Net Clinical Benefit Analysis of Radiation Therapy Oncology Group 0525: A Phase III Trial Comparing Conventional Adjuvant Temozolomide With Dose-Intensive Temozolomide in Patients With Newly Diagnosed Glioblastoma

Terri S Armstrong; Jeffrey S Wefel; Meihua Wang; Mark R Gilbert; Minhee Won; Andrew Bottomley; Tito R Mendoza; Corneel Coens; Maria Werner-Wasik; David G Brachman; Ali K Choucair; Minesh Mehta

doi:10.1200/JCO.2013.49.6067

. 2013 Oct 7;31(32):4076–4084. doi: 10.1200/JCO.2013.49.6067

Net Clinical Benefit Analysis of Radiation Therapy Oncology Group 0525: A Phase III Trial Comparing Conventional Adjuvant Temozolomide With Dose-Intensive Temozolomide in Patients With Newly Diagnosed Glioblastoma

Terri S Armstrong ^1,^✉, Jeffrey S Wefel ¹, Meihua Wang ¹, Mark R Gilbert ¹, Minhee Won ¹, Andrew Bottomley ¹, Tito R Mendoza ¹, Corneel Coens ¹, Maria Werner-Wasik ¹, David G Brachman ¹, Ali K Choucair ¹, Minesh Mehta ¹

PMCID: PMC3816957 PMID: 24101048

Abstract

Purpose

Radiation Therapy Oncology Group trial 0525 tested whether dose-intensifying temozolomide versus standard chemoradiotherapy improves overall survival (OS) or progression-free survival (PFS) in newly diagnosed glioblastoma. Tests of neurocognitive function (NCF) and symptoms (using the MD Anderson Symptom Inventory–Brain Tumor module; MDASI-BT) and of quality of life (European Organisation for the Research and Treatment of Cancer Quality of Life Questionnaire [EORTC QLQ] –C30/BN20) examined the net clinical benefit (NCB) of therapy.

Patients and Methods

NCF tests (Hopkins Verbal Learning Test–Revised, Trail Making Test, and Controlled Oral Word Association), MDASI-BT, and EORTC QLQ-C30/BN20 were completed in a subset of patients. Multivariate Cox proportional hazard regression modeling determined the prognostic value of baseline and early change from baseline to cycle 1 for OS and PFS. Two-sample proportional test statistic was used to evaluate differences between treatments (dose-dense v standard-dose) on NCB measures from baseline to cycle 4 in stable patients.

Results

Overall, 182 patients participated in the study. Baseline NCF tests and the physical functioning quality of life scale were associated with OS and PFS. Baseline to cycle 1 in all NCB components were associated with OS and PFS. There was greater deterioration in the dose-dense arm from baseline to cycle 4 in the Global Health and Motor Function subscales (EORTC QLQ-C30/BN20) as well as in overall symptom burden, overall symptom interference, and activity-related symptom interference subscales (MDASI-BT). There were no between-arm differences in NCF.

Conclusion

Longitudinal collection of NCB measures is feasible in cooperative group studies and provides an added dimension to standard outcome measures. Greater adverse symptom burden and functional interference, as well as decreased global health and motor function were observed in patients randomly assigned to the dose-dense arm. Baseline and early change in NCB measures were associated with decreased rates of survival.

INTRODUCTION

Glioblastoma (GBM) is one of the most malignant primary brain tumors. Most studies show a median survival rate of approximately 14 months, with a treatment course often marked by a decline in overall functional status. Most patients succumb to their illness within 2 years with modern treatments having produced modest survival gains.¹ Because of the impact of the tumor on neurologic and cognitive function, improvements in survival may not be associated with improvements in patient function. As a consequence, traditional end points may not measure the overall impact on the patient. Precedence exists in oncology for measurement of nontherapeutic end points, but no standard approach has been used to measure this effect in neurooncology. To date, approaches include subjective measures such as quality of life (QOL) and symptom burden, as well as objective neurocognitive testing. Using these end points coupled with the primary impact on disease control will allow the investigator to more fully interrogate the net clinical benefit (NCB) of a treatment on the patient.² Radiation Therapy Oncology Group (RTOG) trial 0525 hypothesized that using a dose-dense temozolomide regimen would modulate methylguanine methyltransferase (MGMT) activity, a presumed mechanism of chemotherapy resistance, resulting in improved survival. However, the experimental dose-dense regimen, administered for 21 of 28 days of each cycle, had the potential to either cause more treatment-related toxicity or conversely improve clinical functioning and outcome consequential to improved tumor control. Therefore, this companion analysis was designed to determine the impact of standard-dose versus dose-dense temozolomide on QOL, symptom burden, and neurocognitive function (NCF), providing information to evaluate the NCB of these treatment approaches. Specifically, the study was designed to determine if dose-dense therapy was associated with better QOL, symptoms, and NCF because of better tumor control; if dose-dense therapy slowed expected NCF deterioration owing to tumor progression, or if dose-dense therapy was associated with more or less short- and long-term neurotoxicity. The goal was that this information, coupled with traditional outcome data, could then be used in important risk-benefit considerations for patients and their health care providers.

PATIENTS AND METHODS

Sample

A subset of RTOG 0525 participants who could speak and read English were offered participation in the NCB portion of the study (Fig 1). All patients had histopathologically confirmed GBM (WHO grade 4) with a supratentorial component. Participation in the sub-study was voluntary. All patients provided written informed consent to participate in this institutional review board–approved trial.

Study Design

The NCB assessment schedule is provided in Appendix Figure A1(online-only). The primary end point of the clinical study was overall survival (OS). Symptom and QOL measures were collected at the following timepoints: baseline; before the initiation of cycles 1, 2, 3, 4, 5, 6, and 10 (if administered, or 3 months after completing cycle 6); and 1 month after cycle 12 (if administered, or 6 months after completing cycle 6). NCF data were collected at baseline; before the initiation of cycles 1, 4, 7 (if administered), and 10 (if administered, or 3 months after completing cycle 6); and 1 month after completing cycle 12 (if administered, or 6 months after completing cycle 6).

The primary study objective was to determine whether dose-intensifying adjuvant temozolomide (increasing the dose density) enhances treatment efficacy as measured by overall survival (Gilbert et al³). The primary objective of the NCB portion of the study was to compare changes in QOL, symptom burden, and NCF between the two treatment arms in patients whose disease did not progress. In addition, we evaluated the ability of baseline NCB measures and deterioration in NCB measures after concurrent chemoradiotherapy to predict progression-free and overall survival (see text box for the prespecified NCB study objectives).

Outcome Measures

Patient-reported outcome measures were used to measure QOL and symptoms, and objective NCF testing was used to assess cognition in our study.

QOL.

Health-related quality of life was assessed using the European Organisation for the Research and Treatment of Cancer Quality of Life Questionnaire C30/BN20 (EORTC-QLQ-C30/BN20),^4,5 developed and validated for use in a cancer patient population; the BN20, in particular, was developed as a module for patients with brain cancer. QLQ-C30 is a 30-item self-report questionnaire in which patients rate the items on a four-point scale, with 1 as “not at all” and 4 as “very much.” The measure produces five domain scores including physical, role, emotional, cognitive, and social functioning, as well as nine single-item individual symptom scales (fatigue, pain, nausea, vomiting, dyspnea, insomnia, anorexia, constipation, and diarrhea) and financial impact. The BN20 consists of four domain scores, including future uncertainty, visual disorder, motor dysfunction, and communication deficit, as well as seven individual symptom items (headache, seizures, drowsiness, hair loss, itching, difficulty with bladder control, and weakness of both legs). Reference and normative data exist for this measure and it was scored using the standardized recommended approach of the EORTC.^6,7

Symptom assessment.

Symptom burden and interference was assessed using the MD Anderson Symptom Inventory Brain Tumor (MDASI-BT),⁸ which was developed and validated for use in this patient population. It consists of 23 symptoms rated on an 11-point scale (0 to 10) to indicate the presence and severity of the symptom, with 0 as “not present” and 10 “as bad as you can imagine.” Each symptom is rated at its worst level in the last 24 hours. Symptoms included on the instrument include those commonly associated with cancer therapies, those associated with increased intracranial pressure, and those related to focal deficits. The symptom items on the MDASI-BT were averaged to generate a mean symptom severity score (ie, symptom burden), and they were grouped into six previously identified factors: affective, cognitive, neurologic, treatment-related, generalized/disease, and gastrointenal-related.⁸ Seizure was removed from the neurologic factor score for this analysis because of the low frequency of reporting by patients in this study. The MDASI-BT also includes ratings of how symptoms interfered with different aspects of a patient's life in the preceding 24 hours. The interference items are measured on a 0 to 10 scale and include: general activity, mood, work (includes both work outside the home and housework), relationships with other people, walking, and enjoyment of life. The interference items are averaged to generate a mean symptom interference score (ie, symptom interference) and are grouped into two types: activity-related (general activity, work, and walking) and mood (mood, relations with other people, and enjoyment of life).

NCF.

Neurocognitive function was assessed using the Hopkins Verbal Learning Test–Revised (HVLT-R; tests for Total Recall, Delayed Recall, and Delayed Recognition),⁹ the Trail Making Test (Part A and Part B),¹⁰ and the Controlled Oral Word Association (COWA)¹¹ The NCF tests were administered by a health care professional (eg, nurse, psychologist) who was trained and certified by the study neuropsychologist (J.S.W.). The tests have published normative data that take into account age and, when appropriate, education and gender. The tests in the neurocognitive battery were selected because they are widely used standardized psychometric instruments that have been shown to be sensitive to the impact of cancer and the neurotoxic effects of cancer treatment in other clinical trials.^12–14

Statistical Methods

Participation in the NCB sub-study was not mandatory. Therefore to determine whether the results would be generalizable to the entire study population, patient characteristics were compared between NCB sub-study participants and nonparticipants via χ² tests. NCB sub-study participant characteristics were also compared between two treatment arms using χ² tests. Only patients without disease progression were required to finish the NCB measures at protocol-specified assessment time points. A two-sample z test was used to compare the percentage of received forms between the two treatment arms at baseline, and before cycles 1, 4, and 10. QOL, symptom, and NCF data collected at baseline and before cycle 4 were used for the primary analysis.

Clinically meaningful change.

For the MDASI-BT, a change in symptom severity of one point was classified as the minimum clinically meaningful change. A decline for an individual patient was calculated as an increase of one point or more in the score from the baseline measurement to before cycle 4. For the EORTC QLQ-C30/BN20, differences of 10 points from the baseline measurement to before cycle 4 based on the standardized score (range, 0 to 100) were classified as the minimum clinically meaningful change in QOL.¹⁵ At cycle 4, change in raw test scores relative to baseline was calculated for each NCF test. Neurocognitive status was categorized as improved, stable, or declined, using the Reliable Change Index (RCI)¹⁶ for each test. The RCI is derived from the SE of measurement of each test and represents the 90% CI for the difference in raw score from baseline to the next assessment that would be expected if no real change occurred:

graphic file with name zlj03213-3954-m01.jpg

SEdiff is the SE of difference, SEM is the SE of measurement, SD₁ is the standard deviation, and r_xy is the test-retest reliability statistic. All RCI thresholds were rounded to the nearest whole number. Changes that did not meet the RCI criterion for decline were categorized as stable/improved. The RCI values were determined from published studies^11,17,18 and have been previously reported.¹⁴

Between-arm differences.

For the analysis related to between-arm differences, two-sample proportion tests were used to compare proportions of deterioration (the percentages of patients with clinically meaningful worsening before cycle 4).

Prediction of OS and progression-free survival.

Individual baseline scores and early change scores (ie, baseline to before cycle 1) were evaluated for association with progression-free survival (PFS) and OS using Cox proportional hazards models. The standardized scores for the neurocognitive tests were used at baseline.^10,11,20 OS was defined as the interval from random assignment to patient death as a result of any cause, and PFS was defined as the interval from random assignment to disease progression or death, whichever occurred first. Those items with a P value less than .1 in the univariate analysis were then included in the multivariate analysis with step-wise selection. recursive partitioning analysis (RPA) class and MGMT were forced to remain in the multivariate model. NCB items with a P value less than .05 on multivariate analysis were deemed to be statistically significant and are reported in this article.

All testing was done at the overall significance level of .05 and there were no multiple comparison adjustments owing to the exploratory purpose of this analysis. Statistical Analysis System (SAS Institute, Cary, NC) was used to perform these analyses.

RESULTS

The primary study opened for accrual on January 17, 2006, and closed to accrual on June 13, 2008, with 1,173 patients enrolled. Among registered patients, 1,125 patients (96%) were eligible for the concurrent radiation and temozolomide stage and 48 patients (4%) were ineligible. A total of 833 patients (74%) were randomly assigned for the adjuvant temozolomide stage. The NCB component became available on July 12, 2007. Of the 569 patients subsequently enrolled to the primary study, 246 patients (43%) agreed to participate in the NCB component and 232 patients (94%) have at least one NCB assessment available. Among the randomly assigned patients, 182 patients have NCB data available (Fig 1).

Sample Characteristics

Patient characteristics for NCB sub-study participants and nonparticipants included in this analysis are listed in Table 1. The patient characteristics were balanced except for sex (P = .09), the surgery type (P = .008), and Karnofsky performance score (P = .04). There was no statistically significant difference on important stratification factors including RPA class (P = .76) and MGMT (P = .27). Patient characteristics between two treatment arms for all of the randomly assigned patients with NCB data are listed in Table 2. The patient characteristics were balanced between these two treatment arms.

Table 1.

Pretreatment Characteristics Since NCB Component Available by Randomized Versus Nonparticipating Patients

Characteristic	Randomized (n = 182)		Nonparticipants (n = 337)		P
Characteristic	No. of Patients	%	No. of Patients	%	P
Age, years					.81
Median	58		57
Min-max	21-84		22-84
Q1-Q3	49-66		50-64
< 50	46	25.3	82	24.3
≥ 50	136	74.7	255	75.7
Sex					.09
Male	94	51.6	200	59.3
Female	88	48.4	137	40.7
Race					.20^*^†
American Indian/Alaska Native	1	0.5	0	0.0
Asian	2	1.1	3	0.9
Black or African American	7	3.8	1	0.3
Native Hawaiian or Other Pacific Islander	0	0.0	1	0.3
White	170	93.4	205	60.8
More than one race	0	0.0	2	0.6
Unknown or not reported	2	1.1	125	37.1
Ethnicity					.25^*
Hispanic or Latino	5	2.7	11	3.3
Not Hispanic or Latino	159	87.4	188	55.8
Unknown (individuals not reporting ethnicity)	18	9.9	138	40.9
Education					.26^*
≤ 12 years	61	33.5	80	23.7
> 12 years	107	58.8	117	34.7
Other	9	4.9	5	1.5
Prefers not to answer/unknown	5	2.7	135	40.1
KPS					.04
90-100	128	70.3	207	61.4
Surgery					.001
Biopsy	3	1.6	9	2.7
Partial resection	64	35.2	172	51.0
Total resection	115	63.2	156	46.3
Neurologic function					.28
No symptoms	62	34.1	111	32.9
Minor symptoms	86	47.3	143	42.4
Moderate symptoms	33	18.1	80	23.7
Severe symptoms	1	0.5	3	0.9
RPA class					.24
III	37	20.3	57	16.9
IV	116	63.7	207	61.4
V	29	15.9	73	21.7
Lateralization of tumor					.23
Right side only	104	57.1	172	51.0
Left side only	77	42.3	156	46.3
Bilateral	1	0.5	5	1.5
Unknown	0	0.0	4	1.2

Open in a new tab

Abbreviations: KPS, Karnofsky performance score; max, maximum; min, minimum; NCB, net clinical benefit; RPA, recursive partitioning analysis.

P values are based on the χ² test.

^†

White versus other race.

Table 2.

Pretreatment Characteristics for Randomly Assigned Patients Participating in NCB Component

Characteristic	Arm 1 (n = 92)		Arm 2 (n = 90)		P
Characteristic	No. of Patients	%	No. of Patients	%	P
Age, years					.80
Median	57		59
Min-Max	23-84		21-79
Q1-Q3	49-67.5		50-66
< 50	24	26.1	22	24.4
≥ 50	68	73.9	68	75.6
Sex					.10
Male	53	57.6	41	45.6
Female	39	42.4	49	54.4
Race					.25^*
American Indian/Alaska Native	1	1.1	0	0.0
Asian	1	1.1	1	1.1
Black or African American	5	5.4	2	2.2
White	84	91.3	86	95.6
Unknown or not reported	1	1.1	1	1.1
Ethnicity					.24
Hispanic or Latino	4	4.3	1	1.1
Not Hispanic or Latino	77	83.7	82	91.1
Unknown (individuals not reporting ethnicity)	11	12.0	7	7.8
Education					.40
≤ 12 years	32	34.8	29	32.2
> 12 years	53	57.6	54	60.0
Other	6	6.5	3	3.3
Prefers not to answer/unknown	1	1.1	4	4.4
KPS					.58
60-80	29	31.5	25	27.8
90-100	63	68.5	65	72.2
Surgery					.20
Biopsy	0	0.0	3	3.3
Partial resection	32	34.8	32	35.6
Total resection	60	65.2	55	61.1
Neurologic function					.32
No symptoms	31	33.7	31	34.4
Minor symptoms	40	43.5	46	51.1
Moderate symptoms	21	22.8	12	13.3
Severe symptoms	0	0.0	1	1.1
MGMT status					.65
Methylated	28	30.4	30	33.3
Unmethylated	52	56.5	52	57.8
Unknown (indeterminate, invalid)	12	13.0	8	8.9
RPA class					.85
III	18	19.6	19	21.1
IV	58	63.0	58	64.4
V	16	17.4	13	14.4
Lateralization of tumor					.41^†
Right side only	55	59.8	49	54.4
Left side only	36	39.1	41	45.6
Bilateral	1	1.1	0	0.0

Open in a new tab

Abbreviations: KPS, Karnofsky performance score; max, maximum; MGMT, methylguanine methyltransferase; min, minimum; NCB, net clinical benefit; RPA, recursive partitioning analysis.

White versus other race.

^†

Right versus left.

Table 3 lists the compliance to NCF and QOL assessments at baseline and before cycles 1, 4, and 10. At least 85% of patients completed a baseline assessment on QOL, MDASI-BT, and NCF measures. There were no statistically significant differences in compliance on these instruments between the two arms, except for QOL and MDASI-BT at the assessment performed before cycle 4 (P = .014 and P = .02, respectively). The compliance rates were lower in the dose-dense arm than in the standard-dose arm for the QOL and MDASI-BT instruments at the assessment performed before cycle 4.

Table 3.

Symptom Burden, QOL, and Neurocognitive Assessment Compliance

Evaluation Status by Measure and Treatment Arm	Baseline		Cycle 1		Cycle 4
Evaluation Status by Measure and Treatment Arm	No. of Patients	%	No. of Patients	%	No. of Patients	%
QLQ-C30/BN20
Standard dose
Expected	96		96		77
Received	85	89	73	76	52	68
Dose dense
Expected	95		94		80
Received	87	92	74	79	39	49
Difference		3		3		19
P^*	.48		0.62		.014
MDASI-BT
Standard dose
Expected	96		96		77
Received	87	91	73	76	52	68
Dose dense
Expected	95		94		80
Received	86	91	73	78	40	50
Difference		0		2		12
P^*	1.0		.74		.02
HVLT-R
Standard dose
Expected	96		96		77
Received	83	86	68	71	41	53
Dose dense
Expected	95		94		80
Received	86	91	64	68	38	48
Difference		5		3		5
P^*	.28		.65		.53
TMT part A
Standard dose
Expected	96		96		77
Received	84	88	71	74	41	53
Dose dense
Expected	95		94		80
Received	86	91	63	67	38	48
Difference		3		7		5
P^*	.50		.29		.53
TMT part B
Standard dose
Expected	96		96		77
Received	82	85	67	70	39	51
Dose dense
Expected	95		94		80
Received	85	89	61	65	37	46
Difference		4		5		5
P^*	.41		.46		.53
COWA
Standard dose
Expected	96		96		77
Received	83	86	68	71	41	53
Dose dense
Expected	95		94		80
Received	83	87	63	67	39	49
Difference		1		4		4
P^*	.84		.55		.62

Open in a new tab

NOTE. Patients who discontinued therapy and did not complete assessments were considered noncompliant.

Abbreviations: COWA, Controlled Oral Word Association; HVLT-R, Hopkins Verbal Learning Test–Revised; TMT, Trail Making Test; QOL, quality of life; QLQ-C30/BN20, European Organisation for Research and Treatment of Cancer Quality of Life Questionnaire C30/BN20.

P value is from the two-sample z test to compare the percentage of received forms between the two arms.

Differences in Deterioration Status

Detecting between-arm differences is an important component of assessing the impact of a new therapy versus a standard approach. We were interested in exploring differences in deterioration status from baseline to the assessment before cycle 4 for each component of the NCB assessment separately. Complete results are provided in Table 4.

Table 4.

Between Arm Differences in Deterioration of QOL, Symptoms, and NCF

Test/Measure and Component	Standard-Dose Deterioration		Dose-Dense Deterioration		P^*
Test/Measure and Component	No. of Patients	%	No. of Patients	%	P^*
EORTC QOL C30
Scale
Global health status/QOL	11	23	17	45	.03^*
Physical	12	24	14	36	.25
Role	12	24	14	36	.25
Emotional	6	13	11	28	.07
Cognitive	10	21	15	38	.07
Symptom items
Social	11	23	14	36	.18
Fatigue	12	24	17	44	.06
Nausea/vomiting	8	16	12	31	.11
Pain	8	16	9	23	.43
Dyspnea	8	16	11	28	.18
Insomnia	5	10	7	18	.32
Appetite loss	16	33	13	33	.95
Constipation	8	16	10	26	.29
Diarrhea	8	17	5	13	.58
Financial	9	19	6	17	.77
BN20
Scale
Future uncertainty	3	6	5	14	.26
Visual disorder	15	31	6	16	.09
Motor dysfunction	10	20	17	45	.014^*
Communication deficit	15	31	12	32	.92
Symptom items
Headaches	5	10	9	24	.09
Seizures	3	6	2	5	.86
Drowsiness	15	31	13	34	.77
Hair loss	7	15	11	31	.09
Itchy skin	13	27	12	34	.45
Weak legs	7	14	9	24	.27
Bladder	2	4	5	13	.14
MDASI-BT
Summary scores
Symptom burden	5	10	11	27	.03^*
Symptom interference	7	14	13	32	.03^*
Work and walking	8	16	15	39	.01^*
Activity-related	8	16	15	38	.02^*
Mood-related	12	24	12	30	.49
Factor groups
Affective	7	14	12	30	.06
Cognitive	9	18	11	28	.27
Neurologic	7	14	9	23	.28
Treatment-related	13	25	16	40	.14
Generalized/disease	6	12	8	20	.27
GI-related	4	8	7	18	.19
NCF
HVLT-R
Total Recall	11	27	11	30	.78
Delayed Recall	8	21	8	24	.76
Delayed Recognition	10	24	7	19	.60
TMT
Part A	8	20	11	29	.35
Part B	10	26	11	30	.69
COWA	3	7	6	16	.25
CTB composite^†	11	29	10	29	.99

Open in a new tab

Abbreviations: COWA, Controlled Oral Word Association; CTB, clinical trial battery; EORTC QOL C30, European Organisation for the Research and Treatment of Cancer Quality of Life Questionnaire C30; HR, hazard ratio; HVLT-R, Hopkins Verbal Learning Test–Revised; MDASI-BT, MD Anderson Symptom Inventory–Brain Tumor module; NCF, neurocognitive function; TMT, Trail Making Test; QOL, quality of life.

From two-sample proportional test statistic comparing the percentage of people who deteriorated since baseline.

^†

CTB composite score is the mean of standardized scores (HVLT-R–Total Recall, HVLT-R–Delayed Recall, HVLT-R–Delayed Recognition, TMT parts A and B, and COWA).

QOL.

There were no statistically significant differences in the rates of deterioration between the two treatment arms, except for the EORTC QLQ-C30 global health status factor (P = .03) and for the BN-20 motor dysfunction item (P = .014), although a trend for greater decline in patient-reported QOL in the dose-dense arm was noted.

Symptom burden.

There were statistically significant differences in the rates of deterioration between the two treatment arms in both mean symptom burden and symptom interference. For symptom interference, there was a statistically significant difference in the frequency of deterioration between the two treatment arms for the activity-related and work and walking factors, but no statistically significant difference for the mood-related factor.

NCF.

There were no statistically significant differences in the frequency of deterioration on any of the NCF tests between the two treatment arms.

Prediction of Overall Survival

We examined baseline QOL, symptoms, and NCF as well as changes in these measures after concurrent chemoradiotherapy as predictors of OS. Measures that were significant at the P ≤ .1 level on univariate analyses were retained for inclusion in multivariate analysis. At baseline, QOL and NCF measures yielded prognostic information regarding overall survival time. Specifically, the EORTC physical functioning scale and nausea/vomiting items, the HVLT-R Delayed Recognition, and COWA were prognostic for OS. No baseline MDASI-BT factors were associated with overall survival time.

After concurrent chemoradiotherapy, deterioration on QOL, symptom, and NCF measures were associated with inferior OS. Specifically, deterioration in the MDASI-BT cognitive factor, EORTC cognitive functioning scale, EORTC motor dysfunction scale, EORTC bothered-by-hair-loss item, HVLT-R Total Recall, and Trail Making Test Part B were associated with shorter overall survival time (Table 5).

Table 5.

Prediction of Overall Survival

Variable	Multivariate Analysis
Variable	P	HR	95% CI
Baseline, continuous
EORTC, physical functioning scale	.029	0.99	0.98 to 1.00
EORTC, nausea/vomiting item	.030	1.02	1.00 to 1.04
HVLT-R, delayed recognition	.022	0.87	0.77 to 0.98
COWA	.010	0.81	0.69 to 0.95
Early change, deterioration
EORTC, cognitive functioning scale	.004	1.95	1.23 to 3.09
EORTC, motor dysfunction item	.041	1.59	1.02 to 2.47
EORTC, hair loss item	.039	0.57	0.33 to 0.97
MDASI-BT, cognitive factor	.012	1.82	1.14 to 2.89
HVLT-R, Total Recall	.013	1.90	1.14 to 3.15
TMT, part B	.003	2.11	1.29 to 3.44

Open in a new tab

Abbreviations: COWA, Controlled Oral Word Association; EORTC, European Organisation for the Research and Treatment of Cancer; HR, hazard ratio; HVLT-R, Hopkins Verbal Learning Test–Revised; MDASI-BT, MD Anderson Symptom Inventory–Brain Tumor module; TMT, Trail Making Test.

Prediction of Progression-Free Survival

We examined baseline QOL, symptoms, and NCF as well as changes in these measures after concurrent chemoradiotherapy as predictors of progression-free survival (Table 6). At baseline, QOL and NCF measures yielded prognostic information regarding progression-free survival time. Specifically, the EORTC physical functioning scale, EORTC constipation item, and the COWA were prognostic for progression-free survival time. No baseline MDASI-BT factors were associated with PFS.

Table 6.

Prediction of Progression-Free Survival

Variable	Multivariate Analysis
Variable	P	HR	95% CI
Baseline, continuous
EORTC, physical functioning scale	.009	0.987	0.977 to 0.997
EORTC, constipation item	.012	0.990	0.981 to 0.998
COWA	.007	0.824	0.716 to 0.949
Early change, deterioration
EORTC, motor dysfunction item	.005	1.74	1.18 to 2.55
MDASI-BT neurologic factor	.007	1.87	1.19 to 2.94
HVLT-R Delayed Recall	.030	1.71	1.06 to 2.77

Open in a new tab

After concurrent chemoradiotherapy, deterioration on QOL, symptom, and NCF measures were associated with inferior PFS. Specifically, deterioration on the MDASI-BT neurologic factor, EORTC motor dysfunction score, and on the HVLT-R Delayed Recall measure were associated with shorter progression-free survival time.

Net Clinical Benefit Objectives.

Primary
To compare between the two treatment arms the symptom burden, neurocognitive function (NCF), and health-related quality of life (HRQOL) in patients whose disease has not progressed 6 months after adjuvant therapy (6-month progression-free survival).
To evaluate midcycle differences in symptom burden and HRQOL in patients on the two arms at day 14 of course 1 and course 4.
Secondary
To evaluate longitudinal changes in HRQOL measures and determine the impact of dose-intense chemotherapy on these parameters.
To measure symptom burden and degree of interference over the course of therapy to evaluate differences between patients' individual symptom severity, overall mean symptom severity, and difference in scores on the interference items between the two treatment arms.
To describe the association between quality of life as measured by the European Organisation for Research and Treatment of Cancer Quality of Life Questionnaire C30/BN20 (EORTC-QL30/BCM20) and mean symptom severity as measured by the MD Anderson Symptom Inventory–Brain Tumor module (MDASI-BT) in patients enrolled onto this study.
To describe the variability of symptom severity across the epoch and follow-up period to compare differences between the two treatment arms.
To evaluate these instruments as a useful composite measurement of the impact of treatment and disease response in analysis of efficacy.
To evaluate differences in longitudinal changes on measures of NCF associated with dose-intense chemotherapy.
To evaluate the relationship between self-reported cognitive dysfunction and NCF testing.

DISCUSSION

RTOG 0525 is the first large, randomized phase III trial in GBM to incorporate a multidimensional battery of measures to prospectively assess patient-oriented outcomes, including cognition, symptoms, and quality of life. The net clinical benefit sub-study was initiated 18 months after the primary trial was activated and yet still succeeded in reaching its accrual goals. Baseline data were collected on more than 80% of patients, supporting the feasibility of this approach.

Patients randomly assigned to the dose-dense arm reported experiencing more severe symptoms and overall and activity-related interference on the MDASI-BT and worse global quality of life and motor dysfunction on the EORTC QLQ-C30/BN20 from baseline to before cycle 4, which is likely a reflection of increased toxicities reported with the dose-dense treatment. There were no between-arm differences in cognitive function from baseline to before cycle 4, potentially commensurate with the absence of between-arm differences in treatment efficacy and tumor control.

Analysis of the baseline NCB measures as independent predictors of survival after accounting for traditional prognostic factors including RPA class and MGMT status demonstrated that worse memory, executive function, physical functioning, and nausea/vomiting were associated with shorter overall survival. Our results are similar to other articles recently published that support the prognostic importance of select symptoms assessed by the EORTC measure in brain cancer.^5,21,22

Decline after concurrent chemoradiotherapy in memory, executive function, subjective cognitive function, motor function, and increased bother with hair loss was associated with shorter overall survival. Similarly, worse baseline executive function, physical function, and constipation were independently associated with shorter progression-free survival. Decline after concurrent chemoradiotherapy in memory and neurologic function was associated with shorter progression-free survival.

Employing multidimensional outcome measures as in RTOG 0525 is feasible in large cooperative group clinical trials and provides complimentary information to standard outcome measures. Our study demonstrated that overall and progression-free survival outcomes are predicted by early deterioration of patient function, supporting the utility of these measures and suggesting that this test battery may have profound implications for clinical trial design. These assessments are not without their limitations. Whereas select components of baseline QOL were associated with both PFS and OS, there were no significant between-arm differences in QOL, despite differences in symptom burden and interference. This may represent the effect of a response shift of the patient report, leading to an absence of a reported decline in QOL despite more severe symptoms. Though there were global differences in symptom burden and interference, baseline symptom reporting was not associated with PFS and OS. Neurocognitive testing was associated with the impact of tumor on cognitive function, prognostic for both PFS and OS, but differential treatment effects were not seen. Finally, completion rates declined over time, primarily a consequence site error. This limitation affects the assessment of longitudinal treatment-related changes in patient outcomes. Strategies to encourage data collection and completion similar to traditional measures are warranted.

Although future trials incorporating the NCB measures are needed to validate these findings, our results strongly support making the NCB measures a component of brain tumor clinical trials. These NCB measures can potentially be used for stratification and also provide a dimension of treatment efficacy that is not adequately determined by traditional measures.

Supplementary Material

Protocol

supp_31_32_4076__index.html^{(2.2KB, html)}

Acknowledgment

We thank Kathryn Okrent, Barbara Kaiser, Sandrine Geinoz, and the late Wilma Hoffman for protocol development and management; and Accelerate Brain Cancer Cure and Merck for grant support.

Appendix

Figure A1. — Study schema. (*) increased after cycle 1 if no myelotoxicity. MGMT, methylguanine methyltransferase; NCB, net clinical benefit; NCF, neurocognitive function; PRO, patient-reported outcomes; RPA, recursive partitioning analysis; RT, radiation therapy; TMZ, temozolomide.

Footnotes

See accompanying article on page 4085

Supported by Grants No. NCI U10CA 21661, U10CA37422, and ABC² and by Merck Pharmaceuticals.

Presented at the 47th Annual Meeting of the American Society of Clinical Oncology, Chicago, IL, June 3-7, 2011 (poster presentation) and at the 16th Annual Scientific Meeting of the Society of Neuro-Oncology, Garden Grove, CA, November 17-20, 2011.

Authors' disclosures of potential conflicts of interest and author contributions are found at the end of this article.

Clinical trial information: NCT00304031.

AUTHORS' DISCLOSURES OF POTENTIAL CONFLICTS OF INTEREST

Although all authors completed the disclosure declaration, the following author(s) and/or an author's immediate family member(s) indicated a financial or other interest that is relevant to the subject matter under consideration in this article. Certain relationships marked with a “U” are those for which no compensation was received; those relationships marked with a “C” were compensated. For a detailed description of the disclosure categories, or for more information about ASCO's conflict of interest policy, please refer to the Author Disclosure Declaration and the Disclosures of Potential Conflicts of Interest section in Information for Contributors.

Employment or Leadership Position: None Consultant or Advisory Role: Mark R. Gilbert, Merck (C), Genentech (C), EMD Serono (C); Tito R. Mendoza, Exelixis (C); Minesh Mehta, Merck (C), Roche (C), Novocure (C) Stock Ownership: Minesh Mehta, Accuray, Pharmacyclics Honoraria: Mark R. Gilbert, Merck, Genentech; Minesh Mehta, Merck Research Funding: Terri S. Armstrong, Merck, Genentech; Jeffrey S. Wefel, Schering-Plough; Mark R. Gilbert, GlaxoSmithKline, Genentech Expert Testimony: None Patents: None Other Remuneration: None

AUTHOR CONTRIBUTIONS

Conception and design: Terri S. Armstrong, Jeffrey S. Wefel, Meihua Wang, Mark R. Gilbert, Andrew Bottomley, Tito R. Mendoza, Corneel Coens, Ali K. Choucair, Minesh Mehta

Provision of study materials or patients: David G. Brachman

Collection and assembly of data: Terri S. Armstrong, Jeffrey S. Wefel, Meihua Wang, Minhee Won, Maria Werner-Wasik, David G. Brachman

Data analysis and interpretation: All authors

Manuscript writing: All authors

Final approval of manuscript: All authors

REFERENCES

1.Stupp R, Mason WP, van den Bent MJ, et al. Radiotherapy plus concomitant and adjuvant temozolomide for glioblastoma. N Engl J Med. 2005;352:987–996. doi: 10.1056/NEJMoa043330. [DOI] [PubMed] [Google Scholar]
2.Taphoorn MJ, Sizoo EM, Bottomley A. Review on quality of life issues in patients with primary brain tumors. Oncologist. 2010;15:618–626. doi: 10.1634/theoncologist.2009-0291. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Gilbert MR, Wang M, Aldape KD, et al. Dose-dense temozolomide for newly diagnosed glioblastoma: A randomized phase III clinical trial. J Clin Oncol. 2013;31:4085–4091. doi: 10.1200/JCO.2013.49.6968. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Efficace F, Bottomley A. Health related quality of life assessment methodology and reported outcomes in randomised controlled trials of primary brain cancer patients. Eur J Cancer. 2002;38:1824–1831. doi: 10.1016/s0959-8049(02)00173-9. [DOI] [PubMed] [Google Scholar]
5.Taphoorn MJ, Claassens L, Aaronson NK, et al. An international validation study of the EORTC brain cancer module (EORTC QLQ-BN20) for assessing health-related quality of life and symptoms in brain cancer patients. Eur J Cancer. 2010;46:1033–1040. doi: 10.1016/j.ejca.2010.01.012. [DOI] [PubMed] [Google Scholar]
6.Scott NW, Fayers PM, Aaronson NK, et al. Reference Values Manual. Brussels, Belgium: European Organisation for Research and Treatment of Cancer; 2008. [Google Scholar]
7.Fayers PM, Aaronson NK, Bjordal K, et al. The EORTC QLQ-C30 Scoring Manual. 3rd ed. Brussels, Belgium: European Organisation for Research and Treatment of Cancer; 2001. [Google Scholar]
8.Armstrong TS, Mendoza T, Gning I, et al. Validation of the MD Anderson Symptom Inventory Brain Tumor Module (MDASI-BT) J Neurooncol. 2006;80:27–35. doi: 10.1007/s11060-006-9135-z. [DOI] [PubMed] [Google Scholar]
9.Benedict RH, Schretten D, Groninger L, et al. Hopkins Verbal Learning Test-Revised: Normative data and analysis of inter-form and test-retest reliability. Clin Neuropsychol. 1998;12 [Google Scholar]
10.Tombaugh TN. Trail Making Test A and B: Normative data stratified by age and education. Arch Clin Neuropsychol. 2004;19:203–214. doi: 10.1016/S0887-6177(03)00039-8. [DOI] [PubMed] [Google Scholar]
11.Ruff RM, Light RH, Parker SB, et al. Benton Controlled Oral Word Association Test: Reliability and updated norms. Arch Clin Neuropsychol. 1996;11:329–338. [PubMed] [Google Scholar]
12.Wefel JS, Saleeba AK, Buzdar AU, et al. Acute and late onset cognitive dysfunction associated with chemotherapy in women with breast cancer. Cancer. 2010;116:3348–3356. doi: 10.1002/cncr.25098. [DOI] [PubMed] [Google Scholar]
13.Meyers CA, Smith JA, Bezjak A, et al. Neurocognitive function and progression in patients with brain metastases treated with whole-brain radiation and motexafin gadolinium: Results of a randomized phase III trial. J Clin Oncol. 2004;22:157–165. doi: 10.1200/JCO.2004.05.128. [DOI] [PubMed] [Google Scholar]
14.Wefel JS, Cloughesy T, Zazzali JL, et al. Neurocognitive function in patients with recurrent glioblastoma treated with bevacizumab. Neuro Oncol. 2011;13:660–668. doi: 10.1093/neuonc/nor024. [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Maringwa J, Quinten C, King M, et al. Minimal clinically meaningful differences for the EORTC QLQ-C30 and EORTC QLQ-BN20 scales in brain cancer patients. Ann Oncol. 2011;22:2107–2112. doi: 10.1093/annonc/mdq726. [DOI] [PubMed] [Google Scholar]
16.Jacobson NS, Truax P. Clinical significance: A statistical approach to defining meaningful change in psychotherapy research. J Consult Clin Psychol. 1991;59:12–19. doi: 10.1037//0022-006x.59.1.12. [DOI] [PubMed] [Google Scholar]
17.Woodard JL, Benedict RH, Salthouse TA, et al. Normative data for equivalent, parallel forms of the Judgment of Line Orientation Test. J Clin Exp Neuropsychol. 1998;20:457–462. doi: 10.1076/jcen.20.4.457.1473. [DOI] [PubMed] [Google Scholar]
18.Levine AJ, Miller EN, Becker JT, et al. Normative data for determining significance of test-retest differences on eight common neuropsychological instruments. Clin Neuropsychol. 2004;18:373–384. doi: 10.1080/1385404049052420. [DOI] [PubMed] [Google Scholar]
19. Reference deleted.
20.Shapiro AM, Benedict RH, Schretlen D, et al. Construct and concurrent validity of the Hopkins Verbal Learning Test-revised. Clin Neuropsychol. 1999;13:348–358. doi: 10.1076/clin.13.3.348.1749. [DOI] [PubMed] [Google Scholar]
21.Quinten C, Maringwa J, Gotay CC, et al. Patient self-reports of symptoms and clinician ratings as predictors of overall cancer survival. J Natl Cancer Inst. 2011;103:1851–1858. doi: 10.1093/jnci/djr485. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Gotay CC, Kawamoto CT, Bottomley A, et al. The prognostic significance of patient-reported outcomes in cancer clinical trials. J Clin Oncol. 2008;26:1355–1363. doi: 10.1200/JCO.2007.13.3439. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Protocol

supp_31_32_4076__index.html^{(2.2KB, html)}

supp_JCO.2013.49.6067_Protocol_JCO.2013.49.6067.pdf^{(282.5KB, pdf)}

[B1] 1.Stupp R, Mason WP, van den Bent MJ, et al. Radiotherapy plus concomitant and adjuvant temozolomide for glioblastoma. N Engl J Med. 2005;352:987–996. doi: 10.1056/NEJMoa043330. [DOI] [PubMed] [Google Scholar]

[B2] 2.Taphoorn MJ, Sizoo EM, Bottomley A. Review on quality of life issues in patients with primary brain tumors. Oncologist. 2010;15:618–626. doi: 10.1634/theoncologist.2009-0291. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B3] 3.Gilbert MR, Wang M, Aldape KD, et al. Dose-dense temozolomide for newly diagnosed glioblastoma: A randomized phase III clinical trial. J Clin Oncol. 2013;31:4085–4091. doi: 10.1200/JCO.2013.49.6968. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B4] 4.Efficace F, Bottomley A. Health related quality of life assessment methodology and reported outcomes in randomised controlled trials of primary brain cancer patients. Eur J Cancer. 2002;38:1824–1831. doi: 10.1016/s0959-8049(02)00173-9. [DOI] [PubMed] [Google Scholar]

[B5] 5.Taphoorn MJ, Claassens L, Aaronson NK, et al. An international validation study of the EORTC brain cancer module (EORTC QLQ-BN20) for assessing health-related quality of life and symptoms in brain cancer patients. Eur J Cancer. 2010;46:1033–1040. doi: 10.1016/j.ejca.2010.01.012. [DOI] [PubMed] [Google Scholar]

[B6] 6.Scott NW, Fayers PM, Aaronson NK, et al. Reference Values Manual. Brussels, Belgium: European Organisation for Research and Treatment of Cancer; 2008. [Google Scholar]

[B7] 7.Fayers PM, Aaronson NK, Bjordal K, et al. The EORTC QLQ-C30 Scoring Manual. 3rd ed. Brussels, Belgium: European Organisation for Research and Treatment of Cancer; 2001. [Google Scholar]

[B8] 8.Armstrong TS, Mendoza T, Gning I, et al. Validation of the MD Anderson Symptom Inventory Brain Tumor Module (MDASI-BT) J Neurooncol. 2006;80:27–35. doi: 10.1007/s11060-006-9135-z. [DOI] [PubMed] [Google Scholar]

[B9] 9.Benedict RH, Schretten D, Groninger L, et al. Hopkins Verbal Learning Test-Revised: Normative data and analysis of inter-form and test-retest reliability. Clin Neuropsychol. 1998;12 [Google Scholar]

[B10] 10.Tombaugh TN. Trail Making Test A and B: Normative data stratified by age and education. Arch Clin Neuropsychol. 2004;19:203–214. doi: 10.1016/S0887-6177(03)00039-8. [DOI] [PubMed] [Google Scholar]

[B11] 11.Ruff RM, Light RH, Parker SB, et al. Benton Controlled Oral Word Association Test: Reliability and updated norms. Arch Clin Neuropsychol. 1996;11:329–338. [PubMed] [Google Scholar]

[B12] 12.Wefel JS, Saleeba AK, Buzdar AU, et al. Acute and late onset cognitive dysfunction associated with chemotherapy in women with breast cancer. Cancer. 2010;116:3348–3356. doi: 10.1002/cncr.25098. [DOI] [PubMed] [Google Scholar]

[B13] 13.Meyers CA, Smith JA, Bezjak A, et al. Neurocognitive function and progression in patients with brain metastases treated with whole-brain radiation and motexafin gadolinium: Results of a randomized phase III trial. J Clin Oncol. 2004;22:157–165. doi: 10.1200/JCO.2004.05.128. [DOI] [PubMed] [Google Scholar]

[B14] 14.Wefel JS, Cloughesy T, Zazzali JL, et al. Neurocognitive function in patients with recurrent glioblastoma treated with bevacizumab. Neuro Oncol. 2011;13:660–668. doi: 10.1093/neuonc/nor024. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B15] 15.Maringwa J, Quinten C, King M, et al. Minimal clinically meaningful differences for the EORTC QLQ-C30 and EORTC QLQ-BN20 scales in brain cancer patients. Ann Oncol. 2011;22:2107–2112. doi: 10.1093/annonc/mdq726. [DOI] [PubMed] [Google Scholar]

[B16] 16.Jacobson NS, Truax P. Clinical significance: A statistical approach to defining meaningful change in psychotherapy research. J Consult Clin Psychol. 1991;59:12–19. doi: 10.1037//0022-006x.59.1.12. [DOI] [PubMed] [Google Scholar]

[B17] 17.Woodard JL, Benedict RH, Salthouse TA, et al. Normative data for equivalent, parallel forms of the Judgment of Line Orientation Test. J Clin Exp Neuropsychol. 1998;20:457–462. doi: 10.1076/jcen.20.4.457.1473. [DOI] [PubMed] [Google Scholar]

[B18] 18.Levine AJ, Miller EN, Becker JT, et al. Normative data for determining significance of test-retest differences on eight common neuropsychological instruments. Clin Neuropsychol. 2004;18:373–384. doi: 10.1080/1385404049052420. [DOI] [PubMed] [Google Scholar]

[B19] 19. Reference deleted.

[B20] 20.Shapiro AM, Benedict RH, Schretlen D, et al. Construct and concurrent validity of the Hopkins Verbal Learning Test-revised. Clin Neuropsychol. 1999;13:348–358. doi: 10.1076/clin.13.3.348.1749. [DOI] [PubMed] [Google Scholar]

[B21] 21.Quinten C, Maringwa J, Gotay CC, et al. Patient self-reports of symptoms and clinician ratings as predictors of overall cancer survival. J Natl Cancer Inst. 2011;103:1851–1858. doi: 10.1093/jnci/djr485. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B22] 22.Gotay CC, Kawamoto CT, Bottomley A, et al. The prognostic significance of patient-reported outcomes in cancer clinical trials. J Clin Oncol. 2008;26:1355–1363. doi: 10.1200/JCO.2007.13.3439. [DOI] [PubMed] [Google Scholar]

PERMALINK

Net Clinical Benefit Analysis of Radiation Therapy Oncology Group 0525: A Phase III Trial Comparing Conventional Adjuvant Temozolomide With Dose-Intensive Temozolomide in Patients With Newly Diagnosed Glioblastoma

Terri S Armstrong

Jeffrey S Wefel

Meihua Wang

Mark R Gilbert

Minhee Won

Andrew Bottomley

Tito R Mendoza

Corneel Coens

Maria Werner-Wasik

David G Brachman

Ali K Choucair

Minesh Mehta

Abstract

Purpose

Patients and Methods

Results

Conclusion

INTRODUCTION

PATIENTS AND METHODS

Sample

Fig 1.

Study Design

Outcome Measures

QOL.

Symptom assessment.

NCF.

Statistical Methods

Clinically meaningful change.

Between-arm differences.

Prediction of OS and progression-free survival.

RESULTS

Sample Characteristics

Table 1.

Table 2.

Table 3.

Differences in Deterioration Status

Table 4.

QOL.

Symptom burden.

NCF.

Prediction of Overall Survival

Table 5.

Prediction of Progression-Free Survival

Table 6.

Net Clinical Benefit Objectives.

DISCUSSION

Supplementary Material

Acknowledgment

Appendix

Figure A1.

Footnotes

AUTHORS' DISCLOSURES OF POTENTIAL CONFLICTS OF INTEREST

AUTHOR CONTRIBUTIONS

REFERENCES

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases