Abstract
Background
Multiple Sclerosis (MS) is a chronic progressive and disabling disease of the central nervous system with dramatic variations in the combination and severity of symptoms it can produce. The lack of reliable disease specific health-related quality of life (HRQL) measures for use in clinical trials prompted the development of the Neurology Quality of Life (Neuro-QOL) instrument, which includes 13 scales that assess physical, emotional, cognitive, and social domains, for use in a variety of neurological illnesses.
Objective
Initial assessment of the reliability and validation of the Neuro-QOL short forms (SFs) in MS.
Methods
We assessed reliability, concurrent validity, known groups validity, and responsiveness between cross-sectional and longitudinal data in 161 recruited MS subjects.
Results
Internal consistency was high for all measures (α = 0.81 - 0.95) and ICCs were within acceptable range (0.76 - 0.91), concurrent and known groups validity were highest with the Global HRQL question. Longitudinal assessment was limited by the lack of disease progression in the group.
Conclusions
The Neuro-QOL SFs demonstrate good internal consistency, test-re-test reliability, and concurrent and know groups validity in this MS population, supporting the validity of Neuro-QOL in adults with MS
INTRODUCTION
Multiple sclerosis (MS) is a progressive, degenerative disease of the central nervous system1, affecting between 250,000 to 350,000 people in the United States with about 10,400 new diagnoses yearly2. MS typically manifests in young adulthood and follows an unpredictable, widely varying clinical course. Relapses and progression throughout the disease course result in accumulating disability and a profound impact on health-related quality of life (HRQL) and diminished physical, social, and cognitive functioning compared to other chronic disorders3, 4.
The impact of MS on HRQL has prompted the development and implementation of many disease-specific measures in clinical trials, however with limited success.5 Variability in the interventions studied and measures employed, inconsistent use of psychometric methods to develop and implement measures make synthesizing results and determining robustness and validity of available measures challenging. The NIH, FDA, and other federal agencies are interested in evaluating specific aspects of function that are comparable across interventions and diseases. As such, there is a push for implementing measures that cross physical, emotional, and social functioning, especially those developed using modern psychometric techniques such as item response theory (IRT), and can be administered using a variety of formats.
The goal of the NINDS-funded Neuro-QoL project was to develop a core set of universally applicable HRQL questions for patients with chronic neurological conditions supplemented with disease-specific questions. The project underwent multiple phases leading up to the final clinical validation of the Neuro-QoL item banks and associated short forms (SFs), which are brief, fixed-length forms of 6-10 items each. The adult conditions included in Neuro-QoL were Amyotrophic Lateral Sclerosis (ALS), Adult Epilepsy, MS, Parkinson's Disease, and Stroke. Two pediatric conditions were also included: pediatric epilepsy and muscular dystrophy. The developers ensured clinical and psychometric validity of these tools by identifying the needs of the clinical research community6, 7, ensuring clinical and patient-driven evidence of importance and relevance of the selected QoL domains, and an expert-based consensus selection of the priority conditions8. Input from experts, caregivers, and patients determined the QoL domains included in the Neuro-QoL9. Item banks for each of the 13 domains were constructed and calibrated using Item Response Theory (IRT) in a sample of adults and children from the General Population (GP) and Clinical Sample (CS) of those suffering from neurological conditions as previously described9-11 with scoring and interpretation details available online via Neuro-QoL.12 In this paper we report the multi-site validation of the Neuro-QoL SFs with a clinical sample of adult MS patients.
Materials and Methods
Participants and Procedures
We compared cross-sectional and longitudinal data from the Neuro-QoL SFs, MS-specific and generic legacy measures, and the PROMIS global health scale. Patients were recruited as part of a large multi-center study to validate Neuro-QoL measures across five adult and two pediatric neurological diseases. The five MS-sites (Cleveland Clinic, Dartmouth-Hitchcock Medical Center, NorthShore University Health System, University of Chicago Hospital, and the University of Texas Health Science Center at San Antonio) all had Institutional Review Board (IRB) approvals. Study inclusion criteria for adults were: Age 18 or older, English-speaking, community resident, and having sufficient cognitive ability to complete the informed consent process for each participating site. MS-specific inclusion criteria were clinician-confirmed diagnosis of MS. The subjects included were a convenience sample of consecutive clinic attendees.
Data Collection and Management
Data were collected at 3 time points – Baseline, Day-7 and Month-6. Baseline and Month-6 data were collected at the clinical sites and Day-7 data were collected by phone. There was a 5-9 day window for the test-retest assessment (Day-7) and a 5-7 month window for the responsiveness assessment (Month-6). Baseline and Month-6 evaluations included the Neuro-QoL instruments, concurrent validity measures, and socio-demographic and clinical data forms that were self-administered by computer or conducted by study personnel. Clinician ratings and chart reviews were also conducted as part of these two visits. The 30-minute Day-7 visit was conducted to assess test-retest performance of the Neuro-QoL instruments administered to subjects over the phone by study personnel. All data were submitted to, and managed by, the coordinating center at Northwestern University.
Measures
Neuro-QoL SFs were validated in relation to generic and MS-specific measures of physical, mental and social functional status, and disease severity. These data were obtained by subject self-rated and clinical assessments (Table 1).
Table 1.
Data and Outcomes | # items | Time required (minutes) | Baseline | 7 Days | 6 month | Mode of Administration |
---|---|---|---|---|---|---|
Generic Measures | ||||||
Socio-demographic Form | 9 | <5 | X | -- | -- | Self-report |
Clinical Information Form | 19 | <5 | X | -- | X | Interviewer |
Karnofsky Performance Status | 1 | <2 | X | -- | X | Medical Prof Rated |
Oral Digit Symbol Modalities | 0-133 | <3 | X | -- | X | Interviewer |
Global HRQL Question | 1 | <2 | X | -- | X | Self-report |
Global Rating of Change Scores | 1 | <2 | -- | -- | X | Self-report |
EQ-5D | 15 | <3 | x | -- | x | Self-report |
PROMIS Global Health Scale | 10 | <2 | x | -- | x | Self-report |
Neuro-QOL Short Forms and Scales | ||||||
Neuro-QOL Short Forms and Scales | ±100 | 45-60 | X | X | X | Self-Report |
Multiple Sclerosis Specific Mesures | ||||||
Multiple Sclerosis Functional Composite (MSFC) (substitute OSDM for PASAT) | 4 | <5 | X | -- | X | Self-report / Interviewer |
Functional Assessment in Multiple Sclerosis (FAMS) | 44 | 10 | X | -- | X | Self-report |
Multiple Sclerosis Performance Scale (MSPS) | 8 | <5 | X | -- | X | Self-report |
Neuro-QoL
The 13 Neuro-QoL SFs (Figure 1) were self-administered at baseline and Month-6 and administered at Day-7 via phone. T-Scores were calculated with a T=50 indicating an average range of function compared to a reference population with a standard deviation (SD) of 10. G P T-Scores were the reference values for Neuro-QoL Positive Affect and Well-Being, Applied Cognition General Concerns, Applied Cognition-Executive Function, Lower Extremity (Mobility), Upper Extremity (Fine Motor, ADL), Ability to Participate in Social Roles and Activities, Satisfaction with Social Roles and Activities, Depression and Anxiety. CS T-Scores were the reference values for Stigma, Fatigue, Sleep disturbance, and Emotional and Behavioral Dyscontrol.
General Function Measures to Assess Convergent Validity (baseline)
Karnofsky Performance Status Scale (KPS)13 rates functional impairment and diagnosis-independent breakdown of activity level across patients.
Cognitive Function
Symbol Digit Modalities Test (SDMT)14 tests information processing speed, visual acuity, and figural memory. The oral version was administered.
Generic Health Related Quality of Life
EQ-5D15, 16 is a 5-item, three response levels per item self-reported health status measure that provides a simple, generic HRQL preference measure for economic evaluation.
PROMIS 10-Item Global Health Scale (GHS)17 items include global ratings of the five primary PROMIS domains (physical function, fatigue, pain, emotional distress, and social health) and general health perceptions that cut across domains. It can be scored into a Global Physical Health component and Global Mental Health component.
Global HRQL Question (GHQ)18, a single item from the Functional Assessment of Chronic Illness Therapy (FACIT), “I am content with the quality of my life right now,” was used as a global measure of quality of life and assessment of convergent validity. It has five response options, ranging from “not at all” to “very much.”
General Function Measures to Assess Responsiveness
Global Rating of Change (GRC)19 is an assessment of patients’ subjective evaluation of the amount of change they experienced over the six month period of the study. We have previously simplified the original 15-level response option to a 7-level option, now ranging from −3 = “very much worse” to +3 = “very much better”20. Individual GRC questions were developed for 6 life domains including Physical, Emotional, Cognitive, Social/Family and Symptomatic Well-being and overall quality of life .
KPS13 was assessed as described above.
Disease Specific Functional Status to assess Convergent Validity (baseline) and Responsiveness
The Multiple Sclerosis Functional Composite (MSFC)21 consists of three objective quantitative tests of neurological functioning: the 9-Hole Peg Test (9-HPT; upper extremity function), Timed-25-Foot Walk (T-25FW; mobility) and the Symbol Digit Modality Test-oral version (SDMT; cognition). Raw scores for each test were converted into Z-scores for each component; Z-scores were averaged to create an overall composite score, per instructions from the measure developers.
Disease Specific HRQL
Functional Assessment of Multiple Sclerosis (FAMS)22 is a MS-specific measure to assess the convergent validity in this population including 44 items summarized into six subscales: mobility, symptoms, emotional well-being (depression), general contentment, thinking/fatigue, and family/social well-being.
Statistical Analysis
Descriptive statistics were calculated for the Neuro-QoL, external validation measures, and socio-demographic and clinical variables at the baseline assessment and follow-up visits. When comparing MS patients’ Neuro-QoL T- scores with the GP and CS reference groups, score difference less than 0.5 SD units (i.e., 5 points, range 45-55) were considered to be within the range of the reference groups’ average.
Reliability
Internal consistency was calculated for Neuro-QoL SF base-line scores using Cronbach's alpha coefficient with coefficient scores of .70 or higher considered acceptable. Test–Retest Reliability of Neuro-QoL SFs at baseline and Day-7 was assessed with Intraclass Correlation Coefficients (ICC) and corresponding 95% confidence intervals, with coefficient scores of .70 or higher considered acceptable.
Validity
Concurrent validity was assessed at baseline by calculating the Spearman rho correlation coefficients between Neuro-QoL SF scores and the generic and disease specific legacy measures. Interpretation guidelines for these correlations were: <0.30 = nominal; 0.30-0.49 = small; 0.5-0.69 = moderate; ≥0.70 = large. The strength of these correlations was hypothesized a priori to the analysis and results are based on those predictions, with correlations >0.50 considered acceptable. Known groups validity was assessed at baseline using analysis of variance comparing baseline mean Neuro-QoL SF scores between MS patients grouped by MS severity using the MSFC and self-reported GHQ.
Responsiveness
Neuro-QoL sensitivity to change was conducted by evaluating general linear models using each patient's change score between month-6 and baseline relative to change in the generic GRC. A correlation of ≥0.30 was set as the criterion for responsive with p≤ 0.05 considered moderately significant and p=0.001 highly significant.
No imputation of missing data was done for patients who failed to participate at the sensitivity to change assessment; however, we prospectively monitored the reasons for missing data (e.g., refusal, disease progression, death) and compared characteristics of patients who did and did not participate.
RESULTS
Baseline Demographics
A total of 161 MS participants completed the baseline assessment with 132 also completing the Month-6 assessments. Baseline demographics indicated that subjects were predominantly female (86%), white (88%), and non-Hispanic (93%). Their average age was 49.8 years (SD=10.5), 58.4 % were married and 90% had some college education or degree. Thirty-seven percent were on disability and 34% were fully employed. MSFC scores ranged from −2.90 to 1.7, with mean of 0.0 (SD=.69). The most common disease courses were relapsing remitting (62.9%) and secondary progressive (28.9%). No systematic differences were found between participants who did or did not complete all assessments or in these scores across the 5 study-sites (data not shown).
Descriptive and reliability statistics for Neuro-QoL MS sample SF T-scores
MS patients were within 5 T score units of the mean GP for Positive Affect and Well-Being, Applied Cognition-Executive Function, Ability to Participate in Social Roles and Activities, Satisfaction with Social Roles and Activities, Depression and Anxiety. MS patients scored worse than 0.5 SD units for Applied Cognition-General Concerns (M=42.7), Lower Extremity (M=43.5), and Upper Extremity (M=44.3). When compared to CS reference group, they were within 0.5 SD for Stigma, Fatigue and Sleep Disturbance, and Emotional and Behavioral Dyscontrol. Internal consistency was high for all measures (α=0.81-0.95) and ICCs were within acceptable range for all SFs (0.76-0.91) (Table 2).
Table 2.
Internal Consistency | Test-Retest | ||||||||
---|---|---|---|---|---|---|---|---|---|
Min-Max Score | N | MGPT | MCST | Std Dev | Chronbach Alpha | n | ICC*** | CI | |
Short Forms* | |||||||||
Positive Affect and Well-Being** | 34.1 - 68.0 | 161 | 53.61 | 7.72 | .95 | 125 | 0.804 | 0.126 | |
Applied Cognition- General Concerns** | 23.1 – 59.3 | 161 | 42.56 | 8.70 | .95 | 125 | 0.855 | 0.096 | |
Applied Cognition- Executive Function** | 27.3 – 57.6 | 161 | 46.02 | 9.37 | .90 | 125 | 0.862 | 0.092 | |
Lower Extremity (Mobility)** | 25.4 – 58.6 | 149 | 43.55 | 9.44 | .93 | 113 | 0.906 | 0.067 | |
Upper Extremity (Fine Motor, ADL)** | 20.8 – 53.8 | 161 | 44.03 | 9.21 | .86 | 125 | 0.808 | 0.124 | |
Ability to Participate in Social Roles and Activities** | 24.1 – 60.2 | 161 | 46.02 | 7.43 | .95 | 125 | 0.759 | 0.151 | |
Satisfaction with Social Roles and Activities** | 28.4 – 60.5 | 161 | 44.97 | 6.07 | .89 | 125 | 0.791 | 0.133 | |
Depression | 36.9 – 62.7 | 161 | 46.69 | 6.93 | .93 | 125 | 0.719 | 0.172 | |
Anxiety | 36.4 – 71.6 | 161 | 51.32 | 6.88 | .93 | 125 | 0.742 | 0.160 | |
Stigma | 43.7 – 66.0 | 161 | 49.35 | 7.23 | .86 | 125 | 0.751 | 0.155 | |
Fatigue | 29.5 – 69.0 | 161 | 48.81 | 8.52 | .95 | 125 | 0.824 | 0.115 | |
Sleep Disturbance | 32.0 – 72.4 | 161 | 48.50 | 8.60 | .81 | 125 | 0.800 | 0.129 | |
Emotional and Behavioral Dyscontrol | 32.2 – 68.9 | 161 | 46.78 | 8.63 | .91 | 125 | 0.775 | 0.143 |
Positive Affect and Well-being includes 9 items, all other Short Forms included 8 items
For these banks, a high score indicates better functioning; for all other banks a high score indicates worse functioning (continued )
Time 1 (baseline) vs. Time 2 (7 days); MGPT – Mean General Population T-Score; MCST- Mean Clinical Sample T-Score
Correlations for Neuro-QoL SF T-scores with Generic Measures
The KPS correlated highly with Lower Extremity and moderately with Upper Extremity, Satisfaction with Roles and Activities, and Stigma (Table 3). Two correlations for Depression and Fatigue did not meet the criterion (r>0.50). The EQ-5D moderately correlated with 6 of the 13 Neuro-QoL measures and demonstrated low correlations with the remaining 7. The Global Quality of Life question met the high correlation criterion for Ability to Participate in Social Roles and Positive Affect and Well-Being, and correlated moderately with the other scales, except for the Upper and Lower Extremity Functioning and Applied Cognition.
Table 3.
Neuro-QOL Short Forms | KPS | EQ-5D | Global HRQL |
---|---|---|---|
Neuro-QOL PHYSCIAL CONSTRUCTS | |||
Lower Extremity (Mobility)* | 0.802 | 0.653 | 0.350 |
Upper Extremity (Fine Motor, ADL)* | 0.618 | 0.603 | 0.364 |
Fatigue | −0.276 | −0.521 | −0.620 |
Sleep Disturbance | −0.194 | −0.441 | −0.650 |
Neuro-QOL EMOTIONAL CONSTRUCTS | |||
Depression | −0.280 | −0.458 | −0.680 |
Anxiety | −0.150 | −0.396 | −0.522 |
Stigma | −0.588 | −0.563 | −0.542 |
Positive Affect and Well-Being* | 0.282 | 0.484 | 0.814 |
Emotional and Behavioral Dyscontrol | −0.266 | −0.414 | −0.435 |
Neuro-QOL COGNITIVE CONSTRUCTS | |||
Applied Cognition- General Concerns* | 0.232 | 0.485 | 0.424 |
Applied Cognition- Executive Function* | 0.259 | 0.438 | 0.440 |
Neuro-QOL SOCIAL CONSTURCTS | |||
Ability to Participate in Social Roles/Activities* | 0.453 | 0.586 | 0.706 |
Satisfaction with Social Roles/Activities* | 0.512 | 0.623 | 0.678 |
For these banks, a high score indicates better function, for all other banks a high score indicates wore function.
Correlation interpretation
<0.30 ~nominal;
0.30 - 0.49~small
0.5-.69:~moderate; (bolded)
>0.70~ large (bolded and italicized)
Blacked out cell indicates correlations did not reach significance threshold of .05
Correlations for Neuro-QoL SF T-scores with MS-specific measures
Six of the Neuro-QoL measures strongly correlated with the FAMS Total Score and the remaining 7 only moderately correlated, with the highest being Satisfaction with Social Roles (r=0.830) and lowest being Upper Extremity (Fine Motor; r=0.578) (Table 4a).
Table 4a.
Spearman Correlation Coefficient | FAMS | |||||||
---|---|---|---|---|---|---|---|---|
Short Forms | Total Score | Addl. Concerns | Emotional Wellbeing | Family/Social Wellbeing | General Contentment | Mobility | Symptoms | Thinking and Fatigue |
Neuro-QOL PHYSCAL CONSTRUCTS | ||||||||
Lower Extremity (Mobility)* | 0.585 | 0.464 | 0.442 | 0.234 | 0.414 | 0.862 | 0.461 | 0.353 |
Upper Extremity (Fine Motor, ADL)* | 0.578 | 0.458 | 0.452 | 0.300 | 0.435 | 0.660 | 0.423 | 0.455 |
Fatigue | −0.811 | −0.638 | −0.629 | −0.580 | −0.667 | −0.518 | −0.671 | −0.841 |
Sleep Disturbance | −0.672 | −0.625 | −0.595 | −0.527 | −0.619 | −0.324 | −0.565 | −0.690 |
Neuro-QOL EMOTIONAL CONSTRUCTS | ||||||||
Depression | −0.712 | −0.632 | −0.755 | −0.582 | −0.718 | −0.413 | −0.476 | −0.569 |
Anxiety | −0.601 | −0.582 | −0.625 | −0.488 | −0.574 | −0.280 | −0.428 | −0.595 |
Stigma | −0.763 | −0.599 | −0.691 | −0.597 | −0.655 | −0.706 | −0.443 | −0.551 |
Positive Affect and Well-Being* | 0.770 | 0.671 | 0.784 | 0.601 | 0.862 | 0.498 | 0.449 | 0.579 |
Emotional and Behavioral Dyscontrol | −0.601 | −0.610 | −0.505 | −0.517 | −0.468 | −0.318 | −0.446 | −0.647 |
Neuro-QOL COGNITIVE CONSTRUCTS | ||||||||
Applied Cognition- General Concerns* | 0.630 | 0.541 | 0.383 | 0.518 | 0.459 | 0.354 | 0.485 | 0.771 |
Applied Cognition- Executive Function* | 0.608 | 0.494 | 0.417 | 0.477 | 0.457 | 0.375 | 0.436 | 0.692 |
Neuro-QOL SOCIAL CONSTURCTS | ||||||||
Ability to Participate in Social Roles and Activities* |
0.813 | 0.648 | 0.671 | 0.537 | 0.727 | 0.711 | 0.566 | 0.659 |
Satisfaction with Social Roles and Activities* | 0.830 | 0.631 | 0.718 | 0.583 | 0.725 | 0.715 | 0.546 | 0.658 |
For these banks, a high score indicated better function, for all other banks, a high score indicates worse function
Correlation interperation
<0.30~nominal
0.30 - 0.49 ~ small
0.50 - 0.69 ~ moderate (bolded)
>0.70~ large (bolded and italicized)
All correlations <0.05
Five Neuro-QoL measures demonstrated strong correlations with the FAMS subscales; the strongest being between FAMS General Contentment and Neuro-QoL Positive Affect and Well-Being (r=0.862) and between FAMS Mobility and Neuro-QoL Lower Extremity (r=0.862) with the remaining subscales demonstrating moderate correlations (Table 4a).
The MSFC total moderately correlated with Lower Extremity (r=0.546) and Upper Extremity (r=−0.591) but not with cognition or depression Neuro-QoL measures. The MSFC T-25-FW strongly correlated with Mobility (r=0.81), the MSFC 9-HPT moderately correlated with Upper Extremity (r=0.631) and the MSFC-SDMT had only low correlations with the Neuro-QoL scales.
Baseline Known Groups Validity
Comparing the MS subjects grouped by MSFC tertile, the Neuro-QoL Physical Constructs Upper Extremity (eta2=0.30) showed strong discriminative ability and Lower Extremity (eta2=0.19) showed moderate discrimination. The Emotional, Cognitive, and Social Constructs did not discriminate among the MSFC groups (Table 5).
Table 5.
Multiple Sclerosis Functional Composite | Global Health Related Quality of Life Question | |||||
n=161 | n=161 | |||||
Baseline, difference between tertiles ‡,# | Baseline, differences between groups‡,# | |||||
During the past 7 days, I am content with the quality of my life right now. | ||||||
TERTILES | Freq. | Percent | Response | Freq. | Percent | |
Low scoring | 53 | 32.92 | Not at all | 14 | 8.70 | |
Mid range | 54 | 33.54 | A little bit | 16 | 9.94 | |
High scoring | 54 | 33.54 | Somewhat | 42 | 26.09 | |
Quite a bit | 49 | 30.43 | ||||
Very much | 40 | 24.84 | ||||
F | p | eta2* | F | p | eta2* | |
Neuro-QoL PHYSCIAL CONSTRUCTS | ||||||
Lower Extremity (Mobility) | 16.721 | 0.000 | 0.190 | 6.517 | 0.000 | 0.150 |
Upper Extremity (Fine Motor, ADL) | 33.886 | 0.000 | 0.300 | 6.570 | 0.000 | 0.140 |
Fatigue | 2.366 | 0.097 | 0.030 | 26.288 | 0.000 | 0.400 |
Sleep Disturbance | 0.879 | 0.417 | 0.010 | 18.342 | 0.000 | 0.320 |
Neuro-QOL EMOTIONAL CONSTRUCTS | ||||||
Depression | 0.712 | 0.492 | 0.010 | 31.339 | 0.000 | 0.450 |
Anxiety | 1.245 | 0.291 | 0.020 | 14.760 | 0.000 | 0.270 |
Stigma | 6.393 | 0.002 | 0.070 | 16.535 | 0.000 | 0.300 |
Positive Affect and Well-Being | 1.293 | 0.277 | 0.020 | 73.617 | 0.000 | 0.650 |
Emotional and Behavioral Dyscontrol | 3.046 | 0.050 | 0.040 | 8.468 | 0.000 | 0.180 |
Neuro-QOL COGNITIVE CONSTRUCTS | ||||||
Applied Cognition- General Concerns | 3.493 | 0.033 | 0.040 | 8.718 | 0.000 | 0.180 |
Applied Cognition- Executive Function | 7.986 | 0.000 | 0.090 | 9.435 | 0.000 | 0.190 |
Neuro-QOL Social Constructs | ||||||
Ability to Participate in Social Roles/Activities | 2.483 | 0.087 | 0.030 | 30.369 | 0.000 | 0.440 |
Satisfaction with Social Roles/Activities | 5.410 | 0.005 | 0.060 | 30.415 | 0.000 | 0.440 |
‡ df1=2 | ‡ df1=4 | |||||
# df2 between 142 and 158 | # df2 between 140 and 156 |
eta2 is an effect size that captures the proportion of variance in IV explained by DV. Interpretation guidelines:
.02 ~ small
.13 ~ moderate (bold)
.26 ~ strong (bolded and italicized)
When grouped by response to the single item GHQ question, the Physical Constructs, Fatigue (eta2=0.40), and Sleep (eta2=0.32) showed strong discriminative ability. All 5 Emotional Constructs, Depression (eta2=0.44), Anxiety (eta2=0.27), Stigma (eta2=0.30), Positive Affect and Well-being (eta2=0.65), and Social and Emotional Dsycontrol (eta2=0.18) showed strong discriminative ability as did Social Constructs, Ability to Participate in Social Roles and Activities (eta2=0.44), and Satisfaction with Social Roles and Responsibilities (eta2=0.44). Physical Constructs Lower Extremity (eta2=0.15) and Upper Extremity (eta2=0.14) showed moderate discriminative ability as did Cognitive Constructs, Applied Cognition-General Concerns (eta2=0.18) and Applied Cognition-Executive Function (eta2=0.19) (Table 5).
Responsiveness to change
Each of the Neuro-QoL SFs responsiveness from baseline to month-6 was based on the responses to the 6 GRC scores (Table 6). All reported measures met the correlation criteria of ≥0.30 and were strongly significant (p<0.001) For the Physical GRC (41.6% remained about the same) there was no reported change in the Neuro-QoL Physical Construct measures. The Social/Family GRC (55.3% remained about the same) strongly correlated with the Positive Affect and Well-Being measure. Emotional GRC (44.1% remained about the same) strongly correlated with Depression and Positive Affect and Well-Being, Cognitive GRC (54.7% remained about the same) strongly correlated with Fatigue and Positive Affect and Well-Being. Symptomatic GRC (39.8% remained about the same) strongly correlated with Positive Affect and Well-Being. Overall QOL GRC (45.3% remained about the same) strongly correlated with Depression and Positive Affect and Well-Being. Generally, these GRCs were good anchors as demonstrated by their correlations with Neuro-QoL change scores. While there is some support for Neuro-QoL responsiveness, the analysis power was sharply limited due to the fact that many patients did not experience significant change during the six months they were in the study.
TABLE 6.
Physical | Social/Family | Emotional | Cognitive | Symptomatic | Overall QOL | |||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
Freq. | Percent | Freq. | Percent | Freq. | Percent | Freq. | Percent | Freq. | Percent | Freq. | Percent | |
Better | 25 | 15.5 | 32 | 19.9 | 34 | 21.1 | 17 | 10.6 | 25 | 15.5 | 36 | 22.4 |
About the same | 67 | 41.6 | 89 | 55.3 | 71 | 44.1 | 88 | 54.7 | 64 | 39.8 | 73 | 45.3 |
Worse | 39 | 24.2 | 10 | 6.2 | 26 | 16.2 | 26 | 16.2 | 42 | 26.1 | 22 | 13.7 |
Missing | 30 | 18.6 | 30 | 18.6 | 30 | 18.6 | 30 | 18.6 | 30 | 18.6 | 30 | 18.6 |
F | p | F | p | F | p | F | p | F | p | F | p | |
Neuro-QOL PHYSCIAL CONSTRUCTS | ||||||||||||
Lower Extremity (Mobility)* | 0.947 | 0.391 | 0.004 | 0.996 | 0.021 | 0.980 | 2.050 | 0.134 | 1.937 | 0.149 | 0.367 | 0.694 |
Upper Extremity (Fine Motor, ADL)* | 1.697 | 0.187 | 1.241 | 0.293 | 1.096 | 0.337 | 4.762 | 0.010 | 0.936 | 0.395 | 2.510 | 0.085 |
Fatigue | 2.362 | 0.098 | 3.423 | 0.036 | 3.659 | 0.029 | 9.750 | 0.000 | 6.452 | 0.002 | 3.121 | 0.048 |
Sleep Disturbance | 0.865 | 0.423 | 1.263 | 0.286 | 1.352 | 0.263 | 2.593 | 0.079 | 2.134 | 0.123 | 1.046 | 0.354 |
Neuro-QOL EMOTIONAL CONSTRUCTS | ||||||||||||
Depression | 5.604 | 0.005 | 4.173 | 0.018 | 14.818 | 0.000 | 2.492 | 0.087 | 5.019 | 0.008 | 8.991 | 0.000 |
Anxiety | 5.365 | 0.006 | 2.195 | 0.116 | 7.282 | 0.001 | 1.909 | 0.152 | 2.164 | 0.119 | 5.573 | 0.005 |
Stigma | 1.795 | 0.170 | 3.978 | 0.021 | 3.358 | 0.038 | 3.732 | 0.027 | 2.533 | 0.083 | 4.048 | 0.020 |
Positive Affect and Well-Being* | 5.897 | 0.004 | 9.480 | 0.000 | 17.068 | 0.000 | 11.110 | 0.000 | 10.185 | 0.000 | 13.101 | 0.000 |
Emotional and Behavioral Dyscontrol | 2.832 | 0.063 | 3.449 | 0.035 | 3.187 | 0.045 | 1.480 | 0.231 | 3.140 | 0.047 | 3.395 | 0.037 |
Neuro-QOL COGNITIVE CONSTRUCTS | ||||||||||||
Applied Cognition- General Concerns* | 2.123 | 0.124 | 0.263 | 0.769 | 0.848 | 0.431 | 7.086 | 0.001 | 0.056 | 0.945 | 1.050 | 0.353 |
Applied Cognition- Executive Function* | 3.298 | 0.040 | 0.239 | 0.788 | 0.683 | 0.507 | 4.694 | 0.011 | 0.607 | 0.547 | 1.235 | 0.294 |
Neuro-QOL SOCIAL CONSTRUCTS | ||||||||||||
Ability to Participate in Social Roles and Activities* | 2.159 | 0.120 | 0.438 | 0.646 | 2.355 | 0.099 | 2.391 | 0.096 | 1.276 | 0.283 | 3.915 | 0.022 |
Satisfaction with Social Roles and Activities* | 1.775 | 0.174 | 0.559 | 0.573 | 0.563 | 0.571 | 1.906 | 0.153 | 0.536 | 0.586 | 1.768 | 0.175 |
‡ df1=2:# , df2 between 95 and 128
For these banks, a high score indicates better function, for all other banks a high score indicates wore function SEE Line on next p≤0.05 considered moderately significant (bolded); p≤0.001 considered strongly significant (bolded and italicized)
DISCUSSION
We evaluated the reliability and validity of the thirteen Neuro-QoL short forms in a sample of persons with MS living in the community who were recruited from five study sites across the United States.
Not surprisingly, compared to the GP, this MS sample demonstrated scores worse than 0.5 SD on cognitive function-general concerns, worse upper and lower extremity physical function, and poorer satisfaction with social roles and activities. That the MS subjects showed comparable Positive Affect and Well Being, Anxiety and Depression than the GP sample that is likely to be a result of our sample.
Additional testing with more significantly affected MS patients is ongoing and will increase our understanding of this. MS patients are comparable to the CP for Stigma, Fatigue, Sleep Disturbance and Emotional and Behavioral Dyscontrol, highlighting the relevance of stigma and behavioral assessment in the MS population. These findings demonstrates that Neuro-QoL is a valuable tool for understanding the relative impact of different neurological conditions and different sub-groups within a disease group. All of the Neuro-QoL measures demonstrated high internal consistency.
The correlation findings indicate that the Neuro-QoL measures offer unique insight into the experience of persons with MS compared to the other measures. The moderate correlations between each of the generic measures and the Neuro-QoL Stigma measure is of interest, as none of the generic measures specifically address stigma, which is not commonly assessed as a negative consequence of MS.
The MSFC is a clinical assessment of MS disability that includes a total score and a score for each of its components, as expected the total MSFC score showed moderate correlations with the Lower Extremity and Upper Extremity but the overall MSTP did not correlate with the Cognitive, Social or Emotional domains. The lack of correlation between the SDMT and the two Neuro-QoL cognition measures is consistent with reports of poor correlation between self-reported cognition and neuropsychological tests.23-25
Since the only significant correlation the MSFC had was with the Neuro-QoL Lower and Upper scales, it is not surprising that only those two measures demonstrated known groups discrimination. In contrast, when grouped by the GHQ, all of the domains showed between group discrimination.
This sample of MS patients showed very limited change in their status over the 6-month period of this observational study. This is expected given that most studies of MS disease-modifying therapies require a 24-month period to distinguish between treatment arms. It is not surprising that there was limited responsiveness to change in the Neuro-QoL measures, There were similarly limited changes in FAMS scores over the same study period (Data not shown).
Individuals with MS included in this sample experienced a rather limited level of disability as they needed to be able to have walking and hand function that allowed them to complete the MSFC While a T-score is interpreted relative to a reference population, improved understanding of the “meaning” of a score in the context of individuals living with a given disease will lend clinical meaningfulness to these measures. Work is underway to improve such interpretation guidelines.26, 27
Conclusions
These data provide initial validation for the Neuro-QoL SF measures in a sample of persons with MS. The measures demonstrate strong internal consistency and test-retest reliability. Given their focus on quality of life in neurological conditions, their stronger correlation with other disease-specific PROs compared to generic PROs, and stronger correlations with PROs than with clinical measures of MS severity is expected. The Neuro-QoL assesses several domains of well-being not typically assessed using traditional MS-specific PROs. Those additional domains include Positive Affect and Well-Being and Emotional and Behavioral Dyscontrol. The relevance of Stigma as a component of quality of life clearly emerged from these data. The availability of one PRO measure that assesses physical, cognitive, and emotional domains of well-being and has been evaluated using a unified validation approach in five adult and two pediatric neurological conditions represents a major advancement in the ability to assess the impact of different interventions within one disease group and across individuals living with several neurological diseases. We believe Neuro-QoL provides an excellent opportunity for researchers and clinicians alike to explore aspects of MS patients’ experiences that have not been previously studied and advances opportunities to study the impact of different diseases across neurological conditions.
Table 4b.
Spearman Correlation Coefficient | MSFC | MSFC - 25 ft walk | MSFC - 9 hole peg | MSFC - Oral Symbol Digit | Performance Scales |
---|---|---|---|---|---|
Short Forms | |||||
Neuro-QOL PHYSCIAL CONSTRUCTS | |||||
Lower Extremity (Mobility)* | 0.546 | 0.815 | 0.523 | 0.253 | −0.747 |
Upper Extremity (Fine Motor, ADL)* | 0.591 | 0.518 | 0.631 | 0.33 | −0.734 |
Fatigue | −0.165 | −0.303 | −0.225 | −0.045 | 0.628 |
Sleep Disturbance | −0.03 | −0.105 | −0.064 | −0.014 | 0.442 |
Neuro-QOL EMOTIONAL CONSTRUCTS | |||||
Depression | −0.153 | −0.222 | −0.187 | −0.048 | 0.483 |
Anxiety | −0.086 | −0.105 | −0.114 | −0.052 | 0.318 |
Stigma | −0.386 | −0.527 | −0.41 | −0.183 | 0.668 |
Positive Affect and Well-Being* | 0.165 | 0.275 | 0.231 | 0.009 | −0.503 |
Emotional and Behavioral Dyscontrol | −0.206 | −0.199 | −0.246 | −0.114 | 0.455 |
Neuro-QOL COGNITIVE CONSTRUCTS | |||||
Applied Cognition- General Concerns* | 0.207 | 0.162 | 0.17 | 0.229 | −0.574 |
Applied Cognition- Executive Function* | 0.315 | 0.227 | 0.248 | 0.341 | −0.583 |
Neuro-QOL SOCIAL CONSTURCTS | |||||
Ability to Participate in Social Roles and Activities* | 0.236 | 0.453 | 0.285 | 0.087 | −0.676 |
Satisfaction with Social Roles and Activities* | 0.322 | 0.473 | 0.397 | 0.126 | −0.712 |
For these banks, a high score indicates better function, for all other banks a high score indicates wore function.
Correlation interpretation
<0.30 ~nominal;
0.30 – 0.49~small
0.5-.69:~moderate (bold)
≥0.70~ large (bolded and italicized)
Blacked out cell indicates correlations are above significance threshold of .05
References
- 1.Noseworthy JH, Lucchinetti C, Rodriguez M, Weinshenker BG. Multiple sclerosis. N Engl J Med. 2000;343:938–952. doi: 10.1056/NEJM200009283431307. [DOI] [PubMed] [Google Scholar]
- 2.NINDS [February 27, 2012];Multiple Sclerosis: Hope Through Research [online] Available at: http://www.ninds.nih.gov/disorders/multiple_sclerosis/detail_multiple_sclerosis.htm.
- 3.Rudick RA, Miller D, Clough JD, Gragg LA, Farmer RG. Quality of life in multiple sclerosis. Comparison with inflammatory bowel disease and rheumatoid arthritis. Archives of Neurology. 1992;49:1237–1242. doi: 10.1001/archneur.1992.00530360035014. [DOI] [PubMed] [Google Scholar]
- 4.Chopra P, Herrman H, Kennedy G. Comparison of disability and quality of life measures in patients with long-term psychotic disorders and patients with multiple sclerosis: an application of the WHO Disability Assessment Schedule II and WHO Quality of Life-BREF. Int J Rehabil Res. 2008;31:141–149. doi: 10.1097/MRR.0b013e32830150e6. [DOI] [PubMed] [Google Scholar]
- 5.Miller DM, Kattan MW, Fu AZ. Health related quality of life assessment in multiple sclerosis. In: Cohen JA, Rudick RA, editors. Multiple Sclerosis Theraputics. 4th ed. Cambridge University Press; New York: 2011. pp. 79–90. [Google Scholar]
- 6.Cella D, Nowinski C, Victorson D, Miller D, Jai J-S, Moy C. The Neurology Quality of Life Measurement (Neuro-QOL) Initiative. Archives of Physical Medicine & Rehabilitation. 2011;92:528–536. doi: 10.1016/j.apmr.2011.01.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Cella D, Nowinski C, Peterman A, Miller DM. The Neuro-QOL Project: Using Multiple Methods to Develop a HQOL Measurement Platform to be Used in Clinical Research Across Neurological Conditions. Quality of Life Research. 2006;A-14:1315. [Google Scholar]
- 8.MIller DM, Nowinski C, Victorson D, Peterman A, Perez L. The Neuro-QOL Project: Establishing Research Priorities Through Qualitative Research and Consensus Develeopment. Quality of Life Research. 2005;14:2031. [Google Scholar]
- 9.Perez L, Huang J, Jansky L, et al. Using focus groups to inform the Neuro-QOL measurement tool: exploring patient-centered, health-related quality of life concepts across neurological conditions. Journal of Neuroscience Nursing. 2007;39:342–353. doi: 10.1097/01376517-200712000-00005. [DOI] [PubMed] [Google Scholar]
- 10.Cella D, Lai JS, Nowinski CJ, et al. Neuro-QOL: brief measures of health-related quality of life for clinical research in neurology. Neurology. 2012;78:1860–1867. doi: 10.1212/WNL.0b013e318258f744. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Gershon RC, Lai JS, Bode R, et al. Neuro-QOL: quality of life item banks for adults with neurological disorders: item development and calibrations based upon clinical and general population testing. Quality of Life Research. 2012;21:475–486. doi: 10.1007/s11136-011-9958-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.(NINDS). NIoNDaS User Manual for the Quality of Life in Neurological Disorders (Neuro-QOL) Measures, version 1.0 [online] Available at: http://www.neuroqol.org/Instruments%20%20Tools/Neuro-QOL_UserManual.pdf.
- 13.Karnofsky DA, Burchenal JH. The clinical evaluation of chemotherapeutic agents in cancer. In: Macleod CM, editor. Evaluation of Chemotherapeutic Agents. Columbia University Press; New York: 1949. pp. 199–205. [Google Scholar]
- 14.Smith A. Symbol Digit Modalities Test. Psychological Services; Los Angeles, CA.: 1991. [Google Scholar]
- 15.Rabin R, de Charro F. EQ-5D: a measure of health status from the EuroQol Group. Annals of Medicine. 2001;33:337–343. doi: 10.3109/07853890109002087. [DOI] [PubMed] [Google Scholar]
- 16.Johnson J, Coons S, Ergo A, Szava-Kovats G. Valuation of EuroQOL (EQ-5D) health states in an adult US sample. Pharmacoeconomics. 1998;13:421–433. doi: 10.2165/00019053-199813040-00005. [DOI] [PubMed] [Google Scholar]
- 17.Hays RD, Bjorner JB, Revicki DA, Spritzer K, Cella D. Development of Physical and Mental Health Summary Scores From the Patient Reported Outcomes Measurement Information System (PROMIS) Global Items. 2008 doi: 10.1007/s11136-009-9496-9. Unpublished. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Cella D. Manual of the Functional Assessment of Chronic Illness Therapy (FACIT Scales) Center on Outcomes, Research and Education (CORE), Evanston Northwestern Healthcare and Northwestern University; Evanston, IL: 1997. [Google Scholar]
- 19.Jaeschke R, Singer J, Guyatt GH. Measurement of health status. Ascertaining the minimal clinically important difference. Controlled Clinical Trials. 1989;10:407–415. doi: 10.1016/0197-2456(89)90005-6. [DOI] [PubMed] [Google Scholar]
- 20.Yost KJ, Eton DT, Garcia SF, Cella D. Minimally important differences were estimated for six Patient-Reported Outcomes Measurement Information System-Cancer scales in advanced-stage cancer patients. J Clin Epidemiol. 2011;64:507–516. doi: 10.1016/j.jclinepi.2010.11.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Rudick RA, Cutter G, Baier M, et al. Use of the Multiple Sclerosis Functional Composit in predicting disability in relapsing MS. Neurology. 2001;56:1324–1330. doi: 10.1212/wnl.56.10.1324. [DOI] [PubMed] [Google Scholar]
- 22.Cella DF, Dineen K, Arnason B, et al. Validation of the functional assessment of multiple sclerosis quality of life instrument. Neurology. 1996;47:129–139. doi: 10.1212/wnl.47.1.129. [DOI] [PubMed] [Google Scholar]
- 23.Benedict RHB, Wahlig E, Bakshi R, et al. Predicting quality of life in multiple sclerosis: accounting for physical disability, fatigue, cognition, mood disorder, personality, and behavior change. J Neurol Sci. 2005;231:29–34. doi: 10.1016/j.jns.2004.12.009. [DOI] [PubMed] [Google Scholar]
- 24.Langdon DW. Cognition in multiple sclerosis. Curr Opin Neurol. 2011;24:244–249. doi: 10.1097/WCO.0b013e328346a43b. [DOI] [PubMed] [Google Scholar]
- 25.Bruce JM, Bruce AS, Hancock L, Lynch S. Self-reported memory problems in multiple sclerosis: influence of psychiatric status and normative dissociative experiences. Archives of Clinical Neuropsychology. 2010;25:39–48. doi: 10.1093/arclin/acp092. [DOI] [PubMed] [Google Scholar]
- 26.Cella D, Choi S, Garcia S, et al. Setting standards for severity of common symptoms in oncology using the PROMIS item banks and expert judgment. Quality of Life Research. 2014;23:2651–2661. doi: 10.1007/s11136-014-0732-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Cook KF, Victorson DE, Cella D, Schalet BD, Miller DM. Creating meaningful cut-scores for Neuro-QOL measures of fatigue, physical functioning, and sleep disturbance using standard setting with patients and providers. Quality of Life Research. 2014 doi: 10.1007/s11136-014-0790-9. [DOI] [PubMed] [Google Scholar]