Skip to main content
. 2023 Sep 30;12:183. doi: 10.1186/s13643-023-02337-8

Table 5.

Review of psychometric properties of identified questionnaires using modified version of COSMIN checklist

PRO Study used in Conceptual model Reliability Content validity Construct validity Responsiveness to change Target population of PRO: generic, cancer, disease, or ACT specific
SF-36 Health Survey (SF-36) [50] Hoogland, Maziarz Eight multi-item dimensions covering functional status, wellbeing and overall evaluation of health Internal consistency and test–retest demonstrated Patient interviews conducted Yes—scores distributed as expected for sex, age, social class, use of health services and for patients with chronic disease Not mentioned Generic
Patient-Reported Outcome Measurement Information System-29 (PROMIS-29) [48] Hoogland, Mullane, Ruark, Wang Assesses pain intensity using a single 0–10 numeric rating item and seven health domains (physical function, fatigue, pain interference, depressive symptoms, anxiety, ability to participate in social roles and activities, and sleep disturbance) using four items for each domain Internal consistency. More reliable than existing summary scores No details provided No details provided No details provided Generic
Patient-Reported Outcomes-Common Terminology Criteria for Adverse Events (PRO-CTCAE) [51] Hoogland, Sidana PRO-based measurement system to capture symptomatic adverse events by self-report in cancer clinical trials Test–retest reliability was acceptable for 36/49 pre-specified items Patient interviews conducted Overall, 119/124 items met at least one construct validity criterion Statistically significant correlations were observed between PRO-CTCAE item changes and corresponding QLQ-C30 scale changes for all 27 pre-specified items (median r = 0.43, range 0.10–0.56; all P ≤ .006) Cancer specific
Inventory of Depression and Anxiety Symptoms (IDAS) [52] Knight To create specific symptom scales reflecting distinctive aspects of depression and anxiety Test–retest reliability figures ranged from 0.72 to 0.83. Good internal consistency No details provided Presented data correlating the IDAS with both the HRSD and the IMAS, we have not yet examined it in relation to formal DSM-IV diagnoses of major depression and the anxiety disorders No details provided Generic
Brief Pain Inventory (BPI) [53] Knight Measures sensory and reactive pain. Rate intensity and how much pain interferes with activities Good internal consistency (CA 0.78–0.95 across the two scales). Test–retest reliability is mixed Patient interviews conducted Factor analysis was consistent across different clinical groups Ability to detect clinically meaningful change Generic
Fatigue Severity Index (FSI) Knight Questionnaire could not be located
Pittsburgh Sleep Quality Index (PSQI) [54] Knight Aims to discriminate between good and poor sleepers and be a useful tool for researchers and clinicians. Assesses sleep duration and latency and frequency and severity of sleep problems Good test–retest reliability Developed using experience with patients but interviews not mentioned Significant differences across groups Not mentioned Generic
Simplified QoL questionnaire Li None None None None None Study specific
European Organisation for Research and Treatment of Cancer Quality of Life Core 30 (EORTC QLQ-C30) [55] Martin, Shah Generate a core questionnaire incorporating a range of physical, emotional and social health issues relevant to a broad spectrum of cancer patients, irrespective of specific diagnosis. This core instrument could then be supplemented by diagnosis-specific (e.g. lung cancer or breast cancer) and/or treatment-specific questionnaire modules The recommended 0.7 for good internal consistency between groups was met for 8 of the 9 subscales Patient interviews Could discriminate across clinical criteria Significant changes in the right direction were reported for functional scales Cancer specific
European Organisation for Research and Treatment of Cancer Multiple Myeloma (EORTC MY20) Martin, Shah To assess the disease-specific symptoms of myeloma and their impact on everyday life and treatment-related issues, mainly side effects of chemotherapy. To be used in conjunction with QLQ-C30 Internal consistency was greater than 0.7 CA for all scales Interviews with patients, oncologists and haematologists Correlations with QLQ-C30 items. Two subscales (disease symptoms and side effects) and the body image item could discriminate by PS and patients with/without fractures Pain was only scale to show significant change over time Disease specific (myeloma)
EuroQoL 5D (EQ-5D-5L) [56] Martin, Wang, Shah Generic instrument for describing and valuing health Korean version reliable in cancer patients Patient interviews Scores were reported in expected direction for key characteristics, e.g. age, education, smoking, status Could detect improvements and deterioration in health (breast cancer) Generic
EuroQoL 5D EQ-5D-3L [57] Laetsch A standardised non-disease-specific instrument for describing and valuing HRQOL Responses conform to what would be expected for key characteristics Generic
EuroQoL 5D Youth (EQ-5D-Y) [58] Laetsch A standardised non-disease-specific instrument suitable for children and adolescents Test–retest results were good for most domains. Ceiling effects for mobility and self-care Interviews with healthy and chronically ill young people High correlations with existing questionnaires. Able to distinguish between those with chronic pain and those without Largest treatment effect observed in chronically ill children. Poorer responses in children with minimal pr no health concerns Generic
Patient-Reported Outcome Measurement Information System Global Health (PROMIS Global Health) [59] Mullane, Ruark Global health refers to a person’s general evaluations of health rather than any of its specific components. The global health items include global ratings of the five primary PROMIS domains (physical function, fatigue, pain, emotional distress and social health) and general health perceptions that cut across domains Correlations with comparable items from PROMIS Generic
Additional questions Mullane, Ruark None None None None None Study specific
MD Anderson Symptom Inventory (MDASI) [60] Wang Brief measure of the impact and severity of symptom items The values of a for the two sets of symptom items and the interference scales, respectively, were 0.85, 0.82 and 0.91 for the validation sample and 0.87, 0.87 and 0.94 for the cross-validation sample, which shows a high level of reliability for these sets of items Clinician assessment but patients not mentioned Able to differentiate between PS Not mentioned Cancer specific
Single-item HRQOL Wang None None None None None Study specific
CAR T-cell therapy-specific symptoms Wang None None None None None Study specific
The Pediatric Quality-of-Life Inventory (PedsQL) [61] Laetsch Integrates generic core scales and disease-specific modules into one measurement system. Designed to measure core health domains covered in WHO Most self-report scales and proxy-report scales approached or exceeded the minimum reliability standard of 0.70 No details provided The PedsQL performed as hypothesized utilising the known-groups method. The PedsQL differentiated HRQOL between healthy children and those with acute or chronic health conditions and was correlated with measures of morbidity and illness burden. The MTMM analyses tested convergent and discriminant validity across methods. The heterotrait-monomethod analyses are consistent with the conceptualization of the PedsQL as measuring an integrated multidimensional construct No details provided Generic (paediatric)
Functional Assessment Cancer Therapy-General (FACT-G) [62] Sidana Generic scale which can be combined with disease-specific modules. Quality of life treated as a subjective multidimensional concept Good internal consistency demonstrated for the subscales Patient interviews used to generate items Convergent and divergent validity were demonstrated when compared with other measures. Able to differentiate between stage of disease Could detect change over time in performance status Cancer specific
Quality of Life in Neurological Disorders (Neuro-QoL v2) [63] Sidana Neuro-QoL is a new, standardized approach to measuring HRQL across common neurologic conditions Patient-focus groups Conditional minimal detectable change scores have been estimated for Neuro-QoL short forms. Thresholds for severity of four Neuro-QoL measures (fatigue, upper extremity function, lower extremity function-mobility, sleep disturbance) have been estimated using a modified bookmarking methodology based on the perspective of individuals with multiple sclerosis and clinicians Disease specific (neurological)
Functional Assessment of Cancer Therapy (FACT-Lym) [64] Maziarz Lymphoma-specific questionnaire designed to compliment FACT-G Internal consistency coefficients for the 15-item LymS (0.79, 0.85 and 0.84 T1–T3) and test–retest stability (0.84) indicated good reliability Interviews with clinicians and patients Did not differentiate between patient groups defined by NHL grade. Patients currently on treatment had lower FACT-Lym scores. Moderate correlations with POMS, SF-36 and PCS Able to differentiate between three patient groups overtime (worse unchanged better) Disease specific (lymphoma)