. 2023 Sep 30;12:183. doi: 10.1186/s13643-023-02337-8

Table 5.

Review of psychometric properties of identified questionnaires using modified version of COSMIN checklist

PRO	Study used in	Conceptual model	Reliability	Content validity	Construct validity	Responsiveness to change	Target population of PRO: generic, cancer, disease, or ACT specific
SF-36 Health Survey (SF-36) [50]	Hoogland, Maziarz	Eight multi-item dimensions covering functional status, wellbeing and overall evaluation of health	Internal consistency and test–retest demonstrated	Patient interviews conducted	Yes—scores distributed as expected for sex, age, social class, use of health services and for patients with chronic disease	Not mentioned	Generic
Patient-Reported Outcome Measurement Information System-29 (PROMIS-29) [48]	Hoogland, Mullane, Ruark, Wang	Assesses pain intensity using a single 0–10 numeric rating item and seven health domains (physical function, fatigue, pain interference, depressive symptoms, anxiety, ability to participate in social roles and activities, and sleep disturbance) using four items for each domain	Internal consistency. More reliable than existing summary scores	No details provided	No details provided	No details provided	Generic
Patient-Reported Outcomes-Common Terminology Criteria for Adverse Events (PRO-CTCAE) [51]	Hoogland, Sidana	PRO-based measurement system to capture symptomatic adverse events by self-report in cancer clinical trials	Test–retest reliability was acceptable for 36/49 pre-specified items	Patient interviews conducted	Overall, 119/124 items met at least one construct validity criterion	Statistically significant correlations were observed between PRO-CTCAE item changes and corresponding QLQ-C30 scale changes for all 27 pre-specified items (median r = 0.43, range 0.10–0.56; all P ≤ .006)	Cancer specific
Inventory of Depression and Anxiety Symptoms (IDAS) [52]	Knight	To create specific symptom scales reflecting distinctive aspects of depression and anxiety	Test–retest reliability figures ranged from 0.72 to 0.83. Good internal consistency	No details provided	Presented data correlating the IDAS with both the HRSD and the IMAS, we have not yet examined it in relation to formal DSM-IV diagnoses of major depression and the anxiety disorders	No details provided	Generic
Brief Pain Inventory (BPI) [53]	Knight	Measures sensory and reactive pain. Rate intensity and how much pain interferes with activities	Good internal consistency (CA 0.78–0.95 across the two scales). Test–retest reliability is mixed	Patient interviews conducted	Factor analysis was consistent across different clinical groups	Ability to detect clinically meaningful change	Generic
Fatigue Severity Index (FSI)	Knight	Questionnaire could not be located
Pittsburgh Sleep Quality Index (PSQI) [54]	Knight	Aims to discriminate between good and poor sleepers and be a useful tool for researchers and clinicians. Assesses sleep duration and latency and frequency and severity of sleep problems	Good test–retest reliability	Developed using experience with patients but interviews not mentioned	Significant differences across groups	Not mentioned	Generic
Simplified QoL questionnaire	Li	None	None	None	None	None	Study specific
European Organisation for Research and Treatment of Cancer Quality of Life Core 30 (EORTC QLQ-C30) [55]	Martin, Shah	Generate a core questionnaire incorporating a range of physical, emotional and social health issues relevant to a broad spectrum of cancer patients, irrespective of specific diagnosis. This core instrument could then be supplemented by diagnosis-specific (e.g. lung cancer or breast cancer) and/or treatment-specific questionnaire modules	The recommended 0.7 for good internal consistency between groups was met for 8 of the 9 subscales	Patient interviews	Could discriminate across clinical criteria	Significant changes in the right direction were reported for functional scales	Cancer specific
European Organisation for Research and Treatment of Cancer Multiple Myeloma (EORTC MY20)	Martin, Shah	To assess the disease-specific symptoms of myeloma and their impact on everyday life and treatment-related issues, mainly side effects of chemotherapy. To be used in conjunction with QLQ-C30	Internal consistency was greater than 0.7 CA for all scales	Interviews with patients, oncologists and haematologists	Correlations with QLQ-C30 items. Two subscales (disease symptoms and side effects) and the body image item could discriminate by PS and patients with/without fractures	Pain was only scale to show significant change over time	Disease specific (myeloma)
EuroQoL 5D (EQ-5D-5L) [56]	Martin, Wang, Shah	Generic instrument for describing and valuing health	Korean version reliable in cancer patients	Patient interviews	Scores were reported in expected direction for key characteristics, e.g. age, education, smoking, status	Could detect improvements and deterioration in health (breast cancer)	Generic
EuroQoL 5D EQ-5D-3L [57]	Laetsch	A standardised non-disease-specific instrument for describing and valuing HRQOL			Responses conform to what would be expected for key characteristics		Generic
EuroQoL 5D Youth (EQ-5D-Y) [58]	Laetsch	A standardised non-disease-specific instrument suitable for children and adolescents	Test–retest results were good for most domains. Ceiling effects for mobility and self-care	Interviews with healthy and chronically ill young people	High correlations with existing questionnaires. Able to distinguish between those with chronic pain and those without	Largest treatment effect observed in chronically ill children. Poorer responses in children with minimal pr no health concerns	Generic
Patient-Reported Outcome Measurement Information System Global Health (PROMIS Global Health) [59]	Mullane, Ruark	Global health refers to a person’s general evaluations of health rather than any of its specific components. The global health items include global ratings of the five primary PROMIS domains (physical function, fatigue, pain, emotional distress and social health) and general health perceptions that cut across domains			Correlations with comparable items from PROMIS		Generic
Additional questions	Mullane, Ruark	None	None	None	None	None	Study specific
MD Anderson Symptom Inventory (MDASI) [60]	Wang	Brief measure of the impact and severity of symptom items	The values of a for the two sets of symptom items and the interference scales, respectively, were 0.85, 0.82 and 0.91 for the validation sample and 0.87, 0.87 and 0.94 for the cross-validation sample, which shows a high level of reliability for these sets of items	Clinician assessment but patients not mentioned	Able to differentiate between PS	Not mentioned	Cancer specific
Single-item HRQOL	Wang	None	None	None	None	None	Study specific
CAR T-cell therapy-specific symptoms	Wang	None	None	None	None	None	Study specific
The Pediatric Quality-of-Life Inventory (PedsQL) [61]	Laetsch	Integrates generic core scales and disease-specific modules into one measurement system. Designed to measure core health domains covered in WHO	Most self-report scales and proxy-report scales approached or exceeded the minimum reliability standard of 0.70	No details provided	The PedsQL performed as hypothesized utilising the known-groups method. The PedsQL differentiated HRQOL between healthy children and those with acute or chronic health conditions and was correlated with measures of morbidity and illness burden. The MTMM analyses tested convergent and discriminant validity across methods. The heterotrait-monomethod analyses are consistent with the conceptualization of the PedsQL as measuring an integrated multidimensional construct	No details provided	Generic (paediatric)
Functional Assessment Cancer Therapy-General (FACT-G) [62]	Sidana	Generic scale which can be combined with disease-specific modules. Quality of life treated as a subjective multidimensional concept	Good internal consistency demonstrated for the subscales	Patient interviews used to generate items	Convergent and divergent validity were demonstrated when compared with other measures. Able to differentiate between stage of disease	Could detect change over time in performance status	Cancer specific
Quality of Life in Neurological Disorders (Neuro-QoL v2) [63]	Sidana	Neuro-QoL is a new, standardized approach to measuring HRQL across common neurologic conditions		Patient-focus groups		Conditional minimal detectable change scores have been estimated for Neuro-QoL short forms. Thresholds for severity of four Neuro-QoL measures (fatigue, upper extremity function, lower extremity function-mobility, sleep disturbance) have been estimated using a modified bookmarking methodology based on the perspective of individuals with multiple sclerosis and clinicians	Disease specific (neurological)
Functional Assessment of Cancer Therapy (FACT-Lym) [64]	Maziarz	Lymphoma-specific questionnaire designed to compliment FACT-G	Internal consistency coefficients for the 15-item LymS (0.79, 0.85 and 0.84 T1–T3) and test–retest stability (0.84) indicated good reliability	Interviews with clinicians and patients	Did not differentiate between patient groups defined by NHL grade. Patients currently on treatment had lower FACT-Lym scores. Moderate correlations with POMS, SF-36 and PCS	Able to differentiate between three patient groups overtime (worse unchanged better)	Disease specific (lymphoma)