. Author manuscript; available in PMC: 2016 Dec 1.

Published in final edited form as: Am J Kidney Dis. 2015 Jul 23;66(6):1033–1046. doi: 10.1053/j.ajkd.2015.05.020

Table 2.

Psychometric measures considered in patient-reported outcomes instrument evaluations.

Measure	Definition	Common methods of assessment	Interpretation^a
Validity (degree to which an instrument measures the concept it is intended to measure)^28,30,31,59
Content validity	Extent to which the instrument includes the most relevant and important aspects of a concept⁵⁴	Stakeholder focus groups, interviews, surveys	Qualitative evidence from development and pre-testing⁶⁰
Construct validity^b	Evidence that relationships among items, domains, and concepts conform to a priori hypotheses²⁸	Correlation statistics	≥0.70 supports strong correlation
Congruent	Extent to which measure correlates with measure assessing the same construct
Known group	Extent to which measure is sensitive to differences and similarities in groups with known attributes
Responsiveness	Extent to which instrument can detect changes in the construct being measured over time³⁰	Score change statistics	Statistically significant difference in scores pre- and post-clinically relevant events²⁹
Reliability (degree to which an instrument is free from measurement error)^{30, 31,59}
Internal consistency reliability	Degree of the interrelatedness among the items in a multi-item measure^30,31	Cronbach’s α	≥0.70: adequate internal consistency⁶⁰
Test-retest reliability	Measure of the ability to provide consistent scores over time in a stable population³⁰	Intraclass correlation coefficient; Kappa statistic	≥0.70: supports test-retest reliability²⁹

Interpretation score thresholds are not well established and may differ across populations and sources.

Construct validity was considered as congruent and known group validity as these were the construct validity sub-types most commonly assessed in identified instruments.