Abstract
Objective:
Assessing functional limitations for adults at high risk of frailty yields valuable information for identifying those in need of therapy. We evaluate a self-report measure used to assess physical function among Medicare recipients in the United States.
Design:
Secondary analysis of the 2020 Medicare Health Outcomes Survey.
Setting:
A random sample of adult enrollees of 510 managed care plans.
Participants:
287,476 adults (37% completion rate): 58% female; 16% were < 65 years old (entitled via disability), 50% 65–74, and 34% 75 or older; 77% White, 14% Black, and 8% another race; 19% had < high school education.
Interventions:
Not applicable.
Main Outcome Measure:
We evaluate item distributions, dimensionality, monotonicity of response options, reliability, and validity of the 8-item Physical Functioning Activities of Daily Living (PFADL) scale.
Results:
Most reported they could do 6 basic activities of daily living without difficulty. More limitations were reported for the other two PFADL items: 32% were not limited at all in climbing several flights of stairs and 40% in moderate activities. Product-moment correlations among the 8 items ranged from r = 0.19 between the easiest-to-do (eating) and most difficult-to-do (climbing several flights of stairs) items to r = 0.73 between bathing and dressing. The coefficient alpha and omega for the 8-item scale were both 0.86. Item slopes ranged from 2.6 (climbing several flights of stairs and eating) to 4.8 (dressing). Item characteristic curves revealed that response options were most likely to be selected in the appropriate order along the physical functioning continuum. The PFADL had at least 0.80 reliability between about −3 SDs below the mean to the mean. It was negatively correlated with comorbid condition count, disability days, problems with balance or walking, and falling.
Conclusions:
The PFADL is useful for assessing average or below physical function in Medicare recipients.
Keywords: functional status, Medicare, self-assessment, aged
INTRODUCTION
Physical function predicts hospitalizations, institutionalization, and mortality.1–3 It is associated with medical conditions such as chronic obstructive pulmonary disease, back pain, osteoarthritis,4 loneliness,5 and social factors like neighborhood and physical environment, economic stability, and education.6 Assessing functional limitations in older individuals is important,7–8 especially for patients at high risk of frailty (e.g., kidney disease).9 Assessment of physical function yields valuable information for physical therapy, self-management, and referral to community-based exercise programs.10
Random samples of patients from all managed care organizations with Medicare contracts in the United States complete the Medicare Health Outcomes Survey (HOS) annually. Results are used for quality improvement, pay for performance, public reporting, and program oversight. The Physical Functioning Activities of Daily Living (PFADL) scale is included in the HOS.
Six of the PFADL items assess basic activities of daily living (BADL): walking, dressing, bathing, getting in/out of chairs, eating, and toileting: 1) No, I do not have difficulty, 2) Yes, I have difficulty, and 3) I am unable to do this activity. The scoring of the physical function items by the Centers for Medicare & Medicaid Services (CMS) assumes that the response selected is consistent with underlying level of physical function. Each response category of the items should have the highest likelihood of being selected somewhere along the underlying distribution consistent with No, I do not have difficulty representing the best functioning, Yes, I have difficulty the next best functioning, and I am unable to do this activity the worst functioning.11 Some investigators score these items by collapsing the 2nd and 3rd response categories.12–13 Doing this may discard potentially useful information about physical function.
The other 2 PFADL items are from the RAND-36 health survey.14 They assess: 1) moderate activities, such as moving a table, pushing a vacuum cleaner, bowling, or playing golf; and 2) climbing several flights of stairs. For these items, patients are asked to select “No, not limited at all; Yes, limited a little; or Yes, limited a lot.” The extent to which these RAND-36 items are associated with the 6 BADL items and whether they improve the measurement of higher levels of physical functioning is unknown.
The assumptions underlying the scoring of the PFADL are that the 8 items are unidimensional and that the response options monotonically represent the level of physical function. Evidence in support of the reliability and validity of the PFADL for use among Medicare patients is also needed. Testing these assumptions can inform the selection and development of items to optimize future measurements of physical functioning. This study assesses whether the PFADL is unidimensional, its response options are monotonically related to underlying physical function, the reliability of the scale across the underlying physical function continuum, and its construct validity.
METHODS
Ethics approval
The data collected in this study was collected in accordance with the Declaration of Helsinki. We obtained the public use files dataset that was constructed in accordance with CMS and Department of Health and Human Services policies and other applicable statutes and laws. All identifying information was excluded from the files, and demographic categories were aggregated so that identification of any given individual is not possible.
Sample
CMS conducted a random sample of adult enrollees of 510 managed care plans, including those 65 and older and younger enrollees entitled via disability. Data was collected between August 17 through November 9, 2020. Of the 783,389 individuals sampled, 287,476 (37%) completed at least some of the items in the baseline survey (83% by mail and 17% by phone); 13,219 (5%) had missing data for more than half the items. The survey was completed in either English, Spanish, or Chinese language. Further information about the Medicare HOS 2020 (cohort 23) baseline file is available online.15
Measures
The PFADL items are shown in the Appendix. Each item is scored so that a higher number represents better physical functioning: limited a lot or unable to perform the ADL = 0 points, limited a little or difficulty performing the ADL = 1 point, and having no limitation or difficulty performing the ADL = 2 points. The PFADL scale score sum ranges from 0 to 16 and a higher score represents better functioning.
We created a count of the number of 19 medical conditions that were included in the HOS survey: 1) blind or serious difficulty seeing; 2) serious difficulty hearing; 3) serious difficulty concentrating, remembering, or making decisions; 4) difficulty doing errands alone; 5) hypertension or high blood pressure; 6) angina pectoris or coronary artery disease; 7) congestive heart failure; 8) myocardial infarction or heart attack; 9) other hear conditions such as problems with heart values or the rhythm of the heartbeat; 10) stroke; 11) emphysema, or asthma, or chronic obstructive pulmonary disease; 12) Crohn’s disease, ulcerative colitis, or inflammatory bowel disease; 13) arthritis of the hip or knee; 14) arthritis of the hand or wrist; 15) osteoporosis; 16) sciatica; 17) diabetes; 18) depression; and 19) cancer (other than skin cancer). Also included were the Centers for Disease Control’s unhealthy physical days in the last 30 days,16 a question about having a problem with balance or walking in the past 12 months, and a question about falling in the past 12 months. In addition, obesity was measured from self-reported weight in kilograms divided by the square of height in meters (obese = body mass index >=30).
Education was categorized at 8th grade or less; Some high school but did not graduate; high school graduate or General Education Diploma (GED); Some college or 2-year degree; 4-year college graduate; More than a 4-year college degree. Age was categorized as less than 65, 65–74, and 75 or older.
Analyses
We provide demographic information for the sample (Table 1). Then, we compute frequencies and product-moment correlations among the 8 PFADL items. We examine principal component eigenvalues based on polychoric correlations to assess unidimensionality (i.e., number of eigenvalues>1), estimate reliability (coefficient alpha17 and omega18), scale means, and ceiling effects (percent with the best possible score). Next, we provide item characteristic curves, item thresholds and slopes, and scale information from an item response theory graded response model.19 Finally, we assess our a priori hypotheses that worse physical functioning will be associated with the number of comorbid conditions, disability days due to physical health, having a problem with balance or walking in the past 12 months, falling in the past 12 months, obesity, and lower educational attainment. In addition, we hypothesized that Medicare recipients younger than 65 (entitled via disability) would have worse physical functioning than those 65 and older and that those 65–74 would have better physical functioning than those 75 and older. Analyses were conducted using SAS 9.4.20.
Table 1:
Characteristics of the Sample (n = 287476)
| Variable | N (Percent) | Medicare Population29 |
|---|---|---|
| Female | 156296 (58%) | 55% |
| Age | ||
| < 65 years | 46005 (16%) | 14% |
| 65–74 | 142812 (50%) | 50% |
| 75 or older | 98659 (34%) | 37% |
| Race | ||
| White | 197788 (77%) | 77% |
| Black | 36753 (14%) | 11% |
| Another race | 21670 (8%) | 12% |
| Education | ||
| Did not graduate high school | 51462 (19%) | 14% |
| High school graduate/general education diploma | 86034 (32%) | 33% |
| More than high school | 128452 (48%) | 53% |
| Marital Status | ||
| Married | 118771 (44%) | 51% |
| Survey Language | ||
| English | 273109 (95%) | |
| Spanish | 14003 (5%) | |
| Chinese | 364 (<1%) |
RESULTS
The demographic characteristics of the sample were very similar to the Medicare population in 2019 (see Table 1). The sample was 58% female; 77% White, 14% Black, and 8% another race; 16% were less than 65 years old, 50% 65–74, and 34% 75 or older. Nineteen percent had less than high school education, 32% high school or GED, and 48% more than a high school education. Forty-four percent were married. Ninety-five percent completed the survey in English, 5% in Spanish, and <1% in Chinese. The median number of 19 medical conditions reported was 3.8. The most common condition was hypertension (66%), followed by arthritis of the hip or knee (46%) and arthritis of the hand or wrist (39%).
PFADL Item Frequencies
Frequencies for the PFADL items are shown in Table 2, ordered from the most difficult item to the least difficult (to report no limitations or difficulty performing the activity) item. Thirty-two percent of respondents reported not being limited when climbing several flights of stairs while 94% reported no difficulty with eating. The 6 activities of daily living items are “easy” items for Medicare Advantage Organization members—that is, the majority report that they can do them without difficulty. The percentage of the sample not limited at all in several flights of stairs was 32% and in moderate activities was 40%.
Table 2:
Frequencies for the Physical Functioning Activities of Daily Living Items
| Activities of Daily Living | Not limited at all | Limited a little | Limited a lot |
|---|---|---|---|
| Climbing several flights of stairs | 88615 (32%) | 9702 (35%) | 88231 (32%) |
| Moderate activities | 111147 (40%) | 96519 (35%) | 69040 (25%) |
| No difficulty | Have difficulty | Unable to do | |
| Walking | 168686 (62%) | 92188 (34%) | 9144 (3%) |
| Getting in or out of chairs | 201882 (75%) | 64143 (24%) | 4959 (2%) |
| Bathing | 221099 (81%) | 39874 (15%) | 10348 (4%) |
| Dressing | 231088 (85%) | 33692 (12%) | 6488 (2%) |
| Using the toilet | 240899 (89%) | 24393 (9%) | 4080 (2%) |
| Eating | 254111 (94%) | 14819 (5%) | 2600 (1%) |
Note: Sample sizes for items vary due to missing data.
Table 3 compares the percentage of the sample able to do 6 of the PFADL items with two other datasets. The current sample (2020 HOS baseline cohort 23) reported similar but slightly more limitations than the 2010 Medicare Consumer Assessment of Healthcare Providers and Systems (CAHPS®) sample that also included those less than 65 years of age. The 2019 Medicare CAHPS sample reported slightly better physical functioning than the current sample but was limited to those 65 and older and fee-for-service respondents.
Table 3:
Percentage Able to Do 6 Physical Function Activities Without Difficulty in Three Datasets
| 2010 Medicare CAHPS* (n =366,701) | 2019 Medicare CAHPS† (n = 79,725) | 2020 Health Outcomes Survey (HOS) Baseline Cohort 23 | |
|---|---|---|---|
| Walking | 253024 (69%) | 58120 (73%) | 168686 (62%) |
| Getting in or out of chairs | 286027 (78%) | 64737 (81%) | 201882 (75%) |
| Bathing | 311696 (85%) | 69600 (87%) | 221099 (81%) |
| Dressing | 322697 (88%) | 71194 (89%) | 231088 (85%) |
| Using the toilet | 333698 (91%) | 73347 (92%) | 240899 (89%) |
| Eating | 344699 (94%) | 75420 (95%) | 254111 (94%) |
Consumer Assessment of Healthcare Providers and Systems (CAHPS®) Medicare managed care and fee-for-service respondents including those less than 65 years old.21
Medicare fee-for-service 65 years of age and older (unpublished).
Associations Among Items and PFADL Score Distribution
Product-moment correlations among the items ranged from r = 0.19 between the easiest (eating) and most difficult item (climbing several flights of stairs) to r = 0.73 between bathing and dressing. Item correlations with the sum of the other 7 items in the PFADL scale ranged from 0.44 (eating) to 0.72 (walking), indicating that there is substantial common variance among them. The principal component eigenvalues of the polychoric correlations support a single underlying factor (first two eigenvalues = 5.99 and 0.85). The average residual correlation was 0.05.
The coefficient alpha and omega for the 8-item scale were both 0.86, indicating that the 8-item scale is reliable enough for group-level comparisons. The mean was 12.73 (SD = 3.43, range 0–16) and 26% reported the highest possible (ceiling) score.
Item Response Theory Graded Response Model Parameters
Figure 1 shows the item characteristic curves. These curves support the monotonicity assumption—that is, each response option is most likely to be selected somewhere along the physical functioning continuum in the expected order. This supports scoring the items as three levels rather than collapsing categories.
Figure 1:

Item Characteristic Curves
B23vrstair= Climbing stairs
B23vrmact = Moderate activities
B23adlwlk = Walking
B23adlchr = Getting in/out of chairs
B23adlbth = Bathing
B23adldrs = Dressing
B23adldtlt = Using the toilet
B23adleat = Eating
Table 4 provides item means and graded response model parameter estimates. Item thresholds are consistent with the item frequencies and item means. The item thresholds indicate that the ordering from most difficult to easiest items is climbing stairs, moderate activities, walking, getting in and out of chairs, bathing, dressing, using the toilet, and eating. The thresholds indicate the level of physical function on a z-score metric needed to have a 50% chance of scoring below versus above the threshold. For example, a person with a physical function score of −0.56 (a little more than a half standard deviation below the mean) would have a 50% probability of selecting yes, limited a lot versus either yes, limited a little, or No, not limited at all on the question about climbing stairs. Item slopes (discrimination) ranged from 2.60 (climbing stairs and eating) to 4.76 (dressing), indicating that the dressing is the item most strongly associated with underlying physical functioning.
Table 4:
Physical Functioning Activities of Daily Living Graded Response Model Item Parameters
| Item | N (%) | Threshold | Slope |
|---|---|---|---|
| B23vrstair (mean = 2.00) Climbing stairs | 2.60 | ||
| Yes, limited a lot | 88231 (32) | ||
| −0.56* | |||
| Yes, limited a little | 97021 (35) | ||
| 0.56† | |||
| No, not limited at all | 88615 (32) | ||
| B23vrmact (mean = 2.15) Moderate activities | 2.75 | ||
| Yes, limited a lot | 69040 (25) | ||
| −0.81‡ | |||
| Yes, limited a little | 96519 (35) | ||
| 0.28§ | |||
| No, not limited at all | 111147 (40) | ||
| B23adlwlk (mean = 2.59) Walking | 3.49 | ||
| I am unable to do this activity | 9144 (3) | ||
| −1.97 | |||
| Yes, I have difficulty | 92188 (34) | ||
| −0.38 | |||
| No, I do not have difficulty | 168686 (63) | ||
| B23adlchr (mean = 2.73) Getting in/out of chairs | 3.58 | ||
| I am unable to do this activity | 4959 (2) | ||
| −2.25 | |||
| Yes, I have difficulty | 64143 (24) | ||
| −0.75 | |||
| No, I do not have difficulty | 201882 (74) | ||
| B23adlbth (mean = 2.78) Bathing | 4.22 | ||
| I am unable to do this activity | 10348 (4) | ||
| −1.84 | |||
| Yes, I have difficulty | 39874 (15) | ||
| −0.98 | |||
| No, I do not have difficulty | 221099 (81) | ||
| B23adldrs (mean = 2.83) Dressing | 4.76 | ||
| I am unable to do this activity | 6488 (2) | ||
| −2.01 | |||
| Yes, I have difficulty | 33692 (12) | ||
| −1.10 | |||
| No, I do not have difficulty | 231088 (85) | ||
| B23adldtlt (mean = 2.88) Using the toilet | 4.32 | ||
| I am unable to do this activity | 4080 (2) | ||
| −2.25 | |||
| Yes, I have difficulty | 24393 (9) | ||
| −1.32 | |||
| No, I do not have difficulty | 240899 (89) | ||
| B23adleat (mean = 2.93) Eating | 2.63 | ||
| I am unable to do this activity | 2600 (1) | ||
| −2.79 | |||
| Yes, I have difficulty | 14819 (5) | ||
| −1.78 | |||
| No, I do not have difficulty | 254111 (94) | ||
−1.82 in Patient-Reported Outcomes Measurement and Information System (PROMIS®) general population sample.
−0.09 in PROMIS® general population.
−2.20 in PROMIS® general population.
−0.49 in PROMIS® general population.
Figure 2 provides the test (scale) information curve. The scale information curve has two peaks and shows that the scale has at least 0.80 reliability between a little higher than −3 SDs below the mean up to a little bit above the mean.
Figure 2:

Physical Functioning Activities of Daily Living Scale Information Curve
Construct Validity
Each of the hypothesized associations was confirmed. The PFADL was negatively correlated with number of comorbid conditions (r = −0.56, p <.0001), disability days due to physical health (r = −0.55, p <.0001), having a problem with balance or walking in the past 12 months (r = −0.56, p <.0001), falling in the past 12 months (r = −0.32, p <.0001), obesity (r = −0.18, p <.0001), and PFADL was positively related to educational attainment (r = 0.18, p <.0001). Moreover, there were significant differences by age (F (2, 283,220) = 9539.34, p<.0001). Medicare respondents less than 65 years old (disabled) had significantly worse physical functioning than those 65 and older (effect sizes = 0.70 and 0.38 for differences with 65–74 and with 75 and older, respectively), and those 75 and older had significantly worse physical functioning than those 65–74 (effect size = 0.31); all pairwise comparisons significant at p <.0001 using Tukey-Kramer adjustment for multiple comparisons.
DISCUSSION
This study provides a comprehensive psychometric evaluation of the PFADL measure in a large sample of Medicare recipients in the United States. The results of this study indicate that the PFADL items satisfy the monotonicity assumption underlying the scale scoring of the response options (0 = I am unable to do this activity/Yes, limited a lot; 1= Yes, I have difficulty/Yes, limited a little; 2 = No, I do not have difficulty/No, not limited at all), which supports preserving three categories in scoring. The item characteristic curves indicate that collapsing response categories would discard useful information. The analyses provide support for the unidimensionality of the PFADL items, consistent with prior assessments of physical function measures.21–22
Ceiling effects for measures are minimized by matching item difficulty to the level of physical function of respondents. Twenty-six percent of the participants in the current sample scored at the ceiling (best possible physical function) on the PFADL scale. In contrast, 65% of a sample of CAHPS Medicare respondents scored at the ceiling of a physical function scale that was limited to the 6 BADL items in the PFADL.21 The lower ceiling effect found in this study was because the PFADL includes two RAND-36 items that assess higher levels of physical function than the 6 BADL items. Even smaller ceiling effects have been observed in other studies: 2% of inpatient geriatric rehabilitation patients had the best possible score on the PROMIS 24-item physical function scale.23
The reliability of the PFADL scale was 0.86, indicating sufficient reliability for group-level comparisons.24 But scale information curves showed that the scale provides the most reliable information for the HOS respondents who have average or lower levels of physical functioning. The greater reliability of measurement among those with limitations compared to those with higher levels of functioning is consistent with other self-report measures such as the 4-item PROMIS-29 physical function scale,25 the 6-item BADL scale administered to Medicare recipients,21 and a 15-item Medical Outcomes Study scale.26 Scale information might be enhanced for those with better physical functioning by including items that capture higher levels of function, such as items that assess doing chores like vacuuming or yard work, running a short distance, and engaging in 2 hours of physical labor that are included in the PROMIS 20-item short form.27
Study Limitations
This study has limitations. Direct comparisons of the results of this study to other measures of physical function are limited by differences in the wording of survey questions.28 In addition, mode effects are possible because the survey was administered using mail with phone follow-up. There was a small positive correlation of PFADL score with mail administration (r = 0.09, p <.0001), but this may reflect the tendency for those with poorer physical functioning to complete surveys by phone. Because of the 37% response rate, it is unknown how well the results represent Medicare recipients more generally. Nonetheless, the findings of the study provide useful information about the PFADL scale administered annually to large samples of adults in Medicare.
CONCLUSION
This study provides support for the reliability and validity of the PFADL for use in assessing Medicare enrollees but suggests potential value in adding items to assess more difficult activities of daily living. The findings can inform the selection of items to optimize future assessments of physical functioning.
Supplementary Material
Funding support:
Centers for Minority Aging Research Center for Health Improvement of Minority Elderly (RCMAR/CHIME) under NIH/NIA Grant P30-AG021684. The sponsor had no role in the design, methods, participant recruitment, data collection, analysis, or preparation of the paper.
Abbreviations
- BADL
Basic Activities of Daily Living
- CAHPS
Consumer Assessment of Healthcare Providers and Systems
- CMS
Centers for Medicare & Medicaid Services
- HOS
Medicare Health Outcomes Survey
- PFADL
Physical Functioning Activities of Daily Living
Footnotes
Conflict of Interest: The authors report no conflict of interest.
REFERENCES
- 1.Deardorff WJ, Jing B, Jeon SY, et al. Do functional status and Medicare claims data improve the predictive accuracy of an electronic health record mortality index? Findings from a national Veterans Affairs cohort. BMC Geriatr 2022; 22:434. 10.1186/s12877-022-03126-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Han PKJ, Lee M, Reeve BB et al. Development of a prognostic model for six-month mortality in older adults with declining health. Journal of Pain and Symptom Management 2012; 43: 527–539. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Liotta G, Lorusso G, Madaro O, et al. Predictive validity of the short functional geriatric evaluation for mortality, hospitalization and institutionalization in older adults: A retrospective cohort survey. International Journal of Nursing Sciences 2023: 10: 38–45. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Wei MY, Luster JE, Ratz D, et al. Development, validation, and performance of a new physical functioning-weighted multimorbidity index for use in administrative data. J Gen Intern Med 2021;36(8):2427–2433. doi: 10.1007/s11606-020-06486-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Pedroso-Chaparro MDS, Cabrera I, Vara-Garcia C, et al. Physical limitations and loneliness: The role of guilt related to self-perception as a burden. J Am Geriatr Soc 2023;71:903–908. doi: 10.1111/jgs.18149. [DOI] [PubMed] [Google Scholar]
- 6.Tang J, Chen Y, Liu H, et al. Examining racial and ethnic differences in disability among older adults: A polysocial score approach. Maturitas 2023. doi: 10.1016/j.maturitas.2023.03.010. [DOI] [PubMed] [Google Scholar]
- 7.O’Neill D, Forman DE. The importance of physical function as a clinical outcome: Assessment and enhancement. Clin Cardiol 2020;43(2):108–117. doi: 10.1002/clc.23311 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Sin MK, Lee JA, Murphy PJM, et al. Upper extremity weakness: A novel risk factor for non-cardiovascular mortality among community-dwelling older adults. Arch Gerontol Geriatr 2023. doi: 10.1016/j.archger.2023.105021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Roshanravan B, Patel KV. Assessment of physical functioning in the clinical care of the patient with advanced kidney disease. Semin Dial 2019;32(4):351–360. doi: 10.1111/sdi.12813. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Shuman V, Brach JS, Bean JF, et al. Prevalence and predictors of ambulatory care physicians’ documentation of mobility limitations in older adults. Archives of Physical Medicine and Rehabilitation 2023; 104(5): 719–727. doi: 10.1016/j.apmr.2022.11.018 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Hambleton RK, Swaminathan H. Item response theory principles and applications. Boston, MA: Kluwer-Nijhoff, 1985. [Google Scholar]
- 12.Anderson C, Olshan A, Bae-Jump V, et al. Falls, walking or balance problems, and limitations in activities of daily living (ADLs) among older endometrial cancer survivors. Support Care Cancer 2022;30(7):6339–6351. doi: 10.1007/s00520-022-07087-2. [DOI] [PubMed] [Google Scholar]
- 13.Blackwood J, Karczewski H, Huang MH, et al. Katz activities of daily living disability in older cancer survivors by age, stage, and cancer type. J Cancer Surviv 2020;14(6):769–778. doi: 10.1007/s11764-020-00891-x. [DOI] [PubMed] [Google Scholar]
- 14.Hays RD, Morales LS. The RAND-36 measure of health-related quality of life. Annals of Medicine 2001; 33: 350–357. [DOI] [PubMed] [Google Scholar]
- 15.Health Services Advisory Group. Medicare HOS Research Data Files. https://www.hosonline.org/en/data-dissemination/research-data-files/. February 23, 2023. Accessed April 24, 2023.
- 16.Centers for Disease Control and Prevention. Measuring Healthy Days. Atlanta, Georgia:CDC, November 2000. [Google Scholar]
- 17.Cronbach LJ. Coefficient alpha and the internal structure of tests. Psychometrika 1951;16:297–334. doi: 10.1007/BF02310555. [DOI] [Google Scholar]
- 18.McDonald RP Test theory: A unified treatment. Mahwah, NJ: Lawrence Erlbaum, 1999. [Google Scholar]
- 19.Samejima F The graded response model. In: van der Linden WJ, Hambleton R eds. Handbook of modern item response theory. New York, NY/USA: Springer, 1996, pp. 85–100. [Google Scholar]
- 20.SAS Institute Inc., SAS 9.4 Help and Documentation, Cary, NC: SAS Institute Inc., 2004. [Google Scholar]
- 21.Hays RD, Mallett JS, Gaillot S, et al. Performance of the Medicare Consumer Assessment of Healthcare Providers and Systems (CAHPS®) physical functioning items. Med Care,2016; 54: 205–209. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Hays RD, Morales LS, Reise SP. Item response theory and health outcomes measures in the 21st century. Med Care 2000;38:II-28 –II-42. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Smit EB, Bouwstra H, Roorda LD, et al. A Patient-Reported Outcomes Measurement Information System short form for measuring physical function during geriatric rehabilitation: Test-retest reliability, construct validity, responsiveness, and interpretability. J Am Med Dir Assoc, 2021;22(8):1627–1632. doi: 10.1016/j.jamda.2021.01.079. [DOI] [PubMed] [Google Scholar]
- 24.Nunnally JC, Bernstein IH. Psychometric theory, 3rd edition. New York, NY: McGraw-Hill, 1994. [Google Scholar]
- 25.Hays RD, Spritzer KL, Reise SP. Using item response theory to identify responders to treatment: Examples with the Patient-Reported Outcomes Measurement Information System (PROMIS) physical functioning and emotional distress scales. Psychometrika 2021; 86(3): 781–792. doi: 10.1007/s11336-021-09774-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Hays RD, Liu H, Spritzer K, et al. Item response theory analyses of physical functioning items in the Medical Outcomes Study. Med Car 2007;45: S32–38. [DOI] [PubMed] [Google Scholar]
- 27.Hays RD, Spritzer KL, Fries JF, et al. Responsiveness and minimally important difference for the Patient-Reported Outcomes Measurement and Information System (PROMIS) 20-Item physical functioning short-form in a prospective observational study of rheumatoid arthritis. Annals of the Rheumatic Diseases 2015;74(1):104–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Ankuda CK, Covinsky K, Freedman VA, et al. The devil’s in the details: Variation in estimates of late-life activity limitations across national cohort studies. J Am Soc 2023;71:858–868. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Tarazi W, Welch W, Nguyen N, et al. Medicare beneficiary enrollment trends and demographic characteristics. Issue Brief. Office of the Assistant Secretary for Planning and Evaluation 2022. https://aspe.hhs.gov/reports/medicare-enrollment [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
