Skip to main content
The Gerontologist logoLink to The Gerontologist
. 2017 Apr 11;58(4):e197–e204. doi: 10.1093/geront/gnx033

The Nursing Home Physical Performance Test: A Secondary Data Analysis of Women in Long-Term Care Using Item Response Theory

Subashan Perera 1,2,, David A Nace 1, Neil M Resnick 1, Susan L Greenspan 1
PMCID: PMC6044406  PMID: 28402474

Abstract

Background and Objectives

The Nursing Home Physical Performance Test (NHPPT) was developed to measure function among nursing home residents using sit-to-stand, scooping applesauce, face washing, dialing phone, putting on sweater, and ambulating tasks. Using item response theory, we explore its measurement characteristics at item level and opportunities for improvements.

Research Design and Methods

We used data from long-term care women. We fitted a graded response model, estimated parameters, and constructed probability and information curves. We identified items to be targeted toward lower and higher functioning persons to increase the range of abilities to which the instrument is applicable. We revised the scoring by making sit-to-stand and sweater items harder and dialing phone easier. We examined changes to concurrent validity with activities of daily living (ADL), frailty, and cognitive function.

Results

Participants were 86 years old, had more than three comorbidities, and a NHPPT of 19.4. All items had high discrimination and were targeted toward the lower middle range of performance continuum. After revision, sit-to-stand and sweater items demonstrated greater discrimination among the higher functioning and/or greater spread of thresholds for response categories. The overall test showed discrimination over a wider range of individuals. Concurrent validity correlation improved from 0.60 to 0.68 for instrumental ADL and explained variability (R2) from 22% to 36% for frailty.

Discussion and Implications

NHPPT has good measurement characteristics at the item level. NHPPT can be improved, implemented in computerized adaptive testing, and combined with self-report for greater utility, but a definitive study is needed.

Keywords: Function/mobility, Long-term care, Measurement, Psychometrics


Physical performance measures in community-dwelling older adults are well-established markers of function and predict many untoward future outcomes such as mortality, disability, hospitalization, functional and quality of life decline, and nursing home placement (Perera et al., 2016; Studenski et al., 2003, 2011; Woo, Ho, & Yu, 1999). Moreover, improvements and declines in physical performance, even if transient, have been shown to be significantly associated with mortality (Hardy, Perera, Roumani, Chandler, & Studenski, 2007; Perera, Studenski, Chandler, & Guralnik, 2005). Criteria for meaningful change in physical performance measures have been developed to facilitate interpretation of such measures in clinical and research settings (Alley et al., 2011; Brach et al., 2010; Kwon et al., 2009; Oh-Park, Perera, & Verghese, 2012; Perera, Mody, Woodman, & Studenski, 2006; Perera et al., 2014). Physical performance measures are increasingly being used as outcomes in randomized trials of the community-dwelling older adults (Brach, VanSwearingen, Perera, Wert, & Studenski, 2013; Pahor et al., 2014; VanSwearingen et al., 2009).

In contrast, the evidence base for physical performance assessment in frail long-term care older adults is less developed due to many challenges. For example, assessments based on walking or climbing stairs that are frequently used in community-dwelling older adults may be too difficult and unsafe for long-term care residents. Nursing Home Physical Performance Test (NHPPT) is one measure of physical performance developed specifically for long-term care residents and seeks to address some of the challenges (Binder, Miller, & Ball, 2001). On an ordinal scale of 0–4 (higher score better), it assesses level of assistance and prompting required as well as time required (quantitative scoring) to complete sit-to-stand, scooping applesauce, face washing, dialing a telephone, putting on and taking off a sweater, and ambulating 6 meters (Supplemental Appendix Table 1). The tasks are more relevant for the long-term care older adults, and the ambulation task is allowed to be completed with a wheelchair if unable to walk. NHPPT has been used as a secondary outcome in a randomized trial of nursing home and assisted living residents examining effects of activities and exercise on sleep (Lorenz et al., 2012; Richards et al., 2011). More recently, NHPPT was included as a secondary outcome in a trial of zoledronic acid to improve bone mineral density (BMD) in a similar population (Greenspan, Perera, Ferchak, Nace, & Resnick, 2015).

The NHPPT was developed, and its psychometric properties such as reliability, validity, and internal consistency have been evaluated using classical methods (Binder et al., 2001). NHPPT has excellent interrater reliability (0.73–0.93); consists of underlying factors of gross motor function as well as fine motor coordination and task sequencing; high internal consistency; has concurrent validity with respect to other physical and cognitive function scales such as that in the Minimum Data Set; Multidimensional Observational Scale for Elderly Subjects; Katz instrument; and the Short Blessed Test of orientation, concentration, and memory and is more responsive to change over time than the other considered scales (Blessed, Tomlinson, & Roth, 1968; Helmes, Csapo, & Short, 1987; Katz, Ford, Moskowitz, Jackson, & Jaffe, 1963; Morris, Fries, & Morris, 1999; Morris, Nonemaker, & Murphy, 1999). However, psychometric properties of NHPPT have not yet been evaluated using techniques of modern test theory, such as item response theory (De Ayala, 2009). Item response theory, a paradigm for scoring tests, was initially developed in the field of educational testing to model the probability of a correct response to a test question as a function of the person’s ability and the question’s difficulty, both measured on the same latent trait scale that the test is designed to assess. Therefore, it is particularly suitable for item-level analyses of instruments. In fact, reviews of other functional performance measures in the residential elderly have suggested increased use of item response theory in measurement strategies in long-term care (Fox, Henwood, Neville, & Keogh, 2014). Item response theory has been widely applied to scale development in other elderly disease conditions such as stroke (Duncan, Bode, Lai, & Perera, 2003; Duncan, Lai, Bode, Perera, & DeRosa, 2003; Lai, Perera, Duncan, & Bode, 2003). To address this gap in knowledge, we perform a secondary analysis of data from frail long-term care women using item response theory. We examine psychometric characteristics of the NHPPT from a modern test theoretic perspective, explore potential changes to scoring, and implications of such changes on concurrent validity with respect to other available measures of function and frailty.

Research Design and Methods

Setting and Participants

In the present secondary analysis, we used cross-sectional data from women that were screened for participating in a randomized trial of zoledronic acid to improve BMD. The methods of the parent study have been previously described in detail elsewhere (Greenspan et al., 2012, 2015). Briefly, participants were recruited to the parent trial from nursing homes and assisted living facilities. Inclusion criteria included not receiving a bisphosphonate, history of vertebral or hip fracture, a BMD lower than 2003 National Osteoporosis Guidelines for treatment, and a BMD lower than 2 SD from a healthy 30-year-old adult (National Osteoporosis Foundation, 2003). Those with cognitive and functional impairment, immobility, multimorbidity, and polypharmacy were not excluded. Only those with a life expectancy of <2 years and estimated glomerular filtration ratio of <30 mL/min were excluded. We included 205 participants with NHPPT data in the present analysis.

Measures

We considered the baseline measurements of quantitatively assessed NHPPT (Binder et al., 2001), Katz Activities of Daily Living Scale (ADL; Katz et al., 1963), Instrumental Activities of Daily Living (IADL; Lawton & Brody, 1969), Short Portable Mental Status Questionnaire (SPMSQ; Pfeiffer, 1975), Comorbidity Index (Rigler, Studenski, Wallace, Reker, & Duncan, 2002), Patient Health Questionnaire for depression (PHQ-9; Kroenke & Spitzer, 2002), and in a subset of 176 participants, the modified Fried Frailty Index (Fried et al., 2001) categorized as robust, prefrail, and frail.

Statistical Analysis

We used appropriate descriptive statistics to summarize the sample characteristics. First, we performed an item response analysis of the NHPPT data using a graded response model, suitable for item analysis when responses are ordinal (De Ayala, 2009; Samejima, 1969). The basic model developed for dichotomous (correct/incorrect) responses (i.e., the one-parameter Rasch model) stipulates the probability of a correct response to a question depends on both a person’s ability and the question’s difficulty, measured on the same underlying latent trait the test is designed to measure (Rasch, 1980). It has been extended to the graded response model for use with ordinal responses, where additional parameters have been added to accommodate different levels of a “correct” response (thresholds), and the level of discrimination afforded by an item. For the NHPPT scale where each item has an ordinal score from 0 to 4, we used the graded response model given by Pijk*=eλj(ηiαjk)/(1+eλj(ηiαjk)),k = 0,…4, where Pijk* is the probability of participant i responding at level k or greater for item j; αjk are the threshold parameters for item j; λj is the slope or discrimination parameter for item j; ηi is the normally distributed latent trait of the ith participant at NHPPT attempts to measure. Item category probability Pijk, the probability of participant i responding at level k item j, can then be estimated by taking differences Pijk*Pij(k+1)* for adjacent categories, where Pij0*=0 and Pij(k+1)*=1. Parameters were estimated; item category probability, item information, and test information curves were plotted against the estimated latent trait; and a histogram of estimated latent trait was made. Second, depending on the observations from the parameter estimates and item curves, and substantive judgment, the published thresholds for quantitative scoring were tweaked with the goals of targeting some items toward frailer (making easier) and some toward higher functioning (making more difficult) and increasing item information for participants with a wider range of function. Third, we fitted the graded response model to the revised NHPPT. Finally, we examined concurrent validity of both original and revised scoring of NHPPT with respect to frailty and other measures of function and mood using correlation and regression analyses. SAS® IRT procedure was used for the main analysis (SAS Institute, Cary, NC).

Results

Participant Characteristics

Participants had a mean age of 86 years, and the majority had had a high-school education (Table 1). Overall, participants had high levels of functional impairment. Mean NHPPT score was 19.4, and on average, participants had more than three comorbidities. Mean ADL and IADL scores were 11.7 and 8.0, respectively.

Table 1.

Participant Characteristics

Characteristic Mean ± SD or N (%)
Age (years) 85.8 ± 5.2
Education
 Elementary school 7 (3.4%)
 High-school equivalent 106 (51.7%)
 College 51 (24.9%)
 Postgraduate 25 (12.2%)
 Other 14 (6.8%)
 Unknown 2 (1.0%)
ADL score (range 0–14; higher is better) 11.7 ± 2.3
IADL score (range 0–14; higher is better) 8.0 ± 3.8
SPMSQ errors (range 0–10; higher is worse) 1.9 ± 2.4
PHQ-9 (range 0–27; higher is worse) 3.8 ± 4.2
Comorbidity index (range 0–8; higher is worse) 3.5 ± 1.4
Frailty
 Robust 11 (5.4%)
 Prefrail 52 (25.4%)
 Frail 113 (55.1%)
 Unknown 29 (14.2%)
NHPPT (range 0–24; higher is better) 19.4 ± 4.5

Note: ADL = Activities of Daily Living; IADL = Instrumental Activities of Daily Living; NHPPT = Nursing Home Physical Performance Test; PHQ-9 = Patient Health Questionnaire; SD = standard deviation; SPMSQ = Short Portable Mental Status Questionnaire.

Item Characteristics of the NHPPT

Table 2 shows that the slope parameters are generally high for all items, ranging from 1.34 for Dial a Telephone to 2.21 for 6-meter walk/wheel, indicating that probability of endorsing an item changes rapidly with a person’s ability thus affording a greater level of discrimination. In addition, threshold parameters for all items appear to have a generally wide range from high negative values to either low negative values or somewhat positive values, indicating that items can discriminate among those with a wider range of person abilities toward the frail end of the spectrum of physical performance. The narrowest of the range was for Put on/Take off Sweater, but still ranged from −1.67 to −1.07. Item category probabilities for each item are plotted against person ability in Figure 1. Different categories (0/1/2/3/4) are largely endorsed with substantial probability by persons with some range of abilities, as exemplified in the Walk/Wheel item. But for some items such as Sit-to-Stand, Dial a Telephone, and Put on/Take off Sweater, no level of ability elicited a substantial probability of performing at the Category 1 level. The extent of information provided by each item, to be interpreted on a comparative basis rather than absolute numbers, is in Figure 2a. Most items provide greatest information for discriminating among lower end of functional performance (−4 to 0 range), whereas Sit-to-Stand and Walk/Wheel in the moderate −2 to +2 range.

Table 2.

Graded Response Model Parameter Estimates ± Standard Error

Item Original scoring Revised scoring
Slope/discrimination parameter (λj) Threshold parameters (αj*) Slope/discrimination parameter (λj) Threshold parameters (αj*)
1. Sit-to-stand 2.20 ± 0.38 0: −2.09 ± 0.27 1.75 ± 0.27 0: −0.79 ± 0.15
1: −2.01 ± 0.26 1: −0.31 ± 0.12
2: −1.40 ± 0.18 2: 0.39 ± 0.13
3: 0.35 ± 0.11 3: 2.19 ± 0.28
2. Scooping applesauce 1.73 ± 0.32 0: −2.97 ± 0.44 1.82 ± 0.32 0: −2.87 ± 0.41
1: −2.46 ± 0.35 1: −2.41 ± 0.33
2: −1.98 ± 0.28 2: −1.96 ± 0.26
3: −0.74 ± 0.14 3: −0.73 ± 0.14
3. Face washing 1.37 ± 0.28 0: −3.81 ± 0.70 1.33 ± 0.26 0: −3.87 ± 0.71
1: −2.92 ± 0.50 1: −2.97 ± 0.51
2: −1.71 ± 0.29 2: −1.73 ± 0.30
3: −1.01 ± 0.20 3: −1.02 ± 0.20
4. Dial a telephone 1.34 ± 0.26 0: −2.38 ± 0.40 1.76 ± 0.38 0: −2.03 ± 0.32
1: −2.18 ± 0.37 1: −1.87 ± 0.29
2: −1.31 ± 0.23 2: −1.55 ± 0.24
3: −0.76 ± 0.17 3: −1.37 ± 0.21
5. Put on/take off sweater 2.19 ± 0.47 0: −1.67 ± 0.24 2.96 ± 0.57 0: −0.26 ± 0.10
1: −1.63 ± 0.23 1: 0.09 ± 0.10
2: −1.42 ± 0.20 2: 0.75 ± 0.12
3: −1.07 ± 0.16 3: 1.68 ± 0.19
6. 6-meter walk/wheel 2.21 ± 0.39 0: −1.79 ± 0.23 1.87 ± 0.28 0: −1.92 ± 0.25
1: −1.09 ± 0.16 1: −1.18 ± 0.17
2: −0.07 ± 0.11 2: −0.09 ± 0.12
3: 0.77 ± 0.13 3: 0.82 ± 0.14

Figure 1.

Figure 1.

Item category probability versus physical performance trait curves.

Figure 2.

Figure 2.

Item information curves for original and rescored Nursing Home Physical Performance Test versus physical performance trait.

Opportunities for Revising NHPPT Scoring

Scoop Applesauce and Washing Face items are most relevant and geared toward those at low end of the performance continuum, whereas Walk/Wheel is most applicable to those in the middle ranges. Figure 2a shows these items already have somewhat wide information curves. Therefore, these items probably do not represent much opportunity for improvement. In a classroom test, it is desirable to have items that have a wide range of difficulty. Similarly, a good instrument should have items with a wide range in terms of difficulty/ease to endorse. Sit-to-Stand and Put on/Take off Sweater items appear candidates for revising their scoring so that they are harder to endorse, and Dial a Telephone easier to endorse. For example, we changed the quantitative scoring so that Sit-to-Stand task should be completed within 1 s rather than 2 s to receive a score of 4, and Dial a Telephone could be completed within 20 s rather than 13 s to receive a score of 4 (Supplemental Appendix Table 1). After making the said changes to item difficulties by changing quantitative scoring, the threshold parameters are more spread out (Table 2). Figure 3b shows that Sit-to-Stand and Put on/Take off Sweater items now provide a greater level of information for discriminating among more high functioning individuals of the performance continuum in the −1 to +3 range. Dial a Telephone item still targets the same range of individuals but provides much greater information. As a complete test, NHPPT with revised scoring provides a greater level of information for individuals with a wider range of abilities. For example, if we arbitrarily consider an amount of information greater than 3, the range of corresponding abilities have widened from approximately (−3 to +1) to (−3 to +2), substantially extending the utility of the test among the higher functioning individuals. Finally, histograms of participants’ estimated latent traits show that some of the ceiling effects present in the original scoring have been alleviated by revised scoring, resulting in a desirable symmetric shape consistent with the normality assumption of participant abilities ηi in the model (Figure 4).

Figure 3.

Figure 3.

Test information curve for original and rescored Nursing Home Physical Performance Test versus physical performance trait.

Figure 4.

Figure 4.

Histogram of estimated latent traits from the original and revised Nursing Home Physical Performance Test.

Concurrent Validity

With the revision of scoring, the correlation of NHPPT with ADL score increased from 0.67 to 0.68, IADL score from 0.60 to 0.68, SPMSQ from 0.34 to 0.39, and comorbidity index from −0.22 to −0.25. Frailty category explained 46% of the variability in the NHPPT and 58% with the revised scoring. Large changes were seen in explained variability by individual IADL items getting to places outside walking distances (32%–41%) and taking own medications (21%–29%).

Discussion and Implications

Using methods of item response theory, we have shown that the NHPPT largely has good psychometric properties at the item level. Items have good discriminatory properties indicated by the slope parameters and are targeted mostly toward the lower end of the physical performance spectrum. NHPPT has concurrent validity with respect to other measures of physical and cognitive function, and frailty. Opportunities were identified for further improvement of the NHPPT that could be accomplished by rescoring. The score revision is shown to improve performance of the instrument at both item and overall test level and also increases the concurrent validity with respect to other measures considered.

Objectively measured and scored performance-based measures such as NHPPT are especially appealing compared with self-reports of function particularly in nursing home residents due to high prevalence of cognitive impairment, reporting bias, and recall bias and that residents simply do not engage in many of the everyday activities on which most self-report questionnaires of function are based. Item analysis of NHPPT is an important next step for several reasons. First, having actually used NHPPT in a randomized trial, we have encountered some practical challenges with some items in the targeted frail population. For example, Putting on/Taking off Sweater caused us to think about whether to use a research set of sweaters in standard sizes. Participants may be uncomfortable using sweaters worn by others but using a resident’s own sweater would introduce substantial variability. Furthermore, sweaters may be uncomfortable during warmer months of the year. Moreover, scooping applesauce can be messy and challenging for some residents depending on their eating habits and cognitive function. If a challenging item also turns out to have poor psychometric characteristics from an item analysis, perhaps it can be revised or skipped. Second, item response theory naturally lends itself to computerized adaptive testing (McDonough et al., 2012) where only a subset of items specially targeted for an individual’s ability is administered. The score derived from the shorter test generally has similar properties to the full test. Shorter and targeted physical performance tests may be especially advantageous in frail populations such as ours where fatigue and safety are important considerations. With the technological advancement new ways of functional assessment are now possible (Lowe et al., 2013), and computerized adaptive testing may play an important role in the said evolution. Third, self-report questionnaires and performance-based measures often provide complementary information about the functional status of the elderly (Perera et al., 2005). Combination of performance measure based items from NHPPT and items from other established self-report questionnaires can be examined using the methods of item response theory to potentially develop new scales that can reap advantages of both assessment paradigms.

Our study has several important strengths. First, to our knowledge, it is the first analysis of NHPPT using item response theory. In fact, application of item response theory in assessment of older long-term care residents thus far has been sparse. We are aware of only a few studies involving assessment of pain and Barthel Index (Edelen & Saliba, 2010; Liu, Unick, Galik, & Resnick, 2015; van Nispen tot Pannerden et al., 2009). Second, it is also the largest study thus far to use NHPPT. The original development and validation work was based on only 95 participants, and a sleep and exercise trial that used NHPPT as an outcome had only 119 participants (Binder et al., 2001; Lorenz et al., 2012; Richards et al., 2011). Apart from gender, our study participants appear to be fairly comparable to those used for original scale development. Third, we were able to revise scoring in an informed manner based on guidance provided by the item analysis. We feel that it is especially appealing that high-end discrimination was improved merely by scoring rather than addition of more challenging tasks such as climbing steps, as the latter approach has the disadvantages of safety concerns and making the instrument longer than necessary. Finally, we examined test information and concurrent validity and showed substantial improvements with the revised scoring.

Our study has several important limitations as well. First, our parent study only included women. How well the results will generalize to the general long-term care population, including men, is not known and difficult to speculate. Studies in community-dwelling older adults show that men perform higher than women on physical performance measures such as gait speed and have lower rates of disability than women (Perera et al., 2016). However, such findings are also influenced by greater longevity in men and the resulting survival bias (Studenski et al., 2011). Second, an even larger sample size is needed for a more comprehensive and definitive item analysis of the NHPPT. Third, we had found that only a very small number of participants from the entire continuum was performing at level that would yield Category 1 for Sit-to-Stand, Dial a Telephone, and Put on/Take off Sweater items. The category can likely be combined with either Category 0 or Category 2 without much loss of information. However, we did not revise this aspect of scoring in this initial investigation with a limited sample because it would have changed the range of scores for NHPPT.

We conclude that NHPPT has good psychometric properties from an item response theoretic perspective, which complements the insights obtained through classical test theory. Through our item analysis, we have identified some opportunities for improvement of NHPPT scoring and demonstrated such revisions can yield improved test information characteristics and concurrent validity. Further study with a larger more representative sample applying item response theory is needed to make more definitive recommendations for improving NHPPT and possibly adapting it for computerized adaptive testing and/or combining with self-report items.

Supplementary Material

Supplementary data is available at The Gerontologist online.

Funding

This work was supported by National Institutes of Health (R01 AG028068, K24 DK062895, and P30 AG024827 to S. L. Greenspan).

Conflict of Interest

S. Perera and S. L. Greenspan receive research funding through a grant from Eli Lilly & Co. to the University of Pittsburgh. S. L. Greenspan receives research funding from Amgen to the University of Pittsburgh and is on the advisory board of Merck. D. A. Nace received past investigator initiated grant funding from Sanofi Pasteur to the University of Pittsburgh for influenza research.

Supplementary Material

Supplemental Appendix Table 1

References

  1. Alley D. E. Hicks G. E. Shardell M. Hawkes W. Miller R. Craik R. L.,…Magaziner J (2011). Meaningful improvement in gait speed in hip fracture recovery. Journal of the American Geriatrics Society, 59, 1650–1657. doi:10.1111/j.1532-5415.2011.03560.x [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Binder E. F. Miller J. P., & Ball L. J (2001). Development of a test of physical performance for the nursing home setting. The Gerontologist, 41, 671–679. doi:10.1093/geront/41.5.671 [DOI] [PubMed] [Google Scholar]
  3. Blessed G. Tomlinson B. E., & Roth M (1968). The association between quantitative measures of dementia and of senile change in the cerebral grey matter of elderly subjects. The British Journal of Psychiatry: The Journal of Mental Science, 114, 797–811. doi:10.1192/bjp.114.512.797 [DOI] [PubMed] [Google Scholar]
  4. Brach J. S. Perera S. Studenski S. Katz M. Hall C., & Verghese J (2010). Meaningful change in measures of gait variability in older adults. Gait & Posture, 31, 175–179. doi:10.1016/j.gaitpost.2009.10.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Brach J. S. VanSwearingen J. M. Perera S. Wert D. M., & Studenski S (2013). Motor learning versus standard walking exercise in older adults with subclinical gait dysfunction: A randomized clinical trial. Journal of the American Geriatrics Society, 61, 1879–1886. doi:10.1111/jgs.12506 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. De Ayala R. J. (2009). The theory and practice of item response theory. New York: Guilford Press. [Google Scholar]
  7. Duncan P. W. Bode R. K. Lai S. M., & Perera S; Glycine Antagonist in Neuroprotection Americans Investigators (2003). Rasch analysis of a new stroke-specific outcome scale: The Stroke Impact Scale. Archives of Physical Medicine and Rehabilitation, 84, 950–963. doi:10.1016/S0003-9993(03)00035-2 [DOI] [PubMed] [Google Scholar]
  8. Duncan P. W. Lai S. M. Bode R. K. Perera S., & DeRosa J (2003). Stroke Impact Scale-16: A brief assessment of physical function. Neurology, 60, 291–296. doi:10.1212/01.WNL.0000041493.65665.D6 [DOI] [PubMed] [Google Scholar]
  9. Edelen M. O., & Saliba D (2010). Correspondence of verbal descriptor and numeric rating scales for pain intensity: An item response theory calibration. The Journals of Gerontology, Series A: Biological Sciences and Medical Sciences, 65, 778–785. doi:10.1093/gerona/glp215 [DOI] [PubMed] [Google Scholar]
  10. Fox B. Henwood T. Neville C., & Keogh J (2014). Relative and absolute reliability of functional performance measures for adults with dementia living in residential aged care. International Psychogeriatrics, 26, 1659–1667. doi:10.1017/S1041610214001124 [DOI] [PubMed] [Google Scholar]
  11. Fried L. P. Tangen C. M. Walston J. Newman A. B. Hirsch C. Gottdiener J.,…McBurnie M. A; Cardiovascular Health Study Collaborative Research Group (2001). Frailty in older adults: Evidence for a phenotype. The Journals of Gerontology, Series A: Biological Sciences and Medical Sciences, 56, M146–M156. doi:10.1093/gerona/56.3.M146 [DOI] [PubMed] [Google Scholar]
  12. Greenspan S. Nace D. Perera S. Ferchak M. Fiorito G. Medich D.,…Resnick N (2012). Lessons learned from an osteoporosis clinical trial in frail long-term care residents. Clinical Trials (London, England), 9, 247–256. doi:10.1177/1740774511430516 [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Greenspan S. L. Perera S. Ferchak M. A. Nace D. A., & Resnick N. M (2015). Efficacy and safety of single-dose zoledronic acid for osteoporosis in frail elderly women: A randomized clinical trial. JAMA Internal Medicine, 175, 913–921. doi:10.1001/jamainternmed.2015.0747 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Hardy S. E. Perera S. Roumani Y. F. Chandler J. M., & Studenski S. A (2007). Improvement in usual gait speed predicts better survival in older adults. Journal of the American Geriatrics Society, 55, 1727–1734. doi:10.1111/j.1532-5415.2007.01413.x [DOI] [PubMed] [Google Scholar]
  15. Helmes E. Csapo K. G., & Short J. A (1987). Standardization and validation of the Multidimensional Observation Scale for Elderly Subjects (MOSES). Journal of Gerontology, 42, 395–405. doi:10.1093/geronj/42.4.395 [DOI] [PubMed] [Google Scholar]
  16. Katz S. Ford A. B. Moskowitz R. W. Jackson B. A., & Jaffe M. W (1963). Studies of illness in the aged. The index of ADL: A standardized measure of biological and psychosocial function. JAMA, 185, 914–919. doi:10.1001/jama.1963. 03060120024016 [DOI] [PubMed] [Google Scholar]
  17. Kroenke K., & Spitzer R. L (2002). The PHQ-9: A new depression and diagnostic severity measure. Psychiatric Annals, 32, 509–521. doi:10.3928/0048-5713-20020901-06 [Google Scholar]
  18. Kwon S. Perera S. Pahor M. Katula J. A. King A. C. Groessl E. J., & Studenski S. A (2009). What is a meaningful change in physical performance? Findings from a clinical trial in older adults (the LIFE-P study). The Journal of Nutrition, Health & Aging, 13, 538–544. doi:10.1007/s12603-009-0104-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Lai S. M. Perera S. Duncan P. W., & Bode R (2003). Physical and social functioning after stroke: Comparison of the Stroke Impact Scale and Short Form-36. Stroke, 34, 488–493. doi:10.1161/01.STR.0000054162.94998.C0 [DOI] [PubMed] [Google Scholar]
  20. Lawton M. P., & Brody E. M (1969). Assessment of older people: Self-maintaining and instrumental activities of daily living. The Gerontologist, 9, 179–186. doi:10.1093/geront/9.3_Part_1.179 [PubMed] [Google Scholar]
  21. Liu W. Unick J. Galik E., & Resnick B (2015). Barthel Index of activities of daily living: Item response theory analysis of ratings for long-term care residents. Nursing Research, 64, 88–99. doi:10.1097/NNR.0000000000000072 [DOI] [PubMed] [Google Scholar]
  22. Lorenz R. A. Gooneratne N. Cole C. S. Kleban M. H. Kalra G. K., & Richards K. C (2012). Exercise and social activity improve everyday function in long-term care residents. The American Journal of Geriatric Psychiatry, 20, 468–476. doi:10.1097/JGP.0b013e318246b807 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Lowe S. A. Rodríguez-Molinero A. Glynn L. Breen P. P. Baker P. M. Sanford J.,…Ólaighin G (2013). New technology-based functional assessment tools should avoid the weaknesses and proliferation of manual functional assessments. Journal of Clinical Epidemiology, 66, 619–632. doi:10.1016/j.jclinepi.2012.12.003 [DOI] [PubMed] [Google Scholar]
  24. Oh-Park M. Perera S., & Verghese J (2012). Clinically meaningful change in stair negotiation performance in older adults. Gait & Posture, 36, 532–536. doi:10.1016/j.gaitpost.2012.05.015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. McDonough C. M. Tian F. Ni P. Kopits I. M. Moed R. Pardasaney P. K., & Jette A. M (2012). Development of the computer-adaptive version of the Late-Life Function and Disability Instrument. The Journals of Gerontology, Series A: Biological Sciences and Medical Sciences, 67, 1427–1438. doi:10.1093/gerona/gls108 [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Morris J. N. Fries B. E., & Morris S. A (1999). Scaling ADLs within the MDS. The Journals of Gerontology, Series A: Biological Sciences and Medical Sciences, 54, M546–M553. doi:10.1093/gerona/54.11.M546 [DOI] [PubMed] [Google Scholar]
  27. Morris J. N. Nonemaker S., & Murphy K (1999). Long Term Care Facility Resident Assessment Instrument (RAI) Users Manual. Washington, DC: Health Care Financing Administration. [Google Scholar]
  28. National Osteoporosis Foundation (2003). Physician’s guide to prevention and treatment of osteoporosis. Belle Mead, NJ: Excerpta Medica. doi:10.1007/s00198-014-2794-2 [Google Scholar]
  29. van Nispen tot Pannerden S. C. Candel M. J. Zwakhalen S. M. Hamers J. P. Curfs L. M., & Berger M. P (2009). An item response theory-based assessment of the pain assessment checklist for Seniors with Limited Ability to Communicate (PACSLAC). The Journal of Pain, 10, 844–853. doi:10.1016/j.jpain.2009.02.007 [DOI] [PubMed] [Google Scholar]
  30. Pahor M. Guralnik J. M. Ambrosius W. T. Blair S. Bonds D. E. Church T. S., . . LIFE Study Investigators. (2014). Effect of structured physical activity on prevention of major mobility disability in older adults: The LIFE study randomized clinical trial. Journal of the American Medical Association, 311, 2387–2396. doi:10.1001/jama.2014.5616 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Perera S. Mody S. H. Woodman R. C., & Studenski S. A (2006). Meaningful change and responsiveness in common physical performance measures in older adults. Journal of the American Geriatrics Society, 54, 743–749. doi:10.1111/j.1532-5415.2006.00701.x [DOI] [PubMed] [Google Scholar]
  32. Perera S. Patel K. V. Rosano C. Rubin S. M. Satterfield S. Harris T.,…Studenski S. A (2016). Gait speed predicts incident disability: A pooled analysis. The Journals of Gerontology, Series A: Biological Sciences and Medical Sciences, 71, 63–71. doi:10.1093/gerona/glv126 [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Perera S. Studenski S. Chandler J. M., & Guralnik J (2005). Magnitude and patterns of decline over one year in health and function affect survival over five years. The Journals of Gerontology, Series A: Biological Sciences and Medical Sciences, 60, M894–M900. doi:10.1093/gerona/60.7.894 [DOI] [PubMed] [Google Scholar]
  34. Perera S. Studenski S. Newman A. Simonsick E. Harris T. Schwartz A.,…Visser M; Health ABC Study (2014). Are estimates of meaningful decline in mobility performance consistent among clinically important subgroups? (Health ABC study). The Journals of Gerontology, Series A: Biological Sciences and Medical Sciences, 69, 1260–1268. doi:10.1093/gerona/glu033 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Pfeiffer E. (1975). A Short Portable Mental Status Questionnaire for the assessment of organic brain deficit in elderly patients. Journal of the American Geriatrics Society, 23, 433–441. doi:10.1111/j.1532-5415.1975.tb00927.x [DOI] [PubMed] [Google Scholar]
  36. Rasch G. (1980). Probabilistic models for some intelligence and attainment tests. Chicago: The University of Chicago Press. [Google Scholar]
  37. Richards K. C. Lambert C. Beck C. K. Bliwise D. L. Evans W. J. Kalra G. K.,…Sullivan D. H (2011). Strength training, walking, and social activity improve sleep in nursing home and assisted living residents: Randomized controlled trial. Journal of the American Geriatrics Society, 59, 214–223. doi:10.1111/j.1532-5415.2010.03246.x [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Rigler S. K. Studenski S. Wallace D. Reker D. M., & Duncan P. W (2002). Co-morbidity adjustment for functional outcomes in community-dwelling older adults. Clinical Rehabilitation, 16, 420–428. doi:10.1191/0269215502cr515oa [DOI] [PubMed] [Google Scholar]
  39. Samejima F. (1969). Estimation of latent ability using a response pattern of graded scores (Psychometric Monograph No. 17). Richmond, VA: Psychometric Society. [Google Scholar]
  40. Studenski S. A. Perera S. Patel K. Rosano C. Faulkner K. Inzitari M.,…Guralnik J. M (2011). Gait speed and survival in older adults: Potential value in clinical care and research. Journal of the American Medical Association, 305, 50–58. doi:10.1001/jama.2010.1923 [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Studenski S. A. Perera S. Wallace D. Chandler J. M. Duncan P. W. Rooney E.,…Guralnik J. M (2003). Physical performance measures in the clinical setting. Journal of the American Geriatrics Society, 51, 314–322. doi:10.1046/j.1532-5415.2003.51104.x [DOI] [PubMed] [Google Scholar]
  42. VanSwearingen J. M. Perera S. Brach J. S. Cham R. Rosano C., & Studenski S. A (2009). A randomized trial of two forms of therapeutic activity to improve walking: Effect on the energy cost of walking. The Journals of Gerontology, Series A: Biological Sciences and Medical Sciences, 64, 1190–1198. doi:10.1093/gerona/glp098 [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Woo J. Ho S. C., & Yu A. L (1999). Walking speed and stride length predicts 36 months dependency, mortality, and institutionalization in Chinese aged 70 and older. Journal of the American Geriatrics Society, 47, 1257–1260. doi:10.1111/j.1532-5415.1999.tb05209.x [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental Appendix Table 1

Articles from The Gerontologist are provided here courtesy of Oxford University Press

RESOURCES