Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2009 Sep 2.
Published in final edited form as: JAMA. 2008 Aug 20;300(7):823–830. doi: 10.1001/jama.300.7.823

Within-person across-neuropsychological test variability and incident dementia

Roee Holtzer 1,2, Joe Verghese 1, Cuiling Wang 3, Charlie Hall 3, Richard B Lipton 1,3,4
PMCID: PMC2736784  NIHMSID: NIHMS130683  PMID: 18714062

Abstract

Context

Neuropsychological tests are used to predict and diagnose dementia. However, no studies to date examined whether within-person across-neuropsychological test variability predicts dementia.

Objective

To examine whether within-person across-neuropsychological test variability predicted future dementia

Design

The Einstein Aging Study (EAS) is a population-based longitudinal study of aging and dementia located in the Bronx county, NY. Cox proportional hazards models using age as the time scale estimated hazard ratios for performance on individual neuropsychological tests (Free and Cued Selective Reminding Test, Digit Symbol, Vocabulary) and for within-person across-neuropsychological test variability as predictors of incident dementia. Analyses were stratified by gender, and controlled for education, and medical illness.

Setting and participants

A total of 1797 participants (age ≥70 years) enrolled in the EAS between October, 1993 and December, 2007. Participants seen for the baseline visit only (n=750), prevalent dementia cases (n=72) and those with missing information (n=78) were excluded. A total of 897 individuals were included in this investigation. Participants had follow-up visits every 12 to 18 months.

Main Outcome Measure(s)

Diagnosis of dementia

Results

Sixty one incident dementia cases were identified during follow-up (M=3.3±2.4 years). Within-person across-neuropsychological test variability predicted incident dementia (hazard ratio for one point difference in variability, 3.926, 95% CI, 2.040 - 7.558) even after adjusting for level of performance on individual neuropsychological tests (hazard ratio, 2.098, 95% CI, 1.041 - 4.225). The significant partial likelihood ratio test (p=0.04) comparing Cox models using neuropsychological tests with and without within-person across-neuropsychological test variability provided further evidence that the former improved the prediction of incident dementia. Finally, within-person across-neuropsychological test variability significantly increased the sensitivity for predicting dementia within one year compared to neuropsychological tests alone (McNemar’s test p=0.014).

Conclusions

Within-person across-neuropsychological test variability is a novel predictor of incident dementia.


Developing strategies to improve the prediction and diagnosis of dementia has paramount therapeutic and public health implications1. Neuropsychological tests support the diagnosis of dementia2, 3,the transitional stages that precede it, such as mild cognitive impairment4, 5, and the delineation of cognitive changes over time6. Level of performance on tests of memory7, 8 and executive function9, 10 has been reported to predict future dementia. Further, memory begins to decline more rapidly up to seven years prior to diagnosis of dementia11, 12.

When neuropsychological tests are used for diagnostic purposes, an individual’s level of performance on specific tests is measured against healthy normative samples to determine cognitive impairment13, 14. This approach, however, does not take into account intra-individual variability in cognitive function. The taxonomy of intra-individual variability, considered as inconsistency in cognitive performance within a person, includes the following definitions: A) Variability on the same cognitive task across multiple assessments over a long period of time. Such variability may be high in healthy individuals across the life span15. B) Variability on repeated trials of a single cognitive task administered on one occasion or over short periods of time. Cross-sectional studies show that this form of variability is increased in aging16 and dementia 17, 18 but has shown variable results in subjects with mild cognitive impairment19, 20. C) Variability in performance across neuropsychological tests administered in a single session. Such variability is expected in the normal population21, 22 including non-demented older adults23 reflecting the individual’s relative cognitive strengths and weaknesses. This study focused on within-person across-neuropsychological test variability as it can be estimated directly from standardized clinical neuropsychological procedures.

Increased within-person across-neuropsychological test variability has been related to poor cognitive function in normal older adults at cross section23. Accordingly, we examined whether increased within-person across-neuropsychological test variability: a) predicted future dementia; b) provided incremental prediction after taking into account level of performance on individual neuropsychological tests; and c) improved sensitivity for the prediction of dementia.

Methods

Study population

Participants were enrolled in the Einstein Aging Study (EAS), a longitudinal study of aging and dementia located at the Albert Einstein College of Medicine in Bronx County, New York. The study design, recruitment and follow-up methods were previously described24, 25. Briefly, EAS has used telephone-based screening procedures to recruit and follow a community-based cohort since 1993. The primary aim of the EAS is to identify risk factors for dementia. Eligibility criteria require that participants be 70 years of age and older, reside in Bronx, and speak English. Exclusion criteria include severe audiovisual disturbances that would interfere with completion of neuropsychological tests, inability to ambulate even with a walking aid or in a wheelchair, and institutionalization. Potential participants over age 70 from the Center for Medicaid Medicare Services population lists of Medicare eligible individuals were first contacted by letter, then by telephone, explaining the purpose nature of the study. The telephone interview included verbal consent, a brief medical history questionnaire, and telephone-based cognitive screening tests24. Following the interview, an age-stratified sample of subjects who matched on a computerized randomization procedure was invited for further evaluation at the medical center. This procedure was implemented to ensure that the individuals included in the EAS represented the Bronx population at that age stratum; and that those who agreed to participate were not different from non-responders in terms of key demographic characteristics. Written informed consents were obtained at clinic visits according to study protocols and approved by the Committee on Clinical Investigation (CCI; the institutional review board of the Albert Einstein College of Medicine).

A total of 1797 participants were enrolled in the EAS between October, 1993 and December, 2007. Participants seen for the baseline visit only (n=750), prevalent dementia cases (n=72) and those with missing information (n=78) were excluded. Hence, a total of 897 participants were included in this investigation. Participants had follow-up visits every 12 to 18 months, at which they underwent detailed neurological and neuropsychological evaluations.

Algorithmic diagnosis of dementia

To address the issue of diagnostic circularity we derived algorithmic diagnosis of dementia that was independent of the neuropsychological tests performance. Two conditions had to be met. First, participants had to make 8 or more errors on the Blessed Information-Memory-Concentration test (BIMC, best score 0 errors, worst possible score, 32 errors)26. This test has high test-retest reliability (0.86) and correlates well with the pathology of Alzheimer’s disease27, 28. The Blessed test is used to screen for cognitive impairment on the baseline telephone and in-person interviews in the EAS, but the scores are not utilized when assigning dementia diagnosis in consensus case conferences24. Second, impairments on basic activities on the Lawton-Brody activities of daily living29 secondary to cognitive impairment had to be documented using previously described procedures30. Incident dementia using the above algorithmic procedures was used as the main outcome measure in the primary analyses reported. There was 96 percent agreement between the algorithmic and clinical diagnosis of dementia in this cohort.

Clinical diagnosis of dementia

EAS participants suspected to have dementia received a complete diagnostic workup as previously described for this cohort31. Diagnoses of dementia were assigned according to the criteria of the Diagnostic and Statistical Manual of Mental Disorders, fourth edition (DSM-IV)32, at case conferences attended by at least one study neurologist, a neuropsychologist, and a geriatric nurse or social worker. Alzheimer’s disease was diagnosed according to the criteria for probable disease detailed by the National Institute of Neurological and Communicative Disorders and Stroke and the Alzheimer’s Disease and Related Disorders Association3. The State of California Alzheimer’s Disease Diagnostic and Treatment Centers criteria was used to assign diagnoses of probable, possible, or mixed vascular dementia 33. In subjects diagnosed with dementia, neuroimaging was used to help allocate the diagnosis of “probable” Alzheimer’s disease or “probable” vascular dementia. We have reported good agreement between clinical diagnoses of Alzheimer’s disease,34 vascular dementia,31, and dementia with Lewy bodies,35 and pathological findings in our study. For the purpose of this paper clinical diagnosis of dementia subtypes was used only in secondary analyses to explore whether associations with variability varied as a function of diseases processes. Hence only “pure” subtypes (Alzheimer’s and Vascular) but not mixed were considered.

Neuropsychological assessment

The tests included in this battery have been validated for use in the aging population in our and other aging studies8, 36, 37. Our recent studies reveal that factor analysis of the neuropsychological test battery consistently yielded three empirically derived and statistically orthogonal cognitive domains including Verbal IQ, Attention/Executive function and Memory38, 39. For the purpose of this study we identified three tests that represented the three cognitive domains mentioned above, and were available for all 897 participants included in this study. The Vocabulary (total score) subtest of the Wechsler Adult Intelligence Scale-revised (WAIS-R40), considered a “hold” test in that it is not sensitive to the effect of aging and age-related diseases, represented the Verbal IQ domain. The Free and Cued Selective Reminding Test (FCSRT, free recall) is commonly used to assess verbal memory and is sensitive to dementia41. The WAIS-R Digit Symbol Substitution subtest40 is commonly used to assess attention and executive function.

Estimate of within person variability across neuropsychological tests

The method used herein to estimate within person across tests variability was described in other studies as well 23, 42. The distribution of scores on the FCSRT, Vocabulary, and Digit Symbol tests were carefully examined to ascertain that assumptions of normality were met. Second, the raw scores of each test were Z transformed on the basis of the distribution of the entire sample (n=897). Then, the Z transformed test scores were used to calculate within person variability across the three tests using this equation:Variability=(ZikAi)2(K1), where Zik is the k th cognitive test score for i th subject. Here, k = 1, ..., K, K =3 (FCSRT, Vocabulary, and Digit Symbol) and Ai=k=1KZik is the individual’s mean z transformed score based on the three tests.

Illness Index

Consistent with our previous studies 38, 39, 43 dichotomous rating (present or absent) of diabetes, chronic heart failure, arthritis, hypertension, depression, stroke, Parkinson’s disease, chronic obstructive lung disease, angina, and myocardial infarction was used to calculate an illness index summary score (range 0-10). Medical history was obtained from multiple sources including significant others and family physicians when available. Trained research assistants used structured clinical interview, and the study physician obtained medical history during the neurological examination independently of the structured clinical interview. Information gathered from all the above sources was carefully scrutinized prior to data entry.

Statistical Analyses

First, to examine whether level of performance on the neuropsychological tests at baseline predicted the development of dementia we used Cox proportional-hazards regression analysis to estimate hazard ratios with 95 percent confidence intervals. The second Cox analysis determined the effect of within-person across-neuropsychological test variability on the development of dementia. To assess whether within-person across-neuropsychological variability provided incremental prediction of incident dementia above and beyond what was predicted by the absolute level of performance on the neuropsychological tests a third Cox proportional-hazards regression analysis was conducted controlling for performance differences on Vocabulary, FCSRT and Digit Symbol. The partial likelihood test is specifically designed to compare the goodness of fit of nested Cox proportional hazard models45. Herein, we compared a model with the three individual neuropsychological tests to a model that added within-person across-neuropsychological test variability using the partial likelihood ratio test to examine whether the latter improved significantly the prediction of dementia. Finally, using the McNemar’s test we examined whether within-person across-neuropsychological test variability significantly improved sensitivity for the prediction of new dementia cases within one year.

Age served as the time-scale because it is considered more appropriate than follow-up time in cohort studies44. When age serves as the time-scale, the hazard function can be directly interpreted as the age-specific incidence function and age is accounted for in the non-parametric term of the hazard function providing a more flexible and effective control of age44. Time to event was from age at the baseline assessment accounting for the left truncation at study inclusion to age at which diagnosis of dementia was ascertained or to final study contact for non-demented participants. In addition, all multivariate analyses were stratified by gender and controlled for education, and medical illness. For all primary analyses incident dementia was defined using the algorithmic procedures described earlier. Proportional hazards assumptions of the models were examined analytically and graphically and were adequately met.

Finally, Cox proportional-hazards regression analysis with age as the time scale were used to examine whether within person across neuropsychological tests variability predicted incident dementia subtypes (e.g. Vascular and Alzheimer’s dementia) defined by consensus diagnosis as previously described. Given the low number of incident cases for of Alzheimer’s dementia and Vascular dementia as well as concerns for potential diagnostic circularity these analyses were considered exploratory. All analyses were performed using SAS 9.1 (SAS Institute Inc., Cary, N.C) and S-Plus 8.0 (Insightful Corp.) The significance level was set at 0.05 and all tests were two-sided.

Results

Demographics

Participants were community residents who were relatively healthy and had normal global mental functioning based on BIMC scores that were within the normal range (Table 1).

Table 1.

Summary of sample characteristics, per quartile of within-person across-neuropsychological test variability at baseline

CHARACTERISTICS Entire sample (N=897) Quartile 1 variability (N=225) Quartile 2 Variability (N=224) Quartile 3 Variability (N= 224) Quartile 4 Variability (N=224)
Age Mean (SD), y 78.6 (5.3) 78.3 (5.1) 78.8 (5.6) 78.4 (5.2) 79 (5.3)
Education Mean (SD), y 13.3 (3.6) 13.6 (3.2) 13.4 (3.3) 13.2 (3.8) 12.9 (3.9)
Women, No, (%) 538 (60) 134 (60) 146 (65) 129 (58) 129, (58)
Blessed: Mean (SD) 2.6 (2.5) 2.2 (2.1) 2.5 (2.5) 2.4 (2.2) 3.5 (2.9)
FCSRT: Mean (SD) 30.0 (6.8) 30.8 (4.9) 30.9 (5.7) 30.4 (5.9) 27.8 (9.3)
Vocabulary: Mean (SD) 46.9 (13.9) 49.9 (10.9) 48.8 (11.7) 47.4 (14.2) 42.6 (17.3)
DSYM: Mean (SD) 34.5 (11.4) 35.9 (8.6) 35.7 (10.5) 33.3 (11.7) 32.9 (14.1)
Variability: Mean (SD) 0.74 (0.40) 0.29 (0.11) 0.56 (0.07) 0.82 (0.09) 1.29 (0.26)
Incident dementia cases, No (%) 61 (7) 3 (1) 19 (8) 13 (6) 26 (12)
Illness Index: Mean (SD) 1.2 (1.0) 1.2 (1.0) 1.1 (1.0) 1.2 (1.0) 1.1 (1.1)
Depression, No, (%) 83 (9) 28 (12) 23 (10) 14 (6) 18 (8)
Angina, No, (%) 90 (10) 22 (10) 22 (10) 25 (11) 21 (9)
Arthritis, No, (%) 51 (6) 13 (6) 12 (5) 15 (7) 11 (5)
Diabetes, No, (%) 120 (13) 29 (13) 24 (11) 31 (14) 36 (16)
Chronic obstructive lung disease, No, (%) 35 (4) 11 (5) 9 (4) 8 (4) 7 (3)
Hypertension, No, (%) 464 (52) 114 (51) 117 (52) 121 (54) 112 (50)
Myocardial infarction, No, (%) 90 (10) 25 (11) 18 (8) 26 (12) 21 (9)
Chronic heart failure, No, (%) 12 (1) 2 (1) 4 (2) 4 (2) 2 (1)
Parkinson’s Disease, No, (%) 12 (1) 4 (2) 2 (1) 2 (1) 4 (2)
Stroke, No, (%) 86 (10) 22 (10) 17 (8) 25 (11) 22 (10)

Variability denotes within-person across-neuropsychological test variability

Increased variability denotes worse cognitive function

FCSRT: Free and Cued Selective Reminding Test

DSYM: Digit Symbol Substitution subtest of the WAIS-R

Vocabulary: Vocabulary subtest of the WAIS-R

Of the 897 participants, 61 (6.8%) incident dementia cases, defined by the algorithmic procedure described earlier, were identified during the follow-up period (M=3.3±2.4 years). On the basis of the consensus clinical diagnostic procedures 47 participants developed incident dementia of the Alzheimer’s type and 18 participants developed incident Vascular dementia.

Prediction of Dementia

All Cox proportional-hazards regression models used age as the time scale, were stratified by gender, and adjusted for education and medical illness. The first model revealed that higher scores on FCSRT (hazard ratio for one point difference on the test, 0.872, 95% CI, 0.840 - 0.906, p<0.001) and Digit Symbol Substitution (hazard ratio for one point difference on the test, 0.971, 95% CI, 0.943 - 0.999, p=0.04) predicted lower risk of developing dementia. Level of performance on the Vocabulary test did not predict future dementia (hazard ratio for one point difference on the test, 1.005, 95% CI, 0.981 - 1.030, p=0.67).

The second Cox proportional-hazards regression analysis showed that within-person across-neuropsychological test variability was a significant predictor of incident dementia defined algorithmically (hazard ratio for one point difference in variability, 3.926, 95% CI, 2.040-7.558, p<0.001). This association was in the expected direction with the risk of dementia increasing as a function of greater within person variability.

The third Cox proportional-hazards regression analysis examined whether within-person across-neuropsychological test variability provided incremental prediction of incident dementia. Similar to the previous analysis, the results revealed that higher scores on the FCSRT (hazard ratio for one point difference on the test, 0.886, 95% CI, 0.853 - 0.921, p<0.001) and Digit Symbol Substitution (hazard ratio for one point difference on the test, 0.973; 95% CI, 0.943 - 1.000, p=0.04) predicted lower risk of incident dementia. As in the previous analysis the Vocabulary test did not predict future dementia (hazard ratio, 1.003 for one point difference on the test, 95% CI, 0.980 - 1.026, p=0.81). However, even after taking level of neuropsychological performance into account, within-person across-neuropsychological test variability remained a significant predictor of incident dementia (hazard ratio for one SD difference in variability, 2.098; 95% CI, 1.041 - 4.225, p=0.03). The reduction in the hazard ratio of variability was expected when adjusting for performance level on the individual tests. However, the low correlations between variability and performance on the neuropsychological tests (FCSRT, r= -0.21, p<0.001; Digit Symbol Substitution, r= -0,115, p<0.001; Vocabulary, r= -0.179, p<0.001) although significant were not suggestive of collinearity. The lower, but significant, hazard ratio for within-person across-neuropsychological test variability in the third compared to the second Cox analysis, along with hazard ratios for level of performance on the individual neuropsychological tests that were similar in the two analyses, are evidence that part, but not all, of the effect of variability is mediated through changes in levels of cognitive performance.

The partial likelihood ratio test45 examined whether adding within-person across-neuropsychological test variability as a predictor to level of performance on the neuropsychological tests (third Cox analysis) improved prediction of dementia compared to using performance level on the neuropsychological tests alone (first Cox analysis). The results were significant (χ2 =4.19, p=0.04) indicating that prediction of incident dementia was improved by adding within-person across-neuropsychological test variability. It was of further interest to examine whether within-person across-neuropsychological test variability improved the sensitivity for the prediction of dementia within one year. In a model that included the three individual neuropsychological tests and controlled for age, sex, education, and disease illness the cut-score that resulted in 80% specificity yielded 83% sensitivity for predicting dementia within one year. Including within-person across-neuropsychological test variability in this model significantly increased the sensitivity for predicting dementia within one year to 88% (p=0.014, McNemar’s test).

Kaplan-Meier Curves delineated the cumulative risk of incident algorithmic dementia using age as the time scale. Within person across neuropsychological tests variability, dichotomized into the highest (ie., worst) quartile versus the rest, revealed that consistent with the previous results higher variability was associated with increased risk of incident dementia (hazard ratio, 2.253; 95% CI, 1.317 - 3.855) (Figure 1a).

Figure 1.

Figure 1

a: Kaplan-Meier Curves for the Cumulative Risk of incident algorithmic dementia as a function of within-person across-neuropsychological test variability (highest quartile illustrated with a blue line versus the remaining three quartiles illustrated with a black line). Variability denotes within-person across-neuropsychological test variability; Variability > Q3 denotes highest (i.e., worst) quartile; Variability <= Q3 denotes remaining lower 3 quartiles

b: Kaplan-Meier Curves for the Cumulative Risk of incident algorithmic dementia as a function FCSRT performance (highest quartile illustrated with a black line versus the remaining three quartiles illustrated with a blue line). FCSRT denotes Free and Cued Selective Reminding Test; FCSRT > Q1 denotes highest (i.e., best) quartile; FCSRT <= Q1 denotes remaining lower 3 quartiles

c: Kaplan-Meier Curves for the Cumulative Risk of incident algorithmic dementia as a function of performance on the Digit Symbol Substitution test of the WAIS-R (highest quartile illustrated with a black line versus the remaining three quartiles illustrated with a blue line). DSYM denotes Digit Symbol Substitution Test; DSYM > Q1 denotes highest (i.e., best) quartile; DSYM <= Q1 denotes remaining lower 3 quartiles

d: Kaplan-Meier Curves for the Cumulative Risk of incident algorithmic dementia as a function of performance on the Digit Symbol Substitution tests (highest quartile illustrated with a black line versus the remaining three quartiles illustrated with a blue line). VOCAB denotes Vocabulary subtest of the WAIS-R VOCAB > Q1 denotes highest (i.e., best) quartile; VOCAB <= Q1 denotes remaining lower 3 quartiles

Additionally, Kaplan-Meier curves for the FCSRT (hazard ratio, 4.772; 95% CI, 2.798-8.1), Digit Symbol Substitution (hazard ratio, 2.428; 95% CI, 1.440-4.093), and Vocabulary tests ((hazard ratio, 1.505; 95% CI, 0.880-2.574) dichotomized into the highest (i.e., best) quartile versus the rest are illustrated in Figure 1b -d, respectively.

Exploratory analyses using clinical consensus diagnosis of dementia subtypes

Cox proportional-hazards regression analysis with age serving as the time scale adjusting for sex, education, and illness index examined whether associations between variability and clinical diagnosis of dementia using consensus case conference procedures varied as a function of dementia subtype. Variability predicted development of both incident dementia of the Alzheimer’s type (n=47; hazard ratio, 3.626; 95 percent confidence interval, 1.786 to 7.365), and incident vascular dementia (n=18; hazard ratio, 5.258; 95 percent confidence interval, 1.696 to 16.303). The disease processes underlying “pure” Alzheimer’s and Vascular dementias are different. Thus, it appears that variability is sensitive to dementia irrespective of disease subtype.

Discussion

Intra-individual variability is concerned with the study of within person differences in cognitive function and its use as a marker of pathology46. The taxonomy of intra-individual variability offers several operational definitions for this construct47. To our knowledge, this is the first study to demonstrate that within-person across-neuropsychological test variability predicts dementia in a population-based cohort age 70 and older even after adjusting for level of performance on each individual neuropsychological test. The potential clinical and diagnostic utility of within-person across-neuropsychological test variability was further substantiated by its contribution to increased sensitivity in the prediction of dementia within one year of the baseline assessment.

The premise guiding the study was that variability or inconsistency in performance across neuropsychological tests would increase in the pre-clinical stages of dementia compared to normal aging. This, in part, is attributed to the decline in memory function years before the diagnosis of dementia11. As opposed to summary scores or intra-individual indices that estimate function within one cognitive domain, within-person across-neuropsychological test variability may be conceptualized as a single representation of variability across multiple domains subserved by a several cortical regions and networks. In this context, this form of cognitive variability maybe considered a signature of decline in cerebral integrity in the early stages of dementing illnesses.

In choosing the individual tests used to calculate within-person across-neuropsychological test variability we aimed to maximize the number of participants, represent different cognitive domains, and increase generalizability by including a few easily administered tests that are commonly used in other studies and by clinicians assessing cognitive impairment in the elderly. Within-person across-neuropsychological test variability can be computed using tests other than those used in this study. Further, this form of cognitive variability can be estimated using standard and widely used clinical neuropsychological assessment procedures that are typically given in one testing session. Hence, the potential clinical utility of this aspect of cognitive function is quite appealing as it requires no changes to standard assessment procedures in aging studies or assessment of cognitive disorder in clinic settings. It is noteworthy that variability on sub-domains of the MMSE48 was related to short term decline on a global measure of cognitive function in a small cohort of centenarians42. Hence, examining across cognitive domain variability in screening measures including those currently used to support the diagnosis of mild cognitive impairments49 is of interest.

Research and theories concerning within person variability on single measures, although relatively recent, suggest that increased inconsistency represents impaired top down executive control processes50 subserved by frontal regions51, 52. However, at present, little is known about within person across cognitive tests variability, the theoretical and neuroanatomical basis for this putative construct. Our findings suggest that even though substantial in non-demented older adults23 within person across neuropsychological tests variability is pathological when a certain threshold is exceeded. It is noteworthy that variability was sensitive to both Alzheimer and Vascular dementia subtypes, which vary in terms of their etiology and cognitive profile33, 53, 54. Therefore, as a signature of early decline in global cerebral integrity, variability may capture the summation of differing sensitivities of brain regions and networks to various disease processes as opposed to estimating the effect of disease on a single brain region. The low correlations between variability and the level of performance on the individual neuropsychological tests appear to support this notion.

The limitations of the study should be considered. First, the sample although representative of the Bronx, consisted of volunteers who resided in the community and who were relatively healthy and willing to travel to the medical center. Hence, the generalizability of the findings may potentially be limited. Second, the diagnosis of incident dementia was based on an algorithmic procedure that was independent of neuropsychological test scores and consensus diagnosis. Hence, diagnostic circularity is less likely to confound the findings. However, optimally, within-person across-neuropsychological test variability should be derived from tests that are not part of the clinical diagnostic procedures to examine longitudinal associations between this form variability and clinical diagnosis of dementia. Third, assignment of subtypes of dementia is fallible. Although the diagnoses were made according to standardized criteria, some misclassification is inevitable. Further, differential associations between variability and dementia subtypes may be demonstrated, perhaps when using neuropsychological tests that are more sensitive to either Vascular or Alzheimer’s diseases33, 53, 54. Fourth, attrition is a major issue of concern in any longitudinal study. A significant number of individuals enrolled in the EAS were not eligible to participate in this investigation because they had only the baseline evaluation and were awaiting the next yearly visit. However, the subsample included in this study was not different from the entire EAS cohort in terms of age, gender distribution and level of education. Nonetheless the potential selection bias should be considered. Finally, we emphasize that a single variability measure cannot replace a complete neuropsychological examination nor do we advocate that three neuropsychological tests are sufficient to provide adequate assessment of the individual’s cognitive function. Instead, we propose that measures of within person across tests variability should be viewed as complementary to standard assessment procedures that are used to predict future risk of dementia.

It is of further interest to examine whether and the extent to which different aspects of intra-individual variability in cognitive functions are related in terms of the theory, underlying mechanism and utility in predicting outcomes of interest. For instance, research suggests that individuals are not consistently diagnosed with MCI across multiple sessions55, 56. Although some fluctuations may be attributed to error in measurement and limitations of this construct, it is of interest to examine the relationship between, across and within testing session variability and whether the latter improves understanding and prediction of transitional stages in cognitive aging.

Acknowledgements

Authors’ Contributions: Dr. Holtzer had full access to all the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis.

Study concept and design: Holtzer, Verghese.

Acquisition of data: Lipton, Verghese.

Analysis and interpretation of data: Holtzer, Verghese, Hall, Wang, Lipton

Drafting of the manuscript: Holtzer, Verghese

Critical revision of the manuscript for important intellectual content: Holtzer, Verghese, Hall, Wang, Lipton,

Statistical analysis: Wang, Hall, Holtzer, Verghese.

Obtained funding: Lipton.

Administrative, technical, or material support: Verghese, Lipton.

Study supervision: Holtzer, Verghese

Financial Disclosures: None reported.

Funding/Support: Funding: The Einstein Aging Study is supported by the National Institutes on Aging program project grant (AGO3949). Roee Holtzer is supported by the National Institute on Aging Paul B Beeson Award (K23 AG030857). Joe Verghese is supported by the National Institute on Aging grant (AG025119).

Role of the Sponsor: The National Institute on Aging did not have any role in the “design and conduct of the study; collection, management, analysis, and interpretation of the data; and preparation, review, or approval of the manuscript.”

References

  • 1.DeCarli C. Mild cognitive impairment: prevalence, prognosis, aetiology, and treatment. Lancet Neurol. 2003 Jan;2(1):15–21. doi: 10.1016/s1474-4422(03)00262-x. [DOI] [PubMed] [Google Scholar]
  • 2.Association AP. Diagnostic and Statistical Manual of Mental Disorders. text revision ed. Author; Washington, DC: 2000. [Google Scholar]
  • 3.McKhann G, Drachman D, Folstein M, Katzman R, Price D, Stadlan EM. Clinical diagnosis of Alzheimer’s disease: report of the NINCDS-ADRDA Work Group under the auspices of Department of Health and Human Services Task Force on Alzheimer’s Disease. Neurology. 1984 Jul;34(7):939–944. doi: 10.1212/wnl.34.7.939. [DOI] [PubMed] [Google Scholar]
  • 4.Petersen RC, Doody R, Kurz A, et al. Current concepts in mild cognitive impairment. Arch Neurol. 2001 Dec;58(12):1985–1992. doi: 10.1001/archneur.58.12.1985. [DOI] [PubMed] [Google Scholar]
  • 5.Petersen RC, Smith GE, Waring SC, Ivnik RJ, Tangalos EG, Kokmen E. Mild cognitive impairment: clinical characterization and outcome. Arch Neurol. 1999 Mar;56(3):303–308. doi: 10.1001/archneur.56.3.303. [DOI] [PubMed] [Google Scholar]
  • 6.Potter GG, Attix DK. An Integrated Model for Geriatric Neuropsychological Assessment. In: Attix DK, Welsh-Bohmer KA, editors. Geriatric neuropsychology: Assessment and intervention. Guilford Publications; New York, NY: 2006. pp. 5–26. [Google Scholar]
  • 7.Grober E, Lipton RB, Hall C, Crystal H. Memory impairment on free and cued selective reminding predicts dementia. Year of Publication 2000. Neurology. 2000 Feb;54(4):827–832. doi: 10.1212/wnl.54.4.827. [DOI] [PubMed] [Google Scholar]
  • 8.Masur DM, Sliwinski M, Lipton RB, Blau AD, Crystal HA. Neuropsychological prediction of dementia and the absence of dementia in healthy elderly persons. Neurology. 1994 Aug;44(8):1427–1432. doi: 10.1212/wnl.44.8.1427. [DOI] [PubMed] [Google Scholar]
  • 9.Albert MS, Moss MB, Tanzi R, Jones K. Preclinical prediction of AD using neuropsychological tests. Journal of the International Neuropsychological Society. 2001 Jul;7(5):631–639. doi: 10.1017/s1355617701755105. [DOI] [PubMed] [Google Scholar]
  • 10.Backman L, Jones S, Berger AK, Laukka EJ, Small BJ. Cognitive impairment in preclinical Alzheimer’s disease: a meta-analysis. Neuropsychology. 2005 Jul;19(4):520–531. doi: 10.1037/0894-4105.19.4.520. [DOI] [PubMed] [Google Scholar]
  • 11.Hall CB, Ying J, Kuo L, et al. Estimation of bivariate measurements having different change points, with application to cognitive ageing. Stat Med. 2001 Dec 30;20(24):3695–3714. doi: 10.1002/sim.1113. [DOI] [PubMed] [Google Scholar]
  • 12.Hall CB, Ying J, Kuoc L, Lipton RB. Bayesian and profile likelihood change point methods for modeling cognitive function over time. Computational Statistics & Data Analysis. 2003;42:91–109. [Google Scholar]
  • 13.Lezak MD, Diane BH, Loring DW. Neuropsychological Assessment. 4 th ed. Oxford University Press; New York: 2004. [Google Scholar]
  • 14.Mitrushina M, Boone KB, Razani J, D’Elia L. Handbook of normative data for neuropsychological assessment. 2 nd ed. Oxford University Press; New York: 2005. [Google Scholar]
  • 15.Salthouse TA. Implications of within-person variability in cognitive and neuropsychological functioning for the interpretation of change. Neuropsychology. 2007 Jul;21(4):401–411. doi: 10.1037/0894-4105.21.4.401. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.West R. The transient nature of executive control processes in younger and older adults. European Journal of Cognitive Psychology. 2001 Mar-Jun;13(12):91–105. [Google Scholar]
  • 17.Hultsch DF, MacDonald SWS, Hunter MA, Levy-Bencheton J, Strauss E. Intraindividual variability in cognitive performance in older adults: Comparison of adults with mild dementia, adults with arthritis, and healthy adults. Neuropsychology. 2000 Oct;14(4):588–598. doi: 10.1037//0894-4105.14.4.588. [DOI] [PubMed] [Google Scholar]
  • 18.Knotek PC, Bayles KA, Kaszniak AW. Response consistency on a semantic memory task in persons with dementia of the Alzheimer type. Brain and Language. 1990 May;38(4):465–475. doi: 10.1016/0093-934x(90)90131-y. [DOI] [PubMed] [Google Scholar]
  • 19.Christensen H, Dear KB, Anstey KJ, Parslow RA, Sachdev P, Jorm AF. Within-occasion intraindividual variability and preclinical diagnostic status: is intraindividual variability an indicator of mild cognitive impairment? Neuropsychology. 2005 May;19(3):309–317. doi: 10.1037/0894-4105.19.3.309. [DOI] [PubMed] [Google Scholar]
  • 20.Dixon RA, Garrett DD, Lentz TL, MacDonald SW, Strauss E, Hultsch DF. Neurocognitive markers of cognitive impairment: exploring the roles of speed and inconsistency. Neuropsychology. 2007 May;21(3):381–399. doi: 10.1037/0894-4105.21.3.381. [DOI] [PubMed] [Google Scholar]
  • 21.Matarazzo JD, Daniel MH, Prifitera A, Herman DO. Inter-subtest scatter in the WAIS--R standardization sample. Journal of Clinical Psychology. 1988 Nov;44(6):940–950. [Google Scholar]
  • 22.Matarazzo JD, Prifitera A. Subtest scatter and premorbid intelligence: Lessons from the WAIS--R standardization sample. Psychological Assessment. 1989 Sep;1(3):186–191. [Google Scholar]
  • 23.Schretlen DJ, Munro CA, Anthony JC, Pearlson GD. Examining the range of normal intraindividual variability in neuropsychological test performance. Journal of the International Neuropsychological Society. 2003 Sep;9(6):864–870. doi: 10.1017/S1355617703960061. [DOI] [PubMed] [Google Scholar]
  • 24.Lipton RB, Katz MJ, Kuslansky G, et al. Screening for dementia by telephone using the memory impairment screen. J Am Geriatr Soc. 2003 Oct;51(10):1382–1390. doi: 10.1046/j.1532-5415.2003.51455.x. [DOI] [PubMed] [Google Scholar]
  • 25.Verghese J, Katz MJ, Derby CA, Kuslansky G, Hall CB, Lipton RB. Reliability and validity of a telephone-based mobility assessment questionnaire. Age Ageing. 2004 Nov;33(6):628–632. doi: 10.1093/ageing/afh210. [DOI] [PubMed] [Google Scholar]
  • 26.Blessed G, Tomlinson BE, Roth M. The association between quantitative measures of dementia and of senile change in the cerebral grey matter of elderly subjects. British Journal of Psychiatry. 1968;114(512):797–811. doi: 10.1192/bjp.114.512.797. [DOI] [PubMed] [Google Scholar]
  • 27.Fuld P. Psychological testing in the differential diagnosis of the dementias. In: Katzman RTR, Bick KL, editors. Alzheimer’s disease: senile dementia and related disorders. Vol. 7. Raven Press; New York: 1978. pp. 185–193. [Google Scholar]
  • 28.Grober E, Dickson D, Sliwinski MJ, et al. Memory and mental status correlates of modified Braak staging. Neurobiol Aging. 1999 Nov-Dec;20(6):573–579. doi: 10.1016/s0197-4580(99)00063-9. [DOI] [PubMed] [Google Scholar]
  • 29.Lawton MP, Brody EM. Assessment of older people: self-maintaining and instrumental activities of daily living. Gerontologist. 1969 Autumn;9(3):179–186. [PubMed] [Google Scholar]
  • 30.Sarazin M, Berr C, De Rotrou J, et al. Amnestic syndrome of the medial temporal type identifies prodromal AD: a longitudinal study. Neurology. 2007 Nov 6;69(19):1859–1867. doi: 10.1212/01.wnl.0000279336.36610.f7. [DOI] [PubMed] [Google Scholar]
  • 31.Verghese J, Lipton RB, Hall CB, Kuslansky G, Katz MJ, Buschke H. Abnormality of gait as a predictor of non-Alzheimer’s dementia. N Engl J Med. 2002 Nov 28;347(22):1761–1768. doi: 10.1056/NEJMoa020441. [DOI] [PubMed] [Google Scholar]
  • 32.Association AP. Diagnostic and Statistical Nanual of Mental Disoders. 4 th ed. ed. Author; Washington: DC: 1994. [Google Scholar]
  • 33.Chui HC, Victoroff JI, Margolin D, Jagust W, Shankle R, Katzman R. Criteria for the diagnosis of ischemic vascular dementia proposed by the State of California Alzheimer’s Disease Diagnostic and Treatment Centers. Neurology. 1992 Mar;42(3 Pt 1):473–480. doi: 10.1212/wnl.42.3.473. [DOI] [PubMed] [Google Scholar]
  • 34.Crystal HA, Dickson D, Davies P, Masur D, Grober E, Lipton RB. The relative frequency of “dementia of unknown etiology” increases with age and is nearly 50% in nonagenarians. Arch Neurol. 2000 May;57(5):713–719. doi: 10.1001/archneur.57.5.713. [DOI] [PubMed] [Google Scholar]
  • 35.Verghese J, Crystal HA, Dickson DW, Lipton RB. Validity of clinical criteria for the diagnosis of dementia with Lewy bodies. Neurology. 1999 Dec 10;53(9):1974–1982. doi: 10.1212/wnl.53.9.1974. [DOI] [PubMed] [Google Scholar]
  • 36.Katzman R, Aronson M, Fuld P, et al. Development of dementing illnesses in an 80-year-old volunteer cohort. Ann Neurol. 1989 Apr;25(4):317–324. doi: 10.1002/ana.410250402. [DOI] [PubMed] [Google Scholar]
  • 37.Sliwinski M, Buschke H, Stewart WF, Masur D, Lipton RB. The effect of dementia risk factors on comparative and diagnostic selective reminding norms. J Int Neuropsychol Soc. 1997 Jul;3(4):317–326. [PubMed] [Google Scholar]
  • 38.Holtzer R, Friedman R, Lipton RB, Katz M, Xue X, Verghese J. The relationship between specific cognitive functions and falls in aging. Neuropsychology. 2007 Sep;21(5):540–548. doi: 10.1037/0894-4105.21.5.540. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Holtzer R, Verghese J, Xue X, Lipton RB. Cognitive Processes Related to Gait Velocity: Results From the Einstein Aging Study. Neuropsychology. 2006 Mar;20(2):215–223. doi: 10.1037/0894-4105.20.2.215. [DOI] [PubMed] [Google Scholar]
  • 40.Wechsler D. Wechsler Adult Intelligence Scale-revised. The Psychological Corporation; New York: 1981. [Google Scholar]
  • 41.Grober E, Buschke H, Crystal H, Bang S, Dresner R. Screening for dementia by memory testing. Neurology. 1988 Jun;38(6):900–903. doi: 10.1212/wnl.38.6.900. [DOI] [PubMed] [Google Scholar]
  • 42.Kliegel M, Sliwinski M. MMSE cross-domain variability predicts cognitive decline in centenarians. Gerontology. 2004 Jan-Feb;50(1):39–43. doi: 10.1159/000074388. [DOI] [PubMed] [Google Scholar]
  • 43.Verghese J, Wang C, Lipton RB, Holtzer R, Xue X. Quantitative gait dysfunction and risk of cognitive decline and dementia. J Neurol Neurosurg Psychiatry. 2007 Sep;78(9):929–935. doi: 10.1136/jnnp.2006.106914. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Thiebaut AC, Benichou J. Choice of time-scale in Cox’s model analysis of epidemiologic cohort data: a simulation study. Stat Med. 2004 Dec 30;23(24):3803–3820. doi: 10.1002/sim.2098. [DOI] [PubMed] [Google Scholar]
  • 45.Silvey SD. Statistical inference. Chapman & Hall; London: 1975. [Google Scholar]
  • 46.Stuss DT, Murphy KJ, Binns MA, Alexander MP. Staying on the job: the frontal lobes control individual performance variability. Brain. 2003 Nov;126(Pt 11):2363–2380. doi: 10.1093/brain/awg237. [DOI] [PubMed] [Google Scholar]
  • 47.Li SC, Huxhold O, Schmiedek F. Aging and attenuated processing robustness. Evidence from cognitive and sensorimotor functioning. Gerontology. 2004 Jan-Feb;50(1):28–34. doi: 10.1159/000074386. [DOI] [PubMed] [Google Scholar]
  • 48.Folstein MF, Folstein SE, McHugh PR. “Mini-mental state”. A practical method for grading the cognitive state of patients for the clinician. J Psychiatr Res. 1975 Nov;12(3):189–198. doi: 10.1016/0022-3956(75)90026-6. [DOI] [PubMed] [Google Scholar]
  • 49.Nasreddine ZS, Phillips NA, Bedirian V, et al. The Montreal Cognitive Assessment, MoCA: a brief screening tool for mild cognitive impairment. J Am Geriatr Soc. 2005 Apr;53(4):695–699. doi: 10.1111/j.1532-5415.2005.53221.x. [DOI] [PubMed] [Google Scholar]
  • 50.West R, Murphy KJ, Armilio ML, Craik FI, Stuss DT. Lapses of intention and performance variability reveal age-related increases in fluctuations of executive control. Brain Cogn. 2002 Aug;49(3):402–419. doi: 10.1006/brcg.2001.1507. [DOI] [PubMed] [Google Scholar]
  • 51.Bellgrove MA, Hester R, Garavan H. The functional neuroanatomical correlates of response variability: Evidence from a response inhibition task. Neuropsychologia. 2004;42(14):1910–1916. doi: 10.1016/j.neuropsychologia.2004.05.007. [DOI] [PubMed] [Google Scholar]
  • 52.Stuss DT, Toth JP, Franchi D, Alexander MP, Tipper S, Craik FI. Dissociation of attentional processes in patients with focal frontal and posterior lesions. Neuropsychologia. 1999 Aug;37(9):1005–1027. doi: 10.1016/s0028-3932(98)00158-4. [DOI] [PubMed] [Google Scholar]
  • 53.Jokinen H, Kalska H, Mantyla R, et al. Cognitive profile of subcortical ischaemic vascular disease. J Neurol Neurosurg Psychiatry. 2006 Jan;77(1):28–33. doi: 10.1136/jnnp.2005.069120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.McPherson SE, Cummings JL. Neuropsychological aspects of vascular dementia. Brain Cogn. 1996 Jul;31(2):269–282. doi: 10.1006/brcg.1996.0045. [DOI] [PubMed] [Google Scholar]
  • 55.Bischkopf J, Busse A, Angermeyer MC. Mild cognitive impairment--a review of prevalence, incidence and outcome according to current approaches. Acta Psychiatr Scand. 2002 Dec;106(6):403–414. doi: 10.1034/j.1600-0447.2002.01417.x. [DOI] [PubMed] [Google Scholar]
  • 56.Collie A, Maruff P, Currie J. Behavioral characterization of mild cognitive impairment. J Clin Exp Neuropsychol. 2002 Sep;24(6):720–733. doi: 10.1076/jcen.24.6.720.8397. [DOI] [PubMed] [Google Scholar]

RESOURCES