Abstract
There has been an increasing trend to utilize short cognitive batteries for the diagnosis of dementia. Most of these batteries have been designed in countries with high standards of education and are less suitable for populations with low levels of education.We developed a battery that has been previously shown to be highly accurate in the diagnosis of dementia in individuals with low levels of education. The accuracy of this battery for patients with higher levels of education is unknown.
Objectives
To evaluate the accuracy of a brief cognitive battery in the diagnosis of Alzheimer’s disease (AD) in subjects with medium and high levels of schooling, and to develop a mathematical model that includes the most discriminative tests.
Methods
Seventy-three mildly demented patients with probable AD and 94 control subjects were evaluated. Sixty patients and 60 controls were randomly selected to generate a mathematical model including the most discriminative tests of the battery using logistic regression. The model was back-tested for the remaining sample of patients and controls.
Results
Delayed recall, learning and category fluency tests were included in a mathematical model that obtained an area of 0.917 in the ROC curve in the back-testing. Inter-rater reliabilities of these tests were high (kappa>0.8).
Conclusions
This model showed a high accuracy for the diagnosis of mild AD in patients with medium and high educational levels. Future studies with more heterogeneously educated individuals are necessary to investigate whether the educational level (number of years at school) should also be included in the model.
Keywords: Alzheimers disease, dementia, diagnosis, education, brief cognitive battery, neuropsychological tests
Abstract
Existe crescente tendência de se utilizarem baterias cognitivas breves no diagnóstico de demência. A maioria destas tem sido idealizada em países com altos níveis de escolaridade e é menos adequada para populações com baixa escolaridade. Desenvolvemos uma bateria que tem demonstrado alta acurácia no diagnóstico de demência em indivíduos com baixa escolaridade, mas a acurácia em indivíduos com escolaridade alta ainda não é conhecida.
Objetivos
Avaliar a acurácia de uma bateria cognitiva breve no diagnóstico de doença de Alzheimer (DA) em indivíduos com escolaridade média ou alta, e desenvolver um modelo matemático que inclua os testes mais discriminativos.
Métodos
Setenta e três pacientes com demência leve causada por DA provável e 94 controles foram avaliados com a bateria cognitiva breve. Sessenta pacientes e 60 controles foram selecionados aleatoriamente para gerar um modelo matemático com os testes mais discriminativos, empregando regressão logística. Este modelo foi retro-testado na amostra remanescente de pacientes e controles.
Resultados
Os testes de memória tardia, aprendizado e fluência verbal foram incluídos no modelo matemático que obteve área sob a curva ROC de 0,917 no retro-teste. A confiabilidade inter-examinadores destes testes foi alta (kappa>0.8).
Conclusões
Este modelo apresentou alta acurácia no diagnóstico de demência leve na DA em sujeitos com escolaridade média ou alta. Novos estudos com população com maior heterogeneidade educacional são necessários para investigar se os anos de escolaridade também devem ser incluídos no modelo.
Keywords: doença de Alzheimer, demência, diagnóstico, educação, bateria cognitiva breve, testes neuropsicológicos
There has been an increasing trend to utilize short cognitive batteries for the diagnosis of dementia. These batteries of neuropsychological tests may be used in epidemiological studies or in the busy clinical settings by specialists as well as by primary care physicians. Most, however, have been developed and validated in developed countries1-4, thus not reflecting the reality in developing and underdeveloped countries, where low education and even illiteracy are common, especially among the elderly. Particularly, batteries designed in developed countries usually include items that rely on reading or writing abilities, which are less suitable for populations with low levels of education.
For several years we have been working on a cognitive battery developed to evaluate subjects independent of their educational level, and thus applicable to populations with heterogeneous educational background5-7. In a previous study, we showed that the performance of illiterate non-demented elderly subjects was higher in the delayed recall test of our brief battery than in the Consortium to establish a registry for Alzheimers disease (CERAD) battery6. Subsequently, we showed that the delayed recall test of our battery was also more accurate than the equivalent test of the CERAD battery for diagnosing dementia in illiterates from an elderly cohort7.
On the other hand, a battery designed for diagnosing cognitive impairment in subjects which are illiterate or have low levels of education may not be applicable for those with high educational levels.
Thus, this study aims at evaluating the accuracy of this brief cognitive battery in the diagnosis of mild dementia caused by Alzheimer’s disease (AD) in patients with medium and high levels of schooling, as well as to ascertain the most discriminative tests, and develop a mathematical model that could be applicable in clinical practice.
Methods
Subjects
All patients with mild dementia due to probable AD according to NINCDS-ADRDA criteria8, who were evaluated by two senior neurologists (RN and PC) and submitted to a comprehensive neuropsychological battery by two psychologists (CSP and HCF) were included. All had laboratory examinations and CT or MRI of the head to exclude other causes of dementia. For the diagnosis of dementia, the neuropsychological battery consisted of the Mini-mental state examination – MMSE9,10, the Dementia Rating Scale – DRS11,12, visual reproduction13, Rey complex figure14, logical memory13, Rey auditory verbal learning test14,15, block design16, Hooper visual organization test17, Raven’s progressive matrices14, Boston naming test18,19, trail making test (A and B versions), Stroop color test and phonemic verbal fluency (F.A.S. test)14. Patients were diagnosed as mildly demented when the DRS global scores fell within the 110 to 122 range12. Seventy-three patients were included.
Ninety-four subjects were included in the healthy control group. They were volunteers recruited among spouses or friends of the patients or drawn from senior citizen groups. A semi-structured questionnaire was used to exclude those with memory or cognitive problems. Subjects who were using medications that could have interfered in the cognitive performance were excluded, as well as those with a history of alcoholism or uncontrolled chronic diseases. Normal control subjects were evaluated with the MMSE and the DRS.
Demographic data of patients and control subjects are shown in Table 1.
Table 1.
Demographic data and mean Mini-Mental State Examination (MMSE) and Dementia Rating Scale (DRS) scores of patients with Alzheimer's disease (AD) and control subjects.
Controls (N=94) | AD patients (N=73) | p | |
---|---|---|---|
Age | 71.1(6.4) | 73.3 (7.4 ) | 0.050 |
Gender | 37 M/57 W | 29 M/44W | 0.911 |
Years of schooling (mean) | 10.8 (4.5 ) | 11.1 (4.7 ) | 0.629 |
MMSE mean (SD) | 27.4 (1.6) | 23.2(3.0) | <0.001 |
DRS mean (SD) | 135.6 (6.1) | 116.2 (3.9) | <0.001 |
M, men;W, women.
Procedures
The brief cognitive battery was administered to all patients and controls, through the display of a sheet of paper in which 10 simple objects were presented as line drawings (shoe, house, comb, key, airplane, bucket, turtle, tree, spoon and book)5,6. First, the subject was asked to name each drawing. The examiner scored the number of correct identifications and naming of the drawn objects. If the subject was able to properly name the drawn objects, the identification was also deemed correct. However, if the naming was incorrect, this could indicate either failure in identification or in naming itself, although this differentiation was usually straightforward. For instance, if the patient named the airplane a fish, this was clearly an identification problem.When naming was incorrect or not remembered, the examiner provided the correct names. Immediately after naming all objects, the sheet of paper was removed from view, and the subject was asked to recall the drawings (incidental memory). The sheet of paper was then shown again for about 30 seconds, the patient was explicitly asked to memorize the objects, and recall was requested immediately after presentation. This latter procedure was performed twice, leading firstly to a score of immediate memory and then to a score of what was called learning test. After an interference period in which the category fluency test (number of animals in one minute) and the clock drawing test – CDT20 were performed, subjects were then asked to recall as many items as they could, for no more than one minute, where this constituted the delayed recall test. As a final procedure, a sheet of paper with 20 drawings, including the 10 previously presented along with 10 added distractors was displayed. The score of this recognition test was calculated by subtracting the number of wrong from the correct responses. To summarize, the brief cognitive battery consists of the identification, naming, incidental memory, immediate memory, learning, delayed recall and recognition of 10 simple drawings associated with the category fluency test and CDT.
In order to verify the reliability of the battery, 22 elderly controls were tested in the presence of two examiners (RN and PC) who scored the subject independently. The time taken to accomplish the battery was recorded for each of these 22 subjects. Sixty patients and 60 controls were randomly selected to generate a mathematical model using logistic regression with the most discriminative tests of the battery. The generated model was backtested for the remaining sample of patients (n=13) and controls (n=34).
ROC (receiver operator characteristic) curves were used to evaluate the accuracy of each test. Inter-rater reliability was evaluated using kappa values.
Comparisons between means of demographic data were performed through the t test or chi-square test when appropriate. For comparisons of test scores of patients and controls, the Mann-Whitney test was used. The value of significance accepted was 0.05. The software package SPSS for Windows 10.0 was used for the statistical analysis.
This study was approved by the Ethics Committee of the Hospital das Clínicas of the University of São Paulo School of Medicine. All subjects (or a relative, if necessary) were informed about the study prior to the evaluation, and informed consent was given.
Results
Patients and controls did not differ regarding the identification of the simple drawings (p=0.20), while all other tests of the battery were different for the two groups. Although naming was also different, 83.6% of the patients were able to correctly name the 10 drawings.When the whole sample was included, the delayed recall was the most discriminative test to correctly identify patients and controls. The areas under the curves (AUC), cut-off scores, sensitivities and specificities are shown in Table 2.
Table 2.
Cut-off scores of the most discriminative tests for the whole sample.
Area under the curve (95% CI) | Cut-off scores | Sensitivity (%) | Specificity (%) | |
---|---|---|---|---|
Delayed recall test | 0.931 (0.894;0.968) | <6 | 82.2 | 90.4 |
Learning test | 0.903 (0.857;0.948) | <7 | 90.4 | 74.5 |
Verbal fluency | 0.839 (0.778;0.900) | <15 | 83.6 | 67.0 |
Immediate memory | 0.890.(0.839;0.941) | <7 | 90.4 | 77.7 |
Incidental memory | 0.866 (0.812;0.920) | <5 | 81.7 | 75.5 |
Recognition | 0.903 (0.857;0.948) | <8 | 90.4 | 74.5 |
Clock-drawing | 0.769 (0.692;0.845) | <9 | 77.8 | 72.3 |
In the randomly selected sample of 60 patients and 60 controls, multivariate analysis selected the delayed recall, learning and category fluency tests to generate the model below:
where dr=delayed recall test, lt=learning test and cf= category fluency test, in which the higher the score, the higher the probability of dementia. In the back-testing, the model obtained an AUC-ROC of 0.917. The kappa values of the inter-rater reliability were 1.0 for the learning test, 0.913 for the delayed recall test and 0.846 for the category fluency test. The mean time to accomplish the battery was 487.7 (±87.6) seconds, with a median of 470 seconds.
Discussion
This study showed that this brief cognitive battery is highly accurate for the diagnosis of mild dementia in AD patients with medium-high educational levels, confirming the findings obtained in illiterate and low educated individuals. The inter-rater reliability was high, and the time to accomplish, about 8 minutes in elderly individuals, is not long considering the many cognitive functions that are evaluated.
A comparative advantage of this battery in relation, for instance, to the CERAD, is its similar procedural applicability and validity, regardless of the educational level of the subject or his or her level of mastering of the official language of their country of residence. Because of the possibility of using this battery for the evaluation of subjects with wide educational background, we decide to name it Brief Cognitive Battery – Unbiased by Education (BCB-Edu) to emphasize this information.
Other brief batteries consisting of simple and rapid tests have been proposed for dementia diagnosis. The Mini-Cog is based on the delayed recall of three words and on the CDT, having been used as a screening test3. In a previous study, we found that non-demented illiterate subjects have great difficulty in performing the CDT6,20. While non-demented literate subjects obtained a median of 9 on the CDT, the median for illiterates was 2.56. At least for our population, it is probable that the use of this combination is not appropriate.
Another interesting test is the Memory Impairment Screen, which is based on the delayed recall of four written words that are encoded with semantic cues21. However, its dependency on reading ability hampers its use for low educated subjects or for immigrants unable to read in the official language of their country of residence.More recently, it has been reported that the combination of the recall of a five-item name and address and category verbal fluency (animals in one minute) had high sensitivity and specificity for the diagnosis of dementia in a highly educated study sample22.
Delayed recall tests are considered the best single tests to diagnose AD23,24, while word generation tasks also have shown high specificity and sensitivity for the diagnosis of dementia25,26. The combination of the two types of tests, as proposed by Kilada et al.22 and by the BCB-Edu, is likely to prove very useful for the diagnosis of AD and other types of dementia. However, the animal fluency test is influenced by education, particularly by illiteracy, making it necessary to use different cut-off scores for different educational levels. For instance, while one study reported that the best cut-off score to differentiate AD from normal controls was 15 in a study sample with high education levels25, a recent Brazilian study reported that the best cut-off for diagnosing dementia in illiterate subjects is 927.
A few points deserve consideration. Firstly, as the time to accomplish the BCB-Edu is higher than the most recently proposed tests3,21,22, could only the three most discriminative tests be used rather than all nine items scored in the battery (identification, naming, incidental memory, immediate memory, learning task, delayed recall, recognition, category fluency and CDT)? As it is clear from the application procedures, that the identification, naming, incidental memory, and immediate memory steps are pre-requirements to the learning and delayed recall steps, they cannot be eliminated.
It is possible that the recognition test and CDT could be eliminated, reducing the procedure to about five minutes.
However, as the clock-drawing, together with the category fluency test, is included between the phase of memorizing the 10 figures and their delayed recall, it remains to be decided whether the exclusion of the CDT would impact the delayed recall score. In a recent study, the scores in a delayed recall test were not different when the time delays between encoding and recall were either two or four minutes22, which makes plausible the suggestion that the CDT be removed from the BCB-edu.
We were able to develop a mathematical model for the diagnosis of dementia. However, it is necessary to study whether this model is consistent when individuals with low levels of education or illiterates are included. Future studies should investigate whether the educational level (number of years at school) should be included in the model. Also, it will be necessary to develop a practical instrument for calculating the final score at the clinic, possibly using a palm-top or laptop.
Finally, as suggested above, by not relying on auditory reading or written material, BCB-Edu may allow reliable cognitive evaluation of those elderly individuals who are functional illiterates, or immigrants unable to read or to write in the official language of their country of residence, an ever increasing trend in many developed countries28,29.
References
- 1.Morris JC, Heyman A, Mohs RC. The Consortium to establish a registry for Alzheimer's disease (CERAD) Part I. clinical and neuropsychological assessment of Alzheimer's disease. Neurology. 1989;39:1159–1165. doi: 10.1212/wnl.39.9.1159. [DOI] [PubMed] [Google Scholar]
- 2.Meulen EF, Schmand B, van Campen JP. The seven minute screen a neurocognitive screening test highly sensitive to various types of dementia. J Neurol Neurosurg Psychiatr. 2004;75:700–705. doi: 10.1136/jnnp.2003.021055. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Borson S, Scanlan J, Brush M, Vitaliano P, Dokmak A. The mini-cog a cognitive 'vital signs' measure for dementia screening in multi-lingual elderly. Int J Geriatr Psychiatr. 2000;15:1021–1027. doi: 10.1002/1099-1166(200011)15:11<1021::aid-gps234>3.0.co;2-6. [DOI] [PubMed] [Google Scholar]
- 4.Lorentz WJ, Scanlan JM, Borson S. Brief screening tests for dementia. Can J Psychiatr. 2002;47:723–733. doi: 10.1177/070674370204700803. [DOI] [PubMed] [Google Scholar]
- 5.Nitrini R, Lefèvre BH, Mathias SC. Neuropsychological tests of simple application for diagnosing dementia. Arq Neuropsiquiatr. 1994;52:457–465. doi: 10.1590/s0004-282x1994000400001. [DOI] [PubMed] [Google Scholar]
- 6.Nitrini R, Caramelli P, Herrera E Jr. Performance of illiterate and literate nondemented elderly subjects in two tests of long-term memory. J Int Neuropsychol Soc. 2004;10:634–638. doi: 10.1017/S1355617704104062. [DOI] [PubMed] [Google Scholar]
- 7.Takada LT, Caramelli P, Fichman HC. Comparison between two tests of delayed recall for the diagnosis of dementia. Arq Neuropsiquiatr. 2006;64:35–40. doi: 10.1590/s0004-282x2006000100008. [DOI] [PubMed] [Google Scholar]
- 8.McKhann G, Drachman D, Folstein M. Clinical diagnosis of Alzheimer's disease report of the NINCDSADRDA work group under the auspices of department of health and human services task force on Alzheimer's disease. Neurology. 1984;34:939–944. doi: 10.1212/wnl.34.7.939. [DOI] [PubMed] [Google Scholar]
- 9.Folstein MF, Folstein SE, McHugh PR. "Mini-mental State": a practical method for grading the cognitive state of patients for the clinician. J Psychiatr Res. 1975;12:189–198. doi: 10.1016/0022-3956(75)90026-6. [DOI] [PubMed] [Google Scholar]
- 10.Brucki SMD, Nitrini R, Caramelli P, Bertolucci PHF, Okamoto IH. Sugestões para o uso do Mini-Exame do Estado Mental no Brasil. Arq Neuropsiquiatr. 2003;61:777–781. doi: 10.1590/s0004-282x2003000500014. [DOI] [PubMed] [Google Scholar]
- 11.Mattis S. Dementia Rating Scale: professional manual. Florida: Psychol Assess Res; 1988. [Google Scholar]
- 12.Porto SC, Charchat-Fichman H, Caramelli P, Bahia VA, Nitrini R. Dementia Rating Scale - DRS - in the diagnosis of patients with Alzheimer's dementia. Arq Neuropsiquiatr. 2003;61:339–345. doi: 10.1590/s0004-282x2003000300004. [DOI] [PubMed] [Google Scholar]
- 13.Weschler D. Weschler Memory Scale - revised manual. New York: Psychological Corporation; 1987. [Google Scholar]
- 14.Spreen O, Strauss E. A compendium of neuropsychological tests: administration, norms, and commentary. 2ª Ed. New York: Oxford Univ Press; 1998. [Google Scholar]
- 15.Diniz LFM, Cruz MF, Torres VM, Consenza RM. O teste de aprendizagem auditivo-verbal de Rey normas para uma população brasileira. Rev Bras Neurol. 2003;36:79–83. [Google Scholar]
- 16.Wechsler D. Test de inteligência para adultos (WAIS). Manual. 2nd ed. Buenos Aires: Paidos; 1993. [Google Scholar]
- 17.Hooper Visual Organization Test (VOT) Manual. Los Angeles: Western Psychological Services; 1983. [Google Scholar]
- 18.Kaplan E, Goodglass H, Weintraub S, Segal H. Boston Naming Test. Philadelphia: Lea & Febiger; 1983. [Google Scholar]
- 19.Mansur LL, Radanovic M, Araujo G de C, Taquemori LY, Greco LL. Boston Naming Test performance of Brazilian population from Sao Paulo. Pro Fono. 2006;18:13–20. doi: 10.1590/s0104-56872006000100003. [DOI] [PubMed] [Google Scholar]
- 20.Sunderland T, Hill JL, Mellow AM. Clock drawing in Alzheimer's disease a novel measure of dementia severity. J Am Geriatr Soc. 1989;37:725–729. doi: 10.1111/j.1532-5415.1989.tb02233.x. [DOI] [PubMed] [Google Scholar]
- 21.Buschke H, Kuslansky G, Katz M, Stewart WF. Screening for dementia with the Memory Impairment Screen. Neurology. 1999;52:231–238. doi: 10.1212/wnl.52.2.231. [DOI] [PubMed] [Google Scholar]
- 22.Kilada S, Gamaldo A, Grant EA, Moghekar A, Morris JC, O'Brien RJ. Brief screening tests for the diagnosis of dementia comparison with the mini-mental state exam. Alzheimer Dis Assoc Disord. 2005;19:8–16. doi: 10.1097/01.wad.0000155381.01350.bf. [DOI] [PubMed] [Google Scholar]
- 23.Welsh K, Butters N, Hughes J, Mohs R, Heyman A. Detection of abnormal memory decline in mild cases of Alzheimer's disease using CERAD neuropsychological measures. Arch Neurol. 1991;48:278–281. doi: 10.1001/archneur.1991.00530150046016. [DOI] [PubMed] [Google Scholar]
- 24.Bertolucci PH, Okamoto IH, Brucki SM, Siviero MO, Toniolo Neto J, Ramos LR. Applicability of the CERAD neuropsychological battery to Brazilian elderly. Arq Neuropsiquiatr. 2001;59:532–536. doi: 10.1590/s0004-282x2001000400009. [DOI] [PubMed] [Google Scholar]
- 25.Canning SJD, Leach L, Stuss D, Ngo L, Black SE. Diagnostic utility of abbreviated fluency measures in Alzheimer disease and vascular dementia. Neurology. 2004;62:556–562. doi: 10.1212/wnl.62.4.556. [DOI] [PubMed] [Google Scholar]
- 26.Cummings JL. The one-minute mental status examination. Neurology. 2004;62:534–535. doi: 10.1212/wnl.62.4.534. [DOI] [PubMed] [Google Scholar]
- 27.Caramelli P, Carthery-Goulart MT, Porto CS, Charchat-Fichman H, Nitrini R. Category fluency as a screening test for Alzheimer's disease in illiterate and literate patients. Alzheimer Dis Assoc Disord. (in press) doi: 10.1097/WAD.0b013e31802f244f. [DOI] [PubMed] [Google Scholar]
- 28.Gazmararian JA, Baker DW, Williams MV. Health literacy among Medicare enrollees in a managed care organization. JAMA. 1999;281:545–551. doi: 10.1001/jama.281.6.545. [DOI] [PubMed] [Google Scholar]
- 29.Parker C, Philp I. Screening for cognitive impairment among older people in black and minority ethnic groups. Age Ageing. 2004;33:447–452. doi: 10.1093/ageing/afh135. [DOI] [PubMed] [Google Scholar]