Shortening the Alzheimer’s disease assessment scale cognitive subscale

Stephen Z Levine; Yair Goldberg; Anat Rotstein; Myrto Samara; Kazufumi Yoshida; Andrea Cipriani; Takeshi Iwatsubo; Stefan Leucht; Toshiaki A Furukawa

doi:10.1192/j.eurpsy.2024.14

. 2024 Feb 23;67(1):e19. doi: 10.1192/j.eurpsy.2024.14

Shortening the Alzheimer’s disease assessment scale cognitive subscale

Stephen Z Levine ^1,^✉, Yair Goldberg ², Anat Rotstein ³, Myrto Samara ⁴, Kazufumi Yoshida ⁵, Andrea Cipriani ^6,^7,⁸, Takeshi Iwatsubo ⁹, Stefan Leucht ¹⁰, Toshiaki A Furukawa ⁵

PMCID: PMC10966609 PMID: 38389390

Abstract

Background

A short yet reliable cognitive measure is needed that separates treatment and placebo for treatment trials for Alzheimer’s disease. Hence, we aimed to shorten the Alzheimer’s Disease Assessment Scale Cognitive Subscale (ADAS-Cog) and test its use as an efficacy measure.

Methods

Secondary data analysis of participant-level data from five pivotal clinical trials of donepezil compared with placebo for Alzheimer’s disease (N = 2,198). Across all five trials, cognition was appraised using the original 11-item ADAS-Cog. Statistical analysis consisted of sample characterization, item response theory (IRT) to identify an ADAS-Cog short version, and mixed models for repeated-measures analysis to examine the effect sizes of ADAS-Cog change on the original and short versions in the placebo versus donepezil groups.

Results

Based on IRT, a short ADAS-Cog was developed with seven items and two response options. The original and short ADAS-Cog correlated at baseline and at weeks 12 and 24 at 0.7. Effect sizes based on mixed modeling showed that the short and original ADAS-Cog separated placebo and donepezil comparably (ADAS-Cog original ES = 0.33, 95% CI = 0.29, 0.40, ADAS-Cog short ES = 0.25, 95% CI =0.23, 0.34).

Conclusions

IRT identified a short ADAS-cog version that separated donepezil and placebo, suggesting its clinical potential for assessment and treatment monitoring.

Keywords: Alzheimer’s disease, assessment, clinical trials, cognition, item response theory, psychometric

Introduction

Alzheimer’s disease is a progressive neurodegenerative disorder that cumulates in mortality on average 4–8 years after the diagnosis, characterized by impairments in the activities of daily functioning and cognitive decline [1]. Since cognitive impairment is a clinical hallmark of Alzheimer’s disease [1] suitable assessments are essential for treatment and research following onset [2]. The most widely used and researched cognitive impairment outcome in clinical trials of Alzheimer’s disease is the Alzheimer’s disease Assessment Scale Cognitive Subscale (ADAS-Cog) [3]. The ADAS-Cog is one of the two primary cognitive outcome measures required by the Food and Drug Administration for clinical drug trials for the treatment of Alzheimer’s disease in the United States [4]; however, it is quite long to administer (takes on average 30–35 min to complete).

Early evidence based on traditional psychometric approaches reported that the ADAS-Cog demonstrates acceptable levels of reliability and validity [1, 2]. Validity was supported based on evidence showing that the different aspects of cognition that constitute the ADAS-Cog are adequately correlated to form a single factor [3]. However, subsequent research did not replicate the single-factor solution and instead identified two- and three-factor solutions [4, 5] and queried the level of reliability of the ADAS-Cog [6]. Furthermore, some studies suggest that the ADAS-Cog is appropriate for use only in the moderate stages of cognitive impairment. Namely, the ADAS-Cog demonstrates severe floor (i.e., some items are too easy for patients) and ceiling (i.e., some items are too difficult for patients) effects [3, 7, 8]. Hence, contentions exist that the ADAS-Cog is inappropriate for mild and severe stage dementia [3, 7, 8]. In addition, the traditional psychometric approaches to examining the ADAS-Cog cannot examine treatment effects [3, 9, 10]. Hence, given these inconsistent findings, examination of the ADAS-Cog using advanced psychometric approaches is warranted.

To improve the ADAS-Cog, advanced psychometric approaches, such as item response theory (IRT), may be helpful [6]. Unlike traditional psychometric approaches, like factor analysis, IRT offers ADAS-Cog details at different cognitive impairment levels by item, information (i.e., reliability), and response option. It does so graphically and numerically. Estimates are available to map the ability of an item to discriminate underlying cognitive impairment levels. Also, it is possible to estimate the probability of progressing to a higher cognitive impairment response option rating or not. It is possible to identify which response options are likely, unlikely, and superfluous [11]. This feature of IRT is related to identifying items and response options that display ceiling or floor aspects on the ADAS-Cog. This seems of note to clinical trials where a given item may be used as a selection criterion, thereby impacting the response option ratings on the remaining items.

IRT has been implemented in studies to shorten psychiatric [9–11] and cognitive measures in dementia [12]. Studies that use IRT to examine the ADAS-Cog highlight that the measure is optimal within the moderate range of cognitive impairment only [13]. However, research has yet to identify an ADAS-Cog IRT-based shortened version that separates treatment and placebo to detect treatment effects.

We aimed to develop an ADAS-Cog short form (ADAS-Cog) using IRT based on individual-level participant clinical trial data and to examine whether it could separate treatment and placebo groups.

Methods

Participants

Study design

Data were accessed on pivotal individual-level participant data of randomized controlled double-blinded trials of donepezil conducted by Eisai Co. Ltd (see Table S1 published as supplementary material online attached to the electronic version of this paper at https://www.cambridge.org/core/journals/european-psychiatry). Data access was granted after the submission of an analytic plan. The data were analyzed on a secure Internet cloud-based platform (http://www.clinicalstudydatarequest.com). Trials were included in which participants with Alzheimer’s disease were assessed with the ADAS-Cog. Individual-level participant data were ascertained from five randomized clinical trials with similar follow-up intervals [14–18]. Institutional review boards approved each trial.

Measures

ADAS-Cog: The ADAS-Cog is a neuropsychological index of cognitive impairment, indicating the severity of cognitive symptoms in Alzheimer’s disease [19]. This measure has been widely used in Alzheimer’s disease clinical trials [3] and has become as the gold standard for evaluating treatment efficacy [20]. It consists of 11 items to assess memory, language, and praxis functions [19]. The ADAS-Cog total score ranges from 0 to 70, with high scores indicating more severe cognitive impairment.

Analytic plan

First, following the removal of individuals with missing baseline ADAS-Cog item level scores (Table 1), the analytic sample was characterized. Second, items and rating options were removed based on IRT to identify an ADAS-Cog short version. Third, the ADAS-Cog original and short versions were examined with mixed-effects models for repeated-measures analysis (MMRM).

Table 1.

Sample characteristics

Study	N	Donepezil N (%)	Placebo N (%)	Female N (%)	Male N (%)	Age Mean (SD)
All trials	2198	1435 (65.29)	763 (34.71)	1344 (61.15)	854 (38.85)	72.42 (7.47)
Homma, Takeda (14)	268	136 (50.75)	132 (49.25)	179 (66.79)	89 (33.21)	70.51 (7.16)
Rogers and Friedhoff (15)	161	121 (75.16)	40 (24.84)	97 (60.25)	64 (39.75)	72.04 (7.45)
Rogers, Doody (16)	481	324 (67.36)	157 (32.64)	305 (63.41)	176 (36.59)	73.95 (7.56)
Rogers, Farlow (17)	473	311 (65.75)	162 (34.25)	293 (61.95)	180 (38.05)	73.48 (7.17)
Burns, Rossor (18)	815	543 (66.63)	272 (33.37)	470 (57.67)	345 (42.33)	71.62 (7.44)

Open in a new tab

IRT of the ADAS-Cog at baseline

IRT assumes a single component underlies the data. Hence, principal components analysis was implemented to ascertain the number of components underlying the data. Next, the graded response model (GRM) [21], a form of IRT, was implemented in the ltm package in R [22]. The GRM model has been used to shorten measures previously [9–11, 23]. In IRT, item discrimination parameters (α) map the ability of an item to discriminate impairment levels. Discrimination parameter values for items are considered very low (between 0.01 and 0.24), low (0.25 and 0.64), moderate (0.65 and 1.34), high (1.35 and 1.69), and very high (over 1.7) [24]. Threshold parameters (βs) indicate the point at which there is a probability of endorsing a higher cognitive impairment rating than the previous rating option. If a threshold value exceeds 1.96, it suggests that ratings provide accurate information, and the converse applies to negative values.

Three graphs are used in IRT: item response category characteristic curves (a plot of the probability of endorsing a rating option by the level of underlying cognitive impairment), Item information curves (lines at similar information levels indicate overlapping, namely that the items assess similar information and so there exists a degree of item redundancy). Test information shows the reliability of the cognitive functioning assessment at different impairment levels.

Mixed models to assess treatment effects

We examined change scores, marginal means, and effect sizes differences in the marginal mean with their associated bootstrapped confidence intervals between the donepezil and placebo groups using a three-level MMRM analysis with maximum likelihood estimation. The levels accounted for the data structure such that level 1 represented the visit, level 2 represented the individual, and level 3 represented the trial [25]. The covariates were age, sex, baseline ADAS-Cog score, and treatment group, and the outcome was the change score from baseline.

Results

Trial characteristics

After removing 12 participants owing to missing ADAS-Cog item responses, the five trials comprised 2,198 study participants. These formed the basis for the baseline IRT analysis (see Supplementary Table S1).

IRT analysis: Tasks discriminating cognitive impairment levels

A scree plot showed that the data sufficed the unidimensional assumption that IRT requires (see Figure S1 published as supplementary material online attached to the electronic version of this paper at https://www.cambridge.org/core/journals/european-psychiatry). Item discrimination parameters were computed to map the ability of an item to discriminate latent symptom severity levels (see Table 2 alpha values). For example, word recall had the highest ability to discriminate underlying cognitive impairment levels (α=1.92). Four ADAS-Cog tasks (spoken language ability, comprehension of spoken language, remembering test instruction, and word finding difficulty) had low item discrimination parameters (i.e., these tasks lacked the ability to discriminate underlying cognitive impairment levels). Hence, the aforementioned four tasks were considered inappropriate for the IRT-based short-scale, leaving seven possible ADAS-Cog tasks (word recall, commands, naming, constructional praxis, ideational praxis, orientation, word recognition).

Table 2.

Item parameters from IRT

Item	α	β₁	β₂	β₃	β₄	β5	β6	β7	β8	β9	β10	β11	β12
Word recall	1.92*****	−4.82	−4.45	−4.08	−3.33	−2.46	−1.55	−0.66	0.19	1.21	2.59*
Commands	1.32***	−0.06	1.02	1.84	3.34*	4.63*
Naming	1.28***	−0.17	−0.09	−0.06	1.51	2.68*
Constructional praxis	0.96***	−1.55	0.95	2.00*	3.98*	5.93*
Ideational praxis	1.12***	0.01	1.36	2.09*	2.63*	3.34*
Orientation	1.41****	−2.26	−1.33	−0.66	−0.07	0.62	1.3	2.35*	4.45*
Word recognition	1.25***	−4.41	−3.17	−2.25	−1.56	−1.08	−0.58	−0.14	0.3	0.76	1.22	1.74	2.49*
Spoken language ability	−0.59*	3.80*	2.82*	2.55*	2.51*	−2.47
Comprehension of spoken language	−0.62*	3.08*	2.22*	2.13*	2.11*	−2.02
Remembering test instruction	−0.76*	2.84*	1.86	1.51	0.71	−2.18
Word finding difficulty	−0.66*	2.57*	1.69	1.35	1.32	−1.25

Open in a new tab

Note: Item discrimination parameters (α) map the ability of an item to discriminate latent cognitive impairment levels. Discrimination parameter values (α) that range from 0.01 to 0.24 are very low, 0.25 to 0.64 low, 0.65 to 1.34 moderate, 1.35 to 1.69 high, and over 1.7 are very high (Baker, 2001). βs are standardized estimates of the 0.5 probability of endorsing a higher cognitive impairment rating where negative values indicate progression to the next response is unlikely.

IRT analysis: ADAS-cog information ascertained at different cognitive impairment levels

Task information (reliability) is ascertained by IRT for the total scale and each task. The topmost plot in Figure 1 shows the test information along the vertical axis at different cognitive impairment levels along the horizontal axis for the ADAS-Cog total. Figure 1 (top panel) suggests that the ADAS-Cog is more reliable at moderate and moderately high impairment levels but displays a reliability that is not satisfactory at low and very high cognitive impairment levels. Figure 1 (middle panel) shows that the information ascertained by word recall is moderate across impairment levels up to severe levels of impairment from which the information ascertained is low.

Of the remaining seven possible ADAS-Cog tasks, the amount of information captured ranged from low to moderate. Word recall captured information at moderate cognitive impairment levels, commands from moderate to high levels, naming at moderate levels, constructional praxis from low to high levels, ideational praxis from moderate to high levels, orientation from moderate to high levels, and word recognition from very low to high levels (for information plots for all tasks, see Figures S2 and S3 published as supplementary material online attached to the electronic version of this paper at https://www.cambridge.org/core/journals/european-psychiatry).

IRT analysis: Response options

Based on item characteristic curves and the probability of a response option being endorsed (Table 2 beta values), we aimed to remove overlapping response options. For instance, the bottom panel of Figure 1 shows that response option 10 is endorsed with a high likelihood at higher impairment levels. All seven possible ADAS-Cog tasks had at least one response option that would likely be required (see Figure S5 published as supplementary material online attached to the electronic version of this paper at https://www.cambridge.org/core/journals/european-psychiatry and Table 2 beta values). However, not all response options appeared to be necessary.

We examined Table 2 (and see Figure S5 published as supplementary material online attached to the electronic version of this paper at https://www.cambridge.org/core/journals/european-psychiatry) to identify and remove superfluous response options. We identified superfluous sources of information for each of the items: word recall (9–10 errors captured severe impairment, and the remaining response options appeared not to capture severe impairment); commands (up to 3 commands incorrect did not appear to have differential utility in capturing impairment, and subsequent commands incorrect slightly superfluous); naming (the options did not capture severe cognitive impairment except five: “9–11 items incorrect”); constructional praxis and ideational praxis (options 0–3 were unlikely to result in a subsequent rating, and 4 and 5 overlapped to moderate to severe capture impairment); orientation (response options 6–8 reflected more severe impairment); and word recognition (12 incorrect responses represented severe impairment, otherwise transition was unlikely and the item responses were quite superfluous).

The ADAS-Cog IRT short-scale scoring key

Based on the above, we recoded the IRT-based ADAS-Cog short version as follows: word recall (0 except 9–10 recoded as 1); commands (up to 3 as 0, otherwise 1); naming (0 except five as 1); constructional praxis and ideational praxis (options 0–3 as 0, and 4 and 5 as 1); orientation (0–5 as 0, 6–8 as 1); and word recognition (0 except 12 as 1). For consistency and ease of future use, dichotomous scoring was implemented.

Mixed models

The bivariate correlation at baseline, at week 12, and week 24 of the short and original ADAS-Cog measures was 0.7 across time points. MRMMs were implemented to contrast the original and IRT-based short ADAS-Cog (Figure 2). The marginal means differed between the original and short ADAS-Cog (original version: donepezil = −1.85, 95% CI = −2.16, −1.53, placebo = −0.38, 95% CI = −0.77, −0.00; short version: donepezil = −0.04, 95% CI = −0.10, −0.02, placebo = 0.11, 95% CI = 0.05, 0.18) were smaller for donepezil than placebo. Based on the marginal means, examination of the effect sizes showed that placebo and donepezil separated more for the original than the short ADAS-Cog version, but the bootstrapped confidence intervals overlapped between versions (ADAS-Cog original ES = 0.33, 95% CI = 0.29, 0.40, ADAS-Cog short ES = 0.25, 95% CI = 0.23, 0.34).

Figure 2. — Mixed model modeling changes in the original and short *Alzheimer*’*s Disease Assessment Scale Cognitive Subscale* (ADAS-Cog) up to 24 weeks. Note: Upper figure is the original ADAS-Cog and the lower is the short ADAS-Cog based item response theory.

Discussion

Based on five pivotal clinical trials of donepezil compared with placebo for Alzheimer’s disease (N = 2,198), we implemented IRT to shorten the ADAS-Cog and examined whether this short version could separate treatment and placebo groups in a manner similar to the original version. We identified a short ADAS-Cog that consisted of seven items and found that it separated placebo from donepezil in these trials.

IRT identified a short ADAS-Cog consisting of 7 items with dichotomous response options, in contrast to the original, which consists of 11 items with multiple response options. In our estimation, assuming the ADAS-Cog takes 30 min to administer, the test-time for the short version may be approximately 18 min or less, because the short version has seven items (36.37% fewer items than the original ADAS-Cog) and two response options (to ease future administration).

Based on mixed modeling, scores on the ADAS-Cog change short version were separated between placebo from donepezil in these individual participant trial data. Also, mixed modeling to examine ADAS-Cog change showed conclusions concerning efficacy were similar for both the short and original ADAS-Cog scales (i.e., both showed superior efficacy of donepezil compared to placebo). The effect size, however, slightly favored the original compared to the short scale.

Limitations and conclusions

Our study has several primary strengths, such as the use of individual-level participant data. Nonetheless, our study has notable limitations. First, clinical trial selection criteria restrict generalizations from clinical trial data to the general population [26, 27]. Hence, caution is warranted regarding generalizing from the current results to clinical treatment settings. To inform clinical practice, replicating the results in large-scale naturalistic studies with extended observation periods may be warranted. Second, unmeasured factors (e.g., delusions) may have confounded the study results. Nonetheless, the data common to all the trials did not contain such other information. Hence, our study may suffer from residual confounding, and future research may wish to account for other potential confounders. Third, our results are restricted to donepezil and placebo. Research is warranted to scrutinize the generalizability of these results to other antidementia drugs. Fourth, the study duration was restricted to 24 weeks of follow-up. Given the course of cognitive decline in Alzheimer’s disease, further research is warranted with longer study durations. Fifth, an independent prospective study is warranted to test the validity of the scale.

The clinical trials in our study were completed over a decade ago. Today, a significant proportion of participants would not receive a research diagnosis of Alzheimer’s disease. Specifically, perhaps up to 30% would receive diagnoses for other neurodegenerative disorders, including vascular or mixed dementia, based on current-day research diagnostic criteria that involve biomarkers, such as amyloid PET, to confirm neuropathology in Alzheimer’s disease according to the 2018 NIA-AA Research Framework [28]. However, the use of biomarkers is yet to translate to daily clinical practice [29]. In current daily clinical practice, the symptomatological diagnostic criteria, including DSM-5 [30] and NINCDS-ADRDA [31], are the basis for the prescription of donepezil and other antidementia drugs, as were done in the trials included in the current study.

Among the strengths of the current study design are the amount of evidence (five pivotal clinical trials) and the relatively large sample, which make the results robust. These features reinforce our faith in the robustness of the analysis. Clinically, a short ADAS-Cog with a strong correlation with the original offers possibilities in reducing the trial participant burden while keeping reliability intact. In sum, the current study contributes to knowledge on Alzheimer’s disease by identifying a short version of the ADAS-Cog with potential use for treatment monitoring in moderate-stage Alzheimer’s disease.

Supporting information

Levine et al. supplementary material

S0924933824000142sup001.docx^{(1.7MB, docx)}

Acknowledgments

Authors Levine and Goldberg contributed equally to this study and are joint first authors. The authors acknowledge Eisai Co. Ltd for providing us with the study data. Eisai Co. Ltd did not provide study design, critical input, or manuscript review for the study. The authors also acknowledge http://www.clinicalstudydatarequest.com for hosting the study data. Data are available based on a request to http://www.clinicalstudydatarequest.com.

Supplementary material

The supplementary material for this article can be found at http://doi.org/10.1192/j.eurpsy.2024.14.

Author contribution

Levine: Manuscript drafting, data curation, statistical analysis, data management, study conceptualization.

Goldberg: Study conceptualization, critical manuscript feedback, statistical analysis.

Rotstein: Study conceptualization, interpretation, critical manuscript feedback.

Yoshida: Critical manuscript feedback, data management, statistical analysis.

Samara: Study conceptualization, interpretation, critical manuscript feedback.

Cipriani: Study conceptualization, interpretation, critical manuscript feedback.

Iwatsubo: Study conceptualization, interpretation, critical manuscript feedback.

Leucht: Study conceptualization, interpretation, critical manuscript feedback.

Furukawa: Critical manuscript feedback, statistical review, study conceptualization, mentorship.

Financial support

This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors. Cipriani is supported by the National Institute for Health Research (NIHR) Oxford Cognitive Health Clinical Research Facility, by an NIHR Research Professorship (grant RP-2017-08-ST2-006), by the NIHR Oxford and Thames Valley Applied Research Collaboration and by the NIHR Oxford Health Biomedical Research Centre (grant BRC-1215-20005). The views expressed are those of the authors and not necessarily those of the UK National Health Service, the NIHR, or the UK Department of Health.

Competing interest

Drs Levine, Yoshida, Rotstein, and Goldberg have nothing to disclose. Dr. Samara has received honoraria as a consultant or for lectures for Viatris, Recordati, Lundbeck, and Viatris. Dr. Iwatsubo has served as a consultant of Eisai and Eli Lilly in the last 3 years. Dr. Cipriani has received research and consultancy fees from INCiPiT (Italian Network for Pediatric Trials), CARIPLO Foundation and Angelini Pharma. In the last 3 years, SL has received honoraria for advising/consulting and/or for lectures and/or for educational material from Angelini, Boehringer Ingelheim, Eisai, Ekademia, GedeonRichter, Janssen, Karuna, Kynexis, Lundbeck, Medichem, Medscape, Mitsubishi, Neurotorium, Otsuka, NovoNordisk, Recordati, Rovi, and Teva. Dr. Furukawa reports royalties from Mitsubishi-Tanabe, consulting fees from Boehringer-Ingelheim, DT Axis, Kyoto University Original, Shionogi, SONY, UPTODATE, and Daiichi Sankyo, and a grant from Shionogi, outside the submitted work. In addition, Dr. Furukawa has patents 2020-548587 and 2022-082495 pending, and intellectual properties for Kokoro-app licensed to Mitsubishi-Tanabe.

References

[1].Alzheimer’s Association Report. 2022 Alzheimer’s disease facts and figures. Alzheimers Dement. 2022;18(4):700–89. [DOI] [PubMed] [Google Scholar]
[2].Robert P, Ferris S, Gauthier S, Ihl R, Winblad B, Tennigkeit F. Review of Alzheimer’s disease scales: is there a need for a new multi-domain scale for therapy evaluation in medical practice? Alzheimer’s Res Therapy. 2010;2(4):24. [DOI] [PMC free article] [PubMed] [Google Scholar]
[3].Rosen WG, Mohs RC, Davis KL. A new rating scale for Alzheimer’s disease. Am J Psychiatry. 1984;141(11):1356–64. [DOI] [PubMed] [Google Scholar]
[4].Manning CA, Ducharme JK. Dementia syndromes in the older adult. In: Lichtenberg PA, editor. Handbook of assessment in clinical gerontology. San Diego: Academic Press; 2010, p. 155–78. [Google Scholar]
[5].Weyer G, Erzigkeit H, Kanowski S, Ihl R, Hadler D. Alzheimer’s disease assessment scale: reliability and validity in a multicenter clinical trial. Int Psychogeriatr. 1997;9(2):123–38. [DOI] [PubMed] [Google Scholar]
[6].Cano SJ, Posner HB, Moline ML, Hurt SW, Swartz J, Hsu T, et al. The ADAS-Cog in Alzheimer’s disease clinical trials: psychometric evaluation of the sum and its parts. J Neurol Neurosurg Psychiatry. 2010;81(12):1363–8. [DOI] [PubMed] [Google Scholar]
[7].Cogo-Moreira H, Krance SH, Black SE, Herrmann N, Lanctôt KL, MacIntosh BJ, et al. Questioning the meaning of a change on the Alzheimer’s disease assessment scale–cognitive subscale (ADAS-Cog): noncomparable scores and item-specific effects over time. Assessment. 2021;28:1708–22. [DOI] [PMC free article] [PubMed] [Google Scholar]
[8].Grochowalski JH, Liu Y, Siedlecki KL. Examining the reliability of ADAS-Cog change scores. Neuropsychol Dev Cogn B Aging Neuropsychol Cogn. 2016;23(5):513–29. [DOI] [PubMed] [Google Scholar]
[9].Levine SZ, Rabinowitz J, Rizopoulos D. Recommendations to improve the positive and negative syndrome scale (PANSS) based on item response theory. Psychiatry Res. 2011;188(3):446–52. [DOI] [PubMed] [Google Scholar]
[10].Wilson JE, Niu K, Nicolson SE, Levine SZ, Heckers S. The diagnostic criteria and structure of catatonia. Schizophr Res. 2015;164(1–3):256–62. [DOI] [PubMed] [Google Scholar]
[11].Levine SZ, Leucht S. Psychometric analysis in support of shortening the scale for the assessment of negative symptoms. Eur Neuropsychopharmacol. 2013;23(9):1051–6. [DOI] [PubMed] [Google Scholar]
[12].McGrory S, Doherty JM, Austin EJ, Starr JM, Shenkin SD. Item response theory analysis of cognitive tests in people with dementia: a systematic review. BMC Psychiatry. 2014;14:47. [DOI] [PMC free article] [PubMed] [Google Scholar]
[13].Benge JF, Balsis S, Geraci L, Massman PJ, Doody RS. How well do the ADAS-cog and its subscales measure cognitive dysfunction in Alzheimer’s disease? Dement Geriatr Cogn Disord. 2009;28(1):63–9. [DOI] [PubMed] [Google Scholar]
[14].Homma A, Takeda M, Imai Y, Udaka F, Hasegawa K, Kameyama M, et al. Clinical efficacy and safety of donepezil on cognitive and global function in patients with Alzheimer’s disease. A 24-week, multicenter, double-blind, placebo-controlled study in Japan. E2020 Study Group. Dement Geriatr Cogn Disord. 2000;11(6):299–313. [DOI] [PubMed] [Google Scholar]
[15].Rogers SL, Friedhoff LT. The efficacy and safety of donepezil in patients with Alzheimer’s disease: results of a US multicentre, randomized, double-blind, placebo-controlled trial. The donepezil study group. Dementia. 1996;7(6):293–303. [DOI] [PubMed] [Google Scholar]
[16].Rogers SL, Doody RS, Mohs RC, Friedhoff LT. Donepezil improves cognition and global function in Alzheimer disease: a 15-week, double-blind, placebo-controlled study. Donepezil Study Group. Arch Intern Med. 1998;158(9):1021–31. [DOI] [PubMed] [Google Scholar]
[17].Rogers SL, Farlow MR, Doody RS, Mohs R, Friedhoff LT. A 24-week, double-blind, placebo-controlled trial of donepezil in patients with Alzheimer’s disease. Donepezil Study Group. Neurology. 1998;50(1):136–45. [DOI] [PubMed] [Google Scholar]
[18].Burns A, Rossor M, Hecker J, Gauthier S, Petit H, Moller HJ, et al. The effects of donepezil in Alzheimer’s disease - results from a multinational trial. Dement Geriatr Cogn Disord. 1999;10(3):237–44. [DOI] [PubMed] [Google Scholar]
[19].Mohs RC, Cohen L. Alzheimer’s disease assessment scale (ADAS). Psychopharmacol Bull. 1988;24(4):627–8. [PubMed] [Google Scholar]
[20].Kueper JK, Speechley M, Montero-Odasso M. The Alzheimer’s disease assessment scale-cognitive subscale (ADAS-Cog): modifications and responsiveness in pre-dementia populations. A narrative review. J Alzheimers Dis. 2018;63(2):423–44. [DOI] [PMC free article] [PubMed] [Google Scholar]
[21].Samejima F. Estimation of latent ability using a response pattern of graded scores. Psychometrika Mon Sup. 1969;34:1–97. [Google Scholar]
[22].Rizopoulos D. ltm: An R package for latent variable modelling and item response theory analyses. J Stat Software. 2006;17(5):1–25. [Google Scholar]
[23].Velthorst E, Levine SZ, Henquet C, de Haan L, van Os J, Myin-Germeys I, et al. To cut a short test even shorter: reliability and validity of a brief assessment of intellectual ability in schizophrenia--a control-case family study. Cogn Neuropsychiatry. 2013;18(6):574–93. [DOI] [PubMed] [Google Scholar]
[24].Baker F. The basics of item response theory. University of Maryland College Park, MD: ERIC Clearinghouse on Assessment and Evaluation; 2001.
[25].Hedeker DR, Gibbons RD. Longitudinal data analysis. Hoboken, NJ: Wiley-Interscience; 2006. [Google Scholar]
[26].Malmivaara A. Generalizability of findings from randomized controlled trials is limited in the leading general medical journals. J Clin Epidemiol. 2019;107:36–41. [DOI] [PubMed] [Google Scholar]
[27].Canevelli M, Bruno G, Vanacore N, de Lena C, Cesari M. Are we really tackling the “evidence-based medicine issue” in Alzheimer’s disease? Eur J Intern Med. 2016;35:e29–e30. [DOI] [PubMed] [Google Scholar]
[28].Jack CR Jr., Bennett DA, Blennow K, Carrillo MC, Dunn B, Haeberlein SB, et al. NIA-AA research framework: toward a biological definition of Alzheimer’s disease. Alzheimers Dement. 2018;14(4):535–62. [DOI] [PMC free article] [PubMed] [Google Scholar]
[29].Frisoni GB, Boccardi M, Barkhof F, Blennow K, Cappa S, Chiotis K, et al. Strategic roadmap for an early diagnosis of Alzheimer’s disease based on biomarkers. Lancet Neurol. 2017;16(8):661–76. [DOI] [PubMed] [Google Scholar]
[30].American Psychiatric Association. Diagnostic and statistical manual of mental disorders, fifth edition (DSM-5). 5th ed. Arlington, VA: American Psychiatric Association; 2013. [Google Scholar]
[31].McKhann G, Drachman D, Folstein M, Katzman R, Price D, Stadlan EM. Clinical diagnosis of Alzheimer’s disease: report of the NINCDS-ADRDA work group under the auspices of department of health and human services task force on Alzheimer’s disease. Neurology. 1984;34(7):939–44. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Levine et al. supplementary material

S0924933824000142sup001.docx^{(1.7MB, docx)}

[r1] [1].Alzheimer’s Association Report. 2022 Alzheimer’s disease facts and figures. Alzheimers Dement. 2022;18(4):700–89. [DOI] [PubMed] [Google Scholar]

[r2] [2].Robert P, Ferris S, Gauthier S, Ihl R, Winblad B, Tennigkeit F. Review of Alzheimer’s disease scales: is there a need for a new multi-domain scale for therapy evaluation in medical practice? Alzheimer’s Res Therapy. 2010;2(4):24. [DOI] [PMC free article] [PubMed] [Google Scholar]

[r3] [3].Rosen WG, Mohs RC, Davis KL. A new rating scale for Alzheimer’s disease. Am J Psychiatry. 1984;141(11):1356–64. [DOI] [PubMed] [Google Scholar]

[r4] [4].Manning CA, Ducharme JK. Dementia syndromes in the older adult. In: Lichtenberg PA, editor. Handbook of assessment in clinical gerontology. San Diego: Academic Press; 2010, p. 155–78. [Google Scholar]

[r5] [5].Weyer G, Erzigkeit H, Kanowski S, Ihl R, Hadler D. Alzheimer’s disease assessment scale: reliability and validity in a multicenter clinical trial. Int Psychogeriatr. 1997;9(2):123–38. [DOI] [PubMed] [Google Scholar]

[r6] [6].Cano SJ, Posner HB, Moline ML, Hurt SW, Swartz J, Hsu T, et al. The ADAS-Cog in Alzheimer’s disease clinical trials: psychometric evaluation of the sum and its parts. J Neurol Neurosurg Psychiatry. 2010;81(12):1363–8. [DOI] [PubMed] [Google Scholar]

[r7] [7].Cogo-Moreira H, Krance SH, Black SE, Herrmann N, Lanctôt KL, MacIntosh BJ, et al. Questioning the meaning of a change on the Alzheimer’s disease assessment scale–cognitive subscale (ADAS-Cog): noncomparable scores and item-specific effects over time. Assessment. 2021;28:1708–22. [DOI] [PMC free article] [PubMed] [Google Scholar]

[r8] [8].Grochowalski JH, Liu Y, Siedlecki KL. Examining the reliability of ADAS-Cog change scores. Neuropsychol Dev Cogn B Aging Neuropsychol Cogn. 2016;23(5):513–29. [DOI] [PubMed] [Google Scholar]

[r9] [9].Levine SZ, Rabinowitz J, Rizopoulos D. Recommendations to improve the positive and negative syndrome scale (PANSS) based on item response theory. Psychiatry Res. 2011;188(3):446–52. [DOI] [PubMed] [Google Scholar]

[r10] [10].Wilson JE, Niu K, Nicolson SE, Levine SZ, Heckers S. The diagnostic criteria and structure of catatonia. Schizophr Res. 2015;164(1–3):256–62. [DOI] [PubMed] [Google Scholar]

[r11] [11].Levine SZ, Leucht S. Psychometric analysis in support of shortening the scale for the assessment of negative symptoms. Eur Neuropsychopharmacol. 2013;23(9):1051–6. [DOI] [PubMed] [Google Scholar]

[r12] [12].McGrory S, Doherty JM, Austin EJ, Starr JM, Shenkin SD. Item response theory analysis of cognitive tests in people with dementia: a systematic review. BMC Psychiatry. 2014;14:47. [DOI] [PMC free article] [PubMed] [Google Scholar]

[r13] [13].Benge JF, Balsis S, Geraci L, Massman PJ, Doody RS. How well do the ADAS-cog and its subscales measure cognitive dysfunction in Alzheimer’s disease? Dement Geriatr Cogn Disord. 2009;28(1):63–9. [DOI] [PubMed] [Google Scholar]

[r14] [14].Homma A, Takeda M, Imai Y, Udaka F, Hasegawa K, Kameyama M, et al. Clinical efficacy and safety of donepezil on cognitive and global function in patients with Alzheimer’s disease. A 24-week, multicenter, double-blind, placebo-controlled study in Japan. E2020 Study Group. Dement Geriatr Cogn Disord. 2000;11(6):299–313. [DOI] [PubMed] [Google Scholar]

[r15] [15].Rogers SL, Friedhoff LT. The efficacy and safety of donepezil in patients with Alzheimer’s disease: results of a US multicentre, randomized, double-blind, placebo-controlled trial. The donepezil study group. Dementia. 1996;7(6):293–303. [DOI] [PubMed] [Google Scholar]

[r16] [16].Rogers SL, Doody RS, Mohs RC, Friedhoff LT. Donepezil improves cognition and global function in Alzheimer disease: a 15-week, double-blind, placebo-controlled study. Donepezil Study Group. Arch Intern Med. 1998;158(9):1021–31. [DOI] [PubMed] [Google Scholar]

[r17] [17].Rogers SL, Farlow MR, Doody RS, Mohs R, Friedhoff LT. A 24-week, double-blind, placebo-controlled trial of donepezil in patients with Alzheimer’s disease. Donepezil Study Group. Neurology. 1998;50(1):136–45. [DOI] [PubMed] [Google Scholar]

[r18] [18].Burns A, Rossor M, Hecker J, Gauthier S, Petit H, Moller HJ, et al. The effects of donepezil in Alzheimer’s disease - results from a multinational trial. Dement Geriatr Cogn Disord. 1999;10(3):237–44. [DOI] [PubMed] [Google Scholar]

[r19] [19].Mohs RC, Cohen L. Alzheimer’s disease assessment scale (ADAS). Psychopharmacol Bull. 1988;24(4):627–8. [PubMed] [Google Scholar]

[r20] [20].Kueper JK, Speechley M, Montero-Odasso M. The Alzheimer’s disease assessment scale-cognitive subscale (ADAS-Cog): modifications and responsiveness in pre-dementia populations. A narrative review. J Alzheimers Dis. 2018;63(2):423–44. [DOI] [PMC free article] [PubMed] [Google Scholar]

[r21] [21].Samejima F. Estimation of latent ability using a response pattern of graded scores. Psychometrika Mon Sup. 1969;34:1–97. [Google Scholar]

[r22] [22].Rizopoulos D. ltm: An R package for latent variable modelling and item response theory analyses. J Stat Software. 2006;17(5):1–25. [Google Scholar]

[r23] [23].Velthorst E, Levine SZ, Henquet C, de Haan L, van Os J, Myin-Germeys I, et al. To cut a short test even shorter: reliability and validity of a brief assessment of intellectual ability in schizophrenia--a control-case family study. Cogn Neuropsychiatry. 2013;18(6):574–93. [DOI] [PubMed] [Google Scholar]

[r24] [24].Baker F. The basics of item response theory. University of Maryland College Park, MD: ERIC Clearinghouse on Assessment and Evaluation; 2001.

[r25] [25].Hedeker DR, Gibbons RD. Longitudinal data analysis. Hoboken, NJ: Wiley-Interscience; 2006. [Google Scholar]

[r26] [26].Malmivaara A. Generalizability of findings from randomized controlled trials is limited in the leading general medical journals. J Clin Epidemiol. 2019;107:36–41. [DOI] [PubMed] [Google Scholar]

[r27] [27].Canevelli M, Bruno G, Vanacore N, de Lena C, Cesari M. Are we really tackling the “evidence-based medicine issue” in Alzheimer’s disease? Eur J Intern Med. 2016;35:e29–e30. [DOI] [PubMed] [Google Scholar]

[r28] [28].Jack CR Jr., Bennett DA, Blennow K, Carrillo MC, Dunn B, Haeberlein SB, et al. NIA-AA research framework: toward a biological definition of Alzheimer’s disease. Alzheimers Dement. 2018;14(4):535–62. [DOI] [PMC free article] [PubMed] [Google Scholar]

[r29] [29].Frisoni GB, Boccardi M, Barkhof F, Blennow K, Cappa S, Chiotis K, et al. Strategic roadmap for an early diagnosis of Alzheimer’s disease based on biomarkers. Lancet Neurol. 2017;16(8):661–76. [DOI] [PubMed] [Google Scholar]

[r30] [30].American Psychiatric Association. Diagnostic and statistical manual of mental disorders, fifth edition (DSM-5). 5th ed. Arlington, VA: American Psychiatric Association; 2013. [Google Scholar]

[r31] [31].McKhann G, Drachman D, Folstein M, Katzman R, Price D, Stadlan EM. Clinical diagnosis of Alzheimer’s disease: report of the NINCDS-ADRDA work group under the auspices of department of health and human services task force on Alzheimer’s disease. Neurology. 1984;34(7):939–44. [DOI] [PubMed] [Google Scholar]

PERMALINK

Shortening the Alzheimer’s disease assessment scale cognitive subscale

Stephen Z Levine

Yair Goldberg

Anat Rotstein

Myrto Samara

Kazufumi Yoshida

Andrea Cipriani

Takeshi Iwatsubo

Stefan Leucht

Toshiaki A Furukawa

Abstract

Background

Methods

Results

Conclusions

Introduction

Methods

Participants

Study design

Measures

Analytic plan

Table 1.

IRT of the ADAS-Cog at baseline

Mixed models to assess treatment effects

Results

Trial characteristics

IRT analysis: Tasks discriminating cognitive impairment levels

Table 2.

IRT analysis: ADAS-cog information ascertained at different cognitive impairment levels

Figure 1.

IRT analysis: Response options

The ADAS-Cog IRT short-scale scoring key

Mixed models

Figure 2.

Discussion

Limitations and conclusions

Supporting information

Acknowledgments

Supplementary material

Author contribution

Financial support

Competing interest

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases