Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2025 Feb 1.
Published in final edited form as: Clin Neuropsychol. 2023 Jun 22;38(2):453–470. doi: 10.1080/13854046.2023.2223389

A comparison of story-recall metrics to predict hippocampal volume in older adults with and without cognitive impairment

Ainara Jauregi Zinkunegi 1,*, Davide Bruno 1, Tobey J Betthauser 2,3, Rebecca Langhough Koscik 2,3,4, Sanjay Asthana 2,4,5, Nathaniel A Chin 2,3, Bruce P Hermann 4,6, Sterling C Johnson 2,3,4,5, Kimberly D Mueller 3,4,7
PMCID: PMC10739621  NIHMSID: NIHMS1917932  PMID: 37349970

Abstract

Objective:

Process-based scores of episodic memory tests, such as the recency ratio (Rr), have been found to compare favourably to, or to be better than, most conventional or “traditional” scores employed to estimate memory ability in older individuals (Bock et al., 2021; Bruno et al., 2019). We explored the relationship between process-based scores and hippocampal volume in older adults, while comparing process-based to traditional story recall-derived scores, to examine potential differences in their predictive abilities.

Methods:

We analysed data from 355 participants extracted from the WRAP and WADRC databases, who were classified as cognitively unimpaired, or exhibited mild cognitive impairment (MCI) or dementia. Story Recall was measured with the Logical Memory Test (LMT) from the Weschler Memory Scale Revised, collected within twelve months of the magnetic resonance imaging scan. Linear regression analyses were conducted with left or right hippocampal volume (HV) as outcomes separately, and with Rr, Total ratio, Immediate LMT, or Delayed LMT scores as predictors, along with covariates.

Results:

Higher Rr and Tr scores significantly predicted lower left and right HV, while Tr showed the best model fit of all, as indicated by AIC. Traditional scores, Immediate LMT and Delayed LMT, were significantly associated with left and right HV, but were outperformed by both process-based scores for left HV, and by Tr for right HV.

Conclusions:

Current findings show the direct relationship between hippocampal volume and all the LMT scores examined here, and that process-based scores outperform traditional scores as markers of hippocampal volume.

Keywords: Recency ratio, hippocampus, Alzheimer’s disease, story recall, logical memory

INTRODUCTION

In research studies and clinical settings, performance on common neuropsychological tests is often calculated by adding the number of correct responses into a total score, which is then interpreted using cut-off points from standard normative data. However, this quantitative or “traditional” scoring and interpretation method does not provide much information on the cognitive functions engaged during the test or the strategies used by patients to obtain the scores (Grant & Adams, 2009), or which of these factors may be contributing to an impaired score (Blanco-Campal et al., 2019). To overcome these limitations, a qualitative or “process-based” approach, also known as the Boston Process Approach (BPA), was first developed by Kaplan (1988). This approach focuses on the strategies used and errors committed during the test (Libon et al., 2022), and can be implemented with procedural modifications of the tasks, such as time limits or adding new components, or by modifying the calculation of their scores (for a review, see Milberg et al., 2009). An example of process approach applied to episodic memory tests (e.g., list-learning and story recall tasks) is analysis of serial position, where primacy, recency, or clustering effects can be analysed in addition to “traditional” scores (Bruno et al., 2013; Diaz-Orueta et al., 2018; Grant & Adams, 2009 Talamonti et al., 2021).

Serial position effects are found when early items (primacy) and late items (recency) on a list are remembered better than items in the middle (Murdock, 1962). Individuals with Alzheimer’s disease (AD) have been reported to show intact or exaggerated immediate memory recall of items learned at the end of a list (recency items, e.g., Foldi et al., 2003), while performing poorly when recalling the same items after a delay (Carlesimo et al., 1995). To leverage this discrepancy in recency performance, Bruno et al. (2016; 2018) proposed a process-based score, the recency ratio (Rr), which is calculated by dividing the recency scores in the immediate recall trial by the corresponding scores in the delayed recall trial. Higher Rr scores, showing disproportionate loss of recency recall from immediate to delayed testing, indicate more recency forgetting and, consequently, more overall risk of cognitive impairment (Bruno et al., 2022). Studies using Rr from list-learning tasks have shown that higher (i.e., worse) scores predict cognitive decline (Bruno et al., 2016), early mild cognitive impairment (MCI; Bruno et al., 2018; Egeland, 2021), and amyloid-β pathology in individuals with MCI (Bruno et al., 2019). Rr scores have also been found to accurately discriminate between individuals diagnosed with AD and other types of dementia, such as frontotemporal, Lewy body and vascular dementias (Turchetta et al., 2018), and to correctly classify MCI patients who are more likely to convert to AD (Turchetta et al., 2020). Bruno et al. (2021a) found Rr to be sensitive to the levels of cerebrospinal fluid (CSF) Neurogranin, a post-synaptic protein reported to reflect synaptic dysfunction in AD and MCI, in individuals with major depressive disorder. Furthermore, higher Rr scores have been found to be associated with increased CSF levels of p-tau and t-tau in older adults with and without cognitive impairment, while total and delayed recall scores were not linked with any of the AD biomarkers (Bruno et al., 2022). Overall, Rr tends to compare favourably to, or perform better than, most conventional or “traditional” scores employed to estimate memory ability in older individuals (Bock et al., 2021; Bruno et al., 2019).

In principle, serial position effects can be examined in any common neuropsychological test of recall, however, until recently, these have been studied primarily in list-learning tasks. In contrast to list-learning tasks, story recall tasks require participants to learn coherent stories comprised of items of varying semantic and lexical categories. These stories must also be recalled immediately and after a delay. Although story recall tasks, such as the Logical Memory test (LMT; Wechsler, 1987), are very commonly used cognitive tools in research and clinical practice, little is known about serial position effects within these tasks compared to list-learning tasks. Only recently, a study by Bruno et al. (2023) found Rr to be applicable to story recall and to be sensitive to CSF levels of amyloid beta 1–40 (Aβ40), p-tau, t-tau, Neurogranin and alpha synuclein—overall outperforming traditional LMT scores, but more evidence that Rr works in story recall tasks is needed.

It has been suggested that higher Rr scores could be due to a combination of reduction in long-term retention, possibly as a result of diminished consolidation ability, paired with a compensatory increased reliance upon short-term memory processing (Bruno et al., 2018). The medial-temporal lobe (MTL) in general, and hippocampus in particular, are essential in the formation and consolidation of long-term memory (Wixted, 2004; Wixted & Cai, 2013). Considering that early neuropathological changes in AD occur within the MTL, especially in the entorhinal cortex and hippocampus (Weiner et al., 2015), it could be argued that higher Rr scores in AD might be associated with changes in these areas (Turchetta et al., 2020). Additionally, verbal episodic memory performance has been shown to be associated with left, rather than right, hippocampal volume (Ezzati et al., 2016; Shi et al.,2009; Travis et al., 2014) and greater left hippocampal reductions, compared to right hippocampal reductions, have been reported in individuals with AD (Dhikav et al., 2016; Ezzati et al., 2016; Li et al., 2016; Müller et al., 2005; Shi et al., 2009; Thompson et al., 2003; 2007; Wicking et al., 2014).Thus, Rr might be sensitive to hippocampal health, and more specifically, to that of the left hippocampus.

To be certain that any possible effects are due to recency forgetting and not total forgetting, we propose a relatively novel process-based score, the Total ratio (Tr) which indices forgetting independently of recency (Bruno et al., 2021b; Bruno et al., 2023). Tr is obtained by dividing the immediate recall trial scores by the delayed recall trial scores, where higher Tr scores reflect more total forgetting. Bruno et al. (2023) found Rr to be more sensitive to CSF biomarkers of neurodegeneration than Tr, but we know little about how Tr fairs, compared to Rr, with different AD biomarker outcomes.

The aim of the current study was to explore the relationship between process-based scores, Rr and Tr, and hippocampal volume in older adults with and without cognitive impairment. We examined this relationship using memory scores collected within twelve months of the MRI scan, while also comparing process-based to traditional story recall-derived memory scores, by analysing data from both WRAP and WADRC samples. We predicted that Rr and Tr from story recall would be negatively associated with hippocampal volume, and that Rr and Tr would be a better predictor of hippocampal volume than traditional LMT and composite scores.

METHODS

Participants

Data were extracted from the Wisconsin Alzheimer’s Disease Research Center (WADRC) and the Wisconsin Registry for Alzheimer’s Prevention (WRAP) studies. Participants were selected based on having completed one T1-weighted structural magnetic resonance imaging (MRI) scan, and one cognitive screening visit within twelve months of the MRI scan, including complete Logical Memory Test (LMT) story recall data, in either WRAP or WADRC. Exclusion criteria for both studies included major neurologic disorder (e.g., head trauma with loss of consciousness, seizures, or neoplasms), current (within the previous 12 months) major psychiatric disorders, or any other significant medical illness. From the total pool of 2,498 participants, 355 participants fulfilled the above criteria: 351 were native English speakers; four participants (1.13%) reported their race as American Indian or Native American, one (0.28%) as Asian, nine (2.54%) as Black, African American, or mixed, 340 (95.78%) as non-Hispanic White or White, and one (0.28%) as Spanish or Hispanic.

Participants were classified as either cognitively unimpaired, with MCI due to presumed AD (MCI-AD), or with dementia due to presumed AD (Dementia-AD), via multi-disciplinary consensus conference review that was blind to AD biomarkers statuses. In WRAP, a two-tiered consensus conference approach was used (for details, see Johnson et al., 2018; Langhough Koscik et al., 2021). For both WRAP and ADRC, cognitive statuses were determined by teams that included physicians, clinical neuropsychologists, and clinical nurse practitioners, and based on core clinical criteria developed by the National Institute on Aging and the Alzheimer’s Association (Albert et al., 2011; McKhann et al., 2011). Among the 355 participants included in the study, 282 individuals were cognitively unimpaired, 39 had a diagnosis of MCI-AD, and 34 had a diagnosis of Dementia-AD. All activities for this study were approved by the institutional review board of the University of Wisconsin–Madison and completed in accordance with the Helsinki Declaration. All participants provided informed consent prior to testing.

Structural MRI

MRI images were acquired using two identical GE 3.0 Tesla MR750 scanners (Waukesha, WI, USA) with an 8-channel head coil (Excite HD Brain Coil; GE Healthcare) in one scanning session. T1-weighted brain volumes were acquired in the axial plane with a 3-D inversion-recovery prepared fast spoiled gradient-echo sequence using the following parameters: inversion time (TI) = 450 ms; repetition time (TR) = 8.2 ms; echo time (TE) = 3.2 ms; flip angle = 12°; acquisition matrix = 256 × 256 × 156 mm; field of view (FOV) = 256 mm; slice thickness = 1.0 mm. Cushions inside the head coil helped reduce head movement during scanning. All MRI scans were read by an experienced clinical neuroradiologist, who excluded participants from further analyses due to structural abnormalities, if required (see other exclusion criteria above).

During pre-processing, T1-weighted volumes were segmented into tissue classes (grey matter, white matter, and cerebrospinal fluid) using Statistical Parametric Mapping (SPM), Version 12 (https://www.fil.ion.ucl.ac.uk/spm). Hippocampal volume was estimated using FSL-FIRST (Patenaude et al., 2011) and total intracranial volume was determined using the reverse brain mask method in SPM, Version 12. See Table 1 for elapsed times between MRI scan and neuropsychological assessment in each diagnostic group.

Table 1.

Demographics, hippocampal volumes, and memory tests scores for the study participants. Variables are reported either as N(%) for categorical variables or mean(sd) for continuous variables, and as adjusted mean(sd) from ANCOVA for LMT scores in each cognitive status.

Characteristic Total
(N=355)
CU (N=282) MCI-AD (N=39) Dementia-AD (N=34) p Value Post-hoc (p < .05)
Sex (female) 225 (71.8%) 200 (70.9%) 12 (30.8%) 13 (38.2%) <.001
Sample (WRAP) 230 (64.8%) 224 (79.4%) 6 (15.4%) 0 (0%) <.001
Age at MRI scan (years) 65.0 (9.4) 62.6 (7.8) 74.8 (7.0) 73.7 (11.7) <.001 CU<MCI
CU<Dem
Education (years) 16.1 (2.8) 16.3 (2.7) 16.2 (3.1) 14.5 (2.9) .002 Dem<CU
Dem<MCI
Elapsed time (months) 4.9 (3.7) 5.6 (3.7) 3.0 (2.9) 1.5 (1.1) <.001 Dem<CU
MCI<CU
APOE risk score 1.3 (0.8) 1.3 (0.8) 1.4 (0.8) 1.8 (1.0) .001 CU<Dem
TIV (mL) 1471.9 (141.5) 1460.4 (139.3) 1542.9 (148.9) 1485.7 (129.3) .002 CU<MCI
Left HV (mL) 3638.8 (528.2) 3770.7 (452.5) 3168.1 (479.8) 3084.1 (506.6) <.001 MCI<CU
Dem<CU
Right HV (mL) 3844.5 (542.0) 3955.4 (477.5) 3497.9 (522.3) 3322.1 (606.0) <.001 MCI<CU
Dem<CU
Recency ratio 1.1 (0.5) 1.0 (0.0) 1.4 (0.1) 2.0 (0.1) <.001 CU<MCI
CU<Dem
MCI<Dem
Total ratio 1.4 (1.0) 1.1 (0.1) 2.4 (0.2) 2.7 (0.2) <.001 CU<MCI
CU<Dem
Immediate LMT 12.7 (4.8) 14.1 (0.2) 9.2 (0.6) 5.8 (0.7) <.001 MCI<CU
Dem<CU
Dem<MCI
Delayed LMT 11.3 (5.3) 13.0 (0.2) 6.1 (0.6) 2.9 (0.7) <.001 MCI<CU
Dem<CU
Dem<MCI

Note: CU = cognitively unimpaired; MCI-AD = MCI due to presumed AD; Dementia-AD or Dem = dementia due to presumed AD; LMT = logical memory test; HV = Hippocampal volume; TIV = total intracranial volume; Elapsed time = time elapsed between MRI scan and neuropsychological assessment. P values for the omnibus test or chi-square test.

Cognitive assessment

The Logical Memory Test

The LMT was used to assess learning and memory performance. The LMT is a subtest of the Weschler Memory Scale Revised (WMS-R; Wechsler, 1987) which comprises two stories, A and B, with 25 items each (“idea units”), of different semantic and lexical categories. Each story is read aloud to the participant and then the participant is asked to recall both stories immediately and again after a 25–30-minute delay. Scoring procedures from the WMS-R manual were applied. Although the scoring criteria permits some alteration from the original item (e.g., “slid off the table” is allowed instead of “fell off the table”), certain items must be recalled verbatim such as numerical expressions or proper names.

In the ADRC sample, story recall was measured with story A of the LMT, as story B was not administered, while in the WRAP sample, story recall was measured with story A and B of the LMT. Regardless of the number of stories tested, because scores from story A and B are averaged, LMT scores from WRAP are computed by averaging story A and B, whereas LMT scores from WADRC are based on one story, resulting in the same number of scores across the WRAP and WADRC samples. Immediate LMT and Delayed LMT recall scores were calculated by adding all the correctly recalled items in the immediate recall trial and delayed recall trial, respectively, and were analysed as raw scores, as in previous studies examining traditional and process-based LMT measures (Bruno et al., 2021b; 2023). Possible scores for Immediate and Delayed Recall trials range from 0 to 25 for each, where higher scores reflect more items being recalled. Immediate LMT and Delayed LMT scores were calculated individually from the closest visit to MRI scan data, if collected within twelve months of the scan.

Process-based measures of the LMT: Recency ratio and Total ratio.

Recency was defined as the final eight items of the story (Bruno et al., 2021b) and immediate and delayed recency scores were calculated as the number of correctly recalled recency items in immediate and delayed recall trials, respectively. Rr was obtained by dividing the recency scores in the immediate recall trial by the corresponding scores in the delayed recall trial. A correction also was applied ((immediate recency score + 1) / (delayed recency score + 1)) to avoid missing data due to zero scores (Bruno et al., 2018), possible Rr scores range from 1 to 9, where higher scores reflect more recency forgetting.

To provide a non-recency-based Rr analogue that would account for memory loss, we also computed a ratio score with Immediate LMT and Delayed LMT ((Immediate LMT + 1) / (Delayed LMT + 1)), the Total ratio (Tr; see also Bruno et al., 2021b). Possible Tr scores range from 1 to 26, where higher scores reflect more total forgetting. As with traditional story recall-derived memory scores, Rr and Tr scores were calculated individually from the closest visit to MRI scan data, if collected within twelve months of the scan.

Global cognitive composite score

A composite score analogous to the Preclinical Alzheimer’s Cognitive Composite 4 (PACC4; for details, see Donohue et al., 2014), based on available tests, was included as a measure of global cognitive functioning, for comparison. In the WRAP study, this composite score was calculated using the average of standardised tests scores of the following tests: total scores for the Logical Memory II subtest (i.e., delayed recall of stories A and B) from the Wechsler Memory Scale—Revised (WMS-R; Wechsler, 1987), total scores from the Digit Substitution test of the Wechsler Abbreviated Intelligence Scale-Revised (WAIS-R; Wechsler, 1981), total recall learning trials 1–5 from the Rey Auditory Verbal Learning Test (RAVLT; Schmidt et al., 1996), and the total score from the Mini-Mental Status Examination (MMSE; Folstein et al., 1975); for more details on how the score was calculated, see Jonaitis et al. (2019). In the WRAP sample, PACC4-analogue scores were collected at the same cognitive screening visit as LMT scores and were also calculated individually; we did not have PACC4-analogue scores for the WADRC sample.

Genotyping

The APOE e4 allele is considered the most important genetic risk factor of AD (Coon et al., 2007; Hobel et al., 2019) and thus, genetic risk was accounted for by calculating an APOE risk score based on the odds ratios of the e2/e3/e4 genotype, as previously reported (Darst et al., 2017). DNA was extracted from whole blood and samples were aliquoted on 96-well plates for determination of APOE genotypes. The APOE risk score was included as a covariate in all the regression analyses.

Statistical Analysis

An analysis of variance (ANOVA) was carried out to compare age, time elapsed between MRI scan and memory assessment (calculated as an absolute value of months), years of education, APOE risk score, total intracranial volume (TIV), left hippocampal volume, and right hippocampal volume among participants, classified by cognitive status (cognitively unimpaired, MCI or dementia). When any of these comparisons was significant, a post-hoc paired comparison was conducted, by using Tukey’s honest significant difference (HSD) test, to account for multiple testing. Differences in sex and sample (WRAP or WADRC) were assessed with a Pearson Chi-square test (p < 0.05). To test for differences in LMT scores, an analysis of covariance (ANCOVA) was conducted by adding age, gender, sample, elapsed time between MRI scan and neuropsychological assessment, years of education and APOE risk score as covariates; post-hoc comparisons were also adjusted using the Tukey’s HSD test. See Table 1 for sample details, reported for the whole sample and by cognitive status.

To understand how correlated the LMT scores were, we ran bivariate Spearman’s rank-order correlations between Rr, Tr, Immediate LMT, and Delayed LMT, as these scores were not normally distributed. Bivariate Spearman’s rank-order correlations were also conducted between left and right hippocampal volume, to explore how associated their size was. Partial correlations, controlling for age, gender, sample, elapsed time between MRI scan and neuropsychological assessment, years of education and APOE risk score, were used to explore the relationship between the LMT scores and left or right hippocampal volume. Steiger’s Z tests were conducted on partial correlation coefficients between any two significant predictors, to determine if the strength of one association between one memory score and left or right HV outcome was stronger than the association between another memory score and the same HV outcome, by using a calculator (http://quantpsy.org; Lee et al., 2013).

Linear regression analyses were conducted with Rr, Tr, Immediate LMT, and Delayed LMT as predictors (in separate models); sex, age at MRI scan, elapsed time between MRI scan and memory assessment (calculated as an absolute value), years of education, sample (WADRC or WRAP), and APOE risk score were used as control variables (covariates). Right or left hippocampal volume (HV) divided by total intracranial volume (TIV) represented the outcomes in separate analyses. We adjusted for multiple testing using a false discovery rate-based approach (FDR; Benjamini & Hochberg, 1995) for the four predictors, corrected across left and right HV. To determine which LMT score is the best predictor of left and right HV, we compared AIC fit statistics (Aiken at al., 1991) across otherwise parallel models, lower AIC values indicate a better fit, and a model with a delta-AIC (i.e., the difference between the two AIC values being compared) greater than 2 is considered significantly better than the model it is being compared to (Burnham & Anderson, 2004).

As an additional analysis, to compare the predictive abilities of process-based scores with a standardised measure of global cognition, separate regression analyses were carried out with left or right HV (divided by TIV) as outcome, Rr, Tr, and PACC4-analogue scores as predictors. The same covariates as above were included, except for sample (WADRC or WRAP), since the analyses were conducted only in participants from WRAP (N = 232; see Global cognitive composite score above). We adjusted for multiple testing using FDR, and AIC fit statistics were compared to determine which model showed the best model fit, as above. Analyses were performed with SPSS, Version 27 (IBM).

RESULTS

In Table 1, means and standard deviations are described for all the variables included in the current study, for the whole sample and by cognitive status closest to MRI scan. All the variables tested showed significant differences between the cognitive status groups. Specifically, post-hoc pairwise analysis revealed that participants with a worse cognitive status were older, with less years of education, and had less left and right HV than participants who were cognitively unimpaired. Mean Rr, Tr, Immediate LMT and Delayed LMT scores were also significantly different between groups, post-hoc analysis indicating the worse the cognitive status was, the higher the Rr and Tr values, and the lower the Immediate LMT and Delayed LMT scores were. In Supplementary materials, we report pairwise plots of left vs. right HV (Table S1), and of left or right HV vs. age by cognitive status (Tables S2.1 and S2.2, respectively).

Bivariate Spearman’s rank-order correlations showed that Rr, Tr, Immediate LMT, and Delayed LMT scores significantly correlated between them, whilst left and right HV were also significantly and positively associated. Partial correlations indicated all memory scores significantly correlated with left and right HV, see Table 2 for bivariate and partial correlation coefficients. Steiger’s Z-tests showed that for left HV, the association with Rr was significantly stronger than with Immediate LMT (Z = −5.42; p < .001) or Delayed LMT (Z = −5.99; p < .001), but not than with Tr (Z = 0.62; p = .267), whereas the association with Tr was significantly stronger than with Immediate LMT (Z = −5.75; p < .001) or Delayed LMT (Z = −6.19; p < .001); the association with Delayed LMT was stronger than with Immediate LMT (Z = 3.47; p < .001). For right HV, the association with Rr was significantly stronger than with Immediate LMT (Z = −4.18; p < .001), but not than with Delayed LMT (Z = −4.46; p < .001) or Tr (Z = 0.65; p = .258), whereas the association with Tr was significantly stronger than with Immediate LMT (Z = −4.53; p < .001) or Delayed LMT (Z = −4.71; p < .001); the association with Delayed LMT was stronger than with Immediate LMT (Z = 2.16; p = 0.015).

Table 2.

Bivariate and partial correlations between memory scores, between left and right hippocampal volume, and between memory score and left or right hippocampal volume. Rr = recency ratio; Tr = total ratio; LMT = logical memory test; HV = hippocampal volume; TIV = total intracranial volume

Rra Tra Immediate LMTa Delayed LMTa Left HV by TIVb Right HV by TIVb c
Rr 1 0.503** −0.162* −0.310** 0.270** −0.186**
Tr 1 −0.290** −0.572** 0.297** 0.215**
Immediate LMT 1 0.929** 0.168* 0.156*
Delayed LMT 1 0.251** 0.208**
Left HV 1 0.733**
Right HV 1

Note: N = 355.

a

Bivariate Spearman’s rank-order correlations between Rr, Tr, Immediate LMT and Delayed LMT.

b

Partial correlations, controlling for age, gender, sample, elapsed time between MRI scan and neuropsychological assessment, years of education and APOE risk score, between memory scores and left or right HV.

c

Bivariate Spearman’s rank-order correlations between left and right HV corrected for TIV.

*

p < .05;

**

p < .01;

***

p < .001. Bold: indicates the stronger partial coefficients in the column, when comparing the strength of the associations between each memory score and left or right HV, as per Steiger’s Z-test.

Linear regression analyses with right HV divided by TIV as outcome, showed that the separate model fits with either Rr, Total ratio, Immediate LMT, or Delayed LMT were significant; as were their coefficients, see Table 3 for details. The AIC fit statistics across the four models with right HV as outcome were compared and showed that the Total ratio and Delayed LMT models had the lowest AIC, as shown in Table 3. Delta-AIC between the Total ratio model and either the Rr or Immediate LMT models were greater than two, indicating the Total ratio model was significantly better. The model with Delayed LMT also showed a delta-AIC difference greater than two with either the Rr or Immediate LMT models, revealing the Delayed LMT model was significantly better; yet the difference was less than two AIC values between the Total ratio and Delayed LMT models.

Table 3.

Linear regression models predicting left or right hippocampal volume, corrected for total intracranial volume.

Predictor Outcome Recency ratio Model1 Total ratio Model2 Immediate LMT Model3 Delayed LMT Model4
β p β p β p β p
Age at MRI scan Left HV −0.34 <.001 −0.33 <.001 −0.31 <.001 −0.29 <.001
Right HV −0.31 <.001 −0.30 <.001 −0.28 <.001 −0.26 <.001
Gender Left HV 0.30 <.001 0.31 <.001 0.30 <.001 0.28 <.001
Right HV 0.26 <.001 0.27 <.001 0.25 <.001 0.24 <.001
Education Left HV 0.07 .114 0.07 .105 0.05 .264 0.04 .343
Right HV 0.02 .613 0.02 .598 0.01 .923 0.00 .993
APOE risk score Left HV −0.08 .063 −0.08 .078 −0.06 .162 −0.05 .241
Right HV −0.04 .363 −0.04 .407 −0.02 .620 −0.02 .730
Elapsed time Left HV −0.05 .318 −0.07 .188 −0.05 .305 −0.06 .256
Right HV −0.06 .258 −0.07 .174 −0.06 .240 −0.07 .210
Sample Left HV 0.07 .226 0.08 .128 0.05 .379 0.03 .580
Right HV 0.13 .022 0.14 .012 0.11 .085 0.10 .111
LMT score Left HV −0.23 <.001* −0.25 <.001* 0.18 .002* 0.26 <.001*
Right HV −0.17 <.001* −0.19 <.001* 0.17 .004* 0.23 <.001*
AIC Left HV −837.50 843.34 −820.72 −833.68
Right HV −818.49 822.83 −814.68 821.69
Adjusted R 2 Left HV .370 .380 .339 .363
Right HV .293 .302 .286 .300

Note: N = 355. β = Standardised regression coefficient; LMT = logical memory test; HV = Hippocampal volume; Elapsed time = time elapsed between MRI scan and neuropsychological assessment; p = p-value;

*

adjusted p-value for multiple comparisons. In all models, variables showed a Variation Inflation Factor (VIF) < 2; AIC = Akaike Information Criterion; Adjusted R2 = adjusted proportion of explained variance. Bold: model with significantly best model fit, as per delta-AIC. Models with left HV by TIV as outcome:

1

Model with Rr: F(7,347) = 30.644 p < .001;

2

Model with Tr: F(7,347) = 31.973 p < .001;

3

Model with Immediate LMT: F(7,347) = 26.940 p < .001;

4

Model with Delayed LMT: F(7,347) = 29.784 p < .001. With right HV by TIV as outcome:

1

Model with Rr: F(7,347) = 22.006 p < .001;

2

Model with Tr: F(7,347) = 22.888 p < .001;

3

Model with Immediate LMT: F(7,347) = 21.244 p < .001;

4

Model with Delayed LMT: F(7,347) = 22.655 p < .001.

With left HV divided by TIV as outcome, separate model fits with either Rr, Total ratio, Immediate LMT, or Delayed LMT were significant; as were their coefficients, see Table 3 for details. The AIC fit statistics across the four models with left HIV by TIV as outcome were compared and showed that the Total ratio model had the lowest AIC, followed by the Rr model, as shown in Table 3. Delta-AIC between the Total ratio model and either the Rr, Immediate LMT, or Delayed LMT models were greater than two, indicating the Total ratio model was significantly better. Delta-AICs between the Rr model and the models with either of the traditional scores were also greater than two, showing that the Rr model fit was significantly better than that of the Immediate LMT or Delayed LMT models for left HV.

As a secondary analysis, we carried out linear regression analyses to compare the model fits of process-based measures of the LMT, Rr and Tr, to that of a standardised composite score of global cognition. With PACC4-analogue as predictor, linear regression analyses showed that the model fit for left HV divided by TIV as outcome was significant (F(6,225) = 7.077, p < .001), as was for right HV divided by TIV (F(6,225) = 5.342, p < .001), yet PACC4-analogue coefficients were not significant for left HV (t = 1.082, adjusted-p = .337, β = .073) or right HV (t = 0.914, adjusted-p = .362, β = .063). The AIC values of the PACC4-analogue models were 85.457 for left HV and 87.719 for right HV.

With process-based scores of story recall, Rr or Tr, as predictors in WRAP participants only, the model fits were significant for both left HV by TIV (Rr, F(6,225) = 8.382, p < .001; Tr, F(6,225) = 8.994, p < .001) and right HV by TIV (Rr, F(6,225) = 5.541, p < .001; Tr, F(6,225) = 6.550, p < .001). The Rr coefficient was significant for left HV (t = −2.791, adjusted-p = .016, β = −.169), but not for right HV (t = −1.373, adjusted-p = .257, β = −.086), while the Tr coefficient was significant for both left HV (t = −3.301, adjusted-p = .006, β = −.199) and right HV (t = −2.684, adjusted-p = .016, β = −.166). The AIC values of the Rr model for left HV was 78.762 and 86.664 for right HV, the Tr model had an AIC value of 75.686 for left HV, 81.268 for right HV. Delta-AIC between the Total ratio model and either the Recency ratio or PACC4-analogue models were greater than two, for left and right HV, indicating the Total ratio model was significantly better, while delta-AIC was only significantly between Rr and PACC4-analogue for left HV.

DISCUSSION

In this study, we explored the relationship between process-based measures of story recall, i.e., Rr and Total ratio, and hippocampal volume in older adults with and without cognitive impairment. We also compared these measures to traditional scoring procedures for the same neuropsychological test. We hypothesised that process-based scores from story recall would be negatively associated with hippocampal volume, and that these scores would be a better predictor of hippocampal volume than traditional LMT and standardised composite scores, while controlling for demographics, elapsed time between MRI scan and neuropsychological assessment, sample, and APOE risk. As hypothesised, significant associations were found between story recall-derived memory scores and hippocampal volume. Although traditional and process-based scores were significantly associated with hippocampal volume, Steiger’s Z-tests indicated Rr and Total ratio outperformed traditional scores for left HV, while Total ratio outperformed traditional scores for right HV. Furthermore, the model with Total ratio had significantly better fit than the models with Rr, Immediate LMT, or Delayed LMT, when predicting left HV; whereas the models with either Total ratio or Delayed LMT showed the best fit for right HV, as indicated by AIC values. Altogether, current findings showed that tracking forgetting in LMT, as opposed to simply measuring total recall performance, is a better option when predicting hippocampal volume.

Higher Rr scores have been suggested to be the consequence of a reduction in long-term memory, due to a loss of consolidation ability, with increased reliance on a more intact ability to retain verbal information in the short-term memory (Bruno et al., 2018; Turchetta et al., 2020). The hippocampus is one of the first structures to atrophy in AD (Weiner et. al., 2015). It is essential for the formation of long-term memory and for synaptic consolidation. Specifically, hippocampal volume reductions are known to occur before converting to AD and are closely related to cognitive decline (Weiner et al., 2015). Previous studies reported hippocampal atrophy to be a useful biomarker for early detection of amnestic MCI and AD (Chincarini et al., 2011; Devanand et al., 2012; Mondragon et al., 2016), for distinguishing patients, such as AD or MCI from controls (Chupin et al., 2009; Karow et al., 2010), or those who will convert from MCI to AD (Wolz et al., 2010). Therefore, it could be argued that finding predictive markers of hippocampal atrophy is crucial for the early detection of cognitive decline (Weiner at al., 2015). Our findings show that Rr applied to story recall is sensitive to hippocampal integrity, as higher Rr scores predicted lower left HV, outperforming Immediate and Delayed LMT scores, as shown by Steiger’s Z-tests, while also having a significantly better model fit than traditional scores for left HV, as indicated by AIC fit statistics.

To be certain that the effects observed here were due to recency forgetting and not to total forgetting, a non-recency based Rr analogue was computed that accounted for memory loss using a ratio score with Immediate LMT and Delayed LMT, the Total ratio. Our findings demonstrated that this novel process-based measure was sensitive to hippocampal volume, as higher Total ratio scores predicted lower left HV, outperforming Immediate and Delayed LMT scores, as shown by Steiger’s Z-tests. Furthermore, the model with Total ratio not only showed a significantly better fit than models with traditional scores, but it was also significantly better that the Rr model in predicting right and left HV. These findings suggest that the effects observed here, especially for left HV, could be due to total forgetting as opposed to recency forgetting. Current findings are in contrast with what Bruno et al. (2023) observed when comparing Rr to Tr as predictors of CSF biomarkers, who reported that Rr outperformed Tr in predicting biomarker outcomes associated with neurodegeneration. It is possible therefore that Tr and Rr might each be better under different circumstances, but more research is needed to determine whether they represent complementary measures. Process-based scores from story recall alone are not intended to serve a diagnostic purpose per se, yet we believe these can be useful tools for clinicians, considering their association with volumetric changes of the hippocampus, and thus, its implications of cognitive decline, in a simple and accessible way.

Against expectations, when evaluating right HV, the models with either Total ratio or Delayed LMT showed the best model fits, with no significant difference between them, as indicated by their AIC values. Typically, verbal episodic memory performance, which was assessed in this analysis, is associated with left, rather than right, hippocampal volume (see Ezzati et al., 2016; Hardcastle et al., 2020). Additionally, greater left hippocampal reductions, compared to right hippocampal reductions, have been reported in individuals with AD (Dhikav et al., 2016; Ezzati et al., 2016; Li et al., 2016; Müller et al., 2005; Shi et al., 2009; Thompson et al., 2003; 2007; Wicking et al., 2014). Therefore, it could be argued that story-recall scores displaying sensitivity to left hippocampal volume may be more helpful, for screening and diagnostic purposes, than scores more sensitive to right hippocampal volume. It is also possible that story recall involves other cognitive abilities, such as narrative coherence, which has been reported to engage the right hippocampus (Cohn-Sheehy et al., 2021). Nevertheless, this empirical finding should be investigated in future research.

In the current study, all LMT scores were analysed as raw scores, as in previous studies examining traditional and process-based LMT measures (Bruno et al., 2021b; 2023). However, composite scores from averaged standardised tests scores, such as the Preclinical Alzheimer Cognitive Composite 4 (PACC4; for details, see Methods section and Donohue et al., 2014), are more likely to be used in clinical settings. Although composite scores have shown reduced variability and stronger associations with amyloid burden-related cognitive changes than raw scores in cognitively unimpaired individuals (Bransby et al., 2019; Jonaitis et al., 2019), the regression models with raw process-based scores showed a significantly better fit than the PACC4-analogue model when predicting left and right hippocampal volume. Specifically, the model with Total ratio was significantly better than the model with PACC4-analogue scores for both left and right HV, while the Rr model was significantly better for left HV. Current findings suggest that process-based scores derived from story recall offer an advantage over standardised composite scores when predicting hippocampal atrophy.

Considering that, as described in Table 1, most cognitively unimpaired participants were female (71%), whilst among the cognitively impaired, most were males (69% of MCI-AD; Dementia-AD, 62%), differences in sex distribution across consensus diagnoses were explored post hoc. Even though a detailed examination of sex-related differences is beyond the scope of the current study, potential differences in LMT scores across sexes were examined with both parametric and non-parametric tests. The post hoc analyses showed that in cognitive unimpaired participants, Delayed LMT scores were significantly higher in females than in males, yet no significant differences were observed in Immediate LMT, Rr or Tr scores, while in cognitively impaired participants, none of the scores significantly differed across sexes. Possible interactions between each LMT score and sex were also examined in regression analyses. We found that none of the interaction terms significantly predicted left or right HV, indicating that sex did not moderate the associations between LMT scores and hippocampal volume in participants with or without cognitive impairment.

A potential limitation of the present paper is the difference in the number of stories to be recalled, as in the WADRC sample, only story A of the LMT was tested, while stories A and B were tested in the WRAP sample. To examine if results differed in each sample, we carried out separate analyses in WADRC (N = 123) and WRAP (N = 232) participants, controlling for the same covariates except sample, and correcting for multiple comparisons across left and right HV (for details, see Table S3 in Supplementary materials). In WADRC only, all LMT scores were significantly associated with left and right HV, except for Immediate LMT, whereas in WRAP only, all LMT scores were significantly associated with left and right HV, except for Rr, which was no longer a significant predictor of right HV. It appears that using a single story varied the predictive ability of Immediate LMT scores, favouring the use of both stories, while Rr scores varied for right HV only, when using both stories. However, considering that the WRAP sample had almost twice the number of participants than WADRC, these results should be taken with caution.

Another limitation is that the sample size for cognitively unimpaired participants is significantly larger than for MCI and AD, this, however, was determined by availability; yet future studies would benefit from larger sample sizes of individuals with MCI, AD, and other dementia pathologies. The samples also consisted mostly of non-Hispanic White participants. Evidence from previous studies indicates that ethnic groups show differences in brain morphology, such as hippocampal volume (DeCarli et al., 2008), white matter hyperintensity volume (Brickman et al., 2008; Divers et al., 2013), and total cerebral brain volume (Stavitsky et al., 2010). Thus, current findings need to be replicated in a more ethnically diverse sample to confirm its generalizability.

Another possible issue is that all the evidence collected so far in support of the utility of Rr and Tr as process scores in story recall derives from a single database (WADRC; Bruno et al., 2023). While this is a drawback, in great part borne out of the novelty of these examinations, we believe it emphasises the need for further research, including in clinical settings, to explore whether these metrics are effective also beyond WRAP and ADRC.

Finally, it should be noted that practice effects were not addressed in the regression models, which could result in skewed scores for some participants. To check whether results differed in participants for whom the LMT score closest to MRI scan was collected in their first cognitive assessment visit, and thus, were unaffected by practice, the same regression analyses were carried out in these participants only (N = 124). Results indicated that all LMT scores, except Immediate LMT, were significantly associated with left and right hippocampal volume, suggesting that only Immediate recall might be affected by practice, while the model with Total ratio showed the best fit of all (for details, see Table S4 in Supplementary materials).

To summarise, the current results demonstrated the direct relationship between story recall-derived memory scores and hippocampal volume. Higher Rr scores, reflecting disproportionate recency recall in the immediate test, were associated with lower left hippocampal volume when controlling for covariates (demographics, elapsed time between MRI scan and neuropsychological assessment, sample, APOE risk score). Higher Total ratio scores, which accounted for total memory loss, significantly predicted lower left hippocampal volume, when controlling for the same covariates, and showed significantly better model fit than any other model, as per AIC fit statistics; suggesting the effects observed here could be due to total forgetting as opposed to recency forgetting. Immediate LMT and Delayed LMT scores, when adjusting for the same covariates, were significantly associated with hippocampal volume, specifically, the model with Delayed LMT showed the best fit along with the model with Total ratio for right HV; yet traditional scores were outperformed by Rr and Total ratio for left HV, and by Total ratio for right HV. Altogether, these findings showed that tracking forgetting in LMT, as opposed to simply measuring total recall performance, is a better option when predicting hippocampal volume in older adults with and without cognitive impairment.

Supplementary Material

Supp 1

Acknowledgements:

We would like to thank the participants, the staff, and researchers at the University of Wisconsin ADRC, it is because of them that this work is possible. WADRC is funded by P50 AG033514. This secondary analysis of ADRC data was funded by a NIH-NIA (R01 144 AAI8612) grant to KDM, in which DB and RL are co-investigators. Additional funding sources that made this work possible include: R01 AG054059; R01 AG027161; R01 AG021155; R01 AG054047.

Footnotes

Disclosure statement:

The authors report there are no competing interests to declare.

Data availability statement:

WRAP and WADRC data can be requested here: https://wrap.wisc.edu/data-requests/.

REFERENCES

  1. Albert MS, DeKosky ST, Dickson D, Dubois B, Feldman HH, Fox NC, … & Phelps CH (2011). The diagnosis of mild cognitive impairment due to Alzheimer’s disease: recommendations from the National Institute on Aging‐Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer’s disease. Alzheimer’s & dementia, 7(3), 270–279. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Apostolova LG, Morra JH, Green AE, Hwang KS, Avedissian C, Woo E, … & Alzheimer’s Disease Neuroimaging Initiative. (2010). Automated 3D mapping of baseline and 12-month associations between three verbal memory measures and hippocampal atrophy in 490 ADNI subjects. Neuroimage, 51(1), 488–499. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Benjamini Y, & Hochberg Y (1995). Controlling the false discovery rate: a practical and powerful approach to multiple testing. Journal of the Royal statistical society: series B (Methodological), 57(1), 289–300. [Google Scholar]
  4. Blanco-Campal A, Diaz-Orueta U, Navarro-Prados AB, Burke T, Libon DJ, & Lamar M (2021). Features and psychometric properties of the Montreal Cognitive Assessment: Review and proposal of a process-based approach version (MoCA-PA). Applied Neuropsychology: Adult, 28(6), 658–672. [DOI] [PubMed] [Google Scholar]
  5. Bock JR, Russell J, Hara J, & Fortier D (2021). Optimizing Cognitive Assessment Outcome Measures for Alzheimer’s Disease by Matching Wordlist Memory Test Features to Scoring Methodology. Frontiers in Digital Health, 3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Bransby L, Lim YY, Ames D, Fowler C, Roberston J, Harrington K, … & Maruff P (2019). Sensitivity of a Preclinical Alzheimer’s Cognitive Composite (PACC) to amyloid β load in preclinical Alzheimer’s disease. Journal of clinical and experimental neuropsychology, 41(6), 591–600. [DOI] [PubMed] [Google Scholar]
  7. Brickman AM, Schupf N, Manly JJ, Luchsinger JA, Andrews H, Tang MX, … & Brown TR (2008). Brain morphology in older African Americans, Caribbean Hispanics, and whites from northern Manhattan. Archives of neurology, 65(8), 1053–1061. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Bruno D, Reiss PT, Petkova E, Sidtis JJ, & Pomara N (2013). Decreased recall of primacy words predicts cognitive decline. Archives of clinical neuropsychology, 28(2), 95–103. [DOI] [PubMed] [Google Scholar]
  9. Bruno D, Reichert C, & Pomara N (2016). The recency ratio as an index of cognitive performance and decline in elderly individuals. Journal of Clinical and Experimental Neuropsychology, 38(9), 967–973. [DOI] [PubMed] [Google Scholar]
  10. Bruno D, Koscik RL, Woodard JL, Pomara N, & Johnson SC (2018). The recency ratio as predictor of early MCI. International psychogeriatrics, 30(12), 1883–1888. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Bruno D, Gleason CE, Koscik RL, Pomara N, Zetterberg H, Blennow K, & Johnson SC (2019). The recency ratio is related to CSF amyloid beta 1‐42 levels in MCI‐AD. International journal of geriatric psychiatry, 34(3), 415–419. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Bruno D, Reichert Plaska C, Clark DP, Zetterberg H, Blennow K, Verbeek MM, & Pomara N (2021a). CSF α-synuclein correlates with CSF neurogranin in late-life depression. International Journal of Neuroscience, 131(4), 357–361. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Bruno D, Mueller KD, Betthauser T, Chin N, Engelman CD, Christian B, … & Johnson SC (2021b). Serial position effects in the Logical Memory Test: Loss of primacy predicts amyloid positivity. Journal of Neuropsychology, 15(3), 448–461. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Bruno D, Jauregi A, Pomara N, Zetterberg H, Blennow K, Koscik R, … & Mueller K (2022). Cross-sectional associations of CSF tau levels with Rey’s AVLT: A recency ratio study. Neuropsychology. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Bruno D, Zinkunegi AJ, Kollmorgen G, Suridjan I, Wild N, Carlsson C, Bendlin B, Okonkwo O, Chin N, Hermann BP, Asthana S, Zetterberg H, Blennow K, Langhough R, Johnson SC, & Mueller KD (2023). The recency ratio assessed by story recall is associated with cerebrospinal fluid levels of neurodegeneration biomarkers. Cortex, 159, 167–174. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Carlesimo GA, Sabbadini M, Fadda L, & Caltagirone C (1995). Different components in word-list forgetting of pure amnesics, degenerative demented and healthy subjects. Cortex, 31(4), 735–745. [DOI] [PubMed] [Google Scholar]
  17. Carmichael O, Xie J, Fletcher E, Singh B, DeCarli C, & Alzheimer’s Disease Neuroimaging Initiative. (2012). Localized hippocampus measures are associated with Alzheimer pathology and cognition independent of total hippocampal volume. Neurobiology of aging, 33(6), 1124–e31. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Chincarini A, Bosco P, Calvini P, Gemme G, Esposito M, Olivieri C, … & Alzheimer’s Disease Neuroimaging Initiative. (2011). Local MRI analysis approach in the diagnosis of early and prodromal Alzheimer’s disease. Neuroimage, 58(2), 469–480. [DOI] [PubMed] [Google Scholar]
  19. Chupin M, Gérardin E, Cuingnet R, Boutet C, Lemieux L, Lehéricy S, … & Colliot O (2009). Fully automatic hippocampus segmentation and classification in Alzheimer’s disease and mild cognitive impairment applied on data from ADNI. Hippocampus, 19(6), 579–587. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Cohn-Sheehy BI, Delarazan AI, Reagh ZM, Crivelli-Decker JE, Kim K, Barnett AJ, … & Ranganath C (2021). The hippocampus constructs narrative memories across distant events. Current Biology, 31(22), 4935–4945. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Coon KD, Myers AJ, Craig DW, Webster JA, Pearson JV, Lince DH, … & Stephan DA (2007). A high-density whole-genome association study reveals that APOE is the major susceptibility gene for sporadic late-onset Alzheimer’s disease. The Journal of clinical psychiatry, 68(4), 8183. [DOI] [PubMed] [Google Scholar]
  22. Darst BF, Koscik RL, Racine AM, Oh JM, Krause RA, Carlsson CM, … & Engelman CD (2017). Pathway-specific polygenic risk scores as predictors of amyloid-β deposition and cognitive function in a sample at increased risk for Alzheimer’s disease. Journal of Alzheimer’s Disease, 55(2), 473–484. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. DeCarli C, Reed BR, Jagust WJ, Martinez O, Ortega M, & Mungas D (2008). Brain behavior relationships amongst African Americans, caucasians and Hispanics. Alzheimer disease and associated disorders, 22(4), 382. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Devanand DP, Bansal R, Liu J, Hao X, Pradhaban G, & Peterson BS (2012). MRI hippocampal and entorhinal cortex mapping in predicting conversion to Alzheimer’s disease. Neuroimage, 60(3), 1622–1629. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Dhikav V, Duraisamy S, Anand KS, & Garga UC (2016). Hippocampal volumes among older Indian adults: Comparison with Alzheimer’s disease and mild cognitive impairment. Annals of Indian Academy of Neurology, 19(2), 195. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Díaz-Orueta U, Blanco-Campal A, & Burke T (2018). Rapid review of cognitive screening instruments in MCI: proposal for a process-based approach modification of overlapping tasks in select widely used instruments. International psychogeriatrics, 30(5), 663–672. [DOI] [PubMed] [Google Scholar]
  27. Divers J, Hugenschmidt C, Sink KM, Williamson JD, Ge Y, Smith SC, … & Freedman BI (2013). Cerebral white matter hyperintensity in African Americans and European Americans with type 2 diabetes. Journal of Stroke and Cerebrovascular Diseases, 22(7), e46–e52. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Donohue MC, Sperling RA, Salmon DP, Rentz DM, Raman R, Thomas RG, … & Aisen PS (2014). The preclinical Alzheimer cognitive composite: measuring amyloid-related decline. JAMA neurology, 71(8), 961–970. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Egeland J (2021). Following HN over 21 years: recency change and reduced retention predict later impairment in memory, and recency ratio may combine both effects. Neurocase, 27(2), 147–154. [DOI] [PubMed] [Google Scholar]
  30. Ezzati A, Katz MJ, Zammit AR, Lipton ML, Zimmerman ME, Sliwinski MJ, & Lipton RB (2016). Differential association of left and right hippocampal volumes with verbal episodic and spatial memory in older adults. Neuropsychologia, 93, 380–385. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Foldi NS, Brickman AM, Schaefer LA, & Knutelska ME (2003). Distinct serial position profiles and neuropsychological measures differentiate late life depression from normal aging and Alzheimer’s disease. Psychiatry research, 120(1), 71–84. [DOI] [PubMed] [Google Scholar]
  32. Folstein MF, Folstein SE, & McHugh PR (1975). “Mini-mental state”: a practical method for grading the cognitive state of patients for the clinician. Journal of psychiatric research, 12(3), 189–198. [DOI] [PubMed] [Google Scholar]
  33. Gicas KM, Honer WG, Wilson RS, Boyle PA, Leurgans SE, Schneider JA, & Bennett DA (2020). Association of serial position scores on memory tests and hippocampal-related neuropathologic outcomes. Neurology, 95(24), e3303–e3312. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Grant IE, & Adams KM (2009). Neuropsychological assessment of neuropsychiatric and neuromedical disorders.
  35. Hall S, & Bornstein RA (1991). Serial-position effects in paragraph recall following mild closed-head injury. Perceptual and motor skills, 72(3_suppl), 1295–1298. [DOI] [PubMed] [Google Scholar]
  36. Hardcastle C, O’Shea A, Kraft JN, Albizu A, Evangelista ND, Hausman HK, … & Woods AJ (2020). Contributions of hippocampal volume to cognition in healthy older adults. Frontiers in aging neuroscience, 12, 593833. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Hobel Z, Isenberg AL, Raghupathy D, Mack W, Pa J, & Alzheimer’s Disease Neuroimaging Initiative. (2019). APOE ɛ4 gene dose and sex effects on Alzheimer’s disease MRI biomarkers in older adults with mild cognitive impairment. Journal of Alzheimer’s Disease, 71(2), 647–658. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Johnson SC, Koscik RL, Jonaitis EM, Clark LR, Mueller KD, Berman SE, … & Sager MA (2018). The Wisconsin Registry for Alzheimer’s Prevention: a review of findings and current directions. Alzheimer’s & Dementia: Diagnosis, Assessment & Disease Monitoring, 10, 130–142. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Jonaitis EM, Koscik RL, Clark LR, Ma Y, Betthauser TJ, Berman SE, … & Johnson SC (2019). Measuring longitudinal cognition: individual tests versus composites. Alzheimer’s & Dementia: Diagnosis, Assessment & Disease Monitoring, 11, 74–84. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Karow DS, McEvoy LK, Fennema-Notestine C, Hagler DJ Jr, Jennings RG, Brewer JB, … & Alzheimer’s Disease Neuroimaging Initiative. (2010). Relative capability of MR imaging and FDG PET to depict changes associated with prodromal and early Alzheimer disease. Radiology, 256(3), 932–942. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. La Rue A, Hermann B, Jones JE, Johnson S, Asthana S, & Sager MA (2008). Effect of parental family history of Alzheimer’s disease on serial position profiles. Alzheimer’s & Dementia, 4(4), 285–290. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Lee IA, & Preacher KJ (2013, September). Calculation for the test of the difference between two dependent correlations with one variable in common [Computer software]. Available from http://quantpsy.org.
  43. Li X, Li D, Li Q, Li Y, Li K, Li S, & Han Y (2016). Hippocampal subfield volumetry in patients with subcortical vascular mild cognitive impairment. Scientific reports, 6(1), 20873. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Libon DJ, Swenson R, Lamar M, Price CC, Baliga G, Pascual-Leone A, … & Andersen SL (2022). The Boston Process Approach and Digital Neuropsychological Assessment: Past Research and Future Directions. Journal of Alzheimer’s Disease, 87(4),1419–1432. [DOI] [PubMed] [Google Scholar]
  45. McKhann GM, Knopman DS, Chertkow H, Hyman BT, Jack CR Jr, Kawas CH, … & Phelps CH (2011). The diagnosis of dementia due to Alzheimer’s disease: Recommendations from the National Institute on Aging-Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer’s disease. Alzheimer’s & dementia, 7(3), 263–269. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Milberg WP, Hebben N, Kaplan E, Grant I, & Adams K (2009). The Boston process approach to neuropsychological assessment. Neuropsychological assessment of neuropsychiatric and neuromedical disorders, 3, 42–65. [Google Scholar]
  47. Mondragón JD, Celada-Borja C, Barinagarrementeria-Aldatz F, Burgos-Jaramillo M, & Barragán-Campos HM (2016). Hippocampal volumetry as a biomarker for dementia in people with low education. Dementia and geriatric cognitive disorders extra, 6(3), 486–499. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Müller MJ, Greverus D, Dellani PR, Weibrich C, Wille PR, Scheurich A, … & Fellgiebel A (2005). Functional implications of hippocampal volume and diffusivity in mild cognitive impairment. Neuroimage, 28(4), 1033–1042. [DOI] [PubMed] [Google Scholar]
  49. Mueller KD, Koscik RL, Du L, Bruno D, Jonaitis EM, Koscik AZ, … & Johnson SC (2020). Proper names from story recall are associated with beta-amyloid in cognitively unimpaired adults at risk for Alzheimer’s disease. cortex, 131, 137–150. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Mueller KD, Du L, Bruno D, Betthauser T, Christian B, Johnson S, … & Koscik RL (2022). Item-level story recall predictors of amyloid-beta in late middle-aged adults at increased risk for Alzheimer’s disease. Frontiers in Psychology, 3553. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Murdock BB Jr (1962). The serial position effect of free recall. Journal of experimental psychology, 64(5), 482. [Google Scholar]
  52. Patenaude B, Smith SM, Kennedy DN, & Jenkinson M (2011). A Bayesian model of shape and appearance for subcortical brain segmentation. Neuroimage, 56(3), 907–922. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Petersen RC, Aisen PS, Beckett LA, Donohue MC, Gamst AC, Harvey DJ, … & Weiner MW (2010). Alzheimer’s disease neuroimaging initiative (ADNI): clinical characterization. Neurology, 74(3), 201–209. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Schmidt M (1996). Rey auditory verbal learning test: A handbook (Vol. 17). Los Angeles, CA: Western Psychological Services. [Google Scholar]
  55. Shi F, Liu B, Zhou Y, Yu C, & Jiang T (2009). Hippocampal volume and asymmetry in mild cognitive impairment and Alzheimer’s disease: Meta‐analyses of MRI studies. Hippocampus, 19(11), 1055–1064. [DOI] [PubMed] [Google Scholar]
  56. Stavitsky K, Du Y, Seichepine D, Laudate TM, Beiser A, Seshadri S, … & Au R (2010). White matter hyperintensity and cognitive functioning in the racial and ethnic minority cohort of the Framingham Heart Study. Neuroepidemiology, 35(2), 117–122. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Steiger JH (1980). Tests for comparing elements of a correlation matrix. Psychological bulletin, 87(2), 245. [Google Scholar]
  58. Talamonti D, Koscik R, Johnson S, & Bruno D (2020). Predicting early mild cognitive impairment with free recall: The primacy of primacy. Archives of Clinical Neuropsychology, 35(2), 133–142. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Talamonti D, Koscik R, Johnson S, & Bruno D (2021). Temporal contiguity and ageing: The role of memory organization in cognitive decline. Journal of Neuropsychology, 15, 53–65. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Tatsumi IF, & Watanabe M (2009). Verbal memory. In Encyclopedia of neuroscience (pp. 4176–4178). Springer, Berlin, Heidelberg. [Google Scholar]
  61. Thompson PM, Hayashi KM, De Zubicaray G, Janke AL, Rose SE, Semple J, … & Toga AW (2003). Dynamics of gray matter loss in Alzheimer’s disease. Journal of neuroscience, 23(3), 994–1005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Thompson PM, Hayashi KM, Dutton RA, CHIANG MC, Leow AD, Sowell ER, … & Toga AW (2007). Tracking Alzheimer’s disease. Annals of the New York Academy of Sciences, 1097(1), 183–214. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Turchetta CS, Perri R, Fadda L, Caruso G, De Simone MS, Caltagirone C, & Carlesimo GA (2018). Forgetting rate on the recency portion of a word list differentiates mild to moderate Alzheimer’s disease from other forms of dementi. Journal of Alzheimer’s Disease, 66(2), 461–470. [DOI] [PubMed] [Google Scholar]
  64. Turchetta CS, De Simone MS, Perri R, Fadda L, Caruso G, De Tollis M, … & Carlesimo GA (2020). Forgetting Rates on the Recency Portion of a Word List Predict Conversion from Mild Cognitive Impairment to Alzheimer’s Disease. Journal of Alzheimer’s Disease, 73(4), 1295–1304. [DOI] [PubMed] [Google Scholar]
  65. Weiner MW, Veitch DP, Aisen PS, Beckett LA, Cairns NJ, Cedarbaum J, … & Alzheimer’s Disease Neuroimaging Initiative. (2015). 2014 Update of the Alzheimer’s Disease Neuroimaging Initiative: a review of papers published since its inception. Alzheimer’s & dementia, 11(6), e1–e120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Wechsler D (1981). Wechsler adult intelligence scale-revised (WAIS-R). Psychological Corporation. [Google Scholar]
  67. Wechsler D (1987). WMS-R: Wechsler memory scale-revised. Psychological Corporation. [Google Scholar]
  68. Wicking M, Nees F, & Steiger F (2014). Neuropsychological measures of hippocampal function. The hippocampus in clinical neuroscience, 34, 60–70. [DOI] [PubMed] [Google Scholar]
  69. Wixted JT (2004). The psychology and neuroscience of forgetting. Annu. Rev. Psychol, 55, 235–269. [DOI] [PubMed] [Google Scholar]
  70. Wixted JT, & Cai DJ (2013). Memory consolidation. Oxf. Handb. Cogn. Neurosci, 1, 436–456. [Google Scholar]
  71. Wolz R, Heckemann RA, Aljabar P, Hajnal JV, Hammers A, Lötjönen J, … & Alzheimer’s Disease Neuroimaging Initiative. (2010). Measurement of hippocampal atrophy using 4D graph-cut segmentation: application to ADNI. NeuroImage, 52(1), 109–118. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supp 1

Data Availability Statement

WRAP and WADRC data can be requested here: https://wrap.wisc.edu/data-requests/.

RESOURCES