Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2013 Mar 5.
Published in final edited form as: Mem Cognit. 2011 Nov;39(8):1423–1434. doi: 10.3758/s13421-011-0119-7

Working Memory in the Oldest-Old: Evidence from Output Serial Position Curves

Emily M Elliott 1, Katie E Cherry 2, Jennifer Silva Brown 3, Emily A Smitherman 4, S Michal Jazwinski 5, Qingzhao Yu 6, Julia Volaufova 7; for the Louisiana Healthy Aging Study
PMCID: PMC3588592  NIHMSID: NIHMS437413  PMID: 21674282

Abstract

We examined adult age differences in short-term and working memory performance in middle-aged (45–64 years), young-old (65–74 years), old-old (75–89 years) and oldest-old adults (90 years and over) in the Louisiana Healthy Aging Study. Previous research suggests that measures of working memory are more sensitive to age effects than simple tests of short-term memory (Bopp & Verhaeghen, 2005; Myerson, Emery, White, & Hale, 2003). To test this hypothesis, we examined output serial position curves of recall data from three span tasks: forward and backward digit span and size judgment span. Participants’ recall patterns in the size judgment span task revealed that the two oldest groups of adults showed the largest decreases in recall performance across output serial positions, but did not differ significantly from each other. Correlation analyses indicated the strongest negative correlation with age occurred with the size judgment span task. Implications of these findings for understanding strategic processing abilities in late life are discussed.

Keywords: Working memory, strategic processing, serial output position curves, oldest-old adults


In the year 2009, the number of people in the United States aged 65 or older accounted for 13% of the population (or 39 million people). This number is projected to grow to 20% of the population by the year 2050 (or 88.5 million; US Census, 2009). Recent demographic trends also indicate major increases in the segment of the population whose age falls into the category of “oldest-old,” yet research into the cognitive capabilities of individuals is limited (Bäckman, Small, Wahlin, & Larsson, 2000). Focusing research efforts on the oldest segment of the adult population is therefore an important challenge for psychology.

In recent years, there has been increasing interest in the role of working memory (WM) as a critical component of adult cognition (Babcock & Salthouse, 1990; Bopp & Verhaeghen, 2005). WM is considered an active portion of the memory system; it is “a temporary storage system under attentional control that underpins our capacity for complex thought” (Baddeley, 2007, p. 1). A number of important cognitive functions have been linked to WM processes. For example, the link between WM and reading comprehension is well-established (Daneman & Carpenter, 1980), as well as the relationship of WM and general fluid intelligence (Salthouse & Pink, 2008; Gray, Chabris, & Braver, 2003). Given the importance of WM to cognitive functioning, age-related differences have been investigated to improve our understanding of the causes of decline in WM performance (Craik & Bialystock, 2006; Dobbs & Rule, 1989). The current study continued in this line of research by conducting an in-depth investigation of adult age differences in three measures of memory span, and extended previous research by including a large number of participants who fall into the age category of oldest-old (90 + years of age).

Researchers investigating the causes of age-related differences in WM have proposed several views, including single mechanism views (such as processing speed declines, e.g., Salthouse, 1996) and multiple mechanism views (such as dynamic contextual control and output forgetting, e.g. Maylor, Vousden, & Brown, 1999). The work ofMaylor et al. (1999) was interpreted within the broader framework of a frontal-function hypothesis, given the large role of the frontal cortex in controlling behavior and its links to WM performance (Engle, 2001). Older adults may be more likely to suffer from changes in functioning of the frontal lobes, which leads to decreased executive functioning abilities (Myerson, Emery, White, & Hale, 2003). This change may also be interpreted in terms of the increased cognitive load that results from tasks requiring the coordination of processing and storage (see Barrouillet, Bernardin, & Camos, 2004, for a discussion of cognitive load).

One matter of theoretical and practical importance concerns the assessment of the constructs of short-term and working memory. In particular, measurement practices have varied, with some researchers using simple span tasks requiring only repetition of items, such as the forward digit span task, and others using more complex span tasks requiring additional processing along with repetition, such as the backward digit span task (see Unsworth & Engle, 2007, for a discussion). Recent research suggests that both forward and backward digit span tasks can be used to effectively assess WM (Shelton, Elliott, Hill, Calamia, & Gouvier, 2009) in a sample of college-aged adults; however, the use of simple and complex span tasks with older adults, especially those older than 90 years of age has not been undertaken to date.

Additionally, the use of serial position information has been central in the development of current theoretical views of working memory (Baddeley, 2007). Researchers examine the performance on each individual item within a list in a memory span task to look for trademark patterns of primacy (improved performance at the beginning of a list) and recency (improved performance at the end of a list). Examining serial position curves in recall can provide additional information about strategic processes that participants may engage in during a task. Very little research exists examining output serial order information in older adults (e.g., Golomb, Peelle, Addis, Kahana, & Wingfield, 2008; Maylor et al., 1999), and even less research exists investigating this important component of memory functioning in the oldest-old. In the Golomb et al. study, younger and older adults’ free and serial recall of uncategorized word lists was examined. They found that older adults (M = 73 years) showed greater declines in serial recall performance than the younger adults (M = 20 years). Follow-up analyses examining the positions of the recalled items, relative to other items in the lists, revealed that older adults were less able to utilize order information. One specific example of this was that the contiguity effect differed by age; in free recall, younger adults tended to recall items together when these items had been originally presented together. Golomb et al.’s detailed analyses of order demonstrated that older adults have greater difficulty utilizing temporal context information, and furthermore, that older adults may try to rely on semantic information even when it is not helpful. However, the older adults were younger than the oldest group in the current sample, and further investigation of these issues with an older group is needed to reveal new information about the use of order information in very late life.

The primary objective of the current research was to provide new evidence on the strategic processing abilities of very old adults. In particular, we examine hypotheses regarding age sensitivity in three different span tasks; we used both forward and backward digit span tasks, and the size judgment span task (SJS; Cherry, Elliott, & Reese, 2007) to evaluate WM performance. In the SJS task, participants heard items spoken aloud, and were asked to recall them in order of physical size, from smallest to largest. The items included were everyday, concrete objects and animal names that were chosen for the ease with which a visual image could be created, thus allowing participants of varying educational and socioeconomic backgrounds to participate in the task (see also Cherry & Park, 1993). Important to consider within the SJS task was that, different from many WM measures that include both storage and processing concurrently, the processing demands most likely occurred after the items were been presented. Although it was possible that participants could try to encode new items while concurrently beginning the process of reordering them, this process would be increasingly difficult to manage as list length increased. Thus, particularly for longer list lengths, it seemed reasonable to assume that participants encoded the list of items, and then engaged in a process of reordering them prior to beginning recall. This process is similar to other tasks involving reordering of items, such as backward digit span or Letter-Number Sequencing tasks that are commonly included in the Wechsler Adult Intelligence Scale versions of assessment (Wechsler, 1997; see also Myerson et al., 2003). This property of the SJS task may place it at an intermediate level of difficulty between simple span tasks and more complex span tasks involving concurrent storage and processing demands (see the meta-analysis by Bopp & Verhaeghen, 2005), which makes it appropriate for use with the oldest-old segment of the population. Furthermore, this reordering of the items may be different from other types of tasks that involve separate item and order information, in which deficits in temporal context have been found with increased age (Hartman & Warren, 2005; Maylor et al, 1999).

The current focus on output serial position effects within both simple repetition and reordering tasks appears to be the first of its kind, especially with the inclusion of a large number of individuals 90 years of age and older. Given the known changes in WM performance over the lifespan, we hypothesized that significant age group differences would emerge, with the largest decreases in performance occurring in the oldest-old group. On the assumption that all three span measures tap into the global construct of WM (see Unsworth, Spillers, & Brewer, 2010; Shelton et al., 2009), this prediction should hold true across the three span measures included here. However, if forward digit span is less resource demanding than the other two span measures (i.e., backward digit span and SJS), then one might expect to observe no significant age differences in the forward digit span task, but age group differences in the other two measures. Alternatively, previous research indicates that older adults have shown declines in temporal context information, and that this type of information may be more sensitive to age than other types of associative memory (Hartman & Warren, 2005). Furthermore, research using the n-back paradigm demonstrated that these declines in temporal context affected performance on the n-back task to a greater degree than probable declines in item information (as assessed through item recognition and responses to lures; McCabe & Hartman, 2008). As mentioned earlier, the demands of the SJS task required that a participant be able to retain item information and then separate this information from the original order information to be able to report the items back in terms of their physical size. Thus, it is possible that older adults would rely upon item information and semantic knowledge, which do not show the same age-related declines as temporal order information, when performing the SJS task. In this case, the differences across the age groups may be less pronounced than in other types of WM tasks (such as complex span tasks; Lustig, May, & Hasher, 2001; McCabe & Hartman, 2003).

The current research investigated adult age differences in memory for temporal order using three tasks, which vary in level of difficulty. The forward digit span task is a simple repetition task, and relies heavily upon temporal order information. The backward digit span task also uses a small, closed set of items, and additionally requires participants to reorder these items. Both of these tasks tap temporal order information without a large contribution from semantic knowledge. If the oldest adults rely upon the combination of semantic and temporal order information more heavily than their younger counterparts, we may find poorer performance by the oldest-old adults in the forward and backward digit span tasks, relative to the SJS task which includes semantic information. Vocabulary level is a factor to consider when discussing the role of semantic knowledge in the SJS task. Vocabulary level, often treated as a proxy for verbal intelligence, might be expected to play a larger role in the SJS task than for the tasks requiring the recall of simple digits. However, vocabulary knowledge does not show large declines across the lifespan (Cherry et al., 2008). We included a measure of vocabulary in the current study to assess the role of verbal intelligence in the performance of the span tasks.

Our second objective was to examine recall performance on the span tasks using multiple methods of scoring which account for overall levels of performance and output serial position performance. We expected that overall performance levels would differ among the age groups, as seen in previous research using span measures and the SJS (Bopp & Verhaeghen, 2005; Cherry & Park, 1993; Cherry et al., 2008); however, the interaction of type of span measure and age group was of specific interest. Furthermore, detailed analyses were conducted using output serial position in these recall tasks, and age-related differences were evaluated using output serial position as the dependent measure (see also Maylor et al., 1999). Based on prior research, we expected to observe a significant interaction of age group with output serial position within each span measure. The comparison of the three types of span tasks in this population provided an opportunity to examine the hypothesis that larger effects of aging would emerge in the more complex tasks, relative to the simplest task (the forward digit span task; see Bopp & Verhaeghen, 2005; Myerson et al., 2003).

Method

Participants

Participants were drawn from the Louisiana Healthy Aging Study (LHAS), an ongoing multidisciplinary study of the determinants of longevity and healthy aging conducted in collaboration with Louisiana State University (LSU) Agricultural and Mechanical College, LSU Health Sciences Center in New Orleans, Tulane University, the University of Alabama at Birmingham, and the Pennington Biomedical Research Center in Baton Rouge, LA. LHAS participants are sampled randomly from the Voters Registration 2000 files for those age 20 to 64 years old and from the Medicare Beneficiary Enrollment Data file of the Center for Medicare and Medicaid Services (CMS) for those age 65 years older and above for the eight parishes (counties) constituting the Greater Baton Rouge community. Participants included 48 middle aged (45–59 years), 40 young old (60–74 years), 40 old-old (75–89 years) and 90 of the oldest-old (90+ years), for a total of 218 participants. All scored at least a 25 on the Mini-Mental State Exam (MMSE; Folstein, Folstein, & McHugh, 1975) and were free of neurologic impairment due to stroke or other trauma to the brain at the time of testing (see Table 1). Although participants completed a battery of tests, the current focus is on the forward and backward digit span tasks and the SJS task (see Cherry et al., 2008, for more details on the procedure). Participants in this study were compared on level of education, using a collapsed version of educational attainment that reduced the categories to two: either high school or above, or less than high school. A χ2 test of independence revealed no significant association between educational attainment and age group, χ2 (3)= 3.69, p = 0.29

Table 1.

Descriptive Statistics for All Participants

Mean Range SD Skew Kurtosis Reliability
Age 76.29 45–96 16.07 −.55 −1.25 --
MMSE 28.11 25–30 1.49 −.37 − .83 .98
Vocabulary 23.99 6 – 39 7.74 −.39 − .57 .96
SJS Score 3.96 2.00–6.50 0.78 .38   .85 .82
SJS Proportion 2–6 0.65 0.11–1.00 0.10 −.28   .03 .91
BDS Score 4.12 2.00–7.00 0.94 .62   .23 .75
BDS Proportion 2–7 0.63 0.17–0.98 0.11 .19 −.46 .92
FDS Score 5.69 2.50–8.50 1.00 .10 −.16 .72
FDS Proportion 2–8 0.50 0.19–1.00 0.13 .42 −.61 .91

Note: N = 218; MMSE refers to the Mini-Mental State Exam (Folstein et al., 1975; with the test-retest reliability estimate drawn from the 28-day comparison in the original work); Vocabulary (Jastak & Jastak, 1964, with the split-half reliability estimate drawn from the original work); Reliability indicates Cronbach’s alpha for all other variables. SJS refers to the size-judgment span task, BDS refers to the backward digit span task, and FDS refers to the forward digit span task. “Score” refers to the all-or-none index of performance on the span tasks, while Proportion refers to the proportion of correct items including list lengths 2–6 for the SJS task, 2–7 for the BDS task, and list lengths 2–8 for the FDS task.

Procedure

Participants were administered a number of different measures across two sessions. The measures of interest for the current investigation included a short form of the Wechsler Adult Intelligence Scale Vocabulary subtest (Jastak & Jastak, 1964) and the MMSE (Folstein et al., 1975) for a global indication of cognitive functioning, as well as the primary measures of interest, the forward and backward digit span tasks and the SJS task. Administration for all three tasks was stopped once a participant did not answer any sequences correctly at a given list length, or if the participant requested that the task be stopped.

During the forward digit span task, participants were read aloud lists of digits, and asked to repeat them back in the same order that they were originally presented. Lists began with three digits and two lists of each length were administered, until the length of 9 digits was reached. The backward digit span task was administered in a similar fashion, with the exception that participants were asked to repeat the items in backwards order from the original presentation. Lists began at two items in length, and continued until 8 items, with two lists at each length. In the SJS task, participants were read a list of easily-imagined objects, and were asked to repeat them back in order of physical size, from smallest to largest. For example, in a list length of three items, a participant might hear, “frog, piano, hairpin” and the correct response would be “hairpin, frog, piano”. The administration of the task began with practice sequence, two items in length. The test sequences consisted of list lengths 2–8, with three instances of each length.

Results

The significance level was set to .05 for all analyses. Participants were first compared, using two separate one-way ANOVAs, to assess age-related differences in performance on the vocabulary and MMSE measures. There were no significant age differences on the vocabulary scores (p = 0.09), but there were significant differences on the MMSE, F (3, 214) = 15.84, MSE = 1.84 (see Table 2). This analysis was followed by a Tukey’s HSD using the harmonic mean sample size, to correct for unequal groups. The post-hoc analysis revealed that the oldest-old group had significantly lower MMSE scores than the three other age groups, which did not differ. The oldest-old age group was examined more closely by dividing the participants into groups, based on their total MMSE scores, and calculating the mean of the SJS score for each group. The means were very similar across the range of MMSE scores (see Appendix A) suggesting that even though the oldest-old age group had a lower overall mean MMSE, relative to the other age groups, that performance on the SJS task was not differentially affected across MMSE scores within this age group.

Table 2.

Mean Total MMSE and Vocabulary Scores (and Standard Deviations) by Age Group

Age Group MMSE Vocabulary N
Middle Age 28.79 (1.37) 25.33 (7.78) 48
Young Old 28.78 (1.21) 24.93 (7.79) 40
Old Old 28.23 (1.23) 25.00 (7.15) 40
Oldest Old 27.39 (1.46) 22.40 (7.80) 90

Scoring of the three span tasks was done in two steps, to provide global and specific indices of performance (see Table 1 for descriptive statistics). An overall span score, referred to as the FDS Score, BDS Score, or SJS Score, was calculated by counting the number of list lengths in which perfect recall was achieved, with an additional half a point for list lengths in which at least one list was answered correctly (for example, all three lists at list length 2 and 3, plus one list length correct at list length 4 = span score of 3.5). The specific index of performance, referred to as either FDS, BDS, or SJS Proportion, was calculated by giving credit for each individual item correctly recalled, regardless of whether or not the entire sequence was recalled correctly across the range of possible list lengths (2–9 for FDS, 2–8 for BDS and SJS). This method allowed for a proportion of correct recall to be calculated for each sequence, by dividing the number of correctly recalled items by the total number of items available in that sequence.

Methods of Scoring: Overall Scores

The analyses began with an age group by span measure mixed-model ANOVA to examine the main effects and potential interaction of age group with type of span measure (see Figure 1 for the means and standard errors). This analysis revealed significant main effects of age group [F (3, 214) = 13.33, MSE = .04, ηp2 = .16], span measure [ F (2, 428) = 28.90, MSE = .014, ηp2 = .12], and a significant interaction of these two factors, [F (6, 428) = 2.75, MSE = .014, ηp2 = .04]. To investigate the hypothesis that the age groups will have differential patterns of performance on the three span measures, the significant interaction was followed by one-way ANOVAs for each task separately.

Figure 1.

Figure 1

This figure depicts the average span score for each of the three span measures for each of the four age groups. Error bars represent standard error of the mean. FDS indicates forward digit span, BDS indicates backward digit span, and SJS indicates size judgment span.

A one-way ANOVA on the overall SJS score yielded a significant age group effect, F (3, 214) = 28.78, MSE = .45. A Tukey’s HSD using the harmonic mean sample size to correct for unequal groups revealed that the middle-aged group and the young-old groups did not differ from each other, but that both differed from the old-old and oldest-old groups, which also did not differ from each other. For a detailed look at the differences, the frequency counts of each SJS score, by age group, are presented in Appendix B.

This same analysis was conducted on the overall BDS score, with a significant age group effect, F (3, 214) = 3.31, MSE = .86. The Tukey’s HSD revealed that the means of the four age groups differed only for the two extremes, M = 3.94 for the old-old age group and M = 4.42 for the young-old age group.

Finally, the FDS score was examined in a similar analysis, resulting in a significant age group effect, F (3, 214) = 3.68, MSE = .96. The Tukey’s HSD post-hoc analysis indicated that the youngest age group had a significantly higher mean score (M = 6.06) than the other age groups, which did not differ from each other.

Methods of Scoring: Proportions

Performance across the list lengths was examined using the proportion correct scores as the dependent measure, for the different age groups. First, results from the SJS task will be discussed, followed by the BDS and FDS tasks. Due to floor effects at the longest list lengths, these highest list lengths were removed from the following analyses to make the comparison of the proportion correct measures more comparable to the “score” measures. That is, these higher lists would not be included in the scores unless they were answered correctly; the proportion correct measure can include one correct item within an otherwise completely incorrect list, thus “artificially” lowering the proportion correct scores by including the highest list lengths.

Due to floor effects at list lengths of 7 and 8 in the SJS task, these lists were dropped from the remaining analyses. A mixed-model type ANOVA was conducted on the average proportion correct scores to adjust for the repeated measures on the same subject at different list lengths. The main effects are age group, list length and we also include the age group by list length interaction in the model. The analysis revealed significant main effects of age group [F (3, 214) = 35.79, MSE = .08, ηp2 = .32], list length [ F (4, 856) = 889.76, MSE = .027, ηp2 = .81], and a significant interaction of these two factors, [F (12, 856) = 9.74, MSE = .027, ηp2 = .12]. These significant findings can be described as a decrease in the mean performance scores across both increasing age and increasing list length, with a significant drop in proportion correct scores for the two oldest age groups at the longest list lengths (see Figure 2A).

Figure 2.

Figure 2

A. This figure depicts the average of the proportion correct scores for list lengths 2–6 in the size-judgment span task for each age group. Error bars represent standard error of the mean.

B. This figure depicts the average of proportion correct scores for list lengths 2–7 in the backward digit span task for each age group. Error bars represent standard error of the mean.

C. This figure depicts the average of proportion correct scores for list lengths 2–8 in the forward digit span task for each age group. Error bars represent standard error of the mean.

Similar analyses were conducted for the BDS task, and performance was very poor on the highest list length (list length 8). The following analysis only included list lengths 2–7 (see Figure 2B). The mixed-model type ANOVA revealed significant main effects of age group [F (3, 214) = 2.76, MSE = .16, ηp2 = .04], and list length [ F (5, 1070) = 645.35, MSE = .039, ηp2 = .75]. The interaction of these two factors was marginally significant, p = 0.07. The change in performance across the age groups in the BDS task was similar to that observed in the SJS task. The average BDS score decreased as the age and/or list length increased.

The FDS task also demonstrated floor effects at the longest list length. The current analysis includes only lists 2–8 (see Figure 2C, which begins with list length 3 due to ceiling effects at list length 2). The results for the FDS task were very similar, revealing two significant main effects, list length [F (6, 1284) = 994.47, MSE = .006, ηp2 = .82], and age group [ F (3, 214) = 4.80, MSE = .24, ηp2 = .06], with a significant interaction, [ F (18, 1284) = 1.69, MSE = .006, ηp2 = .02]. The reduction in overall level of difficulty of the FDS task is observed by noting that participants continued to perform well into higher list lengths as compared to the two previous span measures.

Output Serial Position Effects

To examine performance more closely in the SJS task, list length 4 was chosen, as it was not constrained by either ceiling or floor effects for the different age groups (see Figure 2A). The average of performance on each item within list length 4 was the dependent measure in this analysis of output serial position effects (see Figure 3A). An item was scored in the correct output serial position if it was reported in the correct order of physical size, from smallest to largest item. A mixed-model type ANOVA accounting for the repeated measures of SJS over the same subject with the main effects of age group, serial position and age group by serial position interaction was conducted, revealing significant main effects of age group [F (3, 214) = 21.60, MSE = .21, ηp2 = .23], serial position [F (3, 642) = 72.99, MSE = .03, ηp2 = .25], and a significant interaction of these two factors, [F (9, 642) = 6.03, MSE = .03, ηp2 = .08]. This interaction was followed up with a series of repeated-measures ANOVAs separately for each age group. Although each age group revealed a significant main effect of serial position [middle-age, F (3, 141) = 5.31, MSE = .01, ηp2 = .10; young-old, F (3, 117) = 11.54, MSE = .02, ηp2 = .23; old-old, F (3, 117) = 28.71, MSE = .04, ηp2 = .42; oldest-old, F (3, 267) = 49.59, MSE = .04, ηp2 = .36], follow-up pairwise comparisons reveal that positions 1 and 2 differed significantly from positions 3 and 4 in the middle-age and young-old age groups. All positions differed from each other for the old-old age group, and all pairings except for the positions 3 and 4 were different in the oldest-old age group. Thus, the pattern of output serial positions differed among the age groups, with the exception that the two youngest groups of participants displayed the same patterns in their output serial position curves.

Figure 3.

Figure 3

A. This figure depicts the average of the proportion correct scores for output serial positions 1–4 for list length 4 in the SJS task, for each age group. Error bars represent standard error of the mean.

B. This figure depicts the average of the proportion correct scores for output serial positions 1–5 for list length 5 in the backward digit span task, for each age group. Error bars represent standard error of the mean.

A. This figure depicts the average of the proportion correct scores for output serial positions 1–7 for list length 7 in the forward digit span task, for each age group. Error bars represent standard error of the mean.

The backward digit span task was analyzed at list length 5, again, to avoid ceiling or floor effects (see Figure 2B). The serial position effects appear in Figure 3B. A mixed-model type ANOVA revealed a significant main effect of serial position [F (4, 856) = 99.97, MSE = .06, ηp2 = .32], and a significant interaction of the factors of age group and serial position, [F (12, 856) = 1.87, MSE = .06, ηp2 = .03]. However, the main effect of age group was not significant (p = .41). The significant interaction was followed up with a series of repeated-measures ANOVAs separately for each age group. The results of these analyses were significant for each age group [middle-age, F (4, 188) = 17.17, MSE = .05, ηp2 = .27; young-old, F (4, 156) = 20.86, MSE = .06, ηp2 = .35; old-old, F (4, 156) = 18.63, MSE = .05, ηp2 = .32; oldest-old, F (4, 356) = 70.46, MSE = .06, ηp2 = .44]. Pairwise comparisons revealed that each age group showed a similar pattern across the serial positions, with positions 1 and 2 showing much higher levels of performance, and differing significantly from all positions, whereas positions 3, 4, and 5 did not differ from each other. The oldest-old showed the largest differences in performance at the later serial positions where only positions 3 and 5 are not significantly different.

The forward digit span task had overall higher levels of performance, and was thus examined at list length 7 (see Figure 2C). The serial position effects for the forward digit span task are depicted in Figure 3C. As before, a mixed-model type ANOVA was conducted, revealing significant main effects of age group [F (3, 214) = 3.30, MSE = .99, ηp2 = .04], serial position [F (6, 1284) = 30.66, MSE = .04, ηp2 = .13], and a significant interaction of these two factors, [F (18, 1284) = 1.63, MSE = .04, ηp2 = .02]. Repeated-measures ANOVA were conducted separately to examine the interaction in more depth. The current task asked participants to listen to a sequence of digits, and to repeat them back in forward order. Thus, one would expect to see traditional patterns of performance in recall from the auditory modality: primacy (better recall at the beginning of the list) and recency (better recall for the final item in the list) with poorer performance for the items in the middle of the list. Although it is easy to note the primacy effects (see Figure 3C), the recency effects are less clearly observed, especially in the older age groups of participants, and the middle-aged group showed the most traditional serial position curve. This interpretation was supported by the findings from the ANOVAs and the pairwise comparisons. All four age groups demonstrated significant changes across serial positions, [middle-age, F (6, 282) = 16.71, MSE = .04, ηp2 = .26; young-old, F (6, 234) = 6.17, MSE = .05, ηp2 = .14; old-old, F (6, 234) = 3.41, MSE = .05, ηp2 = .08; oldest-old, F (6, 534) = 11.97, MSE = .04, ηp2 = .12]. The pairwise comparisons indicated that all serial positions differed from the first position in the middle-aged group (demonstrating primacy); however in the young-old group only positions 3, 4, and 5 differed significantly from position 1. In contrast, the old-old age group revealed the flattest curve, with no significant differences emerging among the 7 serial positions. Finally, the oldest-old revealed strong primacy effects, with all but position 2 differing significantly from position 1.

Correlation Analyses

A comparison of the Scores and the Proportion measures (including the list lengths indicated previously for each task) was conducted to address the second objective of the study which was to determine which of these methods allowed for a stronger relationship with other indicators of cognitive aging. For this analysis, correlations were conducted with the multiple methods of scoring SJS, BDS, and FDS, the MMSE, and vocabulary (see Table 3). MMSE and vocabulary scores were chosen for inclusion in the correlation analyses as indicators of global cognitive functioning and verbal intelligence, respectively. In the first analysis, age was included as a variable, and in the second, the correlations were conducted with age partialled out. In terms of the level of difficulty hypothesis of the three span tasks, the patterns of correlations suggested that the SJS, with similar outcomes for both scoring measures, was strongly and negatively correlated with age (r’s = − 0.55 and – 0.58 for the SJS score and SJS proportion, respectively). The other two span measures, FDS and BDS, had correlations with age of similar magnitudes to each other, again regardless of scoring. In particular, the r values were − 0.23 and – 0.26 for FDS score and FDS proportion, respectively, whereas the r values, in order, were − 0.21 and – 0.20 for BDS score and BDS proportion. This pattern of correlations would argue against an increasing level of difficulty as one moved from the FDS task, to BDS. It instead suggests that the demands of the SJS task were more strongly related to differences with increased age, and the FDS and BDS shared similar relationships to each other with age.

Table 3.

Correlations Among Measures with Age, and with Age Partialled Out

1 2 3 4 5 6 7 8 9
Age (1)
MMSE (2) −0.41** 0.37 ** 0.19 ** 0.17 * 0.16 * 0.21 ** 0.03 0.00
Vocabulary (3) −0.15 * 0.40 ** 0.26 ** 0.27 ** 0.14 * 0.28 ** 0.28 ** 0.24 **
SJS Score (4) −0.55 ** 0.37 ** 0.30 ** 0.84 ** 0.35 ** 0.38 ** 0.41 ** 0.42 **
SJS Proportion 2–6 (5) −0.58 ** 0.36 ** 0.30 ** 0.87 ** 0.33 ** 0.35 ** 0.37 ** 0.37 **
BDS Score (6) −0.21 ** 0.22 ** 0.17 * 0.40 ** 0.36 ** 0.78 ** 0.38 ** 0.41 **
BDS Proportion 2–7(7) −0.20 ** 0.27 ** 0.30 ** 0.41 ** 0.38 ** 0.79 ** 0.41 ** 0.44 **
FDS Score (8) −0.23 ** 0.12 0.31 ** 0.46 ** 0.42 ** 0.41 ** 0.43 ** 0.93 **
FDS Proportion 2–8 (9) −0.26 ** 0.11 0.27 ** 0.48 ** 0.43 ** 0.44 ** 0.47 ** 0.93 **

Note:

**

represents p < 0.01,

*

represents p < 0.05.

Values above the diagonal, in bold print, represent the correlations with age partialled out.

These relationships were explored by examining the correlations of the span tasks with the MMSE and vocabulary scores as indicators of cognitive functioning. The MMSE total score was strongly correlated with age (r = − 0.41), as were the measures of SJS span (r’s = 0.37 and 0.36 for the SJS score and SJS proportion, respectively). In contrast, FDS was not correlated with MMSE, suggesting that the requirements of these two tasks differed greatly. Furthermore, these correlations went to zero when age was partialled out. The BDS measures shared modest, but significant correlations with the MMSE (r’s = 0.22 and 0.27 for BDS score and BDS proportion, respectively). Finally, the relationship between vocabulary scores and the SJS task was considered to determine if vocabulary as a proxy for verbal knowledge contributed significantly to overall performance on the SJS task. The correlation was positive and significant (r = .30), although this relationship was not heavily driven by contributions of vocabulary knowledge differentially for the different age groups, given that this relationship did not decrease much in strength when age was partialled out (rpartial = .26). The relationship of vocabulary scores and age was not very strong overall (r = −.15).

General Discussion

Recent demographic trends reveal large increases in the percentages of the oldest members of our society. It is increasingly important to understand the cognitive functioning of older adults, with special emphasis placed on the oldest-old segment (those 90 + years of age). The current study addressed this concern by conducting an in-depth analysis of performance on three measures of memory in a sample of participants ranging in age from 45 to 96 years of age, with a total of 90 participants in the 90 + age group. We were interested in potential differences in strategic processing across the age groups, as well as the sensitivity of three measures of memory span to reveal such differences.

We compared performance on three measures of memory span using both an all-or-none scoring method, and a method that allowed for proportion correct scoring. Both scoring methods revealed that adults had increasing difficulty with these tasks as age increased (see Figures 2A-C). Importantly, output serial position data were examined to determine how the demands of reordering the items in terms of their physical size would affect participants’ performance, relative to the output serial position curves of the two digit span tasks. The size judgment span task placed additional demands upon the particpants, beyond simply repeating the series of items. To perform this task correctly, participants had to maintain the items during the encoding phase, and then reorder them prior to recall. The data suggested that the oldest participants had the most difficulty with the size judgment span task, as revealed by the overall scores, output serial position curves, and the significant, negative correlation of the size judgment span task with age. Thus, the size judgment span task revealed the most sensitivity to age-group differences. Our results are similar to Myerson et al.’s (2003) findings in which the largest declines with increasing age were observed in the Letter-Number Sequencing task, as compared to forward and backward digit span tasks. The Letter-Number Sequencing task places similar demands on participants as the size judgment task, in that participants must categorize the letters and numbers within the list of items separately, and report them in their respective orders (alphabetical and numeric).

The individual tasks were then examined in detail, using the output serial position curves as an index of strategic processing. Typical serial position curves were not expected in the size judgment span task, because it invovled reordering. All age groups revealed primacy effects in the output serial position data for list length 4, and no recency. Primacy effects typically relate to rehearsal of the items; however, in this reordering task the relationship between input and output serial position is not always maintained. The high performance levels at output serial position 1 indicate that the smallest item was tagged within the list, but additional ordering by size became more difficult as the list continued, differentially for the oldest adults.

The two younger age groups had serial position curves that remained relatively flat, while the two older age groups had much larger drops in performance across the list items. The marked decreases in memory of the later list items was much greater in the older two groups of participants, suggesting that the requirements of not only retaining the items, but then placing them in a specific order, was much more difficult for these older participants. This pattern was reflected in the post-hoc comparisons of output serial position performance within each age group. Interestingly, this post-hoc analysis did not show significant differences in output serial position performance between the two oldest age groups, suggesting that these participants may have been able to draw upon item information and semantic associations. The use of semantic associations has shown less steep declines with age, relative to item information (see McCabe & Hartman, 2008; Hartman & Warren, 2005). Future research with the two oldest age groups should continue to investigate the trajectory of the temporal context deficit. The work ofMaylor et al. (1999) supported the hypothesis of context deficits, but did not include adults in the 90+ age range. Comparing the pattern of performance in the size judgment span task to the backward digit span task illustrated the differences in difficulty level of these two tasks, as a higher list length could be examined without the problems of floor effects. Although both tasks required participants to do re-ordering of the items, using familiar digits led to higher performance than ordering items by their physical size. Similar findings of primacy effects emerged as with the size-judgment span task, but the sharp differences among the four age groups were not observed. While the recall performance of the oldest-old decreased more in the later serial positions, the pairwise comparisons for the four age groups did not result in different patterns. All four age groups showed primacy effects, as positions 1 and 2 differed significantly from positions 3, 4, and 5.

Finally, the forward digit span task was investigated, and revealed that this task was easier than the other two, in terms of higher levels of performance at higher list lengths. List length 7 was examined in detail, and again, strong primacy effects were found in all age groups. While the general bow-shaped pattern of serial recall was evident in all groups, it appeared most clearly in the middle-aged group. The shape of the curve suggests that the middle-aged group is relying more heavily upon rehearsal processes, while the older adults may be employing multiple strategies to perform the task successfully. The oldest-old did not have the lowest levels of performance across all of the list lengths in this task. The role of experience with familiar digits, and the lower level of difficulty of the forward digit span task may have allowed for the use of multiple strategies without major decrements in performance across the age groups.

Taken together across the three span measures, the pattern of performance of the oldest-old was not dramatically different than their younger counterparts. In many analyses, they performed very similarly to the old-old age group. However, one methodological limitation of the present research warrants brief mention. Those who participate in a multidisciplinary study of healthy aging may be unusually high functioning. Conceivably, our results may reflect selectivity in the direction of cognitive vitality in late life, which necessarily limits the generalizeability of these results. Notwithstanding the health of our oldest-old, the MMSE scores were significantly lower in this age group, reflecting the presence of healthy aging (see Table 2 and Appendix A). Future studies will continue to address the issues that relate to longevity, and the potential relationship of longevity to cognitive functioning.

In conclusion, these data provided a unique window into the strategic processing abilities of the oldest-old in three memory tasks. Examining output serial position effects allowed for a more detailed understanding of performance, and revealed changes across serial positions for the different age groups. The pattern of results suggested that the two younger age groups performed similarly, and that the two older age groups performed similarly. These data confirmed previous findings that working memory abilities decrease with increasing age, and supported the view that changes in the use of semantic knowledge may be less sensitive to changes with age (see McCabe & Hartman, 2008; Hartman & Warren, 2005). Furthermore, the use of complex measures, like the size judgment span task, which required both memory for items and a reordering of these items, proved to be the most sensitive to age differences, whereas the forward digit span task was less sensitive. Our results replicated similar findings in the literature when forward and backward digit span tasks have been compared across increasing ages (Bopp & Verhaeghen, 2005; Gregoire & Van der Linden, 1997; Myerson et al., 2003), and provided support for the view of aging that includes frontal-function deficits, or changes in executive control (Myerson et al., 2003; Maylor et al., 1999). In addition, these patterns were shown to exist in the oldest-old.

Acknowledgments

This research was supported by grants from the Louisiana Board of Regents through the Millennium Trust Health Excellence Fund [HEF(2001-06)-02] and the National Institute on Aging P01 AG022064. This support is gratefully acknowledged.

We thank Karri Hawley, Erin Jackson Walker, Julieta Spears, Elizabeth Maggiore, Marla Erwin, Todd Ransom, and Travis Chidester for their assistance with participant testing. We also thank Jenny Sentilles, Travis Atkinson, Taylor Roberts, Tia Bradley, Christina Hebbler, Nicole Lowenhaupt, and Alicia Briganti for their help with data scoring.

Corporate Author List

*Meghan B. Allen, BS; Gloria Anderson, BS; Iina E. Antikainen, BS; Arturo M. Arce, MD; Jennifer Arceneaux, RN; Mark A. Batzer, PhD; Emily O. Boudreaux, MA; Lauri Byerley, PhD; Pauline A. Callinan, PhD; Catherine M. Champagne PhD, RD; Hau Cheng, MS; Katie E. Cherry, PhD; Yu-wen Chiu, MPH; Liliana Cosenza, BS; M. Elaine Cress, PhD-Consultant; Jianliang Dai, PhD; James P. DeLany, PhD; Jenny Y. Denver, MS; Andy Deutsch, PhD; Melissa J. deVeer, MS; Devon A. Dobrosielski, MS; Rebecca Ellis, PhD; Andrea Ermolao, MD; Marla J. Erwin, MA; Mark Erwin, MA; Jennifer Fabre, MPT; Elizabeth T. Fontham, PhD; Madlyn Frisard, PhD; Paula Geiselman, PhD; Lindsey Goodwin, BS; Jason Gtray, BS; Valentina Greco, PhD; Tiffany Hall; Michael Hamilton, MD; Karri S. Hawley, PhD; Scott W. Herke, PhD; Jennifer Hayden, MS; Kristi Hebert, BS; Fernanda Holton, MA; Hui-Chen Hsu, PhD; S. Michal Jazwinski, PhD; Darcy Johanssen, PhD; Lumie Kawasaki, MD; Sangkyu Kim, PhD; Beth G. Kimball, BS; Christina King-Rowley, MS; Miriam Konkel, MD; Lynn LaMotte, PhD; Kim Landry; Carl Lavie, MD-Consultant; Daniel LaVie, BS; Matthew Leblanc; Christina M. Lefante, MPH; Li Li, MD; Hui-Yi Lin, PhD, MSPH; Kay Lopez, DSN; James Major; Bridget McEvoy-Hein, MS; John D. Mountz, MD PhD; Leann Myers, PhD; Jennifer Owens, BA; Kim B. Pedersen, PhD; Andrew Pellett, PhD; Eric Ravussin, PhD; Paul Remedios, BS; Ryan Rhodes, BS; Yolanda Robertson, NP; Jennifer Rood, PhD; Henry Rothschild, MD, PhD; Ryan A. Russell, BS; Erin Sandifer, BS; Charles Sasser, BS; Beth Schmidt, MS; Robert Schwartz, MD – Consultant; Donald K. Scott, PhD; Mandy Shipp, RD; Jennifer Silva Brown, PhD; F. Nicole Standberry; L. Joseph Su, PhD MPH; Jolie Thibodeaux; Mary Susan Thomas, APRN, MSN, FNP; Jessica Thomson, PhD; Valerie Toups, LPN; Crystal Traylor, APRN, MSN, WHNP; Cruz Velasco-Gonzalez, PhD; Julia Volaufova, PhD; Celeste Waguespack, BSN, RN; Jerilyn A. Walker, MS; Xui Yun Wang; David A. Welsh, MD; Michael A. Welsch, PhD; Robert H. Wood, PhD; Sarah Zehr, PhD; Pili Zhang, PhD. (Louisiana State University, Baton Rouge; Pennington Biomedical Research Center, Baton Rouge; Louisiana State University Health Sciences Center, New Orleans; Tulane University, New Orleans; University of Alabama, Birmingham).

Appendix A

Mean SJS Scores in the Oldest-Old as a Function of the MMSE Total Score

MMSE total M of SJS Score N
25 3.55 10
26 3.31 18
27 3.60 19
28 3.68 19
29 3.64 18
30 3.75 6

Appendix B

Means and Frequency Counts of SJS Score by Age Group

Middle Age Young Old Old Old Oldest Old
2.0 -- -- -- 4
2.5 -- -- -- 3
3.0 1 1 12 22
3.5 3 4 6 18
4.0 16 18 15 35
4.5 11 7 5 6
5.0 9 6 2 2
5.5 4 2 -- --
6.0 2 2 -- --
6.5 2 -- -- --
Total 48 40 40 90
Mean 4.56 4.33 3.74 3.57

Contributor Information

Emily M. Elliott, Louisiana State University

Katie E. Cherry, Louisiana State University

Jennifer Silva Brown, Louisiana State University.

Emily A. Smitherman, Louisiana State University

S. Michal Jazwinski, Tulane University School of Medicine.

Qingzhao Yu, Louisiana State University Health Science Center.

Julia Volaufova, Louisiana State University Health Science Center.

References

  1. Babcock RL, Salthouse TA. Effects of increased processing demands on age differences in working memory. Psychology & Aging. 1990;5:421–428. doi: 10.1037//0882-7974.5.3.421. [DOI] [PubMed] [Google Scholar]
  2. Bäckman L, Small BJ, Wahlin A, Larsson M. Cognitive functioning in very old age. In: MCraik FI, Salthouse TA, editors. The handbook of aging and cognition. 2nd ed. Mahwah, New Jersey:: Erlbaum; 2000. pp. 499–558. [Google Scholar]
  3. Baddeley A. Working Memory, Thought, and Action. New York, NY: Oxford University Press; 2007. [Google Scholar]
  4. Barrouillet P, Bernardin S, Camos V. Time constraints and resources sharing in adults’ working memory spans. Journal of Experimental Psychology: General. 2004;133:83–100. doi: 10.1037/0096-3445.133.1.83. [DOI] [PubMed] [Google Scholar]
  5. Bopp KL, Verhaeghen P. Aging and verbal memory span: A meta-analysis. Journal of Gerontology: Psychological Sciences. 2005;60:223–233. doi: 10.1093/geronb/60.5.p223. [DOI] [PubMed] [Google Scholar]
  6. Cherry KE, Elliott EM, Reese CM. Age and individual differences in working memory: The size judgment span task. Journal of General Psychology. 2007;134:43–65. doi: 10.3200/GENP.134.1.43-65. [DOI] [PubMed] [Google Scholar]
  7. Cherry KE, Hawley KS, Jackson EM, Volaufova J, Su LJ, Jazwinski SM. Pictorial superiority effects in oldest-old adults. Memory. 2008;16:728–741. doi: 10.1080/09658210802215534. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Cherry KE, Park DC. Individual difference and contextual variables influence spatial memory in younger and older adults. Psychology and Aging. 1993;8:517–526. doi: 10.1037//0882-7974.8.4.517. [DOI] [PubMed] [Google Scholar]
  9. Craik FI, Bialystock E. Cognition through the lifespan: Mechanisms of change. Trends in Cognitive Sciences. 2006;10:131–138. doi: 10.1016/j.tics.2006.01.007. [DOI] [PubMed] [Google Scholar]
  10. Daneman M, Carpenter PA. Individual differences in working memory and reading. Journal of Verbal Learning and Behavior. 1980;19:450–466. [Google Scholar]
  11. Dobbs AR, Rule BG. Adult age differences in working memory. Psychology and Aging. 1989;4:500–503. doi: 10.1037//0882-7974.4.4.500. [DOI] [PubMed] [Google Scholar]
  12. Engle RW. What is working memory capacity? In: Roediger HL, Nairne JS, Neath I, Surprenant AM, editors. The nature of remembering: Essays in honor of Robert G. Crowder. Washington, DC: American Psychological Association; 2001. pp. 297–314. [Google Scholar]
  13. Folstein MF, Folstein SE, McHugh PR. Mini-mental state: A practical method for grading the cognitive state of patients for the clinician. Journal of Psychiatric Research. 1975;12:189–198. doi: 10.1016/0022-3956(75)90026-6. [DOI] [PubMed] [Google Scholar]
  14. Golomb JD, Peelle JE, Addis KM, Kahana MJ, Wingfield A. Effects of adult aging on utilization of temporal and semantic associations during free and serial recall. Memory & Cognition. 2008;36:947–956. doi: 10.3758/mc.36.5.947. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Gray JR, Chabris CF, Braver TS. Neural mechanisms of general fluid intelligence. Neuroscience. 2003;6:316–322. doi: 10.1038/nn1014. [DOI] [PubMed] [Google Scholar]
  16. Gregoire J, Van der Linden M. Effects of age on forward and backward digit spans. Aging, Neuropsychology, and Cognition. 1997;4:140–149. [Google Scholar]
  17. Hartman M, Warren LH. Explaining age differences in temporal working memory. Psychology and Aging. 2005;20:645–656. doi: 10.1037/0882-7974.20.4.645. [DOI] [PubMed] [Google Scholar]
  18. Jastak J, Jastak S. Short forms of the WAIS and WISC vocabulary subtests. Journal of Clinical Psychology. 1964;20:167–199. doi: 10.1002/1097-4679(196404)20:2<167::aid-jclp2270200202>3.0.co;2-9. [DOI] [PubMed] [Google Scholar]
  19. Lustig C, May CP, Hasher L. Working memory span and the role of proactive interference. Journal of Experimental Psychology: General. 2001;130:199–207. doi: 10.1037//0096-3445.130.2.199. [DOI] [PubMed] [Google Scholar]
  20. Maylor EA, Vousden JI, Brown GDA. Adult age differences in short-term memory for serial order: Data and a model. Psychology and Aging. 1999;14:572–594. doi: 10.1037//0882-7974.14.4.572. [DOI] [PubMed] [Google Scholar]
  21. McCabe J, Hartman M. Examining the locus of age effects on complex span tasks. Psychology and Aging. 2003;18:562–572. doi: 10.1037/0882-7974.18.3.562. [DOI] [PubMed] [Google Scholar]
  22. McCabe J, Hartman M. Working memory for item and temporal information in younger and older adults. Aging, Neuropsychology, and Cognition. 2008;15:574–600. doi: 10.1080/13825580801956217. [DOI] [PubMed] [Google Scholar]
  23. Myerson J, Emery L, White DA, Hale S. Effects of age, domain, and processing demands on memory span: Evidence for differential decline. Aging, Neuropsychology, and Cognition. 2003;10:20–27. [Google Scholar]
  24. Salthouse TA, Pink JE. Why is working memory related to fluid intelligence? Psychonomic Bulletin & Review. 2008;15:364–371. doi: 10.3758/PBR.15.2.364. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Shelton JT, Elliott EM, Hill BD, Calamia MR, Gouvier WD. A comparison of laboratory and clinical working memory tests and their prediction of fluid intelligence. Intelligence. 2009;37:283–293. doi: 10.1016/j.intell.2008.11.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. United States Census. Older Americans Month: May 2009. 2009 Retrieved March 26, 2009 from http://www.census.gov/Press-Release/www/releases/archives/facts_for_features_special_editions/013384.html.
  27. Unsworth N, Engle RW. On the division of short-term and working memory: An examination of simple and complex span and their relation to higher order abilities. Psychological Bulletin. 2007;133:1038–1066. doi: 10.1037/0033-2909.133.6.1038. [DOI] [PubMed] [Google Scholar]
  28. Unsworth N, Spillers GJ, Brewer GA. The contributions of primary and secondary memory to working memory capacity: An individual differences analysis of immediate free recall. Journal of Experimental Psychology: Learning, Memory, & Cognition. 2010;36:240–247. doi: 10.1037/a0017739. [DOI] [PubMed] [Google Scholar]
  29. Wechsler D. Wechsler Adult Intelligence Scale-Third Edition: Technical Manual. San Antonio, TX: The Psychological Corporation; 1997. [Google Scholar]

RESOURCES