Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2015 Nov 1.
Published in final edited form as: Neuropsychol Dev Cogn B Aging Neuropsychol Cogn. 2015 Apr 1;22(6):693–711. doi: 10.1080/13825585.2015.1028884

Younger and Older Adults Weigh Multiple Cues in a Similar Manner to Generate Judgments of Learning

Jarrod C Hines, Christopher Hertzog, Dayna R Touron
PMCID: PMC4537673  NIHMSID: NIHMS679306  PMID: 25827630

Abstract

One's memory for past test performance (MPT) is a key piece of information individuals use when deciding how to restudy material. We used a multi-trial recognition memory task to examine adult age differences in the influence of MPT (measured by actual Trial 1 memory accuracy and subjective confidence judgments, CJs) along with Trial 1 judgments of learning (JOLs), objective and participant-estimated recognition fluencies, and Trial 2 study time on Trial 2 JOLs. We found evidence of simultaneous and independent influences of multiple objective and subjective (i.e., metacognitive) cues on Trial 2 JOLs, and these relationships were highly similar for younger and older adults. Individual differences in Trial 1 recognition accuracy and CJs on Trial 2 JOLs indicate that individuals may vary in the degree to which they rely on each MPT cue when assessing subsequent memory confidence. Aging appears to spare the ability to access multiple cues when making JOLs.

Keywords: judgments of learning, memory for past test, aging, metacognition, multilevel modeling

Introduction

During the course of learning, people are often able to study materials multiple times (e.g., when a student spends multiple nights studying for an exam). Actively monitoring one's current state of learning can help to optimize further study efforts (e.g., Nelson & Narens, 1990; see Bjork, Dunlosky, & Kornell, 2013, for a recent review of self-regulated learning). One metric for assessing an individual's perceptions of his or her current state of learning is the judgment of learning (JOL), for which people rate confidence in the likelihood of remembering a piece of information (e.g., an item from a word list) during a subsequent memory test (for an introductory overview, see Dunlosky and Metcalfe, 2009). As reviewed in detail below, research on aging and JOLs indicates age-related invariance in the relative accuracy of JOLs, as indicated by within-person correlations of item JOLs with memory outcomes such as item recall success (Hertzog & Dunlosky, 2011). The present study focuses on age differences in the information people use to make JOLs and whether the cues that are accessed differ across the adult life span (Hertzog & Shing, 2009). Before reviewing the literature on aging and JOLs, we first consider theories of JOL construction that set the stage for understanding age differences in how people use multiple cues to construct JOLs.

Cue Utilization and JOLs

It is widely accepted that metacognitive judgments are explicit or implicit inferences based on accessible information, or cues, about underlying cognitive states without direct access to those internal cognitive states (Koriat, 1997; Nelson, 1996a). There are many potential metacognitive cues available to individuals which differ widely in their validity for predicting later remembering (Dunlosky & Metcalfe, 2009).

Research over the last 25 years has identified a large number of cues that are accessed when making JOLs, including initial encoding fluency (e.g., Besken & Mulligan, 2013; Begg, Duft, Lalonde, Melnick, & Sanvito, 1989; Hertzog, Dunlosky, Robinson, & Kidder, 2003; Undorf & Erdfelder, 2013); retrieval fluency (e.g., Benjamin, Bjork, & Schwartz, 1998; Hines, Hertzog, & Touron, 2009); perceived item difficulty (Undorf & Erdfelder, 2013); learners' degree of prior familiarity with to-be-learned items (e.g., Shanks & Serra, 2014); how JOLs are framed (e.g., whether prediction instructions focus on studying or forgetting; Ariel, Hines, & Hertzog, 2014; Finn, 2008); spontaneous use of effective encoding strategies (Hertzog, Sinclair, & Dunlosky, 2010); and, perhaps most commonly, observable item characteristics such as associative relatedness or word concreteness (e.g., Begg, Duft, Lalonde, Melnick, & Sanvito, 1989; Hertzog, Kidder, Powell-Moman, & Dunlosky, 2002; Koriat, 1997; Rhodes & Castel, 2008).

An open question at present is how many cues individuals can access on a given item, or across a set of items. Most of the existing literature has focused on either one or a small set of potential cues. A multiple-cue utilization perspective (Hertzog, Hines, & Touron, 2013) argues that people can and do access and use multiple cues when making JOLs, although the likelihood of accessing multiple cues during the judgment process will be influenced by factors such as motivation, ability, fatigue, item difficulty, and the capacity of attention and working memory. Certainly, not all available cues are accessed and used when making JOLs. Learners have been found to ignore, neglect, or discount many cues that are highly diagnostic of later remembering, such as an asymmetric direction of associative relatedness of cue-target word pairs (e.g., Koriat & Bjork, 2006) or variation in the effectiveness of instructed encoding strategies (e.g., Hertzog, et al., 2009). In some cases, cue-neglect may be associated with overshadowing of one cue by another (e.g., Hertzog et al., 2002).

An important JOL cue in multi-trial learning tasks (involving more than one study-test trial) is memory-for-past-test performance (MPT; Ariel & Dunlosky, 2011; Finn & Metcalfe, 2007; 2008; Hertzog, Hines, & Touron, 2013; Serra & Ariel, 2014; Tauber & Rhodes, 2012). JOLs made after the first test are typically strongly related to MPT, which has been argued to seed a MPT heuristic (“if I remembered it before, I’ll remember it again”; Finn & Metcalfe, 2007). Individuals could improve JOL accuracy (or resolution) across study-test trials by solely relying on a MPT heuristic, perhaps at the cost of ignoring other valid cues while generating an underconfidence-with-practice effect, in which JOL magnitudes increasingly underestimate future performance by ignoring future learning (Finn & Metcalfe, 2008). Other cues aside from MPT apparently influence multi-trial JOLs (Ariel & Dunlosky, 2011; Tauber & Rhodes, 2012), although the specific cues responsible for the effects reported in those studies were not identified.

Hertzog, Hines, and Touron (2013) used multilevel modeling procedures (Snijders & Bosker, 2012) to show explicitly that encoding fluency was a source of enhanced JOL-memory correlations at a second study trial, in addition to other cues. As expected from the MPT literature studying associative cued recall, Trial 2 JOLs were strongly influenced by prior recognition memory accuracy or success (the index of MPT). The additional cues influencing JOLs included: (a) item-level recognition confidence judgments (CJs), (b) actual recognition response latency, (c) subjective recognition test fluency (measured by recognition response times, RTs), and (d) encoding fluency (measured by strategy completion RTs) immediately prior to the JOL.

CJs in associative recognition tests are influenced by retrieval fluency and accuracy (Hines, Hertzog, & Touron, 2009), while diverging from both due to fallible metacognitive monitoring. For example, factors like specific recollections of study or retrieval strategies (and their degree of success) or the general degree of clarity regarding those recollections may influence memory confidence in associative recognition tests (e.g., Cohn & Moscovitch, 2007; Parks, 2007). In recognition memory contexts, the less-than-perfect accuracy of CJs, as measured by CJ-recognition memory success correlations, allows for detecting influences of both variables on JOLs.

The role of actual retrieval fluency (correct recognition RTs) in influencing CJs and subsequent JOLs is not surprising, given that RTs are typically faster as one's level of underlying item-learning increases (e.g., Cerella, Onyper, & Hoyer, 2006; Logan, 1988; Ratcliff, Thapar, & McKoon, 2011). Monitoring this relationship may lead individuals to adopt a fluency heuristic (“if I remembered it quickly, I remembered it accurately”; e.g., Benjamin et al., 1998), which may provide useful information in addition to the MPT heuristic when constructing JOLs. The concurrent and independent influence of subjective retrieval fluency (as measured by estimated recognition RTs) controlling on actual RTs, might be more unexpected, given a literature that generally focuses on metacognition for memory response accuracy but not for response latency.

As with CJs and recognition accuracy, the relation of prior test-trial subjective retrieval fluency on later JOLs probably arises because temporal monitoring in memory tasks are not always well calibrated, instead showing a substantial level of aggregate underestimation of RTs and trial-level divergence between actual and estimated RT (Craik & Hay, 1999; Hertzog, Touron, & Hines, 2007). The influence of prior-trial CJs and estimated retrieval latencies on JOLs, controlling on recognition accuracy and actual recognition RTs, demonstrates the importance of metacognitive monitoring of past events on subsequent JOLs and study time (e.g., Hines, et al., 2009; Robinson, Hertzog, & Dunlosky, 2006).

Age Differences in JOL Construction

The question of interest for this study was whether there are age differences in multiple-cue utilization when making JOLs. The existing literature on age differences in JOLs indicates age-related invariance in the effects of different cues on JOLs, but only a few aging studies have evaluated concurrent influences of multiple cues. There is ample evidence that younger and older adults' JOLs are influenced similarly by experimentally-manipulated metacognitive cues and that there are few, if any, age differences in JOL accuracy (or resolution), as measured by within-person correlations of JOLs with memory outcomes (e.g., Dunlosky & Hertzog, 2000; Hertzog, Kidder, Powell-Moman, & Dunlosky, 2002; Hertzog, Sinclair, & Dunlosky, 2010; Robinson et al., 2006). With respect to multiple cue utilization, Hertzog, Sinclair, & Dunlosky (2010) found that adults of all ages attended to whether they had successfully used mediational strategies and assigned higher JOLs to well-encoded items for both related and unrelated items.

However, such findings may not generalize to memory task performance monitoring, as required for MPT effects. Older adults are often less accurate in estimating aggregate recall outcomes within levels of experimental variables (e.g., Price, Dunlosky, & Hertzog, 2008), and are prone to high-confidence recognition memory errors (e.g., Dodson, Bawa, & Krueger, 2007; Hertzog & Touron, 2011; Shing, Werkle-Bergner, Li, & Lindenberger, 2009; Wong, Cramer, & Gallo, 2012). Although Tauber and Rhodes (2012) and Serra and Ariel (2014) found that younger and older adults used the MPT heuristic similarly and appropriately to estimate subsequent JOLs in a cued recall task, it could be the case that attempting to monitor the large set of available cues in recognition memory tasks would exceed older adults’ diminished executive control and/or processing capacity (e.g., Hedden & Yoon, 2006; Salthouse, 2004). Furthermore, some aspects of performance monitoring may be impaired in old age, given evidence for age differences in resolution of CJs and estimated retrieval fluency (e.g., Craik & Hay, 1999; Hertzog, et al., 2007; Hines, et al., 2009). Hence, there are reasons to doubt older adults’ ability to monitor multiple cues successfully, especially when potent MPT effects could overshadow other available cues.

Alternatively, Tauber and Rhodes (2012) noted that MPT likely relies not only on explicit recollections of actual test performance, but also on a set of implicit cues that operate in the absence of a high-quality recollection of the testing experience. Given age-related declines in episodic, but not implicit, memory performance (e.g., Fleischman, Wilson, Gabrieli, Beinias, & Bennet, 2004; for reviews see Light et al., 2000; Zacks, Hasher, & Li, 2000), older adults might manifest similar relations of metacognitive cues with JOLs, despite age differences in executive function and working memory capacity.

Current Hypotheses

This study used a two study-test trial design in a yes-no associative recognition task (e.g., Naveh-Benjamin, 2000) to evaluate age differences in multiple cue influences on JOLs. Data on younger adults in this experiment were analyzed previously generating evidence of multiple-cue utilization in younger adults (Hertzog et al., 2013). The older adult data have not previously been reported.

Our hypotheses derived from the multiple cue-utilization perspective. We hypothesized that older adults are capable of monitoring multiple cues when making JOLs on a second study-test trial. We predicted that, given the existing evidence for spared resolution and sensitivity of JOLs in old age, multiple cues would act simultaneously and independently for both younger and older adults when estimating Trial 2 JOLs, in patterns consistent with spared MPT accuracy in the presence of multiple-cue influences on JOLs. Critically, our specific empirical hypotheses were that (a) Trial 1 recognition CJs and estimated RTs would predict Trial 2 JOLs independently of Trial 1 recognition memory accuracy (i.e., MPT), and (b) variations in the fluency of self-paced study time at Trial 2 would also influence Trial 2 JOLs, given known encoding fluency effects that have been seen in single study-test trial tasks (e.g., Robinson et al., 2006). These effects, along with robust MPT effects in older adults, would provide compelling evidence for older adults’ ability to access multiple cues when making Trial 2 JOLs. Finally, we also expected consistency between Trial 1 and Trial 2 JOLs in both item-level and person-level effects, indicating consistency in perceived item difficulty and stable individual differences in both use of the JOL rating scale and levels of confidence about future learning.

Method

Design

The experiment was a 2 (Age: Young, Old) X 2 (Study-Test Trial: 1, 2) mixed design, with age as a between-subjects independent variable and study-test trial as a within-subjects repeated variable.

Participants

Fifty-one younger adults between the ages of 18 and 25 years (M = 19.2) participated, as did 36 older adults who were between the ages of 60 and 75 years (M = 68.4). All younger adults were undergraduates at the Georgia Institute of Technology who received course credit for their participation. All older adults were recruited from the nearby community and were given an honorarium for their participation. All participants were pre-screened for basic health issues that could impede participation (e.g., diagnosed Parkinson’s disease or uncorrected visual impairments). Two cognitive tests were given in addition to the main experimental task to assess the similarity of basic information processing between our younger and older samples. The Shipley Institute of Living Scale-Revised vocabulary test (Zachary, 1986) was given to assess general crystallized abilities and is particularly relevant due to our experimental task involving encoding words (which benefits from knowing their meaning). The Digit-Symbol subtest of the Wechsler Adult Intelligence Scale-Revised (WAIS-R, Wechsler, 1981) was also administered as a test of general fluid ability. This test involves participants being given a sheet of many rows of randomly-ordered numbers (one through nine) under which they are to fill in an associated symbol (a key providing the unique associations is included on the top of the fill-in-the-blank test). Following the 90-second fill-in-the blank interval, the key and response sheet are removed and participants are given another sheet with a single row of response boxes labeled with numbers one through nine and asked to fill in the appropriate symbol below each number using only their memory. Performance in both portions of the Digit-Symbol subtest benefit from learning the unique associations between the numbers and symbols quickly, an ability which serves as a proxy for general fluid intelligence. Participant characteristics are presented in Table 1. As would be expected given the vast literature on age differences in crystallized and fluid abilties, older adults outperformed younger adults on the Shipley vocabulary measure, whereas younger adults performed better on both components of the WAIS-R Digit –Symbol test. Extremely low outliers (greater than two standard deviations from the mean) would have been culled from the sample, but that was unnecessary. Outcomes pertaining to the other variables in Table 1 are discussed in the Results section.

Table 1.

Means and Standard Deviations of Participant Characteristics and Performance Indices for Older Adults

Younger Adults Older Adults

Measure Mean SD Mean SD t d
Age (years) 19.22 1.19 68.36 5.44 −62.56***
Vocabulary 31.41 3.52 34.14 4.02 −3.36** 0.73
Digit-symbol 70.90 10.66 42.91 9.04 12.58*** 2.76
Digit-symbol memory 8.14 1.28 4.50 1.80 10.90*** 2.39
Trial 1 study time (s) 7.36 4.66 10.00 6.94 −2.12* 0.46
Trial 1 judgment of learning 40.76 17.66 31.45 21.49 2.21* 0.48
Trial 1 recognition accuracy (%) 82.16 12.74 72.36 22.02 2.62* 0.57
Trial 1 confidence judgment 85.53 10.09 76.13 18.36 3.07** 0.67
Trial 1 recognition RT (s) 1.65 0.30 3.55 1.08 −11.93*** 2.59
Trial 1 recognition RT estimate (s) 1.49 0.86 1.69 1.38 −0.82 0.18
Trial 2 study time (s) 1.62 2.09 3.50 1.89 −4.31*** 0.93
Trial 2 judgment of learning 81.70 13.41 69.33 25.02 2.98** 0.65

Note. Vocabulary = number of correct out of 40 on the Shipley Institute of Living Scale Revised vocabulary test (Zachary, 1986). Digit-symbol = Wechsler Adult Intelligence Scale-Revised Digit-Symbol Subtest (Wechsler, 1981). Digit-symbol memory = symbol recall memory following the Wechsler Adult Intelligence Scale-Revised Digit-Symbol subtest. Cohen (1988) advised values of d of 0.2, 0.5, and 0.8 should be considered small, moderate, and large effects, respectively.

*

p < .05;

**

p < .01;

***

p < .001

Materials and Procedure

Stimuli and procedural details were identical to those reported by Hertzog et al. (2013). One hundred twenty concrete nouns were used to construct 60 semantically-unrelated word pairs (e.g., IVY-BIRD). Participants completed two study-test trials using the same set of 60 word pairs in both trials. Participants were told prior to beginning the first study trial that their memories of each word pair would be tested. Each study-test trial involved randomized item-level study for each word pair that was terminated by participants pressing the spacebar. After ending study for each item, participants gave a JOL relating their ability to remember the word pair approximately 10 minutes later on a continuous scale from 0% to 100% confidence.

Testing during both trials was also self-paced. Participants were presented with all 60 previously-studied (i.e., intact) items in addition to 60 new (i.e., rearranged) items that were constructed from the same 120 nouns used to create the original items (sampled without replacement). For example, IVY-BARREL might have been a new item constructed for testing by rearranging words from the studied word pairs IVY-BIRD and BARREL-STAR. This procedure resulted in 120 test items for which participants were asked to indicate by keypress if they had studied the presented word pair (YES) or if the word pair presented had not been studied previously (NO). After a Yes/No response was registered, participants gave a confidence judgment (CJ) for their recognition response scaled from 0% to 100%. Participants also gave a RT estimate for their Yes/No keypress using a computer mouse to move a slider along a continuous scale from 0s to 10s. As the mouse moved, the computer displayed a numeric value to participants that was accurate to the nearest 0.1s. The order of collecting CJs and RT estimates was counterbalanced across participants who were randomly assigned to counterbalancing conditions to investigate possible reactivity due to the order of metacognitive judgments. No important interactions were found and data were analyzed after collapsing across counterbalancing conditions. Finally, it is unclear how one would map a Trial 1 rearranged-pair test item onto Trial 2 study behavior, given that rearranged test pairs draw words from two different paired-associate items. Moreover, consistent with past research (e.g., Cohn & Moscovitch, 2007; Hertzog & Touron, 2011) relationships between our variables of interest were considerably weaker for rearranged items at test. Hence the inclusion of data from Trial 1 test was restricted to intact word pairs in all analyses.

Statistical Procedure

The relationships existing between the outcome variable Trial 2 JOL and predictor variables (specified below) were examined via multilevel modeling with within- and between-person centering of predictors in our models to account for both types of effects (see Enders & Tofighi, 2007, and Singer, 1998). Within-person centering rescales each predictor independently as its deviation from that person's mean value. For example, if a participant's average RT is 2000ms, an item-level RT of 1500ms would be scaled as −500ms. This type of centering allows for the evaluation of the influences of within-person variation in cues predicts within-person variation in JOLs. In multilevel regression, these influences are called Level 1 regression effects. Between-person centering involves calculating an average value for each predictor for each participant and calculating a difference score between the mean of each participant and the mean of the overall sample. For example, if the sample mean RT was 3000ms, and a participant's mean RT was 2000ms, the value would be rescaled to −1000ms for every item-level response. Between-person centering allows for the evaluation of individual differences in means on predictors and are referred to as Level 2 regression effects in multilevel models.

Level 1 and Level 2 effects act independently from one-another and can be used as predictors within the same model (Singer, 1998). This is advantageous because questions pertaining to within- and between-person variations in predictors can be investigated simultaneously. For example, questions like "does a faster recognition response for an item in Trial 1 predict a higher JOL for that item in Trial 2?" can be asked along with "do people who respond faster on average in Trial 1 have higher average memory confidence in Trial 2?" Examining only one type of question in isolation can lead to improper conclusions about the relationships between predictors and outcomes because Level 1 and Level 2 effects can operate in opposing directions (e.g., people may be more confident in items that they remember faster, but people who, on average, respond quickly are not necessarily more confident than those who respond more slowly). See Snijders and Bosker (2012) for a more detailed treatment of this topic.

The multilevel regression models included the within- and between-person variables associated with every factor measured explicitly during the course of our experiment: participant age group (younger or older), study time from Trials 1 and 2, Trial 1 JOL, recognition accuracy (a binary measure), CJ, RT, and estimated RT. JOLs and CJs were scaled from 0% to 100% memory confidence, time-based measures were scaled in milliseconds, and age interaction terms were included for each variable to afford an age-based comparison of relationships between variables.

Models were fit to our data using SAS PROC MIXED (Littell, Milliken, Stroup, & Wolfinger, 2000), and our Type I error was set at α = .05 for all main effects and interactions. The F-tests associated with all fixed effects are reported, but only effects that met this standard are discussed. Relevant to our error criterion, it is appropriate to mention another benefit of examining our data via multi-level modeling as opposed to a more traditional measure, such as the gamma correlation (Nelson, 1984; 1996b). Murayama, Sakaki, Yan, and Smith (2014) found that the use of gamma correlations in a two-stage process to assess group-level differences (namely, by first computing participant-level correlations between JOLs and memory outcomes and then analyzing the derived statistics as dependent variables), can inflate the Type I error rate in the presence of one or more random effects (such as variance in item difficulty and individual differences in the effects of MPT cues on JOLs). Our models estimate fixed effects for each item, which controls for item difficulty effects and directly addresses this concern. Hertzog et al. (2013) found multiple random effects (i.e., individual differences) in analyzing the younger adult data presented here, and our analyses control for these effects, whereas the use of gamma correlations does not. Finally, when marginal distributions of the component variables are highly skewed, gamma correlations have high standard errors that are then ignored in the two-stage process, producing distorted standard errors for aggregate gammas. The multi-level modeling procedure produces appropriate asymptotic standard errors of estimate for all regression effects.

Results

The models presented here focus on key predictors and Trial 2 JOL for both younger and older adults. Relevant means and standard deviations of the variables included therein can be found in Table 1. Briefly, older adults studied longer than did younger adults in order to reach a somewhat dissimilar level of recognition memory performance in Trial 1 (favoring younger adults). Older adults were generally less confident in their memories following both study and test experiences, although, like their younger counterparts, their initial underconfidence in Trial 1 was repaired following test, reflecting the benefits of the MPT heuristic and, possibly, the other potential JOL cues examined in detail below. Of particular note were response time estimates, which were discrepant from actual RT (assessed using the gamma correlations; see Nelson, 1984). Older adults’ estimates correlated less with actual RT (M = 0.32, SE = 0.03) than did those of younger adults (M = 0.41, SD = 0.03), d = .46, a medium-sized effect (Cohen, 1988). Given research showing that JOLs are more highly correlated with perceived rather than actual memory fluencies (e.g., Robinson, et al., 2006), this discrepancy provides a good opportunity to determine whether our participants (especially older adults) were attuned to actual RT, estimated RT, or both when estimating JOLs.

Multi-level Model Rationale

We first defined an unconditioned model in which no effects were specified other than the intercept (see Table 2). This model served as a baseline for comparison with our other models because we used it to calculate pseudo-R2 values for each iteration by subtracting the residual (unexplained) variance for those models from the residual of the unconditioned model and dividing that amount by the residual of the unconditioned model. We were thus able to determine the benefit of adding specific components to the model in terms of the additional variance explained in item-level Trial 2 JOL. A second model included a random effect for the intercept, which allowed an examination of the influence of individual differences in average Trial 2 JOL (i.e., memory confidence). A third model added all fixed effects for the potential JOL cues specified above. A fourth model included the aforementioned effects in addition to a random effect of Trial 1 recognition accuracy. As discussed previously, prior research on MPT reported that a large proportion of the variation in subsequent memory confidence (relative to that of other potential cues) is based on whether an individual remembers the item-specific results of a prior memory test, so this effect is hypothesized to be of considerable importance. A reliable fixed effect would indicate that prior response accuracy was an important source of variation in Trial 2 JOL. A reliable random effect would indicate that individuals varied in the extent to which they based JOLs on prior recognition accuracy. A fifth model also included a random effect for Trial 1 CJs, which captured individual differences in this cue’s degree of influence on JOLs above and beyond differences in recognition memory accuracy itself. Model 6 retained the random effect of CJs but did not include the random effect of accuracy and served as a test of whether or not individual differences in Trial 1 recognition accuracy had a significant effect on the model's explanatory power if we also included CJs.

Table 2.

Goodness-of-Fit Statistics, LR χ2 and Psuedo-R2 for Models Predicting Trial 2 JOLs for Younger and Older Adults

Younger Adults

Model −2LL BIC Residual Δ χ 2 Δ df R 2

1. Unconditioned 28146 28163 578.42 - - -
2. Random Intercept 27250 27262 408.81 896.0* 1 .293
3. Add Fixed Effects 26439 26738 315.17 811.7* 0 .455
4. Add Random Accuracy 26375 26682 302.95 63.5* 2 .476
5. Add Random CJ 26325 26643 295.17 50.7* 3 .490
6. Drop Random Accuracy 26343 26650 299.49 18.3* 3 .482
Older Adults

Model −2LL BIC Residual Δ χ 2 Δ df R 2

1. Unconditioned 49788 49805 812.45 - - -
2. Random Intercept 46780 46794 427.11 3007.70* 1 .474
3. Add Fixed Effects 45529 45935 339.71 1322.08* 0 .582
4. Add Random Accuracy 45440 45855 327.85 89.04* 2 .596
5. Add Random CJ 45352 45781 319.09 87.23* 3 .607
6. Drop Random Accuracy 45379 45795 322.82 27.00* 3 .603

Note: R2 is the total pseudo- R2 for all effects (fixed and random).

*

p < .001

Abbreviations: −2LL: −2 Log Likelihood (fit function); BIC: Bayesian Information Criterion; LR: likelihood ratio; JOL: judgment of learning; CJ: confidence judgment

Multi-level Model Results

Fit statistics pertaining to our models can be found in the top panel of Table 2. The following description of findings pertains only to older adult models, although the pattern of improvement in model fit resulting from the inclusion of specific random effects was similar between younger and older adults (see Hertzog, et al., 2013 for further details regarding younger adult models). Model 2, which only included a random effect on intercept, accounted for approximately 47% of the variance in Trial 2 JOL and offered a better fit to the data than did our initial unconditioned model. Participants therefore differed reliably in their average level of leaning confidence as assessed by Trial 2 JOLs. The explanatory power of the model increased considerably following the inclusion of all fixed effects, pseudo-R2 = .58. Smaller increases were found following the inclusion of random effects of Trial 1 recognition accuracy (pseudo-R2 = .60) and Trial 1 recognition CJ (pseudo-R2 = .61). A small but statistically significant reduction in model fit (accounting for 1% of variance explained) resulted from the removal of the random effect of accuracy from the prior model. Individuals therefore varied with respect to the degree of emphasis placed on both recognition accuracy and CJs and the inclusion of both associated random effects in the model is therefore justified. The random effects covariance matrix for the accepted model, Model 5, are reported in Table 3.

Table 3.

Covariance Matrices of Random Effects for Model 5 (Standard Errors in Parentheses)

Intercept Trial 1 Accuracy Trial 1 CJs

Intercept 159.03 (25.13)*** --- ---
Trial 1 Accuracy 14.84 (13.91) 37.36 (14.18)** ---
Trial 1 CJs −0.57 (0.31) 0.29 (0.20) 0.02 (0.01)***

Note:

* p< .05;

**

p < .01;

***

p < .001

The fixed effects solution for Model 5 is reported in Table 4, and the relevant F-tests are presented in Table 5. There was a robust main effect for Stimulus, indicating that items varied in their average JOL at Trial 2 (range: 54.31% - 84.58%). All other effects control for these average between-item differences in perceived degree of learning. The intercept was defined by the average JOL confidence of the older age group, given that age was dummy coded with a zero for older adults, one for younger adults. Therefore each of the reported age-related effects in Table 4 can be interpreted as the differential consequence of being a younger adult. With that in mind, the average Trial 2 JOL for older adults was approximately 70%, t(168) = 18.73, p < .001. The age difference (approximately 4%, favoring younger adults) did not approach statistical reliability (p = .52, d = 0.14), indicating similar levels of post-study confidence in degree of learning for both age groups in Trial 2. This is useful because any age differences in factors influencing Trial 2 JOLs cannot be attributed to scale sensitivity differences (as might be the case if young and old responses tended to occupy different ranges of the JOL scale).

Table 4.

Multi-Level Regression Predicting Trial 2 Judgments of Learning for Younger and Older Adults

Effect Estimate SE
Intercept 69.55 3.71
Age (younger adults) 3.88 5.97
Trial 1 study time (within subjects) 0.01 0.06
Trial 1 study time (between subjects) −0.22 0.41
Trial 1 JOL (within subjects) 0.17 0.03
Trial 1 JOL (between subjects) 0.55 0.14
Trial 1 recognition accuracy (within subjects) 8.55 1.62
Trial 1 recognition accuracy (between subjects) 23.91 13.97
Trial 1 CJ (within subjects) 0.13 0.04
Trial 1 CJ (between subjects) 0.44 0.17
Trial 1 recognition RT (within subjects) 0.06 0.14
Trial 1 recognition RT (between subjects) 3.40 1.80
Trial 1 recognition RT estimate (within subjects) −0.63 0.47
Trial 1 recognition RT estimate (between subjects) −1.79 1.64
Trial 2 study time (within subjects) −0.35 0.07
Trial 2 study time (between subjects) −1.24 1.11
Trial 1 study time (within subjects) × Age −0.02 0.08
Trial 1 study time (between subjects) × Age −0.12 0.59
Trial 1 JOL (within subjects) × Age −0.07 0.03
Trial 1 JOL (between subjects) × Age −0.33 0.18
Trial 1 recognition accuracy (within subjects) × Age 3.47 2.19
Trial 1 recognition accuracy (between subjects) × Age −3.50 25.34
Trial 1 CJ (within subjects) × Age 0.01 0.05
Trial 1 CJ (between subjects) × Age −0.09 0.32
Trial 1 recognition RT (within subjects) × Age 0.01 0.38
Trial 1 recognition RT (between subjects) × Age −4.79 4.58
Trial 1 recognition RT estimate (within subjects) × Age −0.42 0.73
Trial 1 recognition RT estimate (between subjects) × Age 1.46 2.86
Trial 2 study time (within subjects) × Age −0.31 0.17
Trial 2 study time (between subjects) × Age 2.00 1.40

Note: JOL = judgment of learning; CJ = confidence judgment. Temporal measures are scaled in seconds. Effects associated with specific word-pair cues were not included in this table to conserve space.

Table 5.

F-Tests for Multi-Level Regression Fixed Effects

Effect df F p
Word-pair Stimulus 5076 2.40 <.001
Age (younger adults) 89.4 0.42 0.517
Trial 1 study time (within subjects) 5063 0.02 0.885
Trial 1 study time (between subjects 89.6 0.93 0.339
Trial 1 JOL (within subjects) 5120 66.73 <.001
Trial 1 JOL (between subjects) 88.9 17.17 <.001
Trial 1 recognition accuracy (within subjects) 73.9 87.62 <.001
Trial 1 recognition accuracy (between subjects) 87.4 3.06 0.084
Trial 1 CJ (within subjects) 89.8 34.45 <.001
Trial 1 CJ (between subjects) 87.3 6.24 0.014
Trial 1 recognition RT (within subjects) 4818 0.12 0.727
Trial 1 recognition RT (between subjects) 88.4 0.19 0.664
Trial 1 recognition RT estimate (within subjects) 4821 5.22 0.022
Trial 1 recognition RT estimate (between subjects) 88.7 0.55 0.462
Trial 2 study time (within subjects) 5069 35.28 <.001
Trial 2 study time (between subjects) 89.1 0.12 0.73
Trial 1 study time (within subjects) × Age 5065 0.09 0.765
Trial 1 study time (between subjects) × Age 89.6 0.04 0.839
Trial 1 JOL (within subjects) × Age 5121 5.19 0.023
Trial 1 JOL (between subjects) × Age 88.9 3.28 0.074
Trial 1 recognition accuracy (within subjects) × Age 73 2.51 0.118
Trial 1 recognition accuracy (between subjects) × Age 87.4 0.02 0.891
Trial 1 CJ (within subjects) × Age 89.1 0.02 0.878
Trial 1 CJ (between subjects) × Age 87.3 0.09 0.767
Trial 1 recognition RT (within subjects) × Age 4835 0.00 0.98
Trial 1 recognition RT (between subjects) × Age 88.4 1.10 0.298
Trial 1 recognition RT estimate (within subjects) × Age 4794 0.33 0.567
Trial 1 recognition RT estimate (between subjects) × Age 88.7 0.26 0.61
Trial 2 study time (within subjects) × Age 5074 3.44 0.064
Trial 2 study time (between subjects) × Age 89.1 2.04 0.156

Note: JOL = judgment of learning; CJ = confidence judgment. Temporal measures are scaled in seconds.

Within-person JOL effects indicate the degree to which items vary in JOLs between Trial 1 and Trial 2. For each percentage increase in an item’s Trial 1 JOL, its Trial 2 JOL was increased by approximately 0.17% for older adults and approximately 0.10% for younger adults, t(5121) = −2.28, p < .001, d = 0.40, a medium-sized effect (Cohen, 1988). Given that mean item difficulty is controlled for by the Stimulus factor in the regression equation, this effect suggests that older adults’ JOL for each item was more stable across trials than younger adults’ item JOL. Individual differences in mean JOLs were also relatively stable across trials. For each percentage increase in a person's average Trial 1 JOL, their average Trial 2 JOL increased by approximately 0.55%, t(88.9) = 3.97, p < .001, d = 0.84. A trend reflecting a smaller effect for younger adults (an increase of .22% in a person’s Trial 2 JOL for each percent increase in average Trial 1 JOL), also indicated more stability in post-study confidence for older adults, p = .07, d = 0.06, although the size of the effect was small (Cohen, 1988). Irrespective of this trend, a person’s average confidence in their initial learning was a robust predictor of their post-study confidence in Trial 2 for all participants.

Despite some degree of stability, there was evidence of within-person and between-person changes in JOLs across the two study-test trials. As predicted by the MPT hypothesis, Trial 1 recognition accuracy had a substantial impact on Trial 2 JOL, such that items with accurate Trial 1 recognition responses were given 8.6% higher Trial 2 JOLs, t(69.8) = 5.30, p < .001, d = 1.27, than items that had not been correctly recognized. This MPT effect did not differ reliably between age groups, p = .12, d = 0.34.

Several relationships emerged that showed MPT in the recognition memory domain to be broader than just a predictive relationship of memory success to subsequent JOLs. First and foremost, Trial 1 recognition test confidence predicted JOLs independently of recognition test accuracy. For each percent increase in an item's Trial 1 CJ, the item’s Trial 2 JOL increased by approximately 0.13% (t(90.1) = 3.70, p < .001, d = 0.78). At the between-person level, each percent increase in a person's average Trial 1 CJ, their average Trial 2 JOL increased by approximately 0.44% (t(87.7) = 2.56, p < .05, d = 0.55). This relationship emerged despite the substantial correlation of CJs with correct recognition responses (CJs for correct recognition responses were 37% higher than CJs for incorrect responses).

Second, reliable random effects for recognition accuracy and CJs were found (see Table 3), indicating reliable individual differences in the relationships between these two variables and JOLs (irrespective of the mean-level correlations and age interactions otherwise reported). The correlation between each predictor and Trial 2 JOL was therefore higher for some participants than for others, which suggests that some participants weighed accuracy cues more heavily than CJs when constructing subsequent JOLs, and vice versa. Although it is possible that some participants increased their emphasis on both recognition accuracy and CJs at the expense of other measured predictors (e.g., actual or estimated RT), we did not detect the presence of any further individual differences (i.e., random effects), which would seem to indicate that this give-and-take process was limited to recognition accuracy and CJs.

Third, the participant-estimated latency of recognition test responses also influenced JOLs. For each perceived one-second increase in item-level estimated Trial 1 recognition RT, the associated item-level Trial 2 JOL decreased by approximately 0.63%, t(4819) = −1.33, p = .184, d = 0.04. Higher perceived recognition fluency was associated with an increase in subsequent JOL confidence for relevant items, however, unlike the pairing of CJs and objective recognition response accuracy, objective recognition response RT was not also a reliable predictor of subsequent JOLs in our model. A possible explanation for this discrepancy is that the influence of objective RTs on subsequent JOLs may have been mediated by other cues present in the model, thereby decreasing the direct association so much so that it was no longer a reliable predictor, whereas the effect of objective recognition response accuracy is so powerful that it remained in the model even after controlling for other cues with which it is correlated (e.g., CJs; see Hines, et al., 2009, for an example of this phenomenon in a similar associative recognition task).

In sum, the regression model indicated that multiple aspects of prior recognition test outcomes influenced Trial 2 JOLs, consistent with a multiple-cue utilization perspective.

Perhaps the most critical evidence for multiple cue utilization involved the relationship of Trial 2 encoding experiences on JOLs, independent of effects arising from Trial 1 recognition test experience. At the within-person level, Trial 2 self-paced study time also influenced Trial 2 JOLs. For each one-second increase in Trial 2 item-level study, the associated JOL decreased by 0.35%, t(5021) = −5.26, p < .001, d = 0.15.

The most critical effect for evaluating age differences concerned the relation of Trial 2 study time to Trial 2 JOLs. Both younger and older adults exhibited higher confidence for their item memory as they invested more time studying each item at Trial 2, independent of the recognition test influences. However, we could not merely accept the null hypothesis of age equivalence given a statistical trend for an Age X Trial 2 Study Time interaction (p < .10). The trend reflects a larger decrease for subsequent JOLs for younger adults (.66%) than for older adults (.35%) for each additional second of study time, d = 0.09, 95% confidence interval [-0.02, 0.64]. By Cohen’s (1988) standards, this is at best a small effect, and we conclude that any age difference in the influence of Trial 2 encoding fluency is relatively trivial. To make sure that an age difference at Trial 2 had not been suppressed by including Trial 1 Study Time in the model, we re-ran the model without the Trial 1 variable in the equation. The targeted Age X Trial 2 Study Time interaction was not affected.

Contrary to a resource-limitation hypothesis about age differences, both younger and older adults’ JOLs were influenced by the multiple cues we analyzed, as manifested in a general lack of age-related interactions. Even when trends toward age differences were present, the associated effect sizes were generally quite small. To clarify this point, the mean effect size for all age-related interactions was 0.12 (SD = .14), ranging from 0.01 to 0.40.

Discussion

The present study shows that both younger and older adults’ JOLs are influenced by multiple metacognitive cues that can act both simultaneously and independently from one-another. Despite the fact that multiple cues influence JOLs, we saw little indication that older adults are deficient in multiple-cue utilization, consistent with earlier cued-recall evidence (e.g., Tauber & Rhodes, 2012). One might expect older adults, who typically have been out of a formal educational setting for decades, to generate JOLs differently than our sample of younger adult students, who likely have more recent high-stakes experience in doing so. Instead, older adults showed similar MPT effects, and most critically, their encoding fluency at Trial 2 had an additional effect on JOLs -- the evidence for multiple cue utilization.

Regarding encoding fluency, this study shows that previous findings indicating age similarity in encoding fluency effects (Robinson et al., 2006) apparently extend to multi-trial JOLs as well. Granted, we observed a trend for age differences in the magnitude of the effect of self-terminated study time on JOLs, but the associated effect size was small, and a reliable relationship of encoding fluency on Trial 2 JOLs was detected in both age groups. It is clear from our data that researchers typically find null effects of age on JOL estimation because younger and older adults use available cues in a similar way to construct JOLs.

The analysis revealed an interesting pattern that underscores the value of using multi-level regression models to study multiple-cue influences on JOLs. Reliable individual differences (random effects) were found in the effects of both recognition accuracy and CJs on Trial 2 JOLs. This finding suggests the intriguing possibility that individuals vary in profiles of reliance (the subjective weight they placed upon) on recognition outcomes and CJs as cues when making JOLs. This finding warrants additional research to try to tease apart the conditions under which people may trade off utilization of these cues. For instance, it could be the case that individuals with higher proportions of strong, accurate recollective experiences in associative recognition attend more to recognition accuracy as a cue, whereas persons susceptible to high-confidence mis-recollections (Shing et al., 2009; Wong et al., 2012) show a greater reliance on CJs over recognition memory accuracy. Given older adults’ heightened susceptibility to this type of recognition memory illusion, it would be useful to import JOLs into tasks that manipulate false memory in recognition memory contexts (e.g., Fandakova, Shing, & Lindenberger, 2013; Jacoby & Rhodes, 2006) while attending explicitly to the measurement of both individual differences and person X task interaction effects. It is likely that age differences in retrieval monitoring accuracy could have differential impact on JOLs in tasks that produce high rates of memory illusions. For instance, Daniels, Toth, & Hertzog (2009) found that older adults’ JOLs were less predictive of subsequent recollection (remember versus know experiences) in a standard single-trial yes-no recognition task (see also Soderstrom, McCabe & Rhodes, 2012).

Furthermore, some older adults (e.g., individuals with preclinical levels of mild cognitive impairment) may be more susceptible to these effects (Guerdoux, Dressaire, Martin, Adam, & Brouillet, 2012). Given the lack of age differences in the impact of recognition accuracy and CJs on subsequent JOLs in the present data, despite small differences in mean-level recognition memory accuracy and confidence, these effects appear to be minimal in the current sample. This could in part be a consequence of self-paced study, which may have facilitated more accurate recollective experiences in both age groups.

The relative lack of age differences in how cues influence JOLs in this study fits well with the larger literature suggesting spared monitoring of encoding in old age (Hertzog et al., 2010; Hertzog & Dunlosky, 2011). Given that we have extended this finding to the multiple-cue context of multi-trial learning, it seems that reduced-resources views of older adults’ JOL accuracy are not well supported by the present data. However, other explanations are possible that could still be reconciled with a reduced resources view of some aspects of metacognitive monitoring. As suggested by Tauber and Rhodes (2012), it could be the case that JOLs simply do not place strong demands on cognitive resources, being based more on relatively automatic or unconscious influences of cues on confidence (see also Shea et al., 2014). These influences could enable monitoring of multiple cues despite a primary focus of conscious attention on the explicit search for information about the past test as envisioned in the MPT heuristic (Serra & Ariel, 2014). Alternatively, age differences in monitoring accuracy might emerge when individuals are attempting to monitor learning on-line or in tasks with high concurrent demands on cognitive control. In effect, the measurement context of self-paced JOLs may remove processing demands during judgments that constrain active monitoring or monitoring accuracy in at least some task contexts. In any case, the present findings add to a picture of spared metacognitive monitoring of learning using standard JOL measurement techniques, despite strong age effects on what is being monitored -- associative learning itself (e.g., Shing, Werkle-Bergner, Li, & Lindenberger, 2008; Old & Naveh-Benjamin, 2008).

These results have potential implications for the use of metacognitive tasks in neuropsychological assessment of older adults. Although early work on aging, fluid intelligence, and working memory supported a generic argument that frontal lobe deterioration could be responsible for age-related cognitive decline (e.g., West, 1996), recent work in cognitive neuroscience indicates that differentiating sub-regions in prefrontal cortex is critical for understanding cognitive control, executive function, and metacognition (e.g., Speer, Jacoby, & Braver, 2003; Fleming, Huijgen, & Dolan, 2012). Research has shown that areas of anterior and medial prefrontal cortex, particularly ventromedial prefrontal cortex (vmPFC), are important substrates for supporting JOLs (Do Lam et al., 2012; Kao, Davis, & Gabrieli, 2005) and other metacognitive judgments (e.g., Chua, Schacter, & Sperling, 2009a). Older and younger adults activate similar brain regions when making metacognitive judgments (Chua, Schacter, & Sperling, 2009b), including vmPFC and anterior prefrontal cortex, although the latter may be more important for metacognition about percepts than memory (Fleming, Huijgen, & Dolan, 2014). Structural MRI evidence suggests relative sparing of vmPFC volume, compared to other frontal regions such as dorsolateral prefrontal cortex (e.g., Raz et al, 2005, 2010). Most neuropsychological tests of frontal function such as the Wisconsin card sorting test focus on tasks that require working memory and task set maintenance and seem to be more dependent on intact dorsolateral prefrontal regions (e.g., Yuan & Raz, 2014) and are sensitive to age (Carp, Gmeindl, & Reuter-Lorenz, 2010; Toepper, et al., 2014; Salthouse & Davis, 2006). Metacognitive judgments are also sensitive to frontal damage as revealed by neuropsychological tests (e.g., Bastin et al., 2012; Pannu, Kaszniak, & Rapczak, 2005), but often without specificity as to localization of frontal damage. The fact that JOL accuracy is well preserved in old age in a number of associative memory tasks suggests that impairments in JOL accuracy, when they occur, would indicate likely pathology in vmPFC and related frontal network connections that are distinct from the lateral areas important for working memory and the control of memory encoding and retrieval (e.g., Badre & Wagner, 2007; Levy & Wagner, 2013). Future research could consider how task batteries that include metacognitive judgments like JOLs could help elucidate patterns of abnormal function in brain regions in older adults, differentiating consequences of localized frontal pathology from the more benign effects of normal aging on metacognitive monitoring.

Acknowledgements

This research was supported by a grant from the National Institute on Aging, one of the National Institutes of Health (R01 AG024485). Jarrod C. Hines is now at the University of Wisconsin – Eau Claire. We would like to extend thanks to the following personnel for their assistance with subject recruitment and data collection: Teri Boutot, Bethany Geist, Devaki Kumarhia, Colin Malone, Melissa McDonald, and Alisha Monteiro.

Footnotes

Author Note

Jarrod C. Hines, School of Psychology, Georgia Institute of Technology; Christopher Hertzog, School of Psychology, Georgia Institute of Technology; Dayna R. Touron, Department of Psychology, University of North Carolina Greensboro

References

  1. Ariel R, Dunlosky J. The sensitivity of judgments-of-learning resolution to past test performance, new learning, and forgetting. Memory & Cognition. 2011;39:171–184. doi: 10.3758/s13421-010-0002-y. doi: 10.3758/s13421-010-0002-y. [DOI] [PubMed] [Google Scholar]
  2. Ariel R, Hines JC, Hertzog C. Test framing generates a stability bias for predictions of learning by causing people to discount their learning beliefs. Journal of Memory and Language. 2014;75:181–198. doi: 10.1016/j.jml.2014.06.003. doi: 10.1016/j.jml.2014.06.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Badre D, Wagner AD. Left ventrolateral prefrontal cortex and the cognitive control of memory. Neuropsychologia. 2007;45:2883–3190. doi: 10.1016/j.neuropsychologia.2007.06.015. doi: 10.1016/j.neuropsychologia.2007.06.015. [DOI] [PubMed] [Google Scholar]
  4. Bastin C, Feyers D, Souchay C, Guillaume B, Pepin J, Lemaire C, Degueldre C, Collette F, Salmon E. Frontal and posterior cingulate metabolic impairment in the behavioral variant of frontotemporal dementia with impaired autonoetic consciousness. Human Brain Mapping. 2012;33:1268–1278. doi: 10.1002/hbm.21282. Doi: 10.1002/hbm.21282. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Begg I, Duft S, Lalonde P, Melnick R, Sanvito J. Memory predictions are based on ease of processing. Journal of Memory and Language. 1989;28:610–632. doi: 10.1016/0749-596X(89)90016-8. [Google Scholar]
  6. Benjamin AS, Bjork RA, Schwartz BL. The mismeasure of memory: When retrieval fluency is misleading as a metamnemonic index. Journal of Experimental Psychology: General. 1998;127:55–68. doi: 10.1037//0096-3445.127.1.55. doi: 10.1037//0096-3445.127.1.55. [DOI] [PubMed] [Google Scholar]
  7. Besken M, Mulligan NW. Easily perceived, easily remembered? Perceptual interference produces a double dissociation between metamemory and memory performance. Memory and Cognition. 2013;41:897–903. doi: 10.3758/s13421-013-0307-8. doi: 10.3578/s13421-013-0307-8. [DOI] [PubMed] [Google Scholar]
  8. Bjork RA, Dunlosky J, Kornell N. Self-regulated learning: Beliefs, techniques, and illusions. Annual Review of Psychology. 2013;64:417–444. doi: 10.1146/annurev-psych-113011-143823. doi: 10.1146/annurev-psych-113011-143823. [DOI] [PubMed] [Google Scholar]
  9. Carp J, Gmeindl L, Reuter-Lorenz PA. Age differences in the neural representation of working memory revealed by multi-voxel pattern analysis. Frontiers in Human Neuroscience. 2010;4 doi: 10.3389/fnhum.2010.00217. Article 217. doi: 10.3389/fnhum.2010.0021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Cerella J, Onyper SV, Hoyer WJ. The associative memory basis of cognitive skill learning. Psychology & Aging. 2006;21:483–498. doi: 10.1037/0882-7974.21.3.483. doi: 10.1037/0882-7974.21.3.483. [DOI] [PubMed] [Google Scholar]
  11. Chua EF, Schacter DL, Sperling RA. Neural correlates of metamemory: A comparison of feeling-of-knowing and retrospective confidence judgments. Journal of Cognitive Neuroscience. 2009a;21:1751–65. doi: 10.1162/jocn.2009.21123. doi:10.1162/jocn.2009.21123. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Chua EF, Schacter DL, Sperling RA. Neural basis for recognition confidence in younger and older adults. Psychology and Aging. 2009b;24:139–53. doi: 10.1037/a0014029. doi: 10.1037/a0014029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Cohen J. Statistical power analysis for the behavioral sciences. 2nd Erlbaum; Hillsdale, NJ: 1988. [Google Scholar]
  14. Cohn M, Moscovitch M. Dissociating measures of associative memory: Evidence and theoretical implications. Journal of Memory and Language. 2007;57:437–454. doi: 10.1016/j.jml.2007.06.006. [Google Scholar]
  15. Craik FIM, Hay JF. Aging and judgments of duration: Effects of task complexity and method of estimation. Perception & Psychophysics. 1999;61:549–560. doi: 10.3758/bf03211972. doi: 10.3758/BF03211972. [DOI] [PubMed] [Google Scholar]
  16. Daniels K, Toth JP, Hertzog C. A role for recollection in the accuracy of judgments of learning. Psychology and Aging. 2009;24:494–500. doi: 10.1037/a0015269. doi: 10.1037/a0015269. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Do Lam ATA, Axmacher N, Fell J, Staresina BP, Gauggel S, Wagner T, Olligs J, Weis S. Monitoring the mind: The neurocognitive correlates of metamemory. PLoS ONE. 2012;7 doi: 10.1371/journal.pone.0030009. Article e30009. doi: 10.1371/journal.pone.0030009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Dodson CS, Bawa S, Krueger LE. Aging, metamemory and high confidence errors: A misrecollection account. Psychology and Aging. 2007;22:122–133. doi: 10.1037/0882-7974.22.1.122. doi: 10.1037/0882-7974.22.1.122. [DOI] [PubMed] [Google Scholar]
  19. Dunlosky J, Hertzog C. Updating knowledge about strategy effectiveness: A componential analysis of learning about strategic effectiveness from task experience. Psychology and Aging. 2000;15:462–474. doi: 10.1037//0882-7974.15.3.462. doi: 10.1037/0882-7974.15.3.462. [DOI] [PubMed] [Google Scholar]
  20. Dunlosky J, Metcalfe J. Metacognition. Sage Publications; Thousand Oaks, CA: 2009. [Google Scholar]
  21. Enders CK, Tofighi D. Centering predictor variables in cross-sectional multi-level models: A new look at an old issue. Psychological Methods. 2007;12:121–138. doi: 10.1037/1082-989X.12.2.121. doi: 10.1037/1082-989X.12.2.121. [DOI] [PubMed] [Google Scholar]
  22. Fandakova Y, Shing Y, Lindenberger U. Differences in binding and monitoring mechanisms contribute to lifespan age differences in false memory. Developmental Psychology. 2013;49:1822–1832. doi: 10.1037/a0031361. doi: 10.1037/a0031361. [DOI] [PubMed] [Google Scholar]
  23. Finn B. Framing effects on metacognitive monitoring and control. Memory & Cognition. 2008;36:813–821. doi: 10.3758/mc.36.4.813. doi: 10.3758/MC.36.4.813. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Finn B, Metcalfe J. The role of memory for past test in the underconfidence with practice effect. Journal of Experimental Psychology; Learning, Memory, and Cognition. 2007;33:238–244. doi: 10.1037/0278-7393.33.1.238. doi: 10.1037/0278-7393.33.1.238. [DOI] [PubMed] [Google Scholar]
  25. Finn B, Metcalfe J. Judgments of learning are influenced by memory for past test. Journal of Memory and Language. 2008;58:19–34. doi: 10.1016/j.jml.2007.03.006. doi: 10.1016/j.jml.2007.03.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Fleischman DA, Wilson RS, Gabrieli JD, Beinias JL, Bennet DA. A longitudinal study of implicit and explicit memory in old persons. Psychology and Aging. 2004;19:617–625. doi: 10.1037/0882-7974.19.4.617. doi: 10.1037/0882-7974.19.4.617// [DOI] [PubMed] [Google Scholar]
  27. Fleming SM, Huijgen J, Dolan RJ. Prefrontal contributions to metacognition in perceptual decision making. The Journal of Neuroscience. 2012;32:6117–6125. doi: 10.1523/JNEUROSCI.6489-11.2012. doi: 10.1523/JNEUROSCI.6489-11.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Fleming SM, Ryu J, Golfinos JG, Blackmon KE. Domain-specific impairment in metacognitive accuracy following anterior prefrontal lesions. Brain: A Journal of Neurology. 2014;137:2811–2822. doi: 10.1093/brain/awu221. doi: 10.1037/t15169-000. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Guerdoux E, Dressaire D, Martin S, Adam S, Brouillet D. Habit and recollection in healthy aging, mild cognitive impairment, and Alzheimer's disease. Neuropsychology. 2012;26:517–533. doi: 10.1037/a0028718. doi: 10.1037/a0028718. [DOI] [PubMed] [Google Scholar]
  30. Hedden T, Yoon C. Individual differences in executive processing predict susceptibility to interference in verbal working memory. Neuropsychology. 2006;20:511–528. doi: 10.1037/0894-4105.20.5.511. doi: 10.1037/0894-4105.20.5.511. [DOI] [PubMed] [Google Scholar]
  31. Hertzog C, Dunlosky J. Metacognition in later adulthood: Spared monitoring can benefit older adults’ self-regulation. Current Directions in Psychological Science. 2011;20:167–173. doi: 10.1177/0963721411409026. doi: 10.1177/0963721411409026. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Hertzog C, Dunlosky J, Robinson AE, Kidder DP. Encoding fluency is used for judgments about learning. Journal of Experimental Psychology: Learning, Memory, and Cognition. 2003;29:22–34. doi: 10.1037//0278-7393.29.1.22. doi: 10.1037/0278-7393.29.1.22. [DOI] [PubMed] [Google Scholar]
  33. Hertzog C, Hines JC, Touron DR. Judgments of learning are influenced by multiple cues in addition to memory for past test. Archives of Scientific Psychology. 2013;1:23–32. doi: 10.1037/arc0000003. doi: 10.1037/arc0000003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Hertzog C, Kidder DP, Powell-Moman A, Dunlosky J. Aging and monitoring associative learning: Is monitoring accuracy spared or impaired? Psychology and Aging. 2002;17:209–225. doi: 10.1037/0882-7974.17.2.209. [PubMed] [Google Scholar]
  35. Hertzog C, Price J, Burpee A, Frentzel WJ, Feldstein S, Dunlosky J. Why do people show minimal knowledge updating with task experience: Inferential deficit or experimental artifact? Quarterly Journal of Experimental Psychology. 2009;62:155–173. doi: 10.1080/17470210701855520. doi: 10.1080/17470210701855520. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Hertzog C, Sinclair SM, Dunlosky J. Age differences in monitoring and learning: Cross-sectional evidence of spared resolution across the adult life span. Developmental Psychology. 2010;46:939–948. doi: 10.1037/a0019812. doi: 10.1037/a0019812. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Hertzog C, Touron DR. Age differences in memory retrieval shift: Governed by feelling-of-knowing? Psychology and Aging. 2011;26:647–660. doi: 10.1037/a0021875. doi: 10.1037/a0021875. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Hertzog C, Touron DR, Hines JC. Does a time monitoring deficit influence older adults’ delayed retrieval shift during skill acquisition? Psychology and Aging. 2007;22:607–624. doi: 10.1037/0882-7974.22.3.607. doi: 10.1037/0882-7974.22.3.607. [DOI] [PubMed] [Google Scholar]
  39. Hines JC, Touron DR, Hertzog C. Metacognitive influences on study time allocation in an associative recognition task: An analysis of adult age differences. Psychology and Aging. 2009;24:462–475. doi: 10.1037/a0014417. doi: 10.1037/a0014417. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Kao Y, Davis ES, Gabrieli JDE. Neural correlates of actual and predicted memory formation. Nature Neuroscience. 2005;8:1776–1783. doi: 10.1038/nn1595. doi: 10.1038/nn1595. [DOI] [PubMed] [Google Scholar]
  41. Koriat A. Monitoring one’s own knowledge during study: A cue-utilization approach to judgments of learning. Journal of Experimental Psychology: General. 1997;126:349–370. doi: 10.1037//0096-3445.126.4.349. [Google Scholar]
  42. Koriat A, Bjork RA. Illusions of competence during study can be remedied by manipulations that enhance learners' sensitivity to retrieval conditions at test. Memory & Cognition. 2006;34:959–972. doi: 10.3758/bf03193244. doi: 10.3758/BF03193244. [DOI] [PubMed] [Google Scholar]
  43. Kornell N, Rhodes MG, Castel AD, Tauber SK. The ease-of-processing heuristic on the stability bias: Dissociating memory, memory beliefs, and memory judgments. Psychological Science. 2011;22:787–794. doi: 10.1177/0956797611407929. doi: 10.1177/0956797611407929. [DOI] [PubMed] [Google Scholar]
  44. Levy BJ, Wagner AD. Measuring memory reactivation with functional MRI: Implications for psychological theory. Perspectives on Psychological Science. 2013;8:72–78. doi: 10.1177/1745691612469031. doi: 10.1177/1745691612469031. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Littell RC, Milliken GA, Stroup WW, Wolfinger RD. SAS system for mixed models. 4th SAS Institute, Inc.; Cary, NC: 2000. [Google Scholar]
  46. Logan GD. Toward an instance theory of automatization. Psychological Review. 1988;95:492–597. doi: 10.1037/0033-295X.95.4.492. [Google Scholar]
  47. Miele DB, Finn B, Molden DC. Does easily learned mean easily remembered? It depends on your beliefs about intelligence. Psychological Science. 2011;22:320–324. doi: 10.1177/0956797610397954. doi: 10.1177/0956797610397954. [DOI] [PubMed] [Google Scholar]
  48. Murayama K, Sakaki M, Yan VX, Smith GM. Type I error inflation in the traditional by-participant analysis to metamemory accuracy: A generalized mixed-effects model perspective. Journal of Experimental Psychology: Learning, Memory, and Cognition. 2014;40:1287–1306. doi: 10.1037/a0036914. doi: 10.1037/a0036914. [DOI] [PubMed] [Google Scholar]
  49. Nelson TO. A comparison of the current measures of the accuracy of feeling-of-knowing predictions. Psychonomic Bulletin. 1984;95:109–133. doi: 10.1037/0033-2909.95.1.109. [PubMed] [Google Scholar]
  50. Nelson TO. Consciousness and metacognition. American Psychologist. 1996a;51:102–116. doi: 10.1037/0003-066X.51.2.102. [Google Scholar]
  51. Nelson TO. Gamma is a measure of the accuracy of predicting performance on one item relative to another item, not of the absolute performance on an individual item. Applied Cognitive Psychology. 1996b;10:257–260. doi: 10.1002/(SICI)1099-0720(199606)10:3<257::AID-ACP400>3.0.CO;2-9. [Google Scholar]
  52. Nelson TO, Narens L. Metamemory: A theoretical framework and new findings. In: Bower G, editor. The psychology of learning and motivation: Advances in research and theory. Vol. 26. Academic Press; San Diego, CA: 1990. pp. 125–141. [Google Scholar]
  53. Pannu JK, Kaszniak AW, Rapcsak SZ. Metamemory for faces following frontal lobe damage. Journal of the International Neuropsychological Society. 2005;11:668–676. doi: 10.1017/S1355617705050873. doi: 10.1037/t31298-000. [DOI] [PubMed] [Google Scholar]
  54. Parks CM. The role of noncriterial recollection in estimating recollection and familiarity. Journal of Memory and Language. 2007;57:81–100. doi: 10.1016/j.jml.2007.03.003. doi: 10.1016/j.jml.2007.03.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Ratcliff R, Thapar A, McKoon G. Effects of aging and IQ on associative memory. Journal of Experimental Psychology: General. 2011;140:464–487. doi: 10.1037/a0023810. doi: 10.1037/a0023810. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Raz N, Lindenberger U, Rodrigue KM, Kennedy KM, Head D, Williamson A, Dahle C, Gerstorf D, Acker JD. Regional brain changes in aging healthy adults: General trends, individual differences, and modifiers. Cerebral Cortex. 2005;15:1679–1689. doi: 10.1093/cercor/bhi044. doi: 10.1093/cercor/bhi044. [DOI] [PubMed] [Google Scholar]
  57. Raz N, Ghisletta P, Rodrigue KM, Kennedy KM, Lindenberger U. Trajectories of brain aging in middle-aged and older adults: Regional and individual differences. Neuroimage. 2010;51:501–511. doi: 10.1016/j.neuroimage.2010.03.020. doi: 10.1016/j.neuroimage.2010.03.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Rhodes MG, Castel AD. Memory predictions are influenced by perceptual information: Evidence for metacognitive illusions. Journal of Experimental Psychology: General. 2008;137:615–625. doi: 10.1037/a0013684. doi: 10.1037/a0013684. [DOI] [PubMed] [Google Scholar]
  59. Robinson AE, Hertzog C, Dunlosky J. Aging, encoding fluency, and metacognitive monitoring. Aging, Neuropsychology, and Cognition. 2006;13:458–478. doi: 10.1080/13825580600572983. doi: 10.1080/13825580600572983. [DOI] [PubMed] [Google Scholar]
  60. Salthouse TA, Davis HP. Organization of cognitive abilities and neuropsychological variables across the lifespan. Developmental Review. 2006;26:31–54. doi: 10.1016/j.dr.2005.09.001. [Google Scholar]
  61. Serra MJ, Ariel R. People use the memory for past-test heuristic as an explicit cue for judgments of learning. Memory and Cognition. 2014 doi: 10.3758/s13421-014-0431-0. Advance online publication. doi: 10.3758/s13421-014-0431-0. [DOI] [PubMed] [Google Scholar]
  62. Shanks LL, Serra MJ. Domain familiarity as a cue for judgments of learning. Psychonomic Bulletin and Review. 2014;21:445–453. doi: 10.3758/s13423-013-0513-1. doi: 10.3758/s13423-013-0513-1. [DOI] [PubMed] [Google Scholar]
  63. Shaughnessy JJ, Zechmeister EB. Memory-monitoring accuracy as influenced by the distribution of retrieval practice. Bulletin of the Psychonomic Society. 1992;30:125–128. doi: 10.3758/BF03330416. [Google Scholar]
  64. Shea N, Boldt A, Bang D, Yeung N, Heyes C, Frith C. Supra-personal cognitive control and metacognition. Trends in Cognitive Science. 2014;18:186–193. doi: 10.1016/j.tics.2014.01.006. doi: 10.1016/j.tics.2014.01.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Shing YL, Werkle-Bergner M, Li S, Lindenberger U. Associative and strategic components of episodic memory: A life-span dissociation. Journal of Experimental Psychology: General. 2008;137:495–513. doi: 10.1037/0096-3445.137.3.495. doi:10.1037/0096-3445.137.3.495. [DOI] [PubMed] [Google Scholar]
  66. Shing YL, Werkle-Bergner M, Li S-C, Lindenberger U. Committing memory errors with high confidence: Older adults do but children don’t. Memory. 2009;17:169–179. doi: 10.1080/09658210802190596. doi: 10.1080/09658210802190596. [DOI] [PubMed] [Google Scholar]
  67. Singer JD. Using SAS PROC MIXED to fit multi-level models, hierarchical models, and individual growth models. Journal of Educational and Behavioral Statistics. 1998;24:323–355. doi: 10.3102/10769986023004323. [Google Scholar]
  68. Snijders TAB, Bosker RJ. Multi-level analysis: An introduction to basic and advanced multi-level modelling. 2nd Sage; London, U.K.: 2012. [Google Scholar]
  69. Soderstrom NC, McCabe DP, Rhodes MG. Older adults predict more recollective experiences than younger adults. Psychology and Aging. 2012;27:1082–1088. doi: 10.1037/a0029048. doi: 10.1037/a0029048. [DOI] [PubMed] [Google Scholar]
  70. Speer NK, Jacoby LJ, Braver TS. Strategy-dependent changes in memory: Effects on behavior and brain activity. Cognitive, Affective, & Behavioral Neuroscience. 2003;3:155–167. doi: 10.3758/cabn.3.3.155. doi: 10.3758/CABN.3.3.155. [DOI] [PubMed] [Google Scholar]
  71. Tauber SK, Rhodes MG. Multiple bases for young and older adults’ judgments of learning in multitrial learning. Psychology and Aging. 2012;26:474–483. doi: 10.1037/a0025246. doi: 10.1037/a0025246. [DOI] [PubMed] [Google Scholar]
  72. Toepper M, Gebhardt H, Bauer E, Beblo T, Gallhofer B, Driessen M, Sammer G. The impact of age on load-related dorsolateral prefrontal cortex activation. Frontiers in Aging Neuroscience. 2014;6 doi: 10.3389/fnagi.2014.00009. Article 9. doi; 10.1037/t00128-000. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Undorf M, Erdfelder E. Separation of encoding fluency and item difficulty effects on judgments of learning. The Quarterly Journal of Experimental Psychology. 2013;66:2060–2072. doi: 10.1080/17470218.2013.777751. doi: 10.1080/17470218.2013.777751. [DOI] [PubMed] [Google Scholar]
  74. Wechsler D. Wechsler Adult Intelligence Scale - Revised. Psychological Corporation; New York: 1981. [Google Scholar]
  75. West RL. An application of prefrontal cortex function theory of cognitive aging. Psychological Bulletin. 1996;120:272–292. doi: 10.1037/0033-2909.120.2.272. doi: 10.1037/0033-2909.120.2.272. [DOI] [PubMed] [Google Scholar]
  76. Wong JT, Cramer SJ, Gallo DJ. Age-related reduction in the confidence-accuracy relationship in episodic memory: Effects of recollection quality and retrieval monitoring. Psychology and Aging. 2012;27:1053–1065. doi: 10.1037/a0027686. doi: 10.1037/a0027686. [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Yuan P, Raz N. Prefrontal cortex and executive functions in healthy adults: A meta-analysis of structural neuroimaging studies. Neuroscience and Biobehavioral Reviews. 2014;42:180–192. doi: 10.1016/j.neubiorev.2014.02.005. Y. doi: 10.1037/t31298-000. [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. Zachary R. Shipley Institute of Living Scale Revised manual. Western Psychological Services; Los Angeles, CA: 1986. [Google Scholar]
  79. Zacks RT, Hasher L, Li KZH. Human memory. In: Craik FIM, Salthouse TA, editors. The handbook of aging and cognition. Erlbaum; Mahwah, NJ: 2000. pp. 293–357. [Google Scholar]

RESOURCES