Abstract
We tested the hypothesis that feeling of knowing (FOK) after a failed recall attempt is influenced by recalling aspects of the original encoding strategy. Individuals were instructed to use interactive imagery to encode unrelated word pairs. We manipulated item concreteness (abstract versus concrete) and item repetition at study (1 versus 3). Participants orally described the mediator produced immediately after studying each item, if any. After a delay they were given cued recall, made FOK ratings, and attempted to recall their original mediator. Concreteness and item repetition enhanced strategy recall, which had a large effect on FOKs. Controlling on strategy recall reduced the predictive validity of FOKs for recognition memory, indicating that access to original aspects of encoding influenced FOK accuracy. Confidence judgments (CJs) for correctly recognized items covaried with FOKs, but FOKs did not fully track strategy recall associations with CJs, suggesting emergent effects of strategy cues elicited by recognition tests not accessed at the time of the FOK judgment. In summary, cue-generated access to aspects of the original encoding strategy strongly influenced episodic FOK, although other influences are also implicated.
Keywords: feeling of knowing, strategies, imagery, mediation, associative recognition, accessibility
The feeling of knowing is a subjective state of confidence in the availability of information in memory, even when it cannot currently be accessed. Following Hart (1965), empirical studies of feeling-of-knowing have asked individuals to attempt to recall information when prompted by a cue, followed by a rating of confidence in the feeling of knowing (FOK) that scales the likelihood of being able to later recognize the sought-after information. In episodic memory experiments, the cue is usually information paired with the sought-after target during encoding (e.g., study a face-name pair, then try to recall the name when shown the face).
Theoretical bases of FOKs
Metacognitive research largely concerns itself with identifying influences on the magnitude of metacognitive judgments (such as FOKs) and the predictive accuracy of those judgments for subsequent cognitive performance (see Dunlosky & Metcalfe, 2009, for an introductory review). Depending on how they are scaled, these two variables – FOK magnitude and FOK accuracy – are essentially independent of one another. FOK magnitude has been shown to be influenced by multiple variables, including the familiarity of the cue that is used to generate the FOK (e.g., Metcalfe, Schwartz, & Joaquim, 1993) and retrieved information elicited by the cue (the accessibility hypothesis; Koriat, 1995; Koriat & Levy-Sadot, 2001). Predictive accuracy is traditionally defined by FOK resolution -- or the within-person correlation of variation in FOKs for different items with recognition memory outcomes for those items. Typically, resolution is measured by ordinal Goodman-Kruskal gamma correlations of FOKs with recognition accuracy, computed separately for each participant (Gonzalez & Nelson, 1996).1
The accessibility hypothesis on FOKs argues that they are influenced by the amount of information accessed, whether or not it derives from the originally encoded target. We adopt an alternative accessibility-based perspective that individuals construct FOKs based on weighting multiple cues, so that FOK magnitude and resolution depend on the accessed cues that are regarded by the rater as relevant to the criterion outcome. False memories for encoding contexts can influence FOKs and constrain FOK accuracy (Koriat, 1995). Conversely, experimental conditions that increase access to diagnostic cues (i.e., cues that derive from the encoded information in memory and signify later recognition success) will increase FOK resolution when raters base their FOKs upon them (e.g., Schacter & Worling, 1985).
Both FOK magnitude and FOK resolution are influenced by the quality of the original encoding (e.g., Lupker, Harbluk, & Patrick, 1991; T. O. Nelson, Leonesio, Shimamura, Landwehr, & Narens, 1982; Thomas, Bulevich, & Dubois, 2012). For example, both mean FOKs and FOK resolution are higher for items studied with multiple presentations, relative to items studied only once (Carroll & Nelson, 1993; Hertzog, Dunlosky, & Sinclair, 2010). Conversely, divided attention at the time of encoding impairs subsequent FOK resolution (Sacher, Taconnat, Souchay, & Isingrini, 2009).
The typical method of generating FOK resolution — correlating them with recognition memory accuracy — merely contrasts FOKs for correctly recognized items against FOKs for incorrectly recognized items. Thus, it cannot evaluate whether gradations in FOK magnitude are associated with gradations in recollective experiences for correctly recognized items. Functionally, this limitation implies that most of the evidence regarding FOK resolution in the literature to date implicitly concerns discriminating low from high FOKs, which could be driven primarily by what Liu, Su, Xu, and Chan (2007) described as the distinction between “definitely knowing that one doesn't know” versus other FOK states. However, for items that were correctly recognized on the criterion test, Hicks and Marsh (2002) demonstrated that a remember-know judgment after each forced-choice recognition item test correlated with FOKs. This finding showed that FOKs after failed recall tests forecast subsequent recollection experiences during a recognition test.
We have replicated this association of FOKs with remember-know judgments (MacLaverty & Hertzog, 2009) and extended it to confidence judgments for recognition test answers (henceforth, CJs; Hertzog, Dunlosky, et al., 2010). As with Hicks and Marsh's (2002) findings with the remember-know procedure, this correlation is driven by variation in FOKs and CJs within the class of correctly recognized items alone (Eakin, Hertzog, & Harris, in press; Hertzog, Dunlosky, et al., 2010), showing that above-chance FOK-CJ resolution cannot be produced by merely discriminating memory successes from memory errors. In fact, FOKs have no reliable correlation with CJs for items that are incorrectly recognized, consistent with the argument that the FOK-CJ relationship is generated by the degree of encoded cue-target relations that are recollected during the FOK judgment (when the target is absent) and its diagnosticity for later recollective experiences at the time of the recognition test (see also Souchay, Moulin, Clarys, Taconnat, & Isingrini, 2007). Moreover, the effect is observed for different types of stimuli, including verbal paired-associates and a face-name learning task, where faces serve as cues for recall and FOK (Eakin et al., in press). Eakin et al. also showed that the FOK-CJ correlation for correctly recognized name-face pairs is observed for both episodic (previously unknown) and semantic (i.e., normatively famous) faces and names.
This pattern of effects for correctly recognized items validates FOK experiences beyond what can be obtained by the traditional means of discriminating recognition successes from recognition failures. More generally, above-chance FOK-CJ correlations are consistent with the view that the amount and quality of information accessed during an FOK-initiated retrieval search influence gradations in FOKs (Hertzog, Dunlosky, et al., 2010; Koriat, 1995). The present study further establishes and clarifies the connections between FOK states, recognition accuracy, and recognition memory CJs.
Noncriterial Recollection and Strategy Recall
The major goal of this study was to evaluate a hypothesis regarding the diagnostic cues that people can access to enhance FOK accuracy. The noncriterial recollection hypothesis (Brewer, Marsh, Clark-Foos, & Meeks, 2010) is an accessibility view stipulating that FOKs are based in part on retrieving information about either the original encoding context or target features other than the criterion target itself (e.g., Parks, 2007). For example, the participant might recollect emotional reactions to the cue-target combination, or that the target reminded one of a past event, and access to such information is predicted to boost FOK magnitude. Noncriterial recollection could influence FOK magnitude because access to contextual detail about encoding or about features of the target can occur even when people cannot recall the target itself (Cook, Marsh, & Hicks, 2006). Consistent with this hypothesis, Brewer et al. (2010) found that recollection of source context or other item characteristics influences FOKs for unrecalled targets. Thomas, Bulevich, and Dubois (2011) showed that remembering the emotional valence of an unrecalled target increases both FOK magnitudes and FOK resolution. They also showed that explicit instructions to recall target valence prior to the FOK increased the FOK-memory correlation, suggesting that a controlled retrieval search is part of the process of making an accurate FOK.
Unlike previous studies, we tested the noncriterial recollection hypothesis for FOKs and FOK accuracy by focusing on retrieval of encoding strategies that had been generated during study. In particular, during study, individuals were instructed to generate mediators for new associations between normatively unrelated nouns. Immediately after the cued recall attempt, participants were then prompted to recall the mediator that they had originally generated during study. We hypothesized that recall of accurate detail about the original associative mediator would increase FOK magnitudes. Retrieving the original mediator, even when the target itself cannot be accessed at the time of the FOK, was hypothesized to be a potent cue influencing FOKs in standard paired-associate tasks. Given that successful retrieval of the original mediator (vs. unsuccessful retrieval) is also related to memory for the sought after target (Dunlosky et al., 2005), we expected that this cue would also be diagnostic of subsequent recognition performance and hence also boost FOK resolution.
To evaluate the noncriterial recollection hypothesis, we directly estimated the relationship between mediator retrieval during cued recall to subsequent FOKs and their resolution. We assessed strategy recall by using a mediator report and recall method (Dunlosky, Hertzog, & Powell-Moman, 2005) for verbal paired-associate (noun-noun) items. After studying each item, participants described the image (or other mediating strategy) that they had generated, if any. As with the previously cited source-memory experiments (e.g., Brewer, et al., 2010; Cook, et al., 2006), when individuals cannot recall the target during the cued-recall test, they can still for some items report access to aspects of the prior encoding operations, such as partial access to constructed encoding strategies they had generated to form a new association between the paired words (Dunlosky, et al., 2005; Hertzog, Fulton, Mandviwala, & Dunlosky, 2013). Although infrequent, target recall failures even occur when individuals are able to provide verbatim recall of their original encoding strategy. Target recall failures are more likely when only gist-consistent or partial descriptions of the mediator are accessed. Between-item variability in the recall of aspects of original encoding strategies is therefore a candidate source of cues influencing FOKs and FOK resolution in associative memory tasks.
It is also plausible that access to aspects of original encoding would be considered useful information by persons making FOKs, especially when individuals are instructed to use mediational strategies to assist with associative learning. We expected that participants in this study would be likely to deem successful retrieval of original encoding outcomes as diagnostic of future recognition memory success, leading them to use that information when making FOK judgments.
We also experimentally manipulated two variables that were likely to influence the quality of the associative encoding based on strategy use: item concreteness and repetitions. We instructed individuals to use interactive imagery to study normatively unrelated verbal paired associate items (either concrete-concrete [e.g., TICK-SPOON] or abstract-abstract [e.g., LIBERTY-PASSION] items). It is more difficult to generate and retrieve imagery mediators for abstract pairs because imageable tokens must be generated for each abstract concept (Paivio, 2007; Yuille, 1973). Using imagery for abstract pairs is therefore less likely to lead to successful associative recall, in part due to reduced access to the mediator during the test (Hertzog, et al., 2013). Items were presented either once or three times, given that this manipulation influences memory, FOK magnitudes, and FOK accuracy (Hertzog, Dunlosky, et al., 2010; T. O. Nelson, et al., 1982). Cook et al. (2006) also demonstrated that repeated presentations increase the likelihood of source recollection in the absence of target recall.
After a one-week delay following original encoding (to bring recognition memory performance for thrice-presented items off ceiling; see T. O. Nelson, et al., 1982), participants returned to the lab for the recall test. They were cued with one word from a pair (e.g., TICK) and asked to recall its associate. We also asked them to provide FOKs and report what they could remember about the mediator they had generated during encoding.
Research Hypotheses
The critical questions for this experiment concern the relations of strategy recall to FOKs and CJs. Our test of the noncriterial recollection hypothesis stipulates three effects regarding prediction of recognition memory performance by FOKs: (1) remembering the original mediator, in whole or in part (which we shall refer to as strategy recall) will increase FOK magnitudes relative to trials where nothing about the mediator can be recalled; (2) strategy recall will predict recognition memory for unrecalled items; and (3) strategy recall will statistically account for, or mediate (MacKinnon, 2008) the relationship of FOKs to recognition memory for unrecalled items. To foreshadow our results, this experiment shows that manipulating these variables affects noncriterial access to original encoding strategies, which in turn influence FOK magnitudes and accounts for the prediction of recognition memory by FOKs.
With respect to CJs, the hypotheses of interest were that (4) FOKs would predict CJs for correctly recognized items; (5) strategy recall would also predict CJs for those items; and (6) strategy recall would account for the relationship of FOKs to CJs. However, an alternative possibility was that additional (unmeasured) cues besides strategy recall were accessed when making FOKs, so that both FOKs and strategy recall would independently predict correct-recognition CJs.
Statistical Approach
We tested these hypotheses by using multi-level regression models to evaluate simultaneously influences of multiple cues on FOKs, recognition memory accuracy, and CJs. This statistical procedure has been successfully employed to evaluate multiple variables' influences on judgments of learning (e.g., Hertzog, Sinclair, & Dunlosky, 2010; Hines, Touron, & Hertzog, 2009). For instance, Tauber and Rhodes (2012) used multi-level regression to show that a memory-for-past-test heuristic is only one of multiple influences on multi-trial JOLs (see also Hertzog, Hines, & Touron, in press).
This approach has three major advantages. First, one generates regression models that estimate magnitudes of influence of multiple cues on metacognitive judgments, including the proper standard errors of estimate for these effects. Second, one can evaluate whether a cue (such as recollection of original encoding strategies) statistically mediates the relation of other cues and experimentally manipulated variables to metacognitive judgments and to memory outcomes.2 We used multi-level regression to test whether there are effects of FOKs on recognition memory and CJs that are statistically independent of recall of encoding strategies at the time of the cued-recall test. This approach can be used to falsify the hypothesis that recall of original encoding outcomes is a sufficient explanation of both FOKs and their predictive validity (for either recognition memory or for CJs), in favor of the alternative hypothesis that there are multiple influences, including recalled encoding strategies, on FOKs.3
Third, the fact that FOKs are evaluated for unrecalled items implies that each person has in principle a different set of unrecalled items that remain for further analysis of FOK magnitudes and FOK accuracy. In the present experiment, participants will also fail to generate a mediator for some items, further reducing the available item pool on an idiosyncratic basis. The possible biasing influences of residual item sets are typically ignored in metacognitive research; multi-level models that use item as an explicit factor in the analysis help to control for these differences and insure that predictors of FOKs are not an artifact of which items survive the screening criteria.
Method
Participants
Undergraduate students at Kent State University and the Georgia Institute of Technology received course credit for participating in the study. Forty-five young adults were included in the analyses. A total of 69 students were recruited for the study, of which 15 did not return for the second session and nine did not recall enough mediators (5% minimum) to be included in the analysis.
Materials
A list of 80 noun pairs, 40 concrete and 40 abstract (see online supplemental materials, Appendix A), were chosen from University of South Florida Free Association Norms (D. L. Nelson, McEvoy, & Schreiber, 1998) and the MRC Psycholinguistic Database (Fearnley, 1997) and verified with ListCheck Pro 1.2 program (Eakin, 2010).
Design
The experiment was a 2 (Concreteness: concrete, abstract) X 2 (Presentation: 1, 3) within-subjects design.
Procedure
Items were presented in a random order at study, either once or three times (under the constraint that an item could not be presented twice in a row), for 30 seconds each. The instructions acknowledged that multiple encoding strategies exist, but participants were instructed to generate an interactive image if possible. Participants practiced using interactive imagery with three concrete and three abstract word pairs. Then they were presented with the experimental list and were prompted to give an oral description of the imagery mediator, which was digitally recorded, after each item studied. After a seven-day delay, participants returned to the lab and went through the main task, which included a phase of cued-recall, FOK, and encoding strategy report, followed by a recognition memory phase, all of which were self-paced. During cued-recall, individuals typed in the associated target words after being shown the cues, which were presented in a random order. They then were again shown the cue and provided an FOK on a 0-100% confidence scale. After the FOK, they were prompted to report anything they could recall about the strategy they had generated at study. Target recall was scored as correct if the first three letters of the typed response were correct. This method is fast and automated, yet it has high convergent validity with other measures of recall, such as coded oral recall protocols (e.g., Dunlosky, et al., 2005).
After they had completed cued-recall, FOKs, and strategy recall reports for all items, they were given a four-alternative forced-choice recognition test, in which the cue was presented with its target and three randomly selected targets from other pairs, under the constraint that each target was used equally often as a recognition lure. After each recognition test probe, individuals rated their confidence in the correctness of their selection, rated on a 0-100% scale. The FOK and CJ procedures were modeled after (Hertzog, Dunlosky, et al., 2010) which can be accessed for additional procedural details.
Strategy recall was obtained by matching the oral descriptions at study and test, coding for no mediator at study, verbatim recall, gist recall, partial recall, commission errors, and omission errors (see Dunlosky, et al., 2005 for more details). A summary of the coding scheme is available (see online supplemental materials, Appendix B). For purposes of this study, we mapped encoding strategy outcomes on an ordinal scale from the highest fidelity of description recall to the lowest: verbatim recall = 4, gist recall = 3, partial recall = 2, omission errors or commission errors = 1. Treatment of commission errors as low strategy recall is the most defensible scaling of recall outcomes, although it could limit FOK-strategy relations because (1) commission errors in target recall are often accompanied by high FOKs (Krinsky & Nelson, 1985), and (2) commission errors for encoding strategies could be regarded by participants as accurately recalled details about original encoding. As such this scaling of strategy recall might dilute somewhat the connection between perceived recollection of original encoding outcomes and FOKs.
Statistical Methods
We used SAS PROC GLIMMIX (SAS Institute, 2008) to analyze the dependent variables in a generalized mixed model (Littell, Milliken, Stroup, & Wolfinger, 2000). For the categorical dependent variable of associative recognition success, a logit link function was employed. For other variables, a Gaussian (normal distribution) link function was used. In these analyses, individual items (nested within the Concreteness independent variable) were modeled as having specific effects on dependent variables. Hence any significant effects of concreteness, repetition, and mediator recall statistically control for item-specific influences on the dependent variable. In addition to the usual homoscedastic residual error variance, we also modeled a random effect for (person) intercepts (individual differences), retaining the parameter if it was reliably different from zero. A critical value of .05 was used for all significance tests. To aid in result interpretation, we computed an effect size difference in fitted marginal means, where applicable, Cohen's d statistic (Cohen, 1988), which scales mean differences in error standard deviation units (pooled intercept and residual variance). Cohen's benchmarks for large, medium, and small effects are 0.8, 0.5, and 0.2, respectively.
We also estimated multilevel structural regression models in the MPlus 7.0 program (Muthén & Muthén, 1998-2007). This approach allowed us to accomplish two additional aims. First, we were able to estimate direct (partial regression coefficients), indirect (effects of one variable on another mediated by an intervening variable), and total effects (the sum of direct and indirect effects; see Cheong & MacKinnon, 2012), and to get standard errors (and significance tests) for the indirect effects. This feature made it is possible to address questions about the degree to which strategy recall mediated effects of independent variables like repetition and concreteness on FOKs. Second, Mplus produces standardized regression estimates for both the within-person (item-level) and between-person (person-level) of the multilevel model. Standardization in Mplus is achieved by partitioning the total covariance matrix into within-person and between-person submatrices, and then rescaling the regression coefficients with the appropriate estimates of variables' SD. For item-level regression coefficients, the rescaling is done by the associated ratio of item-level standard deviations (i.e., β * SDx(w) / SDy(w), where β is the estimated regression coefficient, SDx(w) is the estimated within-person SD of the predictor, and SDy(w) is the within-person SD of the criterion). For between-person regression coefficients, rescaling is done by the analogous ratio of between-person standard deviations. This feature allowed us to evaluate the relative magnitude of effects of different variables on metacognitive judgments (FOKs and CJs).
Results and Discussion
Target recall results were fully consistent with earlier studies (see online supplemental materials, Appendix C, Table 1), showing greater recall for concrete (vs. abstract) items and for three (vs. one) repetitions (e.g., Dunlosky, et al., 2005; Hertzog, Dunlosky, et al., 2010; Hertzog, et al., 2013). Feeling-of-knowing states are defined as confidence that a target which cannot be accessed is available in memory and will be later recognized. Hence, as is traditional in this area of research, the analyses we report all exclude trials resulting in successful target recall (on average, targets were recalled on 28% of the trials), analyzing data for trials when targets were not recalled.4 We also excluded items for which individuals did not report generating a mediator. Consistent with our earlier work (Hertzog, et al., 2013), successful mediator production was relatively common. The mean proportion of items generating mediator descriptions was .95 (SD = .07), with the values ranging from .63 to 1.0 across all participants.
Table 1 reports strategy recall and recognition accuracy, scaled as proportion correct, as a function of concreteness and repetition. For archival purposes, we also report mean FOKs and mean CJs and their SDs in this table. Note that the low mean levels of strategy recall reflects the fact that the modal outcome for unrecalled items was an omission or commission error (Dunlosky, et al., 2005; Hertzog, et al., 2013) for the generated mediator (M = .80, SD = .12). Nevertheless, verbatim or gist recall of the original encoding strategy (M = .10, SD = .08) still occurred following target recall failures.
Table 1. Mean FOK, Strategy recall, Recognition, and Confidence Judgments as a Function of Concreteness and Repetition.
FOK | Encoding Recall | |||||||
---|---|---|---|---|---|---|---|---|
Concrete | Abstract | Concrete | Abstract | |||||
M | SD | M | SD | M | SD | M | SD | |
1 Presentation |
25.94 | 27.67 | 24.99 | 26.87 | 1.16 | .47 | 1.12 | .41 |
3 Presentations |
48.05 | 35.53 | 40.90 | 32.21 | 1.74 | .95 | 1.56 | .90 |
Recognition | Confidence Judgments | |||||||
---|---|---|---|---|---|---|---|---|
Concrete | Abstract | Concrete | Abstract | |||||
M | SD | M | SD | M | SD | M | SD | |
1 Presentation |
.81 | .40 | .61 | .49 | 73.77 | 30.42 | 58.93 | 31.55 |
3 Presentations |
.96 | .21 | .80 | .40 | 94.00 | 14.82 | 79.14 | 27.97 |
Note. All unrecalled items were included when computing these statistics. FOKs and CJs are scaled in % confidence, recognition memory in proportion correct, and strategy recall in average level (ranging from 1 for recall errors to 4 for verbatim recall).
It would be typical in the metacognitive literature to use aggregated person-level means for the variables reported in Table 1 as dependent measures; for example, by analyzing each person's proportion correct in the associative recognition task. We forgo this approach because of our use of multi-level models for each variable, using item-level data.
Use of item-level data for recognition memory ran into the problem that six items were correctly recognized by all participants, and had to be deleted from the analysis of recognition memory success to obtain converged multi-level regression solutions. To preserve comparability of results across the different dependent variables, we deleted data for these six items from all of the multi-level regression analyses reported in this paper, including the ones analyzing FOKs and CJs.
Strategy recall
We begin with an analysis of strategy recall, because this variable is central to most of the major predictions about FOKs described under Research Hypotheses. We expected that the likelihood of recalling properties of the mediators (i.e., strategy recall) would be influenced by the independent variables of Concreteness and Repetition. The generalized mixed model predicting the strategy recall variable (see Table 2, column 1) showed that Concreteness and Repetition both influenced strategy recall, controlling for the significant specific item effects (some items afforded more memorable encodings than others). Table 3 reports the random effects for each model. The first row of Table 3 reports the unconditioned model (estimating only a person intercept and residual variance, without any experimental effects); reductions in residual variances to models including independent variables enabled computing a pseudo-R2 statistic (Snijders & Bosker, 1999). The full regression model included a residual variance and a significant random effect for intercepts, indicating reliable individual differences in average level of strategy recall. The fixed effects for Concreteness, Repetition, and their interaction accounted for about 56% of the variance in strategy recall; including the intercept variance, the model accounted for 71% of the variance in strategy recall.
Table 2. F-tests for Fixed Effects of Item, Concreteness, and Repetition on Strategy recall and FOK Magnitude from the Mixed Model Analyses.
Factor | Strategy recall | FOK Magnitude | ||||
---|---|---|---|---|---|---|
Model 1 | Model 2 | Model 3 | ||||
F | df | F | df | F | df | |
Item | 1.56* | 72, 2144 | 2.05** | 72, 2217 | 1.58* | 72, 2217 |
Concreteness | 27.77** | 1, 2156 | 28.82** | 1, 2219 | 5.05* | 1, 2218 |
Repetition | 368.94** | 1, 2164 | 340.21** | 1, 2221 | 80.02** | 1, 2220 |
Concreteness X Repetition | 10.10* | 1, 2145 | 12.45* | 1, 2217 | 1.20 | 1, 2217 |
Strategy recall (item level) | - | - | - | - | 536.16** | 1, 2218 |
Strategy recall (person level) | - | - | - | - | 2.44 | 1, 46.5 |
Concreteness X Strategy recall (item level) | - | - | - | - | 6.30* | 1, 2217 |
Concreteness X Strategy recall (person level) | - | - | - | - | 5.44* | 1, 2218 |
Note.
p < .05
p < .001
Table 3. Random Variance Components for the Mixed Models Predicting Strategy recall and FOK Magnitude.
Strategy Recall | Unrecalled-Item FOKs | |||||
---|---|---|---|---|---|---|
Model 1 | Model 2 | Model 3 | ||||
Variance | S.E. | Variance | S.E. | Variance | S.E. | |
Unconditional | 1.31** | .01 | 952.80** | 28.34 | 952.80 | 28.34 |
Residual | .38* | .01 | 483.47** | 14.52 | 372.14** | 11.18 |
Intercept | .19* | .03 | 362.31** | 78.48 | 349.96** | 75.40 |
Overall R2 | .71 | - | .49 | - | .61 | - |
Fixed effects R2 | .56 | - | .11 | - | .24 | - |
p < .05
p < .001
Note: The overall R2 reports the proportion of total item variance accounted for by fixed effect factors effects (items, repetition, concreteness, and random intercepts --individual differences in mean FOKs). The fixed-effects R2 is the proportion of total item variance accounted for by the fixed effect factors, ignoring intercept variance.
Figure 1 shows the corresponding marginal means and standard errors for the strategy recall variable. Recalling something about the encoding strategy was far more likely for items presented three times (M = 1.72, SE = 0.04) than for items presented once (M = 1.15, SE = 0.03), d = 0.75, a large effect. In terms of odds ratios, strategy recall success (attaining either verbatim or gist recall of originally encoded mediators) was three times more likely when items were presented thrice instead of once. Recall of encoding outcomes were also on average more likely for concrete items (M = 1.51, SE = 0.04) than abstract items (M = 1.36, SE = 0.03), d = 0.20. The Concreteness X Repetition interaction was also reliable (see Figure 1), indicating that repetition effects were larger for concrete items, d = 0.89, than abstract items, d = 0.51.
In general then, recall of original encoding strategies for unrecalled targets occurred, varied within-persons, and was influenced by independent variables shown in other studies to influence FOKs. Thus, the quality of strategy recall is a candidate variable to explain variation in FOKs for unrecalled items.
FOK Magnitude
Before evaluating the main Research Hypotheses pertaining to FOKs and strategy recall, FOKs for unrecalled items were first analyzed without reference to encoding outcomes. The mixed model results (Table 2, Model 2) showed reliable effects of Item, Concreteness, and Repetition, along with a Concreteness X Repetition interaction. FOKs were therefore sensitive to the independent variables (see Figure 2). Concreteness on average generated a small effect, d = 0.18, whereas Repetition generated a medium-sized effect, d = 0.67. The reliable interaction reflected larger repetition effects on FOKs for concrete items, d = 0.80, than for abstract items, d = 0.53. The model also included a random effect of FOK intercepts, reflecting individual differences in mean FOKs (Table 3, model 2). The pseudo-R2 indicated that the experimental factors (items, concreteness, and repetition) accounted for about 11% of the total variance in FOKs. Including the random intercept variance, the model accounted for about half the variance in FOKs, showing that individual differences in mean FOKs were a substantial source of FOK variance.
To evaluate the main hypotheses, a critical next step was to consider the contribution of strategy recall to FOKs. In particular, the noncriterial recollection hypothesis stipulates that strategy recall will have a strong relationship to FOKs. As a preliminary step, we computed the average FOK at each level of strategy recall (see Figure 3). This plot suggested a strong relationship between the two variables, with the biggest discrimination between levels of FOKs for strategy recall errors (omissions and commissions) and some level of mediator recall, which is consistent with the prediction from our first hypothesis. The plot indicated little distinction between gist and verbatim recall of encoding strategies in effects on FOKs, which would not necessarily be unexpected for the retrieval of imagery mediators, which might be equally likely to generate gist or verbatim verbal descriptions of retrieved images at the time of cued recall (Hertzog, et al., 2013), with either recollective experience generating relatively high FOKs. Nevertheless, we opted to continue to use the four-level graded strategy recall variable in further analyses of strategy recall-FOK relationships.
We then added the graded strategy recall variable to the mixed model predicting FOKs (Table 2, Model 3). The model included two variables capturing different aspects of graded strategy recall; a person-centered variable measuring within-person variation in strategy recall for different items (i.e., item-to-item variability in strategy recall for a given person), and a grand-mean centered variable that captured between-person variation in each person's mean level of strategy recall. These two variables reflecting between-person and a within-person sources of item variance in strategy recall are statistically independent (see Singer, 1998).
We initially included all higher-order interaction terms with the two strategy recall variables, but then trimmed non-significant effects in the reported final model. Compared to Model 2, Model 3 increased R2 by 13% by adding the fixed effects associated with Strategy recall (see Table 3, Model 3). The model revealed a robust effect of item-level strategy recall on FOKs, β = 19.7, SE = 1.1. Within an individual, an increase in level of strategy recall (e.g., from strategy recall failure to partial mediator recall) increased FOK confidence by about 20%. This effect was moderated by concreteness. Figure 4 shows that the fitted linear effects for strategy recall were stronger for concrete items relative to abstract items.
In contrast, person-level effects of strategy recall were not statistically significant, indicating that individual differences in mean levels of strategy recall did not greatly influence individual differences in mean FOKs. Overall, these results indicated that the within-person variability in strategy recall across items was a more important influence on FOKs than between-person differences in strategy recall. In sum, the signature feature of these results was a very large effect of item-level variation in strategy recall on FOKs for unrecalled items in all experimental conditions, with a magnified effect size for concrete items. This outcome verified a key prediction of the noncriterial recollection hypothesis.
Including mediator recall in the model reduced residual variance and therefore increased statistical power. Nevertheless, the F-tests for the Repetition and Concreteness main effects were reduced in magnitude compared to Model 2, and the Concreteness X Repetition interaction was eliminated. The fitted marginal mean difference in FOK confidence between concrete and abstract items was only 2.4% (d = 0.09) when strategy recall was included in the model, compared to a 14% (d = 0.18) difference when it was not in the model. Likewise, the repetition effect on FOK magnitudes, controlling on strategy recall, was reduced to 9.2%, d = 0.34, compared to the previous effect, d = 0.67, when strategy recall variables were not part of the model. It appeared that strategy recall statistically mediated some of the effects of concreteness and repetition on FOKs.
This inference was supported by a structural regression model with estimated indirect effects run in the Mplus program. The estimated standardized direct effect of strategy recall on FOKs was .50, larger than the standardized direct effects of Concreteness (.04) and Repetition (.17). The indirect effects of Concreteness and Repetition mediated by encoding strategy recall were .04 and .19, respectively, both of which were reliably greater than zero, p < .05. Thus, about half of the total effect of each independent variable on FOKs was mediated by encoding strategy recall.
These outcomes support the noncriterial recollection hypothesis, showing that FOKs in episodic memory tasks are strongly influenced by access to outcomes of encoding operations carried out one week earlier. The effect of strategy recall on FOKs found in this study appears to be larger than the FOK-related effects found in studies that have used accessibility of ancillary encoding-context features (Brewer, et al., 2010) or accessibility of a single manipulated target feature (e.g., emotional valence; Thomas, et al., 2011) to evaluate the noncriterial recollection hypothesis. We speculate that participants in this experiment routinely regarded recovered detail about the original encoding experience to be diagnostic of later target recognition and often based their FOKs on this source of information.
Associative Recognition Accuracy
Table 4 (Column 1, Model 1) reports the F-tests from the SAS PROC GLIMMIX analysis of recognition memory success (for previously unrecalled items only), after logit transformation of that binary dependent variable. Controlling for significant item differences in recognition memory success, there were reliable main effects of Concreteness and Repetition, as well as a reliable Concreteness X Repetition interaction. On average concrete items were more likely to be correctly recognized than abstract items, thrice-presented items were more likely to be correctly recognized than once-presented items, and this effect was larger for concrete items than abstract items. Table 5 (Model 1) reports the estimated random effects for this model.
Table 4. F-tests for the Generalized Mixed Models Using Item, Concreteness, Repetition, and Strategy recall to predict Logit-transformed Recognition Accuracy.
Recognition Accuracy | ||||||
---|---|---|---|---|---|---|
Model 1 | Model 2 | Model 3 | ||||
F | df | F | df | F | df | |
Item | 1.34* | 72, 2138 | 1.29 | 72, 2137 | 1.23 | 72, 2138 |
Concreteness | 71.69** | 1, 2150 | 68.15* | 1, 2148 | 62.56** | 1, 2148 |
Repetition | 95.32** | 1, 2157 | 77.13* | 1, 2156 | 62.49** | 1, 2157 |
Concreteness X Repetition | 4.63* | 1, 2137 | 4.12* | 1, 2136 | 3.45 | 1, 2136 |
FOK (item level) | - | - | 9.71* | 1, 2143 | 2.13 | 1, 2143 |
FOK (participant level) | - | - | 0.06 | 1, 35.5 | 0.07 | 1, 34 |
Strategy recall (participant mean centered) | - | - | - | - | 12.07* | 1, 2141 |
Strategy recall (grand mean centered) | - | - | - | - | 10.67* | 1, 42.2 |
Note.
p < .05
p < .001
Table 5. Random Variance Components for the Generalized Mixed Models Predicting Recognition Accuracy.
Recognition Accuracy | ||||||
---|---|---|---|---|---|---|
Model 1 | Model 2 | Model 3 | ||||
Variance | S.E. | Variance | S.E. | Variance | S.E. | |
Unconditional | 1.08** | .05 | 1.08 | .05 | 1.08 | .05 |
Residual | 0.94* | .03 | .95* | .03 | .97* | .03 |
Intercept | 0.92 | .13 | .93 | .13 | .82 | .12 |
Note.
p < .05
p < .001
FOK-Recognition Accuracy Relationships
It is traditional to evaluate FOK resolution with respect to recognition accuracy by computing ordinal within-person gamma correlations and analyzing them as the dependent variable. As expected, repetition did affect gamma correlations (see online supplemental materials, Appendix D, Table 4). We focus, however, on the use of multilevel regression models in SAS PROC GLIMMIX with logit-transformed recognition accuracy as the dependent variable because of its advantages for evaluating the linkage of strategy recall to FOK accuracy.
We started by adding FOKs to the model already reported. Our earlier analysis with FOKs as the dependent variable had shown reliable random effects in intercepts (individual differences in mean FOKs), so it was important to isolate item-level and person-level FOK effects on recognition accuracy. We again used person-centered and grand-mean centered FOK variables to accomplish this partition.
The initial analysis included all higher-order interactions involving both FOK variables (e.g., Concreteness X item-level FOKs), but none of these interactions were statistically significant, so they were trimmed from the model. Table 4, Model 2, reports the F-tests for effects remaining in the trimmed model. Table 5 reports the estimated random effects. There were reliable effects of item-level variation in FOKs on recognition accuracy, consistent with the gamma correlations. Higher FOKs were associated with higher likelihood of recognition memory accuracy, β = 0.008, SE = 0.003. In contrast, there was no reliable prediction of individual differences in recognition memory by mean FOKs.
The next step was to add strategy recall to the model. Again, we entered item-level strategy recall, person-level strategy recall, and all associated interactions into the model. None of the interactions were statistically significant. Table 4, Model 3, reports the F-tests for fixed effects in trimmed model. Note that there were reliable effects of both encoding recall variables on recognition memory success. Within an individual, items for which aspects of the original encoding could be recalled were more likely to be recognized than items generating less retrieved detail, β = 0.48, SE = 0.14. Between individuals, persons with higher levels of mediator recall were more likely to successfully recognize items they had not previously recalled, β = 2.40, SE = 0.73. This finding corroborates our second hypothesis, demonstrating that there is a substantial relationship between strategy recall and recognition of previously unrecalled items. Hence strategy recall is a diagnostic cue that could account for FOK accuracy.
Controlling on the item-level strategy recall variable completely eliminated the significant effect of item-level FOKs on item recognition memory. This important outcome verifies our third hypothesis, suggesting that noncriterial recollection of original encoding details fully mediated the predictive accuracy of FOKs for recognition memory. To further evaluate this hypothesis, we ran an Mplus model using strategy recall as the mediator of FOKs' relationship to recognition memory success. Whereas the direct effect of FOKs on recognition memory just missed significance when controlling on strategy recall, standardized effect = .05, SE = .03, p = .06, the standardized indirect effect (.03, SE = .01) mediated by strategy recall was statistically significant, p < .05. The standardized .08 total effect of FOKs on recognition accuracy (SE = .02) was reliably greater than zero, p < .05.5
An interesting interpretational twist on these analyses is that one can also argue that FOKs, although influenced by encoding recall, do not fully capture the potential of strategy recall as a cue for generating accurate FOKs, given that strategy recall predicted recognition memory independently of FOKs. This outcome suggests that participants' reliance on this cue to make FOKs was inconsistent across trials, and highlights the idea that this type of metacognitive monitoring potentially could be enhanced by improving attention to available diagnostic cues.
In sum, then, the predictive accuracy of FOKs for recognition memory success appears to be generated in large part by strategy recall, consistent with the noncriterial recollection hypothesis (Brewer, et al., 2010). However, FOKs failed to benefit fully from the available cues of strategy recall, repetition, and concreteness, all of which predicted recognition success independently of FOKs.
FOK-Confidence Judgment Relationships
To evaluate our second group of hypotheses pertaining to CJs, our next goal was to evaluate the predictive validity of FOKs for recollective experiences at the time of the recognition memory test using correct associative recognition trials only. First, as predicted, FOK-CJ gamma correlations were reliably above chance -- greater than zero (see online supplemental materials, Appendix E, Table 5). We were interested in hypotheses about strategy recall and the FOK-CJ relationships that cannot be assessed with these gamma correlations. Specifically, we hypothesized that strategy recall would also predict recollection during the forced choice recognition test, and, given the relationship of strategy recall to FOKs, would therefore mediate, at least in part, the prediction of recognition memory CJs by FOKs.
The first model (Table 6, Model 1) simply included the item effects and the two experimentally manipulated variables, Concreteness and Repetition. We detected significant random effects in intercepts, indicating substantial individual differences in mean CJs (see Table 7). There were robust main effects of both independent variables, and their interaction just missed statistical significance. Concrete items led to higher levels of confidence in recognition decisions (M = 88.5, SE = 2.0) compared to abstract items (M = 76.0, SE = 2.0), d = 0.49, and thrice-presented items led to higher confidence (M = 92.2, SE = 2.1) than did once-presented items (M = 72.3, SE = 2.0), d = 0.78. The trend for an interaction reflected larger repetition benefits on confidence for abstract items, d = 0.86, than for concrete items, d = 0.71.
Table 6. F-tests for Mixed Models Using Item, Concreteness, Repetition, and Strategy recall to Predict Confidence Judgments for Correctly Recognized Items.
Confidence Judgments | ||||||
---|---|---|---|---|---|---|
Model 1 | Model 2 | Model 3 | ||||
F | df | F | df | F | df | |
Item | 1.24 | 72, 1644 | 1.36* | 72, 1644 | 1.34* | 72, 1644 |
Concreteness | 112.64** | 1, 1653 | 109.48** | 1, 1656 | 103.43** | 1, 1655 |
Repetition | 273.46** | 1, 1662 | 196.22** | 1, 1668 | 164.76** | 1, 1669 |
Concreteness X Repetition | 2.66 | 1, 1647 | 0.49 | 1, 1648 | 2.97 | 1,1648 |
FOK (item level) | - | - | 26.66** | 1, 1645 | 9.31* | 1, 1642 |
FOK (person level) | - | - | 4.30* | 1, 44.8 | 3.92 | 1, 44.6 |
Strategy recall (item level) | - | - | - | - | 8.66* | 1,1642 |
Strategy recall (person level) | - | - | - | - | 0.66 | 1, 44.8 |
FOK (item level) X Concreteness | - | - | 4.84* | 1, 1647 | - | - |
FOK (person level) X Concreteness | - | - | 2.50 | 1, 1652 | - | - |
FOK (item level) X Repetition | - | - | 9.05* | 1, 1659 | 9.63* | 1,1665 |
FOK (person level) X Repetition | - | - | 7.88* | 1, 1667 | 6.40* | 1, 1664 |
Note.
p < .05
p < .001
Table 7. Random Variance Components and Coefficients of Determination for Mixed Models with Confidence Judgments.
Confidence Judgments | ||||||
---|---|---|---|---|---|---|
Model 1 | Model 2 | Model 3 | ||||
Variance | S.E. | Variance | S.E. | Variance | S.E. | |
Unconditional | 747.57** | 25.75 | 747.57 | 25.75 | 747.57 | 25.75 |
Residual | 492.30** | 17.19 | 478.94** | 16.74 | 478.69** | 16.72 |
Intercept | 150.22* | 34.81 | 133.67** | 31.24 | 129.66** | 30.38 |
Overall R2 | .34 | .37 | - | .36 | - | |
R2 fixed effects | .14 | .19 | - | .19 | - |
p < .05
p < .001
Note: The overall R2 reports the proportion of total item variance accounted for by fixed effect factors effects (items, repetition, concreteness, and random intercepts—individual differences in mean FOKs). The fixed-effects R2 is the proportion of total item variance accounted for by the fixed effect factors, ignoring intercept variance.
We interpret gradations in CJs for correctly recognized items as reflecting degree of recollective experience at the time of the forced-choice recognition test, including recollective support for recall-to-reject processes (e.g., Cohn & Moscovitch, 2007; Gallo, Bell, Beier, & Schacter, 2006; Yonelinas, 2001). Given that the targets were previously unrecalled for the items included in these analyses, recall-to-reject in this context most likely reflects a process by which recollective detail of encoding context is first triggered when the foils (incorrect alternatives) are presented during the forced-choice recognition test. Thus, unlike a yes/no recognition task, recollection during the forced-choice task will include both recollection of original cue-target encoding and recollection triggered by recall of foils and their originally paired cues.
Next, we entered item-level and person-level FOKs and associated interactions into the model. The analysis detected a reliable effect of item-level FOKs on CJs, qualified by interactions of item-level FOKs with Repetition and Concreteness. Similarly, there was a significant person-level FOK effect on CJs, qualified by an interaction of person-level FOKs with Repetition (see Table 6, Model 2).
To help clarify the repetition-related interactions, we ran multilevel models separately for the once-presented and thrice-presented items. Item-level FOKs had a larger regression coefficient for once-presented items, relative to items presented three times (β = 0.153, SE = 0.050 versus β = 0.061, SE = 0.043); indeed, the effect was not reliable for thrice-presented items. Likewise, person-level FOKs tended to generate a larger effect for once-presented items, β = 0.243, SE = 0.132 than for thrice-presented items, β = 0.090, SE = 0.081.
These results were consistent with the idea that very high levels of recollection for correctly recognized items blunted the connection between FOKs and CJs for the thrice-presented items, given the strong effect of repetition on mean FOKs in the previous analysis. By this interpretation, the interaction does not imply qualitative shifts in the basis for FOK-CJ relations in the different repetition conditions.
In general, these results supported our fourth hypothesis of FOK-CJ relations for correctly recognized items, consistent with previous research (Eakin, et al., in press; Hertzog, Dunlosky, et al., 2010). Access to information about encoding strategies at the time of the FOK forecasts recognition states that generate higher confidence in the accuracy of the forced-choice recognition discrimination. This outcome sets the stage for a test of the noncriterial recollection account of FOK relations to recognition memory CJs.
Strategy recall, FOKs, and CJs
To evaluate contributions of strategy recall to these effects, we added the item-level strategy recall and person-level strategy recall to the previous model. Table 8, Model 3, shows that doing so resulted in significant prediction of CJs by item-level strategy recall, but not at the person level. Furthermore, the effect of FOKs on CJs was reduced, but not eliminated by adding strategy recall to the model, suggesting that strategy recall partially mediated the predictive validity of FOKs for CJs for correct recognition responses. In contrast, strategy recall had little impact on the repetition-related effects in Model 2.
We evaluated indirect effects of item-level FOKs on CJs as mediated by strategy recall by modeling the item-level data using Mplus. Repetition and Concreteness both had relatively robust direct effects on CJs (standardized effects of .33 and .24, respectively). The standardized direct effects of item-level FOKs and item-level strategy recall were reliable, but weaker (.08 and .07, respectively).6 The standardized total effect of strategy recall on CJs was 0.11 (SE = 0.23). The standardized indirect effect mediated through FOKs was smaller, but reliably greater than zero, effect = .04, SE = .02, p < .05.
These results support the more modest version of the noncriterial recollection hypothesis. Strategy recall is indeed one of the cues accounting for the predictive validity of FOKs for recollective experiences during the recognition test. However, other cues must also be operating to generate FOK-CJ relationships.
The fact that strategy recall accounts for most of the FOK-recognition memory correlations, but not for most of the FOK-CJ correlations for correct trials, supports the argument that recognition success and CJs reveal different aspects of recognition memory with which to validate FOKs. As noted previously, awareness that one has accessed little or no information about the target after cued recall failure could lead to a phenomenal experience of knowing that one doesn't know (Liu, et al., 2007), which cannot account for correlations between FOKs and CJs for correct recognition trials. Forced-choice recognition success requires access to specific information about the original cue-target relationship that could in principle be triggered at the time of the FOK by the recall cue, and strategy recall is either the principal source of this information or a correlate of most available sources. However, although strategy recall does predict CJs for correct recognition trials, it does not account in full for the observed predictive validity of FOKs for CJs. Thus, FOKs must be, ipso facto, influenced by other cues that foster a positive FOK-CJ relationship. There are a number of candidate cues not directly measured in this study that could in principle influence FOKs and FOK accuracy, including (1) cue familiarity—promoted in this experiment by the manipulation of repetition (e.g., Metcalfe, et al., 1993), (2) access to target features not integrated into the encoded mediator when making the FOK (Thomas, et al., 2011), and (3) recollection of other aspects of encoding context besides the mediator itself.
However, it is also clear that FOKs did not achieve the level of predictive validity for CJs that was in principle possible, given the magnitude of observed relationships of the available cues of repetition, concreteness, and strategy recall to CJs, as well as the substantial residual variance in CJs owing implicitly to other, unmeasured influences. One possible explanation for the limited FOK-CJ relationship—aside from poor monitoring or suboptimal rating scale behavior—contrasts the types of recollective experience that are not shared in common between cue-generated FOKs and recognition tests. Recollective experiences during the recognition test can also derive from foil-induced recall-to-reject mechanisms cited earlier, and these contributions to recognition test confidence cannot in principle be anticipated at the time of the FOK (when the foils are not yet known; as in Thiede & Dunlosky, 1994). One can draw an analogy here to work on the difference between immediate judgments of learning (JOLs), on the one hand, and delayed JOLs and FOKs, on the other hand (Eakin & Hertzog, 2012). Immediate JOLs are insensitive to cue set size and target set size effects that influence implicit retrieval interference during a cued recall test (see T. O. Nelson & Dunlosky, 1991). In contrast, both delayed JOLs and FOKs, which are influenced by accessibility of information during cued recall, are sensitive to these retrieval-based effects.
Limitations and Conclusions
The major finding of this study is that the mediators that participants produce at encoding play a large role in statistically explaining variation in FOKs, the resolution of FOKs, and (to a lesser degree) the relationship between FOKs and CJs in a subsequent associative recognition test. As such, the degree of strategy recall appears to be one of the pathways by which noncriterial recollection influences FOKs and FOK accuracy.
These results do not necessarily generalize to all other studies of episodic FOK accuracy given that this study instructed individuals to use encoding strategies and required individuals to report their encoded mediators at study and after cued recall. This procedure could have increased the salience of strategy recall as a potential cue for making FOKs. Hence we cannot conclude at present that encoding strategy recall influences FOKs in tasks where participants spontaneously generate the strategies without experimenter intervention. Note, however, that people do spontaneously generate mnemonic strategies—including imagery—during encoding (e.g., Dunlosky & Hertzog, 2001), and that other evidence suggests that recalling spontaneous strategy use may affect FOKs. Hosey, Peynircioğlu, and Rabinovitz (2009) requested post hoc justifications for face-name FOKs from their participants, and found that individuals often reported access to aspects of encoding when they made high FOKs. Hertzog, Sinclair, and Dunlosky (2010) used retrospective strategy reports to measure spontaneous strategy use when learning verbal paired-associates. They showed that reported encoding strategies correlated with judgments of learning made immediately after each item was encoded. One could even argue that the effects seen in the present study might be even stronger under conditions where individuals spontaneously used strategies for some but not all items, so that strategy use would contribute to subsequent item-level variability in recognition and recognition CJs. New research will be needed to investigate whether FOKs are influenced by spontaneous strategy use and the degree of influence this might have on FOK accuracy.
We also acknowledge that the strategy recall-FOK relationships observed in this study are inherently correlational. Although they are consistent with the interpretation that strategy recall in this task has a causal influence on FOKs, we cannot rule out the generic rival explanation that unmeasured cues that are correlated with strategy recall are the actual basis for the strategy recall-FOK relationships observed in this study. We are justified in concluding that the observed relationship is independent of the manipulated variables of repetition and concreteness, and that some of the effects of those independent variables on FOKs covary with strategy recall, rendering strategy recall a plausible candidate explanation of FOK variance.
Scientists have speculated about whether FOKs might be based on nonanalytic feelings of warmth that could derive from indirect (and perhaps unconscious) access to the target (e.g., Metcalfe, 2000) or implicit influences on cue familiarity (Jameson, Narens, Goldfarb, & Nelson, 1990), as opposed to being based on information generated by explicit retrieval searches prompted by the FOK. The present results demonstrate that one product of an explicit retrieval search, strategy recall, appears to have a strong influence on FOKs. This outcome is therefore consistent with what Nelson and Narens (1990) once termed the ‘no-magic’ account of FOKs. Retrieval of information about the mediators created by implementing instructed strategy use apparently influences FOKs for unrecalled items. Access to original encoding strategies accounts for much of the correlation of FOKs with subsequent recognition memory success and for some of the correlation of FOKs with CJs for correct recognition trials.
Supplementary Material
Acknowledgments
We thank Melissa Bishop, Rebekah Stewart, Asha Kumar, Radhika Solanki, Anna Babaie, Megan Bulluck, Shivani Shah, Nabila Nazarali, Natalie Scholpa, and Huiqi Ma for help with data collection, preparation, and management. This research was supported by a grant from the National Institute on Aging, one of the National Institutes of Health (R37 AG013148), and a Ruth L. Kirschstein PHS training grant (T32 AG000175). For more information on our research program, see http://psychology.gatech.edu/CHertzog/
Footnotes
See Dunlosky & Metcalfe (2009) for some discussion of other aspects of predictive accuracy in metacognitive research, including judgment calibration.
In the classic associative learning literature, implementation of encoding strategies to form new associations has been termed mediation (e.g., Richardson, 1998), with the use of imagery, sentence generation, and other strategies to mediate the formation of the new association. We make use of statistical analyses to test mediated regression hypotheses in this paper (MacKinnon, 2008). To avoid confusion of terms, we refer to strategy use for forming new associations as encoding strategies and the recall of encoding strategies as strategy recall.
Under certain conditions, statistical mediation effects can be interpreted as causal influences. In this study we claim no such interpretation, and instead use mediation analysis only to evaluate degree of statistical interdependence between strategy recall, FOKs, and outcome measures. The extent to which strategy recall mediates effects of FOKs on outcome measures may reflect a causal influence of strategy recall on FOKs, but it certainly does not imply from our perspective a causal influence of FOKs on other outcome measures, like recognition memory success. Instead it allows us to quantify the degree to which strategy recall and FOKs have shared or unique prediction of outcomes used to assess FOK accuracy.
One can validate FOKs for all items to show that FOKs are strongly influenced by cued-recall success (Eakin & Hertzog, 2012). This relationship was also found in the present study (see online supplemental materials, Appendix C, Tables 2a-2c), but is not our focus of interest here.
Note that the standardization used here is for within-person (between-item) variance in the different variables. Item-level variation includes sources of error aggregated away when list-level statistics (like mean FOKs) are computed, which helps to explain the modest standardized regression effects.
Given the large standardized direct effect of item-level strategy recall on FOKs (.53), the weak direct effects could have been partly a function of partialling these two variables for each other. However, even when strategy recall was omitted from the model, the effect of item-level FOKs on CJs was still smaller than the effects of the manipulated independent variables, standardized effect = .11 (SE = .02), p < .05.
Contributor Information
Christopher Hertzog, Georgia Institute of Technology.
Erika K. Fulton, Georgia Institute of Technology
Starlette M. Sinclair, Georgia Institute of Technology
John Dunlosky, Kent State University.
References
- Brewer GA, Marsh RL, Clark-Foos A, Meeks JT. Noncriterial recollection influences metacognitive monitoring and control processes. The Quarterly Journal of Experimental Psychology. 2010;63(10):1936–1942. doi: 10.1080/17470210903551638. [DOI] [PubMed] [Google Scholar]
- Carroll M, Nelson TO. Effect of overlearning on the feeling of knowing is more detectable in within-subject than in between-subject designs. The American Journal of Psychology. 1993;106(2):227–235. doi: 10.2307/1423169. [DOI] [PubMed] [Google Scholar]
- Cheong J, MacKinnon DP. Mediation/indirect effects in structural equation modeling. In: Hoyle RH, editor. Handbook of structural equation modeling. New York, NY: Guilford Press; 2012. pp. 417–435. [Google Scholar]
- Cohen J. Statistical power analysis for the behavioral sciences (rev ed.) Hillsdale, NJ England: Lawrence Erlbaum Associates, Inc; 1988. [Google Scholar]
- Cohn M, Moscovitch M. Dissociating measures of associative memory: Evidence and theoretical implications. Journal of Memory and Language. 2007;57(3):437–454. doi: 10.1016/j.jml.2007.06.006. [DOI] [Google Scholar]
- Cook GI, Marsh RL, Hicks JL. Source memory in the absence of successful cued recall. Journal of Experimental Psychology: Learning, Memory, and Cognition. 2006;32(4):828–835. doi: 10.1037/0278-7393.32.4.828. [DOI] [PubMed] [Google Scholar]
- Dunlosky J, Hertzog C. Measuring strategy production during associative learning: The relative utility of concurrent versus retrospective reports. Memory & Cognition. 2001;29(2):247–253. doi: 10.3758/BF03194918. [DOI] [PubMed] [Google Scholar]
- Dunlosky J, Hertzog C, Powell-Moman A. The contribution of mediator-based deficiencies to age differences in associative learning. Developmental Psychology. 2005;41(2):389–400. doi: 10.1037/0012-1649.41.2.389. [DOI] [PubMed] [Google Scholar]
- Eakin DK. ListChecker Pro 1.2: A program designed to facilitate creating word lists using the University of South Florida word association norms. Behavior Research Methods. 2010;42(4):1012–1021. doi: 10.3758/brm.42.4.1012. [DOI] [PubMed] [Google Scholar]
- Eakin DK, Hertzog C. Immediate judgments of learning are insensitive to implicit interference effects at retrieval. Memory & Cognition. 2012;40(1):8–18. doi: 10.3758/s13421-011-0138-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Eakin DK, Hertzog C, Harris W. Age invariance in semantic and episodic metamemory: Both younger and older adults provide accurate feeling of knowing for names of faces. Aging Neuropsychology and Cognition. doi: 10.1080/13825585.2013.775217. in press. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fearnley S. MRC Psycholinguistic Database search program. Behavior Research Methods, Instruments & Computers. 1997;29(2):291–295. doi: 10.3758/bf03204829. [DOI] [Google Scholar]
- Gallo DA, Bell DM, Beier JS, Schacter DL. Two types of recollection-based monitoring in younger and older adults: Recall-to-reject and the distinctiveness heuristic. Memory. 2006;14(6):730–741. doi: 10.1080/09658210600648506. [DOI] [PubMed] [Google Scholar]
- Gonzalez R, Nelson TO. Measuring ordinal association in situations that contain tied scores. Psychological Bulletin. 1996;119(1):159–165. doi: 10.1037/0033-2909.119.1.159. [DOI] [PubMed] [Google Scholar]
- Hart JT. Memory and the feeling-of-knowing experience. Journal of Educational Psychology. 1965;56(4):208–216. doi: 10.1037/h0022263. [DOI] [PubMed] [Google Scholar]
- Hertzog C, Dunlosky J, Sinclair SM. Episodic feeling-of-knowing resolution derives from the quality of original encoding. Memory & Cognition. 2010;38(6):771–784. doi: 10.3758/MC.38.6.771. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hertzog C, Fulton EK, Mandviwala L, Dunlosky J. Older adults show deficits in retrieving and decoding associative mediators generated at study. Developmental Psychology. 2013;49(6):1127–1131. doi: 10.1037/a0029414. [DOI] [PubMed] [Google Scholar]
- Hertzog C, Hines JC, Touron DR. Judgments of learning are influenced by multiple cues in addition to memory for past test accuracy. Archives of Scientific Psychology. doi: 10.1037/arc0000003. in press. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hertzog C, Sinclair SM, Dunlosky J. Age differences in the monitoring of learning: Cross-sectional evidence of spared resolution across the adult life span. Developmental Psychology. 2010;46(4):939–948. doi: 10.1037/a0019812. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hicks JL, Marsh RL. On predicting the future states of awareness for recognition of unrecallable items. Memory & Cognition. 2002;30(1):60–66. doi: 10.3758/BF03195265. [DOI] [PubMed] [Google Scholar]
- Hines JC, Touron DR, Hertzog C. Metacognitive influences on study time allocation in an associative recognition task: An analysis of adult age differences. Psychology and Aging. 2009;24(2):462–475. doi: 10.1037/a0014417. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hosey LA, Peynircioğlu ZF, Rabinovitz BE. Feeling of knowing for names in response to faces. Acta Psychologica. 2009;130(3):214–224. doi: 10.1016/j.actpsy.2008.12.007. [DOI] [PubMed] [Google Scholar]
- Jameson KA, Narens L, Goldfarb K, Nelson TO. The influence of near-threshold priming on metamemory and recall. Acta Psychologica. 1990;73(1):55–68. doi: 10.1016/0001-6918(90)90058-n. [DOI] [PubMed] [Google Scholar]
- Koriat A. Dissociating knowing and the feeling of knowing: Further evidence for the accessibility model. Journal of Experimental Psychology: General. 1995;124(3):311–333. doi: 10.1037/0096-3445.124.3.311. [DOI] [Google Scholar]
- Koriat A, Levy-Sadot R. The combined contributions of the cue-familiarity and accessibility heuristics to feelings of knowing. Journal of Experimental Psychology: Learning, Memory, and Cognition. 2001;27(1):34–53. doi: 10.1037/0278-7393.27.1.34. [DOI] [PubMed] [Google Scholar]
- Krinsky R, Nelson TO. The feeling of knowing for different types of retrieval failure. Acta Psychologica. 1985;58(2):141–158. doi: 10.1016/0001-6918(85)90004-6. [DOI] [PubMed] [Google Scholar]
- Littell RC, Milliken GA, Stroup WW, Wolfinger RD, editors. SAS system for mixed models. 4. Cary, NC: SAS Institute, Inc; 2000. [Google Scholar]
- Liu Y, Su Y, Xu G, Chan RCK. Two dissociable aspects of feeling-of-knowing: Knowing that you know and knowing that you do not know. The Quarterly Journal of Experimental Psychology. 2007;60(5):672–680. doi: 10.1080/17470210601184039. [DOI] [PubMed] [Google Scholar]
- Lupker SJ, Harbluk JL, Patrick AS. Memory for things forgotten. Journal of Experimental Psychology: Learning, Memory, and Cognition. 1991;17(5):897–907. doi: 10.1037/0278-7393.17.5.897. [DOI] [PubMed] [Google Scholar]
- MacKinnon DP. Introduction to statistical mediation analysis. New York, NY: Taylor & Francis Group/Lawrence Erlbaum Associates; 2008. [Google Scholar]
- MacLaverty SN, Hertzog C. Do age-related differences in episodic feeling of knowing accuracy depend on the timing of the judgement? Memory. 2009;17(8):860–873. doi: 10.1080/09658210903374537. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Metcalfe J. Feelings and Judgments of Knowing: Is there a special noetic state? Consciousness and Cognition. 2000;9:178–2000. doi: 10.1006/ccog.2000.0451. [DOI] [PubMed] [Google Scholar]
- Metcalfe J, Schwartz BL, Joaquim SG. The cue-familiarity heuristic in metacognition. Journal of Experimental Psychology: Learning, Memory, and Cognition. 1993;19(4):851–864. doi: 10.1037/0278-7393.19.4.851. [DOI] [PubMed] [Google Scholar]
- Muthén LK, Muthén BO. Mplus User's Guide. 4. Los Angeles, CA: Muthén & Muthén; 1998-2007. [Google Scholar]
- Nelson DL, McEvoy CL, Schreiber TA. The University of South Florida word association, rhyme, and word fragment norms. 1998 doi: 10.3758/bf03195588. from http://www.usf.edu/FreeAssociation/ [DOI] [PubMed]
- Nelson TO, Dunlosky J. When people's judgments of learning (JOLs) are extremely accurate at predicting subsequent recall: The ‘delayed-JOL effect.’. Psychological Science. 1991;2(4):267–270. doi: 10.1111/j.1467-9280.1991.tb00147.x. [DOI] [Google Scholar]
- Nelson TO, Leonesio RJ, Shimamura AP, Landwehr RF, Narens L. Overlearning and the feeling of knowing. Journal of Experimental Psychology: Learning, Memory, and Cognition. 1982;8(4):279–288. doi: 10.1037/0278-7393.8.4.279. [DOI] [PubMed] [Google Scholar]
- Nelson TO, Narens L. Metamemory: A theoretical framework and new findings. In: Bower GH, editor. The psychology of learning and motivation. Vol. 26. New York: Academic Press; 1990. pp. 125–141. [Google Scholar]
- Paivio A. Mind and its evolution: A dual coding theoretical approach. Mahwah, NJ: Lawrence Erlbaum Associates; 2007. [Google Scholar]
- Parks CM. The role of noncriterial recollection in estimating recollection and familiarity. Journal of Memory and Language. 2007;57(1):81–100. doi: 10.1016/j.jml.2007.03.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Richardson JTE. The availability and effectiveness of reported mediators in associative learning: A historical review and an experimental investigation. Psychonomic Bulletin & Review. 1998;5(4):597–614. doi: 10.3758/bf03208837. [DOI] [Google Scholar]
- Sacher M, Taconnat L, Souchay C, Isingrini M. Divided attention at encoding: Effect on feeling-of-knowing. Consciousness and Cognition: An International Journal. 2009;18(3):754–761. doi: 10.1016/j.concog.2009.04.001. [DOI] [PubMed] [Google Scholar]
- SAS Institute, I. SAS/STAT®9.2 User's Guide. Cary, NC: SAS Institute Inc; 2008. [Google Scholar]
- Schacter DL, Worling JR. Attribute information and the feeling-of-knowing. Canadian Journal of Psychology/Revue canadienne de psychologie. 1985;39(3):467–475. doi: 10.1037/h0080074. [DOI] [Google Scholar]
- Singer JD. Using SAS PROC MIXED to fit multilevel models, hierarchical models, and individual growth models. Journal of Educational and Behavioral Statistics. 1998;23(4):323–355. doi: 10.2307/1165280. [DOI] [Google Scholar]
- Snijders JD, Bosker RJ. Multilevel analysis: An introduction to basic and advanced multilevel modeling. London, UK: Sage; 1999. [Google Scholar]
- Souchay C, Moulin CJA, Clarys D, Taconnat L, Isingrini M. Diminished episodic memory awareness in older adults: Evidence from feeling-of-knowing and recollection. Consciousness and Cognition: An International Journal. 2007;16(4):769–784. doi: 10.1016/j.concog.2006.11.002. [DOI] [PubMed] [Google Scholar]
- Tauber SK, Rhodes MG. Multiple bases for young and older adults' judgments of learning in multitrial learning. Psychology and Aging. 2012;27(2):474–483. doi: 10.1037/a0025246. [DOI] [PubMed] [Google Scholar]
- Thiede KW, Dunlosky J. Delaying students' metacognitive monitoring improves their accuracy in predicting their recognition performance. Journal of Educational Psychology. 1994;86(2):290–302. doi: 10.1037/0022-0663.86.2.290. [DOI] [Google Scholar]
- Thomas AK, Bulevich JB, Dubois SJ. Context affects feeling-of-knowing accuracy in younger and older adults. Journal of Experimental Psychology: Learning, Memory, and Cognition. 2011;37(1):96–108. doi: 10.1037/a0021612. [DOI] [PubMed] [Google Scholar]
- Thomas AK, Bulevich JB, Dubois SJ. An analysis of the determinants of the feeling of knowing. Consciousness and Cognition: An International Journal. 2012;21(4):1681–1694. doi: 10.1016/j.concog.2012.09.005. [DOI] [PubMed] [Google Scholar]
- Yonelinas AP. Consciousness, control, and confidence: The 3 Cs of recognition memory. Journal of Experimental Psychology: General. 2001;130(3):361–379. doi: 10.1037/0096-3445.130.3.361. [DOI] [PubMed] [Google Scholar]
- Yuille JC. A detailed examination of mediation in PA learning. Memory & Cognition. 1973;1(3):333–342. doi: 10.3758/BF03198117. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.