Skip to main content
The Journal of Neuroscience logoLink to The Journal of Neuroscience
. 2008 Aug 27;28(35):8765–8771. doi: 10.1523/JNEUROSCI.1953-08.2008

Temporary Activation of Long-Term Memory Supports Working Memory

Jarrod A Lewis-Peacock 1,, Bradley R Postle 1
PMCID: PMC2699183  NIHMSID: NIHMS68573  PMID: 18753378

Abstract

This study describes a functional magnetic resonance imaging study of humans engaged in long-term memory (LTM) and working memory tasks. A pattern classifier learned to identify patterns of brain activity associated with viewing and making judgments about three categories of pictures (famous people, famous locations, and common objects). The evaluation of these stimuli relied on perception and long-term semantic and/or episodic memories. We investigated whether this classifier could successfully decode brain activity from a subsequent delayed paired-associate recognition working memory task that required the short-term retention of the same stimuli. We reasoned that the LTM-trained classifier would be able to decode delay-period activity only if that activity reflected, to some extent, the temporary activation of LTM. Our results demonstrated successful decoding: delay-period activity from a distributed network of brain regions matched learned patterns of activity for task-relevant stimuli to a greater extent than for task-irrelevant stimuli. In varying degrees throughout the delay, activity reflected the target (a retrospective code) and its associate (a prospective code) with considerable variability among subjects. Although prefrontal cortex (PFC) demonstrated category-specific patterns of activity during the LTM task, these patterns were not reinstated in PFC during the working memory task. We conclude that the short-term retention of information can be supported by the temporary reactivation of LTM representations.

Keywords: memory, long-term memory, working memory, paired association, fMRI, pattern classification

Introduction

Working memory refers to the retention of information in conscious awareness when this information is not present in the environment, to its manipulation, and to its use in guiding behavior. For decades, an influential view has held that working memory functions are supported by the operation of specialized systems that act as buffers for the storage and manipulation of information (Baddeley and Hitch, 1974; Goldman-Rakic, 1987). Neurobiological evidence consistent with this view includes evidence suggesting that prefrontal cortex (PFC) may be a neural substrate for storage buffers of this model (Haxby et al., 2000; Constantinidis et al., 2001; Leung et al., 2002; Courtney, 2004; Narayanan et al., 2005; Zarahn et al., 2005). Furthermore, many neurobiological models of working memory function assume a critical role for PFC neurons whose specialized properties support sustained delay-period activity (Machens et al., 2005; Durstewitz and Seamans, 2006; Mongillo et al., 2008). However, there is also a large body of results from neuropsychological, neurophysiological, and neuroimaging studies that is difficult to reconcile with the depiction of the PFC as a critical substrate for working memory storage (for review, see Postle, 2006). This points to an alternative view that depicts working memory as being emergent from the coordinated recruitment, via attention, of brain systems that have evolved to accomplish sensory-, representation-, and action-related functions. This alternative, therefore, emphasizes the temporary activation of long-term memory (LTM) representations (Anderson, 1983; Cowan, 1995; Oberauer, 2002; Ruchkin et al., 2003). This activated LTM model, however, also remains controversial (Baddeley, 2003; Düzel, 2003; Kroger, 2003; Logie and Della Sala, 2003; Majerus et al., 2003; Vallar, 2003).

Several previous studies have produced data that are consistent with the activated LTM model by demonstrating sustained delay-period activity in regions that are associated with LTM representation of the stimulus domain being tested. For example, face-specific delay-period activity has been localized to regions of inferior temporal cortex that are believed to support the perception and long-term retention of faces (Druzgal and D'Esposito, 2003; Postle et al., 2003; Ranganath et al., 2004b; Postle, 2005). Such results cannot be interpreted as direct tests of this model, however, because to do so would be to commit the logical fallacy of “reverse inference” (affirming the consequent) (Poldrack, 2006). The present experiment was designed to support stronger inference than these previous studies by applying multivariate pattern classification to functional magnetic resonance imaging (fMRI) data: first, we recorded distinct patterns of neural activity corresponding to the engagement of LTM processes; second, we tested for the reinstatement of these patterns during the short-term retention of the same stimuli. This study thereby effected a decisive test of the hypothesis that the temporary activation of LTM representations contributes to the short-term retention of information in working memory.

Materials and Methods

Subjects

Ten subjects (all right-handed; seven males and three females; age, 19–32 years) were recruited from the undergraduate and medical campuses of the University of Wisconsin–Madison. None reported any medical, neurological, or psychiatric illness, and all gave informed consent.

Overview of study

The experiment proceeded in three phases, each implemented with E-Prime software (Psychology Software Tools). First, subjects performed an LTM task for three categories of visual stimuli: famous people, famous locations, and common objects (Fig. 1A). The evaluation of these stimuli required perception and long-term semantic and/or episodic memories. (For example, to render a judgment of like or dislike for the stimulus “John Wayne,” one would have to retrieve knowledge about the famous actor John Wayne, and perhaps also memories of scenes from his movies.) This procedure replicated that of a recent study of free recall which demonstrated that category-specific brain activity patterns arising from this stimulus-judgment task reliably reappeared just before the verbal recall of an item from a category (Polyn et al., 2005). Second, outside the scanner, subjects learned six pairings of 12 stimuli that were selected at random from the training set. Finally, subjects performed delayed paired-associate recognition (a “working memory task”) on these stimulus pairs (Fig. 1B). A pattern classifier was trained to distinguish category-specific patterns of brain activity from the first task and was then used to decode brain activity from the delay-period of the second task. We reasoned that the only way delay-period activity from the working memory task could be classified by an LTM-trained pattern classifier would be if delay-period activity significantly matched LTM patterns of activity, i.e., if the working memory task produced the temporary reinstatement of the LTM activity. In this way, successful decoding of working memory activity would provide conclusive evidence for the activated LTM model.

Figure 1.

Figure 1.

Long-term and working memory trials. A, B, The timelines are shown for LTM trials (A) and delayed paired-associate working memory trials (B). The dashed lines connecting the boxes to the timelines indicate the onset and duration of each trial phase.

Behavioral procedures

Stimulus judgment task (“LTM task”).

Subjects viewed a total of 90 stimuli drawn from three categories: 30 famous people, 30 famous locations, and 30 common objects. They then indicated (on a four-point Likert scale, using a stimulus-response box) how much they liked the celebrity, how much they would like to visit the location, or how often they encountered the object in everyday life. Each stimulus was presented one time only, for a total of 90 randomly ordered stimulus presentations. Each LTM trial consisted of a cue period (2 s), a stimulus period (5 s), and a judgment period (3 s). After each trial, subjects performed an arithmetic task (16 s) to reduce interference between trials (Polyn et al., 2005).

Paired-associate learning.

Twelve stimuli (four people, four locations, and four objects) were then selected at random from the training set and paired arbitrarily so that one stimulus from each category was paired with each of the three categories. This produced three pairs with stimuli from the same category (e.g., person1–person2) and three pairs with stimuli from different categories (e.g., person3–location1). Outside the scanner, subjects learned these within-category and between-category pairs through repeated three-alternative forced-choice testing (with foils drawn from the set of 12) until they achieved a criterion level of performance of 24 consecutive correct trials.

Delayed paired-associate recognition (working memory task).

Subjects then performed delayed paired-associate recognition in the scanner. Each working memory trial consisted of a target stimulus (1 s), a delay period (11 s), a probe stimulus (1 s), and an intertrial interval (ITI) (13 s). Subjects indicated with a “Yes” or “No” button press whether the probe stimulus was the correct associate of the target stimulus. The trial depicted in Figure 1B is an example of a person–location trial: the target is a person (John Wayne), and the probe is a location (Vatican City). Each stimulus pair was presented 12 times (six times in each direction; i.e., the person–location pair was presented six times with the person as target and six times with the location as target). Therefore, there were 36 (3 pairs × 12 exposures) within-category trials and 36 between-category trials. Trials were configured such that there was a probability of 0.5 that the probe stimulus was the correct associate of the target, with foils drawn equally from all categories; thus, the probe stimulus belonged to the correct category on two-thirds of all trials. In addition, subjects performed 24 no-memory trials in which they were presented with a target stimulus not from the learned set. They were instructed ahead of time that these trials did not require memory because the target stimulus would always reappear as the probe stimulus; simply a “No” button press was required when the probe stimulus appeared.

Validation subjects and experimental subjects

To validate our method we instructed two of the subjects (one male and one female) to solve the working memory task using a prospective strategy (i.e., “as soon as you see the first picture, quickly recall its associate and hold the associate in mind for the duration of the delay period”). This created a situation in which we would know the contents of the subjects' working memory independent of the experimental data and could thereby assess the validity of the classification of the delay-period activity. After successful validation, we recruited eight new subjects who received no instruction about performance strategy, allowing us to interrogate delay-period activity in an unbiased manner. The data from these “experimental” subjects were used to test our hypotheses. Statistical analyses in validation subjects focused on the within-subject differences in classifier estimates, and all between-category working memory trials were pooled together and treated as independent replications (n = 2; dferror = 71). For experimental subjects, statistical comparisons also focused on the within-subject differences in classifier estimates, but trial averaging was used, yielding a single set of data for each subject (n = 8; dferror = 7).

fMRI acquisition and preprocessing

Whole-brain images were acquired with a 3T scanner (Signa VH/I; GE Healthcare). For all volunteers, we acquired high-resolution T1-weighted images (30 axial slices, 0.9375 × 0.9375 × 4 mm). We used a gradient-echo, echo-planar sequence [repetition time (TR), 2000 ms; echo time, 50 ms] to acquire data sensitive to the blood oxygen level-dependent signal within a 64 × 64 matrix (30 axial slices coplanar with the T1 acquisition, 3.75 × 3.75 × 4 mm). Six scans of both the LTM and working memory tasks were obtained for each subject, each scan lasting 6 min, 50 s (LTM) and 7 min 16 s (working memory). All task runs were preceded by 20 s of dummy pulses to achieve a steady state of tissue magnetization. Preprocessing of the functional data were done with the Analysis of Functional NeuroImages (AFNI) software package (Cox, 1996) using the following preprocessing steps, in order: correction for slice time acquisition and rigid-body realignment to the first volume from the experimental task with 3dvolreg; removal of signal spikes with 3dDespike; removal of the mean from each voxel and linear and quadratic trends from within each run with 3dDetrend; and correction for magnetic field inhomogeneities (using in-house software). Finally, functional data from the working memory task were aligned to data from the LTM task using 3dAllineate. Note that spatial smoothing was not imposed, and the data were not spatially transformed into a common atlas space before hypothesis testing. Rather, the data from each subject were analyzed in that subject's unsmoothed, native space. For classification analyses, a feature selection ANOVA was applied to the preprocessed images to select those voxels whose activity varied significantly (p < 0.05) between people, location, and object categories over the course of the LTM experiment. These feature-selected voxels served as input nodes to the pattern classifiers (mean, 8489 voxels; SD, 2107).

Classifier training

MVPA (The Princeton Multi-Voxel Pattern Analysis Toolbox) (http://www.csbmb.princeton.edu/mvpa), in conjunction with the Matlab Neural Network Toolbox (MathWorks), was used for all pattern classification analyses. Preprocessed fMRI signal from the initial 10 s of each LTM trial associated with the cue, stimulus, and judgment periods were used to train a two-layer feedforward neural network (via backpropagation) to distinguish three patterns of brain activity corresponding to the study of people, locations, and objects (Fig. 1A). To reduce prediction error in analyses involving the pattern classifier, the reported classifier outputs were the average of 50 backpropagation networks, each initialized with a different set of random weights (Bishop, 1995). Data for all classification analyses were shifted in time by 6 s to account for hemodynamic lag. We evaluated classification accuracy by training on five blocks of data (fMRI task runs) and testing on the novel sixth block. The blocks used for training were then rotated, and a new block of data was tested until all six blocks of data had been classified.

Because interpretation of activity from the LTM task is central to the logic of this experiment, further procedural and theoretical discussion will be useful. Procedurally, inclusion of the cue and judgment periods in training (when the visual stimulus was absent) replicated the procedures of Polyn et al. (2005). To assess empirically whether these data should be included, we calculated the classification accuracy at each time point of the 10 s training window and found that category discrimination was well above chance throughout the entire period, with marginally enhanced discrimination during stimulus presentation. Thus, we are confident that comparable stimulus category-specific activity was being evoked during all three trial phases. At a theoretical level, it is important to establish whether or not working memory was engaged during the LTM task. (After all, successful decoding of working memory task data would not be surprising if the LTM task also engaged working memory.) In this regard, it is helpful to emphasize that the activated LTM hypothesis posits a mechanism for the short-term retention of stimulus-specific information. From this perspective, it is clear that the LTM task did not require subjects to retain a stimulus representation across a delay period when the stimulus was not present in the environment. What was required in the LTM task, in addition to perception and retrieval from LTM, was the maintenance of the rules that guided behavior on this task. Although some definitions of working memory incorporate the retention of rules and/or of behavioral set, the activated LTM model does not relate to these constructs. Therefore, the LTM task was suitable for testing the activated LTM hypothesis.

Voxel discrimination maps

To assess the relative importance of different brain areas to the classification of the stimulus categories, we calculated, from the trained pattern classifier, which voxels discriminated between the patterns of activity corresponding to each stimulus category. We started with the voxel importance formula from Polyn et al. (2005): impij = wij * avgij, where wij is the weight between input unit i and output unit j, and avgij is the average activity of input i during study of category j. To quantify the discriminability of each voxel, we then computed this value for all output categories and created their ratio, yielding discrimij = impij/(impij + impik + impil), where impij is as before, and impik and impil are the voxel importance values for categories k and l, respectively. These discrimination scores for each voxel gave an indication of how much that voxel activated a given output over and above the other outputs (e.g., the ratio indicated how much a voxel activated the “person” output compared with the “location” and “object” outputs). This analysis produced three separate distributions (centered on 0.33) of voxelwise discrimination scores corresponding to each category. We used two SDs above the mean of each distribution as the threshold for including a voxel in the “discrimination map” for a given category.

Decoding delay-period activity

A trained pattern classifier for each subject, trained on all six blocks of LTM data, was used to assess the extent to which category-specific patterns of brain activity could be identified in the delay-period of the subsequent delayed paired-associate recognition task. Preprocessed fMRI signal at intervals of TR = 2 s was classified from the initial 16 s of each working memory trial (Fig. 1B) associated with target presentation (1 s), delay period (11 s), probe presentation (1 s), and the first 3 s of ITI. Possible contamination of delay-period estimates from probe stimulus processing (after the 6 s hemodynamic adjustment) was not a concern, because this processing would be expected to introduce noise, not coherent activity. This follows from the fact that the stimulus presented as the probe was from the same category as the associate of the target on only two-thirds of the trials, with the remaining trials presenting foils, which were drawn from a different category than the expected associate [see above, Delayed paired-associate recognition (working memory task)].

Our paired-associate recognition procedure, modeled after previous studies in monkeys (Takeda et al., 2005), permitted us to investigate retrospective and prospective coding in working memory. Assuming that subjects solved the task by actively representing task-relevant information across the delay period, the delay-period brain activity would be expected to reflect either retention of a representation of the target stimulus (a “retrospective” code), or the recall (from LTM) and retention of a representation of the paired-associate stimulus (a “prospective” code), or some combination thereof. There was an important distinction in our design between the two types of delayed-recognition memory trials: within-category trials and between-category trials. For within-category trials, regardless of whether a subject maintained a retrospective representation of the target or a prospective representation of its associate, the delay-period activity patterns would be expected to reflect the same stimulus category. Between-category trials were the critical trials, however, because they could distinguish retrospective from prospective neural representations. If, for example, a person-like delay-period activity pattern was identified on a person–object trial, this would indicate that the subject was remembering the item presented at the beginning of the trial. Prospective delay-period activity could only occur if, on seeing the target stimulus (a person in this example), the subject retrieved from LTM the representation of its associate, an object, and retained this representation in working memory for the remainder of the trial. Only data for between-category trials are presented here. Results for within-category and no-memory control trials are available in the supplemental material, available at www.jneurosci.org.

It is important to note that support for the activated LTM hypothesis would come in the form of successful decoding of delay-period activity as task-relevant versus task-irrelevant. Whether this task-relevant activity is in the form of prospective or retrospective activity is not critical for this principle hypothesis. Rather, the prospective versus retrospective distinction was included (1) for validation of the methodology (see Fig. 4A) and (2) to explore at a finer grain of detail the nature of the short-term retention of information across individuals (see Fig. 4C).

Figure 4.

Figure 4.

Classification of between-category trials. A, C, Mean classifier estimates of delay-period activity are shown for the two validation subjects who were instructed to solve the working memory task using a prospective strategy (A) and for the eight experimental subjects who received no instruction on which coding strategy to prioritize during the task (C). Note that the results in A validate the method, in that delay-period activity classification matches the cognitive strategy that these two subjects were instructed to adopt. The vertical axis shows the classifier estimates of the correspondence between the delay-period brain activity and the learned patterns of category-specific activity (for people, locations, and objects) associated with the stimuli from each trial. Estimates are collapsed across all 36 between-category trials for each subject into three categories: retrospective (the category of the target), prospective (the category of its associate), and other (task-irrelevant). The horizontal axis shows the timing of the working memory trial. The black bars along this axis indicate the presentations of target and probe stimuli (target, 0–1 s; probe, 12–13 s). C, Bottom, The bar graphs indicate the number of subjects that demonstrated each type of category selectivity at each time point (black, retrospective; purple, prospective; gray, other). These selectivity statistics were calculated by assigning each subject to a particular category based on the largest average estimate of the three categories for each point. B, Statistical comparisons for validation subjects focused on the within-subject difference between the prospective classifier estimate and the average of the nonprospective estimates (retrospective and other). D, For experimental subjects, statistical comparisons focused on the within-subject difference between the average of the task-relevant estimates (retrospective and prospective) and the task-irrelevant estimate (other). *p < 0.01 (B), and *p < 0.05 (D). Error bars represent a 95% confidence interval around the within-subject difference scores.

Prefrontal cortex activity

In monkeys, neural activity in lateral PFC has been found to initially represent the target object (a retrospective code) but, toward the end of the delay period, to reflect the anticipated associate object (a prospective code) (Rainer et al., 1999). In a different study, robust delay-period neuronal activity in inferior temporal cortex was found to actively maintain prospective information, whereas retrospective representations were attenuated and often replaced by distractor information (Takeda et al., 2005). Much of the work on human prospective memory has emphasized the role of the PFC in accounts that emphasize the planning and cognitive control aspects of prospective memory (D'Esposito et al., 2000; Curtis et al., 2004; Burgess et al., 2007). However, there has also been evidence for prospective coding outside PFC, e.g., in caudate nucleus (Postle and D'Esposito, 2003) and in inferior temporal cortex (Ranganath et al., 2004a).

In the present study, we interrogated the role of PFC activity in working memory retention by attempting to decode delay-period activity from (1) only voxels located inside PFC (PFC-only condition) and (2) all voxels posterior to PFC (no-PFC condition). Anatomically derived PFC masks were generated for each subject in AFNI by backward-transforming a TT_Daemon atlas mask (consisting of Brodmann areas 8–11, 44–46) into that subject's native space. The number of voxels passing feature selection in each condition was 1069 ± 283 for PFC-only and 7421 ± 1907 for no-PFC. So that the number of voxels interrogated in each condition was equivalent, for each subject we sampled randomly from the no-PFC voxels and created 50 new masks that contained the same number of voxels as in the PFC-only condition. These PFC-only and (equivalently sized) no-PFC masks were selectively applied during classification to constrain the voxel populations used for analyses.

Results

Classifier training

For all subjects, activity from the LTM task was reliably classified as consistent with the appropriate category of the trial (Fig. 2). Based on one-tailed t tests, classifier prediction accuracy was significantly above chance (0.333) for each trial type, with p < 0.001 (people, t(9) = 21.2; locations, t(9) = 11.0; objects, t(9) = 14.6). We performed further analyses by (1) masking out all voxels in PFC (no-PFC condition) and (2) masking out all voxels outside PFC (PFC-only condition). Classifier training on the LTM data, when restricted to these voxel populations, revealed successful discrimination of category-specific activity, significant at p < 0.001 (Fig. 2) (no-PFC: people, t(9) = 19.7; locations, t(9) = 10.7; objects, t(9) = 12.6; PFC-only: people, t(9) = 9.5; locations, t(9) = 5.5; objects, t(9) = 8.8). Although classification accuracy was greater in the no-PFC compared with the PFC-only condition, the prediction accuracy in both conditions was reliably above chance. That is, activity both in PFC and in more posterior brain regions demonstrated reliable category-specific patterns of activity during the LTM task.

Figure 2.

Figure 2.

Classifier training accuracy on LTM trials. The prediction accuracy of the classifier for identifying the correct category of brain activity in the LTM trials is shown for three analysis conditions: whole-brain (Whole); with PFC voxels masked out (No-PFC); and with all voxels outside of PFC masked out (PFC-Only). The accuracy scores represent the mean prediction accuracy for the initial 10 s of all LTM trials across all subjects. Accuracy is shown separately for each trial type: People, Locations, and Objects. The dashed line represents the chance level of prediction at 0.333 (the classifier could predict person, location, or object at each time point). Error bars are 95% confidence intervals for the one-tailed t tests at p < 0.001.

Distributed representations

Average voxel activity and network weights from the pattern classifier were analyzed to estimate the extent to which each voxel discriminated between the three categories (see Materials and Methods). Figure 3 illustrates the voxels that exerted the strongest influence in discriminating each of the three stimulus categories (for one representative subject). Although canonical category-selective areas contributed to the classification of the three categories (e.g., the mid-fusiform gyrus for person stimuli, parahippocampal gyrus for location stimuli, and lateral occipital cortex for object stimuli), these regions did not solely drive classification. Each was a component of a distributed network of brain regions involved in the classification of a particular category. This replicates the findings of Polyn et al. (2005). Data summarizing the category discrimination analysis in specific anatomical regions across all 10 subjects are presented in Table 1.

Figure 3.

Figure 3.

Classifier-derived voxel discrimination maps. Classifier-derived voxel discrimination maps (from the LTM task) from one representative subject (subject 4). Voxels are colored that were important for detecting patterns of brain activity corresponding to the perception and LTM of people (blue), locations (red), and objects (green). FG, Fusiform gyrus; PG, parahippocampal gyrus; MTG, middle temporal gyrus; LO, lateral occipital cortex; R, right; L, left.

Table 1.

Distribution of classifier-derived discriminant voxels

Category FG (L/R) PG (L/R) LO (L/R) V1 (L/R) msFP (midline) MTG (L/R) MFG (L/R) SFS (L/R) PrCG (L/R) PC (L/R)
People 8/10 1/1 7/3 7/9 8 6/8 6/8 5/2 1/2 8/10
Locations 5/5 9/10 6/8 9/9 3 3/2 4/6 1/3 1/0 5/8
Objects 10/10 9/2 10/8 9/8 10 9/5 7/7 10/7 6/4 7/6

Data indicate the number of subjects (of 10) for whom category-discriminating voxels were found in particular anatomical regions. For example, important voxels for the discrimination of location representations were found in the left parahippocampal gyrus (PG) in 9 subjects, and in the right PG in all 10 subjects. FG, Mid-fusiform gyrus; PG, parahippocampal gyrus; LO, lateral occipital cortex; V1, cuneus, calcarine sulcus, caudal lingual gyrus; msFP, medial superior frontal pole; MTG, middle temporal gyrus; MFG, middle frontal gyrus; SFS, superior frontal sulcus; PrCG, precentral gyrus; PC, posterior cingulate/precuneus; L, left; R, right.

Decoding delay-period activity

Subjects performed the delayed paired-associate recognition trials with near perfect accuracy (98 ± 0.05%). All between-category trials (correct and incorrect) were included in the analyses.

Validation subjects

Classification results for the two subjects whose data served as procedural validation are shown in Figure 4A. These subjects were instructed to concentrate on the anticipated associate stimulus during the delay period (i.e., to solve the working memory task using a prospective strategy). The classifier reliably identified their delay-period activity as reflecting a prospective representation of the associate throughout the delay-period (paired t(71) = 4.9; p < 0.001) (Fig. 4B).

Experimental subjects

Having validated our method by demonstrating successful decoding of brain activity in subjects for whom we had independent knowledge of their mental state, we now consider the eight experimental subjects in Figure 4C. Critically, the pattern classifier reliably decoded delay-period activity as task relevant. Classifier estimates of task-relevant retrospective and prospective activity together were reliably higher throughout the delay period than estimates of task-irrelevant activity (paired t(7) = 3.1; p < 0.01) (Fig. 4D). Had delay-period activity not reinstated the activity patterns from the LTM task, in contrast, the classifier would have been expected to produce indistinguishable estimates for each category. (This was verified empirically by randomizing the labels of delay-period data at each time point and reclassifying, which produced statistically inseparable classification estimates.) For these eight subjects, prospective activity was not separable from retrospective activity until the very end of the delay period (10–12 s), when the patterns of activity were reliably more prospective than retrospective (paired t(7) = 2.8; p < 0.025). This prospective coding was likely a neural reflection of the cognitive anticipation of the probe stimulus that was to appear after the delay. As a result of the prospective bias at this time point, the task-relevant (i.e., average of prospective and retrospective estimates) versus task-irrelevant distinction was not significant (Fig. 4D). Beginning with probe onset (time, 12 s), the retrospective and prospective categories were no longer separable, and these task-relevant categories continued to be indistinguishable from the task-irrelevant category. It is important to emphasize that it is not only the recognition of prospective coding but also retrospective coding that implicates the temporary activation of LTM for working memory retention. Assuming that delay-period activity reflects the active retention of information in working memory, the observation of category-specific activity corresponding either to the target or its associate during the delay indicates that the short-term retention of information was accomplished, at least in part, by the temporary activation of LTM patterns for these stimuli.

For no-PFC analyses, the decoding of subsequent delay-period activity was successful (Fig. 5A,B), with task-relevant activity statistically separated from task-irrelevant activity (paired t(7) = 4.3; p < 0.005). Note that the qualitative pattern of delay-period estimates from the no-PFC analyses was very similar to those from the original whole brain analysis (Fig. 4C). For PFC-only analyses, the decoding of delay period activity failed (Fig. 5C,D). The numerical classifier match values for all categories decreased relative to no-PFC, and task-relevant activity was not distinguished from task-irrelevant activity (paired t(7) = −0.7, NS). Indeed, the latter was numerically higher than the former for much of the delay period.

Figure 5.

Figure 5.

Classification results for PFC. A, C, Mean classifier estimates of delay-period activity averaged across all eight experimental subjects are shown when all PFC voxels were removed from analysis (A) and when only PFC voxels were included (C). B, D, Statistical comparisons of task-relevant versus task-irrelevant estimates for both conditions. Graph conventions are as described in Figure 4.

Discussion

The analyses presented here tested directly the hypothesis that patterns of brain activity evoked when subjects perceived and evaluated visual stimuli (drawing on semantic and episodic LTM) would be reinstated when representations of these stimuli were held in working memory. Consistent with this hypothesis, we decoded delay-period activity with a pattern classifier that was trained on LTM activity. Thus, these data support the idea that the short-term retention of information can be supported by the temporary activation of LTM representations. This model does not necessarily rule out a parallel contribution to delay-task performance by specialized working memory systems. However, in this study we were unable to find evidence for category-specific delay-period activity in the PFC, a region proposed to support such systems. Although activity patterns in PFC during the LTM task were dissociable for people, location, and object stimuli, these category-specific activity patterns were not reinstated during the delay period of the working memory task. This contrasts with more posterior brain regions in which category-specific patterns of LTM activity re-emerged during the delay period (Fig. 5).

Similar to findings from the monkey (Takeda et al., 2005), we observed that sustained delay-period activity in human brains corresponded to both retrospective and prospective representations. Across subjects, there were qualitatively different classification results for delay-period activity. Figure 6 contrasts individual results for two subjects. On location–person trials, the delay-period activity for Subject 1 was classified as being the most consistent with location activity for the first 4 s of the delay. For the remainder of the delay period, the brain activity was classified as most resembling patterns of person activity. We interpret that this subject maintained a representation of the target (a location), then retrieved from LTM a representation of its associate (a person), and actively maintained this representation for the remainder of the delay period. That is, the subject temporarily activated and maintained an LTM representation in the same brain regions in which those representations were initially observed. The crossover from a representation of a location to that of a person suggests a transition from retrospective to prospective coding during the delay period of these trials. This coding strategy was evident across all between-category trials for this subject (Fig. 6B). Subject 6, in contrast, maintained person activity throughout the delay period of person–location trials (Fig. 6C). This demonstrates a retrospective trace of the target stimulus. This retrospective strategy was evident across all between-category trials for this subject (Fig. 6D). These data suggest intersubject variability in the cognitive strategy used to solve the working memory task. We also observed substantial intrasubject variability, in that some subjects displayed retrospective coding for some trial types and prospective coding for others. Averaging classifier estimates across trials for all subjects diminished these opposing trends and produced results that demonstrated no reliable bias across our sample for retrospective versus prospective coding but, importantly, captured a reliable dissociation between task-relevant and task-irrelevant patterns of activity.

Figure 6.

Figure 6.

Individual variability in classification results. Classifier estimates of delay-period activity are shown from between-category trials for two representative subjects, illustrating individual differences in task strategy. A, C, Results for a specific between-category trial type (location–person) are shown for Subject 1 (A) and for the complementary trial type (person–location) for Subject 6 (C). B, D, The classifier estimates for all 36 between-category trials from each subject were recoded into the categories Retrospective, Prospective, and task-irrelevant Other in B and D, respectively. Graph conventions are as described in Figure 4.

Footnotes

This work was supported by National Institutes of Health Grant MH064498 (B.R.P.). We thank Sean Polyn for his helpful suggestions and for stimuli, Ken Norman and his group at Princeton University for providing insightful feedback on our analyses, and Tim Rogers for helpful discussions on this work.

References

  1. Anderson JR. The architecture of cognition. Cambridge, MA: Harvard UP; 1983. [Google Scholar]
  2. Baddeley AD. New data: old pitfalls. Behav Brain Sci. 2003;26:729–730. [Google Scholar]
  3. Baddeley AD, Hitch GJ. Working memory. In: Bower GH, editor. The psychology of learning and motivation. New York: Academic; 1974. pp. 47–89. [Google Scholar]
  4. Bishop CM. Neural networks for pattern recognition. Oxford: Oxford UP; 1995. [Google Scholar]
  5. Burgess PW, Dumontheil I, Gilbert SJ, Okuda J, Schölvinck ML, Simons JS. On the role of rostral prefrontal cortex (area 10) in prospective memory. In: Kliegel M, McDaniel MA, Einstein GO, editors. Prospective memory: cognitive, neuroscience, developmental, and applied perspectives. Mahwah, NJ: Erlbaum; 2007. pp. 235–260. in press. [Google Scholar]
  6. Constantinidis C, Franowicz MN, Goldman-Rakic PS. The sensory nature of mnemonic representation in the primate prefrontal cortex. Nat Neurosci. 2001;4:311–316. doi: 10.1038/85179. [DOI] [PubMed] [Google Scholar]
  7. Courtney SM. Attention and cognitive control as emergent properties of information representation in working memory. Cogn Affect Behav Neurosci. 2004;4:501–516. doi: 10.3758/cabn.4.4.501. [DOI] [PubMed] [Google Scholar]
  8. Cowan N. Attention and memory: an integrated framework. New York: Oxford UP; 1995. [Google Scholar]
  9. Cox RW. Software for analysis and visualization of functional magnetic resonance neuroimages. Comput Biomed Res. 1996;29:162–173. doi: 10.1006/cbmr.1996.0014. [DOI] [PubMed] [Google Scholar]
  10. Curtis CE, Rao VY, D'Esposito M. Maintenance of spatial and motor codes during oculomotor delayed response tasks. J Neurosci. 2004;24:3944–3952. doi: 10.1523/JNEUROSCI.5640-03.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. D'Esposito M, Ballard D, Zarahn E, Aguirre GK. The role of prefrontal cortex in sensory memory and motor preparation: an event-related fMRI study. Neuroimage. 2000;11:400–408. doi: 10.1006/nimg.2000.0571. [DOI] [PubMed] [Google Scholar]
  12. Druzgal TJ, D'Esposito M. Dissecting contributions of prefrontal cortex and fusiform face area to face working memory. J Cogn Neurosci. 2003;15:771–784. doi: 10.1162/089892903322370708. [DOI] [PubMed] [Google Scholar]
  13. Durstewitz D, Seamans JK. Beyond bistability: biophysics and temporal dynamics of working memory. Neuroscience. 2006;139:119–133. doi: 10.1016/j.neuroscience.2005.06.094. [DOI] [PubMed] [Google Scholar]
  14. Düzel E. Some mechanisms of working memory may not be evident in the human EEG. Behav Brain Sci. 2003;26:732. [Google Scholar]
  15. Goldman-Rakic PS. Circuitry of the prefrontal cortex and the regulation of behavior by representational memory. In: Mountcastle VB, Plum F, Geiger SR, editors. Handbook of neurobiology. Bethesda, MD: The American Physiological Society; 1987. pp. 373–417. [Google Scholar]
  16. Haxby JV, Petit L, Ungerleider LG, Courtney SM. Distinguishing the functional roles of multiple regions in distributed neural systems for visual working memory. Neuroimage. 2000;11:380–391. doi: 10.1006/nimg.2000.0592. [DOI] [PubMed] [Google Scholar]
  17. Kroger JK. Long-term memory, features, and novelty. Behav Brain Sci. 2003;26:744–745. [Google Scholar]
  18. Leung HC, Gore JC, Goldman-Rakic PS. Sustained mnemonic response in the human middle frontal gyrus during on-line storage of spatial memoranda. J Cogn Neurosci. 2002;14:659–671. doi: 10.1162/08989290260045882. [DOI] [PubMed] [Google Scholar]
  19. Logie RH, Della Sala S. Working memory as a mental workspace: why activated long-term memory is not enough. Behav Brain Sci. 2003;26:745–746. [Google Scholar]
  20. Machens CK, Romo R, Brody CD. Flexible control of mutual inhibition: a neural model of two-interval discrimination. Science. 2005;307:1121–1124. doi: 10.1126/science.1104171. [DOI] [PubMed] [Google Scholar]
  21. Majerus S, Van der Linden M, Collette F, Salmon E. Does sustained ERP activity in posterior lexico-semantic processing areas during short-term memory tasks only reflect activated long-term memory? Behav Brain Sci. 2003;26:746–747. [Google Scholar]
  22. Mongillo G, Barak O, Tsodyks M. Synaptic theory of working memory. Science. 2008;319:1543–1546. doi: 10.1126/science.1150769. [DOI] [PubMed] [Google Scholar]
  23. Narayanan NS, Prabhakaran V, Bunge SA, Christoff K, Fine EM, Gabrieli JD. The role of prefrontal cortex in the maintenance of verbal working memory information: an event-related fMRI analysis. Neuropsychology. 2005;19:223–232. doi: 10.1037/0894-4105.19.2.223. [DOI] [PubMed] [Google Scholar]
  24. Oberauer K. Access to information in working memory: exploring the focus of attention. J Exp Psychol Learn Mem Cogn. 2002;28:411–421. [PubMed] [Google Scholar]
  25. Poldrack RA. Can cognitive processes be inferred from neuroimaging data? Trends Cogn Sci. 2006;10:59–63. doi: 10.1016/j.tics.2005.12.004. [DOI] [PubMed] [Google Scholar]
  26. Polyn SM, Natu VS, Cohen JD, Norman KA. Category-specific cortical activity precedes retrieval during memory search. Science. 2005;310:1963–1966. doi: 10.1126/science.1117645. [DOI] [PubMed] [Google Scholar]
  27. Postle BR. Delay-period activity in prefrontal cortex: one function is sensory gating. J Cogn Neurosci. 2005;17:1679–1690. doi: 10.1162/089892905774589208. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Postle BR. Working memory as an emergent property of the mind and brain. Neuroscience. 2006;139:23–38. doi: 10.1016/j.neuroscience.2005.06.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Postle BR, D'Esposito M. Spatial working memory activity of the caudate nucleus is sensitive to frame of reference. Cogn Affect Behav Neurosci. 2003;3:133–144. doi: 10.3758/cabn.3.2.133. [DOI] [PubMed] [Google Scholar]
  30. Postle BR, Druzgal TJ, D'Esposito M. Seeking the neural substrates of working memory storage. Cortex. 2003;39:927–946. doi: 10.1016/s0010-9452(08)70871-2. [DOI] [PubMed] [Google Scholar]
  31. Rainer G, Rao SC, Miller EK. Prospective coding for objects in the primate prefrontal cortex. J Neurosci. 1999;19:5493–5505. doi: 10.1523/JNEUROSCI.19-13-05493.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Ranganath C, Cohen MX, Dam C, D'Esposito M. Inferior temporal, prefrontal, and hippocampal contributions to visual working memory maintenance and associative memory recall. J Neurosci. 2004a;24:3917–3925. doi: 10.1523/JNEUROSCI.5053-03.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Ranganath C, DeGutis J, D'Esposito M. Category-specific modulation of inferior temporal activity during working memory encoding and maintenance. Brain Res Cogn Brain Res. 2004b;20:37–45. doi: 10.1016/j.cogbrainres.2003.11.017. [DOI] [PubMed] [Google Scholar]
  34. Ruchkin DS, Grafman J, Cameron K, Berndt RS. Working memory retention systems: a state of activated long-term memory. Behav Brain Sci. 2003;26:709–728. doi: 10.1017/s0140525x03000165. discussion 728–777. [DOI] [PubMed] [Google Scholar]
  35. Takeda M, Naya Y, Fujimichi R, Takeuchi D, Miyashita Y. Active maintenance of associative mnemonic signal in monkey inferior temporal cortex. Neuron. 2005;48:839–848. doi: 10.1016/j.neuron.2005.09.028. [DOI] [PubMed] [Google Scholar]
  36. Vallar G. The short-term/long-term memory distinction: back to the past? Behav Brain Sci. 2003;26:757–758. [Google Scholar]
  37. Zarahn E, Rakitin B, Abela D, Flynn J, Stern Y. Positive evidence against human hippocampal involvement in working memory maintenance of familiar stimuli. Cereb Cortex. 2005;15:303–316. doi: 10.1093/cercor/bhh132. [DOI] [PubMed] [Google Scholar]

Articles from The Journal of Neuroscience are provided here courtesy of Society for Neuroscience

RESOURCES