Abstract
Considerable research has focused on how basic visual features are maintained in working memory, but little is currently known about the precision or capacity of visual working memory for complex objects. How precisely can an object be remembered, and to what extent might familiarity or perceptual expertise contribute to working memory performance? To address these questions, we developed a set of computer-generated face stimuli that varied continuously along the dimensions of age and gender, and we probed participants’ memories using a method-of-adjustment reporting procedure. This paradigm allowed us to separately estimate the precision and capacity of working memory for individual faces, based on the assumptions of a discrete capacity model, and to assess the impact of face inversion on memory performance. We found that observers could maintain up to 4–5 items on average, with equally good memory capacity for upright and upside-down faces. In contrast, memory precision was significantly impaired by face inversion at every set size tested. Our results demonstrate that the precision of visual working memory for a complex stimulus is not strictly fixed, but instead can be modified by learning and experience. We find that perceptual expertise for upright faces leads to significant improvements in visual precision, without modifying the capacity of working memory.
Keywords: visual short-term memory, face inversion effect, face perception, face recognition, visual expertise, perceptual learning
Visual working memory is essential for our ability to maintain information about stimuli that are no longer in direct view. In daily life, we rely on this ability to maintain relevant information about objects in our environment and to facilitate the planning of actions toward such objects during everyday tasks (e.g., Hayhoe, Shrivastava, Mruczek, & Pelz, 2003; Hollingworth, 2004). Given that visual working memory is of such importance for normal functioning and yet has profound limitations, it has been the topic of considerable research and debate (e.g., Luck & Vogel, 1997; Alvarez & Cavanagh, 2004; Zhang & Luck, 2008; Bays & Husain, 2008; van den Berg, Shin, Chou, George, & Ma, 2012; Luck & Vogel, 2013). Researchers have sought to define the mechanisms of successful encoding, maintenance, and retrieval, as well as the properties of working memory that account for its fundamental limits.
Whereas early studies relied on change detection paradigms to estimate the number of visual objects that can be maintained in working memory (Phillips, 1974; Luck & Vogel, 1997), more recent studies have adopted psychophysical procedures that provide separate estimates of memory precision and capacity (Zhang & Luck, 2008). Using the method of adjustment, one can estimate the precision of memory for basic visual features that vary along a continuum, such as color or orientation, by having participants manipulate a probe stimulus along a relevant feature dimension to indicate their memory of a previously viewed test stimulus. Although it is debated whether working memory is better described by a small number of discrete representations (Zhang & Luck, 2008; Rouder et al., 2008) or by a continuous resource that can be flexibly subdivided to represent many items (Bays & Husain, 2008; van den Berg et al., 2012), research has demonstrated the utility of considering precision and memory capacity as separate psychological variables. For example, estimates of memory capacity, but not precision, are highly predictive of individual differences in fluid intelligence (Fukuda, Vogel, Mayr, & Awh, 2010) and further correlate with EEG measures of working memory maintenance (Anderson, Vogel & Awh, 2011). For the purposes of the present study, we rely on the mixture-model analysis of Zhang & Luck (2008) and its assumption of a discrete capacity limit to estimate the precision and capacity of visual working memory.
Recent studies have successfully characterized the precision of working memory for basic visual features, but much less is known about the resolution and capacity of working memory for complex objects. Unlike basic features, which can be readily manipulated to vary along a continuous dimension, it is less obvious how one might devise a set of complex objects that smoothly vary along a continuum. Previous studies using change detection paradigms have suggested that working memory capacity may be more limited for complex visual objects (Wheeler & Treisman, 2002; Xu, 2002; Alvarez & Cavanagh, 2004). For example, Alvarez and Cavanagh (2004) found that participants could successfully maintain more than four colored squares in working memory, on average, but could maintain no more than two shaded cubes or random polygons. Similarly, Curby and Gauthier (2007) reported finding greater working memory capacity for upright than inverted faces, which suggested that extensive perceptual training with upright faces led to an information processing advantage and thereby reduced the load on working memory.
However, a potential concern in these change detection studies is that differences in memory precision might be confused for differences in capacity (Wilken & Ma, 2004; Awh, Barton, & Vogel, 2007). It is now known that memory precision decreases with increasing set size (Zhang & Luck, 2008); thus, working memory performance could be disproportionately impaired at large set sizes if participants were to have generally poorer memory precision for a particular object class. Awh et al. (2007) found that the estimated capacity of working memory across different object categories was strongly predicted by the visual similarity (or confusability) of the stimuli within each object class. Moreover, when participants were required to detect large between-category changes (e.g., from cube to polygon) rather than subtle within-category changes (e.g., polygon to polygon), estimates of memory capacity were comparable across the different object classes. In another study, Scolari, Vogel, and Awh (2008) found an upright face advantage for detecting within-category changes in facial identity (e.g., from one face to another), replicating the findings of Curby and Gauthier (2007). However, the researchers found comparable performance for upright and inverted faces following between-category changes (e.g., from face to cube). The authors interpreted these findings to suggest that differences in memory precision may account for the differences in working memory performance previously reported for upright and inverted faces (Curby & Gauthier, 2007). Although these change detection studies suggest that memory precision is likely to be better for upright faces, the paradigm did not allow the researchers to directly quantify the precision of working memory for faces. Thus, it is difficult to specify how much more precisely participants can maintain upright as compared to inverted faces. Recent studies have demonstrated the importance of using continuous measures of memory for an item to obtain separate estimates of memory precision and capacity (Luck & Vogel, 2013), especially if the research goal is to determine whether an effect on working memory performance should be attributed to changes in memory precision, capacity, or both.
The purpose of our study was to directly evaluate the precision and capacity of visual working memory for the complex stimulus class of faces, and to determine how perceptual expertise might enhance working memory performance. Faces constitute a very important and socially relevant class of visual stimuli for which people have acquired a lifetime of expertise. Evidence of expertise for faces comes from the well-documented face inversion effect: when a face is simply turned upside-down, both perceptual discrimination and recognition judgments are consistently impaired (Yin, 1969; Valentine, 1988; Tanaka & Sengco, 1997; Tong & Nakayama, 1999; McKone & Yovel, 2009). Stimulus inversion causes a disproportionate impairment in face recognition, when compared with the recognition of other types of objects (Yin, 1969), unless participants have acquired considerable expertise for that specific object class (Diamond & Carey, 1986; Gauthier & Tarr, 1997; but see also McKone, Kanwisher & Duchaine, 2007).
In the present study, our motivation for using face stimuli to investigate working memory was two-fold. First, faces vary along multiple dimensions, and people are adept at distinguishing subtle variations between individual faces in many of these dimensions (e.g., O’Toole, Vetter, Troje, & Bülthoff, 1997; Webster, Kaping, Mizokami, & Duhamel, 2004). By generating faces that varied along the dimensions of both gender and age, we created a smoothly changing circular face space (see Figure 1). The circular nature of this space was critical, as it allowed participants to precisely report their memory of individual faces by method of adjustment, and therefore yielded estimates of precision and capacity that were uncontaminated by response biases. Second, people have far greater perceptual expertise for upright than inverted faces. Our goal was to test whether extensive experience with upright faces leads to improvements in the precision or capacity of working memory, based on the separate quantification of these parameters using the mixture model approach.
In Experiment 1, participants were presented with upright and inverted faces at set sizes of one, three and five, and we obtained estimates of working memory precision and capacity for each subject and experimental condition. In Experiment 2, we tested a larger set size of 6 faces and implemented an articulatory suppression procedure to discourage the use of verbal encoding strategies. Across both experiments, we find consistently better memory precision for upright than inverted faces, but no effect of face orientation on memory capacity.
Experiment 1
Methods
Participants
A total of 8 participants (ages 19 – 31, 6 female) participated in Experiment 1, which consisted of 6 one-hour experimental sessions. Participants included students or employees at Vanderbilt University, as well as members of the surrounding Nashville community. All participants had normal or corrected-to-normal vision. Participants were paid for their time, and they could earn additional pay for performing well on the task. Two of the eight participants were removed from the analysis for failing to follow the task instructions (submitting mostly random responses even at set size 1). This study was approved by the Vanderbilt University Institutional Review Board, and all participants gave written informed consent.
Stimuli
A set of 80 grayscale 3D face images were generated, using the FaceGen Modeller software package (Singular Inversions, Inc.), to form an approximately circular face space. First, we created eight 3D-rendered faces that varied along the dimensions of age and gender, to form an octagonal space (Figure 1). Next, each pair of neighboring faces was combined in varying proportions (10/90, 20/80, … 90/10) using a linear morphing procedure to create a total of 80 unique faces. For our analyses of memory performance, we consider these 80 face stimuli to span a 360° circular space and assume even spacing between neighboring faces, such that the difference between any two neighboring faces is equivalent to 4.5°.
All face stimuli were normalized to equate for mean luminance, and were displayed at a size of 5.2° × 6.5° of visual angle. In addition to the face stimuli, a set of 80 noise stimuli were constructed that shared the outline shape of a corresponding face stimulus, but consisted of random noise (1/F noise) presented within the bounding mask.
Stimuli were presented on a 20″ LCD monitor using Matlab and the Psychophysics Toolbox (Brainard, 1997; Pelli, 1997). Participants viewed the display at a distance of 44 cm, with head position stabilized by a forehead and chin rest.
Procedure
Each participant completed 6 one-hour sessions, consisting of 126 trials each, for a total of 756 trials per person. Working memory performance was measured for upright and inverted faces, shown in separate 21-trial blocks. Upright and inverted stimulus blocks were interleaved, and the inversion condition of the first block of the first session was counterbalanced across participants. On a given trial, participants were presented with 1, 3, or 5 faces chosen randomly from the stimulus space (see Figure 2). The stimuli were presented one at a time at different spatial locations at an eccentricity of 6.5° relative to a central fixation dot. The polar angles between stimulus locations were chosen pseudo-randomly on each trial such that no two stimulus locations overlapped. Pilot testing indicated that the faces were too visually similar to allow for accurate discrimination in the periphery, so participants were instructed to fixate on the central dot at the beginning of each trial, and then to look directly at each face as it was presented for 1500ms in its respective spatial location. This relatively long stimulus presentation duration was used to ensure that participants had enough time to saccade to each item and fully encode it into working memory, mitigating the possibility that any observed face inversion effects are due to differences in the speed of visual encoding.
On trials with fewer than five faces, noise stimuli were substituted to equate the total number of stimuli shown. The presentation of all 5 face/noise stimuli was followed by a 3000ms retention interval, and then a 1000ms spatial cue (a circle with thicker outline) indicating which of the previously viewed faces would be tested. In the subsequent test period, one of the 80 possible faces was randomly selected and presented at the center of the screen. The participant used two keys on the keyboard to cycle through the face space, matching the appearance of the central face as precisely as possible to the probed face in memory, and made a key press to submit his/her response. Trials were separated by a 1000ms inter-trial interval. On inverted blocks, the face stimuli, noise stimuli, and test faces were all inverted.
At the beginning of the experiment, participants completed 12 practice trials (2 trials for each face orientation x set size condition). They were encouraged to remember as many stimuli as possible on each trial, as precisely as possible, and were told that they could earn bonus points by reporting the probed item with high accuracy. Participants received 2 bonus points for responding within 10 faces of the correct face (0–45° difference), and 1 bonus point for responding between 11 and 20 faces (45–90° difference) away from the correct face. Participants received an extra dollar for every 50 bonus points earned, and could earn up to $5 in bonus pay per one-hour session in addition to the standard payment rate of $10/hour. During the initial practice session, participants received feedback on every trial regarding the accuracy of their performance. During the main experiment, however, cumulative feedback was provided only at the end of each block of trials, during self-paced breaks. At the end of each experimental session, participants were informed of the cumulative amount of points earned and the corresponding dollar amount of bonus payment.
Data analysis
For each participant, we calculated response error as the angular difference between the reported face and the studied face. We applied Zhang and Luck’s (2008) mixture model to the distributions of these errors separately for each subject, set size, and upright/inverted condition by maximizing the likelihood function. This model assumes that the responses are a mixture of pure guesses, which are distributed uniformly across the face space, and responses based on noisy memory representations, which form a von Mises distribution centered on the studied face. This model has two free parameters: (1) the standard deviation of the von Mises reflects the precision (SD) of faces stored in working memory, and (2) the proportion of responses arising from the von Mises distribution, rather than the uniform guessing distribution, reflects the probability that a face was stored in memory. This proportion is then transformed into an estimate of working memory capacity (K) by multiplying the probability of successful memory by the set size in each condition. Within-subjects analyses of variance (ANOVA) and planned contrasts were performed to test for reliable differences in memory precision and capacity across the manipulations of face orientation and set size.
Results
We estimated the precision and capacity of visual working memory for each participant and condition, based on the observed differences between the reported faces and the studied faces. Figure 3 shows the distributions of response errors of a representative participant, and the corresponding fits of the mixture model. The distribution of response errors was binned for the purposes of illustration, but each data point was considered individually in fitting the model using maximum likelihood estimation. The entire face space is assumed to span 360°, and the error sizes and standard deviations are reported in degrees.
In general, our participants performed the task very well, as indicated by the group-averaged results showing the estimates of memory capacity (Figure 4A) and precision (Figure 4B). Participants were able to retain up to approximately 4.5 items on average, regardless of face orientation. A within-subjects ANOVA confirmed that the number of successfully maintained items increased as a function of set size (F(2,10) = 515.9, p ≤ .0001). More importantly, this analysis revealed no effect of face orientation on the number of items that were successfully maintained (F(1,5) = .56, p = .49), and no evidence of an interaction between face orientation and set size (F(2,10) = .64, p = .55). These results indicate that the capacity of visual working memory was not reliably impaired by face inversion.
In contrast, we found that face inversion led to consistently poorer memory precision, as indicated by larger standard deviations for inverted faces than upright faces across all set sizes. An ANOVA revealed that memory precision was significantly worse for inverted than upright faces (F(1,5) = 27.1, p =.003), with no evidence of an interaction effect between face orientation and set size (F(2,10) = .44, p = .66). Planned contrasts revealed a significant difference in precision at each of the three set sizes (paired t-tests, all p < .02). Thus, we find a consistent advantage for upright faces in memory precision, even when only a single face was maintained and the demands on working memory capacity were very modest.
Further analyses indicated that memory precision declined significantly with increases in set size, F(2,10)=4.918, p=.03. However, precision did not decline as rapidly as one might expect based on Zhang and Luck’s (2008) slots-plus-averaging model. This model assumes that when participants are given only a single item to remember, multiple slots can be used to maintain independent representations of that single item, leading to improvements in memory precision as would be predicted by signal averaging. For example, if a working memory system with 4 slots were used to encode a single item, precision for that single item should improve by a factor of 2, or the square root of the number of slots used for simultaneous encoding. In our study, average memory capacity exceeded a value of 4, yet the improvement in precision at set size 1 was very modest when compared to memory precision at set size 5. Due to our use of sequential presentation, participants could not anticipate exactly how many faces would appear on a given trial; as a consequence, they might not have been able to optimize their encoding strategy on set size 1 trials. Regardless, the results of Experiment 1 indicate a consistent advantage in memory precision for upright faces.
Experiment 2
We performed a second experiment in order to confirm the results from Experiment 1 and rule out potential alternative accounts of the finding that face inversion impairs precision but not capacity. Because the average capacity estimates in Experiment 1 were quite high, approximately 4.5 items out of 5, it is possible that a ceiling effect might have obscured our ability to detect an effect of face inversion on memory capacity in this experiment. In Experiment 2, we used a set size of 6 on every trial to ensure that participants’ working memory capacities were reliably exceeded when testing for differential effects of face orientation on capacity. In addition, we were concerned that some participants might employ a verbal encoding strategy to remember the face stimuli. To discourage the use of verbal encoding strategies, the second experiment included a concurrent articulatory suppression task (Murray, 1967), which is commonly used to prevent verbal rehearsal during memory tasks (Baddeley & Hitch, 1975).
Methods
Participants
A total of 14 participants (ages 18 – 32, 7 female) participated in Experiment 2, and each had normal or corrected-to-normal vision. All participants were members of the Vanderbilt/Nashville community and provided written informed consent. Participants were paid for their time, with additional pay for accurate performance on the working memory task, as described in Experiment 1. None of the participants in Experiment 2 had previously participated in Experiment 1.
Stimuli and Procedure
The stimuli and design of Experiment 2 were identical to those of Experiment 1, with the exception that 6 face stimuli were presented on every trial and participants performed a concurrent articulatory suppression task. Each trial began with the auditory presentation of two randomly chosen digits, followed by a fixation period (total of 3 seconds). Participants were instructed to overtly rehearse these digits out loud throughout the subsequent memory task. An experimenter monitored compliance with this task by either remaining in the room or listening remotely via a microphone. After participants made their responses on the memory task, they were prompted to enter the digits they had spoken throughout the trial. The accuracy of this report provided a quantitative measure of how well participants performed the articulatory suppression task.
Participants completed 4 one-hour sessions, each beginning with 6 practice trials that provided post-trial feedback regarding bonus points earned on the working memory task and the accuracy of performance on the digit report task. Each session included 8 blocks of 8 experimental trials each, and upright and inverted faces were presented in alternating blocks, as in Experiment 1. Feedback about the number of bonus points earned was provided at the end of each block. Each participant completed a total of 256 trials, providing 128 trials per face orientation condition.
Results
Performance on the concurrent articulatory suppression task was highly accurate, with all participants entering at least one of the two digits correctly on every trial. Out of the total 768 digits to be rehearsed across all four sessions, participants had a mean accuracy above 99%, suggesting that the secondary task was performed diligently throughout the session and across conditions.
The results of the working memory task were very consistent with the findings in Experiment 1. As in the first experiment, we found that the mixture model fit the data well (see Figure 5A for model fits to a representative participant’s data). We also found that estimates of memory capacity were almost identical for upright and inverted faces (Figure 5B) with no evidence of a reliable statistical difference (paired samples t-test, t(13) = .37, p =.72). In contrast, the precision of working memory (Figure 5C) was significantly better for upright faces (mean SD = 38.8°) than for inverted faces (mean SD = 49.1°), as indicated by a t-test (t(13) = −5.11, p < .001).
Capacity estimates for participants in Experiment 2 were reliably below the full set size of 6, mitigating the possibility that a ceiling effect might have impeded our ability to observe an effect of face inversion on capacity estimates. Nonetheless, we conducted an additional analysis to address whether the lack of a difference in memory capacity might have resulted from a subset of participants that exhibited high capacity estimates. We repeated the original analysis, excluding any participants with mean capacity estimates greater than 5.0 items. With six participants remaining in the analysis, we still observed no effect of face inversion on memory capacity (t(5) = .4331, p =.683).
We conducted an additional analysis to address whether the null effect of memory capacity was likely to reflect a Type II error, that is, a failure to reject the null hypothesis when it is false. Significance tests like the t-test are not designed to control for Type II error rates, so they are not well suited for firmly accepting the null hypothesis. In contrast, Bayesian hypothesis tests, known as Bayes factors, provide an assessment of the evidence both for and against the null hypothesis. Specifically, the Bayes factor equivalent to the t-test (Rouder, Speckman, Sun, Morey, & Iverson, 2009) provides the likelihood that the null hypothesis is true, as well as the likelihood of the alternative hypothesis that there exists a true difference between conditions. We applied this analysis to our data, and obtained a Bayes factor indicating that the hypothesized face inversion effect on precision was 77 times more likely than the null hypothesis of there being no such effect, in agreement with the t-test reported above. However, our analysis of capacity estimates indicated that the null hypothesis of no inversion effect was 4.7 times more likely to be true than the alternative hypothesis of an effect of face inversion on capacity. Bayes factors greater than 3.0 are considered to be “substantial” evidence for a hypothesis (Jeffreys, 1961). Consequently, these results suggest that face inversion significantly affected precision, and was unlikely to have affected capacity. In addition, for the subset of participants who had overall estimated capacities less than 5.0, the null hypothesis of no inversion effect on capacity was 3.2 times more likely than the alternative hypothesis. Taken together, these results suggest that our failure to observe an effect of face inversion on memory capacity is neither attributable to a ceiling effect, nor is it due to a lack of statistical power.
Each participant completed four experimental sessions on separate days, and we investigated whether memory performance changed with practice over the course of these sessions. There was not sufficient data to fit the mixture model to an individual’s data for a single session (32 trials in each condition per session), so we simply considered the root mean squared error (RMSE) between the studied face and the reported face as an overall measure of memory performance. The results are shown in Figure 6A. An ANOVA performed on these error scores revealed a consistent main effect of both face inversion across sessions (F(1,13) = 33.6, p <.001), as well as a main effect of experimental session due to gradual improvement with prolonged training (F(3,39) = 4.34, p = .01). However, we did not find evidence of an interaction between experimental session and face orientation (F(3,39) = .37, p = .78), suggesting that extended practice with these stimuli led to similar improvements in working memory performance for upright and inverted faces.
Recent studies of working memory for visual orientation have found evidence of a recency effect, with superior precision found for items that appear toward the end of the sequence of items (Gorgoraptis et al., 2011). Because our stimuli were presented individually, we could also explore whether memory performance for faces varied depending on the temporal position of the face that was probed for recall. Again, there was insufficient data to fit the mixture model to the data from individual probed locations, so we considered RMSE as an overall measure of memory performance (see Figure 6B). A repeated-measures ANOVA revealed a significant main effect of the temporal position of the target stimulus on overall performance (F(5,65) = 8.60, p < .001), which appeared to reflect the presence of both primacy and recency effects. There was also a main effect of face orientation (F(1,13) = 29.24, p < .001), but a non-significant interaction between temporal position and face orientation (F(5,65) = 2.02, p =.09). Although this interaction effect was marginal, for both upright and inverted faces there appears to be a trend of better memory performance for the most recent items. It should be noted that the temporal position of the probed face was randomized from trial-to-trial, such that the mixture model results reported above are not sensitive to differences in performance across temporal position. However, these analyses do demonstrate the robustness of the face inversion effect on memory performance: even with practice over multiple days or intervening stimuli in a sequential memory array, participants’ memories are consistently impaired for inverted faces.
Discussion
In the present study, we developed a set of continuously varying faces to measure the precision and capacity of visual working memory for complex stimuli. We found that simply turning a set of faces upside-down leads to a significant loss of memory precision, even though face inversion fails to affect the number of items that can be actively maintained in memory. Our results demonstrate that the precision of working memory for complex stimuli is not strictly fixed, but rather, can be modified by learning and visual experience.
Our findings are relevant to current discussions regarding the properties and limits of visual working memory. One prominent view is that complex objects place greater information processing demands on working memory, and thereby lead to a reduced memory capacity (Alvarez & Cavanagh, 2004; Brady, Konkle, & Alvarez, 2011). Given that inverted faces require more intensive part-based processing, and upright faces allow for more efficient holistic processing (Tanaka & Sengco, 1997; McKone & Yovel, 2009), one might expect that face inversion should lead to diminished memory capacity. Instead, we found that stimulus inversion led to a selective impairment in the precision of visual working memory, without impairing capacity. These findings support and clarify the results of previous studies involving change detection with faces (Curby & Gauthier, 2007; Scolari, Vogel, & Awh, 2008). Furthermore, our study raises the possibility that the presumed impact of object complexity on working memory capacity may instead be attributable to differences in visual precision (see also Awh, Barton, & Vogel, 2007). Previous studies relied on change detection paradigms to estimate memory capacity for complex objects, and were unable to separately measure memory precision and capacity. It would be interesting for future studies to develop continuously varying stimuli in other complex object domains, to evaluate the generality of the present findings.
This study is also relevant to the prominent slot-plus-averaging model proposed by Zhang and Luck (2008). Based on their studies of memory for basic features, they proposed that visual working memory consists of a discrete number of slots, each of which can retain information about a single item with a fixed degree of visual precision. The fact that we find comparable capacity estimates for upright and inverted faces is consistent with the predictions of this discrete capacity model. That said, it should be acknowledged that our capacity estimates for face stimuli ranged from 4.5 to 4.9 items across the two experiments, a value that is somewhat higher than those reported in studies of working memory for basic visual features (Zhang & Luck, 2008; Anderson, Vogel, & Awh, 2011; Rademaker, Tredway & Tong, 2012).
Whereas memory capacity is the same for upright and inverted faces, we find that the precision of visual working memory for a complex stimulus is not strictly fixed, and instead depends on one’s level of familiarity and expertise with that item. Recent studies have shown that experimental manipulations of set size, attentional allocation, encoding duration, and temporal position, can all influence working memory precision (Zhang & Luck, 2008; Bays, Gorgoraptis, Wee, Marshall, & Husain, 2011; Gorgoraptis, Catalao, Bays, & Husain, 2011). In addition, there appears to be some natural stochastic variability in the precision of visual working memory from trial to trial (Rademaker, Tredway, & Tong, 2012; Fougnie, Suchow & Alvarez, 2012). However, much less is known about the extent to which perceptual training might modify the precision of visual representations that are called upon during working memory tasks. Studies have demonstrated improvements in working memory performance for an object class following visual training (e.g., Moore, Cohen & Ranganath, 2006; Curby, Glazek, & Gauthier, 2009), but these change detection studies could not distinguish between effects of memory capacity and precision. In the present study, we observed a consistent advantage in visual precision, even at set size 1 when the capacity of working memory was far from being fully taxed. Such a result cannot be readily explained in terms of a modified capacity limit for the trained object category, but rather, appears to reflect an improvement in the ability to encode and maintain a precise representation of the trained stimulus.
These findings lead us to propose that acquired expertise for a stimulus class, such as upright faces, leads to the development of more precise perceptual representations that can be called upon to support visual working memory. The role of early perceptual areas in visual working memory has been demonstrated in recent fMRI decoding studies (e.g., Harrison & Tong, 2009; Serences, Ester, Vogel, & Awh, 2009; Albers, Kok, Toni, Dijkerman, & de Lange, 2013; Emrich, Riggall, LaRocque, & Postle, 2013), suggesting that perception and working memory rely on common visual representations. Given that perceptual learning is known to modify the representations of both basic features (Sasaki, Nanez & Watanabe, 2010) and complex objects (Palmeri, Wong, & Gauthier, 2004), we predict that extensive perceptual training with other types of stimuli should also lead to improvements in the precision of visual working memory.
Acknowledgments
This research was supported by the National Science Foundation grants BCS-1228526 and BCS-0642633 to F.T., and by National Institutes of Health Fellowship F32-EY-022569 to M.P. This work was also facilitated by support from NEI center grant P30-EY008126 to the Vanderbilt Vision Research Center (Director, Dr. Schall).
References
- Albers AM, Kok P, Toni I, Dijkerman HC, de Lange FP. Shared Representations for Working Memory and Mental Imagery in Early Visual Cortex. Current Biology. 2013;23(15):1427–1431. doi: 10.1016/j.cub.2013.05.065. [DOI] [PubMed] [Google Scholar]
- Alvarez GA, Cavanagh P. The capacity of visual short-term memory is set both by visual information load and by number of objects. Psychological Science. 2004;15(2):106–111. doi: 10.1111/j.0963-7214.2004.01502006.x. [DOI] [PubMed] [Google Scholar]
- Anderson DE, Vogel EK, Awh E. Precision in visual working memory reaches a stable plateau when individual item limits are exceeded. The Journal of Neuroscience. 2011;31(3):1128–1138. doi: 10.1523/JNEUROSCI.4125-10.2011. [DOI] [PMC free article] [PubMed] [Google Scholar] [Retracted]
- Awh E, Barton B, Vogel EK. Visual working memory represents a fixed number of items regardless of complexity. Psychological Science. 2007;18(7):622–628. doi: 10.1111/j.1467-9280.2007.01949.x. [DOI] [PubMed] [Google Scholar]
- Baddeley AD, Hitch GJ. Working memory. The psychology of learning and motivation. 1975;8(47–89) [Google Scholar]
- Bays PM, Husain M. Dynamic shifts of limited working memory resources in human vision. Science. 2008;321(5890):851–854. doi: 10.1126/science.1158023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bays PM, Gorgoraptis N, Wee N, Marshall L, Husain M. Temporal dynamics of encoding, storage, and reallocation of visual working memory. Journal of Vision. 2011;11(10):6, 1–15. doi: 10.1167/11.10.6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brady TF, Konkle T, Alvarez GA. A review of visual memory capacity: Beyond individual items and toward structured representations. Journal of Vision. 2011;11(5):4, 1–4. doi: 10.1167/11.5.4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brainard DH. The psychophysics toolbox. Spatial Vision. 1997;10(4):433–436. [PubMed] [Google Scholar]
- Chase WG, Simon HA. Perception in chess. Cognitive Psychology. 1973;4(1):55–81. [Google Scholar]
- Curby KM, Gauthier I. A visual short-term memory advantage for faces. Psychonomic Bulletin & Review. 2007;14(4):620–628. doi: 10.3758/bf03196811. [DOI] [PubMed] [Google Scholar]
- Curby KM, Glazek K, Gauthier I. A visual short-term memory advantage for objects of expertise. Journal of Experimental Psychology: Human Perception and Performance. 2009;35(1):94–107. doi: 10.1037/0096-1523.35.1.94. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Diamond R, Carey S. Why faces are and are not special: an effect of expertise. Journal of Experimental Psychology: General. 1986;115(2):107–117. doi: 10.1037//0096-3445.115.2.107. [DOI] [PubMed] [Google Scholar]
- Emrich SM, Riggall AC, LaRocque JJ, Postle BR. Distributed patterns of activity in sensory cortex reflect the precision of multiple items maintained in visual short-term memory. The Journal of Neuroscience. 2013;33(15):6516–6523. doi: 10.1523/JNEUROSCI.5732-12.2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fougnie D, Suchow JW, Alvarez GA. Variability in the quality of visual working memory. Nature Communications. 2012;3:1229. doi: 10.1038/ncomms2237. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fukuda K, Vogel E, Mayr U, Awh E. Quantity, not quality: The relationship between fluid intelligence and working memory capacity. Psychonomic Bulletin & Review. 2010;17(5):673–679. doi: 10.3758/17.5.673. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gauthier I, Tarr MJ. Becoming a “Greeble” expert: Exploring mechanisms for face recognition. Vision research. 1997;37(12):1673–1682. doi: 10.1016/s0042-6989(96)00286-6. [DOI] [PubMed] [Google Scholar]
- Gauthier I, Tarr MJ. Unraveling mechanisms for expert object recognition: bridging brain activity and behavior. Journal of Experimental Psychology: Human Perception and Performance. 2002;28(2):431–446. doi: 10.1037//0096-1523.28.2.431. [DOI] [PubMed] [Google Scholar]
- Gorgoraptis N, Catalao RF, Bays PM, Husain M. Dynamic updating of working memory resources for visual objects. The Journal of Neuroscience. 2011;31(23):8502–8511. doi: 10.1523/JNEUROSCI.0208-11.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Harrison SA, Tong F. Decoding reveals the contents of visual working memory in early visual areas. Nature. 2009;458(7238):632–635. doi: 10.1038/nature07832. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hayhoe MM, Shrivastava A, Mruczek R, Pelz JB. Visual memory and motor planning in a natural task. Journal of Vision. 2003;3(1):49–63. doi: 10.1167/3.1.6. [DOI] [PubMed] [Google Scholar]
- Hollingworth A. Constructing visual representations of natural scenes: the roles of short-and long-term visual memory. Journal of Experimental Psychology: Human Perception and Performance. 2004;30(3):519–537. doi: 10.1037/0096-1523.30.3.519. [DOI] [PubMed] [Google Scholar]
- Jeffreys H. The theory of probability. Oxford University Press; 1998. [Google Scholar]
- Luck SJ, Vogel EK. The capacity of visual working memory for features and conjunctions. Nature. 1997;390(6657):279–281. doi: 10.1038/36846. [DOI] [PubMed] [Google Scholar]
- Luck SJ, Vogel EK. Visual working memory capacity: from psychophysics and neurobiology to individual differences. Trends in Cognitive Sciences. 2013;17(8):391–400. doi: 10.1016/j.tics.2013.06.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- McKone E, Kanwisher N, Duchaine BC. Can generic expertise explain special processing for faces? Trends in Cognitive Sciences. 2007;11(1):8–15. doi: 10.1016/j.tics.2006.11.002. [DOI] [PubMed] [Google Scholar]
- McKone E, Yovel G. Why does picture-plane inversion sometimes dissociate perception of features and spacing in faces, and sometimes not? Toward a new theory of holistic processing. Psychonomic Bulletin & Review. 2009;16(5):778–797. doi: 10.3758/PBR.16.5.778. [DOI] [PubMed] [Google Scholar]
- Miller GA. The magical number seven, plus or minus two: some limits on our capacity for processing information. Psychological Review. 1956;63(2):81–97. [PubMed] [Google Scholar]
- Moore CD, Cohen MX, Ranganath C. Neural mechanisms of expert skills in visual working memory. The Journal of Neuroscience. 2006;26(43):11187–11196. doi: 10.1523/JNEUROSCI.1873-06.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Murray DJ. The role of speech responses in short-term memory. Canadian Journal of Psychology/Revue Canadienne de Ppsychologie. 1967;21(3):263–276. doi: 10.1037/h0082978. [DOI] [PubMed] [Google Scholar]
- O’Toole AJ, Vetter T, Troje NF, Bülthoff HH. Sex classification is better with three-dimensional head structure than with image intensity information. Perception. 1997;26:75–84. doi: 10.1068/p260075. [DOI] [PubMed] [Google Scholar]
- Palmeri TJ, Wong AC, Gauthier I. Computational approaches to the development of perceptual expertise. Trends in Cognitive Sciences. 2004;8(8):378–386. doi: 10.1016/j.tics.2004.06.001. [DOI] [PubMed] [Google Scholar]
- Pelli DG. The VideoToolbox software for visual psychophysics: Transforming numbers into movies. Spatial Vision. 1997;10:437–442. [PubMed] [Google Scholar]
- Phillips WA. On the distinction between sensory storage and short-term visual memory. Perception & Psychophysics. 1974;16(2):283–290. [Google Scholar]
- Rademaker RL, Tredway CH, Tong F. Introspective judgments predict the precision and likelihood of successful maintenance of visual working memory. Journal of Vision. 2012;12(13):1–13. doi: 10.1167/12.13.21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rouder JN, Morey RD, Cowan N, Zwilling CE, Morey CC, Pratte MS. An assessment of fixed-capacity models of visual working memory. Proceedings of the National Academy of Sciences. 2008;105(16):5975–5979. doi: 10.1073/pnas.0711295105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rouder JN, Speckman PL, Sun D, Morey RD, Iverson G. Bayesian t tests for accepting and rejecting the null hypothesis. Psychonomic bulletin & Review. 2009;16(2):225–237. doi: 10.3758/PBR.16.2.225. [DOI] [PubMed] [Google Scholar]
- Sasaki Y, Nanez JE, Watanabe T. Advances in visual perceptual learning and plasticity. Nature Reviews Neuroscience. 2009;11(1):53–60. doi: 10.1038/nrn2737. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Scolari M, Vogel EK, Awh E. Perceptual expertise enhances the resolution but not the number of representations in working memory. Psychonomic Bulletin & Review. 2008;15(1):215–222. doi: 10.3758/pbr.15.1.215. [DOI] [PubMed] [Google Scholar]
- Serences JT, Ester EF, Vogel EK, Awh E. Stimulus-specific delay activity in human primary visual cortex. Psychological Science. 2009;20(2):207–214. doi: 10.1111/j.1467-9280.2009.02276.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tanaka JW, Sengco JA. Features and their configuration in face recognition. Memory & Cognition. 1997;25(5):583–592. doi: 10.3758/bf03211301. [DOI] [PubMed] [Google Scholar]
- Tong F, Nakayama K. Robust representations for faces: evidence from visual search. Journal of Experimental Psychology: Human Perception and Performance. 1999;25(4):1016–1035. doi: 10.1037//0096-1523.25.4.1016. [DOI] [PubMed] [Google Scholar]
- Valentine T. Upside-down faces: A review of the effect of inversion upon face recognition. British journal of psychology. 1988;79(4):471–491. doi: 10.1111/j.2044-8295.1988.tb02747.x. [DOI] [PubMed] [Google Scholar]
- van den Berg R, Shin H, Chou WC, George R, Ma WJ. Variability in encoding precision accounts for visual short-term memory limitations. Proceedings of the National Academy of Sciences. 2012;109(22):8780–8785. doi: 10.1073/pnas.1117465109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vogel EK, Woodman GF, Luck SJ. Storage of features, conjunctions, and objects in visual working memory. Journal of Experimental Psychology: Human Perception and Performance. 2001;27(1):92–114. doi: 10.1037//0096-1523.27.1.92. [DOI] [PubMed] [Google Scholar]
- Webster MA, Kaping D, Mizokami Y, Duhamel P. Adaptation to natural facial categories. Nature. 2004;428(6982):557–561. doi: 10.1038/nature02420. [DOI] [PubMed] [Google Scholar]
- Wheeler ME, Treisman AM. Binding in short-term visual memory. Journal of Experimental Psychology: General. 2002;131(1):48–64. doi: 10.1037//0096-3445.131.1.48. [DOI] [PubMed] [Google Scholar]
- Wilken P, Ma WJ. A detection theory account of change detection. Journal of Vision. 2004;4(12):1120–1135. doi: 10.1167/4.12.11. [DOI] [PubMed] [Google Scholar]
- Wong JH, Peterson MS, Thompson JC. Visual working memory capacity for objects from different categories: A face-specific maintenance effect. Cognition. 2008;108(3):719–731. doi: 10.1016/j.cognition.2008.06.006. [DOI] [PubMed] [Google Scholar]
- Xu Y. Limitations of object-based feature encoding in visual short-term memory. Journal of Experimental Psychology: Human Perception and Performance. 2002;28(2):458. doi: 10.1037//0096-1523.28.2.458. [DOI] [PubMed] [Google Scholar]
- Yin RK. Looking at upside-down faces. Journal of Experimental Psychology. 1969;81(1):141–145. [Google Scholar]
- Zhang W, Luck SJ. Discrete fixed-resolution representations in visual working memory. Nature. 2008;453(7192):233–235. doi: 10.1038/nature06860. [DOI] [PMC free article] [PubMed] [Google Scholar]