When Natural Behavior Engages Working Memory

Dejan Draschkow; Melvin Kallmayer; Anna C Nobre

doi:10.1016/j.cub.2020.11.013

. 2021 Feb 22;31(4):869–874.e5. doi: 10.1016/j.cub.2020.11.013

When Natural Behavior Engages Working Memory

Dejan Draschkow ^1,^2,^4,^∗, Melvin Kallmayer ³, Anna C Nobre ^1,²

PMCID: PMC7902904 PMID: 33278355

Summary

Working memory (WM) enables temporary storage and manipulation of information,¹ supporting tasks that require bridging between perception and subsequent behavior. Its properties, such as its capacity, have been thoroughly investigated in highly controlled laboratory tasks.¹^,²^,³^,⁴^,⁵^,⁶^,⁷^,⁸ Much less is known about the utilization and properties of WM in natural behavior,⁹^,¹⁰^,¹¹ when reliance on WM emerges as a natural consequence of interactions with the environment. We measured the trade-off between reliance on WM and gathering information externally during immersive behavior in an adapted object-copying task.¹² By manipulating the locomotive demands required for task completion, we could investigate whether and how WM utilization changed as gathering information from the environment became more effortful. Reliance on WM was lower than WM capacity measures in typical laboratory tasks. A clear trade-off also occurred. As sampling information from the environment required increasing locomotion and time investment, participants relied more on their WM representations. This reliance on WM increased in a shallow and linear fashion and was associated with longer encoding durations. Participants’ avoidance of WM usage showcases a fundamental dependence on external information during ecological behavior, even if the potentially storable information is well within the capacity of the cognitive system. These foundational findings highlight the importance of using immersive tasks to understand how cognitive processes unfold within natural behavior. Our novel VR approach effectively combines the ecological validity, experimental rigor, and sensitive measures required to investigate the interplay between memory and perception in immersive behavior.

Video Abstract

Download video file^{(20.7MB, mp4)}

Highlights

•
Gaze provides a measure of working-memory (WM) usage during natural behavior
•
Natural reliance on WM is low even when searching for objects externally is effortful
•
WM utilization increases linearly as searching for objects requires more locomotion
•
The trade-off between using WM versus external sampling affects performance

Draschkow et al. demonstrate that reliance on working memory is low during natural behavior, even when “holding information in mind to guide future behavior” is the very essence of the task. As searching for objects externally increases in locomotive effort, WM usage increases in a shallow and linear fashion. This trade-off affects performance.

Results and Discussion

In our temporally extended object-copying task (Figure 1A), participants (n = 24) copied a model display by selecting realistic objects from a resource pool and placing them into a workspace (Video S1). The immersive nature of virtual reality enabled us to disentangle different sub-parts of the completed task (Figure 1B). Critically, we varied the model’s location between conditions (0°, 45°, 90°, or 135° in relation to the workspace), thus manipulating the “locomotive effort” required between encoding objects in the model and placing the objects in the workspace (Figures 1A and S1)—a composite variable that combines the effort and time to complete the task. Other than the varying locomotive demands, the task structure remained the same across conditions.

Object-Copying Task in Virtual Reality

(A) Twenty-four participants copied 8 “model” arrangements of objects, here shown at 45°, by selecting and picking up objects from the “resource” section and placing them into a “workspace” area (see Video S1). The model’s location varied between conditions (1 run = 14 displays), either 0°, 45°, 90°, or 135° from the workspace.

(B) Participants could only pick up and carry one object at a time, which imposed a sequential order into the task (for further details, see STAR Methods and Figure S1). Participants would (1) look at the model area in order to encode the to-be-copied object(s), (2) move to the resources in order to (3) search and select the object, (4) pick it up, and finally (5) place it in the corresponding location in the workspace.

See also Figure S1 and Video S1.

Video S1. Temporally Extended Object-Copying Task in Virtual Reality, Related to Figure 1 and STAR Methods

Download video file^{(71.7MB, mp4)}

Gaze Provides a Measure of Working-Memory Usage during Natural Behavior

Placing each object correctly required two features (i.e., types of information) in memory: its identity (1 feature) and location (1 feature). Gaze measures provided an implicit proxy of working memory (WM) utilization (see more in Figure 2A and Video S2). For example, participants could look at the model before placing the object, indicating they retained only the identity of the object, but not its location (1 feature). During another example, participants could place the object and select and place an additional object from the resource without having looked back at the model, indicating they retained the identity and locations of two objects (4 features).

Utilization of WM Representations

(A) Measuring gaze in virtual reality enabled us to develop an implicit metric of working-memory utilization (see Video S2). Because participants needed to sample identity and location information of the to-be-copied object from the model, we could count the number of WM features used in the task between re-fixations of the model. For example, if participants fixated the model *before* placing the object, they only used 1 feature, that is the identity feature of that object. If they fixated the model *after* placing the object, we counted 2 features used (both identity *and* location information were utilized).

(B) Probability of using an increasing number of WM features as a function of the experimental condition (i.e., locomotive effort)—see Figure S2A for results of more than 4 features in WM. Error bars depict standard error of the mean.

(C) The shape of the relationship between amount of locomotion and number of WM features utilized was linear. For a data-driven visualization of the relationship, the line in the plot was fit using a nonparametric LOESS smoothing function (thick gray line) with shaded areas representing the 95% confidence intervals. Solid dots indicate group averages and transparent lines, and dots depict the averages of the twenty-four participants.

See also Figure S2 and Video S2.

Video S2. Gaze Provides a Measure of Working-Memory Usage during Natural Behavior, Related to Figure 2 and STAR Methods

Download video file^{(9.7MB, mp4)}

Surprisingly, in the lowest locomotion condition (0°), participants relied on only one feature in memory at a time in the majority (60%) of cases. They correctly picked up an object from the resource pool (maintaining identity) but looked again at the model before placing it (sampling the location information). This result demonstrates that WM usage is lower than WM capacity estimates (∼4 items) in typical laboratory tasks.¹³^,¹⁴^,¹⁵

Natural Reliance on WM Is Low Even When Searching for Objects Externally Is Effortful

The reliance on WM was surprisingly low, even in the most effortful case (135°). Most often, participants used two features in memory (close to 50%). Compared to the least effortful (0°) condition, the probability of using one feature dropped from 60% to less than 30% of the cases (Figure 2B; for details, see the Quantification and Statistical Analysis). Using four features also increased from ∼1% in the 0° condition to 10% in the 135° condition. Nevertheless, WM was far from loaded to the capacity derived in laboratory tasks; instead, the results demonstrated a fundamental dependence on external information during ecological behavior.

WM Utilization Increases Linearly as Searching for Objects Requires More Locomotion

In order to describe the shape of the relationship between locomotive effort and WM utilization, we computed the average number of WM features used as a function of the different locomotion conditions (Figure 2C). Generalized linear mixed-model comparisons revealed a clear linear relationship between locomotion and the number of features used, suggesting a shift from external sampling to relying on WM (for details, see Quantification and Statistical Analysis). On average, this was a shallow change, with memory load increasing from approximately one feature in the 0° condition to two features in the 135° movement trials (Figure 2C).

The Trade-Off between Using WM versus External Sampling Affects Performance

In addition to costs related to maintaining features in WM, the time it takes to encode objects is another important factor in determining the optimal trade-off between using WM and sampling information from the environment. We found that the time participants spent viewing the model during the selection of the to-be-copied objects (Figure 3A) increased systematically as locomotive effort increased (comparing the lines between each of the facets of the figure), likely reflecting the increase in time and distance this information needed to be “carried” (for details, see Quantification and Statistical Analysis). Critically, within each locomotion condition, dwell times were longer for trials in which more WM features were used (positive linear slopes for all conditions). This relationship was quadratic (except for the line in the 0° condition [first facet], where it was linear), showing a steep increase in dwell times as participants moved from 3 to 4 features in memory (Figure 3A). The relationship between encoding duration and WM features utilized (see also Figure S2B) suggests participants loaded-up more information,¹⁶ instead of simply using more. This pattern highlights that, although relying on WM representations may spare locomotive effort, loading WM may bring additional costs in terms of invested time.

Trade-Offs between Sampling Information from WM versus from the Environment

(A) Average viewing time during the initial viewing of the to-be-copied objects in the model area, as a proxy for encoding duration. Viewing times were longer for trials in which more WM features were used (positive slopes for all conditions). Conversely, viewing times also predicted the number of features used (see Figure S2B). Error bars represent 95% confidence intervals.

(B) Display completion times (each point is one display completed by one participant) as a function of the average number of features used in the given display, reflecting the performance outcome of trading off using WM and external information. Greater reliance on WM reduced completion times (significant negative slopes for all conditions), though the quadratic relationship demonstrates that, with higher numbers of features in memory, performance plateaued in this task (for details, see Quantification and Statistical Analysis). For a data-driven visualization of the relationship, the lines in the plot were fit using a nonparametric LOESS smoothing function with shaded areas representing the 95% confidence intervals.

The dots in (A) represent group averages, whereas the dots in (B) depict the data of individual displays. See also Figure S2.

To provide a fuller measure of performance outcome resulting from trading off using WM versus external information, we calculated the time participants required to complete each display—that is the summed duration required to copy the 8 objects from the model area into the workspace (Figure 3B). Erroneous placements had to be corrected immediately, and fewer than 3% of copying sequences included errors (Figure S3; for details, see Quantification and Statistical Analysis). Therefore, by design, participants’ performance was evaluated by the speed of their copying behavior. For each display, we also calculated the average number of features in WM used. Figure 3B depicts this relationship as a function of the different locomotion conditions. Unsurprisingly, participants were slower to complete displays with increasing locomotion demands. However, greater reliance on WM reduced completion times (significant negative slopes for all movement conditions), though a significant quadratic relationship also revealed diminishing returns for increasing the number of features in memory, particularly when using more than ∼2 features. The observed pattern of regularly sampling information from the environment (i.e., using one feature) was therefore not the most efficient strategy, and using more features in WM increased task performance. On the other hand, encoding more features also becomes taxing in terms of encoding duration (Figure 3A), leading to diminishing returns for overall performance.

Why Is Reliance on WM Low in Natural Behavior?

Laboratory tasks have yielded many views about capacity limits in WM relating to whether resources are continuous⁶^,¹⁷^,¹⁸ or discrete⁷^,¹⁴^,¹⁹ and whether capacity is fixed at all.²⁰^,²¹ These views are moderated by the questions of whether relevant units are integrated objects or features⁷^,²²^,²³ and whether spatial position is necessarily encoded along with specific features.²⁴^,²⁵^,²⁶ By any of these measures, the estimates of capacity in lab tasks exceed the estimates we have obtained for the average WM load in the natural task presented here. We show that, in natural, immersive behavior, WM is costly,²⁷^,²⁸ and its usage emerges in a shallow, linear fashion as locomotive effort increases.

Our findings can be reconciled with typical lab estimates in several ways. In classical WM research, objects are briefly “flashed” on a computer screen, whereas in natural behavior, we usually can decide between looking back to the objects of interest and retaining them in memory. Such sensory-motor decisions have been investigated within the framework of statistical decision theory,²⁹^,³⁰^,³¹ which emphasizes the importance of considering costs and benefits—mediated by the underlying neural reward circuitry³²—in the choice of actions. Here, memory use is weighted against locomotive effort, and depending on the reliability of the representation, the actor would rely on the information in mind or update it.³³ Looking back can be rather “cheap,” if only a few saccades are required,³⁴^,³⁵^,³⁶ but becomes more costly in an ecological context as we have to move the head, arms, and body.¹¹^,³⁷^,³⁸ Further, constrained screen-based tasks often require participants to remain still and hold gaze on a single spot. However, eye movements can disrupt visuospatial WM representations.³⁹ Self-movement as well as the related computation of changing object coordinates can also reduce capacity estimates of WM⁴⁰ and sustained attention.⁴¹ Additionally, programming movements in the environment might require the maintenance of relevant locations in the experimental environment, which would further reduce the WM capacity available for objects. Finally, although the overall capacity of WM is likely higher then suggested here, only some of these objects might be prioritized within WM,⁴²^,⁴³^,⁴⁴ and information out of internal focus may be susceptible to interference when interacting with distractor objects in the resource area. In sum, resource allocation in natural tasks can be vastly different from utilizing resources in laboratory settings, highlighting the need to understand how cognitive processes unfold within natural behavior.

Using versus Not-Using Memories from Different Timescales

Although classical laboratory tasks measure the upper bound of what is possible (i.e., benchmarking), here, we aim to describe how the usage of this cognitive resource emerges during natural behavior as a function of locomotive effort—providing a measure of how WM capacity is used¹² rather than its maximum potential. Using versus not-using memory has been investigated more thoroughly with respect to long-term memory (LTM) guidance of visual search.⁴⁵^,⁴⁶^,⁴⁷^,⁴⁸^,⁴⁹^,⁵⁰^,⁵¹^,⁵²^,⁵³ This literature provides evidence that, when searching for a target requires only a few fixations, LTM use can be low, whereas effortful (requiring more time and distance) searches in immersive environments recruit a more substantial usage of long-term representations.

Real-world cognition, however, is not restricted to perception, WM, or LTM operating in isolation. Instead, the content from these different timescales is integrated to serve adaptive and purposeful behavior.⁵⁴ Understanding how information collected over these different timescales work together or compete to guide successful adaptive behavior is an exciting prospect and remains to be addressed in future research. The task we present here offers the opportunity to address these questions while taking into account environmental constraints and energetic costs, thus recognizing—instead of ignoring—the functional and ecological aspects of cognition.³³^,⁵⁵

When Natural Behavior Engages WM

Our novel VR task provides a useful naturalistic setting in which the observer is required to (1) actively maintain visuospatial representations in WM, (2) protect them from the interference of visual translations (rotating through the environment) and interference from similar objects (distractor objects in the resource area), and (3) manipulate the contents in memory in order to update the computation of changing object coordinates. We further use (4) realistic novel objects,⁵⁶ rather than intrinsically confusable and hard-to-remember colored blocks,¹²^,⁵⁷ in order to adhere more closely to naturalistic constraints. These points constitute a combination of the challenges that have been a hallmark of active WM usage and enable an ecological investigation of the WM properties necessary for bridging between perception and subsequent behavior.

Our novel VR approach provides an ideal starting point for investigations into the factors and mechanisms that support memory usage in naturalistic settings. For example, manipulating the stimuli and their arrangements will inform the role of intrinsic memorability⁵⁸^,⁵⁹^,⁶⁰ when using memory. Further, increasing the granularity of the eye-movement recordings and changing the task relevance of location and identity object features⁷^,²²^,²³^,²⁴^,²⁶ will show whether using some features is costlier than others. Finally, directly comparing WM usage estimates from our task with capacity and precision estimates from standard laboratory tasks will elucidate the relationship between the upper bounds of capacity,⁵ controlling access to WM,⁶¹ and the amount of memory actually used. Such investigations would inform how memory use is related to individual differences in cognitive capacities in a broader sense, specifically how natural memory use relates to IQ and academic attainment⁶² as well as mental workload⁶³ in more applied settings.

Conclusions

Our estimates show remarkably low levels of WM utilization in immersive behavior, even when “holding information in mind to guide future behavior” is the very essence of the task. We found reliance on ∼1 feature in WM when locomotive demands were at their lowest, which increased to an average of ∼2 features in memory at the highest locomotive demands we tested. Encoding more features bore a cost, as suggested by viewing times of the information to be maintained. Although this cost made the individual copying sequences last longer, it also enabled more information to be copied in each sequence, which reduced the overall completion times of the to-be-copied displays. This increase in efficiency, however, plateaued at ∼2 to 3 features in memory, demonstrating the importance of balancing the reliance on WM with gathering information externally during immersive behavior. Our novel VR approach effectively combines the ecological validity, experimental rigor, and sensitive measures required to investigate the interplay between memory and perception in natural behavior and opens the doors to many interesting future investigations.

STAR★Methods

Key Resources Table

REAGENT or RESOURCE	SOURCE	IDENTIFIER
Deposited Data

Post-processed data and R code	https://osf.io/sbzt6/	https://doi.org/10.17605/OSF.IO/SBZT6

Experimental Models: Organisms/Strains

Twenty-four healthy human volunteers participated in the study (mean age = 26.2, range = 18-36, 17 female, all right-handed)	N/A	N/A

Software and Algorithms

Lmer(), Glmer()	⁶⁴	https://doi.org/10.18637/jss.v067.i01
Wrapper for Lmer(), Glmer()	⁶⁵	https://dx.doi.org/10.18637/jss.v082.i13
R	⁶⁶	http://www.r-project.org
RStudio	⁶⁷	https://rstudio.com/
ggplot2()	⁶⁸	https://ggplot2.tidyverse.org/
Box-Cox Trasformation	⁶⁹	https://www.jstor.org/stable/2984418
pairedSamplesTTest()	⁷⁰	https://learningstatisticswithr.com/book/

Other

HTC Vive Tobii Pro VR	Tobii	https://www.tobiipro.com/de/produkte/vr-integration/
The Novel Object and Unusual Name (NOUN) Database	⁵⁶	https://doi.org/10.3758/s13428-015-0647-3

Open in a new tab

Resource Availability

Lead Contact

Further information and requests for resources should be directed to and will be fulfilled by the Lead Contact, Dejan Draschkow (dejan.draschkow@psych.ox.ac.uk).

Materials Availability

No materials are available for this study.

Data and Code Availability

The data and code generated during this study are available at Open Science Framework: https://osf.io/sbzt6/

Experimental Model and Subject Details

Twenty-four healthy human volunteers participated in the study (mean age = 26.2, range = 18-36, 17 female, all right-handed). All participants had normal or corrected-to-normal (6 participants used lenses) vision and reported no history of neurological or psychiatric disorders. Participants received financial compensation (£10/h) and provided informed consent prior to participating in the experiment. Protocols were approved by the local ethics committee (Central University Research Ethics Committee #R64089/RE001).

Method Details

Apparatus and Virtual Environment

Participants wore an HTC Vive Tobii Pro VR integration with a built-in binocular eye tracker with an accuracy of approximately 0.5° visual angle. We tracked gaze position in 3D space at a sampling rate of 90 Hz. Gaze position in 3D space was obtained by intersecting the gaze vector with objects in the environment. The head-mounted display (HMD) consisted of two 1080 × 1200 pixel resolution OLED screens (refresh rate = 90 Hz, field-of-view = 100° horizontally × 110° vertically). Locations of the headset and the hand-held controller were tracked with sub-millimeter precision using two Lighthouse base stations that emitted infrared pulses, which were detected by 37 infrared sensors in the HMD and 24 in the controller. Tracking was optimized by an accelerometer and a gyroscope embedded in the HMD. A trigger button (operated with the index finger) and a grip button (operated with the thumb) on the wireless controller were used for interacting with the experimental program. By intersecting the controller with a virtual object and holding down the trigger button, the participants could pick up objects – releasing the trigger button released the object from the virtual grip.

The virtual environment was presented and rendered with Unity on a high-performance PC running Windows 10 (Figures 1A and S1). The environment consisted of a 450 × 450 cm room with a ceiling height of 240 cm. Participants were situated in the center of the room, with the Model (120 × 60 cm), Workspace (100 × 50 cm) and Resources (120 × 75 cm) surrounding them.

The objects participants handled were cubes, with each side of the cube spanning 10x10 cm. The placeholders in the Model area and each side of the cube of the objects in the Resource area were overlaid with images from the Novel Object and Unusual Name Database.⁵⁶ These stimuli hold the advantage of being naturalistic, while at the same time unfamiliar and difficult to verbalize. Out of a stimulus pool of 60 objects, 8 objects were randomly selected for the Model area and 16 additional objects were drawn for the Resources, for each display. The location assignment of all objects was pseudo-randomized, so that a specific display arrangement never repeated across runs for a participant.

Procedure and tasks

Upon arrival, participants were informed that they would perform an object-copying task in which they would have to: (1) find the objects depicted in the Model within the Resources; (2) pick up these objects and move them into the Workspace to copy the arrangement in the Model; and (3) complete each display as quickly as possible (with a timeout of 45 s per display).

The Model contained the configuration of objects to be copied; the Resource contained the objects to be used; and the Workspace was the area in which the copied arrangement was assembled (Figures 1A and S1; Video S1). Participants could only pick up and carry one object at a time with their controller, which imposed a sequential order into the task (Figure 1B). The picked-up object needed to be placed in the appropriate location of the Workspace. Once the participant successfully placed the object, the location was highlighted with green contours (Video S1). Red contours would signal to the participant if the wrong location was chosen (Figure S3). The objects in the Resource would be rendered invisible as long as an object was placed incorrectly in the Workspace, thus making it impossible for the participant to continue until the object was either placed correctly, or removed from the Workspace (e.g., an incorrect object was moved to begin with). By design, participants performance was evaluated by the speed of their copying behavior, because erroneous placements had to be corrected immediately and fewer than 3% of copying sequences contained errors (Figure S3). On average, 1.8% of the sequences contained an identity error (picking up an object which was not contained in the Model) and 2.9% contained a location error (placing an object on the incorrect field). In case objects fell on the ground during copying, they would automatically re-spawn in the Resource.

Critically, we varied the Model’s location between runs (0°, 45°, 90°, or 135° in relation to the Workspace), thus manipulating the ‘locomotive effort’ required between encoding objects in the Model and placing the objects in the Workspace (Figures 1A and S1)) – a composite variable that combines the effort and time to complete the task. Other than the varying locomotive demands, the task structure remained the same across conditions.

A short practice session familiarised participants with the HMD, the wireless controller, the testing procedure, and the lab space. The practice session was identical to the actual task and consisted of copying all objects in 3 displays. Practise continued until all open questions were resolved.

Participants completed 8 runs of trials split between two sessions. Within each run they reproduced 14 displays, each display containing 8 to be copied objects. The experimental manipulation (0°, 45°, 90°, or 135°) was varied run-wise, so that every run consisted of trials from a single condition. A mandatory break (approximately 5 minutes) was administered after completing the first four runs (session 1). During the break participants removed the headset and could rest. In the second session, participants completed four more runs. Thus, each participant completed 28 displays (copied 224 objects) per condition (0°, 45°, 90°, and 135°). The order of the conditions was randomized across sessions and participants. The full experiment lasted approximately 90 minutes.

Quantification and Statistical Analysis

Data recording and pre-processing

Data from 24 displays across participants were removed due to missing values. All remaining data from 2664 displays were included in the analysis.

Frame-by-frame data were written to a csv file during recording. For the purpose of the data analysis, we segmented our measures of interest into sequence. A sequence always started by detecting a gaze sample on the Model and ended with a gaze sample on the Model. Critically, to qualify as a sequence, the participants must have looked at either the Model or Resources, between two views of the Model. For example, a sequence could consist of an initial gaze to the Model, followed by a gaze to the Resource, and finally conclude with another gaze back to the Model. This final gaze sample concludes this sequence and initiates the next sequence.

Number of features in memory

We quantified the number of features in memory during each sequence, according to the actions performed (Figure 2A; Video S2). Specifically, we considered how many object features (identity, location) were acted upon, before observers’ gaze returned to the Model for additional encoding. If, for example, the only action that was performed during a sequence was picking up an object from the Resource, this was counted as a 1-feature sequence. That is, the participant looked at the Model before placing the object. In a 2-features sequence, for example, an observer not only picked up the object, but also placed it in the Workspace before their gaze returned to the Model area. If a second object was picked up before looking back to the Model, this would constitute that 3 features in memory were used, and if the second object was also placed then this would be categorized as a 4-feature sequence, etc. For each display, participants had to place 8 objects. The overall number of features per display added up to 16 features since each object contained 2 features: 1 identity + 1 location feature (Figure S2A).

As participants’ behavior in this unconstrained task is not perfectly characterized by this categorization, we also counted cases in which 0 features were used. This captures cases in which no object was picked up during a sequence (Figure S2A). While this metric captures occurrences in which participants genuinely looked at the Model, turned to the Resource, and realized that they did not remember what they are looking for; it is likely also strongly contaminated by positional adjustments, lapses of attention, reorienting, and other unforeseen idiosyncrasies.

Model viewing time

Model viewing times were derived individually for each sequence. Thus, each viewing time had a corresponding features-in-memory value.

Display completion time

To calculate display completion time, we summed up the individual sequence completion times for each display.

Data analysis

The descriptions in this section are organized according to the figures in the main text and supplementary materials (see Data and Code Availability for access to data and code). Analyses were run using the lsr package,⁷⁰ lme4 package⁶⁴ and lmerTest⁶⁵ in the R statistical programming language⁶⁶ using RStudio.⁶⁷ The ggplot2 package⁶⁸ was used for data visualization and the MASS package⁷¹ for conducting the Box–Cox procedure.⁶⁹

All mixed-effects models were fitted with the maximum likelihood criterion. After inspecting the distributions of dependent variables, their residuals, and power coefficient outputs, we transformed the values in order to approximate a normal distribution more closely – here the Box–Cox procedure suggested a log transformation for all relevant continuous variables. Predictor variables were z-transformed (scaled and centered) and where relevant higher-order orthogonal polynomials were evaluated (e.g., linear, quadratic, or cubic).

Statistical analysis related to Figure 2B – probability of using features in memory

Differences between means of conditions were analyzed using planned pairwise t tests. Planned comparisons revealed a significant difference between the probability values of all neighboring movement conditions, nested within each number of features in memory (see table below). Only the difference between 90° and 135° for 3-features in memory was not reliable (p = 0.5).

Row	Measure	Comparison	t	df	p	Cohen’s d	Mean Diff	95% CI
1	probability	1 feature: 0° versus 45°	4.381	23	< 0.001	0.894	9.754	5.149, 14.359
2	probability	1 feature: 45° versus 90°	4.626	23	< 0.001	0.944	10.564	5.84, 15.289
3	probability	1 feature: 90° versus 135°	6.475	23	< 0.001	1.322	10.475	7.129, 13.822
4	probability	2 features: 0° versus 45°	−4.804	23	< 0.001	0.981	−9.153	−13.094, −5.212
5	probability	2 features: 45° versus 90°	−3.375	23	.003	0.689	−7.754	−12.508, −3.001
6	probability	2 features: 90° versus 135°	−3.665	23	.001	0.748	−6.598	−10.323, −2.874
7	probability	3 features: 0° versus 45°	−5.042	23	< 0.001	1.029	−3.387	−4.777, −1.998
8	probability	3 features: 45° versus 90°	−3.064	23	.005	0.625	−2.345	−3.928, −0.762
9	probability	3 features: 90° versus 135°	0.681	23	.503	0.139	0.595	−1.214, 2.405
10	probability	4 features: 0° versus 45°	−2.382	23	.026	0.486	−0.875	−1.634, −0.115
11	probability	4 features: 45° versus 90°	−3.064	23	.005	0.625	−3.547	−5.942, −1.152
12	probability	4 features: 90° versus 135°	−5.63	23	< 0.001	1.149	−5.432	−7.428, −3.436

Open in a new tab

Each row shows the results of a within-subject paired t-test.

Statistical analysis related to Figure 2C – number of features in memory

Generalized linear mixed-effects models (GLMMs) with a Poisson distribution were used to investigate how the experimental manipulation (from 0° to 135°) predicted the number of features in memory used. The random effects structure included a by-participant random intercept and a by-participant random slope for the locomotive condition. A model using a third-order polynomial (linear, quadradic and cubic) demonstrated that an increase in locomotive demands significantly predicted the number of features in memory, only when modeled with a first order polynomial (linear) fit, β = 24.254, SE = 2.393, z = 10.134, p < 0.001 (compared to quadratic, p = 0.493 and cubic, p = 0.818). To be exhaustive, we performed model comparisons on an array of plausible models, which are summarized in the following table.

Row	Measure	Predictor	df	AIC	BIC	logLik	χ²	p
1	features in memory	exponential (locomotion)	5	69392	69433	−34691	—–	—–
2	features in memory	logarithmic (locomotion)	5	69397	69438	−34694	0	1
3	features in memory	linear (locomotion)	5	69358	69399	−34674	39.780	< 0.001
4	features in memory	quadratic (locomotion)	6	69359	69408	−34674	0.448	.503
5	features in memory	cubic (locomotion)	7	69361	69419	−34674	0.052	.820

Open in a new tab

Each row shows the results from a likelihood ratio procedure comparing each model with the preceding one.

Higher order polynomial fits always included the lower order polynomials.

Statistical analysis related to Figures 3A and S2B – Model viewing time

To analyze viewing durations during the initial viewing of the Model, we restricted our analysis to instances in which between 1 and 4 features in memory were used. We excluded sequences with viewing times below 50 ms and above 2000 ms, which excluded less than 1% of the data.

Summed initial viewing time of the Model area was added as a predictor to the best fitting GLMM modeling the number of features in memory used (section above). Viewing times significantly predicted the number of features used (Figure S2B), β = 0.014, SE = 0.006, z = 2.377, p = 0.017, but did not interact with locomotion, β = 0.007, SE = 0.005, z = 1.337, p = 0.181. The effect of locomotion remained significant after the inclusion of the new predictor, β = 0.131, SE = 0.011, z = 11.423, p < 0.001.

To investigate how the number of features in memory as well as locomotion influenced viewing times, we used linear mixed-effects models (Figure 3A). With respect to the random effects of the models, we started with maximal⁷² random effects structures which included by-participant random intercepts and by-participant random slopes for the effect of locomotion, the effect of features in memory and the interaction of these two. The number of features in memory predictor entered both the fixed and the random effects structure with a linear, as well as quadratic fit (Figure 3A). Full models often lead to overparameterization and convergence issues.⁷³ We ran a principal component analysis (PCA) of each fitted model’s random-effects variance-covariance estimates to identify overparameterization and then removed random slopes that were both (a) not supported by the PCA and (b) did not contribute significantly to the goodness of fit as assessed by a likelihood ratio test comparing models with and without the slope in question. No model simplification was justified after simplification of the random-effects structure; thus, we report the inferential outcomes of the full model.

Row	Measure	Predictor	t	df	p	β	SE
1	Viewing time	linear (features in memory)	1.600	23.55	.123	1.397	0.087
2	Viewing time	quadratic (features in memory)	3.328	23.21	.003	2.890	0.087
3	Viewing time	locomotion	11.361	22.63	< 0.001	0.145	0.013
4	Viewing time	linear (features in memory): locomotion	0.039	22.37	.97	0.030	0.763
5	Viewing time	quadratic (features in memory): locomotion	3.148	24.80	.004	2.032	0.645

Open in a new tab

Each row shows the results of a predictor variable.

The p-values were calculated with the Satterthwaite’s degrees of freedom method.

To clarify the interaction (row 5) between locomotion and the number of features in memory (modeled with a quadratic fit), we calculated separate models for each locomotion condition.

Row	Measure	Predictor	t	df	p	β	SE
1	Viewing time	0°: linear (features in memory)	3.179	22.31	< 0.001	1.804	0.567
2	Viewing time	0°: quadratic (features in memory)	−0.009	3085	.993	−0.003	0.385
3	Viewing time	45°: linear (features in memory)	−1.835	2946	.067	−0.757	0.413
4	Viewing time	45°: quadratic (features in memory)	1.069	18.32	.299	0.848	0.793
5	Viewing time	90°: linear (features in memory)	1.072	23.26	.295	0.834	0.779
6	Viewing time	90°: quadratic (features in memory)	4.436	25.88	< 0.001	2.332	0.525
7	Viewing time	135°: linear (features in memory)	3.479	21.95	.002	2.537	0.729
8	Viewing time	135°: quadratic (features in memory)	7.192	1329	< 0.001	3.159	0.439

Open in a new tab

Each row shows the results of a predictor variable.

The p-values were calculated with the Satterthwaite’s degrees of freedom method.

Viewing times were longer for trials in which more WM features were used (positive slopes for all conditions).

Statistical analysis related to Figure 3B – display completion time

For analyzing the overall time it took participants to complete a display, we summed the individual sequence completion times for each display (Figure 3B). We also calculated the average number of features in memory used for each display, enabling us to relate these measures.

To predict display completion time, the linear mixed-effects model included the predictors locomotion and number of features in memory. The random-effects structure consisted of by-participant random intercepts and by-participant random slopes for the effect of locomotion, the effect of features in memory and the interaction of these two. The number of features in memory predictor entered both the fixed and the random-effects structure with a linear, as well as quadratic fit (Figure 3B).

Row	Measure	Predictor	t	df	p	β	SE
1	Completion time	linear (features in memory)	−2.735	2.73	.079	−1.166	0.427
2	Completion time	quadratic (features in memory)	15.078	14.97	< 0.001	7.820	0.519
3	Completion time	locomotion	20.397	21.09	< 0.001	0.129	0.006
4	Completion time	linear (features in memory): locomotion	−6.840	6.457	< 0.001	−2.918	0.427
5	Completion time	quadratic (features in memory): locomotion	−10.085	3.334	.001	−3.784	0.375

Open in a new tab

Each row shows the results of a predictor variable.

The p-values were calculated with the Satterthwaite’s degrees of freedom method.

To clarify the interactions (row 4 and 5) between locomotion and the number of features in memory, we calculated separate models for each locomotion condition.

Row	Measure	Predictor	t	df	p	β	SE
1	Completion time	0°: linear (features in memory)	−19.90	647.97	< 0.001	−2.513	0.126
2	Completion time	0°: quadratic (features in memory)	8.44	634.74	< 0.001	0.907	0.107
3	Completion time	45°: linear (features in memory)	−16.84	666.64	< 0.001	−2.281	0.135
4	Completion time	45°: quadratic (features in memory)	10.13	650.40	< 0.001	1.065	0.105
5	Completion time	90°: linear (features in memory)	−6.824	23.29	< 0.001	−2.855	0.418
6	Completion time	90°: quadratic (features in memory)	4.359	119.78	< 0.001	0.841	0.193
7	Completion time	135°: linear (features in memory)	−6.532	4.88	.001	−1.115	0.170
8	Completion time	135°: quadratic (features in memory)	8.175	3.91	.001	2.346	0.287

Open in a new tab

Each row shows the results of a predictor variable.

The p-values were calculated with the Satterthwaite’s degrees of freedom method.

Acknowledgments

This research was funded by a Wellcome Trust Senior Investigator Award (104571/Z/14/Z), the John Fell Research Fund (University of Oxford), and a James S. McDonnell Foundation Understanding Human Cognition Collaborative Award (220020448) to A.C.N., and by the NIHR Oxford Health Biomedical Research Centre. The Wellcome Centre for Integrative Neuroimaging is supported by core funding from the Wellcome Trust (203139/Z/16/Z). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. We thank Sage Boettcher for helpful discussions and comments.

Author Contributions

Conceptualization, D.D.; Methodology, D.D., M.K., and A.C.N.; Software, M.K.; Formal Analysis, D.D. and M.K.; Investigation, D.D. and M.K.; Writing – Original Draft, D.D.; Writing – Review & Editing, D.D. and A.C.N.; Funding Acquisition, A.C.N.

Declaration of Interests

The authors declare no competing interests.

Published: December 4, 2020

Footnotes

Supplemental Information can be found online at https://doi.org/10.1016/j.cub.2020.11.013.

A video abstract is available at https://doi.org/10.1016/j.cub.2020.11.013#mmc5.

Supplemental Information

Document S1. Figures S1–S3

mmc1.pdf^{(435.2KB, pdf)}

Document S2. Article plus Supplemental Information

mmc4.pdf^{(1.7MB, pdf)}

Video Abstract

Download video file^{(20.7MB, mp4)}

References

1.Baddeley A. Working memory. Science. 1992;255:556–559. doi: 10.1126/science.1736359. [DOI] [PubMed] [Google Scholar]
2.Fuster J.M., Alexander G.E. Neuron activity related to short-term memory. Science. 1971;173:652–654. doi: 10.1126/science.173.3997.652. [DOI] [PubMed] [Google Scholar]
3.Funahashi S., Chafee M.V., Goldman-Rakic P.S. Prefrontal neuronal activity in rhesus monkeys performing a delayed anti-saccade task. Nature. 1993;365:753–756. doi: 10.1038/365753a0. [DOI] [PubMed] [Google Scholar]
4.Awh E., Jonides J. Overlapping mechanisms of attention and spatial working memory. Trends Cogn. Sci. 2001;5:119–126. doi: 10.1016/s1364-6613(00)01593-x. [DOI] [PubMed] [Google Scholar]
5.Vogel E.K., Machizawa M.G. Neural activity predicts individual differences in visual working memory capacity. Nature. 2004;428:748–751. doi: 10.1038/nature02447. [DOI] [PubMed] [Google Scholar]
6.Bays P.M., Husain M. Dynamic shifts of limited working memory resources in human vision. Science. 2008;321:851–854. doi: 10.1126/science.1158023. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Luck S.J., Vogel E.K. The capacity of visual working memory for features and conjunctions. Nature. 1997;390:279–281. doi: 10.1038/36846. [DOI] [PubMed] [Google Scholar]
8.Brady T.F., Konkle T., Alvarez G.A. A review of visual memory capacity: beyond individual items and toward structured representations. J. Vis. 2011;11:4. doi: 10.1167/11.5.4. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Ballard D.H., Hayhoe M.M., Pook P.K., Rao R.P.N. Deictic codes for the embodiment of cognition. Behav. Brain Sci. 1997;20:723–742, discussion 743–767. doi: 10.1017/s0140525x97001611. [DOI] [PubMed] [Google Scholar]
10.Tatler B.W., Land M.F. Vision and the representation of the surroundings in spatial memory. Philos. Trans. R. Soc. Lond. B Biol. Sci. 2011;366:596–610. doi: 10.1098/rstb.2010.0188. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Hayhoe M., Ballard D. Eye movements in natural behavior. Trends Cogn. Sci. 2005;9:188–194. doi: 10.1016/j.tics.2005.02.009. [DOI] [PubMed] [Google Scholar]
12.Ballard D.H., Hayhoe M.M., Pelz J.B. Memory representations in natural tasks. J. Cogn. Neurosci. 1995;7:66–80. doi: 10.1162/jocn.1995.7.1.66. [DOI] [PubMed] [Google Scholar]
13.Rouder J.N., Morey R.D., Cowan N., Zwilling C.E., Morey C.C., Pratte M.S. An assessment of fixed-capacity models of visual working memory. Proc. Natl. Acad. Sci. USA. 2008;105:5975–5979. doi: 10.1073/pnas.0711295105. [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Cowan N. The magical number 4 in short-term memory: a reconsideration of mental storage capacity. Behav. Brain Sci. 2001;24:87–114. doi: 10.1017/s0140525x01003922. [DOI] [PubMed] [Google Scholar]
15.Conway A.R.A., Kane M.J., Bunting M.F., Hambrick D.Z., Wilhelm O., Engle R.W. Working memory span tasks: a methodological review and user’s guide. Psychon. Bull. Rev. 2005;12:769–786. doi: 10.3758/bf03196772. [DOI] [PubMed] [Google Scholar]
16.Meghanathan R.N., van Leeuwen C., Nikolaev A.R. Fixation duration surpasses pupil size as a measure of memory load in free viewing. Front. Hum. Neurosci. 2015;8:1063. doi: 10.3389/fnhum.2014.01063. [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Huang L. Visual working memory is better characterized as a distributed resource rather than discrete slots. J. Vis. 2010;10:8. doi: 10.1167/10.14.8. [DOI] [PubMed] [Google Scholar]
18.Wilken P., Ma W.J. A detection theory account of change detection. J. Vis. 2004;4:1120–1135. doi: 10.1167/4.12.11. [DOI] [PubMed] [Google Scholar]
19.Zhang W., Luck S.J. Discrete fixed-resolution representations in visual working memory. Nature. 2008;453:233–235. doi: 10.1038/nature06860. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Endress A.D., Potter M.C. Large capacity temporary visual memory. J. Exp. Psychol. Gen. 2014;143:548–565. doi: 10.1037/a0033934. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Brady T.F., Störmer V.S., Alvarez G.A. Working memory is not fixed-capacity: more active storage capacity for real-world objects than for simple stimuli. Proc. Natl. Acad. Sci. USA. 2016;113:7459–7464. doi: 10.1073/pnas.1520027113. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Wheeler M.E., Treisman A.M. Binding in short-term visual memory. J. Exp. Psychol. Gen. 2002;131:48–64. doi: 10.1037//0096-3445.131.1.48. [DOI] [PubMed] [Google Scholar]
23.Jiang Y., Olson I.R., Chun M.M. Organization of visual short-term memory. J. Exp. Psychol. Learn. Mem. Cogn. 2000;26:683–702. doi: 10.1037//0278-7393.26.3.683. [DOI] [PubMed] [Google Scholar]
24.Foster J.J., Bsales E.M., Jaffe R.J., Awh E. Alpha-band activity reveals spontaneous representations of spatial position in visual working memory. Curr. Biol. 2017;27:3216–3223.e6. doi: 10.1016/j.cub.2017.09.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Schneegans S., Bays P.M. Neural architecture for feature binding in visual working memory. J. Neurosci. 2017;37:3913–3925. doi: 10.1523/JNEUROSCI.3493-16.2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Rajsic J., Wilson D.E. Asymmetrical access to color and location in visual working memory. Atten. Percept. Psychophys. 2014;76:1902–1913. doi: 10.3758/s13414-014-0723-2. [DOI] [PubMed] [Google Scholar]
27.Christie S.T., Schrater P. Cognitive cost as dynamic allocation of energetic resources. Front. Neurosci. 2015;9:289. doi: 10.3389/fnins.2015.00289. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Solomon R.L. The influence of work on behavior. Psychol. Bull. 1948;45:1–40. doi: 10.1037/h0055527. [DOI] [PubMed] [Google Scholar]
29.Maloney L.T., Zhang H. Decision-theoretic models of visual perception and action. Vision Res. 2010;50:2362–2374. doi: 10.1016/j.visres.2010.09.031. [DOI] [PubMed] [Google Scholar]
30.Wolpert D.M., Landy M.S. Motor control is decision-making. Curr. Opin. Neurobiol. 2012;22:996–1003. doi: 10.1016/j.conb.2012.05.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
31.Franklin D.W., Wolpert D.M. Computational mechanisms of sensorimotor control. Neuron. 2011;72:425–442. doi: 10.1016/j.neuron.2011.10.006. [DOI] [PubMed] [Google Scholar]
32.Gottlieb J. Attention, learning, and the value of information. Neuron. 2012;76:281–295. doi: 10.1016/j.neuron.2012.09.034. [DOI] [PMC free article] [PubMed] [Google Scholar]
33.Hayhoe M.M. Vision and action. Annu. Rev. Vis. Sci. 2017;3:389–413. doi: 10.1146/annurev-vision-102016-061437. [DOI] [PubMed] [Google Scholar]
34.Hayhoe M.M., Bensinger D.G., Ballard D.H. Task constraints in visual working memory. Vision Res. 1998;38:125–137. doi: 10.1016/s0042-6989(97)00116-8. [DOI] [PubMed] [Google Scholar]
35.Melnik A., Schüler F., Rothkopf C.A., König P. The world as an external memory: the price of saccades in a sensorimotor task. Front. Behav. Neurosci. 2018;12:253. doi: 10.3389/fnbeh.2018.00253. [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Somai R.S., Schut M.J., Van der Stigchel S. Evidence for the world as an external memory: a trade-off between internal and external visual memory storage. Cortex. 2020;122:108–114. doi: 10.1016/j.cortex.2018.12.017. [DOI] [PubMed] [Google Scholar]
37.Land M.F. Vision, eye movements, and natural behavior. Vis. Neurosci. 2009;26:51–62. doi: 10.1017/S0952523808080899. [DOI] [PubMed] [Google Scholar]
38.Tatler B.W., Hayhoe M.M., Land M.F., Ballard D.H. Eye guidance in natural vision: reinterpreting salience. J. Vis. 2011;11:5. doi: 10.1167/11.5.5. [DOI] [PMC free article] [PubMed] [Google Scholar]
39.Postle B.R., Idzikowski C., Sala S.D., Logie R.H., Baddeley A.D. The selective disruption of spatial working memory by eye movements. Q J Exp Psychol (Hove) 2006;59:100–120. doi: 10.1080/17470210500151410. [DOI] [PMC free article] [PubMed] [Google Scholar]
40.Wolbers T., Hegarty M., Büchel C., Loomis J.M. Spatial updating: how the brain keeps track of changing object locations during observer motion. Nat. Neurosci. 2008;11:1223–1230. doi: 10.1038/nn.2189. [DOI] [PubMed] [Google Scholar]
41.Thomas L.E., Seiffert A.E. Self-motion impairs multiple-object tracking. Cognition. 2010;117:80–86. doi: 10.1016/j.cognition.2010.07.002. [DOI] [PubMed] [Google Scholar]
42.Oberauer K. Access to information in working memory: exploring the focus of attention. J. Exp. Psychol. Learn. Mem. Cogn. 2002;28:411–421. [PubMed] [Google Scholar]
43.Öztekin I., Davachi L., McElree B. Are representations in working memory distinct from representations in long-term memory? Neural evidence in support of a single store. Psychol. Sci. 2010;21:1123–1133. doi: 10.1177/0956797610376651. [DOI] [PMC free article] [PubMed] [Google Scholar]
44.Myers N.E., Stokes M.G., Nobre A.C. Prioritizing information during working memory: beyond sustained internal attention. Trends Cogn. Sci. 2017;21:449–461. doi: 10.1016/j.tics.2017.03.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
45.Horowitz T.S., Wolfe J.M. Visual search has no memory. Nature. 1998;394:575–577. doi: 10.1038/29068. [DOI] [PubMed] [Google Scholar]
46.Peterson M.S., Kramer A.F., Wang R.F., Irwin D.E., McCarley J.S. Visual search has memory. Psychol. Sci. 2001;12:287–292. doi: 10.1111/1467-9280.00353. [DOI] [PubMed] [Google Scholar]
47.Le-Hoa Võ M., Wolfe J.M. The role of memory for visual search in scenes. Ann. N Y Acad. Sci. 2015;1339:72–81. doi: 10.1111/nyas.12667. [DOI] [PMC free article] [PubMed] [Google Scholar]
48.Solman G.J.F., Kingstone A. Balancing energetic and cognitive resources: memory use during search depends on the orienting effector. Cognition. 2014;132:443–454. doi: 10.1016/j.cognition.2014.05.005. [DOI] [PubMed] [Google Scholar]
49.Helbing J., Draschkow D., Võ M.L.-H. Search superiority: goal-directed attentional allocation creates more reliable incidental identity and location memory than explicit encoding in naturalistic virtual environments. Cognition. 2020;196:104147. doi: 10.1016/j.cognition.2019.104147. [DOI] [PubMed] [Google Scholar]
50.Draschkow D., Võ M.L.-H.L.-H. Scene grammar shapes the way we interact with objects, strengthens memories, and speeds search. Sci. Rep. 2017;7:16471. doi: 10.1038/s41598-017-16739-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
51.Draschkow D., Võ M.L.-H. Of “what” and “where” in a natural search task: active object handling supports object location memory beyond the object’s identity. Atten. Percept. Psychophys. 2016;78:1574–1584. doi: 10.3758/s13414-016-1111-x. [DOI] [PubMed] [Google Scholar]
52.Li C.L., Aivar M.P., Tong M.H., Hayhoe M.M. Memory shapes visual search strategies in large-scale environments. Sci. Rep. 2018;8:4324. doi: 10.1038/s41598-018-22731-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
53.Tatler B.W., Tatler S.L. The influence of instructions on object memory in a real-world setting. J. Vis. 2013;13:5. doi: 10.1167/13.2.5. [DOI] [PubMed] [Google Scholar]
54.Nobre A.C., Stokes M.G. Premembering experience: a hierarchy of time-scales for proactive attention. Neuron. 2019;104:132–146. doi: 10.1016/j.neuron.2019.08.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
55.Engel A.K., Maye A., Kurthen M., König P. Where’s the action? The pragmatic turn in cognitive science. Trends Cogn. Sci. 2013;17:202–209. doi: 10.1016/j.tics.2013.03.006. [DOI] [PubMed] [Google Scholar]
56.Horst J.S., Hout M.C. The Novel Object and Unusual Name (NOUN) Database: A collection of novel images for use in experimental research. Behav. Res. Methods. 2016;48:1393–1409. doi: 10.3758/s13428-015-0647-3. [DOI] [PubMed] [Google Scholar]
57.Ballard D.H., Hayhoe M.M., Li F., Whitehead S.D. Hand-eye coordination during sequential tasks. Philos. Trans. R. Soc. Lond. B Biol. Sci. 1992;337:331–338, discussion 338–339. doi: 10.1098/rstb.1992.0111. [DOI] [PubMed] [Google Scholar]
58.Isola P., Parikh D., Torralba A., Oliva A. Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011, NIPS 2011. NIPS; 2011. Understanding the intrinsic memorability of images; pp. 2429–2437. [Google Scholar]
59.Rust N.C., Mehrpour V. Understanding image memorability. Trends Cogn. Sci. 2020;24:557–568. doi: 10.1016/j.tics.2020.04.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
60.Bylinskii Z., Isola P., Bainbridge C., Torralba A., Oliva A. Intrinsic and extrinsic effects on image memorability. Vision Res. 2015;116:165–178. doi: 10.1016/j.visres.2015.03.005. [DOI] [PubMed] [Google Scholar]
61.Vogel E.K., McCollough A.W., Machizawa M.G. Neural measures reveal individual differences in controlling access to working memory. Nature. 2005;438:500–503. doi: 10.1038/nature04171. [DOI] [PubMed] [Google Scholar]
62.Alloway T.P., Alloway R.G. Investigating the predictive roles of working memory and IQ in academic attainment. J. Exp. Child Psychol. 2010;106:20–29. doi: 10.1016/j.jecp.2009.11.003. [DOI] [PubMed] [Google Scholar]
63.Wickens C.D. Multiple resources and mental workload. Hum. Factors. 2008;50:449–455. doi: 10.1518/001872008X288394. [DOI] [PubMed] [Google Scholar]
64.Bates D., Mächler M., Bolker B., Walker S. Fitting linear mixed-effects models using lme4. J. Stat. Softw. 2015;67:1–48. [Google Scholar]
65.Kuznetsova A., Brockhoff P.B., Christensen R.H.B. lmerTest package: tests in linear mixed effects models. J. Stat. Softw. 2017;82:1–26. [Google Scholar]
66.R Development Core Team . R Foundation for Statistical Computing; 2012. R: A language and environment for statistical computing.http://www.r-project.org [Google Scholar]
67.RStudio Team . 2016. RStudio: Integrated development environment for R. [Google Scholar]
68.Wickham H. Springer; 2016. ggplot2: Elegant Graphics for Data Analysis. [Google Scholar]
69.Box G.E.P., Cox D.R. An analysis of transformations. J. R. Stat. Soc. B. 1964;26:211–252. [Google Scholar]
70.Navarro D.J. 2018. Learning statistics with R: a tutorial for psychology students and other beginners.https://open.umn.edu/opentextbooks/textbooks/559 [Google Scholar]
71.Venables W.N., Ripley B.D. Springer; 2002. Modern Applied Statistics with S. [Google Scholar]
72.Barr D.J., Levy R., Scheepers C., Tily H.J. Random effects structure for confirmatory hypothesis testing: keep it maximal. J. Mem. Lang. 2013;68 doi: 10.1016/j.jml.2012.11.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
73.Bates D., Kliegl R., Vasishth S., Baayen H. Parsimonious mixed models. arXiv. 2015 https://arxiv.org/abs/1506.04967 arXiv:1506.04967v2. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Download video file^{(20.7MB, mp4)}

Video S1. Temporally Extended Object-Copying Task in Virtual Reality, Related to Figure 1 and STAR Methods

Download video file^{(71.7MB, mp4)}

Video S2. Gaze Provides a Measure of Working-Memory Usage during Natural Behavior, Related to Figure 2 and STAR Methods

Download video file^{(9.7MB, mp4)}

Document S1. Figures S1–S3

mmc1.pdf^{(435.2KB, pdf)}

Document S2. Article plus Supplemental Information

mmc4.pdf^{(1.7MB, pdf)}

Video Abstract

Download video file^{(20.7MB, mp4)}

Data Availability Statement

The data and code generated during this study are available at Open Science Framework: https://osf.io/sbzt6/

[bib1] 1.Baddeley A. Working memory. Science. 1992;255:556–559. doi: 10.1126/science.1736359. [DOI] [PubMed] [Google Scholar]

[bib2] 2.Fuster J.M., Alexander G.E. Neuron activity related to short-term memory. Science. 1971;173:652–654. doi: 10.1126/science.173.3997.652. [DOI] [PubMed] [Google Scholar]

[bib3] 3.Funahashi S., Chafee M.V., Goldman-Rakic P.S. Prefrontal neuronal activity in rhesus monkeys performing a delayed anti-saccade task. Nature. 1993;365:753–756. doi: 10.1038/365753a0. [DOI] [PubMed] [Google Scholar]

[bib4] 4.Awh E., Jonides J. Overlapping mechanisms of attention and spatial working memory. Trends Cogn. Sci. 2001;5:119–126. doi: 10.1016/s1364-6613(00)01593-x. [DOI] [PubMed] [Google Scholar]

[bib5] 5.Vogel E.K., Machizawa M.G. Neural activity predicts individual differences in visual working memory capacity. Nature. 2004;428:748–751. doi: 10.1038/nature02447. [DOI] [PubMed] [Google Scholar]

[bib6] 6.Bays P.M., Husain M. Dynamic shifts of limited working memory resources in human vision. Science. 2008;321:851–854. doi: 10.1126/science.1158023. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib7] 7.Luck S.J., Vogel E.K. The capacity of visual working memory for features and conjunctions. Nature. 1997;390:279–281. doi: 10.1038/36846. [DOI] [PubMed] [Google Scholar]

[bib8] 8.Brady T.F., Konkle T., Alvarez G.A. A review of visual memory capacity: beyond individual items and toward structured representations. J. Vis. 2011;11:4. doi: 10.1167/11.5.4. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib9] 9.Ballard D.H., Hayhoe M.M., Pook P.K., Rao R.P.N. Deictic codes for the embodiment of cognition. Behav. Brain Sci. 1997;20:723–742, discussion 743–767. doi: 10.1017/s0140525x97001611. [DOI] [PubMed] [Google Scholar]

[bib10] 10.Tatler B.W., Land M.F. Vision and the representation of the surroundings in spatial memory. Philos. Trans. R. Soc. Lond. B Biol. Sci. 2011;366:596–610. doi: 10.1098/rstb.2010.0188. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib11] 11.Hayhoe M., Ballard D. Eye movements in natural behavior. Trends Cogn. Sci. 2005;9:188–194. doi: 10.1016/j.tics.2005.02.009. [DOI] [PubMed] [Google Scholar]

[bib12] 12.Ballard D.H., Hayhoe M.M., Pelz J.B. Memory representations in natural tasks. J. Cogn. Neurosci. 1995;7:66–80. doi: 10.1162/jocn.1995.7.1.66. [DOI] [PubMed] [Google Scholar]

[bib13] 13.Rouder J.N., Morey R.D., Cowan N., Zwilling C.E., Morey C.C., Pratte M.S. An assessment of fixed-capacity models of visual working memory. Proc. Natl. Acad. Sci. USA. 2008;105:5975–5979. doi: 10.1073/pnas.0711295105. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib14] 14.Cowan N. The magical number 4 in short-term memory: a reconsideration of mental storage capacity. Behav. Brain Sci. 2001;24:87–114. doi: 10.1017/s0140525x01003922. [DOI] [PubMed] [Google Scholar]

[bib15] 15.Conway A.R.A., Kane M.J., Bunting M.F., Hambrick D.Z., Wilhelm O., Engle R.W. Working memory span tasks: a methodological review and user’s guide. Psychon. Bull. Rev. 2005;12:769–786. doi: 10.3758/bf03196772. [DOI] [PubMed] [Google Scholar]

[bib16] 16.Meghanathan R.N., van Leeuwen C., Nikolaev A.R. Fixation duration surpasses pupil size as a measure of memory load in free viewing. Front. Hum. Neurosci. 2015;8:1063. doi: 10.3389/fnhum.2014.01063. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib17] 17.Huang L. Visual working memory is better characterized as a distributed resource rather than discrete slots. J. Vis. 2010;10:8. doi: 10.1167/10.14.8. [DOI] [PubMed] [Google Scholar]

[bib18] 18.Wilken P., Ma W.J. A detection theory account of change detection. J. Vis. 2004;4:1120–1135. doi: 10.1167/4.12.11. [DOI] [PubMed] [Google Scholar]

[bib19] 19.Zhang W., Luck S.J. Discrete fixed-resolution representations in visual working memory. Nature. 2008;453:233–235. doi: 10.1038/nature06860. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib20] 20.Endress A.D., Potter M.C. Large capacity temporary visual memory. J. Exp. Psychol. Gen. 2014;143:548–565. doi: 10.1037/a0033934. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib21] 21.Brady T.F., Störmer V.S., Alvarez G.A. Working memory is not fixed-capacity: more active storage capacity for real-world objects than for simple stimuli. Proc. Natl. Acad. Sci. USA. 2016;113:7459–7464. doi: 10.1073/pnas.1520027113. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib22] 22.Wheeler M.E., Treisman A.M. Binding in short-term visual memory. J. Exp. Psychol. Gen. 2002;131:48–64. doi: 10.1037//0096-3445.131.1.48. [DOI] [PubMed] [Google Scholar]

[bib23] 23.Jiang Y., Olson I.R., Chun M.M. Organization of visual short-term memory. J. Exp. Psychol. Learn. Mem. Cogn. 2000;26:683–702. doi: 10.1037//0278-7393.26.3.683. [DOI] [PubMed] [Google Scholar]

[bib24] 24.Foster J.J., Bsales E.M., Jaffe R.J., Awh E. Alpha-band activity reveals spontaneous representations of spatial position in visual working memory. Curr. Biol. 2017;27:3216–3223.e6. doi: 10.1016/j.cub.2017.09.031. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib25] 25.Schneegans S., Bays P.M. Neural architecture for feature binding in visual working memory. J. Neurosci. 2017;37:3913–3925. doi: 10.1523/JNEUROSCI.3493-16.2017. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib26] 26.Rajsic J., Wilson D.E. Asymmetrical access to color and location in visual working memory. Atten. Percept. Psychophys. 2014;76:1902–1913. doi: 10.3758/s13414-014-0723-2. [DOI] [PubMed] [Google Scholar]

[bib27] 27.Christie S.T., Schrater P. Cognitive cost as dynamic allocation of energetic resources. Front. Neurosci. 2015;9:289. doi: 10.3389/fnins.2015.00289. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib28] 28.Solomon R.L. The influence of work on behavior. Psychol. Bull. 1948;45:1–40. doi: 10.1037/h0055527. [DOI] [PubMed] [Google Scholar]

[bib29] 29.Maloney L.T., Zhang H. Decision-theoretic models of visual perception and action. Vision Res. 2010;50:2362–2374. doi: 10.1016/j.visres.2010.09.031. [DOI] [PubMed] [Google Scholar]

[bib30] 30.Wolpert D.M., Landy M.S. Motor control is decision-making. Curr. Opin. Neurobiol. 2012;22:996–1003. doi: 10.1016/j.conb.2012.05.003. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib31] 31.Franklin D.W., Wolpert D.M. Computational mechanisms of sensorimotor control. Neuron. 2011;72:425–442. doi: 10.1016/j.neuron.2011.10.006. [DOI] [PubMed] [Google Scholar]

[bib32] 32.Gottlieb J. Attention, learning, and the value of information. Neuron. 2012;76:281–295. doi: 10.1016/j.neuron.2012.09.034. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib33] 33.Hayhoe M.M. Vision and action. Annu. Rev. Vis. Sci. 2017;3:389–413. doi: 10.1146/annurev-vision-102016-061437. [DOI] [PubMed] [Google Scholar]

[bib34] 34.Hayhoe M.M., Bensinger D.G., Ballard D.H. Task constraints in visual working memory. Vision Res. 1998;38:125–137. doi: 10.1016/s0042-6989(97)00116-8. [DOI] [PubMed] [Google Scholar]

[bib35] 35.Melnik A., Schüler F., Rothkopf C.A., König P. The world as an external memory: the price of saccades in a sensorimotor task. Front. Behav. Neurosci. 2018;12:253. doi: 10.3389/fnbeh.2018.00253. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib36] 36.Somai R.S., Schut M.J., Van der Stigchel S. Evidence for the world as an external memory: a trade-off between internal and external visual memory storage. Cortex. 2020;122:108–114. doi: 10.1016/j.cortex.2018.12.017. [DOI] [PubMed] [Google Scholar]

[bib37] 37.Land M.F. Vision, eye movements, and natural behavior. Vis. Neurosci. 2009;26:51–62. doi: 10.1017/S0952523808080899. [DOI] [PubMed] [Google Scholar]

[bib38] 38.Tatler B.W., Hayhoe M.M., Land M.F., Ballard D.H. Eye guidance in natural vision: reinterpreting salience. J. Vis. 2011;11:5. doi: 10.1167/11.5.5. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib39] 39.Postle B.R., Idzikowski C., Sala S.D., Logie R.H., Baddeley A.D. The selective disruption of spatial working memory by eye movements. Q J Exp Psychol (Hove) 2006;59:100–120. doi: 10.1080/17470210500151410. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib40] 40.Wolbers T., Hegarty M., Büchel C., Loomis J.M. Spatial updating: how the brain keeps track of changing object locations during observer motion. Nat. Neurosci. 2008;11:1223–1230. doi: 10.1038/nn.2189. [DOI] [PubMed] [Google Scholar]

[bib41] 41.Thomas L.E., Seiffert A.E. Self-motion impairs multiple-object tracking. Cognition. 2010;117:80–86. doi: 10.1016/j.cognition.2010.07.002. [DOI] [PubMed] [Google Scholar]

[bib42] 42.Oberauer K. Access to information in working memory: exploring the focus of attention. J. Exp. Psychol. Learn. Mem. Cogn. 2002;28:411–421. [PubMed] [Google Scholar]

[bib43] 43.Öztekin I., Davachi L., McElree B. Are representations in working memory distinct from representations in long-term memory? Neural evidence in support of a single store. Psychol. Sci. 2010;21:1123–1133. doi: 10.1177/0956797610376651. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib44] 44.Myers N.E., Stokes M.G., Nobre A.C. Prioritizing information during working memory: beyond sustained internal attention. Trends Cogn. Sci. 2017;21:449–461. doi: 10.1016/j.tics.2017.03.010. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib45] 45.Horowitz T.S., Wolfe J.M. Visual search has no memory. Nature. 1998;394:575–577. doi: 10.1038/29068. [DOI] [PubMed] [Google Scholar]

[bib46] 46.Peterson M.S., Kramer A.F., Wang R.F., Irwin D.E., McCarley J.S. Visual search has memory. Psychol. Sci. 2001;12:287–292. doi: 10.1111/1467-9280.00353. [DOI] [PubMed] [Google Scholar]

[bib47] 47.Le-Hoa Võ M., Wolfe J.M. The role of memory for visual search in scenes. Ann. N Y Acad. Sci. 2015;1339:72–81. doi: 10.1111/nyas.12667. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib48] 48.Solman G.J.F., Kingstone A. Balancing energetic and cognitive resources: memory use during search depends on the orienting effector. Cognition. 2014;132:443–454. doi: 10.1016/j.cognition.2014.05.005. [DOI] [PubMed] [Google Scholar]

[bib49] 49.Helbing J., Draschkow D., Võ M.L.-H. Search superiority: goal-directed attentional allocation creates more reliable incidental identity and location memory than explicit encoding in naturalistic virtual environments. Cognition. 2020;196:104147. doi: 10.1016/j.cognition.2019.104147. [DOI] [PubMed] [Google Scholar]

[bib50] 50.Draschkow D., Võ M.L.-H.L.-H. Scene grammar shapes the way we interact with objects, strengthens memories, and speeds search. Sci. Rep. 2017;7:16471. doi: 10.1038/s41598-017-16739-x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib51] 51.Draschkow D., Võ M.L.-H. Of “what” and “where” in a natural search task: active object handling supports object location memory beyond the object’s identity. Atten. Percept. Psychophys. 2016;78:1574–1584. doi: 10.3758/s13414-016-1111-x. [DOI] [PubMed] [Google Scholar]

[bib52] 52.Li C.L., Aivar M.P., Tong M.H., Hayhoe M.M. Memory shapes visual search strategies in large-scale environments. Sci. Rep. 2018;8:4324. doi: 10.1038/s41598-018-22731-w. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib53] 53.Tatler B.W., Tatler S.L. The influence of instructions on object memory in a real-world setting. J. Vis. 2013;13:5. doi: 10.1167/13.2.5. [DOI] [PubMed] [Google Scholar]

[bib54] 54.Nobre A.C., Stokes M.G. Premembering experience: a hierarchy of time-scales for proactive attention. Neuron. 2019;104:132–146. doi: 10.1016/j.neuron.2019.08.030. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib55] 55.Engel A.K., Maye A., Kurthen M., König P. Where’s the action? The pragmatic turn in cognitive science. Trends Cogn. Sci. 2013;17:202–209. doi: 10.1016/j.tics.2013.03.006. [DOI] [PubMed] [Google Scholar]

[bib56] 56.Horst J.S., Hout M.C. The Novel Object and Unusual Name (NOUN) Database: A collection of novel images for use in experimental research. Behav. Res. Methods. 2016;48:1393–1409. doi: 10.3758/s13428-015-0647-3. [DOI] [PubMed] [Google Scholar]

[bib57] 57.Ballard D.H., Hayhoe M.M., Li F., Whitehead S.D. Hand-eye coordination during sequential tasks. Philos. Trans. R. Soc. Lond. B Biol. Sci. 1992;337:331–338, discussion 338–339. doi: 10.1098/rstb.1992.0111. [DOI] [PubMed] [Google Scholar]

[bib58] 58.Isola P., Parikh D., Torralba A., Oliva A. Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011, NIPS 2011. NIPS; 2011. Understanding the intrinsic memorability of images; pp. 2429–2437. [Google Scholar]

[bib59] 59.Rust N.C., Mehrpour V. Understanding image memorability. Trends Cogn. Sci. 2020;24:557–568. doi: 10.1016/j.tics.2020.04.001. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib60] 60.Bylinskii Z., Isola P., Bainbridge C., Torralba A., Oliva A. Intrinsic and extrinsic effects on image memorability. Vision Res. 2015;116:165–178. doi: 10.1016/j.visres.2015.03.005. [DOI] [PubMed] [Google Scholar]

[bib61] 61.Vogel E.K., McCollough A.W., Machizawa M.G. Neural measures reveal individual differences in controlling access to working memory. Nature. 2005;438:500–503. doi: 10.1038/nature04171. [DOI] [PubMed] [Google Scholar]

[bib62] 62.Alloway T.P., Alloway R.G. Investigating the predictive roles of working memory and IQ in academic attainment. J. Exp. Child Psychol. 2010;106:20–29. doi: 10.1016/j.jecp.2009.11.003. [DOI] [PubMed] [Google Scholar]

[bib63] 63.Wickens C.D. Multiple resources and mental workload. Hum. Factors. 2008;50:449–455. doi: 10.1518/001872008X288394. [DOI] [PubMed] [Google Scholar]

[bib64] 64.Bates D., Mächler M., Bolker B., Walker S. Fitting linear mixed-effects models using lme4. J. Stat. Softw. 2015;67:1–48. [Google Scholar]

[bib65] 65.Kuznetsova A., Brockhoff P.B., Christensen R.H.B. lmerTest package: tests in linear mixed effects models. J. Stat. Softw. 2017;82:1–26. [Google Scholar]

[bib66] 66.R Development Core Team . R Foundation for Statistical Computing; 2012. R: A language and environment for statistical computing.http://www.r-project.org [Google Scholar]

[bib67] 67.RStudio Team . 2016. RStudio: Integrated development environment for R. [Google Scholar]

[bib68] 68.Wickham H. Springer; 2016. ggplot2: Elegant Graphics for Data Analysis. [Google Scholar]

[bib69] 69.Box G.E.P., Cox D.R. An analysis of transformations. J. R. Stat. Soc. B. 1964;26:211–252. [Google Scholar]

[bib70] 70.Navarro D.J. 2018. Learning statistics with R: a tutorial for psychology students and other beginners.https://open.umn.edu/opentextbooks/textbooks/559 [Google Scholar]

[bib71] 71.Venables W.N., Ripley B.D. Springer; 2002. Modern Applied Statistics with S. [Google Scholar]

[bib72] 72.Barr D.J., Levy R., Scheepers C., Tily H.J. Random effects structure for confirmatory hypothesis testing: keep it maximal. J. Mem. Lang. 2013;68 doi: 10.1016/j.jml.2012.11.001. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib73] 73.Bates D., Kliegl R., Vasishth S., Baayen H. Parsimonious mixed models. arXiv. 2015 https://arxiv.org/abs/1506.04967 arXiv:1506.04967v2. [Google Scholar]

PERMALINK

When Natural Behavior Engages Working Memory

Dejan Draschkow

Melvin Kallmayer

Anna C Nobre

Summary

Video Abstract

Highlights

Results and Discussion

Figure 1.

Gaze Provides a Measure of Working-Memory Usage during Natural Behavior

Figure 2.

Natural Reliance on WM Is Low Even When Searching for Objects Externally Is Effortful

WM Utilization Increases Linearly as Searching for Objects Requires More Locomotion

The Trade-Off between Using WM versus External Sampling Affects Performance

Figure 3.

Why Is Reliance on WM Low in Natural Behavior?

Using versus Not-Using Memories from Different Timescales

When Natural Behavior Engages WM

Conclusions

STAR★Methods

Key Resources Table

Resource Availability

Lead Contact

Materials Availability

Data and Code Availability

Experimental Model and Subject Details

Method Details

Apparatus and Virtual Environment

Procedure and tasks

Quantification and Statistical Analysis

Data recording and pre-processing

Number of features in memory

Model viewing time

Display completion time

Data analysis

Statistical analysis related to Figure 2B – probability of using features in memory

Statistical analysis related to Figure 2C – number of features in memory

Statistical analysis related to Figures 3A and S2B – Model viewing time

Statistical analysis related to Figure 3B – display completion time

Acknowledgments

Author Contributions

Declaration of Interests

Footnotes

Supplemental Information

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases