Abstract
When we imagine a picture, we move our eyes even though the picture is physically not present. These eye movements provide information about the ongoing process of mental imagery. Eye movements unfold over time, and previous research has shown that the temporal gaze dynamics of eye movements in mental imagery have unique properties, which are unrelated to those in perception. In mental imagery, refixations of previously fixated locations happen more often and in a more systematic manner than in perception. The origin of these unique properties remains unclear. We tested how the temporal structure of eye movements is influenced by the complexity of the mental image. Participants briefly saw and then maintained a pattern stimulus, consisting of one (easy condition) to four black segments (most difficult condition). When maintaining a simple pattern in imagery, participants restricted their gaze to a narrow area, and for more complex stimuli, eye movements were more spread out to distant areas. At the same time, fewer refixations were made in imagery when the stimuli were complex. The results show that refixations depend on the imagined content. While fixations of stimulus-related areas reflect the so-called ‘looking at nothing’ effect, gaze restriction emphasizes differences between mental imagery and perception.
Keywords: visual imagery, eye movement, looking at nothing, recurrence quantification analysis, individual differences
Introduction
Eye movements during mental imagery carry information about processes that underlie mental imagery. For example, when a previously seen picture is imagined, fixation distributions resemble those from actually looking at the picture. This is known as the ‘looking at nothing’ effect. This spatial similarity has supported the notion of the spatial nature of mental images (Kosslyn, 1994) as opposed to mental images relying on a propositional code (Pylyshyn, 1981, 2002; for a review see J. Pearson & Kosslyn, 2015). Furthermore, a strong spatial similarity of eye fixation distributions in perception and mental imagery is beneficial for later retrieval of the imagined content (Bochynska & Laeng, 2015; Johansson & Johansson, 2014; Jordana S. Wynn, Olsen, Binns, Buchsbaum, & Ryan, 2018). This suggests that ‘looking at nothing’ has functional value. Over all, the ‘looking at nothing’ effect is a consistent and reliable finding. However, it depends on how fine-grained the spatial analysis of fixations is. In a recent paper (Gurtner, Bischof, & Mast, 2019), we showed that the ‘looking at nothing’ effect is present when comparing the number of fixations in quadrants, whereas the underlying scan paths in imagery and perception show few similarities, as analyzed with MultiMatch (Dewhurst et al., 2012) and ScanMatch (Cristino, Mathôt, Theeuwes, & Gilchrist, 2010).1 Thus, the current understanding of the spatial characteristics of eye movements in mental imagery is still incomplete. More importantly though, little attention has been devoted to properties of eye movements in mental imagery other than their spatial similarity with eye movements in perception.
Indeed, eye movements in imagery have other unique characteristics. Typically, fixations in mental imagery have a smaller spread (Brandt & Stark, 1997; Gurtner et al., 2019; Johansson, Holsanova, & Holmqvist, 2006) and last longer (Brandt & Stark, 1997; Recarte & Nunes, 2000). Another dimension of eye movements is their temporal dynamics, that is, how eye movements evolve over time. The temporal dynamics are of special interest for imagery research for at least three reasons. First, the temporal dynamics provide descriptive information about eye movements in imagery themselves, without directly relating them to eye movements made in perception Dewhurst et al. (2012). Second, in two recent studies, we showed that eye movements in imagery differ in terms of their temporal dynamics from those made during perception of the same information (Gurtner et al., 2019; Gurtner, Hartmann, & Mast, 2021). For example, in imagery, participants made more and more systematic refixations and the refixations happened sooner in time. The origin of these differences remains unclear. Third and most importantly, temporal dynamics of eye movements allow to test hypotheses about the maintenance of mental images, because, as we will show, different processes manifest themselves differently in temporal gaze dynamics. Here, we focus on the imagery maintenance processes responsible for refixations during imagery.
There are at least two different possible mechanisms that can lead to refixations in imagery. On the one hand, refixations can occur because the mental images tend to fade rapidly (De Beni, Pazzaglia, & Gardini, 2007; Farah, 1989; Kosslyn, 1994) and the place-bound content needs to be reactivated by refixations to the relevant areas (Ferreira, Apel, & Henderson, 2008; Foerster, 2018; Laeng & Teodorescu, 2002; Mast & Kosslyn, 2002; Scholz, Klichowicz, & Krems, 2018; Jordana S. Wynn et al., 2016). This is in line with findings suggesting that eye movements serve the maintenance of imagined content (Damiano & Walther, 2019; D. G. Pearson, Ball, & Smith, 2014).
On the other hand, refixations in mental imagery can also arise because participants restrict their gaze in mental imagery, as illustrated by the lower spread and the reduced number of fixations in mental imagery (Brandt & Stark, 1997; Johansson et al., 2006). In particular, generating and maintaining mental images requires cognitive resources and therefore, it is particularly cost-sensitive, which in turn influences how many eye movements will be planned and executed. Restricting eye movements could be a means to avoid interference from unavoidable perceptual input when eyes are open. Such input can influence (J. Pearson, Clifford, & Tong, 2008; Teng & Kravitz, 2019) and even interfere with mental imagery (Wais, Rubens, Boccanfuso, & Gazzaley, 2010). Protection against perceptual input by restricting the gaze to one particular area can thus also be a means to maintain imagined visual content. Such a restriction of the gaze would naturally also lead to increased refixations. In summary, refixations can arise from reactivation of place-bound content or from restriction of the gaze.
The goal of the present experiment is to distinguish between these two possible mechanisms of imagery maintenance by means of their temporal dynamics. This requires the analysis of the temporal gaze pattern leading to a more complete picture of the underlying mechanisms. Before we elaborate how eye movement characteristics can disambiguate between reactivation of place-bound content or gaze restriction, we provide more details on how we quantify temporal gaze dynamics.
Temporal gaze dynamics can be analyzed by recurrence quantification analysis (RQA). RQA is commonly applied to the analysis of dynamic nonlinear systems that change over time. For example, it has been used to describe recurring patterns in changing water level of the sea (Webber Jr & Zbilut, 2005) or to the breathing patterns of anesthetized patients (Webber Jr & Zbilut, 1994). It has also been successfully applied to the analysis of eye movements in visual perception (Anderson, Bischof, Laidlaw, Risko, & Kingstone, 2013; Bischof, Anderson, & Kingstone, 2019; Farnand, Vaidyanathan, & Pelz, 2016; Vaidyanathan, Pelz, Alm, Shi, & Haake, 2014): it computes the percentage of recurrent fixations, or refixations of previously inspected areas in a single scan path. Recurrent fixations are fixations that are closer to one another with respect to a predefined threshold, usually the area covered by the fovea (Anderson et al., 2013; Zhang, 2020). To our knowledge, we were the first to apply this method to eye movements during mental imagery (Gurtner et al., 2019, 2021). In this context, three RQA measures provide particularly important information: First, the percentage of recurrent fixations indicates how often an observer refixates previously inspected areas. Second, the CORM value (Center of Recurrence Mass) describes the general timing of refixations. When refixations are temporally close, that is, when the gaze returns to an area after few or no fixations to other areas, the CORM value is low. Conversely, when there are many intermediate fixations to other parts of a stimulus between refixations of an area, the CORM value is high (see also Anderson et al., 2013). Third, the determinism value indicates the extent to which recurrent fixations follow the same sequence of previous fixations. It reflects how systematic or repetitive a sequence of refixations is (Anderson et al., 2013). In previous research, we showed that the imagined content influences RQA measures (Gurtner et al., 2019, 2021). In the present work, we investigate the relevant dimensions further by varying the complexity of mental images. The complexity allows to determine whether refixations in imagery originate from reactivation of place-bound content or whether they originate from gaze restriction.
Specifically, if refixations during mental imagery arise because fading away of the mental images needs to be counteracted, recurrence should increase with image complexity: The more complex an image, the more demands it makes on the cognitive processes of the maintaining individual, and the less this individual can afford to make image-unrelated, intermittent fixations. Complex images also contain more distinct parts or segments, which would have to be visited and reactivated before returning to one of the previous parts, leading to a larger spread of fixations (at least if the different segments are spread apart too), and later refixations whereby CORM values increase.
Conversely, if refixations arise from restricting the gaze, participants would increasingly avoid making eye movements as stimuli become more difficult to maintain, reducing the spread of fixations, increasing the rate of refixations, and a diminished spatial relation to the original stimulus. Restricting one’s gaze would also increase determinism values when imagining complex mental images, because the systematicity or repetitiveness of eye movement patterns would increase. Likewise, it would lead to lower CORM values, as refixations tend to occur in closer temporal proximity when the area is restricted.
In sum, how image complexity affects RQA measures can distinguish between the two distinct hypotheses about the origin of refixations in mental imagery and thus can shed light on processes responsible for image maintenance. Refixations can arise in mental imagery because place-bound, fading pictorial content is reactivated or because gaze is restricted to avoid interfering perceptual input. RQA is the tool to distinguish between these two different accounts. If refixations originate from reactivation, we expect increasing recurrence, spread and CORM with increasing stimulus complexity. If refixations originate from gaze restriction, we expect increasing recurrence and determinism, but CORM and spread to decrease with increasing complexity.
We investigated the origins of refixations and their dependency on image complexity in two experiments. In Experiment A, participants saw a geometric checkerboard pattern of varying complexity (Dror & Kosslyn, 1994, see Figure 1) for 700 ms and maintained it for 20 seconds, during which the screen remained blank. After the maintenance period, a cross was displayed, and participants had to indicate whether the cross would have fallen on a white or black part of the pattern. This task is, in parts, similar to tasks used in visuo-spatial working memory experiments (Schurgin, 2018; Sperling, 1963), where the analysis of eye movements has been key to understand visual continuity in active vision (Hollingworth, Richard, & Luck, 2008; Schurgin, 2018; Stigchel & Hollingworth, 2018). However, the task we used in this experiment differs from classical visual short-term memory tasks in that our participants maintained the stimulus for 20 seconds. Never the less, the proximity to visual working memory tasks and the fact that there is an argument that visual working memory and imagery might be different terms for the same phenomenon (Tong, 2013), inspired us to use dynamic visual noise (DVN) between the pattern and the blank screen in half of the trials (Dean, Dewhurst, Morris, & Whittaker, 2005; Dean, Dewhurst, & Whittaker, 2008; Dent, 2010; McConnell & Quinn, 2004; Orme, 2009), and to include a short-term visuo-spatial memory task (Visual Pattern Test). DVN was expected to interfere with the mental representation of the stimulus, which in turn should influence RQA parameters and thus impair mental image inspection at the end of the task. The main focus was to investigate whether the number of segments in the checkerboard pattern influenced the temporal gaze dynamics while participants maintained the pattern in imagery.
Methods
Participants
We recruited a total of 42 undergraduate students from the University Bern for the two experiments. They received course credit points as compensation for their participation. All participants had normal or corrected to normal vision. In the informed consent, we declared that the study was investigating whether pupil dilation can indicate general cognitive effort, i.e. whether individuals with high cognitive abilities show a smaller increase in pupil dilation when solving a memory task. The study was approved by the Ethics committee of the Faculty of Human Sciences of the University of Bern.
Material
In both experiments, eye movements were registered using the EyeLink 1000 Software, which was provided by SR Research GmbH (Canada). The samples were recorded at 500 Hz. Using an infrared light sensitive camera, the eye tracking device inferred the gaze position from the corneal reflex and the pupil location. At the beginning of the experiment, we presented a 9 point grid for calibration. The participant rested their chin on a head support and faced the 26-inch display (51 × 28 cm, resolution: 1920 × 1080 pixel). The distance between the eyes and the display was approximately 76 cm. For the stimulus, we used a 4 × 5 grid where each square measured 109 × 108 pixels. Together with the square brackets, measuring each 12 pixels in width, the stimulus was presented centrally and measured 462 × 566 pixels (approximately 14.1 × 15.1 cm, 10 × 12.20 degrees of visual angle).
Procedure
Both experiments comprised three parts: a maintenance task during which we measured participants’ eye movements, a paper-pencil test of visual short-term memory, and questionnaires assessing imagery vividness and demographic data. The participants spent two hours in the laboratory and were free to take breaks upon completion of each part.
The maintenance task began with the 9-point calibration, followed by 120 trials (see Figure 2). Each trial began with a drift correction, a fixation cross (for 500 ms) and a blank white screen (for 500 ms), after which participants saw a pattern of black blocks within a 4 × 5 grid (see Figure 1) for 700 ms. The patterns were constructed along the procedure of Dror and Kosslyn (1994). They were composed of straight segments containing one to four filled grid cells and were arranged in the 4 × 5 grid. The segments were allowed to touch and the stimulus also included the four corner brackets. The stimulus was followed by a blank screen that lasted for 20 seconds. Participants had to imagine the pattern they just saw while keeping their eyes on the screen. At the end of each trial, the four corner brackets reappeared and participants had to judge whether a cross on the screen would fall on a black or white background if the stimulus were still present. They indicated their answer by pressing one of two buttons on a keyboard in front of them. Finally, participants rated on a 7-point Likert-scale, how well they were able to maintain the stimulus. Experiment B was identical to Experiment A, except that we introduced two seconds of dynamic visual noise (DVN) between the stimulus and the blank screen in half of the trials. The other half of the trials was identical to all the trials of Experiment A. In previous research, participants imagined the stimuli for 15 seconds (Gurtner et al., 2019, 2021). Here, we chose a 20 second maintenance phase to ensure that a sufficient number of eye movements were made to compute RQA parameters. Moreover, a pretest with a 15 second maintenance phase has shown high accuracy in this task. A longer duration will likely increase difficulty and counteract a possible ceiling effect.
For the test of visual short-term memory, we used the Visual Pattern Test (VPT) (Della Sala, Gray, Baddeley, & Wilson, 1997), where participants looked at paper cards displaying a grid with an increasing number of squares, some of which were black and others white. The grids progressed in size from the smallest, a 2 × 2 matrix (with two black squares), to the largest, a 5 × 6 matrix (with 15 black squares). Each card was presented to the participant for three seconds. Immediately after viewing the card, the participants were required to mark the black squares on a response sheet. There was no time limit for responding, and participants could correct their responses if they wished. There were three patterns at each difficulty level and the test was terminated when the participant made a mistake in one of the three patterns at a given level of complexity. The number of correctly marked patterns was used to measure participants’ visuo-spatial short-term memory. At the end of the experiment, we asked participants what they thought the aim of the experiment was. No participant guessed that we were interested in eye movements during mental imagery.
Dynamic visual noise
For Experiment B, we created the DVN in accordance with the procedures used in the literature. To optimize interference with the stimulus, we chose a short stimulus presentation time (Dent, 2010; Kemps & Andrade, 2012; Zimmer & Speiser, 2002), no inter-stimulus-interval (Borst, Kosslyn, & Denis, 2006), and a high rate of change of the DVN (Dean et al., 2005; McConnell & Quinn, 2004). The size of the noise was matched to the stimulus size (464 × 568 pixels). It was comprised of 116 by 142 cells of 4 × 4 black and white pixels. The monitor we used to present the stimuli had a frame rate of 144 Hz. We changed the noise-picture every fourth frame, leading to 36 changes per second. We decided on a rate of change of 50% since this rate has been found to exert a strong interfering effect on memory (Dean et al., 2005). Thus, in the 36 changing noise pictures per second, we wanted 50% of the cells (that is 8236 cells) to change color. Therefore, we constructed the noise pictures such that in each picture, 228 random cells changed color with respect to the previous one. The code used to generate the DVN can be found at: https://osf.io/h4tjc.
Data analysis
Bayesian generalized regression models
We predicted the temporal gaze dynamics and the spread of fixations by the number of segments of our stimuli. To this end, we used Bayesian generalized hierarchical regressions. In a Bayesian analysis, a prior expectation about an effect is combined with empirical data, which results in a posterior estimate of the effect. Crucially, the posterior estimate is a distribution. In addition to the mean or median of the posterior, the distribution allows to quantify the certainty of the estimation through the range of the distribution. If the 95% density interval of this distribution (the credibility interval, CI) includes zero, the effect is typically absent. Conversely, if zero is not included in the CI, the effect is typically present. We used generalized regressions because they allow a linear model, which is restricted to normal data, to fit data that is not normally distributed. This is achieved by using a link function. In our case, we used a zero-one-inflated-beta distribution to fit the RQA measures (which vary between 0 and 100), and we used an ex-Gaussian distribution to fit the spread of fixations (which cannot be smaller than zero). The hierarchical structure of our regressions allows to model participant- and stimulus-specific variance. Thus, our models account for the fact that some participants have high recurrence values in mental imagery and others do not, and the fact that the influence of the number of segments in a stimulus can vary between participants. To this end, we included random intercepts and slopes for participants. Our model also accounts for the fact that some stimuli might be easier to imagine than their number of segments would actually predict. Therefore, we included random intercepts for stimuli. See Supplementary Material Section 1, for the model specifications of all regression models.
Implementation details
The fixations were defined by the Data Viewer software of SR research. We used the default settings for defining fixations: Each measurement is classified as part of a fixation if the pupil is visible to the eye tracker, the velocity of the gaze position does not exceed 30°/s and if its acceleration does not exceed 8000°/s. After extracting the fixations, we analyzed our data with R (R Core Team, 2015) and R-Studio (RStudio_Team, 2016) and with the packages of tidyverse (Wickham, 2017) to prepare and plot our data. We deleted all fixations that were either outside the screen, shorter than 100 ms or longer than 5 seconds. With these constraints in place, we included 90.30% of fixations in Experiment A, and 84.60% of fixations in Experiment B. To calculate the spread of fixations, we computed the average fixation point for each trial of each participant. Next, we calculated the distances of all fixations in the trial from the average fixation point. The median of these distances is reported as the spread of fixations. To compute the recurrence values, we used MATLAB and the functions made available by Anderson et al. (2013). Finally, the brms package (Bürkner, 2018) was used for the Bayesian generalized hierarchical regressions. To sample from the posterior distribution, we used four chains with 5000 iterations each. Once the regressions were run, we assessed model fit by checking the posterior predictive distributions. Based on the posterior, new data distributions are simulated. If a model fits the data well, the simulated data will resemble the actual empirical data. This was the case in all our models. The manuscript was produced using the papaja package (Aust & Barth, 2020). The data, study materials and analysis code will be made available at the Open Science Framework. This study was not preregistered.
Results
Experiment A investigated the impact of image complexity on eye movements in mental imagery. Specifically, we tested whether the number of refixations (recurrence), their systematicity (determinism), and their timing (CORM) depended on the number of segments in a stimulus that was imagined by participants. In Experiment B, we added DVN to interfere with the mental representation of the stimulus and to make the task more difficult.
Data quality and manipulation control
Before looking at the effect of the number of segments on RQA measures, we ensured that participants performed sufficiently well in the maintenance task and that the different segments had the desired effect on accuracy. Overall, participants were correct in 87.80% (SD = 0.33) of the trials in Experiment A and in 91.20% (SD = 0.28) in Experiment B, see also Figure 3.
Next, we assessed whether complex stimuli were harder to imagine. The number of segments in a stimulus reduced participants’ accuracy in both experiments (binomial regressions, Experiment A: beta = –0.45, CI: –0.74 – –0.17, Experiment B: beta = –0.66, CI: –1.17 – –0.18). This means, the more complex the stimulus was, the more difficult it was to imagine correctly.
In Experiment B, the dynamic visual noise did not change the probability of a correct answer being given (beta coefficient of DVN = –0.31, CI: –1.28 – 0.67) and it did not change the impact of number of segments on accuracy (beta coefficient of the interaction DVN with number of segments = 0.13, CI: –0.18 – 0.45). Furthermore, model comparisons showed that including the DVN as predictor did not add to the predictive power of the model (elpd difference: –2.14, standard error of the difference: 1.51).
Taken together, these results show that the participants complied with the instructions. The number of segments had the desired effect of increasing the task difficulty of the maintenance task. This means that our procedure manipulated the maintenance process as intended and thus, differences in eye movements in the different complexity conditions can inform us about mental image maintenance processes.
Image complexity and temporal gaze dynamics
Given that stimulus complexity increased task difficulty, we analyzed the influence of stimulus complexity on RQA measures to gain insight into the underlying maintenance mechanisms. We only analyzed trials in which we were certain that there was sufficient engagement with the maintenance task and in which imagery was successful. Therefore, we first excluded trials in which participants provided wrong answers. Second, we excluded trials in which participants found it difficult to maintain the stimulus, i.e. when the difficulty of maintenance rating was 1 or 2 out of 7 (18.80% in Experiment A, 15% in Experiment B, comparable amount of deletions for the different stimulus categories, see Supplementary Material, section 2). The number of segments of the stimulus influenced participants’ RQA measures. In both experiments, an increase in segments lead to a decrease in recurrence and determinism (see Figure 4). Thus, the more segments a stimulus had, the fewer refixations were made (i.e. the more fixations were made to previously unfixated areas) and the less systematic the remaining refixations became.
There are two possible confounds for this effect, both of which can be ruled out. First, the effects are independent of the spread of fixations, which was statistically controlled for: We included the spread of fixations as a predictor into the regressions. A second potential confound is the spread of the stimuli themselves. The screen was empty when participants maintained the stimuli. Nevertheless, eye movements in imagery could reflect the spread of the stimuli rather than reflecting the stimulus’ complexity. This would be expected, if eye movements predominantly reflect the ‘looking at nothing’ effect. To address this confound, we investigated whether the distribution of black pixels in each stimulus (i.e. how far black pixels were spread out in the stimulus) altered the relationship between the number of segments and recurrence. We found that the pixels’ distribution did not predict recurrence by itself, and including it as a predictor in a regression did not alter the effect of segment numbers on recurrence or enhance model fit. Therefore, we conclude that the effect of segment number on recurrence is not confounded with the pixels’ distribution in the maintained stimulus. For more details, see the Supplementary Material, section 3.
The reported effects indicate that the effect of the number of segments on RQA parameters is independent of the distribution of fixations. The estimated posterior distributions for the CORM measure showed dependence on the number of segments; they were centered around zero in both experiments. This means that the number of segments did not influence the timing of refixations. Moreover, DVN did not influence gaze dynamics or the spread of fixations (see the Supplementary Material, section 4).
Participants adapted their temporal gaze patterns when the stimulus was more complex in that they made fewer and less systematic refixations (as measured by recurrence and determinism) and more fixations to previously uninspected areas. At the same time, the spread of fixations increased as the number of segments in a stimulus increased (see Figure 4). Thus, participants made larger eye movements when they maintained a complex mental image and they restricted their gaze in trials where they maintained a simple mental image. This is in line with previous research on the relationship between the spread of fixations and task difficulty (Johansson, Holsanova, & Holmqvist, 2011; Scholz, Mehlhorn, Bocklish, & Krems, 2011; Jordana S. Wynn et al., 2018).
Refixations can arise from both restricting the gaze or from making repeated fixations to distal, stimulus-related areas, i.e. the spread of fixations does not necessarily relate to the rate of refixations. In our results, refixation rates (recurrence) decreased as stimulus complexity increased, and at the same time, the spread of fixations increased. To understand the origin of this result, the relationship between the spread and recurrence, Figure 5, is paramount. The figure shows that trials with high recurrence rates (above 50%) have a low median spread of fixations (lower than the threshold distance to define recurrence, dashed line). Thus, refixations happen particularly often in trials with a low spread of fixations. Conversely, there was no trial in which participants moved their eyes considerably and, at the same time, revisited the same distant locations repeatedly (no trial falls into the upper-right corner of the graph). If refixations originated from re-activation of place-bound content, we would expect to find at least some trials (those with more widely-spread segments) in which participants show widely spread fixations and higher recurrence levels. Therefore, refixations mostly arise from restricting the gaze to a small area. It could be argued that the relationship between recurrence and the spread of fixations is caused by the specific spatial layout of our stimuli, as they did not cover the entire screen. However, a reanalysis of previous data (Gurtner et al., 2021), where stimuli covered the entire screen, showed that, in mental imagery, the same pattern is present and even consistent across different types of stimuli (art, faces and landscapes, see Supplementary Material, section 5). Taken together, refixations originate predominantly from gaze restriction. The process responsible for gaze restriction is attenuated as difficulty to maintain the mental image increases.
Interindividual differences
In addition to our general findings, the analysis shows large inter-individual variance in the degree of refixations. In keeping with our previous results (Gurtner et al., 2019), some participants in the current experiment made few refixations (minimum of average recurrence, Experiment A: 9.08%, Experiment B: 8.87%) while others directed the vast majority of fixations to already inspected areas (maximum average recurrence, Experiment A: 72.70%, Experiment B: 95.40%). This was also reflected in the high standard deviations of the random intercepts between participants (Experiment A: 1.27, CI: 0.93 – 1.75; Experiment B: 1, CI: 0.72 – 1.40, please refer to the Supplementary Material, section 6, for the distribution of random intercepts). Thus, participants differed with respect to the number of refixations they made. This is important because it suggests that differences in recurrence are not simply noise in the data but rather reflect systematic differences between participants, suggesting to different strategies employed by different individuals.
At the same time, the differences between individuals’ general recurrence levels did not influence the way participants adapted their temporal gaze patterns to changing stimuli, given that the correlation between the random intercepts of participants and the random slopes for number of segments was estimated to be around zero in both experiments (Experiment A: –0.29, CI: –0.65 – 0.14; Experiment B: 0.05, CI: –0.38 – 0.49). Thus, regardless of how many refixations participants make, in general fewer refixations are made when the imagined content is more complex. This is illustrated in Figure 6. This means that the processes leading to the reduction of refixations in more complex mental images are the same between participants despite the different styles of eye movements during mental imagery.
Turning to the self-report measures, participants also indicated how well they had been able to imagine the stimulus and completed the Visual Pattern Test (VPT) as a measure for short-term memory. Figure 7 shows the correlations between both measures and the temporal gaze dynamics, aggregated over the participants of both experiments. Imagery rating and VPT are uncorrelated. This can be due to the fact that the patterns participants keep in mind in the VPT surpass the complexity of the stimuli they imagined in the maintenance task. It is also possible that the imaginability rating was biased by the answer in the maintenance task that was given immediately before assessing the vividness of the imagined stimulus. Imaginability ratings (i.e. answers to the question “how difficult was it to imagine the stimulus?”) and VPT scores were unrelated to the temporal gaze dynamics, but the temporal gaze dynamics (recurrence, determinism and CORM) are inter-correlated, in accordance with previous literature (Anderson et al., 2013; Zhang, Anderson, & Miller, 2021). The correlation between recurrence and determinism is higher than reported by Zhang et al. (2021), who applied RQA to eye movements in mind wandering during a concurring visual encoding task. This suggests that in mental imagery, recurrent fixations are more systematic compared to when there is concurrent visual input Zhang et al. (2021). This interpretation is confirmed by a re-analysis of the data of Gurtner et al. (2021), where the correlation between recurrence and determinism is significantly higher in mental imagery compared to perception, see Supplementary Material, section 5.2.
Discussion
We investigated the mechanisms involved in mental image maintenance. Specifically, we proposed that two mechanisms, reactivation of place-bound content and gaze restriction, can be responsible for eye movements during image maintenance. To distinguish between them, we analyzed how temporal gaze dynamics change as the maintenance task becomes more difficult, i.e., participants imagine stimuli increasingly complex segments. Participants were able to complete the maintenance task (average correctness Experiment A: 87.80%, Experiment B: 91.20%) and they complied with the instruction, given that the number of segments influenced their accuracy. The general pattern of eye movement results is the following: when participants maintained a simple stimulus, they made numerous and systematic refixations (recurrence and determinism were both high). Furthermore, spread of fixations was low when the imagined stimulus was simple. As the stimuli became increasingly difficult to maintain, i.e. as the number of segments increased, the spread of fixations increased and participants made fewer and less systematic refixations. This shows that temporal patterns of eye movements change as a function of mental activity associated with imaging complex and non-complex patterns. Interestingly, these results support both reactivation of place-bound content and gaze-restriction.
The analysis of the spatial gaze characteristics shows that fixations made when imagining a more complex image were spread out to a greater extent. It is unlikely that this effect is solely due to participants looking at more spread-out segments of the pattern stimulus. First, there were no segments to look at, the screen was blank during the time of eye movement measurement and participants had to maintain the mental image. Second, in principle, there is no one-to-one relationship between complexity and the spread of stimulus elements. Two segments of a simple stimulus can be far apart and four segments in a complex stimulus can be closely arranged (for the relationship between number of segments and the spread of black pixels, see Fig.1 in the Supplementary Material, section 3). Third, we showed that the distribution of the pixels participants had seen did not alter the relationship between the complexity of the imagined stimulus and RQA measures. We argue that as it is more difficult to maintain more complex images, supportive eye movements become necessary. Thus, the spatial gaze characteristics are in line with the idea of ‘looking at nothing’, or reactivation of place-bound content, i.e. the idea that re-fixations can help to counteract fading of a mental image (De Beni et al., 2007; Farah, 1989; Johansson, Holsanova, Dewhurst, & Holmqvist, 2012; Johansson & Johansson, 2014, 2020).
The analysis of the temporal gaze dynamics qualifies and extends the results of spatial gaze properties. The spread of eye movements increases with stimulus complexity, which is consistent with reactivation, while the rate of refixations and their systematicity decrease. Therefore, refixations cannot result exclusively from reactivation. The origin of refixations becomes more clear when analyzing the relationship between the spread of fixations and recurrence (Figure 5). In principle, high recurrence in mental imagery could originate from widely spread fixations repeatedly targeting the same parts of the mental image. This means, stimuli with more segments could increase the likelihood of refixations, which we hypothesized. Alternatively, high recurrence can originate from closely arranged fixations. Our results show that all trials with high recurrence have a low spread of fixations (the median spread of fixations is smaller than the threshold distance between two refixations, see the dotted line in Figure 5). This means that in trials with high recurrence, most fixations are close to each other. High recurrence rates thus originate from keeping the gaze on a restricted area. This pattern cannot be due to the small size of the stimuli used in this study, since the same pattern was present in a re-analysis of previous data based on stimuli covering the entire screen (Gurtner et al., 2021), where stimuli covered the entire screen. Thus, when imagining a simple stimulus, participants actively focus their gaze on a small region, leading to systematic refixations. As the stimulus becomes more complex, additional fixations to stimulus-related areas are needed, and the spread of fixations increases. At the same time, the rate of refixations drops. Thus, we can deduce that these stimulus-related areas are visited only once (if they had been visited repeatedly, spread would increase but recurrence would not drop). Taken together, the analysis of the temporal gaze dynamics suggests that in addition to reactivation of place-bound content (i.e. the ‘looking at nothing’ effect), eye movements in mental imagery also reflect a preference for restricting the gaze on a small area. This conclusion is only possible by analyzing temporal gaze dynamics in addition to summary statistics like the spread of fixations and thus highlights the gains of imagery research when taking into account the temporal dimension of eye movements. The results of the spread of fixations alone would have suggested that ‘looking at nothing’ is the main driver behind eye movements in mental imagery and the importance of gaze restriction would have been overseen.
Gaze restriction makes perceptual information processing less costly, because it implies that over the course of several fixations, the visual input remains rather constant. Constant input, in turn, is more easily suppressed if one wishes to disengage from processing perceptual information and focus on imagined content. Indeed, Craver-Lemley and Reeves (1992) and Reeves and Craver-Lemley (2012) demonstrated that we suppress visual information during visual mental imagery at least when perceptual acuity is high (Reeves, Grayhem, & Craver-Lemley, 2020), and there is evidence that the feedback circles of perceptual inference change during mental imagery (Dijkstra, Ambrogioni, Vidaurre, & Gerven, 2020). This disengagement might be reflected in the restriction of the gaze to a small area, which could help the maintenance of mental images. Accordingly, there is growing evidence that eye movements in mental imagery can interfere with mental imagery (J. Pearson et al., 2008; Teng & Kravitz, 2019; Wais et al., 2010). Exactly how, when, why, and to what degree this disengagement from analyzing visual content during mental imagery takes place is an interesting and important avenue for future research.
We suggest that the complexity of the mental image is an important factor that determines the relative importance of gaze restriction vs. making place-bound eye movements to support mental imagery. Restricting gaze might enable participants to imagine the stimulus more holistically as long as the stimulus’ complexity allows for this strategy. As the stimulus becomes more complex, this strategy might not be feasible because too much information would fall outside of the central part of the mental image. In this case, making eye movements to update the content in the periphery might enable maintenance of the entire mental image. This interpretation is supported by the fact that, as complexity increases, participants make more spread out, and more unique eye movements (lower recurrence and lower determinism). It follows that fixations in mental imagery can reflect reactivation of place-bound content but that they do not do so in every case. Eye movements in mental imagery are thus shaped by another process beyond the ‘looking at nothing’ effect (Chiquet, Martarelli, & Mast, 2020). High rates of refixations can be driven by active disengagement from perceptual information processing. The effect of the complexity on recurrence was controlled for the spread of fixations (Gurtner et al., 2019, 2021) and our results highlight processes that have been neglected in the study of eye movements in mental imagery.
The overall effect of image complexity on temporal gaze dynamics was accompanied by large differences between participants in the number of refixations they made. This is in accordance with previous findings about the temporal dynamics of eye movements during mental imagery (Gurtner et al., 2021). Interestingly, Kinjo, Fooken, and Spering (2020) showed that a third of their participants were “non-saccaders”, as they made few saccades during retrieval. The authors interpreted this as an individual preference in the trade-off between the benefit and the cost of making eye movements. At the same time, both “saccaders” and “non-saccaders” showed comparable memory performance. Likewise, our participants were able to solve the maintenance task despite considerable interindividual variance of the number and systematicity of refixations. It is possible that the individual cost and benefit of eye movements in mental imagery (Kinjo et al., 2020) can explain the variance in temporal gaze dynamics between participants.
This variance can be due to individual representation styles. Palmiero et al. (2019) suggest that individual combinations of imagery ability and strategy (i.e. object and spatial imagery, Bochynska & Laeng, 2015) lead to different forms of mental representations. Indeed, the type of mental representation could be the cause of differences in eye movements between learning styles (Cao & Nishihara, 2012) and it might be associated with the differences in eye movements that accompany navigation styles, i.e. whether an individual memorizes a map by landmarks, the route they have to remember or in a survey-style (Piccardi et al., 2016). The different types of mental representations could further produce the discrepancies in temporal gaze dynamics between experts and novices (Gandomkar, Tay, Brennan, & Mello-Thoms, 2017; Vaidyanathan et al., 2014).
Likewise, the representation participants base their mental images upon could influence temporal gaze dynamics in mental imagery. Participants with a non-spatial representation-style, i.e. object-imagers (Bochynska & Laeng, 2015), show diminished ‘looking at nothing’ (Chiquet, Martarelli, & Mast, 2022) and could be those who do not move their eyes much during imagery (non-saccaders, Kinjo et al., 2020). In their case, planning, executing and verifying eye movements could add noise, while their representation would not gain much from making stimulus-related eye movements. This interpretation is in line with our previous finding that working memory modulates the ‘looking at nothing’ effect (Gurtner et al., 2021). Larger working memory capacity leads to a spatially more accurate mental image, which, in turn, increases the spatial correspondence with the perceptual scenario. There might thus be different forms of mental representations associated with different eye movement dynamics in mental imagery. At the same time, the mechanism by which image complexity influences eye movement dynamics appears to be the same for everyone since individual recurrence levels did not influence the effect of complexity on eye movement dynamics.
Despite our efforts to maximize the interfering potential and to our surprise, the DVN in Experiment B did not influence the temporal gaze dynamics, the spread of fixations, or the accuracy in the maintenance task. The mental images were robust and did not fade more rapidly during the exposure to DVN. One possible reason for this is the type of stimuli we used. It is possible that participants maintained the geometric stimuli in the visual cache, which is more protected from incoming sensory data (J. Pearson & Keogh, 2019), instead of holding them in the visual buffer (Borst et al., 2006; Cocchini, Logie, Della Sala, MacPherson, & Baddeley, 2002) where the DVN might have a stronger effect. Furthermore, the influence of DVN on mental imagery can be particularly strong when visual details of the mental image were tested (Dean et al., 2005, 2008; Dent, 2010; McConnell & Quinn, 2004; Orme, 2009). Our stimuli might not have contained enough visual details for the effect of DVN to unfold. Finally, our task is similar to classical short-term memory tasks, and we thus used non-structured DVN. However, structured DVN might disrupt the mental images more efficiently. Structured noise has been used by Borst, Ganis, Thompson, and Kosslyn (2012), but their task differed substantially in timing (1.5 sec of image generation and maintenance vs. 20 sec of maintenance in our study) and the specific mental operation (image inspection vs. spatial judgment in our study). The task used by Dror and Kosslyn (1994) has several advantages for the aim of our study. First, it is based on the theoretical model of mental imagery by Kosslyn (1994) and it was designed specifically to test the image maintenance phase (separating it from image generation and inspection). Second, the task accounts for the fact that we group several black cells into a single segment (i.e. four black cells in a row result in a complexity of one, not four), which is unlike similar tasks used in the context of DVN. Vasques, Garcia, and Galera (2016) used the number of black cells but re-analysis of our data with their definition did not change the overall results substantially. Third, unlike more naturalistic pictures, the degree of picture complexity is controllable in a systematic fashion. Nevertheless, future research could consider other measures of stimulus complexity, such as entropy, especially with regard to natural pictures in which scene complexity cannot be varied easily. Beyond such methodological considerations, one could also speculate that the lack of an effect of DVN on gaze dynamics suggests that gaze properties are independent of short-term memory processes. Such speculation would also be consistent with the low correlation of the VPT with gaze dynamic measures, but warrants more research in the future.
Our study has two possible limitations. Participants could have restricted their gaze because they were imagining a comparatively small stimulus. However, we re-analyzed the data of a previous paper (Gurtner et al., 2021), where participants imagined scenes that covered the entire screen. In that study, the relationship between spread and recurrence is comparable to the non-linear relationship reported here. Therefore we conclude that temporal gaze dynamics are not a simple function of the size of the stimulus.
In addition, the definition of recurrent fixations depends on the choice of a threshold distance between fixations. The threshold is often defined so that it matches the size of the fovea, which is meaningful in the case of perception. It is not clear in how far this convention is suitable for mental images. We therefore analyzed our data with different threshold definitions (see Supplementary Material, section 7). As expected, the relationship between spread and recurrence changes but still, no participant made many refixations at the same time as moving their eyes over large distances. Thus, altered thresholds do not entail substantial changes in the results.
Conclusion
As the complexity of mental images increased, participants made fewer and less systematic refixations, while the spread of fixations increased. We show that refixations in mental imagery originate, at least partly, from a tendency to restrict the gaze, and this reflects disengagement from perception. In addition, simple mental images can be maintained without making eye movements, whereas complex mental images require stimulus-related fixations. The results suggest that eye movements in mental imagery are shaped by two mechanisms: the ‘looking at nothing’ effect to reactivate part-based pictorial content and the minimization of interference from unrelated perceptual input. The trade-off between the cost and the benefit of making eye movements could cause the pronounced interindividual differences in temporal gaze dynamics.
Data Accessibility Statement
The data will be made available at the Open Science Framework. To access the study materials and the analysis code, please contact the corresponding author (Lilla Gurtner). This study was not preregistered.
Additional File
The additional file for this article can be found as follows:
Acknowledgements
We would like to thank all participants and Teodora Moser for her help during data collection.
Footnotes
Both approaches compute scan path similarity. ScanMatch separates the screen into small quadrants and compares how similar the sequences of quadrants are, in which participants made fixations. MultiMatch converts the scan path into a vector representation, which allows to compare the scan path shape independently from its size, for example.
Ethics and Consent
The study was approved by the Ethics committee of the Faculty of Human Sciences of the University of Bern (number: 2017-04-00001). Participants gave informed consent prior to taking part in the study.
Competing Interests
The authors have no competing interests to declare.
Author contributions
The authors made the following contributions. Lilla M. Gurtner: Conceptualization, Data curation, Formal analysis, Methodology, Project administration, Visualization, Writing – Original Draft Preparation, Writing – Review & Editing; Walter F. Bischof: Conceptualization, Writing – Review & Editing; Fred W. Mast: Conceptualization, Supervision, Funding Acquisition, Writing – Review & Editing.
References
- 1.Anderson, N. C., Bischof, W. F., Laidlaw, K. E. W., Risko, E. F., & Kingstone, A. (2013). Recurrence quantification analysis of eyemovements. Behavior Research Methods, 45, 842–856. DOI: 10.3758/s13428-012-0299-5 [DOI] [PubMed] [Google Scholar]
- 2.Aust, F., & Barth, M. (2020). papaja: Prepare reproducible APA journal articles with R Markdown. Retrieved from https://github.com/crsh/papaja%0D
- 3.Bischof, W. F., Anderson, N. C., & Kingstone, A. (2019). Temporal methods for eye movement analysis. In Klein C. & Ettinger U. (Eds.), Eye movement research: An introduction to its scientific foundations and applications (pp. 407–448). Berlin: Springers. Retrieved from https://arxiv.org/abs/arXiv:1011.1669v3. DOI: 10.1007/978-3-030-20085-5_10 [DOI] [Google Scholar]
- 4.Bochynska, A., & Laeng, B. (2015). Tracking down the path of memory: eye scanpaths facilitate retrieval of visuospatial information. Cognitive Processing, 16(1), 159–163. DOI: 10.1007/s10339-015-0690-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Borst, G., Ganis, G., Thompson, W. L., & Kosslyn, S. M. (2012). Representations in mental imagery and working memory: Evidence from different types of visual masks. Memory & Cognition, 40(2), 204–217. DOI: 10.3758/s13421-011-0143-7 [DOI] [PubMed] [Google Scholar]
- 6.Borst, G., Kosslyn, S. M., & Denis, M. (2006). Different cognitive processes in two image-scanning paradigms. Memory & Cognition, 34(3), 475–490. DOI: 10.3758/BF03193572 [DOI] [PubMed] [Google Scholar]
- 7.Brandt, S. A., & Stark, L. W. (1997). Spontaneous eye movements during visual imagery reflect the content of the visual scene. Journal of Cognitive Neuroscience, 9(1), 27–38. DOI: 10.1162/jocn.1997.9.1.27 [DOI] [PubMed] [Google Scholar]
- 8.Bürkner, P.-C. (2018). Advanced Bayesian multilevel modeling with the R package brms. The R Journal, 10(1), 395–411. DOI: 10.32614/RJ-2018-017 [DOI] [Google Scholar]
- 9.Cao, J., & Nishihara, A. (2012). Understand learning style by eye tracking in slide video learning. Journal of Educational Multimedia and Hypermedia, 21(4), 335–358. Retrieved from https://www.learntechlib.org/p/39182 [Google Scholar]
- 10.Chiquet, S., Martarelli, C. S., & Mast, F. W. (2020). Eye movements to absent objects during mental imagery and visual memory in immersive virtual reality. Virtual Reality, (0123456789). DOI: 10.1007/s10055-020-00478-y [DOI] [Google Scholar]
- 11.Chiquet, S., Martarelli, C. S., & Mast, F. W. (2022). Imagery-related eye movements in 3D space depend on individual differences in visual object imagery. Scientific Reports, 12(1), 1–10. DOI: 10.1038/s41598-022-18080-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Cocchini, G., Logie, R. H., Della Sala, S., MacPherson, S. E., & Baddeley, A. D. (2002). Concurrent performance of two memory tasks: Evidence for domain-specific working memory systems. Memory and Cognition, 30(7), 1086–1095. DOI: 10.3758/BF03194326 [DOI] [PubMed] [Google Scholar]
- 13.Craver-Lemley, C., & Reeves, A. (1992). How visual imagery interferes with vision. Psychological Review, 99(4), 633–649. DOI: 10.1037/0033-295X.99.4.633 [DOI] [PubMed] [Google Scholar]
- 14.Cristino, F., Mathôt, S., Theeuwes, J., & Gilchrist, I. D. (2010). ScanMatch: A novel method for comparing fixation sequences. Behavior Research Methods, 42(3), 692–700. DOI: 10.3758/BRM.42.3.692 [DOI] [PubMed] [Google Scholar]
- 15.Damiano, C., & Walther, D. B. (2019). Distinct roles of eye movements during memory encoding and retrieval. Cognition, 184(December 2018), 119–129. DOI: 10.1016/j.cognition.2018.12.014 [DOI] [PubMed] [Google Scholar]
- 16.De Beni, R., Pazzaglia, F., & Gardini, S. (2007). The generation and maintenance of visual mental images: Evidence from image type and aging. Brain and Cognition, 63(3), 271–278. DOI: 10.1016/j.bandc.2006.09.004 [DOI] [PubMed] [Google Scholar]
- 17.Dean, G. M., Dewhurst, S. A., Morris, P. E., & Whittaker, A. (2005). Selective Interference With the Use of Visual Images in the Symbolic Distance Paradigm. Journal of Experimental Psychology: Learning, Memory, and Cognition, 31(5), 1043–1068. DOI: 10.1037/0278-7393.31.5.1043 [DOI] [PubMed] [Google Scholar]
- 18.Dean, G. M., Dewhurst, S. A., & Whittaker, A. (2008). Dynamic visual noise interferes with storage in visual working memory. Experimental Psychology, 55(4), 283–289. DOI: 10.1027/1618-3169.55.4.283 [DOI] [PubMed] [Google Scholar]
- 19.Della Sala, S., Gray, C., Baddeley, A. D., & Wilson, J. T. L. (1997). Visual pattern test: a test of short-term visual recall. Thames Valley Test Company. [Google Scholar]
- 20.Dent, K. (2010). Dynamic Visual Noise Affects Visual Short-Term Memory for Surface Color, but not Spatial Location. 57(1), 17–26. DOI: 10.1027/1618-3169/a000003 [DOI] [PubMed] [Google Scholar]
- 21.Dewhurst, R., Nyström, M., Jarodzka, H., Foulsham, T., Johansson, R., & Holmqvist, K. (2012). It depends on how you look at it: Scanpath comparison in multiple dimensions with MultiMatch, a vector-based approach. Behavior Research Methods, 44(4), 1079–1100. DOI: 10.3758/s13428-012-0212-2 [DOI] [PubMed] [Google Scholar]
- 22.Dijkstra, N., Ambrogioni, L., Vidaurre, D., & van Gerven, M. (2020). Neural dynamics of perceptual inference and its reversal during imagery. eLife, 9, 1–19. DOI: 10.7554/eLife.53588 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Dror, I. E., & Kosslyn, S. M. (1994). Mental imagery and aging. Psychology and Aging, 9(1), 90–102. DOI: 10.1037//0882-7974.9.1.90 [DOI] [PubMed] [Google Scholar]
- 24.Farah, M. J. (1989). The neural basis of mental imagery. Trends in Neurosciences, 12(10), 395–399. DOI: 10.1016/0166-2236(89)90079-9 [DOI] [PubMed] [Google Scholar]
- 25.Farnand, S. P., Vaidyanathan, P., & Pelz, J. B. (2016). Recurrence metrics for eye movements in perceptual experiments. Journal of Eye Movement Research, 9(4), 1–12. DOI: 10.16910/jemr.9.4.1 [DOI] [Google Scholar]
- 26.Ferreira, F., Apel, J., & Henderson, J. M. (2008). Taking a new look at looking at nothing. Trends in Cognitive Sciences, 12(11), 405–410. DOI: 10.1016/j.tics.2008.07.007 [DOI] [PubMed] [Google Scholar]
- 27.Foerster, R. M. (2018). “Looking-at-nothing” during sequential sensorimotor actions: Long-term memory-based eye scanning of remembered target locations. Vision Research, 144 (June 2017), 29–37. DOI: 10.1016/j.visres.2018.01.005 [DOI] [PubMed] [Google Scholar]
- 28.Gandomkar, Z., Tay, K., Brennan, P. C., & Mello-Thoms, C. (2017). A model based on temporal dynamics of fixations for distinguishing expert radiologists’ scanpaths. Medical Imaging 2017: Image Perception, Observer Performance, and Technology Assessment, 10136, 1013606. DOI: 10.1117/12.2254527 [DOI] [Google Scholar]
- 29.Gurtner, L. M., Bischof, W. F., & Mast, F. W. (2019). Recurrence quantification analysis of eye movements during mental imagery. Journal of Vision, 19(1), 17. DOI: 10.1167/19.1.17 [DOI] [PubMed] [Google Scholar]
- 30.Gurtner, L. M., Hartmann, M., & Mast, F. W. (2021). Eye movements during visual imagery and perception show spatial correspondence but have unique temporal signatures. Cognition, 210, 104597. DOI: 10.1016/j.cognition.2021.104597 [DOI] [PubMed] [Google Scholar]
- 31.Hollingworth, A., Richard, A. M., & Luck, S. J. (2008). Understanding the Function of Visual Short-Term Memory: Transsaccadic Memory, Object Correspondence, and Gaze Correction. Journal of Experimental Psychology. General, 137(1), 163–181. Understanding. DOI: 10.1037/0096-3445.137.1.163 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Johansson, R., Holsanova, J., Dewhurst, R., & Holmqvist, K. (2012). Eye movements during scene recollection have a functional role, but they are not reinstatements of those produced during encoding. Journal of Experimental Psychology. Human Perception and Performance, 38(5), 1289–1314. DOI: 10.1037/a0026585 [DOI] [PubMed] [Google Scholar]
- 33.Johansson, R., Holsanova, J., & Holmqvist, K. (2006). Pictures and spoken descriptions elicit similar eye movements during mental imagery, both in light and in complete darkness. Cognitive Science, 30(6), 1053–1079. DOI: 10.1207/s15516709cog0000_86 [DOI] [PubMed] [Google Scholar]
- 34.Johansson, R., Holsanova, J., & Holmqvist, K. (2011). The dispersion of eye movements during visual imagery is related to individual differences in spatial imagery ability. Proceedings of the 33rd Annual Meeting of the Cognitive Science Society, 1200–1205. DOI: 10.1037/a0026585 [DOI] [Google Scholar]
- 35.Johansson, R., & Johansson, M. (2014). Look here, eye movements play a functional role in memory retrieval. Psychological Science, 25(1), 236–242. DOI: 10.1177/0956797613498260 [DOI] [PubMed] [Google Scholar]
- 36.Johansson, R., & Johansson, M. (2020). Gaze position regulates memory accessibility during competitive memory retrieval. Cognition, 197, 104169. DOI: 10.1016/j.cognition.2019.104169 [DOI] [PubMed] [Google Scholar]
- 37.Kemps, E., & Andrade, J. (2012). Dynamic visual noise reduces confidence in short-Term memory for visual information. Cognitive Processing, 13(2), 183–188. DOI: 10.1007/s10339-011-0429-5 [DOI] [PubMed] [Google Scholar]
- 38.Kinjo, H., Fooken, J., & Spering, M. (2020). Do eye movements enhance visual memory retrieval? Vision Research, 176(July), 80–90. DOI: 10.1016/j.visres.2020.07.013 [DOI] [PubMed] [Google Scholar]
- 39.Kosslyn, S. M. (1994). Image and Brain: The Resolution of the Imagery Debate. Cambridge, MA: MIT Press. DOI: 10.7551/mitpress/3653.001.0001 [DOI] [Google Scholar]
- 40.Laeng, B., & Teodorescu, D.-S. (2002). Eye scanpaths during visual imagery reenact those of perception of the same visual scene. Cognitive Science, 26, 207–231. DOI: 10.1207/s15516709cog2602_3 [DOI] [Google Scholar]
- 41.Mast, F. W., & Kosslyn, S. M. (2002). Visual mental images can be ambiguous: Insights from individual differences in spatial transformation abilities. Cognition, 86(1), 57–70. DOI: 10.1016/S0010-0277(02)00137-3 [DOI] [PubMed] [Google Scholar]
- 42.McConnell, J., & Quinn, J. G. (2004). Complexity factors in visuo-spatial working memory Complexity factors in visuo-spatial working memory. Memory, 12(3), 338–350. DOI: 10.1080/09658210344000035 [DOI] [PubMed] [Google Scholar]
- 43.Orme, E. (2009). Identifying the Functional Architecture Underlying Multiple Representations in Visual Working Memory (PhD thesis). Doctoral Thesis, Northumbria University. [Google Scholar]
- 44.Palmiero, M., Piccardi, L., Giancola, M., Nori, R., D’Amico, S., & Olivetti Belardinelli, M. (2019). The format of mental imagery: from a critical review to an integrated embodied representation approach. Cognitive Processing, 20(3), 277–289. DOI: 10.1007/s10339-019-00908-z [DOI] [PubMed] [Google Scholar]
- 45.Pearson, D. G., Ball, K., & Smith, D. T. (2014). Oculomotor preparation as a rehearsal mechanism in spatial working memory. Cognition, 132(3), 416–428. DOI: 10.1016/j.cognition.2014.05.006 [DOI] [PubMed] [Google Scholar]
- 46.Pearson, J., Clifford, C. W., & Tong, F. (2008). The functional impact of mental imagery on conscious perception. Current Biology, 18(13), 982–986. DOI: 10.1016/j.cub.2008.05.048 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Pearson, J., & Keogh, R. (2019). Redefining visual working memory: A cognitive-strategy, brain-region approach. Current Directions in Psychological Science, 28(3), 266–273. DOI: 10.1177/0963721419835210 [DOI] [Google Scholar]
- 48.Pearson, J., & Kosslyn, S. M. (2015). The heterogeneity of mental representation: Ending the imagery debate. Proceedings of the National Academy of Sciences, 112, 10089–10092. DOI: 10.1073/pnas.1504933112 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Piccardi, L., De Luca, M., Nori, R., Palermo, L., Iachini, F., & Guariglia, C. (2016). Navigational style influences eye movement pattern during exploration and learning of an environmental map. Frontiers in Behavioral Neuroscience, 10(Jun), 1–15. DOI: 10.3389/fnbeh.2016.00140 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Pylyshyn, Z. W. (1981). The Imagery Debate: Analogue media versus tacit knowledge. Psychological Review, 88 (1), 16–45. DOI: 10.1037/0033-295X.88.1.16 [DOI] [Google Scholar]
- 51.Pylyshyn, Z. W. (2002). Mental imagery: in search of a theory. Behavioral and Brain Sciences, 25, 157–238. DOI: 10.1017/S0140525X02000043 [DOI] [PubMed] [Google Scholar]
- 52.R Core Team. (2015). R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing. Retrieved from http://www.r-project.org/ [Google Scholar]
- 53.Recarte, M. A., & Nunes, L. M. (2000). Effects of verbal and spatial-imagery tasks on eye fixations while driving. Journal of Experimental Psychology: Applied, 6(1), 31–43. DOI: 10.1037/1076-898X.6.1.31 [DOI] [PubMed] [Google Scholar]
- 54.Reeves, A., & Craver-Lemley, C. (2012). Unmasking the Perky effect. Frontiers in Psychology, 3. DOI: 10.3389/fpsyg.2012.00296 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Reeves, A., Grayhem, R., & Craver-Lemley, C. (2020). The Perky effect revisited: Imagery hinders perception at high levels, but aids it at low. Vision Research, 167(June 2019), 8–14. DOI: 10.1016/j.visres.2019.12.004 [DOI] [PubMed] [Google Scholar]
- 56.RStudio_Team. (2016). RStudio: Integrated Development Environment for R. Boston, MA: RStudio, Inc. Retrieved from http://www.rstudio.com/ [Google Scholar]
- 57.Scholz, A., Klichowicz, A., & Krems, J. F. (2018). Covert shifts of attention can account for the functional role of “eye movements to nothing”. Memory and Cognition, 46(2), 230–243. DOI: 10.3758/s13421-017-0760-x [DOI] [PubMed] [Google Scholar]
- 58.Scholz, A., Mehlhorn, K., Bocklish, F., & Krems, J. F. (2011). Looking at nothing diminishes with practice. Proceedings of the Annual Meeting of the Cognitive Science Society, 33(33), 66–67. [Google Scholar]
- 59.Schurgin, M. W. (2018). Visual memory, the long and the short of it: A review of visual working memory and long-term memory. Attention, Perception, and Psychophysics, 80(April), 1035–1056. DOI: 10.3758/s13414-018-1522-y [DOI] [PubMed] [Google Scholar]
- 60.Sperling, G. (1963). A Model for Visual Memory Tasks. Human Factors: The Journal of the Human Factors and Ergonomics Society, 1, 19–31. DOI: 10.1177/001872086300500103 [DOI] [PubMed] [Google Scholar]
- 61.Stigchel, S. V. D., & Hollingworth, A. (2018). Visuospatial Working Memory as a Fundamental Component of the Eye Movement System. Current Directions in Psychological Science, 27(2), 136–143. DOI: 10.1177/0963721417741710 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Teng, C., & Kravitz, D. J. (2019). Visual working memory directly alters perception. Nature Human Behaviour, 3(8), 827–836. DOI: 10.1038/s41562-019-0640-4 [DOI] [PubMed] [Google Scholar]
- 63.Tong, F. (2013). Imagery and visual working memory: one and the same? Trends in Cognitive Sciences, 17(10), 489–490. DOI: 10.1016/j.tics.2013.08.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Vaidyanathan, P., Pelz, J., Alm, C., Shi, P., & Haake, A. (2014). Recurrence quantification analysis reveals eye-movement behavior differences between experts and novices. Proceedings of the Symposium on Eye Tracking Research and Applications, 303–306. Safety Harbor, Florida: ACM. DOI: 10.1145/2578153.2578207 [DOI] [Google Scholar]
- 65.Vasques, R., Garcia, R. B., & Galera, C. (2016). Short-term memory recall of visual patterns under static and dynamic visual noise. Psychology & Neuroscience, 9(1), 46–53. DOI: 10.1037/pne0000039 [DOI] [Google Scholar]
- 66.Wais, P. E., Rubens, M. T., Boccanfuso, J., & Gazzaley, A. (2010). Neural Mechanisms Underlying the Impact of Visual Distraction on Retrieval of Long-Term Memory. Journal of Neuroscience, 30(25), 8541–8550. DOI: 10.1523/JNEUROSCI.1478-10.2010 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Webber, C. L., Jr., & Zbilut, J. P. (1994). Dynamical assessment of physiological systems and states using recurrence plot strategies. Journal of Applied Physiology, 76(2), 965–973. DOI: 10.1152/jappl.1994.76.2.965 [DOI] [PubMed] [Google Scholar]
- 68.Webber, C. L., Jr., & Zbilut, J. P. (2005). Recurrence quantification analysis of nonlinear dynamical systems. Tutorials in Comtemporarty Nonlinear Methods for the Behavioural Sciences (pp. 26–94). [Google Scholar]
- 69.Wickham, H. (2017). tidyverse: Easily Install and Load the ‘Tidyverse’. Retrieved from https://cran.r-project.org/package=tidyverse
- 70.Wynn, J. S., Bone, M. B., Dragan, M. C., Hoffman, K. L., Buchsbaum, B. R., & Ryan, J. D. (2016). Selective scanpath repetition during memory-guided visual search. Visual Cognition, 24(1), 15–37. DOI: 10.1080/13506285.2016.1175531 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Wynn, J. S., Olsen, R. K., Binns, M. A., Buchsbaum, B. R., & Ryan, J. D. (2018). Fixation reinstatement supports visuospatial memory in older adults. Journal of Experimental Psychology: Human Perception and Performance, 44(7), 1119–1127. DOI: 10.1037/xhp0000522 [DOI] [PubMed] [Google Scholar]
- 72.Zhang, H. (2020). Mind-Wandering: What Can We Learn from Eye Movements? DOI: 10.31234/osf.io/n9fbz [DOI] [Google Scholar]
- 73.Zhang, H., Anderson, N. C., & Miller, K. F. (2021). Refixation patterns of Mind-wandering during Real-world Scene Perception. Journal of Experimental Psychology. Human Perception and Performance, 47(1), 36–52. DOI: 10.1037/xhp0000877 [DOI] [PubMed] [Google Scholar]
- 74.Zimmer, H. D., & Speiser, H. R. (2002). The irrelevant picture effect in visuo-spatial working memory: Fact or fiction? Psychological Test and Assessment Modeling, 44(2), 223–247. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The data will be made available at the Open Science Framework. To access the study materials and the analysis code, please contact the corresponding author (Lilla Gurtner). This study was not preregistered.