Abstract
Searching for a visual object naturally involves sequences of gaze fixations, during which the current foveal image is analyzed and the next object to inspect is selected as a saccade target. Fixation durations during such sequences are short, suggesting that saccades may be concurrently processed. Therefore, the selection of the next saccade target may occur before the current saccade target is acquired. To test this hypothesis, we trained four female rhesus monkeys (Macaca mulatta) to perform a multiple-fixation visual conjunction search task. We simultaneously recorded the activity of sensorimotor neurons in the midbrain superior colliculus (SC) in two monkeys. In this task, monkeys made multiple fixations before foveating the target. Fixation durations were significantly shorter than the latency of the initial responses to the search display, with approximately one-quarter being shorter than the shortest response latencies. The time at which SC sensorimotor activity discriminated the target from distracters occurred significantly earlier for the selection of subsequent fixations than for the selection of the first fixation. Target selection during subsequent fixations occurred even before the visual afferent delay in more than half of the neuronal sample, suggesting that the process of selection can encompass at least two future saccade targets. This predictive selection was present even when differences in saccade latencies were taken into account. Altogether, these findings demonstrate how neural representations on the visual salience map are processed in parallel, thus facilitating visual search.
Keywords: concurrent processing, monkey, saccade target selection, visual search
Introduction
Active vision entails sequences of multiple fixations and saccadic eye movements (Henderson, 2003). During fixations, the foveal image is analyzed and the next saccade target is selected. This process is thought to occur within the visual salience map, wherein multiple stimulus representations compete for selection. Neurons within the frontal eye fields (FEF), the superior colliculus (SC), and the lateral intraparietal (LIP) area are thought to instantiate the visual salience map. Evidence from visual search studies show how these neurons initially have indiscriminate responses to stimuli in their receptive fields (RFs), but their activity evolves to indicate the target location before saccades (FEF: Thompson et al., 1996, 2005; SC: McPeek and Keller, 2002a; Shen and Paré, 2007; Shen et al., 2011; LIP: Ipata et al., 2006; Thomas and Paré, 2007). However, this account is limited to the selection of the first saccade target after display presentation. How these selection mechanisms extend beyond the first saccade remains mostly unknown.
The unconstrained nature of active vision makes the study of its underlying neural mechanisms using natural scenes difficult: visual stimulation is highly variable across fixations and saccades with similar vectors occur infrequently (Phillips and Segraves, 2010). Difficult search tasks offer an alternative approach because they naturally promote multiple fixations (Motter and Belky, 1998; Shen and Paré, 2006; Nothdurft et al., 2009). Combined with neuronal recordings, search tasks can be used to investigate saccade target selection from time-varying visual representations during sequences of fixations (Bichot et al., 2005; Zhou and Desimone, 2011).
Interestingly, in multi-fixation search, fixation durations that occur after the initial saccade response can be shorter than the latency of the initial response (Viviani and Swensson, 1982; McPeek and Keller, 2001; Van Loon et al., 2002). Similarly, brief intersaccadic intervals have been reported in reading and simple saccade tasks (for review, see Bahill et al., 1975; Rayner, 1998). Short intersaccadic intervals are characteristic of preplanned sequences (Becker and Jürgens, 1979; Zingale and Kowler, 1987) and thought to reflect concurrent saccade processing. Indeed, information about a search target may be acquired at least one fixation ahead. The second saccade of a sequence is influenced by visual information available before the first saccade (Caspi et al., 2004). Observers can identify a search target at least one fixation in advance (Kotowicz et al., 2010), and planning can occur for sequences of up to three saccades (De Vries et al., 2013). Moreover, visual discrimination is enhanced at the locations of future saccade targets before the first saccade (Godijn and Theeuwes, 2003; Baldauf and Deubel, 2008; Gersch et al., 2009), suggesting that visual attention can be allocated in parallel more than one fixation ahead.
Because saccade target selection is thought to rest on the allocation of visual attention, the process of selecting the next saccade target could begin before the current saccade target is acquired. We tested this hypothesis by recording the activity of SC visuo-movement neurons while monkeys searched for a target defined by a unique conjunction of features.
Materials and Methods
Data were collected from four female rhesus monkeys (Macaca mulatta, 5–6.5 kg, 6–10 years old) cared for under experimental protocols approved by the Queen's University Animal Care Committee and in accordance with the Canadian Council on Animal Care guidelines. During the postsurgical recovery period, animals received both antibiotics and analgesic medications. Monkeys were then trained with operant conditioning and positive reinforcement to perform fixation and saccade tasks for a liquid reward until satiation. The surgical procedure, stimulus presentation, data acquisition, and data analyses have been described previously (Shen and Paré, 2006, 2007).
Behavioral paradigms
Grid conjunction search task.
The main data of this report were collected while monkeys performed an unconstrained grid conjunction search task. On each trial, monkeys fixated a fixation stimulus that served as a cue for the target. This fixation stimulus could randomly appear in any of the 16 possible positions for the subsequent grid display (Fig. 1). Once fixated for 500–800 ms, the fixation stimulus disappeared, followed by a gap period of 200 ms during which there was no visual stimulation, but the animal was required to maintain fixation within a defined window. The search display then appeared, consisting of 16 stimuli that were conjunctions of a color [yellow: CIE (for Commission internationale de l'éclairage) x = 0.43, CIE y = 0.48; or green: CIE x = 0.29, CIE y = 0.60] and a shape (circle or square). The single target stimulus was a unique combination of these features, with distracter stimuli that could share either a feature with the target (same color or same shape) or have no feature in common with the target (opposite). The distracter types appeared in equal proportions (i.e., five each). The stimuli were matched for luminance (20 cd/m2) using a Minolta CS-100 Plus photometer and presented on a black background (<0.01 cd/m2). Monkeys were given 500 ms after the presentation of the search display to make their first gaze fixation and were given an additional 5000 ms for subsequent fixations if this initial fixation was not on the target. Monkeys were required to fixate the target for 500 ms to receive a liquid reward. The search target remained the same within each experimental session but changed across sessions. For sessions in which only behavioral data were collected, the stimuli were each 1° in diameter and were spaced 10° apart horizontally and vertically. For sessions in which neuronal activity was also recorded, the stimuli were arranged such that their spacing was determined by the vector of the RF of the neuron. In other words, the search display was positioned so as to maximize the probability that a stimulus fell in the RF of the neuron on each fixation. The stimulus dimensions were then scaled with the eccentricity of the RF and in proportion to striate cortex RF magnification (Van Essen et al., 1984). Behavioral data were collected from all four animals (monkey F, 8; monkey M, 24; monkey G, 4; monkey H, 4). In two of the animals (monkeys M and F), we also simultaneously recorded the extracellular activity of single SC neurons (n = 32) using previously described methods (Paré and Wurtz, 2001). Spike occurrences were sampled at 1 kHz.
Grid detection task.
Each grid conjunction search session was preceded by a grid detection task that served to familiarize the monkey with the target for that session. Trials in this task proceeded as those in the search task, but on display presentation only the target stimulus appeared (with no distracters) and monkeys were rewarded for fixating the single target stimulus within 500 ms of presentation. In neuronal recording sessions, this task was also used to characterize the extent of the RF of the neuron and to ensure proper display alignment for the grid conjunction search task.
Concentric conjunction search task.
This report also includes neuronal data recorded from monkeys G (n = 9) and H (n = 11) using an unconstrained conjunction search task that was the focus of a previous study (Shen and Paré, 2007). In this task, each trial began with a central fixation stimulus that was identical to the target stimulus. After the fixation interval (500–800 ms), a concentric array of one target and seven distracter stimuli (see Fig. 7A) appeared simultaneously with the disappearance of the fixation stimulus. Monkeys were allowed 500 ms to make their first fixation on the array. If the target was successfully foveated on this first fixation, the animals received a maximal liquid reward amount along with a reinforcement tone. If the first fixation was on a distracter stimulus, animals were allowed an additional 2000 ms to fixate the target, and a minimal amount (<0.33 of the maximal amount) of liquid reward was delivered along with the reinforcement tone. Stimuli were conjunctions of a color (red or green) and a shape (circle or square). One stimulus was placed in the RF of the neuron, whereas all other stimuli were positioned equidistantly from it and each other.
Concentric detection task.
A detection task preceded each concentric conjunction search session to familiarize monkeys with the search target of the session, as well as to delineate the extent of the RF of the neuron. The task proceeded as in the concentric conjunction search task, but only the target appeared with the disappearance of the fixation cue.
Delayed saccade task.
In all recording sessions, the responses of neurons were first characterized while monkeys performed a delayed saccade task to temporally dissociate visual stimulation from saccade execution (Paré and Wurtz, 2001). Data collected in this task were used to calculate the visual onset latencies of our neuronal sample and to describe the visuo-movement properties of each neuron (Table 1). This task served to additionally characterize the extent of the RF of each neuron.
Table 1.
Search | n | Visual onset latency (ms) | vis (spikes/s) | Delay activity (spikes/s) | mov (spikes/s) | VMI |
---|---|---|---|---|---|---|
Grid | 32 (monkey M, 24; monkey F, 8) | 41 ± 1 (32–57) | 50 ± 7 (1–171) | 23 ± 5 (0–112) | 288 ± 28 (43–651) | −0.64 ± 0.05 (−0.99 to 0.01) |
Concentric | 20 (monkey G, 9; monkey H, 11) | 42 ± 2 (30–63) | 67 ± 11 (14- 231) | 38 ± 7 (1–123) | 359 ± 41 (42–625) | −0.64 ± 0.06 (−0.90 to −0.03) |
Visual onset latency, delay period activity, vis, mov, and VMI are as described in Materials and Methods [mean ± SEM (range)].
Data analysis
Methods for neuronal data analysis have been described previously (Thompson et al., 1996; Shen and Paré, 2007; Thomas and Paré, 2007). Briefly, neuronal activity was quantified as continuously varying spike density functions aligned on the onset of the visual display presentation (stimulus-aligned), fixations after the first saccade (fixation-aligned), or saccades (saccade-aligned). Spike density functions were constructed by convolving spike trains with a combination of growth (1 ms time constant) and decay (20 ms time constant) exponential functions that resembled a postsynaptic potential (Thompson et al., 1996). To be included in any analysis, neurons had to contribute at least five trials in each condition of interest.
Visual onset latency (Table 1) was taken as the time at which the activation function reached a threshold (meanbaseline + 2 × SDbaseline) and remained above that threshold for 50 consecutive ms after stimulus onset in the delayed saccade task. Baseline activity was computed as the average activity during the last 100 ms before stimulus onset, and updated every millisecond thereafter if the activity remained below the currently determined threshold. For neurons having little to no baseline activity (<5 spikes/s), thresholds were arbitrarily set at 5 spikes/s.
The extent of the RF of each neuron was determined using detection tasks to ensure that only one stimulus in the search display yielded a maximum visual response. Visual responses in these tasks were computed as the mean response during a 25 ms epoch after visual response onset. In the grid detection task, all stimulus positions outside of what was considered the RF position yielded visual responses <50% of the maximal RF response. Figure 3C shows the visual tuning properties of an example neuron during this task. Figure 7C shows the visual tuning properties of an example neuron during the concentric detection task.
We used signal detection theory to quantify the process of saccade target selection in our search task (Thompson et al., 1996; Shen and Paré, 2007; Thomas and Paré, 2007). This method estimates how well an ideal observer of SC activity can discriminate between the activity related to a target falling in the RF of the neuron and that related to a distracter in the RF of the neuron. Receiver operating characteristic (ROC) curves were calculated during successive 5 ms intervals, and the area under each of those ROC curves was then plotted as a function of time and fit with a Weibull function to describe the time course of saccade target selection. The point at which the function reached a criterion value of 0.75 was taken as the discrimination time (DT) of the neuron, and the upper asymptote of the function was taken as the discrimination magnitude (DM) of the neuron. Best-fit functions were calculated only with activity occurring before the initiation of saccades, and calculations were terminated once there were fewer than five target or distracter trials. We also conducted successive rank-sum tests on the same 5 ms intervals used for the ROC analysis to determine whether target- and distracter-related activity were significantly different from each other (Thomas and Paré, 2007). For both the rank-sum tests and the ROC analysis, the ranges of saccade latencies in target and distracter trials were matched. The minimum saccade latency in response to display onset was fixed to 90 ms. Fixations were only included if the preceding saccades were made away from the RF (five opposing directions) of the neuron.
To calculate DT/response time (RT) slopes, we divided the trials for each neuron into short and long RT groups and determined the DT for each group. We did this for both single- and multiple-fixation trials. We then determined the slope of the relationship between DT and RT. A slope of 1 would indicate that DT is predictive of RT and that the trial-to-trial variability in RT may be related to differences in visual processing rather than movement programming. Alternatively, a slope of 0 indicates that DT is fixed despite variability in RT and that trial-to-trial variability in RT may be more related to differences in movement programming (Thompson et al., 1996; Shen et al., 2011).
All 52 neurons were recorded within the SC intermediate layers and identified as visuo-movement neurons using data collected in the delayed saccade task. Their activity before saccades (last 100 ms before saccade onset; Table 1) was significantly greater (rank-sum test, p < 0.001) than the corresponding delay activity (last 300 ms epoch before fixation stimulus disappearance), and they had reliable visually evoked responses within 100 ms of display onset. We calculated a visuo-movement index (VMI; Shen et al., 2011) with data collected in the delayed saccade task (Table 1). This index quantifies the relative magnitude of visually evoked and saccade-related activity of each neuron: VMI = (vis − mov)/(vis + mov), where vis is the mean discharge rate over the first 100 ms after stimulus presentation, and mov is the peak discharge rate within ±40 ms of saccade onset. Neurons with stronger visually evoked activity have VMIs closer to 1.0, and those with stronger saccade related activity have VMIs closer to −1.0.
Results
We collected data from four monkeys over 40 sessions of the grid conjunction search task for a total of 27,747 trials. Monkeys acquired the search target successfully with a single fixation in only 29.7% of trials. Because the fixation stimulus could randomly appear in any of the 16 locations within the grid, the distance between the target and fixation stimuli could vary. As a result, the probability of acquiring the target after a single fixation was significantly greater when the target was located adjacent to the fixation stimulus than when it was further eccentric (0.31 ± 0.01 vs 0.26 ± 0.01; p < 0.01, t test). These results suggest that stimuli were discriminable and that discriminability decreased somewhat with eccentricity. On average, the monkeys needed 2.7 ± 0.26 (range, 1–14) fixations per trial to acquire the target. This was significantly less than that expected by random scanning (2.7 vs 7.5; p < 0.0001, t test), which suggests that search was guided by the visual array composition, i.e., monkeys did not adopt an idiosyncratic, systematic scanning strategy.
The time it took to respond to the initial presentation of the visual search display (RT) was not different for trials in which monkeys made a single fixation compared with trials in which they made a sequence of multiple fixations (188 ± 3 vs 184 ± 2 ms; p = 0.23, t test). RT in multiple-fixation trials was significantly longer than the durations of the subsequent fixations in all four animals (Fig. 2A; p < 0.001, t test). Figure 2B shows how, in a single session for one animal (monkey F), this effect was attributable to a proportion (25%) of fixation durations that were shorter than the shortest RT. This was also the case for the other monkeys: on average, 26 ± 2% (monkey F, 36%; monkey M, 19%; monkey G, 32%; monkey H, 45%) of subsequent fixations had durations shorter than the shortest initial RT, which was on average 116 ± 2 ms (Fig. 2C). These behavioral data suggest that the process of saccade target selection can occur in parallel for at least two saccade targets. Moreover, fixation durations decreased significantly as a function of fixation number for trials having one to four fixations before target foveation (Fig. 2D; all p < 0.05, one-way ANOVA). A post hoc Bonferroni's pairwise analysis revealed that the final fixation just before target foveation was significantly shorter (p < 0.05) than all previous fixation durations in all four trial types. These data additionally suggest that information about target location accumulated over the course of each trial.
To determine whether this parallel selection occurs at the level of the visual salience map, we simultaneously recorded the activity of 32 single SC visuo-movement neurons in two monkeys while they performed the grid conjunction search task. We quantified the process of saccade target selection using signal detection theory (see Materials and Methods) and determined the discrimination probability over time with data aligned to stimulus (search display) onset. All neurons discriminated the target from distracters before saccades were made. On average, DT was 139 ± 3 ms (range, 106–174 ms) after the onset of the search display, and DM was 0.94 ± 0.01 (range, 0.79–1) for trials in which the target was acquired with a single fixation. Using successive rank-sum tests to determine DT produced similar results: all neurons had significantly greater target than distracter activity (p < 0.05) before saccades, with DT occurring on average 130 ± 4 ms (range, 97–187 ms) after stimulus onset.
To examine the process of saccade target selection during subsequent fixations, we compared the above single-fixation trials to ones in which the monkey made multiple fixations before acquiring the target (Fig. 3A,B, insets). Figure 3 illustrates this comparison for one example neuron. In single-fixation trials, this neuron discriminated the target from distracters 126 ms after the onset of the search display (Fig. 3A). In multiple-fixation trials, DT was 98 ms after the onset of subsequent fixations (Fig. 3B). DMs in both single- and multiple-fixation trials were similar and near perfect (∼0.9). Figure 4 depicts the normalized activity profile during single- and multiple-fixation trials for all neurons. The increase in normalized firing rate for target-related activity occurred much earlier in multiple- than single-fixation trials (−40 vs 90 ms). Interestingly, the increase in activity around fixation onset in multiple-fixation trials occurs only for the stimulus that will become the next saccade target. This was also the case for the individual activity profiles of all neurons.
We quantified the saccade target selection process for the population of neurons we recorded by calculating the mean ± SE discrimination probability over time for both single- and multiple-fixation trials. The discrimination probability was already significantly greater at fixation onset for multiple-fixation trials and remained that way until 145 ms after stimulus/fixation onset (Fig. 5A; p < 0.05, t tests), suggesting that the selection of a saccade target that was part of a sequence occurred earlier than the selection of a single-saccade target. On average, DT was significantly shorter in multiple-fixation trials (76 ± 6 ms) than in single-fixation trials (139 ± 3 ms; p < 0.001, paired t test). Discrimination probabilities were not different between single- and multiple-fixation trials 60–100 ms before stimulus appearance (Fig. 5A, open circles; p > 0.05, t test) and were also not different from chance (p > 0.05), indicating that the early selection of a subsequent saccade target was not attributable to any differences in neuronal activity occurring during the previous fixation.
With reference to the visual afferent delay, discrimination occurred only 34 ± 6 ms (range, −51 to 101) after the onset of visually evoked responses (Fig. 5B), whereas it occurred nearly 100 ms later in single-fixation trials (98 ± 3 ms; range, 66–133). These findings were additionally supported by a temporal analysis of target and distracter activity. In single-fixation trials, the earliest time that target activity was significantly greater than distracter activity was 87 ms after the display onset (Fig. 5A, bottom). However, in multiple-fixation trials, more than half (18 of 32, 56%) of the neurons had significantly greater (p < 0.05, rank-sum test) target activity before the onset of their visually evoked responses; greater activity was also observed before fixation onset in 16% (5 of 32) of neurons.
It is possible that the shift toward an earlier selection process in multiple-fixation trials was driven mainly by fixations of very short duration (i.e., 50–100 ms). The earlier onset of movement-related activity in these cases could have contributed to the observed shift to an earlier DT, regardless of the target selection process, when the data were aligned to fixation onset. To account for any differences in saccade timing between the single- and multiple-fixation trials, we performed the analysis again with the data aligned to the onset of saccades. We found that discrimination probabilities were still significantly greater in multiple-fixation trials up to 40 ms before saccades (Fig. 5C; p < 0.05, t tests). Even when the time of movement onset was accounted for, we found an earlier DT for multiple-fixation trials, suggesting that the predictive signals we observed were related to the selection process rather than purely to the programming of an eye movement.
To test whether our behavioral findings of information accrual (Fig. 2D) was supported by the underlying neural activity, we examined the average number of fixations made in each session and related it to DT calculated from multiple-fixation trials (i.e., calculated using the final fixation in the trial just before target foveation). We found that multiple-fixation DT was earliest for sessions that had more fixations per trial for both fixation-aligned (r = −0.57; p < 0.001, Pearson's correlation) and saccade-aligned (r = −0.35; p < 0.05) data (Fig. 5D). These data suggest that generally more information had accumulated over those sessions with longer trials, resulting in shorter DTs and fixation durations.
That the discrimination probabilities were not different just before saccades suggests that the ability of the neurons to discriminate the target stimulus from distracters did not change between the two conditions. McPeek and Keller (2002a) reported previously that DT in some SC visuo-movement neurons predicted saccade latencies in single-fixation visual search. Our observation that there was a shift to earlier DTs even when data were aligned on saccade onset might have occurred because of a change in this relationship between DT and RT in multiple-fixation trials. To test this hypothesis, we calculated DT/RT slopes for each neuron for both single- and multiple-fixation trials. There were sufficient data (at least five trials per condition) in 24 neurons for this analysis. On average, DT/RT slopes in single-saccade trials were 0.99 ± 0.13 and not significantly different from 1 (p = 0.93, t test), with DT leading RT by 65 ± 4 ms. DT/RT slopes for those same neurons in multiple-fixation trials were also not significantly different from 1 (1.0 ± 0.17; p = 0.98), but the lead time was significantly longer than in single-fixation trials (92 ± 5 ms; p < 0.0001, paired t test). Therefore, DT in multiple-fixation trials was equally predictive of RT, although it occurs earlier.
Concentric conjunction search
To determine whether the predictive selection of saccade targets can be observed in other visual search tasks as well as in other monkeys, we reanalyzed the activity of 20 SC visuo-movement neurons recorded in monkeys G and H during a concentric conjunction search task from our previous report (Shen and Paré, 2007). In this task, monkeys were allowed an extra 2000 ms to acquire the search target if their initial saccade landed on a distracter. On ∼30% of trials, they needed more than one fixation to acquire the target. In these trials, the duration of the first fixation was significantly shorter than the time to respond to display presentation (113 ± 2 vs 155 ± 1 ms; p < 0.0001, t test). We examined SC neuronal activity on trials in which monkeys made a second corrective saccade to the target that had been brought into the RF of the neuron after the first saccade (Fig. 6A; Murthy et al., 2007). We compared these “target” trials to trials in which a distracter fell in the RF after the first saccade and the monkey made a correct second saccade to the target located outside of the new RF position of the neuron. There were sufficient data in 16 neurons for this analysis (monkey G, 8; monkey H, 8). An example neuron, depicted in Figure 6B, shows how DT in multiple-fixation trials was at approximately its visual onset latency (43 ms after search display onset) and occurred much earlier for the selection of the second saccade target than for the first. Across this neuronal sample, discrimination in multiple-fixation trials occurred on average 9 ± 9 ms after the first fixation, whereas it occurred 109 ± 4 ms (p < 0.0001, t test; Fig. 6C) after the onset of the search display in single-fixation trials. Therefore, these data replicate our findings of predictive saccade target selection in SC in a different paradigm with different animals.
In our previous report, we also showed how the target representation is enhanced even when it is not selected for the first saccade, i.e., SC neuronal activity signals stimulus identity (Shen and Paré, 2007). Given the evidence presented above for the concurrent selection of saccade targets in SC in our concentric array task, could this enhancement in activity actually be related to the stimulus (i.e., target) identity rather than to the selection of the second saccade target? We examined activity during a 25 ms epoch just before saccades in three conditions: target-related activity in two-fixation trials when the subsequent fixation duration was short and when the fixation duration was long, as well as distracter-related activity in single-fixation trials. We had sufficient data in all 20 neurons to perform this analysis. For the example neuron shown in Figure 7A, we found an enhancement of target-related activity before a subsequent fixation was made to that target. This enhancement was greater for the shortest fixation durations than for longer ones, consistent with previous findings in a feature search task (McPeek and Keller, 2002b). However, the target-related activity in our conjunction search task in the longest fixation duration trials was still greater than distracter-related activity. To quantify these effects across the sample, we calculated a stimulus identity index by normalizing the target-related activity of each neuron in the two conditions (short and long fixation duration) to its distracter-related activity (Fig. 7B). On average, the shortest fixation duration trials had an index greater than that for long-duration trials (1.9 ± 0.1 vs 1.6 ± 0.1; p < 0.05, t test), suggesting that the enhancement in activity was related to the concurrent processing of two saccade targets. However, both target conditions were still significantly >1 (i.e., distracter-related activity; p < 0.0001, t tests), indicating that the signaling of stimulus identity is independent of the preceding fixation duration in the concentric conjunction search task.
Relationship to discharge properties
To determine whether the ability of an individual neuron to select a subsequent saccade target is predicted by its discharge properties, we tested for a relationship between both the DT and DM of a neuron in multiple-fixation trials and its discharge characteristics. Neither DT nor DM was correlated with the position of a neuron along the VMI (p = 0.50 and p = 0.32, respectively, Pearson's correlation). The same was true when DT and DM were tested against the magnitude of the visually evoked responses (p = 0.88 and p = 0.90) and saccade-related activity (p = 0.24 and p = 0.20). Moreover, DT and DM were not predicted by the discharge rate of a neuron during the delay period of the delayed saccade task (p = 0.77 and p = 0.52).
Discussion
We investigated saccade target selection while monkeys executed a multiple-fixation search for a target embedded in a structured search array. Consistent with previous behavioral studies, we found that fixation durations were significantly shorter than the latency of the initial responses to the search array. Accordingly, SC sensorimotor neurons displayed discriminating activity significantly earlier when the target was acquired after several fixations compared with when the target was fixated just after the search array onset. Discriminating activity occurred before the visual afferent delay in more than half of the neuronal sample. These predictive signals existed in addition to the representation of target identity in our conjunction search task. Our data demonstrate how neural representations on the visual salience map are not processed one fixation at a time and how advanced saccade target selection facilitates visual search.
Previous observations that fixation durations are shorter than initial RTs during visual search tasks were interpreted with the view that the processing of at least two saccades can occur in parallel (Viviani and Swensson, 1982; Hooge and Erkelens, 1996; Theeuwes et al., 1998; McPeek et al., 2000; McPeek and Keller, 2001; see also Sommer, 1997; Findlay et al., 2001; Godijn and Theeuwes, 2002). Moreover, visual information presented before the first saccade has been shown to be correlated with the outcome of the second saccade, suggesting that information accumulates for the second saccade before the onset of the first saccade (Caspi et al., 2004). Indeed, the processing of visual information by temporal cortical neurons has been shown to begin before the fixation of the target in a complex search task (Sheinberg and Logothetis, 2001). Our current findings complement this literature by showing how the neuronal selection of a subsequent saccade target can occur before it is brought into the RF of a neuron. This parallel selection of saccade targets is accompanied by a process in which visual information is accumulated across saccades, as evidenced by earlier DTs in sessions with more saccades and shorter fixation durations just before target foveation. These findings suggest that, with each fixation, the evidence for the target location grows as the visual salience map is updated.
A substantial body of literature has described how the selection of the first saccade goal in response to a search display occurs across a network of structures that form a distributed visual salience map. These previous visual search studies have established how neurons in area LIP (Ipata et al., 2006; Thomas and Paré, 2007), FEF (Thompson et al., 1996, 2005; Sato et al., 2001), and SC (Ottes et al., 1987; McPeek and Keller, 2002a; Shen and Paré, 2007; Shen et al., 2011) select the saccade goal in advance of saccades. Neurons are consistently reported to have indiscriminate visual responses to both the target and distracter that occur with a latency of 40–60 ms (but see Buschman and Miller, 2007; Premereur et al., 2011). The target-related activity then increases while the distracter-related activity does not, resulting in reliable discriminating activity after ∼100–150 ms (Schall et al., 2007; Shen et al., 2011). This DT corresponds to ∼50–70 ms before saccades.
Only a few studies have extended this approach to examine the saccade selection process when there is more than a single saccade. McPeek and Keller (2002b) showed that, in response to the presentation of the search display, SC neurons have increased activity for a target that will be foveated after a second saccade if the intervening fixation is short. This observation suggests that the target representation remained competitive for selection even after an initial erroneous saccade. However, part of this enhancement could also have been attributable to the representation of target identity separate from the saccade goal (Shen and Paré, 2007). Indeed, we showed here using a conjunction search task how even the target-related activity in two-saccade trials having the longest fixation durations was still enhanced compared with distracter-related activity. Other studies examined the activity leading up to the selection of the second saccade rather than the first. For example, Murthy et al. (2007) used a feature search task in which the target occasionally jumped to a new location during saccade planning. On trials in which monkeys made a corrective saccade to the new target location, discriminating activity of FEF neurons was found to occur earlier (∼40–60 ms after fixation onset) than on successful single-saccade trials. Similarly, recent studies using visual search of natural scenes (Phillips and Segraves, 2010) and dense grid displays (Zhou and Desimone, 2011) reported how selective FEF activity occurred around the time of their visually evoked responses (∼50 ms after fixation onset). However, these studies either did not entirely control the placement of the stimulus within the RF of the neuron (Murthy et al., 2007; Zhou and Desimone, 2011) or did not control for the visual stimulation of the neuron across fixations (Phillips and Segraves, 2010). Related to these search data are observations of LIP neurons during a task in which the visual display remained on throughout a session (Gottlieb et al., 1998). Monkeys were trained to shift their eyes so that one of the stable stimuli fell in the RF of a neuron before foveating the cued stimulus. LIP neurons modulated their activity to signal the location of a relevant stimulus in advance of the saccade that brought the stimulus into the RF. The study we present here adds to these previous observations by quantifying the time course of saccade target selection during sequences of fixations and providing evidence for predictive selection in a structure that more closely affects visual behavior because of its direct outputs to the saccade-generating system (Rodgers et al., 2006).
The relationship between the selection of a saccade goal and the programming of the movement itself has been shown to be dissociated in FEF (Thompson et al., 1996), as well as for some neurons in SC (McPeek and Keller, 2002a). Therefore, the DT/RT slopes of these neurons tend toward 0, such that saccade latencies cannot be predicted by DTs. In the present study, saccade latencies were predicted by DTs for both single- and multiple-saccade trials, because DT/RT slopes were distributed around 1. We showed previously in a large SC neuronal sample how DT/RT slopes were unimodally distributed around 1 when the search task was unconstrained, suggesting that in natural situations—as in the present multiple-fixation task—visual processing of the next target is not distinct from the selection of the target itself (Shen et al., 2011; see also Ipata et al., 2006; Thomas and Paré, 2007). In the current study, however, the time difference between neuronal discrimination and saccade initiation in these two types of fixations suggests that the process of saccade target selection is distinct from saccade programming. This is consistent with the view that a selection threshold exists separately from the saccade trigger threshold (Shen and Paré, 2007). The observation that the selection of the subsequent saccade target occurred nearly 90 ms in advance of saccade initiation suggests that there is ample time for that saccade to be countermanded. In other words, there is not necessarily a commitment of a saccade program to the selected item.
Predictive neuronal activity for a stimulus about to be brought into the RF has been reported in SC (Walker et al., 1995), FEF (Umeno and Goldberg, 1997), and area LIP (Duhamel et al., 1992). In these studies, a single visual stimulus is displayed, and the monkey is instructed to shift its eyes to a location that will align the RF with the stimulus once the saccade is made. This activity has been interpreted to be a possible neural substrate for perceptual stability, in which visual RFs (and therefore visual stimuli within them) are “remapped” based on an efference copy of the movement program to account for the displacement of those objects once the eyes have moved (Sommer and Wurtz, 2008). However, evidence in favor of this account has been difficult to obtain. Reports of “change blindness” wherein significant changes to a visual scene remain undetected across saccades (Henderson and Hollingworth, 1999) and limited memory for visual objects across saccades (Irwin and Andrews, 1996) put into question whether spatial remapping serves to stabilize visual perception. Observations that saccade targets (i.e., visited locations) are retained across saccades (Beck et al., 2006) and the presence of remapping activity in visual areas within the dorsal but not ventral stream may instead suggest that predictive activity plays a supporting role in action selection rather than perception (Bays and Husain, 2007). Indeed, if the early increase in activity were attributable to the perceptual remapping of the RF, an increase should occur for both target and distracter stimuli. Instead, we observed that the early modulation in activity was associated only with the stimulus that would become the next saccade target (Fig. 4B). This finding is supported by a recent behavioral study that suggested how multiple movement programs, rather than retinotopic perceptual locations, are planned in parallel and updated at the end of each saccade (Quaia et al., 2010). Whether perceptual remapping also occurs cannot be determined from our data because there is a stimulus in the RF of a neuron on every fixation. Even so, its contribution to our task may not be significant because remapping activity has been shown to be diminished when tested in less impoverished environments (Churan et al., 2011). The affect of remapping activity in more naturalistic environments has yet to be fully understood.
We extended our understanding of selection mechanisms beyond the first saccade and showed how selection mechanisms are substantially different for subsequent saccade targets. In addition to an enhanced representation for target identity, the processing of time-varying signals on the visual salience map occurs in parallel to support efficient sequences of saccades.
Footnotes
This research was supported by Canadian Institutes of Health Research Grant MOP 38089 and Queen's University (M.P.). K.S. held a postgraduate scholarship from the Natural Sciences and Engineering Council of Canada.
References
- Bahill AT, Bahill KA, Clark MR, Stark L. Closely spaced saccades. Invest Ophthalmol. 1975;14:317–321. [PubMed] [Google Scholar]
- Baldauf D, Deubel H. Properties of attentional selection during the preparation of sequential saccades. Exp Brain Res. 2008;184:411–425. doi: 10.1007/s00221-007-1114-x. [DOI] [PubMed] [Google Scholar]
- Bays PM, Husain M. Spatial remapping of the visual world across saccades. Neuroreport. 2007;18:1207–1213. doi: 10.1097/WNR.0b013e328244e6c3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Beck MR, Peterson MS, Vomela M. Memory for where, but not what, is used during visual search. J Exp Psychol Hum Percept Perform. 2006;32:235–250. doi: 10.1037/0096-1523.32.2.235. [DOI] [PubMed] [Google Scholar]
- Becker W, Jürgens R. An analysis of the saccadic system by means of double step stimuli. Vision Res. 1979;19:967–983. doi: 10.1016/0042-6989(79)90222-0. [DOI] [PubMed] [Google Scholar]
- Bichot NP, Rossi AF, Desimone R. Parallel and serial neural mechanisms for visual search in macaque area V4. Science. 2005;308:529–534. doi: 10.1126/science.1109676. [DOI] [PubMed] [Google Scholar]
- Buschman TJ, Miller EK. Top-down versus bottom-up control of attention in the prefrontal and posterior parietal cortices. Science. 2007;315:1860–1862. doi: 10.1126/science.1138071. [DOI] [PubMed] [Google Scholar]
- Caspi A, Beutter BR, Eckstein MP. The time course of visual information accrual guiding eye movement decisions. Proc Natl Acad Sci U S A. 2004;101:13086–13090. doi: 10.1073/pnas.0305329101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Churan J, Guitton D, Pack CC. Context dependence of receptive field remapping in superior colliculus. J Neurophysiol. 2011;106:1862–1874. doi: 10.1152/jn.00288.2011. [DOI] [PubMed] [Google Scholar]
- De Vries JP, Hooge IT, Verstraten FAJ. Saccades toward the target are planned as sequences rather than as single steps. Psychol Sci. 2014;25:215–223. doi: 10.1177/0956797613497020. [DOI] [PubMed] [Google Scholar]
- Duhamel JR, Colby CL, Goldberg ME. The updating of the representation of visual space in parietal cortex by intended eye movements. Science. 1992;255:90–92. doi: 10.1126/science.1553535. [DOI] [PubMed] [Google Scholar]
- Findlay JM, Brown V, Gilchrist ID. Saccade target selection in visual search: the effect of information from the previous fixation. Vision Res. 2001;41:87–95. doi: 10.1016/S0042-6989(00)00236-4. [DOI] [PubMed] [Google Scholar]
- Gersch TM, Kowler E, Schnitzer BS, Dosher BA. Attention during sequences of saccades along marked and memorized paths. Vision Res. 2009;49:1256–1266. doi: 10.1016/j.visres.2007.10.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Godijn R, Theeuwes J. Programming of endogenous and exogenous saccades: evidence for a competitive integration model. J Exp Psychol Hum Percept Perform. 2002;28:1039–1054. doi: 10.1037/0096-1523.28.5.1039. [DOI] [PubMed] [Google Scholar]
- Godijn R, Theeuwes J. Parallel allocation of attention prior to the execution of saccade sequences. J Exp Psychol Hum Percept Perform. 2003;29:882–896. doi: 10.1037/0096-1523.29.5.882. [DOI] [PubMed] [Google Scholar]
- Gottlieb JP, Kusunoki M, Goldberg ME. The representation of visual salience in monkey parietal cortex. Nature. 1998;391:481–484. doi: 10.1038/35135. [DOI] [PubMed] [Google Scholar]
- Henderson JM. Human gaze control during real-world scene perception. Trends Cogn Sci. 2003;7:498–504. doi: 10.1016/j.tics.2003.09.006. [DOI] [PubMed] [Google Scholar]
- Henderson JM, Hollingworth A. High-level scene perception. Annu Rev Psychol. 1999;50:243–271. doi: 10.1146/annurev.psych.50.1.243. [DOI] [PubMed] [Google Scholar]
- Hooge IT, Erkelens CJ. Control of fixation duration in a simple search task. Percept Psychophys. 1996;58:969–976. doi: 10.3758/BF03206825. [DOI] [PubMed] [Google Scholar]
- Ipata AE, Gee AL, Goldberg ME, Bisley JW. Activity in the lateral intraparietal area predicts the goal and latency of saccades in a free-viewing visual search task. J Neurosci. 2006;26:3656–3661. doi: 10.1523/JNEUROSCI.5074-05.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Irwin DE, Andrews RV. Integration and accumulation of information across saccadic eye movements. In: Inui T, McClelland JL, editors. Attention and performance XVI: information integration in perception and communication. Cambridge, MA: Massachusetts Institute of Technology; 1996. pp. 125–156. [Google Scholar]
- Kotowicz A, Rutishauser U, Koch C. Time course of target recognition in visual search. Front Hum Neurosci. 2010;4:31. doi: 10.3389/fnhum.2010.00031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- McPeek RM, Keller EL. Short-term priming, concurrent processing, and saccade curvature during a target selection task in the monkey. Vision Res. 2001;41:785–800. doi: 10.1016/S0042-6989(00)00287-X. [DOI] [PubMed] [Google Scholar]
- McPeek RM, Keller EL. Saccade target selection in the superior colliculus during a visual search task. J Neurophysiol. 2002a;88:2019–2034. doi: 10.1152/jn.2002.88.4.2019. [DOI] [PubMed] [Google Scholar]
- McPeek RM, Keller EL. Superior colliculus activity related to concurrent processing of saccade goals in a visual search task. J Neurophysiol. 2002b;87:1805–1815. doi: 10.1152/jn.00501.2001. [DOI] [PubMed] [Google Scholar]
- McPeek RM, Skavenski AA, Nakayama K. Concurrent processing of saccades in visual search. Vision Res. 2000;40:2499–2516. doi: 10.1016/S0042-6989(00)00102-4. [DOI] [PubMed] [Google Scholar]
- Motter BC, Belky EJ. The guidance of eye movements during active visual search. Vision Res. 1998;38:1805–1815. doi: 10.1016/S0042-6989(97)00349-0. [DOI] [PubMed] [Google Scholar]
- Murthy A, Ray S, Shorter SM, Priddy EG, Schall JD, Thompson KG. Frontal eye field contributions to rapid corrective saccades. J Neurophysiol. 2007;97:1457–1469. doi: 10.1152/jn.00433.2006. [DOI] [PubMed] [Google Scholar]
- Nothdurft HC, Pigarev IN, Kastner S. Overt and covert visual search in primates: reaction times and gaze shift strategies. J Integr Neurosci. 2009;8:137–174. doi: 10.1142/S0219635209002101. [DOI] [PubMed] [Google Scholar]
- Ottes FP, Van Gisbergen JA, Eggermont JJ. Collicular involvement in a saccadic colour discrimination task. Exp Brain Res. 1987;66:465–478. doi: 10.1007/BF00270679. [DOI] [PubMed] [Google Scholar]
- Paré M, Wurtz RH. Progression in neuronal processing for saccadic eye movements from parietal cortex area lip to superior colliculus. J Neurophysiol. 2001;85:2545–2562. doi: 10.1152/jn.2001.85.6.2545. [DOI] [PubMed] [Google Scholar]
- Phillips AN, Segraves MA. Predictive activity in macaque frontal eye field neurons during natural scene searching. J Neurophysiol. 2010;103:1238–1252. doi: 10.1152/jn.00776.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Premereur E, Vanduffel W, Janssen P. Functional heterogeneity of macaque lateral intraparietal neurons. J Neurosci. 2011;31:12307–12317. doi: 10.1523/JNEUROSCI.2241-11.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Quaia C, Joiner WM, Fitzgibbon EJ, Optican LM, Smith MA. Eye movement sequence generation in humans: motor or goal updating? J Vis. 2010;10:28. doi: 10.1167/10.14.28. pii. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rayner K. Eye movements in reading and information processing: 20 years of research. Psychol Bull. 1998;124:372–422. doi: 10.1037/0033-2909.124.3.372. [DOI] [PubMed] [Google Scholar]
- Rodgers CK, Munoz DP, Scott SH, Paré M. Discharge properties of monkey tectoreticular neurons. J Neurophysiol. 2006;95:3502–3511. doi: 10.1152/jn.00908.2005. [DOI] [PubMed] [Google Scholar]
- Sato T, Murthy A, Thompson KG, Schall JD. Search efficiency but not response interference affects visual selection in frontal eye field. Neuron. 2001;30:583–591. doi: 10.1016/S0896-6273(01)00304-X. [DOI] [PubMed] [Google Scholar]
- Schall JD, Paré M, Woodman GF. Comment on “Top-down versus bottom-up control of attention in the prefrontal and posterior parietal cortices.”. Science. 2007;318:44. doi: 10.1126/science.1144865. author reply 44. [DOI] [PubMed] [Google Scholar]
- Sheinberg DL, Logothetis NK. Noticing familiar objects in real world scenes: the role of temporal cortical neurons in natural vision. J Neurosci. 2001;21:1340–1350. doi: 10.1523/JNEUROSCI.21-04-01340.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shen K, Paré M. Guidance of eye movements during visual conjunction search: local and global contextual effects on target discriminability. J Neurophysiol. 2006;95:2845–2855. doi: 10.1152/jn.00898.2005. [DOI] [PubMed] [Google Scholar]
- Shen K, Paré M. Neuronal activity in superior colliculus signals both stimulus identity and saccade goals during visual conjunction search. J Vis. 2007;7 doi: 10.1167/7.5.15. 15.1–13. [DOI] [PubMed] [Google Scholar]
- Shen K, Valero J, Day GS, Paré M. Investigating the role of the superior colliculus in active vision with the visual search paradigm. Eur J Neurosci. 2011;33:2003–2016. doi: 10.1111/j.1460-9568.2011.07722.x. [DOI] [PubMed] [Google Scholar]
- Sommer MA. The spatial relationship between scanning saccades and express saccades. Vision Res. 1997;37:2745–2756. doi: 10.1016/S0042-6989(97)00078-3. [DOI] [PubMed] [Google Scholar]
- Sommer MA, Wurtz RH. Brain circuits for the internal monitoring of movements. Annu Rev Neurosci. 2008;31:317–338. doi: 10.1146/annurev.neuro.31.060407.125627. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Theeuwes J, Kramer AF, Hahn S, Irwin DE. Our eyes do not always go where we want them to go: capture of the eyes by new objects. Psychol Sci. 1998;9:379–395. doi: 10.1111/1467-9280.00071. [DOI] [Google Scholar]
- Thomas NW, Paré M. Temporal processing of saccade targets in parietal cortex area LIP during visual search. J Neurophysiol. 2007;97:942–947. doi: 10.1152/jn.00413.2006. [DOI] [PubMed] [Google Scholar]
- Thompson KG, Hanes DP, Bichot NP, Schall JD. Perceptual and motor processing stages identified in the activity of macaque frontal eye field neurons during visual search. J Neurophysiol. 1996;76:4040–4055. doi: 10.1152/jn.1996.76.6.4040. [DOI] [PubMed] [Google Scholar]
- Thompson KG, Bichot NP, Sato TR. Frontal eye field activity before visual search errors reveals the integration of bottom-up and top-down salience. J Neurophysiol. 2005;93:337–351. doi: 10.1152/jn.00330.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Umeno MM, Goldberg ME. Spatial processing in the monkey frontal eye field. I. Predictive visual responses. J Neurophysiol. 1997;78:1373–1383. doi: 10.1152/jn.1997.78.3.1373. [DOI] [PubMed] [Google Scholar]
- Van Essen DC, Newsome WT, Maunsell JH. The visual field representation in striate cortex of the macaque monkey: asymmetries, anisotropies, and individual variability. Vision Res. 1984;24:429–448. doi: 10.1016/0042-6989(84)90041-5. [DOI] [PubMed] [Google Scholar]
- Van Loon EM, Hooge IT, Van den Berg AV. The timing of sequences of saccades in visual search. Proc Biol Sci. 2002;269:1571–1579. doi: 10.1098/rspb.2002.2062. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Viviani P, Swensson RG. Saccadic eye movements to peripherally discriminated visual targets. J Exp Psychol Hum Percept Perform. 1982;8:113–126. doi: 10.1037/0096-1523.8.1.113. [DOI] [PubMed] [Google Scholar]
- Walker MF, Fitzgibbon EJ, Goldberg ME. Neurons in the monkey superior colliculus predict the visual result of impending saccadic eye movements. J Neurophysiol. 1995;73:1988–2003. doi: 10.1152/jn.1995.73.5.1988. [DOI] [PubMed] [Google Scholar]
- Zhou H, Desimone R. Feature-based attention in the frontal eye field and area V4 during visual search. Neuron. 2011;70:1205–1217. doi: 10.1016/j.neuron.2011.04.032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zingale CM, Kowler E. Planning sequences of saccades. Vision Res. 1987;27:1327–1341. doi: 10.1016/0042-6989(87)90210-0. [DOI] [PubMed] [Google Scholar]