Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2013 Nov 29.
Published in final edited form as: J Neurosci. 2013 May 29;33(22):9273–9282. doi: 10.1523/JNEUROSCI.0239-13.2013

Attending multiple items decreases the selectivity of population responses in human primary visual cortex

David E Anderson 1, Edward F Ester 2, John T Serences 2,3, Edward Awh 1
PMCID: PMC3710664  NIHMSID: NIHMS486192  PMID: 23719796

Abstract

Multiple studies have documented an inverse relationship between the number of to-be-attended or remembered items in a display (“set size”) and task performance. The neural source of this decline in cognitive performance is currently under debate. Here, we used a combination of fMRI and a forward encoding model of orientation-selectivity to generate population tuning functions for each of two stimuli while human observers attended either one or both items. We observed: (1) clear population tuning functions for the attended item(s) that peaked at the stimulus orientation and decreased monotonically as the angular distance from this orientation increased, (2) a set-size dependent decline in the relative precision of orientation-specific population responses, such that attending two items yielded a decline in selectivity of the population tuning function for each item, (3) that the magnitude of the loss of precision in population tuning functions predicted individual differences in the behavioral cost of attending an additional item. These findings demonstrate that attending multiple items degrades the precision of perceptual representations for the target items, and provides a straightforward account for the associated impairments in visually-guided behavior.

Keywords: Attention, Encoding Model, fMRI, Perception, Primary Visual Cortex


Multiple studies have documented an inverse relationship between the number of to-be-attended or remembered items in a display (“set size”) and task performance (Geisler and Chou, 1995; Palmer 1990; Palmer, Ames and Lindsey, 1993; Zhang and Luck, 2008). Two different explanations of these effects have been offered. Perceptual coding models posit that perceptual quality is constrained by the distribution of a finite neural resource that must be spread more thinly when set size increases (Posner 1980; Palmer et al. 2011; Wolfe et al. 2001; Bundesen, 1990). By contrast, decision integration models posit that the internal noise associated with each perceptual representation is independent of set size (Palmer 1995; Eckstein, Thomas, Palmer, and Shimozaki 2000; Palmer and McClean 1995; Verghese 2001), and that declines in performance reflect increasing decision noise.

Recently, Pestilli and colleagues (2011) attempted to discriminate between these models by measuring blood-oxygenation-level-dependent (BOLD) responses in visual areas during a contrast-discrimination task. Although behavioral performance was better – and BOLD responses were larger – during attend-one relative to attend-four trials, quantitative modeling indicated that attention-related changes in behavior could not be solely explained by changes in the quality of stimulus representations in visual cortex (e.g., changes in the mean or variance of BOLD responses across trials). Instead, they favored a decision integration model where neural responses are subjected to weighted pooling and a “max” rule is applied to identify the target. Two points motivate a re-examination of this question. First, the contrast-discrimination task may be conducive to grouping strategies that could minimize set size effects. Second, Pestilli et al. employed a univariate analytic approach that is not always sensitive to qualitative changes in neural activity (e.g., Harrison and Tong, 2009; Serences et al., 2009; Serences and Saproo, 2012).

Thus, we re-examined these alternative accounts by examining set-size dependent changes in the relative precision of orientation-specific population responses measured via fMRI. We used a forward encoding model of orientation selectivity to reconstruct orientation-specific response profiles -- channel tuning functions (CTFs) -- while observers attended one or two peripheral targets (Brouwer and Heeger, 2011; Serences and Saproo, 2012) and performed a challenging visual discrimination that discouraged perceptual grouping. According to perceptual coding models, CTF precision should decline as set size rises from one to two because two attended representations would each receive a smaller proportion of a finite resource. Alternatively, decision integration models predict no qualitative difference in CTF precision because the internal noise of each representation is independent of set size and load-dependent effects reflect noise in decision-related processes.

To preview our findings, CTF precision was reliably lower when more items were attended. Moreover, the reduction in the precision of the population response profiles was strongly correlated with associated reductions in behavioral performance, showing that reductions in the precision of population responses provides a compelling explanation of declining behavioral performance. Thus, although decision processes may also be challenged by increases in set size, attending additional items can also have consequences for the quality of perceptual codes.

Materials and Methods

Subjects

Fourteen neurologically intact human subjects (6 females) were tested in a single 2.5 hour session. All subjects self-reported normal or corrected-to-normal visual acuity and gave both written and oral informed consent. Prior to scanning, each subject completed a single 1-hour behavioral training session. Compensation for participation was $10/hr for behavioral training and $25/hr for scanning.

Stimuli and apparatus

Visual stimuli were generated using the Psychophysics Toolbox (Brainard 1997; Pelli 1997) implemented in MATLAB (MathWorks, Natick, MA). Stimuli were rendered in white on a black background via a rear projection system. Button press responses were made on an fMRI-compatible response box using the index and middle fingers of the right hand. Subjects were instructed to hold fixation on a small central diamond (subtending .18° from viewing distance of 58 cm) throughout each scan (eye position data was continuously collected to ensure compliance with these instructions; see Eye Tracking, below).

Experimental Task

The sequence of events on each trial is presented in Figure 1. Each trial began with the presentation of two full-contrast square-wave grating stimuli (radius of 4°, 2.67 cycles/°) in the upper left and right quadrants of the display (horizontal and vertical eccentricity of ±7° and +5°, respectively). To elicit a strong, stimulus-specific BOLD response, we presented full-contrast stimuli on a black background, thus introducing both a contrast onset and a luminance increment for each stimulus presentation. Each stimulus was randomly assigned to one of two orientation values from distributions spanning 5–162.5° or 15–172.5° (for the left and right stimuli, respectively; both orientation distributions increased in steps of 22.5°) for a total of 8 unique orientations in each distribution; the distributions were offset from one another to reduce the likelihood of perceptual grouping of identical or similarly oriented gratings. The two gratings flickered at a rate of 3 Hz (166.7 ms on, 166.7 ms off) for a total of 5 s, and the spatial phase of both stimuli was randomized on each cycle. A fixation cue was highlighted on either the left, right, or both sides (red or green, counterbalanced across subjects; see Figure 1) to instruct observers whether to attend the left, right, or both gratings in order to detect brief clockwise or counterclockwise displacements in orientation. Subjects were instructed to press one of two buttons to distinguish whether a clockwise or counterclockwise displacement was detected. Each trial included two target events that were randomly distributed among cued items across time. During set size 1 trials - when only one stimulus was attended - no rotations occurred on the irrelevant stimulus; during set size 2 trials- when both stimuli were attended - targets were distributed randomly so that rotations occurred either only in the left, only in the right, or across both stimuli (balanced across trials). Each trial was followed by a randomly jittered ITI (5, 6, 7, or 8 s).

Figure 1. Behavioral Task.

Figure 1

Subjects were instructed to attend the item indicated by the central cue. During set size 1 trials, either the left or right side of the cue was shaded green, indicating that subjects were to attend only the left or right grating, respectively; during set size 2 trials, both sides of the cue were shaded. Subjects were instructed to detect angular deviations in an attended item and to indicate its direction (counterclockwise or clockwise). Stimuli oscillated (3 Hz) for 5 s, followed by a randomly jittered ITI.

Stimulus orientation and set size (one or two) were balanced within each scan (where “scan” refers to a continuous, 370-s long block of 32 trials). Orientation differences between the two stimuli were fully counterbalanced across a total of eight consecutive scans, resulting in 64 total possible orientation pairings for each condition. For set size 1 trials (16 trials per block), there were an equal number of attend left and attend right trials; for set size 2 trials (16 trials per block), each trial was an attend-both trial.

Staircase Procedure

To ensure that the task was sufficiently challenging, we adjusted task difficulty by computing orientation rotation thresholds (defined as the minimum rotation magnitude yielding 75% correct performance) for each subject in a separate behavioral testing session (completed 1–2 days before the scanning session). The staircase procedure was identical to the experimental task with the only exception being that thresholds were independently modulated for set sizes one and two. Rotation thresholds for set size one trials were used to set the target disparities for both set sizes during the scan session. This enabled a clear behavioral measure of the cost of attending an additional item.

Eye Tracking

To assess compliance with fixation instructions, eye position data was obtained via an ASL 5000 infrared tracking system. Recording was performed at 60 Hz, and data were filtered for blinks and corrected for linear drift offline. For each trial, we identified all stable fixations (defined as a 200 ms epoch during which eye position did not deviate >.25°) during a period extending the entire trial epoch. The x and y positions of fixations made during the trial were saved for offline analyses.

fMRI data acquisition and analysis

fMRI data were collected using a 3T Siemens Allegra system at the Robert and Beverly Lewis Center for Neuroimaging at the University of Oregon. Anatomical images were acquired using a spoiled-gradient-recalled T1-weighted sequence that yielded images with a 1 mm3 resolution. Whole-brain echo-planar images (EPIs) were acquired in 33 transverse slices (3 mm2 in-plane resolution, 2000 ms repetition time, 30 ms echo time, 90° flip angle, 64 × 64 matrix 192 mm field of view, 3.5 mm slice thickness, no gap). EPIs were slice-time corrected, motion corrected (both within and between scans), and high-pass filtered (3 cycles/run). Image preprocessing and data analysis were performed using BrainVoyagerQX (version 1.9; Brain Innovation) and custom time-series and custom routines written in MATLAB (version 2010a; MathWorks).

Retinotopic mapping

Retinotopic mapping data were acquired using a rotating checkerboard wedge flickering at 8 Hz and subtending 45° of polar angle (following standard procedures described in detail elsewhere; e.g., Engel et al. 1994; Sereno et al. 1995). Each participant completed a single scan lasting 480 seconds. This procedure was used to identify retinotopic boundaries of V1. To aid in the visualization of these regions, data were projected onto a computationally inflated representation of each observer’s gray/white matter boundary.

Functional localizer and voxel selection

Each participant also completed one to two scans (15 trials) of a functional localizer task to identify voxels within primary visual cortex that were responsive to the spatial position occupied by the oriented grating stimuli of the primary experiment. Two full-contrast, phase reversing (10 Hz) checkerboard stimuli (radius 4°) that exactly matched the size and spatial position of the oriented grating stimuli of the main task were presented for a total of 10 seconds; participants were required to detect brief (50 ms) reductions in stimulus contrast that occurred simultaneously in both stimuli at unpredictable intervals on each trial. Each trial was followed by a 10 second fixation interval. Visually responsive regions of V1 were identified using a general linear model (GLM) with a single regressor – denoting stimulus presence (i.e. on vs off) – that was constructed by convolving a boxcar model of the stimulus sequence with a gamma function to account for the assumed shape of the hemodynamic response (Boynton et al. 1996). Voxels that showed a stronger response during epochs of stimulation (relative to fixation; thresholded at p < 0.05 using the false discovery rate algorithm provided in BrainVoyagerQX 1.9; The false discovery rate algorithm is a procedure used to prevent multiple comparison problems while avoiding Type II errors prevalent in Bonferroni correction approaches; see Genovese et al., 2002) were used to define functional regions-of-interest (ROIs) in V1.

Multivoxel Pattern Analysis

The purpose of multivoxel pattern analysis (MVPA) and the forward encoding model is to estimate the influence of attending multiple items on orientation selectivity in primary visual cortex. Here, we assumed that each voxel in V1 represents a large population of orientation-selective cells, where a BOLD response within a given voxel reflects the pooled activity across a distribution of cells with dissimilar orientation preferences. Therefore, any observed orientation bias within a given voxel may reflect large-scale feature maps (Freeman et al. 2011; Sasaki et al., 2006) or random anisotropies in the distribution of orientation-selective columns within a voxel (Kamitani and Tong 2005; Swisher et al. 2010). Thus, the BOLD response measured from many of the voxels in V1 exhibit a robust orientation preference (Haynes and Rees 2005; Kamitani and Tong 2005; Serences et al. 2009; Brouwer and Heeger 2011; Freeman et al. 2011).

To examine whether patterns of activation in primary visual cortex contained information about the attended orientation, we first normalized the raw time series from each voxel on a scan-by-scan basis using a z transform. Next, mean BOLD responses were calculated across a time period extending from 4–8 seconds following the onset of the stimulus on each trial, resulting in a single BOLD response value for each trial, and these amplitude estimates were sorted into one of 8 orientation bins. Data from all but one scan were then used to construct a “training” data set that was used to train a linear discriminant function to discriminate between different stimulus conditions separately for each cortical hemisphere. The training set included data from both set sizes. The trained classifier was then used to decode the orientation of the sample stimulus on each trial within the remaining scan (the ‘test’ set). Data could be assigned one of eight orientation labels, so chance discrimination accuracy was 12.5%. Mean responses from each label were removed to spatially normalize the activation patterns. This analysis was iterated using a hold-one-out cross-validation procedure until data from every scan had served as the test set. This analysis was completed for each stimulus for contralateral (e.g. left hemifield stimulus classified in right cortical hemisphere) and ipsilateral (e.g. left hemifield stimulus classified in left cortical hemisphere) hemispheres. For example, to estimate set size 1 classification accuracy in contralateral ROIs, we aggregated single trial classification accuracies from (1) left primary visual cortex during attend right trials (8 trials per block) and (2) right primary visual cortex during attend left trials (8 trials per block); conversely, to estimate set size 2 classification accuracy in contralateral ROIs, we obtained two classification values for each set size 2 trial (i.e. classification in right hemisphere for left stimulus and classification in left hemisphere for right stimulus), and aggregated these values across all set size 2 trials (16 trials per block). Classification accuracies were then averaged across each test set, yielding a single classification accuracy value for each set size and cortical hemisphere (contralateral, ipsilateral).

Forward Encoding Model

To characterize orientation-specific responses in visual cortex, we used a forward encoding model of orientation selectivity. Our approach was similar to one described by Brouwer and Heeger (2009; 2011), and we therefore adopt their terminology and conventions. Briefly, this model assumes that each fMRI voxel in visual cortex samples from a large number of orientation-selective neurons and that the response of any given voxel is proportional to the summed responses of all neurons in that voxel. Thus, one can characterize the orientation selectivity of a given voxel as a weighted sum of N orientation channels, each with an idealized tuning curve. We modeled the response of each voxel using a basis set of 8 half sinusoids (one per sample angle) raised to the 5th power. These functions were chosen to approximate single-unit tuning profiles in primary visual cortex, where the 1/√2 half-bandwidth of orientation-selective cells has been estimated at about 20° (though there is a considerable amount of variability in this estimate, e.g., Schiller et al. 1976; Swindale 1998; Ringach et al. 2002a,b; Gur et al. 2005).

To estimate orientation-selectivity in V1 across variations in the distribution of attention across a variable number of items, we first normalized and sorted the data into training and test sets (see Multivoxel Pattern Analysis for details). Thus, each training set had 224 observations for subjects who underwent 8 runs in the scanner (7 runs in training set × 32 trials). Critically, every run sampled from each orientation equally. Similarly, the test set for these subjects had 32 observations. This analysis was completed for each ROI (left V1, right V1) and each visual field (left, right) across all trials, regardless of target location or set size. This ensured that the estimated weight profiles were unbiased across conditions.

In the first phase of the analysis, data from the training set were used to estimate weights on the hypothetical orientation channels separately for each voxel. Using the terminology of Brouwer and Heeger (2009; 2011), let m be the number of voxels in an ROI, k be the number of hypothetical orientation channels, and n1 and n2 be the number of observations in the training and test sets, respectively. The channel weights (W, m × k) can be derived via least-squares estimation:

W=D1C1T(C1C1T)1

Where D1 (m × n1) is the training set and C1 (k × n1) is the basis set discussed above. In the second phase of the analysis, channel responses (C2, k × n2) were estimated given the weight matrix and responses in the test set (D2, m × n2):

C2=(WTW)1WTD2

The steps of the forward encoding procedure outlined so far were collapsed across each stimulus, because parsing each condition and performing the forward encoding procedure independently for each stimulus condition would have biased estimation of channel weights. Thus, there were an equal number of trials from both set sizes, including the unattended stimulus, in the training set. Following the procedure above, each estimated channel response function was then sorted based on set size, and the columns in C2 were then circularly shifted so that the channel aligned with the stimulus presented on each trial was positioned in the center of orientation space, thereby aligning the estimated channel responses to a common center (i.e., 0°). Channel response functions from both stimuli in set size 2 trials were averaged together to form a single channel response function for set size 2. For example, when estimating contralateral channel response functions, we averaged together left hemisphere channel responses to the right stimulus and right hemisphere channel responses to the left stimulus; similarly, when estimating ipsilateral channel response functions, we averaged together left hemisphere channel responses to the left stimulus and right hemisphere channel responses to the right stimulus. This analysis was repeated iteratively until all scans had served as the test set and the results for each set size were averaged. With respect to a given orientation value, this procedure was repeated across both contralateral and ipsilateral ROIs.

The purpose of this analysis was to reconstruct a separate orientation-selective population response profile for each stimulus, even though there were always two stimuli presented. To this end, each possible orientation value for the left stimulus was paired with all possible orientation values of the right stimulus (across all the scans run for each observer). This allowed us to yoke the analysis to a given stimulus while the channel activity associated with the other stimulus was fully counterbalanced (see Experimental Task). As the results will show, this procedure successfully allowed us to extract orderly tuning functions for each stimulus presented, even though the channel activity for any single trial was influenced by two stimuli.

Note that this procedure makes no assumption regarding the shape (e.g. Gaussian) of voxel-based tuning functions at the level of a single voxel. Instead, this analysis models each voxel’s response as a combination of responses across eight hypothetical orientation-selective channels. Using this approach, any response profile may be observed at the single voxel level. Then, in the testing stage of the analysis, the known weights across all voxels (computed during training) are used to constrain the estimate of the response in each channel given a vector of responses across voxels on each trial in the test set. Thus, this approach is generally more flexible than other approaches that assume a Gaussian distribution of orientation responses within each voxel. However, note also that the Gaussian-like CTF function (see Evaluating population-level feature-selective tuning functions in primary visual cortex), is influenced to by the use of a Gaussian-like basis set, as the basis functions overlap and are thus partially correlated. However, similar results are obtained if we use a basis set of orthogonal delta functions (‘stick’ functions) that are not correlated.

Mutual information analysis

During the MVPA and forward model analyses, we used an information-theoretic quantity known as mutual information (MI) to rank-order voxels within each visual area based on how well their responses in the training set discriminated between different stimulus orientations (following an approach similar to that described by Saproo and Serences 2010, but using the kernel-density estimation software developed by Alex Ihler, freely available at http://www.ics.uci.edu/~ihler/code/kde.html, to estimate probability distributions). Briefly, MI indexes information contained in a voxel’s response distribution about stimulus orientation without making a priori assumptions about the shape of the response distribution. This method ensured that noisy voxels or voxels that did not show orientation selectivity – based only data from the training set – were excluded from the MVPA and forward encoding modeling. Functional ROIs in primary visual cortex were defined using the top 75% of voxels with the highest MI values.

Model Predictions

As shown in Figure 2, perceptual coding and decision integration models make very different predictions regarding the nature of the population response of orientation-selective cells in visual cortex. On the one hand, perceptual coding models posit that the simultaneous encoding of visual items requires the distribution of a finite resource, where the perceptual quality of the stimulus degrades as more items must be encoded at one time (Posner 1980; Palmer et al. 2011; Zhang and Luck 2008; Wolfe et al. 2001; Fisher 1984; Bundesen 1990). According to these models, the number of simultaneously attended items should modulate the quality of the population response, such that a less precise neural representation should be observed when distributing attention across a greater number of items (see Figure 2B). Thus, these models assume that observed set size effects result from a degradation of orientation-selective responses. On the other hand, decision integration models posit that perceptual noise is identical across large variations in set size such that the quality of the stimulus at encoding is independent of set size (Palmer 1995; Eckstein, Thomas, Palmer, and Shimozaki 2000; Wilken and Ma 2004; Palmer and McClean 1995; Verghese 2001). According to this class of models, the quality of the population response is unaffected by the number of items being attended at any given time; rather, noise arising from multiple competing representations during the decision stage influence the ability to accurately apprehend noisy internal representation. Under this view, the probability of a single nontarget representation randomly reaching criterion threshold increases as the number of noisy internal representations increases, resulting in an observed set size effect due to decision-related noise Thus, these models assume no influence of set size on population responses in early visual cortex (Figure 2A).

Figure 2. Model Predictions.

Figure 2

Predicted effect of set size on hypothetical orientation-selective population response profiles for set size 1 (black) or set size 2 (gray) displays. According to the decision integration model (left), the representational quality of each item is independent of set size, leading to no difference in population response profiles. According to the perceptual coding model (right), the quality of each item is determined by a limited neural resource, where the resource is divided more finely among a greater number of items; consequently, this class of model predicts a decline in the selectivity of population responses.

Results

Task Performance – Behavioral Training

In the staircasing procedure, we independently estimated change detection thresholds during the monitoring of one or two flickering gratings. The target event was a change in the orientation of one of the attended gratings. If a subject correctly responded with the direction of orientation change, the magnitude of angular deviation for that condition (set size 1 or 2) decreased; if, on the other hand, a subject responded incorrectly with the direction of orientation change, the magnitude of angular deviation for that condition increased. Thus, we recorded changes in angular deviation thresholds between set sizes 1 and 2.

To estimate discrimination thresholds, we calculated the average angular deviation observed from ten trials after the start of the experiment (Figure 3A; black dots represent individual subject estimates; error bars reflect within-subject error). Consistent with previous studies, we found an effect of set size on detection thresholds (t(13)=3.89, p<.01), where attending two items required a larger angular deviation to reach the same level of performance as when attending one item.

Figure 3. Task Performance.

Figure 3

(A) Performance during the staircasing procedure. Angular deviations were modulated separately for each set size, and detection thresholds were estimated as the angular deviation required to reach 75% detection accuracy. Detection thresholds were significantly lower for set size 1 trials compared to set size 2 trials (p<.01). (B) Task performance during MRI. Angular deviations for both set sizes were yoked to detection thresholds for set size 1 during the staircasing procedure. Accuracy in reporting the direction of angular deviations was higher for set size 1 than set size 2 (p<.01). Error bars represent within-subject confidence intervals.

Task performance - MRI

In the primary task, the magnitude of angular deviations across conditions was yoked to set size one detection thresholds estimated during the staircasing procedure. This method revealed a significant effect of set size on detection accuracy (t(13)=9.47, p<.001; Figure 3B). Thus, this procedure was sensitive to the known performance costs of attending additional items.

Event-related BOLD response

Previous work has reported an effect of attention on overall BOLD response magnitudes in primary visual cortex (Kastner and Ungerleider 2000; Kastner et al. 1998; Ghandi et al. 1999). This observation allows us to set up predictions for the two competing models. According to the perceptual coding model, which posits that set size effects result from a degraded response in early visual areas, we should observe a larger BOLD response within a given ROI when attending one item compared to attending two because fewer attentional resources are allocated to each item with increasing set size. According to the decision integration model, which posits that set size effects arise from competition between multiple noisy representations at the decision stage, we should observe no effect of set size on BOLD activity within a given ROI because unlimited processing capacity allows each item to receive the same amount of resources independent of set size. In line with the predictions of the perceptual coding model, we observed a significant effect of set size (F(1,13)=8.21, p<.05) on the evoked BOLD response (Figure 4) such that the BOLD response was reduced when monitoring two items compared to monitoring one item. This finding mirrors that of Pestilli et al (2011), who found that attending one item evoked a larger BOLD amplitude than when attending four. Furthermore, because the activation of the hemodynamic response requires several seconds to initiate, the hemodynamic response functions indicates no difference between set sizes until peak amplitude is reached, at which point the set size effect is evident; this effect elicits a significant interaction between set size and time was observed (F(10,4)=5.47, p<.01). As a result of this difference around the peak, all subsequent analyses focused on response patterns measured 4–8 s following stimulus onset, which demonstrated a similar set size effect (t(13)=6.44, p<.001). The BOLD response evoked by the unattended stimulus was also reduced relative to set size 1 (t(13)=3.28, p<.01) and set size 2 trials (t(13)=2.35, p<.05). Critically, although an effect of set size was observed in the HRF epoch, this effect did not predict set size effects in behavior (R2=.03; t(13)=.60, p=.28). Here again, this finding echoes that of Pestilli et al., who found that the observed changes in BOLD amplitude were insufficient to explain the observed declines in behavior as set size increased. As our subsequent analyses will suggest, however, the null result we obtained may reflect a limitation in the sensitivity of univariate analyses to qualitative changes in target-related sensory codes.

Figure 4. Event-Related Analysis of BOLD Signal.

Figure 4

The mean BOLD response was estimated from the 75% most selective voxels in primary visual for both set sizes. The solid black window represents the duration of the stimulus display. Data are collapsed across stimulus orientation. BOLD responses were significantly higher for set size 1 trials than set size 2 trials during the main analysis window (gray shaded window; p<.01). BOLD responses were lowest for the unattended stimulus. Error bars represent between-subject confidence intervals.

Multi-Voxel Pattern Classification

Next, we examined whether orientation information could be decoded from the pattern of BOLD-related responses across primary visual cortex. If the quality of a given representation degrades with the addition of multiple attended items as predicted by the perceptual coding model, then we should observe a decline in decoding accuracy for that representation in primary visual cortex. The results of the MVPA analysis on both condition (set size 1 and 2) and hemisphere (contralateral, ipsilateral) are depicted in Figure 5. Classification accuracy declined reliably as set size increased (F(1,13)=8.96, p<.01), but was equivalent between contralateral and ipsilateral hemispheres (p=.98). This latter result corroborates past work showing the spatially-global (i.e., bilateral) character of stimulus-specific activity even when a lateralized stimulus is encoded (Ester et al. 2009; Serences and Boynton 2007). In other words, previous work has shown that during both encoding and storage in WM, lateralized stimuli are represented in both the left and right hemisphere (Jehee, Brady, and Tong, 2011). Thus, in a task like ours with stimuli presented in both the right and left visual fields, we expect to see both a contralateral and an ipsilateral representation of each stimulus, such that the neural representations of these stimuli are interleaved across both sides of visual cortex (and in cortical regions responsive to locations outside of the retinotopic positions associated with the stimuli). All conditions were significantly above chance (p<.01). Critically, left- and right-field stimulus orientations were counterbalanced across trials. Thus, this finding cannot be explained by covariations in stimulus orientation across hemifields. These MVPA results are consistent with the hypothesis that the precision of perceptual representations declines as the number of to-be-attended items increases because the reliability of stimulus decoding concurrently declines. Supporting this inference, we observed a link between declines in stimulus decoding (subtracting classification accuracy for set size 2 from set size 1) and behavioral accuracy, where larger set-size dependent declines in decoding accuracy predicted similar declines in behavioral accuracy (R2=.31, p<.05).

Figure 5. Multi-Voxel Decoding Analysis.

Figure 5

A classification algorithm was trained to recognize orientation-selective patterns from voxels within each V1 ROI, and then inferred the orientation of the attended stimulus. The horizontal dashed line at .125 indicates chance classification accuracy. Above-chance classification accuracy was observed for both set sizes and ROIs (contralateral, ipsilateral). Classification accuracy was higher for set size 1 trials than set size 2 trials, while no difference was observed between ROIs within each set size. Error bars represent between-subject confidence intervals.

However, although MVPA is a valuable tool for inferring the attended feature value across a population of weakly selective voxels, this method is limited in its ability to reveal how the sensory population response was influence by the requirement to attend an additional item. Specifically, we can only conclude from this analysis that the patterns of activation associated with each stimulus became less linearly separable as set size increased, whereas the underlying mechanism leading to the poorer classification accuracy is not specified. For example, worse classification accuracy might emerge due to decreases in the overall amplitude of orientation selective population response profile or due to a decrease in the selectivity of the population response profile; both alternatives can explain the observed link between declines in decoding accuracy and behavior. Thus, we employed a forward encoding model to provide a clearer understanding of how population-level feature-selective tuning functions are affected when attention is distributed across multiple items.

Evaluating population-level feature-selective tuning functions in primary visual cortex

So far, we have demonstrated reliable set size effects in behavioral- and BOLD-related responses during a perceptual monitoring task. To better understand the source of these set size effects, we now turn to the results of the forward encoding model, which estimates population-level orientation-selective responses based on patterns of activation measured in primary visual cortex during our experimental task. These response patterns were estimated using a forward encoding model of orientation selectivity (e.g. Brouwer and Heeger 2009; Serences and Saproo 2012; see Estimating orientation-selectivity using a forward encoding model) for both contralateral and ipsilateral hemispheres. As reported in the multi-voxel pattern classification above, stimulus-specific information was reliably decoded in both contralateral and ipsilateral hemispheres. We therefore implemented a similar approach by examining orientation-selective populations responses in both contralateral and ipsilateral ROIs. Since the MVPA analysis was suggestive of a spatially global, stimulus-specific representation (see also Ester et al., 2009; Serences and Boynton, 2007; Jehee, Brady, and Tong, 2011), we anticipated a clear population tuning function centered over the channel preferring the attended orientation for both set sizes and hemispheres (i.e. contralateral and ipsilateral). As shown in Figure 6, we observed a graded orientation-selective profile of channel activations that peaked in the channel corresponding to the orientation attended during the monitoring task in both contralateral (Figure 6A) and ipsilateral (Figure 6B) hemispheres.

Figure 6. Population-Level Orientation-Selective Responses.

Figure 6

A forward encoding model was used to generate orientation-selective population response profiles for each stimulus in the display based on patterns of activity observed in primary visual cortex. This analysis revealed a similar pattern across set sizes and ROIs (contralateral, ipsilateral): a clear population tuning function emerged that peaked at the channel centered over the stimulus’ orientation and decreased monotonically as the angular distance from this orientation increased. (A) Population tuning functions observed in contralateral hemisphere (collapsed across left and right hemifield stimulus presentations) for set size 1 (black) and set size 2 (gray) trials. (B) Population tuning functions observed in ipsilateral hemisphere (collapsed across left and right hemifield stimulus presentations) for set size 1 and se size 2 trials.

In contralateral hemisphere (Figure 6A), there is an apparent attenuation of on-channel responses (channels at or near the attended feature value, e.g., ±0, 22.5) and an increase in off-channel responses (channels further from the attended feature value, e.g., ±67.5, 90) when monitoring more than one grating. This observed pattern was supported by an analysis of variance (ANOVA), which revealed a significant condition (set size: 1, 2) by channel interaction (F(2.0,26.1)=4.52, p<.05; Greenhouse-Geisser corrected). Given that the effective allocation of attention simultaneously maximizes neuronal firing rates of on-channel cells and minimizes neuronal firing rates of off-channel cells (Martinez-Trujillo and Treue 2004; Treue and Martinez-Trujillo 1999), the observed pattern of results suggests that distributing attention across multiple items leads to an overall loss of selectivity in the population response in regions of visual cortex contralateral to the attended stimulus. We also modeled patterns of activation in regions of interest ipsilateral to the attended stimulus (Figure 6B), and observed no condition by channel interaction (p=.67). However, direct comparison of contralateral and ipsilateral responses revealed a main effect on channel responses (F(1,13)=11.82, p<.01), such that channel responses were smaller in the ipsilateral relative to the contralateral hemisphere.

The critical result here is the loss of selectivity in the population response profile contralateral to the attended stimulus in the set size 2 condition. We considered whether this effect could be explained by contamination from the channel activity associated with the ipsilateral stimulus in the set size 2 condition rather than by a true change in the selectivity of the contralateral population response. That is, we considered whether -- given that the response to the ipsilateral stimulus was amplified by attention in the set size 2 condition relative to the set size 1 condition – the activity associated with the ipsilateral stimulus could have yielded an artifactual change in the shape of the tuning function for the contralateral stimulus. One problem for this explanation, however, is that the amplitude of channel activity at the target value (i.e., on-channel activity) in the set size 2 condition was reliably lower than in the set size 1 condition, consistent with past observations that attenuated feature-based selection yields relative declines in on-channel responses and increases in off-channel responses (Martinez-Trujillo and Treue 2004; Treue and Martinez-Trujillo 1999; Scolari, Byers, and Serences 2012). By contrast, the putative contamination by the ipsilateral stimulus considered here should have yielded a uniform increase in activity across all channels because the orientation of the ipsilateral stimulus was fully counterbalanced with respect to the contralateral stimulus. Thus, even though the presence of the ipsilateral stimulus could have led to increases in off-channel activity, this alternative explanation cannot account for the observation that on-channel and off-channel activity showed opposite effects of attending an additional item. In addition, we report below that these changes in on-channel and off-channel activity for the contralateral stimulus predicted the behavioral cost of attending an additional item; this correlation is also inconsistent with an artifactual source of the observed changes in the channel tuning function. Thus, contamination from the ipsilateral stimulus is unlikely to explain the observed reduction in the selectivity of the channel tuning function in the set size 2 condition.

Another explanation for the observed pattern of results is that subjects simply fixated the to-be-attended stimulus on set size 1 trials. To examine this possibility, we examined the eye-tracking data recorded during the experimental task. Specifically, we computed the mean x, y position of saccades (defined here as stable fixations lasting > 200 ms outside a 0.50° centered on fixation) as a function of target location (left vs. right visual hemifield) during set size 1 trials. We failed to observe a difference in horizontal eye position between set size 1 conditions in which subjects had to attend the item on the left or right (t(5)=.57, p=.61), and both the attend left (t(5)=1.40, p=.26) and attend right (t(5)=.04, p=.97) conditions did not differ significantly from the set size 2 condition in which both items were attended. Thus, our results cannot be explained by different patterns of eye positions across manipulations of set size.

Reductions in orientation-selectivity predict behavioral performance

A key question is whether or not the observed changes in the profile of channel responses were linked with the observed changes in behavioral performance. Thus, we examined whether the set size dependent difference in each channel response predicted the observed cost in behavioral accuracy. First, because channel response functions were symmetric (see Figure 6), we collapsed channel response functions across orientation channels equally distant from the target orientation channel (e.g. −45 and 45 degree orientation channels). Then, we calculated the difference in channel responses between set sizes for every orientation channel (Figure 7). This was estimated by subtracting the channel responses in set size 2 from the channel responses in set size 1. Thus, a positive channel difference reflects a larger channel response in set size 1 for any given channel, and a negative channel difference reflects a larger channel responses in set size 2 than for set size 1. As can be seen in Figure 7A (and Figure 6A), there was an overall reduction in on-channel responses and an overall increase in off-channel responses when attending two items compared to when attention was focused on a single item. This analysis suggests that distributing attention to multiple items effectively reduces orientation-selectivity during perception by suppressing on-channel responses and enhancing off-channels responses in set size 2 relative to set size 1 trials.

Figure 7. Loss of Feature-Selectivity in Population Responses.

Figure 7

Channel differences were estimated by subtracting channel responses in population tuning functions observed in set size 2 trials from those observed in set size 1 trials for contralateral (A) and ipsilateral (B) hemispheres. Positive channel differences represent larger channel responses for set size 1 population tuning functions; negative channel differences represent larger channel responses for set size 2 population tuning functions. For contralateral ROIs, a loss of selectivity associated with attending multiple items was observed – represented as a reduction in on-channel responses and increase in off-channel responses for set size 2 compared to set size 1 population tuning functions – while no loss of selectivity was observed in ipsilateral ROIs. (C,D) Individual differences in the loss of feature-selectivity predict declines in behavioral performance. Differences in behavioral accuracy (set size 1 – set size 2) were plotted as a function of channel modulation in both contralateral (C) and ipsilateral (D) ROIs. For contralateral ROIs, channel modulation predicted behavioral costs in accuracy, such that a more negative slope in channel differences (A) corresponded to a larger set size-dependent cost in accuracy (p<.01). This link was not observed in ipsilateral ROIs (p=.41).

We then used this channel response difference measure to determine whether observed costs in orientation-selectivity predict individual difference in the behavioral cost of attending an additional item. If distributing attention across multiple items effectively reduces the quality of orientation-selective representations measured in primary visual cortex, then we should observe an orderly relationship between the shape of channel difference functions and the relative cost of attending multiple items. Specifically, individuals who evoke a larger gradient in channel response differences as a function of orientation channel should demonstrate a larger cost in behavioral accuracy when attending two relative to one items because feature-selectivity declines as the overall difference between on- and off-channel responses in channel difference functions increases. Consider the consequence of reducing or increasing on-channel or off-channel responses, respectively, on the quality of the information available during perception. On the one hand, by reducing on-channel population responses when attention must be allocated to more than one item, the strength of the signal derived from neural populations tuned toward the attended stimulus decreases and the overall activity level in on-channel cells is more similar to the baseline level of activity. On the other hand, by increasing off-channel responses when attention must be allocated to more than one item, the activity level of neural populations tuned away from the attended stimulus feature increases relative to baseline and the overall activity level in off-channel cells is more similar to the activity level of neural populations tuned toward to the attended stimulus feature. Taken together, decreases in on-channel responses (i.e. a positive channel difference value) and increases in off-channel responses (i.e. a negative channel difference value) are indicative of declines in feature-selectivity. To quantify the magnitude of declines in feature-selectivity, we calculated a channel modulation value by computing the slope of channel response differences as a function of orientation channel.

In line with the hypothesis that set-size dependent declines in feature-selectivity drive the concurrent declines in behavioral performance, we observed a strong link between channel modulation and accuracy costs such that individuals who evoked a larger decline in feature-selectivity – defined as a more negative channel modulation value – were more likely to demonstrate a larger reduction in accuracy when attending multiple items. Importantly, this relationship was only present in cortical ROIs contralateral to the attended stimulus (Figure 7C; R2=.45, p<.01), while no relationship was observed in ipsilateral ROIs (Figure 7D; R2=.06, p=.41). Thus, in line with perceptual coding models, observed set size effects in behavioral performance are attributable to a reduction in the selectivity of orientation-specific neural populations in primary visual cortex during perception.

Discussion

There is active debate regarding the processing locus of declines in performance with concurrent increases in set size. Perceptual coding models posit that the quality of stimulus representations is constrained by the distribution of a finite neural resource (Posner 1980; Palmer et al. 2011; Wolfe et al. 2001; Fisher 1984; Bundesen 1990), such that increased set sizes reduces the resources for each item, yielding declines in representational quality. Conversely, decision integration models posit that the internal noise represented in sensory cortices is independent of set size, and that reported set size effects arise from competing noisy representations during post-perceptual decision stages of information processing (Palmer 1995; Eckstein, Thomas, Palmer, and Shimozaki 2000; Palmer and McClean 1995; Verghese 2001).

In the current work, we estimated population-level, orientation-selective responses in primary visual cortex during a perceptual monitoring task requiring the distribution of attention to one or two visual stimuli. We observed a loss of orientation-selectivity when more than one item was attended. Specifically, there was a relative increase and decrease of off-channel and on-channel responses, respectively, which broadened the distribution of orientation-selective responses. Thus, these findings provide a mechanistic explanation of why decoding results from the MVPA analysis showed a decline in accuracy with set size. Moreover, we observed that the degree to which channel responses were affected by set size predicted costs in behavioral accuracy for each observer in the study. Thus, our findings show a strong relationship between the declining precision in the orientation population response for attended items, and the known behavioral costs of attending additional items.

The main conclusion of the current work must be reconciled with recent work demonstrating support for decision integration models (Pestilli et al., 2011). These authors measured univariate patterns of BOLD-related activity in striate and extrastriate visual areas during a contrast-discrimination task requiring the allocation of focal or distributed attention. In this task, subjects were presented with four gratings of variable contrast in each visual quadrant, and were required to detect a change in the contrast of a single grating between two presentations of the same display. The shape of the BOLD contrast response functions in the focal and distributed attention conditions failed to support perceptual coding models because: (1) behavior could not be explained via response enhancement during focal attention because no change in the slope of BOLD contrast response function was observed, and (2) behavior could not be explained via noise reduction during focal attention because a quantitative model constrained by the psychophysical data was used to show that neural responses would need to undergo an unreasonably high reduction in noise. The authors then demonstrated that behavior was well-described by a weighted pooling across responses to each stimulus, consistent with a decision integration account of efficient selection. .

The empirical pattern observed in Pestilli et al (2011), however, could arguably be task-specific in that their contrast discrimination task encouraged sensory pooling as an optimal strategy. Specifically, observers were required to indicate which frame contained a higher contrast target. This task may have been conducive to a pooling strategy in which a single representation reflecting average contrast was computed and compared across frames, and then observers selected the frame with the higher average contrast. In contrast, in the current work observers were required to maintain individuated representations of multiple stimuli throughout the entire trial to detect an unpredictable change in stimulus orientation. We therefore argue that in situations that require the precise encoding of multiple individuated representations, changes in behavioral performance can be well accounted for via changes in the precision of feature-selective sensory responses.

Multiple studies have demonstrated a relative enhancement of on-channel and suppression of off-channel neural response properties when attending to a single stimulus (Martinez-Trujillo and Treue 2004; Treue and Martinez-Trujillo 1999; Scolari, Byers, and Serences 2012), suggesting that attention increases the selectivity for attended features by increasing the responses of neurons preferring the attended feature value while decreasing responses of neurons tuned to the orthogonal feature value. Within the scope of these studies, our results suggest that attending multiple items encumbers a neural resource that drives the distinction between neural populations tuned either toward or away from the attended feature value. These findings are in line with perceptual coding models, which predict that attending multiple items degrades the quality of perceptual representations. Thus, while our findings do not exclude the possibility of increased decision noise with larger set sizes (Palmer 1995; Eckstein, Thomas, Palmer, and Shimozaki 2000; Palmer and McClean 1995; Verghese 2001), our findings demonstrate that attending more items has a direct impact on the quality of perceptual codes, and that these changes in perceptual quality can account for the associated declines in visually-guided behavior.

Our results suggest that the selectivity of population responses in sensory cortical regions declines with increases in the number of attended items. Increases in perceptual noise during stimulus encoding would effectively degrade the fidelity of information transmission and processing in cortical units (Pouget, Daya, and Zemel 2003), which would lead to lower quality population responses and compromised stimulus decoding. One possible explanation of this decline in the precision of population responses comes from a class of computational models in which multiple items are represented by oscillatory phase coding mechanisms (Canolty and Knight 2010; Jensen and Colgin 2007; Lisman and Idiart 1995; Roelfsema et al. 1996; Singer 1995). According to these phase coding models, subpopulations of feature-selective cells code the details of a single representation through synchronized neural activity (Engel et al. 1991; Fries et al. 2001; Gray et al. 1989; Siegel et al. 2008; for a review, see Fries 2005). To individuate each representation, the phase of each subpopulation of feature-selective cells is decoupled and maintained as asynchronous neural assemblies (Engel et al. 1991; Gray et al. 1989; Eckhorn et al. 1998; Roelfsema et al. 1996). Under this view, declines in the integrity of population responses may result from sensory units firing out of phase with their respective neural assembly and in phase with a different neural assembly. Thus, the probability of a given sensory unit firing in phase with a separate neural assembly would increase with the number of to-be-attended representations, leading to a decline in the precision of population responses.

A similar brand of phase coding models has been proposed for the maintenance of visual working memory representations (Lisman and Idiart 1995; Raffone and Wolters 2001), suggesting that similar declines in perceptual and mnemonic precision should be observed with increases in item load. Indeed, multiple studies have demonstrated an inverse relationship between mnemonic precision in visual working memory and set size (Wilken and Ma 2004; Zhang and Luck 2008; Anderson et al. 2011; Anderson and Awh 2012). Thus, it is possible that a similar oscillatory coding scheme explains how items are individuated during both perception and memory, and that declines in mnemonic precision with increasing set size are also due to increased dispersion in the population responses for items stored in memory (see Emrich et al. (2013) for a recent demonstration of declines in decoding accuracy as a function of set size).

The forward encoding model used here provides a powerful complement to standard MVPA methods by generating an estimate of the feature-selective population response profile based on a set of a priori, physiologically plausible assumptions. For instance, while an MVPA analysis in the current study revealed identical classification accuracies for contralateral and ipsilateral hemispheres in both set size conditions, confirming past observations of spatially global mnemonic and perceptual codes (Ester et al. 2009; Serences and Boynton 2007), the forward encoding model revealed a reduction in the amplitude of channel responses in ipsilateral relative to contralateral cortex. Thus, the forward encoding model revealed a basic distinction between population responses in contralateral and ipsilateral hemispheres that was invisible to the MVPA analysis.

To conclude, we investigated the neural locus of declines in behavioral performance across concurrent increases in set size during a perceptual monitoring task. We measured orientation-selective population response profiles in human primary visual cortex by using a combination of fMRI and a forward encoding model of orientation selectivity. In line with the central tenant of perceptual coding models (Posner 1980; Palmer et al. 2011), we observed a quantifiable attenuation in the selectivity of population responses that predicted the degree to which accuracy declined as a function of set size. This finding lends confidence to claim of perceptual coding models that the quality of encoded representations is constrained by the distribution of a finite neural resource, and that attending multiple items requires a finer distribution of this limited resource. Thus, the current work provides a novel insight into the neural source of decline in behavioral performance when multiple items must be simultaneously attended.

Acknowledgements

Supported by NIMH R01-MH087214 to E.A.

Footnotes

Conflict of Interest: None

Contributions: All authors conceived and designed the experiment. D.E.A. collected the data. D.E.A. and E.F.E. analyzed the data using software developed by J.T.S. All authors contributed to writing the manuscript.

References

  1. Anderson DE, Awh E. The plateau in mnemonic resolution across large set sizes indicates discrete resource limits in visual working memory. Attention, Perception, and Psychophysics. 2012;74:891–910. doi: 10.3758/s13414-012-0292-1. [DOI] [PMC free article] [PubMed] [Google Scholar] [Retracted]
  2. Anderson DE, Vogel EK, Awh E. Precision in visual working memory reaches a stable plateau when individual item limits are exceeded. Journal of Neuroscience. 2011;31:1128–1138. doi: 10.1523/JNEUROSCI.4125-10.2011. [DOI] [PMC free article] [PubMed] [Google Scholar] [Retracted]
  3. Anderson DE, Vogel EK, Awh E. A common discrete resource for visual working memory and visual search. Psychological Science. doi: 10.1177/0956797612464380. In Press. [DOI] [PMC free article] [PubMed] [Google Scholar] [Retracted]
  4. Awh E, Jonides J. Overlapping mechanisms of attention and spatial working memory. Trends in Cognitive Science. 2001;5:119–126. doi: 10.1016/s1364-6613(00)01593-x. [DOI] [PubMed] [Google Scholar]
  5. Bays PM, Husain M. Dynamic shifts of limited working memory resources in human vision. Science. 2008;321:851–854. doi: 10.1126/science.1158023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Boynton GM, Engel SA, Glover GH, Heeger DJ. Linear systems analysis of functional magnetic resonance imaging in human V1. Journal of Neuroscience. 1996;16:4207–4221. doi: 10.1523/JNEUROSCI.16-13-04207.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Brainard DH. The psychophysics toolbox. Spatial Vision. 1997;10:433–436. [PubMed] [Google Scholar]
  8. Brouwer GJ, Heeger DJ. Decoding and reconstructing color from responses in human visual cortex. Journal of Neuroscience. 2009;29:13992–14003. doi: 10.1523/JNEUROSCI.3577-09.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Brouwer GJ, Heeger DJ. Cross-orientation suppression in human visual cortex. Journal of Neurophysiology. 2011;106:2108–2119. doi: 10.1152/jn.00540.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Bundesen C. A theory of visual attention. Psychological Review. 1990;97:523–547. doi: 10.1037/0033-295x.97.4.523. [DOI] [PubMed] [Google Scholar]
  11. Canolty RT, Knight RT. The functional role of cross frequency coupling. Trends in Cognitive Science. 2010;14:506–514. doi: 10.1016/j.tics.2010.09.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Chun MM. Visual working memory as visual attention sustained over time. Neuropsychologica. 2011;49:1407–1409. doi: 10.1016/j.neuropsychologia.2011.01.029. [DOI] [PubMed] [Google Scholar]
  13. Cowan N. The magical number 4 in short-term memory: A reconsideration of mental storage capacity. Behavioral and Brain Sciences. 2001;24:87–185. doi: 10.1017/s0140525x01003922. [DOI] [PubMed] [Google Scholar]
  14. D’Esposito M. From cognitive to neural models of working memory. Philosophical Transactions of the Royal Society London B: Biological Sciences. 2007;362:761–772. doi: 10.1098/rstb.2007.2086. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Desimone R, Duncan J. Neural mechanisms of selective visual attention. Annual Review of Neuroscience. 1995;18:193–222. doi: 10.1146/annurev.ne.18.030195.001205. [DOI] [PubMed] [Google Scholar]
  16. Duncan J. The locus of interference in the perception of simultaneous stimuli. Psychological Review. 1980;87:272–300. [PubMed] [Google Scholar]
  17. Duncan J, Humphreys GW. Visual search and stimulus similarity. Psychological Review. 1989;96:433–458. doi: 10.1037/0033-295x.96.3.433. [DOI] [PubMed] [Google Scholar]
  18. Eckhorn R, et al. Coherent oscillations: a mechanism of feature linking in the visual cortex? Multiple electrode and correlation analyses in the cat. Biological Cybernetics. 1988;60:121–130. doi: 10.1007/BF00202899. [DOI] [PubMed] [Google Scholar]
  19. Eckstein MP, Thomas JP, Palmer J, Shimozaki SS. A signal detection model predicts the effects of set size on visual search accuracy for feature, conjunction, triple conjunction, and disjunction displays. Perception & Psychophysics. 2000;62:425–451. doi: 10.3758/bf03212096. [DOI] [PubMed] [Google Scholar]
  20. Emrich SM, Rigall AC, LaRocque JL, Postle BR. Distributed patterns of activity in sensory cortex reflect the precision of multiple items maintained in visual short-term memory. Journal of Neuroscience. 2013;33:6516–6523. doi: 10.1523/JNEUROSCI.5732-12.2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Engel AK, Konig P, Singer W. Direct physiological evidence for scene segmentation by temporal coding. PNAS. 1991;88:9136–9140. doi: 10.1073/pnas.88.20.9136. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Engel SA, Rumelhart DE, Wandell BA, Lee AT, Glover GH, Chichilnisky EJ, Shadlen MN. fMRI of human visual cortex. Nature. 1994;369:525. doi: 10.1038/369525a0. [DOI] [PubMed] [Google Scholar]
  23. Ester EF, Serences JT, Awh E. Spatially global representations in human primary visual cortex during working memory maintenance. Journal of Neuroscience. 2009;29:15258–15265. doi: 10.1523/JNEUROSCI.4388-09.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Fisher DL. Limited channel models of automatic detection: capacity and scanning in visual search. Psychological Review. 1982;6:662–692. [PubMed] [Google Scholar]
  25. Fisher DL. Central capacity limits in consistent mapping visual search tasks: Four channels or more? Cognitive Psychology. 1984;16:449–484. doi: 10.1016/0010-0285(84)90017-3. [DOI] [PubMed] [Google Scholar]
  26. Freeman J, Brouwer GJ, Heeger DJ, Merriam EP. Orientation decoding depends on maps, not columns. Journal of Neuroscience. 2011;31:4792–4804. doi: 10.1523/JNEUROSCI.5160-10.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Fries P. A mechanism for cognitive dynamics: neuronal communication through neuronal coherence. Trends in Cognitive Science. 2005;9:474–480. doi: 10.1016/j.tics.2005.08.011. [DOI] [PubMed] [Google Scholar]
  28. Fries P, Reynolds JH, Rorie AE, Desimone R. Modulation of oscillatory neuronal synchronization by selective visual attention. Science. 2001;291:1560–1563. doi: 10.1126/science.1055465. [DOI] [PubMed] [Google Scholar]
  29. Gandhi SP, Heeger DJ, Boynton JM. Spatial attention affects brain activity in human primary visual cortex. PNAS. 1999;96:3314–3319. doi: 10.1073/pnas.96.6.3314. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Geisler WS, Chou K. Separation of low-level and high-level factors in complex tasks: Visual search. Psychological Review. 1995;102:356–378. doi: 10.1037/0033-295x.102.2.356. [DOI] [PubMed] [Google Scholar]
  31. Genovese CR, Lazar NA, Nichols TE. Thresholding of statistical maps in functional neuroimaging using the false discovery rate. NeuroImage. 2002;15:870–878. doi: 10.1006/nimg.2001.1037. [DOI] [PubMed] [Google Scholar]
  32. Gray CM, Konig P, Engel AK, Singer W. Oscillatory responses in cat visual cortex exhibit inter-columnar synchronization which reflects global stimulus properties. Nature. 1989;338:334–337. doi: 10.1038/338334a0. [DOI] [PubMed] [Google Scholar]
  33. Gur M, Kagan I, Snodderly DM. Orientation and direction selectivity of neurons in V1 of alert monkeys: functional relationships and laminar distributions. Cerebral Cortex. 2005;15:1207–1221. doi: 10.1093/cercor/bhi003. [DOI] [PubMed] [Google Scholar]
  34. Haynes JD, Rees G. Predicting the orientation of invisible stimuli from activity in human primary visual cortex. Nature Neuroscience. 2005;8:686–691. doi: 10.1038/nn1445. [DOI] [PubMed] [Google Scholar]
  35. Hoffman JE. Search through a sequentially presented visual display. Perception and Psychophysics. 1978;23:1–11. doi: 10.3758/bf03214288. [DOI] [PubMed] [Google Scholar]
  36. Jehee JFM, Brady DK, Tong F. Attention improves encoding of task-relevant features in the human visual cortex. Journal of Neuroscience. 2011;31:8210–8219. doi: 10.1523/JNEUROSCI.6153-09.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Jensen O, Colgin LL. Cross-frequency coupling between neuronal oscillations. Trends in Cognitive Science. 2007;11:267–269. doi: 10.1016/j.tics.2007.05.003. [DOI] [PubMed] [Google Scholar]
  38. Kamitani Y, Tong F. Decoding the visual and subjective contents of the human brain. Nature Neuroscience. 2005;8:679–685. doi: 10.1038/nn1444. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Kastner S, Ungerleider LG. Mechanisms of visual attention in the human cortex. Annual Review of Neuroscience. 2000;23:315–341. doi: 10.1146/annurev.neuro.23.1.315. [DOI] [PubMed] [Google Scholar]
  40. Kastner S, DeWeerd P, Desimone R, Ungerleider LG. Mechanisms of directed attention in the human extrastriate cortex as revealed by functional MRI. Science. 1998;282:108–111. doi: 10.1126/science.282.5386.108. [DOI] [PubMed] [Google Scholar]
  41. Lisman JE, Idiart MAP. Storage of 7 +/− 2 short-term memories in oscillatory subcycles. Science. 1995;267:1513–1515. doi: 10.1126/science.7878473. [DOI] [PubMed] [Google Scholar]
  42. Luck SJ, Vogel EK. The capacity of visual working memory for features and conjunctions. Nature. 1997;390:279–281. doi: 10.1038/36846. [DOI] [PubMed] [Google Scholar]
  43. Martinez-Trujillo JC, Treue S. Feature-based attention increases the selectivity of population responses in primate visual cortex. Current Biology. 2004;14:744–751. doi: 10.1016/j.cub.2004.04.028. [DOI] [PubMed] [Google Scholar]
  44. Mazyar H, Van den Berg R, Ma WJ. Does precision decrease with set size? Journal of Vision. 2012;12 doi: 10.1167/12.6.10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Palmer J. Attentional limits on the perception and memory of visual information. Journal of Experimental Psychology: Human Perception and Performance. 1990;16:332–350. doi: 10.1037//0096-1523.16.2.332. [DOI] [PubMed] [Google Scholar]
  46. Palmer J. Attention in visual search: Distinguishing four causes of set size effects. Current Directions in Psychological Science. 1995;4:118–123. [Google Scholar]
  47. Palmer J, Ames CT, Lindsey DT. Measuring the effects of attention on simple visual search. Journal of Experimental Psychology: Human Perception and Performance. 1993;19:108–130. doi: 10.1037//0096-1523.19.1.108. [DOI] [PubMed] [Google Scholar]
  48. Palmer J, McClean J. Visual search: Large set size effects do not reject models based upon independent channels. Investigative Ophthalmology & Visual Science. 1996;37:62. [Google Scholar]
  49. Palmer EM, Fencsik DE, Flusberg SJ, Horowitz TS, Wolfe JM. Signal detection evidence for limited capacity in visual search. Attention, Perception, and Psychophysics. 2011;73:2413–2414. doi: 10.3758/s13414-011-0199-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Pashler H. Detecting conjunctions of color and form: reassessing the serial search hypothesis. Perception & Psychophysics. 1987;41:191–201. doi: 10.3758/bf03208218. [DOI] [PubMed] [Google Scholar]
  51. Pashler H. Familiarity and visual change detection. Perception and Psychophysics. 1988;44:369–378. doi: 10.3758/bf03210419. [DOI] [PubMed] [Google Scholar]
  52. Pelli DG. The VideoToolbox software for visual psychophysics: Transforming numbers into movies. Spatial Vision. 1997;10:437–442. [PubMed] [Google Scholar]
  53. Pestilli F, Carrasco M, Heeger DJ, Gardner JL. Attentional enhancement via selection and pooling of early sensory responses in human visual cortex. Neuron. 2011;72:832–846. doi: 10.1016/j.neuron.2011.09.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Phillips WA. On the distinction between sensory storage and short-term visual memory. Perception and Psychophysics. 1974;16:283–290. [Google Scholar]
  55. Posner MI, Snyder CRR, Davidson BJ. Attention and the detection of signals. Journal of Experimental Psychology: General. 1980;109:160–174. [PubMed] [Google Scholar]
  56. Postle BR. Working memory as an emergent property of the mind and brain. Neuroscience. 2006;139:23–38. doi: 10.1016/j.neuroscience.2005.06.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Pouget A, Dayan P, Zemel RS. Inference and computation with population codes. Annual Review of Neuroscience. 2003;26:381–410. doi: 10.1146/annurev.neuro.26.041002.131112. [DOI] [PubMed] [Google Scholar]
  58. Prinzmetal W, Banks WP. Perceptual capacity limits in visual detection and search. Bulletin of the Psychonomic Society. 1983;21:263–266. [Google Scholar]
  59. Raffone A, Wolters G. A cortical mechanism for binding in visual working memory. Journal of Cognitive Neuroscience. 2001;13:766–785. doi: 10.1162/08989290152541430. [DOI] [PubMed] [Google Scholar]
  60. Ringach DL, Shapley RM, Hawken MJ. Orientation selectivity in macaque V1: Diversity and laminar independence. Journal of Neuroscience. 2002a;22:5639–5651. doi: 10.1523/JNEUROSCI.22-13-05639.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Ringach DL, Bredfeldt CE, Shapley RM, Hawken MJ. Suppression of neural responses to nonoptimal stimuli correlates with tuning selectivity in macaque V1. Journal of Neurophysiology. 2002b;87:1018–1027. doi: 10.1152/jn.00614.2001. [DOI] [PubMed] [Google Scholar]
  62. Roelfsema PR, Engel AK, Konig P, Singer W. The role of neuronal synchonization in response selection: A biological plausible theory of structured representations in the visual cortex. Journal of Cognitive Neuroscience. 1996;8:603–625. doi: 10.1162/jocn.1996.8.6.603. [DOI] [PubMed] [Google Scholar]
  63. Saproo S, Serences JT. Spatial attention improves the quality of population codes in human visual cortex. Journal of Neurophysiology. 2010;104:885–895. doi: 10.1152/jn.00369.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Sasaki Y, Rajimehr R, Kim BW, Ekstrom LB, Vanduffel W, Tootell RBH. A different slant on visual orientation selectivity in human and nonhuman primates. Neuron. 2006;51:661–670. doi: 10.1016/j.neuron.2006.07.021. [DOI] [PubMed] [Google Scholar]
  65. Schiller PH, Finlay BL, Volman SF. Quantitative studies of single-cell properties in monkey striate cortex. II. Orientation specificity and ocular dominance. Journal of Neurophysiology. 1976;39:1320–1333. doi: 10.1152/jn.1976.39.6.1320. [DOI] [PubMed] [Google Scholar]
  66. Scolari M, Byers A, Serences JT. Optimal deployment of attention gain during fine discriminations. Journal of Neuroscience. 2012;32:7723–7733. doi: 10.1523/JNEUROSCI.5558-11.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Serences JT, Ester EF, Vogel EK, Awh E. Stimulus-specific delay activity in human primary visual cortex. Psychological Science. 2009;20:207–214. doi: 10.1111/j.1467-9280.2009.02276.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Serences JT, Saproo S, Scolari M, Ho T, Muftuler LT. Estimating the influence of attention on population codes in human visual cortex using voxel based tuning functions. Neuroimage. 2009;44:223–231. doi: 10.1016/j.neuroimage.2008.07.043. [DOI] [PubMed] [Google Scholar]
  69. Serences JT, Boynton GM. Feature-based attentional modulations in the absence of direct visual stimulation. Neuron. 2007;55:301–312. doi: 10.1016/j.neuron.2007.06.015. [DOI] [PubMed] [Google Scholar]
  70. Sereno MI, Dale AM, Reppas JB, Kwong KK, Belliveau JW, Brady TJ, Rosen BR, Tootell RB. Borders of multiple visual areas in human revealed by functional magnetic resonance imaging. Science. 1995;268:889–893. doi: 10.1126/science.7754376. [DOI] [PubMed] [Google Scholar]
  71. Siegel M, Donner TH, Oostenveld R, Fries P, Engel AK. Neuronal synchronization along the dorsal visual pathway reflects the focus of spatial attention. Neuron. 2008;60:709–719. doi: 10.1016/j.neuron.2008.09.010. [DOI] [PubMed] [Google Scholar]
  72. Singer W, Gray CM. Visual feature integration and the temporal correlation hypothesis. Annual Review of Neuroscience. 1995;18:555–586. doi: 10.1146/annurev.ne.18.030195.003011. [DOI] [PubMed] [Google Scholar]
  73. Swindale NV. Orientation tuning curves: empirical description and estimation of parameters. Biological Cybernetics. 1998;78:45–56. doi: 10.1007/s004220050411. [DOI] [PubMed] [Google Scholar]
  74. Swisher JD, Gatenby JC, Gore JC, Wolfe BA, Moon CH, Kim SG, Tong F. Multiscale pattern analysis of orientation-selective activity in the primary visual cortex. Journal of Neuroscience. 2010;30:325–330. doi: 10.1523/JNEUROSCI.4811-09.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Treisman AM, Gelade G. A feature-integration theory of attention. Cognitive Psychology. 1980;12:97–136. doi: 10.1016/0010-0285(80)90005-5. [DOI] [PubMed] [Google Scholar]
  76. Treue S, Martinez-Trujillo JC. Feature-based attention influences motion processing gain in macaque visual cortex. Nature. 1999;399:575–579. doi: 10.1038/21176. [DOI] [PubMed] [Google Scholar]
  77. Verghese P. Visual search and attention: A signal detection theory approach. Neuron. 2001;31:523–535. doi: 10.1016/s0896-6273(01)00392-0. [DOI] [PubMed] [Google Scholar]
  78. Wilken P, Ma WJ. A detection theory account of change detection. Journal of Vision. 2004;4:1120–1135. doi: 10.1167/4.12.11. [DOI] [PubMed] [Google Scholar]
  79. Wolfe JM. Guided search 2.0: a revised model of visual search. Psychonomic Bulletin & Review. 1994;1:202–238. doi: 10.3758/BF03200774. [DOI] [PubMed] [Google Scholar]
  80. Zhang W, Luck SJ. Discrete fixed-resolution representations in visual working memory. Nature. 2008;453:233–235. doi: 10.1038/nature06860. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES