Abstract
Attention often requires maintaining a stable mental state over time while simultaneously improving perceptual sensitivity. These requirements place conflicting demands on neural populations, as sensitivity implies a robust response to perturbation by incoming stimuli, which is antithetical to stability. Functional specialization of cortical areas provides one potential mechanism to resolve this conflict. We reasoned that attention signals in executive control areas might be highly stable over time, reflecting maintenance of the cognitive state, thereby freeing up sensory areas to be more sensitive to sensory input (i.e., unstable), which would be reflected by more dynamic attention signals in those areas. To test these predictions, we simultaneously recorded neural populations in prefrontal cortex (PFC) and visual cortical area V4 in rhesus macaque monkeys performing an endogenous spatial selective attention task. Using a decoding approach, we found that the neural code for attention states in PFC was substantially more stable over time compared with the attention code in V4 on a moment-by-moment basis, in line with our guiding thesis. Moreover, attention signals in PFC predicted the future attention state of V4 better than vice versa, consistent with a top-down role for PFC in attention. These results suggest a functional specialization of attention mechanisms across cortical areas with a division of labor. PFC signals the cognitive state and maintains this state stably over time, whereas V4 responds to sensory input in a manner dynamically modulated by that cognitive state.
SIGNIFICANCE STATEMENT Attention requires maintaining a stable mental state while simultaneously improving perceptual sensitivity. We hypothesized that these two demands (stability and sensitivity) are distributed between prefrontal and visual cortical areas, respectively. Specifically, we predicted attention signals in visual cortex would be less stable than in prefrontal cortex, and furthermore prefrontal cortical signals would predict attention signals in visual cortex in line with the hypothesized role of prefrontal cortex in top-down executive control. Our results are consistent with suggestions deriving from previous work using separate recordings in the two brain areas in different animals performing different tasks and represent the first direct evidence in support of this hypothesis with simultaneous multiarea recordings within individual animals.
Keywords: attention, extrastriate, monkey, population, prefrontal, vision
Introduction
Nervous systems provide the following critical functions: (1) the ability to respond to input and (2) the ability to store information in the absence of input. These two functions conflict with each other because information storage requires a stable representation that is robust to outside perturbation, whereas sensitivity to perturbation is the essence of responsiveness. The functional specialization of cortical areas reflects one means to resolve this apparent conflict. Areas that are synaptically distant from the sensory periphery (such as frontal executive control areas) might function through highly stable activity patterns in line with the need to maintain cognitive states across long time scales, whereas activity patterns in sensory areas closer to the periphery might be more volatile, reflecting sensitivity of these patterns to sudden environmental changes (Murray et al., 2014; Runyan et al., 2017).
Indeed, there is substantial evidence for the dynamic nature of sensory responses within sensory cortical areas. For example, although the earliest spikes in response to a visual input generally carry the most information about the stimulus (Osborne et al., 2004), feature selectivity in visual cortex continues to develop over time. Such dynamic sensory coding has been observed in several contexts, including orientation selectivity (Ringach et al., 1997; Müller et al., 2003; Shapley et al., 2003) and contour integration (Chen et al., 2014) in area V1, texture selectivity (Kim et al., 2019) and object occlusion (Fyall et al., 2017) in area V4, and pattern motion sensitivity in area MT (Pack and Born, 2001; Smith et al., 2005).
In contrast, stable representations have been hypothesized to be essential for executive functions involving maintenance of information on long time scales (several seconds and longer). One brain area implicated in such executive functions is the lateral prefrontal cortex (PFC), and indeed evidence for stable neural representations in PFC has been reported for long time-scale behaviors including categorical reasoning (Freedman et al., 2003; Cromer et al., 2011; McKee et al., 2014), flexible rule-based decision-making (Bunge et al., 2003; Siegel et al., 2015), and working memory (Constantinidis et al., 2018; Wasmuht et al., 2018; Parthasarathy et al., 2019). It is important to emphasize that such stable representations can exist at the population level despite considerable temporal instability at the level of individual neurons (Druckmann and Chklovskii, 2012). The case of working memory provides an excellent illustration of this point. Work using population-level analyses has found that although the activity of PFC neurons can be highly dynamic during memory maintenance periods, a stable representation can persist in a subspace of the population activity that encodes the mnemonic information (Murray et al., 2017; Parthasarathy et al., 2019).
One cognitive function that exemplifies the tension between stability and sensitivity is selective attention, which is the ability to marshal limited processing resources preferentially for goal-relevant information while suppressing the processing of less relevant or distracting information. For example, consider searching for a friend wearing a red shirt in a crowd. This requires maintaining a stable representation that red is the relevant visual feature while also endeavoring to improve visual sensitivity to that feature. Correlates of attention have been found in the neural activity of multiple sensory areas (for review, see Hromádka and Zador, 2007; Maunsell, 2015; Gomez-Ramirez et al., 2016), as well as parietal (Desimone and Duncan, 1995) and frontal association cortex, including PFC (Paneri and Gregoriou, 2017). Previous work has suggested that PFC and V4 directly interact in the service of attention (Squire et al., 2013); but the functional specialization of those areas has not been tested with respect to the relative stability of population activity. Moreover, although previous researchers have speculated that diversity of intrinsic time scales across cortical areas reflects a computational division of labor (Murray et al., 2014), this type of interplay of stable and unstable processes has not been directly demonstrated moment to moment with simultaneous recordings in multiple brain areas. Previous studies have focused on trial-averaged activity rather than moment-to-moment signals from populations of neurons.
We asked if the conflicting demands of selective attention to maintain stable task-set representations and to enhance sensitivity to input might be distributed across nodes of this network, and if so, what is the nature of that areal specialization? Thus, for the present study, we sought to compare the temporal stability of, and interaction among, attention signals in two key nodes of the attention network: V4 and PFC. We predicted that PFC population activity would more stably represent the attention state compared with population activity in V4. This prediction is in line with the relative stability of task-set representations previously found in PFC compared with the relative dynamism of sensory processes in V4 but has not yet been explicitly tested with simultaneous measurement of population activity in the two brain areas. Thus, we provide the first direct evidence for this division of labor for temporal stability across frontal and visual cortical areas on a moment-by-moment basis on individual trials.
To test our prediction that PFC signals attention states stably whereas V4 attention states are more dynamic, we simultaneously recorded neural populations in V4 and dorsolateral PFC of monkeys while they performed a visuospatial selective attention task and then used a decoding approach to assay the degree to which the population codes for attention in each brain area were stable. We found neural codes for attention states remained highly stable in a subspace of prefrontal population activity, whereas corresponding codes in V4 were relatively dynamic. We also found that the estimated attention state in PFC predicted the future attention state in V4 on a centisecond time scale, in line with a top-down role of PFC in endogenous attention processes. These results suggest attention relies on areal specialization for subfunctions of selective attention processes; stable task-set representations are maintained over time in frontal cortex, freeing up visual cortex to remain sensitive to external events.
Materials and Methods
Ethical oversight
Experimental procedures were approved by the Institutional Animal Care and Use Committee of the University of Pittsburgh and were performed in accordance with the U.S. National Research Council Guide for the Care and Use of Laboratory Animals.
Subjects
We used two adult male rhesus macaques (Macaca mulatta) for this study. Two animals is the minimum number needed to demonstrate reproducibility and is customary for research with this model species. Surgeries were performed in aseptic conditions under isoflurane anesthesia. Opiate analgesics were used to minimize pain and discomfort perioperatively. A titanium head post was attached to the skull with titanium screws to immobilize the head during experiments. After each subject was trained to perform the spatial attention task, we implanted one 100 electrode Utah array (Blackrock Microsystems) in each of the brain areas V4 and PFC (area 8Ar; Fig. 1A). We implanted the electrode arrays in the right hemisphere for Monkey P and in the left hemisphere for Monkey W. A separate analysis of a portion of these data was previously reported, along with a complete description of the experimental methods (Snyder et al., 2018). The pattern of results was highly similar across the two animals, so we combined across animals for the results reported in the main text.
Visual change-detection task
Subjects maintained central fixation as sequences of drifting Gabor stimuli were presented in one or both of the visual hemifields and were rewarded with water or juice for detecting a change in orientation of one of the stimuli in the sequence (the target) and making a saccade to that stimulus (Fig. 1B). The probable target location was block randomized so that 90% of the targets would occur in one hemifield until the subject made 80 correct detections in that block (including cue trials, described below), at which point the probable target location was changed to the opposite hemifield.
The fixation point was a 0.6° yellow dot at the center of a flat-screen cathode ray tube monitor positioned 36 cm from the eyes of the subjects. The background of the display was 50% gray. We measured monitor luminance γ functions by photometer and linearized the relationship between input voltage and output luminance using look-up tables. We tracked the gaze of the subjects using an infrared eye-tracking system (EyeLink 1000, SR Research). Gaze was monitored online by the experimental control software to ensure fixation within ≈1° of the central fixation point throughout each trial. We excluded from analysis data segments during which the gaze of a subject left the fixation window.
After fixating for a randomly chosen duration of 300–500 ms (uniformly distributed), a visual stimulus was presented for 400 ms or until the gaze gaze of the subjects left the fixation window, whichever came first. For the initial trials within a block, a Gabor stimulus was presented only in the hemifield that was chosen to have a high probability of target occurrence for the block. These cue trials were to alert the subjects to a change in the probable target location and were excluded from the analysis. The initial cue location was counterbalanced across recording sessions. Once a subject correctly detected five orientation changes during the cue trials, bilateral Gabor stimuli were presented for the remainder of the block.
Each trial consisted of a sequence of 400 ms stimulus presentations separated by 300–500 ms interstimulus intervals (uniformly distributed). We varied the interstimulus intervals so that stimulus onset times would be relatively less predictable, encouraging the animal to deploy attention consistently over time (as opposed to transiently disengaging attention during the interstimulus interval). Stimulus sequences continued until the subject made an eye movement (data during saccades were excluded from analysis), or a target was presented, but the subject did not respond to it within 400 ms (i.e., a miss). For the first presentation in a sequence, the orientation of the stimulus at the cued location was randomly chosen to be 45 or 135°, and the orientation of the stimulus in the opposite hemifield, if present, was orthogonal to this. Subsequent stimulus presentations in the sequence each had a fixed probability (uniform hazard function) of containing a target (30% for Monkey P, 40% for Monkey W), that is, a change in orientation of one of the Gabor stimuli compared with the preceding stimulus presentations in the trial. Within a block, 90% of targets (randomly chosen) occurred in one hemifield (valid targets), and 10% of targets occurred in the opposite hemifield (invalid targets). For valid targets, the orientation change was randomly chosen to be 1, 3, 6, or 15° in either the clockwise or anticlockwise direction (number of valid targets of each orientation at each location: Monkey P: 11.49 ± 3.14, mean ± SD, across sessions, Monkey W: 14.56 ± 4.75). For invalid targets, the orientation change was always the near threshold value of 3°, clockwise or anticlockwise (because invalid targets occurred infrequently, we restricted the number of orientation change magnitudes for this condition to derive a reasonable estimate of the target detection rate). We analyzed trials including either valid or invalid targets but excluded from analysis all neural data from the time of target onset through the end of the trial.
Monkey P completed 24 sessions of the experiment, and Monkey W completed 25 sessions. One session for each subject was subsequently excluded from analysis because of recording equipment failure.
Microelectrode array recordings
Signals from the arrays were bandpass filtered (0.3–7500 Hz), digitized at 30 kHz, and amplified by a Grapevine system (Ripple). Signals crossing a threshold (periodically adjusted using a multiple of the root-mean-squared noise) were stored for off-line analysis. We first performed a semisupervised sorting procedure followed by manual refinement using custom MATLAB software (https://github.com/smithlabvision/spikesort), taking into account waveform shapes and interspike interval distributions. These initial sorting steps yielded 93.2 ± 8.9 (mean ± SD) candidate units per session in V4 and 119.6 ± 17.5 units per session in PFC for Monkey P and 61.9 ± 27.4 candidate units per session in V4 and 113.8 ± 21.9 units per session in PFC for Monkey W. We likely recorded a mixture of single units and multiunit activity, although for simplicity we refer to all units as “neurons.” The arrays were chronically implanted and likely recorded some, but not all, neurons over more than one recording session, but we calculated our results within each recording session and treated each session as an independent sample for the analysis. We note with respect to statistical inference that if we had, in fact, repeatedly sampled the same pool of neurons across sessions, then our p values would have a different meaning than if we had sampled a completely new set each time. In the former case, the interpretation would be that a particular sample of neurons shows the effects we found and that that effect was not because of chance (e.g., on the first session), whereas in the latter case the interpretation would be that neurons in general show the effect we found. Both are valid inferences, and both are interesting findings. In fact, our view is that attention mechanisms can be conceived as latent factors that affect neural populations and that we observed the same latent factors each session with our population-level analysis whether or not the neurons were the same or completely different between sessions.
To avoid potential confounds because of our blocked design, we excluded neurons that were not recorded stably throughout a session. These were identified by dividing all the recorded data for a session into 10 equally sized blocks, measuring the average firing rate of each neuron within each block and then calculating the coefficient of variation (CV) of the average firing rate of each neuron over the blocks. Neurons with a CV >1 were deemed to be unstable and were excluded (9.3 ± 8.7 neurons excluded for Monkey P per session, 14.3 ± 16.5 neurons excluded for Monkey W per session). We also excluded neurons with an average firing rate <0.1 spikes per second measured over the entire session (2.3 ± 2.2 additional neurons excluded for Monkey P per session, 6.0 ± 5.7 neurons excluded for Monkey W per session).
We used a factor analysis (FA) method to reduce the dimensionality of the population activity (see below, Dimensionality reduction of population activity). Neural activity recorded on different electrodes can, in rare cases, show highly correlated activity because of electrical cross-talk among electrodes on the array (Yu et al., 2009). Such correlated activity can confound methods such as FA or Gaussian process factor analysis (GPFA), which seek to identify shared variance among neurons. To be conservative, we measured the Pearson correlation between spike trains (1 ms time bins) for all pairs of neurons and randomly excluded one neuron from each pair with a correlation coefficient greater than r = 0.1. This criterion was quite liberal (12.9 ± 9.3 neurons excluded per session), and the results were not affected by substantial variation of this correlation threshold.
One of our goals was to test how much variability in one neural population was explained by the population activity of the other brain area (see below, Granger causal influence). Such an analysis may be biased if we used a greater number of neurons in one brain area to predict the activity of a fewer number of neurons in another brain area. To place the two areas on the same footing, for each session, we randomly selected neurons from the population with greater cardinality to equate the population size used for all analyses between the two brain areas (final population size in each area for the analysis was 79.1 ± 12.5, mean ± SD neurons for Monkey P per session; 50.9 ± 22.0 neurons for Monkey W per session).
From the continuous recording, we extracted data segments from 300 ms before stimulus onset to 300 ms following stimulus offset (1 s total segment duration; Monkey P: 2253.1 ± 591.3 data segments per session; Monkey W: 2894.2 ± 946.8 data segments per session) and counted spikes for each neural unit in 1 ms bins. For the calculation of peristimulus spike time histograms (PSTHs; Fig. 2), we smoothed spike trains with a causal half-Gaussian function with σ = 20 ms before averaging across data segments.
Receptive field mapping
Before beginning the visual change-detection experiment, we mapped the receptive fields (RFs) of the spiking neurons recorded on the V4 arrays by presenting small (≈1°) sinusoidal gratings (four orientations) at a grid of positions. We subsequently used Gabor stimuli scaled and positioned to roughly cover the aggregate RF area determined by the responses to the small gratings at the grid of positions. For Monkey P this was 7.02° full-width at half-maximum (FWHM) centered 7.02° below and 7.02° to the left of fixation, and for Monkey W this was 4.70° FWHM centered 2.35° below and 4.70° to the right of fixation. We next measured tuning curves by presenting gratings at the RF area with four orientations and a variety of spatial and temporal frequencies. For each subject we used full-contrast Gabor stimuli with a temporal and spatial frequency that evoked a robust response from the population overall (i.e., our stimulus was not optimized for any single neuron). For Monkey P this was 0.85 cycles/° and 8 cycles/s. For Monkey W this was 0.85 cycle/° and 7 cycles/s. For the task, we presented a Gabor stimulus at the estimated RF location, at the mirror-symmetric location in the opposite hemifield, or at both locations simultaneously. We did not measure RFs for the PFC neurons; stimulus properties were chosen solely on the basis of V4 responses. In a preliminary step, we electrically microstimulated each of our PFC electrodes at a range of current amplitudes and found that we were unable to evoke eye movements even at current amplitudes several times greater than those that typically evoke eye movements from electrodes implanted in the frontal eye field (FEF; Goldberg and Bruce, 1986), suggesting that our electrodes did not impinge on FEF but were rather exclusively in PFC.
Dimensionality reduction of population activity
To focus the decoding analyses (see below, Decoding attention state) on activity that is shared among neurons, we first applied GPFA to the spike counts recorded in each brain area separately (Yu et al., 2009; Fig. 3). We then orthonormalized the dimensions using the procedure described in Yu et al. (2009). This yielded a single-trial time series of latent factors, which summarize the shared activity among neurons over time. For this analysis, we restricted the dataset to presentations of standard stimuli to which the subject correctly withheld a response. We applied GPFA separately to each of the two stimulus configuration conditions (i.e., when the stimulus in the RF was oriented at 45° and when it was oriented at 135°). For each neuron we square-root transformed the spike counts (as a variance-stabilizing step), then subtracted the average PSTH for the included data segments from each individual trial. That is, our analysis focused on residual trial-to-trial variation in firing rates beyond the average stimulus-evoked response. We then binned the residual spike counts for each neuron in each data segment within 50 nonoverlapping 20 ms time bins. We aimed to specify a common model dimensionality across all brain areas and sessions so that values across brain areas and sessions would be directly comparable. We tested several potential dimensionalities for our GPFA models and found that 10 dimensions captured most of the explainable shared variance in all but one of our datasets, so we used this dimensionality consistently. We also tested different criteria to select the number of dimensions, such as using the minimum number of dimensions needed to retain 90% of the total shared variance of each population, and found the pattern of results to be highly consistent. Thus, our results and conclusions are not dependent on the precise criterion to choose the number of dimensions. Our goal in performing this initial dimensionality reduction step was to reduce the tendency to overfit the logistic regression model to noisy individual neurons and instead to emphasize shared variability. After separately applying GPFA for each stimulus configuration condition separately, we recombined the results for subsequent analyses. The patterns of results and the conclusions were not affected if we restricted our entire analysis to only one stimulus configuration or the other; therefore, we used all the data from both conditions.
In GPFA, the amount of temporal smoothing (i.e., τ, the time scale of the Gaussian processes) was determined in an unsupervised manner from the neural activity (through maximum likelihood estimation using expectation maximization with cross-validation), and not prespecified. Thus, differences across brain areas in how quickly latent variables change over time reflect inherent differences in the activity across areas. To confirm this, we repeated the main analyses using FA, which does not include temporal smoothing. We found similar qualitative results as with GPFA, although the results tended to be noisier without temporal smoothing, which motivated us to use GPFA instead of FA in the first place. Thus, our results and conclusions are robust to the use of temporal smoothing.
Decoding attention state
We used binomial logistic regression models to estimate the probability of which stimulus location was cued (i.e., attended) using the GPFA latent variables obtained from the population activity from each brain area (Fig. 5). Specifically, we used the MATLAB function mnrfit to find the that optimizes the following equation:
where is the vector of 10-dimensional GPFA scores at time t. For each brain area, we fit a separate model at each time point during the stimulus-aligned data segments. We used 10-fold cross-validation to quantify the classification performance of the cue condition (Fig. 4C), as well as all subsequent analyses involving the decoded attention state. Where indicated in the text, we used the single decoder from the time point with the best performance and applied that decoder to the data from all time points, which enables a more straightforward comparison between time points. The pattern of results and conclusions was unchanged if we used a separate decoder trained at each time point for such analyses. To measure cross-temporal generalization of our decoders (Fig. 5C,D), we measured the Pearson product-moment correlation coefficient across trials between P(cueRF) values obtained using a decoder trained on data from one time point (training time), on data from another time point (test time), and P(cueRF) values obtained using decoders trained and tested at the same time point (in a cross-validated manner).
Attention state prediction stability
We sought to quantify how much the estimated attention state in each brain area fluctuated from moment to moment. Because the output of the logistic regression decoder was in units of probability (i.e., that the RF was attended), these values were bounded between zero and one. This boundedness would have created a compression of the moment-to-moment variation near those extreme values. To mitigate this ceiling/floor effect, we transformed the probabilities into z-scores using a normal inverse cumulative distribution function. We then computed the attention prediction stability, st, as one minus the absolute change in the z-transformed probability estimate from one time point to the next (Fig. 6) as follows:
Thus, the maximum value for prediction stability is one, which indicates no change in the attention prediction value over time. To compare the measured stability values to what might be expected if there was no temporal structure in the data, we randomly shuffled the time points of our data within each session, measured the stability of the shuffled data, and then averaged within each session. We repeated this shuffling procedure several times and found that because each session contained hundreds of thousands of time points, the same average stability resulted for each iteration to a precision of <0.0001, so we simply used that value. We then rescaled our observed stability measurements so that a value of 0 equaled the least average stability that was found in our shuffled dataset. Therefore, a value of 1 indicates perfect stability, and a value of 0 indicates the least expected stability if there was no temporal structure. Note that stability can be negative. To ease comparison of P(cueRF) across time, for this analysis of instantaneous stability and the analysis of Granger causal influence (GCI) we used the decoder for each brain area from the time point with the best decoding performance for each session and applied it to the data across all the time points. The qualitative pattern of results and conclusions were the same if we used a separate decoder trained at each time point but would be more difficult to interpret as it would be ambiguous as to whether instability was accounted for by a change in decoder, a change in activity patterns, or both.
Granger causal influence
To estimate how much future information about the attention state in one brain area was predicted from the past history of the other brain area, we performed a GCI analysis using the Multivariate Granger Causality (MVGC) toolbox for MATLAB (Barnett and Seth, 2014; Fig. 7). Although this analysis is traditionally known as Granger causality, we caution that mechanistic causality may not be validly inferred as the procedure is strictly observational and the potential role of unobserved variables cannot be ruled out. Traditionally, GCI analysis involves estimating parameters for both a full and a reduced (nested) autoregressive model. The reduced model aims to predict the future state of one signal (the target, y; in our case the P(cueRF) estimated from activity in one brain area) using the past history of that signal as follows:
where n is a specified model order and η is residual error. In our model for V4, for example, the reduced model aimed to predict the value of P(cueRF) in V4, and it did so by using the preceding values of P(cueRF) from V4 in each trial (up to the model order and excluding any points in time that would span across trials).
The full model aims to predict the future state of the target signal using the past history of the target signal and also an additional signal (the source, x; in our case the P(cueRF) given the activity of the other brain area) as follows:
In this full model for V4, the current value of P(cueRF) in V4 was predicted by the preceding values of P(cueRF) from both V4 and PFC in each trial (up to the model order and, again, excluding any points that span across trials). Note that the model order n is the same for both the reduced and full models.
If the full model predicts the target signal better than the reduced model, then this implies that the source signal carries additional information about the future state of the target. We quantified the GCI as the percentage of residual error from the reduced model, which was accounted for by the full model.
Because GCI analysis assumes stationarity, we restricted our GCI analysis to the following time windows relative to each stimulus onset where this assumption was most reasonable: (1) a 200 ms time window before stimulus onset (−290 to −90 ms) and (2) a 200 ms time window during the sustained stimulus response (i.e., after the initial onset transient; 150–350 ms). Our two signals of interest for this analysis were the time series of P(cueRF) for each of the two brain areas. We first estimated an appropriate autoregressive model order (i.e., the number of 20 ms time bins of history used to predict the target) for each session by estimating optimal parameters for each model order from 1 to 6 (by ordinary least-squares regression) and then calculating the Bayes information criterion (BIC) for each model. The model order with the best BIC was taken as the best model order (almost always order 4, sometimes 3 or 5, rarely 2). Our general pattern of results was robust to deviations in this estimated optimal model order value. For example, if we performed our analysis setting the model order of all sessions to be 3, 4, 5, or 6, the qualitative pattern of results was highly similar to that of our main result. In cases where our estimated autoregressive model parameters resulted in an unstable process (i.e., a spectral radius ≥1, which is not realistic for brain activity), we rescaled the parameters to stabilize the process (using the var_specrad function of the MVGC toolbox).
Experimental design and statistical analyses
For details on subjects, including justification of number used, see above, Subjects. For details on the task design, including independent variables and the numbers of trials per condition, see above, Visual change-detection task. For details on the number of neurons recorded, including exclusion criteria, see above, Microelectrode array recordings.
To test differences between cue conditions for behavioral accuracy (Fig. 1C) and reaction time (Fig. 1D), we used two-tailed dependent samples Student's t tests (24 sessions for Monkey P and 23 sessions for Monkey W) with α = 0.05.
To test for differences between the characteristic time scales of the latent factors estimated by GPFA for the two brain areas of interest, we used a two-tailed independent samples Student's t test (two brain areas, each with 470 observations of latent factors, 47 sessions with 10 factors each) with α = 0.05.
To test the performance of our logistic regression decoders, we used the receiver operating characteristic (ROC; Fig. 4C). For each session, we estimated P(cueRF) at every time point for every trial (with cross-validation; see above, Decoding attention state) for both brain areas. Then, we calculated the area under the curve (AUC) of ROC values at each time point, yielding a time series of AUC values for each session. We then tested the results of the 47 sessions against the null hypothesis chance value of AUC = 0.5 with a two-tailed, one sample Student's t test at each time point with α = 0.05, corrected for multiple comparisons with Bonferroni's method.
To test for differences in cross-temporal generalization of our decoders between brain areas (Fig. 5E), we applied Fisher's r to z transformation to correlation values then tested against a null hypothesis of no difference between brain areas with a two-tailed dependent samples Student's t test (N = 47 sessions, α = 0.05, Bonferroni corrected for the number of time point pairs).
To test the difference in prediction stability between brain areas (Fig. 6C), we used a two-tailed dependent samples Student's t test (47 sessions) with α = 0.05, corrected for multiple comparisons with Bonferroni's method.
For testing GCI values (Fig. 7), we used a nonparametric permutation test. For each session, we pseudorandomly permuted the trial order for PFC data, holding the trial order for the V4 data constant and taking care to ensure that no permuted trial ever ended up in its original sequence position by chance. Because in this permutation the within-trial order of the time points remained the same, it did not affect the prediction from the reduced model. Instead, it simply destroyed the ability of the added brain area to increase the prediction in the full model. This permutation procedure reflects the null hypothesis that there is no directed information flow between brain areas (i.e., the two brain areas are independent, in which case permuting the trial order should make no difference). We then recalculated GCI for the trial-order-permuted dataset (see above, Granger causal influence for all steps, including model selection and parameter estimation). This procedure was iterated 10,000 times for each session to yield a distribution of GCI values reflecting the null hypothesis. We also generated a null hypothesis distribution for GCI asymmetry by taking the ratio of the PFC-to-V4 null hypothesis distribution to the V4-to-PFC null hypothesis distribution. Observed GCI values were considered significant if they were more extreme than 5% of the permutation test distribution, Bonferroni corrected for the number of sessions tested. For testing the distribution of GCI asymmetry across sessions against a null hypothesis of zero asymmetry, we used a two-tailed one-sample t test with α = 0.05.
Results
We recorded neural population activity from PFC and V4 simultaneously in the same hemisphere of two adult male monkeys (Macaca mulatta; Fig. 1A) while they performed a spatial selective attention task (Fig. 1B). The animals were more accurate (Fig. 1C) and faster (Fig. 1D) at discriminating orientation changes at a high-target-probability location (cue-valid targets) than at a low-target-probability location (cue-invalid), confirming that they selectively attended to the high-target-probability location during the task.
Single-neuron attention signals
The firing rate of individual neurons differed depending on whether the RF location had a high target probability (i.e., was likely attended). As we previously reported (Snyder et al., 2018), most V4 neurons fired more vigorously in response to an attended stimulus than an unattended stimulus (Figure 2A). We did not observe a difference in the average spontaneous firing rate in V4 between attention conditions. In PFC, the average firing rate was slightly more vigorous toward the end of the stimulus interval when attention was directed ipsilaterally compared with when it was directed contralaterally (Fig. 2B). As in V4, average spontaneous firing rates in PFC did not differ between attention conditions.
The patterns of attention modulations observed for individual neurons diverged greatly from the population averages, however. For example, in V4 some neurons that had greater stimulus-related firing rates with attention were in contrast suppressed by attention in the absence of a stimulus (Fig. 2C, top). Other V4 neurons fired comparatively less in response to an attended stimulus than an unattended stimulus (Fig. 2C, middle). Specifically, we found that for 19% of neurons in our sample that had a significant attention effect both before and during the stimulus, the direction of that effect changed between the two time periods of interest (Snyder et al., 2018). This pattern of results suggests that the population code for attention in V4 is dynamic and depends on the stimulus context. For PFC, although the average firing rate was similar between attention conditions, individual neurons showed strong and stable attention effects in different directions (Fig. 2D). Some PFC neurons showed fairly constant firing rates over time (i.e., not stimulus dependent) that differed between attention conditions (Fig. 2D, top). Firing of other PFC neurons was suppressed (Fig. 2D, middle) or facilitated (Fig. 2D, bottom) by stimuli but nonetheless showed consistent differences between attention conditions over time. For example, the proportion of PFC neurons with significant attention effects both before and during the stimulus that changed the direction of their effect between the two time periods was significantly smaller than that for V4 (PFC, 59/1027, 5.4%; V4, 96/404, 19.2%; ).
Population-level attention coding
Population-level analyses can reveal dynamics or stability of neural codes hidden at the single-neuron level (Murray et al., 2017; Snyder et al., 2018), so we tested for this possibility. We first reduced the dimensionality of the activity from each population using GPFA (Yu et al., 2009; Fig. 3A). This allowed us to focus subsequent decoding analyses on variance that was shared among neurons within each population, as well as to avoid overfitting when decoding (the qualitative pattern of results was similar if performed on the full population data, although with worse performance because of overfitting). It is important to note that we performed dimensionality reduction after subtracting the trial-averaged response of each neuron from each trial. That is, we were not interested in dynamics simply because of the average stimulus response but rather sought to emphasize nonstimulus-related activity. In our case, the fitted Gaussian processes for the latent factors that best explained the V4 population activity typically had faster characteristic time scales than the latent factors that best explained the PFC population activity (Fig. 3B), indicating that V4 activity changed more quickly than did PFC activity [V4 time scale: τ = 94.63 ± 46.14 ms, mean ± SD; PFC time scale: τ = 201.82 ± 125.80 ms; independent samples t test: t(938) = – 17.24, p = 1.18 · 10– 58, N = 940, 470 factors (i.e., 10 factors for each session) × 2 brain areas]. This suggests that V4 and PFC have different intrinsic time scales, in line with our guiding thesis as well as previous findings (Murray et al., 2014; Runyan et al., 2017).
Although the latent factors for V4 varied more quickly than those for PFC, this does not necessarily mean that the population code for attention in V4 is less stable than in PFC, as previous research has shown encoded information can remain stable despite highly dynamic population activity (Murray et al., 2017; Parthasarathy et al., 2019). Thus, we next applied a decoding approach to the latent factors to assay and compare the time courses of encoded attention information in the two brain areas. We used logistic regression to decode whether the RF (or the opposite hemifield) had been cued from the neural population activity in each brain area (Fig. 4). Because the question of code stability is only interesting if the code carries predictive value, we confirmed our decoders performed significantly better than chance in both brain areas at all time points during a trial (Fig. 4C; t test, p < 0.05 at all time points, Bonferroni corrected). Importantly, we were able to decode attention states even during prestimulus periods during which population average firing rates were quiescent and similar between attention conditions (Fig. 2A). Having established that both V4 and PFC encoded the attention state throughout the experiments, we next asked if we could establish the comparative dynamics of those codes.
Dynamics of population attention codes
In subsequent analyses, rather than decoding a binary value (i.e., cue-in-RF or cue-away), we focused on a more sensitive metric—the probability that the RF had been cued (P(cueRF)) as an estimate of the attention state. This value was obtained from the logistic regression described above and varies between 0 and 1, where 0 corresponds to cue-away and 1 corresponds to cue-in-RF. Because we were especially interested in the temporal stability of attention signals, we next examined the average time course of P(cueRF) in each brain area for trials from each attention condition. On trials for which the RF had been cued, the average P(cueRF) was greater than the equivocal level of 0.5 (Fig. 5A). On average, the P(cueRF) using V4 activity started near low values and then increased to a peak during the later half of the stimulus presentation before falling again to low levels. In contrast, the average P(cueRF) in PFC was more consistent over time through the trial. On trials for which the RF was not cued, a similar time course was seen when estimating the probability that the cue had been away from the RF (P(cueaway)), but with weaker overall prediction strengths (Fig. 5B). This qualitative pattern of results suggests that the average readouts of the attention code are more dynamic in V4 than in PFC.
In the analysis described above, we trained an independent decoder at each time point during the peristimulus interval. This enabled us to assess how consistent the decoder weights were over time for each brain area. In other words, we asked, regardless of any change in P(cueRF), does the pattern of population activity that best encodes the attention state change over time? One way to test this would be to measure correlations between pairs of β weight vectors measured at different time points. Strong correlations persisting over time would be consistent with a stable code, whereas correlations that weakened more rapidly over time would suggest a more dynamic code. We found that β weight vectors in PFC were indeed more strongly correlated over a longer time range than β weight vectors in V4 (data not shown). However, it could be the case that small changes in β weights lead to large changes in decoding performance or that large changes in β weights lead to small changes in decoding performance. That is, it is difficult to interpret the importance of β weight changes by directly comparing these values. Rather, we reasoned it would be more informative to test the ability of decoders to generalize over time. Thus, we measured the correlation across trials among P(cueRF) values obtained using the data from one time point and the β weights from another time point and P(cueRF) values obtained using data and β weights from the same time point (in a cross-validated manner; Fig. 5C,D). For V4, the decoded P(cueRF) values were highly correlated only between nearby time points, which indicates that the attention code in V4 changed over time (Fig. 5C). In contrast, the decoded P(cueRF) values for PFC were correlated over a significantly greater time range than in V4 (Fig. 5D,E), indicative of a more stable attention code.
Our previous analyses considered the trial-averaged readout of the attention code over time and found that it was more stable in PFC than in V4. However, it may have yet been the case that the P(cueRF) in PFC was highly dynamic on individual trials but appeared stable when averaged across trials. To test for this possibility, we examined the stability of P(cueRF) on individual trials for each brain area. At each time point, we calculated the prediction stability as the negative absolute change in P(cueRF) from the preceding time point (Fig. 6A). We scaled the resultant stability values so that a value of 1 indicated perfect stability (i.e., no change in P(cueRF) over time), and a value of 0 indicated the expected value if time points were independent. For both animals, the prediction stability was substantially greater in PFC than in V4 (Fig. 6B,C; t test, p < 0.05 at all time points, Bonferroni corrected). This included prestimulus time points, indicating that differences in P(cueRF) stability were not trivially because of different dynamics of stimulus responsiveness. We note also that although stability dropped transiently in V4 around stimulus onset, in general, stability in V4 was at similar levels during spontaneous and stimulus-evoked activity, indicating stability was not trivially related to overall activity levels. We also found that when comparing the average stability between the two brain areas on the same trial, PFC was consistently more stable than V4 (121,283 total trials, , mean ± SD, t(121282) = 395.46, p ≈ 0, paired t test). These results show that even on single trials, the decoded attention state in PFC was more stable than in V4.
Directionality of attention codes
Our guiding thesis for this study was that a stable attention code in PFC may serve as a consistent beacon of the attention state that may guide the dynamic behavior of sensory populations such as V4. Thus, we asked if there was evidence for a greater directed flow of the encoded attention state information from PFC to V4. To test this, we calculated the GCI between the estimated attention states in PFC and V4 (Fig. 7A). Intuitively, GCI estimates how much the future of one signal can be (linearly) predicted from the past history of another signal, beyond what was predictable from the first signal alone (Barnett and Seth, 2014), and can be quantified as the degree of improvement in that prediction (i.e., as percent residual error explained). Note that GCI does not directly imply mechanistic causality because the role of unobserved variables cannot be ruled out, but it does suggest that the potential for a causal relationship exists. In our case, we wanted to ask how well we could predict the P(cueRF) of V4 neurons based on the history of those neurons alone compared with the history of both V4 and PFC neurons combined. By also making the analogous comparison in the complementary direction (PFC alone or PFC with V4), we could assess whether changes in PFC were more predictive of what happened next with the attention state in V4, or whether changes in V4 were more predictive of what happened next with the attention state in PFC.
We performed our analysis of GCI between PFC and V4 during two time windows we have focused on before: one before stimulus onset (−290 to −90 ms) and one during the sustained response to the stimulus (150 to 350 ms). In each 200 ms time window of interest, the goal was to test how well we could predict the P(cueRF) in each 20 ms time bin based on past history within that larger time window. We chose these time windows for two reasons. First, GCI analysis assumes that the signals are stationary, an assumption that would be violated during the transition from the unstimulated to the stimulated state but was more reasonable during these two time windows before and after that transition. Second, they represent two distinct contexts of attention function—the anticipatory period when an attentional state of readiness is prepared and the period of stimulus processing when the benefits of that readiness are conferred. We found evidence for GCI in the PFC-to-V4 direction and/or the V4-to-PFC direction in many sessions (Fig. 7B,D), indicating that incorporating moment-to-moment knowledge of one brain area aided in prediction of upcoming activity in the other. The absolute quantity of residual variance accounted for by GCI was small (<1%), but this is attributable in large part to the difficulty in explaining the variance in the brief time periods of our analysis (20 ms bins). Most importantly, our measurements of GCI were substantially greater (by a factor of >2) in the PFC-to-V4 direction than in the V4-to-PFC direction, both during the prestimulus anticipatory period (Fig. 7B,C), as well as during the sustained stimulus response (Fig. 7D,E). These results are consistent with a framework in which the attention state in PFC influences the future attention state in V4 more than vice versa, in line with the hypothesized role of PFC in the executive control of endogenous selective attention.
Discussion
We found that the population code for attention in PFC was substantially stabler than the population code for attention in V4. Although evidence has been mounting for this type of areal specialization for temporal stability (Murray et al., 2014), previous studies have typically compared neural recordings from different brain areas measured in separate animals performing separate tasks, often in separate laboratories, limiting direct comparison. Furthermore, previous work has often analyzed trial-averaged activity from populations of neurons that were sometimes not simultaneously recorded. Without direct observation, the hypothesized division of labor of stable and unstable computational processes across cortical areas based on the intrinsic time scales of dynamics had been speculative. Our simultaneous recordings show that even moment to moment within an animal, neural codes for attention in PFC are more stable than those in V4.
We also found that the attention code in PFC carried more information from moment to moment about the future attention state of the V4 populations than vice versa. This type of causal role for attention has previously been suggested for FEF (Bressler et al., 2008; Squire et al., 2013), for which strong electrical stimulation causes eye movements to a particular location (movement field), whereas weaker electrical stimulation leads to attention-like gain modulations in extrastriate neurons sensitive to the movement field (Moore and Fallah, 2001), a pattern of results lending support to the so-called premotor theory of covert attention. The current results suggest an extension of this causal role to more rostral populations, which, in our hands, did not lead to eye movements under strong electrical stimulation (see above, Materials and Methods). We caution, however, that our observational GCI analyses should be taken merely as a suggestion that the potential for a causal relationship exists, and we cannot rule out alternatives such as both PFC and V4 receiving influence from a third, unobserved brain area. Together, however, our results provide the first evidence that PFC provides a stable attention code that leads a dynamic attention code in V4 at a time scale of tens of milliseconds, consistent with the hypothesized role of PFC as a source of top-down endogenous attention control signals in primate cerebral cortex.
For a system to be stable entails dynamics that are robust to outside perturbation. Such stability would be adaptive for maintaining cognitive states in mind while actively exploring a dynamic environment. From this perspective, the finding of a stable representation of a long-term attention state (as in PFC) is intuitive. A more challenging question concerns the corollary, that is, Why is the code for attention states in V4 dynamic? A prominent view of the neural correlates of attention is an enhancement of the activity of neurons tuned for relevant information that comes at the expense of suppressed activity for neurons processing less-relevant information through a normalized gain mechanism (Reynolds and Heeger, 2009). That is, each neuron shows a gain in responses that is roughly proportional to the alignment of the tuning curve of that neuron to the attended information. Under the simplest version of this framework, attention would be viewed as a stable modulation of activity levels; that is, the gain modulation in V4 would remain constant while attention is held constant. The current results are incompatible with this view because even during brief periods of spontaneous activity on single trials, we found attention signals to be dynamic on time scales much briefer than those that have been found to characterize attention shifts (on the order of 200–300 ms; Posner, 1980; VanRullen et al., 2007). Instead, our findings suggest a key role for dynamic coding of attention in sensory cortex; that is, if response gain is a key mechanism of attention, it is not applied equally to all sensory neurons at all times.
One potential explanation of dynamic attention codes in V4 is simply that they are a side effect of the function of sensory cortex to respond to sensory input. This is not to say that dynamic responses to sensory events trivially manifest dynamic attention signals. Indeed, in our experiment we took two steps to rule out a potentially trivial relationship between sensory response dynamics and attention signals in V4. First, we performed our analyses on residual firing rates after the average stimulus response had been subtracted, removing the greatest contribution of sensory response dynamics from the data. Second, we calculated P(cueRF) stability moment to moment to hold stimulus context relatively constant (Fig. 6C). The strongest example of this is that we found P(cueRF) stability differed among brain areas during spontaneous activity when there was no stimulus response, ruling out a difference in stimulus processing dynamics as a trivial explanation for the difference in attention signal dynamics. Instead, our results are consistent with a more interesting relationship in which the properties of sensory networks that enable them to be responsive predispose them to dynamic attention signaling independent of sensory input. For example, if a stable activity pattern were imposed on sensory populations, even weakly, this stabilizing influence would maladaptively reduce the sensitivity of those populations to sensory input. Indeed, evidence indicates that stable dynamics because of synchronizing input are associated with suppressed sensory processing, in particular in the form of 8–10 Hz oscillations measured as field potentials or electroencephalogram (Kelly et al., 2006; Bollimunta et al., 2011; Snyder et al., 2015). Although this provides a simple explanation for the presence of dynamic attention codes in V4, we can further consider whether such dynamic codes provide additional function.
One context where such a dynamic coding scheme might make sense is to distinguish between the stimulated and unstimulated state. Neural populations in visual cortex face different computational demands in the presence versus the absence of a stimulus. A pattern of neural activity that functions to enhance representation during visual processing may be inappropriate if instead there was no stimulus to process. In support of this notion, we and others reported that the pattern of modulations across the V4 population differed fundamentally between spontaneous and visual-evoked activity (Sani et al., 2017; Snyder et al., 2018). These observations are also consistent with human neuroimaging work, where attentional modulations of visual cortical areas are greatest during sensory processing, whereas attention modulations in frontal and parietal association cortices are in contrast greatest during nonsensory periods (Kastner et al., 1999; Hopfinger et al., 2000; Corbetta and Shulman, 2002). Human neuroimaging work further aligns with our Granger causality findings suggesting a top-down role of frontal anticipatory signals influencing sensory areas (Bressler et al., 2008). The stimulus-context-dependent population codes that we observed may provide a means to multiplex multiple signals within a neural population, enabling separable representations of sensory and cognitive information, such as expectation (Rungratsameetaweemana et al., 2018; Rungratsameetaweemana and Serences, 2019) or motor intention (Kaufman et al., 2014; Elsayed et al., 2016).
Even within the context of processing visual stimuli, there is a growing appreciation that attention is an inherently dynamic process. Evidence suggests that attention is automatically reoriented several times per second to sample multiple sources of sensory information (James, 1890; Posner, 1980; VanRullen et al., 2007; Fiebelkorn et al., 2011; Landau and Fries, 2012). These automatic attention cycles are associated with interactions occurring around the 3–5 Hz theta frequency range across a network including lateral prefrontal and parietal areas (Fiebelkorn et al., 2018), anterior cingulate cortex (Voloh et al., 2015), multiple visual cortical areas (Spyropoulos et al., 2018), and pulvinar nucleus of the thalamus (Fiebelkorn et al., 2019). The frequency of these reorientation cycles is similar to that of the periodic planning and execution of saccadic eye movements in primates, which researchers have also linked to attention-like perceptual effects (Hafed and Clark, 2002; Hafed et al., 2011; Hafed, 2013; Lowet et al., 2018). Although the time scale of these effects are too slow to fully explain our results, it underscores that attention and perception rely fundamentally on dynamic processes. Such periodic movements and covert shifts in perception may have evolved to help us explore our environment, even in the face of particularly salient or engrossing stimuli. This ethological tendency to regularly shift attention is sensible in natural contexts, which are highly dynamic. In contrast, our task was relatively static in that it required subjects to attend for long periods to a consistent location while target probabilities and reward expectations were held constant. Thus, one important remaining question for further study concerns how the division of labor of stable and dynamic attention codes between prefrontal and sensory cortex that we found operates in more naturalistic contexts.
Even within a period of relatively constant attention, the underlying mechanisms may be dynamic on fast time scales reflecting differential modulation of specific stages of sensory processing (Sripati and Johnson, 2006; Sani et al., 2017; Snyder et al., 2018). For example, Sani et al. (2017) found that different types of attention gain manifested in V4 activity across time during sensory responses; an early contrast gain followed by a stimulus-tuned multiplicative gain peaking ∼150 ms after stimulus onset, and then another later round of contrast gain. Those authors interpreted the gradual emergence of stimulus-dependent multiplicative effects that they observed as consistent with a mechanism in which relatively weak top-down input becomes amplified locally through a time-consuming dynamic process. Our current results are also consistent with this notion.
This faster time scale for dynamic attention in V4 is reminiscent of the emergence of feature-selectivity in visual cortex. For example, Ringach et al. (1997) used a reverse correlation approach to measure the temporal development of orientation selectivity in primary visual cortex of monkeys. They found that although neurons in input layers showed relatively static orientation tuning preferences, the orientation preferences of neurons in output layers continued to change over time, in many cases preferring one orientation at short delays but the orthogonal orientation at longer delays, or in some cases developing multimodal orientation preferences. The authors compared the predictions of feedforward and feedback models to conclude that the dynamics of orientation preferences in output layers were likely because of feedback influences, and they surmised the slower complex tuning preferences might encode subtler features of images. One implication of this pair of observations—dynamic attention and dynamic feature-selectivity—is that one potential mechanism for feature-based attention would be to preferentially allocate resources to the time period in the sequence of processing steps when selectivity for that feature emerges. One testable prediction of this hypothesis is that attention directed selectively toward the subtler features of images to which Ringach et al. (1997) alluded should manifest later during the sensory response when tuning for those features emerges, which may account for why attention modulation of sensory responses typically develops well after the initial stimulus-onset transient response (Mehta et al., 2000).
The current results are consistent with a framework in which maintaining stable task-set representations relies on brain areas with intrinsically stabler dynamics (Murray et al., 2014; Runyan et al., 2017); as we showed here for PFC populations. This particular brain area has been linked to many processes unfolding on slow time scales, including categorical reasoning (Freedman et al., 2003; Cromer et al., 2011; McKee et al., 2014), flexible rule-based decision-making (Bunge et al., 2003; Siegel et al., 2015), working memory (Constantinidis et al., 2018; Wasmuht et al., 2018; Parthasarathy et al., 2019), and attention (Ikkai and Curtis, 2011; Paneri and Gregoriou, 2017). It is important to note that the stability in this case resides not at the level of individual neuron firing rates, which in fact vary substantially over time, but rather at a population-level readout. This type of population-level stability has previously been reported for PFC populations in the context of working memory. For example, individual PFC neurons encode mnemonic information predominantly transiently during maintenance periods, with peak latencies distributed to tile the entire interval (e.g., Hussar and Pasternak, 2012, 2013; Spaak et al., 2017). In a compelling example, Tremblay et al. (2015) recorded populations of PFC neurons in monkeys performing a spatial attention task and found that patterns of ensemble activity encoding attention states generalized across time periods within a trial and even across multiple days over a month of recording sessions. This type of population-level stability despite single-cell variability may enable individual PFC neurons to temporally multiplex several concurrent processes while maintaining stable signals at the population level (Mante et al., 2013). Indeed, the activity of prefrontal and sensory populations have been linked to a number of cognitive processes other than selective attention relevant for our task, such as monitoring choice history (Mochol et al., 2021), arousal (Cowley et al., 2020), and reward expectation (Roesch and Olson, 2003), which are likely contributing to the mixtures of population activity that we observed. Although it is often difficult to disentangle these cognitive processes in the context of a given task (Maunsell, 2004), some studies have suggested distinct neural and behavioral signatures for attention compared with sensory expectation (Summerfield and de Lange, 2014; Rungratsameetaweemana et al., 2018; Rungratsameetaweemana and Serences, 2019), reward valuation (Baruni et al., 2015), arousal (Luo and Maunsell, 2019; Cowley et al., 2020), and other related processes. The current experiment cannot tease apart the precise contributions of these various cognitive processes because we did not vary the expected value of rewards separately from the attention cue (Thus, reward and expectation were confounded with attention.), and the animals almost always chose the cued target when making actions (Thus choice history and cue were also confounded.). A future study that separately manipulates rewards within the context of a given attention condition could further dissect the nuances of frontosensory interactions during cognition.
Perhaps our most novel finding is that population attention codes in V4 vary rapidly, and the moment-to-moment interactions between PFC and V4 have apparent control over the attention state. One lingering question remains whether the details of these population-level dynamics are specifically germane for behavior. For example, one reasonable but unintuitive prediction would be that stable attention signals in prefrontal cortex would be beneficial for behavior, whereas stable attention signals in sensory cortical areas such as V4 may actually be detrimental to behavior, as they may undermine responsiveness to external events. Another reasonable prediction is that shifts of attention (e.g., to detect invalidly cued targets) might be accompanied by rapid dynamics of attention signals in both V4 and PFC. The current study was not designed to address these predictions because targets were infrequent, and reactive saccades to targets prohibited disentanglement of sensory and motor signals; a future study with delayed responses and a greater proportion of targets could test these predictions and other issues related to target processing. Together, our results suggest that different cortical areas implicated in attention control are specialized to function in different regimes of stability to resolve the competing demands of attention; stable representations of task variables are maintained on long time scales in prefrontal cortical populations, freeing up sensory cortical populations to maintain the sensitivity to outside perturbation essential for efficient perception.
Footnotes
A.C.S. was supported by National Institutes of Health (NIH) Grant K99/R00EY025768 and a National Association For Research on Schizophrenia and Depression Young Investigator Award from the Brain and Behavior Research Foundation. B.M.Y. and M.A.S. were supported by NIH Collaborative Research in Computational Neuroscience Grant R01MH118929, National Science Foundation (NSF) Grant Neural and Cognitive Systems / Behavioral and Cognitive Sciences 1954107/1734916, and NIH Grant R01EB026953. B.Y. was supported by Simons Foundation Grant 543065, NIH Grant R01HD071686, NIH Collaborative Research in Computational Neuroscience Grant R01NS105318, and NSF Grant NCS BCS 1533672. M.A.S. was supported by NIH Grants R01EY022928 and P30EY008098, Research to Prevent Blindness, and the Eye and Ear Foundation of Pittsburgh. We thank Samantha Schmitt for assistance with surgery and data collection.
The authors declare no competing financial interests.
References
- Barnett L, Seth AK (2014) The MVGC multivariate Granger causality toolbox: a new approach to Granger-causal inference. J Neurosci Methods 223:50–68. [DOI] [PubMed] [Google Scholar]
- Baruni JK, Lau B, Salzman CD (2015) Reward expectation differentially modulates attentional behavior and activity in visual area V4. Nat Neurosci 18:1656–1663. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bollimunta A, Mo J, Schroeder CE, Ding M (2011) Neuronal mechanisms and attentional modulation of corticothalamic α oscillations. J Neurosci 31:4935–4943. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bressler SL, Tang W, Sylvester CM, Shulman GL, Corbetta M (2008) Top-down control of human visual cortex by frontal and parietal cortex in anticipatory visual spatial attention. J Neurosci 28:10056–10061. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bunge SA, Kahn I, Wallis JD, Miller EK, Wagner AD (2003) Neural circuits subserving the retrieval and maintenance of abstract rules. J Neurophysiol 90:3419–3428. PMID: . [DOI] [PubMed] [Google Scholar]
- Chen M, Yan Y, Gong X, Gilbert CD, Liang H, Li W (2014) Incremental integration of global contours through interplay between visual cortical areas. Neuron 82:682–694. 10.1016/j.neuron.2014.03.023 [DOI] [PubMed] [Google Scholar]
- Constantinidis C, Funahashi S, Lee D, Murray JD, Qi X-L, Wang M, Arnsten AFT (2018) Persistent spiking activity underlies working memory. J Neurosci 38:7020–7028. 10.1523/JNEUROSCI.2486-17.2018 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Corbetta M, Shulman GL (2002) Control of goal-directed and stimulus-driven attention in the brain. Nat Rev Neurosci 3:201–215. 10.1038/nrn755 [DOI] [PubMed] [Google Scholar]
- Cowley BR, Snyder AC, Acar K, Williamson RC, Yu BM, Smith MA (2020) Slow drift of neural activity as a signature of impulsivity in macaque visual and prefrontal cortex. Neuron 108:551–567. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cromer JA, Roy JE, Buschman TJ, Miller EK (2011) Comparison of primate prefrontal and premotor cortex neuronal activity during visual categorization. J Cogn Neurosci 23:3355–3365. PMID: . [DOI] [PMC free article] [PubMed] [Google Scholar]
- Desimone R, Duncan J (1995) Neural mechanisms of selective visual attention. Annu Rev Neurosci 18:193–222. 10.1146/annurev.ne.18.030195.001205 [DOI] [PubMed] [Google Scholar]
- Druckmann S, Chklovskii DB (2012) Neuronal circuits underlying persistent representations despite time varying activity. Curr Biol 22:2095–2103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Elsayed GF, Lara AH, Kaufman MT, Churchland MM, Cunningham JP (2016) Reorganization between preparatory and movement population responses in motor cortex. Nat Commun 7:13239. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fiebelkorn IC, Foxe JJ, Butler JS, Mercier MR, Snyder AC, Molholm S (2011) Ready, set, reset: stimulus-locked periodicity in behavioral performance demonstrates the consequences of cross-sensory phase reset. J Neurosci 31:9971–9981. 10.1523/JNEUROSCI.1338-11.2011 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fiebelkorn IC, Pinsk MA, Kastner S (2018) A dynamic interplay within the frontoparietal network underlies rhythmic spatial attention. Neuron 99:842–853. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fiebelkorn IC, Pinsk MA, Kastner S (2019) The mediodorsal pulvinar coordinates the macaque fronto-parietal network during rhythmic spatial attention. Nat Commun 10:215. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Freedman DJ, Riesenhuber M, Poggio T, Miller EK (2003) A comparison of primate prefrontal and inferior temporal cortices during visual categorization. J Neurosci 23:5235–5246. 10.1523/JNEUROSCI.23-12-05235.2003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fyall AM, El-Shamayleh Y, Choi H, Shea-Brown E, Pasupathy A (2017) Dynamic representation of partially occluded objects in primate prefrontal and visual cortex. Elife 6:e25784. 10.7554/eLife.25784 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Goldberg ME, Bruce CJ (1986) The role of the arcuate frontal eye fields in the generation of saccadic eye movements. Prog Brain Res 64:143–154. [DOI] [PubMed] [Google Scholar]
- Gomez-Ramirez M, Hysaj K, Niebur E (2016) Neural mechanisms of selective attention in the somatosensory system. J Neurophysiol 116:1218–1231. 10.1152/jn.00637.2015 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hafed ZM (2013) Alteration of visual perception prior to microsaccades. Neuron 77:775–786. [DOI] [PubMed] [Google Scholar]
- Hafed ZM, Clark JJ (2002) Microsaccades as an overt measure of covert attention shifts. Vision Res 42:2533–2545. 10.1016/S0042-6989(02)00263-8 [DOI] [PubMed] [Google Scholar]
- Hafed ZM, Lovejoy LP, Krauzlis RJ (2011) Modulation of microsaccades in monkey during a covert visual attention task. J Neurosci 31:15219–15230. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hopfinger JB, Buonocore MH, Mangun GR (2000) The neural mechanisms of top-down attentional control. Nat Neurosci 3:284–291. [DOI] [PubMed] [Google Scholar]
- Hromádka T, Zador AM (2007) Toward the mechanisms of auditory attention. Hear Res 229:180–185. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hussar CR, Pasternak T (2012) Memory-guided sensory comparisons in the prefrontal cortex: contribution of putative pyramidal cells and interneurons. J Neurosci 32:2747–2761. 10.1523/JNEUROSCI.5135-11.2012 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hussar CR, Pasternak T (2013) Common rules guide comparisons of speed and direction of motion in the dorsolateral prefrontal cortex. J Neurosci 33:972–986. 10.1523/JNEUROSCI.4075-12.2013 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ikkai A, Curtis CE (2011) Common neural mechanisms supporting spatial working memory, attention and motor intention. Neuropsychologia 49:1428–1434. 10.1016/j.neuropsychologia.2010.12.020 [DOI] [PMC free article] [PubMed] [Google Scholar]
- James W (1890) The Principles of Psychology. New York: Henry Holt and Company. [Google Scholar]
- Kastner S, Pinsk MA, De Weerd P, Desimone R, Ungerleider LG (1999) Increased activity in human visual cortex during directed attention in the absence of visual stimulation. Neuron 22:751–761. [DOI] [PubMed] [Google Scholar]
- Kaufman MT, Churchland MM, Ryu SI, Shenoy KV (2014) Cortical activity in the null space: permitting preparation without movement. Nat Neurosci 17:440–448. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kelly SP, Lalor EC, Reilly RB, Foxe JJ (2006) Increases in alpha oscillatory power reflect an active retinotopic mechanism for distracter suppression during sustained visuospatial attention. J Neurophysiol 95:3844–3851. [DOI] [PubMed] [Google Scholar]
- Kim T, Bair W, Pasupathy A (2019) Neural coding for shape and texture in macaque area V4. J Neurosci 39:4760–4774. 10.1523/JNEUROSCI.3073-18.2019 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Landau AN, Fries P (2012) Attention samples stimuli rhythmically. Curr Biol 22:1000–1004. [DOI] [PubMed] [Google Scholar]
- Lowet E, Gomes B, Srinivasan K, Zhou H, Schafer RJ, Desimone R (2018) Enhanced neural processing by covert attention only during microsaccades directed toward the attended stimulus. Neuron 99:207–214. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Luo TZ, Maunsell JHR (2019) Attention can be subdivided into neurobiological components corresponding to distinct behavioral effects. Proc Natl Acad Sci U S A 116:26187–26194. 10.1073/pnas.1902286116 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mante V, Sussillo D, Shenoy KV, Newsome WT (2013) Context-dependent computation by recurrent dynamics in prefrontal cortex. Nature 503:78–84. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Maunsell JH (2004) Neuronal representations of cognitive state: reward or attention? Trends Cogn Sci 8:261–265. 10.1016/j.tics.2004.04.003 [DOI] [PubMed] [Google Scholar]
- Maunsell JHR (2015) Neuronal mechanisms of visual attention. Annu Rev Vis Sci 1:373–391. PMID: . [DOI] [PMC free article] [PubMed] [Google Scholar]
- McKee JL, Riesenhuber M, Miller EK, Freedman DJ (2014) Task dependence of visual and category representations in prefrontal and inferior temporal cortices. J Neurosci 34:16065–16075. 10.1523/JNEUROSCI.1660-14.2014 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mehta AD, Ulbert I, Schroeder CE (2000) Intermodal selective attention in monkeys. I: distribution and timing of effects across visual areas. Cereb Cortex 10:343–358. 10.1093/cercor/10.4.343 [DOI] [PubMed] [Google Scholar]
- Mochol G, Kiani R, Moreno-Bote R (2021) Prefrontal cortex represents heuristics that shape choice bias and its integration into future behavior. Curr Biol 31:1234–1244. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Moore T, Fallah M (2001) Control of eye movements and spatial attention. Proc Natl Acad Sci U S A 98:1273–1276. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Müller JR, Metha AB, Krauskopf J, Lennie P (2003) Local signals from beyond the receptive fields of striate cortical neurons. J Neurophysiol 90:822–831. 10.1152/jn.00005.2003 [DOI] [PubMed] [Google Scholar]
- Murray JD, Bernacchia A, Freedman DJ, Romo R, Wallis JD, Cai X, Padoa-Schioppa C, Pasternak T, Seo H, Lee D, Wang X-J (2014) A hierarchy of intrinsic timescales across primate cortex. Nat Neurosci 17:1661–1663. 10.1038/nn.3862 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Murray JD, Bernacchia A, Roy NA, Constantinidis C, Romo R, Wang X-J (2017) Stable population coding for working memory coexists with heterogeneous neural dynamics in prefrontal cortex. Proc Natl Acad Sci USA 114:394–399. 10.1073/pnas.1619449114 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Osborne LC, Bialek W, Lisberger SG (2004) Time course of information about motion direction in visual area MT of macaque monkeys. J Neurosci 24:3210–3222. 10.1523/JNEUROSCI.5305-03.2004 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pack CC, Born RT (2001) Temporal dynamics of a neural solution to the aperture problem in visual area MT of macaque brain. Nature 409:1040–1042. [DOI] [PubMed] [Google Scholar]
- Paneri S, Gregoriou GG (2017) Top-down control of visual attention by the prefrontal cortex. functional specialization and long-range interactions. Front Neurosci 11:545. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Parthasarathy A, Tang C, Herikstad R, Cheong LF, Yen S-C, Libedinsky C (2019) Time-invariant working memory representations in the presence of code-morphing in the lateral prefrontal cortex. Nat Commun 10:4995–4995. 10.1038/s41467-019-12841-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- Posner MI (1980) Orienting of attention. Q J Exp Psychol 32:3–25. 10.1080/00335558008248231 [DOI] [PubMed] [Google Scholar]
- Reynolds JH, Heeger DJ (2009) The normalization model of attention. Neuron 61:168–185. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ringach DL, Hawken MJ, Shapley R (1997) Dynamics of orientation tuning in macaque primary visual cortex. Nature 387:281–284. [DOI] [PubMed] [Google Scholar]
- Roesch MR, Olson CR (2003) Impact of expected reward on neuronal activity in prefrontal cortex, frontal and supplementary eye fields and premotor cortex. J Neurophysiol 90:1766–1789. PMID: . 10.1152/jn.00019.2003 [DOI] [PubMed] [Google Scholar]
- Rungratsameetaweemana N, Serences JT (2019) Dissociating the impact of attention and expectation on early sensory processing. Curr Opin Psychol 29:181–186. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rungratsameetaweemana N, Itthipuripat S, Salazar A, Serences JT (2018) Expectations do not alter early sensory processing during perceptual decision-making. J Neurosci 38:5632–5648. 10.1523/JNEUROSCI.3638-17.2018 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Runyan CA, Piasini E, Panzeri S, Harvey CD (2017) Distinct timescales of population coding across cortex. Nature 548:92–96. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sani I, Santandrea E, Morrone MC, Chelazzi L (2017) Temporally evolving gain mechanisms of attention in macaque area V4. J Neurophysiol 118:964–985. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shapley R, Hawken M, Ringach DL (2003) Dynamics of orientation selectivity in the primary visual cortex and the importance of cortical inhibition. Neuron 38:689–699. 10.1016/s0896-6273(03)00332-5 [DOI] [PubMed] [Google Scholar]
- Siegel M, Buschman TJ, Miller EK (2015) Cortical information flow during flexible sensorimotor decisions. Science 348:1352–1355. 10.1126/science.aab0551 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Smith MA, Majaj NJ, Movshon JA (2005) Dynamics of motion signaling by neurons in macaque area MT. Nat Neurosci 8:220–228. [DOI] [PubMed] [Google Scholar]
- Snyder AC, Morais MJ, Willis CM, Smith MA (2015) Global network influences on local functional connectivity. Nat Neurosci 18:736–743. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Snyder AC, Yu BM, Smith MA (2018) Distinct population codes for attention in the absence and presence of visual stimulation. Nat Commun 9:4382. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Spaak E, Watanabe K, Funahashi S, Stokes MG (2017) Stable and dynamic coding for working memory in primate prefrontal cortex. J Neurosci 37:6503–6516. 10.1523/JNEUROSCI.3364-16.2017 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Spyropoulos G, Bosman CA, Fries P (2018) A theta rhythm in macaque visual cortex and its attentional modulation. Proc Natl Acad Sci U S A 115:E5614–E5623. 10.1073/pnas.1719433115 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Squire RF, Noudoost B, Schafer RJ, Moore T (2013) Prefrontal contributions to visual selective attention. Annu Rev Neurosci 36:451–466. PMID: . [DOI] [PubMed] [Google Scholar]
- Sripati AP, Johnson KO (2006) Dynamic gain changes during attentional modulation. Neural Comput 18:1847–1867. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Summerfield C, de Lange FP (2014) Expectation in perceptual decision making: neural and computational mechanisms. Nat Rev Neurosci 15:745–756. [DOI] [PubMed] [Google Scholar]
- Tremblay S, Pieper F, Sachs A, Martinez-Trujillo J (2015) Attentional filtering of visual information by neuronal ensembles in the primate lateral prefrontal cortex. Neuron 85:202–215. 10.1016/j.neuron.2014.11.021 [DOI] [PubMed] [Google Scholar]
- VanRullen R, Carlson T, Cavanagh P (2007) The blinking spotlight of attention. Proc Natl Acad Sci U S A 104:19204–19209. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Voloh B, Valiante TA, Everling S, Womelsdorf T (2015) Theta-gamma coordination between anterior cingulate and prefrontal cortex indexes correct attention shifts. Proc Natl Acad Sci U S A 112:8457–8462. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wasmuht DF, Spaak E, Buschman TJ, Miller EK, Stokes MG (2018) Intrinsic neuronal dynamics predict distinct functional roles during working memory. Nat Commun 9:3499. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yu BM, Cunningham JP, Santhanam G, Ryu SI, Shenoy KV, Sahani M (2009) Gaussian-process factor analysis for low-dimensional single-trial analysis of neural population activity. J Neurophysiol 102:614–635. [DOI] [PMC free article] [PubMed] [Google Scholar]