Skip to main content
eLife logoLink to eLife
. 2025 Sep 11;14:RP104222. doi: 10.7554/eLife.104222

Dynamic estimation of the attentional field from visual cortical activity

Ilona M Bloem 1,†,, Leah Bakst 1,, Joseph T McGuire 1, Sam Ling 1,
Editors: Ming Meng2, Huan Luo3
PMCID: PMC12425475  PMID: 40932476

Abstract

Navigating around the world, we must adaptively allocate attention to our surroundings based on anticipated future stimuli and events. This allocation of spatial attention boosts visuocortical representations at attended locations and locally enhances perception. Indeed, spatial attention has often been analogized to a ‘spotlight’ shining on the item of relevance. Although the neural underpinnings of the locus of this attentional spotlight have been relatively well studied, less is known about the size of the spotlight: to what extent can the attentional field be broadened and narrowed in accordance with behavioral demands? In this study, we developed a paradigm for dynamically estimating the locus and spread of covert spatial attention, inferred from visuocortical activity using fMRI in humans. We measured BOLD activity in response to an annulus while participants (four female, four male) used covert visual attention to determine whether more numbers or letters were present in a cued region of the annulus. Importantly, the width of the cued area was systematically varied, calling for different sizes of the attentional spotlight. The deployment of attention was associated with an increase in BOLD activity in corresponding retinotopic regions of visual areas V1–V3. By modeling the visuocortical attentional modulation, we could reliably recover the cued location, as well as a broadening of the attentional modulation with wider attentional cues. This modeling approach offers a useful window into the dynamics of attention and spatial uncertainty.

Research organism: Human

Introduction

We bounce attention around all the time. Take, for instance, when we’re monitoring oncoming traffic while driving. It isn’t sufficient to attend to the single most likely source of traffic. Instead, attention adaptively broadens and narrows to cover the anticipated spatial distribution of relevant events. The need to spread attention across different swaths of the visual field is driven, to a large degree, by spatial uncertainty: statistical regularities give us a general sense as to where something useful might happen, and this evolves from moment to moment. We navigate this uncertainty by dynamically deploying spatial attention.

Covert spatial attention improves behavioral performance at attended locations at the cost of performance at unattended locations (Posner, 1980), leading to a common metaphor that spatial selective attention acts as a ‘spotlight’ or ‘zoom lens’ (Shaw and Shaw, 1977; Posner, 1980; Eriksen and St. James, 1986; Carrasco, 2011). This attentional ‘spotlight’ is characterized by a specific size and location and traverses the visual field based on behavioral demands (Eriksen and St. James, 1986; Castiello and Umiltà, 1990), selectively boosting information at the attended location within the visual system while suppressing information elsewhere. Animal studies have observed multiplicative increases in visuocortical neural responses at attended locations (McAdams and Maunsell, 1999; Maunsell, 2015), and human neuroimaging studies have found similar focal modulations of population responses (Kastner et al., 1998; Brefczynski and DeYoe, 1999; McMains and Somers, 2004; Datta and DeYoe, 2009; Sprague and Serences, 2013; Puckett and DeYoe, 2015; Samaha et al., 2016; Shioiri et al., 2016; Bloem and Ling, 2019; Bartsch et al., 2023).

While neural modulation at the locus of attention has been relatively well studied, less is known regarding the neural signatures of the size of the attentional field (Yeshurun, 2019). Spreading attention over a larger region of visual space can decrease behavioral performance, but only a handful of studies have interrogated associated effects within visual cortex (Müller et al., 2003; Herrmann et al., 2010; Itthipuripat et al., 2014; Feldmann-Wüstefeld and Awh, 2020). This is surprising, as the spatial distribution of the attentional field is a key feature in an influential theoretical model of attention (Reynolds and Heeger, 2009). The model assumes that the size of the attentional field can be adjusted based on task demands and that the interaction between attentional field size and stimulus-related factors can predict observed attentional gain effects.

While the studies that have experimentally manipulated the attentional field size found evidence congruent with this prominent theory (Herrmann et al., 2010; Itthipuripat et al., 2014; Kinıklıoğlu and Boyaci, 2022), few studies have directly investigated the spatial extent of the attentional window and its concomitant neural representation. One neuroimaging study revealed that the attentional field expanded in the face of greater task-related uncertainty (Herrmann et al., 2010), while other studies showed that the responsive area of visual cortex increased in size, coupled with a decrease of the overall population response (Müller et al., 2003; Feldmann-Wüstefeld and Awh, 2020). While these studies are consistent with the notion that the attentional field size can be detected in visual cortex, methods for dynamically recovering location and field size from moment to moment are lacking.

In this study, we developed a paradigm that allowed us to dynamically characterize the spatial tuning of spatial attention across the visual field. Using fMRI in humans, we examined whether attentional modulation of the BOLD response spanned a larger area of visual cortex when participants were cued to attend to a larger region of space. Behavioral performance confirmed that participants could successfully allocate their attention to different-sized swaths of the visual field. This deployment of attention was associated with a modulation in cortical activity in the corresponding retinotopic areas of visual cortex. By modeling the location and spread of the visuocortical modulation, we dynamically recovered the cued location from the attentional activity with a high degree of fidelity, together with a broadening of the attentional modulation for wider attentional cues.

Results

Behavioral performance indicates effective deployment of covert spatial attention

We set out to investigate the spatial distribution of attentional modulation within visual cortex. To do so, we first ensured that participants (n=8) could successfully allocate covert spatial attention to cued portions of the visual field. During the experiment, participants’ task was to fixate the center of the screen and report whether there were more numbers or letters in a cued peripheral region (Figure 1a). The cued region varied in location and width: it could be centered on any of 20 polar angles and could span any of four widths (18°, 54°, 90°, and 162° of polar angle). Task performance indicated that participants used the cue effectively, as the proportion of correct responses was significantly above chance for all width conditions (Figure 1b; t-test, all p<0.001). We verified, with eye tracking, that participants performed the task using peripheral vision while maintaining central fixation. The upper bound of the 95% CI for each participant’s average gaze eccentricity ranged from 0.29° (degrees of visual angle) to 0.64° (mean = 0.48°; Figure 1c), suggesting that gaze did not exceed the cue annulus at fixation and that participants used covert spatial attention to perform the task.

Figure 1. Experimental design and behavioral performance.

Figure 1.

(a) Task schematic. Participants were instructed to maintain central fixation and use covert spatial attention to determine whether there were more numbers or letters present within a cued region of a white noise annulus. On each trial, the red cue was displayed alone for 1.35 s and remained present throughout the trial. Twenty digits and letters were then presented for 0.5 s, equally spaced and overlaid on the annulus. Participants had 1.25 s to indicate via button press whether more digits or letters were present in the cued region. The cue remained stable for five trials (10 TRs, 15.5 s), had a width of 1, 3, 5, or 9 segments (18°, 54°, 90°, or 162°), and was centered on any of the 20 digit/letter slots. (b) Behavioral task performance: group mean accuracy for each cue width. Error bars are SEM; gray circles show individual participants (n=8). (c) Group mean gaze eccentricity (in degrees of visual angle) for each cue width, conventions as in (b).

Attentional modulation of BOLD responses broadens with cue width

We assessed the spatial distribution of attention by visualizing how the BOLD response was modulated by the location and width of the cue. To do so, we used each voxel’s population receptive field (pRF) to project BOLD responses for each attentional cue into the visual field. The resulting 2D visual field maps were averaged across trials for each cue width by rotating the maps, so the attentional cue aligned to 0° polar angle (right horizontal meridian). The reconstructed visual field maps revealed that increasing cue width led to a concomitant broadening of attentional modulation in cortex (Figure 2a). While this pattern was evident in all three early visual regions (V1–V3), the effect appeared to strengthen when ascending the visuocortical hierarchy.

Figure 2. Spatial distribution of attentional modulation.

Figure 2.

(a) BOLD response projected into the visual field for each attentional cue width. Heatmaps represent the group mean BOLD activity using each voxel’s population receptive field (pRF) location within the visual field, shown separately for V1, V2, and V3. Maps were rotated to align all attentional cue locations to 0° polar angle (rightward). Concentric circles indicated by black dashed lines represent the location of the white noise annulus. (b) Average spatial modulation profiles at the eccentricity of the annulus. The spatial profiles were recentered to 0° polar angle based on the cue location. Solid lines represent the group mean BOLD response and shaded regions the SEM across participants (n=8).

Next, we computed the one-dimensional (1D) profile of attentional modulation at a fixed eccentricity. We were able to do this because we manipulated the location of the attentional field only as a function of polar angle, so all cues directed the attentional field to iso-eccentric locations. We selected voxels with pRFs that overlapped the white noise annulus and sorted them according to their polar angle preference.

For visualization purposes, the spatial response modulations were recentered to align all cues at 0° polar angle and averaged across trials for each cue width separately. Much like in the visual field reconstructions, there was a clear attentional modulation centered on 0°, which broadened and shifted downward with cue width – a pattern that was particularly evident in area V3 (Figure 2b).

Dynamic model-based recovery of the attentional field

We next applied a modeling approach to estimate the location and width of attentional modulation, allowing us to further investigate the spread of attention in visual cortex. To do this, we averaged the spatial response profiles across TRs within each 10-TR block, in which the cue maintained a consistent location and width, yielding between 27 and 53 averaged spatial response profiles per participant for each width condition. We fit a generalized Gaussian function to each of these spatial profiles to estimate the location and width of attentional modulation per spatial profile (see Figure 3a). The width of attentional modulation was quantified in terms of the full width at half maximum (FWHM) of the best fitting model prediction (see Figure 3b).

Figure 3. Modeling approach.

Figure 3.

(a) The generalized Gaussian model is characterized by parameters for location (μ), scale (σ), and shape (β). (b) Example model fits for two spatial profiles. Dots indicate BOLD response for two attentional cues differing in position and width. Solid lines indicate the best fitting model estimate. To quantify the attentional field, we extracted the location and gain (dashed arrows), as well as the width (full width at half maximum (FWHM); solid arrows).

Can we dynamically recover the attentional field from activity within visual cortex? Model fits explained a substantial proportion of variance in the spatial profiles of BOLD activity (V1: for 18° cues, mean [standard deviation] of R2=0.42 [0.03]; for 54° cues, 0.43 [0.03]; for 90° cues, 0.44 [0.03]; for 162° cues, 0.42 [0.03]; V2: for 18° cues, 0.51 [0.05]; for 54° cues, 0.54 [0.05]; for 90° cues, 0.54 [0.04]; for 162° cues, 0.55 [0.04]; V3: for 18° cues, 0.50 [0.03]; for 54° cues, 0.56 [0.04]; for 90° cues, 0.55 [0.03]; for 162° cues, 0.51 [0.02]). To interpret the estimated model parameters, we excluded the bottom 20% of fits based on a pooled R2 across V1, V2, and V3, leaving roughly equal proportions of included blocks across cue width conditions (18°: mean [standard deviation]=0.78 [0.04], 54°: 0.83 [0.05], 90°: 0.83 [0.04], 162°: 0.77 [0.07]).

To assess how well the model-estimated attentional field matched the cued location, we first calculated the angular error between the cue center and the model’s estimated location parameter. The angular error distribution across blocks, separated by width condition, is shown in Figure 4 for one example participant to display block-to-block variation. The model reliably captured the location of the attentional field with low angular error and with no systematic directional bias. This result was observed across participants. We next examined the absolute angular error to assess the overall accuracy of our estimates. The group mean absolute angular error in V1 was 41.9° (SEM = 2.86°), in V2 was 32.2° (2.31°), and in V3 was 24.7° (1.54°). Additionally, the absolute angular error did not vary linearly with the width of the cue in V1 or V2 (regression slopes tested against zero at the group level using a t-test; V1: t(7)=0.65, p=0.537; V2: t(7)=1.24, p=0.253; Figure 5). In V3, we observed a small but statistically significant increase in absolute angular error associated with greater cue widths (mean slope = 1.4, t(7)=4.18, p=0.004).

Figure 4. Attentional field parameter estimates for an example participant.

Figure 4.

The full parameter estimate distributions across blocks for location, width, gain, and baseline are shown for one example participant in V1, V2, and V3. Median parameter estimates are shown by the white points, with the box plot representing the 25th to 75th percentile, and whiskers extending to all non-outlier points.

Figure 5. Attentional field parameter estimates.

Figure 5.

Group results for location, width, gain, and baseline estimates. Overall group mean and SEM (n=8) are shown in solid black, separated by cue width and brain region. Individual participant median estimates are shown in gray. The example participant from Figure 4 is indicated by a denser dashed dark gray line with triangle symbols to aid in comparison.

Next, we evaluated the width of the attentional field by visualizing the distribution of FWHM for the same example participant (Figure 4) and at the group level (Figure 5). Confirming the broadening of the attentional field observed in the visual field reconstruction maps, we found that the estimated FWHM increased with greater cue widths in V2 and V3 (V2 t(7)=5.63, p<0.001; V3 t(7)=6.49, p<0.001). The effect was not statistically significant in V1 (t(7)=1.68, p=0.136).

We also assessed the gain of the attentional modulation in the model (Figures 4 and 5 for the example participant and group data, respectively). We observed no significant relationship between gain and cue width in V1 and V2 (V1 t(7)=–0.54, p=0.605; V2 t(7)=–2.19, p=0.065), though we did find a significant effect in V3, illustrating that gain decreases with cue width (t(7)=–3.12, p=0.017). We also found that the overall gain was greater in V2 and V3 compared to V1 (paired t-test, both p≤0.01).

Finally, we examined the baseline offset (Figure 4 example participant, and Figure 5 group data). No significant relationship was observed between cue width and baseline offset in any of the three brain regions (V1, t(7)=–1.05, p=0.330; V2, t(7)=–2.00, p=0.086; V3, t(7)=–1.61, p=0.152).

Temporal interval analysis

In the previous analyses, we leveraged the fact that the attentional cue remained constant for five-trial blocks (spatial profiles were computed by averaging BOLD measurements across a block of 10 TRs). We next examined the degree to which we were able to recover the attentional field on a moment-by-moment (TR-by-TR) basis. To do this, we systematically adjusted the number of TRs that contributed to the averaged spatial response profile. To maintain a constant number of observations across the temporal interval conditions, we randomly sampled a subset of TRs from each block. This allowed us to determine the amount of data needed to recover the attentional field, with a goal of examining the usability of our modeling approach in future paradigms involving more dynamic deployment of spatial attention.

When we systematically varied the number of TRs included for each model fit (1, 2, 3, 5, or 10 TRs), we found a significant effect of cue width on recovered FWHM when averaging two or more TRs in V3 (all t(7)≥2.38, all p≤0.049), and 10 TRs in V2 (results as reported in prior section; Figure 6a). As described above, V1 did not reliably show a significant relationship between cue width and FWHM, even when averaging 10 TRs. We found that increasing the number of TRs had a small but significant positive effect on FWHM estimates in V2 and V3 (V2, mean slope = 2.7, t(7)=2.95, p=0.021; V3, mean slope = 1.16, t(7)=3.22, p=0.015), although a significant effect was not observed in V1 (t(7)=1.82, p=0.111).

Figure 6. Effect of number of TRs.

Figure 6.

Model fits were computed using BOLD data averaged across different temporal intervals (1, 2, 3, 5, or 10 TRs). Group means (with SEM (n=8)) are plotted for (a) full width at half maximum (FWHM), (b) absolute angular error, (c) gain, (d) baseline offset, and (e) R2, separated by cue width, brain region, and the number of TRs used for each model fit.

The number of TRs significantly affected the absolute angular error associated with the estimated location of the attentional field (Figure 6b). Error magnitude decreased with TRs in all three visual regions (all t(7)≤–4.48, all p≤0.003), suggesting that more data yielded more accurate estimates, though absolute angular error remained consistently below chance (90°) even when fitting the model to single-TR BOLD responses. Absolute angular error remained stable across width conditions in V1 and V2 (V1, t(7)=–0.55, p=0.598; V2, t(7)=1.92, p=0.098), though we found that larger cue width had a small but significant effect associated with larger errors in V3 (mean slope = 0 .02, t(7)=3.28, p=0.014).

The estimated gain of the attentional modulation showed a dependence on number of TRs, with more TRs associated with lower gain estimates in V1 and V3 (V1, t(7)=–7.21, p<0.001; V3, t(7)=–9.97, p<0.001), though this was not clearly observed in V2 (t(7)=–1.60, p=0.154). There was no evident dependence of gain on cue width in V1 and V2 (V1 t(7)=–0.19, p=0.856; V2 t(7)=–2.34, p=0.052), though we did observe a significant relationship in V3 (t(7)=–2.86, p=0.024; Figure 6c).

The baseline offset tended to increase with number of TRs across all three brain regions (V1, t(7)=8.79, p<0.001; V2, t(7)=6.5, p<0.001; V3, t(7)=5.59, p=0.001; Figure 6d). Baseline offset did not show a significant dependence on cue width in any region (V1, t(7)=1.47, p=0.186; V2, t(7)=–2.16, p=0.068; V3, t(7)=–1.67, p=0.139).

Finally, the model’s goodness of fit improved with more data, with larger R2 associated with greater numbers of TRs included in the average profiles (all t(7)≥2.99, all p≤0.020), though all R2 were above 0.3 across all visual regions even for single-TR model fits. We did not observe a dependence of R2 on cue width (all t(7)≤1.26, all p≥0.249; Figure 6e).

Width of the attentional field mimics perceptual modulation

While the attentional field broadened as expected when participants were cued to attend to a larger portion of the white noise annulus, the size of the estimated attentional modulation was greater than the true size of the cued region. The cue width varied between 18° and 162°, whereas the width estimate derived from spatial profiles of BOLD modulation varied between 103° and 179° (Figure 5). We wondered what the underlying cause of this disparity might be. One possibility is that the BOLD-derived FWHM might tend to overestimate the retinotopic extent of the modulation, perhaps driven by binning and smoothing processing steps to create the 1D spatial profiles. If this were the case, we would expect to obtain similar FWHM estimates when modeling the perceptual modulations as well. Alternatively, the true subjective attentional field might be consistently broader than cued, despite the presence of nearby distractors. If this were the case, modulation driven by perceptual differences should not result in the same large FWHM estimates.

To address this, we compared our estimates of the attentional field with equivalent estimates for spatial profiles induced by a perceptual manipulation. In this additional experiment, we varied the contrast intensity of sections of the white noise annulus. Participants were not asked to deploy spatial attention to the stimulus and were instead instructed to perform a color change detection task at fixation. The regions of increased noise contrast matched the attentional cue widths (18°, 54°, 90°, and 162°, plus an additional intermediate width of 126°) and were centered on one of the four cardinal locations (0°, 90°, 180°, 270° polar angle).

As expected, we observed a broadening of the spatial profile of BOLD modulation in all three visual areas as the region of increased contrast widened (Figure 7a). Using an identical modeling procedure, we estimated the spatial profile of the perceptual BOLD modulation. The model-based estimates revealed that the mean absolute angular error between the model-estimated location and the center of the contrast stimulus had no significant dependence on contrast width in any of the three brain regions (magnitude of all t(4)≤0.915, all p≥0.412). The recovered FWHM increased with contrast width in both V1 and V3 (Figure 7b; V1, t(4)=6.94, p=0.002; V3 t(4)=11.34, p<0.001), though this effect was not clearly observed in V2 (t(4)=1.37, p=0.242). The estimated gain modulation also did not show a relationship to contrast width in any of the visual areas (magnitude of all t(4)≤1.71, all p≥0.163). Finally, we did not observe a significant relationship between contrast width and baseline offset in any visual area (magnitude of all t(4)≤1.93, all p≥0.125). In sum, the group results for model estimates revealed that: (1) the model was highly accurate in estimating the location of the contrast increment; (2) FWHM of the spatial profiles broadened across contrast widths, (3) the gain and baseline remained stable across contrast widths (Figure 7b).

Figure 7. Spatial distribution of perceptual modulation.

Figure 7.

(a) Spatial profiles of perceptual modulation. Solid lines represent the group mean BOLD activity and shaded regions the SEM (n=5). (b) Group-level parameter estimates. Overall group mean and SEM are shown for the absolute angular error, full width at half maximum (FWHM), gain, and baseline, separated by contrast width and brain region. (c) Comparison of FWHM estimates obtained from the attentional manipulation and the physical contrast manipulation. Dot color indicates brain region; each point represents the mean FWHM for a given width condition across participants.

Mirroring the results from the attentional manipulation, FWHM estimates systematically exceeded the nominal size of the perceptually modulated region of the visual field. Comparing the estimated FWHMs of the perceptual and attentional spatial profiles (Figure 7c) revealed that the estimated widths were highly correlated (Pearson correlation r=0.749 across width conditions and visual regions), and though the estimated FWHMs from the perceptual task appear to be smaller, they did not significantly differ from the FWHMs derived from the attentional task (t-test p=0.181). Importantly, the relative differences in FWHM show meaningful effects of both cue and contrast width in a similar manner for attentional and perceptual forms of modulation.

Discussion

We investigated the topographic spread of spatial attention in human visual cortex by characterizing the spatial profile of BOLD responses while participants attended to different portions of the visual field. Behavioral performance confirmed that participants used the fixation cue to dynamically allocate attention to different swaths of the visual field. Attention allocation was associated with a modulation in the BOLD response in corresponding retinotopic areas of visual cortex. To characterize the topography of that modulation, our approach involved selecting voxels with pRF preferred eccentricities that overlapped our white noise annulus and organizing those voxels into 1D profiles of attentional modulation as a function of preferred polar angle. This allowed us to model the location and spread of the attentional field and test how well it tracked the nominal location and width of the cue presented at fixation. Using a generalized Gaussian model, the cued location could be recovered with high fidelity. We observed a broadening of the estimated attentional field in areas V2 and V3 with the cue width, suggesting our method was capable of dynamically recovering the location and size of the attentional field from moment to moment. We also found that the estimated spatial spread of the attentional modulation (as indicated by the recovered FWHM) was consistently wider than the cued region itself. We therefore compared the spread of the attention field with the spatial profile of a perceptually induced width manipulation. The results were comparable in both the attentional and perceptual versions of the task, suggesting that cueing attention to a region results in a similar 1D spatial profile to when the stimulus contrast is simply increased in that region.

This work builds on the concept of an attentional ‘spotlight’ or ‘zoom lens’ that has long been theorized to aid in spatial attention (Shaw and Shaw, 1977; Posner, 1980; Eriksen and St. James, 1986; Carrasco, 2011). By flexibly adjusting and shifting the focus of the spotlight, visual representations are selectively enhanced in a region of the visual field. However, the empirical evidence demonstrating that attention can change its spread across the visual field by modulating brain responses is surprisingly lacking (Yeshurun, 2019). Our understanding of how the attentional window interacts with spatial representations is mainly based on behavioral reports (Gobell et al., 2004; Palmer and Moore, 2009; Herrmann et al., 2010; van Beilen et al., 2011; Taylor et al., 2015; Huang et al., 2017; Kinıklıoğlu and Boyaci, 2022), but see Müller et al., 2003; Hopf et al., 2006; Itthipuripat et al., 2014; Tkacz-Domb and Yeshurun, 2018; Feldmann-Wüstefeld and Awh, 2020. We introduced a novel modeling approach that recovered the location and the size of the attentional field. Our data show that the estimated spatial spread of attentional modulation (as indicated by the recovered FWHM) consistently broadened with the cue width, replicating prior work (Müller et al., 2003; Herrmann et al., 2010). Our results go beyond prior work by linking the spatial profiles to pRF estimates, allowing us to quantify the spread of both attentional and perceptual modulation in degrees of polar angle. Interestingly, the FWHM estimates for the attentional and perceptual spatial profiles were highly similar. Additionally, for area V3, we replicate that the population response magnitude decreased with cue width (Müller et al., 2003; Feldmann-Wüstefeld and Awh, 2020). One innovation of our method is that it directly reconstructs attention-driven modulations of responses in visual cortex, setting it apart from other methods, such as inverted encoding models (e.g. Sprague and Serences, 2013). Finally, we demonstrated that our method has potential to be used in more dynamic settings, in which changes in the attentional field need to be tracked on a shorter timescale.

The ability to change the size of the attentional field is a crucial component in an influential theoretical model of attention. This model proposes that the interaction between stimulus properties (such as its size and specific features) and the attentional field can explain a wide variety of attentional effects reported in behavioral and neurophysiological studies (Herrmann et al., 2010; Itthipuripat et al., 2014; Bloem and Ling, 2019; Jigo et al., 2021). The present study sought to address this gap, with our results showing that the visuocortical attentional field broadened as we increased the cue width (Figure 5). This provides compelling evidence that the attention-related cortical response can, in fact, flexibly vary in its position and spatial distribution.

The observed effects of attentional field width were unlikely to be directly attributable to variation in task difficulty. Participants’ task in our study was to discriminate whether more numbers or more letters were presented within a cued region of an iso-eccentric annulus of white noise. For our different cue widths, the ratios of numbers and letters were selected to be as similar as possible given the size and spacing of our stimuli. Changes in accuracy across the three larger cue widths were small and non-monotonic, implying task difficulty was dissociable from width per se. This dissociation bolsters the interpretability of our model fits; nevertheless, future work should further investigate how task difficulty interacts with the spread of the attentional field and the amplitude of attention-related BOLD effects (cf. Ress et al., 2000).

In this study, we modeled the attentional field using a 1D distribution. This approach aligned with our experimental design, as the attentional cue was manipulated only as a function of polar angle. However, we know that spatial processing varies substantially as a function of eccentricity. Spatial resolution is highest at the fovea and rapidly drops in the periphery (Anton-Erxleben and Carrasco, 2013). The spatial distribution of attention will presumably also vary with eccentricity and will likely take on different functional properties close to the fovea, where spatial resolution is high, compared to the far periphery where spatial resolution is low (Intriligator and Cavanagh, 2001; Jigo et al., 2021). Future work can help provide a better understanding of the contribution of spatial attention by considering how the attentional field interacts with these well-described spatial variations across the visual field. Measuring the full spatial distribution of the attentional field (across both eccentricity and polar angle) will shed light on how spatial attention guides perception by interacting with the nonuniformity of spatial representations.

The spread of the attentional field likely influences the degree to which spatial resolution at the attended location is transformed, leading to enhanced behavioral performance. Spatial attention was vital for this task, as enhanced spatial perception allowed the participants to better discriminate all stimuli within the cued region (Anton-Erxleben and Carrasco, 2013). Future work could unpack the degree to which the size of the attentional field influences the spatial resolution of visual cortical representations (Klein et al., 2014; Vo et al., 2017; Tünçok et al., 2024), and how this influences spatial perception.

Beyond addressing core questions related to the function of spatial attention, this method also lays groundwork for addressing questions about spatial predictive uncertainty and belief updating. Prior work on these topics has relied almost entirely on inferring participants’ predictions from their behavior, often requiring participants to report overt point predictions (Nassar et al., 2010; McGuire et al., 2014; d’Acremont and Bossaerts, 2016; Nassar et al., 2019), or inferring participants’ predictions from their sequences of decisions (Daw et al., 2006; Behrens et al., 2007; Payzan-LeNestour and Bossaerts, 2011; Payzan-LeNestour et al., 2013). These approaches have shed light on how we dynamically adapt our learning and belief updating processes over time in differently structured contexts. However, methods for recovering information about full predictive belief distributions have been limited, relying on indirect measurements such as eye movements (O’Reilly et al., 2013; Bakst and McGuire, 2021; Bakst and McGuire, 2023), and physiological measures of uncertainty and surprise in EEG and pupillometry (Preuschoff et al., 2011; Nassar et al., 2012; Nassar et al., 2019). The methods developed here offer a potential way to recover the location and width of a spatial predictive distribution via the attentional field in contexts in which it is unknown a priori and might be dependent on how a given participant has integrated previous sequential evidence. Future work could extend this method to more directly interrogate how predictive uncertainty is represented throughout the brain on a moment-by-moment basis.

In summary, we found evidence that people could dynamically adapt the spread of spatial attention, and that the retinotopic extent of attentional modulation of the BOLD response reflected this dynamic adaptation. These findings address a gap in our understanding of spatial attentional control, supporting core theoretical models of attention. Our modeling approach also lays the groundwork to address further questions related to how the attentional field interacts with the nonuniformity of spatial representations and how uncertainty in spatial contexts is represented in the human brain.

Materials and methods

Participants

Eight healthy adults (four female, four male, mean age = 30) participated in the main attention experiment, five of whom also participated in a second experiment featuring a contrast manipulation. All participants had normal or corrected-to-normal vision. All procedures were approved by the Boston University Institutional Review Board, and informed consent was obtained from all participants.

Apparatus and stimuli

Participants were presented with stimuli generated using PsychoPy (v1.85.1; Peirce, 2007) on a MacBook Pro. The visual stimuli were displayed on a rear-projection screen (subtending ~20°×16° visual angle) using a VPixx Technologies PROPixx DLP LED projector (maximum luminance 306 cd/m2). Participants viewed the screen through a front surface mirror. Participants were placed comfortably in the scanner with padding to minimize head motion.

Procedure

Attentional width manipulation

Participants were instructed to fixate a central point (radius 0.08° visual angle) while dynamic pixelwise white noise (flickering at 10 Hz, 50% contrast) was presented in the periphery (annulus spanning 4.6° to 7.4° visual angle). The annulus was segmented into 20 bins (18° polar angle per bin) by white grid lines radiating from a white circle at the center of the screen (radius 0.25°), passing behind the annulus, and terminating at 8.5° eccentricity. In the middle of each bin, a number or letter (height: 2.1°) was superimposed on the white noise annulus (see Figure 1a). For a subset of the participants (three out of eight), the screen distance inside the scanner was changed; therefore, for those participants, the letter size was 1.86° visual angle, and the white noise annulus spanned 4.1° to 6.5° visual angle. The set of possible letters included all lowercase letters of the Latin alphabet except a, b, e, g, i, o, and u. The set of numbers included 2, 3, 4, 5, 7, and 8.

Participants were cued to attend covertly to a contiguous subset of the bins, and their task was to report, via button press, whether there were more numbers or letters present within the cued region. The cue was a bold red segment on the central white circle, which corresponded to 1, 3, 5, or 9 bins (18°, 54°, 90°, or 162° polar angle; see Figure 1a). The true proportion of letters versus numbers was controlled within each cue width condition. For cued regions of 1 bin, there was either a single number or letter in the bin. For cued regions of 3 bins, the ratio was always 2:1 (either two numbers and one letter or vice versa). For cued regions of 5 bins, the ratio was 3:2, and for cued regions of 9 bins, the ratio was 6:3. The ratios were selected to be as similar as possible given the size and spacing of our stimuli (aside from the one-bin cue, the proportions for the other cues were 0.67, 0.60, and 0.67). Cues could be centered on any of the 20 bins.

Participants completed 8–12 runs of the task (mean = 10.4), with each run lasting 341 s and containing 100 trials. Each cue remained constant for a block of five trials (lasting 15.5 s, 10 TRs), although the letters and numbers within the cued region changed on every trial. Thus, each participant saw 20 unique cues (combinations of cue location and width) per run. Each run began and ended with 15.5 s of the dynamic noise annulus.

During each trial, the cue and white noise annulus were presented alone for 1.35 s. The numbers and letters were then displayed for 0.5 s. Thereafter, the cue and white noise remained visible while the participant had 1.25 s to indicate whether there had been more digits or letters within the cued region, resulting in a total trial duration of 3.1 s (2 TRs). No accuracy feedback was provided during the main experiment. However, all participants completed three training runs with trial-by-trial feedback prior to the scan session. During training runs, the response window was shortened to 1 s and the remaining 0.25 s presented feedback in the form of a change in color of the fixation point (blue for correct responses and orange for incorrect responses).

Physical contrast manipulation

A subset of participants (n=5) also participated in an experiment that enhanced the physical contrast intensity of the dynamic visual noise in segments of the annulus. This additional experiment was carried out during the same scan session and allowed for benchmarking the detectability of stimulus-evoked modulation in visual cortex using our analyses. The stimuli and trial structure were similar to the attentional manipulation. The task differed in the following ways: (1) the contrast of the white noise annulus was increased to 100% for segments of the annulus corresponding to 1, 3, 5, 7, or 9 bins (18°, 54°, 90°, 126°, or 162° polar angle), with a Gaussian rolloff (σ=15°) that spanned 25% of the furthest included bins and 25% of the adjacent excluded bins; (2) the enhanced segments were always centered on the cardinal directions (0°, 90°, 180°, and 270° polar angle); (3) the contrast increase remained constant for 15.5 s (10 TRs); (4) participants performed a color change detection task at fixation. Each unique combination of four locations and five widths of the contrast enhancement was shown once per run, with the order randomized. To estimate a baseline response, each run started and ended with 15.5 s without contrast modulation. Participants completed two runs total, each lasting 341 s (220 TRs).

Throughout the physical contrast runs, participants were instructed to fixate on a central point (radius 0.08° visual angle) and to press a button when the fixation point switched color (alternating white and red). The fixation point remained a color for at least 1 s and then had a 10% probability of switching every 100 ms. No cues were presented for the regions of increased contrast. Additionally, no letters or numbers were superimposed on the white noise annulus.

pRF mapping

pRF estimates were obtained for each participant in a separate scan session. We used the experimental procedure as described in the Human Connectome Project 7T Retinotopy dataset (Benson et al., 2018). Stimuli were composed of a pink noise background with colorful objects and faces at various spatial scales, displayed on a mean luminance gray background. Stimuli were updated at a rate of 15 Hz while participants performed a color change detection task at fixation. Participants viewed two types of mapping stimuli: (1) contracting/expanding rings and rotating wedges; (2) moving bar stimuli (Dumoulin and Wandell, 2008; Kay et al., 2013). A total of four to six scans (300 TRs) were collected for each participant (two to three scans per stimulus type). In this session, the field of view was restricted to the occipital cortex to maximize signal-to-noise ratio (SNR), thereby limiting the brain regions for which we had pRF estimates to V1, V2, and V3.

MRI data acquisition

All MRI data were acquired at Boston University’s Cognitive Neuroimaging Center (Boston, MA, USA) on a research-dedicated Siemens Prisma 3T scanner using a 64-channel head coil. A scanning session lasted 2 hr.

All functional neuroimaging data were acquired using a simultaneous multislice gradient-echo echo-planar acquisition protocol (Moeller et al., 2010; Setsompop et al., 2012): 2 mm isotropic voxels; FoV = 212 × 212 mm2; 72 axial slices; TR = 1.55 s; TE = 35.60 ms; flip angle = 72°; multiband acceleration factor 4. We computed distortion field maps by using a spin echo echoplanar protocol with opposite y-axis phase encoding directions (2 mm isotropic voxels; FOV = 212 × 212 mm2; TR = 8850 ms; TE = 70.80 ms; flip angle = 90°). During a separate scan session, we acquired a whole-brain anatomical scan using a T1-weighted multi-echo MPRAGE 3d sequence (1 mm isotropic; FoV = 256 × 256 mm2; 176 sagittal slices; TR = 2530 ms; TE = 1.69 ms; flip angle = 7°), and the pRF scans (occipital coverage only; right-left phase encoding; 2 mm isotropic voxels; FoV = 136 × 136 mm2; 36 slices; TR = 1 s; TE = 35.4 ms; flip angle = 64°; multiband acceleration factor 3).

MRI data analysis

Structural data preprocessing

Whole-brain T1-weighted anatomical data were analyzed using the standard ‘recon-all’ pipeline provided by FreeSurfer software (FreeSurfer version 5.3, Fischl, 2012), generating cortical surface models, whole-brain segmentation, and cortical parcellations.

Functional data preprocessing

All analyses were performed in the native space for each participant. First, EPI distortion correction was applied to all fMRI BOLD time-series data using a reverse phase-encode method (Andersson et al., 2003) implemented in FSL (Smith et al., 2004). All functional data were then preprocessed using FS-FAST (Fischl, 2012), including standard motion-correction procedures, Siemens slice timing correction, and boundary-based registration between anatomical and functional volumetric spaces (Greve and Fischl, 2009). To facilitate voxel-wise analysis, no volumetric smoothing was performed, and across-run within-modality robust rigid registration was applied (Reuter et al., 2010), with the middle time point of the first run serving as the target volume, and the middle time point of each subsequent run used as a movable volume for alignment. Lastly, data were detrended (0.005 Hz high-pass filter) and converted to percent signal change for each voxel independently using custom code written in MATLAB (version 2020b).

pRF mapping and voxel selection

The time series were analyzed using the analyzePRF toolbox in MATLAB, implementing a compressive spatial summation pRF model (Kay et al., 2013). The results of the pRF analysis were used to manually draw boundaries between early visual regions (V1, V2, and V3), which served as our regions of interest (ROIs).

Within each ROI, pRF modeling results were used to constrain voxel selection used in the main experiment. We excluded voxels with a preferred eccentricity outside the bounds of the pRF stimulus (<0.7° and >9.1°), with a pRF size smaller than 0.01°, or with poor spatial selectivity as indicated by the pRF model fit (R2<10%). Following our 2D visualizations (see below), we further constrained voxel selection by only including voxels whose pRF overlapped with the white noise annulus. We included all voxels with an estimated eccentricity within the annulus bounds, as well as voxels with an estimated pRF size that would overlap the annulus.

2D visualizations of attentional modulation

To visualize the topography of attentional modulation under different cue widths, we projected the average BOLD responses for a given block (10 TRs with a consistent cue location and width, shifted by 3 TRs [4.65 s] to compensate for the hemodynamic delay) into the visual field using each voxel’s pRF location. This method is similar to that described in Favila et al., 2022. First, we computed the Cartesian (x,y) coordinates from the pRF eccentricity and polar angle estimates for each voxel. Then, within a given ROI, we interpolated the BOLD responses over (x,y) space to produce a full-field representation. Each representation was then z-scored to allow for comparison across blocks, cue conditions, and participants. Finally, the representation was rotated so that the center of the cue was aligned to the right horizontal meridian (see Figure 2a).

1D spatial profile of attentional modulation

We also examined the spatial profile of attentional modulation as a function of polar angle. Voxels with pRFs overlapping the white noise annulus were grouped into 60 bins according to their pRF polar angle estimate (6° polar angle bin width). We computed a median BOLD response within each bin. This facilitated the recentering of each profile to align all cue centers for subsequent combining across trials. To improve the SNR, the resulting profile was smoothed with a moving average filter (width 18° polar angle; see Figure 2b).

Model fitting

We quantified the spatial profile of attentional modulation with a generalized Gaussian model (Nadarajah, 2005). The generalized Gaussian function (G) combines Gaussian and Laplace distributions:

G=exp{|xμσ|β} (1)

The function has free parameters for location (μ), scale (σ), and shape (β). The shape parameter enables the tails of the distribution to become heavier than Gaussian (when β<2), or lighter than Gaussian (when β>2); as β, the model approaches a uniform distribution.

Next, G was normalized to range between 0 and 1, and vertically scaled and shifted by two additional free parameters for gain (a) and baseline offset (b):

y^=aG+b (2)

We fit the five free parameters (μ,σ,β,a,b) using the MATLAB optimization tool fmincon, minimizing the squared error between the model prediction and the 1D profile described above. To avoid local minima, we first ran a grid search to find the initialization values with the lowest SSE (six possible values for μ, equally spaced between 0° and 360°, crossed with six possible values for σ, equally spaced between 9° and 162° polar angle; β = 4; a = 1; b = 0). We imposed the following parameter bounds on the search: σ: [6°, 180° polar angle], β: [1.8, 50], and a: [0, 20]. μ was unbounded, but was wrapped to remain within [0°, 360°].

From the model fits, we computed the following summary metrics: (1) angular error, defined as the polar-angle distance between the true and estimated location; (2) the FWHM of the best-fitting generalized Gaussian function, which served as our measure of the width of attentional modulation. The FWHM was controlled mainly by the scale parameter (σ) but also to a lesser degree by the shape parameter (β; see Figure 3a); (3) the gain modulation of the spatial profile (a); (4) the baseline offset (b); (5) the model’s goodness of fit quantified as the percentage of explained variance (R2) in the spatial response profile:

R2=1(yy^)2(yy¯)2 (3)

Statistical testing

To assess how the attentional cue width manipulation influenced the 1D spatial profile of BOLD modulation, we tested whether the computed summary metrics (absolute angular error, FWHM, gain, and baseline) varied as a function of cue width. Specifically, we performed a linear regression for each metric within each subject and tested whether the slopes differed from zero at the group level using a t-test. This was done independently for each ROI. No multiple comparison correction was applied, as the different tests for each region are treated as separate questions. However, using a threshold of 0.017 for p-values would correct for comparisons across the three brain regions. When testing whether the number of TRs impacted our metrics, the linear regression used both cue width and number of TRs as explanatory variables.

Eye-position monitoring

Gaze data were collected for all participants using an MR-compatible SR Research EyeLink 1000+ eye tracker sampling at 1 kHz. Data from blink periods were excluded from analysis. Participants maintained fixation throughout the task, with average gaze eccentricity below 0.5° for all participants. Gaze eccentricity did not significantly vary by cued width (pairwise comparison of width conditions using a paired t-test, all p≥0.205 with Bonferroni correction for multiple comparisons) nor location (pairwise comparison, all p≥0.522 with Bonferroni correction for multiple comparisons). Additionally, we examined the number of fixations to the white noise annulus itself. No participant had more than 16 fixations (out of 800–1200 trials) to the annulus during the task, further suggesting that participants successfully maintained fixation.

Acknowledgements

This work was supported by National Science Foundation grants SMA-1809071, and BCS-1755757, National Institutes of Health grants F32-EY029134, R01-EY028163, and R01-MH126971, Office of Naval Research grant N00014-17-1-2304, and the Center for Systems Neuroscience Postdoctoral Fellowship at Boston University. The equipment used was funded by NSF Major Instrumentation Grant 1625552. The content of this paper does not necessarily represent the official views of the funding agencies. We thank members of the Ling and McGuire labs for providing helpful feedback and comments.

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Sam Ling, Email: samling@bu.edu.

Ming Meng, University of Alabama at Birmingham, United States.

Huan Luo, Peking University, China.

Funding Information

This paper was supported by the following grants:

  • National Science Foundation SMA-1809071 to Leah Bakst, Joseph T McGuire.

  • National Science Foundation BCS-1755757 to Joseph T McGuire, Sam Ling.

  • National Eye Institute F32-EY029134 to Leah Bakst, Joseph T McGuire.

  • National Eye Institute R01-EY028163 to Sam Ling.

  • National Institute of Mental Health R01-MH126971 to Joseph T McGuire.

  • Boston University Center for Systems Neuroscience Postdoctoral Fellowship to Leah Bakst.

Additional information

Competing interests

No competing interests declared.

Author contributions

Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Software, Visualization, Writing – original draft, Writing – review and editing.

Conceptualization, Formal analysis, Funding acquisition, Investigation, Methodology, Software, Validation, Visualization, Writing – original draft, Writing – review and editing.

Conceptualization, Methodology, Writing – original draft, Investigation, Funding acquisition, Formal analysis.

Conceptualization, Methodology, Writing – original draft, Investigation, Data curation, Formal analysis.

Ethics

Human subjects: All procedures were approved by the Boston University Institutional Review Board, and informed consent was obtained from all participants.

Additional files

MDAR checklist

Data availability

All preprocessed fMRI data (extracted time series from each voxel), behavioral data, and eye tracking data reported in this study, as well as all code necessary to reproduce the analysis and recreate the figures are publicly available at Open Science Framework and GitHub (copy archived at Bloem, 2025) respectively.

The following dataset was generated:

Bloem I, Bakst L, McGuire JT, Ling S. 2025. Data and code for: "Dynamic estimation of the attentional field from visual cortical activity". Open Science Framework.

References

  1. Andersson JLR, Skare S, Ashburner J. How to correct susceptibility distortions in spin-echo echo-planar images: application to diffusion tensor imaging. NeuroImage. 2003;20:870–888. doi: 10.1016/S1053-8119(03)00336-7. [DOI] [PubMed] [Google Scholar]
  2. Anton-Erxleben K, Carrasco M. Attentional enhancement of spatial resolution: linking behavioural and neurophysiological evidence. Nature Reviews. Neuroscience. 2013;14:188–200. doi: 10.1038/nrn3443. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Bakst L, McGuire JT. Eye movements reflect adaptive predictions and predictive precision. Journal of Experimental Psychology. General. 2021;150:915–929. doi: 10.1037/xge0000977. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Bakst L, McGuire JT. Experience-driven recalibration of learning from surprising events. Cognition. 2023;232:105343. doi: 10.1016/j.cognition.2022.105343. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Bartsch MV, Merkel C, Strumpf H, Schoenfeld MA, Tsotsos JK, Hopf JM. A cortical zoom-in operation underlies covert shifts of visual spatial attention. Science Advances. 2023;9:eade7996. doi: 10.1126/sciadv.ade7996. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Behrens TEJ, Woolrich MW, Walton ME, Rushworth MFS. Learning the value of information in an uncertain world. Nature Neuroscience. 2007;10:1214–1221. doi: 10.1038/nn1954. [DOI] [PubMed] [Google Scholar]
  7. Benson NC, Jamison KW, Arcaro MJ, Vu AT, Glasser MF, Coalson TS, Van Essen DC, Yacoub E, Ugurbil K, Winawer J, Kay K. The human connectome project 7 Tesla retinotopy dataset: description and population receptive field analysis. Journal of Vision. 2018;18:23. doi: 10.1167/18.13.23. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Bloem IM, Ling S. Normalization governs attentional modulation within human visual cortex. Nature Communications. 2019;10:5660. doi: 10.1038/s41467-019-13597-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Bloem I. BloemBakstMcGuireLing_eLife_2025. swh:1:rev:a15e4e39f056652d4e1bba975e8f50b54dee674bSoftware Heritage. 2025 https://archive.softwareheritage.org/swh:1:dir:59df4308df50c1ce860c4d2c239033a5120e4e5f;origin=https://github.com/ilonabloem/BloemBakstMcGuireLing_eLife_2025;visit=swh:1:snp:7d65d32b3f8c9b7b0fedae14eb9fba53163912f8;anchor=swh:1:rev:a15e4e39f056652d4e1bba975e8f50b54dee674b
  10. Brefczynski JA, DeYoe EA. A physiological correlate of the “spotlight” of visual attention. Nature Neuroscience. 1999;2:370–374. doi: 10.1038/7280. [DOI] [PubMed] [Google Scholar]
  11. Carrasco M. Visual attention: the past 25 years. Vision Research. 2011;51:1484–1525. doi: 10.1016/j.visres.2011.04.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Castiello U, Umiltà C. Size of the attentional focus and efficiency of processing. Acta Psychologica. 1990;73:195–209. doi: 10.1016/0001-6918(90)90022-8. [DOI] [PubMed] [Google Scholar]
  13. d’Acremont M, Bossaerts P. Neural mechanisms behind identification of leptokurtic noise and adaptive behavioral response. Cerebral Cortex. 2016;26:1818–1830. doi: 10.1093/cercor/bhw013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Datta R, DeYoe EA. I know where you are secretly attending! The topography of human visual attention revealed with fMRI. Vision Research. 2009;49:1037–1044. doi: 10.1016/j.visres.2009.01.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Daw ND, O’Doherty JP, Dayan P, Seymour B, Dolan RJ. Cortical substrates for exploratory decisions in humans. Nature. 2006;441:876–879. doi: 10.1038/nature04766. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Dumoulin SO, Wandell BA. Population receptive field estimates in human visual cortex. NeuroImage. 2008;39:647–660. doi: 10.1016/j.neuroimage.2007.09.034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Eriksen CW, St. James JD. Visual attention within and around the field of focal attention: a zoom lens model. Perception & Psychophysics. 1986;40:225–240. doi: 10.3758/BF03211502. [DOI] [PubMed] [Google Scholar]
  18. Favila SE, Kuhl BA, Winawer J. Perception and memory have distinct spatial tuning properties in human visual cortex. Nature Communications. 2022;13:5864. doi: 10.1038/s41467-022-33161-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Feldmann-Wüstefeld T, Awh E. Alpha-band activity tracks the zoom lens of attention. Journal of Cognitive Neuroscience. 2020;32:272–282. doi: 10.1162/jocn_a_01484. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Fischl B. FreeSurfer. NeuroImage. 2012;62:774–781. doi: 10.1016/j.neuroimage.2012.01.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Gobell JL, Tseng C, Sperling G. The spatial distribution of visual attention. Vision Research. 2004;44:1273–1296. doi: 10.1016/j.visres.2004.01.012. [DOI] [PubMed] [Google Scholar]
  22. Greve DN, Fischl B. Accurate and robust brain image alignment using boundary-based registration. NeuroImage. 2009;48:63–72. doi: 10.1016/j.neuroimage.2009.06.060. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Herrmann K, Montaser-Kouhsari L, Carrasco M, Heeger DJ. When size matters: attention affects performance by contrast or response gain. Nature Neuroscience. 2010;13:1554–1559. doi: 10.1038/nn.2669. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Hopf J-M, Luck SJ, Boelmans K, Schoenfeld MA, Boehler CN, Rieger J, Heinze H-J. The neural site of attention matches the spatial scale of perception. The Journal of Neuroscience. 2006;26:3532–3540. doi: 10.1523/JNEUROSCI.4510-05.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Huang D, Liang H, Xue L, Wang M, Hu Q, Chen Y. The time course of attention modulation elicited by spatial uncertainty. Vision Research. 2017;138:50–58. doi: 10.1016/j.visres.2017.06.008. [DOI] [PubMed] [Google Scholar]
  26. Intriligator J, Cavanagh P. The spatial resolution of visual attention. Cognitive Psychology. 2001;43:171–216. doi: 10.1006/cogp.2001.0755. [DOI] [PubMed] [Google Scholar]
  27. Itthipuripat S, Garcia JO, Rungratsameetaweemana N, Sprague TC, Serences JT. Changing the spatial scope of attention alters patterns of neural gain in human cortex. The Journal of Neuroscience. 2014;34:112–123. doi: 10.1523/JNEUROSCI.3943-13.2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Jigo M, Heeger DJ, Carrasco M. An image-computable model of how endogenous and exogenous attention differentially alter visual perception. PNAS. 2021;118:p. e2106436118. 10.1073/pnas.2106436118. doi: 10.1073/pnas.2106436118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Kastner S, De Weerd P, Desimone R, Ungerleider LG. Mechanisms of directed attention in the human extrastriate cortex as revealed by functional MRI. Science. 1998;282:108–111. doi: 10.1126/science.282.5386.108. [DOI] [PubMed] [Google Scholar]
  30. Kay KN, Winawer J, Mezer A, Wandell BA. Compressive spatial summation in human visual cortex. Journal of Neurophysiology. 2013;110:481–494. doi: 10.1152/jn.00105.2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Kinıklıoğlu M, Boyaci H. Increasing the spatial extent of attention strengthens surround suppression. Vision Research. 2022;199:108074. doi: 10.1016/j.visres.2022.108074. [DOI] [PubMed] [Google Scholar]
  32. Klein BP, Harvey BM, Dumoulin SO. Attraction of position preference by spatial attention throughout human visual cortex. Neuron. 2014;84:227–237. doi: 10.1016/j.neuron.2014.08.047. [DOI] [PubMed] [Google Scholar]
  33. Maunsell JHR. Neuronal mechanisms of visual attention. Annual Review of Vision Science. 2015;1:373–391. doi: 10.1146/annurev-vision-082114-035431. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. McAdams CJ, Maunsell JH. Effects of attention on orientation-tuning functions of single neurons in macaque cortical area V4. The Journal of Neuroscience. 1999;19:431–441. doi: 10.1523/JNEUROSCI.19-01-00431.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. McGuire JT, Nassar MR, Gold JI, Kable JW. Functionally dissociable influences on learning rate in a dynamic environment. Neuron. 2014;84:870–881. doi: 10.1016/j.neuron.2014.10.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. McMains SA, Somers DC. Multiple spotlights of attentional selection in human visual cortex. Neuron. 2004;42:677–686. doi: 10.1016/s0896-6273(04)00263-6. [DOI] [PubMed] [Google Scholar]
  37. Moeller S, Yacoub E, Olman CA, Auerbach E, Strupp J, Harel N, Uğurbil K. Multiband multislice GE-EPI at 7 tesla, with 16-fold acceleration using partial parallel imaging with application to high spatial and temporal whole-brain fMRI. Magnetic Resonance in Medicine. 2010;63:1144–1153. doi: 10.1002/mrm.22361. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Müller NG, Bartelt OA, Donner TH, Villringer A, Brandt SA. A physiological correlate of the “Zoom Lens” of visual attention. The Journal of Neuroscience. 2003;23:3561–3565. doi: 10.1523/JNEUROSCI.23-09-03561.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Nadarajah S. A generalized normal distribution. Journal of Applied Statistics. 2005;32:685–694. doi: 10.1080/02664760500079464. [DOI] [Google Scholar]
  40. Nassar MR, Wilson RC, Heasly B, Gold JI. An approximately Bayesian delta-rule model explains the dynamics of belief updating in a changing environment. The Journal of Neuroscience. 2010;30:12366–12378. doi: 10.1523/JNEUROSCI.0822-10.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Nassar MR, Rumsey KM, Wilson RC, Parikh K, Heasly B, Gold JI. Rational regulation of learning dynamics by pupil-linked arousal systems. Nature Neuroscience. 2012;15:1040–1046. doi: 10.1038/nn.3130. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Nassar MR, Bruckner R, Frank MJ. Statistical context dictates the relationship between feedback-related EEG signals and learning. eLife. 2019;8:e46975. doi: 10.7554/eLife.46975. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. O’Reilly JX, Schüffelgen U, Cuell SF, Behrens TEJ, Mars RB, Rushworth MFS. Dissociable effects of surprise and model update in parietal and anterior cingulate cortex. PNAS. 2013;110:E3660–E3669. doi: 10.1073/pnas.1305373110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Palmer J, Moore CM. Using a filtering task to measure the spatial extent of selective attention. Vision Research. 2009;49:1045–1064. doi: 10.1016/j.visres.2008.02.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Payzan-LeNestour E, Bossaerts P. Risk, unexpected uncertainty, and estimation uncertainty: Bayesian learning in unstable settings. PLOS Computational Biology. 2011;7:e1001048. doi: 10.1371/journal.pcbi.1001048. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Payzan-LeNestour E, Dunne S, Bossaerts P, O’Doherty JP. The neural representation of unexpected uncertainty during value-based decision making. Neuron. 2013;79:191–201. doi: 10.1016/j.neuron.2013.04.037. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Peirce JW. PsychoPy - Psychophysics software in Python. Journal of Neuroscience Methods. 2007;162:8–13. doi: 10.1016/j.jneumeth.2006.11.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Posner MI. Orienting of attention. The Quarterly Journal of Experimental Psychology. 1980;32:3–25. doi: 10.1080/00335558008248231. [DOI] [PubMed] [Google Scholar]
  49. Preuschoff K, ’t Hart BM, Einhäuser W. Pupil dilation signals surprise: evidence for noradrenaline’s role in decision making. Frontiers in Neuroscience. 2011;5:115. doi: 10.3389/fnins.2011.00115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Puckett AM, DeYoe EA. The attentional field revealed by single-voxel modeling of fMRI time courses. The Journal of Neuroscience. 2015;35:5030–5042. doi: 10.1523/JNEUROSCI.3754-14.2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Ress D, Backus BT, Heeger DJ. Activity in primary visual cortex predicts performance in a visual detection task. Nature Neuroscience. 2000;3:940–945. doi: 10.1038/78856. [DOI] [PubMed] [Google Scholar]
  52. Reuter M, Rosas HD, Fischl B. Highly accurate inverse consistent registration: a robust approach. NeuroImage. 2010;53:1181–1196. doi: 10.1016/j.neuroimage.2010.07.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Reynolds JH, Heeger DJ. The normalization model of attention. Neuron. 2009;61:168–185. doi: 10.1016/j.neuron.2009.01.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Samaha J, Sprague TC, Postle BR. Decoding and reconstructing the focus of spatial attention from the topography of alpha-band oscillations. Journal of Cognitive Neuroscience. 2016;28:1090–1097. doi: 10.1162/jocn_a_00955. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Setsompop K, Gagoski BA, Polimeni JR, Witzel T, Wedeen VJ, Wald LL. Blipped-controlled aliasing in parallel imaging for simultaneous multislice echo planar imaging with reduced g-factor penalty. Magnetic Resonance in Medicine. 2012;67:1210–1224. doi: 10.1002/mrm.23097. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Shaw ML, Shaw P. Optimal allocation of cognitive resources to spatial locations. Journal of Experimental Psychology. Human Perception and Performance. 1977;3:201–211. doi: 10.1037//0096-1523.3.2.201. [DOI] [PubMed] [Google Scholar]
  57. Shioiri S, Honjyo H, Kashiwase Y, Matsumiya K, Kuriki I. Visual attention spreads broadly but selects information locally. Scientific Reports. 2016;6:35513. doi: 10.1038/srep35513. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Smith SM, Jenkinson M, Woolrich MW, Beckmann CF, Behrens TEJ, Johansen-Berg H, Bannister PR, De Luca M, Drobnjak I, Flitney DE, Niazy RK, Saunders J, Vickers J, Zhang Y, De Stefano N, Brady JM, Matthews PM. Advances in functional and structural MR image analysis and implementation as FSL. NeuroImage. 2004;23 Suppl 1:S208–S219. doi: 10.1016/j.neuroimage.2004.07.051. [DOI] [PubMed] [Google Scholar]
  59. Sprague TC, Serences JT. Attention modulates spatial priority maps in the human occipital, parietal and frontal cortices. Nature Neuroscience. 2013;16:1879–1887. doi: 10.1038/nn.3574. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Taylor JET, Chan D, Bennett PJ, Pratt J. Attentional cartography: mapping the distribution of attention across time and space. Attention, Perception, & Psychophysics. 2015;77:2240–2246. doi: 10.3758/s13414-015-0943-0. [DOI] [PubMed] [Google Scholar]
  61. Tkacz-Domb S, Yeshurun Y. The size of the attentional window when measured by the pupillary response to light. Scientific Reports. 2018;8:11878. doi: 10.1038/s41598-018-30343-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Tünçok E, Carrasco M, Winawer J. Spatial attention alters visual cortical representation during target anticipation. bioRxiv. 2024 doi: 10.1101/2024.03.02.583127. [DOI]
  63. van Beilen M, Renken R, Groenewold ES, Cornelissen FW. Attentional window set by expected relevance of environmental signals. PLOS ONE. 2011;6:e21262. doi: 10.1371/journal.pone.0021262. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Vo VA, Sprague TC, Serences JT. Spatial tuning shifts increase the discriminability and fidelity of population codes in visual cortex. The Journal of Neuroscience. 2017;37:3386–3401. doi: 10.1523/JNEUROSCI.3484-16.2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Yeshurun Y. The spatial distribution of attention. Current Opinion in Psychology. 2019;29:76–81. doi: 10.1016/j.copsyc.2018.12.008. [DOI] [PubMed] [Google Scholar]

eLife Assessment

Ming Meng

This valuable study addresses a gap in our understanding of how the size of the attentional field is represented within the visual cortex. The evidence supporting the role of visual cortical activity is convincing, based on a novel modeling analysis of fMRI data. The results will be of interest to psychologists and cognitive neuroscientists.

Reviewer #1 (Public review):

Anonymous

The authors conducted an fMRI study to investigate the neural effects of sustaining attention to areas of different sizes. Participants were instructed to attend to alphanumeric characters arranged in a circular array. The size of attention field was manipulated in four levels, ranging from small (18 deg) to large (162 deg). They used a model-based method to visualize attentional modulation in early visual cortex V1 to V3, and found spatially congruent modulations of the BOLD response, i.e., as the attended area increased in size, the neural modulation also increased in size in the visual cortex. They suggest that this result is a neural manifestation of the zoom-lens model of attention and that the model-based method can effectively reconstruct the neural modulation in the cortical space.

The study is well-designed with sophisticated and comprehensive data analysis. The results are robust and show strong support for a well-known model of spatial attention, the zoom-lens model. Overall, I find the results interesting and useful for the field of visual attention research.

Comments on revisions:

The authors have addressed my previous comments satisfactorily. I would encourage the authors to make data and code publicly available, which appears to be the custom in this era.

Reviewer #2 (Public review):

Anonymous

Summary:

The study in question utilizes functional magnetic resonance imaging (fMRI) to dynamically estimate the locus and extent of covert spatial attention from visuocortical activity. The authors aim to address an important gap in our understanding of how the size of the attentional field is represented within the visual cortex. They present a novel paradigm that allows for the estimation of the spatial tuning of the attentional field and demonstrate the ability to reliably recover both the location and width of the attentional field based on BOLD responses.

Strengths:

(1) Innovative Paradigm: The development of a new approach to estimate the spatial tuning of the attentional field is a significant strength of this study. It provides a fresh perspective on how spatial attention modulates visual perception.

(2) Refined fMRI Analysis: The use of fMRI to track the spatial tuning of the attentional field across different visual regions is methodologically rigorous and provides valuable insights into the neural mechanisms underlying attentional modulation.

(3) Clear Presentation: The manuscript is well-organized, and the results are presented clearly, which aids in the reader's comprehension of the complex data and analyses involved.

Weaknesses:

(1) Lack of Neutral Cue Condition: The study does not include a neutral cue condition where the cue width spans 360{degree sign}, which could serve as a valuable baseline for assessing the BOLD response enhancements and diminishments in both attended and non-attended areas.

(2) Clarity on Task Difficulty Ratios: The explicit reasoning for the chosen letter-to-number ratios for various cue widths is not detailed. Ensuring clarity on these ratios is crucial, as it affects the task difficulty and the comparability of behavioral performance across different cue widths. It is essential that observed differences in behavior and BOLD signals are attributable solely to changes in cue width and not confounded by variations in task difficulty.

Comments on revisions:

(1) Please standardize the naming of error metrics across Figures 4-6 to improve clarity (e.g., "angular error" (Figure 4), "|angular error|" (Figure 5), and "absolute error" (Figure 6) appear to refer to the same measure). This inconsistency is also present in the main text.

(2) Consider briefly mentioning the baseline offset in Lines 179-186. It is included in Figures 4-7 and serves as a reference for interpreting attentional modulation alongside gain. Introducing it with other model parameters would improve clarity.

(3) It may be valuable to examine BOLD responses in unattended visual regions. As shown in Figure 2a, suppression patterns (e.g., the most negative responses) appear to vary in extent and distribution with attentional cue width. Analyzing these unattended regions may offer a more complete view of how attention shapes the spatial profile of cortical activity.

eLife. 2025 Sep 11;14:RP104222. doi: 10.7554/eLife.104222.3.sa3

Author response

Ilona M Bloem, Leah Bakst, Joseph T McGuire, Sam Ling

The following is the authors’ response to the original reviews

We thank the three reviewers for their insightful feedback. We look forward to addressing the raised concerns in a revised version of the manuscript. There were a few common themes among the reviews that we will briefly touch upon now, and we will provide more details in the revised manuscript.

First, the reviewers asked for the reasoning behind the task ratios we implemented for the different attentional width conditions. The different ratios were selected to be as similar as possible given the size and spacing of our stimuli (aside from the narrowest cue width of one bin, the ratios for the others were 0.66, .6 and .66). As Figure 1b shows, while the ratios were similar, task difficulty is not constant across cue widths: spreading attention makes the task more difficult generally. But, while the modeled width of the spatial distribution of attention changes monotonically with cue width, task difficulty does not. Furthermore, prior work has indicated that there is a relationship between task difficulty and the overall magnitude of the BOLD response, however we don’t suspect that this will influence the width of the modulation. How task difficulty influences the BOLD response is an important topic, and we hope that future work will investigate this relationship more directly.

Second, reviewers raised interest in the distribution of spatial attention in higher visual areas. In our study we focus only on early visual regions (V1-V3). This was primarily driven by pragmatic considerations, in that we only have retinotopic estimates for our participants in these early visual areas. Our modeling approach is dependent on having access to the population receptive field estimates for all voxels, and while the main experiment was scanned using whole brain coverage, retinotopy was measured in a separate session using a field of view only covering the occipital cortex.

Lastly, we appreciate the opportunity to clarify the purpose of the temporal interval analysis. The reviewer is correct in assuming we set out to test how much data is needed to recover the cortical modulation and how dynamic a signal the method can capture. This analysis does show that more data provides more reliable estimates, though the model was still able to recover the location and width of the attentional cue at shorter timescales of as few as two TRs. This has implications for future studies that may involve more dynamic tracking of the attentional field.

Public Reviews

Reviewer #1 (Public review):

The authors conducted an fMRI study to investigate the neural effects of sustaining attention to areas of different sizes. Participants were instructed to attend to alphanumeric characters arranged in a circular array. The size of attention field was manipulated in four levels, ranging from small (18 deg) to large (162 deg). They used a model-based method to visualize attentional modulation in early visual cortex V1 to V3, and found spatially congruent modulations of the BOLD response, i.e., as the attended area increased in size, the neural modulation also increased in size in the visual cortex. They suggest that this result is a neural manifestation of the zoomlens model of attention and that the model-based method can effectively reconstruct the neural modulation in the cortical space.

The study is well-designed with sophisticated and comprehensive data analysis. The results are robust and show strong support for a well-known model of spatial attention, the zoom-lens model. Overall, I find the results interesting and useful for the field of visual attention research. I have questions about some aspects of the results and analysis as well as the bigger picture.

(1) It appears that the modulation in V1 is weaker than V2 and V3 (Fig 2). In particular, the width modulation in V1 is not statistically significant (Fig 5). This result seems a bit unexpected. Given the known RF properties of neurons in these areas, in particular, smaller RF in V1, one might expect more spatially sensitive modulation in V1 than V2/V3. Some explanations and discussions would be helpful. Relatedly, one would also naturally wonder if this method can be applied to other extrastriate visual areas such as V4 and what the results look like.

We agree with the reviewer. It’s very interesting how the spatial resolution within different visual regions contributes to the overall modulation of the attentional field, and how this in turn would influence perception. Our data showed that fits in V1 appeared to be less precise than in V2 and V3. This can be seen in the goodness of fit of the model as well as the gain and absolute angular error estimates. The goodness of fit and gain were lowest in V1 and the absolute angular error was largest in V1 (see Figure 5). We speculate that the finer spatial granularity of V1 RFs was countered by a lower amplitude and SNR of attention-related modulation in V1, resulting in overall lower sensitivity to variation in attentional field width. Prior findings concur that the magnitude of covert spatial attention increases when moving from striate to extrastriate cortex (Bressler & Silver (2010); Buracas & Boynton (2007)). Notably, in our perception condition, V1 showed more spatially sensitive modulation (see Figure 7), consistent with the known RF properties of V1 neurons.

Regarding the second point: unfortunately, our dataset did not allow us to explore higherorder cortical regions with the model-based approach. While the main experiment was scanned using a sequence with whole brain coverage, the pRF estimates came from a separate scanning session which only had limited occipital coverage. Our modeling approach is dependent on the polar angle estimates from this pRF session. We now explicitly state this limitation in the methods (lines 87-89):

“In this session, the field of view was restricted to the occipital cortex to maximize SNR, thereby limiting the brain regions for which we had pRF estimates to V1, V2, and V3.”

(2) I'm a bit confused about the angular error result. Fig 4 shows that the mean angular error is close to zero, but Fig 5 reports these values to be about 30-40 deg. Why the big discrepancy? Is it due to the latter reporting absolute errors? It seems reporting the overall bias is more useful than absolute value.

The reviewer’s inference here is exactly right: Figure 4 shows signed error, whereas Figure 5 shows absolute error. We show the signed error for the example participant because, (1) by presenting the full distribution of model estimates for one participant, readers have access to a more direct representation of the data, and (2) at the individual level it is possible to examine potential directional biases in the location estimates (which do not appear to be present). As we don’t suspect a consistent directional bias across the group, we believe the absolute error in location estimates is more informative in depicting the precision in location estimates using the model-based approach. In the revised manuscript, we modified Figure 5 to make the example participant’s data visually distinct for easy comparison. We have clarified this reasoning in the text (results lines 59-64):

“The angular error distribution across blocks, separated by width condition, is shown in Figure 4 for one example participant to display block-to-block variation. The model reliably captured the location of the attentional field with low angular error and with no systematic directional bias. This result was observed across participants. We next examined the absolute angular error to assess the overall accuracy of our estimates.”

(3) A significant effect is reported for amplitude in V3 (line 78), but the graph in Fig 5 shows hardly any difference. Please confirm the finding and also explain the directionality of the effect if there is indeed one.

We realize that the y-axis scale of Figure 5 was making it difficult to see that gain decreases with cue width in area V3. Instead of keeping the y-axis limits the same across visual regions, we now adapt the y-axis scale of each subplot to the range of data values:

We now also add the direction of the effect in the text (results lines 83-86):

“We observed no significant relationship between gain and cue width in V1 and V2 (V1 t(7)=.54, p=.605; V2 t(7)=-2.19, p=.065), though we did find a significant effect in V3 illustrating that gain decreases with cue width (t(7)=-3.12, p=.017).”

(4) The purpose of the temporal interval analysis is rather unclear. I assume it has to do with how much data is needed to recover the cortical modulation and hence how dynamic a signal the method can capture. While the results make sense (i.e., more data is better), there is no obvious conclusion and/or interpretation of its meaning.

We apologize for not making our reasoning clear. We now emphasize our reasoning in the revised manuscript (results lines 110-112). Our objective was to quantify how much data was needed to recover the dynamic signal. As expected, we found that including more data reduces noise (averaging helps), but importantly, we found that we still obtained meaningful model fits even with limited data. We believe this has important implications for future paradigms that explore more dynamic deployment of spatial attention, where one would not want to average over multiple repetitions of a condition.

The first paragraph of the Temporal Interval Analysis section in the results now reads:

“In the previous analyses, we leveraged the fact that the attentional cue remained constant for 5-trial blocks (spatial profiles were computed by averaging BOLD measurements across a block of 10 TRs). We next examined the degree to which we were able to recover the attentional field on a moment-by-moment (TR-by-TR) basis. To do this, we systematically adjusted the number of TRs that contributed to the averaged spatial response profile. To maintain a constant number of observations across the temporal interval conditions, we randomly sampled a subset of TRs from each block. This allowed us to determine the amount of data needed to recover the attentional field, with a goal of examining the usability of our modeling approach in future paradigms involving more dynamic deployment of spatial attention.”

(5) I think it would be useful for the authors to make a more explicit connection to previous studies in this literature. In particular, two studies seem particularly relevant. First, how do the present results relate to those in Muller et al (2003, reference 37), which also found a zoom-lens type of neural effects. Second, how does the present method compare with spatial encoding model in Sprague & Serences (2013, reference 56), which also reconstructs the neural modulation of spatial attention. More discussions of these studies will help put the current study in the larger context.

We now make a more explicit connection to prior work in the discussion section (lines 34-54).

“We introduced a novel modeling approach that recovered the location and the size of the attentional field. Our data show that the estimated spatial spread of attentional modulation (as indicated by the recovered FWHM) consistently broadened with the cue width, replicating prior work (Müller et al., 2003; Herrmann et al., 2010). Our results go beyond prior work by linking the spatial profiles to pRF estimates, allowing us to quantify the spread of both attentional and perceptual modulation in degrees of polar angle. Interestingly, the FWHM estimates for the attentional and perceptual spatial profiles were highly similar. Additionally, for area V3 we replicate that the population response magnitude decreased with cue width (Müller et al., 2003; Feldmann-Wüstefeld and Awh, 2020). One innovation of our method is that it directly reconstructs attention-driven modulations of responses in visual cortex, setting it apart from other methods, such as inverted encoding models (e.g. Sprague & Serences, 2013). Finally, we demonstrated that our method has potential to be used in more dynamic settings, in which changes in the attentional field need to be tracked on a shorter timescale.”

(6) Fig 4b, referenced on line 123, does not exist.

We have corrected the text to reference the appropriate figure (Figure 5, results line 136).

Reviewer #2 (Public review):

Summary:

The study in question utilizes functional magnetic resonance imaging (fMRI) to dynamically estimate the locus and extent of covert spatial attention from visuocortical activity. The authors aim to address an important gap in our understanding of how the size of the attentional field is represented within the visual cortex. They present a novel paradigm that allows for the estimation of the spatial tuning of the attentional field and demonstrate the ability to reliably recover both the location and width of the attentional field based on BOLD responses.

Strengths:

(1) Innovative Paradigm: The development of a new approach to estimate the spatial tuning of the attentional field is a significant strength of this study. It provides a fresh perspective on how spatial attention modulates visual perception.

(2) Refined fMRI Analysis: The use of fMRI to track the spatial tuning of the attentional field across different visual regions is methodologically rigorous and provides valuable insights into the neural mechanisms underlying attentional modulation.

(3) Clear Presentation: The manuscript is well-organized, and the results are presented clearly, which aids in the reader's comprehension of the complex data and analyses involved.

We thank the reviewer for summarizing the strengths in our work.

Weaknesses:

(1) Lack of Neutral Cue Condition: The study does not include a neutral cue condition where the cue width spans 360°, which could serve as a valuable baseline for assessing the BOLD response enhancements and diminishments in both attended and non-attended areas.

We do not think that the lack of a neutral cue condition substantially limits our ability to address the core questions of interest in the present work. We set out to estimate the locus and the spread of covert spatial attention. By definition, a neutral cue does not have a focus of attention as the whole annulus becomes task relevant. We agree with the reviewer that how spatial attention influences the magnitude of the BOLD response is still not well defined; i.e., does attending a location multiplicatively enhance responses at an attended location or does it instead act to suppress responses outside the focus of attention? A neutral cue condition would be necessary to be able to explore these types of questions. However, our findings don’t rest on any assumptions about this. Instead, we quantify the attentional modulation with a model-based approach and show that we can reliably recover its locus, and reveal a broadening in the attentional modulation with wider cues.

We realize that throughout the original manuscript we often used the term ‘attentional enhancement,’ which might inadvertently specify an increase with respect to a neutral condition. To be more agnostic to the directionality of the effect, we have changed this to ‘attentional modulation’ and ‘attentional gain’ throughout the manuscript. Additionally, we have added results and visualizations for the baseline parameter to all results figures (Figures 4-7) to help readers further interpret our findings.

(2) Clarity on Task Difficulty Ratios: The explicit reasoning for the chosen letter-to-number ratios for various cue widths is not detailed. Ensuring clarity on these ratios is crucial, as it affects the task difficulty and the comparability of behavioral performance across different cue widths. It is essential that observed differences in behavior and BOLD signals are attributable solely to changes in cue width and not confounded by variations in task difficulty.

The ratios were selected to be as similar as possible given the size and spacing of our stimuli (aside from the narrowest cue width of one bin, the proportions for the others were 0.67, 0.60, and 0.67). We have updated the methods section to state this explicitly (methods lines 36-38):

“The ratios were selected to be as similar as possible given the size and spacing of our stimuli (aside from the one-bin cue, the proportions for the other cues were 0.67, 0.60, 0.67).”

As Figure 1b shows, task accuracy showed small and non-monotonic changes across the three larger cue widths, dissociable from the monotonic pattern seen for the modelestimated width of the attentional field. Furthermore, as prior work has indicated that there is a relationship between task difficulty and the overall magnitude of the BOLD response (e.g., Ress, Backus & Heeger, 2000), we would primarily expect effects of task difficulty on the gain or baseline rather than the width. How exactly task difficulty influences the BOLD response and whether this would, in fact, interact with the width of the attentional field is an important topic, and we hope that future work will investigate this relationship more directly.

We have clarified these points within the text, and now explicitly motivate future work looking at these important interactions (discussion lines 57-67):

“The observed effects of attentional field width were unlikely to be directly attributable to variation in task difficulty. Participants' task in our study was to discriminate whether more numbers or more letters were presented within a cued region of an iso-eccentric annulus of white noise. For our different cue widths, the ratios of numbers and letters were selected to be as similar as possible given the size and spacing of our stimuli. Changes in accuracy across the three larger cue widths were small and non-monotonic, implying task difficulty was dissociable from width per se. This dissociation bolsters the interpretability of our model fits; nevertheless, future work should further investigate how task difficulty interacts with the spread of the attentional field and the amplitude of attention-related BOLD effects (cf. Ress, Backus & Heeger, 2000).”

Reviewer #3 (Public review):

Summary:

In this report, the authors tested how manipulating the contiguous set of stimuli on the screen that should be used to guide behavior - that is, the scope of visual spatial attention - impacts the magnitude and profile of well-established attentional enhancements in visual retinotopic cortex. During fMRI scanning, participants attended to a cued section of the screen for blocks of trials and performed a letter vs digit discrimination task at each attended location (and judged whether the majority of characters were letters/digits). Importantly, the visual stimulus was identical across attention conditions, so any observed response modulations are due to topdown task demands rather than visual input. The authors employ population receptive field (pRF) models, which are used to sort voxel activation with respect to the location and scope of spatial attention and fit a Gaussian-like function to the profile of attentional enhancement from each region and condition. The authors find that attending to a broader region of space expands the profile of attentional enhancement across the cortex (with a larger effect in higher visual areas), but does not strongly impact the magnitude of this enhancement, such that each attended stimulus is enhanced to a similar degree. Interestingly, these modulations, overall, mimic changes in response properties caused by changes to the stimulus itself (increase in contrast matching the attended location in the primary experiment). The finding that attentional enhancement primarily broadens, but does not substantially weaken in most regions, is an important addition to our understanding of the impact of distributed attention on neural responses, and will provide meaningful constraints to neural models of attentional enhancement.

Strengths:

(1) Well-designed manipulations (changing location and scope of spatial attention), and careful retinotopic/pRF mapping, allow for a robust assay of the spatial profile of attentional enhancement, which has not been carefully measured in previous studies.

(2) Results are overall clear, especially concerning width of the spatial region of attentional enhancement, and lack of clear and consistent evidence for reduction in the amplitude of enhancement profile.

(3) Model-fitting to characterize spatial scope of enhancement improves interpretability of findings.

We thank the reviewer for highlighting the strengths of our study.

Weaknesses:

(1) Task difficulty seems to vary as a function of spatial scope of attention, with varying ratios of letters/digits across spatial scope conditions, which may complicate interpretations of neural modulation results

The reviewer is correct in observing that task accuracy varied across cue widths. Though we selected the task ratios to be as similar as possible given the size and spacing of our stimuli (aside from the narrowest cue width of one bin, the proportions for the others were 0.67, 0.60, and 0.67), behavioral accuracy across the three larger cue widths was not identical. Prior research has shown that there is a relationship between task difficulty and the overall magnitude of the BOLD response (e.g., Ress, Backus & Heeger, 2000). Thus, we would primarily expect effects of task difficulty on gain rather than width. How task difficulty influences the BOLD response and whether this would, in fact, interact with the width of the attentional field is an important topic, and we hope that future work will investigate this relationship more directly.

To clarify these points and highlight the potential for future work looking at these important interactions, we added the following text to the discussion section (discussion lines 57-67):

“The observed effects of attentional field width were unlikely to be directly attributable to variation in task difficulty. Participants' task in our study was to discriminate whether more numbers or more letters were presented within a cued region of an iso-eccentric annulus of white noise. For our different cue widths, the ratios of numbers and letters were selected to be as similar as possible given the size and spacing of our stimuli. Changes in accuracy across the three larger cue widths were small and non-monotonic, implying task difficulty was dissociable from width per se. This dissociation bolsters the interpretability of our model fits; nevertheless, future work should further investigate how task difficulty interacts with the spread of the attentional field and the amplitude of attention-related BOLD effects (cf. Ress, Backus and Heeger, 2000).”

(2) Some aspects of analysis/data sorting are unclear (e.g., how are voxels selected for analyses?)

We apologize for not describing our voxel selection in sufficient detail. Some of the questions raised in the private comments are closely related to this point, we therefore aim to clarify all concerns below:

- Voxel selection: To select voxels that contribute to the 1D spatial profiles, we relied on the independent pRF dataset. We first defined some general requirements that needed to be met. Specifically, (1) the goodness of fit (R2) of the pRF fits needed to be greater than 10%; (2) the estimated eccentricity had to fall within [0.7 9.1] degree eccentricity (to exclude voxels in the fovea and voxels with estimated eccentricities larger than the pRF mapping stimulus); (3) the estimated size must be greater than 0.01 degree visual angle.

Next, we included only voxels whose pRF overlapped with the white noise annulus. Estimated eccentricity was used to select all voxels whose eccentricity estimate fell within the annulus bounds. However, here it is also important to take the size of the pRF into account. Some voxels’ estimated eccentricity might fall just outside the annulus, but will still have substantial overlap due to the size of their pRF. Therefore, we further included all voxels whose estimated pRF size resulted in overlap with the annulus.

This implies that some voxels with greater eccentricities and larger pRF sizes contribute to the 1D profile, which will influence the spatial specificity of the 1D profiles. However, we want to emphasize that in our view, the exact FWHM value is not so much of interest, as this will always be dependent on the voxel selection and many other data processing steps. Instead, we focus on the relative differences of the FWHM driven by the parametric attentional cue width manipulation.

- Data sorting and binning. The reviewer raises an important point about how the FWHM value should be interpreted considering the data processing steps. To generate the 1D spatial profile, we binned voxels based on their estimated polar angle preference into 6degree bins and applied a moving average of 18 degrees to smooth the 1D profiles. Both of these processing steps will influence the spatial specificity of the profile. The binning step facilitates recentering based on cue center and combining across trials.

To explore the extent to which the moving average substantially impacted our results, we reran our analyses without that smoothing step. The vast majority of the results held. In V1, we found a significant effect of cue width on FWHM where the result was not significant previously (t(7)=2.52, p=.040). Additionally, when looking at the minimum number of TRs needed to see a significant effect of cue width on FWHM, without the smoothing step in V1 it took 10 TRs (not significant at 10 TRs previously), in V2 it took 5 TRs (10 previously), and in V3 it took 3 TRs (2 previously). The other notable difference is that FWHM was generally a bit larger when the moving average smoothing was performed. We have visualized the group results for the FWHM estimates below to help with comparison.

Author response image 1. No moving average smoothing.

Author response image 1.

Voxel selection methods have been clarified in methods section lines 132-139:

“Within each ROI, pRF modeling results were used to constrain voxel selection used in the main experiment. We excluded voxels with a preferred eccentricity outside the bounds of the pRF stimulus (<0.7° and >9.1°), with a pRF size smaller than 0.01°, or with poor spatial selectivity as indicated by the pRF model fit (R2 < 10%). Following our 2D visualizations (see below), we further constrained voxel selection by only including voxels whose pRF overlapped with the white noise annulus. We included all voxels with an estimated eccentricity within the annulus bounds, as well as voxels with an estimated pRF size that would overlap the annulus.”

Data binning methods have been clarified in methods section lines 154-159:

“Voxels with pRFs overlapping the white noise annulus were grouped into 60 bins according to their pRF polar angle estimate (6° polar angle bin width). We computed a median BOLD response within each bin. This facilitated the recentering of each profile to align all cue centers for subsequent combining across trials. To improve the signal-to-noise ratio, the resulting profile was smoothed with a moving average filter (width 18° polar angle; see Figure 2b).”

(3) While the focus of this report is on modulations of visual cortex responses due to attention, the lack of inclusion of results from other retinotopic areas (e.g. V3AB, hV4, IPS regions like IPS0/1) is a weakness

We agree with the reviewer that using this approach in other retinotopic areas would be of significant interest. In this case, population receptive field mapping occurred in a separate session with a field of view only covering the occipital cortex (in contrast to the experimental session, which had whole-brain coverage). Because our modeling approach relies on these pRF estimates, we were unable to explore higher visual areas. However, we hope future work will follow up on this.

We have added the following text to the methods section describing the pRF mapping session (lines 87-89):

“In this session, the field of view was restricted to the occipital cortex to maximize SNR, thereby limiting the brain regions for which we had pRF estimates to V1, V2, and V3.”

(4) Additional analyses comparing model fits across amounts of data analyzed suggest the model fitting procedure is biased, with some parameters (e.g., FWHM, error, gain) scaling with noise.

In this analysis, we sought to test how much data was needed to recover the attentional field, in view of the need for additional fMRI-based tools for use in tasks that involve more rapid dynamic adaptation of attention. Though we did find that more data reduced noise (and accordingly decreased absolute error and amplitude while increasing FWHM and R2), absolute angular error remained low across different temporal intervals (well below the chance level of 90°). With regard to FWHM, we believe that the more important finding is that the model-estimated FWHM was modulated by cue width at shorter timescales of as few as two TRs while maintaining relatively low angular error. We refrain from drawing conclusions here on the basis of the exact FWHM values, both because we don’t have a ground truth for the attentional field and because various processing pipeline steps can impact the values as well. Rather, we are looking at relative value and overall patterns in the estimates. The observed patterns imply that the model recovers meaningful modulation of the attentional field even at shorter time scales.

Recommendations for the authors:

Reviewer #1 (Recommendations for the authors):

Additional data reporting and discussion of results are needed as outlined in the public review.

Reviewer #2 (Recommendations for the authors):

(1) The current experimental design effectively captured the impact of varying cue widths on the BOLD response in the visual cortex. However, the inclusion of a neutral cue condition, where the cue width spans 360{degree sign} and all peripheral stimuli are attended, could serve as a valuable baseline. This would enable a quantitative assessment of how much the BOLD response is enhanced in specific spatial regions due to focused cues and, conversely, how much it is diminished in non-attended areas, along with the spatial extent of these effects.

Please refer to our response in the public review.

(2) While the study provides valuable insights into BOLD signal changes in visual areas corresponding to the focus of attention, it does not extend its analysis to the impact on regions outside the focus of attention. It would be beneficial to explore whether there is a corresponding decrease in BOLD signal in non-attended regions, and if identified, to describe the spatial extent and position of this effect relative to the attended area. Such an analysis could yield deeper insights into how attention influences activity across the visual cortex.

We agree with the reviewer that it is very interesting to examine the spread of attention across the whole visual field. Our experiment was designed to focus on width modulations at a fixed eccentricity, but future work should explore how the attentional field changes with eccentricity and interacts with spatial variations across the visual field. This is highlighted in our discussion section (lines 76-81):

“Future work can help provide a better understanding of the contribution of spatial attention by considering how the attentional field interacts with these well described spatial variations across the visual field. Measuring the full spatial distribution of the attentional field (across both eccentricity and polar angle) will shed light on how spatial attention guides perception by interacting with the non-uniformity of spatial representations.”

The addition of figure panels for the estimated baseline parameter in Figures 4-7 provides further information about BOLD effects in unattended regions of the annulus.

(3) The rationale behind the selection of task difficulty ratios for different cue widths, specifically the letter-to-number ratios of 1:0, 1:2, 2:3, and 3:6 (or vice versa) for cue widths of 18{degree sign}, 54{degree sign}, 90{degree sign}, and 162{degree sign} respectively, was not explicitly discussed. It would be beneficial to clarify the basis for these ratios, as they may influence the perceived difficulty of the task and thus the comparability of behavioral performance across different cue widths. Ensuring that the task difficulty is consistent across conditions is crucial for attributing differences in behavior and BOLD signals solely to changes in cue width and not confounded by variations in task difficulty.

Please refer to our response in the public review. We now clarify why we selected these ratios, and acknowledge more explicitly that behavioral performance differed across width conditions. See also our reply to private comment 1 from Reviewer 3 for some additional analyses examining task related influences.

Reviewer #3 (Recommendations for the authors):

(1) Task difficulty: the task seems exceptionally challenging. Stimuli are presented at a relativelyeccentric position for a very brief duration, and a large number of comparisons must be made across a broad region of space. This is reflected in the behavioral performance, which decreases rapidly as the scope of attention increases (Fig. 1). Because trials are blocked, does this change in task difficulty across conditions impact the degree to which neural responses are modulated? How should we consider differences in task difficulty in interpreting the conclusions (especially with respect to the amplitude parameter)? Also, note that the difficulty scales both with number of stimuli - as more need to be compared - but also with the ratio, which differs nonmonotonically across task conditions. One way to dissociate these might be RT: for 54/162, which both employ the same ratio of letter/digits and have similar accuracy, is RT longer for 162, which requires attending more stimuli?

In addition to our comments in response to the public review, we emphasize that the reviewer makes an important point that there are differences in task difficulty, though the ratios are as close as they can be given the size and spacing of our stimuli. Behavioral performance varied non-monotonically with cue width, bolstering our confidence that our monotonically increasing model-estimated width is likely not entirely driven by task difficulty. There nevertheless remain open questions related to how task difficulty does impact BOLD attentional modulation, which we hope future work will more directly investigate.

The reviewer's comments identify two ways our data might preliminarily speak to questions about BOLD attentional modulation and task difficulty. First: how might the amplitude parameter reflect task difficulty? This is an apt question as we agree with the reviewer that it would be a likely candidate in which to observe effects of task difficulty. We do find a small effect of cue width on our amplitude estimates (amplitude decreases with width) in V3. Using the same analysis technique to look at the relationship between task difficulty and amplitude, we find no clear relationship in any of the visual areas (all p >= 0.165, testing whether the slopes differed from zero at the group level using a one-sample t-test). We believe future work using other experimental manipulations should look more systematically at the relationship between task difficulty and amplitude of the attentional BOLD enhancement.

Second: Does the same ratio at different widths elicit different behavioral responses (namely accuracy and RT)? We followed the reviewer’s suggestion to compare performance between cue widths of three and nine (identical ratios, different widths; see Author response image 2 and Figure 5). We found that, using a paired t-test, behavioral accuracy differed between the two cue widths (mean accuracy of 0.73 versus 0.69, p = 0.008), with better performance for cue width three. RT did not differ significantly between the two conditions (paired t-test, p = 0.729). This could be due to the fact that participants were not incentivized to respond as quickly as possible, they merely needed to respond before the end of the response window (1.25 s) following the stimulus presentation (0.5 s). The comparisons for accuracy and RT (calculated from time of stimulus appearance) are plotted below:

Author response image 2.

Author response image 2.

In summary, with matched stimulus ratios, the wider cue was associated with worse (though not slower) performance. This could be due to the fact that more elements are involved and/or that tasks become more difficult when attending to a broader swath of space. Given these results, we believe that future studies targeting difficulty effects should use direct and independent manipulations of task difficulty and attentional width.

(2) Eye movements: while the authors do a good job addressing the average eccentricity of fixation, I'm not sure this fully addresses concerns with eye movements, especially for the character-discrimination task which surely benefits from foveation (and requires a great deal of control to minimize saccades!). Can the authors additionally provide data on, e.g., # of fixations within the attended stimulus annulus, or fixation heatmap, or # of saccades, or some other indicator of likelihood of fixating the letter stimuli for each condition?

We agree with the reviewer that this task is surely much easier if one foveated the stimuli, and it did indeed require control to minimize saccades to the annulus. (We appreciate the effort and motivation of our participants!) We are happy to provide additional data to address these reasonable concerns about eye movements. Below, we have visualized the number of fixations to the annulus, separated by participant and width. Though there is variability across participants, there are at most 16 instances of fixations to the annulus for a given participant, combined across all width conditions. The median number of fixations to the annulus per width is zero (shown in red). Considering the amount of time participants engaged in the task (between 8 and 12 runs of the task, each run with 100 trials), this indicates participants were generally successful at maintaining central fixation while the stimuli were presented.

Author response image 3.

Author response image 3.

We added the results of this analysis to the methods section (lines 205-208):

“Additionally, we examined the number of fixations to the white noise annulus itself. No participant had more than 16 fixations (out of 800-1200 trials) to the annulus during the task, further suggesting that participants successfully maintained fixation.”

(3) pRF sorting and smoothing: Throughout, the authors are analyzing data binned based on pRF properties with respect to the attended location ("voxels with pRFs overlapping with the white noise annulus", line 243-244) First, what does this mean? Does the pRF center need to be within the annulus? Or is there a threshold based on the pRF size? If so, how is this implemented? Additionally, considering the methods text in lines 242-247, the authors mention that they bin across 6 deg-wide bins and smooth with a moving average (18 deg), which I think will lead to further expansion of the profile of attentional enhancement (see also below)

We provide a detailed response in the public review. Furthermore, we have clarified the voxel selection procedure in the Methods (lines 132–139 & 154–159).

(4) FWHM values: The authors interpret the larger FWHMs estimated from their model-fitting than the actual size of the attended region as a meaningful result. However, depending on details of the sorting procedure above, this may just be due to the data processing itself. One way to identify how much expansion of FWHM occurs due to analysis is by simulating data given estimates of pRF properties for a 'known' shape of modulation (e.g., square wave exactly spanning the attended aperture) and compare the resulting FWHM to that observed for attention and perception conditions (e.g., Fig. 7c).

We provide a detailed response in the public review. The essence of our response is to refrain from interpreting the precise recovered FWHM values, which will be influenced by multiple processing steps, and instead to focus on relative differences as a function of the attentional cue width. Accordingly, we did not add simulations to the revised manuscript, although we agree with the reviewer that such simulations could shed light on the underlying spatial resolution, and how binning and smoothing influences the estimated FWHM. We have clarified our interpretation of FWHM results in the manuscript as follows:

Results lines 137-141:

“One possibility is that the BOLD-derived FWHM might tend to overestimate the retinotopic extent of the modulation, perhaps driven by binning and smoothing processing steps to create the 1D spatial profiles. If this were the case, we would expect to obtain similar FWHM estimates when modeling the perceptual modulations as well.”

Results lines 169-175:

“Mirroring the results from the attentional manipulation, FWHM estimates systematically exceeded the nominal size of the perceptually modulated region of the visual field. Comparing the estimated FWHMs of the perceptual and attentional spatial profiles (Figure 7c) revealed that the estimated widths were highly comparable (Pearson correlation r=0.664 across width conditions and visual regions). Importantly, the relative differences in FWHM show meaningful effects of both cue and contrast width in a similar manner for both attentional and perceptual forms of modulation.”

Discussion lines 16-22:

“We also found that the estimated spatial spread of the attentional modulation (as indicated by the recovered FWHM) was consistently wider than the cued region itself. We therefore compared the spread of the attention field with the spatial profile of a perceptually induced width manipulation. The results were comparable in both the attentional and perceptual versions of the task, suggesting that cueing attention to a region results in a similar 1D spatial profile to when the stimulus contrast is simply increased in that region.”

(5) Baseline parameter: looking at the 'raw' response profiles shown in Fig. 2b, it looks, at first, like the wider attentional window shows substantially lower enhancement. However, this seems to be mitigated by the shift of the curve downwards. Can the authors analyze the baseline parameter in a similar manner as their amplitude analyses throughout? This is especially interesting in contrast to the perception results (Fig. 7), for which the baseline does not seem to scale in a similar way.

We agree with the reviewer that the baseline parameter is worth examining, and have therefore added panels displaying the baseline parameter into all results figures (Figures 4-7). There was no significant association between cue width and baseline offset in any of the three visual regions.

(6) Outlier: Fig. 5, V2, Amplitude result seems to have a substantial outlier - is there any notable difference in e.g. retinotopy in this participant?

One participant indeed has a notably larger median amplitude estimate in V2. Below, we plot the spatial coverage from the pRF data for this participant (022), as well as all other participants.

Author response image 4.

Author response image 4.

Each subplot represents a participant's 2D histogram of included voxels for the 1D spatial profiles; the colors indicate the proportion of voxels that fell within a specific x,y coordinate bin. Note that this visualization only shows x and y estimates and does not take into account size of the pRF. While there is variation across participants in the visual field coverage, the overall similarity of the maps indicates that retinotopy is unlikely to be the explanation.

To further explore whether this participant might be an outlier, we additionally looked at behavioral performance, angular error and FWHM parameters as well as the goodness of fit of the model. On all these criteria this participant did not appear to be an outlier. We therefore see no reason to exclude this participant from the analyses.

(7) Fig. 4 vs Fig. 5: I understand that Fig. 4 shows results from a single participant, showing variability across blocks, while Fig. 5 shows aggregate results across participants. However, the Angular Error figure shows complementary results - Fig. 4 shows the variability of best-fit angular error, while Fig. 5 shows the average deviation (approximately the width of the error distribution). This makes sense I think, but perhaps the abs(error) for the single participant shown in Fig. 4 should be included in the caption so we can easily compare between figures.

That's right: the Figure 4 results show the signed error, whereas the Figure 5 results show the absolute error. We agree that reporting the absolute error values for the example participant would facilitate comparison. Rather than add the values to the text, we have made the example participant’s data visually distinct within Figure 5 for easy comparison.

(8) Bias in model fits: the analysis shown in Fig. 6 compares the estimated parameters across amounts of data used to compute attentional modulation profiles for fitting those parameters. If the model-fitting procedure were unbiased, my sense is we would likely see no impact of the number of TRs on the parameters (R^2 should improve, abs(error) should improve, but FWHM, amplitude, baseline, etc should be approximately stable, if noisier). However, instead, it looks like more/less data leads to biased estimates, such that FWHM is biased to be smaller with more noise, and amplitude is biased to be larger. This suggests (to me) that the fit is landing on a spiky function that captures a noise wiggle in the profile. I don't think this is a problem for the primary results across the whole block of 10 TRs, which is the main point of the paper. Indeed, I'm not sure what this figure is really adding, since the single-TR result isn't pursued further (see below).

Please refer to our response in the public review, comment 4.

(9) 'Dynamics': The paper, starting in the title, claims to get at the 'dynamics' of attention fields. At least to me, that word implies something that changes over time (rather than across trials). Maybe I'm misinterpreting the intent of the authors, but at present, I'm not sure the use of the word is justified. That said, if the authors could analyze the temporal evolution of the attention field through each block of trials at 1- or 2-TR resolution, I think that could be a neat addition to the paper and would support the claim that the study assays dynamic attention fields.

We thank the reviewer for giving us a chance to speak more directly to the dynamic aspect of our approach. Here, we specifically use the word “dynamic” to refer to trial-to-trial dynamics. Importantly, our temporal interval analysis suggests that we can recover information about the attentional field at a relatively fine-grained temporal resolution (a few seconds, or 2 TRs). Following this methodological proof-of-concept to dynamically track the attentional field, we are excited about future work that can more directly investigate the manner in which the attentional field evolves through time, especially in comparison to other methods that first require training on large amounts of data.

(10) Correction for multiple comparisons across ROIs: it seems that it may be necessary to correct statistical tests for multiple comparisons across each ROI (e.g., Fig. 5 regression tests). If this isn't necessary, the authors should include some justification. I'm not sure this changes any conclusions, but is worth considering.

We appreciate the opportunity to explain our reasoning regarding multiple comparisons. We thought it appropriate not to correct as we are not comparing across regions and are not treating tests of V1, V2, and V3 as multiple opportunities to support a common hypothesis. Rather, the presence or absence of an effect in each visual region is a separate question. We would typically perform correction for multiple comparisons to control the familywise error rate when conducting a family of tests addressing a common hypothesis. We have added this to the Methods section (lines 192-195):

“No multiple comparison correction was applied, as the different tests for each region are treated as separate questions. However, using a threshold of 0.017 for p-values would correct for comparisons across the three brain regions.”

However, we are happy to provide corrected results. If we use Bonferroni correction across ROIs (i.e. multiply p-values by three), there are some small changes from significant to only trending towards significance, but these changes don’t affect any core results. The changes that go from significant to trending are:

Associated with Figure 5 – In V3, the relationship of cue width to amplitude goes from a p-value of 0.017 to 0.051.

Associated with Figure 6 –

V1: the effect of cue width on FWHM goes from p = 0.043 to 0.128.

V2: the effect of TR on both FWHM and R2 goes from p = ~0.02 to ~0.06.

V3: the effect of cue width on amplitude goes from p = 0.024 to 0.073.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Data Citations

    1. Bloem I, Bakst L, McGuire JT, Ling S. 2025. Data and code for: "Dynamic estimation of the attentional field from visual cortical activity". Open Science Framework. [DOI] [PMC free article] [PubMed]

    Supplementary Materials

    MDAR checklist

    Data Availability Statement

    All preprocessed fMRI data (extracted time series from each voxel), behavioral data, and eye tracking data reported in this study, as well as all code necessary to reproduce the analysis and recreate the figures are publicly available at Open Science Framework and GitHub (copy archived at Bloem, 2025) respectively.

    The following dataset was generated:

    Bloem I, Bakst L, McGuire JT, Ling S. 2025. Data and code for: "Dynamic estimation of the attentional field from visual cortical activity". Open Science Framework.


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES