Skip to main content

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

bioRxiv logoLink to bioRxiv
[Preprint]. 2025 Apr 10:2025.04.10.648136. [Version 1] doi: 10.1101/2025.04.10.648136

V1 superficial layers create a saliency map that feeds forward to the parietal cortex for attentional guidance

Chen Liu 1,2,, Chengwen Liu 3,, Laurentius Huber 4, Li Zhaoping 5, Peng Zhang 1,2,*
PMCID: PMC12190818  PMID: 40568089

Abstract

A salient visual object with a distinct feature from the surrounding environment automatically captures attention. While the saliency signals have been found in many brain regions, their source remains highly controversial. Here, we investigated the neural origin of visual saliency using cortical layer-dependent functional magnetic resonance imaging (fMRI) of cerebral blood volume (CBV) at 7 Tesla. Behaviorally, human observers were better at detecting foreground bars with a larger orientation contrast from uniformly oriented background bars. Saliency-sensitive signals were strongest in the superficial layers of the primary visual cortex (V1), and in the middle layers of the intraparietal sulcus (IPS) of the parietal cortex. Layer-dependent effective connectivity revealed the transmission of saliency signals along the feedforward pathway from V1 to IPS. Furthermore, behavioral sensitivity to the foreground stimulus correlated significantly with the fMRI response in the superficial layers of V1. Our findings provide mesoscale evidence that a visual saliency map is created by iso-feature suppression through lateral inhibition in the superficial layers of V1, and then feeds forward to attentional control brain regions to guide attention and eye movements.

Keywords: Saliency, Attention, Laminar fMRI, VASO CBV

Introduction

Only a small fraction of sensory inputs can be selected by our attention for further processing. Attention is guided both endogenously by top-down factors and exogenously by bottom-up factors (Baluch & Itti, 2011; Bisley & Goldberg, 2010; Buschman & Kastner, 2015; Hopfinger et al., 2000; Moore & Zirnsak, 2017). Although the bottom-up guidance is simpler, it remains controversial which brain area computes saliency from exogenous visual input. It is believed that the brain generates a saliency map from feature contrast to guide attention shifts and eye movements (Itti et al., 1998; Koch & Ullman, 1985; Li, 2002; Treisman & Gelade, 1980). According to a traditional and dominant computational model (Itti & Koch, 2000, 2001; Koch & Ullman, 1985), different feature maps (e.g., color, luminance, orientation, etc.) are generated by center-surround mechanisms and then combined into a feature-agnostic master map of saliency, presumably in a high-order brain region such as the parietal cortex. However, it has been shown that saliency behavior can also be explained by the V1 Saliency Hypothesis (V1SH) (Li, 2002, 2008), according to which the saliency map is generated by intracortical mechanisms in the primary visual cortex (V1) and represented by the firing rates of feature-selective neurons.

Visual saliency-related signals have been observed in the frontal (Moore & Armstrong, 2003; Thompson & Bichot, 2005) and parietal cortices (Bisley & Goldberg, 2010; Chen et al., 2020), V4 (Mazer & Gallant, 2003) and V1 (Yan et al., 2018; Zhang et al., 2012), as well as subcortical regions including the pulvinar of the thalamus (Shipp, 2004) and the superior colliculus (SC) of the midbrain (Fecteau & Munoz, 2006; White, Berg, et al., 2017; White, Kan, et al., 2017). An important question is which area generates the saliency signals originally and which areas merely receive and utilize the saliency signals generated in another area. For example, a recent study showed that inactivation of the posterior parietal cortex in monkeys significantly weakened the relationship between image salience and visually guided behavior (Chen et al., 2020), this could suggest that the parietal cortex is the original source of the saliency signals, but this is also consistent with the idea that parietal cortex utilizes the saliency signals and combine them with top-down signals for guiding behavior (Bisley & Goldberg, 2010). Using salient but subliminal visual inputs, a human neuroimaging study found responses to saliency in the earliest (C1) component of event-related potentials (ERPs), as well as fMRI responses to saliency in V1 but not in the parietal cortex (Zhang et al., 2012). One monkey electrophysiological study reported that the latency of the saliency signals in SC is shorter than that in V1 (White, Kan, et al., 2017), but this short SC latency is longer than the latency of the saliency signals in V1 reported in another monkey study (Yan et al., 2018). We aim to shed light on the origin of saliency signals through a high-resolution fMRI study, which allows us to examine laminar patterns of responses to saliency signals. As will be explained later, the laminar patterns are diagnostic of the sources of the saliency signals, and we will focus on the question of whether the source for the saliency signals is in V1, IPS, or SC.

Recent advances in high-resolution fMRI at ultra-high magnetic field have enabled non-invasive imaging of mesoscale functional units, such as cortical layers and columns, in the human brain at submillimeter resolution (Cheng, 2018; Huber et al., 2020; Vizioli et al., 2023). Using submillimeter blood oxygen level dependent (BOLD) fMRI techniques, feedforward and feedback pathways of stimulus-driven activity and top-down attentional modulation have been investigated in the human visual cortex (de Hollander et al., 2021; Lawrence et al., 2019; Liu et al., 2021; Van Mourik et al., 2023). However, the laminar specificity of T2* weighted BOLD fMRI is limited by the draining vein effect (Fukuda et al., 2021; Huber et al., 2019; Kay et al., 2019; Uludag & Havlicek, 2021). The Vascular Space Occupancy (VASO)-based cerebral blood volume (CBV) technique is an inversion recovery MRI technique that detects changes in blood volume by nulling the signal from blood water (Lu et al., 2003, 2004), particularly in small vessels such as arterioles and capillaries. Therefore, compared to T2*w BOLD signals that are sensitive to blood oxygen in large veins, CBV changes are more closely related to the site of neuronal activation, offering higher spatial specificity to resolve layer-dependent signals across cortical depth, despite having relatively lower functional sensitivity (Finn et al., 2019; Huber et al., 2017).

Different brain areas for creating the saliency signals should lead to different laminar patterns of the distributions of these saliency signals in themselves as well as in other brain areas as the saliency signals are propagated through feedforward and feedback connections. One can therefore examine these laminar patterns by laminar fMRI techniques to diagnose the source of the saliency signals. Figure 1 schematizes the laminar pattern of saliency-related signals in V1 and parietal cortex according to each of the three possible origins of the saliency signal: parietal cortex, V1, and SC, based on the canonical neural circuits in the primate visual system (Felleman & Van Essen, 1991). If the parietal cortex generates the saliency map and modulates V1 activity through feedback connections, the laminar pattern of saliency-related signals should be consistent with cortico-cortical feedback, stronger in the deep and superficial layers of V1 and in the deep layer of the parietal cortex. According to the V1 Saliency Hypothesis (V1SH), saliency signals emerge through intracortical interactions mediated by horizontal connections in the superficial layers of V1. Consequently, the strongest saliency-related activity should be observed in the superficial layers of V1 and in the middle layers of high-level cortex that receive feedforward input. Finally, if the saliency signals originate in the SC, they would modulate activity more strongly in the middle and superficial cortical layers through the tecto-thalamo-cortical pathway (Shipp, 2003, 2004, 2007).

Figure 1. Three possible hypotheses on the origins of saliency signals and their consequent laminar patterns of the saliency signals through feedforward and feedback neural pathways.

Figure 1.

In the feature combination hypothesis, saliency signal initially emerged in frontoparietal areas and then modulate neural activity of parietal and early visual cortices through feedback connections (Red lines); In V1SH, saliency signals initially emerge via intracortical interactions mediated by the horizontal connections in superficial layers of V1 (primary visual cortex). These signals can then feedforward to middle layers of parietal cortex (Green lines); In subcortical computation hypothesis, saliency signals are generated in SC (superior colliculus) and then modulate middle and superficial layer activity of cerebral cortex through thalamo-cortical pathways. On the right, relative strengths of saliency signals in the deep (D), middle (M), and superficial (S) layers of V1 and parietal cortex that should arise from each hypothesis are shown.

In the current study, to reveal the neural source of visual saliency, we employed submillimeter fMRI at 7 Tesla with a CBV-weighted VASO fMRI technique to investigate cortical depth-dependent responses to a foreground stimulus consisting of four bars presented at different orientation contrasts (θ = 90°, 15° or 0°) relative to a background of iso-oriented bars (Figure 2). The laminar profile of the saliency-sensitive fMRI response, its correlation with behavioral sensitivity to saliency, and the layer-dependent functional connectivity collectively support the hypothesis that the saliency map is generated in the superficial layers of V1 and subsequently projects to attention-control brain regions, such as the parietal cortex.

Figure 2.

Figure 2.

(A) Stimulus and procedure of the psychophysical experiment. The orientation contrast between foreground and background bars is defined as θ. (B) Luminance contrast sensitivity (Sbehavior) for detecting the 90° and 15° foreground. Each dot represents data from one participant. Error bars represent standard errors of the mean (SEM). *** p < 0.001. (C) Stimulus and procedure of the fMRI experiment. Texture stimuli with three θ conditions were presented in separate blocks. The foreground bars were randomly presented to the lower-left or lower-right of fixation in each block. (D) Localizer activations corresponding to the retinotopic location of the foreground bars in V1 and V2 of the early visual cortex and IPS of the parietal cortex (thresholded at p < 0.05 uncorrected in a representative participant), as indicated by the dashed circles. Dashed lines mark the borders of the brain areas.

Results

Behavioral sensitivity was higher to a foreground region with a larger orientation contrast against the background

In a psychophysical experiment, the bar stimulus was presented for 200 ms (Figure 2A). The foreground bars were presented either to the lower-left or the lower-right of fixation, at an eccentricity of 4.3°, and participants pressed one of two buttons to report whether the foreground bars were at lower-left or lower right. The orientation of the uniformly oriented background bars was randomly chosen for each trial, while the foreground bars had an orientation contrast θ of either 90° or 15° relative to the background bars. Across trials, the Michaelson luminance contrast between the bars (all of which had the same luminance) and the uniform gray background was adaptively adjusted using a 3-down-1-up staircase procedure, implemented independently for the two θ conditions, to identify a threshold contrast Cthreshold at which the participant can perform the task at 80% accuracy. The results showed that luminance contrast sensitivity Sbehavior=1/Cthreshold was significantly higher for a foreground with a larger orientation contrast θ (Figure 2B, 90° vs. 15°, t19 = 13.829, p < 0.001). Specifically, the behavioral sensitivity to detect the 90° foreground (Sbehavior90=18.719) was approximately twice that of the sensitivity to detect the 15° foreground Sbehavior15=9.720) (Figure 2B). It should be noted that, at our luminance contrast sensitivity Sbehavior to detect the foreground, the luminance of all the bars were presented at a suprathreshold level. Hence, the luminance contrast threshold Cthreshold measured in our task does not reflect the visibility of the entire stimulus but rather the saliency of the foreground bars in the context of the background texture. When the orientation contrast θ or saliency is higher, the foreground is more easily detected and localized, so that our participants could do our task at a lower luminance contrast of the bars. Hence, our luminance contrast sensitivity Sbehavior is an assessment of the saliency of the foreground bars, and we also call Sbehavior our behavioral saliency score. Furthermore, to correlate with the fMRI signals for saliency later, we define behavioral sensitivity to saliency SSbehavior=Sbehavior90-Sbehavior15.

Saliency-sensitive responses were strongest in the superficial layers of V1 and in the middle layers of IPS

In the 7T fMRI experiment, a slice-saturation slab-inversion VASO (SS-SI-VASO) sequence with 0.82-mm isotropic resolution was used to measure CBV-weighted signals across cortical depth from the posterior part of the brain (Huber et al., 2014, 2017). In each inversion TR, blood-nulled and not-nulled 3D-EPI volumes were acquired to minimize T2*-weighted BOLD contamination. The imaging slab was oriented in an oblique-coronal orientation to cover the early visual cortices and the parietal cortex from the dorsal part of the occipito-parietal lobe (Figure S1C). According to the literature and hypotheses on the neural origin of visual saliency (Figure 1), our analyses focused on V1 and V2 of the early visual cortex, and the intraparietal sulcus (IPS) of the parietal cortex.

The stimulus and procedure of the fMRI experiment are illustrated in Figure 2C. Bar stimuli with three orientation contrast conditions (θ = 90°, 15° and 0°, randomly selected to be either clockwise or counterclockwise to the orientation of the background bars) were presented in the lower visual field in separate 30-second blocks, interleaved with 15-second fixation period. Within a stimulus block, θ is fixed, and each stimulus was presented for 200 ms, with a random inter-stimulus interval (ISI) ranging from 200 to 400 ms. In each stimulus, the iso-oriented background bars had a random orientation, and the foreground bars appeared randomly either to the lower left or lower right of fixation. During the experiment, participants were instructed to maintain fixation and to detect small luminance changes of the fixation cross. In two localizer runs, naturalistic stimuli of the same size and location as the foreground bars were presented in the lower-left and lower-right visual fields in separate blocks (Figure S2). The cortical ROIs of the foreground bars are shown in Figure 2D for a representative participant.

To quantify our results and to account for individual differences in mean response amplitude, we define the normalized CBV foreground response as SfMRI(s,l,θ)=r(s,l,θ)rnorm(s)r. In which, r(s,l,θ) is the original CBV response for subject s, layer l, at foreground orientation contrast θ,rnorm(s)=l,θr(s,l,θ)2l,θ1 and r=srnorm(s)s1. CBV-weighted fMRI responses in the foreground ROIs SfMRI in V1, V2 and IPS are plotted as a function of cortical depth in Figure 3A (see Figure S3 for the BOLD response profiles).

Figure 3.

Figure 3.

(A) Upper: Normalized CBV responses (SfMRI in percent signal change) to the orientation foregrounds in V1, V2, and IPS. Lower: CBV response difference between the 90° and 15° foreground conditions (SSfMRI=SfMRI90-SfMRI15). Each gray dot represents data from one participant. Error bars represent SEM. *, ** and *** indicate p < 0.05, p < 0.01 and p < 0.001, respectively. D: deep, M: middle, S: superficial. (B) Volume activation maps of the saliency-sensitive response SSfMRI in the foreground ROIs of V1, V2, and IPS for a representative participant. Red lines indicate the boundary between gray matter (GM) and cerebrospinal fluid (CSF). Yellow lines indicate the boundary between GM and white matter (WM). (C) Left: Surface activation maps of saliency-sensitive response (SSfMRI in percent signal change) in different cortical depths of V1 in a representative participant (same participant as in B). Dashed circles indicate the location of foreground on the cortical surface. Right: Saliency maps (averaged across all participants) in image space.

A two-way repeated measures (rm) ANOVA with cortical depth (deep, middle, and superficial) and orientation contrast (θ = 90°, 15°, and 0°) as within-subject factors was performed on the normalized foreground response SfMRI. A significant effect of orientation contrast θ was found in all three brain regions (V1: F2, 38 = 7.080, p = 0.002; V2: F2, 38 = 4.763, p = 0.014; IPS: F2, 38 = 4.673, p = 0.015), indicating stronger responses to the foreground with larger orientation contrast (i.e., 90° compared to 15° and 0° conditions). However, CBV responses to the 15° foreground were slightly reduced compared to the 0° condition. This is due to the partial volume effect of background suppression (Figure S4). Similar effects of background suppression were found for the 90° and 15° conditions. Therefore, to avoid the partial volume effect of background suppression, the saliency-sensitive fMRI response SSfMRI=SfMRI90-SfMRI15 was calculated as the difference between the fMRI responses to the 90° foreground and the 15° foreground conditions (Figure 3A, lower).

The amplitude of SSfMRI differed significantly across cortical depth in V1, V2, and IPS (V1: F2, 38 = 9.696, p = 0.002; V2: F2, 38 = 9.833, p = 0.001; IPS: F2, 38 = 5.421, p = 0.008). In V1, the saliency-sensitive response was strongest in the superficial layers (S vs. D: t19 = 3.408, p = 0.003; S vs. M: t19 = 3.660, p = 0.002). In V2, it was stronger in the superficial and middle layers than in the deep layers (S vs. D: t19 = 3.232, p = 0.004; M vs. D: t19 = 4.582, p < 0.001). In IPS, SSfMRI was strongest in the middle layers (M vs. D: t19 = 2.659, p = 0.015; M vs. S: t19 = 2.983, p = 0.008). Saliency-sensitive responses calculated by the original CBV responses showed a similar pattern of results (Figure S5). The laminar profiles of the response difference between the 90° and 0° conditions, SFMRI90-SFMRI0, were also qualitatively similar (Figure S6). In Figure 3B, the activation map in a representative participant clearly showed the strongest saliency sensitivity SSfMRI in the superficial layers of V1, in the middle and superficial layers of V2, and in the middle layers of IPS, consistent with the laminar profiles for a V1 origin of saliency in Figure 3A. To better illustrate the V1 saliency map across cortical depths, saliency-sensitive responses SSfMRI were shown on the cortical surface in the right hemisphere of the representative subject (Figure 3C, left). The saliency maps in the image space were reconstructed using a population receptive field (pRF) model, shown in the right panels of Figure 3C. Saliency signals were strongest in the superficial layer of V1 at the location of foreground bars. These findings are consistent with the V1 hypothesis in Figure 1, in which the saliency signals initially emerge by horizontal interactions in the superficial layers of V1 and subsequently feed forward to the middle layers of IPS.

Behavioral sensitivity to saliency correlated with the saliency-sensitive fMRI response in the superficial layers of V1

To investigate whether the fMRI response can predict the behavioral sensitivity to the foreground bars, we calculated Pearson’s correlations between the behavioral sensitivity to saliency SSbehavior and the saliency-sensitive fMRI response SSfMRI. For each brain region, the cortical depth with the strongest saliency-sensitive fMRI response was selected. SSbehavior showed a significant correlation with SSfMRI in the superficial layers of V1 (Figure 4A, r = 0.542, p = 0.014), but not with those in the superficial layers of V2 (r = −0.218, p = 0.356) or the middle layers of IPS (r = −0.203, p = 0.390). The correlation in V1 remained significant after family-wise error (FWE) correction by permutation test (Figure 4B). The correlation results supported that the saliency-sensitive fMRI response in the superficial layers of V1 can predict the behavioral sensitivity to saliency.

Figure 4.

Figure 4.

(A) Behavioral sensitivity to saliency SSbehavior, i.e., the difference in luminance contrast sensitivity to detect the two different foregrounds, is correlated with fMRI saliency-sensitive response SSfMRI in the superficial layers of V1, giving a significant Pearson’s correlation coefficient r. Each circle represents one participant. (B) Pearson’s correlation coefficients r in the superficial layers of V1 and V2, and in the middle layer of IPS. The dashed line indicates the significance threshold after family-wise error (FWE) correction by a permutation test. (C) The feedforward and feedback pathways of saliency signals between V1 and IPS, assessed by the layer-specific generalized psychophysiological interaction (gPPI) analysis of functional connectivity. (D) Functional connectivity difference (90° vs. 15°) of the two pathways in C. Each gray dot represents a single participant. Error bars represent the standard errors of the mean across participants. ** indicates p < 0.01.

Layer-dependent effective connectivity revealed saliency transmission from V1 to IPS

A generalized psychophysiological interaction (gPPI) analysis was conducted to examine layer-dependent effective connectivity of saliency signals between V1 and IPS. For the feedforward connection, the superficial layer of V1 was used as the seed ROI, and the middle layer of IPS served as the target ROI. For the feedback connection, the seed ROI was the deep layer of IPS, and the target ROI was the deep layer of V1 (Figure 4C). The gPPI results revealed a significant connectivity difference between the 90° and 15° foreground conditions for the feedforward connection from V1 superficial to IPS middle (t19 = 3.407, p = 0.006, Holm corrected). In contrast, no significant difference was observed for the feedback connection from IPS deep to V1 deep (t19 = 1.719, p = 0.102) (Figure 4D). Due to the low signal-to-noise ratio of the VASO signal, here we used the BOLD signals for the gPPI analysis. The gPPI results using VASO and BOLD signals were similar (Figure S7), both showing a significant feedforward connection from V1 to IPS. The connections across all cortical layers are shown in Figure S7, indicating that the middle layer of IPS received significant feedforward input from V1. The layer-dependent connectivity results further support the hypothesis that the saliency signal propagates along the feedforward pathway from V1 to IPS.

Fixation stability was not affected by the presence of foreground bars

Since participants were performing a fixation task in the fMRI experiment, the fixation stability was unlikely to be affected by the presence of the foreground. To confirm that participants could maintain fixation, we monitored the eye movements of 12 participants in a control experiment using the same stimuli and fixation task as in the fMRI experiment. Fixation stability was assessed using the bivariate contour ellipse area (BCEA) of 95% fixations (Steinman, 1965). BCEA results showed no significant difference among the three θ conditions (F2, 22 = 0.291, p = 0.750, BF10 = 0.227) (Figure S8). Therefore, our findings were unlikely to be due to unstable fixation.

Discussion

The neural source of bottom-up saliency has been highly controversial. Using 7T VASO-CBV fMRI at submillimeter resolution, we investigated layer-dependent responses to salient visual input in the early visual cortex and the parietal cortex of human participants, and the functional connectivity between these brain regions to elucidate how saliency signals are propagated. Saliency-sensitive responses were strongest in the superficial layers of V1, and in the middle layers of IPS. Meanwhile, behavioral sensitivity to saliency correlated significantly with the fMRI saliency-sensitive responses in the superficial layers of V1, but not with those in the middle layers of IPS. Furthermore, layer-dependent effective connectivity revealed significant feedforward transmission of saliency signals from V1 to IPS. These findings support the hypothesis that a bottom-up saliency map is created by contextual influences mediated by horizontal connections, which are most abundant in the superficial layers of V1, and is subsequently propagated to other brain regions, such as IPS and SC, to guide attention and eye movements.

While the V1 saliency hypothesis (V1SH) is supported by substantial behavioral evidence (Koene & Zhaoping, 2007; Li, 2008; Zhaoping, 2012; Zhaoping & May, 2007; Zhaoping & Zhe, 2015), only a few neuroscience studies have directly tested it. Using EEG and fMRI, (Zhang et al., 2012) studied the neural responses to salient foregrounds resembling our stimuli, but the bars were made subliminal by backward masking so that observers were at chance level in identifying the foreground location, while the saliency effect of the foreground (manifested as its cueing effect in discriminating a subsequent probe) remained evident. A more salient foreground (by a larger orientation contrast) evoked a stronger C1 component of occipital ERPs, and 3T fMRI results revealed saliency signals in V1 but not in IPS. Although these findings provide strong neuroimaging support for the V1SH, several limitations should be noted. First of all, top-down attention may not be completely abolished by backward masking, since it can modulate stimulus processing even without awareness of visual stimuli (Bahrami et al., 2007; Kanai et al., 2006). In addition, C1 amplitude can also be modulated by top-down attention (Kelly et al., 2008; Rauss et al., 2009, 2012). Finally, the lack of a saliency signal in IPS could be due to the limited spatial resolution of 3T fMRI to isolate the retinotopic signal of the foreground. Meanwhile, for supraliminal salient visual inputs, saliency-sensitive responses in parietal cortex have been observed in fMRI and neurophysiological studies (Bisley & Goldberg, 2010; Geng & Mangun, 2009; Gottlieb et al., 1998). In one neurophysiological study (Yan et al., 2018), neural activity in monkey V1 showed both early (40–60 ms after stimulus onset) and late (80–200 ms) responses to a salient orientation singleton bar in a background of uniformly oriented bars, with the early component being present even during the fixation task, prior to any experience in the singleton detection task. However, another study (White, Kan, et al., 2017) found that saliency-related signals appeared earlier in the SC (around 65 ms) than in V1 (around 121 ms). Therefore, neuroscience studies have found saliency-related signals in multiple brain regions, and it remains controversial as to which brain regions are, respectively, the sources and recipients of the saliency signals.

With a large brain coverage and a submillimeter resolution, the laminar fMRI results in our study provide important mesoscale evidence for the cortical microcircuitry underlying the saliency computations for bottom-up attention in humans. The laminar response patterns from multiple brain regions (Figure 3A/B) and the layer-dependent connectivity (Figure 4C/D) are consistent with a canonical feedforward pathway from V1 to IPS (Felleman & Van Essen, 1991; Markov et al., 2014). Since the intracortical horizontal connections in V1 are most abundant in the superficial layers (Gilbert & Wiesel, 1983; Rockland & Lund, 1983; Angelucci et al., 2017), and since contextual influences mediated by these connections is a key mechanism underlying the saliency computation in the V1 model (Li, 2002), our finding that the strongest saliency-related signal is in the superficial layers of V1 strengthens the evidence for V1 as the source of the saliency signals. While the previous studies cannot rule out the superior colliculus (SC) as a potential source for saliency computation, our findings provide evidence against this possibility. If the saliency map was generated in the SC, the saliency signals should be stronger in the middle layer than in the superficial layer of V1 (Figure 1), since the thalamic input mediated from the SC via the tecto-thalamo-cortical pathway should be strongest in the middle layer of V1 (Shipp, 2003, 2007). Our laminar results (Figure 3) are not consistent with this prediction. Instead, they are aligned with recent evidence that visual responses in the SC of awake primates depend on signals routed through the LGN and V1 (Katz et al., 2024): deactivation of the LGN abolished SC responses to visual stimuli. It is therefore more likely that the saliency signals are generated in V1 and then project to the superficial layers of the SC to guide attention and eye movements (May, 2006; Shipp, 2004).

In contrast to the results reported by (Zhang et al., 2012), we observed significant saliency-sensitive signals in IPS (Figure 3A/B). This discrepancy may be attributed to the higher spatial resolution of our 7T fMRI data, which allowed better isolation of fMRI responses to the foreground from those to the background bars. Another possible reason is the visibility of the texture stimuli, which were made invisible by backward masking in Zhang et al. (2012). However, given the low-level mechanisms in V1 and the feedforward transmission of saliency signals, it is unlikely to be strongly influenced by conscious awareness (He et al., 1996; He & MacLeod, 2001; Rees et al., 2002; Tong, 2003). By making the stimuli invisible, the previous study largely eliminated conscious awareness-related feedback from a high-level cortex such as the IPS, while the saliency signals were still observed in V1. However, as a consequence, the feedforward propagation of the saliency signals from V1 to IPS cannot be illustrated due to the lack of signal in the high-level region. Overcoming the limitations of the previous study, our study represents important advances in demonstrating the neural origin and recipient of saliency signals. We reveal the V1 origin and feedforward propagation of saliency signals from V1 to IPS, by allowing the manifestation of the saliency signals in the parietal cortex and by layer-specific response and connectivity analyses. Furthermore, the laminar response pattern for horizontal contextual influences in the superficial layers of V1 provides cortical support for the proposed mechanism of saliency computation in the V1 model.

To summarize, mesoscale evidence at the level of cortical layers resolves key discrepancies in the neural origin of saliency signals. Our results demonstrate that visual saliency is initially created by iso-feature suppression in the superficial layers of V1 and then propagates to higher-order attentional control brain regions to guide attention and eye movements. These findings not only shed light on the neural and computational mechanisms of visual attention, but also provide valuable guidance for the development of biologically inspired artificial visual systems.

Methods and Materials

Participants

Twenty healthy volunteers (9 females, aged 22–42 years) participated in the psychophysical and fMRI experiments. The sample size was determined based on the large effect (Cohen’s f > 0.4) of orientation contrast on V1 responses in a previous study (Zhang et al., 2012), as calculated by G*Power (Faul et al., 2009). A sample size of 20 participants was sufficient to achieve 90% statistical power for detecting a large effect in a within-subjects ANOVA. Twelve volunteers (5 females, 22–30 years of age) participated in the control movement with eye movement recordings, 6 of them also participated in the fMRI experiment. All participants had normal or corrected-to-normal vision and gave written informed consent before the experiments. Experimental protocols were approved by the Institutional Review Board of the Institute of Biophysics, Chinese Academy of Sciences (No. 2012-IBP-011).

Stimuli and procedures

Visual stimuli were generated in MATLAB (Mathworks Inc.) with psychophysics toolbox extension (Brainard, 1997; Pelli, 1997) on a Windows 10 operating system computer. Stimuli and procedures were illustrated in Figure 2A&2C. Each texture stimulus (12.5 by 6.5 degrees of visual angle) consisted of 17 × 9 bars of 0.5° × 0.05° in visual angle presented in the lower visual field on a gray background (L0=43.8cd/m2), with 0.75° spacing between the bars. In the foreground present conditions, all bars were identical except for a foreground of 2 × 2 bars titled either 90° or 15° (orientation contrast θ) relative to the background bars. The foreground bars were located at 4.3° of eccentricity either to the lower-left or lower-right of the fixation. In the uniform texture condition, all bars were uniformly oriented. The orientation of background bars was randomly chosen from 0° to 180°.

Psychophysical experiment

Visual stimuli were presented on a Cambridge Research System Display++ LCD monitor (32 inches, 1920 × 1080 pixels, 120-Hz refresh rate) at a viewing distance of 100 cm. The head of subject was stabilized using a chin rest.

A bright fixation cross (73 cd/m2) was shown at the center of the screen. Participants were informed to keep fixation throughout the experiment. A trial began after a button press. After 1 s of fixation, the texture stimulus was presented for 200 ms, followed by another 1 s of fixation for the participant to respond (Figure 2A) before the next trial began. Participants were instructed to make 2-alternatives-forced-choice response by pressing one of two buttons to indicate whether the foreground appeared in the lower left or lower right visual field. Bar luminance Lbar (ranged from 43.8 to 78.9 cd/m2) was adjusted by a 3-down-1-up staircase procedure to determine the participant’s contrast sensitivity for detecting the foreground. The luminance contrast of the bars from the uniform gray background was defined by the Michaelson contrast C=Lbar-L0Lbar+L0. Two blocks of trials (120 trials each) were collected for each orientation contrast condition. Each block consisted of two independent staircases (60 trials each) that were randomly intermixed across trials. Thus, a total of four 60-trial staircases were collected for each condition. A psychometric curve was fitted by a Weibull function relating the performance in each condition to the contrast of the bars. The contrast detection threshold Cthreshold was defined as the contrast level at 80% accuracy, and then the contrast sensitivity was defined as S=1/Cthreshold.

FMRI experiment

Stimuli were presented with an MRI safe projector (1024 × 768 pixels @ 60 Hz) on a translucent screen placed behind the head coil. Participants viewed the stimuli through a mirror mounted inside the head coil at a viewing distance of 69 cm from the screen. Visual stimuli were identical as those in the psychophysical experiment except that the luminance of the bars and the uniform background were Lbar=87.5cd/m2 and L0=0.4cd/m2, respectively. The purpose of using a dark background was to enhance activations to the texture stimuli. Texture stimuli with different orientation contrast with the background bars (θ = 90°, 15°, or 0°) were presented in separated 30-s blocks, interleaved with 15-second fixation periods (Figure 2C). Each run lasting 270 s, with 6 blocks of stimuli per run (2 blocks for each θ). In each block, there were 60 texture stimuli, each lasting 200 ms followed by 200/300/400 ms of random intervals. The foreground bars were randomly presented in the lower-left or lower-right visual field. Subjects were instructed to maintain fixation and count the number of luminance changes in the fixation cross. Feedback about the numbers of fixation changes was given at the end of each run. A total of 9 runs were collected for each participant. The order of stimulus conditions was balanced across runs and participants.

Two localizer runs were used for each participant to localize the retinotopic regions of interest (ROIs) for the foreground bars. Each run consisted of eight 30-s stimulus blocks, interleaved with 15-s fixation intervals (Figure S2). In each stimulus block, colored naturalistic stimuli (same size as the foreground in the main experiment) were presented at 4 images per second (116.7 ms presentation interleaved with 133.3 ms fixation), at one of the foreground locations. Participants were instructed to maintain fixation and to count fixation changes as in the main experiment.

Eye tracking experiment

Eye movements were recorded using an SR Research Eyelink 1000 Plus eye tracker. The centroids of the left eye’s pupil were sampled at 1000 Hz. Visual stimuli were presented on a 27 inches LCD monitor at 1920 × 1080 resolution and 60 Hz refresh rate, at a viewing distance of 100 cm. Participants stabilized their heads using a chin rest. The texture stimuli and fixation task were identical to those employed in the fMRI experiment. Eye gaze positions were preprocessed in the following steps: eye blinks removal (200 ms before and after the eye blinks) with linear interpolation of removed data, linear detrend, baseline correction per stimulus block. A 95% bivariate contour ellipse area (BCEA) was used to quantify the distribution of fixation samples (Steinman, 1965).

MRI data acquisition

MRI data were collected on a 7T scanner (Siemens MAGNETOM) with a 32-channel receive single-channel transmit head coil (Nova Medical) in the Beijing MRI Center for Brain Research. The gradient coil had a maximum amplitude of 70 mT/m, 200 us minimum gradient rise time, and 200 T/m/s maximum slew rate. Participants used a bite bar to reduce head motion. Functional data of blood nulled and not-nulled images were acquired using a SS-SI-VASO sequence with 3D-EPI readout (Huber et al., 2014; Poser et al., 2010) (0.82-mm isotropic voxels, 26 oblique-coronal slices, FOV = 133 × 177 mm2, paired-TR = 5020 ms, TE = 25 ms, TI1 = 1744 ms, FA = 26°, bandwidth = 1064 Hz/pixel, Partial Fourier = 6/8 with 8 POCS iterations, GRAPPA = 3 with FLASH reference). The 3D slab of fMRI acquisition was placed in oblique-coronal orientation (Figure S1C). Images with reversed phase and read directions were acquired for susceptibility distortion correction. T1-weighted anatomical images were acquired using an MP2RAGE sequence (Marques et al., 2010) (0.7-mm isotropic voxels, FOV = 224 × 224 mm2, 256 sagittal slices, TE = 3.05 ms, TR = 4000 ms, TI1 = 750 ms, FA = 4°, TI2 = 2500 ms, FA = 5°, bandwidth = 240 Hz/pixel, phase and slice partial Fourier = 7/8, GRAPPA = 3).

MRI data analysis

Preprocessing of functional data

The preprocessing of fMRI data was performed using AFNI (Cox, 1996), LAYNII (Huber et al., 2021) and custom Python code. The following steps were applied for blood nulled and not-nulled volumes separately: EPI image distortion correction with blip-up/down non-linear warping method, rigid-body correction of head motion, spatially up-sampled by a factor of 2 (3dResample in AFNI) (J. Wang et al., 2022). Since the blood nulled and not-nulled volumes were acquired alternately, the 3D timeseries were up-sampled by a factor of 2 (3dUpsample in AFNI) and shifted by TR/2 for BOLD correction. The VASO or CBV-weighted volume was calculated by dividing the nulled by not-nulled volumes (LN_BOCO in LAYNII). To minimize image blur, all spatial transformations were combined and applied to the functional images in one interpolation (3dAllineate in AFNI with sinc method). After per run scaling as percent signal change, a general linear model (GLM) with a canonical HRF (BLOCK4 in AFNI) was used to estimate the VASO and BOLD signal change from baseline. The fMRI signals in all cortical depths were fitted equally well by convolving the stimulus regressor with the canonical HRF (Figure S9). Thus, the GLM analysis should not introduce bias across cortical depth. Head motions and low-order drifts were included as regressors of no interest in the GLM.

Surface segmentation

The T1-w anatomical volume was aligned to the mean EPI image after motion correction, and segmented into white matter (WM), gray matter (GM), and cerebrospinal fluid (CSF) compartments using FreeSurfer with the “high-res” option (Fischl, 2012). FreeSurfer’s segmentation results were visually inspected and manually edited to remove dura mater and sinus, etc., ensuring accurate surface segmentation (Figure S1A). High-density surface mesh was generated by a factor of 4 to better align with the up-sampled volume grid (Polimeni et al., 2018). The cortical depth profiles were constructed using the equi-volume model (Waehnert et al., 2014). Two equi-volume intermediate surfaces between the WM and pial surfaces were generated using mripy (https://github.com/herrlich10/mripy), dividing the GM into three equi-volume layer compartments (Figure S1A). For each voxel, the layer weight (volume percentage among WM, CSF and the three layers compartments) were calculated and subsequently used in a spatial regression approach to unmix layer activity (Kok et al., 2016).

ROI definition

Anatomical ROIs of the early visual areas (V1-V2) were defined on the cortical surface by a 7T retinotopic atlas of the Human Connectome Project using Neuropythy tools (Benson et al., 2014, 2018). The anatomical ROI for IPS was taken from the Wang15 atlas (L. Wang et al., 2015). Vertices with positive activation from baseline in the localizer (p < 0.05, uncorrected) were selected within the anatomical ROI as the foreground ROI (Figure 2D). Accordingly, surround ROIs in V1 and V2 were acquired by subtracting the foreground ROI from vertices with significant activation to the texture stimuli (p < 0.05, uncorrected). Surface ROIs were then projected to the EPI volume to select voxels in a column-wise manner.

Visual Field Map Reconstruction

The visual field map of saliency-related signals was reconstructed for each cortical depth using the saliency-dependent (θ90 - θ15) CBV responses and the population receptive field (pRF) maps from the Benson atlas (Benson et al., 2018; Dumoulin & Wandell, 2008). To account for individual differences in the pRF maps, the pRF locations were linearly transformed by aligning the peak of localizer activation to the foreground location. The visual field map was reconstructed by multiplying the each node’s CBV response with its pRF map and then summed across all nodes (Mo et al., 2018). Finally, the reconstructed map was normalized (divided) by the maximum response across all cortical depths, and then averaged across participants.

Layer-dependent effective connectivity

A generalized psychophysiological interaction (gPPI) method was used to investigate the effective connectivity across cortical depths in V1 and IPS (McLaren et al., 2012). CBV timeseries of the foreground ROI were averaged within each cortical depth of V1 and IPS. Due to the uncertainty about CBV HRF, the PPI term was calculated by the dot product of the stimulus regressor (boxcar function, shifted by one TR due to the HRF delay) and the seed timecourse without deconvolution (O’Reilly et al., 2012; Sharoh et al., 2019). The GLM includes the seed region time course, stimulus regressors, PPI terms of the three θ conditions, along with baseline regressors and head motion parameters. Saliency-dependent functional connectivity was defined as the difference between the PPI beta weights of the θ90 and θ15 conditions.

Statistical analysis

Two-way repeated-measures (rm) ANOVA with cortical depths (deep, middle, and superficial) and θ (90°, 15°, and 0°) as within-subject factors were performed on the ROI-averaged CBV responses. All data met the sphericity assumption of ANOVA. In cases where the sphericity assumption was violated, p values were adjusted using the Greenhouse-Geisser correction. One-way rm ANOVA was performed to evaluate the difference in saliency-related signals across cortical depths, followed paired t-tests if there was a significant difference. No FWE correction is needed for paired t-tests followed by a significant one-way ANOVA with three levels (Levin et al., 1994).

Pearson’s r was calculated to assess the correlation between the fMRI saliency sensitivity SSfMRI and the behavioral saliency sensitivity SSbehavior. Data were assessed for multivariate normality using Shapiro-Wilk test (all p > 0.05). A permutation test was performed to correct for the family-wise error (FWE) across ROIs. In each permutation, Pearson’s r was calculated after shuffling the correspondence between fMRI signal and perceptual sensitivity across participants. The largest r value was taken and the permutation was repeated to generate a null distribution, from which the FWE was derived for the original r values.

Supplementary Material

1

Acknowledgement

This study was supported by STI2030-Major Projects (2022ZD0211900 and 2021ZD0204200 to P.Z.), National Natural Science Foundation of China (31871107 and 31930053 to P.Z.). Hunan Provincial Natural Science Foundation (2024JJ6313 to C.W.L), Scientific Research Foundation of Hunan Provincial Education Bureau (24B0058 to C.W.L). NIH Intramural Program of NIMH/NINDS (#ZIC MH002884 to L.H.).

References

  1. Angelucci A., Bijanzadeh M., Nurminen L., Federer F., Merlin S., & Bressloff P. C. (2017). Circuits and Mechanisms for Surround Modulation in Visual Cortex. Annual Review of Neuroscience, 40, 425–451. 10.1146/annurev-neuro-072116-031418 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Bahrami B., Lavie N., & Rees G. (2007). Attentional load modulates responses of human primary visual cortex to invisible stimuli. Current Biology: CB, 17(6), 509–513. 10.1016/j.cub.2007.01.070 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Baluch F., & Itti L. (2011). Mechanisms of top-down attention. Trends in Neurosciences, 34(4), 210–224. 10.1016/j.tins.2011.02.003 [DOI] [PubMed] [Google Scholar]
  4. Benson N. C., Butt O. H., Brainard D. H., & Aguirre G. K. (2014). Correction of Distortion in Flattened Representations of the Cortical Surface Allows Prediction of V1-V3 Functional Organization from Anatomy. PLoS Computational Biology, 10(3), e1003538. 10.1371/journal.pcbi.1003538 [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Benson N. C., Jamison K. W., Arcaro M. J., Vu A. T., Glasser M. F., Coalson T. S., Van Essen D. C., Yacoub E., Ugurbil K., Winawer J., & Kay K. (2018). The Human Connectome Project 7 Tesla retinotopy dataset: Description and population receptive field analysis. Journal of Vision, 18(13), 23. 10.1167/18.13.23 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Bisley J. W., & Goldberg M. E. (2010). Attention, Intention, and Priority in the Parietal Lobe. Annual Review of Neuroscience, 33(1), 1–21. 10.1146/annurev-neuro-060909-152823 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Brainard D. H. (1997). The Psychophysics Toolbox. Spatial Vision, 10(4), 433–436. [PubMed] [Google Scholar]
  8. Buschman T. J., & Kastner S. (2015). From Behavior to Neural Dynamics: An Integrated Theory of Attention. Neuron, 88(1), 127–144. 10.1016/j.neuron.2015.09.017 [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Chen X., Zirnsak M., Vega G. M., Govil E., Lomber S. G., & Moore T. (2020). Parietal Cortex Regulates Visual Salience and Salience-Driven Behavior. Neuron, 106(1), 177–187.e4. 10.1016/j.neuron.2020.01.016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Cheng K. (2018). Exploration of human visual cortex using high spatial resolution functional magnetic resonance imaging. NeuroImage, 164, 4–9. 10.1016/j.neuroimage.2016.11.018 [DOI] [PubMed] [Google Scholar]
  11. Cox R. W. (1996). AFNI: Software for Analysis and Visualization of Functional Magnetic Resonance Neuroimages. Computers and Biomedical Research, 29(3), 162–173. 10.1006/cbmr.1996.0014 [DOI] [PubMed] [Google Scholar]
  12. de Hollander G., van der Zwaag W., Qian C., Zhang P., & Knapen T. (2021). Ultra-high field fMRI reveals origins of feedforward and feedback activity within laminae of human ocular dominance columns. NeuroImage, 228, 117683. 10.1016/j.neuroimage.2020.117683 [DOI] [PubMed] [Google Scholar]
  13. Dumoulin S. O., & Wandell B. A. (2008). Population receptive field estimates in human visual cortex. NeuroImage, 39(2), 647–660. 10.1016/j.neuroimage.2007.09.034 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Faul F., Erdfelder E., Buchner A., & Lang A.-G. (2009). Statistical power analyses using G*Power 3.1: Tests for correlation and regression analyses. Behavior Research Methods, 41(4), 1149–1160. 10.3758/BRM.41.4.1149 [DOI] [PubMed] [Google Scholar]
  15. Fecteau J. H., & Munoz D. P. (2006). Salience, relevance, and firing: A priority map for target selection. Trends in Cognitive Sciences, 10(8), 382–390. 10.1016/j.tics.2006.06.011 [DOI] [PubMed] [Google Scholar]
  16. Felleman D. J., & Van Essen D. C. (1991). Distributed Hierarchical Processing in the Primate Cerebral Cortex. Cerebral Cortex, 1(1), 1–47. 10.1093/cercor/1.1.1 [DOI] [PubMed] [Google Scholar]
  17. Finn E. S., Huber L., Jangraw D. C., Molfese P. J., & Bandettini P. A. (2019). Layer-dependent activity in human prefrontal cortex during working memory. Nature Neuroscience, 22(10), 1687–1695. 10.1038/s41593-019-0487-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Fischl B. (2012). FreeSurfer. NeuroImage, 62(2), 774–781. 10.1016/j.neuroimage.2012.01.021 [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Fukuda M., Poplawsky A. J., & Kim S.-G. (2021). Time-dependent spatial specificity of high-resolution fMRI: Insights into mesoscopic neurovascular coupling. Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences, 376(1815), 20190623. 10.1098/rstb.2019.0623 [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Geng J. J., & Mangun G. R. (2009). Anterior Intraparietal Sulcus is Sensitive to Bottom–Up Attention Driven by Stimulus Salience. Journal of Cognitive Neuroscience, 21(8), 1584–1601. 10.1162/jocn.2009.21103 [DOI] [PubMed] [Google Scholar]
  21. Gilbert C. D., & Wiesel T. N. (1983). Clustered intrinsic connections in cat visual cortex. Journal of Neuroscience, 3(5), 1116–1133. 10.1523/JNEUROSCI.03-05-01116.1983 [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Gottlieb J. P., Kusunoki M., & Goldberg M. E. (1998). The representation of visual salience in monkey parietal cortex. Nature, 391(6666), 481–484. 10.1038/35135 [DOI] [PubMed] [Google Scholar]
  23. He S., Cavanagh P., & Intriligator J. (1996). Attentional resolution and the locus of visual awareness. Nature, 383(6598), 334–337. 10.1038/383334a0 [DOI] [PubMed] [Google Scholar]
  24. He S., & MacLeod D. I. A. (2001). Orientation-selective adaptation and tilt after-effect from invisible patterns. Nature, 411(6836), 473–476. 10.1038/35078072 [DOI] [PubMed] [Google Scholar]
  25. Hopfinger J. B., Buonocore M. H., & Mangun G. R. (2000). The neural mechanisms of top-down attentional control. Nature Neuroscience, 3(3), 284–291. 10.1038/72999 [DOI] [PubMed] [Google Scholar]
  26. Huber L., Finn E. S., Handwerker D. A., Bönstrup M., Glen D. R., Kashyap S., Ivanov D., Petridou N., Marrett S., Goense J., Poser B. A., & Bandettini P. A. (2020). Sub-millimeter fMRI reveals multiple topographical digit representations that form action maps in human motor cortex. NeuroImage, 208, 116463. 10.1016/j.neuroimage.2019.116463 [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Huber L., Handwerker D. A., Jangraw D. C., Chen G., Hall A., Stüber C., Gonzalez-Castillo J., Ivanov D., Marrett S., Guidi M., Goense J., Poser B. A., & Bandettini P. A. (2017). High-Resolution CBV-fMRI Allows Mapping of Laminar Activity and Connectivity of Cortical Input and Output in Human M1. Neuron, 96(6), 1253–1263.e7. 10.1016/j.neuron.2017.11.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Huber L., Ivanov D., Krieger S. N., Streicher M. N., Mildner T., Poser B. A., Möller H. E., & Turner R. (2014). Slab-selective, BOLD-corrected VASO at 7 Tesla provides measures of cerebral blood volume reactivity with high signal-to-noise ratio: SS-SI-VASO Measures Changes of CBV in Brain. Magnetic Resonance in Medicine, 72(1), 137–148. 10.1002/mrm.24916 [DOI] [PubMed] [Google Scholar]
  29. Huber L., Poser B. A., Bandettini P. A., Arora K., Wagstyl K., Cho S., Goense J., Nothnagel N., Morgan A. T., Van Den Hurk J., Müller A. K., Reynolds R. C., Glen D. R., Goebel R., & Gulban O. F. (2021). LayNii: A software suite for layer-fMRI. NeuroImage, 237, 118091. 10.1016/j.neuroimage.2021.118091 [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Huber L., Uludağ K., & Möller H. E. (2019). Non-BOLD contrast for laminar fMRI in humans: CBF, CBV, and CMRO2. NeuroImage, 197, 742–760. 10.1016/j.neuroimage.2017.07.041 [DOI] [PubMed] [Google Scholar]
  31. Itti L., & Koch C. (2000). A saliency-based search mechanism for overt and covert shifts of visual attention. Vision Research, 40(10–12), 1489–1506. 10.1016/S0042-6989(99)00163-7 [DOI] [PubMed] [Google Scholar]
  32. Itti L., & Koch C. (2001). Computational modelling of visual attention. Nature Reviews Neuroscience, 2(3), 194–203. 10.1038/35058500 [DOI] [PubMed] [Google Scholar]
  33. Itti L., Koch C., & Niebur E. (1998). A model of saliency-based visual attention for rapid scene analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence, 20(11), 1254–1259. 10.1109/34.730558 [DOI] [Google Scholar]
  34. Kanai R., Tsuchiya N., & Verstraten F. A. J. (2006). The Scope and Limits of Top-Down Attention in Unconscious Visual Processing. Current Biology, 16(23), 2332–2336. 10.1016/j.cub.2006.10.001 [DOI] [PubMed] [Google Scholar]
  35. Katz L. N., Yu G., & Krauzlis R. J. (2024). Visual activity in primate superior colliculus depends on visual cortex. Journal of Vision, 24(10), 744. 10.1167/jov.24.10.744 [DOI] [Google Scholar]
  36. Kay K., Jamison K. W., Vizioli L., Zhang R., Margalit E., & Ugurbil K. (2019). A critical assessment of data quality and venous effects in sub-millimeter fMRI. NeuroImage, 189, 847–869. 10.1016/j.neuroimage.2019.02.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Kelly S. P., Gomez-Ramirez M., & Foxe J. J. (2008). Spatial Attention Modulates Initial Afferent Activity in Human Primary Visual Cortex. Cerebral Cortex, 18(11), 2629–2636. 10.1093/cercor/bhn022 [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Koch C., & Ullman S. (1985). Shifts in selective visual attention: Towards the underlying neural circuitry. Human Neurobiology, 4(4), 219–227. [PubMed] [Google Scholar]
  39. Koene A. R., & Zhaoping L. (2007). Feature-specific interactions in salience from combined feature contrasts: Evidence for a bottom–up saliency map in V1. Journal of Vision, 7(7), 6. 10.1167/7.7.6 [DOI] [PubMed] [Google Scholar]
  40. Kok P., Bains L. J., van Mourik T., Norris D. G., & de Lange F. P. (2016). Selective Activation of the Deep Layers of the Human Primary Visual Cortex by Top-Down Feedback. Current Biology, 26(3), 371–376. 10.1016/j.cub.2015.12.038 [DOI] [PubMed] [Google Scholar]
  41. Lawrence S. J., Norris D. G., & de Lange F. P. (2019). Dissociable laminar profiles of concurrent bottom-up and top-down modulation in the human visual cortex. eLife, 8, e44422. 10.7554/eLife.44422 [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Levin J. R., Serlin R. C., & Seaman M. A. (1994). A controlled, powerful multiple-comparison strategy for several situations. Psychological Bulletin, 115(1), 153–159. 10.1037/0033-2909.115.1.153 [DOI] [Google Scholar]
  43. Li Z. (2002). A saliency map in primary visual cortex. Trends in Cognitive Sciences, 6(1), 9–16. 10.1016/S1364-6613(00)01817-9 [DOI] [PubMed] [Google Scholar]
  44. Li Z. (2008). Attention capture by eye of origin singletons even without awareness—A hallmark of a bottom-up saliency map in the primary visual cortex. Journal of Vision, 8(5), 1. 10.1167/8.5.1 [DOI] [PubMed] [Google Scholar]
  45. Liu C., Guo F., Qian C., Zhang Z., Sun K., Wang D. J., He S., & Zhang P. (2021). Layer-dependent multiplicative effects of spatial attention on contrast responses in human early visual cortex. Progress in Neurobiology, 207, 101897. 10.1016/j.pneurobio.2020.101897 [DOI] [PubMed] [Google Scholar]
  46. Lu H., Golay X., Pekar J. J., & Van Zijl P. C. M. (2003). Functional magnetic resonance imaging based on changes in vascular space occupancy. Magnetic Resonance in Medicine, 50(2), 263–274. 10.1002/mrm.10519 [DOI] [PubMed] [Google Scholar]
  47. Lu H., Patel S., Luo F., Li S.-J., Hillard C. J., Ward B. D., & Hyde J. S. (2004). Spatial correlations of laminar BOLD and CBV responses to rat whisker stimulation with neuronal activity localized by Fos expression. Magnetic Resonance in Medicine, 52(5), 1060–1068. 10.1002/mrm.20265 [DOI] [PubMed] [Google Scholar]
  48. Markov N. T., Ercsey-Ravasz M. M., Ribeiro Gomes A. R., Lamy C., Magrou L., Vezoli J., Misery P., Falchier A., Quilodran R., Gariel M. A., Sallet J., Gamanut R., Huissoud C., Clavagnier S., Giroud P., Sappey-Marinier D., Barone P., Dehay C., Toroczkai Z., … Kennedy H. (2014). A weighted and directed interareal connectivity matrix for macaque cerebral cortex. Cerebral Cortex (New York, N.Y.: 1991), 24(1), 17–36. 10.1093/cercor/bhs270 [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Marques J. P., Kober T., Krueger G., Van Der Zwaag W., Van De Moortele P.-F., & Gruetter R. (2010). MP2RAGE, a self bias-field corrected sequence for improved segmentation and T1-mapping at high field. NeuroImage, 49(2), 1271–1281. 10.1016/j.neuroimage.2009.10.002 [DOI] [PubMed] [Google Scholar]
  50. May P. J. (2006). The mammalian superior colliculus: Laminar structure and connections. In Büttner-Ennever J. A. (Ed.), Progress in Brain Research (Vol. 151, pp. 321–378). Elsevier. 10.1016/S0079-6123(05)51011-2 [DOI] [PubMed] [Google Scholar]
  51. Mazer J. A., & Gallant J. L. (2003). Goal-related activity in V4 during free viewing visual search. Evidence for a ventral stream visual salience map. Neuron, 40(6), 1241–1250. 10.1016/s0896-6273(03)00764-5 [DOI] [PubMed] [Google Scholar]
  52. McLaren D. G., Ries M. L., Xu G., & Johnson S. C. (2012). A generalized form of context-dependent psychophysiological interactions (gPPI): A comparison to standard approaches. NeuroImage, 61(4), 1277–1286. 10.1016/j.neuroimage.2012.03.068 [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Mo C., He D., & Fang F. (2018). Attention Priority Map of Face Images in Human Early Visual Cortex. The Journal of Neuroscience, 38(1), 149–157. 10.1523/JNEUROSCI.1206-17.2017 [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Moore T., & Armstrong K. M. (2003). Selective gating of visual signals by microstimulation of frontal cortex. Nature, 421(6921), 370–373. 10.1038/nature01341 [DOI] [PubMed] [Google Scholar]
  55. Moore T., & Zirnsak M. (2017). Neural mechanisms of selective visual attention. Annual Review of Psychology, 68(1), 47–72. 10.1146/annurev-psych-122414-033400 [DOI] [PubMed] [Google Scholar]
  56. O’Reilly J. X., Woolrich M. W., Behrens T. E. J., Smith S. M., & Johansen-Berg H. (2012). Tools of the trade: Psychophysiological interactions and functional connectivity. Social Cognitive and Affective Neuroscience, 7(5), 604–609. 10.1093/scan/nss055 [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Pelli D. G. (1997). The VideoToolbox software for visual psychophysics: Transforming numbers into movies. Spatial Vision, 10(4), 437–442. 10.1163/156856897X00366 [DOI] [PubMed] [Google Scholar]
  58. Polimeni J. R., Renvall V., Zaretskaya N., & Fischl B. (2018). Analysis strategies for high-resolution UHF-fMRI data. NeuroImage, 168, 296–320. 10.1016/j.neuroimage.2017.04.053 [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Poser B. A., Koopmans P. J., Witzel T., Wald L. L., & Barth M. (2010). Three dimensional echo-planar imaging at 7 Tesla. NeuroImage, 51(1), 261–266. 10.1016/j.neuroimage.2010.01.108 [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Rauss K., Pourtois G., Vuilleumier P., & Schwartz S. (2009). Attentional load modifies early activity in human primary visual cortex. Human Brain Mapping, 30(5), 1723–1733. 10.1002/hbm.20636 [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Rauss K., Pourtois G., Vuilleumier P., & Schwartz S. (2012). Effects of attentional load on early visual processing depend on stimulus timing. Human Brain Mapping, 33(1), 63–74. 10.1002/hbm.21193 [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Rees G., Kreiman G., & Koch C. (2002). Neural correlates of consciousness in humans. Nature Reviews Neuroscience, 3(4), 261–270. 10.1038/nrn783 [DOI] [PubMed] [Google Scholar]
  63. Rockland K. S., & Lund J. S. (1983). Intrinsic laminar lattice connections in primate visual cortex. The Journal of Comparative Neurology, 216(3), 303–318. 10.1002/cne.902160307 [DOI] [PubMed] [Google Scholar]
  64. Sharoh D., Van Mourik T., Bains L. J., Segaert K., Weber K., Hagoort P., & Norris D. G. (2019). Laminar specific fMRI reveals directed interactions in distributed networks during language processing. Proceedings of the National Academy of Sciences, 116(42), 21185–21190. 10.1073/pnas.1907858116 [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Shipp S. (2003). The functional logic of cortico-pulvinar connections. Philosophical Transactions of the Royal Society B: Biological Sciences, 358(1438), 1605–1624. 10.1098/rstb.2002.1213 [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Shipp S. (2004). The brain circuitry of attention. Trends in Cognitive Sciences, 8(5), 223–230. 10.1016/j.tics.2004.03.004 [DOI] [PubMed] [Google Scholar]
  67. Shipp S. (2007). Structure and function of the cerebral cortex. Current Biology, 17(12), R443–R449. 10.1016/j.cub.2007.03.044 [DOI] [PubMed] [Google Scholar]
  68. Steinman R. M. (1965). Effect of Target Size, Luminance, and Color on Monocular Fixation*. Journal of the Optical Society of America, 55(9), 1158. 10.1364/JOSA.55.001158 [DOI] [Google Scholar]
  69. Thompson K. G., & Bichot N. P. (2005). A visual salience map in the primate frontal eye field. In Progress in Brain Research (Vol. 147, pp. 249–262). Elsevier. 10.1016/S0079-6123(04)47019-8 [DOI] [PubMed] [Google Scholar]
  70. Tong F. (2003). Primary visual cortex and visual awareness. Nature Reviews. Neuroscience, 4(3), 219–229. 10.1038/nrn1055 [DOI] [PubMed] [Google Scholar]
  71. Treisman A. M., & Gelade G. (1980). A feature-integration theory of attention. Cognitive Psychology, 12(1), 97–136. 10.1016/0010-0285(80)90005-5 [DOI] [PubMed] [Google Scholar]
  72. Uludag K., & Havlicek M. (2021). Determining laminar neuronal activity from BOLD fMRI using a generative model. Progress in Neurobiology, 207, 102055. 10.1016/j.pneurobio.2021.102055 [DOI] [PubMed] [Google Scholar]
  73. Van Mourik T., Koopmans P. J., Bains L. J., Norris D. G., & Jehee J. F. M. (2023). Investigation of layer-specific BOLD signal in the human visual cortex during visual attention. Aperture Neuro, 3. 10.52294/001c.87638 [DOI] [Google Scholar]
  74. Vizioli L., Huber L., & Yacoub E. (2023). Laminar and columnar imaging at UHF: Considerations for mesoscopic-scale imaging with fMRI. In Advances in Magnetic Resonance Technology and Applications (Vol. 10, pp. 387–405). Elsevier. 10.1016/B978-0-323-99898-7.00026-2 [DOI] [Google Scholar]
  75. Waehnert M. D., Dinse J., Weiss M., Streicher M. N., Waehnert P., Geyer S., Turner R., & Bazin P.-L. (2014). Anatomically motivated modeling of cortical laminae. NeuroImage, 93, 210–220. 10.1016/j.neuroimage.2013.03.078 [DOI] [PubMed] [Google Scholar]
  76. Wang J., Nasr S., Roe A. W., & Polimeni J. R. (2022). Critical factors in achieving fine-scale functional MRI: Removing sources of inadvertent spatial smoothing. Human Brain Mapping, 43(11), 3311–3331. 10.1002/hbm.25867 [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Wang L., Mruczek R. E. B., Arcaro M. J., & Kastner S. (2015). Probabilistic Maps of Visual Topography in Human Cortex. Cerebral Cortex, 25(10), 3911–3931. 10.1093/cercor/bhu277 [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. White B. J., Berg D. J., Kan J. Y., Marino R. A., Itti L., & Munoz D. P. (2017). Superior colliculus neurons encode a visual saliency map during free viewing of natural dynamic video. Nature Communications, 8(1), 14263. 10.1038/ncomms14263 [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. White B. J., Kan J. Y., Levy R., Itti L., & Munoz D. P. (2017). Superior colliculus encodes visual saliency before the primary visual cortex. Proceedings of the National Academy of Sciences, 114(35), 9451–9456. 10.1073/pnas.1701003114 [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Yan Y., Zhaoping L., & Li W. (2018). Bottom-up saliency and top-down learning in the primary visual cortex of monkeys. Proceedings of the National Academy of Sciences of the United States of America, 115(41), 10499–10504. 10.1073/pnas.1803854115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Zhang X., Zhaoping L., Zhou T., & Fang F. (2012). Neural Activities in V1 Create a Bottom-Up Saliency Map. Neuron, 73(1), 183–192. 10.1016/j.neuron.2011.10.035 [DOI] [PubMed] [Google Scholar]
  82. Zhaoping L. (2012). Gaze capture by eye-of-origin singletons: Interdependence with awareness. Journal of Vision, 12(2), 17–17. 10.1167/12.2.17 [DOI] [PubMed] [Google Scholar]
  83. Zhaoping L., & May K. A. (2007). Psychophysical Tests of the Hypothesis of a Bottom-Up Saliency Map in Primary Visual Cortex. PLoS Computational Biology, 3(4), e62. 10.1371/journal.pcbi.0030062 [DOI] [PMC free article] [PubMed] [Google Scholar]
  84. Zhaoping L., & Zhe L. (2015). Primary Visual Cortex as a Saliency Map: A Parameter-Free Prediction and Its Test by Behavioral Data. PLOS Computational Biology, 11(10), e1004375. 10.1371/journal.pcbi.1004375 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1

Articles from bioRxiv are provided here courtesy of Cold Spring Harbor Laboratory Preprints

RESOURCES