Skip to main content
eLife logoLink to eLife
. 2022 Aug 11;11:e78712. doi: 10.7554/eLife.78712

Parallel processing, hierarchical transformations, and sensorimotor associations along the ‘where’ pathway

Raymond Doudlah 1,, Ting-Yu Chang 2,, Lowell W Thompson 1, Byounghoon Kim 1, Adhira Sunkara 3, Ari Rosenberg 1,
Editors: David J Freedman4, Tirin Moore5
PMCID: PMC9439678  PMID: 35950921

Abstract

Visually guided behaviors require the brain to transform ambiguous retinal images into object-level spatial representations and implement sensorimotor transformations. These processes are supported by the dorsal ‘where’ pathway. However, the specific functional contributions of areas along this pathway remain elusive due in part to methodological differences across studies. We previously showed that macaque caudal intraparietal (CIP) area neurons possess robust 3D visual representations, carry choice- and saccade-related activity, and exhibit experience-dependent sensorimotor associations (Chang et al., 2020b). Here, we used a common experimental design to reveal parallel processing, hierarchical transformations, and the formation of sensorimotor associations along the ‘where’ pathway by extending the investigation to V3A, a major feedforward input to CIP. Higher-level 3D representations and choice-related activity were more prevalent in CIP than V3A. Both areas contained saccade-related activity that predicted the direction/timing of eye movements. Intriguingly, the time course of saccade-related activity in CIP aligned with the temporally integrated V3A output. Sensorimotor associations between 3D orientation and saccade direction preferences were stronger in CIP than V3A, and moderated by choice signals in both areas. Together, the results explicate parallel representations, hierarchical transformations, and functional associations of visual and saccade-related signals at a key juncture in the ‘where’ pathway.

Research organism: Rhesus macaque

Introduction

The 3D perceptual and sensorimotor capabilities of primates facilitate their ability to shape the world. For instance, 3D spatial reasoning is a key predictor of engineering problem-solving ability (Hsi et al., 1997). These capabilities are supported by the dorsal ‘where’ pathway. In particular, high-level visual transformations are thought to occur in brain areas located at the parieto-occipital junction (Tsao et al., 2003; Chang et al., 2020b). Parietal cortex is thought to then implement sensorimotor transformations that map those sensory representations to motor responses (Pause and Freund, 1989; Rushworth et al., 1997; Buneo and Andersen, 2006). However, assigning particular functions to specific areas has been challenging due to methodological differences across studies. Here, we used a common experimental design to investigate two areas that bridge the parieto-occipital junction in macaque monkeys: intermediate visual area V3A and the caudal intraparietal (CIP) area.

Area CIP is a site of 3D visual processing (Taira et al., 2000; Tsutsui et al., 2002; Rosenberg et al., 2013; Rosenberg and Angelaki, 2014a; Rosenberg and Angelaki, 2014b), which is functionally correlated (Tsutsui et al., 2003; Elmore et al., 2019) and causally linked (Tsutsui et al., 2001; Van Dromme et al., 2016) to 3D perception. Saccade-related activity and sensorimotor associations in CIP may further support goal-directed behaviors (Chang et al., 2020b) via connections to oculomotor and prehensile areas (Lewis and Van Essen, 2000; Premereur et al., 2015; Van Dromme et al., 2016; Lanzilotto et al., 2019).

By comparison, V3A findings have been highly conflicting. Some imply relatively low-level image processing such as spatiotemporal filtering (Gaska et al., 1987; Gaska et al., 1988), basic stereoscopic depth selectivity (Anzai et al., 2011), and 2D direction selectivity (Nakhla et al., 2021). Other findings link V3A to high-level processes underlying stable, allocentric representations of the world. This includes combining visual and extraretinal signals to represent objects in non-retinal coordinates (Galletti and Battaglini, 1989; Galletti et al., 1990; Sauvan and Peterhans, 1999; Nakamura and Colby, 2002), distinguishing veridical object motion from self-induced retinal image motion (Galletti et al., 1990), and 3D spatial processing (Tsao et al., 2003; Elmore et al., 2019). Furthermore, V3A activity is modulated by attention and memory-related factors, and some neurons show postsaccadic activity (Nakamura and Colby, 2000).

To directly compare the functional properties of these interconnected areas, we used a common experimental design to assess (i) selectivity for the 3D pose (orientation and position) of planar surfaces, (ii) choice-related activity during a 3D orientation discrimination task (Chang et al., 2020a), (iii) saccade-related activity during a visually guided saccade task (Munoz and Wurtz, 1995; Hanes and Schall, 1996), and (iv) sensorimotor associations (Chang et al., 2020b). Multiple lines of evidence converged to support a V3A-to-CIP hierarchy. First, our findings revealed that robust 3D pose representations were most prominent in CIP. Second, choice-related activity was associated with robust 3D pose tuning in both areas but most prevalent in CIP. Third, the areas contained similar proportions of neurons with saccade-related activity that predicted the direction and timing of eye movements. Saccade-related activity started earlier in V3A than CIP and the CIP time course closely matched the temporally integrated V3A output, suggesting that saccade-related signals in CIP may originate in V3A. Notably, both areas showed sensorimotor associations (which were stronger in CIP than V3A) that were statistically moderated by choice-related activity. Together, these findings challenge classical notions of sensorimotor dichotomies, argue for a reclassification of V3A as association cortex, and implicate choice-related activity as a novel factor in sensorimotor processing.

Results

To investigate the contributions of areas V3A and CIP to the transformation of retinal images into object-level representations and goal-directed sensorimotor processing, we compared the 3D selectivity, saccade-related properties, and sensorimotor associations of 692 V3A neurons (Monkey L: N = 311; Monkey F: N = 263; Monkey W: N = 118) and 437 previously analyzed CIP neurons (Monkey L: N = 218; Monkey F: N = 219) (Chang et al., 2020b). Areas V3A and CIP were dissociated from each other and adjacent regions using multiple anatomical and functional criteria (Figure 1; ‘Materials and methods’). Supporting a V3A-to-CIP hierarchy, the median visual response latency was shorter in V3A (46 ms) than CIP (52 ms) and the receptive fields were smaller in V3A than CIP (Figure 1—figure supplement 1).

Figure 1. Neuronal recordings.

(A) Lateral (left) and dorsal (right) views of the inflated cortical surface of Monkey L (left hemisphere). Dashed lines mark the coronal sections in (B). (B) Coronal sections (left: AP = –7 mm; right: AP = –9.5 mm) with MRI-based estimates of the boundaries of V3A, CIP, and adjacent areas. Recording locations for V3A (blue-gray circles) and CIP (red squares) were projected along the AP axis onto the closest of the two coronal sections shown. A schematic of a four-tetrode laminar probe with spike waveforms from the V3A recording marked with white circles in the right coronal section is shown (middle). CIP, caudal intraparietal area (light blue); V3A, visual area V3A (orange); PIP, posterior intraparietal area; V3, visual area V3; PO, parieto-occipital area; LIPd, dorsal aspect of the lateral intraparietal area; 7a, area 7a.

Figure 1.

Figure 1—figure supplement 1. Response latencies and receptive field sizes.

Figure 1—figure supplement 1.

(A) Cumulative density functions for the visual latencies of visual area V3A (V3A) (orange) and caudal intraparietal (CIP) area (blue) neurons. Colored circles mark the median latencies. (B) Receptive field (RF) size versus eccentricity for V3A (orange circles) and CIP (blue squares). RF size was defined as the square root of the RF area. Type II regression lines are shown for V3A (solid line; RF size = 2.6 + 0.55 × eccentricity) and CIP (dashed line; RF size = 5.4 + 0.81 × eccentricity).

Behavioral discrimination of 3D surface orientation

To investigate the transformation of visual representations into goal-directed behaviors, we trained three monkeys to report the 3D orientation of a planar surface (Chang et al., 2020a). Specifically, they performed an eight-alternative forced choice (8AFC) tilt discrimination task with planar surfaces presented at different orientations and distances (Figure 2). The orientation was defined by two angular variables (Stevens, 1983; Rosenberg et al., 2013): tilt and slant. Tilt describes which side of the plane was nearest to the monkey and slant describes the rotation in depth (Figure 2A). Planes were presented for 1 s while fixation was maintained on a target at the center of the screen. The monkey then reported the plane’s tilt (the near side) via a saccade to the corresponding choice target, regardless of the slant or distance (Figure 2B).

Figure 2. Stimuli, task, and behavioral performance.

Figure 2.

(A) Planar surfaces were defined using random dots with perspective and stereoscopic cues (illustrated here as red-green anaglyphs). For clarity, the size and number of dots differ from the actual stimuli. (B) Eight-alternative tilt discrimination task. A trial began by fixating a dot at the center of the screen (fixation was always at screen distance, 57 cm) for 300 ms (left). A plane was then presented with a given tilt (0–315°, 45° steps), slant (0–60°, 15° steps), and distance (37, 57, 97, or 137 cm) for 1 s (middle). The fixation target and plane then disappeared and eight choice targets corresponding to the eight tilts appeared (right). This cued the monkey to saccade to one of the targets to report which side of the plane was nearest. (C) Behavioral performance. Error distributions of reported tilts for each slant at 97 cm for Monkey W (left). Data points show the mean probability of a given ΔTilt (reported tilt – presented tilt), and error bars show standard error of the mean (SEM) across sessions (N = 14). Solid curves are von Mises probability density functions with sensitivities (κ) indicated. The black dashed line marks chance level. The heat map (right) shows the mean tilt sensitivity for each slant (rows) and distance (columns) for Monkey W across sessions. Yellow hues indicate higher sensitivities. Green arrow and purple rectangle mark the data shown in the error distribution plots (left).

Behavioral performance was quantified each session (V3A: Monkey L: N = 39; Monkey F: N = 38; Monkey W: N = 14; CIP: Monkey L: N = 26; Monkey F: N = 27) by calculating the distribution of reported tilt errors (∆Tilt = reported tilt – presented tilt) for each non-zero slant and distance (16 conditions) pooled across tilt (Chang et al., 2020a; Chang et al., 2020b). Each error distribution was then fit with a von Mises probability density function (Equation 1) and behavioral sensitivity was quantified as the concentration parameter (κ) of the fit. To assess how sensitivity depended on the viewing conditions, we ran a linear mixed-effects model with distance, slant, and area as main effects and animal as a random effect. Consistent with our previous findings, sensitivity decreased with distance from fixation and increased with slant (Figure 2C). Correspondingly, behavioral sensitivity significantly depended on distance (p=4.1 × 10-30) and slant (p=2.2 × 10-308). There was no significant effect of area (p=0.46), indicating that behavioral performance was similar during the V3A and CIP sessions. To relate this pattern of behavioral sensitivity to V3A and CIP activity, we next characterized the simultaneously recorded neuronal responses.

Hierarchical transformations in the representation of 3D orientation

The visual system is thought to turn ambiguous 2D retinal signals into behaviorally relevant 3D object representations through a series of transformations. We therefore hypothesized that CIP would contain a higher-level representation of 3D pose than V3A. For a 3D pose selective neuron, the shape of its 3D orientation tuning curve will be tolerant to distance, but its overall response amplitude (gain) should be distance-dependent (Janssen et al., 2000; Nguyenkim and DeAngelis, 2003; Alizadeh et al., 2018; Chang et al., 2020b). In contrast, a neuron selective for lower-level visual features (e.g., binocular disparity) will have 3D orientation tuning curves whose shape and gain are highly distance-dependent. To test for 3D pose tuning, we therefore assessed how 3D orientation tuning depended on distance.

The 3D orientation tuning curves of four representative V3A neurons are shown in Figure 3A–D (qualitatively similar examples from CIP are shown in Figure 3 of Chang et al., 2020b). Some neurons had similar 3D orientation tuning across all (Figure 3A) or most (Figure 3B) distances with distance-dependent gain changes, implying 3D pose tuning. Others had significant orientation tuning at a single distance (ANOVA, p<0.05; Bonferroni–Holm corrected for four distances; Figure 3C), which may reflect intermediate selectivity for gradients of absolute binocular disparity (Nguyenkim and DeAngelis, 2003). The orientation tuning of other neurons changed substantially with distance (Figure 3D), implying lower-level visual feature selectivity. These examples suggest that V3A contains a heterogeneous population of neurons whose functional properties range from processing low-level visual features to 3D object pose.

Figure 3. Comparison of 3D orientation tuning across distance.

(A–D) Four example visual area V3A (V3A) neurons from Monkey W (A, C) and Monkey F (B, D). Heat maps show firing rate plotted as a function of tilt (angular axis) and slant (radial axis). Red hues indicate higher firing rates. Black dots mark preferred 3D orientations from Bingham function fits at distances with significant tuning (ANOVA, p<0.05; Bonferroni–Holm corrected for four distances). Some dots are not located on a disc because the largest tested slant was 60° but slant ranges from 0° to 90°. The preferred slant (S) and tilt (T) are indicated for each tuned distance. (E) Proportion of neurons with significant orientation tuning at each distance for V3A (orange; proportions: 37 cm = 0.57, 57 cm = 0.82, 97 cm = 0.57, 137 cm = 0.46) and caudal intraparietal (CIP) area (blue; proportions: 37 cm = 0.70, 57 cm = 0.87, 97 cm = 0.65, 137 cm = 0.55). (F) Proportion of neurons with significant orientation tuning at each possible number of distances for V3A (proportions: #0 = 0.11, #1 = 0.17, #2 = 0.22, #3 = 0.20, #4 = 0.30) and CIP (proportions: #0 = 0.06, #1 = 0.12, #2 = 0.23, #3 = 0.18, #4 = 0.41). (G) Comparison of surface orientation discrimination index (SODI) values at each distance for V3A (orange) and CIP (blue). Data points and error bars are mean and SEM across neurons with significant orientation tuning, respectively. The asterisk indicates a significant difference between V3A and CIP SODI values at 57 cm only (ANOVA followed by Tukey’s HSD test, p<0.05).

Figure 3.

Figure 3—figure supplement 1. Distributions of 3D orientation preferences.

Figure 3—figure supplement 1.

Tilt and slant preferences in visual area V3A (V3A) (top row; 37 cm: N = 397 neurons; 57 cm: N = 570; 97 cm: N = 394; 137 cm: N = 317) and caudal intraparietal (CIP) area (bottom row; 37 cm: N = 304; 57 cm: N = 380; 97 cm: N = 283; 137 cm: N = 240) plotted using an equal area projection (Rosenberg et al., 2013). Marginal histograms show the distributions of tilt and slant. Black triangles mark mean values.
Figure 3—figure supplement 2. Cross-area comparison of 3D orientation tuning curve shape.

Figure 3—figure supplement 2.

Three Bingham function parameters set the bandwidth (λ2), isotropy (λ1), and axis about which tuning anisotropy occurred (Φ). (A) Bandwidth (λ2 ≥ 0). Larger values indicate narrower tuning. Fixed parameters in the schematic: λ1 = 0, Φ = undefined since λ1 = 0. (B) Distribution of λ2 in visual area V3A (V3A) (orange bars) and caudal intraparietal (CIP) area (blue bars). (C) Isotropy (λ1 ≤ 0). More negative values indicate greater anisotropy. Fixed parameters in the schematic: λ2 = 0.7 and Φ = 90°. (D) Distributions of λ1. (E) Axis about which tuning anisotropy occurred (0° ≤ Φ < 180°). Fixed parameters within the schematic: λ2 = 0.7 and λ1 = –1.5. (F) Distributions of Φ. Bars at 0° and 180° are identical. Distributions in (C, E, G) include all distances (V3A: N = 1,799; CIP: N = 1,276) and triangles mark median values. Asterisks in (B, D) mark significant cross-area differences (linear mixed-effects model with area and absolute distance from fixation as fixed effects and neuron as a random effect, p<0.05 for main effect of area).

In both areas, more neurons had 3D orientation tuning at 57 cm (fixation distance) than at the other distances (Figure 3E). Although the proportion of neurons with significant tuning at each distance was greater in CIP than V3A, the cross-area difference was not significant (chi-squared test, across animals: χ2 = 2.4, p=0.50; Monkey L: χ2 = 0.62, p=0.89; Monkey F: χ2 = 1.6, p=0.67). However, CIP neurons were typically tuned for 3D orientation at more distances than V3A neurons (chi-squared test, across animals: χ2 = 21.2, p=2.9 × 10–4; Monkey L: χ2 = 13.0, p=0.01; Monkey F: χ2 = 8.7, p=0.07), implying greater convergence of orientation information across distance within CIP than V3A (Figure 3F).

We next examined the 3D orientation preferences and tuning curve shapes by fitting each significant orientation tuning curve (ANOVA, p<0.05; Bonferroni–Holm corrected for four distances) with a Bingham function (Bingham, 1974). The Bingham function is a low-dimensional, parametric model over tilt and slant that describes V3A and CIP 3D orientation tuning curves (Rosenberg et al., 2013; Rosenberg and Angelaki, 2014a; Elmore et al., 2019; Chang et al., 2020b). The preferred orientation taken from these fits is marked with a black dot for the example neurons in Figure 3A–D. In both V3A and CIP, the full span of 3D orientations was represented at each distance (Figure 3—figure supplement 1), indicating that both areas can support neural codes for 3D pose. To compare the shape of the orientation tuning curves, we used the Bingham parameters describing the bandwidth (λ2), isotropy (λ1), and axis about which any anisotropy occurred (Φ). First, there was a slight but significant tendency for V3A neurons (median λ2 = 0.80) to be more narrowly tuned than CIP neurons (median λ2 = 0.65; linear mixed-effects model with area, absolute distance from fixation, and animal as fixed effects, and neuron as a random effect, p=1.5 × 10–3; Figure 3—figure supplement 2A and B). This difference may reflect convergent input from multiple V3A neurons onto individual CIP neurons. The tuning bandwidths also increased with distance from fixation (p=6.5 × 10–6), implying information loss that mirrored the behavioral finding that tilt discrimination performance decreased with distance from fixation (Chang et al., 2020a; Chang et al., 2020b; Figure 2, see Figure 5). No difference was observed across animals (p=0.77). Second, the V3A tuning curves were less isotropic (more elongated; median λ1 = –1.62) than the CIP tuning curves (median λ1 = – 0.92), and the difference was significant (p=1.3 × 10–9; Figure 3—figure supplement 2C and D). The level of anisotropy also significantly increased with distance from fixation (p=9.4 × 10–6). No difference was observed across animals (p=0.78). Lastly, the distributions of Φ peaked at approximately 90° in both V3A (median Φ = 88°) and CIP (median Φ = 89°), indicating that any anisotropy in the tuning curves generally occurred along the tilt/slant axes (Figure 3—figure supplement 2E and F). These findings indicate greater orientation tuning symmetry in CIP than V3A, which may be important for perceptual sensitivity to changes in object orientation to not depend on the axis of rotation (Chang et al., 2020b).

A previous study found that 3D orientation was better discriminated based on the responses of individual V3A than CIP neurons (Elmore et al., 2019). To follow up on that finding, we computed a surface orientation discrimination index (SODI) that quantifies the difference in responses to preferred and non-preferred orientations relative to the response variability (Equation 2). Neurons with stronger 3D orientation selectivity have SODI values closer to one, whereas those with weaker selectivity have values closer to zero. For each neuron, we calculated the SODI at each distance with significant orientation tuning. In both areas, the mean SODI had an inverted U-shape as a function of distance that peaked at 57 cm (fixation distance; Figure 3G). This indicates that 3D orientation was most discriminable at the fixation distance, which may be a downstream consequence of V1 neurons tending to prefer smaller binocular disparities (Prince et al., 2002). Consistent with the Elmore et al., 2019 study, which measured 3D orientation tuning at the fixation distance only, we found that the SODI values across animals were significantly larger in V3A than CIP at the fixation distance (ANOVA followed by Tukey’s HSD test, p=1.1 × 10–5). However, the differences were not significant at any other distance (p≥0.47). For Monkey L, the SODI values were significantly larger in V3A than CIP at 37, 57, and 97 cm (p≤8.5 × 10–3), but not 137 cm (p=0.95). For Monkey F, they were not significantly different at any distance (p≥0.52). Taken together with the Elmore et al., 2019 findings, these results are consistent with a cross-area difference that, as we consider next, may reflect a transformation from lower-level visual feature selectivity to higher-level 3D pose tuning.

Hierarchical refinement of 3D pose representations

To test for cross-area differences in lower-level visual feature selectivity versus higher-level 3D pose tuning, we assessed how the 3D orientation tuning curves depended on distance (Janssen et al., 2000; Nguyenkim and DeAngelis, 2003; Alizadeh et al., 2018). This approach recently revealed 3D pose tuning in CIP (Chang et al., 2020b) but has not been applied to V3A. It thus remains unknown if 3D pose tuning in CIP is simply inherited or reflects a qualitative transformation of feedforward input.

To quantify the distance-dependence of 3D orientation tuning curve shape, we fit each 3D pose tuning curve with a separable model (Equation 3) and computed a tolerance index (Chang et al., 2020b). Tolerance values near zero indicate that the shape of the orientation tuning curve changed substantially with distance (as expected for neurons selective for low-level visual features). Values near one indicate that the shape changed minimally with distance (implying 3D pose tuning). As shown for the example neurons, larger tolerance values were associated with 3D pose tuning (tolerance = 0.96, 0.74; Figure 3A and B, respectively), modest values with more intermediate representations (tolerance = 0.41; Figure 3C), and low values with low-level feature selectivity (tolerance = 0.17; Figure 3D). Across the V3A population, the tolerance values revealed a heterogeneous population in which neurons ranged from having low-level visual feature selectivity to high-level 3D pose tuning (Figure 4A, orange bars).

Figure 4. Robust 3D pose tuning was less prevalent in visual area V3A (V3A) than the caudal intraparietal (CIP) area.

Figure 4.

(A) Distribution of tolerance values in V3A (orange; N = 692) and CIP (blue; N = 437). Triangles mark mean tolerance values, and the asterisk indicates a statistically significant difference (two-sample t-test, p<0.05). (B) Cumulative density functions over the angular deviations between the orientation preference at each distance and the principal orientation for each neuron.

To test our hypothesis that 3D pose tuning would be more prevalent in CIP than V3A, we performed two complementary analyses. First, we evaluated if there was a cross-area difference in the extent to which the shape of the 3D orientation tuning curves depended on distance by comparing the tolerance distributions. In V3A, the mean tolerance was 0.57 ± 6.7 × 10–3 SEM across animals (Figure 4A, orange bars; N = 692). Individually, the mean tolerances in V3A were 0.56 ± 8.5 × 10–3 (Monkey L, N = 311), 0.58 ± 0.01 (Monkey F, N = 263), and 0.55 ± 0.02 (Monkey W, N = 118). In CIP, the mean tolerance was 0.66 ± 7.7 × 10–3 SEM across animals (Figure 4A, blue bars; N = 437). Individually, the mean tolerances in CIP were 0.65 ± 0.01 (Monkey L, N = 218) and 0.66 ± 0.01 (Monkey F, N = 219). As predicted, the tolerance values in CIP were significantly larger than in V3A (two-sample t-test, across animals: p=7.4 × 10–19; Monkey L: p=4.7 × 10–11; Monkey F: p=3.7 × 10–7), indicating that the shape of 3D orientation tuning curves was more similar across distance in CIP than V3A. Second, we evaluated if there was a cross-area difference in the extent to which the orientation preferences (independent of other tuning parameters such as bandwidth) depended on distance. For each neuron, we calculated the angular deviation between the preferred orientation at each distance and the principal orientation (Chang et al., 2020b; ‘Materials and methods’). Across neurons, we then computed cumulative density functions over the angular deviations (Figure 4B) and found that the deviations were significantly greater in V3A than CIP (Kolmogorov–Smirnov test, across animals: p=1.2 × 10–9; Monkey L: p=5.1 × 10–6; Monkey F: p=5.9 × 10–4). Thus, although both areas represented the full span of 3D orientations at each distance (Figure 3—figure supplement 1), the orientation preferences of individual neurons were more similar across distance in CIP than V3A. These analyses together suggest that a transformation from lower-level visual features to higher-level 3D object representations occurs between V3A and CIP.

Neuronal correlates of behavioral tilt sensitivity

We previously found that behavioral tilt sensitivity, which increases as a function of slant and has an inverted U-shape pattern as a function of distance from fixation (Chang et al., 2020a; Figure 2C and black curves in Figure 5), is correlated with neuronal tilt discriminability in CIP (Chang et al., 2020b). To test if a functional correlation between behavior and neuronal activity also exists for V3A, we calculated a tilt discrimination index (TDI; Equation 2) at each slant–distance combination for each neuron, following Chang et al., 2020b. Analogous to the SODI, the TDI quantifies the difference in responses to preferred and non-preferred tilts relative to the response variability. Notably, the mean TDI values in V3A followed the same trends as the behavioral sensitivity for slant (Figure 5A) and distance (Figure 5B). Indeed, the behavioral tilt sensitivities and TDI values were highly correlated across all 16 slant–distance combinations (Monkey L: Spearman r = 0.92, p=2.2 × 10–308; Monkey F: r = 0.98, p=2.2 × 10–308; Monkey W: r = 0.74, p=1.5 × 10–3). Thus, neuronal tilt discriminability in both V3A and CIP was functionally correlated with the 3D tilt sensitivities of the monkeys across a wide range of viewing conditions.

Figure 5. Neuronal correlates of tilt sensitivity.

Figure 5.

(A) Mean visual area V3A (V3A) tilt discrimination index (TDI) (gray) and behavioral tilt sensitivity (black) increased with slant. TDI values (and behavioral sensitivities) were averaged across neurons (or monkeys) and distances. (B) Mean V3A TDI and behavioral tilt sensitivity had an inverted U-shape relationship with distance. TDI values (behavioral sensitivities) were averaged across neurons (monkeys) and slants. Error bars are SEM.

V3A carries choice-related activity during 3D orientation discrimination

Previous studies found that roughly half of CIP neurons carried choice-related activity during 3D orientation discrimination tasks (Elmore et al., 2019; Chang et al., 2020b). In contrast, one of those studies also reported that choice-related activity was essentially non-existent in V3A, but only tested 23 neurons (Elmore et al., 2019). Given that choice-related activity is preferentially carried by CIP neurons with robust 3D pose tuning (Chang et al., 2020b), the dearth of V3A choice-related activity in the Elmore study could have occurred if the small sample mostly included neurons with low-level feature selectivity. Because that study was not designed to distinguish between low-level visual feature selectivity and 3D pose tuning, we wanted to reassess if V3A carries choice-related activity.

To dissociate choice-related and orientation-selective activity, we analyzed responses to frontoparallel planes (S = 0° and tilt undefined, making them task ambiguous) only. To remove any distance-related response differences, we z-scored the responses at each distance and then pooled across distance. The responses were then grouped according to the monkey’s reported tilt. We first computed eight population-level time courses aligned to the tilt choice that elicited the maximum response for each neuron (Figure 6A). Following an initial transient response, the eight time courses began to separate and showed parametric tuning with amplitudes that symmetrically fell off from the preferred choice (note the similarity of the ±45°, ±90°, and ±135° time courses), thus revealing choice-related activity in V3A. The onset of choice-related activity was defined as the first time point that the eight time courses significantly diverged (191 ms; ANOVA, p<0.05). In contrast, the onset of choice-related activity in CIP was 202 ms (see Figure 5A in Chang et al., 2020b). The finding that choice-related activity appeared first in V3A may reflect that choice signals in CIP contain a large bottom-up contribution from V3A.

Figure 6. Choice tuning in visual area V3A (V3A) and its relationship to tilt tuning.

Figure 6.

(A) Population time courses for each choice option relative to the preferred choice. Curves show z-scored responses averaged over neurons. Stimulus onset = 0 ms. Vertical dashed lines mark the median visual response latency (46 ms) and the onset of choice-related activity (191 ms). (B) Comparison of noise and signal correlations in V3A (orange; N = 404 pairs) and the caudal intraparietal (CIP) area (blue; N = 244 pairs). Solid lines show type II linear regression fits. Marginal histograms show the distributions of noise (right) and signal (top) correlations. Triangles mark mean values. Inset shows proportion of neurons with choice-related activity in V3A (25%) and CIP (46%). (C–F) Choice tuning curves (left axis, purple) and tilt tuning curves marginalized over slant and distance (right axis, black) for the four example V3A neurons from Figure 3. Data points show mean firing rate, and error bars are SEM. Solid purple curves are von Mises fits for neurons with significant choice tuning (ANOVA, p<0.05). Black lines are linear interpolations. (G) Comparison of preferred surface tilt and choice preferences in V3A (left, N = 168) and CIP (right, N = 194). The peaks near 0° indicate that the preferences generally aligned. Bars at ±180° are identical.

Across the populations, 172 (25%) V3A and 201 (46%) CIP neurons carried significant choice-related activity (ANOVA, p<0.05; Figure 6B, inset bar plot). The proportion of neurons with choice-related activity varied across monkeys but was always more prevalent in CIP (Monkey L: N = 72, 33%; Monkey F: N = 129, 59%) than V3A (Monkey L: N = 23, 7%; Monkey F: N = 127, 48%; Monkey W: N = 22, 19%). To test if the strength of choice-related activity differed between the areas, we computed a choice discrimination index (CDI; Equation 2) for each neuron with significant choice tuning (Appendix 1 shows that this index is unaffected by z-scoring, which was used in calculating the choice tuning curves). The CDI values were highly similar in V3A (mean CDI = 0.38 ± 5.6 × 10–3 SEM, N = 172) and CIP (mean CDI = 0.38 ± 5.7 × 10–3 SEM, N = 201) and not significantly different (Wilcoxon rank-sum test, p=0.20). Thus, although there was a cross-area difference in the prevalence of choice-related activity, it was nevertheless present in V3A and similar in strength to CIP.

One potential reason that choice-related activity was more prevalent in CIP than V3A is a difference in the structure of correlated variability between the areas (Nienborg and Cumming, 2006). To test this, we computed noise and signal correlations using the 3D pose data for pairs of neurons simultaneously recorded on the same tetrode (‘Materials and methods’). The noise correlations were almost identical in V3A (across animals: mean r = 0.17 ± 0.01 SEM, N = 404 pairs; Monkey L: 0.15 ± 0.01, N = 206; Monkey F: 0.15 ± 0.03, N = 121; Monkey W: 0.25 ± 0.03, N = 77) and CIP (across animals: 0.17 ± 0.01, N = 244 pairs; Monkey L: 0.14 ± 0.02, N = 107; Monkey F: 0.19 ± 0.02, N = 137), and not significantly different (Wilcoxon rank-sum test, across animals: p=0.62; Monkey L: p=0.58; Monkey F: p=0.49; Figure 6B, right marginal histogram). However, noise correlations alone are not sufficient to account for differences in choice-related activity. Instead, the relationship between noise and signal correlations matters (Liu et al., 2013; Cumming and Nienborg, 2016). We therefore quantified the signal correlations for the same pairs of neurons and found that they were also similar in V3A (across animals: mean r = 0.40 ± 0.02 SEM; Monkey L: 0.32 ± 0.03; Monkey F: 0.41 ± 0.04; Monkey W: 0.61 ± 0.04) and CIP (across animals: 0.36 ± 0.03; Monkey L: 0.26 ± 0.04; Monkey F: 0.43 ± 0.03), and not significantly different (across animals: p=0.15; Monkey L: p=0.32; Monkey F: p=0.86; Figure 6B, top marginal histogram). We then compared the relationship between noise and signal correlations (Figure 6B). As expected, higher noise correlations were associated with higher signal correlations (Shadlen et al., 1996; Cohen and Maunsell, 2009; Gu et al., 2011). Importantly, the linear relationship between noise and signal correlation was not significantly different between the areas (ANCOVA, across animals: p=0.37; Monkey L: p=0.06; Monkey F: p=0.30). Furthermore, between pairs of neurons within each area in which (i) both had choice-related activity, (ii) neither had choice-related activity, or (iii) one had but the other did not have choice-related activity, we found no significant differences in the magnitudes of noise (Kruskal–Wallis test, both p≥0.15) and signal (both p≥0.19) correlations or their linear relationship (ANCOVA, both p≥0.23). These results suggest that differences in the structure of correlated variability either within or across areas did not account for the presence of choice-related activity. As such, the cross-area difference in prevalence of choice-related activity was consistent with a stronger functional correlation between neuronal activity and 3D perceptual decisions in CIP than V3A.

Choice tuning curves are shown for the four example V3A neurons from Figure 3 in Figure 6C–F. The two neurons with higher tolerances (Figure 3A and B) both carried choice-related activity (Figure 6C and D, purple curves). Notably, the tilt tuning curves (marginalized over slant and distance; Figure 6C and D, black curves) were well aligned with the choice tuning curves. In contrast, the neurons with intermediate and low tolerances (Figure 3C and D) did not carry choice-related activity (Figure 6E and F, purple points). To quantify the relationship between tilt and choice preferences for neurons with significant orientation and choice tuning, we took the circular difference between each neuron’s preferred tilt (from the principal orientation) and preferred tilt choice (from the von Mises fit). In both areas, the median circular difference between the preferences (V3A: 1.57°, N = 168; CIP: –0.75°, N = 194) was not significantly different from 0° (circular median test, both p≥0.49), indicating that the tilt and choice preferences tended to align (Figure 6G). To assess if the relationship between tilt and choice preferences differed between the areas, we compared the widths of the two distributions and found that they had similar circular variance (CV) values (V3A: CV = 0.34; CIP: CV = 0.33), which were not significantly different (two-sample concentration difference test, p=0.88) (Fisher, 1995). Thus, the strength of the association between tilt and choice preferences was similar in V3A and CIP.

Choice-related activity was associated with robust 3D tuning

We previously found that CIP neurons with more robust 3D pose tuning (higher tolerance values) preferentially carried choice-related activity (Chang et al., 2020b). To assess this relationship in V3A, we compared the tolerance values of V3A neurons with and without choice-related activity (Figure 7A). Indeed, V3A neurons with choice-related activity had a mean tolerance of 0.65 ± 1.3 × 10–2 SEM (N = 172), whereas those without choice-related activity had a mean tolerance of 0.54 ± 7.2 × 10–3 SEM (N = 520), and the difference was significant (ANOVA followed by Tukey’s HSD test, p=3.8 × 10–9). Thus, choice-related activity was preferentially carried by V3A neurons with more robust 3D pose tuning, as in CIP.

Figure 7. Robust 3D pose tuning was associated with choice-related activity.

Figure 7.

(A) Distribution of tolerance values for visual area V3A (V3A) neurons with (orange bars, N = 172) and without (gray bars, N = 520) choice-related activity. Triangles mark mean tolerances. (B) Cross-area comparison of tolerance values for neurons with (colored bars) and without (gray bars) choice-related activity. The V3A data in (A) is reproduced in (B) for comparison with the caudal intraparietal (CIP) area. Bar height indicates mean tolerance, and error bars are SEM. Horizontal lines and asterisks indicate significant differences in (A, B) (ANOVA followed by Tukey’s HSD test, p<0.05). (C) Cumulative density functions over the angular deviations between the orientation preference at each distance and the principal orientation for neurons with (solid lines) and without (dashed lines) choice-related activity.

Because more robust 3D pose tuning was associated with choice-related activity, we were concerned that the difference in 3D selectivity between V3A and CIP (Figure 4) simply reflected a cross-area difference in the prevalence of choice signals. However, this was not the case. First, tolerance values were greater in CIP than V3A both for neurons with (ANOVA followed by Tukey’s HSD test, p=1.7 × 10–7) and without (p=1.6 × 10–4) choice-related activity (Figure 7B). Second, comparisons of the cumulative density functions over the angular deviations between orientation preferences at each distance and the principal orientation (Figure 7C) revealed significantly smaller deviations for neurons with than without choice-related activity in both areas (Kolmogorov–Smirnov test, p≤1.1 × 10–5). The deviations were also significantly smaller in CIP than V3A both for neurons with (p=1.3 × 10–4) and without (p=8.8 × 10–4) choice-related activity. Thus, 3D pose tuning was more robust in CIP than V3A, regardless of whether the neurons carried choice-related activity.

V3A carries saccade-related activity

Previous studies reported that V3A contains extraretinal signals associated with creating stable, allocentric representations of the world (Galletti and Battaglini, 1989; Galletti et al., 1990; Sauvan and Peterhans, 1999; Nakamura and Colby, 2002), as well as postsaccadic activity (Nakamura and Colby, 2000). Recently, we discovered saccade-related activity in CIP that predicted the direction and timing of eye movements (Chang et al., 2020b). We therefore hypothesized that V3A may possess similar saccade-related activity. To test this possibility, we trained the monkeys to perform a visually guided (pop-up) saccade task (Munoz and Wurtz, 1995; Hanes and Schall, 1996; Figure 8A).

Figure 8. Saccade-related activity in visual area V3A (V3A).

(A) Visually guided (pop-up) saccade task. A target was fixated for 1.3 s (matching the tilt discrimination task duration; left) after which it disappeared and a single saccade target appeared at one of eight locations (matching the choice targets in the tilt discrimination task; right). A saccade was then made to the target. (B) Population time courses for each saccade direction relative to the preferred direction. Curves show responses averaged over neurons. Saccade initiation = 0 ms. Vertical dashed line marks the start of saccade-related activity (–108 ms). (C) Histogram of saccade latencies divided into quartiles (Q). Triangles mark mean values (black for the full distribution). (D) Time courses of saccade-related activity conditioned on the saccade latency quartile. Colored circles mark the start of V3A activity for each quartile (ANOVA, p<0.05). Open black circle marks the point at which the V3A curves approximately coalesced. (E) Inverse linear relationship between the growth rate (GR) of saccade-related activity and mean saccade latency (SL) for each quartile. (F) The temporally integrated V3A time courses for each quartile (dashed curves) were well-aligned to the observed caudal intraparietal (CIP) area time courses (solid curves). Circles mark the start of CIP activity for each quartile and the point at which the curves approximately coalesced.

Figure 8.

Figure 8—figure supplement 1. Comparison of saccade-related activity and responses to visual flashes without eye movements.

Figure 8—figure supplement 1.

(A) Screen schematic. Filled red and black circles show the fixation and choice/saccade targets, respectively. Dashed black circle shows the envelope of the planar stimuli. Squares show the tessellation for receptive field (RF) mapping in caudal intraparietal (CIP) area. Filled green squares are locations included in the analysis shown in (B). In visual area V3A (V3A), the mapped area was matched to the RFs (not shown). All elements are drawn to scale. (B) The saccade-related activity and responses to visual flashes had distinct time courses. The curves are population-averaged responses to the preferred saccade directions (solid curves) and visual flashes at nearby RF mapping tiles (bright flashes only; dashed curves). For V3A, 390/415 neurons with saccade-related activity were included. The others were removed because the preferred saccade target was outside of where the visual RF was mapped. For CIP, all 274 neurons with saccade-related activity were included. Visual flashes were 150 ms in duration. Saccade targets were present until the eyes entered the saccade target window (triangles mark median saccade onset times). Despite the visual flashes being of higher contrast (Weber contrast = 2.2) and larger (1.9° × 1.9° in V3A, 3.9° × 3.9° in CIP) than the saccade targets (Weber contrast = 1.0; 0.7° diameter), the flash responses were weaker and more transient than the saccade-related activity. The saccade-related activity also had conspicuous inflection points at 71 ms in V3A and 78 ms in CIP (marked by open black circles; calculated by finding when the second derivative equaled zero), which speculatively reflect the addition of presaccadic activity to an otherwise visual transient.

To first determine if V3A carries saccade-related activity, we computed eight population-level time courses relative to the saccade direction that elicited the maximum response for each neuron (Figure 8). Noting that the time courses were parametrically tuned, we calculated the start of the activity by finding the first time point at which they significantly diverged (ANOVA, p<0.05). Intriguingly, saccade-related activity started in V3A (a classically defined ‘intermediate’ visual area) 108 ms prior to saccade initiation, which was earlier than in CIP (102 ms; see Figure 8B in Chang et al., 2020b). In both areas, the time course of saccade-related activity was distinct from the visually evoked response measured during receptive field mapping, consistent with presaccadic activity (Figure 8—figure supplement 1; see ‘Discussion’).

The finding of saccade-related activity in V3A prompted us to test if that activity predicted the timing of the saccades. Each trial in which a saccade was made in the preferred saccade direction was labeled with the saccade latency (the time from target appearance to saccade initiation). The distribution of latencies was then divided into quartiles (Figure 8C) and the time course of saccade-related activity computed for each quartile (Figure 8D). On trials that the saccade latency was shorter, saccade-related activity started closer to the saccade initiation. Indeed, the growth rate (linear slope) from the start of saccade-related activity (ANOVA, p<0.05) to when the four curves approximately coalesced (59 ms before saccade initiation; ‘Materials and methods’) was highly correlated and inversely related to the mean saccade latency (r = –0.98, p=1.9 × 10–2; Figure 8E). Intriguingly, these results indicate that V3A activity predicts the direction and timing of upcoming saccadic eye movements.

We further noted that the V3A time courses were visually more similar to step functions than the CIP time courses, which more closely resembled ramping activity. This informal observation was reflected in steeper growth rates for each quartile in V3A (Q1 = 1.8; Q2 = 1.1; Q3 = 0.82; Q4 = 0.44) than CIP (Q1 = 0.73; Q2 = 0.61; Q3 = 0.56; Q4 = 0.39) and led us to speculate that CIP might temporally integrate saccade-related signals from V3A. For each latency quartile, we therefore integrated the V3A time courses from the onset of V3A activity to when the CIP time courses approximately coalesced (15 ms before saccade initiation; ‘Materials and methods’; Chang et al., 2020b) and compared them to the observed CIP time courses. Notably, the integrated V3A output (Figure 8F, dashed curves) aligned well with the CIP activity (solid curves) for each quartile, as might be expected if CIP accumulates evidence from V3A in favor of particular oculomotor responses. Consistent with CIP being closer to the site of saccade initiation than V3A, the CIP time courses coalesced 15 ms before saccade initiation compared to 59 ms in V3A. These results imply that V3A is not simply a ‘visual area’ and may suggest that there are parallel visual and saccade-related V3A-to-CIP hierarchies.

We next examined the saccade direction tuning of individual V3A neurons. Saccade direction tuning curves for the four example V3A neurons are shown in Figure 9A–D (green curves) along with von Mises fits for those with significant tuning (ANOVA, p<0.05). Across animals, 415 (60%) of the V3A neurons had significant saccade direction tuning (Monkey L: 172, 55%; Monkey F: 154, 58%; Monkey W: 89, 75%), similar to that observed in CIP (across animals: 274, 63%; Monkey L: 153, 70%; Monkey F: 121, 55%) (Chang et al., 2020b). Like CIP neurons, the V3A neurons showed parametric tuning for saccade direction with responses that fell off symmetrically from the preferred direction. Correspondingly, the tuning curves were well described by von Mises functions (mean r = 0.92 ± 0.47 × 10–3 SEM, N = 415).

Figure 9. Sensorimotor associations were moderated by choice-related activity.

(A–D) Saccade direction tuning curves (left axis, green) and tilt tuning curves marginalized over slant and distance (right axis, black) for the four example visual area V3A (V3A) neurons from Figures 3 and 6. Data points are mean firing rates and error bars are SEM across trials. Solid green curves are von Mises fits for neurons with significant saccade direction tuning (ANOVA, p<0.05). Black lines are linear interpolations. (E) Differences between choice and saccade direction preferences in V3A (N = 124). (F) Differences between principal surface tilts and saccade direction preferences for neurons with (colored bars) and without (gray bars) choice-related activity in V3A (left) and caudal intraparietal (CIP) area (right). Bars at ±180° are identical in (E, F). (G) Comparison of circular variances for the distributions in (F). Horizontal lines and asterisks indicate significant differences (two-sample concentration difference test, p<0.05).

Figure 9.

Figure 9—figure supplement 1. Saccade-related activity was associated with less robust 3D tuning in visual area V3A (V3A), but not caudal intraparietal (CIP) area.

Figure 9—figure supplement 1.

Tolerance values were compared for neurons with and without saccade-related activity, condition on the presence of choice-related activity in V3A and CIP. Four neuronal subpopulations were defined: those with (1) choice- and saccade-related activity (C+S + , V3A: N = 124; CIP: N = 137), (2) choice- but not saccade-related activity (C+S-, V3A: N = 48; CIP: N = 64), (3) no choice- but saccade-related activity (C-S+, V3A: N = 291; CIP: N = 137), and (4) neither choice- nor saccade-related activity (C-S-, V3A: N = 229; CIP: N = 99). First, we compared neurons with choice-related activity (C+S + and C+S-). Tolerance values were significantly lower for C+S+ than C+S- neurons in V3A (p=6.9 × 10-3) but not CIP (p=0.23). Second, we compared neurons without choice-related activity (C-S+ and C-S-). Tolerance values were significantly lower for C-S+ than C-S- neurons in V3A (p=1.9 × 10-3) but not CIP (p=0.94). Thus, saccade-related activity was associated with less robust 3D tuning in V3A. There was no significant relationship between saccade-related activity and the robustness of 3D tuning in CIP. Horizontal lines and asterisks indicate significant differences (ANOVA followed by Tukey’s HSD test, p<0.05).

To test for cross-area differences in saccade direction tuning, we first calculated a saccade discrimination index (SDI; Equation 2) for neurons with significant tuning. Surprisingly, saccade direction was more discriminable based on V3A (across animals: mean SDI = 0.49 ± 6.2 × 10–3 SEM, N = 415; Monkey L: 0.48 ± 0.02, N = 172; Monkey F: 0.51 ± 9.3 × 10–3, N = 154; Monkey W: 0.48 ± 0.01, N = 89) than CIP (across animals: 0.41 ± 6.2 × 10–3, N = 274; Monkey L: 0.40 ± 8.7 × 10–3, N = 153; Monkey F: 0.42 ± 8.7 × 10–3, N = 121) responses, and the difference was statistically significant (Wilcoxon rank-sum test, across animals: p=4.8 × 10–17; Monkey L: p=5.1 × 10–8; Monkey F: p=2.6 × 10–11). We further compared the tuning bandwidths (κ from the von Mises fits) and found that the V3A tuning curves were narrower (across animals: mean κ = 6.3 ± 0.26 SEM; Monkey L: 6.1 ± 0.45; Monkey F: 6.6 ± 0.40; Monkey W: 6.2 ± 0.53) than the CIP tuning curves (across animals: 4.7 ± 0.29; Monkey L: 4.4 ± 0.38; Monkey F: 5.0 ± 0.46), and the difference was statistically significant (Wilcoxon rank-sum test, across animals: p=1.9 × 10–7; Monkey L: p=0.01; Monkey F: p=8.4 × 10–5). These results are consistent with convergent input from multiple V3A neurons onto individual CIP neurons and parallel the cross-area difference in 3D orientation tuning.

We additionally assessed if choice- and saccade-related activity were functionally dissociable in V3A as they are in CIP (Chang et al., 2020b). Across the V3A population, 124 neurons (18%) had both choice- and saccade-related activity. For this subpopulation, choice and saccade preferences generally aligned (Figure 9E). The median circular difference (–1.7°) between the preferences was not significantly different from 0° (circular median test, p=0.65). Although the preferences aligned, other characteristics were distinct. First, the saccade tuning curves were narrower (mean κ = 6.3 ± 0.26 SEM) than the choice tuning curves (mean κ = 4.7 ± 0.46 SEM), and the difference was significant (Wilcoxon rank-sum test, p=6.4 × 10–11). Second, some neurons carried choice- (48, 7%) or saccade-related (291, 42%) activity only, indicating that the properties were not mutually inclusive. Third, choice-related activity was associated with more robust 3D tuning, whereas saccade-related activity was associated with less robust 3D tuning (Figure 9—figure supplement 1), indicating that choice and saccade signals had opposite functional relationships with 3D selectivity in V3A. These results together suggest that the choice- and saccade-related activities were functionally distinct.

Sensorimotor associations are moderated by choice-related activity

Neurons in CIP form associations between their surface orientation and choice/saccade direction preferences such that the alignment of the tuning curves reflects whether the monkey was trained to report the near or far side of the plane (Elmore et al., 2019; Chang et al., 2020b). However, the existence of a similar sensorimotor association was not immediately evident based on the saccade direction and tilt tuning curves (marginalized over slant and distance) of the example V3A neurons (Figure 9A–D). Assuming that decision-related processing occupies an intermediate position between sensory and motor activity, it is possible that choice signals have a moderating effect on the sensorimotor association that could obscure the relationship if not taken into consideration. Indeed, we previously found that the strength of the sensorimotor association in CIP depended on the presence of choice signals (Chang et al., 2020b).

To test for sensorimotor associations in V3A and evaluate the potential moderating effect of choice signals, we therefore calculated the angular difference between the principal tilt (from the principal orientation) and saccade direction preference for neurons with and without choice-related activity (Figure 9F, left, orange and gray bars, respectively). For neurons without choice-related activity, there was no discernible association between the tilt and saccade direction preferences since the distribution was not significantly different from uniform (Rayleigh test, p=0.08, N = 270). However, for neurons with choice-related activity, the distribution was significantly different from uniform (p=1.2 × 10–18, N = 122) and the median circular difference (–1.6°) was not significantly different from 0° (circular median test, p=0.42), revealing a sensorimotor association. Correspondingly, the distribution of preference differences was significantly narrower for neurons with (CV = 0.44) than without (CV = 0.9) choice-related activity (two-sample concentration difference test, p=1.4 × 10–5; Figure 9G). Thus, a sensorimotor association was only evident for V3A neurons with choice-related activity.

When we repeated this analysis for CIP, we found a striking difference from V3A (Figure 9F, right). Specifically, sensorimotor associations were evident regardless of whether the neurons carried choice signals. The distributions of preference differences were significantly different from uniform both for neurons with (blue bars; p=3.7 × 10–29, N = 134) and without (gray bars; p=9.7 × 10–5, N = 122) choice-related activity. Likewise, the median differences were not significantly different from 0° both for neurons with (–4.2°, p=0.14) and without (–3.2°, p=0.65) choice-related activity. This reveals that choice signals were not necessary for sensorimotor associations in CIP, but the distribution was significantly narrower for neurons with (CV = 0.34) than without (CV = 0.73) choice-related activity (two-sample concentration difference test, p=2.6 × 10–5; Figure 9G), indicating that sensorimotor associations were strongest for neurons with choice-related activity.

We further assessed the cross-area difference in the overall strength of sensorimotor associations by comparing the widths of the V3A and CIP preference difference distributions (including neurons with and without choice-related activity). The distribution was significantly broader in V3A (CV = 0.77) than CIP (CV = 0.53; p=1.2 × 10–3), indicating that sensorimotor associations were strongest in CIP (Figure 9G). These results thus imply a hierarchical transformation in the strength of sensorimotor associations along the ‘where’ pathway and suggest a novel role for choice-related activity in sensorimotor processing as explored next.

The above analyses showed that neurons with choice-related activity exhibited more robust 3D pose tuning (Figure 7) and formed stronger sensorimotor associations (Figure 9G). If choice signals occupy an intermediate position between sensory and motor processing, the strength of sensorimotor associations might depend on the robustness of 3D tuning for neurons with choice-related activity but not those without choice-related activity. In other words, choice-related activity may statistically moderate (Judd et al., 2014) the relationship between the robustness of 3D pose tuning and the strength of sensorimotor associations. To test this, we ran a linear regression model where the absolute angular difference between the principal tilt and saccade direction preference depended on tolerance, choice-related activity, and their interaction. Both areas showed a significant interaction (both p≤1.6 × 10–3) such that tolerance had a negligible impact on the strength of the sensorimotor association for neurons without choice-related activity (V3A: slope = 0.16; CIP: slope = –0.15) but a strong impact for neurons with choice-related activity (V3A: slope = –2.2; CIP: slope = –2.5). Thus, this analysis revealed an intricate relationship between sensory, choice, and saccade-related activity in which choice signals moderated the relationship between the robustness of 3D tuning and sensorimotor associations.

Discussion

Transforming ambiguous 2D retinal images into relevant 3D object representations that can guide action is a fundamental function of the dorsal ‘where’ pathway. Here, we explicated the parallel processing, hierarchical transformations, and functional associations of visual, choice, and saccade-related signals at the juncture of visual and parietal cortex. Our findings challenge classical notions of sensorimotor dichotomies by revealing that V3A possesses saccade-related activity and defining properties of association cortex, and further implicate choice-related activity as a novel factor in moderating sensorimotor processing.

Parallel processing, hierarchical transformations, and sensorimotor associations

Multiple lines of evidence converged to support a V3A-to-CIP hierarchy. Focusing first on the visual properties, we found that the median visual response latency in V3A was 6 ms shorter than in CIP and that the receptive fields were smaller in V3A than CIP. At a functional level, 3D orientation and position information (which are confounded in retinal images) were more separable in CIP than V3A, implying a hierarchical resolution of sensory ambiguities that limit the ability to make 3D perceptual inferences.

Intriguingly, our findings may also reveal a V3A-to-CIP hierarchy related to oculomotor processing. While saccade-related activity that predicted the direction and timing of eye movements was prevalent in both areas, it began 6 ms earlier in V3A than CIP (in agreement with the difference in visual response latencies). The finding that the time course of saccade-related activity in CIP closely resembled the temporally integrated V3A output may further suggest that some presaccadic signals originate in a region of visual cortex whose feedforward input includes V1 and V2 (Zeki, 1980; Felleman and Van Essen, 1991), thus challenging classical notions of sensorimotor dichotomies. This finding is also consistent with CIP accumulating evidence provided by V3A in favor of particular oculomotor responses, which raises the possibility that some of the integration-like properties of LIP (Shadlen and Newsome, 1996; Roitman and Shadlen, 2002) may reflect bottom-up input (Lewis and Van Essen, 2000; Premereur et al., 2015; Van Dromme et al., 2016). In that case, inactivating CIP, which is architectonically distinct from LIP (Katsuyama et al., 2010; Niu et al., 2020), might impair saccadic responses during decision-making tasks. Consistent with this, reversible inactivation of CIP during a depth structure categorization task was found to delay saccadic responses (Van Dromme et al., 2016). While that delay may have reflected degraded visual discrimination, the current findings also point to the possibility of impaired saccade preparation.

While these experiments revealed saccade-related activity in V3A and CIP, the protocol was not designed to dissociate the potential contributions of visual and presaccadic signals. Nevertheless, several lines of evidence suggest that the observed saccade-related activity cannot be explained by visual responses alone. First, the time course of saccade-related activity was distinct from that of the visual flash response. Examination of the time courses provided some evidence that a presaccadic signal may begin in V3A 7 ms before CIP, consistent with the possibility of a V3A-to-CIP oculomotor hierarchy. Second, there was a pronounced relationship between the growth rate of the saccade-related activity and saccade latency. Third, there was an apparent integrative relationship between the V3A and CIP saccade-related activity, which does not exist for the visual responses. Fourth, the sensorimotor association between tilt and saccade direction preferences as well as the moderation by choice-related activity would not be expected for visual flash responses. The current findings therefore suggest that V3A and CIP may contain presaccadic signals, but a thorough evaluation of this possibility will require future studies.

Many V3A neurons showed functional associations between their orientation, choice, and saccade direction preferences, as in CIP (Elmore et al., 2019; Chang et al., 2020b). Finding these associations in V3A implies that sensorimotor processing traditionally linked to parietal cortex already occurs in an ‘intermediate visual area,’ suggesting that V3A may be more appropriately classified as an early association region. Intriguingly, saccade direction discrimination was greater in V3A (where saccade-related activity was associated with poorer 3D pose tuning) while sensorimotor associations were stronger in CIP (where no relationship between saccade-related activity and the robustness of 3D tuning was evident). These cross-area differences may reflect a transition from lower-level representations of saccade-related signals to higher-level sensorimotor associations and parallel the observed changes in visual feature selectivity.

Visual processing in V3A

Previous studies arrived at categorically different conclusions regarding visual processing in V3A (Gaska et al., 1987, Gaska et al., 1988; Galletti and Battaglini, 1989; Galletti et al., 1990; Sauvan and Peterhans, 1999; Nakamura and Colby, 2000; Nakamura and Colby, 2002; Tsao et al., 2003; Anzai et al., 2011; Elmore et al., 2019). Our large V3A sample revealed a heterogeneous population in which neurons ranged from representing low-level visual features to high-level object properties. Earlier discrepancies may reflect a combination of this heterogeneity, small sample sizes, and the possibility of functionally distinct modules within V3A. Another factor that may have contributed to the discrepancies is variability in how V3A was defined (Nakhla et al., 2021), highlighting the continued importance of using functional and anatomical localization methods in future investigations of this relatively understudied area.

For some V3A neurons, we found that orientation tuning curve shape (but not gain) was highly tolerant to distance, implying 3D pose tuning. Because the perspective cues in our stimuli were independent of distance, the gain changes must have been driven by stereoscopic cues. This implies that these neurons were selective for gradients of relative disparity. Other V3A neurons showed orientation tuning at a single distance. These neurons may have been selective for absolute disparity gradients, similar to some middle temporal (MT) area neurons (Nguyenkim and DeAngelis, 2003). Such neurons may reflect an intermediate stage of visual processing whose outputs are combined to create 3D pose representations. Selectivity for absolute disparity gradients may further account for suppressive effects previously reported within the classical receptive fields of V3A neurons (Gaska et al., 1987) since the stimuli were presented at screen distance only and therefore would stimulate portions of the receptive field with non-preferred disparities. Lastly, the orientation tuning curve shape of some V3A neurons was highly distance-dependent, which may reflect tuning for a single absolute disparity. Future studies that perform detailed receptive field submapping of disparity selectivity will be important to explicate the heterogeneity of V3A as well as the transformations by which the visual system achieves 3D pose tuning. One possibility is that absolute disparity representations (in V1) are used to construct absolute disparity gradient detectors (in V3A, MT) and then pose selectivity (in V3A, PIP, CIP).

Origins and functional implications of choice signals

Contemporary interpretations of choice signals include a mix of feedforward contributions to decision processes, the structure of correlated variability, attention, cognitive and behavioral factors, as well as feedback (Celebrini and Newsome, 1994; Britten et al., 1996; Dodd et al., 2001; Haefner et al., 2013; Gu et al., 2014; Ruff and Cohen, 2014; Smolyanskaya et al., 2015; Cumming and Nienborg, 2016). Despite these complexities, the current findings are consistent with a feedforward cascade and build-up of choice signals, which may reflect greater contributions of CIP than V3A to 3D perceptual decisions. First, choice-related activity appeared 11 ms earlier in V3A than CIP. Second, it was about twice as prevalent in CIP as V3A. Notably, the structure of correlated variability could not account for the prevalence of choice-related activity either within or across areas. Third, choice-related activity was preferentially carried by neurons with 3D orientation tuning that was more tolerant to distance. This is consistent with reports that neurons that carry choice signals tend to have resolved sensory ambiguities about the information being discriminated, allowing them to more directly contribute to decisions (Liu et al., 2013; Chang et al., 2020b). Although choice signals may be confounded with feature-based attentional modulation (Cohen and Newsome, 2008), this is unlikely to explain our findings. Attention is largely associated with changes in response magnitude (Reynolds and Heeger, 2009). However, we found more intricate relationships in which choice-related activity (i) was associated with changes in tuning curve shape that made 3D orientation tuning more tolerant to distance (Chang et al., 2020b) and (ii) statistically moderated the strength of sensorimotor associations such that visual and saccade direction preferences were best aligned for neurons that carried choice signals. Indeed, the strength of sensorimotor associations strongly depended on the robustness of 3D pose tuning for neurons with (but not without) choice-related activity. These findings thus reveal a multifaceted landscape of functional associations between sensory, choice-, and motor-related activity. They further imply a novel role for choice signals in sensorimotor processing that might reflect the temporal cascade of sensory processing, response selection, and motor action.

Materials and methods

Animal preparation

All procedures followed the National Institutes of Health’s Guide for the Care and Use of Laboratory Animals and were approved by the Institutional Animal Care and Use Committee at the University of Wisconsin–Madison (Protocol G005229). Three male rhesus monkeys (Macaca mulatta; Monkey L: 6 years of age; Monkey F: 5 years; Monkey W: 5 years) were surgically implanted with a Delrin ring for head restraint and a removable recording grid for guiding electrodes. After recovery, they were trained to sit in a primate chair with head restraint and to fixate visual targets within 2° version and 1° vergence windows for liquid rewards.

Experimental control and stimulus presentation

Experimental control was performed using the REC-GUI software (RRID:SCR_019008) (Kim et al., 2019). Stimuli were rendered using Psychtoolbox 3 (MATLAB R2016b; NVIDIA GeForce GTX 970) and rear-projected onto a polarization preserving screen (Stewart Film Screen, Inc) using a DLP LED projector (PROPixx; VPixx Technologies, Inc) with 1280 × 720 pixel resolution (70° × 43° of visual angle) at 240 Hz. A circular polarizer was used to sequence the presentation of stereoscopic ‘half-images’ to each eye (120 Hz/eye). Polarized glasses were worn. A phototransistor circuit was used to confirm the synchronization of the left and right eye images as well as align neuronal responses to the stimulus onset. Eye tracking was performed optically at 1 kHz (EyeLink 1000 plus, SR Research). The monkeys sat 57 cm from the screen.

Visual stimuli

The stimuli were previously described in detail (Chang et al., 2020a; Chang et al., 2020b). Briefly, planar surfaces subtending 20° of visual angle were presented at the center of the screen. They were defined by 250 nonoverlapping dots that were uniformly distributed across the plane and rendered with stereoscopic and perspective cues. Surface orientation was described using tilt and slant (Stevens, 1983; Rosenberg et al., 2013). All combinations of eight tilts (0–315°, 45° steps) and four slants (15–60°, 15° steps) plus the frontoparallel plane (tilt undefined, slant = 0°) were presented (N = 33). All orientations were presented at four distances (37, 57, 97, and 137 cm; N = 132 unique poses). The dots were scaled with distance such that their screen size only depended on slant. At a slant of 0°, each dot subtended 0.35°.

The fixation point subtended 0.3° and was always at 57 cm (screen distance). Keeping its distance constant while varying the plane’s distance was a key design feature (Nguyenkim and DeAngelis, 2003; Hegdé and Van Essen, 2005; Ban and Welchman, 2015; Alizadeh et al., 2018; Henderson et al., 2019) that conferred two advantages over yoking the distance of the stimulus and fixation point (Banks et al., 2001; Hillis et al., 2004). First, it ensured that the monkeys could not rely on local absolute disparity cues to judge 3D orientation (Elmore et al., 2019; Chang et al., 2020a; Chang et al., 2020b). Second, it allowed us to dissociate the effects of stimulus distance and vergence signals (which would have otherwise been confounded) on the neuronal responses, which is important because V3A and CIP carry extraretinal signals.

Experimental protocol

Tilt discrimination task

The monkeys performed an 8AFC tilt discrimination task (Chang et al., 2020a; Chang et al., 2020b). Each trial began by fixating a circular target at the center of the screen for 300 ms. A planar surface then appeared for 1 s. The fixation target and plane then disappeared and eight choice targets corresponding to the eight possible tilts appeared at polar angles of 0–315° in 45° steps (11° eccentricity). The nearest side of the plane was indicated by making a saccade to the corresponding target (e.g., the right target for a right-near plane) in exchange for a liquid reward for correct responses. Because frontoparallel planes were task ambiguous (tilt is undefined at slant = 0°), responses to those stimuli were rewarded with equal probability (12.5%).

Visually guided saccade task

The monkeys also performed a visually guided (pop-up) saccade task (Chang et al., 2020b). The timing was matched to the tilt discrimination task. Each trial began by fixating a target (identical to the target in the tilt discrimination task) at the center of the screen for 1.3 s. The fixation target then disappeared and a single saccade target appeared at one of the eight choice target locations. A saccade to the target was made for a liquid reward.

The two tasks were interleaved within a block design. For all CIP and 422 V3A neurons, each block included one completed trial for each of the following: (i) tilt discrimination task: (8 tilts × 4 non-zero slants + 8 frontoparallel planes) × 4 distances (160 trials) and (ii) saccade task: 8 directions × 4 repeats (32 trials). For 270 V3A neurons, each saccade direction was repeated once per block. Trials were aborted and the data discarded if fixation was broken before the choice/saccade target(s) appeared, or if a response was not provided within 500 ms. A minimum of five blocks was required to include a neuron for analysis.

Behavioral data analysis

Tilt discrimination performance was quantified by calculating the distribution of reported tilt errors (ΔTilt = reported tilt – presented tilt) and fitting a von Mises probability density function:

VM(ΔTilt)=eκcos(ΔTiltμ)2πI0(κ). (1)

The mean (µ) and concentration (κ) parameters describe the accuracy and sensitivity, respectively (Seilheimer et al., 2014; Dakin and Rosenberg, 2018). Values of µ closer to 0° indicate greater accuracy and larger κ indicate greater sensitivity. An upper bound of κ = 18 was set in the estimation routine (Chang et al., 2020a; Chang et al., 2020b). A modified Bessel function of order 0, I0κ, normalizes the function to have unit area.

Neuronal recordings

To target the areas, magnetic resonance imaging (MRI) scans were collected on a 3-Tesla GE MR750 scanner before and after implanting the head restraint ring. To estimate the penetration trajectories, the CARET software was used to register the structural scans to the F99 atlas (Van Essen et al., 2001) and align the recording grid (Rosenberg et al., 2013; Rosenberg and Angelaki, 2014a; Rosenberg and Angelaki, 2014b; Chang et al., 2020b). During penetrations, observed gray/white matter and sulcal transitions were referenced to the MRI. Area CIP is located in the caudal portion of the lateral bank of the intraparietal sulcus and ventral to LIP. Area V3A is located adjacent to and ventral-laterally to CIP. There is a swath of white matter dorsal to V3A and lateral to CIP.

The order of the recording sessions went as follows. First, 41 V3A sessions were performed (Monkey L: N = 21; Monkey F: N = 20). Then, all 53 CIP sessions were performed (Monkey L: N = 26; Monkey F: N = 27). Finally, 50 more V3A sessions were performed (Monkey L: N = 18; Monkey F: N = 18; Monkey W: N = 14). The recordings were performed using linear array probes with either four or eight tetrodes (NeuroNexus, Inc). The tetrodes were separated by 300 μm and electrodes within a tetrode were separated by 25 μm. Neuronal signals (sampled at 30 kHz) along with eye position and phototransistor signals (each sampled at 1 kHz) were stored using a Scout Processor (Ripple, Inc). Tetrode-based spike sorting was performed offline using the KlustaKwik (K. Harris) semi-automatic clustering algorithm in MClust (MClust-4.0, A.D. Redish et al.) followed by manual refinement using Offline Sorter (Plexon, Inc). Only well-isolated single neurons verified by at least two authors were included.

Receptive field mapping and analysis

Initial receptive field (RF) estimates were made by hand-mapping with patches of random dots, sinusoidal gratings, and/or orientated bars. During hand-mapping, fixation was maintained on a target (0.3°) at the center of the screen and a liquid reward was provided every 2–3 s of continuous fixation. Breaks in fixation resulted in the disappearance of the fixation point and stimulus for 1 s. An automated stimulus in which bright and dark squares were flashed one square at a time was then presented on a gray background. For V3A, 2° × 2° squares tiled a region centered on and larger than the hand estimate. For CIP, the squares were 4° × 4° and tiled the entire screen. A monkey was required to first fixate a target (0.3°) at the center of the screen for 300 ms. Then, while maintaining fixation, the squares were flashed (150 ms duration) in an alternating sequence at pseudorandom locations. A liquid reward was provided after every 20 flashes. If fixation was broken, the monkey was required to again fixate the target for 300 ms before the sequence resumed. A single repetition was completed when both bright and dark squares had been flashed at every location. At least five repetitions were collected each session.

The RF boundary was estimated offline by calculating the firing rate at each tiled location, averaging over the bright and dark square responses. The responses at each location were then compared to baseline, calculated using the last 150 ms of the fixation periods preceding the stimulus sequences, to identify significant responses (ANOVA, p<0.05). The RF map was then manually smoothed and an envelope contour drawn. This procedure produced RF estimates for 355 (51%) V3A and 112 (26%) CIP neurons. The cross-area difference in the proportion of neurons for which a RF could be estimated may reflect that the mapping stimulus was likely more appropriate for lower- than higher-level visual areas.

Neuronal data analyses

Visual response latency

For each neuron, spike trains were aligned to the stimulus onset using the phototransistor signal. Each spike train (1 ms bins) was convolved with a double exponential function and then averaged across trials to create spike density functions (SDFs) (Chang et al., 2020b). A neuron’s visual latency was defined as the first time point after the stimulus onset where the SDF significantly deviated (ANOVA followed by Tukey’s HSD test, p<0.05) from the baseline activity (calculated using the last 150 ms of the fixation periods preceding the stimuli) for at least 30 ms. Firing rates were calculated from the area’s median visual latency to the end of the stimulus presentation.

Discrimination indices (DIs)

We calculated DIs to quantify how well preferred and non-preferred conditions could be discriminated from single-neuron responses (Prince et al., 2002):

DI=RmaxRminRmaxRmin+2SSENM (2)

where Rmax and Rmin are the maximum and minimum mean responses across the tuning curve, SSE is the sum squared error around the mean responses for each condition, N is the total number of trials, and M is the number of conditions. For neurons with large response modulation and low response variability, responses to preferred and non-preferred stimuli will be highly discriminable, and the DI correspondingly closer to one. Otherwise, the DI will be closer to zero. Four DIs were computed following Equation 2. The SODI was used to quantify how well the 3D orientation (over all slant–tilts) could be discriminated at each distance (M = 33). The TDI was used to quantify how well the tilt could be discriminated at given slant–distance combinations (M = 8). The CDI was used to quantify how well the choice could be discriminated (M = 8). SDI was used to quantify how well the saccade direction could be discriminated (M = 8).

Quantifying the dependency of orientation tuning on distance

The tolerance of the 3D orientation tuning curves to distance was quantified as previously described (Chang et al., 2020b). Briefly, we fit the 3D pose (orientation × distance) tuning curve with a multiplicatively separable model:

R(θ,D)=DC+gH(θ)F(D) (3)

where R is the response to orientation θ (tilt and slant) and distance D, DC is an offset, g sets the response amplitude, H(θ) is the orientation tuning curve, and F(D) is the distance tuning curve. A tolerance index quantifying the dependence of the 3D orientation tuning curve shape on distance was calculated as the average correlation between the observed tuning curve and fit at each distance. We additionally tested an additively separable model R(θ,D)=DC+H(θ)+F(D), but did not present the results because the multiplicative model better described the responses of 690/692 V3A and 437/437 CIP neurons.

We also quantified the extent to which orientation preferences differed across distance. For each neuron, the preferred orientation was estimated at each distance with significant orientation selectivity (ANOVA, p<0.05; Bonferroni–Holm corrected for four distances) by fitting a Bingham function (Rosenberg et al., 2013). The principal orientation, the axis about which the preferences clustered, was determined by arranging the surface normal vectors corresponding to the preferences into a matrix and calculating the eigenvectors. The principal orientation was defined by the eigenvector with the largest eigenvalue (Chang et al., 2020b).

Choice-related activity

Choice-related activity was calculated using frontoparallel plane trials only. In every session, each monkey chose every choice target, and the choice distributions were broad (Monkey L: mean circular variance = 0.79 ± 0.09 standard deviation, N = 65 sessions; Monkey F: 0.82 ± 0.09, N = 65; Monkey W: 0.64 ± 0.14, N = 14). Importantly, there were no significant correlations between the mean choices of the monkeys and the choice preferences of the simultaneously recorded neurons (Monkey L: r = 0.13, p=0.24; Monkey F: r = –0.11, p=0.08; Monkey W: r = 0.21, p=0.18).

Choice tuning was assessed from the onset of choice-related activity for the area (calculated in an iterative fashion following Chang et al., 2020b) to the end of the stimulus presentation, as briefly described here. Response differences associated with the plane’s distance were removed by z-scoring the baseline-subtracted responses separately for each distance. For neurons with significant choice tuning (ANOVA, p<0.05), responses were then grouped according to the tilt choice. For each tuned neuron, the average SDF was then computed for each choice direction and labeled relative to the neuron’s preferred choice (preferred choice, ±45°, ±90°, ±135°, and 180°). The SDFs were then averaged across neurons to create eight population-level time courses. The onset was defined as the first time point that the time courses significantly differed (ANOVA, p<0.05) for at least 30 ms (Rosenberg et al., 2013). This process was iteratively performed until the onset no longer changed.

Saccade-related activity

Saccade onset was defined as the first time point that the velocity of either eye was ≥150°/s. Saccade direction tuning was assessed from the start of saccade-related activity for the area to the saccade onset. A neuron was classified as having saccade-related activity if the baseline-subtracted firing rates significantly depended on the saccade direction (ANOVA, p<0.05). The onset of saccade-related activity was calculated following the procedure described above for the choice-related activity (Chang et al., 2020b).

The time point at which the time courses of saccade-related activity conditioned on saccade latency approximately coalesced was defined as when the sum squared error between the time courses and their mean was smallest (Chang et al., 2020b). To visually compare the temporally integrated V3A and CIP time courses, we applied a DC offset and multiplicative gain to each integrated V3A time course to minimize the sum squared error with the CIP time course.

Noise and signal correlations

We calculated noise and signal correlations between pairs of simultaneously recorded neurons on the same tetrode in V3A (N = 404 pairs) and CIP (N = 244 pairs). To calculate noise correlations, we took the spike counts across trials for each 3D pose condition (N = 132) and removed outliers (>3 standard deviations from the mean response) (Zohary et al., 1994; Kohn and Smith, 2005; Huang and Lisberger, 2009; Gu et al., 2011). To remove stimulus-dependent response differences, the remaining spike counts were z-scored separately for each condition. The correlation between the two neurons’ responses was then computed across conditions and trials. Signal correlations were computed between the 3D pose tuning curves.

Vergence control

To determine if the 3D pose responses were significantly affected by small vergence eye movements that did not violate the vergence window, we performed an ANCOVA to test for main effects of stimulus tuning with vergence included as a covariate. Tilt (linearized into cosine and sine components), slant, and distance were independent factors and the mean vergence was a covariate. Only 67 (9.7%) V3A neurons showed a significant effect of vergence (p<0.05). Importantly, the significance of the main effects did not depend on whether vergence was included as a covariate for all but 13 (1.9%) V3A neurons. These results are comparable to those in the CIP data (Chang et al., 2020b) and other studies (DeAngelis and Uka, 2003; Elmore et al., 2019), and suggest that vergence errors had a minimal impact on the findings.

Acknowledgements

We thank Zikang Zhu for helpful discussion, as well as Meghan Lowe and Satchal Postlewaite for help with spike sorting. Further support was provided by the National Institutes of Health Grant P51OD011106 to the Wisconsin National Primate Research Center and the National Institute of Child Health and Human Development Grant P50HD105353 to the Waisman Center.

Appendix 1

Here, we show that the discrimination index is invariant to linear transformations of the neuronal responses using the example of z-scoring. The discrimination index is defined as

.DI= Rmax RminRmaxRmin + 2SSENM

where Rmax and Rmin are the maximum and minimum mean responses across the tuning curve, SSE is the sum squared error around the mean responses for each condition, N is the total number of trials, and M is the number of conditions.

The z-score transformation is defined as

.Z(x)=x-μσ

where x is the observed value, µ is the sample mean, and σ is the sample standard deviation. We define DIZ as the discrimination index of the z-scored responses:

DIZ=RmaxμσRminμσRmaxμσRminμσ+Z(2SSENM)=RmaxRminRmaxRmin+σZ(2SSENM)

Thus, if Z2SSEN-M = 1σ2SSENM , then DI=DIZ . The SSE is given by

SSE=i=1Mj=1niRij-μi2

where M is the total number of conditions, ni is the number of trials for the ith condition, Rij is the response for the ith condition and jth trial, and μi is the mean response for the ith condition. Since i=1Mni=N ,

Z(2SSENM)=Z(2i=1M j=1ni (Rijμi)2NM)
=2i=1M j=1ni 1σ2(Rijμi)2NM=1σ2i=1M j=1ni (Rijμi)2NM

Thus, Z2SSEN-M reduces to 1σ2SSENM, and DI=DIZ.

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Ari Rosenberg, Email: ari.rosenberg@wisc.edu.

David J Freedman, The University of Chicago, United States.

Tirin Moore, Stanford University, United States.

Funding Information

This paper was supported by the following grants:

  • National Institutes of Health T32EY027721 to Raymond Doudlah.

  • National Science Foundation DGE-1545481 to Raymond Doudlah, Lowell W Thompson.

  • National Institutes of Health T32NS105602 to Lowell W Thompson.

  • McPherson Eye Research Institute Graduate Student Support Initiative to Lowell W Thompson.

  • Alfred P. Sloan Foundation FG-2016-6468 to Ari Rosenberg.

  • Whitehall Foundation 2016-08-18 to Ari Rosenberg.

  • Greater Milwaukee Foundation Shaw Scientist Award to Ari Rosenberg.

  • National Institutes of Health EY029438 to Ari Rosenberg.

Additional information

Competing interests

No competing interests declared.

No competing interests declared.

Author contributions

Conceptualization, Data curation, Software, Formal analysis, Funding acquisition, Validation, Investigation, Visualization, Methodology, Writing - original draft, Writing – review and editing.

Conceptualization, Data curation, Software, Formal analysis, Funding acquisition, Validation, Investigation, Visualization, Methodology, Writing – review and editing.

Software, Formal analysis, Funding acquisition, Validation, Writing – review and editing.

Conceptualization, Resources, Data curation, Software, Formal analysis, Supervision, Validation, Methodology, Project administration, Writing – review and editing.

Conceptualization, Resources, Software, Methodology, Writing – review and editing.

Conceptualization, Resources, Data curation, Software, Formal analysis, Supervision, Funding acquisition, Validation, Investigation, Visualization, Methodology, Writing - original draft, Project administration, Writing – review and editing.

Ethics

This study was performed in strict accordance with the recommendations of the National Institutes of Health's Guide for the Care and Use of Laboratory Animals. All experimental procedures and surgeries were approved by the Institutional Animal Care and Use Committee (IACUC) at the University of Wisconsin-Madison (Protocol #: G005229).

Additional files

Transparent reporting form

Data availability

All data generated or analyzed during this study are available through the Open Science Framework via our lab's profile at https://osf.io/8wxk7/.

The following dataset was generated:

Rosenberg A. Doudlah R, Chang T-Y, Thompson L, Kim B. 2022. Parallel processing, hierarchical transformations, and sensorimotor associations along the 'where' pathway. Open Science Framework. a89gx

References

  1. Alizadeh AM, Van Dromme I, Verhoef BE, Janssen P. Caudal intraparietal sulcus and three-dimensional vision: A combined functional magnetic resonance imaging and single-cell study. NeuroImage. 2018;166:46–59. doi: 10.1016/j.neuroimage.2017.10.045. [DOI] [PubMed] [Google Scholar]
  2. Anzai A, Chowdhury SA, DeAngelis GC. Coding of stereoscopic depth information in visual areas V3 and V3A. The Journal of Neuroscience. 2011;31:10270–10282. doi: 10.1523/JNEUROSCI.5956-10.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Ban H, Welchman AE. FMRI analysis-by-synthesis reveals a dorsal hierarchy that extracts surface slant. The Journal of Neuroscience. 2015;35:9823–9835. doi: 10.1523/JNEUROSCI.1255-15.2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Banks MS, Hooge IT, Backus BT. Perceiving slant about a horizontal axis from stereopsis. Journal of Vision. 2001;1:55–79. doi: 10.1167/1.2.1. [DOI] [PubMed] [Google Scholar]
  5. Bingham C. An antipodally symmetric distribution on the sphere. The Annals of Statistics. 1974;2:1201–1225. doi: 10.1214/aos/1176342874. [DOI] [Google Scholar]
  6. Britten KH, Newsome WT, Shadlen MN, Celebrini S, Movshon JA. A relationship between behavioral choice and the visual responses of neurons in macaque MT. Visual Neuroscience. 1996;13:87–100. doi: 10.1017/s095252380000715x. [DOI] [PubMed] [Google Scholar]
  7. Buneo CA, Andersen RA. The posterior parietal cortex: sensorimotor interface for the planning and online control of visually guided movements. Neuropsychologia. 2006;44:2594–2606. doi: 10.1016/j.neuropsychologia.2005.10.011. [DOI] [PubMed] [Google Scholar]
  8. Celebrini S, Newsome WT. Neuronal and psychophysical sensitivity to motion signals in extrastriate area MST of the macaque monkey. The Journal of Neuroscience. 1994;14:4109–4124. doi: 10.1523/JNEUROSCI.14-07-04109.1994. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Chang TY, Thompson L, Doudlah R, Kim B, Sunkara A, Rosenberg A. Optimized but not maximized cue integration for 3D visual perception. ENeuro. 2020a;7:ENEURO.0411-19.2019. doi: 10.1523/ENEURO.0411-19.2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Chang TY, Doudlah R, Kim B, Sunkara A, Thompson LW, Lowe ME, Rosenberg A. Functional links between sensory representations, choice activity, and sensorimotor associations in parietal cortex. eLife. 2020b;9:e57968. doi: 10.7554/eLife.57968. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Cohen M.R, Newsome WT. Context-dependent changes in functional circuitry in visual area MT. Neuron. 2008;60:162–173. doi: 10.1016/j.neuron.2008.08.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Cohen MR, Maunsell JHR. Attention improves performance primarily by reducing interneuronal correlations. Nature Neuroscience. 2009;12:1594–1600. doi: 10.1038/nn.2439. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Cumming BG, Nienborg H. Feedforward and feedback sources of choice probability in neural population responses. Current Opinion in Neurobiology. 2016;37:126–132. doi: 10.1016/j.conb.2016.01.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Dakin CJ, Rosenberg A. In: Handbook of Clinical Neurology. Brian LD, Stephen RL, editors. Elsevier; 2018. Gravity estimation and verticality perception; pp. 43–59. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. DeAngelis GC, Uka T. Coding of horizontal disparity and velocity by MT neurons in the alert macaque. Journal of Neurophysiology. 2003;89:1094–1111. doi: 10.1152/jn.00717.2002. [DOI] [PubMed] [Google Scholar]
  16. Dodd JV, Krug K, Cumming BG, Parker AJ. Perceptually bistable three-dimensional figures evoke high choice probabilities in cortical area MT. The Journal of Neuroscience. 2001;21:4809–4821. doi: 10.1523/JNEUROSCI.21-13-04809.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Elmore LC, Rosenberg A, DeAngelis GC, Angelaki DE. Choice-related activity during visual slant discrimination in macaque CIP but not V3A. ENeuro. 2019;6:ENEURO.0248-18.2019. doi: 10.1523/ENEURO.0248-18.2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Felleman DJ, Van Essen DC. Distributed hierarchical processing in the primate cerebral cortex. Cerebral Cortex. 1991;1:1–47. doi: 10.1093/cercor/1.1.1-a. [DOI] [PubMed] [Google Scholar]
  19. Fisher N. Statistical Analysis Of Circular Data. Cambridge: Cambridge University Press; 1995. [DOI] [Google Scholar]
  20. Galletti C, Battaglini PP. Gaze-dependent visual neurons in area V3A of monkey prestriate cortex. The Journal of Neuroscience. 1989;9:1112–1125. doi: 10.1523/JNEUROSCI.09-04-01112.1989. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Galletti C, Battaglini PP, Fattori P. Real-motion cells in area V3A of macaque visual cortex. Experimental Brain Research. 1990;82:67–76. doi: 10.1007/BF00230838. [DOI] [PubMed] [Google Scholar]
  22. Gaska JP, Jacobson LD, Pollen DA. Response suppression by extending sine-wave gratings within the receptive fields of neurons in visual cortical area V3A of the macaque monkey. Vision Research. 1987;27:1687–1692. doi: 10.1016/0042-6989(87)90098-8. [DOI] [PubMed] [Google Scholar]
  23. Gaska JP, Jacobson LD, Pollen DA. Spatial and temporal frequency selectivity of neurons in visual cortical area V3A of the macaque monkey. Vision Research. 1988;28:1179–1191. doi: 10.1016/0042-6989(88)90035-1. [DOI] [PubMed] [Google Scholar]
  24. Gu Y, Liu S, Fetsch CR, Yang Y, Fok S, Sunkara A, DeAngelis GC, Angelaki DE. Perceptual learning reduces interneuronal correlations in macaque visual cortex. Neuron. 2011;71:750–761. doi: 10.1016/j.neuron.2011.06.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Gu Y, Angelaki DE, DeAngelis GC. Contribution of correlated noise and selective decoding to choice probability measurements in extrastriate visual cortex. eLife. 2014;3:e02670. doi: 10.7554/eLife.02670. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Haefner RM, Gerwinn S, Macke JH, Bethge M. Inferring decoding strategies from choice probabilities in the presence of correlated variability. Nature Neuroscience. 2013;16:235–242. doi: 10.1038/nn.3309. [DOI] [PubMed] [Google Scholar]
  27. Hanes DP, Schall JD. Neural control of voluntary movement initiation. Science. 1996;274:427–430. doi: 10.1126/science.274.5286.427. [DOI] [PubMed] [Google Scholar]
  28. Hegdé J, Van Essen DC. Role of primate visual area V4 in the processing of 3-D shape characteristics defined by disparity. Journal of Neurophysiology. 2005;94:2856–2866. doi: 10.1152/jn.00802.2004. [DOI] [PubMed] [Google Scholar]
  29. Henderson M, Vo V, Chunharas C, Sprague T, Serences J. Multivariate analysis of BOLD activation patterns recovers graded depth representations in human visual and parietal cortex. ENeuro. 2019;6:eNeuro. doi: 10.1523/ENEURO.0362-18.2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Hillis JM, Watt SJ, Landy MS, Banks MS. Slant from texture and disparity cues: optimal cue combination. Journal of Vision. 2004;4:967–992. doi: 10.1167/4.12.1. [DOI] [PubMed] [Google Scholar]
  31. Hsi S, Linn MC, Bell JE. The role of spatial reasoning in engineering and the design of spatial instruction. Journal of Engineering Education. 1997;86:151–158. doi: 10.1002/j.2168-9830.1997.tb00278.x. [DOI] [Google Scholar]
  32. Huang X, Lisberger SG. Noise correlations in cortical area MT and their potential impact on trial-by-trial variation in the direction and speed of smooth-pursuit eye movements. Journal of Neurophysiology. 2009;101:3012–3030. doi: 10.1152/jn.00010.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Janssen P, Vogels R, Orban GA. Three-dimensional shape coding in inferior temporal cortex. Neuron. 2000;27:385–397. doi: 10.1016/s0896-6273(00)00045-3. [DOI] [PubMed] [Google Scholar]
  34. Judd CM, Yzerbyt VY, Muller D. In: Handbook of Research Methods in Social and Personality Psychology. Judd CM, editor. Elsevier; 2014. Mediation and moderation; pp. 653–676. [Google Scholar]
  35. Katsuyama N, Yamashita A, Sawada K, Naganuma T, Sakata H, Taira M. Functional and histological properties of caudal intraparietal area of macaque monkey. Neuroscience. 2010;167:1–10. doi: 10.1016/j.neuroscience.2010.01.028. [DOI] [PubMed] [Google Scholar]
  36. Kim B, Kenchappa SC, Sunkara A, Chang TY, Thompson L, Doudlah R, Rosenberg A. Real-time experimental control using network-based parallel processing. eLife. 2019;8:e40231. doi: 10.7554/eLife.40231. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Kohn A, Smith MA. Stimulus dependence of neuronal correlation in primary visual cortex of the macaque. The Journal of Neuroscience. 2005;25:3661–3673. doi: 10.1523/JNEUROSCI.5106-04.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Lanzilotto M, Ferroni CG, Livi A, Gerbella M, Maranesi M, Borra E, Passarelli L, Gamberini M, Fogassi L, Bonini L, Orban GA. Anterior intraparietal area: A hub in the observed manipulative action network. Cerebral Cortex. 2019;29:1816–1833. doi: 10.1093/cercor/bhz011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Lewis JW, Van Essen DC. Corticocortical connections of visual, sensorimotor, and multimodal processing areas in the parietal lobe of the macaque monkey. The Journal of Comparative Neurology. 2000;428:112–137. doi: 10.1002/1096-9861(20001204)428:1<112::aid-cne8>3.0.co;2-9. [DOI] [PubMed] [Google Scholar]
  40. Liu S, Gu Y, DeAngelis GC, Angelaki DE. Choice-related activity and correlated noise in subcortical vestibular neurons. Nature Neuroscience. 2013;16:89–97. doi: 10.1038/nn.3267. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Munoz DP, Wurtz RH. Saccade-related activity in monkey superior colliculus. I. characteristics of burst and buildup cells. Journal of Neurophysiology. 1995;73:2313–2333. doi: 10.1152/jn.1995.73.6.2313. [DOI] [PubMed] [Google Scholar]
  42. Nakamura K, Colby CL. Visual, saccade-related, and cognitive activation of single neurons in monkey extrastriate area V3A. Journal of Neurophysiology. 2000;84:677–692. doi: 10.1152/jn.2000.84.2.677. [DOI] [PubMed] [Google Scholar]
  43. Nakamura K, Colby CL. Updating of the visual representation in monkey striate and extrastriate cortex during saccades. PNAS. 2002;99:4026–4031. doi: 10.1073/pnas.052379899. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Nakhla N, Korkian Y, Krause MR, Pack CC. Neural selectivity for visual motion in macaque area V3A. ENeuro. 2021;8:eNeuro. doi: 10.1523/ENEURO.0383-20.2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Nguyenkim JD, DeAngelis GC. Disparity-based coding of three-dimensional surface orientation by macaque middle temporal neurons. The Journal of Neuroscience. 2003;23:7117–7128. doi: 10.1523/JNEUROSCI.23-18-07117.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Nienborg H, Cumming BG. Macaque V2 neurons, but not V1 neurons, show choice-related activity. The Journal of Neuroscience. 2006;26:9567–9578. doi: 10.1523/JNEUROSCI.2256-06.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Niu M, Impieri D, Rapan L, Funck T, Palomero-Gallagher N, Zilles K. Receptor-driven, multimodal mapping of cortical areas in the macaque monkey intraparietal sulcus. eLife. 2020;9:e55979. doi: 10.7554/eLife.55979. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Pause M, Freund HJ. Role of the parietal cortex for sensorimotor transformation. Brain, Behavior and Evolution. 1989;33:136–140. doi: 10.1159/000115916. [DOI] [PubMed] [Google Scholar]
  49. Premereur E, Van Dromme IC, Romero MC, Vanduffel W, Janssen P. Effective connectivity of depth-structure-selective patches in the lateral bank of the macaque intraparietal sulcus. PLOS Biology. 2015;13:e1002072. doi: 10.1371/journal.pbio.1002072. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Prince SJD, Pointon AD, Cumming BG, Parker AJ. Quantitative analysis of the responses of V1 neurons to horizontal disparity in dynamic random-dot stereograms. Journal of Neurophysiology. 2002;87:191–208. doi: 10.1152/jn.00465.2000. [DOI] [PubMed] [Google Scholar]
  51. Reynolds JH, Heeger DJ. The normalization model of attention. Neuron. 2009;61:168–185. doi: 10.1016/j.neuron.2009.01.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Roitman JD, Shadlen MN. Response of neurons in the lateral intraparietal area during a combined visual discrimination reaction time task. The Journal of Neuroscience. 2002;22:9475–9489. doi: 10.1523/JNEUROSCI.22-21-09475.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Rosenberg A, Cowan NJ, Angelaki DE. The visual representation of 3D object orientation in parietal cortex. The Journal of Neuroscience. 2013;33:19352–19361. doi: 10.1523/JNEUROSCI.3174-13.2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Rosenberg A, Angelaki DE. Gravity influences the visual representation of object tilt in parietal cortex. The Journal of Neuroscience. 2014a;34:14170–14180. doi: 10.1523/JNEUROSCI.2030-14.2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Rosenberg A, Angelaki DE. Reliability-dependent contributions of visual orientation cues in parietal cortex. PNAS. 2014b;111:18043–18048. doi: 10.1073/pnas.1421131111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Ruff DA, Cohen MR. Global cognitive factors modulate correlated response variability between V4 neurons. The Journal of Neuroscience. 2014;34:16408–16416. doi: 10.1523/JNEUROSCI.2750-14.2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Rushworth MFS, Nixon PD, Passingham RE. Parietal cortex and movement. Experimental Brain Research. 1997;117:292–310. doi: 10.1007/s002210050224. [DOI] [PubMed] [Google Scholar]
  58. Sauvan XM, Peterhans E. Orientation constancy in neurons of monkey visual cortex. Visual Cognition. 1999;6:43–54. doi: 10.1080/713756803. [DOI] [Google Scholar]
  59. Seilheimer RL, Rosenberg A, Angelaki DE. Models and processes of multisensory cue combination. Current Opinion in Neurobiology. 2014;25:38–46. doi: 10.1016/j.conb.2013.11.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Shadlen MN, Britten KH, Newsome WT, Movshon JA. A computational analysis of the relationship between neuronal and behavioral responses to visual motion. The Journal of Neuroscience. 1996;16:1486–1510. doi: 10.1523/JNEUROSCI.16-04-01486.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Shadlen MN, Newsome WT. Motion perception: seeing and deciding. PNAS. 1996;93:628–633. doi: 10.1073/pnas.93.2.628. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Smolyanskaya A, Haefner RM, Lomber SG, Born RT. A modality-specific feedforward component of choice-related activity in MT. Neuron. 2015;87:208–219. doi: 10.1016/j.neuron.2015.06.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Stevens KA. Slant-tilt: the visual encoding of surface orientation. Biological Cybernetics. 1983;46:183–195. doi: 10.1007/BF00336800. [DOI] [PubMed] [Google Scholar]
  64. Taira M, Tsutsui KI, Jiang M, Yara K, Sakata H. Parietal neurons represent surface orientation from the gradient of binocular disparity. Journal of Neurophysiology. 2000;83:3140–3146. doi: 10.1152/jn.2000.83.5.3140. [DOI] [PubMed] [Google Scholar]
  65. Tsao DY, Vanduffel W, Sasaki Y, Fize D, Knutsen TA, Mandeville JB, Wald LL, Dale AM, Rosen BR, Van Essen DC, Livingstone MS, Orban GA, Tootell RBH. Stereopsis activates V3A and caudal intraparietal areas in macaques and humans. Neuron. 2003;39:555–568. doi: 10.1016/s0896-6273(03)00459-8. [DOI] [PubMed] [Google Scholar]
  66. Tsutsui K, Jiang M, Yara K, Sakata H, Taira M. Integration of perspective and disparity cues in surface-orientation-selective neurons of area CIP. Journal of Neurophysiology. 2001;86:2856–2867. doi: 10.1152/jn.2001.86.6.2856. [DOI] [PubMed] [Google Scholar]
  67. Tsutsui K-I, Sakata H, Naganuma T, Taira M. Neural correlates for perception of 3D surface orientation from texture gradient. Science. 2002;298:409–412. doi: 10.1126/science.1074128. [DOI] [PubMed] [Google Scholar]
  68. Tsutsui K-I, Jiang M, Sakata H, Taira M. Short-term memory and perceptual decision for three-dimensional visual features in the caudal intraparietal sulcus (area CIP) The Journal of Neuroscience. 2003;23:5486–5495. doi: 10.1523/JNEUROSCI.23-13-05486.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Van Dromme IC, Premereur E, Verhoef BE, Vanduffel W, Janssen P. Posterior parietal cortex drives inferotemporal activations during three-dimensional object vision. PLOS Biology. 2016;14:e1002445. doi: 10.1371/journal.pbio.1002445. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Van Essen DC, Lewis JW, Drury HA, Hadjikhani N, Tootell RB, Bakircioglu M, Miller MI. Mapping visual cortex in monkeys and humans using surface-based atlases. Vision Research. 2001;41:1359–1378. doi: 10.1016/s0042-6989(01)00045-1. [DOI] [PubMed] [Google Scholar]
  71. Zeki S. A direct projection from area V1 to area V3A of rhesus monkey visual cortex. PNAS. 1980;207:499–506. doi: 10.1098/rspb.1980.0038. [DOI] [PubMed] [Google Scholar]
  72. Zohary E, Shadlen MN, Newsome WT. Correlated neuronal discharge rate and its implications for psychophysical performance. Nature. 1994;370:140–143. doi: 10.1038/370140a0. [DOI] [PubMed] [Google Scholar]

Editor's evaluation

David J Freedman 1

This study compares the neuronal activity of two interconnected cortical areas in the dorsal visual pathway, V3A and CIP, during perceptual decisions based on judging the tilt of 3D visual patterns. This is a novel comparison between neural activity in these two cortical areas during perceptual decisions and gives insight into the hierarchical relationship and roles of these two areas. CIP shows higher-order spatial representations and more choice-correlated responses. Furthermore, the study finds modulation of V3A activity by extraretinal factors, suggesting that V3A be better characterized as contributing to higher-order behavioral functions as opposed to low-level visual feature processing.

Decision letter

Editor: David J Freedman1
Reviewed by: Peter Janssen2

Our editorial process produces two outputs: (i) public reviews designed to be posted alongside the preprint for the benefit of readers; (ii) feedback on the manuscript for the authors, including requests for revisions, shown below. We also include an acceptance summary that explains what the editors found interesting or important about the work.

Decision letter after peer review:

Thank you for submitting your article "Parallel processing, hierarchical transformations, and sensorimotor associations along the 'where' pathway" for consideration by eLife. Your article has been reviewed by 3 peer reviewers, one of whom is a member of our Board of Reviewing Editors, and the evaluation has been overseen by Joshua Gold as the Senior Editor. The following individual involved in the review of your submission has agreed to reveal their identity: Peter Janssen (Reviewer #3).

The reviewers have discussed their reviews with one another, and the Reviewing Editor has drafted this to help you prepare a revised submission.

Essential revisions:

(1) R1: The manuscript should provide more information and analysis of behavioral performance in V3A and CIP sessions, to address the possibility that any differences in neuronal results observed between areas can be explained based on the performance of the animals.

(2) R1: The methods should be expanded to include additional information about the sequence and timing of V3A and CIP experimental sessions. For example, were all CIP sessions conducted first followed by V3A sessions, or were they interleaved?

(3) R1: The overall presentation of the results would be improved by adding information about the animal to animal variability in the results, and clarifying whether examples are shown from different animals (e.g. Figures 1, 2, 3)

4) R1: The presentation of results and rationale for pooling data between animals can be improved as described by point #4 from reviewer 1.

(5) R1: Regarding the analysis of presaccadic activity in V3A, it would be good to show that the results are robust to the assumptions and parameters of the analysis and justify the use of ANOVA as the statistical test.

6. R2 questions using the visually guided saccade to dissociate visual and saccade-related responses. The approach should be clarified regarding how "CIP time courses approximately coalesced".

7. R2: The discrimination index should be described more clearly, including understanding what kind of index values are expected under different conditions.

8. R2 suggests clarifying the section entitled, "Hierarchical refinement of 3D pose representation."

9. R2 suggests refining and clarifying the analysis of choice-related activity by assessing the amount of behavioral bias in the monkeys' choices.

(10) It is suggested by R3 to temper claims that the results are strong evidence for a hierarchical relationship between the two brain areas, and to place more emphasis on the novel results of pre-saccadic activity in V3A.

11) R3 recommends dropping statements regarding the conflict between anatomical and functional data, as it is not judged to be critical for the current study.

12) R3: Clarify whether the difference in timing of the onset of choice activity is significantly different between CIP and V3A.

Reviewer #1 (Recommendations for the authors):

The study would benefit from additional details about the experimental methods to understand the temporal relationship between recordings from V3A and CIP. Revisions to the analysis approach, as well as breaking down the analysis and results on a per animal basis, would strengthen the study as well.

(1) The CIP and V3a data were obtained from different recording sessions. The manuscript should provide more information and analysis of behavioral performance in V3A and CIP sessions, to address the possibility that any differences in neuronal results observed between areas can be explained based on the performance of the animals.

(2) Related to the comment above, additional information about the sequence and timing of V3A and CIP sessions should be included in the manuscript. For example, were all CIP sessions conducted first followed by V3A sessions, or were they interleaved?

(3) A general comment regarding the presentation of the results is that there is a notable lack of information about the animal to animal variability in the results, and in many cases, data is shown for only one animal or is not specified regarding whether data from multiple animals are shown.

(3a) Figures 1 & 2 show imaging and behavioral data for only one of the monkeys. In both cases, data/results should be shown for all animals.

(3b) In figure 3, it should be made clear which animals each of the single neuron examples are from.

4) Related to point 3 – for many (but not all) of the population level analyses, data is apparently combined across animals and stats reported on the pooled data. This is not unusual and generally fine, but appropriate tests need to be done and reported regarding whether results were consistent between animals, and how they differ. It is not clear what rationale is being followed to determine which analyses are reported separately for each animal, and which are from pooled data. Also, it should be reported whether findings were statistically significant within each animal, or only when data is combined across animals. The study has recorded sizeable populations of neurons within each animal and brain area, so the data seems likely to support a more granular level of analysis animal by animal.

(5) Regarding the analysis of presaccadic activity in V3A, it would be good to show that the results are robust to the assumptions and parameters of the analysis. For example, picking the divergence time based on an ANOVA at P<0.05 seems like a liberal criterion that could be prone to noise. Also, given the parametric tuning, is ANOVA really the most appropriate choice for a test?

Reviewer #2 (Recommendations for the authors):

I have a few suggestions to make, which I think, would improve the article and its clarity.

1. As stated in the public review, I don't think visually guided saccades allow you to dissociate from visual and saccade-related responses. Moreover, I don't understand how you compute how "CIP time courses approximately coalesced".

2. I think the rationale between DI and how they behave could be better explained. I am not sure I truly understand how it is supposed to behave with different levels of response relative to different variability.

3. I'm a bit confused by the paragraph named Hierarchical refinement of 3D pose representation. Here you show that 3D orientation tuning is less distance dependent in CIP than in V3a (Figure 4a). I don't understand what is the difference with the following analysis of angular deviation.

4. In the analysis of choice-related activity, you focus on trials with 0 degrees tilt and use animals' behavior to infer how they interpret these ambiguous stimuli. The analysis you propose assumes that animals' choices are evenly distributed among the 8 possible directions. Are animals biased-free?

Reviewer #3 (Recommendations for the authors):

I am not convinced that the correspondence between the integrated V3A time courses and the CIP time courses really reflects a parallel hierarchy. It is also possible that these presaccadic signals originating from a higher area arrive differently in the two areas. In any case, the manuscript would be improved if the authors would de-emphasize and maybe reduce the first part of the results, and highlight more the novel results on pre-saccadic activity in V3A.

The authors try to rescue the fact that many results are expected by suggesting an apparent conflict between anatomical and functional data, but the lower selectivity for 3D orientation in CIP compared to V3A as reported in the Elmore study is a very small effect (SODI of 0.63 vs 0.68) with a small sample of V3A neurons which could be explained by other experimental factors. I would strongly recommend dropping these statements in the introduction and discussion (also on p.9) since it is not really necessary for this study.

It is not clear whether the difference in timing of the onset of choice activity is significantly different between CIP and V3A.

eLife. 2022 Aug 11;11:e78712. doi: 10.7554/eLife.78712.sa2

Author response


Essential revisions:

1) R1: The manuscript should provide more information and analysis of behavioral performance in V3A and CIP sessions, to address the possibility that any differences in neuronal results observed between areas can be explained based on the performance of the animals.

This is an excellent point that was not addressed in the first submission. We updated the analysis to test for a difference in performance between the V3A and CIP sessions. The analysis confirmed that there was no significant difference in performance between these sessions (lines: 143-149).

(2) R1: The methods should be expanded to include additional information about the sequence and timing of V3A and CIP experimental sessions. For example, were all CIP sessions conducted first followed by V3A sessions, or were they interleaved?

As suggested, we now report the sequence of the V3A and CIP recording sessions (lines: 898-901). Briefly, we first performed 41 V3A sessions (Monkeys L and F), then all 53 CIP sessions (Monkeys L and F), and then 50 additional V3A sessions (Monkeys L, F, and W).

(3) R1: The overall presentation of the results would be improved by adding information about the animal to animal variability in the results, and clarifying whether examples are shown from different animals (e.g. Figures 1, 2, 3)

We completely agree. For all main analyses, we added statistics for each animal to complement the population results. Where relevant, we now discuss between-animal variability (which was minimal). Analyses that distinguished whether neurons carried choice-related activity were performed at the population-level only because some of the individual sample sizes became rather small (see lines: 399-402). We now indicate from which animal each example neuron came (lines: 192-193).

4) R1: The presentation of results and rationale for pooling data between animals can be improved as described by point #4 from reviewer 1.

We agree that this was not clear in the first submission. To clarify, we ran behavioral analyses as well as comparisons between the behavioral and neuronal responses separately for each animal. The neuronal analyses were run at the population level. As indicated in our response to comment #3, the resubmission includes neuronal analyses for the individual animals as well as pooled across animals.

(5) R1: Regarding the analysis of presaccadic activity in V3A, it would be good to show that the results are robust to the assumptions and parameters of the analysis and justify the use of ANOVA as the statistical test.

We thank the reviewer for bringing up this point. In the first submission, we did not clearly indicate that the onset time was defined as the first time point at which the time courses significantly diverge (ANOVA, p < 0.05) for at least 30 consecutive ms. The 30 ms criterion is widely used when calculating latencies because it makes the estimates more conservative and eliminates false positives. We have clarified this in the Materials and methods section (lines: 937-940 and 994-995).

We further verified the robustness of the results in two ways. First, we recalculated the onset of choice- and saccade-related activity using an ANOVA with more conservative parameters (significance values of 0.01, 0.001, and 0.0001). As expected, the onsets shifted to later times, but always by roughly equal amounts in V3A and CIP. In all cases the cross-area latency difference was within 2 ms of the reported difference, and the conclusions regarding the cross-area differences never changed. Second, we recalculated the onset of choice- and saccade-related activity using a Kruskal-Wallis test (nonparametric). In all cases, the onset times were within 3 ms of the times calculated using an ANOVA, and the conclusions regarding the cross-area differences did not change.

We also used an ANOVA to test if the responses of individual neurons depended on the saccade direction, which allowed for a direct comparison with previous work. Using a Kruskal-Wallis test (nonparametric), the classification (tuned vs. not tuned) was the same for 1053/1129 (93%) of the neurons. Additionally, we note that the saccade direction tuning curves of neurons with significant tuning were well described by von Mises functions (mean r = 0.92 ± 0.47x10-3 SEM), which would not be expected if they were not accurately classified using the ANOVA.

6. R2 questions using the visually guided saccade to dissociate visual and saccade-related responses. The approach should be clarified regarding how "CIP time courses approximately coalesced".

This is an important point which should have been addressed more thoroughly in the first submission. In the resubmission, we discuss that the current study cannot unambiguously distinguish the contributions of visual and presaccadic activity to the observed saccade-related activity (lines: 733-734). To be more agnostic about the possibility of presaccadic activity and to match the terminology used in Chang et al. (2020b), we now use the term “saccade-related activity”. We also added a supplemental figure which shows that the saccade-related activity follows a distinct time course from the visual flash responses measured during receptive field mapping (Figure 8―figure supplement 1). In the Discussion, we additionally highlight that several of the findings suggest that the observed saccade-related activity cannot be attributed to visual responses, and that future experiments will be required to thoroughly dissociate the contributions of visual and presaccadic activity in V3A/CIP (lines: 735-745).

In the Results section (lines: 565-566), we added additional pointers to the Materials and methods section (lines: 1003-1007) where we describe how we calculated the coalescent points for both the V3A and CIP time courses. At the same locations, we reference Chang et al. (2020b) which also used this method.

7. R2: The discrimination index should be described more clearly, including understanding what kind of index values are expected under different conditions.

As suggested, we expanded the description of the discrimination index (DI) to clarify its interpretation and how the mean and variance of the neuronal responses together determine the value (lines: 256-257 and 949-951).

8. R2 suggests clarifying the section entitled, "Hierarchical refinement of 3D pose representation."

We thank the reviewer for this suggestion. We have clarified that the two analyses in this section are complementary to one another in that they provide an assessment of how the overall shape of the orientation tuning curve depends on distance (Tolerance) and how the preferred orientation (independent of tuning bandwidth) depends on distance (lines: 312-334). Together, they provide convergent evidence of a transformation from lower-level visual feature selectivity in V3A to higher-level 3D object representations in CIP.

9. R2 suggests refining and clarifying the analysis of choice-related activity by assessing the amount of behavioral bias in the monkeys' choices.

This is an excellent point that we did not address in the first submission. We now report that each of the monkeys chose all targets in every session and provide summary statistics which show that the choice distributions were very broad. We further report that the session-by-session mean choices of the monkeys and the preferred choice directions of the neurons were not significantly correlated for any monkey, suggesting that their choices were not associated with the neuronal choice preferences (lines: 980-985).

(10) It is suggested by R3 to temper claims that the results are strong evidence for a hierarchical relationship between the two brain areas, and to place more emphasis on the novel results of pre-saccadic activity in V3A.

This is a great point. As suggested, we tempered our interpretation of the differences in saccade-related activity between V3A and CIP as evidence of a hierarchy and focus more on the novel result of saccade-related activity in V3A.

(11) R3 recommends dropping statements regarding the conflict between anatomical and functional data, as it is not judged to be critical for the current study.

We agree that this point is secondary to the goals of the study and removed all such statements.

(12) R3: Clarify whether the difference in timing of the onset of choice activity is significantly different between CIP and V3A.

To statistically compare the V3A and CIP onsets of choice- and saccade-related activity at the population level, it was necessary to perform permutation tests based on bootstrapped values. These individual statistics were not significant, but the latency differences were highly consistent across all domains, supporting the proposed hierarchy. We now highlight in the Discussion that V3A activity preceded CIP activity by a similar amount in every domain: visual onset (6 ms; lines: 710-712), choice-related activity onset (11 ms; lines: 793-794), saccade-related activity onset aligned to the saccade initiation (6 ms; lines: 717-719), and inflection point in the time course of the saccade-related activity aligned to the target onset (7 ms; lines: 737-739). Notably, these analyses were performed using different neuronal subpopulations and experimental trials, providing convergent evidence of a V3A to CIP hierarchy.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Data Citations

    1. Rosenberg A. Doudlah R, Chang T-Y, Thompson L, Kim B. 2022. Parallel processing, hierarchical transformations, and sensorimotor associations along the 'where' pathway. Open Science Framework. a89gx [DOI] [PMC free article] [PubMed]

    Supplementary Materials

    Transparent reporting form

    Data Availability Statement

    All data generated or analyzed during this study are available through the Open Science Framework via our lab's profile at https://osf.io/8wxk7/.

    The following dataset was generated:

    Rosenberg A. Doudlah R, Chang T-Y, Thompson L, Kim B. 2022. Parallel processing, hierarchical transformations, and sensorimotor associations along the 'where' pathway. Open Science Framework. a89gx


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES