Dynamic mechanisms of visually guided 3D motion tracking

Kathryn Bonnen; Alexander C Huk; Lawrence K Cormack

doi:10.1152/jn.00831.2016

. 2017 Jun 21;118(3):1515–1531. doi: 10.1152/jn.00831.2016

Dynamic mechanisms of visually guided 3D motion tracking

Kathryn Bonnen ^1,^3,^4,^✉, Alexander C Huk ^1,^2,^3,⁴, Lawrence K Cormack ^1,^2,³

PMCID: PMC5596126 PMID: 28637820

We characterize motion perception continuously in all directions using an ecologically relevant, manual target tracking paradigm we recently developed. This approach reveals a selective impairment to the perception of motion-through-depth. Geometric considerations demonstrate that this impairment is not consistent with previously observed spatial deficits (e.g., stereomotion suppression). However, results from an examination of disparity processing are consistent with the longer latencies observed in discrete, trial-based measurements of the perception of motion-through-depth.

Keywords: vision, motion, depth, sensorimotor behavior, binocular vision

Abstract

The continuous perception of motion-through-depth is critical for both navigation and interacting with objects in a dynamic three-dimensional (3D) world. Here we used 3D tracking to simultaneously assess the perception of motion in all directions, facilitating comparisons of responses to motion-through-depth to frontoparallel motion. Observers manually tracked a stereoscopic target as it moved in a 3D Brownian random walk. We found that continuous tracking of motion-through-depth was selectively impaired, showing different spatiotemporal properties compared with frontoparallel motion tracking. Two separate factors were found to contribute to this selective impairment. The first is the geometric constraint that motion-through-depth yields much smaller retinal projections than frontoparallel motion, given the same object speed in the 3D environment. The second factor is the sluggish nature of disparity processing, which is present even for frontoparallel motion tracking of a disparity-defined stimulus. Thus, despite the ecological importance of reacting to approaching objects, both the geometry of 3D vision and the nature of disparity processing result in considerable impairments for tracking motion-through-depth using binocular cues.

NEW & NOTEWORTHY We characterize motion perception continuously in all directions using an ecologically relevant, manual target tracking paradigm we recently developed. This approach reveals a selective impairment to the perception of motion-through-depth. Geometric considerations demonstrate that this impairment is not consistent with previously observed spatial deficits (e.g., stereomotion suppression). However, results from an examination of disparity processing are consistent with the longer latencies observed in discrete, trial-based measurements of the perception of motion-through-depth.

the perception of motion-through-depth is crucial to human behavior. It provides information necessary for tracking moving objects in the three-dimensional (3D) world so that we can, for example, duck to avoid being hit. However, the perception of motion and depth are typically examined independently. Both have become powerful model systems for investigating how information is processed in the brain (Julesz 1971; Newsome and Paré 1988; Shadlen and Newsome 2001). However, significantly less work has considered the perception of motion and depth as part of one unified perceptual system for processing position and motion information from the 3D world.

In this study, subjects continuously followed objects moving in a Brownian random walk through a 3D environment. The target tracking task we employ here provides a rich and efficient paradigm for examining visual perception and visually guided action in the 3D world (Bonnen et al. 2015). Manual tracking responses can be collected at a much higher temporal resolution compared with the binary decisions in trial-based forced choice psychophysics. Our laboratory’s previous work has demonstrated that target tracking provides measures of visual sensitivity that are comparable to those obtained using traditional psychophysical methods (Bonnen et al. 2015). This prior work relied on the underlying logic that tracking should be more accurate for a clearly visible target than for targets that are difficult to see. Here we extend this logic to investigate 3D motion perception: tracking should be more accurate for clearly visible motion than for motion that is more difficult to see. Tracking also takes advantage of the natural human ability to follow objects in the environment. Forced choice tasks typically require that subjects view a single motion stimulus and make a binary decision about that motion, which they then communicate with a button press or other discrete behavioral response that is often arbitrarily mapped onto the visual perception or decision. While this traditional approach has yielded much information about motion processing in the visual system, tracking allows us to examine motion perception in finer temporal detail in the context of a task that is also more natural for observers.

Our first experiment examined tracking performance for Brownian motion in a 3D space. Subjects were instructed to track the center of a target (by pointing at it with their finger) as it moved in a 3D Brownian random walk. Tracking performance was impaired for motion-through-depth relative to horizontal and vertical motion. Thus the impaired processing of motion-through-depth observed in discrete, trial-based tasks generalizes to naturalistic, continuous visually guided behavior (Brooks and Stone 2006; Cooper et al. 2016; Katz et al. 2015; McKee et al. 1990; Nienborg et al. 2005; Tyler 1971). Follow-up experiments isolated the sources of the deficits for tracking motion-through-depth. Experiments II–IV show that the deficit is partially due to the geometry of motion-through-depth relative to an observer. However, this did not account for the longer latencies for tracking motion-through-depth compared with frontoparallel motion. We hypothesized that this remaining difference was a signature of disparity processing (Wheatstone 1838); previous work has shown behavioral delays for static disparities and slower temporal dynamics for neural responses (Braddick 1974; Cumming and Parker 1994; Nienborg et al. 2005; Norcia and Tyler 1984). Experiment V examined whether the longer latencies observed in experiments I and II can be attributed to disparity processing. When disparity processing was imposed on frontoparallel motion tracking using dynamic random element stereograms (DRES), we found impaired tracking performance that better matched the temporal characteristics of motion-through-depth tracking.

In summary, we found that the diminished performance in depth motion tracking can be explained by a combination of two factors: a geometric penalty, because 3D spatial signals give rise to two-dimensional (2D) retinal signals (projections of the 3D motion), and a disparity processing penalty, because the combination of signals across the two eyes gives rise to different temporal dynamics.

GENERAL METHODS

Observers

Three observers served as the subjects for all of the following experiments. All had normal or corrected-to-normal vision. Written, informed consent was obtained for all observers in accordance with The University of Texas at Austin Institutional Review Board. Observers were treated according to the principles set forth in the Declaration of Helsinki of the World Medical Association. Two of the three observers were authors, and the third [subject 3 (S3)] was naive concerning the purposes of the experiments.

Apparatus

Stimuli were presented using a Planar PX2611W stereoscopic display. This display consists of two monitors (with orthogonal linear polarization) separated by a polarization-preserving beam splitter (Planar Systems, Beaverton, OR). Subjects wore simple, passive linearizing filters to view binocular stereo stimuli. In all experiments, subjects used a forehead rest to maintain constant viewing distance. In experiment V, subjects were fully head-fixed using both a chin cup and a forehead rest. Each monitor was gamma-corrected to produce a linear relationship between pixel values and output luminance.

A Leap Motion controller was used to record the manual tracking data (Leap Motion, San Francisco, CA). It uses two infrared cameras and an infrared light source to track the position of hands and fingers. This device collected measurements of the (x,y,z) position of the observer’s pointer finger over time (see appendix a for an evaluation of the spatiotemporal characteristics of this device).

All experiments and analyses were performed using custom code written in MATLAB using the Psychophysics Toolbox (Brainard 1997; Kleiner et al. 2007; Pelli 1997). Subpixel motion was achieved using the anti-aliasing built into the “DrawDots” function of the Psychophysics Toolbox. During trials, observers controlled a cursor by moving their pointer finger above the Leap Motion controller. The experiments were performed with observers sitting at a viewing distance of 85 cm, except the final experiment (experiment V) in which viewing distance was 100 cm.

Stimuli

In all experiments, subjects tracked the center of a target as it moved in a random walk, controlling a visible cursor with their finger. Each dimension of the random walk (horizontal, vertical, and depth) was defined as follows:

\begin{matrix} x_{t + 1} = x_{t} + w_{t}, & w_{t} \sim \end{matrix} N (0, σ^{2})

(1)

where x_t is the position and w_t is a random variable drawn from a Normal distribution (denote by N) with a mean of zero and variance σ². Time steps correspond to 0.05 ms (20 Hz). A trial consisted of 20 s of tracking.

For experiments I–IV, the target and cursor were luminance-defined circles (61.5 cd/m², 0.8° diameter; and 71.3 cd/m², 0.3° diameter, respectively) on a gray background (52.4 cd/m²). Luminance was measured with a photometer (PR 655, Photo Research; Syracuse, NY) through the beam splitter and a polarizing lens. For experiment V, the target was a disparity-defined square (width = 0.8°) created by a DRES (Julesz and Bosche 1966; Norcia and Tyler 1984). Both the target and the background were composed of Gaussian pixel noise clipped at 3 SDs and set to span the range of the monitor output (mean = 52.4 cd/m², maximum = 102.8 cd/m², minimum = 2.043 cd/m²; see Fig. 12 for example). The cursor was a small red square (0.1°).

Fig. 12. — Schematic of the dynamic random element stereogram (DRES) stimulus. The target was constrained to be in front of the background. Both the target and the background were composed of Gaussian pixel noise that updated at 60 Hz.

Looming and focus cues (i.e., accommodation and defocus) are both known to be cues for motion-through-depth, but were not rendered in these stimuli. These cues would have been very small for real-world versions of our stimuli (see the general discussion for more details).

Analysis

Each trial resulted in a time series of target and response positions. To examine tracking performance, we calculated a cross-correlogram (CCG) for each trial of the target velocity and the response velocity for the relevant directions of motion (see e.g., Bonnen et al. 2015; Mulligan et al. 2013). A CCG shows the correlation between the target and response velocities as a function of time lag (temporal offset between target and response time series). An average CCG was computed per subject across all the trials in a condition. The CCGs can loosely be interpreted as causal filters or impulse response functions. In fact, for a linear system and white noise, the CCG is an estimate of the impulse response function.

The shape of the average CCG characterizes the overall latency and spatiotemporal fidelity of the tracking response. Some basic features of these CCG response functions (i.e., peak, lag, width) provide simple measures of performance in each condition. The peak is the maximum correlation value. The lag is temporal offset(s) at the peak correlation value. The width refers to the width of the CCG at one-half of the peak correlation (i.e., height). The lag provides a measure of response latency, while peak and width are related to the spatiotemporal fidelity. The median was chosen as a summary statistic for these features. While the average CCG was robust, outliers were observed on individual trials, particularly in low-amplitude conditions. To be consistent across all conditions in all experiments and avoid ad hoc methods for excluding outliers, we chose to report the median of the features (peak, lag, width). For each condition, the median and its 95% confidence intervals were estimated via bootstrapping. A boot-strapped data set was generated by resampling the original data set [e.g., the peaks for the horizontal direction for subject 1 (S1) in experiment I] with replacement. The median was calculated for that bootstrapped data set. This was repeated N times (N = 1,000). The median and 95% confidence intervals of those N medians are reported (see Figs. 3, 7, 9, 11, and 14). In many figures, the 95% confidence intervals may be hidden behind the plotted CCG curves because they are relatively small intervals.

Fig. 3. — Summary of features of tracking performance in *experiment I*, calculated from CCGs shown in Fig. 2. Features (*top*: lags; *middle*: peaks; *bottom*: width at half peak) indicate consistently better performance (shorter lags, higher peak correlation values, and smaller CCG widths) for tracking frontoparallel motion compared with motion-through-depth for a target moving in a Brownian random walk. Bar height indicates median values, and error bars show 95% confidence intervals.

Fig. 7. — Summary of features of tracking performance for frontoparallel motion tracking and motion-through-depth tracking shown in Fig. 6. Color corresponds to condition, error bars indicate 95% confidence intervals, and the lines correspond to least squares fits of the frontoparallel data. Median lags (*top* row), median peak correlations (*middle* row), and median width (*bottom* row) values for all 3 subjects are shown. Peak correlation increases as a function of σ. Lag changes very little. Width has a negative relationship with σ. See Table 3 for slope values. With one exception (*subject 3*, peak), the point corresponding to depth tracking is clearly afield from the line describing the frontoparallel data.

Fig. 9. — Summary of features of tracking performance for gain-corrected frontoparallel motion tracking and motion-through-depth tracking. Median lags (*top*), peak correlations (*middle*), and median widths (*bottom*) for motion-through-depth (black) and gain-corrected frontoparallel (brown) tracking for each of the subjects are shown. A pronounced and consistent difference in lags remains between motion-through-depth tracking and frontoparallel motion tracking. In the case of peak correlation and width, corrected gain accounts for the majority of the difference. Error bars represent 95% confidence intervals.

Fig. 11. — Summary of features of tracking performance from depth (black), vertical XZ (light red), and vertical XY (dark red) CCGs pictured in Fig. 10. Features (*top*: lags; *middle*: peaks; *bottom*: width at half peak) indicate consistent difference between motion-through-depth tracking and vertical XZ tracking. Bar heights indicate median values, and error bars show 95% confidence intervals.

Fig. 14. — Summary of features of tracking performance from depth (black) and frontoparallel (blue) CCGs pictured in Fig. 13. Bar heights indicate median values, and error bars show 95% confidence intervals. Features (*top*: lags; *middle*: peaks; *bottom*: width at half peak) are similar across motion-through-depth tracking and frontoparallel tracking. In particular, the latency difference between frontoparallel motion tracking performance and motion-through-depth tracking performance is negligible or reversed (see Table 7).

We performed several planned comparisons of features across conditions in our experiments. For these comparisons, the effect size (effectively a d′) was calculated on the medians:

\frac{| m_{1} - m_{2} |}{\sqrt{s_{1}^{2} + s_{2}^{2}}}

(2)

where m₁ and m₂ are the medians of the respective conditions and s₁ and s₂ are the standard deviations. Because the data contained outliers and did not meet the assumption of normality, we could not perform traditional Student’s t-tests for these comparisons. To evaluate significance, we sampled (N = 100,000) to obtain the distribution of differences between the medians in the two conditions in question. We report the cumulative probability that the difference is ≤0 as our significance value, where the difference is taken in the direction of the effect (effectively a one-tailed t-test for medians).

Experiment I. 3D Tracking

Observers tracked the center of a target as it moved in a 3D Brownian random walk. Analysis of the resulting time series (target path and subject’s response path) revealed a selective impairment for tracking motion-through-depth.

Methods.

In this experiment, observers were asked to track the center of a luminance-defined stereoscopic target as it moved in a 3D Brownian random walk (σ = 1 mm) using their finger to control a visible cursor. Note that, when referring to motion in each of the three dimensions, we will use the terms horizontal motion, vertical motion, frontoparallel motion (referring to horizontal and/or vertical motion), and motion-through-depth to remain consistent with existing literature. The term 3D motion will refer more generally to motion in all directions. The cursor motion was rendered to match the motion of the observer’s finger in space, such that, when the subjects moved their finger 1 cm in a direction, the cursor appeared to move 1 cm in that direction. Observers completed 50 such trials in blocks of 10.

Each trial yielded a pair of x-y-z time series: the position of the target in 3D space (i.e., the stimulus), and the position of the cursor (i.e., the observer’s response). For each trial, we computed a CCG (see, e.g., Bonnen et al. 2015; Mulligan et al. 2013) of the target velocity and the response velocity for the horizontal, vertical, and depth components. We report the average across all trials. In experiment I, the CCG functions are well fit by a skewed Gabor function (e.g., Geisler and Albrecht 1995), a sine function windowed by a skewed Gaussian (a Gaussian with two σ values, σ₁ above the mean and σ₂ below the mean):

f (t)= a \times e^{- \frac{{(t - μ)}^{2}}{2 σ_{1}^{2}}} \times \sin [2 πω \times (t - μ)], t \geq μ

(3)

f (t)= a \times e^{- \frac{{(t - μ)}^{2}}{2 σ_{2}^{2}}} \times \sin [2 πω \times (t - μ)], t < μ

(4)

where t is the function domain; a is the amplitude, μ is the mean and the offset of the sine wave, and σ₁ and σ₂ are the standard deviations, σ₁ for t ≥ μ and σ₂ for t < μ; for the sine function, ω is the frequency of the sine wave. The location of this function’s maximum value (i.e., the lag) is equal to $μ + σ_{1}^{2}$ . The skewed Gabor function was fit to CCGs using least squares minimization. The proportion of the variance explained was used to measure the goodness of fit. This measure was calculated by leave-one-out cross validation. All but one trial was used to perform the fit, then the correlation between the left-out trial and the fit was calculated. This value was squared to find the variance explained. This was repeated 50 times, once for each trial, and the average is reported.

Results.

Figure 1 shows tracking time series data for one subject from an example trial (20 s). Subjects were able to track the target (gray lines) in each of the cardinal directions (relative to the observer): horizontal (blue; left), vertical (red; middle), and depth (black; right).

Fig. 1. — Example of data generated by target tracking. These data were taken from a single trial completed by *subject 1*. Each subplot shows the position for a particular cardinal direction (horizontal, vertical, and depth) over time. In every panel, the thick gray line represents the target position. The thinner line in each panel represents the subject’s tracking response (horizontal, blue; vertical, red; depth, black).

Figure 2 shows the mean CCGs (dots) and 95% confidence intervals (cloud) for each of the three subjects, across all trials using the same color conventions as in Fig. 1. Skewed Gabor functions were fit to the CCGs (see Methods for details). The solid lines in Fig. 2 correspond to the fits. The proportion of the variance explained across subjects and cardinal directions ranges from 73 to 88%, with an average of 82% (see Table 1 for goodness-of-fit values per CCG).

Table 1.

The proportion of variance explained by the fits shown in Fig. 2

	Subject 1	Subject 2	Subject 3
Horizontal	0.82	0.83	0.88
Vertical	0.79	0.83	0.83
Depth	0.83	0.73	0.84

Open in a new tab

Values are goodness of fit per CCG.

All subjects show a significant impairment for tracking motion-through-depth compared with horizontal and vertical motion. In particular, the depth CCG for each subject has a decreased peak correlation, increased lag (of that peak correlation), and increased width (at half peak) compared with either of the frontoparallel CCGs (horizontal and vertical). The differences in these features indicate a longer response latency and decreased spatiotemporal precision for tracking motion-through-depth in experiment I.

Figure 3 shows the median lag (first row), peak (second row), and width (third row) for the horizontal, vertical, and depth CCGs for each observer (error bars indicate 95% confidence intervals, see general methods for how these features are computed). We compared the features of motion-through-depth tracking performance to horizontal motion tracking performance to confirm our previous observations that the depth motion CCGs exhibit increased lags, decreased peaks, and increased widths (P < 1e-5 across all comparisons; see Table 2 for effect sizes and significance values). These differences are indicative of decreased performance in tracking motion-through-depth across all features.

Table 2.

Comparison of frontoparallel motion tracking (horizontal, blue, in Figs. 1, 2, and 3) and motion-through-depth tracking (black in Figs. 1, 2, and 3) in experiment I

	Subject 1	Subject 2	Subject 3
Lag	2.78 (P < 1e-5)	3.46 (P < 1e-5)	3.10 (P < 1e-5)
Peak	1.83 (P < 1e-5)	1.75 (P < 1e-5)	1.09 (P < 1e-5)
Width	2.63 (P < 1e-5)	0.84 (P < 1e-5)	0.66 (P < 1e-5)

Open in a new tab

Values are the effect sizes, with significance values in parentheses, for the difference of medians.

While there are differences between horizontal and vertical tracking CCGs, they are all relatively small and idiosyncratic to the observer. For example, notice that S1 shows a slightly lower peak and longer lag for vertical tracking compared with horizontal. Informally, we have observed that these individual differences are relatively stable across time and experimental condition (over the course of 2–3 yr of experiments in our laboratory). However, our primary interest is in the general differences in performance between frontoparallel and depth tracking. Therefore, we take horizontal tracking to be representative of frontoparallel tracking for the purposes of comparison.

Discussion.

In experiment I, observers tracked a target as it moved in a 3D Brownian random walk. Observers showed impaired tracking performance for tracking the motion-through-depth compared with the frontoparallel components of motion.

Two potential explanations for this relative impairment are as follows: 1) the egocentric geometry of motion-through-depth results in a smaller signal-to-noise ratio (SNR) (i.e., the size of the visual signals are much smaller for motion-through-depth vs. frontoparallel motion); and 2) the perception of motion-through-depth involves additional mechanisms to make use of binocular signals [e.g., interocular velocity differences (IOVDs), changing disparities (CD)]. Those additional mechanisms have different spatiotemporal signatures in the context of tracking.

The following experiments examine the contribution of these two explanations to the impairment of motion-through-depth tracking, with experiments II–IV focused primarily on the ramifications of geometry, and experiment V focused on the role of disparity processing.

Experiment II. Geometry of 3D Motion as a Constraint on Motion-Through-Depth Tracking Performance

The magnitude of the retinal signal resulting from an environmental motion depends on the direction of the motion relative to the observer (see Fig. 4). In fact, when the viewing distance is large relative to interpupillary distance (allowing the small-angle approximation), frontoparallel motion and a motion-through-depth result in retinal projections with a relative size of ~1: ipd/d, where d is viewing distance and ipd is interpupillary distance. This approximation assumes that the viewing distance is large compared with the interpupillary distances (d >> ipd) and that the motion’s location (x) ≈ 0 (meaning that it has a horizontal position close to the midpoint between the two eyes). Both are true during the motion-through-depth condition of the following experiment (d = 85 cm, ipd = 6.5 cm, x = 0). The ratio is calculated by similar triangles, as shown in the diagram in Fig. 4, middle. For this experiment, the ratio is 1:0.08, meaning the motion-through-depth signal is <10% of the frontoparallel signal. This geometric reality greatly reduces the SNR for tracking motion-through-depth. We examined the ramifications of this geometry in two ways, 1) a set of simulations that used a Kalman filter observer and manipulated signal size and 2) a set of experiments that manipulated signal size for frontoparallel motion tracking, and then compared it to motion-through-depth tracking performance.

Fig. 4. — Frontoparallel motion and motion-through-depth produce differently sized retinal signals. *Left*: for an environmental motion vector that remains the same size, regardless of direction, the magnitude of the resulting motion on the retina (measured as the absolute angular difference, i.e., the difference between the gray line and the black/blue lines) is smaller for motion-through-depth (black) than for horizontal motion (blue). *Middle*: the approximation of the ratio of the size of the retinal projections for frontoparallel motion vs. motion-through-depth is calculated by similar triangles, making the assumption that d >> *ipd* and x = 0. From this diagram we see that *ipd*/d = x_proj/z. Let z = 1, then x_proj = *ipd*/d. *Right*: the magnitude of the motion on the retina (retinal angle, black line) is periodic over the environmental motion direction (left/right are large, and toward/away are small). Arrows show the two cases illustrated in the *left* panel.

For this set of experiments/simulations and all remaining experiments, we shift to reporting σ in arcminutes, because we are now concerned with size of the motion falling on the retina, and this is traditionally reported in degrees (or arcminutes) of visual angle. The standard ratio used to convert from millimeters of motion on the screen to arcminutes is simply $60 \times 2 \times atand (\frac{v}{2 d})$ , where atand is the arctangent in degrees, v is the motion on the screen, and d is the viewing distance.

Kalman filter observer.

We simulated tracking at different signal sizes using a Kalman filter observer (ideal observer for the behavioral tracking paradigm; Baddeley et al. 2003; Bonnen et al. 2015). It makes a set of testable predictions for how manipulating SNR should affect measures of performance (e.g., CCGs, peak, lag, and width).

Two equations form a simple linear dynamical system: the random walk of a stimulus (x_1:_T, see Eq. 1) and the corresponding noisy observations (y_1:_T) of an observer:

y_{t} = x_{t} + v_{t}, v_{t} \sim N (0, R)

(5)

where x_t is a target position, y_t is the noisy observation of x_t, σ² is the parameter that controls the motion amplitude of the stimulus, R is the observation noise variance, and v_t corresponds to a random variable drawn from N(0, R) at time t. (See appendix in Bonnen et al. 2015 for additional details.) In this formulation $\sqrt{\frac{σ^{2}}{R}}$ is the SNR. The Kalman filter provides an optimal solution for estimating x_t given y_1:_t, σ², and R. Using the Kalman filter as an observer, we simulated responses to a random walk with different values of σ while R remained fixed, effectively manipulating the SNR. Figure 5 summarizes the results of this simulation. The values of σ were chosen to correspond to those in experiment II (below). R was set to 200 arcmin so that the CCG in the maximum SNR condition had a peak response comparable to the empirical results observed in experiment II (below). Changing σ systematically changes the optimal Kalman gain (K), which specifies how much a new noisy observation is weighted relative to the previous estimate. The CCGs reflect those changes in the optimal Kalman gain, showing a decreased peak correlation (see Fig. 5, bottom middle) and increased width of the CCG (see Fig. 5, bottom right). The lag is unaffected (see Fig. 5, bottom left).

Fig. 5. — Performance of a Kalman filter observer. *Top left*: the change in the optimal Kalman gain (K) as a function of motion amplitude (σ). Larger values of σ result in larger values of K, which results in a higher weighting of new observations. *Top right*: average CCGs were calculated for simulated Kalman filter observer responses at each of the values of σ. *Bottom left*: crucially, lag was independent of σ. *Bottom middle*: higher values of σ resulted in higher peak correlations, indicating a higher spatiotemporal fidelity. *Bottom right:* higher values of σ resulted in lower widths at half height, indicating a higher temporal precision in the response.

Changes in SNR due to geometry predict some of the differences between frontoparallel motion tracking and motion-through-depth tracking observed in experiment I: a drop in the CCG peak and an increase of the width of the CCG, but not the change in lag. The following experiment examines the effects of manipulating SNR on human performance, compares human performance to the Kalman filter observer, and makes a comparison between tracking frontoparallel motion and motion-through-depth.