The induced motion effect is a high-level visual phenomenon: Psychophysical evidence

Michael Falconbridge; Kassandra Hewitt; Julia Haille; David R Badcock; Mark Edwards

doi:10.1177/20416695221118111

. 2022 Sep 7;13(5):20416695221118111. doi: 10.1177/20416695221118111

The induced motion effect is a high-level visual phenomenon: Psychophysical evidence

Michael Falconbridge ^✉, Kassandra Hewitt, Julia Haille, David R Badcock ¹, Mark Edwards ²

PMCID: PMC9459461 PMID: 36092511

Abstract

Induced motion is the illusory motion of a target away from the direction of motion of the unattended background. If it is a result of assigning background motion to self-motion and judging target motion relative to the scene as suggested by the flow parsing hypothesis then the effect must be mediated in higher levels of the visual motion pathway where self-motion is assessed. We provide evidence for a high-level mechanism in two broad ways. Firstly, we show that the effect is insensitive to a set of low-level spatial aspects of the scene, namely, the spatial arrangement, the spatial frequency content and the orientation content of the background relative to the target. Secondly, we show that the effect is the same whether the target and background are composed of the same kind of local elements—one-dimensional (1D) or two-dimensional (2D)—or one is composed of one, and the other composed of the other. The latter finding is significant because 1D and 2D local elements are integrated by two different mechanisms so the induced motion effect is likely to be mediated in a visual motion processing area that follows the two separate integration mechanisms. Area medial superior temporal in monkeys and the equivalent in humans is suggested as a viable site. We present a simple flow-parsing-inspired model and demonstrate a good fit to our data and to data from a previous induced motion study.

Keywords: higher-order motion, local motion, models, motion, neural mechanisms, optic flow, perception, perceptual organization, scene perception

Introduction

Induced motion or the Duncker effect is a compelling visual illusion that has been studied by psychologists for over a century (Carr & Hardy, 1920; Duncker, 1929; Thelin, 1927). Originally the effect referred to the illusory motion of a stationary object when a surrounding or nearby object moved—the sense that the moon is moving when seen near wind-blown clouds being an often-cited example (see Reinhardt-Rutland, 1988 for a review). Now, more generally, it refers to the “repulsion” of target motion away from the direction of the unattended object/s in the scene; a motion vector opposite to that of the unattended object/s is added to the target whether the target moves or not (Bassili & Farber, 1977; Farrell-Whelan et al., 2012; Gogel, 1979; Wallach & Becklen, 1983; Zivotofsky et al., 1995; Zivotofsky, 2004). For example, when a target pattern moves upward against a background pattern that moves to the right, the target will appear to move upward, as expected, but also to the left, away from the background direction of motion.

During everyday experiences such as catching balls on the run or avoiding obstacles while driving, we accurately calculate target motion in the presence of background motion (arising from self-motion), so why the errors seen in the induced motion illusion? An explanation may lie in a relatively recent proposal to view this two-dimensional (2D) visual phenomenon within a three-dimensional (3D) context. Warren and Rushton (2007) noted that in a real-world, 3D scenario where background motion is due to self-motion, the visual motion caused by self-motion needs to be “parsed out” of the visual scene in order to extract the absolute (world-centered) motion of the target. Parsing out equates to subtracting the background motion in the vicinity of the target (see Figure 1). They suggest that this subtraction may be what causes the repulsion with 2D translating stimuli such as those seen in induced motion experiments (Rushton & Warren, 2005; Warren & Rushton, 2004, 2007, 2008, 2009). Subtracting background motion from target motion effectively adds to the target a motion component in the opposite direction to the background leading to a perceived repulsion away from the background.

Figure 1. — Illustration of the flow parsing hypothesis (Warren & Rushton, 2007). Based on an image in Warren and Rushton (2009). Obtaining the world-centered velocities of the background and object (right panel) from the retinal velocities (left panel) involves subtracting the velocities created by self-motion through the environment, that is, adding the negative of the velocity field created by self-motion (central panel). Note that the oblique retinal motion of the ball is perceived (correctly) as vertical after parsing out self-motion.

Their proposal relies on the unattended motion being treated as a case of optic flow by the visual system. Optic flow is the type of motion that arises in our visual fields as we move through real environments. As optic flow has to be abstracted from low-level motion signals that can be scattered as wide as the entire visual field and because its cause, ego-motion, is associated with other cues such as those from the vestibular system, it is thought to be represented in relatively late stages of the visual processing pathway (Cullen, 2011; Lappe et al., 1999; Perrone, 1992). Mounting evidence suggests it is represented in area medial superior temporal (MST) in the monkey (Duffy & Wurtz, 1991; Gu et al., 2007; McCool & Britten, 2008; Wurtz, 1998) and, equivalently, area hMST in humans (Smith et al., 2006, 2012; Wall & Smith, 2008) at the earliest. This puts the locus of the background/target interaction seen in the induced motion effect at a late stage in the visual motion processing stream.

Our aim was to conduct psychophysical tests of the plausibility of a flow-parsing explanation for the induced motion effect. We looked, specifically, at two requirements if the proposal is to hold. Firstly, the repulsive interaction between target and background needs to occur at a relatively late stage of visual processing, say area MST where optic-flow is processed. Secondly, there is a need to reconcile the proposal with previous research; the proposal demands that the unattended motion in the scene be treated as motion belonging to a stationary background against which an observer is moving but previous psychophysical experiments show induced motion even when the unattended motion belongs to seemingly non-background-like elements such as single dots (e.g., Carr & Hardy, 1920; Duncker, 1929) and other shaped objects (e.g., Farrell-Whelan et al., 2012; Levi & Schor, 1984; Post & Heckmann, 1986; Wallach & Becklen, 1983). If single moving objects work just as well as dispersed motion fields for induced motion then the concept of what constitutes an optic-flow background under the flow-parsing hypothesis needs to be rather broad.

In order to assess whether these two requirements are met by the visual system, we sought a stimulus that had the following two characteristics (corresponding to the two requirements, respectively): (1) the entities upon which the induced motion effect operates—that is, the target and background motions—should only be available at a relatively late stage in the motion processing pathway. If induced motion occurs as usual in this case then it is very likely to be mediated at a late stage in the motion processing pathway—after the target and background motions have been calculated. A scene that is composed of multiple small dispersed motion signals where the signals need to be separated into distinct target and background pools and then each pool integrated to calculate the individual motions of the target and background has the potential to meet this characteristic. Further, if it can be shown that the target and background can be integrated by functionally separate mechanisms without changing the induced motion effect then we have even stronger evidence that the induced motion mechanism follows the separate integration mechanisms. Further still, the stimulus should allow for the adjustment of low-level aspects of the local elements, such as orientation and spatial frequency, to see if adjusting these low-level properties affects induced motion. A lack of sensitivity to these adjustments would support a higher level mechanism. (2) The shape and position of the background relative to the target should be adjustable without the motions of each causing them to collide or significantly spatially vary relative to one another or vary relative to the fixation point during a trial. This is to allow the testing of what spatial arrangements of the unattended part of the stimulus constitute backgrounds for induced motion.

To this end, a novel induced motion stimulus was designed. It consisted of two sets of dispersed patterned elements; one set for the target and one for the background. Each element remained in place during each trial but the pattern it contained drifted in a way that was consistent with the motions of the other patterns belonging to the same “object”—target or background—as if the stationary elements were windows through which small parts of a larger moving object could be seen; one set of windows for the target and another for the background. Using stationary local elements meant that the background and target shapes, positions and motions could be independently varied without them ever colliding, running off the screen or coming nearer to or further from fixation during a trial.

So that we could specifically control the spatial frequency and orientation content of the target and background and so that no single local element could give away the direction of the target or background to which it belonged, most conditions used randomly oriented local Gabor patches with drifting carriers; sinusoidal gratings within circular Gaussian weighting functions (windows) where the phase of the grating drifted with time. Each element’s drift speed was consistent with the rigid motion of the target or background to which it belonged. See Figure 2A and Video 2A in online Supplemental Material for an example stimulus.

Figure 2. — Example stimuli used in our experiments. (A) A representation of the “1D” stimulus used in our first experiment; target ring against a field background. Shown is the central portion of the stimulus display plus dashed lines to indicate the shape of and direction of motion for the target (red) and the background (blue). All Gabor envelopes were stationary but each sinusoidal carrier drifted at a speed consistent with the overall motion of the target or background to which it belongs. Although the actual target object velocity is upwards it appears to move up and to the left. Please see video 2A in online Supplemental Material. (B) An example “2D” stimulus used in the fourth experiment; ring target with ring “background” where the background is **inside** the target. Each small patch stayed in place but the plaid patterns within them drifted so as to evoke a separate sense of motion in the target and background. In the video associated with the image, the outer target ring velocity is upwards, but it appears to move up and to the left as the inner background ring has a rightward component to its velocity. Please see video 2B in online Supplemental Material.

Each element in this stimulus provides only a local “1D” picture of the global 2D motion of the object to which it belongs as motion information in the direction parallel to the stripes is undetectable leaving a single explicit motion component—the one orthogonal to the grating stripes. One needs to integrate over at least two differently oriented 1D elements to get a picture of the 2D global motion of the object to which the elements belong. Computationally, the global motion solution lies at the “intersection of [the] constraints” (IOC) introduced by the motion of each local 1D drifting Gabor element (Adelson & Movshon, 1982). Importantly, it has been shown previously that the integration method employed by the visual system when combining local 1D motion signals corresponds to the IOC solution whereas an averaging integration method is used if the local 1D signals are replaced with local 2D signals (Amano et al., 2009a; Bowns & Alais, 2006). These constitute two functionally separate integration methods which, if used according to the logic outlined above, can be used to provide strong evidence for a relatively late-stage induced motion mechanism.

To achieve local 2D motion signals using stationary patches a textured pattern needs to drift within each circular window. We used plaid patches, which consist of two overlayed orthogonal gratings within circular Gaussian windows, to produce local 2D motion signals (see Figure 2B and Video 2B in online Supplemental Material).

We define the “target” in our stimulus as the set of local elements to which the observer was instructed to attend in an effort to judge its direction. In all cases this was a set of elements lying on a ring centered on the (instructed) fixation area where the patterns within the local elements moved in a way consistent with a certain global speed and direction which we define as the “motion” of the target. The spatial arrangement of the unattended “background” local elements varied between conditions from field (scattered over the stimulus area including inside the target) to ring-shaped (cf. target) and the background’s “motion” was defined in the same way as it was for the target. We take the directions of the targets and backgrounds defined in this way and compare them with directions of targets and backgrounds in traditional induced motion experiments.

It is important to note that our novel 1D induced motion stimulus requires the visual system to segregate target and background local elements while at the same time calculating the global motions of the target and background. If an induced motion effect, similar to what has been seen in previous studies, can be elicited with this stimulus, even before any other tests are conducted, it would suggest a higher level for the induced motion effect. This is because the target and background motions upon which the induced motion effect rest can only be calculated after local elements have been assigned correctly to the target and background and each set of elements has been integrated to obtain the global target and background motions. Integrated global motion solutions, for both 1D and 2D elements, have been shown to be represented in area MT (Britten et al., 1993; Movshon et al., 1985; Newsome et al., 1989; Smith et al., 2005)—already a mid-level stage area in the motion processing pathway—so the occurrence of induced motion with our stimulus—especially the 1D stimulus where the global 2D motion can only be obtained by integration—would suggest that induced motion be mediated at this level at the earliest.

Our first experiment, then, was to test for an induced motion effect using our 1D stimulus. To be sure that any repulsion between the target and background was a result of induced motion and not some other related phenomenon we mapped out the repulsion effect as a function of background direction. Doing so allowed us to discern between two plausible causes for the repulsion, (1) induced motion and (2) the so-called “motion direction illusion” (e.g., Wiese & Wenderoth, 2007) or “direction repulsion effect” (e.g., Benton & Curran, 2003) wherein two interspersed populations of dots (or similar) move in two different directions and the perceived directions of each perceived transparent sheet are repulsed away from one another. Each of these possible causes predicts a different relationship between background direction and the repulsion magnitude. Specifically, the induced motion effect peaks when the background direction is orthogonal to the perceived target direction (Farrell-Whelan et al., 2012), whereas the peak tends to be at significantly smaller separation angles for the direction repulsion effect (see Experiment 1 Discussion section below for an explanation). Accordingly, we compared our results to those from both an induced motion experiment using a more traditional stimulus than ours (Farrell-Whelan et al., 2012) and a classic direction repulsion experiment (Marshak & Sekuler, 1979).

Although the simplest explanation for an induced motion effect in Experiment 1 would be that the effect occurs after the integration step, it is possible that there is a much more complicated mechanism that involves interactions before or during the integration step. The experiments described below were designed to test for this possibility. By eliminating low-level explanations we aimed to show that the first requirement for the flow-parsing hypothesis is satisfied, that is, that induced motion is mediated in a higher level area of the motion pathway.

In Experiments 2 and 3, we tested whether the repulsion effect is changed by significant low-level differences between the target and background. By “low-level” we mean differences that would influence the repulsion effect if it were mediated by lower-level areas in the visual motion system such as area V1 or area MT. Specifically, we tested whether the target/background interaction is changed by a 1-octave difference in spatial frequency content between the target and background (Experiment 2) and whether the interaction is changed when target and background local directions are separated in direction space by 45° (Experiment 3).

We expected that, if the influence of the background on the target were mediated by a low-level mechanism, the strength of interaction between the target and background would be altered by these low-level differences between them. For example, if the effect were mediated by V1 neurons there should be a decrease in the interaction between nearby target and background elements with spatial frequency difference as individual V1 neurons are tuned to this feature and interactions between V1 neurons tend to fall off with separation in preferred spatial-frequency (e.g., Aghajari et al., 2020; Blakemore & Campbell, 1969; Polat & Sagi, 1993) just as they do with separation in orientation preference (e.g., Apthorp et al., 2017; Blakemore & Campbell, 1969; Cannon & Fullenkamp, 1991; Petrov & McKee, 2009). In our tests, the target was composed of elements lying in one spatial frequency (Experiment 2) or orientation (Experiment 3) band and the background was composed of elements in another significantly separated band.

The final psychophysical test for the possibility of a low-level induced motion mechanism involved different combinations of 1D and 2D local elements. If the repulsion effect is implemented after the local motion signals in a scene have been segregated into target and background pools and each pool of signals has been integrated to form an estimate of target and background motion then one would expect the characteristics of the repulsion to be the same whether the scene consisted of 2D motion signals or 1D motion signals. They are each processed by different systems in the visual pathway (Amano et al., 2009a; Bowns & Alais, 2006) but if it is the final result of each system that is fed into the repulsion mechanism then the results should be the same. But note that, even if we confirm their similarity, there still remains the possibility that the 1D motion processing system and the 2D motion processing system each possess similar target/background repulsion mechanisms and that the repulsion is implemented before those signals are fully processed, that is, there is cross-talk between target and background motion signals within the 1D (and similarly 2D) motion processing system that causes the repulsion. So, in Experiment 4, we compared the results of same (1D/1D or 2D/2D) target and background type with different (1D/2D or 2D/1D) target and background type. If the results are the same in both cases we have strong evidence against the effect being mediated by cross-talk within integration systems.

The four experiments outlined above address the requirement that induced motion be mediated in a relatively late stage in the motion processing pathway. We addressed the second requirement of the optic-flow explanation—that the unattended part of the scene be treated as a stationary background against which an observer is moving—by running three versions of Experiment 1—each using a different background configuration—and comparing the results. We begin by using a stimulus that is most conducive to an object-moving-against-a-stationary-background interpretation, that is, a ring-like target centered on fixation moving against a dispersed background motion field. We compare the resulting induced motion effect with that from a stimulus where the unattended motion belongs to an object the same shape as the target but surrounding the target. We go further to test the induced motion magnitude when the unattended ring is nested inside the target ring. If our results are the same in the three cases then what is considered background in the induced motion effect may simply be a matter of attention; having nothing to do with the shape or spatial arrangement of the unattended motion elements relative to the target; if the motion elements are unattended they are considered background. Whether such a notion of background is conducive with the flow-parsing hypothesis is discussed in the General Discussion.

In the General Discussion, we also apply a simple model based on the flow-parsing concept to both our data and the data of Farrell-Whelan. A close fit of the model would be expected if the visual system does, indeed, implement a flow-parsing mechanism.

In summary of the approach to follow, if the flow-parsing interpretation of the classic Induced Motion Illusion is to hold we need psychophysical evidence that the illusion is mediated at a level in the visual motion pathway capable of optic-flow analysis. We also need to be able to interpret induced motion scene elements within a flow-parsing framework including interpreting the unattended parts of induced motion scenes as stationary backgrounds against which an observer is moving. We explore the background question and provide evidence for a high level—possibly optic-flow stage—mechanism in the four experiments below.

Experiment 1. Does Our 1D Stimulus (With Various Background Configurations) Produce Induced Motion?

The aim of Experiment 1 was to test for induced motion using our novel 1D stimulus. Specifically, we assessed perceived target repulsion as a function of background direction in order to compare our results with those from a previous induced motion experiment and, for contrast, a direction repulsion effect experiment. Similarity to the previous induced motion experiment would indicate that induced motion occurs at least at the motion processing stage that calculates global motion from local 1D motion signals, as the global target and background motion signals are not available prior to that stage.

The experiment consisted of three conditions: background field, background ring outside, and background ring inside the target ring. Our aim in using different background types was to investigate the strength of the induced motion effect as a function of the extent and shape of the background, specifically, we tested whether object-like backgrounds could be just as effective at inducing motion in the target as field-like backgrounds as suggested by previous literature. If so, could the background be placed inside the target and still be just as effective? No change in the size of the effect would mean the flow-parsing hypothesis needs a very broad concept of what constitutes an optic-flow background if it is to hold; something defined more by attention than physical extent or position.

Method

For all experiments, the stimuli were presented on a SONY Trinitron G420 monitor (1024 × 768 pixels at 100 Hz) that was placed 60 cm from a chin rest. Images were created in MATLAB R2013b on a PC running Windows 8. The stimuli were stored and presented using a Cambridge Research Systems (CRS) ViSaGe visual stimuli generator. Two buttons located on the top row of a CB6 Response Box (CRS) were used to register participant responses.

All participants gave written informed consent prior to beginning the experiments. These experiments had ethics approval (RA/4/1/4503) from the Human Ethics Committee at the University of Western Australia.

In all three conditions the target consisted of 28 Gabors evenly distributed around a 4° radius ring centered on the display. Observers were instructed to fixate as close to the center of the ring as they could (a fixation point was not included as it would provide an unwanted reference for judging target motion). In the background field condition, the background consisted of 40 Gabor elements randomly scattered over a 20° × 20° region centered on the display (elements were free to also appear inside the target ring), in condition 2 the 40 Gabor elements were evenly distributed around a 5.6° ring surrounding the target, and in condition 3 there were 16 Gabor elements evenly distributed on a 2.4° ring inside the target. All Gabor elements were distributed and oriented randomly at the beginning of each trial. The Gaussian envelopes had a standard deviation of 8′, and the carrier had a spatial frequency of 3 cycles per degree (c/°). All Gabor elements in the target ring had a Michelson contrast of 0.40 ([Lmax–Lmin]/[Lmax + Lmin]) or, in the case of the inner and outer background rings, had a contrast that perceptually matched the target contrast. Since piloting showed that this adjustment had little apparent effect the elements in the field also had 0.4 contrast. Each Gabor element’s phase drifted at a rate that depended on its orientation and the speed and direction of the target or background object to which it belonged. For example, if the Gabor element was part of the background, its rate of drift was the dot product of a unit vector representing the direction of drift (orthogonal to the Gabor’s stripes) and a vector representing the speed and direction of the background. This is equivalent to making all background Gabor elements drift consistently with the IOC solution for the desired background motion. The target was separately constructed in the same manner. This produced the percept of two rigidly moving objects, one being the target and the other the background. Both “moved” in this way at a speed of 6^o/s.

There were two versions of the background ring outside condition; one for making a comparison with the background field condition and the other for making a comparison with the background ring inside condition. They are denoted, respectively, “background ring” and “background ring outside.”

For each background field versus background ring session the background moved in one of five randomly chosen directions clockwise from vertical: 0°, 15°, 30°, 60°, or 90°. A moving target was simultaneously presented with the background on each trial and participants indicated whether the target appeared to move clockwise or anticlockwise of vertical. The target directions were chosen and responses were analyzed in real time using the Psi Psychophysical Method (Prins & Kingdom, 2018) in order to find the point of subjective vertical (upwards drift) for the target for each background direction. Seventy five trials were run during each background direction session as pilot studies showed that threshold estimates tended to be stable at this point and each participant did each background direction session twice. Background field and background ring sessions were randomly interleaved. Each stimulus was presented for 500 ms and a central fixation point appeared between stimulus presentations.

For the background field and background ring conditions, participants consisted of five experienced observers recruited from the Vision Lab at the University of Western Australia. All participants except KH were naïve to the hypothesis of the experiment. All participants had normal or corrected-to-normal visual acuity. Observer ED has a divergent squint and completed the experiments using an opaque eye patch over the non-dominant eye.

For background ring outside and background ring inside conditions, the background direction was always 30° and the subjective vertical direction for the target was found using the same method as for background field and background ring conditions. As we wanted to compare background outside with background inside the target ring, we needed to control for perceived contrast as a function of eccentricity as it has been shown previously that contrast detection thresholds vary with eccentricity (Koenderink et al., 1978a, 1978b), perceived speed can vary with perceived contrast (Thompson, 1982), and signal segmentation can occur as a result of differences in apparent contrast (Croner & Albright, 1997).

Each participant, therefore, first completed a series of contrast matching sessions—one for each of the eight conditions described below—and their results were used to normalize the apparent contrasts for each participant individually in following sessions. During contrast matching sessions, participants indicated whether the inner or outer ring—one being the same radius as the target ring—was of higher contrast and the Psi Psychophysical Method was used to find the contrast for the non-target ring which equated the appearance of the two rings. Stimulus presentation time was the same as for the main experiment.

Background ring outside and background ring inside conditions were run in conjunction with Experiment 2 outlined below using the same set of participants. Accordingly, there were eight conditions. Each was run twice for each participant. The conditions were randomly interleaved. The eight conditions were all possible combinations of the following pairs: background inside/outside of target (Experiment 1, part 2 conditions), target high spatial frequency/low spatial frequency, and background high spatial frequency/low spatial frequency (Experiment 2 conditions).

Eight participants took part in these experiments. ED, TM, and DP were experienced observers. All participants, except JH, were naïve to the hypotheses of the experiments. Participants all had normal or corrected to normal visual acuity. Observer ED has a divergent squint and completed the experiment using an opaque eye patch and monocular vision.

Results

In the first part of this experiment, the stimulus consisted of a ring target moving in an IOC-defined direction against either a background field of randomly positioned Gabors or a ring of Gabors moving with a different IOC direction—the motion being produced by physically stationary “drifting” 1D motion elements as just described in the Methods section. The deviation of the target direction clockwise from vertical while being perceived as vertical is plotted against the background direction (also in degrees clockwise from vertical) in Figure 3.

Figure 3. — Experiment 1 part 1; background field versus background ring. Shown is target direction that is perceived as vertical for a range of background directions for both the background ring and background field conditions. Both are measured in degrees clockwise of vertical. This convention is used in all graphs to follow. The averaged performance of the group of observers is plotted. Error bars represent 1 SD.

Note that the axes represent the direction of the background (horizontal axis) and the direction of the target (vertical axis) motion relative to vertical in a clockwise direction, that is, 0 means vertically upwards and 90 means horizontally rightwards motion. It is plotted in this way so that the vertical axis is indicative of the size of “repulsion” effect, that is, the deviation of the target from vertical while still appearing vertical. This convention for the vertical axis is used for all plots below.

There was a similar pattern of results across observers but there are large differences in effect size between observers. This is reflected in the large standard deviations in the combined results shown in Figure 3 (note, 95% CI not shown as the significance of the means resulting from the spread between observers is not the focus). Individual results are shown in the Supplementary section for the interested reader. The mean target direction that appears vertical when the background direction is also vertical is non-zero for both data sets but the difference from zero is non-significant for both the field (t[4] = 1.26, p = .28, two-tailed t-test) and ring (t[4] = 2.31, p = .08, two-tailed t-test). This is discussed further in the Supplemental Section for the interested reader.

The shape of the two curves in Figure 3 is consistent with the repulsion being an induced motion phenomenon. Specifically, the general shape matches that seen in previous induced motion studies (Farrell-Whelan et al., 2012) and doesn’t match those in previous motion direction illusion studies (Marshak & Sekuler, 1979). This important finding and its implications are discussed in more detail in the Experiment 1 Discussion section below.

When the background field of randomly placed Gabors was replaced with the same number of Gabors lying on a ring outside of the target ring, the results were similar. There is no statistically significant difference between having a ring and a field as background (two-way repeated-measures ANOVA, F [1, 20] = 2.198, p = .154). Giving a shape to the background, that is, making it object-like without changing the number of local Gabors, doesn’t significantly affect its repulsive influence on the target.

We also tested whether the position of the background relative to the target, inside or outside, made a difference to the induced motion effect. The group-averaged results of this comparison are shown in Figure 4 along with the results of Experiment 2. Having the background ring outside versus inside the target ring is compared for both low spatial frequency and high spatial frequency target and background Gabors. To examine the effect of having the background inside versus outside of the target simply compare directly-neighboring “inside” and “outside” Background Type conditions in the graph; the two striped bars in each set of four are Inside conditions and the other two are Outside. The background direction was 30° clockwise from vertical for all conditions as explained in the Methods section.

Figure 4. — Experiment 1 part 2 and experiment 2; background ring inside versus outside of the target ring with high and low spatial frequency (“sf”) local elements. The target direction normalization process is described in the text. Error bars represent 95% confidence intervals.

Despite the similar pattern of results across participants there were large differences in actual target repulsion magnitudes. For this reason, the data was normalized to remove differences between individuals using two baseline conditions, that is, all “background outside” conditions were normalized using the target low frequency/background low frequency outside condition (as it matched our first experiment) and all “background inside” conditions were normalized using the target low frequency/background low frequency inside condition. We used an additive normalization process (new result = mean baseline + [old – individual baseline]). The background outside and background inside conditions were normalized separately because the direction of the biases for each participant varied according to the position of the background. Note that in normalizing the data the group means did not change, only the variance in the group data was affected, that is to say, a graph of the raw data would be identical to the normalized one but error bars would be larger and the pattern of differences between conditions would be the same.

The position of the background relative to the target—inside or outside—had no significant effect on the repulsion interaction between target and background (three-way repeated measures ANOVA, F [1, 56] = 0.1325, p = .717). The RMANOVA was conducted on the raw data using the factors target frequency, background frequency, and background position.

Discussion

The primary goal of Experiment 1 was to see if our novel dispersed 1D local motion stimulus produced an induced motion effect. If the characteristics of the target repulsion seen here compare well to those from previous induced motion experiments it would indicate that induced motion is mediated in a brain area that follows the local element segregation and integration steps as the target and background motions upon which the effect acts are not available until after those steps. Accordingly, we compared our results to those from Farrell-Whelan et al. (2012), who performed a standard induced motion experiment, and, to make sure our results were not consistent with the direction repulsion effect, we plotted our results alongside those of Marshak and Sekuler (1979), whose study is considered a classic direction repulsion experiment.

Both teams looked at the repulsion strength as a function of the directional separation between the target and background. One important difference between the two effects is clear when the repulsion strength is plotted against the background direction relative to the perceived target direction. Farrell-Whelan et al. (2012) measured this directly but for Marshak and Sekuler (1979) the background direction relative to perceived target was, here, calculated by adding the perceived deviation of the target from its actual direction to the difference between the target and background direction. Like Farrell-Whelan et al. (2012) we measured this directly by setting the perceived target direction to vertical and measuring the effect of the background direction relative to vertical. Figure 5 shows the result of the comparison.

Figure 5. — Comparison of our results with those of Farrell-Whelan et al. (2012) and Marshak and Sekuler (1979)—the first being an induced motion experiment and the second a direction repulsion experiment. Note that both the Target Direction and Background Direction are measured relative to the perceived target direction. The dashed arrow points to the peak of the Marshak curve and the full arrow points to the peak of the Farrell-Whelan curve. Pilot studies indicated that our repulsion effects peaked in the same place as Farrell-Whelan. Peaking at 90° is a feature of induced motion (see text). Data taken from Experiment 1 above, (Farrell-Whelan et al., 2012) and (Marshak & Sekuler, 1979).

For induced motion, the effect size increases with the speed of the background (Gogel, 1979; Post & Heckmann, 1986), albeit at a diminishing rate (Wallach & Becklen, 1983).¹ The maximum induced motion effect corresponds to the point at which the target direction is most deviated from the perceived target direction, that is, if an axis is drawn that is orthogonal to perceived target direction, the actual target component on this axis is at its maximum. This will occur when the background component on that axis is also maximal. This occurs at 90° on the x-axis in Figure 5 because, for a fixed speed background, its speed component orthogonal to the perceived target direction is maximum when the background is orthogonal to the perceived target direction. The relationship of the effect size to speed of background is not so straight-forward for the direction repulsion effect (Curran & Benton, 2003) so the peak need not lie at 90° (Mather & Moulden, 1980). Our pilot studies clearly showed a drop off in effect size after 90° but, to minimize an already-large observer load we chose to measure only up to the peak. Our results are qualitatively similar to those of Farrell-Whelan albeit diminished in strength. This demonstrates that the effect we are seeing in our results is consistent with the induced motion effect. Consequently, this experiment provides evidence that induced motion is mediated in a relatively late stage of the motion processing system—following the 1D motion integration system that works out the motions of the target and background.

Our results also demonstrate that the induced motion effect is insensitive to significant changes in the shape and position of the background relative to the target. Even having the background the same shape as the target and positioned inside it didn’t change the effect size. This confirms the results of some other researchers that a background need not surround or be larger than the target (e.g., Carr & Hardy, 1920; Day et al., 1979; Duncker, 1929). It also suggests that what constitutes background is a matter of attention; in the background-inside and background-outside conditions both the target and background were rings and the only constant feature of the background was that it was the ring not being attended to. This calls for a more abstract definition of what constitutes background in induced motion experiments (contrast with, e.g., Palmer, 1999) and requires just as broad a definition for what constitutes an optic flow background if the flow-parsing explanation of induced motion is to hold. This is discussed further in the General Discussion section.

The fact that the background-inside results matched those of the background-outside condition also invalidates a simple attentional spotlight mechanism wherein a circular “spotlight” is shone on a retinal area that covers the target and the target’s motion is calculated by integrating signals within that area. If that were the case, for the background-inside condition, the background motion elements would be integrated into the target and a target motion somewhere between the true target and background motions would be used as input into the induced motion system leading to a different set of perceived target directions than those for when the background was outside of the spotlight. This would likely lead to lower repulsion effects. There was no such difference indicating that the visual system was capable of correctly segregating target and background elements independent of where they were placed in the scene.

These results also speak to our aim of uncovering the processing stage at which induced motion occurs. We have demonstrated that what constitutes background in induced motion experiments is far from simple; too complex to be easily compatible with a simple low-level mechanism. Attention-driven feedback from higher areas to a theoretical lower-level induced motion mechanism where the feedback tells the system what is background and what is target is possible, but a simpler solution is to have the induced motion mechanism follow the integration system that determines what is target and what is background. Also, V1-level interactions are likely to be local—occurring between close-neighboring Gabor elements—and the average number of close target-background neighbors is much greater for the background ring conditions than for the background field condition where the background elements are dispersed. We saw no change in the induced motion effect between these conditions.

Experiment 2. Does Spatial Frequency of Background Relative to Target Matter?

There are three possible ways the target and background elements were interacting to produce the effects seen so far. The first is that the individual elements were interacting before they were segmented and integrated into target and background global percepts, the second is that they were interacting during the integration phase, and the third is that they were interacting afterwards, once the global target and background solutions were reached. The first is unlikely as individual elements do not contain information on their own about the target and background motions and we are seeing interactions that are consistent with induced motion based on the global target and background IOC solutions. But if the interaction were somehow occurring at this low processing level (probably V1), then we would expect to see a decrease in interactions between nearby target and background elements the more they differed in spatial frequency (e.g., Blakemore & Campbell, 1969; Cannon & Fullenkamp, 1991; Kim & Wilson, 1997). In Experiment 2 we used a 1 octave difference which, by low-level explanations, should result in a minimal interaction between the target and background elements (Blakemore & Campbell, 1969; Cannon & Fullenkamp, 1991). If the second type of interaction (in the integration phase) were occurring, then the magnitude of interaction is also likely to be sensitive to spatial frequency based on work conducted previously in our lab. Amano et al. (2009b) tested the spatial frequency tuning of the integration systems for Global Gabor stimuli like the ones used in our experiments by looking at how noise elements of one spatial frequency influence the threshold of detection for global motion carried by elements at another spatial frequency. Thresholds dropped approximately three-fold on average for one octave differences between signal and noise elements for 0.7 and 1.4 cycle/deg signal elements and the trend appeared to be the same for 2.8 cycle/deg signal elements (although testing didn’t occur for 5.6 cycle/deg noise elements). Extrapolating these results, we’d expect a low magnitude of interaction between low spatial frequency (3 c/°) and high spatial frequency (6 c/°) elements during the integration stage; much lower than for elements of the same spatial frequency. Our aim was to test for the decrease predicted by both low-level explanations.