The precedence effect: Fusion and lateralization measures for headphone stimuli lateralized by interaural time and level differences

Andrew D Brown; G Christopher Stecker

doi:10.1121/1.4796113

. 2013 May;133(5):2883–2898. doi: 10.1121/1.4796113

The precedence effect: Fusion and lateralization measures for headphone stimuli lateralized by interaural time and level differences

Andrew D Brown ^1,^a), G Christopher Stecker ¹

PMCID: PMC3663858 PMID: 23654394

Abstract

The present investigation assessed fusion and localization dominance aspects of the precedence effect under headphones across a variety of stimulus conditions in 10 normal-hearing listeners. Listeners were presented with “lead-lag” pairs of brief (123 μs) impulses or trains of such pairs lateralized by interaural time or level differences (ITD or ILD). Listeners used a touch-sensitive display to indicate for the final lead-lag pair presented on each trial (1) whether one or two locations were perceived and (2) the location perceived. In the event two locations were perceived, subjects were further instructed to indicate the left-most location perceived. Results demonstrated that lead-lag fusion was more robust for stimuli lateralized by ITD than ILD, particularly when cues of the test stimulus differed from cues of the preceding “buildup” stimulus, consistent with Krumbholz and Nobbe [(2002). J. Acoust. Soc. Am. 112, 654–663]. Unexpectedly, results also demonstrated reduced localization dominance with increasing lead-lag delay, suggesting that the fusion aspect of the precedence effect may be dissociated from the localization dominance aspect under buildup. It is thus argued that buildup of fusion might be understood more generally as an example of auditory object formation rather than a special facility for enhanced sound localization.

INTRODUCTION

Normal-hearing listeners localize sound sources accurately in ordinary listening environments (e.g., rooms) by responding to the auditory spatial cues carried by the early arriving sound rather than the spurious cues carried by later-arriving reflections and reverberation. The various phenomena associated with this observation are known collectively as the “precedence effect” (Wallach et al., 1949; for excellent reviews, see Blauert, 1997, and Litovsky et al., 1999). The precedence effect, studied in a variety of psychophysical and physiological paradigms over the past six decades, depends on essentially two phenomena: (1) fusion of the early arriving (“lead”) and late-arriving (“lag”) sound and (2) dominance of the localization cues carried by the lead over those carried by the lag (termed localization dominance; see Litovsky et al., 1999). One of the most surprising findings related to the precedence effect since its first description (Wallach et al., 1949) was reported by Clifton and colleagues (e.g., Clifton, 1987; Clifton and Freyman, 1989), who discovered that the temporal limit of lead-lag fusion, known as the “echo threshold,” is dynamic and dependent on prior stimulation. Using two (left and right) loudspeakers in a free field paradigm, Clifton and Freyman (1989) demonstrated that repetition of a fixed lead-lag stimulus (e.g., right-lead, left-lag) led to elevation of the echo threshold (i.e., enhancement of fusion, termed “buildup”), while subsequent presentation of a binaurally opposite lead-lag stimulus (e.g., left-lead, right-lag), led to an apparent resetting of the echo threshold (termed “breakdown”). Multiple studies of this phenomenon over the past two decades (e.g., Clifton et al., 1994; Grantham, 1996; McCall et al., 1998; Djelani and Blauert, 2001) have established that a listener's echo threshold, generally on the order of 5–10 ms at baseline (for lead-lag pairs of clicks or other impulsive stimuli), may be elevated to 15–25 ms following buildup, and reduced back to 5–10 ms following breakdown. Studies focused specifically on the breakdown phenomenon have additionally established that breakdown can be induced not only by a “switching” of the lead and lag speakers, but also by a sudden change in the lead-lag delay (Clifton et al., 1994), or a sudden change in the spectrum of the echo (McCall et al., 1998). Djelani and Blauert (2001) later demonstrated that subsequent presentation of a test pair identical to the original conditioner pairs gave extant buildup of fusion (i.e., that buildup was maintained across presentation of the breakdown stimulus, a condition they termed “re-buildup”).

Based on these observations, several investigators have suggested that dynamic aspects of the precedence effect reflect listeners' “construction of an internal model of auditory space” (Freyman and Keen, 2006; Keen and Freyman, 2009; Sanders et al., 2011) specific to a particular reverberant context: Echoes in agreement with the internal model become more effectively suppressed or fused with the direct sound (buildup), while echoes in violation of the internal model remain treated as separate sources (breakdown) (cf. Clifton et al., 1994; Grantham, 1996; Blauert, 1997; Litovsky et al., 1999; Freyman and Keen, 2006; Keen and Freyman, 2009). While it is tempting to assume that such a model would embody listeners' implicit knowledge of the spatial geometry of the perceived room and its sound source(s), i.e., the angles and distances (or equivalently directions and delays) involved (e.g., Keen and Freyman, 2009), an alternative possibility is that the “internal model of auditory space” actually embodies listeners' implicit knowledge concerning the spatial acoustics of the environment, e.g., the statistics of the interaural time differences (ITD), interaural level differences (ILD), or spectral features of the direct (early) and reflected (later-arriving) sound. According to the first possibility, buildup reflects increasing confidence in knowledge about the geometric arrangement of a listening environment, leading to more accurate localization of the veridical sound source(s) and stronger suppression of misleading echoes. According to the second possibility, buildup reflects enhanced fusion or capture of spatial information across acoustic cues, in much the same way that auditory-visual capture (i.e., ventriloquism) exploits the statistics of auditory and visual spatial cues. Importantly, the two possibilities make distinct predictions for breakdown effects: according to the first, breakdown results from geometric violations (such as change in source position or room shape) without regard to the nature of underlying acoustic cues; according to the second, breakdown results from changes in the acoustic cues that violate the expected statistical relationships of those cues. The current study aims to evaluate these different accounts of dynamic precedence by examining buildup and breakdown effects for stimuli lateralized by either ITD or ILD in the same group of listeners.

Different contributions of ITD and ILD to precedence

In a novel study of buildup and breakdown phenomena, Krumbholz and Nobbe (2002) presented listeners with single pairs or trains of pairs of lead-lag clicks over headphones to measure buildup and breakdown effects for stimuli lateralized by either ITD or ILD. This experiment was a departure from most other studies of the precedence effect, which had almost exclusively employed free field, headphone ITD, or virtual acoustic space stimuli, where the contributions of ITD and ILD to listeners' judgments were not independently evaluated (e.g., Wallach et al., 1949; Thurlow and Parks, 1961; Clifton, 1987; Clifton and Freyman, 1989; Shinn-Cunningham et al., 1993; Yang and Grantham, 1997; Djelani and Blauert, 2001; Freyman and Keen, 2006; Keen and Freyman, 2009; see Litovsky et al., 1999). Krumbholz and Nobbe (2002) measured significant buildup of fusion echo thresholds for ITD stimuli, with thresholds increasing from a mean of 7 ms at baseline to 16 ms after stimulus repetition, and lesser but significant buildup for ILD stimuli, with thresholds increasing from a mean of 5 ms at baseline to 11 ms after stimulus repetition. The breakdown effect, in contrast, was evidenced only for binaurally switched ILD stimuli. Echo thresholds for breakdown-ILD stimuli (buildup conditioner + binaurally switched test stimulus) were comparable to thresholds for baseline-ILD stimuli (mean of ∼4 ms), consistent with the breakdown effect observed in the free field (e.g., Clifton and Freyman, 1989). Echo thresholds for breakdown-ITD stimuli, in contrast, remained comparable to thresholds in the buildup-ITD condition (mean of ∼14 ms). That is, presentation of a lead-lag ITD test stimulus carrying cues opposite the conditioner stimulus produced no breakdown of fusion. The findings of Krumbholz and Nobbe (2002) are thus striking in the context of previous studies of buildup and breakdown (e.g., Clifton, 1987; Clifton and Freyman, 1989; Clifton et al., 1994; Djelani and Blauert, 2001; Freyman and Keen, 2006; Keen and Freyman, 2009). Based on their data, breakdown of fusion in free-field lead-lag “switch” paradigms would seem necessarily mediated by sensitivity to changes in the ILD, while the concomitant change in the ITD might be inconsequential.

A number of questions about the precedence effect might be asked on the basis of Krumbholz and Nobbe's (2002) report. For example, the authors did not include a “re-buildup” condition (after Djelani and Blauert, 2001) in their experiment to assess whether echo threshold had actually “broken down” for ILD stimuli, or whether thresholds reflected baseline thresholds for novel stimuli. Of greater interest, while some psychophysical evidence suggests that ITD and ILD are combined at some level of binaural processing into a common code (e.g., Maier et al., 2010), the results of Krumbholz and Nobbe (2002; see their Discussion in particular) suggest that the precedence effect, in terms of lead-lag fusion, strongly depends on which binaural cue (ITD or ILD) is manipulated. A geometric account of buildup and breakdown phenomena holds that the effects of stimulus repetition should depend only on the spatial perception induced by, and not on the specific acoustic cues carried by, the stimulus. Following on the report of Krumbholz and Nobbe (2002), the present study reexamines buildup and breakdown phenomena with particular attention to the room acoustics hypothesis of Freyman and colleagues (e.g., Freyman and Keen, 2006; Keen and Freyman, 2009; Sanders et al., 2011). Three experiments designed to measure both fusion echo thresholds and subjective lateralization for stimuli carrying nonzero ITD or ILD cues presented in isolation or following appropriately designed conditioner stimuli are described. Data are presented and briefly discussed for each experiment, followed by summary points and general discussion in the context of the precedence literature at large.

COMMON EXPERIMENTAL METHODS

All procedures, including subject recruitment, obtaining subject consent, and subject testing followed the guidelines of the University of Washington Human Subjects Division and were reviewed and approved by the cognizant Institutional Review Board.

Subjects

Ten subjects aged 20–58 (four female) completed participation in this experiment. All subjects were naive to the purpose of the experiment and were compensated for their participation. Subjects reported normal hearing and demonstrated pure-tone detection thresholds <20 dB HL with <10 dB asymmetry between left and right ears at octave frequencies 250–8000 Hz.

Stimuli and procedure

All testing was completed in a sound-attenuated booth (IAC, Bronx, NY). Subjects were seated in a swivel chair facing a large (80-cm diagonal) touch-sensitive display (elo Touchsystems 3200L, Tyco Electronics, Bermuda). Stimuli across all phases of all experiments were comprised of monophasic pulses (“clicks”) 123 μs (6 samples) in duration with an average binaural level of ∼60 dB SPL. Pairs and trains of clicks (to be detailed subsequently for each experiment) were programmed in MATLAB (MathWorks, Natick, NJ), synthesized at 48.828 kHz (Tucker-Davis Technologies RP2.1, Alachua, FL) and presented under closed-back electrostatic headphones (STAX Model 4070, Saitama, Japan). Non-zero ITD values were imposed by delaying the signal to the left channel for right-favoring ITD or delaying the signal to the right channel for left-favoring ITD. Non-zero ILD values were imposed by amplifying the signal to the favored earphone by half the total ILD and attenuating the signal to the opposite earphone by an equal amount. Subjects completed each experiment by interacting with the touch-sensitive display as described in the following sections.

Training

Prior to participation in experiment I (Sec. 3), subjects were trained in a simple two-alternative-forced-choice task. The purpose of this training was (1) to familiarize subjects with the lead-lag experimental stimuli, and (2) to provide a standard for judgments in the inherently subjective experimental task. Specifically, stimuli in the training task were pairs of lead-lag dichotic clicks (separated by 1–50 ms) in which the lag carried an ITD or ILD either identical to or opposite the ITD or ILD carried by the lead. On each trial, the subjects' task was to indicate whether the stimuli had consisted of signals from “one location” (true if lead and lag cues were identical, indicated by touching the panel at the top of display) or “two locations” (true if lead and lag cues favored opposite ears, indicated by touching the panel at the bottom of the display). Immediate correct/incorrect feedback was displayed on the monitor after each trial. In addition to verifying that subjects were sensitive to the nonzero ITD or ILD values carried by the stimuli, this task served to train subjects to select the “two locations” panel only when two discrete locations were perceived, rather than simply when two sounds were perceived. This method emphasizes a more stringent definition of echo threshold (summarized by Blauert, 1997), which requires that the lead and the lag be perceived at discrete, though not necessarily their veridical, locations. All subjects completed at least 2 h of training runs, evenly divided between ITD stimuli and ILD stimuli. Several subjects required additional training to reach criterion (90% correct or better at lead-lag delays >25 ms) for either ITD or ILD conditions (or both), suggestive of lesser sensitivity to the cues in those subjects.1

ITD-ILD matching task

The questions addressed by experiments II and III (see Secs. 4, 5) required that ITD and ILD values employed in test stimuli produced equivalent lateralization for each subject (approximate equivalence was assumed for experiment I based on the data of Krumbholz and Nobbe, 2002). Thus, prior to testing in experiments II and III, an explicit ITD-ILD matching task was completed by each subject. On each trial of the matching task, subjects were initially presented with a “standard” click carrying ±308 μs ITD (experiment II matching task) or ±615 μs ITD (experiment III matching task), followed 1 s later by a second “pointer” click carrying a random ILD from the range −15 to +15 dB ILD. By convention, positive cue values favored the right ear, and negative cue values favored the left ear. On each trial, subjects were instructed to adjust the perceived location of the ILD pointer to match the perceived location the ITD standard by using buttons displayed on the touch screen monitor. Three left arrow buttons and 3 right arrow buttons provided for “coarse” (±6 dB), “medium” (±2 dB), and “fine” (±0.3 dB) adjustment. After each adjustment, the standard-pointer click pair was automatically replayed. Subjects were free to make as many adjustments as necessary to match the location of the pointer to the location of the standard. Once subjects were satisfied with the pointer-standard match, they pressed a “match” button at the bottom of the display, and the next trial began automatically. Each run consisted of 20 trials; subjects completed 4 runs for left- and right-leading ITD standards (8 total) for each matching task (experiment II, experiment III). Matching data are presented in Secs. 4, 5).

Main experimental task

Following completion of training (experiment I) or ITD-ILD matching tasks (experiments II and III), subjects began testing. The subject's task on each trial was the same across all conditions and all experiments. Following presentation of either a brief silent period or a conditioner stimulus (see Secs. 3, 4, 5), a “lead-lag” test stimulus was presented. The subject was instructed to indicate for this stimulus (i.e., ignoring the conditioner stimulus, if present) (1) whether one location (upper panel on the touch screen) or two locations (lower panel on the touch screen) had been perceived and (2) to indicate the apparent lateral location of the stimulus within the selected panel (a perceptual scaling task). If two locations were perceived, subjects were further instructed to indicate the left-most location perceived. Each response thus carried two independent components of data. Specifically, which panel the subject touched indicated whether the lead and lag clicks had appeared “fused,” and where the subject touched within the selected panel indicated the extent of localization dominance (i.e., the extent to which the reported spatial percept agreed with cues carried by the lead versus cues carried by the lag). Both aspects (fusion and localization dominance2) were expected to change as a function of lead-lag delay, the parameter varied from trial-to-trial (see below). Trials within a given run were of a single stimulus condition (e.g., Baseline ITD or Buildup ILD; see Secs. 3, 4, 5); stimulus conditions were presented in random order across subjects. Subjects completed at least one practice run in each condition before testing commenced.

Both adaptive methods (e.g., Krumbholz and Nobbe, 2002) and methods of constant stimuli (e.g., Clifton and Freyman, 1989) have been used for echo threshold estimation in studies of the precedence effect. Each has its advantages (e.g., the efficiency of an adaptive staircase versus the completeness of an empirically measured psychometric function). In efforts to ensure consistency of measurement, three echo thresholds were estimated for each run in the present investigation using three simultaneous and independent procedures (programmed in MATLAB). On each trial within a given run, the lead-lag delay was drawn randomly from one of two adaptive tracks (one ascending, starting value 1 ms; one descending, starting value 50 ms) or a constant set of delays ([0.43,3 3, 6, 9, 15, 25, 50 ms], 5 trials per stimulus, each equally probable on a given trial). Each adaptive staircase followed a 1-up, 1-down rule to estimate the 50% echo threshold; logarithmic step sizes of 0.2 (delay_new = delay_old × 10^±0.2) were employed up to the fourth reversal in each track, then decreased to 0.05 (delay_new = delay_old × 10^±0.05) for the duration of the run. Each track terminated after 8 reversals. The threshold for each was taken as the geometric mean ITD of the final 4 reversals. A third threshold was taken as the lead-lag delay at the interpolated 50% point on a psychometric function that was fit to responses for the constant set of values once the run had finished (using the custom MATLAB function psignfit, Wichmann and Hill, 2001). In experiments I and II, subjects completed four runs in each condition, giving 12 total threshold estimates per condition per subject. In experiment III, subjects completed two runs in each condition, giving six threshold estimates per condition. Since two out of three threshold trackers were adaptive, the lead-lag delay presented on a given trial was occasionally related to the lead-lag delay presented on the previous trial; nonetheless, tracker randomization and the presence of a constant stimulus tracker made it nearly impossible to anticipate the delay on a given trial. Finally, lateralization responses did not affect the trial-to-trial progression of experimental runs; right-lead, left-lag and left-lead, right-lag stimuli were presented in random order over the duration of a run, and lateralization data were analyzed offline after completion of the experiment.

Analysis

Echo thresholds were compared across conditions and across subjects by repeated-measures analysis of variance (ANOVA) and paired t-tests. Significant differences are given by p < 0.05, with corrections for multiple comparisons applied as appropriate. Substantial individual variability in echo thresholds was anticipated on the basis of past studies of the precedence effect and binaural sensitivity in general (e.g., Wallach et al., 1949; McFadden et al., 1973; Freyman et al., 1991; Grantham, 1996; Yang and Grantham, 1997; Brown and Stecker, 2010); such differences were observed in some conditions of the present investigation. Individual subject data are thus given for each experiment along with mean data.

Lateralization data consisted of horizontal position values giving the location within either panel (“one location” or “two locations”) that the subject touched on each trial, ranging from −1 (maximum left) to +1 (maximum right). For all experiments, lateralization responses were first grouped according to the sidedness of the lead stimulus (left-lead, right-lag or right-lead, left-lag), and then again according to whether the subject touched the upper panel (“one location” trials) or the lower panel (“two location” trials). Sorted responses were plotted as a function of lead-lag delay, and weighted lines of best fit (least-squares) were generated to summarize trends in lateralization for each case as a function of lead-lag delay (dashed black and red lines in Figs. 3, 6, and 9). In some cases, one-sample t-tests were conducted on the slopes of these best fit lines (described separately for each experiment) to test the null hypothesis that the slope of lateralization across lead-lag delay was zero (i.e., that the magnitude of lateralization did not depend on lead-lag delay).

Lateralization data for experiment I. (a) Mean “one location” (black) and “two locations” (red) responses for right-lead, left-lag trials. Means are weighted by the number of subjects and trials they consider; error bars give ±1 weighted SD. Dashed black and red lines give weighted linear fits to the mean data (note logarithmic lead-lag delay axis). (b) Responses for left-lead, right-lag stimuli. Note that echo thresholds (inward pointing triangles) were measured independent of lateralization and are included for comparison with lateralization data only. For further explanation of lateralization analyses, see Sec. 3A2.

Lateralization data for experiment II. Legend as in Fig. 3.

Lateralization data for experiment III. Legend as in Fig. 3.

EXPERIMENT I: DYNAMIC PRECEDENCE EFFECTS FOR ITD AND ILD

The goal of this experiment was to replicate and extend the study of Krumbholz and Nobbe (2002), in which echo thresholds were measured for pairs of lead-lag clicks lateralized by ITD or ILD across three different stimulus types: baseline, buildup, and breakdown. Stimuli in the present experiment were of four different types: (1) Baseline stimuli consisted of a single lead-lag click pair, (2) Buildup stimuli consisted of 12 “conditioner” lead-lag click pairs and a final test pair identical to the conditioner pairs, (3) Breakdown stimuli consisted of 12 conditioner pairs and a “switched” test pair in which the interaural cues were swapped between the lead and lag clicks, and (4) Retest stimuli consisted of 11 conditioner pairs, an intervening switched pair, and a final test pair identical to the 11 conditioner pairs [after the re-buildup condition used by Djelani and Blauert (2001) to demonstrate maintenance of buildup following breakdown]. In ITD conditions, lead clicks always carried +/−308 μs ITD (i.e., 308 μs right-favoring or 308 μs left-favoring ITD) and 0 dB ILD, and lag clicks always carried −/+308 μs ITD and 0 dB ILD (i.e., an opposing ITD cue). Correspondingly, in ILD conditions, lead clicks always carried +/−10 dB ILD and 0 μs ITD, and lag clicks always carried −/+10 dB ILD and 0 μs ITD. These and other key stimulus parameters are illustrated in Fig. 1. Stimuli were designed to match those employed by Krumbholz and Nobbe (2002) in their ITD and ILD conditions; cue values were expected to produce approximately equivalent lateralization for ITD and ILD stimuli (see also Fig. 4). The major novelty of the present experiment was its simultaneous assessment of fusion and localization dominance, the latter having never been measured for “buildup” stimuli (cf. Clifton, 1987; Freyman et al., 1991; Clifton et al., 1994; Djelani and Blauert, 2001; Freyman and Keen, 2006; Keen and Freyman, 2009).

Schematic illustration of stimuli for experiment I. Test lead-lag click pairs (bold ticks) were preceded by silence (Baseline), 12 identical lead-lag click pairs (Buildup), 12 binaurally opposite lead-lag click pairs (Breakdown), or 11 identical lead-lag click pairs and one intervening opposite pair (Retest). Stimuli in the left column were for ITD conditions; stimuli in the right column were for ILD conditions. A, lead-lag delay (varied trial-to-trial, see text); B, 308 μs ITD; C, 10 dB ILD; D, 250 ms inter-stimulus interval between conditioner lead-lag pairs; E, 500 ms pause between conditioner and test.

Data from experiment II ITD-ILD matching task. Points give the mean dB ILD matched to a 308 μs ITD standard by each subject (±SEM across 8 runs). For comparison, the dashed line plots the 10 dB ILD used for all subjects in experiment I.

Results

Echo thresholds

Figure 2 gives individual subject (symbols) and mean (filled circles, error bars ± SE) echo thresholds for ITD and ILD conditions of experiment I (12 threshold estimates per condition per subject). Mean echo thresholds were greater for ITD than ILD in every condition, with the disparity particularly evident in the Breakdown condition. Of particular interest in the context of the studies of Djelani and Blauert (2001) and Krumbholz and Nobbe (2002), the echo threshold in the Retest ILD condition appeared to be comparable to the echo threshold in the Buildup ILD condition. Thus, lower Breakdown ILD thresholds notwithstanding, buildup was apparently “maintained” for the original lead-lag ILD stimulus. Individual data support the mean trends, with some individual differences evident (e.g., subjects 0601 and 1014 failing to show any breakdown effect in either ITD or ILD conditions). Threshold data were submitted to a 4 × 2 (condition × cue) repeated-measures ANOVA. The main effects of cue [F(1,9) = 27.51, p < 0.05] and condition [F(3,27) = 16.46, p < 0.05], and the cue × condition interaction [F(3,27) = 4.31, p < 0.05] were all significant. Follow-up paired t-tests with set-wise correction for multiple comparisons demonstrated that ITD-based echo thresholds were significantly higher than ILD-based echo thresholds in Baseline [t(9) = 5.31, p < 0.0125], Buildup [t(9) = 3.16, p < 0.0125], and Breakdown [t(9) = 4.09, p < 0.0125] conditions, but not in the Retest condition [t(9) = 2.57, p = 0.030]. An additional set of tests demonstrated that ILD-based echo thresholds were significantly higher in the Buildup condition than in the Baseline [t(9) = 4.78, p < 0.025] and Breakdown [t(9) = 2.76, p < 0.025] conditions. A final set of tests demonstrated that echo thresholds within ITD conditions were significantly higher than Baseline in Buildup [t(9) = 4.15, p < 0.0125], Breakdown [t(9) = 3.39, p < 0.0125], and Retest [t(9) = 4.52, p < 0.0125] conditions, while Breakdown and Buildup thresholds were not statistically different [t(9) = 1.86, p = 0.100].

Lateralization responses

The adaptive threshold estimation procedure introduced dozens of unique lead-lag delays across runs for each subject additional to the constant set of lead-lag delays tested in all runs for all subjects (see Sec. 2). Thus, to assess lateralization responses at the group level, mean lateralization values were first computed for each subject at each tested lead-lag delay. The cross-subject mean was then computed across lead-lag delay as the running mean of a sliding 3-ms window from 1 to 100 ms (interval 0.1 ms). A weight was determined for each such mean according to the number of subjects for which data existed (minimum 0, maximum 10). For means that comprised two subjects or more, the weighted standard deviation was also computed. Finally, means and their weights were used to compute a weighted line of best fit (least-squares) to lateralization responses as a function of lead-lag delay. This procedure was applied separately to lateralization data for “one location” and “two location” trials (i.e., for trials on which subjects responded in the upper panel on the display versus the lower panel; see Sec. 2).

Figures 3a, 3b plot cross-subject mean lateralization responses as a function of lead-lag delay for all conditions of experiment I [Fig. 3a, right-lead, left-lag stimuli; Fig. 3b, left-lead, right-lag stimuli]. Within each panel, axes are arranged such that the magnitude of lateralization is given by the leftward (toward −1) or rightward (toward +1) deviation from the midline (dotted vertical line). Lead-lag delay is plotted in the vertical dimension. Black filled circles give weighted means for “one location” trials; red filled circles give weighted means for “two locations” trials. The size of each point gives the weight of each mean, and error bars give the weighted standard deviation. Finally, dashed black and red lines give weighted linear fits to the mean lateralization responses (nonlinear in appearance due to the logarithmic lead-lag delay axes). Fusion echo thresholds (see Fig. 2), included for visual reference only, are given by inward-pointing triangles along the lead-lag delay axes; as described in Sec. 2, echo thresholds and lateralization were measured independently.

Considering first Fig. 3a (responses for right-lead, left-lag stimuli), consistent with expectations, lateralization responses on trials for which subjects reported “one location” (black) generally fell to the right of midline, in agreement with the right-favoring ITD or ILD carried by the lead, while lateralization responses on trials for which the subject reported perceiving “two locations” (red) generally fell to the left of midline, in agreement with the left-favoring ITD or ILD carried by the lag. Wholly unexpected was an apparent reduction in the magnitude of lateralization for “one location” responses with increasing lead-lag delay. This pattern was particularly evident in conditions featuring elevated echo thresholds (e.g., Buildup conditions) where mean lateralization responses for “one location” trials at lead-lag delays beyond ∼20 ms fell close to or at the midline. This pattern, also reflected in individual subject data (not shown), appeared to hold for both ITD and ILD conditions. Submitting the slopes of all “one location” best fit lines in Fig. 3a to a one-sample t-test (against 0) revealed that slopes taken across conditions were significantly negative [t(7) = −4.83, p < 0.05].

Critically, this pattern of reduced lateralization of “fused” lead-lag stimuli with increasing lead-lag delay was also apparent for left-lead, right-lag stimuli [Fig. 3b]. For these stimuli, given the task instructions, subjects should have responded to the left-favoring lead regardless of fusion (i.e., whether one or two locations were perceived). Nonetheless, as for right-lead, left-lag trials, the magnitude of lateralization for “fused” responses clearly decreased with increasing lead-lag delay, giving rise to significantly positive slopes for “one location” best fit lines [one-sample t-test against 0, t(7) = 3.93, p < 0.05]. In contrast, in the absence of fusion (i.e., for “two locations” responses), there was a trend for slightly increased lateralization with increasing lead-lag delay [one-sample t-test of “two locations” best fit slopes against 0, t(7) = −2.70, p < 0.05].

Interim discussion

The results of experiment I suggest that the precedence effect in terms of lead-lag fusion is more robust for ITD than ILD, in agreement with the data of Krumbholz and Nobbe (2002) and consistent with several previous headphone studies (e.g., Saberi et al., 2004; Stecker and Brown, 2010; Brown and Stecker, 2010). Across all tested conditions, echo thresholds for stimuli lateralized by ITD exceeded those for stimuli lateralized by ILD. The difference was most notable in the Breakdown condition, where a sudden perturbation in the ITD of the test stimulus relative to that of the conditioner stimulus (specifically, a switching of the lead and lag ITD values) failed to produce a change in echo threshold. These observations support the notion that the precedence effect as described in the free field depends on specific contributions from ITD and ILD. Most critically, the data suggests, consistent with Krumbholz and Nobbe (2002), that the breakdown effect demonstrated in the free field must depend on the sudden change in lead-lag ILD values; the concomitant sudden change in ITD, at least under headphones, is evidently inconsequential.

The present data additionally demonstrated a surprising reduction in localization dominance for fused (“one location”) lead-lag images with increasing lead-lag delay. For many such trials at lead-lag delays beyond ∼20 ms, lateralization responses fell near the midline, suggesting that the lead and lag cues both contributed substantially to the response. As these trials make the greatest contribution to the elevation of measured echo thresholds in buildup conditions, it follows that the “built-up” precedence effect may feature enhanced lead-lag fusion without similarly enhanced localization dominance. This observation is difficult to reconcile with a standard view of the buildup phenomenon, which construes the elevation of echo thresholds as enhancement of the precedence effect per se (i.e., enhanced fusion with enhanced localization dominance). The possibility of reduced localization dominance with increased fusion is explored further in experiments II and III, while an alternative explanation for near-midline lateralization responses (concerning the presence of the diotic ILD in ITD stimuli and the diotic ITD in ILD stimuli) is considered specifically in experiment III.

EXPERIMENT II: CROSS-CUE TRANSFER OF BUILDUP

The present experiment was designed to evaluate spatial geometric versus spatial acoustic accounts of the dynamic precedence effect (considered in Sec. 1) by presenting subjects with two novel buildup conditions where conditioner and test stimuli were lateralized by different cues of equal subjective magnitude. Prior to testing in the main experimental task, subjects completed an ITD-ILD matching task to obtain values of ILD (carried by single clicks) that matched the subjective lateralization of a ±308 μs ITD standard (see Sec. 2). These individually determined values of ILD (see Fig. 4) were used for all ILD conditioner and test stimuli in experiment II. Stimulus conditions in Experiment II consisted of Baseline ITD and ILD conditions identical to those of Experiment I (with the exception that the individually determined values of ILD in Fig. 4 were used in place of the +/−10 dB used in Experiment I), and two novel “buildup” conditions, (1) Buildup ILD, Test ITD and (2) Buildup ITD, Test ILD conditions, in which the conditioner and test were lateralized by ILD and ITD and by ITD and ILD, respectively.

Results and interim discussion

Figure 5 gives echo thresholds for the conditions of experiment II. Mean Baseline ITD and ILD thresholds were nearly identical to those measured in experiment I. Toward the primary question addressed by experiment II, the mean Buildup ILD, Test ILD threshold appeared to be moderately elevated relative to the Baseline ITD threshold, while the Buildup ITD, Test ILD threshold appeared to be equal to the threshold in the Baseline ILD condition. Individual subject data reveal that the mean Buildup ILD, Test ITD threshold was skewed by two subjects, most especially subject 1012, whose threshold in that condition was 37.5 ms. Although this point might be treated as an outlier, subject 1012's data were not unusual in other conditions, and very high buildup thresholds are occasionally measured in subjective fusion tasks (e.g., Yang and Grantham, 1997; Djelani and Blauert, 2001). Paired t-tests (corrected for two comparisons) indicated that the Buildup ILD, Test ITD threshold was not significantly higher than the Baseline ITD threshold, though the difference approached significance [t(9) = 2.65, p = 0.027], while Buildup ITD, Test ILD and Baseline ILD thresholds were not significantly different [t(9) = 1.16, p = 0.274]. In comparison to “within-cue” buildup thresholds, the mean Buildup ILD, Test ITD threshold of the present experiment was ∼15 ms (∼12.5 ms with removal of subject 1012) versus ∼18 ms for Buildup ITD (experiment I), while the Buildup ITD, Test ILD of the present experiment threshold was ∼5.5 ms versus ∼14 ms for Buildup ILD (experiment I). The data thus suggest that even when the subjective location of a repeating stimulus is fixed, maximal buildup requires static ITD and ILD cues. Although limited cross-cue transfer of buildup was measured with a subjectively equivalent ILD conditioner and ITD test stimulus (particularly for two subjects), none was measured in the opposite case.

Echo thresholds for conditions of experiment II. Legend as in Fig. 2.

Lateralization data for experiment II are displayed in Fig. 6. Because trends in lateralization as a function of lead-lag delay closely followed those observed in experiment I, data are given only for right-lead, left-lag trials (see Appendix for left-lead, right-lag data). As in experiment I, when one location was reported at lead-lag delays beyond ∼20 ms, subjects tended to respond near the midline, suggesting weak localization dominance. Consequently, as in experiment I, the slope of lines fit to “one location” responses was significantly negative taken across conditions [t(3) = −3.23, p < 0.05]. Additionally, a subtler trend was observed in responses for “two locations” responses: At lead-lag delays near the echo threshold, “two locations” responses tended to be shifted leftward (i.e., toward the lead). This trend, leading to slightly negative slopes for lines fit to “two locations” data, was also present in the data of experiment I, but was particularly evident in the Baseline ITD and Buildup ILD, Test ITD conditions of the present experiment. This observation and the observation of weak lateralization dominance for lead-lag buildup test stimuli, as well as differences between ITD- and ILD-based precedence effects discussed hereto, are explored further in experiment III.

EXPERIMENT III: DYNAMIC PRECEDENCE EFFECTS WITHIN A SINGLE HEMIFIELD

The majority of precedence effect studies have used lead and lag stimuli that were symmetrically opposed across the interaural midline (e.g., Wallach et al., 1949; Thurlow and Parks, 1961; Freyman et al., 1991; Krumbholz and Nobbe, 2002; experiments I and II, and dozens of others). Use of such stimuli offers certain advantages such as avoidance of differences in sensitivity to perturbations in the lead versus lag resulting simply from differences in spatial sensitivity across azimuth. Nonetheless, exclusive use of binaurally symmetric stimuli also presents certain disadvantages: Of greatest concern, information is only obtained about one type of synthetic listening condition: a single source and single echo arranged in a single spatial configuration. The generalizability of psychophysical performance measured under such conditions to real-world listening in rooms is therefore limited.

In the present investigation, another difficulty related to the use of binaurally symmetric stimuli may be identified: We (and many previous investigators) have adopted the terms “ITD stimuli” and “ILD stimuli” to describe stimulus conditions in which the ITD or ILD was manipulated while the ILD or ITD was held constant (usually at 0 dB or 0 μs). For any binaural stimulus, however, both cues are always present. Thus, the ITD test stimuli of experiments I and II carry both ±308 μs ITD and ±0 dB ILD, and the ILD test stimuli carry both ±10 dB ILD and ±0 μs ITD. This is a critical consideration in light of the lateralization data obtained in experiments I and II. When “one location” was reported at lead-lag delays beyond ∼20 ms, localization dominance appeared to be weak: Rather than responding on the side consistent with cues carried by the lead (as at brief lead-lag delays), subjects responded near the midline. We took these responses to evidence a substantial contribution to lateralization by both lead and lag cues, i.e., “averaging” of the lead and lag. An alternative explanation could be that, given a diffuse image comprised of disparate ITD or ILD lead and lag cues, subjects were compelled to respond to the co-occurring and highly stable unmanipulated ILD or ITD cue. To address this concern, experiment III employed lead and lag stimuli confined to a single “hemifield,” such that the average of the manipulated lead and lag cue values was nonzero.