Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2021 Sep 1.
Published in final edited form as: Vision Res. 2021 May 28;186:71–79. doi: 10.1016/j.visres.2021.05.006

Task-dependent audiovisual temporal sensitivity is not affected by stimulus intensity levels

Alexandra N Scurry 1,*, Zachary Lovelady 1, Fang Jiang 1
PMCID: PMC8273142  NIHMSID: NIHMS1705608  PMID: 34058622

Abstract

Flexibility and robustness of multisensory temporal recalibration is paramount for maintaining perceptual constancy of the surrounding natural world. Different environments impart various impediments, distances and routes that alter the propagation times of sight and sound cues comprising a multimodal event. One’s ability to rapidly calibrate and account for these external variations allows for maintained perception of synchrony which is crucial for coherent and consistent perception. The two common paradigms used to compare precision of temporal processing between experimental and control groups, the simultaneity judgment (SJ) and temporal order judgment (TOJ) tasks, often use supra-threshold stimuli. However, few studies have specifically examined the effects of normalizing stimulus intensities to participant’s unisensory detection thresholds. The current project presented multiple combinations of auditory and visual stimulus intensity levels, based on individual detection thresholds, during a TOJ and a SJ task. While no effect of stimulus intensity was found on temporal sensitivity or perceived temporal synchrony, there was a significant difference in point of subjective simultaneity (PSS) measures between tasks. In addition, PSS estimates were audio-leading, rather than visual-leading as previously reported, suggesting that exposure to the particular combinations of stimulus intensity levels used influenced temporal synchrony perception. Overall, these results support the use of supra-threshold stimuli in TOJ and SJ tasks as a way of minimizing the confound from differences in unisensory processing.

Keywords: temporal sensitivity, temporal synchrony, multisensory perception

1. Introduction

Our ability to react and adapt to the external environment is critical for precise perception and subsequent responses and actions. A classic example is when we attend a baseball game and notice the extreme delay between our visual perception of the bat making contact with the ball and our auditory perception of the sound generated by that contact. However, after just a few pitches, our conscious awareness of this temporal delay is eliminated, and we perceive the visual and auditory features of this event as simultaneous. This type of short-term temporal recalibration is inherent to multisensory processing and allows for precision and flexibility in navigating the environment. Further, it enables individuals to maintain consistency in their perception of synchrony, a sensation that is built from their prior experience with and exposure to natural asynchronies of multisensory stimuli (Aschersleben & Prinz, 1995). For instance, observers generally display a bias toward visual-leading stimuli due to the naturally faster propagation times of light compared to sound (Bedard & Barnett-Cowan, 2016; Keetels & Vroomen, 2012). In addition, the temporal discrepancy required between two signals to perceive synchrony are not correlated between modalities, presumably due to the contextual factors dictating simultaneous perception (Scurry et al., 2019).

The point of subjective simultaneity (PSS) is the standard estimate used to reflect perceptual synchrony and is defined as the temporal delay required between two cues (i.e. a sound and a flash) so that the temporal order is unknown (Stone et al., 2001). Two common paradigms used to assess the PSS are the temporal order judgment (TOJ) and simultaneity judgement (SJ) tasks in which a participant is asked to determine the sequential order or the temporal simultaneity, respectively, of asynchronous as well as synchronous auditory and visual cues. As the TOJ and SJ tasks tap into different perceptual processes (Love et al., 2013), it is not surprising that PSS estimates are not correlated between these two tasks (Bedard & Barnett-Cowan, 2016; van Eijk et al., 2010). Indeed, while TOJ performance is heavily dependent on sequential timing information and is often classified as a more difficult temporal task, the SJ task appears to measure perceptual integration and is influenced by an asymmetric criterion bias at narrow SOAs (Linares & Holcombe, 2014).

PSS measures also reliably estimate the magnitude and effect of temporal recalibration, an innate strategy to perceptually realign one’s temporal bias of audiovisual stimuli in order to retain coherent multisensory percepts when audiovisual signaled are misaligned due to spatial and contextual factors. Classic adaptation paradigms are traditionally used to investigate temporal recalibration aftereffects, measured as the magnitude and direction of the PSS shift following prolonged exposure (i.e. 3 minutes) to a constant temporal asynchrony between two cues (Fujisaki et al., 2004; Vroomen et al., 2004). In conjunction with this prolonged adaptation effect, there is also evidence for a much more rapid, low-level process that can produce immediate perceptual realignment on the current trial (t) toward the temporal delay (a negative aftereffect) presented in the previous trial (t-1) for both simple and complex audiovisual stimuli (Van der Burg et al., 2013; Van Der Burg et al., 2015; Van der Burg & Goodbourn, 2015). This more immediate strategy of temporal realignment makes sense from an ethological perspective where delays between naturally occurring audiovisual signals are vast and highly influenced by environmental obstacles and features. Indeed, as light is most often perceived prior to sound, it is not surprising that the extent of this rapid recalibration is asynchronous. For instance, when an auditory pure tone led a visual flash, the effect of recalibration was significant only for the smallest delay tested (64 ms) while this effect was significant over a larger range of SOAs for visual-leading t-1 trials (Van der Burg et al., 2013).

The previously described studies depicting negative aftereffects induced after short-term exposure to an asynchronous audiovisual stimulus only used the SJ task. However, a more recent study that compared rapid recalibration between an SJ and a TOJ task found opposing directions of temporal recalibration (Roseboom, 2019). Specifically, while modality order of the preceding trial (t-1) elicited an immediate shift in PSS on the current trial (t) toward that delay (negative aftereffect) in the SJ task, in line with prior findings (Van der Burg et al., 2013; Van der Burg & Goodbourn, 2015), the PSS shifted away from the preceding (t-1) temporal delay (positive aftereffect) in the TOJ task (Roseboom, 2019). This positive aftereffect reported in the TOJ task is likely a result from a choice-repetition bias which leads a participant to repeat their temporal order decision from the preceding trial on the current trial. Indeed, choice repetition occurred in both SJ and TOJ tasks however, while this bias affected the amplitude of the SJ response distribution (i.e. shifts in the proportion of synchronous reports) and therefore didn’t affect the PSS, the shift in the TOJ response distribution had a direct effect on PSS estimates (Keane et al., 2020).

Interestingly, the effect of adaptation on temporal sensitivity thresholds, a measure of the smallest temporal difference between the two signals that can be discerned by the observer, is dependent on the temporal scale of the adaptation. For instance, the more classical adaptation designs in which participants are continuously exposed to a constant audiovisual delay result in larger thresholds for visual-leading stimuli (Y. M. Chan et al., 2014a; Fujisaki et al., 2004), likely enabling more flexibility for integrating audiovisual stimuli characterized by this temporal relationship. However, in studies examining rapid realignment of audiovisual simultaneity, the modality order of the preceding trial had no significant effect on temporal thresholds (Van der Burg et al., 2013; Van Der Burg et al., 2015; Van der Burg & Goodbourn, 2015). Although, worse thresholds were associated with increased magnitudes of the rapid recalibration effect (Van der Burg et al., 2013), again likely due to greater flexibility toward perceiving asynchronous audiovisual stimuli.

Temporal sensitivity appears to be driven by separate cortical processes relative to perceptual synchrony. For instance, older adults that demonstrated impaired temporal sensitivity across audiovisual, visual-motor and visual only temporal tasks did not show any differences in perceptual synchrony estimates relative to young adult controls (Scurry et al., 2019). Similar findings from SJ tasks have been reported in individuals with high levels of attention deficit hyperactivity disorder (ADHD) traits (Panagiotidi et al., 2017) and those with autism spectrum disorder (ASD) (Noel et al., 2017) suggesting that in those neurological disorders that normally affect temporal acuity, perceptual synchrony remains stable.

In majority of studies that investigate multisensory temporal processes, the auditory and visual stimuli are presented at a presumed supra-threshold level. However, unisensory detection sensitivities are not necessarily equivalent across control and experimental groups. This is an especially valid consideration in aging studies as the natural aging process generally results in impairments to the separate sensory systems. Indeed, the sensory receptor organs and the primary regions within the central nervous system that are required for sensory perception deteriorate with age (Celesia et al., 1987; Goodin et al., 1978; Kraus & Anderson, 2013; Lemaître et al., 2005; Ostroff et al., 2003; Werner et al., 2010). These common age-related modifications to sensory processing result in poorer detection abilities of unisensory stimuli. For example, hearing ability and visual acuity both decline with age and subsequent auditory and visual detection thresholds worsen (Humes et al., 2009). Therefore, use of a supra-threshold stimulus may actually be closer to detection thresholds for the experimental groups rather than for the control groups and resultant group effects in multisensory processing could be partially due to their differences in unisensory processing impairments. Without specifically equating intensity levels relative to each individual’s threshold, this confound is difficult to rule out.

Stimulus intensity can also significantly affect synchrony perception. For instance, in a rat model, lowered auditory intensity led to asynchronous perception of simultaneous audiovisual stimuli during a SJ task and to a bias toward visual-first perception in a TOJ task (Schormans & Allman, 2018). Stimuli of higher intensity were reported as being perceived first in a TOJ task and the extent of this bias was correlated to the difference in stimulus intensities between the two cues being compared (Neumann & Niepel, 2004). In a separate study where stimuli were presented in the periphery rather than centrally, increasing the intensity of the visual stimulus led to a shift toward an auditory-leading PSS in a TOJ task (Boenke et al., 2009). Despite these differing directions of a PSS shift, together these studies serve as evidence of stimulus intensity contributing to perceptual bias over simultaneity.

As stimulus intensity has an evident impact on temporal processing and integrating multisensory stimuli, it follows that an individual’s ability to detect stimuli has a similar influence and participant’s thresholds should be controlled for during experimentation. A recent study examined how stimulus effectiveness influenced synchrony perception in young adults using an SJ task. Higher rates of synchrony perception were found when the auditory and visual cue were both presented at low intensity compared to when they were both presented at high intensity levels (Fister et al., 2016). However, this finding was specific for trials in which the auditory and visual stimulus were separated by 200 ms suggesting increased perceptual flexibility for less effective stimuli at larger delays (Fister et al., 2016). Another study compared the temporal binding windows, or the length of temporal delay that an individual is likely to perceptually bind asynchronous stimuli, of older and younger adults after performing a version of the SJ task when the stimulus intensity was based on the individual’s threshold and found that the effect of age was not due to reduced unisensory discrimination sensitivity (Y. M. Chan et al., 2014b). However, since visual acuity is positively correlated to the temporal binding window (the temporal limits of integrating audiovisual stimuli) measured by the TOJ task but not the SJ task (Begum & Barnett-Cowan, 2018), stimulus intensity may contribute differently dependent on the mechanism the temporal task requires.

Therefore, the current study sought to investigate how stimulus intensities based on individual detection thresholds affect temporal sensitivity and temporal synchrony in two separate temporal processing tasks, the TOJ and SJ. We expected that in incongruent conditions, where one stimulus is presented at supra-threshold (i.e. visual) and the other is presented at near-threshold (i.e. auditory), temporal sensitivity would be biased toward the supra-threshold stimulus (i.e. visual). In addition, we hypothesized an effect of stimulus intensity on temporal thresholds estimated during a TOJ, not an SJ task (Begum & Barnett-Cowan, 2018). As the brain demonstrates an innate process to maintain perceptual constancy despite external changes, we did not expect an effect of stimulus intensity on perceptual synchrony in either task. However, we did expect an interaction between stimulus intensity and modality order of trial t-1 on PSS estimates. Specifically, when the leading modality (i.e. visual) of trial t-1 was at supra-threshold, we expected a shift in PSS toward that temporal delay (i.e. negative aftereffect) similar to the bias toward supra-threshold vs near-threshold stimuli previously reported (Neumann & Niepel, 2004; Schormans & Allman, 2018).

2. Methods

2.1. Participants

19 young adults (M = 20.42 ± 0.25 years; 9 females) were recruited from the University of Nevada, Reno and the surrounding community. All participants reported no history of psychiatric or neurological disorders, normal cognitive function, and normal or corrected to normal vision. They were additionally tested for normal hearing using AudioScope 3, a screening audiometer (Welch Allyn, Skaneateles Falls, NY, USA), defined as pure tone thresholds ≤ 25 dB for 1000 and 2000 Hz in both ears. All participants provided signed informed consent before any experimentation and were financially compensated for their time. The experimental protocol was reviewed and approved by the Institutional Review Board at the University of Nevada, Reno.

2.2. Apparatus and Stimuli

The visual stimulus was an isotropic two-dimensional Gaussian blob subtending 3.5˚ visual angle presented on a uniform grey background for 30 ms. The contrast of the visual stimulus was set to either near- or supra-threshold levels based on the participant’s detection threshold measured in during the estimation of unisensory detection thresholds (see below). The auditory stimulus was a 30 ms 1000 Hz tone presented at a dB level set at either near- or supra- individual detection threshold levels. All stimuli were generated using MATLAB (Mathworks, Natick, MA, USA) and Psychtoolbox extensions (Brainard, 1997; Pelli, 1997). Auditory and visual stimuli were presented via an AudioFile Stimulus Processor and a Display++ system with a refresh rate of 120 Hz, respectively (Cambridge Research Systems, Rochester, UK) in order to preserve precise timing. Auditory stimuli were presented via a speaker (Fantech HellScream GS 201, Nepal) that was centered underneath the display to align with presentation of the visual stimuli.

2.3. Estimation of Unisensory Detection Thresholds

A 2-down-1-up staircase was used to quantify individual detection thresholds. Participants were instructed to respond if they detected the stimulus or not for a total of 80 trials. Trials were separated by a variable intertrial interval between 1200 and 1400 ms. The stimulus intensities of the final 10 trials were extracted and averaged to determine the 70.7% detection threshold. For each individual, the near-threshold value was defined as 3 SDs above the mean while the supra-threshold value was defined as 10 SDs above the mean (Y. M. Chan et al., 2014b). The near-threshold level should elicit detection of a stimulus 99.7% of the time (Y. Chan et al., 2017) and piloting procedures with 2 participants demonstrated they were able to detect the near-threshold stimuli on 100% of the 15 test trials. The experimental procedure used to determine detection thresholds was carried out separately for the auditory and visual stimuli.

2.3.1. Visual Detection Threshold

The initial contrast, measured as a Weber fraction, between the visual stimulus (white circle) and the grey background was 1%. The step size used was also 1% so that on trials where participants were unable to detect the white circle, the contrast increased by 1% and when participants did detect the white circle for two consecutive trials, the contrast decreased by 2%. In the case of ceiling performance, the minimum contrast allowed was 1%. Following the onset of the visual stimulus, a brief 750 Hz beep played in order to alert the participant to the stimulus presentation and encourage a response. If a participant detected the Gaussian blob, they pressed the ‘y’ key on the keyboard and if they did not detect the stimulus, they pressed the ‘n’ key. Participants began the visual detection staircase with a brief practice session so that on 3 of the 6 practice trials, the white circle was presented at .56 Weber fraction while no circle was presented on the other 3 to ensure participants understood the task.

2.3.2. Auditory Detection Threshold

The 1000 Hz tone was initially presented at .5 dB and varied by 2 dB dependent on the participant’s response. When the participant was able to detect the tone in 2 consecutive trials, the sound level decreased by 2 dB (with a minimum value of .5 dB) and when the participant could not detect the tone, the sound level increased by 2 dB. 200 ms following the tone’s onset, the fixation cross changed to a blue color during the response window so that individuals knew a stimulus had been presented regardless of whether or not they perceived the sound. If a participant detected the sound, they pressed the ‘y’ key on the keyboard and if they did not detect the sound, they pressed the ‘n’ key. Participants began this task with 6 practice trials with the tone presented at 55 dB on 3 of the trials and no tone on the other 3 trials to ensure they understood the task.

2.4. Temporal Processing Tasks

All participants completed two separate temporal processing tasks, a simultaneity judgment (SJ) and a temporal order judgement (TOJ) task. For each task, 4 experimental blocks were completed where each block contained one of the 4 possible combinations of visual and auditory stimulus intensity levels – auditory near-threshold, visual near-threshold (ANVN); auditory near-threshold, visual supra-threshold (ANVS); auditory supra-threshold, visual near-threshold (ASVN); auditory supra-threshold, visual supra-threshold (ASVS). The order of the 4 stimulus intensity conditions within each task, as well as the order of task (SJ vs TOJ), was randomized. However, all 4 stimulus conditions were completed within a single task (i.e. TOJ) prior to performing the second task (i.e. SJ).

2.4.1. Simultaneity Judgment (SJ) Task

On each trial of a single block, the visual and auditory stimuli were presented at variable stimulus onset asynchronies (SOAs) ranging from −350 to +350 ms in 50 ms steps where negative SOAs represent auditory leading visual trials and positive SOAs represent visual leading auditory trials. Each SOA was repeated 5 times for a total of 75 trials/block (5 repetitions x 15 SOAs). Participants were instructed to respond via a keyboard press as to whether or not they perceived the two stimuli as simultaneous.

Prior to performing any of the 4 SJ blocks, participants completed a short practice session to ensure they understood the task. The contrast of the visual stimulus was .56 Weber fraction and the auditory stimulus was presented at 55 dB. Half of the trials were synchronous (SOA of 0 ms) and the other half were asynchronous presented at larger and different delays than the experimental version of the task (420, 460 or 510 ms).

2.4.2. Temporal Order Judgment (TOJ) Task

The same parameters were used for the TOJ task so that 4 blocks were performed by each participant and each AV stimulus intensity combination was tested separately in the 4 blocks. Again, 15 SOAs (−350 to + 350 ms in 50 ms steps) were each repeated 5 times for a total of 75 trials per/block. At the end of each trial, participants were asked to judge the temporal order of the stimuli so that a 1 on the keyboard indicated they perceived the auditory stimulus as preceding the visual and a 2 indicated the perceived the visual preceding the auditory stimulus.

Prior to running any of the TOJ blocks, participants did complete a short practice session of 10 trials to ensure they understood the task and the correct response. Equivalent to the SJ practice session, the visual stimulus was presented at .56 Weber fraction and the auditory stimulus at 55 dB. SOA values were also different from the experimental TOJ blocks (360, 420, 460 or 510 ms).

2.5. Behavioral and Statistical Analysis

In order to understand how stimulus intensity and task affected rapid recalibration and temporal sensitivity, trials were categorized based on the leading modality in the preceding trial (t-1) so that auditory-leading t-1 trials and visual leading t-1 trials data subsets were created. This categorization occurred for each of the stimulus intensity conditions in both the SJ and TOJ task prior to the fitting procedures described below. Modality order is therefore implicated in generating any difference in the fits between the auditory-leading t-1 trials and visual leading t-1 trials (Alais et al., 2015; Van der Burg et al., 2013). Specifically, the difference in PSS estimates (ΔPSS = PSSvisual-leading t-1 trials – PSSaudio-leading t-1 trials) captured how modality order could rapidly recalibrate the system and shift perceptual bias of temporal simultaneity.

For the SJ task, the average response of ‘simultaneous’ was plotted as a function of SOA separately for each participant. The data were then fit with a gaussian function that used three free parameters: the mean, the standard deviation, and the amplitude. The mean of the fitted function reflected the PSS, the standard deviation reflected the temporal window of integration and the amplitude reflected the proportion of simultaneous responses.

Similarly, for the TOJ task, the proportion of ‘flash first’ response was plotted as a function of SOA for each individual and then fit with a normal cumulative distribution function. The mean was estimated from this cumulative distribution and reflected the PSS (Alais et al., 2015; Fechner, 1860; Scurry et al., 2019; van Eijk et al., 2008; Weber, 1834). Temporal order sensitivity thresholds were estimated using the standard deviation of the fitted cumulative distribution function (Alais et al., 2015; Scurry et al., 2019).

4 participants reported simultaneous audiovisual stimuli in the SJ task over 50% of the time when the pair was not veridically synchronous and thus were excluded from further analysis.

To determine statistically significant effects and interactions that affected PSS and threshold estimates, a repeated measures ANOVA was performed using task (SJ vs TOJ), condition (ANVN; ANVS; ASVN; ASVS), and modality order in t-1 trial (auditory vs visual) as factors. A repeated measures ANOVA was also used to understand how task, auditory stimulus intensity (near vs supra) and visual stimulus intensity (near vs supra) affected ΔPSS. To understand and interpret any non-statistically significant results (Dienes, 2014; van Doorn et al., 2020), Bayes factors were calculated using the BayesFactor package in R with default priors (Morey, 2018; Rouder et al., 2012). All statistical analysis was performed in R statistical software.

3. Results

Detection thresholds are shown in Figure 1 for both auditory (left panel) and visual (right panel) stimuli. The average auditory detection threshold was 6.10 ± 1.03 dB while the average visual detection threshold was .094 ± .010 contrast (Weber fraction).

Figure 1.

Figure 1.

Unisensory detection thresholds. Group average and individual data are shown for auditory (left panel) and visual (right panel) detection thresholds. Error bars represent SEM.

Initially, we categorized trials based on the leading modality of the previous trial (t – 1) and fit a normal cumulative distribution and gaussian function to both data subsets for the TOJ and SJ task, respectively. This procedure was done for all 4 stimulus intensity conditions and temporal sensitivity thresholds and PSS estimates were quantified. The individual along with the group fits for both auditory leading (blue) and visual leading (orange) t-1 trial orders are displayed in Figure 2 for the TOJ task and in Figure 3 for the SJ task.

Figure 2.

Figure 2.

Group and individual fits of the cumulative distribution function for TOJ responses. The group-averaged proportion of flash first responses in the TOJ task are plotted as a function of stimulus onset asynchrony (SOA) for both auditory-leading t-1 trials (blue) and visual-leading t-1 trials (orange). The SOAs shown on the x-axis depict the asynchrony level of the current trial (t). Group (bold lines) and individual fits (thin lines) of a normal cumulative distribution function are shown for both subsets of trials. Error bars represent SEM.

Figure 3.

Figure 3.

Group and individual fits of a Gaussian function for SJ responses. The group-averaged proportion of simultaneous responses in the SJ task are plotted as a function of stimulus onset asynchrony (SOA) for both auditory-leading t-1 trials (blue) and visual-leading t-1 trials (orange). The SOAs shown on the x-axis depict the asynchrony level of the current trial (t). Group (bold lines) and individual fits (thin lines) of a Gaussian function are shown for both subsets of trials. Error bars represent SEM.

A 2×4×2 repeated measures ANOVA was conducted to determine the effect of the leading modality in trial t-1, stimulus intensity condition, and task on PSS estimates. Across all conditions and tasks, group-averaged PSS values were negative indicating an auditory-leading bias. PSS values were significantly smaller (a shift toward visual-leading) during the SJ compared to the TOJ task (F(1,190) = 7.61 p < .01) while the leading modality in trial t-1 (F(1,187) = 3.26, p = .07) and stimulus intensity condition (F(3, 187) = 1.35, p = .26) had no significant influence. Similarly, none of the 2-way or the 3-way interactions were significant (Fs(3,187) ≤ 1.49, ps ≥ .22). Group averaged PSS values with individual PSS estimates are plotted in Figure 4 for both the SJ (left panel) and TOJ (right panel) tasks.

Figure 4.

Figure 4.

Point of subjective simultaneity (PSS) estimates. Group averaged and individual PSS estimates for audio-leading t-1 trials (dark grey bars w/black circles) and visual-leading t −1 trials (light grey bars w/light grey circles) are displayed for the SJ (left panel) and TOJ (right panel) tasks for each of the 4 conditions (ANVN; ANVS; ASVN; ASVS). Error bars represent SEM.

This data was further examined using Bayes Factors to determine the likelihood of the null hypotheses compared to other possible models. As expected from the results derived from the 2×4×2 ANOVA, the model with Task as a main effect was preferred (BF = 4.94), followed by a model including both task and the modality order of trial t-1 (BF = 3.61). However, modality order of t-1 trial alone had insufficient evidence (BF = 0.68) to support the null hypothesis that it does not influence the PSS. The null hypothesis that the main effect of interest, stimulus intensity condition, does not significantly influence PSS measures is favored in a model that only contained condition as a main effect (BF = 0.11) and in models containing stimulus intensity condition in the interaction term(s) (BFs ≤ 0.16).

The same statistical procedures were conducted to determine effects on temporal sensitivity measures (Figure 5). Results from the 2×4×2 repeated measures ANOVA showed that none of the main effects (leading modality in trial t-1, stimulus intensity condition, and task) were significant (Fs(3,188) ≤ 0.99, ps ≥ .32). Further, the 2-way and 3-way interactions did not significantly influence temporal thresholds (Fs(3,188) ≤ 2.01, ps ≥ .16). All possible models were again compared to the null model and subsequent Bayes Factors displayed moderate support for the null hypothesis that Task (BF = 0.24) and leading modality in trial t-1 (BF = 0.22) did not influence thresholds. Further, there was strong support for the null hypothesis that stimulus intensity condition did not significantly affect temporal thresholds (BF = 0.05) and for models containing any combination of these main effects or their interactions (BFs ≤ 0.05).

Figure 5.

Figure 5.

Temporal thresholds. Group averaged and individual temporal sensitivity thresholds for audio-leading t-1 trials (dark grey bars w/black circles) and visual-leading t −1 trials (light grey bars w/light grey circles) are displayed for the SJ (left panel) and TOJ (right panel) tasks for each of the 4 conditions (ANVN; ANVS; ASVN; ASVS). Error bars represent SEM.

In order to examine how stimulus intensity and task affected rapid recalibration, a ΔPSS was calculated that compared the PSS in visual leading t-1 trials to the PSS from auditory leading t-1 trials (see Figure 6). A 2×2×2 repeated measures ANOVA that was conducted to examine the effects of task, auditory stimulus intensity and visual stimulus intensity on ΔPSS did not reveal any significant main effects (Fs(1, 104) ≤ 1.34, ps ≥ .25) or interactions (Fs(1, 104) ≤ 3.16, ps ≥ .08). The null model was again compared to all possible models and Bayes Factors were extracted to estimate the likelihood of the null. All possible models with main effects and their interactions demonstrate moderate to strong support for the null hypothesis of no significant influence on ΔPSS (BFs ≤ 0.37).

Figure 6.

Figure 6.

ΔPSS measures to examine the effect of modality order in preceding trial (t-1). Group-averaged and individual differences in PSS between visual- and auditory-leading t-1 trials (ΔPSS) are displayed for the SJ (dark grey) and TOJ (light grey) tasks for all 4 stimulus intensity conditions (ANVN; ANVS; ASVN; ASVS). Error bars represent SEM.

To investigate if the modality order of trial t-1 affected PSS estimates, separate t-tests were used to investigate if ΔPSS values were significantly different from 0. Across conditions in the TOJ task, no significant influence of the modality order of trial t-1 occurred (ts(14) ≤ 1.14, ps ≥ .28). Bayes Factor t-tests were also conducted and revealed only anecdotal support for the null hypothesis that the ΔPSS = 0 (BFs were between 0.32 and 0.47). In the SJ task, only the ANVN condition revealed any significant recalibration (t(14) = 2.84, p < .05, BF = 4.50) while the remaining 3 conditions did not (ts(14) ≤ 1.34, ps ≥ .20). Again, Bayes Factor t-tests were between 0.34 and 0.56 suggesting that there was not sufficient evidence to support that stimulus intensity condition had no influence on the PSS. This coincides with the observed difference in the group-averaged simultaneous response proportion in A-lead versus V-lead conditions in Figure 3 (i.e., A-lead (blue) show higher proportions than V-lead (orange) on left hand side of curve in ASVN and ASVS condition).

4. Discussion

Flexibility in temporal processing of multisensory cues is imperative for proper recalibration dependent on changing environmental contexts. The strength and features driving multisensory recalibration is often measured by exposing participants to a constant temporal offset between signals and then measuring how our system has adapted to this offset by our shift in perceived temporal synchrony. In some experimental groups, such as older adults that present with reduced temporal sensitivity for discriminating temporal delays between multisensory cues, a consequent reduction in the strength of audiovisual recalibration is also reported (Y. M. Chan et al., 2014a). However, without equating stimulus amplitudes across participants, it is difficult to declare our basic sensory processing deficits as contributing to an experimental subject’s recalibration deficit. As the two common paradigms used to assess temporal recalibration, the SJ and TOJ tasks, often rely on supra-threshold stimuli as negating this potential confound, our study sought to explicitly examine the contribution of individualized stimulus intensity levels to temporal sensitivity and synchrony measures estimated from both tasks. The effect of stimulus intensity level did not significantly impact temporal sensitivity or synchrony estimates, regardless of modality order in trial t-1. However, PSS estimates were significantly smaller in the SJ compared to the TOJ task while no difference in thresholds were found between tasks.

Initially, we predicted that stimulus intensity levels would affect temporal sensitivity estimates, particularly when one stimulus was at near-threshold while the other was presented at supra-threshold. For instance, when the auditory cue was presented at near-threshold and the visual at supra-threshold, we would expect improved temporal order detection and thus sensitivity for visual leading stimuli. This prediction was in line with prior reports of temporal order bias toward a stimulus presented at a higher level than the stimulus from the opposing modality (Boenke et al., 2009; Neumann & Niepel, 2004; Schormans & Allman, 2018). Fister et al. (2016), also found reduced temporal sensitivity (reported as increased perceived synchrony) for asynchronous stimuli when both auditory and visual stimuli were at a low intensity compared to when they were both at a high intensity level.

Interestingly, we did not find any significant effects of stimulus intensity on temporal thresholds. While the present study used a total of 15 SOAs, a narrower range of SOAs have previously been used (4 SOAs in Fister et al., 2016; 7 SOAs in Schormans & Allman, 2018; 10 SOAs in Boenke et al., 2009). The extended range of SOAs currently used likely provided participants with additional information to improve their decision strategies and accurately perform the temporal tasks leading to equivalent temporal thresholds. This explanation is supported by the recent finding of reduced proportions of temporally dependent, sound-induced flash illusions when individuals were presented with 10 vs 6 SOAs (J. S. Chan et al., 2018).

Prior studies also manipulated the intensity of stimuli from only 1 modality while keeping the intensity of the other stimulus constant (Boenke et al., 2009; Schormans & Allman, 2018) or had congruent levels of stimulus intensity (both low or both high intensity) (Fister et al., 2016). The increased number of stimulus intensity combinations used in the present study may have negated any potential effects due to complex interactions. In addition, the more salient stimulus may have helped drive the perception of sequential order so the particular combination wouldn’t exert any effect on performance. For instance, when needing to determine temporal order, a participant may be judging the timing of the most salient stimulus relative to the overall trial length. Therefore, even when that salient stimulus lagged the weaker stimulus, a participant could still correctly discern the order in that trial. In addition, similar thresholds across conditions may have been a consequence of a practice effect as participants completed all 4 stimulus intensity blocks for one task (i.e. TOJ) prior to completing the second task (i.e. SJ) providing ample time to develop effective criterions for judging temporal relationships.

Differences in the PSS values derived from visual-leading compared to auditory-leading t-1 trials also did not show any significant effect of stimulus intensity. Prior studies examining rapid recalibration of temporal synchrony perception found that the physical temporal properties between audiovisual cues, and not the perceived temporal relationship, affected temporal realignment (Van der Burg et al., 2013, 2018). Therefore, it is likely that while stimulus intensity may affect perceived temporal order or simultaneity (Boenke et al., 2009; Schormans & Allman, 2018), it cannot alter the veridical temporal relationship and thus does not influence any temporal realignment that occurs between trials. One limitation of the present study was the sparse number of trials per SOA during each experimental condition (5 trials/SOA). This prevented analysis of how the ΔPSS was influenced by the degree of temporal asynchrony from the previous trial, as shown in prior studies (Roseboom, 2019; Van der Burg et al., 2013).

At the group level, the PSS was negative (audio-leading) across all tasks and stimulus intensity conditions. This is contrary to prior studies finding a visual-leading PSS for the SJ (Bedard & Barnett-Cowan, 2016; Fister et al., 2016; van Eijk et al., 2008) and TOJ (Bedard & Barnett-Cowan, 2016; Scurry et al., 2019) tasks indicating that auditory signals are processed at a slightly faster rate than visual signals. Indeed, a PSS shift in the audio-leading direction for higher intensity visual stimuli was reported from a TOJ task suggesting that visual cues were processed faster (Boenke et al., 2009). However, this explanation doesn’t hold when the two cues were presented at a similar intensity (ANVN and ASVS) or when the visual stimulus was presented at a higher intensity relative to auditory (ANVS). Perhaps, the different exposures to stimulus intensities throughout the experiment adjusted the overall temporal sensitivity of participants leading to a shift toward audio-leading PSS values. In prior studies using complex stimuli (i.e. a bouncing ball (van Eijk et al., 2008) or a Newton’s cradle toy with manipulated predictive and postdictive information in the visual signal (van Eijk et al., 2010)) an audio-leading PSS was explained by participant’s enhanced temporal sensitivity for audio-leading trials. Regardless, the negative PSS values do indicate a shift in natural perception as visual information naturally precedes auditory. Further, the individual variability in PSS values across all conditions suggest that inherent bias exists, and additional individual factors likely contribute to the temporal synchrony perception.

The overall auditory-leading PSS was significantly larger in the TOJ task relative to the SJ task indicating that auditory stimuli needed to lead visual by a greater extent to elicit a simultaneous percept. As discussed above, this may have reflected improved discrimination for audio-leading trials specifically during the TOJ task. It has been previously reported that the SJ and TOJ tasks probe different functions involved in temporal processing. SJ tasks are subject to an asymmetric criterion bias that increases the likelihood of perceiving simultaneity in asynchronous cues while TOJ tasks are associated with a difficulty in ordering successive events that impairs performance (Linares & Holcombe, 2014). Therefore, it is probable that participants became more sensitive toward temporal order cues during the TOJ compared to the SJ thus inducing the auditory-lead shift in their PSS.

The absence of any effect of stimulus intensity on temporal sensitivity or synchrony supports abundant prior findings that rely on supra-threshold stimulus presentation levels. Other effects, such as stimulus type, task, or between subject factors (i.e. age) are less likely confounded by any differences at the initial steps of sensory processing that are influenced by detection sensitivity and resultant speed of processing. While the reported null effects certainly substantiate prior findings, replications of the current design should be carried out in experimental groups rather than just young adults. In addition, replicable findings using different stimulus types, especially more ethologically relevant stimuli, would further strengthen the current findings and help support the practice of using a standard supra-threshold intensity level. Finally, extensions of this project that use additional definitions for near- and supra-threshold stimuli may help to parse out the point at which unisensory detection levels, and the specific combination of detection levels between stimulus modalities, affect multisensory temporal processing.

5. Conclusions

This project sought to investigate how individualized stimulus intensity levels influenced temporal precision and perceived temporal synchrony. While stimulus intensity did not have any significant effect on temporal precision or synchrony, the type of temporal processing task did affect perceived temporal synchrony. This finding is likely due to the separate functions required by temporal order vs simultaneity judgments. Determining sequential order decodes asynchronous cues in a less ambiguous way than determining synchrony and likely allows participants’ judgement strategies to improve for more difficult audio-leading trials over experimentation shifting their PSS toward audio-leading stimuli. Future studies that probe experimental groups and expand the definition of near- and supra-threshold stimuli will increase the validity of these findings and further promote the use of supra-threshold stimuli when investigating aspects of temporal processing as a way to reduce the confounding effect of unisensory detection ability.

Acknowledgements

This research has been supported by EY023268 to Fang Jiang and P20 GM103650. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

We thank the Nevada Undergraduate Research Award, provided by the University of Nevada, Reno, for supporting the contribution of Zachary Lovelady to the project.

References

  1. Alais D, Orchard-Mills E, & Van der Burg E (2015). Auditory frequency perception adapts rapidly to the immediate past. Attention, Perception, and Psychophysics, 77(3), 896–906. 10.3758/s13414-014-0812-2 [DOI] [PubMed] [Google Scholar]
  2. Aschersleben G, & Prinz W (1995). Synchronizing actions with events: The role of sensory information. Perception & Psychophysics, 57(3), 305–317. 10.3758/BF03213056 [DOI] [PubMed] [Google Scholar]
  3. Bedard G, & Barnett-Cowan M (2016). Impaired timing of audiovisual events in the elderly. Experimental Brain Research, 234(1), 331–340. 10.1007/s00221-015-4466-7 [DOI] [PubMed] [Google Scholar]
  4. Boenke LT, Deliano M, & Ohl FW (2009). Stimulus duration influences perceived simultaneity in audiovisual temporal-order judgment. Experimental Brain Research, 198(2–3), 233–244. 10.1007/s00221-009-1917-z [DOI] [PubMed] [Google Scholar]
  5. Brainard DH (1997). The psychophysics toolbox. Spatial Vision, 10(4), 433–436. 10.1163/156856897X00357 [DOI] [PubMed] [Google Scholar]
  6. Celesia GG, Kaufman D, & Cone S (1987). Effects of age and sex on pattern electroretinograms and visual evoked potentials. Electroencephalography and Clinical Neurophysiology, 68(3), 161–171. http://www.ncbi.nlm.nih.gov/pubmed/2436875 [DOI] [PubMed] [Google Scholar]
  7. Chan JS, Connolly SK, & Setti A (2018). The number of stimulus-onset asynchronies affects the perception of the sound-induced flash illusion in young and older adults. Multisensory Research, 31(3–4), 175–190. 10.1163/22134808-00002605 [DOI] [PubMed] [Google Scholar]
  8. Chan YM, Pianta MJ, & McKendrick AM (2014a). Reduced audiovisual recalibration in the elderly. Frontiers in Aging Neuroscience, 6, 226. 10.3389/fnagi.2014.00226 [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Chan YM, Pianta MJ, & McKendrick AM (2014b). Older age results in difficulties separating auditory and visual signals in time. Journal of Vision, 14(11), 1–11. 10.1167/14.11.13 [DOI] [PubMed] [Google Scholar]
  10. Chan Y, Pianta MJ, Bode S, & McKendrick AM (2017). Spatio-temporal patterns of event-related potentials related to audiovisual synchrony judgments in older adults. Neurobiology of Aging, 55, 38–48. 10.1016/J.NEUROBIOLAGING.2017.03.011 [DOI] [PubMed] [Google Scholar]
  11. Dienes Z (2014). Using Bayes to get the most out of non-significant results. Frontiers in Psychology, 5, 781. 10.3389/fpsyg.2014.00781 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Fechner GT (1860). Elements of Psychophysics. Breitkopf & Hartel. https://www.britannica.com/topic/Elements-of-Psychophysics [Google Scholar]
  13. Fister JK, Stevenson RA, Nidiffer AR, Barnett ZP, & Wallace MT (2016). Stimulus intensity modulates multisensory temporal processing. Neuropsychologia, 88, 92–100. 10.1016/j.neuropsychologia.2016.02.016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Fujisaki W, Shimojo S, Kashino M, & Nishida S (2004). Recalibration of audiovisual simultaneity. Nature Neuroscience, 7(7), 773–778. 10.1038/nn1268 [DOI] [PubMed] [Google Scholar]
  15. Goodin DS, Squires KC, Henderson BH, & Starr A (1978). Age-related variations in evoked potentials to auditory stimuli in normal human subjects. Electroencephalography and Clinical Neurophysiology, 44(4), 447–458. http://www.ncbi.nlm.nih.gov/pubmed/76553 [DOI] [PubMed] [Google Scholar]
  16. Humes LE, Busey TA, Craig JC, & Kewley-port D (2009). The effects of age on sensory thresholds and temporal gap detection in hearing, vision, and touch. Atten Percept Psychophys, 71(4), 860–871. 10.3758/APP.71.4.860.The [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Keane B, Bland NS, Matthews N, Carroll TJ, & Wallis G (2020). Rapid recalibration of temporal order judgements: Response bias accounts for contradictory results. European Journal of Neuroscience, 51(7), 1697–1710. 10.1111/ejn.14551 [DOI] [PubMed] [Google Scholar]
  18. Keetels M, & Vroomen J (2012). Perception of Synchrony between the Senses. In The Neural Bases of Multisensory Processes. CRC Press/Taylor & Francis. http://www.ncbi.nlm.nih.gov/pubmed/22593865 [PubMed] [Google Scholar]
  19. Kraus N, & Anderson S (2013). The Effects of Aging on Auditory Processing. The Hearing Journal, 66(1), 36. 10.1097/01.HJ.0000425774.80002.ea [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Lemaître H, Crivello F, Grassiot B, Alpérovitch A, Tzourio C, & Mazoyer B (2005). Age- and sex-related effects on the neuroanatomy of healthy elderly. NeuroImage, 26(3), 900–911. 10.1016/J.NEUROIMAGE.2005.02.042 [DOI] [PubMed] [Google Scholar]
  21. Linares D, & Holcombe AO (2014). Differences in perceptual latency estimated from judgments of temporal order, simultaneity and duration are inconsistent. I-Perception, 5(6), 559–571. 10.1068/i0675 [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Love SA, Petrini K, Cheng A, & Pollick FE (2013). A Psychophysical Investigation of Differences between Synchrony and Temporal Order Judgments. PLoS ONE, 8(1), e54798. 10.1371/journal.pone.0054798 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Morey RD (2018). Package “BayesFactor” Title Computation of Bayes Factors for Common Designs. https://cran.r-project.org/web/packages/BayesFactor/BayesFactor.pdf [Google Scholar]
  24. Neumann O, & Niepel M (2004). Timing of “Perception” and Perception of “Time.” In Kaernbach C, Schroger E, & Muller H (Eds.), Psychophysics Beyond Sensation: Laws and Invariants of Human Cognition (pp. 245–269). Lawrence Erlbaum Associates. https://books.google.com/books?hl=en&lr=&id=Mo9i5_yidGIC&oi=fnd&pg=PA245&ots=WFbtoHDxEI&sig=PTmU0MDX6lpgg_mxrBDDiSQfsB4#v=onepage&q&f=false [Google Scholar]
  25. Noel J-P, De Niear MA, Stevenson R, Alais D, & Wallace MT (2017). Atypical rapid audio-visual temporal recalibration in autism spectrum disorders. Autism Research, 10(1), 121–129. 10.1002/aur.1633 [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Ostroff JM, McDonald KL, Schneider BA, & Alain C (2003). Aging and the processing of sound duration in human auditory cortex. Hearing Research, 181(1–2), 1–7. 10.1016/S0378-5955(03)00113-8 [DOI] [PubMed] [Google Scholar]
  27. Panagiotidi M, Overton PG, & Stafford T (2017). Multisensory integration and ADHD-like traits: Evidence for an abnormal temporal integration window in ADHD. Acta Psychologica, 181, 10–17. 10.1016/j.actpsy.2017.10.001 [DOI] [PubMed] [Google Scholar]
  28. Pelli DG (1997). The VideoToolbox software for visual psychophysics: Transforming numbers into movies. Spatial Vision, 10(4), 437–442. 10.1163/156856897X00366 [DOI] [PubMed] [Google Scholar]
  29. Roseboom W (2019). Serial dependence in timing perception. Journal of Experimental Psychology: Human Perception and Performance, 45(1), 100–110. 10.1037/xhp0000591 [DOI] [PubMed] [Google Scholar]
  30. Rouder JN, Morey RD, Speckman PL, & Province JM (2012). Default Bayes factors for ANOVA designs. Journal of Mathematical Psychology, 56(5), 356–374. 10.1016/J.JMP.2012.08.001 [DOI] [Google Scholar]
  31. Schormans AL, & Allman BL (2018). Behavioral Plasticity of Audiovisual Perception: Rapid Recalibration of Temporal Sensitivity but Not Perceptual Binding Following Adult-Onset Hearing Loss. Frontiers in Behavioral Neuroscience, 12, 256. 10.3389/fnbeh.2018.00256 [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Scurry AN, Vercillo T, Nicholson A, Webster M, & Jiang F (2019). Aging Impairs Temporal Sensitivity, but not Perceptual Synchrony, Across Modalities. Multisensory Research, 32(8), 671–692. 10.1163/22134808-20191343 [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Stone JV, Hunkin NM, Porrill J, Wood R, Keeler V, Beanland M, Port M, & Porter NR (2001). When is now? Perception of simultaneity. Proceedings of the Royal Society B: Biological Sciences, 268(1462), 31–38. 10.1098/rspb.2000.1326 [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Van der Burg E, Alais D, & Cass J (2013). Rapid recalibration to audiovisual asynchrony. Journal of Neuroscience, 33(37), 14633–14637. 10.1523/JNEUROSCI.1182-13.2013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Van der Burg E, Alais D, & Cass J (2018). Rapid recalibration to audiovisual asynchrony follows the physical—not the perceived—temporal order. Attention, Perception, and Psychophysics, 80(8), 2060–2068. 10.3758/s13414-018-1540-9 [DOI] [PubMed] [Google Scholar]
  36. Van Der Burg E, Alais D, & Cass J (2015). Audiovisual temporal recalibration occurs independently at two different time scales. Scientific Reports, 5. 10.1038/srep14526 [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Van der Burg E, & Goodbourn PT (2015). Rapid, generalized adaptation to asynchronous audiovisual speech. Proceedings of the Royal Society B: Biological Sciences, 282(1804). 10.1098/rspb.2014.3083 [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. van Doorn J, van den Bergh D, Böhm U, Dablander F, Derks K, Draws T, Etz A, Evans NJ, Gronau QF, Haaf JM, Hinne M, Kucharský Š, Ly A, Marsman M, Matzke D, Gupta ARKN, Sarafoglou A, Stefan A, Voelkel JG, & Wagenmakers EJ (2020). The JASP guidelines for conducting and reporting a Bayesian analysis. In Psychonomic Bulletin and Review (pp. 1–14). Springer. 10.3758/s13423-020-01798-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. van Eijk RLJ, Kohlrausch A, Juola JF, & van de Par S (2008). Audiovisual synchrony and temporal order judgments: Effects of experimental method and stimulus type. Perception & Psychophysics, 70(6), 955–968. 10.3758/PP.70.6.955 [DOI] [PubMed] [Google Scholar]
  40. van Eijk RLJ, Kohlrausch A, Juola JF, & van de Par S (2010). Temporal order judgment criteria are affected by synchrony judgment sensitivity. Attention, Perception & Psychophysics, 72(8), 2227–2235. 10.3758/APP [DOI] [PubMed] [Google Scholar]
  41. Vroomen J, Keetels M, de Gelder B, & Bertelson P (2004). Recalibration of temporal order perception by exposure to audio-visual asynchrony. Cognitive Brain Research, 22(1), 32–35. 10.1016/j.cogbrainres.2004.07.003 [DOI] [PubMed] [Google Scholar]
  42. Weber EH (1834). De Tactu. http://onlinebooks.library.upenn.edu/webbin/book/lookupid?key=ha009657826
  43. Werner JS, Schefrin BE, & Bradley A (2010). Optics and vision of the aging eye. In Bass M, Enoch JM, & Lakshminarayanan V (Eds.), Handbook of Optics (3rd ed., pp. 14.11–14.38). McGraw-Hill. [Google Scholar]

RESOURCES