Effects of temporal uncertainty and temporal expectancy on infants’ auditory sensitivity

Lynne A Werner; Heather K Parrish; Nicole M Holmer

doi:10.1121/1.3050254

. 2009 Feb;125(2):1040–1049. doi: 10.1121/1.3050254

Effects of temporal uncertainty and temporal expectancy on infants’ auditory sensitivity¹

Lynne A Werner ^1,^b), Heather K Parrish ¹, Nicole M Holmer ¹

PMCID: PMC2677369 PMID: 19206878

Abstract

Adults are more sensitive to a sound if they know when the sound will occur. In the present experiment, the effects of temporal uncertainty and temporal expectancy on infants’ and adults’ detection of a 1 kHz tone in a broadband noise were examined. In one experiment, masked sensitivity was measured with an acoustic cue and without an acoustic cue to possible tone presentation times. Adults’ sensitivity was greater for the cue than for the no-cue condition, while infants’ sensitivity did not differ significantly between the cue and no-cue conditions. In a second experiment, the effect of temporal expectancy was investigated. The detection advantage for sounds occurring at an expected (most frequent) time, over sounds occurring at unexpected (less frequent) times, was examined. Both infants and adults detected a tone better when it occurred before or at an expected time following a cue than when it occurred at a later time. Thus, despite the fact that the auditory cue did not improve infants’ sensitivity, it nonetheless provided the basis for temporal expectancies. Infants, like adults, are more sensitive to sounds that are consistent with temporal expectancy.

INTRODUCTION

It is well established that infants have higher detection thresholds than adults (e.g., Schneider et al., 1989; Bargones et al., 1995; Berg and Boswell, 1999). Several contributors to infants’ immature sensitivity have been suggested, including inattention and unselective listening across frequency (Bargones and Werner, 1994; Bargones et al., 1995; Werner and Boike, 2001). Another possible contributor is unselective listening in time.

In a typical infant test procedure, the listener hears a continuous background noise that starts at the beginning of the test session. At certain points during the session the target tone is presented. There is no explicit cue informing the listener of when a tone might be presented, although the experimenter observing the listener’s response is aware that a trial is in progress. The listener is thus uncertain about the timing of the signal tone. Temporal uncertainty is a feature of most, if not all, psychoacoustical procedures applied to infant listeners (e.g., Schneider and Trehub, 1985; Werner, 1995; Berg and Boswell, 1999).

It is generally accepted that adults listen selectively in time to optimize sensitivity. Temporal uncertainty reduces adults’ auditory sensitivity (Egan et al., 1961; Lappin and Disch, 1973; Lisper et al., 1977). For example, Egan et al. (1961) examined listeners’ ability to detect a tone in broadband noise when the time at which the tone could be presented was known and when the presentation time varied over intervals of 1–8 s. Their results showed that a signal that was detected with a d^′ of about 1.5, when presentation time was known, was detected with a d^′ of only 0.75 when the interval of uncertainty was 8 s. The results of Egan et al. (1961) indicate that such an increase in uncertainty would be equivalent to a 9 dB decrease in signal-to-noise ratio.

Another indication that adults listen selectively in time is that adult listeners detect tones better at expected presentation times than at unexpected presentation times (Leis Rossio, 1986; Chang, 1991; Chang and Viemeister, 1991). On a majority of trials in these studies, the signal occurred at a fixed time during the observation interval, the “expected” time. On the remaining trials, the signal occurred either before or after the expected time; these signals occurred at “unexpected” times. Leis Rossio (1986) measured adults’ hit rate for a click in noise when the expected presentation time was 500 ms into the observation interval with unexpected presentation times varying between 100 and 1100 ms. A single-interval, yes-no procedure was used in that study. Chang (1991) and Chang and Viemeister (1991) used a two-interval forced-choice method and a 20 ms tone as a signal. Beside a visual indicator of each observation interval, a click was presented in the contralateral ear to indicate precisely the expected presentation time within the observation interval. Unexpected presentation times varied between 100 and 900 ms. Although the details of the procedures and the gradient of performance over time differed between the studies, both showed that as the presentation time of the signal deviated from the expected time, detection of the signal grew poorer. The results of these studies further support the benefit of knowing when to listen for a sound.

The effects of temporal uncertainty on detection have not been studied developmentally. If infants’ detection is more disrupted by temporal uncertainty than adults’, that could at least partially explain why infants’ thresholds are higher than adults’ in a temporally uncertain test procedure.

The effects of frequency uncertainty have been examined in children. Allen and Wightman (1995) found that children’s detection was less affected than adults’ by uncertainty about signal frequency, suggesting that children did not focus on a particular frequency when the signal frequency was known. Other results support the idea that infants do not listen selectively in frequency. For example, while adults detect tones at an expected frequency better than those at unexpected frequencies, infants detect expected and unexpected frequency tones equally well (Bargones and Werner, 1994). If infants do not focus on the time when signals are expected to occur, then decreasing temporal uncertainty may produce little change in infants’ sensitivity. If that were the case, the difference between infants’ and adults’ sensitivity would be greater when temporal uncertainty is reduced, because adults’ sensitivity would improve, but infants’ would not.

The goal of the present experiments was to determine how infants’ and adults’ detection of a tone in noise is affected by temporal uncertainty and temporal expectancy in an infant test procedure. First, detection in the typical, temporally uncertain, infant procedure was compared to detection when a cue to the timing of signal presentation was provided. Second, detection of tones that occurred at expected times was compared to that of tones that occurred at unexpected times.

EXPERIMENT 1: EFFECTS OF TEMPORAL CUES ON SENSITIVITY

Method

Subjects

The data were provided by 98 infants and 93 adults. The age of the infants ranged from 29 to 40 wk (M=33.5 wk; SD=3.3 wk). The age of the adults ranged from 19 to 31 yr (M=24 yr; SD=3 yr). All subjects had normal hearing, as assessed by parent report or self-report. None had any risk factors associated with hearing loss, and all subjects passed screening typanometry on the test day. All infants were full term, healthy, and developing normally by parent report.

Stimuli

Subjects detected a 1 kHz tone, 300 ms in duration, with 16 ms rise and fall times, in the presence of a 2500 Hz low-pass noise. The noise was presented continuously throughout the session. The level of the tone was 50 dB Sound Pressure Level (SPL) for the infants and 42 dB SPL for the adults. The spectrum level of the noise was always 20 dB SPL during trials. These levels were chosen to allow detection of the tone with a d^′=1, based on the results of a previous study (Bargones et al., 1995).

In the cue conditions, the cue indicated when the trial began; when the tone was presented, its onset was at a fixed interval after the cue. The cues were always acoustic cues. Acoustic cues were chosen, because even young infants focus attention within a sensory modality and respond less to stimulation in a modality other than the one on which they are focused (Richards, 2000). Thus, it seemed preferable to present the cue in the same modality as the target stimulus. Two different cue conditions and a no-cue condition were tested. Each subject was tested in only one of these conditions. Data collection for one cue condition was completed before data collection for the other cue condition. To make sure that any effect of the cue type was not due to changes in testers or instrumentation, a separate group of listeners was tested in the no-cue condition for each cue condition.

The stimulus conditions are depicted in Fig. 1. In the noise-decrement-cue condition, a reduction in background noise level was the cue. The spectrum level of the noise was 26 dB SPL until trial onset. At trial onset, the spectrum level dropped to 20 dB SPL. When the tone was presented, its onset was 500 ms after trial onset. Thus, the cue was the drop in the level of the background noise. In the noise-increment-cue conditions, the cue to trial onset was a 200 ms, 10 dB increment in the background noise. When the tone was presented, its onset was 500 ms after the offset of the noise increment. In the corresponding no-cue conditions, no cue was presented to the listener to mark the onset of the trial, but when the tone was presented its onset was 500 ms after trial onset. Previous studies indicate that infants of this age can easily detect noise level changes of the magnitudes used in the noise cue conditions (Berg and Boswell, 1998; Werner and Boike, 2001).

Stimulus configurations in experiment 1. A broadband noise is presented continuously (gray shading). Tones (black rectangles) are presented on tone trials. In the no-cue condition (bottom panel), a tone is presented 500 ms after the observer starts the trial, with no indication to the listener that the trial is underway. In the noise-decrement cue condition, the spectrum level of the background noise drops from 26 to 20 dB SPL, 500 ms before the tone and remains at 20 dB for the duration of the trial. In the noise-increment cue condition, the spectrum level of the background noise increases from 20 to 30 dB for 200 ms, and then returns to 20 dB, 500 ms before the tone. The trial configurations on no-tone trials are the same as the tone trial in each condition, except that no tone is presented.

Data collection in the noise-decrement-cue conditions was completed first. A noise decrement, rather than an increment, was chosen as the cue to ensure that the cue did not mask the signal tone. Subsequent work showed that forward masking of the tone by the cue would not be expected in this condition (Werner, 1999). An unexpected result in the noise-decrement-cue conditions led us to repeat the study using the noise-increment cue. The number of subjects tested in the noise-decrement cue, and no-cue conditions were 26 and 32, respectively, for the infants, and 28 and 25, respectively, for the adults. The number of subjects tested in the noise-increment cue, and no-cue conditions were 21 and 19, respectively, for the infants, and 20 and 20, respectively, for the adults.

The stimuli were presented to the subject’s right ear using an Etymotic ER-1 insert earphone. A computer controlled the presentation of the stimulus and stored the results on each trial. Testing took place in a sound-attenuating booth.

Procedure

Infants’ detection of the tone was measured using the observer-based psychoacoustic procedure (Werner, 1995). The infant, with ear tip in place, was seated on a parent’s lap in the booth. An assistant, seated to the infant’s left, manipulated toys on a table in front of the infant to maintain the infant’s gaze forward. Both the parent and assistant listened to masking sounds over circumaural headphones so that they could not hear any of the sounds presented to the infant. Two mechanical toys in dark Plexiglas boxes with lights were placed to the infant’s right; these toys were activated to reinforce the infants’ response to the tone as described below. An observer watched the infant through a one-way window and on a video monitor. The observer pushed a button interfaced to the computer to begin a trial when the infant was quiet and attentive, without knowing whether a tone would be presented or not.1 Both “tone trials” and “no-tone trials” were presented. Trials were 4 s in duration. If the observer judged on the basis of the infant’s behavior that a tone had occurred, she pushed a button to indicate a “yes” response. If the observer was correct in judging that a tone had occurred, one of the mechanical toys in the test booth was illuminated and activated as reinforcement for the infant. The observer received feedback at the conclusion of all trials. The same general procedure was used to test adults. The adult subject was told to respond “when you hear the sound that will make the toy come on.” An assistant outside the booth recorded the adult’s responses, and a mechanical toy was activated when a response was recorded during a tone trial.

At the beginning of each session, a brief (approximately five trials) training phase was completed during which the tone was clearly audible and the reinforcer toy was turned on after every tone trial. This procedure demonstrated to the infant that the tone (or cue+tone) was associated with the toy. The toy was never turned on after no-tone (or cue alone) trials. In the second training phase, the tone remained clearly audible, tone and no-tone trials were equally probable, and the reinforcer toy only came on if the observer correctly identified a tone trial. The infant∕observer team or the adult subject was required to achieve 80% correct on both tone and no-tone trials. Thus, in the cue conditions, the infant learned to respond to cue+tone, but not to the cue alone. Similarly, the observer learned to differentiate the infant’s response to the cue+tone from the infant’s response to the cue alone. This phase took about 22 trials to complete in all conditions. Once training criterion had been met, 35 test trials were presented, including 15 tone trials, 15 no-tone trials, and 5 probe trials. On probe trials, the level of the tone was chosen to be readily detectable, 51 dB SPL for adults and 60 dB SPL for infants. A subject’s data were only used if at least three of the five probe tones were detected. This provided a check that the subject was “on task.”

If a subject reached training criterion but did not complete all test trials, a new block of test trials was completed in a subsequent visit after an abbreviated training procedure.

Sensitivity was expressed as d^′. Hit or false alarm rates of 1 or 0 were adjusted by 1∕2n where n is the number of trials (Macmillan and Creelman, 2005). Levene’s test of homogeneity of variance was significant in the dataset as a whole (with both infants and adults, p<0.0001), but it was not significant within age groups (both p>0.4). For that reason, the effect of the cues on d^′ was analyzed within age groups, and the pattern of effects compared between age groups.

Results

In the no-cue conditions, both infants and adults generally achieved a d^′ around 1.0, as expected (e.g., Bargones et al., 1995). Mean d^′ in the noise-decrement no-cue group was 1.28 (SD=0.83) for infants and 1.16 (SD=0.64) for adults; in the noise-increment no-cue group mean d^′ was 0.98 (SD=0.60) for infants and 1.15 (SD=0.70) for adults. The differences between the two no-cue groups were not statistically significant by t-test [t(49)=1.4, p=0.17, d=0.4) for infants; t(43)=0.03, p=0.98, d=0.01 for adults]. The data of the two no-cue groups were therefore pooled within age groups in the remainder of the analyses.

Average d^′ in the noise-decrement-cue (dark gray bars), noise-increment-cue (light gray bars), and no-cue (white bars) conditions is plotted in Fig. 2, with infants’ data on the left and adults’ data on the right. In each cue condition, adults’ d^′ was greater than in the no-cue condition. One-way analysis of variance (ANOVA) indicated a significant effect of cue type (noise-decrement cue, noise-increment cue, no cue) for the adults [F(2,90)=4.81, p=0.1, η²=0.10]. Bonferroni post hoc pairwise comparisons showed that d^′ was significantly higher in each of the cue conditions than in the no-cue condition (both p<0.04). The two cue conditions were not significantly different for adults (p>0.99).

Mean d^′ as a function of cue condition in experiment 1 for infants and adults, ±1 SEM.

Infants’ d^′ in each cue condition, however, was actually a little lower than that in the no-cue condition; clearly neither cue improved infants’ detection of the tone. For the infants, the effect of cue type was only marginally significant by one-way ANOVA [F(2,95)=2.70, p=0.7, η²=0.05) Bonferroni post hoc pairwise comparisons showed a marginally significant difference in d^′ between the noise-increment-cue and no-cue condition (p=0.083). The noise-decrement-cue and no-cue conditions were not statistically different (p=0.55), and the two cue conditions were not significantly different (p>0.99).

Adults are typically very conservative in their response bias in an “infant procedure,” while infants∕observers tend to be unbiased or a little liberal in their response bias in the same procedure (e.g., Werner and Marean, 1991). A cue might be expected to change response bias, although it is not clear that infants and adults would be affected in the same way. To examine the effect of the cue on response bias, bias was described as

c = 0.5 [z (hit rate) + z (false alarm rate)]

(Macmillan and Creelman, 2005). Hit or false alarm rates of 1 or 0 were adjusted by 1∕2n, where n is the number of trials. Positive values of c indicate a conservative bias, while negative values indicate a liberal bias. In the no-cue conditions, infants were somewhat liberal responders, while adults were quite conservative, as expected. Mean c in the noise-decrement no-cue group were −0.21 (SD=0.33) for infants and 1.05 (SD=0.39) for adults; in the noise-increment no-cue group −0.18 (SD=0.37) for infants and 1.15 (SD=0.37) for adults. The differences between the two no-cue groups were not statistically significant by t-test [t(49)=−0.32, p=0.74, d=0.1 for infants; t(43)=−0.88, p=0.38, d=0.26 for adults]. The data of the two groups were therefore pooled within age groups in the remainder of the analyses.

Average c in the noise-decrement-cue (dark gray bars), noise-increment-cue (light gray bars), and no-cue (white bars) conditions is plotted in Fig. 3, with infants’ data plotted on the left and adults’ data plotted on the right. As noted, infants tended to be a little liberal, while adults tended to be conservative. Both infants and adults tended to respond more liberally when a cue was provided, although the effect appears smaller for infants than for adults.

Mean c as a function of cue condition in experiment 1 for infants and adults, ±1 SEM.

One-way cue-type ANOVA of c indicated a significant effect for adults [F(2,90)=8.87, p=0.0003, η²=0.16]. Bonferroni post hoc pairwise comparisons indicated that both types of cues made adults significantly more liberal (p<0.0001 for noise-decrement cue, p=0.025 for noise-increment cue). For infants, the one-way cue-type ANOVA of c was marginally significant [F(2,95)=0.246, p=0.09, η²=0.05), but the Bonferroni post hoc pairwise comparisons were not significant (p=0.173 for noise-decrement cue, p=0.305 for noise increment cue). Thus, the cues clearly made adults more liberal in their response bias, and although infants’ bias changed in the same direction with the cues, the effect was not statistically significant.

Discussion

The results of Experiment 1 indicate that a cue to trial onset led to improved performance in adult listeners, but not in infant listeners. For adults, this result held whether the cue was a decrement in the level of the background noise or an increment in the level of the background noise. Cue type also made little difference to infants, although some weak evidence suggested that the noise-increment cue could be detrimental to infants’ performance.

As expected, adults’ pure-tone detection was better when temporal uncertainty was reduced. This result is qualitatively consistent with previous reports (Egan et al., 1961; Lappin and Disch, 1973; Lisper et al., 1977).

The results for infant listeners suggest that infants do not benefit from a reduction in temporal uncertainty. It is possible that some other sort of cue might improve infants’ detection performance. We avoided using a visual cue in the current experiments so that infants would not be required to divide attention between sensory modalities (e.g., Richards, 2000). However, a recent study suggests that visual information can facilitate infants’ ability to separate an auditory target from a masker. Hollich et al. (2005) tested infant’s ability to recognize a word in a background of competing speech. If the target word was paired with a video of a face saying the word, or even with an “oscilloscopelike trace” that was temporally synchronized with the word, infants recognized the word at a lower signal-to-background ratio than they did when no visual information was provided, or if the visual display was not synchronized with the target word. This suggests that a visual cue could improve infants’ detection of a tone, even if an auditory cue does not.

To benefit from any cue, the listener must (1) learn that the cue predicts the possible occurrence of the signal, (2) learn and remember when the signal could occur following the cue, and (3) be able to listen selectively at the predicted time. One explanation for the cue’s failure to improve infant’s detection is that infants do not form expectancies that one event will follow another. Casual observation of infants suggests this is unlikely. Furthermore, it is well established that infants develop expectations that one visual event will follow another (e.g., Haith et al., 1988). Another explanation is that while infants develop expectancies and attempt to direct listening to the appropriate time, their ability to estimate or to remember the interval between events is highly inaccurate. In that case, their uncertainty about the timing of the signal might not be reduced by a cue. A final explanation is that infants are not able to listen at a particular time for an expected event. That would be consistent with their listening along the frequency dimension: Infants detect expected and unexpected frequencies equally well, while adults detect expected frequencies better than unexpected frequencies (Bargones and Werner, 1994).

Experiment 2 was a more direct test of infants’ ability to form and use temporal expectancies about sounds. The probe-signal method (e.g., Greenberg and Larkin, 1968; Scharf, 1987; Schlauch and Hafter, 1991; Dai and Wright, 1995; Arbogast and Kidd, 2000) was used to determine whether infants detect sounds presented at expected times better than they detect sounds presented at unexpected times. In this method, listeners detect a tone. On 75% of the trials, the tone is presented at one temporal position in the observation interval; on the remaining trials, the tone is presented before or after that time. The level of the tone is set so that it is detectable on, perhaps, 80% of the trials if it is presented at a fixed time. Adults detect the tone presented at the more common temporal position more often than they detect the tones at the other temporal positions (Leis Rossio, 1986; Chang, 1991; Chang and Viemeister, 1991), just as they detect tones at a more common frequency (e.g., Schlauch and Hafter, 1991), duration (Wright and Dai, 1994; Dai and Wright, 1995), or spatial position (Arbogast and Kidd, 2000) more often than they detect tones at other frequencies, durations, or spatial positions. We have previously used the probe-signal method to examine infants’ “listening bands” in frequency (Bargones and Werner, 1994). We refer to the effect of temporally selective listening as a “listening window.”