Skip to main content
iScience logoLink to iScience
. 2024 Jul 30;27(9):110611. doi: 10.1016/j.isci.2024.110611

Opposing serial effects of stimulus and choice in speech perception scale with context variability

Carina Ufer 1,2,3,, Helen Blank 1,2
PMCID: PMC11382034  PMID: 39252961

Summary

In this study, we investigated serial effects on the perception of auditory vowel stimuli across three experimental setups with different degrees of context variability. Aligning with recent findings in visual perception, our results confirm the existence of two distinct processes in serial dependence: a repulsive sensory effect coupled with an attractive decisional effect. Importantly, our study extends these observations to the auditory domain, demonstrating parallel serial effects in audition. Furthermore, we uncover context variability effects, revealing a linear pattern for the repulsive perceptual effect and a quadratic pattern for the attractive decisional effect. These findings support the presence of adaptive sensory mechanisms underlying the repulsive effects, while higher-level mechanisms appear to govern the attractive decisional effect. The study provides valuable insights into the interplay of attractive and repulsive serial effects in auditory perception and contributes to our understanding of the underlying mechanisms.

Subject areas: Neuroscience, Behavioral neuroscience, Social sciences

Graphical abstract

graphic file with name fx1.jpg

Highlights

  • Disentangling stimulus and choice history reveals opposing effects on perception

  • Choice history attracts and stimulus history repels current speech perception

  • Repulsive stimulus history effect only revealed after accounting for choice history

  • Stability in speaker context strengthens repulsive effects of stimulus history


Neuroscience; Behavioral neuroscience; Social sciences

Introduction

Perception of incoming sensory information is profoundly influenced by preceding sensory input. In the visual domain, numerous studies have provided evidence that visual perception tends to be biased by previously encountered similar stimuli, a phenomenon labeled as serial dependence.1,2,3,4 However, while serial effects should also be of great importance in audition, as the auditory signal inherently unfolds in the temporal dimension,5 reports of serial dependence in the auditory domain are sparse (but see6,7). Serial dependence has been reported in a positive direction, i.e., attracting the current percept toward the previous stimulus (attractive effect8,9,10), and in a negative direction, i.e., repelling the current percept away from the previous stimulus (repulsive effect11,12,13,14). These opposing serial effects have been explained with the properties of the presented stimuli. It has been suggested that repulsive effects are more likely to result from exposure to strong, salient, high-contrast, and long-duration stimuli, whereas brief, less salient, and low-contrast stimuli are associated with attractive effects.1 Attractive and repulsive stimulus history effects may serve different goals: while repulsive serial dependence, akin to adaptation-like processes, might help to weight previous information about the recent past to maximize sensitivity to change,15,16,17,18 attractive serial dependence might follow a dynamic form of integration whereby the tendency of the physical world to remain stable or change continuously is exploited.3,10

As attractive serial dependencies of the previous stimulus have been observed over many perceptual dimensions — for example, for olfactory stimuli19 along with various visual features like orientation,10 motion,20 numerosity,6,21 position,22 gender,23,24 age,24 face attractiveness,25 food palatableness,26 and art aesthetics27 — it has been argued that it constitutes a domain-general mechanism. This ubiquity led to the hypothesis that serial dependence serves as a functional mechanism integrating similar stimuli over time to maintain perceptual coherence as our external world exhibits temporal stability. However, the view that attractive serial dependence leverages to maintain a continuous perceptual experience has recently been challenged.28 If attractive serial dependence serves to smooth out variations in sensory inputs, the attractive effect should scale with the statistical variability of the environment. Thus, in contexts with high temporal correlations between stimuli, i.e., low variability, the current stimulus should be perceived as stable through attraction to the previous percept. However, contrary to this expectation, a recent study discovered that such correlations across visual stimuli repelled the current sensory input away from the previous one.28 Interestingly, in contrast to the repulsive serial dependence of the stimulus history, an attractive serial effect of the previous choice was observed throughout different temporal correlations.

Crucially, in experiments that observed attractive serial dependence from the previous stimulus, effects by the response were not always controlled for.6,19,25,26 Thus, an attractive dependence from the previous choice could superimpose an effect overshadowing the potential repulsive serial dependence from the stimulus history. Indeed, when the previous response was taken into consideration, many experiments identified this attractive serial effect of choice.14,29,30,31,32,33,34 Moreover, a recent re-analysis of published data revealed that the initially found attractive serial dependence from the previous stimulus could be the result of an attractive decisional process overshadowing a repulsive effect of the previous stimulus.35 With this mounting evidence that two opposing history effects of stimulus and choice counteract each other, we tested these effects in the lesser-studied auditory domain. Specifically, the current study aimed to investigate how previous stimulus features (here speech and voice) together with the previous choice influence the perception of the current auditory stimulus in different contexts of environmental variability.

While temporal context effects have a long-standing history in auditory perception,36,37,38,39 they have little impact on the research field of serial dependence. Our experiment bridges the gap between those two disciplines. Most studies investigating context effects on auditory perception have focused on negative effects, where the current percept is repelled away from the previous sound.40 These studies typically used inducer sounds that did not require any decision and were manipulated to elicit a repulsive effect based on deviating stimulus features.40 In contrast, the few studies investigating the perception of auditory features within the framework of serial dependence have reported an attractive effect toward the previously judged stimulus. Importantly, in these studies, the previous stimulus typically required a response and deviated more gradually from the current stimulus.41,42 Until now, both research fields studying temporal context effects in the auditory domain focused on the influence of previous sensory features but neglected the impact of the previous choice (if it was present). We aimed to test whether these contradicting perceptual context effects could be reconciled by showing repulsive auditory effects in setups of serial dependence after controlling for an attractive choice effect. The impact of choice history is also corroborated by a recent study finding an attractive serial choice effect in audition.43 It raises the importance of controlling for the previous choice in gaining knowledge about auditory processing. Therefore, we hypothesized finding a repulsive serial dependence of auditory features after controlling for the attractive choice effect. Evidence for two opposing processes of sensory and choice history in our auditory study would extend the well-studied effects in the visual domain and contribute to our understanding of generalized, domain-general mechanisms underlying serial dependencies.

In the present series of experiments, we examined the serial dependence of the current auditory stimulus on the preceding stimulus, i.e., speech and voice features, while considering the previous response. Participants listened to words spoken by different speakers. To induce perceptual difficulty and increase the potential of detecting opposing serial effects of stimulus and choice history, we used morphing to create a continuum between two speech sounds, specifically the vowels ‘u’ and ‘o’ contained in the word stimuli. Unlike a clear vowel, a morphed vowel is not consistently perceived as the most probable category based on its physical sensory features. This means that an ambiguous stimulus, such as 75% ‘o’, would be perceived as ‘o’ most of the time and occasionally as ‘u’ thereby leading to response variability for the identical stimulus features. If a stimulus was always perceived as the same category, stimulus features and response would be fully conflated and thus also the stimulus and the choice history. Testing the perception of intermediate steps of the morph continuum enabled us to disentangle the hypothesized opposing effects of the previous choice and stimulus features. A repulsive serial dependence effect of speech should repel the percept of the current stimulus away from the previously heard morph level, but not from the previous choice. Additionally, to test for serial dependence of voice features, we manipulated two salient and prominent voice qualities that account for considerable variability between speakers: the pitch of a voice and the vocal resonance frequencies (formants), which contribute to the distinct timbre and brightness of a voice.44 These two voice features influence speech classification in opposing directions. For example, on the vowel continuum between ‘u’ and ‘o’, a high pitch shifts the perception towards ‘u’, whereas high formants shift the perception towards ‘o’.45,46 A repulsive serial dependence of previous voice features should invert this relationship so that a high previous pitch should shift the perception towards ‘o’, while high previous formants should shift the perception towards ‘u’.

To investigate whether the context effect found in the visual domain28 translates to the auditory domain, we tested whether the serial dependence on previous voice features scales negatively with increasing stability in the context environment. To that end, we manipulated the statistical context variability in which the voice properties were presented. In the most variable context, different speakers were mixed randomly (Experiment 1), in the context with intermediate variability, speakers were presented in short sequences (Experiments 2a and 2b), and in the context with the least variability, speakers were grouped in blocks (Experiment 3, see Figure 1).

Figure 1.

Figure 1

Three experimental setups

(A–C) Across different experiments, we manipulated the variability of auditory voice features (i.e., pitch and formants) from (A) high context variability (mixed), (B) intermediate context variability (short sequences), to (C) low context variability, i.e., stable (block). Two voice features, the formants of a voice (purple color scale) and the pitch (blue color scale) were shifted up or down to evoke the effect of different speakers. Within each speaker, the auditory speech signal (one instance of nine word-pairs) could vary in ambiguous steps between ‘u’ and ‘o’ (morph level, orange-red color scale). Participants were asked to indicate whether they perceived ‘o’ or ‘u’ as a binary response after the audio signal via keypress.

Results

Repulsive serial dependence of stimulus features and attractive serial dependence of choice

To investigate the serial effects in the auditory domain, specifically of voice and speech properties, as well as the simultaneous influence of the previous choice, we recorded participants’ categorical responses, which could be either ‘u’ or ‘o’, based on morphed word stimuli spoken by different speakers. As we were specifically interested in serial effects, i.e., the influence of the preceding stimulus and choice, we accounted for the properties of the current stimulus (i.e., such as morph level, pitch, and formant manipulation) by adding them as baseline predictors to a generalized linear mixed model (GLMM). The current morph level should have a positive effect, i.e., a positive coefficient, on the perception of ‘o’, similar to the effect expected by the current formants. The baseline coefficient of the current pitch was expected to be negative since high pitch values decrease the probability of perceiving ‘o’. Next, to assess the serial dependence of previous stimulus features, i.e., the previous morph level, pitch, and formants, were added as predictors of the current choice. If a repulsive effect was apparent, the baseline effects should reverse, leading to a negative coefficient for the previous morph level and the previous formants and a positive coefficient for the previous pitch. Additionally, the previous choice was added to the model to address its possible attractive influence, shown by a positive model coefficient, on the current choice. Finally, to test for the effect of context variability, which was manipulated across three separate experimental setups, we included interactions of the context (i.e., experimental setup) with all predictors. All baseline predictors of the current stimulus showed the expected pattern of a positive slope for the current morph level (all p < 0.001) and the current formants (all p < 0.001) and a negative slope for the current pitch (all p < 0.001). As hypothesized, we found a reversion of these effects, i.e., a repulsive serial dependence for all stimulus-related parameters, i.e., previous morph level (low context variability −0.27 ± 0.05 [-0.36 -0.18], z = −5.02; intermediate context variability: −0.22 ± 0.03 [-0.27 -0.17], z = −8.37; high context variability: −0.13 ± 0.05 [-0.22 -0.04], z = −2.93), previous formants (low context variability: −1.23 ± 0.07 [-1.36 -1.10], z = −18.63; intermediate context variability: −0.58 ± 0.03 [-0.65 -0.52], z = −18.64; high context variability: −0.23 ± 0.05 [-0.33 -0.13], z = −4.32), and previous pitch (low context variability: b ± SE = 0.96 ± 0.06, CI [0.83 1.08], z = 15.44; intermediate context variability: 0.49 ± 0.03 [0.44 0.55], z = 16.89; high context variability: 0.14 ± 0.05 [0.04 0.24], z = 2.84; all p < 0.01). Note, that the positive slope for the previous pitch still indicates a repulsive effect since the baseline influence of the current pitch on the perception of the vowel ‘o’ is negative. In contrast to the stimulus features and in line with our hypothesis, we observed an attractive serial dependence of choice, attracting the current choice toward the previous one (low context variability: 0.26 ± 0.06 [0.13 0.38], z = 4.08; intermediate context variability: 0.43 ± 0.03 [0.36 0.49], z = 13.29; high context variability: 0.20 ± 0.06 [0.08 0.33], z = 73.18, all p < 0.01). This pattern of negative and positive effects emerged independently of the speaker variability context (see also Figure 2B). Consistently throughout all experiments, the effect of current voice features reversed for the previous voice features, i.e., a repulsive serial dependence. Also, in the speech dimension, the previous morph level influenced the current choice in a repulsive way, such that if the previous morph was an ‘o’, the current morph was more likely to be perceived as an ‘u’. At the same time, the previous choice positively shaped the current choice, attracting it toward what had previously been answered.

Figure 2.

Figure 2

Effects of the voice, speech, and choice within different speaker variability contexts

(A) Effects of current stimulus features. Influence of the current stimulus’ voice features on the speech perception of ‘o’ and ‘u’ for contexts with different levels of speaker variability (solid, dashed, dotted lines). High pitch leads to a lower probability of ‘o’ responses (first panel), whereas high formants lead to a higher probability of ‘o’ responses (second panel).

(B) Effect of previous stimulus features and previous choice. Influence of the previous stimulus features and the previous choice on the perception of ‘o’ and ‘u’ in contexts with different levels of speaker variability (solid, dashed, dotted lines). The effects of the previous pitch, formants, and morph level on the perception of the current stimulus reverse: A high pitch of the previous stimulus leads to a lower probability of ‘o’ responses (first panel in B), whereas high formants of the previous stimulus lead to a lower probability of ‘o’ responses (second panel) and when the previous morph level was shifted toward ‘o’ the probability of ‘u’ responses on the current trial is higher (third panel). The previous choice exhibited an attractive effect on the perception of the current stimulus, i.e., a higher probability of responding ‘o’, when the previous choice was ‘o’ (fourth panel). All psychometric curves are plotted based on the model coefficients and their standard errors.

(C) Model coefficients of previous features. The model coefficients show repulsive effects for the previous sensory features and an attractive effect of the previous choice. Repulsive effects for the sensory features of the previous stimulus (i.e., pitch, formants, and morph level) scale linearly with the levels of variability in speaker context (displayed as square, diamond, dot). The attractive effect for the previous choice exhibits a quadratic trend. All plots are based on the model coefficients and error bars (C) and shaded areas (A, B) indicate their standard errors.

Context variability influences the contrastive stimulus effect linearly and the attractive choice effect quadratically

We manipulated the speaker context variability at three different levels: speakers were presented in mixed blocks (high context variability), short sequences (intermediate context variability), or larger blocks (low context variability). To investigate the different serial effects of both sensory stimulus features and choice depending on context variability, context variability was added as a factor in the GLMM, and the predictor variables of the current and previous trials were allowed to vary freely across contexts. A linear contrast on the parameter estimates revealed that the repulsive serial dependence of voice features increased with context stability. This linear trend was apparent in both voice features of a speaker, i.e., pitch (b ± SE = −0.82 ± 0.08, z = −10.32, p < 0.001) and formants (b ± SE = 1.00 ± 0.08, z = −11.85, p < 0.001). Thus, serial dependence of the voice features scaled linearly with the manipulated context variability. Equivalently, the effect of the previous speech feature, i.e., the previous morph level, also followed a linear trend and had the greatest repulsive influence when speakers were presented in large blocks (b ± SE = 0.14 ± 0.06, z = 2.12, p = 0.03). Hence, all features of the previous stimulus exhibited the greatest impact in the low variability context and the weakest impact in the high variability context.

As reported earlier, the effect of the previous choice was positive throughout the different levels of context variability, but unlike the stimulus features, its magnitude revealed a quadratic effect across contexts (b ± SE = 0.40 ± 0.11, z = 3.62, p < 0.001). While the previous choice had the greatest impact when context variability was intermediate (speakers presented in short sequences), its effect was reduced in the low and high context variability conditions (mixed and blocked speaker presentation).

To understand this quadratic effect of context variability, we explored the possibility that participants perceived the short sequences in the intermediate speaker variability context as distinct units, similar to a chunk. In this context, participants could detect the beginning and the end of a sequence which may facilitate a chunking of physically similar trials within the sequence, in contrast to the low and high variability contexts lacking clear sequence boundaries, as the “sequences” are either very long (n = 207) or very short (the same speaker never appeared after itself, i.e., n = 1). Therefore, this may lead to a higher choice bias for within-sequence trials which are perceived as instances of the same chunk opposed to between-sequence trials that are perceived as instances of different chunks. If the increased attractive serial dependence in the context of intermediate speaker variability indeed reflects a chunking mechanism, choices within a sequence should be more influenced by the previous choices than choices made between different sequences. To test this possible explanation, we performed an additional analysis on the data of the intermediate variability context by adding an interaction term for the previous choice and the binary sequence change (i.e., whether the previous choice was made within or between sequences). Note that in the low variability context with large speaker blocks, too few between-sequence trials were available to test this hypothesis there as well. Our model revealed that indeed a significant interaction between the influence of the previous choice and sequence change was present. When the choice was made within a sequence, the attractive serial dependence became stronger (0.47 ± 0.03 [0.41 0.54], z = 13.76, all p < 0.001), while it was comparable in magnitude with the low and high speaker variability contexts for choices between sequences (0.25 ± 0.05 [0.16 0.34], z = 5.49, all p < 0.001).

The negative effect of the previous morph turns positive when the choice is unaccounted for

Numerous experiments reported an attractive serial dependence of the previous stimulus features when previous decisional processes were not taken into account. This phenomenon can be explained by the intricate interplay of mutually opposing repulsive and attractive effects. When we plotted our raw data against the morph level of the previous stimulus without considering the previous choice, the repulsive effect of the previous morph level, as identified by the GLMM, was indeed not visible (Figure 3A, first panel). However, when we also accounted for the counteracting attractive serial dependence of choice, the formerly obscured negative effect of the previous morph level was revealed in the raw data (Figure 3A, second and third panel). As both effects pull the percept of the current stimulus in opposing directions with comparable weight, the positive choice effect overshadows the negative effect of the previous morph level completely.

Figure 3.

Figure 3

The repulsive effect of the previous morph turns more attractive when the choice is unaccounted for

(A) Repulsive effect of previous morph level hidden by attractive choice effect. Without controlling for the previous choice, no repulsive effect is evident in the raw example data from the low speaker variability context (first panel). When the data are split by the previous choice, a clear repulsive effect of the previous morph level can be seen in the raw data (second and third panel). Additionally, the attractive effect of the previous choice is apparent as the curve is shifted above toward ‘u’ (i.e., most data points fall below the mean curve of the undivided data) when the previous choice was ‘u’ (second panel) and shifted toward ‘o’ (i.e., most data points fall above the mean curve of the undivided data) when the previous choice was ‘o’ (third panel). Logistic regression lines approximate the raw data points (dots) averaged across participants and their SEMs as shaded area.

(B) Model comparison and their β coefficients representing the influence of the previous morph level. First panel: The current stimulus model (1.), neglecting any prior history, outperforms the pure choice history model (2.) but is surpassed by the models incorporating either the previous choice (3.) or the previous stimulus features (4.). The best-fitting model is the full model (5.) accounting concomitantly for both the previous stimulus features and the previous choice. Second panel: In the model not considering the previous choice (4.), the β coefficients for the previous morph level show a shift toward the positive (attractive) direction, indicating a conflation of the previous morph level and previous choice (top). Conversely, a distinct repulsive effect for the β coefficients of the previous morph level revealed in the model (5.), which accounts for the previous choice scaling with the speaker context variability (bottom). Error bars represent the standard error of the model coefficients. ∗ indicates ΔBIC/AIC >10.

We fitted an additional model, which deliberately excluded the previous choice as a predictor, to see how this affected the parameter estimates of the previous morph level. We expected to see a positive shift of the parameter estimates, resembing the here unaccounted effect of the previous choice on the current percept. When the previous choice was not included as a predictor in the model, the evidence for the repulsive effect of the previous morph level was mixed (Figure 3B, second panel). It varied across the contexts variabilities, so that its coefficient was significantly positive when speakers were presented in short sequences (b ± SE = 0.06 ± 0.02 [0.02 0.09], z = 3.23, p = 0.001), and negative when speakers were presented in blocks (b ± SE = −0.06 ± 0.03 [-0.11 -0.01], z = −2.52, p = 0.01) or mixed (b ± SE = −0.01 ± 0.03 [-0.07 0.04], z = −0.47, p = 0.64). Crucially, all parameter estimates for the previous morph level were shifted toward a more positive value and a similar quadratic trend as for the influence of the previous choice parameter can be seen.

To confirm that the model accounting for both, the influence of the previous stimulus and previous choice, was the best fit for the data, we compared five models: a model that included only the current stimulus features (Equation 1), a model that only relied on the previous choice (Equation 2), a model with the current stimulus features and the previous choice (Equation 3), a model with the current and the previous stimulus features (Equation 4), and the full model with serial dependence for both the stimulus and the choice (Equation 5). To determine the most suitable model, we used the Bayesian Information Criterion (BIC) and Akaike Information Criterion (AIC) for model comparisons, which balance the trade-off between goodness of fit and complexity. A BIC/AIC difference of Δ > 10 indicates very strong evidence in favor of the smaller value.47 The model that predicted participants’ responses only based on the previous choice performed the worst, and the model containing only predictors about the current stimulus improved the model fit tremendously (ΔBIC > 49000; ΔAIC > 49000). In addition to the current stimulus features, the previous choice explained further variance in predicting participant’s responses (ΔBIC > 390; ΔAIC > 420). The goodness of model fit improved further compared to the current stimulus model when including the previous stimulus features as predictors (Δ BIC > 1100, Δ AIC = 1200), and even more when adding the previous choice (ΔBIC > 2300, ΔAIC > 1200). Thus, the full model comprising previous stimulus features and the previous choice captured our data best and outperformed all other models (Figure 3B, first panel). In sum, this provides very strong evidence for serial dependence effects and that both the attractive serial dependence of choice and the repulsive serial dependence of morph level coexist and pull the percept into opposing directions.

Discussion

In this study, we investigated the role of serial dependence in the auditory domain by manipulating the voice and speech properties of the sensory input presented in sequences with varying levels of variability. We identified two concurrent opposing serial effects: a repulsive serial dependence of the stimulus history and an attractive serial dependence of the choice history.

This finding is in contrast to many previous experiments in the visual domain, which reported an attractive serial dependence of stimulus history.6,8,10,19,20,24,25,27,42,48,49 However, these studies often neglected possible choice effects. We disentangled the serial dependence of perceptual features from those stemming from decision processes by applying a model that included both stimulus and choice history. In line with previous work,30 this comprehensive model demonstrated superior explanatory power to all other tested models, thus corroborating the plausibility of a unified account for both perceptual and decisional dependencies (Figure 3B, first panel). The direction of the serial dependence effects is in agreement with evidence from visual studies that considered the possibility of two serial effects of sensory history and choice history.7,13,14,28,30,31,33,50 One potential reason why many previous studies did not find a repulsive, but attractive serial dependence of sensory history may be that two concurrent history effects are working in opposing directions.35 This conclusion was further supported by a re-analysis of four previously published datasets which initially pointed toward an attractive effect of stimulus history, however, when the model additionally controlled for a possible effect of the previous choice, no reliable evidence for this effect was apparent.35 Likewise, our results demonstrate a conflated pattern in the influence of the previous morph level within the model that neglects the previous choice, resembling a combination of the repulsive sensory effect and attractive choice effect (Figure 3B, second panel). The combination of the two opposing serial effects has been determined to be additive,30 providing a potential explanation for why repulsive serial dependence appears to emerge from strong, salient, high-contrast, and prolonged stimuli, as the repulsive effect may exert a net dominance over the serial choice effect.1 Efforts have been undertaken to mitigate serial effects of choice by instructing participants to withhold their responses in some trials. This still led to attractive serial dependence in some studies10,22,31,51,52,53 while in others the attractive effect disappeared.31,54,55,56,57 The absence of an executed motor response may not necessarily rule out a (potentially implicit) perceptual decision, or the reliance on the most recent overtly realized decision, which could still be sufficient to induce an attractive serial dependence.31,35 This is supported by the finding that the attractive serial effect disappeared when participants were explicitly instructed to ignore the stimulus that did not require a response.31 Crucially, when the previous choice is accurate (i.e., the participant chose the category that matched the physical properties of the stimulus), the previous stimulus features and the previous choice are inherently and inevitably conflated. Such a conflated pattern may also be the reason that an attractive effect was found when the previous choice was correct, but when the previous choice was incorrect it turned repulsive.58 In sum, it is important to consider both sensory and choice history simultaneously in the analyses of serial dependence.

The effect of context variability on repulsive sensory serial dependence

Our results indicate that the repulsive serial dependence of stimulus features, i.e., voice and speech, increases with context stability. Notably, the slope of the repulsive serial dependence of the voice features was quite steep, indicating the previous stimulus had a significantly greater influence when speakers were presented in blocks, compared to short sequences, compared to the mixed presentation (Figure 2B). This serial effect induced by sensory feature history is akin to many adaptation-like repulsive effects within and beyond the auditory domain.17,18,59,60,61,62 When a stimulus or a feature of a stimulus is presented repeatedly, the sensory system adapts to the statistical context with reduced responsiveness to the stimulus feature, so that the sensitivity to changes around the stimulus feature is maximised.62 From an information-theoretic point of view, the neural response to a stimulus is scaled by its informational value: the same stimulus carries increasingly less new information about the environment with each repeated exposure, while the informational value of a deviating stimulus scales with its amount of surprise.63 This is thought to be an efficient encoding strategy by the sensory system to filter relevant signals and facilitate novelty detection.62 In our experiment, the statistical context varied in terms of speaker variability, and the repulsive effect of the stimulus history increased with low variability likely representing a heightened informational value of the deviating information. Also in visual serial dependence research, the repulsive serial dependence of stimulus history was enhanced by decreasing context variability and further with extended sequence length.28 Moreover, we find that the repulsive serial effect of the speech features became slightly stronger with the context stability of voice features, although the speech variability remained constant across experimental setups. This suggests that small differences in speech features along the morphed continuum between two vowels are discriminated best in a stable voice environment. Again, this can serve as further, though indirect, evidence for an efficient encoding mechanism that facilitates more fine-grained spectral contrast detection between confusable ambiguous morph levels with repeatedly presented speakers. In line with this, increased intelligibility and faster reaction times have been demonstrated when speakers are repeated.64 Future research manipulating the variability of the speech context while keeping the voice context stable could corroborate whether the repulsive effect of the morph level scales steeply with speech context variability.

The effect of context variability on attractive choice serial dependence

For the attractive serial choice effect, we observed a quadratic effect across the different levels of context variability. Note that we kept the decisional environment uncorrelated over all contexts in our experiment, as there was no predictability of the morph level, i.e., participants could not anticipate whether the next stimulus was an ‘u’ or an ‘o’. The current choice was biased strongest toward the previous choice in the intermediate speaker variability context, in which speakers were presented in short sequences. While the smaller choice effect in the high speaker variability context can be explained by the lowered subjective relevance attributed to the past choice due to lacking correlations in the statistical environment,65,66 this does not explain why the choice effect was also reduced in the context with low speaker variability. As an alternative explanation, the effect could be due to a chunking process during decision-making, which refers to the conscious or unconscious grouping of a set of events or stimuli to enable a more efficient encoding in comparison to when they were represented as independent elements.67,68 Our additional analyses support the hypothesis that in the intermediate speaker variability context, participants perceived the short sequences as discrete chunks, as we found a stronger bias toward the previous choice within a sequence. In the low variability context, speaker changes occurred very rarely, so participants most likely did not group the individual stimuli within blocks and if so, may have instead applied a metacognitive rule to not respond repetitively. When accounting for within-sequence biases, the attractive choice effect was comparable over the different levels of speaker context variability across the experimental setups. Previous research findings provided mixed results regarding the relationship between attractive choice history effects and variability of the context. While some research suggests that dependence on past responses increased with the stability of the environment,65,66 others indicate that the attractive choice effect diminished with greater stability.28 In our experiment, we found that the attractive effect of choice history remained consistent across different levels of speaker context variability, after deducting the within-sequence influence. This can be attributed to the indirect influence of voice context variability on the choice, although the decisional environment itself (speech morphs) were uncorrelated. Further research is needed to understand the factors that contribute to the modulation of attractive choice history effects, such as meta-cognition or task demands, in different environmental contexts.

Implementation of serial dependence

The unresolved debate regarding whether “serial dependence” operates at a lower-level sensory or a higher-level post-perceptual level may partly arise from the interplay of these two opposing serial effects.1 Several findings indicate a higher-level origin of the attractive serial dependence, supporting our finding of an attractive decisional process. Studies using traditional analysis techniques, which disregard the previous choice, demonstrated that despite presenting stimuli with the same features, the attractive serial dependence of the stimulus features disappeared when the choice focused on a different feature.33,54 The finding that the attractive serial effect relies on the decision-making process supports its interpretation as a higher-level post-perceptual effect, as well as its persistence even when stimuli are merely imagined.69 Moreover, attractive serial dependence can only be found once the stimulus is consciously perceived and its strength scales with attention.10,48 Additionally, attractive serial dependence was enhanced for higher levels of confidence in a decision as well as for longer memory delays, both processes which are commonly associated with late higher-level processing.14,29,33,70 Furthermore, generative models implementing leaky integration of the previous choice history outperformed purely stimulus-based models43 underlining the great importance of previous decisions. Also, in our data, adding the previous choice as another predictor to the model explained the data significantly better. In most experiments, including ours, only the sensory input and the resulting response output are observable, leaving the intermediate processing stages undisclosed. Since the behavioral response is the overt manifestation of perception, it remains unclear whether the attractive effect already affects processing at the perceptual level in a top-down manner, rather than emerging during post-perceptual processing. The neural signatures of serial dependence may reveal further insights into its implementation. The two opposing serial processes appear to map on different brain networks: areas in early sensory regions were shown to be involved in repelling perception away from sensory history,32,71,72 whereas frontoparietal regions were involved in the attractive effect of the previous choice.72,73 Correspondingly, several studies demonstrated this memory trace of the previous trial at different timepoints relative to the current stimulus presentation in human EEG signals.7,12,74,75,76 The accuracy of decoding the neural signature of the past response correlated positively with the strength of observed serial dependence reflecting the behavioral relevance of this neural representation of past experience.74,76 As most of the mentioned studies focused on the possibility to decode previous information from the current trial, the exact neural mechanism of how previous and current information are combined, is still unclear. Some insight comes from spiking activity analyses from neurons in the mouse brain where a strengthened correlation between the neural pattern and behavioral choice increased across the ascending cortico-thalamic hierarchy.77 This indicates a possible integration of the encoded repulsive sensory signal with attractive top-down mechanisms that finally results in an (attractive) choice behavior.

Why is there an attractive serial dependence?

Intriguingly, the serial effect consistently manifested as attractive across different experiments - regardless of whether it was interpreted as the net dominance of an attractive decisional effect over a repulsive sensory effect7,13,14,28,31,32,35,78 or as an attractive sensory-perceptual effect.6,10,22,24,25,26,27,49 An overall attractive effect has been shown to be difficult to eliminate, as it persists even in entirely uncorrelated experimental contexts, despite impairing performance.65 This raises the question why there is attractive serial dependence. A possible explanation is that in everyday perception, the past is typically correlated with the future which could lead to default top-down prediction that the current precept will resemble the previous one. Once this statistical regularity is learned,79 small negligible variabilities in the sensory input may be smoothed out thereby achieving an adaptive stable representation of the environment.3 Even though a recent study found diminished attractive serial dependence of choice history when introducing correlations in the decisional context, the analysis of the error responses showed that participants made more correct choices.28 This suggests when deliberately introducing a positive bias toward the previous choices in an experimental context, comparable to the real-world environment, attractive serial dependence renders perception more precise and veridical. In sum, serial dependence may involve a balanced interplay between a sensory encoding mechanism that is sensitive to detecting changes which results in repulsive sensory serial dependence, and the attractive decoding mechanism of this repulsive sensory evidence to filter out less meaningful variations leading to a stable attractive serial dependence of choice.

Comparing serial dependence in the visual and auditory modality

In the current study, we examined the effect of the stimulus and choice history on the perception of auditory stimuli. Pre- and post-perceptual mechanisms in the auditory domain may be intrinsically different from the visual domain, as temporal statistics of events in the external world differ for both modalities. The auditory signal is a wave that conveys information by variability over time and thus inherently needs temporal change, whereas changes in the visual environment are typically sparse and slow,80 i.e., there is more variability in the auditory domain and especially in spoken word recognition these fluctuations are meaningful, whereas in the visual domain sensory input may change on a comparative slower timescale which renders small changes as uninformative. Our results complement the traditional serial dependence literature demonstrating attractive effects of the stimulus history,1,8,10 by reconciling a repulsive effect of the previous stimulus features and an attractive effect of the previous choice (as previous studies7,13,14,28,35).

Nevertheless, the pattern of results we observed in our study, i.e., opposing effects of sensory and choice history, could also be explained by inherent differences in the auditory and visual modality. For auditory stimuli the weighting of top-down influence and bottom-up sensory evidence may be shifted, so that in contrast to visual stimuli, small deviations in auditory input are weighted stronger, leading to a higher likelihood of observing a repulsive effect. A recent comparison of visual and auditory serial dependence in duration perception showed no difference in either magnitude of the repulsive sensory or the attractive choice effect between the two sensory domains,50 suggesting that the sensory processing for vision and audition likely relies on similar mechanisms. Although the temporal statistics in the external world of both modalities are inherently different, the pattern of serial dependence is similar, showing repulsive sensory processing together with an attractive decisional effect. Hence, our findings hint at a shared mechanism operating in both vision and audition. Similar to our study, opposing serial effects in agency81 and tactile vibration perception82 support the speculative notion of a domain-general mechanism underlying serial dependence. Accounting for both sensory and choice history across several modalities will elucidate whether a general, modality-invariant mechanism for serial dependence exists.

Limitations of the study

The two-alternative forced choice task implemented here used stimuli that varied in voice (pitch and formants) and in speech features (u-o-continuum) while the binary choice discriminating between ‘u’ and ‘o’ was collected as a response measure. This categorical task may not fully encompass the response variability that may span the entire continuous morph spectrum,83 since it has been suggested that speech processing may involve a gradient mapping84 rather than a strict categorical mapping. However, a recent study design with a non-categorical pitch reproduction task revealed the same repulsive effect of the previous stimulus with an attractive effect of the previous choice.7 Hence, while the magnitude of our effects could be under- or overestimated by the dichotomous response scale, the direction of our observed effects, as well as the relation between the speaker variability contexts should be unaffected by our task. Future studies may test graded, continuous response options similar to the rating responses in the visual domain,10 presumably leading to more differentiated error patterns.

In our study, we manipulated voice features related to speech perception simultaneously with speech features. However, we measured speech perception which captured the impact of voice features only indirectly. This was possible because we previously established the direction of voice influence, i.e., the pitch and the formant frequencies, on the perception of vowel contrast.46 As hypothesized, the repulsive effect of those voice features was shown by reversed perceptual effects for current and previous voice features. Given that similar repulsive effects were noted in other studies where the response dimension directly involved pitch, it is plausible that our results generalise.7 Further, the experimental design choice to focus only on one dimension (the speech dimension) was necessary since serial dependence is thought to be feature-specific and disturbed by decisions associated with a different feature dimension.7,33

The voice features in our experiments only varied in three steps from lower to mid-range, to higher than average. Though improbable, it is still possible that smaller, more nuanced differences in the voice features would have revealed an effect in the opposing direction, since it has been argued that attractive serial dependence operates over a limited range of feature (dis-)similarity.1 In our study, a gradual manipulation of the voice features would have interfered with the speaker variability context as the speakers would not have been discriminable anymore. A more fine-grained manipulation of the voice features could be addressed by future studies focusing on the serial effect of voice features in the absence of context variability.

One further consideration in our study is that in the intermediated speaker variability condition, grouping within the sequence may have been strengthened by the presentation of static face images. This could have contributed to a stronger grouping within than between a sequence, even though the visual information was uninformative of the speech information, and therefore should not affect the choice.

Our design intentionally used a stable mapping of keys and responses because trial-wise key switches would have been confusing for participants, especially in an online experimental environment. Thus, the attractive effect of the previous choices was paralleled by the history of motor activity since in our experiment the choice and motor response are inseparable. Nevertheless, motor history should only have dampened the attractive choice effects because previous studies have shown that motor response history either had no influence on the attractive effect of the previous choice66 or that it had a repulsive effect.7,85

In conclusion, our study showed opposing serial effects on auditory vowel perception with repulsive sensory and attractive decisional effects. Extending findings from visual perception research, we identified parallel serial effects in the auditory domain, with intriguing context variability effects which suggest an adaptive sensory mechanism for repulsion and a higher-level mechanism for attraction.

STAR★Methods

Key resources table

REAGENT or RESOURCE SOURCE IDENTIFIER
Deposited data

Experimental data This paper https://doi.org/10.17605/OSF.IO/9TJ57
Stimuli This paper https://doi.org/10.17605/OSF.IO/9TJ57

Software and algorithms

R2023 R Core Team https://www.r-project.org/
RStudio RStudio Team https://posit.co/products/open-source/rstudio/
lme4 Bates et al.86 https://doi.org/10.18637/jss.v067.i01
Python 3.10 Python Software Foundation https://www.python.org/
MATLAB Mathworks https://mathworks.com/
STRAIGHT Hideki Kawahara https://github.com/HidekiKawahara/legacy_STRAIGHT
Praat Paul Boersma and David Weenink https://www.fon.hum.uva.nl/praat/
Prolific Prolific https://www.prolific.com/
Pavlovia Open Science Tools https://pavlovia.org/
jsPsych 6.3 Josh de Leeuw https://www.jspsych.org/6.3/
Code for analyses This paper https://doi.org/10.17605/OSF.IO/9TJ57

Resource availability

Lead contact

Further information and requests for resources should be addressed to the lead contact, Carina Ufer c.ufer@uke.de.

Materials availability

The dataset, code and stimuli generated in this study have been deposited online (see key resources table). No other materials were produced by this study.

Data and code availability

  • Experimental data and the stimuli have been deposited at OSF and are publicly available as of the date of publication. DOIs are listed in the key resources table.

  • The original code to reproduce the results is publicly available at OSF and the DOI is listed in the key resources table. The code for the online experiment and the code for plotting the results is available from the lead contact upon request.

  • Any additional information required to reanalyze the data reported in this paper is available from the lead contact upon request.

Experimental model and study participant details

We conducted a series of four online experiments, in which we investigated the influence of stimulus and choice history on the current auditory percept. Across four experiments, we recruited 179 healthy participants (experiment 1: 30 participants (17 males, 13 females, 18-60 years, Mage = 28.33, SDage = 9.26); experiment 2a: 33 participants (16 males, 17 females, 19-40 years, Mage = 29.76, SDage = 5.99), and experiment 2b: 87 participants (43 males, 44 females, 19-39 years Mage = 27.53, SDage = 5.20); experiment 3: 30 participants (19 males, 11 females, 18-57 years, Mage = 29.0, SDage = 9.63). Participants were screened to have normal hearing, no neurological or psychiatric illnesses, and German as their first language. Data from experiments 1 and 3 were existing datasets from previous experiments.46 All participants were recruited via the online platform Prolific and were rewarded 7.50 £ per hour for experiments 1, 3, 2a, and inflation-adjusted 12.00 £ per hour for experiment 2b. The sample size was based on previous work.46 Written informed consent was given by each participant at the start of the experiment. The local ethics committee approved the study.

Method details

Stimuli

Across all four experiments, we used the same audio stimuli. These were the following real word pairs in German, that phonetically only differ between the central ‘u’ or ‘o’ vowel:

(Brut / Brot (brood / bread); Bug / Bog (bow / bent); Fuhr / Vor (rode / before); Huhn / Hohn (chicken / derision); Kur / Chor (cure / choir); Ruhm / Rom (glory / Rome); Schuh / Show (shoe / show); Tut / Tot (toot / dead); Zug / Zog (train / drew).

The mean duration of the stimuli was 642 ms (SD = 67 ms). Between the u/o endpoints, morphs were created in 5 steps using the STRAIGHT vocoder in MATLAB, keeping the pitch contour constant at the value of the /u/ endpoint. All audio stimuli were recorded by a professional female speaker. To create the average voice sound based on variance in the population we adjusted the average pitch of the initial recording to 170 Hz and modified the formant frequencies of the original voice by reducing them by 7%. To create the four additional speaker identities, the average speaker's voice was shifted up or down in either pitch or formant frequencies using Praat's "change gender" routine. The shift factors were 1.146 or 1/1.146 = 0.873 for the high and low formant frequency speakers, and 1.24 or 1/1.24 = 0.806 for the high and low pitch speakers, respectively. These factors were chosen with a pitch/formant frequency ratio of approximately 1.6 which is the relative importance of these factors when determining different voice identities.87 The differences between each of the four created voices to the average speaker were chosen to induce reliable percepts of different speakers, i.e., less than 25% formant frequency difference and 45% pitch difference.87 In general, the pitch variation was reduced with a factor of 0.2, to increase the perceived similarity within test stimuli.

Apparatus

We conducted a series of four online experiments in which participants listened to sequences of spoken words via headphones (which was tested via a headphone test at the beginning of each experiment88). Overall experiments, participants indicated their responses by pressing the respective keyboard key (‘u’ / ‘o’). If participants did not respond in time, a 1500 ms long prompt informed them that they had been too slow. After each auditory stimulus, there was a minimum of 500 ms of silence up to the maximum response time.

Experiment 1 - High speaker variability

In experiment 1, the spoken words were presented in random order with the two constraints that the same speaker could not be followed by itself and the same word could not appear consecutively by two different speakers. For each participant, there were 828 trials, resulting from 9 words x 4 repetitions for the 4 shifted speakers with 5 morph levels per word, and for the average speaker with 3 morph levels per word. Thus the probability of occurrence for the average speaker was 13%. On the screen, participants were presented with the response options ‘u / o’ each trial for which they were given 5000 ms time to answer. The experiment was split into 8 blocks with a break of 15 s in between. On average, participants took 21.39 min (range 18.19 to 25.54 min) to complete all experimental trials, without the time allocated for reading and accepting the informed consent and undergoing auditory checks at the beginning of the experiment.

Experiment 2 a + b - Intermediate speaker variability

Experiment 2a

Participants were presented with sequences of words of varying length (7, 8, or 9 words) by the same shifted speaker, however, one word per sequence was exchanged to be spoken by the average speaker. The average speaker was presented at every position once (except the first one to establish context). Each sequence length was presented six times per speaker. This led to a total of 576 trials ((7+8+9) words x 6 repetitions x 4 speakers) per participant, of which 72 were spoken by the average speaker. Thus the probability of a word spoken by the average speaker was 12.5%. Because of the few trials, we restricted the average speaker words to be the most ambiguous 50% morph level. The order of the speakers was pseudo-randomized for each participant and constrained such that between sequences the same speaker could not be followed by itself. Within a sequence, each word could only appear once, and between sequences the same word could not be repeated. Each trial prompted a response and lasted 2500 ms because in experiment 1 participants' response times very rarely exceeded 2000 ms.

During auditory word presentation, one of four faces was shown to be associated with each speaker randomly. To further ensure participants' engagement, we additionally introduced 8 catch trials in which participants had to respond with the space key to a red dot on the screen, which in return disappeared again. If more than 3 catch trials were missed, the experiment would terminate automatically, which did not happen for any participant. On average, participants took 23.25 min (range 20.50 - 27.36 min) to complete all experimental trials.

Experiment 2b

Experiment 2b was equivalent to experiment 2a except that the sequence length was fixed to 7 words. This led to a total of 504 trials (7 words per sequence x 18 sequences x 4 speakers) each prompting a response. Over the course of the experiment, the average speaker was presented at each of the 7 sequence positions at least twice, and at 4 positions three times, resulting in 18 trials per speaker. On average, there was approximately a 14% chance of a word being spoken by the average speaker. The order of the speakers was pseudo-randomized over the experiment. Participants took on average 21.46 (range 18.52 - 28.42 min) to complete all experimental trials.

We merged both sequence experiments 2a and 2b because they were based on identical tasks and procedures. There were no differences in the direction of the repulsive stimulus history effects and the attractive choice history effect and also the magnitude of the effects was comparable for all parameters (all p > 0.15) except for the history effect of the pitch, which was stronger for experiment 2b (b ± SE = 0.14 ± 0.06, p = 0.02).

In the two sequence experiments, we presented images of faces during the spoken words to address a different research question of another study which is reported here for transparency only. One of four faces was shown to be associated with each speaker, assigned randomly across participants. In the middle of both experiments, participants completed a face-voice association training, which was part of the research question of another study and is reported here for transparency only. During the face-voice association training participants listened to the audiobook “Alice im Wunderland” ("Alice in Wonderland”) read by the four speakers. In the first part, they listened passively while in the second active part, they had to actively indicate which speaker they were currently listening to via a mouse click and received feedback. At the end of the experiment, a test of whether participants learned the face-voice associations was conducted. This training did not affect the serial dependence effects of the current study. We assessed if there were differences in serial dependencies from before to after the face-voice association training with an extended GLMM. There were no differences between the two parts that could not be explained better by a linear time trend over all blocks of the experiments (both BIC and AIC > 280). Even though there were small significant differences throughout the experiment attenuating the serial effect that the previous morph and the previous choice had on the current trial, the estimated effect of the previous morph at the end of the experiment was still negative (b ± SE = -0.11 ± 0.04, CI [-0.19 -0.04], z = -2.89) and the estimated effect of previous choice at end of the experiment was still positive (0.36 ± 0.04 [0.29 0.44], z = 9.11, both p < 0.01), the pattern of repulsive sensory history and attractive choice history remained stable.

Experiment 3 - Low speaker variability

In total, there were 828 trials for each participant, resulting from 9 words x 4 shifted speakers x 5 morph levels x 4 repetitions (720), and additional for the average speaker 9 words x 3 morph levels x 4 repetitions (108). The experiment was split into four blocks based on the four speakers, each comprising a total of 207 trials, of which 180 word stimuli were spoken by the shifted speaker, randomly interspersed with 27 words spoken by the average speaker. Thus the probability of a word spoken by the average speaker was 13%. Neither two words by the average speaker, nor the same word could repeat itself. Note here that with this experimental setup, the probability of a word being spoken by the average speaker was nearly identical over the three variability conditions across the three experimental setups and only the speaker context changed. In the middle and between each block there was a 15 s break. For every trial, participants were presented with the question ‘u / o’ for which they had 5000 ms time to answer. They took on average 22.21 min (range 19.95 - 25.67 min) to complete all experimental trials.

Quantification and statistical analysis

All statistical analyses were performed using RStudio version 2023.6.0.421 with the R version 4.2.089 and the lme4 package for generalised linear mixed models.86

Participants' responses were manually checked by counting their consecutively repeating or alternating choice patterns and by plotting psychometric functions. If a random response pattern was obvious, participants were confronted with it via a direct message and could voluntarily return their submission. In total, 7 participants withdrew their participation. We excluded one additional participant who responded 34 times before the audio stimulus ended, thus indicating low effort.

Before analyses, participants’ responses were cleaned for outliers. We considered trials in which participants responded under or over 3 SD of their mean reaction time as outliers, which led to the exclusion of a maximal 4% of trials per participant.

As the responses were binary a logistic linking function was specified. The response variable was coded as 1 if ‘o’ was answered and 0 if ‘u’ was answered. We hypothesised that the responses were influenced by the morph level, the pitch, and the formant frequencies of the voice of the current stimulus, but crucially also by the previous morph level, the pitch, and the formant frequencies of the voice of the previous stimulus, and the previous response. A similar analysis approach of using GLMMs to investigate serial dependence in the visual domain was recently applied.13,78 In accordance with the literature, we followed a maximal approach including as many random effects per participant as possible while retaining model convergence.90 We first included participant random slopes for all fixed effects and their interactions and reduced the model stepwise by prioritising the fixed effects of interest until the model successfully converged. This approach led to the following formula:

Response'o'morphlevel×context+pitchofthevoice×context+formantfrequenciesofthevoice×context+pitchofthevoiceprev×context+formantfrequenciesofthevoiceprev×context+morphlevelprev×context+choiceprev×context+1+morphlevel|word+1+morphlevel+pitchofthevoiceprev+formantfrequenciesofthevoiceprev+morphlevelprev+choiceprev|participant

All fixed effect predictors, except the factorial context predictor, were rescaled and centred to fall within the range from -1 to 1 to ensure comparable effect sizes.91 The context variable was dummy-coded and the blocked speaker condition was defined as the reference level. For all post hoc tests and for the estimation of 95% confidence intervals (corresponding to a significance level of p = 0.05) and p-values the emmeans package was used.92 As a second step, contrasts over the GLMM parameter estimates were applied to characterise the magnitude of the effect over the different contexts. The linear and quadratic effect of the parameter estimates obtained from the GLMM was assessed using custom contrasts (linear: [-1, 0, 1] or quadratic [-0.5, 1, -0.5]) on the estimated marginal means per experiment.

For model comparison, four alternative models to the full model were tested: (1) the stimulus model which contained information about the current stimulus, (2) the previous choice model with the previous choice as the only predictor, (3) the model which contained additional to the previous choice the current stimulus features as a predictor, and (4) the previous stimulus model which included predictors about the current and the previous stimulus features. For comparability reasons, we kept the random effects structure similar across models and therefore also fitted the full serial dependence model (5) with the reduced random effects structure.

Responseomorphlevel*context+pitchofthevoice*context+formantfrequenciesofthevoice*context+(1|word)+(1|participant) (Equation 1)
Responseopreviouschoice*context+(1|word)+(1|participant) (Equation 2)
Responseomorphlevel*context+pitchofthevoice*context+formantfrequenciesofthevoice*context+previouschoice*context+(1|word)+(1|participant) (Equation 3)
Responseomorphlevel*context+pitchofthevoice*context+formantfrequenciesofthevoice*context+previousmorphlevel*context+pitchofpreviousvoice*context+formantfrequenciesofpreviousvoice*context+(1|word)+(1|participant) (Equation 4)
Responseomorphlevel*context+pitchofthevoice*context+formantfrequenciesofthevoice*context+previousmorphlevel*context+pitchofpreviousvoice*context+formantfrequenciesofpreviousvoice*context+previouschoice*context+(1|word)+(1|participant) (Equation 5)

For model comparison, we chose to compare both, Akaike information criterion (AIC) and Bayesian information criterion (BIC), because both information criteria may favour different models as the BIC, different from the AIC, penalises more complex models additional to the number of predictors for the number of observations, thereby preferring simpler models over more complex ones compared to the AIC. The difference in BIC and AIC were interpreted in line with the guidelines for model selection so that a difference > 10 was judged to be very strong evidence in favour of the model with the smaller value.47

We conducted an additional analysis to test for differences in the dependence on the previous choice in the intermediate speaker variability context within a sequence versus between sequences. We filtered out the corresponding data and added the interaction between the previous choice and the factor variable whether the current decision was within or between sequences. Again we followed the maximal approach for model fitting, which resulted in the following model:

Response'o'morphlevel+pitchofthevoice+formantfrequenciesofthevoice+morphlevelprev+pitchofthevoiceprev+formantfrequenciesofthevoiceprev+choiceprev×within/betweensequnece+1+morphlevel|word+1+morphlevel+morphlevelprev+choiceprev|participant

Acknowledgments

This work is funded by the Emmy Noether program of the Deutsche Forschungsgemeinschaft (German Research Foundation; Grant No DFG BL 1736/1-1 to H.B.). We acknowledge financial support from the Open Access Publication Fund of UKE - Universitätsklinikum Hamburg-Eppendorf. We thank Fabian Schneider, Janika Becker, and four anonymous reviewers for their valuable feedback which significantly contributed to the improvement of the manuscript.

Author contributions

Conceptualization, H.B. and C.U.; Methodology, H.B. and C.U.; Formal Analysis, C.U.; Investigation, C.U.; Writing – Original Draft, C.U.; Writing – Review and Editing, H.B. and C.U.; Funding Acquisition, H.B.; Supervision, H.B.

Declaration of interests

The authors declare no competing interests.

Declaration of generative AI and AI-assisted technologies

During the preparation of this work the authors used ChatGPT in order to improve language and readability. After using the tool, the authors reviewed and edited the content as needed and take full responsibility for the content of the publication.

Published: July 30, 2024

References

  • 1.Cicchini G.M., Mikellidou K., Burr D.C. Serial Dependence in Perception. Annu. Rev. Psychol. 2024;75:129–154. doi: 10.1146/annurev-psych-021523-104939. [DOI] [PubMed] [Google Scholar]
  • 2.Manassi M., Murai Y., Whitney D. Serial dependence in visual perception: A meta-analysis and review. J. Vis. 2023;23:18. doi: 10.1167/jov.23.8.18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Manassi M., Whitney D. Continuity fields enhance visual perception through positive serial dependence. Nat. Rev. Psychol. 2024;3:352–366. doi: 10.1038/s44159-024-00297-x. [DOI] [Google Scholar]
  • 4.Pascucci D., Tanrikulu Ö.D., Ozkirli A., Houborg C., Ceylan G., Zerr P., Rafiei M., Kristjánsson Á. Serial dependence in visual perception: A review. J. Vis. 2023;23:9. doi: 10.1167/jov.23.1.9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Gori M., Sandini G., Burr D. Development of Visuo-Auditory Integration in Space and Time. Front. Integr. Neurosci. 2012;6 doi: 10.3389/fnint.2012.00077. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Fornaciai M., Park J. Serial dependence generalizes across different stimulus formats, but not different sensory modalities. Vis. Res. 2019;160:108–115. doi: 10.1016/j.visres.2019.04.011. [DOI] [PubMed] [Google Scholar]
  • 7.Zhang H., Luo H. Feature-specific reactivations of past information shift current neural encoding thereby mediating serial bias behaviors. PLoS Biol. 2023;21 doi: 10.1371/journal.pbio.3002056. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Cicchini G.M., Anobile G., Burr D.C. Compressive mapping of number to space reflects dynamic encoding mechanisms, not static logarithmic transform. Proc. Natl. Acad. Sci. USA. 2014;111:7867–7872. doi: 10.1073/pnas.1402785111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Cicchini G.M., Mikellidou K., Burr D.C. The functional role of serial dependence. Proc. Biol. Sci. 2018;285 doi: 10.1098/rspb.2018.1722. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Fischer J., Whitney D. Serial dependence in visual perception. Nat. Neurosci. 2014;17:738–743. doi: 10.1038/nn.3689. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Bae G.-Y., Luck S.J. Interactions between visual working memory representations. Atten. Percept. Psychophys. 2017;79:2376–2395. doi: 10.3758/s13414-017-1404-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Bae G.-Y., Luck S.J. Reactivation of Previous Experiences in a Working Memory Task. Psychol. Sci. 2019;30:587–595. doi: 10.1177/0956797619830398. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Bosch E., Fritsche M., Ehinger B.V., de Lange F.P. Opposite effects of choice history and evidence history resolve a paradox of sequential choice bias. J. Vis. 2020;20:9. doi: 10.1167/jov.20.12.9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Fritsche M., Mostert P., de Lange F.P. Opposite Effects of Recent History on Perception and Decision. Curr. Biol. 2017;27:590–595. doi: 10.1016/j.cub.2017.01.006. [DOI] [PubMed] [Google Scholar]
  • 15.Benucci A., Saleem A.B., Carandini M. Adaptation maintains population homeostasis in primary visual cortex. Nat. Neurosci. 2013;16:724–729. doi: 10.1038/nn.3382. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Grill-Spector K., Henson R., Martin A. Repetition and the brain: neural models of stimulus-specific effects. Trends Cognit. Sci. 2006;10:14–23. doi: 10.1016/j.tics.2005.11.006. [DOI] [PubMed] [Google Scholar]
  • 17.O’Mahony M. Sensory Adaptation. J. Sensory Stud. 1986;1:237–258. doi: 10.1111/j.1745-459X.1986.tb00176.x. [DOI] [Google Scholar]
  • 18.Pérez-González D., Malmierca M. Adaptation in the auditory system: an overview. Front. Integr. Neuroscience (San Diego, CA, U. S.) 2014;8 doi: 10.3389/fnint.2014.00019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Van der Burg E., Toet A., Brouwer A.-M., van Erp J.B.F. Sequential Effects in Odor Perception. Chemosens. Percept. 2022;15:19–25. doi: 10.1007/s12078-021-09290-7. [DOI] [Google Scholar]
  • 20.Alais D., Fernández Folgueiras U., Leung J. A common mechanism processes auditory and visual motion. J. Vis. 2018;18:1135. doi: 10.1167/18.10.1135. [DOI] [Google Scholar]
  • 21.Burr D., Ross J. A Visual Sense of Number. Curr. Biol. 2008;18:425–428. doi: 10.1016/j.cub.2008.02.052. [DOI] [PubMed] [Google Scholar]
  • 22.Manassi M., Liberman A., Kosovicheva A., Zhang K., Whitney D. Serial dependence in position occurs at the time of perception. Psychon. Bull. Rev. 2018;25:2245–2253. doi: 10.3758/s13423-018-1454-5. [DOI] [PubMed] [Google Scholar]
  • 23.Taubert J., Alais D., Burr D. Different coding strategies for the perception of stable and changeable facial attributes. Sci. Rep. 2016;6 doi: 10.1038/srep32239. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Manassi M., Whitney D. Illusion of visual stability through active perceptual serial dependence. Sci. Adv. 2022;8:eabk2480. doi: 10.1126/sciadv.abk2480. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Taubert J., Van Der Burg E., Alais D. Love at second sight: Sequential dependence of facial attractiveness in an on-line dating paradigm. Sci. Rep. 2016;6 doi: 10.1038/srep22740. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Van Der Burg E., Toet A., Abbasi Z., Brouwer A.M., Van Erp J.B.F., Kallen V.L., Kaneko D., Kim Y., Kinnear M., De Kock H.L., et al. Sequential dependency for affective appraisal of food images. Humanit. Soc. Sci. Commun. 2021;8:228. doi: 10.1057/s41599-021-00909-4. [DOI] [Google Scholar]
  • 27.Kim S., Burr D., Alais D. Attraction to the recent past in aesthetic judgments: A positive serial dependence for rating artwork. J. Vis. 2019;19:19. doi: 10.1167/19.12.19. [DOI] [PubMed] [Google Scholar]
  • 28.Blondé P., Kristjánsson Á., Pascucci D. Tuning perception and decisions to temporal context. iScience. 2023;26 doi: 10.1016/j.isci.2023.108008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Bliss D.P., Sun J.J., D’Esposito M. Serial dependence is absent at the time of perception but increases in visual working memory. Sci. Rep. 2017;7 doi: 10.1038/s41598-017-15199-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Moon J., Kwon O.S. Attractive and repulsive effects of sensory history concurrently shape visual perception. BMC Biol. 2022;20:247. doi: 10.1186/s12915-022-01444-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Pascucci D., Mancuso G., Santandrea E., Della Libera C., Plomp G., Chelazzi L. Laws of concatenated perception: Vision goes for novelty, decisions for perseverance. PLoS Biol. 2019;17 doi: 10.1371/journal.pbio.3000144. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Sheehan T.C., Serences J.T. Attractive serial dependence overcomes repulsive neuronal adaptation. PLoS Biol. 2022;20 doi: 10.1371/journal.pbio.3001711. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Suárez-Pinilla M., Seth A.K., Roseboom W. Serial dependence in the perception of visual variance. J. Vis. 2018;18:4. doi: 10.1167/18.7.4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Eckert A.-L., Gounitski Y., Guggenmos M., Sterzer P. Cross-Modality Evidence for Reduced Choice History Biases in Psychosis-Prone Individuals. Schizophr. Bull. 2023;49:397–406. doi: 10.1093/schbul/sbac168. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Sadil P., Cowell R.A., Huber D.E. The push–pull of serial dependence effects: Attraction to the prior response and repulsion from the prior stimulus. Psychon. Bull. Rev. 2024;31:259–273. doi: 10.3758/s13423-023-02320-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Kay R.H., Matthews D.R. On the existence in human auditory pathways of channels selectively tuned to the modulation present in frequency-modulated tones. J. Physiol. 1972;225:657–677. doi: 10.1113/jphysiol.1972.sp009962. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Ladefoged P., Broadbent D.E. Information Conveyed by Vowels. J. Acoust. Soc. Am. 1957;29:98–104. doi: 10.1121/1.1908694. [DOI] [PubMed] [Google Scholar]
  • 38.Shu Z.J., Swindale N.V., Cynader M.S. Spectral motion produces an auditory after-effect. Nature. 1993;364:721–723. doi: 10.1038/364721a0. [DOI] [PubMed] [Google Scholar]
  • 39.Summerfield Q., Haggard M., Foster J., Gray S. Perceiving vowels from uniform spectra: Phonetic exploration of an auditory aftereffect. Percept. Psychophys. 1984;35:203–213. doi: 10.3758/BF03205933. [DOI] [PubMed] [Google Scholar]
  • 40.Stilp C. Acoustic context effects in speech perception. WIREs Cogn. Sci. 2020;11 doi: 10.1002/wcs.1517. [DOI] [PubMed] [Google Scholar]
  • 41.Arzounian D., de Kerangal M., de Cheveigné A. Sequential dependencies in pitch judgments. J. Acoust. Soc. Am. 2017;142:3047. doi: 10.1121/1.5009938. [DOI] [PubMed] [Google Scholar]
  • 42.Motala A., Zhang H., Alais D. Auditory Rate Perception Displays a Positive Serial Dependence. Percept. 2020;11 doi: 10.1177/2041669520982311. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Schlunegger D., Mast F.W. Probabilistic integration of preceding responses explains response bias in perceptual decision making. iScience. 2023;26 doi: 10.1016/j.isci.2023.107123. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Kreiman J., Lee Y., Garellek M., Samlan R., Gerratt B.R. Validating a psychoacoustic model of voice quality. J. Acoust. Soc. Am. 2021;149:457–465. doi: 10.1121/10.0003331. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Gao Y., Ding H., Birkholz P., Lin Y. Comparing fundamental frequency of German vowels produced by German native speakers and Mandarin Chinese learners. JASA Express Lett. 2021;1 doi: 10.1121/10.0005593. [DOI] [PubMed] [Google Scholar]
  • 46.Krumbiegel J., Ufer C., Blank H. Influence of voice properties on vowel perception depends on speaker context. J. Acoust. Soc. Am. 2022;152:820–834. doi: 10.1121/10.0013363. [DOI] [PubMed] [Google Scholar]
  • 47.Raftery A.E. Bayesian model selection in social research. Socio. Methodol. 1995;25:111–163. [Google Scholar]
  • 48.Kim S., Burr D., Cicchini G.M., Alais D. Serial dependence in perception requires conscious awareness. Curr. Biol. 2020;30:R257–R258. doi: 10.1016/j.cub.2020.02.008. [DOI] [PubMed] [Google Scholar]
  • 49.Murai Y., Whitney D. Serial dependence revealed in history-dependent perceptual templates. Curr. Biol. 2021;31:3185–3191.e3. doi: 10.1016/j.cub.2021.05.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Li B., Wang B., Zaidel A. Modality-specific sensory and decisional carryover effects in duration perception. BMC Biol. 2023;21:48. doi: 10.1186/s12915-023-01547-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Cicchini G.M., Mikellidou K., Burr D. Serial dependencies act directly on perception. J. Vis. 2017;17:6. doi: 10.1167/17.14.6. [DOI] [PubMed] [Google Scholar]
  • 52.Lau W.K., Maus G.W. Visual serial dependence in an audiovisual stimulus. J. Vis. 2019;19:20. doi: 10.1167/19.13.20. [DOI] [PubMed] [Google Scholar]
  • 53.Kristensen S., Fracasso A., Dumoulin S.O., Almeida J., Harvey B.M. Size constancy affects the perception and parietal neural representation of object size. Neuroimage. 2021;232 doi: 10.1016/j.neuroimage.2021.117909. [DOI] [PubMed] [Google Scholar]
  • 54.Feigin H., Baror S., Bar M., Zaidel A. Perceptual decisions are biased toward relevant prior choices. Sci. Rep. 2021;11:648. doi: 10.1038/s41598-020-80128-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Moon J., Tadin D., Kwon O.S. A key role of orientation in the coding of visual motion direction. Psychon. Bull. Rev. 2023;30:564–574. doi: 10.3758/s13423-022-02181-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Pascucci D., Plomp G. Serial dependence and representational momentum in single-trial perceptual decisions. Sci. Rep. 2021;11:9910. doi: 10.1038/s41598-021-89432-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Togoli I., Fedele M., Fornaciai M., Bueti D. Serial dependence in time and numerosity perception is dimension-specific. J. Vis. 2021;21:6. doi: 10.1167/jov.21.5.6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.St. John-Saaltink E., Kok P., Lau H.C., de Lange F.P. Serial Dependence in Perceptual Decisions Is Reflected in Activity Patterns in Primary Visual Cortex. J. Neurosci. 2016;36:6186–6192. doi: 10.1523/JNEUROSCI.4390-15.2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Anstis S., Saida S. Adaptation to auditory streaming of frequency-modulated tones. J. Exp. Psychol. Hum. Percept. Perform. 1985;11:257–271. doi: 10.1037/0096-1523.11.3.257. [DOI] [Google Scholar]
  • 60.Huang J., Holt L.L. Listening for the Norm: Adaptive Coding in Speech Categorization. Front. Psychol. 2012;3 doi: 10.3389/fpsyg.2012.00010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Rakowski A., Hirsh I.J. Poststimulatory pitch shifts for pure tones. J. Acoust. Soc. Am. 1980;68:467–474. doi: 10.1121/1.384759. [DOI] [PubMed] [Google Scholar]
  • 62.Webster M.A. Evolving concepts of sensory adaptation. F1000 Biol. Rep. 2012;4 doi: 10.3410/B4-21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Strong S.P., Koberle R., de Ruyter van Steveninck R.R., Bialek W. Entropy and Information in Neural Spike Trains. Phys. Rev. Lett. 1998;80:197–200. doi: 10.1103/PhysRevLett.80.197. [DOI] [Google Scholar]
  • 64.Lee C.Y., Zhang Y. Processing Speaker Variability in Repetition and Semantic/Associative Priming. J. Psycholinguist. Res. 2015;44:237–250. doi: 10.1007/s10936-014-9307-5. [DOI] [PubMed] [Google Scholar]
  • 65.Abrahamyan A., Silva L.L., Dakin S.C., Carandini M., Gardner J.L. Adaptable history biases in human perceptual decisions. Proc. Natl. Acad. Sci. USA. 2016;113:E3548–E3557. doi: 10.1073/pnas.1518786113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Braun A., Urai A.E., Donner T.H. Adaptive History Biases Result from Confidence-Weighted Accumulation of past Choices. J. Neurosci. 2018;38:2418–2429. doi: 10.1523/JNEUROSCI.2189-17.2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Hardman K.O., Cowan N. Remembering complex objects in visual working memory: Do capacity limits restrict objects or features? J. Exp. Psychol. Learn. Mem. Cogn. 2015;41:325–347. doi: 10.1037/xlm0000031. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Miller G.A. The magical number seven, plus or minus two: Some limits on our capacity for processing information. Psychol. Rev. 1956;63:81–97. doi: 10.1037/h0043158. [DOI] [PubMed] [Google Scholar]
  • 69.Sheehan T., Carfano B., Serences J. Serial dependence to prior stimuli and past responses. J. Vis. 2022;22:4401. doi: 10.1167/jov.22.14.4401. [DOI] [Google Scholar]
  • 70.Samaha J., Switzky M., Postle B.R. Confidence boosts serial dependence in orientation estimation. J. Vis. 2019;19:25. doi: 10.1167/19.4.25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Hajonides J.E., van Ede F., Stokes M.G., Nobre A.C., Myers N.E. Multiple and Dissociable Effects of Sensory History on Working-Memory Performance. J. Neurosci. 2023;43:2730–2740. doi: 10.1523/JNEUROSCI.1200-22.2023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Schwiedrzik C.M., Ruff C.C., Lazar A., Leitner F.C., Singer W., Melloni L. Untangling Perceptual Memory: Hysteresis and Adaptation Map into Separate Cortical Networks. Cerebr. Cortex. 2014;24:1152–1164. doi: 10.1093/cercor/bhs396. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Urai A.E., Donner T.H. Persistent activity in human parietal cortex mediates perceptual choice repetition bias. Nat. Commun. 2022;13:6015. doi: 10.1038/s41467-022-33237-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Barbosa J., Stein H., Martinez R.L., Galan-Gadea A., Li S., Dalmau J., Adam K.C.S., Valls-Solé J., Constantinidis C., Compte A. Interplay between persistent activity and activity-silent dynamics in the prefrontal cortex underlies serial biases in working memory. Nat. Neurosci. 2020;23:1016–1024. doi: 10.1038/s41593-020-0644-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Luo J., Collins T. The Representational Similarity between Visual Perception and Recent Perceptual History. J. Neurosci. 2023;43:3658–3665. doi: 10.1523/JNEUROSCI.2068-22.2023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Ranieri G., Benedetto A., Ho H.T., Burr D.C., Morrone M.C. Evidence of Serial Dependence from Decoding of Visual Evoked Potentials. J. Neurosci. 2022;42:8817–8825. doi: 10.1523/JNEUROSCI.1879-21.2022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Siegle J.H., Jia X., Durand S., Gale S., Bennett C., Graddis N., Heller G., Ramirez T.K., Choi H., Luviano J.A., et al. Survey of spiking in the mouse visual system reveals functional hierarchy. Nature. 2021;592:86–92. doi: 10.1038/s41586-020-03171-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Bosch E., Fritsche M., Utzerath C., Buitelaar J.K., de Lange F.P. Adaptation and serial choice bias for low-level visual features are unaltered in autistic adolescents. J. Vis. 2022;22:1. doi: 10.1167/jov.22.6.1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Zhou L., Liu Y., Jiang Y., Wang W., Xu P., Zhou K. The distinct development of stimulus and response serial dependence. Psychon. Bull. Rev. 2024 doi: 10.3758/s13423-024-02474-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Dong D., Atick J. Statistics of natural time-varying images. Netw. Comput. Neural Syst. 1995;6:345–358. doi: 10.1088/0954-898X/6/3/003. [DOI] [Google Scholar]
  • 81.Stern Y., Ben-Yehuda I., Koren D., Zaidel A., Salomon R. The dynamic boundaries of the Self: Serial dependence in the Sense of Agency. Cortex. 2022;152:109–121. doi: 10.1016/j.cortex.2022.03.015. [DOI] [PubMed] [Google Scholar]
  • 82.Hachen I., Reinartz S., Brasselet R., Stroligo A., Diamond M.E. Dynamics of history-dependent perceptual judgment. Nat. Commun. 2021;12:6036. doi: 10.1038/s41467-021-26104-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.McMurray B. The myth of categorical perception. J. Acoust. Soc. Am. 2022;152:3819–3842. doi: 10.1121/10.0016614. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Apfelbaum K.S., Kutlu E., McMurray B., Kapnoula E.C. Don’t force it! Gradient speech categorization calls for continuous categorization tasks. J. Acoust. Soc. Am. 2022;152:3728–3745. doi: 10.1121/10.0015201. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Zhang H., Alais D. Individual difference in serial dependence results from opposite influences of perceptual choices and motor responses. J. Vis. 2020;20:2. doi: 10.1167/jov.20.8.2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Bates D., Mächler M., Bolker B., Walker S. Fitting Linear Mixed-Effects Models Using lme4. J. Stat. Software. 2015;67:1–48. doi: 10.18637/jss.v067.i01. [DOI] [Google Scholar]
  • 87.Gaudrain E., Li S., Ban V.S., Patterson R.D. The role of glottal pulse rate and vocal tract length in the perception of speaker identity. Interspeech; 2009. pp. 148–151. [DOI] [Google Scholar]
  • 88.Woods K.J.P., Siegel M.H., Traer J., McDermott J.H. Headphone screening to facilitate web-based auditory experiments. Atten. Percept. Psychophys. 2017;79:2064–2072. doi: 10.3758/s13414-017-1361-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.R Core Team . R Foundation for Statistical Computing; 2022. R: A Language and Environment for Statistical Computing. [Google Scholar]
  • 90.Barr D.J., Levy R., Scheepers C., Tily H.J. Random effects structure for confirmatory hypothesis testing: Keep it maximal. J. Mem. Lang. 2013;68:255–278. doi: 10.1016/j.jml.2012.11.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91.Field A.P., Wright D.B. A Primer on Using Multilevel Models in Clinical and Experimental Psychopathology Research. J. Exp. Psychopathol. 2011;2:271–293. doi: 10.5127/jep.013711. [DOI] [Google Scholar]
  • 92.Lenth R.V. 2023. Emmeans: Estimated Marginal Means, Aka Least-Squares Means. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

  • Experimental data and the stimuli have been deposited at OSF and are publicly available as of the date of publication. DOIs are listed in the key resources table.

  • The original code to reproduce the results is publicly available at OSF and the DOI is listed in the key resources table. The code for the online experiment and the code for plotting the results is available from the lead contact upon request.

  • Any additional information required to reanalyze the data reported in this paper is available from the lead contact upon request.


Articles from iScience are provided here courtesy of Elsevier

RESOURCES