Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2023 Feb 3;18(2):e0276691. doi: 10.1371/journal.pone.0276691

Rhythmic tapping difficulties in adults who stutter: A deficit in beat perception, motor execution, or sensorimotor integration?

Anneke Slis 1, Christophe Savariaux 1, Pascal Perrier 1, Maëva Garnier 1,*
Editor: Jessica Adrienne Grahn2
PMCID: PMC9897587  PMID: 36735662

Abstract

Objectives

The study aims to better understand the rhythmic abilities of people who stutter and to identify which processes potentially are impaired in this population: (1) beat perception and reproduction; (2) the execution of movements, in particular their initiation; (3) sensorimotor integration.

Material and method

Finger tapping behavior of 16 adults who stutter (PWS) was compared with that of 16 matching controls (PNS) in five rhythmic tasks of various complexity: three synchronization tasks ― a simple 1:1 isochronous pattern, a complex non-isochronous pattern, and a 4 tap:1 beat isochronous pattern ―, a reaction task to an aperiodic and unpredictable pattern, and a reproduction task of an isochronous pattern after passively listening.

Results

PWS were able to reproduce an isochronous pattern on their own, without external auditory stimuli, with similar accuracy as PNS, but with increased variability. This group difference in variability was observed immediately after passive listening, without prior motor engagement, and was not enhanced or reduced after several seconds of tapping. Although PWS showed increased tapping variability in the reproduction task as well as in synchronization tasks, this timing variability did not correlate significantly with the variability in reaction times or tapping force.

Compared to PNS, PWS exhibited larger negative mean asynchronies, and increased synchronization variability in synchronization tasks. These group differences were not affected by beat hierarchy (i.e., “strong” vs. “weak” beats), pattern complexity (non-isochronous vs. isochronous) or presence versus absence of external auditory stimulus (1:1 vs. 1:4 isochronous pattern). Differences between PWS and PNS were not enhanced or reduced with sensorimotor learning, over the first taps of a synchronization task.

Conclusion

Our observations support the hypothesis of a deficit in neuronal oscillators coupling in production, but not in perception, of rhythmic patterns, and a larger delay in multi-modal feedback processing for PWS.

1 Introduction

Stuttering is a neuro-motor disorder [1,2], characterized by episodes of disfluent speech, containing repeated, extended or blocked sounds, and disrupted rhythmic flow [3]. These perceptual disfluencies have been related to quantitative differences in respiratory, glottal and articulatory behavior of people who stutter (PWS), compared to typical individuals [47]. Significant differences in movement duration, movement timing and reaching accuracy have also been reported in upper limb and non-speech orofacial movements [811]. Compared to typical speakers, PWS show larger variability and disrupted timing across and within moving components, such as limbs and articulators [1217], suggesting a timing deficit.

Although the etiology of stuttering is not fully understood yet, evidence suggests that stuttering is related to dysfunctional dopamine receptors and a disrupted basal ganglia-thalamo-cortical network, affecting both motor control and time processing [18,19]. The hypothesis that speech disfluencies in stuttering are caused by a timing deficit [10,20,21] has been explored, behaviorally, by means of finger tapping tasks. In paced tapping tasks, i.e., when tapping in synchrony with an external metronome or musical excerpt [10,20], some of these studies reported a greater tapping variability in PWS, compared to people who do not stutter (PNS) [21,22]. In addition, when tapping along with a metronome marking a simple isochronous sequence, PWS tend to tap more ahead of the beat, i.e., they show a greater “Negative Mean Asynchrony” (NMA) [10,20]. However, another study failed to reveal any difference in variability during sequences of paced tapping tasks [23].

The observed differences in movement behavior potentially originate from deficits at more than one level, since paced tapping involves multiple simultaneous processes, such as the skill to perceive a periodic beat, the capacity to initiate and execute movements to reproduce that beat, and the ability to monitor and update movement timing on-line, using sensory feedback. The sections below define these different processes in more detail, review already available knowledge concerning their possible impairment in PWS and highlight some unresolved issues that still need to be addressed.

1.1 Motor delays and variability in the execution of movements

First, evidence suggests that the inaccurate tapping of PWS originates from difficulties at the motor execution stage [24,25], in particular with regard to initiating and sequencing of movements [3], which is also in accordance with a deficient Basal Ganglia [1,18]. Indeed, several studies have observed longer voice reaction times in PWS [26,27] as well as longer reaction times in non-speech tasks involving finger movements [26,28] (see however Reich et al. [29], who did not find significant differences in finger reaction times). In addition, Max et al. [8] reported longer movement durations, peak velocity latencies, and lower peak velocities for finger flexion. Longer durations were also observed between the peak EMG of lip muscles and the speech onset for PWS [13].

To identify difficulties at the motor execution stage, a complicating factor relates to the exact level at which motor execution hampers. Besides the possibility that muscle functioning can be impaired, another potential explanation for the observed movement variability in PWS concerns inaccurate, unstable, or insufficiently activated internal representations [2,25]. Thus, some authors suggested that PWS do not rely on a feedforward and automatized mode of motor control. Instead, they mainly rely on sensory feedback [3032], inducing additional processing delays, eventually leading to unstable movement behavior of different effectors, especially at fast rate. Supporting this idea, a greater gestural variability was observed in PWS, compared to PNS, not only in the timing of their gestures, but also in their amplitude and target [25,33,34]. Finally, stuttering frequency is also influenced by task complexity [35] and larger differences between PWS and PNS are observed when the task increases in complexity or speed [33,36,37].

One of the hypotheses is that the observed difficulties originate at the level of motor control. In this context, a first objective of the present study is to explore to what extent the increased timing variability and decreased timing accuracy of PWS is related to difficulties in motor planning and execution. In particular, we investigate whether:

  • PWS differ from PNS by increased delays and variability when initiating movements and whether these aspects correlate with the degree of tapping accuracy and consistency observed in synchronization tasks or tapping tasks without an external auditory reference.

  • PWS show a greater variability, not only in timing, but also in the strength of movements, and whether these two types of variability are correlated.

  • the possibly greater variability of PWS is even more enhanced by task complexity, which “pressures” the motor system.

1.2 Beat perception and reproduction

“Beat” perception refers to the emergence of an internal representation of periodicity when listening, seeing, or feeling a regular sequence of stimuli [3842]. One point of view, supported by several theoretical and experimental studies and encompassed under the general term “Oscillators Coupling Hypothesis”, suggests that beat perception involves the in phase tuning of endogenous neuronal oscillations in the brain [4346] (see also [47,48]) in various frequency ranges, with external physical periodic or oscillatory phenomena. Although there is still ongoing debate on this endogenous oscillator entrainment hypothesis [49,50], the observation that steady state-evoked potentials appear in the delta frequency range [0.5–4 Hz] in subjects who were passively listening to a rhythmic sequence at 2.4Hz, provides support for this hypothesis [43,44,51]. In the context of the “Active Sensing” hypothesis applied to auditory perception, Morillon et al. [52,53] have suggested that the tuning of these neuronal oscillators occurring in the delta frequency range in the auditory cortex is modulated by oscillations occurring in the same frequency range in the motor cortex. Thus, the auditory perception of external beats in the delta frequency range [0.5–4 Hz] is expected to be associated with tuned oscillations both in the auditory and motor cortices.

In the framework of coupled oscillators, several authors have suggested that the reduced synchronization accuracy and consistency of PWS during tapping tasks originate from deficient neuronal oscillator coupling affecting time perception and prediction [54,55]. At the behavioral level, coupling deficiencies between neuronal oscillator in the motor cortex are hypothesized to result in the inability to independently produce an isochronous pattern, without the support of external auditory triggers.

A deficit in the coupling mechanism of neuronal oscillations is also theorized to result in the inability to predict and anticipate a repeated periodic stimulus. Etchell et al. [56] showed that, while listening to regular pulses, typical children showed a peak in beta oscillations in the basal ganglia close to stimulus onset–interpreted as an increased attention and prediction of an event at that time–whereas children who stuttered showed a peak after the stimulus occurred. From a behavioral point of view, however, PWS demonstrated Negative Mean Asynchronies in synchronization tasks, like PNS, suggesting that they are able to anticipate external stimuli and that they are not simply “reacting” [10,20].

A subtler deficit of the coupling mechanism is also expected to result in an inaccurate, more variable, and/or drifting reproduction of the period of a previously perceived beat. Studies using synchronization-continuation tasks have reported ambiguous results, however: some showed increased tapping variability in the continuation phase for PWS, compared to PNS [57], whereas others did not observe any significant difference between both groups [21,23].

A final hypothesis is that a deficit in recovering an underlying beat likely results in increased difficulties to add and remove events within a periodic pattern, and therefore to perceive and reproduce complex rhythms, as well as meter, i.e., the hierarchical organization of a rhythmic sequence into “strong” beats and “weaker” ones (a waltz, for instance, is characterized by a triple meter, with a strong initial beat, followed by two weaker ones, whereas a march is duple-metered, with a strong beat every two beats).

In this context, a second objective of the present study is to explore the ability of PWS to perceive and reproduce an intrinsic beat. To identify the level of impairment, the study follows a “differential” and behavioral approach, comparing the performance of PWS and PNS in different rhythmic tasks of varying complexity.

We explore whether PWS differ from PNS in their ability:

  • to produce an isochronous pattern independently, without external auditory triggers.

  • to predict and anticipate the occurrence of periodic events.

  • to reproduce by themselves, without external auditory triggers, an isochronous pattern at a specific tempo, either immediately after passive listening, or after a few seconds of tapping, i.e., after engaging the motor system.

  • to perceive and reproduce higher levels of beat organization, like meter, complex non-isochronous patterns, or patterns in which certain pulses are not explicitly marked by external auditory stimuli.

1.3 On-line control of movement timing: Dealing with multi-sensory feedback

Perceiving the beat, and then reproducing it, is a first step in tapping along with an external trigger. An additional step involves correctly synchronizing movements to the beat, using sensory feedback for on-line monitoring and correcting timing errors [5860]. Resulting delays in the pathway linking motor commands and their sensory consequences need to be compensated by the individual who is tapping. A common phenomenon observed in synchronization tapping tasks is the tendency, even in typical individuals, to anticipate the beat, i.e., demonstrating a Negative Mean Asynchrony (NMA) [61]. This phenomenon is influenced by several factors, such as musical experience (NMA shorter in musicians [62]), beat rate (NMA increases when period increases [58,63]), and rhythmic complexity (NMA is reduced in non-isochronous musical excerpts, compared to an isochronous sequence [20]). The NMA also depends on feedback modalities and is reduced when direct auditory feedback is available compared to information provided by only tactile-kinesthetic feedback [64]. Aschersleben proposed that NMA reflects a slower processing and integration of tactile feedback than auditory or visual feedback [59,65]. In addition to slower processing and integration, this so-called “sensory accumulation” theory further predicts that the magnitude of auditory-tactile delay, and the resulting NMA, depends on stimulation intensity, which, in case of tapping, is hypothesized to concern the tapping force. The NMA is therefore hypothesized to decrease when tactile-kinesthetic feedback in the form of tapping force increases. In line with this, several authors suggested that the greater NMA observed in PWS is related to either a deficit in one sensory modality–in particular, a reduced kinesthetic acuity [30,66,67], or a deficit in multisensory integration [20,59].

Based on this knowledge, a third objective of the present study is to further explore the synchronization abilities of PWS, and to better understand whether:

  • the larger degree of NMA observed in PWS can be explained by a weaker tapping force, as predicted by the sensory accumulation theory.

  • other intra-individual variations in NMA, due to beat strength in particular, correlate with variations in tapping force.

  • the difference in tapping variability between PWS and PNS is similar or larger in a synchronization task than the differences observed in a tapping task without external auditory reference, reflecting possible difficulties with sensorimotor integration, additionally involved in synchronization tasks.

1.4 Influence of motor engagement and sensorimotor learning

Finally, it is uncertain to what extent the motor system influences or is intrinsically involved in timing processes. Some evidence, however, points toward this possibility. First, some brain activity is observed in motor regions during passive listening to a rhythmic pattern, without any movement [39,68,69], supporting the idea that beat perception intrinsically involves the motor system. Second, the coupling of neuronal oscillations to an external beat frequency, observed in passive listening to rhythm, is enhanced when gestures, like finger tapping, are simultaneously produced [70]. Also, a more accurate and less variable reproduction of an isochronous sequence is observed after tapping along with the pattern, compared to passively listening before tapping [71]. Altogether, these observations support the idea that people build an internal representation of the beat by detecting the periodicity in sensory inputs without actual movement, but that this internal representation is nevertheless consolidated with engaging the motor system.

In that context, the fourth objective of the present study is to explore whether the increased timing variability could be due not so much to a deficit in perceiving and reproducing a beat per se, but rather to a deficit in consolidating or updating the sense of the beat with actual motor engagement or sensorimotor learning. These questions are addressed by comparing whether the differences in tapping accuracy and consistency between PWS and PNS, on paced and unpaced tasks, are observed immediately after passive listening to a rhythmic pattern or emerge after several seconds of tapping (In this case, PWS would be expected to improve their accuracy and consistency, whereas PNS would not).

2 Material and methods

2.1 Participants

16 PWS and 16 PNS were recruited via certified speech language pathologists, word-of-mouth, and social media. The experimental and control group matched in age, gender, and musical training (see Table 1, and section A of the supplementary material, for details on musical training). All speakers were native monolingual speakers of French and did not report any hearing, speaking, voice, or language problems other than developmental stuttering for the experimental group. The project was approved by the local ethics committee of the University Grenoble Alpes (IRB00010290-2018-10-16-54).

Table 1. Female and male (F, M) people who stutter (PWS) and not stutter (PNS), with “age” and “musical training” (1: “none”, 2: “moderate”, 3: “high”).

For the PWS, the stuttering severity was both self-evaluated (1: “mild”, 2: “moderate”, 3: “severe”) and evaluated with the SSI-4 Instrument.

PWS PNS
Age Gender Musical
training
SSI-4 score Self-evaluated severity Age Gender Musical
training
PWS1 44 F 0 17 (very mild) 1 PNS1 50 0 0
PWS2 20 F 0 20 (mild) 3 PNS2 20 0 0
PWS3 56 M 0 16 (very mild) 2 PNS3 59 0 0
PWS4 39 M 0 10 (very mild) 2 PNS4 32 0 0
PWS5 54 M 0 12 (very mild) 2 PNS5 51 0 0
PWS6 44 M 0 19 (mild) 1 PNS6 40 0 0
PWS7 42 M 0 30 (moderate) 3 PNS7 39 0 0
PWS8 20 M 0 19 (mild) 3 PNS8 21 0 0
PWS9 48 M 0 26 (moderate) 3 PNS9 46 0 0
PWS10 65 M 0 26 (moderate) 3 PNS10 70 0 0
PWS11 34 M 2 10 (very mild) 2 PNS11 38 2 2
PWS12 27 F 1 13 (very mild) 1 PNS12 25 1 1
PWS13 19 M 1 19 (mild) 2 PNS13 19 1 1
PWS14 35 M 1 18 (mild) 1 PNS14 34 1 1
PWS15 25 M 2 18 (mild) 2 PNS15 24 2 2
PWS16 48 M 2 34 (severe) 3 PNS16 47 2 2
Average 35.7 ± 15.3 Average 36.0 ± 16.4

2.2 Fluency assessment

Participants were asked to self-evaluate their stuttering severity as ‘mild,’ ‘moderate’, or ‘severe’. Based on a reading task and a picture description task, a speech therapist, specialized in stuttering, also assessed the participants’ stuttering severity objectively with the SSI-4 (Stuttering Severity Instrument) [72]. A significant correlation was observed between SSI-4 scores and the self-evaluated severity (see Table 1) (R = 0.57, p = 0.02). Accordingly, the SSI-4 scores were considered for analysis.

2.3 Tasks

Five rhythmic conditions were explored: three isochronous, one non-isochronous and one aperiodic rhythm, summarized in Fig 1.

Fig 1. Summary of the five tasks: 1:1_ISO_SYNC—Synchronization task with a quadruple metered isochronous pattern; 0:1_ISO_REPRO–Reproduction, without any external reference, of a quadruple metered isochronous pattern, after listening passively to it; 1:4_ISO_SYNC: Synchronization task with a quadruple metered isochronous pattern, where only the strong beats (one every four) were marked by an auditory stimulus; NONISO_SYNC—Synchronization task with a quadruple metered non-isochronous pattern; REACT–Reaction task to an unpredictable and aperiodic pattern.

Fig 1

The small lines indicate the metronome beats of an 8- beat cycle. The black dots indicate the auditory stimuli that were played to the participants. The grey triangles indicate the participants finger taps.

  • 1:1_ISO_SYNC—Synchronization task with an isochronous pattern

    The participants were presented with a simple periodic pattern with an Inter-stimulus Onset Interval (IOI) of 500 ms (i.e., a tempo of 120 BPM). Since a metrical organization of beats (into groups of 2, 3, or 4) arises naturally and automatically when listening to an isochronous sequence of identical tones [7375], we controlled for that perceptual grouping and induced the perception of quadruple meter, i.e., with a “strong” or accentuated beat sensed every four pulses, the other beats sensed as “weak” or unaccentuated). To achieve this, auditory stimuli were organized into 8-beat cycles, with a metronome click marking the pulse on each beat, and an additional audio beep (Pitch: 1100 Hz; 20 ms) played simultaneously on the first seven beats only (without variations in pitch, loudness, or duration) (see Fig 1). Participants were instructed to listen passively to two cycles of that pattern before they started tapping in synchrony with the beat. For the analysis, the first 8-beat cycle was distinguished from, and compared to the next two 8-beat cycles (2nd and 3rd) to examine a potential effect of sensorimotor learning during the beginning of these tasks.

  • ISO_REPRO–Reproduction, without any external reference, of an isochronous pattern, after listening passively to it

    The participants were presented with the same pattern as described for the synchronization task 1:1_ISO_SYNC. After listening passively to two cycles of the pattern, the external auditory metronome stopped, and the participants started tapping as regularly as possible, trying to keep the same pace as in the previously perceived pattern (120 BPM) (see Fig 1). For the analysis, the first cycle of taps was distinguished from, and compared to, the next two cycles (Taps 9 to 24), to explore the performance of internalizing and reproducing the beat after passive listening, and the potential improving effect of motor engagement in reproduction.

  • 1:4_ISO_SYNC—Synchronization task with an isochronous pattern, where only the strong beats (one every four) are marked by an auditory stimulus:

    After listening passively to two cycles of the isochronous pattern described earlier in 1:1_ISO_SYNC and ISO_REPRO, the external auditory stimuli were played back every 4 beats only–on the 1st and the 5th beats of the 8- beat cycle, supposed to be perceived as “strong” in a quadruple meter, while the participants started tapping as regularly as possible, trying to keep the same pace as in the previously perceived pattern (see Fig 1). Only the stabilized phase of this task (2nd and 3rd cycles of taps) was considered for analysis.

  • NONISO_SYNC—Synchronization task with a quadruple metered non-isochronous pattern:

    The participants were presented with a non-isochronous pattern of seven taps distributed over the 8-beat cycle, still following a quadruple meter and a tempo of 120 BPM. Five of the notes fell “on the beat” (i.e., synchronized with the metronome pulse) while two fell “half the beat” (i.e., exactly in between two metronome pulses) (see Fig 1). Like in 1:1_ISO_SYNC, a metronome click marked the pulse on each beat, while an audio beep played the seven “notes” of the non-isochronous pattern (without variations in pitch, loudness, or duration) (see Fig 1). After listening passively to two cycles of this pattern, participants started to tap in synchrony with the audio beep. In this task again, only the 2nd and 3rd cycles of taps were considered for analysis.

  • REACT–Reaction task to an unpredictable and aperiodic pattern:

    The reaction task consisted of responding with a tap as quickly as possible after hearing auditory beeps, played in a non-periodic, and therefore unpredictable, way (see Fig 1). The inter-stimulus onset interval (IOI) ranged from 200 to 800 ms, with a quasi-flat distribution over a 1 min interval. Unlike in the previous tasks, the REACT task did not include an example phase and the participant could start tapping when ready. In this task, only the 9th to 24th taps were considered for analysis.

The participants performed two trials of each task. The condition 1:1_ISO_SYNC was always performed first, followed by REACT, then NONISO_SYNC. The more complex tasks 1:4_ISO_SYNC and ISO_REPRO were performed at the end of the session. During a practice session outside the booth, the experimenters explained and practiced the tasks with the participants until they were sure that the participants understood the instructions, which did not mean that they were able to achieve the tasks perfectly.

Next, before the actual rhythmic task started, but already inside the booth and being experimentally set-up, spontaneous French speech was elicited by a “spot-the-difference” task during which the participant was instructed to describe differences within pairs of pictures. Finally, a French reading text was employed to elicit more controlled speech material. Both tasks provided material to evaluate the Stuttering Severity Index and to familiarize participants with the experimental setup.

2.4 Data collection and experimental set-up

During the experiment, the participants sat at a table, with their dominant lower arm and hand resting on the table, such that they were able to move the index finger easily without moving the arm or hand. Finger tapping events were recorded using a gauge strain sensor (EPL-D11-25P from Meas France), attached to the table, and located just under the index finger of the participant. A microphone simultaneously recorded the resulting audio signal. Both the force signal from the sensor and the audio signal were recorded with a Biopac MP150 acquisition system and the associated Acknowledge software, at a sampling rate of 20 kHz, over 16 bits.

The auditory stimuli (metronome click and audio beep) were played binaurally through earphones/earplugs at a comfortable level, indicated by the participant. The earphones and the moderate tapping force prevented the participant from getting direct auditory feedback from their taps. The metronome click and audio beep were also recorded on a second channel of the Biopac system, synchronously with the force signal.

2.5 Extracted descriptors

First, the force signal was low-pass filtered (Chebyshev filter, cutoff frequency of 100 Hz, using the function filtfilt in Matlab (R2018b) to extract its envelop, and normalized, based on its maximum value observed in each executed tapping task. For each tap, the first sharp peak of the force signal, corresponding to the tapping instant, was detected automatically (using the Matlab function “findpeaks”, with a minimum interpeak distance of 200 ms and a 20% threshold for peak height). These tapping instants were saved in PRAAT [76] annotation files, and were all manually verified and corrected.

From each tapping realization, three measures were extracted, based on the output force signal and the auditory signal played to the participant (see Table 2):

Table 2. Summary of the seven descriptors considered in this study, depending on the condition.

CONDITION
1:1_ISO_SYNC 1:4_ISO_SYNC NONISO_SYNC ISO_REPRO REACT
1st cycle 2nd and 3rd cycles 2nd and 3rd cycles 2nd and 3rd cycles 1st cycle 2nd and 3rd cycles 2nd and 3rd cycles
Parameters extracted for each tap
Reaction Time (RT) x
Phase Angle (PA)
• In general, for all taps x x
• Distinguished for
 strong beats x x x
 weak beats x x x
 taps falling “half beat” x
Tapping Force (TF)
• In general, for all taps x x x
• Distinguished for
 strong beats x x x
 weak beats x x x
 taps falling “half beat” x
Parameters extracted for each train of taps
Reaction Time Variability (RT_var) x
Phase Locking Value (PLV)
• In general, for all taps x x
• Distinguished for
 strong beats x x x
 weak beats x x x
 taps falling “half beat” x
Drift in ITI over time x
Coefficient of Variation (CV) x x x
Periodicity Error (PE) x x
Tapping Force Variability (TF_var) x x x
  • Reaction Time (RT, in ms) was measured in the condition REACT as the time difference (ms) between a tap and the closest preceding auditory stimulus. This value was therefore always positive.

  • Phase Angle (PA, in degrees) was measured in the conditions 1:1_ISO_SYNC, 1:4_ISO_SYNC and NONISO_SYNC, as the angular conversion of Tapping Asynchrony, i.e., the time difference (ms) between a tap and the closest metronome pulse, relatively to the Inter-stimulus onset interval of 500 ms (IOI) (see Eq 1). Tapping asynchrony values were always between -250 ms and +250 ms, so that PA values ranged from -180° (completely desynchronized in advance to the auditory stimulus) to +180° (completely desynchronized following the auditory stimulus), passing through 0° (perfectly synchronized with the auditory stimulus). In the analysis, we distinguished taps that were synchronized with “strong” beats of the 8-beat cycle (the 1st and 5th beats, marked by an auditory stimulus in all three conditions 1:1_ISO_SYNC, 1:4_ISO_SYNC and NONISO_SYNC) from those synchronized with “weak” beats of the cycle (all other beats, marked by an auditory stimulus in 1:1_ISO_SYNC and NONISO_SYNC, but only “internalized” in 1:4_ISO_SYNC), and also from those falling “half-beat” (for the condition NONISO_SYNC only).
    PA=Asynchrony*360IOI (1)
  • Tapping Force (TF) was defined as the amplitude of the first sharp peak of the force signal. It was not calibrated in Newtons and was therefore expressed in arbitrary units. However, the same experimental set-up and calibration of the recording equipment was used for all participants, enabling inter-and intra- subject comparisons.

Six other descriptors were measured over a train of taps:

  • Variability in Reaction Time (RT_Var) was measured as the standard deviation of RT values over the taps 9th to 24th in the condition REACT.

  • Phase Locking Value (PLV), characterizing the consistency of the stimulus-tap synchrony, was measured over the 2nd and 3rd 8-beat cycles of the conditions 1:1_ISO_SYNC, NONISO_SYNC and 1:4_ISO_SYNC, as well as over the very first 8-beat cycle of the condition 1:1_ISO_SYNC. PLV is defined as the norm of the sum of all the PA vectors (PA is a unit vector of phase PA in a plane) divided by their number N [22] (see Eq 2). In case the stimulus-tap asynchrony, and therefore the PA values, remain constant over a complete tapping train, the corresponding PA vectors align and their sum results in a vector of maximum length (i.e., ideally a PLV of 1). If the stimulus-tap asynchrony, and therefore the PA values, vary considerably from tap to tap, the PA vectors points into inconsistent directions and their sum results in a vector of smaller length (i.e., a PLV significantly smaller than 1). In case PA varies a lot, the PLV value can also be very small, due to systematic underestimation or over-estimation of the ITI. PLV values were determined separately for the “strong” and “weak” taps during 1:1_ISO_SYNC, 1:4_ISO_SYNC and NONISO_SYNC, as well as for the taps falling “half-beat” for the condition NONISO_SYNC.
    PLV=PAN (2)
  • Drift in Inter-Tap Interval over time was evaluated over the first 24 taps during the condition ISO_REPRO. The Inter-Tap Interval (ITI) was defined as the time difference, in ms, between two consecutive taps. A significantly non-null regression slope between the variation of ITI values and the tap number of a train (from 1 to 24) indicated whether the ITI followed a global acceleration (positive slope) or deceleration (negative slope). A non-significant slope indicated that no significant drift occurred over time.

  • Coefficient of Variation (CV, in %), was measured as the standard deviation of ITI values, relatively to their mean, over the first cycle of taps (1 to 8) and the next two cycles (taps 9 to 24) of the conditions ISO_REPRO and 1:1_ISO_SYNC.

  • Periodicity Error (PE, in ms) was measured again over the first cycle of taps (1–8) and the next two cycles (taps 9 to 24) of the conditions ISO_REPRO, as the time difference, in absolute value, between the “target” period of the previously heard pattern (500 ms) and the actual period of the produced train of taps, i.e., its mean ITI value.

  • Variability in Tapping Force (TF_Var): was measured as the standard deviation of tapping force values over the stabilized phase (2nd and 3rd cycles, or taps 9 to 24) of the three conditions 1:1_ISO_SYNC, NONISO_SYNC and ISO_REPRO.

2.6 Statistical analysis

The statistical analyses were conducted using the software R [77]. Linear mixed models (using the R package nlme) were used to explore the variation of all the descriptors, except Phase Angle, whose variation was explored with Bayesian circular mixed models (using the R package bpnreg [78]). The variables RT and CV were log-transformed, and the variable PLV was logit transformed.

For linear mixed models, hypotheses about the model’s normality and homoscedasticity were validated by looking at the residuals’ graphs. When more than one fixed effect was considered in the model, the interaction between them was tested with Likelihood Ratio Tests, and specific contrasts were further examined with Bonferroni adjustments (using the R package “multcomp”).

For linear mixed models, the contrast between two conditions was considered significant when p < .05. Since Bayesian circular mixed models do not return any p-values, two conditions were considered significantly different if their Highest Posterior Density (HPD) intervals, estimated by the model, did not overlap, or if the HPD interval of their difference did not include zero.

3 Results

For the sake of conciseness and clarity, the RESULTS section focuses on the main and most interesting results. Complementary analyses were conducted, in particular to test the correlation between some of the parameters. These non-significant results are available in section B of the Supplementary Material.

3.1 Motor delays and variability at the initiation of movements

A first question of this study was whether PWS differed from PNS by increased delays and variability at the initiation of movements, revealing a possible deficiency in movement initiation. To this end, differences between PWS and the PNS in the Reaction Time (RT) and its variability (RT_Var) in the condition REACT were tested, based on the mixed models [log(RT) ~ GROUP + 1|Participant] and [RT_Var ~ GROUP + 1|Participant] (i.e., “GROUP” considered as a fixed effect, and Participant as a random effect).

As expected, a positive Reaction Time, of 232 ± 6 ms on average, was observed when following unpredictable auditory stimuli, as evoked in the condition REACT. PWS and PNS did not differ in their average Reaction Time (F(1,30) = 0.39, p = .54) (see Fig 2a), or in its variability (RT_Var) (F(1, 30) = 0.001, p = .97) (see Fig 2b).

Fig 2. (a) Average finger reaction time and (b) variability of this reaction time, in the condition REACT during which participants had to follow aperiodic and unpredictible auditory stimuli.

Fig 2

People who stutter (PWS, N = 16) are compared with typical adults without speech disorder (PNS, N = 16).

The average RT and RT_Var values of PWS were also not found to correlate significantly with stuttering severity (see section B.1.1 of the supplementary material).

3.2 Beat perception and reproduction

3.2.1 Degree of ITI variability of the reproduced pattern in the condition ISO_REPRO

An additional question was whether PWS faced difficulties with tapping an isochronous sequence on their own, without the help of external auditory triggers. The Coefficient of Variation (CV) of the inter-tap intervals (ITIs) is inversely related to the degree of isochrony of the reproduced pattern in the condition ISO_REPRO. Variations of CV were explored, considering the mixed model [log(CV) ~ GROUP * TIME + 1|Participant], with the 2-level factor TIME = {First 8 beat cycle of ISO_ REPRO; Second and third 8-beat cycles of ISO_REPRO}.

  • Group: PWS showed a higher Coefficient of Variation (i.e., less isochronous tapping) compared to PNS (∆log(CV)PNS-PWS = 0.24 ± 0.11, z = 2.19, p = .03) in ISO_REPRO (see Fig 3a). This higher variability of tapping did not correspond to a significant acceleration or deceleration of ITIs is over time (see section B.2.1 of the supplementary Material). The average Coefficient of Variation of PWS was not found to correlate significantly with stuttering severity as well (see section B.2.2 of the supplementary Material).

  • Interaction with motor engagement: The difference in tapping variability between PNS and PWS, in terms of higher Coefficient of Variation, was observed already during the very first taps (First 8-beat cycle) following passive listening (without motor engagement), and the magnitude of this difference did not change during the second and third cycles, after the motor system had been engaged (No significant interaction GROUP*TIME: df = 1, LRatio:1.28, p = 0.26). For both PWS and PNS, no significant reduction of the Coefficient of Variation was observed between the very first taps and the subsequent ones (Δlog(CV)9to24-1to8 = -0.047 ± 0.051, z = -0.92, p = 0.36) (see Fig 3a).

Fig 3. (a) Coefficient of Variation (CV), inversely related to the degree of isochrony of the reproduced pattern, measured over the very first taps or the stabilized phase of ISO_REPRO, compared to the stabilized phase of the synhcronization task 1:1_ISO_SYNC.

Fig 3

People who stutter (PWS, N = 16) are compared with typical adults without speech disorder (PNS, N = 16). (b) Average Periodicity Error (PE) when reproducing the specific 500ms period of the previously heard isochronous pattern of the condition ISO_REPRO, over the very first taps (first 8-beat cycle) or the more stabilized phase (second and third 8-beat cycles) of the condition.

No significant correlation was observed between the Coefficient of Variation and the variability in Tapping Force (TF_Var) on the same trains of taps, or the variability in Reaction Time (RT_Var) in the REACT condition (see section B.2.2 of the supplementary material).

3.2.2 Average Periodicity Error (PE) in the condition ISO_REPRO

One additional question is whether PWS face difficulties at extracting, internalizing, and then reproducing the specific tempo of a periodic pattern. To investigate that question, the variation in Periodicity Error (PE) over a cycle of taps was explored, considering the mixed model [PE ~ GROUP * TIME + 1|Participant], with the 2-level factor TIME = {First 8-beat cycle of ISO_ REPRO; Second and third 8-beat cycles of ISO_REPRO}. No significant interaction GROUP*TIME was observed (df = 1, LRatio = 0.0044, p = 0.95).

  • Group: PWS were not significantly worse than PNS at reproducing the specific period of the previously heard isochronous pattern (ΔPEPWS-PNS = 3 ± 3 ms, z = 1.03, p = .30, see Fig 3b): The participants showed an average Periodicity Error of 13 ± 1 ms, which corresponds to 2.6% of the 500 ms IOI target. The average Periodicity Error of PWS was not found to correlate significantly with stuttering severity (see section B.2.3 of the supplementary Material). For both PNS and PWS, this Periodicity Error did not correspond to a systematic under-estimation or over-estimation or the 500 ms pattern period: both groups produced tapping trains with a comparable mean ITI of 501 ± 4 ms (ΔMean_ITIPWS-PNS = 0 ± 5ms, z = 0.07, p = .95). The average Periodicity Error of each participant also did not correlate with his/her average Reaction Time (RT) in the REACT condition (see section B.2.3 of the supplementary material).

  • Interaction with motor engagement: Periodicity Error was also not significantly reduced after motor engagement (second and third 8-beat cycles), compared to the first eight-beat cycle of taps following passive listening only (ΔPE9to24-1to8 = 1 ± 2 ms, z = 0.89, p = .37) (see Fig 3b).

3.3 Synchronization abilities: Phase Angle (PA: Accuracy) and Phase Locking Value (PLV: Consistency)

3.3.1 Reference condition (1:1_ISO_SYNC)

The variations of Phase Angle (PA), Phase Locking Value (PLV) and Tapping Force (TF) in the condition 1:1_ISO_SYNC were explored, considering the Bayesian circular mixed model [PA ~ GROUP + TIME + 1|Participant], and the linear mixed models [logit(PLV) ~ GROUP * TIME + 1|Participant] and [TF ~ GROUP * TIME + 1|Participant], with the 2-level factor TIME = {first 8-beat cycle of 1:1_ISO_SYNC; 2nd and 3rd cycles of 1:1_ISO_SYNC}.

  • Prediction abilities: Both groups demonstrated negative Phase Angles in 1:1_ISO_SYNC, with an average of -29.4 ± 15.3 degrees (see Fig 4a), which indicated that they were not reacting to the stimulus, as in the condition REACT (see Fig 2a), and that both groups were able to predict and anticipate the beat.

  • Group: Compared to PNS, PWS showed larger negative Phase Angles, indicating a reduced synchronization accuracy (ΔPAPWS-PNS = -10.8 ± 5.8 degrees, HPD = [0.2 22.5]) (see Fig 4a), as well as lower Phase Locking Values, signifying a reduced synchronization consistency (Δlogit(PLV)PWS-PNS = -0.59 ± 0.21, z = -2.81, p < .01) (see Fig 4b). No significant difference in average Tapping Force was observed between PWS and PNS (ΔTFPWS-PNS = -0.025 ± 0.058 a.u, z = -0.44, p = .66) (see Fig 5). The average Phase Angles and Phase Locking Values of PWS in the condition 1:1_ISO_SYNC were not found to correlate significantly with stuttering severity (see section B.3.1 of the supplementary material).

  • Interaction with motor engagement: No significant difference in synchronization accuracy in terms of Phase Angle (PA) was observed between the taps produced during the very beginning of the condition 1:1_ISO_SYNC (first 8-beat cycle), and the next two cycles, for both PNS (ΔPA = 0.9 ± 1.5 degrees, HPD = [-2.1 +4.1]) and PWS (ΔPA = 2.4 ± 2.1 degrees, HPD = [- 1.7 +6.6]) (see Fig 4a). Synchronization consistency, in terms of Phase Locking Values (PLV), however, showed a significant improvement between the first 8-beat cycle and the next two cycles of the condition 1:1_ISO_SYNC (Δlogit(PLV) = 0.33 ± 0.12, z = 2.85, p = .004), for both groups of participants (No significant interaction Time*Group: df = 1, LRatio = 0.35, p = 0.55) (see Fig 4b).

  • Relationship to other indices of motor delays and variability: The lower Phase Locking Values observed for PWS in this simple synchronization task–revealing an increased variability of the asynchrony between a tap and the closest auditory stimulus–was also related to a greater Coefficient of Variation–corresponding to an increased variability of the inter-tap intervals (Δlog(CV)PWS-PNS = 0.24 ± 0.09, z = 2.53, p = .01) (see Fig 3a). However, this average coefficient of variation in 1:1_ISO_SYNC was significantly greater than in the condition ISO_REPRO (Δlog(CV)1:1_ISO_SYNC–ISO_REPRO = 0.13 ± 0.04, z = 3.16, p = .002), for both groups (Non-significant interaction GROUP*TASK: df = 1, LRatio = 3.11, p = 0.08). No significant correlation was observed in 1:1_ISO_SYNC between the degree of NMA and the Tapping Force or between the Phase Locking Values of each train of taps and its corresponding variability in Tapping Force (see section B.3.3 of the supplementary material). No significant correlation was also observed between the average degree of NMA of each participant in 1:1_ISO_SYNC and his/her average reaction time (RT) in the REACT condition, or between the average Phase Locking Value of each participant in 1:1_ISO_SYNC and his/her average Variability in Reaction Time in REACT (see section B.3.2 of the supplementary material).

Fig 4. (a) Average Phase Angle and (b) Phase Locking Value, for the synchronization task with an isochronous pattern (1:1_ISO_SYNC), over the very first 8-beat cycle of taps or the two next cycles.

Fig 4

Fig 5. Tapping Force (in arbitrary unit) on the “strong” vs. “weak” beats of a 8-beat isochronous pattern, in which all the beats were marked by an auditory stiumulus (1:1_ISO_SYNC), or only the strong ones (1:4_ISO_SYNC), and on the “half-beat” pulses of a non-isochronous pattern (NONISO_SYNC).

Fig 5

People who stutter (PWS, N = 16) are compared to with matched control particpants without speech disorders (PNS, N = 16).

3.3.2 Perception and reproduction of meter

To assess how PWS and PNS perceive and reproduce higher levels of beat organization, the variations of Phase Angle (PA), Phase Locking Value (PLV), and Tapping Force (TF) with metrical hierarchy were further explored, considering for the two tasks, 1:1_ISO_SYNC and NONISO_SYNC, the Bayesian circular mixed models [PA ~ GROUP + STRENGTH + 1|Participant] or the linear mixed models [logit(PLV) ~ GROUP + STRENGTH + 1|Participant] and [TF ~ GROUP + STRENGTH + 1|Participant], with STRENGTH = {strong beats; weak beats} in 1:1_ISO_SYNC, and STRENGTH = {strong beats; weak beats; taps falling “half-beat”} in NONISO_SYNC.

  • Beat strength in 1:1_ISO_SYNC: The results showed that in the condition 1:1_ISO_SYNC, taps falling on “strong” beats (in our case, the 1st and 5th of each 8-beat cycle) were indeed produced with greater Tapping Force than taps falling on “weak” beats (remaining beats) (ΔTFstrong-weak = 0.019 ± 0.009 a.u, p = .04), for both PWS and PNS (Interaction GROUP*STRENGTH: df = 1, LRatio = 0.003, p = 0.96) (see Fig 5a). They were also synchronized more accurately, i.e., closer to the beat (ΔPAstrong-weak = 4.5 ± 1.8 degrees, HPD = [1.1 8.1]), with a similar strong-weak contrast in both groups of participants (Interaction GROUP*STRENGTH: HPD = [-6.1 +19.8]) (see Fig 6a). Synchronization consistency was not significantly affected by beat strength (Δlogit(PLV) strong-weak = 0.03 ± 0.19, z = 0.16, p = .87), for both PNS and PWS (Interaction GROUP*STRENGTH: df = 1, LRatio = 0.02, p = .89) (see Fig 7a).

  • Beat strength in NONISO_SYNC: The significant differences in Phase Angles and Tapping Force observed between strong and weak taps in 1:1_ISO_SYNC, were no longer observed in NONISO_SYNC (ΔPAstrong-weak = -0.7 ± 0.4 degrees, HPD = [-4.7 +3.0]) (see Fig 6c) (ΔTF strong-weak = 0.010 ± 0.016 a.u, p = .79; see Fig 5c). Like in 1:1_ISO_SYNC, strong and weak taps also did not differ significantly in NONISO_SYNC in terms of synchronization consistency, i.e., Phase Locking Values (Δlogit(PLV)strong-weak = 0.02 ± 0.13, p = 0.98; see Fig 7c). Taps falling “half-beat” in the condition NONISO_SYNC were also not produced with a significantly reduced consistency, compared to taps falling “on the beat”–synchronized with strong or weak beats (Δlogit(PLV)on the beat-half beat = 0.01 ± 0.12, z = 0.06, p = .99) (see Fig 7c). However, they were synchronized less accurately (ΔPAon the beat-half beat = 5.9 ± 2.0, HPD = [2.2 9.9]; Interaction GROUP*STRENGTH: HPD = [-9.2 +21.6])(see Fig 6c), and with a significantly weaker Tapping Force (ΔTF on the beat—half beat = -0.0730 ± 0.019 a.u, z = -3.87, p = .0002) (see Fig 5c).

Fig 6. Average Phase Angle on the “strong” vs. “weak” beats of a 8-beat isochronous pattern, in which all the beats were marked by an auditory stiumulus (1:1_ISO_SYNC), or only the strong ones (1:4_ISO_SYNC), and on the “half-beat” pulses of a non-isochronous pattern (NONISO_SYNC).

Fig 6

People who stutter (PWS, N = 16) are compared to with matched control particpants without speech disorders (PNS, N = 16).

Fig 7. Phase Locking Value on the “strong” vs. “weak” beats of a 8-beat isochronous pattern, in which all the beats were marked by an auditory stiumulus (1:1_ISO_SYNC), or only the strong ones (1:4_ISO_SYNC), and on the “half-beat” pulses of a non-isochronous pattern (NONISO_SYNC).

Fig 7

People who stutter (PWS, N = 16) are compared to with matched control particpants without speech disorders (PNS, N = 16).

3.3.3 Effect of beat internalization vs. marking by an external auditory stimulus (1:4_ISO_SYNC vs. 1:1_ISO_SYNC)

To investigate how PWS perceive and reproduce internalized beats, the task 1:4_ISO_SYNC in which only the first and fifth beats were marked by an external auditory stimulus was compared to the task 1:1_ISO_SYNC in which a stimulus was played on all beats. The variations of Phase Angle (PA), Phase Locking Value (PLV) and Tapping Force (TF) were further explored in these two tasks, considering the Bayesian circular mixed model [PA ~ GROUP + CONDITION + 1|Participant] and the linear mixed models [logit(PLV) ~ GROUP * CONDITION + 1|Participant] and [TF ~ GROUP * CONDITION + 1|Participant] for either strong or weak beats, distinctly (with CONDITION = {1:1_ISO_SYNC; 1:4_ISO_SYNC}).

  • Synchronization accuracy: No significant difference in Phase Angle was observed between 1:1_ISO_SYNC and 1:4_ISO_SYNC for the strong beats, which were marked by an auditory stimulus in both conditions (ΔPA1:4_ISO_SYNC-1:1_ISO_SYNC = 4.4 ± 6.1 degrees, HPD = [-7.1 +17.0]). This absence of significant differences in accuracy was observed for PWS as well as PNS (No significant Interaction GROUP*CONDITION: HPD = [-16.9 26.1]) (see Fig 6a and 6b). For the condition 1:4_ISO_SYNC, in which weak beats were not marked by an auditory stimulus, both PNS and PWS “synchronized” the weak taps closer to the theoretical beat position, compared to the condition 1:1_ISO_SYNC, in which weak beats were actually marked by an auditory stimulus (ΔPA 1:4_ISO_SYNC-1:1_ISO_SYNC = 11.4 ± 3.5 degrees, HPD = [-18.1–4.5]) (No significant Interaction GROUP*CONDITION: HPD = [-23.7 +18.8]).

  • Synchronization consistency: Phase Locking Values on weak beats were significantly decreased in 1:4_ISO_SYNC, compared to 1:1_ISO_SYNC (Δlogit(PLV)1:4_ISO_SYNC-1:1_ISO_SYNC = -2.06 ± 0.19, z = -10.89, p < .0001), for PWS as well as PNS (No significant interaction GROUP*CONDITION: df = 1, LRatio = 1.52, p = .22) (see Fig 7a and 7b). A similar decrease in synchronization consistency was observed for strong beats (Δlogit(PLV)1:4_ISO_SYNC-1:1_ISO_SYNC = -2.10 ± 0.23, z = -0.10, p < .0001), again similarly in PNS and PWS (No significant interaction GROUP*CONDITION: df = 1, LRatio = 0.046, p = .83).

  • Tapping Force: For both groups, Tapping Force was increased in the condition 1:4_ISO_SYNC, compared to 1:1_ISO_SYNC (ΔTF1:4_ISO_SYNC-1:1_ISO_SYNC = 0.086 ± 0.009 a.u, z = 9.83, p < .0001) (see Fig 5a and 5b).

  • Interaction with beat strength: Finally, the significant difference in synchronization accuracy observed between strong and weak taps in 1:1_ISO_SYNC, was no longer observed in 1:4_ISO_SYNC (ΔPAstrong-weak = 4.9 ± 3.1 degrees, HPD = [-0.7 11.5]) (see Fig 6b). Strong and weak taps also did not differ significantly in synchronization consistency in 1:4_ISO_SYNC (Δlogit(PLV)strong-weak = -0.01 ± 0.18, z = -0.04, p = 0.97), like in 1:1_ISO_SYNC (see Fig 7b). On the contrary, a significant difference in tapping force between strong and weak taps was maintained in 1:4_ISO_SYNC (ΔTFstrong-weak = 0.029 ± 0.012 a.u, z = -2.33, p = .020) for both groups, like in 1:1_ISO_SYNC (see Fig 5b).

3.3.4 Effect of rhythmic complexity (NONISO_SYNC vs. 1:1_ISO_SYNC)

One of the remaining questions was whether rhythmic complexity enhances the difference in synchronization variability, already observed between PWS and PNS in a simple synchronization task. To this end, the variations of Phase Angle (PA), Phase Locking Value (PLV) and Tapping Force (TF) were also further explored, considering the Bayesian circular mixed model [PA ~ GROUP + CONDITION + 1|Participant] and the linear mixed models [logit(PLV) ~ GROUP * CONDITION + 1|Participant] and [TF ~ GROUP * CONDITION + 1|Participant] for either strong or weak beats, separately (with CONDITION = {1:1_ISO_SYNC; NONISO_SYNC}).

Detailed results are available in section B.3.4 of the supplementary material. In summary, the increased rhythmic complexity in NONISO_SYNC was globally associated with an improved synchronization accuracy (i.e., smaller NMA), compared to the simple synchronization task 1:1_ISO_SYNC (see Fig 6a and 6c), a reduced synchronization consistency (i.e., smaller PLV) (see Fig 7a and 7c), and a greater tapping force (see Fig 5a and 5c). These results were observed for strong as well as weak beats, and similar in PWS and PNS groups.

4 Discussion

The study investigated the rhythmic tapping behavior of people who stutter compared to people who do not stutter and considered several levels of processing at which differences were hypothesized to occur: 1- the execution of movements, in particular their initiation (as measured in the task REACT), 2- the perception of beat, at a given periodicity (as measured in the task ISO_REPRO), 3- the on-line adaptation and improvement of their accuracy and consistency, based on sensory feedback (as measured in the tasks 1:1_ISO_SYNC, 1:4_ISO_SYNC and NONISO_SYNC).

4.1 Motor delays and variability in the execution of movements

One of the current theories is that stuttering originates from a dysfunctional Basal Ganglia, and more generally a dysfunctional “Cortico-basal ganglia-thalamocortical loop”, resulting in disrupted motor execution, such as difficulties initiating movements [3,47]. Several previous studies indeed reported longer voice reaction times [26,27,79] and finger reaction times in PWS [26,28]. Our study did not confirm these studies: no significant difference in average finger reaction time, or its variability, was observed here between PNS and PWS. When taking severity into account, the average reaction time and its intra-individual variance did not correlate significantly with the SSI score. The finger reaction time of each individual did also not predict the average accuracy in a simple synchronization task. No significant link was also observed between the intra-individual variability in reaction time and the consistency at synchronizing with a simple isochronous pattern. These different observations support the idea that, in our experiment, the participants who stutter did not demonstrate a deficit of movement initiation or at least, that this did not affect externally triggered movements.

In their “dual premotor” model, Alm et al. [80] distinguished a “medial” premotor circuit (involving the basal ganglia and the supplementary motor area), involved when initiating and sequencing automatized self-triggered actions, and a “lateral” premotor circuit (involving the cerebellum and the lateral premotor cortex), involved in initiating and sequencing non-automatized actions, triggered by external stimuli. They suggested that the medial circuit is impaired in stutterers, while the lateral one is intact, explaining the observed improved fluency of PWS while speaking with a metronome, choral reading, and singing [8184]. It was hypothesized that this external information either provides triggers to initiate speech sequences or forces the speaker to pay close attention to the available sensory information, making the movements less automatized. Based on the dual premotor model, we expected in our study to observe timing differences between PWS and PNS, expressed as the Periodicity Error or difference between the dictated inter-stimulus onset interval and the performed interval in the reproduction task ISO_REPRO, during which the tapping was not triggered by external signals, and thus was mediated by the medial premotor circuit only. Contrary to these expectations, no significant difference in Periodicity Error was observed between PNS and PWS. And on the contrary, PWS and PNS did differ significantly in terms of Negative Mean Asynchrony (or Phase Angle) when an external trigger was provided, i.e., in all the synchronization tasks, during which participants were hypothesized to rely more on their lateral premotor circuit. Furthermore, no significant correlation was observed between the average finger reaction time (in REACT) of each individual and his/her average accuracy (PE) in the reproduction task. No significant correlation was also observed between the intra-individual variability in reaction time in REACT and the average tapping variability (CV) in the reproduction task without external auditory stimuli. These results therefore suggest that the timing differences that were further observed between PNS and PWS in our study were not due to motor difficulties regarding initiating movements, whether self-triggered or by an external stimulus.

It was suggested that other possible motor impairments in PWS were associated either with inaccurate internal models, generating instable movements due to larger delays in feedback processing, or with neural noise corrupting the motor commands or the sensory inputs. Both impairments are expected to result in larger variability in motor actions, in terms of movement magnitude, timing or force, for PWS, in addition to greater variability in timing [21,22]. Supporting these hypotheses, previous studies reported a greater variability in movement amplitude and target in PWS [25,33,34], compared to PNS. Although an increased variability in timing was also observed in our study in PWS, when synchronizing with a simple isochronous pattern, no significant difference was observed between the two groups in terms of tapping force variability for such a “simple” task. In the more complex task NONISO_SYNC, however, an increased variability of the tapping gestures was observed in both timing (decreased PLV) and force, compared to 1:1_ISO_SYNC. Although the group difference in PLV was not significantly enhanced with complexity, PWS showed increased variability of tapping force during complex rhythmic tapping. Finally, no significant correlation was observed between the tapping variability in time (CV) and in force (TF_Var) in the reproduction task, or between the Phase Locking Values and the Tapping Force variability within the simple and complex conditions. These findings suggest that differences in CV and PLV observed between groups (PWS vs. PNS), may not be caused by motor impairment associated with less accurate internal models or by increased neural noise. These results are in line with studies that did not observe a greater motor implementation variance [23] when decomposing the total observed variance in tapping into a motor implementation and a central clock component [85]. This does not mean that PWS may not have any motor difficulties, though, but that these difficulties are not reflected in the tasks that were investigated in our study.

4.2 Beat perception and reproduction

In the condition ISO_REPRO, PWS showed the ability to tap an isochronous sequence on their own, without any external auditory reference, i.e., their tapping trains did not show any significant acceleration or slow-down. Furthermore, PWS showed a significant reduction of tapping asynchronies in a predictable pattern (1:1_ISO_SYNC), compared to an unpredictable pattern (REACT), proving their ability to predict and anticipate a regular event. Furthermore, taps were produced with a Periodicity Error (PE) and a tapping variability (CV) that remained within an “acceptable” range, which provides convincing evidence that PWS have the capacity to perceive the specific frequency of a regular pattern, while passively listening to it and to transfer this frequency in the motor domain. In the framework of the Oscillators Coupling Hypothesis, and considering Morillon et al.’s hypotheses [52,53], these observations therefore exclude the hypothesis of a strong deficit in the tuning of neuronal oscillations with the external beat, both in the auditory and in the motor domain, as well as in their interactions [3842].

Compared to PNS, PWS also did not show a significantly reduced accuracy (i.e., a greater Periodicity Error (PE)) when reproducing a previously perceived isochronous pattern with a specific periodicity. They did, however, show a significantly reduced consistency (i.e., a greater Coefficient of Variation (CV)), which supports the idea that PWS do not have a deficit at perceiving the exact periodicity of an isochronous pattern, but that their difficulties are rather related to reproducing the pattern with tapping gestures.

Several arguments were provided in the preceding section (4.1) that exclude the idea that timing differences between PWS and PNS simply result from an impaired motor execution. However, these differences can possibly be explained in the framework of the Oscillators Coupling Hypothesis, which assumes that neuronal oscillators are tuned in phase and frequency to the frequency of the external periodic stimulation, both in the sensory areas [43,44,51] and in motor areas [52,53]. In our study, the similar level of periodicity error observed in PWS and PNS, but the increased Coefficient of Variation observed in PWS for the condition ISO_REPRO (without external auditory triggers), suggests that the perception mechanism of the beat frequency works properly in PWS, in the sense that they perceive the beat accurately, but their difficulties are related to a deficit in the coupling of the oscillators driving the motor system,–so that they reproduce beat with increased variability. Since, in our study, the higher tapping variability of PWS, compared to PNS, was observed immediately during the first taps of the ISO_REPRO condition, after listening passively to the isochronous pattern to reproduce, and since no significant improvement was observed after several seconds of motor engagement, the oscillatory coupling deficit in the motor system does not seem to be due to the transition between perception and production, but are instead intrinsic and long lasting.

Finally, since an internalized awareness of beat enables us to link certain rhythmic events as more salient or important than others [86], a deficit in internalizing the beat was hypothesized to result in increased difficulties to perceive and reproduce complex rhythms, as well as to perceive and reproduce meter. In our study, we indeed observed that PWS showed more errors than PNS in the reproduction of the NONISO_SYNC pattern, which is in line with the results of Wieland et al. [87] and supports the idea that PWS have more difficulties than PNS in correctly perceiving and/or reproducing complex non-isochronous patterns. On the other hand, PNS and PWS did not differ significantly in their marking of beat hierarchy: For both PWS and PNS, taps falling on strong beats in 1:1_ISO_SYNC and 1:4_ISO_SYNC, were produced with a greater tapping force, compared with taps falling on weak beats. In NONISO_SYNC, for PNS as well as PWS, taps falling on-beat were also produced with a greater tapping force, compared with taps falling half-beat.

4.3 Sensorimotor integration and learning

A significantly reduced timing accuracy and consistency was observed in PWS in synchronization tasks of varying complexity, through greater degrees of Negative Mean Asynchrony and lower Phase Locking Values (PLV). In addition, for both PWS and PNS, the Phase Angles varied with 1- beat strength 2- the presence vs. absence of external auditory stimuli to mark the beat (increased NMA on weak beats in 1:1_ISO_SYNC, in which they are marked by an auditory stimulus, compared to 1:4_ISO_SYNC, in which they are “internalized” by the participants), and 3- task complexity and pulse rate (reduced NMA in both strong and weak beats of NON_ISO_SYNC, compared to 1:1_ISO_SYNC–in agreement with the reduced NMA observed on non-isochronous musical excerpts [20] or with shorter ITI [58,63]). The fact that Phase Angles depended on the task and the lack of correlation with the average Reaction Time (measured in REACT), excludes the idea that NMA in general, and the greater NMA of PWS in particular, correspond to an anticipation strategy aiming at compensating for motor delays at the initiation of movements.

Our results also reject the idea that NMA reflects an under-estimation of Inter-stimulus Onset or Inter-Tap Intervals [20,88]. If this was the case, PWS, who showed a greater NMA, would have demonstrated a global acceleration or an average ITI lower than the pattern’s period in the ISO_REPRO condition, when no external stimulus was provided. In this reproduction task, however, PWS did not show any significant drift in ITI over time. The mean ITI of their tapping train was also not systematically “lower” than 500 ms, and comparable to that of PNS.

Many of our observations provide support for a slower processing of tactile and proprioceptive information in PWS, compared to auditory information [59,89]. As a consequence, this reduced kinesthetic sensitivity in PWS [30,66,67] increases this integration delay between auditory and kinesthetic feedback. Several studies indeed show significant differences between PWS and PNS in integrating kinesthetic feedback [30,66,67]. Such a theory explains why PWS perform taps even more in advance to the beat than PNS, so that they more accurately synchronize the perception of the tactile input with the perception of the auditory input. The key-role of synchronizing multiple sensory channels is also compatible with the well-known observation that speech fluency is improved by delayed auditory feedback in PWS [90,91]. If this delayed processing of tactile feedback originates from a systematic slower nerve conduction at the peripheral level (‘Fraisse-Paillard’ hypothesis [89]), it is predicted that the NMA remains constant within a same individual, regardless of beat strength. This, however, is not the case: in both groups, we observed that (1) PA is reduced in the weak beats of 1:4_ISO_SYNC, in the absence of acoustic beeps, as compared to the weak beats of 1:1_ISO_SYNC marked with beeps, while this reduction is not observed in strong beats that are marked by acoustic beeps in both tasks; and (2) NMA is reduced in the non-isochronous task (NONISO_SYNC as compared to 1:1_ISO_SYNC). An alternative model, the “sensory accumulation” model [59,65], assumes that the central nervous system detects a sensory stimulation when the number of afferent signals–that increases quickly from the onset of a sensory stimulation, following a so-called “accumulation function”–reaches a certain threshold of sensitivity. The steepness of that accumulation function depends not only on the stimulation intensity, but also on the density of sensory receptors, which is greater for the auditory modality than for the tactile one. Thus, the model assumes that the NMA observed in finger tapping synchronization corresponds to the compensation for the slower accumulation function of tactile feedback received from the finger, compared to that of the auditory metronome stimulation, so that both accumulation functions reach the detection threshold by the central system at the same time. The model furthermore predicts that the amplitude of that auditory-tactile delay, and the resulting NMA, depends on the stimulation intensity. In particular, the NMA is hypothesized to decrease when tapping force increases. In agreement with these predictions, it was observed in our study that in the condition 1:1_ISO_SYNC, strong taps differed from weak taps by both a greater force and a reduced NMA, in the same way that taps falling on the beat differed from half-beat taps in the condition NONISO_SYNC. Furthermore, PNS tapped in the non-isochronous task NONISO_SYNC with both an increased force and a reduced average NMA, compared to the simple synchronization task 1:1_ISO_SYNC.

Following a similar reasoning, the fact that PLV also did not remain constant within an individual but varied with 1- the presence vs. absence of external auditory stimuli to mark the beat (overall decrease in 1:4_ISO_SYNC, compared to 1:1_ISO_SYNC, for all taps) and with 2- complexity and pulse rate (overall decrease in NONISO_SYNC, compared to 1:1_ISO_SYNC, for all taps), and the lack of correlation with the variability in Reaction Time (RT_Var) or Tapping Force (TF_Var) again excludes the idea that the decreased PLV in PWS simply relates to motor difficulties, either with the initiation of movements, or more generally with their execution. Instead, the greater variability in inter-tap intervals (CV), similarly observed for both PNS or PWS in the synchronization task 1:1_ISO_SYNC, compared to the reproduction task ISO_REPRO, indicates that synchronizing with an auditory stimulus involves an additional source of variability (probably of sensori-motor nature), in addition to just tapping periodically–which already involves timing and motor variabilities. In any case, this increase in tapping variability in a synchronization task was not significantly enhanced in PWS, compared to PNS, so that there does not appear to be a deficit at this stage.

Finally, the variations in synchronization performance over time in 1:1_ISO_SYNC showed that PWS are distinguished from PNS by reduced synchronization accuracy and consistency as soon as the very first taps following a passive listening of the rhythmic pattern, and that synchronization consistency was then significantly improved after a few seconds of synchronous tapping (one 8-beat cycle). However, this improvement was similar for both PNS and PWS, and no such improvement was observed in terms of synchronization accuracy. These observations therefore exclude the hypothesis that timing difficulties in PWS originate from a deficit in sensorimotor learning, to consolidate internal beat representations.

5 Conclusions

Following a “differential” and behavioral approach, this study compared the performance of people who stutter (PWS) and people who do not stutter (PNS) in different rhythmic tasks of various complexity, to better understand the rhythmic deficits of PWS and to identify at which level some cognitive processes might be impaired, leading to the observed differences.

The data were analyzed from three theoretical perspectives: (1) stuttering is associated with motor deficits affecting the initiation and sequencing of movements or the accuracy of movements due to inaccurate predictive models or neural noise; (2) stuttering is associated with impaired coupling between external physical cyclical phenomena and neural oscillators both in perception and movement production, resulting in deficient beat perception and/or reproduction; (3) stuttering is associated with delays in the processing and integration of (multi)sensory feedbacks, resulting in deficient sensorimotor control and synchronization.

The results from our study, exploring a rhythmic deficiency in PWS, point towards (1) a deficit in neural oscillator coupling in production, but not in perception, of rhythmic patterns in PWS, and (2) a larger delay in multi-modal feedback processing for PWS.

Supporting information

S1 File

S1 Fig. (a) Correlation between this average Tapping Asynchrony in the condition REACT (i.e. the average finger Reaction Time) and the SSI score of PWS. (b) Correlation between this reaction time variability and the SSI score of PWS. S2 Fig. Tapping Force Variability in the reproduction task of an isochronous pattern, after passive listening (ISO_REPRO) and in both tasksof synhcronization to a 4-beat metered isochronous pattern (1:1_ISO_SYNC) or to a non-isochronous pattern (NONISO_SYNC). S3 Fig. (a) Correlation between the average log(CV) value and SSI score of PWS. (b) Correlation between the average log(CV) and the Reaction Time Variability (in the condition REACT) of each participant (N = 32). (c) Correlation between log(CV) and the Tapping Force Variability on each train of taps produced in the condition ISO_REPRO. S4 Fig. (a) Correlation between the average PE and the SSI score of PWS. (b) Correlation between the average PE in the condition ISO_REPRO, and the average Reaction Time in the condition REACT of each participant (N = 32). S5 Fig. (a) Correlation between the average PA in the condition 1:1_ISO_SYNC, and the SSI score of PWS. (b) Correlation between the average log(PLV) vaalue in the condition 1:1_ISO_SYNC, and the SSI score of PWS. S6 Fig. (a) Correlation between the average Phase Angle (PA) in the condition 1:1_ISO_SYNC, and the average Reaction Time in the condition REACT of each participant (N = 32). (b) Correlation between the average Phase Locking Value (logit((PLV)) in the condition 1:1_ISO_SYNC, and the average Variability in Reaction Time in the condition REACT of each participant. S7 Fig. (a) Correlation between the Phase Angle (PA) and the Tapping Force (TF) of each tap in the conditions 1:1_ISO_SYNC and 1:4_ISO_SYNC. (b) Correlation between the logit(PLV))value and the Tapping Force Variability on each train of taps produced in the conditions 1:1_ISO_SYNC and 1:4_ISO_SYNC. People who stutter (PWS, N = 16) are compared to with matched control participants without speech disorders (PNS, N = 16).

(PDF)

Acknowledgments

We also thank Silvain Gerber, Ladislas Nalborczyk and Pierre Baraduc for their advice on circular statistics, as well as the two reviewers for their constructive comments.

Data Availability

Data cannot be shared publicly because of the confidentiality clause in the ethical approval (CERGA: IRB00010290-2018-10-16-54). Data are available from the GIPSA-lab (contact via Laurent Girin: laurent.girin@gipsa-lab.fr) for researchers who meet the criteria for access to confidential data.

Funding Statement

This study was funded by the Agence Nationale de la Recherche (https://anr.fr/fr/)(Project StopNCo; ANR-14-CE30-0017; PI: MG) The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Civier O, Bullock D, Max L, Guenther FH. Computational modeling of stuttering caused by impairments in a basal ganglia thalamo-cortical circuit involved in syllable selection and initiation. Brain Lang. 2013;126(3):263–78. doi: 10.1016/j.bandl.2013.05.016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Max L, Guenther FH, Gracco VL, Ghosh SS, Wallace ME. Unstable or insufficiently activated internal models and feedback-biased motor control as sources of dysfluency: A theoretical model of stuttering. Contemp Issues Commun Sci Disord. 2004;31:105–22. [Google Scholar]
  • 3.Mackay DG, Macdonald MC. Stuttering as a sequencing and timing disorder. In: In. College Hill Press; 1984.
  • 4.Blomgren M, Robb M, Chen Y. A Note on Vowel Centralization in Stuttering and Nonstuttering Individuals. J Speech Lang Hear Res. 1998;41(5):1042–51. doi: 10.1044/jslhr.4105.1042 [DOI] [PubMed] [Google Scholar]
  • 5.Freeman FJ, Ushijima T. Laryngeal muscle activity during stuttering. J Speech Hear Res. 1978;21(3):538–62. doi: 10.1044/jshr.2103.538 [DOI] [PubMed] [Google Scholar]
  • 6.Hutchinson JM, Norris GM. The differential effect of three auditory stimuli on the frequency of stuttering behaviors. J Fluen Disord. 1977;2(4):283–93. [Google Scholar]
  • 7.van Lieshout PH, Peters HF, Starkweather CW, Hulstijn W. Physiological differences between stutterers and nonstutterers in perceptually fluent speech: EMG amplitude and duration. J Speech Lang Hear Res. 1993;36(1):55–63. doi: 10.1044/jshr.3601.55 [DOI] [PubMed] [Google Scholar]
  • 8.Max L, Caruso AJ, Gracco VL. Kinematic analyses of speech, orofacial nonspeech, and finger movements in stuttering and nonstuttering adults. 2003; [DOI] [PubMed]
  • 9.Daliri A, Prokopenko RA, Flanagan JR, Max L. Control and Prediction Components of Movement Planning in Stuttering Versus Nonstuttering Adults. J Speech Lang Hear Res. 2014;57(6):2131–41. doi: 10.1044/2014_JSLHR-S-13-0333 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Sares AG, Deroche ML, Shiller DM, Gracco VL. Adults who stutter and metronome synchronization: evidence for a nonspeech timing deficit. Ann N Y Acad Sci. 2019;1449(1):56. doi: 10.1111/nyas.14117 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.de Felício CM, Freitas RLRG, Vitti M, Regalo SCH. Comparison of upper and lower lip muscle activity between stutterers and fluent speakers. Int J Pediatr Otorhinolaryngol. 2007;71(8):1187–92. doi: 10.1016/j.ijporl.2007.04.008 [DOI] [PubMed] [Google Scholar]
  • 12.Caruso AJ, Abbs JH, Gracco VL. Kinematic analysis of multiple movement coordination during speech in stutterers. Brain. 1988; 111(2):439–55. doi: 10.1093/brain/111.2.439 [DOI] [PubMed] [Google Scholar]
  • 13.Hulstijn W, Summers JJ, van Lieshout PHM, Peters HFM. Timing in finger tapping and speech: A comparison between stutterers and fluent speakers. Hum Mov Sci. 1992;11(1–2):113–24. [Google Scholar]
  • 14.Loucks TMJ, De Nil LF, Sasisekaran J. Jaw-phonatory coordination in chronic developmental stuttering. J Commun Disord. 2007; 40(3):257–72. doi: 10.1016/j.jcomdis.2006.06.016 [DOI] [PubMed] [Google Scholar]
  • 15.Max L, Gracco VL. Coordination of oral and laryngeal movements in the perceptually fluent speech of adults who stutter. 2005; [DOI] [PubMed]
  • 16.McClean MD, Runyan CM. Variations in the relative speeds of orofacial structures with stuttering severity. J Speech Lang Hear Res. 2000; 43(6):1524–31. doi: 10.1044/jslhr.4306.1524 [DOI] [PubMed] [Google Scholar]
  • 17.Alfonso PJ. Implications of the concepts underlying task-dynamic modeling on kinematic studies of stuttering. Speech Mot Control Stuttering. 1991;79100. [Google Scholar]
  • 18.Alm PA. Stuttering and the basal ganglia circuits: a critical review of possible relations. J Commun Disord. 2004;37(4):325–69. doi: 10.1016/j.jcomdis.2004.03.001 [DOI] [PubMed] [Google Scholar]
  • 19.Etchell AC, Johnson BW, Sowman PF. Behavioral and multimodal neuroimaging evidence for a deficit in brain timing networks in stuttering: a hypothesis and theory. Front Hum Neurosci. 2014; 8. doi: 10.3389/fnhum.2014.00467 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Falk S, Müller T, Dalla Bella S. Non-verbal sensorimotor timing deficits in children and adolescents who stutter. Front Psychol. 2015; 6:847. doi: 10.3389/fpsyg.2015.00847 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Zelaznik HN, Smith A, Franz EA, Ho M. Differences in bimanual coordination associated with stuttering. Acta Psychol. 1997;96(3):229–43. doi: 10.1016/s0001-6918(97)00014-0 [DOI] [PubMed] [Google Scholar]
  • 22.Sares AG, Deroche MLD, Shiller DM, Gracco VL. Adults who stutter and metronome synchronization: evidence for a nonspeech timing deficit. Ann N Y Acad Sci. 2019; 29. doi: 10.1111/nyas.14117 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Max L, Yudman EM. Accuracy and Variability of Isochronous Rhythmic Timing Across Motor Systems in Stuttering Versus Nonstuttering Individuals. J Speech Lang Hear Res. 2003; 46(1):146–63. doi: 10.1044/1092-4388(2003/012) [DOI] [PubMed] [Google Scholar]
  • 24.Amster BJ, Starkweather CW. Articulatory Rate, Stuttering and Speech Motor Control. In: Peters HFM, Hulstijn W, editors. Speech Motor Dynamics in Stuttering. Vienna: Springer; 1987; p. 317–28. [Google Scholar]
  • 25.Namasivayam AK, Van Lieshout P. Speech motor skill and stuttering. J Mot Behav. 2011;43(6):477–89. doi: 10.1080/00222895.2011.628347 [DOI] [PubMed] [Google Scholar]
  • 26.Cross DE, Luper HL. Voice reaction time of stuttering and nonstuttering children and adults. J Fluen Disord. 1979; 4(1):59–77. [DOI] [PubMed] [Google Scholar]
  • 27.Watson BC, Alfonso PJ. Foreperiod and stuttering severity effects on acoustic laryngeal reaction time. J Fluen Disord. 1983; 8(3):183–205. [Google Scholar]
  • 28.Starkweather CW, Franklin S, Smigo TM. Vocal and finger reaction times in stutterers and nonstutterers. J Speech Lang Hear Res. 1984;27(2):193–6. [DOI] [PubMed] [Google Scholar]
  • 29.Reich A, Till J, Goldsmith H. Laryngeal and manual reaction times of stuttering and nonstuttering adults. J Speech Lang Hear Res. 1981;24(2):192–6. doi: 10.1044/jshr.2402.192 [DOI] [PubMed] [Google Scholar]
  • 30.Namasivayam AK, Van Lieshout P, McIlroy WE, De Nil L. Sensory feedback dependence hypothesis in persons who stutter. Hum Mov Sci. 2009;28(6):688–707. doi: 10.1016/j.humov.2009.04.004 [DOI] [PubMed] [Google Scholar]
  • 31.Civier O, Tasko SM, Guenther FH. Overreliance on auditory feedback may lead to sound/syllable repetitions: Simulations of stuttering and fluency-inducing conditions with a neural model of speech production. J Fluen Disord. 2010;35(3):246–79. doi: 10.1016/j.jfludis.2010.05.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Tourville JA, Guenther FH. The DIVA model: A neural theory of speech acquisition and production. Lang Cogn Process. 2011;26(7):952–81. doi: 10.1080/01690960903498424 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Smith A, Kleinow J. Kinematic correlates of speaking rate changes in stuttering and normally fluent adults. J Speech Lang Hear Res. 2000;43(2):521–36. doi: 10.1044/jslhr.4302.521 [DOI] [PubMed] [Google Scholar]
  • 34.Walsh B, Mettel KM, Smith A. Speech motor planning and execution deficits in early childhood stuttering. J Neurodev Disord. 2015;7(1):1–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Ratner NB, Sih CC. Effects of gradual increases in sentence length and complexity on children’s dysfluency. J Speech Hear Disord. 1987;52(3):278–87. doi: 10.1044/jshd.5203.278 [DOI] [PubMed] [Google Scholar]
  • 36.Bishop JH, Williams HG, Cooper WA. Age and task complexity variables in motor performance of stuttering and nonstuttering children. J Fluen Disord. 1991;16(4):207–17. [Google Scholar]
  • 37.Kleinow J, Smith A. Influences of length and syntactic complexity on the speech motor stability of the fluent speech of adults who stutter. J Speech Lang Hear Res. 2000;43(2):548–59. doi: 10.1044/jslhr.4302.548 [DOI] [PubMed] [Google Scholar]
  • 38.Dalla Bella S, Farrugia N, Benoit CE, Begel V, Verga L, Harding E, Kotz SA. BAASTA: Battery for the assessment of auditory sensorimotor and timing abilities. Behav Res Methods. 2017;49(3):1128–45. doi: 10.3758/s13428-016-0773-6 [DOI] [PubMed] [Google Scholar]
  • 39.Grahn JA, Rowe JB. Feeling the beat: premotor and striatal interactions in musicians and nonmusicians during beat perception. J Neurosci. 2009;29(23):7540–8. doi: 10.1523/JNEUROSCI.2018-08.2009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Lewis PA, Miall RC. Brain activation patterns during measurement of sub-and supra-second intervals. Neuropsychologia. 2003;41(12):1583–92. doi: 10.1016/s0028-3932(03)00118-0 [DOI] [PubMed] [Google Scholar]
  • 41.Repp BH. Sensorimotor synchronization: a review of the tapping literature. Psychon Bull Rev. 2005;12(6):969–92. doi: 10.3758/bf03206433 [DOI] [PubMed] [Google Scholar]
  • 42.Repp BH, Su YH. Sensorimotor synchronization: a review of recent research (2006–2012). Psychon Bull Rev. 2013;20(3):403–52. doi: 10.3758/s13423-012-0371-2 [DOI] [PubMed] [Google Scholar]
  • 43.Fujioka T, Trainor LJ, Large EW, Ross B. Internalized timing of isochronous sounds is represented in neuromagnetic beta oscillations. J Neurosci. 2012;32(5):1791–802. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Nozaradan S, Peretz I, Missal M, Mouraux A. Tagging the neuronal entrainment to beat and meter. J Neurosci. 2011;31(28):10234–40. doi: 10.1523/JNEUROSCI.0411-11.2011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Ghitza O. Linking speech perception and neurophysiology: speech decoding guided by cascaded oscillators locked to the input rhythm. Front Psychol. 2011;2:130. doi: 10.3389/fpsyg.2011.00130 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Giraud AL, Poeppel D. Cortical oscillations and speech processing: emerging computational principles and operations. Nat Neurosci. 2012;15(4):511–7. doi: 10.1038/nn.3063 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Large EW, Palmer C. Perceiving temporal regularity in music. Cogn Sci. 2002;26(1):1–37. [Google Scholar]
  • 48.Merchant H, Honing H. Are non-human primates capable of rhythmic entrainment? Evidence for the gradual audiomotor evolution hypothesis. Front Neurosci. 2014;274. doi: 10.3389/fnins.2013.00274 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Doelling KB, Assaneo MF, Bevilacqua D, Pesaran B, Poeppel D. An oscillator model better predicts cortical entrainment to music. Proc Natl Acad Sci. 2019;116(20):10113–21. doi: 10.1073/pnas.1816414116 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Zoefel B, Ten Oever S, Sack AT. The involvement of endogenous neural oscillations in the processing of rhythmic input: More than a regular repetition of evoked neural responses. Front Neurosci. 2018;12:95. doi: 10.3389/fnins.2018.00095 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Nozaradan S, Peretz I, Mouraux A. Selective neuronal entrainment to the beat and meter embedded in a musical rhythm. J Neurosci. 2012;32(49):17572–81. doi: 10.1523/JNEUROSCI.3203-12.2012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Morillon B, Hackett TA, Kajikawa Y, Schroeder CE. Predictive motor control of sensory dynamics in auditory active sensing. Curr Opin Neurobiol. 2015;31:230–8. doi: 10.1016/j.conb.2014.12.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Morillon B, Arnal LH, Schroeder CE, Keitel A. Prominence of delta oscillatory rhythms in the motor cortex and their relevance for auditory and speech perception. Neurosci Biobehav Rev. 2019;107:136–42. doi: 10.1016/j.neubiorev.2019.09.012 [DOI] [PubMed] [Google Scholar]
  • 54.Cooper MH, Allen GD. Timing Control Accuracy in Normal Speakers and Stutterers. J Speech Hear Res. 1977;20(1):55–71. doi: 10.1044/jshr.2001.55 [DOI] [PubMed] [Google Scholar]
  • 55.Wieneke G, Janssen P, Brutten GJ. Variance of central timing of voiced and voiceless periods among stutterers and nonstutterers. J Fluen Disord. 1995. Jun;20(2):171–89. [Google Scholar]
  • 56.Etchell AC, Ryan M, Martin E, Johnson BW, Sowman PF. Abnormal time course of low beta modulation in non-fluent preschool children: A magnetoencephalographic study of rhythm tracking. NeuroImage. 2016;125:953–63. doi: 10.1016/j.neuroimage.2015.10.086 [DOI] [PubMed] [Google Scholar]
  • 57.Olander L, Smith A, Zelaznik HN. Evidence that a motor timing deficit is a factor in the development of stuttering. J Speech Lang Hear Res. 2010. Aug;53(4):876–86. doi: 10.1044/1092-4388(2009/09-0007) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Mates J, Müller U, Radil T, Pöppel E. Temporal integration in sensorimotor synchronization. J Cogn Neurosci. 1994;6(4):332–40. doi: 10.1162/jocn.1994.6.4.332 [DOI] [PubMed] [Google Scholar]
  • 59.Aschersleben G. Temporal control of movements in sensorimotor synchronization. Brain Cogn. 2002;48(1):66–79. doi: 10.1006/brcg.2001.1304 [DOI] [PubMed] [Google Scholar]
  • 60.Vorberg D, Schulze HH. Linear phase-correction in synchronization: Predictions, parameter estimation, and simulations. J Math Psychol. 2002;46(1):56–87. [Google Scholar]
  • 61.Repp BH, Su YH. Sensorimotor synchronization: a review of recent research (2006–2012). Psychon Bull Rev. 2013;20(3):403–52. doi: 10.3758/s13423-012-0371-2 [DOI] [PubMed] [Google Scholar]
  • 62.Krause V, Pollok B, Schnitzler A. Perception in action: the impact of sensory information on sensorimotor synchronization in musicians and non-musicians. Acta Psychol (Amst). 2010;133(1):28–37. doi: 10.1016/j.actpsy.2009.08.003 [DOI] [PubMed] [Google Scholar]
  • 63.Zendel BR, Ross B, Fujioka T. The effects of stimulus rate and tapping rate on tapping performance. Music Percept. 2011;29(1):65–78. [Google Scholar]
  • 64.Mates J, Radil T, Pöppel E. Cooperative tapping: Time control under different feedback conditions. Percept Psychophys. 1992;52(6):691–704. doi: 10.3758/bf03211706 [DOI] [PubMed] [Google Scholar]
  • 65.Gehrke J. Sensorimotor synchronization: The intensity of afferent feedback affects the timing of movements. Max Planck Inst Psychol Res Afferente Informationsverarbeitung Synchronisation Von. 1995; Paper 15.
  • 66.Archibald L, De Nil LF. The relationship between stuttering severity and kinesthetic acuity for jaw movements in adults who stutter. J Fluen Disord. 1999;24(1):25–42. [Google Scholar]
  • 67.Loucks TM, De Nil LF. Oral kinesthetic deficit in adults who stutter: a target-accuracy study. J Mot Behav. 2006;38(3):238–47. doi: 10.3200/JMBR.38.3.238-247 [DOI] [PubMed] [Google Scholar]
  • 68.Chen JL, Penhune VB, Zatorre RJ. Listening to musical rhythms recruits motor regions of the brain. Cereb Cortex. 2008;18(12):2844–54. doi: 10.1093/cercor/bhn042 [DOI] [PubMed] [Google Scholar]
  • 69.Grahn JA, Brett M. Rhythm and beat perception in motor areas of the brain. J Cogn Neurosci. 2007;19(5):893–906. doi: 10.1162/jocn.2007.19.5.893 [DOI] [PubMed] [Google Scholar]
  • 70.Chemin B, Mouraux A, Nozaradan S. Body movement selectively shapes the neural representation of musical rhythms. Psychol Sci. 2014;25(12):2147–59. doi: 10.1177/0956797614551161 [DOI] [PubMed] [Google Scholar]
  • 71.Kliger Amrani A, Zion Golumbic E. Memory-Paced Tapping to Auditory Rhythms: Effects of Rate, Speech, and Motor Engagement. J Speech Lang Hear Res. 2022;65(3):923–39. doi: 10.1044/2021_JSLHR-21-00406 [DOI] [PubMed] [Google Scholar]
  • 72.Riley G, Bakker K. SSI-4: Stuttering severity instrument. PRO-ED, an International Publisher; 2009.
  • 73.Woodrow H. A quantitative study of rhythm: The effect of variations in intensity, rate and duration. Science Press; 1909. [Google Scholar]
  • 74.Fraisse P. Rhythm and tempo. Psychol Music. 1982;1:149–80. [Google Scholar]
  • 75.Drake C, Botte MC. Tempo sensitivity in auditory sequences: Evidence for a multiple-look model. Percept Psychophys. 1993;54(3):277–86. doi: 10.3758/bf03205262 [DOI] [PubMed] [Google Scholar]
  • 76.Boersma P. Praat: doing phonetics by computer. www.Praat.Org. 2006.
  • 77.Team RC. R: A language and environment for statistical computing. 2013.
  • 78.Cremers J, Klugkist I. One direction? A tutorial for circular data analysis using R with examples in cognitive psychology. Front Psychol. 2018;9:2040. doi: 10.3389/fpsyg.2018.02040 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Reich A, Till J, Goldsmith H. Laryngeal and manual reaction times of stuttering and nonstuttering adults. J Speech Lang Hear Res. 1981;24(2):192–6. doi: 10.1044/jshr.2402.192 [DOI] [PubMed] [Google Scholar]
  • 80.Alm PA. The dual premotor model of cluttering and stuttering: A neurological framework. In: Proceedings of the First World Conference on Cluttering, Katarino, Bulgaria. Citeseer; 2007. p. 207–10.
  • 81.Brady JP. Studies on the metronome effect on stuttering. Behav Res Ther. 1969;7(2):197–204. doi: 10.1016/0005-7967(69)90033-3 [DOI] [PubMed] [Google Scholar]
  • 82.Glover H, Kalinowski J, Rastatter M, Stuart A. Effect of instruction to sing on stuttering frequency at normal and fast rates. Percept Mot Skills. 1996. Oct 1;83(2):511–22. doi: 10.2466/pms.1996.83.2.511 [DOI] [PubMed] [Google Scholar]
  • 83.Hutchinson JM, Navarre BM. The effect of metronome pacing on selected aerodynamic patterns of stuttered speech: Some preliminary observations and interpretations. J Fluen Disord. 1977;2(3):189–204. [Google Scholar]
  • 84.Falk S, Schreier R, Russo FA. Singing and stuttering. In: The Routledge Companion to Interdisciplinary Studies in Singing. Routledge; 2020. p. 50–60. [Google Scholar]
  • 85.Wing AM, Kristofferson AB. The timing of interresponse intervals. Percept Psychophys. 1973;13(3):455–60. [Google Scholar]
  • 86.Grahn JA, Watson SL. Perspectives on rhythm processing in motor regions of the brain. Music Ther Perspect. 2013;31(1):25–30. [Google Scholar]
  • 87.Wieland EA, McAuley JD, Dilley LC, Chang SE. Evidence for a rhythm perception deficit in children who stutter. Brain Lang. 2015;144:26–34. doi: 10.1016/j.bandl.2015.03.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Wohlschläger A, Koch R. Synchronization error: An error in time perception. In: Rhythm perception and production. Swets; 2000. p. 115–27. [Google Scholar]
  • 89.Fraisse P. Les synchronisations sensori-motrices aux rythmes [The sensorimotor synchronization of rhythms]. Anticip Comport. 1980;233–57.
  • 90.Howell P, El-Yaniv N, Powell DJ. Factors affecting fluency in stutterers when speaking under altered auditory feedback. In: Speech motor dynamics in stuttering. Springer; 1987. p. 361–9. [Google Scholar]
  • 91.Kalinowski J, Stuart A, Sark S, Armson J. Stuttering amelioration at various auditory feedback delays and speech rates. Int J Lang Commun Disord. 1996;31(3):259–69. doi: 10.3109/13682829609033157 [DOI] [PubMed] [Google Scholar]

Decision Letter 0

Jessica Adrienne Grahn

10 Feb 2022

PONE-D-21-37460Rhythmic tapping difficulties in adults who stutter: a deficit in Central Clock and/or Motor Implementation?PLOS ONE

Dear Dr. HUEBER,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. As you will see, both reviewers encountered several major issues with the framing, the methods, the data, and the discussion. I agree with their conclusions that the introduction does not accurately convey the theories used to motivate the study, therefore the rationale is not sound. The methods need significant improvements in clarity and consistent use of terminology, as well as correction of several errors. I also agree that it is statistically unsound to analyze the musicians as groups with insufficient sample sizes, and that reviewer 1's recommendation to remove these groupings entirely is necessary. The discussion also needs to be made more coherent. The reviewers are both experts in the field, and their more minor comments and suggestions also warrant careful attention.

Please submit your revised manuscript by Mar 27 2022 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Jessica Adrienne Grahn

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Please provide additional details regarding participant consent. In the ethics statement in the Methods and online submission information, please ensure that you have specified (1) whether consent was informed and (2) what type you obtained (for instance, written or verbal, and if verbal, how it was documented and witnessed). If your study included minors, state whether you obtained consent from parents or guardians. If the need for consent was waived by the ethics committee, please include this information.

If you are reporting a retrospective study of medical records or archived samples, please ensure that you have discussed whether all data were fully anonymized before you accessed them and/or whether the IRB or ethics committee waived the requirement for informed consent. If patients provided informed written consent to have data from their medical records used in research, please include this information.

3. Thank you for stating the following in the Acknowledgments Section of your manuscript:

“We thank our volunteer subjects and the Agence Nationale de la Recherche for support (Project StopNCo; ANR-14-CE30-0017; PI: Maëva Garnier).”

We note that you have provided additional information within the Acknowledgements Section that is not currently declared in your Funding Statement. Please note that funding information should not appear in the Acknowledgments section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form.

Please remove any funding-related text from the manuscript and let us know how you would like to update your Funding Statement. Currently, your Funding Statement reads as follows:

 “This study was funded by the Agence Nationale de la Recherche (https://anr.fr/fr/)(Project StopNCo; ANR-14-CE30-0017; PI : MG)

The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.”

Please include your amended statements within your cover letter; we will change the online submission form on your behalf.

4. In your Data Availability statement, you have not specified where the minimal data set underlying the results described in your manuscript can be found. PLOS defines a study's minimal data set as the underlying data used to reach the conclusions drawn in the manuscript and any additional data required to replicate the reported study findings in their entirety. All PLOS journals require that the minimal data set be made fully available. For more information about our data policy, please see http://journals.plos.org/plosone/s/data-availability.

Upon re-submitting your revised manuscript, please upload your study’s minimal underlying data set as either Supporting Information files or to a stable, public repository and include the relevant URLs, DOIs, or accession numbers within your revised cover letter. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories. Any potentially identifying patient information must be fully anonymized.

Important: If there are ethical or legal restrictions to sharing your data publicly, please explain these restrictions in detail. Please see our guidelines for more information on what we consider unacceptable restrictions to publicly sharing data: http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions. Note that it is not acceptable for the authors to be the sole named individuals responsible for ensuring data access.

We will update your Data Availability statement to reflect the information you provide in your cover letter.

5. Please ensure that you refer to Figure 8 in your text as, if accepted, production will need this reference to link the reader to the figure.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: No

Reviewer #2: Partly

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: No

Reviewer #2: No

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: No

Reviewer #2: No

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: No

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The authors report a single study that compares timing performance of adults who stutter compared to controls for three tapping tasks that vary in rhythmic complexity. Also considered is the potential role of music training in mediating any differences between groups. A general assumption of the study is that there is larger tapping variability for adults who stutter compared to controls, with the goal of their study to identify / pinpoint the nature of the observed differences, using the Wing & Kristofferson (1973) model as one approach to decomposing tapping variability into separate clock and motor components. The main conclusion is that worse tapping performance of adults who stutter compared to controls is due to both increased central clock variability and increased motor variability.

Strengths

The topic of investigating potential timing deficits in stuttering warrants investigation, as there has been an increasing number of studies that have proposed timing deficits in stuttering, with mixed results. Studies that clarify the nature of timing differences between adults who stutter and controls are certainly needed.

Weaknesses

Although the topic of the study is interesting and warrants investigation, the reported study has a number of significant weaknesses that in my view preclude publication. It is further difficult for me to see how a revision would adequately address these concerns without effectively yielding a new manuscript. Here are the main issues.

First, the methodology for the study is poorly described and motivated. The descriptions of the tasks are very hard to unambiguously interpret and also are non-standard versions of the task(s) to assess timing performance that have been used with the W&K model. With respect to the latter, it is not clear why the authors have chosen to use very non-standard versions of synchronize-continue tapping tasks, which makes it difficult to assess the data in relation to previous work. For example, the standard synchronization-continuation task has individuals synchronize finger taps with an isochronous series of tones, which after a certain number of tones, cut out, and participants continue tapping at the same pace/rate until there is a cue to stop. The version described in the manuscript seems somewhat odd as there are two tones presented (one high and one low), with the high tone indicating that the person should tap and tones are organized in eight element groups. Participants were then asked to tap for a certain number of repetitions of the 8-element pattern, but based on the text this was for only the first seven of the eight low tones (??). The continuation task is then treated as a separate task – and seems as far as I can tell to be separate with participants listening during the synchronization phase and tapping during the continuation phase. In general, this alternative version of the sync-continue task is not well motivated and in general I would recommend using the standard version so that the data are more directly comparable to previous data.

There are further a number of confusing elements of the task and procedure descriptions. For example, in the description of the task order, the synchronization to complex rhythm task is listed twice and the continuation task is not listed at all. This seems like it has to be an error.

Second, I find the description of theory in the introduction to be overly loose/muddled. In this respect, in the consideration of the view that individuals who stutter have a temporal processing deficit, the authors equate the theoretical concept of a central clock and with the generation of an internal beat. This connection is problematic because what is meant by a central clock in the literature is a time interval measuring mechanism that is akin to ‘stop watch’ that measures the duration of each to-be-timed interval independently, with some variance (i.e., in the literature this is viewed as an interval-based timing mechanism) whereas internal-beat generation emphasizes the concept of a rhythm and a beat-based (entrainment) mechanism of timing. It is thus odd to be making connections between (correlated) estimates of central clock variance where each successive interval is estimated independently and synchronization measures where synchronization relies on stimulus-driven entrainment. This feels like comparing ‘apples’ with ‘oranges’.

Third, there are a number of questions about how the dependent variables are measured or the resulting data itself, which reduces confidence in interpreting the results and drawing clear conclusions. As one example, the formula for measuring asynchrony seems like it has an error. As far as I can tell, the authors are taking the asynchrony in msec and then multiplying by 360 and then dividing by 0.5. Dividing by 0.5 is multiplying by 2, so they are taking the raw asynchrony in msec and then multiplying by 720, which would not yield a value between -180 and +180 degrees as claimed. The text also describes “with a 360 modulo”, which does not make any sense to me. Here, I believe what the authors mean is that they took the raw time point of each tap (modulo 500 ms – the inter-beat-interval of the metronome), which if then divided by 500 ms would give a value between 0 and 1; they then would need to rescale between -0.5 and 0.5 and multiply by 360 to get a value between -180 and +180 (or an equivalent procedure).

Note here also that the phase values of -180 and +180 correspond to the same phase, which is relevant for the calculation of mean phase angle. If the authors are simply averaging the phases to calculate mean phase angle, then average -180 and +180 would give a value of zero, which is incorrect. In order to compute mean phase angle, it is necessary to use circular descriptive methods.

With respect to the application of the W&K model to decompose tapping variability into clock and motor components, it is not clear whether the authors linearly de-trended the data before applying the model (removing drift), which is standard for use of this model. The model assumes a stationary time series of produced intervals and the effect of consistent drift is to reduce the negative correlation of successive intervals (reducing the estimate of motor variance – and sometimes yielding negative estimates). Positive correlation of successive intervals is a violation of the model and notably occurred frequently for the PWS group (on 45% of the trials as far as I can tell!). That means that the authors threw out almost half of the PWS data for the W&K part of the study – and completely excluded 4 of the PWS participants (see p. 14).

A further comment on the use of the W&K model is that this model assumes that the mean clock interval is exactly equal to the target inter-tap-interval (in this case 500 ms). The purpose of the synchronization phase is simply to set the clock interval in listeners’ minds. With that said, I find it confusing for the authors to be discussing deviations in the mean produced inter-tap-interval (amount of drift) in terms of the W&K model.

With respect to music training, it is not clear why dance was included as part of the music training measure. Dance is not music training, but an individual with > 5 years of dance and consistent practice would be classified according to the authors’ procedure as having a high level of music training. ??

A final comment about dependent measures is that the authors indicated that they used a peak-picking algorithm in MATLAB to determine peaks, but provide very little additional detail about how this algorithm determines peaks, which can be tricky. Along these lines, did the authors conduct any low pass filtering of the data before picking peaks to remove noise? Details of the peak-picking procedure need to be spelled out so that the reader can better evaluate the method used.

Fourth, I have two general comments about the results that detract and significantly limit the contribution of the work. It is unclear (and problematic) to me why so much of the results and conclusions about the difference in timing performance between PWS and PNS, and the relation between decomposed measures of clock and motor variance and synchronization measures rely on breaking each group down into three musical training categories with very small sample sizes (n = 3) for the moderate and high levels of music training for both the PWS and PNS groups. It is very unlikely that based on such small samples sizes that any differences between levels of music training and any mediating effects are due to self-reported levels of music training, but rather due to a combination of other individual difference factors. For the analyses, at a minimum, it would better to include level of music training as a covariate rather than interpreted as an independent variable, which it is not, and not place so much emphasis on it in the write up and focus on the group comparisons and the main questions of interest. 2. Along these lines, I find it odd that although one on the main conclusions of the study is about the relation/correlation between estimates of central clock and motor variance during continuation tapping with the synchronization measures, none of the graphs show these correlations. Rather, Figures 6, 7, and 8, focus on group comparisons of PWS and PNS for the three levels of music training – which given the very small sample sizes for each level of music training are not very meaningful and unrelated to the main question of interest.

Finally, I do not find the level of precision in the writing up to be up to par for publication. The writing would need to be significantly improved and some sections rewritten to improve precision and clarity to a level that is publication quality.

Reviewer #2: The current study tested tapping abilities of 16 adults who stutter compared to a matched control group across five tapping tasks. The authors aimed to try and tease apart whether individuals who stutter have a deficit in a central clock mechanism or a motor execution deficit. The experiment is very interesting, and it’s great to have these measurements within this relatively large group of stutterers compared to matched controls. However, the presentation and analyses were particularly difficult to follow, and there were some questionable analyses made which make the results and conclusions difficult to interpret. Please see more detailed comments below.

Major Points:

Outline and naming of tasks

It was very difficult to follow the tasks and measures taken, largely because the naming conventions seemed to keep changing, and many different measures were taken and not outlined clearly. Perhaps one way to structure this more clearly would be to have all measures and all measurements in a table for an easy-to-understand overview? Or summarise somehow more clearly in a visual way? The listing of tasks as dot points (e.g., Pg. 8-9) and the listing of all extracted measures (pg. 11 – 14 with various levels depending on task etc) is very difficult to take in. There were numerous naming inconsistencies throughout, for example, line 222, it is unclear what is meant by “metronome and tapping instants”. Is this all tasks? In the discussion (ln 614): greater tapping variability during unpaced tapping is mentioned: is this referring to the synchronization continuation task? Continuation isn’t a pure unpaced tapping measure as they had a cue to begin. I was also wondering why there was no pure unpaced tapping task, as people who stutter have been shown to be aided by an external cue (i.e., as they had in the CONT task at the beginning). Inter-response interval could be more clearly labelled as inter-tap interval to fit with previous literature. These inconsistencies and presentation really need to be improved otherwise it’s very difficult to follow the results.

Musical training

One of the main concerns I had while reading these analyses was with the statistics related to musical training. From Table 1, there are only 3 participants in the “moderate” music training group and 3 participants in the “high” music training group in each stuttering vs. non-stuttering group. All the analyses including musical training are therefore reflecting very few participants, with a big group difference compared to those with “low” music training (n = 10 in each group). These different group sizes are also covered up by bar graphs, and it’s impossible to see the spread of data, and whether there are important outliers. Musical training is also confounded with stuttering severity, as for the “moderate” group, there was 1 very mild stutterer and 2 mild stutterers, and in the “high” group there was 1 very mild, 1 mild, and 1 severe stutterer. I suggest to remove all of these analyses involving musical training. This would also streamline and clarify the results and allow for a focus on the results of interest. Perhaps the authors could instead add some additional, i.e., supplementary material looking at correlations with years of musical training (rather than a categorical, arbitrary grouping measure) and some of the tapping measures, as this would give a more continuous measure. However, I don’t think this should be part of the main analysis or story based on the small sample size. Based on these concerns, many of the conclusions in the discussion are not justified.

Some other small comments about the musical training analyses:

- the measure of musical training is very course, and it is unclear what participants were asked. If musical training was an important aspect of the current study, a more sophisticated measure should have been used, such as the Goldsmiths musical sophistication index. Were there any participants who had more than 5 years of training but were not currently playing? This case does not seem to be captured by the current descriptions.

- Line 378 paragraph: when “musicians” and “non-musicians” are compared – is this group 0 vs. group 1 + 2? Please specify. Then in line 388-389 “highly trained” musicians are mentioned – is this just the one group (with 3 participants?)

Analyses

Were there convergence issues in your linear models? Adding musical training and severity as categorical fixed factors (with three levels each) into your model seems like it would have lots of problems, considering e.g., there are only 3 participants with moderate or high music training, and within each group, for those with high training, 1 is very mild, 1 is mild, and 1 is severe. It doesn’t seem like you have enough data to model these interactions, and I would assume that R will tell you this. The statistical analysis section (2.7) seems to suggest that you could combine all of these factors (musical training + severity) in one model, but I couldn’t find this in the results themselves.

Figures

Individual variation should be displayed in all graphs by including individual data points, and/or better representations of the spread of the data (e.g., box plots, but individual data points would be ideal). This would allow the reader to easily see the spread/variance of data, and also group size differences between bars.

Discussion

The numerous theories presented in the discussion also make for some tough reading, with no strong conclusions being made. It seems in the end that it’s unclear what the results show and how they could be reflected in the different models. Perhaps a clearer summary or more integration across theories is necessary here. The final conclusion that “the dual premotor model and the sensory accumulation model” are compatible with most of the observations didn’t come out easily from the discussion. Some reframing and streamlining seems necessary here.

Paragraph starting line. 658 – starts suggesting that there was support for a global deficit in motor skill. I therefore expected this paragraph to show this. However, the conclusion of the paragraph is that stuttering is NOT caused by differences in motor skill. Please make a topic sentence that is consistent with the evidence presented in the paragraph.

Minor Points:

Abstract: Authors mention that there are three finger-tapping synchronization tasks, but then they list 5. Figure 1 also lists 5. It would be useful throughout to be more consistent with the labelling of each task and order of presentation, to make it easier for the reader to process.

Pg. 3 line 62-63 – lower tapping variability compared to what?

Pg. 4, lines. 66-69 - Can you explain the Wing and Kristofferson method, or rephrase the sentence so the reader isn’t expecting an explanation? Is there a reason it can only be applied on unpaced tapping (ln. 75)?

Pg. 4, lines 83-84: please rephrase, as it currently reads as if the hypothesis itself would significantly contribute to variability than central clock variance.

Pg. 5, lines 91-92 – please fix up this sentence. PWS and what?

Pg. 5, lines 106-107 - Couldn’t central clock variance be related also to motor execution problems?

109 – could tapping force just measure confidence?

Data Cleaning: were any taps excluded from the analysis? E.g., while they were beginning the task? From section 2.6 line 215 it seems that all taps were included? Could this increase variability?

Pg. 15, were CV, CCV, MIV, IRI, Finger RT all in the same model? Aren’t there big collinearities between these measurements? And if there’s only 55% of the PWS group with MIV and CCV calculations, it’s missing a lot of data (The MIV and CCV estimations were considered in the analysis only in these cases, which represented 68% of the tapping trains (82% of the PNS group and 55% for the PWS group) and no single value could be calculated for 4 PWS participants.)

Line 318 – was there a reason not to use the Watson-Williams test here?

Line 325 – why is a generalized linear model suddenly used here? What distribution was used?

Lines 331-335 – then we have coefficients bc and SAM – it’s unclear what this adds to the analysis.

Results

Figure 2: Can you please show individual data in this graph (e.g., as small dots)? Please also mention how many participants are in each group. The phase angles would be better represented as a circular plot in my opinion, e.g., by using the library “circular” in R, or in Matlab using the CircStat toolbox.

Figure 1: from Ln 365-365 it seems that there were both strong and weak beats in SYNCSimp – can you include this information in the figure? Was there an emphasis placed on these beats? Otherwise how are they considered as strong?

The sheer number of acronyms in the results makes it almost impossible to follow at times.

Line 561: Please outline again what REAC means, or use consistent terminology so it’s clear which task is which.

Typos:

Pg. 5 line 91, observed should be observe

Line 95 – “these evidences of” should be “this evidence for”

Line 717: in this line of “though”

Line 760, has two commas.

Please fix up others throughout

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2023 Feb 3;18(2):e0276691. doi: 10.1371/journal.pone.0276691.r002

Author response to Decision Letter 0


21 Jul 2022

Reviewer #1:

The authors report a single study that compares timing performance of adults who stutter compared to controls for three tapping tasks that vary in rhythmic complexity. Also considered is the potential role of music training in mediating any differences between groups. A general assumption of the study is that there is larger tapping variability for adults who stutter compared to controls, with the goal of their study to identify / pinpoint the nature of the observed differences, using the Wing & Kristofferson (1973) model as one approach to decomposing tapping variability into separate clock and motor components. The main conclusion is that worse tapping performance of adults who stutter compared to controls is due to both increased central clock variability and increased motor variability.

Strengths

The topic of investigating potential timing deficits in stuttering warrants investigation, as there has been an increasing number of studies that have proposed timing deficits in stuttering, with mixed results. Studies that clarify the nature of timing differences between adults who stutter and controls are certainly needed.

Weaknesses

Although the topic of the study is interesting and warrants investigation, the reported study has a number of significant weaknesses that in my view preclude publication. It is further difficult for me to see how a revision would adequately address these concerns without effectively yielding a new manuscript. Here are the main issues.

• First, the methodology for the study is poorly described and motivated. The descriptions of the tasks are very hard to unambiguously interpret and also are non-standard versions of the task(s) to assess timing performance that have been used with the W&K model.

The section on task description (L250-299) has been written again to improve its clarity. The introduction now details further the cognitive subprocesses involved in paced and unpaced tapping, in order to better introduce the research questions and to justify the different rhythmic tasks used in this study. Figure 1 gives a visual summary of the different tasks and the new Table 1 recapitulates which parameter was considered in each task.

With respect to the latter, it is not clear why the authors have chosen to use very non-standard versions of synchronize-continue tapping tasks, which makes it difficult to assess the data in relation to previous work. For example, the standard synchronization-continuation task has individuals synchronize finger taps with an isochronous series of tones, which after a certain number of tones, cut out, and participants continue tapping at the same pace/rate until there is a cue to stop.

The continuation task is treated as a separate task – and seems as far as I can tell to be separate with participants listening during the synchronization phase and tapping during the continuation phase.

In general, this alternative version of the sync-continue task is not well motivated and in general I would recommend using the standard version so that the data are more directly comparable to previous data.

First of all, we modified the label of that task to avoid confusion and comparison with a standard synchronization-continuation task, as used in many previous studies. The condition is now referred to as ISO_REPRO, since it aimed at testing the ability to perceive and reproduce a periodic pattern on one’s own, at a specific tempo. The choice of starting by a passive listening phase, instead of a synchronization phase like in the classical synchronization-continuation paradigm, aimed at distinguishing 1- the ability to build an accurate representation of the beat from passive listening only, without motor engagement, and 2 - the possible “consolidation”, or improvement in accuracy of that representation after several seconds of actual tapping. Our initial hypothesis was that PWS might present a tapping accuracy comparable to that of PNS immediately after passive listening, but that a group difference might emerge only after several seconds of taps, due to a deficient ‘motor consolidation’ mechanism in PWS. The choice of this non-standard task is now better explained in the material & methods section (L 265-274). The underlying theoretical background and the research question, in relationship to neural oscillations, are now more clearly presented in introduction (L108-122+ L194-213). The results section has been re-organized address these questions more explicitly (see L436-492). In particular, additional analyses have been conducted to test the possible improvement in tapping accuracy and consistency after motor engagement, compared to after passive listening only (L451-457 and L489-492).

The version [of the synchronization task] described in the manuscript seems somewhat odd as there are two tones presented (one high and one low), with the high tone indicating that the person should tap and tones are organized in eight element groups. Participants were then asked to tap for a certain number of repetitions of the 8-element pattern, but based on the text this was for only the first seven of the eight low tones (??).

The label of that task has also been modified to “1:1_ISO_SYNC” and its description has also been improved (L250-264)

The choice of a metered isochronous pattern (instead of a simple metronome) was motivated by

- Controlling for the metrical organization that would naturally arise when listening to an isochronous sequence (1–3), but that could vary from one listener to another.

- Comparing a simple synchronization task with a more complex one, in which an external stimulus is played every 4 taps only, which clearly induces a perception of quadruple meter. It was therefore necessary to induce the same metrical organization in the first simple task, in which an external stimulus was played on every beat.

- Exploring whether PWS show different abilities with meter compared with PNS, which might indicate a more global and lower level deficit in beat perception.

1. Woodrow H. A quantitative study of rhythm: The effect of variations in intensity, rate and duration. Science Press; 1909.

2. Fraisse P. Rhythm and tempo. Psychol Music. 1982;1:149–80.

3. Drake C, Botte MC. Tempo sensitivity in auditory sequences: Evidence for a multiple-look model. Percept Psychophys. 1993;54(3):277–86.

There are further a number of confusing elements of the task and procedure descriptions. For example, in the description of the task order, the synchronization to complex rhythm task is listed twice and the continuation task is not listed at all. This seems like it has to be an error.

The description of the tasks has been improved and clarified (L250-299), and accompanied by the recapitulative Figure1. The label of the tasks has been changed in order to avoid the confusion between the complexity related to non-isochrony (in the condition now labeled “NONISO_SYNC”) and the complexity related to hearing an external auditory stimulus every 4 taps only (in the condition now labeled “1:4_ISO_SYNC”).

• Second, I find the description of theory in the introduction to be overly loose/muddled. In this respect, in the consideration of the view that individuals who stutter have a temporal processing deficit, the authors equate the theoretical concept of a central clock and with the generation of an internal beat.

This connection is problematic because what is meant by a central clock in the literature is a time interval measuring mechanism that is akin to ‘stop watch’ that measures the duration of each to-be-timed interval independently, with some variance (i.e., in the literature this is viewed as an interval-based timing mechanism) whereas internal-beat generation emphasizes the concept of a rhythm and a beat-based (entrainment) mechanism of timing.

It is thus odd to be making connections between (correlated) estimates of central clock variance where each successive interval is estimated independently and synchronization measures where synchronization relies on stimulus-driven entrainment. This feels like comparing ‘apples’ with ‘oranges’.

Thank you for your relevant comment. The introduction has been completely re-written and complemented, following your recommendations. It now details the different subprocesses involved in paced and unpaced tapping, better defines the notions of “beat” and “meter” perception, and the theory of neural oscillators coupling (L108-122).

The term “central clock” is now avoided, and the correlation between CCV (Central clock variance) and PLV (synchronization consistency) is no longer considered in the new version of the article.

• Third, there are a number of questions about how the dependent variables are measured or the resulting data itself, which reduces confidence in interpreting the results and drawing clear conclusions. As one example, the formula for measuring asynchrony seems like it has an error. As far as I can tell, the authors are taking the asynchrony in msec and then multiplying by 360 and then dividing by 0.5. Dividing by 0.5 is multiplying by 2, so they are taking the raw asynchrony in msec and then multiplying by 720, which would not yield a value between -180 and +180 degrees as claimed. The text also describes “with a 360 modulo”, which does not make any sense to me. Here, I believe what the authors mean is that they took the raw time point of each tap (modulo 500 ms – the inter-beat-interval of the metronome), which if then divided by 500 ms would give a value between 0 and 1; they then would need to rescale between -0.5 and 0.5 and multiply by 360 to get a value between -180 and +180 (or an equivalent procedure). Note here also that the phase values of -180 and +180 correspond to the same phase, which is relevant for the calculation of mean phase angle. If the authors are simply averaging the phases to calculate mean phase angle, then average -180 and +180 would give a value of zero, which is incorrect. In order to compute mean phase angle, it is necessary to use circular descriptive methods.

Our measure of tapping asynchrony, as a phase angle is similar to previous studies (Sares et al. 2019; Falk et al. 2015). The value “0.5” corresponds to the Inter-stimulus Onset Interval (IOI, in seconds). So, dividing the asynchrony (between the stimulus onset and the tapping instant, also in seconds) by the IOI and multiplying it by 360 (degrees) gives a phase angle (in degrees).

We clarified the definition of that descriptor as follows (L 341-355):

“Phase Angle (PA, in degrees) was measured in the conditions 1:1_ISO_SYNC, 1:4_ISO_SYNC and NONISO_SYNC, as the angular conversion of Tapping Asynchrony, i.e. the time difference (ms) between a tap and the closest metronome pulse, relatively to the Inter-stimulus onset interval of 500ms (IOI) (see Eq. 1). By definition, Tapping Asynchrony values were always between -250ms and +250ms, so that PA values ranged from -180° (completely desynchronized in advance to the auditory stimulus) to +180° (completely desynchronized following the auditory stimulus), passing through 0° (perfectly synchronized with the auditory stimulus).“

Bayesian circular mixed models were already used in the previous version of the paper (R package bpnreg). Following your advice, PA results are now also presented with circular plots in the revised version of the manuscript.

• With respect to the application of the W&K model to decompose tapping variability into clock and motor components, it is not clear whether the authors linearly de-trended the data before applying the model (removing drift), which is standard for use of this model. The model assumes a stationary time series of produced intervals and the effect of consistent drift is to reduce the negative correlation of successive intervals (reducing the estimate of motor variance – and sometimes yielding negative estimates). Positive correlation of successive intervals is a violation of the model and notably occurred frequently for the PWS group (on 45% of the trials as far as I can tell!). That means that the authors threw out almost half of the PWS data for the W&K part of the study – and completely excluded 4 of the PWS participants (see p. 14). A further comment on the use of the W&K model is that this model assumes that the mean clock interval is exactly equal to the target inter-tap-interval (in this case 500 ms). The purpose of the synchronization phase is simply to set the clock interval in listeners’ minds. With that said, I find it confusing for the authors to be discussing deviations in the mean produced inter-tap-interval (amount of drift) in terms of the W&K model.

No significant acceleration or deceleration was observed over the 24 first taps of the condition ISO_REPRO in any of the participants (see section B.2.1 of the supplementary material, L40-49).

Nevertheless, we agree that it was problematic that the W&K decomposition was only valid for half of the tapping trains in the PWS group. We therefore decided to remove the analysis based on the W&K model and variance decomposition in the revised version of the article.

• With respect to music training, it is not clear why dance was included as part of the music training measure. Dance is not music training, but an individual with > 5 years of dance and consistent practice would be classified according to the authors’ procedure as having a high level of music training?

We considered that both the practice of dance and music can influence synchronization abilities, since they consist of producing gestures in synchrony with external rhythms. It has been proven that amongst musicians, drummers and professional pianists show a particularly high synchronization accuracy and consistency, better than singers or non-musicians (Krause, Pollok, and Schnitzler, 2010) (4). Other studies also showed greater synchronization abilities in skilled dancers, compared to non-dancers (Miura, Kudo, Ohtsuki, and Kanehisa, 2011) (5).

In any case, the 2 dancers in our study also had a corresponding level of musical training. So, to avoid confusion or discussion, we removed any mention to dancing in the assessment of Musical Training (see section A of the supplementary material).

Furthermore, Musical Training was no longer considered as an explicative factor in the revised version of the article. It was only used as an individual factor, like gender and age, to match participants between the PWS and PNS groups.

4. Krause V, Pollok B, Schnitzler A. Perception in action: the impact of sensory information on sensorimotor synchronization in musicians and non-musicians. Acta Psychol (Amst). 2010;133(1):28–37.

5. Miura A, Kudo K, Ohtsuki T, Kanehisa H. Coordination modes in sensorimotor synchronization of whole-body movement: a study of street dancers and non-dancers. Hum Mov Sci. 2011;30(6):1260–71.

• A final comment about dependent measures is that the authors indicated that they used a peak-picking algorithm in MATLAB to determine peaks, but provide very little additional detail about how this algorithm determines peaks, which can be tricky. Along these lines, did the authors conduct any low pass filtering of the data before picking peaks to remove noise? Details of the peak-picking procedure need to be spelled out so that the reader can better evaluate the method used.

More details on peak detection are now given on L328-334:

“First, the force signal was low-pass filtered (Chebyshev filter, cutoff frequency of 100 Hz, using the function filtfilt in Matlab (R2018b) to extract its envelop, and normalized, based on its maximum value observed in each executed tapping task. For each tap, the first sharp peak of the force signal, corresponding to the tapping instant, was detected automatically (using the Matlab function “findpeaks”, with a minimum interpeak distance of 200ms and a 20% threshold for peak height). These tapping instants were saved in PRAAT (73) annotation files, and were all manually verified and corrected.”

• Fourth, I have two general comments about the results that detract and significantly limit the contribution of the work. It is unclear (and problematic) to me why so much of the results and conclusions about the difference in timing performance between PWS and PNS, and the relation between decomposed measures of clock and motor variance and synchronization measures rely on breaking each group down into three musical training categories with very small sample sizes (n = 3) for the moderate and high levels of music training for both the PWS and PNS groups.

It is very unlikely that based on such small samples sizes that any differences between levels of music training and any mediating effects are due to self-reported levels of music training, but rather due to a combination of other individual difference factors. For the analyses, at a minimum, it would better to include level of music training as a covariate rather than interpreted as an independent variable, which it is not, and not place so much emphasis on it in the write up and focus on the group comparisons and the main questions of interest.

Following your advice, as well as those of the second reviewer, we no longer consider Musical Training as an explicative factor in the revised version of the article. It is now only used as an individual factor, like gender and age, to match participants between the PWS and PNS groups. The figures, the results section and the discussion have also been modified accordingly and do not mention that factor any longer.

• Along these lines, I find it odd that although one of the main conclusions of the study is about the relation/correlation between estimates of central clock and motor variance during continuation tapping with the synchronization measures, none of the graphs show these correlations. Rather, Figures 6, 7, and 8, focus on group comparisons of PWS and PNS for the three levels of music training – which given the very small sample sizes for each level of music training are not very meaningful and unrelated to the main question of interest.

As indicated above, we removed any mention to musical training.

All the correlations tested in the article are now supported by graphs and presented in section B of the supplementary material.

• Finally, I do not find the level of precision in the writing be up to par for publication. The writing would need to be significantly improved and some sections rewritten to improve precision and clarity to a level that is publication quality.

Many sections of the manuscript have been rewritten.

Furthermore, we had the manuscript proof-read by a native English speaker.

We therefore hope that it now reaches the expected level of precision and clarity.

Reviewer #2:

The current study tested tapping abilities of 16 adults who stutter compared to a matched control group across five tapping tasks. The authors aimed to try and tease apart whether individuals who stutter have a deficit in a central clock mechanism or a motor execution deficit. The experiment is very interesting, and it’s great to have these measurements within this relatively large group of stutterers compared to matched controls. However, the presentation and analyses were particularly difficult to follow, and there were some questionable analyses made which make the results and conclusions difficult to interpret. Please see more detailed comments below.

Major Points:

• Outline and naming of tasks

It was very difficult to follow the tasks and measures taken, largely because the naming conventions seemed to keep changing, and many different measures were taken and not outlined clearly. Perhaps one way to structure this more clearly would be to have all measures and all measurements in a table for an easy-to-understand overview? Or summarise somehow more clearly in a visual way? The listing of tasks as dot points (e.g., Pg. 8-9) and the listing of all extracted measures (pg. 11 – 14 with various levels depending on task etc) is very difficult to take in.

The section Material & Methods has been rewritten to improve its clarity. The tasks have been renamed and are summarized in the Figure 1. The extracted measures are now summarized in the Table 1.

There were numerous naming inconsistencies throughout, for example, line 222, it is unclear what is meant by “metronome and tapping instants”. Is this all tasks?

It is now clarified (L330-333) that tapping instants were detected from the first sharp peak of the force signal, recorded with the force pressure sensor. It is now also clarified (L257-260) that external auditory stimuli, or triggers, were made of both a metronome click marking the pulse, and of audio beeps marking the rhythmic pattern to reproduce. The composition of these external auditory stimuli in each rhythmic condition is summarized in Figure 1.

In the discussion (ln 614): greater tapping variability during unpaced tapping is mentioned: is this referring to the synchronization continuation task?

The discussion has been fully re-written. Greater tapping variability in PWS has been found in our study as well as most of the previous studies, for both synchronization tasks and tasks in which participants tapped on their own, without any external auditory reference (continuation tasks, reproduction tasks like ISO_REPRO here, or self-paced tasks).

Continuation isn’t a pure unpaced tapping measure as they had a cue to begin. I was also wondering why there was no pure unpaced tapping task, as people who stutter have been shown to be aided by an external cue (i.e., as they had in the CONT task at the beginning).

True. Following your comment, we now avoid the term “unpaced” throughout the article and talk, instead, of tapping without the help of external auditory reference.

Inter-response interval could be more clearly labelled as inter-tap interval to fit with previous literature. These inconsistencies and presentation really need to be improved otherwise it’s very difficult to follow the results.

Done. IRI was changed to ITI.

• Musical training

One of the main concerns I had while reading these analyses was with the statistics related to musical training. From Table 1, there are only 3 participants in the “moderate” music training group and 3 participants in the “high” music training group in each stuttering vs. non-stuttering group. All the analyses including musical training are therefore reflecting very few participants, with a big group difference compared to those with “low” music training (n = 10 in each group). These different group sizes are also covered up by bar graphs, and it’s impossible to see the spread of data, and whether there are important outliers. Musical training is also confounded with stuttering severity, as for the “moderate” group, there was 1 very mild stutterer and 2 mild stutterers, and in the “high” group there was 1 very mild, 1 mild, and 1 severe stutterer. I suggest to remove all of these analyses involving musical training. This would also streamline and clarify the results and allow for a focus on the results of interest. Perhaps the authors could instead add some additional, i.e., supplementary material looking at correlations with years of musical training (rather than a categorical, arbitrary grouping measure) and some of the tapping measures, as this would give a more continuous measure. However, I don’t think this should be part of the main analysis or story based on the small sample size. Based on these concerns, many of the conclusions in the discussion are not justified.

Some other small comments about the musical training analyses:

- the measure of musical training is very course, and it is unclear what participants were asked. If musical training was an important aspect of the current study, a more sophisticated measure should have been used, such as the Goldsmiths musical sophistication index. Were there any participants who had more than 5 years of training but were not currently playing? This case does not seem to be captured by the current descriptions.

- Line 378 paragraph: when “musicians” and “non-musicians” are compared – is this group 0 vs. group 1 + 2? Please specify. Then in line 388-389 “highly trained” musicians are mentioned – is this just the one group (with 3 participants?)

Following your advice, as well as those of the first reviewer, we do not consider any longer Musical Training as an independent variable of our experimental design. It is now only used as an individual factor, like gender and age, to match participants between the PWS and PNS groups. The figures, the results section and the discussion have also been modified accordingly and do not mention that factor any longer.

• Analyses

Were there convergence issues in your linear models? Adding musical training and severity as categorical fixed factors (with three levels each) into your model seems like it would have lots of problems, considering e.g., there are only 3 participants with moderate or high music training, and within each group, for those with high training, 1 is very mild, 1 is mild, and 1 is severe. It doesn’t seem like you have enough data to model these interactions, and I would assume that R will tell you this. The statistical analysis section (2.7) seems to suggest that you could combine all of these factors (musical training + severity) in one model, but I couldn’t find this in the results themselves.

The different mixed models considered are now clearly presented at the beginning of each paragraph in the Results section.

Musical Training is no longer considered as an independent variable.

Correlations with stuttering severity (though the SSI score) are tested in the PWS group only, and independently from the effect of other factors, such as Group, Task or Beat strength.

• Figures

Individual variation should be displayed in all graphs by including individual data points, and/or better representations of the spread of the data (e.g., box plots, but individual data points would be ideal). This would allow the reader to easily see the spread/variance of data, and also group size differences between bars.

Following your advice, the figures now represent the average value observed for each participant (N=16 in each group), instead of standard deviations with errorbars.

• Discussion

The numerous theories presented in the discussion also make for some tough reading, with no strong conclusions being made. It seems in the end that it’s unclear what the results show and how they could be reflected in the different models. Perhaps a clearer summary or more integration across theories is necessary here. The final conclusion that “the dual premotor model and the sensory accumulation model” are compatible with most of the observations didn’t come out easily from the discussion. Some reframing and streamlining seems necessary here.

The theoretical framework of the whole article has been improved and complemented. Instead of considering two hypotheses: that the reduced tapping accuracy and consistency of PWS may be related to a timing deficit vs. a motor deficit, we now consider different subprocesses that may be involved in sensorimotor synchronization and that may possibly be impaired in PWS (1- motor execution, 2- beat perception and reproduction, 3- sensorimotor integration and learning). The whole article is now organized around the examination of these abilities. This also enables clearer conclusions on the cognitive levels that appear unimpaired, and those at which a significant difference is observed between PWS and PNS. Thus, the results from our study, point towards (1) a deficit in neural oscillators coupling in production, but not in perception, of rhythmic patterns in PWS, and (2) a larger delay in multi-modal feedback processing for PWS.

• Paragraph starting line. 658 – starts suggesting that there was support for a global deficit in motor skill. I therefore expected this paragraph to show this. However, the conclusion of the paragraph is that stuttering is NOT caused by differences in motor skill. Please make a topic sentence that is consistent with the evidence presented in the paragraph.

The discussion section has been fully rewritten. It now makes a clearer distinction between 1- motor difficulties in movement initiation, 2- other possible sources of increased motor variability, and 3- possible deficits in sensorimotor control and learning.

In that framework, our results support the idea that PWS present (1) a deficit in neural oscillators coupling in production, but not in perception, of rhythmic patterns and (2) a larger delay in multi-modal feedback processing

• Minor Points:

Abstract: Authors mention that there are three finger-tapping synchronization tasks, but then they list 5. Figure 1 also lists 5. It would be useful throughout to be more consistent with the labelling of each task and order of presentation, to make it easier for the reader to process.

There are indeed three synchronization tasks:

- a simple synchronization task with a 1:1 quadruple metered isochronous pattern (1:1_ISO_SYNC)

- a synchronization task with a quadruple metered isochronous pattern, where only the strong beats (every four taps) are marked by an external auditory stimulus (1:4_ISO_SYNC)

- a synchronization task with a non-isochronous pattern (NONISO_SYNC)

and two other tapping tasks, where participants do not synchronize with auditory stimuli:

- a “reaction” task, in which participants follow as quickly as possible an unpredictable pattern (REACT)

- a reproduction task, in which participants reproduce on their own a quadruple metered isochronous pattern, just after listening to it passively (ISO_REPRO)

Pg. 4, lines. 66-69 - Can you explain the Wing and Kristofferson method, or rephrase the sentence so the reader isn’t expecting an explanation? Is there a reason it can only be applied on unpaced tapping (ln. 75)?

Since the W&K decomposition could not be applied to a substantial part of our data, this analysis was removed from the article and no mention is made to it any longer.

Pg. 5, lines 106-107 - Couldn’t central clock variance be related also to motor execution problems?

The whole idea of the W&K decomposition was to disentangle, from the total tapping variance, what can be attributed to Central Clock Variance (CCV) and what can come from Motor Implementation Variance (MIV). We decided to approach the data differently and remove the analysis from the article, as well as from the discussion of its results.

Pg. 3 line 62-63 – lower tapping variability compared to what?

Pg. 4, lines 83-84: please rephrase, as it currently reads as if the hypothesis itself would significantly contribute to variability than central clock variance.

Pg. 5, lines 91-92 – please fix up this sentence. PWS and what?

The discussion has been fully rewritten.

109 – could tapping force just measure confidence?

Yes, this is possible, and could be used in future studies to evaluate indirectly the difficulty of a task, for example. In this study, it was measured seaking 3 objectives:

1- to test whether participants, and in particular PWS, were able to perceive and reproduce the quadruple meter of the proposed patterns

2- to test the sensory accumulation theory, i.e. whether the degree of NMA correlates with the tapping force

3- to have a descriptor of the amplitude variability of finger movements, and not only of their timing variability

Data Cleaning: were any taps excluded from the analysis? E.g., while they were beginning the task? From section 2.6 line 215 it seems that all taps were included? Could this increase variability?

Following your comment, the revised version of the article now distinguishes the very first taps in a task (first 8-beat cycle) and the “stabilized” phase (2nd and 3rd 8-beat cycles).

Instead of simply excluding the very first taps, we found it relevant, in the two conditions 1:1_ISO_SYNC and ISO_REPRO, to explore the variation in tapping accuracy and consistency over time, between the very first taps and the stabilized phase, and to discuss these variations in terms of motor engagement and sensorimotor learning.

Pg. 15, were CV, CCV, MIV, IRI, Finger RT all in the same model? Aren’t there big collinearities between these measurements? And if there’s only 55% of the PWS group with MIV and CCV calculations, it’s missing a lot of data (The MIV and CCV estimations were considered in the analysis only in these cases, which represented 68% of the tapping trains (82% of the PNS group and 55% for the PWS group) and no single value could be calculated for 4 PWS participants.)

We agree that this was problematic that the W&K decomposition could apply to only half of the tapping trains in the PWS group. We therefore decided to remove any mention of the W&K model and variance decomposition in the revised version of the article.

The article still presents variations in RT, RT_Var = std(RT), mean(ITI), PE = abs(0.5 s – mean(ITI)), and CV = std(ITI)/mean(ITI).

No significant correlation was observed between CV and RT_Var (see section B.2.2 of the supplementary material).

No significant correlation was also observed between PE and RT (see section B.2.3 of the supplementary material).

The variation of each parameter was explored independently, with different mixed models, as clearly indicated at the beginning of each paragraph in the Results section.

Line 318 – was there a reason not to use the Watson-Williams test here?

A Watson-Williams test unfortunately did not enable to consider the factor “Participant” underlying the repeated measures of a same individual. Applying a Watson-Williams test over the whole set of “aggregated” data would not be correct, since it would artificially inflate the cohort size (as if we had 32*N repetitions of participants, instead of 32 participants). On the other hand, applying a Watson-Williams test over the 32 average value of each participant would be correct, but would considerably reduce the statistical power of the analysis. Instead, using mixed models (and here Bayesian circular mixed models) is the most recommended method to deal with repeated (angular) data, since it enables to consider the whole dataset, and to account at the same time for the factor “Participant” underlying the intra-individual variation of the dependent variable.

Line 325 – why is a generalized linear model suddenly used here? What distribution was used?

A generalized linear model was used, when testing the correlation between two variables that were measured in different conditions, or when testing the correlation between a variable and the SSI score, since repeated measures could not be considered in that case. Only average values measured for each participant in a condition could be compared, which therefore did not require to use mixed models and to consider a random factor on the participant.

In the revised version of the article, correlations (that are presented in section B of the supplementary material) are simply tested with a Pearson’s correlation test for linear data, and with an angular-linear correlation test (with the toolbox Circstats) for PA.

Lines 331-335 – then we have coefficients bc and SAM – it’s unclear what this adds to the analysis.

In the first version of the manuscript, the reporting of these 3 parameters followed the recommendations of Cremers & Klugkist (2018) for the use of the bpnreg package and the reporting of correlations, based on Bayesian mixed models.

As indicated above, the correlation of PA with other linear variables is now simply tested with an angular-linear correlation test (with the toolbox Circstats) in the revised version of the article.

Results

Figure 2: Can you please show individual data in this graph (e.g., as small dots)? Please also mention how many participants are in each group.

Done. The size of each group (N=16) is also given in the legend.

The phase angles would be better represented as a circular plot in my opinion, e.g., by using the library “circular” in R, or in Matlab using the CircStat toolbox.

PA variations are now represented with circular plots, following your recommendation.

Figure 1: from Ln 365-365 it seems that there were both strong and weak beats in SYNCSimp – can you include this information in the figure? Was there an emphasis placed on these beats? Otherwise how are they considered as strong?

The description of that task, now labelled 1:1_ISO_SYNC, has been clarified L252-260, and clearly indicates that:

“Since a metrical organization of beats (into groups of 2, 3 or 4) arises naturally and automatically when listening to an isochronous sequence of identical tones (1-3), we controlled for that perceptual grouping and induced the perception of quadruple meter, i.e. with a “strong” or accentuated beat sensed every four pulses, the other beats sensed as “weak” or unaccentuated). To achieve this, auditory stimuli were organized into 8-beat cycles, with a metronome click marking the pulse on each beat, and an additional audio beep (Pitch: 1100 Hz; 20 ms) played simultaneously on the first seven beats only (without variations in pitch, loudness, or duration)

No emphasis was placed on the 1st and 5th beat of the stimuli, which we reckon, is disputable and could be improved in a next study. However, we verified that participants perceived that quadruple metrical organization, since they tapped with greater force on these 1st and 5th beats (see section 3.3.2 and Figure 7).

The sheer number of acronyms in the results makes it almost impossible to follow at times.

We improved the notation consistency throughout the article, and gave a summary of the task and variable names in the Figure 1 and the Table 1. We hope that it is now easier to follow.

Line 561: Please outline again what REAC means, or use consistent terminology so it’s clear which task is which.

REACT stands for the “reaction” task. The description of the tasks has been improved and summarized in Figure 1.

Typos:

Pg. 5 line 91, observed should be observe

Line 95 – “these evidences of” should be “this evidence for”

Line 717: in this line of “though”

Line 760, has two commas.

Please fix up others throughout

These typos have been fixed or corresponded to paragraphs that have been fully re-written.

Attachment

Submitted filename: Rebuttal-PlosOne-150622.pdf

Decision Letter 1

Jessica Adrienne Grahn

12 Sep 2022

PONE-D-21-37460R1Rhythmic tapping difficulties in adults who stutter: a deficit in beat perception, motor execution, or sensorimotor integration?PLOS ONE

Dear Dr. HUEBER,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. You will see that generally the reviewer finds the manuscript easier to follow and has just a few remaining suggestions that I agree might be helpful.

Please submit your revised manuscript by Oct 27 2022 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Jessica Adrienne Grahn

Academic Editor

PLOS ONE

Journal Requirements:

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #2: (No Response)

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #2: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #2: No

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #2: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #2: The current manuscript revision is greatly improved and easier to follow, and the discussion in particular is a lot clearer. It was also nice that the results were condensed to the most interesting, but also that the full analyses were reported in supplementary material. The result structure was easier to follow since it has been condensed, but it was still a bit tricky to track which task and which dependent variable was being analysed. It helped to have the models written out. Perhaps the authors could consider having a more systematic labelling system of headings. At the moment, there are changes in structure, e.g., levels of headings, bolding etc, which make it difficult to clearly see the patterns. Some of the writing is sometimes a bit unclear as well and could be edited further. But overall, the paper is getting into good shape, and the authors have done a good job condensing a lot of dependent variables into a digestible manuscript.

Here are some minor comments to improve clarity:

Pg. 4, lines 64-65: “some other studies” are mentioned, but only one is cited.

Pg. 7: “the observation that steady state-evoked potentials appear in the delta frequency range in subjects who were passively listening to a rhythmic sequence at 2.4Hz provides strong support to this hypothesis”. The existence of steady-state evoked potentials could actually be based on populations of neurons firing in synchrony, not necessarily that they are reflecting the entrainment on endogenous oscillations. There is a big debate about this in the field, so it’s important to clarify this point. E.g., see Zoefel, ten Oever & Sack, 2019: Neural oscillations in the processing of rhythmic input: More than a regular repetition of evoked neural responses. Frontiers in Neuroscience.

Pg. 10, line 185: the greater negative mean asynchrony can be explained by a weaker tapping force – please explain why. The logic behind this is unclear for the moment.

Table 1: musical training is listed as 0, 1, 2 for PWS and as no/yes for PNS – these should be the same scale.

Figure 1: For 1:4_ISO_SYNCH – were the 1st and 4th beats in the example stimuli accented? If so, this should be made clear in the diagram with e.g., an accent marker.

Pg. 16: Only the 9th and 24th taps were considered for analysis. Why? Please motivate the reason for this in the text.

Table 2 is helpful for following the measures taken across the different tasks.

Figure 3 caption seems to be switched around, with 3a as periodicity in the caption, but coefficient of variation in the figure.

Pg. 27 – why was the Bayesian model Group + Time and no interaction?

Pg. 27, line 512: “however, no significant difference…” – I don’t think you need a “however” here. You could just say “there was no significant difference…”.

Pg. 3, line 645 – after explaining all the results across the various measures, it might be nice in this first paragraph to more clearly link the theoretical side with the tasks measured. E.g., 1- in the execution of movements (as measured in xx task/s).

Pg. 36, Lines 739-743: this sentence is very long and refers to already presented information. This could be written more concisely to avoid having to refer to information “already stated above”.

Pg. 40, line 812, do you mean central nervous system?

Pg. 41, line 835: do you mean *decreased* PLV in PWS?

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #2: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2023 Feb 3;18(2):e0276691. doi: 10.1371/journal.pone.0276691.r004

Author response to Decision Letter 1


28 Sep 2022

Reviewer #2: The current manuscript revision is greatly improved and easier to follow, and the discussion in particular is a lot clearer. It was also nice that the results were condensed to the most interesting, but also that the full analyses were reported in supplementary material. The result structure was easier to follow since it has been condensed, but it was still a bit tricky to track which task and which dependent variable was being analysed. It helped to have the models written out.

Perhaps the authors could consider having a more systematic labelling system of headings. At the moment, there are changes in structure, e.g., levels of headings, bolding etc, which make it difficult to clearly see the patterns.

We worked on this, so that there are now 3 clear levels of headings. A 4 fourth level of organization is sometimes marked by bullet points.

Some of the writing is sometimes a bit unclear as well and could be edited further.

The entire manuscript has been edited once again by a native speaker of English.

But overall, the paper is getting into good shape, and the authors have done a good job condensing a lot of dependent variables into a digestible manuscript.

Here are some minor comments to improve clarity:

Pg. 4, lines 64-65: “some other studies” are mentioned, but only one is cited.

This was replaced by ‘another study’

Pg. 7: “the observation that steady state-evoked potentials appear in the delta frequency range in subjects who were passively listening to a rhythmic sequence at 2.4Hz provides strong support to this hypothesis”. The existence of steady-state evoked potentials could actually be based on populations of neurons firing in synchrony, not necessarily that they are reflecting the entrainment on endogenous oscillations. There is a big debate about this in the field, so it’s important to clarify this point. E.g., see Zoefel, ten Oever & Sack, 2019: Neural oscillations in the processing of rhythmic input: More than a regular repetition of evoked neural responses. Frontiers in Neuroscience.

We agree with this comment and this paragraph on L111-115 was re-worded as;

“Although there is still ongoing debate on this endogenous oscillator entrainment hypothesis (49,50), the observation that steady state-evoked potentials appear in the delta frequency range [0.5 – 4 Hz] in subjects who were passively listening to a rhythmic sequence at 2.4Hz, provides support for this hypothesis (43,44,51)”, with

[49]. Doelling KB, Assaneo MF, Bevilacqua D, Pesaran B, Poeppel D. An oscillator model better predicts cortical entrainment to music. Proc Natl Acad Sci. 2019;116(20):10113–21.

[50]. Zoefel B, Ten Oever S, Sack AT. The involvement of endogenous neural oscillations in the processing of rhythmic input: More than a regular repetition of evoked neural responses. Front Neurosci. 2018;12:95.

Pg. 10, line 185: the greater negative mean asynchrony can be explained by a weaker tapping force – please explain why. The logic behind this is unclear for the moment.

We added an explanation about 10 lines above (L173-177), and clarified the sensory accumulation theory: “In addition to slower processing and integration, this so-called “sensory accumulation” theory further predicts that the magnitude of auditory-tactile delay, and the resulting NMA, depends on stimulation intensity, which, in case of tapping, is hypothesized to concern the tapping force. The NMA is therefore hypothesized to decrease when tactile-kinesthetic feedback in the form of tapping force increases”

We hope that we addressed the confusion re. the logic behind that sentence on L183 “the larger degree of NMA observed in PWS can be explained by a weaker tapping force, as predicted by the sensory accumulation theory”.

Table 1: musical training is listed as 0, 1, 2 for PWS and as no/yes for PNS – these should be the same scale.

The PNS and PWS were matched in musical training. The notation is modified in the table, accordingly.

Figure 1: For 1:4_ISO_SYNCH – were the 1st and 4th beats in the example stimuli accented? If so, this should be made clear in the diagram with e.g., an accent marker.

In that condition, the first and fifth beats were not “accentuated” but were the only ones marked with an external stimulus.

We explained this on P15, 271-272, in the description of the task 1:4_ISO_SYNC, as follows: “the external auditory stimuli were played back every 4 beats only – on the 1st and the 5th beats of the 8- beat cycle”.

In addition, this is also illustrated in the recapitulative Figure 1.

Pg. 16: Only the 9th and 24th taps were considered for analysis. Why? Please motivate the reason for this in the text.

We did not consider the 9th and 24th taps, but the taps number 9 to 24, i.e., the 2nd and 3rd 8-beat cycles of taps, taken as the “stabilized” phase of the task. It was hypothesized that some sensorimotor adaptation and learning might occur during the first cycle of 8 taps and carried over to the remainder of the trial.

The word “Taps 9-24” was replaced by “Taps 9 to 24” to avoid confusion

Table 2 is helpful for following the measures taken across the different tasks.

We are happy that it improved the paper.

Figure 3 caption seems to be switched around, with 3a as periodicity in the caption, but coefficient of variation in the figure.

Modified

Pg. 27 – why was the Bayesian model Group + Time and no interaction?

The notation in the bpnreg package (for circular mixed models) is actually different from that of the nlme package (for linear mixed models). In the package bpnreg, the notation “Group + Time” actually considers the possible interaction Group*Time between both factors. We refer the readers to Cremers et & Klugkist (2018) for more details about the use of the bpnreg package.

Cremers J, Klugkist I. One direction? A tutorial for circular data analysis using R with examples in cognitive psychology. Front Psychol. 2018;9:2040.

Pg. 27, line 512: “however, no significant difference…” – I don’t think you need a “however” here. You could just say “there was no significant difference…”.

The word “however” has been removed.

Pg. 33, line 645 – after explaining all the results across the various measures, it might be nice in this first paragraph to more clearly link the theoretical side with the tasks measured. E.g., 1- in the execution of movements (as measured in xx task/s).

This paragraph, L622-628, has been modified as follows:

“The study investigated the rhythmic tapping behavior of people who stutter compared to people who do not stutter and considered several levels of processing at which differences were hypothesized to occur: 1- the execution of movements, in particular their initiation (as measured in the task REACT), 2- the perception of beat, at a given periodicity (as measured in the task ISO_REPRO), 3- the on-line adaptation and improvement of their accuracy and consistency, based on sensory feedback (as measured in the tasks 1:1_ISO_SYNC, 1:4_ISO_SYNC and NONISO_SYNC).”

Pg. 36, Lines 739-743: this sentence is very long and refers to already presented information. This could be written more concisely to avoid having to refer to information “already stated above”.

The sentence on L712-713 has been modified as follows:

“Several arguments were provided in the preceding section (4.1) that exclude the idea that timing differences between PWS and PNS simply result from an impaired motor execution.”

Pg. 40, line 812, do you mean central nervous system?

Thank you for pointing this out. It has been corrected accordingly.

Pg. 41, line 835: do you mean *decreased* PLV in PWS?

Yes., thank you for noticing the mistake. This has been corrected.

Attachment

Submitted filename: Rebuttal_270922.pdf

Decision Letter 2

Jessica Adrienne Grahn

12 Oct 2022

Rhythmic tapping difficulties in adults who stutter: a deficit in beat perception, motor execution, or sensorimotor integration?

PONE-D-21-37460R2

Dear Dr. HUEBER,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Jessica Adrienne Grahn

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #2: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #2: No

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #2: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #2: All my concerns were addressed. Goodluck!

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #2: No

**********

Acceptance letter

Jessica Adrienne Grahn

18 Oct 2022

PONE-D-21-37460R2

Rhythmic tapping difficulties in adults who stutter: a deficit in beat perception, motor execution, or sensorimotor integration?

Dear Dr. Garnier:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr Jessica Adrienne Grahn

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 File

    S1 Fig. (a) Correlation between this average Tapping Asynchrony in the condition REACT (i.e. the average finger Reaction Time) and the SSI score of PWS. (b) Correlation between this reaction time variability and the SSI score of PWS. S2 Fig. Tapping Force Variability in the reproduction task of an isochronous pattern, after passive listening (ISO_REPRO) and in both tasksof synhcronization to a 4-beat metered isochronous pattern (1:1_ISO_SYNC) or to a non-isochronous pattern (NONISO_SYNC). S3 Fig. (a) Correlation between the average log(CV) value and SSI score of PWS. (b) Correlation between the average log(CV) and the Reaction Time Variability (in the condition REACT) of each participant (N = 32). (c) Correlation between log(CV) and the Tapping Force Variability on each train of taps produced in the condition ISO_REPRO. S4 Fig. (a) Correlation between the average PE and the SSI score of PWS. (b) Correlation between the average PE in the condition ISO_REPRO, and the average Reaction Time in the condition REACT of each participant (N = 32). S5 Fig. (a) Correlation between the average PA in the condition 1:1_ISO_SYNC, and the SSI score of PWS. (b) Correlation between the average log(PLV) vaalue in the condition 1:1_ISO_SYNC, and the SSI score of PWS. S6 Fig. (a) Correlation between the average Phase Angle (PA) in the condition 1:1_ISO_SYNC, and the average Reaction Time in the condition REACT of each participant (N = 32). (b) Correlation between the average Phase Locking Value (logit((PLV)) in the condition 1:1_ISO_SYNC, and the average Variability in Reaction Time in the condition REACT of each participant. S7 Fig. (a) Correlation between the Phase Angle (PA) and the Tapping Force (TF) of each tap in the conditions 1:1_ISO_SYNC and 1:4_ISO_SYNC. (b) Correlation between the logit(PLV))value and the Tapping Force Variability on each train of taps produced in the conditions 1:1_ISO_SYNC and 1:4_ISO_SYNC. People who stutter (PWS, N = 16) are compared to with matched control participants without speech disorders (PNS, N = 16).

    (PDF)

    Attachment

    Submitted filename: Rebuttal-PlosOne-150622.pdf

    Attachment

    Submitted filename: Rebuttal_270922.pdf

    Data Availability Statement

    Data cannot be shared publicly because of the confidentiality clause in the ethical approval (CERGA: IRB00010290-2018-10-16-54). Data are available from the GIPSA-lab (contact via Laurent Girin: laurent.girin@gipsa-lab.fr) for researchers who meet the criteria for access to confidential data.


    Articles from PLOS ONE are provided here courtesy of PLOS

    RESOURCES