Skip to main content
eLife logoLink to eLife
. 2022 Nov 1;11:e74816. doi: 10.7554/eLife.74816

Collective dynamics support group drumming, reduce variability, and stabilize tempo drift

Dobromir Dotov 1,2,3,, Lana Delasanta 4,5, Daniel J Cameron 1,2, Edward W Large 4,5,6,7, Laurel Trainor 1,2,8
Editors: Martin Hartmann9, Aleksandra M Walczak10
PMCID: PMC9678363  PMID: 36317963

Abstract

Humans are social animals who engage in a variety of collective activities requiring coordinated action. Among these, music is a defining and ancient aspect of human sociality. Human social interaction has largely been addressed in dyadic paradigms, and it is yet to be determined whether the ensuing conclusions generalize to larger groups. Studied more extensively in non-human animal behavior, the presence of multiple agents engaged in the same task space creates different constraints and possibilities than in simpler dyadic interactions. We addressed whether collective dynamics play a role in human circle drumming. The task was to synchronize in a group with an initial reference pattern and then maintain synchronization after it was muted. We varied the number of drummers from solo to dyad, quartet, and octet. The observed lower variability, lack of speeding up, smoother individual dynamics, and leader-less inter-personal coordination indicated that stability increased as group size increased, a sort of temporal wisdom of crowds. We propose a hybrid continuous-discrete Kuramoto model for emergent group synchronization with a pulse-based coupling that exhibits a mean field positive feedback loop. This research suggests that collective phenomena are among the factors that play a role in social cognition.

Research organism: Human

Introduction

Humans are social animals who engage in a variety of collective activities requiring coordinated joint action. Collective goals can be achieved through spontaneously distributed workloads among group members, such that the emerging collective dynamics confer benefits to performance not available to individuals—the ‘wisdom of crowds’ (Galton, 1907; Surowiecki, 2005).

Principles of collective dynamics explain the adaptive value of collective behavior in some species (Couzin, 2018). For example, collective dynamics can overcome the limitations of individuals’ knowledge and ability to communicate (Goldstone and Roberts, 2006) by integrating information quickly during group decision making (Ispolatov, 2015; Miller et al., 2013; Rosenthal et al., 2015). Large coherent swarming and flocking in groups can arise from short-range interactions among proximal individuals in non-human animals including primates (Farine et al., 2017), insects (Jacobs et al., 2007), fish, and birds (Miller et al., 2013; Parrish et al., 2002). As such dynamics have been demonstrated in numerous species, theoretical models have been developed to provide additional quantitative explanatory support. In contrast, in spatial tasks such as navigation, aggregate group behavior, formalized as a mean field, can serve to stabilize feedback to individuals by virtue of being a group average (Berdahl et al., 2018; Sumpter, 2006; Torney et al., 2009). Here, we test whether mean field behavior accounts for stability of temporal group dynamics in humans during a drumming task.

Collective dynamics are observable in animal as well as in human group behavior. Crowds of walking individuals achieve globally coherent states based on local inter-individual interactions (Rio et al., 2018; Warren, 2018). In audiences, individual-group interactions and social contagion govern the spontaneous onset and offset of applause (Mann et al., 2013). Collective dynamics can also play a constitutive role in sports by permitting advantages not available to individuals alone (Vilar et al., 2013). Evidence of collective dynamics is seen even in the minimal group, a dyad: spontaneous synchronization between two individuals emerges from constraints such as weak coupling (Oullier et al., 2008; Schmidt et al., 1990; Schmidt and O’Brien, 1997).

Synchronized group action is an essential element of music making, a defining social behavior of human interaction (Honing et al., 2015; Patel and Iversen, 2014; Salimpoor et al., 2011; Savage et al., 2015; Trainor, 2015), and the value of precise synchronization may vary from culture to culture and across musical contexts (Benadon et al., 2018; Davies et al., 2013; Lucas et al., 2011).The archaeological evidence of musical instruments goes back 30,000 years, and singing and drumming are thought to be even older (Conard et al., 2009). The evolutionary origins of musical rhythmic actions may relate to social motor behaviors in non-human species, such as the synchronization and desynchronization of vocalizations between individuals in group chorusing, arising from pressures to either collaborate or compete (Gamba et al., 2016; Greenfield et al., 2017; Ravignani et al., 2014; Ravignani et al., 2019). Fundamental aspects of music are present in humans from early stages in development: infants show early musical preferences, social-emotional responses to music, and rate-sensitive motoric responses to musical rhythm (Cirelli et al., 2014; Cirelli et al., 2018; Trainor and Marsh-Rollo, 2019; Zentner and Eerola, 2010). Although musical behavior is found in individuals alone, it occurs mainly in groups ranging from duets to hundreds of participants. Yet, the role of collective dynamics, especially in groups larger than dyads, has largely remained untested. Here, we consider a musical task in which timing consistency and synchrony are crucial. We investigate in what ways the group average performs better than the individuals, a temporal version of the wisdom of the crowd phenomenon.

To account for group interaction, one popular strategy is to first develop a theory for single-person tasks and then extrapolate into dyadic contexts. For example, the idea that the brain is a prediction machine that enables timely motor control in an uncertain environment can be extrapolated as two brains mutually predicting each other (Friston and Frith, 2015; Wolpert et al., 2003). This means that each individual instantiates two processes: the first for controlling one’s own rhythm and timing and the second for predicting the rhythm and timing of the partner in the dyad (Heggli et al., 2019; Heggli et al., 2021; Keller et al., 2014; van der Steen and Keller, 2013). It is not clear, however, if this can serve as an adequate foundation for behavior in larger groups. Every individual would have to predict every other individual and potentially factor-in higher-order predictions. Furthermore, at the perceptual level, group size changes the auditory information available to an individual. A listener is likely to hear an entire choir as one, or a small number of, coherent sounds rather than perceive every singer’s voice within that choir.

Group interaction can also be addressed in terms of theoretical models explicitly developed for synchronization in systems made of many dynamic units (Alderisio et al., 2016; Oullier et al., 2008; Schmidt et al., 1990; Schmidt and O’Brien, 1997). The individual units in such models usually do not contain processes dedicated separately to self-timing and other-predicting. Instead, each unit has only a self-timing and a phase-correction coupling term. The propagation of phase adjustment across all units is sufficient for the collective to enter a group-synchronized state, despite that each unit is only trying to cancel a phase difference. An important benefit of this approach is that it is inherently collective, converging on the same formalisms used for small or large groups of animals (Kelso, 2021; Zhang et al., 2019). Here, we consider such a system of coupled oscillators to account for empirical data on individual and group variability in a drumming task.

Group music making constitutes an ecologically valid and convenient paradigm for studying group action and collective experience in the laboratory (e.g. D’Ausilio et al., 2015). We used a timing task performed by groups of different sizes. Larger group dynamics are less studied because measuring highly precise timing while collecting group data is difficult for both logistical and methodological reasons (cf. Alderisio et al., 2016; Chang et al., 2017; Chang et al., 2019; Chauvigné et al., 2019; Shahal et al., 2020). Participants completed a group synchronization-continuation task (SCT; Figure 1A). It required them to drum in synchrony with an isochronous auditory stimulus and continue drumming at the same rate after the stimulus stopped, while we collected the onset times of each drum hit (Figure 1B). Stimulus tempo was varied across trials. We tested groups of two (dyads), four (quartets), and eight (octets) participants (Figure 1C). The task tested participants’ ability to synchronize to an external reference as well as to other participants, while also minimizing temporal variability and maintaining the initial stimulus rate. This ensemble drumming was also compared against the solo condition. Specifically, participants tested in duets or quartets completed both solo and ensemble conditions. The octet group did not complete the solo conditions for logistical reasons, but they instead completed a control condition in which the synchronization phase continued for the whole trial (i.e. there was no continuation phase) as well as trials using a more complex and musically realistic rhythm. Analyses of the more complex rhythm will be reported separately.

Figure 1. Experiment setup.

Figure 1.

(A) In ensemble condition, drummers faced each other in a circle. (B) The main task was synchronization-continuation where participants were paced initially by an auditory stimulus and then had to maintain the rhythm, tempo, and synchronization among each other. The inter-onset intervals between drum hits, shown schematically as vertical lines, were used to obtain individual-level measures of variability and speeding up. Cross-correlation and transfer entropy were used as pair-level measures of synchronization and interaction. (C) Transfer entropies, color, and width-coded, from three sample trials from different groups. Network analysis was applied to these graphs to obtain group-level characterization.

Inspired by theoretical models from the animal literature (Sumpter, 2006), we assumed that individual interactions average to overall group behavior, a mean field, which provides stabilizing feedback to individual group members (Figure 2). A key prediction was that the relative influence of the stabilizing feedback would increase with increasing ensemble size. To this end, we measured separately the variability of individual drummers and the group. We constructed a theoretical group-aggregate onset time as the center of clusters of individual onset times. We expected that larger groups would exhibit lower variability as measured using the coefficient of variation of inter-onset intervals. We also tested this idea formally by adapting a Kuramoto model of group synchronization.

Figure 2. Group synchronization of eight oscillators with random initial conditions (A) and coherent phases later in the trial (B).

The middle inset shows individual trajectories (red lines) and the mean field (blue line) in time (t). Averaging the phase oscillators θ (red lines) gives a so-called mean field (blue lines) with phase Ψ and amplitude r. As the individual oscillators become more coherent from (A) to (B), r increases which leads to stronger influence from Ψ to diverging θ’s (weight of the downward vectors).

Figure 2.

Figure 2—figure supplement 1. The gamma distribution (a=1.25 and b=.02) in the pulse-coupled model of group synchronization localizes the coupling in time, Equation 5.

Figure 2—figure supplement 1.

The (Kuramoto, 1975) dynamic system of coupled phase oscillators, Equation 1, was conceptualized as a large population of oscillators with different natural frequencies capable of spontaneously locking to a common frequency.

θ˙i=ωi+KNj=1Nsin(θjθi) (1)

Here, θi is the phase of oscillator i, ωi is its preferred frequency, i.e., how fast around the unit circle it likes to go, K is coupling strength, and N is the number of oscillators.

The model gives a mathematical account of group synchronization as dependent on a mean field, referred to here as group aggregate. A central feature of the model is that the feedback is positive: the amplitude of the mean field grows as a function of inter-individual synchronization, and reciprocally, the individual oscillators are affected more by the mean field if its amplitude is larger, see Figure 2. This is shown by using the definition of the mean of phases, Equation 2, to express Equation 1 equivalently (Strogatz, 2000) in terms of the coupling between individual oscillators and the mean field, Equation 3. Ψ is the mean field phase, and r is the mean field coherence, also called order parameter (Kuramoto, 1975).

reiΨ=1Nj=1Neiθj (2)
θ˙i= ωi+rKsin(Ψθi) (3)

Such individual-collective positive feedback loops enable ant trails and other phenomena in swarming animals (Sumpter, 2006) as well as acoustic herding in chorusing animals (Ravignani et al., 2014). The system also serves as a model for neuronal collective synchronization (Breakspear et al., 2010; Frank et al., 2000; Noori et al., 2020). Theoretically, the principles embodied by this system should apply to group action in humans too (Zhang et al., 2019). It has found application in understanding inter-personal synchronization of dyads (Dotov et al., 2019; Heggli et al., 2019; Roman et al., 2019), individual rate preferences in the dyad (Bégel et al., 2022), and effects of coupling topology in larger groups (Alderisio et al., 2016). In the present context, the model predicts that ensembles will be more stable than individuals because of the feedback loop between the timing of individuals and the group aggregate of individuals.

The original Kuramoto model involves continuous dynamics and continuous coupling with constant gain and no delay. In a drumming task, participants are coupled by way of discrete acoustic onset times. Here, we propose a hybrid continuous-discrete system of oscillators with event-based feedback updated once per cycle, Equation 4. It utilizes a pulse function shaped like the acoustic envelope of a drum hit. To this end, Equation 5 specifies an asymmetric gamma distribution that rises sharply at time 0 and then decays slowly (Figure 2—figure supplement 1). In the model, an oscillator emits a pulse once per cycle when its phase angle crosses zero. Furthermore, we posit additive phase variability with a Gaussian distribution N(0,σ).

θ˙i=ωi+KNj=1NPji(t)sinθi+N(0,σ),i=1,...,N (4)
P(t)=f(t|a,b)=1baΓ(a)ta1etb (5)

Like the original system, this model involves intrinsic dynamics (ωi) and interaction dynamics given by the whole coupling term. For comparison, we tested two additional models: the classic Kuramoto system with constant full coupling, Equation 1, as well as the hybrid continuous-discrete model, Equations 4; 5, with a sparse coupling matrix in a ring topology. Specifically, the individual units were only coupled to their two immediate neighbors, j=i±1, with a periodic boundary condition, θN+1=θ1 and θ0=θN .

Results

In Performance, Dynamics and coordination in ensemble, SCT conditions, and Group dynamics, we present the analyses of the participants’ drumming behavior. In Variability in the mean-field model with discretely updated feedback, we present the results from our adapted hybrid model consisting of a continuous-discrete system of oscillators with pulse-based feedback updated once per cycle.

Performance

Variability

Does individual variability increase in ensemble compared to solo conditions?

To answer this question, we fitted a model including only individuals’ data in dyads and quartets (octets did not complete solo conditions). As Figure 3A shows, individuals’ variability increased when playing in an ensemble relative to playing solo (β=0.009, SE=0.001, t=6.65, and ω2=0.10).

Figure 3. Task performance measured in terms of mean (SE) coefficient of variation of inter-onset intervals (IOIs).

(A) Synchronization-continuation task (SCT) drumming trials (no data collected in N8 solo condition). (B) Pulse-coupled Kuramoto model. N2=dyad; N4=quartet; N8=octet; Inds = individual participants; Aggr = group-aggregate; Solo = solo condition; Ensemble = ensemble condition. Error bars are standard errors. With missing trials, the number of observations in the respective conditions was n=(88, 88, 44, 111, 110, 32, ∅, 953, 133) in (A), and n=(200, 198, 99, 400, 400, 100, 800, 688, 86) in (B).

Figure 3—source data 1.

Figure 3.

Figure 3—figure supplement 1. Task performance measured in terms of mean (SE) coefficient of variation of IOIs.

Figure 3—figure supplement 1.

N2=dyad; N4=quartet; N8=octet; Inds = individual participants; Agg = group-aggregate; Solo = solo condition; Ens = ensemble condition; no data collected in N8 Solo condition. (A) Synchronization-continuation task (SCT) drumming trials. (B) Simulations. As a sanity check, group-aggregate variability of IOIs in the solo (pseudo-group-ensemble) was much higher. A linear model was fitted to duet and quartet drumming trials (N2 and N4), the groups where the participants performed both in solo and ensemble condition. The statistical model confirmed that the group-aggregate variability in ensemble condition was lower than the pseudo-group-aggregate in solo playing condition (β=−0.171, SE = 0.016, and t=−10.56). In addition, the pseudo-group-aggregate was higher than individuals’ variability in solo conditions (β=0.153, SE=0.011, and t=13.38). There was an interaction between group size and the difference between individuals and group aggregate. Pseudo-group-aggregate variability in solo trials increased as group size increased from duet to quartet (β=0.017, SE=0.004, and t=4.67), i.e., as more individual solo performances were put into calculating the mean field. In ensemble playing, group-aggregate variability decreased as group size increased from duet to quartet (β=−0.016, SE=0.005, and t=−3.22), the more individuals playing together, the lower the variability of the group average. Error bars are standard errors. With missing trials, the number of observations was n=(88, 111, ∅, 44, 32, ∅, 88, 110, 953, 44, 32, 133) in (A), and n=(200, 400, 800, 100, 100, 100, 198, 400, 688, 99, 100, 86) in (B).
Figure 3—figure supplement 2. Variability in alternative models of group synchronization.

Figure 3—figure supplement 2.

(A) The classical Kuramoto model with constant coupling term. (B) The pulse-coupled model but with selective coupling in a ring topology, where units only interact with their immediate neighbors. In contrast to the pulse-coupled model developed in the present paper, these models do not exhibit the pattern of data across conditions found empirically, see Figure 3. Error bars are standard errors. With missing trials, the number of observations in the respective conditions was n=(200, 200, 100, 400, 400, 100, 800, 800, 100) in (A), and n=(200, 198, 99, 400, 396, 99, 800, 784, 98) in (B).
In the context of true ensemble playing, how does variability of the individuals and group aggregate change across dyads, quartets, and octets?

To specifically test the main hypothesis, we fitted a model to continuation (SCT) trials from all three group sizes in the ensemble playing condition. We found that individual variability increased with group size (β=0.001, SE=0.0005, t=2.20, and ω2=0.09), group-aggregate variability was lower than individual variability (β=−0.010, SE=0.003, t=−3.29, and ω2=0.01), and the difference between individual and group-aggregate variability increased for larger group sizes (β=−0.0025, SE=0.0005, t=−5.19, and ω2=0.03), see Figure 3A. As a sanity check, we verified that the procedure for group-aggregate onset times did not lead to spuriously periodic data. We observed very high variability for the pseudo-group-aggregate in solo condition where individuals did not hear each other, see Figure 3—figure supplement 1A.

How does variability compare between continuation and synchronization?

The octet group completed trials in conditions of both SCT (i.e. after the offset of the reference metronome) and synchronization-only (with a constant reference metronome). Surprisingly, there were no differences between these conditions either in individual (t<1) or in group-aggregate variability (t<1).

Speeding up

Do individuals speed up more when playing solo than in dyad or quartet ensembles?

Speeding up was defined as the linear increase in tempo (i.e. decrease of IOIs) over the course of a trial. As expected, individuals sped up even when playing solo (β=0.018, SE=0.007, t=2.67, and ω2=0.10), and greater speeding up was observed in ensemble than in solo conditions (β=0.022, SE=0.007, t=3.29, and ω2=0.16). The effect of group size was not significant (t<1).

Do dyad, quartet, and octet ensembles speed up to different extents?

To test the effect of group size across the duets, quartets, and octets, a second model was fitted with group size as a continuous predictor, but only including trials in SCT ensemble playing. As suggested by Figure 4, there was a trend for speeding up to decrease as group size increased. The best linear model included an effect of tempo (β=−0.001, SE=0.00005, t=−1.98, and ω2=0.005) and an interaction between group size and tempo (β=−.00004, SE=0.000007, t=−6.20, and ω2=0.11), reflecting that at higher tempos larger ensembles sped up less than smaller ensembles, see Figure 4—figure supplement 1.

Figure 4. Mean (SE) speeding up defined as the linear slope of tempo over time, computed from the IOIs.

N2=dyad; N4=quartet; N8=octet (no data collected in solo N8 condition). Error bars are standard errors. With missing trials, the number of observations in the respective conditions was n=(88, 111, ∅, 88, 110, 482).

Figure 4—source data 1.

Figure 4.

Figure 4—figure supplement 1. Task performance in ensemble, synchronization-continuation task (SCT) trials, measured in terms of (A) speeding up, defined as the linear slope of tempo over time, and (B) variability.

Figure 4—figure supplement 1.

Individuals’ data across group sizes (color-coded) and tempos are shown (abscissa jittered for visibility) along with the model fit. Note that tempos 50, 100, and 240 were not included in the statistical analysis and main text.

Dynamics and coordination in ensemble, SCT conditions

Individual and group-aggregate dynamics (autocorrelations)

Individuals

Figure 5A shows that individuals’ autocorrelations at lag 1 were negative across all group sizes and tempos. This is typical for the alternating long-short IOIs seen in synchronization tasks, reflecting alternating fast-slow synchronization errors when adapting to the previous interval. There was also a trend across all group sizes and tempo conditions for positive peaks at even lags (2, 4, 6, and 8) and negative peaks at odd lags (3, 5, and 7), see Figure 5—figure supplement 1. However, this general pattern became smoother with increasing group size. To test this statistically, in each trial, we took the average absolute difference between successive lags, thus measuring the average range of the autocorrelation function up to lag 8 and applied the same linear modeling approach as in Performance. The effect of group size was significant (β=−0.0054, SE=0.0023, t=−2.35, and ω2=0.13). There was also a decrease of range with increasing tempo (β=−0.00039, SE=0.0001, t=−4.92, and ω2=0.04). Additionally, we verified the signs, relative magnitudes, and the effects of group size and tempo on the autocorrelations by fitting separate linear models for each lag, see Supplementary file 1a.

Figure 5. Autocorrelations in the synchronization-continuation task (SCT) in ensemble conditions.

(A) Autocorrelations of individual IOIs, averaged (SE) across participants’ trials and tempos, separately per group size (color-coded). (B) Same for group-aggregate IOIs. Error bars are bootstrap 95% confidence intervals. The number of observations in the respective group sizes was n=(88, 110, 474) in (A) and n=(44, 32, 66) in (B).

Figure 5—source data 1.

Figure 5.

Figure 5—figure supplement 1. Autocorrelations of individual IOIs in ensemble synchronization-continuation task (SCT) drumming, averaged (SE) across individual participants’ trials, shown separately per tempo (panels) and group size (color-coded lines).

Figure 5—figure supplement 1.

Tempos 50, 100, and 240 were not included in the statistical analysis in the main text. Error bars are bootstrap 95% confidence intervals. The number of observations in the respective group sizes was n=(20, 14) in bpm=50, n=(22, 28, 83) in bpm=80, n=(77) in bpm=100, n=(22, 26, 85) in bpm=120, n=(22, 28, 83) in bpm=160, n=(22, 28, 84) in bpm=200, n=(22, 27) in bpm=240.
Group-aggregate

Interestingly, group-aggregate IOIs revealed the same overall pattern of results even though by definition the group-aggregate timing was smoother and less variable (see Group-aggregate (mean field) measure for ensembles and pseudo-ensembles), see Figure 5B and Supplementary file 1b. There was a trend across all group sizes for relative positive peaks at even lags 2, 4, 6, and 8 and relative negative peaks at odd numbered lags 1, 3, 5, and 7. The similarity of the dynamics of individuals and the group aggregate is not trivial because it implies that even groups of up to eight participants spontaneously acquire the alternating long-short interval dynamics characteristic of when individual participants synchronize with a stimulus.

Inter-personal coordination (cross-correlations)

Cross-correlation applied to the pre-processed IOIs assessed inter-personal coordination between pairs of participants drumming together in the continuation phase of ensemble performances. On rare occasions, individuals produced diverging beat times by, for example, missing the drum or hitting it two times. The resulting outlier IOIs were removed, and the remaining were re-aligned relative to the other participants by temporal adjacency. The last pre-processing step consisted of whitening each time-series of IOIs (see From beat onset times to IOIs, outlier removal, and clustering–Pre-whitening).

We analyzed lags in the range 0–4 because cross-correlation coefficients tended to be symmetric between positive and negative lags, see Figure 6. As expected, cross-correlations at lag 0 were negative in duets (β=−0.039, SE=0.011, t=−3.60, and ω2=0.04), consistent with the pattern of results for autocorrelations, and inter-personal coordination dynamics were smoother in larger group sizes. Furthermore, lag 0 cross-correlations became less negative with increasing group size (β=0.012, SE=0.002, t=5.93, and ω2=0.13). Lag 1 cross-correlations were positive (β=0.14, SE=0.012, t=11.80, and ω2=0.32), but their magnitude decreased with group size (β=−0.016, SE=0.002, t=−7.14, and ω2=0.20). There were also effects of tempo, see Figure 6—figure supplement 1.

Figure 6. Cross-correlations in the synchronization-continuation task (SCT) in ensemble conditions.

Averages (SE) across participant pairs and tempos are shown separately per group size (color-coded lines). IOIs were aligned across participants and pre-whitened by filtering with an autoregressive model. Error bars are bootstrap 95% confidence intervals. The number of observations in the respective group sizes was n=(44, 140, 1479).

Figure 6—source data 1.

Figure 6.

Figure 6—figure supplement 1. Cross-correlations in the synchronization-continuation task (SCT) task, ensemble conditions, averaged across participant pairs (±CIs), per tempo (panels) and group size (color-coded lines).

Figure 6—figure supplement 1.

IOIs were aligned across participants and pre-whitened by filtering with an autoregressive model. Tempos 50, 100, and 240 were not included in the statistical analysis. Error bars are bootstrap 95% confidence intervals. The number of observations in the respective group sizes was n=(10, 19) in bpm=50, n=(11, 36, 248) in bpm=80, n=(232) in bpm=100, n=(11, 32, 260) in bpm=120, n=(11, 36, 248) in bpm=160, n=(11, 36, 254) in bpm=200, n=(11, 34) in bpm=240.

Group dynamics

Network analysis

We used network analysis in the continuation phase of trials of ensemble playing to describe group coordination at an even higher level of organization than interpersonal coordination. First, we obtained the directed graphs of functional connectivity among participant drummers. In each trial, the graph consisted of the real-valued directed links between pairs of drummers estimated by way of the transfer entropy (TE), see Figure 1C. Then, network properties of these graphs were computed, see Network analysis. There was a significant effect of group size on causal density (β=−0.0006, SE=0.0001, t=−5.88, and ω2=0.47), see Figure 7A. Causal density increased with tempo (β=0.008, SE=0.0019, t=4.01, and ω2=0.12). As Figure 7B shows, mean node strength increased with group size (β=0.0032, SE=0.00032, t=10.13, and ω2=0.73). Node strength was affected by tempo (β=0.016, SE=0.005, t=2.96, and ω2=0.07). For better model conditioning, tempos values were reduced and compressed by transforming them with a logistic function. Comparing between SCT trials and synchronization-only trials in the octet group found no difference for causal density (t<1) or mean node strength (t<1), suggesting group dynamics were similar regardless of whether a pacing stimulus was present or not.

Figure 7. Network dynamics.

(A) Mean node strength increases with group size in the continuation phase of synchronization-continuation task (SCT) ensemble drumming trials. (B) Causal density decreases with group size (Abscissa jittered for visibility).

Figure 7—source data 1.

Figure 7.

Figure 7—figure supplement 1. Association between drumming performance measures and network properties in synchronization-continuation task (SCT) ensemble trials.

Figure 7—figure supplement 1.

Figure 7—figure supplement 2. Transfer entropy (TE) is highest in synchronization-continuation task (SCT) trials with minimal tempo increase relative to the instructed tempo.

Figure 7—figure supplement 2.

The Gaussian model (green) was selected over the linear (red) using model comparison with Akaike information criterion and Bayesian information criterion (AIC and BIC), indicating that higher TE values tended to concentrate in the center of the distribution. Specifically, we compared a Gaussian and a linear model’s ability to predict TE values from all pairs during continuation tapping (including all group sizes) from their respective changes in tempo (maximum likelihood estimation, mle in R). Indeed, the AIC method selected the Gaussian model (–39,838.89) over the linear (–39,566.81). The estimated 95% CIs of model parameters indicated that the center of the Gaussian curve was just above zero, change of 1.56–2.37 beats per minute (bpm), over the continuation phase (tempo changes in the data ranged from about –20–40 bpm), suggesting that best group coordination was achieved when participants were slowly increasing their tempo or that slowly increasing the tempo allowed for best coordination.

Network properties and task performance

The relation between group dynamics and group performance was evaluated by regressing the network properties separately with respect to speeding up and IOI variability in ensemble playing. Speeding up decreased with increasing causal density (β=−0.005, SE=0.002, t=−2.22, and ω2=0.02) and was not associated with node strength (β=−0.01, SE=0.007, and t=−1.66), see Figure 7—figure supplement 1A–B. Variability was not associated with causal density (β=−.003, SE=0.0017, and t=−1.97) but decreased with increasing node strength (β=−0.007, SE=0.0019, t=−4.01, and ω2=0.09), see Figure 7—figure supplement 1C–D. Interestingly, we also observed that TE tended to be higher in trials that deviated less from the stimulus tempo, see Figure 7—figure supplement 2.

Variability in the mean-field model with discretely updated feedback

The results of the modeling showed that the cycle duration variabilities of the mean-field hybrid Kuramoto model, adapted to have discretely updated feedback, reproduced the main experimental findings. The model results are summarized in Figure 3B and can be compared to the experimental results in Figure 3A. The adapted model reproduced the pattern of behavioral results at both the level of individual oscillators (relating to individuals in our drumming data) and the mean field (relating to our group-aggregate analyses). Specifically, for all group sizes, variability was higher for individual oscillators when playing in the group than when playing solo, but variability was always lowest for the mean field (group aggregate). In addition, with increasing group size, the difference between individual variability and that of the mean field became greater, indicating increasing collective benefits with increasing group size. See Figure 3—figure supplement 1B for a comparison with the null (pseudo-ensemble) condition. A separate report will address the model’s individual unit dynamics and co-variation in more detail (Delasanta et al., in preparation). The importance of the mean-field hybrid model for understanding the present results can be seen in that other versions of the Kuramoto model did not successfully replicate the pattern of the data. Specifically, a pulse-coupled network in a neighbors-only ring-topology and a constant coupling network in full connectivity did not exhibit the same pattern of variability as the empirical data, see Figure 3—figure supplement 2A–B.

Discussion

The present study examined how group performance in a synchronization timing task depends on group size and interactions among group members. Our overall hypothesis was that the mean field would influence individuals in a group, thereby stabilizing group performance. We expected that larger groups would exhibit more stable performance due to a more stable mean field. This hypothesis was motivated by the collective dynamics that allow animals with limited capacity for interacting with each other to achieve mutual goals by exploiting processes such as positive collective feedback and wisdom of crowds (Sumpter, 2006). As expected, we observed that temporal variability in group drumming decreased with group size. This occurred only at the level of the ensemble or mean field when the group was taken as a single entity. The effect was reversed for individual participants; their variability increased from solo to group playing conditions and with group size. We showed that this result agrees with a collective dynamics framework based on an adapted Kuramoto model. The model is consistent with the idea from theoretical biology that self-organizing systems can successfully coordinate with an external constraint by reducing their disorder at the macro-level at the expense of increasing their micro-level disorder (Kugler and Turvey, 1987; Prokopenko et al., 2014) and that medium-sized groups exhibit commonalities with principles of organization found both in two-unit and large collectives (Kelso, 2021; Zhang et al., 2019). Our theory is related to the Vicsek model, an influential account of collective animal behavior which explains the spontaneous coherent heading direction arising in collective animal movement, also in human crowds, in terms of local interactions of attraction and repulsion (Silverberg et al., 2013; Vicsek et al., 1995). The Vicsek model also relies on individual coupling with the average of a group, with the important difference that only a small radius of neighbors defines the interaction field, called a local order parameter (Chaté et al., 2008).

The continuous-discrete Kuramoto system introduced here included fluctuating intrinsic frequencies and event-based feedback. Consistent with the discrete nature of time asynchronies in a drumming task, the model used a pulse function instead of continuous coupling. The implications of this idea reach beyond the topic of the present study. The combination of continuous movement and discrete sensory sampling of the environment is a frequently observed scenario; in the present study, participants used the discrete auditory feedback of drum taps to inform their continuous movements leading up to future drum taps. Our task reduced the opportunity of visual coupling but did not eliminate it completely, leaving the possibility that group drumming involved both discrete and continuous feedback such as visual information about continuous arm, hand, and drumstick movements related to the timing of individual drumming sounds. Future studies should address whether visual feedback during group drumming is informative, as well as explore how that visual information is captured within the individual and collective dynamics of the system.

Interestingly, in the octet group, individual variability was similar in both the synchronization-only condition, where the metronome pacing stimulus was present, and in the self-paced continuation stage of the SCT trials, where no pacing stimulus was present. This result implies that, although tempo accuracy and drift were superior when the pacing stimulus was present, the stability of group timing in a group of eight participants may hit a ceiling, where performance becomes as stable as with a pacing stimulus.

Individuals’ tempos tended to increase in solo trials, and as expected, this effect was amplified (i.e. speeding up was greater) when trying to stay synchronized with other participants (Okano et al., 2017). Surprisingly, the amount of speeding up was not higher for larger groups of drummers, and it was even reduced for larger groups, at least at faster tempos. In contrast, accounts based on mutual prediction imply more speeding up for larger groups (Thomson et al., 2018; Wolf et al., 2019). This discrepancy highlights the importance of testing tempo drift with specific task constraints and avoiding hasty generalizations from smaller to larger groups. Indeed, tempo can also slowdown in some circumstances. This was observed, for example, in a group synchronization task when participants synchronized their oscillations using swung pendulums, a task involving continuous visual coupling rather than discrete acoustic event coupling (Bardy et al., 2020).

Having summarized task performance in terms of consistency and accuracy, we next considered drumming dynamics. Individuals’ autocorrelations exhibited negative coefficient peaks at odd lags and positive coefficient peaks at even lags. This is the typical error-correction process demonstrated when individuals synchronize their tapping with a stimulus or with each other in pairs (Konvalinka et al., 2010; Repp, 2005). With increasing group size, however, the magnitudes of the autocorrelation coefficient fluctuations with lag decreased; individuals’ alternating patterns became smoother. This suggests that as group size increases, drumming dynamics are dominated less by tap-by-tap error-correction and more by a group dynamic. Interestingly, the group-aggregate onsets exhibited the same autocorrelation pattern as individuals, even in octets. This implies that the collective mean field behavior spontaneously acquired some of the dynamic properties typically associated with individual performance.

Inter-personal coordination in duets exhibited the negative cross-correlation at lag 0 and positive cross-correlation at lag 1 expected from past studies (Repp, 2005). The results for speeding up, variability, and cross-correlations in the dyads were consistent with the literature on dyadic interaction with tapping tasks. Yet, as group size increased in the present study, the coefficient magnitudes for cross-correlations decreased. In fact, they all but disappeared for groups of eight, suggestive of a leader-less group coordination scenario (Heggli et al., 2019). In a small number of trials, we even observed positive lag 0 cross-correlations in groups of eight, a signature of anticipation emerging from dynamic interaction. This result is consistent with the idea that group members coordinated less with each other and more with the mean field. Along with a more slowly decaying autocorrelation function, this suggests the emergence of a larger-scale group dynamic in the octet. As we hypothesized, mutual prediction is not necessarily the only basis for coordination in larger groups. The present model would be more like a predictive account had we included a new variable and an additional equation in the model, θ˙j,i=f(θi,θj,θj,i), corresponding to a separate internal oscillatory process θj,i that individual i dedicates to the prediction of an external signal j. Yet, there is no reason to assume that collective dynamics and mutual prediction are exclusive alternatives, only that collective dynamics are a mode of social interaction that emerges under certain constraints such as large groups of interacting agents.

Characterizing inter-personal interaction on the basis of bivariate measures is typical in the context of dyadic paradigms. Larger groups, however, create the possibility for higher-order interactions. Network analysis was applied to the connectivity graph of TEs among drummers. Larger groups had lower causal density but higher node strength. This is possible because individual nodes in larger groups project to a larger number of other individual nodes and suggests that in larger groups, each individual has a decreased influence on the group, both less potential to destabilize the group and lower requirements to keep the group stable. We also found that higher node strength was associated with better overall performance, both in terms of variability and tempo accuracy.

Group timing tasks address some of the foundations of human social interaction. This area of research has been labeled the ‘dark matter’ of social neuroscience (Dumas et al., 2014; Insel, 2010; Schilbach et al., 2013) because it has not received enough attention despite the fact that large parts of the human cortex respond to social stimuli (Frith and Frith, 2001). The temporal regularities in music make it an ideal stimulus to promote synchronous and cooperative movement, providing evidence as to why music is often present at emotional social gatherings such as weddings, funerals, parties, and political rallies. Feelings of affiliation and cooperation increase between people who have experienced synchronous movement with each other, even during infancy (Cirelli et al., 2014; Cirelli et al., 2016; Hove and Risen, 2009; Mogan et al., 2017; Trainor and Cirelli, 2015; Valdesolo et al., 2010). The ability of music to strengthen social bonds within a group may have evolutionary roots that enhance survival capabilities of the group (Huron, 2001; Savage et al., 2020; Trainor, 2015). Arguably, the presence of a convergent regular rhythm defined over the ensemble allows individuals of varying skill level to perform together without having to predict each note they play, an idea that remained largely untested in the present study because of the relative simplicity of the isochronous stimulus.

Participants in group activities sometimes report a so-called state of flow characterized by absorption, loss of sense of control, loss of self-consciousness, togetherness, and effortless action toward a shared goal (Csikszentmihalyi, 1990; Gaggioli et al., 2017; Hart et al., 2014). Little is known about the underlying group processes that enable such experiences. Some of the required conditions have been studied in the context of group musical performance (Bishop et al., 2019; D’Amario et al., 2018). Collective intentionality is often seen as an extrapolation of single-person cognitive processes in that individuals predict other individuals’ goals and actions (Tomasello, 2000, Tomasello, 2014). Yet, the neural and cognitive mechanisms historically identified in single-person paradigms do not always map onto the ontology of social behavior (Przyrembel et al., 2012). Collective intentionality can stand for an independent mode of social cognition (Satne and Salice, 2018; Zahavi and Satne, 2015). Social cognition theory can even be turned upside-down by observing cases where individual agency arises from social interaction (De Jaegher and Froese, 2009; Froese et al., 2014). The present work shows how collective dynamics can absorb individual action.

Beyond the group average

We have argued that attraction to the group average constitutes an organizing principle that supports the coordinated performance of medium and large groups of individuals. There are theoretical and empirical reasons, however, to assume that other organizing tendencies can compete with the group average and potentially lead to more complicated dynamics. To begin with, our synchronization task was designed to foster collaboration, did not challenge individual skill, and did not create any incentives for competitive individual behavior. This collaborative aspect corresponds to social behaviors where crowds want to appear as a strong unity, for instance, when chanting a protest song or storming into battle. In contrast, choir singing demonstrates that both cooperation and competition are involved often in group musical performance (Keller et al., 2017). And different players in musical ensembles often simultaneously play different parts with different melodic and rhythmic features. While these parts must fit together, reflecting cooperation, at any one point, the player of the ‘melody’ or most important part influences other players more than vice versa, setting up leadership dynamics within the group (e.g. Chang et al., 2017). Different cultural contexts and performance styles may even emphasize the creative and expressive de-synchronization from the regular group beat, or avoid isochrony altogether, allowing musicians to unfold their own ideas or belonging to a specific tradition (Benadon et al., 2018; Davies et al., 2013; Lucas et al., 2011). Understanding musical creativity requires a situated approach and ethnographic engagement to find which features of the emergent collective acoustic environment constitute meaningful affordances for given musicians in a given context (Linson and Clarke, 2017).

The combination of attractive and repulsive coupling among individuals is particularly evident in the context of collective animal behavior. The well-known Vicsek model accounts for the coherent heading direction in collective movement in terms of a combination of local attraction and repulsion forces (Vicsek et al., 1995). In chorusing animals, there is a tendency for male individuals to cluster their acoustic behavior in time but also to advance ahead of the cluster, presumably to have better chances at attracting the attention of other relevant individuals (Gamba et al., 2016; Greenfield et al., 2017; Ravignani et al., 2014; Ravignani et al., 2019).

The present study focused on a scenario with uniform coupling by keeping group members close to each other and restricting the timbre and pitch of their drums to a very narrow range, making them acoustically inseparable from each other when playing synchronously. Musicians who play on large stages sometimes report noticeable delays amongst each other. In the present study, the delay was limited in the range from 4 ms to 16 ms depending on partners’ spatial separation, and the sound intensity drop was from 6.3 to 15.1 dB. Yet, selective attention plays an important role in animal chorusing (Greenfield et al., 2021). Variations in the layout of the acoustic environment, individualized timbres, and the availability of alternative coupling modalities such as vision, all create affordances for selective attention and individualized roles in the ensemble. For example, low-pitch sounds afford better beat-based timing precision in humans (Burger et al., 2013; Hove et al., 2014; Hove et al., 2020). The discernability of individual sounds facilitates temporal coordination tasks, presumably through the involvement of neural mechanisms for self-other segregation and integration mechanisms (Liebermann-Jordanidis et al., 2021; Novembre et al., 2016; Ragert et al., 2014).

In conclusion, we assume that the aforementioned factors fit within the collective dynamics framework as symmetry-breaking terms, complications in task space that enable more interesting stabilities than mere attraction to the group average. Future studies can address this idea by manipulating individual heterogeneity, task difficulty, spatial layout, and coupling topology.

Materials and methods

Participants

A total of 102 university students participated in the duet (n=11 sessions; 22 participants), quartet (n=8 sessions; 32 participants), and octet (n=6 sessions; 48 participants) groups. In addition, 88 high-school students participated in the octet condition (n=11 sessions). Participants were recruited from the department’s participant pool with course credit and from local high schools as part of science field trips to the university. Across the set of all participants, musical training was distributed in the range from non-musician to amateur musician. We did not control the composition of musical skill within each ensemble.

Apparatus

In ensemble playing conditions, participants were seated in chairs facing each other, equally spaced in a linear, square, or octagon arrangement, with diagonal distances of 3 meters in the duet and quartet groups and 6 meters in octet group. Ensemble playing conditions were performed in a large, tall-ceiling dance studio, part of the LIVELab, a laboratory dedicated to auditory and movement neuroscience (http://livelab.mcmaster.ca). In the duet and quartet groups, solo trials were recorded with individual participants immediately before or after the ensemble sessions in an adjacent sound-proof room.

In the duet and quartet groups, participants used drumsticks on plastic buckets that were dampened and placed upside-down. Their fundamental frequencies varied in a narrow range (117–129 Hz). They are of the type seen in street-style group drumming observed in the community and were used previously by a group drumming workshop instructor. Piezoelectric sensor elements were taped on the inner side of the attack surface of each bucket. The analog signal was recorded on a computer at 48 kHz via a multi-channel studio-grade analog-digital interface (Focusrite RedNet 4) and the digital audio production software Reaper (Cockos Incorporated, Rosendale, NY, USA). This setup provided for a clean signal with minimal noise and a sharp, high-amplitude attack that was digitized into onset times in Matlab using a custom algorithm with thresholding and rejection of rebounds under 150 ms.

The octet group used a set of electronic pads which, unlike the bucket drums, allowed tuning the drums to distinct fundamental frequencies across a larger number of individual drums. Eight identical pads (Yamaha TP70 7.5” snare/tom) and eight independent powered studio reference monitors (Yorkville Sound YSM5) placed under each pad were connected to the head module (Yamaha DTX 920 K). The pads had linear amplitude response and, on impact, produced the same woodblock sound. The woodblock fundamental frequencies, about double those of the bucket drums, also varied in a narrow range (249–284 Hz). The two setups were similar as they both involved polymer impact surfaces and piezoelectric sensors embedded within those surfaces to detect impacts and record onset times. To record the electronic drums, the head module was connected to the computer which stored drum hits as MIDI events. The recording line was parallel to the drum set and did not contribute additional latency between drum impact and drum sound (5 ms). Separately, we measured that the recording pipeline added negligible timing variability with a SD of 1 ms relative to the ground truth based on a direct analog recording of the drum pad impacts.

The reference metronome that participants were asked to synchronize with was played over a dedicated speaker placed in the middle between participants. A custom patch for the sound synthesis and production software Pure Data (Puckette, 1997) ran on the computer to generate and time a sharp electronic kick drum sound, with a timbre very distinct from the individuals’ drums (600 Hz sine wave with 250 ms duration, 20 ms amplitude attack, and 230 ms decay). The stimulus was fed back and recorded as an additional channel along with the participants’ drum beats.

Stimuli

The stimulus tempo of the isochronous reference metronome varied between 50 and 240 beats per minute (bpm) depending on condition (see below). Trials were 90 s long, and the synchronization phase always included 32 stimulus events. Thus, slower tempos had longer duration synchronization phases.

Task

Drumming is ecologically valid in terms of common musical practice, and tapping with sticks has been shown to improve timing relative to finger tapping (Madison et al., 2013; Manning et al., 2017). The task on each trial was a SCT. During the synchronization phase, participants were presented with trials containing a reference isochronous metronome at a particular tempo and were asked, using drumsticks, to: (1) synchronize with the given reference metronome, (2) keep their own variability as low as possible, (3) keep their drumming rate as steady as possible, and (4) synchronize with the other drummers when in an ensemble condition. During the continuation phase, the reference metronome stopped, and participants were told to continue to keep their drumming rate as steady as possible while synchronizing with other drummers in an ensemble. Those in the octet group also completed trials in a synchronization-only control task in which the isochronous metronome played for the entire trial (i.e. there was no continuation phase). Participants were instructed to focus their eyes on an ‘X’ taped in the centers of their drums, thus reducing the visual coupling among group members in the ensemble conditions.

Design

Participants were assigned to one of three group sizes: duet (2), quartet (4), or octet (8). Those in the duet and quartet groups completed the SCT task in both solo and ensemble conditions. Duet and quartet groups completed six trials at each of the following tempos: 50, 80, 120, 160, 200, and 240 bpm (or 1200, 750, 500, 375, 300, and 250 ms onset-to-onset intervals between beats) in each of the solo and ensemble conditions, with the order of tempos randomized. For each set of participants, there was random assignment to either the solo or ensemble condition first. Trials lasted 90 s with a synchronization phase of 32 metronome beats.

For octet groups, there were three differences in the design. First, they did not complete solo trials because of time and space constraints. Second, the set of tempos was slightly different (80, 100, 120, 160, and 200 bpm or 750, 600, 500, 375, and 300 ms intervals). Finally, an additional condition was included, consisting of a synchronization-only control task (i.e. with the reference metronome throughout).

We found that the very low and very high tempos led to poor performance; hence, we eliminated extreme tempo trials and only analyzed the set of matching tempos across dyads, quartets, and octets, namely 80, 120, 160, and 200 bpm (750, 500, 375, and 300 ms). See the figure supplements for the extreme tempo conditions. For all groups, prior to completing the trials in each condition, participants engaged in practicing synchronized steady drumming without an initial metronome to set the tempo so that participants could become familiar with the sticks and drums and playing together in an ensemble.

Procedure

After providing informed consent, filling out demographic and music background questionnaires, reading instructions, and observing a demonstration of the apparatus, participants were instructed to find a comfortable posture on their chair and adjust the drum position during practice. They were asked not to adjust these further during the trials. Participants were allowed to practice as needed to feel comfortable with the task. After each trial, the experimenter verified with the participants that they were ready to start drumming again before the next trial was initiated. The whole session from arrival to departure lasted about an hour. Before asking for informed consent, the overall goals of the study were explained to the participants, including that the collected drumming data would potentially be presented anonymously at meetings and in scientific papers, and that the procedures had been approved by the McMaster Research Ethics Board.

During the solo performance in the duet condition, one participant was moved to an adjacent sound-proof room while the second remained in the large dance studio. For the quartet condition, two of the participants were randomly assigned to arrive earlier and perform their solo trials before the group trials and the other two to perform their solo trials after the group trials. In the octets, individuals did not perform solo trials.

Analysis

Pre-processing of onset times

Onset times were processed using custom Matlab scripts (an example script for cleaning and matching IOIs is available at https://gitlab.com/dodo_bird/group_sync_elife, copy archived at swh:1:rev:6479ed2e3409ebc156fbc917a246cf4c4edf4f44, Dobromir, 2022). In SCT trials, only data from the continuation phase was analyzed. In the synchronization-only control trials of the octet group, the full-trial length was analyzed. The following steps were taken to remove artifacts and condition the data according to the requirements of the analyses used.

Signal

On a few occasions, participants accidentally pulled the cable from the drum and disconnected or damaged the sensor. A total of 15 bad channels (i.e. participants) were detected and removed from further analysis with a custom script using the onset variability, signal noise, and amplitude consistency. For further verification, we visually inspected the signals and onset times from all channels and trials.

Group-aggregate (mean field) measure for ensembles and pseudo-ensembles

For ensemble trials, we created a measure of the beat onset times of the group as a whole by constructing theoretical group-aggregate onsets in the center of clusters of individuals’ onset times, tn,g, where n is the beat onset number, and g signifies it is the group-aggregate onset. Specifically, we: (1) convolved onset times with a Gaussian kernel 200 ms in width, (2) smoothed the resulting group activity rate with a moving average window of a quarter of the reference stimulus beat onset-to-onset time, and (3) took the peak locations for group-aggregate beat onset times (see Figure 8). This algorithm, typically used in neuroscience to extract a firing rate or population activity from the spike times of multiple recorded units (Dayan and Abbott, 2001), provides a measure of the central tendency of one or more events clustered in time. The method does not necessarily restrict the group-aggregate pattern to one average onset because if a single onset time is isolated away from other clusters, for example, when a participant erroneously hits the drum while the other participants were silent, it will also be detected as a single-event cluster. Such events of large individual deviations from the group were very rare in the present study and eliminated from analysis because, overall, participants performed the task consistently and in agreement with group members. Group-aggregate beat onset times were calculated for each trial in each group during ensemble playing conditions. In solo trials of dyads and quartets, participants could not see or hear each other, but we computed onset times of the pseudo-group-aggregate for a baseline.

Figure 8. Group-aggregate onset times from convolving the drummers’ onsets with a Gaussian kernel.

Figure 8.

From beat onset times to IOIs, outlier removal, and clustering

For each trial for each participant, the sequence of onset times tn,i, where i is participant and n onset number, was differenced to produce a time-series of inter-onset-intervals, IOIn,i=tnn,i – tn−1,i. Where appropriate for a more intuitive interpretation, we expressed these as tempos, Tn,i=60/IOIn,i. The same procedure was carried out to obtain the time-series of IOIs for the group-aggregate measure in ensemble conditions and the pseudo-aggregate measure in solo playing conditions.

If a participant skipped a beat on a certain occasion, this would produce an outlier IOI twice as large as most others in the trial, thus biasing subsequent measures. IOIs exceeding 50% difference above or below the trial median were removed. The measures of individual variability and tempo trend (see below) were based on these IOIs.

Alignment of IOIs

Quantifying inter-personal coordinated tapping with the cross-correlation requires pairing IOIs from all participants in a given trial. To this end, we used a custom algorithm to detect IOIs that were matching across all group members. To illustrate, if one group member skipped a beat, the produced IOI was about double that of the others, and the corresponding IOIs for all group members were removed.

Pre-whitening

Cross-correlations are difficult to interpret in the presence of non-stationarities such as slow drift up and down of the IOI. We applied a method for removing such non-stationarities by pre-whitening the IOIs (Dean and Dunsmuir, 2016). Specifically, each time-series of IOIs was filtered using an auto-regressive model with coefficients estimated with automatic model-order selection (‘pre-whiten’ function from the time series analysis package TSA for R).

Performance measures and dynamics

All subsequent measures were based on the individual participants’ timing data. For the sake of characterizing overall group performance, the group-aggregate timing was analyzed in the same way as was individual participant timing and reported as a separate condition.

Variability and tempo trend

The tempo trend was defined as the linear trend of IOIs along successive beats in a trial. The slope b in the regression equation Tn,i=a + bni was fitted separately in each trial for each individual. The variabilities of each individual and group-aggregate were estimated separately using the coefficient of variation of their de-trended and pre-whitened IOIs.

Autocorrelation

Individuals’ and group-aggregate dynamics were analyzed using autocorrelation of IOIs.

Inter-personal coordination

Cross-correlation

The temporal pattern of co-variation among participants was analyzed using cross-correlation applied to the pre-whitened IOIs.

Transfer entropy

The bidirectional interaction between every pair of participants in a group was measured using TE (Schreiber, 2000). This is an information-theoretic measure of directed mutual information used for quantifying the effective connectivity in, for example, moving animals (Brown et al., 2020), sensori-motor and motor-motor interactions (Dotov and Froese, 2018; Lungarella and Sporns, 2006; Stoffregen et al., 2009), and complex networks of neurons (Gourévitch and Eggermont, 2007). Effective connectivity, sometimes also referred to as causality, is understood in the Wiener sense that having information about past source dynamics improves the prediction of future target dynamics (Wiener, 1956). We used a Matlab toolbox for TE (Ito et al., 2011) adapted for point processes such as neuronal firing times where a delay d separated the source (J) and target (I) dynamics are binary events, TEJI(d)=p(it+1,it,jt+1d)log2p(it+1|it,jt+1d)p(it+1|it). The t indices here are bins of time, set to 10 ms. We took the maximum TE over a range of up to 100 delay steps (i.e. 1 s). We performed the analysis either with or without removing the trend of decreasing intervals between time indices in a trial (i.e. trend for tempo to speed up) and found this did not change the pattern of results.

Arguably, TE has several benefits over comparable methods (Lungarella et al., 2005). We identified another reason for applying TE to the sequence of time onsets t rather than what would be the first choice historically, namely TE or Granger causality among sequences of IOIs paired across participants. The latter scenario involves several steps of subtractive pre-processing consisting of removing outliers due to missed taps, pruning and aligning observations across participants, and filtering smooth trends that could be intrinsic to the performance, whereas applying a version of TE for point processes (sequences of onsets) avoids the steps which risk excessive data removal.

Network analysis

The measures in Performance measures and dynamics are univariate and describe drumming performance of individuals. This individual-level analysis does not say much about how participants coordinate with each other to achieve group performance. Inter-personal coordination lists bi-variate measures typically used to analyze dyadic tapping and sensorimotor coordination problems. Yet, even such pair-level analysis is not sufficient in the group context where more than two participants coordinate simultaneously with each other. Hence, we obtained group-level measures by analyzing the network of effective connectivity among drummers (Newman et al., 2011). Drummers were treated as nodes (or vertices) and TE values as the weights of links (or edges) of a directed graph.

Causal density and mean node strength

Among the rich set of phenomena that potentially can be quantified in a large network, only a few measures are relevant in the present context because of the low number of nodes and the real-valued nature of the graph consisting of continuous connection weights rather than binary links. We defined causal density as the average of all edge weights (Seth et al., 2011; Seth and Edelman, 2004). Mean node strength (Barrat et al., 2004), obtained using the igraph package for R (Csardi and Nepusz, 2006), was defined as the average across drummers of each drummer’s total outbound connectivity. This is a version of the more popular measure of node degree which is used for binary graphs.

Statistical analyses

Linear mixed effects models (Bates et al., 2015; Singer and Willett, 2003) were used instead of ANOVAs because of the complex, unbalanced design and missing observations. By taking different subsets of the data and fitting separate models, we tested for: (1) effects of group size in SCT trials comparing dyads, quartets, and octets, (2) differences between solo and ensemble playing conditions in dyads and quartets, (3) differences between SCT and synchronization-only in octets, and (4) differences between individual and group-aggregate performance. For brevity, only the selected model with best fit is reported in Results. The modeling procedure was incremental, starting with the minimal specification of a grand mean and iteratively including predictors and their interactions until the model became over-specified or too complex to converge, which tended to be the case when including tempo. The Satterthwaite method was used to determine significant effects and interactions (Kuznetsova et al., 2017). Effect sizes are given by the ω2 (Ben et al., 2020).

Model parameters

We simulated 100 runs per condition and each run corresponded to a trial with duration of 100 s. We used a sampling rate of 300 Hz and a third-order solver. Different sampling rates and using solvers with the Euler method or with an adaptive step size produced similar results. Performance variability was measured in terms of the coefficient of variation of cycle durations, treating zero-phase crossings as onset times, reproducing the behavioral data qualitatively. The pulse in Equation 5 with parameters a=1.25 and b=0.02 specifies an asymmetric gamma distribution that rises sharply at time 0 and then decays slowly in about 200 ms. The code needed to run the full simulation is available at https://gitlab.com/dodo_bird/group_sync_elife.

Conditions consisted of the full crossing of group size N=(2,4,8) with coupling strength (K=0 in solo and K=8 in ensemble). The noise term had a mean of zero and variability σ=6. Intrinsic frequencies ωi were drawn from a normal distribution with a mean of 2 Hz and SD of 0.5. Comfortable finger tapping in humans is in the range 1.5–2 Hz (McAuley et al., 2006), tapping is optimally precise in the range 0.8–2.5 Hz (Moelants et al., 2002), and walking cadence on average is at 2 steps/s (MacDougall and Moore, 2005).

Acknowledgements

We would like to thank Daniel Bosnyak, Sally Stafford, Susan Marsh-Rollo, and LIVELab staff. This research was supported by grants to LJT from the Social Science and Humanities Research Council of Canada, the Natural Sciences and Engineering Research Council of Canada, the Canadian Institutes of Health Research, and the Canadian Institute for Advanced Research.

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Dobromir Dotov, Email: ddotov@unomaha.edu.

Martin Hartmann, University of Jyväskylä, Finland.

Aleksandra M Walczak, CNRS LPENS, France.

Funding Information

This paper was supported by the following grants:

  • Social Sciences and Humanities Research Council of Canada Insight Grant 435-2020-0442 to Laurel Trainor.

  • Natural Sciences and Engineering Research Council of Canada RGPIN-2019-05416 to Laurel Trainor.

  • CIFAR to Laurel Trainor.

Additional information

Competing interests

No competing interests declared.

No competing interests declared.

Author contributions

Conceptualization, Data curation, Software, Formal analysis, Methodology, Writing – original draft, Writing – review and editing.

Formal analysis, Writing – original draft, Writing – review and editing.

Conceptualization, Methodology, Writing – review and editing.

Conceptualization, Software, Formal analysis, Supervision, Writing – original draft, Writing – review and editing.

Conceptualization, Formal analysis, Supervision, Funding acquisition, Methodology, Writing – original draft, Writing – review and editing.

Ethics

Before asking for informed consent, the overall goals of the study were explained to the participants, including that the collected drumming data would potentially be presented anonymously at meetings and in scientific papers, and that the procedure had been approved by the McMaster Research Ethics Board, protocol #2164.

Additional files

Supplementary file 1. Supplementary table.

Separate linear mixed-effects models were fitted for each of lags 1–8 of the autocorrelations of individuals (a) and group aggregates (b) in the continuation phases of synchronization-continuation task (SCT) trials, ensemble conditions. The predictors were group size (N) and Tempo (80, 120, 160, and 200 beats per minute [bpm]). For simplicity, the same full model was fitted in all cases for individuals, Yik=(β00g)+β1Nik2Tempoik3NikTempoikg, where i is participant, k trial, and g group. For the group aggregate (b) where there was not enough data to fit models with all predictors, Yik=(β00g)+β1Nik2Tempoikg was used. Significant coefficients (p<.05, Satterthwaite method) are in bold.

elife-74816-supp1.docx (20.2KB, docx)
Transparent reporting form

Data availability

The current manuscript is a computational study, so no data have been generated for this manuscript.

References

  1. Alderisio F, Bardy BG, di Bernardo M. Entrainment and synchronization in networks of rayleigh-van der pol oscillators with diffusive and haken-kelso-bunz couplings. Biol Cybern. 2016;110:151–169. doi: 10.1007/s00422-016-0685-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Bardy BG, Calabrese C, De Lellis P, Bourgeaud S, Colomer C, Pla S, di Bernardo M. Moving in unison after perceptual interruption. Scientific Reports. 2020;10:1–13. doi: 10.1038/s41598-020-74914-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Barrat A, Barthélemy M, Pastor-Satorras R, Vespignani A. The architecture of complex weighted networks. PNAS. 2004;101:3747–3752. doi: 10.1073/pnas.0400087101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Bates D, Mächler M, Bolker B, Walker S. Fitting linear mixed-effects models using lme4. Journal of Statistical Software. 2015;67:1–48. doi: 10.18637/jss.v067.i01. [DOI] [Google Scholar]
  5. Bégel V, Demos AP, Wang M, Palmer C. Social interaction and rate effects in models of musical synchronization. Frontiers in Psychology. 2022;13:865536. doi: 10.3389/fpsyg.2022.865536. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Ben M, Lüdecke D, Makowski D. Effectsize: estimation of effect size indices and standardized parameters. Journal of Open Source Software. 2020;5:2815. doi: 10.21105/joss.02815. [DOI] [Google Scholar]
  7. Benadon F, McGraw A, Robinson M. Quantitative analysis of temporal structure in cuban guaguancó drumming. Music & Science. 2018;1:205920431878264. doi: 10.1177/2059204318782642. [DOI] [Google Scholar]
  8. Berdahl AM, Kao AB, Flack A, Westley PAH, Codling EA, Couzin ID, Dell AI, Biro D. Collective animal navigation and migratory culture: from theoretical models to empirical evidence. Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences. 2018;373:20170009. doi: 10.1098/rstb.2017.0009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Bishop L, Cancino-Chacón C, Goebl W. Moving to communicate, moving to interact. Music Perception. 2019;37:1–25. doi: 10.1525/mp.2019.37.1.1. [DOI] [Google Scholar]
  10. Breakspear M, Heitmann S, Daffertshofer A. Generative models of cortical oscillations: neurobiological implications of the kuramoto model. Frontiers in Human Neuroscience. 2010;4:190. doi: 10.3389/fnhum.2010.00190. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Brown J, Bossomaier T, Barnett L. Information flow in finite flocks. Scientific Reports. 2020;10:3837. doi: 10.1038/s41598-020-59080-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Burger B, Thompson MR, Luck G, Saarikallio S, Toiviainen P. Influences of rhythm- and timbre-related musical features on characteristics of music-induced movement. Frontiers in Psychology. 2013;4:183. doi: 10.3389/fpsyg.2013.00183. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Chang A, Livingstone SR, Bosnyak DJ, Trainor LJ. Body sway reflects leadership in joint music performance. PNAS. 2017;114:E4134–E4141. doi: 10.1073/pnas.1617657114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Chang A, Kragness HE, Livingstone SR, Bosnyak DJ, Trainor LJ. Body sway reflects joint emotional expression in music ensemble performance. Scientific Reports. 2019;9:1–11. doi: 10.1038/s41598-018-36358-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Chaté H, Ginelli F, Grégoire G, Peruani F, Raynaud F. Modeling collective motion: variations on the vicsek model. The European Physical Journal B. 2008;64:451–456. doi: 10.1140/epjb/e2008-00275-9. [DOI] [Google Scholar]
  16. Chauvigné LAS, Walton A, Richardson MJ, Brown S. Multi-person and multisensory synchronization during group dancing. Human Movement Science. 2019;63:199–208. doi: 10.1016/j.humov.2018.12.005. [DOI] [PubMed] [Google Scholar]
  17. Cirelli LK, Einarson KM, Trainor LJ. Interpersonal synchrony increases prosocial behavior in infants. Developmental Science. 2014;17:1003–1011. doi: 10.1111/desc.12193. [DOI] [PubMed] [Google Scholar]
  18. Cirelli LK, Wan SJ, Trainor LJ. Social effects of movement synchrony: increased infant helpfulness only transfers to affiliates of synchronously moving partners. Infancy. 2016;21:807–821. doi: 10.1111/infa.12140. [DOI] [Google Scholar]
  19. Cirelli LK, Trehub SE, Trainor LJ. Rhythm and melody as social signals for infants. Annals of the New York Academy of Sciences. 2018;1423:66–72. doi: 10.1111/nyas.13580. [DOI] [PubMed] [Google Scholar]
  20. Conard NJ, Malina M, Münzel SC. New flutes document the earliest musical tradition in southwestern Germany. Nature. 2009;460:737–740. doi: 10.1038/nature08169. [DOI] [PubMed] [Google Scholar]
  21. Couzin ID. Synchronization: the key to effective communication in animal collectives. Trends in Cognitive Sciences. 2018;22:844–846. doi: 10.1016/j.tics.2018.08.001. [DOI] [PubMed] [Google Scholar]
  22. Csardi G, Nepusz T. The igraph software package for complex network research. InterJournal Complex Systems. 2006;1695:1–9. [Google Scholar]
  23. Csikszentmihalyi M. Flow: The Psychology of Optimal Experience. Harper & Row; 1990. [Google Scholar]
  24. D’Amario S, Daffern H, Bailes F. Synchronization in singing DUO performances: the roles of visual contact and leadership instruction. Frontiers in Psychology. 2018;9:1208. doi: 10.3389/fpsyg.2018.01208. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. D’Ausilio A, Novembre G, Fadiga L, Keller PE. What can music tell us about social interaction? Trends in Cognitive Sciences. 2015;19:111–114. doi: 10.1016/j.tics.2015.01.005. [DOI] [PubMed] [Google Scholar]
  26. Davies M, Madison G, Silva P, Gouyon F. The effect of microtiming deviations on the perception of groove in short rhythms. Music Perception. 2013;30:497–510. doi: 10.1525/mp.2013.30.5.497. [DOI] [Google Scholar]
  27. Dayan P, Abbott LF. Theoretical Neuroscience: Computational and Mathematical Modeling of Neural Systems. MIT Press; 2001. [Google Scholar]
  28. Dean RT, Dunsmuir WTM. Dangers and uses of cross-correlation in analyzing time series in perception, performance, movement, and neuroscience: the importance of constructing transfer function autoregressive models. Behavior Research Methods. 2016;48:783–802. doi: 10.3758/s13428-015-0611-2. [DOI] [PubMed] [Google Scholar]
  29. De Jaegher H, Froese T. On the role of social interaction in individual agency. Adaptive Behavior. 2009;17:444–460. doi: 10.1177/1059712309343822. [DOI] [Google Scholar]
  30. Dobromir D. Group_sync_elife. swh:1:rev:6479ed2e3409ebc156fbc917a246cf4c4edf4f44Software Heritage. 2022 https://archive.softwareheritage.org/swh:1:dir:a203ffdbf29ac2532c1625c7aa397e43fbdb6eb4;origin=https://gitlab.com/dodo_bird/group_sync_elife;visit=swh:1:snp:7d9f1691d935937d923f4090877180af0adc599b;anchor=swh:1:rev:6479ed2e3409ebc156fbc917a246cf4c4edf4f44
  31. Dotov D, Froese T. Entraining chaotic dynamics: a novel movement sonification paradigm could promote generalization. Human Movement Science. 2018;61:27–41. doi: 10.1016/j.humov.2018.06.016. [DOI] [PubMed] [Google Scholar]
  32. Dotov DG, Cochen de Cock V, Geny C, Ihalainen P, Moens B, Leman M, Bardy B, Dalla Bella S. The role of interaction and predictability in the spontaneous entrainment of movement. Journal of Experimental Psychology. General. 2019;148:1041–1057. doi: 10.1037/xge0000609. [DOI] [PubMed] [Google Scholar]
  33. Dumas G, de Guzman GC, Tognoli E, Kelso JAS. The human dynamic clamp as a paradigm for social interaction. PNAS. 2014;111:E3726–E3734. doi: 10.1073/pnas.1407486111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Farine DR, Strandburg-Peshkin A, Couzin ID, Berger-Wolf TY, Crofoot MC. Individual variation in local interaction rules can explain emergent patterns of spatial organization in wild baboons. Proceedings. Biological Sciences. 2017;284:20162243. doi: 10.1098/rspb.2016.2243. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Frank TD, Daffertshofer A, Peper CE, Beek PJ, Haken H. Towards a comprehensive theory of brain activity. Physica D. 2000;144:62–86. doi: 10.1016/S0167-2789(00)00071-3. [DOI] [Google Scholar]
  36. Friston K, Frith C. A duet for one. Consciousness and Cognition. 2015;36:390–405. doi: 10.1016/j.concog.2014.12.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Frith U, Frith C. The biological basis of social interaction. Current Directions in Psychological Science. 2001;10:151–155. doi: 10.1111/1467-8721.00137. [DOI] [Google Scholar]
  38. Froese T, Iizuka H, Ikegami T. Embodied social interaction constitutes social cognition in pairs of humans: a minimalist virtual reality experiment. Scientific Reports. 2014;4:3672. doi: 10.1038/srep03672. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Gaggioli A, Chirico A, Mazzoni E, Milani L, Riva G. Networked flow in musical bands. Psychology of Music. 2017;45:283–297. doi: 10.1177/0305735616665003. [DOI] [Google Scholar]
  40. Galton F. Vox populi (the wisdom of crowds) Nature. 1907;75:450–451. doi: 10.1038/075450a0. [DOI] [Google Scholar]
  41. Gamba M, Torti V, Estienne V, Randrianarison RM, Valente D, Rovara P, Bonadonna G, Friard O, Giacoma C. The indris have got rhythm! timing and pitch variation of a primate song examined between sexes and age classes. Frontiers in Neuroscience. 2016;10:249. doi: 10.3389/fnins.2016.00249. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Goldstone RL, Roberts ME. Self-organized TRAIL systems in groups of humans. Complexity. 2006;11:43–50. doi: 10.1002/cplx.20135. [DOI] [Google Scholar]
  43. Gourévitch B, Eggermont JJ. Evaluating information transfer between auditory cortical neurons. Journal of Neurophysiology. 2007;97:2533–2543. doi: 10.1152/jn.01106.2006. [DOI] [PubMed] [Google Scholar]
  44. Greenfield MD, Marin-Cudraz T, Party V. Evolution of synchronies in insect choruses. Biological Journal of the Linnean Society. 2017;122:487–504. doi: 10.1093/biolinnean/blx096. [DOI] [Google Scholar]
  45. Greenfield MD, Aihara I, Amichay G, Anichini M, Nityananda V. Rhythm interaction in animal groups: selective attention in communication networks. Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences. 2021;376:20200338. doi: 10.1098/rstb.2020.0338. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Hart Y, Noy L, Feniger-Schaal R, Mayo AE, Alon U. Individuality and togetherness in joint improvised motion. PLOS ONE. 2014;9:e87213. doi: 10.1371/journal.pone.0087213. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Heggli OA, Cabral J, Konvalinka I, Vuust P, Kringelbach ML. A kuramoto model of self-other integration across interpersonal synchronization strategies. PLOS Computational Biology. 2019;15:e1007422. doi: 10.1371/journal.pcbi.1007422. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Heggli OA, Konvalinka I, Kringelbach ML, Vuust P. A metastable attractor model of self-other integration (MEAMSO) in rhythmic synchronization. Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences. 2021;376:20200332. doi: 10.1098/rstb.2020.0332. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Honing H, ten Cate C, Peretz I, Trehub SE. Without it no music: cognition, biology and evolution of musicality. Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences. 2015;370:20140088. doi: 10.1098/rstb.2014.0088. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Hove MJ, Risen JL. It’s all in the timing: interpersonal synchrony increases affiliation. Social Cognition. 2009;27:949–960. doi: 10.1521/soco.2009.27.6.949. [DOI] [Google Scholar]
  51. Hove MJ, Marie C, Bruce IC, Trainor LJ. Superior time perception for lower musical pitch explains why bass-ranged instruments lay down musical rhythms. PNAS. 2014;111:10383–10388. doi: 10.1073/pnas.1402039111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Hove MJ, Martinez SA, Stupacher J. Feel the bass: music presented to tactile and auditory modalities increases aesthetic appreciation and body movement. Journal of Experimental Psychology. General. 2020;149:1137–1147. doi: 10.1037/xge0000708. [DOI] [PubMed] [Google Scholar]
  53. Huron D. Is music an evolutionary adaptation? Annals of the New York Academy of Sciences. 2001;930:43–61. doi: 10.1111/j.1749-6632.2001.tb05724.x. [DOI] [PubMed] [Google Scholar]
  54. Insel TR. The challenge of translation in social neuroscience: a review of oxytocin, vasopressin, and affiliative behavior. Neuron. 2010;65:768–779. doi: 10.1016/j.neuron.2010.03.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Ispolatov Y. Computing in fish schools. eLife. 2015;5:e12852. doi: 10.7554/eLife.12852. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Ito S, Hansen ME, Heiland R, Lumsdaine A, Litke AM, Beggs JM. Extending transfer entropy improves identification of effective connectivity in a spiking cortical network model. PLOS ONE. 2011;6:e27431. doi: 10.1371/journal.pone.0027431. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Jacobs PA, Lengauer CC, Bruschi CV, Garre M, Gjuracic K, Mccoy EE. From disorder to order in marching locusts. Science. 2007;317:490–494. doi: 10.5061/dryad.5t110.Supplementary. [DOI] [Google Scholar]
  58. Keller PE, Novembre G, Hove MJ. Rhythm in joint action: psychological and neurophysiological mechanisms for real-time interpersonal coordination. Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences. 2014;369:20130394. doi: 10.1098/rstb.2013.0394. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Keller PE, König R, Novembre G. Simultaneous cooperation and competition in the evolution of musical behavior: sex-related modulations of the singer’s formant in human chorusing. Frontiers in Psychology. 2017;8:1559. doi: 10.3389/fpsyg.2017.01559. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Kelso JAS. Unifying large- and small-scale theories of coordination. Entropy. 2021;23:537. doi: 10.3390/e23050537. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Konvalinka I, Vuust P, Roepstorff A, Frith CD. Follow you, follow me: continuous mutual prediction and adaptation in joint tapping. Quarterly Journal of Experimental Psychology. 2010;63:2220–2230. doi: 10.1080/17470218.2010.497843. [DOI] [PubMed] [Google Scholar]
  62. Kugler PN, Turvey MT. Information, Natural Law, and the Self-Assembly of Rhythmic Movement. New Jersey: Lawrence Erlbaum Associates, Inc; 1987. [Google Scholar]
  63. Kuramoto Y. Self-entrainment of a population of coupled non-linear oscillators. International Symposium on Mathematical Problems in Theoretical Physics.1975. [Google Scholar]
  64. Kuznetsova A, Brockhoff PB, Christensen RHB. lmertest package: tests in linear mixed effects models. Journal of Statistical Software. 2017;82:1–26. doi: 10.18637/jss.v082.i13. [DOI] [Google Scholar]
  65. Liebermann-Jordanidis H, Novembre G, Koch I, Keller PE. Simultaneous self-other integration and segregation support real-time interpersonal coordination in a musical joint action task. Acta Psychologica. 2021;218:103348. doi: 10.1016/j.actpsy.2021.103348. [DOI] [PubMed] [Google Scholar]
  66. Linson A, Clarke EF. In: Distributed Creativity: Collaboration and Improvisation in Contemporary Music. Clarke EF, Doffman M, editors. New York: Oxford University Press; 2017. Distributed cognition, ecological theory, and group improvisation; pp. 52–69. [Google Scholar]
  67. Lucas G, Clayton M, Leante L. Inter-group entrainment in afro-brazilian congado ritual. Empirical Musicology Review. 2011;6:75–102. doi: 10.18061/1811/51203. [DOI] [Google Scholar]
  68. Lungarella M, Pegors T, Bulwinkle D, Sporns O. Methods for quantifying the informational structure of sensory and motor data. Neuroinformatics. 2005;3:243–262. doi: 10.1385/NI:3:3:243. [DOI] [PubMed] [Google Scholar]
  69. Lungarella M, Sporns O. Mapping information flow in sensorimotor networks. PLOS Computational Biology. 2006;2:e144. doi: 10.1371/journal.pcbi.0020144. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. MacDougall HG, Moore ST. Marching to the beat of the same drummer: the spontaneous tempo of human locomotion. Journal of Applied Physiology. 2005;99:1164–1173. doi: 10.1152/japplphysiol.00138.2005. [DOI] [PubMed] [Google Scholar]
  71. Madison G, Karampela O, Ullén F, Holm L. Effects of practice on variability in an isochronous serial interval production task: asymptotical levels of tapping variability after training are similar to those of musicians. Acta Psychologica. 2013;143:119–128. doi: 10.1016/j.actpsy.2013.02.010. [DOI] [PubMed] [Google Scholar]
  72. Mann RP, Faria J, Sumpter DJT, Krause J. The dynamics of audience applause. Journal of the Royal Society, Interface. 2013;10:20130466. doi: 10.1098/rsif.2013.0466. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Manning FC, Harris J, Schutz M. Temporal prediction abilities are mediated by motor effector and rhythmic expertise. Experimental Brain Research. 2017;235:861–871. doi: 10.1007/s00221-016-4845-8. [DOI] [PubMed] [Google Scholar]
  74. McAuley JD, Jones MR, Holub S, Johnston HM, Miller NS. The time of our lives: life span development of timing and event tracking. Journal of Experimental Psychology. General. 2006;135:348–367. doi: 10.1037/0096-3445.135.3.348. [DOI] [PubMed] [Google Scholar]
  75. Miller N, Garnier S, Hartnett AT, Couzin ID. Both information and social cohesion determine collective decisions in animal groups. PNAS. 2013;110:5263–5268. doi: 10.1073/pnas.1217513110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Moelants D, Stevens C, Burnham D, McPherson G, Schubert E, Renwick J. Preferred tempo reconsidered. Proceedings of the 7th International Conference on Music Perception and Cognition; 2002. pp. 580–583. [Google Scholar]
  77. Mogan R, Fischer R, Bulbulia JA. To be in synchrony or not? A meta-analysis of synchrony’s effects on behavior, perception, cognition and affect. Journal of Experimental Social Psychology. 2017;72:13–20. doi: 10.1016/j.jesp.2017.03.009. [DOI] [Google Scholar]
  78. Newman MEJ, Barabási AL, Watts DJ. The Structure and Dynamics of Networks. Princeton University Press; 2011. [DOI] [Google Scholar]
  79. Noori R, Park D, Griffiths JD, Bells S, Frankland PW, Mabbott D, Lefebvre J. Activity-dependent myelination: A glial mechanism of oscillatory self-organization in large-scale brain networks. PNAS. 2020;117:13227–13237. doi: 10.1073/pnas.1916646117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Novembre G, Sammler D, Keller PE. Neural alpha oscillations index the balance between self-other integration and segregation in real-time joint action. Neuropsychologia. 2016;89:414–425. doi: 10.1016/j.neuropsychologia.2016.07.027. [DOI] [PubMed] [Google Scholar]
  81. Okano M, Shinya M, Kudo K. Paired synchronous rhythmic finger tapping without an external timing cue shows greater speed increases relative to those for solo tapping. Scientific Reports. 2017;7:43987. doi: 10.1038/srep43987. [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. Oullier O, de Guzman GC, Jantzen KJ, Lagarde J, Kelso JAS. Social coordination dynamics: measuring human bonding. Social Neuroscience. 2008;3:178–192. doi: 10.1080/17470910701563392. [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. Parrish JK, Viscido SV, Grünbaum D. Self-organized fish schools: an examination of emergent properties. The Biological Bulletin. 2002;202:296–305. doi: 10.2307/1543482. [DOI] [PubMed] [Google Scholar]
  84. Patel AD, Iversen JR. The evolutionary neuroscience of musical beat perception: the action simulation for auditory prediction (ASAP) hypothesis. Frontiers in Systems Neuroscience. 2014;8:57. doi: 10.3389/fnsys.2014.00057. [DOI] [PMC free article] [PubMed] [Google Scholar]
  85. Prokopenko M, Polani D, Ay N. In: In Guided Self-Organization: Inception. Prokopenko M, editor. Berlin, Heidelberg: Springer; 2014. On the cross-disciplinary nature of guided self-organisation; pp. 3–15. [DOI] [Google Scholar]
  86. Przyrembel M, Smallwood J, Pauen M, Singer T. Illuminating the dark matter of social neuroscience: considering the problem of social interaction from philosophical, psychological, and neuroscientific perspectives. Frontiers in Human Neuroscience. 2012;6:190. doi: 10.3389/fnhum.2012.00190. [DOI] [PMC free article] [PubMed] [Google Scholar]
  87. Puckette MS. Pure Data. Proceedings of the International Computer Music Conference; Thessaloniki, Greece. 1997. [Google Scholar]
  88. Ragert M, Fairhurst MT, Keller PE. Segregation and integration of auditory streams when listening to multi-part music. PLOS ONE. 2014;9:e84085. doi: 10.1371/journal.pone.0084085. [DOI] [PMC free article] [PubMed] [Google Scholar]
  89. Ravignani A, Bowling DL, Fitch WT. Chorusing, synchrony, and the evolutionary functions of rhythm. Frontiers in Psychology. 2014;5:1–15. doi: 10.3389/fpsyg.2014.01118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  90. Ravignani A, Verga L, Greenfield MD. Interactive rhythms across species: the evolutionary biology of animal chorusing and turn-taking. Annals of the New York Academy of Sciences. 2019;1453:12–21. doi: 10.1111/nyas.14230. [DOI] [PMC free article] [PubMed] [Google Scholar]
  91. Repp BH. Sensorimotor synchronization: a review of the tapping literature. Psychonomic Bulletin & Review. 2005;12:969–992. doi: 10.3758/bf03206433. [DOI] [PubMed] [Google Scholar]
  92. Rio KW, Dachner GC, Warren WH. Local interactions underlying collective motion in human crowds. Proceedings of the Royal Society Biological Sciences. 2018;285:20180611. doi: 10.1098/rspb.2018.0611. [DOI] [PMC free article] [PubMed] [Google Scholar]
  93. Roman IR, Washburn A, Large EW, Chafe C, Fujioka T. Delayed feedback embedded in perception-action coordination cycles results in anticipation behavior during synchronized rhythmic action: a dynamical systems approach. PLOS Computational Biology. 2019;15:e1007371. doi: 10.1371/journal.pcbi.1007371. [DOI] [PMC free article] [PubMed] [Google Scholar]
  94. Rosenthal SB, Twomey CR, Hartnett AT, Wu HS, Couzin ID. Revealing the hidden networks of interaction in mobile animal groups allows prediction of complex behavioral contagion. PNAS. 2015;112:4690–4695. doi: 10.1073/pnas.1420068112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  95. Salimpoor VN, Benovoy M, Larcher K, Dagher A, Zatorre RJ. Anatomically distinct dopamine release during anticipation and experience of peak emotion to music. Nature Neuroscience. 2011;14:257–262. doi: 10.1038/nn.2726. [DOI] [PubMed] [Google Scholar]
  96. Satne G, Salice A. In: Minimal Cooperation and Shared Agency. Fiebich A, editor. Springer; 2018. Shared intentionality and the cooperative evolutionary hypothesis; pp. 71–92. [Google Scholar]
  97. Savage PE, Brown S, Sakai E, Currie TE. Statistical universals reveal the structures and functions of human music. PNAS. 2015;112:8987–8992. doi: 10.1073/pnas.1414495112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  98. Savage PE, Loui P, Tarr B, Schachner A, Glowacki L, Mithen S, Fitch WT. Music as a coevolved system for social bonding. The Behavioral and Brain Sciences. 2020;44:e59. doi: 10.1017/S0140525X20000333. [DOI] [PubMed] [Google Scholar]
  99. Schilbach L, Timmermans B, Reddy V, Costall A, Bente G, Schlicht T, Vogeley K. Authors’ response: a second-person neuroscience in interaction. The Behavioral and Brain Sciences. 2013;36:441–462. doi: 10.1017/s0140525x12002452. [DOI] [PubMed] [Google Scholar]
  100. Schmidt RC, Carello C, Turvey MT. Phase transitions and critical fluctuations in the visual coordination of rhythmic movements between people. Journal of Experimental Psychology. Human Perception and Performance. 1990;16:227–247. doi: 10.1037//0096-1523.16.2.227. [DOI] [PubMed] [Google Scholar]
  101. Schmidt RC, O’Brien B. Evaluating the dynamics of unintended interpersonal coordination. Ecological Psychology. 1997;9:189–206. doi: 10.1207/s15326969eco0903_2. [DOI] [Google Scholar]
  102. Schreiber T. Measuring information transfer. Physical Review Letters. 2000;85:461–464. doi: 10.1103/PhysRevLett.85.461. [DOI] [PubMed] [Google Scholar]
  103. Seth AK, Edelman GM. Environment and behavior influence the complexity of evolved neural networks. Adaptive Behavior. 2004;12:5–20. doi: 10.1177/105971230401200103. [DOI] [Google Scholar]
  104. Seth AK, Barrett AB, Barnett L. Causal density and integrated information as measures of conscious level. Philosophical Transactions. Series A, Mathematical, Physical, and Engineering Sciences. 2011;369:3748–3767. doi: 10.1098/rsta.2011.0079. [DOI] [PubMed] [Google Scholar]
  105. Shahal S, Wurzberg A, Sibony I, Duadi H, Shniderman E, Weymouth D, Davidson N, Fridman M. Synchronization of complex human networks. Nature Communications. 2020;11:3854. doi: 10.1038/s41467-020-17540-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  106. Silverberg JL, Bierbaum M, Sethna JP, Cohen I. Collective motion of humans in mosh and circle pits at heavy metal concerts. Physical Review Letters. 2013;110:228701. doi: 10.1103/PhysRevLett.110.228701. [DOI] [PubMed] [Google Scholar]
  107. Singer JD, Willett JB. Applied Longitudinal Data Analysis: Modeling Change and Event Occurrence. Oxford University Press; 2003. [DOI] [Google Scholar]
  108. Stoffregen TA, Villard S, Kim CG, Ito K, Bardy BG. Coupling of head and body movement with motion of the audible environment. Journal of Experimental Psychology. Human Perception and Performance. 2009;35:1221–1231. doi: 10.1037/a0014251. [DOI] [PubMed] [Google Scholar]
  109. Strogatz SH. From kuramoto to Crawford: exploring the onset of synchronization in populations of coupled oscillators. Physica D. 2000;143:1–20. doi: 10.1016/S0167-2789(00)00094-4. [DOI] [Google Scholar]
  110. Sumpter DJT. The principles of collective animal behaviour. Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences. 2006;361:5–22. doi: 10.1098/rstb.2005.1733. [DOI] [PMC free article] [PubMed] [Google Scholar]
  111. Surowiecki J. The Wisdom of Crowds. Anchor; 2005. [Google Scholar]
  112. Thomson M, Murphy K, Lukeman R. Groups clapping in unison undergo size-dependent error-induced frequency increase. Scientific Reports. 2018;8:808. doi: 10.1038/s41598-017-18539-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  113. Tomasello M. The Cultural Origins of Human Cognition. Cambridge: Harvard University Press; 2000. [DOI] [Google Scholar]
  114. Tomasello M. A Natural History of Human Thinking. Harvard University Press; 2014. [DOI] [Google Scholar]
  115. Torney C, Neufeld Z, Couzin ID. Context-dependent interaction leads to emergent search behavior in social aggregates. PNAS. 2009;106:22055–22060. doi: 10.1073/pnas.0907929106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  116. Trainor LJ. The origins of music in auditory scene analysis and the roles of evolution and culture in musical creation. Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences. 2015;370:20140089. doi: 10.1098/rstb.2014.0089. [DOI] [PMC free article] [PubMed] [Google Scholar]
  117. Trainor LJ, Cirelli L. Rhythm and interpersonal synchrony in early social development. Annals of the New York Academy of Sciences. 2015;1337:45–52. doi: 10.1111/nyas.12649. [DOI] [PubMed] [Google Scholar]
  118. Trainor LJ, Marsh-Rollo S. In: The Oxford Handbook of Music and the Brain. Thaut MH, Hodges DA, editors. Oxford Academic; 2019. Rhythm, meter, and timing: the heartbeat of musical development; pp. 591–622. [DOI] [Google Scholar]
  119. Valdesolo P, Ouyang J, DeSteno D. The rhythm of joint action: synchrony promotes cooperative ability. Journal of Experimental Social Psychology. 2010;46:693–695. doi: 10.1016/j.jesp.2010.03.004. [DOI] [Google Scholar]
  120. van der Steen MCM, Keller PE. The adaptation and anticipation model (ADAM) of sensorimotor synchronization. Frontiers in Human Neuroscience. 2013;7:253. doi: 10.3389/fnhum.2013.00253. [DOI] [PMC free article] [PubMed] [Google Scholar]
  121. Vicsek T, Czirók A, Ben-Jacob E, Cohen I, Shochet O. Novel type of phase transition in a system of self-driven particles. Physical Review Letters. 1995;75:1226–1229. doi: 10.1103/PhysRevLett.75.1226. [DOI] [PubMed] [Google Scholar]
  122. Vilar L, Araújo D, Davids K, Bar-Yam Y. Science of winning soccer: emergent pattern-forming dynamics in association football. Journal of Systems Science and Complexity. 2013;26:73–84. doi: 10.1007/s11424-013-2286-z. [DOI] [Google Scholar]
  123. Warren WH. Collective motion in human crowds. Current Directions in Psychological Science. 2018;27:232–240. doi: 10.1177/0963721417746743. [DOI] [PMC free article] [PubMed] [Google Scholar]
  124. Wiener N. In: In Modern Mathematics for Engineers. Wiener N, editor. Dover Publications; 1956. The theory of prediction; pp. 323–329. [Google Scholar]
  125. Wolf T, Vesper C, Sebanz N, Keller PE, Knoblich G. Combining phase advancement and period correction explains rushing during joint rhythmic activities. Scientific Reports. 2019;9:9350. doi: 10.1038/s41598-019-45601-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  126. Wolpert DM, Doya K, Kawato M. A unifying computational framework for motor control and social interaction. Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences. 2003;358:593–602. doi: 10.1098/rstb.2002.1238. [DOI] [PMC free article] [PubMed] [Google Scholar]
  127. Zahavi D, Satne G. In: Beyond the Analytic-Continental Divide: Pluralist Philosophy in the Twenty-First Century. Bell JA, Cutrofello A, Livingston PM, editors. Taylor and Francis; 2015. Varieties of shared intentionality: tomasello and classical phenomenology. [DOI] [Google Scholar]
  128. Zentner M, Eerola T. Rhythmic engagement with music in infancy. PNAS. 2010;107:5768–5773. doi: 10.1073/pnas.1000121107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  129. Zhang M, Beetle C, Kelso JAS, Tognoli E. Connecting empirical phenomena and theoretical models of biological coordination across scales. Journal of the Royal Society, Interface. 2019;16:20190360. doi: 10.1098/rsif.2019.0360. [DOI] [PMC free article] [PubMed] [Google Scholar]

Editor's evaluation

Martin Hartmann 1

Taking joint drumming as a model of collective dynamics, and combining solid quantitative methods, the authors characterize how human behavior changes, at the individual- and group-level, as a function of group numerosity. A take-home message of this important work is that not everything we know from studies involving dyads should be necessarily generalized to larger groups. This study will be of great interest to scientists looking for new approaches to understanding group behavior, especially within the fields of human cognition, neurosciences, and musicology.

Decision letter

Editor: Martin Hartmann1
Reviewed by: Michael Greenfield2

Our editorial process produces two outputs: i) public reviews designed to be posted alongside the preprint for the benefit of readers; ii) feedback on the manuscript for the authors, including requests for revisions, shown below. We also include an acceptance summary that explains what the editors found interesting or important about the work.

Decision letter after peer review:

Thank you for submitting your article "Collective dynamics support group drumming, reduce variability, and stabilize tempo drift" for consideration by eLife. Your article has been reviewed by 3 peer reviewers, and the evaluation has been overseen by a Reviewing Editor and Aleksandra Walczak as the Senior Editor. The following individual involved in review of your submission has agreed to reveal their identity: Michael Greenfield (Reviewer #2).

The reviewers have discussed their reviews with one another, and the Reviewing Editor has drafted this to help you prepare a revised submission.

Essential revisions:

1) The data do fit, to some extent, the proposed hypothesis (response to the mean of all neighbors). However, as noted by Reviewer #2, there are alternative hypotheses which may be as good or better.

2) The fact that the 'agents' could see as well as hear each other poses some problems for adapting the Kuramoto model correctly. Moreover, selective attention (to only some neighbors) has not been dealt with in the proposed hypotheses or in the perceptual and response mechanisms of the individual agents. The issue that individuals might have adapted more strongly to the output of their neighbours should be discussed in the article.

3) Both reviewers #1 and #3 recommended that the article should clarify which of the conclusions from the study might be generalisable to musical activities.

4) Statements regarding predictive mechanisms should be fully justified. Note also the point made by Reviewer #1: the task used here requires minimal prediction due to the constant tempo of the task.

Reviewer #1 (Recommendations for the authors):

Here are some comments and suggestions.

1. Abstract. I would try to be more explicit about potential dissociations between dyadic vs. large group interactions (i.e. comparing this study with previous ones focusing on dyads). The authors might even attempt to refer to this issue more explicitly in the abstract.

2. Introduction, page 4, "arising from pressures to either collaborate or compete". The study by Keller et al., 2017 (Frontiers in Psychology), showing that collaboration and competition are not always alternatives in joint music making, might also be relevant here.

3. Introduction, page 4, "Group music making constitutes an ecologically valid and convenient paradigm for studying group action and collective experience in the laboratory". This statement would be better supported by a reference, e.g. D'Ausilio et al., 2015 (TICS).

4. Introduction. The role of "prediction" in "interpersonal synchronization" is discussed briefly on two occasions in the introduction (page 4 and page 5, final sentence). I am not sure the authors have sufficiently conveyed their message. I believe they should better explain how prediction would emerge "implicitly" from the dynamic models they are referring to. Ideally, they should provide a concrete example of this, and support this with some data from previous studies. Otherwise, I am not sure a naïve reader will be able to make much of this issue.

5. Methods. The equipment used across small and large groups is slightly different. I don't think this should be a problem. However, I believe it would be good to explain why this was the case. This might be important for colleagues planning other studies in the future.

6. Methods and results. Page 9: "We found that the very low and very high tempos led to poor performance, hence we eliminated extreme tempo trials and only analyzed the set of matching tempos across dyads, quartets, and octets, namely 80, 120, 160, and 200 bpm". I believe it would be important to show this data (ideally as a supplement, i.e. not in the main text).

7. Methods and results. Section 2.7.2.1. "In ensemble conditions, we only kept the group-aggregate tempo trend". For comparability with previous studies testing smaller groups, it might be important to conduct this analysis looking at each individual separately, and then average the results. I realize this might yield qualitatively similar results, but I still think it's an important detail to report somewhere.

8. Methods and results. Page 19. "IOIs were matched across participants by proximity". Does this mean that cross correlations were computed between pairs of participants that were physically proximal? Or have the authors computed these coefficients by cross-correlating each individual's IOIs with the aggregate IOI measure? This is an important detail, which I am not sure I have fully understood.

9. General comment. The authors have selectively tested experimental conditions in which the effects produced by the performers were qualitatively similar (i.e. same acoustic output). I would like the authors to make a prediction about what would happen should the effects produced be different. Considering that the authors often refer to music, and that musical ensembles are often composed of players producing distinct sounds, I believe it might be important to discuss this.

10. General comment. Following on the previous point, the authors might consider that in this experiment participants only needed to "integrate" the acoustic outputs produced by the other players. However, in more complex coordination dynamics (e.g. an orchestra), each individual might need to weight the outputs produced by the other members, eventually "integrating" with some and "segregating" from others. This principle is well known in auditory cognition and has also been proposed to be important for ensemble music performance (see e.g. Ragert et al., 2014 (Plos One), Novembre et al., 2016 (Neuropsychologia), Liebermann-Jordanidis et al., 2021 (Acta Psychologica)). Could the proposed model account for this kind of behavior? And, if so, how? Would this require modelling individuals as separate processes (see also point 4 above)?

11. General comment. In relation to the role of "prediction", the authors state that this "is implicit in the emergent synchronization of coupled individual oscillatory dynamics". I am not sure this can be generalized to all instances of interpersonal coordination though. Take, as an example, the instance where one of the drummers gradually changes tempo (I am not referring to microtiming deviations or phase adjustments, but to an actual gradual change of the period). This might strongly call for predictive processes, more so than in the current scenario (where there is not much to predict given that all participants have to drum at a stable, non-changing tempo). A more ecological example would be joint musical improvisation, where players notably introduce unexpected behaviors, yet their peers are able to synchronize with (and sometimes even anticipate) these. To what extent can the discussed results and model be generalized to similar instances?

Reviewer #2 (Recommendations for the authors):

Several details not addressed above:

It is noted that the human subjects filled out a questionnaire which included information on musical experience/background. Was this information used to eliminate or retain subjects? Would musical background, or lack thereof, potentially influence the results of drummer timing? Some detailed information on this issue would be helpful.

The paragraph at the bottom of page 4, beginning with 'There are models of coordination where prediction... ' could benefit from a better explanation. There are some profound statements here, and it would behove the authors to avoid possibilities of misinterpretation.

Reviewer #3 (Recommendations for the authors):

Please, have a look at the "groove" and expressivity studies (groove has many culturally specific definitions: swing, swingue, ginga, balanço, etc) and reflect on how these studies solve and shed light on some aspects of the contribution. In special, understand that cultural forms profit from intentional and idiomatic variability and not simply avoid them. In my opinion, the ontological stance of the musical phenomenon as a creative act violates important assumptions behind traditional statistical methods, and researchers must take care to avoid building Procrustean beds of predictability in a context where predictability is not welcome.

Discuss and explain how procedures could have affected the shape and magnitude of timing variability.

Explain the axes, indicate magnitudes and units, and define ALL elements, terminologies, and abbreviations presented in formulas and figures. Repeat explanations in the captions. The proper visualization of time in the graphs is important because the stimuli designed with 20 ms attack slope and perception times add 10ms or more in the thresholds of perception. Therefore the clear indication of IOIs or another timing reference in terms of milliseconds helps to understand the results.

Explain the procedures in the attempt to clarify the chain that could add delays (latency) in the system: sampling, midi protocols, soundboards, Since we are dealing with timing issues and small perception thresholds, small delays and their nature (constant, variable, random) greatly affect the validity of measurements.

Verify if your statistical/numerical methods are not violated by homogeneity and independence (e.g. a delay in tempo is often compensated by the performer in the following attacks).

Try to communicate and describe the musical/performative effect of the results to open a dialogue with researchers in the music field.

eLife. 2022 Nov 1;11:e74816. doi: 10.7554/eLife.74816.sa2

Author response


Essential revisions:

1) The data do fit, to some extent, the proposed hypothesis (response to the mean of all neighbors). However, as noted by Reviewer #2, there are alternative hypotheses which may be as good or better.

We agree and we included a discussion of alternative mechanisms and scenarios (bottom of page 3, the bottom of page 8, the end of first paragraph on page 16, the end of paragraph 1 on page 18, end of paragraph 2 on page 15, and of course the whole new section 3.1). We consider the mean field approach to be the most likely for the present scenario because the task makes it hard to pay selective attention to individual participants. We discussed the role of individual heterogeneity and spatial separation in breaking symmetry and creating affordances for more interesting forms of coordination. An additional argument is provided by comparing different versions of the model. We included a neighbour-only coupling (ring topology) version of the model which does not correspond well to the empirical data. We showed this in Figure 3—figure supplement 2B.

2) The fact that the 'agents' could see as well as hear each other poses some problems for adapting the Kuramoto model correctly. Moreover, selective attention (to only some neighbors) has not been dealt with in the proposed hypotheses or in the perceptual and response mechanisms of the individual agents. The issue that individuals might have adapted more strongly to the output of their neighbours should be discussed in the article.

We included further discussion of this limitation of the task. Indeed, we cannot rule out visual coupling which is of a different nature than the short auditory pulses. We also included the following points. First, we emphasized that participants were instructed to focus on X’s in the center of their drums, hence partners were only visible in the peripheral field (pages 22-23, section 4.4). Second, our results in the dyad group essentially replicated other dyadic tapping studies (end of page 17). Third, we included two new versions of our model, one with a continuous coupling (classical Kuramoto), and one with a restricted topology, a ring, where individual units are only coupled to their immediate neighbours (bottom of page 8 and Figure 3—figure supplementary 2). The classical model would be expected to fit the data if the primary input was continuous (i.e., visual) and the ring model would be expected to fit the data if participants were indeed focusing on the visual information from those beside them. However, neither model fit the empirical data well.

3) Both reviewers #1 and #3 recommended that the article should clarify which of the conclusions from the study might be generalisable to musical activities.

We agree that the present task is an idealized impoverished scenario. We inserted a more thorough discussion of the limitations of the present study and where future research can be directed to capture more realistic musical activities in a new section, “Beyond the group average”.

4) Statements regarding predictive mechanisms should be fully justified. Note also the point made by Reviewer #1: the task used here requires minimal prediction due to the constant tempo of the task.

We agree that the isochronous task is not challenging as a pattern. Because we did not design the study to specifically test predictive processes, we prefer to minimize the discussion of predictive mechanisms. It is hard to talk about predictive processes and whatever is the alternative to such processes without engaging in a lengthy discussion including definitions and how prediction can be used to mean different things. Having said that, this has been an important topic of research in the last few years that should not be ignored. To this end, we made revisions so that the topic is acknowledged but without committing to uncovering new knowledge about prediction as our study was not designed to do that. We do now, however, pointed out a definite way for distinguishing models in that context. Specifically, one set of models, typically associated with predictive processing theories, assume that two separate (oscillatory) processes are needed to account for an individual: one for controlling one’s own timing, and the second for tracking an external process (the partner). Our approach makes use of only one oscillator per individual; doesn’t separate the mechanism into self-timing and other-tracking. We added a brief mention of this distinction at the beginning of page 18.

Reviewer #1 (Recommendations for the authors):

Here are some comments and suggestions.

1. Abstract. I would try to be more explicit about potential dissociations between dyadic vs. large group interactions (i.e. comparing this study with previous ones focusing on dyads). The authors might even attempt to refer to this issue more explicitly in the abstract.

Thank you for the suggestion, we have now done this.

2. Introduction, page 4, "arising from pressures to either collaborate or compete". The study by Keller et al., 2017 (Frontiers in Psychology), showing that collaboration and competition are not always alternatives in joint music making, might also be relevant here.

Thank you for the suggestion. We included this at a different location (bottom paragraph of page 19), in the final section where we discuss other potential principles of organization that come into play in tasks with both competitive and cooperative aspects.

3. Introduction, page 4, "Group music making constitutes an ecologically valid and convenient paradigm for studying group action and collective experience in the laboratory". This statement would be better supported by a reference, e.g. D'Ausilio et al., 2015 (TICS).

Thank you for the useful suggestion. We had added this (beginning of paragraph 2, page 5).

4. Introduction. The role of "prediction" in "interpersonal synchronization" is discussed briefly on two occasions in the introduction (page 4 and page 5, final sentence). I am not sure the authors have sufficiently conveyed their message. I believe they should better explain how prediction would emerge "implicitly" from the dynamic models they are referring to. Ideally, they should provide a concrete example of this, and support this with some data from previous studies. Otherwise, I am not sure a naïve reader will be able to make much of this issue.

Indeed, our introduction of this issue was very cursory in an attempt to avoid turning this manuscript into a comparison of predictive and non-predictive theoretical models of coordination. We made changes so as to acknowledge the existence of predictive accounts without committing to them as our study was not designed to specifically examine predictive processes (last paragraph of page 4 and paragraph 1 on page 18).

5. Methods. The equipment used across small and large groups is slightly different. I don't think this should be a problem. However, I believe it would be good to explain why this was the case. This might be important for colleagues planning other studies in the future.

The main reason was circumstance. Additionally, while the drums in the initial setup had distinct sounds, they didn’t allow us to manipulate them to produce eight drums with nominally different fundamental frequencies. Thus, when the eight-drum set became available we switched to it.

6. Methods and results. Page 9: "We found that the very low and very high tempos led to poor performance, hence we eliminated extreme tempo trials and only analyzed the set of matching tempos across dyads, quartets, and octets, namely 80, 120, 160, and 200 bpm". I believe it would be important to show this data (ideally as a supplement, i.e. not in the main text).

We edited the relevant figures and captions in the figure supplements and inserted the data from all tempos.

7. Methods and results. Section 2.7.2.1. "In ensemble conditions, we only kept the group-aggregate tempo trend". For comparability with previous studies testing smaller groups, it might be important to conduct this analysis looking at each individual separately, and then average the results. I realize this might yield qualitatively similar results, but I still think it's an important detail to report somewhere.

We verified that this does not make a difference, see our response above. For simplicity, we replaced this with the stats based on individual data.

8. Methods and results. Page 19. "IOIs were matched across participants by proximity". Does this mean that cross correlations were computed between pairs of participants that were physically proximal? Or have the authors computed these coefficients by cross-correlating each individual's IOIs with the aggregate IOI measure? This is an important detail, which I am not sure I have fully understood.

We edited to be clearer (first paragraph of 2.2.2, page 12). What was meant here was not spatial proximity with the neighbours but deciding which drum hits from different participants go together and which ones, on very rare occasions, are mistakes. The issue of spatial proximity, raised by the other reviewers as well, is very important for understanding large ensembles but is less relevant here because of the relative proximity of all drummers (see also section 3.1. “Beyond …”).

9. General comment. The authors have selectively tested experimental conditions in which the effects produced by the performers were qualitatively similar (i.e. same acoustic output). I would like the authors to make a prediction about what would happen should the effects produced be different. Considering that the authors often refer to music, and that musical ensembles are often composed of players producing distinct sounds, I believe it might be important to discuss this.

We believe that introducing acoustic heterogeneity, as well as difficulty, will create affordances for selective attention. For example, low-pitch sounds may afford better beat-based timing precision. In general, heterogeneity should make it possible for specialized individual roles, sub-groups and leader-follower dynamics to emerge. See 3.1. “Beyond the group average” at the end of the Discussion.

10. General comment. Following on the previous point, the authors might consider that in this experiment participants only needed to "integrate" the acoustic outputs produced by the other players. However, in more complex coordination dynamics (e.g. an orchestra), each individual might need to weight the outputs produced by the other members, eventually "integrating" with some and "segregating" from others. This principle is well known in auditory cognition and has also been proposed to be important for ensemble music performance (see e.g. Ragert et al., 2014 (Plos One), Novembre et al., 2016 (Neuropsychologia), Liebermann-Jordanidis et al., 2021 (Acta Psychologica)). Could the proposed model account for this kind of behavior? And, if so, how? Would this require modelling individuals as separate processes (see also point 4 above)?

Thank you for the relevant literature and ideas. We have incorporated the literature, bottom of page 20. Additionally, we now discuss that the present approach needs to be expanded to be able to account for competitive pressure and selective attention. See also our discussion of the well-known Vicsek model (middle of page 16 and second paragraph of page 20).

11. General comment. In relation to the role of "prediction", the authors state that this "is implicit in the emergent synchronization of coupled individual oscillatory dynamics". I am not sure this can be generalized to all instances of interpersonal coordination though. Take, as an example, the instance where one of the drummers gradually changes tempo (I am not referring to microtiming deviations or phase adjustments, but to an actual gradual change of the period). This might strongly call for predictive processes, more so than in the current scenario (where there is not much to predict given that all participants have to drum at a stable, non-changing tempo). A more ecological example would be joint musical improvisation, where players notably introduce unexpected behaviors, yet their peers are able to synchronize with (and sometimes even anticipate) these. To what extent can the discussed results and model be generalized to similar instances?

We agree. We removed the idea of implicit prediction and discussed other, richer forms of performance that are not really tested by the present paradigm. Please see our responses to similar comment above and below from the other reviewers.

Reviewer #2 (Recommendations for the authors):

Several details not addressed above:

It is noted that the human subjects filled out a questionnaire which included information on musical experience/background. Was this information used to eliminate or retain subjects? Would musical background, or lack thereof, potentially influence the results of drummer timing? Some detailed information on this issue would be helpful.

We have inserted a short description of this aspect of the demographics, stating that the sample consisted of participants who all had some musical and dance experience without extreme outliers. We believe that expertise will interact with the task especially if it is more difficult, creating the opportunity for more interesting dynamics including clear asymmetries between participants.

The paragraph at the bottom of page 4, beginning with 'There are models of coordination where prediction... ' could benefit from a better explanation. There are some profound statements here, and it would behove the authors to avoid possibilities of misinterpretation.

We edited this part as suggested here, and also by Reviewer 1.

Reviewer #3 (Recommendations for the authors):

Please, have a look at the "groove" and expressivity studies (groove has many culturally specific definitions: swing, swingue, ginga, balanço, etc) and reflect on how these studies solve and shed light on some aspects of the contribution. In special, understand that cultural forms profit from intentional and idiomatic variability and not simply avoid them. In my opinion, the ontological stance of the musical phenomenon as a creative act violates important assumptions behind traditional statistical methods, and researchers must take care to avoid building Procrustean beds of predictability in a context where predictability is not welcome.

We agree. A lot about musical expression is left out from the present study. We now discuss the limitations related to issues of context, individual heterogeneity, and the ability to resist the group average, and encourage future research to incorporate the influence of the mean field as one factor that is likely necessary for understand the range of musical behaviours.

Discuss and explain how procedures could have affected the shape and magnitude of timing variability.

Explain the axes, indicate magnitudes and units, and define ALL elements, terminologies, and abbreviations presented in formulas and figures. Repeat explanations in the captions. The proper visualization of time in the graphs is important because the stimuli designed with 20 ms attack slope and perception times add 10ms or more in the thresholds of perception. Therefore the clear indication of IOIs or another timing reference in terms of milliseconds helps to understand the results.

We tried to make corresponding changes but we found there is very little in the figures and results that is actually defined at the millisecond scale. Since we are not focused on asynchronies but variability of the intervals, the CV is the only really relevant measure that reflects sub-second variations but it is a dimensionless variable.

Explain the procedures in the attempt to clarify the chain that could add delays (latency) in the system: sampling, midi protocols, soundboards, Since we are dealing with timing issues and small perception thresholds, small delays and their nature (constant, variable, random) greatly affect the validity of measurements.

We included more detail about this to clarify that the recording was parallel to sound triggering, hence participants did not experience added delays between surface contact and sound onset, other than the minimal latency of the professional electronic drum set which is about 5 ms.

With respect to timing variability resulting from the chain from the electronic drums module and USB connection to MIDI-recording on the computer, we tested the apparatus specifically for the sake of the present revisions and created Author response image 1. It summarizes the timing error from the ground truth based on a direct analogue recording of the piezoelectric elements. It shows that the digital pipeline adds very little timing variability. On very few occasions it drops drum hits, and how we handled these was already explained in the methods.

Author response image 1.

Author response image 1.

Verify if your statistical/numerical methods are not violated by homogeneity and independence (e.g. a delay in tempo is often compensated by the performer in the following attacks).

Thank you for the thorough suggestions. The main assumption of the statistical method (linear mixed effects models with maximum likelihood estimation) is normality of the residuals and random effects of the given fitted model. We included examples of how we verified that this assumption is satisfied to a reasonable degree. For example, Author response image 2 show how the given data fit a normal distribution. The first row is for the residuals of the model for tempo speeding up. Second row, the random effects from the same model (the random effects estimate individuals’ or groups’ overall tendency to speed up aside from effects of experimental condition). Left, model fitted to individuals. Right, model fitted to the group-average of individuals for better statistical properties.

Author response image 2.

Author response image 2.

Try to communicate and describe the musical/performative effect of the results to open a dialogue with researchers in the music field.

Thank you for encouraging us to do so. We hope the responses to the previous comments are sufficient to also address this important one.Equations 4; 5Equations 4; 5

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    Figure 3—source data 1.
    Figure 4—source data 1.
    Figure 5—source data 1.
    Figure 6—source data 1.
    Figure 7—source data 1.
    Supplementary file 1. Supplementary table.

    Separate linear mixed-effects models were fitted for each of lags 1–8 of the autocorrelations of individuals (a) and group aggregates (b) in the continuation phases of synchronization-continuation task (SCT) trials, ensemble conditions. The predictors were group size (N) and Tempo (80, 120, 160, and 200 beats per minute [bpm]). For simplicity, the same full model was fitted in all cases for individuals, Yik=(β00g)+β1Nik2Tempoik3NikTempoikg, where i is participant, k trial, and g group. For the group aggregate (b) where there was not enough data to fit models with all predictors, Yik=(β00g)+β1Nik2Tempoikg was used. Significant coefficients (p<.05, Satterthwaite method) are in bold.

    elife-74816-supp1.docx (20.2KB, docx)
    Transparent reporting form

    Data Availability Statement

    The current manuscript is a computational study, so no data have been generated for this manuscript.


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES