Abstract
Creativity in human societies may be sustained not by isolated minds but by embodied, collective intelligence. Using music as a tractable model system, this paper reviews evidence that creativity depends on the dynamic tuning of uncertainty: moderate, time-varying surprise—often accompanied by interoceptive bodily sensations—can promote exploration, while interpersonal synchrony and social evaluation can stabilize and transmit what is new. Because creativity itself is value-neutral, its social consequences depend on the co-occurrence of ethics, morality, and empathy. This paper argues that ethical norms bound admissible deviation, and empathic perspective-taking—supported by bodily and physiological coupling—acts as a social form of precision weighting that increases (or decreases) the likelihood that novelty is admitted, reshaped, or rejected by a community. From these ingredients emerges field-level intelligence (FLI): a process in which groups preserve diversity, selectively integrate deviations, and maintain “just-right” uncertainty under partially connected social structures. Although music provides a high-resolution measurement sandbox, this paper articulates boundary conditions and testable predictions for other domains (e.g., scientific discovery, design, and everyday collaboration) and for human–AI co-creativity. This paper outlines an empirical agenda: operationalize FLI and social precision weighting, quantify uncertainty dynamics in embodied interaction, and identify network conditions that sustain socially shared creativity without overgeneralizing across domains.
Keywords: collective intelligence, embodied cognition, empathy, ethics, uncertainty dynamics
1. Introduction
Creativity is a fundamental human capacity that drives societal advancement. It is the ability to generate novel ideas and frameworks and to embed them in forms that can be shared with others. This capacity extends across domains as varied as science and technology, the arts, education, industry, and everyday collaboration (Hennessey and Amabile, 2010). Crucially, creativity is not a transient spark of inspiration; rather, it is a dynamic process that is continuously sustained and renewed across generations and communities. Creativity is not a phenomenon confined to the solitary mind; it emerges, is tested, and ultimately selected through the dynamics of social interaction. Within networks of collaborators and teams, ideas are challenged, refined, and either disseminated or discarded—underscoring that creativity is as relational and collective as it is cognitive (Uzzi and Spiro, 2005; Wuchty et al., 2007). This cumulative renewal of knowledge—often termed the “ratchet effect”—has long been a central theme in cultural evolution research (Dean et al., 2012). Elucidating the mechanisms that sustain creativity is therefore indispensable for understanding the very foundations of sustainable intelligence and the dynamics of cultural development.
A useful contrast for these dynamics is the recent rise of large language models (LLMs). These systems, accessed through natural-language interfaces, have broadened access to knowledge (Brown et al., 2020). Yet their control still relies heavily on post hoc alignment, safety filters by humans, and output-level moderation (Ouyang et al., 2022; Gehman et al., 2020; Weidinger et al., 2022). As a result, vulnerabilities remain, including susceptibility to unexpected behaviors and the reproduction of biases inherited from training data (Bender et al., 2021). Moreover, empirical and theoretical work has shown that when models iteratively incorporate their own generated data into retraining, the distributional diversity erodes and both quality and variability decline—a phenomenon termed “model collapse” (Shumailov et al., 2024). In other words, this trend raises the concern that the very source of creativity—the richness of uncertainty and diversity—may be lost. This contrast forces us to confront a fundamental question: why is creativity in human societies far less vulnerable to collapse? Why, instead, is diversity maintained and even amplified across generations? And what mechanisms sustain such a resilient system?
This review addresses these questions from the perspective that creativity is sustained not at the level of the individual, but at the level of the collective. This paper focuses in particular on music as a model system. From the standpoint of predictive processing—namely, the interplay between prediction errors and their precision—music provides an ideal domain in which to examine how novelty is linked to surprise and how new information is assigned “value” or experienced as pleasure (Friston, 2010; Koelsch et al., 2019; Pearce and Wiggins, 2012; Salimpoor et al., 2011). Moreover, decision-making and emotional processes grounded in the norms and violations of musical grammar have been visualized in neural activity (Maess et al., 2001; Koelsch, 2014). The social and collective diversity and universality that arise through music further illuminate a “maintenance system” that enables creativity to be continuously renewed (Mehr et al., 2019). Building on these insights, our aim is to delineate the fundamental mechanisms that sustain the ongoing renewal of creativity in human societies and to propose a theoretical framework that integrates them. Ultimately, this paper extends this perspective beyond music to domains such as scientific research and human–AI collaboration, thereby advancing an interdisciplinary understanding of the fundamental question: how is creativity not only preserved, but actively amplified across generations?
Music is used here as a high-temporal-resolution domain in which predictive models, surprise, and uncertainty can be quantified and experimentally manipulated. Musical creativity is not assumed to be identical to creativity in science, visual arts, or everyday collaboration. Rather, the paper treats music as an existence proof and a measurement sandbox for linking uncertainty dynamics to embodied experience and social coordination, and derives hypotheses that require cross-domain testing. Accordingly, this distinguishes (i) components that plausibly generalize across domains (e.g., “just-right” uncertainty, partial connectivity, and interaction-level selection of deviations) from (ii) components that are likely domain-specific (e.g., ethical norms, institutional gatekeeping, and the cost of errors). This framing motivates a set of operational definitions and falsifiable predictions. Appendix A summarizes the core constructs used throughout the paper along with example operationalizations, and provides a concise glossary for quick reference.
2. Sustenance of creativity in human societies
2.1. Dynamics of uncertainty emerging from embodied-collective intelligence
The sustained renewal of creativity across generations and communities cannot be explained solely by individual predictive models. Instead, the evidence points toward a fundamentally collective substrate of human predictive processing, continuously shaped through embodied and social interaction. Here, this paper uses “uncertainty” for how unpredictable the next event is (often quantified as entropy) and “surprise” for how much an event violates expectations (prediction error). In predictive processing, “precision” refers to the confidence (roughly the inverse of uncertainty) assigned to prediction errors, which controls how strongly deviations update predictions. This paper uses the term conceptually, and Supplementary lists candidate empirical proxies.
Using music as a predictive model system, this paper has developed an integrative approach that couples predictive processing with behavioral and neuroscientific measurements in humans (Daikoku and Tanaka, 2024). This work has analyzed a large corpus of jazz improvisations spanning the early twentieth to the twenty first century (456 pieces, 78 musicians) with a hierarchical Bayesian statistical learning model (Daikoku et al., 2023; Daikoku, 2023). This analysis revealed that while the acoustic properties of musical signals themselves remain strikingly stable across eras, the temporal dynamics of prediction error (surprise) and entropy (uncertainty) exhibit gradual change, period-specific signatures (Daikoku, 2024). In particular, pitch-related sequences displayed gradual, community-level shifts in their temporal patterns of uncertainty, whereas rhythmic structures—such as fundamental ratios of 1:1 and 1:2 (e.g., a quarter note followed by another quarter note = 1:1, a quarter note followed by a half note = 1:2)—remained relatively invariant across time. The key point is this: historical change is reflected not in the raw acoustic signal, but in the probabilistic structure of sequences—transition probabilities, their uncertainty, and how these fluctuate over time. These findings suggest that the dynamics of deviation and alignment within shared predictive models are what drive creativity as a collective, emergent form of intelligence rather than as an individual cognitive process.
Complementary studies reinforce this view. Figure 1 schematically summarizes the temporal dynamics of uncertainty in jazz improvisation and chord progressions, together with the proposed bodily-synchrony overlay. In a large-scale experiment with 353 participants, emotional evaluations and bodily maps were collected in response to chord progressions. By constructing a population-level predictive model of chord progression (Daikoku et al., 2024) and manipulating the temporal dynamics of uncertainty and prediction error, this paper demonstrated that chord progressions with higher uncertainty elicited stronger subjective feelings of creativity and heightened arousal (Daikoku and Tanaka, 2024). Importantly, this effect was tightly linked to interoceptive sensations in the cardiac region, whereas the perception of beauty and pleasure was more strongly associated with predictable and less uncertain chord progressions. These results indicate that fluctuations in high uncertainty, grounded in collective predictive models, recruit embodied pathways to promote the feeling of creativity, whereas aesthetic value is stabilized by predictability (for theoretical background, see Critchley et al., 2004; Seth, 2013). Crucially, these findings are consistent with broader cross-cultural evidence on the embodied grounding of affect and music. Volynets et al. (2020) demonstrated that bodily maps of basic emotions are culturally universal, suggesting that interoceptive patterns provide a shared biological template for linking internal bodily states with affective experiences. Extending this framework, Putkinen et al. (2024) revealed that bodily maps of musical sensations likewise display systematic and cross-culturally consistent patterns. Together, these results highlight that the coupling between predictive dynamics and embodied sensations is not culturally idiosyncratic but reflects a universal mechanism through which human societies anchor and transmit affective and creative experiences (Daikoku and Tanaka, 2026). Taken together, these converging findings underscore a key principle: creativity is not simply sustained by novelty per se, but by the temporal dynamics of uncertainty embedded in collective predictive frameworks, which engage the body and its interoceptive signals as mediators of creative experience. This perspective aligns with recent affective neuroscience and music cognition studies showing that both surprise and uncertainty fluctuations shape emotional intensity and social bonding in music (e.g., Vuust et al., 2022b). Moreover, the cross-cultural universality of bodily maps of both emotions and musical sensations may suggest that embodied pathways form a robust and species-wide foundation for the resilience and renewal of creativity in human societies.
Figure 1.
Schematic overview of the proposed framework linking uncertainty dynamics, embodiment, and collective intelligence. (A) The upper panel illustrates the hypothesized pathway from creative deviation to field-level intelligence. (B) The lower panel illustrates the network-level consequences of admissibility. In fully connected networks, rapid information diffusion tends to produce fast consensus, increasing the risk of premature convergence and reducing diversity. In contrast, partially connected networks allow deviations to persist locally and be explored in parallel, thereby preserving diversity and supporting cumulative innovation over time. Together, the panels illustrate how uncertainty dynamics, embodied valuation, social regulation, and network structure interact to sustain creativity in human collectives.
These observations reinforce the view that value judgments—including those underlying creativity—do not reside within the isolated individual but emerge in social and collective contexts of resonance. Neural evidence has shown that coupling of neural activity between speakers and listeners, or performers and audiences, is a critical determinant of mutual understanding and joint action in language, music, and gesture (Stephens et al., 2010; Hasson et al., 2012). Likewise, physiological synchronization during collective rituals may be linked to the strengthening of social bonds (Konvalinka et al., 2011; Hove and Risen, 2009; Mogan et al., 2017), while synchronized movement more broadly promotes cooperative behavior (Wiltermuth and Heath, 2009). Together, such findings highlight how “collective intelligence” can emerge from the interactions of diverse individuals (Woolley et al., 2010). Diversity itself plays a decisive role: heterogeneous groups consistently outperform homogeneous ones (Hong and Page, 2004), and success in cultural markets often follows unpredictable and unequal trajectories driven by social influence (Salganik et al., 2006). These strands of evidence converge on a central principle: the value of creativity is distributed across a collective/social “field,” shaped through the alignment and deviation of mutual predictions.
The computational basis of such dynamics can be situated within predictive processing frameworks, which emphasize the representation and weighting of uncertainty in Bayesian inference (Knill and Pouget, 2004; Körding and Wolpert, 2004). Humans appear to prefer an intermediate rate of information—neither too little nor too much—a phenomenon known as the “Goldilocks effect,” observed in infants' allocation of visual and auditory attention (Kidd et al., 2012). This preference for “just-right” uncertainty resonates with musical groove, where moderate deviations in prediction sustain engagement and drive affective responses (Vuust et al., 2022a). It also suggests a precision-tuning mechanism that regulates the optimal balance of pleasure, arousal, and immersion that defines creative experience. At the societal level, the dynamics of maintaining uncertainty without excess or depletion resemble probabilistic search algorithms. For instance, the Metropolis–Hastings algorithm accepts even rare deviations with a finite probability, thereby preventing premature convergence and preserving diversity (Metropolis et al., 1953; Hastings, 1970). Experimental studies of cultural evolution further support this analogy: partially connected networks, which avoid overly dense synchronization, foster the accumulation of diverse and complex solutions (Derex and Boyd, 2016). Figure 1 contrasts fully connected and partially connected networks, highlighting why moderated connectivity can preserve diversity while avoiding premature convergence.
To sharpen the analogy with probabilistic search, it can be mapped (i) a creative deviation to a proposed candidate state, (ii) the field's evaluative landscape to a (time-varying) target distribution over admissible states, and (iii) social uptake to an acceptance step whose probability increases with expected value but decreases with expected social and ethical cost. In this view, a tolerance parameter captures how permissive a community is toward exploratory deviation. This mapping is heuristic rather than literal: real creative fields are non-stationary, multi-criteria, and institutionally constrained. Nevertheless, the mapping makes a concrete point—finite acceptance of imperfect deviations can prevent premature convergence and help sustain long-run diversity.
Taken together, these findings suggest that human creativity is sustained not by optimizing individual internal models, but by collective processes of acceptance and selection mediated through bodily synchronization, emotional resonance, and reciprocal prediction. Accordingly, intelligence and creativity are better understood at the level of interaction patterns than at the level of isolated individuals alone. These structures flexibly regulate the temporal profile of surprise and uncertainty, allowing norms and deviations to coexist in dynamic tension. It may be precisely this capacity for continual adjustment that safeguards prediction uncertainty as the wellspring of creativity.
2.2. Co-occurrence of ethics, morality, and empathy
Having outlined how uncertainty dynamics are embodied and socially shared, this paper next considers how novelty is regulated by social norms and social understanding. The social and collective dynamics that sustain creativity do not terminate within individual brains. This paper proposes that three processes intertwine within collective dynamics, forming what can be viewed as an “intelligence of fields”: (i) innovative ideas that underlie uncertainty; (ii) ethical/moral constraints that stabilize predictability by delimiting what is acceptable; and (iii) empathy and perspective taking, which bridge the two by rendering innovation intelligible and admissible to others.
Empirical evidence supports the key components of this framework. Individuals with stronger creative problem-solving skills show superior ethical decision-making (N = 258), and creative reasoning can unlock moral insights that reconcile value conflicts—particularly when actors generate multiple “What could I do?” options (Mumford et al., 2010; Zhang et al., 2018). On the other hand, creativity has a dark side. Creativity can facilitate the self-justification of unethical conduct (e.g., creatively finding loopholes) and is associated with increases in dishonest behavior (Gino and Ariely, 2012). Furthermore, malevolent creativity—novel and effective ideas serving harmful purposes—has been theorized and functionally modeled in domains such as terrorism and crime (Cropley et al., 2008). These findings imply that creativity per se is value-neutral; the field conditions that align innovations with social norms—and thereby shape the probability of acceptance—determine whether novelty crystallizes as ethical innovation or deviates into harmful ingenuity.
Within this regulatory framework, empathy and perspective taking may function not merely as correlates of social behavior but as mechanisms through which a social field assigns precision to prediction errors arising from others' actions. In predictive processing terms, when another agent introduces a deviation from shared expectations, this generates a social prediction error. Whether such a deviation updates a shared model depends on the precision assigned to it—that is, the inferred reliability of the signal and the anticipated consequences of adopting it. This paper proposes that empathic cues—such as inferred benevolent intent, shared affect, or physiological synchrony—can increase the perceived reliability of the deviating agent and reduce expectations of harm. In doing so, they effectively increase the precision assigned to the deviation, thereby raising the probability that it will be explored and integrated rather than rejected. Conversely, cues signaling malicious intent or norm violation reduce the assigned precision, suppressing learning from the deviation even when it is highly surprising. The causal claim advanced here is modest: empathic and norm-related cues are hypothesized to alter the precision assigned to deviations, which in turn changes whether they are explored, adopted, or rejected. This mechanism is not uniformly prosocial; in domains such as advertising or political messaging, affective attunement and perspective taking can also amplify ethically ambiguous or manipulative deviations, underscoring that empathy-like alignment must be constrained by explicit norms.
This interpretation is consistent with converging evidence from affective neuroscience linking empathy to interoceptive processes. Neural activity in regions associated with interoceptive awareness—particularly the anterior insula—has been shown to contribute to empathic experience (Critchley et al., 2004). Similarly, heartbeat-evoked potentials (HEP) have been implicated in the neural representation of others' affective states (Fukushima et al., 2011). Interoception—the perception and interpretation of internal bodily states—provides a critical substrate through which prediction errors are coupled to affective value. The framework of interoceptive inference further suggests that emotional experience emerges from predictive regulation of bodily signals, whereby prediction errors about internal states shape subjective feeling and behavioral responses (Seth, 2013). In this sense, empathy may be understood as a process through which representations of others become integrated with predictive models of bodily states, allowing deviations introduced by others to acquire affective and motivational significance.
Beyond its neural grounding, empathy operates socially by shaping how novel deviations are interpreted and evaluated within collective contexts. Perspective taking—the capacity to adopt another person's standpoint and infer their intentions—plays a particularly important role in this process. In negotiation research, perspective taking facilitates the discovery of hidden agreements and enables the creation of joint value (Galinsky et al., 2008), while improving outcomes in mixed-motive situations (Gilin et al., 2013). It also promotes intergroup contact and increases cohesion across social boundaries (Wang et al., 2014). From this perspective, perspective taking may transform exploratory deviations into forms that are intelligible to others. When combined with interoceptive bodily signals and interpersonal synchrony, this process allows creative mismatches to be socially interpreted rather than dismissed. In doing so, empathy and perspective taking help regulate the balance between exploratory creativity and ethical acceptability, enabling social systems to incorporate novelty without collapsing into either rigid conformity or uncontrolled deviation.
These mechanisms are particularly vivid when performing music, where interactions among creative behavior, morality/ethics, and empathy prevent both collapse from excessive novelty and stagnation from rigid adherence to norms. In a jazz session, the groove established by the rhythm section scaffolds the soloist's bold and novel insertions; the ensemble and audience momentarily hold their breath as a bodily response to surprise, but then stabilize interpretation and predictability by catching the phrase harmonically and metrically. In these processes, performers, listeners, and co-listeners alike become physically coupled to the music, their bodies moving spontaneously (Janata et al., 2012). Such bodily entrainment naturally gives rise to interpersonal synchrony, which in turn enhances affiliation (Hove and Risen, 2009). Building on this foundation, evidence shows that joint music-making promotes prosocial behavior (Kirschner and Tomasello, 2010), while group singing and synchronous dance strengthen social bonding (Tarr et al., 2014; Weinstein et al., 2016). From lullabies and choral singing to the dance floor, this loop re-emerges across contexts, sustaining a dynamic balance between novelty and stability—likely grounded in musical moral–ethical sensibilities and collectively negotiated through embodied empathy and perspective taking.
Meta-analyses indicate that interpersonal synchrony in musical activities (choral singing, ensemble performance, dance) increases cooperation, altruism, and cohesion (Mogan et al., 2017; Rennung and Göritz, 2016). Synchrony heightens self–other overlap and—sometimes via endogenous opioids—boosts trust and cooperation (Lang et al., 2017). In this light, the ethics of musical practice can be viewed as the coordination rules that sustain “creative exploration within a shared temporal action space” by satisfying: (1) care (protecting others' agency and welfare), (2) fairness (equitable distribution of participation and resources—time, loudness, solos, credit), and (3) authority/norm-respect (adherence to agreed forms, rituals, and copyrights) as foundations in moral foundations theory (Graham et al., 2018). These may be scaffolded by bodily synchrony and perspective taking. The result is the maintenance of socially optimized uncertainty, steering between the twin failure modes of over-stability (rigidity) and over-uncertainty (chaos). The next section integrates these ingredients—uncertainty dynamics, embodied coupling, and normative/empathic regulation—into a field-level account of how collective intelligence can sustain creativity over time.
3. Collective intelligence emerging from uncertainty dynamics mediated by creativity, ethics, and empathy
The foregoing arguments indicate that the sustainment of creativity cannot be explained from an individual's internal model. Rather, it depends on collective interaction grounded in embodied sensibility. In music, for example, the timing of prediction and deviation is socially shared; the resulting mismatch and surprise are experienced as a communicable sense of creativity through bodily synchrony and resonance among performers and listeners. Crucially, what supports creativity is not novelty per se, but the temporal dynamics of predictive uncertainty. Novelty functions as a probabilistic fluctuation that is continuously modulated by the group and distributed through bodily experience, rather than as a single, isolated jolt.
Creativity is likewise inseparable from ethics/moral/insight and empathy. While it can generate moral insight, it also carries the risk of norm violations and self-justification. This ambivalence is governed by social regulatory mechanisms—bodily synchrony, empathy, and perspective taking. Whether a creative deviation is admitted or rejected by a community is decided within a social field through embodied empathic understanding and its alignment with ethical norms. By “collective,” this paper therefore does not mean a mere aggregation of parallel individuals. The term designates an emergent, field-level intelligence (FLI) that arises from the dynamics of bodily coordination and misalignment. Operationally, FLI refers to a group's capacity to preserve diversity, selectively admit useful deviations, and sustain cumulative innovation over time; candidate observables therefore include diversity retention, acceptance rates of borderline deviations, and longitudinal innovation uptake under different network structures. This intelligence of the field continuously fine-tunes between stabilizing norms and exploratory deviations, avoiding both rigidity and chaos. Through this ongoing micro-regulation, creativity should be continually renewed and transmitted across generations and cultures.
Jazz improvisation exemplifies this process. When a soloist launches a bold phrase, listeners' bodies register a brief rise in uncertainty (e.g., faster heartbeat). The ensemble then absorbs the deviation by adjusting harmony and meter, turning it into shared, intelligible invention. Rather than rejecting deviation and novelty, the group incorporates momentary misfit into the field, allowing musical creativity to be updated while retaining ethical and empathic meaning. A similar logic operates in choral singing and joint rhythmic activity. Synchrony of voices and movement creates strong collective cohesion. Individual predictions are calibrated with respect to others, and small deviations are folded back into the overall harmony. Here, normative constraints (pitch, rhythm and musical grammar) remain salient, yet micro-deviations are tolerated as the signature of living music, often enriching creative vitality. Bodily synchrony thus sustains ethical order while keeping creativity alive.
The same dynamics appear in human–AI collaboration, particularly with generative systems. Model outputs often contain unexpected deviations or noise. Through human interpretation and empathic uptake, these fragments can be converted into creative ideas; without such alignment to empathic and ethical constraints, they are dismissed as harmful or meaningless. Hence, even in human–AI settings, embodiment- and empathy-based social regulation is pivotal, opening the possibility of a genuinely co-creative field intelligence spanning humans and machines.
Taken together, uncertainty dynamics mediated by creativity, ethics, and empathy are observable in musical practice. What persists is not a simple sum of individual knowledge but a collective intelligence of the field that rises through the interplay of misalignment and coordination. This structure plausibly underwrites the robustness and sustainability of human creativity. This paper now turns to a broader discussion of implications (including human–AI co-creativity), limitations, and concrete steps for operationalizing and testing the framework.
4. Discussion
Intelligence in human societies is rarely the mere sum of individual abilities. Rather, it frequently arises from the space of interaction—the dynamic field of exchanges among people. This emergent capacity is often termed collective intelligence, and group performance on complex tasks is frequently explained less by the distribution of individual IQs than by the quality of coordination within the group (Woolley et al., 2010). Beyond a simple aggregation of minds, intelligence is distributed across bodies, temporal synchrony, and environments, thereby functioning as a field-level phenomenon that extends cognition into the world (Clark and Chalmers, 1998). This paper refers to this distributed, interaction-born capacity as field-level (collective) intelligence. This framework does not deny individual variation in creative ability; rather, it proposes that collective outcomes arise from the interaction between individual-level variability and network-level coordination.
Although music is a privileged measurement domain for timing, prediction, and synchrony, the proposed components—uncertainty regulation, interaction-level selection of deviations, and norm-bounded admissibility—are not inherently musical. In scientific creativity, for instance, novelty is filtered through peer review, replication, and ethical oversight; in everyday collaboration, novelty is filtered through trust, role expectations, and local norms. A scientific example is CRISPR (Jinek et al., 2012): the core methodological innovation diffused rapidly, while ethically sensitive applications remained subject to explicit normative debate and governance. These fields differ in the cost of errors and in the kinds of norm violations that matter, which implies that the same magnitude of “surprise” can be welcomed in one domain and rejected in another. Therefore, generalization requires specifying which variables carry across domains (e.g., uncertainty dynamics and network connectivity) and which variables are domain-specific (e.g., the normative cost function and institutional gatekeeping). This distinction also clarifies how the framework can be used productively: music provides measurable proxies, while cross-domain tests establish external validity. Useful non-musical test beds include scientific collaboration, collaborative design, and human–AI co-creation, where novelty is evaluated under explicit institutional or user-level constraints. Figure 1 schematizes the proposed framework, linking uncertainty dynamics, embodied coupling, ethical filtering, and partial connectivity. A non-musical illustration is CRISPR, where novelty was rapidly adopted as a methodological advance while ethically sensitive applications remained subject to strong normative filtering.
Contemporary neuroscience frames the brain as a predictive organ that minimizes prediction error (Friston, 2010; Knill and Pouget, 2004). Crucially, this predictive machinery is not confined within individuals: during communication, neural activity couples between speakers and listeners, supporting shared understanding and joint action (Stephens et al., 2010; Hasson et al., 2012). In this sense, prediction and error correction are not only intra-cranial processes but also shared across the social field that links brains together. Music is a particularly tractable model system for examining such collective dynamics and FLI. Musical experience hinges on probabilistic “grammars”: listeners form expectations about upcoming notes and chords, and affective responses arise from the balance of confirmation and violation (Koelsch, 2014; Pearce and Wiggins, 2012). A moderate degree of fluctuation of predictability—neither too little nor too much—maximizes pleasure and movement tendencies (Vuust et al., 2022a; Stupacher et al., 2022). Harmony violations recruit regions that also support syntactic processing in language, including Broca's area (Maess et al., 2001). Put simply, when listening to Western tonal music, many listeners strongly expect a tonic after a dominant. Resolution produces feelings of beauty and relief; deliberate detours produce surprise and tension which, when subsequently integrated back into the prevailing grammar, are often shared and appraised as creative (Daikoku et al., 2024). Thus, musical affect emerges from time-varying play between prediction and deviation that is scaffolded by culturally shared grammatical regularities. Music thereby functions as a collective prediction device—a window onto the mechanisms of FLI.
The substrate of this field is bodily. Interoceptive pathways—particularly those involving the anterior insula—support predictions about cardiac and visceral states and the processing of interoceptive prediction errors, shaping emotion and the sense of self (Critchley et al., 2004; Seth, 2013). People also share broadly similar bodily maps of emotion—for example, heat or pressure in the chest for anger or sadness—suggesting a common bodily “lexicon” for affect (Nummenmaa et al., 2014; Daikoku et al., 2026). These embodied templates provide a shared substrate through which affective experiences can be communicated and socially stabilized, thereby supporting collective creativity. Recent work extends these insights directly to music. Cross-cultural mapping shows that bodily sensations evoked by music exhibit systematic topographies that align across Western and East Asian participants—for example, danceable, joyful tracks map more strongly to the limbs, while gentle, sad tracks map to the chest (Putkinen et al., 2024). This pattern generalizes Nummenmaa's bodily maps of basic emotions to the musical domain, suggesting that the interaction among prediction of external phenomenon, bodily sensation, and emotion is, in broad outline, culturally shareable (Putkinen et al., 2024). Such a shared bodily communication makes it easier for performers and audiences to achieve understanding and synchrony, providing a scaffold for collective intelligence. Complementing this, body-map studies focused on the felt sense of creativity show that the experience of creativity tracks the temporal dynamics of uncertainty and surprise in chord progressions. Higher entropy and surprise increase ratings of “felt creativity” and are accompanied by stronger sensations in the cardiac region; by contrast, beauty and pleasantness peak under more predictable, low-uncertainty progressions (Daikoku and Tanaka, 2026). Individuals high in interoceptive sensibility report stronger creative feelings and clearer bodily sensations (Daikoku and Tanaka, 2024). Hence, creative experience is likely mediated not only by informational novelty but also by interoceptive prediction-error processing and its conscious access. In musical settings, momentary mismatches can thus be absorbed via the body into empathic synchrony, and thereby transformed into socially shared judgments of “creative and interesting.”
Taken together, (1) cross-culturally shared emotion–body mappings (Putkinen et al., 2024) and (2) the coupling between uncertainty dynamics and interoception in the experience of creativity (Daikoku and Tanaka, 2024) jointly explain why music readily functions as a collective prediction device. On a widely legible bodily substrate, moderate fluctuations in predictability ignite a sense of creativity that is then distributed through bodily synchrony and empathy into the shared field. This circulation—the rise of creative feeling on a shared bodily code, its uptake through synchrony, and its stabilization in collective appraisal—captures a core mechanism of FLI.
A distinctive strength of collective intelligence is its ability to maintain diversity while keeping uncertainty at a just-right level. Heterogeneous groups tend to outperform homogeneous high-ability groups (Hong and Page, 2004). In cultural evolution experiments, partially connected networks (in which not everyone is tightly linked to everyone else, slowing premature consensus and preserving diversity)—avoiding over-dense coupling—accumulate more diverse and complex solutions than fully connected ones (Derex and Boyd, 2016). Algorithmically, such dynamics echo search schemes that accept rare deviations with finite probability (e.g., Metropolis-Hastings), thereby preventing premature convergence (Metropolis et al., 1953; Hastings, 1970). The Collective Predictive Coding (CPC) framework (Taniguchi et al., 2016; Taniguchi, 2025; Taniguchi et al., 2025) seeks to formalize these ideas computationally. CPC emphasizes that predictive models are not only individual constructs but are also generated and calibrated within the social field, which regulates whether deviations are admissible as shareable innovations. This view aligns with theories of symbol emergence, in which meanings and norms arise not top-down but through interaction, progressively stabilizing within the field (Taniguchi et al., 2016).
To reduce conceptual density and increase falsifiability, this paper proposes several concrete predictions that can be tested across domains: (1) Just-right uncertainty: Perceived creativity/interest and long-run adoption will show an inverted-U relationship with uncertainty (entropy rate or prediction-error variance), both in music and in non-musical sequential tasks, (2) Social precision weighting: Empathy, trust, and physiological synchrony will increase the acceptance probability of borderline deviations when expected harm is low, but will have little or negative effect when ethical risk is high, (3) Partial connectivity: Partially connected communication networks will preserve higher diversity and support cumulative innovation over time compared with fully connected networks.
These “musical collective intelligence” principles may sketch design cues for future AI. Current AI systems—especially LLMs—lack humanlike bodies and therefore struggle to ground symbols in perception and action. This long-standing symbol grounding problem highlights a structural gap between current AI and embodied human cognition (Harnad, 1990). Decades of work in grounded and extended cognition show that human understanding is deeply embedded in body and environment, implying qualitative differences between human learning and prevailing AI pipelines (Barsalou, 2008). The gap is stark for affect: inferring emotion from surface labels or facial/vocal cues alone is insufficient, casting doubt on whether present AI understands emotion in a human-relevant sense. Although affect detection using physiological signals has progressed (Calvo and D'Mello, 2010; Koelstra et al., 2011), AI systems themselves lack interoceptive generative models that predict and correct internal bodily states—the very machinery that supports affect and self in humans (Critchley et al., 2004; Seth, 2013). Consequently, they may poorly be coupled to the bodily synchrony and empathy that scaffold FLI. Relatedly, recent creativity scholarship argues that generative systems at most exhibit “artificial creativity”—outputs that can appear original and effective while lacking the authenticity and intentional agency characteristic of human creative processes (Runco, 2023). Complementing this view, Lockhart (2025) emphasizes that human creativity is grounded in lived, embodied experience and moral agency, which current machine generation does not instantiate.
Music suggests how to close part of this gap. In musical practice, deviations are not discarded as noise but admitted as creativity when they are absorbed through bodily sensation and empathic uptake. Translating this into AI design requires going beyond rule-based governance of morality and ethics to a sensory–affective alignment layer in which systems optionally respond to bio-backed signals—heart rate, respiration, or HRV—under strict privacy-preserving protocols (Critchley et al., 2004; Thayer et al., 2012). A feasible near-term version would not attempt to endow AI with genuine interoception; rather, it would use opt-in human physiological signals as contextual inputs in tightly bounded co-creative settings, building on precedents from affect-aware systems that already estimate human state from physiological data. This should be read as a testable interface proposal, not as a claim that current AI can reproduce human affective embodiment. To avoid model collapse caused by training on self-generated data (Shumailov et al., 2024), novelty should be filtered through multi-agent, heterogeneous reception rather than a single average preference, leveraging diversity's first-order benefits and partially connected evaluation networks that preserve exploration (Derex and Boyd, 2016). In parallel, interactions among humans and AIs can incorporate synchrony-like signals—overlap in attentional allocation, coordinated response latencies, and measures of interpretive agreement—coupled to ethical metrics so that the admissibility of deviations is tuned by the state of the shared field. A practical illustration is the use of generative design-assistance tools in commercial workflows, where AI-generated variations are surfaced for human selection and revision under user control and responsible-AI guardrails rather than being treated as autonomously valid outputs. A further lesson is to regulate “just-right” uncertainty. Inspired by the Goldilocks effect, systems should maintain moderate predictive variability—neither boring nor chaotic—by adaptively controlling sampling temperature, search breadth, and presentation order (Kidd et al., 2012). Conceptually, these ingredients align with Collective Predictive Coding: meanings and norms emerge through interaction (Taniguchi et al., 2016, 2025), and the social field dynamically assigns precision to deviations—accepting, reshaping, or rejecting them in context. Operationalizing these mechanisms would begin to instantiate FLI in AI, opening a path toward systems that cohere with human social dynamics rather than merely optimizing in isolation.
Several limitations qualify the scope of this review. First, the strongest empirical demonstrations reviewed here come from music-cognition studies, so the present framework should be read as a music-grounded proposal rather than a cross-domain law. The extent to which the proposed mechanisms generalize to other coordination-intensive domains—scientific collaboration, education, or collective decision-making—remains uncertain, and the rhythmic invariances discussed here are best treated as domain-specific observations until analogous tests are conducted outside music. Even where cross-cultural similarities in bodily maps for music are observed, cultural ecologies may still shape norms for “admissible deviation” and sanctioning in ways this paper has not fully captured. Second, constructs such as FLI and admissibility of deviation risk remaining theoretical unless tied to operational definitions. They should be validated against longitudinal outcomes—innovation uptake, retention of knowledge—and aligned with existing findings that partially connected networks help preserve diversity (Derex and Boyd, 2016), thereby providing a coherent evaluative framework. Third, while the “Goldilocks” hypothesis for creative engagement is promising, the optimal uncertainty is likely task-, expertise-, and risk-dependent, plausibly nonlinear and hysteretic. Infant learning and musical groove studies establish the principle (Kidd et al., 2012; Vuust et al., 2022b), but parameterizing it for complex adult decisions will require closed-loop, adaptively tuned experimental designs. Fourth, our AI proposals—synchrony-like interaction signals and bio-informed sensory–affective alignment—are, at best, proxies for interoception rather than true interoceptive generative models. Their limits must be quantified against strong non-embodied baselines to demonstrate the incremental value of embodied proxies. Finally, there is no standard benchmark for FLI. Community-agreed tasks and metrics are needed to jointly test collapse resistance under self-generated data, diversity preservation, and socially accepted novelty under partial connectivity; without such shared benchmarks, cumulative progress will be difficult.
Acknowledging these constraints, this paper nevertheless advances a framework that centers FLI—beyond individual internal models—as a key to the persistence of creativity in human societies. Using music as a tractable model, this paper has highlighted how dynamic fluctuations between prediction and deviation are socially shared via bodily sensation and empathy, and how this circulation sustains creative capacity. This paper further argued that these insights can be extended to scientific collaboration, education, and human–AI co-creativity, offering a theoretical basis for explaining why creativity does not merely endure across generations but is actively renewed.
5. Conclusion
Creativity persists in human societies not as a property of isolated minds but as a phenomenon distributed across bodies, artifacts, and social relations. Viewing creativity through the lens of predictive processing and musical practice reveals a simple but powerful logic: moderate, time-varying uncertainty ignites exploration; interoceptive sensation and empathic uptake translate deviation into shared value; and partially connected social structures preserve diversity while preventing premature convergence. In this dynamic, music functions as a collective prediction device, making visible a putative operational signature of FLI.
This review synthesized evidence that, in music and related coordination-rich settings, prediction–deviation dynamics are stabilized and renewed by bodily synchrony, shared grammars, and socially negotiated admissibility. It further proposes—rather than demonstrates—that some of these mechanisms may generalize as design principles, provided that domain-specific norms, risks, and ethical costs are explicitly modeled.
A concrete research agenda follows. First, formalize field-level constructs—social precision weighting, admissibility of deviation, collapse resistance—into operational metrics and community benchmarks. Second, develop closed-loop, ecologically valid experiments that manipulate uncertainty and synchrony while tracking longitudinal outcomes such as innovation uptake and knowledge retention. Third, prototype sensory–affective alignment layers that are strictly opt-in, privacy-preserving, and auditable, and evaluate their value over strong non-embodied baselines. Finally, embed these components within collective predictive coding frameworks so that meanings and norms can emerge and stabilize through interaction rather than prescription.
In sum, the persistence of creativity is supported not solely within individual brains but across the field—through interoceptive, empathic coupling and culturally shared musical grammars and bodily maps. Music makes these dynamics especially visible: moderate uncertainty elicits a sense of creativity that, via bodily synchrony and empathy, is socially shared and stabilized. Within music, this may represent an operational signature of FLI; whether comparable signatures hold in other creative domains remains an empirical question.
Funding Statement
The author(s) declared that financial support was received for this work and/or its publication. This research was supported by the JSPS KAKENHI (25K03206, 25H00448, 24K22302), Japan. The funding sources had no role in the decision to publish or prepare the manuscript.
Footnotes
Edited by: Miguel Pic, University of Valladolid, Spain
Reviewed by: Ezra N. S. Lockhart, Northcentral University, United States
Francisco Fialho, Federal University of Santa Catarina, Brazil
Data availability statement
The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding author.
Author contributions
TD: Writing – review & editing, Investigation, Conceptualization, Validation, Software, Supervision, Funding acquisition, Methodology, Formal analysis, Writing – original draft, Project administration, Data curation, Visualization, Resources.
Conflict of interest
The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declared that generative AI was used in the creation of this manuscript. For language proofreading and stylistic refinement of the manuscript's text. The author(s) reviewed, verified, and take full responsibility for all generated content.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher's note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyg.2026.1752280/full#supplementary-material
References
- Barsalou L. W. (2008). Grounded cognition. Annu. Rev. Psychol. 59, 617–645. doi: 10.1146/annurev.psych.59.103006.093639 [DOI] [PubMed] [Google Scholar]
- Bender E. M., Gebru T., McMillan-Major A., Shmitchell S. (2021). “On the dangers of stochastic parrots: can language models be too big?,” in Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency 610-623. doi: 10.1145/3442188.3445922 [DOI] [Google Scholar]
- Brown T., Mann B., Ryder N., Subbiah M., Kaplan J. D., Dhariwal P., et al. (2020). Language models are few-shot learners. Adv. Neural Inform. Process. Syst. 33, 1877–1901. [Google Scholar]
- Calvo R. A., D'Mello S. (2010). Affect detection: An interdisciplinary review of models, methods, and their applications. IEEE Transactions on. Affect.ive Comput. 1, 18–37. doi: 10.1109/T-AFFC.2010.1 [DOI] [Google Scholar]
- Clark A., Chalmers D. (1998). The extended mind. Analysis 58, 7–19. doi: 10.1093/analys/58.1.7 [DOI] [Google Scholar]
- Critchley H. D., Wiens S., Rotshtein P., Öhman A., Dolan R. J. (2004). Neural systems supporting interoceptive awareness. Nature Neurosci. 7, 189–195. doi: 10.1038/nn1176 [DOI] [PubMed] [Google Scholar]
- Cropley D. H., Kaufman J. C., Cropley A. J. (2008). Malevolent creativity: A a functional model of creativity in terrorism and crime. Creat. Res. J. 20, 105–115. doi: 10.1080/10400410802059424 [DOI] [Google Scholar]
- Daikoku T. (2023). Temporal dynamics of statistical learning in children's song contributes to phase entrainment and production of novel information in multiple cultures. Sci. Rep. 13:18041. doi: 10.1038/s41598-023-45493-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Daikoku T. (2024). Temporal dynamics of uncertainty and prediction error in musical improvisation across different periods. Sci. Rep. 14:22297. doi: 10.1038/s41598-024-73689-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- Daikoku T., Tanaka M. (2026). Body maps for feeling of creativity in musical chord progression. Think. Skills Creat. 60:102103. doi: 10.1016/j.tsc.2025.102103 [DOI] [Google Scholar]
- Daikoku T., Kamermans K., Minatoya M. (2023). Exploring cognitive individuality and the underlying creativity in statistical learning and phase entrainment. EXCLI. J. 22:828. doi: 10.1101/2023.07.29.551091 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Daikoku T., Minatoya M., Tanaka M. (2026). Mapping emotional feeling in the body: a tripartite framework for understanding the embodied mind. Neurosci. Biobehav. Rev. 180:106469. doi: 10.1016/j.neubiorev.2025.106469 [DOI] [PubMed] [Google Scholar]
- Daikoku T., Tanaka M. (2024). Protocol to visualize a bodily map of musical uncertainty and prediction. STAR Protoc. 5:103473. doi: 10.1016/j.xpro.2024.103473 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Daikoku T., Tanaka M., Yamawaki S. (2024). Bodily maps of uncertainty and surprise in musical chord progression and the underlying emotional response. Iscience 27:109498. doi: 10.1016/j.isci.2024.109498 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dean L. G., Kendal R. L., Schapiro S. J., Thierry B., Laland K. N. (2012). Identification of the social and cognitive processes underlying human cumulative culture. Science. 335, 1114–1118. doi: 10.1126/science.1213969 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Derex M., Boyd R. (2016). Partial connectivity increases cultural accumulation within groups. Proc. Nat. Acad. Sci. 113, 2982–2987. doi: 10.1073/pnas.1518798113 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Friston K. (2010). The free-energy principle: a unified brain theory? Nature Rev. Neurosci. 11, 127–138. doi: 10.1038/nrn2787 [DOI] [PubMed] [Google Scholar]
- Fukushima H., Terasawa Y., Umeda S. (2011). Association between interoception and empathy: evidence from heartbeat-evoked brain potential. Int. J. Psychophysiol. 79, 259–265. doi: 10.1016/j.ijpsycho.2010.10.015 [DOI] [PubMed] [Google Scholar]
- Galinsky A. D., Maddux W. W., Gilin D., White J. B. (2008). Why it pays to get inside the head of your opponent: the differential effects of perspective taking and empathy in negotiations. Psychol. Sci. 19, 378–384. doi: 10.1111/j.1467-9280.2008.02096.x [DOI] [PubMed] [Google Scholar]
- Gehman S., Gururangan S., Sap M., Choi Y., Smith N. A. (2020). realtoxicityprompts: evaluating neural toxic degeneration in language models. Findings of EMNLP 2020, 3356–3369. doi: 10.18653/v1/2020.findings-emnlp.301 [DOI] [Google Scholar]
- Gilin D., Maddux W. W., Carpenter J., Galinsky A. D. (2013). When to use your head and when to use your heart: The differential value of perspective-taking versus empathy in competitive interactions. Pers. Soc. Psychol. Bull. 39, 3–16. doi: 10.1177/0146167212465320 [DOI] [PubMed] [Google Scholar]
- Gino F., Ariely D. (2012). The dark side of creativity: original thinkers can be more dishonest. J. Pers. Soc. Psychol. 102:445. doi: 10.1037/a0026406 [DOI] [PubMed] [Google Scholar]
- Graham J., Haidt J., Motyl M., Meindl P., Iskiwitch C., Mooijman M. (2018). “Moral Foundations Theory: On the Advantages of Moral Pluralism Over Moral Monism,” in Atlas of moral psychology, eds. K. Gray and J. Graham (New York, NY: The Guilford Press; ), 211–222. [Google Scholar]
- Harnad S. (1990). The symbol grounding problem. Phys. D Nonlinear Phenomena 42, 335–346. [Google Scholar]
- Hasson U., Ghazanfar A. A., Galantucci B., Garrod S., Keysers C. (2012). Brain-to-brain coupling: a mechanism for creating and sharing a social world. Trends Cogn. Sci. 16, 114–121. doi: 10.1016/j.tics.2011.12.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hastings W. K. (1970). Monte Carlo sampling methods using Markov chains and their applications. Biometrika 57, 97–109. doi: 10.1093/biomet/57.1.97 [DOI] [Google Scholar]
- Hennessey B. A., Amabile T. M. (2010). Creativity. Annu. Rev. Psychol. 61:569-98. doi: 10.1146/annurev.psych.093008.100416 [DOI] [PubMed] [Google Scholar]
- Hong L., Page S. E. (2004). Groups of diverse problem solvers can outperform groups of high-ability problem solvers. Proc. Nat. Acad. Sci. 101, 16385–16389. doi: 10.1073/pnas.0403723101 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hove M. J., Risen J. L. (2009). It's all in the timing: interpersonal synchrony increases affiliation. Soc. Cogn. 27, 949–961. doi: 10.1521/soco.2009.27.6.949 [DOI] [Google Scholar]
- Janata P., Tomic S. T., Haberman J. M. (2012). Sensorimotor coupling in music and the psychology of the groove. J. Exp. Psychol. Gen. 141:54. doi: 10.1037/a0024208 [DOI] [PubMed] [Google Scholar]
- Jinek M., Chylinski K., Fonfara I., Hauer M., Doudna J. A., Charpentier E. (2012). A programmable dual-RNA–guided DNA endonuclease in adaptive bacterial immunity. Science 337, 816–821. doi: 10.1126/science.1225829 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kidd C., Piantadosi S. T., Aslin R. N. (2012). The Goldilocks effect: human infants allocate attention to visual sequences that are neither too simple nor too complex. PloS ONE 7:e36399. doi: 10.1371/journal.pone.0036399 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kirschner S., Tomasello M. (2010). Joint music making promotes prosocial behavior in 4-year-old children. Evol. Hum. Behav. 31, 354–364. doi: 10.1016/j.evolhumbehav.2010.04.004 [DOI] [Google Scholar]
- Knill D. C., Pouget A. (2004). The Bayesian brain: the role of uncertainty in neural coding and computation. Trends Neurosci. 27, 712–719. doi: 10.1016/j.tins.2004.10.007 [DOI] [PubMed] [Google Scholar]
- Koelsch S. (2014). Brain correlates of music-evoked emotions. Nature Rev. Neurosci. 15, 170–180. doi: 10.1038/nrn3666 [DOI] [PubMed] [Google Scholar]
- Koelsch S., Vuust P., Friston K. (2019). Predictive processes and the peculiar case of music. Trends Cogn. Sci. 23, 63–77. doi: 10.1016/j.tics.2018.10.006 [DOI] [PubMed] [Google Scholar]
- Koelstra S., Muhl C., Soleymani M., Lee J. S., Yazdani A., Ebrahimi T., et al. (2011). Deap: A database for emotion analysis; using physiological signals. IEEE Trans. Affect. Comput. 3, 18–31. doi: 10.1109/T-AFFC.2011.15 [DOI] [Google Scholar]
- Konvalinka I., Xygalatas D., Bulbulia J., Schjødt U., Jegindø E. M., Wallot S., et al. (2011). Synchronized arousal between performers and related spectators in a fire-walking ritual. Proc. Nat. Acad. Sci. 108, 8514–8519. doi: 10.1073/pnas.1016955108 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Körding K. P., Wolpert D. M. (2004). Bayesian integration in sensorimotor learning. Nature 427, 244–247. doi: 10.1038/nature02169 [DOI] [PubMed] [Google Scholar]
- Lang M., Bahna V., Shaver J. H., Reddish P., Xygalatas D. (2017). Sync to link: Endorphin-mediated synchrony effects on cooperation. Biol. Psychol. 127, 191–197. doi: 10.1016/j.biopsycho.2017.06.001 [DOI] [PubMed] [Google Scholar]
- Lockhart E. N. (2025). Creativity in the age of AI: the human condition and the limits of machine generation. J. Cult. Cogn. Sci. 9, 83–88. doi: 10.1007/s41809-024-00158-2 [DOI] [Google Scholar]
- Maess B., Koelsch S., Gunter T. C., Friederici A. D. (2001). Musical syntax is processed in Broca's area: An MEG study. Nat. Neurosci. 4, 540–545. doi: 10.1038/87502 [DOI] [PubMed] [Google Scholar]
- Mehr S. A., Singh M., Knox D., Ketter D. M., Pickens-Jones D., Atwood S., et al. (2019). Universality and diversity in human song. Science 366:eaax0868. doi: 10.1126/science.aax0868 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Metropolis N., Rosenbluth A. W., Rosenbluth M. N., Teller A. H., Teller E. (1953). Equation of state calculations by fast computing machines. J. Chem. Phys. 21, 1087–1092. doi: 10.1063/1.1699114 [DOI] [PubMed] [Google Scholar]
- Mogan R., Fischer R., Bulbulia J. (2017). To be in synchrony or not? Meta-analysis of synchrony's effects on behavior, perception, cognition and affect. J. Exp. Soc. Psychol. 72, 13–20. doi: 10.1016/j.jesp.2017.03.009 [DOI] [Google Scholar]
- Mumford M. D., Waples E. P., Antes A. L., Brown R. P., Connelly S., Murphy S. T., et al. (2010). Creativity and ethics: the relationship of creative and ethical problem-solving. Creat. Res. J. 22, 74–89. doi: 10.1080/10400410903579619 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nummenmaa L., Glerean E., Hari R., Hietanen J. K. (2014). Bodily maps of emotions. Proc. Nat. Acad. Sci. 111, 646–651. doi: 10.1073/pnas.1321664111 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ouyang L., Wu J., Jiang X., Almeida D., Wainwright C., Mishkin P., et al. (2022). Training language models to follow instructions with human feedback. Adv. Neural Inf. Process. Syst. 35, 27730–27744. doi: 10.52202/068431-2011 [DOI] [Google Scholar]
- Pearce M. T., Wiggins G. A. (2012). Auditory expectation: the information dynamics of music perception and cognition. Top. Cogn. Sci. 4, 625–652. doi: 10.1111/j.1756-8765.2012.01214.x [DOI] [PubMed] [Google Scholar]
- Putkinen V., Zhou X., Gan X., Yang L., Becker B., Sams M., et al. (2024). Bodily maps of musical sensations across cultures. Proc. Nat. Acad. Sci. 121:e2308859121. doi: 10.1073/pnas.2308859121 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rennung M., Göritz A. S. (2016). Prosocial consequences of interpersonal synchrony. Z Psychol. doi: 10.1027/2151-2604/a000252 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Runco M. A. (2023). AI can only produce artificial creativity. J. Creat. 33:100063. doi: 10.1016/j.yjoc.2023.100063 [DOI] [Google Scholar]
- Salganik M. J., Dodds P. S., Watts D. J. (2006). Experimental study of inequality and unpredictability in an artificial cultural market. Science. 311, 854–856. doi: 10.1126/science.1121066 [DOI] [PubMed] [Google Scholar]
- Salimpoor V. N., Benovoy M., Dagher A., Zatorre R. J. (2011). Anatomically distinct dopamine release during anticipation and experience of peak emotion to music. Nat. Neurosci. 14, 257–262. doi: 10.1038/nn.2726 [DOI] [PubMed] [Google Scholar]
- Seth A. K. (2013). Interoceptive inference, emotion, and the embodied self. Trends Cogn. Sci. 17, 565–573. doi: 10.1016/j.tics.2013.09.007 [DOI] [PubMed] [Google Scholar]
- Shumailov I., Shumaylov Z., Zhao Y., Papernot N., Anderson R., Gal Y. (2024). AI models collapse when trained on recursively generated data. Nature 631, 755–759. doi: 10.1038/s41586-024-07566-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stephens G. J., Silbert L. J., Hasson U. (2010). Speaker–listener neural coupling underlies successful communication. Proc. Nat. Acad. Sci. 107, 14425–14430. doi: 10.1073/pnas.1008662107 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stupacher J., Witek M. A. G., Vuust P., Schroeder S. C., Kringelbach M. L. (2022). Music, rhythm, and the brain: Synchronization, anticipation, and the pleasure of groove. Neurosci. Biobehav. Rev. 137:104619.35331819 [Google Scholar]
- Taniguchi T. (2025). Beyond individuals: collective predictive coding for memory, attention, and the emergence of language. Cogn. Neurosci. 2518942, 1-2. doi: 10.1080/17588928.2025.2518942 [DOI] [PubMed] [Google Scholar]
- Taniguchi T., Nagai T., Nakamura T., Iwahashi N., Ogata T., Asoh H. (2016). Symbol emergence in robotics: A survey. Adv. Robot. 30, 706–728. doi: 10.1080/01691864.2016.1164622 [DOI] [Google Scholar]
- Taniguchi T., Takagi S., Otsuka J., Hayashi Y., Hamada H. T. (2025). Collective predictive coding as model of science: Formalizing scientific activities towards generative science. R. Soc. Open Sci. 12:241678. doi: 10.1098/rsos.241678 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tarr B., Launay J., Dunbar R. I. (2014). Music and social bonding:“self-other” merging and neurohormonal mechanisms. Front. Psychol. 5:1096. doi: 10.3389/fpsyg.2014.01096 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Thayer J. F., Åhs F., Fredrikson M., Sollers III J. J., Wager T. D. (2012). A meta-analysis of heart rate variability and neuroimaging studies: implications for heart rate variability as a marker of stress and health. Neurosci. Biobehav. Rev. 36, 747–756. doi: 10.1016/j.neubiorev.2011.11.009 [DOI] [PubMed] [Google Scholar]
- Uzzi B., Spiro J. (2005). Collaboration and creativity: the small world problem. Am. J. Sociol. 111, 447–504. doi: 10.1086/432782 [DOI] [Google Scholar]
- Volynets S., Glerean E., Hietanen J. K., Hari R., Nummenmaa L. (2020). Bodily maps of emotions are culturally universal. Emotion 20:1127. doi: 10.1037/emo0000624 [DOI] [PubMed] [Google Scholar]
- Vuust P., Heggli O. A., Friston K. J., Kringelbach M. L. (2022a). Music in the brain. Nature Rev. Neurosci. 23, 287–305. doi: 10.1038/s41583-022-00578-5 [DOI] [PubMed] [Google Scholar]
- Vuust P., Witek M. A. G., Dietz M. J., Kringelbach M. L., Overy K. (2022b). Now you hear it: A predictive coding model of music perception. Psychol. Rev. 129, 234–259. [Google Scholar]
- Wang C. S., Kenneth T., Ku G., Galinsky A. D. (2014). Perspective-taking increases willingness to engage in intergroup contact. PloS ONE 9:e85681. doi: 10.1371/journal.pone.0085681 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Weidinger L., Uesato J., Rauh M., Griffin C., Huang P. S., Mellor J., et al. (2022). “Taxonomy of risks posed by language models,” in Proceedings of the 2022 ACM Conference on Fairness, Accountability, and Transparency, 214-229. doi: 10.1145/3531146.3533088 [DOI] [Google Scholar]
- Weinstein D., Launay J., Pearce E., Dunbar R. I., Stewart L. (2016). Singing and social bonding: changes in connectivity and pain threshold as a function of group size. Evol. Hum. Behav. 37, 152–158. doi: 10.1016/j.evolhumbehav.2015.10.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wiltermuth S. S., Heath C. (2009). Synchrony and cooperation. Psychol. Sci. 20, 1–5. doi: 10.1111/j.1467-9280.2008.02253.x [DOI] [PubMed] [Google Scholar]
- Woolley A. W., Chabris C. F., Pentland A., Hashmi N., Malone T. W. (2010). Evidence for a collective intelligence factor in the performance of human groups. Science 330, 686–688. doi: 10.1126/science.1193147 [DOI] [PubMed] [Google Scholar]
- Wuchty S., Jones B. F., Uzzi B. (2007). The increasing dominance of teams in production of knowledge. Science 316, 1036–1039. doi: 10.1126/science.1136099 [DOI] [PubMed] [Google Scholar]
- Zhang T., Gino F., Margolis J. D. (2018). Does “could” lead to good? On the road to moral insight. Acad. Manag. J. 61, 857–895. doi: 10.5465/amj.2014.0839 [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding author.

