Abstract
Communication depends on precise coordination between motor execution and cognition. Here we reveal that the cerebellum exerts real-time control over social vocalizations in adult mice. Optogenetic activation of excitatory cerebellar output in the superior cerebellar peduncle suppressed ultrasonic vocalizations with frequency-dependent potency while inducing distinct motor phenotypes. Systematic, functional mapping across cerebellar regions and cell types revealed that vocal suppression can occur in the absence of overt motor impairment, suggesting selective control of vocal output beyond gross movement disruption. We identified the periaqueductal gray (PAG), a conserved midbrain vocal control center, as a key downstream mediator of this effect. Deep brain stimulation of the PAG rescued vocal deficits in a model of cerebellar dysfunction without rescuing motor incoordination. These findings define a cerebellar–midbrain pathway that gates vocal behavior and demonstrate that targeted therapeutic neuromodulation can selectively restore communication-related output even in the presence of persistent cerebellar-driven motor deficits.
Language acquisition is a defining feature of human development, linking cognition to social communication. The neural circuits that govern language span distributed cortical networks (1, 2), with the cerebellum often being overlooked. However, cerebellar participation in language (3–8) and other cognitive domains (9–12) has been increasingly recognized. As a critical hub that integrates motor and non-motor functions, cerebellar connectivity with the cerebral cortex dynamically adapts across development (13, 14) in parallel with the emergence of higher-order cognition and coordinated motor behavior. Human imaging studies underscore the complex relationship between cerebellar circuits and behavior across functional domains (15, 16), raising the question of how specific cerebellar regions contribute to distinct behavioral processes.
Many neurodevelopmental and neurodegenerative conditions that may be rooted in cerebellar dysfunction present with co-occurring motor and non-motor challenges that profoundly affect quality of life (17). Autism spectrum disorder (ASD), for example, is characterized primarily by differences in social communication, alongside variable motor and repetitive behavioral features, with converging evidence implicating cerebellar involvement (18–24). Despite this association, it remains unclear whether cerebellar circuits that influence communication are separable from those that control movement, or whether non-motor deficits arise secondarily from disrupted motor output. Parsing cerebellar circuits that differentially influence movement and communication is therefore a critical step toward uncovering mechanisms of cognitive dysfunction and identifying therapeutic targets. Animal models provide a powerful platform to address this problem using temporally precise circuit-level manipulations.
Ultrasonic vocalizations (USVs) in mice provide a sensitive and quantifiable readout of vocal output during social interaction and are widely used as a proxy for social communication. Whether cerebellar circuits directly influence ongoing adult vocalizations, however, has not been established. To address this, we manipulated cerebellar output at its primary efferent pathway to determine whether vocal behavior could be acutely modulated. We find that cerebellar output rapidly and reversibly suppresses USVs in adult mice, demonstrating that the cerebellum can directly influence ongoing vocal behavior. This suppression can be mediated through projections to the periaqueductal gray (PAG), a conserved midbrain vocal control center. We then systematically mapped cerebellar subregions and cell types using frequency-specific optogenetic stimulation, revealing that modulation of vocal output does not uniformly track with overt motor phenotypes, including tremor and motor incoordination.
Cerebellar excitatory output gates vocal communication in mice
The immense computational processing that occurs within the cerebellar cortex may be distilled into its main output neurons. In the rodent, billions of granule cells make hundreds of thousands of synapses onto individual Purkinje cells, which are additionally contacted by several types of interneurons and one or two functionally potent climbing fibers. This extensive convergence enables Purkinje cells to integrate diverse synaptic inputs and translate them into a unified output signal. As the sole output neurons of the cerebellar cortex, Purkinje cells transmit this integrated activity via their axons (with local collaterals) predominantly to the cerebellar nuclei. Similarly, the main output pathway of the entire cerebellum is contained within the superior cerebellar peduncle (SCP), a white matter tract formed by the projections of cerebellar nuclei neurons that target multiple extra-cerebellar brain regions (25).
We capitalized on this organized output architecture by expressing channelrhodopsin (ChR2) in excitatory output cells of the cerebellar nuclei, enabling us to directly manipulate cerebellar signaling, and determine whether it influenced adult rodent vocalizations in a social behavior paradigm. As Atoh1 is expressed in excitatory neurons of the cerebellum, including granule cells and nuclei neurons (26, 27), we used it to drive Cre-dependent ChR2 expression. 3D light sheet imaging confirmed strong expression of Atoh1-tdTomato in excitatory cells of the cerebellar cortex (Fig. 1A, Movie S1). To selectively target cerebellar output, optic fibers were placed bilaterally over the SCP after it exits the cerebellar nuclei. This white matter tract contains the axons of cerebellar nuclei neurons and is spatially segregated from the cerebellar cortex by hundreds of microns (Movie S1). As a result, optical stimulation at this location only engages ChR2-expressing output axons. This strategy enables selective manipulation of cerebellar output pathways despite broader ChR2 expression in the cortex.
Figure 1. Cerebellar excitatory output via the superior cerebellar peduncle acutely suppresses ultrasonic vocalizations.
(A) Light-sheet reconstruction of Atoh1-tdTomato expressing cerebellar excitatory neurons and their axonal projections. Bilateral optical fibers are positioned above the superior cerebellar peduncle (SCP) to selectively target cerebellar excitatory output pathways while avoiding stimulation of granule cells, Purkinje cells or inhibitory nuclei cells (Movie S1).
(B) Behavioral assay schematic. Male mice implanted bilaterally above the SCP are recorded in a sound attenuated chamber during social interaction with a female. Ultrasonic vocalizations (USVs) are detected using an ultrasound microphone positioned above the chamber, synchronized with video tracking. Optogenetic stimulation consists of 10 ms square-wave light pulses delivered at 10, 20, 30 or 50 Hz during defined behavioral epochs. Representative USV traces are shown below the chamber schematic.
(C) Time course of USV production aligned to female entry. Optogenetic stimulation of SCP excitatory output (blue shading) produces a rapid, frequency-dependent suppression of USVs. Quantification of the number of USVs produced during the first 30 s of stimulation revealed a significant effect of stimulation condition (one-way ANOVA, F(6,77) = 25.64, p < 0.0001). Dunnett’s multiple comparisons test showed that each stimulation frequency differed significantly from baseline (10 Hz: p < 0.0001; 20 Hz: p < 0.0001; 30 Hz: p < 0.0001; 50 Hz: p < 0.0001). Higher stimulation frequencies resulted in near-complete vocal silencing that often persisted beyond stimulation offset. (c’) Ridge plots depict individual animals (the same line across trials) tracked across stimulation frequencies. When stimulation is initiated 10 s after female entry, USVs are abruptly silenced and remain suppressed in a subset of animals following termination of stimulation. Each line represents the same animal across frequencies on separate days. N = 12 animals.
(D) Social interaction quantified by the distance between the centers (torsos, Movie S3) of interacting animals during the first 0–30 s of stimulation and the subsequent 30–60 s interval. During the 0–30 s stimulation window, inter-animal distance differed significantly across conditions (one-way ANOVA, F(4,55) = 8.903, p < 0.0001). Dunnett’s post hoc comparisons revealed significant increases in distance relative to baseline at all stimulation frequencies (10 Hz: p = 0.0038; 20 Hz: p = 0.0018; 30 Hz: p = 0.0004; 50 Hz: p < 0.0001). During the 30–60 s post-stimulation interval, a smaller but significant effect of stimulation condition persisted (one-way ANOVA, F(4,55) = 3.042, p = 0.0245), with Dunnett’s comparisons indicating continued differences from baseline (30 Hz: p = 0.044; 50 Hz: p = 0.014). Error bars indicate mean ± SD; individual points represent animals. Additional comparisons shown in Fig. S1. N = 12 animals.
(E) Composition of ultrasonic call types classified using VocalMat (27). Heatmaps depict the centered log-ratio (CLR) difference between stimulation and baseline conditions for the first 100 calls following vocal onset in each session. CLR values were computed per animal and averaged across animals within each stimulation frequency (10–50 Hz). Only animals producing ≥100 calls in both baseline and stimulation sessions were included (N = 10, 9, 8, and 6 animals for 10, 20, 30, and 50 Hz, respectively). No call type reached significance after false discovery rate correction when assessed using paired-sample effect sizes across animals; however, stimulation consistently biased vocal output toward short calls, indicative of a restricted vocal repertoire following SCP activation. Corresponding control animal distributions are shown in Fig. S1. Color scale denotes ΔCLR (stimulation − baseline).
For this experiment, male mice with bilateral SCP implants were habituated in a sound attenuation box containing an ultrasound microphone and camera to monitor USVs, social interactions, and motor behavior (Fig. 1B). In this behavioral paradigm, the males drive USVs, whereas females rarely vocalize (28–30). We focus on males due to their reliability to vocalize at specific time points that can be aligned with optogenetic stimulation during a social behavior. At baseline (Fig. 1C, gray), all 12 males (previously exposed to a female), began USV emissions after entry of a female into the chamber (without any before). Over the course of five minutes, USVs gradually tapered off, making the initial 30 seconds following female entry the period with the highest number and most reliable period for USV generation. We therefore chose to drive excitatory cerebellar output stimulation for 30 seconds preceding, and 30 seconds after female entry, when males are most motivated to vocalize. Increasing stimulation frequencies (10, 20, 30, and 50 Hz) showed higher likelihood to silence USVs (Fig. 1C, c’), with persistent silencing at higher frequencies, even after stimulation was ceased. Twelve control mice without ChR2 expression maintained USVs across each trial (Fig. S1). Each stimulation frequency and behavioral assay were performed on separate days using novel females to reduce variability of USV emissions. Each line in Fig. 1c’ represents the same male across trials.
To ensure that daily stimulation of the entire cerebellar excitatory output did not damage the ability of mice to generate appropriate USVs, and to determine the time-course of USV silencing effects, we next delayed the stimulation until 10 seconds after the female entered the chamber and observed that all males again increased USV number (Fig. 1 c’, orange). Immediately upon 30 Hz light pulse delivery USVs were robustly silenced throughout the 30 second stimulation, and the stimulation was sufficient to silence USVs for the duration of the assay in several mice. The observed silencing effect was not lateralized, as restricting the stimulation to either side could drive the effect (Fig. S1). Further, social interactions paralleled the USV changes, in that mice at baseline stayed very close together (measured by center-to-center distance, additional analyses in Fig. S1), while mice spent further time away from each other at increasing stimulation intensities (Fig. 1D). This effect manifested more intensely with higher stimulation frequencies, with males primarily avoiding females, while females explored freely (Movie S2). When USVs (first 100 emissions) resumed, we noted a tendency toward more up-frequency modulation and shorter calls (Fig. 1E). Overall, these data show that targeted activation of cerebellar excitatory output silences USVs and leads to decreased sociability in male mice.
Functional mapping of segregated vocal and motor domains
Targeting excitatory projections within the SCP robustly suppressed USVs across stimulation frequencies; however, distinct and frequency-dependent motor phenotypes emerged in parallel. At 10 Hz, stimulation produced a prominent body-wide tremor. At 20 Hz, no overt motor phenotype was apparent, although a low-power tremor was detectable using an accelerometer-based tremor monitor (Fig. S2). At 30 Hz, neither tremor nor gross motor abnormalities were observed, whereas at 50 Hz, mild motor incoordination emerged. Stimulation at all frequencies increased inter-animal distance (Fig. 1D, Movie S2).
Because the SCP contains axons that originate from multiple cerebellar nuclei, it was unclear whether the specific circuits responsible for USV suppression were the same as those producing motor phenotypes. To disentangle these effects, we constructed a “phenotypic” heat map (Fig. 2) that summarized the presence or absence of vocal, social, and motor phenotypes across stimulation frequency, cerebellar region, and cell type. This structure-function mapping revealed a dissociation between vocal suppression and motor dysfunction. Notably, a robust 10 Hz tremor was elicited by stimulation of every cerebellar region tested and across excitatory neurons, inhibitory SCP projection neurons, and Purkinje cells, revealing that tremor is not strictly localized to a particular region or cell type. However, only a subset of these manipulations affected USVs. For example, stimulation of excitatory neurons in the dentate nucleus and/or Purkinje cell projections targeting the dentate nucleus reliably induced tremor, yet animals continued to generate appropriate USVs (Fig. 2, Fig. S3) and maintained social distance. Stimulation of SCP excitatory projections suppressed USVs and increased social distance when tremor was present, but stimulation of SCP inhibitory projections caused tremor with no effect on USVs or social interaction. Therefore, tremor alone was insufficient to account for deficits in social communication and vocal output, likely reflecting the downstream impact of these different pathways on particular regions.
Figure 2. Frequency-specific functional mapping of social and motor phenotypes across cerebellar cells and regions.
Binary summary of behavioral phenotypes elicited by optogenetic stimulation of distinct cerebellar subregions and cell types at 10, 20, 30, or 50 Hz. Phenotypic outcomes varied systematically with stimulation frequency, cell type, and anatomical target, revealing a frequency-dependent functional organization of cerebellar signaling.
Ultrasonic vocalizations (USVs) were quantified using the machine-learning–based classifier VocalMat. Given the exploratory nature of this dataset and limited sample sizes for several stimulation targets, phenotypes were operationally defined using within-group consistency criteria rather than population-level statistical inference. Specifically, for groups with N < 5, a USV phenotype was scored as present if all animals exhibited a ≥20% reduction relative to baseline during stimulation (Fig. S3). Social disengagement was assessed by changes in center-to-center distance (Movie S3), with a phenotype assigned when all animals in a group showed a ≥12% increase in distance (Fig. S3). Superior cerebellar peduncle comparisons are shown in Fig. 1A and periaqueductal gray comparisons are shown in Fig. 3E.
Tremor was identified through direct behavioral observation and independently confirmed using an accelerometer-based tremor monitor (Fig. S2) and marked positive when physiological tremor (increased power near 10 Hz) was present. Motor incoordination was classified as mild (ataxia) or severe (dystonia) based on overt abnormalities in posture and movement evident in behavioral videos (Movie S4). Severe motor incoordination co-occurred with increased social distance, consistent with impaired motor control contributing to apparent social disengagement under these conditions.
Colored boxes indicate the presence of a phenotype under the specified stimulation condition, whereas white boxes indicate its absence. Numbers in parentheses denote the number of animals tested for each stimulation target. Quantitative analyses underlying each behavioral measure are provided in the supplementary figures. Bilateral optical stimulation was performed in Atoh1Cre (excitatory output neurons and axon terminals in the PAG), Pcp2Cre (Purkinje cells), and Ptf1aCre (inhibitory output neurons) mice crossed to Rosa26lsl-ChR2-eYFP. The PAG was targeted either bilaterally or unilaterally.
Further arguing against motor dysfunction alone physically constraining vocal output, severe motor incoordination that resembled dystonia could also co-occur with USV suppression (Crus I/II Purkinje axon bundle), or with normal vocalizations (vermis VI). Frequencies of at least 50 Hz were required to produce severe motor incoordination resembling dystonia, while ataxia could be produced with stimulation as low as 20 Hz when targeting Purkinje cells to the fastigial or interposed nuclei. Additionally, within the fastigial and interposed, a complicated mixture of motor phenotypes was apparent (Movie S3), with co-occurring dystonia, ataxia, and tremor similar to our previous findings (32). CrusI and CrusII are heavily implicated in cerebellar associations to language (8, 33) and targeting the CrusI/II projection bundle en route to the nuclei led to silencing of USVs, while individually targeting subsets of cells within the lobules of Crus I or Crus II did not. The lateral areas of lobule IV/V (in the paravermis) did not silence USVs, but targeting the same lobule (IV/V) in the vermis reduced USVs during motor incoordination episodes. In some regions (Crus I/II bundle), 20 Hz did not cause motor incoordination, but a USV deficit persisted. These findings demonstrate that cerebellar suppression of vocalizations cannot be explained by gross motor impairment alone, and instead implicate downstream circuits capable of selectively gating communication.
Cerebellar-periaqueductal gray circuit drives vocal suppression
We next considered regions with significant cerebellar projections that may contribute to USVs. The SCP projects widely throughout the brain, but we focused on areas likely to mediate USVs independent of gross motor disruption. Of the potential targets, the periaqueductal gray (PAG) was chosen due to its well-known role as a vocal control center in mammals (34–36), which is densely innervated by the cerebellum (25). Targeting the PAG may eliminate the optogenetically induced motor symptoms if it is specialized for cognitive/affective states or more specific vocal control mechanisms. Indeed, targeting cerebellar excitatory terminals within the PAG led to a substantial reduction in USVs with no apparent motor phenotypes (Fig. 2, Fig. 3A). Interestingly, the PAG was the only region targeted that did not display a tremor at 10 Hz (Fig. S2), suggesting that the PAG is a significant target of cerebellar output influencing USVs and not gross motor dysfunction. Both the SCP and the PAG stimulated animals had similar overall locomotion compared to control (no ChR2 expression) animals (Fig. S4).Optogenetic manipulations of cerebellar projections within the PAG led to a similar reduction seen in a previous study that directly manipulated PAG neurons (36). Therefore, projections from the cerebellum are sufficient to drive a reduction in USVs that resembles direct PAG modulation.
Figure 3. Cerebellar vocalization circuit map identifies a cerebellar–periaqueductal gray pathway that suppresses vocalization and encodes negative valence.
(A) Summary circuit map of cerebellar regions where stimulation significantly suppressed ultrasonic vocalizations. Regions are indicated based on optogenetic targeting and behavioral outcomes summarized across experiments (see Fig. 2). Representative histological validation shows optic fiber placement and channelrhodopsin expression (green) in cerebellar projections terminating in the periaqueductal gray (PAG), along with induction of c-Fos (magenta) following PAG stimulation, confirming downstream activation of PAG neurons (Fig. S3) and general activity across the midbrain. I = interposed, F = fastigial, SCP = superior cerebellar peduncle, PAG = periaqueductal gray, Crus I/II = Purkinje cell fiber tract
(a’) Direct stimulation of cerebellar excitatory projections within the PAG reduced vocal output. Optogenetic stimulation using laser pulses across frequencies decreased the number of ultrasonic vocalizations produced during the stimulation epoch (left) and reduced the duration of individual calls (right) relative to baseline. Each dot represents an individual animal; bars indicate mean ± SD. One-way ANOVA did not detect a significant main effect of stimulation frequency on USV number (F(4,30) = 2.184, p = 0.0948) or call duration (F(4,28) = 2.619, p = 0.0562). However, planned post hoc comparisons using Dunnett’s multiple comparisons test revealed a significant reduction at 50 Hz relative to baseline for USV number (adjusted p = 0.0209) and call duration (adjusted p = 0.0141).
(B) Social interaction metrics during PAG stimulation. Mean center-to-center (left), nose-to-nose (middle), and male nose-to-female tail base (right) distances are shown for the stimulation epoch (0–30 s; blue bar) and the subsequent post-stimulation epoch (30–60 s). One-way ANOVA results 0–30 s: center-to-center F(4,30) = 13.86, p < 0.0001, 50 Hz Dunnett’s p < 0.0001; nose-to-nose F(4,30) = 5.539, p = 0.0018, 50 Hz Dunnett’s p = 0.0001; nose-to-tail base F(4,30) = 6.625, p = 0.0006, 50 Hz Dunnett’s p = 0.0008. 30–60 s: center-to-center F(4,30) = 13.86, p = 0.0005, 50 Hz Dunnett’s p = 0.0001; nose-to-nose F(4,30) = 4.510, p = 0.0057, 50 Hz Dunnett’s p = 0.0001; nose-to-tail base F(4,30) = 5.533, p = 0.0019, 50 Hz Dunnett’s p = 0.0010. Error bars indicate mean ± SD; individual points represent animals (N = 7).
(b’) Real-time place preference (RTPP) assay demonstrates that activation of excitatory cerebellar projections to the PAG carries negative valence. When 50 Hz optogenetic stimulation was triggered upon entry into one side of the chamber, animals spent significantly less cumulative time in the stimulation-paired zone (left), traveled less total distance (middle), and made fewer entries into the stimulation zone (right) compared to light-off epochs. Gray bars denote non-stimulation periods; green bars denote stimulation periods. * indicates p < 0.05.
(C) Call-type composition during optogenetic stimulation across cerebellar regions and stimulation frequencies. Heatmaps depict changes in call-type proportions during the first 30 s of stimulation relative to baseline. For each animal, the proportion of each call type was calculated during baseline and stimulation epochs, and the difference (Δ proportion) was summarized across animals. Call types are shown along the y-axis and cerebellar regions along the x-axis, grouped by stimulation frequency (10, 20, 30, and 50 Hz). No call-type category survived false discovery rate (FDR) correction across regions or frequencies; however, changes were directionally consistent across frequencies and regions, with a bias toward simpler call types, particularly short and up-frequency–modulated calls. Red region labels indicate excitatory cells, green are Purkinje cells and blue are inhibitory cells.
Having established the PAG as a key downstream mediator of vocal suppression, we summarize the overall regions that led to a silencing or reduction (Fig. 2, Fig. 3A). Bilateral optic implants to each cerebellar nuclei revealed that excitatory cells within the interposed nucleus were a primary driver of the silencing effect. In fact, neither the dentate nucleus nor the fastigial nucleus excitatory cells could drive the reduction. Since both excitatory cells of the interposed nucleus and the projections to those cells by Purkinje cells could silence USVs, this implicated the paravermis as a likely candidate, as the majority of interposed projections originate there. Most projections to the fastigial nucleus originate in the vermis, so lobules of the vermis may contribute as well (as seen with VI), with lateral lobules less likely since the dentate (lateral) nucleus did not drive USV changes targeting either excitatory nuclei cells or the Purkinje cell projections to them. The Crus I and II projection bundle could silence USVs at every frequency, as did the excitatory SCP output. The SCP excitatory projections terminating in the PAG reduced/silenced USVs and increased C-Fos across the midbrain after stimulation (quantification in Fig. S5).
Because PAG activation influenced both vocal output and social behavior, we next examined whether these effects reflected changes in affective state. When targeting both the SCP (Fig. 1D) and the PAG (Fig. 3B), males routinely disengaged from social interactions, measured by increased distance between the mice and was directly observable in the videos. The PAG has been shown to be involved in defensive/aversive behaviors (37) and projections from the cerebellum through the SCP are known to contribute to defensive behaviors driven by the PAG (38, 39). Therefore, we tested whether, in addition to affecting USV properties (Fig. 3a’), activating cerebellar excitatory projections in the PAG was an aversive stimulus that could lead to the observed social disengagement, particularly because no motor impairments were observed. Using a real time place preference (RTPP) assay we found that when 50 Hz stimulation to the PAG was coupled to half of an arena, mice displayed a robust aversion to this stimulation and quickly avoided the stimulation side of the box (Movie S6), leading to less overall distance traveled and decreased entries to the stimulation side (Fig. 3b’). As the mice spent roughly equal time in each side of the box without light stimulation, these data indicate that cerebellar excitatory projections to the PAG can elicit negative valence, and may contribute to both avoidance behaviors during social interaction paradigms, and inter-animal distance (Fig. 3B).
Finally, stimulation of the PAG resulted in shorter duration calls (Fig. 1a’), which led us to expect that overall vocal repertoire could shift depending on the region manipulated. Indeed, cerebellar stimulation of specific cell types across multiple regions was found to induce a marked simplification of the vocal repertoire, defined by a relative enrichment of short and up-frequency modulation call types and a depletion of more complex, multi-step calls (Fig. 3C). Overall, we conclude the cerebellum gates USVs by eliciting negative valence as a state shift, which leads to silencing or simplifying vocal output.
Neuromodulation restores vocalization deficits in a model with cerebellar circuit dysfunction
Collectively, our data show that altered cerebellar output to the PAG disrupts vocalizations and leads to an aversive state. Conversely, we wondered whether mice with severe cerebellar deficits could improve vocalizations with deep brain stimulation (DBS) targeted to the PAG. Our lab had previously shown that pup vocalizations were reduced when excitatory signaling is abolished from climbing fibers to Purkinje cells from birth (40), a model with persistent severe motor incoordination resembling dystonia (Fig. 4A-C) that could improve with DBS targeted to the interpose nucleus (41). For this, we first determined typical vocal behaviors across the lifespan by recording pup vocalizations in a maternal separation paradigm at postnatal day 0, 4, 7, 14, and 21 (Fig. 4b’). At P0, there was no significant difference between conditional knockout mice (CSKO, Ptf1acre; Vglut2fl/fl) and their littermates, reflecting the feature that climbing fibers are just beginning to contact Purkinje cell somas with immature synapses. At P4, when climbing fibers are capable of eliciting EPSCs at Purkinje cell somas, USV reduction was apparent, though the pups could still produce USVs, which persisted at P7 (replicating our previous report). However, while control pups began to reduce the number of USVs generated, the CSKO mice showed more USVs at P14, potentially reflecting a delay in development as the climbing fibers move from soma to Purkinje cell dendrites to form their mature connections (42). By P21, both control and CSKO pups do not elicit USVs after maternal separation, showing that USV’s cease at an appropriate time point of development. This established that mice are physically capable of generating USVs during development and that the timeline of their disruption correlates with specific cerebellar functional circuit establishment.
Figure 4. Neuromodulation of the periaqueductal gray (PAG) restores vocalization deficits in a complex spike knockout mouse model.
(A) Schematic illustrating elimination of vesicular glutamate transporter 2 (Vglut2) selectively from the olivocerebellar pathway in Ptf1acre/+; Vglut2fl/fl mice (complex spike knockout; CSKO). Loss of Vglut2 abolishes fast glutamatergic neurotransmission from inferior olive climbing fibers onto Purkinje cells and cerebellar nuclei neurons, eliminating complex spike activity in Purkinje cells.
(B–C) CSKO mice exhibit a pronounced dystonic motor phenotype beginning in early postnatal development (P7) that persists into adulthood.
(b’) Developmental timeline of ultrasonic vocalizations (USVs) in dystonia pups. A two-way ANOVA revealed a significant interaction between age and genotype on pup ultrasonic vocalization (USV) production (F(4,242) = 3.076, p = 0.0170), indicating that the effect of complex spike knockout on vocal output depends on developmental stage. There was a significant main effect of age (F(4,242) = 15.76, p < 0.0001), reflecting robust developmental changes in USV production. In contrast, there was no significant main effect of genotype when averaged across ages (F(1,242) = 2.965, p = 0.0864), consistent with a developmentally restricted rather than global genotype effect. Post hoc simple-effects comparisons between control and dystonia pups at each age using Šídák correction did not identify any individual age at which the groups differed significantly (all adjusted p > 0.05). However, given the known developmental timeline of climbing fiber–Purkinje cell circuit establishment, we performed planned comparisons at specific early postnatal ages. These analyses revealed significant reductions in USV production in dystonia pups at P4 (p = 0.010, N = 24 control, 25 CSKO) and P7 (p = 0.020, N = 31 control, 28 CSKO) with unpaired t-tests.
(c’) In adulthood, CSKO males exhibit a marked reduction in USV production during social interaction compared with control mice. Baseline USV output in CSKO males was significantly lower than that of control animals (unpaired t-test, p = 0.0077) and remained significantly reduced after sham DBS experiments with repeated exposure to novel females (p = 0.0085). N = 4 mice. In a separate cohort of males that showed reductions at baseline (p = 0.0002), deep brain stimulation (DBS) of the PAG significantly increased USV production in CSKO animals (paired t-test, p = 0.0080), restoring vocal output toward control levels. N = 8 mice. Gray bars indicate control mice; orange bars indicate CSKO mice. Points represent individual animals; bars show mean ± SD.
(D) Schematic of deep brain stimulation targeting the periaqueductal gray (PAG). Biphasic stimulation was delivered at 130 Hz (60 μs pulse width, 30 μA) for 1 hour per day across 5 consecutive days.
(E) Call-type composition analysis shows that short calls are disproportionately represented in CSKO mice at baseline, consistent with a restricted vocal repertoire. Following PAG DBS, call-type distributions shift toward those observed in control animals, driven primarily by a reduction in the proportion of short calls (paired t-test on short-call proportion, p = 0.0010). N = 4 control, 8 CSKO
(F) Representative spectrograms from the same CSKO animal before DBS and after the fifth day of PAG DBS illustrate restoration of call diversity and increased vocal output.
(G) Despite robust rescue of vocalization deficits, dystonic motor symptoms, quantified using a standardized dystonia rating scale, are not significantly improved by PAG DBS. Illustrations adapted from (42, 57).
We next tested whether severely dystonic CSKO mice have disrupted USVs in adulthood. To give the best chance of the male mice vocalizing, each CSKO mouse utilized was experienced, in that they produced at least one litter and were co-housed with a female for several weeks. They were then singly housed, implanted with DBS electrodes to the PAG, and allowed at least one week of recovery before the social interaction paradigm described in Fig. 1B was initiated. Baseline USVs of control vs CSKO adults were recorded over three days, and we report the third trial for each animal as the baseline number. CSKO mice exhibited limited vocalizations (Fig. 4c’), and this deficit was not improved by either sham DBS (no electrical stimulus delivered) or repeated exposure to novel females across five days. However, delivering 1 hour of DBS (130 Hz, 30 μA, 60 μs, Fig. 4D) to the PAG for five days was able to rescue the USV deficit (measured 30 minutes into the DBS protocol). Unlike our previous observation when targeting the interposed nucleus with DBS (41), dystonic motor symptoms were not rescued (Fig. 4G). The call types of CSKO adults were dominated by short calls before DBS, while after DBS the full range of typical call categories were observed (Fig. 4E-F).
Given that the main phenotype observed in the CSKO mouse is the elimination of complex spikes from Purkinje cells, this demonstrates that cerebellar signaling to downstream regions during postnatal and adolescent development may be important for driving social vocalizations in adulthood. Before climbing fiber circuitry is established (P0), pups generate typical USVs, suggesting that the cerebellum reshapes vocal behavior across development, which was recently demonstrated by assessing USV spectral properties in this dystonic model (43). Even though complex spikes are still eliminated in adulthood after DBS, which significantly disrupts cerebellar function and motor coordination, DBS to the downstream PAG could “bypass” the cerebellar dysfunction and promote USV generation in adults even with continued severe motor incoordination. This suggests that when cerebellar dysfunction is present, neuromodulation targeted to impactful downstream areas for specific behaviors may serve to rescue select behavioral deficits, and after critical periods of development.
Discussion
We define a causal cerebellar–midbrain circuit that directly gates social communication, bridging a longstanding gap between motor control and higher-order behavior through the mapping of behaviors to specific cerebellar structures and cell types to distinct behavioral outputs. Optogenetic manipulation of excitatory cerebellar output robustly suppressed vocalizations, a finding with direct relevance to human clinical observations. In pediatric patients, surgical disruption of the SCP during posterior fossa tumor resection is frequently associated with cerebellar mutism syndrome, characterized by mutism, social withdrawal, and motor deficits (44). These features bear a striking resemblance to the phenotypes observed following SCP manipulation in mice. By identifying projections from the cerebellum to the PAG, we reveal a subcortical relay through which cerebellar output may influence affective and social behaviors. Consistent with this, recent imaging studies have linked cerebellar–PAG dysfunction to mutism in human patients (45), highlighting the overlapping functional capacities of this circuit across species.
Overall, low frequency stimulation reliably entrains motor loops, as a 10 Hz stimulation drove robust tremor across all cerebellar regions and cell types targeted. Intermediate frequencies (20 – 30 Hz) more clearly decouples motor vs affective circuits, and high frequency stimulation (50 Hz) tended to prevent vocal output and simultaneously cause severe motor incoordination. Combined, it appears that, under these conditions, the cerebellum does not encode vocal motor patterns per se, but rather regulates the permissiveness and timing of vocal output by modulating midbrain affective/vocal gating nodes (PAG), thereby biasing whether vocalizations are permitted, suppressed, or simplified.
Several observations support this interpretation. First, vocalizations were often preserved during the most severe motor disturbances, including tremor, ataxia and dystonia, depending on the region targeted. Second, animals maintained physical social interaction and produced USVs even under conditions of substantial motor disruption, arguing against a simple impairment of the vocal apparatus or gross motor impairment driving differences. Third, during stimulation conditions that suppressed vocal output, animals exhibited clear behavioral avoidance despite preserved locomotion, suggesting a shift in motivational or affective state rather than a purely motor deficit. Finally, direct stimulation of cerebellar projections within the PAG reduced, but did not abolish vocalizations and altered their acoustic features while preserving broad call categories, further indicating that vocal suppression is not explained by an inability to generate motor output.
Together, these findings indicate that cerebellar output can suppress vocal behavior independently of overt motor impairment. When vocalizations were reduced or absent, particularly in the PAG, this was accompanied by behavioral disengagement rather than generalized motor incapacity, supporting a model in which cerebellar signaling influences the decision to vocalize rather than the mechanics of vocal production within these behavioral paradigms. An important question that emerges is what information is conveyed by the cerebellar–PAG pathway. Real-time place preference assays suggest that activation of this pathway is aversive, consistent with growing evidence that cerebellar outputs engage PAG circuits involved in defensive and fear-related behaviors (39, 46). These findings raise the possibility that cerebellar activity modulates vocal output by influencing internal state, biasing behavior away from social engagement. While USVs ultimately require motor execution, our data suggest that cerebellar contributions occur upstream of brainstem motor circuits, potentially regulating whether vocalization is initiated. Consistent with this, animals retained the capacity for locomotion but exhibited a shift from active pursuit to avoidance, supporting a reorganization of behavior rather than a loss of motor function. Although these results are best explained by a change in behavioral/brain state, we cannot exclude more subtle influences on motor control of the vocal apparatus.
Remarkably, DBS of the PAG restored vocal output in mice with genetically induced cerebellar dysfunction, even when severe dystonia persisted. This finding demonstrates that targeted neuromodulation can bypass impaired cerebellar computations to reinstate specific behavioral functions, a principle with broad translational potential for cerebellar ataxias, dystonias, and neurodevelopmental disorders that involve communication deficits. Determining the precise neural mechanisms that reinstate vocal output without restoring gross motor coordination will be important for determining how to leverage neuromodulation in promoting vocal output. Overall, disrupting key nodes in the vocalization network compresses output toward brief, simplified calls: dystonic mice exhibit short-call dominance that is reversed by DBS, PAG manipulation produces significantly shorter calls, and optogenetic stimulation across many regions shows a consistent trend toward short and up-frequency modulation calls, suggesting that overall disruption to cerebellar signaling biases vocal output toward a simpler repertoire.
Our work frames the cerebellum as an integrative hub that regulates the temporal precision of social signaling, linking motor, cognitive, and affective domains through overlapping, yet distinct circuitry. Frequency-specific activation patterns separately drive communication and movement behaviors, underscoring the importance of rate codes as behavioral gates. Functional mapping reveals dedicated locations where phenotypes originate, highlighting functional specificity at the level of the cerebellar cortex and nuclei driven by defined cell types. The restoration of vocal behavior through downstream stimulation further suggests that communication deficits resulting from cerebellar pathology may be therapeutically tractable even beyond developmental critical periods. Together, our results begin to illuminate how the cerebellum contributes to flexible coordination of complex behaviors, advancing a unifying framework for cerebellar function across domains of movement, affect, and communication.
Materials and Methods
Animals
Mice were housed with a 20:00–6:00 dark cycle within a level 3, AALAS-certified vivarium. The Institutional Animal Care and Use Committee of Baylor College of Medicine reviewed and approved all experimental animal procedures under protocol AN-5996. The following transgenic mouse lines on a C57BL/6J background were utilized: Atoh1Cre (47), Pcp2Cre (also known as L7Cre) (48), Ptf1aCre (gift from Dr. Chris Wright) (49), Slc17a6tm1Lowl/J (also known as Vglut2flox; JAX:012898) (50), Ai32 (Rosa26lsl-ChR2-eYFP, JAX:012569) (51) and Ai65 (Rosalsl-tdTomato, JAX:007908) (52). The conditional knock-out mice that resulted in dystonia were generated by crossing Ptf1aCre;Vglut2fx/fx with homozygote Vglut2fx/fx mice (41). Heterozygous expression of Cre in males was confirmed through ear punches and genotyping of each animal, then confirmed by positive expression of ChR2/tdTomato in each cell type when relevant. Due to the behavioral paradigm, we analyzed males (2–6 months old) only in this study, due to their reliability to produce USVs in response to a female and overrepresentation of developmental speech deficits in humans.
Clearing and light sheet imaging
Whole brains from Atoh1Cre; Rosalsl-tdTomato or Pcp2Cre; Rosalsl-tdTomato were first cleared in Cubic-L at 37° for delipidation (50–52) for one-two weeks (changing solution every few days), rinsed with distilled water then placed in EZ Clear (53) for two days until imaging with a Zeiss Light Sheet Z1 at a refractive index of 1.52 with a 5x objective. Image tiles were stitched together with Stitchy and visualized with Imaris.
Surgery
The same surgical technique was performed to implant 200 μm diameter optical fibers (typically bilateral) (ThorLabs, R-FOC-L200C-39NA) or two custom 75 mm twisted platinum DBS electrodes (PlasticsOne, Roanoke, VA, USA; 8IMS3039SPCE) into the various brain regions (Fig. S6) calculated through Pinpoint (57). Preemptive analgesics (buprenorphine, 1 mg/kg subcutaneous (SC), and meloxicam, 5 mg/kg SC) were delivered, with additional meloxicam doses the following 3 days as part of the standard post-operative analgesic protocol. Mice were continuously anesthetized with isoflurane while secured in a stereotaxic device (David Kopf Instruments), fur was removed with Nair, the skin was disinfected with alternating alcohol and betadine wipes, a midline incision was made and the skull was exposed and lightly etched with a blade. Small craniotomies were made at the target coordinates, the optical probes or DBS electrodes were slowly advanced into the brain, antibiotic ointment was added to cover the craniotomy, C&B Metabond (S380) was applied and allowed to harden then dental cement (A-M systems, #525000 and #526000) was applied over the top. Behavioral experiments started typically one week or more after the day of surgery. After behavioral experiments, brains were perfused, sectioned with a vibratome and targeting positions confirmed with fluorescent nissl staining and/or endogenous ChR2 expression on a Zeiss Axio Zoom. Only mice with confirmed appropriate expression and implant targeting (directly above the region of interest) were included in the dataset.
Optogenetics and behavioral assay
Male mice with optical implants were attached to a bifurcated bundle (Thor Labs, BFYL2LS01) and fiber coupled LED (M470F4) connected to a Thor LED driver (LEDD1B) or Crystalaser (CL473) then allowed to habituate in the sound attenuation chamber for ten minutes. An Avisoft condenser ultrasound microphone (CM16/CMPA #40011) hovered 27 cm above the 27 cm wide Plexiglass animal chamber for ultrasonic vocalization (USV) recordings with UltraSoundGate 116B (#51161) and Avisoft-RECORDER USGH software. The Avisoft software was initiated simultaneously with Spike2 software for video behavior recordings and time-stamped delivery of stimulation pulses via the LED driver through a CED Power1401 data acquisition interface (CED, Cambridge, UK). All stimulation paradigms were delivered at full power (output 2.1 mW) with square pulses of 10 ms, with varying times between pulses (typically 10, 20, 30 or 50 Hz).
For the core behavioral paradigm used across all regions, baseline behavior of the male alone in the chamber was video recorded for one minute, the optogenetic stimulation paradigm was introduced and proceeded for 30 seconds, the female was introduced so that it fully entered the chamber through a side port by 90 seconds, then the optogenetic stimulation was terminated at 120 seconds (30 second stimulation alone, 30 second stimulation with female for a total of one minute stimulation) and the mice were allowed to freely interact for a total of 5 minutes. Variations to this protocol are indicated in figures/legends, such as when the stimulation was introduced after the female entered to abruptly terminate USVs when targeting the SCP (Fig. 3C orange). Each session was conducted on separate days with novel females, generally increasing frequency on subsequent days, but randomizing frequencies resulted in the same phenotype. Control males vocalized consistently across days with no effect from the stimulation (Fig. S1). Males did not vocalize when alone in the chamber (before female entry) and were singly housed before and between experiments.
Deep Brain Stimulation
Male mice implanted with DBS electrodes (targeting the PAG; AP −4.7 mm, ML ±0.7 mm, DV −2.21 mm relative to bregma) were connected to the deep brain stimulation device (Multi Channel Systems, #STG4002) to deliver a biphasic 130 Hz stimulus with 60 μs duration at 30 μA for 1 h per day for 5 days (41). A female was introduced, as described in the optogenetic behavioral methods, with one full assay occurring 30 minutes into the one hour of DBS stimulation. Videos during the assay were examined for dystonic postures. All dystonic males were co-housed with females for several weeks and produced at least one litter before being single housed. Baseline USVs were recorded on the third trial (to allow the mice to have experience with the assay) while tethered to the DBS system, but no pulses were delivered (as with the sham DBS trials). Each trial was on a separate, consecutive day.
Dystonia rating scale (55) from 0 to 5 was used wherein 0 = no motor abnormalities; 1 = slightly slowed or abnormal motor behavior, no dystonia; 2 = mild impairment, sometimes limited ambulation, dystonic postures when disturbed; 3 = moderate impairment, frequent spontaneous dystonic postures; 4 = severe impairment, sustained dystonic postures and limited ambulation; and 5 = prolonged immobility in dystonic postures.
Tremor, ataxia and dystonia behavior
Motor behavior was visually identified from videos taken during the behavioral assay (Movie S4) and binary presence of tremor, ataxia and dystonia were manually recorded by at least two expert observers. To confirm the presence of a tremor induced by the optogenetic stimulation and to verify the frequency of tremor induced with different stimulation paradigms, the mice were placed in a custom-made tremor monitor as described previously (56). Frequency and amplitude of the tremor were measured before, during and after optogenetic stimulation via optical patch cables connected above the tremor monitor, which is a Plexiglass box suspended by elastic chords at each corner with an accelerometer mounted to the box to detect subtle shifts in box movement. Mice were gently encouraged to ambulate a similar amount during each recording period for consistent baseline movement of the box. The output from the tremor monitor was amplified and lowpass filtered at 5 kHz (Brownlee Precision, Model 410) before being digitized by a CED board and analyzed using Spike2 Scripts. Recordings are centered on zero, downsampled using the Spike2 interpolate function and a power spectrum analysis with a Hanning window was performed for each recording window of 2 minutes (before, during and after stimulation). The peak power was calculated as the maximum power within the frequency band window. The mice are allowed to freely ambulate within the box, and given 2 minutes to acclimate before sequential 2 minute recording periods.
Machine learning software analysis
USVs were recorded using Avisoft software and exported as .wav files for downstream analysis. USV files were processed in MATLAB (MathWorks) using VocalMat, an automated computer vision–based pipeline for detection and classification of ultrasonic calls (57). VocalMat identifies individual vocalizations based on spectral features and temporal structure then assigns calls to predefined categories using supervised machine learning classifiers trained on annotated datasets. Quantitative features describing call timing, duration, and spectral content were exported for each recording. Custom Python scripts were used to aggregate vocalization metrics across trials and animals, and summary data were visualized and statistically analyzed in GraphPad Prism (version 10.3.1 for Windows).
To quantify social interaction dynamics during male–female encounters, pose estimation was performed using Social LEAP Estimates Animal Poses (SLEAP) (58). A custom multi-animal bottom-up deep learning model was trained to detect and localize individual body landmarks (nose, ears, neck, mid-torso, tail base, tail midpoint, and tail tip) for each animal in video frames (Movie S3). The bottom-up architecture was selected because it provided robust performance across variable postures and partial occlusions typical of freely interacting animals. Model training produced confidence maps for individual body parts and part-affinity fields used to associate detected keypoints into distinct animal instances.
For inference, pose predictions were generated across entire video sequences using a multi-animal bottom-up inference pipeline with a maximum of two tracked instances. Identity tracking across frames was performed using optical-flow–based instance matching with greedy assignment, constrained by instance-level similarity metrics. Tracking incorporated a temporal window of five frames and optional Kalman filter–based smoothing after an initial warm-up period to improve cross-frame identity consistency. Single-frame track discontinuities were automatically reconnected during post-processing. Output pose trajectories were used for downstream quantification of movement metrics and between animal spatial relationships using custom Python analysis scripts.
Real-time place preference assay
Atoh1-Cre mice expressing ChR2 with optic fibers targeting the periaqueductal gray (PAG) were assessed after USV assays. Both unilateral (n = 2) and bilateral (n = 5) fiber implant configurations were used. Mice were tested in a rectangular open-field arena divided into two spatially distinct halves. On the first day, animals were allowed to freely explore the arena for 30 minutes in the absence of photostimulation to establish baseline place preference. On a subsequent testing day, animals were returned to the same arena, and photostimulation (473 nm light, 50 Hz, 10 ms pulses) was delivered when the animal occupied one predefined half of the arena. The stimulated side was randomized across animals. No stimulation was delivered when animals occupied the opposite half of the arena.
Animal position and locomotor behavior were continuously tracked using EthoVision software (Noldus), which was used to quantify time spent in each half of the arena and to assess spatial occupancy. All behavioral metrics were computed from EthoVision outputs and visualized with Graph Pad Prism. Place preference was assessed by comparing the proportion of time spent in the stimulated versus unstimulated zones across sessions.
At the conclusion of the assay, mice were stimulated for 30 seconds on and 90 seconds off for 10 minutes at 50 Hz to induce c-Fos expression. Mice were perfused one hour later, brains were cryoprotected with 30% sucrose, sectioned on a cryostat and immunostained against GFP (Invitrogen A10262, 1:1000, Secondary: Invitrogen 11039 1:500) to detect ChR2 expression and c-Fos (Abcam ab190289, 1:1000, Secondary: Invitrogen A-21245 1:500). Sections containing the PAG implant were compared to control mice without stimulation to detect upregulation in c-Fos (Fig. S4).
Statistics
All statistical analyses were performed using GraphPad Prism (version 10.3.1 for Windows) and custom Python scripts for data aggregation and preprocessing. No statistical methods were used to predetermine sample sizes; sample sizes were chosen based on prior studies using comparable behavioral and optogenetic paradigms. Data distribution was not formally tested for normality due to modest sample sizes; parametric tests were selected based on experimental design and robustness to minor deviations from normality. All tests were two-tailed and an alpha level of 0.05 was used to determine statistical significance. Exact p values, test statistics, and degrees of freedom are reported in figure legends where applicable.
Ultrasonic vocalizations
USVs were quantified as total call number within defined time windows (typically the first 30 s of stimulation unless otherwise stated), call duration, and call-type composition. Comparisons involving more than two stimulation conditions (e.g., baseline vs. 10, 20, 30, and 50 Hz stimulation) were analyzed using ordinary one-way ANOVA. When a significant main effect was detected, Dunnett’s multiple comparisons test was used to compare each stimulation condition to baseline while controlling the familywise error rate. In cases where the omnibus ANOVA did not reach significance, post hoc comparisons were still reported when prespecified comparisons to baseline were performed.
For developmental analyses of pup USVs across postnatal ages (P0–P21), one-way ANOVA was used to assess effects of age within genotype, with genotype comparisons performed at individual time points using unpaired two-tailed t tests where appropriate.
Call-type composition analyses
Call-type proportions were analyzed using centered log-ratio (CLR) transformation to account for the compositional nature of vocal repertoire data. For each animal, CLR differences between stimulation and baseline were computed for the first 100 calls following vocal onset. Effect sizes were summarized using paired-sample Cohen’s d (mean ΔCLR / SD across animals). Statistical significance across call categories was assessed using paired comparisons with false discovery rate (FDR) correction (Benjamini–Hochberg procedure). Heatmaps display mean ΔCLR values across animals. For region exploration experiments (Fig. 3A), the proportion of call types were compared to baseline and averaged across animals for each region within the 30 s stimulation window.
Social interaction metrics
Social interaction was quantified using SLEAP-derived pose tracking to compute Euclidean distances between animals (nose-to-nose, nose-to-tail base, and center-to-center distances, Movie S3). For optogenetic experiments, distances were averaged over defined epochs (0–30 s stimulation and 30–60 s post-stimulation). Effects of stimulation frequency were assessed using ordinary one-way ANOVA followed by Dunnett’s multiple comparisons test versus baseline. Control littermate datasets were analyzed in parallel using identical statistical procedures.
Tremor analysis
Tremor power spectra obtained from the accelerometer-based tremor monitor were analyzed by calculating the maximum power within the stimulation frequency band for baseline and stimulation periods. Paired two-tailed t tests were used to compare baseline and stimulation power for each frequency and targeting condition. For experiments involving multiple stimulation frequencies, each frequency was treated as an independent comparison relative to baseline.
Motor phenotype scoring
Binary presence or absence of tremor, ataxia, and dystonia was recorded by at least two expert observers blinded to stimulation condition. For dystonia severity ratings (0–5 scale), baseline and post-DBS values were compared using paired two-tailed t tests. Because the dystonia scale is ordinal, results are interpreted conservatively and reported alongside individual data points.
Real-time place preference (RTPP)
RTPP behavior was analyzed by comparing time spent in stimulated versus unstimulated zones, total distance traveled, and number of zone entries. Paired two-tailed t tests were used to compare light-OFF and light-ON conditions within animals. Unilateral and bilateral stimulation cohorts were pooled after confirming no qualitative differences in behavioral directionality.
Data presentation
Bar plots depict mean ± standard deviation unless otherwise indicated. Individual animals are represented as dots. Heatmaps represent averaged values across animals. Exact N values for each analysis are reported in the corresponding figure panels or legends. No data points were excluded unless predefined criteria were not met (e.g., insufficient number of USVs for call-type analyses).
Supplementary Material
Acknowledgments:
We sincerely thank all members of the Sillitoe lab for useful discussions and technical advice, particularly Dr. Amanda M. Brown, Dr. Linda H. Kim, Sarah Donofrio and Dr. Jason S. Gill.
Funding:
Simons Foundation SFI-AN-AR-Fellow-00012389 (CB)
Department of Defense PR22135 (BRA)
National Institute of Diabetes and Digestive and Kidney Diseases R01DK138518 (BRA)
National Institute of Diabetes and Digestive and Kidney Diseases R01DK109934 (BRA)
United States Department of Agriculture CRIS 1-58-3092-0-001 (BRA)
National Institute of Neurological Disorders and Stroke R01NS119301 (RVS)
National Institute of Neurological Disorders and Stroke R01NS127435 (RVS)
Eunice Kennedy Shriver National Institute of Child Health and Human Development of the National Institutes of Health P50HD103555 for use of the Cell and Tissue Pathogenesis Core (BCM IDDRC)
Baylor College of Medicine, Texas Children’s Hospital (RVS)
Ting Tsung and Wei Fong Chao Foundation (RVS)
Footnotes
Competing interests: Authors declare that they have no competing interests.
Data and materials availability:
All data are available in the main text or the supplementary materials.
References
- 1.Tuckute G., Kanwisher N., Fedorenko E., Language in Brains, Minds, and Machines. Annual Review of Neuroscience 47, 277–301 (2024). [Google Scholar]
- 2.Arriaga G., Jarvis E. D., Mouse vocal communication system: Are ultrasounds learned or innate? Brain and Language 124, 96–116 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.LeBel A., D’Mello A. M., A seat at the (language) table: incorporating the cerebellum into frameworks for language processing. Current Opinion in Behavioral Sciences 53, 101310 (2023). [Google Scholar]
- 4.LeBel A., Jain S., Huth A. G., Voxelwise Encoding Models Show That Cerebellar Language Representations Are Highly Conceptual. J Neurosci 41, 10341–10355 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Jobson K. R., Hoffman L. J., Metoki A., Popal H., Dick A. S., Reilly J., Olson I. R., Language and the Cerebellum: Structural Connectivity to the Eloquent Brain. Neurobiology of Language 5, 652–675 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Mariën P., Ackermann H., Adamaszek M., Barwood C. H. S., Beaton A., Desmond J., De Witte E., Fawcett A. J., Hertrich I., Küper M., Leggio M., Marvel C., Molinari M., Murdoch B. E., Nicolson R. I., Schmahmann J. D., Stoodley C. J., Thürling M., Timmann D., Wouters E., Ziegler W., Consensus Paper: Language and the Cerebellum: an Ongoing Enigma. Cerebellum 13, 386–410 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Mechtenberg H., Heffner C. C., Myers E. B., Guediche S., The cerebellum is sensitive to the lexical properties of words during spoken language comprehension. Neurobiology of Language, 1–44 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Casto C., Poliak M., Tuckute G., Small H., Sherlock P., Wolna A., Lipkin B., D’Mello A. M., Fedorenko E., The cerebellar components of the human language network. Neuron 0 (2026). [Google Scholar]
- 9.Kim L. H., Heck D. H., Sillitoe R. V., Cerebellar Functions Beyond Movement and Learning. Annual Review of Neuroscience 47, 145–166 (2024). [Google Scholar]
- 10.Koziol L. F., Budding D., Andreasen N., D’Arrigo S., Bulgheroni S., Imamizu H., Ito M., Manto M., Marvel C., Parker K., Pezzulo G., Ramnani N., Riva D., Schmahmann J., Vandervert L., Yamazaki T., Consensus paper: the cerebellum’s role in movement and cognition. Cerebellum (London, England) 13, 151–77 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Van Overwalle F., Manto M., Cattaneo Z., Clausi S., Ferrari C., Gabrieli J. D. E., Guell X., Heleven E., Lupo M., Ma Q., Michelutti M., Olivito G., Pu M., Rice L. C., Schmahmann J. D., Siciliano L., Sokolov A. A., Stoodley C. J., van Dun K., Vandervert L., Leggio M., Consensus Paper: Cerebellum and Social Cognition. Cerebellum 19, 833–868 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Schmahmann J. D., Guell X., Stoodley C. J., Halko M. A., The Theory and Neuroscience of Cerebellar Cognition. Annual Review of Neuroscience 42, 337–364 (2019). [Google Scholar]
- 13.Lyu W., Thung K.-H., Huynh K. M., Wang L., Lin W., Ahmad S., Yap P.-T., Functional development of the human cerebellum from birth to age five. Nat Commun 16, 6350 (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Gaiser C., van der Vliet R., de Boer A. A. A., Donchin O., Berthet P., Devenyi G. A., Mallar Chakravarty M., Diedrichsen J., Marquand A. F., Frens M. A., Muetzel R. L., Population-wide cerebellar growth models of children and adolescents. Nat Commun 15, 2351 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Guell X., Schmahmann J., Cerebellar Functional Anatomy: a Didactic Summary Based on Human fMRI Evidence. Cerebellum 19, 1–5 (2020). [DOI] [PubMed] [Google Scholar]
- 16.Nettekoven C., Zhi D., Shahshahani L., Pinho A. L., Saadon-Grosman N., Buckner R. L., Diedrichsen J., A hierarchical atlas of the human cerebellum for functional precision mapping. Nat Commun 15, 8376 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Schmahmann J. D., Disorders of the Cerebellum: Ataxia, Dysmetria of Thought, and the Cerebellar Cognitive Affective Syndrome. JNP 16, 367–378 (2004). [Google Scholar]
- 18.Hampson D. R., Blatt G. J., Autism spectrum disorders and neuropathology of the cerebellum. Frontiers in neuroscience 9, 420 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Becker E. B. E., Stoodley C. J., Autism spectrum disorder and the cerebellum. International review of neurobiology 113, 1–34 (2013). [DOI] [PubMed] [Google Scholar]
- 20.Brandenburg C., Griswold A. J., Van Booven D. J., Kilander M. B. C., Frei J. A., Nestor M. W., Dykxhoorn D. M., Pericak-Vance M. A., Blatt G. J., Transcriptomic analysis of isolated and pooled human postmortem cerebellar Purkinje cells in autism spectrum disorders. Front Genet 13, 944837 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Rogers T. D., McKimm E., Dickson P. E., Goldowitz D., Blaha C. D., Mittleman G., Is autism a disease of the cerebellum? An integration of clinical and pre-clinical research. Frontiers in systems neuroscience 7, 15 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Wang S. S.-H., Kloth A. D., Badura A., The cerebellum, sensitive periods, and autism. Neuron 83, 518–32 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Fatemi S. H., Aldinger K. A., Ashwood P., Bauman M. L., Blaha C. D., Blatt G. J., Chauhan A., Chauhan V., Dager S. R., Dickson P. E., Estes A. M., Goldowitz D., Heck D. H., Kemper T. L., King B. H., Martin L. A., Millen K. J., Mittleman G., Mosconi M. W., Persico A. M., Sweeney J. A., Webb S. J., Welsh J. P., Consensus paper: pathological role of the cerebellum in autism. Cerebellum (London, England) 11, 777–807 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Watanabe T., Kano M., Molecular and cellular mechanisms of developmental synapse elimination in the cerebellum: Involvement of autism spectrum disorder-related genes. Proc. Jpn. Acad., Ser. B 100, 508–523 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Novello M., Bosman L. W. J., De Zeeuw C. I., A Systematic Review of Direct Outputs from the Cerebellum to the Brainstem and Diencephalon in Mammals. Cerebellum, doi: 10.1007/s12311-022-01499-w (2022). [DOI] [Google Scholar]
- 26.Ben-Arie N., Bellen H. J., Armstrong D. L., McCall A. E., Gordadze P. R., Guo Q., Matzuk M. M., Zoghbi H. Y., Math1 is essential for genesis of cerebellar granule neurons. Nature 390, 169–172 (1997). [DOI] [PubMed] [Google Scholar]
- 27.Rose M. F., Ahmad K. A., Thaller C., Zoghbi H. Y., Excitatory neurons of the proprioceptive, interoceptive, and arousal hindbrain networks share a developmental requirement for Math1. Proc Natl Acad Sci U S A 106, 22462–22467 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Chabout J., Serreau P., Ey E., Bellier L., Aubin T., Bourgeron T., Granon S., Adult Male Mice Emit Context-Specific Ultrasonic Vocalizations That Are Modulated by Prior Isolation or Group Rearing Environment. PLOS ONE 7, e29401 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Warburton V. L., Sales G. D., Milligan S. R., The emission and elicitation of mouse ultrasonic vocalizations: the effects of age, sex and gonadal status. Physiol Behav 45, 41–47 (1989). [DOI] [PubMed] [Google Scholar]
- 30.Warren M. R., Spurrier M. S., Roth E. D., Neunuebel J. P., Sex differences in vocal communication of freely interacting adult mice depend upon behavioral context. PLOS ONE 13, e0204527 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Chilosi A. M., Brovedani P., Cipriani P., Casalini C., Sex differences in early language delay and in developmental language disorder. Journal of Neuroscience Research 101, 654–667 (2023). [DOI] [PubMed] [Google Scholar]
- 32.van der Heijden M. E., Brown A. M., Kizek D. J., Sillitoe R. V., Cerebellar nuclei cells produce distinct pathogenic spike signatures in mouse models of ataxia, dystonia, and tremor. eLife 12, RP91483 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Stoodley C. J., Schmahmann J. D., Functional topography of the human cerebellum. Handb Clin Neurol 154, 59–70 (2018). [DOI] [PubMed] [Google Scholar]
- 34.Sharif A., Matsumoto J., Choijiljav C., Badarch A., Setogawa T., Nishijo H., Nishimaru H., Characterization of Ultrasonic Vocalization-Modulated Neurons in Rat Motor Cortex Based on Their Activity Modulation and Axonal Projection to the Periaqueductal Gray. eNeuro 11, ENEURO.0452–23.2024 (2024). [Google Scholar]
- 35.Brudzynski S. M., Handbook of Mammalian Vocalization: An Integrative Neuroscience Approach (Academic Press, 2009). [Google Scholar]
- 36.Tschida K., Michael V., Takato J., Han B.-X., Zhao S., Sakurai K., Mooney R., Wang F., A specialized neural circuit gates social vocalizations in the mouse. Neuron 103, 459–472.e4 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Tovote P., Esposito M. S., Botta P., Chaudun F., Fadok J. P., Markovic M., Wolff S. B. E., Ramakrishnan C., Fenno L., Deisseroth K., Herry C., Arber S., Lüthi A., Midbrain circuits for defensive behaviour. Nature 534, 206–212 (2016). [DOI] [PubMed] [Google Scholar]
- 38.Lawrenson C., Paci E., Pickford J., Drake R. A., Lumb B. M., Apps R., Cerebellar modulation of memory encoding in the periaqueductal grey and fear behaviour. eLife 11, e76278 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.da Silva G. N., Seiffert N., Tovote P., Cerebellar contribution to the regulation of defensive states. Front Syst Neurosci 17, 1160083 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Van Der Heijden M. E., Gill J. S., Rey Hipolito A. G., Salazar Leon L. E., Sillitoe R. V., Quantification of Behavioral Deficits in Developing Mice With Dystonic Behaviors. Dystonia 1, 10494 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.White J. J., Sillitoe R. V., Genetic silencing of olivocerebellar synapses causes dystonia-like behaviour in mice. Nat Commun 8, 14912 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Brandenburg C., Gill J. S., Sillitoe R. V., “Olivocerebellar circuit development as a substrate for dystonia pathogenesis” in Current Topics in Developmental Biology (Elsevier, 2025; https://linkinghub.elsevier.com/retrieve/pii/S0070215325001012), p. S0070215325001012. [Google Scholar]
- 43.Fitzgerald A. L., Coello J. A., Lyon A. M., Dao B. L., van der Heijden M. E., Assessing vocal changes through spectral analyses of vocalizations in a cerebellar-specific dystonia mouse model. Dystonia 5, 15530 (2026). [Google Scholar]
- 44.Fabozzi F., Margoni S., Andreozzi B., Musci M. S., Del Baldo G., Boccuto L., Mastronuzzi A., Carai A., Cerebellar mutism syndrome: From pathophysiology to rehabilitation. Front Cell Dev Biol 10, 1082947 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.McAfee S. S., Robinson G., Gajjar A., Zhang S., Bag A. K., Raches D., Conklin H. M., Khan R. B., Scoggins M. A., Cerebellar mutism is linked to midbrain volatility and desynchronization from speech cortices. Brain 146, 4755–4765 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Urrutia Desmaison J. D., Sala R. W., Ayyaz A., Nondhalee P., Popa D., Léna C., Cerebellar control of fear learning via the cerebellar nuclei–Multiple pathways, multiple mechanisms? Front. Syst. Neurosci. 17 (2023). [Google Scholar]
- 47.Yang H., Xie X., Deng M., Chen X., Gan L., Generation and characterization of Atoh1-Cre knock-in mouse line. Genesis 48, 407–413 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Lewis P. M., Gritli-Linde A., Smeyne R., Kottmann A., McMahon A. P., Sonic hedgehog signaling is required for expansion of granule neuron precursors and patterning of the mouse cerebellum. Dev Biol 270, 393–410 (2004). [DOI] [PubMed] [Google Scholar]
- 49.Kawaguchi Y., Cooper B., Gannon M., Ray M., MacDonald R. J., Wright C. V. E., The role of the transcriptional regulator Ptf1a in converting intestinal to pancreatic progenitors. Nat Genet 32, 128–134 (2002). [DOI] [PubMed] [Google Scholar]
- 50.Tong Q., Ye C.-P., Jones J. E., Elmquist J. K., Lowell B. B., Synaptic release of GABA by AgRP neurons is required for normal regulation of energy balance. Nat Neurosci 11, 998–1000 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Madisen L., Mao T., Koch H., Zhuo J., Berenyi A., Fujisawa S., Hsu Y.-W. A., Garcia A. J., Gu X., Zanella S., Kidney J., Gu H., Mao Y., Hooks B. M., Boyden E. S., Buzsáki G., Ramirez J. M., Jones A. R., Svoboda K., Han X., Turner E. E., Zeng H., A toolbox of Cre-dependent optogenetic transgenic mice for light-induced activation and silencing. Nat Neurosci 15, 793–802 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Madisen L., Zwingman T. A., Sunkin S. M., Oh S. W., Zariwala H. A., Gu H., Ng L. L., Palmiter R. D., Hawrylycz M. J., Jones A. R., Lein E. S., Zeng H., A robust and high-throughput Cre reporting and characterization system for the whole mouse brain. Nat Neurosci 13, 133–140 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Matsumoto K., Mitani T. T., Horiguchi S. A., Kaneshiro J., Murakami T. C., Mano T., Fujishima H., Konno A., Watanabe T. M., Hirai H., Ueda H. R., Advanced CUBIC tissue clearing for whole-organ cell profiling. Nature protocols 14, 3506–3537 (2019). [DOI] [PubMed] [Google Scholar]
- 54.Blatt G. J., Brandenburg C., Poulopoulos A., India ink to 3D imaging: The legacy of Dr. Deepak “Dee” N. Pandya and his influence on generations of neuroanatomists. The Journal of comparative neurology 531, 1875–1882 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Brandenburg C., Poulopoulos A., “Sparse Labeling, Rapid Clearing, and Native Fluorescence Light Sheet Imaging in the Developing Rodent Cerebellum” in Synapse Development: Methods and Protocols, Soykan T., Poulopoulos A., Eds. (Springer US, New York, NY, 2025; 10.1007/978-1-0716-4446-1_16), pp. 263–276. [DOI] [Google Scholar]
- 56.Hsu C.-W., Cerda J., Kirk J. M., Turner W. D., Rasmussen T. L., Flores Suarez C. P., Dickinson M. E., Wythe J. D., EZ Clear for simple, rapid, and robust mouse whole organ clearing. Elife 11, e77419 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Birman D., Yang K. J., West S. J., Karsh B., Browning Y., the I. B. Laboratory, Siegle J. H., Steinmetz N. A., Pinpoint: trajectory planning for multi-probe electrophysiology and injections in an interactive web-based 3D environment. eLife 12 (2023). [Google Scholar]
- 58.Pizoli C. E., Jinnah H. A., Billingsley M. L., Hess E. J., Abnormal cerebellar signaling induces dystonia in mice. J Neurosci 22, 7825–7833 (2002). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Brown A. M., White J. J., van der Heijden M. E., Zhou J., Lin T., Sillitoe R. V., Purkinje cell misfiring generates high-amplitude action tremors that are corrected by cerebellar deep brain stimulation. Elife 9, e51928 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Fonseca A. H., Santana G. M., Bosque Ortiz G. M., Bampi S., Dietrich M. O., Analysis of ultrasonic vocalizations from mice using computer vision and machine learning. eLife 10, e59161. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Pereira T. D., Tabris N., Matsliah A., Turner D. M., Li J., Ravindranath S., Papadoyannis E. S., Normand E., Deutsch D. S., Wang Z. Y., McKenzie-Smith G. C., Mitelut C. C., Castro M. D., D’Uva J., Kislin M., Sanes D. H., Kocher S. D., Wang S. S.-H., Falkner A. L., Shaevitz J. W., Murthy M., SLEAP: A deep learning system for multi-animal pose tracking. Nat Methods 19, 486–495 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All data are available in the main text or the supplementary materials.




