Abstract
While motor cortical circuits contain information related to specific movement parameters1, long-range inputs also play a critical role in action execution2, 3. Thalamic projections can shape premotor activity2–6 and have been suggested7 to mediate the selection of short, stereotyped actions comprising more complex behaviors8. However, the mechanisms by which the thalamus interacts with motor cortical circuits to execute such movement sequences remain unknown. Here we find that thalamic drive engages a specific subpopulation of premotor neurons within the zebra finch song nucleus HVC (proper name) and that these inputs are critical for the progression between vocal motor elements (i.e., ‘syllables’). In vivo 2-photon imaging of thalamic axons in HVC revealed robust song-related activity, and online perturbations of thalamic function caused song to be truncated at syllable boundaries. We used thalamic stimulation to identify a sparse set of thalamically-driven neurons within HVC, representing ~15% of the premotor neurons within that network. Surprisingly, this population of putative thalamorecipient neurons is robustly active immediately preceding syllable onset, leading to the possibility that thalamic input can initiate individual song components through selectively targeting these ‘starter cells’. Our findings highlight the motor thalamus as a director of cortical dynamics in the context of an ethologically-relevant behavioral sequence.
The songbird is an advantageous model system for understanding the neural mechanisms underlying a learned complex behavior. During singing, adult male zebra finches produce a fixed sequence of typically 2–7 ‘syllables’, which represent functionally9, 10 and developmentally11 distinct motor elements. The motor thalamic nucleus Uvaeformis (Uva) has been shown to be a critical node in the vocal production pathway necessary for singing12–17. Consistent with these results, we found that vocalizations following Uva lesions (n = 6 birds) – but not sham lesions (n = 2 birds) – became diffuse and no longer categorically distinct (i.e., the syllable structure was lost) (Fig. 1b,c, Extended Data Fig. 1). Uva provides input to the cortical nucleus HVC (proper name) through direct projections as well as indirectly via the nucleus interfacialis of the nidopallium (NIf)17 and Avalanche (Av)18 (Fig. 1a). During singing, intrinsic circuitry within HVC16, 19–22 generates a sparse sequence of activity tiling the entirety of song23, with each premotor neuron contributing a burst at a single invariant timepoint relative to the song24, 25. Using this sparse HVC code as a temporal readout, we examine the functional connections from Uva to HVC to assess the impact of motor-related thalamic circuitry in the context of an ethologically-relevant complex behavior.
Fig. 1 |. Uva perturbations impact normal syllabic song structure.

a, Schematic of the zebra finch song control pathway (sagittal view), including HVC (proper name), nucleus Uvaeformis (Uva), nucleus interfacialis of the nidopallium (NIf) and nucleus Avalanche (Av). Inset shows a histological slice through Uva with GFP labelled HVC-projecting neurons (n = 3 birds). b, c, Example sonograms from two individual zebra finches before (top) and after (bottom) a unilateral Uva lesion (b) or a similarly sized lesion located outside of Uva (c). Vocalizations produced by each bird are represented by single points in a t-SNE plot at bottom (prelesion green, postlesion black). d, Example songs truncated by bilateral Uva stimulation (yellow lines = stimulation times, red lines = song truncation times). Reference song at top. e, Uva stimulation and song truncation times (black and red ticks, respectively, n = 325 trials) for all stimulated trials resulting in a truncation aligned to the sonogram in (d). Lower panel represents probability of stimulation and song truncation (black and red lines, respectively). f, Example songs truncated by bilateral HVC stimulation. g, HVC stimulation and song truncation times (n = 331 trials) aligned to the sonogram in (f). h, i, Probability of song truncation related to closest syllable offset (h) or stimulation time (i) following Uva (n = 4 birds, left) or HVC (n = 3 birds, right) stimulation. Each line represents data from a single bird; dashed lines refer to birds shown in (e) and (g), respectively.
Perturbation of thalamus during singing
Though its principal necessity had long been established12–17, the dynamic role of Uva activity for vocal production remains an important point of disagreement. Uva has been proposed to be part of a fast feedback loop during singing (i.e., ‘fast feedback model’) whose persistent activity is required to propagate moment-to-moment sequence generation in HVC26. In the alternative ‘initiation model’, Uva activity may provide a ‘go cue’ for song-related HVC sequences27, 28 which can then propagate without the need for additional thalamic drive16. These models make distinct predictions concerning the impact of an ‘online’ perturbation during singing: The ‘fast feedback model’ predicts that Uva perturbations will lead to an indiscriminate song truncation at all time points, while the ‘initiation model’ predicts that the impact of Uva perturbations will be restricted to specific moments within the song (i.e., syllable boundaries).
To test these hypotheses, we manipulated ongoing network activity using electrical microstimulation (5 pulses at 500 Hz, current range: 20–120 μA) administered bilaterally during singing. This manipulation profoundly suppressed local HVC activity for ~200 ms (n = 64 neurons from 2 birds), likely through the activation of local inhibitory interneurons (Extended Data Fig. 2), consistent with previous observations in other circuits29, 30, including thalamus30. In singing birds, bilateral stimulation of Uva (Fig. 1d, Extended Data Fig. 3a–d) led to truncations at the end of syllables (Fig. 1e, Extended Data Fig. 3e–h) (n = 4 birds, modal latencies: 47, 49, 60 and 71 ms). Additionally, song was interrupted at discrete transition points near the center of long duration, complex syllables (Extended Data Fig. 3e–h), consistent with previous observations that these are often composed of smaller (~ 100–150 ms) functional elements10, 22. Similar effects were observed when we stimulated Uva unilaterally in 2 birds (Extended Data Fig. 3l,m). Conversely, an equivalent, bilateral perturbation administered to HVC (reanalyzed data from31) truncated singing throughout the entirety of the song (Fig. 1f,g, Extended Data Fig. 3i–k) with a modal latency of ~ 60 ms (range: 49 – 65 ms, n = 3 birds), in agreement with previous results32, 33. Uva and HVC stimulation-related truncation time distributions were significantly different when aligned to either the end of syllables (Fig. 1h) or the time of the stimulation (Fig. 1i) (Kruskal-Wallis test, p < 0.0001 and p < 0.01, respectively). The truncation of song at syllable boundaries following Uva perturbation indicates that thalamic activity is critical for starting the following syllable but not for completing the ongoing syllable, consistent with the ‘initiation model’ introduced above.
HVC receives song-related thalamic input
After demonstrating a role for Uva in the initiation of song elements, we next examined the anatomical features of the Uva-to-HVC thalamocortical pathway. We first quantified the number of Uva neurons directly projecting to HVC. Using an intersectional retrograde approach (see Methods), we counted 739, 643, and 887 HVC-projecting Uva (UvaHVC) neurons in the ipsilateral hemisphere of 3 zebra finches (Extended Data Fig. 4). We then turned our attention to the impact of these Uva inputs into HVC. Anterograde labeling of Uva neurons with GFP revealed widespread thalamic innervation throughout the extent of HVC (Fig. 2a). Furthermore, at a single neuron level, large axonal swellings (likely synaptic boutons) from Uva were observed near the soma and dendrites of some premotor HVC neurons (Fig. 2b), suggesting direct synaptic connections to these cells13.
Fig. 2 |. Structure and function of thalamic inputs to HVC.

a, GFP-labeled Uva axons densely innervate HVC (borders indicated with dashed white line). b, At top, GFP-labeled Uva axons located near a single HVC premotor neuron (tdTomato). At bottom, a closer view of putative synaptic connections #1 (z-stack = 4 μm) and #2 (z-stack = 2.5 μm) indicated above (n = 2 birds). c, Single-trial activity of an Uva axon labeled with GCaMP6s. Raw audio trace shown above; axonal fluorescence of ROI 13 shown below. d, GCaMP6s activity from three example Uva axons. ROIs indicated by white lines in micrographs on left (scale bars = ROI 13: 5 μm, ROI 6: 2 μm, ROI 10: 10 μm). Average and SEM fluorescence traces aligned to vocal onsets (ΔF/F scale bars = intro notes: 0.1, song: 0.4, and distance call: 0.4). Asterisks denote significant responses. e, The ‘modulation index’ indicates response strength for all imaged axons (n = 2 birds; blue = decrease, red = increase, black = n.s.).
A simple prediction from our behavioral result is that thalamic inputs to HVC will be modulated during singing. In prior electrophysiological studies of Uva14, 15, no efforts were made to distinguish the contribution of specific projection neuron subtypes (to HVC, to NIf, or to Avalanche) from the activity of local circuit interneurons. To test whether HVC-projecting Uva neurons exhibit vocalization-related responses, we expressed the calcium indicator GCaMP6s in Uva and used 2-photon imaging to examine the activity of individual thalamic axons in HVC during vocalization (Fig. 2c). Uva inputs recorded from 2 birds displayed heterogeneous responses across a range of vocal types (Fig. 2d,e), as determined by the ‘modulation index’, or the proportion of trials in which a significant increase (1) or decrease (−1) in fluorescence was detected (see Methods). Specifically, we found that 12 of 13 ROIs responded significantly during singing, while only 7 changed their activity during introductory notes (5 increasing and 2 decreasing), which typically precede song. Thalamic axons also sometimes increased their activity during calls, vocalizations which are distinct from song (distance call: 5 out of 7 ROIs; tet call: 4 out of 8 ROIs) (Fig. 2e). In sum, these results indicate that although thalamic inputs to HVC exhibit some selectivity across the zebra finch vocal repertoire, they appear to be universally activated during song.
Thalamically driven HVC subpopulation
Our next step was to estimate the impact of thalamic activation onto downstream HVC circuitry in the context of song by functionally identifying neurons within HVC that were driven by thalamic activity. To accomplish this, we expressed GCaMP6s in HVC neurons and used 2-photon imaging to monitor Uva stimulation-evoked responses in HVC (n = 6 birds) (Fig. 3a). We imaged different HVC cell types as defined by retrograde tracers and functional criteria (see Methods). In addition to HVC premotor neurons, which project to the robust nucleus of the arcopallium (RA) – or HVCRA neurons (n = 251), we also imaged HVC neurons projecting to the avian basal ganglia called Area X (HVCX neurons, n = 667) (Fig. 3b). The remaining neurons (n = 351) had functional profiles inconsistent with these cell classes and may represent either local inhibitory interneurons25 or members of a considerably smaller population of cells projecting to Avalanche (HVCAv neurons)34. Consistent with previous findings13, electrical stimulation in Uva resulted in short latency, stimulus-locked activation in a subset of HVC neurons (e.g., Fig. 3c,d, Cell #1) independent of the state of the bird (i.e., awake or ketamine anesthetized, see Methods), while most imaged neurons showed no stimulation-induced fluorescence changes (e.g., Fig. 3c,e, Cell #2).
Fig. 3 |. Classifying HVC premotor neurons based on thalamic drive.

a, Schematic depicting simultaneous in vivo 2-photon imaging in HVC along with electrical microstimulation in Uva. b, In vivo 2-photon image of an example plane featuring GCaMP6s-labeled somata (white outlines = ROIs, white arrows = HVC premotor neurons, blue arrow = HVC interneuron, red soma label = basal ganglia-projecting HVC neurons) (n = 6 birds). c, Average fluorescence changes following 25 Uva stimulation events; only Cell 1 shows significant response (n = 6 birds). d, e, HVC premotor neuron examples from (c) which are driven (d) or not driven (e) by Uva stimulation. f, g, Distribution of activation ratios (f) and a scatterplot displaying activation ratios and simulation-evoked ΔF/F (g) for 251 HVC premotor neurons. Colors indicate functional distinction as UvD or nUvD neurons following clustering procedure (see Methods).
We established a quantitative metric to identify putative thalamorecipient HVC neurons. Specifically, we calculated an ‘activation ratio’ consisting of the proportion of trials with significant stim-induced responses across trials (24.8 ± 7.0 stimulation trials/neuron, see Methods). We found a bimodal distribution of activation ratios in our population of HVCRA premotor neurons (Fig. 3f) but not in HVCX or ‘HVC interneurons + HVCAv’ (Extended Data Fig. 5a–c). We then used k-means clustering to assign each premotor neuron (n = 251, n = 6 birds) to one of two populations: ‘Uva driven’ (UvD) cells activated by Uva stimulation (15.9%; median latency = 5.8 ms; Extended Data Fig. 5d) and cells which were ‘not Uva driven’ (nUvD) (Fig. 3g). Because only a small subset of cells was driven by Uva stimulation, these results indicate that thalamic input primarily targets a specific subgroup of premotor neurons within HVC.
Function of thalamus driven HVC neurons
Once the premotor neurons had been classified either as UvD or nUvD cells, we revisited these neurons to determine their activity profile in the singing bird (Fig. 4a–c, supplementary video 1). The vast majority (99.2%) of song-related HVC neurons classified as premotor (i.e., both UvD and nUvD cells) produced a single, extremely reliable phasic increase in fluorescence for each rendition of song, whereas 29.4% of neurons labeled with a retrograde marker from X exhibited multiple phasic increases, consistent with previous electrophysiological recordings24, 25. We used an established Bayesian inference method23 to pinpoint the burst onset time across multiple song trials (27.6 ± 13.2 trials/neuron) (Fig. 4d). This approach enabled us to estimate the timing of song-related activity for individual premotor neurons with sufficient temporal precision (temporal uncertainty: 2.9 ± 1.4 ms) to characterize the activity profiles of UvD and nUvD neurons in relation to vocal production (Fig. 4e,f).
Fig. 4 |. Differential roles of UvD and nUvD neurons during song production.

a, b, Average response of two premotor neuron examples to Uva stimulation (Cell #6, n = 50 trials; Cell #9, n = 74 trials; shading = SEM). c, GCaMP6s responses during singing (ΔF/F, scale bar: 100%) for the two HVC premotor neurons highlighted in (a) and (b). d, Song-related activity of Cells #6 (n = 41 trials) and #9 (n = 66 trials) aligned to song motif. Dotted line indicates estimated median spiking event onset time. Traces normalized by maximum value. e, Distribution of spiking event time estimates (see Methods) relative to the median estimate for Cells #6 and #9. f, Distribution of spiking event time uncertainties for all 241 HVC premotor neurons active in the context of vocal production. g, Spiking event times for UvD (n = 11) and nUvD (n = 58) neurons (red and black ticks, respectively) in Bird 490 aligned to sonogram and song amplitude. Arrows indicate the song-related event times of Cells #6 and #9 shown in (d). h–k, Median, normalized song amplitude envelopes aligned to UvD and nUvD spiking event times for Bird 490 (h, j) and all 6 birds (i, k) (UvD, n = 30 events; nUvD, n = 174 events). The zero point represents the timing of each HVCRA neuron’s spiking event. Red bar in (i) indicates time for which envelope exceeds the 99% CI (see Methods). l, m, Spiking event times aligned to vocal onsets (l) or offsets (m) for UvD and nUvD HVC premotor neurons (n = 6 birds). Shading indicates presence of vocalization. Horizontal red lines indicate moments in which the population exceeds 99% CI of a uniform distribution. Syllable, introductory note, and call-related event times were pooled; see Extended Data Fig. 7a,b for each vocal type analyzed separately. n, Uva preferentially engages a subpopulation of HVC ‘starter neurons’ (in red) directly or via NIf or Avalanche to initiate syllables.
If the thalamic nucleus Uva is playing a role in vocal initiation, then the activity of UvD neurons should be nonuniformly distributed throughout the song (i.e., active before vocal onsets). Our song perturbation results (see above, Fig. 1) and previous work27 suggest that the brief (~ 20–50 ms) inspiratory pulses preceding syllable onsets could be relevant landmarks in this regard. Upon initial inspection, we observed that UvD neuron activity appeared clustered around these silent periods preceding syllables or shortly before local amplitude minima of long duration complex syllables, while the activity of nUvD neurons did not correspond to these features (Fig. 4g, Extended Data Fig. 6a–f). To quantify this difference, we calculated the median event-triggered song amplitude for both the UvD and nUvD populations (Fig. 4h–k). UvD events were associated with a significant dip in song amplitude from 4 to 19 ms (i.e., exceeding the 99% confidence interval (CI) of a uniform distribution, see Methods) (Fig. 4h,i, Extended Data Fig. 6g). Conversely, the song amplitude did not vary as a function of the timing of nUvD spiking events, as expected for event times that are uniformly distributed across the song (Fig. 4j, k, Extended Data Fig. 6). Furthermore, the event-triggered song amplitude associated with the UvD population (window: 0 – 25 ms) was significantly lower than that associated with the nUvD population (Wilcoxon, two-sided, p<0.05). Taken together, these results demonstrate that UvD population firing is associated with minima in the song amplitude profile, consistent with moments where syllable initiation is likely to occur.
To directly test the notion that UvD firing events cluster around vocal boundaries, we examined the timing of these neurons with respect to the onsets and offsets of song syllables and other vocal behaviors, specifically introductory notes and calls (Fig. 4l,m, Extended Data Fig. 7a,b). We therefore use the term ‘vocal onsets and offsets’ inclusively to refer to all these vocal types. We found that nUvD spiking events were uniformly distributed within our analysis window (± 50 ms relative to vocalization) while UvD spiking events consistently occurred ~25 ms before vocal onsets (Fig. 4l, see Methods), roughly equivalent to the delay inherent between HVC premotor commands and song production35. We then aligned responses with respect to vocal offsets and found that the activity of UvD neurons (but not nUvD neurons) was clustered at this boundary as well (Fig. 4m), potentially serving as a stop signal or a ‘go cue’ for the following syllable or song motif35. We observed cases in which UvD neurons were active at the end of the last syllable of song as well as in advance of the first syllable, consistent with UvD premotor neurons activity mediating both stop and start signals (Extended Data Fig. 8). Importantly, the state of the bird during our functional classification of thalamorecipient neurons had no bearing on the outcome; we observed significant clustering of song-related spiking activity around vocal boundaries for neurons designated as Uva driven in either awake or ketamine anesthetized birds (Extended Data Fig. 7c,d). Additionally, syllable onset activity was observed in HVCX neurons (Extended Data Fig. 7e), whose role in singing behavior in the adult bird remains unclear36. Taken together, these data indicate that the thalamic nucleus Uva is preferentially wired onto cortical neurons that represent vocal boundaries and are therefore well-positioned to initiate or terminate behaviorally relevant network activity.
DISCUSSION
Our results support a conceptual model in which thalamic inputs from Uva can selectively engage specific elements within a cortical premotor circuit to direct vocal behavior (Fig. 4n). To accomplish this, Uva could drive HVC neurons directly via the large synaptic inputs that we found to be active during song production (Fig. 2). Uva may also exert its influence through NIf or Av, nuclei that are important for song learning and auditory processing34, 37 but – unlike Uva12–16 – appear to play a considerably lesser role in adult courtship song production38, 39. The temporal clustering of UvD neurons along with the Uva perturbation results are inconsistent with existing models requiring continuous thalamic activation for song production26. Instead, we provide a compelling biological example of a recently proposed theoretical proposal7 in which thalamic inputs can precisely initiate individual elements within the context of a skilled behavioral sequence.
Uva is embedded within a complex song production network. In addition to the multiple postsynaptic targets described above, it also receives input from dorsomedial nucleus (DM) of the intercollicular complex and nucleus parambigualis (PAm), a respiratory premotor center in the medulla33. Electrical stimulation in Uva may therefore activate these upstream regions antidromically or trigger other descending fibers (e.g., from RA) passing nearby, which may complicate the interpretation of our behavioral observations. However, previous work has demonstrated that direct stimulation of either PAm33 or RA32, 33 results in fixed-latency song interruptions, like those observed following HVC stimulation and categorically different from what we report following Uva stimulation. Additionally, although the song-related activity in DM and PAm has not yet been characterized, signals arising from these upstream nuclei must first pass through Uva to interface with the HVC premotor network, further reinforcing the role of the thalamic nucleus Uva as an important functional bottleneck.
Our findings have demonstrated that thalamocortical inputs can mediate their effect through preferential wiring onto a key premotor population. We have also unveiled new testable hypotheses regarding the nature of this thalamic engagement of cortical circuitry. First, although the timing of Uva and NIf neuron activity14, 15, 40 and depolarizing postsynaptic potentials in HVC neurons26 appear to favor syllable boundaries, activity is consistently found at multiple times throughout the song. How can this broader activity translate into a single, precise spiking event, for instance, at the level of an HVC ‘starter neuron’ (i.e., UvD neurons active at the beginning of a vocalization) (Fig. 4n)? One hypothesis is that multiple coactive thalamic inputs must converge on a single HVC neuron to initiate a downstream response. Alternately or in addition, gaps in inhibition mediated by local HVC interneurons could provide sparse permissive windows during which thalamic input can effectively drive individual HVC premotor neurons41, 42. Consistent with this notion, local minima in interneuron firing rates have been observed to proceed syllable onsets in singing birds25. Likewise, ‘starter neurons’ may be more excitable than other cells within the HVC circuit43, which could facilitate their activation. Finally, HVCX neurons, which were less robustly driven by Uva stimulation than HVCRA neurons (Extended Data Fig. 5a,b), could play a role in relaying Uva drive to HVC premotor neurons19, 21. These factors may further explain the ability of thalamic drive to initiate local HVC dynamics that lead to the formation of individual vocalizations.
In conclusion, our results highlight a mechanism by which motor thalamus can initiate cortical dynamics to generate a complex motor sequence, an arrangement that carries several systems-wide advantages. Uva is well-positioned to coordinate bilateral activity at key moments in the song, which can provide an important synchronizing cue given the lack of interhemispheric projections in the cortical song motor pathway28. Furthermore, Uva relays dense inputs from the brainstem respiratory centers to HVC44, and this feedback likely aids in the integration of breathing and vocal production45. A similar arrangement has been suggested for the coordination of human speech and respiration46–48, but direct anatomical pathways supporting this hypothesis have so far not been described. Future work can investigate how sensory inputs – including those contributing to the well-characterized visual responses in Uva – may exploit this wiring to initiate singing behavior49. In addition, the role of Uva in motor initiation is reminiscent of recent rodent studies2, 3, which have demonstrated the necessity of thalamic input to motor cortex for cued movements. Specifically, thalamic input can provide a brief ‘go cue’ that rapidly reorganizes cortical dynamics from a motor planning to a motor execution phase2. Taken together with our present results, thalamus appears to act as a critical initiator of premotor sequences and their corresponding behaviors across vertebrate taxa.
METHODS
Animals
We used adult (> 90 days posthatch) male zebra finches (Taeniopygia guttata) that were obtained from an outside breeder and maintained in a temperature- and humidity-controlled environment with a 12/12-hour light/dark schedule. All animal maintenance and experimental procedures were performed according to the guidelines established by the Institutional Animal Care and Use Committee at the New York University Langone Medical Center.
Surgical procedures
All surgical procedures were performed under isoflurane anesthesia (1%–3% in oxygen) following established guidelines. To access Uva (nucleus Uvaeformis), the beak bar was positioned 5 degrees up from horizontal and the electrode was inserted vertically. At this head angle, Uva coordinates varied from 1.5–1.7 mm lateral and 3.0–3.7 mm anterior from the bifurcation of the superior sagittal sinus and 5.1–5.5 mm ventral from the brain’s surface. For all Uva-related manipulations (i.e., lesions, chronic stimulation-electrode implantations, and viral injections into Uva), Uva was confirmed antidromically via HVC stimulation. A platinum-iridium or tungsten search electrode (0.5 MΩ, MicroProbes) was used to measure spiking responses in Uva evoked by HVC stimulation (0.5 Hz, biphasic single pulse, 0.2 ms/phase, 100 μA). Stimulation was enabled using a bipolar electrode in HVC constructed from Teflon coated stainless steel wire (0.002” California Fine Wire) inserted 2.3 mm lateral and 0.25 mm anterior from the bifurcation of the superior sagittal sinus with the beak bar at 45 degrees down from the horizontal.
Chronically implanted bipolar electrodes for lesions and microstimulation
We performed unilateral lesions acutely or chronically either in or adjacent to Uva, with the latter case categorized as a ‘sham’ condition. In some cases, lesions were made using custom-built bipolar stainless-steel electrodes (0.002” California Fine Wire) inserted acutely into Uva during anesthesia. In other cases, lesions were produced using custom bipolar electrodes composed of a platinum-iridium electrode (0.5 MΩ, MicroProbes, PI20030.5B3) and an insulated stainless-steel wire (0.002” California Fine Wire) directly affixed to the Pt/Ir-electrode using cyanoacrylate. The tips of the stainless-steel and platinum-iridium wires were displaced vertically by approximately 400 μm. The platinum-iridium tip of this electrode allowed for unit recordings during implantation and therefore an extremely precise, electrophysiology-guided placement in Uva. Unilateral lesions were achieved with 30-second biphasic pulses (acute: 50 μA, 6 total pulses; chronic: 50–500 μA, 6–10 total pulses). Lesion locations were confirmed histologically (details below). For all microstimulation experiments (i.e., Fig. 1, 3), we used the latter Pt/Ir type bipolar electrode with 0.001” stainless steel wire (California Fine Wire) displaced 220–300 μm vertically (Extended Data Fig. 3b). For behavioral microstimulation experiments (i.e., Fig. 1) both hemispheres were implanted with bipolar electrodes.
In vivo stereotaxic injections
Retrograde tracers
We used glass pipettes (opening diameter: 20 μm) with an oil-based pressure injection system (Nanoject III, Drummond Scientific) for all tracer and virus injections. For post hoc assessment of lesion success or stimulation electrode position, the ipsilateral HVC was injected with 200 nl of either dextran Alexa Fluor 488 (Invitrogen, D22910) or a virus (scAAV-DJ9-hCMV-chl-EGFP) to retrogradely label neurons projecting from Uva to HVC (e.g, Extended Data Fig. 4d–f). Counting Uva-HVC projecting neurons required a dense labeling of the HVC-projecting Uva neuronal population. Therefore, we chose a two-tracer approach: EGFP50 (Extended Data Fig. 4c) together with cholera toxin subunit B (100 nL, CTB, Alexa Fluor™ 647 conjugate, Invitrogen, C34778) (Extended Data Fig. 4b), and injected into the right hemisphere.
Axonal labeling
To label direct projections from Uva to HVC, we injected virus (AAV9-CAG-GFP, Addgene: #37825-AAV9) into two locations within Uva (62 nL/site), one at the ventral edge of Uva and one in its center. To mark potential HVC postsynaptic targets, we sparsely labeled HVC-RA projection neurons with tdTomato. To accomplish this, we first electrophysiologically identified RA with a tungsten electrode (0.5 MΩ, MicroProbes) and injected 50 nl of a retrograde Cre virus diluted in physiological saline solution into its center (dilution: 1:20; scAAV-DJ9-hCMV-CRE)50. 200 nL of Cre-dependent tdTomato virus (AAV9-FLEX-tdTomato; Addgene: #28306-AAV9) was then injected into HVC.
GCaMP injections
We used viral vectors to express the calcium indicator GCaMP6s (AAV9-CAG-GCaMP6s-WPRE-SV40, Addgene: #100844) in either Uva or HVC. Uva injections were performed at two sites (ventral edge and center, 62 nL of virus per site). To cover the entirety of HVC, we performed injections at 9 to 12 sites separated by approximately 250 μm in a grid-like fashion. At each point, we used either single injections at 400 μm below the brain’s surface or two injections at 400 and 550 μm depth. To minimize the impact on the tissue, HVC virus injections were performed with beveled pipettes (opening diameter: 20 μm, length of bevel: 80–100 μm).
RetroBead injections
We injected red RetroBeads (Lumafluor Inc.) into Area X (two injection sites, 70 nL each). Area X was accessed via a small craniotomy centered at 1.5 mm lateral and 5.9 mm anterior of the superior sagittal sinus with the beak bar at 8 degrees up from the horizontal. Injections were made 2.8 mm below the brain’s surface at 5.8 and 6.0 mm anterior (first and second injection, respectively) and 1.5 mm lateral. This procedure robustly labeled a large majority of basal ganglia projecting HVC neurons (HVCX) and allowed for an unambiguous identification of that cell class during in vivo 2-photon imaging.
Implantation of cranial window
Cranial windows were placed above HVC for both axonal imaging of Uva-HVC projection neurons and somatic imaging of HVC neurons. In each case, we carefully removed a circular piece of dura (diameter approximately 1 mm) above HVC while leaving the transparent pia intact. The window was implanted at the end of the injection procedure (HVC injection) or two weeks after the virus was injected (Uva injection). We sealed a 3-mm-diameter circular coverglass (#1 thickness, Warner Instruments) with Kwik-Sil adhesive (WPI) and fixed the edge of the glass with cyanoacrylate. We also cemented a black plastic ring (inner diameter: 5 mm, outer diameter: 7.5 mm) around the cranial window to prevent light contamination during imaging.
Head-fixed singing protocol
We habituated zebra finches to sing to a female while head-fixed, following a previously published protocol from our laboratory23. The female was initially visually occluded using a curtain which was lifted by a stepper motor (Wantai, Model 57BYGH420–2) controlled by an Arduino (Arduino UNO with Arduino Motor Shield). We recorded female-directed singing with a lavalier condenser microphone (Audio-Technica, AT831) placed approximately 4 cm away from the beak of the head-fixed bird. The signal was amplified using a solid-state preamplifier (Behringer, Ultragain Pro MIC2200). Song was detected using a digital signal processor (RX8, Tucker-Davis Technologies), and custom software was written using the RPvdsEx interface (Tucker-Davis Technologies) and MATLAB to open a solenoid (NResearch) for 200 ms to dispense approximately 20 μl of water. Because the zebra finch produces a song containing short gaps between syllables and motifs, singing behavior was defined as time periods in which the ratio of high-frequency power to low-frequency power (1–7 kHz and 0–1 kHz, respectively) was greater than 3 for more than 50% of a 1-second sliding window.
2-Photon Imaging
We used a customized movable objective microscope (Sutter Instrument) to scan our field of view (frame rate: 28.8 Hz) with a resonant system (Thorlabs) and ScanImage 4.2 software51. All imaging was done using a 25x water immersion objective (Olympus, XLPLN25XWMP) with a numerical aperture of 1.05 and a working distance of 2 mm. To protect the microscope from external light contamination, we wrapped it with a light-attenuating material. Additionally, we fit a black balloon to the tip of the objective on one end and the black ring surrounding the optical window on the other52. The excitation source was a mode-locked Ti:sapphire laser (Chameleon, Coherent) tuned at 920 nm and controlled by a Pockels cell (Conoptics, Model 350–80; Controller 302RM). Fluorescent light was detected using a GaAsP photomultiplier tube (H10770PA-40 PMT module, Hamamatsu) and a wide detection path (2-inch collection lens).
To apply electrical stimulation to Uva while the ipsilateral HVC of a head-fixed bird was imaged, we connected the chronic stimulation electrode to a pulse generator (A-M Systems, Model 2100). For a given imaging plane, Uva was stimulated > 20 times at pseudo-random intervals (interval range, 2 – 4 s). Each stimulation event consisted of a train of five 100 μA bipolar, biphasic current pulses (0.2 ms/phase) delivered at 500 Hz. The stimulation-triggered activity of most cells was imaged in the awake nonsinging bird or under ketamine anesthesia (50 mg/kg, mixed with xylazine, 5 mg/kg).
Song-triggered Microstimulation
Data related to gauging the behavioral impact of perturbing Uva activity with electrical stimulation were collected as part of the present study; HVC stimulation data were re-analyzed from a previously published data set31. For the Uva stimulation experiments, we tested the effectiveness of the chronically implanted stimulation electrodes under anesthesia. A single electrical stimulus (biphasic current pulse, 0.2 ms/phase) was applied to Uva, separately for each hemisphere. Uva stimulation (5–50 μA) elicited orthodromic responses in the ipsilateral HVC, which we measured with a 0.5 MΩ tungsten electrode (MicroProbes) through an open craniotomy above HVC.
Once the electrode performance was confirmed, each bird was transferred into a sound-isolation chamber where singing was recorded with an omnidirectional microphone (Audio-Technica, AT803) and amplified with a tube preamplifier (Presonus, Studio Channel). We continuously monitored the sound inside the chamber with custom written MATLAB software, which recorded singing in a sound-triggered manner. Simultaneously, the microphone signal was sent to a digital signal processor (RX8, Tucker-Davis Technologies). Custom software was written using the RPvdsEx interface (Tucker-Davis Technologies) to detect song and subsequently send out a TTL pulse to trigger stimulation consisting of a train of five bipolar, biphasic current pulses (0.2 ms/phase) delivered at 500 Hz (A-M Systems, Model 2100); copying the previously established HVC stimulation protocol31. Stimulators were connected to the chronic Uva implants via a commutator (Doric Lenses, AERJ_12_HARW or Dragonfly, SL-88–10). The lowest stimulation current level necessary to truncate song in > 80% of cases was established empirically for each bird. Stimulation currents were kept the same for each hemisphere. After an effective current level was established, we used this current level to stimulate during a minimum of 100 motifs per bird. To avoid more than one stimulation event per motif and to distribute stimulation times uniformly across the motif, a random delay was chosen, uniformly distributed between 0 and 1200 ms, after which the trigger signal was sent. Following each stimulation, subsequent stimulations were blocked for 1.5 seconds.
Simultaneous Microstimulation and Silicon Probe Recordings
A high-density silicon probe (Diagnostic Biochips, 128–5) was lowered into HVC (2.25 mm lateral and 0.25 mm anterior to lambda) and a stainless-steel ground wire (0.001” AM systems) was inserted between the skull and the dura. Silicon elastomer (Kwik-Cast, WPI) was applied to the craniotomy once the target region was successfully identified (depth: ~0.4mm-0.7mm). We stimulated locally (~ 100–200 μm) using a platinum-iridium electrode (0.5 MOhm, MicroProbes) in head-fixed, awake birds that were held in a foam restraint for the duration of the experiment. A biphasic current (5 pulses of 100 μA at 500 Hz) was applied every 2–5 sec for > 100 trials. Neural data and TTL pulses time-locked to the triggered stimulation were acquired using the Intan Recording Controller.
Histology
Brains were fixed via cardiac perfusion of phosphate buffered saline (PBS) followed by 4% formaldehyde (PFA) in PBS. After removal, the brains were stored in PFA overnight and sliced at 50 μm (100 μm for electrode site verification) using a vibrating microtome (VT1200S, Leica). Slices were mounted on microscope slides (Fisher Scientific, Colorfrost Plus) with Vectashield Antifade mounting media with DAPI staining (Vector Laboratories). We imaged using a Zeiss confocal microscope (Axio ObserverZ1, LSM800) and took tiled image stacks (z-step size: 1 μm or 0.5 μm) for Uva cell counting and axonal images. For electrode site verification, we used an Olympus fluorescence microscope (BX53).
To detect Uva axons and potential postsynaptic targets in HVC, birds were perfused 2–3 weeks post-injection. Slices were incubated in a blocking solution containing, 0.2 g Bovine Serum Albumin, 10% Normal Donkey Serum, 0.5% Triton X-100 for 1 hour at room temperature. Slices were then incubated in blocking solution containing anti-GFP (1:1000; ThermoFisher, A-11122) and mouse anti-NeuN (1:500; Sigma, MAB377). After rinsing 3 times for 10 minutes each using PBS, slices were transferred to secondary antibodies, specifically anti-mouse AF647 and anti-rabbit AF488 (1:1000; ThermoFisher, A-32787 & A-21206 respectively), for two hours at room temperature. Slices were then rinsed 3 times for 10 minutes and mounted using Vectashield Antifade mounting media (Vector Laboratories). In another case, slices were stained against GFP, as described above, and against tdTomato using goat anti-tdT (1:200; OriGene, AB8181–200) and anti-goat AF594 (1:1000; ThermoFisher, A-11058) as the secondary.
To quantify the number of HVC-projecting Uva neurons, birds were perfused 6 days post injection. Slices were blocked with the same protocol as described above and then rinsed with PBS 3 times for 10 minutes each before incubation with secondary antibody anti-mouse AF555 (1:1000; ThermoFisher, A-31570) for 2 hours at room temperature. Slices were then rinsed 3 times for 10 minutes in PBS and mounted using Vectashield. Manual cell counting analysis was performed using Amira Software.
Data Analysis: Behavioral effects of Uva lesions
We recorded the songs of birds 1–12 days following Uva lesions. To determine lesion-induced changes to the structure of vocal elements, we used Principal Component Analysis (PCA). First, vocalizations were manually annotated by identity (i.e., syllable A, syllable B). For each bird, all segments were zero-padded to match the length of the longest syllable. Spectrograms were then embedded using PCA in MATLAB, restricting out analysis to the first five PCs. Clusters were determined using k-means for k = 2–10 and scored using silhouette analysis, which requires a minimum of two clusters. Distances were calculated using cosine distance to ensure the measure was invariant to rescaling of the data. Barnes-Hut t-distributed stochastic neighbor embedding (t-SNE) was used to visualize the distributions of vocal elements pre- and postlesion. Spectrograms (processed as described above) for both pre- and postlesion vocalizations were analyzed using an established t-SNE algorithm (perplexity hyperparameter = 45)53.
Data Analysis: Microstimulation perturbation of singing behavior
For all song recordings, if stimulation onset times fell within the range of one of the bird’s song motifs (i.e., from 50 ms before motif onset to the end of the motif), we manually annotated the start and stop time of all syllables, including truncated syllables. The ends of syllables were defined as the times at which the amplitude falls back to baseline. We used a custom graphical user interface for syllable annotations, which allowed for detailed spectrogram and raw audio inspection. The motif stop time was defined by the final syllable produced. For each bird, a representative reference motif was chosen, and the syllables of every individual motif were aligned to the corresponding syllable onsets of the reference motif. To avoid motif stop times unrelated to stimulation, we excluded stop times that happened after the length of the reference motif minus 30 ms from further analysis. We used a Kruskal-Wallis test to ask whether aligned Uva and HVC motif stop times came from the same distribution. To visualize and normalize truncation probabilities across individual birds, aligned stop times were smoothed with a sliding average (window: 10 ms, step size: 1 ms) and divided by the total number of truncations.
Data analysis: Simultaneous Microstimulation and Silicon Probe Recordings
Custom MATLAB software was used to analyze recordings and to align single unit data to stimulation times. Automated spike detection and sorting was carried out using Kilosort software54 and manual post-processing was performed using Phy55 as described previously20. Peristimulus time histograms were calculated using either 5 ms bins (Extended Data Fig. 2a–c) or 25 ms bins (Extended Data Fig. 2c). Spike histograms were normalized by the maximum firing rate during the 2-sec window centered on the stimulus. Neurons were classified as stimulus activated if they met our criterion: frstim/frpre - frstim < 0 (pre-stimulus window = −1000 – 0ms; stimulus window = 0 – 200ms). Waveforms were measured for all spike times occurring in the 1 sec prior to stimulus onset with rise time calculated as the time to reach 90% of the maximum peak-trough amplitude.
Data analysis: 2-photon Imaging
For both soma and axon imaging experiments, we used a multichannel analog-to-digital converter (Digidata 1550, Molecular Devices) and Clampex software (pCLAMP, v9) to synchronize the microphone and stimulation trigger signal with the start and stop time of each frame acquisition (frame resolution: 512 × 512 pixels; frame size: 228 × 228 μm or 341 × 341 μm for somatic imaging and 137 × 137 μm for axonal imaging). We defined the scan time for each individual frame-line as the moment that a lines’ central pixel was recorded. We motion-corrected all acquired images within and across trials using a non-rigid motion correction algorithm (NoRMCorre)56, 57. Following motion correction, ROIs (regions of interest) were manually drawn, and the fluorescence of each ROI was estimated using freely available software (ImageJ). Fluorescence traces were extracted by taking the mean pixel value within the respective ROIs and each ROI’s scan time was defined as the scan time of its central line. Because of the stability of zebra finch song-motor coding across time58, we matched ROIs across days in cases where the same ROI was recorded in more than one session. Next, we segmented and labeled all song motifs, introductory notes, distance calls, and tet calls using custom MATLAB software. Given the slight variability in song timing across trials9, 20, it was necessary to linearly warp the vocalizations to a uniform duration. For each vocalization, we applied an identical warp factor to the corresponding scan times. Finally, for each ROI and trial, we defined as baseline (F) the average of 250 samples before the curtain was lifted and calculated the ΔF/F.
Neurons labeled with red RetroBeads injected into Area X were defined as basal ganglia-projecting (i.e., HVCX) neurons. HVC premotor neurons (i.e., HVCRA neurons) were defined as cells without RetroBead labeling and with at least one stereotyped, vocalization locked phasic fluorescence increase. The remainder of imaged neurons, distinguished by an absence of a retrograde tracer and the lack of a stereotyped, vocalization locked phasic fluorescence increase may represent either HVC interneurons or Avalanche-projecting neurons.
The onset times of phasic fluorescence increases in HVC premotor neurons were previously shown to correspond to the burst onset times (i.e., spiking events) of these neurons23. We deconvolved the onset times of HVCRA neurons using a Markov Chain Monte Carlo (MCMC) inference method23. Briefly, the MCMC chain was run for 5000 sweeps over all parameters (with a burn-in of 2000 sweeps). The burst onset time was defined as the median onset estimate and uncertainty as the standard deviation of the last 3000 sweeps.
Data Analysis: Uva stimulation evoked ΔF/F
To quantify the fluorescence increase of individual HVC neurons following electrical microstimulation in Uva, we compared single-cell fluorescence before and after stimulation. In a first step, we subtracted local background fluorescence. Specifically, for a given somatic ROI, we generated an annulus ROI surrounding the somatic ROI while excluding close by other somatic ROIs. The area of the annulus was 20 times the area of the corresponding somatic ROI. The mean pixel value of the annulus was subtracted from the mean pixel value of the somatic ROI for each sample. After background correction, we quantified fluorescence changes on a trial-by-trial basis. For each stimulation event, we tested whether 15 samples of fluorescence values (approximately 500 ms) following the stimulation were significantly larger than the 15 samples before the stimulation (Wilcoxon test, one-tailed, p < 0.01). The sum of stimulations followed by a significant fluorescence increase was then divided by the total number of stimulations to obtain an ‘activation ratio’ for each individual neuron. Additionally, we calculated the ΔF/F, individually for each stimulation, using the mean fluorescence across the 500 ms preceding a stimulation as baseline (F). The average ΔF/F across the 500 ms following stimulation, across all stimulations, was then calculated to obtain a single, post-stimulation ΔF/F value for each neuron. Most HVC neurons were observed during two Uva stimulation conditions, under ketamine anesthesia and in the awake bird. For those cells, the condition with the maximum activation ratio and ΔF/F was selected for further analysis. We then used a k-means clustering algorithm on the activation ratios and ΔF/F of all premotor neurons to assign each cell to one of two clusters. Specifically, we used the sum of absolute differences, where each centroid is the component-wise median of the points in its cluster (MATLAB, k-means function, distance measure: ‘cityblock’). We refer to these two clusters as Uva driven (UvD) and not Uva driven (nUvD) HVC premotor neurons.
Data Analysis: Comparison of HVC premotor neuron spiking event times
We used the amplitude envelope of each bird’s reference motif (i.e., the song motif we used for linear time warping, see above) to calculate an event-triggered median song amplitude. For each reference motif, we calculated a sliding root-mean-square (RMS, window: 3.7 ms, step size: 25 μs) level and then divided by the 99th percentile of the motif’s RMS values to normalize amplitudes. These amplitude envelopes were then used to cut and align a 100 ms envelope cutout to each HVC premotor neuron spiking event (−25 to +75 ms relative to event time). For each cutout, the minimum amplitude was subtracted, and envelope edges were zero padded to account for event times at the onset or offset of the motif. Median event-aligned amplitude envelopes were calculated for UvD and nUvD HVC premotor neuron populations. Next, we took a bootstrapping approach to calculate a distribution of median event-aligned amplitude envelopes separately for UvD and nUvD premotor neuron populations. For each bird, we generated a random uniform distribution of time points matching the number of actual UvD or nUvD spiking events of a given bird and derived a 100 ms amplitude envelope cutout for each of those time points. We then registered the median of these amplitude envelopes across all birds and repeated this procedure 100,000 times to arrive at a distribution of median event-aligned amplitude envelopes. Finally, we compared the 99% confidence interval (CI) of these distributions to our data (UvD and nUvD separately).
To align premotor neuron spiking events to the onset or offset of song syllables, introductory notes or calls, we manually labeled those timepoints for all vocalizations. Syllable boundaries were defined as points where song amplitude falls to baseline for a minimum of 10 ms to include all inspiratory gaps, which typically range between 20 – 50 ms45. We then aligned all premotor neuron spiking events that happened within ± 50 ms relative to a vocal onset or offset to the closest onset and offset. These relative time points were assigned to 1 ms bins, their distribution was smoothed with a sliding average (window: 10 ms, step size: 1 ms) and divided by the total number of time points to obtain a normalized distribution of firing probabilities, separately for UvD and nUvD time points. Finally, taking a bootstrapping approach, we asked whether those distributions exceeded the 99% CI of a uniform distribution. We generated a uniform distribution of time points by the random assignment of matching the number of actual UvD or nUvD time points within our ±50 ms window into 100 bins (bin size: 1 ms). This random uniform distribution was then smoothed with a sliding average as described above. We repeated this step 100,000 times to obtain a distribution of uniform time point distributions and calculated the 99% CI for each 1 ms bin.
Data Analysis: UvaHVC axon activity
To quantify the responses of Uva-HVC projecting axons in HVC, we looked at the fluorescence modulation (ΔF/F) of individual axonal ROIs relative to the onset of vocalizations (song, introductory notes, distance calls and tet calls). If two or more axonal ROIs on a given plane showed the same activity profile, they were joined to one ROI resulting in 13 axonal ROIs. An ROI was defined as significantly modulated relative to a given vocalization type if at least 10 renditions of this vocalization were collected and, if the mean ΔF/F across the 400 ms window preceding vocalization onset was significantly different from the mean ΔF/F across the 400 ms window following vocalization onset across all renditions (paired Wilcoxon, two-sided, p < 0.01). For single song motifs and song bouts (i.e., several repeated song motifs in a row) the analysis deviated from this standard procedure as follows: The 400 ms before bout onset were used as a reference for every motif in the bout. This reference was then compared to a 400 ms window, sliding over a given motif in 100 ms steps. For example, if the motif had a length of 600 ms, 6 comparisons were made with the reference window and alpha was Bonferroni corrected accordingly. If at least one comparison was significant, the ROI was defined as significantly modulated by song.
To visualize modulation strength and direction (i.e., fluorescence increase or decrease) for all significant modulations, we calculated a ‘modulation index’. Here we asked for each individual rendition of one vocalization type, whether a ROI’s mean ΔF/F across 400 ms before vocalization onset was significantly different from the mean ΔF/F across a 400 ms window following vocalization onset. If there was no significant difference, we registered a 0. If there was a significant increase in fluorescence, we registered a 1. If there was a significant decrease, we registered a −1. The sum of the resulting vector was then divided by the total number of renditions to obtain a value between −1 and 1, i.e., the modulation index.
Throughout the main text, average values are given with standard deviation unless otherwise noted.
Extended Data
Extended Data Fig. 1 |. Uva lesions impact syllabic song structure.

a,b, Silhouette criteria calculated on the prelesion (a) and postlesion (b) vocalizations for each bird (n = 8) for k-means of various sizes embedded using the first five PCs. Black circles superimposed on curves indicate the number of syllables in each bird’s song motif. c,d, Histology of Uva lesioned (n = 6) (c) and sham lesioned (n = 2) (d) birds (brightfield images, sagittal slices). White arrows indicate the extent of the lesions. CP: Posterior commissure, OM: occipitomesencephalic tract, SpM: nucleus spiriformis medialis, Cb: cerebellum. e, Schematic of the zebra finch brain at the medio-lateral level of Uva (sagittal view) based on an established anatomical resource (http://www.zebrafinchatlas.org/). The highlighted region is used in (f) and (g) to show the lesion extent (in transparent blue) for individual birds. f, g, t-SNE plots representing vocalizations taken from Uva lesioned (f) and sham lesioned (g) birds (prelesion green, postlesion black). Example sonograms depict pre- and postlesion song motifs.
Extended Data Fig. 2 |. Circuit-level consequences of electrical stimulation.

a,b, Example spike rasters (top) and peristimulus time histograms (bottom) for 6 HVC neurons that respond transiently (a) or are suppressed (b) by local high-frequency stimulation (see Methods). Red line indicates the onset of stimulation. Intensity and timing of stimulation identical to that used in behavioral experiments (i.e., Fig. 1). Inset: Average waveform for each neuron shows a variety of spike widths (Spike rise times - Cell a: 0.09 ms, Cell b: 0.34 ms, Cell c: 0.20 ms, Cell i: 0.36 ms, Cell ii: 0.41 ms, Cell iii: 0.30 ms. c, Activity profiles of 64 neurons recorded during local stimulation from 2 birds (bin size: 25 ms). At right, a closer view of responses from transiently active neurons (bin size: 5 ms). Color scale ranges from 0 (no firing) to 1 (maximum firing rate during observation period). d, Histogram showing the spike rise times (see Methods) for all recorded neurons. Transiently active neurons (red) often displayed shorter rise times, indicative of inhibitory interneurons.
Extended Data Fig. 3 |. Stimulation-related song truncation times across all birds.

a, Histology was used to confirm the placement of our stimulating electrodes (n = 10 birds). Shown here is an example bird in which we labeled upstream nuclei labeled with a retrograde GFP virus into HVC (n = 3 birds), including nucleus Uvaeformis (Uva) and nucleus interfacialis of the nidopallium (NIf). Arrows indicate the bipolar electrode tract (left) and the axonal tract emanating from Uva (right). b, Photography of bipolar stimulation electrode. c,d, Magnified views of Uva from the bird shown in (a). In (d), the dotted white line indicates the electrode tract and the diffuse green fluorescence nearby are small electrolytic lesions made to identify the electrode site. e–k, Stimulation (black ticks) and song truncation times (Uva: red ticks; HVC: blue ticks) for all stimulated motifs aligned to sonogram shown on top (E: 325 trials, F: 119 trials, G: 114 trials, H: 187 trials, I: 331 trials, J: 163 trials, K: 214 trials). For each plot, a histogram (bin size: 10 ms) shows the probability of a stimulation event and song truncation to occur (stimulation = black, Uva truncation: red, HVC truncation: blue). Orange bars indicate examples of long duration, complex syllables. l, Example songs truncated by unilateral Uva stimulation in either the left or the right hemisphere (yellow lines = stimulation times, red lines = song truncation times). Reference song at top. m, Proportion of song truncations related to closest syllable offset for unilateral stimulation (bird 156, left hemisphere: 86 trials; bird 156, right hemisphere: 64 trials; bird 471, right hemisphere: 32 trials).
Extended Data Fig. 4 |. Quantifying the population of HVC-projecting Uva neurons.

a–c, Sagittal view of HVC (white dotted line) into which retrograde tracers were injected (right is anterior) (n = 3 birds). NeuN labeled neurons shown in orange (a). Extent of fluorescently labeled cholera toxin subunit B (red) relative to HVC (b). Local labeling of cells with EGFP (green) after the injection of a retrograde virus into HVC (c). d–f, Spheres indicate retrogradely labeled HVC-projecting Uva neurons from three zebra finches shown in parasagittal (d), coronal (e), and transverse (f) views. Scale bars: 200 μm.
Extended Data Fig. 5 |. Uva-stimulation responses in each HVC cell type.

a–c, Activation ratios following Uva stimulation for premotor neurons (HVCRA) (a), basal ganglia-projecting neurons (HVCX) (b), and putative interneurons (HVCInt) and Avalanche-projecting neurons (HVCAv) (c). Left histogram same data as plots at right but on a logarithmic axis. d, Individual (gray) and average (red) fluorescence changes following Uva stimulation for each Uva driven HVCRA neuron from Fig. 3g.
Extended Data Fig. 6 |. UvD and nUvD spiking event times relative to song for premotor neurons across all birds.

a–f, All spiking event times for UvD and nUvD neurons across all birds aligned to sonogram and amplitude of song motif. Bird 490 (UvD = 11, nUvD = 58) (a), Bird 471 (UvD = 1, nUvD = 13) (b), Bird 513 (UvD = 10, nUvD = 30) (c), Bird 520 (UvD = 6, nUvD = 51) (d), Bird 486 (UvD = 2, nUvD = 1) (e), Bird 467 (UvD = 0, nUvD = 21) (f). g, Comparison of song amplitude following UvD and nUvD spiking events. nUvD (black) and UvD (red) lines show the median song amplitude following spiking events for recorded data (window: 4 to 19 ms post spiking event; nUvD, n = 174, UvD, n = 30) compared with a 99% confidence interval (CI) based on the distribution of 100,000 median song amplitude values (same window) drawn from data matched numbers of uniformly distributed, random event times (see Methods).
Extended Data Fig. 7 |. Spiking event times of UvD and nUvD HVCRA and HVCX neurons aligned to vocalization onset and offset.

a, Spiking event times of HVCRA neurons active during singing. Left: Aligned to the closest syllable onset (UvD, n = 23; nUvD, n = 106). Right: Aligned to the closest syllable offset (UvD, n = 19; nUvD, n = 89). b, Spiking event times of HVCRA neurons active during introductory notes, distance calls, or tet calls. Left: Aligned to the closest vocalization onset (UvD, n = 8, nUvD, n = 19). Right: Aligned to the closest vocalization offset (UvD, n = 0, nUvD n = 11). No spiking events from UvD HVCRA neurons were found within 50 ms of introductory note or call offsets. c, Left: Scatterplot displaying activation ratios and Uva stimulation evoked ΔF/F for 227 HVCRA neurons observed in the ketamine condition. Filled circles in the UvD cluster (red) were also classified as UvD cells in the awake condition shown in (d). Middle: Spiking event times of HVCRA neurons – classified based on the ketamine condition – aligned to the closest syllable onset (UvD, n = 29; nUvD, n = 114). Right: Aligned to the closest syllable offset (UvD, n = 18; nUvD, n = 90). d, Left: Same as (c) for 220 HVCRA neurons observed in the awake condition. Filled circles in the UvD cluster (red) were also classified as UvD cells in the ketamine condition shown in (c). Middle: Spiking event times of HVCRA neurons – classified based on the awake condition – aligned to the closest syllable onset (UvD, n = 24; nUvD, n = 113). Right: Aligned to the closest syllable offset (UvD, n = 13; nUvD, n = 88). e, Left: Scatterplot displaying activation ratios and Uva stimulation evoked ΔF/F for 667 HVCX neurons. Neurons were classified using either k-means clustering (top) or a ‘strict’ activation ratio threshold (bottom), which was set to 0.38 (i.e. the lowest activation ratio of the UvD cluster in Fig. 3g). Middle: Spiking event times of UvD HVC neurons aligned to the closest syllable onset (k-means: HVCRA, n = 31; HVCX, n = 50; strict: HVCX, n = 28). Right: Aligned to the closest syllable offset (k-means: HVCRA, n = 19; HVCX, n = 23, strict: HVCX, n = 13). Horizontal red, green or blue lines indicate moments in which the population exceeds a 99% CI of a uniform distribution.
Extended Data Fig. 8 |. Uva driven (UvD) premotor neurons active at motif and bout transition points.

a, Average fluorescence responses (± SEM) to Uva stimulation for 5 functionally defined UvD HVC neurons. Range: 23–26 stimulation trials. b, Activity of UvD neurons during singing. Song amplitude shown at top; fluorescence trace in red (scale bar = 2 ΔF/F). Individual song motifs indicated by green background shading. c, Song motif aligned fluorescence responses for all songs during which Cell #1 (n = 42 trials), Cell #70 (n = 15 trials), Cell #8 (n = 65 trials), Cell # 85 (n = 23 trials), and Cell #117 (n = 24 trials) were recorded. Black dotted line indicates the median spiking event time estimated across all fluorescence traces (see Methods). Traces normalized by maximum value.
Supplementary Material
ACKNOWLEDGEMENTS
We thank Arkarup Banerjee, Dezhe Jin, Nader Nikbakht, Andreas Nieder, Lena Veit, and members of the Long laboratory for comments on earlier versions of this manuscript. Abby Paulson provided technical assistance.
Funding:
This research was supported by R01 NS075044 (M.A.L.), F31 NS116933 (M.E.), and Simons Collaboration on the Global Brain (M.A.L.).
Footnotes
COMPETING INTERESTS
The authors declare no competing interests.
DATA AVAILABILITY STATEMENT
The data and code that support the findings of this study are available from the corresponding authors upon request.
REFERENCES
- 1.Shenoy KV, Sahani M & Churchland MM Cortical control of arm movements: a dynamical systems perspective. Annu Rev Neurosci 36, 337–359 (2013). [DOI] [PubMed] [Google Scholar]
- 2.Inagaki HK et al. A midbrain-thalamus-cortex circuit reorganizes cortical dynamics to initiate movement. Cell 185, 1065–1081 e1023 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Dacre J et al. A cerebellar-thalamocortical pathway drives behavioral context-dependent movement initiation. Neuron 109, 2326–2338 e2328 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Guo ZV et al. Maintenance of persistent activity in a frontal thalamocortical loop. Nature 545, 181–186 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Gaidica M, Hurst A, Cyr C & Leventhal DK Distinct Populations of Motor Thalamic Neurons Encode Action Initiation, Action Selection, and Movement Vigor. J Neurosci 38, 6563–6573 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Sauerbrei BA et al. Cortical pattern generation during dexterous movement is input-driven. Nature 577, 386–391 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Logiaco L, Abbott LF & Escola S Thalamic control of cortical dynamics in a model of flexible motor sequencing. Cell Rep 35, 109090 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Tanji J Sequential organization of multiple movements: involvement of cortical motor areas. Annu Rev Neurosci 24, 631–651 (2001). [DOI] [PubMed] [Google Scholar]
- 9.Glaze CM & Troyer TW Temporal structure in zebra finch song: implications for motor coding. J Neurosci 26, 991–1005 (2006). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Cynx J Experimental determination of a unit of song production in the zebra finch (Taeniopygia guttata). J Comp Psychol 104, 3–10 (1990). [DOI] [PubMed] [Google Scholar]
- 11.Okubo TS, Mackevicius EL, Payne HL, Lynch GF & Fee MS Growth and splitting of neural sequences in songbird vocal development. Nature 528, 352–357 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Coleman MJ & Vu ET Recovery of impaired songs following unilateral but not bilateral lesions of nucleus uvaeformis of adult zebra finches. J Neurobiol 63, 70–89 (2005). [DOI] [PubMed] [Google Scholar]
- 13.Coleman MJ, Roy A, Wild JM & Mooney R Thalamic gating of auditory responses in telencephalic song control nuclei. J Neurosci 27, 10024–10036 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Danish HH, Aronov D & Fee MS Rhythmic syllable-related activity in a songbird motor thalamic nucleus necessary for learned vocalizations. PLoS One 12, e0169568 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Williams H & Vicario DS Temporal patterning of song production: participation of nucleus uvaeformis of the thalamus. J Neurobiol 24, 903–912 (1993). [DOI] [PubMed] [Google Scholar]
- 16.Elmaleh M, Kranz D, Asensio AC, Moll FW & Long MA Sleep replay reveals premotor circuit structure for a skilled behavior. Neuron 109, 3851–3861 e3854 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Nottebohm F, Kelley DB & Paton JA Connections of vocal control nuclei in the canary telencephalon. J Comp Neurol 207, 344–357 (1982). [DOI] [PubMed] [Google Scholar]
- 18.Akutagawa E & Konishi M New brain pathways found in the vocal control system of a songbird. J Comp Neurol 518, 3086–3100 (2010). [DOI] [PubMed] [Google Scholar]
- 19.Mooney R & Prather JF The HVC microcircuit: the synaptic basis for interactions between song motor and vocal plasticity pathways. J Neurosci 25, 1952–1964 (2005). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Egger R et al. Local Axonal Conduction Shapes the Spatiotemporal Properties of Neural Sequences. Cell 183, 537–548 e512 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Kornfeld J et al. EM connectomics reveals axonal target variation in a sequence-generating network. Elife 6 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Long MA & Fee MS Using temperature to analyse temporal dynamics in the songbird motor pathway. Nature 456, 189–194 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Picardo MA et al. Population-Level Representation of a Temporal Sequence Underlying Song Production in the Zebra Finch. Neuron 90, 866–876 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Hahnloser RH, Kozhevnikov AA & Fee MS An ultra-sparse code underlies the generation of neural sequences in a songbird. Nature 419, 65–70 (2002). [DOI] [PubMed] [Google Scholar]
- 25.Kozhevnikov AA & Fee MS Singing-related activity of identified HVC neurons in the zebra finch. J Neurophysiol 97, 4271–4283 (2007). [DOI] [PubMed] [Google Scholar]
- 26.Hamaguchi K, Tanaka M & Mooney R A Distributed Recurrent Network Contributes to Temporally Precise Vocalizations. Neuron 91, 680–693 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Andalman AS, Foerster JN & Fee MS Control of vocal and respiratory patterns in birdsong: dissection of forebrain and brainstem mechanisms using temperature. PLoS One 6, e25461 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Schmidt MF Pattern of interhemispheric synchronization in HVc during singing correlates with key transitions in the song pattern. J Neurophysiol 90, 3931–3949 (2003). [DOI] [PubMed] [Google Scholar]
- 29.Valverde S et al. Deep brain stimulation-guided optogenetic rescue of parkinsonian symptoms. Nat Commun 11, 2388 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Lymer J, Prescott IA & Levy R Microstimulation-induced inhibition of thalamic reticular nucleus in non-human primates. Exp Brain Res 237, 1511–1520 (2019). [DOI] [PubMed] [Google Scholar]
- 31.Arfin SK, Long MA, Fee MS & Sarpeshkar R Wireless neural stimulation in freely behaving small animals. J Neurophysiol 102, 598–605 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Vu ET, Mazurek ME & Kuo YC Identification of a forebrain motor programming network for the learned song of zebra finches. J Neurosci 14, 6924–6934 (1994). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Ashmore RC, Wild JM & Schmidt MF Brainstem and forebrain contributions to the generation of learned motor behaviors for song. J Neurosci 25, 8543–8554 (2005). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Roberts TF et al. Identification of a motor-to-auditory pathway important for vocal learning. Nat Neurosci 20, 978–986 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Fee MS, Kozhevnikov AA & Hahnloser RH Neural mechanisms of vocal sequence generation in the songbird. Ann N Y Acad Sci 1016, 153–170 (2004). [DOI] [PubMed] [Google Scholar]
- 36.Scharff C, Kirn JR, Grossman M, Macklis JD & Nottebohm F Targeted neuronal death affects neuronal replacement and vocal behavior in adult songbirds. Neuron 25, 481–492 (2000). [DOI] [PubMed] [Google Scholar]
- 37.Zhao W, Garcia-Oscos F, Dinh D & Roberts TF Inception of memories that guide vocal learning in the songbird. Science 366, 83–89 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Cardin JA, Raksin JN & Schmidt MF Sensorimotor nucleus NIf is necessary for auditory processing but not vocal motor output in the avian song system. J Neurophysiol 93, 2157–2166 (2005). [DOI] [PubMed] [Google Scholar]
- 39.Otchy TM et al. Acute off-target effects of neural circuit manipulations. Nature 528, 358–363 (2015). [DOI] [PubMed] [Google Scholar]
- 40.Vyssotski AL, Stepien AE, Keller GB & Hahnloser RH A Neural Code That Is Isometric to Vocal Output and Correlates with Its Sensory Consequences. PLoS Biol 14, e2000317 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Kosche G, Vallentin D & Long MA Interplay of inhibition and excitation shapes a premotor neural sequence. J Neurosci 35, 1217–1227 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Cannon J, Kopell N, Gardner T & Markowitz J Neural Sequence Generation Using Spatiotemporal Patterns of Inhibition. PLoS Comput Biol 11, e1004581 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Miri A et al. Spatial gradients and multidimensional dynamics in a neural integrator circuit. Nat Neurosci 14, 1150–1159 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Reinke H & Wild JM Identification and connections of inspiratory premotor neurons in songbirds and budgerigar. J Comp Neurol 391, 147–163 (1998). [PubMed] [Google Scholar]
- 45.Schmidt MF & Wild JM The respiratory-vocal system of songbirds: anatomy, physiology, and neural control. Progress in brain research 212, 297–335 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Johnson MD & Ojemann GA The role of the human thalamus in language and memory: evidence from electrophysiological studies. Brain Cogn 42, 218–230 (2000). [DOI] [PubMed] [Google Scholar]
- 47.Jurgens U Neural pathways underlying vocal control. Neurosci Biobehav Rev 26, 235–258 (2002). [DOI] [PubMed] [Google Scholar]
- 48.Pattinson KT et al. Determination of the human brainstem respiratory control network and its cortical connections in vivo using functional and structural imaging. Neuroimage 44, 295–305 (2009). [DOI] [PubMed] [Google Scholar]
- 49.Wild JM Visual and somatosensory inputs to the avian song system via nucleus uvaeformis (Uva) and a comparison with the projections of a similar thalamic nucleus in a nonsongbird, Columba livia. J Comp Neurol 349, 512–535 (1994). [DOI] [PubMed] [Google Scholar]
METHODS REFERENCES
- 50.During DN et al. Fast Retrograde Access to Projection Neuron Circuits Underlying Vocal Learning in Songbirds. Cell Rep 33, 108364 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Pologruto TA, Sabatini BL & Svoboda K ScanImage: flexible software for operating laser scanning microscopes. Biomed Eng Online 2, 13 (2003). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Dombeck DA, Harvey CD, Tian L, Looger LL & Tank DW Functional imaging of hippocampal place cells at cellular resolution during virtual navigation. Nat Neurosci 13, 1433–1440 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Kollmorgen S, Hahnloser RHR & Mante V Nearest neighbours reveal fast and slow components of motor learning. Nature 577, 526–530 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Pachitariu M, Steinmetz N, Kadir S, Carandini M & Kenneth D,H Kilosort: realtime spike-sorting for extracellular electrophysiology with hundreds of channels. bioRxiv, 061481 (2016). [Google Scholar]
- 55.Rossant C et al. Spike sorting for large, dense electrode arrays. Nat Neurosci 19, 634–641 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Guizar-Sicairos M, Thurman ST & Fienup JR Efficient subpixel image registration algorithms. Opt Lett 33, 156–158 (2008). [DOI] [PubMed] [Google Scholar]
- 57.Pnevmatikakis EA & Giovannucci A NoRMCorre: An online algorithm for piecewise rigid motion correction of calcium imaging data. J Neurosci Methods 291, 83–94 (2017). [DOI] [PubMed] [Google Scholar]
- 58.Katlowitz KA, Picardo MA & Long MA Stable Sequential Activity Underlying the Maintenance of a Precisely Executed Skilled Behavior. Neuron 98, 1133–1140 e1133 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The data and code that support the findings of this study are available from the corresponding authors upon request.
