Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2015 Apr 2.
Published in final edited form as: Neuron. 2014 Apr 2;82(1):208–223. doi: 10.1016/j.neuron.2014.01.039

Emergence of context-dependent variability across a basal ganglia network

Sarah C Woolley 1,2,*, Raghav Rajan 1,3,4, Mati Joshua 3,5, Allison J Doupe 1,3
PMCID: PMC4132189  NIHMSID: NIHMS581477  PMID: 24698276

Summary

Context-dependence is a key feature of cortical-basal ganglia circuit activity, and in songbirds, the cortical outflow of a basal ganglia circuit specialized for song, LMAN, shows striking increases in trial-by-trial variability and bursting when birds sing alone rather than to females. To reveal where this variability and its social regulation emerge, we recorded stepwise from cortico-striatal (HVC) neurons and their target spiny and pallidal neurons in Area X. We find that cortico-striatal and spiny neurons both show precise singing-related firing across both social settings. Pallidal neurons, in contrast, exhibit markedly increased trial-by-trial variation when birds sing alone, created by highly variable pauses in firing. This variability persists even when recurrent inputs from LMAN are ablated. These data indicate that variability and its context-sensitivity emerge within the basal ganglia network, suggest a network mechanism for this emergence, and highlight variability generation and regulation as basal ganglia functions.

Introduction

The basal ganglia are critical for the learning, planning and execution of movement (Graybiel, 2008; Grillner et al., 2005). Moreover, the activity of cells within the basal ganglia appears to integrate motor commands with signals reflecting motivation. For example, basal ganglia activity is modulated by behavioral context such that the neural response associated with performing a movement when a reward is expected can be entirely different, or even absent, under conditions when no reward is forthcoming (Gdowski et al., 2001; Handel and Glimcher, 2000; Kawagoe et al., 1998). Such an integration of motor control and reward signals may be critical for on-line modulation of motor behavior as well as for motor and reinforcement learning.

A recent addition to our view of basal ganglia motor signals and their modulation by behavioral context has come from studies of a songbird cortical-basal ganglia circuit, the anterior forebrain pathway (AFP), dedicated to a single behavioral output, the bird’s learned song. This circuit, which is essential for both juvenile and adult song plasticity (Fig. 1A; Andalman and Fee, 2009; Bottjer et al., 1984; Kao et al., 2005; Olveczky et al., 2005; Scharff and Nottebohm, 1991; Warren et al., 2011), is strikingly sensitive to social context. Both immediate early gene (IEG) induction and neural activity change markedly when adult birds switch from singing alone (‘undirected’ song) to singing courtship song ‘directed’ at a female (Hessler and Doupe, 1999; Jarvis et al., 1998; Kao et al, 2008). In particular, during undirected singing, activity in the cortical output nucleus of the AFP, the lateral magnocellular nucleus of the anterior nidopallium (LMAN), is dominated by bursts and is highly variable in the timing and reliability of firing across song renditions (Fig. 1B & C). During directed singing, however, LMAN neurons rapidly shift to single spike activity that is precisely timed and reliably locked to song across trials (Fig. 1C; Hessler and Doupe, 1999; Kao et al, 2008). With the same switch from solo to social contexts, the song itself changes, in a dopamine-dependent manner, from song that is variable to stereotyped ‘performance’ song that is less plastic, preferred by females and seems to represent the bird’s best current version of song (Kao and Brainard, 2006; Kojima and Doupe 2011; Leblois et al., 2010; Murugan et al., 2013; Sakata and Brainard, 2009; Sasaki et al., 2006; Woolley and Doupe, 2008). Lesions or inactivations of LMAN eliminate the song variability seen in adult undirected song as well as that of juveniles still learning to sing, indicating that the behavioral variability is not a default state but actively requires cortical basal ganglia-circuitry (Goldberg and Fee, 2011; Kao and Brainard, 2006; Olveczky et al., 2005, 2011; Scharff and Nottebohm, 1991). Together these data have suggested that one crucial function of cortical basal ganglia circuits is to generate behavioral variability. In adult birds such variability has been shown to generate motor ‘exploration’ that can enable reinforcement learning (Andalman and Fee, 2009; Tumer and Brainard, 2007; Warren et al., 2011), and the structure of the variance in song crucially determines what is learned (Charlesworth et al., 2011).

Figure 1. Social context influences activity of single neurons in LMAN.

Figure 1

(A) In the ‘motor’ pathway (white boxes), HVC (proper name) projects to the robust nucleus of the arcopallium (RA) which innervates motor neurons used for singing (‘vocal output’). The ‘anterior forebrain pathway’ (AFP, gray boxes) also receives input from HVC. Specifically, HVC neurons project to the basal ganglia nucleus Area X, which in turn projects to the lateral magnocellular nucleus of the anterior nidopallium (LMAN) via the medial nucleus of the dorsolateral thalamus (DLM). LMAN provides the output of the AFP to the motor pathway and sends a recurrent projection back to Area X. The connectivity of putative cell types that we recorded from (filled in black) is indicated by circles and lines; glutamatergic HVCX neurons project to GABAergic spiny neurons (SN) in Area X that inhibit pallidal output neurons (P). Pallidal neurons send strong inhibitory projections to the thalamus. The other classes of neurons in Area X (interneurons and GPe neurons) are omitted for simplicity. (B–C) As described in Kao et al., 2008, social context influences the firing rate and variability of neurons in LMAN. (B) Spectrogram of a representative song motif. (C) Raster plots of firing of a single LMAN neuron during 25 motif renditions of undirected (top) and directed (middle) song and during ten non-singing epochs (bottom).

Because most neurophysiological studies of context-dependent variability have focused on the cortical nucleus LMAN, it remains unclear whether this neural variability and its social sensitivity emerge in the cortical-basal ganglia circuit or elsewhere in the brain. To investigate these questions, we recorded neural activity during singing at multiple earlier stages in the circuit, including 1) the inputs to basal ganglia from cortical premotor neurons in HVC, and both 2) putative spiny neurons (SNs) and 3) internal globus pallidus-like (GPi) neurons in the avian basal ganglia nucleus Area X (Fig. 1A). We found that, in marked contrast to LMAN, the activity of striatal-projecting neurons in HVC was highly stereotyped and equally precise in both behavioral conditions. Similarly, the firing of Area X SNs was equally precisely timed in both social conditions, although the firing rate of SNs showed some social context dependence. In contrast, putative GPi output cells in Area X, displayed dramatic social modulation of both firing rate and spike timing variability, and this variability did not depend on the recurrent cortical input from LMAN neurons. These data provide a striking illustration of how neural variability emerges and acquires context-sensitivity over the course of a circuit, and point to the striato-pallidal network as a key locus in the generation of variability and its context sensitivity.

Results

To investigate where the marked trial-by-trial neural variability of LMAN and its social context-dependence arise (Fig. 1B, C), we recorded and analyzed the singing-related activity of 28 neurons in the cortical input nucleus to the AFP, HVC, and of 70 neurons within the basal ganglia nucleus Area X (Fig. 1A) in male zebra finches within and across renditions of the bird’s ‘motif’ (Fig. 1B).

AFP-projecting HVC neurons

We first examined the activity of HVC neurons projecting to Area X (HVCX, n=28 single units in 6 birds, including 10 antidromically identified from Area X, Fig. 2 A–C, see Experimental Procedures). HVCX neurons exhibited low but measurable spontaneous activity (Table 1; Fig. 2E, bottom panel). During singing directed at a female, HVCX neurons increased their activity and displayed sparse firing ‘events’ (1–5/motif) that consisted of single spikes or bursts that were tightly locked to song (Fig. 2E; e.g., Kozhevnikov and Fee, 2007). For example, the cell in Fig. 2 shows a firing event at the start of syllable ‘b’. We recorded the activity of the same HVCX neurons during interleaved trials of undirected singing and found that neither the firing rate (p=0.69; Fig. 2E–G, Table 1) nor the percentage of spikes in bursts (p=0.62; Fig 2H, Table 1; see Experimental Procedures), differed between the two contexts. The lack of social modulation of HVCX neurons, which is also apparent in IEG expression (Jarvis et al., 1998), stands in striking contrast to LMAN.

Figure 2. Social context has no effect on the firing of individual HVCX neurons.

Figure 2

(A) Recordings from HVCX neurons (gray circle) (B) Amplitude oscillogram of a song (top) and raw trace of the associated neural activity. (C) Trace of the stimulus artifact (black arrow) and evoked spikes (gray arrow) resulting from antidromic stimulation of HVC from Area X. (D) Spectrogram of a representative song motif (‘abcdefg’). (E) Raster plots of a single HVCX neuron during ten motif renditions of directed (DS; top) and undirected (US; middle) song and during six non-singing epochs (SP; bottom). (F) Mean firing pattern during DS (gray line) and US (black line) singing and during spontaneous activity (SP; dashed line). There were no significant differences in firing rate (G), percent of spikes in bursts (H) or the CC (I) between DS and US for identified HVCX neurons (gray diamonds) or putative HVCX single units (black diamonds). For this and all subsequent figures, open diamonds indicate unpaired data and * indicates p<0.05.

Firing Rate
(Hz)
p value
D vs. UD
CC p value
D vs. UD
Reliability
(%)
p value
D vs. UD
Spikes in
Bursts (%)
p value
D vs. UD
Cross Trial
Spike Count CV
p value
D vs. UD
HVCX Directed 12.2 ± 1.2 0.28 0.97 ± 0.01 0.67 99.3 ± 0.62 0.66 77.2 ± 3.1 0.58 0.19 ± 0.02 0.12
Undirected 11.8 ± 1.1 0.95 ± 0.01 77.7 ± 2.7 0.22 ± 0.02
Spontaneous 2.67 ± 0.9
SN Directed 7.3 ± 0.8 0.05 0.79 ± 0.07 0.40 82.7 ± 4.41 0.14 47.0 ± 7.6 0.018 0.70 ± 0.03 0.09
Undirected 11.3 ± 2.0 0.74 ± 0.10 90.6 ± 3.24 67.4 ± 3.8 0.58 ± 0.05
Spontaneous 0.19 ± 0.2

Along with their stable firing rates, HVCX neurons had extremely precise and reliable firing in both social conditions (Fig. 2E–F). To quantify the reproducibility of the pattern of spikes across trials, we calculated the correlation coefficient (CC) between the instantaneous firing rates across the entire motif for all possible pairs of trials in each social condition (see Experimental Procedures; Kao et al., 2008). This measure was close to the maximum of 1.00 during both directed and undirected singing (Fig. 2I, Table 1). Moreover, when we focused specifically on the burst events (see Experimental Procedures), we found that for both directed and undirected singing, spikes occurred on every trial with almost 100% reliability (see Experimental Procedures; Table 1), and the number of spikes per event was highly consistent from trial-to-trial (cross trial spike count variance, Table 1).

Altogether, these data demonstrate marked stereotypy of HVCX neurons regardless of context. Thus, striatal projecting HVC neurons do not appear to contribute to either the variability in LMAN neuron firing or its modulation by social context.

Area X neurons

Given the equally precise and reliable firing of HVC neurons in both social contexts, we turned to the striatopallidal nucleus Area X to determine whether the increased variability and context dependence of LMAN activity emerge in this basal ganglia nucleus. We focused in particular on the input and output cells (Fig. 1A), the putative spiny neurons (SNs), which are a major target of input from HVC, and the globus pallidus internus (GPi)-like pallidal neurons, which receive input from HVC and SNs and other Area X neurons and project strongly to the thalamus (Farries et al., 2005; Goldberg et al., 2010; Leblois et al., 2009; Luo and Perkel, 1999).

In this study, we recorded from multiple cell types within Area X. Consistent with prior descriptions (Goldberg and Fee, 2010), we could clearly distinguish pallidal and striatal neurons based on their spontaneous firing rates (see Fig. S1 and Experimental Procedures). In addition, using a measure of spike count variance, the Fano factor, during undirected singing, we could further subdivide the striatal and pallidal neurons into additional classes, including discrete populations, comparable to SNs and to GPi neurons, that we focus on here (Goldberg et al., 2010; Goldberg and Fee 2010; see Experimental Procedures, Supplemental Text and Fig. S1).

Spiny neurons in Area X

Like SNs in mammals and birds (Farries and Perkel, 2002; Goldberg and Fee, 2010; Hikosaka et al., 1989; Kimura, 1990; Schmitzer-Torbert and Redish, 2008), our putative SNs all had extremely low spontaneous rates (Table 1; n=8 in 3 birds) and sparse firing ‘events’ during singing. These firing events, like those of HVCX neurons, consisted of single spikes or short bursts that were tightly locked to song and were precise from trial-to-trial (average 1.7 events/motif, range 1–4; Fig. 3). For example, the SN in Figure 3 had one firing event during syllable g, and rarely fired at other times during song.

Figure 3. Social context modulates the firing rate but not the precision of SNs.

Figure 3

(A) Recordings from spiny neurons (SNs; gray circles) in Area X. (B) Amplitude oscillogram of a song (top) and a raw trace of the associated single unit activity. (C) Spectrogram of a representative song motif (‘abcdefg’). (D) Raster plots of firing of a single SN during 20 motif renditions of directed (DS; top) and undirected (US; middle) song and during 10 non-singing epochs (bottom). (E) Mean firing pattern during DS (gray line) and US (black line) and during spontaneous activity (SP; dashed line). (F) Like HVCX neurons, SN firing was highly precise and equally stereotyped for both behavioral conditions, indicated by the high pairwise CCs that were not significantly different between social contexts. (G) HVCX and Area X SNs differed in the variability of their cross-trial firing rates, in particular the variance (CV) of the number of spikes per trial was significantly higher for SNs than HVCX neurons regardless of social context. (H) The average SN firing rate was slightly but significantly higher during US than during DS. (I) A larger percentage of spikes were produced in bursts than as single spikes during US compared to DS. See also Fig. S1.

When we compared the variability of SN spike timing between directed and undirected singing, we found that, like HVCX neurons, SN firing was equally well timed during both conditions. Specifically, neither the average trial-by-trial CC (p=0.41; Fig. 3F, Table 1) nor the temporal jitter of the first spike within a spike event (1.69 ± 0.27 ms vs. 1.75 ± 0.26 ms, p=0.59; see Experimental Procedures) was significantly different between directed and undirected song. Finally, there were no significant differences between directed and undirected song in the reliability of firing across trials (p=0.14; Table 1). These data indicate that the temporal precision of singing-related SN firing was not significantly affected by the social context in which song was produced.

While spiking timing variability and reliability of SNs were not modulated by social context, they differed from that of the HVCx inputs. First, the average trial-by-trial CC of SNs was significantly lower than that of HVCX neurons (p<0.0001, Table 1). Second, the number of spikes per event was significantly lower (p<0.0001) and more variable from trial-to-trial (p<0.0001, Fig. 3G) in SNs than in HVCX neurons. Third, SN firing was less reliable than that of HVCX neurons (p<0.0001, Table 1). Thus, although the song-locked firing of SNs is precisely timed in both social conditions, these neurons spike less and show a slight but significant decrease in their cross-trial consistency and reliability compared to their inputs from HVC. Similar cross-trial variance of spike count has been observed in SNs of young birds (Goldberg and Fee, 2010).

Finally, in contrast to HVCX neurons, we found that the average firing rates of SNs in Area X were modulated by social context. For the SN depicted in Figure 3, the firing rate during undirected song was double the firing rate during directed song (17.6 Hz vs. 8.7 Hz). Such differences were consistent across SNs, and, on average, the firing rate during undirected song was 62% higher than during directed song (p=0.05; Fig. 3H, Table 1). The increase in the undirected SN firing rate was due primarily to more spikes/burst within the same song-locked events as those during directed song, resulting in a significantly greater percentage of spikes occurring in bursts during undirected than directed singing (p=0.018; Fig. 3I, Table 1). This difference in the average firing rates of SNs across conditions provides the first neurophysiological evidence within the AFP of social context modulation. However, individual SNs appear to faithfully transmit the timing of the precise song-locked HVCX events, and SN cross-trial firing reliability although lower than that of HVCX neurons, is similar between social conditions. Thus, the considerable social modulation of spike timing variability in LMAN neurons is not present in the individual striatal neurons that receive a major input from the cortical nucleus HVC.

GPi output neurons in Area X

Because the dramatic trial-by-trial spike timing variability and marked sensitivity to social context of LMAN neurons were not evident in SNs, we investigated whether these phenomena were present in the putative GPi output neurons in Area X. We recorded the activity of 23 such GPi neurons (in 10 birds; Fig 4, Experimental Procedures and Fig. S1).

Figure 4. Social context modulates firing of GPi neurons in Area X.

Figure 4

(A) Recordings from pallidal projection neurons in Area X (P; gray circle). (B) Spectrogram of a representative song motif (‘abcdefgh’). (C) Raster plots of singing-related activity of a GPi neuron during 25 motif renditions each of directed song (DS; top) and undirected song (US; middle) and during spontaneous activity (SP; bottom). (D) The mean firing rate over time during US (black line) and DS (gray line) singing as well as during SP (dashed line). The pattern of activity during DS and US is highly similar, despite the difference in firing rate. (E) Average firing rate for DS (gray line) and US (black line) and SP (dashed line) prior to the start of the motif (vertical dashed line). Asterisks indicate the first introductory note for DS (gray) and US (black). During US, the firing rate increases earlier prior to the motif and to the introductory notes (Fig. S2) than during DS. (F) Group data showing the highly consistent firing rate increases between SP, DS and US (all groups are significantly different from each other). (G) Group data showing the time relative to the onset of the motif (time 0) when activity increased to 25, 50, and 75 percent of the peak firing rate. The US firing rate (black) reaches each of those benchmarks significantly earlier than does the DS firing rate (gray). See also Fig. S2.

In general, adult GPi neuron activity was strongly modulated during singing and showed a consistent pattern associated with the motif. Notably, the song-related activity consisted of a tonic increase in the neurons’ already high firing rates, punctuated by phasic, song-locked slowing of this firing. For example, the cell illustrated in Figure 4 exhibits consistent decreases in activity during syllables c and g. Moreover, the pattern of activity, when averaged across all trials, was highly similar regardless of the social context in which song was produced (e.g. Fig. 4D; Hessler and Doupe, 1999). To quantify the degree of similarity in the firing pattern of individual GPi neurons between contexts, we computed the correlation coefficient between the average activity pattern during directed and undirected singing. For the neuron depicted in Figure 4, this correlation coefficient was 0.68, and across all cells the average correlation coefficient was 0.70 ± 0.03. Consequently, as is the case for neurons in LMAN, on average GPi neurons in Area X produce a common signal during directed and undirected singing.

Despite similar average firing patterns between the two social contexts, there were striking differences in activity between directed and undirected singing. First, the firing rate was significantly affected by social context. For example, the activity of the GPi neuron in Figure 4 increased from a baseline rate of 205 Hz to 292 Hz during directed song and 437 Hz during undirected song. Across all GPi neurons, firing rates increased significantly during directed singing relative to spontaneous but showed even greater increases during undirected singing (Table 2; Fig. 4F). The context-dependent difference in firing rate was apparent even in the activity leading up to singing. While both directed and undirected GPi neuron activity increased from the same spontaneous baseline activity level, those increases in activity occurred significantly earlier prior to undirected singing than prior to directed singing (Figs. 4E,G, Fig. S2, Supplemental Text).

Firing Rate
(Hz)
p value
D vs. UD
CC p value
D vs. UD
Reliability
(%)
p value
D vs. UD
GPi spikes (intact) Directed 208.3 ± 11.0 <0.0001 0.49 ± 0.03 <0.0001
Undirected 337.2 ± 18.3 0.25 ± 0.02
Spontaneous 158.4 ± 7.4
GPi spikes (LMAN lesion) Directed 232.7 ± 12.2 0.0001 0.53 ± 0.05 0.0001
Undirected 305.2 ± 17.3 0.28 ± 0.03
Spontaneous 139.3 ± 7.7
GPi pauses (intact) Directed 0.24 ± 0.9 0.0001 40.1 ± 4.0 <0.0001
Undirected 7.3 ± 0.8 21.7 ± 3.3
GPi pauses (LMAN lesion) Directed 11.3 ± 2.0 0.0001 42.3 ± 5.4 0.0003
Undirected 0.19 ± 0.2 23.3 ± 4.1

In addition to the changes in firing rate, there was a marked increase in the trial-by-trial spike timing variability of GPi neurons over neurons earlier in the circuit, and this variability was socially modulated, with undirected singing showing a more variable pattern of activity across trials. In particular, across all cells, the average CC of spiking between all pairs of trials was significantly lower during undirected singing (0.25) than during directed singing (0.49; Table 2, p<0.0001; Fig. 5B).

Figure 5. Social context modulates pauses in GPi neuron firing.

Figure 5

For the population of GPi cells (A), the variability of the spike trains was consistently higher during undirected singing (US) than directed singing (DS), illustrated by the significantly lower CC for US (B). (C) Similarly, pause onset times across the motif were also more variable during US, as evidenced by the lower CC. (D) Within pause events, the reliability of pauses across trials was significantly lower during US. (E–F) The firing rate immediately preceding a pause was significantly higher during US than DS. (E) Depicted is the mean spike rate preceding a pause for a single neuron during US (black) and DS (gray). (F) For all but one cell, the activity immediately preceding a pause was significantly greater during US. (G) To measure the duration of ‘decelerations’ in the firing rate we smoothed each spike train (black curve; see Experimental Procedures) then thresholded the smoothed spike trains at 2 SD below the mean (gray horizontal line). Durations of these ‘decelerations’ were then calculated as the difference between onsets and offsets determined based on threshold crossings of the smoothed curve (gray vertical lines). (H) Group data showing that durations of decelerations were significantly longer during US than DS. See also Fig. S3.

The trial-by-trial spike timing variability and its change with context were particularly apparent in the pattern of breaks or pauses in firing, which differed markedly between directed and undirected singing. To analyze context-related differences in pauses we defined a pause as any ISI more than two standard deviations above the mean directed ISI, and a pause onset as the start time of each ISI above the threshold (see Experimental Procedures, Supplementary text, and Fig. S3 for discussion of pause definition). As was the case for spikes, the pattern of pauses was less consistent from trial-to-trial during undirected singing. To quantify the reproducibility of the pattern of pauses across the entire motif, we calculated the average CC specifically for the pause onsets as defined above (see Experimental Procedures; Fig. S3). For the cell in Fig. 4 the CC of pause onsets was low during directed singing (0.24), but decreased even further during undirected singing (0.05). Across all GPi neurons, the average CC of pause onsets was significantly lower during undirected than directed singing (Table 2; p=0.0001; Fig. 5C).

The difference in the trial-by-trial CC of pauses between directed and undirected song resulted not only from differences in pause timing but also from context-dependent differences in the reliability of pauses across trials. In general, the consistent pattern of decreases in activity associated with the song (e.g. Fig. 4D) was evident as increases or peaks in the average pattern of pause onsets (Fig. S3D). We called such increases in the average pause rate ‘pause events’ and investigated the reliability of pause onsets across trials within these singing-related pause events (see Experimental Procedures). We found that pause events were significantly less reliable from trial to trial during undirected singing (Table 2; p<0.0001; Fig. 5D). Thus, salient song-related pause events were not only variable in timing but occurred less reliably across trials during undirected than directed singing.

Previous work has demonstrated that pauses in GPi neuron activity allow DLM neurons to fire single spikes or bursts, and that the probability of spiking in DLM is influenced by changes in firing rate of GPi input neurons, with greater decreases in GPi neuron activity leading to more DLM spikes (Goldberg and Fee, 2012; Goldberg et al., 2012; Kojima and Doupe, 2009; Person and Perkel, 2005). To examine context-dependent differences in the firing rate preceding pauses, we used a pause-triggered average (Fig. 5E), and found that the mean firing rate of GPi neurons immediately preceding a pause was significantly higher during undirected than directed singing (379 ± 7 vs. 248 ± 5 spikes/sec; Fig. 5F; p<0.0001).

Spiking in DLM is also influenced by the duration of decreases in firing rate (‘decelerations’), with longer decelerations leading to more DLM spikes (Goldberg and Fee, 2012; Goldberg et al., 2012; Kojima and Doupe, 2009; Person and Perkel, 2005).

Thus, we also looked at whether there was a context-dependent difference in how long the firing rate remained low following the onset of deceleration. To investigate this, we smoothed each trial and measured the duration of epochs when the firing rate dropped at least 2 SD below the mean rate (see Experimental Procedures; Fig. 5G). We found that these periods of decreased firing were significantly longer during undirected song (14.9 ± 0.9 ms) than during directed song (9.5 ± 1.0 ms; Fig. 5H; p<0.0001).

Together, these data showing both greater and longer slowing of GPi neuron firing rates during undirected singing, suggest that undirected GPi activity could allow more thalamic spikes, including bursts, than during directed singing. Moreover, in combination with the marked spike timing variability that we observed during undirected singing, the contextual differences in pallidal firing could explain context-dependent differences in both the frequency and variability of bursts in LMAN.

Sources of variability

Activity of GPi neurons following LMAN lesions

The considerable increase in spike and pause timing variability of GPi neurons specifically during undirected singing, compared to SNs and HVC neurons earlier in the circuit, suggests that such variability and its context-dependence arise within Area X. Alternatively, because GPi neurons and SNs receive inputs not only from HVC but also recurrently from LMAN, they could arise first in LMAN, and be fed back to Area X. We therefore lesioned LMAN and assessed whether such lesions abolished the variability or context-dependence of Area X GPi neurons activity (Fig. 6; n =15 GPi neurons from 3 birds with lesions of over 75% of LMAN; see Experimental Procedures).

Figure 6. LMAN lesions do not eliminate context-dependent differences in variability.

Figure 6

(A) Recordings from GPi neurons (P; gray circle) in LMAN lesioned males. (B) Spectrogram of a representative song motif. (C) Raster plots of firing of a single GPi neuron during 25 motif renditions of directed (DS; top) and undirected song (US; middle) and during ten non-singing epochs (bottom). (D) Mean firing pattern during singing (DS in gray, US in black) and spontaneous activity (SP; dashed line). (E) As was the case for GPi neurons in unlesioned birds (gray dots and line in E–F are the means for unlesioned birds), the firing rate of GPi neurons in birds with LMAN lesions was significantly higher during US than DS. In addition, the variability was significantly higher during US than DS, indicated by the lower cross correlation of the spike trains (F) as well as of the pauses (G).

Firing rate modulations of GPi neurons in LMAN-lesioned birds were not different from those of control birds. These cells had high spontaneous firing rates and significantly increased their firing rates during both directed and undirected singing (Table 2; p=0.0001). As was the case in unlesioned birds, the firing rate was significantly higher during undirected than directed singing (p=0.0001; Fig 6E). Moreover, there were no significant differences in either the spontaneous, directed, or undirected firing rates between lesioned and unlesioned birds (p=0.44; Fig. 6E; Table 2).

In addition to showing normal social modulation of firing rate, GPi neurons in LMAN-lesioned males continued to exhibit significant spike timing variability, and social modulation of this variability. There were no significant effects of the LMAN lesions on any of our measures of variability during undirected singing (for CC of spikes p=0.27 between lesions and controls; for CC of pauses p=0.93; for reliability of pauses p=0.66). Moreover, activity was significantly more variable during undirected than directed singing, as evidenced by a lower trial-by-trial CC of spikes during undirected than directed singing (p=0.0001, Fig. 6F, Table 2) and pauses were more variable (p=0.0001; Fig 6G, Table 2) and less reliable across trials (p=0.0003; Table 2) during undirected than directed singing. These data indicate that LMAN inputs into Area X are not necessary for the variability of adult GPi neuron activity or its social context-dependence, and suggest that neural variability and its social modulation emerge within Area X.

Context-dependent SN correlations as a source of GPi neuron variability: hypothesis and computational model

Our data indicate that the spiking of SNs is precisely timed in both social contexts, with only a social modulation of the average firing rate. However, the output cells of this circuit, the GPi neurons, exhibit a marked social modulation not only of firing rate but also of spike timing variability, particularly in the pattern of pauses presumed to reflect SN inputs. How might such a transformation occur? Our results, along with existing data on striatum in both mammals and songbirds, suggest one simple hypothesis for how the spike timing variability of GPi neurons and its social regulation might arise within the Area X network.

This hypothesis builds on several observations. For one, GPi neurons in mammals are a site of major convergence of SN inputs (Bergman et al., 1998; Pecheron et al., 1984). Our neurophysiological recordings are consistent with this: the SNs we recorded had sparse firing events during each motif (range 1–4 events; e.g. Fig. 7B,F) while, in contrast, GPi neurons exhibited multiple pauses often distributed across the entire motif (range 1–14, average 4.85 pauses per motif; Fig. 7C,D,G,H). Thus, if SNs provide a major source of the inhibition generating pauses in GPi neurons, there must be convergence of multiple SNs onto a single GPi neuron (Fig. 7 illustrates this possibility with recordings of SNs and GPi cells from the same bird). Second, our own observations, as well as those in juvenile birds (Goldberg and Fee, 2010; Fig 3G), suggest that there is greater cross-trial variance of spike count in adult SNs than in their inputs from HVC, and that this does not differ between social contexts. Finally, existing data suggest that dopamine levels in Area X are lower during undirected than directed singing (Sasaki et al., 2006). In mammalian basal ganglia, low dopamine states are associated with increased cross-neuronal correlation and highly synchronized bursting in ensembles of striatal neurons (Bergman et al., 1998; Costa et al., 2006; Goldberg et al., 2004). This raises the possibility such an increase in correlation in lower dopamine states might occur in songbird basal ganglia as well.

Figure 7. Singing-related activity of a population of SNs and GPi neurons.

Figure 7

Sample plots of the song-aligned activity for a randomly selected subset of trials during directed (DS; A–D) and undirected (US; E–H) singing of four SNs and two GPi neurons recorded in a single male. (A, E) Representative spectrograms of DS (A) and US (E). (B, F) Raster plots of four SNs ordered based on the timing of their activity relative to the motif. Different neurons are indicated by different colors and separated by black lines. (C, G) Raster plots of two GPi neurons indicated by different colors and separated by black lines. (D, H) Average pause rate for the second (purple) GPi neuron, generated from 19 DS and 21 US trials. Note the larger number of singing-related events, distributed across the song, compared to SNs.

These observations suggest a simple model in which 1) the degree of correlation in firing rate across SNs can greatly influence the trial-by-trial variability of their downstream target neurons, and 2) this SN rate correlation varies between social contexts. This model is illustrated intuitively in Fig. 8A. This shows three SNs firing during one event locked to song, and the resulting inhibition of the GPi neuron onto which they converge, in two rate correlation conditions. In the upper panel of Fig. 8A, individual SNs are uncorrelated in the number of spikes they fire on each trial, so that the downstream neuron will effectively receive the average of all the input neurons’ variable firing rates on each trial, resulting in equal inhibition of intermediate intensity across trials. However, if changes in the SN network occur so that SNs co-vary in the number of spikes they produce on each trial (Fig. 8A, lower panel), the summed effect of these inputs on the downstream neuron’s firing will differ across trials. Thus co-variation in SN rate will propagate the individual cross-trial spike count variance forward, and result in downstream pauses that are highly variable in magnitude across trials. According to this model, undirected singing, and the associated decrease in dopamine levels, should result in greater cross-SN rate correlations, just as suggested in mammalian basal ganglia (Bergman et al., 1998; Costa et al., 2006), and this will create greater trial-by-trial pause variability in GPi neurons.

Figure 8. Increasing the rate correlation in SNs increases pause variability in GPi neurons.

Figure 8

(A) Illustration of how altering the degree of rate correlation among SNs could influence the output of GPi neurons. The top panel depicts the responses of three SNs (SN 1–3) and one GPi neuron to three different motif renditions (trials 1–3) when the rates of the SNs are uncorrelated. The SN responses are depicted as raster plots, the GPi neuron response is the inhibitory potential created by the SN inputs, with greater deflection from zero indicating a greater response. The bottom panel depicts the responses of these same neurons when the firing rates are correlated. Note that the GPi responses are predicted to be of similar size in the uncorrelated case, while the GPi responses in the correlated condition vary depending on the quantity of spikes in the SN burst. (B) Example of the output of the model. Top panel is a generic spectrogram. The activity of a hypothetical GPi neuron in response to uncorrelated (middle panel) and correlated (bottom panel) SN inputs is depicted as raster plots. The group data for 50 runs of the model shows that the CC of spikes (C) and pauses (D), like the data from GPi neurons, was higher for the uncorrelated, directed-like repetitions than for the correlated, undirected-like repetitions. See also Fig. S4.

To investigate more quantitatively whether altering the degree of rate correlation among a population of SNs like ours could result in differences in the variability and reliability of pauses in GPi neurons, we constructed a simple computational model (see Experimental Procedures, Fig. 8), incorporating 1) the observed cross trial variance in SN firing rate, 2) the observed directed-undirected differences in the number of spikes per burst of SNs, and 3) postulated differences in correlations between neurons, assuming higher correlations among SNs during the lower dopamine, undirected state. We found that just by manipulating the degree of rate correlation between SNs, it was possible to generate differences in the reliability of pauses across trials of a high firing neuron, thereby affecting the neuron’s overall variability in a manner similar to what we see in GPi neurons in Area X (Fig. 8B). For example, across 50 simulations, each involving 50 trials and1000 SNs synapsing onto a single GPi neuron, the average CCs of firing rate and pause onsets were both significantly higher and less variable when SN rates were uncorrelated than when they were correlated (Fig. 8C,D). Although the simplicity of our model creates spike trains with greater regularity than we observed, it nonetheless indicates that substantial differences in GPi neuron pause variability can arise from the very small differences in trial-by-trial firing rates seen in SNs, via the plausible mechanism of changing SN co-variation in firing rate, in a state-dependent manner.

Discussion

Neural variability and its context sensitivity have been well-described in LMAN, the output of the songbird cortical-basal ganglia circuit (Hessler and Doupe, 1999; Kao et al., 2008; Murugan et al., 2013). Here, by systematically recording neurons at earlier stages in the circuit, we traced the emergence of trial-by-trial variability and its social modulation. We found that individual HVC neurons, the cortico-striatal inputs to the circuit, were highly stereotyped in all respects in both social conditions, underscoring the generation of variability downstream of HVC. SNs, a major target of HVCX inputs in the striato-pallidum, also were not differentially affected in their spike timing and cross-trial reliability by social context. The first neurophysiological sign within the circuit of social modulation appeared in the average firing rates of SNs, which were lower during directed singing than undirected singing. However, by the output stage of the striato-pallidal circuit, in the GPi neurons, marked trial-by-trial variability in spike timing and its dependence on social context were both apparent. Variability in the timing of pauses of GPi neurons and their modulation by social context were maintained even after lesions of LMAN, indicating that these properties were not simply fed back into the basal ganglia from LMAN. Taken together, our findings show that variable singing-related firing patterns and their social modulation first emerge over multiple steps within the songbird basal ganglia, and highlight several critical loci and potential mechanisms in the generation and regulation of variability.

More recent data reveal that lesions of Area X reduce song plasticity (Ali et al., 2013; Kojima et al. 2013) but only temporarily reduce behaviorally-measured song variability (Goldberg and Fee, 2011; Ali et al., 2013). This would seem to support a model proposing LMAN as the initial source of neural and behavioral variability (Fee and Goldberg, 2011). However, our data here, which have the advantage of measuring pallidal neural activity directly rather than only behavior, reveal the striking persistence of neural rate and timing variability in GPi neurons after lesions of LMAN, and thus indicate that basal ganglia variability is not strictly dependent on the presence of LMAN. One of several possible explanations for the residual song variability after Area X lesions is that such lesions do not silence LMAN but create abnormal firing. Specifically, after Area X lesions, LMAN activity still increases during singing, but there is a marked loss of patterned neuronal bursting (Kojima et al., 2013) and associated egr-1 expression (Kubikova et al., 2009). While such activity appears insufficient for allowing plastic changes to song (Ali et al., 2013; Kojima et al., 2013), the presence of elevated and unpatterned activity in LMAN could provide an alternate source of variability that could lead to the residual variability in song.

In birds and mammals, SNs, and more specifically cortical-SN synapses, are well-described loci of neuromodulatory control (Ding and Perkel, 2002; Surmeier et al., 2007). Overall, such synapses are thought to be weak (Wilson, 1995), which could explain the greater trial-by-trial spike count variance in SNs compared to HVC neurons. Moreover, while HVCX neurons had similar firing rates and bursting in both social settings, SNs in Area X showed less bursting during directed than undirected singing. Such a diminished occurrence of SN bursting during directed singing is consistent with the dopaminergic gating of striatal excitability seen both in mammalian striatum (Nicola et al., 2000; Surmeier et al., 2007) and in songbirds (Ding and Perkel, 2002; LeBlois et al., 2010). In songbirds, more dopamine is released in Area X during directed than undirected singing (Sasaki et al., 2006), and the decreased bursting of SNs during directed song could result from a dopamine-dependent damping of the influence of HVCX inputs on SNs. Additionally, such a reduction in bursting is also consistent with studies of IEG expression, which have raised the possibility that norepinephrine also participates in the social modulation of gene expression (Castelino and Ball, 2005; Hara et al., 2009; Jarvis et al., 1998).

Despite the social modulation of SN rate, the precision and reliability of SN firing were unchanged between directed and undirected singing. SNs inputs are thought to give rise to pauses in the firing of GPi neurons downstream (Chevalier and Deniau, 1990; Farries et al., 2005; Luo and Perkel, 1999). However, the highly variable timing and reliability of pauses in GPi neurons during undirected singing indicates that their context-dependent variability is not simply inherited from individual SNs. One possible mechanism suggested by these results is that GPi variability arises from context-dependent differences in the network properties of Area X. In particular, in mammals, decreases in dopaminergic tone are associated with increases in synchronized bursting and firing rate co-variation in the cortex and basal ganglia (Goldberg et al., 2004; Heimer et al., 2002; Costa et al., 2006). Using a simple computational model incorporating our observations about individual SNs, we compared the effects of altering the degree of rate correlation among multiple SN inputs to the same GPi neuron. We found that increasing the degree of SN rate correlation (as might occur during a low dopamine state) resulted in significant decreases in the reliability of pauses across trials in this downstream neuron, in a manner similar to what we observed in actual GPi neurons in Area X during undirected singing (Fig. 8). These data suggest that if there are context dependent differences in network correlations, even the small cross-trial changes in firing rates that we observed in SNs could lead to large context-dependent differences in the trial-by-trial variability of individual downstream pallidal neurons. While our model is not the only possible explanation for how changes in variability could be generated within Area X (for example, the role of other interneurons in this circuit remains unclear; Gittis et al., 2011), it highlights the need for future studies to investigate whether social context and dopamine release modulate the degree of rate correlation among SNs, and how such modulations influence the precision of song.

In addition to modulating the trial-by-trial variability of GPi neuron pauses, social context also influenced the average singing-related firing rate of GPi neurons. There are several possible mechanisms for the larger increase in GPi firing rate during undirected versus directed song. Other than SNs, known inputs to GPi neurons include the glutamatergic HVCX neurons (Farries et al., 2005; Fig. 1A), which are thought to provide the functional equivalent of a ‘hyperdirect’ pathway. Such excitatory HVC inputs have been shown to increase pallidal neuron firing in vitro and in anesthetized birds, and this increase is damped by dopamine (LeBlois et al., 2009, 2010). Thus, increased dopamine during directed singing could decrease the effectiveness of synaptic inputs to pallidal neurons from HVC neurons. Dopamine can also directly decrease pallidal neuron firing rates (LeBlois et al., 2010), and could have additional effects on SN synapses and other inputs to GPi neurons. Thus, the contextual differences in Area X GPi neuron firing rate may have multiple, independent sources.

Regardless of the sources of Area X pallidal neuron firing and its modulation by context, differences in the magnitude and duration of pallidal decelerations between directed and undirected singing are particularly well-suited to drive strong context-dependent differences at pallido-thalamic synapses. Neurons in the thalamic nucleus DLM are tonically inhibited by GPi projections, through GABAergic calyceal synapses (Luo and Perkel, 1999), and fire single spikes or bursts during pauses or decelerations in GPi activity. Moreover, greater and longer decelerations of GPi neuron activity lead to more DLM spikes (Person and Perkel, 2005, 2007; Kojima and Doupe, 2009; Goldberg and Fee, 2012). Our findings of social context-dependent differences in GPi firing rates preceding and during pauses predict that DLM neurons should exhibit marked social differences in firing. Indeed, the socially modulated switch from isolated spike to bursting modes of firing in LMAN could first emerge in the thalamus, which is well-known to switch between bursting and non-bursting states across other contexts, such as sleep and wake (Fanselow et al., 2001; McCormick and Bal, 1997).

Reinforcement learning is critically dependent on the basal ganglia and behavioral variability (Bar Gad and Bergman, 2001; Doya, 2002; Doya and Sejnowski, 1998, 2000; Graybiel et al., 1994; Sutton and Barto, 1998). Studies in songbirds have suggested that basal ganglia circuits contribute to reinforcement learning by providing a source of behavioral variability for trial-and-error learning (Andalman and Fee, 2009; Charlesworth et al., 2011; Hampton et al., 2009; Kao et al., 2005, 2008; Kao and Brainard, 2006; Olveczky et al., 2005, 2011; Tumer and Brainard, 2007; Warren et al., 2011). By systematically recording at multiple stages in the basal ganglia circuit, we demonstrate that neural variability is introduced and amplified as activity progresses through the striato-pallidal network, and acquires its context-dependence there as well. Our findings indicate that the striato-pallidal circuit is integral for generating and modulating variable firing patterns downstream that normally promote behavioral variation and plasticity.

Experimental Procedures

Animals

Adult (>100 days old, n=22) male zebra finches (Taeniopygia guttata) from our colony were used. A few days prior to surgery males were isolated and housed in a small cage inside a sound attenuation chamber (soundbox; Acoustic Systems, Austin, TX) where they received seed, water, and grit ad libitum. Procedures adhered to UCSF IACUC approved protocols and NIH guidelines for the care and use of animals.

Surgery

Surgery for the implantation of a lightweight microdrive carrying electrodes was performed as previously described in Kao et al. (2008). Tungsten electrodes (2–3) with impedances of 3–5 MOhms (Microprobe, Gaithersburg, MD) were stereotaxically targeted to either Area X or HVC. For HVC recordings, two birds had custom-made bipolar stimulating electrodes implanted into Area X that were used for antidromic identification of HVC neurons projecting to Area X. For LMAN lesions, a unilateral electrolytic lesion of LMAN was made using a low impedance electrode (Kao and Brainard, 2006; Hampton et al., 2009). A microdrive carrying recording electrodes was then implanted ipsilateral to the lesion into Area X during the same surgery (n=2). For a third bird, a bilateral lesion of LMAN was made, and electrodes were implanted into Area X following recovery. The effectiveness of all lesions was histologically verified (see below).

Physiological recordings

Sound, spontaneous activity (both alone and in the presence of a female), and singing-related activity were recorded using custom written LabVIEW software [A. Leonardo (California Institute of Technology, Pasadena, CA) and C. Roddey (University of California, San Francisco)]. Neural signals were digitized, amplified (X1000), filtered (0.3 to 10 kHz) and recorded. Sounds were digitized, bandpass filtered (0.3 to 10 kHz; Krohn-Hite, Brockton, MA) and recorded. Behavior was monitored through a video camera inside the soundbox. Activity during directed singing was recorded when males performed courtship song for females (see Kao et al., 2008; Woolley and Doupe, 2008).

Upon completion of individual experiments, small electrolytic lesions were made at different depths along the recording track. All recorded sites were verified to lie within Area X or HVC based on their position relative to marker lesions on Nissl stained sections. For birds with LMAN lesions we also performed immunohistochemistry for calcitonin gene related peptide (CGRP), which reliably labels the boundaries of LMAN, and quantified the volume of LMAN remaining using ImageJ (Rasband, W.S., ImageJ, NIH, Bethesda, Maryland, USA, http://imagej.nih.gov/ij/). The average lesion size across all birds was 84% (range 76–88%; Supplemental Text).

Analysis

Spike Sorting and Cell-type Classification

Neural activity was analyzed off-line using software written in Matlab (Mathworks, Natick, MA). Units in HVC (10 antidromically identified and 18 putatively identified single HVCX units; Table 1) were sorted off-line using custom written Matlab programs and Klustakwik (Ken Harris). For HVC units, we assumed a premotor latency of 40 ms.

To identify putative HVCX neurons, we used the spontaneous firing rate as well as the number of song-locked firing events. Putative HVCx neurons had sparse song-locked firing events (1–5 per motif), distinguishing them from interneurons which fire throughout song (Kozhevnikov and Fee, 2007; Prather et al., 2009; Sakata and Brainard, 2008), and spontaneous firing rates that were higher than the extremely low rates of HVC neurons projecting to RA (HVCRA; Table 1; Hahnloser et al, 2002; Prather et al., 2009).

For Area X recordings, large single units were discriminated using a Bayesian spike-sorting program (Kao et al., 2008). Isolation of single units was based on waveform shape and the presence of a refractory period in the ISI histogram; examples of typical waveforms and ISI distributions are shown in Figure S2. For all neurons, fewer than 1% of ISIs were less than 1.0 ms.

Using a Ward’s hierarchical cluster analysis of spontaneous firing rates (JMP v.8.0.1; SAS institute, Cary, NC) we found that the Area X cells we recorded fell into 3 discrete groups: one with almost no spontaneous activity (mean 0.028 Hz, range 0–1.6 Hz), one with medium spontaneous firing rates (mean 11.3 Hz, range 5–18 Hz), and one with high spontaneous firing rates (mean 146.2 Hz, range 75–204 Hz). The Fano factor (F) during undirected activity further distinguished these groups (Supplemental Text, Fig. S1). For example, cells with medium firing rates had highly variable firing during undirected singing, indicated by a higher F value, further distinguishing them from the regular and precise singing-related firing of cells with very low spontaneous firing (HVCRA neurons). The F also subdivided the neurons with high firing rates into a more variable (higher F) population, with long irregular singing-related bursts similar to avian and mammalian GPe neurons (Elias et al., 2007; Goldberg et al., 2010) and a second less variable population that displayed greater increases firing during singing and very brief singing-related pauses, similar to avian and mammalian GPi neurons (Goldberg et al., 2010; Goldberg and Fee, 2012; see Supplemental Text, Fig. S1 for more details).

Time-Warping of Spike Trains

In order to compensate for differences in syllable and interval timing between individual renditions of the bird’s song, we performed a piecewise linear time warp of each motif (see Kao et al., 2008) to linearly stretch or compress each syllable and interval to match the corresponding duration of a reference motif (Supplemental Text). All analyses were conducted on warped spike trains.

Analysis of Bursts

For SNs and HVC projection neurons, we identified bursts as groups of spikes with ISIs less than 10 ms (instantaneous firing rates of greater than 100 Hz). We then calculated the percentage of spikes occurring in bursts. Our results were not significantly different when we defined bursts using ISIs of 5 ms.

Measures of Variability - Spike Trains

Pairwise correlation coefficients (CC)

We quantified trial-by-trial variability across the entire motif by computing the CC between the instantaneous firing rates for all pairs of trials (Kao et al., 2008). Briefly, for each trial we smoothed the spike train with a Gaussian filter (standard deviation 10 ms) then subtracted the time-averaged firing rate for that trial to estimate the instantaneous firing rate for that trial. We then computed the average CC between the instantaneous firing rates for all pairs of trials within a social condition.

Cross trial reliability and spike count variance

For SNs and HVCX neurons we also measured the cross-trial reliability of spiking during a particular ‘event’. Events were defined by taking a threshold of the time-averaged firing rate and defining a window based on the threshold crossings. Event reliability was calculated as the percentage of trials in which a spike occurred during the event window. The spike count variance was calculated as the coefficient of variation (CV) of the number of spikes across all trials within an event.

RMS Jitter

For SNs we also measured the root mean square (RMS) of spike times across trials for spikes occurring during an event. The cross trial RMS jitter was calculated for the first spike time within the event for each trial. The average RMS jitter for all events was then calculated for each cell in each behavioral condition.

Analysis of Pauses in GPi Neuron Activity

GPi neurons were characterized by brief breaks or pauses in activity associated with particular elements of song. For each cell, we defined a pause as an ISI greater than two standard deviations above the mean ISI during directed singing, and a pause onset as the start time of each ISI above the threshold (see also Supplemental Text, Fig. S4). Once we identified pause onsets, we looked at the depth of the pause by computing a pause-triggered average. For all trials, we aligned the pause onsets and measured the firing rate occurring during a 50 ms window immediately preceding the pause (where the firing rate was zero).

Measures of Variability – Pauses

We analyzed the variability of pauses by treating the pause onsets like spike trains and then performing the same analyses of variability (e.g. the CC and reliability) described above for spikes (see also Supplemental Text).

Duration of Decelerations

In addition to pauses, we also examined epochs during which GPi firing rates slowed significantly but did not cross an absolute ISI threshold. Often, such epochs appeared as long pauses interrupted by a few (1–3) spikes (e.g. Fig. 5G). We identified such ‘decelerations’ by smoothing the spike trains associated with each motif rendition (20 ms Hanning window). The deceleration onsets and offsets were then defined as the times when the smoothed spike train fell below a threshold set to the mean smoothed firing rate - 2 SD of the smoothed firing rate. The durations of decelerations were calculated as the difference between offsets and onsets for each deceleration.

Computational Modeling

The computational model (Matlab; code available on request) consists of an input layer of 1000 neurons, corresponding to the SNs, that all project to and inhibit a single output neuron, corresponding to a GPi neuron. To simulate the response of a population of SNs during one motif we divided the input layer into five groups of cells, with each group responding at a different time in the motif. Each SN spike contributed an exponentially decaying (τ=9.5 ms) post-synaptic inhibition to the GPi cell. To set the effectiveness of the inhibition on the postsynaptic firing rate, the magnitude of the synaptic inhibition was scaled by a gain factor that was the same for all synapses. We then calculated the instantaneous firing rate of the output cell as the singing-related rate from the data (200 Hz and 300 Hz for directed and undirected song, respectively), minus the sum of the inhibition from all the input cells at each time point. To generate a spike train from the instantaneous rate of the GPi neuron, we randomly generated spikes in a 1 ms bin resolution.

We modeled the effects of SN rate correlations or lack thereof, on pallidal output as follows. For each trial, the number of spikes in SNs was drawn from a Poisson distribution based on the actual data (rate = 0.7 spikes/event for directed, 1.1 for undirected). In the directed song condition, the number of SN spikes was drawn independently for each SN in a group, for each of the 5 groups. In the undirected condition, all the cells in a group had the same number of spikes during a particular song event. Then, during both directed and undirected song, the number of spikes in each group varied across repetitions (trials) of the simulation. We analyzed the differences in variability between conditions using the trial-by-trial CC of spikes and pause onsets, as for the real data (Fig. 8C, D show the CC resulting from 50 runs of the model). For each run, the timing of firing events for neurons in each of the 5 groups was randomly seeded and the inhibitory gain was set to 2500. We also assessed the effect of synaptic inhibitory gains ranging from 1–10,000. At all gains besides very low gains that did not drive pauses consistently even in the directed condition, the CC’s of spike rates and pauses were higher for uncorrelated (directed) than correlated (undirected; Fig. S4A,B) input.

The variability results were qualitatively the same regardless of whether average GPi neuron firing rates during singing were set to exhibit the context-dependent difference in rates seen in our data, as described above, or set to be equal (Fig. S4A,C).

Statistics

Statistics were performed using JMP software (JMP 8.0.1 for Mac, SAS Institute). For comparisons between nuclei (e.g. HVC and Area X) we performed repeated measures ANOVAs (rmANOVAs) with social context (directed vs. undirected) as the repeated measure. Similarly, we used rmANOVAs with social context (directed vs. undirected) as the repeated measure to look at the effects of lesion (e.g. lesioned vs. unlesioned). Unless otherwise noted, we used an alpha value of p < 0.05. All other comparisons between directed and undirected singing were done using paired t-tests.

Supplementary Material

01

Acknowledgements

We thank Michael Brainard, Mimi Kao, Jon Sakata, Satoshi Kojima and anonymous reviewers for helpful comments. This research was supported by grants from the National Institutes of Health (MH68114 to S.C.W., MH55987 and MH078824 to A.J.D.), the National Alliance for Research on Schizophrenia and Depression (A.J.D.) and the Human Frontiers Science Program (R.R.).

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

References

  1. Ali F, Otchy TM, Pehlevan C, Fantana AL, Burak Y, Olveczky BP. The Basal Ganglia is necessary for learning spectral, but not temporal, features of birdsong. Neuron. 2013;80:494–506. doi: 10.1016/j.neuron.2013.07.049. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Andalman AS, Fee MS. A basal ganglia-forebrain circuit in the songbird biases motor output to avoid vocal errors. Proc. Natl. Acad. Sci. U.S.A. 2009;106:12518–12523. doi: 10.1073/pnas.0903214106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Bar Gad I, Bergman H. Stepping out of the box: information processing in the neural networks of the basal ganglia. Curr. Opin. Neurobiol. 2001;11:689–695. doi: 10.1016/s0959-4388(01)00270-7. [DOI] [PubMed] [Google Scholar]
  4. Bergman H, Feingold A, Nini A, Raz A, Slovin H, Abeles M, Vaadia E. Physiological aspects of information processing in the basal ganglia of normal and parkinsonian primates. TINS. 1998;21:32–38. doi: 10.1016/s0166-2236(97)01151-x. [DOI] [PubMed] [Google Scholar]
  5. Bottjer SW, Miesner EA, Arnold AP. Forebrain lesions disrupt development but not maintenance of song in passerine birds. Science. 1984;224:901–903. doi: 10.1126/science.6719123. [DOI] [PubMed] [Google Scholar]
  6. Castelino CB, Ball GF. A role for norepinephrine in the regulation of context-dependent ZENK expression in male zebra finches (Taeniopygia guttata) Eur. J. Neurosci. 2005;21:1962–1972. doi: 10.1111/j.1460-9568.2005.04028.x. [DOI] [PubMed] [Google Scholar]
  7. Charlesworth JD, Tumer EC, Warren TL, Brainard MS. Learning the microstructure of successful behavior. Nat. Neurosci. 2011;14:373–380. doi: 10.1038/nn.2748. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Chevalier G, Deniau JM. Disinhibition as a basic process in the expression of striatal functions. TINS. 1990;13:277–280. doi: 10.1016/0166-2236(90)90109-n. [DOI] [PubMed] [Google Scholar]
  9. Costa RM, Lin SC, Sotnikova TD, Cyr M, Gainetdinov RR, Caron MG, Nicolelis MA. Rapid alterations in corticostriatal ensemble coordination during acute dopamine-dependent motor dysfunction. Neuron. 2006;52:359–369. doi: 10.1016/j.neuron.2006.07.030. [DOI] [PubMed] [Google Scholar]
  10. DeAngelis GC, Newsome WT. Organization of disparity-selective neurons in macaque area MT. J. Neurosci. 1999;19:1398–1415. doi: 10.1523/JNEUROSCI.19-04-01398.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. DeLong MR. Activity of pallidal neurons during movement. J. Neurophysiol. 1971;34:414–427. doi: 10.1152/jn.1971.34.3.414. [DOI] [PubMed] [Google Scholar]
  12. Ding L, Perkel DJ. Dopamine modulates excitability of spiny neurons in the avian basal ganglia. J. Neurosci. 2002;22:5210–5218. doi: 10.1523/JNEUROSCI.22-12-05210.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Doya K. Metalearning and neuromodulation. Neural Netw. 2002;15:495–506. doi: 10.1016/s0893-6080(02)00044-8. [DOI] [PubMed] [Google Scholar]
  14. Doya K, Sejnowski TJ. A computational model of birdsong learning by auditory experience and auditory feedback. In: Poon PWF, Brugge JF, editors. Central auditory processing and neural modeling. New York, NY: Plenum; 1998. pp. 77–88. [Google Scholar]
  15. Elias S, Joshua M, Goldberg JA, Heimer G, Arkadir D, Morris G, Bergman H. Statistical properties of pauses of the high-frequency discharge neurons in the external segment of the globus pallidus. J Neurosci. 2007;27:2525–2538. doi: 10.1523/JNEUROSCI.4156-06.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Fanselow EE, Sameshima K, Baccala LA, Nicolelis MA. Thalamic bursting in rats during different awake behavioral states. PNAS. 2001;98:15330–15335. doi: 10.1073/pnas.261273898. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Farries MA, Perkel DJ. A telencehpalic nucleus essential for song learning contains neurons with physiological characteristics of both striatum and globus pallidus. J. Neurosci. 2002;22:3773–3787. doi: 10.1523/JNEUROSCI.22-09-03776.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Farries MA, Ding L, Perkel DJ. Evidence for “direct” and “indirect” pathways through the song system basal ganglia. J. Comp. Neurol. 2005;484:93–104. doi: 10.1002/cne.20464. [DOI] [PubMed] [Google Scholar]
  19. Fee MS, Goldberg JH. A hypothesis for basal ganglia-dependent reinforcement learning in the songbird. Neurosci. 2011;198:152–170. doi: 10.1016/j.neuroscience.2011.09.069. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Gdowski MJ, Miller LE, Parrish T, Nenonene EK, Houk JC. Context dependency in the globus pallidus internal segment during targeted arm movements. J. Neurophysiol. 2001;85:998–1004. doi: 10.1152/jn.2001.85.2.998. [DOI] [PubMed] [Google Scholar]
  21. Gittis AH, Hang GB, LaDow ES, Shoenfeld LR, Atallah BV, Finkbeiner S, Kreitzer AC. Rapid target-specific remodeling of fast-spiking inhibitory circuits after loss of dopamine. Neuron. 2011;71:858–868. doi: 10.1016/j.neuron.2011.06.035. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Goldberg JH, Adler A, Bergman H, Fee MS. Singing-related neural activity distinguishes two putative pallidal cell types in the songbird basal ganglia: comparison to the primate internal and external pallidal segments. J Neurosci. 2010;30:7088–7098. doi: 10.1523/JNEUROSCI.0168-10.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Goldberg JH, Farries MA, Fee MS. Integration of cortical and pallidal inputs in the basal ganglia-recipient thalamus of singing birds. J. Neurophys. 2012;108:1403–1429. doi: 10.1152/jn.00056.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Goldberg JH, Fee MS. Singing-related neural activity distinguishes four classes of putative striatal neurons in the songbird basal ganglia. J Neurophysiol. 2010;103:2002–2014. doi: 10.1152/jn.01038.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Goldberg JH, Fee MS. Vocal babbling in songbirds requires the basal ganglia-recipient motor thalamus but not the basal ganglia. J Neurophysiol. 2011;105:2729–2739. doi: 10.1152/jn.00823.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Goldberg JH, Fee MS. A cortical motor nucleus drives the basal ganglia-recipient thalamus in singing birds. Nat Neurosci. 2012;15:620–627. doi: 10.1038/nn.3047. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Goldberg JA, Rokni U, Boraud T, Vaadia E, Bergman H. Spike synchronization in the cortex/basal-ganglia networks of Parkinsonian primates reflects global dynamics of the local field potentials. J Neurosci. 2004;24:6003–6010. doi: 10.1523/JNEUROSCI.4848-03.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Graybiel AM. Habits, rituals, and the evaluative brain. Annu. Rev. Neurosci. 2008;31:359–387. doi: 10.1146/annurev.neuro.29.051605.112851. [DOI] [PubMed] [Google Scholar]
  29. Graybiel AM, Aosaki T, Flaherty AW, Kimura M. The basal ganglia and adaptive motor control. Science. 1994;265:1826–1831. doi: 10.1126/science.8091209. [DOI] [PubMed] [Google Scholar]
  30. Grillner S, Hellgren J, Ménard A, Saitoh K, Wikström MA. Mechanisms for selection of basic motor programs--roles for the striatum and pallidum. TINS. 2005;28:364–370. doi: 10.1016/j.tins.2005.05.004. [DOI] [PubMed] [Google Scholar]
  31. Hampton CM, Sakata JT, Brainard MS. An avian basal ganglia-forebrain circuit contributes differentially to syllable versus sequence variability of adult Bengalese finch song. J. Neurophysiol. 2009;101:3235–3245. doi: 10.1152/jn.91089.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Hahnloser RH, Kozhevnikov AA, Fee MS. An ultra-sparse code underlies the generation of neural sequences in a songbird. Nature. 2002;419:65–70. doi: 10.1038/nature00974. [DOI] [PubMed] [Google Scholar]
  33. Handel A, Glimcher PW. Contextual modulation of substantia nigra pars reticulata neurons. J. Neurophysiol. 2000;83:3042–3048. doi: 10.1152/jn.2000.83.5.3042. [DOI] [PubMed] [Google Scholar]
  34. Hessler NA, Doupe AJ. Social context modulates singing-related neural activity in the songbird forebrain. Nat Neurosci. 1999;2:209–211. doi: 10.1038/6306. 1999b. [DOI] [PubMed] [Google Scholar]
  35. Hikosaka O, Sakamoto M, Usui S. Functional properties of monkey caudate neurons. I. Activities related to saccadic eye movements. J. Neurophysiol. 1989;61:780–798. doi: 10.1152/jn.1989.61.4.780. [DOI] [PubMed] [Google Scholar]
  36. Jarvis ED, Scharff C, Grossman MR, Ramos JA, Nottebohm F. For whom the bird sings: context-dependent gene expression. Neuron. 1998;21:775–788. doi: 10.1016/s0896-6273(00)80594-2. [DOI] [PubMed] [Google Scholar]
  37. Kao MH, Brainard MS. Lesions of an avian basal ganglia circuit prevent context-dependent changes to song variability. J Neurophysiol. 2006;96:1441–1455. doi: 10.1152/jn.01138.2005. [DOI] [PubMed] [Google Scholar]
  38. Kao MH, Doupe AJ, Brainard MS. Contributions of an avian basal ganglia-forebrain circuit to real-time modulation of song. Nature. 2005;433:638–643. doi: 10.1038/nature03127. [DOI] [PubMed] [Google Scholar]
  39. Kao MH, Wright BD, Doupe AJ. Neurons in a forebrain nucleus required for vocal plasticity rapidly switch between precise firing and variable bursting depending on social context. J. Neurosci. 2008;28:13232–13247. doi: 10.1523/JNEUROSCI.2250-08.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Kawagoe R, Takikawa Y, Hikosaka O. Expectation of reward modulates cognitive signals in the basal ganglia. Nat. Neurosci. 1998;1:411–416. doi: 10.1038/1625. [DOI] [PubMed] [Google Scholar]
  41. Kimura M. Behaviorally contingent property of movement related activity in the primate putamen. J. Neurophysiol. 1990;63:1277–1297. doi: 10.1152/jn.1990.63.6.1277. [DOI] [PubMed] [Google Scholar]
  42. Kojima S, Doupe AJ. Activity propagation in an avian basal ganglia-thalamocortical circuit essential for vocal learning. J. Neurosci. 2009;29:4782–4793. doi: 10.1523/JNEUROSCI.4903-08.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Kojima S, Doupe AJ. Social performance reveals unexpected vocal competency in young songbirds. PNAS. 2011;108:1687–1692. doi: 10.1073/pnas.1010502108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Kojima S, Kao MH, Doupe AJ. Task-related “cortical” bursting depends critically on basal ganglia input and is linked to vocal plasticity. PNAS. 2013;110:4756–4761. doi: 10.1073/pnas.1216308110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Kozhevnikov AA, Fee MS. Singing-related activity of identified HVC neurons in the zebra finch. J. Neurophysiol. 2007;97:4271–4283. doi: 10.1152/jn.00952.2006. [DOI] [PubMed] [Google Scholar]
  46. Kubikova L, Turner EA, Jarvis ED. The pallial basal ganglia pathway modulates the behaviorally driven gene expression of the motor pathway. Eur J Neurosci. 2007;25:2145–2160. doi: 10.1111/j.1460-9568.2007.05368.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Leblois A, Bodor AL, Person AL, Perkel DJ. Millisecond timescale disinhibition mediates fast information transmission through an avian basal ganglia loop. J Neurosci. 2009;29:15420–15433. doi: 10.1523/JNEUROSCI.3060-09.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Leblois A, Wendel BJ, Perkel DJ. Striatal dopamine modulates basal ganglia output and regulates social context-dependent behavioral variability through D1 receptors. J Neurosci. 2010;30:5730–5743. doi: 10.1523/JNEUROSCI.5974-09.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Luo M, Perkel DJ. A GABAergic, strongly inhibitory projection to a thalamic nucleus in the zebra finch song system. J. Neurosci. 1999;19:6700–6711. doi: 10.1523/JNEUROSCI.19-15-06700.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. McCormick DA, Bal T. Sleep and arousal: thalamocortical mechanisms. Ann. Rev. Neurosci. 1997;20:185–215. doi: 10.1146/annurev.neuro.20.1.185. [DOI] [PubMed] [Google Scholar]
  51. Mooney R. Neurobiology of song learning. Curr Opin Neurobiol. 2009;19:654–660. doi: 10.1016/j.conb.2009.10.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Murugan M, Harward S, Scharff C, Mooney R. Diminished FoxP2 levels affect dopaminergic modulation of corticostriatal signaling important to song variability. Neuron. 2013 doi: 10.1016/j.neuron.2013.09.021. In Press. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Nicola SM, Surmeier J, Malenka RC. Dopaminergic modulation of neuronal excitability in the striatum and nucleus accumbens. Annu. Rev. Neurosci. 2000;23:185–215. doi: 10.1146/annurev.neuro.23.1.185. [DOI] [PubMed] [Google Scholar]
  54. Ölveczky BP, Andalman AS, Fee MS. Vocal experimentation in the juvenile songbird requires a basal ganglia circuit. PLoS Biol. 2005;3:e153. doi: 10.1371/journal.pbio.0030153. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Ölveczky BP, Otchy TM, Goldberg JH, Aronov D, Fee MS. Changes in the neural control of a complex motor sequence during learning. J Neurophysiol. 2011;106:386–397. doi: 10.1152/jn.00018.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Pecheron G, Yelnik J, François C. A Golgi analysis of the primate globus pallidus. III. Spatial organization of the striato-pallidal complex. J Comp Neurol. 1984;227:214–227. doi: 10.1002/cne.902270207. [DOI] [PubMed] [Google Scholar]
  57. Person AL, Perkel DJ. Unitary IPSPs drive precise thalamic spiking in a circuit require for learning. Neuron. 2005;46:129–140. doi: 10.1016/j.neuron.2004.12.057. [DOI] [PubMed] [Google Scholar]
  58. Prather JF, Nowicki S, Anderson RC, Peters S, Mooney R. Neural correlates of categorical perception in learned vocal communication. Nat. Neurosci. 2009;12:221–228. doi: 10.1038/nn.2246. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Sakata JT, Brainard MS. Online contributions of auditory feedback to neural activity in avian song control circuitry. J Neurosci. 2008;28:11378–11390. doi: 10.1523/JNEUROSCI.3254-08.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Sakata JT, Brainard MS. Social Context Rapidly Modulates the Influence of Auditory Feedback on Avian Vocal Motor Control. J. Neurophysiol. 2009;102:2485–2497. doi: 10.1152/jn.00340.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Sasaki A, Sotnikova TD, Gainetdinov RR, Jarvis ED. Social context-dependent singing regulated dopamine. J. Neurosci. 2006;30:9010–9014. doi: 10.1523/JNEUROSCI.1335-06.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Scharff C, Nottebohm F. A comparative study of the behavioral deficits following lesions of various parts of the zebra finch song system: implications for vocal learning. J Neurosci. 1991;11:2896–2913. doi: 10.1523/JNEUROSCI.11-09-02896.1991. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Schmitzer-Torbert NC, Redish AD. Task-dependent encoding of space and events is dependent on neural subtype. Neurosci. 2008;153:349–360. doi: 10.1016/j.neuroscience.2008.01.081. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Surmeier DJ, Ding J, Day M, Wang Z, Shen W. D1 and D2 dopamine-receptor modulation of striatal glutamatergic signaling in striatal medium spiny neurons. Trends Neurosci. 2007;30:228–335. doi: 10.1016/j.tins.2007.03.008. [DOI] [PubMed] [Google Scholar]
  65. Sutton RS, Barto AG. Reinforcement Learning: An Introduction. Cambridge, MA: MIT Press; 1998. [Google Scholar]
  66. Tumer EC, Brainard MS. Performance variability enables adaptive plasticity of ‘crystallized’ adult birdsong. Nature. 2007;450:1240–1244. doi: 10.1038/nature06390. [DOI] [PubMed] [Google Scholar]
  67. Warren TL, Tumer EC, Charlesworth JD, Brainard MS. Mechanisms and time course of vocal learning and consolidation in the adult songbird. J Neurophysiol. 2011;106:1806–1821. doi: 10.1152/jn.00311.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Woolley SC, Doupe AJ. Social context-induced song variation affects female behavior and gene expression. PLoS Biol. 2008;6:e62. doi: 10.1371/journal.pbio.0060062. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

01

RESOURCES