Abstract
Vocal learning in songbirds, like speech acquisition in humans, entails a period of sensorimotor integration during which vocalizations are evaluated via auditory feedback and progressively refined to achieve an imitation of memorized vocal sounds. This process requires the brain to compare feedback of current vocal behavior to a memory of target vocal sounds. We report the discovery of two distinct populations of neurons in a cortico-basal ganglia circuit of juvenile songbirds (zebra finches, Taeniopygia guttata) during vocal learning: (1) one in which neurons are selectively tuned to memorized sounds and (2) another in which neurons are selectively tuned to self-produced vocalizations. These results suggest that neurons tuned to learned vocal sounds encode a memory of those target sounds, whereas neurons tuned to self-produced vocalizations encode a representation of current vocal sounds. The presence of neurons tuned to memorized sounds is limited to early stages of sensorimotor integration: after learning, the incidence of neurons encoding memorized vocal sounds was greatly diminished. In contrast to this circuit, neurons known to drive vocal behavior through a parallel cortico-basal ganglia pathway show little selective tuning until late in learning. One interpretation of these data is that representations of current and target vocal sounds in the shell circuit are used to compare ongoing patterns of vocal feedback to memorized sounds, whereas the parallel core circuit has a motor-related role in learning. Such a functional subdivision is similar to mammalian cortico-basal ganglia pathways in which associative-limbic circuits mediate goal-directed responses, whereas sensorimotor circuits support motor aspects of learning.
Introduction
Both humans and songbirds display the rare trait of vocal learning; juveniles hear and memorize vocal sounds of adult “tutors” during a sensitive period of development and then translate the neural memory of those sounds into a motor program (Doupe and Kuhl, 1999). The existence of a stable memory of tutor sounds in songbirds was first demonstrated behaviorally by seminal work in the 1950s (Marler, 1956; Thorpe, 1958; Marler, 1970). Acquiring a neural representation of tutor song is the first essential step in the process of vocal learning, and, once formed, this memory guides sensorimotor integration as each bird learns to imitate its tutor's song by vocalizing and using auditory feedback to compare its incipient vocalizations to that tutor memory (Konishi, 1965).
Parallel cortico-basal ganglia loops in mammals process distinct but related aspects of sequential motor and reward-based learning (Hikosaka et al., 1999; O'Doherty et al., 2004; Tanaka et al., 2004; Yin et al., 2009; Thorn et al., 2010). In songbirds, the cortical region LMAN (lateral magnocellular nucleus of the anterior nidopallium) is required for vocal learning (Bottjer et al., 1984; Scharff and Nottebohm, 1991; Aronov et al., 2008). LMAN is composed of two subregions, a core and a surrounding shell, whose pathways make parallel connections to form recurrent loops through the basal ganglia and through other cortical regions (Bottjer, 2004; Fig. 1A). Neurons in LMANcore project to vocal motor cortex and drive motor variability that is characteristic of babbling in juvenile birds to facilitate trial-and-error learning (Olveczky et al., 2005; Warren et al., 2011; Ravbar et al., 2012). However, strong evidence for a neural representation unique to tutor song in this circuit has been lacking. Neurons in the shell region of LMAN project to an adjacent area of motor cortex necessary for accurate temporal sequencing and copying of tutor syllables but not for normal vocal production (Johnson et al., 1995; Iyengar et al., 1999; Bottjer et al., 2000; Bottjer and Altenau, 2010). Furthermore, the shell pathway receives a strong transient projection from LMANcore only in juvenile birds (Miller-Sims and Bottjer, 2012; Fig. 1B), indicating that core neurons convey efference copy to LMANshell circuitry as birds are engaged in vocal learning. This pattern of results suggests that neurons in LMANshell may contribute to evaluating whether current vocal behavior matches the tutor song memory.
Neurons in LMANcore develop auditory selectivity to each bird's own song during vocal learning, but individual core neurons in young juveniles respond to both the bird's own song and the tutor song even when they have been manipulated to be acoustically dissimilar (Solis and Doupe, 1997, 1999, 2000; Yazaki-Sugiyama and Mooney, 2004). We predicted that, in contrast to LMANcore, LMANshell in young birds should receive information regarding learned tutor sounds and hence contain neurons that respond selectively to the tutor song. In accord with this prediction, we found that LMANshell contains distinct populations of neurons that encode either the tutor song or self-produced vocal babbling in birds in early stages of sensorimotor integration. In adult birds that have completed the process of vocal learning, the neural representation of each bird's own song is strengthened as the representation of the tutor song is diminished, indicating that these neuronal populations are specialized for the period of vocal learning.
Materials and Methods
Subjects.
All procedures were performed in accordance with protocols approved by the University of Southern California Animal Care and Use Committee. Twenty-eight urethane-anesthetized male zebra finches (Taeniopygia guttata) of three different ages were used: (1) 43–47 d post-hatch (dph; mean, 45.5; n = 6); (2) 54–68 dph (mean, 60.9; n = 10); and (3) adult birds (>90 dph; n = 12). All birds were bred in group aviaries and remained with their natural parents until at least 35 dph to ensure that they received normal tutor song exposure and social experience (Immelmann, 1969; Böhner, 1983, 1990; Eales, 1985; Clayton, 1987; Catchpole and Slater, 1995; Mann and Slater, 1995; Roper and Zann, 2006). Because birds were raised by their parents, they were exposed to their father's song (the tutor song) during both the early period of social imprinting as well as during the tutor memorization period; although the presence of the tutor during the imprinting period does not affect song learning (Roper and Zann, 2006), exposure to the tutor song during imprinting might influence the development of neural responsiveness.
Previous work in our laboratory has shown, consistent with the studies cited above, that natural rearing until 35 dph constitutes sufficient tutor exposure to develop an accurate copy of the tutor song (Foster and Bottjer, 2001). Most birds raised under these naturalistic conditions (i.e., by their biological parents within a colony) memorize the song of their father (see references above). Because juveniles had access to other adult males, they could have memorized some aspects of other males' songs (Williams, 1990). However, this tendency would result in weaker tuning to the tutor song of the natural father and would therefore work against an overall pattern of tutor tuning. Therefore, the tutor for each bird was its natural father; because the songs of individual zebra finches vary considerably, each experimental bird learned a specific and unique tutor song (TUT).
Song stimuli.
Birds were placed in individual sound attenuation chambers, and songs were recorded using Sound Analysis Pro (44 kHz sampling rate; Tchernichovski et al., 2000). All song stimuli were chosen based on how frequently they were produced by each bird, as in previous studies (Solis and Doupe, 1999; Phan et al., 2006; Kojima and Doupe, 2007); at least 10 song bouts (periods in which multiple song motifs are sung) for adult song and at least 20 song bouts for juvenile song were inspected both aurally and visually to pick the song motif that represented the most frequently produced song for each bird. Each song stimulus was high-pass filtered at 400 Hz and edited to include one or two song motifs (Solis and Doupe, 1997; Rosen and Mooney, 2000; Solis and Doupe, 2000). Because juvenile song is variable, at the onset of each recording session, we tested several (four to five) motifs of each juvenile bird's own song (BOS) that were produced within 24 h before the experiment. Each version of BOS elicited similar responses in LMANcore and LMANshell of all birds tested (Solis and Doupe, 1997, 1999, 2000); we chose the version of BOS that produced the largest neural response and used it thereafter. This song was also played back as a mirror-image reverse (REV). Conspecific songs (CON) were recorded from adult breeders from non-adjacent aviaries. Age-matched conspecific songs (AMC) were recorded from unfamiliar juveniles matched to age ± 1 d of the experimental bird. Although previous studies have found no intensity preferences in song control nuclei (Margoliash and Fortune, 1992; Doupe, 1997), we equalized song stimuli to 68.6 ± 0.3 dB (mean ± SEM) measured with a sound level meter (Extech Instruments). Twenty iterations of each stimulus were presented. Song stimuli were presented in blocks of three to five different stimuli (depending on how many stimuli were presented for a given bird). The order of stimuli within a block was random (without replacement), and the interstimulus interval was 10 ± 2 s.
Electrophysiology.
One to 2 d before an experiment, birds were anesthetized with 1.5% isoflurane (inhalation), and a stainless steel post was attached to the skull with dental cement. On the day of the experiment, birds were anesthetized with an intramuscular injection of 20% urethane in dH2O (100–110 μl; Sigma), and the steel post was fixed to a stereotaxic apparatus so that the ears were unobstructed during recording. Extracellular recordings were made using single electrodes (0.4–0.8 MΩ; Carbostar) or linear electrode 16-pad arrays (1.5–5 MΩ; NeuroNexus). Spike data was amplified (Neuralynx), band passed between 300 and 5000 Hz, and digitized at 20 or 32 kHz using Spike 2 software (Cambridge Electronic Design). Small electrolytic lesions (4–7 μA for 20 s) were made above and below LMANcore for single electrodes and in the bottom-most and top-most recording pads of linear arrays for electrode tract reconstruction. Coronal sections (50 μm thick) were Nissl stained to reveal LMANcore borders, and, for seven birds, an alternate series was stained for calbindin using standard immunohistochemical procedures to confirm LMANshell borders (Fig. 2A; Pinaud et al., 2007).
Analyses.
A recording site was considered for analysis if it was confirmed histologically to be in either LMANcore or LMANshell (excluding 50 μm on either side of the core/shell border). The evoked responses of LMAN neurons tended not to persist throughout the whole stimulus, as has been observed previously (Doupe, 1997; Solis and Doupe, 1997, 1999; Kojima and Doupe, 2007). Therefore, response strengths calculated for song stimuli longer than ∼1 s underestimated the actual response by averaging across both the early phasic response and the period of decreased response (cf. Doupe, 1997; Solis and Doupe, 1999). To correct for stimulus duration bias (e.g., longer songs would yield speciously low response strengths), all analyses were performed using neural data collected during the first second of song playback (if the song was >1 s; some adult songs were slightly <1 s in duration). Our results for LMANcore replicate those reported previously (Doupe, 1997; Solis and Doupe, 1997, 2000; Rosen and Mooney, 2000; Kojima and Doupe, 2007), showing that calculation of response strength based on the first second did not change the nature of the results. In addition, because neurons showed either an enhanced or a suppressed response to song playback (see Results), it was important to determine whether rate increases and rate decreases were evaluated equally when restricting the analysis to the first second of the response. To address this point quantitatively, we examined the time course of responses of 20 randomly selected neurons that showed decreased firing rates during playback. Decreases in firing rate in this sample of neurons lasted on average 0.13–1.20 s after stimulus onset. This pattern was equivalent to that of neurons that show increases in firing rate; that is, like rate increases, rate decreases occurred primarily during the first second of a song stimulus. We also assessed the responses of 20 randomly selected neurons that showed no significant change in firing rate during playback to test whether any of these neurons showed a significant increase or decrease in firing rate if the response window was extended to the entire stimulus length. None of these neurons showed any significant responses when tested this way. Therefore, restriction of analyses to the first second does not differentially reject neurons that show decreased firing rates, and extending the analysis window did not increase the incidence of song-responsive neurons.
Isolated units were clustered manually using MClust-3.5 (A. David Redish, University of Minnesota, Minneapolis, MN; Fig. 2B,C) in MATLAB (MathWorks). To confirm the quality of clustering, all 45 dph recordings were reclustered automatically using KlustaKwik (Ken Harris, Rutgers University, Newark, NJ) based on the maximum and first derivative of energy of the waveforms, the first two principal components, and the rising and falling slope of the waveforms. Within KlustaKwik, the minimum and maximum number of clusters required was 4 and 30, respectively, and the KlustKwik clustering algorithm was used. Those clusters were then manually inspected and corrected when needed. The two different sorting methods yielded highly similar patterns of data; the results presented here are from the 45 dph data that were automatically clustered. Spike sorting methods inevitably include some error (Harris et al., 2000; Pedreira et al., 2012), but standard precautions were taken to ensure that any error was minimal: single units were considered for analysis if the signal-to-noise ratio was ≥3 (mean ± SEM, 5.9 ± 0.09) and if <1% of spikes had an interspike interval <2 ms (core: 45 dph, n = 93, 60 dph, n = 214; adult, n = 308; shell: 45 dph, n = 114; 60 dph, n = 172; adult, n = 265).
We determined whether each single unit was responsive by testing for a significant change in firing rate from baseline (1 s before stimulus) during presentation of any song stimulus (matched t test, p < 0.05). To compare differences in firing rates across neurons, standardized responses (SRs) were calculated:
where S is firing rate during stimulus, and B is firing rate during baseline so that a positive value indicates an increased rate to a stimulus and a negative value indicates a decreased rate to a stimulus. We did not adjust the SRs for the number of trials (i.e., multiply SR by √# trials). Therefore, when accounting for the number of trials, all neurons that were considered responsive to a stimulus showed an adjusted SR of at least 2.24. In describing the responses of single units, use of the word “only” signifies that an individual neuron gave a statistically significant response to only one stimulus of three to five tested (see Results). For example, “TUT only” refers to neurons that gave a significant response only to TUT and not to any other stimulus. In contrast, “TUT-responsive” neurons included those that showed a significant response to TUT, regardless of whether they also responded significantly to other stimuli. Similarly, “BOS only” refers to the subset of neurons that responded significantly only to BOS and not to any other stimulus, and “BOS responsive” refers to the larger group of neurons that responded significantly to BOS regardless of responses to other song stimuli. When describing the responses of single units, “excited” and “suppressed” are used as operational terms that signify firing rate increases and decreases compared with baseline rates and are not descriptions of the underlying synaptic mechanisms responsible for changes in firing rate.
To measure selectivity, a difference score was calculated for “Song A” as follows: Song AΔSR = SRSong A − SRSong B. This is an alternative to the psychometric discriminability index d′ used in previous studies. We chose to standardize responses before subtracting as opposed to subtracting response strengths and dividing by the standard deviations as in d′. As other groups have noted (Solis and Doupe, 1997; Coleman and Mooney, 2004), this use of d′ scores has limitations, including the fact that the score is sensitive to response strength and is less useful for revealing the degree to which the response of one stimulus is greater or less than another. Selectivity scores based on proportions were not appropriate for the data because there were frequently negative response strengths that would confound the interpretation of the score. Use of SR differences corrects for differences in response strength, allows the interpretation of scores calculated from a negative response, and retains one of the benefits of d′, insensitivity to sample size. To address whether d′ scores would produce substantially different patterns of selectivity, we calculated d′ values for 45 dph TUT- and BOS-responsive neurons (see Tables 2, 3). Comparison of d′ scores with ΔSR scores revealed no substantial differences. As shown for ΔSR scores, d′ values revealed that neurons in shell were more selective for BOS or TUT than core neurons. To include all song-responsive neurons in selectivity measures, regardless of whether cells showed increased or decreased firing rates, we reversed the ΔSR scores and d′ values of song-suppressed neurons.
Table 2.
d′ comparison | TUT-responsive 45 dph neurons |
|
---|---|---|
Core | Shell | |
TUT–BOS | 0.00 ± 0.06 | 0.26 ± 0.07** |
TUT–REV | 0.26 ± 0.07 | 0.48 ± 0.07* |
TUT–CON | 0.23 ± 0.09 | 0.63 ± 0.12** |
TUT–AMC | 0.16 ± 0.08 | 0.54 ± 0.12** |
Core, n = 41 versus BOS and REV, n = 24 versus CON and AMC; shell, n = 37 versus BOS and REV, n = 19 versus CON and AMC.
*p < 0.05 and **p < 0.01 for significant differences between mean d′ values of core and shell. We used ΔSR scores in preference to d′ scores for methodological reasons (see Materials and Methods). However, both measures showed good agreement in terms of assessing differences in selectivity between shell and core neurons: TUT selectivity of shell neurons was higher than that in core neurons.
Table 3.
d′ comparison | BOS-responsive 45 dph neurons |
|
---|---|---|
Core | Shell | |
BOS–REV | 0.41 ± 0.05 | 0.58 ± 0.06* |
BOS–TUT | 0.27 ± 0.04 | 0.49 ± 0.07** |
BOS–CON | 0.39 ± 0.06 | 0.62 ± 0.10* |
BOS–AMC | 0.32 ± 0.05 | 0.50 ± 0.08 |
Core, n = 65 versus REV, n = 55 versus TUT, n = 38 versus CON and AMC; shell, n = 36 versus REV, n = 32 versus TUT, n = 16 versus CON and AMC.
*p < 0.05 and **p < 0.01 for significant differences between mean d′ values of core and shell. We used ΔSR scores in preference to d′ scores for methodological reasons (see Materials and Methods). However, both measures showed good agreement in terms of assessing differences in selectivity between shell and core neurons: BOS-selectivity of shell neurons was higher than that in core neurons.
Statistics.
All continuous data were tested for normality using Kolmogorov–Smirnov and Shapiro–Wilk tests. t tests were used to compare means for normally distributed data, and Mann–Whitney tests were used to compare medians for non-normally distributed data. Differences in proportions were tested using χ2 tests, and differences in distributions were tested with Kolmogorov–Smirnov Z tests. A Z test for proportions was used to test whether the relative proportions of excited and suppressed 45 dph shell neurons were different from chance. Bonferroni's corrections were applied wherever multiple comparisons were made.
Results
Distinct neuronal populations in LMANcore and LMANshell of juvenile birds
To test whether LMANshell contains a neural representation of the tutor song during vocal learning, we recorded from core and shell regions of LMAN in juvenile zebra finches (Taeniopygia guttata) that have completed memorization of a tutor song and begun to practice their song vocalizations (45 dph). We tested each neuron for auditory responsiveness, expressed as a significant change in firing rate from baseline during playback of each song stimulus (p < 0.05, t test). A high proportion of neurons in both core and shell gave an auditory response to playback of at least one song stimulus; the proportion of core neurons that showed a significant auditory response was higher than that for shell neurons (0.89 vs 0.68, p < 0.001, χ2 test; core, n = 83 of 93; shell, n = 77 of 114; core/shell auditory neurons per birds for n = 6 birds, n = 18/17, 0/16, 20/36, 2/0, 32/4, and 11/4).
As an initial population analysis, we categorized each auditory neuron according to whether it exhibited a significant change in firing rate to BOS only, TUT only, or both BOS and TUT (Fig. 3A; all of these neurons were tested with TUT, BOS, and a mirror-image reverse of BOS in which temporal structure is reversed while leaving spectral content intact). For example, TUT-only neurons showed a statistically significant response only to TUT and not to BOS or REV (see Materials and Methods). A large proportion of neurons in shell showed a significant response only to TUT (0.32, 21 of 65 neurons). In contrast, only a small proportion of core neurons responded to TUT only (0.07, 5 of 68 neurons; p = 0.003, χ2 test). However, approximately equal proportions of core and shell neurons responded only to BOS. In addition, a large proportion of core neurons responded to both BOS and TUT, whereas few shell neurons responded significantly to both stimuli (0.47 vs 0.15; 32 of 68 vs 10 of 65 neurons; p < 0.0001, χ2 test). These results show that LMANshell contains two distinct populations of auditory neurons during the period of sensorimotor integration: those that respond only to the tutor song and a separate population that responds only to the current version of the bird's own song. Relatively few shell neurons respond to both BOS and TUT. In contrast, the majority of neurons in LMANcore respond either only to the bird's own song or to both BOS and TUT.
To more fully characterize the populations of core and shell neurons, we presented a subset of neurons with an expanded stimulus set of five different songs, including an adult CON and AMC in addition to BOS, REV, and TUT (Table 1). Despite the inclusion of more song types, these data confirmed that a large proportion of neurons gave a significant response only to the tutor song in LMANshell compared with a smaller proportion in LMANcore (0.28 vs 0.04, 11 of 40 vs 2 of 52 neurons; p < 0.01, χ2 test). A relatively large proportion of core neurons responded significantly to combinations of TUT plus other songs compared with shell neurons (sum of rows 2–4; 0.43 vs 0.15; p = 0.005, χ2 test). Expressed as a relative percentage, 91% of individual TUT-responsive neurons in core also responded to other songs compared with only 35% in shell (p < 0.001, χ2 test). The proportion of neurons that responded only to BOS did not differ between core and shell (0.15 vs 0.13, 8 of 52 vs 5 of 40 neurons), confirming that each subregion contains a population of neurons that responds only to BOS (compare with Fig. 3A). Fewer shell than core neurons were BOS-responsive overall (sum of rows 3–6; 0.73 vs 0.43; p = 0.003, χ2 test), such that, although more neurons in core than shell responded to BOS plus other stimuli (sum of rows 3, 4, and 6; 0.58 vs 0.30; p = 0.008, χ2 test), the relative percentage of BOS-responsive neurons that also responded to other songs did not differ between core and shell (79 vs 70%). Interestingly however, many neurons in core were very broadly tuned (i.e., responded significantly to BOS, TUT, plus at least one other song stimulus; row 4) compared with shell (0.25 vs 0.05; 13 of 52 vs 2 of 40 neurons).
Table 1.
Row | Neuron category | Core (n = 52) | Shell (n = 40) |
---|---|---|---|
1 | TUT only | 0.04 | 0.28** |
2 | TUT + REV, CON or AMC | 0.10 | 0.05 |
3 | BOS + TUT only | 0.08 | 0.05 |
4 | BOS + TUT + other songs | 0.25 | 0.05* |
5 | BOS only | 0.15 | 0.13 |
6 | BOS + REV, CON or AMC | 0.25 | 0.20 |
Sum 2–4/sum 1–4 | Relative percentage of TUT-responsive neurons that respond to other songs | 91 | 35*** |
Sum 3,4,6/sum 3–6 | Relative percentage of BOS-responsive neurons that respond to other songs | 79 | 70 |
For this analysis, “TUT only” indicates neurons that showed a significant response to TUT and not to BOS, REV, CON, or AMC; “BOS only” indicates neurons that responded significantly to BOS and not to REV, TUT, CON, or AMC. Rows 1–6 sum to 87% for core and 76% for shell (remaining neurons did not respond to BOS or TUT).
*p < 0.05, **p < 0.01, and ***p < 0.001 for significant differences between LMANcore and LMANshell proportions (χ2 tests).
Figure 4 shows examples of individual TUT-responsive neurons in 45 dph juvenile LMANshell and LMANcore during playback of various song stimuli. Rasters for each neuron indicate that responses in both subregions tended to be somewhat sparse and variable from trial to trial, as reported previously for LMANcore (Doupe, 1997). A representative TUT-responsive LMANshell neuron showed a strong response only to TUT, as well as a modest but nonsignificant response to BOS and an even lower response to BOS played in reverse (Fig. 4A). In contrast, a typical TUT-responsive neuron from LMANcore gave a significant increased rate to BOS, REV, and TUT (Fig. 4B). This broad responsiveness concurs with previous studies that found that most TUT-responsive neurons in core also respond to BOS (Solis and Doupe, 1997; 1999; 2000; Yazaki-Sugiyama and Mooney, 2004), as was also true for TUT-responsive neurons found in other song-control nuclei (Nick and Konishi, 2005a). Another difference between shell and core subregions was that song-evoked activity in shell neurons could manifest as either increases or decreases in firing rate, whereas core neurons showed increased rate responses only. Figure 4C shows a shell neuron that showed a suppressed response to TUT but not to BOS or CON (see below).
Figure 5 shows examples of individual BOS-responsive neurons in shell and core of 45 dph juveniles. The core neuron fired strongly in response to BOS, REV, and TUT, whereas the shell neuron showed significant increased firing only to BOS. Thus, as for TUT-responsive neurons, BOS-responsive neurons in core tended to respond to multiple other stimuli, whereas those in shell were likely to respond strongly only to BOS. In this example, the response of the shell neuron distinguished the bird's own babbling song from that of another young bird (AMC) as well as from the tutor song.
Neurons in LMANshell of 45 dph birds were either excited or suppressed by playback
As indicated above, many auditory neurons in shell, but not core, of 45 dph birds showed suppressed responses to playback of song (Fig. 4C). Among all BOS-responsive shell neurons (i.e., those that showed a significant response to BOS alone or in combination with other song stimuli), 81% (29 of 36) were excited, whereas 19% (7 of 36) showed a suppressed response to BOS. Among TUT-responsive shell neurons, 76% (28 of 37) showed excitation and 24% (9 of 37) showed suppression to TUT. Interestingly, BOS- and TUT-excited shell neurons never showed suppression to any other stimulus. This mutual exclusion was true for suppressed neurons as well (except for 1 of 73 cases in which the neuron showed increased firing to REV and suppression to TUT). Thus, shell neurons encoded a representation of TUT and BOS via either increases or decreases in response strength.
Among core neurons that were tested with TUT, BOS, and REV, none showed suppression to any song (n = 68). In contrast, among shell neurons that responded only to BOS, 84% (16 of 19) showed an enhanced response, whereas 16% (3 of 19) showed a suppressed response (p = 0.003, Z test for proportion; Fig. 3B). Among shell neurons that responded only to TUT, 67% (14 of 21) showed increased firing and 33% (7 of 21) showed suppression (p = 0.12, Z test for proportion). This pattern suggests that neurons that responded only to TUT were more likely to show suppression than were neurons that responded only to BOS in LMANshell of 45 dph birds. However, despite the tendency toward a greater incidence of song suppression within neurons that responded only to TUT, the relative proportion of excited/suppressed neurons was not significantly different between neurons that responded only to TUT and those that responded only to BOS (p = 0.20, χ2 test).
TUT-responsive neurons were more selectively tuned in juvenile shell than core
To test the relative magnitude of responses across all TUT-responsive neurons, we compared response strengths with all song stimuli in 45 dph LMANshell and LMANcore. SRs for TUT-excited shell neurons were largest to TUT compared with all other songs (Fig. 6A, filled red bars; p < 0.05 in all cases, independent t test, unequal variance, Bonferroni's corrected). In contrast, TUT-excited core neurons responded equally well to TUT and BOS (filled gray bars; p = 1.0, Mann–Whitney test, Bonferroni's corrected) but, like shell neurons, responded less to REV, CON, and AMC (p < 0.05 in all cases, Mann–Whitney test, Bonferroni's corrected). Furthermore, shell neurons responded much less to BOS, REV, CON, and AMC than did those in core (p < 0.05 in all cases, independent t test, unequal variance for CON comparison). Neurons in shell that exhibited suppression to TUT showed similar response patterns to TUT-excited neurons: response suppression was greatest to TUT compared with other songs (open red bars; p < 0.01 in all cases, Mann–Whitney test, Bonferroni's corrected). As indicated above, TUT-suppressed neurons were not detected in core. In summary, TUT-responsive neurons in core and shell showed comparable response strength to TUT, but neurons in shell gave a much weaker response to non-tutor songs than did those in core.
To quantify response selectivity between pairs of stimuli, we computed a difference score for each neuron: ΔSR = SRSong A − SRSong B (a score of zero indicates no song preference). Across all TUT-responsive neurons in shell of 45 dph birds, mean selectivity (ΔSR) scores revealed a significant preference for TUT versus all other songs (Fig. 6B; ΔSR significantly different from zero, p < 0.001 in all cases, one-sample t test; Fig. 6E shows data for individual birds; for d′ scores, see Table 2). Neurons in core also preferred TUT over REV, CON, and AMC (all cases significantly different from zero, p < 0.05, one-sample t test). However, there was no significant selectivity for TUT versus BOS (p = 0.30, one-sample t test), indicating no song preference between TUT and BOS in 45 dph core neurons. Furthermore, shell neurons exhibited much greater selectivity for TUT over all other songs than did core neurons (p value always <0.005, independent t test). These results for TUT-responsive neurons demonstrate that shell neurons showed greater selectivity for TUT compared with those in core for all comparisons with other song stimuli.
Cumulative distributions of selectivity scores for TUT versus BOS were shifted to the right for shell relative to core neurons (Fig. 6C), demonstrating that more neurons preferred TUT over BOS in shell (p = 0.002, Kolmogorov–Smirnov Z test). As a control for whether the selective response to TUT simply reflected a preference for an adult song versus an immature juvenile song, we also compared cumulative distributions for adult CON versus AMC (Fig. 6D). Shell neurons evinced no preference for CON over AMC (p = 0.56, one-sample t test), and the distributions of core versus shell selectivity scores did not differ (p = 0.62, Kolmogorov–Smirnov Z test). Thus, TUT-responsive shell neurons show a specific preference for the tutor song they have memorized as opposed to a generic preference for general characteristics of adult song over immature song.
BOS-responsive neurons were more selectively tuned in juvenile shell than core
We also tested BOS-responsive neurons in shell versus core of 45 dph birds by comparing relative response strength across song stimuli. BOS-excited neurons in both core and shell showed a stronger response to BOS than to all other songs (Fig. 7A; p < 0.01 for all cases, Mann–Whitney test, Bonferroni's corrected). However, shell neurons responded less strongly to REV, TUT, and AMC than did core neurons (p = 0.005, 0.002, and 0.047, respectively, Mann–Whitney test for REV comparison, independent t test for TUT and AMC comparisons). The response to CON did not differ between core and shell (p = 0.38, independent t test, unequal variance). BOS-suppressed neurons were found only in shell and showed greater suppression to BOS compared with REV (p = 0.002, Mann–Whitney test, Bonferroni's corrected). Thus, neurons in core and shell responded comparably with the bird's own babbling sounds, but with the exception of CON, neurons in shell gave a much weaker response to non-BOS songs than did those in core.
As in the case of TUT-responsive neurons, higher selectivity scores among BOS-responsive shell neurons reflected their weaker response to non-preferred songs (Fig. 7B). BOS-responsive neurons in both core and shell were selective for BOS over all other songs (ΔSR different from zero, p < 0.001 in all cases, one-sample t test; Fig. 7E shows data for individual birds; for d′ scores, see Table 3). However, shell neurons showed higher selectivity than core neurons for BOS compared with all other songs (p values always <0.03, Mann–Whitney test for REV and AMC comparisons, independent t test for TUT and CON comparisons). Likewise, more neurons in shell were selective for BOS over TUT than in core, as illustrated by the right-shifted cumulative distribution (p = 0.009, Kolmogorov–Smirnov Z test; Fig. 7C). If BOS-responsive shell neurons simply preferred a juvenile song over adult songs, then one would expect high selectivity scores for AMC–CON comparisons. However, the cumulative distributions of AMC–CON ΔSR scores revealed no selectivity (Fig. 7D; scores not different from zero; core, p = 0.15 and shell, p = 0.86, one sample t test), and there was no difference between the distributions (p = 0.277, Kolmogorov-Smirnov Z test). These results show that, although response strength to BOS is similar in core versus shell, responses of shell neurons are more selective as a result of diminished responses to other song stimuli. Therefore, LMANshell of 45 dph birds contains a population of neurons that encodes a selective representation of vocal babbling sounds and a distinct population of neurons that is selective to the adult tutor song of each bird.
Selective tuning to tutor song is restricted to a phase of early sensorimotor integration
The presence of a substantial population of neurons that respond only to the tutor song in LMANshell of 45 dph birds shows that the shell pathway has access to a representation of target vocal sounds during the period of active imitative learning. To test whether a representation of the tutor song persists in shell neurons after the conclusion of vocal learning, we recorded from core and shell in older juveniles at a later stage of sensorimotor integration when birds are singing “plastic” song (60 dph) and in adults that are producing a stereotyped song after the learning period (>90 dph). These neurons were categorized as in Figure 3 according to whether BOS, TUT, or both songs evoked a significant response. This developmental analysis revealed that the high proportion of shell neurons showing a significant response only to TUT at 45 dph was greatly reduced by 60 dph (p = 0.01, χ2 test, Bonferroni's corrected), and this proportion remained low in adult birds (Fig. 8A, left). The substantial reduction of TUT-only shell neurons by 60 d eliminated the significantly greater incidence of tutor-tuned neurons in shell versus core seen at 45 dph. The proportion of core neurons that responded only to TUT did not change between 45 and 60 d (p = 0.52, χ2 test, Bonferroni's corrected).
Developmental changes in BOS responses showed a different pattern (Fig. 8A, middle). The proportion of neurons that showed a significant response only to BOS increased in both core and shell between 45 and 60 d (core, p = 0.03 and shell, p = 0.07, χ2 test, Bonferroni's corrected) and was not different between the two subregions at any age. Therefore, the neural representation of each bird's own song became stronger in both subregions of LMAN as vocal development progressed from vocal babbling to plastic song. The proportion of core neurons that responded to both BOS and TUT decreased substantially between 45 and 60 d (Fig. 8A, right; p = 0.002, χ2 test, Bonferroni's corrected) and remained low in adulthood. Although the proportion of shell neurons that responded to both BOS and TUT did not change throughout development, by adulthood a greater proportion of neurons in shell responded to BOS and TUT than in core (p = 0.008, χ2 test, Bonferroni's corrected). In summary, these data show that, with learning, a larger proportion of neurons in both subregions respond significantly only to BOS as song matures, and the proportion of neurons in LMANshell that respond only to the tutor song diminishes.
We found that core neurons became more selectively tuned to BOS during development, in accord with reports of selectivity in previous studies (although this represents the first test of BOS selectivity in birds younger than 55 dph) (Doupe, 1997; Solis and Doupe, 1997). The selectivity of core neurons for BOS increased between 45 and 60 dph (p < 0.001 for both REV and TUT comparisons, independent t test, unequal variance for TUT comparison), such that there was no difference in ΔSR scores of shell versus core neurons by 60 dph (Fig. 8B,C; independent t test, unequal variance for TUT comparison). The selectivity of shell neurons for BOS over both REV and TUT did not change between 45 and 60 dph (p > 0.05 in both cases, independent t test, Bonferroni's corrected), although BOS–REV selectivity did increase between 45 dph and adulthood (p = 0.03, independent t test, Bonferroni's corrected). Response strength to BOS did not change in either subregion during development (Fig. 8D; Mann–Whitney test, Bonferroni's corrected), indicating that the developmental increases in BOS selectivity were attributable to decreased responses to other stimuli. These results indicate that shell neurons show relatively more mature levels of selectivity than core neurons early in vocal learning and that the increase in selectivity throughout development in both subregions is the result of weaker responses to nonpreferred stimuli.
Because relative levels of inhibition contribute to neural selectivity (Rosen and Mooney, 2000; 2003; Pinaud et al., 2008) as well as regulation of sensitive periods (Hensch et al., 1998), we examined the proportion of all auditory neurons with enhanced versus suppressed responses throughout development. LMANcore is known to include song-suppressed neurons in adult but not juvenile birds (Doupe, 1997). In accord with this fact, we found that the proportion of core neurons that showed a suppressed response to at least one song stimulus increased throughout development (Fig. 9A, gray bars; 0.01 at 45 dph, 0.13 at 60 dph, 0.25 in adults; p < 0.001, three-way χ2 test). Among BOS-responsive neurons, the percentage in core that were suppressed by BOS also increased significantly from 0% at 45 dph to 8% at 60 dph and 12% in adults (Fig. 9B, gray bars; p = 0.016, three-way χ2 test). In contrast to core, the proportion of shell neurons that showed suppression to at least one song stimulus did not change during development (Fig. 9A, red bars; 0.26 in 45 dph birds, 0.33 in 60 dph, 0.18 in adults; p = 0.12, three-way χ2 test). Moreover, the percentage of BOS-responsive neurons in shell that were suppressed by BOS did not change during development (Fig. 9B, red bars; 19% at 45 dph, 19% at 60 dph, 13% in adults; p = 0.55, three-way χ2 test). Thus, suppressive responses to song in LMANcore mature throughout development and gradually come to match the level of song suppression seen in LMANshell. In addition, these results suggest that maturation of inhibitory circuits may underlie increased selective tuning in LMAN and represent a correlate of learning.
Discussion
These results demonstrate that LMANshell contains a population of neurons that encodes a neural representation of learned tutor sounds in juvenile songbirds during early stages of sensorimotor integration for vocal learning (45 dph). This discovery represents the first instance in which a substantial population of vocal-control neurons responds selectively to tutor song in birds that are acquiring learned vocal behavior. The TUT-tuned neurons are present at a time when lesions of AId (dorsal intermediate arcopallium), which interrupt the trans-cortical recurrent pathway from LMANshell through dorsal thalamus, prevent both imitation of the tutor song and the development of a stable sequence of syllables (Bottjer and Altenau, 2010). Furthermore, the large proportion of neurons that responds only to tutor song is not present in LMANshell of adult birds that have finished learning to produce a stereotyped imitation of that song. Tutor-tuned neurons in 45 dph LMANshell coexist with a separate population that responds selectively to the bird's own song, suggesting that discrete populations of shell neurons receive signals from regions of higher-level auditory cortex that process either vocal-related feedback (BOS tuning) or aspects of tutor song (TUT tuning) (Phan et al., 2006; London and Clayton, 2008; Gobes et al., 2010; Hahnloser and Kotowicz, 2010). In contrast to LMANshell, most neurons in LMANcore of young birds responded to BOS or to BOS plus other songs, in agreement with previous studies (Solis and Doupe, 1999, 2000; Yazaki-Sugiyama and Mooney, 2004). The overall pattern of results indicates that distinct representations of both the tutor song and the bird's own song are available only in shell at 45 dph and that neurons in core do not develop a selective representation of self-produced vocalizations until late in learning.
At 45 dph, LMANshell contained TUT- and BOS-suppressed neurons, demonstrating that shell neurons represent the tutor song and bird's own song via either increases or decreases in firing rate, similar to responses to learned sounds in prefrontal cortex of mammals (Suzuki et al., 1997; Fritz et al., 2010). Thalamic input to LMAN is excitatory (Livingston and Mooney, 1997; Boettiger and Doupe, 1998; Bottjer et al., 1998; Pinaud et al., 2007), so suppression presumably results from intrinsic inhibitory interneurons such as those that contribute to selectivity in core (Rosen and Mooney, 2000). Thus, TUT-tuned and BOS-tuned excitatory neurons in DLM (dorsolateral medial thalamus) and LMANshell may drive intrinsic inhibitory activity onto specific subgroups of shell neurons (BOS- and TUT-suppressed neurons) and contribute to greater selectivity (Blättler and Hahnloser, 2011).
Comparisons of response strength and selectivity across all TUT-responsive neurons showed that individual cells in 45 dph birds were more selectively tuned to TUT in shell than in core. This selectivity was attributable to the fact that shell neurons showed a much weaker response to non-tutor songs than did core neurons. In addition, BOS-responsive core neurons at 45 dph were less selective for BOS than shell neurons. By the time birds reached an intermediate stage of sensorimotor integration (60 dph), a robust representation of tutor song was no longer present in shell. By this age, BOS-suppressed responses have emerged in core, and core neurons have developed a selective representation of self-produced sounds (Eliades and Wang, 2008). The earlier maturation of selective tuning in LMANshell suggests that shell neurons may contribute to emergent auditory tuning in core neurons between 45 and 60 dph.
Sixty days post-hatch corresponds to the earliest time when birds have formed a fairly stable sequence of song syllables and lesions to LMAN (including both core and shell) have lost the ability to substantially disrupt song behavior (Bottjer et al., 1984). Additionally, the volume of LMANshell increases dramatically during early sensorimotor integration and regresses by 60 dph (Johnson and Bottjer, 1992; Johnson et al., 1995). Thus, the suite of developmental changes that occurs in LMAN by this age, including the loss of TUT-tuned shell neurons, may reflect a diminished capacity for plasticity and thereby contribute to the close of the sensitive period for vocal learning.
A neural memory of the tutor song represents a necessary target to which feedback of current song behavior can be compared (Thorpe, 1958; Konishi, 1965). Multiple sites are likely to encode a representation of the tutor song (Nick and Konishi, 2005b; London and Clayton, 2008; Prather et al., 2010; Adret et al., 2012), some of which may convey information to LMANshell. Another source of tutor song information may come through a postsynaptic target of LMANshell, dNCL (dorsal caudolateral nidopallium; Fig. 10), which contributes to the trans-cortical recurrent loop. Indeed, singing in combination with hearing TUT strongly induces immediate early gene expression in dNCL neurons (Bottjer et al., 2010).
The presence of TUT-tuned neurons during a restricted interval of vocal learning suggests that they may serve as a referent that enables a correct imitation of tutor song to gradually emerge during sensorimotor integration. During vocal babbling, juvenile birds will occasionally produce a sound that is similar enough to tutor song to activate TUT-selective neurons; this would enable TUT-tuned neurons to act as a filter for correct performance. Concurrent feedback of babbling sounds could drive BOS-selective neurons such that both TUT-tuned and BOS-tuned neurons are activated simultaneously. Convergence of signals from TUT- and BOS-tuned neurons in downstream circuitry would constitute a means of evaluating matches between current and target behavior. Interestingly, the strong transient projection from juvenile core to AId (Miller-Sims and Bottjer, 2012; Fig. 1B) during early stages of song development indicates that shell circuitry receives an efference copy of the motor signal for vocal output. In addition, AId projects to the region of dopaminergic neurons in the ventral tegmental area that projects to Area X (Bottjer et al., 2000) and hence could relay error/reinforcement signals into the cortico-basal ganglia circuits that collectively regulate vocal learning.
Auditory responses are generally reduced during alert wakefulness in the song motor system (Dave et al., 1998; Cardin and Schmidt, 2004), although BOS-selective responses have been observed in LMANcore of awake birds (Hessler and Doupe, 1999). Our results were obtained in anesthetized birds, and the role of behavioral state in neural responses of shell neurons is not known. It is possible that, in addition to or instead of an online role, TUT-tuned neurons in LMANshell are important for offline learning that occurs during sleep (Shank and Margoliash, 2009). Chronic recordings in core and shell of juvenile singing birds will be needed to test directly the role of TUT- and BOS-selective neurons.
Models of reinforcement learning in which motor skills are acquired via evaluation of goal-oriented outcomes suggest that parallel basal ganglia circuits like those of LMANcore and LMANshell mediate different aspects of learning (Nakahara et al., 2001; Samejima and Doya, 2007). In mammals, neurons in parallel cortico-basal ganglia loops process distinct aspects of reward-based learning, such as reward prediction error and selected motor actions (Hikosaka et al., 1999; Miyachi et al., 2002; Tanaka et al., 2004; Yin and Knowlton, 2006; Atallah et al., 2007; Samejima and Doya, 2007; Graybiel, 2008; Yin et al., 2009; Thorn et al., 2010; Redgrave et al., 2011). Thorn et al. (2010) describe a model for cortico-basal ganglia circuitry in which sensorimotor pathways regulate motor-related aspects of learning, whereas associative-limbic circuits process goal-directed responses and modulate the sensorimotor loop's control over action. Models based on reinforcement learning have suggested that the LMANcore circuit may function as an “actor/experimenter” to drive variable vocal production (Sutton and Andrew, 1998; Doya and Sejnowski, 2000; Troyer and Doupe, 2000; Fiete et al., 2007; Miller et al., 2010; Fee and Goldberg, 2011). The presence of neurons representing the target behavior—a memory of the tutor song—in juvenile LMANshell is consistent with a model in which shell participates in evaluative goal-oriented responses (Abe and Watanabe, 2011). Like shell neurons, associative cortico-basal ganglia neurons in prefrontal cortex of mammals show activity related to goal states (Saito et al., 2005; Mushiake et al., 2006; Fritz et al., 2010). The LMANshell circuit contributes to both integrative feedforward and feedback pathways, including connections to limbic and reward circuitry (Fig. 10) and so is well suited to integrate these signals with specific song-related information.
The presence and nature of the auditory responses in LMANshell, in addition to lesion and anatomical evidence (Johnson and Bottjer, 1992; Johnson et al., 1995; Bottjer and Altenau, 2010; Bottjer et al., 2010; Miller-Sims and Bottjer, 2012), challenge the idea of a single song-control basal ganglia pathway. More generally, these findings show that two distinctly tuned neuronal populations can represent current and target behaviors during learning. Therefore, these results raise the possibility that the division of learning-related functions into parallel cortico-basal ganglia loops may be a conserved feature of goal-directed motor learning.
Footnotes
This work was supported by National Institute of Neurological Disorders and Stroke Grant NS 037547 and Training Fellowship NS 073323 and National Institute on Deafness and Other Communication Disorders Training Fellowship DC 009975. We thank Vanessa Miller-Sims for RA/Ai tissue and Tansu Celikel for technical advice.
The author(s) declare(s) no competing financial interests.
References
- Abe K, Watanabe D. Songbirds possess the spontaneous ability to discriminate syntactic rules. Nat Neurosci. 2011;14:1067–1074. doi: 10.1038/nn.2869. [DOI] [PubMed] [Google Scholar]
- Adret P, Meliza CD, Margoliash D. Song tutoring in presinging zebra finch juveniles biases a small population of higher-order song-selective neurons toward the tutor song. J Neurophysiol. 2012;108:1977–1987. doi: 10.1152/jn.00905.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Aronov D, Andalman AS, Fee MS. A specialized forebrain circuit for vocal babbling in the juvenile songbird. Science. 2008;320:630–634. doi: 10.1126/science.1155140. [DOI] [PubMed] [Google Scholar]
- Atallah HE, Lopez-Paniagua D, Rudy JW, O'Reilly RC. Separate neural substrates for skill learning and performance in the ventral and dorsal striatum. Nat Neurosci. 2007;10:126–131. doi: 10.1038/nn1817. [DOI] [PubMed] [Google Scholar]
- Blättler F, Hahnloser RH. An efficient coding hypothesis links sparsity and selectivity of neural responses. PLoS One. 2011;6:e25506. doi: 10.1371/journal.pone.0025506. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Boettiger CA, Doupe AJ. Intrinsic and thalamic excitatory inputs onto songbird LMAN neurons differ in their pharmacological and temporal properties. J Neurophysiol. 1998;79:2615–2628. doi: 10.1152/jn.1998.79.5.2615. [DOI] [PubMed] [Google Scholar]
- Böhner J. Song learning in the zebra finch (Taeniopygia guttata): selectivity in the choice of a tutor and accuracy of song copies. Anim Behav. 1983;31:231–237. doi: 10.1016/S0003-3472(83)80193-6. [DOI] [Google Scholar]
- Böhner J. Early acquisition of song in the zebra finch, Taeniopygia guttata. Anim Behav. 1990;39:369–374. doi: 10.1016/S0003-3472(05)80883-8. [DOI] [Google Scholar]
- Bottjer SW. Developmental regulation of basal ganglia circuitry during the sensitive period for vocal learning in songbirds. Ann N Y Acad Sci. 2004;1016:395–415. doi: 10.1196/annals.1298.037. [DOI] [PubMed] [Google Scholar]
- Bottjer SW, Altenau B. Parallel pathways for vocal learning in basal ganglia. Nat Neurosci. 2010;13:153–155. doi: 10.1038/nn.2472. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bottjer SW, Miesner EA, Arnold AP. Forebrain lesions disrupt development but not maintenance of song in passerine birds. Science. 1984;224:901–903. doi: 10.1126/science.6719123. [DOI] [PubMed] [Google Scholar]
- Bottjer SW, Brady JD, Walsh JP. Intrinsic and synaptic properties of neurons in the vocal-control nucleus lMAN from in vitro slice preparations of juvenile and adult zebra finches. J Neurobiol. 1998;37:642–658. doi: 10.1002/(SICI)1097-4695(199812)37:4<642::AID-NEU12>3.0.CO%3B2-G. [DOI] [PubMed] [Google Scholar]
- Bottjer SW, Brady JD, Cribbs B. Connections of a motor cortical region in zebra finches: relation to pathways for vocal learning. J Comp Neurol. 2000;420:244–260. doi: 10.1002/(SICI)1096-9861(20000501)420:2<244::AID-CNE7>3.0.CO%3B2-M. [DOI] [PubMed] [Google Scholar]
- Bottjer SW, Alderete TL, Chang D. Conjunction of vocal production and perception regulates expression of the immediate early gene ZENK in a novel cortical region of songbirds. J Neurophysiol. 2010;103:1833–1842. doi: 10.1152/jn.00869.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cardin JA, Schmidt MF. Auditory responses in multiple sensorimotor song system nuclei are co-modulated by behavioral state. J Neurophysiol. 2004;91:2148–2163. doi: 10.1152/jn.00918.2003. [DOI] [PubMed] [Google Scholar]
- Catchpole C, Slater PJ. Bird song: biological themes and variations. Cambridge: Cambridge UP; 1995. [Google Scholar]
- Clayton NS. Song tutor choice in zebra finches. Anim Behav. 1987;35:714–721. doi: 10.1016/S0003-3472(87)80107-0. [DOI] [Google Scholar]
- Coleman MJ, Mooney R. Synaptic transformations underlying highly selective auditory representations of learned birdsong. J Neurosci. 2004;24:7251–7265. doi: 10.1523/JNEUROSCI.0947-04.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dave AS, Yu AC, Margoliash D. Behavioral state modulation of auditory activity in a vocal motor system. Science. 1998;282:2250–2254. doi: 10.1126/science.282.5397.2250. [DOI] [PubMed] [Google Scholar]
- Doupe AJ. Song- and order-selective neurons in the songbird anterior forebrain and their emergence during vocal development. J Neurosci. 1997;17:1147–1167. doi: 10.1523/JNEUROSCI.17-03-01147.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Doupe AJ, Kuhl PK. Birdsong and human speech: common themes and mechanisms. Annu Rev Neurosci. 1999;22:567–631. doi: 10.1146/annurev.neuro.22.1.567. [DOI] [PubMed] [Google Scholar]
- Doya K, Sejnowski TJ. The new cognitive neurosciences. Cambridge, MA: Massachusetts Institute of Technology; 2000. A computational model of avian song learning; pp. 469–488. [Google Scholar]
- Eales LA. Song learning in zebra finches: some effects of song model availability on what is learnt and when. Anim Behav. 1985;33:1293–1300. doi: 10.1016/S0003-3472(85)80189-5. [DOI] [Google Scholar]
- Eliades SJ, Wang X. Neural substrates of vocalization feedback monitoring in primate auditory cortex. Nature. 2008;453:1102–1106. doi: 10.1038/nature06910. [DOI] [PubMed] [Google Scholar]
- Fee MS, Goldberg JH. A hypothesis for basal ganglia-dependent reinforcement learning in the songbird. Neuroscience. 2011;198:152–170. doi: 10.1016/j.neuroscience.2011.09.069. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fiete IR, Fee MS, Seung HS. Model of birdsong learning based on gradient estimation by dynamic perturbation of neural conductances. J Neurophysiol. 2007;98:2038–2057. doi: 10.1152/jn.01311.2006. [DOI] [PubMed] [Google Scholar]
- Foster EF, Bottjer SW. Lesions of a telencephalic nucleus in male zebra finches: influences on vocal behavior in juveniles and adults. J Neurobiol. 2001;46:142–165. doi: 10.1002/1097-4695(20010205)46:2<142::AID-NEU60>3.0.CO%3B2-R. [DOI] [PubMed] [Google Scholar]
- Fritz JB, David SV, Radtke-Schuller S, Yin P, Shamma SA. Adaptive, behaviorally gated, persistent encoding of task-relevant auditory information in ferret frontal cortex. Nat Neurosci. 2010;13:1011–1019. doi: 10.1038/nn.2598. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gobes SM, Zandbergen MA, Bolhuis JJ. Memory in the making: localized brain activation related to song learning in young songbirds. Proc Biol Sci. 2010;277:3343–3351. doi: 10.1098/rspb.2010.0870. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Graybiel AM. Habits, rituals and the evaluative brain. Annu Rev Neurosci. 2008;31:359–387. doi: 10.1146/annurev.neuro.29.051605.112851. [DOI] [PubMed] [Google Scholar]
- Hahnloser RH, Kotowicz A. Auditory representations and memory in birdsong learning. Curr Opin Neurobiol. 2010;20:332–339. doi: 10.1016/j.conb.2010.02.011. [DOI] [PubMed] [Google Scholar]
- Harris KD, Henze DA, Csicsvari J, Hirase H, Buzsáki G. Accuracy of tetrode spike separation as determined by simultaneous intracellular and extracellular measurements. J Neurophysiol. 2000;84:401–414. doi: 10.1152/jn.2000.84.1.401. [DOI] [PubMed] [Google Scholar]
- Hensch TK, Fagiolini M, Mataga N, Stryker MP, Baekkeskov S, Kash SF. Local GABA circuit control of experience-dependent plasticity in developing visual cortex. Science. 1998;282:1504–1508. doi: 10.1126/science.282.5393.1504. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hessler NA, Doupe AJ. Singing-related neural activity in a dorsal forebrain–basal ganglia circuit of adult zebra finches. J Neurosci. 1999;19:10461–10481. doi: 10.1523/JNEUROSCI.19-23-10461.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hikosaka O, Nakahara H, Rand MK, Sakai K, Lu X, Nakamura K, Miyachi S, Doya K. Parallel neural networks for learning sequential procedures. Trends Neurosci. 1999;22:464–471. doi: 10.1016/S0166-2236(99)01439-3. [DOI] [PubMed] [Google Scholar]
- Immelmann K. Song development in the zebra finch and other estrildid finches. In: Hinde RA, editor. Bird vocalisations. Cambridge, UK: Cambridge UP; 1969. pp. 61–74. [Google Scholar]
- Iyengar S, Viswanathan SS, Bottjer SW. Development of topography within song control circuitry of zebra finches during the sensitive period for song learning. J Neurosci. 1999;19:6037–6057. doi: 10.1523/JNEUROSCI.19-14-06037.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Johnson F, Bottjer SW. Growth and regression of thalamic efferents in the song-control system of male zebra finches. J Comp Neurol. 1992;326:442–450. doi: 10.1002/cne.903260309. [DOI] [PubMed] [Google Scholar]
- Johnson F, Sablan MM, Bottjer SW. Topographic organization of a forebrain pathway involved with vocal learning in zebra finches. J Comp Neurol. 1995;358:260–278. doi: 10.1002/cne.903580208. [DOI] [PubMed] [Google Scholar]
- Kojima S, Doupe AJ. Song selectivity in the pallial-basal ganglia song circuit of zebra finches raised without tutor song exposure. J Neurophysiol. 2007;98:2099–2109. doi: 10.1152/jn.00916.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Konishi M. The role of auditory feedback in the control of vocalization in the white crowned sparrow. Zeitschrift fur Tierpsychologie. 1965;22:770–783. [PubMed] [Google Scholar]
- Livingston FS, Mooney R. Development of intrinsic and synaptic properties in a forebrain nucleus essential to avian song learning. J Neurosci. 1997;17:8997–9009. doi: 10.1523/JNEUROSCI.17-23-08997.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
- London SE, Clayton DF. Functional identification of sensory mechanisms required for developmental song learning. Nat Neurosci. 2008;11:579–586. doi: 10.1038/nn.2103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mann NI, Slater PJ. Song tutor choice by zebra finches in aviaries. Anim Behav. 1995;49:811–820. doi: 10.1016/0003-3472(95)80212-6. [DOI] [PubMed] [Google Scholar]
- Margoliash D, Fortune ES. Temporal and harmonic combination-sensitive neurons in the zebra finch's HVC. J Neurosci. 1992;12:4309–4326. doi: 10.1523/JNEUROSCI.12-11-04309.1992. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marler P. The voice of the Chaffinch and its function as a language. Ibis. 1956;98:231–261. [Google Scholar]
- Marler P. A comparative approach to vocal learning: Song development in white-crowned sparrows. J Comp Physiol Psychol. 1970;71:1–25. doi: 10.1037/h0029144. [DOI] [Google Scholar]
- Miller JE, Hilliard AT, White SA. Song practice promotes acute vocal variability at a key stage of sensorimotor learning. PLoS One. 2010;5:e8592. doi: 10.1371/journal.pone.0008592. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Miller-Sims VC, Bottjer SW. Auditory experience refines cortico-basal ganglia inputs to motor cortex via re-mapping of single axons during vocal learning in zebra finches. J Neurophysiol. 2012;107:1142–1156. doi: 10.1152/jn.00614.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Miyachi S, Hikosaka O, Lu X. Differential activation of monkey striatal neurons in the early and late stages of procedural learning. Exp Brain Res. 2002;146:122–126. doi: 10.1007/s00221-002-1213-7. [DOI] [PubMed] [Google Scholar]
- Mushiake H, Saito N, Sakamoto K, Itoyama Y, Tanji J. Activity in the lateral prefrontal cortex reflects multiple steps of future events in action plans. Neuron. 2006;50:631–641. doi: 10.1016/j.neuron.2006.03.045. [DOI] [PubMed] [Google Scholar]
- Nakahara H, Doya K, Hikosaka O. Parallel cortico-basal ganglia mechanisms for acquisition and execution of visuomotor sequences—a computational approach. J Cogn Neurosci. 2001;13:626–647. doi: 10.1162/089892901750363208. [DOI] [PubMed] [Google Scholar]
- Nick TA, Konishi M. Neural auditory selectivity develops in parallel with song. J Neurobiol. 2005a;62:469–481. doi: 10.1002/neu.20115. [DOI] [PubMed] [Google Scholar]
- Nick TA, Konishi M. Neural song preference during vocal learning in the zebra finch depends on age and state. J Neurobiol. 2005b;62:231–242. doi: 10.1002/neu.20087. [DOI] [PubMed] [Google Scholar]
- O'Doherty J, Dayan P, Schultz J, Deichmann R, Friston K, Dolan RJ. Dissociable roles of ventral and dorsal striatum in instrumental conditioning. Science. 2004;304:452–454. doi: 10.1126/science.1094285. [DOI] [PubMed] [Google Scholar]
- Olveczky BP, Andalman AS, Fee MS. Vocal experimentation in the juvenile songbird requires a basal ganglia circuit. PLoS Biol. 2005;3:e153. doi: 10.1371/journal.pbio.0030153. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pedreira C, Martinez J, Ison MJ, Quian Quiroga R. How many neurons can we see with current spike sorting algorithms? J Neurosci Methods. 2012;211:58–65. doi: 10.1016/j.jneumeth.2012.07.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Phan ML, Pytte CL, Vicario DS. Early auditory experience generates long-lasting memories that may subserve vocal learning in songbirds. Proc Natl Acad Sci U S A. 2006;103:1088–1093. doi: 10.1073/pnas.0510136103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pinaud R, Saldanha CJ, Wynne RD, Lovell PV, Mello CV. The excitatory thalamo-“cortical” projection within the song control system of zebra finches is formed by calbindin-expressing neurons. J Comp Neurol. 2007;504:601–618. doi: 10.1002/cne.21457. [DOI] [PubMed] [Google Scholar]
- Pinaud R, Terleph TA, Tremere LA, Phan ML, Dagostin AA, Leão RM, Mello CV, Vicario DS. Inhibitory network interactions shape the auditory processing of natural communication signals in the songbird auditory forebrain. J Neurophysiol. 2008;100:441–455. doi: 10.1152/jn.01239.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Prather JF, Peters S, Nowicki S, Mooney R. Persistent representation of juvenile experience in the adult songbird brain. J Neurosci. 2010;30:10586–10598. doi: 10.1523/JNEUROSCI.6042-09.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ravbar P, Lipkind D, Parra LC, Tchernichovski O. Vocal exploration is locally regulated during song learning. J Neurosci. 2012;32:3422–3432. doi: 10.1523/JNEUROSCI.3740-11.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Redgrave P, Vautrelle N, Reynolds JN. Functional properties if the basal ganglia's re-entrant loop architecture: selection and reinforcement. Neuroscience. 2011;198:138–151. doi: 10.1016/j.neuroscience.2011.07.060. [DOI] [PubMed] [Google Scholar]
- Reiner A, Perkel DJ, Bruce LL, Butler AB, Csillag A, Kuenzel W, Medina L, Paxinos G, Shimizu T, Striedter G, Wild M, Ball GF, Durand S, Güntürkün O, Lee DW, Mello CV, Powers A, White SA, Hough G, Kubikova L, Smulders TV, Wada K, Dugas-Ford J, Husband S, Yamamoto K, Yu J, Siang C, Jarvis ED. Revised nomenclature for avian telencephalon and some related brainstem nuclei. J Comp Neurol. 2004;473:377–414. doi: 10.1002/cne.20118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Roper A, Zann R. The onset of song learning and song tutor selection in fledgling zebra finches. Ethology. 2006;112:458–470. doi: 10.1111/j.1439-0310.2005.01169.x. [DOI] [Google Scholar]
- Rosen MJ, Mooney R. Intrinsic and extrinsic contributions to auditory selectivity in a song nucleus critical for vocal plasticity. J Neurosci. 2000;20:5437–5448. doi: 10.1523/JNEUROSCI.20-14-05437.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rosen MJ, Mooney R. Inhibitory and excitatory mechanisms underlying auditory responses to learned vocalizations in the songbird nucleus HVC. Neuron. 2003;39:177–194. doi: 10.1016/S0896-6273(03)00357-X. [DOI] [PubMed] [Google Scholar]
- Saito N, Mushiake H, Sakamoto K, Itoyama Y, Tanji J. Representation of immediate and final behavioral goals in the monkey prefrontal cortex during an instructed delay period. Cereb Cortex. 2005;15:1535–1546. doi: 10.1093/cercor/bhi032. [DOI] [PubMed] [Google Scholar]
- Samejima K, Doya K. Multiple representations of belief states and action values in cortico-basal ganglia loops. Ann N Y Acad Sci. 2007;1104:213–228. doi: 10.1196/annals.1390.024. [DOI] [PubMed] [Google Scholar]
- Scharff C, Nottebohm F. A comparative study of the behavioral deficits following lesions of various parts of the zebra finch song system: implications for vocal learning. J Neurosci. 1991;11:2896–2913. doi: 10.1523/JNEUROSCI.11-09-02896.1991. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shank SS, Margoliash D. Sleep and sensorimotor integration during early vocal learning in a songbird. Nature. 2009;458:73–77. doi: 10.1038/nature07615. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Solis MM, Doupe AJ. Anterior forebrain neurons develop selectivity by an intermediate stage of birdsong learning. J Neurosci. 1997;17:6447–6462. doi: 10.1523/JNEUROSCI.17-16-06447.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Solis MM, Doupe AJ. Contributions of tutor and bird's own song experience to neural selectivity in the songbird anterior forebrain. J Neurosci. 1999;19:4559–4584. doi: 10.1523/JNEUROSCI.19-11-04559.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Solis MM, Doupe AJ. Compromised neural selectivity for song in birds with impaired sensorimotor learning. Neuron. 2000;25:109–121. doi: 10.1016/S0896-6273(00)80875-2. [DOI] [PubMed] [Google Scholar]
- Sutton RSB, Andrew G. Reinforcement learning: an introduction. Cambridge, MA: Massachusetts Institute of Technology; 1998. [Google Scholar]
- Suzuki WA, Miller EK, Desimone R. Object and place memory in the macaque entorhinal cortex. J Neurophysiol. 1997;78:1062–1081. doi: 10.1152/jn.1997.78.2.1062. [DOI] [PubMed] [Google Scholar]
- Tanaka SC, Doya K, Okada G, Ueda K, Okamoto Y, Yamawaki S. Prediction of immediate and future rewards differentially recruits cortico-basal ganglia loops. Nat Neurosci. 2004;7:887–893. doi: 10.1038/nn1279. [DOI] [PubMed] [Google Scholar]
- Tchernichovski O, Nottebohm F, Ho CE, Pesaran B, Mitra PP. A procedure for an automated measurement of song similarity. Anim Behav. 2000;59:1167–1176. doi: 10.1006/anbe.1999.1416. [DOI] [PubMed] [Google Scholar]
- Thorn CA, Atallah H, Howe M, Graybiel AM. Differential dynamics of activity changes in dorsolateral and dorsomedial striatal loops during learning. Neuron. 2010;66:781–795. doi: 10.1016/j.neuron.2010.04.036. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Thorpe WH. The learning of song patterns by birds, with especial reference to the song of the Chaffinch Fringilla coelebs. Ibis. 1958;100:535–570. [Google Scholar]
- Troyer TW, Doupe AJ. An associational model of birdsong sensorimotor learning. I. Efference copy and the learning of song syllables. J Neurophysiol. 2000:1204–1223. doi: 10.1152/jn.2000.84.3.1204. [DOI] [PubMed] [Google Scholar]
- Warren TL, Tumer EC, Charlesworth JD, Brainard MS. Mechanisms and time course of vocal learning and consolidation in the adult songbird. J Neurophysiol. 2011;106:1806–1821. doi: 10.1152/jn.00311.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Williams H. Models for song learning in the zebra finch: fathers or others? Anim Behav. 1990;39:745–757. doi: 10.1016/S0003-3472(05)80386-0. [DOI] [Google Scholar]
- Yazaki-Sugiyama Y, Mooney R. Sequential learning from multiple tutors and serial retuning of auditory neurons in a brain area important to birdsong learning. J Neurophysiol. 2004;92:2771–2788. doi: 10.1152/jn.00467.2004. [DOI] [PubMed] [Google Scholar]
- Yin HH, Knowlton BJ. The role of the basal ganglia in habit formation. Nat Rev Neurosci. 2006;7:464–476. doi: 10.1038/nrn1919. [DOI] [PubMed] [Google Scholar]
- Yin HH, Mulcare SP, Hilário MR, Clouse E, Holloway T, Davis MI, Hansson AC, Lovinger DM, Costa RM. Dynamic reorganization of striatal circuits during the acquisition and consolidation of a skill. Nat Neurosci. 2009;12:333–341. doi: 10.1038/nn.2261. [DOI] [PMC free article] [PubMed] [Google Scholar]