Skip to main content
eLife logoLink to eLife
. 2014 Jun 16;3:e02152. doi: 10.7554/eLife.02152

A role for descending auditory cortical projections in songbird vocal learning

Yael Mandelblat-Cerf 1,*,, Liora Las 1,, Natalia Denisenko 1,, Michale S Fee 1
Editor: Ronald L Calabrese2
PMCID: PMC4113997  PMID: 24935934

Abstract

Many learned motor behaviors are acquired by comparing ongoing behavior with an internal representation of correct performance, rather than using an explicit external reward. For example, juvenile songbirds learn to sing by comparing their song with the memory of a tutor song. At present, the brain regions subserving song evaluation are not known. In this study, we report several findings suggesting that song evaluation involves an avian 'cortical' area previously shown to project to the dopaminergic midbrain and other downstream targets. We find that this ventral portion of the intermediate arcopallium (AIV) receives inputs from auditory cortical areas, and that lesions of AIV result in significant deficits in vocal learning. Additionally, AIV neurons exhibit fast responses to disruptive auditory feedback presented during singing, but not during nonsinging periods. Our findings suggest that auditory cortical areas may guide learning by transmitting song evaluation signals to the dopaminergic midbrain and/or other subcortical targets.

DOI: http://dx.doi.org/10.7554/eLife.02152.001

Research organism: other

eLife digest

Most new skills, from playing a sport to learning a language, are acquired through a gradual process of trial and error. While some of this learning is driven by direct external rewards, such as praise, much of it occurs when the individual compares their current performance with their own impression of what a ‘correct’ performance should be. The way that the brain responds to external rewards is relatively well understood, but much less is known about the processes used by the brain to evaluate its own performance.

One way to study this process is to examine how songbirds learn their songs. While in the nest, young male birds memorize another bird's song, usually that of their father. They learn to sing by comparing their own vocals with this memorized template, tweaking their song until the two versions match. Now, Mandelblat-Cerf et al. have identified a pathway in the brain that enables the birds to make this comparison and to use any discrepancies to improve their subsequent attempts.

Anatomical labeling experiments revealed that a brain structure called the arcopallium has a key role in this process. The ventral part of this structure (known as AIV) receives inputs from the auditory cortex—meaning that it has access to the bird’s own song—and then forms connections with a specific group of neurons in the midbrain. These midbrain neurons, which communicate using the chemical transmitter dopamine, project to brain regions that ultimately control the movements involved in singing. This means that the AIV is ideally positioned to be able to evaluate and then adjust the song as required.

Consistent with this possibility, young zebra finches were less able to imitate a template song if their AIV was destroyed before they had started practicing. By contrast, destroying the AIV in adult birds who had already learned their song did not impair performance, indicating that the arcopallium circuit supports song learning rather than singing per se. Finally, recordings of neurons in the AIV made during singing revealed that this brain area sends signals about discrepancies between what the young bird tries to sing and what he hears himself sing.

In addition to providing further clues as to how the songbirds learn their songs, this work also highlights the fact that dopaminergic neurons in the midbrain—which are best known for being involved in our response to external rewards such as food and drugs—also contribute to learning that is driven internally.

DOI: http://dx.doi.org/10.7554/eLife.02152.002

Introduction

Most human behaviors, such as speech, music, or athletic performance, are learned through a gradual process of trial and error. In all of these behaviors, the motor actions are shaped during learning by an internal model of good performance (Wolpert et al., 1995). While a great deal has been learned about the neural mechanisms by which external rewards, such as food or juice drops, are represented in the brain and might shape future behavior (Schultz et al., 1997; Hikosaka, 2007), little is known about how the brain evaluates its own performance, where such self-evaluation is computed and how these signals might shape future behavior.

Here we use the songbird as a model system to understand how complex behaviors can be learned by self-evaluation of motor performance, rather than relying on explicit external rewards such as food or social reinforcement. Songbirds learn their vocalizations by storing a memory, or ‘template’, of their tutor's song (Doupe and Kuhl, 1999). By listening to themselves sing and comparing their own song to this template (Konishi, 1965), they gradually converge to what can be a nearly exact copy of the tutor's song. Vocal learning requires a basal-ganglia forebrain circuit called the anterior forebrain pathway (AFP), the output of which projects to cortical premotor vocal areas (Nottebohm et al., 1976; Bottjer et al., 1984; Scharff and Nottebohm, 1991). The AFP is crucial for driving learned changes in song and shaping plasticity in the song motor pathway (Brainard and Doupe, 2000; Andalman and Fee, 2009; Warren et al., 2011). A key component of this vocal learning circuit is Area X, a basal ganglia homologue containing both striatal and pallidal components (Person et al., 2008).

Of critical importance in understanding the mechanisms of vocal sensorimotor learning in the songbird is to determine where song is evaluated and how the resulting evaluation signals are transmitted to the AFP to shape learning. Some hypotheses have emphasized the possible role of HVC (used as a proper name), a premotor cortical circuit that receives auditory inputs and also projects to Area X (Troyer and Doupe, 2000; Mooney, 2006; Prather et al., 2008; Roberts et al., 2012). While the involvement of HVC in evaluating ongoing song remains an open question, several lines of evidence suggest that HVC does not transmit error-related signals to the AFP (Hessler and Doupe, 1999; Kozhevnikov and Fee, 2007), or receive auditory inputs (Hamaguchi et al., 2014) during singing.

Another possibility is suggested by the large projection to Area X from dopaminergic nuclei in the midbrain—the ventral tegmental area (VTA) and substantia nigra pars compacta (SNc) (Person et al., 2008; Figure 1A). Inspired by the recently hypothesized role of dopaminergic signaling in reinforcement learning (Houk et al., 1994; Schultz et al., 1997; Bayer and Glimcher, 2005; Tsai et al., 2009), we and others have proposed that this dopaminergic input to Area X may play a role in song learning by signaling vocal errors (Hara et al., 2007; Gale et al., 2008; Gale and Perkel, 2010; Fee and Goldberg, 2011). Notably, a recent anatomical study in songbirds has described a projection to the dopaminergic midbrain from the arcopallium (Gale et al., 2008), an avian cortical region containing subtelencephalically projecting neurons analogous to those in deep layer-5 of mammalian cortex (Dugas-Ford et al., 2012; Karten, 2013).

Figure 1. Characterization of avian ‘cortical’ areas projecting to the dopaminergic midbrain.

(A) Left panel: schematic of the songbird brain (in sagittal view) showing classical song control brain areas. Nuclei HVC and RA form the descending song motor pathway, necessary for song production. Nucleus RA is in the arcopallium, a cortical-like region homologous to layer-5 neurons of the mammalian cortex. Also shown is the anterior forebrain pathway (AFP), a circuit necessary for vocal learning but not vocal production. The AFP consists of a basal ganglia homologue Area X, thalamic nucleus DLM, and cortical-like nucleus LMAN, which projects to RA. Right panel: schematic showing a set of pathways that are the focus of this paper, including a previously described projection to Area X from neurons in the dopaminergic midbrain nuclei VTA and SNc (Gale et al., 2008). Also shown is AIV, a part of the intermediate arcopallium found to project to VTA and SNc. Projections to AIV from auditory cortical areas L1, CM, and HVC-shelf are elucidated here. (B) Sagittal section showing the injection site of a retrograde tracer (CTB, white color, arrow) within VTA/SNc (TH-stained neurons, red). (C) Three sagittal sections through the arcopallium showing retrogradely labeled neurons in AIV (fluorescence, white) and relation to RA (dark field image, purple). Panel at left is most lateral; sections 200 µm apart. (D) Upper panel: image of injection site of a GFP-expressing virus (HSV, fluorescence white) in the anterior part of AIV, showing relation to RA (dark field image). Bottom panel: sagittal section of VTA/SNc showing anterogradely labeled fibers from AIV (green) and TH-stained neurons (red). (E) Same as D, with an injection site in AIV ventral to RA. All scale bars 200 µm unless indicated otherwise. In panels AE, anterior is left; dorsal is up. (F) Coronal section showing axons in RA and Ad (green), anterogradely labeled from LMAN and LMAN-shell, respectively (Dextran 10 MW Alexa 488). AIV neurons (white) labeled by injection of a retrograde tracer (CTB) in VTA and SNc (medial, left; dorsal, up).

DOI: http://dx.doi.org/10.7554/eLife.02152.003

Figure 1.

Figure 1—figure supplement 1. Retrograde labeling of AIV neurons from VTA and SNc.

Figure 1—figure supplement 1.

(A) Site of CTB injection (white, 13.8 nl total injected volume) into VTA (tyrosine hydroxylase immunostain, red). (B) Serial sagittal sections through the arcopallium showing retrogradely labeled neurons (white) in the vicinity of RA. (Most lateral image is #1,100 µm between slices). (C and D) Site of CTB injection into SNc and retrograde label (white) in the arcopallium. (EH) In each pair, the image on the left shows the injection site in VTA/SNc. The image on the right shows a single sagittal section through the arcopallium near the lateral third of RA (corresponding to sections 2 or 3 in panels B and D). We refer collectively to the regions containing neurons retrogradely labeled from the dopaminergic midbrain (VTA/SNc) as the ventral intermediate arcopallium (AIV). Scale bars for all images are 200 µm.

Here we set out to examine the role of the cortical area projecting to VTA/SNc in song learning. First, we characterized the spatial extent of the arcopallial neurons projecting to VTA or SNc, and characterized the inputs to these neurons from auditory cortical fields. We then carried out lesions targeted to this area to examine the effect on tutor imitation and on vocal plasticity that occurs after deafening. Finally, we recorded from VTA/SNc-projecting neurons in the arcopallium of singing birds and observed fast transient responses to disruptive auditory feedback during singing. Altogether, our findings suggest that descending auditory cortical projections may play a role in song evaluation via projections to the dopaminergic midbrain and/or other subcortical targets.

Results

An auditory cortical pathway to the dopaminergic midbrain

To elucidate the spatial pattern of cortical neurons projecting to the dopaminergic midbrain in songbirds, we made small injections of a retrograde tracer (cholera toxin subunit β, CTB) into VTA and SNc (n = 12 zebra finches, Figure 1B, see ‘Materials and methods’). Consistent with an earlier report (Gale et al., 2008), we observed a distinct mass of retrogradely labeled cell bodies within the ventral-most extent of the intermediate arcopallium. Labeled neurons were distributed in a complex pattern that was highly consistent across birds (Figure 1C, Figure 1—figure supplement 1). The main cluster of labeled neurons was arranged in a stem-like column ventral to RA (robust nucleus of the arcopallium), while a distinct ‘stripe’ of labeled neurons was observed 300–400 µm anterior to the main cluster. A smaller number of labeled neurons formed a thin shell around the dorsal surface of RA, as previously described (Gale et al., 2008). We will refer specifically and collectively to the arcopallial areas containing neurons retrogradely labeled from the dopaminergic midbrain, described above, as the ventral intermediate arcopallium (AIV).

Injections of anterograde tracer (HSV viral vector expressing green fluorescent protein, GFP) within the anterior and posterior clusters of AIV neurons each revealed axonal arborization intermingled with TH-positive neurons in both VTA and SNc (Figure 1D,E, n = 6 birds).

It is important to elucidate the anatomical relation between the area we identify as AIV and the dorsal arcopallium (Ad), which has previously been implicated in vocal learning (Bottjer and Altenau, 2010), and also in locomotory control (Feenders et al., 2008; Jarvis et al., 2013) (called lateral intermediate arcopallium, LAI, in the latter studies). Ad was labeled by injections of anterograde tracer (Alexa 488 conjugated dextran 10 MW) into the nidopallium lateral to LMAN (LMAN-shell), and AIV was retrogradely labeled in the same birds by injection of CTB into VTA and SNc (n = 8 hemispheres). In coronally sectioned tissue, Ad was seen as a band of labeled fibers extending lateral to RA (Figure 1F), as previously described. Neurons retrogradely labeled from VTA/SNc were visible as a wedge of cell bodies ventrolateral to RA and ventromedial to Ad. Few labeled cell bodies were observed within the region of labeled fibers in Ad, although some retrogradely labeled neurons were seen in the thin ‘neck’ of labeled fibers connecting RA and Ad. Injections of retrograde tracer (CTB) into the anterior or posterior parts of AIV (n = 5) revealed no retrogradely labeled neurons in the nidopallium adjacent to LMAN (LMAN-shell). Thus, AIV and Ad appeared to be largely distinct non-overlapping regions.

To further elucidate the possible cortical afferents to AIV, we made injections of retrograde tracer (CTB) into different locations anterior and ventral to RA (n = 10 birds). Following injections into the anterior parts of AIV (n = 5 birds), retrogradely labeled neurons were reliably observed in auditory cortical areas HVC-shelf, caudal mesopallium (CM) and L1 (Figure 2A–F, Figure 2—figure supplement 1A–C) (Kelley and Nottebohm, 1979; Vates et al., 1996; Mello et al., 1998). We note that, while these previous studies have identified inputs to ventral arcopallium (RA-cup) from L1 and HVC-shelf, the projection described here from CM is novel. Following injections into the more posterior parts of AIV (n = 5 birds), retrogradely labeled neurons were observed in the caudal nidopallium (NC) posterior to HVC-shelf (Figure 2G,H).

Figure 2. Cortical inputs to ventral intermediate arcopallium (AIV): retrograde and anterograde tracing.

(A) Injection of a retrograde tracer (CTB) anterior to RA in the vicinity of AIV. (B) Retrograde label in cortical auditory areas L1 and CM resulting from the injection shown above. (C) Labeled neurons in HVC-shelf of the same bird. (DF) Same as panels AC. Injection of CTB was targeted to the anterior ‘stripe’ part of AIV. (G) Injection of CTB into the posterior part of AIV, directly ventral to RA. (H) Retrogradely labeled neurons in caudal nidopallium (NC) resulting from the injection shown above. (IL) Anterograde tracing from auditory cortical areas L1, CM, and HVC-shelf. (I) Upper panel—injection of a GFP-expressing virus (HSV, green) into auditory field L1. Middle panel shows labeled axons in the vicinity of AIV neurons retrogradely labeled (purple) from VTA/SNc. Color in RA is due to auto fluorescence, not label. Bottom panel shows higher magnification view from image above. (J) Same as (I), but the injection of HSV-GFP was made into the auditory caudal mesopallium (CM). (K) Same as (I), but the injection of HSV-GFP was made into HVC-shelf. Scale bars: 100 µm for lower panels of IL; 200 µm for all other panels.

DOI: http://dx.doi.org/10.7554/eLife.02152.005

Figure 2.

Figure 2—figure supplement 1. Cortical inputs to ventral intermediate arcopallium (AIV).

Figure 2—figure supplement 1.

(A) Injection site of CTB in anterior AIV (see Figure 1—figure supplement 1B,D panels 2–4). (B) Retrogradely labeled cell bodies in L1 and CM resulting from the injection shown in panel A. Retrograde label was also observed in L3 following injections of tracer anterior to RA. But the presence of labeled neurons in L3 was less reliable than in L1 and CM. Further studies will be required to resolve the connectivity between L3 and AIV. (C) Retrogradely labeled cell bodies in posterior HVC-shelf resulting from the injection shown in panel A. (DH) Upper panels—injection of a GFP-expressing virus (HSV, green) into L1(D), CM (E), HVC-shelf (F) and NC (G and H). Middle panels show labeled fibers arborizing in AIV in the vicinity of neurons retrogradely labeled from VTA/SNc (purple). Bottom panels show a higher-magnification view of the sections shown in the middle panels. Scale bars: 100 µm for the bottom images in panels DI; 200 µm for all other images.

To confirm that neurons in these four areas terminate within AIV, we carried out anterograde tracing using a GFP-expressing virus (HSV-GFP). Injections of anterograde tracer into the caudal nidopallium (NC) posterior to HVC-shelf revealed axonal arborization primarily in the posterior ‘stem’ region of AIV (Figure 2—figure supplement 1G–H, n = 8 birds). Injections of anterograde tracer into auditory cortical areas L1, CM, and HVC-shelf all revealed axonal arborization overlapping with neurons retrogradely labeled from VTA/SNc, particularly within the anterior ‘stripe’ region of AIV (Figure 2I–L, Figure 2—figure supplement 1D–F; L1 and CM, n = 4 birds each; HVC-shelf, n = 10 birds). In these experiments, axonal arbors were not restricted to the region of retrogradely labeled neurons in AIV, but were also seen anterior to the AIV ‘stripe’ and in the area directly anterior to RA.

The functional connectivity of projections from the auditory areas L1, CM, and HVC-shelf was further confirmed by electrically stimulating these areas and demonstrating short latency activation of multi-unit recording sites in AIV and of antidromically identified VTA/SNc-projecting AIV neurons (Figure 3). Together these findings suggest the existence of multiple descending pathways by which auditory information could reach the dopaminergic midbrain and other downstream auditory targets of the arcopallium (Kelley and Nottebohm, 1979; Vates et al., 1996; Mello et al., 1998).

Figure 3. Electrophysiological verification of functional connectivity.

Figure 3.

Electrical stimulation in auditory cortical areas L1, CM, and HVC-shelf drives spiking in VTA/SNC-projecting neurons in AIV. (A) Schematic at left illustrates the location of the stimulating electrodes (red) in L1 and the location of the recording electrode (blue) in AIV. Middle traces: responses of 3 antidromically identified AIV neurons to L1 stimulation (overlayed responses from ten trials). Right traces: antidromic response of the same neurons to electrical stimulation in VTA/SNc. (B and C) The panels are analogous to those shown in (A), except the stimulation electrode is placed in caudal mesopallium (CM) or in the posterior part of HVC-shelf.

DOI: http://dx.doi.org/10.7554/eLife.02152.007

Lesions of AIV disrupt song learning

To test the role of AIV in vocal learning and imitation, we carried out excitotoxic lesions targeted to AIV in young zebra finches, and examined the subsequent effect on song imitation. Birds were lesioned after being tutored in their home cage but prior to evidence of vocal imitation (Figure 4A,B, 48–51 days post hatch, dph). After the lesion surgery (‘Materials and methods’), birds were maintained in isolation and their songs were recorded until adulthood (90 dph). Control siblings underwent the same tutoring protocol but did not receive AIV lesions. A quantification of song imitation (by analysis of pupil-tutor similarity, see ‘Materials and methods’) revealed that the AIV-lesioned birds exhibited deficits in vocal learning (Figure 4C–F). Lesioned birds had a significantly lower similarity to their tutor (imitation score) than their unlesioned control siblings (Figure 4—figure supplement 1A, paired t test, p=0.0048, rank-sum test between all controls and all AIV-lesioned birds p=0.0024). The overall distribution of acoustic features (Tchernichovski et al., 2000) was not significantly different between AIV-lesioned and control groups, suggesting that these lesions did not create gross abnormalities in song production (Figure 4—figure supplement 1B–D).

Figure 4. Bilateral lesions of AIV impair tutor imitation.

(A) Excitotoxic lesion targeted to AIV by injection of NMA. Lesion borders indicated by arrowheads (sagittal section; anterior left, dorsal up). (B) Schematic timeline of experimental protocol. (C) Song spectrograms of an adult bird (90 dph) that underwent bilateral lesion of AIV as a juvenile, compared to an unlesioned control sibling. Top: spectrogram of tutor song. Middle: two example song spectrograms of the lesioned bird (45% lesion, 0.205 imitation score). Bottom: two example song spectrograms of the control sibling (0.235 imitation score). (D) Same as panel C, but for a different pair of birds (lesioned bird, 66% lesion, 0.1566 imitation score; control sibling, 0.27 imitation score). (E and F) Song spectrograms of two additional adult birds that underwent lesions of AIV as juveniles. These birds did not have control siblings. (Bird in panel E, 61% lesion, 0.097 imitation score; bird in panel F, 77% lesion, 0.114 imitation score.)

DOI: http://dx.doi.org/10.7554/eLife.02152.008

Figure 4.

Figure 4—figure supplement 1. Effect of AIV lesions on song motor production and imitation.

Figure 4—figure supplement 1.

(A) AIV-lesioned birds showed significantly reduced tutor imitation score compared to unlesioned controls. Lines connect data from siblings. (B) Average distribution of song spectral features (FM, Weiner entropy, pitch goodness, and pitch) of AIV lesioned birds (black) and unlesioned controls (blue) at 90 dph. (C and D) AIV lesions produce no immediate effect on juvenile songs. (C) Song example of a juvenile bird just prior to AIV-lesion (50 dph; top two spectrograms) and in the first day of singing after AIV lesion (53 dph; bottom two spectrograms). This bird had a 55% AIV-lesion and ultimately exhibited a severe deficit in imitation (tutor imitation score 0.094 at 90 dph). (D) Distributions of several acoustic features prior the AIV lesion (blue) and after AIV lesion (red), for the bird shown in panel C. Numbers in each subplot show the cross correlation between the feature distributions before and after lesion. Across all AIV-lesioned birds (n = 17 birds) the correlation coefficients between pre- and post-lesion distributions were 0.9837 ± 0.03, 0.944 ± 0.06, 0.988 ± 0.13, and 0.96 ± 0.03 for FM, Weiner entropy, pitch goodness, and pitch, respectively.

We investigated the possibility that the observed learning deficits after AIV lesions might be caused by the loss of neurons in dorsal arcopallium (Ad), which is dorsolaterally adjacent to AIV. In a subset of juvenile birds (n = 7), lesions were targeted to the portion of Ad most likely affected by our AIV lesion procedure (Figure 5A, ‘Materials and methods’). These birds underwent the same tutor exposure and learning protocol as did the AIV-lesioned birds described above. The Ad-lesioned birds exhibited significantly higher song imitation scores compared to AIV-lesioned birds (Wilcoxon rank-sum test p=0.012), and no significant deficit in song imitation compared to non-lesioned controls (Figure 5B, Wilcoxon rank-sum test p=0.74). These birds exhibited no overt motor or locomotor deficits.

Figure 5. AIV lesions, but not Ad lesions, impair tutor imitation.

(A) Left: coronal section showing the relation between AIV neurons (retrogradely labeled from VTA/SNc, blue) and nucleus Ad (anterograde labeling from LMAN-shell, green; medial, left; dorsal, up) Right: coronal section showing excitotoxic lesion of Ad, revealed by loss of NeuN staining (white, note that the lesion affected some of the overlying nidopallium). (B) Tutor imitation score plotted as a function of the extent of AIV lesion for each bird (hollow circles, n = 17 AIV-lesioned birds). Ad-lesioned birds (n = 7) are shown as 0% lesion (hollow gray diamonds) since Ad lesions had minimal impact on AIV. No significant impairment of song-imitation was observed in Ad-lesioned birds as compared to unlesioned controls (blue diamonds, n = 19 birds). Solid line denotes least square fit to Ad-lesioned and AIV lesioned data points. Red dashed horizontal line indicates mean of all similarity comparisons between 20 unrelated adult birds (red shaded area indicates SEM) (C) Boxplot of the distribution of imitation scores of all control birds (cyan, unlesioned, and Ad-lesioned controls) and birds with large AIV-lesions (black, >50% lesion). Also shown is a boxplot of the distribution of similarity scores of all pairwise comparisons between 20 unrelated adult birds (red). Whiskers denote 10–90 percentile. Asterisk in each boxplot denotes the mean, heavy line denotes median. (D) Distribution of syllable self-similarity in AIV-lesioned birds (top) and in control birds (bottom, unlesioned, and Ad-lesioned controls combined). Dashed lines denote mean of distributions.

DOI: http://dx.doi.org/10.7554/eLife.02152.010

Figure 5.

Figure 5—figure supplement 1. Effect of AIV lesions on song imitation, development of song stereotypy, and rate of singing.

Figure 5—figure supplement 1.

The tutor imitation scores shown in main Figure 5B are based on a product of the acoustic similarity score the sequence similarity score (Mandelblat-Cerf and Fee, 2014). Here the contribution of acoustic and sequence similarity are shown separately in panels A and B respectively. (C) Plot of maturity index (a measure of song stereotypy; see ‘Note on methods’ below) as a function of AIV lesion size. Larger AIV lesions in juvenile birds resulted in lower stereotypy in the final adult song. In panels AC, dashed line denotes averaged scores of each metric for comparisons between unrelated tutors. Solid lines denote linear regression using least square. R-values are 0.69, 0.46, and 0.50 respectively. F-statistic p-values are noted. The song maturity index of AIV-lesioned birds is significantly smaller than that of control birds (unlesioned and Ad-lesioned birds; Wilcoxon rank-sum test, p=0.01 and p=0.025, respectively). (DE) To test whether the deficits in learning (tutor imitation scores) after AIV lesion are due to a reduced rate of singing (i.e., less practice), we quantified the amount of singing in AIV-lesioned birds and Ad-lesioned controls. (D) Total seconds of singing per day for Ad-lesioned birds (average of 7 birds, red) and AIV-lesioned birds (average of 17 birds, black). There was no significant difference in the accumulated amount of singing (t test, p=0.52), nor in the daily singing rate, between these populations, as tested every 5 days (t tests, p-values of all comparisons, 0.8>p>0.22). (E) Tutor imitation scores are not correlated with the amount of singing both for AIV-lesioned (upper panel, black), Ad-lesioned (upper panel, red), and unlesioned control birds (bottom panel, blue) (F-statistics all comparisons, p>0.4). Note on methods: our song similarity algorithm compares many renditions of the pupil song with the tutor motif. This comparison results in an acoustic similarity score and a sequence similarity score. Acoustic similarity estimates the similarity of each of the tutor's syllables to the best match in the pupil's song. The sequence similarity score represents the acoustic similarity of song segments preceding and following these closest-match syllables. The overall tutor imitation score is the product of the acoustic and sequence similarity scores. Song stereotypy (panel C) was quantified by measuring the peak of the spectrogram cross-correlation between different bouts of the birds song with itself (Aronov et al., 2008). This measure was shown to gradually increase through song development, and therefore we refer to it as the maturity index.

The lack of effect of Ad lesions, on either vocal learning or other aspects of motor behavior, led us to wonder whether complete lesions of Ad might have an observable effect. Larger lesions of Ad were carried out in three additional birds (the lesions were extended 0.2 mm more laterally, but within Ad). All three birds exhibited severe akinesia and immobility requiring the termination of the experiment. Because these birds did not sing, we were not able to assess the effects of the larger Ad lesions on vocal learning.

We wondered if the song imitation deficits produced by the AIV lesions described above were correlated with the size of the lesion. Several different lesion protocols were tested in this study, resulting in different extents of AIV lesion across the data set (n = 17 birds total). In all lesioned birds, the extent to which AIV was lesioned was quantified histologically at the age 90 dph (‘Materials and methods’). Because Ad lesions minimally impacted AIV, Ad-lesioned birds (n = 7) were included in this analysis as sham lesions (0% AIV lesion). Song imitation scores were negatively correlated with the fraction of AIV that was lesioned (Figure 5B, linear regression using least square, R = 0.63, F-statistic p=0.0008, Figure 5—figure supplement 1A,B).

Birds for which we estimated that more than half of AIV was lesioned (n = 8 birds, referred to as a large AIV lesion) produced poor imitations of the tutor song compared to controls (Figure 5C, rank-sum test, p=0.00016; Ad-lesioned and unlesioned control groups were pooled in this comparison). On average, birds with large AIV lesion exhibited an 87 ± 9% loss of imitation capacity compared to the controls (‘Materials and methods’). The distribution of tutor imitation scores for birds with large lesions was not significantly different from the distribution of pairwise similarity scores between unrelated adult birds in our colony (Figure 5C, Wilcoxon rank-sum test, p=0.31). Altogether, our findings suggest that the songs of birds with large AIV lesions were nearly as dissimilar to their own tutors as unrelated adult birds in our colony are to each other.

Importantly, the impaired vocal imitation in AIV-lesioned birds cannot be attributed to the amount of vocal practice. We compared the amount of singing in the period from surgery up to 90 dph for AIV-lesioned birds vs Ad-lesioned controls. No difference was observed between these groups (Figure 5—figure supplement 1D,E, t test, p=0.56). Not surprisingly, both Ad- and AIV-lesioned birds sang less than the unlesioned control group (ranksum p=0.003 and 0.016 respectively). However, within any experimental or control group, there was no correlation between the amount of singing and the degree of song imitation (F-statistics all comparisons, p>0.4).

In addition to impaired vocal imitation, AIV-lesioned birds developed adult songs that exhibited a significantly reduced overall song stereotypy (p=0.0045, rank-sum test, Figure 5—figure supplement 1C) (Aronov et al., 2008) and syllable stereotypy (p=0.01, rank-sum test, Figure 5D), compared to the control groups (Ad-lesioned and unlesioned control groups were individually significant, but were pooled in the comparisons stated above). Another common feature of the songs of AIV-lesioned birds was the presence of a large number of ‘atypical’ syllables that were uncharacteristically long in duration or contained patterns of acoustic modulation not usually observed in zebra finch song.

AIV lesions do not have the same effect as deafening

Auditory feedback is necessary for vocal learning in juvenile birds, as revealed by experiments showing that birds deafened as juveniles are unable to learn normal songs (Konishi, 1965). Because AIV is a downstream target of auditory brain areas, we wondered if the effects of AIV lesions might be similar to deafening. We took advantage of the fact that deafening results in a dramatic degradation of song in young adults that have largely finished learning their songs (Nordeen and Nordeen, 1992; Lombardino and Nottebohm, 2000). For example, deafening at the age of 75–80 dph resulted in a near complete loss of song structure, typically within 1 week (n = 14 birds, Figure 6A,D, see ‘Materials and methods’ for quantification). In contrast to the effect of deafening, bilateral lesions targeted to AIV (n = 6 birds, 75–80 dph) produced no significant song degradation within a 2-week period following the lesion (t test p=0.28, Figure 6B,D).

Figure 6. Effect of AIV lesion on adult song and on song degradation after deafening.

(A) Examples of song spectrograms from a bird deafened at the age 80 dph. Shown from top to bottom; song before surgery (deafening), first song post-surgery, song 1 week post-surgery, and song 2 weeks post-surgery. Note rapid degradation of song structure within 1 week after deafening. (B) Examples of song spectrograms from a bird that underwent complete bilateral lesion of AIV at the age 80 dph. Note the lack of song degradation. (C) Song spectrograms from a bird that underwent both bilateral lesion of AIV and deafening at the age 80 dph. (D) Plot of song self-similarity, normalized to the average self-similarity of pre-surgery song. Note that lesioned birds exhibited a reduced rate of song degradation after deafening, compared to deafening alone, 1 week and 2 weeks post-surgery (t test, p<0.01).

DOI: http://dx.doi.org/10.7554/eLife.02152.012

Figure 6.

Figure 6—figure supplement 1. Immediate effects on song of excitotoxic and electrolytic lesions in AIV.

Figure 6—figure supplement 1.

(A) Excitotoxic lesions in AIV do not cause a decrease in song variability, as would be expected if the lesions impacted LMAN axons entering RA. Example song spectrograms prior to AIV lesion surgery (top) and the first day of singing after the surgery (bottom). Data are from a bird that received bilateral lesion of AIV at the age 77 dph. (B) Quantification of song variability before and after AIV lesion in older juvenile birds (75–80 dph). Data are from the same birds shown in the main Figure 6. Song variability was assessed by computing the variance of song self-similarity scores. (CF) Partial electrolytic lesion anterior to RA produces an immediate effect on song acoustic structure, presumably due to the destruction of axons in the RA output tract. Such immediate effects on song structure were never observed after excitotoxic lesions in AIV, which also extended anterior to RA. (C) Sagittal section (NeuN, white) showing RA and the electrolytic lesion anterior to it (red arrow). (D) Song spectograms prior to the electrolytic lesion (upper panels) and the first day of singing post lesion (lower panels), for the same bird in panel C. (E) Plot of song self-similarity post-surgery, normalized to the average self-similarity of pre-surgery song (75–80dph). Post-electrolytic-lesion similarity scores (n = 5 birds, red) were significantly smaller than pre-lesion similarity scores (ranksum test, p=0.008), and significantly smaller than the normalized self-similarity scores of AIV-lesioned birds (n = 6 birds, gray). (ranksum test, p=0.0043). (F) Plot of song self-similarity of AIV-lesioned birds (gray) and electrolytically lesioned birds for 2 weeks after surgery. Self-similarity of electrolytically lesioned birds showed an immediate drop after lesion that remained significantly lower than pre-surgery, and lower than AIV-lesioned birds, for at least 2 weeks (all comparisons, p<0.01). Note on methods: bilateral electrolytic lesions were carried out by passing cathodal current of 70 µA for 45 s at multiple sites in the arcopallium anteroventral to RA. The head was oriented stereotaxically with the flat anterior portion of the skull rotated forward at a pitch of 45°. RA was completely mapped based on its characteristic electrophysiological signature (regular spiking interspersed with bursts). Once all of the borders of RA were localized, these were used to calculate the coordinates of the lesion sites. Two penetrations were made 200 µm anterior to the anterior edge of RA, spaced 300 µm from each other on the medial–lateral axis. For each penetration, two lesions were made in two depths: 200 µm above and below the bottom edge of RA. Songs were acquired for 2 weeks (n = 3 birds) or for 2 days (n = 2 birds), after which birds were sacrificed and the brain analyzed histologically.

Notably, AIV lesions had no immediate effect on song structure or song variability (paired t test p=0.52, Figure 6—figure supplement 1A,B), suggesting that AIV plays no direct role in song production or in the generation of vocal variability. These results also suggest that our AIV lesions did not significantly disrupt fibers of passage, either afferent fibers entering RA laterally from LMAN (lateral magnocellular nucleus of the anterior nidopallium, an area which conveys variability into the motor output during singing [Kao et al., 2005; Ölveczky et al., 2005]), or efferent fibers exiting RA rostrally (Figure 6—figure supplement 1C–F).

AIV lesions inhibit deafening-induced song degradation

It has been shown that lesions of the basal ganglia–forebrain learning circuit, AFP, largely prevent the degradation of song that occurs after deafening, leading to the view that such degradation is an active process requiring vocal learning circuitry (Brainard and Doupe, 2000; Nordeen and Nordeen, 2010). Since we have hypothesized that AIV is involved in vocal learning, we wondered whether lesions targeted to this area might slow the degradation of song after deafening. To test this idea, we carried out bilateral cochlear removal in combination with bilateral AIV lesions (n = 15, 75–80 dph). Songs were recorded for 2 weeks following this procedure and were analyzed for similarity to the pre-surgery song. Song degradation after deafening was significantly slower in AIV-lesioned birds than in birds with intact AIV (Figure 6C,D, comparison at 7 days and 14 days post-lesion, t test p=0.0074, 0.0096 respectively). The reduced rate of deafening-induced song degradation after AIV lesion suggests a role for descending auditory cortical pathways, including possibly those projecting to the dopaminergic midbrain, in mediating the plastic changes in the AFP and motor pathways that occurs after deafening.

AIV neurons respond to disruptive auditory feedback during singing

We next wanted to test if neurons in AIV—specifically those projecting to VTA/SNc—exhibit auditory responses during singing consistent with their hypothesized role in vocal learning. To address this question we used a motorized microdrive to record neural activity in AIV of singing zebra finches (n = 7 birds, 90-120 dph). Recordings were targeted based on electrophysiological mapping of RA borders (‘Materials and methods’). Recordings were made from 37 single units, of which 17 were antidromically identified and collision-tested VTA/SNc projectors (Figure 7A,B). We also recorded 13 multiunit sites exhibiting robust antidromic response to VTA/SNc stimulation (‘Materials and methods’, antidromic responses to VTA/SNc stimulation were restricted to the borders of AIV as defined by retrograde labeling, Figure 7—figure supplement 1). AIV single-units discharged at low rates during singing (1–10 Hz, Figure 7C), and exhibited only small changes in average firing rate compared to non-singing (Figure 7D, significant firing rate change in 15/37 neurons, paired t test p<0.05). Only a small fraction of AIV neurons exhibited significant firing rate modulations locked to the song motif (n = 4/37). These modulations did not appear to be premotor in nature, and were substantially weaker than those reported in premotor song control nuclei (Leonardo and Fee, 2005), or auditory cortical areas (Sen et al., 2001).

Figure 7. AIV neurons projecting to VTA/SNc exhibit error-related auditory responses during singing.

(A) Schematic diagram of recording setup showing stimulating electrode placed in VTA/SNc for antidromic identification of AIV neurons. (B) Voltage traces showing an antidromically evoked spike of a VTA/SNc-projecting AIV neuron (red spikes). Collision test shown in black traces. (C) Distribution of firing rates of AIV neurons during singing. (D) Firing rates of AIV neurons during singing vs non-singing (antidromically identified neurons, hollow circles; non-identified AIV neurons, filled circles). (E) Recording of an antidromically identified AIV neuron during singing (song spectrogram, top; extracellular voltage trace, bottom) with presentation of noise bursts (red arrows). (F) Motif-aligned spike raster plot of the AIV neuron from panel E during singing with no noise bursts presented. Each row in the raster plot corresponds to a rendition of the song motif. Each dot corresponds to a spike. Along each row, gray areas denote syllables and white areas denote silent gaps. Two vertical red lines denote motif onset and offset. Shaded area along the histogram denotes SEM. Note lack of singing-related firing rate modulations. (G) Spike raster plot and spike histogram aligned to noise bursts during singing. Red vertical line denotes noise onset. Other notations are as in F. Note the brief response to the noise burst during singing. (HJ) Another example of an antidromically identified AIV neuron (larger spike waveform) recorded during singing. (I) Motif-aligned raster plot of the AIV neuron from panel H. (J) Spike raster plot and histogram aligned to noise bursts during singing. Rasters are sorted by the duration of the syllable in which the noise burst occurred. Note that the response to noise burst occurs during different syllable types. (K) Average peri-stimulus histogram for all antidromically identified AIV neurons, aligned to noise burst onset during singing. (L) Distribution of response latency after noise burst (left) and response duration (right), for AIV neurons that showed a significant response to noise bursts.

DOI: http://dx.doi.org/10.7554/eLife.02152.014

Figure 7.

Figure 7—figure supplement 1. Simultaneous mapping of VTA/SNc-projecting neurons in AIV by antidromic activation and retrograde tracing.

Figure 7—figure supplement 1.

Does electrical stimulation in VTA/SNc antidromically activate only AIV neurons, or does it also activate other descending arcopallial pathways? To assess this question, we carried out mapping of antidromically stimulated activity in ventral arcopallium and compared this, in the same animal, to the pattern of AIV neurons retrogradely labeled from VTA/SNc. (A) Sagittal section showing neurons in AIV retrogradely labeled by an injection of a CTB into VTA/SNc. Antidromic responses were measured along the electrode tracks indicated (diagonal gray lines). Yellow circles indicate sites where statistically significant antidromic activation from electrical stimulation in VTA/SNc was observed. White circles indicate where no significant response was observed. Dashed line indicates the penetration along which reference electrolytic burn marks were made. (BD) Example responses to VTA/SNc stimulation are shown at selected recording sites along each of three electrode tracks. Significant antidromic responses were observed only at locations where retrogradely labeled neurons were subsequently found. Specifically, there was no antidromic response seen in the gap between the main posterior mass of retrogradely neurons and the more anterior stripe. Nor were antidromic responses observed antero-ventral to the anterior stripe, within the borders of RA, nor dorsal to the thin rim of retrogradely labeled neurons along the dorsal border of RA. Note on methods: Retrograde tracer (CTB) was injected into VTA and SNc, and bipolar stimulating electrodes were implanted into the same region. The head was then oriented with the flat anterior portion of the skull rotated forward at a pitch of 70°. RA was completely mapped based on its characteristic electrophysiological signature (regular spiking interspersed with bursts). Electrode penetrations (Carbostar, Kation Scientific) were made in a plane 200 µm medial from the lateral edge of RA and penetrations were made at several locations along the AP axis in this plane. For each penetration, recordings were made along a 1–1.5 mm range of DV coordinates spanning through the arcopallium. For each recording site the existence of antidromic responses was tested with single 0.2 ms monopolar pulses, current up to 350 µA and with both polarities of stimulation current. If no antidromic response was observed visually, responses were recorded using the maximum current (350 µA), otherwise, responses were recorded at 130% of threshold current. In each recording penetration, we noted the most ventral coordinate at which the characteristic spontaneous activity of RA was found. After recordings were complete, another electrode was used to make an electrolytic burn at known locations relative to the recording sites. Alignment of antidromic and retrograde maps. The brains were sectioned and stained for NeuN, and retrogradely labeled neurons were visualized. The AP coordinate of every recording site was determined with respect to the electrolytic burns. The DV coordinate of every recording site was determined by aligning the ventral border of RA as determined histologically (NeuN stain) and electrophysiologically (spontaneous activity). Significance of an antidromic response: we analyzed the significance of antidromic responses as follows. For each recording site we computed the frequency spectrum of the averaged response to the stimulus (15 ms window, spanning 2–17 ms after stimulus onset). We then computed the power between 200 Hz and 2 KHz. A t test was used to test the hypothesis that the power in this frequency band was significantly higher than the power obtained from recordings with no antidromic response (recorded at 1–2 mm dorsal to RA and inspected visually to verify they did not show antidromic response). Note that results were consistent for different frequency bands.

While natural song learning proceeds slowly (Tchernichovski et al., 2001), playback of disruptive auditory feedback, such as brief bursts of broadband noise, can induce rapid learned changes in song structure over the course of a few hours (Tumer and Brainard, 2007; Andalman and Fee, 2009). Such learning has been interpreted as evidence that distorted auditory feedback can be used to introduce experimentally controlled ‘errors’ in song performance (Tumer and Brainard, 2007; Andalman and Fee, 2009; Fee and Goldberg, 2011). More than 40% of the VTA/SNc-projecting neurons exhibited a significant neuronal response to distorted auditory feedback (50 ms noise bursts) presented during singing (Figure 7E–K, p<0.02 for n = 7/17 neurons, comparison of spike count in a 150 ms window before and after noise burst onset for each neuron; paired t test, p<10−5 for average response, bootstrap analysis, ‘Materials and methods’). Significant responses were also observed at 54% of the multiunit recording sites (n = 7/13), and in 30% of AIV neurons that did not meet the criteria for antidromic identification (n = 6/20 neurons). The response of AIV neurons to noise bursts was brief, occurring with an average latency of 23 ± 12 ms from noise onset and having an average duration of 90 ± 43 ms (‘Materials and methods’, Figure 7L). There was no significant difference between the responses of antidromically identified neurons and those that were not antidromically identified. Thus, these data are pooled in the statistics on latency and duration.

Are AIV neurons responsive to noise bursts only during singing, or can similar auditory responses be evoked during non-singing? We recorded the responses of AIV neurons to noise bursts presented during playback of the bird's own song (BOS) in 25 identified VTA/SNc-projecting AIV neurons under anesthesia (n = 3 birds), and another 28 such neurons in freely behaving birds (n = 4 birds). Under these conditions, noise bursts elicited no significant response in any of these neurons (Figure 8A,C,E, ‘Materials and methods’). We also examined whether AIV neurons respond to noise bursts presented in a silent background (non-singing, no BOS). In this case, auditory responses were observed in a small fraction of neurons in both awake and anesthetized birds: Altogether, 12/53 neurons exhibited a significant response within a 1 s window after noise onset (bootstrap, p<0.02, ‘Materials and methods’). However, these responses were significantly slower and exhibited longer latency than during singing (p<0.001, Figure 8B,D,E, only one neuron responded significantly within 150 ms of noise onset). In comparison, all the neurons that were found to be noise-responsive during singing (significant activity within a 1 s window), exhibited this response within 150 ms of the noise burst onset. Altogether, we have found that the auditory responsiveness of AIV neurons to disruptive auditory stimuli is highly state-dependent, exhibiting fast and robust firing-rate changes only during singing.

Figure 8. Response of AIV neurons to noise bursts during non-singing.

Figure 8.

(A) Activity of an antidromically identified AIV neuron recorded during presentation of noise bursts during playback of birds own song (BOS). Top to bottom panels: song spectrogram and simultaneous recording of neuronal activity; raster plot and spike histogram aligned to BOS onset (black line). Time of noise bursts indicated by vertical red lines. Yellow band denotes period of BOS playback. Note lack of response to noise bursts. (B) Response of the same neuron in panel A to presentation of isolated noise bursts (no BOS, no singing). Note slow timecourse of the response. (C and D) Recording of another antidromically identified AIV neuron (notation same as in panels A and B). (E) Averaged PSTH for all VTA/SNc-projecting AIV neurons that exhibited a significant response to isolated noise bursts within 1 s after noise onset. Average response of these same neurons to noise bursts presented during BOS playback (blue). In comparison, averaged response is shown for all VTA/SNc-projecting AIV neurons that responded significantly to noise burst during singing (black trace).

DOI: http://dx.doi.org/10.7554/eLife.02152.016

Discussion

Motivated by the hypothesized role of dopaminergic signaling in reinforcement learning (Houk et al., 1994; Schultz et al., 1997; Bayer and Glimcher, 2005; Tsai et al., 2009), we set out to examine the role in vocal learning of a recently discovered songbird cortical area (Gale et al., 2008) that projects to VTA and SNc. We have characterized the spatial extent of neurons within the intermediate arcopallium that project to VTA and SNc, and refer to the region of arcopallium retrogradely labeled from these midbrain dopaminergic areas as AIV. Using a combination of anatomical and electrophysiological techniques, we have elucidated the afferent inputs to AIV neurons from other cortical areas. We examined the effect on vocal learning of lesions targeted to AIV, and carried out electrophysiological recordings of AIV neurons. Our findings are broadly consistent with the hypothesis that an arcopallial region with descending auditory cortical projections, both to the dopaminergic midbrain and to midbrain and brainstem auditory centers, plays a role in vocal learning.

The area we have identified as AIV is adjacent to other structures that have been hypothesized to play a role in vocal learning (Bottjer and Altenau, 2010). The dorsal arcopallium (also referred to as the lateral intermediate arcopallium, LAI, by Jarvis et al., 2013), receives inputs from the nidopallium adjacent to LMAN (LMAN-shell), and projects to the basal ganglia, motor thalamus, tectum, and the reticular formation (Bottjer et al., 2000). Our anatomical findings suggest that AIV, while adjacent to Ad, is part of a distinct, largely non-overlapping, anatomical circuit. In contrast to Ad, AIV does not receive a projection from LMAN-shell; rather it receives inputs from auditory cortical areas (CM, L1, and HVC-shelf) and caudal nidopallium. Furthermore, our findings show that AIV, but not Ad, projects to VTA/SNc. Given its anatomical overlap with RAcup, parts of AIV also likely project to MLD or to the shell of the thalamic nucleus ovoidalis (Mello et al., 1998).

Earlier studies have presented conflicting views on the function of Ad/LAI, particularly in regard to its role in vocal learning (Bottjer et al., 2000; Achiro and Bottjer, 2013). Bottjer and Altenau (2010) describe evidence that lesions of Ad produce deficits in vocal learning, not unlike those we report here for lesions of AIV. In contrast, Feenders et al. showed immediate early gene activation in Ad/LAI and in LMAN-shell (which projects to Ad/LAI) during locomotory activity (i.e., hopping), but find no evidence for activation in these areas during singing. This led them to suggest that Ad/LAI and RA form the output of a general avian cortical motor circuit, of which RA is a component specialized for vocal production (Feenders et al., 2008).

Our findings are consistent with the view that AIV, but not Ad, play a role in vocal motor learning. We found that lesions of AIV caused deficits in vocal imitation, while lesions of Ad did not cause any such deficit, suggesting that the loss of song imitation following lesions targeted to AIV was not a secondary consequence of an unintended lesion in Ad. Our finding that larger lesions of Ad produced severe akinesia and immobility is consistent with the hypothesis that Ad plays a role in locomotion and other motor behaviors, rather than vocal learning (Feenders et al., 2008).

There are several potential explanations for the discrepancy between our findings on the effects of Ad lesions and those of Bottjer and Altenau (2010). Clearly, the Ad lesion protocol of Bottjer and Altenau impacted a subset of neurons important for vocal learning, while our Ad lesion protocol did not. This subset of neurons, in principle, could be within Ad, within AIV, or could be in another, as yet undescribed, adjacent region important for vocal learning. In our view, the most parsimonious explanation for the detrimental effects on vocal learning is that the Ad lesions of Bottjer and Altenau had an unintended impact on neurons in AIV. More detailed electrophysiological studies of the pathways into and out of Ad will be needed to completely resolve these different views of its function.

The effects of lesions directed to AIV in juvenile birds bear some resemblance to the effects of lesions in the AFP, a basal ganglia–thalamocortical circuit necessary for vocal learning. Lesions of Area X, the basal ganglia component of the AFP, in young birds have a substantial detrimental effect on song imitation and result in persistent song variability into adulthood (Scharff and Nottebohm, 1991). In contrast, lesions of Area X have relatively little immediate effect on song performance in adult and juvenile birds (Goldberg and Fee, 2011; Kojima et al., 2013). Similarly, we found that birds in which AIV was lesioned at an early age produced a poor imitation of the tutor song and developed a less stereotyped song than did control birds, and AIV lesions in adult or juvenile birds have little immediate effect on vocal performance. Lesions of Area X also largely block the degradation of song following deafening (Kojima et al., 2013), and similarly, but to a lesser extent, we found that lesions of AIV slow song degradation in deafened birds. The similar patterns of findings in AIV-lesioned birds and Area X-lesioned birds is consistent with the possibility that the AIV, or some components of it, may interact with the AFP, perhaps by transmitting to it a signal important for its function.

In principle, there are several ways the loss of a descending auditory cortical projections might affect vocal learning. We first address the possible role of the projection from AIV to the dopaminergic midbrain in vocal learning. In the songbird, dopaminergic inputs to Area X have been hypothesized to carry signals related to motivational aspects of singing and to regulate vocal variability, in particular while switching between less variable song directed to a female bird to more variable song produced in social isolation (Yanagihara and Hessler, 2006; Hara et al., 2007; Leblois et al., 2010). Notably, we find that juvenile and young adult birds exhibit normal song variability after AIV lesions, suggesting that AIV may not be involved in the regulation of vocal variability, and that the deficits in vocal learning do not result from a loss of exploratory song variability. Alternatively, reduced glutamatergic drive after AIV lesion could result in reduced tonic dopaminergic input to Area X. In mammals, reduced dopaminergic input to the striatum can have deleterious effects on BG function, including reduced spontaneous activity and the loss of spines in the medium spiny neurons (Kreitzer and Malenka, 2008). Songbirds exhibit patterns of dopamine receptor expression in cortex and Area X (Kubikova et al., 2010), and responses to dopamine stimulation (Ding and Perkel, 2002) that parallel those found in mammals. Finally, another possibility is that impoverished vocal imitation after lesions of AIV results from the loss of a reinforcement signal that carries information about recent song performance. The short-latency response of AIV neurons to disrupted auditory feedback is consistent with the possibility that AIV may play a role in computing or transmitting a fast online signal to VTA/SNc.

Previous studies have shown that neurons in the vicinity of RA, and potentially within AIV, have several sub-telencephalic targets other than VTA/SNc, including a higher-order auditory thalamic nucleus Ov-shell and parts of the auditory midbrain (Vates et al., 1996). Thus, deficits in vocal learning following lesions targeted to AIV could potentially arise from loss of signaling in these other pathways. Further studies will be required to elucidate the relation between neurons projecting to these different downstream targets, and to determine the role of these different pathways in vocal learning.

One key to understanding the function of AIV is to characterize its cortical afferents, and several candidates have been identified. Güntürkün et al. have described a projection in pigeons from the lateral caudal nidopallium (NCL) to the ventral arcopallium (Kroner and Gunturkun, 1999). The projection we describe here in the zebra finch—from the caudal nidopallium (NC) posterior to HVC—terminates primarily in the posterior ‘stem’ part of AIV. This projection may be related to the previously described nidopallium caudolaterale (NCL) projection, although NCL appears to be more lateral than NC. NCL was recently found to be involved in error and reward processing (Starosta et al., 2013), and based on several lines of evidence, it has been suggested that NCL may be analogous to mammalian prefrontal cortex (Kroner and Gunturkun, 1999). Notably, in mammals the major cortical projection to VTA arises from prefrontal cortex, a key structure for cognitive control and goal-directed actions (Murase et al., 1993).

Previous studies in the zebra finch have also identified inputs to the intermediate arcopallium from auditory cortical regions, including primary auditory cortical field L1 and HVC-shelf in the nidopallium (Kelley and Nottebohm, 1979; Vates et al., 1996; Mello et al., 1998). These projections were found to terminate near RA, in a region termed RA-cup, and are likely homologous to projections in the pigeon from the dorsal nidopallium (Nd) to the ventromedial intermediate arcopallium (AIvm) (Wild et al., 1993). We have confirmed the existence of these projections, and using anatomical tracing and electrophysiological techniques, we have demonstrated that these inputs directly or indirectly innervate AIV neurons projecting to the dopaminergic midbrain. Inputs from L1 and anterior parts of HVC shelf appear localized to the parts of AIV anteroventral to RA. Inputs from posterior HVC shelf appear to terminate in both the anterior and posterior parts of AIV, and show significant overlap with inputs from the more posterior caudal nidopallium described above.

Our anatomical studies have also revealed a novel projection to AIV from the caudal mesopallium (CM), an avian auditory area previously identified as homologous to higher-order auditory cortex (Bauer et al., 2008) or to upper layers of primary auditory cortex (Karten, 2013). The projection from CM terminates within AIV, and electrical stimulation in CM causes spiking activity in VTA/SNc-projecting AIV neurons, likely through orthodromic activation. The existence of a projection to AIV from CM and L1 may be important for two reasons. The cluster of neurons in CM retrogradely labeled from AIV was overlapped with the region of CM, also known as Avalanche (Av), that forms bidirectional synaptic interactions with both HVC and NIf (Bauer et al., 2008; Akutagawa and Konishi, 2010), two song control nuclei also involved in vocal learning (Roberts et al., 2012). Furthermore, a small subpopulation of neurons in CM and L1 have been shown to respond to noise bursts presented during singing, but not to noise bursts presented during playback of the birds own song (Keller and Hahnloser, 2009). These responses, highly reminiscent of responses in AIV, have led to the suggestion that one computational function of CM and L1 may be to compare actual auditory feedback with the song template (Keller and Hahnloser, 2009). Further studies are required to determine if the hypothesized error-related responses of AIV neurons derive from activity in this population of CM and L1 neurons.

One can speculate about the potential role for AIV in computing or transmitting a fast online signal to VTA/SNc that potentially carries information about recent song performance. Most VTA/SNc-projecting AIV neurons responded to disruptive auditory feedback with a latency of less than 25 ms and response duration of less than 100 ms. This response may have a sufficiently fast temporal resolution to mediate vocal learning, assuming that the synaptic learning rules in Area X and the motor pathway employ a short-term synaptic memory such as an eligibility trace (Sutton and Barto, 1998; Fee and Goldberg, 2011; Redondo and Morris, 2011). Such a fast dopaminergic reinforcement signal could, in principle, be used in Area X to correlate vocal performance with an efference copy of vocal motor commands (Fee and Goldberg, 2011) and to drive corticostriatal plasticity at HVC inputs to medium spiny neurons (MSNs). The MSNs could then act through their downstream pallidothalamic pathway to (1) bias the motor system in favor of vocal commands that previously led to better song performance and (2) drive plasticity in the motor pathway such that the juvenile song gradually converges to the desired vocal output (Andalman and Fee, 2009; Fee and Goldberg, 2011; Warren et al., 2011; Charlesworth et al., 2012).

Additional studies would be required to establish the nature of auditory reinforcement signals in the pathway we have described here, or possibly through other downstream targets, and to establish a causal role for these circuits in vocal learning.

Materials and methods

Subjects

Animal subjects were male zebra finches (n = 110) (45–120 days post hatch, dph). Birds were obtained from the Massachusetts Institute of Technology zebra finch breeding facility (Cambridge, Massachusetts). The care and experimental manipulation of the animals were carried out in accordance with guidelines of the National Institutes of Health and were reviewed and approved by the Massachusetts Institute of Technology Committee on Animal Care.

Identification of AIV

Retrograde labeling of neurons in AIV was obtained by injecting fluorescently labeled cholera toxin β subunit (Molecular Probes) into the ventral tegmental area (VTA) and substantia nigra pars compacta (SNc). VTA/SNc were localized stereotaxically with reference to the principal thalamic auditory relay nucleus Ovoidalis, as described below.

Reliable targeting of VTA/SNc

We have developed a technique that allows us to target VTA and SNc with high reliability, based on the mapped location of the thalamic auditory relay nucleus Ovoidalis (Ov). Specifically, the head was oriented with the flat anterior portion of the skull rotated forward at a pitch of 50°. Using an extracellular electrode (Carbostar-1, Kation Scientific, Minneapolis, MN), tilted in the LM direction 2° towards the midline, Ov was located and mapped based on its robust auditory responses. Injections were made into VTA relative to Ov as follows: 300 µm anterior from the center of Ov, 200 µm medial from the medial edge of Ov, and 1800 µm ventral to the middle of Ov in the DV direction. Injections into SNc were targeted 350 µm posterior and 350 µm lateral to the VTA coordinates. For electrophysiology experiments requiring antidromic stimulation in VTA/SNc, bipolar stimulating electrodes were placed such that, to the extent possible, one wire was in VTA and the other wire was in SNc.

Excitotoxic lesions of AIV

Bilateral lesions of AIV were carried out by injecting 2% N-Methyl-DL aspartic acid (NMA, Sigma, St Louis, MO) at multiple injection sites in ventral arcopallium around RA chosen to maximally lesion VTA/SNc-projecting neurons. All AIV lesions were carried out stereotaxically. The head was oriented with the flat anterior portion of the skull rotated forward at a pitch of 70°. Because a large portion of AIV is located ventral to RA, access to AIV was achieved by a laterally rotated penetration. The head was rotated 45° (roll) to the left for lesions of AIV in the right hemisphere, and rotated to the right for lesions of AIV in the left hemisphere. Using a carbon fiber electrode (Mod #E1011-20; Carbostar-1, Kation Scientific.) RA was then completely mapped based on its characteristic electrophysiological signature (regular spiking interspersed with bursts). Once all of the edges of RA were localized in the rotated frame, these were used to calculate the coordinates of the injection sites. In the injection coordinates given below (ML, AP, DV), dimensions are in µm; positive numbers represent more lateral, more anterior, and more ventral displacements, respectively. The coordinates are specified relative to the lateral edge of RA in the medial–lateral direction, relative to the center of RA in the anterior–posterior direction, and relative to the most ventral edge of RA in the dorsal–ventral direction. Injections were made through a glass pipette using a digitally controlled injection system (Nanoject, Drummond Scientific, Broomall, PA). Injections at each site were made in multiple boluses of 13.8 nl, which were spaced at 5-min intervals to avoid backflow along the pipette. AIV lesions (n = 17 birds) were created by up to seven different injection sites (in five different penetrations) at the coordinates specified below. The last number in each coordinate indicates the number of injection boluses made at that site: (+200,−400,+200; 4); (+300,+0,+200; 4); (+200,+300,+0; 4); (+0,+700,+0; 2); (+0,+700,+300; 2); (−300,+900,+0; 2) ; (−300,+900,+300; 2).

Note that, given the complex spatial structure of AIV, it was difficult to achieve a complete lesion of VTA/SNc projecting neurons. For example, we did not attempt to lesion the thin shell of AIV neurons dorsal to RA. Furthermore, our lesions may have affected areas of the intermediate arcopallium adjacent to AIV.

Excitotoxic lesions of Ad

We targeted the portion of Ad most likely affected by our AIV lesion procedure (n = 7 birds) as follows: Bilateral lesions of Ad were carried out stereotaxically by injecting 1% NMA at multiple injection sites. The stereotaxic procedure was similar to that described above for AIV lesions. RA was mapped electrophysiologically to determine the lateral border and center (in the AP direction) of RA. Ad was also mapped electophysiologically by its characteristic bursting patterns, which distinguished it from the overlying arcopallium. Injection sites are listed below: the (ML, AP) coordinates are given in the same notation given above for the AIV lesion. However, the injections were targeted to the center of Ad in the dorso–ventral dimension as determined from electrophysiological mapping of Ad. Injections at each site were made in 3 boluses of 13.8 nl, which were spaced at 5-min intervals to avoid backflow along the pipette. Injections were made at the following locations: (+250,−100); (+250,+300); (+650,−100); (+650,+300); (+1000,−100); (+1000,+300). This lesion protocol typically spared the lateral-most ∼200 µm of Ad (see Figure 5A).

To determine the effects of complete lesions of Ad on vocal learning, we also carried out (n = 3 birds) an Ad lesion protocol with two additional lateral injection sites. Injections were made at the following locations: (+250,−100); (+250,+300); (+650,−100); (+650,+300); (+950,−100); (+950,+300); (+1200,−100); (+1200,+300). These birds exhibited severe immobility after surgery, and failed to eat or drink, requiring early termination of the experiment.

Developmental study of song imitation (Figure 4)

Male juvenile birds were maintained in the aviary in their home cages with both parents until the age 44–45 dph, at which point they were transferred to sound isolation/recording chambers until they began to sing and baseline song was acquired. For experimental birds, bilateral AIV or Ad lesions were carried out as described above. After surgery, birds typically began to sing within 5 days (range 2–5 days). Birds were maintained in isolation and recorded continuously until they reached the age of 90 dph. Once it was confirmed that post-90 dph, songs had been recorded, we histologically verified the extent of AIV lesions, as described below. Unmanipulated control birds were isolated at 44–45 dph (as for experimental birds) and were maintained in sound isolation/recording cages until they reached the age of 90 dph, or until post-90 dph song had been acquired. Assignment of birds to unmanipulated control or experimental groups was initially made randomly, but subsequent juveniles from a given breeding cage (with a given tutor) were placed in control or experimental groups to provide sibling matches between these conditions.

Histological verification of AIV lesions

AIV lesions were examined histologically at the end of each experiment. Retrograde tracer (fluorescently labeled cholera toxin β subunit, Molecular Probes) was injected into VTA/SNc and 4 days later the bird was sacrificed and perfused transcardially with saline and then 4% paraformaldehyde. The brain was removed and post-fixed in 4% paraformaldehyde overnight. The brain was sectioned, stained for NeuN, and imaged under a fluorescence microscope (Zeiss Axioplan, Germany). The extent of AIV lesions was analyzed on the basis of loss of NeuN staining and the density of retrogradely labeled neurons in these regions. Quantification was based on the size of the lesion in each parasagittal section extending from the medial edge of RA to 200 µm past the lateral edge, and yielded two numbers indicating the fraction of AIV lesioned on the left and right side. The extent of any damage to RA was also monitored and birds with more than 5% RA lesion were excluded from the analysis (n = 2). The histological assessment of lesion size was conducted blind to the behavioral effects on song, and the quantification of the behavioral effects was done blind to the histological analysis of lesion size.

Testing the functionality of anatomical projections

To test the functional connectivity of projections from L1, CM, and HVC-shelf to VTA/SNc-projecting neurons in AIV, we implanted bipolar stimulating electrodes in each of these auditory cortical areas (L1, n = 2 birds; CM, n = 3 birds; HVC-shelf, n = 3 birds). A bipolar stimulating electrode was also placed in VTA/SNc to antidromically identify neurons in AIV. Guided by the results of the anterograde tracing experiments (Figure 2, Figure 2—figure supplement 1), recordings were made in the regions of AIV ventral and anterior to RA using a Carbostar electrode (1 MOhm, Kation Scientific). Within this region, single-unit and multi-unit activity was broadly responsive to stimulation in both VTA/SNc and in L1, CM, or HVC-shelf. Threshold currents required to elicit spiking responses were between 50 and 200 µA (single 0.2-ms unipolar pulses). Location of the stimulating electrodes was verified histologically.

Analysis of song imitation

We modified the previously published Song Analysis Pro (SAP) algorithm (Tchernichovski et al., 2000) and developed an automated procedure for comparison of pupil songs with the tutor motif (Software is available at Mandelblat-Cerf and Fee, 2014). Following SAP, songs were represented by acoustic features. Similarity between time points in the two songs is based on the Euclidean distance in the feature space between these points. The procedure builds a similarity matrix for all possible pairs of time points between the two song samples.

To quantify tutor–pupil similarity the tutor motif was segmented into separate syllables. While syllable segmentation is highly reliable in tutor song, it is less reliable in the more variable pupil song, which was thus segmented into equally sized sections, each twice the duration of the tutor motif. For each tutor syllable our procedure uses the similarity matrix described above to find the sections of the pupil song bout that have the highest similarity and assigns an overall similarity score. For each match between a tutor's syllable and a pupil's song section, a sequencing score is evaluated by computing the match of the next syllable in the tutor song with the next section of the pupil song. The algorithm then computes a composite ‘tutor imitation score’ as the product of song similarity and sequence similarity.

For each bird we examined song spectrograms and computed the imitation scores for the first day of singing after isolation, and before any further manipulation. Three birds exhibited clear evidence of tutor imitation and were excluded from further experiments.

Imitation capacity

To determine the fractional extent to which AIV-lesions reduced the capacity of birds to imitate their tutor song, imitation scores of each bird were placed on a linear scale ranging from the average imitation score of control birds 0.197 ± 0.011 to the average similarity score from all pairwise comparisons of 20 unrelated adult birds in our colony 0.104 ± 0.0027, yielding a total dynamic range of 0.093 [=0.197–0.104] ± 0.00636. For birds in which we estimate that more than half of AIV was lesioned, the average imitation score was 0.116 ± 0.008. Therefore, we can compute the similarity of AIV-lesioned birds to their tutor songs relative to the song similarity of unrelated birds, as a fraction of the total dynamic range, as 12.9% [=(0.116–0.104)/0.093] ± 9.2%. Thus, AIV-lesioned birds suffered a loss of imitation capacity compared to controls of 87% [=1.0–0.129] ± 9%.

Quantification of song degradation following deafening and/or AIV lesion (Figure 6)

Male juvenile birds were obtained from the aviary at the age 65–70 dph and were maintained in sound isolation cages until they began to sing. Upon reaching age the 75–80 dph, pre-surgery songs were recorded, and birds were randomly placed in three experimental groups: deafening, AIV-lesion, and combined deafening and AIV-lesion. Deafening was achieved by bilateral cochlear extirpation, which was carried out by accessing the cochlea through the posterio-lateral cranium. AIV lesions were carried out as described above. Upon recovery from surgery, birds were maintained in sound isolation cages. Undirected song was recorded continuously for 2 weeks.

To quantify song degradation, song was recorded every day post-surgery and compared to song recorded prior to surgery. Self-similarity scores were computed using the same algorithm used to evaluate tutor imitation.

Chronic neural recordings in AIV

Single-unit recordings of VTA/SNc-projecting AIV neurons were obtained from seven young adult birds (90–120 dph) during undirected singing. For antidromic identification of AIV neurons, bipolar stimulating electrodes were placed in VTA/SNc, according to the methodology described above. AIV was localized with reference to the boundaries of RA, as electrophysiologically identified by the characteristic tonic activity of RA neurons. Placement of the chronically implanted electrode array was further refined by finding the regions in AIV that exhibited strong antidromic responses to brief unipolar square current pulses (0.2 ms duration) applied to the stimulating electrodes in VTA/SNc. Peak currents up to 350 µA were used to search for antidromically activated neurons: threshold currents for antidromic activation of neurons in AIV ranged from 90 µA to 250 µA. Recordings in AIV were made using an array of three recording electrodes (200 µm spacing, Microprobe, Gaithersburg, MD, #PI20033.0A3, 3 MΩ), mounted in a motorized microdrive (n = 20 birds implanted). Data were acquired and analyzed using custom Matlab software.

Disruptive auditory stimuli consisted of 50-ms duration pulses of white noise (95 dBSPL) similar to that previously used to drive song plasticity (Tumer and Brainard, 2007; Andalman and Fee, 2009), which were presented with random timing during undirected singing.

Recordings during playback of birds own song (BOS)

Auditory stimuli

A week before surgery, birds were isolated and undirected song was recorded. The song motif was extracted and a BOS stimulus was constructed from concatenation of 2 motifs. To mimic the auditory input during disruptive auditory feedback during singing, we constructed ‘BOS with noise’ stimulus in which noise bursts were added to a single syllable within each repetition of the motif. For each bird, the noise burst was added at a fixed time near the middle of the motif. The peak loudness of the BOS playback was set to be equal to the average peak loudness of song measured on the head of the singing birds (90 dBSPL) (Andalman and Fee, 2009). Noise burst loudness was the same as for the presentation of disruptive auditory feedback during singing (95 dBSPL).

Recordings in anesthetized birds

Birds were anesthetized with three intramuscular injections of 25 µl each of 20% urethane administered at 20 min intervals. If needed, another injection was given during the recording session. Playback of bird's own song (BOS) was delivered to the birds using miniature custom-built headphones (E-A-RTONE Mod-3A Audiometric Insert Earphone, Indianapolis, IN). Sound was played at 70dBSPL and 90dBSPL, calibrated using a probe microphone (Model ER-7C, Etymotic, Inc, Elk Grove Village, IL). VTA/SNc-projecting AIV neurons were antidromically identified using the protocol described above for chronic recordings.

Recordings in awake (non-singing) birds

A motorized microdrive was implanted for recordings in AIV as described above. After the surgery, the bird was maintained in a sound attenuation chamber. VTA/SNc-projecting AIV neurons were recorded during playback of BOS and BOS with noise on alternating presentations. Sound level for BOS playback was 90 dBSPL and calibrated using a Bruel & Kajer Model 2236 sound level meter placed at approximately the center of the bird cage. Histological verification of electrode positions was carried out after the recordings. Data were collected from four birds.

Analysis of neuronal responses

The significance of single neuron and multiunit responses to noise bursts were analyzed in two different ways: the first was a comparison of the spike counts in a window 150 ms before vs 150 ms after noise onset (paired t test). The second was a test for a significant peak in the peri-stimulus time histogram (PSTH) using a bootstrap analysis: over a 2 s window (1 s before stimulus onset to 1 s after stimulus onset) spikes were shifted circularly, where the extent of the shifts were drawn from a uniform distribution of ±1 s. Given the resulting PSTH, the peak within 1 s after stimulus was noted. This random shuffling was repeated 1000 times to obtain the distribution of peaks in the shuffled data sets. Neurons that exhibited a real PSTH peak that exceeded the 95th percentile of the shuffled data were considered significant. The significance of song-locked firing rate modulations of AIV neurons during singing was analyzed similarly. The highest observed peak in the motif-aligned PSTH was compared to the distribution of PSTH peaks obtained from surrogate data sets in which spike times were circularly within the motif.

The significance of the averaged response to noise bursts of all projecting neurons

Over a 2 s window (1 s before noise onset to 1 s after noise onset) the averaged response of each neuron was shifted circularly, where the extent of the shifts were drawn from a uniform distribution of ±1 s. Given the resulting averaged response across all neurons, two values were noted: the peak in the firing rate and the maximal spike count over 150 ms. This random shuffling was repeated 1000 times to obtain the distributions in the shuffled data sets. The peak of the measured averaged response within 150 ms from noise onset, and the spike count over this period, were tested against the distribution of peaks and spike counts. The indicated p-value is the higher of these two tests.

Response latency and duration

Activity around stimulus onset was binned in 2 ms bins. Each bin after stimulus onset was tested against the distribution of the activity prior to the stimulus, using a z-test. Response onset (latency) was detected by the first bin for which the following five consecutive bins (10 ms) were significantly different than the activity prior to the stimulus (z-test, all with p<0.05). To assess response duration, we searched for the first bin after response onset for which, the following 10 consecutive bins (20 ms) were insignificantly different than the pre-stimulus activity (z-test, all with p>0.05).

Stereotypy of identified syllables

To evaluate syllable stereotypy, independent of sequence stereotypy, we used the peak of the spectral cross-correlation (Aronov et al., 2008), on a syllable-by-syllable basis. A database of song syllables was constructed for each bird (n = 500–600 syllables). The spectrogram of each syllable was computed using the multi-taper method (k = 2 tapers, bandwidth parameter p=1.5, 10 ms window, 1 ms step size). 100 syllables were randomly selected (without replacement) from the data set. For each selected syllable, a sample of 100 syllables was drawn randomly from the data set. The similarity of the selected syllable to each of the sampled syllables was evaluated following the spectral correlation method: first, the spectrogram correlation matrix was computed from the correlation of power spectra (860 Hz and 8.6 kHz) between each 1 ms time slice in the selected syllable and each 1 ms time slice in the sampled syllable. Second, a lag correlation function was computed as the sum along the diagonals of the correlation matrix. Third, the maximum of the lag correlation function was determined. The values resulting from all comparisons to the selected syllable formed a distribution. The value of the 95 percentile of this distribution represents the stereotypy of the selected syllables. For each bird, the syllable stereotypy metric (Figure 5D) was computed as the average over all 100 selected syllables.

Acknowledgements

We thank Jesse Goldberg for helpful suggestions. We also thank Daniel Rubin, Anton Konovchenko, and Tim Currier for technical assistance. Funding for this work was provided by the National Institutes of Health (R01 MH067105) and McGovern Institute internal funding.

Funding Statement

The funder had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Funding Information

This paper was supported by the following grant:

Additional information

Competing interests

The authors declare that no competing interests exist.

Author contributions

YM-C, Conception and design, Acquisition of data, Analysis and interpretation of data, Drafting or revising the article, Contributed unpublished essential data or reagents.

LL, Conception and design, Acquisition of data, Analysis and interpretation of data, Drafting or revising the article, Contributed unpublished essential data or reagents.

ND, Conception and design, Acquisition of data, Analysis and interpretation of data, Drafting or revising the article, Contributed unpublished essential data or reagents.

MF, Conception and design, Acquisition of data, Analysis and interpretation of data, Drafting or revising the article, Contributed unpublished essential data or reagents.

Ethics

Animal experimentation: This study was performed in strict accordance with the recommendations in the Guide for the Care and Use of Laboratory Animals of the National Institutes of Health, were reviewed and approved by the Massachusetts Institute of Technology Committee on Animal Care (IACUC). Permit Number: 0712-071-15. Every effort was made to minimize suffering.

References

  1. Achiro JM, Bottjer SW. 2013. Neural representation of a target auditory memory in a cortico-basal ganglia pathway. The Journal of Neuroscience 33:14475–14488. doi: 10.1523/JNEUROSCI.0710-13.2013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Akutagawa E, Konishi M. 2010. New brain pathways found in the vocal control system of a songbird. The Journal of Comparative Neurology 518:3086–3100. doi: 10.1002/cne.22383 [DOI] [PubMed] [Google Scholar]
  3. Andalman AS, Fee MS. 2009. A basal ganglia-forebrain circuit in the songbird biases motor output to avoid vocal errors. Proceedings of the National Academy of Sciences of the United States of America 106:12518–12523. doi: 10.1073/pnas.0903214106 [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Aronov D, Andalman AS, Fee MS. 2008. A specialized forebrain circuit for vocal babbling in the juvenile songbird. Science 320, 630–634. doi: 10.1126/science.1155140 [DOI] [PubMed] [Google Scholar]
  5. Bauer E, Coleman M, Roberts T, Roy A, Prather J, Mooney R. 2008. A synaptic basis for auditory–vocal integration in the songbird. The Journal of Neuroscience 28:1509–1522. doi: 10.1523/JNEUROSCI.3838-07.2008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Bayer HM, Glimcher PW. 2005. Midbrain dopamine neurons encode a quantitative reward prediction error signal. Neuron 47:129–141. doi: 10.1016/j.neuron.2005.05.020 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Bottjer S, Brady J, Cribbs B. 2000. Connections of a motor cortical region in zebra finches: relation to pathways for vocal learning. The Journal of Comparative Neurology 420:244–260. doi: 10.1002/(SICI)1096-9861(20000501)420:23.0.CO;2-M [DOI] [PubMed] [Google Scholar]
  8. Bottjer SW, Altenau B. 2010. Parallel pathways for vocal learning in basal ganglia of songbirds. Nature Neuroscience 13:153–155. doi: 10.1038/nn.2472 [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Bottjer SW, Miesner EA, Arnold AP. 1984. Forebrain lesions disrupt development but not maintenance of song in passerine birds. Science 224:901–903. doi: 10.1126/science.6719123 [DOI] [PubMed] [Google Scholar]
  10. Brainard MS, Doupe AJ. 2000. Interruption of a basal ganglia-forebrain circuit prevents plasticity of learned vocalizations. Nature 404:762–766. doi: 10.1038/35008083 [DOI] [PubMed] [Google Scholar]
  11. Charlesworth JD, Warren T, Brainard M. 2012. Covert skill learning in a cortical-basal ganglia circuit. Nature 486:251–255. doi: 10.1038/nature11078 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Ding L, Perkel DJ. 2002. Dopamine modulates excitability of spiny neurons in the avian basal ganglia. The Journal of Neuroscience 22:5210–5218 [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Doupe AJ, Kuhl PK. 1999. Birdsong and human speech: common themes and mechanisms. Annual Review of Neuroscience 22:567–631. doi: 10.1146/annurev.neuro.22.1.567 [DOI] [PubMed] [Google Scholar]
  14. Dugas-Ford J, Rowell JJ, Ragsdale CW. 2012. Cell-type homologies and the origins of the neocortex. Proceedings of the National Academy of Sciences of the United States of America 109:16974–16979. doi: 10.1073/pnas.1204773109 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Fee MS, Goldberg JH. 2011. A hypothesis for basal ganglia-dependent reinforcement learning in the songbird. Neuroscience 198:152–170. doi: 10.1016/j.neuroscience.2011.09.069 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Feenders G, Liedvogel M, Rivas M, Zapka M, Horita H, Hara E, Wada K, Mouritsen H, Jarvis ED. 2008. Molecular mapping of movement-associated areas in the avian brain: a motor theory for vocal learning origin. PLOS ONE 3:e1768. doi: 10.1371/journal.pone.0001768 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Gale SD, Perkel DJ. 2010. A basal ganglia pathway drives selective auditory responses in songbird dopaminergic neurons via disinhibition. The Journal of Neuroscience 30:1027–1037. doi: 10.1523/JNEUROSCI.3585-09.2010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Gale SD, Person AL, Perkel DJ. 2008. A novel basal ganglia pathway forms a loop linking a vocal learning circuit with its dopaminergic input. The Journal of Comparative Neurology 508:824–839. doi: 10.1002/cne.21700 [DOI] [PubMed] [Google Scholar]
  19. Goldberg JH, Fee MS. 2011. Vocal babbling in songbirds requires the basal ganglia-recipient motor thalamus but not the basal ganglia. Journal of Neurophysiology 105:2729–2739. doi: 10.1152/jn.00823.2010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Hamaguchi K, Tschida KA, Yoon I, Donald BR, Mooney R. 2014. Auditory synapses to song premotor neurons are gated off during vocalization in zebra finches. eLife 18:e01833. doi: 10.7554/eLife.01833 [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Hara E, Kubikova L, Hessler NA, Jarvis ED. 2007. Role of the midbrain dopaminergic system in modulation of vocal brain activation by social context. The European Journal of Neuroscience 25:3406–3416. doi: 10.1111/j.1460-9568.2007.05600.x [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Hessler N, Doupe A. 1999. Singing-related neural activity in a dorsal forebrain–basal ganglia circuit of adult zebra finches. The Journal of Neuroscience 19:10461–10481 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Hikosaka O. 2007. Basal ganglia mechanisms of reward-oriented eye movement. Annals of the New York Academy of Sciences 1104:229–249. doi: 10.1196/annals.1390.012 [DOI] [PubMed] [Google Scholar]
  24. Houk JC, Adams JL, Barto AG. 1994. In: Models of information processing in the basal ganglia. MIT press; 249–270 [Google Scholar]
  25. Jarvis ED, Yu J, Rivas MV, Horita H, Feenders G, Whitney O, Jarvis SC, Jarvis ER, Kubikova L, Puck AE, Siang-Bakshi C, Martin S, McElroy M, Hara E, Howard J, Pfenning A, Mouritsen H, Chen CC, Wada K. 2013. Global view of the functional molecular organization of the avian cerebrum: mirror images and functional columns. The Journal of Comparative Neurology 521:23462. doi: 10.1002/cne.23404 [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Kao MH, Doupe AJ, Brainard MS. 2005. Contributions of an avian basal ganglia-forebrain circuit to real-time modulation of song. Nature 433:638–643. doi: 10.1038/nature03127 [DOI] [PubMed] [Google Scholar]
  27. Karten HJ. 2013. Neocortical evolution: neuronal circuits arise independently of lamination. Current Biology 23:R12–R15. doi: 10.1016/j.cub.2012.11.013 [DOI] [PubMed] [Google Scholar]
  28. Keller GB, Hahnloser RH. 2009. Neural processing of auditory feedback during vocal practice in a songbird. Nature 457:187–190. doi: 10.1038/nature07467 [DOI] [PubMed] [Google Scholar]
  29. Kelley D, Nottebohm F. 1979. Projections of a telencephalic auditory nucleus-field L-in the canary. The Journal of Comparative Neurology 183:455–469. doi: 10.1002/cne.901830302 [DOI] [PubMed] [Google Scholar]
  30. Kojima S, Kao MH, Doupe AJ. 2013. Task-related “cortical” bursting depends critically on basal ganglia input and is linked to vocal plasticity. Proceedings of the National Academy of Sciences of the United States of America 110:4756–4761. doi: 10.1073/pnas.1216308110 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Konishi M. 1965. The role of auditory feedback in the control of vocalization in the white-crowned sparrow. Zeitschrift für Tierpsychologie 22:770–793. doi: 10.1111/j.1439-0310.1965.tb01688.x [DOI] [PubMed] [Google Scholar]
  32. Kozhevnikov AA, Fee MS. 2007. Singing-related activity of identified HVC neurons in the zebra finch. Journal of Neurophysiology 97:4271–4283. doi: 10.1152/jn.00952.2006 [DOI] [PubMed] [Google Scholar]
  33. Kreitzer AC, Malenka RC. 2008. Striatal plasticity and basal ganglia circuit function. Neuron 60:543–554. doi: 10.1016/j.neuron.2008.11.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Kroner S, Gunturkun O. 1999. Afferent and efferent connections of the Caudolateral Neostriatum in the Pigeon (Columba livia): a retro- and anterograde pathway tracing study. The Journal of Comparative Neurology 407:228–260. doi: [DOI] [PubMed] [Google Scholar]
  35. Kubikova L, Wada K, Jarvis ED. 2010. Dopamine receptors in a songbird brain. The Journal of Comparative Neurology 518:741–769. doi: 10.1002/cne.22255 [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Leblois A, Wendel BJ, Perkel DJ. 2010. Striatal dopamine modulates basal ganglia output and regulates social context-dependent behavioral variability through D1 receptors. The Journal of Neuroscience 30:5730–5743. doi: 10.1523/JNEUROSCI.5974-09.2010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Leonardo A, Fee MS. 2005. Ensemble coding of vocal control in birdsong. The Journal of Neuroscience 25:652–661. doi: 10.1523/JNEUROSCI.3036-04.2005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Lombardino AJ, Nottebohm F. 2000. Age at deafening affects the stability of learned song in adult male zebra finches. The Journal of Neuroscience 20:5054–5064 [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Mandelblat-Cerf Y, Fee MS. 2014. An automated procedure for evaluating song imitation. PLOS ONE 9:e96484. doi: 10.1371/journal.pone.0096484 [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Mello CV, Vates GE, Okuhata S, Nottebohm F. 1998. Descending auditory pathways in the adult male zebra finch (Taeniopygia guttata). The Journal of Comparative Neurology 395:137–160. doi: [DOI] [PubMed] [Google Scholar]
  41. Mooney R. 2006. Synaptic mechanisms for auditory-vocal integration and the correction of vocal errors. Annals of the New York Academy of Sciences 1016:476–494. doi: 10.1196/annals.1298.011 [DOI] [PubMed] [Google Scholar]
  42. Murase S, Grenhoff J, Chouvet G, Gonon FG, Svensson TH. 1993. Prefrontal cortex regulates burst firing and transmitter release in rat mesolimbic dopamine neurons studied in vivo. Neuroscience Letters 157:53–56. doi: 10.1016/0304-3940(93)90641-W [DOI] [PubMed] [Google Scholar]
  43. Nordeen K, Nordeen E. 2010. Deafening-induced vocal deterioration in adult songbirds is reversed by disrupting a basal ganglia-forebrain circuit. The Journal of Neuroscience 30:7392–7400. doi: 10.1523/JNEUROSCI.6181-09.2010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Nordeen KW, Nordeen EJ. 1992. Auditory feedback is necessary for the maintenance of stereotyped song in adult zebra finches. Behavioral and Neural Biology 57:58–66. doi: 10.1016/0163-1047(92)90757-U [DOI] [PubMed] [Google Scholar]
  45. Nottebohm F, Stokes TM, Leonard CM. 1976. Central control of song in the canary, Serinus canarius. The Journal of Comparative Neurology 165:457–486. doi: 10.1002/cne.901650405 [DOI] [PubMed] [Google Scholar]
  46. Ölveczky BP, Andalman AS, Fee MS. 2005. Vocal experimentation in the juvenile songbird requires a basal ganglia circuit. PLOS Biology 3:e153. doi: 10.1371/journal.pbio.0030153 [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Person A, Gale S, Farries M, Perkel D. 2008. Organization of the songbird basal ganglia, including area X. The Journal of Comparative Neurology 508:840–866. doi: 10.1002/cne.21699 [DOI] [PubMed] [Google Scholar]
  48. Prather JF, Peters S, Nowicki S, Mooney R. 2008. Precise auditory-vocal mirroring in neurons for learned vocal communication. Nature 451:305–310. doi: 10.1038/nature06492 [DOI] [PubMed] [Google Scholar]
  49. Redondo R, Morris R. 2011. Making memories last: the synaptic tagging and capture hypothesis. Nature Reviews Neuroscience 12:17–30. doi: 10.1038/nrn2963 [DOI] [PubMed] [Google Scholar]
  50. Roberts T, Gobes S, Murugan M, Olveczky B, Mooney R. 2012. Motor circuits are required to encode a sensory model for imitative learning. Nature Neuroscience 15:1454–1459. doi: 10.1038/nn.3206 [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Scharff C, Nottebohm F. 1991. A comparative study of the behavioral deficits following lesions of various parts of the zebra finch song system: implications for vocal learning. The Journal of Neuroscience 11:2896–2913 [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Schultz W, Dayan P, Montague PR. 1997. A neural substrate of prediction and reward. Science 275:1593–1599. doi: 10.1126/science.275.5306.1593 [DOI] [PubMed] [Google Scholar]
  53. Sen K, Theunissen FE, Doupe AJ. 2001. Feature analysis of natural sounds in the songbird auditory forebrain. Journal of Neurophysiology 86:1445–1458 [DOI] [PubMed] [Google Scholar]
  54. Starosta S, Gunturkun O, Stutttgen M. 2013. Stimulus-response-outcome coding in the pigeon nidopallium caudolaterale. PLOS ONE 8:e57407. doi: 10.1371/journal.pone.0057407 [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Sutton RS, Barto AG. 1998. Reinforcement learning: an introduction. IEEE Transactions on Neural Networks 9:1054. doi: 10.1109/TNN.1998.712192 [DOI] [Google Scholar]
  56. Tchernichovski O, Mitra PP, Lints T, Nottebohm F. 2001. Dynamics of the vocal imitation process: how a zebra finch learns its song. Science 291:2564–2569. doi: 10.1126/science.1058522 [DOI] [PubMed] [Google Scholar]
  57. Tchernichovski O, Nottebohm F, Ho CE, Pesaran B, Mitra PP. 2000. A procedure for an automated measurement of song similarity. Animal Behaviour 59:1167–1176. doi: 10.1006/anbe.1999.1416 [DOI] [PubMed] [Google Scholar]
  58. Troyer TW, Doupe AJ. 2000. An associational model of birdsong sensorimotor learning I. Efference copy and the learning of song syllables. Journal of Neurophysiology 84:1204–1223 [DOI] [PubMed] [Google Scholar]
  59. Tsai HC, Zhang F, Adamantidis A, Stuber GD, Bonci A, de Lecea L, Deisseroth K. 2009. Phasic firing in dopaminergic neurons is sufficient for behavioral conditioning. Science 324:1080–1084. doi: 10.1126/science.1168878 [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Tumer EC, Brainard MS. 2007. Performance variability enables adaptive plasticity of 'crystallized' adult birdsong. Nature 450:1240–1244. doi: 10.1038/nature06390 [DOI] [PubMed] [Google Scholar]
  61. Vates GE, Broome BM, Mello CV, Nottebohm F. 1996. Auditory pathways of caudal telencephalon and their relation to the song system of adult male zebra finches. The Journal of Comparative Neurology 366:613–642. doi: [DOI] [PubMed] [Google Scholar]
  62. Warren T, Tumer E, Charlseworth J, Brainard M. 2011. Mechanisms and time course of vocal learning and consolidation in the adult songbird. Journal of Neurophysiology 106:1806–1821. doi: 10.1152/jn.00311.2011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Wild JM, Karten HJ, Frost BJ. 1993. Connections of the auditory forebrain in the pigeon (Columba livia). The Journal of Comparative Neurology 337:32–62. doi: 10.1002/cne.903370103 [DOI] [PubMed] [Google Scholar]
  64. Wolpert DM, Ghahramani Z, Jordan MI. 1995. An internal model for sensorimotor integration. Science 269:1880–1882. doi: 10.1126/science.7569931 [DOI] [PubMed] [Google Scholar]
  65. Yanagihara S, Hessler NA. 2006. Modulation of singing-related activity in the songbird ventral tegmental area by social context. The European Journal of Neuroscience 24:3619–3627. doi: 10.1111/j.1460-9568.2006.05228.x [DOI] [PubMed] [Google Scholar]
eLife. 2014 Jun 16;3:e02152. doi: 10.7554/eLife.02152.017

Decision letter

Editor: Ronald L Calabrese1

eLife posts the editorial decision letter and author response on a selection of the published articles (subject to the approval of the authors). An edited version of the letter sent to the authors after peer review is shown, indicating the substantive concerns or comments; minor concerns are not usually shown. Reviewers have the opportunity to discuss the decision before the letter is sent (see review process). Similarly, the author response typically shows only responses to the major concerns raised by the reviewers.

[Editors’ note: although it is not typical of the review process at eLife, in this case the editors decided to include the reviews in their entirety for the authors’ consideration as they prepared their revised submission.]

Thank you for sending your work entitled “A descending auditory cortical projection is necessary for vocal learning in a songbird” for consideration at eLife. Your article has been favorably evaluated by a Senior editor and 3 reviewers, one of whom is a member of our Board of Reviewing Editors.

The following individuals responsible for the peer review of your submission have agreed to reveal their identity: Ronald L. Calabrese (BRE) and Marc Schmidt (reviewer).

The Reviewing editor and the other reviewers discussed their comments before we reached this decision, and the Reviewing editor has assembled the following comments to help you prepare a revised submission. The full initial reviews are appended below as well, and many of the minor comments found in them should be addressed.

Overview:

In this manuscript the authors present careful anatomical, electrophysiological, and behavioral analyses in songbirds of a brain area they term ventral intermediate arcopallium (AIV), which receives auditory cortical input and has output to the dopaminergic VTA/SNc. They further show that AIV neurons that are demonstrated to project to VTA by antidromic activation respond to disruptive auditory feedback during singing but not during the playback of the bird's own song or the bird's own song with disruptive auditory feedback by in vivo single unit (and multiunit) recording in awake unrestrained birds during singing and quiescence. They also show that lesions of the AIV leads to deficits in vocal learning but do not alter song in mature birds. The data set presented is extensive and the analyses quantitative. The AIV neurons described provide an opening to the dopaminergic reward system that promises in future to help elucidate how auditory feedback can alter vocal output: a significant problem in song birds and humans. This analysis will generate wide interest in the songbird and sensory motor integration communities.

Concerns:

There are several concerns in the reviews by the BRE and the external reviewers that are provided to the authors to help them in revision. To focus this revision and remove any apparent conflict in the reviews, a consensus has formed about three areas.

Anatomy:

The reviewers are aware that as work has proceeded in the songbird, focus has shifted from distinct nuclei like RA or X to more diffusely organized areas like the area described here as AIV. AIV must be as precisely delimited as is possible and the authors should be at pains to do so. The anatomy presented in support of AIV must be discussed in the context of past findings, particularly Bottjer and Altenau (2010) (Parallel pathways for vocal learning in basal ganglia of songbirds. Nature Neuroscience 13, 153-155.), even if this makes sections of the Discussions more for birdsong experts. It would also serve the community if some other nearby areas could be differentiated from or included in AIV e.g. RA cup. This is not to set previous work as some anatomical standard but rather to motivate further clarification that will be healthy for the field. Not only should the authors specify exactly what the boundaries of AIV are but they should thoroughly discuss how it is related to Ad. They need to state specifically why they think they have lesioned one and not the other and how the AIV vs. Ad lesions lead to different or similar behavioral results. They should try to explain discrepancies in their data and that from Bottjer and Altenau (2010).

Many of the projections described to AIV have been previously described, at least in part. The projection from CM to AIV is novel and potentially most functionally significant for the physiological responses of AIV to VTA projecting neurons; it should be highlighted.

Behaviour:

The authors should substantially increase the presentation of their primary data on song that is encompassed in their imitation score. There are no figure limitations in eLife, so it is possible to present more sonograms so that other workers can more easily judge how song is affected or not affected by a lesion. In addition to changes in imitation score, they should give a qualitative description of how song is affected by their lesions. They should then thoroughly discuss how the results are similar to or different than Ad and X lesions by others.

Physiology:

There are a number of specific comments by Reviewer #2 that should be addressed.

The Abstract is written quite conservatively and is highly appropriate; it might be best to adjust the title so that it more matches the conservative tone of Abstract.

Other issues to address:

Reviewer #1:

In this manuscript the authors present careful anatomical, electrophysiological, and behavioral analyses in songbirds of a previously unstudied brain area (ventral intermediate arcopallium (AIV)). They show that AIV receives auditory cortical input and has output to the dopaminergic VTA/SNc. They further show that AIV neurons respond to disruptive auditory feedback during singing but not during the playback of the bird's own song or the bird's own song with disruptive auditory feedback by in vivo single unit (and multiunit) recording in awake unrestrained birds during singing and quiescence. They also show that lesions of the AIV leads to deficits in vocal learning but do not alter song in mature birds. The data set presented is extensive and the analyses thorough and quantitative.

This AIV neurons discovered provide an opening to the dopaminergic reward system that promises in future to help elucidate how auditory feedback can alter vocal output: a significant problem in song birds and humans. This elegant analysis will generate wide interest in the songbird and sensory motor integration communities.

Concerns:

1) The work is carefully done with detailed analysis and the figures appropriately illustrate the data. Nevertheless the figures are very complicated and could benefit from more detailed legends and perhaps better labeling. A tremendous amount of data is presented in supplementary figures. eLife discourages use of supplementary data and rather includes all important data in the text. The authors should ruthlessly select the relevant data from the supplementary data and present those integrated into the text. For example the lower panels in Figure 5—figure supplement 5 could be eliminated. I defer to the expert reviewers and the authors on what is essential, however.

2) A better general explanation (and figure) of the system than afforded in the Introduction, Discussion and by Figure 1A would greatly help the general reader especially in Discussion. I fully understand why Figure 1A is simplified to explain the anatomical experiments, but by the time we move beyond these experiments through Results and in Discussion the acronyms for brain areas start to fly thick and fast and the general reader like myself has a difficult time following. For example, Figure 1A shows lateral caudal nidopallium (NCL) (without definition in Figure 1 legend), which is not mentioned until the Discussion but ignores LMAN (first mentioned with no definition or explanation). RA is mentioned early (first mentioned without definition; also not defined in Figure 1 legend) and often, but its supposed function and anatomic definition is not given. I suggest that a more functional figure would aid discussion; one that includes all the brain areas mentioned (including, e.g., LMAN) with some at least minimal explanation of supposed function and clear anatomic definitions. Think of the general reader.

3) The writing is succinct but the paper is not always easy to follow and could benefit from more explanation for the general reader particularly in the Discussion.

Reviewer #2:

This is a very carefully executed study identifying a new pathway by which higher-order auditory information can be relayed from “cortical” auditory areas to dopamine reward centers in the midbrain. Identification of this pathway is of fundamental importance not only for our understanding of vocal learning in songbirds but more generally for setting up a framework for understanding how the dopamine system might be involved in the evaluating self-performance, a question that surprisingly has received little attention to date. The significance of this study lies not only in the identification of this new pathway, but more importantly in the evidence they provide that this pathway carries information from higher-order auditory neurons about auditory feedback error (i.e. neurons in AIV only respond to auditory feedback when feedback is perturbed) and that interruption of this pathway prevents error based vocal learning. Suggestion that this pathway might exist has been hinted at over the past two years from a number of different (behavioral, anatomical and physiological) studies but its existence remained elusive. This careful and extremely thorough study provides a significant step forward in the study of vocal learning and reinforcement learning more generally.

This study can roughly be divided into three parts:

Part I: The authors use targeted injections of retrograde and anterograde tracers to identify a major area in the arcopallium (just ventral of RA, which they call AIV (ventral intermediate arcopallium) that projects directly to the VTA. The authors then show that the AIV receives many major projections from higher-order auditory forebrain areas such as CM, L1 and HVC-shelf. Of significance is that one of these areas, CM, has been suggested to respond selectively to auditory feedback error.

Part II: To test the role this pathway might play in processing auditory feedback error, and to settle potential controversies in the field regarding an area (Ad) which lies adjacent to the AIV, the authors then perform a number of careful targeted lesions to both the AIV and Ad and test the effect of these lesions on song performance and learning. The authors show that lesions of AIV (but not Ad) in juvenile birds prevents vocal imitation of the tutor song even though birds are able to sing songs that have some degree of stereotypy. Lesions do not appear to affect adult song but are able to delay song degradation caused by deafening, a mechanism that is known to require the basal-ganglia forebrain pathway.

Part III: To evaluate the type of information that might be carried by this pathway, the authors use (mostly) antidromic stimulation to record from VTA-projecting neurons in AIV. These neurons show little responsiveness during normal singing and respond weakly, and with relatively long latencies (∼100 msec), to passively presented white noise bursts. However, when short noise bursts are presented during singing (therefore introducing an error in the feedback signal), neurons respond reliably with a short latency response. This response is specific to noise presented during singing and is not mimicked by presentation of noise during passive presentation of the bird's own song.

In general, I find this study compelling and very important and I only have one major concern:

One of the assumptions in this study is that the CM → AIV → VTA pathway “is computing or transmitting a fast online signal to VTA/SNc that potentially carries information about recent song performance”. As such each instance of song where feedback does not match motor intent, there should be a neural response in the neurons. From the data presented in Figure 5G and 5J, this seems to be the case but it is difficult to be sure. This is an important question, because an alternate hypothesis is that this pathway computes the accumulation of feedback error if it occurs in a consistent manner rather than occurring online. If the neurons are responding to an accumulation of error, then one might expect responses to white noise bursts to change (increase of decrease) across trials. On the other hand, if this pathway is truly processing/monitoring error online, then responses should be similar independent of trial number. To address these potentially competing hypotheses, it would be useful for the authors to quantify responses in a slightly different ways to resolve these issues. It is likely that such analysis can be done on the data they already have.

Reviewer #3:

Mandelblat-Cerf et al. (MC et al.) investigate the connections and function of a 'cortical' area in the songbird brain that sits near the song motor nucleus 'RA'. The authors confirm and extend previous findings (especially from Gale and Perkel, and Bottjer) that neurons in this region send projections to the VTA and Substantia Nigra, and that they receive inputs from other 'cortical' structures, including auditory regions. They define the region that projects to the VTA/SN as 'AIV' and go on to demonstrate that lesions of AIV in juvenile birds cause development of abnormal song as evaluated at 90 d of age, and lesions in adults reduce deafening induced song plasticity. Additionally, they show that neurons in AIV exhibit response properties to feedback perturbation that suggest a possible role in registering vocal errors and contributing to song learning and plasticity. Overall, there is an impressive amount of data in this study that is consistent with the possibility that this brain region plays an important role in feedback guided song learning and plasticity. The authors note that the anatomically and physiologically defined outputs from AIV to the VTA/SN additionally raise the possibility that such learning depends on descending auditory evaluation signals that may reach the song system via brainstem circuitry. The study adds to our understanding of the neural substrates that contribute to vocal learning in the songbird and thereby sheds light on more general mechanisms of feedback evaluation and learning.

1) One major concern about the current study is how the results relate to previous work (Bottjer and Altenau, Nature Neuroscience, 2010) that assessed the effects of lesions in an adjacent/overlapping brain region. The B&A study performed lesions in a very similar brain region, that they termed 'Ad', and reported what seem to be very similar behavioral results. While CM et al. carry out some control lesions in an area that they term 'Ad' (and do not see behavioral effects reported by B&A) it is unclear whether these lesions correspond to the 'Ad' of B&A. Broadly, I found it very difficult to figure out whether CM et al. and B&A are examining the same brain regions (and calling them different things) versus examining different brain regions. This is a key issue that needs to be more clearly addressed and discussed.

2) An additional and related area of concern is that of the specificity of the chemical lesions used in the study, and specifically what damage is responsible for the observed behavioral deficits. The AIV as defined by the authors has multiple distinct regions (with different connectivity) some of which are spatially adjacent to or overlapping not only with the region termed 'Ad' by B&A, but also the 'RA cup' region and RA itself. The complex and tight packing of multiple brain regions around RA make it difficult to assess from the data that are presented what are the subdivisions of AIV, where are its boundaries, and whether the effects of lesions should be attributed specifically to damage to one or more components of AIV or to damage of other structures in the vicinity. These concerns are elaborated further below.

B&A examined the consequences of 'Ad' lesions placed lateral and ventral to RA on song learning that appear to significantly overlap with the locations of 'AIV' lesions in the current study. Also, the current study and the B&A study appear to report similar effects of these lesions (though see more questions on this important point below). B&A termed the region that they lesioned 'Ad' based on prior anatomical studies that identified several afferent and efferent projections to this region. These included projections from LMAN shell and from NCL, and included projections to VTA. Because MC et al. define AIV as the VTA/SN projecting regions surrounding RA, it would seem that the AIV as so defined includes Ad? One portion of AIV that is lesioned by MC et al. also receives projections from NC and sends projections to VTA. Hence, this portion of AIV (which appears to make up a significant part of its bulk), shares some major defining characteristics with the similarly located Ad. I presume that this portion of AIV might be distinguished (as far as anatomy is concerned) by identifying a region from which LMAN-shell afferents are excluded? I am not sure whether the authors have examined this possibility, or can otherwise clarify this issue. However, the anatomy components of the study (especially as they relate to lesion locations and locations of neural recordings) require more clarification before the relationship between the current study and previous work can be understood and evaluated.

3) The lesion coordinates for 'AIV' in MC et al. appear to overlap significantly with the lesion coordinates for 'Ad' in B&A. For example, both studies include lesions that are several hundred microns lateral to the lateral most edge of RA, approximately at the depth of the center of RA (as well as more ventrally), and at approximately the R-C center of RA (as well as more rostrally and caudally). With the listed coordinates for both studies it would seem plausible if not likely that the two studies have lesioned overlapping brain regions. Additionally, MC et al. carry out separate control 'Ad' lesions (intended to test whether the reported effects could be due to lesions of Ad rather than AIV. However, the R-C and M-L locations of these Ad lesions (in MC et al.) fall into this same region lateral to RA. The D-V locations of these lesions are not provided, but instead the authors note that Ad lesions were made at depths that corresponded to the electrophysiologically defined boundaries of Ad. The authors should provide the D-V locations of Ad lesions, and some validation of the physiological criteria for defining the boundaries of Ad. I suppose that the authors' intent with their Ad lesions was to show that lesions restricted to the region lateral to RA (that may have been lesioned in both the 'AIV' lesions of CM et al. and the 'Ad' lesions of B&A) are insufficient to elicit the behavioral deficits associated with the 'AIV' lesions. However, because the authors use a lower concentration (and volume) of NMA for their Ad lesions than for their AIV lesions (1% versus 2%) this argument needs to be bolstered with some careful analysis of histology to establish exactly what was destroyed in each experiment. If I understand, the authors' argument is that lesions to 'AIV' that exclude the region lateral to RA (where they made ineffective 'Ad lesions') would be effective for disrupting learning. Are there examples with this pattern of damage? More broadly, the authors need to show us more of the histology that is associated with their lesions of AIV and of Ad and provide more detail on the criteria used for distinguishing between these regions, and also provide some more help in determining the relationship between the current study and prior work.

4) Other concerns about the lesions include the possibility of damage to RA. The authors note that 2 birds were excluded for which RA was substantially damaged by AIV lesions. However, other birds that had lesions of up to 5% of RA were included as 'AIV lesions'. Could this have contributed to behavioral deficits? Again, the close packing of structures in the vicinity of RA, and the difficulty of defining the borders of chemical lesions, raises concerns over the degree to which the observed effects can be attributed specifically to lesions of the anatomically defined AIV.

5) Effects of lesions on amount of singing. The authors dismiss the possibility that the poor learning in AIV lesioned birds could have occurred because lesions reduced the amount of singing by juvenile birds during the learning period: ”Importantly, the impaired vocal imitation in AIV-lesioned birds cannot be attributed to the amount of vocal practice. We compared the amount of singing in the period from surgery up to 90dPh for AIV-lesioned birds versus AD-lesioned controls. No difference was observed...” I believe that most people would read this as an indication that AIV lesions did not reduce the amount of singing. However, the supplementary material appears to show that birds with AIV lesions sang significantly less than intact control birds (though this is not reported, it looks like there is about a 30% reduction, on average, in amount of singing in AIV lesioned birds). If this is the case, the authors should note this important observation in the main text and address how this influences interpretation of lesions results with respect to deficits of learning and song maturity as assessed at 90d of age. Given the large amount of individual variation.

6) Effects of lesions on learning: I found it difficult to get a sense of how song learning was affected in the AIV lesion birds. The authors devise a 'tutor imitation score' that is a single number intended to capture how closely the juvenile songs (at 90d of age) match the tutor song (in terms of both the copying of individual notes and of the sequencing of notes). Because this measure has not been used in previous studies of birdsong, it is difficult to interpret. The authors provide as reference the imitation score for unrelated adult songs, and suggest that AIV lesion birds are almost as dissimilar to the tutor song as unrelated adult birds' songs are to each other. However, there are many ways in which the quality of match between experimental and tutor songs may differ and it is important in reporting results and for purposes of comparing with previous studies for the authors to figure out how to provide more clarity about how the AIV songs differ from the tutor songs. For example, deafening, lesions of LMAN, lesions of X and lesions of Ad (in B&A) in juvenile birds all appear to disrupt imitation, but in different ways. CM et al., suggest that the nature of song disruption caused by their AIV lesions may be similar to the effects of X-lesions in juvenile birds (as reported by Scharff and Nottebohm). However, the single example that is shown (Figure 3C) does not look particularly like the examples of X-lesion birds in Scharff's study.

This may be because this example comes from an AIV lesion bird that is less disrupted than most. From supplementary data it appears that this AIV lesion bird has an imitation score of about .21, while its control sibling has an imitation score of about .24. Hence, this example of disrupted learning from an AIV lesion bird shows very modest disruption (this AIV lesion bird has a better tutor imitation score than the majority of control birds!). It would be helpful if the authors presented spectrograms from more birds to give a clearer qualitative sense of what is going in, and some context for better understanding what different values of the imitation score (and other new measures introduced here) mean. This should include examples of good and bad imitation from controls and from AIV lesion (and perhaps also Ad lesion) birds. For example, do the control birds that have imitation scores around 0.1 and below look as dissimilar to the tutor song as do the unrelated adults? Are these control birds disrupted in a qualitatively similar manner to the AIV lesion birds that also have scores in that range? Regardless of what measures the authors use, they need to more clearly convey to the reader the extent to which the effects of AIV lesions parallel the effects of X-lesions (as reported by Scharff) and/or Ad lesions (as reported by B&A).

eLife. 2014 Jun 16;3:e02152. doi: 10.7554/eLife.02152.018

Author response


Overview:

In this manuscript the authors present careful anatomical, electrophysiological, and behavioral analyses in songbirds of a brain area they term ventral intermediate arcopallium (AIV), which receives auditory cortical input and has output to the dopaminergic VTA/SNc. They further show that AIV neurons that are demonstrated to project to VTA by antidromic activation respond to disruptive auditory feedback during singing but not during the playback of the bird's own song or the bird's own song with disruptive auditory feedback by in vivo single unit (and multiunit) recording in awake unrestrained birds during singing and quiescence. They also show that lesions of the AIV leads to deficits in vocal learning but do not alter song in mature birds. The data set presented is extensive and the analyses quantitative. The AIV neurons described provide an opening to the dopaminergic reward system that promises in future to help elucidate how auditory feedback can alter vocal output: a significant problem in song birds and humans. This analysis will generate wide interest in the songbird and sensory motor integration communities.

Concerns:

There are several concerns in the reviews by the BRE and the external reviewers that are provided to the authors to help them in revision. To focus this revision and remove any apparent conflict in the reviews, a consensus has formed about three areas.

Anatomy:

The reviewers are aware that as work has proceeded in the songbird, focus has shifted from distinct nuclei like RA or X to more diffusely organized areas like the area described here as AIV. AIV must be as precisely delimited as is possible and the authors should be at pains to do so. The anatomy presented in support of AIV must be discussed in the context of past findings, particularly Bottjer and Altenau (2010) (Parallel pathways for vocal learning in basal ganglia of songbirds. Nature Neuroscience 13, 153-155.), even if this makes sections of the Discussions more for birdsong experts.

We have very precisely defined the boundaries of AIV in terms of the extent of neurons retrogradely-labeled from VTA/SNc. While the extent of AIV is quite reliable from bird-to-bird relative to nucleus RA, all of our anatomical work identifying the inputs to AIV is based on retrograde-labeling of neurons from VTA/SNc.

Although we have found no clear cytoarchitectonic features (visible without retrograde labeling) by which to identify the boundaries of AIV, we believe the retrograde labeling provides a practical approach to defining AIV. Fortunately, injections into VTA/SNc are relatively straightforward, as the thalamic nucleus Ovoidalis can be used as a local reference point.

As described below, we have substantially expanded our description of the relation between AIV and Ad.

It would also serve the community if some other nearby areas could be differentiated from or included in AIV e.g. RA cup. This is not to set previous work as some anatomical standard but rather to motivate further clarification that will be healthy for the field.

Definitions of RA cup rely on two types of information: 1) Afferents to RAcup from primary auditory fields L1, L3, and auditory area HVC shelf. 2) Efferents from RAcup to thalamic and midbrain areas Ovoidalis shell and MLD. We have a large amount of data now, from double labeling studies, suggesting that all of the regions in ventral arcopallium identified by these earlier studies correspond to different partially-overlapping or even non-overlapping regions. Thus, in our view now, there is no such thing as ‘RA cup’ per se, but rather a complex heterogeneous pattern of auditory-related projections into and out of ventral arcopallium.

In our study, we have shown the relation between AIV neurons and projections from HVC-shelf and L1, two of the defining features of RAcup, and have identified a new input to the ventral arcopallium from auditory area CM.

An effort to further elucidate the relation between AIV and RA cup is ongoing. We are now focusing on the efferents from the ventral arcopallium to Ov-shell and MLD. We fully agree with the reviewers that this needs to be done, and it will be done. However, this is a substantial effort, with another set of 3-4 anatomical studies involving double or triple labeling. We feel that this new work is beyond the scope of the present paper.

Not only should the authors specify exactly what the boundaries of AIV are but they should thoroughly discuss how it is related to Ad. They need to state specifically why they think they have lesioned one and not the other and how the AIV vs. Ad lesions lead to different or similar behavioral results. They should try to explain discrepancies in their data and that from Bottjer and Altenau (2010).

We have added additional anatomical findings that more clearly elucidate the relation between AD and AIV. Our data suggest that the effects of AD lesions reported by Bottjer and Altenau were the result of unintended lesions of AIV in that study. We have now extensively expanded the discussion of the earlier findings on Ad, and specifically addressed an explanation for the discrepancies.

Many of the projections described to AIV have been previously described, at least in part. The projection from CM to AIV is novel and potentially most functionally significant for the physiological responses of AIV to VTA projecting neurons; it should be highlighted.

We are delighted that the reviewer appreciates the novelty of the finding that CM projects to AIV. The Discussion already includes an extensive paragraph on the anatomical and function significance of this finding. However, we have added a sentence to the Results section pointing out the novelty of this result.

Behaviour:

The authors should substantially increase the presentation of their primary data on song that is encompassed in their imitation score. There are no figure limitations in eLife, so it is possible to present more sonograms so that other workers can more easily judge how song is affected or not affected by a lesion. In addition to changes in imitation score, they should give a qualitative description of how song is affected by their lesions. They should then thoroughly discuss how the results are similar to or different than Ad and X lesions by others.

In order to incorporate more sonograms, we have split the old Figure 3 into two new figures. The first of these, now Figure 4, shows four sets of sonograms from lesioned birds. (Two of these have non-lesioned siblings from the same tutor.) The second of the new figures, now Figure 5, shows the quantitative analysis of song imitation.

We have also added additional qualitative descriptions of how song is affected, specifically the apparent high incidence of atypical non-song-like vocal elements.

However, the incidence of such vocal elements is difficult to quantify, and we would emphasize that we have been justifiably cautious about making such statements.

Physiology:

There are a number of specific comments by Reviewer #2 that should be addressed.

See below for response to these comments.

The Abstract is written quite conservatively and is highly appropriate; it might be best to adjust the title so that it more matches the conservative tone of Abstract.

The title has been changed: ‘A role for descending auditory cortical projections in songbird vocal learning’

Other issues to address:

Reviewer #1:

1) The work is carefully done with detailed analysis and the figures appropriately illustrate the data. Nevertheless the figures are very complicated and could benefit from more detailed legends and perhaps better labeling. A tremendous amount of data is presented in supplementary figures. eLife discourages use of supplementary data and rather includes all important data in the text.

We have indeed presented a lot of data in the supplementary information and have now made an effort to move some of this to the main figures. We feel that all the essential data are now included in the main figures, and that the data presented in supplementary information only support those presented in the main figures, rather than representing significant findings on their own. We feel that moving more of these results or analysis to the main findings would be unduly burdensome to the reader.

At the same time, we feel that some readers will find the supplementary data and analysis extremely helpful, and would very much like to keep these as part of the paper.

The authors should ruthlessly select the relevant data from the supplementary data and present those integrated into the text. For example the lower panels in Figure 5—figure supplement 5 could be eliminated. I defer to the expert reviewers and the authors on what is essential, however.

We have moved parts of the old Supplementary Figure 2 to the main text as the new Figure 3. As requested, we have eliminated the lower panels of Figure 5—figure supplement 5 (now Figure 7—figure supplement 1)

2) A better general explanation (and figure) of the system than afforded in the Introduction, Discussion and by Figure 1A would greatly help the general reader especially in Discussion. I fully understand why Figure 1A is simplified to explain the anatomical experiments, but by the time we move beyond these experiments through Results and in Discussion the acronyms for brain areas start to fly thick and fast and the general reader like myself has a difficult time following. For example, Figure 1A shows lateral caudal nidopallium (NCL) (without definition in Figure 1 legend), which is not mentioned until the Discussion but ignores LMAN (first mentioned with no definition or explanation). RA is mentioned early (first mentioned without definition; also not defined in Figure 1 legend) and often, but its supposed function and anatomic definition is not given. I suggest that a more functional figure would aid discussion; one that includes all the brain areas mentioned (including, e.g., LMAN) with some at least minimal explanation of supposed function and clear anatomic definitions. Think of the general reader.

We have substantially augmented the explanation of the various song components. We have added a new panel to Figure 1 showing the classical song-control brain areas, and have been more generous with the explanations in the Discussion. Finally, the original Figure 1a erroneously labeled NC as NCL, which explains why this did not appear until the Discussion. We thank the reviewers for catching this, and apologize for the confusion it must have caused.

3) The writing is succinct but the paper is not always easy to follow and could benefit from more explanation for the general reader particularly in the Discussion.

Reviewer #2:

In general, I find this study compelling and very important and I only have one major concern:

One of the assumptions in this study is that the CM → AIV → VTA pathway “is computing or transmitting a fast online signal to VTA/SNc that potentially carries information about recent song performance”. As such each instance of song where feedback does not match motor intent, there should be a neural response in the neurons. From the data presented in Figure 5G and 5J, this seems to be the case but it is difficult to be sure. This is an important question, because an alternate hypothesis is that this pathway computes the accumulation of feedback error if it occurs in a consistent manner rather than occurring online. If the neurons are responding to an accumulation of error, then one might expect responses to white noise bursts to change (increase of decrease) across trials. On the other hand, if this pathway is truly processing/monitoring error online, then responses should be similar independent of trial number. To address these potentially competing hypotheses, it would be useful for the authors to quantify responses in a slightly different ways to resolve these issues. It is likely that such analysis can be done on the data they already have.

We thank the reviewer for this important suggestion. We have analyzed the data as suggested: i.e. look to determine if the error-related responses increase or decrease over the period of exposure to noise bursts. Our analysis shows that the responses do not show a significant trend over time (no significant correlation of response with trial number, F-statistics p>0.11 for all neurons). Furthermore, the responses are statistically indistinguishable in the first 20 trials and last 20 trials. (Ranksum test, p>0.15)

Reviewer #3:

1) One major concern about the current study is how the results relate to previous work (Bottjer and Altenau, Nature Neuroscience, 2010) that assessed the effects of lesions in an adjacent/overlapping brain region. The B&A study performed lesions in a very similar brain region, that they termed 'Ad', and reported what seem to be very similar behavioral results. While CM et al. carry out some control lesions in an area that they term 'Ad' (and do not see behavioral effects reported by B&A) it is unclear whether these lesions correspond to the 'Ad' of B&A. Broadly, I found it very difficult to figure out whether CM et al. and B&A are examining the same brain regions (and calling them different things) versus examining different brain regions. This is a key issue that needs to be more clearly addressed and discussed.

We have carried out extensive anatomical characterization of the relation between AIV and Ad. AIV is defined as a region in the ventral arcopallium containing neurons projecting to VTA and SNc. Ad is defined as a region receiving afferents from the shell-region around LMAN (LMANshell). The distinction between these areas is most clearly visible in coronal slices (Figure 1F and 5A). To show this relation more clearly in this paper, we have carried out additional experiments using double labeling (n=3 birds, 6 hemispheres). Anterograde tracer (dextran-Alexa488) was injected into LMANshell and retrograde tracer (CTB-Alext647) was injected into VTA/SNc. Anterogradely-labelled fibers were visible in a band lateral to RA, consistent with earlier descriptions of Ad. Neurons retrogradely-labelled from VTA/SNc were visible in a ‘wedge’ ventral and medial to Ad. Very few neurons retrogradely labeled from VTA/SNc were seen within AD.

Furthermore, injections of retrograde tracer into AIV revealed no evidence of labeled neurons in LMAN-shell. Thus, the anterograde and retrograde tracing experiments suggest that Ad and AIV are distinct circuits with non-overlapping patterns of afferents and efferents.

2) An additional and related area of concern is that of the specificity of the chemical lesions used in the study, and specifically what damage is responsible for the observed behavioral deficits. The AIV as defined by the authors has multiple distinct regions (with different connectivity) some of which are spatially adjacent to or overlapping not only with the region termed 'Ad' by B&A, but also the 'RA cup' region and RA itself. The complex and tight packing of multiple brain regions around RA make it difficult to assess from the data that are presented what are the subdivisions of AIV, where are its boundaries, and whether the effects of lesions should be attributed specifically to damage to one or more components of AIV or to damage of other structures in the vicinity. These concerns are elaborated further below.

We have carried out preliminary tracing experiments using double-labeling to characterize the relation between AIV and RA-cup. Our results suggest that the various afferents and efferents previously used to define RA cup do not correspond to a single area, but are rather a set of partially-overlapping inter-related areas. Some of these areas overlap with AIV, as we have defined it, and some do not. Our efforts to elucidate the relation between AIV and RAcup are ongoing, and we believe that the present manuscript will provide strong motivation for future work in this area.

We emphasize that our definition of AIV is very precise: ‘The region of arcopallium retrogradely labeled by injection of retrograde tracer into VTA/SNc.’

B&A examined the consequences of 'Ad' lesions placed lateral and ventral to RA on song learning that appear to significantly overlap with the locations of 'AIV' lesions in the current study. Also, the current study and the B&A study appear to report similar effects of these lesions (though see more questions on this important point below). B&A termed the region that they lesioned 'Ad' based on prior anatomical studies that identified several afferent and efferent projections to this region. These included projections from LMAN shell and from NCL, and included projections to VTA. Because MC et al. define AIV as the VTA/SN projecting regions surrounding RA, it would seem that the AIV as so defined includes Ad? One portion of AIV that is lesioned by MC et al. also receives projections from NC and sends projections to VTA. Hence, this portion of AIV (which appears to make up a significant part of its bulk), shares some major defining characteristics with the similarly located Ad. I presume that this portion of AIV might be distinguished (as far as anatomy is concerned) by identifying a region from which LMAN-shell afferents are excluded? I am not sure whether the authors have examined this possibility, or can otherwise clarify this issue. However, the anatomy components of the study (especially as they relate to lesion locations and locations of neural recordings) require more clarification before the relationship between the current study and previous work can be understood and evaluated.

The reviewer is correct that the anatomical relation between AIV and Ad must be described before it is possible to understand our findings in comparison to the Altenau and Bottjer. We have done this very carefully. Afferents from LMAN-shell overlap minimally with neurons retrogradely-labeled from VTA/SNc. This might suggest that the claim that Ad projects to VTA could be an artifact of leakage of tracer from Ad into neighboring AIV.

As noted above, we have re-examined our earlier histological material in which retrograde tracers were injected into AIV. We found no evidence for retrogradely-labeled neurons in the vicinity of LMAN (LMAN-shell). Thus both the anterograde and retrograde tracing experiments strongly support the conclusion that AIV and Ad are anatomically distinct circuits, with little or no overlap in either afferent or efferent connections.

3) The lesion coordinates for 'AIV' in MC et al. appear to overlap significantly with the lesion coordinates for 'Ad' in B&A. For example, both studies include lesions that are several hundred microns lateral to the lateral most edge of RA, approximately at the depth of the center of RA (as well as more ventrally), and at approximately the R-C center of RA (as well as more rostrally and caudally). With the listed coordinates for both studies it would seem plausible if not likely that the two studies have lesioned overlapping brain regions. Additionally, MC et al. carry out separate control 'Ad' lesions (intended to test whether the reported effects could be due to lesions of Ad rather than AIV. However, the R-C and M-L locations of these Ad lesions (in MC et al.) fall into this same region lateral to RA. The D-V locations of these lesions are not provided, but instead the authors note that Ad lesions were made at depths that corresponded to the electrophysiologically defined boundaries of Ad. The authors should provide the D-V locations of Ad lesions, and some validation of the physiological criteria for defining the boundaries of Ad. I suppose that the authors' intent with their Ad lesions was to show that lesions restricted to the region lateral to RA (that may have been lesioned in both the 'AIV' lesions of CM et al. and the 'Ad' lesions of B&A) are insufficient to elicit the behavioral deficits associated with the 'AIV' lesions. However, because the authors use a lower concentration (and volume) of NMA for their Ad lesions than for their AIV lesions (1% versus 2%) this argument needs to be bolstered with some careful analysis of histology to establish exactly what was destroyed in each experiment.

The reviewer is presumably concerned here with the possibility that the Ad lesions may have been too small to have an effect on vocal learning. We point out that the protocol for lesioning Ad was validated in three different birds. The pattern of lesion injections was targeted by first electrophysiologically mapping Ad, in each lesioned bird, based on its distinctive bursting activity (similar to that observed in RA). The lesions were confirmed histologically after the birds reached adulthood.

Finally, in an attempt to reproduce the results of Bottjer and Altenau, we carried out an additional set of experiments in which we made complete lesions of Ad. These birds exhibited severe akinesia and immobility, such that the experiments had to be terminated and song imitation could not be assessed. These findings are consistent with the hypothesis (Jarvis, 2008) that Ad is involved in locomotion and other motor behaviors, rather than song learning.

Given the reviewers’ concern about the relation between our findings and those of Bottjer and Altenau (2010), we have now reported the results of these larger lesions.

If I understand, the authors' argument is that lesions to 'AIV' that exclude the region lateral to RA (where they made ineffective 'Ad lesions') would be effective for disrupting learning. Are there examples with this pattern of damage? More broadly, the authors need to show us more of the histology that is associated with their lesions of AIV and of Ad and provide more detail on the criteria used for distinguishing between these regions, and also provide some more help in determining the relationship between the current study and prior work.

We assessed our lesions of AIV in saggitally-sectioned tissue, because it is much easier to identify the boundaries of AIV in these sections, where RA serves as a clearly-visible landmark. However, it is not possible to assess the impact of these lesions on Ad in sagittal sections, which is lateral to RA. Therefore, we are not able to produce the evaluation, requested by the reviewer, of the impact of our ‘AIV lesions’ on Ad in the experimental birds.

However, we carried out a careful calibration study, prior to our lesion experiments, showing that our AIV lesion protocol had minimal impact on AD, as assessed in coronally-sliced tissue in which Ad was labeled by anterograde tracing from LMAN-shell. Furthermore, our Ad lesion control experiments were specifically designed to address this issue. Again, we found no effect of Ad lesions on vocal imitation.

4) Other concerns about the lesions include the possibility of damage to RA. The authors note that 2 birds were excluded for which RA was substantially damaged by AIV lesions. However, other birds that had lesions of up to 5% of RA were included as 'AIV lesions'. Could this have contributed to behavioral deficits? Again, the close packing of structures in the vicinity of RA, and the difficulty of defining the borders of chemical lesions, raises concerns over the degree to which the observed effects can be attributed specifically to lesions of the anatomically defined AIV.

In only three of the birds included as ‘AIV lesions’ did the lesions have any impact on RA. In these cases, a thin rim of RA was impacted, and only in a few of the 100um sections was this damage observed. Furthermore, in these cases only one hemisphere was involved. We have reanalyzed the behavioral results with these birds removed from the dataset, and our findings are still highly significant (p=0.0084).

5) Effects of lesions on amount of singing. The authors dismiss the possibility that the poor learning in AIV lesioned birds could have occurred because lesions reduced the amount of singing by juvenile birds during the learning period: “Importantly, the impaired vocal imitation in AIV-lesioned birds cannot be attributed to the amount of vocal practice. We compared the amount of singing in the period from surgery up to 90dPh for AIV-lesioned birds versus AD-lesioned controls. No difference was observed...” I believe that most people would read this as an indication that AIV lesions did not reduce the amount of singing. However, the supplementary material appears to show that birds with AIV lesions sang significantly less than intact control birds (though this is not reported, it looks like there is about a 30% reduction, on average, in amount of singing in AIV lesioned birds). If this is the case, the authors should note this important observation in the main text and address how this influences interpretation of lesions results with respect to deficits of learning and song maturity as assessed at 90d of age. Given the large amount of individual variation.

The reviewer is correct that the AIV lesions and the AD lesions had an effect on the total amount of singing compared to unmanipulated birds. This is not surprising, given that both of groups of lesioned birds underwent a significant surgical manipulation, while the unmanipulated birds had no surgery. However, we found that the AD lesioned birds showed a reduction in the amount of singing, but no deficits in song learning. Furthermore, we found no correlation between the amount of singing and the degree of song imitation. We believe we have properly controlled for any effect of amount of singing on song imitation. These points have been raised in the results section.

6) Effects of lesions on learning: I found it difficult to get a sense of how song learning was affected in the AIV lesion birds. The authors devise a 'tutor imitation score' that is a single number intended to capture how closely the juvenile songs (at 90d of age) match the tutor song (in terms of both the copying of individual notes and of the sequencing of notes). Because this measure has not been used in previous studies of birdsong, it is difficult to interpret. The authors provide as reference the imitation score for unrelated adult songs, and suggest that AIV lesion birds are almost as dissimilar to the tutor song as unrelated adult birds' songs are to each other. However, there are many ways in which the quality of match between experimental and tutor songs may differ and it is important in reporting results and for purposes of comparing with previous studies for the authors to figure out how to provide more clarity about how the AIV songs differ from the tutor songs.

Song imitation is a highly variable process: different birds normally copy different aspects of the tutor song. Furthermore, different pupil birds have different degrees of acoustic and sequence stereotypy. This makes it very difficult to convey, in a few examples, the precise nature of the imitation. We have selected two aspects of song imitation on which to focus our quantification (acoustic similarity and sequence similarity with the tutor song). We have developed a largely automated procedure for quantifying these measures, and a paper describing this procedure has now been published in PLOS One. The Matlab code for implementing this algorithm was made publicly available.

For example, deafening, lesions of LMAN, lesions of X and lesions of Ad (in B&A) in juvenile birds all appear to disrupt imitation, but in different ways. CM et al., suggest that the nature of song disruption caused by their AIV lesions may be similar to the effects of X-lesions in juvenile birds (as reported by Scharff and Nottebohm). However, the single example that is shown (Figure 3C) does not look particularly like the examples of X-lesion birds in Scharff's study.

This may be because this example comes from an AIV lesion bird that is less disrupted than most. From supplementary data it appears that this AIV lesion bird has an imitation score of about .21, while its control sibling has an imitation score of about .24. Hence, this example of disrupted learning from an AIV lesion bird shows very modest disruption (this AIV lesion bird has a better tutor imitation score than the majority of control birds!). It would be helpful if the authors presented spectrograms from more birds to give a clearer qualitative sense of what is going in, and some context for better understanding what different values of the imitation score (and other new measures introduced here) mean. This should include examples of good and bad imitation from controls and from AIV lesion (and perhaps also Ad lesion) birds. For example, do the control birds that have imitation scores around 0.1 and below look as dissimilar to the tutor song as do the unrelated adults? Are these control birds disrupted in a qualitatively similar manner to the AIV lesion birds that also have scores in that range? Regardless of what measures the authors use, they need to more clearly convey to the reader the extent to which the effects of AIV lesions parallel the effects of X-lesions (as reported by Scharff) and/or Ad lesions (as reported by B&A).

As requested, we have added additional examples of song spectrograms in AIV-lesioned birds. However, we feel that the quantitative procedure we have developed is much more informative, as it captures the quality of many hundreds of song renditions rather than just one or two examples.

As the reviewer suggests, we have also examined the songs of the two low-performing control birds. Their songs appear to be quite dissimilar to their tutor songs, and qualitatively speaking, are as different from their tutors as unrelated adults. The songs of these controls birds differ from their tutors in a manner that is not inconsistent with the way AIV-lesioned birds exhibited poor imitation performance, including a lack of acoustic and sequence similarity and high variability. We should note here that the imitation of tutor song by zebra finches can be variable, particularly in birds that are isolated from their tutors at an early age, as was done in our experiments. We don’t feel that adding a few examples of song spectrograms from good and bad imitators in each experimental group will clarify our findings. Our claims are based on an automated quantitative assessment of song imitation and variability.

Our statement that AIV lesions resemble lesions of Area X was based on the fact that these lesions in juvenile birds result in poor imitation and higher song variability, while the same lesions in adult birds have little effect on learned song. We do not wish to claim that the AIV lesions produced the exact same effect as X lesions. There is no reason to expect such a precise correspondence of outcomes from lesions in these different brain areas. We have softened the wording in the relevant paragraph of the Discussion.

The reviewer raises an interesting point about the relation between the effects of AIV lesions and those of Ad lesions. The Bottjer and Altenau study claim that lesions of Ad have no immediate effect on song, produce poor imitation and increase song variability. These findings are consistent with our suggestion that the effects of lesions targeted by Ad reported by Bottjer and Altenau are the result of an unintended lesion of AIV.


Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

RESOURCES