Abstract
Songbirds learn and maintain their songs via auditory experience. Neurons in many telencephalic nuclei important to song production and development are song selective, firing more to forward auditory playback of the bird's own song (BOS) than to reverse BOS or conspecific songs. Elucidating circuits that generate these responses can localize where auditory experience influences vocalization, bridging cellular and systems analyses of song learning. Song-selective responses in many song nuclei, including the vocal premotor nucleus robustus archistriatalis (RA) and the basal ganglia homolog area X, are thought to originate in nucleus HVc (used as a proper name), which contains interneurons and relay cells that innervate either RA or area X. Previous studies indicated that only X-projecting neurons have auditory responses, leaving open the source of RA's auditory input and the degree to which song selectivity may be refined in HVc. Here,in vivo intracellular recordings from morphologically and electrophysiologically identified HVc neurons revealed that both relay cell types fire song-selectively. However, their firing arises via markedly different subthreshold processes, and only X-projecting neurons appear to be sites for auditory refinement. RA-projecting neurons exhibited purely depolarizing subthreshold responses that were highly song selective and that were excitatory. In contrast, subthreshold responses of X-projecting neurons included less-selective depolarizing and highly selective hyperpolarizing components. Within individual birds, these BOS-evoked hyperpolarizations closely matched interneuronal firing, suggesting that HVc interneurons make restricted inputs onto X-projecting neurons. Because of the two relay cell types' subthreshold differences, factors affecting their resting membrane potentials could enable them to transmit distinct song representations to their targets.
Keywords: HVc, in vivo intracellular, vocal learning, songbirds, song, auditory selectivity, communication, zebra finch
Songbirds and humans listen to their own vocalizations to develop and maintain learned communication sounds (Doupe and Kuhl, 1999). These processes are likely to require the integration of auditory and vocal circuitry. Indeed, in songbirds, neurons in many brain nuclei used for song production, development, and perception are song selective, firing more to forward auditory playback of the bird's own song (BOS) than to reverse BOS or conspecific songs (Margoliash, 1983; Doupe and Konishi, 1991; Janata and Margoliash, 1999). Elucidating the circuitry underlying song selectivity could reveal where audition influences vocalization, thus providing a cellular framework for vocal learning. In fact, auditory responses in many song-related nuclei may arise from two distinct relay cell classes in the nucleus HVc (used as a proper name), but the correspondence between auditory responsiveness and HVc cell type is poorly understood. This study uses in vivo intracellular recordings from identified HVc neurons to establish this correspondence and to probe the cellular mechanisms underlying song selectivity.
The telencephalic nucleus HVc is a major auditory–vocal interface. HVc exhibits singing-related premotor activity (McCasland, 1987; Yu and Margoliash, 1996), and many HVc neurons are song selective (Margoliash, 1986; Lewicki and Arthur, 1996; Volman, 1996; Theunissen and Doupe, 1998). Beyond affording a site for auditory–vocal integration, HVc is the probable source of auditory input to other nuclei important to song production, development, and perception. HVc is structurally heterogeneous, containing two relay cell types and interneurons, and functionally heterogeneous, because its two relay cell types give rise to anatomically separate pathways implicated in either audition-dependent vocal plasticity or vocal control (see Fig.2) (Nottebohm et al., 1976; Fortune and Margoliash, 1995; Foster and Bottjer, 1998). One relay cell type innervates area X, a basal ganglia homolog within the anterior forebrain pathway (AFP). The AFP, which is essential to juvenile and adult audition-dependent vocal plasticity (Bottjer et al., 1984; Scharff and Nottebohm, 1991; Brainard and Doupe, 2000) and may facilitate song perception (Scharff et al., 1998) is thought to derive auditory input from HVc (Vates et al., 1996). In agreement with this idea, in vivo intracellular recordings from X-projecting neurons reveal that they have song-selective auditory responses (Lewicki, 1996). Furthermore, certain X-projecting neurons display BOS-evoked hyperpolarizations (Lewicki, 1996), perhaps driven by HVc interneurons, that might refine these relay cells' song selectivity relative to HVc's auditory afferents (Lewicki and Arthur, 1996). However, the auditory properties of HVc interneurons and their interactions with either relay cell type are unknown. The second relay cell type innervates the nucleus robustus archistriatalis (RA), which controls the vocal and respiratory neurons used for singing (Vicario, 1993; Wild, 1993). Although RA has auditory responses requiring direct HVc input (Doupe and Konishi, 1991; Vicario and Yohay, 1993) and RA-projecting neurons afford attractive sites for potential auditory–vocal interaction, previous intracellular studies concluded they were nonauditory (Katz and Gurney, 1981; Lewicki, 1996). The difficulty in obtaining in vivo intracellular recordings from identified HVc neurons has obscured the nature of auditory information transmitted from HVc to other song nuclei, while impeding insight into the mechanisms underlying song selectivity.
Here, in vivo intracellular recordings from identified HVc neurons were used to measure their responses to BOS playback. These studies show that HVc's two relay cell populations generate similar song-selective firing via distinct subthreshold processes, which may arise via segregated input from highly selective interneurons onto X-projecting cells. Ultimately, factors affecting the two relay cell types' membrane potentials may unmask previously unidentified subthreshold differences, enabling distinct auditory representations of song to be transmitted by HVc.
MATERIALS AND METHODS
Subjects. Experiments were performed using 31 adult [85–440 posthatch days (PHD)] male zebra finches (Taeniopygia guttata) in accordance with a protocol approved by the Duke University Institutional Animal Care and Use Committee.
Stimuli. Before the experiment, songs were recorded from a male zebra finch placed with an adult female zebra finch in a recording chamber (Industrial Acoustics, Bronx, NY). Songs were amplified and low-pass filtered at 10 kHz, digitized at 20 kHz (National Instruments data acquisition board AT-MIO-16E2; Austin, TX), and stored on a hard drive. Songs were recorded and edited with LabView software (National Instruments; all custom software for this study was written by M. Rosen, F. Livingston, R. Neumann, and R. Balu). Edited songs included either one or two motifs (a motif is the largest repeated unit in the bird's song and usually comprises single and multinote syllables; zebra finch song bouts typically consist of several motifs). Stimuli presented always included the BOS and reverse BOS (i.e., song played backward) and were 1–3 sec in duration. Forward and reverse versions of the BOS contain the same spectral energy but differ in both their local and global temporal organization. Differential responsiveness to forward over reverse BOS can be used to measure neuronal sensitivity to time-varying auditory cues (see Data analysis below).
Preparatory surgery. Two days before electrophysiological recording, birds were food and water deprived for 1 hr, anesthetized with equithesin (2 mg/kg, i.m.: 0.85 gm of chloral hydrate, 0.21 gm of pentobarbitol, 0.42 gm of MgSO4, 2.2 ml of 100% ethanol, and 8.6 ml of propylene glycol brought to a 20 ml final volume with dH2O), and placed in a stereotaxic device (45° head angle; H. Adams, California Institute of Technology). The scalp was dissected along the midline, and HVc's location was determined using stereotaxic coordinates (approximately −0.2 mm caudal and 2.4 mm lateral, measured from the caudal edge of the bifurcation of the midsagittal sinus). A stainless steel post was mounted to the rostral part of the bird's skull with dental cement, the wound was closed with cyanoacrylate, antibiotic (Neosporin ointment) was applied, and the bird was warmed under a heat lamp (33°C) until recovery (2–4 hr).
In vivo electrophysiology and song presentation. On the morning of the day of the electrophysiological recording, birds were injected in the pectoral muscle with 20% urethane (75–105 μl total; Sigma, St. Louis, MO), administered in 25–35 μl doses at 30 min intervals. Birds were immobilized via the mounted post in a sound-attenuating chamber (Industrial Acoustics) on an air table (Technical Manufacturing Corporation, Peabody, MA); body temperature was maintained via an electric blanket at 37°C (Harvard Apparatus, Holliston, MA). After topical application of xylocaine (2%), the scalp was retracted, a small craniotomy (<300 μm wide) was made over HVc, and the dura was slit open with a fine insect pin (Minuten, Carolina Biological Supply).
Sharp electrodes (borosilicate glass, BF100-50-10; Sutter Instrument, Novato, CA) were pulled to yield a resistance of 100–250 MΩ when filled with 3 m K-acetate and 5% neurobiotin. A hydraulic microdrive (Soma Scientific, Irvine, CA) was used to lower electrodes into the nucleus (∼300 μm in depth). Brief (∼1 msec) capacitance overcompensation was used to “ring” the electrode to achieve entry into the cell. An AxoClamp 2B intracellular amplifier (Axon Instruments, Foster City, CA) was used in bridge mode to record intracellular potentials, which were low-pass filtered at 3 kHz, digitized at 10 kHz, and stored on a personal computer hard drive. HVc neurons were identified on-line by their firing properties in response to injected currents (see Results) and usually by spontaneous, intermittent, high-frequency bursts of action potentials and were verified histologically after the recording session. Cells were tested with auditory stimuli if their resting potentials were negative of −50 mV. Ten to thirty iterations of each auditory stimulus, delivered at intervals ranging from 6 to 10 sec, were presented at ∼70 dB (rms; A-weighting) through a speaker positioned 20 cm directly in front of the bird. Peristimulus time histograms (PSTHs; 25 msec bin width) and median-filtered averaged membrane potential traces were computed on-line (see Data analysis) to aid in experimental decisions.
When possible, after characterization of a cell's auditory responses to BOS playback, its intrinsic properties were studied. Hyperpolarizing responses to negative current pulses (−200 to −400 pA) were collected to estimate input resistance, and instantaneous and mean firing rates, as well as the latency to the first spike, were calculated in response to positive current pulses (+200 to 1000 pA; 1 sec duration) using custom LabView software. Three- to five-minute-long spontaneous membrane potential records were also collected, and a software event detector was used to estimate spontaneous firing rates. Action potential widths were measured at the base or shoulder of the spike, where the membrane potential described a sharp positive inflection. The amplitudes of spike after hyperpolarizations were measured from the spike shoulder to the trough of the hyperpolarization after the spike. Resting membrane potential was determined by subtracting any DC offset observed after electrode withdrawal from the membrane potential recorded during the 5–10 min before the end of the recording. All values are reported as the mean ± SEM; statistics and tests for statistical significance (other than those described in Data analysis) are reported in the figure legends, Table1, and/or the Results.
Table 1.
Cell type (n)* | Resting potential, mV | Rinput, MΩ | Spontaneous firing rate, Hz | Evoked firing rate, Hz/nA | Spike width, msec | AHP, mV | Latency to first spike, msec (number of spike onsets; +400 and +600 pA) |
---|---|---|---|---|---|---|---|
X-projecting (16) | −76 ± 2 | 48 ± 6 | 1.5 ± 0.4 (14) | 42 ± 2 | 2.0 ± 0.2 | 14.2 ± 0.6 | 28 ± 10 (40) |
RA-projecting (10) | −78 ± 3 | 50 ± 10 (9) | 0.6 ± 0.4 (8) | 24 ± 6 (9) | 1.8 ± 0.4 (9) | 13.4 ± 1.9 (9) | 172 ± 67 (14) |
p< 0.001, Xp p < 0.01, Int two-tailed t test | |||||||
Interneuron (11) | −66 ± 2 | 44 ± 6 | 12.0 ± 4.31-160 (6) | 172 ± 22 | 0.7 ± 0.06 | 17.6 ± 0.6 | 15 ± 9 (23) |
p < 0.01 vs RAp,Xp two-tailed t test | p < 0.0001, RAp, Xp, two-tailed t test | p < 0.01 vs RAp, p < 0.0001 vs Xp, two tailed ttest | p < 0.05 RAp, p < 0.001 Xp, two-tailedt test |
Numbers in this column apply to all other columns, except as noted in parentheses.
F1-160: Values in bold are significantly different from similar values in the other cell types.
Data analysis. The suprathreshold responsiveness (Rsupra) of cells with spiking activity was calculated by Rsupra =SFR −BFR, whereSFR andBFR are the firing rates during each stimulus presentation and during a 1.5 sec baseline period before each stimulus presentation, respectively. To assess subthreshold responsiveness in spiking and nonspiking cells, raw traces first were median-filtered (each point was replaced by the median value of the surrounding 50 points, equivalent to 5 msec at the 10 kHz sampling rate used here). Median-filtering removed the action potential (which was typically ∼1 msec in duration) and yet did not distort slower membrane potential movements [for an example, see Jagadeesh et al. (1997), their Fig. 1]. The subthreshold depolarizing responsiveness (RVm) of these cells was measured by RVm =Sarea −Barea, whereSarea andBarea are the integrals of the positive-going deviations in membrane potential either during (i.e.,Sarea) or before (i.e.,Barea) stimulus presentation relative to the mode membrane potential measured during the baseline period. That is, the total positive area during the baseline (measured from the mode) was subtracted from the total positive area during the stimulus (measured from the mode; see Fig. 9 for examples of how these subthreshold values were measured). The mode membrane potential was calculated for the baseline data array with an automated LabView routine and was used instead of the mean or median of the baseline data array because we observed that it gave the most reliable measure of the central tendency of the baseline membrane potential. Similar calculations were made for the subthreshold hyperpolarizing area, using the negative-going deviations in membrane potential from the baseline mode value. The net subthreshold hyperpolarization was multiplied by −1, so that a net increase in subthreshold hyperpolarization relative to the baseline would be represented by a negative number. It was possible for single cells to respond to a stimulus with an increase in both negative and positive area relative to baseline. Furthermore, strong hyperpolarizing responses to a stimulus could yield negative values for positive and negative area (e.g., when the baseline positive area was greater than the positive area evoked by the stimulus). Average Rsupra orRVm were computed for 10–30 stimulus iterations. Significance was determined with pairedt tests comparing stimulus-evoked suprathreshold, subthreshold depolarizing or subthreshold hyperpolarizing responses to corresponding baseline measures.
To compare suprathreshold and subthreshold responses, response strengths were expressed as z-scores. The suprathreshold z-score (Zsupra) is given by the difference between the average firing rate during stimulus presentation and that during a 1.5 sec baseline period before stimulus presentation, divided by the SD of this difference:
where FR is the mean firing rate during the stimulus,FR is the mean firing rate during the baseline period, and the denominator is the SD ofSFR −BFR. For nonspiking cells and median-filtered spiking cells, the subthreshold z-score (ZVm) is given by the difference between the average area during stimulus presentation and that during baseline, divided by the SD of this difference. TheZVm formula is the same as that for Zsupra with substitutions of area for FR, wherearea is the mean deviation in Vm (from the baseline mode, calculated separately for negative or positive area, as stated above) during song presentation and area is the mean deviation in Vm during baseline; the denominator is the SD of Sarea −Barea.
The selectivity of a given neuron for forward over reverse BOS playback was measured using the psychophysical metric d′ (Green and Swets, 1966), which estimates the discriminability between two stimuli. A difference in response to these two stimuli has been used previously as the criterion for the auditory selectivity of neurons in HVc, as well as in other song nuclei (Solis and Doupe, 1997; Theunissen and Doupe, 1998; Janata and Margoliash, 1999; Rosen and Mooney, 2000). Thed′ value comparing the response to BOS relative to reverse BOS is given by:
where d′supra represents suprathreshold responsiveness andd′Vm represents subthreshold responsiveness. is the mean value ofR (as described above), and ς2 is its variance. This measure of selectivity is similar to a ratio measure but takes into account both the mean and the variance of a cell's responses and can report negative values. A d′ value > 0.7 (or < −0.7, reflecting a relative excitatory bias toward reverse BOS) was used as the criterion for identifying a cell as “selective”; this corresponded to a significance level of p = 0.036 as measured by a paired t test comparingRVm orRFR values for 20 presentations of BOS versus reverse BOS. Note that d′ values for subthreshold responses (d′Vm) were calculated separately for positive and negative areas. Tests for statistical significance are reported in the figure legends, except where noted in the Results.
To quantify the relative spike timing of the different cell types during BOS playback, the action potential PSTH of an RA-projecting neuron was correlated (Origin, Microcal Corporation) with the PSTH of either an X-projecting neuron or interneuron recorded from within the same bird. To examine the relationship between spike timing and membrane polarity (i.e., interneuron spiking vs X-projecting neuronal polarity), the cross-correlation analysis was performed using the interneuron action potential PSTH and the median-filtered average membrane potential of an X-projecting cell from the same bird; the membrane potential trace was further divided into 25-msec-long average “points” [i.e., the trace was divided into sequential segments 250 points in length (25 msec at the 10 kHz sample rate), and the average of these 250 points was calculated to yield a single point every 25 msec, equaling the temporal resolution of the PSTH]. In each case, the cross-correlation analysis was performed on data collected during forward BOS playback. Each cross-correlation was performed using data collected from a single-cell pair within a single bird, each resultant cross-correlogram was normalized to its peak value, and the normalized values from a given pair type (e.g., RA-projecting vs X-projecting) were then averaged across all pairs that were obtained (i.e., across birds).
Histology. Cells were filled with neurobiotin using positive currents (+0.5 to +1 nA; 500 msec at 1 Hz). After the recording session, birds were deeply anesthetized with equithesin and transcardially perfused with 0.9% saline for 3 min, followed by 4% paraformaldehyde (PFA) in 25 mm sodium phosphate buffer for 30 min. Brains were removed and post-fixed in 4% PFA with 30% sucrose overnight, blocked sagittally, and sectioned on a freezing microtome at 60 μm. Sections either were processed using a standard HRP–DAB reaction technique (see Kittelberger and Mooney, 1999) or occasionally were visualized using avidin–Oregon green (Molecular Probes, Eugene, OR; sections were incubated overnight at 4°C in a 1:500 dilution of avidin–Oregon green and 0.4% Triton X-100 in 0.025 m PBS, rinsed four times for 20 min each in PBS, mounted, and coverslipped). Camera lucida drawings were made using both 10× and 63× (Zeiss; 1.3 nA) objectives. Cell bodies were traced under the 63× objective, and their diameters were measured along the major axis of the cell body using an eyepiece reticule. A neuron's dendritic extent also was estimated using the eyepiece reticule and was measured from the middle of the cell body to the tip of the longest dendrite in the same section. Only DAB-reacted cells were used for dendritic measurements; because of this, as well as some incomplete fills, only a subset of labeled cells were included for this analysis.
RESULTS
Intracellular recordings were made in the HVc of urethane-anesthetized adult male zebra finches to (1) establish the relationship between an HVc neuron's intrinsic electrophysiological properties and its morphological type, (2) use this information to determine which HVc neuron types are auditory, (3) examine the subthreshold events that contribute to song-selective action potential responses in different HVc neurons, and (4) enable within-bird comparisons between the song-evoked responses in different HVc neuron types.
Intrinsic properties of morphologically identified HVc neurons
Previous in vitro studies indicated that different morphological classes of HVc neurons in adult male zebra finches possess distinct intrinsic electrophysiological properties (Dutar et al., 1998). The existence of similar distinctions in vivowould enable sorting of the cells to class on the basis of their intrinsic properties alone, especially advantageous when intracellular recordings are too brief to afford good staining (<15 min) or when many cells are impaled and filled in a single nucleus, making an unambiguous assignment of an electrophysiological record to a dye-filled cell difficult. Therefore, the in vivo intrinsic properties of HVc neurons (morphologically identified post hoc via intracellular neurobiotin staining; see below) were first characterized by collecting spontaneous membrane potential records and by injecting them with hyperpolarizing and depolarizing currents (Fig.1A; n = 16 X-projecting, 10 RA-projecting, and 11 interneurons). The three morphological classes of HVc neurons had distinct intrinsic propertiesin vivo, differing in the distributions of their instantaneous spike frequencies in response to similar injected currents (Fig. 1A–C), their average firing frequencies in response to depolarizing current pulses (Fig.1D), their spontaneous firing rates, their action potential widths, the amplitude of their individual spike afterhyperpolarizations (AHPs), and the onset latencies to their first action potential during a depolarizing pulse (Table 1). Interneurons were readily distinguished from the two relay cell types by their extremely high-frequency, nonadapting action potential trains in response to depolarizing pulses (Fig. 1A–C), their markedly steeper average firing frequencies in response to these injected currents (Fig. 1D, Table 1), their narrower action potential widths, their larger spike AHPs, and their higher rates of spontaneous firing (Table 1). X-projecting neurons also fired relatively regular action potential trains when depolarized (Fig.1A), but with moderate spike frequency adaptation over the first 5–10 spikes (Fig. 1C), a shallower current-to-spike-frequency relationship (Fig. 1D, Table 1), broader action potentials, and lower rates of spontaneous firing (Table 1). Finally, unlike X-projecting neurons, RA-projecting neurons exhibited highly variable (Fig. 1A,C) spike timing during depolarizing current injection; these evoked responses could include very high-frequency bursts (>200 Hz: Fig.1A,B). In further contrast to both X-projecting neurons and interneurons, RA-projecting HVc neurons displayed firing behavior that could be described as “sluggish,” as reflected in their significantly longer and more variable latencies to first spike onset when injected with positive current (Table 1) and the trends these cells manifested toward both the lowest average current-to-frequency relationship (Fig. 1D, Table 1) and the lowest mean rates of spontaneous firing (Table 1). In practice, this sluggish behavior along with the highly irregular action potential discharge of RA-projecting neurons in response to moderate positive currents (approximately +500 pA) was the most useful feature for distinguishing them from X-projecting neurons on-line, whereas the fast-spiking behavior of interneurons enabled them to be readily distinguished from the two relay cell classes. Input resistances, as well as resting potentials, were not significantly different for the three cell classes (Table 1).
Morphology
In vivo intracellular neurobiotin staining revealed that the two HVc relay cell types differed morphologically from one another, as well as from interneurons (see Figs. 2-4). Differences among all three cell types included the presence or absence of a projection axon, the projection axon's termination, cell body size, dendritic extent, and the relative abundance or absence of dendritic spines. These differences are very similar to those that have been noted in previous morphological analyses of HVc neurons (Nixdorf et al., 1989; Fortune and Margoliash, 1995; Benton et al., 1998).
The main axons of X- and RA-projecting HVc neurons had different trajectories and termini, projecting either rostrally from HVc to area X or caudally to nucleus RA, whereas interneurons elaborated processes wholly restricted to HVc (see Figs. 2, 3). The main axons of almost all (14/16) X-projecting neurons traveled immediately in the rostral direction after exiting HVc (Fig.2C). These rostral trajectories were quite varied, including dorsal paths that initially ran along the ventricle, above the lamina hyperstriatica (LH), as well as more ventral routes within and between the LH and the lamina medullaris dorsalis. X-projecting axons that described the most dorsal trajectories turned sharply in the ventral direction close to the rostrocaudal plane of the lateral magnocellular nucleus of the anterior neostriatum (LMAN) and then traveled near or through LMAN en route to area X (an example of this is shown in Fig. 2C). In two cases, X-projecting axons traveled ventrally into the archistriatum after leaving HVc and then ran rostrally along the ventral floor of the telencephalon toward area X. Both of these ventral-going axons followed a more medial trajectory in their descent from HVc and were medial to RA at the point they initiated their rostral turn toward area X. In all RA-projecting neurons (10/10 cells), the main axon traveled ballistically in a caudal and ventral direction via the tractus dorsalis archistriatalis to RA (Fig. 2C). Both relay cell types elaborated extensive axon collaterals within HVc, indicating that they are likely to function as local circuit neurons in HVc as well as serving as relay cells to other song nuclei (Fig.2C). In contrast to the two relay cell types, HVc interneurons had processes entirely restricted to HVc; usually, an interneuron's dendrites and axons were difficult to distinguish from one another (Figs. 3,4).
In agreement with previous studies indicating that X-projecting neurons have larger cell bodies than other HVc neuron types (Paton et al., 1985; Alvarez-Buylla et al., 1988), the cell body diameters of X-projecting neurons stained here were larger than those of interneurons and RA-projecting neurons (soma diameters: X-projecting, 23.0 ± 0.9 μm; n = 16 cells; interneurons, 18.7 ± 1.0 μm; n = 10; RA-projecting, 16.8 ± 0.8 μm; n = 9; Xp > Int, p< 0.005; Xp > RAp, p < 0.0001; unpairedt test). Qualitatively, the cell bodies of RA-projecting neurons had more rounded profiles than had those of both X-projecting neurons and interneurons, which tended to be polyhedral or fusiform in shape. Both relay cell types had spinous dendrites, but the dendrites of RA-projecting neurons were more compact (Fig. 2; mean of greatest radial dendritic extent: X-projecting, 168 ± 4 μm;n = 12 cells; RA-projecting, 123 ± 0.9 μm;n = 6 cells; p < 0.001, unpairedt test) and were thinner and more sparsely spined (Figs. 2,4). The RA-projecting neurons stained here strongly resemble the short dendrite class of HVc neurons described in the canary HVc, which were tentatively classified from Golgi-impregnated material as RA-projecting on the basis of their small soma sizes (Nixdorf et al., 1989). In contrast to the two relay cell types, all 11 HVc interneurons had aspinous processes. The processes of 10 of the 11 interneurons had swellings, or varicosities, along their lengths (Figs. 3A,4C). Swellings on some interneuronal processes could be seen in close apposition to other cell bodies, suggestive of axosomatic synapses (Fig. 4D). The one remaining interneuron had smooth dendrites, which were readily distinguished from a thinner axon and its very fine collaterals (Fig. 3B). This cell had the highest average firing-frequency-to-current relationship of any interneuron measured (∼390 Hz/nA), although its action potential width, AHP height, input resistance, and resting potential fell within the range described for the other 10 interneurons. The combination of fast-spiking electrophysiological properties and varicose, aspinous dendrites seen in HVc interneurons is reminiscent of inhibitory interneurons that have been described in other song areas (Spiro et al., 1999) and in the mammalian telencephalon (Azouz et al., 1997;Thomson and Deuchard, 1997).
Auditory responses of identified HVc neurons
In vivo intracellular recordings were made from a total of 172 neurons situated within and around HVc. Many of these cells were held too briefly to characterize their auditory responses or to identify the cell type. Sixty neurons were sorted unambiguously to type relying on both morphological and electrophysiological criteria, and their auditory responses were characterized using forward and reverse playback of the BOS. These identified HVc neurons included 20 X-projecting neurons (all confirmed morphologically), 23 RA-projecting neurons (11 confirmed morphologically), and 17 interneurons (7 confirmed morphologically).
Previous extracellular studies have shown that the BOS is a potent auditory stimulus for many HVc neurons (Sutter and Margoliash, 1994;Volman, 1996). Here, all identified HVc neuron types were found to respond robustly to forward playback of the BOS, firing significantly above the baseline rate during presentation of the stimulus (firing rate during BOS > baseline firing rate, p < 0.05, 17/20 X-projecting neurons; 15/23 RA-projecting neurons; 14/17 interneurons). Representative responses to BOS playback are shown for each cell type in Figure 5, which includes individual raw current-clamp records collected during five consecutive playbacks of the BOS. These individual traces were used to construct both the action potential PSTHs as well as the median-filtered, averaged membrane potential records to quantify a neuron's subthreshold and suprathreshold responses to song playback (see Materials and Methods).
Several features distinguished the song-evoked auditory responses of the three HVc cell types (Fig. 5). First, the majority of BOS-evoked action potential responses in both relay cell classes were phasic, occurring at restricted points during a motif (see Figs. 5, 6,8A,B, top; 10/17 X-projecting neurons and 9/15 RA-projecting neurons that fired significantly to forward BOS did so in a phasic manner). In contrast, almost all interneurons exhibited BOS playback-evoked action potential responses that were sustained throughout the stimulus, or throughout each motif (see Figs. 5, 6; 12/14 interneurons that responded significantly to forward BOS fired in a sustained manner to the stimulus). Second, all RA-projecting neurons encountered here showed robust subthreshold responses to BOS playback (i.e., the BOS-evoked depolarizing response > spontaneous depolarizations before playback;p < 0.05; 23/23 cells). Even the eight RA-projecting neurons that did not show elevated firing to BOS playback (including three cells that evinced neither spontaneous nor stimulus-evoked firing) still exhibited significant subthreshold depolarizing responses to BOS. Third, the subthreshold responses of RA-projecting neurons were always strongly depolarizing from rest (23/23 cells), in contrast to those of X-projecting cells, which could include hyperpolarizing (7/20 cells; p < 0.05) or depolarizing (8/20 cells;p < 0.05) components (see Materials and Methods and below). These recordings demonstrate that all three HVc cell types have auditory responses and raise the possibility that all three types generate BOS-evoked firing via different synaptic mechanisms.
Song selectivity
Birdsongs vary in their temporal as well as spectral organization. Previous extracellular studies have shown that many HVc neurons are sensitive to time-varying acoustical features of the bird's own song, firing selectively for forward over reverse BOS playback (these two stimuli contain the same spectral energy but differ in their temporal organization) (Margoliash, 1986; Lewicki and Arthur, 1996; Volman, 1996; Theunissen and Doupe, 1998). Here, intracellular recordings confirmed that all three HVc neuron types could fire selectively for forward over reverse BOS playback [Fig.6; d′ > 0.7, 18/20 X-projecting neurons; 15/23 RA-projecting neurons; 15/17 interneurons (one interneuron that did not display significant firing-rate elevation to forward BOS did display significant firing-rate suppression to reverse BOS)].
The median-filtered average membrane potential traces in the three cells shown in Figure 6 illustrate that distinct subthreshold membrane potential responses accompanied their song-selective firing. For example, the interneuron and the X-projecting cell underwent depolarizing and hyperpolarizing movements to forward song, with little subthreshold responsiveness to reverse BOS playback. In contrast, the RA-projecting neuron displayed substantial subthreshold depolarizations to reverse as well as forward BOS playback.
Relationships between subthreshold and suprathreshold selectivity
As a first step toward identifying differences in the subthreshold auditory responsiveness of the three HVc neuron types, their subthreshold depolarizations and firing-rate responses to forward and reverse BOS playback were measured. Z-scores were calculated for depolarizing (positive) subthreshold areas (see Materials and Methods) and firing-rate responses to forward and reverse BOS for individual neurons of each HVc neuron class. For individual cells, either subthreshold area or firing-rate z-scores for forward BOS responses were plotted against those same measures of reverse BOS responses. These comparisons revealed that subthreshold versus suprathreshold responsiveness varied systematically according to HVc cell type (Fig.7A). Both X-projecting neurons and interneurons exhibited firing-rate responses biased to forward BOS, without a similar bias in their depolarizing subthreshold responses. In contrast, RA-projecting neurons displayed firing-rate and depolarizing subthreshold responses biased to forward over reverse BOS playback (see Fig. 7 and legend for statistical tests of significance).
The different patterns of subthreshold and suprathreshold responsiveness of the three HVc neuron types to forward and reverse BOS playback suggest that X-projecting neurons and interneurons produce more selective action potential output than would be predicted from their depolarizing subthreshold responses alone. To confirm that this was indeed the case, subthreshold and firing-rate d′statistics (see Materials and Methods) were computed to allow direct comparisons of subthreshold versus firing-rate selectivity within single HVc neurons of a given class. Plotting d′ values for depolarizing subthreshold responses (area) against those for firing rate revealed that selectivity was greater at a firing-rate level than at a subthreshold level for X-projecting neurons and interneurons but lower at a firing-rate than subthreshold level for RA-projecting neurons (Fig. 7B; see figure and legend for relevant statistics). At a population level, mean d′ values for firing rate in the three HVc cell types greatly exceeded 0.7 (i.e., were selective for forward over reverse BOS playback), with the two relay cell classes displaying almost identical absolute levels of firing-rate selectivity (mean d′, X-projecting, 1.35 ± 0.26; RA-projecting, 1.34 ± 0.17; interneuron, 3.05 ± 0.44). In almost every case, selectivity stemmed from an augmentation of firing rate in response to forward BOS either alone or in combination with firing-rate suppression to reverse BOS, rather than only a suppression of firing rate in response to reverse BOS (selective cells with an absolute excitatory bias to forward BOS, 18/18 X-projecting neurons; 15/15 RA-projecting neurons; 14/15 interneurons). In contrast, significantly elevated firing-rate responses to reverse BOS playback were rare in all song-selective HVc neurons (p < 0.05; 0/18 X-projecting neurons; 1/15 RA-projecting neurons; 3/14 interneurons).
Membrane potential effects on BOS-evoked subthreshold and action potential responses
BOS playback could elicit hyperpolarizing responses in some X-projecting neurons (Figs. 5, 6) (see also Lewicki, 1996), while evoking exclusively depolarizing responses in RA-projecting neurons (Figs. 5, 6). To test whether different patterns of song-evoked inhibition and excitation distinguish song-selective responses in the two HVc relay cell types, tonic-positive current was injected via the recording electrode to depolarize the resting membrane potential of individual HVc neurons during song playback (Fig.8). These membrane potential manipulations were used to unmask or augment potentially latent song-evoked inhibition and to determine whether subthreshold depolarizations that occurred at the cell's normal resting potential were excitatory or reversed inhibitory events. BOS-evoked excitation and inhibition were assessed by measuring both subthreshold and firing-rate responses for the two types of relay cells at their normal resting potentials and when the cells were tonically depolarized. To measure subthreshold depolarizations and hyperpolarizations, positive and negative areas between the median-filtered average membrane potential traces and the prestimulus baseline mode potential were estimated before and during playback, and the positive or negative prestimulus area was then subtracted from the stimulus-evoked area of the same sign to yield net subthreshold responses (see Materials and Methods; Fig. 9). In parallel to these subthreshold measurements, firing rates to forward and reverse BOS playback relative to baseline rates were calculated at the two different membrane potentials. Manipulating membrane potential in this way revealed different patterns of subthreshold activity accompanied by firing-rate suppression or augmentation in the two cell types (Figs. 8,9).
Tonic depolarization of X-projecting neurons resulted in periods of pronounced membrane hyperpolarization to forward BOS playback that were paralleled by action potential suppression, suggestive of an inhibitory synaptic mechanism (Fig. 8A). For X-projecting neurons at their normal resting potentials, forward BOS playback could evoke either net depolarizing (4/11 cells; p < 0.05) or net hyperpolarizing (3/11 cells; p < 0.05) subthreshold responses relative to baseline fluctuations in membrane potential (Fig. 9A). Tonically depolarizing these cells (Vnormal = −76.5 ± 2.1 mV vs Vdepolarized = −57.6 ± 3.5 mV;p < 0.001; n = 11 cells) strongly augmented BOS-evoked hyperpolarization, such that all 11 cells displayed only net hyperpolarizing responses to forward BOS playback (Fig. 9A; net hyperpolarizing, p < 0.05; 11/11 cells; net depolarizing, p < 0.05; 0/11 cells; see Fig. 9 legend for mean z-scores for negative and positive areas). Although highly phasic depolarizing responses to forward BOS could still persist during tonic-positive current injection (Fig.8A; ∼2.7 sec), they failed to exceed spontaneous baseline depolarizations in total area. Subthreshold responses of any kind to reverse BOS were rare, and tonic depolarization did not unmask latent inhibition (or excitation) evoked by this stimulus (Fig.8A; population data not shown; net depolarizing,p < 0.05; 2/11 cells at normal Vm and 1/11 cells at positive Vm; net hyperpolarizing, p < 0.05; 2/11 cells at normal Vm and 1/11 cells at positive Vm).
In X-projecting neurons, the subthreshold effects of altering membrane potential were paralleled by changes in BOS-evoked firing (Figs.8A, 9B). At normal resting membrane potentials, significant firing-rate increases above baseline accompanied forward BOS playback in all 11 cells, whereas only one of these cells displayed a significant suprathreshold response to reverse BOS, which consisted of firing-rate suppression. During tonic depolarization, forward BOS playback evoked either firing-rate suppression (BOS-evoked firing rate < baseline firing rate;p < 0.05; 5/11 cells) or no significant suprathreshold response of any kind relative to baseline firing rates (p > 0.05; 6/11 cells), whereas none of these cells displayed significant firing-rate responses to reverse BOS in the tonically depolarized state (p > 0.5; 11/11 cells). Paired comparisons of z-scores for firing rate in response to either forward or reverse BOS playback at the two membrane potentials show that tonic depolarization selectively altered action potential responses to forward, but not reverse, BOS (Fig. 9B). Forward BOS-evoked firing-rate responses changed from net positive (i.e., augmentation) to net negative (i.e., suppression) relative to baseline with depolarization, whereas reverse BOS did not evoke significant suprathreshold responses in either the resting or depolarized state (mean z-scores, ZnormalBOS = 0.972 ± 0.103 vs ZdepolarizedBOS = −0.731 ± 0.336; p < 0.001, paired ttest; ZnormalBOSrev = −0.090 ± 0.118 vs ZdepolarizedBOSrev = 0.121 ± 0.186;p = 0.24, paired t test; n = 11 cells). Taken together, the subthreshold and suprathreshold effects of changing membrane potential are consistent with the idea that X-projecting neurons receive excitatory and inhibitory inputs activated primarily by forward BOS playback but do not receive inputs of either sign that are strongly activated by reverse BOS playback.
Unlike X-projecting neurons, membrane potential manipulations in RA-projecting neurons suggest that they receive predominantly excitatory inputs strongly activated by both forward and reverse BOS playback. At their normal resting membrane potentials, RA-projecting neurons exhibited exclusively depolarizing subthreshold responses to forward as well as reverse BOS playback (Figs. 8B,9A; net depolarizing, p < 0.05; forward BOS, 8/8 cells; reverse BOS, 8/8 cells; net hyperpolarizing,p < 0.05; forward BOS, 0/8 cells; reverse BOS, 0/8 cells). Unlike X-projecting neurons, tonic depolarization (Vnormal = −79.6 ± 2.8 mV vs Vdepolarized = −57.4 ± 3.2 mV;p < 0.0001; n = 8 cells) did not reveal robust forward BOS-evoked inhibition (Figs.8B, 9A; net depolarizing,p < 0.05; forward BOS, 5/8 cells; reverse BOS, 4/8 cells; net hyperpolarizing, p < 0.05; forward BOS, 2/8 cells; reverse BOS, 0/8 cells). Of the two RA-projecting neurons that did display song-evoked hyperpolarization, one simultaneously maintained a net positive subthreshold response, while the other exhibited the highest stimulus-evoked firing rate observed for any RA-projecting neuron, inconsistent with an inhibitory input. One possibility is that, at higher firing frequencies, spike AHPs contributed significantly to the median-filtered traces and increased the average membrane potential negativity.
RA-projecting neurons differed from X-projecting neurons in that tonic depolarization almost always augmented their song-evoked firing, especially to reverse BOS, and never resulted in firing-rate suppression to forward BOS playback (Figs. 8B,9B). Six of seven RA-projecting neurons displayed significant firing-rate responses to forward BOS playback at their normal resting membrane potentials, and five of these cells exhibited higher absolute firing rates to forward BOS in the depolarized state. In addition, a seventh cell that did not display a significant firing-rate response to forward BOS playback at either membrane potential still exhibited a >15-fold increase in response strength after depolarization (Znormal = 0.33;p = 0.33; vs Zdepolarized = 5.38;p = 0.07). As a group, the mean firing-rate response of RA-projecting neurons to forward BOS playback more than doubled after depolarization, although this effect fell short of significance (mean z-scores, ZcontrolBOS = 1.01 ± 0.15; ZdepolarizedBOS = 2.34 ± 0.60;p = 0.12, paired t test; n = 6; two cells did not spike in the depolarized state. In RA-projecting neurons, firing-rate suppression was never observed in response to forward BOS playback, regardless of membrane potential (0/8 cells). Finally, in further contrast to X-projecting neurons, tonic depolarization could unmask latent excitatory responses to reverse BOS in RA-projecting neurons (reverse BOS-evoked firing rate > baseline; p < 0.05; 2/8 cells at the normal membrane potential vs 4/8 cells during tonic depolarization), and the mean firing-rate response strength to reverse BOS playback increased >10-fold with depolarization (ZnormalBOSrev = 0.14 ± 0.19; ZdepolarizedBOSrev = 1.47 ± 0.29; p < 0.01, paired t test). The contrasting effects of membrane polarity on song-evoked responses in X- and RA-projecting neurons suggest that the same auditory stimuli drive different synaptic responses in the two HVc relay cell types; X-projecting neurons receive excitatory and inhibitory inputs activated primarily by forward BOS playback, whereas RA-projecting neurons receive predominantly excitatory inputs activated both by forward and reverse BOS.
The effects of membrane polarity on song selectivity
Membrane potential manipulations differentially affected statistical measures of selectivity in the two relay cell classes, because of contrasting effects of membrane potential on firing-rate responses to forward and reverse BOS playback. Selectivity (d′) values calculated in both the normal and depolarized conditions revealed that the mean firing-rate selectivity for the population of X-projecting neurons, but not RA-projecting neurons, changed as a function of membrane polarity (Fig.10). In X-projecting neurons, manipulating membrane potential altered mean firing-rate selectivity (d′normal = 1.65 ± 0.22 vsd′depolarized = −1.21 ± 0.52;p < 0.005, paired t test), whereas in RA-projecting neurons, mean firing-rate selectivity did not vary with membrane potential (d′normal = 1.47 ± 0.21; d′depolarized = 1.54 ± 0.27;p = 0.85, paired t test; membrane potential values in the two states are those given in the previous section and are in the legend for Fig. 10). On a single-cell level, membrane potential manipulations decreased the d′ values of all X-projecting neurons. At their normal membrane potentials, 11/11 X-projecting neurons had d′ values > 0.7, whereas in the tonically depolarized state, 9/11 cells had d′values < 0.7 (6/9 cells; d′ < −0.7), and the two other cells that maintained d′ values > 0.7 still exhibited lower d′ values when depolarized. In contrast, the six RA-projecting neurons that were (positively) selective at their normal resting membrane potentials maintained selectivity during depolarization (the d′ values decreased for 4 cells and increased for 2 cells, but all were > 0.7), whereas a seventh cell that was initially nonselective exhibited selectivity during depolarization (d′normal = 0.54;d′depolarized = 2.1). At a population level, these different effects on selectivity were caused by the distinctly different effects that depolarization had on firing rate to forward or reverse BOS playback, as noted previously (Figs. 8, 9). For X-projecting neurons, the specific membrane potential-dependent changes to forward but not reverse BOS responses resulted in altered selectivity, whereas for RA-projecting neurons, the parallel changes in forward and reverse BOS responses resulted in equivalent selectivity at the two membrane potentials.
Hyperpolarizing subthreshold and firing-rate responses in X-projecting neurons
In tonically depolarized X-projecting neurons, the pronounced hyperpolarizations evoked by forward but not reverse BOS playback suggest that these cells receive highly song-selective inhibitory input. To explore this idea further, the relationship between subthreshold hyperpolarizing responses and firing-rate selectivity was examined for X-projecting neurons. First, as performed previously for depolarizing responses, subthreshold hyperpolarizing responses to forward and reverse BOS were compared using z-scores (Fig.11A). These comparisons revealed that hyperpolarizing responses were biased toward forward BOS (mean z-score for hyperpolarization, BOS, −0.49 ± 0.21; BOSrev, 0.16 ± 0.12; p < 0.005, pairedt test). Second, direct comparisons of subthreshold hyperpolarizing selectivity (d′ for negative area) and firing-rate selectivity for individual X-projecting neurons were made (Fig. 11B). Unlike the mismatch between depolarizing subthreshold selectivity and firing-rate selectivity (see Fig. 6), hyperpolarizing subthreshold responses were as selective as firing-rate measures in these cells (d′negative area = 1.05 ± 0.28; d′firing rate = 1.35 ± 0.26; p = 0.46, paired t test). Taken together with the effects of membrane potential manipulations on forward (but not reverse) BOS-evoked hyperpolarizations, these results are consistent with the idea that X-projecting neurons receive highly song-selective inhibition.
Within-bird comparisons of the timing of song-evoked responses
HVc interneurons are one possible source of the highly song-selective inhibition seen in X-projecting neurons. To study this possibility further, sequential recordings from two or more different HVc neuron types were obtained within a single bird. Comparisons of the timing of forward BOS-evoked action potential firing and subthreshold responsiveness in these different cells were made by directly superimposing PSTHs and/or membrane potential traces collected from within birds (Fig. 12). In addition, cross-correlation analyses were performed on cell pairs to examine during BOS playback the relative spike timing of RA-projecting neurons with that of either X-projecting neurons or interneurons and to compare interneuron spike timing with X-projecting neuronal membrane polarity (Fig. 13; Materials and Methods). These comparisons reveal that the timing of song-evoked responses varies systematically according to cell class, with the closest temporal coincidence between interneuron firing and X-projecting neuronal hyperpolarization. In six animals, recordings were obtained from at least one interneuron and one X-projecting neuron. In every case, song-evoked interneuron firing closely coincided with when hyperpolarization and firing-rate suppression occurred in the X-projecting neuron (examples in Fig. 12). Cross-correlation analysis of interneuron firing and X-projecting neuronal membrane hyperpolarization during playback confirmed a maximum correlation at zero offset (Fig. 13; n = 6 pairs in 6 birds). Although recordings obtained from both interneurons and RA-projecting neurons revealed that song-evoked firing could occur near the same times (Fig.12B, compare bottom, topPSTHs), cross-correlating their spike times revealed a slight offset, with RA-projecting neuronal firing leading that of interneurons by ∼25 msec (the resolution of the analysis; see Fig. 13;n = 4 pairs in 4 animals). Finally, recordings from the two relay cell types in single birds showed that they fired reciprocally during BOS playback and that the BOS-evoked changes in membrane potential described an antiparallel motion, with the RA-projecting neuron exhibiting a prolonged depolarization during the same period when the X-projecting neuron was sustaining a net hyperpolarization (Fig. 12B). Cross-correlation analysis confirmed a large temporal offset in song-evoked firing between X-projecting and RA-projecting neurons, with a maximum of ∼75 msec (Fig. 13; n = 4 pairs in 4 animals). These analyses indicate that, during forward BOS playback, interneuron firing closely coincides with X-projecting neuronal hyperpolarization, RA-projecting neuron firing and interneuron firing are separated by short delays, and RA- and X-projecting neurons fire reciprocally at relatively long time intervals.
DISCUSSION
All HVc neuron types fire selectively for forward over reverse BOS playback. Relay cells innervating either area X or nucleus RA generate remarkably similar patterns of selective firing via different subthreshold mechanisms, likely because of segregated inhibitory input from interneurons onto X-projecting cells. Ultimately, factors that modulate the relay cells' membrane potentials may enable them to transmit distinct song representations, providing the potential for auditory comparisons used in song learning, maintenance, and perception.
Different morphological classes of HVc neurons examined in vivo display distinct intrinsic electrophysiological properties, extending previous in vitro studies (Dutar et al., 1998;Kubota and Taniguchi, 1998). One difference in vivo was that all cell types had lower input resistances than seen in brain slices, likely because of higher spontaneous synaptic activity in the intact brain. In vivo, as in vitro, positive currents drove interneurons to fire “fast” action potentials, often at high rates, while causing X-projecting neurons to fire regular action potential trains, although with a lower current–frequency relationship than for interneurons. Although RA-projecting neurons could fire repetitively in vivo, their DC-evoked spiking was characterized by an unusually prolonged and variable spike onset. Their firing remained intermittent and highly variable even with large currents (∼1 nA), perhaps because of spontaneous synaptic activity interacting with injected currents. If these RA-projecting neurons are the same that become highly active during singing, then they undergo dramatic state-dependent modulation.
All three HVc neuron types are highly song selective. Previous intracellular studies concluded that X- but not RA-projecting neurons had auditory responses and left the auditory properties of HVc interneurons undetermined (Katz and Gurney, 1981; Lewicki, 1996). Here, interneurons fired tonically and selectively to BOS, whereas both relay cells fired more phasically. Such tonic responses, detected in previous intracellular studies (Lewicki, 1996), were not linked with a cell type; the present study shows they arise from HVc interneurons with fast spikes and varicose dendrites, two features of inhibitory interneurons in RA and in other systems (Azouz et al., 1997; Thomson and Deuchard, 1997; Spiro et al., 1999). If these HVc interneurons are also inhibitory, then they likely provide song-selective inhibition to X-projecting neurons. Previous reports classifying RA-projecting neurons as nonauditory (Katz and Gurney, 1981; Lewicki, 1996) are surprising because here they usually fired song-selectively and they always displayed robust song-selective subthreshold responses. Perhaps previous studies failed to detect auditory responses because song stimuli were not used (Katz and Gurney, 1981) or because these cells' relatively refractory behavior caused them to be disregarded.
The existence of song-responsive RA-projecting neurons, like those described here, has been inferred by reversibly inactivating HVc, which transiently abolishes RA's auditory responsiveness (Doupe and Konishi, 1991). Furthermore RA's auditory responses persist even when LMAN (RA's other likely auditory afferent) is lesioned (Doupe and Konishi, 1991; Vicario and Yohay, 1993). A confound of understanding the genesis of RA's auditory responses in control birds is that inactivating HVc also silences X-projecting neurons, ultimately abolishing auditory responses in LMAN. Therefore, detecting auditory responses in RA-projecting neurons reveals a direct path from HVc to RA by which auditory information could influence vocalization. A parsimonious model is one in which the same RA-projecting HVc neurons that convey elaborate premotor patterns for singing (McCasland, 1987;Yu and Margoliash, 1996) also respond selectively to the resulting vocalizations, providing a cellular substrate for auditory feedback. Separate auditory and vocal projections from HVc to RA may be less likely to exist, because here RA-projecting neurons always responded to the BOS. In either case, HVc is the source of two song-selective auditory pathways.
Song-evoked subthreshold responses partly reflect the firing of neurons presynaptic to the impaled cell and thus can illuminate the origins of song selectivity. One caveat is that signal averaging measuresnet excitation and inhibition, and stronger elements may mask weaker components. Another qualification is that excitatory and inhibitory contributions may differ in states other than those studied here, perhaps via voltage-dependent effects (e.g., NMDA receptor-mediated currents). Nonetheless, one striking feature of RA-projecting neurons is that their song-evoked subthreshold responses appear exclusively (if latently) excitatory and, although biased toward forward song, can be quite pronounced to reverse BOS. The elevated and sustained firing in tonically depolarized RA-projecting neurons evoked by either stimulus (Fig. 8B, top) suggests that these cells receive little or no song-evoked inhibition. Instead, such sustained depolarizations indicate that cells presynaptic to RA-projecting neurons are excitatory and fire throughout forward and reverse BOS playback, although with a forward song bias. Most neurons in NIf, an HVc afferent, display sustained, song-selective firing (Janata and Margoliash, 1999), but the nature of their connections to different HVc neurons is unknown. In addition, a minority of neurons in the primary avian auditory telencephalon (field L) fire selectively to BOS (Lewicki and Arthur, 1996; Janata and Margoliash, 1999), but it is unknown whether they innervate HVc. A necessary step will be to determine the auditory nature and synaptic sign of extrinsic inputs onto various HVc neurons.
Unlike RA-projecting neurons, X-projecting neurons exhibited pronounced forward BOS-evoked hyperpolarizations and lack reverse BOS responses altogether. The contrasting song-evoked inhibition in the two relay cell types suggests that X-projecting neurons are privileged targets of HVc interneurons. Indeed, within birds, elevated interneuron firing closely coincides with X-projecting neuronal hyperpolarization during forward BOS playback. In the songbird HVc, BOS-evoked hyperpolarizations in a subset of X-projecting neurons may be critical to generating their supralinear sensitivity to syllable sequences (Lewicki and Konishi, 1995; Lewicki, 1996). Because such combination-sensitive neurons are rare [∼15–25% of HVc neurons (Margoliash and Fortune, 1992; Lewicki and Arthur, 1996)], yet BOS-evoked hyperpolarizations were detected in all X-projecting neurons (at least when sufficiently depolarized), song-evoked inhibition alone may not suffice for this form of temporal sensitivity. Finally, the absence of reverse BOS responses in X-projecting cells contrasts directly with RA-projecting neurons, raising the possibility that the two relay cell types receive distinct extrinsic auditory inputs and/or form a serial or circular hierarchy in HVc.
The function of BOS-evoked inhibition on X-projecting neurons could be to sculpt excitatory responses to BOS, generate more precise spike timing (Mainen and Sejnowski, 1995), elevate signal-to-noise levels (Lewicki, 1996), or facilitate excitatory responses by either priming the postsynaptic membrane (e.g., by deinactivating voltage-gated conductances) or synchronizing the firing of excitatory presynaptic inputs. If X- and RA-projecting neurons receive similar song-selective excitatory inputs, then inhibition in X-projecting neurons could partially suppress subthreshold depolarizing responses to forward BOS, causing them to appear nonselective. However, postsynaptic inhibition is unlikely to account for the lack of reverse BOS responses in X-projecting neurons, because hyperpolarizing responses to this stimulus were not detected even when cells were held tonically depolarized. Support for inhibitory maintenance of signal-to-noise levels and against postsynaptic priming comes from finding that artificially depolarizing X-projecting neurons suppresses their mean firing below baseline during forward BOS playback, even though highly phasic firing persists (Fig. 8A; ∼2.7 sec). Distinguishing between these various roles will require selectively blocking inhibition and assessing any effects on BOS-evoked responses in X-projecting neurons. Finally, these results underscore that highly similar action potential responses can arise from distinct subthreshold processes, even within a single brain area; both RA- and X-projecting neurons fired phasically and with almost identical song selectivity, yet only X-projecting neurons displayed BOS-evoked hyperpolarizations.
Chronic recordings in nonsinging, awake zebra finches show that BOS playback evokes responses in HVc (Dave et al., 1998; but see Schmidt and Konishi, 1998), whereas after singing, HVc's auditory responsiveness is transiently suppressed (McCasland and Konishi, 1981). Because of HVc's functional heterogeneity, it will be necessary to distinguish whether auditory suppression also occurs during singing and in all HVc cell types and whether suppression varies in different developmental or attentional states, any of which could profoundly impact auditory–vocal interactions. One speculation is that the same circuitry used for singing is co-opted for the auditory processing of song. Specifically RA-projecting neurons could act via local interneurons to drive feedforward inhibition onto X-projecting cells both during singing and passive listening. In the singing state, vocal premotor-coupled inhibition may define a time window for auditory feedback, suppressing auditory responses when they fail to coincide closely with the vocal efference. During passive listening, auditory-evoked activity in RA-projecting neurons may again drive feedforward inhibition onto X-projecting neurons, shaping but not suppressing their responses. This model predicts that NIf neurons, which are likely to provide auditory and premotor drive to HVc, innervate RA-projecting neurons, which in turn excite interneurons providing segregated input to X-projecting neurons.
Dual auditory representations of song emerge from HVc, potentially facilitating comparisons important to song learning and maintenance. The subthreshold events underlying BOS-evoked firing in the two relay cell types enable them to transmit different auditory representations of song, at least under certain conditions. When sufficiently depolarized, RA-projecting neurons fire continuously to forward BOS, and even to reverse BOS, normally a weak stimulus for HVc neurons (Margoliash, 1986; Lewicki and Arthur, 1996; Volman, 1996; Theunissen and Doupe, 1998). Therefore, the HVc-to-RA pathway may be able to convey information about song quality even when notes are distorted or sung out of sequence, as occurs in juveniles during song learning and in adults of those species that seasonally reexpress vocal plasticity. In contrast, inhibitory interactions may restrict X-projecting neuronal firing to certain times during the song, perhaps after distinct note sequences. Coincident activation of the two pathways, perhaps when vocalizations include these specific sequences, could then reinforce motor patterns in HVc and/or in RA, the direct and indirect target, respectively, of RA- and X-projecting cells. Furthermore X- and not RA-projecting neurons are likely sites for auditory refinement; varying the membrane potential of single X-projecting neurons altered their firing-rate selectivity, and as a whole these neurons displayed a marked increase in firing rate over subthreshold depolarizing selectivity. Because song selectivity arises via auditory experience (Volman, 1993), synapses onto X-projecting neurons may be especially sensitive to this experience. Future work can gauge the functional significance of multiple song representations, how HVc's local circuits refine song selectivity relative to HVc's extrinsic auditory afferents, and the sources of these extrinsic afferents.
Footnotes
This work was supported by National Institutes of Health Grant R01 DC 02524 and by grants from the McKnight, Sloan, and Klingenstein Foundations. I am grateful for helpful discussions and comments on this manuscript provided by Dr. David Fitzpatrick, Dr. John Spiro, Merri Rosen, and other members of the Mooney lab, as well as two anonymous reviewers. Special thanks to Stacey James for expert histology, Jim Adelman for data analysis, and Merri Rosen for expert computer programming.
Correspondence should be addressed to Dr. Richard Mooney, Department of Neurobiology, Box 3209, Duke University Medical Center, Durham, NC 27710. E-mail: mooney@neuro.duke.edu.
REFERENCES
- 1.Alvarez-Buylla A, Theelen M, Nottebohm F. Birth of projection neurons in the higher vocal center of the canary forebrain before, during, and after song learning. Proc Natl Acad Sci USA. 1988;85:8722–8726. doi: 10.1073/pnas.85.22.8722. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Azouz R, Gray C, Nowak L, McCormick D. Physiological properties of inhibitory interneurons in cat striate cortex. Cereb Cortex. 1997;7:534–545. doi: 10.1093/cercor/7.6.534. [DOI] [PubMed] [Google Scholar]
- 3.Benton S, Cardin JA, Devoogd TJ. Lucifer yellow filling of area X-projecting neurons in the high vocal center of female canaries. Brain Res. 1998;799:138–147. doi: 10.1016/s0006-8993(98)00485-5. [DOI] [PubMed] [Google Scholar]
- 4.Bottjer SW, Miesner EA, Arnold AP. Forebrain lesions disrupt development but not maintenance of song in passerine birds. Science. 1984;224:901–903. doi: 10.1126/science.6719123. [DOI] [PubMed] [Google Scholar]
- 5.Brainard MS, Doupe AJ. Interruption of a basal ganglia-forebrain circuit prevents plasticity of learned vocalizations. Nature. 2000;404:762–766. doi: 10.1038/35008083. [DOI] [PubMed] [Google Scholar]
- 6.Dave AS, Yu AC, Margoliash D. Behavioral state modulation of auditory activity in a vocal motor system. Science. 1998;282:2250–2254. doi: 10.1126/science.282.5397.2250. [DOI] [PubMed] [Google Scholar]
- 7.Doupe A, Kuhl P. Birdsong and human speech: common themes and mechanisms. Annu Rev Neurosci. 1999;22:567–631. doi: 10.1146/annurev.neuro.22.1.567. [DOI] [PubMed] [Google Scholar]
- 8.Doupe AJ, Konishi M. Song-selective auditory circuits in the vocal control system of the zebra finch. Proc Natl Acad Sci USA. 1991;88:11339–11343. doi: 10.1073/pnas.88.24.11339. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Dutar P, Vu HM, Perkel DJ. Multiple cell types distinguished by physiological, pharmacological, and anatomic properties in nucleus Hvc of the adult zebra finch. J Neurophysiol. 1998;80:1828–1838. doi: 10.1152/jn.1998.80.4.1828. [DOI] [PubMed] [Google Scholar]
- 10.Fortune ES, Margoliash D. Parallel pathways and convergence onto HVc and adjacent neostriatum of adult zebra finches (Taeniopygia guttata). J Comp Neurol. 1995;360:413–441. doi: 10.1002/cne.903600305. [DOI] [PubMed] [Google Scholar]
- 11.Foster EF, Bottjer SW. Axonal connections of the high vocal center and surrounding cortical regions in juvenile and adult male zebra finches. J Comp Neurol. 1998;397:118–138. [PubMed] [Google Scholar]
- 12.Green D, Swets J. Signal detection theory and psychophysics. Wiley; New York: 1966. [Google Scholar]
- 13.Jagadeesh B, Wheat HS, Kontsevich LL, Tyler CW, Ferster D. Direction selectivity of synaptic potentials in simple cells of the cat visual cortex. J Neurophysiol. 1997;78:2772–2789. doi: 10.1152/jn.1997.78.5.2772. [DOI] [PubMed] [Google Scholar]
- 14.Janata P, Margoliash D. Gradual emergence of song selectivity in sensorimotor structures of the male zebra finch song system. J Neurosci. 1999;19:5108–5118. doi: 10.1523/JNEUROSCI.19-12-05108.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Katz LC, Gurney ME. Auditory responses in the zebra finch's motor system for song. Brain Res. 1981;221:192–197. doi: 10.1016/0006-8993(81)91073-8. [DOI] [PubMed] [Google Scholar]
- 16.Kittelberger J, Mooney R. Lesions of an avian forebrain nucleus that disrupt song development alter synaptic connectivity and transmission in the vocal premotor pathway. J Neurosci. 1999;19:9385–9398. doi: 10.1523/JNEUROSCI.19-21-09385.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Kubota M, Taniguchi I. Electrophysiological characteristics of classes of neuron in the HVc of the zebra finch. J Neurophysiol. 1998;80:914–923. doi: 10.1152/jn.1998.80.2.914. [DOI] [PubMed] [Google Scholar]
- 18.Lewicki MS. Intracellular characterization of song-specific neurons in the zebra finch auditory forebrain. J Neurosci. 1996;16:5855–5863. [PubMed] [Google Scholar]
- 19.Lewicki MS, Arthur BJ. Hierarchical organization of auditory temporal context sensitivity. J Neurosci. 1996;16:6987–6998. doi: 10.1523/JNEUROSCI.16-21-06987.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Lewicki MS, Konishi M. Mechanisms underlying the sensitivity of songbird forebrain neurons to temporal order. Proc Natl Acad Sci USA. 1995;92:5582–5586. doi: 10.1073/pnas.92.12.5582. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Mainen ZF, Sejnowski TJ. Reliability of spike timing in neocortical neurons. Science. 1995;268:1503–1506. doi: 10.1126/science.7770778. [DOI] [PubMed] [Google Scholar]
- 22.Margoliash D. Acoustic parameters underlying the responses of song-specific neurons in the white-crowned sparrow. J Neurosci. 1983;3:1039–1057. doi: 10.1523/JNEUROSCI.03-05-01039.1983. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Margoliash D. Preference for autogenous song by auditory neurons in a song system nucleus of the white-crowned sparrow. J Neurosci. 1986;6:1643–1661. doi: 10.1523/JNEUROSCI.06-06-01643.1986. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Margoliash D, Fortune ES. Temporal and harmonic combination-sensitive neurons in the zebra finch's HVc. J Neurosci. 1992;12:4309–4326. doi: 10.1523/JNEUROSCI.12-11-04309.1992. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.McCasland JS. Neuronal control of birdsong production. J Neurosci. 1987;7:23–39. doi: 10.1523/JNEUROSCI.07-01-00023.1987. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.McCasland JS, Konishi M. Interaction between auditory and motor activities in an avian song control nucleus. Proc Natl Acad Sci USA. 1981;78:7815–7819. doi: 10.1073/pnas.78.12.7815. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Nixdorf BE, Davis SS, DeVoogd TJ. Morphology of Golgi-impregnated neurons in hyperstriatum ventralis, pares caudalis in adult male and female canaries. J Comp Neurol. 1989;284:337–349. doi: 10.1002/cne.902840302. [DOI] [PubMed] [Google Scholar]
- 28.Nottebohm F, Stokes TM, Leonard CM. Central control of song in the canary, Serinus canarius. J Comp Neurol. 1976;165:457–486. doi: 10.1002/cne.901650405. [DOI] [PubMed] [Google Scholar]
- 29.Paton JA, O'Loughlin BE, Nottebohm F. Cells born in adult canary forebrain are local interneurons. J Neurosci. 1985;5:3088–3093. doi: 10.1523/JNEUROSCI.05-11-03088.1985. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Rosen M, Mooney R. Intrinsic and extrinsic contributions to auditory selectivity in a song nucleus critical for vocal plasticity. J Neurosci. 2000;20:XXXX–XXXX. doi: 10.1523/JNEUROSCI.20-14-05437.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Scharff C, Nottebohm F. A comparative study of the behavioral deficits following lesions of various parts of the zebra finch song system: implications for vocal learning. J Neurosci. 1991;11:2896–2913. doi: 10.1523/JNEUROSCI.11-09-02896.1991. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Scharff C, Nottebohm F, Cynx J. Conspecific and heterospecific song discrimination in male zebra finches with lesions in the anterior forebrain pathway. J Neurobiol. 1998;36:81–90. [PubMed] [Google Scholar]
- 33.Schmidt MF, Konishi M. Gating of auditory responses in the vocal control system of awake songbirds. Nat Neurosci. 1998;1:513–518. doi: 10.1038/2232. [DOI] [PubMed] [Google Scholar]
- 34.Solis MM, Doupe AJ. Anterior forebrain neurons develop selectivity by an intermediate stage of birdsong learning. J Neurosci. 1997;17:6447–6462. doi: 10.1523/JNEUROSCI.17-16-06447.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Spiro J, Dalva M, Mooney R. Long-range inhibition within the zebra finch song nucleus RA can coordinate the firing of multiple projection neurons. J Neurophysiol. 1999;81:3007–3020. doi: 10.1152/jn.1999.81.6.3007. [DOI] [PubMed] [Google Scholar]
- 36.Sutter ML, Margoliash D. Global synchronous response to autogenous song in zebra finch HVc. J Neurophysiol. 1994;72:2105–2123. doi: 10.1152/jn.1994.72.5.2105. [DOI] [PubMed] [Google Scholar]
- 37.Theunissen FE, Doupe AJ. Temporal and spectral sensitivity of complex auditory neurons in the nucleus HVc of male zebra finches. J Neurosci. 1998;18:3786–3802. doi: 10.1523/JNEUROSCI.18-10-03786.1998. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Thomson A, Deuchard J. Synaptic interactions in neocortical local circuits: dual intracellular recordings in vitro. Cereb Cortex. 1997;7:510–522. doi: 10.1093/cercor/7.6.510. [DOI] [PubMed] [Google Scholar]
- 39.Vates GE, Broome BM, Mello CV, Nottebohm F. Auditory pathways of caudal telencephalon and their relation to the song system of adult male zebra finches. J Comp Neurol. 1996;366:613–642. doi: 10.1002/(SICI)1096-9861(19960318)366:4<613::AID-CNE5>3.0.CO;2-7. [DOI] [PubMed] [Google Scholar]
- 40.Vicario DS. A new brain stem pathway for vocal control in the zebra finch song system. NeuroReport. 1993;4:983–986. doi: 10.1097/00001756-199307000-00037. [DOI] [PubMed] [Google Scholar]
- 41.Vicario DS, Yohay KH. Song-selective auditory input to a forebrain vocal control nucleus in the zebra finch. J Neurobiol. 1993;24:488–505. doi: 10.1002/neu.480240407. [DOI] [PubMed] [Google Scholar]
- 42.Volman SF. Development of neural selectivity for birdsong during vocal learning. J Neurosci. 1993;13:4737–4747. doi: 10.1523/JNEUROSCI.13-11-04737.1993. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Volman SF. Quantitative assessment of song-selectivity in the zebra finch “high vocal center.”. J Comp Physiol [A] 1996;178:849–862. doi: 10.1007/BF00225832. [DOI] [PubMed] [Google Scholar]
- 44.Wild JM. Descending projections of the songbird nucleus robustus archistriatalis. J Comp Neurol. 1993;338:225–241. doi: 10.1002/cne.903380207. [DOI] [PubMed] [Google Scholar]
- 45.Yu AC, Margoliash D. Temporal hierarchical control of singing in birds. Science. 1996;273:1871–1875. doi: 10.1126/science.273.5283.1871. [DOI] [PubMed] [Google Scholar]