Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2012 Mar 28.
Published in final edited form as: J Neurosci. 2011 Sep 28;31(39):13936–13948. doi: 10.1523/JNEUROSCI.6809-10.2011

Short bouts of vocalization induce long lasting fast gamma oscillations in a sensorimotor nucleus

Brian Lewandowski 1, Marc Schmidt 1
PMCID: PMC3196533  NIHMSID: NIHMS328330  PMID: 21957255

Abstract

Performance evaluation is a critical feature of motor learning. In the vocal system, it requires the integration of auditory feedback signals with vocal motor commands. The network activity that supports such integration is unknown, but it has been proposed that vocal performance evaluation occurs offline. Recording from NIf, a sensorimotor structure in the avian song system, we show that short bouts of singing in adult male zebra finches (Taeniopygia guttata) induce persistent increases in firing activity and coherent oscillations in the fast gamma range (90–150 Hz). Single units are strongly phase-locked to these oscillations, which can last up to 30 s, often outlasting vocal activity by an order of magnitude. In other systems, oscillations often are triggered by events or behavioral tasks but rarely outlast the event that triggered them by more than 1 second. The present observations are the longest reported gamma oscillations triggered by an isolated behavioral event. In mammals, gamma oscillations have been associated with memory consolidation and are hypothesized to facilitate communication between brain regions. We suggest that the timing and persistent nature of NIf’s fast gamma oscillations make them well suited to facilitate the integration of auditory and vocal motor traces associated with vocal performance evaluation.

INTRODUCTION

Synchronized network oscillations in the gamma (30 – 80 Hz) and fast gamma (90 – 150 Hz) range are observed in many different areas of the brain (Schoffelen et al., 2005; Siegel et al., 2008) and appear to be an emergent property of neural circuits (Fries, 2009). These oscillations have been observed in both vertebrate and invertebrate species (Neuenschwander et al., 1996; Engel and Singer, 2001) and are associated with a wide range of processes including sensory processing (Cardin et al., 2009), motor preparation (Schoffelen et al., 2005; Uhlhaas et al., 2009), selective attention (Fries et al., 2001; Buehlmann and Deco, 2008), and memory consolidation (Fell et al., 2001; Axmacher et al., 2006). Despite the rich literature on gamma oscillations in mammals, relatively little is known regarding their role and properties in birds (Neuenschwander et al., 1996; Low et al., 2008).

In songbirds, auditory feedback is critically necessary for song learning and maintenance (Leonardo and Konishi, 1999; Lombardino and Nottebohm, 2000). In order to shape and maintain vocal output so that it matches a stored template of song, vocal motor patterns are evaluated through comparisons with auditory feedback. Neurons have been observed in auditory areas that are sensitive to auditory feedback perturbations during singing (Keller and Hahnloser, 2009). However, similar sensitivity to perturbations of auditory feedback has not been observed in vocal motor nuclei during singing (Konishi, 2004; Kozhevnikov and Fee, 2007; but see Sakata and Brainard, 2008). This has prompted some investigators to suggest that consolidation of premotor vocal commands with auditory feedback may occur offline (Margoliash and Schmidt, 2010). The most likely timeframes for such offline performance evaluation would be during sleep, where replay of premotor patterns is known to occur (Dave and Margoliash, 2000; Rauske et al., 2010), and shortly after the offset of vocalization, when auditory and motor traces of song would be most reliably encoded by short-term neural plasticity. We hypothesized that such offline integration of sensorimotor traces would require persistent changes in network activity following singing, and that these persistent changes may involve oscillatory activity, which has been implicated in memory consolidation and the facilitation of communication between connected brain areas (Buzsaki and Draguhn, 2004; Dan and Poo, 2004; Fries, 2005; Fries et al., 2008).

To test this hypothesis, we recorded both unit activity and local field potentials (LFPs) from the nucleus interfacialis of the nidopallium (NIf) in singing adult male zebra finches. NIf is part of the song system (Figure 1a), a discrete network of interconnected nuclei that is necessary for the learning, maintenance, and production of song. NIf is also part of a sensorimotor loop, along with nucleus HVC (used as a proper name) and nucleus Avalanche (Av), that links the song system and the auditory forebrain (Akutagawa and Konishi, 2010), making it ideally situated to facilitate offline sensorimotor integration. We found that singing caused dramatic and lasting changes in firing rate and gamma oscillation power in NIf. We propose that these persistent changes in NIf’s network activity following singing could facilitate the integration of auditory and vocal motor traces associated with vocal performance evaluation.

Figure 1. Connectivity and characteristic premotor activity distinguish NIf from surrounding auditory nuclei.

Figure 1

(A) Schematic of the main auditory and vocal motor pathways in the avian brain. Auditory information reaches NIf from the caudal mesopallium (CM), a secondary auditory area (Vates et al., 1996), and nucleus Avalanche (Av), which is part of the CM. HVC and the robust nucleus of the arcopallium (RA) make up the telencephalic portion of the descending motor pathway which controls vocal output through projections onto brainstem respiratory nuclei and the tracheosyringeal portion of the hypoglossal nucleus (nXIIts). Song pattern generation and interhemispheric coordination are believed to involve a recurrent loop that includes HVC, RA, nuclei of the respiratory complex, and thalamic nucleus uvaeformis (Uva), which also projects to NIf and Av. Av is included with the darker shaded song system nuclei to highlight its strong interconnectivity with NIf and HVC, and the projection it receives from Uva. Abbreviations: MLd: dorsal lateral nucleus of the mesencephalon; Ov: nucleus ovoidalis.

(B) Exemplar site demonstrating NIf’s characteristic premotor activity patterns. Top, spectrogram and amplitude waveform of a song with four motifs. Bottom, expanded time views of multiunit activity around the first introductory note and the final motif of song. Plotted below are peri-stimulus time histograms (PSTH) showing average firing rate centered on the onset of the first introductory note (left) and the offset of the last motif (right) of song. The time lag from premotor onset to vocal onset is highlighted in green. Premotor activity ceases shortly before the offset of vocal production (highlighted in dark blue) and remains suppressed past the end of vocal output (highlighted in light blue). Red line represents baseline firing rate.

EXPERIMENTAL PROCEDURES

Animals

Adult male and female zebra finches (Taeniopygia guttata) ranging in age from 120 to 500 days of age were obtained from a local supplier (Canary Bird Farm). Subjects were administered antibiotics and monitored for a week before becoming eligible for experimental use. Birds were housed under 12 h light/dark conditions and provided with food and water ad libitum. All procedures described here were approved by an institutional animal care and use committee at the University of Pennsylvania.

Acute Surgery

A custom built motorized microdrive (RP Metrix, Princeton, NJ) wired with 0.8–1.2 MΩ tungsten microelectrodes (FHC, Bowdoin, ME) was implanted in the right NIf using the following procedure. Adult male zebra finches were anesthetized using 0.02 mL of ketamine/xylazine (40 mg/kg ketamine and 8 mg/kg xylazine; Phoenix Pharmaceuticals, Belmont, CA) and 0.06–0.1 mL of diazepam (7.5 mg/kg; Abbott Laboratories, Abbott Park, IL). NIf was located using previously established stereotaxic coordinates, and identified based on a combination of its characteristic neural activity patterns and auditory selectivity for BOS. In addition, once the subject had recovered from surgery, premotor activity during vocalization and the state-dependence of auditory response properties in NIf (Cardin and Schmidt, 2004b) were used as additional criteria for the identification of NIf. These criteria have been used successfully to identify NIf in previous studies from our lab (Cardin and Schmidt, 2004b; Cardin et al., 2005). Before implanting, electrodes were retracted to 200 μm above NIf to reduce gliosis. The electrodes and exposed brain were surrounded with GelFoam (Pfizer, New York, NY) and the microdrive was secured to the skull using dental cement. A silver wire implanted under the skull acted as ground. Subjects were allowed to recover overnight and then transferred to a modified holding cage where a tether containing a small operational amplifier (Texas Instruments, Dallas, TX) connected the implanted microdrive to a swivel connector on the roof of the cage. An elastic thread built into the tether helped to support the weight of the implant. Subjects remained tethered both during and between experiments.

Chronic Recordings

During experimental sessions subjects were transferred to a recording chamber and tethered to a mercury commutator. Neural signals were collected using a Bioamp headstage and DB4 module (Bioamp headstage, DB4, and FT5 units: Tucker-Davis Technologies, Alachua, FL). Signals were amplified and filtered between 1 Hz and 7 kHz using the DB4 module and a four channel amplifier (Model 440, Brownlee, San Jose, CA). The signal was then digitized and recorded using a Power 1401 mark II board and Spike2 software (Cambridge Electronic Design, Cambridge, England). Neural recordings were collected while the subject vocalized in response to conspecifics housed nearby and playback of conspecific vocalizations through a hidden speaker.

Experimental Conditions

The goal of this experiment was to record neural activity in NIf during natural vocal behavior. To elicit the necessary amount of vocal communication behavior in the 1–2 hours that we could typically hold a multiunit site in a moving subject, we adopted an adaptive experimental protocol tailored for each subject. In the simplest condition, a subject was housed alone and responded to playback of BOS, REVBOS, and conspecific vocalizations played at approximately 60 dB SPL measured at the location of the subject. Playback alone was rarely enough to elicit the quantity and variety of vocalizations we required. We progressively enriched a subject’s social environment through the addition of both seen and unseen male and female zebra finches until the subject was vocalizing at a high level. The subject was able to hear and be heard by these conspecifics. For most subjects the presence of a familiar female visible in a nearby cage combined with playback was enough to induce strong vocal activity. Our most laconic subject required playback with one female visible and a second female housed out of sight with a highly vocal male. Despite our best efforts to actively shape the experimental environment to encourage vocal behavior, we still were unable to use data from many sites. These included sites where a subject did not vocalize enough and sites where the subject vocalized too much leaving very few periods of silence between vocalizations during which we could measure dynamic state changes. During periods of vocal inactivity, subjects were regularly inspected to ensure that they remained awake.

Histology

In many cases, electrode sites were confirmed by creating electrolytic lesions using an isolated pulse stimulator (model 2100; AM-Systems). Subjects were then euthanized using 0.2 mL of 50 mg/mL Nembutal (Abbott Laboratories, Chicago, IL). Their brains were removed and fixed in 4% formaldehyde and cryoprotected using 30% sucrose. Tissue was Nissl stained in order to visualize the lesion sites.

Data Analysis

Data were analyzed using Spike2 and Matlab (The MathWorks, Natick, MA) routines written by B. C. Lewandowski unless otherwise noted. Local field potentials (LFPs) were isolated by offline filtering of neural data between 1 and 200 Hz. Multiunit activity (MUA) was isolated by highpass filtering neural data at or above 250 Hz (filtering above 250 Hz was occasionally used to remove movement artifact). Multiunit spike times were determined by setting a threshold visually to a point high enough to be clearly above background noise, but low enough to capture all distinct spike events as performed in our previous studies in NIf (Cardin and Schmidt, 2004a). Attempts to use a threshold based upon standard deviations above background noise were unreliable because of the difficulty of establishing baseline periods to measure background noise due to the high spontaneous multiunit spike rate in NIf,. All LFP and MUA data used for analysis were visually inspected to ensure they were free of movement artifacts.

Measuring vocalization onsets and offsets

Microphone recordings were bandpass filtered (3–7 kHz) to eliminate ambient noise while retaining vocalization related information. The cutoff frequencies of the bandpass filter were chosen to optimize the signal-to-noise ratio in order to enhance the accuracy of onset and offset measurements. Vocal onsets were determined using the root mean square (RMS, 2 ms window) of the filtered microphone trace and a threshold set above background noise. To account for mild reverberation in the recording room, Vocal offsets were defined as the point in the microphone RMS trace when sound amplitude decreased the most rapidly. Operationally these inflection points were identified as minima in the average slope (10 ms window) of the RMS (1 ms window) of the filtered audio recording.

Defining the measurement windows

To determine the effect of vocal behavior on NIf’s network activity, spike rate and fast gamma power changes were measured in three non-overlapping periods: Pre-Vocal, Post-Vocal, and Baseline.

Pre-Vocal

Defined as a period 1–3 s prior to vocal onset, significantly outside the range of premotor activity (see Results). To ensure that changes observed in the Pre-Vocal period were not actually Post-Vocal changes that had not fully decayed, vocal onsets had to be preceded by at least 70 s of vocal inactivity.

Post-Vocal

Defined as a window of 2–4 s after vocal offset. This window captures stable peak spike rate and oscillation values. Any vocalization that is followed by at least 4.5 s of vocal inactivity is considered for Post-Vocal analysis. Each site had to have at least 7 song offsets that fit the criteria for the Post-Vocal period to be included in our final analysis.

Baseline

Defined as a period of vocal inactivity starting 90 s after a vocalization and lasting until 30 s before the next vocalization.

Premotor activity

Premotor onset

Average MUA was calculated in 10 ms windows (90% overlap) aligned to the onset of at least 10 introductory notes. Counting backwards from vocal onset, the time of the first bin significantly above baseline (Student’s t-test) that is preceded by two non-significant bins is considered premotor onset. Baseline is the average firing during Baseline periods.

Premotor offset

Defined as the measure of the lag time between the end of premotor activity and the end of vocal production. Calculated from average MUA aligned to song offsets, premotor offset for a given site is the time from the last bin significantly above baseline to the end of song.

Post-Vocal inhibition

Defined as the time it takes MUA to return to baseline levels following song offset. Using average MUA, significance with respect to baseline was determined for each bin following vocal offset (Student’s t-test, p<0.05). To account for the gradual return to baseline, we found the first 40 ms after vocal-offset where half the bins were not significantly below baseline. Post-Vocal inhibition is the time of the last significant bin within this 40 ms window.

Dynamic state changes

Early in our analyses we observed significant variability in the magnitude of changes in spike rate and synchronized spiking activity (dynamic state changes) following different vocalization types. Because song produced the most consistent changes, we restrict our analysis of dynamic state changes in NIf to song offsets unless otherwise noted. Firing rate was quantified using z-scores for comparison both within and across sites.

Zscore=(xμ)/σ

Where x is the experimental value, μ is the average during Baseline periods, and σ is the Baseline standard deviation. Spike counts were measured in non-overlapping 2 s bins across Baseline periods. Most sites had multiple Baseline periods separated by intervals of vocal activity. To account for any possible drift in firing rate, the nearest Baseline period was always chosen to compute the z-score. Significance was determined using two-sample t-tests comparing Pre- or Post-Vocal z-scores to Baseline z-scores (average baseline z-score will be 0, but measurements from individual bins provide variance).

For spectral analysis of spike signals, data were converted to a binary representation of spike activity with a 0.5 ms resolution. Power spectral density was measured from these binary time series using Thomson’s multitaper method (Thomson, 1982) as implemented by Matlab with a time-bandwidth product of 6. The multitaper method computes spectral estimates using a series of orthogonal tapers derived from Slepian sequences that seek to minimize errors in the frequency domain caused by the use of finite windows. To control for the possibility that increases in spike rate might cause broadband increases in spectral power (Manning et al., 2009), power measurements across a given frequency band were normalized by the total power of the signal. The resulting ‘normalized power’ metric represents the percentage of the total power of the signal contained within the frequency band of interest.

Power spectra for LFP data were also calculated using Thompson’s multitaper method. Power in LFP spectra was unevenly distributed across frequencies in a manner resembling the 1/f power law distribution observed in mammals (Bedard and Destexhe, 2009). Due to this uneven distribution of power, low frequencies contained significantly more power than high frequencies and thus were over-represented in LFP power measurements. To control for this, we normalized LFP power spectra calculated from experimental periods (Pre-Vocal, Post-Vocal, or the final 1 s of song) by the average Baseline spectrum calculated in non-overlapping 2 s windows across Baseline periods. The result is a modulation spectrum, which shows the change in power relative to baseline for each frequency (Henrie and Shapley, 2005). Modulation spectra control for the uneven distribution of power across frequencies, but do not control for broadband shifts in spectral power. Therefore, the ‘normalized power’ metric described above was also used to measure power across a given frequency band in modulation spectra.

A two-sample t-test was used to test for significant differences between normalized power calculated from experimental periods (Pre-Vocal, Post-Vocal, or the final 1 s of song) and normalized power calculated in non-overlapping 2 s bins across Baseline periods (1 s bins were used for comparison with data from the last 1 s of song).

Dynamic State Duration Measurements

For each site MUA was used to calculate spike rate and normalized power in 2 s windows centered on times beginning at song offset and then moved piecewise in 50 ms steps. For each time window, the collection of values for spike rate and normalized power was tested for significance vs. baseline (two-sample t-test, alpha 0.01). This resulted in a vector of significance vs. time relative to song offset with a 50 ms resolution. The duration of spike rate and fast gamma power increases for a given site was defined as the first significant bin followed by 1 s of non-significant bins. This value of 1 s was determined empirically after we observed sites where brief (< 1 s) dips below significance were followed by many seconds of significant values.

Single Unit Isolation and Analysis

Spike waveforms were identified and isolated using the WaveMark feature of Spike2. These waveforms were exported for analysis using custom built software developed in our laboratory by Dr. Chris Glaze. The software identifies single unit clusters by modeling data from the first three principle components of spike waveforms using a mixture of t-distributions (Svensen and Bishop, 2005). Each cluster was then manually inspected for separation and accuracy. Finally, Spike2’s built-in principle component analysis was used to follow the evolution of identified clusters over time. Single unit analysis was only conducted for the period of time during which a unit formed a distinct cluster. For single unit analyses, we expanded the Post-Vocal window to 1.5–5.5 s following vocal offset to account for smaller spike sample sizes.

Phase Locking Measurements

Phase times were recovered from the analytic signal of a Hilbert transform performed on signals filtered in the fast gamma range (90–150 Hz) (see Le Van Quyen et al., 2001 for a detailed description of Hilbert transforms). For each subject in which we recorded LFP signals, two electrodes were implanted. Each electrode was separated by approximately 250 μm. Due to NIf’s small size and oblong shape, it was rare for both of these electrodes to be within the nucleus simultaneously. To avoid concerns that spike waveforms could influence the LFP signal (David et al., 2010), we used the electrode not recording the single unit to calculate phase times, hereafter referred to as Eref. Though LFPs lose power as they propagate through the extracellular matrix from their source, in mammals they can often be detected 1 mm away from their source (for review see Logothetis, 2003). As expected the LFPs that we recorded outside of NIf had lost some power as they propagated to Eref, but they still contained reliable phase information. Using fast gamma phase times measured from Eref, we determined the phase of each spike in the modified Post-Vocal window and an equal number of spikes selected randomly from Baseline. A Rayleigh test for circular uniformity was used to measure the significance of phase locking, and a Kuiper two-sample test was used to test for differences in phase-locking between Baseline and Post-Vocal conditions. These tests were conducting using routines from the CircStat toolbox for Matlab (Berens, 2009).

Spike Field Coherence

Spike field coherence (SFC) is a normalized measure of the degree of phase synchronization between spike activity and the LFP as a function of frequency. This method has been described in detail elsewhere (Fries et al., 2001). Briefly, a spike-triggered waveform average (STA) is calculated by averaging the LFP around each spike. The absolute power of the STA is calculated and then normalized by the average power spectrum of all the individual LFP segments used to create the STA. The result is the SFC, a measure independent of spike rate or LFP amplitude, which takes a value between 0 and 1 for each frequency. A value of 1 indicates that all spikes occur at exactly the same phase for that frequency. A value of 0 indicates no phase relationship exists between spike activity and oscillations at that frequency. SFC was calculated using multitaper methods (5 tapers with a time-bandwidth product of 3) as implemented by the coherencycpt.m function from the Chronux data analysis toolbox for Matlab (Mitra and Bokil, 2008), available at http://chronux.org.

Auditory response testing

To test the influence of fast gamma oscillations on auditory processing, we examined auditory responses to long calls given by freely behaving female conspecifics. To determine whether a site had significant auditory responses to conspecific calls, we compared spike counts in 200 ms windows beginning at long call onset with spike counts in the 200 ms preceding long call onset using a paired t-test. We then identified all conspecific long calls that were produced within 12 s of the experimental subject singing. Because gamma oscillations are strongest after song, we classified these conspecific calls as Strong Oscillation Calls. We compared these against conspecific calls given during the Baseline period when fast gamma power is low (Weak Oscillation Calls). Auditory responses were quantified using the auditory response index:

RSINDEX=(XResponseXbaseline)/(XResponse+Xbaseline)

The value for XResponse was either spike count or normalized fast gamma power, both measured in 200 ms windows beginning at the onset of a conspecific call. The baseline for the Weak Oscillation Calls was the 200 ms preceding conspecific call onset. Because spike rate and oscillation power are not constant following song, we compared the measured response to a Strong Oscillation Call with the average value during the same time window following other songs. RSINDEX values for the two groups were compared using a two-sample t-test.

Measuring premotor activity

To investigate the relationship between premotor activity and fast gamma power at each site, we compared the oscillation power measured in the Post-Vocal period after each bout of vocalizations with measures of the premotor activity that accompany these vocalizations. Vocal bouts are typically made up of some combination of song, long calls, and tet/stack calls. Male long calls have stereotyped spectrotemporal features that are learned from a tutor; these calls are typically 100–200 ms long and are used to maintain long-range vocal contact. Tet calls and stack calls are unlearned vocalizations used in short-range vocal communication; they are shorter (~40–60 ms) and quieter than long calls (Zann, 1996). For simplicity, the term ‘tets’ is used to refer to both tet calls and stack calls in the remainder of the text.

It is not uncommon for vocalizations within a bout to be separated by 1–2 s of silence. The presence of fast gamma oscillations during these silent intervals complicates the estimation of premotor activity. The short bursts of multiunit firing associated with fast gamma oscillations have firing rates as high as those seen during premotor activity, which makes instantaneous firing rate an inaccurate tool for identifying premotor spikes. However, the increase in firing rate during premotor activity persists for tens of milliseconds, while fast gamma bursts only last ~2–4 ms. We therefore estimated premotor activity by calculating firing rate in 20 ms non-overlapping bins starting at vocal offset and moving backwards in time. All bins with firing rates greater than two standard deviations above the mean Baseline firing rate were considered premotor activity. Spike counts derived from these “premotor” bins were used to calculate the average premotor firing rate.

If premotor activity and fast gamma power are correlated, then the slow decay of fast gamma power beginning ~3–5 s after vocal offset (see RESULTS, Figure 6) suggests that the influence of premotor activity on fast gamma power decays with time. The average normalized oscillation power for all sites drops to ~70% of its maximum value 7 s after premotor activity ends. This suggests the magnitude of fast gamma oscillations is maximally influenced by premotor activity that occurred within the previous 7 s. In order to test if premotor activity was correlated with fast gamma power, we counted all premotor spikes, as described above, within 7 s of the start of each Post-Vocal period. Thus, for each Post-Vocal period a normalized power value and an estimate of premotor firing rate were calculated. The correlation between these values for all vocalizations at a site was measured using Pearson’s linear correlation coefficient and tested for significance against the null hypothesis of no correlation.

Figure 6. Increases in fast gamma power and spike rate persist for tens of seconds.

Figure 6

(A) Average increase in spontaneous firing rate before and after vocal activity for a representative site. Spontaneous firing rate was measured in 2 s sliding windows (97.5% overlap; dark grey +/− SD). Data plotted preceding vocal onset is the average for all vocal onsets used in Pre-Vocal analysis. Averaged data following vocal offsets were calculated for all songs that did not have any vocalizations in the subsequent 40 s. The dashed line represents the average value during baseline (0 for a z-score). The vertical hatched bar represents a typical length for a bout of song (~2 s). Firing rate remained elevated above baseline levels at this site for 20.65 s. Dashed box, the relatively short period of post-vocal inhibition cannot be properly resolved using a 2 s bin size, which is why spike counts do not drop below baseline here; larger bin sizes increase the reliability of measurements at the cost of temporal resolution.

(B) Average normalized fast gamma power over time for the same site as (A). Fast gamma power was calculated from multiunit activity in 2 s sliding windows as described in (A) (dark grey +/− SD). Oscillation power remained elevated above baseline levels for 26.95 s at this site.

(C) Time relative to vocal offset is plotted against the fraction of all sites whose firing rate remained significantly above baseline up to that time, essentially a reverse cumulative histogram (e.g. 100% of sites had firing rates significantly above baseline 5 s after vocal offset, 0% were significant at 55 s). Only sites that showed a significant increase in spike rate or gamma during the Post-Vocal period were included in this plot.

RESULTS

To investigate NIf’s role in vocal production and performance evaluation, we recorded multiunit activity and LFPs in awake behaving male zebra finches engaging in natural vocal communication behavior. Changes in network activity associated with vocalizations are reported for 18 sites across 6 birds.

NIf shows characteristic patterns of premotor activity

NIf is embedded in the auditory forebrain but is distinguishable from the surrounding auditory nuclei by its strong bursts of premotor activity during vocal production. Premotor activity during songs and long calls follows a general pattern characterized by three distinguishing features: large increases in firing rate prior to and during vocalization, cessation of neural activity prior to the offset of the vocalization, and continued suppression of neural activity for a short time after vocal offset (Figure 1b). Measured from the onset of the introductory notes that precede song, NIf premotor activity preceded song by an average 45.7 +/− 15.7 ms (18 sites across 6 birds). The pattern of premotor activity during song is related to the syllables that are produced, and thus cannot be easily generalized across birds singing different songs. However, in all six birds recorded for this study, premotor activity ended shortly before song offset. The average time between the end of premotor activity and the offset of song was 26.5 +/− 13.5 ms. Cessation of premotor activity was accompanied by a general suppression of neural activity in NIf. Firing rates remained below baseline levels for an average of 240 +/− 204 ms. This general pattern of premotor activity was consistent across all sites for songs and long calls and agrees with latencies recorded in earlier studies (McCasland, 1987).

Vocal activity causes long lasting changes in the dynamic network state of NIf

To determine whether vocal behavior caused changes in NIf’s neural activity beyond the transient increases in firing rate associated with premotor activity, we compared neural firing rates in NIf before and after periods of vocal activity with those measured during extended periods of silence. Neural activity was examined in three periods: Pre-Vocal, Post-Vocal, and a Baseline period defined by prolonged vocal inactivity (Figure 2a, see METHODS for details). To allow for comparison across sites, z-scores were calculated for the Pre-Vocal and Post-Vocal periods based on firing rates measured during Baseline periods.

Figure 2. Vocal behavior increases spontaneous firing rate and induces coherent oscillations in NIf.

Figure 2

(A) Schematic showing how measurement periods were defined (for details see METHODS). Grey boxes represent vocalizations (songs, long calls or tets). All analysis was restricted to Post-Vocal periods following song unless stated otherwise.

(B) Sound spectrogram and neural activity surrounding the production of a song with expanded time-views of Pre-Vocal and Post-Vocal periods. There is a marked increase in spontaneous firing after vocalization offset. Periods of post-vocal inhibition between each song motif can also be seen.

(C) Expanded time-view of 90 ms of neural activity during the Pre- and Post-Vocal periods. Also plotted are the simultaneously recorded local field potential (1–200 Hz) and a sinusoid at the peak frequency of oscillation for this site. Phase-locking in the Post-Vocal period can be visualized by comparing neural activity with this peak frequency sinusoid.

A representative example of neural activity in NIf both before and after the production of song is shown in Figure 2b. The brief suppression of activity following vocal offset transitions rapidly into a prolonged increase in firing rate. Most sites exhibited a significant increase in spiking relative to Baseline during the Post-Vocal period (15/18 sites, p<0.01, two-sample t-test, Figure 4). In contrast, few sites had a significant increase in Pre-Vocal firing rate (4/18, p<0.01), and all of the sites with significant increases in Pre-Vocal firing had an even larger increase in firing rate during the Post-Vocal period.

Figure 4. Power in the fast gamma band is positively correlated with spike rate, and the distribution of power in the fast gamma band is conserved across sites within birds.

Figure 4

(A) Z-score for normalized fast gamma power and spike rate during the Post-Vocal period for each site. There is a positive correlation between the magnitude of these changes across sites. Histograms of the z-scores for fast gamma power and spike rate are plotted on the top and left axes, respectively. Vertical and horizontal bars represent a count of 1 for the respective histograms.

(B) Peak frequency is conserved across sites within birds. Each point represents the average peak frequency during Post-Vocal periods for a given site. Peak frequency was defined as the frequency with the greatest power in the fast gamma band calculated from multiunit spike times. Sites are clustered by subject to show the consistency in peak frequency values across sites. Individual subjects are identified by tag number on the x-axis.

The increase in firing rate following vocalizations was accompanied by a striking change in action potential synchronization characterized by short windows of firing followed by periods of nearly complete silence (Figure 2c). Using multi-taper methods, we measured the power and frequency of this synchronized spiking activity. Figure 3 illustrates the frequency and time course of vocal activity induced spike synchronization. There is a strong and specific increase in the power of the 90 to 150 Hz band following each bout of vocalizations. Based on other studies, we refer to this frequency range as fast gamma (Buzsaki and Draguhn, 2004).

Figure 3. Vocal activity induces fast gamma oscillations in NIf.

Figure 3

Representative 90 s epoch during which the subject vocalized several times. Each vocal bout is followed by a selective and long-lasting increase in fast gamma power. (A) Microphone trace showing the location of the three vocal bouts (i, ii, and iii): i) a string of 12 closely spaced long calls, ii) a song bout with 3 motifs, iii) a song bout with 5 motifs. Plotted below is the normalized power calculated from multiunit spike times in 2 s sliding windows (75% overlap) across named frequency bands: Delta 1–3 Hz, Theta 4–8 Hz, Alpha 8–14 Hz, Beta 15–30 Hz, Gamma 30–90 Hz, Fast Gamma 90–150 Hz.

(B) Spectrogram calculated from MUA demonstrating the distribution of power increases across fast gamma and other frequencies for the same time period as (A). The peak frequency of oscillation for this subject was approximately 110 Hz. Each subject had a characteristic distribution of power increases across the fast gamma band in the Post-Vocal period. Spectrogram values were smoothed for display purposes (pillbox filter, 3 Hz × 1.5 s). Warmer colors represent greater Power / Hz. In both (A) and (B) premotor activity has been removed because power during vocal activity is dominated by the temporal spacing of premotor bursts, which is largely a reflection of call spacing or the syllabic structure of individual songs.

Synchronized spiking in the fast gamma range was strongest in the 1–5 s following vocalization and then decayed slowly over the course of many seconds as neural activity became less synchronized (Fig 3b). Nearly all sites showed significant increases in fast gamma power during the Post-Vocal period when compared to either Pre-Vocal or Baseline periods (Figure 4, 17/18 sites, p<0.0001, two-sample t-test). Only 1 site showed an increase in fast gamma power during the Pre-Vocal period (1/18 sites, p<0.01). All sites with significant increases in Post-Vocal spike rate also had significant increases in fast gamma power during the Post-Vocal period. Across the sites recorded in this study there was a strong correlation between gamma power and increased spike rate (Figure 4a, r = 0.88, p<0.001, Pearson linear correlation). This is in keeping with findings from previous studies where increases in gamma power are correlated with increased spike rates (Fries et al., 2001; Manning et al., 2009).

Similar to observations of gamma in mammals (Hall et al., 2009; Jansen et al., 2009), the frequency with the greatest power in the fast gamma range (peak frequency) was consistent across sites for an individual subject, but could differ considerably across subjects (Figure 4b). Across subjects, the average peak frequency of oscillation in the Post-Vocal period ranged from 110.34 Hz to 135.42 Hz; in contrast, the average frequency spread across sites within subjects was only 3.70 Hz. The difference in peak frequencies across subjects was significant (p<0.0001, one-way ANOVA). These results suggest each subject has an individually characteristic distribution of power across the fast gamma band.

Local field potential recordings in NIf demonstrate selective increases in fast gamma power

We hypothesized that the highly coherent nature of neural firing during fast gamma oscillations should allow their detection in LFPs even though the avian telencephalon is not ideal for such measurements given its non-laminar structure without any clear neural architecture to sum and amplify synchronous neural events. We recorded LFPs and MUA simultaneously from 15 sites in 5 birds and looked for changes in power during the Pre-and Post-Vocal periods. To control for natural variations in LFP power across frequencies, we computed ‘modulation’ spectra (Figure 5a, see METHODS), which show the change in power for each frequency relative to the Baseline average. Most sites showed a significant increase in LFP power across the fast gamma band during the Post-Vocal period (12/15 sites, p<0.01, t-test). No sites had increased fast gamma power during the Pre-Vocal period (0/15, p<0.01, t-test). Fast gamma power measured from LFPs was positively correlated with spike rate (r = 0.82, p<0.001 Pearson’s test for nonzero correlation). Figure 5b shows the average modulation spectrum for all sites that had significant increases in Post-Vocal fast gamma power.

Figure 5. Local field potentials show selective increases in fast gamma power following vocal offset.

Figure 5

(A) Top, average LFP power spectra during Baseline and Post-Vocal periods from an exemplar site. Bottom, a modulation spectrum for this site shows the percentage increase or decrease in fast gamma power during the Post-Vocal period relative to Baseline. This procedure controls for the uneven distribution of power across frequencies in the LFP.

(B) Average modulation spectra for all sites in each bird are plotted in thin grey lines. The grand average across all five birds is also plotted (thick black line, SEM in dark grey). Note that the grand average spans the entire fast gamma range, but power increases in individual birds only span a portion of the full range.

(C) Premotor activity and increases Post-Vocal fast gamma power are correlated across sites within NIf. Data is plotted from five sites recorded from the same electrode. The depth of each recording site is plotted on the x-axis. The average multiunit firing rate during the last 1 s of songs immediately preceding Post-Vocal periods is plotted in solid black. The dotted line represents the average normalized Post-Vocal fast gamma power calculated from LFPs at each site. Standard deviations are indicated with vertical lines.

The ability to detect fast gamma oscillations in the spike output of NIf neurons (Figure 3) is strong evidence that either fast gamma oscillations are generated locally by the neural network in NIf, or else a second structure connected to NIf oscillates in the fast gamma range and entrains NIf’s network through strong synaptic connections. NIf is not connected to the auditory structures that surround it, and therefore it is unlikely that these structures are the source of the fast gamma oscillations that shape neural activity within NIf. Further evidence that the increase in LFP fast gamma power originates from NIf comes from a comparison of LFPs recorded from electrodes located within NIf with reference electrodes located a short distance (~250 um) outside of NIf. At all sites, fast gamma power was stronger at the LFP recorded within NIf than at the LFP recorded from outside of NIf; at all sites but one, this difference reached significance (10/11, p<0.01, paired t-test).

To further confirm that fast gamma power observed in the LFP recordings originates from NIf, we compared activity across five recording sites from a single electrode penetration through NIf., We hypothesized that the power of fast gamma oscillations should increase with proximity to the center of NIf given that NIf has no clear neural architecture that would support segregated layers of sinks and sources (Fortune and Margoliash, 1992). As shown in Figure 5c, as the recording electrode moved through NIf there was first an increase and then a decrease in the magnitude of premotor activity, a characteristic feature neural activity in NIf (see Figure 1). These changes in premotor activity across sites were strongly correlated with changes in the magnitude of normalized fast gamma measured from the LFPs recorded at each site (r = 0.8905, p=0.043, Pearson’s test for non-zero correlation).

Taken together, these findings all support the conclusion that NIf is the source of the increases in fast gamma power observed in our local field potential recordings.

Increases in fast gamma power and neural firing rate often persist for tens of seconds after the offset of song

Increases in gamma band activity triggered by a stimulus or event typically last for a second or less (Laurent and Naraghi, 1994; Siegel and Konig, 2003; Brugge et al., 2009; Fukuda et al., 2010). In stark contrast, NIf’s vocalization-induced increases in fast gamma power often persisted for ten or more seconds, with some sites not returning to baseline activity levels until more than 30 s after vocal offset. Figure 6a shows the average time-course of fast gamma power and firing rate changes in NIf following vocalization in a representative site. After a sharp increase following post-vocal inhibition, fast gamma power and spike rate remain at or near peak values from 1.5 – 4.5 s after vocal offset before slowly falling back to baseline levels over the course of many seconds.

The duration of the changes in NIf’s network activity varied considerably from site to site (Figure 6c). The duration of elevated fast gamma power following the offset of song ranged between 9.4 s and 34.7 s, with an average of 21.62 +/− 8.76 s (17 sites in 6 birds). Spike rate in NIf remained elevated following the offset of song for periods ranging from 10.2 s to 51.1 s, with an average of 32.09 +/− 13.96 s (13 sites in 5 birds). Only sites that demonstrated significant increases in fast gamma power or spike rate were included in this analysis.

Fast gamma oscillations in NIf during song production

The presence of fast gamma oscillations during song could indicate that fast gamma is involved in providing a temporal framework for motor production. This would fit with previous results which showed that network dynamics within RA evolve on a timescale of approximately 125 Hz (Leonardo and Fee, 2005). Measurements of fast gamma power during song production must be assessed using LFPs because the power spectrum derived from multiunit spike times is dominated by the premotor bursts associated with the production of song syllables. To determine if fast gamma power increased prior to the offset of song, we measured normalized LFP power during the final 1 s of each song that directly preceded a Post-Vocal period. In both the recording (ERec) and reference (ERef) electrodes, we observed either no change or a significant decrease (ERec 10/15 sites, ERef 5/10 sites, p<0.01, t-test) in normalized LFP fast gamma power during singing as compared to Baseline.

Isolated single units are phase locked to fast gamma oscillations

We were able to isolate 14 single units from our multiunit recording sites in NIf (see METHODS for details). To test for phase locking, we determined the phase of fast gamma oscillations corresponding to single-unit spike times from the Post-Vocal and Baseline periods. To control for any possible influence of spike waveforms on the simultaneously recorded LFP, we measured phase-locking of single units against LFPs recorded from a second electrode (ERef) located approximately 250 μm away from the single unit electrode (see METHODS for details). ERef was located outside of NIf for all but two units, and neither of these two units showed significant phase-locking.

Of 14 single units, 8 were significantly phase locked to the LFP during the Post-Vocal period (8/14, p < 0.05, Rayleigh test). Figure 7a shows the phase distribution and spike triggered waveform average (STA) of an example unit during Post-Vocal and Baseline periods. The clearly visible fast gamma waveform in the STA demonstrates the tight coupling between single unit firing and oscillations during the Post-Vocal period. In this example, the STA calculated from Baseline spike times shows a small but significant amount of phase locking during non-vocal periods. Four units showed weak phase locking during the baseline period (4/14, p<0.05, Rayleigh test). Figure 7b shows the phase preference for each of the phase-locked neurons. Most phase-locked neurons fired preferentially during the trough or rising phase of fast gamma.

Figure 7. Single units phase lock to underlying fast gamma oscillations.

Figure 7

(A) Circular phase plot (left) and spike-triggered waveform average (STA; right) for an exemplar unit that showed significant phase locking to fast gamma frequencies during the Post-Vocal period (red). Long from the center in the circular phase plot represents the probability of a spike occurring at that phase. To create the STA, LFPs (filtered in the fast gamma range 90–150 Hz using a Butterworth IIR filter) occurring around each spike in the Post-Vocal period, and an equal number of spikes in the Baseline period, were aligned by spike onset and averaged. This exemplar unit was one of a small number of single units that were weakly, but significantly, phase-locked to fast gamma during the Baseline period (blue).

(B) Phase relationship between fast gamma and each significantly phase-locked single unit. Two oscillation cycles are plotted against the probability of firing at each point in the cycle. The dotted line is a representation of the fast gamma waveform. The exemplar from (A) is plotted in dark red. In order to eliminate phase-shifts that occur as the oscillations propagate through the extracellular matrix, phase values for this plot were derived from LFPs recorded at the site of the single unit.

(C) Spike field coherence during Post-Vocal (red) and Baseline (blue) periods for the exemplar site shown in (A). Means are plotted in dark lines with shaded areas indicating 5% jack-knife error bars. Note the similarity between this SFC plot and the modulation spectrum in Figure 5a; these data were calculated using two different sites from the same bird.

(D) Distribution of SFC for each significantly phase locked single unit. The dark red trace represents the exemplar shown in (C).

To quantify the phase locking properties of identified single units across frequencies, we calculated their spike-field coherence (SFC). In this analysis a coherence of 1 for a given frequency indicates that spikes always occur at the same phase, while a value of 0 indicates no correlation between spike times and the phase of oscillation. Figure 7c shows the SFC for an exemplar unit. For this unit the increase in phase-locking during the Post-Vocal period is restricted to a frequency band between 95 and 120 Hz. The SFC during the Post-Vocal period for each of the significantly phase-locked units is shown in Figure 7d.

Of the 14 single units, 5 had significantly increased spike rates in the Post-Vocal period relative to Baseline (5/14, p<0.05, two-sample t-test), 8 had significantly decreased firing rates (8/14, p < 0.05, two-sample t-test), and 1 showed no significant change in firing rate. All of the single units with increased Post-Vocal firing rates were also significantly phase-locked during the Post-Vocal period. Of the neurons that did not increase their firing rate in the Post-Vocal period, only 3 were significantly phase-locked (3/9, p<0.05 Raleigh test). The fact that most of our single units had decreased firing rates in the Post-Vocal period, while nearly all our multiunit recordings increased firing during this time, may be a reflection of the fact that larger neurons are easier to isolate from multiunit recordings (Logothetis, 2003). NIf is known to contain populations of neurons with different soma sizes (Fortune and Margoliash, 1992), and the larger of these neural subtypes will be naturally over-represented in single unit isolations from extracellular recordings.

Auditory responses during fast gamma oscillations in NIf

NIf shows a high degree of variability in both the strength and selectivity of auditory responses in awake birds (Cardin and Schmidt, 2004b). It is possible that fast gamma oscillations in NIf represent a specialization for processing conspecific responses during vocal communication. We compared auditory responses to conspecific long calls given during periods of strong oscillations (following song production) and weak oscillations (during the Baseline period). Because these calls were produced during natural vocal interactions, only 6 sites had enough activity to be analyzed. All six of these sites had significant auditory responses to conspecific calls, measured as an increase in multiunit firing (6/6 sites, p<0.01, t-test). However, none of the sites showed any significant difference in auditory responses during periods of strong vs weak oscillations (0/6 sites, p<0.05, t-test). In addition, no sites demonstrated significant changes in normalized fast gamma power in response to conspecific calls (0/6 sites, p<0.05, t-test). Because of our small sample size we cannot conclude with certainty that fast gamma oscillations have no influence on auditory responses in NIf. It is worth noting, however, that antiphonal calling (the exchange of calls between two or more conspecifics) is the primary form of reciprocal vocal communication in zebra finches (Zann, 1996). Antiphonal call responses are typically produced within a few hundred milliseconds after hearing a conspecific long call. Thus, an antiphonal response call will often arrive during the period of post-vocal inhibition in NIf when both spike rate and oscillatory activity are largely suppressed (see Figures 1, 2).

The relationship between fast gamma oscillations and premotor activity in NIf

To clarify the relationship between vocal activity and fast gamma oscillations, we expanded our analysis to include both tets (short contact calls) and long calls. We found that including these additional vocalizations expanded the range of Post-Vocal fast gamma power values we observed. Song generally induced the strongest oscillations and tets the weakest, with long calls falling somewhere in between (Figure 8a). However, a series of closely spaced long calls could induce oscillations as strong as song (for example see Figure 3a). These observations led us to hypothesize that premotor activity, as opposed to vocalization type, was correlated with the power of subsequent oscillations.

Figure 8. Premotor activity is positively correlated with fast gamma power.

Figure 8

(A) Scatter plots demonstrate the positive correlation between normalized fast gamma power and average premotor firing rate for exemplar sites in two different birds. For each vocal offset, an estimate of premotor firing (see METHODS) during the preceding 5 s is plotted against normalized fast gamma power in the Post-Vocal period. The vocalization type preceding each measurement window is indicated.

(B) Normalized fast gamma power nears its maximal value following the level of premotor activity typically produced during a bout of song. For comparison across sites and birds, fast gamma power and premotor firing estimates were normalized between 0 and 1 based on the minimal and maximal measurements at each individual site. These normalized values were used to divide premotor firing estimates into three bins Low (0–0.33), Medium (0.33–0.66), and High (0.66–1). Normalized fast gamma power associated with the premotor estimates in each bin was averaged, and these values were used to calculate a grand average across all sites. Even small amounts of premotor activity cause a significant increase in fast gamma power when compared to baseline (** p<0.01, t-test). However, fast gamma power saturates before premotor activity estimates reach maximal values. Most of the variation in fast gamma power is found at lower concentrations of premotor activity.

To test this hypothesis, we measured the correlation between fast gamma power in the Post-Vocal period and the amount of premotor activity prior to vocalization offset (only sites with significant increases in Post-Vocal fast gamma power were included in this analysis). Examples of the covariation between premotor activity and oscillation power for two exemplar sites are shown in Figure 8a. All sites had a significant positive correlation between oscillation power and premotor activity (17/17, p<0.001, Pearson linear correlation) with an average correlation coefficient of 0.69 +/− 0.12. Oscillation power tended to asymptote before maximal levels of premotor activity were reached (Figure 8b). As a result of this, oscillations induced by singing would often approach the maximal power for fast gamma at a given site. Thus, while fast gamma power increased following tets and long calls, most strong oscillations occurred after song.

DISCUSSION

In this study, we show that vocal activity causes dramatic and prolonged changes in network dynamics in NIf that are characterized by an increase in spike rate and neural oscillations in the fast gamma range (90–150 Hz). The observation of fast gamma oscillations in LFPs recorded outside of NIf suggests an unusually high degree of coherency in these oscillations given that NIf, and most structures in the avian brain (Wang et al., 2010), lack a laminated architecture and any clear concentration of sinks and sources that would amplify synchronous synaptic activity (Fortune and Margoliash, 1992; Logothetis, 2003). NIf’s fast gamma oscillations could play an important role in integrating vocalization-related sensorimotor information important for the encoding and maintenance of learned vocalizations.

A remarkable characteristic of our observations is that relatively short bouts of vocal activity (2–3 s) can induce oscillations that persist for an order of magnitude longer than the stimulus that initiated them. We are unaware of any examples where an isolated stimulus or behavioral event causes oscillations that last more than a few seconds (Laurent and Naraghi, 1994; Siegel and Konig, 2003; Brugge et al., 2009; Fukuda et al., 2010). Most models of gamma producing networks require tonic input to reproduce oscillatory activity (Bartos et al., 2007). While we cannot rule out that NIf receives such input from its auditory afferents, it is unlikely to come from within the song system because activity in the premotor thalamic nucleus Uva, which sends a strong projection to NIf, remains elevated for only 1–2 s following song offset (Williams and Vicario, 1993). Another possibility is that neuromodulatory inputs (Ryan and Arnold, 1981; Cardin and Schmidt, 2004a) are active during and/or following vocalization in NIf and cause the network to express oscillatory activity (Fisahn et al., 1998).

Fast oscillations and the song system

Though no previous studies have reported fast gamma oscillations in the song system, results from studies in RA hint at the involvement of fast gamma in the encoding and production of song. In RA, each moment of song is encoded by a stereotyped ensemble of neurons (Yu and Margoliash, 1996; Leonardo and Fee, 2005). The motor code for song in RA consists of a succession of unique neural ensembles, each of which is active for ~8 ms (125 Hz), suggesting that, at the level of RA, the structure of song may be encoded on a fixed time-scale in the fast gamma range. Activity patterns in RA also suggest that fast gamma may be important for the offline encoding of tutor song during song learning in juvenile birds. During the first night following exposure to the tutor song, RA neurons transition from being tonically active to becoming more phasic in nature with a dramatic increase in inter-spike intervals (ISI) in the fast gamma range (Shank and Margoliash, 2009). Finally, RA and HVC both contain a population of fast-spiking, parvalbumin-expressing interneurons (Wild et al., 2001; Rauske et al., 2003; Wild et al., 2005), similar to those that underlie gamma oscillations in mammals, and experiments in RA have shown that activation of this interneuronal network can synchronize firing in projection neurons (Spiro et al., 1999).

Gamma oscillations in NIf are unlikely to play a direct role in motor production or sensory processing

Despite its song-related auditory and motor activity, it remains unclear what role NIf plays in vocal production and/or auditory processing. Anatomically, NIf is located at the interface of the auditory and vocal motor systems. It receives motor projections from the thalamic nucleus Uva and auditory input from the secondary auditory area CM (Vates et al., 1996), including a strong projection from nucleus Avalanche, a subdivision of CM that is connected reciprocally with both NIf and HVC (Akutagawa and Konishi, 2010). In addition to auditory information, NIf provides strong excitatory drive to nucleus HVC (Coleman and Mooney, 2004; Hahnloser and Fee, 2007), which along with RA makes up the telencephalic portion of the direct motor pathway of the song system. Unlike, HVC and RA, however, lesions of NIf do not cause immediate degradation of the spectrotemporal features of song (Simpson and Vicario, 1990; Cardin et al., 2005; Roy and Mooney, 2009). This lack of short-term spectral degradation of song following NIf lesions, combined with our failure to find significant increases in normalized fast gamma power prior to or during song, make it unlikely that NIf’s oscillations are directly involved in song production.

Auditory responses in NIf range from variable and non-selective in awake birds to strong and highly selective for BOS during sleep (Janata and Margoliash, 1999; Cardin and Schmidt, 2004b). In mammals gamma oscillations have been linked to attentional modulation of sensory responses (Fries et al., 2001; Buehlmann and Deco, 2008); similar modulation in NIf could help explain the variability of auditory responses in the awake bird (Cardin and Schmidt, 2004b). To test this possibility, we measured auditory responses to calls produced by freely behaving conspecifics. We failed, however, to observe any differences in auditory response strength between calls heard under conditions of high or low gamma power.

NIf and the auditory feedback dependent maintenance of song

Bilateral lesions of NIf cause a significant decrease in spontaneous activity and auditory responses in HVC, which is the source of auditory information to both the descending motor and anterior forebrain pathways (Coleman and Mooney, 2004; Cardin et al., 2005; Roy and Mooney, 2009). Given the necessity of auditory feedback for the maintenance of crystallized song, lesions of NIf in zebra finches might be expected to cause spectrotemporal degradation of song similar to that seen following deafening (Nordeen and Nordeen, 1992; Lombardino and Nottebohm, 2000). In Bengalese finches, both deafening (Woolley and Rubel, 1997) and NIf lesions (Hosino and Okanoya, 2000) cause, within a relatively short time period (days to weeks), syntactically complex songs to become less variable and more deterministic. In contrast, zebra finch song retains its spectrotemporal structure following bilateral NIf lesions for at least 5 weeks (Cardin et al., 2005; Roy and Mooney, 2009), which is longer than song remains intact in deafened birds of the same age (Lombardino and Nottebohm, 2000). However, the songs of older zebra finches can maintain their spectrotemporal structure for more than a year following deafening (Lombardino and Nottebohm, 2000) suggesting that the motor code of crystallized zebra finch song is resilient to disruptions of auditory feedback. It is perhaps not surprising then that lesions of NIf do not have an immediate or near immediate effect on the spectrotemporal structure of adult song (Cardin et al., 2005; Roy and Mooney, 2009) especially since HVC also receives auditory inputs from Av (Akutagawa and Konishi, 2010). Whether NIf lesions would eventually cause degradations in song similar to those observed following deafening is unknown, and the role of NIf in auditory feedback based song learning and maintenance remains unclear.

Fast gamma oscillations as a possible mechanism for offline vocal-motor performance evaluation

In other systems, the presence of sustained oscillations is an indication of sustained neural processing and/or communication between brain regions (Hoogenboom et al., 2006; Fries et al., 2008; Koch et al., 2009). In NIf, persistent oscillatory activity is specifically induced by the production of vocalizations, particularly song. It is reasonable to hypothesize, therefore, that fast gamma oscillations in NIf represent sustained processing of information associated with vocal production. The question then is what sort of information a song system nucleus would be processing in the period immediately following song production?

Given the lack of immediate effects of feedback perturbation on song motor activity (Konishi, 2004; Kozhevnikov and Fee, 2007; but see Sakata and Brainard, 2008), offline processing has been proposed as a possible mechanism for comparing motor performance (auditory feedback) with motor instruction (premotor activity) (Margoliash and Schmidt, 2010). Fast gamma oscillations in NIf are tightly linked to premotor activity and may therefore be well suited to facilitate offline integration of auditory and premotor information following singing bouts. The anatomical connectivity between Av, NIf, and HVC suggests a sensorimotor loop that is well situated to play a role in offline vocal processing, especially given that Av forms part of the auditory forebrain known to contain neurons sensitive to perturbations in vocalization-related auditory feedback (Keller and Hahnloser, 2009). If Av tracks auditory feedback during vocal production, fast gamma oscillations could simultaneously enhance communication between NIf and Av while providing a temporal framework for the integration of vocal motor and auditory feedback traces. If such offline sensorimotor integration were occurring, then we would predict that field oscillations and/or single unit activity in Av should phase-lock to fast gamma oscillations in NIf. Furthermore, if NIf is involved in the process of evaluating vocal performance to correct errors in vocal output, then lesions of NIf should affect the recovery of normal song following decrystallization caused by distorted auditory feedback (Leonardo and Konishi, 1999).

Comparisons of gamma oscillations in the songbird brain with those in mammalian systems will expand our understanding of the universality and versatility of gamma oscillations (Fries, 2009). Furthermore, as a discrete nucleus with a limited set of inputs, NIf provides a rich model system for the study of gamma oscillations, particularly given the prolonged duration of the oscillations and their established relationship with an easily quantifiable vocal behavior.

Acknowledgments

This work was supported by grants from the National Institutes of Health (DC006102 and DC006453). We would like to thank Kristina Penikis for her assistance with chronic recordings, and Dr. Josh Jacobs for his comments on the manuscript.

Footnotes

SUPPLEMENTAL MATERIAL - None

CONFLICTS OF INTEREST - None

References

  1. Akutagawa E, Konishi M. New brain pathways found in the vocal control system of a songbird. J Comp Neurol. 2010;518:3086–3100. doi: 10.1002/cne.22383. [DOI] [PubMed] [Google Scholar]
  2. Axmacher N, Mormann F, Fernandez G, Elger CE, Fell J. Memory formation by neuronal synchronization. Brain Res Rev. 2006;52:170–182. doi: 10.1016/j.brainresrev.2006.01.007. [DOI] [PubMed] [Google Scholar]
  3. Bartos M, Vida I, Jonas P. Synaptic mechanisms of synchronized gamma oscillations in inhibitory interneuron networks. Nat Rev Neurosci. 2007;8:45–56. doi: 10.1038/nrn2044. [DOI] [PubMed] [Google Scholar]
  4. Bedard C, Destexhe A. Macroscopic models of local field potentials and the apparent 1/f noise in brain activity. Biophys J. 2009;96:2589–2603. doi: 10.1016/j.bpj.2008.12.3951. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Berens P. CircStat: A Matlab toolbox for circular statistics. Journal of Statistical Software. 2009;31:1–21. [Google Scholar]
  6. Brugge JF, Nourski KV, Oya H, Reale RA, Kawasaki H, Steinschneider M, Howard MA., 3rd Coding of repetitive transients by auditory cortex on Heschl’s gyrus. J Neurophysiol. 2009;102:2358–2374. doi: 10.1152/jn.91346.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Buehlmann A, Deco G. The neuronal basis of attention: rate versus synchronization modulation. J Neurosci. 2008;28:7679–7686. doi: 10.1523/JNEUROSCI.5640-07.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Buzsaki G, Draguhn A. Neuronal oscillations in cortical networks. Science. 2004;304:1926–1929. doi: 10.1126/science.1099745. [DOI] [PubMed] [Google Scholar]
  9. Cardin JA, Schmidt MF. Noradrenergic inputs mediate state dependence of auditory responses in the avian song system. J Neurosci. 2004a;24:7745–7753. doi: 10.1523/JNEUROSCI.1951-04.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Cardin JA, Schmidt MF. Auditory responses in multiple sensorimotor song system nuclei are co-modulated by behavioral state. J Neurophysiol. 2004b;91:2148–2163. doi: 10.1152/jn.00918.2003. [DOI] [PubMed] [Google Scholar]
  11. Cardin JA, Raksin JN, Schmidt MF. Sensorimotor nucleus NIf is necessary for auditory processing but not vocal motor output in the avian song system. J Neurophysiol. 2005;93:2157–2166. doi: 10.1152/jn.01001.2004. [DOI] [PubMed] [Google Scholar]
  12. Cardin JA, Carlen M, Meletis K, Knoblich U, Zhang F, Deisseroth K, Tsai LH, Moore CI. Driving fast-spiking cells induces gamma rhythm and controls sensory responses. Nature. 2009;459:663–667. doi: 10.1038/nature08002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Coleman MJ, Mooney R. Synaptic transformations underlying highly selective auditory representations of learned birdsong. J Neurosci. 2004;24:7251–7265. doi: 10.1523/JNEUROSCI.0947-04.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Dan Y, Poo MM. Spike timing-dependent plasticity of neural circuits. Neuron. 2004;44:23–30. doi: 10.1016/j.neuron.2004.09.007. [DOI] [PubMed] [Google Scholar]
  15. Dave AS, Margoliash D. Song replay during sleep and computational rules for sensorimotor vocal learning. Science. 2000;290:812–816. doi: 10.1126/science.290.5492.812. [DOI] [PubMed] [Google Scholar]
  16. David SV, Malaval N, Shamma SA. Decoupling action potential bias from cortical local field potentials. Comput Intell Neurosci. 2010:393019. doi: 10.1155/2010/393019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Engel AK, Singer W. Temporal binding and the neural correlates of sensory awareness. Trends Cogn Sci. 2001;5:16–25. doi: 10.1016/s1364-6613(00)01568-0. [DOI] [PubMed] [Google Scholar]
  18. Fell J, Klaver P, Lehnertz K, Grunwald T, Schaller C, Elger CE, Fernandez G. Human memory formation is accompanied by rhinal-hippocampal coupling and decoupling. Nat Neurosci. 2001;4:1259–1264. doi: 10.1038/nn759. [DOI] [PubMed] [Google Scholar]
  19. Fisahn A, Pike FG, Buhl EH, Paulsen O. Cholinergic induction of network oscillations at 40 Hz in the hippocampus in vitro. Nature. 1998;394:186–189. doi: 10.1038/28179. [DOI] [PubMed] [Google Scholar]
  20. Fortune ES, Margoliash D. Cytoarchitectonic organization and morphology of cells of the field L complex in male zebra finches (Taenopygia guttata) J Comp Neurol. 1992;325:388–404. doi: 10.1002/cne.903250306. [DOI] [PubMed] [Google Scholar]
  21. Fries P. A mechanism for cognitive dynamics: neuronal communication through neuronal coherence. Trends Cogn Sci. 2005;9:474–480. doi: 10.1016/j.tics.2005.08.011. [DOI] [PubMed] [Google Scholar]
  22. Fries P. Neuronal gamma-band synchronization as a fundamental process in cortical computation. Annu Rev Neurosci. 2009;32:209–224. doi: 10.1146/annurev.neuro.051508.135603. [DOI] [PubMed] [Google Scholar]
  23. Fries P, Reynolds JH, Rorie AE, Desimone R. Modulation of oscillatory neuronal synchronization by selective visual attention. Science. 2001;291:1560–1563. doi: 10.1126/science.1055465. [DOI] [PubMed] [Google Scholar]
  24. Fries P, Womelsdorf T, Oostenveld R, Desimone R. The effects of visual stimulation and selective visual attention on rhythmic neuronal synchronization in macaque area V4. J Neurosci. 2008;28:4823–4835. doi: 10.1523/JNEUROSCI.4499-07.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Fukuda M, Rothermel R, Juhasz C, Nishida M, Sood S, Asano E. Cortical gamma-oscillations modulated by listening and overt repetition of phonemes. NeuroImage. 2010;49:2735–2745. doi: 10.1016/j.neuroimage.2009.10.047. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Hahnloser RH, Fee MS. Sleep-related spike bursts in HVC are driven by the nucleus interface of the nidopallium. J Neurophysiol. 2007;97:423–435. doi: 10.1152/jn.00547.2006. [DOI] [PubMed] [Google Scholar]
  27. Hall MH, Taylor G, Sham P, Schulze K, Rijsdijk F, Picchioni M, Toulopoulou T, Ettinger U, Bramon E, Murray RM, Salisbury DF. The Early Auditory Gamma-Band Response Is Heritable and a Putative Endophenotype of Schizophrenia. Schizophr Bull. 2009 doi: 10.1093/schbul/sbp134. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Henrie JA, Shapley R. LFP power spectra in V1 cortex: the graded effect of stimulus contrast. J Neurophysiol. 2005;94:479–490. doi: 10.1152/jn.00919.2004. [DOI] [PubMed] [Google Scholar]
  29. Hoogenboom N, Schoffelen JM, Oostenveld R, Parkes LM, Fries P. Localizing human visual gamma-band activity in frequency, time and space. Neuroimage. 2006;29:764–773. doi: 10.1016/j.neuroimage.2005.08.043. [DOI] [PubMed] [Google Scholar]
  30. Hosino T, Okanoya K. Lesion of a higher-order song nucleus disrupts phrase level complexity in Bengalese finches. Neuroreport. 2000;11:2091–2095. doi: 10.1097/00001756-200007140-00007. [DOI] [PubMed] [Google Scholar]
  31. Janata P, Margoliash D. Gradual emergence of song selectivity in sensorimotor structures of the male zebra finch song system. J Neurosci. 1999;19:5108–5118. doi: 10.1523/JNEUROSCI.19-12-05108.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Jansen R, Linkenkaer-Hansen K, Heistek T, Timmerman J, Mansvelder HD, Brussaard AB, de Gunst M, van Ooyen A. Inbred mouse strains differ in multiple hippocampal activity traits. Eur J Neurosci. 2009;30:1092–1100. doi: 10.1111/j.1460-9568.2009.06883.x. [DOI] [PubMed] [Google Scholar]
  33. Keller GB, Hahnloser RH. Neural processing of auditory feedback during vocal practice in a songbird. Nature. 2009;457:187–190. doi: 10.1038/nature07467. [DOI] [PubMed] [Google Scholar]
  34. Koch SP, Werner P, Steinbrink J, Fries P, Obrig H. Stimulus-induced and state-dependent sustained gamma activity is tightly coupled to the hemodynamic response in humans. J Neurosci. 2009;29:13962–13970. doi: 10.1523/JNEUROSCI.1402-09.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Konishi M. The role of auditory feedback in birdsong. Ann N Y Acad Sci. 2004;1016:463–475. doi: 10.1196/annals.1298.010. [DOI] [PubMed] [Google Scholar]
  36. Kozhevnikov AA, Fee MS. Singing-related activity of identified HVC neurons in the zebra finch. J Neurophysiol. 2007;97:4271–4283. doi: 10.1152/jn.00952.2006. [DOI] [PubMed] [Google Scholar]
  37. Laurent G, Naraghi M. Odorant-induced oscillations in the mushroom bodies of the locust. J Neurosci. 1994;14:2993–3004. doi: 10.1523/JNEUROSCI.14-05-02993.1994. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Le Van Quyen M, Foucher J, Lachaux J, Rodriguez E, Lutz A, Martinerie J, Varela FJ. Comparison of Hilbert transform and wavelet methods for the analysis of neuronal synchrony. J Neurosci Methods. 2001;111:83–98. doi: 10.1016/s0165-0270(01)00372-7. [DOI] [PubMed] [Google Scholar]
  39. Leonardo A, Konishi M. Decrystallization of adult birdsong by perturbation of auditory feedback. Nature. 1999;399:466–470. doi: 10.1038/20933. [DOI] [PubMed] [Google Scholar]
  40. Leonardo A, Fee MS. Ensemble coding of vocal control in birdsong. J Neurosci. 2005;25:652–661. doi: 10.1523/JNEUROSCI.3036-04.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Logothetis NK. The underpinnings of the BOLD functional magnetic resonance imaging signal. J Neurosci. 2003;23:3963–3971. doi: 10.1523/JNEUROSCI.23-10-03963.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Lombardino AJ, Nottebohm F. Age at deafening affects the stability of learned song in adult male zebra finches. J Neurosci. 2000;20:5054–5064. doi: 10.1523/JNEUROSCI.20-13-05054.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Low PS, Shank SS, Sejnowski TJ, Margoliash D. Mammalian-like features of sleep structure in zebra finches. Proc Natl Acad Sci U S A. 2008;105:9081–9086. doi: 10.1073/pnas.0703452105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Manning JR, Jacobs J, Fried I, Kahana MJ. Broadband shifts in local field potential power spectra are correlated with single-neuron spiking in humans. J Neurosci. 2009;29:13613–13620. doi: 10.1523/JNEUROSCI.2041-09.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Margoliash D, Schmidt MF. Sleep, off-line processing, and vocal learning. Brain Lang. 2010;115:45–58. doi: 10.1016/j.bandl.2009.09.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. McCasland JS. Neuronal control of bird song production. J Neurosci. 1987;7:23–39. doi: 10.1523/JNEUROSCI.07-01-00023.1987. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Mitra P, Bokil H. Observed brain dynamics. Oxford ; New York: Oxford University Press; 2008. [Google Scholar]
  48. Neuenschwander S, Engel AK, Konig P, Singer W, Varela FJ. Synchronization of neuronal responses in the optic tectum of awake pigeons. Vis Neurosci. 1996;13:575–584. doi: 10.1017/s0952523800008257. [DOI] [PubMed] [Google Scholar]
  49. Nordeen KW, Nordeen EJ. Auditory feedback is necessary for the maintenance of stereotyped song in adult zebra finches. Behav Neural Biol. 1992;57:58–66. doi: 10.1016/0163-1047(92)90757-u. [DOI] [PubMed] [Google Scholar]
  50. Rauske PL, Shea SD, Margoliash D. State and neuronal class-dependent reconfiguration in the avian song system. J Neurophysiol. 2003;89:1688–1701. doi: 10.1152/jn.00655.2002. [DOI] [PubMed] [Google Scholar]
  51. Rauske PL, Chi Z, Dave AS, Margoliash D. Neuronal stability and drift across periods of sleep: premotor activity patterns in a vocal control nucleus of adult zebra finches. J Neurosci. 2010;30:2783–2794. doi: 10.1523/JNEUROSCI.3112-09.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Roy A, Mooney R. Song decrystallization in adult zebra finches does not require the song nucleus NIf. J Neurophysiol. 2009;102:979–991. doi: 10.1152/jn.00293.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Ryan SM, Arnold AP. Evidence for cholinergic participation in the control of bird song; acetylcholinesterase distribution and muscarinic receptor autoradiography in the zebra finch brain. J Comp Neurol. 1981;202:211–219. doi: 10.1002/cne.902020207. [DOI] [PubMed] [Google Scholar]
  54. Sakata JT, Brainard MS. Online contributions of auditory feedback to neural activity in avian song control circuitry. J Neurosci. 2008;28:11378–11390. doi: 10.1523/JNEUROSCI.3254-08.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Schoffelen JM, Oostenveld R, Fries P. Neuronal coherence as a mechanism of effective corticospinal interaction. Science. 2005;308:111–113. doi: 10.1126/science.1107027. [DOI] [PubMed] [Google Scholar]
  56. Shank SS, Margoliash D. Sleep and sensorimotor integration during early vocal learning in a songbird. Nature. 2009;458:73–77. doi: 10.1038/nature07615. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Siegel M, Konig P. A functional gamma-band defined by stimulus-dependent synchronization in area 18 of awake behaving cats. J Neurosci. 2003;23:4251–4260. doi: 10.1523/JNEUROSCI.23-10-04251.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Siegel M, Donner TH, Oostenveld R, Fries P, Engel AK. Neuronal synchronization along the dorsal visual pathway reflects the focus of spatial attention. Neuron. 2008;60:709–719. doi: 10.1016/j.neuron.2008.09.010. [DOI] [PubMed] [Google Scholar]
  59. Simpson HB, Vicario DS. Brain pathways for learned and unlearned vocalizations differ in zebra finches. J Neurosci. 1990;10:1541–1556. doi: 10.1523/JNEUROSCI.10-05-01541.1990. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Spiro JE, Dalva MB, Mooney R. Long-range inhibition within the zebra finch song nucleus RA can coordinate the firing of multiple projection neurons. J Neurophysiol. 1999;81:3007–3020. doi: 10.1152/jn.1999.81.6.3007. [DOI] [PubMed] [Google Scholar]
  61. Svensen M, Bishop CM. Robust Bayesian mixture modeling. Neurocomputing. 2005;64:235–252. [Google Scholar]
  62. Thomson DJ. Spectrum estimation and harmonic analysis. Proceedings of the IEEE. 1982;70:1055–1096. [Google Scholar]
  63. Uhlhaas PJ, Pipa G, Lima B, Melloni L, Neuenschwander S, Nikolic D, Singer W. Neural synchrony in cortical networks: history, concept and current status. Front Integr Neurosci. 2009;3:17. doi: 10.3389/neuro.07.017.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Vates GE, Broome BM, Mello CV, Nottebohm F. Auditory pathways of caudal telencephalon and their relation to the song system of adult male zebra finches. J Comp Neurol. 1996;366:613–642. doi: 10.1002/(SICI)1096-9861(19960318)366:4<613::AID-CNE5>3.0.CO;2-7. [DOI] [PubMed] [Google Scholar]
  65. Wang Y, Brzozowska-Prechtl A, Karten HJ. Laminar and columnar auditory cortex in avian brain. Proc Natl Acad Sci U S A. 2010;107:12676–12681. doi: 10.1073/pnas.1006645107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Wild JM, Williams MN, Suthers RA. Parvalbumin-positive projection neurons characterise the vocal premotor pathway in male, but not female, zebra finches. Brain Res. 2001;917:235–252. doi: 10.1016/s0006-8993(01)02938-9. [DOI] [PubMed] [Google Scholar]
  67. Wild JM, Williams MN, Howie GJ, Mooney R. Calcium-binding proteins define interneurons in HVC of the zebra finch (Taeniopygia guttata) J Comp Neurol. 2005;483:76–90. doi: 10.1002/cne.20403. [DOI] [PubMed] [Google Scholar]
  68. Williams H, Vicario DS. Temporal patterning of song production: participation of nucleus uvaeformis of the thalamus. J Neurobiol. 1993;24:903–912. doi: 10.1002/neu.480240704. [DOI] [PubMed] [Google Scholar]
  69. Woolley SM, Rubel EW. Bengalese finches Lonchura Striata domestica depend upon auditory feedback for the maintenance of adult song. J Neurosci. 1997;17:6380–6390. doi: 10.1523/JNEUROSCI.17-16-06380.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Yu AC, Margoliash D. Temporal hierarchical control of singing in birds. Science. 1996;273:1871–1875. doi: 10.1126/science.273.5283.1871. [DOI] [PubMed] [Google Scholar]
  71. Zann RA. The zebra finch : a synthesis of field and laboratory studies. Oxford ; New York: Oxford University Press; 1996. [Google Scholar]

RESOURCES