SUMMARY
The zebra finch brain features a set of clearly defined and hierarchically arranged motor nuclei that are selectively responsible for producing singing behavior. One of these regions, a critical forebrain structure called HVC, contains premotor neurons that are active at precise timepoints during song production. However, the neural representation of this behavior at a population level remains elusive. We used 2-photon microscopy to monitor ensemble activity during singing, integrating across multiple trials by adopting a Bayesian inference approach to more precisely estimate burst timing. Additionally, we examined spiking and motor-related synaptic inputs using intracellular recordings during singing. With both experimental approaches, we find that premotor events do not preferentially occur at the onsets or offsets of song syllables or at specific subsyllabic motor landmarks. These results strongly support the notion that HVC projection neurons collectively exhibit a temporal sequence during singing that is uncoupled from ongoing movements.
INTRODUCTION
Song production in the zebra finch provides an excellent opportunity to examine the processes that shape the representation of a single complex learned behavior as it progresses from dedicated higher order centers through downstream targets, ultimately leading to the flexion of muscles needed to produce the song (Ashmore et al., 2005; Leonardo and Fee, 2005 Although the motor pathway in the songbird is composed of well-identified and spatially segregated neural circuits (Nottebohm et al., 1976), the means by which this singing behavior is represented in these regions is still a matter of debate (Troyer, 2013). Debate One centers on a single forebrain area (called HVC), which plays a central role in song production (Long and Fee, 2008; Vu et al., 1994). In one view, HVC projection neurons reflect motor-related aspects of the ongoing song (Amador et al., 2013; Boari et al., 2015), similar to the coding scheme observed in the mammalian primary motor cortex (M1) (Churchland et al., 2012; Evarts, 1968; Georgopoulos et al., 1986; Todorov, 2000). In the other view, these neurons may represent relative time within the motor act (Fee et al., 2004; Hahnloser et al., 2002; Kozhevnikov and Fee, 2007; Long et al., 2010) without regard to kinematics of the vocal apparatus Similar abstract motor representations have been shown to exist in higher order cortical sites, such as the supplementary motor area (Matsuzaka et al., 2007; Mita et al., 2009; Nachev et al., 2008; Tanji and Shima, 1994 An intermediate hypothesis, in which HVC may represent both movement and elapsed time, may also be valid and has been suggested to exist in M1 (Carpenter et al., 1999; Lu and Ashe, 2005, 2015; Matsuzaka et al., 2007). Distinguishing between these mechanisms of motor control would be a significant step forward in our understanding of how singing behavior is encoded within HVC, potentially extending to skilled behaviors in other forebrain circuits.
Technical challenges using traditional electrophysiological approaches in singing birds have prevented a clear characterization of the HVC premotor network (Amador et al., 2013; Day et al., 2013; Hahnloser et al., 2002; Kozhevnikov and Fee, 2007; Long et al., 2010; Markowitz et al., 2015). Previously, song-related neural responses were collected one at a time over the span of many weeks and then aligned to singing behavior (Hahnloser et al., 2002; Kozhevnikov and Fee, 2007; Long et al., 2010; Okubo et al., 2015). This process is inefficient, and coding of the behavior at an ensemble level could have shifted considerably during that time (Huber et al., 2012; Okubo et al., 2015; Peters et al., 2014), potentially leading to misinterpretations that would be avoided using a method that could allow for a ‘snapshot’ of large neural populations in a single recording session. In the rodent, in vivo 2-photon imaging has enabled the study of large populations of motor cortical neurons during the performance of tractable behaviors (Huber et al., 2012; Li et al., 2015; Peters et al., 2014).
For these experiments, we developed a head-fixed preparation that enabled us to image populations of HVC projection neurons during singing. We then applied a new computational algorithm that enabled high precision estimates for event timing by integrating across song trials. Additionally, we analyzed a series of intracellular recordings in which spikes and excitatory postsynaptic events were used to reflect motor-related network activity within HVC. First, we demonstrate that the rate of HVC premotor bursts during silent gaps in the song does not differ relative to epochs of active singing. Additionally, we found that HVC activity occurs continuously within the context of a syllable, rather than concurrent with identified motor components of the song. Our results show that behaviorally relevant temporal sequences within HVC of the zebra finch are uncoupled from the properties of each constitutive movement, akin to higher order cortical areas in primates (e.g., Tanji and Shima, 1994).
RESULTS
Two-photon calcium imaging in the head-fixed singing bird
We wanted to test the proposed hypotheses of song representation by examining the neuronal activity at a population level with 2-photon microscopy (Denk et al., 1990). Recent experiments have used 2-photon imaging of HVC activity in the anesthetized zebra finch to measure sensory responses within that structure (Graber et al., 2013; Peh et al., 2015). In order to adapt this approach to address motor dynamics, we developed a head-fixed singing bird preparation. Since we were not able to elicit head-fixed song in initial attempts (n=6 birds), we used operant conditioning to encourage this behavior. Male zebra finches were separated from females by a glass partition whose transparency was under experimental control, and they were given visual access to females at regular intervals (Figure 1A). Males were rewarded for singing during those epochs (Figure 1B, Figures S1A–S1D) and then transitioned to a head-fixed context once song could be reliably evoked (Figure 1C, Figures S1E, S1F and S1I). The vast majority of these birds (71 out of 78) produced at least one head-fixed song motif (Table 1). Whereas head-fixation can alter behavior and the associated neural activity for certain tasks (Ravassard et al., 2013), we find that the quality (p=0.27, F=1.47, one-way ANOVA) and timing (p=0.95, paired t-test) of singing behavior for five individual birds were not significantly different compared with song produced during free movement (Figures S1J–S1L).
Figure 1. Imaging neural activity in HVC of a head-fixed zebra finch.
(A) A schematic of the arena used to train head-fixed zebra finches. In our training paradigm, we used polarized glass to provide visual access to a female.
(B) An example spectrogram (frequency: 0.5–8 kHz) showing singing behavior elicited during a trial.
(C) A timeline displaying the progression of training conditions. The percentage of trials with song for each day is plotted for two birds.
(D) GCaMP6s-expressing neurons within HVC.
(E) Examples of fluorescence measurements in the neurons circled in (D). Individual song motifs are highlighted in gray.
Table 1. Zebra finch training.
Performance of zebra finches on each phase of the training program. In the last two conditions, only a subpopulation of head-fixed birds were used for this study, and further training was carried out under the 2-photon microscope (2P).
Freely Moving | Head-Fixed (no 2P) | Head-Fixed 2P | Head-Fixed 2P & Imaging | |
---|---|---|---|---|
# Birds Trained | 109 | 78 | 17 | 6 |
# Sessions | 540 | 2,474 | 187 | 85 |
# Trials | 18,500 | 61,750 | 2,834 | 1,405 |
Trials with Song | 7,463 | 11,811 | 1,509 | 756 |
# Song Motifs | 37,112 | 66,025 | 6,800 | 2,868 |
Motifs per Trial | 2.0 | 1.1 | 2.4 | 2.0 |
A subset of the head-fixed singing birds was acclimated to the 2-photon microscope (Figures S1G and S1H, Supplementary Movie). In those individuals, we virally expressed a genetically encoded calcium indicator (GCaMP6s or GCaMP6f) (Chen et al., 2013) to report the activity of HVC neurons during singing. To calibrate the calcium indicator in vivo, we performed simultaneous juxtacellular recordings and calcium imaging (Figure S2A) of bursting activity in an anesthetized and pharmacologically disinhibited zebra finch (Figures S2B–S2E). The latency from the first spike of the burst to the onset of the calcium transient was 5.9 ± 3.7 ms (n = 3 neurons from 2 birds). The standard deviation of onset timing within individual neurons was 3.5 ± 1.3 ms (e.g. Figure S2E), which supported the notion that GCaMP6 could reliably report neural activity with high temporal precision. During singing, the majority of HVC neurons exhibited one (Cells #1, #4, and #5) or a few (Cells #2 and #3) distinct calcium transients (Figures 1D and 1E), consistent with previous measurements taken with a head-mounted CMOS camera (Markowitz et al., 2015). For this study, we restrict our analysis to these sparsely active neurons, which are likely to represent HVC premotor [HVC(RA)] or basal ganglia-projecting [HVC(X)] cells (Hahnloser et al., 2002; Kozhevnikov and Fee, 2007; Long et al., 2010), as opposed to other observed neurons that showed sustained fluorescence during singing and probably represent inhibitory interneurons (Hahnloser et al., 2002; Kosche et al., 2015).
Increasing temporal resolution by integrating data across behavioral renditions
We next asked whether our approach could provide sufficient temporal resolution to test hypotheses concerning the representation of singing behavior within the HVC circuit. High temporal resolution is critical for detecting the precise onset of (~10 ms) bursting responses within HVC (Hahnloser et al., 2002; Long et al., 2010). However, the onset time of putative burst events was difficult to precisely estimate on individual trials (e.g., Figures 2A, 2C and 2E). To address this problem, we took advantage of the stereotyped timing inherent in both the zebra finch song as well as the underlying HVC bursting activity (Hahnloser et al., 2002; Kozhevnikov and Fee, 2007). For a given focal plane, we imaged during several renditions of the song (Figure 2B) and aligned calcium traces from multiple trials to the singing behavior (Figure 2D). Using a Bayesian method (Figures S2D and S2F–S2I), we found that combining fluorescence measurements across trials could greatly sharpen the temporal resolution of our estimates for burst onsets (Figures 2C–2G, see Experimental Procedures for details). For instance, in an example cell that had been imaged for 23 song motifs, the burst events could be detected with significantly more precision when considering all trials (mean σonset = 3.0 ms, Figure 2F) compared with estimates taken from single trials (mean σonset = 16.3 ms, Figure 2E). Across our dataset, the uncertainty of our estimate for burst onsets decreased as a function of the number of motifs analyzed (Figure 2G) and as a function of the signal-to-noise ratio of measurements from individual neurons (Figure 2H).
Figure 2. Integrating imaging data across trials.
(A,B) Spectrograms with vertical lines representing the scan times for a single motif in (A) and across 23 motifs in (B).
(C,D) Fluorescence transients for one neuron (image inset). We show the results from a single trial (29 points) in (C) as well as measurements taken across all trials (667 points) in (D).
(E,F) Histograms represent an estimate of the posterior probability of burst onsets in the above panels. Standard deviations (in milliseconds) provided for each peak (horizontal bar =mean ±1 SD).
(G,H) The standard deviations of onset estimates varied as a function of the number of trials (n=376 bursts) in (G) and the signal-to-noise ratio of the fluorescence measurements (n= 361 bursts) in (H) for single neurons. The gray shaded region indicates values less than 10 ms.
Using calcium imaging to investigate motor representation at a population level
Once the timing of HVC bursts was determined, we examined the relationship between these events and singing behavior. We asked whether bursts preferentially occurred during periods of active singing. Zebra finches produce a repeated series of vocal elements, known as ‘syllables’ that are approximately 75–300 ms in length and are separated by shorter (15–100 ms) silent ‘gaps’ during inspiration (Hartley and Suthers, 1989) (Figures 3A and 3B). Although syringeal muscles can be activated during both syllables and gaps, electromyogram recordings during song production show dynamic patterns of muscle activation that correlate with the rapidly changing acoustic features of song (Goller and Cooper, 2004; Riede and Goller, 2010; Suthers et al., 1994; Vicario, 1991a). As a result, the kinematic hypothesis would predict that HVC projection neurons would be significantly more active during syllables than silent gaps, while the temporal sequence hypothesis would not anticipate a difference between these conditions. In total, we considered 294 burst events (250 cells) from five birds producing a total of 45 syllables (Figures 3 and S3). We found no relationship between the relative frequency of HVC projection neuron bursts and the presence of a syllable (74.0 bursts/sec) or a gap (76.7 bursts/sec) (p > 0.05, paired t-test), supporting the notion that bursts are forming a continuous sequential representation throughout the song motif. Additionally, there was no temporal alignment of burst events to the onsets (Figure 3C) or offsets (Figure 3D) of syllables.
Figure 3. Timing of HVC network activity during singing.
(A) The spectrogram (top) and waveform (bottom) of a song motif with 58 GTE overlaid (green lines).
(B) Inferred onset times of 103 burst events from 90 HVC projection cells aligned to the song motif. Horizontal bars represent burst onset times (±1 SD). Below, we show burst onsets (black), silent gaps (light blue), and GTE times (green).
(C–E) Normalized cross-correlation (see Experimental Procedures) between burst onset times and syllable onsets in (C), syllable offsets in (D), and subsyllabic timepoints in (E). Negative offsets mean that the burst precedes the motor event. Dashed lines = ±3 SD from null model (see Experimental Procedures).
(F) The normalized cross-correlation for simulated data based on syllable onsets, offsets, and subsyllabic timepoints.
(G) Probabilities of burst times in two alternative models (uniform=black, GTE=green).
(H) The normalized log-likelihood ratio of the predictive model as a function of the relative contributions from the uniform and GTE models, with positive values for R favoring the uniform model. The R value for Bird #102 is 23.6 and for the population of five birds is 84.2, which provides strong support for the uniform model over the GTE model. Note that the value for ‘c’ is plotted on the x-axis (see Experimental Procedures).
(I) The normalized log-likelihood for simulated data. The large negative values for R (R102 = − 45.6 and RAll = −94.7) demonstrate that our positive control simulation strongly supports the GTE model.
We next wanted to examine the fine structure of syllables in order to test whether HVC projection neuron activity is best explained by the presence of specific motor movements or by a continuous temporal sequence. Because a direct assessment of muscular activity was not feasible in our preparation, we turned to a previously established tool for estimating vocal muscle gestures from recorded songs (Amador et al., 2013; Boari et al., 2015) (Figure S4). These events, called gesture-trajectory extrema (GTE) are associated with changes in syringeal tension or subsyringeal pressure (Amador et al., 2013) and can be estimated by analyzing song structure (Boari et al., 2015). We first examined the relationship between GTE and burst onsets (e.g., Figure 3B) and found no significant correlation between these events at any time lag in both individual birds and across the population (Figure 3E). To validate our method, we performed a simulation where burst events were placed at syllabic and subsyllabic timepoints and adjusted for associated experimental uncertainties as well as the inherent temporal variance previously suggested to exist between these events (Amador et al., 2013). In these simulated positive controls, we were able to detect a significant correlation between motor events and bursting activity (Figure 3F).
Hypothesis testing of network representation
In order to formally test hypotheses concerning motor representations within HVC, we defined the predictions of two models: a movement model based on GTE events and a uniform distribution model, in which bursts are evenly spread throughout the song (Figure 3G). When we smoothly interpolated between these two models with a likelihood-ratio test in order to consider a range of mixed coding schemes (see Experimental Procedures for details), we found that our data strongly favor a uniform distribution (Figure 3H), supporting the notion of a song-related temporal sequence within HVC. As before, we confirmed the efficacy of this method by finding a strong preference for the GTE model in our simulated positive control (Figure 3I). One limitation of the likelihood-ratio approach is that it tests the GTE model with no temporal offset, which was a choice that was originally motivated by the previous claim that GTE and bursts co-occur (Amador et al., 2013). Although our decision to focus on a zero latency relationship is consistent with our observation of a lack of strong correlations at a variety of time offsets (e.g., Figure 3E), we repeated the log likelihood analysis at a range of possible offsets (−30 to 30 ms). The repeated testing at various negative time offsets can help to counteract any systematic inaccuracy in our estimate of the lag between the true burst time and the calcium onset resulting from our calibration conditions (Figure 4A). The positive time offsets enable us to consider a ‘causal’ scenario in which the accepted premotor delays (~15–20 ms) existing between HVC activity and song (Kozhevnikov and Fee, 2007) are incorporated into the analysis (Figure 4B). Across these various tests, we continued to find strong support for a uniform distribution compared with a subsyllabic kinematic model.
Figure 4. Testing the temporal relationship between neural activity and movement.
(A,B) Normalized log-likelihood of data acquired across all five imaged birds as a function of the relative strength of the GTE model presented with a series of additional time offsets in which burst times are either delayed in (A) or advanced in (B) relative to gestural timepoints. The values for RAll across all delays range from 77.4 to 118.1.
We considered a number of potential sources of error that may have biased our findings. First, our ability to precisely estimate burst onset times for individual neurons was variable (Figure 2G and 2H). Although our analyses take this variability into account (see Experimental Procedures), it is possible that those neurons with the highest amount of uncertainty were obscuring an existing relationship between bursts and GTE. To address this concern, we restricted our analysis to only those bursts with an onset uncertainty less than 10 ms (186 out of 294 events) and again saw no significant relationship between GTE and HVC burst timing (Figures S5A–S5D). Second, zebra finches may generate some muscle movements during silent gaps, such as ‘mini-breaths’ (Hartley and Suthers, 1989), whose relationship with HVC activity is unclear (Andalman et al., 2011). Because we are unable to reconstruct gestures outside of phonation, we repeated the analysis after removing bursts that occurred during gaps and found no significant change in the results (Figure S5E and S5F).
Using intracellular recording to investigate motor representation at a population level
To provide an additional test of our hypotheses, we next used intracellular recordings to examine the activity of song-related populations of cells within HVC. We first considered the onset of burst responses of individual projection neurons (Bird #33, n = 12 cells, 17 bursts, Figures 5A and 5B) in order to provide an additional validation of our analytical approach in a condition with traditionally less temporal uncertainty than calcium imaging. In our analysis of 28 bursting events in 21 cells from 3 birds, we did not find any evidence of a relationship between burst times and GTE (Figure 5C), despite simulations indicating that we would be able to find a relationship if one existed (Figure 5D).
Figure 5. Timing of HVC bursts during singing.
(A) The spectrogram (top) and waveform (bottom) of a song motif with 45 GTE overlaid (green lines).
(B) Onset times of 17 burst events measured electrophysiologically from 12 HVC projection cells aligned to the song motif. Horizontal bars, depicted with a standard width of 10 ms, are centered on burst onset times. Below, we show burst onsets (blue) and GTE times (green).
(C) The normalized log-likelihood of the acquired data as a function of the relative strength of the GTE model (R33 = 32 and RAll = 49.7).
(D) The normalized log-likelihood for simulated data (R33 = −9.0 and RAll = −17.7).
We next turned our attention to the subthreshold membrane potential of these HVC neurons. We have recently demonstrated that the series of stereotyped synaptic events onto identified HVC projection neurons is driven by motor-related inputs (Vallentin and Long, 2015). Previous paired recordings in slices of HVC have shown extensive interconnectivity within that nucleus, including links between excitatory neurons (Kosche et al., 2015; Mooney and Prather, 2005). As a result, we reasoned that identified excitatory events onto individual HVC projection neurons (Figure 6B, see example traces) could be used to simultaneously monitor the activity of multiple premotor neurons. We estimated onset times for the most consistently detected postsynaptic potentials (Figures 6A–6C, see Experimental Procedures). We then repeated this analysis for all 15 HVC projection neurons within one bird (n = 187 events, Figure 6D) and across a group of three birds (n = 333 events from 26 cells, Figure S6A–S6C). Additionally, there was no connection between the incidence of synaptic onset times and the presence of either a syllable (165.7 events/sec) or a gap (164.1 events/sec) (p > 0.05, paired t-test). When considering syllabic and subsyllabic structure, we also found no significant relationship between synaptic events and behavior (Figures 6E–6K, Figure S6D and S6E), including separate analyses where gaps were not included (Figures S6F and S6G) and a range of delays were introduced (Figure S6H). The possibility existed that GTE-related events may be observed in one cell class but not the other (n = 19 HVC(RA) and 7 HVC(X), Figure 6D), but we found no additional relationship when considering each cell type separately (Figure S7). Because of potential errors in our ability to detect synaptic events, we also analyzed the average membrane potential at syllable onsets and offsets as well as GTE times, and we did not find any significant deviation from zero across all neurons tested (n = 33 cells).
Figure 6. Timing of HVC synaptic inputs during singing.
(A,B) A sonogram and GTE timepoints in (A) for the bird shown in Figure 5, song-related intracellular recordings from an HVC(RA) neuron aligned to the song motif in (B). Traces below have been truncated (oblique lines). Vertical gray lines indicate identified synaptic events.
(C) All EPSPs detected across 13 song motifs. Consistently detected events (see Experimental Procedures) are indicated with red lines (mean values below).
(D) The consensus synaptic event times for 11 HVC(RA) and 4 HVC(X) neurons (red and blue respectively) recorded in the same bird. Below, we show EPSP onsets (red or blue, as above), silent gaps (light blue), and GTE times (green).
(E–G) Normalized cross-correlation (see Experimental Procedures) between burst onset times and syllable onsets in (E), syllable offsets in (F), and subsyllabic timepoints in (G). Dashed lines = ±3 SD from null model.
(H) Normalized cross-correlation for simulated data based on syllable onsets, offsets, and subsyllabic timepoints.
(I) Probabilities of burst times in two alternative models (uniform=black, GTE=green).
(J) The normalized log-likelihood of the acquired data as a function of the relative strength of the GTE model (R33 = 240.9 and RAll = 493.9).
(K) The normalized log-likelihood for simulated data (R33 = −97.8 and RAll = −157.7).
DISCUSSION
In our investigation, we focused on the forebrain nucleus HVC of the zebra finch, as these neurons represent a well-defined skilled behavior using a sparse and reliable code. We developed a method for investigating the neural mechanisms underlying song production in a head-fixed context, which represents a rare example of a restrained social behavior (Lenschow and Brecht, 2015; Oomura et al., 1983). Using this approach, we provided population-level support for a notion that HVC projection neurons exhibit an abstract representation of elapsed time, suggesting a scheme for encoding behavior that is divorced from the kinematics of ongoing movements (Lu and Ashe, 2015; Matsuzaka et al., 2007; Tanji and Shima, 1994).
Our first step was to determine whether the population data that we collected from the imaging (n = 5 birds) and electrophysiology (n = 3 birds) experiments were temporally modulated. Despite observing a large number of motor timepoints in each bird, obvious gaps in activity were still evident (e.g., Figures S3A and S6C). These nonuniformities may arise from a number of sources. For instance, because our recordings were spatially restricted within HVC, a sampling bias may exist if burst events from nearby cells exhibit similar timing (Graber et al., 2013; Markowitz et al., 2015; Peh et al., 2015). We next tested whether any nonuniformities could be explained by the coincident activity of neurons with motor movements. We greatly expanded on previous efforts to address this issue (Amador et al., 2013; Kozhevnikov and Fee, 2007) by quantitatively examining a number of possible relationships between HVC neuronal activity and singing. First, we find that HVC projection neurons are not preferentially active during syllables compared with silent gaps, despite the clear increase in the number and complexity of motor commands during these times (Goller and Cooper, 2004; Riede and Goller, 2010; Suthers et al., 1994; Vicario, 1991a). Second, we added to this analysis by further showing that no structured neural activity existed when considering syllable onsets or offsets separately. Third, we tested whether the timing of HVC projection neuron activity is modulated by subsyllabic motor gestures, as measured by GTE (Amador et al., 2013; Boari et al., 2015).
To analyze the relationship between GTE and neural activity, we used a quantitative approach to demonstrate that the data are better described by a simple uniform model than by a gestural hypothesis. Moreover, this analysis allowed us to test for the possibility of a mixed coding scheme, in which a population of neurons represents both kinematics and some other feature independent of the ongoing movement. By examining these intermediate models in our likelihood analysis, we do not see support for the idea of partial coding of GTEs by the population. It should be stated that the poor fit of our data with the GTE model does not necessarily mean that the underlying event times are uniformly distributed, but rather that any nonuniformities cannot be explained by aspects of motor behavior tested as part of this study. The method for identifying these GTE timepoints was derived from models of subsyringeal pressure and syringeal muscle activity (Goller and Cooper, 2004; Mendez et al., 2010; Perl et al., 2011). However, future work should directly measure these factors in conjunction with population activity in order to further confirm our findings.
An important future direction is to understand the processes that enable a transformation between the abstract temporal sequence within HVC and the production of singing behavior. A candidate site that is likely to be central to this conversion is the robust nucleus of the archopallium (RA), which sits between HVC and the motorneurons that coordinate syringeal and respiratory activity (Vicario, 1991b). Each RA neuron receives synaptic input from multiple HVC neurons (Fee et al., 2004; Garst-Orozco et al., 2014), and spiking within this structure covaries with certain song features (Sober et al., 2008). Despite a wealth of single-unit recordings in this region during singing (Leonardo and Fee, 2005; Yu and Margoliash, 1996), an analysis linking RA spiking activity to GTE has not yet been performed.
Sequences similar to those described in this study have been shown to exist across a variety of brain regions, such as the hippocampus (Pastalkova et al., 2008), parietal cortex (Harvey et al., 2012), and basal ganglia (Mello et al., 2015). Within HVC, computational models (Fiete et al., 2010; Jun and Jin, 2007) and recent experimental findings (Okubo et al., 2015) suggest that this kind of population sequence may arise as part of a developmental process. Previous electrophysiological results have shown that the circuitry within HVC (Kosche et al., 2015; Mooney and Prather, 2005; Solis and Perkel, 2005) is likely to play an important role in generating these sequences (Long and Fee, 2008; Vu et al., 1994). Specifically, strong excitatory collaterals within HVC appear to underlie the propagation of network activity through a feed-forward synaptic chain (Abeles, 1991; Long et al., 2010). A combination of our high temporal resolution imaging technique with emerging anatomical approaches for circuit reconstruction (Briggman et al., 2011) or electrophysiological methods (Ko et al., 2011) could help to identify mechanisms underlying sequence generation within HVC.
EXPERIMENTAL PROCEDURES
Animals
We used adult (>90 days post hatch) male and female zebra finches that were obtained from an outside breeder and maintained in a temperature-and humidity-controlled environment with a 12 hour:12 hour light:dark schedule. All animal maintenance and experimental procedures were performed according to the guidelines established by the Institutional Animal Care and Use Committee at the New York University Langone Medical Center.
Song detection and reward
We recorded singing behavior with an omni-directional lavalier condenser microphone (AT803, Audio-Technica) and amplified the signal with a solid-state preamplifier (Ultragain Pro MIC2200, Behringer). Song was detected using a digital signal processor (RX8, Tucker-Davis Technologies) and custom software written using the RPvdsEx interface (Tucker-Davis Technologies) and MATLAB. Because the zebra finch produces a song containing short gaps between syllables and motifs, singing behavior was defined as time periods in which the ratio of high frequency power to low frequency power (0–1 kHz and 1–7 kHz respectively) was greater than 3 for more than 50% of a one second sliding window. Singing behavior was only evaluated during 20 second trial periods when the female zebra finch was visible (e.g. Figure 1A). Liquid rewards were administered using a gravity-fed line though a solenoid (NResearch) that was opened for 200 ms in order to dispense approximately 20 μL of water. To evaluate the detection algorithm, we manually noted false negatives and false positives, enabling us to calculate the sensitivity [True Positives/(True Positives + False Negatives)] and positive predictive value [True Positives/(True Positives + False Positives)].
Surgical procedures
All surgical procedures were performed under isoflurane anesthesia (1–3% in oxygen) following established guidelines. Prior to the onset of training, a small (3.9 mm by 4.55 mm, 1.3 mm) stainless steel headplate with two threaded holes was affixed with dental acrylic over the inner leaflet of the skull with the posterior edge approximately 5 mm rostral to the bifurcation of the sagittal sinus. Cranial window surgery and viral injection were performed once the zebra finch demonstrated the ability to sing reliably in the head-fixed condition (see Figure 1 and S1). HVC was identified electrophysiolgically during surgery using a bipolar stimulating electrode placed into RA (Long et al., 2010). In most birds, we injected AAV9.Syn.GCaMP6s.WPRE.SV40 (Penn Vector Core) into HVC using a beveled pipette (opening diameter: 30 μm, length of bevel: 100μm). In two birds, we injected a 1:1 mix of AAV9.CamKII0.4.Cre.SV40 and either AAV9.CAG.Flex.GCaMP6f.WPRE.SV40 (Bird #192) or AAV9.CAG.Flex.GCaMP6s.WPRE.SV40 (Bird #193). We performed 3–6 injections (30–100 nL per site) separated by approximately 300 μm using an oil-based pressure injection system (Nanoject II, Drummond Scientific). At the end of the injection procedure, we sealed a 3 mm diameter circular cover glass (#1 thickness, Warner Instrument) with Kwik-Sil adhesive (WPI) and fixed the edge of the glass with cyanoacrylate. We also cemented a black plastic ring (inner diameter: 5 mm, outer diameter: 7.5 mm) around the cranial window to prevent light contamination during imaging.
Two-photon imaging
We used a customized Movable Objective Microscope (Sutter instrument Company) to scan our field of view (frame rate: 28.8 Hz unless otherwise noted) with a resonant system (Thorlabs) and ScanImage 4.2 software (Pologruto et al., 2003). All imaging was done using a 16× water immersion objective (Nikon) with a numerical aperture of 0.8 and a working distance of 3 mm. In order to protect the microscope from external light contamination, we wrapped it with a light-attenuating material. Additionally, we fit a black balloon to the tip of the objective on one end and the black ring surrounding the optical window on the other (Dombeck et al., 2010). The excitation source was a mode-locked Ti:sapphire laser (Chameleon, Coherent) tuned at 920nm and controlled by a Pockels cell (Conoptics 302RM).Fluorescent light was detected using a GaAsP photomultiplier tube (H10770PA-40 PMT Module, Hamamatsu) and a wide detection path (2″ collection lens).
Two-photon targeted electrophysiological recordings
In vivo electrophysiological recordings of HVC neurons expressing GCaMP6s were performed in isoflurane-anesthetized zebra finches. For these birds, we did not affix the cranial window to the brain with Kwik-Sil. Rather, the glass coverslip was sealed at the edges with cyanoacrylate and a small opening (~300 μm diameter) was produced near the center of the glass using a carbide burr. Borosilicate glass pipettes were fabricated on a horizontal puller (Sutter Instrument Company) with impedance values in the range of 4–5 MΩ and filled with an solution of K-gluconate (150 mM) and 5 μM of Red Alexa 594 (Molecular Probes, Invitrogen) to visualize the pipette under the microscope. Because HVC projection neurons produce only infrequent bursts outside of the context of singing (Hahnloser et al., 2002; Long et al., 2010), we applied 0.1 mM of the GABAA receptor antagonist gabazine (Sigma) to the surface of the craniotomy in order to increase the rate of spontaneous bursting activity (Mooney and Prather, 2005). Signals were recorded using a Neurodata IR183A single channel amplifier (Cygnus Technology) and custom MATLAB acquisition software. Data were low-pass filtered at 5 kHz and digitized with a National Instruments digital-to-analog converter (acquisition rate: 40 kHz). For these experiments, the soma was scanned at 52 Hz, and these data were aligned to spiking activity.
Intracellular recordings
Intracellular recordings during singing were carried out as described previously (Vallentin and Long, 2015). Briefly, a motorized intracellular microdrive was installed on the head of the zebra finch. For antidromic identification of projection neurons, we implanted a bipolar stimulating electrode into RA and/or area ×. Sharp electrodes with an impedance of 70–130 MΩ were backfilled with 3 M of potassium acetate and inserted into HVC. Acceptable recordings were defined as having a spike height greater than 40 mV, a resting membrane potential more hyperpolarized than −50 mV, and a total recording duration greater than 3 minutes. Once stable recordings were achieved, a female bird zebra finch was presented to elicit directed singing. Neurons with fewer than two song motifs were excluded from our analysis.
Supplementary Material
Acknowledgments
This research was supported by the NIH (R01NS075044), the New York Stem Cell Foundation, the Rita Allen Foundation, the Esther A & Joseph Klingenstein Foundation, Simons Foundation (Global Brain Initiative), DARPA N66001-15-C-4032 (SIMPLEX), and EMBO (ALTF 1608–2013). We thank Aimee Chow, Celine Cammarata, and Brandon Robinson for their technical assistance and their help with further developing our training protocol. We thank Florin Albeanu and Simon Peron for their assistance with 2-photon imaging and Jeff Gauthier and Arnaud Malvache for their help with analysis. We thank Florin Albeanu, Dmitriy Aronov, Brenton Cooper, Kishore Kuchibhotla, and John Long for comments on earlier versions of this manuscript. We thank Gabriel Mindlin and Marcos Trevisan for their assistance in identifying GTE timepoints. We acknowledge the GENIE Program and the Janelia Farm Research Campus, specifically Vivek Jayaraman, Ph.D., Rex A. Kerr, Ph.D., Douglas S. Kim, Ph.D., Loren L. Looger, Ph.D., Karel Svoboda, Ph.D. from the GENIE Project, Janelia Farm Research Campus, Howard Hughes Medical Institute.
Footnotes
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
AUTHOR CONTRIBUTIONS
M.P., D.O., and M.L. designed the experiments. M.L., M.P., J.M., L.P., and K.K. wrote the paper. M.P., D.V., S.B., R.C., and K.K. collected the data. M.P., E.P., J.M., and K.K. analyzed the data. J.M., E.P., L.P., and K.K. created relevant computational tools for analysis.
AUTHOR INFORMATION
The authors declare no competing financial interests.
Additional experimental procedures related to analytical methods are included in supplementary information.
References
- Abeles M. Corticonics: Neural Circuits of the Cerebral Cortex. Cambridge University Press; 1991. [Google Scholar]
- Amador A, Perl YS, Mindlin GB, Margoliash D. Elemental gesture dynamics are encoded by song premotor cortical neurons. Nature. 2013;495:59–64. doi: 10.1038/nature11967. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Andalman AS, Foerster JN, Fee MS. Control of vocal and respiratory patterns in birdsong: dissection of forebrain and brainstem mechanisms using temperature. PLoS One. 2011;6:e25461. doi: 10.1371/journal.pone.0025461. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ashmore RC, Wild JM, Schmidt MF. Brainstem and forebrain contributions to the generation of learned motor behaviors for song. J Neurosci. 2005;25:8543–8554. doi: 10.1523/JNEUROSCI.1668-05.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Boari S, Sanz Perl Y, Amador A, Margoliash D, Mindlin GB. Automatic reconstruction of physiological gestures used in a model of birdsong production. J Neurophysiol. 2015;jn 00385:02015. doi: 10.1152/jn.00385.2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Briggman KL, Helmstaedter M, Denk W. Wiring specificity in the direction-selectivity circuit of the retina. Nature. 2011;471:183–188. doi: 10.1038/nature09818. [DOI] [PubMed] [Google Scholar]
- Carpenter AF, Georgopoulos AP, Pellizzer G. Motor cortical encoding of serial order in a context-recall task. Science. 1999;283:1752–1757. doi: 10.1126/science.283.5408.1752. [DOI] [PubMed] [Google Scholar]
- Chen TW, Wardill TJ, Sun Y, Pulver SR, Renninger SL, Baohan A, Schreiter ER, Kerr RA, Orger MB, Jayaraman V, et al. Ultrasensitive fluorescent proteins for imaging neuronal activity. Nature. 2013;499:295–300. doi: 10.1038/nature12354. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Churchland MM, Cunningham JP, Kaufman MT, Foster JD, Nuyujukian P, Ryu SI, Shenoy KV. Neural population dynamics during reaching. Nature. 2012;487:51–56. doi: 10.1038/nature11129. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Day NF, Terleski KL, Nykamp DQ, Nick TA. Directed functional connectivity matures with motor learning in a cortical pattern generator. J Neurophysiol. 2013;109:913–923. doi: 10.1152/jn.00937.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Evarts EV. Relation of pyramidal tract activity to force exerted during voluntary movement. J Neurophysiol. 1968;31:14–27. doi: 10.1152/jn.1968.31.1.14. [DOI] [PubMed] [Google Scholar]
- Fee MS, Kozhevnikov AA, Hahnloser RH. Neural mechanisms of vocal sequence generation in the songbird. Ann N Y Acad Sci. 2004;1016:153–170. doi: 10.1196/annals.1298.022. [DOI] [PubMed] [Google Scholar]
- Fiete IR, Senn W, Wang CZ, Hahnloser RH. Spike-time-dependent plasticity and heterosynaptic competition organize networks to produce long scale-free sequences of neural activity. Neuron. 2010;65:563–576. doi: 10.1016/j.neuron.2010.02.003. [DOI] [PubMed] [Google Scholar]
- Garst-Orozco J, Babadi B, Olveczky BP. A neural circuit mechanism for regulating vocal variability during song learning in zebra finches. Elife. 2014;3:e03697. doi: 10.7554/eLife.03697. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Georgopoulos AP, Schwartz AB, Kettner RE. Neuronal population coding of movement direction. Science. 1986;233:1416–1419. doi: 10.1126/science.3749885. [DOI] [PubMed] [Google Scholar]
- Goller F, Cooper BG. Peripheral motor dynamics of song production in the zebra finch. Ann N Y Acad Sci. 2004;1016:130–152. doi: 10.1196/annals.1298.009. [DOI] [PubMed] [Google Scholar]
- Graber MH, Helmchen F, Hahnloser RH. Activity in a premotor cortical nucleus of zebra finches is locally organized and exhibits auditory selectivity in neurons but not in glia. PLoS One. 2013;8:e81177. doi: 10.1371/journal.pone.0081177. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hahnloser RH, Kozhevnikov AA, Fee MS. An ultra-sparse code underlies the generation of neural sequences in a songbird. Nature. 2002;419:65–70. doi: 10.1038/nature00974. [DOI] [PubMed] [Google Scholar]
- Hartley RS, Suthers RA. Airflow and pressure during canary song: direct evidence for mini-breaths. J Comp Physiol A. 1989;165:15–26. [Google Scholar]
- Harvey CD, Coen P, Tank DW. Choice-specific sequences in parietal cortex during a virtual-navigation decision task. Nature. 2012;484:62–68. doi: 10.1038/nature10918. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huber D, Gutnisky DA, Peron S, O’Connor DH, Wiegert JS, Tian L, Oertner TG, Looger LL, Svoboda K. Multiple dynamic representations in the motor cortex during sensorimotor learning. Nature. 2012;484:473–478. doi: 10.1038/nature11039. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jun JK, Jin DZ. Development of neural circuitry for precise temporal sequences through spontaneous activity, axon remodeling, and synaptic plasticity. PLoS One. 2007;2:e723. doi: 10.1371/journal.pone.0000723. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ko H, Hofer SB, Pichler B, Buchanan KA, Sjostrom PJ, Mrsic-Flogel TD. Functional specificity of local synaptic connections in neocortical networks. Nature. 2011;473:87–91. doi: 10.1038/nature09880. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kosche G, Vallentin D, Long MA. Interplay of inhibition and excitation shapes a premotor neural sequence. J Neurosci. 2015;35:1217–1227. doi: 10.1523/JNEUROSCI.4346-14.2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kozhevnikov AA, Fee MS. Singing-related activity of identified HVC neurons in the zebra finch. J Neurophysiol. 2007;97:4271–4283. doi: 10.1152/jn.00952.2006. [DOI] [PubMed] [Google Scholar]
- Lenschow C, Brecht M. Barrel cortex membrane potential dynamics in social touch. Neuron. 2015;85:718–725. doi: 10.1016/j.neuron.2014.12.059. [DOI] [PubMed] [Google Scholar]
- Leonardo A, Fee MS. Ensemble coding of vocal control in birdsong. J Neurosci. 2005;25:652–661. doi: 10.1523/JNEUROSCI.3036-04.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li N, Chen TW, Guo ZV, Gerfen CR, Svoboda K. A motor cortex circuit for motor planning and movement. Nature. 2015;519:51–56. doi: 10.1038/nature14178. [DOI] [PubMed] [Google Scholar]
- Long MA, Fee MS. Using temperature to analyse temporal dynamics in the songbird motor pathway. Nature. 2008;456:189–194. doi: 10.1038/nature07448. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Long MA, Jin DZ, Fee MS. Support for a synaptic chain model of neuronal sequence generation. Nature. 2010;468:394–399. doi: 10.1038/nature09514. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lu X, Ashe J. Anticipatory activity in primary motor cortex codes memorized movement sequences. Neuron. 2005;45:967–973. doi: 10.1016/j.neuron.2005.01.036. [DOI] [PubMed] [Google Scholar]
- Lu X, Ashe J. Dynamic reorganization of neural activity in motor cortex during new sequence production. Eur J Neurosci. 2015 doi: 10.1111/ejn.12979. [DOI] [PubMed] [Google Scholar]
- Markowitz JE, Liberti WA, 3rd, Guitchounts G, Velho T, Lois C, Gardner TJ. Mesoscopic patterns of neural activity support songbird cortical sequences. PLoS Biol. 2015;13:e1002158. doi: 10.1371/journal.pbio.1002158. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Matsuzaka Y, Picard N, Strick PL. Skill representation in the primary motor ortex after long-term practice. J Neurophysiol. 2007;97:1819–1832. doi: 10.1152/jn.00784.2006. [DOI] [PubMed] [Google Scholar]
- Mello GB, Soares S, Paton JJ. A scalable population code for time in the striatum. Curr Biol. 2015;25:1113–1122. doi: 10.1016/j.cub.2015.02.036. [DOI] [PubMed] [Google Scholar]
- Mendez JM, Dall’Asen AG, Cooper BG, Goller F. Acquisition of an acoustic emplate leads to refinement of song motor gestures. J Neurophysiol. 2010;104:984–993. doi: 10.1152/jn.01031.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mita A, Mushiake H, Shima K, Matsuzaka Y, Tanji J. Interval time coding by neurons in the presupplementary and supplementary motor areas. Nat Neurosci. 2009;12:502–507. doi: 10.1038/nn.2272. [DOI] [PubMed] [Google Scholar]
- Mooney R, Prather JF. The HVC microcircuit: the synaptic basis for interactions between song motor and vocal plasticity pathways. J Neurosci. 2005;25:1952–1964. doi: 10.1523/JNEUROSCI.3726-04.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nachev P, Coulthard E, Jager HR, Kennard C, Husain M. Enantiomorphic normalization of focally lesioned brains. Neuroimage. 2008;39:1215–1226. doi: 10.1016/j.neuroimage.2007.10.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nottebohm F, Stokes TM, Leonard CM. Central control of song in the canary, Serinus canarius. J Comp Neurol. 1976;165:457–486. doi: 10.1002/cne.901650405. [DOI] [PubMed] [Google Scholar]
- Okubo TS, Mackevicius EL, Payne HL, Lynch GF, Fee MS. Growth and splitting of neural sequences in songbird vocal development. Nature. 2015;528:352–357. doi: 10.1038/nature15741. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Oomura Y, Yoshimatsu H, Aou S. Medial preoptic and hypothalamic neuronal activity during sexual behavior of the male monkey. Brain Res. 1983;266:340–343. doi: 10.1016/0006-8993(83)90666-2. [DOI] [PubMed] [Google Scholar]
- Pastalkova E, Itskov V, Amarasingham A, Buzsaki G. Internally generated cell assembly sequences in the rat hippocampus. Science. 2008;321:1322–1327. doi: 10.1126/science.1159775. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Peh WY, Roberts TF, Mooney R. Imaging auditory representations of song and syllables in populations of sensorimotor neurons essential to vocal communication. J Neurosci. 2015;35:5589–5605. doi: 10.1523/JNEUROSCI.2308-14.2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Perl YS, Arneodo EM, Amador A, Goller F, Mindlin GB. Reconstruction of physiological instructions from Zebra finch song. Phys Rev E Stat Nonlin Soft Matter Phys. 2011;84:051909. doi: 10.1103/PhysRevE.84.051909. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Peters AJ, Chen SX, Komiyama T. Emergence of reproducible spatiotemporal activity during motor learning. Nature. 2014;510:263–267. doi: 10.1038/nature13235. [DOI] [PubMed] [Google Scholar]
- Ravassard P, Kees A, Willers B, Ho D, Aharoni D, Cushman J, Aghajan ZM, Mehta MR. Multisensory control of hippocampal spatiotemporal selectivity. Science. 2013;340:1342–1346. doi: 10.1126/science.1232655. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Riede T, Goller F. Functional morphology of the sound-generating labia in the syrinx of two songbird species. J Anat. 2010;216:23–36. doi: 10.1111/j.1469-7580.2009.01161.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sober SJ, Wohlgemuth MJ, Brainard MS. Central contributions to acoustic variation in birdsong. J Neurosci. 2008;28:10370–10379. doi: 10.1523/JNEUROSCI.2448-08.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Solis MM, Perkel DJ. Rhythmic activity in a forebrain vocal control nucleus in vitro. J Neurosci. 2005;25:2811–2822. doi: 10.1523/JNEUROSCI.5285-04.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Suthers RA, Goller F, Hartley RS. Motor dynamics of song production by mimic thrushes. J Neurobiol. 1994;25:917–936. doi: 10.1002/neu.480250803. [DOI] [PubMed] [Google Scholar]
- Tanji J, Shima K. Role for supplementary motor area cells in planning several movements ahead. Nature. 1994;371:413–416. doi: 10.1038/371413a0. [DOI] [PubMed] [Google Scholar]
- Todorov E. Direct cortical control of muscle activation in voluntary arm movements: a model. Nat Neurosci. 2000;3:391–398. doi: 10.1038/73964. [DOI] [PubMed] [Google Scholar]
- Troyer TW. Neuroscience: The units of a song. Nature. 2013;495:56–57. doi: 10.1038/nature11957. [DOI] [PubMed] [Google Scholar]
- Vallentin D, Long MA. Motor origin of precise synaptic inputs onto forebrain neurons driving a skilled behavior. J Neurosci. 2015;35:299–307. doi: 10.1523/JNEUROSCI.3698-14.2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vicario DS. Contributions of syringeal muscles to respiration and vocalization in the zebra finch. J Neurobiol. 1991a;22:63–73. doi: 10.1002/neu.480220107. [DOI] [PubMed] [Google Scholar]
- Vicario DS. Organization of the zebra finch song control system: II. Functional organization of outputs from nucleus Robustus archistriatalis. J Comp Neurol. 1991b;309:486–494. doi: 10.1002/cne.903090405. [DOI] [PubMed] [Google Scholar]
- Vu ET, Mazurek ME, Kuo YC. Identification of a forebrain motor programming network for the learned song of zebra finches. J Neurosci. 1994;14:6924–6934. doi: 10.1523/JNEUROSCI.14-11-06924.1994. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yu AC, Margoliash D. Temporal hierarchical control of singing in birds. Science. 1996;273:1871–1875. doi: 10.1126/science.273.5283.1871. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.