Skip to main content
Journal of Neurophysiology logoLink to Journal of Neurophysiology
. 2010 Jun 16;104(2):984–993. doi: 10.1152/jn.01031.2009

Acquisition of an Acoustic Template Leads to Refinement of Song Motor Gestures

Jorge M Méndez 1,, Analía G Dall'Asén 1, Brenton G Cooper 2, Franz Goller 1
PMCID: PMC2934944  PMID: 20554848

Abstract

Vocal learning, a key behavior in human speech development, occurs only in a small number of animal taxa. Ontogeny of vocal behavior in humans and songbirds involves acquisition of an acoustic model, which guides the development of self-generated vocalizations (sensorimotor period). How vocal development proceeds in the absence of an acoustic model is largely unknown and cannot be studied directly in humans. Here we explored the effects of an acoustic model on song motor control by comparing peripheral motor gestures (respiration and syringeal muscles) of tutored birds with those of birds raised in acoustic isolation. Although the overall use of syringeal muscles during song was similar in both groups, tutored birds displayed enhanced temporal patterns of activation in respiratory and syringeal motor gestures. Muscle activation was more uniformly distributed throughout the song of tutored birds than that of untutored birds. Similarly, the respiratory effort was similar for both groups, but the expiratory pulses of song contained more modulations and temporal complexity in tutored birds. These results indicate that the acquisition of an acoustic template guides a refinement of experience-independent motor gestures by increasing temporal fine structure, but there is no difference in bilateral activation patterns for a given sound between the two groups. Nevertheless, these subtle temporal changes in muscle activation give rise to pronounced acoustic differences between the songs of the tutored and untutored birds. Experience with song during ontogeny therefore guides a more refined use of experience-independent motor programs.

INTRODUCTION

Vocal learning is an imprinting-like process in which acoustic models of species-typical vocalizations are acquired during a sensory period (Adret 2008; Brainard and Doupe 2002). These stored representations are thought to serve as a template for vocal development (Konishi 2004; Nick and Konishi 2005). Regardless of the presence of an acoustic model, vocal development proceeds through a sensorimotor practice phase in which vocal production is gradually perfected through acoustic feedback (Brainard and Doupe 2002). The timing of song ontogeny is species specific. For example, zebra finches acquire song during a sensory phase that typically lasts from posthatching day (phd) 25–60, whereas the sensorimotor phase starts at phd 30 and lasts until phd 90, at which time crystallization occurs. It is thought that, in the absence of a learned template, vocal ontogeny is shaped by an experience-independent representation of genetically predetermined vocalizations (Forstmeier et al. 2009).

Humans and songbirds share these basic features of vocal learning (Doupe and Kuhl 2008). However, in humans, it is not possible to study the role of an acquired acoustic template in modifying innate motor instructions for speech. Deaf children are unable to perceive the acoustic models during the sensory phase but are also not able to hear their own vocalizations during the sensorimotor practice phase, likely resulting in much greater speech deficits. Although a few historic “Kaspar Hauser” cases (children deprived of social contact and therefore potentially of acoustic models for speech) have been reported, it is unclear to what degree they have been exposed to speech at an early age (Fromkin 1974; Lane 1976). In songbirds, it is possible to test the influence of an acquired acoustic model on the motor instructions for learned vocal behavior by rearing birds in isolation, preventing exposure to song early in development, and comparing song motor patterns between group-reared and isolate-reared birds.

Song production involves the coordinated activity of motor systems for respiration, vocal organ (syrinx), and articulatory systems of the upper vocal tract (Goller and Cooper 2004). The respiratory system generates sequences of expiratory and inspiratory pulses, which determine the coarse temporal pattern of the song (Goller and Cooper 2004; Suthers and Zollinger 2004; Wild et al. 1998). The activity of the syringeal muscles is thought to provide fine control of airflow and thus temporal features as well as the acoustic structure of vocalizations (Goller and Suthers 1996a,b). In brown thrashers (Toxostoma rufum) for example, the activity of the ventral syringeal muscle (vS) is correlated with the fundamental frequency of the vocalization, and the dorsal and ventral tracheobronchial muscles regulate the syringeal valve. Although the respiratory dynamics of zebra finches during song were studied (Goller and Daley 2001; Franz and Goller 2003), the syringeal dynamics for this species are insufficiently understood, and the specific role of individual muscles is likely to deviate from the respective roles described in thrashers (Goller and Cooper 2004; Vicario 1991).

In the zebra finch, song is a stereotyped sequence of four to eight different syllables (motif), which is repeated a variable number of times in a song bout. Males exposed to a tutor during the critical period acquire song elements from its song and refine their own vocalizations by practice (Price 1979). Males raised without exposure to a tutor model (untutored) develop song that is somewhat similar to that of tutored birds but is characterized by atypical frequency structure of syllables (syllables are defined as uninterrupted traces on a spectrogram) and the presence of notes (acoustic units within a syllable) with atypically long duration (Fehér et al. 2009).

The coordinated activity of respiratory, syringeal, and upper vocal tract motor systems emerges from a well-characterized neural song production circuit (Brenowitz et al. 1997). The detailed nature of the motor program for song in the zebra finch is still largely unknown, despite recent advances in chronic recordings from neurons in premotor areas of this circuit (Hahnloser et al. 2002; Leonardo and Fee 2005). Nevertheless, recent behavioral and neural studies have unraveled exciting details about song learning in the zebra finch. Exposure of the juvenile bird to the song of a tutor causes rapid emergence of some features of the song model in the juvenile bird, such as syllable duration, frequency modulation (FM), and syllable sequence (Tchernichovski et al. 2004). Hearing tutor song also causes a rapid change in the activity of neurons in a premotor nucleus [robust nucleus of the arcopallium (RA)] of the song circuit (Shank and Margoliash 2009). In tutored birds, individual RA neurons fire short bursts during specific segments of song (Yu and Margoliash 1996), such that the combined output of many neurons may constitute the full sequence of premotor information (Leonardo and Fee 2005). It is unknown whether an acquired acoustic model of song leads to changes in the firing pattern of individual neurons or a change in the population output in RA.

Here we explored how the presence of an acoustic template during development affected the motor instructions for song by studying respiratory and syringeal motor gestures (i.e., the activity of the motor systems related to the execution of an action). By comparing the motor gestures after song development is completed in tutored and untutored birds, we are able to infer how the acoustic experience affects motor control of song. The main specific questions about the role of acoustic experience are as follows: 1) does acoustic experience qualitatively change the motor control of song production and 2) how do differences in motor gestures control the acoustic differences between experience-dependent and experience-independent song?

METHODS

Animals and isolation protocol

We used live tutored (tutored; n = 10) zebra finches and birds with no exposure to tutor song during development (untutored; n = 14) in this study. The tutored birds were bred and raised in a flight aviary, whereas the untutored birds were bred with parents and siblings in acoustic isolation chambers until posthatching day 7 (phd 7). At this point, the father was removed from the breeding cage, and at 35 phd, the bird was put in an acoustic isolation chamber and kept in a room with no adult male zebra finches until the bird's song was recorded (after 120 phd).

Surgical procedure and recording

All experimental procedures were approved by the IACUC of the University of Utah. For the analysis of air sac pressure, data were collected from 10 tutored birds and 14 untutored birds. Data from five tutored birds and seven untutored birds were used for the analysis of the EMG activity of the syringeal muscles. For all the analyses, we used data from directed song.

The surgical procedures and recording methods were previously described in detail (Goller and Suthers 1996a,b). In summary, an elastic belt with a Velcro connector was fitted around the bird's thorax to which a wire was attached for tethering the bird to a counterbalanced tether arm. After the birds resumed singing, the surgical procedure was carried out. The bird was deprived of food and water for 1 h before the surgery. Chloropent anesthesia was used for implanting the EMG electrodes, and isoflurane anesthesia was used for implanting a cannula into an air sac.

Air sac pressure recording

A small hole was made in the abdominal wall, and a flexible cannula (Silastic tubing, Dow Corning, Midland, MI) was inserted into an anterior thoracic air sac. The cannula was sutured to the ribs, and the free end of the cannula was connected to a pressure transducer (FPM-02PG, Fujikura) mounted on the backpack.

Syringeal EMG recordings

The syrinx was accessed through the interclavicular air sac, and bipolar electrodes were inserted into the syringeal muscles and secured into place with tissue adhesive. Two pairs of electrodes were implanted in left and right vS or dorsal tracheobronchial (dTB) muscles. The electrodes were routed subcutaneously to the backpack. Both syringeal muscle pairs were recorded in some individuals by performing successive surgeries on each bird with 1–3 mo between experiments.

Data analysis

All the analysis methods for EMG and air sac pressure data described in the following were programmed in Matlab (version 7.7.0, Mathworks, Natick, MA).

ACTIVE TIME RATIO FOR THE AIR SAC PRESSURE.

The pressure pattern was adjusted and normalized to make the pressure amplitudes of the motif fit into the interval [0, 1].

The probability distribution of the amplitude of the pressure patterns were calculated for both tutored and untutored groups. Based on the results of a previous study measuring air sac pressures in zebra finches (Goller and Daley 2001), a threshold value separating expiratory from inspiratory values was established (threshold for normalized pressure = 0.3). The areas beneath the probability distributions between the threshold and 1 were computed. The sets of values for tutored and untutored groups were statistically compared.

CV OF THE PRESSURE PATTERN.

The CV (CV) was calculated for each bird as the ratio between SD and the mean of the pressure pattern during the motif.

PROBABILITY DISTRIBUTION OF THE PRESSURE PATTERN DERIVATIVE (PP′).

The value of the derivative of the pressure pattern was calculated at each time point evaluating the derivative of the polynomial (degree 5) that provides the best fit for the pressure pattern in an interval of 10 ms around that time point. The probability distribution of these values was built (PP′). From the probability distributions, the following cumulative distributions C(x) were computed

c(x)=xxPp'(y)dy

For each group, the values xc so that C(xc) = 0.9 were calculated, and a Kolmogorov-Smirnov test was run to assess the significance of the differences between groups.

Quantification of the modulation of pressure patterns

To quantify the modulation characteristics of air sac pressure for individual pulses of the song motif, we fitted a fourth-order polynomial (using the least square method) to the expiratory pulse. This procedure reproduces the main shape of the pressure pulse but without the superimposed fine modulations of pressure. We calculated the summed error between the actual pressure signal during the expiratory pulse and the fitted curve per unit time. This pressure modulation index (PMI) provides a quantification of the modulation complexity that is independent of pulse duration.

PERMUTATION ENTROPY [H(N)].

The method proposed by Bandt and Pompe (2002) to compute the permutation entropy of a time series was used. The permutation entropy is a widely used quantifier for the degree of complexity of a time series. Briefly, the method consists of the following algorithm. The time series {xi}i=1 … N is first embedded in a phase space that gives the order (n) of the entropy with a time-delay T (given in number of points). Hence, the set of vectors (xi, xi+T,…, xi+(n−1)T) is the result of the embedding. A permutation (p) of the elements of each vector puts the elements of the rearranged vector in ascending order. The relative frequency [P(p)] of each permutation in the set of vectors is calculated. The permutation entropy h(n) for n > 2 is defined as

h(n)=P(p)log[P(p)]log(n!)

In the sum, all the possible permutations of n elements are added.

For the various analyses of EMG recordings, the raw EMG signals were rectified, integrated with a 2.5-ms sliding time window, and normalized. We named this transformed signal “the activation pattern” (|∇EMG|).

MUSCLE USE (MU).

To assess the absolute activity of a muscle per unit time during the song, the following quantifier was calculated

MU=Tquiet¯Tmotifmotif|EMG(s)|dsrespcycle|EMG(s)|ds

The upper line means average calculated over several respiratory cycles, Tquiet is the period of the quiet respiratory cycle, and Tmotif is the duration of the motif. To calculate the average, usually five quiet respiratory cycles were used.

ACTIVE TIME RATIO OF THE EMG (ATR_EMG).

We computed the ratio of time during while the muscle is active during the motif. The active state was defined whenever the muscle was activated at an amplitude 1.5 times greater than the maximal value during normal breathing.

INTEGRAL VARIATION OF THE ACTIVATION PATTERN FROM ITS MEAN (V).

The sum of the absolute values of the difference between |∇EMG| and its mean during the motif was calculated.

CUMULATIVE VARIATION OF THE ACTIVATION PATTERN (VC).

If the motif goes from t = 0 to t = T, VC is calculated as

E(t)=0t|EMG(s)|ds0T|EMG(s)|ds,D(t)=tT
VC(t)=E(t)D(t)

The ratio of time during the motif while VC is negative is used to quantify how long the activation amplitude is lower than what it would be if the activation was uniformly distributed across the motif. A statistical comparison between these sets of values for tutored and untutored birds was calculated.

CORRELATIONS.

The autocorrelation function was computed for each EMG time series of birds in both groups. The lag was converted to time values for plotting. Also, the Pearson's correlation coefficient was computed for each bird for the time series corresponding to the EMG activity of right and left muscles.

QUANTIFICATION OF THE DEGREE OF SYNCHRONY.

To quantify the degree of synchrony of left and right muscles at a precise moment (corresponding to the element i of the time series), the instantaneous cross-correlation coefficient [Co(i)] at that time was calculated as

c0(i)=(x(i)x¯)(y(i)y¯)σxσy
HIGH-FREQUENCY MODULATIONS IN THE EMG SIGNAL.

Power spectral density of the EMG activation patterns was calculated based on Thomson's multitaper method (Thomson 1982). For frequencies between 10 and 300 Hz, we tested whether power spectral densities were different between tutored and untutored birds. To do so, we divided the frequency range into 20-Hz bins, calculated the signal power in that range, and performed a Kolmogorov-Smirnov test on the pooled data from all individuals in each group for each bin.

Lateralization

The difference in activation of the left and right muscles per unit time is used for quantifying the degree of lateralization

L=1T0TEMG1(t)||EMG2(t)dt

where the interval [0,T] refers to the whole duration of the motif (Lmotif) or the duration of each syllable (Lsyl).

SOUND ANALYSIS.

The analysis of the songs of every individual in both experimental groups was performed using SOUND ANALYSIS PRO (Tchernichovski et al. 2000; http://ofer.sci.ccny.cuny.edu:2001/html/sound_analysis.html).

RESULTS

Examples for simultaneously recorded respiratory pressure and EMG activity of syringeal muscles during a song motif are shown in Fig. 1 for an untutored and a tutored bird. Because songs varied substantially between individuals in each group, quantitative analyses are used to explore systematic differences between the two groups.

Fig. 1.

Fig. 1.

Representative recordings of song and physiological parameters for tutored (right) and untutored (left) zebra finches. A: spectrogram of the song. B: air sac pressure patterns. C: activation pattern of the left ventral syringeal muscle (vS) muscles during the motif. D: activation pattern of the right vS muscles during the motif. Air sac pressure and EMG traces are shown as relative voltages. EMG activity was rectified, integrated, and normalized so that the maximal value during the motif is 1. The time series of the EMG patterns were not shifted to account for the delay between electrical activation and movement of syringeal effectors. Generally, the syllables are produced during expiratory air sac pressure pulses, although sometimes zebra finches produce sound during deep inspirations as the one shown for the tutored bird (2nd to last syllable accompanied by deep inspiratory pressure).

Activation level of peripheral motor systems

The first question to address is whether exposure to a tutor song changes the overall activation level of peripheral motor systems for song. We compared the ratio of expiratory and inspiratory durations for the entire motif [active time ratio for the air sac pressure (ATRP)] and the CV (CV) of the amplitude of air sac pressure patterns between both tutored and untutored birds and found no significant differences [ATRP: Kolmogorov-Smirnov test, D = 0.134 and CV (untutored) = 0.47 ± 0.01, CV (tutored) = 0.46 ± 0.02, Kolmogorov-Smirnov test, D = 0.24].

Next, we asked whether overall activation of syringeal muscles differs between both groups. The mean muscle use (MU) for the motif did not differ between tutored and untutored groups for the left and right vS or for dTB muscles (Kolmogorov-Smirnov test, D = 0.27 for vS and D = 0.9 for dTB). Additionally, there was no significant difference in the time ratio of muscle activity during the motif (ATR_EMG) for vS and for dTB (Kolmogorov-Smirnov test, D = 0.87 for vS and D = 0.67 for dTB). These results indicate that tutoring does not change the overall use of respiratory or syringeal muscles for song production.

Temporal structure of respiratory and syringeal patterns

One important question to address is whether the acquired acoustic template leads to a change in the degree of modulation and complexity of the respective contributions to song by different motor systems. In a comparison of different syllables for tutored and untutored groups, songs in the tutored group contained a higher degree of respiratory modulation (Fig. 2). The increased modulation of respiratory pressure during song syllables in tutored birds produced more variable acoustic behavior than was found in the untutored group. Furthermore, to understand the complexity at the behavioral level, it is necessary to record the respiratory and syringeal motor patterns simultaneously. For example, in the syllables presented in Fig. 2 for the untutored birds, the pressure patterns do not reflect the frequency and AM displayed in the acoustic output, suggesting that these features were generated by syringeal control.

Fig. 2.

Fig. 2.

In tutored birds, air sac pressure of expiratory pulses is modulated more frequently. A and B: examples showing differences in modulation of pressure between tutored and untutored birds for long duration syllables. Untutored birds produce long syllables with simple, unmodulated pressure patterns despite high-frequency modulations of the acoustic output, which are produced by activity of the syringeal muscles. Air sac pressure patterns are shown as relative voltages and are offset to allow comparison of tutored and untutored birds. Each of the expiratory pulses shown corresponds to a syllable. The spectrograms of these syllables are displayed on the right side and identified by arrows.

Based on these qualitative observations of increased modulation in expiratory pressure for the tutored birds, we compared the temporal structure of the motor gestures quantitatively between tutored and untutored birds. The dynamical complexity of the time series of pressure patterns was assessed by computing the permutation entropy (Bandt and Pompe 2002). The comparison between both groups yielded that tutored birds display more complex pressure patterns (Kolmogorov-Smirnov test, D <0.05 for time delays between 3 and 9 ms and order 5; see methods; Fig. 3A). Furthermore, the shape of the probability distribution of the derivative of the air sac pressure patterns for the tutored birds is broader (see methods; Kolmogorov-Smirnov test, D = 0.01 for a cumulative value of 0.9), showing more temporal modulation of air sac pressure (Fig. 3B). This indicates that respiratory pressure patterns of tutored birds present more fluctuations along a broad time scale, consistent with the existence of more variable values of the pressure derivative. All these results point to a more frequent occurrence of high-frequency fluctuations in the respiratory motor gestures of tutored birds.

Fig. 3.

Fig. 3.

Air sac pressure pattern analysis. A: averaged permutation entropy (h) for the tutored (black) and untutored (gray) birds. Inset: output of the Welch's t-test between both groups of recordings. B: representative probability distribution of the pressure pattern derivative (PP′) during the motif for the tutored (black) and untutored (gray) birds. The results of these analyses are consistent with the representative examples shown in Fig. 2. The existence of more modulations in the pressure patterns of tutored birds increased their complexity and the width of the probability distribution of the pressure pattern derivative.

The relationship between the mean pressure and sound frequency differs between tutored and untutored birds. Whereas high-frequency sounds in tutored birds are generated with higher mean air sac pressure than low-frequency sounds, this difference is less distinct in untutored birds (Fig. 4A). Air sac pressure values for high-frequency sounds in untutored birds overlap widely with those for low-frequency sounds.

Fig. 4.

Fig. 4.

Differences exist in the ranges of air sac pressure and EMG activity between tutored and untutored birds. A: the relation between fundamental frequency of sound elements (AFFs) and the mean pressure (taken from the normalized pressure pattern) used for their production is plotted for untutored (top) and tutored birds (bottom). Untutored birds produce sounds of a given fundamental frequency with a wider range of mean pressures than tutored birds. B: the ranges of EMG activation and complexity of pressure modulation are different between tutored (black symbols) and untutored (gray symbols) birds. For each syllable of the motif, the modulation of its pressure pattern is quantified by calculating its pressure modulation index (PMI), and the use of the syringeal muscles is calculated by computing the average EMG activity per unit time (EMG activity normalized to the average EMG activity during quiet breathing). There is an increased tendency toward complex modulation patterns in the songs of tutored birds relative to those of untutored birds.

The relationship between modulation index of the expiratory air sac pressure and the activation of syringeal muscles also indicates a difference in motor control between tutored and untutored birds (Fig. 4B). The values for average EMG activity during syllables span a wider range, particularly for high values, in untutored birds. Pressure modulation indices extend to higher values in tutored birds. This difference suggests that tutored birds use a combination of respiratory and syringeal control to achieve sound modulation characteristics, whereas untutored birds predominantly use syringeal control.

We inspected the high-frequency sound elements that were generated with relatively low air sac pressure in untutored birds more closely to test whether these constituted examples of sounds that are generated with different motor gestures by tutored and untutored birds. This was not the case, because the sound elements in this pressure range are atypical sound elements that are rarely produced by tutored birds. The motor gestures for producing similar sounds were therefore similar in both groups, and the differences in the respective ranges of air sac pressure and EMG activity of syringeal muscles correspond to acoustic differences. This is further shown by similarities in motor gestures for other shared acoustic characteristics. Tutored and untutored birds produced frequency modulations in syllables by modulating the activity of the syringeal muscles. In both groups, harmonic stacks were generated with low EMG activation of vS muscles (Fig. 5, A and B).

Fig. 5.

Fig. 5.

Untutored birds produce long sequences of frequency and amplitude modulated sounds with synchronized bursts of high activation of syringeal muscles, whereas long sound segments in tutored birds tend to be harmonic stacks. Representative examples of a long syllable (spectrographic representation in bottom panels) for an untutored bird (A) and for a tutored bird (B) are shown with the corresponding pressure pattern (black), and the activity of the left (red) and right (blue) vS muscles. Both groups produce frequency modulations in some of their syllables by modulating the activity of syringeal muscles (bursts), but in tutored birds, the duration of these sound elements is typically short. Also, during the production of harmonic stacks, the activation of vS is low in both groups. Air sac pressure and EMG traces are normalized to the [0, 1] interval.

The main difference in song between untutored and tutored birds is how various notes and their corresponding motor gestures are used to compose the song motif. For example, untutored birds include atypically long notes that are modulated in frequency and amplitude at a high rate. This modulation is accompanied by burst-like, high-amplitude activation of the syringeal muscles (Fig. 5A). Tutored birds also produced such burst-like activity of their vS muscles, but these activation patterns and corresponding sound segments were much shorter than in untutored birds.

An analysis of the spectral content of vS muscle activation for tutored and untutored birds shows a difference in their respective use (Fig. 6A). In tutored birds, signal power between 10 and 300 Hz is more evenly spread across the full spectral range. Untutored birds had significantly lower power in the range of 40–100 Hz (Kolmogorov-Smirnov test, D < 0.05) and a distinct peak in the range of 140–180 Hz (Fig. 5A). This difference in the spectral distribution of power is most likely explained by the fact that untutored birds sang uncharacteristically long syllables with high-frequency amplitude and frequency modulations, which are accompanied by EMG bursts that match the modulation frequency (Fig. 5A).

Fig. 6.

Fig. 6.

Analysis of the activation patterns of vS syringeal muscles. A: averaged power spectral density of the tutored (black) and untutored (gray) birds. The black horizontal line indicates the region where the signal power of the tutored group is significantly greater than that of the untutored group (40–100 Hz), and the gray line indicates significantly greater power in the untutored group (140–180 Hz). B: cumulative analysis of variations for 2 representative examples of the activation pattern during the motif for the tutored (bottom) and untutored birds. The time is normalized to the duration of the motif. The quantifier D(t) represents the ratio of the cumulative activity of the muscle up to time t if the EMG activity were uniformly distributed over the motif, and E(t) corresponds to the actual ratio of cumulative activity of the muscle up to time t. This allows a quantification of how much the distribution of the muscle activity deviates from a uniform distribution across the motif. C: representative examples of the autocorrelation function of the vS muscles for tutored (black) and untutored (gray) birds. Tutored birds switch more frequently between high and low levels of activation.

How is muscle activation distributed across the motif? Although per unit time, the integrated variation of the muscular activation patterns does not differ significantly between the songs of the two groups (Kolmogorov-Smirnov test, D = 0.65 for vS and D = 0.32 for dTB), the distribution of vS activation across the motif is significantly different (see methods; cumulative analysis of variations; Kolmogorov-Smirnov test, D = 5 × 10−4; Fig. 6B). Activity in untutored birds is typically concentrated toward the end of the motif, whereas in tutored birds, it is more evenly distributed across the motif.

The differences in the temporal structure of muscle activation between tutored and untutored birds go beyond this coarse level. The analysis of the autocorrelation function shows a clear difference in the temporal organization of muscle activation bursts. In tutored birds, autocorrelation functions are less broad, which is consistent with more frequent switching between high and low activation and thus a more variable structure of the EMG pattern [mean of the full width at half-maximum (FWHM) for tutored birds = 9 ms and for untutored birds = 28 ms; Kolmogorov-Smirnov test, D <5 × 10−4; Fig. 6C]. Untutored birds maintain a certain activation level (high or low) for a longer time.

Bilateral syringeal activation

The quantification of lateralization of EMG activity (Lmotif) for the whole motif or individual syllables did not yield significant differences between tutored and untutored birds (Kolmogorov-Smirnov tests, D = 0.44 for the motifs and D = 0.7 for the syllables). Also, the Pearson's correlation coefficients for left-right comparisons were not significantly different between both groups (Kolmogorov-Smirnov test, D = 0.53). Both analyses show therefore that lateralization of vS activity is not different between tutored and untutored birds.

The beginning of an expiratory pulse of the song motif is typically accompanied by a unilateral or bilateral activation of the syringeal muscles. For these syllable onset events, activation of the left and right vS muscles was more synchronous in tutored birds than in untutored birds, which resulted in a bigger instantaneous cross-correlation at the beginning of the pressure pulse (Kolmogorov-Smirnov test, D = 0.002). Figure 7 shows one example of a synchronous burst occurring in the left and right vS muscles. Another example (Fig. 5) also shows this difference; the syllable from an untutored bird (Fig. 5A) shows no synchronous burst, whereas the one from a tutored bird (Fig. 5B) shows again the clear occurrence of this burst in both muscles.

Fig. 7.

Fig. 7.

At syllable onset, tutored birds typically show synchronized activation of the left and right vS. In this example, a high-amplitude EMG burst in the left (red) and right (blue) vS occurs at a point where air sac pressure levels (black) reach a level sufficient for phonation.

Speed of syringeal muscles

To test whether the differences in the sensorimotor learning process impact the biomechanics of vocal production, we compared the maximal EMG burst rates of the syringeal muscles between tutored and untutored birds. The maximal EMG burst rates for vS muscles were 237.0 ± 14.2 and 217.4 ± 10.6 Hz for tutored and untutored birds, respectively; the difference was not statistically significant (Kolmogorov-Smirnov test, D = 0.2). In addition, the relationships between the mean vS EMG activity and fundamental frequency of sound did not differ between tutored and untutored birds. These results suggest that biomechanics of vocal control are not strikingly different between tutored and untutored birds.

In summary, song motor gestures of tutored birds contain modulations more evenly distributed across time and across frequencies. This is consistent with shorter periods of increased activation of the muscles and also shorter periods of time displaying the maximal EMG burst rate. Furthermore, coordination between the left and right sides of the syrinx is different between songs of tutored and untutored birds. How are these differences in motor gestures expressed in the acoustic signals they generate?

Corresponding acoustic differences

As was found previously (Fehér et al. 2009), note duration is significantly longer in untutored birds. This tendency in untutored birds to maintain a particular acoustic feature within a syllable was particularly common for high-frequency notes. Although the acoustic features of high-frequency notes do not differ between tutored and untutored birds (AM: tutored = 0.0036 ± 0.0006, untutored = 0.0038 ± 0.0005; FM: tutored = 43.62 ± 15.68; untutored = 46.09 ± 14.01; entropy: tutored = −1.76 ± 0.80; untutored = −2.16 ± 0.92), the much longer mean duration (102.3 ± 15.9, n = 28 vs. 36.30 ± 3.13, n = 24) distinguishes notes between the two groups. This acoustic difference in note duration is consistent with the observation that respiratory and syringeal motor gestures show more frequent switching between activation levels in tutored birds than in untutored birds.

A second striking feature of untutored songs is the more frequent occurrence of simultaneous, independent fundamental frequencies in harmonic stack elements. Whereas such elements were only present in 2.6% of the harmonic stack segments in tutored birds, they occurred in 28.6% of the elements in untutored birds. The two independent frequencies most likely originate from the two sound generators (Goller and Cooper 2004; Jensen et al. 2007) and may therefore indicate different coordination between the two sides in untutored birds. This difference, however, was not reflected in the integrated analyses of lateralization, in which data from all syllable types were pooled.

DISCUSSION

This study provides insight into how the acquisition of an acoustic model early in life shapes the motor control for song production at the level of the respiratory and syringeal motor gestures. Understanding the changes in peripheral motor gestures is a first step toward unraveling the effects of early acoustic experience on the central motor program for song. The results indicate that untutored and tutored birds make similar use of the motor systems over the course of a motif and use similar respiratory and syringeal motor gestures to generate similar sounds and that there are no functional differences in the achieved activation rate of syringeal muscles. The main differences in motor gestures are found in the temporal features, which are a direct result of the temporal arrangement of acoustic elements within syllables and the motif. We first discuss the implications of these findings on the mapping of motor gestures onto acoustic output and then draw some implications about the effects of an acoustic model on the central control of song production.

Motor control in both groups uses qualitatively similar motor gestures

The exposure to an auditory tutor model results in a refinement of the experience-independent motor gestures and does not lead to qualitatively different motor control patterns. Such a difference could be expected if more complex acoustic output was achieved by an overall increase in muscle use throughout the motif. This expectation is consistent with syringeal biomechanics. Rapid gating of airflow and high-frequency sounds have been associated with high EMG activity in different syringeal muscles (Goller and Suthers 1996a,b). If sophisticated frequency control and rapid modulation of airflow are the physiological control mechanisms for generation of diverse acoustic features, more muscle use would translate into greater acoustic complexity. However, our data suggest that tutored zebra finches do not show increased utilization of muscles.

Another possibility how tutoring might affect motor control is through direct influences on the peripheral musculature. During development, muscle activation patterns could influence the fiber type composition of muscles. Innervation patterns of muscles are known to affect the development of motor units in other systems (Nacarrete and Vrbová 1993). Whereas this study did not directly investigate the organization of syringeal muscles, our data on the burst-like activation patterns suggest that both untutored and tutored birds used equally high burst rates of syringeal muscle activation to generate amplitude and FM patterns in the song. Similar burst rates indicate that syringeal muscles can perform at high contraction speeds in both groups (Elemans et al. 2008) and suggest that very fast motor units are present in both groups.

Exposure to model song causes changes in temporal distribution of qualitatively similar motor gestures

Differences in the temporal activation patterns of the peripheral muscles is an important difference between tutored birds and those without exposure to adult song. This difference is manifested as more frequent modulation of both the respiratory and the syringeal motor gestures over the course of the motif in the songs of tutored birds. It is particularly striking that the respiratory contributions to fine modulation of air pressure and flow conditions are so much more pronounced in tutored birds. The quantitative differences in respiratory modulation patterns result mainly from the unusually long high-frequency notes in the songs of untutored birds. As most of the quantitative analyses show, other changes to the experience-independent motor gestures are relatively subtle at the level of muscle activation patterns but result in the well-described, distinct acoustic differences between tutored and untutored songs (Price 1979). Longer duration of acoustic notes reflects less temporal structure of motor gestures as a given activation pattern is generated for a longer period of time.

These analyses of respiratory and syringeal motor patterns suggest that exposure to an acoustic model does not result in the development of novel motor gestures. Instead, the same set of motor gestures present in untutored birds is used, but distributed differently over the course of the song motif. This similar mapping of neural control patterns onto acoustic output in the two groups makes it possible to predict changes in motor gestures from the acoustic signals in each group. However, whereas changes in the respiratory and syringeal motor gestures are subtle, the resulting acoustic effects are quite pronounced, suggesting a strongly nonlinear mapping of motor gestures onto acoustic output. This poses the question of whether additional nonlinearity exists at higher levels in the central control of the song motor program. This translation of motor activity into behavior could represent a more general mechanism of how motor skills are added without increasing the overall muscle activation or energy expenditure. Behavioral change may be effected more generally by the use of nonlinearities in mapping neural instructions onto movement.

Implications for how experience shapes vocal motor programs

Although single unit recordings in RA in tutored birds suggest that the output signal to the syringeal and respiratory areas is comprised of the combined activation of individual neurons, we still lack a thorough understanding of the central motor program of song production. More detailed data on functionally connected populations of neurons and their combined output to syringeal and respiratory areas may be necessary to elucidate the motor program, but such information is currently not available. However, syringeal motor gestures reflect motor activity that is directly controlled by premotor activity in RA and thus reflect the population output signal from RA. This motor information parallels insights gained from studying peripheral activation patterns in the process of learning other motor skills (Shadmehr and Holcomb 1997; Shadmehr and Thoroughman 2000; Thoroughman and Shadmehr 1999). However, the song learning system is unique in that it does not only represent a motor learning process but motor learning as a consequence of the acquisition of an acoustic template for song.

Experience-dependent modification of the song motor program seems to operate within an experience-independent framework of motor gestures. These basic gestures may be determined by the neural architecture and emerging motor control mechanisms of the song control network. A restriction of experience-dependent modification to temporal redistribution of a basic set of motor commands may be a general mechanism of vocal motor learning and may therefore reflect aspects of speech development in human infants as well. If these specific modifications to the motor program for birdsong are paralleled by how the motor program for human speech changes with acoustic and social experience (Goldstein et al. 2003), the notion of an experience-independent basic speech program is strongly supported (Chomsky 1975). The effects of skill learning on motor control in other systems are consistent with this interpretation. For example, the effect of learning a reaching task by rodents is manifested in a selection and tuning of an already present motor program (Kargo and Nitz 2003).

The two experimental groups differed in their experience with an acoustic model early in life, whereas both underwent the second learning component of vocal ontogeny, the sensorimotor learning phase. However, interactions between the acquisition of an acoustic model and the subsequent perfection of self-generated vocalizations are likely because the development of the neural architecture of the motor circuitry is directly affected by the acquisition of an acoustic model (Wilbrecht et al. 2006) and therefore must influence the sensorimotor learning process. This presumed interactive process may provide a mechanistic explanation for the observation that social experience, and not specific features of the acoustic model, may cause enhanced temporal structure in the songs of birds tutored by males singing untutored song (Fehér et al. 2009). Because the tutored birds used in this study also had social experience during vocal ontogeny, this interpretation cannot be excluded. Irrespective of which aspect of experience led to the observed changes in motor control and acoustic output, the interactive aspects of experience and development of neural circuitry and control patterns most likely play an important role. It is therefore important that the changes to the motor program have been studied at the endpoint of vocal ontogeny, because experimental manipulation during development may affect this interaction.

GRANTS

This work was supported by National Institute on Deafness and Other Communication Disorders Grants DC-04390 and DC-06876.

DISCLOSURES

No conflicts of interest, financial or otherwise, are declared by the authors.

ACKNOWLEDGMENTS

We thank the reviewers for the thoughtful and constructive comments.

REFERENCES

  1. Adret P. The template concept: crafting a song replica from memory. In: Neuroscience of Birdsong, edited by Zeigler HP, Marler P. Cambridge, MA: Cambridge University Press, 2008, p. 282–299 [Google Scholar]
  2. Bandt C, Pompe B. Permutation entropy: a natural complexity measure for time series. Phys Rev Lett 88: 174102, 2002 [DOI] [PubMed] [Google Scholar]
  3. Brainard MS, Doupe AJ. What songbirds teach us about learning. Nature 417: 351–358, 2002 [DOI] [PubMed] [Google Scholar]
  4. Brenowitz EA, Margoliash D, Nordeen KW. An introduction to birdsong and the avian song system. J Neurobiol 33: 495–500, 1997 [PubMed] [Google Scholar]
  5. Chomsky N. Logical Structure of Linguistic Theory. New York: Plenum Press, 1975 [Google Scholar]
  6. Doupe AJ, Kuhl PK. Birdsong and human speech: common themes and mechanisms. In: Neuroscience of Birdsong, edited by Zeigler HP, Marler P. Cambridge, MA: Cambridge University Press, 2008, p. 5–31 [Google Scholar]
  7. Elemans CP, Mead AF, Rome LC, Goller F. Superfast vocal muscles control song production in songbirds. PLoS One 3: e2581, 2008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Fehér O, Wang H, Saar S, Mitra PP, Tchernichovski O. De novo establishment of wild-type song culture in the zebra finch. Nature 459: 564–568, 2009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Forstmeier W, Burger C, Temnow K, Deregnaucourt S. The genetic basis of zebra finch vocalizations. Evolution 63: 2114–2130, 2009 [DOI] [PubMed] [Google Scholar]
  10. Franz M, Goller F. Respiratory patterns and oxygen consumption in singing zebra finches. J Exp Biol 206: 967–978, 2003 [DOI] [PubMed] [Google Scholar]
  11. Fromkin V. The development of language in Genie: a case of language acquisition beyond the ‘critical period’. Brain Lang 1: 81–197, 1974 [Google Scholar]
  12. Goldstein MH, King AP, West MJ. Social interaction shapes babbling: testing parallels between birdsong and speech. Proc Natl Acad Sci USA 100: 8030–8035, 2003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Goller F, Cooper B. Peripheral motor dynamics of song production in the zebra finch. Ann NY Acad Sci 1016: 130–152, 2004 [DOI] [PubMed] [Google Scholar]
  14. Goller F, Daley MA. Novel motor gestures for phonation during inspiration enhance the acoustic complexity of birdsong. Proc R Soc Lond B 268: 2301–2305, 2001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Goller F, Suthers RA. Role of syringeal muscles in gating airflow and sound production in singing brown thrashers. J Neurophysiol 765: 867–876, 1996a [DOI] [PubMed] [Google Scholar]
  16. Goller F, Suthers RA. Role of syringeal muscles in controlling the phonology of bird song. J Neurophysiol 76: 287–300, 1996b [DOI] [PubMed] [Google Scholar]
  17. Hahnloser RH, Kozhevnikov AA, Fee MS. An ultra-sparse code underlies the generation of neural sequences in a songbird. Nature 419: 65–70, 2002 [DOI] [PubMed] [Google Scholar]
  18. Jensen KK, Cooper BG, Larsen ON, Goller F. Songbirds use pure tone register in two voices to generate low frequency sound. Proc Biol Sci 274: 2703–2710, 2007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Kargo WJ, Nitz DA. Early skill learning is expressed through selection and tuning of cortically represented muscle synergies. J Neurosci 23: 11255–11269, 2003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Konishi M. The role of auditory feedback in songbird. Ann NY Acad Sci 1016: 463–475, 2004 [DOI] [PubMed] [Google Scholar]
  21. Lane H. The Wild Boy of Aveyron. Cambridge, MA: Harvard University Press, 1976 [Google Scholar]
  22. Leonardo A, Fee MS. Ensemble coding of vocal control in birdsong. J Neurosci 25: 652–661, 2005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Nacarrete R, Vrbová G. Activity-dependent interactions between motoneurones and muscles: their role in the development of the motor unit. Prog Neurobiol 41: 93–124, 1993 [DOI] [PubMed] [Google Scholar]
  24. Nick TA, Konishi M. Neural auditory selectivity develops in parallel with song. J Neurobiol 62: 469–481, 2005 [DOI] [PubMed] [Google Scholar]
  25. Price PH. Developmental determinants of structure in zebra finch song. J Comp Physiol Psychol 2: 260–277, 1979 [Google Scholar]
  26. Shadmehr R, Holcomb HH. Neural correlates of motor memory consolidation. Science 277: 821–825, 1997 [DOI] [PubMed] [Google Scholar]
  27. Shadmehr R, Thoroughman KA. Learning and memory formation of arm movements. In: Biomechanics and Neural Control of Posture and Movement, edited by Winters J, Crago P. New York: Springer Verlag, 2000, p 347–353 [Google Scholar]
  28. Shank SS, Margoliash D. Sleep and sensorimotor integration during early vocal learning in a songbird. Nature 458: 73–77, 2009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Suthers RA, Zollinger SA. Producing song: the vocal apparatus. Ann NY Acad Sci 1016: 109–129, 2004 [DOI] [PubMed] [Google Scholar]
  30. Tchernichovski O, Lints TJ, Deregnaucourt S, Cimenser A, Mitra PP. Studying the song development process: rationale and methods. Ann NY Acad Sci 1016: 348–363, 2004 [DOI] [PubMed] [Google Scholar]
  31. Tchernichovski O, Nottebohm F, Ho CE, Pesaran B, Mitra PP. A procedure for an automated measurement of song similarity. Anim Behav 59: 1167–1176, 2000 [DOI] [PubMed] [Google Scholar]
  32. Thomson D. Spectrum estimation and harmonic analysis. Proc IEEE 70: 1055–1096, 1982 [Google Scholar]
  33. Thoroughman KA, Shadmehr R. Electromyographic correlates of learning an internal model of reaching movements. J Neurosci 19: 8573–8588, 1999 [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Vicario DS. Contributions of syringeal muscles to respiration and vocalization in the zebra finch. J Neurobiol 22: 63–73, 1991 [DOI] [PubMed] [Google Scholar]
  35. Wilbrecht L, Williams H, Gangadhar N, Nottebohm F. High levels of new neurons addition persist when the sensitive period for song learning is experimentally prolonged. J Neurosci 26: 9135–9141, 2006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Wild JM, Goller F, Suthers RA. Inspiratory muscle activity during bird song. J Neurobiol 36: 441–453, 1998 [DOI] [PubMed] [Google Scholar]
  37. Yu AC, Margoliash D. Temporal hierarchical control of singing in birds. Science 273: 1871–1875, 1996 [DOI] [PubMed] [Google Scholar]

Articles from Journal of Neurophysiology are provided here courtesy of American Physiological Society

RESOURCES