Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2013 May 13.
Published in final edited form as: Nat Commun. 2012;3:1184. doi: 10.1038/ncomms2165

Behavioural and neurobiological implications of linear and non-linear features in larynx phonations of horseshoe bats

Kohta I Kobayasi a,b,c,*, Steffen R Hage a,d,*, Sean Berquist a, Jiang Feng e, Shuyi Zhang f, Walter Metzner a,c
PMCID: PMC3552533  NIHMSID: NIHMS425643  PMID: 23149729

Abstract

Mammalian vocalizations exhibit large variations in their spectrotemporal features, although it is still largely unknown which result from intrinsic biomechanical properties of the larynx and which are under direct neuromuscular control. Here we show that mere changes in laryngeal air flow yield several non-linear effects on sound production, in an isolated larynx preparation from horseshoe bats. Most notably, there are sudden jumps between two frequency bands used for either echolocation or communication in natural vocalizations. These jumps resemble changes in “registers” as in yodelling. In contrast, simulated contractions of the main larynx muscle produce linear frequency changes, but are limited to echolocation or communication frequencies. Only by combining non-linear and linear properties can this larynx therefore produce sounds covering the entire frequency range of natural calls. This may give behavioural meaning to yodelling-like vocal behaviour and reshape our thinking about how the brain controls the multitude of spectral vocal features in mammals.

Introduction

Echolocating bats produce a wide variety of different sonar pulses and also have a rich repertoire of communication calls. In horseshoe bats, for example, a long constant frequency (CF) component characterizes their echolocation pulses. When the bat is perched (“at rest”), it emits the CF-portion of its echolocation calls at its so called “resting frequency”, RF2, which normally represents the second harmonic of the calls1. The pulses usually terminate with a brief and rapid drop in frequency and sometimes also contain an initial brief frequency rise, each extending 10–15kHz below RF2. During flight, horseshoe bats change their CF-frequency by up to 5kHz below and 1–2kHz above RF213 to compensate for Doppler-shifts in the returning echo signal. This maintains the echo frequency practically constant (Doppler-shift compensation, DSC1). Communication signals of horseshoe bats, on the other hand, are spectro-temporally more complex. Their second harmonic frequencies are more than 15kHz below the RF2 of echolocation pulses4. It is still unknown, however, how the bat larynx can produce such low call frequencies.

In our present study, we chose a simplified approach as an initial step towards tackling the question of how the horseshoe bat larynx could generate the entire range of call frequencies observed in natural vocalizations. We used an isolated larynx preparation from these bats, which lacked any neuronal innervation, to test if we could reproduce the same range of frequencies in larynx phonations by merely changing the airflow in this preparation. If so, this would indicate that direct neuromuscular control by larynx muscles is not necessary. Instead, sound production would largely result from intrinsic biomechanical properties of the larynx. Alternatively, we examined if we could generate the same frequency range by simulating cricothyroid contractions in our larynx preparation. This directly addresses which role neuromuscular control by the cricothyroid plays in call frequency production. We found that changes in laryngeal air flow yielded several non-linear effects on sound production, most notably sudden jumps between two independent frequency bands, reminiscent of changes in “registers”, as in yodelling. These frequency bands matched the frequency ranges used for either echolocation or communication in natural vocalizations. On the other hand, simulated contractions of the main larynx muscle produced linear frequency changes, but only within each of the bands used for echolocation or communication. Therefore, only by combining non-linear and linear operational modes, i.e. intrinsic properties and neuromuscular control, can the horseshoe bat larynx produce sounds covering the entire frequency range of natural calls.

Results

Frequencies for communication are lower than for echolocation

In horseshoe bats, the second harmonic frequencies of communication calls are more than 15kHz below the RF2 of echolocation pulses4 (vertical pairs of arrows in Fig. 1). Interestingly, neurophysiological recordings of the spike activity5 in various motor structures demonstrate that their activity virtually ceases at call frequencies only 4–6kHz below RF2. These structures include the cricothyroid muscle, which is the sole laryngeal muscle controlling call frequencies in bats6, its motor nerve, the superior laryngeal nerve7, and neurons within the motor nucleus of laryngeal control, the nucleus ambiguus, itself (Fig. 2; surgery and stereotaxic approaches follow those outlined in detail in ref. 8, and the behavioral tests resulting in call frequency changes are identical to those published previously9,10.). While this frequency range is sufficient to account for call frequencies produced during DSC, it poses a challenge for explaining how such extremely low discharge patterns can encode for the low frequencies and large spectral modulations seen in communication calls. In addition to direct tension control of the mammalian vocal fold by a single larynx muscle, however, other mechanisms may also exist that affect tension and thus sound frequency1115.

Fig. 1. Sonagrams of typical echolocation and communication calls from two subspecies of greater horseshoe bats.

Fig. 1

Rhinolophus ferrumequinum tragatus (two individuals in a and b, respectively) and R.f. nippon (two individuals in c and d, respectively). Individual resting frequencies (RF2) of echolocation pulses (marked by “E”) given for each bat and indicated by pink hatched lines. Note the large frequency differences between RF2 and the second harmonic of communication calls, which are indicated as numbers (kHz) above the white vertical pairs of arrows.

Fig. 2. Changes in spike activity with echolocation call frequency in laryngeal motor neurons.

Fig. 2

Spike activity virtually ceases in three single neurons in the laryngeal motor nucleus, the nucleus ambiguus. Data points represent means (bin width: 0.4 kHz) ± SD (error bars). Outliers for low call frequencies are given without error bars.

Testing sound production in isolated horseshoe bat larynges

In order to determine how horseshoe bat larynges may be able to generate the entire frequency range of echolocation and communication call frequencies, we excised larynges from 11 Greater Horseshoe Bats, Rhinolophus ferrumequinum (5 males, 6 females) post mortem to produce an in vitro model of vocalization by varying tracheal air pressure. Adapting the approach by Fee et al.16 for the bird syrinx, we removed larynx, trachea and both lung lobes from the bat, cleaned them of excess connective tissue and mounted them on the experimental platform. The vocal tract was cut immediately dorsal to the epiglottis (Fig. 3a, Methods).

Fig. 3. linear and non-linear frequency changes caused by varying air pressure.

Fig. 3

Side (a.) and ventral view (b.) of excised larynx. Instead of using direct action of a vocalis muscle to change vocal fold tension, echolocating bats tilt the thyroid cartilage through action of the cricothyroid muscle (a) in a visor-like fashion. This tautens the vocal folds, resulting in higher frequencies. Direction of thyroid adduction towards cricoid, simulating cricothyroid contraction, indicated by red arrow (a,b). c.–e. Sounds generated by excised larynx (Bat 1, Table 1) during slow changes in air flow. Frequency increases linearly for rising air pressures until it suddenly jumps from higher (F1high) to lower (F1low) frequencies. All frequency changes reverse for decreasing air pressures. F1high corresponds to echolocation and F1low to communication call frequencies. c. Subglottal air pressures (top), sound fundamental frequency (F1; center) and amplitude (bottom). Dashed line in center trace: “resting frequency”, RF1, determined in live animal. d. Variations in subglottal air pressure and changes in sound frequency (blue and red dots) and amplitude (green squares). Same preparation as c (10 increases and decreases of air pressure, each). Rate of linear changes in frequency for F1high (blue dots): 2.94 kHz/kPa, and F1low (red dots): 2.4 kHz/kPa. Horizontal dashed line and RF1: fundamental component of RF of this animal determined in vivo. e. Hysteresis of sudden, non-linear changes in sound frequency. Same data as d. Arrows: upper and lower ranges of sudden frequency shifts during increases (blue and red dots) and decreases (red and blue squares) of air flow, respectively. f. Mean (symbols) and standard deviation (error bars) of F1high (blue triangles) and F1low (red triangles) produced by two larynx preparations for R.f. tragatus (left) and R.f. nippon (right). 10 trials each for increasing and decreasing air pressure. Asterisks: means of fundamental frequencies of echolocation (blue asterisks) and communication calls (red asterisks). R. f. tragatus: RF1 34.5 – 39.2kHz (n=32 bats, mean: 36.1±1.82kHz) and communication calls: 20.4 – 39.1kHz (n=32 bats; mean: 28.1±3.23kHz). RF1 in R.f. nippon: 29.7 –34.9kHz (n=25 bats, mean 31.9±2.33kHz), and communication calls: 14.4 – 33.4kHz (n=25 bats; mean: 25.1±3.58kHz).

All 11 excised larynges exhibited consistent phonations to repeated trials of changes in subglottal air pressure. Because of the large number of experimental parameters that we tested and variations in the surgical extraction of larynges, we were not able to test all parameters in all preparations. The results of each of our experimental manipulations are based on at least two preparations (actual numbers given below). All sounds had their dominant frequencies at the fundamental (F1), because the excised larynx preparation lacked the resonance cavities (especially of pharynx and nose) that, in the vocal tract of live horseshoe bats, shift the peak frequencies towards the second harmonic of the naturally produced calls1719. The range of absolute air pressures that we used to elicit phonations in the excised larynges varied between 1.5 and 3kPa. It was thus well within the normal range of subglottal air pressures measured in naturally vocalizing bats (0.9–5.9kPa20). The sound amplitudes for F1 ranged from 63 to 82dB SPL (median: 70.2±7.43dB SPL). The second harmonic was on average 23.8±7.40dB weaker.

Changing air pressure

We were able to test the effects of increases and decreases of air pressure on the production of tonal sounds in a total of 6 preparations. Figures 3 and 4 depict typical results. At the lowest subglottal air pressures that phonated the excised larynx (at approximately 1.1kPa), the sounds had fundamental frequencies between 30 and 40kHz (F1high; Figs. 3c–f). This corresponded to the range of the fundamental in naturally produced echolocation calls. The lowest air pressures produced the lowest sound frequencies, which were always more than 2kHz below the bat’s resting fundamental frequency (RF1; Figs.3c,d). Further increases in air pressure yielded a slow gradual rise in F1high (Pearson’s correlation, P<0.001, n=17) over an average range of 2.76±0.67kHz, until a sudden and rapid drop of 13.1±0.03kHz occurred that resulted in a lower F1 (F1low). This F1low was unrelated with F1high (Table 1). The sudden jump in sound frequency occurred within a pressure change of less than 0.05kPa (Fig. 3d; Suppl. Fig. S1). Further increases in laryngeal air flow restored the smooth, graded rise in sound frequency (Pearson’s correlation, P<0.001, n=17) over an average range of 3.25±0.81kHz, though at a lower fundamental frequency (Figs.3c,d). The rates of frequency change in the other 5 preparations tested were similar and there was no significant difference for changes of F1high and F1low between preparations (paired t-tests, P>0.05, n=51).

Fig. 4. High air pressures cause aperiodic larynx phonations.

Fig. 4

Below 2.1kPa, changes are similar to those shown in Fig. 3. Above 2.1kPa, the tonal (periodic) signal suddenly becomes aperiodic. Similar aperiodic components are also part of natural communication calls. a. Increases in air pressure, b. bandwidth of sound signal 30 dB below max sound amplitude, c. sonogram of produced sound, and d. waveform of sound signal for sudden transition between periodic and aperiodic sound emission (red box). e,f. Effects of changes in absolute air pressure on relative amplitude (e), as well as sound frequency (f, left ordinate and blue and red dots) and signal bandwidth (f, bottom, right ordinate and pink squares). Same larynx preparation as in ad. Data represent averages of 10 increases and decreases of air pressure, each. Vertical bars in e,f represent SDs and vertical dashed lines indicate the range of air pressure for rapid shifts between F1high and F1low. g,h. Transition between periodic and aperiodic components in sonagram (g) and waveform (h, red box) of natural communication call of R.f. nippon.

Table 1. Frequency profiles for individual bats.

Average values for sudden, non-linear falls (F1high to F1low) and rises (F1low to F1high) in sound frequency for all 6 preparations tested.

# trials (n) fall (Hz) rise (Hz) ratio F1low/F1high
Bat 1 11 13,175 ± 37 12,731 ± 146 0.626 ± 0.004
Bat 2 6 9,803 ± 462 10,022 ± 344 0.778 ± 0.010
Bat 3 17 6,698 ± 297 6,758 ± 313 0.833 ± 0.010
Bat 4 7 6.313 ± 561 5,669 ± 624 0.813 ± 0.021
Bat 5 5 5,528 ± 580 5,633 ± 580 0.831 ± 0.024
Bat 6 5 7,200 ± 470 6,932 ± 351 0.760 ± 0.0

All frequency shifts, slow graded as well as non-linear sudden jumps, reversed for decreases in air pressure (Figs.3c–e). Slow decreases in F1high as well as F1low for decreasing air pressures occurred at the same rate as the rises in frequency for increasing air flows (no significant difference; paired t-test, P>0.05, n=17). The amount of frequency drop or rise that occurred during sudden, non-linear changes did not differ between increases and decreases of air pressure (Table 1; paired t-test, P>0.3, n=51). The absolute frequency values at which the sudden changes took place did, however: for increasing air pressures, the sudden drops occurred at significantly higher pressures and therefore higher frequencies than the sudden rises during decreases in air flow (Fig. 3e, paired t-test, P<0.001, n=17). The frequency values for the jumps were very consistent in each preparation resulting in a very robust hysteresis behavior.

Interestingly, sudden non-linear jumps in frequency were not accompanied by sudden shifts in sound amplitude. Instead, sound amplitude continued to rise (or fall) linearly with further increases (or decreases) in subglottal air pressure (Fig. 3c and 4b; R=0.75, Pearson’s correlation, P<0.001, n=17).

Were the values of F1high and F1low related to any naturally produced call frequencies? We found that F1high matched the frequency range of the fundamental CF-component of echolocation calls emitted at rest (RF1), whereas F1low overlapped with the range of fundamental frequencies in communication calls for individuals of R. f. tragatus in our colony of captive horseshoe bats4(Fig. 3f left; paired t-test, P>0.3, n=23). Interestingly, we found that such a match also occurred in another subspecies of horseshoe bats, R. f. nippon, which generally emitted lower call frequencies (Fig. 3f, right; paired t-test, P>0.3, n=16).

When we increased subglottal air pressure to even higher levels than the maximum levels depicted in Figure 3 (6 preparations), we observed yet another nonlinear effect in addition to the sudden drop from F1high to F1low described above. Sound production suddenly switched from periodic (tonal) signals at F1low to aperiodic (chaotic) emission of signals. These aperiodic signals had bandwidths of more than 10 kHz mainly around F1low (Fig. 4). The switch from periodic to aperiodic sounds occurred within less than 200μs or only 2–3 cycles of the tonal signal. Decreasing the air pressure reversed the effect and the signal switched back to a tonal sound emitted at F1low.

Figure 4b depicts average results of this switch to aperiodic sound emission from 10 trials (same larynx preparation as in Fig. 4a). It shows how frequency, bandwidth and amplitude changed with altering air pressure. Below approximately 2.1kPa, increasing air pressures caused frequency changes similar to those depicted in Fig. 3. Initially, the frequency (Fig. 4b, blue dots in bottom panel) increased within F1high until a rapid downward shift to F1low around 1.9kPa (red dots in Fig. 4b). At approximately 2.1kPa, the bandwidth of the signal suddenly increased (Fig. 4b, pink squares in bottom panel). This marked the transition from periodic to aperiodic sound emission.

Rapid transitions between periodic and aperiodic signals are also commonly found in naturally produced communication calls of horseshoe and other bats (Fig. 4c), accounting for almost 10% of a bat’s repertoire4,21. Fig. 5 illustrates such spectro-temporal similarities between larynx phonations and natural horseshoe bat calls.

Fig. 5. Features of periodic and aperiodic components in larynx phonations and natural communication calls.

Fig. 5

Sonagrams of a phonation of an excised larynx (a) and a natural communication call (b, similar to Fig 2c). c–f Spectral features analyzed in a 0.8 ms time window at the time points indicated by vertical bars in (a) and (b). Upper traces: embedded orbits of the extracted segment with a time delay of 0.004ms. Center traces: corresponding waveforms. Lower traces: corresponding power spectra.

Simulated cricothyroid contractions

The cricothyroid appears to be the sole laryngeal muscle controlling call frequencies in bats5,6. We examined its role in sound production by simulating its contractions. For this purpose, we adducted the thyroid cartilage towards the cricoid cartilage (Fig. 3a,b) over a total distance of up to 500μm. This corresponded to a shortening of the cricothyroid by approximately 25–30%. As a result, the thyroid tilted downward, tautening the vocal folds (Fig. 3a). The subglottal air pressure remained unchanged and was set at a value that produced stable phonation of periodic signals within the range of F1high (n=3) and F1low (n=2), respectively. Figure 6a,b shows one representative trial for quasi-sinusoidal changes in adduction and the resulting variations in sound F1. When the cricothyroid cartilage was not adducted and presumably close to its resting position, call frequencies emitted at F1high were approximately 3.5kHz below the bat’s RF1.

Fig. 6. Effects of simulated cricothyroid contractions on phonation frequencies.

Fig. 6

Different degrees of adduction of the thyroid (simulated contraction of cricothyroid muscle) on sound frequency caused frequency changes that were highly linear but limited to sounds emitted either within F1high or F1low. a. Different degrees of adduction of the thyroid towards the cricoid cartilage (top panel) and b., resulting changes in sound frequency. Air pressure was maintained constant at a level yielding sounds emitted at F1high. c. Frequency changes for two preparations (pink dots and green squares, respectively; both from R.f. nippon) caused by adduction of the thyroid over a distance up to approximately 500 μm towards the cricoid cartilage (10 trials each). Pink dots depict same preparation as shown in a. and b., and horizontal dashed line indicates the fundamental of the resting frequency (RF1 = 34.8 kHz) for this animal obtained in vivo. Ranges of echolocation pulses and communication calls for this subspecies indicated at right (same data as in Fig. 3f).

Different degrees of adduction caused frequency changes that were highly linear for sounds emitted at both F1high and F1low. Figure 6c shows data for two preparations of R.f. nippon. We tested one preparation of R.f. tragatus for changes in F1high only and it also exhibited a linear correlation with R=0.86 (Pearson’s correlation, P<0.001, n>250). In the two cases depicted in Figure 6c, adduction of the thyroid from 10μm to over 400μm increased the values for F1high from 30.5 to 34.8kHz (regression line for pink dots; RF1 of this bat was 33.8kHz) and from 29.7 to 32.0kHz (regression line for green squares). For the second harmonics, this corresponded to a range of 8.6kHz (up to 6.6kHz below and 2.0kHz above the RF2 of 67.6kHz) and 4.6kHz, respectively. This was sufficient to explain the range of natural echolocation pulses produced while at rest and during DSC13. (Note that sounds emitted at F1high for adductions of more than approximately 400μm became extremely weak and virtually disappeared in the background noise. These data were therefore not included.) For sound frequencies emitted at F1low, thyroid adductions yielded average frequencies that increased from 21.4 to 27.5kHz (regression line for pink dots) and from 23.0 to 27.0kHz (regression line for green squares). The average second harmonics for F1low therefore reached frequencies that were only 3.0 and 2.7kHz, respectively, lower than the lowest values emitted for F1high. For the case in which we were able to determine RF2 in vivo (pink dots), the second harmonics reached on average up to 24.8kHz below RF2, which matched the frequency difference observed between echolocation and most communication signals (Fig. 1). The rates of frequency change per 100μm movement of the thyroid cartilage were 0.6kHz and 1.2kHz for F1high and 0.6 and 1.1kHz for F1low. In summary, combining the linear frequency changes of F1high and F1low that were caused by thyroid adductions at the two different air flows covered virtually the entire range of the call repertoire of horseshoe bats4(Fig. 6c).

Discussion

Our data show that changes in subglottal air pressure within a naturally occurring range20 were sufficient to reproduce many features of natural vocalizations in horseshoe bats. Most notably, they were small linear frequency changes and non-linear jumps between frequency bands used for echolocation and communication, respectively. This indicates that these characteristics are based upon biomechanical properties of the horseshoe bat larynx and do not require neuromuscular control by the brain.

Many other vertebrate larynges exhibit similar frequency jumps, including in humans, where they reflect “register changes”16,2225, such as the switch from chest voice to falsetto during yodelling, for example. Such register changes result from changes in vocal fold vibrations. Therefore, only direct measurements of these vibrations will of course determine how they relate to acoustic features that one observes in larynx phonations.

Nevertheless, what distinguishes the observations in horseshoe bats is that the frequency bands occupied by the two states matched the two major frequency bands that these bats use for sonar and social communication, respectively. It is therefore tempting to speculate that a behavior similar to yodelling enables horseshoe bats to switch between echolocation and communication. Borrowing terminology used for the human voice, our bats would then emit communication sounds, for example, in “modal” or “chest voice” and echolocation pulses in “falsetto”.

We measured the effects of changes in air pressure on sound production in excised larynges, similar to studies in humans26,27, dogs28, or the bird syrinx16. Therefore, this raises the question how sound production may differ between in vitro and in vivo conditions. An excised larynx (or syrinx) may show a reduced structural stability and/or it may exhibit different air pressures22. We are aware of only one study that examined differences between in vitro and intact vocal organs, the zebra finch syrinx, in detail29. The authors found that nonlinearities occurred only in vitro, probably because of altered air pressures. The sudden rapid acoustic transitions observed in natural zebra finch song are therefore most likely caused by direct control from superfast muscles29,30. Results from preparations of excised vocal organs therefore should always be interpreted cautiously.

Nevertheless, pressures in the in vitro preparations of humans, dogs and our horseshoe bats, on the other hand, were similar to the natural pressures measured in vivo20,2628. Although it is impossible to determine whether the anatomical position of the cartilages (and therefore of the muscles) in our in vitro preparations corresponded to the condition in vivo, we found numerous striking similarities between phonations produced by excised larynges and natural vocalizations: the frequency bands of F1high and F1low matched the frequency ranges of echolocation and communication calls. This even occurred in two subspecies with very different average frequencies of their echolocation and communication calls (Fig. 3f). In addition, the frequency ranges that we produced by simulating cricothyroid contractions corresponded extremely well with the ranges naturally used for echolocation (DSC) and communication (Fig. 5c). Finally, RFs measured in live bats just a few days before we examined their excised larynges also fell within the proper range of F1high frequencies produced by the excised larynx (RF1 in Figs. 3c and 6c).

What causes linear frequency changes in horseshoe bat vocalizations? Although changes in air pressure did produce smooth, gradual changes of F1high and F1low, it seems unlikely that changes in air pressure alone are the immediate cause for linear call frequency changes in live bats: Echolocating and flying horseshoe bats can change the CF-portion of their second harmonics during DSC between 6kHz below and 1.5kHz above their RF213, and communication call frequencies vary by at least 10kHz4. In comparison, subglottic pressure changes generated much smaller frequency changes (2.76±0.67kHz for F1high and 3.25±0.81kHz for F1low, i.e., twice the amount for the second harmonics emitted by live bats). Variations in subglottic pressure alone are therefore insufficient to explain the entire range of echolocation and communication call frequencies produced by live bats. Other work also indicates that the subglottal pressure has only a limited, if any, effect on the frequency of bat echolocation pulses31.

Instead, cricothyroid contractions represent most likely the major source for these linear changes, but only within each of the frequency bands. When we simulated cricothyroid contraction, the phonation frequencies changed linearly within the frequency bands of either F1high (echolocation) or F1low (communication). This limitation is most likely not caused by an insufficient adduction of the thyroid cartilage for several reasons. The amount of cricothyroid shortening that we simulated in our preparations corresponds to length changes observed in other larynx preparations32,33. It also represents the upper limit of adduction in human larynges31. Linear relationships between vocal pitch and the distance between cricoid and thyroid cartilage have also been reported in humans34. Similarly, electrical stimulations of the cricothyroid muscle in horseshoe bats revealed highly linear call frequency changes5, although the data were limited to the narrow range between RF2 and 1.2 kHz above. Neurophysiological recordings from the cricothyroid muscle, its motor nerve, the superior laryngeal nerve, and from neurons in the nucleus ambiguus demonstrated a linear decline in spike activity from about 40 to 10 spikes/s with decreasing call frequency from RF2 to 3–5kHz below5,7,35(see also Suppl. Fig. 1). Extrapolating these data suggests that the neuronal activity would cease only 5–6kHz below RF2. Even if spike activities did not completely cease at these lower call frequencies, it is challenging to imagine how such extremely reduced spike levels could encode for the large modulations in call frequencies seen in communication calls (Fig. 1). The frequency range covered by linear changes in spike activity corresponds only to the frequencies produced by simulated cricothyroid contractions at F1low (Fig. 4c). This covers the range of call frequencies emitted during echolocation, including DSC13, but not lower-pitched communication signals. Assuming that the cricothyroid is the sole, or at least major muscle responsible for call frequency control in bats5,20, horseshoe bats could generate all behaviorally relevant call frequencies therefore only by combining linear changes in frequency caused by cricothyroid contraction with non-linear effects from varying subglottal air pressure.

In addition to direct tension control of the mammalian vocal fold by a single larynx muscle, such as the cricothyroid, however, other mechanisms may also affect vocal fold tension and thus phonation frequency. For example, additional muscles, such as the thyroarytenoid11,15 or changes in the subglottal pressure12,14,36 could also contribute. Finally, when muscle contractions or relaxations approach their maxima, non-linear relationships between muscle contractions and the resulting vocal fold tension may occur. Nevertheless, various evidence suggests that the cricothyroid represents the major source for call frequency control in bats. Only sections of the superior laryngeal, not of the recurrent nerve, lowered call frequency and amplitude5,6,20. Conversely, only spike activities of the superior laryngeal nerve correlated with call frequencies7,37. Finally, simulated contractions of the cricothyroid yielded phonation frequencies that strikingly matched the frequencies emitted during echolocation and communication, including RFs that we obtained from individual live bats.

In conclusion, it appears therefore that the neural networks that control the activity of the cricothyroid muscle and those that generate the subglottal air pressure through the amount of adduction of the vocal folds are more tightly interconnected than previously assumed38,39. The cricothyroid is innervated by the superior laryngeal nerve, whose motoneurons are located in the anterior portion of the nucleus ambiguus5,7,37. In contrast, vocal fold adduction is controlled by the recurrent laryngeal nerve, which originates from motoneurons in the posterior nucleus ambiguus7,35,40. Hence, we generally assumed that separate motor circuits mediate spectral and temporal call features: supralaryngeal nerve activity would account for call frequency. On the other hand, the recurrent laryngeal nerve would control the amount of opening and closing of the glottis, i.e., vocal fold adduction, and thus determine mostly temporal call features7,35,3840. The data presented here indicate, however, that call frequency changes involve more than merely supralaryngeal motor commands and contractions of the cricothyroid. They also require coordinating vocal fold adduction with subtle modulations of the subglottal airflow. The latter result in non-linear changes in vocal fold vibrations that give rise to sudden jumps in laryngeal phonations. Therefore, the neural circuit that controls glottal airflow not only determines temporal features, such as on-and offset, and the amplitude of calls but also allows bats to switch between generating echolocation and communication calls. In addition, it is intricately linked with neuromuscular control of the cricothyroid. Call amplitudes are also intricately linked with contractions of expiratory, mostly abdominal muscles41, which are innervated by spinal nerves. How these different motoneuron pools achieve such a precise gating of activity is currently unknown.

Finally, it is important to note that recent phylogenetic evidence suggests that several acoustic features of bat echolocation evolved convergently multiple times and horseshoe bats with their sophisticated echolocation share a common ancestry with non-echolocating Old-World fruit bats42. This implies that there is most likely more than one design principle for the neural circuits and mechanisms underlying bat echolocation.

Methods

Animals

Greater Horseshoe Bats, Rhinolophus ferrumequinum, were collected in China and belonged to two different subspecies with slightly different call features (see Results): 3 males and 3 females (from Beijing) belonged to R.f. tragatus, and 2 males and 3 females belonged to R.f. nippon (from Ji An/Jilin). Procedures were in accordance with NIH guidelinesfor experiments involving vertebrate animals and were approved by UCLA’s IACUC.

Preparation and setup

Following the approach of Fee et al.16 for the bird syrinx, we removed larynx, trachea and both lung lobes from the bat, cleaned them of excess connective tissue and mounted them on the experimental platform. We cut the vocal tract immediately dorsal to the epiglottis (Fig. 3b; length of attached trachea between larynx and bronchial branches: approximately 13 mm) and inserted a small piece of polyethylene tubing (2 cm long, I.D. 1.14 mm, O.D. 1.57 mm; Clay Adams Inc., New York, NY, USA) into the trachea and held it in place with a string. The opposite end of the tubing was inserted into a polyvinyl chloride tubing (I.D. 4.7 mm, O.D. 7.9 mm; Fisher Scientific. CA, USA) and connected to a jar providing warm, humid air (temperature: 35–40°C; humidity: >94%) via a pressurized air outlet with manually adjustable pressure. We measured subglottal changes in air pressure with an electric pressure sensor (MPX5050DP; Freescale Semiconductor, Austin, TX, USA), which was inserted into the tubing close to the trachea. We determined absolute tracheal air pressures based on the pressure measured by the sensor and corrected for the 0.45 times smaller diameter of the tubing inserted into the trachea. We varied tracheal air pressures over a range of approximately 3.5 kPa. Applying isotonic saline solution (0.9% NaCl) to the surface of the larynx and trachea every 5 min maintained a moist preparation during experimental procedures. To simulate contraction of the cricothyroid muscle, we altered the distance between the thyroid and cricoid cartilage with a pair of forceps (Dumont #5, Fine Science Tools, Foster City, CA; width of tips: approximately 50 μm) and an attached slide potentiometer (linear motion, Type V448 MONO, CTS electronic components; Taiwan). This allowed us to quantify the simulated cricothyroid motion with an accuracy of 15 μm (standard error of potentiometer).

Experimental procedure

Before we began to systematically measure the effects of varying air flow on sound production, we determined the overall range of air pressure that generated sound. For this purpose, we increased air pressure linearly from a level slightly below the lowest level at which sounds were produced and then decreased it again. Subsequently, we varied air pressure over this range at least 10 times for a period of 1–5 s each.

We simulated cricothyroid muscle contractions by changing the distance between the cricoid and thyroid cartilage with a pair of forceps. One tip of the measuring forceps was placed immediately above the protruding rostroventral edge of the thyroid and the other tip immediately below the equally bulging caudoventral edge of the cricoid cartilage (see Fig. 2a,b). We used a resistor positioned between the shanks of the forceps to measure the changes in the distance between the tips by converting them into changes in DC voltage. Particular care was taken not to damage the laryngeal tissue and therefore, the forceps were not permanently attached to the cartilages (e.g., by gluing them to the tissue) but merely touched the edges of the cartilages. Therefore, we could only compress the forceps simulating muscle contraction. Relaxation of the cricothyroid muscle could not be measured reliably and was therefore not analyzed.

Recording procedure, data analysis, and statistical analysis

We used a ¼ in ultrasonic microphone (4939 with preamplifier 2633, Brüel & Kjær, Nærum, Denmark), positioned at a distance of 15 cm from the rostral end of the larynx to record the phonations. The recording system had a flat frequency response of ±3 dB between 10 kHz and 100 kHz. Sound signals and outputs of the pressure sensor and the resistor were recorded (16 bit, 300 kHz sample rate; UltraSoundgate 116, Avisoft Bioacoustics, Berlin, Germany) and analyzed with commercial analysis software (SASLab Pro; Avisoft Bioacoustics, Berlin, Germany; length of FFT: 1024 points with 94% overlap, spectral resolution: 195 Hz). We determined the bandwidth of aperiodic signals by measuring the width of the power spectrum 30dB below the peak. Statistical analysis was performed using the Pearson’s correlation coefficient (level of significance: probability of error <5%).

Supplementary Material

1

Acknowledgments

Support from the Sound Technology Promotion Foundation, Japan, to KIK, Deutsche Forschungsgemeinschaft to SRH, Chinese Natural Science Foundation to JF and SZ, a cooperation grant from the Chinese Education Ministry to JF and WM, and NIDCD to WM. We thank Jie Ma, Tinglei Jiang, and Jennifer Ho for their assistance during the experimental procedures and manuscript preparation and Jürgen Neubauer for discussion and comments on the manuscript.

Footnotes

Author contributions:

KIK and SRH (equal contribution) as well as WH designed the study, KIK and SRH (equal contribution) performed the experiments, SB assisted in experiments, KIK, SRH and WH analyzed and interpreted the data and wrote the manuscript, and JF and SZ were detrimental in providing experimental animals and support.

None of the authors have any competing financial interests.

None declared

References

  • 1.Schnitzler HU. Die Ultraschallortungslaute der Hufeisennasen-Fledermäuse (Chiroptera, Rhinolophidae) in verschiedenen Orientierungssituationen. Z Vergl Physiol. 1968;57:376–408. [Google Scholar]
  • 2.Metzner W, Zhang SY, Smotherman MS. Doppler-shift compensation behavior in horseshoe bats revisited: auditory feedback controls both a decrease and an increase in call frequency. J Exp Biol. 2002;205:1607–1616. doi: 10.1242/jeb.205.11.1607. [DOI] [PubMed] [Google Scholar]
  • 3.Schuller G, Beuter K, Schnitzler HU. Response to frequency-shifted artificial echoes in the bat, Rhinolophus ferrumequinum. J Comp Physiol A. 1974;89:275–286. [Google Scholar]
  • 4.Ma J, Kobayasi K, Zhang S, Metzner W. Vocal communication in adult greater horseshoe bats, Rhinolophus ferrumequinum. J Comp Physiol A. 2006;192:535–550. doi: 10.1007/s00359-006-0094-9. [DOI] [PubMed] [Google Scholar]
  • 5.Schuller G, Suga N. Laryngeal mechanism for the emission of CF-FM sounds in the Doppler-shift compensating Greater Horseshoe Bat, Rhinolophus ferrumequinum. J Comp Physiol A. 1976;107:253–262. [Google Scholar]
  • 6.Novick A, Griffin DR. Laryngeal mechanisms in bats for the production of orientation sounds. J Exp Zool. 1961;148:125–145. doi: 10.1002/jez.1401480203. [DOI] [PubMed] [Google Scholar]
  • 7.Schuller G, Rübsamen R. Laryngeal nerve activity during pulse emission in the CF-FM bat, Rhinolophus ferrumequinum. I. Superior laryngeal nerve. J Comp Physiol A. 1981;143:317–321. [Google Scholar]
  • 8.Metzner W. An audio-vocal interface in echolocating horseshoe bats. J Neurosci. 1993;13:1899–1915. doi: 10.1523/JNEUROSCI.13-05-01899.1993. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Smotherman M, Kobayasi K, Ma J, Zhang S, Metzner W. A mechanism for vocal-respiratory coupling in the mammalian parabrachial nucleus. J Neurosci. 2006;26:4860–4869. doi: 10.1523/JNEUROSCI.4607-05.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Smotherman M, Metzner W. Fine control of call frequency by horseshoe bats. J Comp Physiol A. 2003;189:435–446. doi: 10.1007/s00359-003-0422-2. [DOI] [PubMed] [Google Scholar]
  • 11.Kempster GB, Larson CR, Kistler MK. Effects of electrical stimulation of cricothyroid and thyroarytenoid muscles on voice fundamental frequency. J Voice. 1988;2:221–229. doi: 10.1044/jshr.3004.552. [DOI] [PubMed] [Google Scholar]
  • 12.Riede T. Subglottal pressure, tracheal airflow, and intrinsic laryngeal muscle activity during rat ultrasound vocalization. J Neurophysiol. 2011;106:2580–2592. doi: 10.1152/jn.00478.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Riede T, Suthers R. Vocal tract motor patterns and resonance during constant frequency song: the white-throated sparrow. J Comp Physiol A. 2009;195:183–192. doi: 10.1007/s00359-008-0397-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Titze IR. On the relation between subglottal pressure and fundamental frequency in phonation. J Acoust Soc Am. 1989;85:901–906. doi: 10.1121/1.397562. [DOI] [PubMed] [Google Scholar]
  • 15.Titze IR, Luschei ES, Hirano M. Role of the Thyroarytenoid Muscle in Regulation of Fundamental Frequency. J Voice. 1989;3:213–224. [Google Scholar]
  • 16.Fee MS, Shraiman B, Pesaran B, Mitra PP. The role of nonlinear dynamics of the syrinx in the vocalizations of a songbird. Nature. 1998;395:67–71. doi: 10.1038/25725. [DOI] [PubMed] [Google Scholar]
  • 17.Hartley DJ, Suthers RA. The acoustics of the vocal tract in the horseshoe bat rhinolophus-hildebrandti. J Acoust Soc Am. 1988;84:1201–1213. [Google Scholar]
  • 18.Suthers RA, Hartley DJ, Wenstrup JJ. The acoustic role of tracheal chambers and nasal cavities in the production of sonar pulses by the horseshoe bat, Rhinolophus hildebrandti. J Comp Physiol A. 1988;162:799–813. doi: 10.1007/BF00610969. [DOI] [PubMed] [Google Scholar]
  • 19.Roberts LH. Cavity resonances in the production of orientation cries. Period Biol. 1973;75:27–32. [Google Scholar]
  • 20.Suthers RA, Fattu JM. Selective laryngeal neurotomy and the control of phonation by the echolocating bat, Eptesicus. J Comp Physiol A. 1982;145:529–537. [Google Scholar]
  • 21.Kanwal JS, Matsumura S, Ohlemiller K, Suga N. Analysis of acoustic elements and syntax in communication sounds emitted by mustached bats. J Acoust Soc Am. 1994;96:1229–1254. doi: 10.1121/1.410273. [DOI] [PubMed] [Google Scholar]
  • 22.Alipour F, Scherer RC. On pressure-frequency relations in the excised larynx. J Acoust Soc Am. 2007;122:2296–2305. doi: 10.1121/1.2772230. [DOI] [PubMed] [Google Scholar]
  • 23.Berry DA, Herzel H, Titze IR, Story BH. Bifurcations in excised larynx experiments. J Voice. 1996;10:129–138. doi: 10.1016/s0892-1997(96)80039-7. [DOI] [PubMed] [Google Scholar]
  • 24.Suthers RA, et al. Voices of the dead: complex nonlinear vocal signals from the larynx of an ultrasonic frog. J Exp Biol. 2006;209:4984–4993. doi: 10.1242/jeb.02594. [DOI] [PubMed] [Google Scholar]
  • 25.Wilden I, Herzel H, Peters G, Tembrock G. Subharmonics, biphonation, and deterministic chaos in mammal vocalization. Bioacoustics. 1998;9:171–196. [Google Scholar]
  • 26.Švec JG, Schutte HK, Miller DG. On pitch jumps between chest and falsetto registers in voice: Data from living and excised human larynges. J Acoust Soc Am. 1999;106:1523–1531. doi: 10.1121/1.427149. [DOI] [PubMed] [Google Scholar]
  • 27.Tokuda IT, Horáček J, Švec JG, Herzel H. Bifurcations and chaos in register transitions of excised larynx experiments. Chaos. 2008;18:013102. doi: 10.1063/1.2825295. [DOI] [PubMed] [Google Scholar]
  • 28.Jiang JJ, Zhang Y, Ford CN. Nonlinear dynamics of phonations in excised larynx experiments. J Acoust Soc Am. 2003;114:2198–2205. doi: 10.1121/1.1610462. [DOI] [PubMed] [Google Scholar]
  • 29.Elemans CP, Laje R, Mindlin GB, Goller F. Smooth operator: avoidance of subharmonic bifurcations through mechanical mechanisms simplifies song motor control in adult zebra finches. J Neurosci. 2010;30:13246–13253. doi: 10.1523/JNEUROSCI.1130-10.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Elemans CP, Mead AF, Rome LC, Goller F. Superfast vocal muscles control song production in songbirds. PLoS One. 2008;3:e2581. doi: 10.1371/journal.pone.0002581. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Fattu JM, Suthers RA. Subglottic pressure and the control of phonation by the echolocating bat, Eptesicus. J Comp Physiol A. 1981;143:465–475. [Google Scholar]
  • 32.Alipour F, Titze I. Active and passive characteristics of the canine cricothyroid muscles. J Voice. 1999;13:1–10. doi: 10.1016/s0892-1997(99)80056-3. [DOI] [PubMed] [Google Scholar]
  • 33.Perlman AL, Alipour-Haghighi F. Comparative study of the physiological properties of the vocalis and cricothyroid muscles. Acta Oto-laryngologica. 2009;105:372–378. doi: 10.3109/00016488809097021. [DOI] [PubMed] [Google Scholar]
  • 34.Kitajima K, Tanabe MNI. Cricothyroid distance and vocal pitch. Experimental surgical study to elevate the vocal pitch. Ann Otol Rhinol Laryngol. 1979;88:52–55. doi: 10.1177/000348947908800109. [DOI] [PubMed] [Google Scholar]
  • 35.Rübsamen R, Betz M. Control of echolocation pulses by neurons of the nucleus ambiguus in the rufous horseshoe bat, Rhinolophus rouxi. I. Single unit recordings in the ventral motor nucleus of the laryngeal nerves in spontaneously vocalizing bats. J Comp Physiol A. 1986;159:675–687. doi: 10.1007/BF00612040. [DOI] [PubMed] [Google Scholar]
  • 36.Riede T, Tokuda IT, Farmer CG. Subglottal pressure and fundamental frequency control in contact calls of juvenile Alligator mississippiensis. J Exp Biol. 2011;214:3082–3095. doi: 10.1242/jeb.051110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Rübsamen R, Schuller G. Laryngeal nerve activity during pulse emission in the CF-FM bat, Rhinolophus ferrumequinum. II. The recurrent laryngeal nerve. J Comp Physiol A. 1981;143:323–327. [Google Scholar]
  • 38.Metzner W, Schuller G. In: Handbook of mammalian vocalization. Brudzynski SM, editor. Academic Press; New York: 2009. pp. 403–416. [Google Scholar]
  • 39.Jürgens U. The neural control of vocalization in mammals: a review. J Voice. 2009;23:1–10. doi: 10.1016/j.jvoice.2007.07.005. [DOI] [PubMed] [Google Scholar]
  • 40.Schweizer H, Rübsamen R, Rühle C. Localization of brain stem motoneurones innervating the laryngeal muscles in the rufous horseshoe bat, Rhinolophus rouxi. Brain Res. 1981;230:41–50. doi: 10.1016/0006-8993(81)90390-5. [DOI] [PubMed] [Google Scholar]
  • 41.Lancaster WC, Henson OW, Jr, Keating AW. Respiratory muscle activity in relation to vocalization in flying bats. J Exp Biol. 1995;198:175–191. doi: 10.1242/jeb.198.1.175. [DOI] [PubMed] [Google Scholar]
  • 42.Jones G, Teeling EC. The evolution of echolocation in bats. Trends in Ecology & Evolution. 2006;21:149–156. doi: 10.1016/j.tree.2006.01.001. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1

RESOURCES