Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2013 Feb 21.
Published in final edited form as: Curr Biol. 1993 Nov 1;3(11):755–758. doi: 10.1016/0960-9822(93)90023-h

Distortion in those good vibrations

Mario A Ruggero 1
PMCID: PMC3578393  NIHMSID: NIHMS273181  PMID: 15335839

Abstract

The features that make inner ear hair cells so sensitive to vibrations may also be responsible for the introduction of surprisingly large distortions.


In their unending search for the ultimate stereo electronic components, high-fidelity aficionados eagerly peruse manufacturers' specifications, always hoping for lower, distortion figures. Whether the electronic components are loudspeakers, amplifiers, compact-disk players or digital-tape machines, the goal is the same: components should be ‘transparent’, delivering only the beauty of the original musical composition and adding no additional sounds of their own. Paradoxically, the human ear, the biological receptor that the electronic stereo components are designed to gratify, generates distortion to an extent that would be unacceptable in electronic equipment. Most astonishingly, it is becoming increasingly clear that this generation of distortion is part and parcel of the very biological apparatus that endows hearing with its marvelous sensitivity and acuity.

That the human auditory system generates distortion of its own has been known since the 18th century. Armed with little scientific apparatus, probably nothing more sophisticated than a violin, the Italian musician Giuseppe Tartini discovered circa 1754 that when listening to a pair of tones, humans hear additional tones that are not present in the physical stimulus [1]. These additional tones, whose perceived magnitude can exceed 10% of that of the acoustic stimulus (at least 100 times what is common in high-fidelity equipment), have frequencies that consist of simple combinations of the frequencies of the primary tones. For example, for primary-tone frequencies of f1 and f2, where f2 > f1, the resulting ‘combination’, or Tartini, tones, also known as intermodulation-distortion products, will have frequencies of 2f1–f2, f2–fl, 2f2–f1, 3f1–2f2 and so on.

The study of combination tones, first by means of psychophysical measurements in humans and more recently via physiological experiments in the auditory nerve and inner ear of animal subjects, has played a central role in the development of auditory theory. To appreciate why such a ‘second order’ effect has been deemed worthy of intensive investigation by hearing scientists, it is helpful to keep in mind an overview of the organization of the auditory system. In brief, the auditory system consists of three components: the ear (external, middle and inner), the auditory nerve and the auditory regions of the brain (Fig. 1). Sound impinges upon the external ear and the eardrum, setting in motion ossicles in the middle ear; these, in turn, create pressure fluctuations in the largely fluid-filled inner ear or cochlea, causing a slow displacement wave to propagate along the organ of Corti (the organ of hearing proper) and its substratum, the basilar membrane.

Fig. 1.

Fig. 1

A schematic diagram of the structure of the human ear.

The organ of Corti, the basilar membrane and the cochlear fluids jointly perform a spatial frequency analysis: waves resulting from low-frequency stimulation travel the length of the cochlea and peak toward its apex, whereas waves evoked by high-frequency stimuli reach a peak near the cochlear base and do not propagate further. Thus, each cochlear site is tuned to a single, or ‘characteristic’ frequency. The displacement of the basilar membrane deflects the ‘hairs’, or stereocilia, located at the top of the hair cells of the organ of Corti, causing them to generate electrical receptor potentials that mimic the acoustic stimulus, much like tiny microphones. Finally, the receptor potentials of inner hair cells produce chemically-mediated excitation in the peripheral terminals of cochlear afferent neurons, generating trains of action potentials that travel to the brain via the auditory nerve (Fig. 2).

Fig. 2.

Fig. 2

Simplified bird's eye view of the mammalian cochlea. The organ of Corti, here represented by three rows of outer hair cells and one row of inner hair cells, rests on the basilar membrane. Acoustic-stimulus frequencies are organized spatially along the basilar membrane, with the basal region responding maximally to high frequencies and the apical region responding mainly to low frequencies. Stimuli consisting of paired tones (f1 and f2, with f2 > f1) appear to generate mechanical distortion products in the responses of the outer hair cells, which are injected into the local vibration of the basilar membrane (blue arrow). Subsequently, each distortion product propagates (red arrow) toward the cochlear site with appropriate characteristic frequency (2f1–f2 in this illustration), where it is detected by the inner hair cells and signaled to the brain via the auditory nerve (green arrow). Distortion products also propagate toward the middle ear, being measurable at the external ear canal as otoacoustic emissions (left red arrow).

A century after the discovery of combination tones, the polymath Herman LF Helmholtz dedicated a chapter of his influential book, On the Sensations of Tone [2], to this topic, suggesting that the distortion products arise as a result of overloading the eardrum and ossicles of the middle ear with too-intense stimuli. More modern psycho-acoustical findings led Goldstein, in 1967, to dispute Helmholtz's hypothesis and to put forth prescient views of cochlear function that are amazingly in accord with our present knowledge [3]. Firstly, Goldstein emphasized that combination tones are audible even at low stimulus intensities and, more importantly, that their relative audibility (in other words, audibility normalized to stimulus level) remains fairly constant over wide ranges of stimulus intensities. In the case of acoustic overload, the expectation is that combination tones should be inaudible at moderate levels and that their relative audibility should grow as a power function with stimulus intensity. Secondly, based on the strong dependence of combination-tone perception on the frequency separation of the primary tones, Goldstein ruled out the middle ear as the site of origin of distortion products and proposed that they arise in the cochlea, the only sensory structure capable of performing a frequency analysis of acoustic signals. Thirdly, he tentatively pinpointed the site of origin of mechanical distortion products to be the hair cells in the cochlear region most sensitive to the primary tones (blue arrow in Figure 2) and suggested that, upon generation, each mechanical distortion product propagates along the basilar membrane to the site whose characteristic frequency matches its own frequency (hatched area toward apex in Figure 2), where it is detected by other hair cells and signaled to the brain via the auditory nerve. Goldstein's hypothesis was especially daring because, at the time, basilar membrane vibrations were almost universally viewed as linear and uninfluenced by the physiological state of the organ of Corti.

Indirect support for Goldstein's hypothesis came from the study of humans and animals suffering from partial deafness confined to restricted frequency bands [4,5]. For example, when a human subject's ear is stimulated by tones with frequencies in the region of impaired hearing, the subject is unable to hear combination tones with frequencies corresponding to regions of normal hearing [4]. Also consistent with Goldstein's hypothesis was the finding that auditory-nerve fibers [6] and inner hair cells [7] tuned to 2f1–f2 can respond to a pair of tones, f1 and f2, even when they are unable to respond to either tone in isolation. Perhaps most spectacular, however, was the discovery that the ear responds to pairs of tones by broadcasting distortion-product otoacoustic emissions into the surrounding environment (left arrow in Figure 2) [8]. This discovery established that mechanical distortion products are generated in the inner ear, and provided a tool for testing the role of hair cells in their production. Taking advantage of the fact that the endings of certain neurons of the olivocochlear efferent system synapse with outer, but not inner, hair cells, Mountain [9] then showed that distortion-product otoacoustic emissions can be altered by electrical activation of these neurons, thus demonstrating that the outer hair cells can influence cochlear mechanics.

A tantalizing hint that distortion products might exist in basilar membrane vibrations was suggested in Rhode's seminal discovery [10] of a non-linearity in basilar membrane responses to characteristic-frequency single tones. Working on the squirrel monkey cochlea, Rhode measured compressive input–output functions that, at least to a first approximation, could be described by power laws with exponents less than one. It can be shown mathematically that when the sum of two sinusoids is operated on by such a simple power law, combination tones arise of necessity and that, as noted by Goldstein in the case of psycho-acoustically measured distortion products, their magnitudes remain more-or-less constant relative to the magnitude of the stimulus. In addition, Rhode showed that this non-linearity was metabolically vulnerable, disappearing upon death or cochlear trauma. Although Rhode's findings were eventually confirmed and extended (reviewed in [11]), nearly two decades elapsed before definitive proof could be obtained for the propagation of mechanical distortion products on the basilar membrane. Two investigations arrived at the conclusion that if 2f1–f2 distortion products exist on the basilar membrane, they are vanishingly small [12,13] and, probably, additional failures to detect mechanical distortion products went unreported. In retrospect, it seems that the long-lasting failure to find basilar membrane correlates of the 2f1–f2 distortion product was due to two main reasons, namely experimenter-induced cochlear trauma and inadequate technology for the recording of submicroscopic displacement or motion.

Finally, in the past few years, investigators in three laboratories have independently used laser interferometry to demonstrate that various distortion products, including those with a frequency equal to 2f1–f2 and 2f2–f1 are indeed present in the vibration of the basilar membranes in guinea pigs, cats and chin chillas [1418]. In the case of the 2f1–f2 distortion product, the main features of psycho-acoustical observations have appropriate counterparts in the basilar membrane, including the dependence of their audibility on the frequency separation between the primary tones, and on their combined or relative intensities [17]. The mechanical distortion products are physiologically vulnerable, being absent in traumatized cochlea. Most interestingly, it is possible to use acoustic overstimulation to reduce the magnitude of the 2f1–f2 distortion product selectively, while leaving relatively unaffected the response to a single tone with a frequency equal to 2f1–f2. Such selectivity implies that the distortion product travels on the basilar membrane from its site of origin, with a characteristic frequency near f1 and f2, to its site of analysis, with a characteristic frequency equal to 2f1–f2. Surprisingly, it is now doubtful whether basilar-membrane mechanical f2–f1 distortion products are large enough to account for their conspicuous psycho-acoustical counterparts [13,18,19].

Which cochlear structure generates the basilar-membrane distortion products? The basilar membrane itself can probably be ruled out because, being largely acellular, it should be insensitive (at least in the short term) to various insults capable of abolishing mechanical non-linearities, including distortion products. The most likely candidates are the outer hair cells, which in isolation exhibit electromotility; in other words, they act like alternating-current motors, changing their length in response to electrical stimulation [20], (reviewed in [21]). Accordingly, cochlear damage that destroys outer hair cells but leaves inner hair cells apparently intact abolishes distortion products in auditory-nerve responses [5]. Activation of the medial olivocochlear efferent system, which innervates outer hair cells exclusively, can alter distortion-product otoacoustic emissions [9]. Furthermore, reduction of the receptor potentials of hair cells (due to death or the effects of ototoxic drugs) drastically degrades the sensitivity of basilar-membrane responses and eliminates its frequency-specific non-linearity [22].

If the outer hair cells are ultimately responsible for the generation of distortion products in basilar-membrane vibrations and in otoacoustic emissions, the mechanism could conceivably be a byproduct of mechanical-to-electrical transduction at the stereocilia, or of electromotility (electrical-to-mechanical transduction) at the lateral cell membrane. Mechanical-to-electrical transduction at the hair cells of both the frog sacculus and the mammalian cochlea is substantially non-linear. The function relating receptor potentials to stereocilia displacement is sigmoidal, saturating at large displacements [23,24]. Most significantly in the generation of distortion products, which can occur at low and moderate levels, the ‘stiffness’ with which the stereocilia resist displacement is constant for large stimuli but dips to a minimum for small stimuli [25,26].

In a recent paper, Jaramillo et al. [27] convincingly show that such low-level transducer non-linearity leads to the generation of mechanical distortion products in frog-sacculus hair cells, reminiscent of those present in basilar-membrane vibration. Even if the electromotility in outer hair cells was fully linear, the non-linearity of forward transduction should cause the responses of the basilar membrane to tone pairs to contain distortion products. In fact, electromotility is non-linear, with a voltage-to-displacement relationship described by a Boltzman function [28,29]. Whether this non-linearity actually leads to significant two-tone distortion is controversial. Under voltage-clamp conditions, the mechanical response of outer hair cells to moderate-level sinusoidal voltages contains little harmonic distortion [28]; presumably, responses to tone pairs would also generate negligible intermodulation distortion. On the other hand, transmembrane electrical stimulation of outer hair cells held in micro-chambers does appear to generate prominent two-tone distortion products [30].

Thus, somewhat paradoxically, both of the mechanisms that endow the sense of hearing with exquisite sensitivity, frequency selectivity and time resolution — stereociliar transduction in hair-cell organs throughout the phylogenetic scale and electromotility in the outer hair cells of the mammalian cochlea — may also be responsible for distorting the auditory sensation.

References

  • 1.Jones AT. The discovery of difference tones. Amer Phys Teacher. 1935;3:49–51. [Google Scholar]
  • 2.Helmholtz HLF. On the Sensations of Tone. New York: Dover; 1954. Republication of 1885 Ellis translation of 1877 edition of Die lehre von den Tonempfindungen. [Google Scholar]
  • 3.Goldstein JL. Auditory nonlinearity. J Acoust Soc Am. 1967;41:676–689. doi: 10.1121/1.1910396. [DOI] [PubMed] [Google Scholar]
  • 4.Smoorenburg GF. Combination tones and their origin. J Acoust Soc Am. 1972;52:615–632. [Google Scholar]
  • 5.Siegel JH, Kim DO, Molnar CE. Effects of altering organ of Corti cochlear distortion products f2–f1 and 2f1–f2. J Neurophysiol. 1982;47:303–328. doi: 10.1152/jn.1982.47.2.303. [DOI] [PubMed] [Google Scholar]
  • 6.Goldstein JL, Kiang NYS. Neural correlates of the aural combination tone 2f1–f2. Proc IEEE. 1968;56:981–992. [Google Scholar]
  • 7.Nuttall AL, Dolan DF. Inner hair cell responses to the 2f1–f2 intermodulation distortion product. J Acoust Soc Am. 1990;87:782–790. doi: 10.1121/1.398890. [DOI] [PubMed] [Google Scholar]
  • 8.Kemp DT. Evidence of mechanical nonlinearity and frequency selective wave amplification in the cochlea. Arch Otorhinolaryngol. 1979;224:37–45. doi: 10.1007/BF00455222. [DOI] [PubMed] [Google Scholar]
  • 9.Mountain DC. Changes in endolymphatic potential and crossed olivocochlear bundle alter cochlear mechanics. Science. 1980;210:71–72. doi: 10.1126/science.7414321. [DOI] [PubMed] [Google Scholar]
  • 10.Rhode WS. Observations of the vibration of the basilar membrane in squirrel monkeys using the Mossbauer technique. J Acoust Soc Am. 1971;49:1218–1231. doi: 10.1121/1.1912485. [DOI] [PubMed] [Google Scholar]
  • 11.Ruggero MA. Responses to sound of the basilar membrane of the mammalian cochlea. Curr Opin Neurobiol. 1992;2:449–456. doi: 10.1016/0959-4388(92)90179-o. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Wilson JP, Johnstone JR. Basilar membrane correlates of the combination tone 2f1–f2. Nature. 1973;241:206–207. doi: 10.1038/241206a0. [DOI] [PubMed] [Google Scholar]
  • 13.Rhode WS. Some observations on two-tone interaction measured with the Mössbauer effect. In: Evans EF, Wilson JP, editors. Psychophysics and Physiology of Hearing. London: Academic Press; 1977. pp. 27–38. [Google Scholar]
  • 14.Nuttall AL, Dolan DF, Avinash G. Measurements of basilar membrane tuning and distortion with laser Doppler velocimetry. In: Dallos P, Geisler CD, Matthews JW, Ruggero MA, Steele CR, editors. Mechanics and Biophysics of Hearing. Berlin: Springer Verlag; 1990. pp. 288–295. [Google Scholar]
  • 15.Robles L, Ruggero MA, Rich NC. Two-tone distortion products in the basilar membrane of the chinchilla cochlea. In: Dallos P, Geisler CD, Matthews JW, Ruggero MA, Steele CR, editors. Mechanics and Biophysics of Hearing. Berlin: Springer Verlag; 1990. pp. 304–311. [Google Scholar]
  • 16.Robles L, Ruggero MA, Rich NC. Two-tone distortion in the basilar membrane of the cochlea. Nature. 1991;349:413–414. doi: 10.1038/349413a0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Robles L, Ruggero MA, Rich NC. Distortion products at the basilar membrane of the cochlea: dependence on stimulus frequency and intensity and effect of acoustic trauma. Soc Neurosci Abstr. 1993;19:1421. [Google Scholar]
  • 18.Rhode WS, Cooper NP. Two-tone suppression and distortion products on the basilar membrane in the hook region of cat and guinea pig cochleae. Hear Res. 1993;66:31–45. doi: 10.1016/0378-5955(93)90257-2. [DOI] [PubMed] [Google Scholar]
  • 19.Nuttall AL, Dolan DF. Intermodulation distortion (F2–F1) in inner hair cell and basilar membrane response. J Acoust Soc Am. 1993;93:2061–2068. doi: 10.1121/1.406692. [DOI] [PubMed] [Google Scholar]
  • 20.Brownell WE, Bader CR, Bertrand D, De Ribaupierre Y. Evoked mechanical responses of isolated cochlear hair cells. Science. 1985;227:194–196. doi: 10.1126/science.3966153. [DOI] [PubMed] [Google Scholar]
  • 21.Ashmore J. The ear's fast cellular motor. Curr Biol. 1993;3:38–40. doi: 10.1016/0960-9822(93)90145-e. [DOI] [PubMed] [Google Scholar]
  • 22.Ruggero MA, Rich NC. Furosemide alters organ of Corti mechanics: evidence for feedback of outer hair cells upon the basilar membrane. J Neurosci. 1991;11:1057–1067. doi: 10.1523/JNEUROSCI.11-04-01057.1991. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Hudspeth AJ, Corey DP. Sensitivity, polarity, and conductance change in the response of vertebrate hair cells to controlled mechanical stimuli. Proc Natl Acad Sci USA. 1977;74:2407–2411. doi: 10.1073/pnas.74.6.2407. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Russell IJ, Richardson GP, Cody AR. Mechanosensitivity of mammalian auditory hair cells in vitro. Nature. 1986;321:517–519. doi: 10.1038/321517a0. [DOI] [PubMed] [Google Scholar]
  • 25.Howard J, Hudspeth AJ. Compliance of the hair bundle associated with gating of mechanoelectrical transduction channels in the bullfrog's saccular hair cell. Neuron. 1988;1:189–199. doi: 10.1016/0896-6273(88)90139-0. [DOI] [PubMed] [Google Scholar]
  • 26.Russell IJ, Kössl M, Richardson GP. Nonlinear mechanical responses of mouse cochlear hair bundles. Proc R Soc Lond B. 1992;250:217–227. doi: 10.1098/rspb.1992.0152. [DOI] [PubMed] [Google Scholar]
  • 27.Jaramiuo F, Markin VS, Hudspeth AJ. Auditory illusions and the single hair cell. Nature. 1993;364:527–529. doi: 10.1038/364527a0. [DOI] [PubMed] [Google Scholar]
  • 28.Santos-Sacchi J. Harmonics of outer hair cell motility. Biophys J. 1993 doi: 10.1016/S0006-3495(93)81247-5. in press. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Dallos P, Hallworth R, Evans BN. Theory of electrically driven shape changes of cochlear outer hair cells. J Neurophysiol. 1993;70:299–323. doi: 10.1152/jn.1993.70.1.299. [DOI] [PubMed] [Google Scholar]
  • 30.Hu XT, Evans BN, He ZZ, Dallos P. Distortion products in electromotile responses of isolated outer hair cells; Assoc Res Otolaryngol Mid-winter meeting abstracts; 1994; in press. [Google Scholar]

RESOURCES