Abstract
In their article, “Measurement of cochlear power gain in the sensitive gerbil ear,” Ren et al. (Nat Commun 2:216, 2011) claim to provide “the first direct experimental evidence of power amplification in the sensitive living cochlea.” While we recognize the technical challenges of the experiments and appreciate the beauty of the data, the authors’ analysis and interpretation of the measurements are invalid. We review the concept of impedance (i.e., the ratio of pressure to velocity) as it applies to cochlear mechanics and show that Ren et al. mistakenly equate the impedances near the basilar membrane and stapes with the impedance characteristic of an infinite, uniform tube of fluid. As a consequence of this error, Ren et al.’s measurements and analysis provide no evidence for power amplification in the cochlea. Compelling evidence for power amplification has, however, been previously provided by others.
Keywords: cochlea, cochlear amplifier, power gain, impedance
Introduction
Electrohydromechanical processes within the mammalian inner ear collectively amplify the sound-induced motion of the cochlear partition, thereby enhancing the sensitivity, frequency selectivity, and, perhaps most importantly, the dynamic range of hearing (reviewed in Dallos 1996). Theoretical analyses suggest that the mammalian cochlear amplifier operates by boosting the power carried by the pressure waves propagating within it (Neely 1980; Zweig 1991; de Boer 1995; Shera 2007). However, the active enhancement of cochlear vibration amplitudes does not, in principle, require cycle-by-cycle power amplification. Displacement enhancement could arise, for example, via mechanisms similar to mechanical transformers (e.g., levers) or by reducing the impedance (e.g., stiffness) of the vibrating structures (e.g., Kolston 1988). Although cochlear power amplification is biologically feasible—direct measurements have established that the bundles of bullfrog saccular hair cells are capable of doing mechanical work (Martin and Hudspeth 1999)—the biophysical processes subserving amplification in the mammalian cochlea evidently differ from those operating in the bullfrog sacculus, at least at high frequencies. In particular, the enhancement of basilar-membrane motion in the mammalian cochlea appears to depend primarily on forces produced by outer hair cell soma rather than by the bundles (Liberman et al. 2002). Thus, determining whether or not the mammalian cochlea provides actual power (as opposed to displacement) amplification—and if so, how much—has been a central question in auditory biophysics.
In a recent report, Ren, He, and Gillespie (2011) claim to resolve this issue by providing “the first direct experimental evidence of power amplification in the sensitive living cochlea.” Since the measurement of power is technically difficult, especially in vivo, Ren et al.’s results, if substantiated, would be an experimental and analytical tour de force worthy of publication in a high-profile journal. Unfortunately, and notwithstanding the merits of the data, a careful reading of the paper reveals that the authors’ analysis and interpretation are invalid. In fact, the measurements provide no evidence whatsoever for power amplification in the cochlea.
Critique
Our critique of Ren et al.’s analysis and conclusions has two main parts. In the first part, we identify a major conceptual problem underlying their analysis and point out a serious technical error in their estimates of energy and power gain. In the second part, we consider whether Ren et al.’s metric might nevertheless provide a useful assay of cochlear power amplification.
A problem with impedance
Power is force times velocity, and it is important to recognize at the outset that Ren et al. measured only displacement, not power. Contrary to their claims, the authors therefore provide no direct evidence for anything involving power, which would require simultaneous measurements of both velocity (as obtained from displacement in their study) and force or pressure (which they did not measure). Lacking direct measurements of force or pressure, Ren et al. must convert measurements of velocity into power. For this, the relevant impedance must be known.
A few remarks about impedance Impedance is a physical concept that relates force to velocity, or in acoustics, pressure to velocity. Here, we consider the acoustic case and define impedance as pressure divided by velocity. In certain simple situations, such as in free space or at the driving end of a tube or horn of standard mathematical shape, the impedance has been determined analytically and can be looked up in a book (e.g., Rayleigh 1896). For plane waves in free space, or in an infinite lossless tube, the impedance is ρc, the product of the mass density of the medium, ρ, and the speed of sound, c. Because these systems are uniform (homogeneous) and infinite, the impedance is constant everywhere. The impedance ρc is also real and positive. This tells us that sound energy is absorbed by the medium (i.e., that it radiates away to infinity). For a closed tube, by contrast, the impedance is equal to ρc multiplied by an imaginary factor whose value depends both on frequency and on the length of the tube. As the frequency is changed, the magnitude of the multiplicative factor varies between zero and infinity, reflecting the fact that at some frequencies the pressure, and at others the particle velocity, becomes zero at the point of measurement. Thus, for the closed tube the impedance at any given location varies with frequency, and conversely, at any given frequency varies with location. The impedance is also purely imaginary, rather than real, which means that energy is neither absorbed nor radiated away. This makes sense; since the sound is reflected at the end and cannot escape to infinity, energy is “trapped” or stored within the tube.Now, consider impedances and power flow within the cochlea. At the macromechanical level considered here, the relevant velocity is that of the cochlear partition, and the relevant pressure is that at the surface of the basilar membrane—or, indeed, the pressure difference across the basilar membrane, for it is this force that moves the partition up and down, stimulating the sensory cells (Voss et al. 1996; Olson 2001; Dong and Olson 2009). Given the cochlea’s convoluted, three-dimensional structure, and the complexity of its myriad mechanical components, it should come as no surprise that impedances in the cochlea differ substantially from those in tubes and horns. Although tubes have no internal parts to interact with the sound they carry, the structures embedded within the cochlea’s fluid medium—the basilar membrane and organ of Corti—have a profound effect on the mechanical response. When the stapes or any of the cochlea’s constituents vibrates, the motion produces forces, communicated through the fluid, that cause the other structures within the cochlea to move or deform. Indeed, forces produced by outer hair cells couple back to the stapes and therefore even result in the movement of structures outside the cochlea (e.g., otoacoustic emissions). Thus, at any given frequency and location, the quantitative relationship between the pressure and the basilar-membrane (BM) velocity—the impedance—depends on a complex network of moving parts and geometrical relationships. As a result, impedances in the cochlea are much more complicated than in infinite tubes, and their values are not so easily computed.Although impedances express the relationship between the local pressure and the local velocity, they depend not only on the local properties of the medium, but on everything the medium is coupled to, directly or indirectly. In an infinite, homogeneous medium, plane waves propagate freely, uncoupled to any other structures, and the corresponding impedance therefore has the constant value ρc. In the cochlea, however, the fluid is strongly coupled to many other things; as a result, cochlear impedances vary both with frequency and with location. They depend not only on the characteristics of the fluid but also on the size and shape of the cochlea, on the mechanical properties of the cochlear partition (e.g., its mass, stiffness, and active forces), and on the load presented to the cochlea by the middle ear and beyond. Although impedances measured in the fluid near the BM are sometimes referred to as “fluid impedances” in the literature (e.g., Steele and Taber 1979), they depend on far more than just the fluid.
Misidentification of the relevant impedances In their analysis of cochlear power flow, Ren et al. ignore the observations about impedance reviewed above and assume that the relevant impedance in the cochlea is everywhere ρc, the impedance of a plane wave propagating in an infinite, homogeneous medium (e.g., in an infinite fluid-filled tube). As we have indicated, this assumption is wrong, for two reasons. First, the cochlea is not infinite in extent. Indeed, the cochlear fluid and the sound waves propagating within it are confined to a finite chamber whose dimensions are small compared to the wavelength of sound. Just as the impedance of a pipe closed at one end differs from that of an infinite tube (Fletcher 1992), so the boundary conditions at the cochlear walls have a significant influence on cochlear impedances (Mammano and Nobili 1993).Second, the cochlea is not homogeneous; it has an internal structure that enables it to function as a sensory transducer. As a result, the impedances relevant to the computation of power depend not only on the fluid and the walls, but also on the properties of the cochlear partition, the boundary conditions at the middle ear, and so on. At the stapes, for example, the effective impedance is influenced by the stiffness of the basilar membrane—the more compliant the membrane, the easier it is to move the stapes and the lower the impedance. As a result, the relevant impedance at the stapes (the cochlear input impedance) has a value very different from the specific acoustic impedance of the cochlear fluid, as demonstrated both by theory and by direct measurement (Lynch et al. 1982; Rosowski et al. 1986; Shera and Zweig 1991).Although Ren et al. correctly note that classical cochlear models assume that hydromechanical energy is propagated through the cochlear fluids, they ignore what those models say about where and how the energy is injected (e.g., via electromechanical transduction in the outer hair cells) and absorbed. When they employ the specific acoustic impedance of the fluid and ignore the properties of the cochlear partition, Ren et al. effectively assume that acoustic energy, once introduced into the fluid, propagates away to infinity. The cochlear models to which Ren et al. refer, however, indicate that this is not the case. Instead, energy transferred to the fluid in one region, either from the stapes or from the organ of Corti, is absorbed by the partition in another (e.g., near the peak of the wave). The models indicate that the direction of time-averaged power transfer to or from the traveling wave is controlled by the sign of the real part of the impedance of the cochlear partition. This impedance depends, among a great many other things, on the stiffness of the basilar membrane and the magnitudes and phases of the forces produced by outer hair cells. Consequently, the relevant impedance varies with frequency and location within the cochlea; the impedance seen by the stapes is not the same as the impedance seen by the basilar membrane near the peak of the traveling wave. By using the specific acoustic impedance of the fluid at all frequencies and at all locations, Ren et al. ignore the role of the cochlear partition in energy transfer and seriously distort the physics underlying the models they cite. To suggest that the relevant cochlear impedance is ρc, as Ren et al. do, is to regard the cochlea as an infinite tube containing no basilar membrane, no organ of Corti, and no hair cells—indeed, containing nothing but salty water. One’s ears can only be deaf to such a proposition.
Technical error in the analysis In addition to misidentifying the relevant impedances, Ren et al. make a technical error that renders their analysis dimensionally incorrect and therefore logically flawed at the most basic level. In particular, Ren et al. compute power using an incorrect formula. To convert measurements of volume velocity (which they denote V) into power (which they denote I), Ren et al. use the formula I = ρcV2, where ρc is the specific acoustic impedance of the medium (e.g., the cochlear fluid). But I defined in this way has dimensions of power × area, not of power. To see this, note that if M, L, and T represent, respectively, the dimensions of mass, length, and time, then the dimensions of ρcV2, denoted [ρcV2], are
![]() |
1 |
where the dimensions of power are those of energy (E) per time, and energy has the dimensions of mass times velocity squared ([E] = ML2/T2). As a result of this dimensional discrepancy, Ren et al.’s calculations of the “energy input and output functions of the basilar membrane” (obtained by integrating the formula for I over some unspecified time interval) do not have units of joules (J), as indicated on the ordinate of their Fig. 5b, but of J·m2. Although Ren et al. refer to the value given by their formula as “the sound energy passing through [a certain area] … in 1 s,” the quantity they compute actually has no clear physical interpretation.The formula used by Ren et al. to compute “power” is typographically but not semantically identical to a standard equation for power flow in acoustics, namely I = ρcV2 (e.g., Kinsler et al. 1982). In the standard equation, I is the acoustic intensity (time-averaged power per unit area) carried by a plane wave propagating in an infinite medium of specific acoustic impedance ρc, and V is the corresponding root mean square (rms) acoustic particle velocity. When the variables I and V are thus correctly interpreted, the equation is, of course, dimensionally consistent. But Ren et al.’s substitution of a volume–velocity amplitude for an rms particle velocity, perhaps facilitated by use of the symbol V for both, renders the equation both dimensionally inconsistent and physically meaningless.
An ad hoc metric unrelated to power gain
Despite these basic physical errors in the analysis, the reader may wonder whether the value of the gain metric introduced by Ren et al. might nevertheless be indicative of cochlear amplification. Using their data, Ren et al. are able to compute a quantity, Ge, that they call the energy (or power) gain of the basilar membrane. As they explain in the “Methods,” Ren et al. interpret Ge as the ratio of the power associated with basilar-membrane vibration near the peak of the wave to the power input at the stapes. Empirical values of Ge greater than one, they argue, indicate that the cochlear amplifier has boosted the power carried by the traveling wave, establishing that the cochlea provides power amplification. Although their physical derivation is clearly flawed (see above), Ren et al.’s demonstration of power gain might still be logically viable: If values of Ge > 1 do, in fact, characterize systems with positive power gain, as Ren et al. claim, then their empirical finding that Ge can be greater than 1 would indeed establish, albeit somewhat fortuitously, that the mammalian cochlea provides power amplification.
Figure 1 demonstrates, however, that values of Ge > 1 can arise even in a completely passive structure. The figure shows values of Ge computed for a simple, passive model of the cochlea (e.g., Zweig et al. 1976) as a function of the effective damping constant of the cochlear partition. For the results shown in the figure, the damping constant was everywhere positive and the model therefore entirely passive. Although the cochlear partition in the model always attenuates the power carried by the traveling wave and never amplifies it, the figure demonstrates that the values of Ge can be significantly greater than 1. (The parameters of the model can also be adjusted to yield values of Ge less than 1.) Thus, Ren et al.’s finding that empirical values of Ge are generally greater than 1 provides no evidence of cochlear power gain. Despite its name, Ge does not measure gain associated with energy or power. For comparison, the figure also shows corresponding values of a physically based metric of net power gain proposed previously (de Boer and Nuttall 2001; Shera 2007). The metric involves the total power carried by the wave at the peak of its envelope normalized by the input power. Computing its value involves integrating the power transferred to or from the organ of Corti between the stapes and the peak of the wave. As expected for a passive system, the power gain measured in this way is always less than 1.
FIG. 1.
Two metrics computed in a passive cochlear model as a function of the assumed damping constant. The physically based gain metric involving integrated power (×) is always less than 1, correctly indicating the absence of cochlear amplification in the model. By contrast, the metric Ge proposed by Ren et al. (white square) does not measure power gain; its value can be significantly greater than 1, even in a passive system.
What determines the value of Ge? Because of the physical errors outlined above, Ren et al.’s “power” gain Ge is nothing but the square of a volume-velocity gain
. In general, volume-velocity gains are in no way equivalent to power gains. It is easy to construct acoustical–mechanical systems that manifest significant volume-velocity gains while having power gains less than 1 (i.e., energy losses). For example, volume-velocity gains greater than 1 can occur in passive acoustic transformers, such as horns (Fletcher 1992). The middle ear driven in reverse provides another example, this time from the auditory periphery (Puria 2003). In the cochlea, the fact that Ren et al.’s ad hoc metric of basilar-membrane “power” or volume-velocity gain (re stapes) can be greater than 1 even in the absence of amplification can be understood as the consequence of conservation of fluid mass and the fact that BM traveling waves contain multiple wavelengths. Because the cochlear fluids are nearly incompressible and a negligible amount of fluid flows through the helicotrema at the stimulus frequency employed, the total instantaneous volume displacement of the BM, integrated along its length, must equal the volume displacement of the stapes. But the integrated BM volume displacement is the sum of contributions that alternate polarity every half wavelength as the BM displacement alternates between positive and negative (“up” and “down”) along its length. Figure 1c in Ren et al.’s paper, which shows a snapshot of the BM traveling wave as it appears at a particular moment of time, illustrates that the volume displacements of adjacent half-wavelength regions alternate in sign. As a consequence of this alternation, the volume displacements associated with different regions largely cancel one another when added together, and the total volume displacement of the BM can therefore be much smaller than the volume displacement of any given section. Thus, if one chooses to compute the instantaneous volume displacement of some particular BM section at some particular moment—such as the one rather arbitrarily selected by Ren et al. near the peak of the wave—one can readily find that the volume displacement of the chosen BM section is larger than the total volume displacement of the BM and hence also larger than that of the stapes. In other words, Ren et al.’s metric Ge of BM volume-velocity gain (re stapes) can be greater than 1, even in a passive cochlea. Simply put, Ge provides an arbitrary measure of volume-velocity gain but does not quantify power gain.1Of course, the exact value of Ge one obtains will depend on the location and spatial extent of the particular BM section one chooses to analyze. It will also depend on the instantaneous wave pattern of the traveling wave in that region, a pattern that can be modified by cochlear amplification and can vary with stimulus intensity. The crucial point, however, is that with an advantageous choice of moving boundary, volume-velocity gains greater than 1 can, in principle, occur under entirely passive conditions, as is easily verified in cochlear models (see Fig. 1). Thus, although Ren et al.’s measurements and analysis corroborate previous studies showing that cochlear responses are nonlinear, they provide no evidence whatsoever for actual power amplification.
Previous discussions of cochlear power gain
Although Ren et al.’s published analysis does not meaningfully address the issue, previous studies have reported and discussed estimates of the power gain of the cochlear amplifier. These estimates have been rigorously derived both by analyzing BM measurements within the context of hydromechanical models (Zweig 1991; Brass and Kemp 1993; de Boer and Nuttall 2001; Shera 2007) and by combining direct measurements of BM velocity and intracochlear pressure (Olson 2001; Dong and Olson 2009). Some of these studies have already convincingly established that the mammalian cochlea provides power amplification to boost the sensitivity and dynamic range of hearing. Going beyond this qualitative statement to obtain reliable quantitative estimates of the power gain of the amplifier remains challenging. The principal difficulty resides in making accurate simultaneous measurements of both pressure and velocity and/or in deriving credible theoretical estimates of these quantities from other measurements. In their report, Ren et al. would have us believe that they have discovered a significant shortcut for this challenging problem. They have not.
Acknowledgments
This work was supported by grants R01 DC003687 (CAS), DC003130 (ESO), and DC00235 (JJG) from the NIDCD, National Institutes of Health.
Footnotes
Physically correct definitions of cochlear power gain involve averaging over a stimulus cycle and integrating the power flow (i.e., acoustic intensity) along the entire extent of the wave (de Boer and Nuttall 2001; Shera 2007). The computation requires knowledge of the real part of the relevant impedances (i.e., the impedances at the BM and at the stapes).
References
- Brass D, Kemp DT. Analyses of Mössbauer mechanical measurements indicate that the cochlea is mechanically active. J Acoust Soc Am. 1993;93:1502–1515. doi: 10.1121/1.406808. [DOI] [PubMed] [Google Scholar]
- Dallos P. Overview: cochlear neurobiology. In: Dallos P, Popper AN, Fay RR, editors. The cochlea. New York: Springer; 1996. pp. 1–43. [Google Scholar]
- Boer E. The ‘inverse problem’ solved for a three-dimensional model of the cochlea. I. Analysis. J Acoust Soc Am. 1995;98:896–903. doi: 10.1121/1.413515. [DOI] [PubMed] [Google Scholar]
- Boer E, Nuttall AL. Power gain of the cochlear amplifier. In: Breebaart DJ, Houtsma AJM, Kohlrausch A, Prijs VF, Schoonhoven R, editors. Physiological and psychological bases of auditory function. Maastricht: Shaker; 2001. pp. 1–7. [Google Scholar]
- Dong W, Olson ES. In vivo impedance of the gerbil cochlear partition at auditory frequencies. Biophys J. 2009;97:1233–1243. doi: 10.1016/j.bpj.2009.05.057. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fletcher NH. Acoustic systems in biology. New York: Oxford Univ. Press; 1992. [Google Scholar]
- Kinsler LF, Frey AR, Coppens AB, Sanders JV. Fundamentals of acoustics. New York: Wiley; 1982. [Google Scholar]
- Kolston PJ. Sharp mechanical tuning in a cochlear model without negative damping. J Acoust Soc Am. 1988;83:1481–1487. doi: 10.1121/1.395903. [DOI] [PubMed] [Google Scholar]
- Liberman MC, Gao J, He DZZ, Wu X, Jia S, Zuo J. Prestin is required for electromotility of the outer hair cell and for the cochlear amplifier. Nature. 2002;419:300–304. doi: 10.1038/nature01059. [DOI] [PubMed] [Google Scholar]
- Lynch TJ, Nedzelnitsky V, Peake WT. Input impedance of the cochlea in cat. J Acoust Soc Am. 1982;72:108–130. doi: 10.1121/1.387995. [DOI] [PubMed] [Google Scholar]
- Mammano F, Nobili R. Biophysics of the cochlea: linear approximation. J Acoust Soc Am. 1993;93:3320–3332. doi: 10.1121/1.405716. [DOI] [PubMed] [Google Scholar]
- Martin P, Hudspeth AJ. Active hair-bundle movements can amplify a hair cell’s response to oscillatory mechanical stimuli. Proc Natl Acad Sci USA. 1999;96:14,306–14,311. doi: 10.1073/pnas.96.25.14306. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Neely ST. Backward solution of a two-dimensional cochlear model. J Acoust Soc Am. 1980;67:S75. doi: 10.1121/1.2018388. [DOI] [Google Scholar]
- Olson ES. Intracochlear pressure measurements related to cochlear tuning. J Acoust Soc Am. 2001;110:349–367. doi: 10.1121/1.1369098. [DOI] [PubMed] [Google Scholar]
- Puria S. Measurements of human middle ear forward and reverse acoustics: implications for otoacoustic emissions. J Acoust Soc Am. 2003;113:2773–2789. doi: 10.1121/1.1564018. [DOI] [PubMed] [Google Scholar]
- Rayleigh JW. Theory of sound. New York: Dover; 1896. [Google Scholar]
- Ren T, He W, Gillespie PG. Measurement of cochlear power gain in the sensitive gerbil ear. Nat Commun. 2011;2:216. doi: 10.1038/ncomms1226. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rosowski JJ, Carney LH, Lynch TJ, Peake WT. The effectiveness of external and middle ears in coupling acoustic power into the cochlea. In: Allen JB, Hall JL, Hubbard A, Neely ST, Tubis A, editors. Peripheral auditory mechanisms. Berlin: Springer; 1986. pp. 56–59. [Google Scholar]
- Shera CA. Laser amplification with a twist: traveling-wave propagation and gain functions from throughout the cochlea. J Acoust Soc Am. 2007;122:2738–2758. doi: 10.1121/1.2783205. [DOI] [PubMed] [Google Scholar]
- Shera CA, Zweig G. A symmetry suppresses the cochlear catastrophe. J Acoust Soc Am. 1991;89:1276–1289. doi: 10.1121/1.400650. [DOI] [PubMed] [Google Scholar]
- Steele CR, Taber LA. Comparison of WKB calculations and experimental results for threedimensional cochlear models. J Acoust Soc Am. 1979;65:1007–1018. doi: 10.1121/1.382570. [DOI] [PubMed] [Google Scholar]
- Voss SE, Rosowski JJ, Peake WT. Is the pressure difference between the oval and round window the effective stimulus for the cochlea? J Acoust Soc Am. 1996;100:1602–1616. doi: 10.1121/1.416062. [DOI] [PubMed] [Google Scholar]
- Zweig G. Finding the impedance of the organ of Corti. J Acoust Soc Am. 1991;89:1229–1254. doi: 10.1121/1.400653. [DOI] [PubMed] [Google Scholar]
- Zweig G, Lipes R, Pierce JR. The cochlear compromise. J Acoust Soc Am. 1976;59:975–982. doi: 10.1121/1.380956. [DOI] [PubMed] [Google Scholar]


