Abstract
The auditory system faithfully represents sufficient details from sound sources such that downstream cognitive processes are capable of acting upon this information effectively even in the face of signal uncertainty, degradation or interference. This robust sound source representation leads to an invariance in perception vital for animals to interact effectively with their environment. Due to unique nonlinearities in the cochlea, sound representations early in the auditory system exhibit a large amount of variability as a function of stimulus intensity. In other words, changes in stimulus intensity, such as for sound sources at differing distances, create a unique challenge for the auditory system to encode sounds invariantly across the intensity dimension. This challenge and some strategies available to sensory systems to eliminate intensity as an encoding variable are discussed, with a special emphasis upon sound encoding.
Keywords: Efficient coding, robust sensory processing, perception, intensity invariance, auditory system, sound processing, auditory cortex, adaptation, forward masking
1. Introduction
An important feature of biological sensory systems is their ability to extract meaningful environmental signals under a wide variety of conditions. This ability is necessary for successful prey acquisition, predator avoidance and mate localization, among other crucial behavioral tasks. These biological systems perform remarkably well even in the face of incomplete information, signal degradation or competing signals. Systems capable of extracting relevant information consistently under extremely variable environmental conditions are termed robust, and robust sensory pattern recognition is extremely useful for the survival of many species, including humans.
One stimulus dimension across which sounds are relatively consistently perceived is intensity or sound level. In other words, as the total power of a target sound is varied over many orders of magnitude, listeners are able either to correctly identify it or to process it correctly relative to other similar sounds (Buus and Florentine, 1991; Hanna, von Gierke et al., 1986; Viemeister and Bacon, 1988). This type of intensity invariance typically represents a straightforward achievement for artificial pattern recognizers because the overall stimulus pattern (e.g., the spectrotemporal distribution of sound power) remains relatively constant as overall power is added to the signal in a linear medium such as air. A simple signal normalization in the pattern recognizer can therefore enable robust identification of the signal relative to changes in its intensity. Intrinsic nonlinearities in biological sensory systems, on the other hand, often mean that even an operation as simple as adding power to a signal could alter the neural representation of that signal. These nonlinearities are particularly strong in the auditory periphery, leaving open the question of how the central auditory system is able to create a consistent perception of a given sound as it changes in intensity.
Under environmental conditions, differences in sound intensity are often associated with differences in sound source distances. In fact, overall sound source intensity is one of the key stimulus features used to estimate target distance in adult humans (Ashmead, LeRoy et al., 1990; Litovsky and Clifton, 1992; Mershon and Bowers, 1979; Strybel and Perrott, 1984; Zahorik and Kelly, 2007). Manipulation of sound intensity (and therefore perceived loudness) leads to systematic errors in distance judgment for virtual sound sources (Mershon, Desaulniers et al., 1981). Loudness perception itself has received considerable attention (Glasberg and Moore, 2006; Plack and Carlyon, 1995; Zhang and Zeng, 1997), while leaving relatively unexplored the mechanism of perceptual invariance across intensity in the auditory system. The goal of this review, therefore, will be to focus upon some of the potential strategies available to the nervous system for encoding sensory signals over a wide dynamic range while still preserving a representation of the signal that can be exploited for invariant or nearly invariant perception of the corresponding object. Strategies that appear to be used by the auditory system will be emphasized.
2. Dynamic Range Stitching
No individual coding element (e.g., receptor or neuron) in the sensory systems of higher animals is capable of encoding the entire intensity range to which the organism is sensitive. Perhaps the most obvious means of building an invariant representation across a wide intensity range using discrete elements of much narrower intensity ranges is to construct these elements such that their individual input-output functions combine to collectively span the total range of interest. In such a case, one would expect a range of neuronal thresholds such that combining or “stitching” together individual neuronal responses would allow the sensory system to represent the full range of intensity normally available to the organism. To some degree this strategy appears to be used by the auditory system and is depicted graphically in Figure 1. Threshold measurements made in auditory nerve support the notion that individual neuronal dynamic ranges are dispersed somewhat across the total intensity range of hearing (Evans, 1972; Liberman, 1978; Liberman and Kiang, 1978; Sachs and Abbas, 1974). Thresholds of auditory nerve fibers have classically been evaluated as absolute spiking rate measures evoked by stimuli versus spontaneous rates, but similar trends hold true when statistical properties of rate responses are taken into account (Geisler, Deng et al., 1985; Young and Barta, 1986) and are logically extended when temporal information in the spike trains is considered (Carney, 1994; Colburn, Carney et al., 2003).
The strategy of dynamic range stitching can be also seen in responses in primary auditory cortex (A1). Figure 2 depicts the distribution of relative tone thresholds measured from over 500 neurons located in awake marmoset monkey A1. The pattern that emerges reflects essentially the same pattern seen in the auditory nerve, whereby a subset of thresholds is indeed distributed across a wide intensity range, but the bulk of thresholds trend toward lower values. Because the average dynamic range of these neurons is around 15 dB, the overall machinery of auditory encoding appears to be biased largely toward intensities within 30 dB or so of hearing threshold (Watkins and Barbour, 2010b). This finding is problematic for two reasons. First, for the “stitching” process to be most effective, the neuronal dynamic ranges would be expected to span more or less uniformly the complete intensity range of hearing, which is not the case here. In fact, prima facie evidence for the auditory system’s ability to encode loud or moderately loud sounds is surprisingly lacking from these data. The issue of not having sufficient unsaturated coding elements to account for observed intensity discrimination capabilities for loud sounds has been explored previously (Florentine, Buus et al., 1987; Viemeister, 1988a; Viemeister, 1988b). Second, a Bayesian argument could be made for a nonuniform distribution of individual neuronal dynamic ranges, but in that case one would expect on efficient coding grounds that these ranges would match at least approximately the environmental stimulus statistics. While the distribution of environmental sound intensities to which these animals were exposed is unknown, it is extremely unlikely to be biased so strongly toward intensities near the animal’s hearing threshold. In fact, these laboratory animals are reared and housed in a room with acoustically reflective walls, implying that intensities to which these animals are regularly exposed are likely to be substantially greater than the intensities regularly encountered by marmosets in the wild or reared in quiet conditions (Liberman, 1978). Given this likelihood, the finding of so many low-threshold neurons in the laboratory animals is even more striking. Therefore, the actual distribution of sound levels that appears to be represented in A1 does not match even the most basic expectations of actual sound levels that the auditory system would need to encode.
One proposal to account for this discrepancy for the stitching hypothesis involves invoking a unique subpopulation of neurons with dynamic ranges more thoroughly spanning the full intensity range of hearing (Phillips, Semple et al., 1994; Sadagopan and Wang, 2008; Suga, 1992; Suga and Manabe, 1982). The neurons in question have an expanded overall dynamic range because at intensities greater than a preferred intensity, their responses systematically diminish, hence the categorization of their input/output functions as “nonmonotonic” or “intensity-tuned.” The result is an expanded overall dynamic range relative to monotonic or intensity-untuned neurons. In some cases, the dynamic ranges of nonmonotonic neurons have been reported to be more uniformly distributed across the full intensity range of hearing than intensity-untuned neurons (e.g., see Sadagopan and Wang, 2008). In data collected from marmoset monkey A1 in the author’s laboratory, however, this distribution of thresholds and dynamic ranges of nonmonotonic neurons is still heavily biased toward lower intensities, implying that sound level representation in even this subgroup of cortical neurons is concentrated toward lower intensities (Watkins and Barbour, 2010b). Given that these animals are likely exposed to more intense sounds on average than animals in the wild, the finding of so many low-threshold neurons in the laboratory animals is even more striking.
The precise distribution of auditory neuronal input/output functions notwithstanding, a particular challenge to inferring auditory system function from these data likely relates to the discrepancy between how the data are collected in the laboratory compared with the properties of natural acoustic stimuli. In traditional laboratory studies of input/output (i.e., rate-intensity or rate-level) functions, an experimental animal is placed into an acoustic isolation chamber that enables very low background sound levels during the experiment. Additional wideband sounds present in an acoustic environment have been shown in many contexts and throughout the auditory system to affect neuronal dynamic ranges measured by tones, usually by shifting dynamic ranges toward higher intensities and possibly compressing them (Aitkin, 1991; Costalupes, Young et al., 1984; Geisler and Sinex, 1980; Gibson, Young et al., 1985; Phillips, 1985; Phillips and Cynader, 1985; Phillips and Hall, 1986; Ramachandran, Davis et al., 2000; Rees and Palmer, 1988; Sadagopan and Wang, 2008; Young and Barta, 1986). Furthermore, each probe tone in these experiments is typically presented with sufficient silence between stimulus presentations in order to avoid the response of one stimulus influencing the response of another. It is widely understood that the reason for waiting between stimulus presentations and, furthermore, randomizing the order of stimulus presentations is that the auditory system being probed has memory associated with it. In other words, the recent past affects neuronal responses because auditory neurons adapt to their recent history, thereby prompting experimental data collection procedures designed to average out these effects. Adaptation is another technique that sensory neurons can use to encode a larger overall dynamic range using individual coding elements possessing smaller dynamic ranges.
3. Adaptation
While sensory systems are capable of encoding a wide range of intensities, over relatively short periods of time a much smaller range of intensities is typically present in the environment. Figure 3, for example, depicts the mean amplitude ranges present in time intervals of different lengths for a jungle and an urban recording. Naturally, longer intervals contain a wider range of amplitudes on average than shorter intervals, which would be true for the vast majority of sounds. Even at relatively long intervals of several seconds, however, no more than 1/4 of the total dynamic range of the recording apparatus was occupied by the stimulus. A logical approach to encoding stimuli over a wide intensity range, therefore, is to adapt the narrower dynamic ranges of individual coding elements in response to recent stimulus statistics at an appropriate time scale. The effect would be to shift the dynamic ranges of coding elements to align with the most common recent signal values encountered. Depending upon the statistics of the sensory stimulus and the rate of adaptation, the coding element dynamic ranges could be relatively narrow yet still effectively encode sensory stimuli over a much greater intensity range. An additional advantage of this strategy is that it utilizes fixed resources more effectively by shifting many of them away from sensitivity to events that have an extremely low probability of either occurring or being detected or both. Adaptive processes could also explain the distribution of thresholds in Figure 2: adaptive neurons in silent conditions would naturally adapt to have their lowest thresholds and highest gains.
Adaptative coding strategies are commonly used in biological sensory systems and take numerous forms. Modulating the amount of signal energy entering the sensory system transduction apparatus is an approach used by both visual and auditory systems. The familiar pupillary light reflex represents a negative feedback circuit that constricts the pupil of the eye in response to bright light (Loewy, 1990). Two acoustic reflexes similar to the pupillary reflex dampen middle ear vibrations in the face of loud sounds: the tensor tympani reflex and the stapedial reflex. These reflexes appear to be much more dynamic than the pupillary reflex, however, contracting rapidly after the onset of loud sounds and synchronously with the speaker’s own vocalizations (Sellari-Franceschini, Bruschini et al., 1986; Silman, 1984). Presumably the adaptive nature of these reflexes is well-suited to the temporal dynamics of natural acoustics, which might reasonably require faster reaction for acoustic stimulation than the pupillary reflex requires for light stimulation.
Sensory adaptation also commonly occurs in the receptors themselves. Mechanoreceptors of the somatosensory system, for example, adapt their responses to recent pressure magnitude, apparently by alterations in their membrane properties that affect depolarization (Nakajima and Onodera, 1969). Chemical receptors for taste and smell desensitize when a ligand activating them is present for extended periods of time (Borisy, Hwang et al., 1993; Song, Cygnar et al., 2008; Wei, Zhao et al., 1998). In the visual system, photoreceptors convert light energy into a molecular conformational change that signals the presence of a photon. The photoreceptors become less sensitive upon transducing at a high rate, which occurs at higher light intensities and can be regulated by additional chemical processes (Kefalov, Estevez et al., 2005; Pepperberg, 2003). Sound is converted in the mammalian ear into neuronal signals through a mechanotransduction mechanism whereby cochlear fluid vibrations are converted into transmembrane ion flows in hair cells of the inner ear through special ion channels sensitive to mechanical forces. The mechanical properties of this transduction mechanism, as well as the chemical properties of the hair cells themselves, appear to adapt to recent stimulus history in a way that can affect the gain of this mechanism (Eatock, 2000; Holt and Corey, 2000; Le Goff, Bozovic et al., 2005).
This approach of adjusting signal gain at the initial stages of processing in order to center the much lower dynamic ranges of coding elements upon the mean signal intensity is a logical one and also represents a standard approach deployed in engineered systems. The use of automatic gain control (AGC) circuits, of which there are many types, has automated numerous processes of engineered gain adaptation. For example, photographic film and modern imaging sensors have dynamic ranges far smaller than the intensity ranges found in nature, hence the need to use an aperture to modulate the total amount of light entering the camera lens and center the dynamic range of the scene onto the sensor. Photographic camera apertures previously needed adjustment by hand to avoid overexposing or underexposing film. Modern cameras virtually all have some kind of light meter coupled to an electronically controlled aperture through an AGC circuit. AGC circuits are also present in modern telephones, hearing aids, voice recorders, etc., and simplify the process of using such devices by insulating the user from the need to adjust gain by hand. One potential drawback of this approach, however, is that the memory inherent in AGCs introduces a nonlinearity that can distort some signals excessively.
Downstream adaptation to stimulus statistics can take multiple forms and can occur across different levels within a sensory system. Raw intensity is largely adapted out early in the visual pathway, and higher order adaptations such as to stimulus contrast become more prevalent at later processing stages (Baccus and Meister, 2002; Brown and Masland, 2001; Chander and Chichilnisky, 2001; Hosoya, Baccus et al., 2005; Kim and Rieke, 2001; Laughlin, 1989; Rieke and Rudd, 2009; Shapley and Victor, 1978; Smirnakis, Berry et al., 1997; Solomon, Peirce et al., 2004). This scenario is advantageous for the visual system because images are formed by contrasts between light and dark areas rather than overall intensity. A pattern recognizer for a face, for example, would ideally have similar performance in a dim room as on a bright day so long as sufficient contrast existed in the image. Early removal of absolute intensity is an evolutionarily adaptive and efficient solution for extending the dynamic range of individual photoreceptors to create the full range available to the visual system (Hosoya, Baccus et al., 2005). This intensity information is still acquired and transmitted to the brain by an alternate pathway, which enables perceptual judgments of brightness as well as proper function of the pupillary light reflex.
A similar argument can be made for somatosensation, where changes in absolute pressure might occur with some low frequency but not represent the most relevant sensory information. Sitting, wearing clothes or holding an item are all actions that will create substantial pressures on the mechanoreptors in the skin. While some information about absolute pressure detected by the skin would be important to represent (e.g., to be able to pick up an egg without crushing it), the most relevant ongoing information to represent is likely to be fluctuations in pressure at relatively short time scales (e.g., to determine if a recently touched surface is smooth or rough). This is equivalent to saying that the AC signal is likely to carry more useful information than the DC signal or that low-probability (novel) stimuli may have more behavioral relevance than signals that have been present for some time.
Adaptation in response to recent stimulus history also occurs in the auditory system and can be detected in auditory nerve fibers (Relkin and Doucet, 1991; Rhode and Smith, 1985; Wen, Wang et al., 2009) in addition to more centrally located neurons (Dean, Harper et al., 2005; Dean, Robinson et al., 2008; Kvale and Schreiner, 2004; Watkins and Barbour, 2008; Watkins and Barbour, 2010a). Like adaptation in other sensory systems, auditory adaptation appears to be present within the transduction mechanism itself (Goutman and Glowatzki, 2007), as well as downstream (Bartlett and Wang, 2005; Nelson, Smith et al., 2009; Ulanovsky, Las et al., 2004; Xu, Kotak et al., 2007). This adaptation results in neuronal input/output functions being modified to align their regions of maximum slope more closely with the most probable stimuli, thereby maximizing discriminability for the most probable stimuli as well as overall information transmission (Brenner, Bialek et al., 2000; Dean, Harper et al., 2005; Dean, Robinson et al., 2008; Kvale and Schreiner, 2004). This is equivalent to saying that the distribution shown in Figure 2 becomes less leftward skewed (i.e., the mean threshold increases) as the average sound intensity increases. Important exceptions to this rule have been reported for some response types, however, that preserve sensitivity to lower probability stimuli of presumably significant behavioral relevance (Watkins and Barbour, 2008; Watkins and Barbour, 2010a). This sensitivity preservation could be particularly useful under conditions of multiple concurrent dynamic sounds, particularly if each sound has a different mean intensity. Consider, for example, how the two stimuli depicted in Figure 3 would best be encoded by elements of narrow dynamic ranges if they were present simultaneously.
Unique psychophysical phenomena can be used to infer adaptive physiological mechanisms (Wolfson and Graham, 2007; Wolfson and Graham, 2009) and, conversely, physiological evidence for adaptive phenomena can be used to predict the existence of particular psychophysical phenomena (Zeng, Turner et al., 1991). In the latter case, neurophysiological experimental results indicated that high-spontaneous rate (low-threshold) auditory nerve fibers recover from adaptation induced by a loud preceding sound an order of magnitude faster than low-spontaneous rate (high-threshold) fibers (Relkin and Doucet, 1991). This kind of experimental protocol is termed forward-masking to signify that the output of interest is the threshold shift or “masking” induced forward in time by a preceding stimulus. Very shortly following a loud sound, then, psychophysical hearing thresholds are masked because all the neuronal thresholds have been shifted upward (Figure 4A,B). Waiting a bit longer, however, (up to 400 ms in these experiments) creates in intriguing situation where the most sensitive neurons have almost completely recovered their initial thresholds while the least sensitive neurons have not yet begun to recover (Figure 4C). The result is about a 20 dB “coding gap” where almost all of the auditory neurons are either saturated or nonresponsive. A straightforward prediction of this neurophysiological finding is that psychophysical thresholds would be expected to be elevated at this intermediate intensity range for the time scales in question. Thresholds for greater or lesser intensities than the gap location would be lower, thereby creating a nonmonotonic threshold curve as a function of intensity. When intensity discrimination thresholds were measured in human listeners shortly after an intense masking stimulus, this hypothesis was borne out robustly (Zeng, Turner et al., 1991). Given that intensity-tuned neurons all the way to auditory cortex appear to be the primary inheritors of high-spontaneous rate/low-threshold auditory nerve fiber activity (Watkins and Barbour, 2010b), these central neurons might be expected to have distinct adaptive properties, as well. This prediction has been shown to be correct, leading to the surprising finding that intensity-tuned neurons adapt in such a way not to maximize encoding of overall stimulus statistics, but to maximize encoding of the particular case of soft sounds preceded by loud sounds, much as in the case of the forward masking paradigm (Watkins and Barbour, 2008; Watkins and Barbour, 2010a). The complete role of these neurons in encoding dynamic acoustic stimuli remains the subject of further investigation.
Peculiar adaptations such as these bear upon issues of efficient coding with limited resources and sensory discrimination across stimulus intensity but do not directly address the issue of invariance in perception across intensity. In fact, given the previous discussion that the neural code actually changes depending upon the temporal stimulus context and multiple strategies to implement optimal stimulus encoding, achieving an invariant neural representation across intensity likely represents a considerable challenge for downstream circuits to implement. This task is made considerably more difficult in the auditory system because of the intensity-dependence of neuronal receptive fields beginning at the auditory periphery.
4. Variation in Neuronal Responses Across Intensity
The nature of the mechanical vibrations in the cochlea leads to a map of acoustic frequency onto position along the cochlea. The nonlinear nature of cochlear mechanics leads to an alteration in vibration pattern as sound intensities increase such that cochlear auditory filters become less selective to frequency (i.e., filters have increasing bandwidth) as stimulus intensity increases (Glasberg and Moore, 1990; Kiang, Watanabe et al., 1965; Liberman and Kiang, 1978). Potentially compounding this situation is a finding in songbird auditory neurons that not only does the spectral filter change with intensity, but so can the temporal filter, most likely due to additional nonlinearities (Nagel and Doupe, 2006). While intensity represents a straightforward stimulus parameter to normalize out in engineered linear systems (or even in some engineered nonlinear systems such as those with automatic gain control circuits), the auditory system presumably must dedicate substantial neural resources toward inverting nonlinearities introduced in the auditory periphery by the sensory transduction apparatus.
All auditory nerve fibers exhibit an increasing bandwidth with increasing intensity, resulting in a filter shape resembling a “V” when response is plotted as a function of both intensity and frequency. Many central auditory neurons also share this response characteristic, even in auditory cortex. Another set of central auditory neuronal response classes, however, has been described by numerous authors as “level tolerant” because their bandwidths remain relatively constant with sound level or intensity (Ehret and Schreiner, 1997; Sadagopan and Wang, 2008; Suga and Manabe, 1982; Suga and Tsuzuki, 1985; Sutter, 2000). In some cases these neurons have a monotonic rate-intensity profile, while in others they are nonmonotonic or intensity-tuned. Taken together, these three classic response types have been classified as “Type V,” “Type I,” and “Type O” responses based upon the shape of their frequency response areas, with the latter two corresponding to level-tolerant neurons (Ramachandran, Davis et al., 1999). Model neurons of all three classes with the same characteristic frequency, threshold and bandwidth at 10 dB above threshold are depicted graphically in Figure 5.
Type I and type O neurons are not present in the auditory nerve and must therefore be created by neural circuitry in the auditory system. Sadagopan and Wang (2008) have suggested that the express purpose for the creation of these neurons is to counter the frequency-dependent nonlinearities in the cochlea that give rise to intensity-dependent bandwidths in the auditory nerve. This is a compelling hypothesis that is consistent with the additional hypothesis that auditory receptive field bandwidths gradually narrow along the ascending auditory pathway from the periphery to primary auditory cortex (Suga, 1995). Neural pathways in other sensory systems tend to produce increasing receptive field sizes more centrally through divergent projections, so the unique nature of the frequency-dependent nonlinearities in the auditory system may be the driving force behind the putative narrowing of auditory receptive fields. Questions remain regarding these hypotheses, however. Auditory neurons with nonmonotonic input/output functions are nonexistent in auditory nerve but become steadily more common at higher auditory centers until they represent a majority of the neurons in A1 (Watkins and Barbour, 2010b). Of considerable interest, however, is the sizable minority of neurons that still exhibit Type V responses similar to those observed in the auditory nerve. Why would this type of response be valuable for the auditory system to retain all the way to cortex if a major purpose of the intervening circuitry was to eliminate that type of response?
Furthermore, the type of decoder necessary to extract intensity-invariant natural stimuli from any of these three neuronal subpopulations is not entirely clear. Figure 6 depicts a plausible map of characteristic frequency, bandwidth and threshold of model neurons in a square grid representing A1 (Chen, Watkins et al., 2010). This version of the map assumes a uniform distribution of thresholds, as depicted in Figure 1. By combining the models of neuronal responses depicted in Figure 5 with the map in Figure 6, a graphic depiction can be obtained of the spatial activity of neurons driven by any particular stimulus. In fact, one could imagine performing a functional imaging experiment on a similarly arrayed collection of real neurons to discern the pattern of activation for different stimuli. A simulated version of such an experiment was done in this case for two steady-state vowels at two different intensities. Each model neuron was assumed to act as a simple linear integrator of stimulus energy falling within its frequency response area (i.e., the dark areas plotted in Figure 5). Population response patterns for each neuronal subtype were calculated separately, as was a mixture of all three types combined in equal proportion. The results can be seen in Figure 7.
The vowel-driven population response patterns for linear Type V and Type I neurons appear to be rather similar to one another for a given stimulus, especially at higher sound levels. This finding was somewhat surprising given the extremely wide bandwidths of Type V neurons at higher intensities, but could potentially be brought about by the wideband nature of the stimulus. Type V neurons do result in a sparser population response at lower intensities, as would be expected from their tapered frequency response areas at the lowest intensities. Type O neurons represent a sparser population at higher intensities. Because of the nature of the model, however, all three neuronal classes exhibit overcomplete representations of the stimuli. Noticeable is how similar the response patterns of all the subtypes and the pooled population are to perceptually different stimuli at the same intensity. Additionally, the response patterns to the same stimuli at different intensities exhibit little resemblance to one another. For any vowel, type O responses show a unique pattern of activation at different intensities. The similarities between Type O responses to different vowels at the same intensity, however, are just as striking as the differences between responses to the same vowels at different intensities, raising a logical question regarding what kind of advantage for downstream decoding the “place code” Type O neurons might provide relative to other response classes (Shamma, 2003). In fact, a level-invariant decoder based upon Type O responses would not have a fundamentally different structure than for Type I and possibly even Type V responses.
Known flaws exist in this computational model that remove it somewhat from physiological reality. Realistic silence-adapted threshold distributions were not taken into account, for example; had they been, though, the model would predict even more population activity for the vowel stimuli. Furthermore, adaptation phenomena were not taken into account in this model. Cortical neurons are also unlikely to be purely linear integrators of sound energy at different frequencies and intensities, especially for wideband sounds such as those used here. In fact, studies of auditory receptive fields using wideband stimuli at different intensities have revealed intensity invariance, even in the auditory nerve (Barbour and Wang, 2003; Calhoun, Miller et al., 1998; Valentine and Eggermont, 2004; Yu and Young, 2000). This phenomenon is apparently mediated by adaptive processes away from a neuron’s characteristic frequency (Blake and Merzenich, 2002). In like fashion, songbird vocalizations appear to be extracted reliably across intensity by some auditory neurons (Billimoria, Kraus et al., 2008). Frequency-dependent nonlinearities in the cochlea therefore result in intensity-dependent neuronal bandwidths, which appear to be most prominent when measured with pure tones in isolation and are not as prominent when other stimuli are used. The question remains whether the neurons that account for invariance in perception across intensity have already been identified in A1 and are simply in need of proper experimental probing to reveal this fact, or if more downstream circuits must be involved to decode the A1 responses. In any case, Figure 7 remains a graphic testament to the nature of the problem the auditory system must solve. Stated another way that can be used to formulate specific research questions, what nonlinearities can be added to this linear model to create an intensity-invariant neural code?
One interesting parallel can be made between the auditory system and another sensory system along these lines. Like the auditory system but unlike somatosensation or vision, receptors in olfaction also lose specificity at higher odorant concentrations, and this phenomenon has actually been visualized by real functional imaging studies like the one simulated in Figure 7 (Ng, Roorda et al., 2002). Like the auditory system, downstream neural circuitry appears to be performing a sparsification on the stimulus representation, resulting in more narrowly “tuned” neurons than receptors. One proposal for how this might occur is that some downstream circuits are specialized to evaluate ratios of odorant concentrations rather than absolute concentrations (Uchida and Mainen, 2007). This idea could be the olfactory equivalent of the observation that central auditory neuron receptive fields tend to maintain the same bandwidth when measured by wideband stimuli. Nevertheless, the auditory and olfactory systems are still quite capable of processing single tones and single odorants in isolation, so this hypothesis is unlikely to account for the full abilities of the respective sensory systems. Modern methods of dynamical systems analysis may begin to reveal solutions to invariant stimulus encoding across stimulus intensity not with individual neurons but with dynamic networks of continually interacting neurons (Galan, Weidert et al., 2006; Stopfer, Jayaraman et al., 2003).
5. Concluding Remarks
The process by which images are parsed for behaviorally relevant content may be referred to as “object recognition.” The process by which this task takes place over noisy variations in signal power is known as “robust object recognition.” While a physical object may persist unaltered in the environment, variations in lighting, distance, viewing angle, orientation and intervening objects can dramatically alter the initial visual representation of that object. Nevertheless, the higher visual systems of many animals implement astounding robust object recognition engines.
An equivalent ability to segregate individual sound sources from interfering acoustic stimulation is possessed by the auditory system. The rules by which invariant perception is achieved by these two systems are likely to differ, but the result is similar. The intensity domain is one arena likely to differ substantially between vision and audition. Retinotopic-specific nonlinearities do not dramatically distort early visual representations the same way that frequency-specific nonlinearities distort early auditory representations. Intensity therefore reflects an intriguing stimulus parameter to ponder from the perspective of robust sound recognition and a challenging one to probe effectively. It is likely that in the process of peeling away the neural mechanisms underlying perceptual invariance of sounds in the face of variable intensity, other rules of stimulus encoding affecting perception will also be elucidated. The result will be an improved understanding of the neural representation of complex sounds available to downstream cognitive processes to assess, interpret and ultimately act upon.
Acknowledgments
Funding
This work was supported by National Institutes of Health grant R01-DC009215.
References
- Aitkin L. Rate-level functions of neurons in the inferior colliculus of cats measured with the use of free-field sound stimuli. J Neurophysiol. 1991;65(2):383–392. doi: 10.1152/jn.1991.65.2.383. [DOI] [PubMed] [Google Scholar]
- Ashmead DH, LeRoy D, Odom RD. Perception of the relative distances of nearby sound sources. Percept Psychophys. 1990;47(4):326–331. doi: 10.3758/bf03210871. [DOI] [PubMed] [Google Scholar]
- Baccus SA, Meister M. Fast and slow contrast adaptation in retinal circuitry. Neuron. 2002;36(5):909–919. doi: 10.1016/s0896-6273(02)01050-4. [DOI] [PubMed] [Google Scholar]
- Barbour DL, Wang X. Auditory cortical responses elicited in awake primates by random spectrum stimuli. J Neurosci. 2003;23(18):7194–7206. doi: 10.1523/JNEUROSCI.23-18-07194.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bartlett EL, Wang X. Long-lasting modulation by stimulus context in primate auditory cortex. J Neurophysiol. 2005;94(1):83–104. doi: 10.1152/jn.01124.2004. [DOI] [PubMed] [Google Scholar]
- Billimoria CP, Kraus BJ, Narayan R, Maddox RK, Sen K. Invariance and sensitivity to intensity in neural discrimination of natural sounds. J Neurosci. 2008;28(25):6304–6308. doi: 10.1523/JNEUROSCI.0961-08.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Blake DT, Merzenich MM. Changes of AI receptive fields with sound density. J Neurophysiol. 2002;88(6):3409–3420. doi: 10.1152/jn.00233.2002. [DOI] [PubMed] [Google Scholar]
- Borisy FF, Hwang PN, Ronnett GV, Snyder SH. High-affinity cAMP phosphodiesterase and adenosine localized in sensory organs. Brain Res. 1993;610(2):199–207. doi: 10.1016/0006-8993(93)91401-d. [DOI] [PubMed] [Google Scholar]
- Brenner N, Bialek W, de Ruyter van Steveninck R. Adaptive rescaling maximizes information transmission. Neuron. 2000;26(3):695–702. doi: 10.1016/s0896-6273(00)81205-2. [DOI] [PubMed] [Google Scholar]
- Brown SP, Masland RH. Spatial scale and cellular substrate of contrast adaptation by retinal ganglion cells. Nat Neurosci. 2001;4(1):44–51. doi: 10.1038/82888. [DOI] [PubMed] [Google Scholar]
- Buus S, Florentine M. Psychometric functions for level discrimination. J Acoust Soc Am. 1991;90(3):1371–1380. doi: 10.1121/1.401928. [DOI] [PubMed] [Google Scholar]
- Calhoun BM, Miller RL, Wong JC, Young ED. Rate encoding of stimulus spectra by auditory nerve fibers. In: Palmer AR, Rees A, Summerfield AQ, Meddis R, editors. Psychophysical and Physiological Advances in Hearing. London: Whurr Publishers; 1998. pp. 170–177. [Google Scholar]
- Carney LH. Spatiotemporal encoding of sound level: models for normal encoding and recruitment of loudness. Hear Res. 1994;76(1–2):31–44. doi: 10.1016/0378-5955(94)90084-1. [DOI] [PubMed] [Google Scholar]
- Chander D, Chichilnisky EJ. Adaptation to temporal contrast in primate and salamander retina. J Neurosci. 2001;21(24):9904–9916. doi: 10.1523/JNEUROSCI.21-24-09904.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen TL, Watkins PV, Barbour DL. Theoretical limitations on functional imaging resolution in auditory cortex. Brain Res. 2010;1319:175–189. doi: 10.1016/j.brainres.2010.01.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Colburn HS, Carney LH, Heinz MG. Quantifying the information in auditory-nerve responses for level discrimination. J Assoc Res Otolaryngol. 2003;4(3):294–311. doi: 10.1007/s10162-002-1090-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Costalupes JA, Young ED, Gibson DJ. Effects of continuous noise backgrounds on rate response of auditory nerve fibers in cat. J Neurophysiol. 1984;51(6):1326–1344. doi: 10.1152/jn.1984.51.6.1326. [DOI] [PubMed] [Google Scholar]
- Dean I, Harper NS, McAlpine D. Neural population coding of sound level adapts to stimulus statistics. Nat Neurosci. 2005;8(12):1684–1689. doi: 10.1038/nn1541. [DOI] [PubMed] [Google Scholar]
- Dean I, Robinson BL, Harper NS, McAlpine D. Rapid neural adaptation to sound level statistics. J Neurosci. 2008;28(25):6430–6438. doi: 10.1523/JNEUROSCI.0470-08.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Eatock RA. Adaptation in hair cells. Annu Rev Neurosci. 2000;23:285–314. doi: 10.1146/annurev.neuro.23.1.285. [DOI] [PubMed] [Google Scholar]
- Ehret G, Schreiner CE. Frequency resolution and spectral integration (critical band analysis) in single units of the cat primary auditory cortex. J Comp Physiol [A] 1997;181(6):635–650. doi: 10.1007/s003590050146. [DOI] [PubMed] [Google Scholar]
- Evans EF. The frequency response and other properties of single fibres in the guinea-pig cochlear nerve. J Physiol. 1972;226(1):263–287. doi: 10.1113/jphysiol.1972.sp009984. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Florentine M, Buus S, Mason CR. Level discrimination as a function of level for tones from 0.25 to 16 kHz. J Acoust Soc Am. 1987;81(5):1528–1541. doi: 10.1121/1.394505. [DOI] [PubMed] [Google Scholar]
- Galan RF, Weidert M, Menzel R, Herz AV, Galizia CG. Sensory memory for odors is encoded in spontaneous correlated activity between olfactory glomeruli. Neural Comput. 2006;18(1):10–25. doi: 10.1162/089976606774841558. [DOI] [PubMed] [Google Scholar]
- Geisler CD, Deng L, Greenberg SR. Thresholds for primary auditory fibers using statistically defined criteria. Journal of the Acoustical Society of America. 1985;77(3):1102–1109. doi: 10.1121/1.392228. [DOI] [PubMed] [Google Scholar]
- Geisler CD, Sinex DG. Responses of primary auditory fibers to combined noise and tonal stimuli. Hear Res. 1980;3(4):317–334. doi: 10.1016/0378-5955(80)90026-x. [DOI] [PubMed] [Google Scholar]
- Gibson DJ, Young ED, Costalupes JA. Similarity of dynamic range adjustment in auditory nerve and cochlear nuclei. J Neurophysiol. 1985;53(4):940–958. doi: 10.1152/jn.1985.53.4.940. [DOI] [PubMed] [Google Scholar]
- Glasberg BR, Moore BC. Derivation of auditory filter shapes from notched-noise data. Hear Res. 1990;47(1–2):103–138. doi: 10.1016/0378-5955(90)90170-t. [DOI] [PubMed] [Google Scholar]
- Glasberg BR, Moore BC. Prediction of absolute thresholds and equal-loudness contours using a modified loudness model. J Acoust Soc Am. 2006;120(2):585–588. doi: 10.1121/1.2214151. [DOI] [PubMed] [Google Scholar]
- Goutman JD, Glowatzki E. Time course and calcium dependence of transmitter release at a single ribbon synapse. Proc Natl Acad Sci U S A. 2007;104(41):16341–16346. doi: 10.1073/pnas.0705756104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hanna TE, von Gierke SM, Green DM. Detection and intensity discrimination of a sinusoid. J Acoust Soc Am. 1986;80(5):1335–1340. doi: 10.1121/1.394385. [DOI] [PubMed] [Google Scholar]
- Holt JR, Corey DP. Two mechanisms for transducer adaptation in vertebrate hair cells. Proc Natl Acad Sci U S A. 2000;97(22):11730–11735. doi: 10.1073/pnas.97.22.11730. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hosoya T, Baccus SA, Meister M. Dynamic predictive coding by the retina. Nature. 2005;436(7047):71–77. doi: 10.1038/nature03689. [DOI] [PubMed] [Google Scholar]
- Kefalov VJ, Estevez ME, Kono M, Goletz PW, Crouch RK, et al. Breaking the covalent bond--a pigment property that contributes to desensitization in cones. Neuron. 2005;46(6):879–890. doi: 10.1016/j.neuron.2005.05.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kiang NYS, Watanabe T, Thomas EC, Clark LF. Discharge Patterns of Single Fibers in the Cat's Auditory Nerve. Cambridge, MA: The MIT Press; 1965. [Google Scholar]
- Kim KJ, Rieke F. Temporal contrast adaptation in the input and output signals of salamander retinal ganglion cells. J Neurosci. 2001;21(1):287–299. doi: 10.1523/JNEUROSCI.21-01-00287.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kvale MN, Schreiner CE. Short-term adaptation of auditory receptive fields to dynamic stimuli. J Neurophysiol. 2004;91(2):604–612. doi: 10.1152/jn.00484.2003. [DOI] [PubMed] [Google Scholar]
- Laughlin SB. The role of sensory adaptation in the retina. J Exp Biol. 1989;146:39–62. doi: 10.1242/jeb.146.1.39. [DOI] [PubMed] [Google Scholar]
- Le Goff L, Bozovic D, Hudspeth AJ. Adaptive shift in the domain of negative stiffness during spontaneous oscillation by hair bundles from the internal ear. Proc Natl Acad Sci U S A. 2005;102(47):16996–17001. doi: 10.1073/pnas.0508731102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liberman MC. Auditory-nerve response from cats raised in a low-noise chamber. J Acoust Soc Am. 1978;63(2):442–455. doi: 10.1121/1.381736. [DOI] [PubMed] [Google Scholar]
- Liberman MC, Kiang NY. Acoustic trauma in cats. Cochlear pathology and auditory-nerve activity. Acta Otolaryngol Suppl. 1978;358:1–63. [PubMed] [Google Scholar]
- Litovsky RY, Clifton RK. Use of sound-pressure level in auditory distance discrimination by 6-month-old infants and adults. J Acoust Soc Am. 1992;92(2 Pt 1):794–802. doi: 10.1121/1.403949. [DOI] [PubMed] [Google Scholar]
- Loewy AD. Autonomic control of the eye. In: Loewy AD, Spyer KM, editors. Central Regulation of Autonomic Functions. 1990. pp. 268–285. [Google Scholar]
- Mershon DH, Bowers JN. Absolute and relative cues for the auditory perception of egocentric distance. Perception. 1979;8(3):311–322. doi: 10.1068/p080311. [DOI] [PubMed] [Google Scholar]
- Mershon DH, Desaulniers DH, Kiefer SA, Amerson TL, Jr, Mills JT. Perceived loudness and visually-determined auditory distance. Perception. 1981;10(5):531–543. doi: 10.1068/p100531. [DOI] [PubMed] [Google Scholar]
- Nagel KI, Doupe AJ. Temporal processing and adaptation in the songbird auditory forebrain. Neuron. 2006;51(6):845–859. doi: 10.1016/j.neuron.2006.08.030. [DOI] [PubMed] [Google Scholar]
- Nakajima S, Onodera K. Adaptation of the generator potential in the crayfish stretch receptors under constant length and constant tension. J Physiol. 1969;200(1):187–204. doi: 10.1113/jphysiol.1969.sp008688. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nelson PC, Smith ZM, Young ED. Wide-dynamic-range forward suppression in marmoset inferior colliculus neurons is generated centrally and accounts for perceptual masking. J Neurosci. 2009;29(8):2553–2562. doi: 10.1523/JNEUROSCI.5359-08.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ng M, Roorda RD, Lima SQ, Zemelman BV, Morcillo P, et al. Transmission of olfactory information between three populations of neurons in the antennal lobe of the fly. Neuron. 2002;36(3):463–474. doi: 10.1016/s0896-6273(02)00975-3. [DOI] [PubMed] [Google Scholar]
- Pepperberg DR. Bleaching desensitization: background and current challenges. Vision Res. 2003;43(28):3011–3019. doi: 10.1016/s0042-6989(03)00484-x. [DOI] [PubMed] [Google Scholar]
- Phillips DP. Temporal response features of cat auditory cortex neurons contributing to sensitivity to tones delivered in the presence of continuous noise. Hear Res. 1985;19(3):253–268. doi: 10.1016/0378-5955(85)90145-5. [DOI] [PubMed] [Google Scholar]
- Phillips DP, Cynader MS. Some neural mechanisms in the cat's auditory cortex underlying sensitivity to combined tone and wide-spectrum noise stimuli. Hear Res. 1985;18(1):87–102. doi: 10.1016/0378-5955(85)90112-1. [DOI] [PubMed] [Google Scholar]
- Phillips DP, Hall SE. Spike-rate intensity functions of cat cortical neurons studied with combined tone-noise stimuli. J Acoust Soc Am. 1986;80(1):177–187. doi: 10.1121/1.394178. [DOI] [PubMed] [Google Scholar]
- Phillips DP, Semple MN, Calford MB, Kitzes LM. Level-dependent representation of stimulus frequency in cat primary auditory cortex. Exp Brain Res. 1994;102(2):210–226. doi: 10.1007/BF00227510. [DOI] [PubMed] [Google Scholar]
- Plack CJ, Carlyon RP. Loudness Perception and Intensity Coding. In: Moore BCJ, editor. Hearing. San Diego: Academic Press, Inc.; 1995. pp. 123–156. [Google Scholar]
- Ramachandran R, Davis KA, May BJ. Single-unit responses in the inferior colliculus of decerebrate cats. I. Classification based on frequency response maps. J Neurophysiol. 1999;82(1):152–163. doi: 10.1152/jn.1999.82.1.152. [DOI] [PubMed] [Google Scholar]
- Ramachandran R, Davis KA, May BJ. Rate representation of tones in noise in the inferior colliculus of decerebrate cats. J Assoc Res Otolaryngol. 2000;1(2):144–160. doi: 10.1007/s101620010029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rees A, Palmer AR. Rate-intensity functions and their modification by broadband noise for neurons in the guinea pig inferior colliculus. J Acoust Soc Am. 1988;83(4):1488–1498. doi: 10.1121/1.395904. [DOI] [PubMed] [Google Scholar]
- Relkin EM, Doucet JR. Recovery from prior stimulation. I: Relationship to spontaneous firing rates of primary auditory neurons. Hear Res. 1991;55(2):215–222. doi: 10.1016/0378-5955(91)90106-j. [DOI] [PubMed] [Google Scholar]
- Rhode WS, Smith PH. Characteristics of tone-pip response patterns in relationship to spontaneous rate in cat auditory nerve fibers. Hear Res. 1985;18(2):159–168. doi: 10.1016/0378-5955(85)90008-5. [DOI] [PubMed] [Google Scholar]
- Rieke F, Rudd ME. The challenges natural images pose for visual adaptation. Neuron. 2009;64(5):605–616. doi: 10.1016/j.neuron.2009.11.028. [DOI] [PubMed] [Google Scholar]
- Sachs MB, Abbas PJ. Rate versus level functions for auditory-nerve fibers in cats: tone-burst stimuli. J Acoust Soc Am. 1974;56(6):1835–1847. doi: 10.1121/1.1903521. [DOI] [PubMed] [Google Scholar]
- Sadagopan S, Wang X. Level invariant representation of sounds by populations of neurons in primary auditory cortex. J Neurosci. 2008;28(13):3415–3426. doi: 10.1523/JNEUROSCI.2743-07.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sellari-Franceschini S, Bruschini P, Pardini L, Berrettini S. Quantification of the parameters of the acoustic reflex in normal ears. Audiology. 1986;25(3):165–175. doi: 10.3109/00206098609078383. [DOI] [PubMed] [Google Scholar]
- Shamma SA. Auditory cortex. In: Arbib MA, editor. The Handbook of Brain Theory and Neural Networks. Cambridge, Massachusetts: The MIT Press; 2003. pp. 122–127. [Google Scholar]
- Shapley RM, Victor JD. The effect of contrast on the transfer properties of cat retinal ganglion cells. J Physiol. 1978;285:275–298. doi: 10.1113/jphysiol.1978.sp012571. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Silman S, editor. The Acoustic Reflex: Basic Principles and Clinical Applications. Orlando: Academic Press; 1984. [Google Scholar]
- Smirnakis SM, Berry MJ, Warland DK, Bialek W, Meister M. Adaptation of retinal processing to image contrast and spatial scale. Nature. 1997;386(6620):69–73. doi: 10.1038/386069a0. [DOI] [PubMed] [Google Scholar]
- Solomon SG, Peirce JW, Dhruv NT, Lennie P. Profound contrast adaptation early in the visual pathway. Neuron. 2004;42(1):155–162. doi: 10.1016/s0896-6273(04)00178-3. [DOI] [PubMed] [Google Scholar]
- Song Y, Cygnar KD, Sagdullaev B, Valley M, Hirsh S, et al. Olfactory CNG channel desensitization by Ca2+/CaM via the B1b subunit affects response termination but not sensitivity to recurring stimulation. Neuron. 2008;58(3):374–386. doi: 10.1016/j.neuron.2008.02.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stopfer M, Jayaraman V, Laurent G. Intensity versus identity coding in an olfactory system. Neuron. 2003;39(6):991–1004. doi: 10.1016/j.neuron.2003.08.011. [DOI] [PubMed] [Google Scholar]
- Strybel TZ, Perrott DR. Discrimination of relative distance in the auditory modality: the success and failure of the loudness discrimination hypothesis. J Acoust Soc Am. 1984;76(1):318–320. doi: 10.1121/1.391064. [DOI] [PubMed] [Google Scholar]
- Suga N. Philosophy and stimulus design for neuroethology of complex-sound processing. Philos Trans R Soc Lond B Biol Sci. 1992;336(1278):423–428. doi: 10.1098/rstb.1992.0078. [DOI] [PubMed] [Google Scholar]
- Suga N. Sharpening of frequency tuning by inhibition in the central auditory system: tribute to Yasuji Katsuki. Neurosci Res. 1995;21(4):287–299. doi: 10.1016/0168-0102(94)00868-g. [DOI] [PubMed] [Google Scholar]
- Suga N, Manabe T. Neural basis of amplitude-spectrum representation in auditory cortex of the mustached bat. J Neurophysiol. 1982;47(2):225–255. doi: 10.1152/jn.1982.47.2.225. [DOI] [PubMed] [Google Scholar]
- Suga N, Tsuzuki K. Inhibition and level-tolerant frequency tuning in the auditory cortex of the mustached bat. J Neurophysiol. 1985;53(4):1109–1145. doi: 10.1152/jn.1985.53.4.1109. [DOI] [PubMed] [Google Scholar]
- Sutter ML. Shapes and level tolerances of frequency tuning curves in primary auditory cortex: quantitative measures and population codes. J Neurophysiol. 2000;84(2):1012–1025. doi: 10.1152/jn.2000.84.2.1012. [DOI] [PubMed] [Google Scholar]
- Uchida N, Mainen ZF. Odor concentration invariance by chemical ratio coding. Front Syst Neurosci. 2007;1:3. doi: 10.3389/neuro.06.003.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ulanovsky N, Las L, Farkas D, Nelken I. Multiple time scales of adaptation in auditory cortex neurons. J Neurosci. 2004;24(46):10440–10453. doi: 10.1523/JNEUROSCI.1905-04.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Valentine PA, Eggermont JJ. Stimulus dependence of spectro-temporal receptive fields in cat primary auditory cortex. Hear Res. 2004;196(1–2):119–133. doi: 10.1016/j.heares.2004.05.011. [DOI] [PubMed] [Google Scholar]
- Viemeister NF. Intensity coding and the dynamic range problem. Hear Res. 1988a;34(3):267–274. doi: 10.1016/0378-5955(88)90007-x. [DOI] [PubMed] [Google Scholar]
- Viemeister NF. Psychophysical aspects of auditory intensity coding. In: Edelman GM, Gall WE, Cowan WM, editors. The Auditory System: Development, Physiology and Psychophysics. New York: Wiley; 1988b. [Google Scholar]
- Viemeister NF, Bacon SP. Intensity discrimination, increment detection, and magnitude estimation for 1-kHz tones. J Acoust Soc Am. 1988;84(1):172–178. doi: 10.1121/1.396961. [DOI] [PubMed] [Google Scholar]
- Watkins PV, Barbour DL. Specialized neuronal adaptation for preserving input sensitivity. Nat Neurosci. 2008;11(11):1259–1261. doi: 10.1038/nn.2201. [DOI] [PubMed] [Google Scholar]
- Watkins PV, Barbour DL. Level-tuned neurons in primary auditory cortex adapt differently to loud versus soft sounds. Cereb Cortex. 2010a;21(1):178–190. doi: 10.1093/cercor/bhq079. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Watkins PV, Barbour DL. Rate-level responses in awake marmoset auditory cortex. Hear Res. 2010b doi: 10.1016/j.heares.2010.11.011. in press. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Watkins PV, Chen TL, Barbour DL. A computational framework for topographies of cortical areas. Biol Cybern. 2009;100(3):231–248. doi: 10.1007/s00422-009-0294-9. [DOI] [PubMed] [Google Scholar]
- Wei J, Zhao AZ, Chan GC, Baker LP, Impey S, et al. Phosphorylation and inhibition of olfactory adenylyl cyclase by CaM kinase II in Neurons: a mechanism for attenuation of olfactory signals. Neuron. 1998;21(3):495–504. doi: 10.1016/s0896-6273(00)80561-9. [DOI] [PubMed] [Google Scholar]
- Wen B, Wang GI, Dean I, Delgutte B. Dynamic range adaptation to sound level statistics in the auditory nerve. J Neurosci. 2009;29(44):13797–13808. doi: 10.1523/JNEUROSCI.5610-08.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wolfson SS, Graham N. An unusual kind of contrast adaptation: shifting a contrast comparison level. J Vis. 2007;7(8):12. doi: 10.1167/7.8.12. [DOI] [PubMed] [Google Scholar]
- Wolfson SS, Graham N. Two contrast adaptation processes: contrast normalization and shifting, rectifying contrast comparison. J Vis. 2009;9(4):30, 31–23. doi: 10.1167/9.4.30. [DOI] [PubMed] [Google Scholar]
- Xu H, Kotak VC, Sanes DH. Conductive hearing loss disrupts synaptic and spike adaptation in developing auditory cortex. J Neurosci. 2007;27(35):9417–9426. doi: 10.1523/JNEUROSCI.1992-07.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Young ED, Barta PE. Rate responses of auditory nerve fibers to tones in noise near masked threshold. J Acoust Soc Am. 1986;79(2):426–442. doi: 10.1121/1.393530. [DOI] [PubMed] [Google Scholar]
- Yu JJ, Young ED. Linear and nonlinear pathways of spectral information transmission in the cochlear nucleus. Proc Natl Acad Sci U S A. 2000;97(22):11780–11786. doi: 10.1073/pnas.97.22.11780. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zahorik P, Kelly JW. Accurate vocal compensation for sound intensity loss with increasing distance in natural environments. J Acoust Soc Am. 2007;122(5):EL143–EL150. doi: 10.1121/1.2784148. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zeng FG, Turner CW, Relkin EM. Recovery from prior stimulation. II: Effects upon intensity discrimination. Hear Res. 1991;55(2):223–230. doi: 10.1016/0378-5955(91)90107-k. [DOI] [PubMed] [Google Scholar]
- Zhang C, Zeng FG. Loudness of dynamic stimuli in acoustic and electric hearing. J Acoust Soc Am. 1997;102(5 Pt 1):2925–2934. doi: 10.1121/1.420347. [DOI] [PubMed] [Google Scholar]