Skip to main content
The Journal of Neuroscience logoLink to The Journal of Neuroscience
. 2024 Nov 8;45(2):e1502232024. doi: 10.1523/JNEUROSCI.1502-23.2024

Distinct Inhibitory Neurons Differently Shape Neuronal Codes for Sound Intensity in the Auditory Cortex

Melanie Tobin 1, Janaki Sheth 1, Katherine C Wood 1, Erin K Michel 1, Maria N Geffen 1,2,3,
PMCID: PMC11714344  PMID: 39516042

Abstract

Cortical circuits contain multiple types of inhibitory neurons which shape how information is processed within neuronal networks. Here, we asked whether somatostatin-expressing (SST) and vasoactive intestinal peptide-expressing (VIP) inhibitory neurons have distinct effects on population neuronal responses to noise bursts of varying intensities. We optogenetically stimulated SST or VIP neurons while simultaneously measuring the calcium responses of populations of hundreds of neurons in the auditory cortex (AC) of male and female awake, head-fixed mice to sounds. Upon SST neuronal activation, noise burst representations became more discrete for different intensity levels, relying on cell identity rather than strength. By contrast, upon VIP neuronal activation, noise bursts of different intensity levels activated overlapping neuronal populations, albeit at different response strengths. At the single-cell level, SST and VIP neuronal activation differentially modulated the response-level curves of monotonic and nonmonotonic neurons. SST neuronal activation effects were consistent with a shift of the neuronal population responses toward a more localist code with different cells responding to sounds of different intensities. By contrast, VIP neuronal activation shifted responses toward a more distributed code, in which sounds of different intensity levels are encoded in the relative response of similar populations of cells. These results delineate how distinct inhibitory neurons in the AC dynamically control cortical population codes. Different inhibitory neuronal populations may be recruited under different behavioral demands, depending on whether categorical or invariant representations are advantageous for the task.

Keywords: auditory cortex, computational neuroscience, imaging, localist code, optogenetics, sparse coding

Significance Statement

Information about sounds is represented in the auditory cortex by neuronal population activity that has a characteristic sparse structure. Cortical neuronal populations comprise multiple types of excitatory and inhibitory neurons. Here, we find that activating different types of inhibitory neurons differentially controls population neuronal representations, with one type of inhibitory neurons increasing the differences in the identity of the cells recruited to represent the different sounds and another inhibitory neuron type changing the relative activity level of overlapping neuronal populations. Such transformations may be beneficial for different types of auditory behaviors, suggesting that these different types of inhibitory neurons may be recruited under different behavioral constraints in optimizing neuronal representations of sounds.

Introduction

Sensory cortical neuronal networks are comprised of multiple subtypes of neurons, including excitatory and inhibitory neurons. Inhibitory neurons can be further divided into multiple subclasses, including somatostatin-expressing (SST) and vasoactive intestinal peptide-expressing (VIP) neurons, which mutually inhibit each other (Campagnola et al., 2022). The activity of these neurons modulates stimulus representations in the auditory cortex (AC). Specifically, activating SST neurons reduces and decorrelates cortical activity (Chen et al., 2015), sharpens frequency tuning of AC neurons (Phillips and Hasenstaub, 2016), and contributes to surround suppression (Lakunina et al., 2020) and adaptation to stimulus context (Natan et al., 2015, 2017). By contrast, VIP neurons disinhibit excitatory neurons (Millman et al., 2020), largely via their projections onto SST neurons (Pfeffer et al., 2013; Pi et al., 2013), without affecting frequency tuning (Bigelow et al., 2019) and can enable high-excitability states in the cortex (Jackson et al., 2016). Both SST and VIP neurons can be modulated: by noradrenergic and cholinergic inputs for SST neurons (Kawaguchi and Shindou, 1998; Fanselow et al., 2008; Chen et al., 2015) and multiple neuromodulators for VIP neurons (Fu et al., 2014; Zhang et al., 2014; Chen et al., 2015) and therefore may serve differential modulatory functions in the context of different behavioral demands.

Sound pressure level representation supports sound detection, hearing in noise, source localization, and distance to target calculation (Litovsky and Clifton, 1992). Most neurons in AC respond selectively to sounds at different sound pressure levels, either in a monotonic or a nonmonotonic fashion. Monotonic neurons increase their firing rate with sound intensity, differing in their threshold and slope of the response functions. Nonmonotonic neurons exhibit preference for specific sound pressure level ranges, differing in their preferred sound pressure level (Zhang et al., 2013). Previous work found that a mix of monotonic and nonmonotonic neurons in AC is important for sound encoding (Sun et al., 2017). Because the excitatory neurons in the cortex form tightly connected circuits with inhibitory neurons, inhibitory neuronal activity can shift the sound-level response functions of excitatory neurons across monotonic and nonmonotonic neurons. These changes can in turn affect the representation of the sound pressure level by cortical populations. Whereas multiple studies have examined the effects of SST and VIP neuronal modulation on sound responses in individual neurons in AC (Natan et al., 2015, 2017; Seybold et al., 2015; Phillips and Hasenstaub, 2016; Bigelow et al., 2019; Millman et al., 2020; Seay et al., 2020), their effect on population representation of the sound pressure level remains to be fully understood.

Here, we studied whether and how cortical inhibitory neurons control the representation of sound pressure levels in populations of neurons, by presenting periodic noise bursts at different sound pressure levels to awake, head-fixed mice and imaging calcium responses in populations of hundreds of neurons while simultaneously activating SST or VIP neurons optogenetically (Figs. 1B, 2). First, we tested whether and how activation of SST or VIP neurons differentially modulated sound pressure level responses at the level of individual neuronal response functions in AC. Next, we tested how the representation of the sound pressure level changed in the neuronal space with and without SST and VIP neuronal activation. We tested for the effects of SST and VIP neuronal activation on response sparseness and separation angle of population response vectors. Finally, we tested whether and how changes in the response-level curves of monotonic and nonmonotonic individual neurons upon SST or VIP neuronal activation mediated the changes in representation at the population level for monotonic and nonmonotonic neurons. Our results suggest that SST and VIP neuronal activation differentially affect both monotonic and nonmonotonic neuronal sound pressure level response functions, thereby shifting the neuronal population codes between localist and distributed representations.

Figure 1.

Figure 1.

Experimental setup and optogenetic stimulation of interneuron populations. A, Two-photon imaging and laser stimulation through the round window of a mouse injected with viruses encoding Syn.jGCaMP7f and Flex.ChrimsonR.tdTomato in the left AC. Flex.ChrimsonR.tdTomato is injected in AC of SST-Cre and VIP-Cre mice and is activated by a 635 nm laser. The mouse lines used were SST-Cre × Cdh23+/+ and VIP-Cre × Cdh23+/+. A speaker delivers a broadband noise stimulus at sound pressure levels within 0–90 dB to the right ear. B, The cell tissue with two-photon imaging in the green channel (left) and cell identification (right) using the Suite2p software, with yellow lines delineating cell borders and red lines indicating the neurons expressing ChrimsonR.tdTomato. C, Left, The outline of the spread of the viral injection in a representative brain. Signal in the green channel (center panels) and red channel (right panels) of a SST-Cre mouse (top panels) and a VIP-Cre mouse (bottom panels). Cells identified as VIP or SST interneurons are indicated with an arrow (see Materials and Methods). D, Top, Diagrams for optogenetic manipulation in the circuit and experimental setup. Bottom left, Response of a SST neuron to no sound and sound stimulation (50 dB) when activating SST neurons with different laser powers. Bottom right, Average fluorescence over a 1-s fixed time window (delay from the stimulus onset: 90 ms) as a function of the sound pressure level for the example cell in the left panel. E, Left, Response of a sound-increasing neuron to no sound and sound stimulation (70 dB) when activating SST neurons with different laser powers. Right, Average fluorescence over a 1-s fixed time window (delay from the stimulus onset: 150 ms) as a function of the sound pressure level for the example cell in the left panel. F, Top, Diagrams for optogenetic manipulation in the circuit and experimental setup. Bottom left, Response of a VIP neuron to no sound and sound stimulation (70 dB) when activating VIP neurons with different laser powers. Bottom right, Average fluorescence over a 1-s fixed time window (delay from the stimulus onset: 300 ms) as a function of the sound pressure level for the example cell in the left panel. G, Left, Response of a sound-increasing neuron to no sound and sound stimulation (70 dB) when activating VIP neurons with different laser powers. Right, Average fluorescence over a 1-s fixed time window (delay from the stimulus onset: 270 ms) as a function of the sound pressure level for the example neuron in the left panel. H, Top, Experimental setup for the control experiment - laser effect in the absence of an opsin. Bottom, Average fluorescence over a 1-s fixed window as a function of the sound pressure level for the whole population of VIP neurons recorded, tagged with Flex.tdTomato, when the laser illuminates AC. I, Average fluorescence over a 1-s fixed window as a function of the sound pressure level for the whole population of neurons recorded (VIP neurons excluded) when the laser illuminates AC. For all panels, black, pink, and red colors correspond to no laser power (0 mW/mm2), medium laser power (∼0.3 mW/mm2), and high laser power (∼3.5 mW/mm2), respectively (see Materials and Methods for power calibration). The gray and red horizontal bars below the example traces in panels DG indicate the presence of the sound and laser stimulus, respectively.

Figure 2.

Figure 2.

Population response to SST and VIP neuronal activation and sparseness A. Rasters of the average fluorescence versus the sound pressure level for all neurons imaged in the SST-Cre mice, calculated over the 1-s fixed time window (see Materials and Methods). Rasters from left to right correspond to SST neuronal activation with no laser power, medium laser power, and high laser power. The thick blue line at the bottom of each raster indicates the SST interneurons. Cells are ordered given their response at 90 dB and no laser power. B, Absolute average fluorescence (solid lines) and change in average fluorescence relative to the laser and silence condition (dashed lines) over a 1-s fixed window as a function of the sound pressure level for the whole population of SST neurons recorded (132 neurons), when the SST neurons are activated. The solid and dashed black lines nearly overlap. C, Absolute average fluorescence (solid lines) and change in average fluorescence relative to the laser and silence condition (dashed lines) over a 1-s fixed window as a function of the sound pressure level for the whole population of non-SST neurons recorded (2,152 neurons), when the SST neurons are activated. D, Cumulative distribution function of sparseness normalized on the population of non-SST neurons, when the SST neurons are activated. Sparseness was defined for non-SST neurons with an increase in response to sound compared with silence at a given laser, corresponding to 2,033, 2,029, and 1,994 neurons for no, medium, and high laser, respectively. E, Average activity sparseness measured from silence at a given laser power as a function of the sound pressure level for each population of neurons (SST neurons excluded), when the SST neurons are activated. The points at 0 dB were by design 100% and thus omitted from the plot and the statistical test. F, Decoding accuracy of SVM decoder at each laser power, decoding individual sound pressure levels using non-SST neuronal responses within 1-s fixed window, when the laser illuminates AC and stimulates SST neurons. G, Average activity sparseness measured from baseline as a function of the sound pressure level for each population of neurons (SST neurons excluded), when the SST neurons are activated. There was a significant interaction between sound and laser amplitude, but no significant sound-independent laser effect in the activity sparseness measured from baseline upon SST neuronal activation (plaser = 0.37; *plaser:sound = 4.5 × 10−2; GLME). H, Rasters of the average fluorescence versus the sound pressure level for all neurons imaged in the VIP-Cre mice, calculated over the 1-s fixed time window (see Materials and Methods). Rasters from left to right correspond to VIP activation with no laser power, medium laser power and high laser power. The thick green line at the bottom of each raster indicates the VIP interneurons. Cells are ordered given their response at 90 dB and no laser power. I, Absolute average fluorescence (solid lines) and change in average fluorescence relative to the laser and silence condition (dashed lines) over a 1-s fixed window as a function of the sound pressure level for the whole population of VIP neurons recorded (226 neurons), when the VIP neurons are activated. The solid and dashed black lines nearly overlap. J, Absolute average fluorescence (solid lines) and change in average fluorescence relative to the laser and silence condition (dashed lines) over a 1-s fixed window as a function of the sound pressure level for the whole population of non-VIP neurons recorded (3,095 neurons), when the VIP neurons are activated. K, Cumulative distribution function of sparseness normalized on the population of non-VIP neurons, when the VIP neurons are activated. Sparseness was defined for non-VIP neurons with an increase in response to sound compared with silence at a given laser, corresponding to 2,979, 2,921, and 3,020 neurons for no, medium, and high laser, respectively. L, Average activity sparseness measured from silence at a given laser power as a function of the sound pressure level for each population of neurons (VIP neurons excluded) when the VIP neurons are activated. The points at 0 dB were by design 100% and thus omitted from the plot and the statistical test. M, Decoding accuracy of SVM decoder at each laser power, decoding individual sound pressure levels using non-VIP neuronal responses within 1-s fixed window, when the laser illuminates AC and stimulates VIP neurons. N, Average activity sparseness measured from baseline as a function of the sound pressure level for each population of neurons (VIP neurons excluded), when the VIP neurons are activated. There was a significant decrease in activity sparseness measured from baseline upon VIP activation (***plaser = 1.8 × 10−14; GLME). For all panels, black, pink, and red colors correspond to no laser power (0 mW/mm2), medium laser power (∼0.3 mW/mm2), and high laser power (∼3.5 mW/mm2), respectively (see Materials and Methods for power calibration).

Materials and Methods

Animals

We performed experiments in 14 adult mice (7 males and 7 females), which were crosses between Cdh23 mice (B6.CAST-Cdh23Ahl+/Kjn, JAX: 002756) and Sst-Cre mice (Ssttm2.1(cre)Zjh/J, JAX: 013044; n = 5 in experimental group) or Vip-IRES-Cre mice (Viptm1(cre)Zjh/J, JAX: 010908; n = 4 in experimental group, n = 5 in control group; Table 1). Mice had access to food and water ad libitum and were exposed to light/dark on a reversed 12 h cycle at 28°C. Experiments were performed during the animals’ dark cycle. Mice were housed individually after the cranial window implant. All experimental procedures were in accordance with NIH guidelines and approved by the Institutional Animal Care and Use Committee at the University of Pennsylvania.

Table 1.

Mouse strains and numbers

Experiment Figures Strain Number of mice Number of recordings
GCaMP7f + ChrimsonR Figures 1 5 CDH23 × VIP-Cre 2 7
CDH23 × SST-Cre 5 13
GCaMP6m + ChrimsonR Figures 1 5 CDH23 × VIP-Cre 2 9
Control: GCaMP7f + Flex.tdTomato – VIP cells Figure 1H CDH23 × VIP-Cre 4 7
Control: GCaMP7f + Flex.tdTomato – Non-VIP cells Figure 1I CDH23 × VIP-Cre 2 4

Surgery procedures

Mice were implanted with cranial windows over AC following a published procedure (Wood et al., 2022). Briefly, mice were anesthetized with 1.5–3% isoflurane, and the left side of the skull was exposed and perforated by a 3-mm biopsy punch over the left AC. We injected in that region 3 × 750 nl of an adeno-associated virus (AAV) mix of AAV1.Syn.GCaMP (6m: Addgene 100841, or 7f: Addgene 104488; dilution 1:10 ∼ 1 × 1013 GC/ml) and AAV1.Syn.Flex.Chrimson.tdTomato (UNC Vector Core; dilution 1:2 ∼ 2 × 1012 GC/ml). In the control mice, we injected a mix of AAV1.Syn.jGCaMP7f (Addgene 104488; dilution 1:10 ∼ 1 × 1013 GC/ml) and AAV1.Syn.Flex.tdTomato (Addgene 28306; dilution 1:100–1:20 ∼ 2 × 1011–1 × 1012 GC/ml) in VIP-Cre mice. We then sealed the craniotomy with a glass round window, attached a head plate to the mouse, and let it recover for 3–4 weeks. After habituating the mouse to being head fixed for 3 d, we mapped the sound-responsive areas of the brain and located AC using wide-field imaging and then performed two-photon imaging in AC (Fig. 2C).

Two-photon imaging

We imaged calcium activity in neurons in Layer 2/3 of AC of awake, head-fixed mice (VIP-Cre mice: 3,321 neurons over 16 recordings; SST-Cre mice: 2,284 neurons over 13 recordings) using the two-photon microscope (Ultima in vivo multiphoton microscope, Bruker) with a laser at 940 nm (Chameleon Ti-Sapphire). The fluorescence from the tissue went through a primary dichroic low pass (620 LP), through an IR Blocker (625 SP), through an emission dichroic long pass (565 LP) which separated the light in two beams. The shorter wavelengths went through an additional bandpass filter (525/70) before being captured by a PMT (“green channel”); the longer wavelengths went through a bandpass filter (595/50) before being captured by a PMT (“red channel”). This set up was used to minimize the contamination of the green channel by the optogenetic stimulus at 635 nm. There was nevertheless some bleedthrough during the optogenetic stimulus which was small enough not to saturate the green channel, and thus the activity of neurons could be recorded continuously without interruption during optogenetic stimulation. We checked there was no saturation for the highest laser power used by plotting the average grayscale profile over the dimension of the image perpendicular to the scanning, and verifying it was well below saturation. During a 5-ms laser pulse, while the whole field of view is illuminated by the laser, it appeared on the image only for the lines that were being scanned during the laser stimulus. These bands were identified using the average grayscale value of each image line, and the contaminated pixels inside these bands were removed before processing the recordings with Suite2p. We imaged a surface of 512 × 512 pixels2 at 30 Hz. If we recorded from a mouse several times, we changed the location or depth within Layer 2/3 of AC in order to not image the same neurons twice.

Optogenetic laser: power calibration

We first calibrated the laser power by measuring the curve of command voltage versus output power for the laser (Optoengine LLC, MRL-III-635-300 mW). The laser's peak frequency was 635 nm. Prior to every recording, we calibrated the laser power at the tissue level as follows: we used an empty cannula to reduce the power of the laser by a factor 10–15 and positioned the optical fiber on the objective so it would shine a spot of 1 mm diameter centered on the focal point of the objective. Thus, the calibrated power at the imaging plane was as follows: for the medium laser power: 0.3 ± 0.09 mW/mm2 [mean ± standard deviation (SD); n = 29; range: 0.14–0.47 mW/mm2], and for the high laser power: 3.4 ± 1.0 mW/mm2 (mean ± SD; n = 29; range, 1.6–5.3 mW/mm2).

Identification of interneurons being stimulated

We started each recording by taking a 2,600 frame video both in the green and the red channels (thus imaging GCaMP and tdTomato). As tdTomato is not dependent on the cell activity, any modulation in the signal in the red channel is due to bleedthrough from the GCaMP. We plotted for all cells the raw signal from the red channel versus the signal from the green channel and did a linear fit to extract the bleedthrough coefficient. We then subtracted the bleedthrough in the red signal and calculated the average fluorescence of the processed red signal for every cell. We then z-scored the signal of the red channel to the background fluorescence and selected the cells with a fluorescence higher than 2 σ (SD of the background) as the targeted interneurons. The percentage of cells labeled as VIP or SST interneurons with this criterion was consistent with the percentage of VIP or SST neurons expected within the cortex (Rudy et al., 2011).

Stimulus presentation

We presented combinations of sound and optogenetic stimuli. The auditory stimulus consisted in 1-s-long click trains of 25 ms pulses of broadband white noise (range 3−80 kHz) at 10 Hz, at seven sound pressure levels within 0−90 dB SPL (0; 30; 50; 60; 70; 80; 90 dB SPL). The optogenetic stimulus consisted in a 1-s long pulse train of 635 nm laser with 5-ms pulses at 20 Hz, at three amplitudes with no, medium, or high laser power (see power at the tissue level above, Optogenetic laser: power calibration). The two stimuli were presented simultaneously, with the optogenetic stimulus preceding the sound stimulus by 20 ms (Blackwell et al., 2020) for maximal optogenetic effect; the interstimulus interval was 5 s. All 21 combinations of sound and optogenetic stimuli were presented randomly and with 10 repeats per combination.

Analysis of single-cell activity: fixed time window

For each trial of a stimulus, the response as a function of time was defined as the change in fluorescence ΔF/FSD compared with the baseline fluorescence Fbaseline over the 1-s window preceding the stimulus: ΔF / FSD =[F − mean(Fbaseline)] / SD(Fbaseline), with F the fluorescence of the cell as a function of time. Fixed window: In order to compare how neuronal responses changed with laser stimulation, keeping all parameters similar besides that one, we defined the fixed window of neuronal response for each recording (one window for all stimulus combinations) as the 1-s window with the largest number of responsive neurons. The fixed time window was selected as the 1-s averaging window between [0−1 s] and [2−3 s] which maximized the number of cells with a significant response to at least one of the stimuli pairs for each recording compared with the prestimulus fluorescence (paired t-test, p < 0.01 with multiple-comparison correction). The delay between the stimulus onset and the beginning of the fixed window was, in SST-Cre mice: 385 ± 419 ms [mean ± SD; range (min–max): 0–1,350 ms; n = 13], and, in VIP-Cre mice: 336 ± 10 ms [mean ± SD; range (min–max): 180–480 ms; n = 16].

Our method for selecting the fixed window gives results similar to a fixed [0–1 s] window (Wood et al., 2022), with improvements which we believe have a better chance at capturing the effects of SST or VIP neuronal activation: by allowing for a delay in the beginning of the fixed window, our analysis leads to response-level curves that are closer to the maximum change in fluorescence for most cells.

Sparseness of a neuron’s response and activity sparseness of a population of neurons

To quantify how many stimuli a neuron responds to, we calculated the sparseness of each neuron adapted from Vinje and Gallant (2000) as follows:

S=111n(1(rin)2(ri2n)),

where ri is the average response of a neuron to the ith sound pressure level at a given laser activation calculated over its fixed time window minus the neuron's response to laser activation and silence and n is the number of sound pressure levels (response in silence excluded). Similar to how this measure is computed with the firing rate of excited neurons (Rolls and Tovee, 1995; Vinje and Gallant, 2000; Olsen and Wilson, 2008; Feigin et al., 2021), we adapted the sparseness measure to fluorescence data by setting any values of ri<0 to zero before calculating the sparseness and by calculating this measure only for neurons with at least one positive ri, for which the sparseness is well-defined. A sparseness value of 0% indicates that a neuron's responses to all sound pressure levels are equal, and a sparseness value of 100% indicates that a neuron only responds to one sound pressure level.

To quantify how many neurons in a population are active in response to a given stimulus, we calculated the activity sparseness of each population (Willmore and Tolhurst, 2001) at a given sound pressure level and laser power (Fig. 2E,L) by subtracting each neuron's response over its fixed window to its response at 0 dB at the same laser power and computing the ratio of neurons that had an increase in response above threshold, with the threshold set as described below. The activity sparseness from baseline was computed by taking each neuron's response over its fixed window and computing the ratio of neurons with an increase in response above threshold (Fig. 2G,N). The threshold was set as the SD of the population's response to no sound and no laser power. An activity sparseness value of 0% indicates that all neurons in a population are active, and a value of 100% indicates that none of the neurons are active.

Separation angle and vector length

To quantify whether mean population vectors were collinear in the neuronal space, we calculated the separation angle between mean population vectors adapted from Vinje and Gallant (2000). For each recording, we computed the mean population vectors over the fixed time window at each laser power from 0 dB to each non-zero sound pressure level (Fig. 3A). We then computed the angle between each pair of mean population vectors at a given laser power (Fig. 3B) and represented the mean ± standard error of the mean (SEM; Fig. 3D,E,GF) or the mean difference in separation angle from a given laser power to no laser power (Fig. 3F,I) across recordings for each sound pair. To quantify whether mean population responses were close in the neuronal space, we computed the vector length between mean population vectors (Fig. 3C). For each recording, we computed the mean population vectors over the fixed time window at each laser power between all pairs of sound pressure levels (Fig. 3A). We then computed the Euclidian norm of each mean population vector at a given laser power (Fig. 3C) and represented the mean ± SEM (Fig. 3J,K,M,N) or the mean difference in vector length from a given laser power to no laser power (Fig. 3L,O) across recordings for each sound pair.

Figure 3.

Figure 3.

Separation angle and vector length are differentially controlled by SST and VIP neuronal activation. A, Schematic representation of the population's response to sound and laser stimulation, for single trials (small dots) and the average response over all trials (large dots). The sound pressure level is represented by the shading of the dot and laser power by the outline of the dot. We simplify the representation of the population's response to two dimensions, keeping only the color code for the laser power. B, Low-dimensional schematic of the separation angle between mean population vectors to each sound pressure level at a given laser power, starting from 0 dB at each laser power. C, Low-dimensional schematic of the vector length of mean population vectors between sound pressure levels at a given laser power. D, Confusion matrix of the separation angle between pairs of sound stimuli for no (left), medium (middle), and high (right) laser power of SST neuronal activation. E, Separation angle between pairs of mean population vectors to different sound pressure levels as a function of the difference in the sound pressure level for no, medium, and high laser powers of SST neuronal activation (circles). The dotted lines are the result from the GLME fit at the different laser powers of SST neuronal activation. F, Confusion matrix of the difference in separation angle between pairs of sound stimuli from medium (left) or high (right) laser power to no laser power of SST neuronal activation. G, Confusion matrix of the separation angle between pairs of sound stimuli for no (left), medium (middle), and high (right) laser power of VIP neuronal activation. H, Separation angle between pairs of mean population vectors to different sound pressure levels as a function of the difference in the sound level for no, medium, and high laser powers of VIP neuronal activation (circles). The dotted lines are the result from the GLME fit at the different laser powers of VIP neuronal activation. I, Confusion matrix of the difference in separation angle between pairs of sound stimuli from medium (left) or high (right) laser power to no laser power of VIP neuronal activation. J, Confusion matrix of the vector length between pairs of sound stimuli for no (left), medium (middle), and high (right) laser power of SST neuronal activation. K, Length of the mean population vector between pairs of sound pressure levels as a function of the difference in the sound pressure level for no, medium, and high laser powers of SST neuronal activation (circles). The dotted lines are the result from the GLME fit at the different laser powers of SST neuronal activation. L, Confusion matrix of the difference in vector length between pairs of sound stimuli from medium (left) or high (right) laser power to no laser power of SST neuronal activation. M, Confusion matrix of the vector length between pairs of sound stimuli for no (left), medium (middle), and high (right) laser power of VIP neuronal activation. N, Length of the mean population vector between pairs of sound pressure levels as a function of the difference in the sound pressure level for no, medium, and high laser powers of VIP neuronal activation (circles). The dotted lines are the result from the GLME fit at the different laser powers of VIP neuronal activation. O, Confusion matrix of the difference in vector length between pairs of sound stimuli from medium (left) or high (right) laser power to no laser power of VIP neuronal activation. For all panels, black, pink, and red colors correspond to no laser power (0 mW/mm2), medium laser power (∼0.3 mW/mm2), and high laser power (∼3.5 mW/mm2), respectively (see Materials and Methods for power calibration), and the mean population vectors were computed over the fixed time window.

Fitting of response-level curves

Mean response curves and the SEM for every neuron were determined by averaging over the fixed-time window its responses to all 10 trials of each sound pressure level. Thus, for a given cell, we constructed three response curves, one for every light condition.

To characterize responses as monotonic or nonmonotonic, we first normalized the response curves such that abs[max(response)] is 1 and computed the monotonicity index (MI). This metric refers to the relative responses at higher stimulus levels (Watkins and Barbour, 2011) and was calculated from the mean curve as follows:

MI=|responsemaxlevelresponsespontaneous||max(response)min(response)|,(1)

where responsemaxlevel is the response to 90 dB which is the highest level of sound presented and responsespontaneous is the spontaneous response measured at 0 dB. A response curve was classified as nonmonotonic if its MI was <0.3 and monotonic if it was >0.7. We refrain from a hard cutoff at 0.5 since preliminary analysis of the response curves indicated that due to stochasticity both monotonic and nonmonotonic curves may have MI values between 0.3 and 0.7. Furthermore, note that a given cell could change its monotonicity in the presence of optogenetic stimulation.

After determining the monotonicity of the neuronal response, we fitted the monotonic and nonmonotonic curves with a four-parameter sigmoid function and a four-parameter Gaussian function, respectively. The sigmoid function is given by the following equation:

y=y0+yrange1+ex0soundlevelΔx,(2)

while the Gaussian function can be written as follows:

y=y0+yrange*e(soundlevelxmean)22*σ2.(3)

y refers to the response curve, y0 is the offset response, yrange is its range in amplitude, x0 is the x value of the sigmoid midpoint, and Δx denotes the width of the sigmoid. In the Gaussian, the parameter y0 is the offset response. The amplitude, mean, and SD of the Gaussian are denoted by yrange, xmean, and σ, respectively, and have their regular interpretations. During the fitting procedure, we minimize (1 − McFadden pseudo R2) using the Powell optimizer (scipy.minimize.optimize in Python). The formula for this error value is as follows:

1R2=lnL^(Mfull)lnL^(Mintercept).(4)

Assuming L^(Mfull) is Gaussian with the experimentally computed response average as its mean value and the response SEM as its SD, we can rewrite the formula as follows:

1R2=(mean(response)response)2sem2(yresponse)2sem2.(5)

We chose the McFadden R2 since it allows us to account for different values of SEM at the different intensities. The regular R2 equation constrains the SEM values to be equal at all intensities. A cell's response curve is considered well fit by its respective function if the Mcfadden R2 is >0.8. Due to the nonlinear nature of our optimization we chose 16 random starting points for the optimizer and cells fitted using two or more of the starting conditions were characterized using the fitting curve which had the highest R2 value. For neurons whose mean response curve MI lay between 0.3 and 0.7, we follow a similar procedure but fit the curve with both the sigmoid and Gaussian functions to find the better fitting function. Furthermore, we constrain the mean of all our Gaussian fits to lie between 10 and 80 dB. Response curves with means <10 dB or >80 dB were recharacterized using the sigmoid function since only one sound pressure level data point (0 or 90 dB) is insufficient to adequately distinguish if the cell is monotonic or nonmonotonic. Roughly ∼5% of the total response curves (combined across all three light conditions) were refitted in this manner. Lastly, we tested if the fitting curve was overfitted to the empirical sound intensities by calculating a new variable—interpolated error. The interpolated sigmoid/Gaussian curve is constructed by interpolating the fitted curve at the intermediate sound pressure levels (15, 40, 55, 65, 75, 85 dB). The interpolated error is the regular R2 value evaluated using the following equation:

R2=1(yinterpolatedlevelsinterpolatedfittedcurve)2(interpolatedfittedcurvemean(interpolatedfittedcurve))2.(6)

yinterpolatedlevels refers to the Gaussian/sigmoid equations (2, 3) computed at the sound pressure levels 0, 15, 30, 40, 50, 55, 60, 65, 70, 75, 80, 85, and 90 dB. When computing statistics on different parameters, we remove neurons with interpolated error <0.25 for both the Sigmoid and Gaussian fits. This threshold allows us to select at least 90% of the fitted curves.

Decoding the sound pressure level using an SVM decoder

We linearly decoded the seven different sound pressure levels at each optostimulated condition using an SVM decoder with a linear kernel. Specifically, we decoded each individual pressure level versus the remaining six. The input to the SVM consisted of the fixed time-window responses of all neurons in the population.

To individually decode each of the seven amplitudes, for every given experimental dataset, we projected the average responses of all neurons derived using a fixed window onto a lower-dimensional space using PCA. The lower-dimensional space had (n) dimensions such that these dimensions accounted for 70% of the variance in the dataset.

Next, because at a given optostimulation condition we had 10 trials per sound pressure level, the input data to the SVM decoder were unbalanced as 10:60. We balanced the dataset by oversampling the sound pressure level of interest, i.e., if we were decoding the 0 dB stimulus from the rest, we oversampled to construct 60 trials for the 0 dB stimulus. Specifically, we constructed these 60 trials by fitting the 10 experimentally obtained 0 dB trials using a Gaussian kernel and sampling the corresponding distribution. The resulting oversampled dataset comprising 120 trials total was input into the two-class SVM decoder which was trained using 10-fold cross-validation. Because we were not tuning the hyperparameters of the SVM, we did not have separate validation and test sets; rather we computed the model's accuracy directly using the validation set which was chosen randomly for each of the 10 iterations. Figure 2, F and M, illustrates the decoding accuracies for all three conditions of optostimulation of SST and VIP interneurons.

Statistics

All responses are plotted as mean ± SEM with the number of measurements above the figure. We tested the significance with a generalized linear mixed-effect (GLME) model with the MATLAB function fitglme, using laser power, sound pressure level, and the interaction term between laser power and sound pressure level as fixed-effect terms and cell identity or session number as grouping variables. All results from the statistical analyses are reported in Table 2.

Table 2.

Statistics table

Comparison Figure N Test Test statistic p value Effect size
Figure 1
SST neuron with SST neuronal activation Figure 1D 10 repeats GLME tlaser = 7.89 ***plaser = 1.8 × 10−13 ηlaser2 = 0.58
tsound = 0.34 psound = 0.74 ηsound2 = 3.8 × 10−3
tlaser:sound = −0.55 plaser:sound = 0.58 ηlaser:sound2 = 1.5 × 10−2
df = 206
Sound-increasing neuron with SST neuronal activation Figure 1E 10 repeats GLME tlaser = 0.33 plaser = 0.74 ηlaser2 = 2.3 × 10−3
tsound = 12.37 ***psound = 1.2 × 10−26 ηsound2 = 0.84
tlaser:sound = −8.34 ***plaser:sound = 1.0 × 10−14 ηlaser:sound2 = 0.78
df = 206
VIP neuron with VIP neuronal activation Figure 1F 10 repeats GLME tlaser = 5.40 ***plaser = 1.8 × 10−7 ηlaser2 = 0.39
tsound = 0.93 psound = 0.35 ηsound2 = 2.8 × 10−2
tlaser:sound = −2.56 *plaser:sound = 1.1 × 10−2 ηlaser:sound2 = 0.25
df = 206
Sound-increasing neuron with VIP neuronal activation Figure 1G 10 repeats GLME tlaser = 2.45 *plaser = 1.5 × 10−2 ηlaser2 = 0.12
tsound = 3.06 **psound = 2.5 × 10−3 ηsound2 = 0.24
tlaser:sound = 1.11 plaser:sound = 0.27 ηlaser:sound2 = 5.8 × 10−2
df = 206
Control: VIP neurons with laser activation Figure 1H 54 cells GLME tlaser = 1.27 plaser = 0.20 ηlaser2 = 6.5 × 10−4
tsound = 2.99 **psound= 2.8 × 10−3 ηsound2 = 5.5 × 10−3
tlaser:sound = −0.11 plaser:sound = 0.91 ηlaser:sound2 = 1.1 × 10−5
df = 11,336
Control: All neurons (VIP excluded) with laser activation Figure 1I 492 cells GLME tlaser = 0.46 plaser = 0.64 ηlaser2 = 8.5 × 10−6
tsound = 12.23 ***psound = 2.1 × 10−34 ηsound2 = 9.0 × 10−3
tlaser:sound = 3.27 **plaser:sound = 1.1 × 10−3 ηlaser:sound2 = 9.8 × 10−4
df = 103,316
Figure 2
SST neurons with SST neuronal activation Figure 2B 132 cells GLME tlaser = 36.91 ***plaser = 3.1 × 10−291 ηlaser2 = 0.14
tsound = 1.32 psound = 0.19 ηsound2 = 3.2 × 10−4
tlaser:sound = 0.16 plaser:sound = 0.88 ηlaser:sound2 = 6.7 × 10−6
df = 27,716
All non-SST neurons with SST neuronal activation Figure 2C 2,152 cells GLME tlaser = −1.27 plaser = 0.20 ηlaser2 = 1.5 × 10−5
tsound = 10.75 ***psound = 5.9 × 10−27 ηsound2 = 1.6 × 10−3
tlaser:sound = −6.35 ***plaser:sound = 2.2 × 10−10 ηlaser:sound2 = 8.5 × 10−4
df = 451,916
Sparseness with SST neuronal activation Figure 2D None: 2,033 GLME tlaser = 5.21 ***plaser = 2.0 × 10−7 ηlaser2 = 4.5 × 10−3
Med: 2,029
High: 1,994 df = 5,702
Activity sparseness with SST neuronal activation Figure 2E 13 populations GLME tlaser = 3.20 **plaser = 1.6 × 10−3 ηlaser2 = 0.22
tsound = 0.87 psound = 0.38 ηsound2 = 1.3 × 10−2
tlaser:sound = −0.79 plaser:sound = 0.43 ηlaser:sound2 = 2.5 × 10−2
df = 230
SST: Decoding accuracy of a linear SVM decoder with SST neuronal activation Figure 2F 13 populations GLME tlaser = −3.92 ***plaser = 1.1 × 10−4 ηlaser2 = 0.19
tsound = −2.84 **psound = 4.9 × 10−3 ηsound2 = 0.16
tlaser:sound = 1.92 plaser:sound = 5.6 × 10−2 ηlaser:sound2 = 0.11
df = 269
Activity sparseness from baseline with SST neuronal activation Figure 2G 13 populations GLME tlaser = −0.89 plaser = 0.37 ηlaser2 = 6.1 × 10−3
tsound = −3.28 **psound = 1.2 × 10−3 ηsound2 = 0.11
tlaser:sound = 2.01 *plaser:sound = 4.5 × 10−2 ηlaser:sound2 = 6.7 × 10−2
df = 269
VIP neurons with VIP neuronal activation Figure 2I 226 cells GLME tlaser = 41.50 ***plaser = 0 ηlaser2 = 0.12
tsound = 2.71 **psound = 6.7 × 10−3 ηsound2 = 9.3 × 10−4
tlaser:sound = −1.96 *plaser:sound = 4.95 × 10−2 ηlaser:sound2 = 7.3 × 10−4
df = 47,456
All non-VIP neurons with VIP neuronal activation Figure 2J 3,095 cells GLME tlaser = 34.18 ***plaser = 7.8 × 10−256 ηlaser2 = 6.0 × 10−3
tsound = 11.32 ***psound = 1.1 × 10−29 ηsound2 = 1.0 × 10−3
tlaser:sound = 8.41 ***plaser:sound = 4.1 × 10−17 ηlaser:sound2 = 8.4 × 10−4
df = 649,946
Sparseness with VIP neuronal activation Figure 2K None: 2,979 GLME tlaser = −2.78 **plaser = 5.5 × 10−3 ηlaser2 = 7.0 × 10−4
Med: 2,921
High: 3,020 df = 8,350
Activity sparseness with VIP neuronal activation Figure 2L 16 populations GLME tlaser = −1.46 plaser = 0.15 ηlaser2 = 2.8 × 10−2
tsound = −1.18 psound = 0.24 ηsound2 = 1.2 × 10−2
tlaser:sound = 0.17 plaser:sound = 0.86 ηlaser:sound2 = 6.4 × 10−4
df = 284
VIP: Decoding accuracy of a linear SVM decoder with VIP neuronal activation Figure 2M 13 populations GLME tlaser = −0.69 plaser = 0.49 ηlaser2 = 4.3 × 10−3
tsound = −3.58 ***psound = 4.0 × 10−4 ηsound2 = 0.15
tlaser:sound = −0.11 plaser:sound = 0.91 ηlaser:sound2 = 2.7 × 10−4
df = 332
Activity sparseness from baseline with VIP neuronal activation Figure 2N 16 populations GLME tlaser = −8.02 ***plaser = 1.8 × 10−14 ηlaser2 = 0.24
tsound = −4.01 ***psound = 7.5 × 10−5 ηsound2 = 0.11
tlaser:sound = 0.76 plaser:sound = 0.45 ηlaser:sound2 = 6.4 × 10−3
df = 332
Figure 3
Separation angle from 0 dB at each laser power—SST neuronal activation Figure 3E 15 angles, 13 recordings GLME tlaser = 8.80 ***plaser = 1.6 × 10−17 ηlaser2 = 0.30
tΔsound = 12.37 ***pΔsound = 2.3 × 10−31 ηΔsound2 = 0.58
tlaser:Δsound = −7.44 ***plaser:Δsound = 3.6 × 10−13 ηlaser:Δsound2 = 0.43
df = 581
Separation angle from 0 dB at each laser power—VIP neuronal activation Figure 3H 15 angles, 16 recordings GLME tlaser = −2.75 **plaser = 6.1 × 10−3 ηlaser2 = 3.0 × 10−2
tΔsound = 7.32 ***pΔsound = 6.9 × 10−13 ηΔsound2 = 0.26
tlaser:Δsound = 0.73 plaser:Δsound = 0.47 ηlaser:Δsound2 = 5.2 × 10−3
df = 716
Vector length at each laser power—SST neuronal activation Figure 3K 21 lengths, 13 recordings GLME tlaser = −4.71 ***plaser = 2.9 × 10−6 ηlaser2 = 6.1 × 10−2
tΔsound = 5.71 ***pΔsound = 1.6 × 10−8 ηΔsound2 = 0.16
tlaser:Δsound = −3.57 ***plaser:Δsound = 3.8 × 10−4 ηlaser:Δsound2 = 9.1 ×10−2
df = 815
Vector length at each laser power—VIP neuronal activation Figure 3N 21 lengths, 16 recordings GLME tlaser = 2.50 *plaser = 1.2 × 10−2 ηlaser2 = 9.7 × 10−3
tΔsound = 4.03 ***pΔsound = 6.1 × 10−5 ηΔsound2 = 4.8 × 10−2
tlaser:Δsound = 4.84 ***plaser:Δsound = 1.5 × 10−6 ηlaser:Δsound2 = 9.0 × 10−2
df = 1,004
Figure 4
Sigmoid fit—offset—with SST neuronal activation Figure 4D None: 109 GLME tlaser = 0.83 plaser = 0.41 ηlaser2 = 2.4 × 10−3
Med: 103
High: 64 df = 274
Sigmoid fit—offset—with VIP neuronal activation Figure 4D None: 267 GLME tlaser = 1.74 plaser = 8.1 × 10−2 ηlaser2 = 3.6 × 10−3
Med: 239
High: 269 df = 773
Sigmoid fit—range—with SST neuronal activation Figure 4E None: 109 GLME tlaser = −1.24 plaser = 0.22 ηlaser2 = 3.1 × 10−3
Med: 103
High: 64 df = 274
Sigmoid fit—range—with VIP neuronal activation Figure 4E None: 267 GLME tlaser = 3.11 **plaser = 1.9 × 10−3 ηlaser2 = 9.4 × 10−3
Med: 239
High: 269 df = 773
Sigmoid fit—midpoint—with SST neuronal activation Figure 4F None: 109 GLME tlaser = 2.65 **plaser = 8.6 × 10−3 ηlaser2 = 2.1 × 10−2
Med: 103
High: 64 df = 274
Sigmoid fit—midpoint—with VIP neuronal activation Figure 4F None: 267 GLME tlaser = −0.88 plaser = 0.38 ηlaser2 = 5.9 × 10−4
Med: 239
High: 269 df = 773
Sigmoid fit—width—with SST neuronal activation Figure 4G None: 109 GLME tlaser = −0.019 plaser = 0.99 ηlaser2 = 8.8 × 10−7
Med: 103
High: 64 df = 274
Sigmoid fit—width—with VIP neuronal activation Figure 4G None: 267 GLME tlaser = 0.56 plaser = 0.57 ηlaser2 = 2.2 × 10−4
Med: 239
High: 269 df = 773
Figure 5
Gaussian fit—offset—with SST neuronal activation Figure 5D None: 224 GLME tlaser = −3.16 **plaser = 1.7 × 10−3 ηlaser2 = 1.8 × 10−2
Med: 175
High: 130 df = 527
Gaussian fit—offset—with VIP neuronal activation Figure 5D None: 243 GLME tlaser = 0.71 plaser = 0.48 ηlaser2 = 5.3 × 10−4
Med: 278
High: 310 df = 829
Gaussian fit—range—with SST neuronal activation Figure 5E None: 224 GLME tlaser = −5.41 ***plaser = 9.4 × 10−8 ηlaser2 = 3.8 × 10−2
Med: 175
High: 130 df = 527
Gaussian fit—range—with VIP neuronal activation Figure 5E None: 243 GLME tlaser = 5.60 ***plaser = 2.9 × 10−8 ηlaser2 = 1.7 × 10−2
Med: 278
High: 310 df = 829
Gaussian fit—mean—with SST neuronal activation Figure 5F None: 224 GLME tlaser = 0.35 plaser = 0.73 ηlaser2 = 1.9 × 10−4
Med: 175
High: 130 df = 527
Gaussian fit—mean—with VIP neuronal activation Figure 5F None: 243 GLME tlaser = 3.34 ***plaser = 8.6 × 10−4 ηlaser2 = 7.4 × 10−3
Med: 278
High: 310 df = 829
Gaussian fit—SD—with SST neuronal activation Figure 5G None: 224 GLME tlaser = −0.63 plaser = 0.53 ηlaser2 = 7.3 × 10−4
Med: 175
High: 130 df = 527
Gaussian fit—SD—with VIP neuronal activation Figure 5G None: 243 GLME tlaser = 3.96 ***plaser = 8.1 × 10−5 ηlaser2 = 1.7 × 10−2
Med: 278
High: 310 df = 829

We used a GLME model to compute the statistics for the data.

For Figures 1 and 2, B, C, E, G, I, J, L, and N; the data (“table”) had four columns: cell, sound level, laser power, and output. The formula used was (MATLAB): glme=fitglme(table,‘outputsound+laser+sound*laser+(1|cell)’);

For Figures 3, D and K, 4, and 5, the data (“table”) had three columns: cell, laser power, and output. The formula used was (MATLAB): glme=fitglme(table,‘outputlaser+(1|cell)’); For Figure 3, E, H, K, and N, the data (“table”) had four columns: cell, sound level difference, laser power, and output. The formula used was (MATLAB): glme=fitglme(table,‘outputsounddiff+laser+sounddiff*laser+(1|cell)’);

For Figure 2, F and M, the data (“table”) had four columns: population, sound level, laser power, and output. The formula used was (MATLAB): glme = fitglme(table,‘output ∼ sound + laser + sound*laser+(1|population)’).

Data availability

The data and code are available on the dryad depository: https://doi.org/10.5061/dryad.t1g1jwt6d (Tobin et al., 2024 link for reviewers, https://datadryad.org/stash/share/wk9BuKO0A8ZzqEfsrAUtd__bHqPOe2pJ5P8WExEOpY0.)

Results

SST and VIP neuronal activation modulate the response of sound-increasing neurons

To investigate whether and how distinct classes of inhibitory neurons in AC, SST and VIP neurons, affect sound representation at a population level, we imaged calcium activity of neurons in AC of awake, head-fixed mice presented with sounds while activating SST or VIP neurons using submillisecond optogenetic manipulation with Chrimson (Fig. 1A; Klapoetke et al., 2014). We monitored calcium activity by measuring fluorescence of GCaMP expressed in hundreds of neurons at a time (VIP-Cre mice: 3,321 neurons over 16 recordings; SST-Cre mice: 2,284 neurons over 13 recordings) and identified the cells expressing the opsin through co-expression of tdTomato (Fig. 1B,C). This approach allowed us to quantify the transformations of sound representations within a large population of cortical neurons driven by SST or VIP neuronal activation.

We first confirmed that the optogenetic manipulations produced expected responses. SST neurons directly inhibit excitatory cells and other cells within the neuronal population, and so we expected that optogenetic manipulation would increase SST neuronal activity but decrease the responses of other cells. By contrast, VIP neurons mostly inhibit other inhibitory neurons (Campagnola et al., 2022), and therefore we expected the VIP neuronal activation would increase both VIP neuronal activity and provide a release of inhibition to other cells in the network. In SST-Cre mice, a representative SST neuron increased activity at all sound pressure levels with laser (example neuron; ***plaser = 1.8 × 10−13; GLME; Fig. 1D). The change in the response of a representative non-SST neuron to SST neuronal activation was sound level-dependent, with a decrease at most sound pressure levels for the medium laser power. Sound responses were abolished with strong SST neuronal activation at the high laser power (plaser = 0.74; ***plaser:sound = 1.0 × 10−14; GLME; Fig. 1E). In VIP-Cre mice, the response of a representative VIP neuron increased at all sound pressure levels with laser, and the increase was sound level-dependent (example neuron; ***plaser = 1.8 × 10−7; *plaser:sound = 1.1 × 10−2; GLME; Fig. 1F). The activity of a representative non-VIP neuron increased at most sound pressure levels during activation of VIP neurons, with a larger increase at the high than medium laser power (*plaser = 1.5 × 10−2; GLME; Fig. 1G). As a control, we injected mice with Flex.tdTomato instead of the opsin in VIP-Cre × Cdh23 mice (n = 5) and verified that laser stimulation of AC in the absence of the opsin did not lead to significant changes in the neuronal responses of VIP neurons (plaser = 0.20; GLME; Fig. 1H) and non-VIP neurons (plaser = 0.64; GLME; Fig. 1I). These representative effects of SST or VIP neuronal activation are consistent with previous reports, suggesting that the activation method worked as expected (Natan et al., 2015; Seybold et al., 2015; Phillips and Hasenstaub, 2016; Bigelow et al., 2019).

Differential distribution of population activity and sparseness with SST and VIP neuronal activation

We next tested whether and how SST and VIP neuronal activation differentially affects the sound pressure level response functions of neurons in AC. We characterized the response of each population to any stimulus combination over its fixed time window (Fig. 2AC,HJ; see Materials and Methods).

Because SST neurons directly inhibit excitatory cells, we hypothesized that SST neuronal activation would lead to fewer neurons responding at increasing sound pressure levels. At baseline, SST neurons have on average lower responses than the non-SST neurons, with the difference in response strongest at low sound pressure levels (Fig. 2B,C). During the SST neuronal activation, the average response of SST neurons increased at all sound pressure levels (n = 132 SST neurons; ***plaser < 1 × 10−100; GLME; Fig. 2A; Fig. 2B, solid lines), whereas non-SST neurons exhibited a mix of decreased and increased responses (Fig. 2A). At medium and high laser, the overall shape of the average response-level curve is preserved for SST neurons (plaser:sound = 0.88; GLME; Fig. 2B). The overall effect of SST neuronal activation on the population of non-SST neurons had a significant interaction between sound and laser amplitude (dashed lines, Fig. 2C), but no significant sound-independent laser effect (solid and dashed lines; n = 2,152 neurons; plaser = 0.20; ***plaser:sound = 2.2 × 10−10; GLME; Fig. 2C). The average response-level curve for non-SST neurons shifted downward for the medium laser power, and at a high laser power, the modulation of the population's response by sound was lost with an average response to silence equal to the average response to sounds.

To further assess the representation of the sound pressure level in the neuronal population, we studied two characteristics of sparse distributed representations: (1) each neuron responds only to a few stimuli (high sparseness), and (2) only a few neurons respond to each stimulus (high activity sparseness). To measure how many stimuli a neuron responds to, we computed the sparseness of each non-SST neuron (see Materials and Methods). The sparseness of non-SST neurons with a positive sound response increased significantly from 62 to 67% upon SST neuronal activation (median; n = 2033, 2029, and 1,994 neurons for no, medium, and high laser power, respectively; ***plaser = 2.0 × 10−7; GLME; Fig. 2D), indicating that neurons responded to fewer stimuli. To measure how many neurons responded to each stimulus, we computed the activity sparseness for each population of non-SST neurons, which is the ratio of neurons that are not active in response to a given stimulus. The activity sparseness compared with silence at a given laser power increased significantly upon SST neuronal activation (n = 13 populations; **plaser = 1.6 × 10−3; GLME; Fig. 2E), indicating that at each successive sound pressure level, there were fewer non-SST neurons that were active. Compared to baseline, the activity sparseness showed a significant interaction between sound and laser amplitude but no sound-independent laser effect upon SST neuronal activation (n = 13 populations; plaser = 0.37; *plaser:sound = 4.5 × 10−2; GLME; Fig. 2G). However, whereas decoding accuracy differed across different sound pressure levels and was slightly lower with SST neuronal activation, there was no interaction between the laser and sound amplitudes (n = 13 populations; **psound = 4.9 × 10−3; ***plaser = 1.1 × 10−4; plaser:sound = 0.056; GLME; Fig. 2F). This suggests that despite the neuronal population responses becoming relatively sparser across sound pressure levels, the relative decoding accuracy was preserved across SPLs. Combined with the change in sound modulation of the population's response, these results point to a more localist representation for sound pressure level with SST neuronal activation.

Because VIP neuron activity provides a release of SST neuron inhibition on excitatory cells, we hypothesized that VIP neuronal activation would lead to more neurons being active in response to each sound level and having stronger responses. At the baseline, VIP neurons have on average lower responses than the non-VIP neurons, with the difference being smallest at 30 dB (Fig. 2I,J). When VIP neurons were activated, the average response of VIP neurons increased at all sound pressure levels (n = 226 VIP neurons; ***plaser < 1 × 10−100; GLME; Fig. 2H; Fig. 2I, solid lines). At high laser, the sound modulation weakens in VIP neurons (*plaser:sound = 4.95 × 10−2; GLME; Fig. 2I). Non-VIP neurons similarly exhibited an increase in response, both in silence and to sounds at different sound pressure levels (Fig. 2H). The average response-level curve for non-VIP neurons shifted upward at all sound pressure levels as VIP neurons were activated (solid and dashed lines; n = 3,095 neurons; ***plaser < 1 × 10−100; GLME; Fig. 2J), reflective of a more distributed stimulus representation. At all laser powers, the modulation of the population's response by sound was maintained: the population average to sounds was higher than to silence (Fig. 2J, dashed lines).

Neuronal sparseness decreased with VIP neuronal activation for all non-VIP neurons with a positive sound response, with a significant decrease from 54 to 52% upon VIP neuronal activation (median; n = 2,979; 2,921, and 3,020 neurons for no, medium, and high laser power, respectively; **plaser = 5.5 × 10−3; GLME; Fig. 2K), indicating that each neuron responded more equally to the different sound pressure levels. The activity sparseness measured from silence at a given laser power did not change upon VIP neuronal activation (n = 16 populations; plaser = 0.15; GLME; Fig. 2L), indicating that the same number of neurons showed an increase in response at each sound pressure level from silence at a given laser power. Consistent with the overall increase in responses with VIP neuronal activation in silence, the activity sparseness measured from the baseline significantly decreased (n = 16 populations; ***plaser = 1.8 × 10−14; GLME; Fig. 2N). Importantly, this change in population activity did not affect the decoding performance (n = 13 populations; psound = 4.0 × 10−4; plaser = 0.49; plaser:sound=0.91; GLME; Fig. 2M). Combined, these results suggest that activating VIP neurons transforms population responses to a more distributed representation while preserving the decoding accuracy.

Overall, SST neuronal activation led to weaker and sparser responses in the population, with neurons responding to fewer stimuli and each stimulus eliciting a response in fewer neurons, shifting the population responses toward a more localist stimulus representation. By contrast, VIP neuronal activation leads to a global increase in the neuronal population's response, along with each neuron responding to more stimuli, leading to a more distributed stimulus representation.

The sound pressure level is represented more discretely or continuously in the neuronal population with SST or VIP neuronal activation, respectively

There are various ways that a neuronal network can implement a representation of a sensory feature. For example, a distributed code may rely on the magnitude of response of the population of neurons or on the relative response of each cell. To investigate this, we examined next the properties of the representation of sound pressure level upon SST and VIP neuronal activation in the neuronal space. We computed the mean population vector from 0 dB at a given laser amplitude to each nonzero sound pressure level at that laser amplitude (Fig. 3A) and computed the separation angle between pairs of mean population vectors at the same laser power (Fig. 3B), as well as the length of mean population vectors between pairs of sounds (Fig. 3C).

The separation angle (Vinje and Gallant, 2000) computes the angle between the mean population vectors to two different sound pressure levels taking 0 dB at each laser power as the origin (Fig. 3B). A smaller angle indicates the population vectors are more collinear, meaning similar neurons respond to the different stimuli, although perhaps with differing magnitude. A larger angle indicates that there is less overlap between the populations of neurons responding to each stimulus.

When there is no interneuron activation, the separation angle increased with the difference in the sound pressure level (Fig. 3E,H, black circles), indicating that there is less overlap in the groups of neurons responding to sounds with a large difference in the sound pressure level than a small difference in the sound pressure level. Upon SST neuronal activation, the curve flattened ∼60°, meaning that there was an increase in the separation angle for small differences in the sound pressure level and a decrease in the separation angle for large differences in the sound pressure level (dotted lines correspond to GLME estimates; ***plaser = 1.6 × 10−17; ***pΔsound = 2.3 × 10−31; ***plaser:Δsound = 3.6 × 10−13; GLME; Fig. 3D,E). On average, the difference in the separation angle between sound pairs from medium or high laser to no laser was positive, at 3.7 ± 1.7° for the high laser power (mean ± SEM; 15 angles; Fig. 3F). This indicates that the population vectors to the different tested sound pressure levels were more equally distributed in the neuronal space. There is, however, still an overlap between groups of neurons responding to different sound pressure levels, as a fully orthogonal coding of the sound pressure level would lead to a 90° separation angle. In contrast, upon VIP neuronal activation, the separation angle decreased equally for all differences in the sound pressure level (dotted lines correspond to GLME estimates; **plaser = 6.1 × 10−3; ***pΔsound = 6.9 × 10−13; plaser:Δsound = 0.47; GLME; Fig. 3G,H). On average, the difference in the separation angle between sound pairs from medium or high laser to no laser was negative, at −4.6 ± 0.7° at the high laser power (mean ± SEM; 15 angles; Fig. 3I). This indicates that the population vectors to the different tested sound pressure levels were more collinear, with more overlap between groups of neurons responding to different sound pressure levels with VIP neuronal activation.

The vector length computes the Euclidian norm of the mean population vector between two sound pressure levels at a given laser power (Fig. 3C). A small length indicates that the responses to two different stimuli are close in the neuronal space, either due to small magnitudes of response, to small differences in separation angle, or both, whereas a large length indicates that there is a large difference in the magnitude, the separation angle, or both. Therefore, we tested whether SST and VIP neuronal activation differentially affected the vector length.

Upon SST neuronal activation, the vector length decreased for all sound pressure level differences to ∼2 a.u. at high laser power, along with a decrease in the slope of the GLME estimate by 81% at high laser power (dotted lines correspond to GLME estimates; ***plaser = 2.9 × 10−6; ***pΔsound = 1.6 × 10−8; ***plaser:Δsound = 3.8 × 10−4; GLME; Fig. 3J,K). The average change in length from medium or high laser power to no laser was negative, at −1.00 ± 0.10 a.u. for the high laser power (mean ± SEM; 21 lengths; Fig. 3L). Upon VIP neuronal activation, the vector length increased for all sound pressure level differences, and the slope of the GLME estimate also increased by 72% at high laser power (dotted lines correspond to GLME estimates; *plaser = 1.2 × 10−2; ***pΔsound = 6.1 × 10−5; ***plaser:Δsound = 1.5 × 10−6; GLME; Fig. 3M,N). The average change in length from medium or high laser power to no laser was positive, at 1.27 ± 0.12 a.u. for the high laser power (mean ± SEM; 21 lengths; Fig. 3O).

Combined with the differences in the separation angle, these results point to emergence of two types of modulations of population codes: upon SST neuronal activation, the encoding of the sound pressure level resembles a localist pattern coding where the magnitude of response is less relevant than the identity of responding cells: the strength of response is reduced and similar for all sound pressure levels, but the population vector angles are more spread out in neuronal space, indicating that there is less overlap between groups of neurons responding to different sound pressure levels. Upon VIP neuronal activation, the encoding of the sound pressure level resembles a rate code, which is a type of distributed representation in which the varying strength of the whole population encodes a continuously varying parameter of the stimulus. Therefore, VIP neuronal activation promotes the strength of response more than the identity of responding neurons: there is more overlap between the groups of neurons responding to different sound pressure levels, but the strength of response is increased.

Response-level curves of sound-modulated cells exhibit a narrower response upon SST neuronal activation and a broader response upon VIP neuronal activation

We next tested how the shifts in stimulus representation mediated by SST and VIP neurons at the scale of the neuronal population are implemented at the single-cell level by analyzing the changes with SST or VIP neuronal activation in response-level curves of single neurons responding positively to sound. Some AC neurons exhibit increased responses with increased sound pressure levels (monotonic response-level curve), while others are tuned with a peak response to a specific sound pressure level (nonmonotonic response-level curve; Schreiner et al., 1992; Phillips et al., 1995; Wu et al., 2006). We classified cells depending on their MI (see Materials and Methods) and fit response functions of monotonically responding cells with a sigmoid function (Fig. 4; see Materials and Methods) and those of nonmonotonically responding cells with a Gaussian function (Fig. 5; see Materials and Methods). We then tested how the parameters of the fits change with interneuron activation.

Figure 4.

Figure 4.

Single-cell fits of response-level curves for monotonic cells. A, Example neuron with a monotonic response-level curve (solid black line) with a sigmoid fit estimated at the probed sound pressure levels (blue dashed line with circles) and the sigmoid function with the same parameters (dotted black line). The parameters for the sigmoid fit are as follows: offset amplitude y0 = 0.1; range yrange = 2.4; midpoint x0 = 60 dB; width Δx = 5 dB. B, Example of changes to the response-level curve of a sound-increasing monotonic neuron upon SST neuronal activation. Response-level curves (solid lines) are fit by sigmoid functions (dotted lines) at no (black lines) and high (red lines) laser powers of SST neuronal activation. The parameters for the sigmoid fit are, for no SST neuronal activation: y0 = −0.3; yrange = 13.4; x0 = 93 dB; and Δx = 11 dB, and for SST neuronal activation: y0 = −0.2; yrange = 8.6; x0 = 96 dB; Δx = 8 dB. C, Example of changes to the response-level curve of a sound-increasing monotonic neuron upon VIP neuronal activation. Response-level curves (solid lines) are fit by sigmoid functions (dotted lines) at no (black lines) and high (red lines) laser powers of VIP neuronal activation. The parameters for the sigmoid fit are, for no VIP neuronal activation: y0 = −0.1; yrange = 5.6; x0 = 66 dB; Δx = 5 dB, and, for VIP neuronal activation: y0 = −0.4; yrange = 9.9; x0 = 60 dB; Δx = 5 dB. D, Schematic showing the offset amplitude y0 parameter of the sigmoid fit (left panel) and cumulative distribution function of y0 for monotonic sound-increasing neurons at different laser powers of SST (middle panel) and VIP (right panel) neuronal activation. E, Schematic showing the amplitude range yrange parameter of the sigmoid fit (left panel) and cumulative distribution function of yrange for monotonic sound-increasing neurons at different laser powers of SST (middle panel) and VIP (right panel) neuronal activation. F, Schematic showing the midpoint x0 parameter of the sigmoid fit (left panel) and cumulative distribution function of x0 for monotonic sound-increasing neurons at different laser powers of SST (middle panel) and VIP (right panel) neuronal activation. G, Schematic showing the width Δx parameter of the sigmoid fit (left panel) and cumulative distribution function of Δx for monotonic sound-increasing neurons at different laser powers of SST (middle panel) and VIP (right panel) neuronal activation. H, Fit types (sigmoid in red, Gaussian in yellow, no fit in white) at one or both of the other laser powers for the non-SST neurons fit by a sigmoid function at no, medium, and high laser powers of SST neuronal activation. The overlap between yellow and red (orange) indicates neurons fit by a sigmoid function at one laser power and a Gaussian function at the other laser power. I, Fit types (sigmoid in red, Gaussian in yellow, no fit in white) at one or both of the other laser powers for the non-VIP neurons fit by a sigmoid function at no, medium, and high laser powers of VIP neuronal activation. The overlap between yellow and red (orange) indicates neurons fit by a sigmoid function at one laser power and a Gaussian function at the other laser power. J, Schematic of the mean significant changes to the response-level curve of a monotonic sound-increasing cell (gray line) upon SST (blue line) and VIP (green line) neuronal activation. For all panels, black, pink, and red colors correspond to no laser power (0 mW/mm2), medium laser power (∼0.3 mW/mm2), and high laser power (∼3.5 mW/mm2), respectively (see Materials and Methods for power calibration). For all panels, the statistical test GLME was performed on the distributions at the three different levels of interneuron activation, with n.s. corresponding to non-significant; *p < 0.05, **p < 0.01; ***p < 0.001. There are n = 109, n = 103, and n = 64 sound-increasing monotonic cells fit at no, medium, and high laser powers of SST neuronal activation, respectively. There are n = 267, n = 239, and n = 269 sound-increasing monotonic cells fit at no, medium, and high laser powers of VIP neuronal activation, respectively.

Figure 5.

Figure 5.

Single-cell fits of response-level curves for nonmonotonic cells A. Example neuron with a nonmonotonic response-level curve (solid black line) with a Gaussian fit estimated at the probed sound pressure levels (blue dashed line with circles) and the Gaussian function with the same parameters (dotted black line). The parameters for the Gaussian fit are as follows: offset amplitude y0 = 0.15; range yrange = 3.8; mean xmean = 49 dB; SD σ = 9 dB. B, Example of changes to the response-level curve of a sound-increasing nonmonotonic neuron upon SST neuronal activation. Response-level curves (solid lines) are fit by Gaussian functions (dotted lines) at no (black lines) and high (red lines) laser powers of SST neuronal activation. The parameters for the Gaussian fit are, for no SST neuronal activation: y0 = 0.2; yrange = 3.8; xmean = 49 dB; σ = 9 dB, and, for SST neuronal activation: y0 = 0.1; yrange = 0.8; xmean = 47 dB; σ = 11 dB. C, Example of changes to the response-level curve of a sound-increasing nonmonotonic neuron upon VIP neuronal activation. Response-level curves (solid lines) are fit by Gaussian functions (dotted lines) at no (black lines) and high (red lines) laser powers of VIP neuronal activation. The parameters for the Gaussian fit are, for no VIP neuronal activation: y0 = 0.0; yrange = 0.2; xmean = 49 dB; σ = 4 dB, and, for VIP neuronal activation: y0 = 0.0; yrange = 0.4; xmean = 66 dB; σ = 7 dB. D, Schematic showing the offset amplitude y0 parameter of the Gaussian fit (left panel) and cumulative distribution function of y0 for nonmonotonic sound-increasing neurons at different laser powers of SST (middle panel) and VIP (right panel) neuronal activation. E, Schematic showing the amplitude range yrange parameter of the Gaussian fit (left panel) and cumulative distribution function of yrange for nonmonotonic sound-increasing neurons at different laser powers of SST (middle panel) and VIP (right panel) neuronal activation. F, Schematic showing the mean xmean parameter of the Gaussian fit (left panel) and cumulative distribution function of xmean for nonmonotonic sound-increasing neurons at different laser powers of SST (middle panel) and VIP (right panel) neuronal activation. G, Schematic showing the SD σ parameter of the Gaussian fit (left panel) and cumulative distribution function of σ for nonmonotonic sound-increasing neurons at different laser powers of SST (middle panel) and VIP (right panel) neuronal activation. H, Fit types (sigmoid in red, Gaussian in yellow, no fit in white) at one or both of the other laser powers for the non-SST neurons fit by a Gaussian function at no, medium, and high laser powers of SST neuronal activation. The overlap between yellow and red (orange) indicates neurons fit by a sigmoid function at one laser power and a Gaussian function at the other laser power. I, Fit types (sigmoid in red, Gaussian in yellow, no fit in white) at one or both of the other laser powers for the non-VIP neurons fit by a Gaussian function at no, medium, and high laser powers of VIP neuronal activation. The overlap between yellow and red (orange) indicates neurons fit by a sigmoid function at one laser power and a Gaussian function at the other laser power. J, Schematic of the mean significant changes to the response-level curve of a nonmonotonic sound-increasing cell (gray line) upon SST (blue line) and VIP (green line) neuronal activation. For all panels, black, pink, and red colors correspond to no laser power (0 mW/mm2), medium laser power (∼0.3 mW/mm2), and high laser power (∼3.5 mW/mm2), respectively (see Materials and Methods for power calibration). For all panels, the statistical test GLME was performed on the distributions at the three different levels of interneuron activation, with n.s. corresponding to non-significant; *p < 0.05; **p < 0.01; ***p < 0.001. There are n = 224, n = 175, and n = 130 sound-increasing nonmonotonic cells fit at no, medium, and high laser powers of SST neuronal activation, respectively. There are n = 243, n = 278, and n = 310 sound-increasing nonmonotonic cells fit at no, medium, and high laser powers of VIP neuronal activation, respectively.

We first characterized the responses of sound-modulated cells that exhibited a monotonic response-level curve by fitting this curve for individual cells at different levels of laser power (Fig. 4AC). Out of the four sigmoidal fit parameters (Fig. 4DG, middle panels), only the midpoint of the sigmoid fit exhibited a significant change upon SST neuronal activation, from 58 dB at no laser power to 73 dB at high laser power (median values; n = 109, 103, and 64 neurons for no, medium, and high laser power, respectively; offset: plaser = 0.41; range: plaser = 0.22; midpoint: **plaser = 8.6 × 10−3; width: plaser = 0.99; GLME; Fig. 4DG, middle panels). With VIP activation, only the range of the sigmoid fit showed a significant increase (n = 267, 239, and 269 neurons for no, medium, and high laser power, respectively; offset: plaser = 0.081; range: **plaser = 1.9 × 10−3; midpoint: plaser = 0.38; width: plaser = 0.57; GLME; Fig. 4DG, right panels). Among the non-SST neurons fit by a sigmoidal function, two-thirds of the cells were fit at a single laser power of SST neuronal activation (n = 75, 60, and 40 neurons for no, medium, and high laser power, respectively; Fig. 4H) and around a tenth of the neurons switched monotonicity with SST neuronal activation (Gaussian fit at other laser powers for n = 10, 21, and 8 neurons at no, medium, and high laser power, respectively). Among the non-VIP neurons fit by a sigmoidal function, ∼40% of the cells were fit at a single laser power of VIP neuronal activation (n = 108, 88, and 109 neurons for no, medium, and high laser power, respectively; Fig. 4I) and ∼15% of the neurons switched monotonicity with VIP neuronal activation (Gaussian fit at other laser powers for n = 49, 35, and 39 neurons at no, medium, and high laser power, respectively). Thus, SST neuronal activation led to monotonic response-level functions that were shifted rightward toward higher sound pressure levels, leading to responses to a narrower range of sounds at higher sound pressure levels. VIP activation expanded the neuronal response-level curves upward, leading to responses to a broader range of sound pressure levels (Fig. 4J).

We then characterized the responses of sound-modulated cells that exhibited a nonmonotonic response-level curve by fitting this curve for individual cells at different levels of laser power (Fig. 5AC). Whereas the mean and SD of the Gaussian fit remained unchanged (mean: plaser = 0.73; SD: plaser = 0.53; GLME; Fig. 5F,G, middle panels), the offset and the range of the Gaussian fit decreased with SST neuronal activation (n = 224, 175, and 130 neurons for no, medium, and high laser power, respectively; offset: **plaser = 1.7 × 10−3; range: ***plaser = 9.4 × 10−8; GLME; Fig. 5D,E, middle panels). The offset of response did not change with VIP neuronal activation (offset: plaser = 0.48; GLME; Fig. 5D, right panel), whereas the range increased significantly as well as the Gaussian mean and SD (n = 243, 278, and 310 neurons for no, medium, and high laser power, respectively; range: ***plaser = 2.9 × 10−8; mean: ***plaser = 8.6 × 10−4; SD: ***plaser = 8.1 × 10−5; GLME; Fig. 5EG, right panels). Among the non-SST neurons fit by a Gaussian function, two-thirds of the cells were fit at a single laser power of SST neuronal activation (n = 147, 111, and 89 neurons for no, medium, and high laser power, respectively; Fig. 5H) and less than a tenth of the neurons switched monotonicity with SST neuronal activation (sigmoidal fit at other laser powers for n = 17, 6, and 13 neurons at no, medium, and high laser power, respectively). Among the non-VIP neurons fit by a Gaussian function, half of the cells were fit at a single laser power of VIP neuronal activation (n = 120, 150, and 170 neurons for no, medium, and high laser power, respectively; Fig. 5I), and ∼15% of the neurons switched monotonicity with VIP neuronal activation (sigmoidal fit at other laser powers for n = 32, 42, and 40 neurons at no, medium, and high laser power, respectively). Thus, SST neuronal activation led to nonmonotonic response-level functions that were shifted downward with a decreased range of responses, leading to responses above the noise level to a narrower range of sound pressure levels. VIP neuronal activation shifted the neuronal response-level curves rightward with an increased range of response, leading to increased peak responses at higher sound pressure levels (Fig. 5J).

These results demonstrated that SST neuronal activation promoted a more localist representation of the sound pressure level at the population scale by eliciting responses above the noise level over a narrower range of sound pressure levels and increasing separation between the sound pressure levels covered by monotonic and nonmonotonic neurons. By contrast, VIP neuronal activation promoted a more distributed representation of the sound pressure level by broadening response-level curves of single neurons and increasing the overlap between the sound pressure levels covered by monotonic and nonmonotonic neurons.

Could the changes to the fitted parameters of sound-modulated cells (Figs. 4, 5) explain the differential effect of SST and VIP neuronal activation on the separation angle and length between mean population vectors to different sound pressure levels (Fig. 3)? To answer this question, we constructed a qualitative model including a monotonic and a nonmonotonic cell, with response-level fit parameters for no and high laser power taken as the mean parameters from the data in Figures 4 and 5 (Fig. 6A,B). With this simple two-cell population, we could qualitatively reproduce the increase in separation angle upon SST neuronal activation and the decrease in separation angle upon VIP neuronal activation over the range of sound pressure levels we tested (Fig. 6DF) and similarly the decrease in vector length upon SST neuronal activation and the increase in vector length upon VIP neuronal activation (Fig. 6GI). We could also model different regimes of multiplicative (or divisive) and additive (or subtractive) effects of SST or VIP neuronal activation on monotonic and nonmonotonic neurons (Fig. 6J,K). Thus, the changes to the fit parameters observed in Figures 4 and 5 can explain the change in representation of sound pressure levels observed in Figure 3.

Figure 6.

Figure 6.

Two-cell model A. Response-level curve of the monotonic cell with parameters taken as the mean parameters from Figure 4 at no and high laser power. The parameters for the sigmoid curve with no interneuron activation (black) are as follows: offset amplitude y0 = −0.12; range yrange = 0.88; midpoint x0 = 55 dB; width Δx = 11 dB. Upon SST neuronal activation (blue), all parameters remain constant except for: midpoint x0SST = 68 dB; upon VIP neuronal activation (green), all parameters remain constant except for: range yrangeVIP = 1.43. B, Response-level curve of the nonmonotonic cell with parameters taken as the mean parameters from Figure 5 at no and high laser power. The parameters for the Gaussian curve with no interneuron activation (black) are as follows: offset amplitude y0 = −0.04; range yrange = 0.54; mean xmean = 50 dB; SD σ = 13 dB. Upon SST neuronal activation (blue), all parameters remain constant except for the following: offset amplitude y0SST = −0.07; range yrangeSST = 0.27; upon VIP neuronal activation (green), the offset amplitude remains constant, and range yrangeVIP = 0.86; mean xmeanVIP = 54 dB; SD σVIP = 17 dB. C, Trajectory of the population's response from 0 to 90 dB in the neural space, with the response of the monotonic cell on the x-axis and the response of the nonmonotonic cell on the y-axis. The response of both cells at 0 dB has been subtracted from the curves; thus the dots at the (0,0) coordinate are the response to 0 dB, and the end of the curves on the right indicates the response to 90 dB. The trajectories are computed from 0 to 90 dB with 1 dB increments, and circles on a line represent 10 dB increments from 0 to 90 dB. D, Confusion matrix of the separation angle between population responses to each sound and laser power from silence at a given laser power, for no (left), SST (middle), and VIP (right) neuronal activation. The sound pressure level is in 1 dB increments, and the gray box indicates the sound levels sampled in the experiments (Fig. 3D,G) E. Schematic in the neural space (see panel C) of the angle between 50 and 70 dB when there is no (black), SST (blue), or VIP (green) neuronal activation, starting from the population's response to silence for each case of (or lack of) interneuron activation. The angle is greatest when SST neurons are activated and smallest when VIP neurons are activated. F, Confusion matrix of the difference in the separation angle from SST (left) or VIP (right) neuronal activation to no interneuron activation, with the angles calculated as in D. The sound level is in 1 dB increments, and the gray box indicates the sound pressure levels sampled in the experiments (Fig. 3F,I). The mean angle difference for SST neuronal activation is, over 1–90 dB: +3.6° and over 30–90 dB: +3.7°; and for VIP neuronal activation is, over 1–90 dB: −4.0° and over 30–90 dB: −4.1°. G, Confusion matrix of the length of the population vector between each sound pressure level at a given laser power, for no (left), SST (middle), and VIP (right) neuronal activation. The sound pressure level is in 1 dB increments, and the gray box indicates the sound pressure levels sampled in the experiments (Fig. 3J,M). H, Schematic in the neural space (see panel C) of the vector length between 50 and 70 dB when there is no (black), SST (blue), or VIP (green) neuronal activation. The vector length is greatest when VIP neurons are activated and smallest when SST neurons are activated. I, Confusion matrix of the difference in the vector length from SST (left) or VIP (right) neuronal activation to no interneuron activation, with the lengths calculated as in G. The sound pressure level is in 1 dB increments, and the gray box indicates the sound pressure levels sampled in the experiments (Fig. 3I,L). The mean length difference for SST neuronal activation is, over 1–90 dB: −0.12 a.u. and over 30–90 dB: −0.07 a.u.; and for VIP neuronal activation is, over 1–90 dB: +0.27 a.u. and over 30–90 dB: +0.21 a.u. J, Response of the monotonic cell with laser activation versus no laser activation. SST neuronal activation shows a divisive regime, a subtractive regime, and a combination of divisive and subtractive or multiplicative and subtractive regimes depending on the range of responses sampled. VIP neuronal activation shows a multiplicative regime or an additive and multiplicative regime depending on the range of responses sampled. K, Response of the nonmonotonic cell with laser activation versus no laser activation. SST neuronal activation shows a combination of divisive and subtractive regime. VIP neuronal activation shows a multiplicative regime and additive regime or an additive and multiplicative regime depending on the range of responses sampled. For all panels, black indicates no interneuron activation, blue indicates SST neuronal activation, and green indicates VIP neuronal activation.

Overall, when neither SST nor VIP neurons are activated, the neuronal population encoded different sound pressure levels using two strategies: the identity of the responsive cells (different cells respond to different sound pressure levels, discrete encoding of the sound pressure level) and the strength of the neuronal response (continuous encoding of the sound pressure level). When SST neurons are activated, the neuronal population shifts toward a more localist representation of the sound pressure level. Specifically, the encoding of the sound pressure level relies more on the identity of the responsive cells and less on the magnitude of response: there is less overlap between populations of cells responding to different sound pressure levels, but the strength of response is similar for all sound pressure levels. This can be explained by narrower bandwidths of response, albeit of reduced magnitude, of both monotonic and nonmonotonic sound-increasing cells with SST neuronal activation. By contrast, when VIP neurons are activated, the neuronal population shifts toward a more distributed representation of the sound pressure level. Specifically, the encoding of sound pressure level by the magnitude of the neuronal response is enhanced, while the representation by different cell groups declines: the neuronal responses are of higher magnitude and over a higher range, but there is more overlap between neurons responding to different sound pressure levels. This can be explained by larger and broader responses of monotonic and nonmonotonic sound-increasing neuronal responses with VIP neuronal activation.

Discussion

Within the brain, neurons form intricate networks, which represent sensory information. A sensory stimulus, such as a specific sound or a visual image, elicits activity in a subset of neurons in a network. A neuronal network can use a multitude of codes to represent information. A stimulus can be encoded discretely with a localist, pattern-separated representation, in which a specific group of neurons represents a specific stimulus and different stimuli elicit activity in different groups of neurons (Fig. 7A). Such localist representations have the advantage of discreteness: they can separate stimuli in different categories. Alternatively, in a distributed representation, stimulus-evoked activity can be distributed across the network, such that the relative activity of neurons within a group represents different stimuli (Fig. 7A,B). An example of a distributed representation is a rate code, in which the firing rate of the active neurons represents a continuously varying stimulus feature, such as intensity or sound location (Belliveau et al., 2014). Distributed representations have the advantage of invariance: a small change in stimulus parameter will elicit a small variation in the neuronal response. Neuronal population responses have been measured at various positions along the localist to distributed representation spectrum across many features and areas [such as memory (Wixted et al., 2014), sound (Hromádka et al., 2008), sound localization (Lesica et al., 2010; Belliveau et al., 2014), and vision (Christensen and Pillow, 2022)] and can change dynamically along the spectrum (Kato et al., 2015; Honey et al., 2017; Kuchibhotla et al., 2017). A defining feature of the AC is sparse coding (DeWeese et al., 2003), which can lead to both distributed and localist representations. Along the auditory pathway, the coexistence of neurons with monotonic and nonmonotonic response-level curves indicates that the sound pressure level is represented by both localist and distributed codes (Schreiner et al., 1992; Wu et al., 2006; Tan et al., 2007; Watkins and Barbour, 2011). More generally, stimulus representation within neuronal networks is mixed between local and distributed codes, in so-called sparse distributed representations, with both the level of activity and the identity of activated neurons encoding the stimulus (Fig. 7A; Rolls and Tovee, 1995; Vinje and Gallant, 2000; Hromádka et al., 2008; Wixted et al., 2014). Based on the environmental and behavioral demands, it may be beneficial for neuronal representations to shift dynamically toward a more localist or a more distributed representation of a stimulus feature.

Figure 7.

Figure 7.

Predictions for testing for differential control of localist and distributed representations of sounds by inhibitory neurons. A, A schematic of example localist versus distributed neuronal codes. B, Localist versus distributed representations can be implemented in many ways. An example of localist code is the pattern code without redundancy (known as the “grandmother cell”; Bowers, 2009). An example of distributed code is a code relying only on the relative response of each cell and not the magnitude of response of the population vector. C, Neuronal circuit manipulation: optogenetic stimulation of SST and VIP neurons in AC (simplified connectivity circuit). D, Noise bursts at different SPLs were presented to an awake mouse. E. For each sound pressure level, some neurons responded (filled circles), while many cells did not respond (empty circles). F. The response to different sound pressure levels can be described in the neuronal space by the separation angle between the mean population vectors and the length separating them. G. Changes to the representation of the sound pressure level at the population level are implemented through changes to the response-level curve of each neuron. Gray: baseline; blue: response transformation with SST neuronal activation; green: response transformation with VIP neuronal activation.

Our results suggest that distinct inhibitory neurons in the AC affect population neuronal response code by differentially shifting the responses toward a distributed or a localist representation. Previous work found that SST neuronal activation decreases the activity of the neuronal population to sounds (Phillips and Hasenstaub, 2016; Natan et al., 2017) and leads to a rightward shift of monotonic response-level curves (Wilson et al., 2012), while VIP neuronal activation, through a disinhibitory circuit, increases the activity of the population (Pfeffer et al., 2013; Pi et al., 2013; Zhang et al., 2014). We found that activation of SST neurons led to a sparser, more localist representation (Fig. 2D–E), where the sound pressure level is encoded in discrete steps by distinct groups of neurons (Fig. 3E) with a similar low strength (Fig. 3K). By contrast, activation of VIP neurons led to a more distributed representation (Fig. 2K,L), with more overlap between the cell populations responding to different sound pressure levels (Fig. 3H): the sound pressure level is encoded continuously by varying the strength of response of a large group of neurons (Fig. 3N). These shifts in representation are implemented at the single-neuron level through changes to the response-level function of monotonic (Fig. 4) and nonmonotonic neurons (Fig. 5). SST neuronal activation shifts the response-level curves of sound-modulated neurons by further separating the sound pressure levels that monotonic and nonmonotonic neurons represent (Figs. 4I, 5I). With VIP neuronal activation, the changes to the response-level curves of sound-modulated neurons allow for stronger responses and more overlap in bandwidth between monotonic and nonmonotonic neurons (Figs. 4J, 5J).

As distinct inhibitory populations can be recruited during different behaviors, their ability to transform the neuronal code can be advantageous to distinct behaviors and computations. Localist versus distributed representations, modulated by the relative strength of global (SST) versus local (PV or VIP) inhibition, respectively, may provide support for neuronal computations such as discreteness versus invariance (Kuchibhotla and Bathellier, 2018), segmentation versus concatenation (Haga and Fukai, 2021), and integration of bottom–up versus top–down information (Honey et al., 2017; Hertäg and Sprekeler, 2019). The two types of interneurons receive neuromodulatory inputs and may dynamically change the network's state toward one or the other type of representation for a given task: SST neuronal activation may help with tasks requiring focused attention such as discriminating different stimuli (Lee and Middlebrooks, 2011) or detecting in noise (Lakunina et al., 2022), by sharpening tuning, decreasing the activity for nonrelevant stimuli, and enhancing the information-per-spike (Phillips and Hasenstaub, 2016). In contrast, VIP neuronal activation may help with tasks requiring receptive attention such as active sensing (Gentet et al., 2012) or detecting small stimuli by amplifying weak signals (Millman et al., 2020) and increasing detectability (Cone et al., 2019) without increasing the stimulus-response mutual information (Bigelow et al., 2019).

Our results expand on the results of previous studies that measured more general effects of interneuron modulation. The changes to the response-level curves (Figs. 4, 5) we measured upon SST or VIP neuronal activation can explain how the frequency–response functions of neurons in AC change with interneuron modulation. The excitatory and inhibitory inputs to pyramidal cells of AC are frequency-tuned (Isaacson and Scanziani, 2011; Li et al., 2013; Kato et al., 2017), and intracortical inhibition further shapes the tuning (Wu et al., 2008). From the response-level curves, we can plot the response with interneuron activation versus without and thus predict within which range of amplitudes we can expect multiplicative or additive effects to the frequency tuning curve. Previous studies have shown that SST neuronal activation leads to either subtractive, divisive, or a combination of both subtractive and divisive effects on the frequency tuning curve (Wilson et al., 2012; Seybold et al., 2015; Phillips and Hasenstaub, 2016). Our data are consistent with these results: for monotonic neurons, SST neuronal activation can lead to divisive and/or subtractive effects at a low range of sound amplitudes, and for nonmonotonic neurons, SST neuronal activation leads to both divisive and subtractive effects (Fig. 6J,K). Previous studies have also shown that VIP neuronal activation leads to an additive shift in the frequency tuning curve (Pi et al., 2013; Bigelow et al., 2019), and similarly SST inactivation leads to a multiplicative or additive shift mostly (Phillips and Hasenstaub, 2016). Our data can explain these results as well, with multiplicative effects for monotonic neurons and a range of additive and multiplicative effects for nonmonotonic neurons (Fig. 6J,K). One component that can contribute to a change in representation is a change in the noisiness of the responses. Indeed, changes in the signal-to-noise ratio (SNR) could drive populations to appear as more localist or distributed in their coding. The change in SNR may be one of the components explaining how the change in representation is implemented; however, it is not the sole factor as assessed by the decoding accuracy (Fig. 2F,M).

Nonmonotonic neurons in AC either can have their nonmonotonicity inherited from the nonmonotonic excitatory input into those cells while the monotonic inhibitory input, which shows a peak in delay at the cell's best pressure level, sharpens the nonmonotonicity (Wu et al., 2006) or can be constructed de novo with an imbalance between excitatory and inhibitory inputs (Wu et al., 2006; Tan et al., 2007). This means that for some of the nonmonotonic cells we recorded from, the input into the cell does not covary with the sound pressure level, and so for these cells, we are not assessing the input–output function of the cell through the response-level curve, rather how inhibition further shapes the already intensity-tuned input. From our experiments, we observe that SST neuronal activation decreases the range of responses of nonmonotonic neurons but does not change the sound pressure level of peak response nor the width of responses (Fig. 5). This may indicate that SST neuronal activation does not change the timing of the inhibitory input into the nonmonotonic cells but rather the overall strength of inhibition across all sound pressure levels. In contrast, VIP neuronal activation leads to a shift of the sound pressure level of peak response toward higher levels, along with a broadening of the response and an increase in the range of response (Fig. 5). This could simply be explained if VIP neuronal activation changes the timing of the inhibition, with the delay between excitatory and inhibitory inputs peaking at a higher sound pressure level.

In our sample, the relative proportion of nonmonotonic versus monotonic neurons is higher than would be expected from the literature. We note that the number of neuronal responses that we were able to fit is likely an underestimate of the truly monotonic or nonmonotonic neurons in the population, as we used stringent selection criteria. The relatively high proportion of nonmonotonic neurons may be due to inclusion of the nonprimary region VAF, which has previously been shown to have a higher proportion of nonmonotonic neurons than A1 (Wu et al., 2006; Polley et al., 2007). Furthermore, we used a mouse line in which the hearing loss mutation in Cdh23 commonly found in C57B6 mice is corrected, thus our mice may have lower detection thresholds than the mice in previous studies, leading to a larger observed proportion of nonmonotonic neurons tuned to lower intensities. Additionally, because of the time course of GCaMP, we are not distinguishing between the onset and offset responses, and they may be integrated in this window. These estimates contribute to the ongoing discussion of the differences in monotonicity of responses between the non-human primates (Gao and Wang, 2019) and rodents. It is plausible that our current setup, with imaging performed in awake mice rather than under anesthesia, allows us to sample the responses in a more accurate fashion than previous studies.

In our analysis, the majority of the neurons remained monotonic or non-monotonic between laser conditions, with only 10–15% switching monotonicity (Figs. 4H,I, 5H,I). Additionally, the ratio between the neurons which we identify as monotonic or non-monotonic was generally preserved across the laser conditions. A significant fraction of neurons were fit at a single laser power of SST or VIP neuronal activation. SST and VIP neuronal activation thus elicited responses in new pools of neurons for each laser, however with more consistency between laser powers upon VIP neuronal activation than upon SST neuronal activation. Because of the stringent fitting criteria, we believe that our fitting procedure underestimates the true fraction of monotonic or nonmonotonic neurons. Nonetheless, we are able to compare the fits across conditions because the criteria and fraction of well-fitted responses remain the same. Therefore, the change in representation (localist or distributed) likely relies on changes to the response-level curves rather than on changes in the proportion of monotonic and nonmonotonic neurons.

A limitation of our study is that we measured the response only from a subset of neurons from Layer 2/3 while broadly stimulating many SST or VIP neurons across different cortical layers. However, neurons across the cortical column may perform additional computations in other layers, which would be important to record in future experiments. Our sample ended up including a relatively low number of SST-positive and VIP-positive neurons. Their sound intensity responses largely trended the mean recorded responses and did not differ strongly from each other. From the literature, we were expecting VIP neurons to be selective for low-decibel sounds (Mesik et al., 2015). Similarly, in the visual cortex, VIP neurons prefer low contrast, whereas SST neurons prefer high contrast visual stimuli (Millman et al., 2020). Here, SST and VIP neurons have similar tuning properties with the non-SST and non-VIP neurons within their sessions, but with optogenetic stimulation, sound modulation becomes weaker. Future study should focus on the responses within the SST and VIP neuronal populations across layers. Furthermore, whereas we combined imaging sessions across the AC, future studies should also examine differences in function of inhibitory neurons across the different primary and nonprimary auditory areas.

A related caveat in interpreting our results is that optogenetic activation of inhibitory neurons may differ across samples and can potentially drive higher activity level of excitatory or inhibitory neurons than physiological levels, saturating the responses. To mitigate this limitation, we included multiple activation levels in each of the imaging sessions by modulating the strength of the laser between high and low levels. This allowed within-sample comparison of activity patterns rather than comparisons of absolute changes across multiple imaging sessions. Furthermore, by comparing the activity levels of imaged neurons between low and high laser intensities, we ensured that the modulation of activity was not saturating due to the laser.

An additional caveat is that the opsin-expressing cells may be depolarized at the baseline due to off-target laser stimulation during two-photon imaging (Forli et al., 2018). In SST-Cre mice, the depolarization of SST neurons would lead monotonic non-SST neurons to shift their range of responsiveness toward higher sound pressure levels (Fig. 4), while nonmonotonic non-SST neurons still respond to the same best sound level. This would lead to a proportionally larger response to the lower sound pressure levels (covered mainly by nonmonotonic neurons) than to the high sound pressure levels, as we observe in Figure 2C. In VIP-Cre mice, the shape of the population response-level curve (Fig. 2J) is similar to that in SST-Cre mice. The depolarization of VIP neurons at the baseline may lead to an increase in the amplitude range of nonmonotonic neurons combined with a slight increase in the number of nonmonotonic neurons, which may result in similar changes to the population response-sound level curve compared with the curve in the control.

Another potential caveat in interpreting the data is that while we were able to identify a subset of neurons as SST or VIP positive, the analysis combined multiple types of neurons as non-SST or non-VIP neurons, and for some neurons the tdTomato expression might not have been strong enough for detection as SST or VIP positive. The population of unlabeled neurons may include SST neurons explaining the increase in population response for non-SST neurons in response to high laser power and no sound (Fig. 2C). SST neurons also suppress PV neurons (Pfeffer et al., 2013), which control frequency discrimination (Aizenberg et al., 2015; Briguglio et al., 2018), and at high laser power, in combination with firing rate saturation and indirect disinhibition of excitatory neurons via PV neurons, the overall activity might be increasing in the absence of sound. A targeted experimental approach which would allow either in vivo or post hoc identification of imaged neuronal types would allow to further distinguish response parameters among different types of inhibitory and excitatory neurons (Kerlin et al., 2010; Khan et al., 2018). Furthermore, future studies inactivating these inhibitory neurons would further complement and extend our findings (Phillips and Hasenstaub, 2016).

In our experiments, we tested the results of SST or VIP neuronal activation on the neuronal representation of the sound pressure level, and an important next step will be to investigate whether the changes in neuronal representation of the sound pressure level correlate with behavioral effects. One approach would be to image the activity of SST neurons while a mouse is engaged in a task that may require SST neuronal activation or similarly VIP neurons. SST neurons may be involved in tasks leading to a sharpening in the neuronal tuning properties or to a filtering out of irrelevant stimuli through the overall decrease in the firing rate, such as discrimination tasks and signal-in-noise tasks (Otazu et al., 2009; Lee and Middlebrooks, 2011; Kuchibhotla et al., 2017; Christensen et al., 2019; Lakunina et al., 2022). VIP neurons may be involved in tasks requiring amplification of weak signals, such as a detection at threshold task or active sensing (Fritz et al., 2003; Gentet et al., 2012; Bennett et al., 2013; Kato et al., 2015; Cone et al., 2019; Millman et al., 2020). A second approach would be to measure how SST or VIP neuronal activity changes the performance of a mouse engaged in a task. In a detection task, we may expect that detection thresholds increase with SST neuronal activation and decrease with VIP neuronal activation, as seen in the visual cortex (Cone et al., 2019). In a discrimination task or detection of sounds in the background task, we may expect SST neuronal activation to increase the performance, while VIP neuronal activation may decrease or not affect performance: SST neuronal inactivation decreases performance in the detection of sounds in background noise (Lakunina et al., 2022), and at the neuronal level, VIP neuronal activation decreases “encoding efficiency” (Bigelow et al., 2019), while SST neuronal activation may increase it (Phillips and Hasenstaub, 2016).

It should be noted that the dichotomy between the functional roles of SST and VIP neurons might not be so clear cut: VIP and SST neurons may cooperate to simultaneously amplify relevant stimuli and filter out irrelevant stimuli, respectively, or VIP neurons may be more active for weak stimuli and SST neurons for loud stimuli (Zhang et al., 2014; Mesik et al., 2015; Karnani et al., 2016; Kuchibhotla et al., 2017; Dipoppa et al., 2018; Millman et al., 2020). For example, activation of the cingulate cortex can elicit spiking activity in PV, SST, and VIP neurons, with SST neurons contributing to surround suppression, and VIP neurons to facilitation of the center of the receptive field (Zhang et al., 2014); cholinergic modulation depolarizes multiple types of inhibitory neurons, including SST and VIP neurons (Kuchibhotla et al., 2017); and locomotion increases activity in VIP neurons primarily for small stimuli but in SST neurons for large stimuli (Dipoppa et al., 2018). VIP neurons inhibit SST neurons, creating attentional spotlights through targeted disinhibition (Karnani et al., 2016). Therefore, examination of the function of SST and VIP neurons in complex behaviors needs to take the circuit mechanisms into account. Interestingly, the connectivity between cortical neurons is largely conserved across layers, primary and nonprimary cortices, and sensory and nonsensory areas (Douglas and Martin, 2004; Markram et al., 2004; Yu et al., 2019; Campagnola et al., 2022), with SST and VIP neurons mutually inhibiting each other as a common motif: perhaps the change in representation we observe with varying sound-level pressure upon SST or VIP neurons extends to other stimulus features, sensory and nonsensory alike.

References

  1. Aizenberg M, Mwilambwe-Tshilobo L, Briguglio JJ, Natan RG, Geffen MN (2015) Bidirectional regulation of innate and learned behaviors that rely on frequency discrimination by cortical inhibitory neurons. PLoS Biol 13:e1002308. 10.1371/journal.pbio.1002308 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Belliveau LAC, Lyamzin DR, Lesica NA (2014) The neural representation of interaural time differences in gerbils is transformed from midbrain to cortex. J Neurosci 34:16796–16808. 10.1523/JNEUROSCI.2432-14.2014 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Bennett C, Arroyo S, Hestrin S (2013) Subthreshold mechanisms underlying state-dependent modulation of visual responses. Neuron 80:350–357. 10.1016/j.neuron.2013.08.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Bigelow J, Morrill RJ, Dekloe J, Hasenstaub AR (2019) Movement and VIP interneuron activation differentially modulate encoding in mouse auditory cortex. eNeuro 6:ENEURO.0164-19.2019. 10.1523/ENEURO.0164-19.2019 [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Blackwell JM, Lesicko AM, Rao W, De Biasi M, Geffen MN (2020) Auditory cortex shapes sound responses in the inferior colliculus ( Shinn-Cunningham BG, Carr CE, Woolley S, Read H, eds). Elife 9:e51890. 10.7554/eLife.51890 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Bowers JS (2009) On the biological plausibility of grandmother cells: implications for neural network theories in psychology and neuroscience. Psychol Rev 116:220–251. 10.1037/a0014462 [DOI] [PubMed] [Google Scholar]
  7. Briguglio JJ, Aizenberg M, Balasubramanian V, Geffen MN (2018) Cortical neural activity predicts sensory acuity under optogenetic manipulation. J Neurosci 38:2094–2105. 10.1523/JNEUROSCI.2457- 17.2017 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Campagnola L, et al. (2022) Local connectivity and synaptic dynamics in mouse and human neocortex. Science 375:eabj5861. 10.1126/science.abj5861 [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Chen N, Sugihara H, Sur M (2015) An acetylcholine-activated microcircuit drives temporal dynamics of cortical activity. Nat Neurosci 18:892–902. 10.1038/nn.4002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Christensen RK, Lindén H, Nakamura M, Barkat TR (2019) White noise background improves tone discrimination by suppressing cortical tuning curves. Cell Rep 29:2041-2053.e4. 10.1016/j.celrep.2019.10.049 [DOI] [PubMed] [Google Scholar]
  11. Christensen AJ, Pillow JW (2022) Reduced neural activity but improved coding in rodent higher-order visual cortex during locomotion. Nat Commun 13:1676. 10.1038/s41467-022-29200-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Cone JJ, Scantlen MD, Histed MH, Maunsell JHR (2019) Different inhibitory interneuron cell classes make distinct contributions to visual contrast perception. eNeuro 6:ENEURO.0337-18.2019. 10.1523/ENEURO.0337-18.2019 [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. DeWeese MR, Wehr M, Zador AM (2003) Binary spiking in auditory cortex. J Neurosci 23:7940–7949. 10.1523/JNEUROSCI.23-21-07940.2003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Dipoppa M, Ranson A, Krumin M, Pachitariu M, Carandini M, Harris KD (2018) Vision and locomotion shape the interactions between neuron types in mouse visual cortex. Neuron 98:602–615.e8. 10.1016/j.neuron.2018.03.037 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Douglas RJ, Martin KAC (2004) Neuronal circuits of the neocortex. Annu Rev Neurosci 27:419–451. 10.1146/annurev.neuro.27.070203.144152 [DOI] [PubMed] [Google Scholar]
  16. Fanselow EE, Richardson KA, Connors BW (2008) Selective, state-dependent activation of somatostatin-expressing inhibitory interneurons in mouse neocortex. J Neurophysiol 100:2640–2652. 10.1152/jn.90691.2008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Feigin L, Tasaka G, Maor I, Mizrahi A (2021) Sparse coding in temporal association cortex improves complex sound discriminability. J Neurosci 41:7048–7064. 10.1523/JNEUROSCI.3167-20.2021 [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Forli A, et al. (2018) Two-photon bidirectional control and imaging of neuronal excitability with high spatial resolution in vivo. Cell Rep 22:3087–3098. 10.1016/j.celrep.2018.02.063 [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Fritz J, Shamma S, Elhilali M, Klein D (2003) Rapid task-related plasticity of spectrotemporal receptive fields in primary auditory cortex. Nat Neurosci 6:1216–1223. 10.1038/nn1141 [DOI] [PubMed] [Google Scholar]
  20. Fu Y, Tucciarone JM, Espinosa JS, Sheng N, Darcy DP, Nicoll RA, Huang ZJ, Stryker MP (2014) A cortical circuit for gain control by behavioral state. Cell 156:1139–1152. 10.1016/j.cell.2014.01.050 [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Gao L, Wang X (2019) Subthreshold activity underlying the diversity and selectivity of the primary auditory cortex studied by intracellular recordings in awake marmosets. Cereb Cortex 29:994–1005. 10.1093/cercor/bhy006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Gentet LJ, Kremer Y, Taniguchi H, Huang ZJ, Staiger JF, Petersen CCH (2012) Unique functional properties of somatostatin-expressing GABAergic neurons in mouse barrel cortex. Nat Neurosci 15:607–612. 10.1038/nn.3051 [DOI] [PubMed] [Google Scholar]
  23. Haga T, Fukai T (2021) Multiscale representations of community structures in attractor neural networks. PLoS Comput Biol 17:e1009296. 10.1371/journal.pcbi.1009296 [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Hertäg L, Sprekeler H (2019) Amplifying the redistribution of somato-dendritic inhibition by the interplay of three interneuron types Cuntz H, ed. PLoS Comput Biol 15:e1006999. 10.1371/journal.pcbi.1006999 [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Honey CJ, Newman EL, Schapiro AC (2017) Switching between internal and external modes: a multiscale learning principle. Netw Neurosci 1:339–356. 10.1162/NETN_a_00024 [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Hromádka T, DeWeese MR, Zador AM (2008) Sparse representation of sounds in the unanesthetized auditory cortex. PLoS Biol 6:e16. 10.1371/journal.pbio.0060016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Isaacson JS, Scanziani M (2011) How inhibition shapes cortical activity. Neuron 72:231–243. 10.1016/j.neuron.2011.09.027 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Jackson J, Ayzenshtat I, Karnani MM, Yuste R (2016) VIP+ interneurons control neocortical activity across brain states. J Neurophysiol 115:3008–3017. 10.1152/jn.01124.2015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Karnani MM, Jackson J, Ayzenshtat I, Sichani XH, Manoocheri K, Kim S, Yuste R (2016) Opening holes in the blanket of inhibition: localized lateral disinhibition by vip interneurons. J Neurosci 36:3471–3480. 10.1523/JNEUROSCI.3646-15.2016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Kato HK, Asinof SK, Isaacson JS (2017) Network-level control of frequency tuning in auditory cortex. Neuron 95:412–423. 10.1016/j.neuron.2017.06.019 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Kato HK, Gillet SN, Isaacson JS (2015) Flexible sensory representations in auditory cortex driven by behavioral relevance. Neuron 88:1027–1039. 10.1016/j.neuron.2015.10.024 [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Kawaguchi Y, Shindou T (1998) Noradrenergic excitation and inhibition of GABAergic cell types in rat frontal cortex. J Neurosci 18:6963–6976. 10.1523/JNEUROSCI.18-17-06963.1998 [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Kerlin AM, Andermann ML, Berezovskii VK, Reid RC (2010) Broadly tuned response properties of diverse inhibitory neuron subtypes in mouse visual cortex. Neuron 67:858–871. 10.1016/j.neuron.2010.08.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Khan AG, Poort J, Chadwick A, Blot A, Sahani M, Mrsic-Flogel TD, Hofer SB (2018) Distinct learning-induced changes in stimulus selectivity and interactions of GABAergic interneuron classes in visual cortex. Nat Neurosci 21:851–859. 10.1038/s41593-018-0143-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Klapoetke NC, et al. (2014) Independent optical excitation of distinct neural populations. Nat Methods 11:338–346. 10.1038/nmeth.2836 [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Kuchibhotla K, Bathellier B (2018) Neural encoding of sensory and behavioral complexity in the auditory cortex. Curr Opin Neurobiol 52:65–71. 10.1016/j.conb.2018.04.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Kuchibhotla KV, Gill JV, Lindsay GW, Papadoyannis ES, Field RE, Sten TAH, Miller KD, Froemke RC (2017) Parallel processing by cortical inhibition enables context-dependent behavior. Nat Neurosci 20:62–71. 10.1038/nn.4436 [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Lakunina AA, Menashe N, Jaramillo S (2022) Contributions of distinct auditory cortical inhibitory neuron types to the detection of sounds in background noise. eNeuro 9:ENEURO.0264-21.2021. 10.1523/ENEURO.0264-21.2021 [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Lakunina AA, Nardoci MB, Ahmadian Y, Jaramillo S (2020) Somatostatin-expressing interneurons in the auditory cortex mediate sustained suppression by spectral surround. J Neurosci 40:3564–3575. 10.1523/JNEUROSCI.1735-19.2020 [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Lee CC, Middlebrooks JC (2011) Auditory cortex spatial sensitivity sharpens during task performance. Nat Neurosci 14:108–114. 10.1038/nn.2713 [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Lesica NA, Lingner A, Grothe B (2010) Population coding of interaural time differences in gerbils and barn owls. J Neurosci 30:11696–11702. 10.1523/JNEUROSCI.0846-10.2010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Li L, Li Y, Zhou M, Tao HW, Zhang LI (2013) Intracortical multiplication of thalamocortical signals in mouse auditory cortex. Nat Neurosci 16:1179–1181. 10.1038/nn.3493 [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Litovsky RY, Clifton RK (1992) Use of sound-pressure level in auditory distance discrimination by 6-month-old infants and adults. J Acoust Soc Am 92:794–802. 10.1121/1.403949 [DOI] [PubMed] [Google Scholar]
  44. Markram H, Toledo-Rodriguez M, Wang Y, Gupta A, Silberberg G, Wu C (2004) Interneurons of the neocortical inhibitory system. Nat Rev Neurosci 5:793–807. 10.1038/nrn1519 [DOI] [PubMed] [Google Scholar]
  45. Mesik L, Ma WP, Li LY, Ibrahim LA, Huang ZJ, Zhang L, Tao HW (2015) Functional response properties of VIP-expressing inhibitory neurons in mouse visual and auditory cortex. Front Neural Circuits 9:22. 10.3389/fncir.2015.00022 [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Millman DJ, et al. (2020) VIP interneurons in mouse primary visual cortex selectively enhance responses to weak but specific stimuli. Elife 9:1–22. 10.7554/eLife.55130 [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Natan RG, Briguglio JJ, Mwilambwe-Tshilobo L, Jones SI, Aizenberg M, Goldberg EM, Geffen MN (2015) Complementary control of sensory adaptation by two types of cortical interneurons ( King AJ, ed). Elife 4:e09868. 10.7554/eLife.09868 [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Natan RG, Rao W, Geffen MN (2017) Cortical interneurons differentially shape frequency tuning following adaptation. Cell Rep 21:878–890. 10.1016/j.celrep.2017.10.012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Olsen SR, Wilson RI (2008) Lateral presynaptic inhibition mediates gain control in an olfactory circuit. Nature 452:956–960. 10.1038/nature06864 [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Otazu GH, Tai L-H, Yang Y, Zador AM (2009) Engaging in an auditory task suppresses responses in auditory cortex. Nat Neurosci 12:646–654. 10.1038/nn.2306 [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Pfeffer CK, Xue M, He M, Huang ZJ, Scanziani M (2013) Inhibition of inhibition in visual cortex: the logic of connections between molecularly distinct interneurons. Nat Neurosci 16:1068–1076. 10.1038/nn.3446 [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Phillips EA, Hasenstaub AR (2016) Asymmetric effects of activating and inactivating cortical interneurons (Uchida N, ed). Elife 5:e18383. 10.7554/eLife.18383 [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Phillips DP, Semple MN, Kitzes LM (1995) Factors shaping the tone level sensitivity of single neurons in posterior field of cat auditory cortex. J Neurophysiol 73:674–686. 10.1152/jn.1995.73.2.674 [DOI] [PubMed] [Google Scholar]
  54. Pi HJ, Hangya B, Kvitsiani D, Sanders JI, Huang ZJ, Kepecs A (2013) Cortical interneurons that specialize in disinhibitory control. Nature 503:521–524. 10.1038/nature12676 [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Polley DB, Read HL, Storace DA, Merzenich MM (2007) Multiparametric auditory receptive field organization across five cortical fields in the albino rat. J Neurophysiol 97:3621–3638. 10.1152/jn.01298.2006 [DOI] [PubMed] [Google Scholar]
  56. Rolls ET, Tovee MJ (1995) Sparseness of the neuronal representation of stimuli in the primate temporal visual cortex. J Neurophysiol 73:713–726. 10.1152/jn.1995.73.2.713 [DOI] [PubMed] [Google Scholar]
  57. Rudy B, Fishell G, Lee S, Hjerling-Leffler J (2011) Three groups of interneurons account for nearly 100% of neocortical GABAergic neurons. Dev Neurobiol 71:45–61. 10.1002/dneu.20853 [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Schreiner CE, Mendelson JR, Sutter ML (1992) Experimental brain research functional topography of cat primary auditory cortex: representation of tone intensity.
  59. Seay MJ, Natan RG, Geffen MN, Buonomano DV (2020) Differential short-term plasticity of PV and SST neurons accounts for adaptation and facilitation of cortical neurons to auditory tones. J Neurosci 40:9224–9235. 10.1523/JNEUROSCI.0686-20.2020 [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Seybold BA, Phillips EAK, Schreiner CE, Hasenstaub AR (2015) Inhibitory actions unified by network integration. Neuron 87:1181–1192. 10.1016/j.neuron.2015.09.013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Sun W, Marongelli EN, Watkins PV, Barbour DL (2017) Decoding sound level in the marmoset primary auditory cortex. J Neurophysiol 118:2024–2033. 10.1152/jn.00670.2016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Tan AYY, Atencio CA, Polley DB, Merzenich MM, Schreiner CE (2007) Unbalanced synaptic inhibition can create intensity-tuned auditory cortex neurons. Neuroscience 146:449–462. 10.1016/j.neuroscience.2007.01.019 [DOI] [PubMed] [Google Scholar]
  63. Tobin M, Geffen M, Sheth J, Wood K (2024) Data from: “distinct inhibitory neurons differently shape neuronal codes for sound intensity in the auditory cortex.” Dryad.
  64. Vinje WE, Gallant JL (2000) Sparse coding and decorrelation in primary visual cortex during natural vision. Science 287:1273–1276. 10.1126/science.287.5456.1273 [DOI] [PubMed] [Google Scholar]
  65. Watkins PV, Barbour DL (2011) Rate-level responses in awake marmoset auditory cortex. Hear Res 275:30–42. 10.1016/j.heares.2010.11.011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Willmore B, Tolhurst DJ (2001) Characterizing the sparseness of neural codes. Network 12:255–270. 10.1080/net.12.3.255.270 [DOI] [PubMed] [Google Scholar]
  67. Wilson NR, Runyan CA, Wang FL, Sur M (2012) Division and subtraction by distinct cortical inhibitory networks in vivo. Nature 488:343–348. 10.1038/nature11347 [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Wixted JT, Squire LR, Jang Y, Papesh MH, Goldinger SD, Kuhn JR, Smith KA, Treiman DM, Steinmetz PN (2014) Sparse and distributed coding of episodic memory in neurons of the human hippocampus. Proc Natl Acad Sci U S A 111:9621–9626. 10.1073/pnas.1408365111 [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Wood KC, Angeloni CF, Oxman K, Clopath C, Geffen MN (2022) Neuronal activity in sensory cortex predicts the specificity of learning in mice. Nat Commun 13:1167. 10.1038/s41467-022-28784-w [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Wu GK, Arbuckle R, Liu B, Tao HW, Zhang LI (2008) Lateral sharpening of cortical frequency tuning by approximately balanced inhibition. Neuron 58:132–143. 10.1016/j.neuron.2008.01.035 [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Wu GK, Li P, Tao HW, Zhang LI (2006) Nonmonotonic synaptic excitation and imbalanced inhibition underlying cortical intensity tuning. Neuron 52:705–715. 10.1016/j.neuron.2006.10.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Yu J, Hu H, Agmon A, Svoboda K (2019) Recruitment of GABAergic interneurons in the barrel cortex during active tactile behavior. Neuron 104:412–427. 10.1016/j.neuron.2019.07.027 [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Zhang JW, Lau C, Cheng JS, Xing KK, Zhou IY, Cheung MM, Wu EX (2013) Functional magnetic resonance imaging of sound pressure level encoding in the rat central auditory system. Neuroimage 65:119–126. 10.1016/j.neuroimage.2012.09.069 [DOI] [PubMed] [Google Scholar]
  74. Zhang S, Xu M, Kamigaki T, Do JPH, Chang WC, Jenvay S, Miyamichi K, Luo L, Dan Y (2014) Long-range and local circuits for top-down modulation of visual cortex processing. Science 345:660–665. 10.1126/science.1254126 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The data and code are available on the dryad depository: https://doi.org/10.5061/dryad.t1g1jwt6d (Tobin et al., 2024 link for reviewers, https://datadryad.org/stash/share/wk9BuKO0A8ZzqEfsrAUtd__bHqPOe2pJ5P8WExEOpY0.)


Articles from The Journal of Neuroscience are provided here courtesy of Society for Neuroscience

RESOURCES