Abstract
Cortical circuits contain multiple types of inhibitory neurons which shape how information is processed within neuronal networks. Here, we asked whether somatostatin-expressing (SST) and vasoactive intestinal peptide-expressing (VIP) inhibitory neurons have distinct effects on population neuronal responses to noise bursts of varying intensities. We optogenetically stimulated SST or VIP neurons while simultaneously measuring the calcium responses of populations of hundreds of neurons in the auditory cortex of male and female awake, head-fixed mice to sounds. Upon SST neuronal activation, noise bursts representations became more discrete for different intensity levels, relying on cell identity rather than strength. By contrast, upon VIP neuronal activation, noise bursts of different intensity level activated overlapping neuronal populations, albeit at different response strengths. At the single-cell level, SST and VIP neuronal activation differentially activated the response-level curves of monotonic and nonmonotonic neurons. SST neuronal activation effects were consistent with a shift of the neuronal population responses toward a more localist code with different cells responding to sounds of different intensity. By contrast, VIP neuronal activation shifted responses towards a more distributed code, in which sounds of different intensity level are encoded in the relative response of similar populations of cells. These results delineate how distinct inhibitory neurons in the auditory cortex dynamically control cortical population codes. Different inhibitory neuronal populations may be recruited under different behavioral demands, depending on whether categorical or invariant representations are advantageous for the task.
INTRODUCTION
Sensory cortical neuronal networks are comprised of multiple subtypes of neurons, including excitatory and inhibitory neurons. Inhibitory neurons can be further divided into multiple sub-classes, including somatostatin-expressing (SST) and vasoactive intestinal peptide-expressing (VIP) neurons, which mutually inhibit each other (Campagnola et al., 2022). The activity of these neurons modulates stimulus representations in auditory cortex (AC). Specifically, activating SST neurons reduces and decorrelates cortical activity (Chen et al., 2015), sharpens frequency tuning of AC neurons (Phillips and Hasenstaub, 2016) and contributes to surround suppression (Lakunina et al., 2020) and adaptation to stimulus context (Natan et al., 2015, 2017). By contrast, VIP neurons disinhibit excitatory neurons (Millman et al., 2020), largely via their projections onto SST neurons (Pfeffer et al., 2013; Pi et al., 2013), without affecting frequency tuning (Bigelow et al., 2019) and can enable high-excitability states in the cortex (Jackson et al., 2016). Both SST and VIP neurons can be modulated by noradrenergic and cholinergic inputs (Kawaguchi and Shindou, 1998; Fanselow et al., 2008; Chen et al., 2015) for SST neurons, and multiple neuromodulators for VIP neurons (Fu et al., 2014; Zhang et al., 2014; Chen et al., 2015) and therefore may serve differential modulatory functions in the context of different behavioral demands.
Sound pressure level representation supports sound detection, hearing in noise, source localization and distance to target calculation (Litovsky and Clifton, 1992). Most neurons in AC respond selectively to sounds at different sound pressure levels, either in a monotonic or a non-monotonic fashion. Monotonic neurons increase their firing rate with sound intensity, differing in their threshold and slope of the response functions. Non-monotonic neurons exhibit preference for specific sound pressure level ranges, differing in their preferred sound pressure level (Zhang et al., 2013). Previous work found that a mix of monotonic and non-monotonic neurons in the auditory cortex is important for sound encoding (Sun et al., 2017). Because the excitatory neurons in the cortex form tightly connected circuits with inhibitory neurons, inhibitory neuronal activity can shift the sound level response functions of excitatory neurons across monotonic and non-monotonic neurons. These changes can in turn affect the representation of sound pressure level by cortical populations. Whereas multiple studies have examined the effects of SST and VIP neuronal modulation on sound responses in individual neurons in AC(Natan et al., 2015, 2017; Seybold et al., 2015; Phillips and Hasenstaub, 2016; Bigelow et al., 2019; Millman et al., 2020; Seay et al., 2020), their effect on population representation of sound pressure level remains to be fully understood.
Here, we studied whether and how cortical inhibitory neurons control the representation of sound pressure levels in populations of neurons, by presenting periodic noise bursts at different sound pressure levels to awake head-fixed mice and imaging Calcium responses in populations of hundreds of neurons while simultaneously activating SST or VIP neurons optogenetically (Figure 1B, Figure 2). First, we tested whether and how activation of SST or VIP neurons differentially modulated sound pressure level responses at the level of individual neuronal response functions in AC. Next, we tested how the representation of sound pressure level changed in the neuronal space with and without SST and VIP neuronal activation. We tested for the effects of SST and VIP neuronal activation on response sparseness and separation angle of population response vectors. Finally, we tested whether and how changes in the response-level curves of monotonic and nonmonotonic individual neurons upon SST or VIP neuronal activation mediated the changes in representation at the population for monotonic and nonmonotonic neurons. Our results suggest that SST and VIP neuronal activation differentially affect both monotonic and non-monotonic neuronal sound pressure level response functions, thereby shifting the neuronal population codes between localist and distributed representations.
METHODS
Animals
We performed experiments in fourteen adult mice (7 males and 7 females), which were crosses between Cdh23 mice (B6.CAST-Cdh23Ahl+/Kjn, JAX: 002756) and Sst-Cre mice (Ssttm2.1(cre)Zjh/J, JAX: 013044; n=5 in experimental group) or Vip-IRES-Cre mice (Viptm1(cre)Zjh/J, JAX: 010908; n=4 in experimental group, n=5 in control group) (Table 1). Mice had access to food and water ad libitum and were exposed to light/dark on a reversed 12h cycle at 28°C. Experiments were performed during the animals’ dark cycle. Mice were housed individually after the cranial window implant. All experimental procedures were in accordance with NIH guidelines and approved by the Institutional Animal Care and Use Committee at the University of Pennsylvania.
Table 1.
Experiment | Figures | Strain | Number of mice | Number of recordings |
---|---|---|---|---|
GCaMP7f + ChrimsonR | Figs 1–5 | CDH23 x VIP-Cre | 2 | 7 |
CDH23 x SST-Cre | 5 | 13 | ||
GCaMP6m + ChrimsonR | Figs 1–5 | CDH23 x VIP-Cre | 2 | 9 |
Control: GCaMP7f + Flex.tdTomato – VIP cells | Fig 1H | CDH23 x VIP-Cre | 4 | 7 |
Control: GCaMP7f + Flex.tdTomato – Non-VIP cells | Fig 1I | CDH23 x VIP-Cre | 2 | 4 |
Surgery procedures
Mice were implanted with cranial windows over Auditory Cortex following a published procedure (Wood et al., 2022). Briefly, mice were anesthetized with 1.5–3% isoflurane and the left side of the skull was exposed and perforated by a 3mm biopsy punch over the left Auditory Cortex. We injected in that region 3×750nL of an adeno-associated virus (AAV) mix of AAV1.Syn.GCaMP (6m: Addgene 100841 or 7f: Addgene 104488; dilution 1:10 ~ 1×1013 GC/mL) and AAV1.Syn.Flex.Chrimson.tdTomato (UNC Vector Core; dilution 1:2 ~ 2×1012 GC/mL). In the control mice, we injected a mix of AAV1.Syn.jGCaMP7f (Addgene 104488; dilution 1:10 ~ 1×1013 GC/mL) and AAV1.Syn.Flex.tdTomato (Addgene 28306; dilution 1:100–1:20 ~ 2×1011 – 1×1012 GC/mL) in VIP-Cre mice. We then sealed the craniotomy with a glass round window, attached a head plate to the mouse and let it recover for 3–4 weeks. After habituating the mouse to being head fixed for 3 days, we mapped the sound-responsive areas of the brain and located Auditory Cortex using wide field imaging, then performed two-photon imaging in Auditory Cortex (Figure 2C).
Two-photon imaging
We imaged calcium activity in neurons in layer 2/3 of Auditory Cortex of awake, head-fixed mice (VIP-Cre mice: 3321 neurons over 16 recordings, SST-Cre mice: 2284 neurons over 13 recordings) using the two-photon microscope (Ultima in vivo multiphoton microscope, Bruker) with a laser at 940nm (Chameleon Ti-Sapphire). The fluorescence from the tissue went through a Primary Dichroic long pass (620 LP), through an IR Blocker (625 SP), through an Emission Dichroic Long pass (565 LP) which separated the light in two beams. The shorter wavelengths went through an additional bandpass filter (525/70) before being captured by a PMT (“green channel”); the longer wavelengths went through a bandpass filter (595/50) before being captured by a PMT (“red channel”). This set up was used to minimize the contamination of the green channel by the optogenetic stimulus at 635nm. There was nevertheless some bleedthrough during the optogenetic stimulus which was small enough not to saturate the green channel, and thus the activity of neurons could be recorded continuously without interruption during optogenetic stimulation. We checked there was no saturation for the highest laser power used by plotting the average grayscale profile over the dimension of the image perpendicular to the scanning, and verifying it was well below saturation. During a 5-ms laser pulse, while the whole field of view is illuminated by the laser, it appears on the image only for the lines that were being scanned during the laser stimulus. These bands were identified using the average grayscale value of each image line, and the contaminated pixels inside these bands were removed before processing the recordings with Suite2p. We imaged a surface of 512×512 pixels2 at 30Hz. If we recorded from a mouse several times, we changed the location or depth within layer 2/3 of Auditory Cortex in order to not image the same neurons twice.
Optogenetic laser: Power calibration
We first calibrated the laser power by measuring the curve of command voltage versus output power for the laser (Optoengine LLC, MRL-III-635-300mW). The laser’s peak frequency was 635nm. Prior to every recording, we calibrated the laser power at the tissue level as follows: we used an empty cannula to reduce the power of the laser by a factor 10–15 and positioned the optical fiber on the objective so it would shine a spot of 1mm diameter centered on the focal point of the objective. Thus, the calibrated power at the imaging plane was for the medium laser power: 0.3 ± 0.09 mW/mm2 (mean ± std, n=29; range: 0.14–0.47 mW/mm2) and for the high laser power: 3.4 ± 1.0 mW/mm2 (mean ± std, n=29; range: 1.6–5.3 mW/mm2).
Identification of interneurons being stimulated
We started each recording by taking a 2600 frame video both in the green and the red channels (thus imaging GCaMP and tdTomato). As tdTomato is not dependent on the cell activity, any modulation in the signal in the red channel is due to bleedthrough from the GCaMP. We plotted for all cells the raw signal from the red channel versus the signal from the green channel and did a linear fit to extract the bleedthrough coefficient. We then subtracted the bleedthrough in the red signal and calculated the average fluorescence of the processed red signal for every cell. We then z-scored the signal of the red channel to the background fluorescence and selected the cells with a fluorescence higher than 2 σ (standard deviation of the background) as the targeted interneurons. The percentage of cells labeled as VIP or SST interneurons with this criterion was consistent with the percentage of VIP or SST neurons expected within cortex (Rudy et al., 2011).
Stimulus presentation
We presented combinations of sound and optogenetic stimuli. The auditory stimulus consisted in 1-s long click trains of 25-ms pulses of broadband white noise (range 3−80 kHz) at 10Hz, at 7 sound pressure levels within 0–90 dB SPL (0; 30; 50; 60; 70; 80; 90 dB SPL). The optogenetic stimulus consisted in a 1-s long pulse train of 635nm laser with 5-ms pulses at 20 Hz, at 3 amplitudes with no, medium or high laser power (see power at tissue level in section Optogenetic laser: Power calibration). The two stimuli were presented simultaneously, with the optogenetic stimulus preceding the sound stimulus by 20 ms (Blackwell et al., 2020) for maximal optogenetic effect, the inter-stimulus interval was 5 s. All 21 combinations of sound and optogenetic stimuli were presented randomly and with 10 repeats per combination.
Analysis of single-cell activity: Optimal time window versus fixed time window
For each trial of a stimulus, the response was defined as the mean over the baseline one-second window fluorescence preceding the stimulus:, with the fluorescence of the cell. Optimal window: In order to compute the best responses across different neurons, which may respond with different time courses of calcium signals, we defined the optimal window of neuronal response of each neuron for each stimulus combination as the one-second window for which the average response most reliably differs from the baseline activity. The optimal time window was selected as the 1-s averaging window which maximized the sensitivity index (d’) between [0–1s] and [4–5s]. The optimal time window was only used in Figure 2, panels A,D,E,G,H,K,L,N. Fixed window: In order to compare how neuronal responses changed with laser stimulation, keeping all parameters similar besides that one, we defined the fixed window of neuronal response for each recording (one window for all stimulus combinations) as the one-second window with the largest number of responsive neurons. The fixed time window was selected as the 1-s averaging window which maximized the number of cells with a significant response to at least one of the stimuli pairs for each recording compared to the pre-stimulus fluorescence (paired t-test, p<0.01 with multiple comparison correction). The delay between the beginning of the stimulus and the beginning of the fixed window was in SST-Cre mice: 385 ± 419 ms (mean ± std, n=13) and in VIP-Cre mice: 336 ± 10 ms (mean ± std, n=16).
Our methods for selecting the optimal window gives results similar to a fixed [0 1]s window (Wood et al., 2022) (Extended Figure 2.1, Extended Table 2.1) or with our computation on a fixed window as described above ( Extended Figure 2.2, Extended Table 2.2), with improvements which we believe have a better chance at capturing the effects of SST or VIP neuronal activation: by allowing for a delay in the beginning of the fixed window, our analysis leads to response-level fits that are closer to the maximum change in fluorescence for most cells.
Sparseness of a neuron’s response and activity sparseness of a population of neurons
To quantify how many stimuli a neuron responds to, we calculated the sparseness of each neuron adapted from (Vinje and Gallant, 2000):
where is the average response of a neuron to the sound pressure level at a given laser activation calculated over its optimal time window minus the neuron’s response to laser activation and silence, and is the number of sound pressure levels (response in silence excluded). Similar to how this measure is computed with the firing rate of excited neurons (Rolls and Tovee, 1995; Vinje and Gallant, 2000; Olsen and Wilson, 2008; Feigin et al., 2021), we adapted the sparseness measure to fluorescence data by setting any values of to zero before calculating the sparseness, and by calculating this measure only for neurons with at least one positive , for which the sparseness is well-defined (at least one positive ). A sparseness value of 0% indicates that a neuron’s responses to all sound pressure levels are equal, and a sparseness value of 100% indicates that a neuron only responds to one sound pressure level.
To quantify how many neurons in a population are active in response to a given stimulus, we calculated the activity sparseness of each population (Willmore and Tolhurst, 2001) at a given sound level pressure and laser power (Figure 2E and L) as the ratio of neurons that had an increase in response above threshold from silence at the same laser power. The activity sparseness from silence and no laser was computed by subtracting each neuron’s response over its optimal window to its response at 0dB and no laser power, and computing the ratio of neurons with an increase in response above threshold (Figure 2G and N). The threshold was set as the standard deviation of the population’s response to no sound and no laser power and the response was calculated over each neuron’s optimal time window. An activity sparseness value of 0% indicates that all neurons in a population are active, and a value of 100% indicates that none of the neurons are active.
Separation angle and Vector length
To quantify whether mean population vectors were collinear in the neuronal space, we calculated the separation angle between mean population vectors adapted from (Vinje and Gallant, 2000). For each recording, we computed the mean population vectors over the fixed window time at each laser power from 0dB to each non-zero sound pressure level (Figure 3A). We then computed the angle between each pair of mean population vectors at a given laser power (Figure 3B), and represented the mean ± s.e.m (Figure 3D–E and G–F) or the mean difference in separation angle from a given laser power to no laser power (Figure 3F and I) across recordings for each sound pair. To quantify whether mean population responses were close in the neuronal space, we computed the vector length between mean population vectors (Figure 3C). For each recording, we computed the mean population vectors over the fixed window time at each laser power between all pairs of sound pressure levels (Figure 3A). We then computed the Euclidian norm of each mean population vector at a given laser power (Figure 3C), and represented the mean ± s.e.m (Figure 3J–K and M–N) or the mean difference in vector length from a given laser power to no laser power (Figure 3L and O) across recordings for each sound pair.
Fitting of response-level curves
Mean response curves and the standard error of the mean (s.e.m) for every neuron were determined by averaging over the fixed-time window its responses to all 10 trials of each sound pressure level. Thus, for a given cell we constructed three response curves, one for every light condition.
To characterize responses as monotonic or nonmonotonic, we first normalized the response curves such that abs(max(response)) and computed the monotonicity index (MI). This metric refers to the relative responses at higher stimulus levels (Watkins and Barbour, 2011) and was calculated from the mean curve as
(1) |
where is the response to 90 dB which is the highest level of sound presented and is the spontaneous response measured at 0 dB. A response curve was classified as nonmonotonic if its was less than 0.3 and monotonic if it was greater than 0.7. We refrain from a hard cutoff at 0.5 since preliminary analysis of the response curves indicated that due to stochasticity both monotonic and nonmonotonic curves may have values between 0.3 and 0.7. Furthermore, note that a given cell could change its monotonicity in the presence of optogenetic stimulation.
After determining the monotonicity of the neuronal response, we fitted the monotonic and nonmonotonic curves with a 4-parameter sigmoid function and a 4-parameter Gaussian function, respectively. The sigmoid function is given by the equation,
(2) |
while the Gaussian function can be written as,
(3) |
refers to the response curve, is the offset response, is its range in amplitude, is the x value of the sigmoid midpoint and denotes the width of the sigmoid. In the Gaussian, the parameter is the offset response. The amplitude, mean and standard deviation of the Gaussian are denoted by , and , respectively, and have their regular interpretations. During the fitting procedure, we minimize (1 – McFadden pseudo R-squared) using the Powell optimizer (scipy.minimize.optimize in Python). The formula for this error value is,
(4) |
Assuming is gaussian with the experimentally computed response average as its mean value and the response s.e.m as its standard deviation, we can rewrite the formula as,
(5) |
We chose the McFadden R-squared since it allows us to account for different values of s.e.m at the different intensities. The regular R-squared equation constrains the s.e.m values to be equal at all intensities. A cell’s response curve is considered well fit by its respective function if the Mcfadden is greater than 0.8. Due to the nonlinear nature of our optimization we chose 16 random starting points for the optimizer and cells fitted using 2 or more of the starting conditions were characterized using the fitting curve which had the highest value. For neurons whose mean response curve lay between 0.3 and 0.7 we follow a similar procedure but fit the curve with both the sigmoid and Gaussian functions to find the better fitting function. Furthermore, we constrain the mean of all our Gaussian fits to lie between 10 and 80 dB. Response curves with means less than 10 dB or greater than 80 dB were recharacterized using the sigmoid function since only one sound pressure level data point (0 dB or 90 dB) is insufficient to adequately distinguish if the cell is monotonic or nonmonotonic. Roughly ~5% of the total response curves (combined across all three light conditions) were refitted in this manner. Lastly, we tested if the fitting curve was overfitted to the empirical sound intensities by calculating a new variable – interpolated error. The interpolated sigmoid/Gaussian curve is constructed by interpolating the fitted curve at the intermediate sound pressure levels – (15, 40, 55, 65, 75, 85) dB. The interpolated error is the regular R-squared value evaluated using the equation,
(6) |
refers to the Gaussian/sigmoid equations (2, 3) computed at the sound pressure levels 0, 15, 30, 40, 50, 55, 60, 65, 70, 75, 80, 85, and 90 dB. When computing statistics on different parameters we remove neurons with interpolated error less than 0.25 for both the Sigmoid and Gaussian fits. This threshold allows us to select at least 90% of the fitted curves.
Decoding sound pressure level using an SVM Decoder
We linearly decoded the 7 different sound pressure levels at each opto-stimulated condition using an SVM decoder with a linear kernel. Specifically, we decoded each individual pressure level versus the remaining six. Input to the SVM consisted of the fixed time-window responses of all neurons in the population.
To individually decode each of the 7 amplitudes, for every given experimental dataset we projected the average responses of all neurons derived using a fixed window onto a lower-dimensional space using PCA. The lower-dimensional space had (n) dimensions such that these dimensions accounted for 70% of the variance in the dataset.
Next, because at a given opto-stimulated condition we had 10 trials per sound pressure level, the input data to the SVM decoder was unbalanced as 10:60. We balanced the dataset by oversampling the sound pressure level of interest, i.e., if we were decoding the 0dB stimulus from the rest, we oversampled to construct 60 trials for the 0dB stimulus. Specifically, we constructed these 60 trials by fitting the 10 experimentally obtained 0dB trials using a Gaussian kernel and sampling the corresponding distribution. The resulting oversampled dataset comprising 120 trials total was input into the two-class SVM decoder which was trained using 10-fold cross-validation. Because we were not tuning the hyperparameters of the SVM, we did not have separate validation and test sets, rather we computed the model’s accuracy directly using the validation set which was chosen randomly for each of the 10 iterations. Figure panels 2F and 2M illustrate the decoding accuracies for all 3 conditions of opto-stimulation of SST and VIP interneurons.
Statistics
All responses are plotted as mean ± s.e.m (standard error of the mean) with the number of measurements above the figure. We tested significance with a Generalized Linear Mixed Effects (GLME) Model with the matlab function fitglme, using laser power, sound pressure level and the interaction term between laser power and sound pressure level as fixed-effect terms and cell identity or session number as grouping variables. All results from the statistical analyses are reported in Table 2.
Table 2. Statistics Table.
Comparison | Figure | N | Test | Test Statistic | p-value | Effect size |
---|---|---|---|---|---|---|
FIGURE 1 | ||||||
SST neuron with SST activation | Fig 1D | 10 repeats | GLME | tlaser=7.89 tsound=0.34 tlaser:sound=−0.55 DF = 206 |
***plaser=1.8e-13 psound=0.74 plaser:sound=0.58 |
ηlaser2 =0.58 ηsound2=3.8e-3 ηlaser:sound2=1.5e-2 |
Sound-increasing neuron with SST activation | Fig 1E | 10 repeats | GLME | tlaser=0.33 tsound=12.37 tlaser:sound=−8.34 DF = 206 |
plaser=0.74 ***psound=1.2e-26 ***plaser:sound=1.0e-14 |
ηlaser2=2.3e-3 ηsound2=0.84 ηlaser:sound2=0.78 |
VIP neuron with VIP activation | Fig 1F | 10 repeats | GLME | tlaser=5.40 tsound=0.93 tlaser:sound=−2.56 DF = 206 |
***plaser=1.8e-7 psound=0.35 *plaser:sound=1.1e-2 |
ηlaser2=0.39 ηsound2=2.8e-2 ηlaser:sound2=0.25 |
Sound-increasing neuron with VIP activation | Fig 1G | 10 repeats | GLME | tlaser=2.45 tsound=3.06 tlaser:sound=1.11 DF = 206 |
*plaser=1.5e-2 **psound=2.5e-3 plaser:sound=0.27 |
ηlaser2=0.12 ηsound2=0.24 ηlaser:sound2=5.8e-2 |
Control: VIP neurons with laser activation | Fig 1H | 54 cells | GLME | tlaser=1.27 tsound=2.99 tlaser:sound=−0.11 DF = 11336 |
plaser=0.20 **psound=2.8e-3 plaser:sound=0.91 |
ηlaser2=6.5e-4 ηsound2=5.5e-3 ηlaser:sound2=1.1e-5 |
Control: All neurons (VIP excluded) with laser activation | Fig 1I | 492 cells | GLME | tlaser=0.46 tsound=12.23 tlaser:sound=3.27 DF = 103316 |
plaser=0.64 ***psound=2.1e-34 **plaser:sound=1.1e-3 |
ηlaser2=8.5e-6 ηsound2=9.0e-3 ηlaser:sound2=9.8e-4 |
FIGURE 2 | ||||||
SST neurons with SST activation | Fig 2B | 132 cells | GLME | tlaser=36.91 tsound=1.32 tlaser:sound=0.16 DF = 27716 |
***plaser=3.1e-291 psound=0.19 plaser:sound=0.88 |
ηlaser2=0.14 ηsound2=3.2e-4 ηlaser:sound2=6.7e-6 |
All non-SST neurons with SST activation | Fig 2C | 2152 cells | GLME | tlaser=−1.27 tsound=10.75 tlaser:sound=−6.35 DF = 451916 |
plaser=0.20 ***psound=5.9e-27 ***plaser:sound=2.2e-10 |
ηlaser2=1.5e-5 ηsound2=1.6e-3 ηlaser:sound2=8.5e-4 |
Sparseness with SST activation | Fig 2D | None: 2059 Med: 1984 High: 1989 |
GLME | tlaser=5.44 DF = 5680 |
***p laser =5.5e-8 | ηlaser2=4.9e-3 |
Activity sparseness with SST activation | Fig 2E | 13 populations | GLME | tlaser=3.20 tsound=0.69 tlaser:sound=−0.99 DF = 230 |
**plaser=1.6e-3 psound=0.49 plaser:sound=0.33 |
ηlaser2=0.26 ηsound2=9.8e-3 ηlaser:sound2=4.8e-2 |
SST: Decoding accuracy of a linear SVM decoder with laser activation | Fig 2F | 13 populations | GLME | tlaser=−3.92 tsound=−2.84 tlaser:sound=1.92 DF = 269 |
***plaser=1.1e-4 **psound=4.9e-3 plaser:sound=5.6e-2 |
ηlaser2=0.19 ηsound2=0.16 ηlaser:sound2=0.11 |
Activity sparseness from 0dB and no laser power with SST activation | Fig 2 | 13 populations | GLME | tlaser=1.60 tsound=1.17 tlaser:sound=−1.54 DF = 230 |
plaser=0.11 psound=0.24 plaser:sound=0.12 |
ηlaser2=5.5e-2 ηsound2=1.9e-2 ηlaser:sound2=7.7e-2 |
VIP neurons with VIP activation | Fig 2I | 226 cells | GLME | tlaser=41.50 tsound=2.71 tlaser:sound=−1.96 DF = 47456 |
***p
laser
=0
**p sound =6.7e-3 *p laser:sound =4.95e-2 |
ηlaser2=0.12 ηsound2=9.3e-4 ηlaser:sound2=7.3e-4 |
All non-VIP neurons with VIP activation | Fig 2J | 3095 cells | GLME | tlaser=34.18 tsound=11.32 tlaser:sound=8.41 DF = 649946 |
***p
laser
=7.8e-256
***p sound =1.1e-29 ***p laser:sound =4.1e-17 |
ηlaser2=6.0e-3 ηsound2=1.0e-3 ηlaser:sound2=8.4e-4 |
Sparseness with VIP activation | Fig 2K | None: 2996 Med: 2883 High: 2980 |
GLME | tlaser=−3.44 DF = 8268 |
***p laser =5.8e-4 | ηlaser2=1.2e-3 |
Activity sparseness with VIP activation | Fig 2L | 16 populations | GLME | tlaser=0.78 tsound=−0.01 tlaser:sound=−0.49 DF = 284 |
plaser=0.44 psound=0.99 plaser:sound=0.62 |
ηlaser2=1.3e-2 ηsound2=1.8e-6 ηlaser:sound2=8.2e-3 |
VIP: Decoding accuracy of a linear SVM decoder with laser activation | Fig 2M | 13 populations | GLME | tlaser=−0.69 tsound=−3.58 tlaser:sound=−0.11 DF = 332 |
plaser=0.49 ***psound=4.0e-4 plaser:sound=0.91 |
ηlaser2=4.3e-3 ηsound2=0.15 ηlaser:sound2=2.7e-4 |
Activity sparseness from 0dB and no laser power with VIP activation | Fig 2N | 16 populations | GLME | tlaser=−3.52 tsound=−0.49 tlaser:sound=0.0059 DF = 284 |
***plaser=4.9e-4 psound=0.62 plaser:sound=0.995 |
ηlaser2=0.15r ηsound2=2.1e-3 ηlaser:sound2=7.6e-7 |
FIGURE 3 | ||||||
Separation angle from 0dB at each laser power – SST activation | Fig 3E | 15 angles, 13 recordings | GLME | tlaser=8.80 tΔsound=12.37 tlaser:Δsound=−7.44 DF = 581 |
***p
laser
=1.6e-17
***p Δsound =2.3e-31 ***p laser:Δsound =3.6e-13 |
ηlaser2=0.30 ηΔsound2=0.58 ηlaser:Δsound2=0.43 |
Separation angle from 0dB at each laser power – VIP activation | Fig 3H | 15 angles, 16 recordings | GLME | tlaser=−2.75 tΔsound=7.32 tlaser:Δsound=0.73 DF = 716 |
**plaser=6.1e-3 ***pΔsound=6.9e-13 plaser:Δsound=0.47 |
ηlaser2=3.0e-2 ηΔsound2=0.26 ηlaser:Δsound2=5.2e-3 |
Vector length at each laser power – SST activation | Fig 3K | 21 lengths, 13 recordings | GLME | tlaser=−4.71 tΔsound=5.71 tlaser:Δsound=−3.57 DF = 815 |
***p
laser
=2.9e-6
***p Δsound =1.6e-8 ***p laser:Δsound =3.8e-4 |
ηlaser2=6.1e-2 ηΔsound2=0.16 ηlaser:Δsound2=9.1e-2 |
Vector length at each laser power – VIP activation | Fig 3N | 21 lengths, 16 recordings | GLME | tlaser=2.50 tΔsound=4.03 tlaser:Δsound=4.84 DF = 1004 |
*p
laser
=1.2e-2
***p Δsound =6.1e-5 ***p laser:Δsound =1.5e-6 |
ηlaser2=9.7e-3 ηΔsound2=4.8e-2 ηlaser:Δsound2=9.0e-2 |
FIGURE 4 | ||||||
Sigmoid fit – offset – with SST activation | Fig 4D | None: 109 Med: 103 High: 64 |
GLME | tlaser=0.83 DF = 274 |
plaser=0.41 | ηlaser2=2.4e-3 |
Sigmoid fit – offset – with VIP activation | Fig 4D | None: 267 Med: 239 High: 269 |
GLME | tlaser=1.74 DF = 773 |
plaser=8.1e-2 | ηlaser2=3.6e-3 |
Sigmoid fit – range – with SST activation | Fig 4E | None: 109 Med: 103 High: 64 |
GLME | tlaser=−1.24 DF = 274 |
plaser=0.22 | ηlaser2=3.1e-3 |
Sigmoid fit – range – with VIP activation | Fig 4E | None: 267 Med: 239 High: 269 |
GLME | tlaser=3.11 DF = 773 |
**p laser =1.9e-3 | ηlaser2=9.4e-3 |
Sigmoid fit – midpoint – with SST activation | Fig 4F | None: 109 Med: 103 High: 64 |
GLME | tlaser=2.65 DF = 274 |
**p laser =8.6e-3 | ηlaser2=2.1e-2 |
Sigmoid fit – midpoint – with VIP activation | Fig 4F | None: 267 Med: 239 High: 269 |
GLME | tlaser=−0.88 DF = 773 |
plaser=0.38 | ηlaser2=5.9e-4 |
Sigmoid fit – width – with SST activation | Fig 4G | None: 109 Med: 103 High: 64 |
GLME | tlaser=−0.019 DF = 274 |
plaser=0.99 | ηlaser2=8.8e-7 |
Sigmoid fit – width – with VIP activation | Fig 4G | None: 267 Med: 239 High: 269 |
GLME | tlaser=0.56 DF = 773 |
plaser=0.57 | ηlaser2=2.2e-4 |
FIGURE 5 | ||||||
Gaussian fit – offset – with SST activation | Fig 5D | None: 224 Med: 175 High: 130 |
GLME | tlaser=−3.16 DF = 527 |
**p laser =1.7e-3 | ηlaser2=1.8e-2 |
Gaussian fit – offset – with VIP activation | Fig 5D | None: 243 Med: 278 High: 310 |
GLME | tlaser=0.71 DF = 829 |
plaser=0.48 | ηlaser2=5.3e-4 |
Gaussian fit – range – with SST activation | Fig 5E | None: 224 Med: 175 High: 130 |
GLME | tlaser=−5.41 DF = 527 |
***p laser =9.4e-8 | ηlaser2=3.8e-2 |
Gaussian fit – range – with VIP activation | Fig 5E | None: 243 Med: 278 High: 310 |
GLME | tlaser=5.60 DF = 829 |
***p laser =2.9e-8 | ηlaser2=1.7e-2 |
Gaussian fit – mean – with SST activation | Fig 5F | None: 224 Med: 175 High: 130 |
GLME | tlaser=0.35 DF = 527 |
plaser=0.73 | ηlaser2=1.9e-4 |
Gaussian fit – mean – with VIP activation | Fig 5F | None: 243 Med: 278 High: 310 |
GLME | tlaser=3.34 DF = 829 |
***p laser =8.6e-4 | ηlaser2=7.4e-3 |
Gaussian fit – standard deviation – with SST activation | Fig 5G | None: 224 Med: 175 High: 130 |
GLME | tlaser=−0.63 DF = 527 |
plaser=0.53 | ηlaser2=7.3e-4 |
Gaussian fit – standard deviation – with VIP activation | Fig 5G | None: 243 Med: 278 High: 310 |
GLME | tlaser=3.96 DF = 829 |
***p laser =8.1e-5 | ηlaser2=1.7e-2 |
Data availability
The data and code are available on the dryad depository: https://doi.org/10.5061/dryad.t1g1jwt6d (Melanie Tobin et al., n.d.).
RESULTS
SST and VIP neuronal activation modulate the response of sound-increasing neurons
To investigate whether and how distinct classes of inhibitory neurons in AC, SST and VIP neurons, affect sound representation at population level, we imaged Calcium activity of neurons in AC of awake, head-fixed mice presented with sounds while activating SST or VIP neurons using sub-millisecond optogenetic manipulation with Chrimson (Figure 1A) (Klapoetke et al., 2014). We monitored calcium activity by measuring fluorescence of GCaMP expressed in hundreds of neurons at a time (VIP-Cre mice: 3321 neurons over 16 recordings, SST-Cre mice: 2284 neurons over 13 recordings) and identified the cells expressing the opsin through co-expression of tdTomato (Figure 1B and 1C). This approach allowed us to quantify the transformations of sound representations within a large population of cortical neurons driven by SST or VIP neuronal activation.
We first confirmed that the optogenetic manipulations produced expected responses. SST neurons directly inhibit excitatory cells and other cells within the neuronal population, and so we expected that optogenetic manipulation would increase SST neuronal activity, but decrease the responses of other cells. By contrast, VIP neurons mostly inhibit other inhibitory neurons (Campagnola et al., 2022), and therefore we expected the VIP neuronal activation would increase both VIP neuronal activity and provide a release of inhibition to other cells in the network. In SST-Cre mice, a representative SST neuron increased activity at all sound pressure levels with laser power (example neuron, ***plaser=1.8e-13, GLME, Figure 1D). The change in the response of a representative non-SST neuron to SST neuronal activation was sound level-dependent, with a decrease at most sound pressure levels for the medium laser power. Sound responses were abolished with strong SST neuronal activation at the high laser power (plaser=0.74, ***plaser:sound=1.0e-14, GLME, Figure 1E). In VIP-Cre mice, the response of a representative VIP neuron increased at all sound pressure levels with laser power and the increase was sound-level dependent (example neuron, ***plaser=1.8e-7, *plaser:sound=1.1e-2, GLME, Figure 1F). The activity of a representative non-VIP neuron increased at most sound pressure levels during activation of VIP neurons, with a larger increase at the high than medium laser power (*plaser=1.5e-2, GLME, Figure 1G). As a control, we injected mice with Flex.tdTomato instead of the opsin in VIP-Cre x Cdh23 mice (n=5), and verified that laser stimulation of AC in the absence of the opsin did not lead to significant changes in the neuronal responses of VIP neurons (plaser=0.20, GLME, Figure 1H) and non-VIP neurons (plaser=0.64, GLME, Figure 1I). These representative effects of SST or VIP neuronal activation are consistent with previous reports, suggesting that the activation method worked as expected(Natan et al., 2015; Seybold et al., 2015; Phillips and Hasenstaub, 2016; Bigelow et al., 2019).
Differential distribution of population activity and sparseness with SST and VIP neuronal activation
We next tested whether and how SST and VIP neuronal activation differentially affects the sound pressure level response functions of neurons in AC. We characterized the response of each cell over its optimal time window for each sound and laser combination (Figure 2 panels A and H; see Methods) and the response of each population to any stimulus combination over its fixed time window (Figure 2 panels B–C and I–J, see Methods).
Because SST neurons directly inhibit excitatory cells, we hypothesized that SST neuronal activation would lead to fewer neurons responding at increasing sound pressure levels. At baseline, SST neurons have on average lower responses than the non-SST neurons at low sound pressure levels, and a stronger average response above 70dB (Figure 2B–C). During the SST neuronal activation, the average response of SST neurons increased at all sound pressure levels (n=132 SST neurons, ***plaser<1e-100, GLME, Figure 2A, solid lines Figure 2B), whereas non-SST neurons exhibited a mix of decreased and increased responses (Figure 2A). At medium and high laser, the overall shape of the average response-level curve is preserved for SST neurons (plaser:sound=0.88, GLME, Figure 2B). The overall effect of SST neuronal activation on the population of non-SST neurons had a significant interaction between sound and laser amplitude (dashed lines, Figure 2C), but no significant sound-independent laser effect (solid and dashed lines, n=2152 neurons, plaser=0.20, ***plaser:sound=2.2e-10, GLME, Figure 2C). The average response-level curve for non-SST neurons shifted downwards for the medium laser power, and at a high laser power, the modulation of the population’s response by sound was lost with an average response to silence equal to the average response to sounds, consistent with expectations for a sparser, more localist population code (Polley et al., 2004).
To further assess the representation of sound pressure level in the neuronal population, we studied two characteristics of sparse distributed representations: (1) each neuron responds only to a few stimuli (high sparseness) and (2) only a few neurons respond to each stimulus (high activity sparseness). To measure how many stimuli a neuron responds to, we computed the sparseness of each non-SST neuron (see Methods). The sparseness of non-SST neurons with a positive sound response increased significantly from 62% to 69% upon SST neuronal activation (median, n=2059, 1984, and 1989 neurons for no, medium and high laser power, respectively; ***plaser = 5.5e-8, GLME, Figure 2D), indicating that neurons responded to fewer stimuli. To measure how many neurons responded to each stimulus, we computed the activity sparseness for each population of non-SST neurons, which is the ratio of neurons that are not active in response to a given stimulus, compared to silence at a given laser power. The activity sparseness increased significantly upon SST neuronal activation (n=13 populations, **plaser=1.6e-3, GLME, Figure 2E), indicating that at each successive sound pressure level, there were fewer non-SST neurons that were active. However, whereas decoding accuracy differed across different sound pressure levels, and was slightly lower with SST neuronal inactivation, there was no interaction between the laser and sound amplitudes (n=13 populations, psound=0.0049, plaser =1.11e-4, plaser:sound=0.056, GLME, Figure 2F). This suggests that despite the neuronal population responses becoming relatively sparser across sound pressure levels, the relative decoding accuracy was preserved across SPLs. Combined these results point to a more localist representation for sound pressure level with SST neuronal activation.
Because VIP neuron activity provides a release of SST neuron inhibition on excitatory cells, we hypothesized that VIP neuronal activation would lead to more neurons being active in response to each sound level and having stronger responses. At baseline, VIP neurons have a similar average response at 30dB, and a lower average response at sound levels above 30dB compared to the average non-VIP population’s response (Figure 2I–J). When VIP neurons were activated, the average response of VIP neurons increased at all sound pressure levels (n=226 VIP neurons, ***plaser<1e-100, GLME, Figure 2H, solid lines Figure 2I). At high laser, the sound modulation weakens in VIP neurons (*plaser:sound=4.95e-2, GLME, Figure 2I). Non-VIP neurons similarly exhibited an increase in response, both in silence and to sounds at different sound pressure levels (Figure 2H). The average response-level curve for non-VIP neurons shifted upwards at all sound pressure levels as VIP neurons were activated (solid and dashed lines, n=3095 neurons, ***plaser<1e-100, GLME, Figure 2J), reflective of a more distributed stimulus representation. At all laser powers, the modulation of the population’s response by sound was maintained: the population average to sounds was higher than to silence (dashed lines, Figure 2J).
Neuronal sparseness decreased with VIP neuronal activation for all non-VIP neurons with a positive sound response, with a significant decrease from 53% to 50% upon VIP neuronal activation (median, n=2996; 2883 and 2980 neurons for no, medium and high laser power, respectively; ***plaser=5.8e-4, GLME, Figure 2K), indicating that each neuron responded more equally to the different sound pressure levels. The activity sparseness measured from silence at a given laser power did not change upon VIP neuronal activation (n=16 populations, plaser=0.44, GLME, Figure 2L), indicating that the same number of neurons showed an increase in response at each sound pressure level from silence at a given laser power. Consistent with the overall increase in responses with VIP neuronal activation in silence, the activity sparseness measured from silence at no laser power significantly decreased (Figure 2N). Importantly, this change in population activity did not affect the decoding performance (n=13 populations, psound=3.99e-4, plaser=0.49, plaser:sound=0.91, GLME, Figure 2M). Combined, these results suggest that activating VIP neurons transforms population responses to a more distributed representation, while preserving the decoding accuracy.
Overall, SST neuronal activation led to weaker and sparser responses in the population, with neurons responding to fewer stimuli and each stimulus eliciting a response in fewer neurons, shifting the population responses toward a more localist stimulus representation. By contrast VIP neuronal activation leads to a global increase in the neuronal population’s response, along with each neuron responding to more stimuli, leading to a more distributed stimulus representation.
Sound pressure level is represented more discretely or continuously in the neuronal population with SST or VIP neuronal activation, respectively.
There are various ways that a neuronal network can implement a representation of a sensory feature. For example, a distributed code may rely on the magnitude of response of the population of neurons or on the relative response of each cell. To investigate this, we examined next the properties of the representation of sound pressure level upon SST and VIP neuronal activation in the neuronal space. We computed the mean population vector from 0dB at a given laser amplitude to each nonzero sound pressure level at that laser amplitude (Figure 3A), and computed the separation angle between pairs of mean population vectors at the same laser power (Figure 3B), as well as the length of mean population vectors between pairs of sounds (Figure 3C).
The separation angle (Vinje and Gallant, 2000) computes the angle between the mean population vectors to two different sound pressure levels taking 0dB at each laser power as the origin (Figure 3B). A smaller angle indicates the population vectors are more collinear, meaning similar neurons respond to the different stimuli, although perhaps with differing magnitude. A larger angle indicates that there is less overlap between the populations of neurons responding to each stimulus.
When there is no interneuron activation, the separation angle increased with the difference in sound pressure level (Figure 3E and H, black circles), indicating that there is less overlap in the groups of neurons responding to sounds with a large difference in sound pressure level than a small difference in sound pressure level. Upon SST neuronal activation, the curve flattened around 60°, meaning that there was an increase in separation angle for small differences in sound pressure level, and a decrease in separation angle for large differences in sound pressure level (dotted lines correspond to GLME estimates, ***plaser=1.6e-17, ***pΔsound=2.3e-31, ***plaser:Δsound=3.6e-13, GLME, Figure 3D–E). On average, the difference in separation angle between sound pairs from medium or high laser to no laser was positive, at 3.7° ± 1.7° for the high laser power (mean ± s.e.m, 15 angles, Figure 3F). This indicates that the population vectors to the different tested sound pressure levels were more equally distributed in the neuronal space. There is, however, still an overlap between groups of neurons responding to different sound pressure levels, as a fully orthogonal coding of sound pressure level would lead to a 90° separation angle. In contrast, upon VIP neuronal activation, the separation angle decreased equally for all differences in sound pressure level (dotted lines correspond to GLME estimates, **plaser=6.1e-3, ***pΔsound=6.9e-13, plaser:Δsound=0.47, GLME, Figure 3G–H). On average, the difference in separation angle between sound pairs from medium or high laser to no laser was negative, at − 4.6° ± 0.7° degrees at the high laser power (mean ± s.e.m, 15 angles, Figure 3I). This indicates that the population vectors to the different tested sound pressure levels were more collinear, with more overlap between groups of neurons responding to different sound pressure levels with VIP neuronal activation.
The vector length computes the Euclidian norm of the mean population vector between two sound pressure levels at a given laser power (Figure 3C). A small length indicates that the responses to two different stimuli are close in the neuronal space, either due to small magnitudes of response, to small differences in separation angle or both, whereas a large length indicates that there is a large difference in magnitude, in separation angle or both. Therefore, we tested whether SST and VIP neuronal activation differentially affected the vector length.
Upon SST neuronal activation, the vector length decreased for all sound pressure level differences to about 2 a.u. at high laser power, along with a decrease in the slope of the GLME estimate by 81% at high laser power (dotted lines correspond to GLME estimates, ***plaser=2.9e-6, ***pΔsound=1.6e-8, ***plaser:Δsound=3.8e-4, GLME, Figure 3J–K). The average change in length from medium or high laser power to no laser was negative, at −1.00 ± 0.10 a.u. for the high laser power (mean ± s.e.m, 21 lengths, Figure 3L). Upon VIP neuronal activation, the vector length increased for all sound pressure level differences, and the slope of the GLME estimate also increased by 72% at high laser power (dotted lines correspond to GLME estimates, *plaser=1.2e-2, ***pΔsound=6.1e-5, ***plaser:Δsound=1.5e-6, GLME, Figure 3M–N). The average change in length from medium or high laser power to no laser was positive, at 1.27 ± 0.12 a.u. for the high laser power (mean ± s.e.m, 21 lengths, Figure 3O).
Combined with the differences in the separation angle, these results point to emergence of two types of modulations of population codes: Upon SST neuronal activation, the encoding of sound pressure level resembles a localist pattern coding where the magnitude of response is less relevant than the identity of responding cells: the strength of response is reduced and similar for all sound pressure levels, but the population vector angles are more spread out in neuronal space, indicating that there is less overlap between groups of neurons responding to different sound pressure levels. Upon VIP neuronal activation, the encoding of sound pressure level resembles a rate code, which is a type of distributed representation in which the varying strength of the whole population encodes a continuously varying parameter of the stimulus. Therefore, VIP neuronal activation promotes the strength of response more than the identity of responding neurons: there is more overlap between the groups of neurons responding to different sound pressure levels, but the strength of response is increased.
Response-level curves of sound-modulated cells exhibit a narrower response upon SST neuronal activation, and a broader response upon VIP neuronal activation.
We next tested how the shifts in stimulus representation mediated by SST and VIP neurons at the scale of the neuronal population are implemented at the single-cell level by analyzing the changes with SST or VIP neuronal activation in response-level curves of single neurons responding positively to sound. Some AC neurons exhibit increased responses with increased sound pressure levels (monotonic response-level curve) while others are tuned with a peak response to a specific sound pressure level (nonmonotonic response-level curve) (Schreiner et al., 1992; Phillips et al., 1995; Wu et al., 2006). We classified cells depending on their Monotonicity Index (MI, see Methods) and fit response functions of monotonically responding cells with a Sigmoid function (Figure 4 see Methods) and those of non-monotonically responding cells with a Gaussian function (Figure 5, see Methods). We then tested how the parameters of the fits change with interneuron activation.
We first characterized the responses of sound-modulated cells that exhibited a monotonic response-level curve by fitting this curve for individual cells at different levels of laser power (Figure 4A–C). Out of the four sigmoidal fit parameters (Figure 4D–G, middle panels), only the midpoint of the sigmoid fit exhibited a significant change upon SST neuronal activation from 58 dB at no laser power to 73 dB at high laser power (median values, n=109; 103 and 64 neurons for no, medium and high laser power, respectively; offset: plaser=0.41; range: plaser=0.22; midpoint: **plaser=8.6e-3; width: plaser=0.99; GLME, Figure 4D–G, middle panels). With VIP activation, only the range of the sigmoid fit showed a significant increase (n=267, 239 and 269 neurons for no, medium, and high laser power, respectively; offset: plaser=0.081; range: **plaser=1.9e-3; midpoint: plaser=0.38; width: plaser=0.57; GLME, Figure 4D–G, right panels). Among the non-SST neurons fit by a sigmoidal function, two thirds of the cells were fit at a single laser power of SST neuronal activation (n=75, 60 and 40 neurons for no, medium and high laser power, respectively, Figure 4H) and around a tenth of the neurons switched monotonicity with SST neuronal activation (Gaussian fit at other laser powers for n=10, 21 and 8 neurons at no, medium and high laser power, respectively). Among the non-VIP neurons fit by a sigmoidal function, around 40% of the cells were fit at a single laser power of VIP neuronal activation (n=108, 88 and 109 neurons for no, medium and high laser power, respectively, Figure 4I) and around 15% of the neurons switched monotonicity with VIP neuronal activation (Gaussian fit at other laser powers for n=49, 35 and 39 neurons at no, medium and high laser power, respectively). Thus, SST neuronal activation leads to monotonic response-level functions that are shifted rightwards towards higher sound pressure levels, leading to responses to a narrower range of sounds at higher sound pressure levels. VIP activation expanded the neuronal response-level curves upwards, leading to responses to a broader range of sound pressure levels (Figure 4J).
We then characterized the responses of sound-modulated cells that exhibited a nonmonotonic response-level curve by fitting this curve for individual cells at different levels of laser power (Figure 5A–C). Whereas the mean and standard deviation of the Gaussian fit remained unchanged (mean: plaser=0.73; standard deviation: plaser=0.53; GLME, Figure 5F–G, middle panels), the offset and the range of the Gaussian fit decreased with SST neuronal activation (n=224, 175 and 130 neurons for no, medium and high laser power, respectively; offset: **plaser=1.7e-3; range: ***plaser=9.4e-8; GLME, Figure 5D–E, middle panels). The offset of response did not change with VIP neuronal activation (offset: plaser=0.48, GLME, Figure 5D, right panel), whereas the range increased significantly as well as the Gaussian mean and standard deviation (n=243, 278 and 310 neurons for no, medium and high laser power, respectively; range: ***plaser=2.9e-8; mean: ***plaser=8.6e-4; standard deviation: ***plaser=8.1e-5; GLME, Figure 5E–G, right panels). Among the non-SST neurons fit by a Gaussian function, two thirds of the cells were fit at a single laser power of SST neuronal activation (n=147, 111 and 89 neurons for no, medium and high laser power, respectively, Figure 5H) and less than a tenth of the neurons switched monotonicity with SST neuronal activation (sigmoidal fit at other laser powers for n=17, 6 and 13 neurons at no, medium and high laser power, respectively). Among the non-VIP neurons fit by a Gaussian function, half of the cells were fit at a single laser power of VIP neuronal activation (n=120, 150 and 170 neurons for no, medium and high laser power, respectively, Figure 5I) and around 15% of the neurons switched monotonicity with VIP neuronal activation (sigmoidal fit at other laser powers for n=32, 42 and 40 neurons at no, medium and high laser power, respectively). Thus, SST neuronal activation led to nonmonotonic response-level functions that were shifted downwards with a decreased range of responses, leading to responses above noise level to a narrower range of sound pressure levels. VIP neuronal activation shifted the neuronal response-level curves rightwards with an increased range of response, leading to increased peak responses at higher sound pressure levels (Figure 5J).
These results demonstrated that SST neuronal activation promoted a more localist representation of sound pressure level at the population scale by eliciting responses above noise level over a narrower range of sound pressure levels, and increasing separation between the sound pressure levels covered by monotonic and nonmonotonic neurons. By contrast, VIP neuronal activation promoted a more distributed representation of sound pressure level by broadening response-level curves of single neurons and increasing the overlap between the sound pressure levels covered by monotonic and nonmonotonic neurons.
Could the changes to the fitted parameters of sound-modulated cells (Figure 4 and Figure 5) explain the differential effect of SST and VIP neuronal activation on the separation angle and length between mean population vectors to different sound pressure levels (Figure 3)? To answer this question, we constructed a qualitative model including a monotonic and a nonmonotonic cell, with response-level fit parameters for no and high laser power taken as the mean parameters from the data in Figure 4 and Figure 5 for no and high laser power (Figure 6A–B). With this simple two-cell population, we could qualitatively reproduce the increase in separation angle upon SST neuronal activation and the decrease in separation angle upon VIP activation over the range of sound pressure levels we tested (Figure 6D–F), and similarly the decrease in vector length upon SST neuronal activation and the increase in vector length upon VIP (Figure 6G–I). Thus, the changes to the fit parameters observed in Figures 4–5 can explain the change in representation of sound pressure levels observed in Figure 3.
Overall, when neither SST or VIP neurons are activated, the neuronal population encoded different sound pressure levels using two strategies: the identity of the responsive cells (different cells respond to different sound pressure levels, discrete encoding of sound pressure level) and the strength of the neuronal response (continuous encoding of sound pressure level). When SST neurons are activated, the neuronal population shifts towards a more localist representation of sound pressure level. Specifically, the encoding of sound pressure level relies more on the identity of the responsive cells and less on the magnitude of response: there is less overlap between populations of cells responding to different sound pressure levels, but the strength of response is similar for all sound pressure levels. This can be explained with the narrower bandwidths of response, albeit of reduced magnitude, of both monotonic and nonmonotonic sound-increasing cells with SST neuronal activation. By contrast, when VIP neurons are activated, the neuronal population shifts towards a more distributed representation of sound pressure level. Specifically, the encoding of sound pressure level by the magnitude of the neuronal response is enhanced, while the representation by different cell groups declines: the neuronal responses are of higher magnitude and over a higher range, but there is more overlap between neurons responding to different sound pressure levels. This can be explained with the larger and broader responses of monotonic and nonmonotonic sound-increasing neuronal responses with VIP neuronal activation.
DISCUSSION
Within the brain, neurons form intricate networks, which represent sensory information. A sensory stimulus, such as a specific sound or a visual image, elicits activity in a subset of neurons in a network. A neuronal network can use a multitude of codes to represent information. A stimulus can be encoded discretely with a localist, pattern-separated representation, in which a specific group of neurons represents a specific stimulus, and different stimuli elicit activity in different groups of neurons (Figure 7A). Such localist representations have the advantage of discreteness: they can separate stimuli in different categories. Alternatively, in a distributed representation, stimulus-evoked activity can be distributed across the network, such that the relative activity of neurons within a group represent different stimuli (Figure 7A–B). An example of a distributed representation is a rate code, in which the firing rate of the active neurons represent a continuously varying stimulus feature, such as intensity or sound location (Belliveau et al., 2014). Distributed representations have the advantage of invariance: a small change in stimulus parameter will elicit a small variation in the neuronal response. Neuronal population responses have been measured at various positions along the localist to distributed representation spectrum across many features and areas (such as memory: (Wixted et al., 2014), sound (Hromádka et al., 2008), sound localization (Lesica et al., 2010; Belliveau et al., 2014), vision (Christensen and Pillow, 2022) ) and can change dynamically along the spectrum (Kato et al., 2015; Honey et al., 2017; Kuchibhotla et al., 2017). A defining feature of the auditory cortex is sparse coding (DeWeese et al., 2003), which can lead to both distributed and localist representations. Along the auditory pathway, the coexistence of neurons with monotonic and nonmonotonic response-level curves indicates that sound pressure level is represented by both localist and distributed codes (Schreiner et al., 1992; Wu et al., 2006; Tan et al., 2007; Watkins and Barbour, 2011). More generally, stimulus representation within neuronal networks is mixed between local and distributed codes, in so-called sparse distributed representations, with both the level of activity and identity of activated neurons encoding the stimulus (Figure 7A) (Rolls and Tovee, 1995; Vinje and Gallant, 2000; Hromádka et al., 2008; Wixted et al., 2014). Based on the environmental and behavioral demands, it may be beneficial for neuronal representations to shift dynamically towards a more localist or a more distributed representation of a stimulus feature.
Our results suggest that distinct inhibitory neurons in the auditory cortex affect population neuronal response code by differentially shifting the responses toward a distributed or a localist representations. Previous work found that SST neuronal activation decreases the activity of the neuronal population to sounds (Phillips and Hasenstaub, 2016; Natan et al., 2017), and leads to a rightward shift of monotonic response-level curves (Wilson et al., 2012), while VIP neuronal activation, through a disinhibitory circuit, increases the activity of the population (Pfeffer et al., 2013; Pi et al., 2013; Zhang et al., 2014). We found that activation of SST neurons, led to a sparser, more localist representation (Figure 2D–E), where sound pressure level is encoded in discrete steps by distinct groups of neurons (Figure 3E) with a similar low strength (Figure 3K). By contrast, activation of VIP neurons, led to a more distributed representation (Figure 2K–L), with more overlap between the cell populations responding to different sound pressure levels (Figure 3H): sound pressure level is encoded continuously by varying the strength of response of a large group of neurons (Figure 3N). These shifts in representation are implemented at the single-neuron level through changes to the response-level function of monotonic (Figure 4) and nonmonotonic neurons (Figure 5). SST neuronal activation shifts the response-level curves of sound-modulated neurons by further separating the sound pressure levels that monotonic and nonmonotonic neurons represent (Figure 4I and 5I). With VIP neuronal activation, the changes to the response-level curves of sound-modulated neurons allow for stronger responses and more overlap in bandwidth between monotonic and nonmonotonic neurons (Figure 4J and 5J).
As distinct inhibitory populations can be recruited during different behaviors, their ability to transform the neuronal code can be advantageous to distinct behaviors and computations. Localist versus distributed representations, modulated by the relative strength of global (SST) versus local (PV or VIP) inhibition respectively, may provide support for neuronal computations such as discreteness versus invariance (Kuchibhotla and Bathellier, 2018), segmentation versus concatenation (Haga and Fukai, 2021), integration of bottom-up versus top-down information (Honey et al., 2017; Hertäg and Sprekeler, 2019). The two types of interneurons receive neuromodulatory inputs and may dynamically change the network’s state towards one or the other type of representation for a given task: SST neuronal activation may help with tasks requiring focused attention such as discriminating different stimuli (Lee and Middlebrooks, 2011) or detecting in noise (Lakunina et al., 2022), by sharpening tuning, decreasing the activity for non-relevant stimuli, and enhancing the information-per-spike (Phillips and Hasenstaub, 2016). In contrast, VIP neuronal activation may help with tasks requiring receptive-attention such as active sensing (Gentet et al., 2012) or detecting small stimuli by amplifying weak signals (Millman et al., 2020), increasing detectability (Cone et al., 2019) without increasing the stimulus-response mutual information (Bigelow et al., 2019).
Our results expand on the results of previous studies that measured more general effects of interneuron modulation. The changes to the response-level curves (Figures 4 and 5) we measured upon SST or VIP neuronal activation can explain how the frequency-response functions of neurons in AC change with interneuron modulation. The excitatory and inhibitory inputs to pyramidal cells of AC are frequency-tuned (Isaacson and Scanziani, 2011; Li et al., 2013; Kato et al., 2017), and intracortical inhibition further shapes the tuning (Wu et al., 2008). From the response-level curves, we can plot the response with interneuron activation versus without and thus predict within which range of amplitudes we can expect multiplicative or additive effects to the frequency tuning curve. Previous studies have shown that SST neuronal activation leads to either subtractive, divisive or a combination of both subtractive and divisive effects on the frequency tuning curve (Wilson et al., 2012; Seybold et al., 2015; Phillips and Hasenstaub, 2016). Our data are consistent with these results: for monotonic neurons, SST neuronal activation can lead to divisive and/or subtractive effects at a low range of sound amplitudes and for nonmonotonic neurons, SST neuronal activation leads to both divisive and subtractive effects (Figure 6J–K). Previous studies have also shown that VIP leads to an additive shift in the frequency tuning curve (Pi et al., 2013; Bigelow et al., 2019) and similarly SST inactivation leads to a multiplicative or additive shift mostly (Phillips and Hasenstaub, 2016). Our data can explain these results as well, with multiplicative effects for monotonic neurons and a range of additive and multiplicative effects for nonmonotonic neurons (Figure 6J–K). One component that can contribute to a change in representation is a change in the noisiness of the responses. Indeed, changes in the signal-to-noise ratio (SNR) could drive populations to appear as more localist or distributed in their coding. The change in SNR may be one of the components explaining how the change in representation is implemented, however it is not the sole factor as assessed by the decoding accuracy (Figure 2F and M).
Nonmonotonic neurons in AC either can have their nonmonotonicity inherited from the nonmonotonic excitatory input into those cells while the monotonic inhibitory input, which shows a peak in delay at the cell’s best pressure level, sharpens the nonmonotonicity (Wu et al., 2006) or can be constructed de novo with an imbalance between excitatory and inhibitory inputs (Wu et al., 2006; Tan et al., 2007). This means that for some of the nonmonotonic cells we recorded from, the input into the cell does not covary with the sound pressure level, and so for these cells, we are not assessing the input-output function of the cell through the response-level curve, rather how inhibition further shapes the already intensity-tuned input. From our experiments, we observe that SST neuronal activation decreases the range of responses of nonmonotonic neurons but does not change the sound pressure level of peak response nor the width of responses (Figure 5). This may indicate that SST neuronal activation does not change the timing of the inhibitory input into the nonmonotonic cells, but rather the overall strength of inhibition across all sound pressure levels. In contrast, VIP neuronal activation leads to a shift of the sound pressure level of peak response towards higher levels, along with a broadening of the response and an increase in the range of response (Figure 5). This could simply be explained if VIP neuronal activation changes the timing of the inhibition, with the delay between excitatory and inhibitory inputs peaking at a higher sound pressure level.
In our sample, the relative proportion of nonmonotonic versus monotonic neurons is higher than would be expected from the literature. We note that the number of neuronal responses that we were able to fit are likely an underestimate of the truly monotonic or non-monotonic neurons in the population, as we used stringent selection criteria. The relatively high proportion of non-monotonic neurons may be due to inclusion of the non-primary region VAF, which has previously been shown to have a higher proportion of nonmonotonic neurons than A1 (Wu et al., 2006; Polley et al., 2007). Furthermore, we used a mouse line in which the hearing loss mutation in Cdh23 commonly found in C57B6 mice is corrected, thus our mice may have lower detection thresholds than the mice in previous studies, leading to a larger proportion of nonmonotonic neurons tuned to lower frequencies. Additionally, because of the time course of GCaMP, we are not distinguishing between onset and offset responses, and they may be integrated in this window. These estimates contribute to the ongoing discussion of the differences in monotonicity of responses between the non-human primates (Gao and Wang, 2019) and rodents. It is plausible that our current setup, with imaging performed in awake mice rather than under anesthesia, allows us to sample the responses in a more accurate fashion than previous studies.
In our analysis, the majority of the neurons remained monotonic or non-monotonic between laser conditions, with only 10–15% switching monotonicity (Figures 4H–I and 5H–I). Additionally, the ratio between the neurons which we identify as monotonic or non-monotonic was generally preserved across the laser conditions. A significant fraction of neurons were fit at a single laser power of SST or VIP neuronal activation. SST and VIP neuronal activation thus elicited responses in new pools of neurons for each laser, however with more consistency between laser powers upon VIP neuronal activation than upon SST neuronal activation. Because of the stringent fitting criteria, we believe that our fitting procedure underestimates the true fraction of monotonic or non-monotonic neurons. Nonetheless, we are able to compare the fits across conditions because the criteria and fraction of well-fitted responses remain the same. Therefore, the change in representation (localist or distributed) likely relies on changes to the response-level curves rather than on changes in the proportion of monotonic and nonmonotonic neurons.
A limitation of our study is that we measured the response only from a subset of neurons from layer 2/3 while broadly stimulating many SST or VIP neurons across different cortical layers. However, neurons across the cortical column may perform additional computations in other layers, which would be important to record in future experiments. Our sample ended up including a relatively low number of SST-positive and VIP-positive neurons. Their sound intensity responses largely trended the mean recorded responses and did not differ strongly from each other. From the literature, we were expecting VIP neurons to be selective for low dB sounds (Mesik et al., 2015). Similarly, in the visual cortex, VIP neurons prefer low contrast whereas SST neurons prefer high contrast visual stimuli (Millman et al., 2020). Here, SST and VIP neurons have similar tuning properties with the non-SST and non-VIP neurons within their sessions, but with optogenetic stimulation, sound modulation becomes weaker. Future study should focus on the responses within the SST and VIP neuronal populations across layers. Furthermore, whereas we combined imaging sessions across the auditory cortex, future studies should also examine differences in function of inhibitory neurons across the different primary and non-primary auditory areas.
A related caveat in interpreting our results is that optogenetic activation of inhibitory neurons may differ across samples, and can potentially drive higher activity level of excitatory or inhibitory neurons than physiological levels, saturating the responses. To mitigate this limitation, we included multiple activation levels in each of the imaging sessions by modulating the strength of the laser between high and low levels. This allowed within sample comparison of activity patterns, rather than comparisons of absolute changes across multiple imaging sessions. Furthermore, by comparing the activity levels of imaged neurons between low and high laser intensities, we ensured that the modulation of activity was not saturating due to the laser.
An additional caveat is that the opsin-expressing cells may be depolarized at baseline due to off-target laser stimulation during two-photon imaging (Forli et al., 2018). In SST-Cre mice, the depolarization of SST neurons would lead monotonic non-SST neurons to shift their range of responsiveness towards higher sound pressure levels (Figure 4), while nonmonotonic non-SST neurons still respond to the same best sound level. This would lead to a proportionally larger response to the lower sound pressure levels (covered mainly by nonmonotonic neurons) than to the high sound pressure levels, as we observe in Figure 2C. In VIP-Cre mice, the shape of the population response-level curve (Figure 2J) is similar to that in SST-Cre mice. The depolarization of VIP neurons at baseline may lead to an increase in the amplitude range of nonmonotonic neurons combined with a slight increase in the number of nonmonotonic neurons, which may result in similar changes to the population response-sound level curve compared to the curve in the control.
Another potential caveat in interpreting the data is the while we were able to identify a subset of neurons as SST or VIP positive, the analysis combined multiple types of neurons as non-SST or non-VIP neurons, and for some neurons the tdTomato expression might not have been strong enough for detection as SST or VIP positive. The population of unlabelled neurons may include SST neurons explaining the increase in population response for non-SST neurons in response to high laser power and no sound (Figure 2C). SST neurons also suppress PV neurons (Pfeffer et al., 2013), and at high laser power, in combination with firing rate saturation and double-disinhibition, the overall activity might be increasing in the absence of sound. Additionally, this increase may stem from the previously observed paradoxical effect of increased inhibition driving an increase in the firing rates in a reciprocally connected circuit (Tsodyks et al., 1997; Soldado-Magraner et al., 2022). A targeted experimental approach which would allow either in vivo or post-hoc identification of imaged neuronal types would allow to further distinguish response parameters among different types of inhibitory and excitatory neurons (Kerlin et al., 2010; Khan et al., 2018). Furthermore, future studies inactivating these inhibitory neurons would further complement and extend our findings (Phillips and Hasenstaub, 2016).
In our experiments, we tested the results of SST or VIP neuronal activation on the neuronal representation of sound pressure level, and an important next step will be to investigate whether the changes in neuronal representation of sound pressure level correlate with behavioral effects. One approach would be to image the activity of SST neurons while a mouse is engaged in a task that may require SST neuronal activation, or similarly VIP neurons. SST neurons may be involved in tasks leading to a sharpening in the neuronal tuning properties, or to a filtering out of irrelevant stimuli through the overall decrease in firing rate, such as discrimination tasks and signal-in-noise tasks (Otazu et al., 2009; Lee and Middlebrooks, 2011; Kuchibhotla et al., 2017; Christensen et al., 2019; Lakunina et al., 2022). VIP neurons may be involved in tasks requiring amplification of weak signals, such as a detection at threshold task or active sensing (Fritz et al., 2003; Gentet et al., 2012; Bennett et al., 2013; Kato et al., 2015; Cone et al., 2019; Millman et al., 2020). A second approach would be to measure how SST or VIP neuronal activity changes the performance of a mouse engaged in a task. In a detection task, we may expect that detection thresholds increase with SST neuronal activation and decrease with VIP neuronal activation, as seen in the visual cortex (Cone et al., 2019). In a discrimination task or detection of sounds in background task, we may expect SST neuronal activation to increase the performance, while VIP neuronal activation may decrease or not affect performance: SST neuronal inactivation decreases performance in the detection of sounds in background noise (Lakunina et al., 2022), and at the neuronal level, VIP neuronal activation decreases “encoding efficiency” (Bigelow et al., 2019) while SST neuronal activation may increase it (Phillips and Hasenstaub, 2016).
It should be noted that the dichotomy between the functional roles of SST and VIP neurons might not be so clear cut: VIP and SST neurons may cooperate to simultaneously amplify relevant stimuli and filter out irrelevant stimuli, respectively, or VIP neurons may be more active for weak stimuli and SST neurons for loud stimuli (Zhang et al., 2014; Mesik et al., 2015; Karnani et al., 2016; Kuchibhotla et al., 2017; Dipoppa et al., 2018a; Millman et al., 2020). For example, activation of the cingulate cortex can elicit spiking activity in PV, SST and VIP neurons, with SST neurons contributing to surround suppression, and VIP neurons to facilitation of the center of the receptive field (Zhang et al., 2014); cholinergic modulation depolarizes multiple types of inhibitory neurons, including SST and VIP neurons (Kuchibhotla et al., 2017); and locomotion increases activity in VIP neurons primarily for small stimuli, but in SST neurons for large stimuli (Dipoppa et al., 2018b). VIP neurons inhibit SST neurons, creating attentional spotlights through targeted disinhibition (Karnani et al., 2016). Therefore, examination of the function of SST and VIP neurons in complex behaviors needs to take the circuit mechanisms into account. Interestingly, the connectivity between cortical neurons is largely conserved across layers, primary and non-primary cortices, sensory and non-sensory areas (Douglas and Martin, 2004; Markram et al., 2004; Yu et al., 2019; Campagnola et al., 2022), with SST and VIP neurons mutually inhibiting each other as a common motif: perhaps the change in representation we observe with varying sound level pressure upon SST or VIP neurons extends to other stimulus features, sensory and non-sensory alike.
Supplementary Material
SIGNIFICANCE.
Information about sounds is represented in the auditory cortex by neuronal population activity that has a characteristic sparse structure. Cortical neuronal populations comprise multiple types of excitatory and inhibitory neurons. Here, we find that activating different types of inhibitory neurons differentially controls population neuronal representations, with one type of inhibitory neurons increasing the differences in the identity of the cells recruited to represent the different sounds, and another inhibitory neuron type changing the relative activity level of overlapping neuronal populations. Such transformations may be beneficial for different types of auditory behaviors, suggesting that these different types of inhibitory neurons may be recruited under different behavioral constraints in optimizing neuronal representations of sounds.
ACKNOWLEDGEMENTS
The authors thank Dr. Anna Schapiro and members of the Geffen laboratory for helpful discussions and advice. This work was supported by NIDCD R01DC015527, R01DC014479, NINDS R01NS113241 to MNG and NIDCD K99DC019504 to KCW.
Footnotes
CONFLICT OF INTEREST
The authors declare no competing interests.
REFERENCES
- Belliveau LAC, Lyamzin DR, Lesica NA (2014) The Neural Representation of Interaural Time Differences in Gerbils Is Transformed from Midbrain to Cortex. Journal of Neuroscience 34:16796–16808. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bennett C, Arroyo S, Hestrin S (2013) Subthreshold Mechanisms Underlying State-Dependent Modulation of Visual Responses. Neuron 80:350–357. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bigelow J, Morrill RJ, Dekloe J, Hasenstaub AR (2019) Movement and VIP Interneuron Activation Differentially Modulate Encoding in Mouse Auditory Cortex. eNeuro 6:ENEURO.0164–19.2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Blackwell JM, Lesicko AM, Rao W, De Biasi M, Geffen MN (2020) Auditory cortex shapes sound responses in the inferior colliculus Shinn-Cunningham BG, Carr CE, Woolley S, Read H, eds. eLife 9:e51890. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bowers JS (2009) On the biological plausibility of grandmother cells: Implications for neural network theories in psychology and neuroscience. Psychological Review 116:220–251. [DOI] [PubMed] [Google Scholar]
- Campagnola L et al. (2022) Local connectivity and synaptic dynamics in mouse and human neocortex. Science 375 Available at: 10.1126/science.abj5861. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen N, Sugihara H, Sur M (2015) An acetylcholine-activated microcircuit drives temporal dynamics of cortical activity. Nat Neurosci 18:892–902. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Christensen AJ, Pillow JW (2022) Reduced neural activity but improved coding in rodent higher-order visual cortex during locomotion. Nat Commun 13:1676. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Christensen RK, Lindén H, Nakamura M, Barkat TR (2019) White Noise Background Improves Tone Discrimination by Suppressing Cortical Tuning Curves. Cell Reports. [DOI] [PubMed] [Google Scholar]
- Cone JJ, Scantlen MD, Histed MH, Maunsell JHR (2019) Different inhibitory interneuron cell classes make distinct contributions to visual contrast perception. eNeuro. [DOI] [PMC free article] [PubMed] [Google Scholar]
- DeWeese MR, Wehr M, Zador AM (2003) Binary spiking in auditory cortex. J Neurosci 23:7940–7949. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dipoppa M, Ranson A, Krumin M, Pachitariu M, Carandini M, Harris KD (2018a) Vision and Locomotion Shape the Interactions between Neuron Types in Mouse Visual Cortex. Neuron. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dipoppa M, Ranson A, Krumin M, Pachitariu M, Carandini M, Harris KD (2018b) Vision and Locomotion Shape the Interactions between Neuron Types in Mouse Visual Cortex. Neuron. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Douglas RJ, Martin KAC (2004) Neuronal Circuits of the Neocortex. Annual Review of Neuroscience 27:419–451. [DOI] [PubMed] [Google Scholar]
- Fanselow EE, Richardson KA, Connors BW (2008) Selective, state-dependent activation of somatostatin-expressing inhibitory interneurons in mouse neocortex. J Neurophysiol 100:2640–2652. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Feigin L, Tasaka G, Maor I, Mizrahi A (2021) Sparse Coding in Temporal Association Cortex Improves Complex Sound Discriminability. J Neurosci 41:7048–7064. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Forli A, Vecchia D, Binini N, Succol F, Bovetti S, Moretti C, Nespoli F, Mahn M, Baker CA, Bolton MM, Yizhar O, Fellin T (2018) Two-Photon Bidirectional Control and Imaging of Neuronal Excitability with High Spatial Resolution In Vivo. Cell Rep 22:3087–3098. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fritz J, Shamma S, Elhilali M, Klein D (2003) Rapid task-related plasticity of spectrotemporal receptive fields in primary auditory cortex. Nat Neurosci 6:1216–1223. [DOI] [PubMed] [Google Scholar]
- Fu Y, Tucciarone JM, Espinosa JS, Sheng N, Darcy DP, Nicoll RA, Huang ZJ, Stryker MP (2014) A cortical circuit for gain control by behavioral state. Cell 156:1139–1152. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gao L, Wang X (2019) Subthreshold Activity Underlying the Diversity and Selectivity of the Primary Auditory Cortex Studied by Intracellular Recordings in Awake Marmosets. Cereb Cortex 29:994–1005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gentet LJ, Kremer Y, Taniguchi H, Huang ZJ, Staiger JF, Petersen CCH (2012) Unique functional properties of somatostatin-expressing GABAergic neurons in mouse barrel cortex. Nat Neurosci 15:607–612. [DOI] [PubMed] [Google Scholar]
- Haga T, Fukai T (2021) Multiscale representations of community structures in attractor neural networks. PLOS Computational Biology 17:e1009296. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hertäg L, Sprekeler H (2019) Amplifying the redistribution of somato-dendritic inhibition by the interplay of three interneuron types Cuntz H, ed. PLoS Comput Biol 15:e1006999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Honey CJ, Newman EL, Schapiro AC (2017) Switching between internal and external modes: A multiscale learning principle. Network Neuroscience 1:339–356. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hromádka T, DeWeese MR, Zador AM (2008) Sparse Representation of Sounds in the Unanesthetized Auditory Cortex. PLOS Biology 6:e16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Isaacson JS, Scanziani M (2011) How inhibition shapes cortical activity. Neuron 72:231–243. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jackson J, Ayzenshtat I, Karnani MM, Yuste R (2016) VIP+ interneurons control neocortical activity across brain states. Journal of Neurophysiology 115:3008–3017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Karnani MM, Jackson J, Ayzenshtat I, Sichani XH, Manoocheri K, Kim S, Yuste R (2016) Opening holes in the blanket of inhibition: Localized lateral disinhibition by vip interneurons. Journal of Neuroscience 36:3471–3480. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kato HK, Asinof SK, Isaacson JS (2017) Network-Level Control of Frequency Tuning in Auditory Cortex. Neuron. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kato HK, Gillet SN, Isaacson JS (2015) Flexible Sensory Representations in Auditory Cortex Driven by Behavioral Relevance. Neuron. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kawaguchi Y, Shindou T (1998) Noradrenergic excitation and inhibition of GABAergic cell types in rat frontal cortex. J Neurosci 18:6963–6976. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kerlin AM, Andermann ML, Berezovskii VK, Reid RC (2010) Broadly tuned response properties of diverse inhibitory neuron subtypes in mouse visual cortex. Neuron 67:858–871. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Khan AG, Poort J, Chadwick A, Blot A, Sahani M, Mrsic-Flogel TD, Hofer SB (2018) Distinct learning-induced changes in stimulus selectivity and interactions of GABAergic interneuron classes in visual cortex. Nature neuroscience 21:851–859. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Klapoetke NC et al. (2014) Independent optical excitation of distinct neural populations. Nat Methods 11:338–346. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kuchibhotla K, Bathellier B (2018) Neural encoding of sensory and behavioral complexity in the auditory cortex. Current Opinion in Neurobiology 52:65–71. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kuchibhotla KV, Gill JV, Lindsay GW, Papadoyannis ES, Field RE, Sten TAH, Miller KD, Froemke RC (2017) Parallel processing by cortical inhibition enables context-dependent behavior. Nat Neurosci 20:62–71. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lakunina AA, Menashe N, Jaramillo S (2022) Contributions of Distinct Auditory Cortical Inhibitory Neuron Types to the Detection of Sounds in Background Noise. eneuro 9:ENEURO.0264–21.2021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lakunina AA, Nardoci MB, Ahmadian Y, Jaramillo S (2020) Somatostatin-expressing interneurons in the auditory cortex mediate sustained suppression by spectral surround. Journal of Neuroscience. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lee C-C, Middlebrooks JC (2011) Auditory cortex spatial sensitivity sharpens during task performance. Nat Neurosci 14:108–114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lesica NA, Lingner A, Grothe B (2010) Population Coding of Interaural Time Differences in Gerbils and Barn Owls. Journal of Neuroscience 30:11696–11702. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li L, Li Y, Zhou M, Tao HW, Zhang LI (2013) Intracortical multiplication of thalamocortical signals in mouse auditory cortex. Nat Neurosci 16:1179–1181. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Litovsky RY, Clifton RK (1992) Use of sound-pressure level in auditory distance discrimination by 6-month-old infants and adults. J Acoust Soc Am 92:794–802. [DOI] [PubMed] [Google Scholar]
- Markram H, Toledo-Rodriguez M, Wang Y, Gupta A, Silberberg G, Wu C (2004) Interneurons of the neocortical inhibitory system. Nat Rev Neurosci 5:793–807. [DOI] [PubMed] [Google Scholar]
- Tobin Melanie, Geffen M, Sheth J, Katherine Wood (n.d.) Data from: “‘Distinct inhibitory neurons differently shape neuronal codes for sound intensity in the auditory cortex.” Dryad. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mesik L, Ma WP, Li LY, Ibrahim LA, Huang ZJ, Zhang L, Tao HW (2015) Functional response properties of VIP-expressing inhibitory neurons in mouse visual and auditory cortex. Frontiers in Neural Circuits 9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Millman DJ, Ocker GK, Caldejon S, Kato I, Larkin JD, Lee EK, Luviano J, Nayan C, Nguyen TV, North K, Seid S, White C, Lecoq J, Reid C, Buice MA, de Vries SEJ (2020) VIP interneurons in mouse primary visual cortex selectively enhance responses to weak but specific stimuli. eLife 9:1–22. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Natan RG, Briguglio JJ, Mwilambwe-Tshilobo L, Jones SI, Aizenberg M, Goldberg EM, Geffen MN (2015) Complementary control of sensory adaptation by two types of cortical interneurons King AJ, ed. eLife 4:e09868. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Natan RG, Rao W, Geffen MN (2017) Cortical Interneurons Differentially Shape Frequency Tuning following Adaptation. Cell Reports 21:878–890. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Olsen SR, Wilson RI (2008) Lateral presynaptic inhibition mediates gain control in an olfactory circuit. Nature 452:956–960. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Otazu GH, Tai L-H, Yang Y, Zador AM (2009) Engaging in an auditory task suppresses responses in auditory cortex. Nat Neurosci 12:646–654. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pfeffer CK, Xue M, He M, Huang ZJ, Scanziani M (2013) Inhibition of inhibition in visual cortex: The logic of connections between molecularly distinct interneurons. Nature Neuroscience. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Phillips DP, Semple MN, Kitzes LM (1995) Factors shaping the tone level sensitivity of single neurons in posterior field of cat auditory cortex. Journal of Neurophysiology 73:674–686. [DOI] [PubMed] [Google Scholar]
- Phillips EA, Hasenstaub AR (2016) Asymmetric effects of activating and inactivating cortical interneurons Uchida N, ed. eLife 5:e18383. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pi HJ, Hangya B, Kvitsiani D, Sanders JI, Huang ZJ, Kepecs A (2013) Cortical interneurons that specialize in disinhibitory control. Nature 503:521–524. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Polley DB, Heiser MA, Blake DT, Schreiner CE, Merzenich MM (2004) Associative learning shapes the neural code for stimulus magnitude in primary auditory cortex. Proceedings of the National Academy of Sciences 101:16351–16356. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Polley DB, Read HL, Storace D a, Merzenich MM (2007) Multiparametric auditory receptive field organization across five cortical fields in the albino rat. Journal of neurophysiology 97:3621–3638. [DOI] [PubMed] [Google Scholar]
- Rolls ET, Tovee MJ (1995) Sparseness of the neuronal representation of stimuli in the primate temporal visual cortex. Journal of Neurophysiology 73:713–726. [DOI] [PubMed] [Google Scholar]
- Rudy B, Fishell G, Lee S, Hjerling-Leffler J (2011) Three groups of interneurons account for nearly 100% of neocortical GABAergic neurons. Developmental Neurobiology 71:45–61. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schreiner CE, Mendelson JR, Sutter ML (1992) Experimental Brain Research Functional topography of cat primary auditory cortex: representation of tone intensity. [DOI] [PubMed] [Google Scholar]
- Seay MJ, Natan RG, Geffen MN, Buonomano DV (2020) Differential short-term plasticity of PV and SST neurons accounts for adaptation and facilitation of cortical neurons to auditory tones. Journal of Neuroscience 40:9224–9235. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Seybold BA, Phillips EAK, Schreiner CE, Hasenstaub AR (2015) Inhibitory Actions Unified by Network Integration. Neuron 87:1181–1192. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Soldado-Magraner S, Seay MJ, Laje R, Buonomano DV (2022) Paradoxical self-sustained dynamics emerge from orchestrated excitatory and inhibitory homeostatic plasticity rules. Proc Natl Acad Sci USA 119:e2200621119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sun W, Marongelli EN, Watkins PV, Barbour DL (2017) Decoding sound level in the marmoset primary auditory cortex. J Neurophysiol 118:2024–2033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tan AYY, Atencio CA, Polley DB, Merzenich MM, Schreiner CE (2007) Unbalanced synaptic inhibition can create intensity-tuned auditory cortex neurons. Neuroscience 146:449–462. [DOI] [PubMed] [Google Scholar]
- Tsodyks MV, Skaggs WE, Sejnowski TJ, McNaughton BL (1997) Paradoxical effects of external modulation of inhibitory interneurons. The Journal of neuroscience : the official journal of the Society for Neuroscience 17:4382–4388. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vinje WE, Gallant JL (2000) Sparse Coding and Decorrelation in Primary Visual Cortex During Natural Vision. Science 287:1273–1276. [DOI] [PubMed] [Google Scholar]
- Watkins PV, Barbour DL (2011) Rate-level responses in awake marmoset auditory cortex. Hearing Research 275:30–42. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Willmore B, Tolhurst DJ (2001) Characterizing the sparseness of neural codes. Network: Computation in Neural Systems 12:255–270. [PubMed] [Google Scholar]
- Wilson NR, Runyan CA, Wang FL, Sur M (2012) Division and subtraction by distinct cortical inhibitory networks in vivo. Nature 488:343–348. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wixted JT, Squire LR, Jang Y, Papesh MH, Goldinger SD, Kuhn JR, Smith KA, Treiman DM, Steinmetz PN (2014) Sparse and distributed coding of episodic memory in neurons of the human hippocampus. Proceedings of the National Academy of Sciences 111:9621–9626. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wood KC, Angeloni CF, Oxman K, Clopath C, Geffen MN (2022) Neuronal activity in sensory cortex predicts the specificity of learning in mice. Nat Commun 13:1167. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wu GK, Arbuckle R, Liu B, Tao HW, Zhang LI (2008) Lateral Sharpening of Cortical Frequency Tuning by Approximately Balanced Inhibition. Neuron 58:132–143. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wu GK, Li P, Tao HW, Zhang LI (2006) Nonmonotonic Synaptic Excitation and Imbalanced Inhibition Underlying Cortical Intensity Tuning. Neuron 52:705–715. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yu J, Hu H, Agmon A, Svoboda K (2019) Recruitment of GABAergic Interneurons in the Barrel Cortex during Active Tactile Behavior. Neuron. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang JW, Lau C, Cheng JS, Xing KK, Zhou IY, Cheung MM, Wu EX (2013) Functional magnetic resonance imaging of sound pressure level encoding in the rat central auditory system. NeuroImage 65:119–126. [DOI] [PubMed] [Google Scholar]
- Zhang S, Xu M, Kamigaki T, Do JPH, Chang WC, Jenvay S, Miyamichi K, Luo L, Dan Y (2014) Long-range and local circuits for top-down modulation of visual cortex processing. Science. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The data and code are available on the dryad depository: https://doi.org/10.5061/dryad.t1g1jwt6d (Melanie Tobin et al., n.d.).