Skip to main content
eLife logoLink to eLife
. 2019 Oct 8;8:e47611. doi: 10.7554/eLife.47611

Structured inhibitory activity dynamics in new virtual environments

Moises Arriaga 1, Edward B Han 1,
Editors: Helen Scharfman2, Laura L Colgin3
PMCID: PMC6850773  PMID: 31591960

Abstract

Inhibition plays a powerful role in regulating network excitation and plasticity; however, the activity of defined interneuron types during spatial exploration remain poorly understood. Using two-photon calcium imaging, we recorded hippocampal CA1 somatostatin- and parvalbumin-expressing interneurons as mice performed a goal-directed spatial navigation task in new visual virtual reality (VR) contexts. Activity in both interneuron classes was strongly suppressed but recovered as animals learned to adapt the previously learned task to the new spatial context. Surprisingly, although there was a range of activity suppression across the population, individual somatostatin-expressing interneurons showed consistent levels of activity modulation across exposure to multiple novel environments, suggesting context-independent, stable network roles during spatial exploration. This work reveals population-level temporally dynamic interneuron activity in new environments, within which each interneuron shows stable and consistent activity modulation.

Research organism: Mouse

Introduction

Excitation is balanced by inhibition in neuronal networks (Andersen et al., 1963). In cortical circuits, feedforward inhibition is rapidly and robustly recruited by excitatory inputs, while pyramidal neuron firing elicits feedback inhibition to further dampen excitability (Alle et al., 2001; Lamsa et al., 2005; Pouille and Scanziani, 2001; Pouille and Scanziani, 2004). Furthermore, inhibition strongly controls synaptic plasticity, a putative cellular mechanism of learning (Bliss and Lomo, 1973; Whitlock et al., 2006). Intact inhibition limits potentiation to relatively low levels while pharmacologically blocking inhibition facilitates both the induction and magnitude of potentiation (Artola and Singer, 1987; Bear et al., 1992; Steward et al., 1990; Wigström and Gustafsson, 1983). Thus, inhibition suppression is a potential mechanism for enhancing learning by favoring synaptic plasticity in excitatory neurons.

Notably, inhibition can be strongly modulated in vivo in freely moving rodents. CA1 fast-spiking putative interneurons are suppressed when rats explore a novel spatial environment (Frank et al., 2004; Nitz and McNaughton, 2004; Wilson and McNaughton, 1993). Learning new food locations in a familiar environment dynamically modulated fast-spiking interneuron activity and altered the associations of these interneurons with pyramidal cell ensembles (Dupret et al., 2013). Hippocampal CA3 parvalbumin-expressing interneurons (PV-ints) have decreased network connectivity during the initial learning of the Morris Water Maze, but this connectivity increases with task performance, a modulation that is is necessary for learning (Donato et al., 2013; Ruediger et al., 2011). Furthermore, numerous studies have shown that suppression of somatostatin- and/or parvalbumin-expressing interneurons (SOM-ints and PV-ints) is necessary for certain types of cortical and amygdalar learning. This suppression is often triggered by disinhibitory inputs from interneurons that preferentially target other interneurons for inhibition, including vasoactive intestinal peptide-expressing interneurons (VIP-ints) (Gentet et al., 2012; Karnani et al., 2016; Lee et al., 2013; Letzkus et al., 2011; Makino and Komiyama, 2015; Mardinly et al., 2016; Pi et al., 2013; Turi et al., 2019; Wolff et al., 2014). Finally, the activity or plasticity of numerous other interneuron cell-types has been implicated in controlling animal behaviors (Basu et al., 2016; Hartzell et al., 2018). Together, this work demonstrates the dynamic nature of inhibition during spatial exploration and learning, and identifies the importance of inhibition suppression in certain types of learning.

Our understanding of in vivo inhibitory activity in the hippocampus is primarily driven by recordings of soma-targeting fast-spiking interneurons (likely PV-ints) since their distinctive firing characteristics make them relatively identifiable in extracellular electrophysiology recordings (Frank et al., 2004; Klausberger et al., 2004; Nitz and McNaughton, 2004; Wilson and McNaughton, 1993). These studies found transient suppression in firing of fast-spiking units during exploration of novel environments, consistent with a model in which decreased inhibition is permissive for excitatory plasticity and downstream learning. Interestingly, studies using calcium imaging in head-fixed animals in visual virtual reality (VR) contexts have found conflicting results with freely moving animals, with either no change in PV-int calcium activity in new virtual environments or a decrease in somatic calcium activity coupled with increased putative axonal calcium fluorescence, both suggesting no decrease in perisomatic inhibition (Hainmueller and Bartos, 2018; Sheffield et al., 2017).

SOM-ints, although far less understood, are of interest because they selectively innervate the dendrites of pyramidal neurons and can directly control dendritic excitability. Dendritic spikes, typically characterized by calcium entry through Ca2+ channels or NMDA receptors, generate burst firing of neurons and can mediate long-term plasticity, place field formation, and learning (Bittner et al., 2015; Bittner et al., 2017; Cichon and Gan, 2015; Golding et al., 2002; Larkum et al., 1999). Formation of place fields is associated with dendritic spikes that occur during transient periods of SOM-int activity suppression in novel environments (Sheffield et al., 2017). In contrast, SOM-int activation rather than suppression is required for fear learning, both in CA1 or in the dentate gyrus (Lovett-Barron et al., 2014; Stefanelli et al., 2016).

Interneuron activity may be dynamically modulated in new environments or during learning, but the stability of these activity dynamics in individual interneurons across time remain unknown. This is partially a technical issue as extracellular electrode recordings are typically stable for a few hours and have limited ability to identify interneurons, making the longitudinal recording of single interneuron activity difficult. Many inhibitory interneurons are strongly influenced by pyramidal neurons, driven in a feedforward and/or feedback manner. Thus, if activated pyramidal neuron ensembles are stochastic, as is the case for place cells in different environments, then stochastic ensembles of strongly activated interneurons should result. An alternate possibility is that individual interneurons play consistent and reproducible roles in a context-independent manner, reflecting an underlying structure that determines how interneurons regulate the pyramidal network. While inhibitory neurons are composed of multiple cell-types playing distinct network roles (Klausberger and Somogyi, 2008; Pelkey et al., 2017; Wamsley and Fishell, 2017), little is understood about the functional specialization of interneurons within a defined cell-type. Previous work from our lab and others have found functional diversity within the same putative interneuronal cell-types, with individual neurons being consistently activated or inhibited by locomotion (Arriaga and Han, 2017; Garcia-Junco-Clemente et al., 2019). Here we investigated whether learning to adapt a previously learned task to a new spatial context could similarly reveal the ‘set’ functional properties of specific interneurons.

We examined the activity dynamics of PV- and SOM-ints using calcium imaging over multiple days of exposure to initially novel environments as animals performed a goal-directed spatial navigation task. We found that PV- and SOM-int activity was strongly suppressed specifically during the initial exploration of new virtual worlds, with activity returning to baseline levels as animals learned to adapt the task to the new context. In contrast, for animals where the recovery of task performance is blocked (static visual scene with no task), SOM-int activity remained persistently suppressed for days, suggesting that the recovery of interneuron activity is tied to recovery of task performance, rather than to either habituation to the context switch or familiarity with the new visual environment. Surprisingly, suppressed interneuron activity triggered by context changes showed a defined population structure across both new environments and the ‘No Task’ condition in SOM-ints. Each interneuron exhibited consistent activity suppression, with high correlation of suppression across multiple novel contexts as well as the ‘No Task’ condition. These data reveal interneuron activity suppression during spatial exploration in new contexts, as well as a functional inhibitory network structure that may route the encoding of information within the pyramidal network.

Results

Virtual reality behavior

We used two-photon calcium imaging to stably record from neurons over weeks to study the activity dynamics of the same cells in a goal-directed, spatial navigation task across different virtual environments. We initially trained water-scheduled mice to run to alternating ends of a virtual visual track to receive water rewards, using their movement on a floating spherical treadmill (styrofoam ball) to control their movement in VR (Arriaga and Han, 2017; Dombeck et al., 2010; Harvey et al., 2009). Note that mice need to physically rotate and run on the ball in order to turn and maneuvre through VR, making this an internally consistent and continuous virtual world that contrasts with VR tasks requiring unidirectional movement on a track with either ‘teleportation’ back to the starting point after each trial (Hainmueller and Bartos, 2018; Sato et al., 2017; Sheffield et al., 2017) or an infinitely repeating corridor (Gauthier and Tank, 2018).

We used this task to study interneuronal activity dynamics as animals learned to adapt this previously learned behavior to new virtual contexts. We ran animals for 7 min in the well-trained familiar environment (Fam), immediately switched mice to a new VR environment for 14 min (New), and then switched them back for another 7-min session in the original familiar environment (Fam′) (schematic, Figure 1A). The task was identical in familiar and New epochs but the visual environment was different, with changes to the walls, track, and distal landmarks. We repeated this protocol over 5 days, with the same New environment each time.

Figure 1. Behavior in new visual virtual reality (VR) environments.

(A) Head-fixed mice run to alternating ends of the VR track by controlling the movement of a floating spherical treadmill (Styrofoam ball). Mice run forward on the ball to traverse the track and rotate the ball to turn around in VR. Animals spend 7 min in a familiar environment (Fam), which is instantaneously replaced with a new environment (New) for 14 min, before returning to the same familiar environment (Fam’). The task is the same but the visual scene differs in the two environments. (B) Example mouse position in VR shows running to alternating ends of track with water rewards (green) in Fam, with worse performance in New. Lick bouts (black bars) are tracked with an electronic sensor on the lick tube. Ball speed shows similar magnitude in New and Fam environments. (C) Behavioral performance is initially impaired in New (rewards/min in New normalized to FamAve, the average performance in flanking Fam and Fam′ epochs) but improves over time. (D) Mice slow down prior to reward in the familiar environments, measured as deceleration in the 3 s window before reward. Deceleration before reward is initially lower in New but increases over days, suggesting anticipation of reward sites (N = 14 mice, *p<0.05, **p<0.01, ***p<0.001 by one-sample t-test with Bonferroni-Holm Correction (C) or paired sample t-test (D)).

Figure 1—source data 1. Statistical tests and results for Figure 1.
DOI: 10.7554/eLife.47611.005

Figure 1.

Figure 1—figure supplement 1. Behavior metrics in New world.

Figure 1—figure supplement 1.

(A) Most licks in FamAve are within a 1 s window centered on reward delivery (defined as ‘correct’ licks). The fraction of ‘correct’ licks is initially lower in New relative to Fam but increases over days. (B) The fraction of incorrect, unrewarded, entries into a track endzone which coincide with a bout of licking across five days of remapping. Mice are significantly more likely to lick in expectation of a reward upon incorrectly entering an endzone in the familiar environment (when they return to the same reward zone twice in a row), suggesting that they know the reward locations in Fam but not New. This difference decreases over the course of exposure to the New world. (C) Mice lick at lower rates in New. (D) Mean distance from endzone of missed trial, identified as the location of a premature turn in the track. Failed trials (when animals turn around too early in the track and return to the same end zone, resulting in no reward) in the New world occur further from the correct end zones than they do in the familiar environment. This difference decreases over 5 days of remapping. (E) Mice spend a similar amount of time stopped in both environments over 5 days, indicating that stopped periods do not significantly contribute to decreased behavioral performance. (F) Running speed in New across five days of remapping, normalized to mean speed in FamAve. Average running speed is the same in FamAve and New, suggesting slower running does not contribute to decreased behavioral performance. (G) Recovery of behavioral performance does not accelerate when animals undergo a second sequence of remapping into another distinct new world (New 2). Performance in New 1 (original new environment) normalized to flanking FamAve was compared to performance in New 2 (second distinct new environment) across 5 days (N = 6, a subset of animals used in other panels, p>0.99 by paired sample t-test with Bonferroni-Holm Corrections). (H) Mice were initially impaired in behavioral performance in the new environment relative to both Fam and Fam′, but the data show return to high performance in Fam′, suggesting that satiation or fatigue do not contribute to impaired behavioral performance in New. Mean performance was measured in terms of rewards per minute in the New world over 5 days of remapping. Black asterisks represent Fam vs New, Gray asterisks represent Fam′ vs New. All Fam vs Fam′ comparisons are not significantly different. (N = 14 for all panels except (G); *p<0.05, **p<0.01, ***p<0.001 by paired sample t-test or one-sample t-test with Bonferroni-Holm corrections).
Figure 1—figure supplement 2. No track location preference in New.

Figure 1—figure supplement 2.

Occupancy time along the length of the track was used to determine whether mice developed a new preference for a region of the track in New as compared to Fam. (A) Mouse residency in Fam over 5 days. Animals spend far more time in end zones than in the middle of the track. (B) Mouse residency in New over 5 days. Animals spend more time in the middle of the track upon initial exposure to New, but still spend more time in the end zones than in the middle of the track, with no sign of preferred areas on the track. (C) Mouse residency in Fam′ over 5 days. Animals spend far more time in the end zones than in the middle of the track (paired t-test between endzones and track p<0.001 for all days, all environments). (D) Average time animals spend in the endzone in each environment over 5 days of exposure to New. (N = 14, Paired t-test between environment endzone residency, *p<0.05 paired sample t-test with Bonferroni-Holm corrections).

We characterized behavioral performance in novel contexts using a cohort of mice, a subset of which was used for SOM- and PV-int imaging (the remainder used for experiments not discussed here). One quantification of task performance was the number of rewards per minute (rew/min) in Fam, New, and Fam’ epochs. Upon initial exposure to the New virtual world, animal behavior was dramatically altered. Performance in New was significantly worse on Days 1, 2, and 4 compared to FamAve (the average performance in the flanking Fam and Fam′ epochs) (Figure 1C, performance in New normalized to FamAve). This impairment was largest on Day 1 and gradually decreased over the next four days of exposure to the same ‘New’ world, indicating that animals learned to adapt previously learned behavior to the new context.

The recovery of task performance over days in New could be due to context-specific adaptation of the task (which is likely to be hippocampal-dependent) or the result of a more general context-independent strategy. One example of the latter is the identification of a virtual corridor and navigation to the end of this corridor, without regard for the specific visual context (either Fam or New), because rewards are given at the end zones of the track. This strategy is intuitive in the real world; once animals are trained to run to alternating ends of a track, they can perform the task in new tracks immediately in a non-hippocampal-dependent manner (Kim and Frank, 2009). Similarly in a task where mice run forward on a treadmill (constrained to 1D movement), move down a VR track for reward and then are teleported back to the beginning for each trial, task performance recovers within a few minutes of exposure to a new track (Sheffield et al., 2017). In marked contrast, recovery of behavioral performance occurs over days in our task.

Furthermore, if mice rapidly identify virtual corridors, we would expect to see early indications of track location awareness in New. In familiar environments, animals typically slow down before entering the end zone (marked by a period of deceleration beginning several seconds prior the end zone) in anticipation of receiving reward and consuming water (Gauthier and Tank, 2018). However, in new environments, animals initially did not decelerate as they approached the end zone, but over 5 days of repeated exposure to New, they decelerated more and more, approaching the levels of deceleration seen in FamAve (Figure 1D, Day 1, p<0.001, t = 5.1; Day 2, 3, and 4, p<0.05, Day 2 t = 3.44, Day 3 t = 3.48, Day 4 t = 2.8). Similarly, animals in familiar environments generally begin licking within a 1 s window centered on reward delivery (which we define as ‘correct’ licks); in comparison, mice lick with higher frequency outside of this window in New (Figure 1—figure supplement 1A, Day 1, p<0.01, t = −4.82; Day 2, p<0.05, t = −3.13), with their behavior improving over repeated training. We characterized other behavioral metrics that show a similar pattern of initial impairment and recovery (Figure 1—figure supplement 1B,C,D). Other measures of behavior did not change, such as percent of time stopped or average speed (Figure 1—figure supplement 1E,F).

If mice employ a strategy of using a virtual corridor as a cue for finding the reward zone, behavioral recovery should be accelerated due to improved cue recognition when exposed to another new environment. Alternatively, if behavioral recovery is context-dependent rather than cued, the time course of recovery should be similar for a second new environment. We put a subset of mice through a second remapping protocol in which they were exposed to another distinct and novel visual virtual environment, labeled ‘New 2’, with the original novel environment now labeled ‘New 1'. We found that the time course of behavioral recovery was the same for both new environments (Figure 1—figure supplement 1G), suggesting context-dependence. We also confirmed there was no significant difference in behavioral performance (using non-normalized rew/min) between the flanking Fam and Fam′ epochs, indicating that time-dependent effects such as satiation or fatigue were not responsible for task impairment in the intervening New epoch (Figure 1—figure supplement 1H).

While the exact nature of the learning that takes place in new environments is not clear, these data are consistent with mice adapting this previously learned task to new environments by learning to navigate the context-specific sequence of visual cues that lead to alternating end zones in the new world. Furthermore, our data do not support the hypothesis that mice immediately recognize the significance of a virtual ‘corridor’ and use this strategy to navigate to track ends. However, we note that the recovery of behavioral performance in new environments is not an unambiguous measure of learning, as performance is affected by other factors such as surprise at the context switch and running speed.

One such confound that could inadvertently alter mouse behavior is the brightness of the VR worlds, with bright areas potentially inhibiting movement. We designed the VR environments to be as dim as possible (both through minimal projector brightness and an additional dimming film applied to the rear projection screen) with approximately equivalent brightness across worlds. Even the brightest VR features (which constitute a small fraction of the entire visual scene) were 3 cd/m2, which is within the mesoscopic range for visual function at which mice are frequently behaviorally active (Denman et al., 2018; Schmucker et al., 2005). Furthermore, we looked for signs of localized inhibition of locomotion by calculating the occupancy time in spatial bins of the track for the Fam, New, and Fam′ epochs. Mice spend most time in the end zones (where they stop to lick their water rewards) with no obvious indication of preferential areas of stopped or slowed locomotion (Figure 1—figure supplement 2), suggesting that VR brightness was not a significant factor in mouse behavior in our experiments.

Characterization of neuronal calcium activity in novel virtual environments

To investigate in vivo interneuronal activity dynamics during exposure to novel environments, we used two-photon imaging of neuronal calcium activity during a spatial navigation task in visual virtual reality (Arriaga and Han, 2017). We used an electric tunable lens to image a 3-D volume of mouse hippocampal dorsal CA1 by capturing sequential imaging frames along the z-axis moving from stratum pyramidale through oriens, over four to six planes at a frame rate of 5.2–7.8 Hz per plane. Cre-dependent AAV1-Syn-Flex-GCaMP6f was injected into Cre+ transgenic mice to drive a genetically encoded calcium sensor specifically in SOM+ or PV+ hippocampal interneurons. Calcium activity can be taken as a proxy for neuronal activity as multiple studies using simultaneous in vivo imaging and cell-attached patch electrophysiology on the same neurons have found strong correlation between spiking and calcium signals (Chen et al., 2013; Dana et al., 2016). We, and others, have measured the activity dynamics and coding properties of hippocampal neurons in visual virtual environments and found significant similarity between VR and real world behavior in several aspects of function such as place coding, direction-specificity, place cell remapping in novel environments, and interneuron activity correlation and anti-correlation with locomotion (Arriaga and Han, 2017; Gauthier and Tank, 2018; Hainmueller and Bartos, 2018; Harvey et al., 2009; Sheffield et al., 2017).

Interneuron activity suppression in a New environment

To investigate the functional activity dynamics of SOM-ints during spatial exploration, we recorded calcium activity from the same cells while animals performed the VR track-running task in Fam, New, and Fam′ over 5 days. SOM-int neuronal activity, measured as ΔF/F, was strongly suppressed upon transition into New (ΔF/F of 6 sample cells from one imaging plane of one mouse, Figure 2A; Video 1 shows activity suppression in another set of SOM-ints). Individual neurons were differentially suppressed with some being relatively unaffected. On returning to Familiar after New in Fam′, calcium activity rapidly recovered, as did behavioral performance (example cells and behavior from an additional three animals in Figure 2—figure supplement 1). Similar results can be seen in all cells from this animal (Figure 2B). We quantified the neuronal activity of all cells in this example animal as mean ΔF/F and compared this activity across Fam, New, and Fam′. Activity suppression is calculated as the percent difference of mean ΔF/F for each cell between New and FamAve using the formula PercentChange=ΔF/FNew-ΔF/FFamΔF/FFamX100. The histogram of percent differences for each cell in the sample mouse shows a distribution of cells that are suppressed in New (Figure 2C). The calcium activity from another sample mouse over 5 days of exposure to New follows a similar pattern of activity dynamics (Figure 2—figure supplement 2). Across all animals, suppression histograms of SOM activity over the 5-day protocol show a large initial suppression in New that diminishes over days of exposure (Figure 2D).

Figure 2. SOM+ interneuron (SOM-int) activity suppression in new environments.

(A–C) Example data from individual mouse (SOM 1). (A) Top, position in VR track of example mouse. Middle, ΔF/F of sample SOM-ints showing activity suppression in New. (B) Mean ΔF/F of all cells from example mouse on Day 1 of New exposure in each environment (gray), mean ΔF/F of example cells from (A) in corresponding colors, with mean of all cells (black). (C) Histogram of percent change in ΔF/F of SOM-ints shown in panel (B) in New relative to FamAve on Day 1. (D) Activity suppression in New decreases with exposure over days (cells from all mice). (E) SOM-int activity is initially suppressed but recovers over days of exposure to New. (F) Performance in New world increases over days. (G) Mice increasingly slow down prior to reward in New. (N = 10, n = 209 cells; n.s. p>0.05, *p<0.05, **p<0.01, ***p<0.001 by paired sample t-test or one-sample t-test with Bonferroni-Holm corrections).

Figure 2—source data 1. Statistical tests and results for Figure 2.
DOI: 10.7554/eLife.47611.011

Figure 2.

Figure 2—figure supplement 1. SOM-int activity suppression in multiple example animals.

Figure 2—figure supplement 1.

Example cell activity on a smaller time scale for four example SOM mice on Day 1 of New environment. Top, position in VR track, middle, ΔF/F of sample cells, bottom, ball speed. (A) Example cells from mouse SOM 1, seen in Figure 2. (B) Example cells from mouse SOM 2, seen in Figure 2—figure supplement 2. (C) Example cells from mouse SOM 3, not otherwise shown. (D) Example cells from mouse SOM 4, not otherwise shown.
Figure 2—figure supplement 2. SOM-int activity suppression over 5 days of remapping into New.

Figure 2—figure supplement 2.

(A) Cellular activity is initially strongly suppressed but recovers over multiple exposures to New. Top, position in VR track, middle, ΔF/F of sample cells, and bottom, ball speed (SOM 2). (B) Mean ΔF/F of all cells from an example mouse on Day 1 of remapping (colors) and mean (black). (C) Histogram of percent change in ΔF/F of SOM-ints in New world relative to FamAve across 5 days of remapping. (n = 25, mouse SOM 2.)
Figure 2—figure supplement 3. Broad SOM-int firing fields in Fam and New on Day 1.

Figure 2—figure supplement 3.

Data from the same sample mouse for which data are shown in Figure 2A – C (SOM 1). (A) Top, position in VR track, middle, ΔF/F of sample cells, and bottom, ball speed. (B) SOM-int firing is broadly tuned in Fam and suppressed in New. Heatmaps of neuronal activity in the VR track on Day 1 of remapping for the 27 cells in this example mouse. Cells 1–6 are the cells shown in panel (A), with the same heatmap colors.
Figure 2—figure supplement 4. Suppression of SOM-int neurite activity.

Figure 2—figure supplement 4.

(A) Sample plane of imaging from SOM-ints. The red box indicates the neuropil region of interest (ROI), consisting of putative SOM-int axon and dendrites, avoiding cell somata. (B) Pixel-wise percent change in a sample plane of imaging, showing broadly distributed suppression of activity in both soma and neurites. (C) ΔF/F trace of example plane (red), shown with position (middle) and running speed (bottom). (D) Average percent change in New over 5 days of exposure (N = 10, *p<0.05 by one-sample t-test with Bonferroni-Holm correction).

Video 1. Activity suppression of SOM-ints in New world.

Download video file (525.2KB, mp4)
DOI: 10.7554/eLife.47611.012

GCaMP6f fluorescence in SOM-ints in mouse running in Fam, New (marked ‘New World’ in movie, 1st day of exposure), and Fam′. Field of view is ~170 µm x ~ 90 µm. Movie is binned into 2 s frames and played at 57.9X speed (original movie is 29.8 min long).

We also examined the spatial correlates of SOM-int activity in Fam, New, and Fam′ during Day 1 exposure. We typically observed broad firing fields in Fam and Fam′, while activity in New was similarly broad, with the magnitude of activity being reduced in comparison to familiar environments (Figure 2—figure supplement 3). Occasional SOM-ints showed firing fields restricted to end zones, representing interneurons activated during immobility (Arriaga and Han, 2017).

To quantify this suppression over time, we calculated a percent difference for each mouse by averaging all cells per mouse and then calculated a grand mean for all mice on each day (Figure 2E). Indeed, SOM-int activity exhibited significant suppression in New that gradually decreased over days (Day 1, p<0.001, t = −8.70; Day 2, p<0.05, t = −4.06). This decrease in suppression paralleled the increase in behavioral performance in New, as normalized to the average performance in Fam and Fam′ (Figure 2F, performance worse in New on Days 1 and 4, p<0.05, Day 1 t = −4.34, Day 4 t = −5.57). (Note that metrics in Figure 2 are from ten SOM-cre mice, a partially overlapping set of the larger behavioral cohort of 14 mice in Figure 1). We also found decreased deceleration before rewards in New, which recovered with continued exposure to a New environment (Figure 2G, Day 1, p<0.05, t = 3.73). While we used other behaviors such as licking to characterize behavior in Figure 1—figure supplement 1, licking behavior was highly variable across animals, leading us to focus on task performance and deceleration before rewards for characterizing behavior in New. These data show, on average, strong suppression of SOM-int upon exposure to a New environment, with recovery of activity over repeated exposures. At the same time, behavioral performance was initially impaired and then increased over time.

Next, we investigated whether SOM-int somatic activity dynamics correlated with activity from nearby SOM-int neurites (axons and dendrites). A previous report found decreased PV-int somatic calcium levels with simultaneous elevation in neuritic calcium, raising the possibility of differential regulation of soma and axon activity (Sheffield et al., 2017). By analyzing regions of interest in the statum oriens that included neurites and excluded somata, we found that SOM-int neurite activity was similarly suppressed (Figure 2—figure supplement 4).

Our data show strong initial inhibition of SOM-int activity in new virtual environments, yet, at the same time, behavior is altered in new environments. Could decreased interneuronal activity directly result from changes in behavior rather than from network reorganization due to spatial exploration? For example, many SOM-ints have activity that is positively correlated with locomotion (Arriaga and Han, 2017; Turi et al., 2019), making it possible that decreased interneuron activity is due to decreased locomotion in New. Although average locomotion was the same in New and FamAve (Figure 1—figure supplement 1F), we more closely investigated the relationship between neuronal activity and several behavioral variables, such as locomotion (including speed and acceleration), rewards, and location. As a first pass, we individually correlated these variables with cellular activity and found that, in familiar environments, locomotion-related variables had relatively high correlation with activity, on average. In new environments, these correlations were much lower, indicating a disruption in the relationship between behavioral variables and neuronal activity in new environments (Figure 3—figure supplement 1). Similar to the recovery of calcium activity and behavioral performance over time, correlation between behavior and SOM-int activity also recovered with increasing experience in New.

To explore the relationship between behavior and SOM-int activity more thoroughly and quantitatively, we turned to computational modeling using general linear models (GLM) to predict each cell’s fluorescence based on behavior. Models were trained using fluorescence data from Fam and fit using forward and rotational components of ball speed, the timing of rewards, and position and speed in the VR environment. Modeled ΔF/F was very similar to actual ΔF/F in Fam, while in New, the fit of modeled ΔF/F was much worse (six sample cells, Figure 3A). To quantify, we compared modeled ΔF/F to actual ΔF/F in two ways: root mean square (RMS) error, which decreases with better fit, and percent of variance explained (R2), which increases with better fit. Using both measures, we found that model fit was significantly worse in New versus FamAve for all 5 days, suggesting that changes in behavior, including locomotion, did not explain decreased ΔF/F in New (RMS error, Figure 3B; R2, Figure 3C).

Figure 3. Decreased SOM-int activity in New is not explained by altered behavior.

(A) Gaussian general linear models (GLMs) for individual SOM-ints were trained to predict calcium activity as a function of locomotion, VR movement, and rewards in Fam. In New, modeled ΔF/F (black) is larger than actual ΔF/F (colored traces), indicating that the suppression of activity is greater than that predicted from the model (using example mouse [SOM 5]). Note that mice can move on the ball but not change their VR position, as seen here shortly after transition into New. This occurs when animals ‘run’ directly into a VR wall so that they are stationary in VR but still moving. (B) Model fits are significantly worse in New versus FamAve based on average Root Mean Square (RMS) error (lower errors mean better model fit). (C) The average amount of variance (R2) capturedby model also shows worse model fit in New (greater R2 means better model fit) (*p<0.001 by paired sample t-test Bonferroni-Holm corrections, N = 10, n = 209).

Figure 3—source data 1. Statistical tests and results for Figure 3.
DOI: 10.7554/eLife.47611.018

Figure 3.

Figure 3—figure supplement 1. Behavioral variables are poorly correlated with SOM-int activity in New.

Figure 3—figure supplement 1.

Correlation between cell activity and behavioral variables in FamAve and New (measured as the peak cross-correlation within a 2 s window). Activity correlations with behavior in New are decreased initially, and recover over days of exposure to New. Forward: forward component of running speed; Rotation: rotation component of running speed; Reward: timing of rewards; Location: position in VR track; VR Speed: speed in virtual reality environment; Tot. Acc.: total acceleration from mouse running speed; For. Acc: forward component of acceleration from mouse running speed. (N = 10, n = 209, *p<0.05, by paired sample t-test with Bonferroni-Holm corrections).
Figure 3—figure supplement 2. SOM-int GLM performance in different environments.

Figure 3—figure supplement 2.

(A) On Day 1 in New, modeled ΔF/F (black) is larger than actual ΔF/F (colored traces), while in Fam′, modeled fit improves relative to New (in example mouse (SOM 5)). (B) RMS error of model fit is significantly different in Fam versus New on all days, while New is different from Fam′ on Day 1 and 4. (C) Average R2 between modeled fluorescence and cell fluorescence across environments and days. (*p<0.05, **p<0.01, ***p<0.001 by paired sample t-test with Bonferroni-Holm corrections, N = 10, n = 209).
Figure 3—figure supplement 3. Locomotion variables strongly contribute to SOM-int model fits.

Figure 3—figure supplement 3.

(A) Using behavioral data at increasing possible maximum lag values for behavioral variables improves model performance. Linear models were trained on behavioral data with a varying amount of time permitted in the range used to identify the peak of the cross correlation between cell activity and behavioral parameters. Model error (root mean square) decreases with amount of lag included in the model. (B) Linear models were trained using only one of the parameters used to train the full model to examine the relative importance of different parameters to model performance. Model error (2 s lag used) is lowest when including all of the features used to train model. Relative performance of model trained on only one feature varies. For: forward component of running speed; Rot: rotation component of running speed; Rew: timing of rewards; Loc: position in VR track; VR Speed: speed in virtual reality environment; Null: constant model at mean firing rate. (C) Fraction of variance explained by model (R2) increases with amount of lag included in the model. (D) Fraction of variance (R2) explained by model (2 s lag used) is highest when including all of the features used to train the model. (N = 10, n = 209).
Figure 3—figure supplement 4. Behavioral variables poorly estimate SOM-int activity in New on Day 1.

Figure 3—figure supplement 4.

(A) Performance of versions of linear model (R2) trained in Fam using all behavioral variables, as well as each parameter individually, and tested in each of the three environments. Models trained in Fam accurately estimate cell activity when using locomotion variables in Fam and Fam′. Model accuracy is much worse in New. (B) Performance in FamAve and New compared for each model type. (C) Performance of models trained in New, tested in each of the three environments. Even when trained in New, the model performs poorly in New. (D) Models trained in New actually perform better in FamAve, indicating an unpredictable relationship between behavior and activity in New. (E) Performance of models trained in Fam′ using all behavioral variables, as well as each parameter individually, and tested in each of the three environments. Models trained in Fam′ perform well in Fam and Fam′, but not in New. (F) Performance in FamAve and New compared for each model type. All: model trained using all variables; Forward: forward component of running speed; Rotation: rotation component of running speed; Reward: timing of rewards; Location: position in VR track; VR Speed: speed in virtual reality environment; Tot Acc: total acceleration; For Acc: forward component of acceleration; Null: constant model at mean ΔF/F (N = 10, n = 209,*p<0.05, by paired sample t-test with Bonferroni-Holm corrections).

We also compared model fits in New to Fam and Fam′ individually (rather than as FamAve) and found that model fit was worse in New relative to Fam; however, there was not always a significant difference between the RMS errors of the New and Fam′ fits, and this difference became non-significant on Day 2 in New (Figure 3—figure supplement 2B). By contrast, R2 values were consistently worse in New vs. Fam′ (Figure 3—figure supplement 2C). Overall model fits appeared worse in Fam′ relative to Fam. This probably results from multiple factors that contribute to worse fits in both Fam′ and New. First, there is drift in neuronal representations over time, in this case, the relationship between behavioral variables and cellular activity. Thus predicted activity will tend to be best in data temporally close to the data used to train the model, in this case Fam (we note that we used cross validation where training and test data are independent, although both within the Fam epoch). Second, photobleaching over time contributes to decreased model fit. The model is trained on brighter cells in Fam, resulting in predicted fluorescence changes that are larger than those seen in the same dimmer, photobleached cells later on (photobleaching can be observed in the fluorescence traces shown in Figure 2—figure supplement 1). While we attempted to correct for photobleaching using common approaches (such as fitting and correcting with double exponentials), no single solution was appropriate for all cells, primarily because of variability in photobleaching across cells. This confound preferentially impacts RMS error, which is more sensitive to the absolute differences between the actual and predicted fluorescence trace than the R2 value. Finally, there may be a prolonged network effect of exposure to the New environment that crosses into the subsequent Fam′ epoch, perhaps due to long lasting neuromodulatory effects. These time-varying factors will also affect model fit in the New epoch, although with less effect because less time has elapsed since model training. These incremental, time-dependent effects appear much weaker than the immediate and drastic loss of model prediction in New.

To evaluate the contribution of individual behavioral variables to cellular activity, we quantified model fits based on single variables (Figure 3—figure supplement 3). Overall, models based on all variables had the best fits, while among the individual variables, those representing locomotion (forward and rotational ball speed, and VR speed) contributed the most to model fits of cellular fluorescence. Finally, we investigated the relationship between behavior and neuronal activity by training GLMs in different behavioral epochs. Earlier, we trained models in Fam and asked how well those GLMs predicted activity in New. This assumes that cells have the same relationship between behavior and activity in Fam as in New (otherwise the fit will be poor). By training the GLMs in New, we should achieve a high model fit if there is any predictable relationship between behavior and neuronal activity in New. In fact, GLMs trained on behavioral variables in New were still very poor at predicting activity in New (Figure 3—figure supplement 4), whereas in contrast, GLMs trained in either Fam and Fam′ predicted activity in familiar environments well, but poorly in New. This suggests a weakened influence of ongoing behavior on SOM-int activity, specifically in new environments. Taken together, these data show that the strong suppression of SOM-int activity in new environments is not simply explained by changes in mouse behavior but is likely to be triggered by contextual novelty itself.

Next, we tested whether soma-targeting PV-ints also show activity suppression in new environments, as shown in real-world experiments (Frank et al., 2004; Nitz and McNaughton, 2004; Wilson and McNaughton, 1993), although not in previous VR experiments (Hainmueller and Bartos, 2018; Sheffield et al., 2017). Similar to SOM-ints, PV-int somatic calcium activity was also strongly suppressed in New and recovered with repeated exposure to New (Figure 4, Figure 4—figure supplement 1). Qualitatively, suppression of activity seemed even stronger in PV-ints, perhaps reflecting decreased activity from a higher starting firing rate, as PV-ints generally have greater basal firing rate in vivo (Royer et al., 2012; Varga et al., 2012). There was also a spectrum of activity suppression in individual PV-ints, ranging from strong suppression to moderate activation in New (Figure 4B). Behavioral performance (rew/min) was also impaired in New relative to Fam, although in this batch of mice, performance was impaired over 4 days of exposure to New (Figure 4D). Deceleration before stops showed the same trend as SOM+ mice, with initially less deceleration and then increasing deceleration with repeated exposures to New; however, this change in deceleration was not significant (Figure 4E, note that N = 6 in this cohort). Using a GLM trained on behavioral data and cell fluorescence in Fam, we found that model fit was significantly worse in New compared to FamAve (Figure 4F,G,H), suggesting that changes in behavior were not responsible for decreased cellular activity.

Figure 4. PV+ interneuron (PV-int) activity suppression in new environments.

(A) Example data from an individual mouse (PV 1). Top, position in the VR track of an example mouse. Middle, ΔF/F of sample PV-ints showing activity suppression in New. (B) Histogram of percent change in ΔF/F of PV-ints from all mice in New relative to FamAve on Day 1, showing initial activity suppression in New that decreases with exposure over days. (C) PV-int activity is initially suppressed but recovers over days of exposure to New. (D) Performance in a New world increases over days. (E) Mice show a non-significant trend toward decreased deceleration before reward in New. (F) Gaussian general linear models (GLMs) for individual PV-ints were trained as a function of locomotion, VR movement, and rewards in Fam to predict calcium activity. In New, modeled ΔF/F (black) is larger than actual ΔF/F (colored traces), indicating that suppression of activity is greater than that predicted by the model (in example mouse (PV 2)). (G) Model fits are significantly worse in New versus FamAve based on average Root Mean Square (RMS) error (lower errors mean better model fit). (H) Average amount of variance (R2) predicted by model also shows worse model fit in New (greater R2 means better model fit) (*p<0.001 by paired sample t-test Bonferroni-Holm corrections, N = 6, n = 172).

Figure 4—source data 1. Statistical tests and results for Figure 4.
DOI: 10.7554/eLife.47611.026

Figure 4.

Figure 4—figure supplement 1. PV-int activity suppression over 5 days of remapping into New.

Figure 4—figure supplement 1.

(A) Cellular activity is initially strongly suppressed but recovers over multiple exposures to New in an example mouse (PV 3). Top, position in VR track, middle, ΔF/F of sample cells, and bottom, ball speed. (B) Mean ΔF/F of all cells from an example mouse on Day 1 of remapping (colors) and mean (black). (C) Histogram of percent change in ΔF/F of SOM-ints in New world relative to FamAve across 5 days of remapping (n = 46).
Figure 4—figure supplement 2. Broad PV-int firing fields in Fam and New on Day 1.

Figure 4—figure supplement 2.

Data from the same sample mouse as that shown in Figure 4—figure supplement 1 panels (A–C). (A) Top, position in VR track, middle, ΔF/F of sample cells (PV 1), and bottom, ball speed. (B) PV-int firing is broadly tuned in Fam and suppressed in New. Heatmaps of neuronal activity in the VR track on day 1 of remapping 28 cells from this example mouse. Cells 1–6 are the cells shown in panel (A), with the same color of heatmap.
Figure 4—figure supplement 3. Suppression of PV-int neurite activity.

Figure 4—figure supplement 3.

(A) Sample plane of imaging from PV-expressing interneurons. The red box indicates the neuropil region of interest (ROI), consisting of putative PV-int axon and dendrites, avoiding cell soma. (B) Pixel-wise percent change in the sample plane of imaging, showing broadly distributed suppression of activity in both soma and neurites. (C) ΔF/F trace of example plane (red), shown with position (middle) and running speed (bottom). (D) Average percent change in New over 5 days of exposure (N = 6, *p<0.05 by one-sample t-test with Bonferroni-Holm correction).
Figure 4—figure supplement 4. Behavioral variables are poorly correlated with PV-int activity in New.

Figure 4—figure supplement 4.

Correlation between PV-int activity and behavioral variables in Famave and New (measured as peak cross-correlation within a 2 s window). Activity correlations with behavior in New are decreased initially, and recover over days of exposure to New. Forward: forward component of running speed; Rotation: rotation component of running speed; Reward: timing of rewards; Location: position in VR track; VR Speed: speed in virtual reality environment; Tot. Acc: total acceleration from mouse running speed; For. Acc: forward component of acceleration from mouse running speed (N = 6, n = 172, *p<0.05, by paired sample t-test with Bonferroni-Holm corrections).
Figure 4—figure supplement 5. PV-int GLM performance in different environments.

Figure 4—figure supplement 5.

(A) RMS error of model fit is significantly different in Fam versus New on all days, while New is different from Fam′ on Days 1 and 4. (B) Average R2 between modeled fluorescence and cell fluorescence across environments and days. (*p<0.05 by paired sample t-test with Bonferroni-Holm corrections N = 6, n = 172).
Figure 4—figure supplement 6. Behavioral variables poorly estimate PV-int activity in New on Day 1.

Figure 4—figure supplement 6.

(A) Performance of versions of linear model (R2) trained in Fam using all behavioral variables, as well as each parameter individually, and tested in each of the three environments. Models trained in Fam accurately estimate cell activity when using locomotion variables in Fam and Fam′. Model accuracy is much worse in New. (B) Performance in FamAve and New compared for each model type. (C) Performance of models (R2) trained in New, tested in each of the three environments. Even when trained in New, the model performs poorly in New. (D) Models trained in New often perform better in FamAve, indicating an unpredictable relationship between behavior and activity in New. (E) Performance of models trained in Fam′ using all behavioral variables, as well as each parameter individually, and tested in each of the three environments. Models trained in Fam′ perform well in Fam and Fam′, but not in New. (F) Performance in FamAve and New compared for each model type. All: model trained using all variables; Forward: forward component of running speed; Rotation: rotation component of running speed; Reward: timing of rewards; Location: position in VR track; VR Speed: speed in virtual reality environment; Tot Acc: total acceleration; For Acc: forward component of acceleration; Null: constant model at mean ΔF/F (N = 6, n = 172, *p<0.05, by paired sample t-test with Bonferroni-Holm corrections).

Firing fields of PV-ints were broad in both environments (Figure 4—figure supplement 2) and, similar to cell somata, PV-int neurites were suppressed in New (Figure 4—figure supplement 3). To investigate the relationship between behavior and PV-int activity in Fam and New, we first correlated these variables with neuronal activity, again finding relatively high correlation between locomotion and activity in Fam and Fam′, but markedly decreased correlation in New (Figure 4—figure supplement 4). GLMs trained on behavioral data in Fam and PV-int activity predicted significantly better neuronal activity in Fam and Fam′ than in New (Figure 4—figure supplement 5), although again GLM model fits were better in Fam than in Fam′, likely for the same reasons described for SOM-ints. To further investigate the role of behavior on PV-int activity in New, we trained GLMs in New, rather than in Fam. Similar to SOM-ints, GLMS trained in New were poor at predicting activity in New (Figure 4—figure supplement 6), whereas GLMs trained in either Fam or Fam′ predicted activity in familiar environments well, but again poorly in New.

Overall, the results from PV-ints were similar to those of SOM-ints when exposed to new environments, with strong suppression of activity that recovered with repeated exposure and paralleled the recovery of task performance. Modeling of activity also suggests that changes in behavior in New were not responsible for decreased activity. Taken together, these data from PV- and SOM-ints show that two major classes of inhibitory neurons, targeting distinct subcellular targets, are inhibited in new environments, consistent with a critical role for inhibition suppression in the reorganization of network activity in new environments.

Another potential interpretation of interneuron activity suppression in new environments is that suppression is driven by surprise at the context switch, with habituation to this surprise gradually restoring interneuron activity, with no relationship between the return of interneuron activity and the recovery of task performance in the new context. To test this possibility, we dissociated surprise at the context switch from task performance in the new environment by replacing the New environment with a no-task, no-reward epoch and a static visual scene (black screen). Under these conditions, if surprise drives interneuron activity suppression and activity recovery is due to habituation, we would see the same suppression and recovery over time as previously shown. On the other hand, if performance recovery is necessary for the restoration of interneuron activity, we should see sustained inhibitory suppression.

Taking advantage of the long-term recording stability of two-photon calcium imaging, we recorded the same SOM-int cells from a subset of the mice (N = 6) used in the remapping experiment, allowing us to compare directly the kinetics of activity recovery when performance recovery was present or absent. The ‘No Task’ environment evoked strong suppression of SOM-int activity, similar to the suppression seen when mice are switched into New (Figure 5A–D). However, over 5 days of exposure to the same ‘No Task’ environment, activity remained strongly suppressed (Figure 5E, Figure 5—figure supplement 1). In marked contrast, the same cells from the same mice showed strong recovery of activity over 5 days of exposure to New (Figure 5E, showing data from six mice used in Figure 2 that were also, later, switched into the ‘No Task’ environment). Thus, recovery of SOM-int activity is unlikely to be the result of habituation to surprise and may rather depend on the recovery of task performance in the new context.

Figure 5. SOM-int activity suppression remains high when performance recovery is blocked in a ‘No Task’ environment.

(A–C) Example data from an individual mouse (SOM 6). (A) Cells from the sample mouse are strongly suppressed during the ‘No Task’ epoch (static black screen, no rewards). Top, position in VR track, middle, ΔF/F of sample cells, and bottom, ball speed. (B) ΔF/F of all cells from an example mouse on Day 1 of ‘No Task’ exposure showing activity suppression (mean of all cells in black). (C) Histogram of percent change in ΔF/F of SOM-ints from the example mouse on Day 1 of ‘No Task’ showing strong suppression. (n = 18). (D) Interneurons remain suppressed over several days of ‘No Task’ exposure. Histogram of percent change of all cells. (E) In ‘No Task’ exposure, SOM-ints remain suppressed in contrast to recovery during exposure to New. ( The same six mice, which are a subset of the ten mice used in Figure 2E, were exposed to No Task and New). (F) Average speed in ‘No Task’ environment increases relative to Familiar, in contrast to New. (n.s. p>0.05, *p<0.05, **p<0.01, ***p<0.001 by paired sample t-test or one-sample t-test with Bonferroni-Holm corrections, N = 6, n = 116).

Figure 5—source data 1. Statistical tests and results for Figure 5.
DOI: 10.7554/eLife.47611.030

Figure 5.

Figure 5—figure supplement 1. SOM-int activity suppression in the No Task environment.

Figure 5—figure supplement 1.

Cellular activity remains strongly suppressed over multiple exposures to the No Task environment. (A) Top, position in VR track, middle, ΔF/F of sample cells (SOM 7), and bottom, ball speed. (B) Mean ΔF/F of all cells from an example mouse on Day 1 of the No Task environment. (C) Histogram of percent change in ΔF/F of SOM-ints in No Task relative to FamAve environments across 5 days of exposure. (n = 24).
Figure 5—figure supplement 2. Short-time correlation declines in new environments.

Figure 5—figure supplement 2.

Short-time correlation, measured as the overall mean of all pairwise activity correlations within a time bin, decreases in new environments. (A) SOM-int short-time (5 s non-overlapping bins) cell-cell activity correlation across time for each of 5 days of exposure to New. (B) SOM-int mean activity correlation decrease during New in comparison to Fam on the first day. (C) SOM-int short-time cell-cell correlation across time for each of 5 days of exposure to No Task. (D) SOM-int mean activity correlation decreased on all days in No Task compared to Fam. (E) PV-int short-time cell-cell activity correlation across time for each of 5 days of exposure to New. F, PV-int mean activity correlation decreased on all days in Ne compared to Fam. (SOM-int: New, N = 10; No Task, N = 6; PV-int: New, N = 6; *p<0.05 comparing either Fam and New, or Fam and No Task, by paired sample t-test with Bonferroni-Holm corrections).

Our previous analyses examined the activity of individual interneurons. Next, we asked how interneuron ensemble activity was altered in our experiments. Here, we measured short-time scale population correlation by calculating all pairwise cell-cell activity correlations within 5 s non-overlapping time bins, averaging those values to get a mean activity correlation for that time bin, then averaging across all mice. The resulting correlation value reports how similar activity is across all cells for that time bin, and the time series of all values shows how this short-time population correlation measure evolves over time. SOM-ints show stable short-time population correlation in Fam that drops significantly on the first day of New, with a return to high short-time population correlation on return to Fam′. By the second day of New, population activity shows similar levels of correlation across Fam, New, and Fam′ (Figure 5—figure supplement 2A,B). In contrast, SOM-ints in ‘No Task’ periods show low short-time population correlation, which remains depressed throughout 5 days of exposure (Figure 5—figure supplement 2C,D). PV-ints in new environments also showed loss of short-time population correlation in New relative to Fam, although unlike SOM-ints, this decrease was significant for all 5 days (Figure 5—figure supplement 2E,F). These results show that interneuron activity at the population level is less similar in new environments than in familiar environments, likely due to a combination of suppression of activity and decreased influence of locomotion on activity.

Consistent inhibitory structure across new contexts

On average, SOM-ints are suppressed in a new environment (Figure 2E), but the degree of activity suppression is heterogeneous across neurons, ranging from strong inhibition to moderate activation in individual cells (Figure 2D). It is clear that interneurons respond differently in new environments, raising the question of whether individual interneurons show different activity dynamics in different virtual worlds, similar to the way that hippocampal pyramidal neurons stochastically remap. Alternatively, are these activity dynamics context-invariant, suggesting a consistent network role for individual interneurons?

We tested whether the structure of SOM-int activity suppression was stochastic or consistent by putting a subset of the animals previously described through a second remapping protocol, in which they were exposed to another distinct and novel visual virtual environment, labeled ‘New 2,’ with the original novel environment now labeled ‘New 1’ (performance recovery is similar between the two New environments, Figure 1—figure supplement 1G). By recording the same cells across the two remapping protocols, we could correlate the magnitude of each cell’s activity suppression in New 1 vs. New 2. If SOM-ints are stochastically recruited by network activity, there should be no correlation in activity suppression across New 1 vs. New 2. To the contrary, we found strong correlation between activity suppression in New 1 vs. New 2 in individual SOM-ints. This correlation was strong on Day 1, and strikingly, this correlation was significant across all days of the remapping protocol (Figure 6A, Day 1, p<0.001; Day 2, p<0.001; Day 3, p<0.001; Day 5, p<0.001). We, and others, have previously verified place cell global remapping across different virtual environments (Arriaga and Han, 2017; Gauthier and Tank, 2018; Hainmueller and Bartos, 2018; Sheffield et al., 2017), strongly suggesting that this consistent functional inhibitory network structure occurs despite differing ensembles of activated pyramidal neurons.

Figure 6. Consistent SOM-int activity responses across different new environments and in ‘No Task’ epoch.

(A) Individual SOM-ints show correlation in activity modulation in two distinct New environments. (B) Similar correlation of activity modulation is seen between New 1 and ‘No Task’ exposures for Day 1. On subsequent days, correlation disappears as SOM-int activity begins to return in New 1 while remaining suppressed in ‘No Task’. (C) Summary of correlation data from (A) and (B). Correlation between percent change of cells between two remapping sessions or between remapping session one and the ‘No Task’ exposure session. (D) Mean difference in percent change in activity in cells between remapping and ‘No Task’ exposure settings (*p<0.01, **p<0.001, one-sample t-test with Bonferroni-Holm correction N = 6, n = 116).

Figure 6—source data 1. Statistical tests and results for Figure 6.
DOI: 10.7554/eLife.47611.035

Figure 6.

Figure 6—figure supplement 1. Characterization of suppressed SOM-ints.

Figure 6—figure supplement 1.

(A) SOM-ints most inhibited on Day 1 in New remain the most suppressed over the course of the experiment. Percent change in activity in New relative to FamAve, across 5 days, stratified by degree of suppression on Day 1. (B) Percent change in activity in ‘No Task’ session, stratified by degree of suppression on Day 1. (C) Soma location was not associated with activity suppression in New. Percent change of activity on Day 1 in New based on soma location, either stratum oriens (SO) or stratum pyramidale (SP). Most SOM-expressing bistratified interneurons have somas in SP, whereas most Oriens-lacunosum moleculare (OLM) interneurons have somas in SO. (D) Mean cell brightness was not associated with activity suppression in New. Percent change of activity on Day 1 in New versus mean cell brightness. (E) Soma cell size was not associated with activity suppression in New. Percent change of activity on Day 1 of New versus cell size. (F) SOM-int activity correlation with locomotion was not associated with activity suppression in New. Percent change of activity on Day 1 in New versus phase angle of the Hilbert transform for the correlation between stop-triggered mean activity and running speed for each cell. Cells with a positive activity correlation with locomotion have a phase angle near 0, whereas those that are anti-correlated (activated by immobility) are shifted ~180° or π radians. (G) GLM model fit was not associated with activity suppression in New. Percent change of activity on Day 1 in New versus RMS error of model fit to actual cellular ΔF/F in Fam. (H) Percent change of activity on Day 1 in New versus R2 of model fit to actual cellular ΔF/F in Fam. (I) There is no association between cell proximity and strength of response to New (p>0.99). (J) Closer cells have more correlated activity (Spearman Rho −0.1, p<0.001). (A–H: N = 10, n = 209. A, B: vertical bars are binned mean with s.e.m., horizonal is connecting line. I, J: N = 10, n = 1793 cell pairs, vertical bars are binned mean with s.d., horizonal is connecting line.)
Figure 6—figure supplement 2. SOM-int network activity structure is stable across environments.

Figure 6—figure supplement 2.

(A) Correlation matrix of pairwise comparisons of SOM-int ΔF/F of cells from a single imaging plane in Fam. The color of each square represents the Pearson correlation between cell-activity traces. (B) Correlation matrix for same cells in a subsequent New epoch. Cell-cell activity relationships are generally maintained, although the magnitude of activity correlation changes in New. (C) Colored traces, ΔF/F of cells 1–5 from panels (A) and (B) at the transition from Fam to New. Cell-cell correlation values between traces are shown to the left for Fam epoch and to the right for New. Cell pairs have similar activity correlation in Fam and New. (D) Scatter plot of cell pair activity correlation in New vs. Fam. (E) Scatter plot of cell pair activity correlation in No Task vs. Fam. (F) Scatter plot of cell pair activity correlation in New1 vs. New2. (G) Scatter plot of each pair activity correlation in No Task vs. New. (H) Scatter plot of cell pair activity correlation vs. absolute difference in percent change in activity in New between each cell pair. Cell pairs with more dissimilar activity in Fam show larger differences in activity suppression in New. (I) Scatter plot of cell pair activity correlation vs. mean percent change of activity in New for each cell pair. (N = 6, n = 1036 cell pairs. In panels (D–E), the line is unity, tests are Spearman Rank Correlation. In (H, I), vertical bars are binned mean with s.d.; Spearman rho).
Figure 6—figure supplement 3. SOM-int network activity structure is stable across days in the same environment.

Figure 6—figure supplement 3.

(A) Scatter plot of cell pair activity correlation in Fam across 5 days. (B) Scatter plot of cell pair activity correlation in New across 5 days. (C) Scatter plot of cell pair activity correlation in No Task across 5 days. (N = 6, n = 1036 cell pairs, line is unity, test by Spearman Rank Correlation).

A trivial explanation for these results could be that this correlation results from general similarities across the two behavioral epochs, such as having equivalent tasks or a shared layout of the virtual worlds. To probe the structure of SOM-int activity suppression in a drastically different context, we used the ‘No Task’ epoch described previously (Figure 5). Here, the visual scene is distinct and static, and there is no behavioral task. Even here, when comparing activity suppression in New 1 vs. ‘No Task’ epochs, we found significant correlation for each cell on Day 1, indicating that the functional inhibitory network structure for these two very different behavioral epochs is very similar. We also measured correlation in activity suppression throughout the 5-day protocol. Although the correlation was significant on Day 1 (Figure 6B, p<0.001), it was not on Days 2–5. This was not surprising because inhibitory activity in New 1 recovers with increasing task performance, whereas suppression remains strong in ‘No Task’ epochs.

The rapid time-dependent loss of correlation between the New 1 and ‘No Task’ epochs reinforces how striking the correlation structure is for New 1 vs. New 2, indicating not only that initial suppression is correlated but that there is also consistency in the temporal dynamics of this structure in time across 5 days (Figure 6C). Similarly, the mean difference in percent change for each cell between New 1 and New 2 remains stable across the remapping paradigm, whereas the difference between New 1 and the ‘No Task’ epoch steadily increases over the 5-day course of exposure to each environment (Figure 6D).

These results show that each cell exhibits a consistent degree of activity suppression across multiple new environments. To understand more about how this structure arises, we looked for other factors that were associated with each cell’s magnitude of activity suppression. First, we examined the recovery of activity in cells as a function of initial level of activity suppression on Day 1 of remapping into New 1. Cells stratified by the magnitude of their initial suppression continue to be stratified by activity suppression across the 5-day remapping protocol, with the most strongly suppressed on Day 1 remaining the most suppressed on Day 5, while the least suppressed remain the least suppressed (Figure 6—figure supplement 1A). This finding indicates that each cell’s initial activity suppression is predictive of future activity throughout the protocol, suggesting that neurons may not be drawn from the same population of functionally homogeneous interneurons. Similarly, cells that were most suppressed in the ‘No Task’ epoch remained the most suppressed through subsequent days of exposure to this epoch (Figure 6—figure supplement 1B).

Does the degree of activity suppression differentially map onto distinct SOM-int cell types? SOM-ints labeled by cre-driver lines are primarily composed of two functionally and anatomically distinct types, OLM and bistratified interneurons (Klausberger et al., 2004; Royer et al., 2012). The somata of OLM neurons lie in stratum oriens (SO) extending into stratum pyramidale (SP), whereas bistratified interneurons are mostly in SP. We found no difference in activity suppression between cells with somata in SO vs. SP (Figure 6—figure supplement 1C, p=0.38), suggesting that each cell’s magnitude of activity suppression was not defined by cell-type, at least at the level of classification of OLM and bistratified interneurons. It remains possible that even more specific cell-type classification, perhaps through single-cell sequencing methods, may identify such a distinction (Harris et al., 2018). We also found that baseline fluorescence (which could be indicative of basal firing rate, with strong caveats) exhibited no relationship to activity suppression (Figure 6—figure supplement 1D). There was no relationship between soma area and activity suppression (Figure 6—figure supplement 1E, p>0.99).

We previously identified two distinct populations of SOM-ints, one whose activity was positively correlated with locomotion and another whose activity was anti-correlated (Arriaga and Han, 2017). These two populations, as measured by the phase angle of the hilbert transform of the cell’s correlation between stop-triggered mean activity and running speed, were also present in these data. However, there was no difference in activity suppression between the two (Figure 6—figure supplement 1F, p>0.99). Thus, the degree of activity suppression was not readily explained by these previously identified cell classes or cellular properties.

Across the cell population, there was variability in the goodness of model fit, so we investigated whether model fit in Fam was predictive of activity suppression in New. Using both RMS error and R2, we found no relationship between the two variables (Figure 6—figure supplement 1G,H). It is also possible that activity suppression is locally organized, or otherwise distance-dependent, with nearby cells having similar magnitudes of fluorescence changes in New. However, there was no relationship between cell-cell distance and activity suppression (Figure 6—figure supplement 1I). We did, however, find that closer cells have more similar activity, although the relationship was weak (Figure 6—figure supplement 1J).

Our results identify stable SOM-int activity dynamics across different environmental contexts. Is this stability restricted to activity dynamics or is the general population activity structure similarly stable? Earlier we found that overall SOM-int population activity becomes less coordinated when comparing average population correlation between familiar and new or ‘No Task’ environments (Figure 5—figure supplement 2). Here we examined population activity structure by calculating cell-cell activity correlations and asking whether these pairwise activity relationships were stable across environments. In an example animal, in the matrix of cell-cell activity comparisons, there is general preservation of activity relationships between cells in Fam and New (Figure 6—figure supplement 2A,B). This can also be seen directly in activity traces, in which cells that have similar activity in Fam also have similar activity in New (Figure 6—figure supplement 2C, cells 1 and 2), whereas cells that have dissimilar activity in Fam are also dissimilar in New (cells 4 and 5). Indeed, we found correlation in cell-cell activity when looking at Fam vs. New, New 1 vs. New 2, Fam vs. No Task, and New vs. No Task (Figure 6—figure supplement 2D–G). This finding of stable activity structure across environments is seemingly at odds with our earlier finding of decreased short-time population correlation between Fam and New (Figure 5—figure supplement 2). However, our measure of short-time population correlation is an average of all correlation values across the population, while activity structure relies on individual cell-pair correlations. Thus on average, short-time population correlation goes down in New (as can be seen by the overall decrease in correlation values in Figure 6—figure supplement 2B relative to Figure 6—figure supplement 2A), while cell-pair correlations are maintained (Figure 6—figure supplement 2D–G)).

Having identified overall stability in cell-cell activity correlation in different environments, we asked whether there was a relationship between each pair’s correlation level and activity suppression in New. To assign a single activity suppression number to each cell pair, we used the absolute difference between activity suppression values for individual cells; thus pairs with similar activity suppression will have a low difference, whereas pairs with dissimilar suppression will have a larger value. We found that cell pairs with dissimilar amounts of activity suppression in New also had lower activity correlations in Fam (Figure 6—figure supplement 2H). In particular, this relationship seemed driven by cell pairs with low activity correlation that also have strongly differing amounts of activity suppression (left side of graph). We also asked whether there was a bias toward higher or lower mean activity suppression on the basis of the cell-pair correlation value. We found that cell pairs with more similar activity had significantly less activity suppression, although this relationship was relatively weak (Figure 6—figure supplement 2I). Finally, we found that activity structure based on cell-pair activity correlation was stable across 5 days of exposure in Fam, New, and ‘No Task’ (Figure 6—figure supplement 3). These data identify overall preserved SOM-int activity structure, in addition to consistent activity dynamics, across different environments.

Discussion

In this work, we addressed critical questions in neuronal network function: how is interneuron activity dynamically regulated during spatial exploration, and is there a persistent structure in the resulting functional interneuron activity dynamics? We found that the activity of SOM- and PV-ints was initially strongly suppressed upon exposure to a novel virtual environment, and that activity recovered as animals learned to adapt a goal-directed spatial navigation task to this new environment. Furthermore for SOM-ints, there was a persistent functional network structure in the transition from familiar to novel environments. Each SOM-int exhibited a characteristic amount of activity suppression in multiple new environments, as well as in a drastically different ‘No Task’ environment where there was no recovery of task performance.

Our findings are consistent with a model in which entering new environments triggers decreased network inhibition, which then gradually recovers over the course of learning to stabilize the network (Frank et al., 2004; Wilson and McNaughton, 1993). We found suppression of both dendrite- and soma-targeting interneurons (SOM- and PV-ints, respectively), suggesting general disinhibition of postsynaptic pyramidal neurons during spatial exploration in this task, rather than subregion-specific changes (Chen et al., 2015; Hainmueller and Bartos, 2018; Sheffield et al., 2017). This loss in inhibition may act as a permissive gate for learning by enhancing synaptic plasticity in the pyramidal neurons. However, an important limitation of our data is that, while we measured calcium activity in interneuronal somata and neurites (axonal and dendritic), the actual inhibitory drive onto pyramidal neurons during spatial exploration remains unknown.

We hypothesized that the recovery of interneuron activity in new environments was dependent on task performance recovery and tested this by switching mice into a ‘No Task’ environment. Indeed SOM-int activity then remained suppressed rather than recovering, consistent with a role for task performance in modulating interneuron activity dynamics. However, although we eliminated task recovery, multiple variables change in the ‘No Task’ epoch, including the loss of virtual world, water rewards, and clear task rules. These other factors rather than any dependence on behavioral performance, either singly or in combination, could be responsible for the extended suppression of interneuron activity. More controlled manipulations of rewards and task rules in novel VR environments should be able to isolate the contribution of these variables to the recovery of interneuron activity. One factor that merits more in-depth investigation is the role of task engagement in controlling network activity. It was striking that, in the ‘No Task’ epoch, the normal positive correlation between locomotion and interneuronal activity was greatly weakened: even though mice ran on the ball at high speed, SOM- and PV-int activity remained low. This is in marked contrast to head-fixed animals running on a track that have never been trained in a VR task, which show a customary positive correlation (Katona et al., 2014). In our mice, it is likely that even the expectation of some VR world and task is sufficient to engage a state-dependent change in network activity. Thus, the low interneuron activity state may signify a network that is primed for learning, rather than an indication of novelty.

We have shown long-lived SOM- and PV-int suppression that is temporally distinct from transient suppression of SOM- and PV-ints during place cell global remapping (Nitz and McNaughton, 2004; Sheffield et al., 2017; Wilson and McNaughton, 1993). How much interneuron activity suppression is associated with novelty-induced place cell global remapping and how much is due to task performance? In both real world and VR experiments, switching animals to new environments triggers a few minutes of activity suppression and the formation of new place cell maps on the same timescale (Frank et al., 2004; Hainmueller and Bartos, 2018; Muller and Kubie, 1987; Nitz and McNaughton, 2004; Sheffield and Dombeck, 2015; Wilson and McNaughton, 1993). By contrast, in a learning task with no global remapping where freely moving rats learned new goal locations in a familiar environment, fast-spiking putative interneurons both increased and decreased activity as performance increased (Dupret et al., 2013). This reflected dissolution of ensembles encoding old reward maps and the formation of new cell assemblies, comprising pyramidal and interneurons, that represent new reward locations. Taken together, it is likely that the initial SOM-int activity suppression in our experiments is triggered by the switch into a new context and that associated global remapping, while slowly increasing interneuron activity thereafter, is associated with additional map refinement related to task performance. Further experiments using more detailed simultaneous recordings of pyramidal and interneuronal activity will refine this picture, whereas more subtle environmental manipulations will help to dissociate global remapping effects from learning.

Another crucial contribution of this study was the discovery of stable inhibitory interneuron activity structure in different environments, which was enabled by our long-term recording of identified cell-types. First, network activity structure (measured as pairwise cell-cell activity) is stable across different contexts, including the drastically different ‘No Task’ epoch. This suggests that the SOM-int network either continues to receive similar afferent inputs in familiar and new environments, or that network activity structure is locally stabilized, perhaps by gap junctional coupling in the interneuronal network. Second, interneuron activity dynamics were stable on a cell-by-cell basis, with each interneuron having a consistent level of activity suppression, both across multiple new environments and in the ‘No Task’ epoch. These findings reveal an underlying inhibitory circuit infrastructure that may serve to shape information encoding in the local pyramidal cell network. Future work will investigate whether PV-ints and other interneuron types show similar stability in activity dynamics across multiple environments or whether SOM-ints are unique in this regard.

Our finding of stable interneuron activity dynamics suggests that inhibition may play an active role in encoding information in the network, perhaps by regulating activity in connected ensembles of pyramidal neurons. Pyramidal neurons downstream of strongly suppressed interneurons are more likely to express plasticity, as a direct result of increased activity due to release of inhibition. Conversely pyramidal neurons downstream of less suppressed interneurons will receive relatively normal levels of inhibition, perhaps limiting plasticity. Thus, this functionally diverse, but consistent, inhibitory structure may act as a mechanism to address a fundamental tradeoff in neuronal network function: balancing stability with plasticity (Abraham and Robins, 2005; McClelland et al., 1995; McCloskey and Cohen, 1989). By modulating the functional properties of downstream neurons, inhibition can create plastic and stable pyramidal ensembles, allowing the integration of new information while preserving existing network function. The regulation of specific pyramidal neuron ensembles through interneuron control has not been well-studied (Rao-Ruiz et al., 2019); however, intriguing evidence finding distinct pyramidal neuron populations, which code either for learning (engram cells) or for stable place coding over learning, support the existence of hippocampal microcircuits that specialize in plasticity or stability (Tanaka et al., 2018).

One requirement of this model is preferential or targeted connectivity in the outputs of SOM-ints onto pyramidal neurons. Such a scenario is at odds with ‘pooled’ or ‘blanket’ inhibition, where interneurons make promiscuous and non-selective synapses (Fino and Yuste, 2011; Packer and Yuste, 2011), but significant evidence exists for preferential connectivity in both the hippocampus and cortex. In the hippocampus, PV-expressing basket cells preferentially inhibit deep pyramidal neurons projecting to the amygdala, while also being more likely to receive excitation from superficial pyramidal neurons or from deep pyramidal neurons projecting to the prefrontal cortex (Lee et al., 2014). In the medial entorhinal cortex, cholecystokinin-expressing basket cells selectively target pyramidal neurons that project extra-hippocampally (Varga et al., 2010). Furthermore, in the hippocampus, interneurons participate in cell assemblies with pyramidal neurons and can share coding properties such as place fields (Ego-Stengel and Wilson, 2007; Kubie et al., 1990; Marshall et al., 2002). Similarly, functional subnetworks of interneurons and pyramidal neurons have been identified in the cortex (Khan et al., 2018; Najafi et al., 2019; Znamenskiy et al., 2018). Finally, this work identifying the specialization of interneuron function is complemented by evidence of functionally distinct subsets of CA1 pyramidal neurons (Danielson et al., 2016; Graves et al., 2012; Mizuseki et al., 2011; Soltesz and Losonczy, 2018).

We identified structured activity dynamics in the functional responses of interneurons as animals adapted a goal-oriented spatial navigation task to novel virtual environments. What mechanisms might generate the activity dynamics and structure within the interneuron population? Neuromodulatory transmitters targeting G-protein coupled receptors are likely to play a significant role. Novelty or arousal produce strong changes in neuromodulation, with sharp increases in acetylcholine (Acquas et al., 1996; Vinck et al., 2015), norepinephrine (Sara et al., 1995), and dopamine (Kempadoo et al., 2016; McNamara et al., 2014; Takeuchi et al., 2016). Differing levels of inhibitory activity suppression could be set by the expression levels of neuromodulatory receptors in each cell. For example, interneurons show markedly divergent responses to acetylcholine depending on the composition and expression of their receptors (McQuiston and Madison, 1999). In vivo, hippocampal SOM-ints are activated by air puffs and this activation depends on muscarinic acetylcholine receptor signaling (Lovett-Barron et al., 2014; Schmid et al., 2016). Furthermore, synapse gain on SOM-ints after fear conditioning depends on cholinergic input, suggesting additional neuromodulatory effects on SOM-int plasticity (Schmid et al., 2016). Finally, recent work investigating the mechanisms of locomotion activation and the suppression of PV-ints in the visual cortex identified norepinephrine and acetylcholine, respectively, as critical effectors (Garcia-Junco-Clemente et al., 2019).

Another possible mechanism for suppressing interneuron activity is disinhibitory connections from other interneurons targeting SOM-ints. In the hippocampus, this disinhibitory input can come from local VIP, PV, and SOM interneurons (Francavilla et al., 2015; Lovett-Barron et al., 2012). Indeed, in our experiments, some SOM- and PV-ints were activated in novel environments, although it remains unclear whether these interneurons provide disinhibitory input. VIP interneurons are strongly associated with disinhibition, and previous work showed that these neurons are necessary for hippocampal-dependent learning (Donato et al., 2013; Turi et al., 2019).

Finally, it is possible that decreased SOM-int activity is inherited from reduced upstream excitatory input. In this case, feed-forward inhibition is driven by EC and CA2/3 input while feed-back excitation is driven by local CA1 neurons. However, during learning or novelty, CA3 and EC pyramidal neurons don’t change their firing rates, while CA1 pyramidal neurons increase activity (Barry et al., 2012; Karlsson and Frank, 2008; Nitz and McNaughton, 2004; Wilson and McNaughton, 1993), making it unlikely that suppression of inhibitory cell activity is purely a function of reduced excitatory drive.

Our work identifies inhibitory activity dynamics while revealing that individual interneurons have a consistent functional role across multiple new environments. Together, this work and the work of others highlight functional specialization within defined sets of neurons, which may serve to allow efficient incorporation of new information while maintaining overall network stability.

Materials and methods

Animals

All experiments were approved by the Washington University Animal Care and Use Committee. Heterozygotes (+/–) from two cre-driver mice lines on a C57Bl/6J genetic background were used to label parvalbumin-expressing and somatostatin-expressing inhibitory interneurons: SSTtm2.1(cre)Zjh/J (SOM-cre) and Pvalbtm1(cre)Arbr/J (PV-cre; Jackson Labs). All imaging data were from SOM-ints while behavioral data in Figure 1 were from PV- and SOM-ints.

Viral injections and hippocampal window implantation

Surgical procedures, VR track running behavior, and two-photon imaging have been described previously (Arriaga and Han, 2017). Briefly, mice were injected with adeno-associated virus (AAV) at 2–4 months of age. Mice were anesthetized with isoflurane, and a small (0.5 mm) craniotomy was opened above the left cortex. Virus (AAV1.Syn.Flex.GCaMP6f.WPRE.SV40, Penn Vector Core, University of Pennsylvania, 1.71 × 1013 genome copies, diluted 1:1–1:4 with PBS,~50 nL total volume) was pressure injected through a beveled micro-pipette targeting CA1 (−1.8 ML, −2.0 AP, −1.3 DV).

After virus injection, mice were water-scheduled for 1–3 weeks and an imaging cannula (2.8 mm diameter) was implanted above the hippocampus by aspirating the overlying cortex. Mice recovered for at least two weeks after surgery before beginning training.

VR track running behavior

The virtual reality display used a custom-built semi-cylindrical projection screen (1 ft radius) and two rear projectors (Optima 750 ST). The projection screen was positioned ~30 cm in front of the mouse and occupied 180° of the horizontal view, 16° below the horizon and 35° above. VR world brightness was kept low, both to minimize behavioral inhibition and because stray VR light contributes to noise during calcium imaging. Projectors were set at minimal brightness with an additional darkening film applied to the rear projection screen (Gila Glare Control, Smoke). To measure brightness directly, we used a chromameter (Minolta CS100A). On the basis of the lens configuration of the instrument, we could not measure total brightness to compare luminance across environments, but we could measure smaller visual features (diameter ~1.4 cm). The luminance of all features fell between 0.03 and 3 candela/m2, within the mesopic range of visual function, in which mice are frequently behaviorally active (Denman et al., 2018).

Mice were head-fixed on a spherical Styrofoam treadmill supported on a cushion of air from a 3D printed base, which allowed free ball rotation with mouse locomotion. Treadmill movement was tracked using a G400 Logitech mouse configured in LabView (National Instruments). The VR environment was rendered using ViRMEn (Virtual Reality Matlab Engine; Aronov and Tank, 2014). Mice were trained to run to alternating ends of a linear VR track (180 cm) for 2–5 weeks until they consistently achieved target performance (>2 rewards/min for one week). After training, mice were imaged during exposure to a new visual virtual world. Remapping experiments consisted of 7 min of behavior in the familiar track (Fam), an instantaneous switch to a novel track of the same length with different visual textures and landmarks (New) for 14 min, and then a return to the familiar (Fam′) environment for 7 min. This remapping protocol was repeated for five successive days with the same, decreasingly novel, New world. In a subset of animals, a second remapping task was also performed. This task was identical to the first with the exception of a different New environment (New 2). In addition, this same subset of animals was imaged in a ‘No Task’ session. This session consisted of 7 min of navigation in the familiar track, 14 min of exposure to a dark screen with no rewards, and a return to the initial familiar environment for 7 min.

Two-photon imaging

Calcium imaging was performed on a Neurolabware laser-scanning two-photon microscope, with the addition of an electric tunable lens (ETL; Optotune, EL-10–30-NIR-LD) and an f=–100 mm offset lens to change axial focal length rapidly. We imaged 4–6 axial planes spanning up to 250 µm in the z-axis at a total frame rate of 31 Hz, resulting in a per plane sampling rate of 5.2 Hz for a six plane recording and 7.8 Hz for a four plane recording. Field of view in x-y was 500 × 500 µm. Laser power (at 920 nm) was ~25–50 mW after the objective and was set independently for each plane imaged.

Data analysis

Data were analyzed using custom programs written in Matlab (RRID:SCR_001622) available at GitHub (Arriaga and Han, 2019, https://github.com/Han-Lab-WUSM/MA-scripts; copy archived at https://github.com/elifesciences-publications/MA-scripts). Images were motion-corrected using cross-correlation registration and rigid translation of individual frames. Slow fluctuations in fluorescence were removed from calculations of ΔF/F0 by calculating F0 using the eighth percentile of fluorescence intensity from a sliding window 300 s around each time point. ROIs were selected using a semi-automated process. Possible ROIs were identified as contiguous regions with SD >1.5 and an area >90 µm2. Overlapping ROIs were manually separated, ROIs were redrawn by hand to separate adjacent cells into distinct ROIs. Unresponsive puncta, or those with low signal-to-noise ratios (initially identified as having a skewness of ΔF/F in the first familiar environment less than 0.3) were dropped from further analysis. When the same cell was recorded in multiple planes, the brightest ROI was used. Neuropil contamination was removed by subtracting a perisomatic fluorescence signal from an annulus between 5 and 20 µm from each ROI, excluding any other possible ROIs (FCorrected–ROI = FROI – 0.8 * FNeuropil).

The percent change in the New environment was calculated on each day for each cell as the ratio between the mean fluorescence in the 14 min New world exposure and the mean of the fluorescence from the two 7 min familiar worlds exposures, normalized by the sum of these means: Percent Change=1 μNewμ(Fam+Fam).

Cell activity correlation with behavioral variables was calculated by taking the maximum value of the cross-correlation of demeaned time series within a 2 s window of lag. We followed the evolution of cell ensemble correlation over time by taking short-time cell-cell correlation of cell activity using the mean Pearson correlation of all pairwise comparisons in non-overlapping 5 s windows.

Behavior analysis

Ball movement data, sampled at 1 kHz, was downsampled to match the imaging frame rate. All normalized behavioral metrics were normalized by taking the ratio of the metric in the New world to FamAve (mean of Fam and Fam′). Task performance was calculated as the rewards received per minute. Speed was calculated as the Euclidean sum of the forward and rotation components of ball velocity. Deceleration was calculated as the first derivative of the forward component of the ball speed during a 3 s window prior to reward. Deceleration before all rewards in the epoch were averaged (ranging from 13 to 30 rewards/epoch on average). The location of trial failure was identified as the distance from the correct destination end zone at which the animal turns around before reaching the end zone.

Lick behavior was detected using a two-transistor lick detection circuit (Slotnick, 2009). Individual licks were not resolvable, so lick responses were binned into lick bouts, defined as a period of repeated lick responses with less than 200 ms between repeated lick signals. The lick rate was calculated as the number of these licking bouts per minute. The fraction of ‘correct’ licks was calculated as the fraction of licking bouts that occur within ± 500 ms of reward delivery (marked by an audible solenoid click to dispense water). The fraction of licks in unrewarded end zones was calculated as the fraction of incorrect, unrewarded, entries into the track end zone that elicited a bout of licking. Mouse location residency was calculated by binning the track in 20 cm bins and by measuring the amount of time that the animals spent in each bin of the track in each environment.

General linear model of activity

A general linear model was used to estimate fluorescence as a function of the behavioral parameters that are correlated with cell activity. The model predicts fluorescence as the linear combination of weighted, time-lagged behavior components. The lag used for each component was determined by the time of the peak of its cross-correlation with cell activity. Modeling of interneuron fluorescence was done using the glmfit function in Matlab with a normal distribution and an identity link function. The oscillatory nature of interneuron fluorescent activity series, without the large transients typical in pyramidal cells, were better fitted using a normal distribution than using the Poisson distribution commonly used in generalized linear models of pyramidal cell activity. Models were trained using fluorescence data from Fam and fitted using the forward and rotation components of ball speed, the timing of rewards, and the position and speed in the virtual reality environment. Behavioral data were included at a lag of up to 2 s, as determined by the maximum value of the cross-correlation between each parameter and cell activity. Root mean square (RMS) error and model correlation, for models tested in the environment in which they were trained, were calculated using 10-fold cross validation. Successive models were trained on 9/10 of the data set and tested on 1/10 of the data. The average model performance across these ten sessions was used as the performance of the model. Performance in environments other than the one in which the model was trained was calculated by directly applying the model behavioral data from test epochs, and by calculating RMS and model correlation on the entire resultant time series. Correlation was measured as the Pearson correlation between modeled traces and ΔF/F in each context, and R2 was taken as the square of the Pearson correlation.

Experimental design and statistical analysis

Behavior data are reported from 14 mice (seven male, seven female). We recorded 209 somatostatin-cre positive cells (mean = 20.9 ± 5.45) from ten mice (eight male, two female) across all 5 days of the initial remapping experiment. For the second remapping and ‘No Task’ paradigms, we recorded from six of these mice (five male, one female), tracking 107 cells across all three contexts. We recorded from 176 parvalbumin-cre cells (mean = 28.67 ± 15.02) from six animals (two male, four female).

The significance of normalized data metrics was calculated using one-sample t-tests of mean values. Differences between familiar and new epochs were calculated using paired sample t-tests of mean values, RMS error values were calculated on each cell in each epoch with paired-sample t-tests.

Significance of RMS error and R2 values were calculated on each cell in each epoch with paired-sample t-tests with Bonferroni-Holm corrections. Correlations of percent change across paradigms were calculated using Pearson correlation. Pearson correlations were used to calculate the correlation between percent change in each New world or No Task epoch. All multiple comparisons were corrected with Bonferroni-Holm corrections.

Locomotion and immobility associated interneurons were identified using the phase angle of the cross-correlation of interneuron activity and running speed, as described previously (Arriaga and Han, 2017).

Statistical analyses were performed in Matlab.

Acknowledgements

This work was supported by the McDonnell Centers for Systems Neuroscience, and Cellular and Molecular Neurobiology, and through the Cognitive, Computational, Systems Neuroscience Pathway at Washington University in St. Louis. We thank Martha Bagnall and Suyash Harlalka for comments.

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Edward B Han, Email: ehan23@wustl.edu.

Helen Scharfman, New York University Langone Medical Center, United States.

Laura L Colgin, University of Texas at Austin, United States.

Funding Information

This paper was supported by the following grants:

  • McDonnell Center for Systems Neuroscience to Edward B Han.

  • McDonnell Center for Cellular and Molecular Neurobiology to Edward B Han.

  • Washington University in St. Louis Cognitive, Computational, Systems Neuroscience Pathway (Graduate Student Fellowship) to Moises Arriaga.

Additional information

Competing interests

No competing interests declared.

Author contributions

Resources, Data curation, Software, Formal analysis, Validation, Investigation, Visualization, Methodology, Writing—review and editing.

Conceptualization, Supervision, Investigation, Methodology, Writing—original draft, Writing—review and editing.

Ethics

Animal experimentation: This study was performed in strict accordance with the recommendations in the Guide for the Care and Use of Laboratory Animals of the National Institutes of Health. All of the animals were handled according to approved institutional animal care and use committee (IACUC) protocols of Washington University (Animal Welfare Assurance # A-3381-01). The protocol was approved by the Washington University School of Medicine IACUC (#20170230). All surgery was performed under isofluorane anesthesia, and every effort was made to minimize suffering.

Additional files

Transparent reporting form
DOI: 10.7554/eLife.47611.036

Data availability

Source data are available at https://doi.org/10.5061/dryad.f83kt85.

The following dataset was generated:

Arriaga M, Han E. 2019. Data from: Structured Inhibitory Activity Dynamics During Learning. Dryad Digital Repository.

References

  1. Abraham WC, Robins A. Memory retention--the synaptic stability versus plasticity dilemma. Trends in Neurosciences. 2005;28:73–78. doi: 10.1016/j.tins.2004.12.003. [DOI] [PubMed] [Google Scholar]
  2. Acquas E, Wilson C, Fibiger HC. Conditioned and unconditioned stimuli increase frontal cortical and hippocampal acetylcholine release: effects of novelty, habituation, and fear. The Journal of Neuroscience. 1996;16:3089–3096. doi: 10.1523/JNEUROSCI.16-09-03089.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Alle H, Jonas P, Geiger JR. PTP and LTP at a hippocampal mossy fiber-interneuron synapse. PNAS. 2001;98:14708–14713. doi: 10.1073/pnas.251610898. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Andersen P, Eccles JC, LøYning Y. Hippocampus of the brain: recurrent inhibition in the Hippocampus with identification of the inhibitory cell and its synapses. Nature. 1963;198:540–542. doi: 10.1038/198540a0. [DOI] [PubMed] [Google Scholar]
  5. Aronov D, Tank DW. Engagement of neural circuits underlying 2D spatial navigation in a rodent virtual reality system. Neuron. 2014;84:442–456. doi: 10.1016/j.neuron.2014.08.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Arriaga M, Han EB. Dedicated hippocampal inhibitory networks for locomotion and immobility. The Journal of Neuroscience. 2017;37:9222–9238. doi: 10.1523/JNEUROSCI.1076-17.2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Arriaga M, Han EB. Interneuron_calcium_imaging_scripts. feeb537GitHub. 2019 https://github.com/Han-Lab-WUSM/MA-scripts
  8. Artola A, Singer W. Long-term potentiation and NMDA receptors in rat visual cortex. Nature. 1987;330:649–652. doi: 10.1038/330649a0. [DOI] [PubMed] [Google Scholar]
  9. Barry C, Ginzberg LL, O'Keefe J, Burgess N. Grid cell firing patterns signal environmental novelty by expansion. PNAS. 2012;109:17687–17692. doi: 10.1073/pnas.1209918109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Basu J, Zaremba JD, Cheung SK, Hitti FL, Zemelman BV, Losonczy A, Siegelbaum SA. Gating of hippocampal activity, plasticity, and memory by entorhinal cortex long-range inhibition. Science. 2016;351:aaa5694. doi: 10.1126/science.aaa5694. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Bear MF, Press WA, Connors BW. Long-term potentiation in slices of kitten visual cortex and the effects of NMDA receptor blockade. Journal of Neurophysiology. 1992;67:841–851. doi: 10.1152/jn.1992.67.4.841. [DOI] [PubMed] [Google Scholar]
  12. Bittner KC, Grienberger C, Vaidya SP, Milstein AD, Macklin JJ, Suh J, Tonegawa S, Magee JC. Conjunctive input processing drives feature selectivity in hippocampal CA1 neurons. Nature Neuroscience. 2015;18:1133–1142. doi: 10.1038/nn.4062. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Bittner KC, Milstein AD, Grienberger C, Romani S, Magee JC. Behavioral time scale synaptic plasticity underlies CA1 place fields. Science. 2017;357:1033–1036. doi: 10.1126/science.aan3846. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Bliss TV, Lomo T. Long-lasting potentiation of synaptic transmission in the dentate area of the anaesthetized rabbit following stimulation of the perforant path. The Journal of Physiology. 1973;232:331–356. doi: 10.1113/jphysiol.1973.sp010273. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Chen TW, Wardill TJ, Sun Y, Pulver SR, Renninger SL, Baohan A, Schreiter ER, Kerr RA, Orger MB, Jayaraman V, Looger LL, Svoboda K, Kim DS. Ultrasensitive fluorescent proteins for imaging neuronal activity. Nature. 2013;499:295–300. doi: 10.1038/nature12354. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Chen SX, Kim AN, Peters AJ, Komiyama T. Subtype-specific plasticity of inhibitory circuits in motor cortex during motor learning. Nature Neuroscience. 2015;18:1109–1115. doi: 10.1038/nn.4049. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Cichon J, Gan W-B. Branch-specific dendritic Ca2+ spikes cause persistent synaptic plasticity. Nature. 2015;520:180–185. doi: 10.1038/nature14251. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Dana H, Mohar B, Sun Y, Narayan S, Gordus A, Hasseman JP, Tsegaye G, Holt GT, Hu A, Walpita D, Patel R, Macklin JJ, Bargmann CI, Ahrens MB, Schreiter ER, Jayaraman V, Looger LL, Svoboda K, Kim DS. Sensitive red protein calcium indicators for imaging neural activity. eLife. 2016;5:e12727. doi: 10.7554/eLife.12727. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Danielson NB, Zaremba JD, Kaifosh P, Bowler J, Ladow M, Losonczy A. Sublayer-Specific coding dynamics during spatial navigation and learning in hippocampal area CA1. Neuron. 2016;91:652–665. doi: 10.1016/j.neuron.2016.06.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Denman DJ, Luviano JA, Ollerenshaw DR, Cross S, Williams D, Buice MA, Olsen SR, Reid RC. Mouse color and wavelength-specific luminance contrast sensitivity are non-uniform across visual space. eLife. 2018;7:e31209. doi: 10.7554/eLife.31209. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Dombeck DA, Harvey CD, Tian L, Looger LL, Tank DW. Functional imaging of hippocampal place cells at cellular resolution during virtual navigation. Nature Neuroscience. 2010;13:1433–1440. doi: 10.1038/nn.2648. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Donato F, Rompani SB, Caroni P. Parvalbumin-expressing basket-cell network plasticity induced by experience regulates adult learning. Nature. 2013;504:272–276. doi: 10.1038/nature12866. [DOI] [PubMed] [Google Scholar]
  23. Dupret D, O’Neill J, Csicsvari J. Dynamic Reconfiguration of Hippocampal Interneuron Circuits during Spatial Learning. Neuron. 2013;78:166–180. doi: 10.1016/j.neuron.2013.01.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Ego-Stengel V, Wilson MA. Spatial selectivity and theta phase precession in CA1 interneurons. Hippocampus. 2007;17:161–174. doi: 10.1002/hipo.20253. [DOI] [PubMed] [Google Scholar]
  25. Fino E, Yuste R. Dense inhibitory connectivity in neocortex. Neuron. 2011;69:1188–1203. doi: 10.1016/j.neuron.2011.02.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Francavilla R, Luo X, Magnin E, Tyan L, Topolnik L. Coordination of dendritic inhibition through local disinhibitory circuits. Frontiers in Synaptic Neuroscience. 2015;7:5. doi: 10.3389/fnsyn.2015.00005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Frank LM, Stanley GB, Brown EN. Hippocampal plasticity across multiple days of exposure to novel environments. Journal of Neuroscience. 2004;24:7681–7689. doi: 10.1523/JNEUROSCI.1958-04.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Garcia-Junco-Clemente P, Tring E, Ringach DL, Trachtenberg JT. State-Dependent subnetworks of Parvalbumin-Expressing interneurons in neocortex. Cell Reports. 2019;26:2282–2288. doi: 10.1016/j.celrep.2019.02.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Gauthier JL, Tank DW. A dedicated population for reward coding in the Hippocampus. Neuron. 2018;99:179–193. doi: 10.1016/j.neuron.2018.06.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Gentet LJ, Kremer Y, Taniguchi H, Huang ZJ, Staiger JF, Petersen CC. Unique functional properties of somatostatin-expressing GABAergic neurons in mouse barrel cortex. Nature Neuroscience. 2012;15:607–612. doi: 10.1038/nn.3051. [DOI] [PubMed] [Google Scholar]
  31. Golding NL, Staff NP, Spruston N. Dendritic spikes as a mechanism for cooperative long-term potentiation. Nature. 2002;418:326–331. doi: 10.1038/nature00854. [DOI] [PubMed] [Google Scholar]
  32. Graves AR, Moore SJ, Bloss EB, Mensh BD, Kath WL, Spruston N. Hippocampal pyramidal neurons comprise two distinct cell types that are countermodulated by metabotropic receptors. Neuron. 2012;76:776–789. doi: 10.1016/j.neuron.2012.09.036. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Hainmueller T, Bartos M. Parallel emergence of stable and dynamic memory engrams in the hippocampus. Nature. 2018;558:292–296. doi: 10.1038/s41586-018-0191-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Harris KD, Hochgerner H, Skene NG, Magno L, Katona L, Bengtsson Gonzales C, Somogyi P, Kessaris N, Linnarsson S, Hjerling-Leffler J. Classes and continua of hippocampal CA1 inhibitory neurons revealed by single-cell transcriptomics. PLOS Biology. 2018;16:e2006387. doi: 10.1371/journal.pbio.2006387. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Hartzell AL, Martyniuk KM, Brigidi GS, Heinz DA, Djaja NA, Payne A, Bloodgood BL. NPAS4 recruits CCK basket cell synapses and enhances cannabinoid-sensitive inhibition in the mouse hippocampus. eLife. 2018;7:e35927. doi: 10.7554/eLife.35927. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Harvey CD, Collman F, Dombeck DA, Tank DW. Intracellular dynamics of hippocampal place cells during virtual navigation. Nature. 2009;461:941–946. doi: 10.1038/nature08499. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Karlsson MP, Frank LM. Network dynamics underlying the formation of sparse, informative representations in the Hippocampus. Journal of Neuroscience. 2008;28:14271–14281. doi: 10.1523/JNEUROSCI.4261-08.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Karnani MM, Jackson J, Ayzenshtat I, Hamzehei Sichani A, Manoocheri K, Kim S, Yuste R. Opening holes in the blanket of inhibition: localized lateral disinhibition by VIP interneurons. Journal of Neuroscience. 2016;36:3471–3480. doi: 10.1523/JNEUROSCI.3646-15.2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Katona L, Lapray D, Viney TJ, Oulhaj A, Borhegyi Z, Micklem BR, Klausberger T, Somogyi P. Sleep and Movement Differentiates Actions of Two Types of Somatostatin-Expressing GABAergic Interneuron in Rat Hippocampus. Neuron. 2014;82:872–886. doi: 10.1016/j.neuron.2014.04.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Kempadoo KA, Mosharov EV, Choi SJ, Sulzer D, Kandel ER. Dopamine release from the locus coeruleus to the dorsal Hippocampus promotes spatial learning and memory. PNAS. 2016;113:14835–14840. doi: 10.1073/pnas.1616515114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Khan AG, Poort J, Chadwick A, Blot A, Sahani M, Mrsic-Flogel TD, Hofer SB. Distinct learning-induced changes in stimulus selectivity and interactions of GABAergic interneuron classes in visual cortex. Nature Neuroscience. 2018;21:851–859. doi: 10.1038/s41593-018-0143-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Kim SM, Frank LM. Hippocampal lesions impair rapid learning of a continuous spatial alternation task. PLOS ONE. 2009;4:e5494. doi: 10.1371/journal.pone.0005494. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Klausberger T, Márton LF, Baude A, Roberts JD, Magill PJ, Somogyi P. Spike timing of dendrite-targeting bistratified cells during hippocampal network oscillations in vivo. Nature Neuroscience. 2004;7:41–47. doi: 10.1038/nn1159. [DOI] [PubMed] [Google Scholar]
  44. Klausberger T, Somogyi P. Neuronal diversity and temporal dynamics: the unity of hippocampal circuit operations. Science. 2008;321:53–57. doi: 10.1126/science.1149381. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Kubie JL, Muller RU, Bostock E. Spatial firing properties of hippocampal theta cells. The Journal of Neuroscience. 1990;10:1110–1123. doi: 10.1523/JNEUROSCI.10-04-01110.1990. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Lamsa K, Heeroma JH, Kullmann DM. Hebbian LTP in feed-forward inhibitory interneurons and the temporal fidelity of input discrimination. Nature Neuroscience. 2005;8:916–924. doi: 10.1038/nn1486. [DOI] [PubMed] [Google Scholar]
  47. Larkum ME, Kaiser KM, Sakmann B. Calcium electrogenesis in distal apical dendrites of layer 5 pyramidal cells at a critical frequency of back-propagating action potentials. PNAS. 1999;96:14600–14604. doi: 10.1073/pnas.96.25.14600. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Lee S, Kruglikov I, Huang ZJ, Fishell G, Rudy B. A disinhibitory circuit mediates motor integration in the somatosensory cortex. Nature Neuroscience. 2013;16:1662–1670. doi: 10.1038/nn.3544. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Lee SH, Marchionni I, Bezaire M, Varga C, Danielson N, Lovett-Barron M, Losonczy A, Soltesz I. Parvalbumin-positive basket cells differentiate among hippocampal pyramidal cells. Neuron. 2014;82:1129–1144. doi: 10.1016/j.neuron.2014.03.034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Letzkus JJ, Wolff SB, Meyer EM, Tovote P, Courtin J, Herry C, Lüthi A. A disinhibitory microcircuit for associative fear learning in the auditory cortex. Nature. 2011;480:331–335. doi: 10.1038/nature10674. [DOI] [PubMed] [Google Scholar]
  51. Lovett-Barron M, Turi GF, Kaifosh P, Lee PH, Bolze F, Sun X-H, Nicoud J-F, Zemelman BV, Sternson SM, Losonczy A. Regulation of neuronal input transformations by tunable dendritic inhibition. Nature Neuroscience. 2012;15:423–430. doi: 10.1038/nn.3024. [DOI] [PubMed] [Google Scholar]
  52. Lovett-Barron M, Kaifosh P, Kheirbek MA, Danielson N, Zaremba JD, Reardon TR, Turi GF, Hen R, Zemelman BV, Losonczy A. Dendritic Inhibition in the Hippocampus Supports Fear Learning. Science. 2014;343:857–863. doi: 10.1126/science.1247485. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Makino H, Komiyama T. Learning enhances the relative impact of top-down processing in the visual cortex. Nature Neuroscience. 2015;18:1116–1122. doi: 10.1038/nn.4061. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Mardinly AR, Spiegel I, Patrizi A, Centofante E, Bazinet JE, Tzeng CP, Mandel-Brehm C, Harmin DA, Adesnik H, Fagiolini M, Greenberg ME. Sensory experience regulates cortical inhibition by inducing IGF1 in VIP neurons. Nature. 2016;531:371–375. doi: 10.1038/nature17187. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Marshall L, Henze DA, Hirase H, Leinekugel X, Dragoi G, Buzsáki G. Hippocampal pyramidal cell-interneuron spike transmission is frequency dependent and responsible for place modulation of interneuron discharge. The Journal of Neuroscience. 2002;22:RC197. doi: 10.1523/JNEUROSCI.22-02-j0001.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. McClelland JL, McNaughton BL, O'Reilly RC. Why there are complementary learning systems in the hippocampus and neocortex: Insights from the successes and failures of connectionist models of learning and memory. Psychological Review. 1995;102:419–457. doi: 10.1037/0033-295X.102.3.419. [DOI] [PubMed] [Google Scholar]
  57. McCloskey M, Cohen NJ. Catastrophic Interference in Connectionist Networks: The Sequential Learning Problem. In: Bower G. H, editor. Psychology of Learning and Motivation. Vol. 24. Academic Press; 1989. pp. 109–165. [DOI] [Google Scholar]
  58. McNamara CG, Tejero-Cantero Á, Trouche S, Campo-Urriza N, Dupret D. Dopaminergic neurons promote hippocampal reactivation and spatial memory persistence. Nature Neuroscience. 2014;17:1658–1660. doi: 10.1038/nn.3843. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. McQuiston AR, Madison DV. Muscarinic receptor activity induces an afterdepolarization in a subpopulation of hippocampal CA1 interneurons. The Journal of Neuroscience. 1999;19:5703–5710. doi: 10.1523/JNEUROSCI.19-14-05703.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Mizuseki K, Diba K, Pastalkova E, Buzsáki G. Hippocampal CA1 pyramidal cells form functionally distinct sublayers. Nature Neuroscience. 2011;14:1174–1181. doi: 10.1038/nn.2894. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Muller RU, Kubie JL. The effects of changes in the environment on the spatial firing of hippocampal complex-spike cells. The Journal of Neuroscience. 1987;7:1951–1968. doi: 10.1523/JNEUROSCI.07-07-01951.1987. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Najafi F, Elsayed GF, Cao R, Pnevmatikakis E, Latham PE, Cunningham J, Churchland AK. Excitatory and inhibitory subnetworks are equally selective during decision-making and emerge simultaneously during learning. bioRxiv. 2019 doi: 10.1101/354340. [DOI] [PMC free article] [PubMed]
  63. Nitz D, McNaughton B. Differential modulation of CA1 and dentate gyrus interneurons during exploration of novel environments. Journal of Neurophysiology. 2004;91:863–872. doi: 10.1152/jn.00614.2003. [DOI] [PubMed] [Google Scholar]
  64. Packer AM, Yuste R. Dense, unspecific connectivity of neocortical parvalbumin-positive interneurons: a canonical microcircuit for inhibition? Journal of Neuroscience. 2011;31:13260–13271. doi: 10.1523/JNEUROSCI.3131-11.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Pelkey KA, Chittajallu R, Craig MT, Tricoire L, Wester JC, McBain CJ. Hippocampal GABAergic inhibitory interneurons. Physiological Reviews. 2017;97:1619–1747. doi: 10.1152/physrev.00007.2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Pi HJ, Hangya B, Kvitsiani D, Sanders JI, Huang ZJ, Kepecs A. Cortical interneurons that specialize in disinhibitory control. Nature. 2013;503:521–524. doi: 10.1038/nature12676. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Pouille F, Scanziani M. Enforcement of temporal fidelity in pyramidal cells by somatic feed-forward inhibition. Science. 2001;293:1159–1163. doi: 10.1126/science.1060342. [DOI] [PubMed] [Google Scholar]
  68. Pouille F, Scanziani M. Routing of spike series by dynamic circuits in the hippocampus. Nature. 2004;429:717–723. doi: 10.1038/nature02615. [DOI] [PubMed] [Google Scholar]
  69. Rao-Ruiz P, Yu J, Kushner SA, Josselyn SA. Neuronal competition: microcircuit mechanisms define the sparsity of the engram. Current Opinion in Neurobiology. 2019;54:163–170. doi: 10.1016/j.conb.2018.10.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Royer S, Zemelman BV, Losonczy A, Kim J, Chance F, Magee JC, Buzsáki G. Control of timing, rate and bursts of hippocampal place cells by dendritic and somatic inhibition. Nature Neuroscience. 2012;15:769–775. doi: 10.1038/nn.3077. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Ruediger S, Vittori C, Bednarek E, Genoud C, Strata P, Sacchetti B, Caroni P. Learning-related feedforward inhibitory connectivity growth required for memory precision. Nature. 2011;473:514–518. doi: 10.1038/nature09946. [DOI] [PubMed] [Google Scholar]
  72. Sara SJ, Dyon-Laurent C, Hervé A. Novelty seeking behavior in the rat is dependent upon the integrity of the noradrenergic system. Cognitive Brain Research. 1995;2:181–187. doi: 10.1016/0926-6410(95)90007-1. [DOI] [PubMed] [Google Scholar]
  73. Sato M, Kawano M, Mizuta K, Islam T, Lee MG, Hayashi Y. Hippocampus-Dependent goal localization by Head-Fixed mice in virtual reality. Eneuro. 2017;4:ENEURO.0369-16.2017. doi: 10.1523/ENEURO.0369-16.2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Schmid LC, Mittag M, Poll S, Steffen J, Wagner J, Geis HR, Schwarz I, Schmidt B, Schwarz MK, Remy S, Fuhrmann M. Dysfunction of Somatostatin-Positive interneurons associated with memory deficits in an Alzheimer's Disease Model. Neuron. 2016;92:114–125. doi: 10.1016/j.neuron.2016.08.034. [DOI] [PubMed] [Google Scholar]
  75. Schmucker C, Seeliger M, Humphries P, Biel M, Schaeffel F. Grating acuity at different luminances in Wild-Type mice and in mice lacking rod or cone function. Investigative Opthalmology & Visual Science. 2005;46:398–407. doi: 10.1167/iovs.04-0959. [DOI] [PubMed] [Google Scholar]
  76. Sheffield MEJ, Adoff MD, Dombeck DA. Increased prevalence of calcium transients across the dendritic arbor during place field formation. Neuron. 2017;96:490–504. doi: 10.1016/j.neuron.2017.09.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Sheffield ME, Dombeck DA. Calcium transient prevalence across the dendritic arbour predicts place field properties. Nature. 2015;517:200–204. doi: 10.1038/nature13871. [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. Slotnick B. A simple 2-transistor touch or lick detector circuit. Journal of the Experimental Analysis of Behavior. 2009;91:253–255. doi: 10.1901/jeab.2009.91-253. [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Soltesz I, Losonczy A. CA1 pyramidal cell diversity enabling parallel information processing in the hippocampus. Nature Neuroscience. 2018;21:484–493. doi: 10.1038/s41593-018-0118-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Stefanelli T, Bertollini C, Lüscher C, Muller D, Mendez P. Hippocampal Somatostatin Interneurons Control the Size of Neuronal Memory Ensembles. Neuron. 2016;89:1074–1085. doi: 10.1016/j.neuron.2016.01.024. [DOI] [PubMed] [Google Scholar]
  81. Steward O, Tomasulo R, Levy WB. Blockade of inhibition in a pathway with dual excitatory and inhibitory action unmasks a capability for LTP that is otherwise not expressed. Brain Research. 1990;516:292–300. doi: 10.1016/0006-8993(90)90930-A. [DOI] [PubMed] [Google Scholar]
  82. Takeuchi T, Duszkiewicz AJ, Sonneborn A, Spooner PA, Yamasaki M, Watanabe M, Smith CC, Fernández G, Deisseroth K, Greene RW, Morris RG. Locus coeruleus and dopaminergic consolidation of everyday memory. Nature. 2016;537:357–362. doi: 10.1038/nature19325. [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. Tanaka KZ, He H, Tomar A, Niisato K, Huang AJY, McHugh TJ. The hippocampal engram maps experience but not place. Science. 2018;361:392–397. doi: 10.1126/science.aat5397. [DOI] [PubMed] [Google Scholar]
  84. Turi GF, Li WK, Chavlis S, Pandi I, O'Hare J, Priestley JB, Grosmark AD, Liao Z, Ladow M, Zhang JF, Zemelman BV, Poirazi P, Losonczy A. Vasoactive intestinal Polypeptide-Expressing interneurons in the Hippocampus support Goal-Oriented spatial learning. Neuron. 2019;101:1150–1165. doi: 10.1016/j.neuron.2019.01.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  85. Varga C, Lee SY, Soltesz I. Target-selective GABAergic control of entorhinal cortex output. Nature Neuroscience. 2010;13:822–824. doi: 10.1038/nn.2570. [DOI] [PMC free article] [PubMed] [Google Scholar]
  86. Varga C, Golshani P, Soltesz I. Frequency-invariant temporal ordering of interneuronal discharges during hippocampal oscillations in awake mice. PNAS. 2012;109:E2726–E2734. doi: 10.1073/pnas.1210929109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  87. Vinck M, Batista-Brito R, Knoblich U, Cardin JA. Arousal and locomotion make distinct contributions to cortical activity patterns and visual encoding. Neuron. 2015;86:740–754. doi: 10.1016/j.neuron.2015.03.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Wamsley B, Fishell G. Genetic and activity-dependent mechanisms underlying interneuron diversity. Nature Reviews Neuroscience. 2017;18:299–309. doi: 10.1038/nrn.2017.30. [DOI] [PubMed] [Google Scholar]
  89. Whitlock JR, Heynen AJ, Shuler MG, Bear MF. Learning induces long-term potentiation in the Hippocampus. Science. 2006;313:1093–1097. doi: 10.1126/science.1128134. [DOI] [PubMed] [Google Scholar]
  90. Wigström H, Gustafsson B. Large long-lasting potentiation in the dentate gyrus in vitro during blockade of inhibition. Brain Research. 1983;275:153–158. doi: 10.1016/0006-8993(83)90428-6. [DOI] [PubMed] [Google Scholar]
  91. Wilson MA, McNaughton BL. Dynamics of the hippocampal ensemble code for space. Science. 1993;261:1055–1058. doi: 10.1126/science.8351520. [DOI] [PubMed] [Google Scholar]
  92. Wolff SB, Gründemann J, Tovote P, Krabbe S, Jacobson GA, Müller C, Herry C, Ehrlich I, Friedrich RW, Letzkus JJ, Lüthi A. Amygdala interneuron subtypes control fear learning through disinhibition. Nature. 2014;509:453–458. doi: 10.1038/nature13258. [DOI] [PubMed] [Google Scholar]
  93. Znamenskiy P, Kim M-H, Muir DR, Iacaruso MF, Hofer SB, Mrsic-Flogel TD. Functional selectivity and specific connectivity of inhibitory neurons in primary visual cortex. bioRxiv. 2018 doi: 10.1101/294835. [DOI]

Decision letter

Editor: Helen Scharfman1
Reviewed by: Andre A Fenton2

In the interests of transparency, eLife includes the editorial decision letter and accompanying author responses. A lightly edited version of the letter sent to the authors after peer review is shown, indicating the most substantive concerns; minor comments are not usually included.

Thank you for submitting your article "Structured Inhibitory Activity Dynamics During Learning" for consideration by eLife. Your article has been reviewed by three peer reviewers, and the evaluation has been overseen by a Reviewing Editor and Laura Colgin as the Senior Editor. The following individual involved in review of your submission has agreed to reveal their identity: Andre A Fenton (Reviewer #3).

The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.

Summary of major concerns:

1) Clarify the behavioral task and other aspects of the paper.

Regarding the task, the interpretation is not clear (e.g., does it study learning specifically). It also is not clear whether the authors adequately controlled for several potentially important variables, such as bright light. Clarification about locomotion as a variable is also important to address (see comments of reviewers 1 and 2). In the subsection “Consistent inhibitory structure during learning”, the rationale for the experiments is not clear (see comments of reviewer 2).

2) The model is useful but the authors need to address the ability to make predictions about both 'New' as well as 'Fam' (comments of reviewers 1 and 2).

3) Regarding sample sizes and statistics, sample sizes were small. Also, stating the numbers of animals is necessary for each analysis. For statistics, critical values and degrees of freedom should be included. Also, for transparency, raw data should be included, not only normalized values.

Essential revisions:

The paper would be substantially strengthened by addressing specificity of the findings to the somatostatin subtype of interneuron (see comments of reviewer 1). This could be achieved by more experiments to look at another interneuron subtype. However it could be that some data already available or new analyses can address this point. Note that when making the revisions, the authors need to be careful about what this study (and other studies) proved and did not prove about interneurons. For example, activity of interneurons is what was studied here, not inhibition of pyramidal cells (see comments of reviewer 2). Population analyses, including analysis of cell pairs, rather than individual cells, could be an important and welcome addition (reviewer 3).

Reviewer #1:

In this very nice study, the authors tackle the complex problem of untangling changes in the hippocampus during spatial learning. They use 2-photon imaging of genetically identified interneurons in the hippocampus during learning and performance of a well-characterized linear track task, where the animals have to learn to navigate a virtual environment to acquire rewards. Overall, there are some intriguingly novel results, and the authors have given some thought to not only the observational data but also to modeling the behavior and the evolution of neural responses. However, there are several places where a deeper dive into the data will likely yield more satisfying results.

One thread that runs through the data is that the activity of the SOM interneurons is highly correlated with locomotion. It is thus surprising that the authors do not spend more time and effort to dissociate the elements of the task, including locomotion, reward availability, learning trajectory, and familiarity. Although the authors note that mean running speed does not change in the 'New' condition as compared to the 'Fam' condition, the dynamics of running are clearly different. In the 'Fam' condition, the running peaks in between the reward sites and the animals slow down as they get closer to the reward. The SOM interneuron activity follows these dynamics closely. In the 'New' condition, the running is fairly constant around the mean and the SOM interneurons are also constant. The interneurons do decrease in activity, but based on this data it is possible that the SOM cells are merely tied to the acceleration/deceleration dynamics. It would be informative to include a condition where the animals observed familiar and novel environments and received reward but did not locomote.

Likewise, the model results in Figure 3 and the supplement raise as many questions as they answer. The model does a pretty good job of recapitulating the SOM interneuron activity in the 'New' condition, based solely on locomotion, position, and rewards. Indeed, speed and forward motion are the largest influences on the model fits. As noted above, these results suggest that the SOM activity is closely tied to the animal's movement. Finally, the authors do not sufficiently address why the model fails to generate predictions for the Fam' condition that are as good as the predictions for the Fam condition.

Are the occasional peaks in SOM activity during 'New' correlated with some behavioral event?

Are these findings generic to all interneurons, or only SOM cells? What do excitatory neurons do during this learning period?

Introduction, third paragraph: interneuron contributions to learning have indeed been studied previously, just not in hippocampus – see Makino and Komiyama, 2015, etc.

Reviewer #2:

In this study, the authors used in vivo calcium imaging from somatostatin interneurons over multiple days coupled with a virtual reality set up to investigate the learning-related changes in activity of these inhibitory interneurons in a spatial learning task. The authors described a diminution of activity when animals are in a new environment compared to a familiar environment, with a gradual return of activity associated with familiarization to the new environment. In contrast, in animals surrounded by black screens that can locomote freely with no specific task, somatostatin interneuron activity is also massively reduced during the first exposures and the suppression is maintained during several days, suggesting that the return of activity in the new environment is tied with the re-learning of goal locations instead of a context habituation.

Globally the manuscript is well-written, figures are easy to read and interpret, and the findings are interesting. The paper provides novel information about detailed activity of an identified hippocampal interneuron type during rewarded navigation in a virtual environment, as well as activity changes with exposure to the task in a novel environment, a task analogous to "remapping". Although the findings appear correct and convincing, I find that some conclusion statements are overstatements, particularly about re-learning the goal location task and the "no task" learning. In addition, several points need to be addressed by the authors.

1) The main point of the study is to report activity of SOM-int. However, throughout the paper the authors interchangeably use the term inhibition instead of SOM-int activity. Of course SOM-ints are inhibitory interneurons but since the authors never assessed if the SOM-int activity resulted in inhibition of pyramidal cells, it is incorrect to refer to inhibition (or inhibitory activity), instead of inhibitory cell activity. This needs to be corrected throughout the manuscript (Abstract, Introduction (last paragraph), Results (subsection “SOM-int activity suppression during learning in new environment”, second and fifth paragraphs), Discussion (first paragraph)).

2) It is not very clear what kind of learning is occurring in the new environment task. The animal is trained first to run across a visual-cued track (fam env) to obtain reward at the end, then turn around and repeat in the other direction. This task takes weeks to learn. Then switching to the new environment, the animal must keep running to each end of the new environment for reward. So the learned behavior is running to end, which is not an ideal spatial learning task. Also in the new environment, the animal must learn to run to the end of the new environment, regardless of visual cues. It is not clear to me if as stated by the authors "mice need to re-learn the task with new visual cues" in the new environment. Rather it seems the same task in a new visual cued environment (remapping?). In fact the performance of the mice in the new environment is significantly different from that in the fam environment, only for the first exposure. Over the next 2-5 days, most performance measures (Figure 1 and Figure 1—figure supplement 1; most importantly correct licks) are not different from the familiar environment, except for deceleration before reward. In addition, the changes in cell activity in new environment (Figure 2E-H) are also significantly different for day 1 only. To me this is not an unambiguous learning task, but a change of context in which the animal adapts the initial running task. Thus, the authors must change their interpretation of the results regarding learning in the new environment (Abstract, end of Introduction, Results and Discussion).

3) Authors state in the text that they used a "large cohort of mice" to characterize behavioral performances and use a subset to perform somatostatin interneurons imaging. The authors report only 9 mice (5 male, 4 female) for behavioral data and 8 mice (6 male, 2 female) for SOM-int imaging which is NOT a subset (at least concerning male mice). This needs clarification. Also 9 mice is not a large cohort for behavioral experiments, so this should be corrected. It would be nice to put number of animals used in each experiment in the text core to avoid back and forth consultation with the material section.

4) A lot of plots in this study have normalized values. It would be interesting to communicate more about raw values, even if effects are not as impressive as with normalized representation, just to get an idea of mice mean performances (Figure 1C, E, Figure 1—figure supplement 1E; Figure 2F, H; Figure 4F). Also, Figure 1D shows a difference in mean deceleration before a reward in a new and a fam environment without specifying the average speed of animals or number of rewards used to calculate the deceleration vector. Also it appears that the example given in Figure 1B is not representative. The performance in new environment appears much worse than the group data (1C). A more representative example would be relevant.

5) Another concern is the method used in the Figure 3 to answer a question raised by the authors. They recognize that the effect they observed could be linked with the locomotion of the animal as they described locomotion-correlated and anti-correlated Som-int activity in their previous work (Arriaga and Han, 2017). To address this question, they used a General Linear Model, that is a useful and supportive tool, but it should be considered an additional analysis of their data. Indeed, one would expect the analysis to show the direct correlation between the locomotion or speed of the animal and cell activity. It is one of the most important correlation to establish in association with the GLM. Another concern is the result (Figure 3—figure supplement 1) that the model established in the Fam situation, does not predict well the cell activity in the Fam' (thru day 1-5) when behavior returned similar to Fam. This seems to indicate that the model is not a good predictor of fluo/activity as a function of various VR and behavioral parameters.

6) Another major concern is about the brightness of screens. One of the drawbacks of virtual reality set up is that they are often bright which could be stressful for the animal and also provide artefacts in imaging. There is some concern about the new environment used in this study because a pattern in the track is clearly darker than other patterns. Could the drastic change of behavior or/and activity seen in this new environment be linked with this darker part VR? For the behavior shown in Figure 1B, do the animal spent more time in the darker part? Indeed the return of activity in New environment experiment is also linked with an improvement of performance in this last one, so with more exposure of the brighter part. In accordance with that, Figure 4 shows a drastic suppression of activity during black screens used in the "no task" exposure. It is thus important to check if the behavioral changes and the suppression effect are still there in a new environment that is equally bright (as the Fam). In addition, concerning the "no task" experiment, it is important to check cell activity with animals placed either with fixed lit screens (for example a fixed visual scene of a maze), or in a new environment with no goal (that is not excluding a learning of the environment but separate the goal directed linked activity).

7) Subsection “Consistent inhibitory structure during learning”, first paragraph. It is not clear to me how the two explanations given by the authors can account for the spectrum of Som-int activity modulation. The first explanation of equipotential interneuron hypothesis is quite unlikely and the second of different interneurons with distinct functional roles is most likely, given the body of literature on interneuron specialization. Also it is not evident how the experiments with a 2nd new environment was a test of the 2 explanations. The experiments are very interesting, but another rationale must be given for them.

Reviewer #3:

This manuscript can be an important contribution to the literature. The authors demonstrate a cell-specific tendency of SOM+ interneurons in hippocampus to decrease their activity in novel environments and that the activity returns across time/exposures to the environment in parallel with learning, not with mere familiarity with the novel environment. The manuscript is clearly written and the data clearly presented. I have a few requests that are intended to improve the already strong paper.

1) The "no-task" control is used to dissociate the confounded relationships between novelty and/or learning on the one hand, and the activity suppression of interest, on the other. It also serves to demonstrate the activity suppression is not due to reduced locomotion. However, it should be acknowledged that the no-task condition changed more than one variable; it removed a goal-directed task component as intended, but it also removed a number of other variables such as consumption of water, visual stimuli, exposure to locomotion-contingent changes in sensation, sensorimotor integration. Any subset or weighted-combination of these changes could have contributed to the initial or maintained no-task activity decrease, not only the exclusion of the conditioned learning component.

Perhaps a better or additional control would have been a new environment with yoked-reward (i.e. reward delivered in a spatially random schedule temporally equivalent to the Fam rewarded trials). This would have removed the ability to learn the location of reward but kept all other physical variables the same. Since such unreinforced exploration of an environment is standard, perhaps the authors have relevant data on activity of these cells in such a condition. In any case the authors need to acknowledge the confound and explain the reasons one should or should not take this confound to be seriously.

Figure 3 shows that the position soon after the change to New is invariant at "0" but the ball is moving. The authors need to explain how that can be.

The one thing I am disappointed by in this paper is the authors did not take real advantage the fact that they recorded many cells at the same time. Almost all analyses performed could have been done if the cells were recorded one at a time across the exposures in 69 or more separate mice. Would population/ensemble analyses have led to the same conclusions without adding anything more? As it is there are important unanswered questions that are contained in the data. For example, to what extent are the within-session fluctuations of dF/F activity correlated across the population of cells? From the presented single cell data, I imagine that the activity correlation structure across the population is stable both within and between exposures and that this even maintains in the New and No-task conditions. However, this need not be the case even though on average, activity decreased across the population. A reader is left wondering. Consistent with their speculative conceptual framework that the cells form a structured inhibitory network (setting an inhibitory infrastructure for activity within the excitatory cell population), the authors should examine how short-time scale population activity vectors evolve within and across the exposures. To evaluate and bolster their conceptual framework, they should also examine if the population of cell-pair activity correlations maintains within and across the exposures. Analysis of these between-cell activity relationships would speak directly to the conceptual framework that the authors promote in the Discussion.

Following on from the preceding comment on lack of between-cell activity relationship investigations, the authors could also determine if the topographic proximity between cell pairs has a role in the inter-cell activity relationships. As mentioned above, these additional analyses would empirically help to address some of the issues that the authors raise in their adroit speculations in the Discussion.

eLife. 2019 Oct 8;8:e47611. doi: 10.7554/eLife.47611.041

Author response


All comments have been addressed and we begin with a summary of major experimental and analysis revisions, with direct point-by-point responses below.

Summary of major revisions

1) To determine if other interneuron classes show similar activity dynamics as somatostatin-expressing interneurons (SOM-ints), we measured calcium activity from parvalbumin-expressing interneurons (PV-ints) during exposure to novel virtual environments, now detailed in Figure 4 and Figure 4—figure supplements 1-6. We confirm similar suppression of activity in new environments in PV-ints.

2) To investigate population dynamics, we examined short-time scale population correlation across different environments (Fam, New, and Fam’) and found decreased population activity similarity between Fam and New that gradually recovered with repeated exposures (Figure 5—figure supplement 2). This loss of population coordination was likely due to a combination of activity suppression and decreased coupling of locomotion to interneuron activity in New.

We examined network activity structure using pairwise cell-cell SOM-int activity correlations. We found these correlations were largely preserved across different environments, including the drastically different “No Task” epoch, indicating persistent activity structure in the interneuron network. (Figure 6—figure supplement 2, 3). This finding of stable activity structure complements and expands on our finding of consistent activity dynamics in new environments.

3) In addition to above changes, we added six new figure supplements and additional panels featuring requested raw data and additional analyses of behavior, calcium activity, and cellular/behavioral correlates of neuronal activity (further detailed below).

4) We added additional mice to existing experiments, four mice to virtual reality (VR) behavior in Figure 1 (N=14) and two mice to SOM-ints in current Figures 2 and 3 (N=10), and Figures 5 and 6 (N=6).

Summary of major concerns:

1) Clarify the behavioral task and other aspects of the paper.

Regarding the task, the interpretation is not clear (e.g., does it study learning specifically).

We take the reviewer’s point that the task (running to alternating ends of a virtual track in new visual environments) is not solely a learning task and quantification of task performance includes multiple variables such as surprise and motor performance (subsection “Virtual reality (VR) behavior”, seventh paragraph). As such, we now avoid the unqualified use of “learning” throughout the paper, except when describing other studies, during speculation, or with specific qualifications. Generally we agree with the statement from reviewer 2 that mice adapt the initial behavior to new environments and we use the specific construction “learning to adapt the task” (e.g. Abstract). The amount and type of learning that is required for this “adaptation” remains unclear (discussed in the aforementioned paragraph) and bears further investigation. Most importantly we have de-emphasized the usage of “learning” throughout the paper to avoid implying any causal relationship between learning and the cellular activity dynamics reported here.

It also is not clear whether the authors adequately controlled for several potentially important variables, such as bright light.

Regarding brightness in VR as a potential confound, we were careful to minimize environmental brightness, both as a behavioral confound and because excess light contributes to noise during calcium imaging. Projectors were set at minimal brightness with an additional darkening film applied to the rear projection screen (subsection “VR track running behavior”, first paragraph). To directly measure brightness, we used a chromameter. Based on the lens configuration of the instrument, we could not measure total brightness to compare luminance across environments but we could measure smaller visual features. Even the brightest VR features (which constitute a small fraction of the entire visual scene) were 3 cd/m2 which is within the mesoscopic range of visual function where mice are frequently behaviorally active (subsection “Virtual reality (VR) behavior”, last paragraph). Furthermore based on track occupancy in different VR worlds, we saw no indication of preferred areas resulting from brightness differences (Figure 1—figure supplement 2). In general, mice were highly motivated to pursue rewards and ran continuously in all worlds, except when consuming reward.

Clarification about locomotion as a variable is also important to address (see comments of reviewers 1 and 2).

We agree that a thorough investigation into the relationship between locomotion and cellular activity is critical and this was the motivation for using general linear models (GLMs) in the original submission. We have supplemented this analysis with additional analyses that more straightforwardly and directly quantify the correlation between various behavioral variables (including locomotion, e.g. forward and rotational components of ball speed, acceleration, etc.) with interneuron calcium activity in familiar and new environments. Similar to the conclusions from our GLM analyses, we found that locomotion variables correlated well with fluctuations in calcium activity in the familiar environment but correlated poorly in the new environment (Figure 3—figure supplement 1, Figure 4—figure supplement 4).

We further investigated the relationship between behavior and neuronal activity by now training GLMs in New. Previously we trained models in Fam (first epoch in the familiar environment) and asked how well those GLMs predicted activity in New (very badly). Here by training the GLMs in New, we do not make the assumption that whatever relationship between behavior and activity that exists in Fam is recapitulated in New. Now if there is any consistent relationship between behavior and neuronal activity in New, modeled fit should be high. In fact, GLMs trained in New poorly predicted neuronal activity in New (Figure 3—figure supplement 4, Figure 4—figure supplement 6).

Taken together, these new analyses further bolster our original conclusion that locomotion is strongly coupled to interneuron activity (both SOM- and PV-ints) in familiar environments, but contributes much more weakly to activity in new environments. The drivers of remaining activity in new environments remain unclear.

In the subsection “Consistent inhibitory structure during learning”, the rationale for the experiments is not clear (see comments of reviewer 2).

The rationale for investigating the consistency of interneuron activity dynamics across different environments is now more simply stated: do interneurons show stochastic activity dynamics in different virtual worlds, similar to the way hippocampal pyramidal neurons stochastically remap, or are their activity dynamics context-invariant, which suggests a consistent network role for individual interneurons (subsection “Consistent inhibitory structure during across new contexts”, first paragraph).

2) The model is useful but the authors need to address the ability to make predictions about both 'New' as well as 'Fam' (comments of reviewers 1 and 2).

Models trained in Fam did not predict activity as well in Fam’ (the return to the original familiar environment after exposure to New) as it did in Fam. This is likely explained by a combination of time-varying factors (subsection “Interneuron activity suppression in new environment”, seventh paragraph).

First, there is drift in neuronal representations over time, in this case, the relationship between behavioral variables and cellular activity. Thus predicted activity will tend to be best in data temporally close to the data used to train the model, in this case Fam (we note that training and test data are independent, although both within the Fam epoch).

Second, photobleaching contributes to decreased model fit. The model is trained on brighter cells in Fam, resulting in predicted fluorescence changes that are larger than those seen in the same dimmer, photobleached cells in Fam’.

Finally there may be a prolonged network effect of exposure to the New environment that crosses into the subsequent Fam’ epoch, perhaps due to long lasting neuromodulatory effects. These time-varying factors will also affect model fit in the New epoch, although with less effect since less time has elapsed since model training. These incremental, time-dependent effects appear much weaker than the immediate and drastic loss of model prediction in New.

3) Regarding sample sizes and statistics, sample sizes were small. Also, stating the numbers of animals is necessary for each analysis. For statistics, critical values and degrees of freedom should be included. Also, for transparency, raw data should be included, not only normalized values.

We added four mice to the overall behavioral cohort and two mice to the SOM-int cohort. We more clearly label the number of animals used in analysis, either directly on the figure or in the figure legend. Critical values and degrees of freedom are now listed in the excel sheet that accompanies all figures. Extensive raw data (activity traces and behavior) have been included from six SOM+ (Figure 2: SOM 1, Figure 2—figure supplement 2: SOM 2, Figure 2—figure supplement 1: SOM 1-4, Figure 3: SOM 5, Figure 5, SOM 6) and two PV+ mice. Additionally non-normalized performance, speed, and lick rate are displayed in Figure 1—figure supplement 1.

Essential revisions:

The paper would be substantially strengthened by addressing specificity of the findings to the somatostatin subtype of interneuron (see comments of reviewer 1). This could be achieved by more experiments to look at another interneuron subtype. However it could be that some data already available or new analyses can address this point.

We have added data investigating the activity dynamics of PV-int in new virtual environments. Indeed we found that PV-ints are profoundly suppressed in New with recovery of activity over repeated exposures. These data now identify suppression of two major classes of inhibitory neurons in novel environments.

We did not address the activity dynamics of PV-ints across multiple new environments in this revision. Preliminary data from PV-ints show some similarity to SOM-ints in this regard, but significant differences. We feel that meaningfully investigating these differences is beyond the scope of this current work. As such, we would prefer to more thoroughly explore these issues in subsequent work.

Note that when making the revisions, the authors need to be careful about what this study (and other studies) proved and did not prove about interneurons. For example, activity of interneurons is what was studied here, not inhibition of pyramidal cells (see comments of reviewer 2).

This is a very important distinction and we have adopted this more precise usage throughout the paper.

Population analyses, including analysis of cell pairs, rather than individual cells, could be an important and welcome addition (reviewer 3).

See Summary of major revisions (#2) above for population analysis revisions.

Reviewer #1:

[…] Overall, there are some intriguingly novel results, and the authors have given some thought to not only the observational data but also to modeling the behavior and the evolution of neural responses. However, there are several places where a deeper dive into the data will likely yield more satisfying results.

One thread that runs through the data is that the activity of the SOM interneurons is highly correlated with locomotion. It is thus surprising that the authors do not spend more time and effort to dissociate the elements of the task, including locomotion, reward availability, learning trajectory, and familiarity. Although the authors note that mean running speed does not change in the 'New' condition as compared to the 'Fam' condition, the dynamics of running are clearly different. In the 'Fam' condition, the running peaks in between the reward sites and the animals slow down as they get closer to the reward. The SOM interneuron activity follows these dynamics closely. In the 'New' condition, the running is fairly constant around the mean and the SOM interneurons are also constant. The interneurons do decrease in activity, but based on this data it is possible that the SOM cells are merely tied to the acceleration/deceleration dynamics. It would be informative to include a condition where the animals observed familiar and novel environments and received reward but did not locomote.

As noted above, we now directly examine correlation between behavioral variables and cellular activity. We agree that the dynamics between locomotion and activity is critical, which led us to use GLMs to more rigorously and quantitatively model this relationship in individual cells. Acceleration contributed little to the modeled fit of activity (Figure 3—figure supplement 1, Figure 3—figure supplement 4, Figure 4—figure supplement 5, Figure 4—figure supplement 7).

The role of reward is likely central to some of the effects we observe. Dissociating reward from locomotion modulation of interneurons would be highly desirable but preventing head-fixed animals from moving is highly stressful for them. This could be more reasonably accomplished using a different task with trial structure, rather than free running.

Likewise, the model results in Figure 3 and the supplement raise as many questions as they answer. The model does a pretty good job of recapitulating the SOM interneuron activity in the 'New' condition, based solely on locomotion, position, and rewards. Indeed, speed and forward motion are the largest influences on the model fits. As noted above, these results suggest that the SOM activity is closely tied to the animal's movement. Finally, the authors do not sufficiently address why the model fails to generate predictions for the Fam' condition that are as good as the predictions for the Fam condition.

See above, Major Concerns, #1 (Clarificationabout locomotion…) and 2.

Are the occasional peaks in SOM activity during 'New' correlated with some behavioral event?

Occasional peaks in SOM-int activity were intriguing but we did not identify any obvious behavioral correlates (such as rewards, etc.). However, future in-depth analysis of behavior or internal states may shed light on their origins.

Are these findings generic to all interneurons, or only SOM cells? What do excitatory neurons do during this learning period?

As mentioned above, PV-ints also show strong activity suppression in new virtual environments and this data is now included in the paper. Preliminary data from our lab shows that a subset of VIP-expressing interneurons is strongly activated in novel VR worlds, consistent with a role in disinhibition. Several published studies (referenced in the paper) have shown that hippocampal pyramidal neurons remap in new virtual environments. Preliminary data show that after initial remapping, pyramidal neuron representations continue to refine over the next few days before stabilizing.

Introduction, third paragraph: interneuron contributions to learning have indeed been studied previously, just not in hippocampus – see Makino and Komiyama, 2015, etc.

This is now more clearly stated and these references have been added (Introduction, second paragraph).

Reviewer #2:

[…] Although the findings appear correct and convincing, I find that some conclusion statements are overstatements, particularly about re-learning the goal location task and the "no task" learning. In addition, several points need to be addressed by the authors.

1) The main point of the study is to report activity of SOM-int. However, throughout the paper the authors interchangeably use the term inhibition instead of SOM-int activity. Of course SOM-ints are inhibitory interneurons but since the authors never assessed if the SOM-int activity resulted in inhibition of pyramidal cells, it is incorrect to refer to inhibition (or inhibitory activity), instead of inhibitory cell activity. This needs to be corrected throughout the manuscript (Abstract, Introduction (last paragraph), Results (subsection “SOM-int activity suppression during learning in new environment”, second and fifth paragraphs), Discussion (first paragraph)).

We agree and have adopted this more precise usage throughout the paper.

2) It is not very clear what kind of learning is occurring in the new environment task. The animal is trained first to run across a visual-cued track (fam env) to obtain reward at the end, then turn around and repeat in the other direction. This task takes weeks to learn. Then switching to the new environment, the animal must keep running to each end of the new environment for reward. So the learned behavior is running to end, which is not an ideal spatial learning task. Also in the new environment, the animal must learn to run to the end of the new environment, regardless of visual cues. It is not clear to me if as stated by the authors "mice need to re-learn the task with new visual cues" in the new environment. Rather it seems the same task in a new visual cued environment (remapping?). In fact the performance of the mice in the new environment is significantly different from that in the fam environment, only for the first exposure. Over the next 2-5 days, most performance measures (Figure 1 and Figure 1—figure supplement 1; most importantly correct licks) are not different from the familiar environment, except for deceleration before reward. In addition, the changes in cell activity in new environment (Figure 2E-H) are also significantly different for day 1 only. To me this is not an unambiguous learning task, but a change of context in which the animal adapts the initial running task. Thus, the authors must change their interpretation of the results regarding learning in the new environment (Abstract, end of Introduction, Results and Discussion).

See above, Major Concerns, #1.

3) Authors state in the text that they used a "large cohort of mice" to characterize behavioral performances and use a subset to perform somatostatin interneurons imaging. The authors report only 9 mice (5 male, 4 female) for behavioral data and 8 mice (6 male, 2 female) for SOM-int imaging which is NOT a subset (at least concerning male mice). This needs clarification. Also 9 mice is not a large cohort for behavioral experiments, so this should be corrected. It would be nice to put number of animals used in each experiment in the text core to avoid back and forth consultation with the material section.

We apologize for the poor wording. This phrasing has been changed and additional animals have been added to the behavior cohort and the SOM-int cohort.

4) A lot of plots in this study have normalized values. It would be interesting to communicate more about raw values, even if effects are not as impressive as with normalized representation, just to get an idea of mice mean performances (Figure 1C, E, Figure 1—figure supplement 1E; Figure 2F, H; Figure 4F). Also, Figure 1D shows a difference in mean deceleration before a reward in a new and a fam environment without specifying the average speed of animals or number of rewards used to calculate the deceleration vector. Also it appears that the example given in Figure 1B is not representative. The performance in new environment appears much worse than the group data (1C). A more representative example would be relevant.

See above Major Concerns, #3.

Average speed is shown in Figure 1—figure supplement 1F. Deceleration before all rewards in the epoch were averaged (ranging from 13 – 30 rewards/epoch on average) (subsection “Behavior Analysis”).

5) Another concern is the method used in the Figure 3 to answer a question raised by the authors. They recognize that the effect they observed could be linked with the locomotion of the animal as they described locomotion-correlated and anti-correlated Som-int activity in their previous work (Arriaga and Han, 2017). To address this question, they used a General Linear Model, that is a useful and supportive tool, but it should be considered an additional analysis of their data. Indeed, one would expect the analysis to show the direct correlation between the locomotion or speed of the animal and cell activity. It is one of the most important correlation to establish in association with the GLM. Another concern is the result (Figure 3—figure supplement 1) that the model established in the Fam situation, does not predict well the cell activity in the Fam' (thru day 1-5) when behavior returned similar to Fam. This seems to indicate that the model is not a good predictor of fluo/activity as a function of various VR and behavioral parameters.

See above, Major Concerns, #1 (Clarification about locomotion…) and 2.

6) Another major concern is about the brightness of screens. One of the drawbacks of virtual reality set up is that they are often bright which could be stressful for the animal and also provide artefacts in imaging. There is some concern about the new environment used in this study because a pattern in the track is clearly darker than other patterns. Could the drastic change of behavior or/and activity seen in this new environment be linked with this darker part VR? For the behavior shown in Figure 1B, do the animal spent more time in the darker part? Indeed the return of activity in New environment experiment is also linked with an improvement of performance in this last one, so with more exposure of the brighter part. In accordance with that, Figure 4 shows a drastic suppression of activity during black screens used in the "no task" exposure. It is thus important to check if the behavioral changes and the suppression effect are still there in a new environment that is equally bright (as the Fam). In addition, concerning the "no task" experiment, it is important to check cell activity with animals placed either with fixed lit screens (for example a fixed visual scene of a maze), or in a new environment with no goal (that is not excluding a learning of the environment but separate the goal directed linked activity).

See above, Major Concerns, #1, It also is not clear….

7) Subsection “Consistent inhibitory structure during learning”, first paragraph. It is not clear to me how the two explanations given by the authors can account for the spectrum of Som-int activity modulation. The first explanation of equipotential interneuron hypothesis is quite unlikely and the second of different interneurons with distinct functional roles is most likely, given the body of literature on interneuron specialization. Also it is not evident how the experiments with a 2nd new environment was a test of the 2 explanations. The experiments are very interesting, but another rationale must be given for them.

See above, Major Concerns, #1 (… the rationale.…).

We also emphasize more clearly in the revision that these functionally distinct interneurons are likely from the same subtype of inhibitory interneuron (based on anatomical evidence in Figure 6—figure supplement 1C). While functional specialization of distinct interneuron types is well established, little is known about diversity of functional roles within the same type.

Reviewer #3:

[…] I have a few requests that are intended to improve the already strong paper.

1) The "no-task" control is used to dissociate the confounded relationships between novelty and/or learning on the one hand, and the activity suppression of interest, on the other. It also serves to demonstrate the activity suppression is not due to reduced locomotion. However, it should be acknowledged that the no-task condition changed more than one variable; it removed a goal-directed task component as intended, but it also removed a number of other variables such as consumption of water, visual stimuli, exposure to locomotion-contingent changes in sensation, sensorimotor integration. Any subset or weighted-combination of these changes could have contributed to the initial or maintained no-task activity decrease, not only the exclusion of the conditioned learning component.

Perhaps a better or additional control would have been a new environment with yoked-reward (i.e. reward delivered in a spatially random schedule temporally equivalent to the Fam rewarded trials). This would have removed the ability to learn the location of reward but kept all other physical variables the same. Since such unreinforced exploration of an environment is standard, perhaps the authors have relevant data on activity of these cells in such a condition. In any case the authors need to acknowledge the confound and explain the reasons one should or should not take this confound to be seriously.

While the “No Task” condition provided essential data regarding the stability of activity dynamics of individual interneurons in drastically different environments, we agree that it was not the optimal experiment to tease apart the role of task and rewards from activity modulation. We acknowledge and discuss this in more detail (Discussion, third paragraph).

Figure 3 shows that the position soon after the change to New is invariant at "0" but the ball is moving. The authors need to explain how that can be.

Mice can move on the ball but not change their VR position, as seen in Figure 3 shortly after the transition to New. This occurs when animals “run” directly into a VR wall so they are stationary in VR but still running (Figure 3 legend). This typically does not occur in familiar environments.

The one thing I am disappointed by in this paper is the authors did not take real advantage the fact that they recorded many cells at the same time. Almost all analyses performed could have been done if the cells were recorded one at a time across the exposures in 69 or more separate mice. Would population/ensemble analyses have led to the same conclusions without adding anything more? As it is there are important unanswered questions that are contained in the data. For example, to what extent are the within-session fluctuations of dF/F activity correlated across the population of cells? From the presented single cell data, I imagine that the activity correlation structure across the population is stable both within and between exposures and that this even maintains in the New and No-task conditions. However, this need not be the case even though on average, activity decreased across the population. A reader is left wondering. Consistent with their speculative conceptual framework that the cells form a structured inhibitory network (setting an inhibitory infrastructure for activity within the excitatory cell population), the authors should examine how short-time scale population activity vectors evolve within and across the exposures. To evaluate and bolster their conceptual framework, they should also examine if the population of cell-pair activity correlations maintains within and across the exposures. Analysis of these between-cell activity relationships would speak directly to the conceptual framework that the authors promote in the Discussion.

See Summary of Major revisions (#2) above for population analysis revisions.

Following on from the preceding comment on lack of between-cell activity relationship investigations, the authors could also determine if the topographic proximity between cell pairs has a role in the inter-cell activity relationships. As mentioned above, these additional analyses would empirically help to address some of the issues that the authors raise in their adroit speculations in the Discussion.

We examined the relationship between activity and distance between SOM-int cell somata and found a weak, but significant, relationship where closer cells have greater activity correlation in Fam (Figure 6—figure supplement 1J). The origins or importance of this finding is unclear. There was no relationship between cell pair proximity and similarity in magnitude of activity suppression in New (Figure 6—figure supplement 1I).

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Data Citations

    1. Arriaga M, Han E. 2019. Data from: Structured Inhibitory Activity Dynamics During Learning. Dryad Digital Repository. [DOI]

    Supplementary Materials

    Figure 1—source data 1. Statistical tests and results for Figure 1.
    DOI: 10.7554/eLife.47611.005
    Figure 2—source data 1. Statistical tests and results for Figure 2.
    DOI: 10.7554/eLife.47611.011
    Figure 3—source data 1. Statistical tests and results for Figure 3.
    DOI: 10.7554/eLife.47611.018
    Figure 4—source data 1. Statistical tests and results for Figure 4.
    DOI: 10.7554/eLife.47611.026
    Figure 5—source data 1. Statistical tests and results for Figure 5.
    DOI: 10.7554/eLife.47611.030
    Figure 6—source data 1. Statistical tests and results for Figure 6.
    DOI: 10.7554/eLife.47611.035
    Transparent reporting form
    DOI: 10.7554/eLife.47611.036

    Data Availability Statement

    Source data are available at https://doi.org/10.5061/dryad.f83kt85.

    The following dataset was generated:

    Arriaga M, Han E. 2019. Data from: Structured Inhibitory Activity Dynamics During Learning. Dryad Digital Repository.


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES