Abstract
Valence detection and processing are essential for the survival of animals and their life quality in complex environments. Neural circuits underlying the transformation of external sensory signals into positive valence coding to generate appropriate behavioral responses remain not well-studied. Here, we report that somatostatin (SOM) subtype of GABAergic neurons in the mouse medial septum complex (MS), but not parvalbumin subtype or glutamatergic neurons, specifically encode reward signals and positive valence. Through an ascending pathway from the nucleus of solitary tract and then parabrachial nucleus, the MS SOM neurons receive rewarding taste signals and suppress the lateral habenula. They contribute essentially to appetitive associative learning via their projections to the lateral habenula: learning enhances their responses to reward-predictive sensory cues, and suppressing their responses to either conditioned or unconditioned stimulus impairs acquisition of reward learning. Thus, MS serves as a critical hub for transforming bottom-up sensory signals to mediate appetitive behaviors.
Subject terms: Emotion, Neural circuits, Sensory processing
Reward behaviour such as food seeking is essential for survival. Shen et al. describe an ascending neural pathway mediating transformation of rewarding taste signals and reward-cue associative learning via somatostatin neurons in the medial septum.
Introduction
Detecting the positive or negative valence of external sensory cues and transforming the valence value into appropriate behavioral reactions are essential for animals’ survival and well-being in complex and challenging environments. Negative valence signals induce aversion/avoidance behaviors, while positive valence signals result in appetitive/approaching behaviors1–3. Although extensive previous studies have been focused on how neural structures in an emotional processing network and their distinct cell groups contribute to the aversion or rewarding type of behaviors1,2,4–6, how ascending sensory signals are transmitted to these structures is poorly studied in general. There has been increasing evidence suggesting that for each sensory modality there could be a distinct ascending neural pathway devoted to valance detection and processing, which is independent of the canonical thalamocortical pathway for generating sensory perception. For example, the medial septum complex (MS) in the basal forebrain receives bottom-up aversive sensory information of multiple modalities, such as auditory and somatosensory signals, from pontine nuclei and mediates related sensory-induced avoidance behaviors7,8. In comparison, analogous bottom-up sensory pathways to process and transform positive valence signals into appetitive behaviors have been less well understood2,9.
For an efficient and economic architecture of emotional processing networks, multisensory signals of the same valence from distinct ascending pathways may arrive and converge at the same critical processing node, following which a common effector (motor-related) pathway is modulated to elicit an aversive or approaching behavior. Within a well-studied emotional processing network2,9, the lateral habenula (LHb) has been identified as one of the critical structures to mediate reward/aversion10–12. It modulates both dopamine (DA) and serotonin (5-HT) neuromodulatory systems, which are considered to be central for reward/aversion decision making2,9. A distinct feature of LHb is that it contains predominantly excitatory glutamatergic neurons13. These neurons are found to respond opposingly to sensory stimuli with positive and negative valences, i.e. being activated by aversive signals and suppressed by reward signals10,14. Several input structures upstream of LHb, such as the ventral pallidum and lateral preoptic area, project both excitatory and inhibitory axons to LHb5,15,16. The excitatory projections are found to carry negative valence, which activates LHb to negatively influence the DA and 5-HT systems partly through the GABAergic rostromedial tegmental nucleus17,18, resulting in anti-rewarding effects or aversion9,11. On the other hand, the GABAergic projections carry positive valence, which reduces LHb activity and results in rewarding emotional effects5,15. Thus, by receiving convergent but opponent positive and negative valence signals, LHb may serve as a hub to integrate valence information from a variety of sources.
Both top-down and bottom-up information is relayed to LHb. Among its input sources, MS has been found to transmit bottom-up aversive sensory signals to LHb via its glutamatergic neurons, resulting in negative emotion related behaviors6,8. Although GABAergic neurons in MS are also found to innervate LHb neurons7, their functional contributions under different behavioral contexts remain largely unclear. MS contains different subtypes of GABAergic neurons, including parvalbumin (PV) and somatostatin (SOM) expressing neurons, which may receive distinct inputs and form intra-nuclear connections with glutamatergic neurons19,20. One possibility is that some specific types of MS GABAergic neurons may receive and transform ascending reward-related information via distinct sensory pathways to mediate appetitive behaviors.
In the present study, we tested this idea by combining behavioral assays, in vivo and in vitro electrophysiology as well as opto/chemogenetic and pharmacological manipulations. We found that MS GABAergic neurons encode positive motivational valence, respond preferentially to rewarding gustatory signals relayed from the parabrachial nucleus (PBN) in the pons, primarily through the SOM subtype of these inhibitory neurons. The SOM neurons in MS acquire responses to reward-predicative cues through cue-reward associative learning and mediate the acquisition of cue-reward association via their projections to LHb. Together, these results highlight an essential role of MS in the formation of reward associative memory.
Results
MS GABAergic neurons encode reward signals
To test whether MS GABAergic neurons can encode positive valence, we utilized a pan-GABAergic Cre diver line, Vgat-Cre, and injected adeno-associated virus (AAV) encoding Cre-dependent channelrhodopsin2 (ChR2)21 (or GFP alone as control) to activate GABAergic neurons in the medial septum complex (shortened as MS in this study). Post-hoc histology revealed that the infected area included the medial septum and the medial part of diagonal band nucleus (NDB) (Supplementary Fig. 1a). For photostimulation, 470 nm blue LED light (at 20 Hz, 5 ms pulse duration) was delivered to MS via an implanted optical fiber (see “Methods”). We first adopted a two-choice self-stimulation paradigm to assess the reinforcing value associated with the activation. LED stimulation was triggered whenever the animal nose-poked the designated LED-on port, whereas nose-poking the other port did not trigger any photostimulation (Fig. 1a). During a 30-min test session, we observed a much greater number of pokes into the LED-associated port than the control port, while in GFP-expressing control mice the number of pokes was not significantly different between the two ports (Fig. 1b, c). This result demonstrates that acute optogenetic activation of MS GABAergic neurons reinforces self-stimulation behavior, which is an important hallmark for reward-encoding neurons22–24. Moreover, in a two-chamber light-dark box test25, ChR2-expressing animals spent significantly more time in the light chamber, which was paired with LED stimulation, than GFP control mice (Fig. 1d, e), consistent with the notion that activation of these neurons is rewarding and counteracts the tendency of mice to avoid bright areas25. In a two-chamber real-time place preference (RTPP) test, LED stimulation was applied whenever the animal entered and stayed in the designated stimulation (LED-on) chamber. Similar to GAD2-Cre animals7,26, we found that the ChR2-expressing Vgat-Cre animals spent significantly more time in the stimulation chamber compared to GFP-expressing controls (Fig. 1f, g), confirming that activation of MS GABAergic neurons as a population drives place preference and thus is rewarding. Together, these results indicate that MS GABAergic neurons encode positive motivational valence.
Using optrode recording (see “Methods”), we next directly examined whether MS GABAergic neurons could respond to reward signals in awake head-fixed mice licking sucrose water (5% w/v, 10 µL per trial) (Fig. 2a). The GABAergic neurons were tagged either by crossing the Vgat-Cre with Ai27 (Cre-dependent ChR2) mice or by injecting AAV-floxed-ChR2 into MS of Vgat-Cre animals. For ChR2-tagged (GABAergic) neurons, an increase of firing rate following the licking onset was observed in a majority of the neurons recorded (79%, 59 out of 75 units), while 13% of the population (10/75) showed no change in firing rate and 8% (6/75) showed a decrease in firing rate (Fig. 2b, c). In the untagged population which presumably contained largely non-GABAergic neurons, only 24% (23/94) showed excitatory responses to sucrose and the majority showed no responses (63%, 59/94) (Fig. 2b, c). In all the recorded neurons that showed excitatory responses to sucrose, 72% were ChR2-tagged (Fig. 2d). These results suggest that the GABAergic population was preferentially activated by sucrose licking. The firing rate increase in the GABAergic neurons was due to the consumption of sucrose water (i.e. reward) but not to the licking behavior per se, as it was not observed in trials when the animal licked but no sucrose water was delivered (Fig. 2g). We performed similar recordings in animals passively receiving quinine (5 mM, 10 µL), a bitter tastant, via an intraoral cheek fistula into the oral cavity (see “Methods”). We did not observe significant changes in firing rate (Fig. 2g). In contrast to the GABAergic neurons, MS glutamatergic neurons, as tagged in Vglut2-Cre mice, did not respond to sucrose consumption but were activated by quinine (Fig. 2e–g), consistent with the previous finding that activity of these neurons encodes aversive emotional outcome7. Together, these results indicate that MS GABAergic neurons as a population are preferentially activated by reward signals in contrary to their glutamatergic counterpart. More importantly, their responses to sucrose became stronger with increasing volumes (3, 6, 10 µL) consumed (Fig. 2h, i), suggesting that the activity level can encode reward value.
PBN provides bottom-up rewarding gustatory input to MS
We wondered what input source relayed the rewarding (sweet) gustatory input to MS. It is known from work in rodents that taste information from peripheral receptors is relayed centrally to the nucleus of the solitary tract (NST) and then to the PBN in the pons27,28. A number of taste-related projections arise from PBN, with some carrying taste information to the gustatory thalamus (VPMpc) before reaching the gustatory cortex29,30. To identify the input source of MS GABAergic neurons, we performed cell-type specific monosynaptic input tracing with pseudo-typed rabies virus31,32 (Fig. 3a). We observed numerous retrogradely labeled cells in PBN, but not in NST, VPMpc or gustatory cortex (Fig. 3b). Consistent with previous work33, in vivo extracellular recording revealed that PBN neurons responded to sucrose by increasing their firing rates (Fig. 3c, d). Anterograde tracing by injecting AAV-floxed-GFP in PBN of Vglut2-Cre mice confirmed that PBN glutamatergic neurons projected their axons to MS (Fig. 3e). Slice whole-cell recordings from MS neurons in Vglut2-Cre mice injected with AAV-floxed-ChR2 in PBN further confirmed monosynaptic connectivity between PBN glutamatergic axons and MS neurons (Fig. 3f). Application of CNQX completely blocked the light-evoked synaptic current recorded at −70 mV, further confirming that the current was mediated by AMPA receptors (Supplementary Fig. 2). Together, these results demonstrate that MS receives gustatory input directly from PBN.
To further test in vivo whether PBN relays rewarding taste signals to MS GABAergic neurons, we silenced PBN bilaterally with muscimol during optrode recording (Fig. 3g). The recording was performed before (as baseline) and 15 min after muscimol infusion. The muscimol application resulted in a profound reduction (by ~80% on average) in the response of MS GABAergic neurons to passively received sucrose (Fig. 3h–j). Likewise, bilateral silencing of NST (Fig. 3k) nearly completely blocked the sucrose response of MS GABAergic neurons (Fig. 3l–n). The reduction of sucrose responses was not observed in control animals that received vehicle infusion in either of these structures (Supplementary Fig. 3).
Previously, sucrose-induced suppressive responses have been observed in LHb neurons14. Indeed, in vivo recordings revealed a sucrose-induced suppressive response in LHb neurons, which was largely blocked by silencing MS (Fig. 3o–r). Therefore, the MS GABAergic projection to LHb7 mediates the reward-related suppression in LHb. Consistent with the changes of neuronal responses, silencing PBN, NST or MS reduced sucrose-induced licking behavior (Supplementary Fig. 4), suggesting reduced appetitive motivation. Together, our data reveal a bottom-up ascending pathway for relaying appetitive taste information: from NST to PBN and then to MS GABAergic neurons, which then inhibit LHb neurons.
MS GABAergic neurons are required for reward associative learning
Since MS GABAergic neurons encode reward signals, they may play a role in reward associative learning. To test this idea, we employed a spatial reward learning paradigm (see “Methods”). Water-restricted mice were trained to perform reward-seeking in a hole-board34, with a target hole hiding sucrose water (Fig. 4a). Over four days’ training, latency for the animal to locate the reward (Fig. 4b, left) became gradually shortened (Fig. 4c, black), indicating successful association of a spatial location with reward. We then chemogenetically suppressed MS GABAergic neurons by expressing the inhibitory designer receptor exclusively activated by designer drugs (DREADD) receptor, hM4Di35, and administrating the DREADD agonist, CNO, daily before the training. The effectiveness of CNO was verified using in vitro slice recording (Supplementary Fig. 5). The chemogenetic silencing greatly impaired the spatial reward learning (Fig. 4b, right), as shown by the much longer latency to locate the reward on day 4 as compared to mCherry control animals (Fig. 4c). This result suggests that MS GABAergic neurons are required for the reward associative learning.
Next, we used a cue-reward associative learning paradigm36,37, where the animal learned to lick for reward (sucrose water) upon hearing a sound (see “Methods”). As MS GABAergic neurons can encode reward value, they may provide the unconditioned stimulus (US) signal for this type of associative learning. To test this idea, we optogenetically silenced MS GABAergic neurons by injecting AAV encoding Cre-dependent archaerhodopsin-3 (AAV-DIO-ArchT, or AAV-DIO-GFP as control) in Vgat-Cre mice (Fig. 4d). Green LED light illumination was applied during the sucrose consumption (US) window to silence US-related activity of the neurons, over 5 days’ training of coupling the conditioned stimulus (CS, 2-s tone) and US (10 µL sucrose) (Fig. 4e). On the 6th (test) day, no LED light illumination was applied. Behavioral responses to both CS and US were monitored. We found that over the course of conditioning (day 1- day 5) the anticipatory licking rate (i.e. licks during the time window between the onsets of CS and US, with baseline subtracted) was gradually increased in GFP control mice (Fig. 4e, right). This increase in anticipatory licking was also observed on the test day, indicating that the animal had learned to associate the sound cue with reward. The increase in anticipatory licking was however largely prevented in ArchT-expressing animals (Fig. 4f, g). Meanwhile, licking during the sucrose consumption window was reduced by silencing MS GABAergic neurons as compared to GFP control animals (Supplementary Fig. 6a). These results indicate that the US responses of MS GABAergic neurons are necessary for forming the cue-reward association.
Learning strengthens responses of MS GABAergic neurons to reward-predictive cues
Neurons involved in reward learning may change their responses during conditioning14,36,37. To test how the responses of MS GABAergic neurons are shaped by learning, we performed optrode recording before, during, or after conditioning for two days (Fig. 5a). Before conditioning (i.e. in naïve mice), we did not observe significant responses in these neurons to the tone that would be used as the CS (Fig. 5b, f). During the first day of conditioning, the neurons exhibited at best a weak response to the CS while a strong response to sucrose (Fig. 5c, f). On the second day of conditioning, the response to the CS became much stronger (Fig. 5d, f). Following two days’ conditioning, the robust response to CS persisted into the third day, when only CS was presented without US (Fig. 5e, f). On average, the response to CS was increased by 5.8-fold compared to the first day of conditioning (Fig. 5f). These results demonstrate learning-induced plasticity of MS GABAergic neuron responses to reward-predictive cues.
SOM but not PV neurons account for the reward processing in MS
In the basal forebrain, GABAergic neurons are known to be functionally diverse19. Parvalbumin (PV) and somatostatin (SOM) have been used as markers for different subtypes of GABAergic neurons in both the basal forebrain19,20,38 and other brain regions39,40. We investigated which of these GABAergic cell subtypes played a role in the reward learning by first examining their responses to reward signals. We expressed Cre-dependent ChR2 in MS of PV-Cre or SOM-Cre mice and performed optrode recording. PV or SOM positive units were identified by recorded spikes time-locked to LED light pulses applied (Fig. 6a). Overall, 86.4% of the recorded SOM neurons were activated by sucrose and 13.6% showed no response (Fig. 6b, d, e), whereas 87.0% of the PV neurons showed no response to sucrose and 13.0% were suppressed (Fig. 6c–e). Also consistent with the properties of the general GABAergic population, the SOM neurons responded more strongly to sucrose than just water, as shown by Ca2+ signals recorded by in vivo fiber photometry (Fig. 6f, g), indicating that they can encode reward value. By expressing ChR2 in PBN and recording from tdTomato-labeled SOM neurons in MS of slice preparations, we further confirmed that the SOM neurons received excitatory monosynaptic input from PBN (Fig. 6h, i).
In addition, activation of the SOM neurons produced place preference in the RTPP test (Fig. 6j–l), similar to GABAergic neurons, whereas activation of the PV neurons resulted in weak place avoidance (Fig. 6m, n). Together, these results suggest that the SOM neurons encode positive valence and convey reward information. In contrast, the PV neurons encode negative valence and they are unlikely involved in appetitive learning.
To test SOM neurons’ involvement in reward learning, using optrode recording we examined spiking activity of these neurons before and during reward associative learning with the same appetitive Pavlovian conditioning paradigm. Before conditioning, the SOM neurons showed no response to the CS tone (Fig. 7a). During the first and second day of conditioning, we observed gradual emergence and strengthening of their responses to the CS tone (Fig. 7b–e), similar to what had been observed for the GABAergic neurons as a whole population. Altogether, these results suggest that the SOM subtype of MS GABAergic neurons likely contributes to the reward associative learning.
MS SOM neurons are required for reward associative learning
To directly examine the contribution of SOM neurons to reward associative learning, we expressed Cre-dependent ArchT in MS of SOM-Cre mice. We applied photoinhibition during either the US window (Fig. 8a–d) or the CS plus delay window (Fig. 8e–h). Compared to GFP control animals, suppressing either CS-related or US-related activity of the SOM neurons largely impaired the reward associative learning, as shown by the prevention of increases of anticipatory licking over conditioning sessions and in the test session after the conditioning (Fig. 8d, h). Silencing the SOM neurons during the US window also reduced licking to sucrose as compared to GFP control mice (Supplementary Fig. 6b). Interestingly, in well-trained mice, photoinhibition of the SOM neurons during the CS plus delay window had no effect on anticipatory licking (Fig. 8i–l), suggesting that after the acquisition of associative learning the activity of the SOM neurons is not required for the expression of cue-reward association. However, it is indispensable for the formation of cue-reward association during conditioning. Together, these results indicate that SOM neurons in MS are required for the acquisition of reward associative learning.
The MSSOM → LHb pathway mediates reward associative learning
The blockade of the suppressive reward response in LHb by silencing MS (Fig. 3o–r) suggested that MS SOM neurons might mediate reward associative learning through LHb. We then anterogradely traced axons from GFP-labeled MS SOM neurons and found robust projections to LHb (Fig. 9a). We further expressed ChR2 in MS SOM neurons and photostimulated the ChR2-expressing axon terminals in LHb (at 20 Hz) bilaterally via implanted optic fibers above LHb (Fig. 9b). The stimulation-induced place preference in the RTPP test (Fig. 9c, d), indicating that the MS SOM to LHb projection is rewarding. In addition, we expressed Cre-dependent halorhodopsin (AAV-DIO-eNpHR3.0) in MS SOM neurons and photoinhibited their axons in LHb bilaterally by delivering yellow light covering the entire CS plus US window over 5 days’ conditioning sessions. (Fig. 9e). The inhibition of the MSSOM → LHb projection resulted in significant impairment of reward associative learning (Fig. 9f–h). In addition, the licking behavior during the sucrose consumption window was reduced (Supplementary Fig. 6c). These data demonstrate that indeed MS SOM neurons mediate the reward associative learning through their projection to LHb.
Discussion
In this study, using cell-type specific analysis we have discovered a bottom-up ascending sensory pathway (Fig. 9i) mediated by MS SOM neurons to relay rewarding taste signals. The activity of the SOM neurons was increased by rewarding stimuli, and precise optogenetic interventions revealed that the activation of these neurons was causally linked to appetitive/approaching behaviors. Positive reinforcement learning resulted in a plastic change (emergence and enhancement) in their responses to CS. Temporal silencing of the SOM neurons largely impaired the positive reinforcement learning, by intervening either the CS- or US-related activity of these neurons. These results have revealed physiological properties of a cell-type specific, reward-coding pathway that underlies specific appetitive reinforcement learning.
MS transmits divergent valences and bidirectionally modulates LHb
MS receives aversive auditory and somatosensory signals from the pontine central gray (PCG) and transmits negative valence information to LHb via its glutamatergic neurons7,8. In the present study, we found that MS GABAergic neurons, specifically SOM inhibitory neurons, receive rewarding gustatory signals from PBN, which is also in the pons. Thus, the SOM neurons are selectively activated by bottom-up rewarding sensory stimuli, while the glutamatergic neurons are selectively activated by aversive sensory stimuli. It remains to be investigated whether the SOM neurons can be activated by a broad range of rewarding stimuli, but at least the observation of their activation by water consumption (Fig. 6g) suggests that the rewarding stimuli are not limited to taste modality. Since MS GABAergic and glutamatergic axons both project to LHb and can co-innervate the same LHb neuron7, MS can convey both rewarding and aversive signals to LHb through these parallel and convergent projections, which then bidirectionally modulates LHb neuronal activity through their opponent interactions. Thus, in addition to previously proposed function roles of MS in speed coding41, sleep control19, generation of hippocampal theta rhythms42, exploratory behavior43, and attention44, we further demonstrate that this structure also serves as an important hub for processing valence of sensory information.
LHb has been shown to play a critical role in mediating aversion/reward10,11,14 through top-down modulation of dopaminergic and serotonergic systems17,18, and is strongly implicated in major depressive disorders9,12. Accumulating evidence supports that attenuating neuronal activity in LHb can provide therapeutic effects9. Because there are few GABAergic neurons within LHb13, in physiological conditions the suppression of LHb relies on its afferents which are GABAergic. Parallel excitatory and inhibitory projections may be used as a common strategy for input sources to LHb, so that the same or cross modality input signals (internal or external) can bidirectionally modulate LHb neuronal activity depending on the balance between the excitatory and inhibitory presynaptic neurons. As such, MS may serve as an important hub where bottom-up sensory signals with opposite valences converge. Through MS-LHb mediated pathways, external sensory cues of different modalities can then be transformed into emotional valences and affect the mood and behavior of animals.
A non-canonical taste-related valence processing pathway
Taste sensation is essential for animal to discriminate and evaluate the palatability of food cues and to approach nutritional food or avoid toxic substances. Once sensed by the peripheral taste receptor cells, gustatory information is first relayed by taste ganglia and then NST45. As a direct downstream target of NST, PBN broadcasts the gustatory information to many regions throughout the brain, including the taste thalamus, gustatory cortex, amygdala, and hypothalamus29,30. Sweet and bitter are two basic taste qualities with opposite valences, i.e. appetitive and aversive respectively. However, whether valences of sweet and bitter taste are represented by molecularly different populations of neurons and where this segregation occurs have remained open questions. Sweet and bitter taste qualities are represented by different cortical fields in the gustatory cortex and such segregation remains in the cortical projections to the amygdala46–48, with the latter considered to be able to drive valence-specific taste behaviors48. Nevertheless, it has been proposed that each taste modality specified by a taste receptor type has its own neuronal pathway45,49,50, implying that segregation of taste quality or valence can occur before the stage of the cortex. Indeed, separate neuronal populations in PBN are reported to encode appetitive and aversive taste stimuli respectively22,33. Therefore, the appetitive and aversive taste information may have already been segregated in PBN and then further utilized by its downstream targets to regulate emotional states and related appetitive/aversive behaviors. Our results demonstrate that MS SOM neurons specifically receive appetitive taste information from PBN and encode its valence value. Such information is further relayed to LHb important for reward processing. Therefore, the NST → PBN → MS → LHb pathway constitutes an ascending gustatory pathway devoted to emotional processing (Fig. 9i), and could be independent of the conventional NST → thalamus→cortex pathway required for taste discrimination. How this pathway interacts with the amygdala in driving valence-dependent taste behaviors needs to be further investigated. It should be noted that PBN projects not only to MS, but also to the lateral septum (LS) (Fig. 3e). In addition, NST, MS and LHb also have multiple projection targets and rich collaterals. The specific NST-PBN-MS-LHb pathway we focus on in this study could be one of many pathways contributing to reward processing of taste signals. This is evidenced by results showing that inactivation of each component of the pathway only partially impeded the licking behavior to sucrose delivery (Supplementary Fig. 6).
MS is a critical hub for associative learning
MS is part of the basal forebrain, which is one of the earliest structures that show degenerative changes in Alzheimer’s disease and has been implicated in its characteristic memory disfunction51. Previous studies have demonstrated involvements of MS in learning and memory, but with a focus on the cholinergic system52. In Pavlovian conditioning, the unconditioned stimulus has to contain an inherent valence value (positive or negative), which is then associated with the conditioned stimulus during conditioning so that the latter acquires the same valence. Cholinergic neurons in MS however appear to encode salience but not valence of unconditioned stimuli7,52, suggesting that other cell types20 such as glutamatergic and GABAergic neurons may be involved in valence coding. Previous studies suggest that MS glutamatergic neurons encode aversive signals in various modalities7 and that silencing MS impairs sound-cued fear conditioning8. Here, we further demonstrate that the SOM type of MS GABAergic neurons specifically encode rewarding signals and positive valence and that their responses to the unconditioned stimulus are required for the reward-cue association. Therefore, via separate molecularly defined neuronal populations, MS can convey both positive and negative valences of unconditioned stimuli required for associative learning. The CS-US association is mediated at least partially by changes of MS neuron responses to the CS (e.g. an increase of CS responses during reward learning). The acquired valence of the CS could be relayed from MS to LHb, which contributes to the observed changes of CS responses in LHb14. In addition, previous studies have demonstrated that neuronal activity in MS can modulate hippocampal spatial representation via the septal-hippocampal projection53. Therefore, it is likely that MS also provides valence signals to the hippocampus and contributes to the formation of reward-associated spatial memory54. Notably, in the septal-hippocampal pathway, MS GABAergic neurons have been shown to mostly innervate GABAergic interneurons in the hippocampus38,55. Thus, activation of MS GABAergic neurons may result in disinhibition of pyramidal neurons in the hippocampus, facilitating the formation of reward site-specific place fields54. Together, these results suggest that MS may act as a critical hub for both aversive and reward learning, by receiving sensory inputs of multiple modalities and distributing associated valence values to various downstream structures. This view fills in a gap in our understanding of roles of MS in learned behaviors.
Plasticity of SOM neurons and their role in reward learning
The functional roles of SOM neurons in the medial septum complex have been poorly studied. Previous investigations on the function of SOM neurons around the region primarily focused on the population residing mostly in the lateral NDB, which is more ventral and lateral to the area we focused on in this study. The neurons are found to powerfully inhibit all other cell types and gate the basal forebrain input to the cortex19,56. A recent study has shown that optogenetic inactivation of basal forebrain SOM neurons impairs spatial working memory57. Moreover, SOM neurons in the hippocampus and cortical regions have been shown to undergo anatomical and functional changes in experience-dependent plasticity58. Our previous study suggests that MS neurons in general do not respond to tones, while selectively respond to broadband noise with a high intensity threshold8. That SOM neurons do not respond to a CS tone in naïve mice (Fig. 7a) is consistent with this previous observation. After conditioning, the SOM neurons acquire responses to the reward-predicative tone, suggesting that the initially subthreshold tone-evoked input to these neurons has been strengthened and become suprathreshold through mechanisms such as long-term potentiation (LTP). Pairing a subthreshold input and a suprathreshold input with an appropriate temporal order can lead to LTP of the subthreshold input59,60. That both the CS (subthreshold) and US (suprathreshold) responses of SOM neurons are required for the acquisition of learning is consistent with the idea that LTP-like synaptic plasticity is a cellular substrate for the formation of CS-US association61,62. The conditioning-induced enhancement of CS responses has also been observed in the basal forebrain63,64, as well as in many other brain regions5,65,66.
The finding that MS SOM neuron activity is required for the formation of reward association memory but not for the retrieval of this memory (Fig. 8i–l) suggests a dissociation of neural circuits for the acquisition and expression phase of reward learning. Possibly, after conditioning, the plasticity of CS responses is transferred to other neuronal substrates, which can then support MS-independent behavioral responses to the CS tone alone. Similar dissociation has been reported elsewhere. For example, the paraventricular thalamic nucleus (PVT) is involved in the acquisition but not expression of associative memory37, while dopamine neurons in the dorsal raphe only affect the expression phase of associative learning67.
In the reward processing system, the activity of some neurons such as dopamine neurons is proposed to encode reward prediction errors65,68. For this type of neurons, reward-predictive cues gain the ability while the predicted reward gradually fail to activate them during learning. However, in our results, after learning the predicted reward remains to be able to activate the MS SOM neurons (Fig. 7d), suggesting that activity of the SOM neurons does not serve as reward prediction errors. In addition, the SOM neurons do not maintain a high-level activity during the anticipatory phase (i.e. delay period) of reward responses, unlike serotonergic neurons in the dorsal raphe69,70. Through the inhibitory projection to LHb, the plasticity of MS responses to reward-predictive cues can be relayed directly to LHb, which accounts for the enhanced suppressive responses to the cues in LHb14.
In summary, the results of the current study reveal an important role of MS in associative learning. By broadly receiving multisensory inputs of both rewarding and aversive nature, MS integrates and broadcasts positive and negative valences of sensory cues via different neuronal populations, which are required for the formation of related associative memory.
Methods
All experimental procedures in this study were in accordance with the guidelines for the care and use of laboratory animals of the US National Institutes of Health (NIH), and were approved by Animal Care and Use Committee (IACUC) of the University of Southern California.
Animals
Experiments were performed in adult (2–3 months old) male and female mice. Wild-type (C57BL/6J) and transgenic (Vgat-ires-Cre, Vglut2-ires-Cre, SOM-ires-Cre, and PV-ires-Cre) mice were obtained from the Jackson Laboratory and were housed with a 12 h light-dark cycle, at 65–75 °F temperature and 40–60% humidity. All recordings and behavioral tests were conducted in the dark cycle.
Virus
AAV1-EF1α-DIO-hChR2(H134R)-EYFP-WPRE (UPenn vector core, Addgene, 20298), AAV1-CAG-FLEX-ArchT-GFP (UNC GTC vector core, Addgene, 29777), AAV1-CAG-FLEX-GFP-WPRE (UPenn vector core, Addgene, 51502), AAV5-EF1a-DIO-hM4D(Gi)-mCherry (Addgene, 50461), AAV5-EF1a-DIO-mCherry (Addgene, 50462), AAV1-CA-FLEX-RG (Addgene, 38043), AAV1-EF1α-FLEX-TVA-mCherry (Addgee, 38044), EnvA-G-deleted Rabies-GFP (Salk vector core), AAV9-Syn-FLEX-GCaMP6s-WPRE-SV4 (Addgene, 100845), AAV1-hSyn-hChR2(H134R)-EYFP (Addgene, 26973), and AAV1-EF1α-DIO-eNpHR3.0-EYFP (Addgene, 26966) were used in this study.
Surgical procedures
Stereotaxic injections of virus were carried out as we previously described7,8. The mouse was anesthetized with isoflurane (1.5–2% by volume). A heating pad was placed underneath the animal body to maintain the body temperature of the animal. A small incision was made on along the midline to expose the skull. One ~0.2 × 0.2 mm2 craniotomy window was made for the target region (MS: 0.98 mm anterior to the bregma, 1 mm lateral to the midline, 4.25 mm below the pia with a 13.5° angle; PBN: 5.2 mm posterior to the bregma, 1.25 mm lateral to the midline, 2.8 mm below the pia with 0° angle; NST: 7.5 mm posterior than the bregma, 0.3 mm lateral to the midline, 3 mm below the pia with 0° angle; LHb: 1.5 mm posterior than the bregma, 1 mm lateral to the midline, 2.5 mm below the pia with a 10° angle). The adeno-associated viruses (AAVs, encoding ChR2, GFP, ArchT, NpHR, or GCamp6s) or pseudo-typed rabies virus (encoding GFP) were used depending on the purpose of experiments and strain of mice. Virus was delivered through a pulled glass micropipette (inner diameter of tip: ~20–30 µm) using pressure injection via a micropump (World Precision Instruments). For each injection, 60 nL of viral solution was injected at a rate of 15 nL/min. Right after the injection, the pipette stayed for 4 min before withdrawal. The scalp was then suture closed and the animal was administered ketoprofen (5 mg/kg) and buprenorphine (0.5 mg/kg) to minimize inflammation and discomfort. Animals were recovered from anesthesia on a heating pad and then returned to their home cages.
Optogenetic preparation and stimulation
2 weeks after the virus injection, an optic cannula (200 µm core, RWD Inc.) was stereotaxically implanted above the MS or LHb. After at least 1 week of recovery, animals were habituated to connecting to an optic fiber cord. For photoactivation, 20 Hz (5 ms pulse duration) light stimulation was delivered through the optic fiber cord which was connected to a blue LED source (470 nm, Thorlabs). For photoinhibition, a sustained green LED light (530 nm, Thorlabs, for ArchT) or yellow light (594 nm, Thorlabs, for NpHR) was applied during the entire period for inhibition. The efficiency of photoactivation and photoinhibition has been verified in slice recording and in vivo optrode recording experiments. The LED power measured at the tip of the fiber (connected with the optic cannula) is around 3–5 mW.
Behavioral tests
Spatial reward learning
Spatial reward learning was performed with a hole-board, where sucrose water was filled in a target hole. Behavioral tests were performed after water deprivation for 24 h. The animal was placed on a platform which was connected with one corner of the hole-board. Time counting was started when the animal left the platform and entered the board, and ended when it found the target hole and started to lick the water. 5 trials were given each day for 4 consecutive days. Compensatory water was given to the animal after daily trial so that the animal maintained > 90% of the original body weight.
Real-time place preference
Real-time place preference test was performed as described before7. A clear acrylic behavior box (40 cm × 20 cm × 20 cm, divided into two chambers, put in a larger white foam box) with normal bedding materials was used. For each trial, the mouse was initially placed in the non-stimulation chamber, and LED (480 nm, 10 Hz, 5-ms pulse duration) stimulation was constantly delivered once the animal entered the stimulation chamber and was stopped once the animal exited. The total duration of each test session was 20 min. Animals were returned to their home cage after each test session. The stimulation chamber was randomly assigned and balanced for the whole group of animals. This test was controlled by a customized close-loop optogenetic control system with online real-time mouse detection software and a computer-controlled Arduino microcontroller (https://www.arduino.cc/) described in detail below.
Self-stimulation
Mice were placed in an operant box equipped with two ports for nose poke at symmetrical locations on one of the cage walls. The ports were connected to a photo-beam detection device allowing for measurements of responses. A valid nose poke at the LED-on port lasting for at least 500 ms triggered a 1 s long 20 Hz (5-ms pulse duration) LED pulse train delivery controlled by an Arduino microcontroller. The LED-on port was randomly assigned and balanced within the group of tested animals. The test lasted for 40 mins. Video and time stamps associated with nose poke and laser events were saved in a computer file for post-hoc analysis.
Light-dark box test
An acrylic behavior box (40 cm × 20 cm × 20 cm) was divided into a dark chamber (10 cm × 20 cm × 20 cm) and a light chamber (30 cm × 20 cm × 20 cm). The dark chamber was shielded with black aluminum foil. A small opening located at floor level in the center of the dividing wall allowed the animal to freely move between the dark and light chambers. To prevent hindering movements of the optic fiber cable, a narrow opening was made on the divider between the two chambers. Besides that, on top of the light side chamber, two curved metal wires were placed to guide the movement of the optic fiber cable. Animal was placed in the light side at the beginning and its behavior was recorded by a camera above the box. The time spent on the light side was analyzed using the object-detection software described below.
Real-time animal detection and closed-loop optogenetic control
A customized mouse detection software was used for online real-time animal detection (written by Guang-Wei Zhang, in Python 3.4, www.python.org, with OpenCV library, https://opencv.org). The behavior of the animal was monitored using an infrared camera at 24 frames per second. Each video frame was gaussian blurred and then binarized. The centroid of the detected contour was used to determine the location of the animal. In the two-chamber place preference test, once the mouse entered the assigned LED-on chamber, computer-controlled Arduino microcontroller (www.arduino.cc) generated TTL signals to drive the LED light source (ThorLabs Inc.). The behavior test was run automatically without the experimenter’s interference and the result was quantified right after each experiment.
Intraoral infusion of sucrose/quinine and Pavlovian conditioning
For passive sucrose or quinine delivery, a soft silastic tubing was subcutaneously inserted to the oral cavity of the mouse through a small incision on cheek69. The tube was adhesive to the cheek with sutures. This approach allowed us to precisely control the time and volume of sucrose or quinine delivered into the mouth. After three days of recovery, a micro pump (Lee-Company) was used to infuse either quinine (5 mM, 10 µL) or sucrose (5% w/v) into the oral cavity through the tube.
For active licking, mice were water-deprived and habituated to the head-fixation condition to lick the waterspout for 2–3 days. Once they reliably licked free water, they were subjected to Pavlovian conditioning. During conditioning, a 16 kHz tone cue was presented for 2 s, followed by a 1 s delay period, and then 10 µL 5% w/v sucrose water was delivered. Each training session contained 50 trials. The inter-trial interval varied between 30–40 s. For each training session, the animal could drink about 1 mL water before returned to home cage, where no water was available. An infrared camera was used to record videos for detection of licking events. An infrared LED was also captured to serve as a trigger signal for each trial and was used to align extracellular recording with video recording. Licking events were detected by a customized software (written by Li Shen, in Python with OpenCV library) and further confirmed by visual inspection. The data were further processed with customized Matlab scripts (Mathworks). The anticipatory lick rate was quantified between the onset of the tone cue and that of sucrose delivery, with the baseline lick rate (before the onset of CS tone) subtracted. The anticipatory lick rate reached a plateau after 5–6 training sessions, and animals were considered as well-trained. To test the effect of photoinhibition in well-trained animals, optogenetic stimulation was applied in randomly selected 50 trials out of 100 total trials.
Optrode recording and spike sorting
Three days before recording, the mouse was anesthetized with isoflurane (1.5–2% by volume). A head post for fixation and another for angle marker were mounted on top of the skull with dental cement and a craniotomy was performed over the MS. Silicone adhesive (Kwik-Cast Sealant, WPI Inc) was applied to cover the craniotomy window until the recording experiment. For pharmacological silencing with muscimol, a drug delivery cannula (internal diameter: 140 µm) was stereotaxically implanted into the target area. After baseline recording, fluorescent muscimol-bodipy (0.7 mM in ACSF with 5% DMSO) was infused via the implanted cannula. Recording session to evaluate silencing effects was started 10 min after the infusion.
Recording was carried out with an optrode (A16-Poly2-5mm-50 s-177-OA16LP, 16 contacts separated by 50 mm, the distance between the tip of the optic fiber and the probes is 200 µm, NA 0.22, Neuronexus Technologies) connected to a laser source (473 nm) via an optic fiber. The optrode was lowered into the MS region and signals were acquired using the Plexon recording system. The Vglut2, Vgat, PV or SOM neurons were optogenetically tagged by injecting AAV-floxed-ChR2 in a corresponding Cre-driver mouse or by crossing a Cre-driver line with a ChR2 reporter line (Vglut2-Cre × Ai27, Vgat-Cre × Ai27, PV-Cre × Ai27, or SOM-Cre × Ai27). To identify ChR2+ neurons, 16-Hz or 32-Hz (5-ms pulse duration) laser pulse trains were delivered intermittently. Extracellular signals were recorded at 30 kHz sample rate and were filtered through a bandpass filter (0.3–10 kHz). The nearby four channels of the probe were grouped as tetrodes, and semiautomatic spike sorting was performed by using Offline Sorter (Plexon). Semiautomated clustering was carried out on the basis of the first three principal components of the spike waveform or Peak-Valley values on each tetrode channel using a T-Dist E-M scan algorithm (scan over a range of 10–30 degree of freedom) and then evaluated with sort quality metrics. Clusters with isolation distance <20 and L-Ratio > 0.1 were discarded7. Spike clusters were classified as single units only if the waveform SNR (Signal Noise Ratio) exceeded 4 (12 dB) and the inter-spike intervals exceeded 1.2 ms for >99.5% of the spikes. MS ChR2+ neurons were optogenetically identified by their time-locked spike responses to blue laser pulses. Only spikes with onset latencies (relative to the light pulse) < 4 ms were considered as being directly evoked. The correlation coefficient between the average waveforms of these laser-evoked spikes and sucrose- or quinine-evoked spikes was calculated and only units with cc > = 0.97 were considered as a valid unit.
Response quantification
After spike sorting, spike trains for each cell were analyzed with customized MATLAB scripts (Mathworks). Peri-stimulus spike time histograms (PSTHs) with 10 ms time bins were generated. Firing rates were normalized to the baseline activity by calculating a Z-score (z = (x−µ)/σ), with µ being the average spontaneous firing rate and σ being the standard deviation in the 2 s window preceding the stimulus onset. For each cell, the evoked response within 2 s after the stimulus onset was compared with the spontaneous activity to determine whether the stimulus could excite or suppress the neuronal activity (unpaired t test, p < 0.05). Recording location and virus expression were verified post-hoc with standard histological procedures.
Slice whole-cell recording
To confirm the connectivity between PBN glutamatergic axons and MS neurons, Vglut2 -Cre mice injected with AAV1-EF1α-DIO-hChR2(H134R)-EYFP-WPRE in PBN were used for slice recording. To confirm the connectivity between PBN axons and MS SOM neurons, SOM-Cre-Ai14 mice injected with AAV1-hSyn-hChR2(H134R)-EYFP in PBN were used for slice recording. Four weeks following the injection, animals were decapitated following anesthesia and the brain was rapidly removed and immersed in an ice-cold dissection buffer (composition: 60 mM NaCl, 3 mM KCl, 1.25 mM NaH2PO4, 25 mM NaHCO3, 115 mM sucrose, 10 mM glucose, 7 mM MgCl2, 0.5 mM CaCl2; saturated with 95% O2 and 5% CO2; pH = 7.4). Coronal slices at 350 µm thickness were sectioned by a vibrating microtome (Leica VT1000s), and recovered for 30 min in a submersion chamber filled with warmed (35 °C) ACSF (composition:119 mM NaCl, 26.2 mM NaHCO3, 11 mM glucose, 2.5 mM KCl, 2 mM CaCl2, 2 mM MgCl2, and 1.2 NaH2PO4, 2 mM Sodium Pyruvate, 0.5 mM VC). MS neurons labeled with tdTomato (SOM-Cre-tdTomato mice) or surrounded by EYFP+ fibers were visualized under a fluorescence microscope (Olympus BX51 WI). Patch pipettes (~4–5 MΩ resistance) filled with a cesium-based internal solution (composition: 125 mM cesium gluconate, 5 mM TEA-Cl, 2 mM NaCl, 2 mM CsCl, 10 mM HEPES, 10 mM EGTA, 4 mM ATP, 0.3 mM GTP, and 10 mM phosphocreatine; pH = 7.25; 290 mOsm) were used for whole-cell recordings. Signals were recorded with an Axopatch 700B amplifier (Molecular Devices) under voltage clamp mode at a holding voltage of –70 mV for excitatory currents, filtered at 2 kHz and sampled at 10 kHz. Tetrodotoxin (TTX, 1 μM) and 4-aminopyridine (4-AP, 1 mM) were added to the external solution for recording monosynaptic responses to blue light stimulation (5 ms pulse, 3 mW power, 10–30 trials).
For testing the efficacy of ChR2 or ArchT, brain slices were prepared similarly, and whole-cell current-clamp recordings were made from neurons expressing ChR2 or ArchT. A train of blue light pulses at different frequencies (5-20 Hz, 5-ms pulse duration) was applied to measure spike responses of ChR2-expressing neurons. Green light stimulation (5-s duration) was applied to measure hyperpolarizations in ArchT-expressing neurons. In vitro slice recording data were analyzed by using pClamp (Molecular Devices) and customized Python codes.
Cell-type-specific retrograde tracing of monosynaptic inputs
To trace monosynaptic inputs to Vgat neurons in MS, AAV1-CA-FLEX-RG and AAV1-EF1α-FLEX-TVA-mCherry were stereotactically injected into MS of Vgat-Cre mice. After two weeks, EnvA-G-deleted Rabies-GFP was injected into MS. The animal was sacrificed one week later. Brain tissue was fixed, sectioned, and imaged using a confocal microscope.
Fiber photometry recording
SOM-Cre mice were injected with Cre-dependent GCaMP6s (AAV1-Syn-FLEX-GCamp6s-WPRE-SV4) into the MS (60 nL). An optic fiber (400 µm, NA = 0.5, Thorlabs) was implanted over the MS injection site and secured with dental cement. Following 3 weeks of viral expression, GCaMP6s fluorescence was detected through the optic fiber using a fiber photometry system as described previously71. LED light (480 nm, Thorlabs) was bandpass filtered (ET470/24 M, Chroma), focused by an objective lens (Olympus) and coupled to an optical fiber (O.D. = 400 µm, NA = 0.48, 1 m long, Doric), which connected to the implanted optic fiber using a ceramic sleeve. The LED power was adjusted to 0.02 mW at the tip of the optical fiber. The fluorescent calcium signal was bandpass filtered (ET525/36 M, Chroma) and collected by a photomultiplier tube (H11706-40, Hamamatsu). Before digitized (250 Hz sampling rate) by a data acquisition card (PCI-MIO-16E-4, National Instruments), the signal was amplified (Model SR570, Stanford Research System) and low-pass filtered (30 Hz). Data were obtained and analyzed using custom LabVIEW and MATLAB software.
Image acquisition
To check the expression of EYFP, GFP, or mCherry, or electrode tracks (coated with DiI), the animals were deeply anesthetized using urethane (25%) and transcardially perfused with phosphate-buffered saline (PBS) and paraformaldehyde (4% in PBS). Coronal brain sections (150 μm) were made with a vibratome (Leica Microsystems) and stained with Nissl reagent (Deep red, Invitrogen) for 2 h at room temperature. Each slice was imaged under a confocal microscope (Olympus).
Statistics
Pilot experiments were conducted to determine the sample size. Mann-Whitney test was used for non-normality data. Kolmogorov-Smirnov test was used for comparing the difference between optogenetic or chemogenetic silencing and control group. One-way ANOVA and Two-way ANOVA with post-hoc Tukey’s test or Bonferroni test were used to test significance between samples. For two-group comparison of normality data, significance was determined by using t test. Paired t test was used to compare data from the same neuron or the same animal.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Supplementary information
Acknowledgements
This work was supported by grants from the U.S. National Institute of Health (EY019049 and MH116990 to H.W.T.; DC008983 and MH116990 to L.I.Z.).
Source data
Author contributions
H.W.T., L.I.Z., and L.S. designed the experiments. L.S., G.W.Z., and M.B.S. performed behavioral experiments. L.S. and G.W.Z. performed in vivo electrophysiology and photometry experiments. L.S. and G.W.Z. performed data analysis. C.T. performed slice recording experiments. J.J.H. and N.K.Z. helped with viral injections and in vivo experiments. H.W.T. and L.I.Z. supervised the project. H.W.T., L.S., G.W.Z., and L.I.Z. wrote the manuscript.
Peer review
Peer review information
Nature Communications thanks Yunlei Yang, Jelena Radulovic and the other anonymous reviewer(s) for their contribution to the peer review of this work. Peer review reports are available.
Data availability
The full datasets generated in the current study are available from the corresponding author upon requests. Source data are provided with this paper.
Code availability
Animal detection and lick detection programs are available on https://github.com/li-shen-amy/behavior. DIO for the repository is 10.5281/zenodo.5992295.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Li Shen, Guang-Wei Zhang.
Contributor Information
Li I. Zhang, Email: liizhang@usc.edu
Huizhong W. Tao, Email: htao@usc.edu
Supplementary information
The online version contains supplementary material available at 10.1038/s41467-022-28854-z.
References
- 1.Hu H. Reward and aversion. Annu. Rev. Neurosci. 2016;39:297–324. doi: 10.1146/annurev-neuro-070815-014106. [DOI] [PubMed] [Google Scholar]
- 2.Tye KM. Neural circuit motifs in valence processing. Neuron. 2018;100:436–452. doi: 10.1016/j.neuron.2018.10.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Berridge KC. Affective valence in the brain: modules or modes? Nat. Rev. Neurosci. 2019;20:225–234. doi: 10.1038/s41583-019-0122-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Jennings JH, et al. Distinct extended amygdala circuits for divergent motivational states. Nature. 2013;496:224–228. doi: 10.1038/nature12041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Stephenson-Jones M, et al. Opposing contributions of GABAergic and glutamatergic ventral pallidal neurons to motivational behaviors. Neuron. 2020;105:921–933. doi: 10.1016/j.neuron.2019.12.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Xiao X, et al. A genetically defined compartmentalized striatal direct pathway for negative reinforcement. Cell. 2020;183:211–227. doi: 10.1016/j.cell.2020.08.032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Zhang G-W, et al. Transforming sensory cues into aversive emotion via septal-habenular pathway. Neuron. 2018;99:1016–1028. doi: 10.1016/j.neuron.2018.07.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Zhang G-W, et al. A non-canonical reticular-limbic central auditory pathway via medial septum contributes to fear conditioning. Neuron. 2018;97:406–417. doi: 10.1016/j.neuron.2017.12.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Hu H, Cui Y, Yang Y. Circuits and functions of the lateral habenula in health and in disease. Nat. Rev. Neurosci. 2020;21:277–295. doi: 10.1038/s41583-020-0292-4. [DOI] [PubMed] [Google Scholar]
- 10.Matsumoto M, Hikosaka O. Lateral habenula as a source of negative reward signals in dopamine neurons. Nature. 2007;447:1111–1115. doi: 10.1038/nature05860. [DOI] [PubMed] [Google Scholar]
- 11.Hikosaka O. The habenula: from stress evasion to value-based decision-making. Nat. Rev. Neurosci. 2010;11:503–513. doi: 10.1038/nrn2866. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Knowland D, Lim BK. Circuit-based frameworks of depressive behaviors: the role of reward circuitry and beyond. Pharmacol. Biochem. Behav. 2018;174:42–52. doi: 10.1016/j.pbb.2017.12.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Aizawa H, Kobayashi M, Tanaka S, Fukai T, Okamoto H. Molecular characterization of the subnuclei in rat habenula. J. Comp. Neurol. 2012;520:4051–4066. doi: 10.1002/cne.23167. [DOI] [PubMed] [Google Scholar]
- 14.Wang D, et al. Learning shapes the aversion and reward responses of lateral habenula neurons. eLife. 2017;6:e23045. doi: 10.7554/eLife.23045. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Barker DJ, et al. Lateral preoptic control of the lateral habenula through convergent glutamate and GABA transmission. Cell Rep. 2017;21:1757–1769. doi: 10.1016/j.celrep.2017.10.066. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Faget L, et al. Opponent control of behavioral reinforcement by inhibitory and excitatory projections from the ventral pallidum. Nat. Commun. 2018;9:849. doi: 10.1038/s41467-018-03125-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Balcita-Pedicino JJ, Omelchenko N, Bell R, Sesack SR. The inhibitory influence of the lateral habenula on midbrain dopamine cells: ultrastructural evidence for indirect mediation via the rostromedial mesopontine tegmental nucleus. J. Comp. Neurol. 2011;519:1143–1164. doi: 10.1002/cne.22561. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Sego C, et al. Lateral habenula and the rostromedial tegmental nucleus innervate neurochemically distinct subdivisions of the dorsal raphe nucleus in the rat. J. Comp. Neurol. 2014;522:1454–1484. doi: 10.1002/cne.23533. [DOI] [PubMed] [Google Scholar]
- 19.Xu M, et al. Basal forebrain circuit for sleep-wake control. Nat. Neurosci. 2015;18:1641–1647. doi: 10.1038/nn.4143. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Do JP, et al. Cell type-specific long-range connections of basal forebrain circuit. eLife. 2016;5:e13214. doi: 10.7554/eLife.13214. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Boyden ES, Zhang F, Bamberg E, Nagel G, Deisseroth K. Millisecond-timescale, genetically targeted optical control of neural activity. Nat. Neurosci. 2005;8:1263–1268. doi: 10.1038/nn1525. [DOI] [PubMed] [Google Scholar]
- 22.Han W, et al. A neural circuit for gut-induced reward. Cell. 2018;175:665–678. doi: 10.1016/j.cell.2018.08.049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Olds J, Milner P. Positive reinforcement produced by electrical stimulation of septal area and other regions of rat brain. J. Comp. Physiol. Psychol. 1954;47:419–427. doi: 10.1037/h0058775. [DOI] [PubMed] [Google Scholar]
- 24.Stuber GD, et al. Excitatory transmission from the amygdala to nucleus accumbens facilitates reward seeking. Nature. 2011;475:377–380. doi: 10.1038/nature10194. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Bourin M, Hascoët M. The mouse light/dark box test. Eur. J. Pharmacol. 2003;463:55–65. doi: 10.1016/s0014-2999(03)01274-3. [DOI] [PubMed] [Google Scholar]
- 26.Zhang, G. W., Shen, L., Li, Z., Tao, H. W. & Zhang, L. I. Track-Control, an automatic video-based real-time closed-loop behavioral control toolbox. bioRxiv10.1101/2019.12.11.873372 (2019).
- 27.Norgren R. Projections from the nucleus of the solitary tract in the rat. Neuroscience. 1978;3:207–218. doi: 10.1016/0306-4522(78)90102-1. [DOI] [PubMed] [Google Scholar]
- 28.Karimnamazi H, Travers SP, Travers JB. Oral and gastric input to the parabrachial nucleus of the rat. Brain Res. 2002;957:193–206. doi: 10.1016/s0006-8993(02)03438-8. [DOI] [PubMed] [Google Scholar]
- 29.Fulwiler CE, Saper CB. Subnuclear organization of the efferent connections of the parabrachial nucleus in the rat. Brain Res. Rev. 1984;319:229–259. doi: 10.1016/0165-0173(84)90012-2. [DOI] [PubMed] [Google Scholar]
- 30.Tokita K, Inoue T, Boughter JD. Subnuclear organization of parabrachial efferents to the thalamus, amygdala and lateral hypothalamus in C57BL/6J mice: a quantitative retrograde double labeling study. Neuroscience. 2010;171:351–365. doi: 10.1016/j.neuroscience.2010.08.026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Wickersham IR, Finke S, Conzelmann K, Callaway EM. Retrograde neuronal tracing with a deletion- mutant rabies virus. Nat. Methods. 2007;4:2006–2008. doi: 10.1038/NMETH999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Watabe-Uchida M, Zhu L, Ogawa SK, Vamanrao A, Uchida N. Whole-brain mapping of direct inputs to midbrain dopamine. Neurons Neuron. 2012;74:858–873. doi: 10.1016/j.neuron.2012.03.017. [DOI] [PubMed] [Google Scholar]
- 33.Fu O, et al. SatB2-expressing neurons in the parabrachial nucleus encode sweet taste. Cell Rep. 2019;27:1650–1656. doi: 10.1016/j.celrep.2019.04.040. [DOI] [PubMed] [Google Scholar]
- 34.van der Staay FJ, Gieling ET, Pinzón NE, Nordquist RE, Ohl F. The appetitively motivated ‘cognitive’ holeboard: a family of complex spatial discrimination tasks for assessing learning and memory. Neurosci. Biobehav. Rev. 2012;36:379–403. doi: 10.1016/j.neubiorev.2011.07.008. [DOI] [PubMed] [Google Scholar]
- 35.Zhu H, Roth BL. DREADD: A chemogenetic GPCR signaling platform. Int. J. Neuropsychopharmacol. 2015;18:pyu007. doi: 10.1093/ijnp/pyu007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Otis JM, et al. Prefrontal cortex output circuits guide reward seeking through divergent cue encoding. Nature. 2017;543:103–107. doi: 10.1038/nature21376. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Zhu Y, et al. Dynamic salience processing in paraventricular thalamus gates associative learning. Science. 2018;362:423–429. doi: 10.1126/science.aat0481. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Freund TF. GABAergic septohippocampal neurons contain parvalbumin. Brain Res. 1989;478:375–381. doi: 10.1016/0006-8993(89)91520-5. [DOI] [PubMed] [Google Scholar]
- 39.Xu X, Roby KD, Callaway EM. Immunochemical characterization of inhibitory mouse cortical neurons: Three chemically distinct classes of inhibitory cells. J. Comp. Neurol. 2010;518:389–404. doi: 10.1002/cne.22229. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Fang Q, et al. A differential circuit via retino-colliculo-pulvinar pathway enhances feature selectivity in visual cortex through surround suppression. Neuron. 2020;105:355–369. doi: 10.1016/j.neuron.2019.10.027. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Fuhrmann F, et al. Locomotion, theta oscillations, and the speed-correlated firing of hippocampal neurons are controlled by a medial septal glutamatergic circuit. Neuron. 2015;86:1253–1264. doi: 10.1016/j.neuron.2015.05.001. [DOI] [PubMed] [Google Scholar]
- 42.Buzsáki G. Theta oscillations in the hippocampus. Neuron. 2002;33:325–340. doi: 10.1016/s0896-6273(02)00586-x. [DOI] [PubMed] [Google Scholar]
- 43.Köhler C, Srebro B. Effects of lateral and medial septal lesions on exploratory behavior in the albino rat. Brain Res. 1980;182:423–440. doi: 10.1016/0006-8993(80)91199-3. [DOI] [PubMed] [Google Scholar]
- 44.Baxter MG, Chiba AA. Cognitive functions of the basal forebrain. Curr. Opin. Neurobiol. 1999;9:178–183. doi: 10.1016/s0959-4388(99)80024-5. [DOI] [PubMed] [Google Scholar]
- 45.Carleton A, Accolla R, Simon SA. Coding in the mammalian gustatory system. Trends Neurosci. 2010;33:326–334. doi: 10.1016/j.tins.2010.04.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Chen X, Gabitto M, Peng Y, Ryba NJP, Zuker CS. A gustotopic map of taste qualities in the mammalian brain. Science. 2011;333:1262–1266. doi: 10.1126/science.1204076. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Peng Y, et al. Sweet and bitter taste in the brain of awake behaving animals. Nature. 2015;527:512–515. doi: 10.1038/nature15763. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Wang L, et al. The coding of valence and identity in the mammalian taste system. Nature. 2018;558:127–131. doi: 10.1038/s41586-018-0165-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Yarmolinsky DA, Zuker CS, Ryba NJP. Common sense about taste: from mammals to insects. Cell. 2009;139:234–244. doi: 10.1016/j.cell.2009.10.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Barretto RPJ, et al. The neural representation of taste quality at the periphery. Nature. 2015;517:373–376. doi: 10.1038/nature13873. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Auld DS, Kornecook TJ, Bastianetto S, Quirion R. Alzheimer’s disease and the basal forebrain cholinergic system: Relations to β-amyloid peptides, cognition, and treatment strategies. Prog. Neurobiol. 2002;68:209–245. doi: 10.1016/s0301-0082(02)00079-5. [DOI] [PubMed] [Google Scholar]
- 52.Hangya B, Ranade SP, Lorenc M, Kepecs A. Central cholinergic neurons are rapidly recruited by reinforcement feedback. Cell. 2015;162:1155–1168. doi: 10.1016/j.cell.2015.07.057. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Mamad O, McNamara HM, Reilly RB, Tsanov M. Medial septum regulates the hippocampal spatial representation. Front. Behav. Neurosci. 2015;9:166. doi: 10.3389/fnbeh.2015.00166. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Gauthier JL, Tank DW. A dedicated population for reward coding in the hippocampus. Neuron. 2018;99:179–193. doi: 10.1016/j.neuron.2018.06.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Freund TF, Antal M. GABA-containing neurons in the septum control inhibitory interneurons in the hippocampus. Nature. 1988;336:170–173. doi: 10.1038/336170a0. [DOI] [PubMed] [Google Scholar]
- 56.Espinosa N, et al. Basal forebrain gating by somatostatin neurons drives prefrontal cortical activity. Cereb. Cortex. 2019;29:42–53. doi: 10.1093/cercor/bhx302. [DOI] [PubMed] [Google Scholar]
- 57.Espinosa N, Alonso A, Lara-Vasquez A, Fuentealba P. Basal forebrain somatostatin cells differentially regulate local gamma oscillations and functionally segregate motor and cognitive circuits. Sci. Rep. 2019;9:1–12. doi: 10.1038/s41598-019-39203-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Urban-Ciecko J, Barth AL. Somatostatin-expressing neurons in cortical networks. Nat. Rev. Neurosci. 2016;17:401–409. doi: 10.1038/nrn.2016.53. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Zhang LI, Tao HW, Holt CE, Harris WA, Poo MM. A critical window for cooperation and competition among developing retinotectal synapses. Nature. 1998;395:37–44. doi: 10.1038/25665. [DOI] [PubMed] [Google Scholar]
- 60.Markram H. Regulation of synaptic efficacy by coincidence of postsynaptic APs and EPSPs. Science. 1997;275:213–215. doi: 10.1126/science.275.5297.213. [DOI] [PubMed] [Google Scholar]
- 61.Malenka RC. The long-term potential of LTP. Nat. Rev. Neurosci. 2003;4:923–926. doi: 10.1038/nrn1258. [DOI] [PubMed] [Google Scholar]
- 62.Nabavi S, et al. Engineering a memory with LTD and LTP. Nature. 2014;511:348–352. doi: 10.1038/nature13294. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Harrison TC, Pinto L, Brock JR, Dan Y. Calcium imaging of basal forebrain activity during innate and learned behaviors. Front. Neural Circuits. 2016;10:1–12. doi: 10.3389/fncir.2016.00036. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Lin S-C, Nicolelis MAL. Neuronal ensemble bursting in the basal forebrain encodes salience irrespective of valence. Neuron. 2008;59:138–149. doi: 10.1016/j.neuron.2008.04.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Cohen JY, Haesler S, Vong L, Lowell BB, Uchida N. Neuron-type-specific signals for reward and punishment in the ventral tegmental area. Nature. 2012;482:85–88. doi: 10.1038/nature10754. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Steinberg EE, et al. Amygdala-midbrain connections modulate appetitive and aversive learning. Neuron. 2020;106:1026–1043. doi: 10.1016/j.neuron.2020.03.016. [DOI] [PubMed] [Google Scholar]
- 67.Lin R, et al. The raphe dopamine system controls the expression of incentive memory. Neuron. 2020;106:498–514. doi: 10.1016/j.neuron.2020.02.009. [DOI] [PubMed] [Google Scholar]
- 68.Schultz W. Behavioral theories and the neurophysiology of reward. Annu. Rev. Psychol. 2006;57:87–115. doi: 10.1146/annurev.psych.56.091103.070229. [DOI] [PubMed] [Google Scholar]
- 69.Li, Y. et al. Serotonin neurons in the dorsal raphe nucleus encode reward signals. Nature Commun.7, 10503 (2016). [DOI] [PMC free article] [PubMed]
- 70.Luo M, Li Y, Zhong W. Do dorsal raphe 5-HT neurons encode “beneficialness”? Neurobiol. Learn. Mem. 2016;135:40–49. doi: 10.1016/j.nlm.2016.08.008. [DOI] [PubMed] [Google Scholar]
- 71.Zhang G, et al. Medial preoptic area antagonistically mediates stress-induced anxiety and parental behavior. Nat. Neurosci. 2021;24:516–528. doi: 10.1038/s41593-020-00784-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The full datasets generated in the current study are available from the corresponding author upon requests. Source data are provided with this paper.
Animal detection and lick detection programs are available on https://github.com/li-shen-amy/behavior. DIO for the repository is 10.5281/zenodo.5992295.