Abstract
The dorsal raphe nucleus (DRN) is involved in organizing reward-related behaviours; however, it remains unclear how genetically defined neurons in the DRN of a freely behaving animal respond to various natural rewards. Here we addressed this question using fibre photometry and single-unit recording from serotonin (5-HT) neurons and GABA neurons in the DRN of behaving mice. Rewards including sucrose, food, sex and social interaction rapidly activate 5-HT neurons, but aversive stimuli including quinine and footshock do not. Both expected and unexpected rewards activate 5-HT neurons. After mice learn to wait for sucrose delivery, most 5-HT neurons fire tonically during waiting and then phasically on reward acquisition. Finally, GABA neurons are activated by aversive stimuli but inhibited when mice seek rewards. Thus, DRN 5-HT neurons positively encode a wide range of reward signals during anticipatory and consummatory phases of reward responses. Moreover, GABA neurons play a complementary role in reward processing.
How neurons in the dorsal raphe nucleus (DRN) respond to reward related behaviours is not known. Here, Li and colleagues report that DRN serotoninergic neurons are phasically activated by rewards such as sex, food and sucrose, and tonically activated during reward anticipation, while GABAergic neurons respond to punishment.
Various brain stations cooperate to organize reward-related behaviours. Best known is the midbrain ventral tegmental area, where dopamine neurons fire phasically to encode the discrepancy between the predicted and currently experienced reward1,2,3,4. More mysterious is the midbrain dorsal raphe nucleus (DRN). Its principle neurons project widely over the brain and strongly interconnect with several reward-related brain areas5,6,7,8. These neurons release the transmitter serotonin (5-hydroxytryptamine, 5-HT), which compels interest because 5-HT affects mood and drugs that increase 5-HT levels treat depression9,10.
Increasing evidences implicate that reward processing involves DRN neurons11. The 5-HT signalling system regulates feeding and social behaviours12,13,14. Slow and diffusive 5-HT signals may determine global reward states to modulate mood15. The DRN represents one of the most effective brain sites that drive electrical and optogenetic self-stimulation16,17,18,19. Surprisingly, the stimulation-evoked reward signal is mainly mediated by glutamate, although a majority of 5-HT neurons are glutamatergic and 5-HT also plays a role17. Optogenetic stimulation of 5-HT neurons also promotes waiting for expected reward, suggesting their role in the reward anticipatory behaviour20,21,22,23.
Knowing neuronal activity patterns can help clarify how the DRN contributes to reward processing. In primates and rodents, the activity of DRN neurons is correlated with arousal24, sensory cues25,26, motor activity25,27,28, reward values29,30,31, delay20, task progress32, aversive stimuli31,33 and the absence of reward26. The response diversity may reflect the heterogeneity of DRN neurons in morphology, location and neurotransmitter phenotypes34,35,36. It is challenging to precisely identify neuron types using electrophysiological criteria in extracellular recordings37,38. Using optogenetic tagging, two recent recordings revealed that reward-predicting cues activate approximately half of 5-HT neurons17,39. These two recordings, however, were performed from head-fixed mice that chronically experienced restraint stress. Moreover, the behavioural tasks was limited to classic conditioning, which trained mice to link an olfactory cue with the expected delivery of liquid reward17,39.
Many key questions remain. As food and sex are fundamental to animal survival and reproduction, how do DRN 5-HT neurons respond to these important natural rewards? Moreover, are these cells activated only when a well-trained animal is waiting for expected rewards? Alternatively, can they be also activated after an animal receives the reward, either expected or unexpected? As GABA neurons comprise a major neuron population in the DRN6,40,41, do they behave differently from 5-HT neurons? In the present study, we tackled these questions using fibre photometry of Ca2+ signals and electrophysiological recordings of action potential firing from genetically identified 5-HT and GABA neurons in freely behaving mice. Recording from freely behaving animals is essential for studying the responses to food intake and social interactions. It also avoids restraint-associated inescapable stress that may change the activity of DRN neurons and 5-HT signalling42,43.
Our recordings reveal that DRN 5-HT neurons are rapidly activated when a mouse voluntarily seeks and acquires sucrose, food, sex and social interaction. Moreover, 5-HT neurons are activated by surprising delivery of appetitive but not aversive stimuli. For a mouse that has been trained to wait for sucrose reward, 5-HT neurons encode reward waiting and acquisition through a tonic-then-phasic activation mode. Finally, GABA neurons in the DRN are suppressed during reward seeking but activated by footshock. These results indicate that DRN 5-HT neurons and GABA neurons respond complementarily when an animal expects and receives various rewards, suggesting that the dorsal raphe serves as an important reward-processing station in parallel to the midbrain dopamine centre.
Results
Food and social rewards activate DRN 5-HT neurons
We used fibre photometry to record Ca2+ signals from 5-HT neurons of behaving mice (Fig. 1a). Similar setups accomplished stable recordings from specific neuron populations in the striatum, cortex and ventral tegmental area44,45,46. Following stereotaxic infusion of the Cre-dependent adeno-associated virus AAV-DIO-GCaMP6m into the DRN of SERT-Cre transgenic mice (henceforth referred to as SERT-DRNGCaMP6 mice)47,48, the calcium indicator GCaMP6 was expressed with >98.7% accuracy and efficiency in neurons expressing tryptophan hydroxylase 2 (Supplementary Fig. 1a), the rate-limiting enzyme for central 5-HT synthesis49. We implanted a small optical fibre (230 μm diameter) with its tip into the DRN for chronic recordings of GCaMP fluorescence changes (Supplementary Fig. 1b).
We first examined GCaMP signals when SERT-DRNGCaMP6 mice sought and acquired sucrose solution. A water-deprived mouse received sucrose by licking a nozzle that was linked to a contact lickometer (Fig. 1a). The mouse completed 14 licking bouts within a 20-min behaviour session. Sucrose lick reliably increased GCaMP fluorescence across the behavioural bouts for this mouse (Fig. 1b,c). Multivariate permutation tests revealed statistically significant GCaMP signals in six out of seven test mice (P<0.05; Fig. 1d and Supplementary Fig. 1c). Post-mortem examination revealed that the six responsive recording sites were located in the centre DRN, whereas the non-responsive site was located in the edge of the DRN where GCaMP6 expression was weak (Supplementary Fig. 1b). For the six recordings in the centre DRN, the average signal peak (ΔF/F) was 23.6±4.4% (mean±s.e.m., n=6 mice). The increase of fluorescence signals was tightly coupled to the lick onset and lasted for approximately the same duration of licking (12.8±1.1 s for GCaMP signals versus 14.7±2.1 s for lick, mean±s.e.m., n=6 mice).
Using fibre photometry, we next measured GCaMP responses to food intake. A food-deprived mouse retrieved a small food pellet that was manually delivered to a tray (Fig. 1a). The events of food intake were identified by offline video scoring and time-locked GCaMP signals were segmented and aligned (Fig. 1b,e). Similar to sucrose lick, the events of food intake were reliably associated with significant increase of GCaMP signals (ΔF/F=20.8±1.4%; duration of significant signals=12.1±2.7 s; mean±s.e.m., P<0.05, permutation test, n=7 mice; Fig. 1e,f and Supplementary Fig. 1d). Mice that expressed enhanced membrane green fluorescent protein (EmGFP) did not show any clear fluorescence change following sucrose lick or food intakes (n=6 SERT-DRNEmGFP mice; Supplementary Fig. 1e,f), indicating that the signals in SERT-DRNGCaMP6 mice were Ca2+ in nature and not motion artefacts.
We then investigated how DRN 5-HT neurons respond to social rewards. We applied fibre photometry to record from the DRN of a male mouse in its home cage and simultaneously videotaped its interaction with another mouse. The test male often initiated mating behaviour following the introduction of a sexually receptive female (Fig. 2a). Ca2+ signals increased slightly during the chasing phase and peaked immediately after the onset of successful mounting (Fig. 2b and Supplementary Video 1). We observed a strong Ca2+ increase from all test mice (ΔF/F=47.6%±11.8%, mean±s.e.m., n=6 SERT-DRNGCaMP6 mice; Fig. 2c and Supplementary Fig. 2a). The activation remained significant throughout the mounting episodes (P<0.05, permutation test, 9.0±1.3 s for GCaMP signals versus 9.4±1.3 s for mounting, mean±s.e.m., n=6 mice). We did not observe any significant change of fluorescence signals from SERT-DRNEmGFP control mice (P>0.05, permutation test, n=4 mice; Supplementary Fig. 2b), again indicating that the mating-associated fluorescence increase in SERT-DRNGCaMP6 mice was produced by Ca2+ changes in 5-HT neurons.
To examine the effect of male–male interaction, we introduced a novel male mouse into the home cage of the test male and recorded GCaMP fluorescence for 5 min. The test male frequently investigated the body of the intruder male (Fig. 2d). The chemoinvestigation were associated with increase of GCaMP fluorescence that peaked when the nose of the test mouse touched the body of the social target (Fig. 2e). We observed significant Ca2+ signals from all test mice (ΔF/F=23.7±3.4%, mean±s.e.m., P<0.05, permutation test, n=7 mice; Fig. 2f and Supplementary Fig. 2c). The duration of activation was sublinear to the duration of chemoinvestigation (5.6±1.4 s for GCaMP signals versus 9.5±1.9 s for investigation).
As novelty is rewarding50, we tested whether novelty alone underlay the Ca2+ signals in social interactions. In separate behavioural sessions, we exposed the test male to a novel inanimate object (a cloth dummy mouse) in the home cage (Fig. 2g). Investigation of the neutral object was associated with small and brief Ca2+ transients (ΔF/F=17.0±2.1%, 0.5±0.1 s for GCaMP signals versus 0.8±0.2 s for investigation, mean±s.e.m., n=7 mice, Fig. 2h,i and Supplementary Fig. 2d). Overall, mating produced substantially stronger Ca2+ signals than chemoinvestigation of another male, which in turn produced stronger and much more prolonged responses than object investigation (Supplementary Fig. 2e). The signal amplitudes (peak ΔF/F) were poorly correlated with the duration of behavioural episodes (Supplementary Fig. 2f), indicating that behaviour types rather than duration determined Ca2+ signal amplitudes. Therefore, social rewards activate DRN 5-HT neurons more effectively than general novelty.
Unexpected rewards activate 5-HT neurons
Our results so far demonstrate that DRN 5-HT neurons are activated when a mouse voluntarily seeks rewards and receives them. We next asked what if the mouse received reward unexpectedly—without seeking or effort. We implanted intraoral cheek fistulae to deliver sucrose solution into the oral cavity of a water-deprived mouse (Fig. 3a)51. This approach allowed us to precisely control the volume and duration of sucrose delivery. Reward expectation was minimized by dispensing sucrose solution randomly with an inter-trial interval of 20–40 s. Sucrose delivery (5% w/v, 10 μl in 0.5 s) reliably elicited Ca2+ transients from 5-HT neurons of a SERT-DRNGCaMP6 mouse (Fig. 3b1). Significant Ca2+ increase was observed from all test mice (ΔF/F=40.0±3.4%, mean±s.e.m., P<0.05, permutation test, n=8 mice; Fig. 3d and Supplementary Fig. 3a). The Ca2+ signals rose and decayed rapidly (median latency to peak=0.84 s, decay duration=3.82±0.65 s, mean±s.e.m.; Fig. 3d and Supplementary Fig. 3b). Therefore, sucrose transiently activates 5-HT neurons even when the mouse is not expecting it.
To test whether 5-HT neurons responded to general saliency rather than rewarding quality of sucrose, we examined the effect of two aversive stimuli, a bitter tastant and footshock. Random intraoral delivery of quinine (5 mM, 10 μl in 0.5 s) reduced Ca2+ signals in some trials (Fig. 3b2,e), but the inhibition was not statistically significant for the entire test population (P>0.05, n=8 mice; permutation test; Fig. 3f and Supplementary Fig. 3c). Similarly, electric footshock resulted in a trend of Ca2+ signal reduction that were not statistically significant for the entire test group (P>0.05, n=5 mice, permutation test; Fig. 3b3 and Supplementary Fig. 3d). Thus, aversive stimuli do not activate the 5-HT neuron population in the DRN, indicating that 5-HT neurons encode reward but not general saliency.
To examine the effect of reward value, we compared the GCaMP signals produced by sucrose (5%) with those by water alone. Within a single session of 90 trials, we applied 30 trials of sucrose (5%, 10 μl in 0.5 s), followed by 30 trials of water (10 μl in 0.5 s) and additional 30 trials of sucrose. GCaMP signals in both sucrose blocks were significantly larger than water-evoked signals (P<0.01, Dunn's multiple comparisons test, n=6 mice; Fig. 3i,j). To examine the effect of reward size, we varied the volume of sucrose drops by adjusting infusion speed (16 versus 4 μl s−1) to generate ‘big' rewards (8 μl in 0.5 s) and ‘small' rewards (2 μl in 0.5 s). Big rewards evoked significantly stronger GCaMP signals than small rewards (P<0.01, Dunn's multiple comparisons test, n=6 mice; Fig. 3k,l). Thus, reward quality and size modulate the response amplitudes of 5-HT neurons.
Expected reward activates 5-HT neurons before delivery
How do 5-HT neurons respond when a thirsty mouse expects a reward but is made to wait? We addressed this question by training mice with a simplified sucrose foraging paradigm52. A water-deprived mouse learned to move back and forth between the chamber of ‘trigger zone' and the chamber of ‘reward zone' to collect sucrose drops (Fig. 4a). Exit from the trigger zone triggered an auditory tone to signal trial initiation. Following entries into the reward zone, a mouse typically started poking into the reward port and licked a nozzle for sucrose, which was supplied through a solenoid valve briefly triggered open (50 ms) with a 2-s delay following reward-zone entry. Initially, a thirsty mouse explored and accidentally received sucrose solution from the liquid nozzle. Following two daily training sessions, a mouse learned to move repeatedly from the trigger zone to the reward zone to acquire sucrose drop following the delay (Fig. 4b and Supplementary Fig. 4a,b). By controlling delay duration and reward outcome, we could precisely correlate the Ca2+ signals of 5-HT neurons to various parameters of the self-paced reward-seeking behaviour, such as reward-zone entry, waiting and sucrose acquisition.
In trained mice, the Ca2+ signals ramped up when a mouse entered the reward zone, remained elevated throughout the 2-s delay period and further increased when the mouse finally received the sucrose drop (Fig. 4c,d). On entry to the reward zone, the entire test group reached the peak ΔF/F of 38.9±2.6% (mean±s.e.m.) after a median latency of 2.3 s (n=7 mice; Fig. 4e and Supplementary Fig. 4c–e). The Ca2+ signals rapidly decayed to the baseline after sucrose delivery (Fig. 4f), although the mouse continued to lick for a few more seconds (Supplementary Fig. 4d,e). We did not observe any clear change of Ca2+ signals correlated to the events of trigger-zone exit and re-entry (Fig. 4g,h). Therefore, 5-HT neuron activation is specifically related to waiting and sucrose acquisition in the reward zone.
We varied the reward outcome and delays to dissociate their effects on Ca2+ signals. In separate behavioural sessions, sucrose drops were either delivered following a 2-s delay or randomly omitted in half of trials. The Ca2+ signals were identical during the 2-s delay period for both trials of reward delivery and those of omission (Fig. 5a–c). However, sucrose omission abolished the additional transient increase associated with sucrose delivery (P=0.016, Wilcoxon's signed-rank tests, n=7 mice; Fig. 5a–c and Supplementary Fig. 4d,e), demonstrating an excitatory effect of reward acquisition on 5-HT neurons.
In separate behavioural sessions, we randomly assigned delays of 2 or 5 s to equal number of trials. Increasing the delay extended the initial plateau of Ca2+ signals to 5 s and the signals further increased following sucrose acquisition (n=7 mice; Fig. 5d–f). During the delay period, three out of the seven test mice started licking during the waiting period before sucrose delivery, whereas the other four mice licked only after sucrose delivery (Supplementary Fig. 4d). Regardless of the difference in licking behaviour, all mice exhibited strong Ca2+ signals during the delay (Supplementary Fig. 4e), indicating that waiting-associated activation does not depend on lick movement.
5-HT neurons exhibit the tonic-then-phasic activation pattern
To examine the electrophysiological basis of Ca2+ signals and the response pattern of individual neurons, we carried out single-unit recordings from the DRN, while a mouse performed the sucrose foraging task (Fig. 6a). Extracellular spikes were recorded with a microdrive-controlled optetrode consisting of four tetrodes and a single optical fibre43,53. Torque signals from the recording cable actively controlled a motorized commutator to facilitate mouse moving54. For in vivo optogenetic tagging of 5-HT neurons, we expressed Channelrhodopsin-2 (ChR2) by infusing AAV-DIO-ChR2-mCherry vectors into the DRN of SERT-Cre mice (Fig. 6b). After isolation of single units (Supplementary Fig. 5a), we examined whether brief light pulses (5 ms) could reliably and rapidly elicit spike firing (Fig. 6c–e). We referred to the statistical method that was recently developed for analysing the significance of tagging4,52.
We recorded 80 optogenetically identified 5-HT neurons from 14 mice. Most of 5-HT neurons exhibited activation in the reward zone (‘reward-on') (Fig. 6f,g). The spike firing rates of a 5-HT cell were often low outside the reward zone (<1 spike s−1) and increased to ∼10 spikes s−1 within the reward zone. The neuronal activity remained elevated, while the mouse waited, briefly soared to as high as 60 spikes s−1 following sucrose delivery and then decreased to baseline in 2 s (Fig. 6f,g). Population average revealed that in general, 5-HT neurons fired tonically at ∼6 spikes s−1 during waiting and then phasically at the peak rate of ∼20 spikes s−1 on sucrose delivery (tonic-then-phasic pattern; Fig. 6h). There was no activation when a mouse entered or left the trigger zone (Supplementary Fig. 5b,c), indicating that neuronal activation were specific to the behaviour events in the reward zone.
Some 5-HT neurons deviated from the typical tonic-then-phasic pattern (Fig. 6i). Principal component analysis and hierarchical clustering classified 5-HT neurons into four subtypes. In addition to the typical type-1 cells (46/80 cells, 58%; Fig. 6i,j), type-2 cells (17/80 cells, 21%) exhibited low firing activity in the trigger zone, tonic activation during waiting in the reward zone and weak or no phasic response following sucrose delivery (Fig. 6i,k and Supplementary Fig. 5d,e). Type-3 cells (9/80 cells, 11%) were more active in the trigger zone and exhibited mild but sustained excitation following reward zone entry (Fig. 6i,l and Supplementary Fig. 5d,f). Finally, type-4 5-HT cells (8/80 cells, 10%) also showed higher spontaneous activity but were inhibited for up to 10 s following reward zone entry (‘reward-off'; Fig. 6i,m and Supplementary Fig. 5d,g).
Compared with the optogenetically tagged 5-HT neurons, non-tagged DRN neurons responded much more heterogeneously (n=129 cells; Supplementary Fig. 6). For example, the percentage of ‘reward-off' cells was substantially higher (35% non-tagged neurons versus 10% tagged 5-HT neurons), demonstrating the importance of genetic identification for accurately revealing the response properties of 5-HT neurons.
Given that reward delay and outcome affected the Ca2+ signals of 5-HT neuron populations (Fig. 5), we examined the effects of omitting rewards and extending delay on the spike firing of single units. Regardless of reward outcome, 5-HT neurons exhibited similar tonic activation when mice waited during the 2-s delay, but sucrose omission eliminated the burst firing associated with sucrose delivery (n=20 cells; Supplementary Fig. 7a,b). Increasing the reward delay from 2 to 5 s extended the tonic activation to 5 s, which was switched to brief bursting following reward delivery (n=22 cells; Supplementary Fig. 7c,d). Thus, the tonic excitation and phasic firing of 5-HT neurons correspond to waiting and reward acquisition, respectively.
DRN GABA neurons are inhibited during reward seeking
We performed fibre photometry to investigate how DRN GABA neurons respond in reward-associated tasks. We expressed GCaMP6 in GABA neurons by injecting AAV-DIO-GCaMP6m into the DRN of VGAT-Cre mice (Supplementary Fig. 8a), in which Cre expression was directed to GABA neurons by the promoter region of the gene encoding vesicular GABA transporter (VGAT)55. We tested the effects of sucrose lick, food intake, intraoral delivery of sucrose and quinine, and footshock on GCaMP6 fluorescence in eight VGAT-DRNGCaMP6 mice.
The Ca2+ signals of DRN GABA neurons typically decreased when a mouse voluntarily licked for sucrose (Fig. 7a). Seven out of eight test mice showed this inhibitory response and the inhibition was significant for the entire test population (P<0.05; permutation test; n=7 mice; range of ΔF/F=(−5 to −20%); duration of inhibition=8.4±1.1 s versus duration of lick=13.9±0.9 s; mean±s.e.m.; Fig. 7b and Supplementary Fig. 8b). Food intake resulted in more variable responses. Six of eight test mice exhibited an initial brief activation (∼1 s) followed by more prolonged inhibition, whereas the other two mice showed slow activation (Fig. 7c and Supplementary Fig. 8c). On average, food intake was associated with a mild but significant reduction of Ca2+ signals (P<0.05; permutation test; n=8 mice; duration of inhibition=10.4±2.0 s; mean±s.e.m.; Fig. 7d). The inhibition probably requires voluntary seeking, because random intraoral delivery of sucrose did not change Ca2+ signals (n=6 mice; Supplementary Fig. 8d,e).
We tested the effect of two aversive stimuli, quinine and footshock, on the activity of GABA neurons. Intraoral delivery of quinine did not affect Ca2+ signals (Supplementary Fig. 8f,g). However, footshock reliably elicited rapid and strong increase of Ca2+ signals. The Ca2+ transients reached the peak within 1 s of footshock onset and decayed to baseline in 3 s (ΔF/F=44.6±3.8%; median latency to peak=1.0 s; decay duration=2.8±0.5 s; mean±s.e.m.; n=8 mice; Fig. 7e,f and Supplementary Fig. 8h,i). Thus, aversive stimuli activate GABA neurons in a modality-dependent manner.
We carried out single-unit recordings of DRN GABA neurons from VGAT-ChR2-EYFP transgenic mice (Supplementary Fig. 9a,b)56. Optical tagging identified 45 GABA neurons from four VGAT-ChR2-EYFP mice that underwent training for the sucrose foraging task (Supplementary Fig. 9c). Outside the reward zone, GABA neurons fired spikes at over ten times higher frequency than 5-HT neurons (18.2±2.2 versus1.6±0.2 spikes s−1; mean±s.e.m.; P<0.001; Wilcoxon's rank-sum test; n=45 GABA cells and 80 5-HT cells; Supplementary Fig. 9d).
Most GABA neurons reduced spike firing when a mouse sought sucrose in the reward zone. After a mouse entered the reward zone, a GABA cell decreased its spike firing rates gradually to ∼10 spikes s−1 at the point of reward delivery, remained low while the mouse continued to lick and then recovered when the mouse ceased to poke its nose into the reward port (Fig. 8a and Supplementary Fig. 9e,f). Cluster analysis revealed that 38 cells (84%) exhibited this typical ‘reward-off' activity pattern (type-1 GABA cells; Fig. 8b,c). In addition to the typical type-1 GABA cells, a small number of GABA cells (type-2 GABA cells, 7/45 cells, 16%) did not show significant activity change that was associated with behavioural events in the reward zone or the trigger zone (Supplementary Fig. 9e–i). Despite the presence of this small subset of nonresponsive cells, on average the entire population of GABA cells suppressed their activity when the animals waited and acquired rewards (Supplementary Fig. 9j).
Discussion
We studied how reward signals are encoded by DRN 5-HT and GABA neurons using fibre photometry and single-unit recordings from freely behaving mice. The validity of fibre photometry was demonstrated by the similarity between Ca2+ and electrophysiological signals in the sucrose foraging task. We find that 5-HT neurons are activated by diverse natural rewards, including food and social rewards. Moreover, a majority of 5-HT neurons encodes reward expectation and acquisition using the tonic-then-phasic firing pattern. Finally, GABA neurons are inhibited during reward seeking.
One key finding is that genetically identified 5-HT neurons are activated by different types of natural rewards. Strong Ca2+ transients were evoked by sucrose, food, sex and social interaction, but not by the aversive stimuli of quinine and footshock. DRN 5-HT neurons thus encode not only liquid-based reward signals but also food and social rewards. As sucrose lick, food intake, mounting a female and chemoinvestigation of another male engage different motor acts, the reward responses are unlikely linked to specific motor behaviour27. The lack of Ca2+ responses to quinine and footshock indicates that the rewarding quality but not general saliency mediates neuronal activation. For all rewards tested, the Ca2+ signals rise and decay in seconds. Therefore, in addition to slowly modulating mood at the time scale of minutes to hours15, DRN 5-HT neurons can rapidly broadcast reward signals from different modalities to organize a broad array of reward-related behaviours including drinking, feeding, social interaction, emotion and cognition13.
Moreover, DRN 5-HT neurons are activated when an animal receives rewards that are either expected or surprising. Ca2+ signals increase when a mouse voluntarily seeks and receives a reward. More importantly, random intraoral delivery of sucrose produces similarly strong activation, thus excluding the potential contribution of reward expectation. After a mouse learns to wait for sucrose delivery in the foraging task, earning the sucrose results in additional Ca2+ increase and phasic firing of spikes, indicating that acquisition of expected rewards also activates 5-HT neurons. These observations are in line with the prior reports that the activity of some DRN neurons in primates tracks the actual reward values of water or juice29. Through genetic tagging, this study demonstrates that the reward-tracking cells are serotonergic, suggesting that 5-HT neurons and the 5-HT signalling system mediate reward consummatory behaviours.
To obtain reward following a predicting sensory cue, an animal needs to analyse the cue, approach, wait, consume and finally evaluate reward value and cost. When a mouse waits for sucrose, both Ca2+ signals and spike firing rates are increased during the entire delay period and further enhanced following sucrose delivery. Possibly because of higher temporal precision of electrophysiological recordings, 5-HT neurons exhibit phasic firings of action potentials upon reward delivery. Manipulating delays and reward outcome further revealed that the initial tonic activation corresponds to reward waiting and the later phasic firing corresponds to reward acquisition. Several earlier recordings reported that a subset of DRN neurons were activated when animals waited for reward20,57, again without strong evidences that they were indeed 5-HT cells. The tonic-then-phasic activation pattern of optogenetically verified 5-HT neurons may reconcile recent debates on whether 5-HT neurons function before or after reward acquisition17,21,22,58,59,60. Using different activation modes, these neurons may perform the dual roles of expecting and evaluating rewards.
There exists some heterogeneity among the response patterns of individual 5-HT neurons. Although the tonic-then-phasic mode was typical, some cells are activated only during the delay and some exhibit mild but prolonged activation following reward acquisition. A small number of 5-HT neurons (10%) are even inhibited during waiting and reward acquisition (Fig. 3). 5-HT neurons exhibit different neurochemical phenotypes36,61,62,63. For example, a substantial number of 5-HT neurons express VGLUT3 and are probably glutamatergic, whereas some 5-HT neurons seem to be purely serotonergic and even smaller percentage of 5-HT neurons expresses GABA markers17,34,36,64,65. Genetic manipulation that combine multiple markers may help determine whether the response subtypes correspond to neurochemical phenotypes.
Our experiments show a very different picture from what was described by two recent recordings from head-fixed mice17,39. Liu et al.17 found that ∼60% of Pet-1 neurons were tonically activated following reward-predicting cue. Possibly because the ePet1-Cre mice were head restraint and trained with classical conditioning, and because some Pet-1 neurons (up to 10%) lacked the markers for 5-HT neurons, Pet-1 neurons showed higher spontaneous firing rates (5 versus 1 spikes s−1 here) and lacked phasic responses to sucrose delivery17. Cohen et al.39 reported that, out of 29 optogenetically tagged SERT neurons, 22 cells were robustly and phasically activated by the aversive stimulus of air puff and only 11 cells were weakly and slowly activated by unexpected water delivery (typically <3 spikes s−1). Immediately following reward-predicting cues, 15 cells were briefly (duration ∼200 ms) activated and 2 were briefly inhibited39. From this study, it appeared that 5-HT neurons respond more strongly to aversive stimuli than to rewards39. In contrast, we find that 5-HT neurons are activated predominantly by rewards but not aversive stimuli. During reward waiting, 5-HT neurons exhibit continuous activation throughout the tested delay of up to 5 s. This activation pattern matches well with the theory that 5-HT neurons promotes patience during reward waiting21,22,59,60. Key technical differences may cause the obvious discrepancies. We recorded from freely behaving mice that are probably much less stressed than head-fixed mice undergoing chronic, inescapable restraint. In addition, we used 5% sucrose solution, which is more rewarding than water alone17.
Some 5-HT neurons may also be involved in processing aversive signals66,67. In the mouse DRN, air puff evoked brief action potential firing from a majority of 5-HT neurons39. In the monkey DRN, air puff slowly and mildly modulated the activity of reward-responsive neurons31. Here, the aversive stimuli of quinine and footshock do not evoke any obvious Ca2+ transients from 5-HT neurons, although footshock reliably activates GABA cells. It cannot be excluded that aversive stimuli do activate some 5-HT cells, but the activation may be too weak and/or brief to produce detectable Ca2+ signals. The type-4 5-HT neurons may be the candidate subset of neurons that positively encode aversive signals, as they are inhibited by reward.
In contrast to 5-HT neurons, GABA cells are mostly inhibited by reward. Voluntary sucrose lick and food intake reduce Ca2+ signals in GABA cells, indicating inhibition from their high baseline activity. Consistently, most of individual GABA cells show prolonged inhibition when a mouse waits and receives reward in the sucrose-foraging tasks. On the other hand, these cells are strongly activated by footshock, suggesting excitation by painful stimuli. GABA cells receive overlapping but different inputs from those to 5-HT neurons, interact with local 5-HT neurons, and may even project axons outside the DRN6,8,40,41. Optogenetic inhibition of GABA cells disinhibits 5-HT cells and prevents the acquisition of social avoidance in mice exposed to social threats40. Suppressing the activity of GABA cell may facilitate the activation of DRN 5-HT neurons and contribute to reward signalling. Thus, 5-HT neurons and GABA neurons may have complementary reward codes and synergistically organize behavioural responses to rewards and aversive stimuli.
Methods
Mice
Animal care and use conformed to institutional guidelines of the National Institute of Biological Sciences, Beijing, and to the governmental regulations of China. We used a total of 41 (39 males, 2 females; 8–14 weeks old) SERT-Cre mice (strain name B6.Cg-Tg(Slc6a4-Cre)ET33Gsat; MMRRC, Davis, CA, USA48), 8 (all males, 8–14 weeks old) VGAT-Cre mice (strain name B6.FVB-Tg(Slc32a1-cre)2.1Hzo/FrkJ; Jackson Laboratory, Ben Harbor, Maine, USA) and 4 (all males, 8–14 weeks old) VGAT-ChR2-EYFP mice (Jax strain name B6.Cg-Tg(Slc32a1-COP4*H134R/EYFP)8Gfng/J; a gift from G. Feng at MIT56). Mice were maintained on a 12/12 light/dark cycle and housed in groups of five for 6–8 weeks. After surgery, mice were housed individually on a reverse light–dark cycle (light off at 0800, h) for at least 1 week before further experiments.
Surgery and virus injection
Mice were anaesthetized with pentobarbital (intraperitoneal, 80 mg kg−1) and mounted to a stereotaxic apparatus equipped with an electric heating pad. The skin was cut and a small craniotomy was made 5 mm posterior to the bregma along the midline. Using a microsyringe pump (Nanoliter 2000 Injector, WPI), AAV virus (500 nl) was slowly injected (40 nl min−1) into the DRN through a glass pipette with a 15° angle from caudal to rostral (dorsal–ventral depth=2.5 mm). The glass pipette was left in place for five additional minutes and then slowly withdrawn.
AAV vectors carrying the DIO-ChR2-mCherry, DIO-EmGFP or DIO-GCaMP6m construct were packaged into AAV2/9 serotype with titres 1–5 × 1012 viral particles per ml. We constructed these plasmids by replacing the coding region of ChR2-mCherry in the pAAV-EF1α-DIO-hChR2(H134R)-mCherry plasmid (a gift from K. Deisseroth) with those encoding enhanced membrane GFP (Addgene Plasmid 14757) or GCaMP6m (Addgene Plasmid 40754), respectively.
We implanted intraoral cheek fistula following a previously described procedure51. The surgery was performed 4 days after the injection of AAV-DIO-GCaMP6m virus into the DRN. An incision was made in the cheek lateral and rostral to the first molar and another incision was made in the scalp in front of the ear or in the back neck. A soft Silastic tubing (30 mm in length, 0.30 mm inner diameter (I.D.) and 0.46 mm outer diameter (O.D.); Dow Corning) was inserted 2 mm into the oral cavity through the cheek incision site and then guided subcutaneously out through the scalp incision. An L-shaped 26-gauge (O.D. 0.48 mm) stainless steel tubing was connected to the Silastic tubing and imbedded aside the ceramic ferrule for optical fibre. A piece of polyethylene tubing (10 mm in length, 0.4 mm I.D. and 1.1 mm O.D.) was fit to the exposed end of the L-shaped tubing. A plug was inserted to the exposed end of the polyethylene tubing, to prevent blockage.
Fibre photometry
Following AAV-DIO-GCaMP6m virus injection, an optical fibre (230 μm O.D., 0.37 numerical aperture (NA); Shanghai Fiblaser) was placed in a ceramic ferrule and inserted towards the DRN through the craniotomy. The ceramic ferrule was supported with a skull-penetrating M1 screw and dental acrylic. Mice were individually housed for at least 1 week to recover.
To record fluorescence signals, laser beam from a 488-nm laser (OBIS 488LS; Coherent) was reflected by a dichroic mirror (MD498; Thorlabs), focused by a × 10 objective lens (NA=0.3; Olympus) and then coupled to an optical commutator (Doric Lenses). An optical fibre (230 μm O.D., NA=0.37, 2-m long) guided the light between the commutator and the implanted optical fibre. The laser power was adjusted at the tip of optical fibre to the low level of 0.01–0.02 mW, to minimize bleaching. The GCaMP fluorescence was bandpass filtered (MF525-39, Thorlabs) and collected by a photomultiplier tube (R3896, Hamamatsu). An amplifier (C7319, Hamamatsu) was used to convert the photomultiplier tube current output to voltage signals, which was further filtered through a low-pass filter (40 Hz cut-off; Brownlee 440). The analogue voltage signals were digitalized at 500 Hz and recorded by a Power 1401 digitizer and Spike2 software (CED, Cambridge, UK).
Behavioural tasks
Sucrose lick and food consumption. A mouse was water and food deprived for 12 h before it was placed in a chamber (20 × 20 × 22, L × W × H in cm) equipped with a contact lickometer connected to a bottle filled with 5% sucrose solution (w/v). Dustless precision food pellets (14 mg, Bio-serv) were manually delivered to a small platform (1.2 mm diameter, 1 cm height) in the centre chamber floor with only one pellet per time. A Power 1401 digitizer simultaneously recorded the lick signals from the lickometer and fluorescence signals from the photomultiplier tube. Videos (15–20 min per session) from an overhead infrared camera were synchronized with Power 1401 data acquisition. Bout initiation time for food intake was determined by video scoring.
Intraoral infusion. A mouse was water deprived for 12 h before sucrose but not quinine infusions. A peristaltic pump (AniLab) delivered either sucrose solution (5% w/v), water or quinine solution (5 mM) through the L-shaped tubing. The pump was controlled through an IC board (Arduino Uno R3) using a self-developed MATLAB programme. To examine the effect of sucrose or quinine, 20 infusions of sucrose or quinine solution (speed 20 μl s−1 and duration 0.5 s) were given within one recording session with random intervals ranging from 20 to 40 s. To examine the effect of reward quality, a mouse was tested with 30 consecutive trials of 5% sucrose (speed 20 μl s−1 and duration 0.5 s) that were followed by 30 trials of water and then 30 trials of 5% sucrose again. To examine the effect of reward size, a mouse was tested with 30 trials of big reward (5% sucrose, speed 16 μl s−1 and duration 0.5 s), 30 trials of small reward (5% sucrose, speed 4 μl s−1 and duration 0.5 s) and 30 trials of big reward again. The fluorescence data were aligned to the events of infusion onset.
Footshock. Animals were introduced into an acrylic box (25 × 25 × 30, L × W × H in cm) with metal grid floor. In a single test session, ten footshocks (0.7 mA scrambled, 0.5 s; Beijing TianMingHongYuan) were randomly delivered with inter-trial intervals of 20–40 s. Shock delivery onset was used as the trigger event for data alignment.
Social interaction. The test male mice were housed individually at least for 1 week after surgery. To study the effect of sexual behaviour, a test mouse was habituated in the dark with an optical fibre connected to the fibre ferrule on its head. A female mouse was introduced into the homecage of the test male so that the male gained sexual experience. A single recording session lasted 30 min and the behaviour of the test mouse was video taped with an overhead infrared camera. Mounting onset was characterized as the placement of the forequarter of the male over the hindquarter of the female. Only mountings with rhythmic contractions of the hindquarter of the male were considered successful mounting.
In the male–male interaction sessions, a test male mouse was placed in its own homecage and the mouse behaviour was video recorded in the dark with an overhead infrared camera. After 1 min habituation, we introduced a group-housed male mouse (intruder) and videoed the social interactions for 5 min. In the separate sessions of object investigation, a cloth-made dummy mouse was introduced in the middle of the cage after 1 min habituation and the animal behaviour was recorded for another 5 min. Offline video scoring identified the events of male–male interaction or object investigation, which was defined as when the test mouse touched the intruder mouse or the object with its nose.
Foraging. The simplified foraging task was described in details elsewhere52. Briefly, a mouse was kept on a water restriction schedule to maintain 85–90% of free-drinking body weight. The foraging field consisted of a linear track (45 × 5 × 10, L × W × H in cm) that connected with two end chambers (10 × 10 × 8.3 cm; defined as ‘trigger zone' and the ‘reward zone'). The mouse was trained to run from the trigger zone to the reward zone, to collect drops of sucrose solutions (5%, w/v) and to return from the reward zone to the trigger zone before a new trial was initiated. Mouse body positions were monitored using an overhead camera (Logitech C510, 20 FPS). Exit from the trigger zone activated a brief auditory cue (200 ms at 4 KHz square wave with 0.5 duty) to signal the exit from the trigger zone (trigger-zone-out) and thus trial initiation. Reward delays (2 or 5 s) or reward outcomes were set in pseudorandom orders. Sucrose solution was delivered through a metal tubing (1.5 mm O.D., 3 cm above the floor) in the end wall of reward zone and the delivery was controlled with a solenoid valve (20 μl in 50 ms). Two separate infrared photodiodes detected nose poking through the reward port and licking behaviour. Experimental control and behavioural data acquisition were implemented with an IC board (Arduino Uno R3) and a customized MATLAB programme.
Trial start cue was identical in the omission and delay sessions. In omission sessions, omission trials were randomly intermingled with rewarding trials of 2 s delay after animal's entry into the reward zone. The omission was accompanied by the opening of an adjacent solenoid valve at the same delay to minimize potential difference in auditory cues25. In delay sessions, sucrose was randomly delivered either at 2 s or at 5 s after reward-zone-in events. A mouse completed 40–80 trials in a single session.
Electrophysiological recording and optical tagging
To record single units using optical tagging, we adopted the recent design of optetrode microdrives43. The microdrive was positioned just above the craniotomy with a protruding optetrode in alignment with the track of glass micropipette for virus injection. The optetrode comprised four tetrodes and one optical ferrule (125 μm diameter, NA=0.37). Tetrodes were twined from 12.7 μm Ni-Cr-Fe wires (Stablohm 675, California Fine Wire, CA, USA) and gold plated to reduce impedance to 250–500 KΩ (ref. 17). The optical fibre was housed in a ceramic fibre optic ferrule (2.5 mm O.D. and 126 μm I.D.). The optic fibre and the tetrodes were then glued together, with the tetrode tips extending 500 μm away from the tip of optical fibre53. The optetrode was gradually lowered to a depth of 0.7 mm above the DRN. A silver wire (127 μm dia, A-M System) was attached to one of the four skull-penetrating M1 screws to serve as ground. The microdrive was secured to the skull with dental acrylic. Mice were individually housed for at least 2 weeks for recovery and AAV expression. Food and water were available ad libitum until mice started behavioural training using the sucrose foraging paradigm.
A self-developed 16-channel amplifier recorded electrophysiological signals with built-in band pass filters (0.5–3.6 kHz). One of the channels was selected as a virtual reference to minimize moving artefact. Analogue signals were digitalized at 25 kHz and collected using a Power1401 digitizer and Spike2 software. Single units were sorted offline with Spike2 software. A 25-channel commutator (Crist Instruments) was pulled by a torque-controlled servomotor (Faulhaber 2036U), to facilitate torqueless movement of the test mouse54. At the end of each recording session, the optetrode was lowered 20–60 μm by manually turning an M1 machine screw in the microdrive. The recording was resumed on the next day unless the electrode tips were judged to be outside the DRN.
Optical tagging confirmed the cell type of recorded single units. Light pulses (5 ms, 0.1 or 10 Hz) were passed through the optical fibre to stimulate action potential firing. The light intensity was lowered to reduce spike jittery. Four parameters were calculated: the reliability of light-evoked spiking within 10 ms from light onset (R), correlation coefficient of spike waveform for spontaneous spikes and evoked spikes (C), the latency of triggering spikes after light onset (L) and a statistical P-value to determine whether the spikes were truly evoked by light stimulation52. The P-value was determined by comparing the distribution latencies of light-evoked spikes and a bootstrapped distribution of latencies of spontaneous spikes52. Only neurons with P≤0.001 and C≥0.85 were considered as optically tagged neurons4,52.
Immunostaining
A mouse was deeply anaesthetized with an overdose of pentobarbital and then intracardially perfused with 0.9% saline followed by 4% paraformaldehyde in PBS. After cryoprotection with 30% sucrose, the mouse brain was sectioned coronally (35 μm thick) with a cryostat (Leica CM1900). For immunohistochemistry, the sections were blocked with 3% BSA in PBS with 0.3% Triton X-100 and incubated with a rabbit polyclonal antibody against Tph2 (1:400, Millipore; 20 h) and chicken polyclonal antibody to GFP (1:400, Abcam) at 4 °C overnight. After washing with PBS, the sections were incubated with Cy3-conjugated goat anti-rabbit IgG (1:500; Jackson ImmunoResearch) and fluorescein-conjugated donkey anti-chicken IgG (1:500; Jackson ImmunoResearch) for 1 h at room temperature. PBS-washed sections were then coverslipped with 50% glycerol mounting medium.
Slice recording
VGAT-ChR2 mice were deeply anaesthetized with pentobarbital (100 mg kg−1 intraperitoneally) and intracardially perfused with ∼5 ml ice-cold oxygenated perfusion solution. The mouse brain was quickly placed in ice-cold oxygenated slicing solution. Coronal sections (250 or 300 μm thick) were prepared with a Leica VT1200S vibratome. The slices were incubated for 1 h at 34 °C with Ringer solution saturated with mixed 95% O2 with 5% CO2. After incubation, a slice was transferred to a chamber and superfused with ringer solution (2 ml min−1) at 28 °C for patch clamp recording. Perfusion solution consisted of (in mM) 225 sucrose, 119 NaCl, 2.5 KCl, 0.1 CaCl2, 4.9 MgCl2, 1.0 NaH2PO4, 26.2 NaHCO3, 1.25 glucose, 3 kynurenic acid and 1 Na L-ascorbate. Slicing solution consisted of (in mM) 110 choline chloride, 2.5 KCl, 0.5 CaCl2, 7 MgCl2, 1.3 NaH2PO4, 25 NaHCO3, 20 glucose, 1.3 Na ascorbate and 0.6 Na pyruvate. Ringer solution consisted of (in mM) 125 NaCl, 2.5 KCl, 2 CaCl2, 1.3 MgCl2, 1.3 NaH2PO4, 25 NaHCO3, 10 glucose, 1.3 Na ascorbate and 0.6 Na pyruvate. For the whole-cell recording, the pipettes were filled with internal solution that contained (in mM) 130 K-gluconate, 10 HEPES, 0.6 EGTA, 5 KCl, 3 Na2ATP, 0.3 Na3GTP, 4 MgCl2 and 10 Na2-phosphocreatine (pH 7.2–7.4). Picrotoxin (50 μM) was added to the superfusion aCSF through the dilution of stock solutions. All chemicals for brain slicing recording were from Sigma-Aldrich. Whole-cell voltage-clamp recordings were performed with a MultiClamp 700B amplifier (Molecular Devices; USA). The neurons were clamped between −50 mV. The traces were low-pass filtered at 3 kHz and digitized at 10 kHz (DigiData 1440, Molecular Devices).
Data analysis and statistical tests
No data were excluded. Photometry data were exported to MATLAB Mat files from Spike2 for further analysis. After smoothing the data with a moving average filter (20 ms span), we segmented the data based on behavioural events within individual trials or bouts. We derived the values of fluorescence change (ΔF/F) by calculating (F−F0)/F0, where F0 is the baseline fluorescence signal averaged over a 1.5-s-long control time window, which was typically set 0.5 s preceding the trigger events. For analysing the responses to mounting behaviour, the control time window was set 2.5 s before mounting onset to minimize the effect of active chasing. ΔF/F values were presented with heatmaps or average plots with a shaded area indicating s.e.m.
We used multivariate permutation tests to analyse the statistical significance of the event-related fluorescence change (ERF) or peri-event time histograms (PETHs) of spike firing rates as described elsewhere68. We used 1,000 permutations for an α-level of 0.05 to compare the values of ΔF/F or spike firing rate at each time point with the ERF or PETH baseline values. A series of statistical P-value at each time point were generated and the statistical results were superimposed on the average ERF or PETH curve with red and blue lines indicating statistically significant (P<0.05) increase or decrease, respectively.
To calculate the amplitude and activation duration of ΔF/F values, we first segmented the data based on the behavioural events of sucrose lick, food intake, mounting, male–male investigation or object investigation. The 95% fraction of the baseline ΔF/F values defined as the up threshold value and the 5% fraction as the low threshold. We then detected the local signal peaks during a given interaction bout using the MATLAB findpeaks function. The response amplitude of a behavioural bout was calculated by averaging all the local peaks above the up threshold value. The activation duration and inhibition duration was calculated by summing the time points above the up threshold or below the low threshold, respectively. The average bout peaks, activation duration and inhibition duration were averaged across the behavioural session to report the corresponding values of a given mouse. Wilcoxon's rank-sum test was applied to report the statistical significance of difference between mounting and male–male interaction. Non-parametric Wilcoxon's signed-rank test was performed to test the statistical significance of the difference between male–male interaction and object investigation.
PETH of spike firing rates (bin width 50 ms) were smoothed with a Gaussian kernel (σ=50 ms) and then presented either with heatmaps or average plots. To calculate Z-scores, we used the mean firing frequency of the entire trial instead of a control period, because most 5-HT neurons were silent (<1 spikes s−1) outside the reward zone. Hierarchical clustering was carried out in three steps. We first reduced the dimensionality of Z-scored firing activity via principle component analysis. The first three major principle components were then used to calculate Euclidean distance metric. The complete agglomeration method was finally applied to build the hierarchy of clusters and plot dendrograms in MATLAB.
Kolmogorov–Smirnov tests were carried out to evaluate the difference of trial durations, running times and poke latencies between training days in the one-arm foraging task. To examine the effects of reward omission, we first determined the peak values of the ERF or PETH data 2–3 s after the reward-zone-in event, when sucrose was delivered in rewarding trials. To examine the effect of reward delay, we measured the peak values of the ERF or PETH data at the point of sucrose delivery and the areas under the ERF or PETH curves during the delay phase (0–2 s and 0–5 s, respectively). Non-parametric Wilcoxon's signed-rank tests were performed to calculate statistical P-values.
Additional information
How to cite this article: Li, Y. et al. Serotonin neurons in the dorsal raphe nucleus encode reward signals. Nat. Commun. 7:10503 doi: 10.1038/ncomms10503 (2016).
Supplementary Material
Acknowledgments
We thank P. Sterling for comments and G. Feng (MIT) for VGAT-ChR2-EYFP mice. M.L. is supported by China MOST (2012CB837701 and 2012YQ03026005), NNSFC (91432114) and Beijing Municipal Government.
Footnotes
Author contributions Y.L., W.Z., D.W., Q.F., J. Zhou, F.H., Z.L. and J. Zeng. performed experiments and analysed data. J. Zhou. and C.J. built the tetrodes amplifier. Q.G., J. Zhou. and L.F. built the fibre photometry system. Y.L. and M.L. conceived the project, designed experiments and wrote the paper.
References
- Schultz W., Dayan P. & Montague P. R. A neural substrate of prediction and reward. Science 275, 1593–1599 (1997). [DOI] [PubMed] [Google Scholar]
- Wise R. A. Dopamine, learning and motivation. Nat. Rev. Neurosci. 5, 483–494 (2004). [DOI] [PubMed] [Google Scholar]
- Berridge K. C. The debate over dopamine's role in reward: the case for incentive salience. Psychopharmacology (Berl) 191, 391–431 (2007). [DOI] [PubMed] [Google Scholar]
- Cohen J. Y., Haesler S., Vong L., Lowell B. B. & Uchida N. Neuron-type-specific signals for reward and punishment in the ventral tegmental area. Nature 482, 85–88 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jacobs B. L. & Azmitia E. C. Structure and function of the brain serotonin system. Physiol. Rev. 72, 165–229 (1992). [DOI] [PubMed] [Google Scholar]
- Weissbourd B. et al. Presynaptic partners of dorsal raphe serotonergic and GABAergic neurons. Neuron 83, 645–662 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ogawa S. K., Cohen J. Y., Hwang D., Uchida N. & Watabe-Uchida M. Organization of monosynaptic inputs to the serotonin and dopamine neuromodulatory systems. Cell Rep. 8, 1105–1118 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dorocic I. P. et al. A whole-brain atlas of inputs to serotonergic neurons of the dorsal and median raphe nuclei. Neuron 83, 663–678 (2014). [DOI] [PubMed] [Google Scholar]
- Mann J. J. Role of the serotonergic system in the pathogenesis of major depression and suicidal behavior. Neuropsychopharmacology 21, 99S–105S (1999). [DOI] [PubMed] [Google Scholar]
- Vaswani M., Linda F. K. & Ramesh S. Role of selective serotonin reuptake inhibitors in psychiatric disorders: a comprehensive review. Prog. Neuropsychopharmacol. Biol. Psychiatry 27, 85–102 (2003). [DOI] [PubMed] [Google Scholar]
- Luo M., Zhou J. & Liu Z. Reward processing by the dorsal raphe nucleus: 5-HT and beyond. Learn. Mem. 22, 452–460 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Simansky K. J. Serotonergic control of the organization of feeding and satiety. Behav. Brain Res. 73, 37–42 (1996). [DOI] [PubMed] [Google Scholar]
- Lucki I. The spectrum of behaviors influenced by serotonin. Biol. Psychiatry 44, 151–162 (1998). [DOI] [PubMed] [Google Scholar]
- Liu Y. et al. Molecular regulation of sexual preference revealed by genetic studies of 5-HT in the brains of male mice. Nature 472, 95–99 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cools R., Nakamura K. & Daw N. D. Serotonin and dopamine: unifying affective, activational, and decision functions. Neuropsychopharmacology 36, 98–113 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Simon H., Le Moal M. & Cardo B. Intracranial self-stimulation from the dorsal raphe nucleus of the rat: effects of the injection of para-chlorophenylalanine and of alpha-methylparatyrosine. Behav. Biol. 16, 353–364 (1976). [DOI] [PubMed] [Google Scholar]
- Liu Z. et al. Dorsal raphe neurons signal reward through 5-HT and glutamate. Neuron. 81, 1360–1374 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Qi J. et al. A glutamatergic reward input from the dorsal raphe to ventral tegmental area dopamine neurons. Nat. Commun. 5, 5390 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- McDevitt R. A. et al. Serotonergic versus nonserotonergic dorsal raphe projection neurons: differential participation in reward circuitry. Cell Rep. 8, 1857–1869 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Miyazaki K., Miyazaki K. W. & Doya K. Activation of dorsal raphe serotonin neurons underlies waiting for delayed rewards. J. Neurosci. 31, 469–479 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Miyazaki K. W. et al. Optogenetic activation of dorsal raphe serotonin neurons enhances patience for future rewards. Curr. Biol. 24, 2033–2040 (2014). [DOI] [PubMed] [Google Scholar]
- Fonseca Madalena S., Murakami M. & Mainen Zachary F. Activation of dorsal raphe serotonergic neurons promotes waiting but is not reinforcing. Curr. Biol. 25, 306–315 (2015). [DOI] [PubMed] [Google Scholar]
- Zhou J., Jia C., Feng Q., Bao J. & Luo M. Prospective coding of dorsal raphe reward signals by the orbitofrontal cortex. J. Neurosci. 35, 2717–2730 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sakai K. & Crochet S. Differentiation of presumed serotonergic dorsal raphe neurons in relation to behavior and wake–sleep states. Neuroscience 104, 1141–1155 (2001). [DOI] [PubMed] [Google Scholar]
- Ranade S. P. & Mainen Z. F. Transient firing of dorsal raphe neurons encodes diverse and specific sensory, motor, and reward events. J. Neurophysiol. 102, 3026–3037 (2009). [DOI] [PubMed] [Google Scholar]
- Li Y., Dalphin N. & Hyland B. I. Association with reward negatively modulates short latency phasic conditioned responses of dorsal raphe nucleus neurons in freely moving rats. J. Neurosci. 33, 5065–5078 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fornal C. A., Metzler C. W., Marrosu F., Ribiero-do-Valle L. E. & Jacobs B. L. A subgroup of dorsal raphe serotonergic neurons in the cat is strongly activated during oral-buccal movements. Brain Res. 716, 123–133 (1996). [DOI] [PubMed] [Google Scholar]
- Waterhouse B. D., Devilbiss D., Seiple S. & Markowitz R. Sensorimotor-related discharge of simultaneously recorded, single neurons in the dorsal raphe nucleus of the awake, unrestrained rat. Brain Res. 1000, 183–191 (2004). [DOI] [PubMed] [Google Scholar]
- Nakamura K., Matsumoto M. & Hikosaka O. Reward-dependent modulation of neuronal activity in the primate dorsal raphe nucleus. J. Neurosci. 28, 5331–5343 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bromberg-Martin E. S., Hikosaka O. & Nakamura K. Coding of task reward value in the dorsal raphe nucleus. J. Neurosci. 30, 6262–6272 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hayashi K., Nakao K. & Nakamura K. Appetitive and aversive information coding in the primate dorsal raphe nucleus. J. Neurosci. 35, 6195–6208 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Inaba K. et al. Neurons in monkey dorsal raphe nucleus code beginning and progress of step-by-step schedule, reward expectation, and amount of reward outcome in the reward schedule task. J. Neurosci. 33, 3477–3491 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schweimer J. V. & Ungless M. A. Phasic responses in dorsal raphe serotonin neurons to noxious stimuli. Neuroscience 171, 1209–1215 (2010). [DOI] [PubMed] [Google Scholar]
- Hioki H. et al. Vesicular glutamate transporter 3-expressing nonserotonergic projection neurons constitute a subregion in the rat midbrain raphe nuclei. J. Comp. Neurol. 518, 668–686 (2010). [DOI] [PubMed] [Google Scholar]
- Fu W. et al. Chemical neuroanatomy of the dorsal raphe nucleus and adjacent structures of the mouse brain. J. Comp. Neurol. 518, 3464–3494 (2010). [DOI] [PubMed] [Google Scholar]
- Calizo L. H. et al. Raphe serotonin neurons are not homogenous: electrophysiological, morphological and neurochemical evidence. Neuropharmacology 61, 524–543 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Allers K. A. & Sharp T. Neurochemical and anatomical identification of fast- and slow-firing neurones in the rat dorsal raphe nucleus using juxtacellular labelling methods in vivo. Neuroscience 122, 193–204 (2003). [DOI] [PubMed] [Google Scholar]
- Hajos M. et al. Neurochemical identification of stereotypic burst-firing neurons in the rat dorsal raphe nucleus using juxtacellular labelling methods. Eur. J. Neurosci. 25, 119–126 (2007). [DOI] [PubMed] [Google Scholar]
- Cohen J. Y., Amoroso M. W. & Uchida N. Serotonergic neurons signal reward and punishment on multiple timescales. Elife 4,, doi:10.7554/eLife.06346 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Challis C. et al. Raphe GABAergic neurons mediate the acquisition of avoidance after social defeat. J. Neurosci. 33, 13978–13988 13988a (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Spoida K., Masseck O. A., Deneris E. S. & Herlitze S. Gq/5-HT2c receptor signals activate a local GABAergic inhibitory feedback circuit to modulate serotonergic firing and anxiety in mice. Proc. Natl Acad. Sci. USA 111, 6479–6484 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Amat J. et al. Medial prefrontal cortex determines how stressor controllability affects behavior and dorsal raphe nucleus. Nat. Neurosci. 8, 365–371 (2005). [DOI] [PubMed] [Google Scholar]
- Warden M. R. et al. A prefrontal cortex-brainstem neuronal projection that controls response to behavioural challenge. Nature 492, 428–432 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Flusberg B. A. et al. Fiber-optic fluorescence imaging. Nat. Methods 2, 941–950 (2005). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cui G. et al. Concurrent activation of striatal direct and indirect pathways during action initiation. Nature 494, 238–242 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gunaydin L. A. et al. Natural neural projection dynamics underlying social behavior. Cell 157, 1535–1551 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen T. W. et al. Ultrasensitive fluorescent proteins for imaging neuronal activity. Nature 499, 295–300 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gong S. et al. Targeting Cre recombinase to specific neuron populations with bacterial artificial chromosome constructs. J. Neurosci. 27, 9817–9823 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang X., Beaulieu J. M., Sotnikova T. D., Gainetdinov R. R. & Caron M. G. Tryptophan hydroxylase-2 controls brain serotonin synthesis. Science 305, 217 (2004). [DOI] [PubMed] [Google Scholar]
- Bevins R. A. & Besheer J. Novelty reward as a measure of anhedonia. Neurosci. Biobehav. Rev. 29, 707–714 (2005). [DOI] [PubMed] [Google Scholar]
- Hintiryan H., Hayes U. L. & Chambers K. C. Intraoral cheek fistulae: a refined technique. Lab. Anim. 40, 456–464 (2006). [DOI] [PubMed] [Google Scholar]
- Kvitsiani D. et al. Distinct behavioural and network correlates of two interneuron types in prefrontal cortex. Nature 498, 363–366 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Anikeeva P. et al. Optetrode: a multichannel readout for optogenetic control in freely moving mice. Nat. Neurosci. 15, 163–170 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Luo M., Fee M. S. & Katz L. C. Encoding pheromonal signals in the accessory olfactory bulb of behaving mice. Science 299, 1196–1201 (2003). [DOI] [PubMed] [Google Scholar]
- Chao H. T. et al. Dysfunction in GABA signalling mediates autism-like stereotypies and Rett syndrome phenotypes. Nature 468, 263–269 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhao S. et al. Cell type-specific channelrhodopsin-2 transgenic mice for optogenetic dissection of neural circuitry function. Nat. Methods 8, 745–752 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Miyazaki K. W., Miyazaki K. & Doya K. Activation of the central serotonergic system in response to delayed but not omitted rewards. Eur. J. Neurosci. 33, 153–160 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ranade S., Pi H. J. & Kepecs A. Neuroscience: waiting for serotonin. Curr. Biol. 24, R803–R805 (2014). [DOI] [PubMed] [Google Scholar]
- Miyazaki K., Miyazaki K. W. & Doya K. The role of serotonin in the regulation of patience and impulsivity. Mol. Neurobiol. 45, 213–224 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Miyazaki K. W., Miyazaki K. & Doya K. Activation of dorsal raphe serotonin neurons is necessary for waiting for delayed rewards. J. Neurosci. 32, 10451–10457 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kocsis B., Varga V., Dahan L. & Sik A. Serotonergic neuron diversity: identification of raphe neurons with discharges time-locked to the hippocampal theta rhythm. Proc. Natl Acad. Sci. USA 103, 1059–1064 (2006). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jensen P. et al. Redefining the serotonergic system by genetic lineage. Nat. Neurosci. 11, 417–419 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu C. et al. Pet-1 is required across different stages of life to regulate serotonergic function. Nat. Neurosci. 13, 1190–1198 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gras C. et al. A third vesicular glutamate transporter expressed by cholinergic and serotoninergic neurons. J. Neurosci. 22, 5442–5451 (2002). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shikanai H. et al. Distinct neurochemical and functional properties of GAD67-containing 5-HT neurons in the rat dorsal raphe nucleus. J. Neurosci. 32, 14415–14426 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dayan P. & Huys Q. J. Serotonin in affective control. Annu. Rev. Neurosci. 32, 95–126 (2009). [DOI] [PubMed] [Google Scholar]
- Boureau Y. L. & Dayan P. Opponency revisited: competition and cooperation between dopamine and serotonin. Neuropsychopharmacology 36, 74–97 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Groppe D. M., Urbach T. P. & Kutas M. Mass univariate analysis of event-related brain potentials/fields II: Simulation studies. Psychophysiology 48, 1726–1737 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.