Abstract
Serotonin is known to play a key role in the regulation of emotional behavior. There have been conflicting hypotheses about whether the central serotonergic system is involved in positive or negative emotional information processing. To reveal whether and how such opposing information processing can be achieved by single neurons in the dorsal raphé nucleus (DRN), the major source of serotonin in the forebrain, we recorded the activity of DRN neurons while monkeys were conditioned in a Pavlovian procedure with two distinct contexts: an appetitive block where a reward was available; and an aversive one where an airpuff was delivered. We found that single DRN neurons were involved in several aspects of both appetitive and aversive information processing. First, more than half of the recorded DRN neurons discriminated between appetitive and aversive contexts by tonic changes in their activity. In the appetitive context, they then kept track of the expected reward value indicated by the conditioned stimuli. Some of them also encoded an error between the obtained and expected values. In the aversive context, the same neurons maintained tonic modulation in their activity throughout the block. However, modulation of their responses to aversive task events depending on airpuff probability was less common. Together, these results indicate that single DRN neurons encode both appetitive and aversive information, but over differing time scales: relatively shorter for appetitive, and longer for aversive. Such temporally distinct processes of value coding in the DRN may provide the neural basis of emotional information processing in different contexts.
Keywords: dorsal raphé nucleus, monkey, Pavlovian conditioning, serotonin, single-unit recording
Introduction
Making a decision requires the brain to compute values associated with different contexts, stimuli, and outcomes in appetitive and aversive terms. Dopamine and serotonin appear to be the major neurotransmitters in this process. The role of the dopaminergic system has been studied extensively, and it is implicated in signaling a “reward prediction error” (RPE), which is the difference between the expected reward, called the “state value,” and the actual reward (Mirenowicz and Schultz, 1994; Montague et al., 1996).
In contrast, the role of serotonin remains more elusive. One of the leading hypotheses is that the serotoninergic system is involved in aversive information processing. Aversive cues or outcomes enhance c-Fos expression (Pezzone et al., 1993; Grahn et al., 1999; Takase et al., 2004), neuronal activity (Schweimer and Ungless, 2010), and serotonin levels (Kirby et al., 1995) in the dorsal raphé nucleus (DRN), the major source of serotonin in the forebrain. Serotonin depletion enhances motivational behavior in rats (Fletcher et al., 1999). The inhibition of serotonergic neurons also leads to dopamine-dependent rewarding effects (Liu and Ikemoto, 2007). Thus, the serotonergic system has been thought to encode aversive information and have an opponent influence on the reward circuitry. The function of the serotonergic system in aversive information processing has also been reported in humans; systemic reduction in serotonin levels led to exaggerated aversive processing and deficient response inhibition (for review, Cools et al., 2008).
However, some electrophysiological studies in animals suggest a seemingly opposite hypothesis: DRN neurons encode expectation and the receipt of rewards (Nakamura et al., 2008; Ranade and Mainen, 2009; Bromberg-Martin et al., 2010; Miyazaki et al., 2011; Inaba et al., 2013). This notion is supported by the facts that electrical (Miliaressis et al., 1975; Miliaressis, 1977) or optogenetic (Liu et al., 2014) stimulation of the DRN induced reward-oriented behavior. How can serotonergic neurons be involved in such opposing processes? One possibility is that different populations of DRN neurons compute appetitive and aversive information separately. Alternatively, single DRN neurons are capable of encoding both appetitive and aversive information.
Using a saccade task, we previously showed that DRN neurons encoded expected reward value continuously throughout given trials (Bromberg-Martin et al., 2010). However, what was lacking in the previous study was an examination of whether the same DRN neurons also respond to aversive stimuli. To answer this question, we made recordings from neurons in the DRN while monkeys were conditioned in a Pavlovian procedure. We used a discrete task with two contexts, one in which rewards may occur (appetitive blocks) and another in which airpuffs may occur (aversive blocks). Additionally, to investigate how DRN neurons encode the expectation of appetitive and aversive stimuli, rewards or airpuffs were probabilistically delivered in each block. These distinct “blocks” enabled us to examine continuous encoding of the emotional contexts as well as value coding of task events with different probabilities. The characteristics of this task were such that, for single DRN neurons, we could directly compare appetitive and aversive information processing throughout the block over different time scales and discrete events.
Materials and Methods
We used both hemispheres of two cynomolgus monkeys (Macaca fascicularis; laboratory designations: monkey H, female, 5.4 kg; monkey S, male, 7.0 kg). All experimental procedures were performed in accordance with the National Institutes of Health Guidelines for the Care and Use of Laboratory Animals and were approved by the institutional Animal Care and Use Committee at Kansai Medical University.
Each monkey was implanted with a post to allow for the maintenance of a stable head position. Eye position was monitored using an infrared video-tracking system at a time resolution of 240 Hz and spatial resolution of 0.1° (EYE-TRAC 6; ASL). Stimuli generated by an active matrix liquid crystal display projector (ELP-505; Epson) were rear projected onto a frontoparallel screen located 60 cm from the monkey's eyes. All aspects of the behavioral experiment, including stimuli presentation, monitoring of eye movements and neuronal activity, and outcome delivery were under the control of a real-time experimentation data acquisition system (Tempo; Reflective Computing).
The activity of single neurons was recorded using tungsten electrodes (diameter, 0.25 mm; impedance, 1–3 MΩ; Frederick Haer). The signal was amplified with a bandpass filter (150 Hz–3 kHz; MEG-5100; Nihon Kohden) and collected at 1 kHz. Single-neuron activity was isolated by a template-matching spike discriminator (time resolution, 50 kHz for waveform matching and spike sampling; Alpha Omega). We also collected spike waveforms for each recorded neuron.
Behavioral task.
The monkeys were trained to learn stimulus–outcome associations in a Pavlovian trace-conditioning task that consisted of the following two distinct blocks of trials: appetitive blocks (Fig. 1A, left) and aversive blocks (Fig. 1A, right). In the appetitive block, three different conditioned stimuli (CSs; green triangle, red open square, and yellow ring for monkey H; red circle, green cross, and blue square for monkey S) were associated with a liquid reward as an unconditioned stimulus (US) with 100%, 50%, and 0% probability, respectively. In the aversive block, three CSs (red circle, blue square, and yellow ring for monkey H; yellow ring, green triangle, and blue square for monkey S) were associated with an airpuff directed at the monkey's face as an aversive US with 100%, 50%, and 0% probability, respectively. The CSs for the “0% reward” and “0% airpuff” trials were identical for both types of blocks. When a reward or an airpuff was not presented (i.e., “0% reward” trials or non-reward trials of the “50% reward” trials in the appetitive block, and “0% airpuff” trials or non-airpuff trials of the “50% airpuff” trials in the aversive block), a tone was sounded. The CSs were 10° in size. For the US, a liquid reward (0.1 ml) was delivered through a spigot that was positioned in front of the monkey's mouth or an airpuff (0.03–0.05 MPa) was delivered through a narrow tube placed 1–2 cm from the face. A tone was presented from a speaker located under the screen. All of these outcomes were delivered for 100 ms.
Each trial started after the presentation of a timing cue (TC; white circle; size, 1–5°) in the middle of the screen for both blocks. The monkeys were not required to fixate on it. After 1 s, the TC disappeared, and one of the three CSs was presented for 1.5 s. The CS disappeared, and after a 1.0 s delay (trace period) the US was delivered. To examine the responses to unpredicted outcomes, in addition to the cued trials, noncued trials were also included, in which a reward alone (free reward) or a tone alone (free tone) was delivered in the appetitive block, or an airpuff alone (free airpuff) or a tone alone (free tone) was delivered in the aversive block. All trials were presented with a random intertrial interval (ITI) of 7–9 s. One block typically consisted of 64 trials with fixed proportions of trial types (100%, 16 trials; 50%, 16 trials; 0%, 16 trials; free reward or airpuff, 8 trials; free tone; 8 trials).
For each neuron, we collected data by repeating the appetitive and aversive blocks two or more times. The order of the blocks was counterbalanced across the experiments. There was a several minute delay between the blocks, and there was no cue for when and which block would be started (Fig. 1B).
Analyses of behavioral data.
We analyzed anticipatory licking and blinking as behavioral measures to estimate the monkey's valuation of appetitive and aversive stimuli. Licking behavior was detected by a vibration sensor, and the signal was digitized by a threshold-crossing algorithm. Blinking was detected by the saturation of the eye position signal in either the horizontal or vertical axis. The data were normalized as (x − Min/(Max − Min), where x was the mean frequency of licking and blinking for the given task period, and Max and Min were the maximum and minimum frequencies of licking or blinking in the entire recording session, respectively. Because anticipatory licking and blinking were observed most often during the second half of the trace period (Fig. 1C), quantitative comparisons between conditions were made by analyzing the rates of licking and blinking during 500 ms before the US onset. To examine the difference in the frequency of these measures depending on the CS, we performed one-way repeated-measures ANOVA followed by multiple comparisons using Bonferroni correction (Fig. 1D).
Localization of the DRN.
We used the same technique to localize the DRN as has been described previously (Nakamura et al., 2008). A circular CILUX recording chamber (Crist Instruments), angled 35° posteriorly, was implanted over the midline of the parietal cortex to access the brainstem between the superior and inferior colliculi. The location of the DRN was verified by overlaying penetration record maps on magnetic resonance (MR) images (0.3 T; AIRIS; Hitachi). As shown in Figure 1E, MR images were obtained with the recording chamber filled with povidone-iodine ointment, which clarified the relative location of the recording chamber and the underlying brain structure.
During the initial survey of the DRN, the following brain structures were identified and were particularly useful as landmarks: the superior colliculus, with receptive fields in the upper visual field with large eccentricities; the mesencephalic trigeminal nucleus, with responses to mouth movements; and, most importantly, the trochlear nucleus with increased firing during downward eye movements. The DRN is located ventral to the central aqueduct (Molliver, 1987). It has also been reported that, in the primate, the lateral components (wings) of the DRN form the larger division, and the wings are best developed at the level of the trochlear nucleus (Jacobs and Azmitia, 1992). We therefore analyzed neurons that were located (1) 0–2 mm anterior to the trochlear nucleus and (2) 1–2 mm from the midline, all of which were well estimated by neuronal landmarks, as described above.
At the end of the recording session, microlesions were placed in the DRN to histologically identify the recording sites where many task-related neurons were recorded. After completion of all experiments, monkey S was deeply anesthetized with sodium pentobarbital and intracardially perfused with saline followed by 4% paraformaldehyde. The brain was removed and cut in coronal sections (50 μm) for Nissl staining with cresyl violet (Fig. 1F).
Analyses of neuronal activity.
The DRN includes serotonergic and nonserotonergic neurons. Serotonergic neurons reportedly do not always differ significantly from non-serotonergic neurons in terms of their electrophysiological features, such as the baseline firing rate, spike waveform, and irregularity of firing pattern (Allers and Sharp, 2003; Kocsis et al., 2006). Therefore, rather than choosing neurons with specific electrophysiological properties, we unselectively recorded the activity of neurons encountered in the DRN during the experimental sessions. The data were chosen for further analyses off-line, as described below.
For each neuron, we computed the mean firing rates for the following task periods: the ITI during 500 ms before TC onset in the second and later trials; the TC period 0–1000 ms after TC onset in the second and later trials; and the trace period 1000 ms before US onset.
The time window for analyzing CS and US responses was chosen based on their latencies. To measure neural response latencies, we referred to methods reported previously (Maunsell and Gibson, 1992; Morrison and Salzman, 2009) that compared activity following CS or US onset with the preceding TC (for CS) or trace (for US) activity. Specifically, we first constructed a histogram of the number of spikes in 20 ms bins, shifted by 2 ms. Next, we determined which of the 20 ms bins, shifted in 2 ms steps, met a criterion response during the 500 ms after CS or US onset. The criterion for an excitatory (or inhibitory) response was met if at least 10 consecutive overlapping bins contained more (or fewer) spikes than 95% of the activity during the preceding period. Latency was defined as the beginning of the first of these 10 consecutive significantly different bins (20 ms duration). We also performed similar analyses using 20 consecutive bins, which gave consistent latencies.
As 82.5% of the analyzed neurons (203 of 246 neurons) started to change their activity immediately following the CS presentation, the standard window of 100–500 ms after CS onset was applied to these neurons. For 43 of 246 neurons whose response to the CS started later than 400 ms, the activity during the 500–1500 ms after CS onset was analyzed. For each neuron, the same window was used for the appetitive and aversive blocks, blind to the valence. The standard time window for the US responses was 100–500 ms after US onset. For the neurons whose US response latency was very short, 25–125 ms (latency <100 ms) or 50–250 ms (latency <125 ms) windows were used. This was often the case for the response to an airpuff in the aversive block (102 of 246 neurons), but also occurred after rewards in the appetitive block (39 of 246 neurons). Thus, for analyses of the US responses in each block, different windows may have been applied to some neurons, depending on their response latencies (Matsumoto and Hikosaka, 2009a,b).
The magnitudes of the CS and US responses were computed as the mean firing rate during each time window relative to the activity during the preceding period (i.e., for the CS response, the TC period; for the US response, the trace period). However, because DRN neurons often showed significant modulations in TC and trace period activities, we also measured the CS response relative to the ITI, and the US response relative to the ITI and TC activities. The results were essentially consistent regardless of the way that the CS and US responses were measured.
Each neuron was judged to be task related if its activity during at least one of the task periods (ITI, TC, CS, trace, and US) for at least one of the trial conditions (four trial types in the appetitive and aversive blocks) was significantly modulated. We used the Mann–Whitney U test to compare activity between two conditions or time windows, and the Kruskal–Wallis test to compare activity among multiple conditions or time windows. After the Kruskal–Wallis test, if a significant difference was found (p < 0.05), we used the Mann–Whitney U test with Bonferroni correction. We defined activity during a particular task epoch as significantly modulated if (1) it was different from the preceding period or from other trial types (p < 0.01), (2) the outcome probability/predictability produced a significant effect (p < 0.05), or (3) a block-dependent effect was observed (ITI and TC, p < 0.05).
For the task-related neurons, we also examined the graded modulation of CS, trace, and US responses depending on the outcome probability. We first performed Spearman's rank correlation test (p < 0.05). Then, we performed a one-way repeated-measures ANOVA followed by multiple comparisons using Bonferroni correction.
We further computed the “baseline” activity of each neuron, which was defined as the mean firing rate during the 500 ms before TC onset on the first trial of the appetitive and aversive blocks (i.e., before the block was initiated). Because there was no cue for when and which block would be started next, the baseline while the task was stopped between the blocks was regarded as activity “outside of task” in a neutral context. To confirm this hypothesis, we compared the baseline activity before each block using the Wilcoxon signed-rank test. If neurons exhibited temporary bursting activity 500 ms before the initiation of the first trial of the block, we analyzed the mean firing rate during the 1000–500 ms before TC onset to compute the baseline activity.
To generate population histograms, neuronal activity was normalized by computing receiver operating characteristic (ROC) values based on the comparison of the firing rates between the baseline activity and a test window (100 ms duration). The analysis was repeated by moving the test window in 20 ms steps. The dataset for the baseline activity was collected as the mean firing rates of 100 ms windows shifted in 20 ms steps during the 500 ms before the initiation of both appetitive and aversive blocks (i.e., 20 windows for each block). Because the results indicated consistency between the subjects (χ2 test, p > 0.05), the data were collapsed across the two monkeys for further analyses.
Analyses of the electrophysiological characteristics of neurons.
We characterized the physiological properties of the sampled neurons, including spike waveform, baseline firing rate, and irregularity of firing pattern. Consistent with previous studies, the typical spike shapes in the current experiment were characterized as follows: first, sharp and small negative; second, sharp and large positive; third, long-duration negative, and fourth, long-duration positive waves (Kocsis et al., 2006; Nakamura et al., 2008). Due to the limited duration of data collection of the spike waveform (up to 2 ms), we measured spike duration from the first sharp negative to the peak of the third, long-duration negative deflection. Nevertheless, among 214 of 246 neurons in which a spike shape was obtained, 114 showed spike durations longer than 2 ms; therefore, we computed the ratio of neurons with spike durations longer than 2 ms. To quantify the irregularity of spike trains, we used an irregularity metric introduced by Davies et al. (2006), which was called “IR” and computed as log (ISIi/ISIi+1): the difference between two adjacent interspike intervals (ISIs). A small IR indicates regular firing, and a large IR indicates irregular firing. We then computed the median of all IR values during the whole task period for all trial types.
Results
Behavioral data
As a prelude to determining whether serotonergic neurons are involved in processing appetitive and aversive information, we first showed that the monkeys could discriminate different CSs by comparing behavioral measures: licking a spout in anticipation of a reward and blinking in anticipation of an airpuff during the late trace period, where they were detected most often (Fig. 1C). We also observed that frequent blinking occurred during the entire task periods, including the ITI and TC in the aversive blocks, but not in the appetitive blocks.
The licking frequencies of both monkeys were different among reward CSs in the appetitive block (ANOVA, F(2,288) = 47.6, p < 0.0001, for monkey H; F(2,34) = 49.0, p < 0.0001 for monkey S); the blinking frequencies were different among airpuff CSs in the aversive block (ANOVA, F(2,288) = 19.2, p < 0.0001, for monkey H; F(2,22) = 9.5, p = 0.001 for monkey S). As shown in Figure 1D, both monkeys licked more often when the reward was more predicted and blinked more often when the airpuff was more predicted. Multiple comparisons revealed significant graded differences in the licking frequencies, depending on reward probability for monkey H (p < 0.0001 for 0% vs 50%; p = 0.007 for 50% vs 100%), but not for monkey S; and the blinking frequencies among airpuff CSs for monkey S (p = 0.01 for 0% vs 50%, p = 0.0049 for 50% vs 100%), but not for monkey H, suggesting individual differences and block dependency in discriminating between the CSs based on outcome probabilities. Compared with the 50% reward CS, there was a significant increase in blinking following the 0% reward CS in both monkeys (p < 0.01 after Bonferroni correction), presumably because of stimulus generalization; blinks were slightly evoked even by the 0% airpuff CS in the aversive block, and the same stimulus was also used as the 0% reward CS in the appetitive block.
Neuronal data
We recorded single-unit activity from a total of 300 neurons (160 from monkey H and 140 from monkey S) in the DRN of both hemispheres of the two monkeys. Among them, we performed analyses for a total of 246 neurons (145 from monkey H and 101 from monkey S) that met the criteria of task-related modulation. To obtain a more detailed picture of neuronal responses so as to determine whether and how single DRN neurons respond to both appetitive and aversive stimuli, we conducted extensive analyses of different aspects of their responses during the Pavlovian conditioning task.
Tonic activity during ITI and TC periods
We first asked whether DRN neurons encoded differential emotional contexts. In the present task, the appetitive and aversive trials were run as a block. Therefore, although the visual stimulus, a white circle, was identical during the TC period in both types of blocks, expected outcomes were different, which may result in differential TC activity depending on the kind of block. We indeed found that >50% of DRN neurons showed discrete activity during the TC period in the appetitive and aversive blocks (Mann–Whitney U test, p < 0.05; Table 1; Fig. 3A).
Table 1.
TC | ITI | Number of cells |
---|---|---|
ap > av (n = 109) | ap > av | 72 |
av > ap | 5 | |
n.s. | 32 | |
av > ap (n = 40) | ap > av | 1 |
av > ap | 22 | |
n.s. | 17 | |
n.s. (n = 97) | ap > av | 19 |
av > ap | 7 | |
n.s. | 71 |
Data are the numbers of neurons that showed a significant difference (p < 0.05) in TC and ITI activities depending on the block. ap, Appetitive block; av, aversive block; n.s., no significant difference.
For example, the neuron shown in Figure 2A exhibited a phasic excitation after TC onset, followed by a tonic elevation of activity in the appetitive block. In the subsequent aversive block, the activity of the same neuron remained low (Fig. 2A, bottom row). Thus, TC activity was significantly higher in the appetitive block than in the aversive block (Mann–Whitney U test, p = 0.1 × 10−8). There were also neurons with opposite responses. The example neuron in Figure 2B exhibited significantly higher tonic activity during the TC period in the aversive block than in the appetitive block (p = 0.59 × 10−10). Such TC activity of these two neurons was reproduced when the blocks were repeated (data not shown).
The differences in TC activity between the blocks may reflect differences in the appetitive and aversive contexts. If that were the case, the activity during the ITI may also be modulated by contexts. Indeed, this block-dependent modulation was observed in ITI activity. Among 149 neurons that showed a significant difference in block-dependent TC activity, 94 showed a consistent and significant difference in ITI activity (Mann–Whitney U test, p < 0.05; Table 1). Moreover, there was a positive correlation in block discrimination between ITI and TC activities (Spearman's rank correlation, ρ = 0.73, p = 0.12 × 10−42; Fig. 3B), indicating that if a neuron showed stronger ITI activity in the appetitive block than in the aversive block, it also showed stronger TC activity in the appetitive block than in the aversive block, and vice versa.
While the block-dependent differences in TC activity suggest that DRN neurons discriminate emotional contexts, the results might also be explained by behavioral distinctions between the blocks because the monkeys' blinking and licking behaviors were discrepant depending on the block during the ITI and TC periods (Fig. 1C), and motor aspects of the raphe nuclei have been reported repeatedly (Jacobs and Fornal, 1993; Ranade and Mainen, 2009; Li et al., 2013).
To examine this issue, we first analyzed, for each neuron, whether there was a significant correlation between licking frequency and neuronal activity during the TC period for each block. If licking behavior were the cause of differences in TC activity, neurons with appetitive (aversive) block preference in TC activity should show a positive (negative) correlation between TC activity and licking frequency. Statistical analyses revealed that a small number of neurons had a significant correlation between licking frequency and TC activity (Spearman's rank correlation, p < 0.05). Among 54 appetitive block-preferring neurons recorded when a sufficient licking signal was measured, 7 showed a positive correlation and 1 showed a negative correlation between licking frequency and TC activity in the appetitive block. Among 27 aversive block-preferring neurons, 3 showed a negative correlation between licking frequency and TC activity. However, for most other neurons, including the one in Figure 2A, TC activity was not significantly correlated with licking frequency in the appetitive block (ρ = −0.009, p = 0.95), indicating that licking had only a minimal effect on the block-dependent modulation in TC activity.
We next analyzed whether different eye positions or blinking behaviors, as shown in Figure 1C, were the cause of the block-dependent differences by comparing TC activity between the appetitive and aversive blocks, but now separately in trials in which the monkeys' eye positions were within or outside (including blinking) the central 16° window. All 120 neurons, with a sufficient number of trials in which blinking occurred, exhibited consistent differences in TC activity between the blocks regardless of the position of the eyes (Mann–Whitney U test, p < 0.05), suggesting that eye position and blinking were not the cause of differential TC activity.
The block-dependent differences in TC and ITI activities might also be due to the residual effects of outcomes in the previous trials, as many DRN neurons respond tonically to outcomes (Nakamura et al., 2008). To control for this possibility, we broke down the data into two groups depending on the preceding outcomes: rewards and tones in the appetitive block; and airpuffs and tones in the aversive block. As shown in Figure 3C–E, the preceding outcomes had only a modest effect on TC and ITI activities. At the single-neuron level, some neurons showed significant differences in their activities depending on outcomes in the previous trials (Mann–Whitney U test, p < 0.05; Fig. 3F–H). The aversive-preferring neurons were likely to fire more strongly after relatively negative outcomes (i.e., tones in the appetitive block and airpuffs in the aversive block) than after positive ones (i.e., rewards) during the ITI and TC periods (Fig. 3F), while the appetitive-preferring neurons were likely to fire more strongly after positive outcomes than after negative ones in the appetitive block (Fig. 3H). Nevertheless, the analysis revealed that the differences in ITI and TC activities were predominantly consistent regardless of the preceding outcome in each preference type (Fig. 3C–E). Together, these results suggest that DRN neurons exclusively encoded appetitive and aversive contexts during the ITI and TC periods, rather than behavioral differences or information of the preceding outcome.
Changes in ITI and TC activities relative to baseline
We have revealed that DRN neurons discriminated the appetitive and aversive blocks even during the ITI. This predicts that ITI activity should change, on a trial-by-trial basis, when compared with the baseline measured outside of the task. In our task, there was no clue as to what the upcoming block was before each block was initiated. Thus, the monkeys should not discriminate between the appetitive and aversive blocks during the “baseline epoch” (500 ms before the first trial started). In support of this idea, a comparison of the baseline among the blocks revealed no significant differences across neurons, suggesting that the baseline activity was not yet modulated depending on the blocks (Wilcoxon signed-rank test, p = 0.1). By comparing the activity 1000–500 ms before TC onset and 500 ms before TC onset on the first trial, we also confirmed that monkeys did not predict the timing of the initiation of a block (Wilcoxon signed-rank test: p = 0.8 before the appetitive, p = 0.2 before the aversive block started).
To examine how the activity was modulated during the ITI, we analyzed trial-by-trial changes in ITI activity relative to the baseline (Fig. 4). As the trials progressed in the aversive block, neurons that preferred the aversive block showed a gradual increase in ITI activity that remained positive relative to the baseline. In the appetitive block, the same neurons initially exhibited increased activity, which soon decreased to around the baseline level (Fig. 4A). Conversely, DRN neurons that preferred the appetitive block showed a gradual decrease in ITI activity in the aversive block, while the same neurons exhibited increased activity in the appetitive block, followed by a decrease in activity during the later trials of the blocks (Fig. 4B). An example neuron with appetitive block preference is shown in Figure 4C. During the earlier trials, ITI and TC activities gradually increased in the appetitive block and decreased in the aversive block. In summary, many DRN neurons discriminated the appetitive and aversive blocks by tonic modulation of the activity during the ITI and TC periods. Such context-dependent tonic modulation in activity occurred, on trial-by-trial bases, after the particular block started.
Responses to CSs
DRN neurons responded to the CSs. Among 246 task-related neurons, 114 (46.3%) showed significant responses to the CSs in the appetitive block only, 35 (14.2%) to the CSs in both the appetitive and aversive blocks, and 10 (4.1%) to the CSs in the aversive block only (Mann–Whitney U test, p < 0.05 after Bonferroni correction). Thus, a significant response to the CSs was primarily observed in the appetitive block rather than in the aversive block.
The CS responses of many neurons were modulated by the outcome probability. For example, relative to its activity during the TC period, the neuron in Figure 2A exhibited a significant excitatory response to the 100% CS (Mann–Whitney U test, p = 0.1 × 10−4) and an inhibitory response to the 0% CS (p = 0.9 × 10−3), but not to the 50% CS. Thus, there was a significant positive correlation between reward probability and CS responses (Spearman's rank correlation, ρ = 0.84, p = 0.1 × 1012). The neuron in Figure 2B also responded to the reward CSs, but in a mirror image manner; when compared with its response during the TC period, it showed inhibition in the response to the 100% reward CS (p = 0.0002) and strong excitation to the 0% reward CS (p = 0.1 × 10−5), resulting in a significant negative correlation between reward probability and CS response (ρ = −0.87, p = 0.5 × 10−17). Neither neuron, however, exhibited significant responses to any of the CSs associated with an airpuff. A small number of neurons, like the one in Figure 2C, showed significant modulation in the CS response, depending on airpuff probability (ρ = −0.47, p = 0.001).
In the appetitive block, 60 neurons (24.4%) and 55 neurons (22.4%) showed significant positive (100% > 50% > 0% CSs; Fig. 5A) and negative (100% < 50% < 0% CSs; Fig. 5C) correlations between reward probability and CS response, respectively (Spearman's rank correlation, p < 0.05). Compared with TC activity, the CS responses of these neurons were strongly inhibited by the 0% reward CS and excited by the 50% and 100% reward CSs (Fig. 5B), and vice versa (Fig. 5D). As a population, both types of neurons exhibited significantly different responses to reward CSs (ANOVA: F(2,118) = 41.4, p < 0.0001 for positive correlation; F(2,108) = 42.2, p < 0.0001 for negative correlation). Multiple comparisons revealed significant differences in the CS responses, depending on reward probability for neurons with positive correlations (p < 0.0001 for 0% vs 50%; p = 0.0023 for 50% vs 100%; Fig. 5B) and negative correlations (p < 0.0001 for 0% vs 50%; p = 0.01 for 50% vs 100%; Fig. 5D).
In the aversive block, 12 neurons (4.9%) and 12 neurons (4.9%) showed significant positive correlations (100% > 50% > 0% CSs; Fig. 5E) and negative correlations (100% < 50% < 0% CSs; Fig. 5G) between airpuff probability and CS response, respectively (Spearman's rank correlation, p < 0.05). As a population, these neurons exhibited a significant difference in the CS responses relative to TC activity (ANOVA: F(2,22) = 33.2, p < 0.0001 for positive correlation; F(2,22) = 23.9, p < 0.0001 for negative correlation). Multiple comparisons revealed significantly different responses to the aversive CSs depending on airpuff probability for neurons with positive correlations (p = 0.0002 for 0% vs 50%; p = 0.0017 for 50% vs 100%; Fig. 5F) and negative correlations (p = 0.0046 for 0% vs 50%; p = 0.0011 for 50% vs 100%; Fig. 5H). These results suggest that DRN neurons are involved in graded coding of values associated with reward or airpuff CSs. However, such value coding was more prominent in the appetitive than in the aversive blocks.
We further asked whether single DRN neurons similarly encoded the values of both appetitive and aversive CSs. We found this to be only partially true. First, there was a significant positive correlation between the response to the 0% reward (“worst”) CS in the appetitive block and the response to the 100% airpuff (“worst”) CS in the aversive block (Spearman's rank correlation, ρ = 0.3, p = 0.7 × 10−6). However, the correlation between the response to the 100% reward (“best”) CS in the appetitive block and the response to the 0% airpuff (“best”) CS in the aversive block was not significant. Finally, whereas the CS responses of many DRN neurons were modulated by reward probability in the appetitive block, the CS responses of the same neurons did not encode airpuff probability on a similar value scale in the aversive block. In the joint distribution of correlation coefficients for the appetitive and aversive blocks (Fig. 5I), the top-right and bottom-left quadrants exhibit a straightforward correlation between CS value coding in the appetitive and aversive blocks. For example, if reward and airpuff CSs are evaluated on a common value scale in the DRN, then a neuron with a stronger response to the 100% reward (“better”) CS than to the 0% reward (“worse”) CS in the appetitive block (positive correlation coefficient between reward probability and neuronal firing rate; Fig. 5I, right half) should respond more strongly to the 0% airpuff (“better”) CS than to the 100% airpuff (“worse”) CS in the aversive block (Fig. 5I, top half). Instead, there was a small, but significant, correlation for neurons that had stronger responses to the 100% reward CS that also showed stronger responses to the 100% airpuff CS; similarly, neurons that had stronger responses to the 0% reward CS also showed stronger responses to the 0% airpuff CS (Spearman's rank correlation, ρ = 0.15, p = 0.02).
The TC and CS responses were often jointly modulated (Table 2). Neurons with appetitive block preference in TC activity tended to exhibit stronger responses to the 100% reward CS than to the 50% and 0% reward CSs. Conversely, neurons with aversive block preference during the TC period tended to exhibit stronger CS responses to the 0% reward CS than to the 50% and 100% reward CSs.
Table 2.
TC type | Appetitive CS |
Appetitive US |
Aversive CS |
Aversive US |
Reward omission |
|||||
---|---|---|---|---|---|---|---|---|---|---|
100% > 0% | 0% > 100% | Free >100% | 100% > Free | 100% > 0% | 0% > 100% | Free >100% | 100% > Free | Increase | Decrease | |
ap > av | 39 | 15 | 39 | 16 | 7 | 3 | 10 | 7 | 9 | 27 |
(n = 109) | (35.8) | (13.8) | (35.8) | (14.7) | (6.4) | (2.8) | (9.1) | (6.4) | (8.3) | (24.8) |
av > ap | 2 | 14 | 9 | 3 | 3 | 1 | 2 | 3 | 8 | 1 |
(n = 40) | (5.0) | (35.0) | (22.5) | (7.5) | (7.5) | (2.5) | (5.0) | (7.5) | (20.0) | (2.5) |
n.s. | 19 | 26 | 19 | 9 | 2 | 8 | 10 | 6 | 14 | 6 |
(n = 97) | (19.6) | (26.8) | (19.6) | (9.3) | (2.1) | (8.2) | (10.3) | (6.2) | (14.4) | (6.2) |
Total | 60 | 55 | 67 | 28 | 12 | 12 | 22 | 16 | 31 | 34 |
Number (%) of neurons that showed a significant modulation in the CS response depending on the reward/airpuff probability, in the US response depending on the reward/airpuff predictability, and in the response to reward omission. The data are shown for separate groups of neurons classified by preferred activity during the TC period. ap, Appetitive block; av, aversive block; n.s., no significant difference.
This was confirmed statistically at the single-neuron level (Fig. 6). We used ROC analysis to measure the change in firing rate of each neuron during the TC period. The ROC area was 0 if the neuron was strongly inhibited, 0.5 if it maintained its ITI firing rate, and 1 if it was strongly excited. We also examined the correlation coefficients between US probability and CS response. This coefficient was positive if the CS response was stronger when US probability was higher; it was negative if the CS response was lower when US probability was higher. We found that the activity of a neuron during the TC period was positively correlated with its degree of CS discrimination in the appetitive block (Spearman's rank correlation, p < 0.01; Fig. 6A). In the aversive context, however, changes in TC activity relative to the ITI as well as changes in the CS responses were both small and did not show any significant correlation with one another (Fig. 6B).
To summarize, CS value coding by DRN neurons was much stronger in the appetitive than in the aversive blocks. The strength of CS value coding was positively correlated with changes in activity during the TC period in the appetitive blocks, but not in the aversive blocks.
Responses to USs
Many neurons also responded to the USs (Fig. 7A). Unlike their responses to the CSs, the US responses were observed in both blocks: 74 neurons (30.1%), 43 neurons (17.5%), and 88 neurons (35.8%), respectively, showed significant responses to rewards only, airpuffs only, and both rewards and airpuffs. We first asked whether the responses of DRN neurons to the USs encoded their value on a single dimension. If that were the case, the responses to rewards and airpuffs should be in the opposite direction (e.g., excitatory for rewards and inhibitory for airpuffs, compared with the trace period). Instead, many DRN neurons responded to rewards and airpuffs in the same direction, and responses to free rewards and free airpuffs were positively correlated (Spearman's rank correlation, ρ = 0.27, p = 0.26 × 10−6; Fig. 7B).
The US responses were strongly modulated by the preceding CSs. The example neuron in Figure 2A showed the strongest phasic responses to free, unpredicted rewards (Mann–Whitney U test, p = 0.0003), and a significant correlation between the US response and outcome predictability indicated by the CS (ρ = −0.40, p = 0.02). We also computed the correlation coefficient between them for each neuron, separately for the appetitive and aversive blocks (Fig. 7C). Changes in the US responses depending on the preceding CSs were commonly observed in the appetitive block: 67 neurons (27.2%) responded more strongly to an unpredicted reward than to a predicted reward, whereas 28 neurons (11.4%) responded more strongly to a predicted reward than to an unpredicted reward (Spearman's rank correlation, p < 0.05). As a population, the correlation coefficient between the response to a reward and its predictability was significantly negative (sign test, median correlation coefficient = −0.13, p = 0.34 × 10−6), indicating a stronger response to an unpredicted reward than to a predicted reward. Such CS-dependent modulation in the US response was less common in the aversive block: 22 neurons (8.9%) responded more strongly to an unpredicted airpuff than to a predicted airpuff, whereas 18 neurons (7.3%) responded more strongly to a predicted airpuff than to an unpredicted airpuff. As a population, there was no significant bias to either a positive or negative correlation coefficient (sign test, median correlation coefficient = −0.04, p = 0.07).
We also found that the predictability-dependent modulations in the responses to rewards were positively correlated with those to airpuffs (Spearman's rank correlation, ρ = 0.22, p = 0.0004). In other words, neurons that exhibited a stronger response to an unpredicted reward than to a predicted reward showed a stronger response to an unpredicted airpuff than to a predicted airpuff (Fig. 7C).
Responses to reward omission
Although a tone was a neutral outcome, it was relatively negative in the appetitive block. On the other hand, the same tone was relatively positive in the aversive block. Hence, we examined in more detail how DRN neurons responded to the tone. As shown in Figure 7A, only a small number of neurons responded to tones in the 0% reward trials (i.e., 100% predictability; n = 15, 6.1%) and free tone trials (i.e., 0% predictability; n = 11, 4.5%) in the appetitive block. However, many neurons (n = 66, 26.8%) exhibited significant excitatory or inhibitory modulation in the response to a tone in the 50% reward trials, when a reward was partly expected following the 50% reward CS, but was not given, and a tone was presented.
To characterize the neurons that responded to reward omission, we analyzed 65 neurons whose response to a tone in the 50% reward trials was (1) significantly modulated relative to the preceding trace period and (2) significantly different from the response to a free tone in the appetitive block (Mann–Whitney U test, p < 0.05). Among these, 31 showed increases (Fig. 8A) and 34 showed decreases (Fig. 8B) in their responses to reward omission or a tone in the 50% reward trials. The modulation of activity in these neurons had several characteristics. First, they did not demonstrate a significant response to the same tone when it was presented as expected (0% reward) or unexpectedly (free tone), or to any tones in the aversive block. Second, these neurons also showed modulation in TC activity and CS responses, with consistent value coding in the response to the reward omission. The neurons that showed excitation to reward omission were tonically inhibited during the TC period and exhibited negative reward coding in their CS responses (i.e., stronger response to the 0% reward CS than to the 100% reward CS; Fig. 8A). Conversely, neurons like the one in Figure 2A that showed inhibition to the tone in the 50% reward trials exhibited enhanced activity during the TC period and positive coding in their CS responses in the appetitive block (i.e., stronger response to the 100% reward CS than to the 0% reward CS; Fig. 8B).
Appetitive and aversive information coding by different groups of DRN neurons
Table 2 summarizes the different modulations in the CS and US responses depending on the outcome probability. Many neurons with appetitive block preference in TC activity responded more strongly to the 100% reward CS than to the 0% reward CS and tended to be inhibited by reward omission. Conversely, many neurons with aversive block preference in TC activity responded more strongly to the 0% reward CS than the 100% reward CS and tended to be excited by reward omission. These mirror image responses were not observed for the US responses. Such distinct activity profiles by different groups of neurons might be associated with differences in electrophysiological characteristics, including baseline activity, spike width, and irregularity of firing pattern; however, there were no significant differences among the groups (Table 3).
Table 3.
TC type | Baseline | Spike width ≥2 ms | Irregularity metric |
---|---|---|---|
ap > av | 7.0 (12.0) Hz | 48/92 (52.2%) | 0.65 (0.37) |
av > ap | 8.0 (10.5) Hz | 19/34 (55.9%) | 0.82 (0.29) |
n.s. | 7.0 (9.0) Hz | 47/88 (53.4%) | 0.72 (0.43) |
The physiological properties of the recorded neurons characterized by their baseline firing rate, spike duration, and irregularity of spike trains. For the baseline firing rate and irregularity metric, the median and interquartile range (in parentheses) are shown. For spike duration, only neurons whose spike shape was recorded were analyzed; the number (%) of neurons with duration longer than 2 ms is shown. ap, Appetitive block; av, aversive block; n.s., no significant difference.
Discussion
Using a Pavlovian conditioning paradigm, we obtained evidence that single DRN neurons are involved in computing both appetitive and aversive information. Specifically, DRN neurons discriminated emotional contexts by tonically modulating their activity. In the appetitive block, they further kept track of reward values associated with CSs and USs. In the aversive block, such probability-dependent responses were observed less often. Rather, many neurons continued to encode the aversive context in tonic activity. Thus, the activity of DRN neurons is characterized by dual processes: a tonic and categorical one to discriminate emotional contexts; and a relatively phasic, quantitative one to encode rewarding events.
These dual processes that are dependent on contexts stand in strong contrast to the activity of midbrain dopamine neurons. The dopamine neurons typically show phasic responses to the relative value of appetitive and aversive stimuli; however, sustained modulation such as block-dependent ITI and TC activities was much less common (Schultz, 1998; Matsumoto and Hikosaka, 2009b). Therefore, while midbrain dopaminergic neurons are suitable for updating appetitive and aversive values in stimulus–outcome associations, DRN neurons may be better suited for encoding the overall value of the context.
Context-dependent tonic modulation
Analyses of tonic activity during the ITI and TC periods revealed that more than half of DRN neurons discriminated the appetitive and aversive blocks. Context-dependent activity in the DRN was also demonstrated by comparing reward and non-reward contexts (Li et al., 2013). However, even though DRN neurons were reported to process both appetitive and aversive information, their activity in the aversive context was never compared directly to that in the appetitive context.
The modulations of ITI and TC activities followed different courses, depending on contexts (Fig. 3C,E). In the appetitive block, if a neuron showed significant context dependency in ITI activity, the activity changed further during the TC period and had a strong positive correlation with changes in the subsequent CS response, indicating its participation in encoding valence during the course of a trial. In the aversive block, the same neuron showed tonic modulation starting at the ITI that remained stable throughout a trial, suggesting that tonic activity discriminated context rather than single trials and events. One of the possible functions of a context signal in the DRN may be the control of sensitivity to aversive stimuli. This is supported by evidence that the lowering serotonin levels enhances punishment signals (Cools et al., 2008) or cognitive engagement with aversive stimuli (Dayan and Huys, 2008). Moreover, serotonergic modulation of amygdala function may contribute to such control, as serotonergic projections from the DRN inhibit aversive sensory information processing in the amygdala via immune activation (Lowry et al., 2007) or corticotropin-releasing factor (Lowry et al., 2000; Hammack et al., 2002).
A particularly relevant source of tonic activity in the DRN could be the ventromedial prefrontal cortex (vmPFC). In a similar task, neurons in subregions of the vmPFC showed comparable activities, including tonic modulation during an appetitive-preferring block (ventral vmPFC) or an aversive-preferring block (dorsal vmPFC; Monosov and Hikosaka, 2012). The mechanisms controlling the context-dependent neuronal activity may also involve projections from the PFC and habenula to the DRN, which have often been discussed in relation to stressor controllability. DRN neurons are activated by aversive events (Pezzone et al., 1993; Grahn et al., 1999; Takase et al., 2004; Schweimer and Ungless, 2010). However, once the PFC detects that a stressor is controllable, it inhibits the aversive stimulus-driven DRN activity (Amat et al., 2001). Conversely, if a stressor is uncontrollable, projections from the habenula cause increases in serotonin levels that lead to action inhibition.
A recent study found that optogenetic activation of the PFC–DRN pathway was causally involved in increased effortful movement in rats during forced swimming, a challenging and inescapable situation, whereas activation of the PFC–habenula pathway had the opposite effect (Warden et al., 2012). Although we compared appetitive to aversive Pavlovian responses rather than controllability using the current task, PFC–DRN and habenula–DRN interactions might also be important for the regulation of emotional behavior during conditioning in the aversive context.
Responsiveness to CSs and USs
The responses to aversive CSs were scarce even though many DRN neurons phasically responded to airpuffs. This finding was unexpected because the DRN was implicated in aversive information processing in previous studies. We showed that the CS responses were rarely modulated by airpuff probability in the aversive block. However, the monkeys behaviorally distinguished the aversive CSs. Thus, this distinction may be performed by other brain structures such as the lateral habenula. While monkeys were conditioned in a Pavlovian procedure, habenula neurons discriminated the aversive CSs probabilistically associated with airpuffs (Matsumoto and Hikosaka, 2009a). Additional candidate areas that might play a key role in evaluating aversive stimuli are the orbitofrontal cortex (Hosokawa et al., 2007; Morrison and Salzman, 2009) and the amygdala (Paton et al., 2006). On the other hand, the results are consistent with the finding that serotonin depletion did not affect fear conditioning to an explicit CS in rats (Wilkinson et al., 1995). In humans, lowering serotonin levels reduced punishment-induced inhibition, indicating that the serotonergic system is crucial for linking response inhibition with predictions of aversive outcomes rather than aversive information processing per se (Crockett et al., 2009). Together, our data suggest that DRN neurons are involved in the coding of aversive context, but not of detailed stimulus contingencies.
In contrast, the activity of the same DRN neurons was richly modulated in the appetitive context. First, many DRN neurons showed reward probability-dependent modulation in the CS responses. There was also a strong positive correlation between TC activity and CS value coding, which is consistent with our previous finding that DRN neurons keep track of expected rewards throughout the task (Bromberg-Martin et al., 2010). Second, DRN neurons showed reward probability-dependent modulation in the US responses. Specifically, many responded stronger when rewards were unpredicted than when predicted (Fig. 7C). Some also responded to the omission of expected rewards (Fig. 8). This is partly in agreement with the theoretical prediction of serotonergic function proposed by Daw et al. (2002), which has never been examined experimentally at the single-neuron level. They predicted that a phasic serotonergic signal might report a negative RPE, implying that actual punishments are stronger than predicted punishments, such as the omission of rewards. However, we found that DRN neurons that responded to the omission of rewards did not exhibit (1) excitatory responses to free tones in the appetitive block, (2) excitatory responses to aversive CSs and USs that were modulated by airpuff probability in the aversive block, or (3) inhibitory responses to rewards. Therefore, the responses to the omission of an expected reward observed in some DRN neurons cannot be fully explained by a negative RPE signal hypothesis in the traditional reinforcement–learning framework.
Furthermore, the present findings indicate that value coding in the responses to CSs and USs in the appetitive and aversive blocks was not simply scaled in the opposite direction. Neurons that responded more strongly to a “better” CS than to a “worse” one in the appetitive block (100% > 0% reward) did not respond more strongly to a “better” CS than to a “worse” one in the aversive block (0% > 100% airpuff). Instead, they tended to respond more strongly to the 100% airpuff CS than to the 0% airpuff CS. Likewise, neurons that responded more strongly to unpredicted rewards also responded more strongly to unpredicted airpuffs. Thus, the responses of DRN neurons may be influenced by multiple factors including valence, but also by salience of the stimuli.
Appetitive and aversive information processing in the DRN
We identified at least three distinct types of TC activity modulation: appetitive preferring, aversive preferring, and no preference. Neurons in each group also exhibited different modulations of their CS and US responses (Table 2), indicating functionally distinct populations of cells. The DRN is a heterogeneous structure consisting of neurons with different electrophysiological, neurochemical, and anatomical properties. Several neurotransmitters other than serotonin are also present in the DRN, including GABA, dopamine, glutamate, nitric oxide, substance P, and neuropeptide Y (Charara and Parent, 1998; Michelsen et al., 2007). Different populations of DRN neurons have afferent and efferent connections with distinct brain areas (for review, see Nakamura, 2013). Thus, functional differences may be attributed to neurochemical and/or anatomical characteristics of neuron populations in the DRN. In the present study, we did not determine the chemical identities of recorded neurons. Nevertheless, the activity we measured should still provide important insights into the roles of serotonin in emotional information processing for the following reasons. First, a substantial proportion of DRN neurons are serotonergic: 30% in rats (Descarries et al., 1982); 70% of medium-sized DRN neurons in cats (Wiklund et al., 1981); and 70% in humans (Baker et al., 1991). Second, extensive interconnectivity exists among neurons with different chemical characteristics in the DRN (Weissbourd et al., 2014).
Although previous results concerning the involvement of the serotonergic system in appetitive and/or aversive processing are inconsistent, we found that single DRN neurons encode both appetitive and aversive information using different time scales. This may be due partly to methodological differences. Commonly used pharmacological manipulations generate relatively global, continuous modulation of serotonin levels, which affect the sensitivity to aversive stimuli. Conversely, electrophysiological measures, including single-unit recording or local stimulation, allow us to analyze neural data on a finer time scale. Manipulations focusing on neurotransmitters and anatomical connections, together with analyses of correlations between neuronal activity and behavioral changes, as demonstrated in this study, will reveal more detailed, circuit-level mechanisms of appetitive and aversive information processing by the serotonergic system.
In conclusion, DRN neurons encode emotional contexts and events reflecting different time scales of prediction depending on their valence, as follows: tonic discrimination of the appetitive and aversive blocks; and phasic representation of reward value throughout the trials. Our findings may provide the neural basis of positive and negative emotional information processing required for value-based decision making in differential contexts.
Footnotes
This work was supported by the Human Frontier Science Program; Precursory Research for Embryonic Science and Technology; the Takeda Foundation; the Nakayama Foundation; Grant-in-Aid for Scientific Research B 20300139; Grant-in-Aid for Scientific Research on Priority Areas 20020028; the Strategic Research Program for Brain Sciences by the Ministry of Education, Culture, Sports, Science and Technology of Japan (K.N.); and Grant-in-Aid for Young Scientists B 24730632 (K.H.). We thank Drs. B. Seymour, I.E. Monosov, M. Matsumoto, E. Bromberg-Martin, and O. Hikosaka for helpful comments; Y. Tokimoto for obtaining magnetic resonance images; A. Noritake and R. Matsuzaki for helping with data analysis; Y. Ueda for histological reconstruction; Y. Kobayashi for help with the experimental setup; and M. Habiro, K. Shiomi, and H. Kuland for technical assistance.
The authors declare no competing financial interests.
References
- Allers KA, Sharp T. Neurochemical and anatomical identification of fast- and slow-firing neurones in the rat dorsal raphe nucleus using juxtacellular labelling methods in vivo. Neuroscience. 2003;122:193–204. doi: 10.1016/S0306-4522(03)00518-9. [DOI] [PubMed] [Google Scholar]
- Amat J, Sparks PD, Matus-Amat P, Griggs J, Watkins LR, Maier SF. The role of the habenular complex in the elevation of dorsal raphé nucleus serotonin and the changes in the behavioral responses produced by uncontrollable stress. Brain Res. 2001;917:118–126. doi: 10.1016/S0006-8993(01)02934-1. [DOI] [PubMed] [Google Scholar]
- Baker KG, Halliday GM, Hornung JP, Geffen LB, Cotton RG, Törk I. Distribution, morphology and number of monoamine-synthesizing and substance P-containing neurons in the human dorsal raphe nucleus. Neuroscience. 1991;42:757–775. doi: 10.1016/0306-4522(91)90043-N. [DOI] [PubMed] [Google Scholar]
- Bromberg-Martin ES, Hikosaka O, Nakamura K. Coding of task reward value in the dorsal raphé nucleus. J Neurosci. 2010;30:6262–6272. doi: 10.1523/JNEUROSCI.0015-10.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Charara A, Parent A. Chemoarchitecture of the primate dorsal raphe nucleus. J Chem Neuroanat. 1998;15:111–127. doi: 10.1016/S0891-0618(98)00036-2. [DOI] [PubMed] [Google Scholar]
- Cools R, Roberts AC, Robbins TW. Serotoninergic regulation of emotional and behavioural control processes. Trends Cogn Sci. 2008;12:31–40. doi: 10.1016/j.tics.2007.10.011. [DOI] [PubMed] [Google Scholar]
- Crockett MJ, Clark L, Robbins TW. Reconciling the role of serotonin in behavioral inhibition and aversion: acute tryptophan depletion abolishes punishment-induced inhibition in humans. J Neurosci. 2009;29:11993–11999. doi: 10.1523/JNEUROSCI.2513-09.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Davies RM, Gerstein GL, Baker SN. Measurement of time-dependent changes in the irregularity of neural spiking. J Neurophysiol. 2006;96:906–918. doi: 10.1152/jn.01030.2005. [DOI] [PubMed] [Google Scholar]
- Daw ND, Kakade S, Dayan P. Opponent interactions between serotonin and dopamine. Neural Netw. 2002;15:603–616. doi: 10.1016/S0893-6080(02)00052-7. [DOI] [PubMed] [Google Scholar]
- Dayan P, Huys QJ. Serotonin, inhibition, and negative mood. PLoS Comput Biol. 2008;4:e4. doi: 10.1371/journal.pcbi.0040004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Descarries L, Watkins KC, Garcia S, Beaudet A. The serotonin neurons in nucleus raphe dorsalis of adult rat: a light and electron microscope radioautographic study. J Comp Neurol. 1982;207:239–254. doi: 10.1002/cne.902070305. [DOI] [PubMed] [Google Scholar]
- Fletcher PJ, Korth KM, Chambers JW. Selective destruction of brain serotonin neurons by 5,7-dihydroxytryptamine increases responding for a conditioned reward. Psychopharmacology (Berl) 1999;147:291–299. doi: 10.1007/s002130051170. [DOI] [PubMed] [Google Scholar]
- Grahn RE, Will MJ, Hammack SE, Maswood S, McQueen MB, Watkins LR, Maier SF. Activation of serotonin-immunoreactive cells in the dorsal raphé nucleus in rats exposed to an uncontrollable stressor. Brain Res. 1999;826:35–43. doi: 10.1016/S0006-8993(99)01208-1. [DOI] [PubMed] [Google Scholar]
- Hammack SE, Richey KJ, Schmid MJ, LoPresti ML, Watkins LR, Maier SF. The role of corticotropin-releasing hormone in the dorsal raphé nucleus in mediating the behavioral consequences of uncontrollable stress. J Neurosci. 2002;22:1020–1026. doi: 10.1523/JNEUROSCI.22-03-01020.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hosokawa T, Kato K, Inoue M, Mikami A. Neurons in the macaque orbitofrontal cortex code relative preference of both rewarding and aversive outcomes. Neurosci Res. 2007;57:434–445. doi: 10.1016/j.neures.2006.12.003. [DOI] [PubMed] [Google Scholar]
- Inaba K, Mizuhiki T, Setogawa T, Toda K, Richmond BJ, Shidara M. Neurons in monkey dorsal raphé nucleus code beginning and progress of step-by-step schedule, reward expectation, and amount of reward outcome in the reward schedule task. J Neurosci. 2013;33:3477–3491. doi: 10.1523/JNEUROSCI.4388-12.2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jacobs BL, Azmitia EC. Structure and function of the brain serotonin system. Physiol Rev. 1992;72:165–229. doi: 10.1152/physrev.1992.72.1.165. [DOI] [PubMed] [Google Scholar]
- Jacobs BL, Fornal CA. 5-HT and motor control: a hypothesis. Trends Neurosci. 1993;16:346–352. doi: 10.1016/0166-2236(93)90090-9. [DOI] [PubMed] [Google Scholar]
- Kirby LG, Allen AR, Lucki I. Regional differences in the effects of forced swimming on extracellular levels of 5-hydroxytryptamine and 5-hydroxyindoleacetic acid. Brain Res. 1995;682:189–196. doi: 10.1016/0006-8993(95)00349-U. [DOI] [PubMed] [Google Scholar]
- Kocsis B, Varga V, Dahan L, Sik A. Serotonergic neuron diversity: identification of raphé neurons with discharges time-locked to the hippocampal theta rhythm. Proc Natl Acad Sci U S A. 2006;103:1059–1064. doi: 10.1073/pnas.0508360103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li Y, Dalphin N, Hyland BI. Association with reward negatively modulates short latency phasic conditioned responses of dorsal raphé nucleus neurons in freely moving rats. J Neurosci. 2013;33:5065–5078. doi: 10.1523/JNEUROSCI.5679-12.2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu ZH, Ikemoto S. The midbrain raphé nuclei mediate primary reinforcement via GABA(A) receptors. Eur J Neurosci. 2007;25:735–743. doi: 10.1111/j.1460-9568.2007.05319.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu Z, Zhou J, Li Y, Hu F, Lu Y, Ma M, Feng Q, Zhang JE, Wang D, Zeng J, Bao J, Kim JY, Chen ZF, El Mestikawy S, Luo M. Dorsal raphé neurons signal reward through 5-HT and glutamate. Neuron. 2014;81:1360–1374. doi: 10.1016/j.neuron.2014.02.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lowry CA, Rodda JE, Lightman SL, Ingram CD. Corticotropin-releasing factor increases in vitro firing rates of serotonergic neurons in the rat dorsal raphé nucleus: evidence for activation of a topographically organized mesolimbocortical serotonergic system. J Neurosci. 2000;20:7728–7736. doi: 10.1523/JNEUROSCI.20-20-07728.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lowry CA, Hollis JH, de Vries A, Pan B, Brunet LR, Hunt JR, Paton JF, van Kampen E, Knight DM, Evans AK, Rook GA, Lightman SL. Identification of an immune-responsive mesolimbocortical serotonergic system: potential role in regulation of emotional behavior. Neuroscience. 2007;146:756–772. doi: 10.1016/j.neuroscience.2007.01.067. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Matsumoto M, Hikosaka O. Representation of negative motivational value in the primate lateral habenula. Nat Neurosci. 2009a;12:77–84. doi: 10.1038/nn.2233. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Matsumoto M, Hikosaka O. Two types of dopamine neuron distinctly convey positive and negative motivational signals. Nature. 2009b;459:837–841. doi: 10.1038/nature08028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Maunsell JH, Gibson JR. Visual response latencies in striate cortex of the macaque monkey. J Neurophysiol. 1992;68:1332–1344. doi: 10.1152/jn.1992.68.4.1332. [DOI] [PubMed] [Google Scholar]
- Michelsen KA, Schmitz C, Steinbusch HW. The dorsal raphé nucleus—From silver stainings to a role in depression. Brain Res Rev. 2007;55:329–342. doi: 10.1016/j.brainresrev.2007.01.002. [DOI] [PubMed] [Google Scholar]
- Miliaressis E. Serotonergic basis of reward in median raphé of the rat. Pharmacol Biochem Behav. 1977;7:177–180. doi: 10.1016/0091-3057(77)90204-0. [DOI] [PubMed] [Google Scholar]
- Miliaressis E, Bouchard A, Jacobowitz DM. Strong positive reward in median raphé: specific inhibition by para-chlorophenylalanine. Brain Res. 1975;98:194–201. doi: 10.1016/0006-8993(75)90521-1. [DOI] [PubMed] [Google Scholar]
- Mirenowicz J, Schultz W. Importance of unpredictability for reward responses in primate dopamine neurons. J Neurophysiol. 1994;72:1024–1027. doi: 10.1152/jn.1994.72.2.1024. [DOI] [PubMed] [Google Scholar]
- Miyazaki K, Miyazaki KW, Doya K. Activation of dorsal raphé serotonin neurons underlies waiting for delayed rewards. J Neurosci. 2011;31:469–479. doi: 10.1523/JNEUROSCI.3714-10.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Molliver ME. Serotonergic neuronal systems: what their anatomic organization tells us about function. J Clin Psychopharmacol. 1987;7:3S–23S. doi: 10.1097/00004714-198712001-00001. [DOI] [PubMed] [Google Scholar]
- Monosov IE, Hikosaka O. Regionally distinct processing of rewards and punishments by the primate ventromedial prefrontal cortex. J Neurosci. 2012;32:10318–10330. doi: 10.1523/JNEUROSCI.1801-12.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Montague PR, Dayan P, Sejnowski TJ. A framework for mesencephalic dopamine systems based on predictive Hebbian learning. J Neurosci. 1996;16:1936–1947. doi: 10.1523/JNEUROSCI.16-05-01936.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Morrison SE, Salzman CD. The convergence of information about rewarding and aversive stimuli in single neurons. J Neurosci. 2009;29:11471–11483. doi: 10.1523/JNEUROSCI.1815-09.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nakamura K. The role of the dorsal raphé nucleus in reward-seeking behavior. Front Integr Neurosci. 2013;7:60. doi: 10.3389/fnint.2013.00060. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nakamura K, Matsumoto M, Hikosaka O. Reward-dependent modulation of neuronal activity in the primate dorsal raphé nucleus. J Neurosci. 2008;28:5331–5343. doi: 10.1523/JNEUROSCI.0021-08.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Paton JJ, Belova MA, Morrison SE, Salzman CD. The primate amygdala represents the positive and negative value of visual stimuli during learning. Nature. 2006;439:865–870. doi: 10.1038/nature04490. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pezzone MA, Lee WS, Hoffman GE, Pezzone KM, Rabin BS. Activation of brainstem catecholaminergic neurons by conditioned and unconditioned aversive stimuli as revealed by c-Fos immunoreactivity. Brain Res. 1993;608:310–318. doi: 10.1016/0006-8993(93)91472-5. [DOI] [PubMed] [Google Scholar]
- Ranade SP, Mainen ZF. Transient firing of dorsal raphé neurons encodes diverse and specific sensory, motor, and reward events. J Neurophysiol. 2009;102:3026–3037. doi: 10.1152/jn.00507.2009. [DOI] [PubMed] [Google Scholar]
- Schultz W. Predictive reward signal of dopamine neurons. J Neurophysiol. 1998;80:1–27. doi: 10.1152/jn.1998.80.1.1. [DOI] [PubMed] [Google Scholar]
- Schweimer JV, Ungless MA. Phasic responses in dorsal raphé serotonin neurons to noxious stimuli. Neuroscience. 2010;171:1209–1215. doi: 10.1016/j.neuroscience.2010.09.058. [DOI] [PubMed] [Google Scholar]
- Takase LF, Nogueira MI, Baratta M, Bland ST, Watkins LR, Maier SF, Fornal CA, Jacobs BL. Inescapable shock activates serotonergic neurons in all raphé nuclei of rat. Behav Brain Res. 2004;153:233–239. doi: 10.1016/j.bbr.2003.12.020. [DOI] [PubMed] [Google Scholar]
- Warden MR, Selimbeyoglu A, Mirzabekov JJ, Lo M, Thompson KR, Kim SY, Adhikari A, Tye KM, Frank LM, Deisseroth K. A prefrontal cortex-brainstem neuronal projection that controls response to behavioural challenge. Nature. 2012;492:428–432. doi: 10.1038/nature11617. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Weissbourd B, Ren J, DeLoach KE, Guenthner CJ, Miyamichi K, Luo L. Presynaptic partners of dorsal raphe serotonergic and GABAergic neurons. Neuron. 2014;83:645–662. doi: 10.1016/j.neuron.2014.06.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wiklund L, Léger L, Persson M. Monoamine cell distribution in the cat brain stem. A fluorescence histochemical study with quantification of indolaminergic and locus coeruleus cell groups. J Comp Neurol. 1981;203:613–647. doi: 10.1002/cne.902030405. [DOI] [PubMed] [Google Scholar]
- Wilkinson LS, Humby T, Robbins TW, Everitt BJ. Differential effects of forebrain 5-hydroxytryptamine depletions on Pavlovian aversive conditioning to discrete and contextual stimuli in the rat. Eur J Neurosci. 1995;7:2042–2052. doi: 10.1111/j.1460-9568.1995.tb00627.x. [DOI] [PubMed] [Google Scholar]