Abstract
The basolateral amygdala complex (BLA) has been identified as a critical structure mediating fear extinction. However, little is known about the functional role of neurons in the BLA during the extinction of a reward-seeking behavior. Here, we used in vivo electrophysiology procedures in freely moving rats to investigate whether and how the BLA encodes the extinction of responding for sucrose. We recorded 330 neurons from 15 rats during a within-session extinction procedure following training under a partial reinforcement schedule. Several distinct populations of neurons change their response profiles as the rat ceases to respond for the omitted reinforcer. One population of neurons (32 of 330; 10%), which responded selectively to port entries in the presence, but not the absence, of sucrose during maintenance, subsequently developed a phasic response to port entries in the absence of sucrose during the extinction epoch only. The relative proportion of these “reinforcement-omission” neurons per rat was correlated with response intensity during extinction, as well as with the rate at which reward-seeking behavior was extinguished. A subpopulation of neurons responded with opposite phasic changes in activity to port entries in the presence of sucrose and to port entries during extinction, demonstrating that BLA neurons may contribute to the detection of value differences between expected and actual outcomes. Another population of neurons (47 of 330; 14%) responded to the empty port only during extinction. Because BLA neural correlates reflect the omission of an expected reward, these neuronal populations may contribute to the expression of behavior during extinction.
Introduction
Environmental contingencies for reward availability may change unexpectedly. Thus, the ability of an organism to quickly and appropriately adapt its behavior to a change in the environment is essential to survival. Furthermore, understanding the neural circuitry that mediates the extinction of a reward-seeking behavior is essential for improving therapeutic interventions for a variety of psychopathologies such as compulsive overeating or drug addiction.
Though extinction is likely mediated by a distributed neural network (Quirk and Mueller, 2008), several studies have implicated the basolateral amygdala (BLA) as a critical structure for fear extinction. Intra-BLA infusions of NMDAR antagonists, kinase inhibitors, or the GABAA agonist, muscimol, impair the acquisition of fear extinction (Falls et al., 1992; Lu et al., 2001; Lin et al., 2003; Sotres-Bayon et al., 2007). Although there is a rapidly accumulating literature describing the neural basis of fear extinction, little is known about the neural circuitry underlying the extinction of reward-seeking behavior.
In humans, functional magnetic resonance imaging (fMRI) studies demonstrate that the amygdala responds differently to reward receipt and omission (Ernst et al., 2005). In primates, single-unit recording within the amygdala has identified a subset of neurons that respond to reinforcer omission (Belova et al., 2007). The amygdala has been implicated in processing the distinction between stimuli that predict positive and negative hedonic outcomes in primates (Paton et al., 2006) and in rats (Schoenbaum et al., 1998; Saddoris et al., 2005). However, little is known about the relationship between amygdala neuron activity and the behavioral response to the unexpected omission of reward during the extinction of reward-seeking behavior.
Because the BLA is critically involved in processing outcome expectation (Schoenbaum et al., 1998; Corbit and Balleine, 2005; Belova et al., 2007), fear extinction (Repa et al., 2001; Myers and Davis, 2002; Herry and Mons, 2004; Herry et al., 2008; Likhtik et al., 2008; Quirk and Mueller, 2008; Ehrlich et al., 2009), and the expression and modulation of conditioned reward seeking (Cador et al., 1989; Baxter and Murray, 2002; Carelli et al., 2003; McLaughlin and See, 2003; McLaughlin and Floresco, 2007; Tye and Janak, 2007; Tye et al., 2008), we hypothesized that BLA neurons encode the change in outcome expectation that underlies the extinction of reward-seeking behavior. To test this hypothesis, we recorded the activity of BLA neurons in vivo in freely moving rats during a within-session extinction procedure.
Materials and Methods
In vivo electrophysiological recordings.
Male Sprague Dawley rats were bilaterally implanted with electrodes in the BLA to enable chronic neural recording during maintenance and extinction. Rats were anesthetized with isoflurane and stereotaxically implanted with fixed-wire electrode arrays (NeuroBiological Laboratories), directed at the BLA (anteroposterior, −2.8 to −3.6; mediolateral, ±5.0; dorsoventral, −7.2 to −7.5). Rats were allowed to heal for 5–10 d, during which time they received food and water ad libitum. After healing, rats were food restricted to 90% of their ad libitum consumption for at least 24 h before initial training. All procedures were approved by the Gallo Center Institutional Animal Care and Use Committee and were in accordance with the National Institute of Health guidelines.
Behavioral training.
Rats (n = 16) were trained to perform operant responses at a nosepoke (NP) operandum on a partial reinforcement schedule (∼50% of responses were reinforced) with a subsequent delivery (after 2 s) of 0.1 ml of a sucrose solution (15% sucrose in water) at an adjacent port. Rats were trained in three variations of the same behavioral procedure. The “reward-predictive cue” procedure was as follows: ∼50% of nosepokes were reinforced by a cue presentation followed by sucrose delivery during maintenance, with the cue omitted during extinction. The “random cue” procedure was as follows: ∼50% of nosepokes were reinforced with sucrose delivery only, with cues presented at random intervals. The “response-contingent cue” procedure was as follows: ∼50% of nosepokes were reinforced by a cue presentation and sucrose delivery during maintenance, with nosepokes reinforced only by the cue during extinction. For all groups, rats that did not reach learning criterion (at least 30 sucrose deliveries earned in a 2 h session) by the fourth session were excluded (n = 1 rat excluded, resulting in a final n of 5 per group). All rats then received an additional posttraining “maintenance” session before the within-session extinction session. The within-session extinction procedure consisted of ∼40 min of maintenance followed by a 1 h extinction epoch during which the sucrose reward was not available. Because there were no significant differences among the three groups in neural or behavioral activity, the data from these groups were pooled, unless otherwise indicated.
Histology.
After the completion of the last session, animals were anesthetized with isoflurane and decapitated. A 19 μA current was passed through each recording electrode that had identifiable single units. The brain was fixed in 10% formalin, 3% potassium ferrocyanide overnight. Brains were then submerged in 20% sucrose and 3% potassium ferrocyanide overnight. Potassium ferrocyanide was used to visualize and determine the location of each electrode tip. Brains were sectioned (50 μm) throughout the extent of the amygdala. Alternating sections were stained with Neutral Red Nissl staining, allowing the visualization of the blue electrode placement marking in relation to the subnuclei of the amygdala. Serial sections were examined under a light microscope, and the location of each electrode tip was plotted on coronal sections taken from the rat stereotaxic atlas (Paxinos and Watson, 1998).
Single-unit recording and discrimination.
Neural activity was recorded using commercial hardware and software, including headstage and programmable amplifiers, filters (0.5 and 5 kHz, 3 dB cutoffs), and multichannel spike sorting software (Plexon Instruments). Discrimination of individual units was performed off-line using principal component analysis of waveform shape. Single units were identified by constancy of waveform shape, cross-correlogram, autocorrelogram, and interspike interval.
Analyzing neural response properties.
Sorted files were processed in NeuroExplorer (Plexon Instruments) to extract unit timestamps and relevant reference event markers. Neural activity was characterized via perievent raster and perievent histogram analysis. NeuroExplorer-extracted timestamps were then exported to MATLAB and analyzed for statistical significance of changes in phasic activity to various reference events. To determine whether a neuron altered firing during port entry, firing rate in each of five 100 ms bins following the reference event was compared with baseline firing rate during the 0.5 s baseline response window on a trial-by-trial basis using the nonparametric Wilcoxon signed-rank test. The baseline response window was determined on an individual rat-by-rat basis during which the animal had the least behavioral activity. Responses were deemed statistically significant if any 100 ms bins in the response window was statistically significant relative to the baseline epoch. Significant correlations with behavioral variables of interest (response intensity and extinction resistance) and neural activity surrounding the nosepoke operant response and the cue were not observed; thus the analyses reported here focus on reward delivery/port entry neural correlates. Categorization of neuronal types was confirmed by ensuring significant differences between port entry responses for a given neuron by comparing the three port entry conditions (sucrose port entry during maintenance, empty port entry during maintenance, and empty port entry during extinction) using the nonparametric ANOVA, the Friedman test. Significance was set at p < 0.01. For the surface plots shown in Figures 5 and 6, the Z-score was calculated for each neuron on each trial for every 100 ms bin from 4 s before to 4 s after each port entry (PE), as follows: Ztrial,bin = FRtest bin − FRbaseline avg/SDbaseline, where FRtest bin is the firing rate of a neuron for each 100 ms bin in the response window, FRbaseline avg is the average firing rate per trial of each 100 ms bin within the baseline epoch, and SDbaseline is the SD of the firing rates of the 100 ms bins of the baseline epoch for each trial. These data were then used to construct the surface plots in MATLAB.
Behavioral analyses.
The latency between each NP and the PE was calculated for each rat throughout the session. Early extinction was defined as the period between extinction onset and the first nosepoke–port entry trial for which the latency between the nosepoke and the subsequent port entry was >2 SDs from the mean (SDs and means were calculated for the entire session). Late extinction was defined as the period between the first trial for which the nosepoke–port entry latency was >2 SDs from the mean latency to the end of the session. Reinforced and nonreinforced nosepokes and port entries were tabulated in 10 min bins as depicted in Figure 2, B and C. Total nonreinforced nosepokes and port entries were summed for the duration of the 60 min of extinction. The extinction slope was calculated using 10 min bins across 60 min of extinction.
Response bout analysis.
To quantitatively analyze bouts of behavioral responding, we used a parameter-based burst analysis method in NeuroExplorer with nosepoke response bouts defined as a minimum of two responses with an initial interevent interval <1 s and a maximum interresponse interval of 60 s (Janak et al., 1998). Mean peak frequency was calculated by NeuroExplorer by using the minimum interevent interval between two responses per bout averaged across bouts.
Statistical analyses.
All correlations were calculated using Pearson's correlation test. For all scatter plots used for correlational analysis where the relative proportion of neurons within a given response profile category was plotted on the y-axis, only rats with n ≥ 8 total recorded neurons were included, resulting in the exclusion of two rats from these plots. Differences in the probability of behavioral responses following a nosepoke or port entry in early and late extinction were calculated using χ2 tests.
Results
We recorded a total of 330 BLA neurons across 15 adult male rats (Fig. 1). After recovery, rats were trained to respond at a nosepoke operandum on a partial (50%) reinforcement schedule for sucrose delivered to an adjacent port (Fig. 2A). During maintenance, when reward was available to be earned, the partial reinforcement schedule enabled comparison of neural responses for rewarded and nonrewarded port entries. After five training sessions, rats underwent within-session extinction wherein nosepoke responses had no consequence and sucrose deliveries were unavailable (Fig. 2). Note that although the presence of a reward-predictive cue was varied across the three experimental groups, no significant differences were observed among the reward-predictive cue, random cue, and response-contingent cue groups for nosepoke responses (Fig. 2B) or port entries (Fig. 2C) during maintenance and extinction (repeated-measures ANOVA, all F values <2.0, all p values >0.16), and therefore the subjects were pooled for subsequent analyses.
Extinction resistance increases with response intensity
Despite some behavioral variability, rats typically displayed a distinct change in the pattern of nosepoke responding upon extinction onset (Fig. 3A). During maintenance, responding at the nosepoke operandum was steady and continuous (Fig. 3A). In contrast, upon extinction onset, the rate of responding at both the nosepoke operandum and the reward port became irregular and was characterized by discrete bouts of responding (Fig. 3A). To quantify the intensity of nosepoke responding, the peak frequency (Fig. 3A, inset) from each bout of responding was averaged to find the mean peak frequency. The mean peak frequency during extinction was normalized to the mean peak frequency during maintenance to obtain a measure we termed “response intensity.”
Additionally, we examined the rate of extinction by comparing the nosepoke response rate during the last 10 min of extinction relative to the entire extinction session, a measure we termed “extinction resistance.” We found a significant correlation (r = 0.6327, p = 0.0114; Pearson's correlation test) between an individual rat's response intensity and extinction resistance (Fig. 3B).
Neural encoding of port entry and sucrose
Of 330 BLA neurons we recorded during this session, 194 (59%) neurons exhibited a change in activity upon reward port entry. The response profiles of these neurons were heterogeneous both within and across groups and the characterization of these neuronal subpopulations is detailed in Table 1. One subpopulation of neurons showed phasic responses to port entries under any condition (Fig. 4A–D), regardless of whether sucrose was present or absent, or if the port entry occurred during maintenance or extinction, and was hence termed “all port entry” (n = 31; 9% of all neurons recorded). While “all port entry”-responsive neurons may just encode the motoric output of entering the port, many other populations showed conditional specificity. The largest population of port-entry-responsive neurons were neurons that selectively responded to the presence of sucrose (sucrose port entry; n = 50; 15%), hence termed “sucrose-only” neurons (Fig. 4E–H). Conversely, another population of neurons termed “empty port” neurons, showed phasic responses only to port entries when sucrose was absent (n = 15; 5%), regardless of whether the port entry occurred during maintenance or extinction.
Table 1.
Population | Port entries during |
All rats pooled |
Rew. predictive |
Random |
Resp. contingent |
||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Maintenance | ExtinctionEmpty | n | % oftotal(n = 330) | n | % oftotal(n = 88) | n | % oftotal(n = 93) | n | % oftotal(n = 149) | ||
Sucrose | Empty | ||||||||||
Reinforcement omission (n = 32/330; 10%) | ↓ | — | ↓ | 22/32 | 7% | 5/88 | 6% | 6/93 | 6% | 11/149 | 7% |
↑ | — | ↓ | 7/32 | 2% | 3/88 | 3% | 2/93 | 2% | 2/149 | 1% | |
↓ | — | ↑ | 3/32 | 1% | 1/88 | 1% | 0/93 | 0% | 2/149 | 1% | |
Extinction only (n = 47/330; 14%) | — | — | ↓ | 41/47 | 9% | 12/88 | 14% | 13/93 | 14% | 16/149 | 11% |
— | — | ↑ | 6/47 | 2% | 3/88 | 3% | 3/93 | 3% | 0/149 | 0% | |
Maint. only (n = 4/330; 1%) | ↑ | ↑ | — | 3/4 | 1% | 0/88 | 0% | 3/93 | 3% | 0/149 | 0% |
↓ | ↓ | — | 1/4 | 0.30% | 0/88 | 0% | 0/93 | 0% | 1/149 | 0.70% | |
Empty in Maint. (n = 17/330; 5%) | — | ↓ | — | 15/17 | 4.50% | 5/88 | 6% | 2/93 | 2% | 8/149 | 5% |
— | ↑ | — | 2/17 | 0.60% | 1/88 | 1% | 0/93 | 0% | 1/149 | 0.70% | |
Empty port (n = 15/330; 5%) | — | ↓ | ↓ | 10/15 | 3% | 2/88 | 2% | 2/93 | 2% | 6/149 | 4% |
— | ↑ | ↑ | 3/15 | 1% | 1/88 | 1% | 1/93 | 1% | 1/149 | 0.70% | |
— | ↑ | ↓ | 1/15 | 0.30% | 0/88 | 0% | 0/93 | 0% | 1/149 | 0.70% | |
— | ↓ | ↑ | 1/15 | 0.30% | 0/88 | 0% | 0/93 | 0% | 1/149 | 0.70% | |
Sucrose only (n = 50/330; 15%) | ↓ | — | — | 37/50 | 11% | 12/88 | 14% | 11/93 | 12% | 14/149 | 9% |
↑ | — | — | 13/50 | 4% | 6/88 | 7% | 7/93 | 8% | 0/149 | 0% | |
All port entries (n = 31/330; 9%) | ↓ | ↓ | ↓ | 23/31 | 7% | 5/88 | 6% | 5/93 | 5% | 13/149 | 9% |
↑ | ↑ | ↑ | 5/31 | 1.60% | 3/88 | 3% | 1/93 | 1% | 1/149 | 0.70% | |
↓ | ↓ | ↑ | 1/31 | 0.30% | 0/88 | 0% | 0/93 | 0% | 1/149 | 0.70% | |
↓ | ↑ | ↓ | 1/31 | 0.30% | 0/88 | 0% | 1/93 | 1% | 0/149 | 0% | |
↑ | ↓ | ↓ | 1/31 | 0.30% | 1/88 | 1% | 0/93 | 0% | 0/149 | 0% | |
No response (n = 134/330; 41%) | — | — | — | 134/330 | 41% | 28/88 | 32% | 36/93 | 39% | 70/149 | 47% |
Here, each population and subpopulation is detailed with neuron counts and changes in phasic activity in each port entry scenario. Upward arrows indicate an increase in firing during the response window (0–500 ms following port entry; p < 0.01), downward arrows indicate a decrease in phasic activity, and the flat line indicates no change. Categorization by response profile in terms of cell number and proportions are shown for all rats pooled, the reward-predictive cue group, the random cue group, and the response contingent cue group in separate columns. Maint., Maintenance; Rew., reward; Resp, response.
A population of neurons develops a response to empty port entries only upon extinction
A total of 79 of 330 neurons (24%) developed a response to the empty port only during extinction. Within this population of neurons, two primary subpopulations can be identified by their responsiveness to port entries when sucrose is present. Importantly, our partial reinforcement schedule allowed us to compare neuronal responses to the same event, port entry when sucrose is absent (empty port entry), during either maintenance or extinction. A significant population of neurons termed “extinction-only” neurons (n = 47; 14%) showed no phasic change in activity upon port entries during maintenance, regardless of whether sucrose was present or absent, but developed a phasic response to empty port entries during extinction (Figs. 5, 6).
Forty-one of forty-seven “extinction-only” neurons exhibited an inhibition in phasic activity upon port entries during extinction. Ten of these forty-one neurons showed a tonic change in firing upon extinction, in addition to a phasic change in firing. Figure 5A shows a population histogram of three neurons (out of 11 total simultaneously recorded neurons) that show a selective inhibition to empty port entries only during the within-session extinction procedure from a representative rat. Throughout the course of extinction, the latency between the nosepoke and the subsequent port entry increased as the animal extinguished reward-seeking behavior (values represent group mean seconds ± SEM; early extinction = 6.132 ± 1.194 and late extinction = 76.026 ± 18.986; p = 0.002). The boundary between early and late extinction is defined as the trial for which the latency between the nosepoke and the subsequent port entry was >2 SDs from the session mean. Furthermore, given a port entry response, the probability that the next behavioral response that the rat will perform is a nosepoke response, rather than another port entry response, significantly decreased in late extinction relative to early extinction (Fig. 5B) (χ2 = 6.578; p = 0.010). Of these 31 “extinction-only” neurons that showed phasic inhibitions, the relative proportion of neurons of this type per rat was inversely correlated with response intensity (Fig. 5C), suggesting that the more neurons recruited to this population, the less intensity of responding during extinction.
This correlation was not observed when the “extinction-only” neurons displaying phasic excitation to the empty port during extinction (n = 6) (Table 1) were included, suggesting that this subpopulation of cells may be functionally distinct from “extinction-only” cells showing phasic inhibitions. The response of a representative neuron that developed a phasic increase in activity in response to empty port entries upon extinction onset is shown in a surface plot in Figure 6A. During early extinction, this neuron displays an increase in tonic firing rate as the phasic excitation develops. The transient increase in tonic firing rate returns to baseline upon late extinction, leaving a pronounced phasic excitation in response to empty port entry. Similar to the behavior of the representative rat shown in Figure 5B, this rat also shows a significant shift (decrease) in the probability of a nosepoke following a port entry response in late extinction (Fig. 6B) (χ2 = 4.338; p = 0.036).
Response intensity and extinction resistance correspond to the proportion of reinforcement-omission-encoding neurons per rat
The “extinction-only” neurons (n = 47), discussed above, respond selectively to the empty port entries occurring during extinction (Fig. 7). However, 32 of the 79 neurons that developed a response to the empty port only upon extinction onset also showed a phasic response to port entries when sucrose was present (10% of all 330 neurons recorded) (Fig. 8A–D). Since this population of neurons was selectively responsive to the presence of sucrose during maintenance and developed a phasic response to the empty port only upon extinction, we termed them “reinforcement-omission” neurons, as these neurons remapped their response profiles upon unexpected reinforcement omission during our within-session extinction procedure. Twenty-two of these thirty-two neurons were phasically inhibited to both port entries when sucrose was present and port entries during extinction (Fig. 8A,B). The other 10 cells of these 32 units showed opposite phasic changes: 7 showed a phasic excitation upon port entries when sucrose was present, no phasic response upon empty port entries during maintenance, and then a phasic decrease in activity upon empty port entries during extinction (Fig. 8C,D, Table 1). The other 3 units in this population were inhibited upon sucrose port entries, and excited upon empty port entries during extinction (Table 1).
Although the majority of rats in our task did not show a dramatic increase in response intensity during extinction relative to maintenance (Fig. 3C), perhaps due to the nature of training under a partial reinforcement schedule (see Discussion), the behavioral variability during extinction allowed us to observe the relationship between behavioral measures and the proportion of neurons with a given response profile. We found that the proportion of neurons in the “reinforcement-omission” population out of the total number of neurons per rat was significantly correlated with each rat's response intensity (Fig. 8E) (r = 0.7476, p = 0.0033) and extinction resistance (Fig. 8F) (r = 0.5671; p = 0.0433). These correlations were nonrandom, as no other population of neurons was significantly correlated to either of these measures (Table 2). Furthermore, none of the neuronal populations we identified were correlated with total numbers of nonreinforced responses or with extinction slope for responses at either the nosepoke operandum or the sucrose delivery port (Table 2).
Table 2.
Population | Extinction resistance | Response intensity | Total nonreinforced responses |
Extinction slope |
||
---|---|---|---|---|---|---|
Nosepoke | Port entry | Nosepoke | Port entry | |||
Reinforcement omission (n = 32/330; 10%) | ||||||
r2 | 0.3216 | 0.5590 | 0.0206 | 0.0148 | 0.0430 | 0.0224 |
p | 0.0433 | 0.0033 | 0.6398 | 0.6919 | 0.4964 | 0.6253 |
Extinction only (n = 47/330; 14%) | ||||||
r2 | 0.1764 | 0.2521 | 0.0660 | 0.0729 | 0.0118 | 0.1127 |
p | 0.1530 | 0.0084 | 0.3967 | 0.3721 | 0.7242 | 0.2622 |
Maint. only (n = 4/330; 1%) | N/A | N/A | N/A | N/A | N/A | N/A |
Empty in Maint. (n = 17/330; 5%) | ||||||
r2 | 0.07386 | 0.1130 | 0.0630 | 0.0573 | 0.0214 | 0.0239 |
p | 0.3691 | 0.2614 | 0.4080 | 0.4306 | 0.6336 | 0.6143 |
Empty port (n = 15/330; 5%) | ||||||
r2 | 0.0299 | 0.06619 | 0.0800 | 0.0032 | 0.0243 | 0.0738 |
p | 0.5724 | 0.3916 | 0.3491 | 0.8536 | 0.6113 | 0.3639 |
Sucrose only (n = 50/330; 15%) | ||||||
r2 | 0.07117 | 0.07795 | 0.0115 | 0.0388 | 0.0061 | 0.2316 |
p | 0.3783 | 0.3556 | 0.7279 | 0.5188 | 0.8005 | 0.0959 |
All port entries (n = 31/330; 9%) | ||||||
r2 | 0.00814 | 0.0005 | 0.5658 | 0.4435 | 0.3434 | 0.2666 |
p | 0.7694 | 0.9419 | 0.0030 | 0.0130 | 0.0353 | 0.0708 |
No response (n = 134/330; 41%) | ||||||
r2 | 0.0686 | 0.0459 | 0.0924 | 0.1912 | 0.0535 | 0.2114 |
p | 0.3874 | 0.4823 | 0.3125 | 0.1351 | 0.4473 | 0.1140 |
Discussion
Here, we report that BLA neurons encode the changes in outcome availability that occur during extinction by remapping their phasic response profiles upon extinction of reward-seeking behavior. Specifically, amygdala neurons responded to the absence of reward delivery during extinction. Because we found that the activity of a population of these neurons, termed “reinforcement-omission” neurons, was positively correlated with response intensity (Fig. 8E) and extinction resistance (Fig. 8F), these neurons may contribute to the maintenance of behavioral responding during extinction.
Neural correlates of extinction in the BLA
Because we used a partial reinforcement schedule, we were able to compare neuronal responses to the empty port during extinction to port entries during maintenance in the presence or absence of sucrose. Of 330 neurons, 79 developed a phasic response to empty port entries during extinction (24%) that had no response to the empty port during maintenance. Of these, the largest population (14%; “extinction-only”) developed a response to the reward port during extinction that was not present during maintenance in the presence or absence of sucrose.
The second major population of neurons (10%) responded to the presence of sucrose during maintenance and developed a response to the empty port during extinction. As these neurons were initially sensitive to reinforcement, and then remapped their response profiles upon unexpected reinforcement omission, we termed them “reinforcement-omission” neurons. This population was primarily neurons with inhibitions to sucrose and to the empty port during extinction; however, one-third of this population displayed opposite changes in firing to sucrose and to the empty port during extinction. Together, the relative proportion of these neurons recorded within each rat was predictive of both response intensity and extinction resistance. Thus, subsets of BLA neurons are highly responsive to the change in outcome availability in the context of extinction.
Neurons responsive to all port entries (Fig. 4) may encode recognition of the port as a motivationally significant cue. Similarly, neurons responsive to port entries when sucrose was present (Fig. 4) may encode reinforcement in general, or the specific value or sensory attributes of the outcome (Schoenbaum et al., 1998; Belova et al., 2007). In contrast, neurons that responded to the empty port only during extinction must specifically encode the change in reward availability. We demonstrate that the unexpected absence of reward during extinction is encoded differently than the expected absence of reward during responding in maintenance.
Interestingly, 66 of the 79 neurons (84%) that developed phasic responses to the empty port upon extinction were phasic inhibitions. Eighty-five to ninety-five percent of BLA neurons are glutamatergic projection neurons with the remainder comprised of GABA-ergic neurons, either interneurons distributed within the nuclei or part of the intercalated cell masses around the BLA (McDonald, 1984; Washburn and Moises, 1992; McDonald and Augustine, 1993; Marowsky et al., 2005; Ehrlich et al., 2009). As the majority of neurons we recorded were likely glutamatergic projection neurons receiving inhibitory input from local interneurons and intercalated cells, our findings are consistent with the notion that local inhibitory circuits in the amygdala play a critical role in extinction (Ehrlich et al., 2009).
Our results are consistent with a growing body of literature implicating the amygdala in fear extinction (Quirk and Mueller, 2008; Ehrlich et al., 2009). Inactivation of the BLA with muscimol blocks acquisition of fear extinction (Sotres-Bayon et al., 2007; Herry et al., 2008). In addition, a distinct population of neurons in the BLA develops a response to a shock-conditioned cue after behavioral extinction (Herry et al., 2008). Our studies extend these neural recording findings by identifying changes in the neural response to the reinforcer/outcome, which is relatively more difficult to measure in fear conditioning studies, and extend the study of neural activity during extinction into instrumentally conditioned responses rather than Pavlovian conditioned responses. Our findings support a more general role of the BLA in processing extinction learning, independent of reinforcer valence.
Our data also complement findings implicating the BLA in the extinction of appetitively conditioned responses; specifically, McLaughlin and Floresco (2007) reported that pharmacological inactivation of the BLA impairs extinction of responding for food reward, in agreement with the idea that neural activity within the BLA contributes to changes in responding produced by extinction. Finally, our findings are congruent with lesion studies that demonstrate a role for the BLA in representing specific outcomes, and, critically, for the ability of changes in the value of an outcome to impact responding (Balleine et al., 2003; Corbit and Balleine, 2005; McLaughlin and Floresco, 2007; Ehrlich et al., 2009).
Role of BLA neuronal correlates in reward-seeking extinction
In extinction, an animal learns that the previously conditioned response is no longer reinforced (Pavlov, 1927), creating a distinct action–no outcome association. The neural mechanisms of extinction, particularly the extinction of reward seeking, are poorly understood. Extinction has been hypothesized to be an “unlearning” process due to a violation of the learned association (Rescorla and Wagner, 1972), and this hypothesis is supported by evidence that extinction reverses fear conditioning-induced potentiation at thalamo-amygdala synapses (Kim et al., 2007). However, a competing hypothesis suggests that extinction is mediated by formation of new associations that inhibit, rather than erase, expression of the original learned association (Pavlov, 1927; Bouton and King, 1983; Rescorla, 2001; Myers and Davis, 2002; Quirk et al., 2006; Ehrlich et al., 2009).
If extinction were new learning, then one would expect to observe the appearance of new neural correlates during extinction that signal the change in reinforcer contingencies. In the present study of sucrose-reinforced instrumental responding, the development of an inhibitory action–no outcome association should require a neural signal corresponding to the absence of reward. The “extinction-only” neurons may provide, at least in part, this signal of reward omission. When considering only the 41 of 47 neurons in the “extinction-only” population that showed phasic inhibitions to port entries only during extinction, the proportion of these neurons per rat showed a strong trend toward an inverse correlation with response intensity (r = −0.5543, p = 0.0493) (Fig. 5C). While speculative, this idea is consistent with findings that inactivation of the BLA impairs extinction acquisition (Sotres-Bayon et al., 2007; Herry et al., 2008), including extinction of operant responding for food (McLaughlin and Floresco, 2007).
The intensity of responding during extinction varied considerably among our subjects (Fig. 2). Competition between newly forming inhibitory associations and the original excitatory association may contribute to the behavioral variability during a within-session extinction procedure in which the animal has recently experienced both contingencies, and may explain the sporadic, high-intensity bouts of responding we observed during extinction. We used this to our advantage to probe neural mechanisms underlying responding during extinction. We speculated that the intensity of responding was related to the continued expectation of reward. A subpopulation of “reinforcement-omission” neurons (22 of 32 cells), showed phasic inhibitions upon both sucrose port entries during maintenance and empty port entries during extinction, but not empty port entries during maintenance. During extinction, the continued phasic responsiveness of these neurons related to the expected but omitted reward may contribute to maintained motivation of the animal to continue responding for rewards despite a change in response-outcome contingency. We hypothesize that this neuronal subpopulation encodes the memory or expectation of the sucrose reward during extinction. The remaining neurons (10 of 32) in the “reinforcement-omission” population, which showed opposite phasic changes in response to the sucrose port entries and extinction port entries, may mediate the distinction between the expected reward and reward omission.
We hypothesized that, together, these neuronal subpopulations comprising the “reinforcement-omission” population support the continued responding for reward during extinction and detect the value difference between expected and actual outcomes, therefore contributing to enhanced instrumental response intensity and extinction resistance. Indeed, we found that the proportion of neurons in the “reinforcement-omission” per rat was strongly correlated with each rat's response intensity (Fig. 8E) and extinction resistance (Fig. 8F).
We did not observe significant correlations between reinforcement-omission and extinction-only neuronal populations and measures of total nonreinforced responding during extinction or with the slope of the extinction curve (Table 2). This may be due to the highly variable and sporadic response patterns observed upon extinction onset in our within-session extinction procedure, and suggests to us that the examination of neural correlates of extinction was facilitated by normalizing the behavior of each subject to that subject's baseline responding, as in the response intensity measure.
BLA involvement in expression of “the frustration effect”
The amygdala has been implicated in mediating the behavioral phenomenon, termed “the reinforcement-omission effect” or “frustration effect,” that occurs when an animal trained to respond for a positive reinforcer unexpectedly finds that the reinforcer is omitted following the operant response previously associated with reinforcer procurement (Amsel and Roussel, 1952; Henke, 1973; Henke and Maxwell, 1973; Dudley and Papini, 1997; Abler et al., 2005; Ernst et al., 2005). Hull (1952) first described this behavioral phenomenon, stating that “the abrupt cessation of a customary reinforcement” results in an initial, transient increase in operant responding. Although using a partial reinforcement procedure while recording neural activity in vivo allows us to control for specific conditions under different contingencies, it also results in greater variability in the expression of “the reinforcement-omission effect” during extinction (Amsel, 1958). BLA lesions attenuate the expression of “the frustration effect” (Henke, 1973; Henke and Maxwell, 1973), consistent with our finding that the proportion of neurons encoding the omitted sucrose reinforcer predicts response intensity during extinction (Fig. 8). It is an intriguing possibility that the “reinforcement-omission” neuronal responses we have identified here contribute to expression of “the frustration effect” during extinction.
Conclusion
In conclusion, we report that upon unexpected reward omission during a within-session extinction procedure, BLA neurons remap their phasic response profiles with the acquisition of extinction. These findings encourage future experimentation to elucidate the synaptic and circuit mechanisms in the BLA and other neural substrates such as the prefrontal (Quirk et al., 2000, 2006; Rosenkranz et al., 2003; Peters et al., 2008) and orbitofrontal (Schoenbaum et al., 1998; Tremblay and Schultz, 2000; Schoenbaum and Roesch, 2005) cortices that may mediate the responses to the absence of reward during extinction.
Footnotes
This research was supported by the State of California for Medical Research on Alcohol and Substance Abuse through the University of California at San Francisco (P.H.J.) and a National Science Foundation Graduate Research Fellowship (K.M.T.). We thank E. E. Steinberg, T. M. Gill, R. Keiflin, and R. Kapur for helpful comments.
References
- Abler B, Walter H, Erk S. Neural correlates of frustration. Neuroreport. 2005;16:669–672. doi: 10.1097/00001756-200505120-00003. [DOI] [PubMed] [Google Scholar]
- Amsel A. The role of frustrative nonreward in noncontinuous reward situations. Psychol Bull. 1958;55:102–119. doi: 10.1037/h0043125. [DOI] [PubMed] [Google Scholar]
- Amsel A, Roussel J. Motivational properties of frustration. I. Effect on a running response of the addition of frustration to the motivational complex. J Exp Psychol. 1952;43:363–366. doi: 10.1037/h0059393. [DOI] [PubMed] [Google Scholar]
- Balleine BW, Killcross AS, Dickinson A. The effect of lesions of the basolateral amygdala on instrumental conditioning. J Neurosci. 2003;23:666–675. doi: 10.1523/JNEUROSCI.23-02-00666.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Baxter MG, Murray EA. The amygdala and reward. Nat Rev Neurosci. 2002;3:563–573. doi: 10.1038/nrn875. [DOI] [PubMed] [Google Scholar]
- Belova MA, Paton JJ, Morrison SE, Salzman CD. Expectation modulates neural responses to pleasant and aversive stimuli in primate amygdala. Neuron. 2007;55:970–984. doi: 10.1016/j.neuron.2007.08.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bouton ME, King DA. Contextual control of the extinction of conditioned fear: tests for the associative value of the context. J Exp Psychol Anim Behav Process. 1983;9:248–265. [PubMed] [Google Scholar]
- Cador M, Robbins TW, Everitt BJ. Involvement of the amygdala in stimulus-reward associations: interaction with the ventral striatum. Neuroscience. 1989;30:77–86. doi: 10.1016/0306-4522(89)90354-0. [DOI] [PubMed] [Google Scholar]
- Carelli RM, Williams JG, Hollander JA. Basolateral amygdala neurons encode cocaine self-administration and cocaine-associated cues. J Neurosci. 2003;23:8204–8211. doi: 10.1523/JNEUROSCI.23-23-08204.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Corbit LH, Balleine BW. Double dissociation of basolateral and central amygdala lesions on the general and outcome-specific forms of pavlovian-instrumental transfer. J Neurosci. 2005;25:962–970. doi: 10.1523/JNEUROSCI.4507-04.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dudley RT, Papini MR. Amsel's frustration effect: a pavlovian replication with control for frequency and distribution of rewards. Physiol Behav. 1997;61:627–629. doi: 10.1016/s0031-9384(96)00498-2. [DOI] [PubMed] [Google Scholar]
- Ehrlich I, Humeau Y, Grenier F, Ciocchi S, Herry C, Lüthi A. Amygdala inhibitory circuits and the control of fear memory. Neuron. 2009;62:757–771. doi: 10.1016/j.neuron.2009.05.026. [DOI] [PubMed] [Google Scholar]
- Ernst M, Nelson EE, Jazbec S, McClure EB, Monk CS, Leibenluft E, Blair J, Pine DS. Amygdala and nucleus accumbens in responses to receipt and omission of gains in adults and adolescents. Neuroimage. 2005;25:1279–1291. doi: 10.1016/j.neuroimage.2004.12.038. [DOI] [PubMed] [Google Scholar]
- Falls WA, Miserendino MJ, Davis M. Extinction of fear-potentiated startle: blockade by infusion of an NMDA antagonist into the amygdala. J Neurosci. 1992;12:854–863. doi: 10.1523/JNEUROSCI.12-03-00854.1992. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Henke PG. Effects of reinforcement omission on rats with lesions in the amygdala. J Comp Physiol Psychol. 1973;84:187–193. doi: 10.1037/h0035015. [DOI] [PubMed] [Google Scholar]
- Henke PG, Maxwell D. Lesions in the amygdala and the frustration effect. Physiol Behav. 1973;10:647–650. doi: 10.1016/0031-9384(73)90137-6. [DOI] [PubMed] [Google Scholar]
- Herry C, Mons N. Resistance to extinction is associated with impaired immediate early gene induction in medial prefrontal cortex and amygdala. Eur J Neurosci. 2004;20:781–790. doi: 10.1111/j.1460-9568.2004.03542.x. [DOI] [PubMed] [Google Scholar]
- Herry C, Ciocchi S, Senn V, Demmou L, Müller C, Lüthi A. Switching on and off fear by distinct neuronal circuits. Nature. 2008;454:600–606. doi: 10.1038/nature07166. [DOI] [PubMed] [Google Scholar]
- Hull CL. New Haven, CT: Yale UP; 1952. A behavior system: an introduction to behavior theory concerning the individual organism. [Google Scholar]
- Janak PH, Redfern JE, Samson HH. The reinforcing effects of ethanol are altered by the endogenous neurosteroid, allopregnanolone. Alcohol Clin Exp Res. 1998;22:1106–1112. [PubMed] [Google Scholar]
- Kim J, Lee S, Park K, Hong I, Song B, Son G, Park H, Kim WR, Park E, Choe HK, Kim H, Lee C, Sun W, Kim K, Shin KS, Choi S. Amygdala depotentiation and fear extinction. Proc Natl Acad Sci U S A. 2007;104:20955–20960. doi: 10.1073/pnas.0710548105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Likhtik E, Popa D, Apergis-Schoute J, Fidacaro GA, Paré D. Amygdala intercalated neurons are required for expression of fear extinction. Nature. 2008;454:642–645. doi: 10.1038/nature07167. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lin CH, Yeh SH, Lu HY, Gean PW. The similarities and diversities of signal pathways leading to consolidation of conditioning and consolidation of extinction of fear memory. J Neurosci. 2003;23:8310–8317. doi: 10.1523/JNEUROSCI.23-23-08310.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lu KT, Walker DL, Davis M. Mitogen-activated protein kinase cascade in the basolateral nucleus of amygdala is involved in extinction of fear-potentiated startle. J Neurosci. 2001;21 doi: 10.1523/JNEUROSCI.21-16-j0005.2001. RC162(1–5) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marowsky A, Yanagawa Y, Obata K, Vogt KE. A specialized subclass of interneurons mediates dopaminergic facilitation of amygdala function. Neuron. 2005;48:1025–1037. doi: 10.1016/j.neuron.2005.10.029. [DOI] [PubMed] [Google Scholar]
- McDonald AJ. Neuronal organization of the lateral and basolateral amygdaloid nuclei in the rat. J Comp Neurol. 1984;222:589–606. doi: 10.1002/cne.902220410. [DOI] [PubMed] [Google Scholar]
- McDonald AJ, Augustine JR. Localization of GABA-like immunoreactivity in the monkey amygdala. Neuroscience. 1993;52:281–294. doi: 10.1016/0306-4522(93)90156-a. [DOI] [PubMed] [Google Scholar]
- McLaughlin J, See RE. Selective inactivation of the dorsomedial prefrontal cortex and the basolateral amygdala attenuates conditioned-cued reinstatement of extinguished cocaine-seeking behavior in rats. Psychopharmacology (Berl) 2003;168:57–65. doi: 10.1007/s00213-002-1196-x. [DOI] [PubMed] [Google Scholar]
- McLaughlin RJ, Floresco SB. The role of different subregions of the basolateral amygdala in cue-induced reinstatement and extinction of food-seeking behavior. Neuroscience. 2007;146:1484–1494. doi: 10.1016/j.neuroscience.2007.03.025. [DOI] [PubMed] [Google Scholar]
- Myers KM, Davis M. Behavioral and neural analysis of extinction. Neuron. 2002;36:567–584. doi: 10.1016/s0896-6273(02)01064-4. [DOI] [PubMed] [Google Scholar]
- Paton JJ, Belova MA, Morrison SE, Salzman CD. The primate amygdala represents the positive and negative value of visual stimuli during learning. Nature. 2006;439:865–870. doi: 10.1038/nature04490. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pavlov IP. Conditioned reflexes: an investigation of the physiological activity of the cerebral cortex. London: Oxford UP; 1927. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Paxinos G, Watson C. The rat brain in stereotaxic coordinates. New York: Academic; 1998. [DOI] [PubMed] [Google Scholar]
- Peters J, Vallone J, Laurendi K, Kalivas PW. Opposing roles for the ventral prefrontal cortex and the basolateral amygdala on the spontaneous recovery of cocaine-seeking in rats. Psychopharmacology (Berl) 2008;197:319–326. doi: 10.1007/s00213-007-1034-2. [DOI] [PubMed] [Google Scholar]
- Quirk GJ, Mueller D. Neural mechanisms of extinction learning and retrieval. Neuropsychopharmacology. 2008;33:56–72. doi: 10.1038/sj.npp.1301555. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Quirk GJ, Russo GK, Barron JL, Lebron K. The role of ventromedial prefrontal cortex in the recovery of extinguished fear. J Neurosci. 2000;20:6225–6231. doi: 10.1523/JNEUROSCI.20-16-06225.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Quirk GJ, Garcia R, González-Lima F. Prefrontal mechanisms in extinction of conditioned fear. Biol Psychiatry. 2006;60:337–343. doi: 10.1016/j.biopsych.2006.03.010. [DOI] [PubMed] [Google Scholar]
- Repa JC, Muller J, Apergis J, Desrochers TM, Zhou Y, LeDoux JE. Two different lateral amygdala cell populations contribute to the initiation and storage of memory. Nat Neurosci. 2001;4:724–731. doi: 10.1038/89512. [DOI] [PubMed] [Google Scholar]
- Rescorla RA. Retraining of extinguished Pavlovian stimuli. J Exp Psychol Anim Behav Process. 2001;27:115–124. [PubMed] [Google Scholar]
- Rescorla RA, Wagner AR. A theory of Pavlovian conditioning: variations in the effectiveness of reinforcement and nonreinforcement. In: Black AM, Prokasy WF, editors. Classical conditioning II: current research and theory. New York: Appleton-Century-Crofts; 1972. pp. 64–99. [Google Scholar]
- Rosenkranz JA, Moore H, Grace AA. The prefrontal cortex regulates lateral amygdala neuronal plasticity and responses to previously conditioned stimuli. J Neurosci. 2003;23:11054–11064. doi: 10.1523/JNEUROSCI.23-35-11054.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Saddoris MP, Gallagher M, Schoenbaum G. Rapid associative encoding in basolateral amygdala depends on connections with orbitofrontal cortex. Neuron. 2005;46:321–331. doi: 10.1016/j.neuron.2005.02.018. [DOI] [PubMed] [Google Scholar]
- Schoenbaum G, Roesch M. Orbitofrontal cortex, associative learning, and expectancies. Neuron. 2005;47:633–636. doi: 10.1016/j.neuron.2005.07.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schoenbaum G, Chiba AA, Gallagher M. Orbitofrontal cortex and basolateral amygdala encode expected outcomes during learning. Nat Neurosci. 1998;1:155–159. doi: 10.1038/407. [DOI] [PubMed] [Google Scholar]
- Sotres-Bayon F, Bush DE, LeDoux JE. Acquisition of fear extinction requires activation of NR2B-containing NMDA receptors in the lateral amygdala. Neuropsychopharmacology. 2007;32:1929–1940. doi: 10.1038/sj.npp.1301316. [DOI] [PubMed] [Google Scholar]
- Tremblay L, Schultz W. Modifications of reward expectation-related neuronal activity during learning in primate orbitofrontal cortex. J Neurophysiol. 2000;83:1877–1885. doi: 10.1152/jn.2000.83.4.1877. [DOI] [PubMed] [Google Scholar]
- Tye KM, Janak PH. Amygdala neurons differentially encode motivation and reinforcement. J Neurosci. 2007;27:3937–3945. doi: 10.1523/JNEUROSCI.5281-06.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tye KM, Stuber GD, de Ridder B, Bonci A, Janak PH. Rapid strengthening of thalamo-amygdala synapses mediates cue-reward learning. Nature. 2008;453:1253–1257. doi: 10.1038/nature06963. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Washburn MS, Moises HC. Electrophysiological and morphological properties of rat basolateral amygdaloid neurons in vitro. J Neurosci. 1992;12:4066–4079. doi: 10.1523/JNEUROSCI.12-10-04066.1992. [DOI] [PMC free article] [PubMed] [Google Scholar]