Skip to main content
eLife logoLink to eLife
. 2016 Feb 2;5:e12678. doi: 10.7554/eLife.12678

Dopamine regulates stimulus generalization in the human hippocampus

Thorsten Kahnt 1,*, Philippe N Tobler 2
Editor: Michael J Frank3
PMCID: PMC4755747  PMID: 26830462

Abstract

The ability to generalize previously learned information to novel situations is fundamental for adaptive behavior. However, too wide or too narrow generalization is linked to neuropsychiatric disorders. Previous research suggests that interactions between the dopaminergic system and the hippocampus may play a role in generalization, but whether and how the degree of generalization can be modulated via these pathways is currently unknown. Here, we addressed this question in humans using pharmacology, functional magnetic resonance imaging, and computational modeling. Blocking dopamine D2-receptors (D2R) altered generalization behavior as revealed by an increased kurtosis of the generalization gradient, and a decreased width of model-derived generalization parameters. Moreover, D2R-blockade modulated similarity-based responses in the hippocampus and decreased midbrain-hippocampal connectivity, which in turn correlated with individual differences in generalization. These results suggest that dopaminergic activity in the hippocampus may relate to the degree of generalization and highlight a potential target for treatment.

DOI: http://dx.doi.org/10.7554/eLife.12678.001

Research Organism: Human

eLife digest

In the 1920s, two psychologists taught a young child known as ‘Little Albert’ to fear a white rat. They did so by striking a metal bar with a hammer whenever the rat was present. After experiencing the rat and noise together on multiple occasions, Little Albert eventually began to cry whenever the rat appeared. However, he also showed a similar response to a number of other white furry objects, including a rabbit and even a fur coat. By applying knowledge about a familiar object to other similar stimuli, humans and other animals avoid having to learn about each and every stimulus from scratch. However, this stimulus generalization is only effective if it occurs to the correct degree: under- or over-generalization (as shown by Little Albert) can lead to behaviors that are less than optimal.

Nerve cells use molecules called neurotransmitters to communicate with each other. For example, one nerve cell might release a neurotransmitter called dopamine, which can be detected on the surface of another nerve cell by a protein called the dopamine D2 receptor. Research suggests that dopamine is involved in stimulus generalization in rat and pigeons, but the effects of dopamine have not been studied in humans. Kahnt and Tobler have now explored how the degree of stimulus generalization is determined in human adults.

For the experiments, volunteers viewed images that contained sets of parallel lines, while lying inside a “functional MRI” brain scanner. The volunteers learned to associate lines with a particular orientation (e.g. 39° from vertical) with receiving a reward, and lines with another specific orientation with the absence of a reward. The next day, half the volunteers were given a drug that blocks the dopamine D2 receptor, while the other half received a placebo. All volunteers were then asked to classify lines of different orientations as either “rewarded” or “non-rewarded”.

Both groups classified lines that were aligned similarly to the previously rewarded orientation as also rewarded. However, those given the D2 receptor blocker classified a narrower range of line orientations as rewarded than those who received the placebo. In other words, blocking the D2 receptors reduced stimulus generalization. It also reduced activity in a region of the brain called the hippocampus, and the extent to which this activity was connected to activity in another area called the midbrain.

Taken together, these results suggest that activation of dopamine D2 receptors in the hippocampus may determine the extent to which we generalize between stimuli. Given that over- and under-generalization is a feature of psychiatric disorders such as depression and anxiety, manipulating D2 receptor activity could have therapeutic benefits in these patients.

DOI: http://dx.doi.org/10.7554/eLife.12678.002

Introduction

Generalization enables neural systems to apply stimulus-outcome associations that have been acquired for one particular stimulus to other, related stimuli. A key aspect of generalization is the degree to which learned associations are applied to novel stimuli: the width of generalization. Generalization relieves individuals from having to learn outcome predictions for every single stimulus from scratch before using them to guide behavior. However, generalizing too widely can be maladaptive because it leads to the indiscriminate approach (or avoidance) of stimuli that are unlikely to be associated with reward (or punishment). Indeed, aberrant generalization is implicated in several neuropsychiatric diseases including schizophrenia, anxiety disorders, depression, and drug abuse (Dunsmoor and Paz, 2015Gotlib and Joormann, 2010Lissek et al., 2014bLucantonio et al., 2015Moustafa et al., 2010Shohamy et al., 2010). Insights into the neurobiological mechanisms controlling the width of generalization are therefore important for understanding adaptive behavior and its disruption in these conditions.

Two basic forms of generalization can be distinguished based on what constitutes the relation among stimuli; associative and stimulus generalization. In the case of associative generalization, such as transitive inference and acquired equivalence, the associative relationship among stimuli determines similarity. This relationship can be established for example by sensory preconditioning (e.g., train stimuli A-B and B-C, test whether A comes to predict C) or by a common associate (e.g. train A-C and B-C, test whether A and B are associated). In contrast, in stimulus generalization the relationship among stimuli is based on the similarity along one or more perceptual dimensions (frequency of sounds, color, line orientation, etc.). Most of what we know about stimulus generalization comes from behavioral experiments utilizing intradimensional stimulus discrimination (Dunsmoor and LaBar, 2013Ghirlanda and Enquist, 2003Guttman and Kalish, 1956Hanson, 1959). In these paradigms, one stimulus (e.g. one particular line orientation) is paired with reward (rewarded conditioned stimulus, CS+), while a second stimulus (e.g. a slightly different line orientation), which differs from the first in only one dimension, is paired with no reward (unrewarded conditioned stimulus, CS−). Generalization is then tested using a range of stimuli that vary along the defining stimulus dimension (e.g. line orientation). Although the test stimuli have never been paired with reward, animals and humans show robust generalization in that they respond to test stimuli that are similar to the CS+. Several psychological models have been developed based on these experiments (Ghirlanda and Enquist, 1998Pearce, 19871994Shepard, 1987McLaren et al., 2012), but the neurobiological mechanisms regulating the width of stimulus generalization have remained unknown.

Early research suggested a role for dopamine in mediating generalization by demonstrating that blockade of dopamine receptors during generalization tests alters response gradients in rats and pigeons (Lyons et al., 1973a1973bTerrace, 1963). While the effects of dopamine have not been investigated in humans, evidence from neuroimaging suggests that dopaminoceptive and dopaminergic regions such as the striatum and the midbrain are involved in generalization (Kahnt et al., 2012Wimmer et al., 2012Wimmer and Shohamy, 2012). However, because standard neuroimaging relies on indirect measurements of neural responses, these studies were unable to inform questions about neurotransmitter-specific activity.

A candidate brain region for a prominent role in stimulus generalization is the hippocampus, based on its involvement in learning, memory, and associative generalization (Dickerson and Delgado, 2015Eichenbaum, 2000Frank et al., 2006Kumaran, 2012Norman and O'Reilly, 2003Squire and Wixted, 2011). Specifically, the hippocampus is thought to contribute to associative generalization by representing higher-order relationships among different stimuli (Eichenbaum, 1999Howard et al., 2005Kumaran and McClelland, 2012). However, human imaging and animal lesion studies suggest that the hippocampus may also be involved in similarity-based stimulus generalization not requiring inference (Casasola et al., 2007Kahnt et al., 2012Lissek et al., 2014aSolomon and Moore, 1975).

In the current study, we directly examined the role of dopamine and hippocampal processing in regulating the width of stimulus generalization. For this purpose, we combined a visual intradimensional discrimination task (Kahnt et al., 2012) with pharmacologic dopamine D2-receptor (D2R) blockade (Kahnt et al., 2015) and functional magnetic resonance imaging (fMRI). Our central hypothesis was that dopamine is involved in regulating the width of stimulus generalization via the modulation of similarity-based processing in the hippocampus. In order to test this hypothesis, we fit a computational model of stimulus generalization to the behavioral data, and examined the model parameters regulating the width of generalization. We then used model-based fMRI and functional connectivity analyses to identify the brain processes that potentially mediate the effects of dopamine on generalization. We predicted that D2R blockade during the test session would reduce the computational parameters governing the width of generalization, and thus lead to narrower generalization gradients. Moreover, we hypothesized that this reduction would be mirrored in hippocampal activity as well as in reduced functional coupling between the dopaminergic midbrain and the hippocampus.

Results

The experiment was carried out on two consecutive days. On the first day, subjects underwent intradimensional discrimination training to learn stimulus-specific reward associations. On the second day, a generalization test was performed in extinction. All subjects received placebo on the first day, whereas on day two, one group received placebo (PP group) and the other group received 400 mg of the D2R blocker amisulpride (PA group) (Figure 1A). By verifying that the two groups learned equally well on day one, we ensured that any observed group differences in behavioral or neural responses during day two resulted from effects of dopamine on generalization at retrieval, independent of encoding-related effects. In the following sections we first describe the behavioral data during the training, and then examine how D2R blockade affected the neurocomputational mechanisms governing generalization using fMRI and a model of stimulus generalization.

Figure 1. Experimental design and stimuli.

Figure 1.

(A) Subjects were pseudo-randomly assigned into a placebo-placebo (PP) or placebo-amisulpride (PA) group. Subjects in the PP group received placebo on both days, whereas subjects in the PA group received placebo before the discrimination training and amisulpride before the generalization test. (B) Discrimination training on day I. During each trial, one orientation (CS+ or CS–) was shown for 600 ms. Subjects had to indicate whether the current orientation may be rewarded (+) or not rewarded (–) using a button press. Outcomes were delivered independently of subjects’ response. After the response, the outcome was presented (20 or 0 cents). (C) Generalization test on day II. On each trial, subjects were presented with one of 15 different test orientations and indicated whether the current orientation was the one previously associated with reward (+), no reward (–) or neither (×). The mapping between buttons and +/–/× was randomized on each trial and the generalization test was performed in extinction. (D) Stimuli used during the training and test session. Associations between stimuli and outcomes were counterbalanced across subjects.

DOI: http://dx.doi.org/10.7554/eLife.12678.003

Robust acquisition of stimulus-outcome associations during discrimination training

The visual intradimensional discrimination task used the orientation of a Gabor patch as reward-relevant dimension (Figure 1B). Specifically, one orientation (39°) served as the CS+ and was paired with the delivery of 20 cents in 50% of trials, whereas a second orientation (51°) served as the CS− and was paired with no reward in all trials (Figure 1D). The association between orientation and reward was counterbalanced across subjects. In order to track the acquisition of stimulus-outcome associations, subjects were asked to make a discriminatory response after the stimulus was displayed, but before the outcome was shown. Subjects had to indicate whether the current stimulus was the rewarded stimulus, the non-rewarded stimulus, or whether they were unsure, by pressing buttons associated with +, -, or x, respectively. Subjects learned the discrimination within the first 50 trials and performed at high levels afterwards (Figure 2A). Performance increased as a function of time (two-way, group-by-time ANOVA, main effect of time, F(39,1716) = 18.98, P < 0.001), but did not differ between groups (main effect of group, F(1,44) = 0.41, P = 0.53; group-by-time interaction, F(39,1716) = 1.27, P = 0.13; two-way, group-by-CS type ANOVA, main effect of group, F(1,44) = 0.35, P = 0.56; group-by-CS type interaction, F(1,44) = 0.39, P = 0.53, Figure 2B). Moreover, learning-related activity in the ventral striatum (Delgado, 2007) did not differ between groups (see Figure 3), suggesting that both groups were also comparable in terms of neural responses during discrimination training. This demonstrates that, as expected, both groups acquired stimulus-outcome associations and performed at comparable levels during training. Accordingly, effects of D2R blockade during the generalization test session on the next day can be compared independent of potential group differences in discrimination training.

Figure 2. Behavioral responses during discrimination training and generalization test.

(A) Percentage of correct responses during discrimination training across time (bins of 5 trials). Subjects learned the stimulus-outcome associations within the first 50 trials, and maintained performance at a high level afterwards. Given that both groups received placebo during the training session, performance was not expected to, and indeed did not, differ. (B) Percentage of correct responses for CS+ and CS− is plotted for both groups separately. (C) Generalization gradients reflect the probability of a + response as a function of stimulus orientation during the test session. Responses reveal a peak-shift (stronger responding on the side of the CS+ that is opposite to the CS−). Subjects in the amisulpride group (PA, blue), showed a narrower generalization gradient with a higher peak compared to subjects receiving placebo (PP, black). (D) Generalization gradients of a similarity-based generalization model with parameters estimated from subjects’ behavioral responses in both groups, separately. The model accurately reproduces the empirical generalization gradients and the differences between the groups. Error bars are SEM for N=25 (PA) and N=21 (PP).

DOI: http://dx.doi.org/10.7554/eLife.12678.004

Figure 2.

Figure 2—figure supplement 1. Comparison of Gaussian and exponential similarity functions.

Figure 2—figure supplement 1.

(A) Behavioral responses, (B) P(+) responses derived from models with a Gaussian and (C) exponential similarity functions.
The model with the Gaussian similarity function better captures the behavior and the effect of the dopaminergic manipulation than the model with the exponential similarity function.

Figure 3. Prediction error responses in the ventral striatum during discrimination training.

Figure 3.

(A) Regions in the ventral striatum (VS, left, x = -15, y = 8, z = -16, t = 6.97, P < 0.001, FWE whole brain corrected; right, x = 6, y = 14, z = -10, t = 6.22, P = 0.003, FWE whole brain corrected) in which activity is correlated with model-derived prediction errors (PE) during the training session across both groups. T-map from one-sample t-test (across the two groups) is thresholded at P < 0.05, FWE whole brain corrected and overlaid on a T1-weigthed image averaged across subjects. (B) Bar plots depict parameter estimates for PE-related activity in the VS. Given that both groups received placebo during the training session, neural PE-responses were not expected to differ, and indeed did not differ between groups (two-sample t-test, t = 0.58, P = 0.56). Error bars are SEM for N = 25 (PA) and N = 21 (PP).

DOI: http://dx.doi.org/10.7554/eLife.12678.006

D2R blockade during test narrows generalization gradients

On the second day, subjects in the PP group received placebo, whereas subjects in the PA group received 400 mg of the D2R antagonist amisulpride (Rosenzweig et al., 2002). One hour later, subjects performed a generalization test session in extinction (without feedback). Specifically, on each trial, one of 15 orientations (17°–73°, Figure 1D) was presented, and subjects performed the same discrimination as during training (Figure 1C; the original CS+ and CS− orientations were not shown during the test). Responses described a bell-shaped gradient around the CS+ and revealed a peak shift (Derenne, 2010Purtle, 1973Wisniewski et al., 2009), such that subjects responded most frequently to an orientation that was never paired with reward (Figure 2C). Specifically, average responding was stronger to orientations on the side of the CS+ that was farther away from the CS− (paired t-test on responding to stimuli left vs. right of the CS+, t = 2.99, P = 0.006). Such peak shifts are typical for intradimensional discrimination with one CS+ and one CS−; they are thought to result from the summation of excitatory and inhibitory gradients around the CS+ and CS−, respectively, and have been observed across many species and stimuli (Ghirlanda and Enquist, 2003Pearce et al., 2008Spence, 1937).

Direct group comparisons of the individual data points along the generalization gradient did not reveal any significant differences (two-sample t-tests, all Ps > 0.29). However, visual inspection of the gradients suggested that the amisulpride group had a narrower gradient than the placebo group, with enhanced responding at the peak of the curve, reduced responding at both flanks, and enhanced responding at the tail of the curve. These shape features are parsimoniously described by the 4th moment of probability distributions, namely, their kurtosis. Accordingly, a test for differences in the kurtosis of group-specific distributions (Pearson type VII distribution, see Materials and methods) revealed a significantly greater kurtosis in the amisulpride group compared to placebo (PA: 6.73, PP: 3.29; permutation test, P = 0.043). This finding demonstrates that amisulpride narrowed the width and increased the peak of the behavioral generalization gradient, and suggests that D2R activity alters the neurocomputational processes that mechanistically control generalization behavior.

To further address this possibility, and to identify the specific computational parameters that are affected by D2R blockade, we utilized a mathematical model for similarity-based stimulus generalization (see Materials and methods and Figure 4). The model assumes that the reward prediction of a given stimulus reflects the integrated excitatory and inhibitory associations of that stimulus, plus the excitatory and inhibitory associations of stimuli that are similar to it (Pearce, 1987). Critically, associations of stimuli that are similar to the currently presented stimulus have a stronger contribution than the associations of dissimilar stimuli. Because the shape of the function determining the similarity between the currently presented stimulus and other stimuli (i.e. the generalization coefficient) is of critical importance (Ghirlanda and Enquist, 2003), we directly compared the most commonly used models, i.e. one with Gaussian (Kahnt et al., 2012) and the other with exponential similarity functions (Shepard, 1987). While the exact shape of the similarity functions differs between models, for both models, the extent to which inhibitory and excitatory associations generalize to the current stimulus is controlled by the width of the similarity functions (sand se) (Figure 4B). The larger sand se, the stronger the impact of the inhibitory and excitatory associations of dissimilar stimuli on the currently predicted reward, respectively, and thus the more generalization takes place.

Figure 4. A computational model of similarity-based generalization.

Figure 4.

(A) Schematic of the model. When a stimulus orientation xk is presented, the predicted reward for this orientation Vk is computed by integrating the excitatory and inhibitory associations E and I of all stimuli j that are similar to k, (including its own associations, j = k), weighted by the similarity between stimuli j and k. The similarity between j and k is determined by the excitatory and inhibitory generalization coefficients eSjk and iSjk, respectively, which are assumed to be Gaussian (or exponential, not shown here). The width of the excitatory and inhibitory generalization coefficients, and thus the degree to which excitatory and inhibitory associations generalize from j to k is determined by the parameters se and si. The reward prediction Vk is used to generate approach behavior P(+) and to compute a reward prediction error δ, which in turn updates the excitatory and inhibitory associations of k. (B) Illustration of the effects of changes in the width of excitatory and inhibitory generalization coefficients on generalization gradients.

DOI: http://dx.doi.org/10.7554/eLife.12678.007

In order to assess the explanatory power of the Gaussian and exponential similarity functions, we directly compared their fit to the behavioral data. The free parameters of both models (width of inhibitory generalization coefficient, si; width of excitatory generalization coefficient, se; slope, β; offset, a; learning rate, α) were estimated for the entire group of subjects by maximizing the likelihood of subjects’ responses during the generalization test given the model (see Materials and methods). Visual inspections suggested that the model with the Gaussian similarity function fitted the behavioral data better than the model with the exponential similarity function (Figure 2—figure supplement 1). This was confirmed by a formal model comparison using the Akaike information criterion (AIC) and Bayesian information criterion (BIC) (Gaussian: AIC = 9586.4, BIC = 9629.2; Exponential: AIC = 9691.6, BIC = 9734.4). We also compared the fit of the two models by comparing the regression coefficients from a logistic regression of the trial-by-trial responses on the modeled P(+) responses. Although both models predicted behavioral responses reliably (Gaussian: t = 11.06, P < 0.001; exponential: t = 11.38, P < 0.001), we found significantly higher regression coefficients for the Gaussian model (paired t-test t = 5.34, P < 0.001). Taken together, this demonstrates that in our experiment, a Gaussian similarity function fits behavior better than an exponential similarity function.

To determine the effects of dopamine on generalization, in a next step, the free parameters of the Gaussian model were estimated separately for each group (see Materials and methods). As can be seen in Figure 2D, the model (with group-specific parameters) accurately reproduced responses in both groups, including the differences in the shape of the generalization gradients. Logistic regression coefficients were significantly different from zero (PA: t = 7.76, P < 0.001; PP: t = 7.95, P < 0.001), and did not differ between groups (t = -0.21, P = 0.84), suggesting that responses in both groups were well described by the model. Notably, comparing the model parameters between groups (Table 1), revealed significant group differences in the width of the excitatory generalization coefficient, with a smaller coefficient in the amisulpride group compared to the placebo group (permutation test, P = 0.035). The width of the inhibitory coefficient was also smaller in the amisulpride group, but this difference was not significant (P = 0.069). Importantly, the learning rate during test did not differ between groups (P = 0.21), demonstrating that additional learning during the test session, which might have been altered by amisulpride, cannot account for the differences in generalization gradients. In order to obtain individual estimates of model parameters, we re-estimated the model using a leave-one-out procedure (see Materials and methods). Comparing the resulting individual estimates between groups confirmed a significant difference in the width of the excitatory (two-sample t-test, t = -2.03, P = 0.024), and to a lesser degree, the inhibitory generalization coefficient (t = -1.59, P = 0.059, Figure 5). These results suggest that D2R blockade modulates the computational processes that control the width of stimulus generalization, resulting in narrower generalization.

Table 1.

Group-specific model parameters.

DOI: http://dx.doi.org/10.7554/eLife.12678.009

Group Inhibitory coefficient si Excitatory coefficient se Slope
β
Offset
a
Learning rate
αtest
PA group N=25 20.121 17.599 3.093 0.370 0.002
PP group N=21 30.587 24.584 3.124 0.361 0.006
Difference PA-PP -10.465 -6.985 -0.031 0.009 -0.004
*P-value (PA-PP) 0.069 0.035 0.495 0.462 0.214

Note: *P-value is based on 10.000 permutations.

Figure 5. Estimates of individual differences in model parameters.

Figure 5.

(A-–E) Each bar plot depicts the average difference between model parameters estimated based on the whole group (N) and a reduced group when one subject is left out (N-1). Values are averaged according to the group of the left out subject (i.e. PA or PP). Positive values indicate that removing subjects from the group decreased the parameter estimates in the N-1 group. Amisulpride reduced parameters controlling the width of generalization (width of inhibitory and excitatory generalization coefficients, A, B) but had no effect on the slope (C) and offset of responding (D), as well as the learning rate (E). P-values are based on two-sample t-tests. Error bars are SEM for N = 25 (PA) and N = 21 (PP).

DOI: http://dx.doi.org/10.7554/eLife.12678.008

In principle, the observed effects of amisulpride on the width of generalization could have resulted from a drug-induced improvement in perceptual orientation-discrimination. To control for such perceptual effects, subjects performed a challenging orientation discrimination task, once before the drug took effect and once after (see Materials and methods and Figure 6). Performance (percentage correct) on this task did not differ between groups (two-way, time-by-group ANOVA, main effect of group, F(1,43) = 0.99, P = 0.33) and there was no group-by-time interaction (F(1,43) = 0.42, P = 0.52). Moreover, while perceptual discrimination performance was reliable across time (correlation between pre- and post-drug performance, r = 0.52, P < 0.001), it was not related to the width of the estimated generalization coefficients (all Ps > 0.38). Taken together, this control analysis demonstrates that the effects of amisulpride on generalization cannot be explained by perceptual improvements in orientation discrimination per se.

Figure 6. Amisulpride does not enhance perceptual discrimination performance.

Figure 6.

(A) Design and timing of the perceptual orientation discrimination task. Stimulus I was always 135°, whereas stimulus II was tilted by ± 0.2°, ± 0.4°, ± 0.9°, ± 1.9°, or ± 4°. Subjects had to indicate whether stimulus II is tilted counter-clockwise or clockwise relative to stimulus I. (B) Performance differed as a function of the difference in orientation between stimulus I and stimulus II (3-way ANOVA, F(4,172) = 74.46, P < 0.001), but not between groups (F(1,43) = 0.01, P = 0.93) or time (pre vs. post drug, F(1,43) = 2.73, P = 0.11). Importantly, there was no significant group-by-time interaction (F(1,43) = 0.07, P = 0.80). (C) Same data as in B, but collapsed across levels of orientation difference. Again, performance did not differ between groups (two-way ANOVA, F(1,43) = 0.99, P = 0.33), time (F(1,43) = 0.22, P = 0.65), and there was no significant group-by-time interaction (F(1,43) = 0.42, P = 0.52). Error bars are SEM for N = 24 (PA) and N = 21 (PP).

DOI: http://dx.doi.org/10.7554/eLife.12678.010

D2R blockade reduces similarity-based activity in the hippocampus

Having established an effect of D2R blockade on the computational processes that govern stimulus generalization, we next examined the neural circuits that mediate these changes. We first identified brain regions involved in generalization of reward predictions during retrieval. As a proxy of generalized value, we focused on prediction error responses derived from our model, which reflect the extent to which reward predictions have generalized from the original CS+ and CS− to the current stimulus (please note that because no outcomes were shown, prediction errors are perfectly but negatively correlated with expected value). Accordingly, to identify brain regions involved in similarity-based computations during generalization, we searched for regions in which fMRI activity correlated with generalized prediction errors at the time of the expected outcome. Based on previous empirical and modeling work linking hippocampal activity to the representation of relationships between stimuli and their predicted value (Kumaran et al., 2009Kumaran and McClelland, 2012Lee et al., 2012Lissek et al., 2014aWimmer and Shohamy, 2012), we expected fMRI signals in the hippocampus to positively correlate with generalized prediction errors. In line with this hypothesis, across the entire group (one sample t-test) we found significant correlations in the bilateral hippocampus (extending into the parahippocampal gyrus, left, x = -30, y = -22, z = -16, t = 6.53, P = 0.001, FWE whole brain corrected; right, x = 33, y = -19, z = -16, t = 7.11, P < 0.001, FWE whole brain corrected, Figure 7A). Similar effects were found in the left amygdala (x = -24, y = -4, z = -19, t = 6.04, P = 0.006, FWE whole brain corrected) and the bilateral middle temporal gyrus (left, x = -48, y = -73, z = 14, t = 6.41, P = 0.002, FWE whole brain corrected; right, x = 28, y = -64, z = 14, t = 5.57, P = 0.022, FWE whole brain corrected). In addition, supporting recent work highlighting a role for medial (mPFC) and ventromedial PFC (vmPFC) in generalization (Dunsmoor and Paz, 2015Onat and Buchel, 2015), at an uncorrected threshold of P < 0.001, we found a cluster in the mPFC (x = -3, y = 56, z = 8, t = 4.03) extending into the vmPFC.

Figure 7. Similarity-based prediction error responses in the hippocampus during generalization.

Figure 7.

(A) Across both groups, activity in the hippocampus is significantly correlated with model-derived prediction errors during generalization test. T-map from one-sample t-test is thresholded at P < 0.05, FWE whole brain corrected and overlaid on a T1-weigthed image averaged across subjects. (B) Prediction error-related activity in the hippocampus is significantly reduced by D2R blockade (two-sample t-test, t = -2.12, P = 0.02). Error bars are SEM for N = 25 (PA) and N = 21 (PP).

DOI: http://dx.doi.org/10.7554/eLife.12678.011

We next tested whether reduced behavioral generalization observed in the amisulpride group was paralleled by a decrease in generalization-related activity in the hippocampus. In line with this idea, we found significantly reduced activity in the hippocampus in the amisulpride relative to the placebo group (two-sample t-test, t = -2.12, P = 0.02, Figure 7B). To examine whether these effects of dopamine on generalization-related activity are specific to the hippocampus, as a control, we tested for similar group differences in the amygdala, middle temporal cortex and mPFC. No significant group differences were observed in the amygdala (P = 0.43), the middle temporal gyrus (P = 0.31), or the medial PFC (P = 0.98). However, post-hoc analyses directly comparing the effect of the drug in the hippocampus to the drug effect in the other regions (i.e. group-by-region interactions), demonstrated that while the effect of D2R blockade in the hippocampus was significantly stronger than in the mPFC (P = 0.02) similar interactions involving the amygdala (P = 0.097) and the middle temporal gyrus (P = 0.127) did not reach significance. These data suggest specificity of the effects of D2R blockade on similarity-based processing in the hippocampus relative to the mPFC, but not necessarily relative to the amygdala and middle temporal lobe.

Midbrain-hippocampal connectivity correlates with the width of generalization

In a next step, we examined the neural pathways on which DR2 blockade may mediate its effects on generalization. Given the anatomical origin of the dopaminergic projections to the hippocampus (Swanson, 1982), the relevance of hippocampal D2R for memory functions (Takahashi et al., 2008), and the modulation of hippocampal processing reported above, we hypothesized that amisulpride would reduce the functional connectivity between the midbrain and the hippocampus. In line with this prediction, a functional connectivity analysis with the midbrain as a seed region (Figure 8A, see Materials and methods) revealed decreased midbrain connectivity in the right hippocampus (x = 33, y = -19, z = -19, t = 3.68, P = 0.02, FWE small volume corrected, Figure 8B,C) and the left striatum (x = -9, y = 8, z = -19, t = 4.26, P = 0.001, FWE small volume corrected), in the amisulpride compared to the placebo group. This finding suggests that D2R blockade may modulate the functional connectivity between the midbrain and dopaminergic target regions such as the hippocampus and the striatum. To examine the specificity of these findings, we tested for similar drug-related effects on functional connectivity in the regions involved in similarity-based processing defined above. While connectivity estimates differed significantly between groups in middle temporal gyrus (P = 0.01), no drug effects were observed in the amygdala (P = 0.296) and the mPFC (P = 0.17). Accordingly, for all regions except the middle temporal gyrus (P = 0.13), the corresponding drug-by-region interactions were significant (all Ps < 0.05), suggesting that amisulpride-related decreases in midbrain connectivity are relatively specific to the hippocampus.

Figure 8. Amisulpride modulates midbrain-hippocampal connectivity.

Figure 8.

(A) Anatomical seed region in the midbrain is overlaid on a T2-weigthed image averaged across subjects. (B) Functional connectivity between the midbrain and the hippocampus during generalization test is significantly reduced in the amisulpride group (P < 0.05, FWE small volume corrected). For illustration, t-map from two-sample t-test (PP > PA) is thresholded at P < 0.001, uncorrected and overlaid on a T1-weigthed image averaged across subjects. (C) For illustration, bar plot depicting average midbrain-hippocampus connectivity in both groups. (D) Strength of midbrain-hippocampal connectivity is positively correlated with the width of the model-estimated inhibitory generalization coefficient (r = 0.47, P = 0.01) in the amisulpride group. Error bars are SEM for N = 25 (PA) and N = 21 (PP).

DOI: http://dx.doi.org/10.7554/eLife.12678.012

Finally, we examined whether the relationship between D2R blockade and the width of generalization is associated with midbrain-hippocampal connectivity. Specifically, we tested the correlation between midbrain-hippocampal connectivity and the estimated width of generalization coefficients in the amisulpride group. This correlation was significant for the inhibitory generalization coefficient (r = 0.47, P = 0.01, Figure 8D), but not for the excitatory coefficient (r = 0.18, P = 0.19). We replicated these findings in our previous data set (Kahnt et al., 2012), showing that midbrain-hippocampal connectivity was significantly correlated with the inhibitory generalization coefficient (r = 0.44, P = 0.036), but not with the excitatory coefficient (r = 0.15, P = 0.49). Interestingly, this relationship was not observed for midbrain-striatal connections (P > 0.16), and was significantly stronger for midbrain-hippocampal compared to midbrain-striatal connectivity (Z = 2.31, P = 0.011), suggesting that the relation between dopamine and the width of generalization may arise primarily from midbrain-hippocampal connections. In summary, these findings suggest a link between dopamine-mediated midbrain-hippocampal coupling and the width of stimulus generalization in humans.

Discussion

The degree to which individuals generalize outcome predictions across similar stimuli is important for adaptive behavior. Here, using D2R pharmacology, fMRI, and computational modeling, we demonstrate that D2R blockade results in narrower behavioral generalization gradients and changes in the computational parameters controlling the width of generalization. Moreover, D2R blockade altered similarity-based processing in the hippocampus and decreased the functional coupling between the midbrain and the hippocampus. This coupling was in turn related to the computational parameter controlling the width of generalization.

Previous empirical and modeling work suggests that the hippocampus contributes to generalization by detecting the relationship between items in memory (Eichenbaum, 1999Howard et al., 2005Kumaran and McClelland, 2012). However, this function was thought to only apply to associative forms of generalization involving higher-order relationships, whereas basic stimulus generalization involving perceptual similarity is suggested to be hippocampus-independent (Kumaran, 2012). Opposing this view, here we provide evidence that the hippocampus is involved in stimulus generalization. Accordingly, our findings suggest that hippocampal similarity computations are not restricted to detecting higher-order relationships among stimuli as previously thought, but can also exploit the perceptual similarity between stimuli to establish meaningful relationships. While the hippocampus may not be necessary for stimulus generalization per se, our data suggest that it facilitates generalization by allowing a flexible modulation of its width. Specifically, such flexibility could not be achieved if discrimination and generalization were entirely based on static, hippocampus-independent, stimulus-outcome associations.

Our results contribute to an ongoing debate regarding the time point at which hippocampal processes support generalization. Two alternative accounts suggest that the relationship among stimuli is established by the hippocampus either during encoding via overlapping neural codes (Eichenbaum, 1999Howard et al., 2005), or at the time of retrieval by means of recurrent similarity computations that are based on separated neural codes (Kumaran and McClelland, 2012). The former proposal has received empirical support from studies showing that hippocampal activity during encoding is related to transfer performance at test (Shohamy and Wagner, 2008Wimmer and Shohamy, 2012). In contrast, in line with the second account, our model assumes that reward predictions are generalized and integrated during test (Pearce, 1987). It should be noted though that this model is mathematically equivalent to a model in which generalization occurs at encoding (Kahnt et al., 2012), and thus, the model alone does not provide evidence for either account. However, only if generalization occurs at retrieval can the width of generalization be modulated after encoding has occurred, and thus, our results are only compatible with a retrieval-based account of generalization. Specifically, because dopamine receptor blockade modulated generalization gradients during test, without affecting additional learning, our pharmacological manipulation provides evidence that stimulus generalization occurs – at least in part – at retrieval.

Our data suggest that the functional connectivity between the dopaminergic midbrain and the hippocampus is related to the width of generalization. Specifically, we found that participants with D2R blockade showed decreased midbrain-hippocampal connectivity, which in turn correlated with the width of generalization. While the presence of dopamine receptors in the hippocampus is undisputed, it is worth keeping in mind that although direct dopaminergic innervation from the midbrain to the hippocampus is present (Gasbarri et al., 1994) and functionally relevant for memory stabilization (McNamara et al., 2014), this pathway is not very strong (Mingote et al., 2015Swanson, 1982). Moreover, it is conceivable that some of the hippocampal dopamine is co-released from noradrenergic neurons, in which it may not have been completely metabolized.

As such, our data is in line with the idea that reduced activity of hippocampal D2R decreases the extent to which reward associations generalize across stimuli. By extension, enhanced dopaminergic activity in the hippocampus may increase the width of generalization. More specifically, dopamine release in the hippocampus could increase the likelihood that ensemble patterns representing similar stimuli are activated, which would effectively lower the threshold for similarity detection, and, by broadening the range of stimuli for which reward predictions are taken into account, facilitate generalization. Conversely, reduced levels of dopamine would decrease the likelihood of ensemble pattern activation, increase the threshold for similarity detection, and, by enhancing fine-tuned discrimination, reduce generalization. This provides a simple neurobiological mechanism by which dopamine may flexibly adjust generalization during retrieval. Such flexibility is highly adaptive as it allows for different levels of generalization based on the state of the organism or the environment. We speculate that tonic dopamine levels, similar to their enabling effects on movements (Schultz, 2007), play an enabling role in generalization. For instance, in situations where dopamine transmission is high, such as in novel environments (Ihalainen et al., 1999Lisman and Grace, 2005), elevated levels of dopamine may not only support memory formation (Shohamy and Adcock, 2010), but also broaden generalization and thereby facilitate exploratory behavior.

Of note, although D2R blockade reduced generalization coefficients for both excitatory and inhibitory associations (albeit less reliably), midbrain-hippocampal connectivity was only correlated with the width of the inhibitory coefficient. This raises the possibility that only the effects of D2R blockade on the inhibitory coefficient are mediated via a modulation of midbrain-hippocampal connectivity, whereas the effects on the excitatory coefficient are mediated via a different, yet to be explored, mechanism. The dissociation between inhibitory and excitatory generalization coefficients is in line with the interpretation of previous results suggesting that administration of chlorpromazine, a nonspecific dopamine receptor antagonist, specifically reduces the strength of inhibitory associations in a dose-dependent manner (Lyons et al., 1973bTerrace, 1963). Moreover, it adds to previous findings indicating that generalization involving appetitive and aversive outcomes may involve different mechanisms (Schechtman et al., 2010).

By revealing the effects of dopamine on generalization, our current results substantially extend those of our previous study (Kahnt et al., 2012). Specifically, the current experiment suggests an association between dopamine and the width of generalization in the hippocampus, and dissociates effects of generalization during retrieval vs. encoding, which could not be achieved with the previous design. However, whereas here we find that prediction errors correlate primarily with activity in the hippocampus, the previous study identified prediction error related activity primarily in the ventral striatum. Also, in the previous study functional connectivity between the striatum and the hippocampus was related to the modeled generalization coefficients, whereas our current results suggest a facilitating role of midbrain-hippocampal connections. Several notable differences in the design of both experiments might explain these discrepancies. Most importantly, the previous design consisted of multiple alternating training and testing blocks, all conducted within one session on a single day, whereas in the current study, training and test did not alternate but were conducted in distinct sessions that were separated by a 24 hr delay. Moreover, in the 2012 study, CS-outcome associations were deterministic (100% contingency), whereas the current experiment involved a 50% reinforcement schedule in order to slow down extinction during test. These differences could have shifted the primary focus of generalization-related processing from the striatum to the hippocampus.

In conclusion, here we propose a neurobiological mechanism for the control of stimulus generalization, in which midbrain dopamine changes similarity computations in the hippocampus, resulting in altered generalization gradients. As such, our results demonstrate that the width of stimulus generalization is not hard-wired but flexible, and can change under pharmacological interventions. Accordingly, our results have important clinical implications for a number of neuropsychiatric disorders in which generalization is disrupted. Specifically, aberrant generalization is implicated in depression, anxiety, and schizophrenia (Buss and Daniell, 1967Gotlib and Joormann, 2010Lissek et al., 2014bShohamy et al., 2010), and our findings indicate that blocking D2R activity may provide a potential treatment of overgeneralization in these disorders.

Materials and methods

Subjects

Subjects were assigned to one of three groups in a double-blind and pseudo-random fashion: amisulpride-placebo (AP), placebo-amisulpride (PA), and placebo-placebo (PP). To avoid confounds related to the effects of dopamine on neural processing during discrimination training, only subjects in the PA and PP group are considered in this manuscript. A total of seven subjects was excluded because they either failed to acquire stimulus-outcome associations during the discrimination training on day one (performance <60%, 4 subjects) or because they failed to follow instructions on day two (3 subjects). Subjects in both groups received placebo on the first day of the experiment, whereas on the second day subjects in the PP and PA group received placebo and 400 mg of the D2R blocker amisulpride, respectively (Figure 1A). All subjects were healthy and had normal or corrected-to-normal vision. Groups did not differ significantly in number (PA: N = 25, PP: N=21, chi-square = 0.348, P = 0.56), average age, (PA: 22.72 years ± 2.17 SD, PP: 22.19 years ± 1.83 SD; t = 0.88, P = 0.38), and average weight (PA: 75.04 kg ± 7.89 SD, PP: 75.48 kg ± 10.92 SD; t = -0.03, P = 0.98). Moreover, subjects were not aware of whether they received placebo or amisulpride on both days as assessed by a post experimental questionnaire (day I, chi-square = 0.49, P = 0.48, day II: chi-square = 0.29, P = 0.60). The study was approved by the Cantonal Ethics Review Board of Zurich, and subjects provided informed consent to participate.

Experimental design and stimuli

On the first day, subjects were briefed about the details of the experiment, signed the consent form, and were administered a pill that was swallowed in front of the experimenter. To minimize and equalize absorption time across subjects, subjects were asked to not eat 6 hr before the experiment.

One hour after taking the pill, subjects entered the MRI scanner to perform an intradimensional discrimination task. During the task, subjects learned the association between oriented Gabor patches (CS+ and CS−) and reward or no reward (0 and 20 cents, respectively). In each trial, subjects were presented with an oriented Gabor patch for 600 ms (Figure 1B). Immediately after the stimulus, subjects had to indicate whether the currently displayed stimulus may lead to reward (+) or no reward (-), or whether they did not know (x) by pressing a button with the index, middle, or ring finger of their right hand, corresponding to the signs (+/-/x) on a response mapping screen. The mapping between buttons (fingers) and +/-/x was randomized in each trial to dissociate signals related to motor preparation and execution from reward predictions and prediction errors. When subjects pressed a button, the brightness of the signs on the screen slightly decreased to indicate that a response has been made. The screen disappeared after 1500 ms (maximum decision time) and was replaced by an outcome screen (1000 ms) indicating the amount of money they received (20 or 0 cents). When subjects failed to respond within 1500 ms, “‘too slow”’ was presented instead of the outcome. The CS+ was paired with reward and no reward in 50% of the trials, whereas the CS− was always paired with no reward. The outcome was independent of the correctness of the behavioral response, and the association between stimulus orientation (39° and 51°) and reward was counterbalanced across subjects. The training phase consisted of 100 repetitions of CS+ and CS− trials, in pseudorandom order. Trials were separated by a variable interval ranging from 1.9 to 9.9 s (1.9 s fix, plus a variable interval drawn from an exponential distribution, truncated at 8 s).

On the second day of the experiment, subjects performed the generalization test in extinction, one hour after taking the pill containing either placebo (PP) or amisulpride (PA). In each trial, subjects saw one of 15 orientations (17°, 21°, 25°, 29°, 33°, 37°,41°, 45°, 49°, 53°, 57°, 61°, 65°, 69°, and 73°; Figure 1D) for 600 ms (Figure 1C). The original CS+ and CS− were not shown during the test. Each orientation was presented 14 times in pseudorandom order resulting in a total of 210 trials. Directly after presentation of the stimulus, subjects had to make the same discrimination response as during training (see above). Importantly, the test was performed in extinction, i.e. no outcomes were shown for all orientations. This design ensured that subjects made motor responses to all stimuli and thus allowed us to observe reward prediction error responses to all orientations independent of potential confounds attributable to reward feedback, different visual stimulation, and different cognitive or motor demands. Trials were separated by a variable interval ranging from 2.9 to 10.9 s (2.9 s fix, plus a variable interval drawn from an exponential distribution, truncated at 8 s).

Orientation discrimination performance

In order to control for potential effects of D2R blockade on perceptual performance per se, subjects performed an orientation discrimination task. In each trial, two oriented Gabor patches were presented for 200 ms each, separated by a blank screen of 300 ms. The first stimulus had an orientation of 135° (i.e.orthogonal to the stimuli used in the main experiment), whereas the second stimulus was tilted -4°, -1.9°, -0.9°, -0.4°, -0.2°, 0.2°, 0.4°, 0.9°, 1.9°, or 4° relative to the first stimulus. Thus, the task consisted of 10 trial types, reflecting 5 levels of difficulty (i.e. absolute difference between first and second orientation). Subjects had to indicate as fast and accurately as possible whether the second orientation was tilted counterclockwise or clockwise relative to the first orientation by pressing a button. No feedback was provided to minimize feedback-based perceptual learning (Kahnt et al., 2011). Each trial type was repeated 7 times, resulting in a total of 70 trials. Discrimination performance was computed by averaging accuracy across trials and difficulty levels. The task was administered twice on each day: once immediately after administration of the pill to obtain a baseline measure in the absence of drug effects, and once after the scanning session (2 hr after drug/placebo administration) to measure discrimination performance under the influence of the drug (amisulpride plasma levels have a first peak after ~1 hr (Rosenzweig et al., 2002)). Due to technical problems, discrimination accuracy data from one subject were not saved.

Comparing the kurtosis of behavioral generalization gradients

In order to compare the overall shape of the behavioral generalization gradients, we estimated the 4th moment of the distributions underlying the behavioral gradients (because the behavioral gradients were bounded (17-–73 degrees) and not centered on 45 degrees, direct numerical estimation of the kurtosis was not possible). For this, we fitted a Pearson type VII distribution to the behavioral generalization gradient of each group and numerically computed the kurtosis of the group-specific distributions according to:

kurt(X)=E[(Xμ)4]σ4

The kurtosis was then compared between groups, and statistical inference on the observed group difference was performed using a permutation test.

A computational model of similarity-based generalization

We designed a similarity-based generalization model (Figure 4A), based on previous computational approaches to stimulus generalization (Ghirlanda and Enquist, 1998Kahnt et al., 2012Pearce, 1987). The model assumes that each orientation k holds inhibitory and excitatory associations, Ik and Ek that change with learning. In a given trial t, the net associative strength V (or predicted reward) of the currently presented stimulus k equals the aggregated excitatory and inhibitory associative strengths of all stimuli j that that are generalized to stimulus k (including j = k):

Vt=j Et,j·eSjkIt,j·iSjk

The degree to which associations generalize from stimulus j to k is determined by the inhibitory and excitatory generalization coefficients iSjk and eSjk (Figure 4B), respectively, which vary continuously between 0 and 1 (for j = k). These coefficients can take the form of Gaussians or exponential functions, and their widths (i.e. the width of generalization) are controlled by the parameters sand se, respectively:

iSjk=exp(xjxk)22·si2 and eSjk=exp(xjxk)22·se2

Where xj and xk are the orientations (in degrees) of stimuli j and k, respectively. The corresponding exponential similarity functions are given by:

iSjk=exp|xj-xk|2·si2 and eSjk=exp|xj-xk|2·se2

The excitatory and inhibitory strengths, E and I, are updated on every trial. Specifically, when the outcome R (1 or 0 for reward and no reward, respectively) is experienced, a prediction error δ is generated according to:

δt=RVt

Because the prediction error is based on generalized reward predictions, it directly reflects the extent to which excitatory and inhibitory associations generalize to the currently presented stimulus. The prediction error is used to update the inhibitory and excitatory associative strengths according to:

Et+1,k=Et,k+α·δt  if  δt>0  and  It+1,k=It,kα·δt  if  δt<0

where α is the learning rate. To account for different learning rates during discrimination training with feedback and generalization test without feedback, separate learning rates were allowed during training and test (αtrain, and αtest).

The probability of making an approach response on a given trial P(+)t is given by the net associative strength Vt, passed through a biasing sigmoid function (softmax), which is controlled by its slope β and offset a.

P(+)t=11+expβ·(Vta)

The free parameters of the model (si, se, β, a, αtrain, αtest) were estimated by maximizing the log likelihood estimate (LLE) of subjects’ responses during test given the model

LLE=tlogP(+|θ)t,*

where P(+|θ)t* is the probability of the model for making the same response as the subject in trial t. In order to compare models with Gaussian and exponential similarity functions, we estimated the free parameters of both models by combining the LLE from subjects in both groups, and compared the aggregate LLE from the best fitting parameter sets using AIC and BIC. In order to generate reliable parameter estimates for each group separately, the same fitting procedure was performed for each group (PA and PP) separately by combining LLE across all subjects within a group when evaluating the model. This yielded two sets of estimated model parameters (Table 1). In order to reduce the number of free parameters, and because both groups received placebo during training, the training learning rate (αtrain) was estimated based on data from all subjects, and then fixed to that value when estimating group-wise model parameters.

We tested the statistical significance of the observed differences in the group-wise model parameters using a permutation test. Specifically, we randomly assigned all subjects into one of the two groups, estimated a set of parameters for each group, and computed the difference between the estimated model parameters between groups. This was repeated 10,000 times, resulting in a distribution of group-wise parameter differences that should be expected by chance (i.e. if group assignments were random). This random distribution was then used to generate P-values for the empirically observed group-wise parameter differences.

To obtain an individual difference measure of model parameters, and as an alternative way to perform inference on the model parameters, we used a leave-one-out estimation procedure. For this, we first fitted the model using the data from all but one subject (N-1). The resulting model parameters were then subtracted from the parameters obtained when using all subjects (N). This difference (parameter based on N-1 subjects – parameter based on N subjects) is proportional to the relative contribution of the left-out subject to the entire group. For instance, if the left-out subject has a large “‘true”’ parameter, leaving this subject out when estimating the model will reduce the parameter obtained in the reduced sample (N-1) relative to the parameter obtained from the entire sample (N). In other words, this procedure generates estimates that reflect the individual differences in model parameters.

fMRI data acquisition and preprocessing

Functional imaging was performed on a Philips Achieva 3 T whole-body scanner equipped with an eight-channel head coil. During the training and test sessions, 675 and 711 T2*-weighted whole-brain EPI images with 37 transversal slices were acquired with a repetition time (TR) of 2000 ms. Imaging parameters were as follows: slice thickness, 3 mm; in-plane resolution, 2.75 x 2.75 mm; echo time (TE), 30 ms; flip angle, 90°. For anatomical reference and identification of the dopaminergic midbrain, T1- and T2-weighted high-resolution (1 x 1 x 1 mm) anatomical images were acquired using the following imaging parameters. T1-weigthed: matrix size, 256 x 256; field of view, 256; 181 slices; flip angle, 8°; TR = 8.2 ms; TE = 3.8 ms. T2-weighted: matrix size, 256 x 256; field of view, 256; 181 slices; flip angle, 8°; TR = 2500 ms; TE = 248 ms. Preprocessing of functional images was performed using SPM12 and consisted of slice-time correction, realignment, coregistration of anatomical (T1-weighted) and functional images, spatial normalization to the standard template of the Montreal Neurological Institute (MNI) by estimating normalization parameters based on the T1-weighted image, and spatial smoothing using a Gaussian kernel of 8 mm FWHM.

fMRI data analysis

To identify brain regions in which activity correlates with prediction errors derived from the similarity-based generalization model, we used a general linear model (GLM) with parametric modulators (Buchel et al., 1998) that included the following regressors: (1) onset of actual (training data) or expected (test data) time of outcome (offset of the response mapping screen), (2) a parametric modulator of stimulus orientation (z-standardized), and (3) a parametric modulator of prediction errors derived from the model with group-wise parameters (z-standardized). All regressors were convolved with a canonical hemodynamic response function (HRF) and together with the head movement parameters from the realignment procedure regressed against the BOLD signal in each voxel. Independent GLMs were estimated for the training and test session. Voxel-wise one-sample t-tests were applied to the resulting parameter estimates of the prediction error regressor.

To test for global effects of amisulpride on blood flow, and thereby BOLD response, we tested whether cue-evoked activity in visual cortex differed between groups. For this, we set up a GLM including one regressor for the onset of the visual cue (HRF convolved) and the six head movement parameters. Cue-related activity in an anatomical mask of the calcarine sulcus (AAL) did not differ between groups (t = -0.80, P = 0.42), suggesting that amisulpride did not unspecifically affect the BOLD response. This is in line with previous studies that found no differences in visually evoked activity between amisulpride and placebo (Jocham et al., 2011).

Functional connectivity analysis

We examined dopamine-related differences in the functional connectivity between the midbrain and the hippocampus by using a variant of the psycho-physiological interaction (PPI) model (McLaren et al., 2012). For each subject, the average time course was extracted from voxels in a 50% probabilistic mask of the substantia nigra (Murty et al., 2014), and together with the six head movement parameters regressed against the time course in each voxel. The parameter estimate of the midbrain-seed regressor reflects the correlation between activity in the midbrain and activity in every other voxel in the brain. To identify regions where connectivity differed depending on D2R blockade, the connectivity maps were compared between groups using a two-sample t test.

Statistical analysis

To identify significant voxels in the fMRI analysis (prediction error during test or training), we used one-sample t-tests and a threshold of P < 0.05, FWE whole brain corrected, in combination with a cluster extent threshold of k>10. To test for group differences, parameter estimates were extracted from significant voxels in the striatum (training) and hippocampus (test), and compared between groups using two-sample t-tests at a statistical threshold of P < 0.05. Significant group differences in functional connectivity were identified using a threshold of P < 0.05, FWE small volume corrected for a functional region of interest in the hippocampus that was identified in the independent prediction error contrast during test in the entire group of subjects (P < 0.05, FWE-corrected). A priori comparisons using t-tests and correlations with directed hypotheses are tested one-tailed.

Although amisulpride is generally known to not alter RT (Jocham et al., 2011; Rosenzweig et al., 2002), we found a trend-level group difference in RT (t = 1.83, P = 0.074), with slower responding in amisulpride (mean RT ± SEM = 816 ± 16.6) compared to placebo (mean RT ± SEM = 770 ± 18.5). Importantly, all group comparisons remained significant when RT was included as a covariate in the statistical models, suggesting that non-significant differences in RT did not confound the imaging results.

Acknowledgements

We thank H Haker, T Baumgartner, S Weber, and M Wälti for assistance in subject recruitment and data acquisition, and JD Howard for helpful comments on the manuscript.

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Funding Information

This paper was supported by the following grants:

  • Swiss National Science Foundation PP00P1_128574 to Philippe N Tobler.

  • Swiss National Science Foundation PP00P1_150739 to Philippe N Tobler.

  • Swiss National Science Foundation CRSII3_141965 to Philippe N Tobler.

Additional information

Competing interests

The authors declare that no competing interests exist.

Author contributions

TK, Conception and design, Acquisition of data, Analysis and interpretation of data, Drafting or revising the article.

PNT, Conception and design, Drafting or revising the article.

Ethics

Human subjects: The study was approved by the Cantonal Ethics Review Board of Zurich (Ref. Mr. EK: KEK-ZH-Nr. 2012-0568), and subjects provided informed consent to participate.

References

  1. Buss AH, Daniell EF. Stimulus generalization and schizophrenia. Journal of Abnormal Psychology. 1967;72:50–53. doi: 10.1037/h0020082. [DOI] [PubMed] [Google Scholar]
  2. Büchel C, Holmes AP, Rees G, Friston KJ. Characterizing stimulus–response functions using nonlinear regressors in parametric fMRI experiments. NeuroImage. 1998;8:140–148. doi: 10.1006/nimg.1998.0351. [DOI] [PubMed] [Google Scholar]
  3. Casasola C, Mejía-Gervacio S, Cruz-Pérez M, Sánchez-Castillo H, Velázquez-Martínez DN. Participation of the dorsal hippocampus in stimulus discrimination with scopolamine. Brain Research. 2007;1178:125–131. doi: 10.1016/j.brainres.2007.08.003. [DOI] [PubMed] [Google Scholar]
  4. Delgado MR. Reward-related responses in the human striatum. Annals of the New York Academy of Sciences. 2007;1104:70–88. doi: 10.1196/annals.1390.002. [DOI] [PubMed] [Google Scholar]
  5. Derenne A. Shifts in postdiscrimination gradients within a stimulus dimension based on bilateral facial symmetry. Journal of the Experimental Analysis of Behavior. 2010;93:485–494. doi: 10.1901/jeab.2010.93-485. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Dickerson KC, Delgado MR. Contributions of the hippocampus to feedback learning. Cognitive, Affective, & Behavioral Neuroscience. 2015;15:861–877. doi: 10.3758/s13415-015-0364-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Dunsmoor JE, LaBar KS. Effects of discrimination training on fear generalization gradients and perceptual classification in humans. Behavioral Neuroscience. 2013;127:350–356. doi: 10.1037/a0031933. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Dunsmoor JE, Paz R. Fear generalization and anxiety: behavioral and neural mechanisms. Biological Psychiatry. 2015;78:336–343. doi: 10.1016/j.biopsych.2015.04.010. [DOI] [PubMed] [Google Scholar]
  9. Eichenbaum H. The hippocampus and mechanisms of declarative memory. Behavioural Brain Research. 1999;103:123–133. doi: 10.1016/S0166-4328(99)00044-3. [DOI] [PubMed] [Google Scholar]
  10. Eichenbaum H. A cortical-hippocampal system for declarative memory. Nature Reviews Neuroscience. 2000;1:41–50. doi: 10.1038/35036213. [DOI] [PubMed] [Google Scholar]
  11. Frank MJ, O'Reilly RC, Curran T. When memory fails, intuition reigns: midazolam enhances implicit inference in humans. Psychological Science. 2006;17:700–707. doi: 10.1111/j.1467-9280.2006.01769.x. [DOI] [PubMed] [Google Scholar]
  12. Gasbarri A, Verney C, Innocenzi R, Campana E, Pacitti C. Mesolimbic dopaminergic neurons innervating the hippocampal formation in the rat: a combined retrograde tracing and immunohistochemical study. Brain Research. 1994;668:71–79. doi: 10.1016/0006-8993(94)90512-6. [DOI] [PubMed] [Google Scholar]
  13. Ghirlanda S, Enquist M. Artificial neural networks as models of stimulus control. Animal Behaviour. 1998;56:1383–1389. doi: 10.1006/anbe.1998.0903. [DOI] [PubMed] [Google Scholar]
  14. Ghirlanda S, Enquist M. A century of generalization. Animal Behaviour. 2003;66:15–36. doi: 10.1006/anbe.2003.2174. [DOI] [Google Scholar]
  15. Gotlib IH, Joormann J. Cognition and depression: current status and future directions. Annual Review of Clinical Psychology. 2010;6:285–312. doi: 10.1146/annurev.clinpsy.121208.131305. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Guttman N, Kalish HI. Discriminability and stimulus generalization. Journal of Experimental Psychology. 1956;51:79–88. doi: 10.1037/h0046219. [DOI] [PubMed] [Google Scholar]
  17. Hanson HM. Effects of discrimination training on stimulus generalization. Journal of Experimental Psychology. 1959;58:321–334. doi: 10.1037/h0042606. [DOI] [PubMed] [Google Scholar]
  18. Howard MW, Fotedar MS, Datey AV, Hasselmo ME. The temporal context model in spatial navigation and relational learning: toward a common explanation of medial temporal lobe function across domains. Psychological Review. 2005;112:75–116. doi: 10.1037/0033-295X.112.1.75. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Ihalainen JA, Riekkinen Jr P, Feenstra MGP. Comparison of dopamine and noradrenaline release in mouse prefrontal cortex, striatum and hippocampus using microdialysis. Neuroscience Letters. 1999;277:71–74. doi: 10.1016/S0304-3940(99)00840-X. [DOI] [PubMed] [Google Scholar]
  20. Jocham G, Klein TA, Ullsperger M. Dopamine-mediated reinforcement learning signals in the striatum and ventromedial prefrontal cortex underlie value-based choices. Journal of Neuroscience. 2011;31:1606–1613. doi: 10.1523/JNEUROSCI.3904-10.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Kahnt T, Grueschow M, Speck O, Haynes J-D. Perceptual learning and decision-making in human medial frontal cortex. Neuron. 2011;70:549–559. doi: 10.1016/j.neuron.2011.02.054. [DOI] [PubMed] [Google Scholar]
  22. Kahnt T, Park SQ, Burke CJ, Tobler PN. How glitter relates to gold: similarity-dependent reward prediction errors in the human striatum. Journal of Neuroscience. 2012;32:16521–16529. doi: 10.1523/JNEUROSCI.2383-12.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Kahnt T, Weber SC, Haker H, Robbins TW, Tobler PN. Dopamine D2-receptor blockade enhances decoding of prefrontal signals in humans. Journal of Neuroscience. 2015;35:4104–4111. doi: 10.1523/JNEUROSCI.4182-14.2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Kumaran D, McClelland JL. Generalization through the recurrent interaction of episodic memories: a model of the hippocampal system. Psychological Review. 2012;119:573–616. doi: 10.1037/a0028681. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Kumaran D, Summerfield JJ, Hassabis D, Maguire EA. Tracking the emergence of conceptual knowledge during human decision making. Neuron. 2009;63:889–901. doi: 10.1016/j.neuron.2009.07.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Kumaran D. What representations and computations underpin the contribution of the hippocampus to generalization and inference? Frontiers in Human Neuroscience. 2012;6 doi: 10.3389/fnhum.2012.00157. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Lee H, Ghim J-W, Kim H, Lee D, Jung M. Hippocampal neural correlates for values of experienced events. Journal of Neuroscience. 2012;32:15053–15065. doi: 10.1523/JNEUROSCI.2806-12.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Lisman JE, Grace AA. The hippocampal-VTA loop: controlling the entry of information into long-term memory. Neuron. 2005;46:703–713. doi: 10.1016/j.neuron.2005.05.002. [DOI] [PubMed] [Google Scholar]
  29. Lissek S, Bradford DE, Alvarez RP, Burton P, Espensen-Sturges T, Reynolds RC, Grillon C. Neural substrates of classically conditioned fear-generalization in humans: a parametric fMRI study. Social Cognitive and Affective Neuroscience. 2014b;9:1134–1142. doi: 10.1093/scan/nst096. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Lissek S, Kaczkurkin AN, Rabin S, Geraci M, Pine DS, Grillon C. Generalized anxiety disorder is associated with overgeneralization of classically conditioned fear. Biological Psychiatry. 2014a;75:909–915. doi: 10.1016/j.biopsych.2013.07.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Lucantonio F, Kambhampati S, Haney RZ, Atalayer D, Rowland NE, Shaham Y, Schoenbaum G. Effects of prior cocaine versus morphine or heroin self-administration on extinction learning driven by overexpectation versus omission of reward. Biological Psychiatry. 2015;77:912–920. doi: 10.1016/j.biopsych.2014.11.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Lyons J, Klipec WD, Eirick R. The effect of chlorpromazine on the peak shift in the albino rat. Physiological Psychology. 1973a;1:165–168. doi: 10.3758/BF03326895. [DOI] [Google Scholar]
  33. Lyons J, Klipec WD, Steinsultz G. The effect of chlorpromazine on discrimination performance and the peak shift. Physiological Psychology. 1973b;1:121–124. doi: 10.3758/BF03326882. [DOI] [Google Scholar]
  34. McLaren DG, Ries ML, Xu G, Johnson SC. A generalized form of context-dependent psychophysiological interactions (gPPI): a comparison to standard approaches. NeuroImage. 2012;61:1277–1286. doi: 10.1016/j.neuroimage.2012.03.068. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. McNamara CG, Tejero-Cantero Álvaro, Trouche S, Campo-Urriza N, Dupret D. Dopaminergic neurons promote hippocampal reactivation and spatial memory persistence. Nature Neuroscience. 2014;17:1658–1660. doi: 10.1038/nn.3843. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Mingote S, Chuhma N, Kusnoor SV, Field B, Deutch AY, Rayport S. Functional connectome analysis of dopamine neuron glutamatergic connections in forebrain regions. Journal of Neuroscience. 2015;35:16259–16271. doi: 10.1523/JNEUROSCI.1674-15.2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Moustafa AA, Keri S, Herzallah MM, Myers CE, Gluck MA. A neural model of hippocampal–striatal interactions in associative learning and transfer generalization in various neurological and psychiatric patients. Brain and Cognition. 2010;74:132–144. doi: 10.1016/j.bandc.2010.07.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Murty VP, Shermohammed M, Smith DV, Carter RM, Huettel SA, Adcock RA. Resting state networks distinguish human ventral tegmental area from substantia nigra. NeuroImage. 2014;100:580–589. doi: 10.1016/j.neuroimage.2014.06.047. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Norman KA, O'Reilly RC. Modeling hippocampal and neocortical contributions to recognition memory: a complementary-learning-systems approach. Psychological Review. 2003;110:611–646. doi: 10.1037/0033-295X.110.4.611. [DOI] [PubMed] [Google Scholar]
  40. Onat S, Büchel C. The neuronal basis of fear generalization in humans. Nature Neuroscience. 2015;18:1811–1818. doi: 10.1038/nn.4166. [DOI] [PubMed] [Google Scholar]
  41. Pearce JM, Esber GR, George DN, Haselgrove M. The nature of discrimination learning in pigeons. Learning & Behavior. 2008;36:188–199. doi: 10.3758/LB.36.3.188. [DOI] [PubMed] [Google Scholar]
  42. Pearce JM. A model for stimulus generalization in pavlovian conditioning. Psychological Review. 1987;94:61–73. doi: 10.1037/0033-295X.94.1.61. [DOI] [PubMed] [Google Scholar]
  43. Pearce JM. Similarity and discrimination: a selective review and a connectionist model. Psychological Review. 1994;101:587–607. doi: 10.1037/0033-295X.101.4.587. [DOI] [PubMed] [Google Scholar]
  44. Purtle RB. Peak shift: a review. Psychological Bulletin. 1973;80:408–421. doi: 10.1037/h0035233. [DOI] [Google Scholar]
  45. Rosenzweig P, Canal M, Patat A, Bergougnan L, Zieleniuk I, Bianchetti G. A review of the pharmacokinetics, tolerability and pharmacodynamics of amisulpride in healthy volunteers. Human Psychopharmacology: Clinical and Experimental. 2002;17:1–13. doi: 10.1002/hup.320. [DOI] [PubMed] [Google Scholar]
  46. Schechtman E, Laufer O, Paz R. Negative valence widens generalization of learning. Journal of Neuroscience. 2010;30:10460–10464. doi: 10.1523/JNEUROSCI.2377-10.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Schultz W. Multiple dopamine functions at different time courses. Annual Review of Neuroscience. 2007;30:259–288. doi: 10.1146/annurev.neuro.28.061604.135722. [DOI] [PubMed] [Google Scholar]
  48. Shepard R. Toward a universal law of generalization for psychological science. Science. 1987;237:1317–1323. doi: 10.1126/science.3629243. [DOI] [PubMed] [Google Scholar]
  49. Shohamy D, Adcock RA. Dopamine and adaptive memory. Trends in Cognitive Sciences. 2010;14:464–472. doi: 10.1016/j.tics.2010.08.002. [DOI] [PubMed] [Google Scholar]
  50. Shohamy D, Mihalakos P, Chin R, Thomas B, Wagner AD, Tamminga C. Learning and generalization in schizophrenia: effects of disease and antipsychotic drug treatment. Biological Psychiatry. 2010;67:926–932. doi: 10.1016/j.biopsych.2009.10.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Shohamy D, Wagner AD. Integrating memories in the human brain: hippocampal-midbrain encoding of overlapping events. Neuron. 2008;60:378–389. doi: 10.1016/j.neuron.2008.09.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Solomon PR, Moore JW. Latent inhibition and stimulus generalization of the classically conditioned nictitating membrane response in rabbits (oryctolagus cuniculus) following hippocampal ablation. Journal of Comparative and Physiological Psychology. 1975;89:1192–1203. doi: 10.1037/h0077183. [DOI] [PubMed] [Google Scholar]
  53. Spence KW. The differential response in animals to stimuli varying within a single dimension. Psychological Review. 1937;44:430–444. doi: 10.1037/h0062885. [DOI] [Google Scholar]
  54. Squire LR, Wixted JT. The cognitive neuroscience of human memory since H.M. Annual Review of Neuroscience. 2011;34:259–288. doi: 10.1146/annurev-neuro-061010-113720. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Swanson LW. The projections of the ventral tegmental area and adjacent regions: a combined fluorescent retrograde tracer and immunofluorescence study in the rat. Brain Research Bulletin. 1982;9:321–353. doi: 10.1016/0361-9230(82)90145-9. [DOI] [PubMed] [Google Scholar]
  56. Takahashi H, Kato M, Takano H, Arakawa R, Okumura M, Otsuka T, Kodaka F, Hayashi M, Okubo Y, Ito H, Suhara T. Differential contributions of prefrontal and hippocampal dopamine D1 and D2 receptors in human cognitive functions. Journal of Neuroscience. 2008;28:12032–12038. doi: 10.1523/JNEUROSCI.3446-08.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Terrace HS. Errorless discrimination learning in the pigeon: effects of chlorpromazine and imipramine. Science. 1963;140:318–319. doi: 10.1126/science.140.3564.318. [DOI] [PubMed] [Google Scholar]
  58. Wimmer GE, Daw ND, Shohamy D. Generalization of value in reinforcement learning by humans. European Journal of Neuroscience. 2012;35:1092–1104. doi: 10.1111/j.1460-9568.2012.08017.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Wimmer GE, Shohamy D. Preference by association: how memory mechanisms in the hippocampus bias decisions. Science. 2012;338:270–273. doi: 10.1126/science.1223252. [DOI] [PubMed] [Google Scholar]
  60. Wisniewski MG, Church BA, Mercado E. Learning-related shifts in generalization gradients for complex sounds. Learning & Behavior. 2009;37:325–335. doi: 10.3758/LB.37.4.325. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Wisniewski MG, Radell ML, Guillette LM, Sturdy CB, Mercado E. Predicting shifts in generalization gradients with perceptrons. Learning & Behavior. 2012;40:128–144. doi: 10.3758/s13420-011-0050-6. [DOI] [PubMed] [Google Scholar]
eLife. 2016 Feb 2;5:e12678. doi: 10.7554/eLife.12678.013

Decision letter

Editor: Michael J Frank1

eLife posts the editorial decision letter and author response on a selection of the published articles (subject to the approval of the authors). An edited version of the letter sent to the authors after peer review is shown, indicating the substantive concerns or comments; minor concerns are not usually shown. Reviewers have the opportunity to discuss the decision before the letter is sent (see review process). Similarly, the author response typically shows only responses to the major concerns raised by the reviewers.

[Editors’ note: a previous version of this study was rejected after peer review, but the authors submitted for reconsideration. The first decision letter after peer review is shown below.]

Thank you for choosing to send your work entitled "Dopamine controls stimulus generalization in the human hippocampus" for consideration at eLife. Your full submission has been evaluated by Timothy Behrens (Senior Editor), Michael Frank (Reviewing Editor), and two peer reviewers. Based on our discussions and the individual reviews below, we regret to inform you that your work will not be considered further for publication in eLife in its current form.

All reviewers and editors expressed interest in the findings and framework, but questioned the level of advance over and above what your group has published in 2012 with a similar model and relation to hippocampal-striatal connectivity. The main issues that came up in the review discussion (some of which are reiterated by the individual reviewer comments below) are as follows.

– First and foremost, while the dopaminergic effect is of real potential interest, all were concerned that there was no observable effect of the drug on generalization behavior, but only through the lens of the model parameters. In general a model can certainly be helpful to refine behavioral analysis, but (1) it has to be shown to fit the data well before its parameters can be interpreted, (2) it should be compared to alternative potential models, and (3) if both of these are successful the winning model should then guide an analysis of the behavioral data that would reveal the significant effects. Reviewers were also concerned that other (unmodelled) factors (e.g. new learning during testing session affected by dopamine) could be at play which make it less clear that dopamine is specifically changing the hippocampal generalization gradient, and that the current findings weren't well integrated with your previous findings.

– The specificity of the effects was not fully established. Reviewers suggest control analyses that would address whether indeed what was seen in the hippocampus is something that is unique to the hippocampus, or whether it was a general effect of pharmacology in BOLD signal that was not observed elsewhere.

Reviewer #1: This study examined the role of dopamine in modulation of perceptual generalization, combining pharmacology, fMRI and computational modeling.

On the first day participants trained on a visual discrimination task. On the second day they were tested for generalization of the training to new samples varying in similarity to the trained items. The generalization test took place either under drug (D2 receptor blocker) or under placebo, in a between-subject design. The results suggest that both groups generalize similarly, overall. But the application of a computational model of generalization revealed subtle but interesting differences in the width of the generalization gradient, with a narrower gradient in the drug-treated group. This difference in the generalization gradient was associated with differences in BOLD activity in the hippocampus, with the drug group showing a weaker generalization-related BOLD response.

Overall this is a solid and informative study using a convergence of methods to address an interesting question. I have just a few comments:

1) As with any pharmacological study, one must raise concerns about selectivity. How do you know the effects of the drug are not due to global differences in dopamine transmission, that are having an impact on BOLD activity in multiple places in parallel with (but not necessarily related to) the subtle differences in behavior? The study already addresses this somewhat by showing selectivity of effects on functional connectivity between the hippocampus and the midbrain, but not with the striatum. And the findings ruling out mere differences in perception also help address this point. But similar control analyses are also needed for the generalization gradient, to determine whether there is really a selective effect of the drug in the hippocampus, as the authors currently conclude. For example, Paz and colleagues have suggested an important role for the amygdala and the PFC in perceptual generalization. These regions are also targets of midbrain region and it would be useful to know whether or not they show parallel effects.

– Related to the point about possible global differences between drug and placebo, were there any reaction time differences between the groups?

2) The Discussion makes the important distinction between associative and perceptual generalization, but this distinction is muddled in the introduction where the two appear to be conflated. It will help readers if this distinction is clarified earlier on.

Reviewer #2: The study by Kahnt and Tobler investigates the role of dopamine in stimulus generalization using fMRI, computational modeling and dopamine blockade (D2R). Two groups of subjects first learnt which of two oriented gabor patches (CS+ 39 deg, CS- 51) were associated with positive or neutral outcome. Following drug (PA) or placebo (PP) administration they then performed the generalization test where they responded to 15 different orientations not presented during training (between 17-73 deg; no outcome presented) with multiple repetitions of each item. Main findings of interest: i) as in their (2012) and other studies a classic peak shift generalization curve was observed (i.e. away from CS-) ii) whilst there was no significant difference between the groups in the raw data, groups different when a similarity based model of generalization was used (i.e. excitatory generalization coefficient was larger in PP group; n.s. trend for inhib coefficient). Iii) Modelled prediction errors during generalization were observed in the hippocampus, and greater in the PP group. Further there was a reduction in midbrain-hippocampus functional connectivity in the PA group, that was associated with individual differences in modeled inhibitory coefficient. Overall findings are discussed as implicating dopaminergic circuit and hippocampus in flexible stimulus generalization based on neuromodulatory state.

This study addresses a question of importance and the findings are potentially of interest. However, I have some concerns about the interpretation of the findings (see below). In addition I am not sure whether the current paper represents a substantial enough advance above their published 2012 work (where an analogous computational model was used and interactions between the hippocampus-striatum implicated) to warrant publication in eLife. The main novel aspect is the dopaminergic manipulation, but as I see it there are some inconsistencies between the 2012 and this paper which make it difficult to have a coherent picture.

Major concerns:

Given the effect of drug on generalization curve is relatively small (i.e. only detectable by model fitting), it seems important that the authors consider alternative models. If these also showed a significant difference in group parameters this would make the findings appear more robust. In particular, exemplar models (e.g. Nosofsky, 1984) and models of similarity based generalization (e.g Shepard, 1987) typically use an exponential similarity function. Was there a particular reason for opting for a Gaussian function instead, and could the authors please provide results from an analogous model with exponential function for comparison? This would be particularly useful given the possible relationship between exemplar models and the hippocampus.

One important claim of the paper is that dopamine widens the generalization curve (e.g. modeled by increase in excitatory coefficient relating to width of Gaussian). Whilst this makes sense at face value, I wonder whether that there are potentially complicated interactions at play during the generalization test that may (also/instead) be affected by dopamine. In brief, and detailed subsequently: dopamine is known to affect the strength of hippocampal encoding and perhaps this is the reason for changes in the generalization curve. This would be important (apart from changing the mechanistic explanation) because it suggests that the dopamine/hippocampal effect is to some extent paradigm specific: it does not apply to "one-shot" generalization (perhaps more naturalistic), but only under settings where lots of stimuli are presented and generalization required.

To state this in more detail: subjects are learning during the generalization phase: i.e. storing new representations of the previously unseen test items and their predicted values (which are being constantly updated). It certainly seems conceivable that in this setting the hippocampus is important in setting up these representations, and the behavioral generalization curve then reflects the concurrent retrieval of these representations in parallel (i.e. as in exemplar models) to compute an EV for the current stimulus (as the authors suggest). My question is how a difference in the strength of these item representations in the hippocampus could influence the generalization curve? At present this is not modeled, and given the known role of dopaminergic modulation of hippocampal encoding this seems an additional factor to consider. Assuming the authors agree with this, could they incorporate this as an additional parameter in the model (distinct from the learning rate, which applies to value updating), and report the results?

It seems important for the authors relate these findings more closely to those of their previous study, which is not discussed in detail at present and consider possible inconsistencies. Firstly, no hippocampal PE signals were reported in that study: can the authors comment on any reasons for why this might be the case (e.g. paradigm differences etc? Secondly, stronger hippocampal-striatal connectivity in that study was associated with narrower generalization: here stronger midbrain-hippocampal connectivity was associated with broader generalization. Can the authors comment on the difference in findings, and also whether the midbrain-hippocampal connectivity result is replicable in the original dataset?

Hippocampal activity is reported to correlate with generalized PE during test phase. Given that no outcomes are presented, is the PE not equivalent to EV? If so, it would be worth stating this, partly because in recent years a number of studies have found EV to be reflected in hippocampal activity.

[Editors’ note: what now follows is the decision letter after the authors submitted for further consideration.]

Thank you for resubmitting your work entitled "Dopamine regulates stimulus generalization in the human hippocampus" for further consideration at eLife. Your revised article has been favorably evaluated by Timothy Behrens (Senior Editor), Michael Frank (Reviewing Editor), and two reviewers. We all agreed that the manuscript has been substantially improved and that you have done a nice job addressing the main issues. Still, there are a few remaining issues that need to be addressed before acceptance, as outlined below by each reviewer in turn.

I highlight two points here.

First, both Reviewers appreciated the new analyses showing specificity of the effects to the hippocampus, but each would like to see a follow-up analysis to further evaluate this specificity. Reviewer 1 notes that any strong claims regarding specificity need to show a region by drug effect interaction and not just a significant effect in hippocampus and null effect elsewhere (see e.g., Nieuwenhuis, Forstmann & Waenmakkers 2011 Nature Neuroscience who emphasize this point strongly). Reviewer 2 asks whether you also see specificity in the PPI analysis.

Second, Reviewer 2 would like to see a somewhat more fleshed out motivation for the prediction error analysis and to clarify whether the PE was evaluated at the outcome or during the choice.

Please address these and the other comments below, and we will be able to make a decision without further review.

Reviewer #1: This is a resubmission of a previous paper relating to the role of the hippocampus & dopamine in stimulus generalization. Overall, the authors have been very responsive to the concerns I had with the original paper: I was convinced by the new findings that distinguish the effect of drug in the absence of a model (through differences in kurtosis), the results of the inclusion of a similarity based model with exponential function, and consideration of learning during test as a possible explanatory factor. Also the discussion of the current findings with respect to their previous paper is informative.

I have few remaining concerns:

– The authors report a significant difference between drug/placebo groups in terms of prediction error signals in the hippocampus, but report that this effect was not significant in other regions showing prediction error signals (e.g. amygdala). Can they confirm whether there was a significant interaction (i.e. drug/placebo x brain region), for example in an appropriately constructed region of interest analysis? This seems important to demonstrate the specificity of the drug effect on PE signals.

– They observe that midbrain-hippocampal functional connectivity correlates with the inhibitory (but not excitatory) generalization coefficient of the model. Although there is a comment that relates to this in the Discussion, could the authors expand on why the behavioral findings between drug groups should be driven by differences in the excitatory coefficient, but not the functional connectivity findings? Also was there a significant difference between placebo and drug group in this correlation?

Reviewer #2: The authors did an extensive revision in which they addressed the main concerns raised in the last round, particularly the lack of a behavioral difference under drug and the lack of specificity of the fMRI findings. The result is an interesting paper presenting solid findings that are likely to be of broad interest.

I have a couple of remaining comments:

1) The rational for the prediction error analysis in the test phase is not spelled out clearly enough. Perhaps I misunderstood, but I thought the authors were arguing that the effect of dopamine at test is essentially on the retrieval of a learned representation, the generalization itself, and that it is explicitly not related to trial by trial updating during the test phase. This leaves me questioning what the link is to the prediction error at the outcome phase rather than the response/choice at the stimulus presentation phase.

2) The selectivity of the fMRI effects to the hippocampus are reassuring. What about the PPI effects? Here too it seems important to show regional selectivity, ideally using the same control regions, to show that the PPI differences are not generally related to pharmacological effects on BOLD.

eLife. 2016 Feb 2;5:e12678. doi: 10.7554/eLife.12678.014

Author response


[Editors’ note: the author responses to the first round of peer review follow.]

We have now thoroughly revised the manuscript based on these comments. Specifically, we provide evidence for an effect of amisulpride on generalization behavior by showing significant group differences in the shape (i.e., kurtosis) of the behavioral generalization gradients (see reviewer #2, point 1). We also show that the effect of amisulpride on model parameters controlling the width of generalization does not depend on the choice of the specific model (see reviewer #2, point 1). We have now compared our model to alternative models, including a model with an exponential similarity function and a model with additional sensory learning during test (see reviewer #2, point 2). We demonstrate that a Gaussian similarity function explains the data better than an exponential similarity function and that additional learning during test cannot account for our results. Importantly, we now show that the effects of dopamine on generalization related fMRI signals are specific to the hippocampus and do not extend into other regions involved in generalization such as the amygdala and the mPFC (see reviewer #1, point 1). We also clarify the conceptual advances over our previous study (Kahnt et al. 2012, J Neurosci), provided by using a pharmacological manipulation of dopamine, and a dissociation of encoding- and retrieval-based generalization (see reviewer #2, point 3).

All reviewers and editors expressed interest in the findings and framework, but questioned the level of advance over and above what your group has published in 2012 with a similar model and relation to hippocampal-striatal connectivity. The main issues that came up in the review discussion (some of which are reiterated by the individual reviewer comments below) are as follows. – First and foremost, while the dopaminergic effect is of real potential interest, all were concerned that there was no observable effect of the drug on generalization behavior, but only through the lens of the model parameters. In general a model can certainly be helpful to refine behavioral analysis, but (1) it has to be shown to fit the data well before its parameters can be interpreted, (2) it should be compared to alternative potential models, and (3) if both of these are successful the winning model should then guide an analysis of the behavioral data that would reveal the significant effects. Reviewers were also concerned that other (unmodelled) factors (e.g. new learning during testing session affected by dopamine) could be at play which make it less clear that dopamine is specifically changing the hippocampal generalization gradient, and that the current findings weren't well integrated with your previous findings. – The specificity of the effects was not fully established. Reviewers suggest control analyses that would address whether indeed what was seen in the hippocampus is something that is unique to the hippocampus, or whether it was a general effect of pharmacology in BOLD signal that was not observed elsewhere.

We have now revised the manuscript in light of these comments and believe that they have strengthened the manuscript considerably. Specifically, we now demonstrate an effect of amisulpride on the shape of behavioral generalization gradients by showing that compared to placebo, gradients in the amisulpride group have a significantly greater kurtosis. We also show that our model fits the behavioral data well, and compare our model to alternative potential models. Specifically, we demonstrate that the effect of amisulpride on model parameters controlling the width of generalization does not depend on the choice of the specific model. Moreover, we show that our original model with a Gaussian similarity function outperforms a model with an exponential similarity function and that adding a sensory learning mechanism does not improve the fit of the model. We then use the winning model to guide the analysis of the behavioral and neural data and characterize significant effects of the dopamine manipulation on model parameters. We also show that additional learning during test does not explain our results.

Importantly, we show that the effects of dopamine on generalization related fMRI signals are specific to the hippocampus and do not occur in other regions such as the amygdala, middle temporal gyrus and mPFC.

We also clarify the conceptual advancements over our previous study (Kahnt et al. 2012, J Neurosci), provided by using a dopamine receptor type-specific manipulation as opposed to a simple correlation between brain activity and behavior, and the dissociation of encoding- and retrieval-based generalization. The point-by-point responses below describe in detail how we addressed each of the reviewers’ comments.

Reviewer #1:

This study examined the role of dopamine in modulation of perceptual generalization, combining pharmacology, fMRI and computational modeling.

On the first day participants trained on a visual discrimination task. On the second day they were tested for generalization of the training to new samples varying in similarity to the trained items. The generalization test took place either under drug (D2 receptor blocker) or under placebo, in a between-subject design. The results suggest that both groups generalize similarly, overall. But the application of a computational model of generalization revealed subtle but interesting differences in the width of the generalization gradient, with a narrower gradient in the drug-treated group. This difference in the generalization gradient was associated with differences in BOLD activity in the hippocampus, with the drug group showing a weaker generalization-related BOLD response. Overall this is a solid and informative study using a convergence of methods to address an interesting question. I have just a few comments: 1) As with any pharmacological study, one must raise concerns about selectivity. How do you know the effects of the drug are not due to global differences in dopamine transmission, that are having an impact on BOLD activity in multiple places in parallel with (but not necessarily related to) the subtle differences in behavior? The study already addresses this somewhat by showing selectivity of effects on functional connectivity between the hippocampus and the midbrain, but not with the striatum. And the findings ruling out mere differences in perception also help address this point. But similar control analyses are also needed for the generalization gradient, to determine whether there is really a selective effect of the drug in the hippocampus, as the authors currently conclude. For example, Paz and colleagues have suggested an important role for the amygdala and the PFC in perceptual generalization. These regions are also targets of midbrain region and it would be useful to know whether or not they show parallel effects.

We thank the reviewer for raising this important point. There are two potential sources of non-selectivity that need to be considered:

1) First, there could be pharmacological effects on global blood flow that are independent of dopaminergic activity. Because the early visual cortex contains only very few dopamine receptors (Lidow et al. 1989, PNAS), visual activity provides a good control for non-specific (i.e., non-dopaminergic) effects of the drug on blood flow and the BOLD response. We therefore tested for drug-related differences in visual cue-related activity in the early visual cortex (AAL, calcarine sulcus). We did not find any differences in cue-related activity in early visual cortex between groups (t = -0.80, P = 0.42), suggesting that amisulpride did not affect global blood flow. This is in line with previous studies that found no differences between amisulpride and placebo in visually evoked activity (e.g., Jocham, et al. 2011, J Neurosci).

2) Second, because we delivered amisulpride systemically rather than by intracerebral infusion, it could have altered BOLD responses in all brain regions that contain dopamine receptors (such as the amygdala), independent of their role in generalization. In other words, the question is whether the observed effects of amisulpride on hippocampal activity are unique to the hippocampus, or can also be identified in other dopaminoceptive brain regions that are involved in generalization. As a control, we therefore tested whether generalization-related activity in the amygdala and middle temporal cortex was also altered by amisulpride. We did not find significant differences in generalization-related activity in either the amygdala (P = 0.43) or the middle temporal gyrus (P = 0.31), suggesting that the effects of amisulpride on generalization-related activity are specific to the hippocampus.

While we do not find generalization-related activity in the PFC at our corrected threshold, we do find a cluster in the medial PFC (mPFC, x, y, z = -3, 56, 8, t = 4.03) at a more liberal, uncorrected threshold (P < 0.001). In order to test for potential drug effects in the PFC, we compared activity in this cluster across groups. The comparison between groups did not reveal any differences (t = -0.023, P = 0.98), further supporting the specificity of the hippocampus finding. Because the mPFC finding is not based on a corrected, but a more liberal uncorrected threshold, we decided to not include this finding in the manuscript. However, we would be happy to include these results if the reviewer feels this would strengthen the manuscript.

Taken together, the results from the two control analyses demonstrate that our findings cannot be explained by non-specific drug effects on blood flow, and that the effects of dopamine on generalization-related processing are specific to the hippocampus. We now include the results of these control analyses in the manuscript:

Methods:

“To test for global effects of amisulpride on blood flow, and thereby BOLD response, we tested whether cue-evoked activity in visual cortex differed between groups. For this, we set up a GLM including one regressor for the onset of the visual cue (HRF convolved) and the six head movement parameters. Cue-related activity in an anatomical mask of the calcarine sulcus (AAL) did not differ between groups (t = -0.80, P = 0.42), suggesting that amisulpride did not unspecifically affect the BOLD response. This is in line with previous studies that found no differences between amisulpride and placebo in visually evoked activity (Jocham et al., 2011).”

Results:

“To examine whether these effects of dopamine on generalization-related activity are specific to the hippocampus, as a control, we tested for similar group differences in the amygdala and middle temporal cortex. Importantly, we did not find significant differences in either the amygdala (P = 0.43) or the middle temporal gyrus (P = 0.31), in line with the idea that the effects of amisulpride on generalization-related activity are specific to the hippocampus.“

– Related to the point about possible global differences between drug and placebo, were there any reaction time differences between the groups?

As suggested by the reviewer, we tested for group differences in response time (RT). Although amisulpride is generally known not to alter RT (e.g., Rosenzweig 2002, Hum Psychopharmacol Clin Exp; Jocham et al. 2011, J Neurosci), we found a non-significant group difference in RT (t = 1.83, P = 0.074), with slower responding in amisulpride (mean RT +/- SEM = 816 +/- 16.6) compared to placebo (mean RT +/- SEM = 770 +/- 18.5). All group comparisons remained significant when RT was included as a covariate in the statistical models, suggesting that non-significant differences in RT did not confound the imaging results. We now add this to the manuscript.

Methods:

“Although amisulpride is generally known to not alter RT (Jocham et al., 2011; Rosenzweig et al., 2002), we found a trend-level group difference in RT (t = 1.83, P = 0.074), with slower responding in amisulpride (mean RT ± SEM = 816 ± 16.6) compared to placebo (mean RT ± SEM = 770 ± 18.5). Importantly, all group comparisons remained significant when RT was included as a covariate in the statistical models, suggesting that non-significant differences in RT did not confound the imaging results.”

2) The Discussion makes the important distinction between associative and perceptual generalization, but this distinction is muddled in the introduction where the two appear to be conflated. It will help readers if this distinction is clarified earlier on.

We now explicitly mention this distinction in the Introduction.

Introduction: “Two basic forms of generalization can be distinguished based on what constitutes the relation among stimuli; associative and stimulus generalization. In the case of associative generalization, such as transitive inference and acquired equivalence, the associative relationship among stimuli determines similarity. This relationship can be established for example by sensory preconditioning (train stimuli A-B and B-C, test whether A comes to predict C) or by a common associate (e.g. train A-C and B-C, test whether A and B are associated). In contrast, in stimulus generalization the relationship among stimuli is established by the similarity along one or more perceptual dimensions (frequency of sounds, color, line orientation, etc.).“

Reviewer #2:

Major concerns: Given the effect of drug on generalization curve is relatively small (i.e. only detectable by model fitting), it seems important that the authors consider alternative models. If these also showed a significant difference in group parameters this would make the findings appear more robust. In particular, exemplar models (e.g. Nosofsky, 1984) and models of similarity based generalization (e.g Shepard, 1987) typically use an exponential similarity function. Was there a particular reason for opting for a Gaussian function instead, and could the authors please provide results from an analogous model with exponential function for comparison? This would be particularly useful given the possible relationship between exemplar models and the hippocampus.

Thank you for raising this important point. We now report evidence for an effect of amisulpride on the shape of the observed behavioral generalization gradients independent of a specific computational model. Specifically, while none of the individual data points along the generalization gradient individually differed between groups, overall, gradients in the amisulpride group were reduced at both flanks and enhanced at the peak and tail of the curve. These features of the shape of the gradients are parsimoniously captured by the kurtosis of the underlying distributions. We therefore compared the kurtosis of the group-specific distributions (Pearson type VII distribution), and found significantly greater kurtosis in the distribution of the amisulpride group (permutation test, P = 0.043). This result provides evidence for effects of the drug on the overall shape of the behavioral generalization gradients. We now include this in the manuscript.

Abstract:

“Blocking dopamine D2-receptors (D2R) altered generalization behavior as revealed by an increased kurtosis of the generalization gradient, and a decreased width of model-derived generalization parameters.”

Results:

“Direct group comparisons of the individual data points along the generalization gradient did not reveal any significant differences (two-sample t-tests, all Ps > 0.29). However, visual inspection of the gradients suggested that the amisulpride group had a narrower gradient than the placebo group, with enhanced responding at the peak of the curve, reduced responding at both flanks, and enhanced responding at the tail of the curve. These shape features are parsimoniously described by the 4th moment of probability distributions, namely, their kurtosis. Accordingly, a test for differences in the kurtosis of group-specific distributions (Pearson type VII distribution, see Materials and methods) revealed a significantly greater kurtosis in the amisulpride group compared to placebo (PA: 6.73, PP: 3.29; permutation test, P = 0.043). This finding demonstrates that amisulpride narrowed the width and increased the peak of the behavioral generalization gradient, and suggests that D2R activity alters the neurocomputational processes that mechanistically control generalization behavior.”

Materials and methods: “Comparing the kurtosis of behavioral generalization gradients”.: “In order to compare the overall shape of the behavioral generalization gradients, we estimated the 4th moment of the distributions underlying the behavioral gradients (because the behavioral gradients were bounded (17-73 degrees) and not centered on 45 degrees, direct numerical estimation of the kurtosis was not possible). For this, we fitted a Pearson type VII distribution to the behavioral generalization gradient of each group and numerically computed the kurtosis of the group-specific distributions according to:

kurt(X)=E[(Xμ)4]σ4

The kurtosis was then compared between groups, and statistical inference on the observed group difference was performed using a permutation test.”

Regarding the shape of the modeled similarity function (i.e., generalization coefficient), we note that Shepard (1987) indeed proposed that similarity decreases exponentially with distance from the CS. However, he also listed several factors, such as repeated discrimination training and delays between training and test (both apply to our current experiment), that can render similarity functions Gaussian. Accordingly, and in line with the results of systematic reviews of empirical data (Ghirlanda & Enquist, 2003, Animal Behaviour), we initially opted for a Gaussian similarity function for modeling the behavior.

To assess the validity of our choice, and as suggested by the reviewer, we formally compared models with Gaussian and exponential similarity functions. The exponential similarity functions for the inhibitory and excitatory gradients were implemented as:

iSjk=exp(xjxk)22si2andeSjk=exp(xjxk)22se2As can be seen in Figure 2–figure supplement 1, compared to the Gaussian model, the exponential model was substantially worse in capturing the behavioral generalization gradients of both groups. Also, the exponential model was unable to capture the group differences in the shapes of the generalization gradients, and accordingly, the estimated generalization coefficients did not differ between groups (inhibitory coefficient, P = 0.34, excitatory coefficient, P = 0.24). A formal model comparison showed that the model with the Gaussian similarity function outperformed the model with the exponential similarity function as indicated by smaller AIC and BIC scores (Gaussian: AIC = 9586.4, BIC = 9629.2; Exponential: AIC = 9691.6, BIC = 9734.4). We also compared the fit of the two models by comparing the average regression coefficients from a logistic regression of the trial-by-trial responses on the modeled P(+) responses. This revealed significantly better fits of the Gaussian compared to the exponential model in the entire group of subjects (paired t-test t = 5.47, P < 0.001), as well as in both groups individually (PA, t = 3.34, P = 0.003 and PP, t = 5.72, P < 0.001).

Taken together, these findings are in line with the observations by Shepard (1987) and demonstrate that the Gaussian model fits our data better than a model with an exponential similarity function. As suggested by the Editors, we now address this point by including a formal model comparison before testing for group differences in the estimated parameters of the winning (i.e., Gaussian) model.

Results:

“Because the shape of the function determining the similarity between the currently presented stimulus and other stimuli (i.e., the generalization coefficient) is of critical importance (Ghirlanda and Enquist, 2003), we directly compared the most commonly used models, i.e. one with Gaussian (Kahnt et al., 2012) the other with exponential similarity functions (Shepard, 1987). While the exact shape of the similarity functions differs between models, for both models, the extent to which inhibitory and excitatory associations generalize to the current stimulus is controlled by the width of the similarity functions (si and se) (Figure 4B). […] Although both models predicted behavioral responses reliably (Gaussian: t = 11.06, P < 0.001; exponential: t = 11.38, P < 0.001), we find significantly higher regression coefficients for the Gaussian model (paired t-test t = 5.34, P < 0.001). Taken together, this demonstrates that in our experiment, a Gaussian similarity function fits behavior better than an exponential similarity function. “

Materials and methods:

“In order to compare models with Gaussian and exponential similarity functions, we estimated the free parameters of both models by combining the LLE from subjects in both groups, and compared the aggregate LLE from the best fitting parameter sets using AIC and BIC.”

It is important to note that the effect of amisulpride on the model parameters controlling the width of generalization does not depend on the a priori choice of the Gaussian model. Specifically, the difference between the exponential and Gaussian similarity functions is determined by the exponent of the (absolute) difference between xj and xk (1 and 2 for the exponential and Gaussian model, respectively). Instead of comparing the fit of different theoretical shapes (exponential and Gaussian), we have also directly estimated the best fitting exponent by including the exponent as an additional free parameter in the model. This resulted in a parameter of 2.04, close to the assumed Gaussian model. Also for this model, the width of the inhibitory and excitatory generalization coefficients differed significantly between groups (P = 0.013 and P = 0.016, for the inhibitory and excitatory coefficients, respectively). However, because this exponential model came at the expense of an additional parameter, without substantially increasing the LLE, it was outperformed by the Gaussian model as indicated by lower AIC and BIC values for the Gaussian model (AIC = 9586.4 vs. 9588.3 and BIC = 9629.2 vs. 9638.2 for Gaussian vs. exponent as free parameter, respectively). These results demonstrate that the effect of amisulpride on the parameters controlling the width of generalization does not depend on the choice of the specific model. In the interest of parsimony, we tentatively opted against including this model in the manuscript. However, if the reviewer feels this should be included, we would be happy to revise the manuscript accordingly.

One important claim of the paper is that dopamine widens the generalization curve (e.g. modeled by increase in excitatory coefficient relating to width of Gaussian). Whilst this makes sense at face value, I wonder whether that there are potentially complicated interactions at play during the generalization test that may (also/instead) be affected by dopamine. In brief, and detailed subsequently: dopamine is known to affect the strength of hippocampal encoding and perhaps this is the reason for changes in the generalization curve. This would be important (apart from changing the mechanistic explanation) because it suggests that the dopamine/hippocampal effect is to some extent paradigm specific: it does not apply to "one-shot" generalization (perhaps more naturalistic), but only under settings where lots of stimuli are presented and generalization required. To state this in more detail: subjects are learning during the generalization phase: i.e. storing new representations of the previously unseen test items and their predicted values (which are being constantly updated). It certainly seems conceivable that in this setting the hippocampus is important in setting up these representations, and the behavioral generalization curve then reflects the concurrent retrieval of these representations in parallel (i.e. as in exemplar models) to compute an EV for the current stimulus (as the authors suggest). My question is how a difference in the strength of these item representations in the hippocampus could influence the generalization curve? At present this is not modeled, and given the known role of dopaminergic modulation of hippocampal encoding this seems an additional factor to consider. Assuming the authors agree with this, could they incorporate this as an additional parameter in the model (distinct from the learning rate, which applies to value updating), and report the results?

The reviewer addresses an interesting point here, namely that dopamine-related differences in learning during the test session could have affected the shape of the generalization gradient. It is important to point out that our original model already accounts for additional associative learning during test. That is, test stimuli acquire stimulus-outcome associations by means of the same learning mechanism that is used during training (δ rule). The estimated learning rates for value learning during test (αtest) did not differ between groups (P = 0.214), and therefore cannot account for differences in the shape of generalization gradients between groups.

If we understand correctly, the reviewer proposes that dopamine could affect a different form of learning during test, instead of, or in addition to, value learning. Specifically, it is possible that dopamine alters the sensory encoding of novel test stimuli for which further associative learning takes place. Such sensory representations could gradually build up over the test session as test stimuli are repeatedly encountered. In this scenario, test stimuli can contribute to generalized expected value (V) only to the degree that sensory representations for these stimuli actually have been established.

To explore this possibility, we modeled the gradual encoding of test stimuli using a mnemonic stimulus buffer M. Before a given test stimulus xk is encountered for the first time, this buffer is zero, and is then updated on a trial by trial basis using the δ rule and a sensory learning rate (αsensory):

ΔMk=sensory(1Mk)

Please note that a sensory learning rate of 1 (αsensory = 1) renders the model equivalent to our original model, in which sensory representations of test stimuli (M) are fully established after the first encounter.

When the generalized expected value (V) is computed on a given trial, M serves to discount the contribution of the test stimuli’s excitatory and inhibitory associations (Ik and Ek):

Vt=jMkEt,jeSjkMkIt,jiSjkAs in our original model, the excitatory and inhibitory associations of the test stimuli (Ik and Ek) are updated using the value learning rate (αtest):

Et+1,k=Et,k+αδtifδt>0 and

It+1,k=It,kαδtifδt<0With δt=RVtFitting αsensory for the entire group of subjects revealed an αsensory of 1, demonstrating that the fit of our initial model was not enhanced by adding the gradual sensory encoding mechanism. Accordingly, because of the additional free parameter, both AIC and BIC were in favor of our original model (AIC = 9588.4 vs. 9586.4; and BIC = 9638.3 vs. 9629.2).

Moreover, estimating the model in both groups separately also revealed an αsensory of 1 in both groups, suggesting that dopamine did not affect how sensory representations of test stimuli are encoded over the course of the test session.

It is worth noting that the two learning rates αsensory and αtest are not entirely independent. Specifically, within certain ranges (i.e., αsensory > 0 and αtest > 0), changes in αsensory can be compensated by changes in αtest, and vice versa. This is because across test trials the δ rule for the update of Ik and Ek similarly induces a gradually increasing contribution of individual test stimuli to the generalized value prediction V. Nevertheless, the fact that both learning rates did not differ between groups indicates that additional learning during the test session was not altered by dopamine, and can therefore not account for the differences in generalization gradients.

In summary, these analyses suggest that our results cannot be explained by effects of dopamine on the encoding of sensory stimulus representations during test. Even though this is an interesting possibility, for the sake of clarity, we tentatively opted to not include the above results in the manuscript. However, if the reviewer feels that this should be included, we would be happy to incorporate these passages.

We now state explicitly that additional learning during the test session did not differ between groups.

Results:

“Importantly, the learning rate during test did not differ between groups, demonstrating that additional learning during the test session (P = 0.21), which might have been altered by amisulpride, cannot account for the differences in generalization gradients.”

It seems important for the authors relate these findings more closely to those of their previous study, which is not discussed in detail at present and consider possible inconsistencies. Firstly, no hippocampal PE signals were reported in that study: can the authors comment on any reasons for why this might be the case (e.g. paradigm differences etc? Secondly, stronger hippocampal-striatal connectivity in that study was associated with narrower generalization: here stronger midbrain-hippocampal connectivity was associated with broader generalization. Can the authors comment on the difference in findings, and also whether the midbrain-hippocampal connectivity result is replicable in the original dataset?

We now describe the differences between our previous (2012) and the current study more clearly. Most importantly, our previous study was silent about the pharmacology and timing of stimulus generalization. Procedurally, it consisted of several alternating training and testing sessions on a single day, whereas the current study used a single training and a single test session, which were separated by a 24 h delay. Moreover, in the 2012 study CS-US associations were deterministic, whereas the current study involved a 50% reinforcement schedule in order to slow down extinction during test. These factors could have caused the differences in PE related activity in the striatum vs. hippocampus. However, it is worth stating that in the 2012 data we did observe PE related activity in the hippocampus (left [-36, -25, -8], t = 3.84; right [30, -7, -14], t = 3.97, P < 0.001), but these activations did not survive correction for multiple comparisons and were therefore not reported in the final manuscript.

Moreover, in the 2012 study, hippocampal-striatal connectivity was negatively correlated with the width of the excitatory vs. inhibitory gradients (excitatory minus inhibitory). As such, this negative correlation could have been driven by a negative correlation with the excitatory gradient, and/or a positive correlation with the inhibitory gradient. In fact, going back to the original data, we find only a relatively weak negative correlation between striatum-hippocampus connectivity and the excitatory gradient (r = -0.42, P = 0.053), but a strong and positive correlation between connectivity and the inhibitory gradient (r = 0.57, P = 0.005). This shows that the relationship between striatal-hippocampal connectivity and the inhibitory gradient in the previous study is compatible with the results reported here.

We also revisited the data from the 2012 study with regard to the current finding of a positive correlation between midbrain-hippocampal connectivity and the width of the inhibitory coefficient. Replicating the results of our current study, we find that midbrain-hippocampal connectivity was significantly correlated with the inhibitory generalization coefficient (r = 0.44, P = 0.036), but not with the excitatory coefficient (r = 0.15, P = 0.49).

We now discuss the differences to our previous study, and relate the findings more thoroughly.

Discussion:

“By revealing the effects of dopamine on generalization, our current results substantially extend those of our previous study (Kahnt et al., 2012). Specifically, the current experiment suggests that dopamine is involved in regulating the width of generalization in the hippocampus, and dissociates effects of generalization during retrieval vs. encoding, which could not be achieved with the previous design. However, whereas here we find that prediction errors correlate primarily with activity in the hippocampus, the previous study identified prediction error related activity primarily in the ventral striatum. […] Thus, the two studies converge in showing a preferential relation of the connection between the midbrain and the hippocampus to inhibitory generalization.”

Hippocampal activity is reported to correlate with generalized PE during test phase. Given that no outcomes are presented, is the PE not equivalent to EV? If so, it would be worth stating this, partly because in recent years a number of studies have found EV to be reflected in hippocampal activity.

The reviewer is correct that because no outcomes were shown in the test session, PEs are perfectly (but negatively) correlated with EV. We now mention this in the paper and include references reporting value signals in the hippocampus.

Results:

“Please note that because no outcomes were shown, prediction errors are perfectly (but negatively) correlated with expected value.”

[Editors' note: the author responses to the re-review follow.]

First, both Reviewers appreciated the new analyses showing specificity of the effects to the hippocampus, but each would like to see a follow-up analysis to further evaluate this specificity. Reviewer 1 notes that any strong claims regarding specificity need to show a region by drug effect interaction and not just a significant effect in hippocampus and null effect elsewhere (see e.g., Nieuwenhuis, Forstmann & Waenmakkers 2011 Nature Neuroscience who emphasize this point strongly). Reviewer 2 asks whether you also see specificity in the PPI analysis. Second, Reviewer 2 would like to see a somewhat more fleshed out motivation for the prediction error analysis and to clarify whether the PE was evaluated at the outcome or during the choice.

We thank the editors for the positive evaluation of our resubmitted manuscript. Below we address the remaining comments. Most importantly, we now include explicit tests of drug-by-region interactions to formally evaluate the specificity of the drug effects in the hippocampus. We also address the remaining interpretative issues.

Reviewer #1:

This is a resubmission of a previous paper relating to the role of the hippocampus & dopamine in stimulus generalization. Overall, the authors have been very responsive to the concerns I had with the original paper: I was convinced by the new findings that distinguish the effect of drug in the absence of a model (through differences in kurtosis), the results of the inclusion of a similarity based model with exponential function, and consideration of learning during test as a possible explanatory factor. Also the discussion of the current findings with respect to their previous paper is informative. I have few remaining concerns:

We thank the reviewer for these positive comments on the resubmitted version of our manuscript. Below we address the remaining comments.

– The authors report a significant difference between drug/placebo groups in terms of prediction error signals in the hippocampus, but report that this effect was not significant in other regions showing prediction error signals (e.g. amygdala). Can they confirm whether there was a significant interaction (i.e. drug/placebo x brain region), for example in an appropriately constructed region of interest analysis? This seems important to demonstrate the specificity of the drug effect on PE signals.

We agree with the reviewer that in order to demonstrate specificity, interactions need to be computed. In line with specificity, post-hoc analyses comparing the effect of the drug in the hippocampus to the drug effect in the other regions (i.e., group-by-region interactions), demonstrated that the drug effect in the hippocampus was significantly stronger than in the mPFC (P = 0.02). However, similar interactions involving the amygdala (P = 0.097) and the middle temporal gyrus (P = 0.127) did not reach significance. This suggests that while the drug effects are specific to the hippocampus relative to the mPFC, they do not exceed the non-significant drug effects in the amygdala and the middle temporal gyrus. We now add these results to the manuscript and adjust our interpretation regarding specificity accordingly.

Results:

“No significant group differences were observed in the amygdala (P = 0.43), the middle temporal gyrus (P = 0.31), or the medial PFC (P = 0.98). However, post-hoc analyses directly comparing the effect of the drug in the hippocampus to the drug effect in the other regions (i.e., region-by-group interactions), demonstrated that while the effect of D2R blockade in the hippocampus was significantly stronger than in the mPFC (P = 0.02) similar interactions involving the amygdala (P = 0.097) and the middle temporal gyrus (P = 0.127) did not reach significance. These data suggest specificity of the effects of D2R blockade on similarity-based processing in the hippocampus relative to the mPFC, but not necessarily relative to the amygdala and middle temporal lobe.”

– They observe that midbrain-hippocampal functional connectivity correlates with the inhibitory (but not excitatory) generalization coefficient of the model. Although there is a comment that relates to this in the Discussion, could the authors expand on why the behavioral findings between drug groups should be driven by differences in the excitatory coefficient, but not the functional connectivity findings? Also was there a significant difference between placebo and drug group in this correlation?

One possibility is that the decrease in the inhibitory and the excitatory coefficient induced by D2R blockade relies on separate neural mechanisms. Specifically, it is possible that only the effects of D2R blockade on the inhibitory coefficient are mediated by connectivity between midbrain and the hippocampus, whereas the D2R effects on excitatory coefficient are mediated via a different mechanism. The difference in the correlation between the placebo and the drug group was only at trend level (p = 0.054). We now spell out this possibility more explicitly in the revised manuscript:

Discussion:

“This raises the possibility that only the effects of D2R blockade on the inhibitory coefficient are mediated via a modulation of midbrain-hippocampal connectivity, whereas the effects on the excitatory coefficient are mediated via a different, yet to be explored, mechanism.”

Reviewer #2:

The authors did an extensive revision in which they addressed the main concerns raised in the last round, particularly the lack of a behavioral difference under drug and the lack of specificity of the fMRI findings. The result is an interesting paper presenting solid findings that are likely to be of broad interest. I have a couple of remaining comments: 1) The rational for the prediction error analysis in the test phase is not spelled out clearly enough. Perhaps I misunderstood, but I thought the authors were arguing that the effect of dopamine at test is essentially on the retrieval of a learned representation, the generalization itself, and that it is explicitly not related to trial by trial updating during the test phase. This leaves me questioning what the link is to the prediction error at the outcome phase rather than the response/choice at the stimulus presentation phase.

The reviewer is right, our results do suggest that the effect of dopamine is on the retrieval of learned information. Because no outcomes are shown during the test phase, we can use prediction errors as a proxy for generalized retrieved value. The prediction error was modeled at the time of outcome. However, we note that because of the fast timing of our task, it is difficult to dissociate effects during the stimulus/choice period from that of the outcome. We now make this more explicit in the manuscript.

Results:

“As a proxy of generalized value, we focused on prediction error responses derived from our model, which reflect the extent to which reward predictions have generalized from the original CS+ and CS− to the current stimulus (please note that because no outcomes were shown, prediction errors are perfectly but negatively correlated with expected value). Accordingly, to identify brain regions involved in similarity-based computations during generalization, we searched for regions in which fMRI activity correlated with generalized prediction errors at the time of the expected outcome.”

2) The selectivity of the fMRI effects to the hippocampus are reassuring. What about the PPI effects? Here too it seems important to show regional selectivity, ideally using the same control regions, to show that the PPI differences are not generally related to pharmacological effects on BOLD.

We examined the specificity of the connectivity effects and now include these results in the manuscript.

Results:

“To examine the specificity of these findings, we tested for similar drug-related effects on functional connectivity in the regions involved in similarity-based processing defined above. While connectivity estimates differed significantly between groups in middle temporal gyrus (P = 0.01), no drug effects were observed in the amygdala (P = 0.296) and the mPFC (P = 0.17). Accordingly, for all regions except the middle temporal gyrus (P = 0.13), the corresponding drug-by-region interactions were significant (all Ps < 0.05), suggesting that amisulpride-related decreases in midbrain connectivity are relatively specific to the hippocampus.”


Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

RESOURCES