Abstract
Considerable evidence suggests that, in instrumental conditioning, rats can encode both the specific action–outcome associations to which they are exposed and the degree to which an action is causal in producing its associated outcome. Three experiments assessed the involvement of the hippocampus in encoding these aspects of instrumental learning. In each study, rats with electrolytic lesions of the dorsal hippocampus and sham-lesioned controls were trained while hungry to press two levers, each of which delivered a unique food outcome. Experiments 1A and 1B used an outcome devaluation procedure to assess the effects of the lesion on encoding the action–outcome relationship. After training, one of the two outcomes was devalued using a specific satiety procedure, after which performance on the two levers was assessed in a choice extinction test. The lesion had no detectable effect on either the acquisition of instrumental performance or on the rats' sensitivity to outcome devaluation; lesion and sham groups both reduced responding on the lever associated with the devalued outcome compared with the other lever. In experiment 2, the sensitivity of hippocampal rats to the causal efficacy of their actions was assessed by selectively degrading the contingency between one of the actions and its associated outcome. Whereas sham rats selectively reduced performance on the lever for which the action–outcome contingency had been degraded, hippocampal rats did not. These results suggest that, in instrumental conditioning, lesions of the dorsal hippocampus selectively impair the ability of rats to represent the causal relationship between an action and its consequences.
Keywords: hippocampus, instrumental conditioning, outcome devaluation, reward, contingency, rat
The ability of a hungry rat to acquire seemingly arbitrary actions, such as lever pressing, to gain access to food is one of the most robust forms of learning one can observe in the animal laboratory. Nevertheless, there have been few recent attempts to systematically assess the neural basis of this form of learning. Although considerable research has investigated the brain processes that mediate learning generally, the vast majority of this work has focussed on the learning of predictive relationships between events primarily using the Pavlovian conditioning paradigm. As a consequence, the neural structures controlling instrumental conditioning remain poorly understood.
Significant advances have, however, been made in our understanding of the psychological determinants of instrumental action in the rat. Although for many years instrumental learning was characterized solely in stimulus–response (S-R) terms, recent evidence from outcome devaluation studies has made it clear that, in instrumental conditioning, rats are able to encode the specific consequences of their actions and that the encoded action–outcome (A-O) relationship plays a critical role in the initial acquisition and performance of an instrumental action (for review, see Colwill and Rescorla, 1986;Dickinson and Balleine, 1994). Nevertheless, evidence that A-O associations play a role in instrumental conditioning does not rule out the involvement of an S-R process. In fact there is considerable evidence suggesting that, when overtrained, instrumental performance can become stimulus-bound, independent of the current value of the outcome and so impervious to outcome devaluation (Adams, 1982;Dickinson and Balleine, 1995; Balleine and Dickinson, 1998a). It is evident, therefore, that both A-O and S-R processes can contribute to instrumental performance, although they serve quite distinct functions. The A-O process controls initial acquisition and performance of goal-directed actions, whereas the S-R process exerts more control over performance as an action becomes more habitual (cf. Dickinson and Balleine, 1994, 1995).
This functional distinction between the A-O and S-R learning processes has often been interpreted as implying that different memory systems contribute to instrumental conditioning. For example, a number of authors have suggested that A-O associations are encoded in declarative memory, whereas S-R associations are encoded in procedural memory (Winograd, 1975; Dickinson, 1980; Dickinson and Balleine, 1993; Squire and Zola-Morgan, 1996). With respect to the neural bases of instrumental conditioning, considerable recent evidence suggests that declarative memory is dependent on the integrity of the hippocampal formation (Squire, 1992; Eichenbaum et al., 1996; Squire and Zola-Morgan, 1996). This suggests that, in instrumental conditioning, A-O learning is instantiated in declarative memory and therefore, that this form of learning may be hippocampally dependent. This hypothesis predicts that damage to the hippocampus should render animals unable to encode the relationship between actions and their outcomes with the effect that their instrumental performance should be controlled predominantly by the S-R process. As a consequence, the instrumental performance of hippocampal rats should be relatively insensitive to outcome devaluation. This prediction was assessed in experiment 1.
EXPERIMENT 1A
In experiment 1A, rats with electrolytic lesions of the dorsal hippocampus and sham-lesioned controls were trained to press two levers, with one lever delivering food pellets and the other delivering a polycose solution. After training, one of these two outcomes was devalued using a specific satiety treatment (Balleine and Dickinson, 1998a) (for review, see Dickinson and Balleine, 1994, 1995). In this procedure, the animals are allowed access to one outcome ad libitum for 1 hr immediately before a choice extinction test conducted on the two levers. In line with previous findings, we anticipated that, on test, sham rats would show an outcome devaluation effect, i.e., they would perform fewer responses on the lever that, in training, delivered the now devalued outcome. In contrast, if, as predicted above, the hippocampus is essential to the formation of A-O associations during instrumental training, then any outcome devaluation effect established in controls should be severely attenuated in the lesioned group.
Method
Subjects and apparatus
The subjects were 17 experimentally naive adult female Long–Evans rats. Subjects were housed singly and were handled daily for 1 week before surgery. Training and testing took place in twelve Med Associates (East Fairfield, VT) operant chambers housed within sound- and light-resistant shells. Each chamber was equipped with a pump fitted with a syringe that delivered 0.1 ml of a 20% polycose solution into a recessed magazine in the chamber. Each chamber was also equipped with a pellet dispenser that delivered one 45 mg Noyes pellet (formula A/I) when activated. The chambers contained two retractable levers that could be inserted to the left and right of the magazine and were illuminated by a 3 W, 24 V house light mounted on the top center of the wall opposite the magazine. Microcomputers equipped with the MED-PC program (Med Associates) controlled the equipment, delivered reinforcers, and recorded the lever presses. For the outcome specific satiety prefeeding, animals received either 50 ml of the polycose solution in a calibrated glass drinking tube affixed to the front of the animal's home cage or a bowl containing 50 gm of Noyes pellets placed inside the home cage.
Surgery
At the time of surgery, animals weighed between 270 and 325 gm. Animals were anesthetized using sodium pentobarbital (Nembutal, 50 mg/kg), treated with atropine (0.1 mg), and then placed in a stereotaxic frame with the incisor bar adjusted so that lambda and bregma were level. Half the subjects received electrolytic lesions of the dorsal hippocampus, and the other half received sham surgery that consisted of lowering the electrode into the hippocampus without running any current. Electrodes, made from insect pins (size 00) covered in Epoxylite except for 1 mm at the tip, were lowered into four sites in the dorsal hippocampus (all coordinates relative to bregma; anteroposterior, −2.8 and −4.2; mediolateral, ±2.0 and ±3.0; dorsoventral, −4.0). When the electrodes were in place, a 1 mA, 20 sec current was passed through the electrode. After surgery, the animals were given 1 week to recover, during which they were handled daily.
Histology
At the end of the experiment, the animals were scarificed using a lethal barbiturate overdose and perfused transcardially with 0.9% saline, followed by 10% formalin solution. The brains were stored in 10% formalin solution for 48 hr and then transferred to a 25% sucrose–formalin solution before 40 μm coronal sections were cut throughout the region of the hippocampus. The slices were stained using thionin. Slides were examined for placement and extent of the lesion, with the latter assessed by microscopically examining sections for areas of marked cell loss and gliosis.
Procedure
After recovery from surgery, subjects were placed on a food deprivation schedule such that they received 15 gm of their maintenance diet daily to maintain them at ∼85% of their free-feeding weight. During training, the animals were fed after each training session. Animals had access to tap water ad libitum while in the home cage. Each session started with the illumination of the house light and insertion of the levers when appropriate and ended with the retraction of the levers and turning off of the house light. All sessions were 30 min in duration.
Magazine training. Initially, all subjects received two 30 min sessions of magazine training in which 15 presentations of each of the two reinforcers were given on a random time 60 sec schedule with the levers withdrawn.
Lever acquisition. In the next session, the animals were trained to press one of the two levers until 100 reinforcers had been earned. The animals were then trained on the other lever until 100 of the alternate reinforcer were earned. Half of the animals were trained with the left lever earning pellets and the right lever earning polycose, and the remaining animals received the opposite action–outcome assignments. In this initial phase of lever training, outcomes were delivered on a fixed interval schedule (FI-20). Once all animals had earned 100 of each reinforcer type, they were shifted to a random ratio (RR) schedule of reinforcement in which the appropriate outcome was delivered at a fixed probability after each action.
Lever training. The animals were first trained on an RR-5 schedule (i.e., each action delivered an outcome with a probability of 0.2). After 3 d of training, this was changed to an RR-10 (or a probability of 0.1) schedule for 3 d and then to an RR-20 schedule (or a probability of 0.05) for an additional 3 d of training. The animals received two training sessions each day, one with each action–outcome pair. The animals had a break of at least 30 min between sessions.
Devaluation test. After the final day of RR-20 training, all of the rats were given access to one of the two outcomes ad libitum for 1 hr in the home cage. Half of the animals in each action–outcome pair assignment received pellets, and the remaining animals received polycose. Immediately after the prefeeding, the animals were placed in the operant chambers. A 20 min choice extinction test was then conducted in which both levers were extended and the number of presses was counted on each lever. No outcomes were delivered during the test.
Results and Discussion
Histology
No recovery problem or weight loss was observed after surgery. In all lesioned animals (n = 8), damage to the dorsal hippocampus was bilateral and complete except at the rostral and caudal extremes. Figure 1 summarizes the maximum and minimum damage resulting from the lesions for the animals included in the behavioral analysis. There was no systematic damage to the overlying cortex observed in either group.
Acquisition
No effect of the lesion on the acquisition of lever pressing was observed. The two groups acquired the lever press response in the same number of days. Furthermore, no difference in the level of lever pressing between the lesion [mean (M), 350; SE, 111.8] and sham (M, 408; SE, 100.6) groups was evident at the end of initial training (F < 1).
Devaluation test
Two hippocampal subjects failed to consume the free outcome in the home cage and were excluded from the analysis. The results of the extinction test are illustrated in Figure2. It is clear from this figure that both the sham and the lesioned groups performed fewer responses on the lever that, in training, earned the subsequently devalued outcome. As such, this figure suggests that a comparable outcome devaluation effect was observed in both groups. This description was confirmed by the statistical analysis. A 2 × 2 mixed ANOVA was conducted with a between-subjects factor of group and a within-subjects factor of devaluation, the later separating performance on the devalued and nondevalued lever. This analysis revealed a significant effect of devaluation (F(1,13) = 16.562;p < 0.01), but not an effect of group (F < 1) or a group × devaluation interaction (F(1,13) = 1.406; p > 0.05). Inspection of Figure 2 may suggest that the devaluation effect was smaller in the hippocampal animals, but the hippocampal animals showed numerically smaller (although not statistically different) response levels in training, which may account for the lower response levels in the devaluation test.
The results of experiment 1A failed to confirm the prediction that, in instrumental conditioning, A-O learning depends on the integrity of the hippocampal formation. Despite considerable destruction to the dorsal hippocampus, lesioned rats showed a clear outcome devaluation effect in the choice extinction test.
EXPERIMENT 1B
The results of experiment 1A suggest that animals with hippocampal lesions are sensitive to changes in the value of an outcome and can integrate this change in value with an association between a specific action and the paired outcome encoded during training. It has been shown that, in normal animals, the incentive value of an outcome can be mediated by a motivationally arbitrary stimulus, such as taste (Balleine and Dickinson, 1998a; Rescorla, 1990). This is a much more subtle and presumably more difficult discrimination to make. In experiment 1A, the outcomes differed not only in taste but also in texture and nutritional value. The purpose of experiment 1B was to examine whether animals with hippocampal lesions are capable of the more difficult discrimination of outcomes that differ only in their taste.
Method
Subjects and apparatus
Seventeen experimentally naive female Long–Evans rats were housed and maintained under the same conditions and trained in the apparatus as described for experiment 1A, except that a second pump and syringe assembly was added to deliver a second solution. Both pumps delivered a 20% sucrose solution, but for one pump, the solution was flavored with orange Kool-Aid, whereas for the other pump, the solution was flavored with grape Kool-Aid.
Surgery
At the time of surgery, animals weighed between 290 and 380 gm. The surgical and histological procedures were identical to those described for experiment 1A.
Procedure
The procedure was similar to that of experiment 1A with the following exceptions. The left lever earned grape-flavored sucrose and right lever earned orange-flavored sucrose for half the animals, whereas for the remainder, these action–outcome relationships were reversed. The animals were given 2 d of magazine training and then trained on the FI-20 schedule until they earned 100 orange outcomes and then 100 grape outcomes as described for experiment 1A. After acquisition, training on the random ratio schedules was conducted exactly as in experiment 1A, except that the rats received 5 d of RR-5 and 5 d of RR-10 training on each action before testing.
Devaluation test. All animals underwent the outcome specific satiety devaluation treatment described above; each animal received 1 hr access to one of the two outcomes in the home cage ad libitum. Immediately after this exposure period, the animals were placed in the operant chambers and given a single two-lever choice extinction test as described for experiment 1A.
Results and Discussion
Histology
Representative lesions for animals included in the lesion group are displayed in Figure 1.
Training
Over the course of training, both groups acquired the lever press response in the same number of days and, at the end of training, there was no difference in response rate between the lesion (M, 294.5; SE, 47.7) and sham (M, 247.6; SE, 55.6) groups (F < 1).
Devaluation test
Figure 3 illustrates the data from the extinction test. As was observed in experiment 1A, a comparable outcome devaluation effect was observed in both the lesion and sham rats, with both groups animals performing fewer presses on the lever that, in training, had delivered the subsequently devalued outcome. Analysis of the extinction test data reveals a significant effect of devaluation (F(1,15) = 7.325;p < 0.01), but not an effect of group or a group × lesion interaction (F < 1 for both).
This finding replicates that of experiment 1A and indicates that animals with lesions of the dorsal hippocampus are able to discriminate and encode the value of outcomes based on a single, motivationally irrelevant flavor feature. Additionally, it should be noted that there is no suggestion that the devaluation effect may be smaller in hippocampal animals as may have been suspected in experiment 1A. Together, experiments 1A and 1B provide consistent evidence against the suggestion that the hippocampus mediates the encoding of the action–outcome association in memory. Indeed, these experiments provide no evidence whatever to suggest that the hippocampus plays a role in instrumental conditioning. It has been suggested by White and colleagues (McDonald and White, 1993) that there are multiple memory systems that may have different roles in specific tasks, and so, it remains possible that sensitivity to outcome devaluation may be dependent on another neural structure, such as the amygdala.
EXPERIMENT 2
There are a number of features of instrumental conditioning that stand discordant with a simple S-R approach and that support the view that animals can encode the relationship between an action and its consequences. The first of these has been described above, namely demonstrations that instrumental conditioning is sensitive to devaluation of the instrumental outcome. A second demonstration of this same principle comes from experiments indicating that the instrumental performance of animals is sensitive to changes in the contingency between a particular action and its consequences (Hammond, 1980; Dickinson and Mulatero, 1989; Balleine and Dickinson, 1998b).
In one study, for example, Balleine and Dickinson (1998b) found that, when the response–outcome contingency was degraded for one of two action–outcome pairs, animals selectively decreased performance of that response. With respect to the current analysis, this demonstration is important because it suggests that, although hippocampal animals may be able to encode adventitious action–outcome relationships that they are exposed to in training, it is still possible that they remain insensitive to the causal consequences of their actions. There is, in fact, some evidence that accords with this suggestion. Devenport (1979,1980) and Devenport and Holloway (1980) argued that the ability of rats to detect causal, as opposed to adventitious, relationships in their environment depends on the hippocampus; that, without an intact hippocampus, instrumental performance is controlled solely by response–reward contiguity as envisaged within S-R reinforcement theory. In support of this claim, Devenport and Holloway (1980) found that rats with hippocampal lesions and trained to lever press for food continued to press at high rates, even when the causal relationship between action and outcome was removed and reward delivery was shifted from a random-interval to a random-time schedule.
Although suggestive, there are several features of these experiments that make this conclusion premature. The results of Devenport and Holloway's (1980) study are particularly open to question because, in their study, the hippocampal lesioned group was pressing at much higher rates than intact controls before the shift to the random time schedule. Response–reward contiguity is positively related to baseline response rate on random-time schedules, and so there are good grounds for suggesting that the effectiveness of the shift in contingency differed between groups. If this is the case, the claim that the instrumental performance of hippocampal animals suffered any general loss of control by the action–outcome contingency must be reexamined.
In experiment 2, therefore, we assessed whether the hippocampal rats in our studies were sensitive to the causal consequences of their actions using a procedure pioneered by Hammond (1980) but as modified in the experiment of Balleine and Dickinson (1998b). To achieve this, the rats used in experiment 1 were given several sessions of retraining after the outcome devaluation test, after which they were given several training sessions in which one of the two instrumental action–outcome contingencies was degraded. Finally, the rats were tested in extinction on the levers to assess the impact of the shift in contingency. If, as suggested by Devenport and Holloway (1980), the hippocampus is critical for encoding the causal relationship between an action and outcome, hippocampal lesions should significantly impair the sensitivity of rats to this shift in the instrumental contingency. As such, although it is predicted that sham rats should reduce performance of the response that is no longer causal with respect to its outcome relative to the other response, hippocampal rats should not show this effect and perform both responses at comparable rates in the extinction test.
Method
Subjects and apparatus
The subjects were those described for experiment 1A above. The equipment and general apparatus were also the same as that described above.
Surgery
Surgical procedures are described in experiment 1.
Procedure
Contingency degradation training. After the training and testing described for experiment 1A, the animals were retrained on RR-20 schedules for 2 d, after which one of the two instrumental contingencies was degraded. At the end of training, each lever earned a unique outcome (pellets or a 20% polycose solution) with a fixed probability, p(O/A) = 0.05. In subsequent sessions, in addition to being earned by one of the actions, one of the outcomes was now also delivered noncontingently with the same probability [p(O/A) = 0.05] in each second without a response. For one lever, the free reinforcer was the same as that which was earned by a response on that lever. Thus, the experienced probability of the delivery of that particular outcome was the same whether or not the animals performed that action, a procedure that should have acted to degrade that action–outcome contingency. For the other lever, the free reinforcer was different from the earned reinforcer, and so this contingency was nondegraded. For half of the animals, the degraded contingency involved pellet delivery, whereas for the remainder, it involved the delivery of the polycose solution. The animals had two 30 min training sessions per day, one on each lever and, hence, on each contingency. The animals had a break of ∼1 hr between sessions, and the order of the sessions was alternated each day. This training continued for 3 d.
Contingency test. On the day after the final day of contingency training, rats in both groups received a choice extinction test. The test began with the insertion of the levers and the onset of the house light and ended 20 min later with the retraction of the levers and the offset of the house light. No outcomes were presented during this session.
Results and Discussion
Histology
Representative lesions displaying the maximum and minimum damage resulting from the lesions are represented in Figure 1.
Contingency degradation training
Figure 4 shows the effects of contingency degradation across training days on the lever press response. The animals were sensitive to the degradation of the instrumental contingency, performing more responses on the lever for which the contingency had not been degraded. The statistical analysis of the contingency training data revealed a main effect of contingency (F(1,15) = 10.540; p< 0.01), a main effect of training day (F(2,30) = 12.241; p< 0.01), but no main effect of group (F < 1). Examination of Figure 4 suggests that the hippocampal animals were not sensitive to the degradation of the instrumental contingency; however, the group × contingency interaction failed to reach significance (F(1,15) = 1.660; p > 0.05). None of the other interactions were significant.
Extinction test
The critical data from the experiment, those arising from the choice extinction test, are presented in Figure5. It is clear that the two groups performed very differently on this test. Whereas the sham group showed a clear difference in performance of the action whose contingency was degraded relative to the other action, this was not true of the hippocampal rats, which appeared to perform both actions at a comparable rate. The analysis of the extinction test data confirmed this description. This analysis revealed no main effect of contingency (F(1,15) = 2.204; p > 0.05) and a marginal main effect of group (F(1,15) = 3.948; p = 0.06), but, most importantly, there was a significant group × contingency interaction (F(1,15) = 5.862; p = 0.02). Simple main effects analysis revealed that, for the sham group, there is a significant difference induced by the degraded and nondegraded contingency treatments (F(8) = 7.88; p< 0.01). As shown in Figure 5, these animals performed fewer presses on lever for which the contingency had been degraded. However, for the lesion group, there was no difference in response rate on the two levers after the contingency treatment (F < 1).
These results are consistent with the suggestion that the hippocampal lesion rendered rats insensitive to the causal consequences of their instrumental actions. Thus, whereas the sham rats showed considerable sensitivity to the selective degradation of a specific instrumental contingency, the hippocampal animals did not appear sensitive to this manipulation and continued to perform both actions at a comparable and somewhat depressed rate.
GENERAL DISCUSSION
The general aim of this study was to assess the effects of dorsal hippocampal lesions on instrumental conditioning. Specifically, we investigated the effects of these lesions on the acquisition and maintenance of instrumental performance, sensitivity to changes in the value of the instrumental outcome, and sensitivity to degradation of the instrumental contingency.
The results indicate that these hippocampal lesions had no effect on the acquisition of the instrumental action. The lesioned animals did not differ from shams in either the rate of acquisition of the lever press response or the level of responding at the end of training. Furthermore, the hippocampal lesioned rats were just as sensitive as sham controls to devaluation of the instrumental outcome. Indeed, in experiment 1B, it was found that lesioned animals still showed this selective devaluation effect even when the sole factor discriminating between the two outcomes was a motivationally irrelevant factor: the flavor. These results provide evidence that animals with lesions of the hippocampus both encode the value of an outcome and are able to establish an A-O association that controls their selective responding on test.
Of greater interest, however, is the finding of experiment 2 that hippocampal rats were relatively insensitive to treatments that ensured the degradation of the action–outcome contingency. When evaluated in a two-lever choice extinction test, intact animals decreased responding on the lever previously associated with the noncontingent reinforcer relative to the other lever, whereas rats with hippocampal lesions continued to respond at equal rates on both levers. These results tend to confirm, therefore, the suggestion of Devenport and colleagues (Devenport, 1979; Devenport and Holloway, 1980) that the hippocampus is critical for the detection of the causal relationship between an action and outcome. As suggested by experiment 2, without an intact hippocampus, animals continue to respond as though reward is dependent on their actions, even when one action is no longer causal with respect to the delivery of its specific outcome. Thus, it appears that the hippocampus plays a relatively specific role in instrumental conditioning; without an intact hippocampus, animals seem unable to differentiate between actions that are causal with respect to their associated outcomes and those that are merely adventitiously related. Because it is the learning of this relationship that critically distinguishes instrumental learning from other forms of relational learning, this interpretation may be thought to accord with the suggested role of the hippocampus in declarative memory (Squire and Zola-Morgan, 1996).
However, when considered together with the data from the devaluation experiments above, the interpretation becomes less clear. In experiments 1A and 1B, it was demonstrated that animals with hippocampal lesions selectively decrease their responding after the devaluation of one instrumental outcome. This shows that they must still depend on an intact A-O association. The deficit in hippocampal animals is not their ability to associate their actions with subsequent outcomes but their sensitivity to the probability of outcome delivery in the absence of an action. An alternative explanation is therefore that animals with hippocampal lesions are unable to calculate background rates of reinforcement. Whereas contingent reinforcers are best predicted by an A-O association, noncontingent reinforcers are better predicted by the training context, and thus the observed deficit in hippocampal animals may be attributable to a failure to form a context–outcome association. Rather than mediating the encoding of causal as opposed to chance relationships between events, the role of the hippocampus is often thought to be to encode the context within which two events are related (Jarrard, 1995; Holland and Bouton, 1999). It is important, therefore, to consider the degree to which the current data accord with this kind of position.
Although often obscured within cognitive analyses of relational learning, considerable evidence suggests that the ability to encode situational cues (i.e., contexts) is essential if animals are to discern causal from merely contiguous relationships between events. This point was recognized quite early on in the associative analysis of Pavlovian conditioning and subsequently encapsulated within the concept of predictive validity (Dweck and Wagner, 1970; Rescorla and Wagner, 1972). According to this position, the learning of predictive relationships between events is subject to a competitive process; encoding one event as predictive of another is subject to competition from other events and so is determined by how good a predictor it is relative to those other events. If, for example, a hungry rat is strongly motivated to predict the delivery of food, then learning that food follows the presentation of, say, a tone would accord with that need, and an association between the tone and delivery of food is likely to be formed. However, if the food is also delivered at other times in the absence of the tone, this association is expected to be substantially weaker. It was argued, and subsequently supported by experimental evidence, that degrading the predictive status of the tone was based on the animal learning to encode the relationship between the food and the background or situational cues in which the noncontiguous food presentations occurred (Dweck and Wagner, 1970; Baker, 1990).
In the current situation, a comparable analysis is available. Thus, for example, when the delivery of a specific food outcome follows the performance of an instrumental action and at no other time, the rat appears to learn that the performance of the action is causally related to the delivery of that outcome. However, when the outcome is also delivered unpaired with performance of the action, any association between the context and the food will naturally compete with the action for association with food delivery and so degrade its causal relationship with the outcome. Indeed, when an outcome is equally probable in the presence of an action (A) and in the presence of a context cue (C) alone, it is clear, given that the action is also performed in C (i.e., AC), that C is a better predictor of the food than is A.
This argument suggests that sensitivity to noncontingent outcome delivery should strongly depend on the ability of rats to form context–outcome associations. If hippocampal animals fail to form an association between the context and the noncontingent outcome, they may associate the delivery of both outcomes only with their actions. This account would explain why the lesioned animals in the current study maintain equal response rates for the contingent and noncontingent outcomes and is consistent with the suggestion that the hippocampus is important for contextual retrieval (Hirsh, 1974).
There are, in fact, numerous experiments that suggest that animals with lesions of the hippocampus show deficits in contextual conditioning (Honey and Good, 1993; Maren and Fanselow, 1997) (but see Holland, 1997). For example, Honey and Good (1993) found that hippocampal lesions did not impair the initial acquisition of conditioned responding but did disrupt the contextual specificity of conditioning in a latent inhibition paradigm. Additionally, there is evidence that, despite their impairment in contextual conditioning, hippocampal animals remain able to form associations with discrete cues (Jarrard, 1993). Related findings have been reported by Winocur and Olds (1978)who found that a reversal learning deficit in hippocampal animals could be reduced if external cues were added, which served to enhance the discriminability of successive tasks and minimized the need for context retrieval. This argument predicts, therefore, that if a discrete cue were used to signal delivery of both the outcomes that are paired with performance of the instrumental action and those that are unpaired, this cue would be a better predictor of that outcome than either the action or the context. Given that hippocampal animals are able to form associations with such cues, this should alleviate their deficit, and these lesioned animals should then to be as sensitive to a change in the instrumental contingency as intact controls.
In summary, lesions of the dorsal hippocampus produce an interesting set of effects on instrumental performance. Although the initial acquisition of the instrumental response appears normal, animals with hippocampal lesions do not modify their behavior when the A-O contingency is selectively degraded. This is not because of a simple failure to encode the difference between two actions, two outcomes, or even the A-O association because these animals were shown to be able to selectively control their responding when one of the two outcomes was devalued. What appears to be impaired, therefore, is the ability of the hippocampal animals to distinguish between contingent and free reinforcers. Although this may suggest that hippocampal animals are unable to encode causal relationships between their actions and their consequences, it remains a possibility that this effect reflects a failure of hippocampal animals to calculate background rates of reinforcement based on the formation of context–outcome associations. It is an important aim for future experimentation to differentiate between these two accounts of the role of the hippocampus in instrumental conditioning.
Footnotes
This work was supported by National Institute of Mental Health Grant MH 56446. We thank Sandra Cetl and Chris Park who assisted with data collection.
Correspondence should be addressed to Laura Corbit, Department of Psychology, University of California, Los Angeles, Box 951563, Los Angeles, CA 90095. E-mail: corbit@ucla.edu.
REFERENCES
- 1.Adams CD. Variations in the sensitivity of instrumental responding to reinforcer devaluation. Q J Exp Psychol. 1982;34B:77–98. [Google Scholar]
- 2.Baker AG. Contextual conditioning during free-operant extinction: unsignaled, signaled, and backward-signaled noncontingent food. Anim Learn Behav. 1990;18:59–70. [Google Scholar]
- 3.Balleine BW, Dickinson A. The role of incentive learning in instrumental outcome revaluation by sensory-specific satiety. Anim Learn Behav. 1998a;26:46–59. [Google Scholar]
- 4.Balleine BW, Dickinson A. Goal-directed instrumental action: contingency and incentive learning and their cortical substrates. Neuropharmacology. 1998b;37:407–419. doi: 10.1016/s0028-3908(98)00033-1. [DOI] [PubMed] [Google Scholar]
- 5.Colwill RC, Rescorla RA. Associative structures in instrumental learning. In: Bower GH, editor. The psychology of learning and motivation, Vol 20. Academic; New York: 1986. pp. 55–104. [Google Scholar]
- 6.Devenport LD. Superstitious bar pressing in hippocampal and septal rats. Science. 1979;205:721–723. doi: 10.1126/science.462183. [DOI] [PubMed] [Google Scholar]
- 7.Devenport LD. Response-reinforcer relations and the hippocampus. Behav Neural Biol. 1980;29:105–110. doi: 10.1016/s0163-1047(80)92526-1. [DOI] [PubMed] [Google Scholar]
- 8.Devenport LD, Holloway FA. The rat's resistance to superstition: role of the hippocampus. J Comp Physiol Psychol. 1980;4:691–705. doi: 10.1037/h0077703. [DOI] [PubMed] [Google Scholar]
- 9.Dickinson A. Contemporary animal learning theory. Cambridge UP; Cambridge: 1980. [Google Scholar]
- 10.Dickinson A, Balleine BW. Actions and responses: the dual psychology of behaviour. In: Eilan N, McCarthy R, Brewer MW, editors. Spatial representation. Basil Blackwell; Oxford: 1993. pp. 277–293. [Google Scholar]
- 11.Dickinson A, Balleine BW. Motivational control of goal-directed action. Anim Learn Behav. 1994;22:1–18. [Google Scholar]
- 12.Dickinson A, Balleine BW. Motivational control of instrumental action. Curr Direct Psychol Sci. 1995;4:162–167. [Google Scholar]
- 13.Dickinson A, Mulatero CW. Reinforcer specificity of the suppression of instrumental performance on a non-contingent schedule. Behav Processes. 1989;19:167–180. doi: 10.1016/0376-6357(89)90039-9. [DOI] [PubMed] [Google Scholar]
- 14.Dweck CS, Wagner AR. Situational cues and correlation between CS and US as determinants of the conditioned emotional response. Psychon Sci. 1970;18:145–147. [Google Scholar]
- 15.Eichenbaum H, Schoenbaum G, Young B, Bunsey M. Functional organization of the hippocampal memory system. Proc Natl Acad Sci USA. 1996;93:13500–13507. doi: 10.1073/pnas.93.24.13500. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Hammond LJ. The effect of contingency upon the appetitive conditioning of free-operant behavior. J Exp Anal Behav. 1980;34:297–304. doi: 10.1901/jeab.1980.34-297. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Hirsh R. The hippocampus and contextual retrieval from memory: a theory. Behav Biol. 1974;12:421–444. doi: 10.1016/s0091-6773(74)92231-7. [DOI] [PubMed] [Google Scholar]
- 18.Holland PC. Brain mechanisms for changes in processing of conditioned stimuli in Pavlovian conditioning: implications for behavior theory. Anim Learn Behav. 1997;25:373–399. [Google Scholar]
- 19.Holland PC, Bouton ME. Hippocampus and context in classical conditioning. Curr Opin Neurobiol. 1999;9:195–202. doi: 10.1016/s0959-4388(99)80027-0. [DOI] [PubMed] [Google Scholar]
- 20.Honey RC, Good M. Selective hippocampal lesions abolish the contextual specificity of latent inhibition and conditioning. Behav Neurosci. 1993;107:23–33. doi: 10.1037//0735-7044.107.1.23. [DOI] [PubMed] [Google Scholar]
- 21.Jarrard LE. On the role of the hippocampus in learning and memory in the rat. Behav Neural Biol. 1993;60:9–26. doi: 10.1016/0163-1047(93)90664-4. [DOI] [PubMed] [Google Scholar]
- 22.Jarrard LE. What does the hippocampus really do? Behav Brain Res. 1995;71:1–10. doi: 10.1016/0166-4328(95)00034-8. [DOI] [PubMed] [Google Scholar]
- 23.Maren S, Fanselow MS. Electrolytic lesions of the fimbria/fornix, dorsal hippocampus, or entorhinal cortex produce anterograde deficits in contextual fear conditioning in rats. Neurobiol Learn Mem. 1997;67:142–149. doi: 10.1006/nlme.1996.3752. [DOI] [PubMed] [Google Scholar]
- 24.McDonald RJ, White NM. A triple dissociation of memory systems: Hippocampus, amygdala, and dorsal striatum. Behav Neurosci. 1993;107:3–22. doi: 10.1037//0735-7044.107.1.3. [DOI] [PubMed] [Google Scholar]
- 25.Paxinos G, Watson C. The rat brain in stereotaxic coordinates. Academic; San Diego: 1998. [DOI] [PubMed] [Google Scholar]
- 26.Rescorla RA. Instrumental responses become associated with reinforcers that differ in one feature. Anim Learn Behav. 1990;18:206–211. [Google Scholar]
- 27.Rescorla RA, Wagner AR. A theory of Pavlovian conditioning: variations in the effectiveness of reinforcement and non-reinforcement. In: Black AH, Prokasy WF, editors. Classical conditioning. II. Current research and Theory. Appelton-Century-Crofts; New York: 1972. pp. 64–99. [Google Scholar]
- 28.Squire LR. Memory and the hippocampus: a synthesis from findings with rats, monkeys, and humans. Psychol Rev. 1992;99:195–231. doi: 10.1037/0033-295x.99.2.195. [DOI] [PubMed] [Google Scholar]
- 29.Squire LR, Zola-Morgan S. Structure and function of declarative and nondeclarative memory systems. Proc Natl Acad Sci. 1996;93:13515–13522. doi: 10.1073/pnas.93.24.13515. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Winograd T. Frames representations and the declarative-procedural controversy. In: Bobrow DG, Collins A, editors. Representation and understanding. Academic; New York: 1975. pp. 185–210. [Google Scholar]
- 31.Winocur G, Olds J. Effects of context manipulation on memory and reversal learning in rats with hippocampal lesions. J Comp Physiol Psychol. 1978;92:312–321. doi: 10.1037/h0077468. [DOI] [PubMed] [Google Scholar]