Feature- versus rule-based generalization in rats, pigeons and humans

Elisa Maes; Guido De Filippo; Angus B Inkster; Stephen E G Lea; Jan De Houwer; Rudi D’Hooge; Tom Beckers; Andy J Wills

doi:10.1007/s10071-015-0895-8

. 2015 Jul 19;18(6):1267–1284. doi: 10.1007/s10071-015-0895-8

Feature- versus rule-based generalization in rats, pigeons and humans

Elisa Maes ¹, Guido De Filippo ^2,³, Angus B Inkster ⁴, Stephen E G Lea ³, Jan De Houwer ⁵, Rudi D’Hooge ⁷, Tom Beckers ^1,^6,^✉, Andy J Wills ^4,^✉

PMCID: PMC4607717 PMID: 26188712

Abstract

Humans can spontaneously create rules that allow them to efficiently generalize what they have learned to novel situations. An enduring question is whether rule-based generalization is uniquely human or whether other animals can also abstract rules and apply them to novel situations. In recent years, there have been a number of high-profile claims that animals such as rats can learn rules. Most of those claims are quite weak because it is possible to demonstrate that simple associative systems (which do not learn rules) can account for the behavior in those tasks. Using a procedure that allows us to clearly distinguish feature-based from rule-based generalization (the Shanks–Darby procedure), we demonstrate that adult humans show rule-based generalization in this task, while generalization in rats and pigeons was based on featural overlap between stimuli. In brief, when learning that a stimulus made of two components (“AB”) predicts a different outcome than its elements (“A” and “B”), people spontaneously abstract an opposites rule and apply it to new stimuli (e.g., knowing that “C” and “D” predict one outcome, they will predict that “CD” predicts the opposite outcome). Rats and pigeons show the reverse behavior—they generalize what they have learned, but on the basis of similarity (e.g., “CD” is similar to “C” and “D”, so the same outcome is predicted for the compound stimulus as for the components). Genuinely rule-based behavior is observed in humans, but not in rats and pigeons, in the current procedure.

Electronic supplementary material

The online version of this article (doi:10.1007/s10071-015-0895-8) contains supplementary material, which is available to authorized users.

Keywords: Rats, Pigeons, Humans, Generalization, Rule-based, Associative models

Introduction

Across the animal kingdom, organisms are capable of transferring what they have learned about a certain stimulus to novel stimuli. Generalizing newly acquired behavior is an important part of learning and allows the organism to respond quickly and adaptively. In the current article, we consider two types of generalization. First, generalization might be based on the perceptual features of stimuli. For example, when a tone (stimulus A) is followed by a shock, conditioned fear will generalize to another tone (stimulus B) to the extent that A and B are perceptually similar. If generalization is based on the perceptual features of stimuli, then it is said that generalization is feature-based. The second hypothesized type of generalization is rule-based. Humans can spontaneously create rules, which are not easily reducible to perceptual features, and which allow for efficient generalization of what is learned to novel situations (see below). The main question of this article is whether this rule-based route is uniquely human, as has been posited by some researchers (e.g., Penn et al. 2008).

Feature-based generalization is easily captured by association-formation theories, which state that when a stimulus (e.g., stimulus A) is presented, a set of representational elements is activated. Those elements might encode distinct features of stimulus A such as its pitch, duration, intensity, spatial location. When stimulus B is presented, some of the representational elements that are activated might be identical to those activated by stimulus A. The amount of generalization from stimulus A to stimulus B would then be a function of the number or proportion of elements A and B have in common (and/or the number or proportion of differences). The higher the featural overlap between A and B, the more generalization will be observed (e.g., Estes 1955; McLaren and Mackintosh 2000, 2002; Rescorla and Wagner 1972; Thorndike 1911; Tversky 1977). Other association-formation theories are based on variants of this general notion but incorporate additional assumptions about how exactly featural overlap is determined (e.g., Pearce 1994). In the current experiments, the latter theories make similar predictions to purely element-based accounts.

However, not all generalization outcomes observed in humans can be explained on the basis of featural similarity. Some instances of generalization seem instead to be rule-based and involving more complex cognitive mechanisms. In light of the enduring debate on the cognitive capacities of non-human animals, it has been suggested that rule-based generalization may be a uniquely human capacity (e.g., Penn et al. 2008). Hierarchies of cognitive ability have often been constructed on the basis of learning differences in abstract concepts and relational learning tasks (e.g., Wright 2010). However, as we will point out, much of this evidence has been inconclusive since viable associative explanations have not been ruled out convincingly.

Researchers have investigated whether pigeons can create arbitrary categories based on common consequences and then generalize within such categories. The general idea in those experiments is that if arbitrary categories of perceptually different stimuli are formed based on a common outcome (Vaughan 1988) or a common response (Wasserman et al. 1992), then changing the outcome or the required response for a subset of stimuli from one category should generalize to the other stimuli of the same category. Both Vaughan and Wasserman have observed such a generalization effect. However, if it is assumed that during generalization training, the presentation of a stimulus activates the representation of the response, which becomes associated with the new response, then association-formation models can explain generalization on the basis of common consequences (Wills et al. 2006).

A second line of research has focused on the ability to judge the relationship between two stimuli through an understanding of concepts such as same and different. It has been investigated whether pigeons (e.g., Blaisdell and Cook 2005; Katz and Wright 2006; Young and Wasserman 1997), rats (Wasserman et al. 2012), monkeys (e.g., Katz et al. 2002; Wright et al. 2003) and baboons (Fagot et al. 2001) can learn abstract concepts, such as same/different. Katz et al. (2007) have proposed several criteria that are important to rule out alternative explanations for abstract-concept learning. The procedure used by Blaisdell and Cook (2005) does not fulfill most criteria, e.g., due to questionable novelty of stimuli used during testing. Further, it seems that when multi-array stimuli are used [as in Fagot et al. 2001 (baboons), Wasserman et al. 2012 (rats), and Young and Wasserman 1997 (pigeons)], a simple measure of item variability can explain the behavior of the animals. Katz and Wright (2006) themselves have obtained evidence for same/different concept learning in pigeons, capuchin monkeys (Wright et al. 2003) and rhesus monkeys (Katz et al. 2002). However, it is possible that the pigeons in both the two-item same/different task (Katz and Wright 2006) and the matching-to-sample tasks (Bodily et al. 2008; Katz et al. 2008) performed the tasks by responding to recently seen items, because the target was always presented first followed by the choice options.

Rule-based generalization may also appear to underlie apparent analogical transfer, where the equivalence of the relationship between two sets of stimuli determines performance. Beckers and colleagues argued that rats can extract additivity rules and apply them to novel stimuli, shown as a modulation of the blocking effect by pretraining that provided information about the additivity of cues (Beckers et al. 2006). However, Haselgrove (2010) and Schmajuk and Kutlu (2010) suggested that the results of Beckers et al. (2006) can be accounted for by associative models (but see Guez and Stevenson 2011). Gillan and colleagues, reporting on the performance of the chimpanzee Sarah on both geometric and functional analogy problems, argued that she possessed the ability to reason on the basis of analogy (Gillan et al. 1981). In follow-up experiments, it was shown that Sarah could not only complete analogy problems, but could also construct analogies (Oden et al. 2001). However, as Penn et al. (2008) argue, replication and further examination of the underlying mechanisms are probably merited. Similar arguments apply to reports that an African grey parrot, Alex, can name the attribute on which a pair of objects are the same or different (Pepperberg 1987). Thus, a few observations suggest the presence of relational learning in animals, but further research is required.

Evidence from procedures developed to specifically investigate rule-based generalization seems to be mixed as well. While Preston (1986) did not find support for the generalization of a contextual rule, Murphy et al. (2008) did find that rats are able to generalize very basic sequential rules. On the other hand, several experiments point to the conclusion that pigeons are very efficient rote learners, but fail to learn overarching rules or concepts (Mackintosh 1988). The criterial-attribute procedure (Kemler Nelson 1984) and procedures based on the COVIS (COmpetition between Verbal and Implicit Systems; Ashby et al. 1998) framework, both originally aimed at investigating rule-based versus feature-based categorization in humans, have subsequently been used in comparative studies. Humans show rule-based generalization in the criterial-attribute procedure, while feature-based responding was observed in macaques (Couchman et al. 2010). However, recent work indicates that these conclusions may be an artifact of the inadequate analysis techniques employed (Wills et al. 2015) and comparative studies using less confounded techniques have found comparable levels of feature-based generalization responding across pigeons, squirrels and undergraduates (Wills et al. 2009). Similarly, in experiments based on the COVIS framework, it has been suggested that rule-based processes are available to humans (for a review see Ashby and Maddox 2005), and macaques (Smith et al. 2011), but not to pigeons (Smith et al. 2010). However, the evidence in humans has been challenged (e.g., Newell et al. 2011) and a number of issues have been raised with the results of the pigeon study (Edmunds et al. 2015). To complicate matters further, both in the criterial-attribute procedures and in comparative studies within the COVIS framework, the purportedly “rule-based” and “feature-based” behaviors also differ in the number of stimulus dimensions relevant for the different routes (Edmunds et al. 2015). For rule-based categorization, only one stimulus dimension is relevant, while for feature-based categorization multiple dimensions are relevant. This difference in dimensionality is problematic when considering the possibility that non-rule-based systems may have some mechanism of dimensional attention (e.g., Sutherland and Mackintosh 1971; Kruschke 1992). In other words, the seemingly rule-based responding in these procedures is explicable within an associative account under the assumption that participants attend to and learn about a subset of features (perhaps the most diagnostic features; Kruschke 1992). In consequence, those procedures do not allow us to clearly disentangle feature-based and rule-based mechanisms, so the controversy regarding the cognitive capacities of non-human animals remains.

In the human literature, there is one procedure for which nearly everyone on both sides of the debate agrees that rule-based generalization in this task is beyond simple associative accounts, the Shanks–Darby procedure. Shanks and Darby (1998), building on earlier work by Lachnit and Kimmel (1993), tested generalization after training on negative and positive patterning problems in human predictive learning. In negative patterning (NP) problems, stimuli A and B individually predict a certain outcome, but not when presented in compound (A+, B+, AB−). In positive patterning (PP) problems, a compound of two stimuli predicts an outcome, while the components do not (C−, D−, CD+). A general rule characterizes both patterning problems, namely compounds have the opposite outcome to their individual components (henceforth, an opposites rule). In the experiment of Shanks and Darby (1998), participants received training with complete positive and negative patterning problems, as well as incomplete positive and negative patterning problems. For example, in addition to training on A+, B+, AB−, C−, D− and CD+, participants saw I+ and J+, but not IJ and saw KL−, but not K or L. During testing, participants were confronted with the stimuli omitted during training. If generalization were feature-based, participants should predict the outcome on IJ trials, but not on K and L trials. A subset of participants, however, did not predict the outcome on IJ trials, but did predict the outcome on K and L trials—a pattern consistent with the opposites rule present in the training patterns. Participants who reached a high level of accuracy during training showed a generalization pattern consistent with an opposites rule, while participants that performed less well on the trained patterns showed a generalization pattern consistent with featural overlap.

Non-human animals have been shown to be capable of solving positive and negative patterning problems, even simultaneously (Dopson et al. 2011; Grand and Honey 2008; Harris et al. 2008; North and Price 1959; Pearce and George 2002). However, mastery of positive and negative patterning problems per se can be explained on the basis of associative mechanisms. For example, according to some association-formation theories, compounds generate configural cues, which emerge from the unique combination of A and B, and which in turn activate certain elements that are unique for the compound and are not shared with the components (Spence 1952). Negative patterning can then be solved by assuming that a configural cue, emerging from the combination of A and B, acquires strong inhibitory strength that cancels the combined excitatory strengths of the components A and B (Rescorla 1972). Thus, the evidence that animals can solve positive and negative patterning problems does not necessarily imply that they have also learned the underlying rule. Association-formation theories cannot, however, account for the rule-based generalization following successful simultaneous positive and negative patterning discrimination observed in humans. After all, when a new compound is presented for the first time, the configural cue has not yet gained any associative strength and therefore responding should depend entirely on generalization from the components to the compound (i.e., feature-based generalization).

Despite the clear superiority of the Shanks and Darby procedure over other procedures to test for rule-based generalization, to the best of our knowledge there are no reports of this paradigm being utilized with non-human animals. There is one report, by Davidson et al. (1993), where generalization of a negative patterning problem in rats was investigated, but generalization after simultaneous acquisition of a positive and negative patterning problems has never been tested in non-humans. Apparently rule-based generalization after mere negative patterning discrimination learning can be explained associatively, because low responding to the generalization compound could be explained by assuming that the inhibitory strength gained by the compound during the training phases generalized to the test compounds (on the assumption that compounds are more similar to other compounds than to non-compound stimuli). Our aim in the present studies, therefore, was to investigate whether non-human animals, rats (“Experiment 1A”) and pigeons (“Experiment 2A”), would be able to demonstrate generalization of negative and positive patterning rules. The conditions faced by the animals in the two experiments described here were quite different from the conditions ordinarily present in human studies of generalization of patterning rules. To allow for a fair comparison between the capacities of humans on the one hand and rats and pigeons on the other hand, we conducted two analog studies in humans that mimicked the conditions of the animal experiments as closely as possible (“Experiment 1B” and “Experiment 2B”).

Experiment 1A: rats

In Experiment 1A, two groups of rats were trained on a negative patterning (A+, B+, AB−) and a positive patterning (C−, D−, CD+) problem simultaneously, in an operant conditioning procedure. One group was then trained on an incomplete positive patterning problem (E−, F−), while the other group was trained on an incomplete negative patterning problem (E+, F+). The crucial test consisted out of presentations of the novel compound (EF). According to feature-based models of generalization, responding to the novel compound should be similar to responding to its components (thus high for those animals for which E and F were reinforced and low for those animals for which E and F were not reinforced). If, on the other hand, rats were able to detect and apply the opposites rule, the reverse pattern should be observed, that is higher responding to the EF compound if E and F were not reinforced and vice versa.

Methods

Subjects

The subjects were 24 experimentally naïve female Sprague–Dawley rats obtained from Janvier (France), with body weights ranging between 256 and 303 g at the start of training. Subjects were randomly assigned to one of the two groups (Ns = 12). The animals were pair housed in standard cages in a colony room that was illuminated from 8:00 a.m. to 8:00 p.m. The animals were allowed free access to food pellets (Sniff Spezialdiäten GmbH, Soest, Germany), whereas water availability was limited to 20 min per day following a progressive deprivation schedule initiated 1 week prior to the start of the study.

Apparatus

Eight standard operant chambers (34 cm length × 33 cm width × 33 cm height; Coulbourn Instruments, Leigh Valley, PA) housed in sound- and light-shielding cabinets (Coulbourn Instruments, Leigh Valley, PA) were used. All chambers had metal ceilings and side walls and clear Plexiglas front and back walls. The floor was made of stainless steel grids (0.5 cm in diameter). On one metal wall of each chamber, there was an operant lever, and adjacent to it was a recess (4 cm × 3 cm) centered 2 cm above the floor. A liquid dipper could deliver 0.04 cc of water into the bottom of the recess. Two speakers were mounted on each side wall. One was used to deliver a white noise at an intensity of approximately 73 dB(C). The second speaker was used to produce two tones, a low, pulsing tone [1000 Hz, 0.2 s on, 0.2 s off, ~79 dB(C)] or a high, complex tone [5000 Hz (0.6 s on, 0.1 s off) and 7000 Hz (0.6 s off, 0.1 s on), ~70 dB(C)]. A clicker was able to deliver a clicking sound, at an intensity of approximately 72 dB(C). A buzzer was used to deliver a buzzing sound, at an intensity of approximately 77 dB(C). The operation of a ventilation fan for each chamber contributed to the background level of noise that was approximately 65 dB(C). A light bulb, placed above the lever, was used to deliver a flashing light. Each chamber was illuminated by a dim house light placed on the opposite side of the light bulb. Those six different stimuli formed three sets of stimulus pairs: buzzer and flashing light (pair 1), low tone and house light turning off (pair 2) and high, complex tone and clicker (pair 3). Thus, two of the three compounds consisted of an auditory and a visual stimulus and one compound consisted of two auditory stimuli. All CSs were 30 s in duration. Water delivery was indicated by the onset of the white noise and the magazine light for 0.5 s.

Procedure

Before the beginning of the experiment, the three different stimulus pairs were assigned to the roles of AB, CD and EF in a counterbalanced fashion, yielding six counterbalancing types (see Table 1). Animals were run in three squads of eight rats balanced with respect to experimental condition and counterbalancing type. Each session was 62 min long.

Table 1.

Design of Experiment 1A

Group	Phase 1
NP transfer	6 A+, 6 B+, 12 AB−, 6 C−, 6 D−, 12 CD+
PP transfer	6 A+, 6 B+, 12 AB−, 6 C−, 6 D−, 12 CD+
Group	Phase 2
NP transfer	2 A+, 2 B+, 4 AB−, 10 C−, 10 D−, 4 CD+, 8 E+, 8 F+
PP transfer	10 A+, 10 B+, 4 AB−, 2 C−, 2 D−, 4 CD+, 8 E−, 8 F−
Group	Phase 3
NP transfer	1 A+, 1 B+, 2 AB−, 2 C−, 2 D−, 2 CD+, 1 E+, 1 F+ / 2 EF− / 4 E−, 4 F−, 4 EF−
PP transfer	2 A+, 2 B+, 2 AB−, 1 C−, 1 D−, 2 CD+, 1 E−, 1 F− / 2 EF− / 4 E−, 4 F−, 4 EF−

Open in a new tab

The + represents 5-s access to 0.04 cc of water upon lever press, the − represents the absence of water; A/B, C/D and E/F represent buzzer/light off, clicker/low tone, and high tone/flashing light, counterbalanced. All stimulus presentations were 30 s in duration. The numbers represent the number of stimulus presentations per session. Commas separate interspersed trials, slashes separate different blocks of a phase that are not intermixed

Shaping

Standard procedures were used to train the rats to press the lever in order to obtain water. A fixed-time 120-s (FT-120-s) schedule of non-contingent water delivery was operated while the levers were retracted at the start of training; shaping ended on a variable interval 20-s (VI-20-s) schedule.

Phase 1

From days 1–27, rats received six presentations each of components A, B, C and D and twelve presentations each of compounds AB and CD (see Table 1). Stimuli A, B and the compound CD were followed by 0.04 cc of water accessible for 5 s upon lever press. Lever pressing during the components C and D and the compound AB was not reinforced. For the first five days, reinforcement was delivered on a continuous reinforcement (CRF) schedule. For the next 3 days (days 6–8), reinforcement was delivered on a variable ratio (VR) 2 schedule. Thereafter, reinforcement was delivered on a VR 4 schedule.

Trial order was semi-random so that no more than two trials of the same type and no more than four reinforced or unreinforced trials appeared in a row. The intertrial interval (ITI) ranged from 35 to 55 s with an average of 45 s. For the first 7 days of this phase, the lever was retracted during the ITI. After those 7 days, the lever was present throughout the whole session.

Phase 2

From days 28–36, rats continued to be trained on the negative and positive patterning problems, but additionally received eight presentations each of the generalization stimuli E and F. For the PP transfer group, lever pressing during presentation of the components E and F was not reinforced, while pressing to those components was reinforced for the NP transfer group. The number of A, B, C and D component trials was not equal between groups (see Table 1) in order to keep outcome frequency at 50 % overall as well as for presentations of components (20 reinforced, 20 unreinforced) and compounds (4 reinforced, 4 unreinforced).

Phase 3 (test phase)

On day 37, during the first part of the test phase all animals received presentations of the complete negative and positive patterns and the incomplete patterning stimuli as before. In the second part of this phase, the EF compound was presented twice, without reinforcement. In the third part, four unreinforced presentations of E and F were intermixed with another four unreinforced presentations of EF (see Table 1). This session lasted for 40 min.

Data archiving

The session-level raw data are archived at www.willslab.co.uk/kulmaes1 with md5 checksum a4be13dfaa3476942874a930805a9198.1

Results

For the first phase, the mean number of responses (lever presses) made during the reinforced components A and B, the unreinforced components C and D, the reinforced compound CD and unreinforced compound AB are shown in Fig. 1. As can be seen, the mean number of responses made during the reinforced components and compound increased, while the number of responses made during the unreinforced components and compound decreased. Repeated-measures analysis of variance (ANOVA) with session and reinforcement (reinforced vs. unreinforced) as within-subject factors revealed an effect of reinforcement, F(1, 23) = 220.30, p < 0.01, $η_{partial}^{2}$ = 0.91, indicating an overall higher response rate to reinforced than unreinforced cues, a linear trend over sessions, F(1, 23) = 91.42, p < 0.01, $η_{partial}^{2}$ = 0.80, indicating an increasing response rate over training and an interaction between reinforcement and linear trend over sessions, F(1, 23) = 220.99, p < 0.01, $η_{partial}^{2}$ = 0.91, indicating an increase in discrimination between the reinforced and unreinforced stimuli over sessions. Follow-up analyses revealed that the response rate to the reinforced stimuli was higher than the response rate to the unreinforced stimuli from the fourth day of discrimination training onward, t(23) = 8.55, p < 0.01, 95 % confidence interval (CI) [1.21–1.99]. To investigate the apparent difference in speed of discrimination learning between NP and PP, an ANOVA with Session and Pattern (NP and PP) as within-subject factors was conducted on the difference between CS+ and CS− for each pattern. This analysis revealed an overall effect of Pattern, F(1, 23) = 12.62, p < 0.01, $η_{partial}^{2}$ = 0.35, a linear trend over sessions, F(1, 23) = 220.99, p < 0.01, $η_{partial}^{2}$ = 0.91, and an interaction between Pattern and linear trend over session, F(1, 23) = 6.79, p < 0.05, $η_{partial}^{2}$ = 0.23. These results indicate that the PP problem was learned more readily than the NP problem, as in previous reports (e.g., Harris et al. 2008, 2009). From the eighth day onwards, the lever was presented during the ITI and the number of responses during a 30-s prestimulus period was recorded. As can be seen in Fig. 1, the prestimulus response rate decreased over days.

Fig. 1 — Mean number of responses over 30 s during reinforced and unreinforced components and compounds across the 27 days of Phase 1 training and mean number of responses over all 30-s prestimulus periods from the eighth day onwards. Error bars represent within-subject standard error of the mean for each stimulus as calculated by the SPSS plug-in of O’Brien and Cousineau (2014)

During the second phase, the lever was available throughout the whole session and an elevation score was calculated for each stimulus as the mean number of responses during each component or compound stimulus presentation minus the mean number of responses during the 30-s prestimulus interval for that specific stimulus. Responding to components E and F was higher in group NP transfer than in group PP transfer, as shown in Fig. 2, top panel. Since this difference was already apparent on the first day, we also examined responding on each trial of the first day (Fig. 2, bottom panel). Responding increased over trials for the NP transfer group, while responding decreased in the PP transfer group. An ANOVA with trial as within-subject factor and group as between-subject factor revealed an interaction between group and linear trend over trials, F(1, 22) = 8.87, p < 0.01, $η_{partial}^{2}$ = 0.29. Planned comparisons revealed a linear trend over trials in both groups, although only marginally significant for group NP transfer [NP transfer: F(1, 11) = 3.91, p = 0.07, $η_{partial}^{2}$ = 0.26; PP transfer: F(1, 11) = 7.93, p < 0.05, $η_{partial}^{2}$ = 0.42], suggesting that rats in the NP transfer group learned to respond to the new components and rats in the PP transfer group learned to not respond to those components. The average number of all 30-s pre-CS responses on this day was 0.35.

Fig. 2 — Mean elevation scores over 30 s for the generalization components E and F for groups NP transfer and PP transfer across the eight days of Phase 2 training (a) and across all trials of the first Phase 2 training day (b). *Error bars* represent within-subject standard error of the mean with group as between-subject factor as calculated by the SPSS plug-in of O’Brien and Cousineau (2014)

During the actual test (Phase 3, parts 2 and 3), the EF compound was presented twice, unreinforced, followed by four unreinforced presentations of the components E and F, intermixed with four unreinforced presentations of the compound EF. The problem here is that extinction from the first two unreinforced presentations of EF might generalize to E and F (generalization of extinction effect), so that the response to E and F would be low. A lower response to E and F compared to EF might also be due to a higher chance to forget the E+/F+ training for E/F test trials than EF test trials. The crucial comparison is, therefore, the between-group difference in elevation score for the first presentation of EF. An independent t test revealed a higher elevation score for EF in the NP transfer group than in the PP transfer group t(11.06) = 10.82, p < 0.01, 95 % CI [26.82–40.51] (see Fig. 3). The average number of all 30-s pre-CS responses on this day was 0.54.

Fig. 3 — Mean elevation scores for the first 30-s presentation of the EF compound for groups NP transfer and PP transfer. *Error bars* represent standard error of the mean

Finally, we determined the apparent generalization strategy (feature- vs. rule-based) for each individual rat. For animals in the PP transfer group, a standard deviation (SD) was calculated based on the responses to the unreinforced trials of the first part of Phase 3 (2 AB−, 1 C−, 1 D−, 1 E−, 1 F−). Rats in this group were classified as rule-based if the number of responses to the first presentation of EF was at least one SD above the mean number of responses to the first presentations of E and F. For animals in the NP transfer group, a standard deviation (SD) was calculated based on the responses to the reinforced trials of the first part of Phase 3 (1 A+, 1 B+, 2 CD+, 1 E+, 1 F+). Rats in the NP transfer group were classified as rule-based if the number of responses to the first presentation of EF was at least one SD below the mean number of responses to the first presentations of E and F. Using this criterion, none of the rats were classified as rule-based generalizers.

Discussion

In this experiment, rats were trained on a positive and a negative patterning discrimination simultaneously. After 4 days of training, rats showed behavior consistent with having learned both the positive and negative patterning discriminations, which is considerably faster than published reports using purely Pavlovian training methods (Bussey et al. 2000; Harris et al. 2008, 2009). However, the use of an operant procedure in which the reinforcer is administered during the trial entails a potential problem. The first reinforcer delivered during a reinforced trial could serve as a cue for the availability of food during the remainder of the trial. This would lead to a high response rate on reinforced trials compared to unreinforced trials irrespective of any discrimination learning between the different stimuli (McDonald et al. 1997). There are two reasons for assuming that the rats did not rely solely on the presentation of the reinforcer to guide their behavior. Given that the reinforcer was delivered on a VR 4 schedule, on average four responses would be necessary to determine whether the trial would be reinforced or not. However, response rates to the unreinforced stimuli dropped below two by the end of Phase 1 (see Fig. 1). Moreover, high response rates to the EF compound were observed in the rats from the NP transfer group in the test phase, which was conducted under extinction (see Fig. 3), so that reinforcement could not serve as a cue for responding.

Despite the fact that the rats learned to solve the patterning problems quickly and reliably, generalization to the novel EF compound seemed to be fully feature-based. That is, elevation scores to the compound were higher in the NP transfer group than the PP transfer group. This is in sharp contrast with the human literature, where it has been shown that around 50 % of participants who learn to solve patterning problems generalize according to the opposites rule (Wills et al. 2011; see further analysis reported in Wills 2014).

A number of reasons might explain the discrepancy between the present results and the typical results in humans. The combination of auditory and visual cues might have made it more difficult for the rats to discern the underlying rule. Moreover, it might also limit generalization from an auditory–visual compound to an auditory–auditory compound. Also, by the time the generalization test was conducted, rats might have been overtrained on the patterning problems, which could have influenced retention of the rule. Another important note is that rats were trained on only one example each of positive and negative patterning, while humans are typically trained on at least two problems of each kind (Shanks and Darby 1998; Wills et al. 2011).

Experiment 1B: humans

In Experiment 1A, rats did not demonstrate rule-based generalization after training on one negative and one positive patterning problem. In the rats’ defense, it is not clear from the human literature whether humans would demonstrate rule-based generalization under the conditions faced by the rats in Experiment 1A. Therefore, we conducted a very similar study with human participants. As in the rat study, an operant procedure using both auditory and visual stimuli was employed to train the participants on a negative and a positive pattern as well as an incomplete negative or positive pattern. Because humans learn this kind of discrimination much more quickly than rats, the procedure was compressed into a single session.