Elemental, configural, and occasion setting mechanisms in biconditional and patterning discriminations

Andrew R Delamater; Eric Garr; Samantha Lawrence; Jesse W Whitlow, Jr

doi:10.1016/j.beproc.2016.10.013

. Author manuscript; available in PMC: 2018 Apr 1.

Published in final edited form as: Behav Processes. 2016 Nov 5;137:40–52. doi: 10.1016/j.beproc.2016.10.013

Elemental, configural, and occasion setting mechanisms in biconditional and patterning discriminations

Andrew R Delamater ^1,², Eric Garr ^1,², Samantha Lawrence ¹, Jesse W Whitlow Jr ³

PMCID: PMC5352505 NIHMSID: NIHMS830955 PMID: 27826037

Abstract

Three experiments explored the utility of considering mechanisms of occasion setting for understanding patterning and biconditional discriminations - two more complex conditional discriminations in which the stimulus-outcome relations of occasion setting are embedded. In Experiment 1, rats were trained in an appetitive conditioning task with either a biconditional or a patterning discrimination using relatively brief CSs (10 s) and differential outcomes as USs. In this study, rats learned the positive patterning task before they had learned negative patterning, and the biconditional task was the most difficult. However, a detailed examination of the results suggested that rats trained in the biconditional task responded to the stimulus compounds mainly on the basis of individual stimulus-outcome associations. Different conditioned response (CR) topographies as a function of reinforcer type complicated interpretation of these results. Experiment 2 confirmed that the biconditional task, with the parameters used here, was not learned, regardless of whether training involved differential or non-differential outcomes. In Experiment 3 the CS duration was increased to 30 s and two different USs were used that each supported similar CR topographies. Under these conditions, we observed that whereas the positive patterning task was learned most rapidly, the biconditional discrimination was learned faster than the negative patterning task. Considered in relation to other findings on patterning and biconditional discriminations, the results suggest that elemental, configural, and/or modulatory occasion setting mechanisms may play different roles in these complex conditional discrimination tasks especially as a function of stimulus duration and differential outcome training.

Keywords: Configural learning, modulation, conditional discrimination learning, differential outcomes, appetitive conditioning, rats

Occasion setting has been a remarkable stimulant for both empirical and theoretical work on the nature of associative learning, generating an extensive literature in the past 30 plus years that has substantially broadened the way we think of conditioning paradigms. Here we examine the potential involvement of occasion setting mechanisms in certain kinds of complex discrimination learning tasks and ask whether we can gain insight into the mechanisms by which those complex discriminations are solved.

Our investigation starts from the recognition that the feature positive/feature negative discriminations used to demonstrate occasion setting can be thought of as the simplest of a nested set of conditional discriminations that increase in complexity. The basic idea is illustrated in Table 1, which shows arrangements of these conditional discrimination tasks (feature positive/feature negative, positive and negative patterning, ambiguous feature positive/feature negative, and biconditional discriminations) that emphasize the way the simpler discriminations are embedded in the more complex ones.

Table 1.

Comparisons among feature positive/negative, positive/negative patterning, ambiguous occasion setting, and biconditional discriminations

Feature Negative:	B+, AB−
Feature Positive:		D−, CD+
Negative Patterning:	A+, B+, AB−
Positive Patterning:		C−, D−, CD+
Ambiguous Occasion Setting:	A C+, C−, B+, AB−	AC−, A+, D−, CD+,
Biconditional:	AC+, CD−, BD+, AB−	AC−, AB+, BD−, CD+

Open in a new tab

Consideration of the relations among different complex discriminations has become a matter of recent theoretical interest for distinguishing among theoretical accounts of associative learning. The importance of comparing these tasks was initially highlighted in the contemporary literature by two papers published in 2008 by Justin Harris and his colleagues. Harris and Livesey (2008) showed with human subjects and Harris, Livesey, Gharaei and Westbrook (2008) showed with rat subjects that biconditional discriminations were more difficult to learn than were positive and negative patterning tasks. These findings were particularly notable because they were the first to test directly a prediction made by the Rescorla-Wagner theory (1972), among others, that the biconditional task should be more easily learned than the negative patterning task. As will be described below, the predicted advantage of the biconditional over negative patterning reflects the assumption in the Rescorla-Wagner model that these discriminations are learned through associations to configural cues. Consequently, the failure to confirm the prediction was taken by Harris and Livesey (2008) and Harris et al (2008) as evidence that configural cues did not play a role in learning the discriminations. Because the idea that occasion setting might reflect the contribution of configural cues (e.g., Wilson & Pearce, 1989; Brandon & Wagner, 1998; Wagner & Brandon, 2001) rather than the operation of other “modulatory” mechanisms (e.g., Bonardi, 1998; Bouton & Nelson, 1998; Delamater, 2012; Holland, 1985; Rescorla, 1985; Schmajuk, Lamoureux, & Holland, 1998), we think it is important for the understanding of occasion setting to have a clear idea of how to interpret procedures that purport to show the role of configural cues.

Configural cues in complex discriminations

According to the Rescorla-Wagner theory, both the negative patterning and the biconditional tasks usually require the involvement of configural cues for successful solution of the discriminations. This problem is readily seen in the case of the biconditional, with 4 trial types represented as AC+, BD+, AB− and CD−. Each component stimulus, A, B, C and D, is equally often reinforced (+) and non-reinforced (−), and all stimuli appear in compounds. Thus, the only way to differentiate between reinforced and the non-reinforced compounds is to identify the specific stimulus configurations. In the case of negative patterning, with 3 trial types represented as A+, B+, and AB−, the component stimuli are also equally often reinforced and nonreinforced, and successful discrimination requires learning to suppress responding to the compound despite consistent reinforcement of the component stimuli.

The prediction of the Rescorla-Wagner theory (Rescorla & Wagner, 1972; Wagner & Rescorla, 1972) that biconditional discriminations should be easier than negative patterning arises from the different demands of the two tasks. In the biconditional task, differential responding will occur as soon as the configural cues for the reinforced compounds have more excitatory strength than the configural cues for the non-reinforced compounds; the strengths of the component stimuli are essentially neutralized. In the negative patterning task, however, correct differential responding will occur only after the configural cue has acquired sufficient inhibitory strength to outweigh the combined excitatory strengths of the component stimuli. Since that inhibitory strength will only be established after excitatory strength develops to the component stimuli, this learning will proceed relatively slowly.

Harris and his colleagues have emphasized an alternative way to conceptualize the nature of the stimulus components in complex discriminations. Specifically, they have followed Estes’ (1950) approach in stimulus sampling theory and represented stimuli as collections of hypothetical microstimulus elements from which samples are drawn on different occasions. Whereas Estes treated all microstimuli as interchangeable, later theorizing has found it useful to distinguish among at least four classes, namely, common elements (e.g., Rescorla & Wagner, 1972; McLaren & Macintosh, 2000), distinctive elements (Wagner, 2003), suppressed (or replaced) elements (e.g., Harris, 2006; Wagner, 2003), and configural elements (e.g., Wagner & Brandon, 2001; Pearce, 1994). One notable consequence of this conceptualization was the idea that some microstimulus elements might be suppressed when a stimulus was presented in compound with another stimulus. Furthermore, this possibility provided a way to solve the negative patterning task without needing to invoke configural elements at all. Elements that were available when a stimulus was presented alone but were suppressed when it was presented in compound could provide the foundation for learning a patterning discrimination. Moreover, with this alternative approach to characterizing stimuli, a simple prediction was that negative patterning should be easier than the biconditional task, which was in fact the result that Harris and his colleagues found. This prediction stems from the fact that because stimuli are always presented in compounds in the biconditional task, only the salient elements of each stimulus would be active on all trials. For learning to occur in this situation, then, a compound unique pattern of suppression would need to occur in order for a different constellation of elements to be present on the different trial types.

The available literature that allows comparisons among the 4 tasks in Table 1 is not large, and it is also somewhat inconsistent. For example, among studies that make comparisons directly, negative patterning is sometimes harder (Whitlow & Loatman, 2015) and sometimes easier (Harris & Livesey, 2008; Harris et al., 2008) than the biconditional. One noteworthy difference between these two contrasting findings is that Whitlow and Loatman (2015) used a procedure with humans in which the elements of the patterning task were combined with a separate novel stimulus on every reinforced trial. In this way, the negative patterning task was trained under more similar conditions to the biconditional in that each stimulus was always presented within a stimulus compound. Under these conditions, Harris’s (2006) model would predict that the negative patterning task would be especially difficult to learn because only the strongest microstimuli of each stimulus would tend to be activated on both reinforced and non-reinforced trials, and so there would be no strong basis for learning the discrimination. In contrast, Harris and his colleagues showed with humans (Harris & Livesey, 2008) and rats (Harris et al., 2008) superior learning of the negative patterning task than the biconditional when using the more typical procedure of presenting stimuli in isolation on reinforced trials in the negative patterning task.

Another complex discrimination problem is the so-called ambiguous occasion setting task (e.g., Holland, 1991). This close cousin of the biconditional task takes the form: AC+, AB−, C−, B+. The only difference between the two is whether C− and B+ occur on their own or as part of CD− and BD+ stimulus compounds. Holland and Reeve (1991) compared learning both the positive (AC+, C−) and negative (AB−, B+) occasion setting components of this task to learning (in different groups of rats) positive and negative patterning discriminations. They found that negative patterning is sometimes no different from learning the feature negative occasion setting component of the ambiguous occasion setting task (Holland & Reeve, 1991, Exp 1), and sometimes a little easier (Holland & Reeve, 1991, Exp. 2).

Delamater, Kranjec, and Fein (2010) studied the impact of a differential outcomes treatment on rats learning ambiguous occasion setting and biconditional discriminations. They found that both the biconditional and ambiguous occasion setting tasks were learned much more rapidly and successfully when each reinforced stimulus was rewarded with a distinctive US, and that animals trained with non-differential outcomes failed to learn either the positive or negative occasion setting components of the ambiguous occasion setting task. These results suggest that the course of learning is strongly affected by whether a differential or non-differential outcomes treatment is used. However, Delamater, et al (2010) did not assess learning in patterning discriminations.

Given this somewhat mixed set of findings, we thought it important to examine how rapidly biconditional and patterning discriminations are learned in rats trained with differential outcomes. Given Delamater et al.’s (2010) finding that biconditional discriminations were learned faster with differential than with non-differential outcomes, we asked whether training with differential outcomes would change the relative difficulty of patterning and biconditional discriminations.

There are at least two reasons why this might matter. First, the Rescorla-Wagner model anticipates that the biconditional task could be solved without recourse to configural cues when training with differential outcomes. This follows from the fact that each element of each reinforced compound bears an excitatory relationship to one US and an inhibitory relationship to the other US. This could render the biconditional task easier to solve than the negative patterning. Second, Delamater (2012) interpreted the differential outcome effects in the Delamater et al (2010) biconditional and ambiguous occasion setting tasks in terms of an acquired distinctiveness of cues effect, whereby training with differential outcomes enabled the animals to perceptually distinguish more effectively between the two auditory cues and also between the two visual cues in their tasks. This mechanism would allow for a more rapid solution to the biconditional problem, but whether such learning would be more rapid than in the negative patterning task is not known.

Experiment 1

The present study examines the relative rates of learning Pavlovian biconditional, positive patterning, and negative patterning tasks when combined with a differential outcomes treatment. In these tasks we used two different auditory (A1, A2) and visual (V1, V2) stimuli and two qualitatively distinct unconditioned stimuli (US1, US2). The form of these discriminations is: A1V1-US1, A1V2-, A2V1-, A2V2-US2 (for the biconditional task), A1V1-US1, A1-, V1- (positive patterning), and A2-US2, V2-US2, A2V2- (negative patterning). These procedures are very similar to those used by Harris et al (2008) except we employed two USs here, instead of just one. Their experiment was unique in that the total number of reinforced trials was matched between groups given the biconditional and patterning discrimination procedures, and because the patterning procedures were trained using a within-subject method in which the different audiovisual stimulus sets used in the biconditional group were also used for each type of patterning problem. We see these design features as an advantage and used them here as well. In short, the present study asked if the pattern of findings reported by Harris et al (2008) would also occur when using a differential outcome procedure.

Method

Subjects

Subjects were 16, experimentally naïve, male (8) and female (8) Long-Evans rats bred at Brooklyn College, but derived from Charles River laboratories. The free feeding body weights varied between 358 and 421 g for the males and between 225 and 271 g for the females at the beginning of the experiment. The rats were housed in groups of 2–4 animals in plastic tub cages with wood chip bedding (17 × 8.5 × 8 in, l × w × h) in a colony room that was on a 14 hr light/10 hr dark cycle, and they were maintained at 85% of their free feeding body weights by daily supplemental feedings (given following the experimental session each day). Experimental sessions occurred during the light phase of their light/dark cycle, approximately 6 hours after light onset.

Apparatus

The apparatus consisted of a set of eight identical standard conditioning chambers (BRS Foringer RC series), each of which was housed in a custom made sound- and light-resistant shell. The conditioning chambers measured 30.5 cm × 24.0 cm × 25.0 cm. Two end walls were constructed of aluminum, and the sidewalls and ceiling were made from clear Plexiglas. The floor consisted of 0.60 cm diameter stainless steel rods spaced 2.0 cm apart. In the center of one end wall 1.2 cm above the grid floor was a recessed food magazine measuring 3.0 × 3.6 × 2.0 cm (length × width × depth). The reinforcers were 2, 45-mg pellets supplied by TestDiet (MLab rodent grain pellets) and a 0.1 ml droplet of a 20% sucrose solution. The sucrose reward was delivered via a gravity-feed valve (ASCO Red-Hat valve) to one of two wells positioned at the entrance of the food magazine, and the food pellets were dropped onto the floor of the same food magazine. On the inner walls of the recessed magazine were an infrared detector and emitter enabling the automatic recording of head movements inside the magazine. These were located 0.9 cm above the magazine floor and 0.8 cm recessed from the front wall. Located 3.0 cm to the right and left of the magazine and 8.0 cm above the floor were different response levers (4 cm in width). These levers protruded into the chamber at all times, but separate sheet metal coverings prevented access to both levers at all times throughout the experiment. A 6-W light bulb, located above the experimental chamber and towards the top portion of the rear wall of the sound attenuating outer chamber, flashed (F), with equal on/off periods, at a rate of approximately 2 cycles/sec when activated. Another 6-W light bulb, located towards the bottom right corner of the rear wall of the outer chamber, emitted light continuously (L) when activated. Approximately 22 cm behind the end wall of the chamber (behind the food magazine) were two audio speakers. One speaker, when activated, emitted a 1500-Hz pure tone generated by a computer and amplified by a Radio Shack amplifier. The other speaker emitted white noise produced by a Grason-Stadler white-noise generator. The pure tone (T) measured 4 dB and the white noise stimulus (N) 12 dB above a background noise level of 78 dB (measured by a Radio Shack Sound Level Meter, C weighting (Cat #33-2050)). The chamber remained dark during trials except during presentations of the visual stimuli. Fans mounted to the outer shells of the chambers supplied cross ventilation and produced the background noise. All experimental events were controlled and recorded automatically by a Pentium-based PC and interfacing equipment (Alpha Products) located in the same room.

Procedure

The rats were initially magazine trained with the two reward types. On each of two days, one magazine training session with one outcome was followed immediately by a second session with the other outcome. The order in which magazine training sessions occurred with the two outcomes was counterbalanced across days. In each session, 20 rewards of one kind were delivered according to a random time 60 sec schedule.

Biconditional Discrimination training

Over the next 56 sessions half of the rats (4 male, 4 female) were trained on a biconditional discrimination task using procedures similar to those described by Harris, Livesey, Gharaei, and Westbrook (2008), with exceptions noted below. In each session there were 8 presentations of each of 4 trial types. These trial types consisted of 4 distinct audio-visual compound stimuli (FN, FT, LN, LT), where two were reinforced with different outcomes and the other two were non-reinforced. Specifically, FN was reinforced (at stimulus offset) with pellets and LT with sucrose. In this study the specific stimulus compound-reinforcer type assignments were not counterbalanced because our primary interest was to compare learning of this biconditional task to learning different patterning discriminations, and the F and N stimuli in that task (see below) were also trained with pellets while L and T were trained with sucrose. All stimuli were 10 s in duration and the trial types were pseudo-randomly presented in each session in 4, 8-trial blocks with the constraint that each trial type occurred twice in each block. There were 8 different running sequences used irregularly across days. The inter-trial interval averaged 2 min, with a range from 1 – 3 min.

On Day 57, the four individual stimuli were tested on non-reinforced probe trials that were irregularly interleaved with normal compound training trials. Each stimulus was tested 4 times throughout the session (once in each block), and each compound was presented 4 times.

Patterning Discrimination training

The remaining rats (randomly chosen) were trained for 56 sessions on a patterning task using similar parameters as those described above. One set of stimuli (F, N) was used with the pellet reward and the other (L, T) with the sucrose reward, but the patterning task (positive, negative) was counterbalanced across these stimulus sets (i.e., FN-pel, F-, N-, L-sucr, T-sucr, LT- for one subset of rats and FN-, F-pel, N-pel, L-, T-, LT-sucr for the other subset). Each session consisted of 4, 8-trial blocks where each compound stimulus occurred twice and each element once in each block. Following Harris, et al (2008) this procedure equates the overall number of reinforcers in each session to the biconditional discrimination group.

Statistical Analysis

The rate and duration of magazine entry responding was assessed during each stimulus presentation as well as in 10 s pre stimulus periods. Elevation scores were then calculated by subtracting pre stimulus responding from that occurring during the stimuli. A discrimination score was also calculated in which these elevation scores during non-reinforced stimuli were subtracted from that seen during reinforced stimuli. Positive scores reflect greater responding to reinforced than non-reinforced stimuli.

The data was then analyzed using analysis of variance (ANOVA) techniques recommended by Rodger (1974; 1975; see Appendix for details). Briefly, these methods entail reconceptualizing factorial designs (e.g., with I and J factors) in terms of a one-way design (e.g., with I × J levels). When a given one-way ANOVA test achieves significance, then interesting interactions among the conditions and groups are uncovered through post-hoc analysis. The outcome of these post-hoc tests are then used to construct a quantitatively precise statement about the effect sizes observed. One measure of effect size this method produces is an estimate of the non-centrality parameter of the non-central F distribution, Δ, which states how much overall variation exists among the means that comprise the F score. In the present study, we report these values for each significant F test. Since Rodger’s method is a decision-based post-hoc testing approach, type I error rate is defined in terms of the expected proportion of true null contrast rejections (out of a set of ν1 mutually orthogonal and linearly independent contrasts) and is assessed against Rodger’s table of critical F scores (Rodger, 1974). In the present studies our type I error rate was set to 0.05. Moreover, our sample sizes were chosen in order to achieve a reasonably high rate (0.85) of detecting moderately sized effects when they exist.

This method was chosen over others because the method avoids any ambiguity regarding statistical decisions concerning all of the data to be evaluated and because it is among the most powerful presently available ANOVA techniques at detecting true effects (see also Rodger & Roberts, 2013).

Results

Positive versus Negative Patterning Discrimination Learning

Figure 1 displays the course of acquisition of the patterning and biconditional discriminations. The mean % time spent in the magazine during the reinforced and non-reinforced stimuli (expressed as elevation scores) is shown over 8-session blocks. It is clear that the positive patterning discrimination was learned more rapidly than the negative patterning discrimination, and that the negative patterning task was somewhat superior to the biconditional discrimination task. Pre CS responding did not differ among trial types or between groups throughout training. The mean % time (and SEM) averaged across training blocks in Group Patterning and Group Biconditional, respectively, was 27.5 (6.5) and 26.2 (5.1).

Mean % time in the magazine elevation scores (CS – Pre) on reinforced and non-reinforced trials across 8-session blocks of Experiment 1 for Group Patterning on the positive (A) and negative (B) patterning tasks and for Group Biconditional (C). Responding is shown in Group Patterning for reinforced (+) and non-reinforced (−) compound (Cpd) and Element trials (El), and for reinforced and non-reinforced compound trials in Group Biconditional.

Group Patterning’s data was first analyzed by performing a repeated measures ANOVA across the reinforced and non-reinforced stimuli and blocks. This analysis revealed significant differences across these conditions, F(27,189) = 2.82, MSE = 279.681, Δ = 48.4. Subsequent post-hoc tests revealed that differences in responding to the reinforced stimulus compound and non-reinforced stimulus elements in the positive patterning task first started to emerge in block 2 of training, and persisted throughout the 7 blocks. In contrast, responding to the reinforced elements was greater than to the non-reinforced compound in the negative patterning task only in blocks 5, 6, and 7. Thus, the positive patterning task was learned more rapidly than the negative patterning task.

Negative Patterning versus Biconditional Discrimination Learning

Since Group Biconditional also appeared to have learned their discrimination to some degree, a separate repeated measures ANOVA was performed on this group to determine where in training their discrimination emerged. This analysis revealed that differences in responding across the reinforced and non-reinforced trials did, indeed, emerge over training, F(13,91) = 3.24, MSE = 81.859, Δ = 28.2. Subsequent post-hoc tests revealed that differences in responding between reinforced and non-reinforced trials emerged in blocks 5, 6, and 7. There were no differences in responding during pre stimulus periods throughout training.

In order to examine whether the negative patterning task was more successfully learned than the biconditional discrimination task a further analysis was performed on these data after first constructing a discrimination score. This index was a difference score between reinforced and non-reinforced stimuli. The data are presented in Figure 2. As was observed by Harris et al (2008) the negative patterning task initially produced somewhat greater responding to the non-reinforced compound compared to the reinforced elements (revealed by small negative difference scores), but then greater discriminative responding, relative to that seen in Group Biconditional, emerged across training.

Mean (+/− SEM) discrimination scores for the negative patterning (Neg Patt) task in Group Patterning and the biconditional (Bicon) task in Group Biconditional over 8-session blocks in Experiment 1. The discrimination score reflects a difference in elevation scores (see Figure 1) on reinforced and non-reinforced trials. Larger numbers indicate greater levels of conditioned responding on reinforced than non-reinforced trials.

These data were analyzed by performing repeated measures ANOVAs on each group, based on a common error term (MSE = 66.231) as well as a between group main effects test. The groups did not differ, overall, from one another, but each group displayed significant differences across training although there was substantially greater variation across training in Group Patterning, F(6,84) = 25.17 and Δ = 141.4 than in Group Biconditional, F(6,84) = 2.63 and Δ = 9.4. Post-hoc analyses confirmed that the negative patterning discrimination score was initially less than that of the biconditional group, but then exceeded the biconditional discrimination by the end of training. The small, but significant, increase in discriminative responding over training seen in Group Biconditional suggests that this group did learn the task albeit to a lesser degree. However, a further analysis of the data helps identify the nature of the learning seen in this group.

The discrimination data in Group Biconditional was broken down in terms of responding seen to the stimulus compounds reinforced with pellets, with sucrose, or not reinforced. The mean response rate and mean % time data across 8-session blocks are depicted in Figure 3. It is clear that this group responded with different topographies in the presence of the pellet- and sucrose-paired stimuli, by responding with a high rate of magazine entries in the presence of the pellet-paired compound, and with a high % of time spent in the magazine in the presence of the sucrose-paired compound. Each of these stimuli greatly exceeded responding during the non-reinforced compounds, but only with one of the response measures.

Mean elevation scores (CS-Pre) in Group Biconditional on trials in which the compound stimulus was paired with the pellet US (Cpd-Pel), the sucrose US (Cpd-Sucr), or was non-reinforced in Experiment 1. The data in panel A displays responding in terms of mean responses per minute, while panel B shows responding in terms of % time in the food magazine.

The data were analyzed by conducting separate repeated measures ANOVAs on the data from the final two blocks of training for the two response measures. Significant differences emerged across the three stimuli in these blocks with both the response rate measure, F(5,35) = 1.50, MSE = 265.365, Δ = 2.0, and the % time measure, F(5,35) = 8.94, MSE = 129.12, Δ = 37.1. Post-hoc tests confirmed that with the response rate measure the pellet-paired stimulus was greater than the sucrose-paired and unpaired stimuli, which did not differ, but that the sucrose-paired stimulus was greater than the pellet-paired and unpaired stimuli, which did not differ, with the % time measure. Thus, although overall responding to the reinforced stimulus compounds was greater than to the non-reinforced compounds in the biconditional task, response topography differences to pellet and sucrose reinforced stimuli need to be taken into consideration. The data for Group Patterning was similarly examined, but because of the small sample sizes per sub-group a composite patterning score was created by combining across both positive and negative components for the problem trained with the pellet versus sucrose USs. By the end of training both measures revealed higher response levels in the presence of reinforced than non-reinforced stimuli; however, the magnitude of this difference with each response measure differed as a function of reinforcer type (as might be anticipated from the data in Group Biconditional). In particular, the difference in response rate to reinforced and non-reinforced stimuli was greater with the pellet US versus sucrose US (reinforced and non-reinforced responding: 20.3, 2.0 for pellet and 6.8, 1.7 for sucrose). Conversely, the difference in % time to reinforced and non-reinforced stimuli was greater with the sucrose than pellet US (46.6, 7.5 for sucrose and 22.1, 13.3 for pellet). Thus, these data generally mirror what was found with these two response measures in Group Biconditional; however, it is important to note that successful discriminative responding, in particular, on the negative patterning task cannot be described in terms of a simple summation of response tendencies conditioned to each separate element.

One final analysis was performed on the element test session data for Group Biconditional on day 57 of the experiment. Because there were no reliable differences among the various compounds or elements with the response rate data, only the % time data are presented. The compound stimuli paired with pellets, with sucrose, or non-reinforced are shown in Figure 4, as is responding to the elements paired with pellets or sucrose. Responding to the stimuli paired with sucrose (compounds or elements) evoked a higher level of magazine responding than the stimuli (compounds or elements) paired with pellets. In addition, overall responding was higher to the compounds than the elements. Pre stimulus responding did not differ between these two trial types. The mean (SEM) % time scores for compound and element trials, respectively, were 23.5 (6.9) and 28.2 (7.7).

Mean % time in the magazine, expressed as elevation scores (CS-Pre), in Group Biconditional during compound and element test trials on session 57 for Group Biconditional in Experiment 1. The data are shown separately for the conditioned stimuli that had been with the pellet US (CS-Pel), the sucrose US (CS-Sucr), and the non-reinforced stimulus (CS−).

A one-way repeated measures ANOVA was performed on these data and revealed a significant difference among these conditions, F(4,28) = 10.87, MSE = 221.373, Δ = 36.4. Subsequent post-hoc tests revealed that responding to the compound stimulus paired with sucrose was significantly greater than to the compound paired with pellets, the element paired with sucrose, and the non-reinforced compound. In addition, responding to the elements paired with pellets was lower than to all other test stimuli.

Discussion

The main findings of the present study were that (1) a positive patterning task was more easily learned than a negative patterning task, and (2) that a negative patterning task was more successfully learned than a biconditional task in which the same set of visual and auditory stimuli were presented and the number of reinforcements was equated. These results are largely consistent with those reported by Harris et al (2008), but under circumstances in which a differential outcome manipulation was employed. One complication introduced by this manipulation was that different response topographies developed in the presence of the stimulus compounds reinforced by pellets and sucrose. In particular, the animals displayed a high rate of magazine entries in the presence of the pellet-paired stimulus compound, but a higher % of time spent in the magazine in the presence of the sucrose-paired stimulus compound. Perhaps Group Biconditional subjects merely learned to associate each individual stimulus with its paired reinforcer, and did not actually solve the biconditional problem by utilizing complex representational strategies. For example, responding to the sucrose-reinforced stimulus compound could merely reflect an additive sum of response tendencies to the two stimuli since these two stimuli were only paired with sucrose. In contrast, the non-reinforced stimulus compounds consisted of one stimulus paired with pellets and one with sucrose, and since only the sucrose stimulus evoked a high % of time spent in the magazine the total level of responding to these compounds was less than to the sucrose-reinforced compound. This analysis was supported by the tests with individual elements on day 57 of the experiment. In this test, the sucrose-paired stimuli evoked a higher % time spent in the magazine than the pellet-paired stimuli when tested individually. Further, the effect was reduced compared to when stimulus compounds were tested. This pattern of results is what would be expected if the separate tendencies to enter the magazine in the presence of the individual stimuli additively contributed to responding. Thus, although responding was greater to the reinforced than non-reinforced compounds, overall, it is not so clear whether this reflects control by anything other than learning to individual stimulus elements. It remains to be determined, therefore, whether rats could learn the biconditional discrimination at all when these differential response tendencies controlled by the individual stimuli is eliminated. The next experiment examined this further.

Experiment 2

The present experiment compared two groups of rats trained on biconditional discrimination tasks. One group was trained using a differential outcomes procedure similar to that used in Experiment 1. However, a second group of rats was trained using a non-differential outcomes procedure in which each reinforced stimulus compound was reinforced half the time with pellets and half the time with sucrose. Thus, in this group each individual stimulus would have been reinforced on some trials with pellets and on other trials with sucrose. If the rats are capable of learning the biconditional task then they should learn to respond more to reinforced than non-reinforced compounds, and the fact that pellets and sucrose support different response topographies should be without any effect. However, if rats given the differential outcome treatment learn to respond more to reinforced than non-reinforced stimulus compounds because they respond in different ways to pellet- and sucrose-paired stimuli, then the non-differential rats may not be capable of acquiring the discrimination. Harris et al (2008) trained their rats using pellets only, and observed that rats could slowly learn the biconditional discrimination task. However, the present study re-examines this using the present set of procedures that involves training with multiple reinforcer types. The results will better help us interpret the findings of Experiment 1 in which only a relatively subtle difference in learning biconditional and negative patterning tasks was found.