Stimulus Equivalence: Testing Sidman's (2000) Theory

Sara Tepaeru Minster; Max Jones; Douglas Elliffe; Suresh D Muthukumaraswamy

doi:10.1901/jeab.2006.15-05

. 2006 May;85(3):371–391. doi: 10.1901/jeab.2006.15-05

Stimulus Equivalence: Testing Sidman's (2000) Theory

Sara Tepaeru Minster ^1,^✉, Max Jones ¹, Douglas Elliffe ¹, Suresh D Muthukumaraswamy ¹

PMCID: PMC1459848 PMID: 16776057

Abstract

Sidman's (2000) theory regarding the origin of equivalence relations predicts that a reinforcing stimulus common to distinct equivalence classes must drop out of the equivalence relations. This prediction was tested in the present study by arranging class-specific reinforcers, R1 and R2, following correct responding on the prerequisite conditional discriminations (Ax–Bx, Cx–Bx) for two stimulus classes, A1B1C1 and A2B2C2. A class-common reinforcer, R3, was presented following correct responding on the prerequisite conditional discriminations for a further two stimulus classes, A3B3C3 and A4B4C4. Sidman's theory predicts reinforcer inclusion within Classes 1 and 2 only, given this training arrangement. Experiment 1 tested for the emergence of four equivalence classes and of stimulus–reinforcer and reinforcer–stimulus relations in each class. Four of the 6 subjects demonstrated the reinforcer-based relations in all four equivalence classes, rather than in only those classes with a class-specific reinforcer, as Sidman's theory predicts. One of the remaining 2 subjects showed the reinforcer-based relations in three of the four classes. Experiment 2 extended these findings to document the emergence of interclass matching relations based on the common reinforcer R3, in 5 of 6 subjects, such that a Class 3 sample occasioned the selection of a Class 4 sample when the Class 3 comparison was absent, and similarly, a Class 4 sample occasioned the selection of a Class 3 comparison when the Class 4 comparison was absent. These interclass relations emerged despite the simultaneous maintenance of Class 3 and 4 baseline conditional discriminations, so that the Class 3 and 4 stimuli and reinforcer simultaneously were, and were not, part of a single larger equivalence class. These data are irreconcilable with Sidman's theory, and question the utility of the application of the equivalence relation in describing derived stimulus relations.

Keywords: stimulus control, equivalence relations, outcome-specific consequences, matching to sample, mouse-click, humans

Stimulus classes with the defining properties of equivalence are typically established using conditional discrimination procedures. In these procedures, at least two different sample stimuli are presented alone and successively across trials. At least two comparison stimuli are presented either concurrently with, or following the presentation of a sample stimulus. Responses to one comparison are reinforced only in the presence of a particular sample, and responses to the second comparison are reinforced only in the presence of the second sample. The comparison stimulus designated as correct is thus conditional on the sample presented (Cumming & Berryman, 1965).

Sidman and Tailby (1982) formalized the properties necessary to assert that a set of stimuli are equivalent by applying mathematical set theory to conditional discrimination performances. Stimuli are said to be members of an equivalence class when conditional discrimination performance shows the properties of reflexivity, symmetry, and transitivity. A reflexive relation demands that a stimulus be matched to itself (if A1 then A1). A symmetrical relation requires the reversal of the learned relation (if A1–B1 then B1–A1). To satisfy the property of transitivity, a subject must match the sample of one trained conditional discrimination (e.g., A1 from A1–B1) to the comparison of another trained conditional discrimination (e.g., C1 from B1–C1) where the comparison of the first, and sample of the second, discriminations are the same stimulus (if A1–B1 and B1–C1, then A1–C1). When these stimulus relations are demonstrated by the subject without explicit training, the relations are described as emergent, and the stimuli are said to be members of an equivalence class.

The following notation will be used to describe Sidman's (2000) theory and throughout the remainder of the present paper: stimuli will be referred to by an alphanumeric term so that, for example, A1 refers to the member, A, of Class 1. The hyphenation of two stimulus terms (e.g., A1–B1) indicates the selection of the class-consistent second term (the comparison) given the first term (the sample) from a set of stimuli that typically includes all numeric forms of the second term (i.e., B1, B2, B3, etc.). The response and reinforcer terms of Sidman's analytic units are abbreviated as “resp” and “R”, respectively, and in cases of multiple response and reinforcer elements, each is given a numeric descriptor (e.g., resp1, resp2, R1, R2). Note that for response and reinforcer elements, the numeric descriptor does not necessarily correspond to stimulus class membership as it does for stimuli. Hyphenation of stimulus, response, and reinforcer elements (e.g., A1–B1–resp1–R1) indicates the consistent selection (or emission) of those elements given the initial element. Terms grouped together (e.g., A1B1C1) indicate that those terms are members of a stimulus class defined by the properties of equivalence. For ease of exposition, X is used to denote all members of the class defined by the numeric descriptor. X4, for example, refers to all members of Class 4. Where x is used as the numeric descriptor, x is used to denote all possible classes. For example, Ax–Bx refers to all possible A–B relations (e.g., A1–B1, A2–B2, A3–B3) given the specified training arrangement.

Sidman (1994, 2000) proposed that “a reinforcement contingency produces at least two types of outcome: analytic units and equivalence relations” (2000, p. 128). The analytic units referred to here are those of an earlier exposition (Sidman, 1986). In that exposition, reinforcement of an operant gives rise to a two-term, response–reinforcer unit of analysis (cf. Donahoe, 1994; Moxley, 1996). Two-term units can come under the control of discriminative stimuli if the response is reinforced only in the presence of a defined stimulus. Alternative responses in the presence of the defined stimulus, or the emission of the defined response in the presence of alternative stimuli, do not result in reinforcement. Control of a two-term unit by antecedent stimuli requires a simple discrimination, and constitutes a three-term unit of analysis: discriminative stimulus–response–reinforcer. The four-term analytic unit arises from the conditional discrimination in which the three-term unit itself comes under the control of a conditional stimulus: conditional stimulus–discriminative stimulus–response–reinforcer. The conditional stimulus is said to function as a “selector of discriminations”, strengthening the discriminative function of the stimulus to which a response will be reinforced (Cumming & Berryman, 1965).

Sidman (2000) argued that reinforcement contingencies produce not only the units of analysis predicted by the experimenter, but also equivalence relations. Equivalence relations, he proposed, can consist of pairs of all positive terms participating in particular reinforcement contingencies, including responses and reinforcers. Suppose we arranged conditional discrimination contingencies to establish the baseline analytic units: A1–B1–resp1–R1; B1–C1–resp1–R1; A2–B2–resp1–R1; and B2–C2–resp1–R1. The stimulus members of these analytic units will form two three-member equivalence classes (A1B1C1 and A2B2C2). If the resulting equivalence relations also included the common response and reinforcer elements as members, then all stimuli would be equivalent via the common reinforcer and response members (see also Dube, McIlvane, Mackay, & Stoddard, 1987). For example, A1 would become equivalent to B2 through the common elements resp1 and R1, so that a subject could not learn even the baseline conditional discriminations. Yet training arrangements such as this are commonplace, and are sufficient to produce both the outcomes that Sidman argues result from reinforcement contingencies—namely, the baseline analytic units and the stimulus–stimulus equivalence relations.

In order to resolve the dilemma of contingencies that on the one hand specify class distinction, and on the other hand promote class union through equivalence relations, Sidman (2000) made an assumption to maintain the logical consistency of his theory:

Our theory requires us to assume that when the two outcomes of the reinforcement contingency come into conflict, the analytic unit takes precedence over the equivalence relation… In order for the common response and reinforcer elements to retain their membership in the analytic unit, they must selectively drop out of the equivalence relation… making it possible for the smaller classes, A1B1C1 and A2B2C2, to form. (p. 132).

This suggestion is crucial to Sidman's theory which sees the reinforcement contingency as producing two outcomes, for without the analytic units, which are instantiated as the baseline conditional relations, the equivalence relations are not possible. According to this theory, then, demonstrating the inclusion of all the possible members in the equivalence class requires arranging a set of contingencies in which there is no conflict between the units of analysis and the equivalence pairs that may arise from experience with those contingencies. For example, we may train a subject to perform: A1–B1–resp1–R1; B1–C1–resp1–R1; and A2–B2–resp2–R2; B2–C2–resp2–R2. Here, Classes 1 and 2 have specific response and reinforcer elements which may then participate in the equivalence relations resulting from those reinforcement contingencies.

Sidman's (1994, 2000) notion that responses and reinforcers can become members of equivalence classes involving samples and their corresponding comparisons has some empirical support. Class expansion via stimulus-reinforcer relations has been reported in a number of studies (Dube et al., 1987; Dube, McIlvane, Maguire, Mackay, & Stoddard, 1989; Goyos, 2000; Schenk, 1994). In these experiments, arbitrary matching-to-sample training established two equivalence classes, A1B1C1, and A2B2C2. Correct responses on Class 1 trials were always followed by R1, and correct responses on Class 2 trials were always followed by R2. In addition, subjects received identity matching trials for all A, B, and C stimuli, and also for D1 and D2 stimuli. Again, correct selection of D1 on identity matching trials produced R1, whereas correct responding on D2 trials produced R2. On test trials subjects matched D1 to A1, B1, and C1, and matched D2 to A2, B2, and C2, even though D stimuli never featured on arbitrary matching trials. This demonstrates class expansion via common reinforcers to include D stimuli. The inclusion of D1 and D2 into their respective classes can only have resulted from their relations with class-specific reinforcers, demonstrating that reinforcer stimuli can participate in equivalence relations.

Class establishment via stimulus–reinforcer relations alone was demonstrated by Schenk (1994, Experiment 2). Subjects received identity matching trials only, where correct selections of A1, B1, C1, and D1 produced R1, and correct selections of A2, B2, C2, and D2 produced R2. On arbitrary matching probes for equivalence-class formation, 6 of 8 subjects were able to match A1, B1, C1, and D1 to one another at levels of accuracy that were significantly greater than chance, and similarly for Class 2 stimuli. The 6 subjects who demonstrated the emergence of two four-member equivalence classes also correctly matched R1 to Class 1 stimuli, and R2 to Class 2 stimuli. As subjects in Experiment 2 of Schenk's study never received arbitrary matching-to-sample training, the equivalence classes shown by 6 subjects could only have been possible if the training procedures resulted in equivalence classes A1B1C1D1R1, and A2B2C2D2R2, showing that class-specific reinforcers can become members of the stimulus classes they are used to establish, and that stimulus–reinforcer relations are sufficient to produce equivalence class formation.

Sidman (1994, 2000) outlined various testable predictions arising from his theory regarding the formation of equivalence classes through class-specific reinforcement or class-specific responses. As stimulus–reinforcer relations are the focus of the present study, only his test for class formation via reinforcer relations will be described here. In this test, four conditional discriminations (A1–B1, D1–C1, A2–B2, and D2–C2) are established with class-specific reinforcers, R1 and R2. Correct selections of B1 and C1 in the presence of A1 and D1 respectively are reinforced with R1, whereas correct selections of B2 and C2 in the presence of A2 and D2 respectively are reinforced with R2. Each individual conditional discrimination has no common conditional- or discriminative-stimulus members with any other conditional discrimination. There are, however, common reinforcing stimuli to draw stimuli into two potential equivalence classes. A1, B1, C1, and D1 should form an equivalence class due to the common reinforcer R1. Similarly, A2, B2, C2, and D2 should form another class due to the common R2 element. Tests for the emergence of these equivalence classes include all between-discrimination pairs for each class (i.e., Ax–Dx, Ax–Cx, Bx–Dx, Bx–Cx, and the reverse of each).

Maki, Overmier, Delos, and Gutmann (1995, Experiment 3) carried out this proposed experiment and reported results consistent with Sidman's predictions, though they did not test all possible emergent relations. In their differential-outcomes group, 2 normal children were trained to choose B1 conditionally upon A1 in order to receive R1, a bead of a particular color (either blue or red, where blue beads were exchangeable for toys and red beads were exchangeable for food). R1 also served as the outcome following correct selections of C1 after D1 had served as a sample. Another set of stimuli were trained with R2, a bead of a different color from R1 (red or blue). R2 was used to reinforce selections of B2 following A2, and of C2 following D2 (i.e., Ax–Bx–Rx, and Dx–Cx–Rx training). Maki et al. carried out two blocks of test trials after training. The first test block arranged Ax–Cx trials for 1 subject and Dx–Bx trials for the second subject. The mean accuracy score in Test Block 1 was 72% correct. The second test block presented the first subject with Dx–Bx trials, and the second subject with Ax–Cx trials. The mean accuracy in Test 2 was 90.5% correct. That these test block accuracies were significantly greater than 50% correct can only have been possible through the common reinforcing stimulus used to train the baseline relations, and indicates that Rx was substitutable for each Class X stimulus.

The experiments described above support Sidman's (2000) prediction that class-specific reinforcers can become members of an equivalence class and, therefore, form the basis for the emergence of new stimulus–stimulus relations. However, his prediction that a reinforcer common to more than one stimulus class must drop out of all equivalence relations to allow the preservation of the analytic units remains to be tested. This was the goal of the present study.

Subjects were exposed to many-to-one training (Ax–Bx, Cx–Bx; x ranges from 1 to 4) where correct Class 1 responses were reinforced with R1, correct Class 2 responses were reinforced with R2, and both correct Class 3 and 4 responses were reinforced with a common reinforcer, R3 (see Figure 1). This training arrangement should result in the emergence of four classes (AxBxCx). With regard to inclusion of Rx stimuli, Sidman's (2000) theory predicts the following results: subjects should include R1 as a member of Class 1 and R2 as a member of Class 2. If, for example, we present R1 as a sample, subjects should consistently choose A1 when A1 to A4 are presented as comparisons, B1 when B1 to B4 are presented as comparisons, and C1 when C1 to C4 are presented as comparisons. Similarly, the theory predicts the emergence of the relations Ax–Rx, Bx–Rx, and Cx–Rx when stimulus members of Classes 1 and 2 are presented as samples and the three reinforcers R1, R2, and R3 are presented as comparisons.

Fig 1 — Class 1 and 2 relations are reinforced with class-specific reinforcers whereas Class 3 and 4 relations are reinforced with a common reinforcer.

However, Classes 3 and 4 share a common reinforcer, R3. According to Sidman's (2000) theory, this should create conflict between the units of analysis and the equivalence relations so that the reinforcer element R3 drops out of the equivalence relations. This means that if we present R3 as a sample, subjects should not consistently choose A3, B3, C3, or A4, B4, or C4, when available as a choice stimulus. Also, the reverse relations (e.g., A3–R3) should not emerge when the three reinforcers are presented as comparison stimuli.

These relations were trained using a modified version of a computer-controlled conditional discrimination training procedure with differential outcomes for adult subjects reported by Miller, Waugh, and Chambers (2002). In the present study, adult subjects were trained on conditional discriminations involving Japanese kanji characters as both samples and comparisons. Reinforcing stimuli were pictures representing entries into a draw to win specific prizes. The prizes were indicated by presenting pictures of cash, chocolates, or movie tickets following correct choices. Incorrect choices were followed only by feedback informing the subject of his or her error. Stimulus relations were trained and tested as outlined above to evaluate the prediction arising from Sidman's (2000) theory that reinforcing stimuli common to at least two stimulus classes must drop out of those classes in order to maintain the integrity of the baseline analytic units.

Experiment 1

Reinforcer test trials presented R stimuli in sample or comparison positions as shown in Figure 2. The top left panel shows a test trial for a Class 1 reinforcer–stimulus relation (Class 2 reinforcer–stimulus trials were similar) with all category A stimuli presented as comparisons. Here, Sidman's theory would predict the selection of A1 given the described baseline training. The bottom left panel shows an A3–R3 test trial (stimulus–reinforcer test trials for all other classes were similar). As only three R stimuli were used during training, the number of comparisons to be presented on stimulus–reinforcer test trials is, unavoidably, only three. The top and bottom right panels illustrate reinforcer–stimulus test trials for Classes 3 and 4, respectively. On a Class 3 reinforcer–stimulus test trial the Class 4 comparison was not presented in the comparison array, and on a Class 4 reinforcer–stimulus test trial the Class 3 comparison was not presented as a comparison1.

Fig 2 — The top left panel shows a test trial for a Class 1 reinforcer–stimulus relation (Class 2 reinforcer–stimulus test trials were similar). The bottom left panel shows a Class 3 stimulus–reinforcer test trial (stimulus–reinforcer test trials for all other classes were similar). The top right panel shows a Class 3 reinforcer–stimulus test trial in which the Class 4 exemplar is absent from the comparison array. The bottom right panel shows a Class 4 reinforcer–stimulus test trial in which the Class 3 exemplar is absent from the comparison array.

Method

Subjects

Six undergraduate psychology students, 3 men and 3 women, served as subjects and were numbered S1 to S6. Subjects were not familiar with stimulus equivalence research, had never participated in experiments involving conditional discriminations, and had no prior experience with Japanese kanji characters.

Apparatus

Subjects sat at a table that supported a standard computer mouse and a computer monitor measuring 228 mm high and 306 mm wide, on which all stimuli were displayed. All responses were made via the left button of the computer mouse. Stimulus presentation and data collection were controlled by an IBM-PC^©-compatible personal computer running customized software programmed in Delphi5^©. The computer recorded the identity and position of all stimuli and the choice response emitted on each trial, together with the times of all experimental events.

Stimuli

Figure 3 shows the stimuli used as samples, comparisons, and reinforcers. Each stimulus was contained within an area 160 pixels high and 150 pixels wide. The samples and comparisons were black Japanese kanji characters on a white background. Class-specific reinforcer stimuli were pictures of movie tickets (R1), cash (R2) and chocolates (R3). The incorrect feedback stimulus comprised the words “That was incorrect” written in black on a red background and was accompanied by the words “Click for the next trial” written above the incorrect feedback stimulus. Sample stimuli and reinforcer stimuli appeared at the center of the screen and comparison stimuli were presented at the corners of the screen, 63 mm from the side of the screen and 25 mm from the top or bottom of the screen. During training and equivalence testing (symmetry; and combined symmetry and transitivity, hereafter referred to as combined tests), all four corner positions were used for comparison presentation. On stimulus–reinforcer tests, only three of the four corner positions, pseudorandomly selected for each trial, displayed a comparison stimulus. On Class 1 and 2 reinforcer–stimulus tests, all four corner positions displayed a comparison, whereas on Class 3 and 4 reinforcer–stimulus tests, only three corner positions displayed a comparison.

Fig 3 — The reinforcing stimuli were colored pictures of movie tickets, cash, and chocolates.

Procedure

All trials involved a zero-delay matching-to-sample task. A trial began with the presentation of the sample stimulus at the center position on the computer screen. A mouse click within the stimulus area resulted in the removal of the sample stimulus and the immediate presentation of comparison stimuli in the corners. A response to any comparison resulted in the immediate removal of all comparison stimuli, and the presentation of feedback at the center of the screen. Correct responses resulted in the presentation of the reinforcer stimulus corresponding to the stimulus class being trained on that trial (see Figure 1). Incorrect responses resulted in the presentation of feedback informing the subject that his or her response was incorrect. A response to either the reinforcer stimulus or the incorrect feedback stimulus resulted in its removal after 1 s, followed by a 1-s intertrial interval, after which the next trial began.

Part 1: Instructions and Baseline Training

Before an experimental session began, subjects were presented with the following instructions on the screen:

Welcome. In this experiment you will be shown a number of kanji characters. Your task is to learn which kanji go together. In a single trial, a kanji will appear in the centre of the screen. Click the kanji to make another four kanji appear. Choose the kanji you think goes with the first one. You will be given feedback about your choice. A correct choice will give you an entry into one of three different prize draws: movie tickets, cash, or chocolates. To begin with, you will have to guess, so you will get about 1 in 4 correct. As you progress through the experiment though, you will learn which kanji go together, and you will get more entries into the prize draws. The computer will tally these for you and tell you at the end how many of each you received.

At some point in the experiment different trials will appear.

Answer these as correctly as possible.

More instructions will appear later in the experiment.

A many-to-one (Ax–Bx, Cx–Bx) training structure was used such that four three-member classes (excluding reinforcer stimuli) could emerge from baseline training. Class-specific reinforcers were arranged for Classes 1 and 2, so that correct responding on Class 1 trials always resulted in the presentation of R1 (cash picture) and correct responding on Class 2 trials always resulted in the presentation of R2 (movie tickets picture) (see Figures 1 and 3). A common reinforcer was arranged for Classes 3 and 4 where correct responding on trials for both classes always resulted in the presentation of R3 (chocolates picture). Feedback was presented on every trial in Part 1. Every presentation of a reinforcing stimulus advanced a counter for the number of entries into the corresponding prize draw. This counter was not visible to the subject during the experiment.

Training was conducted in blocks of 64 trials. For each block, trials were sampled without replacement from all possible sample-comparison combinations. One block contained eight trials of each relation to be trained (e.g., A1-B1) with the correct comparison being presented in each of the four comparison locations twice. Part 1 was continued until responding was at least 85% correct, averaged across all baseline trials in each of two successive training blocks.2 When this criterion was reached subjects were given the following instructions:

The experiment will continue but you will now receive no feedback.

Feedback may appear later. Take a break and click to continue.

A response to the instruction text advanced training to Part 2.

Part 2: Removal of Feedback and Baseline Maintenance

All feedback was removed in Part 2. Selection of a comparison stimulus was, therefore, immediately followed by the 1-s intertrial interval. The block structure in Part 2 was as in Part 1. In order to progress to testing, responding had to be maintained at 85% correct, averaged across all classes in one block of trials. No new instructions were given prior to testing. However, if the performance criterion for baseline maintenance was not satisfied, the subject was presented with the following instructions:

The experiment will continue but you will now receive feedback.

Take a break and click to continue.

A response to the instruction text was followed by a repetition of Part 1. The criteria for advancement of training remained the same. All subjects maintained 85% correct during Part 2 so that a return to Part 1 was never required.

Part 3: Testing

Tests for symmetrical stimulus relations (Bx–Ax, Bx–Cx), combined symmetrical and transitive stimulus relations (combined trials: Ax–Cx, Cx–Ax), stimulus–reinforcer and reinforcer–stimulus matching relations (Ax–Rx, Bx–Rx, Cx–Rx, Rx–Ax, Rx–Bx, and Rx–Cx) were conducted. The test block contained 224 trials and comprised 64 baseline trials, 32 symmetry trials, 32 combined trials, 48 stimulus–reinforcer, and 48 reinforcer–stimulus trials. Baseline trials were sampled as for Part 1. Four trials of each symmetry and combined relation were randomly sampled without replacement from all possible sample-comparison combinations. Four trials of each stimulus–reinforcer and reinforcer–stimulus relation were sampled in the same manner. Trials testing symmetrical, combined, stimulus–reinforcer, and reinforcer–stimulus matching relations were mixed with baseline trials and presented in a random order. No feedback was programmed for any of these trials.

On all baseline, symmetry, and combined trials, four comparison stimuli were presented. For example, when an A1–C1 trial was arranged, C1 to C4 were presented as comparisons. On all stimulus–reinforcer test trials, only three comparisons (R1 to R3) were presented (see bottom left panel of Figure 2). Reinforcer–stimulus trials were different: Class 1 and 2 reinforcer–stimulus trials always had four comparison stimuli. For example, on an R2–A2 trial, R2 was the sample and A1 to A4 were comparisons. Class 3 and 4 reinforcer–stimulus trials always had three comparison stimuli. On R3–X3 test trials the Class 4 comparison was not presented (see top right panel of Figure 2). Similarly, the Class 3 comparison was not presented on R3–X4 test trials (see bottom right panel of Figure 2).

The criterion for maintenance of baseline relations was set at 85% correct, averaged over all baseline trials during Part 3 testing, and was assessed upon completion of an entire test block. If this criterion was not met, instructions that feedback was to be implemented again (see Part 2) were presented, and Part 1 procedures were reintroduced. As in Part 2, this retraining procedure was never needed because all subjects maintained the baseline relations at greater than 85% correct.

All parts were conducted within one experimental session with a duration of about 1 hr. Upon completion of the entire experiment, recipients of the cash, chocolate, and movie tickets prizes were randomly selected and provided with their prize.

Results

Baseline Training

Figure 4 shows acquisition of Class 1 to 4 baseline relations for each subject where percent correct is plotted as a function of the number of training blocks for each class separately. All subjects except S2 attained 100% correct on baseline trials with all four classes by the end of baseline training. The number of training blocks required for a subject to meet the criterion for progressing to testing ranged from 6 to10. Figure 4 shows no systematic differences among the acquisition functions for the four classes. In particular, there is no evidence of a differential-outcomes effect (Trapold, 1970) in acquisition of class relations with class-specific reinforcers (Classes 1 and 2) compared with the acquisition of class relations reinforced with shared outcomes (Classes 3 and 4). A differential-outcomes effect would have been expressed in Figure 4 by steeper functions for Class 1 and 2 relations, reaching asymptotic values earlier than the corresponding functions for Classes 3 and 4. However, idiosyncratic patterns of acquisition are shown by subjects in Figure 4. All subjects maintained baseline relations with an accuracy of at least 85% correct when feedback was removed in Part 2.

Fig 4 — The rightmost data point for each subject and class denotes accuracy in Part 2 (removal of feedback).

Testing

Figure 5 shows percent correct during Part 3 trials for all subjects. Data from symmetry and combined trials are pooled together and presented as equivalence tests. Data from stimulus–reinforcer and reinforcer–stimulus trials are pooled together and presented as reinforcer tests. All subjects maintained accuracies of at least 85% correct for overall conditional discrimination accuracy on baseline trials, and also for each individual stimulus class. The criterion for documenting the emergence of equivalence, stimulus–reinforcer, and reinforcer–stimulus relations was 85% correct, averaged over all trials of a particular test. Figure 5 shows that all subjects met the criterion for the emergence of four equivalence classes (open bars) with the exception of Subjects 2 and 3. S2 met the accuracy criterion for equivalence relations with Class 2 stimuli only, whereas S3 did so for only Classes 1 and 2. Figure 5 shows that all subjects met the criterion for reversible stimulus–reinforcer relations (dark bars) in all classes with the exception of S2 and S4. S2 failed to meet this criterion with the Class 3 stimuli and reinforcer. S4 failed to meet this criterion with stimuli from all classes.

Fig 5 — The 85% criterion for baseline maintenance, equivalence class formation, and reinforcer inclusion is indicated by the dotted line on each graph.

Discussion

Sidman's (2000) theory states that a reinforcement contingency gives rise to two outcomes: the unit of analysis and equivalence relations between all members of that unit, including reinforcer elements. If these outcomes conflict, any member of the equivalence relation involved in such conflict must drop out of the equivalence relation in order that the analytic units remain intact. As discussed earlier, Sidman's theory predicts the inclusion of the class-unique reinforcers R1 and R2 in Classes 1 and 2, respectively, so that A1B1C1R1 and A2B2C2R2 emerge. However, the class-common reinforcer R3 should not be included in Class 3 or 4, so that only A3B3C3 and A4B4C4 should emerge.

Behavior on reinforcer test trials in the present experiment did not support this prediction. Five subjects (except S4) met the criterion for stimulus–reinforcer and reinforcer–stimulus relations with Class 1 and 2 stimuli. However, 4 of these subjects (except S2) also met this criterion with Class 3 and 4 stimuli, whereas the fifth (S2) did so with Class 4 stimuli. The remaining subject (S4) failed to meet the criterion for stimulus–reinforcer and reinforcer–stimulus relations in all classes. In summary, R3 usually, though not always, remained a member of Classes 3 and 4 simultaneously, inconsistent with Sidman's (2000) theory.

The observation of stimulus–reinforcer relations in Classes 3 and 4 could perhaps be accounted for by appealing to exclusion (Type R control; e.g., Carrigan & Sidman, 1992) in the following way. Suppose Sidman's (2000) theory is correct in that a reinforcing stimulus can only ever be included in an equivalence class if that reinforcing stimulus is specific to one class. In the present experiment, we should only have observed symmetric X1–R1 and X2–R2 relations emerging from baseline training. It could be argued that subjects who demonstrated stimulus-reinforcer relations in all classes may have done so with only these X1–R1 and X2–R2 relations and without any relations involving R3. Recall that stimulus–reinforcer test trials presented R1, R2, and R3 as comparisons. On X3–R3 trials then, choosing R3 may be a result of responding controlled by exclusion, with both R1 and R2 being rejected due to their equivalence with X1 and X2, respectively, and the nonequivalence of X1 and X2 with X3, thus leaving the choice of R3 as the only remaining option. In this way, high levels of accuracy on Class 3 and 4 stimulus–reinforcer test trials may not reflect the emergence of Class 3 and 4 stimulus-reinforcer relations.

However, the case for exclusion on R3–X3 reinforcer–stimulus trials is not so clear. Recall that on an R3–X3 trial, X1, X2, and X3 were comparisons in the presence of an R3 sample. If R3 was not a member of any stimulus class, then we might expect responding on these trials to be equally distributed among X1, X2, and X3. Progression through the testing block may have resulted in the gradual emergence of R3–X3 relations due to responding by exclusion on X3–R3 trials, but then we would not expect accuracies of 100% correct on Class 3 and 4 reinforcer test trials, as some subjects showed. Nevertheless, responding controlled by exclusion may have played some part in Experiment 1.

To eliminate the possibility of an account based on exclusion, Experiment 2 included trials testing for interclass matching relations between Class 3 and 4 stimuli. Trials testing for X3–X4 matching relations were arranged with Class 3 samples (i.e., X3) presented with comparisons from Classes 1, 2, and 4 only. Similarly, by arranging trials with Class 4 samples and Class 1, 2, and 3 comparisons, we tested for X4–X3 matching relations. X3–X4 matching relations cannot be accounted for by exclusion given these trial arrangements as X1, X2, and X4 are equally unrelated to X3, just as X1, X2, and X3 are equally unrelated to X4 on X4–X3 test trials. Observation of these interclass matching relations would therefore support the suggestion of Experiment 1 that R3 had not “dropped out” of equivalence relations with X3 and X4 stimuli, contrary to the predictions of Sidman's theory.

Experiment 2

A pilot experiment to test for X3–X4 and X4–X3 interclass matching relations was conducted with 1 subject. This pilot experiment was identical to Experiment 1, with the addition of X3–X4 and X4–X3 test trials in Part 3. Part 3 therefore contained baseline trials to assess maintenance of baseline, symmetry and combined tests, reinforcer test trials for all classes, and the additional X3–X4 and X4–X3 interclass matching test trials. The pilot subject showed the emergence of four three-member stimulus classes defined by equivalence, but reinforcer relations with Classes 2 and 4 only, scoring 67% and 88% correct respectively. Because the addition of another type of test trial to the only testing block seemed to have a detrimental effect compared to Experiment 1 procedures, with regard to observing all stimulus–reinforcer relations necessary for the interclass matching relations, tests in Experiment 2 were introduced sequentially and after a return to baseline.