Variation, Repetition, And Choice

Josele Abreu-Rodrigues; Kennon A Lattal; Cristiano V dos Santos; Ricardo A Matos

doi:10.1901/jeab.2005.33-03

. 2005 Mar;83(2):147–168. doi: 10.1901/jeab.2005.33-03

Variation, Repetition, And Choice

Josele Abreu-Rodrigues ^1,^✉,², Kennon A Lattal ^1,^✉, Cristiano V dos Santos ^1,^✉, Ricardo A Matos ^1,^✉

PMCID: PMC1193744 PMID: 15828592

Abstract

Experiment 1 investigated the controlling properties of variability contingencies on choice between repeated and variable responding. Pigeons were exposed to concurrent-chains schedules with two alternatives. In the REPEAT alternative, reinforcers in the terminal link depended on a single sequence of four responses. In the VARY alternative, a response sequence in the terminal link was reinforced only if it differed from the n previous sequences (lag criterion). The REPEAT contingency generated low, constant levels of sequence variation whereas the VARY contingency produced levels of sequence variation that increased with the lag criterion. Preference for the REPEAT alternative tended to increase directly with the degree of variation required for reinforcement. Experiment 2 examined the potential confounding effects in Experiment 1 of immediacy of reinforcement by yoking the interreinforcer intervals in the REPEAT alternative to those in the VARY alternative. Again, preference for REPEAT was a function of the lag criterion. Choice between varying and repeating behavior is discussed with respect to obtained behavioral variability, probability of reinforcement, delay of reinforcement, and switching within a sequence.

Keywords: variation, choice, concurrent-chains schedules, key peck, pigeons

Two lines of operant research, both concerned with the controlling sources of behavioral variability, have challenged the conclusion that behavioral stereotypy is an inherent and inevitable result of contingencies of reinforcement (e.g., Schwartz, 1980, 1982a, 1982b). One line, characterized by the absence of operant contingencies between response variation and reinforcement, has shown a negative correlation between reinforcement rate and degree or amount of variability in steady-state performance (e.g., Antonitis, 1951; Eckerman & Lanson, 1969; Notterman & Mintz, 1965; Tatham, Wanchisen, & Hineline, 1993). The other line has asserted that variability itself can be selected by contingencies of reinforcement. That is, specifying that a given response sequence must differ from those emitted recently results in greater variation in the sequences than occurs in the absence of such a requirement (e.g., Page & Neuringer, 1985). This outcome has been functionally related to a number of variables, such as reinforcement history (Hunziker, Caramori, da Silva, & Barba, 1998) and response topography (Morgan & Neuringer, 1990).

Behavioral variability also has been investigated in the context of choice. To demonstrate that “pigeons choose systematically to vary and to repeat their behaviors” (Neuringer, 1992, p. 249), Neuringer trained pigeons to vary or repeat sequences of four responses. VARY was defined when the current sequence differed from the three previous ones and REPEAT was defined when the sequence was equal to any one of the previous three sequences. Before each sequence occurred, a computer selected whether VARY or REPEAT would be reinforced. The probability of reinforcement of VARY or REPEAT was varied over conditions such that the probabilities of REPEAT reinforcement were equal to 1.0 minus the VARY probability. The percentages of VARY sequences emitted were an increasing function of the probability of VARY reinforcement. Neuringer interpreted these findings as evidence that choice between varying versus repeating response sequences was controlled by reinforcement probability, a variable that he also described in terms of relative frequency of reinforcement.

Fantino (1977) noted that in concurrent schedules, the index of preference, or relative rate of response, is confounded by the direct reinforcement of the two concurrent responses. For example, FR schedules maintain higher response rates than VI schedules. Consequently, when these two schedules are concurrently arranged, higher FR rates do not necessarily indicate preference for that schedule. In Neuringer's (1992) experiment, although he claimed to be investigating choice between varying and repeating contingencies, one cannot assert preference for either contingency because the index of preference, the ratio of VARY to REPEAT sequences, was confounded with the effects of reinforcement of those sequences. That is, because the percentage of VARY sequences was a direct function of the probability of VARY reinforcers, this index may have changed either because the pigeons actually preferred one or the other type of sequence or it may have changed because of the direct action of reinforcement on the sequences. Fantino proposed that a better index of preference could be obtained by using concurrent-chains schedules because choice could be separated from the response patterns that are directly reinforced.

Following Fantino's (1977) observation, the present experiments used concurrent-chains schedules to assess preference for VARY and REPEAT sequences as a function of the required degree of variability. This allowed an assessment of whether, as Neuringer's (1992) data suggest, the degree of variability is a factor in determining preference. With this procedure it also was possible to investigate such preferences while holding overall reinforcement rate constant.

Choice between variation and repetition also was of interest for two reasons. First, its demonstration would provide further evidence of the sensitivity of sequences of responses to their consequences and, as a result, of the general idea of variation and repetition as dimensions of the operant. Second, such sensitivity is predicted by recent findings. Doughty and Lattal (2001) showed that variability was more resistant to disruption by prefeeding or response-independent food delivery during blackouts than was repeatability. Nevin and Grace (2000) found that pigeons' preferences were greater for those response alternatives that were more resistant to disruption by response-independent food and to extinction (see also Grace & Nevin, 1997). If variable behavior is more resistant to disruption than repetitive behavior, and if contingencies correlated with greater resistance are preferred over those generating less resistance, then it follows that variability and repeatability contingencies should differentially control preference.

Experiment 1

Using a concurrent-chains schedule, Experiment 1 investigated choice between VARY and REPEAT contingencies as a function of the degree of behavioral variability required by the contingencies. At issue was whether choice between varying and repeating sequences of responses could be predicted and controlled by the level of sequence variability. If the variability requirement is a good predictor of preference, a systematic relation between preference and behavior variation should be observed.

Method

Subjects

Four experimentally naive White Carneau pigeons (P10, P20, P30, and P40) were maintained at 75% to 80% of their free-feeding weights throughout the experiment. The pigeons were housed individually with free access to grit and water in a temperature-controlled room with a 12:12 hr light/dark cycle. Supplementary food was given 1 hr after the end of the session as necessary to maintain prescribed weights.

Apparatus

The experimental chamber had a workspace measuring 29.5 cm long, 31 cm deep, and 32 cm high. Four 2.8 cm diameter, translucent keys were displayed horizontally on the work panel, 22 cm above the floor, with 9 cm separating the two middle keys and 3 cm separating each of the outermost keys. The keys are identified here, from left to right, as Keys 1, 2, 3, and 4. The keys were transilluminated by red, white, or green lights. Pecks on an illuminated key at a minimum force of 0.14 N operated the key. A Gerbrands food magazine delivered mixed grain through an aperture (4.5 cm by 6 cm) centered in the middle of the work panel and located 7.5 cm above the floor. A white houselight was located in the lower-right corner of the work panel. The houselight and the keys were darkened and inoperative during 3-s grain presentations, when the hopper was illuminated by a white light. A Sonalert® tone generator, located behind the work panel, provided an auditory stimulus. The chamber was housed in a light- and sound-attenuating box equipped with a fan for ventilation and masking noise. A Tandy 1000 TX microcomputer connected to the chamber by a MED-PC® interface system arranged the experimental conditions and recorded the pigeons' responding. MED-PC® software was used to program the experimental contingencies.

Procedure

Preliminary Repeat/Vary Training.

Previous studies have shown that, when variability is not demanded, the most frequent four-response sequence is one of responding on a single operandum, followed by the LRRR and RLLL (L for left-key responses and R for right-key responses) sequences (McElroy & Neuringer, 1990; Morgan & Neuringer, 1990). Based on that finding, the REPEAT sequence throughout this study was LRRR; the LLLL or RRRR sequences were not selected to require the use of both keys during the REPEAT and VARY conditions.

The training of the REPEAT sequence occurred in several stages, and was similar to that provided by Cohen, Neuringer, and Rhodes (1990). During the REPEAT training, two keys were white: Keys 1 (left) and 2 (right) for Pigeons P30 and P40, and Keys 3 (left) and 4 (right) for Pigeons P10 and P20. In Stage 1, only the right key was illuminated. Two consecutive right-key responses were required for reinforcement. In Stage 2, reinforcers followed three consecutive right-key responses. In Stage 3, the left key was initially illuminated. A left-key response darkened that key and illuminated the right key. Three consecutive right-key responses resulted in reinforcement. In Stage 4, both left and right keys were illuminated simultaneously. One left-key response followed by three right-key responses resulted in reinforcement. Left-key responses after the first left-key response, and right-key responses at the beginning of the session or after reinforcement, had no consequences. A left-key response after the first right-key response initiated a 5-s blackout (BO) during which the chamber was dark and the Sonalert was on continuously. In Stage 5, the reinforcement contingencies were similar to those in Stage 4 except that the BO also followed right-key responses at the beginning of the session or after reinforcement. In Stage 6, reinforcement was contingent upon the emission of the LRRR sequence. Any response that disrupted this sequence immediately produced the BO. In Stage 7, the LRRR sequence was required for reinforcement but now BOs occurred only at the end of a four-response sequence. Thus any other sequence of four responses terminated in BO. During these stages, each of the first three key pecks was followed by a 0.5 s darkening of the response key. Responses to the darkened key and during the BO reset the interval and were not counted towards the REPEAT contingency (cf. Neuringer, 1991). Reinforcement and BO were immediately followed by a new trial. Each session ended after 60 reinforcers.

When at least 50% of the REPEAT sequences were reinforced for three consecutive sessions, the VARY contingency was introduced. During this contingency, two keys were green: Keys 1 and 2 for Pigeons P10 and P20, and Keys 3 and 4 for Pigeons P30 and P40. Each VARY sequence consisted of four responses, with each of the first three responses producing a 0.5 s darkening of the response key. A sequence was reinforced if it met the variability criterion; otherwise it initiated a 5-s BO during which tone alternated with no tone every 500 ms. The VARY sequences included the sequence that was always used in REPEAT and also sequences that used only one key. The VARY contingency alternated with the REPEAT contingency according to a multiple schedule. Initially, each session began with the REPEAT contingency. When 10 REPEAT reinforcers were obtained, the VARY contingency then operated for 50 reinforcers. This difference in the frequency of VARY and REPEAT reinforcers per session was implemented because, prior to this point in the experiment, the pigeons had accumulated more reinforcers for REPEAT sequences than for VARY sequences. When the total numbers of VARY and REPEAT reinforcers obtained since the beginning of training was approximately the same, sessions began randomly with either the REPEAT or VARY component. Each component lasted until 10 reinforcers had been earned, and was followed by a 5-s intercomponent interval during which the chamber was dark, but unlike the BOs, no tone was presented. Each session lasted for 60 reinforcers throughout the REPEAT/VARY training.

Across three training conditions, the degree of variability required for reinforcement was manipulated according to a lag procedure (Page & Neuringer, 1985). To be reinforced in the VARY component, a sequence had to differ from the immediately preceding sequence (Lag 1 criterion), from each of the last three sequences (Lag 3 criterion), or from each of the five previous sequences (Lag 5 criterion) in different conditions of the experiment. Each lag criterion remained in effect until at least 50% of the VARY and 50% of the REPEAT sequences were reinforced over three consecutive sessions. The preliminary training typically was completed within 90 sessions.

Concurrent-Chains Training

Figure 1 illustrates the concurrent-chains procedure used to investigate preference for VARY versus REPEAT contingencies. In the initial links, Keys 2 and 3 were red, and a concurrent variable-interval (VI) 30-s VI 30-s schedule was programmed according to Stubbs and Pliskoff's (1969) procedure. Thus the VI schedules, generated according to the Fleshler and Hoffman (1962) progression with 12 intervals, operated interdependently. That is, after an average of 30 s, a terminal link entry was arranged on either Key 2 or Key 3. A single peck on the appropriate key initiated its terminal link; pecks on the other key were ineffective. Once a terminal link entry was assigned, the VI timer stopped until the end of the intertrial interval (ITI).

REPEAT and VARY contingencies were available in the terminal links. The arrangement of these contingencies was similar to that in the REPEAT/VARY training. With both contingencies, each trial consisted of a sequence of four responses. Each of the first three responses darkened the keylight for 0.5 s. Responses to either darkened key reset the 0.5-s period, but these responses were not counted toward the sequence requirement. The fourth response terminated the trial. If the sequence requirement was met, a 3-s reinforcer ensued. Otherwise, a 5-s BO occurred, during which the chamber was dark, the keys were inoperative, and a continuous (REPEAT condition) or intermittent (VARY condition) tone was presented. Responses during the BO reset the 5-s interval. A new trial began immediately after the reinforcer or BO.

During REPEAT terminal links, the keys were white, and reinforcement was contingent on the occurrence of a single sequence (LRRR). During VARY terminal links, the keys were green, and a Lag n variability condition operated such that a sequence was reinforced only if it differed from the previous n sequences. Each terminal link remained in effect until five reinforcers were obtained. To maintain a constant overall reinforcement rate, a timeout (TO) was added after the fifth reinforcer for the shorter terminal link, which was always the REPEAT one, such that the total time in both terminal links was equal. The TO duration was determined within each session. For example, if it took the pigeon 50 s to complete the VARY terminal link, but only 30 s to complete the REPEAT terminal link, an additional 20-s period was introduced after the fifth REPEAT reinforcer was delivered (see Figure 1). During the TO, the keylights were off but the houselight remained on. The delivery of the fifth reinforcer, or the end of the TO, initiated a 5-s ITI, followed by the start of another initial link. The ITI and BO were similar except that: (a) ITIs did not include the presentation of the tone, and (b) ITIs occurred at the end of the terminal link whereas BOs occurred after incorrect sequences. The first two terminal links of the concurrent-chains schedule in each session were forced ones, with one occurrence of either the VARY or the REPEAT components randomly ordered across sessions. They were considered warm-up terminal links and were excluded from the data analysis.

For Pigeons P10 and P20, pecks on the left initial-link key (Key 2) produced the VARY terminal link that operated on Keys 1 and 2 whereas pecks on the right initial-link key (Key 3) led to the REPEAT terminal link, which was in effect on Keys 3 and 4. For Pigeons P30 and P40, left initial-link key responses initiated the REPEAT terminal link (Keys 1 and 2) whereas right initial-link key responses initiated the VARY terminal link (Keys 3 and 4).

Table 1 shows the order of experimental conditions and the number of sessions per condition. The lag variability criterion in the VARY terminal link was manipulated across experimental conditions. In the initial condition, the pigeons were exposed to a Lag 5 criterion. Two pigeons then were exposed to Lag 1 followed by Lag 10, and the remaining 2 to Lag 10 followed by Lag 1. Finally, all pigeons were returned to Lag 5. Each lag requirement was in effect until the relative number of responses in the initial link (REPEAT responses/total responses), averaged over three sessions, differed by no more than 0.05 from the average of the three previous sessions. Sessions were conducted 6 days a week. Each session ended after eight left and eight right terminal links, excluding warm-up terminal links.

Table 1. Total number of sessions and average number of sequences per reinforcer for the REPEAT and VARY terminal links in each condition of Experiment 1.

Data are averaged over six sessions. Standard deviations are shown in parenthesis.

			Sequences per reinforcer

Subjects	Conditions	Sessions	Repeat	Vary
P10	Lag 5	32	1.1	1.8
			(0.1)	(0.3)
	Lag 1	31	1.1	1.3
			(0.1)	(0.1)
	Lag 10	36	1.1	2.1
			(0.1)	(0.2)
	Lag 5	35	1.2	1.6
			(0.1)	(0.2)
P20	Lag 5	48	1.1	1.9
			(0.0)	(0.2)
	Lag 1	36	1.1	1.2
			(0.1)	(0.1)
	Lag 10	34	1.1	2.5
			(0.1)	(0.2)
	Lag 5	33	1.1	1.7
			(0.1)	(0.1)
P30	Lag 5	35	1.3	1.7
			(0.1)	(0.2)
	Lag 10	30	1.3	2.4
			(0.1)	(0.4)
	Lag 1	32	1.4	1.5
			(0.2)	(0.1)
	Lag 5	35	1.3	1.7
			(0.1)	(0.2)
P40	Lag 5	38	1.3	1.5
			(0.1)	(0.1)
	Lag 10	32	1.4	2.5
			(0.1)	(0.3)
	Lag 1	34	1.2	1.1
			(0.1)	(0.1)
	Lag 5	33	1.1	1.5
			(0.4)	(0.1)

Open in a new tab

Results

The following analysis is based on choice outcomes followed by REPEAT and VARY performances in the terminal links during the last six sessions of each condition.

Performance in the Initial Links

Figure 2 shows the results on choice between varying and repeating contingencies. Choice was measured by dividing the number of responses in the initial-link key correlated with the REPEAT terminal link by the total number of responses in the initial link. Choice proportions of 0.5 indicate that responding was distributed equally between the REPEAT and VARY keys. Higher proportions indicate greater responding on the REPEAT key. Preference for the REPEAT terminal link was least with the Lag 1 requirement and greater with the Lag 10 requirement in the VARY component. The Lag 1 and Lag 5 preferences sometimes overlapped. That is, the direction of the shifts in preference for REPEAT tracked the lag criterion such that a direct relation between preference for the REPEAT terminal link and the extreme values of the lag requirement (Lag 1 and Lag 10) was obtained.

Fig. 2 — Data are averaged over six sessions. Error bars represent one standard deviation.

Performance in the Terminal Links

In that one purpose of Experiment 1 was to evaluate the suitability of concurrent-chains schedules to study choice between varying and repeating responses, the main requirement was that the REPEAT and VARY contingencies produced distinct terminal-link performances. Figure 3 shows U values for each pigeon across conditions. The U value is an index of overall sequence variability calculated according to the following equation:

where p is the probability of occurrence of sequence i, and n is the number of possible sequences, or 16 (Neuringer, 1991; Page & Neuringer, 1985). According to the U statistic, if each of the 16 possible sequences were emitted equally often in a given session, then U would be equal to 1; if only one sequence was emitted, U would be equal to 0. The degree of behavioral variability engendered by REPEAT contingencies (open bars) was much lower than that obtained with VARY contingencies (filled bars), with REPEAT generating low and approximately constant U values for all pigeons, and VARY producing U values that changed with the lag criterion for Pigeons P10, P20, and P30. For these pigeons, the Lag 1 condition engendered less sequence variation than the Lag 5 condition, which tended to engender levels of sequence variation close to those observed for the Lag 10 condition. Sequence variability of Pigeon P40 was high regardless of the lag manipulations in the variability requirement.

Fig. 3 — Error bars represent one standard deviation.

Figure 4 shows the percentage of correct sequences in the REPEAT (open bars) and VARY (filled bars) terminal links for each pigeon in each condition. This measure was calculated by dividing the number of sequences that met the REPEAT (or the VARY) requirement by the total number of REPEAT (or VARY) sequences in a session, and then multiplying the result by 100. The percentage of correct sequences was greater in the REPEAT than in the VARY terminal link across conditions, with the exception of the Lag 1 condition for Pigeon P40. The percentage of REPEAT correct sequences was constant throughout the experiment whereas the percentage of VARY correct sequences changed inversely with manipulations in the lag criterion.

Fig. 4 — Data are averaged over six sessions. Error bars represent one standard deviation.

Taken together, Figures 3 and 4 indicate that, with more stringent criteria, VARY behavior tended to be more variable, but such variation was not necessarily followed by greater reinforcement. Across conditions, under static contingencies, REPEAT behavior remained both stereotyped and effective in producing reinforcement.

To examine further the performance in the terminal links, the percentage of occurrence of each of the 16 possible sequences in the VARY terminal links under each experimental condition is plotted in Figure 5. Congruent with the U values of Figure 3, frequency distributions varied with the lag criterion, except for Pigeon P40. The Lag 10 condition produced the flattest, and the Lag 1 condition the sharpest, frequency distributions. An alternative way to summarize the variability shown in Figure 5 is to consider each pigeon's four most frequently emitted sequences as a percentage of total sequence occurrences. As lag increased, this percentage tended to decrease. Across Lags 1, 5 (averaged across determinations), and 10, respectively, the percentages were: for Pigeon P10, 86.1%, 58%, and 37.2%; for Pigeon P20, 91.5%, 56.1%, and 44.2%; for Pigeon P30, 88.0%, 47.1%, and 42.3%; and for Pigeon P40, 48.1%, 44.7%, and 44.2%.

Fig. 5 — In each graph, and for each condition, sequences are ordered from left to right, from the most to the least frequent one. Data are averaged over six sessions.

There is evidence that the stereotypy in the VARY terminal link during the Lag 1 and Lag 5 conditions may reflect efficient responding as long as such stereotypy comprises sequences with minimal switching. Figure 6 presents the relative frequency distributions of the number of switches per sequence for the VARY responding across conditions. Individual and average data are shown in the left and right columns, respectively. The binomial distribution predicted by random responding, that is, when the 16 possible sequences occur equally often, is also presented in the right column (dashed function). The left column indicates that intersubject differences in the switching distribution tended to decrease as the lag criterion increased. With Lag 1, the emission of any two sequences in an alternated order would produce the reinforcer, and because there were 16 possible sequences, the two selected sequences were expected to be different across subjects. With Lag 10, however, several different sequences (at least 11) were required for reinforcement such that sequence overlapping across subjects would be the rule. The right column shows that under all lag contingencies, one-switch sequences occurred more often than sequences incorporating zero, two, or three switches. In the Lag 1 condition, mean performance tended to deviate from random in that zero-switch sequences were over represented and two-switch and three-switch sequences were under represented. As the lag criterion increased, performance increasingly approximated the random distribution.

Fig. 6 — Data are averaged over six sessions for Lag 1 and Lag 10 conditions, and over 12 sessions for Lag 5. The left columns indicate individual performances, and the right columns show average and random performances.

Choice versus Performance in the Terminal Links

Figure 7 shows the log proportion of REPEAT choices in the initial links as a function of the log proportion of REPEAT reinforced sequences in the terminal link. Solid lines are fitted least-squares regression lines. The equation of the fitted line (y = ax + b, where a is the slope and b is the intercept) appears in each graph. Preference for REPEAT increased as the relative percentage of reinforced sequences in the REPEAT terminal link increased. The slope values showed that proportional changes in REPEAT preference were greater than proportional changes in the percentage of reinforced sequences. The R² values ranged from .52 to .95, suggesting that changes in preference may be accounted for by the relative probability of REPEAT versus VARY reinforcement in the terminal links.

Fig. 7 — Solid lines are fitted least-squares regression lines. Data are averaged over six sessions.

An analysis of log proportion of REPEAT choices as a function of log proportions of U values in the REPEAT terminal links, not shown here, revealed the lack of a systematic relation between preference and the obtained level of behavioral variability for all pigeons.

Discussion

Choice was a direct function of the degree of variability required by the VARY contingency. This finding is consistent with Neuringer's (1992) suggestion that pigeons choose to vary and repeat, but the present use of a concurrent-chains schedule demonstrates such preference in the absence of the confounding effects of the direct action of the reinforcement contingency on the choice responses. Although overall reinforcement rate was held constant, thereby eliminating this variable as a source of control, several other indirect variables (Zeiler, 1977) also might have contributed to the choice along with the direct variable of variability requirements.

If behavioral variability per se influences choice, a systematic relation between the U value and preference must be obtained. The results obtained with Pigeon P40 indicated that preference changed even though the U values remained constant across conditions.

For the other pigeons, the behavioral variability obtained in the terminal link also was approximately similar with the Lag 5 and Lag 10 criteria, indicating that this variable per se was not likely a factor in the choice performances beyond the Lag 1 criterion. The percentage of correct sequences (or the probability of reinforcement), however, may not be ruled out as a potential source of control. As the lag criterion increased, the relative number of incorrect VARY sequences per reinforcer increased, and preference for REPEAT increased correspondingly. The TOs at the end of the REPEAT terminal links also may have affected choice. Preference for terminal links delivering single versus multiple reinforcers is accentuated when TOs follow the single-reinforcer terminal link (Poniewaz, 1984; cf. also Dunn, Williams, & Royalty, 1987; Logan, 1965; Snyderman, 1983). These studies together suggest that the addition of TOs in one terminal link does not change the direction of preference, but it leads to less extreme preference for that terminal link. An analogous effect may have occurred here such that the inclusion of TOs in the REPEAT terminal link may have attenuated the degree of preference for that alternative.

Another variable that was not constant was the delay to the first, and therefore to the four subsequent, reinforcers in each terminal link. Time to the first and subsequent reinforcers in terminal links of concurrent-chains schedules can affect initial-link responding, with greater preference for terminal links that represent relatively greater delay reduction to reinforcement (e.g., Fantino, 1969; Fantino, Preston, & Dunn, 1993; Mazur, 1986; Shull, Spear, & Bryson, 1981). Although data on such delays were not obtained in this experiment, the general effect can be discerned from the extant data. During the Lag 1 condition, because the duration as well as the number of sequences per reinforcer in the VARY terminal link were comparable to that in the REPEAT terminal link (see Table 1), the delays to each reinforcer in either link were expected to be similar. With the two longer lag requirements, it is likely that the REPEAT terminal link involved relatively shorter delays to reinforcement because it was increasingly probable that a nonreinforceable VARY sequence would be emitted (see Table 1). The initial-link responding across the two keys is consistent with these observations in that preference for REPEAT increased with increased lag requirements. The second experiment investigated this issue.

Experiment 2

This experiment was performed to evaluate the effects of the obtained variability on preference in the absence of the confounding effects of delay to reinforcement and TOs. The interreinforcer intervals (IRIs) in the REPEAT terminal link were yoked to the IRIs in the VARY terminal link such that the REPEAT contingency closely replicated the delays to the first and subsequent reinforcers obtained with the VARY contingency. Because of this procedure, it also was possible to eliminate the TOs following the REPEAT terminal links.