Earning And Obtaining Reinforcers Under Concurrent Interval Scheduling

James S MacDonall

doi:10.1901/jeab.2005.76-04

. 2005 Sep;84(2):167–183. doi: 10.1901/jeab.2005.76-04

Earning And Obtaining Reinforcers Under Concurrent Interval Scheduling

James S MacDonall ^1,^✉

PMCID: PMC1243978 PMID: 16262185

Abstract

Contingencies of reinforcement specify how reinforcers are earned and how they are obtained. Ratio contingencies specify the number of responses that earn a reinforcer, and the response satisfying the ratio requirement obtains the earned reinforcer. Simple interval schedules specify that a certain time earns a reinforcer, which is obtained by the first response after the interval. The earning of reinforcers has been overlooked, perhaps because simple schedules confound the rates of earning reinforcers with the rates of obtaining reinforcers. In concurrent variable-interval schedules, however, spending time at one alternative earns reinforcers not only at that alternative, but at the other alternative as well. Reinforcers earned for delivery at the other alternative are obtained after changing over. Thus the rates of earning reinforcers are not confounded with the rate of obtaining reinforcers, but the rates of earning reinforcers are the same at both alternatives, which masks their possibly differing effects on preference. Two experiments examined the separate effects of earning reinforcers and of obtaining reinforcers on preference by using concurrent interval schedules composed of two pairs of stay and switch schedules (MacDonall, 2000). In both experiments, the generalized matching law, which is based on rates of obtaining reinforcers, described responding only when rates of earning reinforcers were the same at each alternative. An equation that included both the ratio of the rates of obtaining reinforcers and the ratio of the rates of earning reinforcers described the results from all conditions from each experiment.

Keywords: preference, concurrent schedule, earning reinforcers, optimal foraging theory, generalized matching law, lever press, rats

Empirical and theoretical investigations of preference using concurrent schedules continue more than 40 years after Herrnstein's (1961) first systematic investigation. He reported that exposing pigeons to concurrent variable-interval (VI) schedules resulted in the proportion of responses at one alternative approximately equaling the proportion of reinforcers obtained at that alternative. Subsequent investigations extended this result to proportions of time at one alternative equaling the proportion of reinforcers obtained at that alternative (e.g., Baum & Rachlin, 1969). Because of systematic deviations from this relation, Baum (1974) introduced the generalized matching law:

graphic file with name jeab-84-02-03-e01.jpg

where B₁ and B₂ are behaviors (i.e., responses or time) allocated to, and R₁ and R₂ are the rates of reinforcers obtained at, Alternatives 1 and 2, respectively. There are two fitted parameters: s, called sensitivity to reinforcement, which measures the degree to which log response ratios change with changes in reinforcer ratios; and b, which is interpreted as response bias unrelated to reinforcer allocation (Trevett, Davison, & Williams, 1972; see Baum, 1974 for a review).

A basic characteristic of concurrent VI VI schedules is that, as subjects earn reinforcers by spending time at one alternative, they also earn reinforcers at a second alternative because the VI timer for the second alternative continues operating (Houston & McNamara, 1981; MacDonall, 1998, 1999). A reinforcer arranged at the second alternative while the subject is at the first alternative is obtained following the first response at the second alternative, possibly after a changeover delay (COD). Thus there are two modes of obtaining reinforcers on concurrent VI VI schedules: stay reinforcers that are earned and obtained while working at the same alternative, and switch reinforcers that are earned by working at one alternative but obtained following a switch to the other alternative.

The distinction between earning and obtaining reinforcers is based on differences in contingencies (Rachlin, Green, & Tormey, 1988). Some contingencies specify how to earn reinforcers. For example, responding on a ratio contingency earns reinforcers. Once earned, however, reinforcers are held until a contingency for obtaining the earned reinforcers is satisfied. Thus on standard fixed- or variable-ratio schedules, the same operant response earns and obtains reinforcers (Ferster & Skinner, 1957). The contingency, however, can involve a different operant response as in counting (Mechner, 1958) in which different responses earn and obtain reinforcers. The contingency for earning reinforcers on simple interval schedules is spending time on the schedule, but because the schedule operates throughout the session, reinforcers do not appear to be earned. Once it has been earned, the next response obtains the reinforcer. On concurrent interval schedules, the differing contingencies for earning and obtaining reinforcers are clearer: Spending time at an alternative earns reinforcers for staying at that alternative and earns reinforcers for switching to the other alternative. The contingency for obtaining reinforcers earned for staying at an alternative is a response at the associated alternative. The contingency for obtaining reinforcers earned for switching to the other alternative is a response at the other alternative.

Simple schedules of reinforcement (fixed ratio, fixed interval, variable ratio, and variable interval) confound the rates of earning reinforcers and the rates of obtaining reinforcers. The rate of obtaining reinforcers is the number of delivered reinforcers divided by the time at the schedule, which is the duration of the session. The rate of earning reinforcers is the number of reinforcers divided by the time earning those reinforcers, which is also the duration of the session. Thus in simple schedules of reinforcement, the rate of earning reinforcers always equals the rate of obtaining reinforcers. Earning and obtained reinforcers can be unconfounded in complex schedules, such as concurrent VI VI schedules. This is because switch reinforcers are earned by spending time at one alternative but obtained by responding at the other alternative. The contingency for earning stay reinforcers makes immediate contact with the stay response because the response that earns the reinforcer also delivers the reinforcer. The contingency for earning switch reinforcers makes contact, sometimes delayed, with the switch response because the switch reinforcer is delivered when the animal commences responding on the other alternative (Dreyfus, Dorman, Fetterman, & Stubbs, 1982). When a COD is not used, there is immediate contact between earning and delivering the switch reinforcer, but when a COD is used the contact is delayed, which may alter the effect of earning reinforcers.

The generalized matching law is a special case of the concatenated generalized matching law (Baum & Rachlin, 1969; Davison & McCarthy, 1988, ch. 4) that was developed to include all obtained reinforcer parameters that affect behavior (i.e., rates, magnitudes, immediacies [reciprocal of delay], and qualities of reinforcers). The concatenated generalized matching law states that preference is a function of the ratios of the rates, magnitudes, immediacies, and qualities of reinforcement obtained at the alternatives. Note that the concatenated generalized matching law allows a different sensitivity value for each of these parameters of reinforcement. When the magnitudes, immediacies, and qualities of reinforcement are the same at each alternative, then the respective ratios equal 1.0 and drop from the equation. This leaves the ratio of the rates of obtaining reinforcers, that is, Equation 1.

If the ratio of the rates of earning reinforcers influences preference, then that ratio may be able to be included in the concatenated generalized matching law. An appropriate form of Equation 1 would be:

graphic file with name jeab-84-02-03-e02.jpg

where, E_n is the rate of earning reinforcers at alternative n, and h is the sensitivity to the ratio of the rates of earning reinforcers.

The separate effects of the rates of earning and of obtaining reinforcers are seldom investigated. Earning reinforcers, however, influenced preference in a simplified analog of a concurrent VI variable-ratio (VR) schedule (Rachlin et al., 1988). Only time at Alternative 1 earned reinforcers, and these were obtained only by responses at Alternative 2. Time at Alternative 2 never earned reinforcers and responses at Alternative 1 never obtained reinforcers. Although responding at Alternative 1 obtained no reinforcers, from 10% to 40% of the time was spent at Alternative 1. This suggests that rates of earning reinforcers affect preference. The purpose of the following experiments was to examine further the effects of rates of earning reinforcers on preference.

EXPERIMENT 1

An operant and the associated contingencies correspond to each of the stay and switch reinforcers at an alternative. Thus there are four operants; two stay operants for staying at each of the alternatives, and two switch operants for switching to each of the alternatives. Each stay operant is any response at an alternative that is reinforced according to the associated stay schedule, and each switch operant is any response that is reinforced according to the associated switch schedule. Thus a concurrent schedule can be implemented with four separate VI timers, one for each operant, that operate in pairs as the stay and switch schedules when the animal is at each alternative (Houston & McNamara, 1981; MacDonall, 2000). Each pair consists of a stay schedule that arranges reinforcers for staying and responding at one alternative and a switch schedule that arranges reinforcers for switching to the other alternative. For example, in a concurrent VI 36 s VI 320 s schedule, the stay schedule at Alternative 1 is VI 36 s and the switch schedule is VI 320 s, both of which operate only while the subject is at Alternative 1. The VI 36 s arranges reinforcers for staying and responding at Alternative 1, whereas the VI 320 s arranges reinforcers for switching to Alternative 2. At Alternative 2, the stay schedule is VI 320 s and the switch schedule is VI 36 s, both of which operate only while the subject is at Alternative 2. The VI 320 s arranges reinforcers for staying and responding at Alternative 2, whereas the VI 36 s arranges reinforcers for switching to Alternative 1. Changing between alternatives exchanges the pair of schedules operating. When using a COD, reinforcers arranged for staying and for switching would follow the first or subsequent response after the COD had elapsed.

The previous example demonstrates that the pairs of schedules are symmetrical in the standard concurrent procedure, that is, the value of the stay schedule that operates at each alternative equals the value of the switch schedule that operates at the other alternative. When using four separate schedules, the values of the schedules may be arranged differently. For example, swapping the values of the switch schedules between alternatives produces an asymmetrical arrangement: The value of the stay schedule at each alternative equals the value of the switch schedule at the same alternative. Swapping the values of the switch schedules in the previous example produces an asymmetrical arrangement with the following pairs of schedules: A VI 36 s for staying at the first alternative and VI 36 s for switching to the second alternative, and a VI 320 s for staying at the second alternative and VI 320 s for switching back to the first alternative.

The symmetrical arrangement produces the same rates of earning reinforcers at the alternatives. The rate of earning reinforcers at an alternative is the sum of the rates of earning stay and earning switch reinforcers. The rate of earning stay reinforcers is the number of stay reinforcers delivered at an alternative divided by the time spent at that alternative, which earned those reinforcers. The rate of earning switch reinforcers is the number of reinforcers delivered for switching to an alternative divided by the time spent at the other alternative, which earned those switch reinforcers. Thus the rate of earning reinforcers at Alternative 1 is the sum of the reinforcers earned for staying at Alternative 1 plus the reinforcers earned for switching to Alternative 2 divided by the time at Alternative 1.

The asymmetrical arrangement, however, produces different rates of earning reinforcers at the alternatives. For example, at the alternative with VI 36 s for staying and VI 36 s for switching, 18 reinforcers, on average, are earned every 320 s: Nine reinforcers are earned from the VI 36 s for staying plus nine reinforcers are earned from the VI 36 s for switching. At the alternative with VI 320 s for staying and VI 320 s for switching, two reinforcers, on average, are earned every 320 s: One reinforcer is earned from the VI 320 s for staying and one reinforcer is earned from the VI 320 s for switching.

If the allocation of behavior between alternatives is sensitive to rates of earning reinforcers, then the difference in the rates of earning reinforcers under the asymmetrical arrangement will affect preference: There will be a preference for the alternative associated with a higher rate of earning reinforcers. This preference occurs even though, in the asymmetrical arrangement, the rate of obtaining reinforcers at each alternative is the same. At the alternative with, say, VI 36 s for staying and VI 36 s for switching, nine reinforcers, on average, will be obtained from the VI 36 s for staying at Alternative 1 and nine reinforcers will be obtained at Alternative 2 from the VI 36 s for switching. At Alternative 2, one reinforcer will be obtained from the VI 320 s for staying at that alternative and one reinforcer will be obtained at Alternative 1 from the VI 320 s for switching. Thus, on average, a total of 10 reinforcers will be obtained every 320 s at each alternative.

The analysis of concurrent schedules into pairs of stay and switch schedules applies to the Findley (1958) procedure as well as to the usual two-manipulandum choice procedure. In the Findley procedure, responses at one (changeover) manipulandum switch alternatives, which are signaled by different stimuli (lights, tones, etc.), whereas responses at a second (main) manipulandum earn and obtain the stay and switch reinforcers. In the present experiment, preference was examined on concurrent VI VI schedules using a Findley procedure. In half of the conditions, symmetrical schedules were arranged and in half of the conditions asymmetrical schedules were arranged. In this way, the effects of rates of earning reinforcers at each alternative were investigated. A COD was not used because this would modify the contingency for obtaining switch reinforcers. The generalized matching law should describe the data from the symmetrical conditions, which would verify that the four-schedule procedure used here produces data consistent with the generalized matching law. If the rates of earning reinforcers do not affect preference, the generalized matching law also should describe data from the asymmetrical arrangement. If the generalized matching law describes data from the symmetrical conditions but not those from the asymmetrical conditions, then rates of earning reinforcers need to be included in the list of choice-affecting variables in the concatenated generalized matching law.

Method

Subjects

The subjects were 6 naive female Sprague-Dawley rats obtained from Hilltop Lab Animals (Scottdale, PA) and maintained at 85% of their free-feeding weights. They were approximately 100 days old when the experiment began and were housed individually in a temperature-controlled colony room on a 14:10 hr light/dark cycle with free access to water in their home cages.

Apparatus

Six operant conditioning chambers were used. Four chambers were approximately 225 mm wide and 195 mm high; three of the chambers were 235 mm in length, whereas the fourth was 350 mm in length. Each chamber was located in a light- and sound-controlled box. The 50 mm square opening for the food cup was centered horizontally on one 225-mm by 195-mm wall, 20 mm above the floor. Two response levers (Model G6312, R. Gerbrands Co.), 45 mm long by 13 mm thick, protruded 15 mm into the chamber. The centers of the levers were 60 mm to the left or right of the center of the food cup and 50 mm above the floor. Each lever required a force of approximately 0.3 N to operate. A Gerbrands feeder (Model G5120), located behind the wall containing the food cup, dispensed 45-mg food pellets (Formula A/1, P. J. Noyes Co.), which were 85% Purina® Rodent Chow. The other two chambers were 305 mm wide, 270 mm high, and 250 mm long. On one 305-mm by 270-mm wall were three response levers (Model G6312) 95 mm above the floor. One lever was centered on the wall and the other two levers were 90 mm on either side of the center of the center lever. Only the two outside levers were used in this experiment. Centered horizontally on the opposite wall, 35 mm above the floor, was a 50-mm square opening to the food cup. The food cup was located on the opposite wall so that it was approximately equidistant to each of the three levers, which was important in experiments examining three-alternative choice. A Gerbrands feeder, located behind the food cup, dispensed 45-mg food pellets (Formula A/1). A 24-V DC stimulus light was centered approximately 75 mm above each lever. All chambers were illuminated during sessions by a pair of houselights mounted on the top center of the chamber. White noise was presented through a speaker centered between the houselights. An IBM®-compatible computer and MED-PC® software and hardware (MED Associates Inc.) controlled the experimental events and recorded responses.

Procedure

All conditions used a changeover-lever procedure (Findley, 1958) to expose rats to concurrent VI VI schedules. The alternative in effect at the beginning of a session, signaled by either a light or white noise, was randomly determined. One pair of stay and switch schedules arranged stay and switch reinforcers during light, and a different pair arranged reinforcers during noise (Table 1). Pressing the changeover (right) lever switched stimuli and the associated pairs of schedules in effect. During either light or noise, when a stay schedule arranged a reinforcer that schedule stopped until a press of the main (left) lever delivered that reinforcer. If the rat switched alternatives before the reinforcer was delivered, that reinforcer was held until the rat switched back to that alternative, and the first press at the main lever then delivered that reinforcer. Also during either light or noise, when a switch schedule arranged a reinforcer that schedule stopped. It resumed when the rat returned to that alternative, as in standard concurrent VI VI schedules. Switch reinforcers, when arranged, were delivered by the first response at the main lever after pressing the changeover lever. Thus, when a switch reinforcer was arranged, the rat was required to press the changeover lever to change alternatives, and then to press the main lever to deliver the switch reinforcer. There was no COD. If both a stay and a switch reinforcer were arranged, the first response delivered the switch reinforcer and the next response delivered the stay reinforcer. For this to occur, the stay reinforcer must be arranged after the last press at the main lever during the brief time traveling to the changeover lever. Then during the visit at the other alternative, a reinforcer for switching back to the first alternative must also be arranged. It was rare for both reinforcers to be arranged at the same time.

Table 1. For each rat in Experiment 1, the sequence of conditions, the number of sessions in each condition, and the values of the stay and switch schedules of reinforcement at each alternative for the symmetrical (S) and asymmetrical (A) conditions are shown. Also shown are the sums over the last five sessions in each condition of responses at each alternative, total time spent at each alternative, the total number of stay and switch reinforcers obtained at each alternative, and the total number of changeovers to the other alternative.

Arrangement	Sessions	Variable-interval schedule (in s) for				Responses in		Time (s) in		Reinforcers from				Changeover to
Arrangement	Sessions	Stay in light	Switch to noise	Stay in noise	Switch to light	Light	Noise	Light	Noise	Stay in light	Switch to noise	Stay in noise	Switch to light	Noise	Light
Rat 407
A	42	320	320	36	36	1,425	6,187	3,788	10,059	18	11	264	207	627	625
A	27	43	43	128	128	4,434	2,986	9,209	7,110	192	186	61	61	1,314	1,312
S	11	64	64	64	64	3,442	3,558	8,672	8,550	124	130	117	129	1,416	1,413
A	23	128	128	43	43	3,162	4,708	7,830	8,991	68	56	191	185	1,072	1,069
A	15	36	36	320	320	4,329	2,991	9,107	6,755	250	203	29	18	1,492	1,489
S	18	36	320	320	36	4,993	2,064	13,220	4,142	334	48	18	100	1,104	1,103
S	35	43	128	128	43	4,680	3,084	11,760	5,788	261	77	35	127	1,060	1,060
S	11	64	64	64	64	3,356	3,616	8,894	7,988	137	125	134	104	1,626	1,623
S	22	128	43	43	128	2,958	3,771	7,360	9,777	56	167	204	73	1,608	1,604
S	21	320	36	36	320	2,466	3,697	6,127	10,919	22	148	290	40	1,584	1,584
Rat 408
A	22	43	43	128	128	4,573	2,695	9,530	6,221	213	208	34	45	1,205	1,208
A	20	320	320	36	36	1,782	6,117	3,717	9,733	16	6	289	189	689	690
A	32	36	36	320	320	6,565	1,947	9,594	4,054	268	208	10	14	861	864
S	22	64	64	64	64	5,472	3,278	10,792	6,939	164	138	102	96	1,033	1,036
A	24	128	128	43	43	3,389	5,900	7,044	9,255	52	44	221	183	1,350	1,354
S	25	43	128	128	43	4,273	3,135	10,432	6,807	228	68	56	148	1,196	1,201
S	26	320	36	36	320	3,563	8,195	5,434	11,523	20	126	321	33	1,240	1,240
S	26	36	320	320	36	9,294	3,055	12,810	4,372	331	30	24	115	700	705
S	19	128	43	43	128	3,868	7,521	6,770	10,305	42	150	230	78	1,243	1,245
S	19	64	64	64	64	4,666	6,115	8,114	9,234	128	86	155	131	1,073	1,076
Rat 409
S	32	320	36	36	320	1,806	8,201	2,597	14,171	5	53	390	52	999	995
S	19	43	128	128	43	5,325	3,823	10,993	6,225	252	71	59	118	1,383	1,386
S	10	64	64	64	64	4,914	4,469	9,068	7,807	149	129	114	108	1,591	1,589
A	26	128	128	43	43	3,401	4,012	8,938	8,475	58	73	201	168	1,507	1,504
A	41	36	36	320	320	5,088	3,554	8,868	7,090	236	219	17	28	1,426	1,426
S	18	64	64	64	64	4,542	3,534	9,216	7,812	141	131	102	126	1,368	1,369
A	23	43	43	128	128	4,984	3,424	9,419	6,499	208	196	54	42	1,324	1,322
A	33	320	320	36	36	3,901	4,470	7,865	9,315	14	34	260	192	1,064	1,062
S	25	128	43	43	128	2,823	5,368	5,885	11,640	46	122	253	79	1,260	1,258
Rat 410
S	21	43	128	128	43	4,121	1,947	12,051	4,934	261	96	33	110	1,301	1,300
S	29	320	36	36	320	2,719	5,843	4,271	12,667	22	104	337	37	1,184	1,180
S	19	36	320	320	36	5,257	2,508	12,320	5,001	314	45	13	128	1,158	1,158
S	13	64	64	64	64	3,074	2,542	9,240	8,049	135	142	107	116	1,126	1,123
A	15	36	36	320	320	2,227	1,114	10,427	4,056	270	215	9	6	615	611
A	41	128	128	43	43	1,554	2,239	5,101	10,927	45	31	243	181	591	590
A	28	43	43	128	128	2,081	1,610	9,908	7,221	200	195	48	57	608	606
S	20	64	64	64	64	2,078	2,115	8,554	9,477	114	110	158	118	574	572
A	29	320	320	36	36	1,357	1,754	5,051	10,498	7	20	247	226	641	637
S	19	128	43	43	128	1,313	2,311	4,874	13,621	33	103	274	90	567	566
Rat 411
A	34	36	36	320	320	3,690	2,093	9,688	4,680	243	223	20	14	1,140	1,140
A	45	128	128	43	43	2,249	2,856	6,642	10,001	47	38	227	188	1,109	1,111
A	14	320	320	36	36	1,864	3,334	5,459	9,834	13	6	256	225	973	975
S	17	64	64	64	64	2,799	4,235	6,123	11,185	94	70	182	154	1,551	1,555
A	27	43	43	128	128	2,369	2,394	9,380	7,955	208	173	62	57	1,427	1,431
S	28	128	43	43	128	3,202	3,363	9,465	7,810	69	199	162	70	1,661	1,659
S	16	320	36	36	320	2,739	3,437	8,658	9,368	28	183	259	30	1,654	1,654
S	10	64	64	64	64	3,449	3,551	8,875	8,291	127	131	136	106	1,609	1,612
S	24	36	320	320	36	4,278	2,426	13,708	3,791	359	31	16	94	1,516	1,520
Rat 412
S	32	36	320	320	36	4,008	1,502	13,174	4,534	320	51	13	116	1,004	1,005
S	24	128	43	43	128	2,216	3,142	6,221	11,074	67	128	229	76	1,305	1,301
S	15	320	36	36	320	2,053	4,102	3,871	13,843	19	98	346	37	1,363	1,358
S	15	64	64	64	64	3,184	3,179	8,579	8,578	145	125	109	121	1,504	1,503
A	26	320	320	36	36	2,964	3,481	7,551	8,983	33	29	219	219	1,580	1,577
S	19	64	64	64	64	2,563	3,485	6,807	10,530	106	91	165	138	1,371	1,371
A	44	36	36	128	128	3,088	2,161	8,894	5,994	221	185	53	41	853	852
S	15	43	128	128	43	4,789	2,601	12,984	4,278	284	91	33	92	1,681	1,680
A	27	43	43	128	128	4,178	2,219	10,426	4,778	225	179	51	45	1,126	1,123

Open in a new tab

Conditions differed according to the arrangement of stay and switch schedules at the alternatives. During the symmetrical conditions, the value of the stay schedule that operated at each alternative equaled the value of the switch schedule that operated at the other alternative. The values of the schedules were selected to provide a wide range of log ratios of rates of obtaining reinforcers. During the asymmetrical conditions, the value of the stay schedule that operated at each alternative equaled the value of the switch schedule that operated at the same alternative. The operation of the schedules and arranging of reinforcers was identical in the symmetrical and asymmetrical arrangements.

Each VI schedule consisted of 10 intervals obtained by the method described by Fleshler and Hoffman (1962) that gives an approximately exponential distribution of intervals, and the intervals were randomly selected without replacement. For the symmetrical arrangement, the stay and switch schedules in each condition were selected to maintain an approximately constant overall rate of reinforcement of one per 32 s. This was accomplished by selecting values of the stay and switch schedules whose reciprocals summed to the reciprocal of 32 (see Herrnstein, 1961). The values of the stay and switch schedules for the asymmetrical arrangement were obtained by exchanging the values of the switch schedules in the symmetrical arrangement.

Because the rats were naive, they were first trained to approach the food cup at the sound of the feeder operating. Then their behavior was shaped to press the main lever. Pressing the changeover lever emerged when responses at the main lever were reinforced intermittently. The rats then were exposed to the first condition in Table 1, which shows, for each rat, the sequence of conditions, the values of the schedules in each condition, and the number of sessions that each condition was in effect.

To identify possible order effects, 3 rats first were exposed to the symmetrical conditions and the other 3 to the asymmetrical conditions. A condition remained in effect for at least 10 sessions and until visual inspection showed there were no apparent upward or downward trends in the logs of the ratios of the rates of obtaining reinforcers, of responses, and of times for five consecutive sessions. Sessions were typically conducted 7 days a week and ended after the first changeover response following the 100th reinforcer.

Results and Discussion

All the results reported here are based on the sums of the data from the last five sessions of each condition. Table 1 presents these sums of presses at the main lever in the light and noise alternatives, total time (in seconds) spent at the light and noise alternatives, reinforcers obtained for staying at and switching to the light and noise alternatives, and total changeovers to the noise and to the light alternatives. For completeness, Table 1 includes the time allocation data although only the results of analyses of response allocations are shown.

The generalized matching law described the data from the symmetrical conditions. The rate of obtaining reinforcers at an alternative is the sum of the reinforcers obtained for staying plus the reinforcers obtained for switching to that alternative divided by the session time. When rates of obtaining stay and switch reinforcers are explicitly noted, the generalized matching law may be expressed in logarithmic form as:

where Rt₁ and Rt₂ represent the number of stay reinforcers obtained at Alternatives 1 and 2, respectively. Rw₁ and Rw₂ represent the number of switch reinforcers obtained at Alternatives 1 and 2, respectively, which were earned at Alternatives 2 (light) and 1 (noise), respectively. T is the session time, which of course divides out of Equation 3. It is important to note that in all equations the subscripts refer to the alternative in which reinforcers were obtained. The other symbols are the same as in Equation 1. This equation describes a straight line, and can be fitted to the data using least-squares linear regression. Ratios of the rates of obtaining reinforcer were the sum of the stay reinforcers obtained (and earned) by a response in light plus the switch reinforcers obtained by a response in light that were earned in noise divided by session time. This quotient was divided by the sum of the stay reinforcers obtained (and earned) by a response in noise plus the switch reinforcers obtained by a response in noise that were earned in light divided by session time. This analysis is the standard application of the generalized matching law to concurrent choice with the stay and switch reinforcers explicitly noted.

Figure 1 shows that for each condition, the logs of the response ratios increased with increases in the logs of the ratios of the rates of obtaining reinforcers. For the symmetrical conditions, the generalized matching law (Equation 3) described the logs of the response ratios (r² > .93). Undermatching, that is, behavior ratios changing less than the reinforcer ratios, was consistently found, probably because a COD was not used. Log b did not consistently differ from zero so there was no consistent bias. For the asymmetrical conditions, the generalized matching law did not describe the results; inspection of Figure 1 shows that the data deviated systematically from the path of the data described by the symmetrical conditions. The data are aligned almost vertically and appear to be described by a sensitivity (s in Equation 3) considerably greater than 1 (i.e., overmatching).

Fig. 1 — Also shown is the best-fitting line to data from the symmetrical conditions, using Equation 3 and least-squares linear regression, and the resulting equation for that line; the standard errors of sensitivity and bias and percentage of the variance accounted for by the equation are below it.

Figure 2 plots log response ratios as a joint function of the ratios of the rates of earning reinforcers and obtaining reinforcers and shows that log response ratios increased as a joint function of these two variables. This figure also shows that the symmetrical conditions produced variations in the ratios of obtaining reinforcers but little change in the ratios of the rates of earning reinforcers; in contrast, the asymmetrical conditions produced different ratios of earning reinforcers but little change in the ratios of the rates of obtaining reinforcers.

Fig. 2 — The tilting plane shows the best-fitting plane using Equation 4 and the data from the symmetrical conditions combined with the data from the asymmetrical conditions. The vertical lines show the difference between the obtained and predicted data from each condition. When the residuals are not visible they are smaller than the data point.

Explicitly noting the stay and switch reinforcers in Equation 2, and expressing the resulting equation in logarithmic form produces:

graphic file with name jeab-84-02-03-e04.jpg

where T_n represents the time spent at Alternative n and parameters s′ and b′ correspond to parameters s and b in Equation 1; the primes are used to distinguish them. The other symbols are the same as in the previous equations. Equation 4 plots as a flat plane and may be fitted to the data using least-squares multiple linear regression. The results of regressions using Equation 4 are shown in Table 2. The descriptions of the response ratios were good (r² > .87) for Rats 408, 409, 410, and 412 but poorer for Rats 407 and 411 (.79 > r² > .75).

Table 2. Results of least-squares multiple linear regressions for response allocations using Equation 4 for data from the symmetrical conditions combined with data from the asymmetrical conditions in Experiment 1.

Rat	s′	SE	h	SE	log b′	SE	r²	df
407	0.32	0.11	0.44	0.11	−0.03	−0.05	.79	8
408	0.45	0.07	0.49	0.07	0.02	0.03	.93	8
409	0.64	0.10	0.08	0.07	0.02	0.04	.90	7
410	0.43	0.05	0.19	0.05	0.03	0.02	.93	8
411	0.16	0.07	0.21	0.06	0.00	0.03	.75	7
412	0.44	0.07	0.21	0.11	0.04	0.03	.87	7

Open in a new tab

The parameter estimates in Table 2 show that the parameters s′ and h are both necessary to describe the data. Parameter values for s′ and h were more than two standard errors greater than zero in all six comparisons and in five of the six comparisons, respectively. This indicates that s′ and h were not equal to zero. The two parameters differed by more than two standard errors for Rats 409 and 410, which indicates that both of these parameters need to be included in Equation 4, so it is unlikely that s′ and h could be taken as equal.

EXPERIMENT 2

Experiment 2 replicated Experiment 1 using a different method of varying the ratio of the rates of earning reinforcers, namely, a two-manipulandum concurrent procedure instead of a Findley procedure, and random-interval (RI) rather than VI schedules. In the weighted conditions in Experiment 2, the rate of earning reinforcers was always greater at one alternative. This was accomplished by using a pair of stay and switch schedules at one alternative; the values of the schedules in the corresponding symmetrical pair were multiplied by a constant, and this pair of schedules was assigned to the other alternative. Consider the schedules in the first condition for Rat 476. The values of the pair of schedules at the left alternative were VI 16.7 s for staying and VI 50 s for switching to the right alternative. The symmetrical pair would be VI 50 s for staying at the right alternative and VI 16.7 s for switching to the left alternative. The values of this pair of schedules were multiplied by 5 producing VI 250 s for staying at the right alternative and VI 83.2 s for switching to the left alternative. The values of the pair of schedules at the left alternative were not changed. Across weighted conditions, the multiplier was constant for each rat, but the values in the initial pair of schedules varied. As in the previous experiment, several conditions used the symmetrical arrangement and several conditions used the weighted arrangement.