Abstract
In a seminal study, R. N. Shepard, C. I. Hovland, and H. M. Jenkins (1961) assessed potential mechanisms involved in categorization learning. To do so, they (SHJ) sequentially trained human participants with six different visual categorization tasks that varied in structural complexity. Humans’ exceptionally strong performance on one of these tasks (Type 2, organized around XOR relations) could not be solely explained by structural complexity, and has since been considered the hallmark of rule-use in these tasks. In the present project, we concurrently trained pigeons on all six SHJ tasks. Our results revealed that the structural complexity of the tasks was highly correlated with group-level performance. Nevertheless, we observed notable individual differences in performance. Two extensions of a prominent categorization model, ALCOVE (Kruschke, 1992), suggested that disparities in the discriminability of the dimensions used to construct the experimental stimuli could account for these differences. Overall, our pigeons’ generally weak performance on the Type 2 task provides no evidence of rule-use on the SHJ tasks. Pigeons thus join monkeys in the contingent of species that solve these categorization tasks solely on the basis of the physical properties of the training stimuli.
Keywords: pigeon, categorization, rule use, ALCOVE, selective attention
Categorization is an important cognitive function. With it, we sort the objects around us and recognize them as members of different classes (Murphy, 2002). For example, we treat high heels, sneakers, and flip flops as members of the same category (shoes), because they serve the same basic function, even though they have distinctly different physical properties.
In a seminal study, Shepard, Hovland and Jenkins (1961; henceforth SHJ) assessed the psychological processes involved in human categorization. Specifically, SHJ evaluated whether participants’ performance in a series of classification problems was based on simple associative principles (stimulus conditioning and generalization) and/or logical rules. In order to do so, SHJ trained participants to classify multidimensional stimuli that varied along three binary-valued, visual dimensions: shape, brightness, and size. When each of the 8 stimuli resulting from the combination of these features (Figure 1A) is assigned to one of two categories, six distinct types of task emerge (Figure 1C).
Figure 1.
Stimuli and task types. A. Shepard, Hovland, and Jenkins (1961) originally used stimuli which were composed from three binary-valued dimensions: brightness, size, and shape. B. The stimuli used in the present experiment were composed from three binary-valued dimensions: brightness, size, and line orientation. C. The six types of tasks used by Shepard, Hovland, and Jenkins (1961); see main text for details regarding the categorical structures of each type.
In the Type 1 task, category membership is determined solely by the values along a single dimension. For example, in a color-relevant Type 1 task, all black stimuli belong to one category, whereas all white stimuli belong to the other. Shape and size are irrelevant dimensions, as they do not convey any information regarding category membership. In the Type 2 task, category membership is determined by a two-dimensional, exclusive-OR rule. For example, in a color/shape-relevant Type 2 task, all black triangles and white squares belong to one category, whereas all white triangles and black squares belong to the other. In this case, size is irrelevant for categorization. In task Types 3, 4, and 5, category membership is determined by rules and exceptions. For example, in a color-relevant Type 3 task, all black stimuli belong to one category except for one (the large black triangle in Figure 1A). Importantly, task Types 3, 4, and 5 differ mainly in the number of rules that can be used to determine category membership. The Type 4 task supports rules for all three dimensions, the Type 3 task supports rules for two dimensions, and, the Type 5 task supports rules for a single dimension. Of additional importance, Type 4 is a linearly separable task solvable by family resemblance. Lastly, in the Type 6 task, category membership cannot be determined by any simple rule. Therefore, it is often said that subjects learning this task must resort to rote memorization of each individual stimulus in order to solve this task. These six types of tasks also differ in structural complexity. When quantified with several different metrics (e.g., Homa, Rhoads, & Chambliss, 1979; Alfonso-Reese, Ashby, & Brainard, 2002; Kloos & Sloutsky, 2008), Type 1 is the least complex task, followed by task Types 4, 3, and 5. Finally, task Types 2 and 6 are—in that order—the two most complex, due to their nonlinear structures.
In order to test whether humans were classifying the training stimuli based on stimulus generalization alone, SHJ compared the number of errors participants made while learning each of the tasks with those predicted by stimulus generalization. Under the assumption that classification is a product of between- and within-category discriminability, SHJ predicted the expected number of errors using different generalization gradients. Particularly, if subjects’ generalization gradients are monotonic, decreasing, and convex (Shepard, 1987), the expected number of errors under stimulus generalization alone is directly related to the structural complexity of each task: Type 1 < Type 4 ≈ Type 3 < Type 5 ≈ Type 2 < Type 6.
Contrary to what was predicted by their stimulus generalization framework, SHJ observed that errors were ordered: Type 1 < Type 2 < Type 3 ≈ Type 4 ≈ Type 5 < Type 6. Notably, errors on the Type 2 task were lower than errors on the Type 4 task (a finding since known as the Type 2 advantage). The fact that performance on the Type 2 task did not fall in line with that predicted by stimulus generalization alone led SHJ to propose that humans learn the Type 2 task with the help of rules. That is, humans not only use stimulus generalization during categorization, but they also deploy rules to encode the regularities of some category structures.
The SHJ study elucidated the potential cognitive systems involved in human categorization learning, and has subsequently exerted a profound influence on the categorization literature. To this day, SHJ’s results have served as a benchmark to evaluate most models of human categorization (e.g., Kruschke, 1992; Nosofsky, et al. 1994; Love, Medin, & Gureckis, 2004; Kurtz, 2007). Importantly, these modeling efforts have demonstrated that the Type 2 advantage typically observed in humans can be reproduced by models that lack explicit rule-production systems, as long as they have dimensional representations of the stimuli and the ability to selectively attend to the relevant dimensions (see Nosofsky et al., 1994, for a review).
In one exceptional project, the SHJ tasks were given to nonhuman primates (Smith, Minda, & Washburn, 2004). In contrast to human participants, Smith et al. (2004) found that the percentage of errors that their four rhesus macaques (Macaca mulatta) committed on the Type 2 task nicely aligned with that predicted by stimulus generalization theory (reproduced in Figure 2B). This finding suggests that the Type 2 advantage commonly observed in the SHJ tasks might be uniquely human, and depend on the availability of explicit rule-production systems or the opportunity to deploy selective attention to exploit category structure (Kurtz et al. 2013; Nosofsky & Palmeri, 1996).
Figure 2.
Percentage of errors in each of the Shepard, Hovland, and Jenkins (1961) tasks, as a function of each task’s structural ratio. A. Pigeon performance in each task across blocks of the concurrent training phase. Each task is denoted by an annotated, dashed, vertical line. B. Comparison of human (Nosofsky, et al., 1994), monkey (Smith et al., 2004), and pigeon performance in the Shepard, et al. (1961) tasks. Just as was the case for monkeys, the percentage of errors commited by pigeons was linearly related to the structural ratio of each task. In contrast, human performance in the Type 2 task was not linearly correlated with the task’s structural ratio.
To expand on the species generality of these intriguing findings, we developed a pigeon model of the SHJ tasks. Several anatomical disparities exist between mammalian and avian brains, although recent evidence suggests that the telencephalon of both taxonomic groups supports similar kinds of high-level cognition (Maler, 2018). Thus, comparing the categorization performance of nonhuman primates and birds is both especially timely and important.
Our pigeon model differed from Smith et al.’s primate model in two significant procedural respects. First, we concurrently trained our three pigeons on all six tasks instead of doing so sequentially. Such concurrent training avoids sequence effects, which could seriously contaminate categorization performance depending on the particular order of problem presentation given to individual subjects. Furthermore, because all of the problems were trained concurrently, monitoring the acquisition of individual subject performance on each of the six problem types was possible. Finally, concurrent training of the different problem types avoids the difficulty in deciding how long to continue training on challenging (or possibly unsolvable) problems before switching to other problems. We could thus train all of our subjects on all of the problems for the same number of sessions.
Notwithstanding the considerable merits of concurrent training, this method is not without potential pitfalls. In particular, concurrent training might hinder a subject’s ability to learn one or more of the tasks because of interference between tasks. Therefore, in a final phase, we retrained all of our pigeons with four of the same problems they had been given concurrently, now one at a time, in a successive manner. In this way, we could determine if there was any crosstalk from one task to another as well as assess the replicability of the patterns of categorization performance that we had earlier observed.
Our design also differed from Smith et al.’s in that instead of using stimuli that varied in brightness, size, and shape (e.g., Smith et al., 2004), we used stimuli that varied in brightness, size, and line orientation (Figure 1B). Our research group has previously documented the discriminability and separability of these dimensions for pigeons (Teng, Vyazovska, & Wasserman, 2015; Vyazovska, Teng, & Wasserman, 2014; Vyazovska, Navarro, & Wasserman, 2016), thereby giving us confidence that they would be suitable for the present project.
How did we expect our pigeons would learn the six SHJ tasks? Prior studies have suggested that pigeons lack the ability to exploit the structural regularities of rule-based categories (Berg & Grace, 2011; Smith et al., 2012). Furthermore, pigeons show feature-based, instead of rule-based generalization in the Shanks-Darby procedure (Shanks & Darby, 1998; Maes et al., 2015). Therefore, we predicted that our pigeons would perform just as did the monkeys in Smith et al.’s (2004) study (Figure 2B). Specifically, their performance on the Type 2 task should comport with the linear performance pattern predicted by the structural complexity of each task, instead of the nonlinear pattern of performance typically reported with human participants (Nosofsky, et al., 1994).
Method
Subjects
Three captive-bred pigeons (Columba livia) were studied. Pigeons were kept at 85% of their free-feeding weights and had free access to grit and water. The animals had previously participated in unrelated studies of visual discrimination and categorization of real world images (Castro & Wasserman, 2014; Navarro & Wasserman, 2015; Couto, Navarro, Smith & Wasserman, 2016). None of the subjects had any experience with the visual dimensions used to construct the stimuli used in this study. The institutional Animal Care and Use Committee at the University of Iowa approved the housing conditions and training procedures.
Apparatus
We used three 36 × 36 × 41 cm conditioning chambers (detailed in Gibson, Wasserman, Frei, & Miller, 2004), located in a dark room with continuous white noise. Each chamber was equipped with a 15-in LCD monitor behind a resistive touchscreen. The visible portion of the screen was 28.5 × 17 cm. Within it, a centered, 3.5 × 3.5 cm area was used to display the start stimulus, and two, equally spaced 3 × 3 cm areas were used to display the response buttons. The response buttons were positioned with a 2-cm and a 1-cm separation from the small and large stimuli, respectively. The stimuli to be categorized were presented centered, 8 cm above the wire mesh floor. A serial controller outside the chamber processed pecks to the touchscreen. A rotary dispenser delivered 45-mg food pellets through a vinyl tube into a plastic cup in the center of the rear wall opposite the touchscreen. Illumination during experimental sessions was provided by a houselight on the rear wall of the chamber. The pellet dispenser and houselight were controlled by a serial I/O interface. An iMac computer controlled each chamber, using programs developed in MATLAB with Psychtoolbox-3-extensions (Brainard, 1997; Pelli, 1997).
Stimuli
Two response buttons and a target stimulus were presented on the screen, over a colored background. The target stimuli were composed of features from three, binary-valued visual dimensions (Gottselig, Wasserman & Young, 2001; see Figure 1B): the diameter of a circle (small or large), its brightness (bright or dark), and the orientation of a thin bar (horizontal or vertical) bisecting the circle. The small and large diameter circles were 4.1 and 6.6 cm, respectively. The bright and dark luminance values were RGB: 165, 165, 165, and RGB: 105, 105, 105, respectively. The background could be any of six different colors: red (RGB: 255, 0, 0), blue (RGB: 0, 0, 255), yellow (RGB: 240, 240, 40), pink (RGB: 240, 40, 240), orange (RGB: 255, 140, 0), and green (RGB: 25, 255, 0) depending on the task being given. The response buttons were 12 different multicolored patterns (see Supplemental Material 1 for examples of the different visual displays).
Procedure
Given the multiplicity of possible problems per task type, the particular problems that each subject received were carefully selected to balance the informational value of each of the three stimulus dimensions and the dyadic relationships they could form (Kloos & Sloutsky, 2008). Critically, this selection process required that the relevant dimension in Task 1 was the irrelevant dimension in Task 2. Conversely, the two relevant dimensions in Task 2 had to be the irrelevant dimensions in Task 1. Finally, our concurrent programming used different background colors and report buttons for each task type (assigned randomly on an individual pigeon basis). Both the background and button stimuli were available to serve as cues informing the pigeons as to which of the six different types of problems was in force on any given trial.
Concurrent training.
Pigeons were given 180 daily sessions of training. Each session contained the 16 unique trials per task type (8 stimuli × 2 randomly arranged locations of the correct and incorrect response buttons), for a total of 96 trials. Each trial began with a start stimulus. Once the pigeon pecked it, the stimulus to be categorized was presented in the center of the screen, over the colored background that corresponded to the task type. After the pigeon pecked the stimulus a fixed number of times (increased daily on the basis of each pigeon’s performance, to a maximum of 25), the two response buttons—one correct and the other incorrect—appeared to the sides of the target stimulus. A final peck to either of the two response buttons determined the remainder of the trial. If the chosen button corresponded to the category of the stimulus being shown, then 1 to 3 food pellets were delivered randomly, followed by a 6-s to 10-s random intertrial interval. If the chosen button did not correspond to the category of the stimulus being shown, then no food pellets were delivered, and a correction trial with the same stimuli was given after the intertrial interval, under the same reinforcement contingencies. Correction trials continued to be given until the correct response was performed, but data from correction trials were excluded from analysis.
Successive training.
As noted earlier, concurrent training might somehow have interfered with our pigeons’ ability to learn each task by itself, possibly because the relevant dimensions in one task were irrelevant in another task, and vice versa. In order to assess such inter-task interference, the same three pigeons were subsequently given four of the tasks they had received during the concurrent phase (Types 1, 2, 4, and 6), now one at a time, in a random order. The background colors and response button stimuli that distinguished the tasks in the concurrent training phase were carried forward to this phase. The pigeons received daily sessions of 96 trials until they reached 85% of correct responses in each of the two categories for 2 days in a row or until they had completed 30 sessions. When a new problem was first introduced, the number of pecks required for the observing response was reduced to 10 and then increased daily by 1 to a maximum of 25.
Results
To test whether our pigeons’ behavior comported with the structural complexity of the categories in each task, we analyzed their choice performance as a function of the structural ratio of the tasks. The structural ratio is a measure of category structural complexity, based on the interstimulus distances along each dimension. This measure is expressed as the ratio of within-category to between-category distances among category members (Homa et al., 1979; Smith et al., 2004); it is bounded between 0 and 1, with 0 indicating perfectly separated categories and 1 indicating perfectly overlapped categories. Type 1, the simplest task, has a structural ratio of .37 (there is no overlap between categories along the relevant dimension, but there is perfect overlap along the other two, irrelevant dimensions). Type 6, the hardest task, has a structural ratio of .82. Also worth noting, Type 2 has a structural ratio of .65. Finally, Types 3, 4, and 5 have structural ratios of .49, .46, and .60, respectively.
Concurrent phase.
The mean overall percentage of choice errors in task Types 1, 2, 3, 4, 5, and 6 were 27.7% (SD = 11.21), 42.8% (SD = 11.02), 35.5% (SD = 4.71), 32.5% (SD = 3.11), 41.1% (SD = 7.32), and 50.7% (SD = 1.89), respectively. Figure 2A shows the percentage of errors in each task across blocks of training, as a function of each task’s structural ratio.
As illustrated in the figure, our pigeons’ errors increased with the structural ratio of each task, and this relation steepened across blocks of training (c.f. slopes of Block 1 and Blocks 5 & 6 in Figure 2A). Because we repeatedly measured each individual pigeon across time, we assessed our pigeons’ errors using a logistic mixed-effects model. The model included structural ratio of each task and the logarithm of session as fixed effects (see Supplemental Material 2 for further details on mixed-effects models and model selection). As expected, the model revealed that the overall percentage of errors reliably decreased across sessions, B = −0.20, SE = 0.03, 95% CI [−0.25, −0.15], Z = −7.94, P < .001. More importantly, the percentage of errors in each task was positively related to the structural ratio of the task, B = 2.24, SE =0.41, 95% CI [1.48, 3.00], Z = 5.42, P < .001. Furthermore, the reduction in the percentage of errors across training in each task was inversely related to the task’s structural ratio, B = 0.71, SE = 0.23, 95% CI [0.58, 0.84], Z = 3.05, P < .01. That is, the reduction in the pigeons’ errors across sessions was larger for tasks with smaller structural ratios. Critically, pigeons made more errors in Type 2 than in Type 4 (42.8% versus 32.5%, respectively); so, just like the monkeys in the Smith et al. (2004) study, our pigeons failed to show a Type 2 advantage (Figure 2B).
Finally, we asked whether the pigeons’ errors fell reliably below chance in each task, during the last 10 sessions of concurrent training. As assessed by binomial tests, all pigeons were below chance in Type 1 (all ps < .001), 42Y and 81B were below chance in Type 2 (18% and 42%, respectively, p < .001 and p < .05, respectively), all pigeons were below chance in Types 3, 4, and 5 (all ps < .01), but no pigeon was below chance in Type 6 (all ps > .10).
Successive phase.
Recall that in this phase, each problem was given alone until the learning criterion was met or until 30 days of training had been completed. Therefore, in order to compare pigeons’ performance across both phases, if bird met criterion on a given problem, we assumed that the remaining sessions would have had the same score as the session on which criterion was met (see Nosofsky et al., 1994 for a similar treatment of post-criterion performance in humans). Consequently, the mean percentages of errors observed in task Types 1, 2, 4, and 6 were 15.9% (SD = 8.15), 35.2% (SD =15.75), 21.1% (SD = 4.91), and 47.2% (SD = 1.97), respectively (see Supplemental Material 3 for individual scores). Similar to the concurrent phase, the pigeons’ errors were positively related to the structural ratio of the tasks. A mixed-effects logistic model including structural ratio and session as fixed effects (see Supplemental Material 2 for details) again revealed that the overall percentage of errors reliably decreased across sessions, B = −0.33, SE = 0.01, 95% CI [−0.35, −0.30], Z = −22.87, p < .001. Furthermore, the percentage of errors in each task and their decrease across sessions were again inversely related to the structural ratio of the task, B = 3.59, SE = 0.31, 95% CI [2.98, 4.20], Z = 11.48, p < .001, and B = 0.82, SE = 0.22, 95% CI [0.40, 1.26], Z = 3.72, p < .001, respectively. Most importantly and just as in the concurrent phase, our pigeons’ errors in the Type 2 task (35.2%) exceeded those in the Type 4 task (21.1%).
A comparison of the overall percentage of errors across the two training phases suggests that concurrent presentation of the tasks might have made the tasks more difficult than successive presentation (see Supplemental Material 3). Assessing errors during the last 2 sessions of the successive phase (which approximates the number of trials per task we used to assess terminal performance in the concurrent phase) revealed that all of the pigeons’ errors fell below chance in Type 2 (40%, 9%, and 31% for 17B, 42Y, and 81B, respectively; all ps < .01). Furthermore, errors for 42Y and 81B fell below chance in Type 6 as well (41% and 30%, respectively; both ps < .05).
More importantly, our results confirm that concurrent programing of the tasks did not materially alter the way our pigeons performed in each task. Indeed, the rank ordering of errors in each of the four tasks was conserved across the concurrent and successive training phases, for each bird; the correlation between the mean percentage of errors between phases by task type was high and significant (Pearson’s r = .98, p < .001).
Strategy analysis.
We further assessed our pigeons’ performance by measuring the extent to which the birds were using unidimensional strategies during the concurrent phase of the experiment. Recall that a unidimensional strategy is necessary to solve the Type 1 task, in which category membership depends on a single dimension. However, a variety of unidimensional strategies can also be applied to Types 3, 4, and 5. Under these suboptimal strategies, subjects will correctly categorize 75% of the stimuli. To measure the extent to which our pigeons adopted these strategies, we calculated the Phi coefficient to measure the relation between the dimensional values of the stimuli and the categorical choices our pigeons made in each task type (see Minda, Desroches, & Church, 2008 for a similar analysis of human data in the SHJ tasks).
These analyses revealed that our pigeons’ use of perfect unidimensional strategies (i.e. an absolute Phi coefficient of 1) was generally rare and varied substantially among individuals. We found perfect coefficients in 4 sessions for pigeon 17B, 1 session for pigeon 42Y, and 39 sessions for pigeon 81B (approximately 2%, 0.6%, and 22% of sessions, respectively). A closer look at these coefficients revealed that the single perfect coefficient for 42Y came from performance on the Type 3 task, whereas all of the perfect coefficients obtained for pigeons 17B and 81B came from their Type 1 tasks, especially later in training. Despite perfect coefficients being rare, our pigeons’ use of each dimension tended to match the availability of optimal or suboptimal unidimensional strategies (as denoted by the presence of high, but imperfect coefficients). For example, the relevant dimension in Type 1 yielded the largest coefficient value for all of our pigeons. Similarly, our pigeons adopted imperfect unidimensional strategies on those tasks that permitted them (Types 3, 4, and 5, in which perfect unidimensional strategies lead to 25% of errors), but did not do so in tasks on which individual dimensional values were orthogonal to category membership (Types 2 and 6). Overall, these results suggest that our pigeons’ performance on the tasks was dimensionally oriented (see Supplemental Information 4 for individual coefficients across the training blocks of both concurrent and successive phases).
Model-based analyses of individual pigeon performance.
As the previous analyses indicate, our pigeons’ errors, as a group, increased as a linear function of each task’s structural complexity (Figure 2B). However, individual pigeons’ performances did vary considerably (see Figure 3A, black circles). Specifically, although the errors of pigeons 17B and 81B increased monotonically as a function of the structural ratio of each task, the errors of pigeon 42Y did not. Notably, 42Y’s errors for Type 2 (30.2%) were not only similar to that for Type 4 (28.9%), but also lower than for Type 1 (38.6%). In other words, the errors of pigeon 42Y were nonmonotonically related to the structural complexity of the tasks, much as is typically observed for human participants (cf. Figure 2B).
Figure 3.
Model fits of our three pigeons’ performance in the SHJ tasks. A. Observed and predicted percentage of errors for the three pigeons in each of the SHJ tasks (1 to 6, annotated). The fits in the top row were obtained using Kruschke’s original (1992) ALCOVE model. The fits in the middle row were obtained using a version of ALCOVE that allowed for differences in each dimension’s attentional learning rate. The fits in the bottom row were obtained using a version of ALCOVE that allowed for differences in discriminability among the three dimensions (see text for details). B. Observed-predicted plots assessing the quality of the fit provided by the original ALCOVE model (top) and its two expanded versions (middle and bottom), for the group and the three individual pigeons. Each model’s annotated coefficient of determination (R2) was calculated over individual scores only. See the online article for the colored version of this figure.
A potential explanation for this unusual pattern of performance lies at the junction between the discriminability of the dimensional values we used to construct the stimuli and their role in determining category membership. Recall that each pigeon received unique versions of each task; moreover, there was no overlap between the relevant dimensions in task Types 1 and 2. In the case of 42Y, this meant that brightness was the dimension determining category membership in task Type 1, whereas size and line orientation determined category membership in task Type 2. Thus, 42Y’s performance may be at least partially explained by the brightness values having low discriminability and the size and line values having high discriminability. If so, then 42Y’s ability to learn task Type 1 might have been hindered by low discriminability of the relevant dimension and high discriminability of the irrelevant dimensions. Similarly, 42Y’s success in learning the substantially more complex task Type 2 might have been promoted by high discriminability of the relevant dimensions and low discriminability of the irrelevant dimension.
In order to test this idea, we fitted individual pigeon performance in each task, on a trial-by-trial basis, using the Attention Learning Covering Map model (ALCOVE; Kruschke, 1992). Briefly, ALCOVE is a three-layer, fully connected model for supervised category learning that uses exemplar-based representation and learns by gradient descent on error. Its exemplar-based representation allows ALCOVE to solve nonlinear learning problems (e.g., SHJ’s Types 2 and 6), which models like that of Rescorla and Wagner (1972), without configural cues, fail to solve. More importantly, ALCOVE can learn to attend to (or to ignore) the portions of the input that lead to correct (or incorrect) categorization (i.e., attentional learning; Mackintosh, 1975).
ALCOVE’s ability to learn which dimensions are relevant to the task at hand allows it to predict human performance in the SHJ tasks (see Pitt, Kim, Navarro & Myung, 2006 for a comprehensive analysis of the behaviors that ALCOVE can exhibit in these tasks, and the model’s falsifiability). Recall that the SHJ tasks vary in the number of dimensions that are necessary for categorization. Type 1 requires a single dimension; Type 2 requires two dimensions; and, Types 3–6 require all three dimensions to different degrees (although the structural complexity of Type 6 is greater than that of Types 3, 4, and 5). Humans learn the six types of task in accord with this attentional ordering, and their performance can thus be reproduced by ALCOVE. Furthermore, the model is able to accommodate the performance of the monkeys in Smith et al. (2004) and the average performance of our three-pigeon cohort, by driving its attentional learning abilities toward zero. When unable to learn about the relevant and irrelevant dimensions of a task, the model’s performance is entirely determined by the structural complexity of the tasks.
Despite its success in explaining our pigeons’ group-level data, ALCOVE cannot faithfully reproduce 42Y’s pattern of performance (Figure 3A, top row). This is because, although ALCOVE has dimensional attentional weights that control exemplar activation, the model does not have dimension-specific parameters that would regulate the subject’s ability to learn to attend (or ignore) at different rates for each dimension or that would control the discriminability of the values along each dimension. If a dimensional discriminability account has merit, then such flexibility might be needed to explain the individual differences in performance that we observed. We thus created two extensions of ALCOVE that might have just this kind of flexibility.
We first explored whether our pigeons learned to attend to different dimensions at different rates. A single attentional learning rate parameter (λa) in ALCOVE controls how much the model adjusts all of its dimensional attention weights after receiving feedback. Thus, our “ALCOVE + dimensional learning” model expanded ALCOVE’s single attentional learning rate parameter into three dimension-specific attentional learning rates (λbrightness, λorientation, λsize), thereby allowing the model to learn to attend or ignore the three dimensions at differential rates, if necessary. (see Supplemental Material 5 for details on the fitting process.)
We then explored whether the dimensions we used to construct the stimuli varied in discriminability. A single sensitivity parameter (c) in ALCOVE scales the distance between the presented stimulus and the exemplar representations. Thus, our “ALCOVE + dimensional discriminability” model expanded ALCOVE’S single sensitivity parameter into three dimension-specific sensitivity parameters (cbrightness, corientation, csize), allowing the model to differentially weight the distances along each dimension, if necessary. In a sense, the inclusion of dimension-specific sensitivity parameters amounts to the estimation of inter-stimulus distances along each dimension using categorization data only (a process that would otherwise require gathering similarity-judgments; e.g. Lee & Navarro, 2002). The addition of these distance-scaling parameters is particularly relevant to the present stimuli, because they were constructed from binary-valued dimensions.
Figure 3 depicts the fits that were produced by each of the three models. Table 1 shows the parameters for each individual model and pigeon after optimization with a genetic algorithm over the negative log likelihood of the models. First, note in Figure 3A (top row) that the original ALCOVE model can predict the performance of 17B and 81B to close degree, by driving the attentional learning rate (λa) toward zero (Table 1). However, the model does not closely reproduce 42Y’s pattern of performance (e.g., the model’s percentages of errors in Types 1 and 2 are similar, and its errors in Type 4 are less than in Type 2). Note, however, that because the performance of two out of our three pigeons was directly related to the structural complexity of each task, the model did a good job in reproducing average group-level performance (Figure 3B, top panel, black symbols).
Table 1.
Best-fitting parameters and Akaike Information Criterion (AIC) for ALCOVE and its extensions.
Model | Parameter | 17B | 42Y | 81B | Mean |
---|---|---|---|---|---|
ALCOVE | c | 0.5788 | 0.4253 | 0.6206 | 0.5416 |
φ | 0.5786 | 0.4288 | 0.6211 | 0.5429 | |
λw | 0.0111 | 0.0223 | 0.0098 | 0.0144 | |
λa | 0.0008 | 0.0022 | 0.0004 | 0.0011 | |
AIC | 22,883 | 22,008 | 21,515 | 22,135 | |
ALCOVE + Dimensional Learning Rates | c | 0.2204 | 0.3972 | 0.5281 | 0.3819 |
φ | 0.5498 | 0.3714 | 1.0649 | 0.6620 | |
λw | 0.0189 | 0.0963 | 0.0030 | 0.0394 | |
λbrightness | 0.0000 | 0.0000 | 0.0000 | 0.0000 | |
λorientation | 1.0000 | 0.1537 | 0.0058 | 0.3865 | |
λsize | 0.0019 | 0.0541 | 0.0658 | 0.0406 | |
AIC | 22,885 | 21,966 | 21,504 | 22,118 | |
ALCOVE + Dimensional Discriminability | cbrightness | 0.0950 | 0.1223 | 0.3383 | 0.1852 |
corientation | 0.5841 | 0.6456 | 0.6385 | 0.6228 | |
csize | 0.5040 | 0.6165 | 0.6606 | 0.5937 | |
φ | 0.7761 | 1.1862 | 0.9878 | 0.9834 | |
λw | 0.0060 | 0.0173 | 0.0046 | 0.0093 | |
λa | 0.0000 | 0.0000 | 0.0037 | 0.0012 | |
AIC | 22,878 | 21,829 | 21,530 | 22,079 |
Note: c = overall sensitivity parameter, φ = mapping constant for choice rule; λw = learning rate for output weights; λa = learning rate for attentional weights; λbrightness, λorientation, and λsize = attentional learning rates for the attentional weights of brightness, orientation, and size dimensions; cbrightness, corientation, and csize = sensitivity to brightness, orientation, and size dimensions; AIC = Akaike Information Criterion.
The fits of ALCOVE improved when we allowed for variations in the rate at which the model learned to attend to each dimension. Whereas the original ALCOVE’s R2 for individual performance was 0.93, the “ALCOVE + Dimensional Learning” R2 was 0.96 (c.f. top and middle rows of Figures 3A and 3B). As Table 1 illustrates, the “ALCOVE + Dimensional Learning” model converged on different learning rates for each dimension. In doing so, the expanded model more closely approximated the performance of 17B and 81B, and far more faithfully reproduced 42Y’s peculiar pattern of errors. To test whether the improvement in model fit was substantial for each pigeon and not merely due to an increase in parameters, we calculated the Akaike Information Criterion (AICs; Akaike, 1974) for the ALCOVE and “ALCOVE + Dimensional Learning” fits of each pigeon’s data. The AICs of the original ALCOVE fits were 22,883, 22,008, and 21,515, for pigeons 17B, 42Y, and 81B, respectively, whereas the AICs for the “ALCOVE + Dimensional Learning” for the same birds were 22,885, 21,966, 21,504, respectively. In other words, the additional parameters substantially improved the fits for both 42Y and 81B, but not for 17B. Note that “ALCOVE + Dimensional Learning” learned to attend to orientation faster than it learned to attend to size, but more critically, the model accounted for the performance of all three birds by being incapable of learning to modulate attention to brightness (λbrightness = 0). The value for this parameter, however, does not imply that the pigeons could not learn to discriminate between brightness values. All birds showed control by brightness in the tasks that allowed it (see Supplemental Material 4). Furthermore, even 42Y managed to discriminate brightness to a decent degree during the last 10 sessions of concurrent training (28% errors in Type 1; see Supplemental Material 6). Instead of an inability to discriminate brightness, a low attentional learning rate for brightness suggests that the pigeons might have been unable to modulate their attention to brightness values when they were relevant or irrelevant for the categorization at hand.
ALCOVE’s fits improved even more when we allowed for variations in the sensitivity to each dimension (R2 = 0.99; Figure 3B, bottom row). Similar to the previous extension, “ALCOVE + Dimensional Discriminability” also improved upon ALCOVE’s fits for 17B and 81B, and produced a remarkably close fit for 42Y’s nonmonotonic pattern of performance. The AICs of “ALCOVE + Dimensional Discriminability” fits for pigeons 17B, 42Y, and 81B, were 22,878, 21,829, and 21,530, respectively. Therefore, this time around the additional parameters improved the fits for both 42Y and 17B, but not 81B. As Table 1 illustrates, the “ALCOVE + Dimensional Discriminability” model converged on different sensitivity parameters for each dimension, but most critically, the average sensitivity to brightness was the lowest (cbrightness = 0.19). Also worth noting is the fact that the model managed to obtain remarkably close fits to the pigeons’ performance by driving the attentional learning rate parameter to 0.0037 for 81B and to 0.0 for both 17B and 42Y. These results prove that the performance exhibited by our pigeons, including 42Y, does not need to be explained by appealing to changes in attention across training. Indeed, a model with parameters to scale the distance between the presented stimulus and the exemplars stored in memory should suffice to capture our pigeons’ performance with extremely high fidelity.
Discussion
The current experiment assessed pigeons’ category learning with the highly influential Shepard et al. (1961) tasks. Although we observed notable individual differences, the performance of the pigeons as a group closely conformed to the structural complexity of the given tasks; pigeons committed more errors to the more structurally complex tasks (Figure 2B).
Importantly, the percentage of errors the pigeons committed in the Type 2 task was not lower than the percentage of errors they committed in the Type 4 task. Thus, our pigeons did not show the Type 2 advantage that is often seen with human participants. This advantage is often considered to be a hallmark of rule use (Shepard, et al., 1961) or a signature of selective attention to relevant dimensions (Nosofsky, et al. 1994; Nosofsky & Palmeri, 1996).
Departing from traditional experimental designs, we concurrently programmed all of the problem types in each training session. Data from a subsequent training phase in which each of the problems was individually and successively presented suggest that concurrent programming might have increased the overall difficulty of the tasks. Nevertheless, the successive training data also suggest that concurrent programing might have not meaningfully affected the way our pigeons learned the tasks. Specifically, the rank order of difficulty of the tasks was the same across both training phases, for all pigeons. Do note that the order of the concurrent and successive phases was confounded in the present design; so, future studies will be needed to assess whether concurrent programming of the SHJ tasks limited our pigeons’ ability to learn the tasks, even when they were later presented in isolation. To our knowledge, no concurrent programming of the SHJ tasks has been attempted with human participants.
Given the specific tasks that we assigned to each pigeon, we observed interesting individual differences (Figure 3A). Most notably, 42Y’s errors on the Type 2 task were as low as the errors it committed on the Type 4 task. Although this low percentage of errors is impressive when compared to the other pigeons’ performance, 42Y’s performance still does not comport with human performance in the Type 2 task. For example, 42Y’s learning curve shows no evidence of a sudden decrease in errors over tens of trials, often deemed to be a possible indicator of rule discovery (see Supplemental Material 6). Instead, it is likely that this pigeon’s pattern of performance, as well as the other individual differences that we observed, resulted from disparities in the discriminability of the dimensions from which the stimuli were constructed.
In a sense, 42Y’s exceptional pattern of errors helped us prove a more general point. By expanding a relatively simple, yet generally successful categorization model (ALCOVE; Kruschke, 1992), we showed that all of the birds’ individual differences can be readily reproduced by incorporating either dimension-specific attentional learning rates or dimension-specific sensitivities. Both approaches suggested that brightness was the least discriminable dimension, not only for 42Y, but also for the other two pigeons. Moreover, this modelling process revealed that attentional learning is sufficient but not necessary to explain the individual differences that we documented. This is because in ALCOVE, both dimensional attentional weights and sensitivity parameters serve to scale inter-stimulus distances.
Do appreciate that we are not claiming that pigeons are unable to modulate their attention; evidence clearly indicates that they do learn to attend more to relevant than to irrelevant features in visual categorization tasks (e.g., Castro & Wasserman, 2017). Rather, we believe that the excellent fits that resulted from our expanded ALCOVE models underscore the complex relationship between the perceived psychological distance between different dimensional values (dimensional discriminability) and the ability to flexibly modulate the amount of attentional resources devoted to different dimensions (attentional learning). It is possible that organisms are unable to modulate the attention they pay to stimulus dimensions that are hard to discriminate. Conversely, organisms might have difficulty discriminating between dimensional values because they fail to increase the attention they pay to them. Future studies should aim to disentangle these two closely related possibilities.
The pigeons in our study had considerable difficulty learning the Type 6 task; all three of our pigeons’ errors were at chance level by the end of the concurrent training phase, and the errors for only two of our pigeons fell below chance level by the end of the successive training phase. The Type 6 task is remarkably complex in structure, because the categories it contains are perfect pseudocategories. Although pigeons can learn pseudocategories comprised of natural stimuli (e.g., Navarro & Wasserman, 2015; Wasserman, Kiedinger, & Bhatt, 1988) where unique stimulus-specific physical features are available, in the present case, the stimuli assigned to each category contain no such features (Soto & Wasserman, 2010). Stimulus-specific features allow organisms to memorize the arbitrary mappings arranged in a pseudocategorization task. In the present case, however, the categories in Type 6 have the same average value along each dimension (e.g., for any of the categories, half of the exemplars are dark and the remaining half of the exemplars are bright), thus placing any single dimension orthogonal to category membership. Furthermore, the dyadic relations that can be formed with any two dimensions are also orthogonal to category membership. Effectively, in order to solve the Type 6 task, organisms need to combine all three dimensional values. The nonlinear combination of these dimensional values would allow them, for instance, to extract a configural cue and associate it with the correct categorization response. Previous studies have documented pigeons’ ability to nonlinearly combine more than three dimensions at a time (e.g., Vyazovska, Navarro, & Wasserman, 2018, Lea, Lohmann, & Ryan, 1993), so it is unlikely that our pigeons were unable to do so with the current stimuli. Instead, it is possible that unidimensional control from the readily discriminable dimensions (and even the control exerted by dyadic relations) competed with configural control (Delamater, Sosa, & Katz, 1999; Redhead & Pearce, 1995). If discrimination along one or more dimensions overwhelms discrimination of configural cues, then failure to learn the Type 6 task is to be expected. Future studies should explore this possibility.
Our results further expand our general understanding of the category learning abilities of nonhuman animals (Lazareva & Wasserman, 2017). With particular regard to the SHJ tasks, both pigeons’ and monkeys’ errors can be seen to fall in line with the structural complexity of the tasks (Figure 2B). Importantly, for both species, Type 2 is the second hardest task to learn, whereas for humans, Type 2 is the second easiest task to learn. Our evidence thus further confirms that the Type 2 advantage may be uniquely observed in humans and may require conditions in which more explicit categorization systems can help encode the nonlinear structure of the Type 2 task (Ashby, Alfonso-Reese, Turken, & Waldron, 1998).
Although evidence of a Type 2 advantage has not yet been obtained in nonhuman animals, there are several reports of humans behaving more like pigeons and monkeys. For example, Nosofsky and Palmeri (1996) showed that humans fail to show the Type 2 advantage when stimuli are composed from non-analyzable dimensions, such as saturation, brightness, and hue (Shepard & Chang, 1963). Furthermore, Minda et al. (2008) found that humans’ learning of the Type 2 task was impaired if they were performing a concurrent, verbal task. Finally, Kurtz et al. (2012) have reported that the likelihood of a Type 2 advantage decreases when participants are not given instructions that prompt them to decompose the stimuli into their constituent dimensions or to find rules for category membership.
The present findings thus inform a possible continuum of categorization learning abilities among different species (Smith et al., 2012). At one end of the continuum, human categorization learning can be—under proper conditions—both analytical and rule-based. Categorization is analytical in the sense that certain categories prompt human participants to encode those dimensions that are necessary to determine category membership and to ignore those dimensions that are not. As a result, humans learn rule-based tasks faster than information-integration tasks (Smith et al., 2010; Ashby & Gott, 1988). Rule-based tasks contain categories whose structures can be described with simple unidimensional, conjunctive, or disjunctive rules, whereas information-integration tasks involve categories whose structures cannot be so easily described. Consequently, humans can often produce verbal rules to encode representations of rule-based categories, thereby giving them an advantage when learning a category and inferring the membership of new category exemplars (Zakrzewski et al., 2018).
In the middle of the continuum, monkey categorization learning has been shown to be analytical (Smith et al., 2012), but apparently not rule-based. Although monkeys do learn rule-based tasks faster than information-integration tasks, they may have difficulty transferring such rules to novel exemplars (Zakrzewski et al., 2018).
Finally, at the other end of the continuum, we find that pigeon categorization learning is nonanalytical; pigeons appear to encode both relevant and irrelevant dimensions when learning the same tasks that encourage analytical category learning in humans and monkeys (Berg & Grace, 2011; Lea, et al., 2018; Smith et al., 2011). Additionally, as the present study shows, the categorization learning of the pigeons we studied shows no sign of rule-use. We obtained no evidence to suggest that pigeons learn the SHJ tasks by relying on anything other than the physical properties of the stimuli.
Supplementary Material
Author notes
Correspondence concerning this article should be addressed to Edward. A. Wasserman, Department of Psychological and Brain Sciences, The University of Iowa, Iowa City, IA 52242. ed-wasserman@uiowa.edu. This research was supported by the Eunice Kennedy Shriver National Institute of Child Health & Human Development of the NIH under Award Number P01HD080679.
References
- Akaike H (1974). A new look at the statistical model identification. IEEE Transactions on Automatic Control 19, 716–723. 10.1109/TAC.1974.1100705 [DOI] [Google Scholar]
- Alfonso-Reese LA, Ashby FG, & Brainard DH (2002). What makes a categorization task difficult? Perception & Psychophysics, 64, 570–583. 10.3758/BF03194727 [DOI] [PubMed] [Google Scholar]
- Ashby FG, Alfonso-Reese LA, Turken AU, & Waldron EM (1998). A neuropsychological theory of multiple systems in category learning. Psychological Review, 105, 442–481. https://doi.Org/10.1037//0033-295X.105.3.442 [DOI] [PubMed] [Google Scholar]
- Ashby FG, & Gott RE (1988). Decision rules in the perception and categorization of multidimensional stimuli. Journal of Experimental Psychology: Learning, Memory, and Cognition, 14, 33 10.1037/0278-7393.14.1.33 [DOI] [PubMed] [Google Scholar]
- Berg ME, & Grace RC (2011). Categorization of multidimensional stimuli by pigeons. Journal of the Experimental Analysis of Behavior, 95, 305–326. 10.1901/jeab.2010.94-305 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brainard DH (1997) The psychophysics toolbox, Spatial Vision, 10, 433–436 10.1163/156856897X00357 [DOI] [PubMed] [Google Scholar]
- Castro L, & Wasserman EA (2014). Pigeons’ tracking of relevant attributes in categorization learning. Journal of Experimental Psychology: Animal Learning and Cognition, 40, 195–211. 10.1037/xan0000022 [DOI] [PubMed] [Google Scholar]
- Castro L, & Wasserman EA (2017). Feature predictiveness and selective attention in pigeons’ categorization learning. Journal of Experimental Psychology: Animal Learning and Cognition, 43, 231–242. 10.1037/xan0000146 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Couto K, Navarro V, Smith T, & Wasserman EA (2016). Concept learning without differential reinforcement in pigeons by means of contextual cueing. Journal of Experimental Psychology: Animal Learning and Cognition, 42, 221–227. 10.1037/xan0000099 [DOI] [PubMed] [Google Scholar]
- Gibson BM, Wasserman EA, Frei L, & Miller K (2004). Recent advances in operant conditioning technology: A versatile and affordable computerized touch screen system. Behavior Research Methods, Instruments and Computers, 36, 355–362. 10.3758/bf03195582 [DOI] [PubMed] [Google Scholar]
- Gottselig JM, Wasserman EA, & Young ME (2001). Attentional tradeoffs in pigeons learning to discriminate newly relevant visual stimulus dimensions. Learning & Motivation, 32, 240–253. 10.1037/0097-7403.28.4.347 [DOI] [Google Scholar]
- Delamater AR, Sosa W, & Katz M (1999). Elemental and configural processes in patterning discrimination learning. The Quarterly Journal of Experimental Psychology, 52, 97–124. 10.1080/71393269810101975 [DOI] [Google Scholar]
- Homa D, Rhoads D, & Chambliss D (1979). Evolution of conceptual structure. Journal of Experimental Psychology: Human Learning and Memory, 5, 11 http://dx.doi.Org/10.1037/0278-7393.5.1.11 [Google Scholar]
- Kloos H, & Sloutsky VM (2008). What’s behind different kinds of kinds: Effects of statistical density on learning and representation of categories. Journal of Experimental Psychology: General, 137, 52–72. 10.1037/0096-3445.137.1.52 [DOI] [PubMed] [Google Scholar]
- Kruschke JK (1992). ALCOVE: An exemplar-based connectionist model of category learning. Psychological Review, 99, 22–44. https://doi.Org/0033-295X/92/3.00 [DOI] [PubMed] [Google Scholar]
- Kurtz KJ (2007). The divergent autoencoder (DIVA) model of category learning. Psychonomic Bulletin & Review, 14, 560–576. 10.3758/BF03196806 [DOI] [PubMed] [Google Scholar]
- Kurtz KJ, Levering KR, Stanton RD, Romero J, & Morris SN (2013). Human learning of elemental category structures: Revising the classic result of Shepard, Hovland, and Jenkins (1961). Journal of Experimental Psychology: Learning, Memory, and Cognition, 39, 552–572. 10.1037/a0029178 [DOI] [PubMed] [Google Scholar]
- Lazareva OF, & Wasserman EA (2017). Categories and concepts in animals In Menzel R (Ed.), Learning theory and behavior. Learning and memory: A comprehensive reference 2e, Vol. 1 (Byrne J, Ed.), pp. 111–139. Oxford: Elsevier. [Google Scholar]
- Lea SEG, Lohmann A, & Ryan CME (1993). Discrimination of 5-dimensional stimuli by pigeons: Limitations of feature analysis. The Quarterly Journal of Experimental Psychology B: Comparative and Physiological Psychology, 46B, 19–42. [Google Scholar]
- Lea SEG, Pothos EM, Wills AJ, Leaver LA, Ryan CME, & Meier C (2018). Multiple feature use in pigeons’ category discrimination: The influence of stimulus set structure and the salience of stimulus differences. Journal of Experimental P sychology: Animal Learning and Cognition, 44, 114–127. 10.1037/xan0000169 [DOI] [PubMed] [Google Scholar]
- Lee MD, & Navarro DJ (2002). Extending the ALCOVE model of category learning to featural stimulus domains. Psychonomic Bulletin & Review, 9, 43–58. [DOI] [PubMed] [Google Scholar]
- Love BC, Medin DL, Gureckis TD (2004). SUSTAIN: A network model of category learning. Psychological Review, 111, 209–332. 10.1037/0033-295X.111.2.309 [DOI] [PubMed] [Google Scholar]
- Mackintosh NJ (1975). A theory of attention: Variations in the associability of stimuli with reinforcement. Psychological Review, 82, 276–298. 10.1037/h0076778 [DOI] [Google Scholar]
- Maes E, De Filippo G, Inkster AB, Lea SEG, De Houwer J, D’Hooge R, Beckers T, & Wills AJ (2015). Feature- versus rule-based generalization in rats, pigeons and humans. Animal Cognition, 18, 1267–1284. 10.1007/s10071-015-0895-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Maler L (2018). Brain evolution: Intelligence without a cortex. Current Biology, 28, R213–R215. https://doi.Org/10.1016/j.cub.2018.01.065 [DOI] [PubMed] [Google Scholar]
- Minda JP, Desroches AS & Church BA (2008). Learning rule-described and non-rule-described categories: A comparison of children and adults. Journal of Experimental Psychology: Learning Memory and Cognition, 34, 1518–1533. 10.1037/a0013355 [DOI] [PubMed] [Google Scholar]
- Murphy GL (2002). The big book of concepts. Cambridge, MA: MIT Press. [Google Scholar]
- Navarro VM, & Wasserman EA (2015). Stepwise conceptualization in pigeons. Journal of Experimental Psychology:Animal Learning and Cognition, 42, 44–50. 10.1037/xan0000083 [DOI] [PubMed] [Google Scholar]
- Nosofsky RM, Gluck MA, Palmeri TJ, McKinley SC, & Glauthier P (1994). Comparing models of rule-based classification learning: A replication and extension of Shepard, Hovland, and Jenkins (1961). Memory & Cognition, 22, 352–369. 10.3758/bf03200862 [DOI] [PubMed] [Google Scholar]
- Nosofsky RM, & Palmeri TJ (1996). Learning to classify integral-dimensión stimuli. Psychonomic Bulletin & Review, 3, 222–226. 10.3758/bf03212422 [DOI] [PubMed] [Google Scholar]
- Pelli DG (1997). The VideoToolbox software for visual psychophysics: Transforming numbers into movies. Spatial Vision, 10, 437–442. 10.1163/156856897X00366 [DOI] [PubMed] [Google Scholar]
- Pitt MA, Kim W, Navarro DJ, Myung JI (2006). Global model analysis by parameter space partitioning. Psychological Review, 113, 57–83. 10.1037/0033-295X.113.1.57 [DOI] [PubMed] [Google Scholar]
- Redhead ES, & Pearce JM (1995). Stimulus salience and negative patterning. Quarterly Journal of Experimental Psychology: Comparative and Physiological Psychology, 48B, 67–83. [PubMed] [Google Scholar]
- Rescorla RA, & Wagner AR (1972). A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and non-reinforcement In Black AH & Prokasy WF (Eds.), Classical Conditioning II. Current research and theory (pp. 64–99). New York: Appleton-Century-Crofts. [Google Scholar]
- Shanks DR & Darby RJ (1998). Feature- and rule-based generalization in human associative learning. Journal of Experimental Psychology: Animal Behavior Processes, 24, 405–415. http://dx.doi.Org/10.1037/0097-7403.24.4.405 [DOI] [PubMed] [Google Scholar]
- Shepard RN (1987). Toward a universal law of generalization for psychological science. Science, 237, 1317–1323. 10.1126/science.3629243 [DOI] [PubMed] [Google Scholar]
- Shepard RN & Chang JJ (1963). Stimulus generalization in the learning of classifications. Journal of Experimental Psychology, 65, 94–102. 10.1037/h0043732 [DOI] [PubMed] [Google Scholar]
- Shepard RN, Hovland CI, & Jenkins HM (1961). Learning and memorization of classifications. Psychological Monographs: General and Applied, 75, 1–42. 10.1037/h0093825 [DOI] [Google Scholar]
- Smith JD, Ashby FG, Berg ME, Murphy MS, Spiering B, Cook RG, & Grace RC (2011). Pigeons’ categorization may be exclusively nonanalytic. Psychonomic Bulletin & Review, 18, 414–421. 10.3758/s13423-010-0047-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Smith JD, Beran MJ, Crossley MJ, Boomer JT, & Ashby FG (2010). Implicit and explicit category learning by macaques (Macaca mulatta) and humans (Homo sapiens). Journal of Experimental Psychology. Animal Behavior Processes, 36, 54–65. 10.1037/a0015892 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Smith JD, Berg ME, Cook RG, Murphy MS, Crossley MJ, Boomer J, Grace RC (2012). Implicit and explicit categorization: A tale of four species. Neuroscience & Biobehavioral Reviews, 36, 2355–2369. https://doi.Org/10.1016/j.neubiorev.2012.09.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Smith JD, Minda JP, & Washburn DA (2004). Category learning in rhesus monkeys: A study of the Shepard, Hovland, and Jenkins (1961) Tasks. Experimental Psychology: General, 133, 398–414. 10.1037/0096-3445.133.3.398 [DOI] [PubMed] [Google Scholar]
- Soto F, & Wasserman EA (2010). Error-driven learning in visual categorization and object recognition: a common-elements model. Psychological Review, 117, 349–381. 10.1037/a0018695 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Teng Y, Vyazovska O, & Wasserman E (2015). Selective attention and pigeons’ multiple necessary cues discrimination learning. Behavioural Processes, 112, 61–71. 10.1016/j.beproc.2014.08.004 [DOI] [PubMed] [Google Scholar]
- Vyazovska OV, Navarro VM, & Wasserman EA (2016). Stagewise multidimensional visual discrimination by pigeons. Journal of the Experimental Analysis of Behavior, 106, 58–74. 10.1002/jeab.217 [DOI] [PubMed] [Google Scholar]
- Vyazovska OV, Navarro VM, & Wasserman EA (2018). Pigeons deploy selective attention to efficiently learn a stagewise multidimensional visual discrimination task. Journal of Experimental Psychology: Animal Learning and Cognition, 44, 162–167. 10.1037/xan0000168 [DOI] [PubMed] [Google Scholar]
- Vyazovska OV, Teng Y, & Wasserman EA (2014). Attentional tradeoffs in the pigeon. Journal of the Experimental Analysis of Behavior, 101, 337–354. 10.1002/jeab.82 [DOI] [PubMed] [Google Scholar]
- Wasserman EA, Kiedinger RE, Bhatt RS (1988). Conceptual behavior in pigeons: Categories, subcategories, and pseudocategories. Journal of Experimental Psychology: Animal Behavior Processes, 14, 235–246. 10.1037/0097-7403.14.3.235 [DOI] [Google Scholar]
- Zakrzewski AC, Church BA, & Smith JD (2018). The transfer of category knowledge by macaques (Macaca mulatta) and humans (Homo sapiens). Journal of Comparative Psychology, 132, 58–74. 10.1037/com0000095 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.