What Are My Chances? Closing the Gap in Uncertainty Monitoring between Rhesus Monkeys (Macaca mulatta) and Capuchin Monkeys (Cebus apella)

Michael J Beran; Bonnie M Perdue; J David Smith

doi:10.1037/xan0000020

. Author manuscript; available in PMC: 2015 Jul 1.

Published in final edited form as: J Exp Psychol Anim Learn Cogn. 2014 Apr 7;40(3):303–316. doi: 10.1037/xan0000020

What Are My Chances? Closing the Gap in Uncertainty Monitoring between Rhesus Monkeys (Macaca mulatta) and Capuchin Monkeys (Cebus apella)

Michael J Beran ¹, Bonnie M Perdue ², J David Smith ³

PMCID: PMC4215522 NIHMSID: NIHMS591273 PMID: 25368870

Abstract

Previous studies have indicated that rhesus monkeys (Macaca mulatta) but not capuchin monkeys (Cebus apella) respond to difficult or ambiguous situations by choosing not to respond or by seeking more information. Here we assessed whether a task with very low chance accuracy could diminish this species difference, presumably indicating that capuchins—compared to macaques—are less risk averse as opposed to less sensitive to signals of uncertainty. Monkeys searched for the largest of six stimuli on a computer screen. Trial difficulty was varied, and monkeys could choose to opt out of any trial. All rhesus monkeys, including some with no prior use of the uncertainty response, selectively avoided the most difficult trials. The majority of capuchins sometimes made uncertainty responses, but at lower rates than rhesus monkeys. Nonetheless, the presence of some adaptive uncertainty responding suggests that capuchins also experience uncertainty and can respond to it, though with less proficiency than macaque monkeys.

Keywords: uncertainty monitoring, metacognition, rhesus monkeys, capuchin monkeys, comparative psychology, psychophysics

A student freezes in dread reading an essay question—she knows she doesn't know that answer. A diner in a restaurant frantically searches memory—he knows he can't place the name of the acquaintance approaching his table. These situations demonstrate humans’ capacity for metacognition—that is, their cognitive awareness of mental states like knowing, uncertainty, and doubt. A large research literature explores this capacity in humans (e.g., Balcomb & Gerken, 2008; Benjamin, Bjork, & Schwartz, 1998; Dunlosky & Bjork, 2008; Flavell, 1979; Koriat & Goldsmith, 1994; Nelson, 1992; Scheck & Nelson, 2005; Schwartz, 2008). An active debate centers on the question of whether humans are unique in their metacognitive abilities, or whether this characteristic is shared with other species (Carruthers, 2008, 2009; Crystal & Foote, 2009; Hampton, 2009; Jozefowiez, Staddon, & Cerutti, 2009; Kornell, 2009, 2013; Le Pelley, 2012; Smith, 2009; Smith, Beran, Couchman, & Coutinho, 2008).

The answer to this question could reveal the earliest evolutionary roots and the waypoints of metacognition's emergence in the primates, and since its inception, animal metacognition research has focused heavily on the performance of primates (Shields, Smith, & Washburn, 1997; Smith, Shields, Schull, & Washburn, 1997; Smith, Shields, Allendoerfer, & Washburn, 1998). This research area is very active now within comparative psychology (e.g., Basile, Hampton, Suomi, & Murray, 2009; Beran, Smith, Redford, & Washburn, 2006; Call, 2010; Castro & Wasserman, 2013; Foote & Crystal, 2007; Fujita, 2009; Hampton, 2001; Iwasaki, Watanabe, & Fujita, 2013; Kornell, Son, & Terrace, 2007; Marsh & MacDonald, 2012a; Paukner, Anderson, & Fujita, 2006; Roberts, Feeney, McMillan, MacPherson, & Musolino, 2009; Smith, Beran, Redford, & Washburn, 2006; Smith, Coutinho, Church, & Beran, 2012; Suda-King, 2008; Sutton & Shettleworth, 2008; Washburn, Gulledge, Beran, & Smith, 2010), and a number of species have been tested. Among primates, important behavioral contrasts have been seen across different species and different tasks.

Great apes

Call and Carpenter (2001) devised an information-seeking task in which subjects sometimes had all information needed to locate food, but sometimes needed more information. Chimpanzees and orangutans (and human children, for that matter) performed in a manner consistent with metacognition. Subjects responded immediately when they had full information, but they invested time and effort into seeking more information when more was needed to locate food. This suggests that subjects knew when they already knew the location of food and also knew when they did not (also see Call, 2010; Marsh & MacDonald, 2012a, 2012b).

Beran, Smith, & Perdue (2013) added to this paradigm a communicative, symbolic component. They tested language-trained chimpanzees by placing food items in an opaque container. Sometimes the chimpanzees saw the food item placed, other times not. They received the food if they named it correctly on their lexigram symbol keyboard. Chimpanzees were more likely to visit the food container first—before choosing a lexigram—on trials in which they did not know the container's contents. They were more likely to immediately name the item on the keyboard—without looking into the container—on trials in which they knew the container's contents. Thus, chimpanzees showed efficient information-seeking behavior that suggested they knew whether they knew the item's identity when it was time to name it.

As the closest living relative of humans, it is not entirely surprising that apes have fared well on metacognitive tasks. All the great-ape species—chimpanzees, orangutans, gorillas, and bonobos – have now shown success in tests of metacognition (Beran et al., 2013; Call, 2010; Call & Carpenter, 2001; Marsh & MacDonald, 2012a, 2012b; Suda-King, 2008) suggesting that this ability is shared among the taxa.

Macaques

Rhesus monkeys have succeeded in information-seeking tasks of the type used by Call and others (Hampton, Zivin, & Murray, 2004). They have succeeded in computer-based information-seeking tasks devised by Beran and Smith (2011). They have shown evidence of metamemory—choosing adaptive behaviors based on the strength of internal memory representations (e.g., Hampton, 2001). They have shown evidence of uncertainty monitoring during multitasking (Smith, Redford, Beran, & Washburn, 2010) and during tests of Harlow's learning set (Washburn, Smith, & Shields, 2006).

Finally, rhesus monkeys have shown evidence of uncertainty monitoring in psychophysical tasks. In these tasks, on which the present tasks are based, subjects are presented with a psychophysical discrimination. For example, if pixel density is the relevant dimension, the stimuli would range from sparsely pixelated to densely pixelated, including many steps between. When a stimulus is presented, the subject would be asked to make a dichotomous sparse-or-dense classification of the stimulus. Stimuli near the sparse-dense breakpoint of the discrimination will be most difficult and cause errors. However, subjects in these tasks are also given an uncertainty response (UR) with which they can opt out of any trial they choose. This response need not bring any reward and can be designed to operate only to end a given trial and bring the next randomly chosen trial instead. In this way, subjects may be able to cope with difficulty and even report uncertainty behaviorally. If subjects can monitor the signal of their own uncertainty as they face a difficult, near-breakpoint trial, then they can decline the trial and avoid the risk of error. Rhesus monkeys use the UR appropriately and adaptively, and even isomorphically with the way that consciously metacognitive humans use it (e.g., Smith et al., 2006, 2007). Thus, for many empirical reasons, there is a growing theoretical consensus that macaques show a close analog to humans’ uncertainty awareness and a form of metacognitive monitoring. For example, Roberts et al. (2009, p. 130) concluded that—substantial evidence from several laboratories converges on the conclusion that rhesus monkeys show metacognition in experiments that require behavioral responses to cues that act as feeling of knowing and memory confidence judgments.∥

Of course the role of associative learning in these paradigms continues to be of theoretical interest (Hampton, 2009; Smith, 2009; Smith et al., 2008; Smith et al., 2009; Smith, Beran, & Couchman, 2012) and there are ongoing efforts to produce formal models that reproduce macaques’ uncertainty performances (e.g., Jozefowiez, et al., 2009; Le Pelley, 2012). These models were discussed extensively in a recent target article (Smith, Couchman, & Beran, in press a) and commentaries (Basile & Hampton, in press; Carruthers, in press; Le Pelley, in press; Smith, Couchman, & Beran, in press b).

Smith et al. (in press a, also Smith et al., 2008) showed that associative models cannot capture certain ccrucial phenomenon in this area. For example, Smith et al. (2006) gave a macaque monkey a density discrimination task along with an uncertainty response that allowed the monkey to avoid making a classification of density. Unlike in studies that provide trial-by-trial feedback, in this task four trials were completed before any feedback was given, and then such feedback was rearranged to group rewards and timeouts rather than give feedback in an order that matched response order. The result was that the macaque's subjective uncertainty region displaced away from the discrimination's true breakpoint (Smith et al., 2006). Although LePelley (2012) reported that an associative model also could produce such a displacement, that outcome occurred only when uncertain responses were the dominant responses made by the simulated subject, which does not match the performance pattern of the actual monkey (see Smith et al., in press). Associative models are locked to the task's associative structure. Subjective uncertainty is not. Smith et al. (2008) discussed the problematic instability of associative models of uncertainty responding. These models can produce qualitatively different performance patterns based on just 2 chance occurrences early in a 6,000-trial simulation run. Smith et al. pointed out that Le Pelley's simulations of Hampton's (2001) metamemory data embodied a mistake about the reinforcement structure of that task, rendering those published simulations theoretically unhelpful. (In personal communications, Le Pelley acknowledged this mistake, while suggesting that the model might still apply once the mistake was remedied.)

Smith and colleagues also showed that associative models could not apply to other demonstrations of macaque metamemory (e.g., Smith et al., 1998) without being granted up to 9 freely varying fitting parameters. With this many free parameters, a model could become an empty mathematical abstraction that has little bearing on anything psychological. Even with the 5 free parameters in Le Pelley's (2012) model, this is already a serious concern. Smith et al. also pointed out that associative models cannot explain Smith et al.'s (2012) demonstration that uncertainty responses are psychologically different from the primary perceptual responses in uncertainty tasks because they are more heavily dependent on working-memory resources. Le Pelley (in press) was not able to address this important failure of associative models that was discussed in depth in Smith et al. (in press a).

Basile and Hampton (2013) concluded that Smith et al. (in press a) persuasively identified the problems with current associative models. They agreed that —the associative models proposed by Le Pelley et al. and Jozefowiez et al. do not currently explain the breadth of nonhuman metacognitive performance.∥ Carruthers (in press) endorsed Smith et al.'s (in press a) evaluations of associationist explanations in this area. Pointedly, he said: —an obsessive focus on associationist accounts of animal behavior impedes progress in comparative psychology and obstructs attempts to understand animal precursors and homologies of components of human cognition.∥ Regarding apes and macaques, therefore, it is largely agreed that they have an analog to humans’ uncertainty system and that this system may be a form of metacognitive monitoring. Obviously, this does not imply that their uncertainty system is identical in every conscious and self-reflective way to that in humans (Carruthers, in press; Smith et al., in press b). In fact, understanding these likely differences will be an important task for the field of comparative metacognition, for such differences could illuminate the emergence of reflective minds during primate evolution.

Capuchin monkeys

This conclusion does not yet extend to primates broadly. Capuchin monkeys (Cebus apella), a New World primate species, represent another primate lineage. Basile et al. (2009) adapted Call and Carpenter's (2001) food- concealment paradigm for capuchins. Unlike rhesus monkeys, their capuchin monkeys showed minimal evidence of adaptive information-seeking behavior. Paukner et al. (2006) tested capuchins in a similar paradigm. The monkeys searched bent food tubes uselessly (because the bait could not possibly be seen in a visual search). They searched clear tubes unnecessarily (because the baited tube was obvious). Thus, capuchins’ behavior contrasted with that of macaques, apes, and humans. Fujita (2009), adapting Hampton's (2001) metamemory paradigm, provided a third demonstration of capuchins’ tenuous response to internal psychological signals of uncertainty and doubt. Likewise, in the computer-based information-seeking tasks of Smith and Beran (2011), capuchins did not show strong evidence that they would seek only the necessary information needed for correct performance, whereas rhesus monkeys did.

Finally, Beran, Smith, Coutinho, Couchman, and Boomer (2009) tested capuchins’ uncertainty monitoring using the sparse-dense discrimination task already described. Capuchin monkeys, again very unlike rhesus monkeys, essentially never responded uncertain. They did not do so even when the penalty for an incorrect response was raised from a 20 s timeout to a 90 s timeout, so that capuchins potentially sacrificed the opportunity for 30 trials and 30 food rewards for every discrimination error they made. Thus, Beran et al. provided another converging line of evidence that the uncertainty- monitoring capacities of capuchins and macaques are different.

These differences between capuchin monkeys and rhesus monkeys across a variety of tasks could be important for revealing a fundamental discontinuity in the phylogenetic distribution of metacognition. They raise the possibility that metacognition may have emerged selectively as a characteristic of the Cattarhine primates. It is exciting that we might place an evolutionary pushpin onto the phylogenetic map identifying where metacognition emerged (i.e., approximately 30 million years ago before Old World monkeys split from apes and humans). However, potential extraneous factors should be carefully considered before pressing this assertion. That is, one must consider whether there are methodological aspects of these tasks that might produce species differences without there being a species discontinuity in the domain of cognitive awareness.

In the present article, we examined one important methodological facet of uncertainty tasks. The standard uncertainty task that we have used (e.g., Smith et al., 1997; Beran et al., 2009) provides a subject with two response options in addition to the UR. Subjects have a 50% chance of being correct even given random responding. It could be that macaques and capuchins have different set points for risk-taking and risk- aversion behaviors. A 50% chance of reward might be an acceptable level of —risk∥ for a capuchin monkey, but not for a rhesus monkey. In that case, capuchins would naturally not make URs in these tasks, but macaques would. In short, capuchin monkeys may not use responses that allow them to avoid primary classifications on standard tasks because they are more tolerant of 50% reward, and not because they lack a metacognitive capacity.

To examine this possibility, we created new psychophysical discriminations in which the probability of reward based on random responding was less than 50%. We did this by having monkeys seek a correct response from among six options, not from among two options as in the standard tasks. If the previously observed species differences represent true differences in uncertainty monitoring, then we would expect those differences to remain intact even in our low-probability paradigm. Alternatively, if the observed species differences reflect different tolerances for risky, exploratory behavior, then we would expect the species differences to be less pronounced. The latter finding would suggest than the species differences in uncertainty monitoring are more quantitative than qualitative in nature, and that one needs to consider carefully the kind of tasks presented across species that can assess uncertainty monitoring comparably and equitably.

The paradigm used here was an extension of a basic uncertainty paradigm, the perceptual psychophysical paradigm that originated the field of animal metacognition (e.g., Smith et al.,1995; Smith et al., 1997). This paradigm—like every human- and animal-metacognition paradigm—is imperfect (see Hampton, 2009; Smith et al., 2012). Our purpose was to foster capuchins’ uncertainty responding in a task in which they have demonstrably failed previously to respond uncertain, and to ask whether their uncertainty-response capacity has sometimes been underestimated. Our purpose was not to definitively prove capuchin metacognition by choosing the most sophisticated possible paradigm (a choice on which there would not even be agreement across researchers).

Experiment 1

Methods

Participants

We tested six adult rhesus macaques (Macaca mulatta; all males, ages 10 to 26 years) and eight adult capuchin monkeys (Cebus apella; 5 males and 3 females, ages 5 to 23 years). All monkeys had previously been trained to use a joystick with their hands to control a cursor on a computer screen (see Evans, Beran, Chan, Klein, & Menzel, 2008; Richardson, Washburn, Hopkins, Savage-Rumbaugh, & Rumbaugh, 1990). They all had participated in numerous previous computerized experiments (e.g., Beran, 2007, 2008; Beran, Evans, Klein, & Einstein, 2012; Beran & Parrish, 2012, 2013), including participation by most of these animals in previous related tests (e.g., Beran & Smith, 2011; Beran et al., 2009; Smith et al., 2006; Smith, et al., 2010). The monkeys had continuous access to water and worked for fruit flavored food pellets. They also received a daily diet of fruits and vegetables independent of the amount of work they completed on the task, and thus they were not food deprived for the purposes of this experiment. The experiments were conducted with approval of the Georgia State University Institutional Animal Care and Use Committee and followed all federal guidelines.

Apparatus

The monkeys were tested using the Language Research Center's Computerized Test System which consists of a personal computer, digital joystick, color monitor, and pellet dispenser (Evans et al., 2008; Richardson et al., 1990). Monkeys manipulated the joystick with their hands to produce isomorphic movements of a small cursor on the computer screen. Contacting stimuli with the cursor sometimes resulted in the delivery of 45-mg (capuchins) or 94-mg (rhesus) banana-flavored chow pellets (Bio- Serv, Frenchtown, NJ) via a pellet dispenser that was connected to the computer through a digitial I/O board (PDISO8A; Keithley Instruments, Cleveland, OH). The task program was written in Visual Basic 6.0.

Design and procedure

Each computerized session consisted of a training phase and a test phase, and monkeys completed as many trials as they chose during daily sessions that lasted from 2 to 6 hours. During the training phase of each session, monkeys initially saw two squares on the computer screen, presented in one of six designated locations that formed a rough semicircle on the screen from bottom left upward and then back down to the bottom right part of the screen (Figure 1). These squares were presented in widths and heights measured in twips in Visual Basic. Each twip is 1/567 cm. Monkeys moved a cursor that was centered on the screen into contact with one of those squares. The cursor moved at approximately 5 cm per second when the joystick was fully displaced in a direction. If the selected square was the largest, the monkey heard a melodic chime (approximately 1.5 seconds in duration) and received a single food pellet. If another square was selected, a buzz tone sounded (approximately 1.5 seconds in duration), no food was given, and a 30 second time-out period occurred during which the screen remained blank. A new trial was then presented after a 1 s inter-trial interval. The location of the largest square on the screen was randomly determined on each trial from one of the six possible locations, with no restrictions on how many trials in a row this might occur.

A size-discrimination trial illustrated. The squares were presented in an orange color, and the largest had to be selected for food reward. Here, the largest square is to the bottom left. The ? is the uncertainty response.

After reaching a criterion of 17 trials correct in the most recent 24 trials, three squares appeared onscreen instead of two, and as criterion was met again that number increased, up to a maximum of six squares onscreen. The position of the largest square in the semicircle of squares was determined randomly on each trial. The size of the largest square on each trial was chosen randomly to be one of 13 sizes, ranging from 1000 twips per side to 2200 twips per side, in 100 step increments (1000, 1100, ..., 2200).

There were two methods for choosing the sizes of the foil squares that accompanied the target, largest square. In the All Foils Different Condition, the next closest square to the target was from 75 twips (Level 3) to 550 twips (Level 22) less wide and less high compared to the target. All remaining squares presented on trials in this condition were, successively, 25 twips less wide and less high than the previous foil square. So, for example, if the target was 2000 twips per side, and the trial was a Level 5 trial, the foil squares had heights/widths of 1875 (2000 – 5 X 25), 1850, 1825, 1800, and 1775 twips. All these squares were randomly assigned their screen position.

In the All Foils Same Condition, all foil squares were the same size on a given trial, from 25 twips (Level 1) to 500 twips (Level 20) less wide and less high than the target. Thus, in both conditions, difficulty varied across trials as a function of the level randomly chosen for the first foil square. In addition, trials were generally more difficult in the All Foils Same Condition, because all foil options were the same size and no individual foil stimuli could be more readily discounted as the correct choice than other foil stimulus. In the All Foils Different condition, the variability in size across foils made some of them easier to discount as not being the largest.

Throughout testing, the UR was present at the bottom center of the screen, and could always be accessed as a response option by the monkeys. Its selection led to the clearing of the screen and the presentation of the next trial. No food pellet, timeout, or auditory feedback was given. The next trial was not guaranteed to be easier or harder than the present trial, and so the UR operated solely as a means of not making the primary discrimination to a given trial. During the test phase, approximately 15% of trials involved mandatory selection of that stimulus, so that the monkey would experience its effects. These forced-UR trials were randomly selected and could occur for any Level, so they were not specifically presented on trials for which they were optimal to use. During these trials, the cursor would only move downward on the screen and into contact with the question mark. Any other directional movement of the joystick had no effect on the cursor.

Monkeys completed variable numbers of test sessions weekly, depending on their participation in other (unrelated) experimental tasks, and during sessions of this experiment, they completed as many trials as they chose in 4-hour test sessions while also having the opportunity to disengage the task and attend to other things in the home cage area such as enrichment devices or other animals within view. Thus, monkeys completed variable numbers of trials, but all monkeys continued working on the task until a sufficiently large number of trials were collected for performing analyses across the range of trial difficulty levels that we had established.

Results

Data analyses were restricted to only the testing phase data from the sessions in which six squares were present on each trial. Trials in which the UR was mandatory were excluded from analysis. Table 1 gives trial counts for the analyzed test data from each monkey.

Table 1.

Correlations between the percentage of trials correct and the percentage of trials in which the UR was selected.

	Experiment 1				Experiment 2
Rhesus Monkeys	All Foils Different		All Foils Same		UR Available
	N	R	N	R	N	R
Chewie	7294	−.96*	7137	−.958*	4730	−.953*
Gale	5825	−.984*	5954	−.946*	5801	−.916*
Han	5811	−.985*	5606	−.952*	7562	−.877*
Hank	2428	−.837*	2387	−.86*	3068	−.942*
Lou	2689	−.865*	2695	−.949*	3405	−.866*
Murph	2538	−.74*	2639	−.829*	2773	−.706*

Capuchin Monkeys	All Foils Different		All Foils Same		UR Available
	N	R	N	R	N	R
Drella	5971	−.924*	6243	−.948*	Not tested	Not tested
Gabe	3944	.071	3938	−.007	Not tested	Not tested
Griffin	8956	−.991*	8959	−.988*	6155	−.936*
Lily	3958	−.899*	3958	−.935*	Not tested	Not tested
Logan	4836	−.936*	4810	−.968*	3987	−.972*
Nkima	3792	−.928*	3913	−.944*	4417	−.915*
Widget	3327	−.90*	3415	−.978*	3476	−.969*
Wren	3888	−.218	3918	.129	Not tested	Not tested

Open in a new tab

Note. Asterisks indicate p < .01.

Figure 2 presents the overall performance patterns for each species as a function of the 18 shared levels (levels 3-20) that were presented in the two test conditions. Group-level analyses were conducted using repeated measures analysis of variance, with Level and Condition as within-subject factors and species as a between-subjects factor. First, we examined performance when monkeys chose to attempt the discrimination trial by choosing one of the squares rather than the UR.

Performance of each species as a function of trial level and condition. Error bars show 95% confidence intervals.

The ANOVA indicated that there was a main effect of Level, F(17, 204) = 196.29, p < .001, η_p² = .94, 95% CIs = .92, .95. Performance in the primary discrimination task improved as Level increased and the presented trials were easier. There was a main effect of Condition, F(1, 12) = 300.93, p < .001, η_p² = .96, 95% CIs = .88, .98. Performance was better in the All Foils Different Condition in which more of the foils were more easily rejected as possible answers. There was no main effect of Species, F(1, 12) = .63, p = .44, η_p² = .05, 95% CIs = <.001, .34.

There was a significant interaction of Level and Condition, F(17, 204) = 19.50, p < .001, η_p² = .62, 95% CIs = .51, .65. The effect of level on performance was stronger in the All Foils Same condition. There was a significant interaction of Condition and Species, F(1, 12) = 5.99, p = .031, η_p² = .33, 95% CIs = <.001, .60. Capuchin monkeys performed relatively more similarly than macaques across the two conditions. There was not a statistically significant interaction between Level and Species, F(17, 204) = 1.37, p = .15, η _p² = .10, 95% CIs = <.001, .11, and there was no three-way interaction, F(17, 204) = 1.16, p = .30, η_p² = .09, 95% CIs = <.001, .09.

Next, we examined monkeys’ use of the UR to decline trials. The ANOVA indicated that there was a main effect of Level, F(17, 204) = 19.16, p < .001, η_p² = .62, 95% CIs = .50, .65. The UR was used more for lower (objectively more difficult) trial levels. There was a main effect of species, F(1, 12) = 6.95, p = .022, η_p² = .37, 95% CIs = .00, .62. Figure 2 clearly shows that capuchins used the UR far less than the macaques did. There was a main effect of condition, F(1, 12) = 22.19 p = .001, η_p² = .65, 95% CIs = .21, .79. The macaques—at least—used the UR more in the All Foils Same condition. All two-way interactions were significant: Level and Condition, F(17, 204) = 4.39, p < .001, η_p² = .27, 95% CIs = .11, .31; Condition and Species, F(1, 12) = 10.94, p = .006, η_p² = .48, 95% CIs = .05, .69; Level and Species, F(17, 204) = 10.43, p < .001, η_p² = .47, 95% CIs = .32, .51, and there was also a significant three-way interaction, F(17, 204) = 1.77, p = .034, η_p² = .13, 95% CIs = <.001, .15.

Thus, rhesus monkeys used the UR much more often than capuchin monkeys, and they did so across a wider range of trial levels than did the capuchin monkeys. However, capuchin monkeys apparently did use the UR slightly, and used it when it was most appropriate. Figure 2's aggregate graphs could not convey whether many capuchins made URs slightly, or whether a few capuchins made URs generously. Therefore, we also examined the performance of individual monkeys in more detail to better understand the extent and nature of the similarities and differences between individuals and between species.

Figures 3 and 4, respectively, present the individual data for each rhesus monkey and each capuchin monkey for all trial levels. Four of six rhesus monkeys showed a robust use of the UR for the hardest trial levels. All rhesus monkeys showed a statistically significant negative correlation across trial difficulty levels between the percentage of trials that were correct when they chose one of the squares and the percentage of trials on which they chose the UR (Table 1). One capuchin monkey (Griffin) showed a robust use of the UR at difficult trial levels. Six of eight capuchin monkeys (including Griffin) showed a statistically significant negative correlation between URs and correct percentages across trial levels (Table 1). It is important to say that this correlation overlooks the absolute level of URs, and that significant and even strong correlations could result from this analysis no matter how small the actual level of URs was. Nonetheless, many of the individual animals in both species used the UR appropriately for difficult trial levels when they chose it, although qualitatively one can see in the figures that capuchin monkeys made far fewer URs than did rhesus monkeys.

Data from each rhesus monkey. The legend for the lines is shown in the top left graph.

Data from each capuchin monkey, as shown in Figure 3.

Discussion

The results of the present experiment are much in line with previous reports assessing the uncertainty-monitoring capacities of rhesus monkeys and capuchin monkeys. The rhesus monkeys in this experiment often used the UR and did so especially on those trials in which they were at greatest risk of making an error. The manipulation of lowering the chance probability of responding to the correct stimulus led to all rhesus monkeys showing typical levels of uncertainty responding, or even use of the UR in some cases at very high levels. Some rhesus monkeys that have failed to use the UR, or used it only minimally, in past experiments used it often here, and highly proficiently. For example, Lou is often —reluctant∥ to use the UR relative to other rhesus monkeys in our past experiments (e.g., Smith et al., 2010), and yet here he used it often.

Capuchin monkeys also had shown little or no use of the UR in past psychophysical discrimination tasks, even though they had had no trouble including a third response within a task if it was another perceptual response like Sparse and Dense. Here we saw once again a very strong species difference in the overall level of adaptive uncertainty responding. However, we saw some capuchin monkeys producing performance patterns that were qualitatively more like those of rhesus monkeys. Some capuchins did show a UR pattern in which they used the UR most often on the most difficult trials. Perhaps the present low-probability paradigm did begin to foster more URs in these monkeys. These data, along with the success of all macaques in showing that pattern, indicates that tasks in which chance levels of responding are as high as 50% may not elicit cognitive monitoring and adaptive response patterns as strongly as do low-probability paradigms.

One possibility, then, for the species difference seen between capuchins and rhesus monkeys, is that capuchin monkeys are less risk sensitive than rhesus monkeys. For capuchins, the chance for food reward may outweigh the aversion of timeouts. Beran et al. (2009) did take this possibility into account, at least partially, when they greatly increased the timeout period, and that manipulation did foster some use of the UR, although only from one monkey. However, that manipulation did not increase the chances of a timeout, but rather its duration, whereas here a change to the chance of suffering a timeout (at least for the hardest levels) did seem to evoke some URs from more monkeys in the present experiment.

Experiment 2

To assess whether capuchin monkeys might increase their URs under some circumstances, we conducted a second experiment. We focused on the macaques and on the four capuchin monkeys (Griffin, Logan, Nkima, and Widget) that had made URs more generously in Experiment 1, to allow the latter species to make the strongest uncertainty-monitoring statement possible. We restricted testing to the All Foils Same condition only, creating an environment of maximum difficulty that might maximally foster uncertainty responding. We manipulated the availability of the UR across trials, sometimes allowing URs and sometimes requiring animals to complete discrimination trials. This created an environment of actually experienced errors and timeouts that might foster URs so as to avert additional errors and timeouts. This last manipulation also allowed us to differentiate the performance of animals on forced-discrimination trials from that on chosen-discrimination trials. If the latter performance levels were higher, it might signal that animals were choosing to accept trials strategically in a way that improved their overall performance in the primary task itself.