Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2022 Dec 1.
Published in final edited form as: Dev Psychol. 2021 Dec;57(12):2150–2164. doi: 10.1037/dev0001253

Testimony bias lingers across development under uncertainty

Rista C Plate 1, Kristin Shutts 2,3, Aaron Cochrane 4, C Shawn Green 2, Seth D Pollak 2,3
PMCID: PMC9036618  NIHMSID: NIHMS1795727  PMID: 34928665

Abstract

Children have a powerful ability to track probabilistic information, but there are also situations in which young learners simply follow what another person says or does at the cost of obtaining rewards. This latter phenomenon, sometimes termed bias to trust in testimony, has primarily been studied in children preschool-age and younger, presumably because reasoning capacities improve with age. Less attention has been paid to situations in which testimony bias lingers—one possibility is that children revert to a testimony bias under conditions of uncertainty. Here, participants (4–9-years-old) searched for rewards and received testimony that varied in reliability. We find support for testimony bias beyond preschool-age, particularly for uncertain testimony. Children were sensitive to trial-by-trial uncertainty (Experiment 1; N=102; 59M, 43F; race and/or ethnicity reported: 9 Hispanic, 93 not Hispanic; 6 Black or African American, 7 Asian American, 8 multiracial, 77 White, 4 “other”) and uncertainty defined as a one-time, unexpected change in the testimony (Experiment 3; N=129; 68M, 61F; 12 Hispanic, 117 not Hispanic; 10 Black or African American, 4 Asian American, 9 multiracial, 103 White, 3 “other”). However, the impact of the testimony bias decreased with age. These effects were specific to the testimony coming from another person as opposed to resulting from a computer glitch (Experiment 2, N=89; 52M, 37F; 5 Hispanic, 80 not Hispanic; 1 Black or African American, Asian American, 15 multiracial, 66 White, 4 “other”). Taken together, these experiments provide evidence of a disproportionate influence of testimony, even in children with more advanced reasoning skills.

Keywords: testimony, uncertainty, social learning, development


Children are powerful learners. Whereas adults can make inferences and predictions based upon coherent and structured knowledge about the world that is formulated over time, early learners must begin formulating their knowledge of the world based upon uncertain information. But what patterns of evidence do children rely upon to learn most effectively about the world? Children are not solitary learners—they attend to and follow other people, allowing them to acquire information faster than if they had to discover the same information independently (Buchsbaum et al., 2011; Harris, 2012; Harris & Koenig, 2006; Jaswal & Kondrad, 2016; Sobel & Kushnir, 2013). However, over-reliance on others can lead children to discount or ignore their own observations or hypotheses, a phenomenon termed “testimony bias” (Jaswal, 2010). It appears to be the case both that early learners have a robust capacity to accurately track and structure statistical information from their environments, but also that children will follow what other people say and do, even when input from others conflicts with their own experience and leads to worse outcomes. The testimony bias has been largely examined in children preschool-age and younger; here, we examined this phenomenon in older children who have had more experience with the world and more developed knowledge structures. Our goal was to better understand the conditions under which input from others can enhance or interfere with children’s learning.

Research on the testimony bias has primarily focused on infants through preschool-age children. In a classic example (Jaswal, 2010), 2- and 3-year-old children were tasked with locating a ball after observing it being dropped into an unfamiliar apparatus that would allow the ball to end in multiple locations. Children overwhelmingly chose the location suggested by an adult despite seeing the ball end up in another location; when asked the location of the ball in the absence of input from an adult, children named the correct location. These types of studies reveal that attending to information from another person may lead children to discount their own sensory experience, pre-empt children’s exploration of alternative hypotheses, or undermine other aspects of learning (Bonawitz et al., 2011; Gweon et al., 2014; Schulz & Bonawitz, 2007). Less is known about how development changes the balance of children’s individual learning versus learning from others.

With increasing experience over development, children become more discerning in their use of social information. By four to five years of age, children favor information provided by those who appear accurate, reliable, knowledgeable, and intelligent, as well as those who have access to relevant information (e.g., Birch et al., 2020; Buchsbaum et al., 2011; Corriveau et al., 2009; Einav & Robinson, 2011; Koenig & Harris, 2005; 2007; Kushnir et al., 2008; Pasquini et al., 2007). Additionally, when children of this age have the opportunity to evaluate another person’s reliability against their own observations, children reduce their trust in that person as an informant (Bridgers et al., 2016; Ronfard & Lane, 2018; Ronfard et al., 2017). Therefore, there is strong and consistent data supporting the view that part of what makes children good learners is that they become active in selecting and weighting social and experiential evidence in acquiring knowledge about the world (Hermes et al., 2018; Hermes et al., 2019; Shafto et al., 2012; Sobel et al., 2010; Sobel & Kushnir, 2013).

Less attention has been paid to situations in which older children—who typically demonstrate more rational reasoning than younger children—might revert to the reliance on social information evidenced by younger children. One well-known condition that affects both children and adults is when the testimony comes from a group (Asch, 1951; Haun & Tomasello, 2011). When faced with group testimony that runs counter to their own observations, individuals tend to defer, even if it means selecting a non-optimal or incorrect solution. Another more general condition in which children appear to have difficulty choosing between sources of evidence is when there is a high degree of uncertainty. For example, preschool-age children will abandon their chosen label for an ambiguous object (e.g., a pen morphed with a toothbrush) in favor of a label provided by an informant (Li & Yow, 2018). Moreover, when preschool-age children were asked to guess what color collar would appear with different dogs, they were able to use both the base rates of their own experience as well as the suggestions of an adult witness who was reliable; but when base rates and testimony conflicted, children discounted their own probabilistic reasoning and followed the informant’s recommendation (Gualtieri et al., 2019). Of note in Gualtieri and colleagues’ study is that when the informant tended to be inaccurate, children reverted to reliance on their own observations. That children show selectivity and balancing of information raises questions about developmental changes in children’s thinking.

Prior studies suggest that uncertain outcomes lead preschool-age children to revert to a testimony bias. Here, we expanded previous work by investigating whether uncertainty in social input (in addition to the outcome) influences reliance on testimony; we also examined the stability of this effect in preschool through school-aged children. To do so, we adapted a probabilistic learning task (Plate et al., 2018), in which children successfully learned the locations of rewards and maximized reward receipt from probabilistic information. In the present research, participants similarly searched differentially rewarded locations, but prior to making a selection, they received nonverbal testimony (i.e., a pointing hand) regarding the location of the reward. Although much research on selective trust has focused on verbal testimony, most commonly in the context of word learning (see Sobel & Finiasz, 2020, for a review and meta-analysis), children often experience nonverbal forms of testimony. For example, technology provides increasing opportunities for people to express advice, ideas, and opinions nonverbally via social media and online learning platforms (e.g., Antheunis et al., 2013; Freberg et al., 2011; Kaplan & Haenlein, 2010). This testimony is often signaled through a visual marker (e.g., a thumbs up) and can be provided by individuals or groups familiar or unfamiliar to the learner. Additionally, children encounter learning materials with visual markers (e.g., an image of a child who proposes a “strategy” for solving a problem in a textbook, Riggs et al., 2015; 2017) intended to convey importance or advice (Magner et al., 2013; Rey, 2012).

In Experiment 1, we varied the degree of testimony certainty by manipulating the likelihood that the testimony suggested the correct location of a reward. This manipulation allowed us to examine the degree to which participants relied on the testimony, as well as how the reliability of testimony influenced the rewards children received and the strategies children used to obtain those rewards. In Experiment 2, we asked whether a nonsocial cue would influence choices similarly to testimony. In Experiment 3, we introduced uncertainty as a one-time, unexpected change in the reliability of the testimony to explore whether bias to trust in testimony is further influenced by the type of uncertainty (i.e., persistent versus time-limited). Across experiments, we tested two hypotheses: (1) that under conditions of high uncertainty, children would show testimony bias (i.e., over-rely on the testimony at the cost of rewards), and (2) that even though children as young as 4 years of age can locate rewards in this task on their own using probabilistic reasoning, high uncertainty would elicit the testimony bias even as children age.

Experiment 1

Experiment 1 included three between-subject conditions that varied in the certainty with which a confederate’s testimony indicated a rewarded location. The conditions included: 1) reliable: the testimony frequently indicated the location that would be rewarded on each given trial, 2) unreliable: the testimony infrequently indicated the rewarded location, and 3) mixed reliability: the testimony indicated the rewarded location on half of the trials. We first measured overall agreement with the testimony as well as whether the probability of choosing the location suggested by testimony changed across trials. Then, we examined whether condition impacted participants’ ability to obtain rewards. Finally, we used a modeling approach to classify the types of strategies children employed, and then we calculated the weight attributed to these strategies.

We designed both the reliable and unreliable scenarios to present little uncertainty: either the testimony was a good indicator of the underlying reward, in which case one should follow the suggestions, or it was a poor indicator of the reward, in which case one should ignore the suggestions. However, the mixed reliability condition introduced a higher degree of uncertainty. The relation between the testimony and the reward was not predictable: On any given trial the testimony could suggest the correct location or an incorrect location. This trial-by-trial uncertainty stood in contrast to the reliable and unreliable conditions in which one could be fairly certain that the testimony would either suggest the correct location (reliable condition) or suggest an incorrect location (unreliable condition). We expected that the mixed reliability condition would present an opportunity to observe the testimony bias. We predicted that participants would continue to select the suggested locations more often than chance (i.e., the frequency with which the confederate suggested the correct location), and, as a result, garner fewer rewards. We tested children ages 4–9-years-old in order to include preschool-age participants (for whom there is already evidence of a testimony bias; Gualtieri et al., 2019; Jaswal, 2010; Jaswal et al., 2010) as well as older children.

Method

Participants were English-speaking children residing in a mid-size city in the Midwest region of the United States. Participants were recruited through a database of families who had previously expressed interest in participating in research studies or through ads placed in the community or via social media. We conducted analyses in R (version 3.6.3; R Core Team, 2019) and used the tidyverse package (Wickham et al., 2019) for data organization, lme4 package (Bates et al., 2012) for linear and mixed effects models, and ggplot2 (Wickham, 2016) for visualization. The experimental task, de-identified data, and analysis script are available on Open Science Framework: https://osf.io/ey4ut/.

Participants

The sample included 102 children ages 4–9 years (participants could report race and/or ethnicity and the sample included n = 9 Hispanic or Latinx, 93 not Hispanic or Latinx; 6 Black or African American, 7 Asian American, 8 multiracial, 77 White, 4 chose “other” or did not report race). Thirty-six children were in the reliable condition (23 male, 13 female, Mage = 6.887, SDage = 1.736, see Supplemental Materials Table S1 for demographic information across experiments), 34 children were in the unreliable condition (19 male, 15 female, Mage = 6.887, SDage= 1.723), and 32 children were in the mixed reliability condition (17 male, 15 female, Mage = 7.053, SDage= 1.866). The sample size was consistent with previous research using a similar experimental task (Plate et al., 2018). There were no differences in participant age (p = .908) or gender (p = .642) by condition. An additional 16 participants were excluded because they quit the experiment before all trials were presented (N = 13) or because of experimenter error (N = 3). The University of Wisconsin-Madison Institutional Review Board approved the research (protocol ID: 2018–0520; study title: Probabilistic Learning Across Development). Parents of child participants gave informed consent; children gave verbal assent. Parents received $20 for their time and children chose a prize after participating.

Procedure

The experimenter brought the participant to the waiting room and explained that the participant would play a computerized game where the goal was to find gold coins hidden behind rocks (adapted from Plate et al., 2018). The participant met a confederate (who, unbeknownst to the participant, was a research assistant in the lab). The majority of confederates were White and female. Participants were told that the confederate had played the game once before and would provide suggestions about which rock to choose throughout the game. The experimenter also explained that the confederate would play the game in another room and would not be able to see the participant’s choices. The participant and confederate then proceeded to two separate testing rooms situated across a hallway from each other. We did not include details about the confederate’s involvement aside from saying that she would provide “suggestions” to the participant. Our intention was for the context to be ambiguous in order to make all three conditions of the experiment plausible. For example, if the confederate had an explicit goal of “helping” the participant or had a lot of experience with the task, it may have been particularly surprising, and perhaps less believable, when the confederate frequently suggested unrewarded locations. Similarly, because children are sensitive to myriad cues related to an informant’s competence (e.g., Buchsbaum et al., 2011; Corriveau et al., 2009; Einav & Robinson, 2011; Koenig & Harris, 2005; 2007; Kushnir et al., 2008; Pasquini et al., 2007), we limited the participant’s interaction with, and information about, the confederate in order for the participant to glean reliability information from the task statistics alone.

Test Phase.

Following a practice phase (see Supplemental Materials for details), an icon meant to suggest that the participant’s computer was connecting to the confederate’s computer appeared on the screen (“Waiting for Player 2…”). Additionally, during this time, the experimenter went to the confederate’s testing room to “set up.” Specifically, the experimenter entered the confederate’s room and asked the confederate if they remembered how to play the game (a conversation that could be heard by the child across the hallway with both doors open) before closing the door to the confederate’s room. After approximately one minute, the experimenter left the confederate’s room saying “good luck” to the confederate. After 30 seconds, text that read “Player 2 ready. Please wait for experimenter” appeared on the participant’s screen. The experimenter returned to the participant’s room, wished the participant good luck, and instructed the participant to begin the game by clicking the mouse. The simple set-up proved to make the confederate’s involvement in the game believable to participants. None of the children tested indicated doubt about the confederate.

There were two blocks of 100 test trials, separated by a break. At the start of each test trial, eight rocks appeared on the screen with equal spacing along a horizontal line (see Figure 1 AD for experimental task). Before participants were allowed to respond, a pointing hand indicated the confederate’s suggestion. (The task was programmed such that participants were unable to respond prior to seeing the confederate’s suggestion.) When participants selected the correct location on a trial, a coin appeared in place of the rock they selected. When participants selected an incorrect location on a trial, a red “X” appeared in the chosen location and the coin was revealed in the correct location. After completing the task, participants answered questions about the game, the confederate who provided testimony, and the strategy they used during the game (results for participant responses are presented in Supplemental Materials).

Figure 1.

Figure 1.

Progression of computerized task: A) Display prior to participant choice, B) Pointing hand indicated confederate’s suggestion, C) Red box highlighted participant’s choice, D) Coin appeared in rewarded location. E) Illustrates the nonsocial cue. Note: Figure illustrates one instance of choice behavior (i.e., choose rock not suggested by confederate) and one possible outcome (i.e., participant fails to find the coin; confederate finds the coin).

Design

From left to right, the following probabilities defined the likelihood of a coin appearing at the eight possible rock location on any given trial: 0% - 0% - 5% - 10% - 70% - 10% - 5% - 0%. To make all participants’ experiences statistically equivalent, the outcomes were predetermined to ensure a match to the location probabilities across trial blocks (i.e., in each 100-trial block, rock five was rewarded on exactly 70 trials, rocks four and six were rewarded on exactly 10 trials, etc.). Participants were not shown the probabilities; the probabilities had to be learned via experience with the task.

The choices of the confederate (i.e., the underlying distribution of the confederate’s choices, see Supplemental Materials Figure S1) varied across three conditions, and participants were randomly assigned to one of these conditions. In the reliable condition, the confederate suggested the correct rock 90% of the time; in the unreliable condition, the confederate suggested the correct rock 10% of the time; in the mixed reliability condition, the confederate suggested the correct rock 50% of the time. Participants were not told about these probabilities. All other task specifications remained constant across conditions.

Results

Agreement with Testimony

First, we compared participants’ overall agreement with the testimony to the testimony’s true reliability (Table 1). We found evidence of a testimony bias in both the unreliable and the mixed reliability conditions, as participants relied on the testimony to a greater extent than the reliability would recommend. Next, we examined participants’ likelihood of agreeing with the testimony across condition, age, and time using a logistic mixed effects model. We regressed participant response (1 = followed suggestion, 0 = did not follow suggestion) on the interaction between condition (mixed reliability as referent), age (continuous, mean-centered), time (i.e., trial number, mean-centered and divided by ten to facilitate convergence) and all lower order effects. We included a by-participant random intercept and a by-participant slope for time. Our reported effects include odds ratios (OR), which indicate by how much the odds of agreeing with the confederate’s suggestion increase (OR > 1) or decrease (OR < 1) with each one-unit increase in the independent variable of interest, and we include 95% confidence intervals around the ORs. There was an effect of condition (X2(2) = 409.913, p < .001; Figure 2). Participants were more likely to follow the suggestions of a reliable, as compared to a mixed reliability, confederate (b = 2.446, p < .001, OR = 11.546, 95% CI [7.291, 18.285]). Participants were also more likely to follow the suggestions of a mixed reliability confederate as compared to an unreliable confederate (b = 2.276, p < .001, OR = 9.735, 95% CI [6.236, 15.197]). This effect is qualified by a condition-by-time interaction (X2(2) = 84.310, p < .001; Figure 3 top left). Participants increased their agreement over time in the reliable condition (b = 0.200, p < .001, OR = 1.221, 95% CI [1.154, 1.291]) and decreased their agreement over time in the unreliable condition (b = −0.050, p < .001, OR = 0.951, 95% CI [0.928, 0.975]); however, agreement did not change over time in the mixed reliability condition (b = 0.014, p = .395, OR = 1.014, 95% CI [0.982, 1.046]). All other effects in the model were not significant.

Table 1.

Comparison of overall participant agreement with confederate reliability

testimony reliability participant agreement t (participant agreement vs. testimony reliability) df p Cohen’s d 95% CI
Experiment 1
unreliable 10% 21% 6.696 33 <.001 1.15 [18%–24%]
mixed reliability 50% 65% 4.8 31 <.001 0.85 [59%–71%]
reliable 90% 92% 1.652 35 0.108 0.28 [90%–95%]
Experiment 2
unreliable 10% 19% 10.49 28 <.001 1.95 [17%–20%]
mixed reliability 50% 52% 0.73 28 0.472 0.14 [46%–59%]
reliable 90% 86% −1.354 30 0.186 −0.24 [79%–92%]

Note: A post-hoc sensitivity analysis using G*Power (Faul et al., 2007) for a one-tailed t-test, error probably of .05, power of .80, and sample size of 29 (i.e., the smallest group in the study) reveals a critical t-value of 1.70 and effect size d of 0.47. These values are exceeded in the significant contrasts reported above.

Figure 2.

Figure 2.

Likelihood of agreeing with the testimony/ cue by condition and Experiment: Model predictions and participant-level data. Note: Error bands represent standard error of the estimates. Points represent individual participants averaged across trials.

Figure 3.

Figure 3.

Likelihood of agreeing with the testimony/cue (top row) and the likelihood of reward (bottom row) for each Experiment 1 and 2: Model predictions and participant-level data. Note: Lines are point estimates from linear model with the interaction between condition and trial number, and lower-order effects. Error bands represent standard error of the estimates. Points are responses averaged across participants.

Reward

We next examined whether uncertain testimony impacted participants’ ability to obtain rewards. To do so, we ran the model above with likelihood of finding a gold coin as the dependent variable (1 = found coin, 0 = did not find coin; see Supplemental Materials for effects of age and time). Participants found the most coins in the reliable condition, followed by the mixed reliability condition, and participants found the least coins in the unreliable condition (X2(2) = 487.603, p < .001; reliable vs. mixed reliability: b = 1.810, p < .001, OR = 6.113, 95% CI [5.005, 7.468]; mixed reliability vs. unreliable: b = 0.239, p = .018, OR = 1.270, 95% CI [1.042, 1.547]). However, the main effects were qualified by a condition-by-time interaction (X2(2) = 21.703, p < .001; Figure 3 bottom left). Although participants in all three conditions improved their reward receipt over time (reliable: b = 0.066, p < .001, OR = 1.068, 95% CI [1.046, 1.092]; mixed reliability: b = 0.021, p < .001, OR = 1.021, 95% CI [1.011, 1.032]; unreliable: b = 0.062, p < .001, OR = 1.064, 95% CI [1.046, 1.082]), there were relatively greater gains over time in the reliable and unreliable conditions as compared to the mixed reliability condition (bs = 0.043, 0.041 respectively, ps < .001, ORs = 1.044 (95% CI [1.022, 1.066]), 1.042 (95% CI [1.021, 1.063])). These results converge with the results measuring agreement with the confederate’s suggestions. Participants are less able to increase their reward receipt over time in light of uncertain testimony.

Children’s Use of Strategies

In order to better characterize response patterns, we examined the strategies participants used. In past research, participants who completed this computer task without testimony transitioned over time from a sub-optimal strategy (i.e., probability matching = choosing each option at the rate it is rewarded) to an optimal strategy (i.e., maximizing = choosing the most highly rewarded option; Plate et al., 2018). Here, we were interested in participant strategy use in the presence of testimony. Therefore, we assessed the extent to which individual participant choices were best captured by one of five different possible models of choice behavior. In brief, the first model was a probability-matching model. Here participants were expected to choose each option in proportion to the probability that each location had been observed to be correct up to the current trial. The second model was a maximizing model. Under this model, participants were expected to choose the option that had been observed to have the highest probability of reward up to the current trial. The third model was a testimony-matching model, in which the participant’s distribution of choices was expected to be the same as the confederate’s distribution of choices. The fourth model was a testimony-following model, in which participants were expected to choose the option suggested by the confederate (this model can be thought of as maximizing on the confederate’s suggestions). The final model was a random choice model, in which there was an equal and constant probability of the participant selecting each of the eight options. (For full model details, see Supplemental Materials.)

To address what approaches children across the different conditions used, we measured participants’ use of the strategies described above. To do this, we determined the log likelihood of each participant’s set of choices across the experiment by summing the log probabilities of the choices given the probability structure across rock locations and testimony considering each participant’s past trial experience and the model being evaluated. We regressed the log likelihood on condition (mixed reliability condition set as the referent), model (testimony following as referent), age (continuous, mean-centered), and all possible interactions with a by-participant random intercept.

There was an interaction between model (probability matching, maximizing, testimony following, testimony matching, and random choice) and condition (F(8,396) = 117.557, p < .001; see Supplemental Materials Table S2 for all pairwise comparisons and Table S3 for the percent of participants best fit by each strategy). The majority of participants in the reliable condition were best fit by the testimony-following model (97%), which was the optimal strategy for obtaining rewards in this condition. The majority of participants in the unreliable condition were best fit by the probability-matching model (85%), suggesting that they were making choices based on the underlying reward distribution (although they were not using the optimal strategy of maximizing given the reward distribution). Finally, the participants in the mixed reliability condition were best fit by either the probability-matching (44%) or the testimony-following (50%) model. In the condition with the most uncertainty, participants were not using an optimal strategy (maximizing). Moreover, half of the participants were simply following testimony.

Weighting of Testimony

To better understand how participants were using each of these strategies, we examined a mixture model that included both probability matching and testimony following because these were overwhelmingly the strategies used by participants. One key aspect of the mixture model was that the mixture itself was not constrained to be a perfect average of the given two models at hand (i.e., 50% probability matching, 50% testimony following). Instead, the fitting procedure involved finding the best-fitting weighted average of the two models (e.g., if a participant’s choices were largely consistent with testimony following, but occasional choices were more consistent with probability matching, this might produce a final mixture with weights of 90% testimony following, 10% probability matching). Examining the specific weight participants attributed to the testimony-following model versus the probability-matching model thus provided critical insight into how children used the testimony and underlying reward cues.

We regressed the weight attributed to the testimony-following model on condition (mixed reliability as referent), age (continuous, mean-centered), and their interaction. When comparing the weight given to the testimony-following model against that given to the probability-matching model within the mixture model that considered both, there were effects of condition (F(2, 96) = 181.981, p < .001, ΔR2 = 0.776) and age (F(1, 96) = 9.179, p = .003, ΔR2 = 0.020), which were qualified by a marginal condition-by-age interaction (F(2, 96) = 2.994, p = .055, ΔR2 = 0.013, Figure 4). A sensitivity analysis for a linear multiple regression using G*Power (Faul et al., 2007) with alpha = .05, power = .80, sample size = 102, and number of predictors = 5 specifies a critical F value of 2.31 and effect size f2 of 0.13; the critical F value was exceeded for the effects described above, f2 was exceeded for the effect of condition (condition f2 = 3.79, age f2 = .10, condition-by-age f2 = .06). Examining simple slopes for the relation between participant age and weight attributed to the testimony-following model, participant age mattered in the mixed reliability condition: weight attributed to the testimony decreased with age (unreliable: b = −0.009, p = .629; mixed reliability: b = −0.051, p = .003; reliable: b = 0.005, p = .759). Therefore, younger participants had more difficulty disengaging from the testimony under uncertainty. In contrast, participants in the unreliable condition attributed little weight to the testimony, regardless of age. As would be expected based on strategy use, participants in the reliable condition attributed great weight to the testimony, again regardless of age.

Figure 4.

Figure 4.

Weight attributed to the confederate’s suggestions by age and condition for Experiments 1 and 2: Model predictions and participant-level data. Note: Lines are point estimates from linear model with the interaction between condition and participant age, and lower-order effects. Error bands represent standard error of the estimates. Points are individual participants’ weights.

Discussion

Experiment 1 provides evidence for testimony bias in middle childhood. Across the full sample, children over-agreed with the testimony both when the confederate was mixed in terms of reliability and when the confederate was unreliable. However, when the testimony predictably provided poor recommendations (unreliable condition), children reduced their trust over time, resulting in greater improvement of reward receipt over time. On the contrary, participants facing a higher degree of uncertainty in the testimony (mixed reliability condition), did not change their use of that cue over time, and improved relatively less in their reward receipt over time. This pattern of results points to the idea of uncertainty not just influencing overall choices, but also influencing the learner’s ability to flexibly adjust choices over time.

We did not find age differences in likelihood of agreeing with the testimony. We did however, find that children attributed less weight to the testimony-following strategy with age. That children attributed less weight to the testimony-following strategy with age provides a hint of how older children could be working to overcome a testimony bias. Namely, by first moving away from a strategy that is reliant on input from another. Nevertheless, participants across the age range continued to be disproportionately influenced—to some degree—by the testimony. One piece of supporting evidence for this idea is the finding that participants did not use the optimal, maximizing, strategy. Participants in the unreliable condition were best fit by the probability-matching model—not the maximizing model. The only condition in which children demonstrated optimal choice behavior was when receiving testimony that was highly reliable (note that because the reliable testimony was correct 90% of the time, always following this testimony was a better strategy than employing the maximizing model, which would be correct 70% of the time). Taken together, the results from Experiment 1 indicate that uncertainty affected children’s ability to disengage from testimony and subsequently enact optimal choice behavior.

One account of participants’ performance is that they were operating under a testimony bias. Another possibility, however, is that children’s performance in Experiment 1 is the result of having two cues (the testimony and the rewarded location) to track. Tracking multiple cues may create difficulty for discerning the best strategy more generally. Therefore, we conducted Experiment 2 to better understand how children consider cues that are not social.

Experiment 2

Experiment 1 provided evidence consistent with a testimony bias (i.e., participants over-relying on the suggestions from the confederate) in a condition with high uncertainty. However, in order to have confidence that the pattern of behavior was a result of receiving testimony from another person, we replicated the method of Experiment 1, with the exception that we provided “testimony” (i.e., a cue) that was not associated with a social agent. We predicted that fewer participants would continue to follow the cue in the mixed reliability condition of Experiment 2 (compared to Experiment 1). Such a finding would be in line with research showing that infants use social, but not nonsocial, cues in uncertain learning situations (Tummeltshammer et al., 2014). We were also interested in whether children would continue to probability match, instead of maximize, in the unreliable condition. It may be that children would more readily disregard a nonsocial cue, which might facilitate more time to transition to maximizing.

Method

Participants

The sample included 89 children ages 4–9 years (5 Hispanic or Latinx, 80 not Hispanic or Latinx; 1 Black or African American, Asian American, 15 multiracial, 66 White, 4 did not report race and/or ethnicity). Children were recruited from the same community as Experiment 1; none of the children had participated in Experiment 1. Thirty-one children were in the reliable condition (19 male, 12 female, Mage = 7.118, SDage = 1.779), 29 children were in the unreliable condition (17 male, 12 female, Mage = 6.898, SDage= 1.671), and 29 children were in the mixed reliability condition (16 male, 13 female, Mage = 6.986, SDage= 1.577). There were no differences in child age (p = .877) or gender (p = .891) by condition. An additional 17 participants were excluded because they ended the experiment before all trials were presented. The Institutional Review Board approved the research. Parents of child participants gave informed consent; children gave verbal assent. Children chose a prize after participating.

Procedure

The procedure was identical to Experiment 1 with the following exceptions. The cue signaling the suggested rock was an image of neutral-colored pixels (Figure 1, E) arranged to approximate the shape and size of the pointing hand from Experiment 1. Second, we revised the instructions of the task to remove any suggestion that the cue was social. We told participants, “You might notice that sometimes you can see these funny shapes. They’re from the computer, and I’m not sure why they’re showing up.”

The reasoning behind the change in the instructions was to emphasize the nonsocial nature of the cue. There are difficulties in designing paradigms to contrast social and nonsocial information, particularly in probabilistic learning tasks. Perhaps most difficult is that there is a tendency for individuals to view patterned behavior as social in nature (Heider & Simmel, 1944). Further, children have increasing knowledge about, and interactions with, technology that might bias them to see a computer cue as having some social characteristics. For example, children in kindergarten and second grade do not readily differentiate between teachers and information from the internet when seeking knowledge (Wang et al., 2019). In order to make the present paradigm convincingly nonsocial, we devised an explanation for the cue that discouraged social attributions.

Results

Agreement with the Cue

In comparing participants’ overall agreement with the cue with the cue’s reliability, participants only over-agreed with the cue in the unreliable condition (Table 1). From the logistic regression, there was an effect of condition (X2(2) = 257.195, p < .001; reliable vs. mixed reliability: b = 2.392, p < .001, OR = 10.928; 95% CI [6.623, 18.033] unreliable vs. mixed reliability: b = −1.709, p < .001, OR = 0.181, 95% CI [0.110, 0.297]; Figure 2) and an effect of time (b = 0.027, X2(1) = 3.758, p = .053, OR = 1.028, 95% CI [1.000, 1.056]), which were qualified by a condition-by-time interaction (X2(2) = 66.538, p < .001; Figure 3 top right). There was a relatively greater change in agreement over time for the reliable and unreliable conditions as compared to the mixed reliability condition (bs = 0.119, −0.058, OR = 1.126 (95% CI [1.080, 1.175], 0.944 (95% CI [0.907, 0.982]), respectively, ps < .005). Taken together, testimony bias was not present in the condition with the highest uncertainty (the mixed reliability condition), however, participants still failed to update their behavior over time in this condition.

Reward

In addition to the effects of condition (X2(2) = 206.013, p < .001; reliable vs. mixed reliability: b = 1.48, p < .001, OR = 4.392, 95% CI [3.423, 5.635]]; unreliable vs. mixed reliability: b = −0.189, p =.137, OR = 0.827, 95% CI [0.645–1.062]), time (b = 0.038, X2(1) = 20.554, p < .001, OR = 1.039, 95% CI [1.022, 1.056]), and a condition-by-time interaction (X2(2) = 6.924, p .031; reliable vs. mixed reliability: b = 0.031, p = .011, OR = 1.032, 95% CI [1.007, 1.057]; unreliable vs. mixed reliability: b = 0.007, p =.585, OR = 1.007, 95% CI [0.983, 1.030]; Figure 3 bottom right) consistent with the findings above, there was a condition-by-time-by-age interaction (X2(2) = 11.293, p = .004). Whereas children across ages improved to receive high rewards in the reliable condition, children showed more marked improvement with age in those conditions (reliable vs. mixed reliability: b = −0.018, p = .017, OR = 0.982, 95% CI [0.968, 0.997]; reliable vs. unreliable: −0.024, p = .001, OR = 0.977, 95% CI [0.963, 0.991]). Therefore, older children show less evidence of a testimony bias influencing reward receipt when a cue is not social in nature.

Children’s Use of Strategies

We assessed the same five strategies as in Experiment 1. The majority of participants in the reliable condition were best fit by the testimony-following model (81%). The majority of participants in the unreliable condition were best fit by the probability-matching model (83%). Contrary to Experiment 1, the majority of participants in the mixed reliability condition were also best fit by the probability-matching model (69%; 21% fit by the testimony-following model; omnibus interaction between model and condition: F(8,344) = 72.252, p < .001, pairwise comparisons in Supplemental Materials), again suggesting reduced reliance on the cue.

Weighting of the Cue

There was an effect of condition such that participants attributed more weight to the cue in the reliable condition and less weight to the cue in the unreliable condition compared to the mixed reliability condition (omnibus: F(2, 83) = 78.293, p < .001, ΔR2 = 0.639; low vs. moderate: b = −0.273, p < .001; low vs. high: b = −0.734, p < .001; high vs. moderate: b = −0.460, p < .001; Figure 4). There was no effect of age and no age-by-condition interaction (ps > .1). A sensitivity analysis for a linear multiple regression using G*Power (Faul et al., 2007) with alpha = .05, power = .80, sample size = 89, and number of predictors = 5 specifies a critical F value of 2.32 and effect size f2 of 0.15; the critical F value and f2 were exceeded for the effect of condition (f2 = 1.88).

Comparing Experiments 1 and 2.

To better unpack the differences in how the social testimony and nonsocial cue influenced participant behavior on the task, we directly compared Experiments 1 and 2. There were no differences in participant age (p = .795) or gender (p = 1) by experiment.

Agreement with the Testimony/Cue.

We regressed likelihood of choosing the suggested location on condition, age, and experiment (nonsocial = −.5, social = .5) with by-participant and by-time random intercepts. There was a main effect of experiment (b = 0.603 X2(1) = 8.065, p = .005, OR = 1.827, 95% CI [1.205, 2.769]) with overall greater likelihood of taking the suggestion when the cue was supposedly coming from another person (i.e., testimony in Experiment 1). There was also an effect of condition (X2(2) = 700.206, p < .001; reliable vs. mixed reliability: b = 2.053 p < .001, OR = 7.788, 95% CI [5.797, 10.464]; unreliable vs. mixed reliability: b = −1.927, p < .001, OR = 0.146, 95% CI [0.108, 1.195]) and a condition-by-age interaction (X2(2) = 8.646, p = .013). The interaction pattern shows that with increasing age, participants better distinguished between the reliable and mixed reliability conditions (b = 0.255, p = .004, OR = 1.290, 95% CI [1.084, 1.534]). Overall, these results point to a testimony bias that is stronger to social testimony (as opposed to a nonsocial cue).

Reward.

In addition to main effects of condition (X2(2) = 619.984, p < .001; reliable vs. mixed reliability: b = 1.620, p < .001, OR = 5.056, 95% CI [4.315, 5.923]; unreliable vs. mixed reliability: b = −0.211, p = .009, OR = 0.809, 95% CI [0.691, 0.948]) and age (b = 0.073, X2(1) = 4.533, p = .033, OR = 1.076, 95% CI [1.006, 1.151]), there was a condition-by-experiment interaction (X2(2) = 6.961, p = .031). Participants earned relatively more rewards in the reliable (vs. mixed reliability b = 0.346, p = .032, OR = 1.414, 95% CI [1.030, 1.940]; vs. unreliable b = 0.386, p = .016, OR = 1.471, 95% CI [1.075, 2.014]) condition in Experiment 1 (social testimony) as compared to Experiment 2 (nonsocial cue). In other words, it appears that over-reliance on the testimony (as opposed to the nonsocial cue’s placement) in the unreliable and mixed reliability conditions negatively impacted reward receipt.

Strategy Use.

We compared the proportion of participants who used the testimony-following model in each condition across experiments. Exactly zero participants used the testimony-following strategy in the unreliable condition in both experiments. Marginally, a greater proportion of participant were best fit by the testimony-following model in the reliable condition of Experiment 1 versus Experiment 2 (X2(1) = 3.281, p = .070). A significantly greater proportion of participants used the testimony-following strategy in the mixed reliability condition with testimony, compared to the nonsocial cue (X2(1) = 4.468, p = .034), providing evidence of a testimony bias under uncertainty that is particularly robust in response to a social cue.

Weighting of Testimony/Cue.

To compare the weighting of strategies, we regressed the weight of the testimony-following model (within the mixture model including testimony following and probability matching) on age, condition, experiment, and all interactions. A number of effects emerged. In addition to main effects of age (b = −0.046, F(1, 179) = 8.774, p = .003, ΔR2 = 0.013) and condition (omnibus: F(2, 179) = 232.446, p < .001, ΔR2 = 0.682; unreliable vs. mixed reliability: b = −0.355, p < .001; reliable vs. mixed reliability: b = 0.414, p < .001) that were consistent with Experiment 1, there was a main effect of experiment, again suggesting that more weight was attributed to testimony as compared to the nonsocial cue (b = 0.196, F(1, 179) = 14.191, p < .001, ΔR2 = 0.021). The main effects were qualified by a condition-by-age interaction suggesting that participants decreased the weight they attributed to the suggestions in the mixed reliability condition with age, but age did not influence behavior in the unreliable or reliable conditions (omnibus: F(2, 179) = 4.489, p = .013, ΔR2 = 0.013; simple slopes: unreliable: b = −0.008, p = .615; mixed reliability: b = −0.046 p = .003; reliable: b = −0.017, p = .239). Finally, though not statistically significant, there was a marginal condition-by-experiment interaction (F(2, 179) = 2.474, p = .087, ΔR2 = 0.007), reflecting the pattern such that there were greater differences in weight attributed to testimony versus the nonsocial cue in the mixed reliability condition. A sensitivity analysis for a linear multiple regression using G*Power (Faul et al., 2007) with alpha = .05, power = .80, sample size = 191, and number of predictors = 11 specifies a critical F value of 1.84 and effect size f2 of 0.09; the critical F value was exceeded for the effects of condition, age, and experiment and the interactions between condition and age and condition and experiment. The f2 was exceeded for the effect of condition (f2 = 2.60; age f2 = .05, experiment f2 = .08, condition-by-age f2 = .05, condition-by-experiment f2 = .03).

Discussion

In general, participants were more likely to select a suggested location in the social experiment as compared to when the cue was nonsocial. Particularly when the cue was mixed in terms of its reliability, children attributed more weight to the testimony and used a strategy that relied on this information (i.e., they followed the confederate’s suggestions). When the cue was not social, children primarily focused on the underlying rewards. Therefore, the distinction between the testimony and nonsocial cues was contingent on the uncertainty of the environment.

The idea that testimony guides behavior under uncertainty whereas nonsocial cues do not is consistent with research with infants (Tummeltshammer et al., 2014). Specifically, infants directed their attention to a never-rewarded location on a computer screen when that location was cued by a person, but not when it was cued by an arrow. Additionally, the results from the current research suggest that the ability to disengage from a nonsocial cue in order to obtain higher rewards increases with age. Notably, this age-related finding was absent in the social experiment, providing a hint that testimony’s effect may be present later across development. In all, the comparison with a nonsocial experiment provides support for the idea that there is a lingering effect of a “testimony bias” that appears to be—at least in part—unique to uncertain social information.

Experiment 3

When the testimony was characterized by trial-by-trial uncertainty, as in the mixed reliability condition of Experiment 1, weight to the testimony-following strategy decreased with age. A comparison with a nonsocial cue provided evidence that the social nature of the uncertain cue impacted strategy use. Yet, there may be more than one way in which testimony can present uncertainty, for example testimony can change unexpectedly. In Experiment 3 we presented conditions in which testimony unexpectedly changed from being reliable to being unreliable or vice versa. In general, we expected that children would be able to flexibly update their behavior in this changing social environment, particularly given how well distinguished the conditions of Experiment 1 were in terms of agreement with the testimony and because children as young as 3 years of age will decrease their trust in verbal testimony after a seeing evidence that an informant’s claim was incorrect (Hermansen et al., 2021). We were primarily interested in whether age might influence flexibility. In Experiment 1, weight to the testimony-following model decreased with age when the testimony only sometimes indicated the correct reward location. Additionally, adults were better at detecting a one-time pattern change in a probability-learning task than 4–6-year-old children (Starling et al., 2018). Further, individuals use minimal information to make trait-level attributions about social agents (Uleman et al., 2008), and children who are younger than 5 years of age are less accurate when making predictions about a social agent’s future behavior because they are not able to readily incorporate consistency cues (Boseovski & Lee, 2006). Based on preschool-age children’s difficulty adapting to changing information and their bias to trust in testimony, we reasoned that younger children may attribute more weight to testimony that is reliable early in the experiment even if that reliability changes over time. We predicted that younger children would be more likely to follow the testimony if the confederate had previously reliably indicated the rewarded location, even if that confederate currently provided unreliable suggestions. Such a difference may not be present when interacting with initially unreliable testimony. Experiment 2 suggests that children will monitor information sources, even when they have no reason to expect that they will be useful. Therefore, we predicted that, even if participants did not use the testimony in the first half of the experiment, they would be sensitive to the change in reliability and update their choices accordingly.

Method

Participants included 129 children ages 4–9 (12 Hispanic or Latinx, 117 not Hispanic or Latinx; 10 Black or African American, 4 Asian American, 9 multiracial, 103 White, 3 chose “other” or did not report race) who did not participate in Experiments 1 or 2. Nine additional participants were excluded for not completing the task (N = 6) or experimenter error (N = 3).

The design, procedure, and data analysis approach were the same as Experiment 1, with the following exceptions: First, in Experiment 3, the conditions included reliable→unreliable (i.e., testimony indicated the correct rock 90% of the time for the first 100 trials and 10% of the time for the second 100 trials, N = 63, 31 male, 32 female, Mage = 6.938, SDage = 1.691) and unreliable→reliable (i.e., testimony indicated the correct rock 10% of the time for the first 100 trials and 90% of the time for the second 100 trials, N = 66, 37 male, 29 female, Mage = 7.068, SDage = 1.630; no difference in participant age (p = .531) or gender (p = .328) by condition). There was no break or any other change in the experiment that would draw participant’s attention to the change in reliability. As in Experiments 1 and 2, we examined likelihood of choosing the suggested location, likelihood of finding a coin, participant strategy use, and weight attributed to strategies.

Results

Agreement with Testimony

We regressed whether the participant chose the rock that was indicated by the testimony on condition (unreliable→reliable = −.5, reliable→unreliable = .5), age (continuous, mean-centered), task half (first half = −.5, second half = .5), and all possible interactions. We included a by-participant random slope for task half and included a by-trial random intercept. The interaction between condition and age was significant (b = −0.126, X2(1) = 41.430, p < .001; OR = 0.881, 95% CI [0.848, 0.916]; observed power via simr (Green & MacLeod, 2016) was 100.00%, 95% CI [98.17%, 100.00%]; Figure 5A). For participants in the reliable→unreliable condition, likelihood of agreeing with the testimony decreased with age (b = −0.061, p < .001; OR = 0.941, 95% CI [0.915, 0.967]) whereas in the unreliable→reliable condition, the likelihood of agreeing with the testimony increased with age (b = 0.073, p < .001; OR = 1.075, 95% CI [1.046, 1.106]). In sum, younger participants were more strongly influenced by the initial testimony both in the reliable→unreliable and unreliable→reliable conditions.

Figure 5.

Figure 5.

A) Likelihood of agreeing with the confederate’s suggestion by age and condition in Experiment 3. B) Likelihood of agreeing with the confederate’s suggestion by task half and condition in Experiment 3. Model predictions and participant-level data. Note: Lines are point estimates from logistic mixed-effects models with the interaction between condition and participant age, and lower-order effects. Error bands represent standard error of the point estimates. Points are individual participants’ proportion of choices that followed the confederate’s suggestion by age (A) or task half (B).

In addition to the condition-by-age interaction, there was a main effect of condition (b = 0.373, X2(1) = 122.018, p < .001, OR = 1.452, 95% CI [1.359, 1.551]) and condition-by-half interaction (b = −5.805, X2(1) = 586.893, p < .001, OR = 0.003, 95% CI [0.002, 0.005]; Figure 5B), indicating greater differences between conditions in the first, as compared to the second, half of the experiment. There was also a main effect of half (b = 0.237, X2(1) = 3.877, p = .049, OR = 1.268, 95% CI [1.001, 1.606]; greater likelihood of agreement with testimony in the second half of the experiment).

Reward

There were numerous effects on reward. First, a main effect of condition (b = 0.080, X2(1) = 7.392, p = .007, OR = 1.084, 95% CI [1.023, 1.148]) indicated that participants obtained more rewards when the testimony was initially reliable. A main effect of age (b = 0.102, X2(1) = 130.898, p < .001, OR = 1.108, 95% CI [1.088, 1.127]) showed that reward receipt increased with age. The main effects were qualified by condition-by-age (b = −0.038, X2(1) = 4.523, p = .033, OR = 0.963, 95% CI [0.930, 0.997]) and condition-by-half interactions (b = −3.726, X2(1) = 390.577, p < .001, OR = 0.024, 95% CI [0.017, 0.035]). The condition-by-age interaction suggests that there was a relatively smaller difference between conditions in likelihood of finding coins across age. The condition-by-half interaction shows greater differences between conditions in the first, as compared to the second, half of the experiment.

Children’s Use of Strategies

We regressed the log likelihood fit on condition, model, task half, and possible interactions. We included by-participant random slopes for model and task half. The three-way interaction was significant (F(4, 508) = 731.109, p < .001). Prior to the switch in testimony reliability, the majority of participants who received reliable testimony were best fit by the testimony-following model (92%), while participants who received unreliable testimony were best fit by the probability-matching model (79%). Post-switch, participants changed their strategy. The majority of participants who initially received reliable testimony, but now received unreliable testimony, were best fit by the probability-matching model (68%). Participants who initially received unreliable testimony, but now received reliable testimony were best fit by the testimony-following model (80%). When comparing participants who received reliable testimony either in the first half or second half of the experiment, there was no difference in the proportion of participants who were best fit by the testimony-following model (p = .094; similarly, no difference in proportion of participants receiving unreliable testimony fit by the probability matching model based on the first or second half of the experiment, p = .247).

Weighting of Testimony with an Unexpected Change

Finally, we examined the weight attributed to the testimony-following model and tested whether weight was influenced by initial experience (i.e., receiving reliable testimony initially or receiving unreliable testimony initially) and age. There was a main effect of condition (b = 0.097, F(1, 125) = 12.568, p < .001; greater weight was attributed to the testimony-following model in the reliable→unreliable condition). This effect was qualified by a condition-by-half interaction (b = −1.20, F(1,125) = 485.318, p < .001) consistent with the pattern described above. No other effects or interactions were significant (ps > .1). Because Half is a repeated measure, we used simr (Green & MacLeod, 2016) to estimate the minimum effect size (defined as the unstandardized coefficient) to achieve 80% power, which was 0.078 (power = 83.50%, 95% CI [77.62%, 88.36%]) for the effect of condition (actual condition b = 0.097; observed power for effect of condition = 97.00%, 95% CI [93.58%, 98.89%]) and 0.16 (power = 82.50%, 95% CI [76.51%, 87.50%]) for the interaction between condition and half (actual condition-by-half b = −1.20; observed power for effect of condition-by-half = 100.00%, 95% CI [98.17%, 100.00%]).

Discussion

Here we tested uncertainty defined as an unexpected change in testimony. Whether encountering reliable testimony following a period of unreliability or vice versa, participants readily adjusted strategies to match the uncertain, changing environment. However, younger participants continued to be influenced by the initial testimony as revealed by the difference in young children’s likelihood of agreeing with the testimony by condition (an effect that decreased with age). Therefore, younger participants again appeared to be more influenced by the uncertainty—as was also seen in the mixed reliability condition of Experiment 1—and had difficulty adapting to the changing environment as compared to older children.

The influence of initially unreliable testimony also decreased with age. There is some research to support the idea that younger children are more influenced by inaccuracy, though this research suggests that children should be sensitive to the degree of inaccuracy by four years of age (Pasquini et al., 2007) and others have found children as young as 4 years to be sensitive to uncertainty (McLoughlin et al., 2020). However, it is possible that the complexity of the task (e.g., multiple probabilities to track across many trials) prolonged the developmental trajectory. Together, these results hint at the possibility that young children have a more general bias to be influenced by initial testimony—be it reliable or unreliable. However, the age findings should be interpreted with caution for two reasons. First, the effect sizes are small, warranting replication with sample sizes chosen a priori that provide sensitivity to test for the effect. Second, without a nonsocial comparison, we cannot assert that these patterns are caused by the social nature of the cue. Because age-related differences in flexible updating have been found in nonsocial contexts (Starling et al., 2018), it is possible that age-related differences are primarily driven by executive function, memory, or other cognitive skills that track with age. Nevertheless, Experiment 3 provides evidence that children can and do update their choices and strategies in the face of a one-time change in testimony.

General Discussion

The aim of the present research was to investigate two hypotheses (1) That under conditions of high uncertainty, children would show testimony bias, and (2) That even though children as young as 4 years of age can solve this task on their own using probabilistic reasoning, high uncertainty would elicit the testimony bias among older children. Consistent with our prediction, participants over-relied on testimony in a condition with high uncertainty and failed to use an optimal strategy to obtain rewards. In Experiments 1 and 3, we also find support for the idea that testimony continues to influence choice-behavior beyond the preschool years. Though, when testimony is uncertain, impact is inversely related to age. Younger children appear to be especially sensitive to initial testimony and have relatively more difficulty updating their behavior. This finding stands in contrast to other research suggesting that younger children are particularly flexible in their ability to update beliefs in light of extant evidence (e.g., Gopnik et al., 2017; Lucas et al., 2014). The developmental differences observed here may represent a special case of interference in uncertain environments when testimony diverges from another source of, also uncertain, information.

Unpacking Strategy Use

One unexpected finding was that children who received unreliable and mixed reliability testimony or nonsocial cues did not transition to maximizing (i.e., exclusively choosing the highest rewarded option, the pattern of results observed in children in Plate et al., 2018 who were tested in the absence of an additional cue). Maximizing would be the optimal strategy in both the unreliable and mixed reliability conditions. Participants failed to maximize even though they were not following the cue’s suggestions and were relying on the reward distribution (probability matching). However, previous research has also shown reduced rates of maximizing when the statistics of the task are made more difficult to distinguish (Plate et al., 2018).

Several explanations are plausible for participants’ failure to maximize. First, perhaps participants simply did not have enough trials to capture maximizing in this task. Given that children typically probability match for a period of time prior to maximizing, it is possible that the addition of a second source of information required a prolonged period of probability matching prior to transitioning to maximizing. Second, it is possible that children considered alternative strategies beyond probability matching. One such strategy could have involved taking into account how “off” the suggestions were. While there were not any deterministic trials (i.e., trials in which one could know the correct rock with 100% certainty based on the suggestion), it is possible that participants were trying to discern such a pattern. Future research, that includes trial types that systematically vary on determinism, is needed to better understand children’s approaches.

Another feature that could have influenced strategy use was that the social learning context was ambiguous: children did not receive explicit information about the social agent’s intentions, past experience, competence, or knowledge state. As these characteristics influence bias to trust in testimony (e.g., Buchsbaum et al., 2011; Corriveau et al., 2009; Einav & Robinson, 2011; Koenig & Harris, 2005; 2007; Kushnir et al., 2008; Pasquini et al., 2007), it would be useful to better understand how uncertainty affects the extent to which children rely on such information. Examining individual strategy approaches (see Supplemental Materials) can provide some insight into how children interpreted the social manipulation with regard to their own goals in the task. Finally, individual differences also have the potential to affect strategy use resulting from the learner’s testimony bias (e.g., inhibitory control is associated with more bias to trust in testimony, Jaswal et al., 2014; see also Gilbert, 1991) and/ or attention to the underlying statistics (e.g., Browning et al., 2015). Research related to individual differences would enhance our understanding of the complex interplay between the learner and the environment.

Limitations and Future Directions

We observed both similarities and differences in comparing the testimony and nonsocial cues. Differences between social and nonsocial conditions can be particularly difficult to interpret because of individuals’ inclinations to attribute agency to nonsocial cues (e.g., Heider & Simmel, 1944; Wang et al., 2019). One possibility is that children privileged testimony over nonsocial information. This view is in line with research suggesting that social cues can direct attention and action above and beyond nonsocial cues (Tummeltshammer et al., 2014; Wu & Kirkham, 2010). Another possibility is that the differences were driven by the instructions. In other words, because participants were told that the nonsocial cue was incidental, perhaps they did not have any reason to think that the cue would help them find the reward. Despite this confound, participants in the reliable condition in both experiments used the cue. Yet, children could have been more willing to discount the nonsocial cue after having been told that it was not an intended aspect of the game.

There is a body of research on pointing gestures that suggests that pointing is a particularly strong cue for preschool age children and that these children may have a bias to search behind pointed locations (Palmquist et al., 2012; Palmquist & Jaswal, 2012). Evidence suggests that preschoolers are lured by pointers because they have difficulty overcoming a bias that these individuals are knowledgeable (as opposed to the alternate hypothesis that young children have difficulty inhibiting a response to the visual point display; Palmquist et al., 2018). Considering these issues, the testimony versus nonsocial contrast could cut both ways: Following the testimony in Experiment 1 could be too alluring because it is pointer-based and children are given instructions that might prime them to use the cue, therefore inflating the differences between Experiment 1 and Experiment 2. Alternatively, Experiment 2 could be unintentionally social because the cue is not entirely random, therefore diminishing the differences between Experiment 1 and Experiment 2. Related to these distinctions is the broader consideration of what “counts” as testimony. Much of the testimony research has focused on verbal testimony (Sobel & Finiasz, 2020); however, children receive input from others via myriad avenues (e.g., on social media (Antheunis et al., 2013; Freberg et al., 2011; Kaplan & Haenlein, 2010), in textbooks (Rey, 2012), and through nonverbal signaling (e.g., a parent simply pointing at a toy for the child to put away). Features that covary with the manner in which input is conveyed could be meaningful for understanding when, why, and for how long children defer to others. For example, testimony provided in person could exert additional social pressure (e.g., to conform or follow a social norm), or it might increase the non-epistemic cues (e.g., the informant’s social identity, position of power) that intensify deference to testimony (Harris et al., 2018; Jaswal & Kondrad, 2016). It is possible that having the confederate in the room with the child and either pointing (similar to Jaswal, 2010) or providing verbal testimony would have further amplified the bias to select the suggested location both in comparison to the nonsocial condition and across development.

In contrast to instances of verbal testimony, one feature of the current study is that the suggestion remains visible both during choice selection and during the presentation of outcomes. That the testimony cue remains visible (but without an in-person informant) may present a unique opportunity to treat the testimony as a single instance of data that can be compared against other data sources (Sobel & Kushnir, 2013). The complex nature of cues is precisely why investigations are needed to make headway in contrasting the mechanisms that underlie social learning and those that underlie nonsocial learning. We believe that the present research highlights an early step in this area of research.

Conclusion

Overall, this research provides evidence regarding lingering testimony bias across childhood. Children were sensitive to the reward structure and the statistics of the testimony. However, children had difficulty adjusting behavior when testimony was uncertain, and younger children were disproportionately likely to rely on testimony under uncertainty. Younger children were also more strongly influenced by initial testimony prior to an unexpected change, an effect that decreased with age. Through these experiments, we shed light on conditions that can strengthen bias to defer to testimony even when children are at ages in which they otherwise demonstrate rational behavior.

Supplementary Material

Plate et al_supplementalMaterials_accepted

Acknowledgments

This research was supported by the National Institute of Mental Health through grant R01MH61285 to S.D.P. and in part by a core grant to the Waisman Center from the National Institute of Child Health and Human Development (U54 HD090256). R.C.P. was supported by a National Science Foundation Graduate Research Fellowship (DGE-1256259) and the Richard L. and Jeanette A. Hoffman Wisconsin Distinguished Graduate Fellowship. We thank the families who participated in this study and the research assistants who helped conduct the research. The experimental paradigm, de-identified data, and analysis scripts are available on Open Science Framework: https://osf.io/ey4ut/. The studies were not preregistered.

References

  1. Antheunis ML, Tates K, & Nieboer TE (2013). Patients’ and health professionals’ use of social media in health care: motives, barriers and expectations. Patient education and counseling, 92(3), 426–431. doi: 10.1016/j.pec.2013.06.020 [DOI] [PubMed] [Google Scholar]
  2. Asch SE, & Guetzkow H (1951). Effects of group pressure upon the modification and distortion of judgments. Organizational influence processes, 295–303. [Google Scholar]
  3. Bates D, Maechler M, Bolker B, Walker S, Christensen RHB, Singmann H, … & Scheipl F (2012). Package ‘lme4’. CRAN. R Foundation for Statistical Computing, Vienna, Austria. [Google Scholar]
  4. Birch S, Severson RL, & Baimel A (2020). Children’s understanding of when a person’s confidence and hesitancy is a cue to their credibility. PloS One, 15, e0227026. doi: 10.1371/journal.pone.0227026 [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Blakemore SJ, & Mills KL (2014). Is adolescence a sensitive period for sociocultural processing?. Annual review of psychology, 65, 187–207. doi: 10.1146/annurev-psych-010213-115202 [DOI] [PubMed] [Google Scholar]
  6. Bonawitz E, Shafto P, Gweon H, Goodman ND, Spelke E, & Schulz L (2011). The double-edged sword of pedagogy: Instruction limits spontaneous exploration and discovery. Cognition, 120(3), 322–330. doi: 10.1016/j.cognition.2010.10.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Boseovski JJ, & Lee K (2006). Children’s use of frequency information for trait categorization and behavioral prediction. Developmental psychology, 42(3), 500. doi: 10.1037/0012-1649.42.3.500 [DOI] [PubMed] [Google Scholar]
  8. Bridgers S, Buchsbaum D, Seiver E, Griffiths TL, & Gopnik A (2016). Children’s causal inferences from conflicting testimony and observations. Developmental psychology, 52(1), 9. doi: 10.1037/a0039830 [DOI] [PubMed] [Google Scholar]
  9. Browning M, Behrens TE, Jocham G, O’reilly JX, & Bishop SJ (2015). Anxious individuals have difficulty learning the causal statistics of aversive environments. Nature neuroscience, 18(4), 590–596. doi: 10.1038/nn.3961 [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Buchsbaum D, Gopnik A, Griffiths TL, & Shafto P (2011). Children’s imitation of causal action sequences is influenced by statistical and pedagogical evidence. Cognition, 120(3), 331–340. doi: 10.1016/j.cognition.2010.12.001 [DOI] [PubMed] [Google Scholar]
  11. Corriveau KH, Meints K, & Harris PL (2009). Early tracking of informant accuracy and inaccuracy. British Journal of Developmental Psychology, 27(2), 331–342. doi: [DOI] [PubMed] [Google Scholar]
  12. Einav S, & Robinson EJ (2011). When being right is not enough: Four-year-olds distinguish knowledgeable informants from merely accurate informants. Psychological science, 22(10), 1250–1253. doi: 10.1177/0956797611416998 [DOI] [PubMed] [Google Scholar]
  13. Faul F, Erdfelder E, Lang AG. et al. G*Power 3: A flexible statistical power analysis program for the social, behavioral, and biomedical sciences. Behavior Research Methods 39, 175–191 (2007). doi: 10.3758/BF03193146 [DOI] [PubMed] [Google Scholar]
  14. Freberg K, Graham K, McGaughey K, & Freberg LA (2011). Who are the social media influencers? A study of public perceptions of personality. Public Relations Review, 37(1), 90–92. doi: 10.1016/j.pubrev.2010.11.001 [DOI] [Google Scholar]
  15. Gilbert DT (1991). How mental systems believe. American Psychologist, 46(2), 107–119. doi: 10.1037/0003-066X.46.2.107 [DOI] [Google Scholar]
  16. Gopnik A, O’Grady S, Lucas CG, Griffiths TL, Wente A, Bridgers S, … & Dahl RE (2017). Changes in cognitive flexibility and hypothesis search across human life history from childhood to adolescence to adulthood. Proceedings of the National Academy of Sciences, 114(30), 7892–7899. doi: 10.1073/pnas.1700811114 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Green P, MacLeod CJ (2016). “simr: an R package for power analysis of generalised linear mixed models by simulation.” Methods in Ecology and Evolution, 7(4), 493–498. doi: 10.1111/2041-210X.12504, https://CRAN.R-project.org/package=simr [DOI] [Google Scholar]
  18. Gweon H, Pelton H, Konopka JA, & Schulz LE (2014). Sins of omission: Children selectively explore when teachers are under-informative. Cognition, 132(3), 335–341. doi: 10.1016/j.cognition.2014.04.013 [DOI] [PubMed] [Google Scholar]
  19. Gualtieri S, Buchsbaum D, & Denison S (2019). Exploring information use in children’s decision-making: Base-rate neglect and trust in testimony. Journal of Experimental Psychology: General. doi: 10.1037/xge0000726 [DOI] [PubMed] [Google Scholar]
  20. Harris PL (2012). Trusting what you’re told: How children learn from others. Harvard University Press. [Google Scholar]
  21. Harris PL, & Koenig MA (2006). Trust in testimony: How children learn about science and religion. Child development, 77(3), 505–524. doi: 10.1111/j.1467-8624.2006.00886.x [DOI] [PubMed] [Google Scholar]
  22. Harris PL, Koenig MA, Corriveau KH, & Jaswal VK (2018). Cognitive foundations of learning from testimony. Annual Review of Psychology, 69, 251–273. doi: 10.1146/annurev-psych-122216-011710 [DOI] [PubMed] [Google Scholar]
  23. Haun DB, & Tomasello M (2011). Conformity to peer pressure in preschool children. Child development, 82(6), 1759–1767. doi: 10.1111/j.1467-8624.2011.01666.x [DOI] [PubMed] [Google Scholar]
  24. Heider F, & Simmel M (1944). An experimental study of apparent behavior. The American journal of psychology, 57(2), 243–259. doi: 10.2307/1416950 [DOI] [Google Scholar]
  25. Hermansen TK, Ronfard S, Harris PL, Pons F, & Zambrana IM (2021). Young children update their trust in an informant’s claim when experience tells them otherwise. Journal of Experimental Child Psychology, 205, 105063. doi: 10.1016/j.jecp.2020.105063 [DOI] [PubMed] [Google Scholar]
  26. Hermes J, Behne T, Bich AE, Thielert C, & Rakoczy H (2018). Children’s selective trust decisions: Rational competence and limiting performance factors. Developmental science, 21(2), e12527. doi: 10.1111/desc.12527 [DOI] [PubMed] [Google Scholar]
  27. Hermes J, Rakoczy H, & Behne T (2019). Making sense of conflicting information: A touchscreen paradigm to measure young children’s selective trust. Infant and Child Development, 28(1), e2119. doi: 10.1002/icd.2119 [DOI] [Google Scholar]
  28. Jaswal VK (2010). Believing what you’re told: Young children’s trust in unexpected testimony about the physical world. Cognitive psychology, 61(3), 248–272. doi: 10.1016/j.cogpsych.2010.06.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Jaswal VK, Croft AC, Setia AR, & Cole CA (2010). Young children have a specific, highly robust bias to trust testimony. Psychological Science, 21(10), 1541–1547. doi: 10.1177/0956797610383438 [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Jaswal VK, & Kondrad RL (2016). Why children are not always epistemically vigilant: Cognitive limits and social considerations. Child Development Perspectives, 10(4), 240–244. doi: 10.1111/cdep.12187 [DOI] [Google Scholar]
  31. Jaswal VK, Pérez‐Edgar K, Kondrad RL, Palmquist CM, Cole CA, & Cole CE (2014). Can’t stop believing: Inhibitory control and resistance to misleading testimony. Developmental science, 17(6), 965–976. doi: 10.1111/desc.12187 [DOI] [PubMed] [Google Scholar]
  32. Kaplan AM, & Haenlein M (2010). Users of the world, unite! The challenges and opportunities of Social Media. Business horizons, 53(1), 59–68. doi: 10.1016/j.bushor.2009.09.003 [DOI] [Google Scholar]
  33. Koenig MA, & Harris PL (2005). Preschoolers mistrust ignorant and inaccurate speakers. Child development, 76(6), 1261–1277. doi: 10.1111/j.1467-8624.2005.00849.x [DOI] [PubMed] [Google Scholar]
  34. Koenig MA, & Harris PL (2007). The basis of epistemic trust: Reliable testimony or reliable sources?. Episteme, 4(3), 264–284. doi: 10.1353/epi.0.0017 [DOI] [Google Scholar]
  35. Kushnir T, Wellman HM, & Gelman SA (2008). The role of preschoolers’ social understanding in evaluating the informativeness of causal interventions. Cognition, 107(3), 1084–1092. doi: 10.1016/j.cognition.2007.10.004 [DOI] [PubMed] [Google Scholar]
  36. Li X, & Yow WQ (2018). Willingness to revise own testimony: 3-and 4-year-olds’ selective trust in unexpected testimony from accurate and inaccurate informants. Journal of experimental child psychology, 173, 1–15. doi: 10.1016/j.jecp.2018.03.008 [DOI] [PubMed] [Google Scholar]
  37. Lucas CG, Bridgers S, Griffiths TL, & Gopnik A (2014). When children are better (or at least more open-minded) learners than adults: Developmental differences in learning the forms of causal relationships. Cognition, 131(2), 284–299. doi: 10.1016/j.cognition.2013.12.010 [DOI] [PubMed] [Google Scholar]
  38. Magner UI, Schwonke R, Aleven V, Popescu O, & Renkl A (2014). Triggering situational interest by decorative illustrations both fosters and hinders learning in computer-based learning environments. Learning and instruction, 29, 141–152. doi: 10.1016/j.learninstruc.2012.07.002 [DOI] [Google Scholar]
  39. Mcloughlin NC, Finiasz Z, Sobel DM, & Corriveau KH (2020). Children’s developing capacity to calibrate verbal testimony with observed evidence when learning causal relations. Preprint doi: 10.31219/osf.io/zuqwm [DOI] [PubMed] [Google Scholar]
  40. Palmquist CM, Burns HE, & Jaswal VK (2012). Pointing disrupts preschoolers’ ability to discriminate between knowledgeable and ignorant informants. Cognitive Development, 27(1), 54–63. doi: 10.1016/j.cogdev.2011.07.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Palmquist CM & Jaswal VK (2012). Preschoolers expect pointers (even ignorant ones) to be knowledgeable. Psychological Science, 23(3), 230–231. doi: 10.1177/0956797611427043. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Palmquist CM, Kondrad RL, & Norris MN (2018). Follow my point? Preschoolers’ expectations of veridicality disrupt their understanding of deceptive points. Cognitive Development, 48, 190–202. doi: 10.1016/j.cogdev.2018.08.009. [DOI] [Google Scholar]
  43. Pasquini ES, Corriveau KH, Koenig M, & Harris PL (2007). Preschoolers monitor the relative accuracy of informants. Developmental Psychology, 43(5), 1216. doi: 10.1037/0012-1649.43.5.1216 [DOI] [PubMed] [Google Scholar]
  44. Plate RC (2021, January 15). Testimony bias lingers across development under uncertainty. Retrieved from osf.io/ey4ut [DOI] [PMC free article] [PubMed]
  45. Plate RC, Fulvio JM, Shutts K, Green CS, & Pollak SD (2018). Probability Learning: Changes in Behavior Across Time and Development. Child development, 89(1), 205–218. doi: 10.1111/cdev.12718 [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Rey GD (2012). A review of research and a meta-analysis of the seductive detail effect. Educational Research Review, 7(3), 216–237. doi: 10.1016/j.edurev.2012.05.003 [DOI] [Google Scholar]
  47. Riggs AE, Alibali MW, & Kalish CW (2015). Leave her out of it: Person-presentation of strategies is harmful for transfer. Cognitive Science, 39(8), 1965–1978. doi: 10.1111/cogs.12224 [DOI] [PubMed] [Google Scholar]
  48. Riggs AE, Alibali MW, & Kalish CW (2017). Does it matter how Molly does it? Person-presentation of strategies and transfer in mathematics. Contemporary Educational Psychology, 51, 315–320. doi: 10.1016/j.cedpsych.2017.09.001 [DOI] [Google Scholar]
  49. Ronfard S, & Lane JD (2018). Preschoolers continually adjust their epistemic trust based on an informant’s ongoing accuracy. Child development, 89(2), 414–429. doi: 10.1111/cdev.12720 [DOI] [PubMed] [Google Scholar]
  50. Ronfard S, Lane JD, Wang M, & Harris PL (2017). The impact of counter-perceptual testimony on children’s categorization after a delay. Journal of Experimental Child Psychology, 163, 151–158. doi: 10.1016/j.jecp.2017.06.006 [DOI] [PubMed] [Google Scholar]
  51. Sobel DM, & Finiasz Z (2020). How children learn from others: An analysis of selective word learning. Child Development, 91(6), e1134–e1161. doi: 10.1111/cdev.13415 [DOI] [PubMed] [Google Scholar]
  52. Schulz LE, & Bonawitz EB (2007). Serious fun: preschoolers engage in more exploratory play when evidence is confounded. Developmental Psychology, 43(4), 1045. doi: 10.1037/0012-1649.43.4.1045 [DOI] [PubMed] [Google Scholar]
  53. Shafto P, Eaves B, Navarro DJ, & Perfors A (2012). Epistemic trust: Modeling children’s reasoning about others’ knowledge and intent. Developmental Science, 15(3), 436–447. doi: 10.1111/j.1467-7687.2012.01135.x [DOI] [PubMed] [Google Scholar]
  54. Sobel DM, Buchanan DW, Butterfield J, & Jenkins OC (2010). Interactions between causal models, theories, and social cognitive development. Neural Networks, 23(8–9), 1060–1071. doi: 10.1016/j.neunet.2010.06.004 [DOI] [PubMed] [Google Scholar]
  55. Sobel DM, & Kushnir T (2013). Knowledge matters: How children evaluate the reliability of testimony as a process of rational inference. Psychological Review, 120(4), 779. doi: 10.1037/a0034191 [DOI] [PubMed] [Google Scholar]
  56. Starling SJ, Reeder PA, & Aslin RN (2018). Probability learning in an uncertain world: How children adjust to changing contingencies. Cognitive development, 48, 105–116. doi: 10.1016/j.cogdev.2018.06.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Team R. C. (2019). R: A language and environment for statistical computing (version 3.1. 2). Vienna, Austria. R Foundation for Statistical Computing; 2014. [Google Scholar]
  58. Tummeltshammer KS, Wu R, Sobel DM, & Kirkham NZ (2014). Infants track the reliability of potential informants. Psychological Science, 25(9), 1730–1738. doi: 10.1177/0956797614540178 [DOI] [PubMed] [Google Scholar]
  59. Uleman JS, Adil Saribay S, & Gonzalez CM (2008). Spontaneous inferences, implicit impressions, and implicit theories. Annu. Rev. Psychol, 59, 329–360. doi: 10.1146/annurev.psych.59.103006.093707 [DOI] [PubMed] [Google Scholar]
  60. Wang F, Tong Y, & Danovitch J (2019). Who do I believe? Children’s epistemic trust in internet, teacher, and peer informants. Cognitive Development, 50, 248–260. doi: 10.1016/j.cogdev.2019.05.006 [DOI] [Google Scholar]
  61. Wickham H (2016). ggplot2: elegant graphics for data analysis. springer. [Google Scholar]
  62. Wickham H, Averick M, Bryan J, Chang W, McGowan L, François R, Grolemund G, Hayes A, Henry L, Hester J, Kuhn M, Pedersen T, Miller E, Bache S, Müller K, Ooms J, Robinson D, Seidel D, Spinu V, … Yutani H (2019). Welcome to the Tidyverse. Journal of Open Source Software, 4(43), 1686. doi: 10.21105/joss.01686 [DOI] [Google Scholar]
  63. Wu R, & Kirkham NZ (2010). No two cues are alike: Depth of learning during infancy is dependent on what orients attention. Journal of Experimental Child Psychology, 107(2), 118–136. doi: 10.1016/j.jecp.2010.04.014 [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Plate et al_supplementalMaterials_accepted

RESOURCES