Skip to main content
Proceedings of the Royal Society B: Biological Sciences logoLink to Proceedings of the Royal Society B: Biological Sciences
. 2011 Jul 27;279(1729):653–662. doi: 10.1098/rspb.2011.1172

The evolutionary basis of human social learning

T J H Morgan 1, L E Rendell 1, M Ehn 2, W Hoppitt 1, K N Laland 1,*
PMCID: PMC3248730  PMID: 21795267

Abstract

Humans are characterized by an extreme dependence on culturally transmitted information. Such dependence requires the complex integration of social and asocial information to generate effective learning and decision making. Recent formal theory predicts that natural selection should favour adaptive learning strategies, but relevant empirical work is scarce and rarely examines multiple strategies or tasks. We tested nine hypotheses derived from theoretical models, running a series of experiments investigating factors affecting when and how humans use social information, and whether such behaviour is adaptive, across several computer-based tasks. The number of demonstrators, consensus among demonstrators, confidence of subjects, task difficulty, number of sessions, cost of asocial learning, subject performance and demonstrator performance all influenced subjects' use of social information, and did so adaptively. Our analysis provides strong support for the hypothesis that human social learning is regulated by adaptive learning rules.

Keywords: social learning, asocial learning, social learning strategy, copying, conformity, consensus

1. Introduction

Rules that govern the use of social information are variously referred to as ‘social learning strategies’ [1], ‘transmission biases’ [2,3] and ‘trust’ [4]. Theory suggests that individuals ought to be selective with respect to when and whom they copy [2,5], and that natural selection will favour the deployment of adaptive social learning strategies that guide individual reliance on social information [1,2,6]. Social learning strategies have been primarily examined through theoretical work using population genetic and game theory models [2,5,714]. Such rules are also receiving attention from researchers in a wide variety of academic disciplines with interests in the experimental analysis of social learning, cultural transmission and cultural evolution [1524]. However, there remains comparatively little empirical data on human social learning with which to test the hypotheses generated by evolutionary models.

One relatively well-studied class of rules are frequency-dependent strategies, such as conformity and ‘anti-conformity’ [2,13,14,18,25], which involve individuals selectively adopting traits based on how common they are. Following Boyd & Richerson [2, p. 206], we define conformist frequency-dependent copying as the disproportionately likely adoption of the most common variant. Theoretical work on conformity has produced mixed results; some analyses suggest that conformity readily evolves under a broad range of conditions, including temporally and spatially variable environments [2,9], while other models conclude that conformity should be selected against because it hinders cumulative culture [25]. Empirical evidence of conformity has proved elusive [17,18,25,26], and some work has suggested that changes in frequency may be more salient than absolute frequencies [27].

Another class of social learning strategies is pay-off-based rules, where copying depends on the return to the observed individual [10,14]. Game-theoretic analyses have indicated that strategies where an individual's use of social information was proportional either to their own pay-off, the pay-off to demonstrators, or the difference between the two can be particularly effective [10,11]. There is good evidence that humans and other animals are sensitive to such information and do use it to direct social learning [19,20,2830]. Other types of strategy are less studied, but nonetheless, there is evidence from human populations that adaptive beliefs are transmitted via prestige [31] and kinship biases [32].

There is, of course, a long-standing interest among social psychologists in when individuals will adopt the decisions of others [3340]. Social impact theory clearly relates to social learning strategies, proposes a psychological mechanism and has been extended to consider its effect upon population-level belief patterns [37,41]. However, such work typically focuses on the adoption of arbitrary or incorrect group decisions [42], and by doing so limits consideration of the evolutionary, population-level consequences of such behaviour, and of whether the use of social information leads to adaptive behaviour. Conversely, we aim to understand not only how different factors affect decision making, but also their impact upon individual performance, leading to functional explanations for the evolution of decision-making rules [43].

A limitation of previous research into social learning strategies is the implicit assumption that individuals only employ one strategy in a given context [1], with models rarely considering more than one strategy, and experiments designed in such a way that subjects cannot use multiple strategies simultaneously. Furthermore, empirical work is typically limited to a single experimental task, raising questions about the generality of observations. Finally, there is a paucity of empirical data on whether the strategies people use are employed adaptively. Here, we present four experiments in which adult human subjects were required to solve computer-based problems by using asocial and/or social information. In total, we employed six tasks and two methods of information presentation and investigated the effect of nine different factors, often presented simultaneously, on subject reliance on social information use. This corresponds to our testing nine specific hypotheses derived from the cultural evolution literature [2,9,10,44]. Factors considered include: the number of demonstrators (+), demonstrator consensus (+), subject confidence (−), task difficulty (+), the cost of asocial learning (+), task familiarity (−), demonstrator (+) and subject (−) performance, and their difference (+); where the + and − signs signify predictions for positive or negative relationships with social information use. Our use of multiple tasks allows us to examine whether social learning strategies are deployed consistently across diverse contexts and sensory modalities, while investigating numerous factors allows us to test whether subjects are limited to using a single strategy at a time. Furthermore, by setting social information use in a task-solving context, we are able to investigate the adaptiveness of subjects' behaviour by measuring subjects' success in the tasks. Thus, the work presented here aims to provide a comprehensive empirical test of the adaptive learning strategies predicted by evolutionary models, to provide new data concerning pay-off and frequency-dependent social learning and to extend existing work by examining the simultaneous use of multiple strategies.

2. Methods

(a). General methods

Across four experiments, subjects took part in computer-based binary choice tasks, using asocial and/or social information to guide decision making. Asocial information gave subjects access to the task itself, whereas social information presented subjects with the decisions of a number of demonstrator individuals. Experimental sessions lasted for 60 min. Subjects were paid £5 for taking part, plus a bonus of up to £5 dependent on their performance.

(i). Subjects and apparatus

In total, 99 subjects took part in experiment 1, 57 in experiment 2, 38 in experiment 3 and 61 in experiment 4. Subjects ages ranged from 18 to 58 (mean = 22, s.d. = 4). Eighty three were male, 172 were female and all but nine were students of the University of St Andrews. Full details are given in the electronic supplementary material.

Subjects took part in experiments in batches of 1–11 individuals. All subjects had access to a computer and were separated by large screens such that they could not see other subjects. Subjects were provided with headphones such that they could listen to any required sounds without disturbing other participants.

(ii). Procedure

On arrival, the experimenter read subjects a briefing and gave them the opportunity to ask any questions. The experimenter then went to an adjoining room and remotely started the experiment programme on the subjects' computers. The connecting door was kept open so that the experimenter could hear if any subjects communicated, but no instances of between-subject communication were observed. When all subjects had completed the experiment, the experimenter returned to debrief the subjects who were then free to collect payment and leave. The briefing and debriefing script for each experiment are presented in the electronic supplementary material.

(iii). General design

The four experiments were designed to repeatedly test numerous factors in a range of contexts and involved two methods of information presentation, six tasks and three sources of social information. We first describe these general design components and then present details and results of each experiment. Each experiment was designed without knowledge of the results of any others. Thus, they can be viewed as independent attempts to address a set of related questions about human social learning. We, therefore, discuss the results together, rather than each in turn, as many results were repeated across experiments. Since several results were replicated across multiple tasks, this gives confidence in the generality of our findings.

(iv). Methods of information presentation

We termed the first method of information presentation the linear protocol. Here, subjects received asocial and then social information (or vice-versa). After receiving each type of information, subjects were required to make a decision and rate their confidence in their decision on a 7-point Likert scale [45]. Subjects were considered to have used social information if their final decision (following social information) differed from their initial decision (following asocial information), given that the majority of demonstrators disagreed with the subject. Here, asocial information was ‘free’ and the duration of the task stimulus was fixed.

In the second protocol, termed the parallel protocol, subjects were given the choice of either social or asocial information, before which they were informed of specifics of the trial (detailed below). After receiving their chosen source of information, subjects were required to make a single decision. Subjects were considered to have used social information when they chose to view social over asocial information. Here, asocial information was costly, and subjects determined the duration or number of presentations of the task stimulus.

(v). The tasks

Foraging task

This task involved selecting the more profitable of two virtual foraging sites and was designed to mimic the procedures deployed in published animal experiments [4648]. Subjects received 20 visual presentations of a number of apples at the two sites, 10 at each. The true values of the sites were either 8 v. 4, 6 v. 2 or vice-versa. The number of apples shown in each presentation was independently drawn from a Poisson distribution centred on the true value of the site (e.g. an image of 12 apples at the right-hand site with a true value of 8), however, further uncertainty was introduced by drawing the number of apples from the value of the other site for 1 of the 10 presentations at each site. The apple presentations were given in a random order and each lasted 500 ms.

Mental-rotation task

Subjects were required to decide whether two shapes were the same shape seen from different angles or different shapes (see the electronic supplementary material for example). This task was based on that used by Shepard & Metzler [49], and allows trials of different difficulty to be generated. In each trial, subjects received a single visual presentation of an image of two shapes, the duration of which varied across experiments.

Length-estimation task

Subjects were required to decide which of two irregular lines was longest (see the electronic supplementary material for example). In each trial, subjects received a single visual presentation of an image of two lines, the duration of which varied as detailed below. This task was chosen as it is hard to solve through visual observation alone and so all responses will be associated with some degree of uncertainty.

Audio tasks

Subjects were required to decide which of two tones was (i) higher in pitch (the ‘pitch task’), (ii) greater in intensity (the ‘intensity task’), or (iii) whether a single tone was increasing or decreasing in pitch (the ‘pitch-modulation task’). These tasks provided a test of the sensory multi-modality of learning strategies. Subjects had access to button(s) that played the relevant tone(s) and could play them as many times as they wished, but incurred a cost per playing, detailed below, that reduced their bonus payment.

(vi). Types of social information

The decisions of others were presented using two tiles labelled with the possible answers. A demonstrator's decision was represented by the relevant tile flashing for 350 ms. This information was either from a battery of previous subject responses (battery), from other subjects taking part in the same experimental session (live) or was generated by us. When social information was manipulated, we employed a conditional information lottery [50] to ensure subjects would treat the information as genuine without being deceived. Under this procedure, a minority of trials used genuine (battery) social information and only performance on these trials affected payment. Subjects were informed of this, but not of which trials used genuine information. Under this protocol, subjects have been shown to treat all information as genuine [50]. Subject feedback following the experiment was used to identify subjects who had failed to understand this procedure and their data were excluded from the analysis.

(vii). General analyses

Unless otherwise stated, we analysed the data using generalized linear mixed models with a Bernoulli error structure and logit link function. We used Markov Chain Monte Carlo methods [51,52] to fit the models in WinBUGS 1.4 [53] and to generate credible intervals for each parameter. All models included a baseline constant, standardized variables (see below for details) and a random subject effect. Minimal adequate models (detailed in the electronic supplementary material) were constructed by backwards elimination, removing variables for which the 95% central credible interval included 0 from a model containing all predictors and second-order interactions. Parameter values were estimated using a sample of at least 3000 iterations, after a suitable burn-in period and thinning to remove autocorrelations. Effects are reported on the logit scale. Preliminary analyses (detailed in the electronic supplementary material) were carried out to establish whether factors should be modelled as linear or categorical.

Unless otherwise stated, all graphs show the median of posterior samples from the fitted models, as estimates of parameter values. Plots of the posterior parameter estimates are preferred to raw data because they directly illustrate the marginal effect sizes associated with a given predictor while controlling for the effects of other predictors. See figure 1a,b for a comparison of the raw data and model estimates.

Figure 1.

Figure 1.

(a,b) A comparison of the raw data (red, average and Wilson confidence interval) and model predictions (black, median and 95% central credible interval) for (a) subject confidence and (b) the different social information conditions (where 7 : 5 means seven demonstrators make one decision while five make the other). The model closely matches the raw data, the narrower intervals for model estimates highlight the increase in precision offered by the model. (c) Median (thick lines) and 95% credible intervals (thin lines) for the probability that a subject uses social information depending on demonstrator consensus in experiments 1 and 2 (modelled as linear) and 4 (modelled as categorical). (d) The probability that subjects use social information increases with the number of demonstrators in three of four experiments, with more pronounced effects in experiments 3 and 4, which used the parallel protocol. (e) In experiments 1 and 2 the probability that a subject uses social information decreases with subject confidence in their own judgement based on asocial information (solid lines), while the probability that subject is correct increases (dashed lines). (f) The probability a subject copies a demonstrator depends on their relative performance and the absolute performance of the demonstrator. (experiment 4). Subjects respond differently to high performing demonstrators dependent on their own success, however, all subjects are equally unlikely to copy a poorly performing demonstrator.

(b). Experiment 1

(i). Design

Subjects completed 10 trials using the foraging task, linear protocol and manipulated social information, presented in a random order. We manipulated (i) the number of demonstrators and (ii) the level of consensus in the social information they provided, such that social information consisted of either 4 v. 0, 5 v. 3, 6 v. 2, 7 v. 5 or 8 v. 4 demonstrators. For 48 of the subjects, the order of information presentation was reversed (i.e. social first) to investigate commitment effects.

(ii). Analyses

We modelled the probability of social information use for subjects given asocial information first as a function of demonstrator number, demonstrator consensus (the proportion of demonstrators choosing the majority option, scaled to range between 0 and 1) and subject confidence in their initial decision. To investigate whether the use of social information was adaptive, we modelled the effect of subject confidence on the probability that a subject's initial, asocial, decision was correct.

To investigate how the order of information presentation affected our results, we pooled data from all subjects and modelled whether social information was used differently dependent on the order of information presentation.

(iii). Results

Subjects were increasingly likely to adopt a conflicting majority decision with decreasing confidence in their own decision (subject confidence 95% credible interval (CI): (−0.8737, −0.1308), median = −0.4809; figure 1e), increasing demonstrator consensus (CI: (1.226, 4.011), median = 2.497; figure 1c) and increasing numbers of demonstrators (CI: (0.0007, 0.2332), median = 0.1128; figure 1d). There were no interactions. Subject confidence predicted whether a subject was right or wrong, with low confidence predicting an incorrect answer (CI: (0.1883, 0.8387), median = 0.4988; figure 1e). There was no effect of information presentation order (social/asocial first) on social information use (CI: (−0.4924, 0.4901), median = 0.01072).

(c). Experiment 2

(i). Design

Subjects completed 24 trials using the mental-rotation task, linear protocol and manipulated social information, presented in a random order. The shape images were presented for a fixed time of 4 s. We manipulated (i) the number of demonstrators and (ii) the level of consensus in the social information as in experiment 1.

(ii). Analyses

We modelled social information use as a function of the number of demonstrators, demonstrator consensus and subject confidence in their initial decision. We modelled the probability that a subject's initial decision was correct as a function of their confidence in that decision.

We investigated conformity in two ways, neither of which limited the data to cases where the social information disagreed with the subject's initial decision. First, we modelled the probability a subject would change their decision. All predictors from the main analysis were included except demonstrator consensus, which was replaced with demonstrator disagreement (the proportion of demonstrators that disagreed with the subject's initial choice). Second, we modelled the probability that a subject's final decision would be that the shapes matched. Predictors used were the number of demonstrators, the proportion of demonstrators reporting a match (modelled separately as linear and categorical, see the electronic supplementary material for the reason), the subject's initial decision, their confidence in it, whether the shapes actually matched or not, and a random effect for each question.

(iii). Results

As in experiment 1, subjects were increasingly likely to switch to a conflicting majority decision as demonstrator consensus increased (CI: (1.633, 3.413), median = 2.512; figure 1c) and with decreasing confidence in their initial choice (CI: (−1.086, −0.5561), median = −0.8142; figure 1e). However, there was no effect of the number of demonstrators (CI: (−0.2128, 0.1384), median = −0.0168; figure 1d). Low subject confidence again predicted an incorrect answer (CI: (0.1125, 0.338), median = 0.226; figure 1e).

The first conformity analysis showed that subjects were increasingly likely to change their initial decision as the proportion of demonstrators that disagreed with them increased (CI: (4.042, 6.438), median = 5.123). There was an interaction between demonstrator number and disagreement (CI: (0.1743, 0.7608), median = 0.4415), producing an S-shaped curve when demonstrator number was high and subject confidence was low (figure 2a,b).

Figure 2.

Figure 2.

(a,b). The proportion of demonstrators that disagree with the subject's initial, asocial, choice strongly affects the probability that subjects will change their decision. (a) Decreases in subject's confidence based on asocial information (number of demonstrators = 12), and (b) increases in demonstrator number (subject confidence = low) increases the likelihood of a switch following conflicting social information. Subject behaviour, when uncertain and given many demonstrators, is conformist in that subjects are disproportionately likely to switch their decision when faced with a strong opposing majority. The black dashed lines portray the expected result of random copying. (c,d). The proportion of demonstrators reporting the shapes match strongly affects the probability that subjects' final decision will be that the shapes match. (c) In line with a conformist response to social information, when unconstrained (i.e. modelled as categorical), intermediate levels of consensus have a disproportionately large effect on decision making. The y-axis is the change to the linear predictor of the model, on which a change of magnitude four could alter the probability a subject decides the shapes match by as much as 76%. Accordingly, without other influences, social information is likely to have a dramatic effect on subject behaviour. (d) Subject behaviour, however, is also strongly affected by prior information and their confidence in it (the lines shown are for cases where subjects already believe that the shapes match). Thus, although subject behaviour may not be conformist, their response to the social information alone was conformist. The black dashed line portrays the expected result of random copying.

The second conformity analysis found the likelihood a subject's final decision was that the shapes matched covaried with the proportion of demonstrators reporting a match (CI: (6.558, 10.45), median = 8.296; figure 2d), an effect that increased with the number of demonstrators (i.e. a proportion × number interaction, CI: (0.3373, 1.017), median = 0.6625, ‘proportion’ modelled as linear). There was another interaction between a subject's initial decision and their confidence in that decision (initial decision CI: (5.897, 7.902), median = 5.795; confidence CI: (−0.2669, 0.005424), median = −0.1295, interaction CI: (0.4234, 0.9641), median = 0.6881). Subjects were also more likely to decide the shapes matched if they genuinely did (CI: (0.02914, 0.848), median = 0.4322, if not CI: (−0.7708, −0.1283), median = −0.4418). With proportion modelled as categorical results were qualitatively the same, but moderate consensus among the demonstrators had a disproportionate impact (figure 2c) and interactions between proportion and other factors could not be investigated.

(d). Experiment 3

(i). Design

Subjects completed 60 trials using the parallel protocol and battery social information. The trials were arranged into four groups of 15, the first and third using the mental-rotation task and the second and fourth the length-estimation task. This structure was adopted to avoid subjects becoming bored and not attending to trials. The order of trials within each group was randomized for each subject.

Prior to choosing between the two sources of information, subjects were informed of question difficulty (‘very easy’, ‘easy’, ‘moderate’, ‘hard’ or ‘very hard’) and the number of demonstrators (1, 3 or 5). For the mental-rotation task, difficulty was calculated as the proportion of previous subjects that got the question wrong (as an objective measure of task difficulty—the angle of rotation—failed to predict performance), while for the length-estimation task, it was proportional to the similarity in length of the lines.

Subjects who chose asocial information were free to examine the relevant image indefinitely, but at a cost proportional to the time spent observing that reduced their possible bonus payment. Subjects were informed of the cost incurred after each trial. Social information was free.

(ii). Analyses

We modelled the probability that an individual would choose to view social information as a function of demonstrator number, task difficulty, session number and previous costs incurred using asocial information. To investigate whether the use of social information was adaptive, we modelled the pay-off of subjects on trials where they used asocial information (using a normal error structure). The predictors were task difficulty, trial group and task.

(iii). Results

Subjects were more likely to choose to view social information on harder than easier trials (CI: (0.2518, 0.4402), median = 0.3463) and when they had incurred greater than lesser costs on the previous trial (CI: (0.02067, 0.1391), median = 0.07678). No effect of task was found (CI: (−0.444, 0.1752), median = −0.1349; see the electronic supplementary material for discussion of the effects of different tasks). There was an interaction between the trial group and the number of demonstrators (demonstrator number CI: (0.4586, 0.6316), median = 0.5488; figure 1d; trial group CI: (−0.3137, −0.05748), median = −0.1836, interaction CI: (0.02559, 0.1694), median = 0.09607). While subjects increasingly chose social information as the number of demonstrators increased, over groups of trials subjects became less likely to choose social information when there was only one (CI: (−0.5963, −0.1454), median = −0.3714) or three (CI: (−0.3036, −0.05188), median = −0.1765) demonstrators, but not when there were five (CI: (−0.1364, 0.1647), median = 0.01801).

Subject pay-off when using asocial information increased over groups of trials (CI: (0.4195, 0.7898), median = 0.6073) and was negatively related with question difficulty in trials using the length-estimation task (CI: (−0.3879, −0.01722), median = −0.1982), but not those using the mental-rotation task (CI: (−0.04259, 0.329), median = 0.1392).

(e). Experiment 4

(i). Methods

Subjects completed 45 trials using the parallel protocol arranged into three groups of 15 that deployed the pitch task, intensity task and pitch modulation task, respectively. Three tasks were in a single session to help maintain subject attention across the experiment. The first two trial groups used battery social information, and within each group, trial order was randomized for each subject. The third trial group used live social information and so subjects were required to complete individual trials simultaneously; trial order was, however, randomized between batches of subjects. If a subject chose to copy an individual who was copying someone themselves, the subject received only a message informing them of this, and therefore had to guess.

Prior to choosing a source of information, in the pitch trials, subjects were informed of the number of demonstrators available (1, 3 or 5) and the cost incurred each time they played a tone (10, 20 or 30% of maximum pay-off). In the intensity trials, subjects were informed of the degree (but not direction) of consensus among the demonstrators (0, 0.5 or 1), while the cost to listen to the tones was held constant and subjects were informed at the end of each trial of the cost incurred. In the pitch-modulation task, subjects were ranked on their performance, assayed by summing their pay-off across all pitch-modulation trials irrespective of the source of information, and were informed of their own rank and the rank of their single potential demonstrator at the start of each trial.

(ii). Analyses

For each of the three tasks, we modelled the probability that an individual would choose social information. In the pitch task, predictors were demonstrator number and the cost to play the sounds, in the intensity task, they were demonstrator consensus and costs incurred when using asocial information previously, and in the pitch-modulation task, they were subject rank, demonstrator rank and the difference between the two. To investigate whether the use of social information was adaptive, we investigated the effect of subject rank on the probability that a subject's asocial decision was correct.

(iii). Results

The likelihood of subjects choosing social information covaried positively with the consensus among demonstrators (figure 1c), the number of demonstrators (CI: (0.5161, 0.7778), median = 0.6428; figure 1d) and the cost to play the sounds (CI: (0.792, 1.279), median = 1.025). Costs incurred on the previous question had no effect (CI: (−0.1883, 0.4168), median = 0.1084) and neither did subject rank (CI: (−0.03578, 0.1501), median = 0.0557). There was a positive interaction between demonstrator rank and the difference between demonstrator and subject rank (demonstrator rank CI: (−0.4843, −0.2249), median = −0.3533, rank difference CI: (−0.1713, 0.04723), median = −0.06232, interaction CI: (0.001068, 0.05605), median = 0.03588; figure 1f)—successful demonstrators were most likely to be copied, particularly when offered to a poorly performing subject. A subject's rank predicted their asocial performance (CI: (−0.2099, −0.04212), median = −0.1255), showing that higher ranking demonstrators offered higher quality information.

3. Discussion

Our experiments show that several factors simultaneously influence human social information use, consistent with a number of social learning strategies, and that such behaviour leads to adaptive decision making. The findings provide confirmatory support for all nine hypotheses derived from evolutionary models [2,9,10,44].

Three of four experiments found that the rate of copying increased with the number of demonstrators (figure 1d). As majority decisions of larger groups are known to be more likely to be correct than those of smaller groups [54], this tendency is likely to prove adaptive. The impact of demonstrator number was enhanced when subjects chose between social and asocial information, as opposed to receiving both in sequence. This difference cannot be attributed to a commitment effect [55], where subjects stick with an earlier decision, as reversing the order of information presentation had no effect on social-information use. It may be that the linear protocol weakens the effect by limiting the analysis to instances where subjects disagree with the social information that they receive. However, it is also possible that the effect is enhanced in experiments using the parallel protocol as when subjects were informed of demonstrator number they were unaware of the consensus among them (in experiments using the linear protocol subjects knew of both) and hence number was all that subjects had to go on.

Demonstrator consensus strongly affected the rate of copying. In all cases, subjects were more likely to copy as consensus increased (figure 1c). Such behaviour is expected to be adaptive as we show using probability theory that the greater the consensus, the more likely that the majority opinion is correct, provided individuals make independent decisions and perform above chance levels (see the electronic supplementary material for formal proof). This effect is enhanced in larger groups, providing further support for the adaptiveness of copying larger groups of demonstrators.

Subject confidence in their own judgement had a negative effect on the probability that they would use social information (figure 1e). This finding is consistent with experimental work using non-human animals, which reports a copy-when-uncertain rule in rats and nine-spined sticklebacks [48,56,57]. This behaviour was adaptive, in the sense that it increased subject pay-off, as subject confidence genuinely predicted whether subjects were correct or not (figure 1e). Incorrect subjects typically expressed lower confidence than correct subjects, and so were more likely to use social information and change their answer to the correct choice.

We also found evidence consistent with a copy-when-asocial-learning-is-costly strategy [2]. In the third experiment, subjects using asocial information received lower pay-offs on harder than easier questions, however, subjects were also more likely to use social information when informed the upcoming question was difficult compared with easy and so used social information to avoid costly asocial information. Similarly, subjects in the third experiment were also more likely to choose social information after previously incurring high than low costs using asocial information. Finally, in the fourth experiment, subjects chose to use social information more frequently when the cost to listen to the sounds was high compared with when it was low.

There was a drop in social information use across trial groups in experiment 3, with a corresponding increase in asocial performance, showing that this was an adaptive response. With practice, subjects apparently became increasingly confident in their own abilities and subsequently decreased their reliance on social information. We also observed an interaction with demonstrator number, such that social information use decreased when there were three or less demonstrators, but remained steady when there were five. Subjects appeared aware of the risks of reliance on a small number of demonstrators [54] and selectively refined their use of social information accordingly as their own skill improved.

In experiment 4, subjects' performance rank did not affect the likelihood of copying, but demonstrators' performance did (figure 1f). This is strikingly consistent with the pattern of pay-off-based copying observed in nine-spined sticklebacks [29], which also copied in proportion to the success of the demonstrator, regardless of their own success. Thus, humans and fishes alike behave in accordance with ‘proportional observation’ [10]. Selectively copying higher performing demonstrators is likely to be adaptive, since such individuals are more likely to demonstrate behaviour associated with high returns [1,2]. In our experiment, subjects' performance rank did indeed predict their asocial performance, so top-ranking individuals were genuinely more skilled at the task and had not copied their way to the top. Subjects were also sensitive to the demonstrator's performance relative to their own and were even more likely to copy when being outperformed than otherwise. However, all subjects were equally unlikely to copy poorly performing demonstrators, irrespective of their own performance. Thus, information from strongly performing demonstrators was most valuable to poorly performing subjects such that behaviour matched ‘proportional observation’ with conditional ‘proportional imitation’ [10].

Subjects were sensitive to the frequencies of the two options in the demonstrators' decisions (figure 2). The observed relationships suggest the conditional deployment of a conformist social learning strategy [2,9] dependent on the number of demonstrators as well as the subject's confidence in their own judgement. Conformity results in the disproportionate adoption of popular traits at the expense of rare traits, producing an S-shaped relationship between trait frequency and probability of adoption. We found an S-shaped curve only when there were many demonstrators, and subjects express low confidences (figure 2a,b). Furthermore, formal models set the threshold for disproportionate copying at 0.5 [2], however in all cases, our results show a higher threshold. However, the formal models assume individuals choose between two variants without prior information, which was not the case under the linear protocol. Indeed, the effect of social information alone matches closely with conformity as its impact is symmetric, and moderate levels of consensus have a disproportionately large influence (figure 2c). Thus, our data are consistent with a conformist use of social information, but as subject's behaviour is the result of both social and asocial influences, the resultant behaviour may not be conformist, depending on the magnitude of the asocial influence (figure 2d).

The rarity of conditions under which conformist behaviour is realized suggests a bias towards asocial over social information. Increased subject confidence strengthens this bias, making subjects more willing to go against the consensus (figure 2a). Such behavioural flexibility may explain the contradictory findings of previous work. For example, data from the Asch experiments found that subjects copied at lower rates than predicted [18]; we suggest the simplicity of the task used probably leads to high subject confidence in asocial information, shifting the curve to a point where conformist behaviour would not be realized. We predict that, even against unanimous social information, subjects may be unlikely to copy on tasks that they believe they can readily solve asocially and that even on difficult tasks, subjects may not use social information if there are few demonstrators. Accordingly, we anticipate that conformist behaviour will depend on demonstrator number as well as the characteristics of the task.

A bias for asocial over social information has been documented elsewhere [48,58,59] and various explanations have been proposed for it. These include that (i) people have a distrust of socially acquired information, (ii) asocially acquired information has a greater learning impact, and (iii) people have prosocial tendencies that lead them to produce information [58]. We offer three further suggestions; that (iv) the tasks thus far deployed have been sufficiently simple that subjects have considerable confidence in their own judgement, (v) the low cost of making an incorrect answer favoured asocial information as, although potentially the poorer source of information, subjects found it more enjoyable (several individuals from our experiment reported finding ‘having-a-go’ rewarding), and (vi) that such a bias is the product of a mixed population of ‘conformists’ (who readily use social information) and ‘mavericks’(who resist social information) [18,60]. With regards to the latter, although our analysis finds subjects rely on social information to different extents, we do not find evidence of discrete personality types, instead subjects form a continuum ranging from conformist to ‘maverick’ behaviour.

In summary, our experiments provide conditional or strong support for the deployment of several social learning strategies predicted by the theoretical cultural evolution literature, including conformity, pay-off-based copying, copy-when-asocial-learning-is-costly and copy-when-uncertain [1,2,911,14]. We also show a strong influence of demonstrator consensus on the likelihood of taking advice. Importantly, these various influences appear to operate simultaneously, and interact to produce behaviour leading to effective decision making and higher pay-offs. These results are not incompatible with the notion that social learning may sometimes have maladaptive consequences [61], but nonetheless illustrate the adaptive aspects of cultural transmission necessary for it to have been favoured by selection. It is apparent that formal evolutionary models provide a framework for predicting the conditions that are likely to produce a reliance on social information on which maladaptive cultural traits can occasionally ride [2]. We see no reason to suggest that these learning biases must themselves be unlearned or under tight genetic control. Rather, we maintain, as is common in the evolutionary game theory literature, that even if the strategies were learned, culturally or otherwise, they can still fruitfully be investigated as though controlled by a simple genetic system, and can still be considered evolved [1,62]. The success of this approach is born out in the effectiveness of the cultural evolution theory in predicting human copying behaviour. Nonetheless, it remains an important topic for future studies to identify the mechanisms underlying implementation of these functional rules.

Acknowledgements

Ethical approval for this study was given by the University Teaching and Research Ethics Committee of the University of St Andrews (approval reference BL5 881).

This work was supported by an ERC Advanced Investigator grant (EVOCULTURE, Ref: 232823) and EU Framework 6 grant (NESTPATH, CULTAPTATION, Ref: 043434) to K.N.L.

References


Articles from Proceedings of the Royal Society B: Biological Sciences are provided here courtesy of The Royal Society

RESOURCES