Skip to main content
Journal of the Royal Society Interface logoLink to Journal of the Royal Society Interface
. 2021 Jun 2;18(179):20210082. doi: 10.1098/rsif.2021.0082

Optimal use of simplified social information in sequential decision-making

Richard P Mann 1,
PMCID: PMC8169205  PMID: 34062101

Abstract

Social animals can improve their decisions by attending to those made by others. The benefit of this social information must be balanced against the costs of obtaining and processing it. Previous work has focused on rational agents that respond optimally to a sequence of prior decisions. However, full decision sequences are potentially costly to perceive and process. As such, animals may rely on simpler social information, which will affect the social behaviour they exhibit. Here, I derive the optimal policy for agents responding to simplified forms of social information. I show how the behaviour of agents attending to the aggregate number of previous choices differs from those attending to just the most recent prior decision, and I propose a hybrid strategy that provides a highly accurate approximation to the optimal policy with the full sequence. Finally, I analyse the evolutionary stability of each strategy, showing that the hybrid strategy dominates when cognitive costs are low but non-zero, while attending to the most recent decision is dominant when costs are high. These results show that agents can employ highly effective social decision-making rules without requiring unrealistic cognitive capacities, and indicate likely ecological variation in the social information different animals attend to.

Keywords: agent-based model, collective behaviour, rational choice, social information

1. Introduction

Living in groups conveys many advantages for social animals [1,2]. One of these is the ability to make use of information provided by other individuals in the group. For example: a navigating bird may choose its direction of flight in part by following others in the flock [3,4]; foraging animals can use the presence of others at a site as an indicator of food availability [5,6]; fish can effectively avoid predators by following others who are seeking to do the same [7,8]; and demand for consumer items is often driven by their perceived popularity [9], especially amongst high-status individuals [10].

Substantial research effort has been directed to identifying precisely what types of social information that humans and animals attend to, and the rules that translate that information into behaviours. These studies broadly fall into two (not mutually exclusive) categories: data-driven analysis of the social correlates of behaviour (e.g. [1117]) and model-based inference of explicit rules of interaction (e.g. [1824]). The first approach is limited by the fact that correlations may result from non-causal relationships between observable social features and behaviour. For example, the orientation of one animal may be correlated with many others in the group beyond those it directly interacts with, since those individuals also interact with others. Furthermore, even the most sophisticated data-driven analysis (e.g. [17]) can only link what is measured, and choosing these measurements requires biological insight. The second methodology proposes causal models to explain observed behaviours, and selects between these by assessing some measure of model fit. To the extent that these models represent a valid causal pathway from social information to behaviour, this avoids the problem of spurious or misleading correlations. However, since the set of all possible causal models linking even one source of social information to behaviour is far too great to test exhaustively, model-based inference is limited in scope unless one can systematically reduce the set of plausible models using a priori biological reasoning.

What should that biological reasoning be? I argue that optimality should bridge the gap between information and behaviour. In social animals that are routinely exposed to the choices others make, responses to that information source will be subject to selection [25,26] and thus are likely to be optimized with respect to fitness. Agents who do not respond optimally to the available information should be out-competed by those who do, whether by evolved instinct or the capability to learn, depending on the life cycle of the species. This implies that individuals’ responses to social information should be understood by considering their adaptive value through rational decision-making theory [27]. However, advanced cognition and accurate perception impose significant costs that create a selective pressure against behaviours that require either, and favour simplified perceptual models and cognitive processes [28,29]. Therefore, optimal use of social information should be understood as a balance that can vary across species or contexts as these costs change.

Recent theoretical work on sequential decision-making identified the optimal response of an agent to observing a sequence of prior decisions [30,31]. These studies showed that optimal use of that information involves the full ordered sequence of decisions. In particular, in common with earlier models [32,33], this research shows that recent decisions should be given greater weight by the focal agent when deciding who to follow. It might be expected, therefore, that agents would adapt by learning or by evolution to use the full sequential nature of such social information if doing so carries no cost. However, observing and processing complex information does carry costs, and it is unlikely that real animals have the perceptual acuity, cognitive capacity and time to either deductively apply the complex recursive calculations necessary in these models, or to inductively learn or evolve appropriate responses to every possible sequence they may observe. Instead, it is likely they apply simpler rules based on reduced representations of the sequence information. One such possibility is to respond to aggregate information: the number of agents choosing each option, independent of sequence. This is the basis for many ‘quorum’ models of collective decision-making [34,35]. Another is indicated by a recent study by Kadak & Miller [24], which confirmed the importance of sequence-dependent decision-making but showed that a ‘follow the last choice’ heuristic was better able to predict decisions in zebrafish than more complex models. These simple rules allow animals to forgo significant cognitive and perceptual costs, but they remain subject to optimization by selection.

In this paper, I identify the optimal rules of social information use by identical agents with a restricted view of previous decisions. I analyse the micro-scale individual decisions and macro-scale collective behaviour of agents rationally responding to different forms of social information and thus assess what information is necessary in order to approximate the behaviour of agents responding to full decision sequences. While varying the cost of cognition and perception I analyse the expected rewards of different social information strategies in competition with each other and thus identify the evolutionary stable strategies under different cost regimes.

2. Results

In this section, I identify the optimal behaviour of identical rational agents attending to restricted subsets of the available social information in a binary choice scenario. Specifically, I focus on the use of aggregate information (the number of previous decisions made for each of the two available options) and dynamic information (the choice made by the most recent decision-maker). I analyse the optimal behaviour by mapping the observable social responses (the probability that a focal agent chooses a specific option, conditioned on the social context), and how well each strategy fits the behaviour of agents with full, unrestricted social information. Here, I derive the consequences of the optimal decision rule both for individual decision-makers and at the collective level. All calculations are performed assuming a fixed environmental noise-to-signal ratio of one (ε = 1, see Methods); previous work has shown that among identical rational agents, the value of the environmental noise level does not significantly affect social behaviour [30].

2.1. Aggregate and dynamic social information

The large majority of theoretical and modelling literature on sequential decision-making assumes that agents follow a social feedback rule based on the aggregated number of individuals that have previously chosen each option. The first model of social information I consider is therefore based on this aggregated information. An agent employing this ‘aggregate’ strategy attends to the number of agents that have previously chosen options A or B, and does not observe, or attend to, any information related to the order in which these decisions were made.

Standing against a rule based solely on aggregated previous decisions there is both theoretical and empirical evidence for the greater importance of more recent decisions. In particular, the most recent decision can override a large majority of aggregate decisions in favour of another option [24,30]. Hence, I also consider how agents should respond when they are only able to observe the most recent decision, and not the aggregate number of previous decisions. Given that this information can rapidly change relative to the aggregate number of decisions, I call this ‘dynamic’ social information, following the same terminology in [20].

For each of these two scenarios, I calculate how an agent should rationally respond to the social information they are presented with in order to maximize their expected utility, making the assumption that agents are aware that decisions have been made sequentially (although they cannot observe the full sequence), and that other agents are subject to the same limitation as themselves. I focus on identical agents (those sharing the same utility function and cognitive processes) as previous research has shown that agent preferences must be strongly aligned to produce significant social information use [31], and this assumption also permits a more straightforward mathematical treatment. These calculations (see Methods) result in a single number associated with each possible distinct observable social information, which denotes a critical threshold for the agent’s private information: if their private information exceeds this threshold they will choose one option, otherwise they will choose the other.

2.1.1. Observed social response

Based on these calculations, I analysed all possible outcomes for a group of 10 agents making sequential binary decisions. To analyse these outcomes, I first imagine making an empirical study of the observable decisions in the manner of previous attempts at direct data-driven inference of social interaction rules (e.g. [12,13,15]). I consider a scenario in which the two choices are (unknown to the agents) identical—that is, the true utilities of both options are the same. This reveals the effect of social information without inducing correlations between decisions due to agents responding to the same environmental signal. I record, for each decision, the aggregate number of previous decisions for options a and b, the identity of the most recent decision and the resulting proportion of times the focal agent chooses each option itself. The result of this analysis is shown in figure 1, where panels are arranged in two rows: the top row shows the probability for the focal agent to choose option A, conditioned on only the aggregate number of previous choices nA and nB, while the lower row shows the same information further conditioned on knowing that the most recent decision-maker chose option A. These results are further divided in columns according to each model, with the aggregate model shown in figure 1a,b, and the dynamic model in figure 1c,d. For comparison, the equivalent analysis performed on the full sequential model of [30] is shown in figure 1g,h. These results show a relatively close visual match between the aggregate model and the sequential model when viewed independently of the most recent decision, though the social reinforcement in the aggregate model is somewhat stronger than in the sequential model compressed into this aggregate projection. By contrast, the dynamic model shows a very different pattern: the probability that the focal agent will choose A is approximately independent of nA and nB except in the case where one of these values is zero. This result, if seen experimentally, could be interpreted as evidence that the agents are principally concerned not to be alone, and are otherwise content to choose any option that has already been chosen by at least one other agent. However, by construction it is clear this is not the case; instead, agents are attending to a different form of social information that is not captured by this aggregate viewpoint. The lower panels show how the apparent social interaction rules change if we include this dynamic social information: the aggregate model remains unchanged (again, by construction), while the dynamic model shows a strong, uniform tendency to follow the most recent decision. The full sequential model also shows a substantial impact of dynamic social information, modulated somewhat by the aggregate values of nA and nB. Therefore, not only does the type of social information an agent attends to affect the social interactions it exhibits, but additionally the way that social information is measured determines which model is apparently closest to the behaviour of agents with full sequential information.

Figure 1.

Figure 1.

Observed social responses by agents employing different social information strategies. (a,c,e,g) The probability the focal agent will choose A, conditioned on the number of previous decisions for A and B. (b,d,f,h) The probability that the focal agent will choose A, conditioned on the number of previous decisions for A and B, when the most recent decision was for A. Panels are organized by column for each strategy: (a,b) aggregate strategy; (c,d) dynamic strategy; (e,f) hybrid strategy; (g,h) full sequential strategy.

2.1.2. Model selection

Secondly, I imagine performing a model selection study, assessing how well the aggregate and dynamic models fit the behaviour of agents with access to the full sequential information at the individual and collective levels [36]. I choose to compare each strategy with decisions based on full sequential information, since these provide the best possible performance, and therefore are what an agent should seek to approximate. Again considering a hypothetical experiment in which the true utilities of A and B are the same, I first measure how closely a limited agent approximates the decisions made by one with access to the full sequential information. Assuming that agents actually attend to the full sequence of decisions (using rules as derived in [30]), I evaluated the probability assigned to their decisions by both the aggregate model and the dynamic model. This likelihood (the probability of the data, conditioned on a model), shown in figure 2a, indicates which model would be favoured by an individual-level model selection analysis (e.g. [19,20,23,24,37,38]). The results show that the dynamic model (blue triangle) is assigned a higher likelihood than the aggregate model (green square). Since an arbitrary number of hypothetical experiments could be considered, I show the likelihood per decision (i.e. the geometric mean probability assigned to each observed decision). I then evaluate the distribution of group outcomes: the total number of agents choosing one of the two options. Comparing this between the limited information models and the full sequential model shows to what degree the two produce similar collective behaviours. Figure 2b shows that agents attending to aggregate information (green squares) produce a collective outcome that is much closer to that produced from full sequence information (black circles) than that produced by those attending to the most recent decision (blue triangles). While all models show consensus as the most probable outcome, this tendency is weaker in the dynamic model, where intermediate outcomes are all more equally probable than in both the sequential and aggregate models.

Figure 2.

Figure 2.

Goodness-of-fit between each limited social information strategy and agents with full sequential information. Panel (a) shows the likelihood (probability per decision) assigned by models of each strategy to decisions made by fully informed (sequential strategy) agents. Panel (b) shows the probability distribution of different collective outcomes (number of agents choosing A) under each strategy.

2.1.3. Hybrid strategy

The results above pose a problem: the aggregate model produces a good approximation to observed collective behaviour in the sequential model and closely matches the sequential model when social information is measured in terms of aggregate quantities nA and nB, while the dynamic model does not. However, the dynamic model more closely predicts the results of individual decisions, and incorporates known facets of recent empirical research, as well as the importance of recent decisions identified in full sequential decision-making [30]. When dynamic information is incorporated in the view of the sequential model, the resulting measured social interactions incorporate features of both the aggregate and dynamic models (figure 1h). It is natural therefore to consider a hybrid model in which agents attend to both the aggregate numbers and to the most recent decision. In a hybrid model, a focal agent may respond differently to the same aggregate configuration of previous decisions depending on the choice made by the most recent decision-maker. This potentially allows the agent to respond to the most important element of the decision sequence, without needing to observe or attend to the full sequence. As shown in figure 2a, this hybrid model (red diamond) is assigned a substantially higher individual-based likelihood than either the aggregate or dynamic models, and nearly matches the maximum achievable likelihood as given to the full sequential model (black circle). On the group level, as shown in figure 2b, the hybrid model (red squares) achieves an almost perfect match to the collective outcomes from the full sequential model. Having established this superior performance of the hybrid model in matching the individual and collective behaviour of groups of agents with full sequential information, we can also return to the analysis of figure 1 and ask what the implied observable social interactions of the hybrid model are. Here too we see that the hybrid model is a substantially better match to the sequential model than either the aggregate or dynamical models, both when the most recent decision is ignored (figure 1e) or incorporated (figure 1f) in the analysis.

2.2. Evolutionary stability

In deriving the decision rules of agents attending to either full sequential information [30] or the limited social information models presented in this paper, a crucial mathematical assumption is that these decision processes are self-consistent. That is, the decision rule applied by an agent is the rational, optimal action if all other agents employ the same cognitive process and attend to the same information. However, in order for this assumption to hold, the relevant model employed by the agents must either be forced by external factors (if, for example, agents are unable to observe some feature of the social information by virtue of an environmental constraint), or the strategy must be resistant to invasion by agents employing a different cognitive model.

To assess the evolutionary stability of the different social information strategies, I created a scenario where an individual following one strategy attempts to invade a population composed of individuals following another. In such an ‘invasion’ a single individual applying the optimal policy based on one type of social information is placed in a group otherwise composed entirely of individuals following a different policy that is optimal for another type of social information. I calculate the expected reward gained by the invader and the rest of the population, integrating over the full range of possible true utility differences (x) and assuming that the invading individual is equally likely to find itself at any position within the sequence of decisions. This is equivalent to evaluating the long-run average rewards for each type of agent, assuming an infinite number of trials in which the agents’ decision order is randomly shuffled for each trial, and where the true value of x in each trial is drawn randomly according to equation (4.2). Figure 3a shows the difference in reward between the invader and the population for all pairs of strategies, as a percentage of the expected reward obtained by the population in the absence of an invader (note that this matrix need not be symmetric). Positive values (colour coded blue) indicate that the invader gains a greater reward than the population and thus a successful invasion, and negative values (red) indicate an unsuccessful invasion. As expected, the full sequential model, taking account of all information, can successfully invade all other strategies, and is universally resistant to invasion. The hybrid strategy can successfully invade both the dynamic and aggregate strategies. Notably, the dynamic strategy can invade the aggregate strategy, while the aggregate strategy can be successfully invaded by any other. This is despite the aggregate strategy having substantially greater complexity, in terms of the number of distinct social information configurations, than the dynamic strategy. Although the full sequential strategy gains higher rewards than any competing strategy, this comes at the cost of high complexity. An agent wishing to employ this strategy must either complete a complex recursive mathematical analysis [30], or otherwise learn how to respond to a large number of possible sequentially ordered social information configurations; assuming that no symmetry principles are employed to reduce the number of distinct possibilities there are 2n − 1 possible configurations they might observe. Other strategies also vary widely in complexity, with (1/2)n(n + 1) configurations for the aggregate strategy (each possible combination of nA and nB), n(n − 1) + 1 for the hybrid strategy (the same configurations with either two possible choices for the most recent decision, excluding impossible configurations) and just 3 for the dynamic strategy (two possible most recent decisions, or no social information). Algorithmic complexity imposes cognitive costs on agents, whether through the energy cost of maintaining sufficient neurological and sensory capacity to employ the algorithm or in the additional time it takes to learn a complex response. Therefore, I reevaluated the above evolutionary stability analysis for a range of hypothetical cognitive costs, defined as a fixed cost per distinct configuration an agent must learn to employ the strategy. These hypothetical costs are chosen to illustrate three distinct regimes: (i) for a low but non-zero cognitive cost, the hybrid model is dominant (figure 3b), successfully invading all other strategies and resisting invasion by all; (ii) with an intermediate cognitive cost, hybrid and dynamic strategies can coexist—each can invade the other when the other is dominant, and resist invasion by the sequential or aggregate strategies (figure 3c); (iii) for sufficiently high cognitive costs the dynamic strategy is dominant (figure 3d), as a result of its very low algorithmic complexity. There is no value for the cognitive cost that will make the aggregate strategy evolutionarily stable, since it is always out competed by the dynamic strategy.

Figure 3.

Figure 3.

Evolutionary stability analysis. Panel (a) shows the percentage difference in expected rewards for a single individual of each strategy invading a population of agents employing a different strategy, in the absence of any cognitive costs. Positive quantities (blue) show that the invader gains greater rewards than the other agents. Panels (bd) show a progression of increasing cognitive costs (defined as a fixed cost per distinct possible configuration of social information), showing a different single or mixed evolutionarily stable strategy for differing costs. These costs are given as a proportion of the typical reward scale (the standard deviation of x), and are for each case: (a) 0; (b) 4 × 10−7; (c) 2 × 10−5; (d) 4 × 10−5.

3. Discussion

I derived the optimal policy for agents attending to limited social information in a sequential decision-making setting, and analysed the resulting behaviour of the agents at the individual and collective level. All three decision rules based on limited social information are substantially simpler than the previously developed full sequential decision rule, when viewed in terms of the number of distinct configurations of social information that an agent must be in a position to respond to. This implies that while deductive calculation of the rational behaviour remains complex, inductive learning or adaptation is more straightforward than when attending to full sequential information. Although the application of symmetry principles and parametric function approximations may change the quantitative complexity of each strategy, the ordering of complexity is likely fixed, with dynamic social information (the most recent decision) being the simplest and fully ordered sequences of previous decisions the most complex.

The results presented in this paper reveal two key insights. First, the behaviour of agents attending to full sequential social information can be approximated to a very high degree by a simpler strategy that makes use of only the aggregate number of previous decisions for each option, along with the identity of only the most recent decision. That is, almost all the information contained in the specific ordering of decisions, above the aggregate number of choices, is contained within the most recent decision. Employing this ‘hybrid’ strategy allows agents to make nearly identical decisions to those with full sequential information without employing complex mathematical calculations, and to obtain greater rewards than two simpler strategies making use of only the aggregate quantities or the most recent decision alone. Second, attending to different forms of social information produced differing expected rewards for agents, and the ordering of these rewards did not correspond to the relative complexities of the forms of social information. In particular, attending to just the most recent decision produced greater rewards than attending to aggregate information when the two strategies were in competition.

While specific environmental or cognitive limitations may force individuals to attend to a particular feature of social information, in many cases agents can be expected to evolve the capacity to attend to information that increases their expected reward relative to competitors. In such cases the hybrid model, wherein an agent attends to both the aggregate numbers of previous decisions across different options and the identity of the most recent decision (but no other sequence information), provides a highly accurate approximation to the full sequential model, with a dramatically lower cognitive cost. It also provides a clear improvement over either aggregate or dynamic social information alone. Given the very high complexity of the full sequential model, we should therefore expect such a hybrid model to be favoured when the cost of cognition or perception is relatively small but non-negligible. When cognitive or perceptual costs are very high, it is likely that a dynamic social information model would be favoured if this information is available—this model has very little cognitive complexity and outperforms the aggregate model for any cognitive cost. The results presented here also suggest that there may be scenarios in which no single strategy is dominant. In this case, agents may specialize into multiple coexisting social information strategies. In the example shown here (figure 3c), agents employing a hybrid strategy would obtain a somewhat higher reward than agents employing a dynamic strategy, that balanced the higher cognitive cost they incur. This is analogous to the emergence of ‘informed’ and ‘uninformed’ agents in collective navigation [4,39]. Because the strategies explored in this paper are only optimal assuming that every agent employs the same strategy, further research is needed to determine how these strategies would adapt under coevolution.

Aggregate social information, which forms the basis of many models studied in the literature (e.g. [34,35,4044]), is outperformed by at least one other model for any cognitive cost (under the assumption that cognitive cost varies monotonically with the number of distinct social information configurations for which a threshold must be learned). Although the quantitative complexity of this strategy could be reduced by exploiting symmetries and by approximating the social response with a parametric representation, it is unlikely that this could reduce the complexity below that of the dynamic model. It is likely that in some contexts individuals may be limited to this information by environmental or other external constraints, such as ants attending to deposited pheromones as a proxy for the number of individuals making each choice [14,40,41] or audience members attending to the volume of applause as a proxy for the number of individuals clapping [21]. However, the poor performance of aggregate social information should serve as a check on the future use of such models without considering alternatives. Where an aggregate social information model is found to be the best fit to some empirical data, this must either be the result of some external constraint on the observable social information or a violation of one or more assumptions in this paper. For instance, the models presented here treat the choices of others as valuable knowledge solely through the information they transmit about the utilities of the possible options. An intrinsic benefit of aggregation, such as predation-dilution effects [1,45], could make knowledge of aggregate quantities useful beyond the direct social information value.

The results of this study have both a descriptive and a normative interpretation. They indicate how the use of social information is likely to have evolved when faced with cognitive limits on the complexity of social information that can be considered. They also suggest how we should respond to the decisions we observe others making: by attending to both aggregate and dynamic social information. We should be wary of options that have been chosen by many other individuals, but which few individuals have chosen recently; current unpopularity may indicate that new information has become available that outweighs the aggregate social information in favour of the choice. It is reasonable to expect that the most recent decisions may often be available to us when making a decision, even when the full sequence is not observed, and the results here suggest that combining these recent decisions with aggregate quantities is sufficient to gain most of the possible advantages of social information.

The set of strategies investigated in this study is not comprehensive, and many alternative variants on these four strategies can be hypothesized. For instance, agents might attend to the two most recent decisions rather than only one (cf. [46]), or to which side has the majority of previous decisions rather than the full aggregate quantities. By hybridizing these alternatives, an enormous number of strategies can ultimately be formed, with widely differing cognitive costs. Future research should examine what single or combined set of strategies is optimal across the full range of cognitive costs.

In this study, I have modelled both environmental and social information as abstractions, without considering how perception and cognition are achieved on a neurological level. Whereas I have modelled the cost of perception and cognition in simple terms, further research on the neural correlates of social decision-making may reveal specific forms of social information that are inherently more favourable for neurological representation. I have considered only strict subsets of the available social information as possible perceptions. A fascinating possibility is that individuals may evolve perceptual models that do not correspond in any direct way to physical reality, but which are instead adapted to act as ‘user interfaces’ with the world for the purpose of maximizing fitness [28,47]. Future research might investigate this possibility with a view to understanding how social information is perceived on a conscious level, in both humans and animals.

4. Methods

Following [30], I consider a group of identical agents faced with a sequential, binary decision, each choosing between option A and option B. These options have true utilities UA and UB. Here, utility refers to a numerical scale of desirable outcomes, and may represent, for example, fitness from an evolutionary perspective, or financial return from an economic one. I denote as x the true difference in utilities between these options: x = UAUB. In accordance with expected utility theory [27], each agent seeks to choose the option that maximizes its expected utility, based on its beliefs about x. That is:

P(chooseAI)={1,ifxp(xI)dx>00,otherwise, 4.1

where I represents all the information, both private and social, that an agent has.

Each agent begins with an identical prior distribution over the values of x; I assume this to be a normal distribution centred on zero (no prior bias for either option) and without loss of generality scale all utilities such that the variance of this distribution is one:

p(x)=ϕ(x), 4.2

where ϕ( · ) is the standard normal probability density function.

4.1. Private information

Each agent i receives private information Δi about the value of x, drawn from a normal distribution with variance ε2:

p(Δix)ϕ((Δix)ϵ). 4.3

Using their private information, each agent can independently update their belief about the value of x, using the Bayes rule:

p(xΔi)ϕ(x)ϕ((Δix)ϵ). 4.4

4.2. Social information

Each agent can further update their belief about the value of x by conditioning on the social information available to them, Si. The general form of this update again uses the Bayes rule:

p(xΔi,Si)p(x)p(Δix)P(Six). 4.5

The structure of Si and the resulting form of P(Si|x) differ when considering the different types of social information that an agent may attend to. For the case where an agent attends to the full sequence of previous decisions, Si = {C1, C2, …, Ci−1}, where Cj is the choice made by agent j (Cj = 1 or −1 if agent j chooses A and B, respectively). In this case, [30] derives a recursive expression for p(x| C1, …, Ci−1, Δi):

p(xC1,,Ci1,Δi)ϕ((Δix)ϵ)ϕ(x)×j=1i1Φ(Cj(xΔj)ϵ) 4.6

and

xp(xC1,,Cj1,Δj=Δj)dx=0, 4.7

where Φ() is the standard normal cumulative probability function, and Δ1=0 since the first agent has no social information and must therefore follow its private information. The values of Δj represent a series of critical thresholds; each agent j chooses A if and only if Δj>Δj. These thresholds are functions of the social information observed by agent j, and because this social information is public these thresholds can in principle be calculated by any other agent; this allows later agents to identify whether the private information of agent j was above or below this threshold. Hence, every possible sequence of previous decisions is ultimately associated with a critical threshold, which in turn determines that probability that the agent will be observed to select option A or B through the stochastic generation of Δj from the environment.

4.3. Observation

In common with previous developments of this model [30,31], I evaluate the probability of an agent choosing A or B by considering the position of an external observer, for whom the private information of all agents is unknown. In this paper, I assume that all observations are made under natural conditions, meaning that the experimental noise level is the same as the habitual noise level (see [30] for a discussion on the role of experimental noise). The external observer can calculate the critical information threshold Δ* for any agent by considering the social information it has access to, by the process laid out above. Having evaluated this threshold, the probability that the agent will be observed to choose option A is the probability that the agent’s (unseen) private information exceeds this threshold:

P(chooseA)=P(Δ>Δ)=Φ((xΔ)ϵ). 4.8

4.4. Simplified social information

When agents attend to simpler social information this represents a compression of the full sequence of decisions into a lower-dimensional projection, and hence this is a many-to-one relationship: multiple full sequences may be compatible with the observed social information. For example, if an agent sees that one agent has already selected A and one has selected B, this is compatible with the first decision being for A and the second for B, or vice versa.

Ultimately an agent must associate each distinct observable social information with a threshold, as with the distinct observable sequences above. This threshold specifies that value of private information Δi that would make the expected value of x zero. These thresholds could in principle be acquired through evolutionary adaptation or reinforcement learning, but in order to calculate them from first principles one must imagine the agent performing a complex cognitive process that depends on the precise nature of the simplified social information, as detailed in the specific cases below.

4.4.1. Aggregate and hybrid strategies

I take as the first case an agent that observes the aggregated number of decisions for A and B (nA and nB, respectively), but without any information regarding the order of those decisions. Consider the set S of all sequences of decisions that could have led to this configuration (of which there are (nA + nB)!/(nA!nB!) possibilities). Each sequence sS specifies a series of previous choices Ck ∈ {1, − 1}, k = 1, …, nA + nB, and associated aggregate configurations (previous values of nA and nB). Denoting the associated thresholds for these configurations as Δk within each sequence s, then agent i’s belief over x is a weighted mixture of the appropriate beliefs for each possible sequence

p(xnA,nB,Δi)p(xΔi)sSP(sx), 4.9

where

P(sx)=k=1nA+nBΦ((xCkΔk)ϵ) 4.10

is the probability of the specific sequence s, conditioned on the true utility difference x. Note that the threshold values within each sequence, Δk, are not the same values as for that sequence within the full sequential model (i.e. in equation (4.6)); instead, they are thresholds specific to the aggregate configurations nA, nB that will have resulted from the (k − 1)th choice along sequence s.

Similarly to equation (4.7), this then recursively specifies the appropriate threshold Δi for the configuration nA, nB:

xp(xnA,nB,Δi=Δi)dx=0. 4.11

An analogous process can be followed for the hybrid model, by considering all sequences that agree with the most-recent decision and the aggregate values of nA and nB. That is, taking S to be the set of all decision sequences consistent with nA and nB as above, with each element s in this set specifying an ordered sequence of decisions C1,,CnA+nB, identify the subset S′⊆ S such that for every sS′, CnA+nB=1 if the most recent decision is option A, or for every sS′, CnA+nB=1 if the most recent decision is option B. Then apply equation (4.9) with summation over the set S′ rather than S to obtain the appropriate belief over x.

4.4.2. Dynamic strategy

In the dynamic model, an agent attends only to the most recent decision. Therefore, the set of possible distinct configurations of social information is limited to three possible observations: a decision for A, a decision for B or no decision (if the focal agent is the first decision-maker). This creates a specific problem for deriving the appropriate critical thresholds: the social information that the focal agent observes may be the same as the previous agent to which it attends. This makes the problem self-referential in a way that the aggregate and hybrid models are not.

Consider the belief of a focal agent i about the value of x, having observed the previous decision Ci−1 = 1 (i.e. a choice of A) and having private information Δi:

p(xCi1=1,Δi)p(xΔi)P(Ci1=1x). 4.12

Similarly to equation (4.9), this can be expressed in terms of the probability P(s| x) of all sequences sS that satisfy Ci−1 = 1:

p(xCi1=1,Δi)p(xΔi)sSP(sx). 4.13

Note that these may be of differing lengths of up to n − 1 prior choices; the focal agent can only be aware that they are not the first decision-maker. Assuming that some threshold Δ* exists such that

P(chooseAΔi,Ci1=1)={1,ifΔi>Δ0,otherwise, 4.14

then the probability of a given sequence s = {C1, C2, …, Cm} is given by

P(sx)=1n1Φ(C1xϵ)k=2mΦ(Ck(xCk1Δ)/ϵ), 4.15

where the initial factor of 1/(n − 1) originates from the agent’s uncertainty over the number of prior decisions, which is equally likely to be anywhere from 1 to n − 1. Finally, in order to make the decision rule self-consistent requires that the expectation of x be zero when Δi = Δ*, thus

xp(xCi1=1,Δi=Δ)dx=0. 4.16

By symmetry, the equivalent threshold for the agent to apply when Ci−1 = −1 is −Δ*.

Acknowledgements

I am grateful to Joseph Bak-Coleman, Wolfram Barfuss and Graham Budd for helpful discussions and comments on the manuscript. Constructive feedback from the reviewers significantly improved the paper.

Data accessibility

Code to reproduce the analysis in this paper is included as electronic supplementary material.

Competing interests

I declare I have no competing interests.

Funding

This work was supported by UK Research and Innovation Future Leaders Fellowship grant no. MR/S032525/1.

References

  • 1.Krause J, Ruxton G. 2002. Living in groups. Oxford, UK: Oxford University Press. [Google Scholar]
  • 2.Sumpter DJT. 2010. Collective animal behavior. Princeton, NJ: Princeton University Press. [Google Scholar]
  • 3.Biro D, Sumpter DJT, Meade J, Guilford T. 2006. From compromise to leadership in pigeon homing. Curr. Biol. 16, 2123-2128. ( 10.1016/j.cub.2006.08.087) [DOI] [PubMed] [Google Scholar]
  • 4.Guttal V, Couzin ID. 2010. Social interactions, information use, and the evolution of collective migration. Proc. Natl Acad. Sci. USA 107, 16 172-16 177. ( 10.1073/pnas.1006874107) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Bhattacharya K, Vicsek T. 2014. Collective foraging in heterogeneous landscapes. J. R. Soc. Interface 11, 20140674. ( 10.1098/rsif.2014.0674) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Aplin LM, Farine DR, Mann RP, Sheldon BC. 2014. Individual-level personality influences social foraging and collective behaviour in wild birds. Proc. R. Soc. B 281, 20141016. ( 10.1098/rspb.2014.1016) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Ward AJW, Herbert-Read JE, Sumpter DJT, Krause J. 2011. Fast and accurate decisions through collective vigilance in fish shoals. Proc. Natl Acad. Sci. USA 108, 2312-2315. ( 10.1073/pnas.1007102108) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Handegard NO, Boswell KM, Ioannou CC, Leblanc SP, Tjøstheim DB, Couzin ID. 2012. The dynamics of coordinated group hunting and collective information transfer among schooling prey. Curr. Biol. 22, 1213-1217. ( 10.1016/j.cub.2012.04.050) [DOI] [PubMed] [Google Scholar]
  • 9.Salganik M, Dodds P, Watts D. 2006. Experimental study of inequality and unpredictability in an artificial cultural market. Science 311, 854-856. ( 10.1126/science.1121066) [DOI] [PubMed] [Google Scholar]
  • 10.Wang J-S, Cheng Y-F, Chu Y-L. 2013. Effect of celebrity endorsements on consumer purchase intentions: advertising effect and advertising appeal as mediators. Hum. Factors Ergon. Manuf. Serv. Ind. 23, 357-367. ( 10.1002/hfm.20336) [DOI] [Google Scholar]
  • 11.Ballerini M et al. 2008. Interaction ruling animal collective behavior depends on topological rather than metric distance: evidence from a field study. Proc. Natl Acad. Sci. USA 105, 1232-1237. ( 10.1073/pnas.0711437105) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Katz Y, Tunstrom K, Ioannou C, Huepe C, Couzin I. 2011. Inferring the structure and dynamics of interactions in schooling fish. Proc. Natl Acad. Sci. USA 108, 18 720-18 725. ( 10.1073/pnas.1107583108) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Herbert-Read JE, Perna A, Mann RP, Schaerf TM, Sumpter DJT, Ward AJW. 2011. Inferring the rules of interaction of shoaling fish. Proc. Natl Acad. Sci. USA 108, 18 726-18 731. ( 10.1073/pnas.1109355108) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Perna A, Granovskiy B, Garnier S, Nicolis SC, Labédan M, Theraulaz G, Fourcassié V, Sumpter DJ. 2012. Individual rules for trail pattern formation in argentine ants (Linepithema humile). PLoS Comput. Biol. 8, e1002592. ( 10.1371/journal.pcbi.1002592) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Pettit B, Perna A, Biro D, Sumpter DJT. 2013. Interaction rules underlying group decisions in homing pigeons. J. R. Soc. Interface 10, 20130529. ( 10.1098/rsif.2013.0529) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Rosenthal SB, Twomey CR, Hartnett AT, Wu HS, Couzin ID. 2015. Revealing the hidden networks of interaction in mobile animal groups allows prediction of complex behavioral contagion. Proc. Natl Acad. Sci. USA 112, 4690-4695. ( 10.1073/pnas.1420068112) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Heras FJ, Romero-Ferrero F, Hinz RC, de Polavieja GG. 2019. Deep attention networks reveal the rules of collective motion in zebrafish. PLoS Comput. Biol. 15, e1007354. ( 10.1371/journal.pcbi.1007354) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Gautrais J, Ginelli F, Fournier R, Blanco S, Soria M, Chaté H, Theraulaz G. 2012. Deciphering interactions in moving animal groups. PLoS Comput. Biol. 8, e1002678. ( 10.1371/journal.pcbi.1002678) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Mann RP, Perna A, Garnett R, Herbert-Read JE, Sumpter DJT, Ward AJW. 2013. Multi-scale inference of interaction rules in animal groups using Bayesian model selection. PLoS Comput. Biol. 9, e1002961. ( 10.1371/journal.pcbi.1002961) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Mann RP, Herbert-Read JE, Ma Q, Jordan LA, Sumpter DJT, Ward AJW. 2014. A model comparison reveals dynamic social information drives the movements of humbug damselfish (Dascyllus aruanus). J. R. Soc. Interface 11, 20130794. ( 10.1098/rsif.2013.0794) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Mann RP, Faria J, Sumpter DJT, Krause J. 2013. The dynamics of audience applause. J. R. Soc. Interface 10, 20130466. ( 10.1098/rsif.2013.0466) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Strandburg-Peshkin A et al. 2013. Visual sensory networks and effective information transfer in animal groups. Curr. Biol. 23, R709-R711. ( 10.1016/j.cub.2013.07.059) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Torney CJ, Lamont M, Debell L, Angohiatok RJ, Leclerc L-M, Berdahl AM. 2018. Inferring the rules of social interaction in migrating caribou. Phil. Trans. R. Soc. B 373, 20170385. ( 10.1098/rstb.2017.0385) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Kadak K, Miller N. 2020. Follow the straggler: zebrafish use a simple heuristic for collective decision-making. Proc. R. Soc. B 287, 20202690. ( 10.1098/rspb.2020.2690) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Ioannou CC, Guttal V, Couzin ID. 2012. Predatory fish select for coordinated collective motion in virtual prey. Science 337, 1212-1215. ( 10.1126/science.1218919) [DOI] [PubMed] [Google Scholar]
  • 26.Kotrschal A et al. 2020. Rapid evolution of coordinated and collective movement in response to artificial selection. Sci. Adv. 6, eaba3148. ( 10.1126/sciadv.aba3148) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Von Neumann J, Morgenstern O. 2007. Theory of games and economic behavior. Princeton, NJ: Princeton University Press. [Google Scholar]
  • 28.Mark JT, Marion BB, Hoffman DD. 2010. Natural selection and veridical perceptions. J. Theor. Biol. 266, 504-515. ( 10.1016/j.jtbi.2010.07.020) [DOI] [PubMed] [Google Scholar]
  • 29.Lieder F, Griffiths TL. 2020. Resource-rational analysis: understanding human cognition as the optimal use of limited computational resources. Behav. Brain Sci. 43, E1. ( 10.1017/s0140525x1900061x) [DOI] [PubMed] [Google Scholar]
  • 30.Mann RP. 2018. Collective decision making by rational individuals. Proc. Natl Acad. Sci. USA 115, E10387-E10396. ( 10.1073/pnas.1811964115) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Mann RP. 2020. Collective decision-making by rational agents with differing preferences. Proc. Natl Acad. Sci. USA 117, 10 388-10 396. ( 10.1073/pnas.2000840117) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Pérez-Escudero A, De Polavieja GG. 2011. Collective animal behavior from Bayesian estimation and probability matching. PLoS Comput. Biol. 7, e1002282. ( 10.1371/journal.pcbi.1002282) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Arganda S, Pérez-Escudero A, De Polavieja GG. 2012. A common rule for decision-making in animal collectives across species. Proc. Natl Acad. Sci. USA 109, 20 508-20 513. ( 10.1073/pnas.1210664109) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Ward AJW, Sumpter DJT, Couzin ID, Hart PJB, Krause J. 2008. Quorum decision-making facilitates information transfer in fish shoals. Proc. Natl Acad. Sci. USA 105, 6948-6953. ( 10.1073/pnas.0710344105) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Sumpter DJT, Krause J, James R, Couzin ID, Ward AJW. 2008. Consensus decision making by fish. Curr. Biol. 18, 1773-1777. ( 10.1016/j.cub.2008.09.064) [DOI] [PubMed] [Google Scholar]
  • 36.Sumpter DJT, Mann RP, Perna A. 2012. The modelling cycle for collective animal behaviour. Interface Focus 2, 764-773. ( 10.1098/rsfs.2012.0031) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Mann RP. 2011. Bayesian inference for identifying interaction rules in moving animal groups. PLoS ONE 6, e22827. ( 10.1371/journal.pone.0022827) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Strandburg-Peshkin A, Farine DR, Couzin ID, Crofoot MC. 2015. Shared decision-making drives collective movement in wild baboons. Science 348, 1358-1361. ( 10.1126/science.aaa5099) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Torney CJ, Levin SA, Couzin ID. 2010. Specialization and evolutionary branching within migratory populations. Proc. Natl Acad. Sci. USA 107, 20 394-20 399. ( 10.1073/pnas.1014316107) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Goss S, Aron S, Deneubourg J-L, Pasteels JM. 1989. Self-organized shortcuts in the argentine ant. Naturwissenschaften 76, 579-581. ( 10.1007/BF00462870) [DOI] [Google Scholar]
  • 41.Beckers R, Deneubourg J-L, Goss S. 1993. Modulation of trail laying in the ant Lasius niger (hymenoptera: Formicidae) and its role in the collective selection of a food source. J. Insect Behav. 6, 751-759. ( 10.1007/BF01201674) [DOI] [Google Scholar]
  • 42.Sumpter DJT, Pratt SC. 2009. Quorum responses and consensus decision making. Phil. Trans. R. Soc. B 364, 743-753. ( 10.1098/rstb.2008.0204) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Ward A, Krause J, Sumpter D. 2012. Quorum decision-making in foraging fish shoals. PLoS ONE 7, e32411. ( 10.1371/journal.pone.0032411) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Gallup AC, Hale JJ, Sumpter DJT, Garnier S, Kacelnik A, Krebs JR, Couzin ID. 2012. Visual attention and the acquisition of information in human crowds. Proc. Natl Acad. Sci. USA 109, 7245-7250. ( 10.1073/pnas.1116141109) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Hamilton WD. 1971. Geometry for the selfish herd. J. Theor. Biol. 31, 295-311. ( 10.1016/0022-5193(71)90189-5) [DOI] [PubMed] [Google Scholar]
  • 46.Anderson LR, Holt CA. 1997. Information cascades in the laboratory. Am. Econ. Rev. 87, 847-862. [Google Scholar]
  • 47.Hoffman D. 2019. The case against reality: why evolution hid the truth from our eyes. New York, NY: WW Norton & Company. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

Code to reproduce the analysis in this paper is included as electronic supplementary material.


Articles from Journal of the Royal Society Interface are provided here courtesy of The Royal Society

RESOURCES