Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2021 Aug 20;118(34):e2107204118. doi: 10.1073/pnas.2107204118

On randomly changing conformity bias in cultural transmission

Kaleda K Denton a, Uri Liberman b, Marcus W Feldman a,1
PMCID: PMC8403876  PMID: 34417299

Significance

Animals, from humans to Drosophila, display conformity and anticonformity. Population dynamics under (anti)conformity may explain emergent properties of groups including fads, norms, and collective behavior. Although empirical evidence suggests that a population’s level of conformity can vary over time, most mathematical models have not included time-varying conformity coefficients. To potentially improve applicability to real-world systems, we allow conformity coefficients, numbers of sampled “role models,” and weak selection to vary stochastically in an established conformity model. Novel dynamics are possible, including simultaneous stochastic local stability of monomorphisms and polymorphism. Interpreting real-world population differences in terms of (anti)conformity may therefore not be straightforward. Under some conditions, however, the deterministic model provides a useful approximation to the stochastic model.

Keywords: random conformity, stochastic local stability, convergence, random selection

Abstract

Humans and nonhuman animals display conformist as well as anticonformist biases in cultural transmission. Whereas many previous mathematical models have incorporated constant conformity coefficients, empirical research suggests that the extent of (anti)conformity in populations can change over time. We incorporate stochastic time-varying conformity coefficients into a widely used conformity model, which assumes a fixed number n of “role models” sampled by each individual. We also allow the number of role models to vary over time (nt). Under anticonformity, nonconvergence can occur in deterministic and stochastic models with different parameter values. Even if strong anticonformity may occur, if conformity or random copying (i.e., neither conformity nor anticonformity) is expected, there is convergence to one of the three equilibria seen in previous deterministic models of conformity. Moreover, this result is robust to stochastic variation in nt. However, dynamic properties of these equilibria may be different from those in deterministic models. For example, with random conformity coefficients, all equilibria can be stochastically locally stable simultaneously. Finally, we study the effect of randomly changing weak selection. Allowing the level of conformity, the number of role models, and selection to vary stochastically may produce a more realistic representation of the wide range of group-level properties that can emerge under (anti)conformist biases. This promises to make interpretation of the effect of conformity on differences between populations, for example those connected by migration, rather difficult. Future research incorporating finite population sizes and migration would contribute added realism to these models.


Cavalli-Sforza and Feldman (1) studied the finite population dynamics of a trait whose transmission from one generation to the next depended on the mean value of that trait in the population. This “group transmission” constrained the within-group variability but could lead to increasing variance in the average trait value between groups. Other analyses of cultural transmission biases have incorporated characteristics of trait variation, such as the quality, and characteristics of transmitters, including success and prestige (2). Another class of transmission biases is couched in terms of the frequencies of the cultural variants in the population (3). These “frequency-dependent” biases include conformity and anticonformity, which occur when a more common variant is adopted at a rate greater or less than its population frequency, respectively (4).

Humans have exhibited conformity in mental rotation (5), line discrimination (6), and numerical discrimination tasks (7). Anticonformity has been exhibited by young children performing numerical discrimination (7). Unbiased frequency-dependent transmission, known as random copying (8), has been suggested to account for choices of dog breeds (9), Neolithic pottery motifs, patent citations, and baby names (10, 11). However, baby name distributions appear more consistent with frequency-dependent (8, 12) and/or other (13, 14) biases.

In nonhuman animals, conformity has been observed in nine-spined sticklebacks choosing a feeder (15) and great tits solving a puzzle box (16, 17) (but see ref. 18). Fruit flies displayed both conformist and anticonformist bias with respect to mate choice (19) (but these authors used a different definition of anticonformity from that of ref. 4, which we use, and therefore did not consider these behaviors to be anticonformist).

Asch (20, 21) used a different definition of conformity from ref. 4, namely “the overriding of personal knowledge or behavioral dispositions by countervailing options observed in others” (ref. 22, p. 34). Aschian conformity (22) has been observed in chimpanzees (23, 24), capuchin monkeys (25, 26) (but see ref. 27), vervet monkeys (28), and great tits (16). It has also been empirically tested in at least 133 studies of humans and, in the United States, has declined from the 1950s to the 1990s (29).

Temporal variation may also occur in forms of conformity other than Aschian. In ref. 12, popular US baby names from 1960 to 2010 show a concave turnover function indicative of negative frequency-dependent bias, but male baby names from earlier decades (1880 to 1930) show a convex turnover indicative of positive frequency-dependent or direct bias. However, most previous mathematical models of conformity have incorporated constant, rather than time-dependent, conformity coefficients.

Cavalli-Sforza and Feldman (ref. 3, chap. 3) and Boyd and Richerson (ref. 4, chap. 7) studied models of frequency-dependent transmission of a cultural trait with two variants. Boyd and Richerson (4) incorporated conformist and anticonformist bias through a conformity coefficient denoted by D. In their simplest model, if the frequency of variant A is p and that of variant B is 1p, then the frequency of variant A in the offspring generation, p, is

p=p+Dp(1p)(2p1), [1]

where D>0 entails conformity (A increases if its frequency is p>12), D<0 entails anticonformity, D=0 entails random copying, and 2<D<1. In this model, each offspring samples the cultural variants of n=3 members of the parental generation (hereafter, role models). Sampling n>3 role models requires different constraints and, if n>4, there are multiple conformity coefficients (Eq. 19).

Many subsequent models have built upon Boyd and Richerson’s (4) simplest model (Eq. 1). These have incorporated individual learning, information inaccuracy due to environmental change (3034), group selection (35), and other transmission biases, including payoff bias (36), direct bias, and prestige bias (37). Other models, which include a single conformity coefficient and preserve the essential features of Eq. 1, incorporate individual learning, environmental variability (32, 38), group selection (39), and multiple cultural variants (38).

In agent-based statistical physics models, the up and down spins of an electron are analogous to cultural variants A and B (40, 41). Individuals are nodes in a network and choose among a series of actions with specified probabilities, such as independently acquiring a spin, or sampling neighboring individuals and adopting the majority or minority spin in the sample. The number of sampled role models can be greater than three (42, 43). (Anti)conformity may occur if all (4247), or if at least r (40, 48), sampled individuals have the same variant. In contrast, Boyd and Richerson’s (4) general model (Eq. 19) allows, for example, stronger conformity to a 60% majority of role models and weaker conformity or anticonformity to a 95% majority (in humans, this might result from a perceived difference between “up-and-coming” and “overly popular” variants).

In Boyd and Richerson’s (4) general model, individuals sample n role models, which is more realistic than restricting n to 3 (as in Eq. 1); individuals may be able to observe more than three members of the previous generation. With n>4, different levels of (anti)conformity may occur for different samples j of n role models with one variant. In addition to the example above with 60 and 95% majorities, other relationships between the level of conformity and the sample j of n are possible. For example, the strength of conformity might increase as the number of role models with the more common variant increases. In a recent exploration of Boyd and Richerson’s (4) general model, we found dynamics that departed significantly from those of Eq. 1 (49). If conformity and anticonformity occur for different majorities j of n role models (i.e., j>n2), polymorphic equilibria may exist that were not possible with Eq. 1. In addition, strong enough anticonformity can produce nonconvergence: With as few as 5 role models, stable cycles in variant frequencies may arise, and with as few as 10 role models, chaos is possible. Such complex dynamics may occur with or without selection.

Here, we extend both Boyd and Richerson’s (4) simplest (Eq. 1) and general (Eq. 19) models to allow the conformity coefficient(s) to vary randomly across generations, by sampling them from probability distributions. Although some agent-based models allow individuals to switch between “conformist” and “non-” or “anticonformist” states over time (40, 42, 47, 50, 51), to our knowledge, random temporal variation in the conformity coefficients themselves has not been modeled previously. In reality, the degree to which groups of individuals conform may change over time, as illustrated by the finding that young children anti-conformed while older children conformed in a discrimination task (7); thus, it seems reasonable to expect that different generations may also exhibit different levels of conformity. Indeed, generational changes have occurred for Aschian conformity (29) and possibly in frequency-dependent copying of baby names (12). Our stochastic model may therefore produce more realistic population dynamics than previous deterministic models, and comparisons between the two can suggest when the latter is a reasonable approximation to the former.

We also allow the number of role models, nt, to vary over time. Agent-based conformity models have incorporated temporal (43) and individual (43, 45, 46) variation in the number of sampled individuals, whereas here, all members of generation t sample the same number nt of role models. Causes of variation in nt are not explored here, but there could be several. For instance, different generations of animals may sample different numbers of role models due to variation in population density. In humans, changes in the use of social media platforms or their features may cause temporal changes in the number of observed individuals. For example, when Facebook added the feature “People You May Know,” the rate of new Facebook connections in a New Orleans dataset nearly doubled (52).

In the stochastic model without selection, regardless of the fluctuation in the conformity coefficient(s), if there is conformity on average, the population converges to one of the three equilibria present in Boyd and Richerson’s (4) model with conformity (D(j)>0 for n2<j<n in Eq. 19). These are p*=1 (fixation on variant A), p*=0 (fixation on variant B), and p*=12 (equal representation of A and B). However, their stability properties may differ from those in the deterministic case. In Boyd and Richerson’s (4) model with random copying, every initial frequency p0 is an equilibrium. Here, with random copying expected and independent conformity coefficients, there is convergence to p*=0,12, or 1. In this case, and in the case with conformity expected, convergence to p*=0,12, or 1 also holds with stochastic variation in the number of role models, nt. With either stochastic or constant weak selection in Boyd and Richerson’s (4) simplest model (Eq. 1) and random copying expected, there is convergence to a fixation state (p*=0 or 1). Finally, with anticonformity in the deterministic model or anticonformity expected in the stochastic model, nonconvergence can occur.

Generalizing the Deterministic Model

Here we generalize Eq. 1 to allow the coefficient D to change randomly at each generation. If t denotes the generation number (t=0,1,2,), then Eq. 1 becomes

pt+1=pt+Dtpt(1pt)(2pt1). [2]

In Eq. 2 we assume that the random variables Dt for t=0,1,2, are identically distributed with 2<Dt<1 for n=3 role models, E(Dt)=d, Var(Dt)=σ2, and Dt are independent of pt in each generation t. We focus on the possible convergence of pt to equilibrium as t.

Eq. 1 entails that σ2=0, and we now summarize the known results for this model. If the conformity coefficients are constant over time, i.e., Dtd0 for all t, then

  • 1)

    There are three possible equilibria, p*=0, p*=1, and p*=12, where p*=0 and p*=1 correspond to fixation in type B and type A, respectively, and p*=12 is a polymorphic equilibrium where both A and B are equally represented.

  • 2)

    As t there is always convergence of pt to an equilibrium.

  • 3)

    If d>0, namely with conformity bias, if 0p0<12, then ptt0, whereas if 12<p01, then ptt1.

  • 4)

    If d<0, namely with anticonformity bias, if 0<p0<1, then ptt12.

In generalizing Eq. 1, we investigate whether, if Dt changes randomly over generations, there is also global convergence of the process {pt}t=0 and, if so, what can be said about the limit distribution. Our analysis is based on the following observations for the process {pt}t=0 (t=0,1,2,) specified in Eq. 2:

Observation 1.

  • 1)

    If 0<p0<12, then 0<pt<12 with probability 1.

  • 2)

    If 12<p0<1, then 12<pt<1 with probability 1.

Observation 2.

  • 1)

    If d=E(Dt)=0, then E(pt+1pt)=pt.

  • 2)

    If d=E(Dt)>0, then if 0<p0<12, E(pt+1pt)<pt and if 12<p0<1, then E(pt+1pt)>pt. If d=E(Dt)<0, then if 0<p0<12, E(pt+1pt)>pt, and if 12<p0<1, then E(pt+1pt)<pt.

Proof of Observation 1.

From Eq. 2 we see that

pt+112=pt121+2Dtpt(1pt). [3]

Since Dt satisfies 2<Dt<1, and 0x1 entails that 0x(1x)14, with probability 1 we have

1+2Dtpt(1pt)14pt(1pt)0, [4]

with equality only when pt=12. Therefore the signs of (pt12) and (pt+112) coincide.

Proof of Observation 2.

For all t=0,1,2

  • 1)

    If d=E(Dt)=0, then Eq. 2 gives E(pt+1pt)=pt.

  • 2)

    If 0<p0<12, then by Observation 1 we have 0<pt<12 and pt(1pt)(2pt1)<0. Hence, if d>0, Eq. 2 implies that E(pt+1pt)<pt and if d<0, then E(pt+1pt)>pt. If 12<p0<1, then with probability 1 we have 12<pt<1, and a similar calculation shows that E(pt+1pt)>pt if d>0 and E(pt+1pt)<pt if d<0.

Using the two observations, we have the following result.

Result 1.

For any starting0p01, the process{pt}t=0converges. That is,limtpt=pwith probability 1.

Proof:

From Observation 2 and the fact that 0pt1 with probability 1 for all t=0,1,2,, and any 0p01, the process ptt=0 is a bounded martingale, submartingale, or supermartingale. Therefore, the martingale convergence theorem ensures that there is a random variable p such that pttp with probability 1.

As the process ptt=0 is bounded by 0 and 1, the Lebesgue convergence theorem gives the following result:

Result 2.

For any0p01, limtE(pt)=E(p)for=1,2,.

The constant equilibria p*=0, p*=12, and p*=1 are also equilibria of the process {pt}t=0 determined by Eq. 2. Is there a relation between these constant equilibria and the random variable p? The following result gives the answer:

Result 3.

Ifpttp, thenp(1p)(2p1)=0with probability 1; namely,

Pp=0orp=12orp=1=1. [5]

Proof:

From Eq. 2, since Dt is independent of pt,

E(pt+1pt)=pt+E(Dt)pt(1pt)(2pt1), [6]

which implies that

E(pt+1)=E(pt)+dEpt(1pt)(2pt1). [7]

By Result 2, E(pt)tE(p) and E[pt(1pt)(2pt1)]tE[p(1p)(2p1)]. Therefore from Eq. 7

dEp(1p)(2p1)=0. [8]

If d0, then Eq. 8 implies E[p(1p)(2p1)]=0. Now if 0<p0<12, then also 0p12 with probability 1, and p(1p)(2p1)0. Thus in this case, p(1p)(2p1)=0 with probability 1. A similar argument applies if 12<p0<1.

If d=0, we use Eq. 2 to see that

E(pt+1pt)=pt,Var(pt+1pt)=Var(Dt)pt2(1pt)2(2pt1)2. [9]

Now Var(Dt)=σ2, and

Var(pt+1)=EVar(pt+1pt)+VarE(pt+1pt) [10]
=σ2Ept2(1pt)2(2pt1)2+Var(pt). [11]

Applying Result 2, Var(pt)tVar(p) and

Ept2(1pt)2(2pt1)2tEp2(1p)2(2p1)2.

Therefore, since σ2>0, Eq. 11 implies that

Ep2(1p)2(2p1)2=0. [12]

But p2(1p)2(2p1)20 with probability 1. Hence, Eq. 12 is possible if and only if p(1p)(2p1)=0 with probability 1.

Result 3 entails that the random variable p can take only the three values 0, 1, and 12 corresponding to the constant equilibria p*=0, p*=1, and p*=12. What are the probabilities that p takes each of these values?

Result 4.

Ifd=E(Dt)=0, then

  • 1)

    If0<p0<12,P(p=0)=12p0,P(p=12)=2p0.

  • 2)

    If12<p0<1,P(p=12)=2(1p0),P(p=1)=2p01.

Proof:

If d=E(Dt)=0, then Eq. 2 implies that E(pt+1pt)=pt, so E(pt+1)=E(pt) for all t=0,1,2,. Therefore E(pt)=p0 for all t=0,1,2,, and E(p)=p0. If 0<p0<12, then 0p12 and by Eq. 5, P(p=0orp=12)=1. Hence the random variable p takes two values 0 and 12, and as E(p)=p0, the probabilities of 0 and 12 are as in Eq. 13. The argument is similar if 12<p0<1.

If d=E(Dt)0, the analysis is more complicated because in this case it is not clear what E(p) is. We address this case using the notion developed by Karlin and Liberman (53) of stochastic local stability (SLS) of a constant equilibrium. Let xtt=0 be a sequence of random variables satisfying xt+1=f(xt) for t=0,1,2,. Then x* is a constant equilibrium of this sequence if x* satisfies x*=f(x*).

Definition: SLS.

A constant equilibrium x* is said to be SLS if for any ε>0 there is a δ>0 such that

|x0x*|<δPlimtxt=x*1ε. [15]

In our case, the sequence ptt=0 determined by Eq. 2 has three constant equilibria: p*=0, p*=1, and p*=12. Following ref. 53, to determine whether a constant equilibrium p* is SLS we examine the linear approximation of the transformation in Eq. 2 “near” p*. The linear approximations of Eq. 2 near p*=0 and p*=1 for small νt are both

νt+1=νt(1Dt), [16]

while that near p*=12 for small ηt is

ηt+1=ηt(1+Dt2). [17]

In Eq. 16, the frequency of a rare variant increases under anticonformity (Dt<0), whereas in Eq. 17, the frequency of a common variant (ηt>12) increases under conformity (Dt>0). Thus, we have the following:

Result 5.

  • 1)

    IfElog(1Dt)<0, then bothp*=0andp*=1are SLS. IfElog(1Dt)>0, thenPlimtpt=0=Plimtpt=1=0.

  • 2)

    IfElog(1+Dt2)<0, thenp*=12is SLS, whereas ifElog(1+Dt2)>0, thenPlimtpt=12=0.

As both functions y=log(1x) and y=log(1+x2) are concave (for x<1 and x>2, respectively), we apply Jensen’s inequality to obtain

Elog(1Dt)<logE(1Dt),Elog(1+Dt2)<logE(1+Dt2), [18]

since 2<Dt<1 with probability 1 and σ2=Var(Dt)>0.

Applying inequalities in Eq. 18 and Result 5, we have the following:

Result 6.

  • 1)

    Ifd=E(Dt)0, thenp*=0andp*=1are SLS.

  • 2)

    Ifd=E(Dt)0, thenp*=12is SLS.

Therefore, if d=E(Dt)=0, all three constant equilibria, p*=0, p*=1, and p*=12 are SLS. Hence p can take the three values as stated in Result 4.

The dynamics with n=3 role models are illustrated in Figs. 1 A, D, and G; 2A; and 3A. Dt is denoted by Dt(2) because it corresponds to the case where j=2 of 3 role models have the same variant [the reason for this Dt(j) notation becomes clear with n>4 and is described under Eq. 19]. Consistent with Observation 1, runs that start in (0,12) or (12,1) remain in (0,12) or (12,1), respectively. Equilibria are p*=0,12, and 1. It is possible that all three equilibria are SLS with E[Dt(2)]=0, E[Dt(2)]<0, or E[Dt(2)]>0 (Fig. 1 A, D, and G, respectively). For p*=12 to be unstable (Fig. 2A), it is necessary that E[Dt(2)]>0. In this case, the dynamics are similar to those of deterministic Eq. 1 with conformity (D>0). For p*=0 and 1 to be unstable (Fig. 3A), it is necessary that E[Dt(2)]<0, in which case the dynamics are similar to Eq. 1 with anticonformity (D<0). With n=3 and identically distributed conformity coefficients, there is always convergence of the population to an equilibrium.

Fig. 1.

Fig. 1.

Stochastic local stability of p*=0,12,1. Each plot shows 25 runs with initial p0=0.25 (red) and 25 runs with p0=0.75 (blue). Eighty generations are shown, although equilibria p*=0,12,1 may take longer to be reached; how often each equilibrium was reached is in SI Appendix, section A along with E[Dt(j)], the mean of log[1Dt(n1)], and the mean of log1+Dt(2)2 for n=3. (A) n=3, Dt(2)U[0,1) with probability 23 and Dt(2)U(2,0] with probability 13. (B) n=4, Dt(3)U[0,1) with probability 34 and Dt(3)U(3,0] with probability 14. (C) n=5, Dt(4)U[0,1) with probability 45 and Dt(4)U(4,0] with probability 15. Dt(3)U[0,2) with probability 35 and Dt(3)U(3,0] with probability 25. (D) n=3, Dt(2) is sampled from a truncated normal distribution on (2,1) with mean (before truncation) 0.05 and SD 0.5. (E) n=4, Dt(3) is sampled from a truncated normal distribution on (3,1) with mean (before truncation) 0.05 and SD 0.5. (F) n=5, Dt(4) is sampled from a truncated normal distribution on (4,1) with mean (before truncation) 0.05 and SD 0.5. Dt(3) is sampled from a truncated normal distribution on (3,2) with mean (before truncation) 0.05 and SD 0.5. (G) n=3, Dt(2)=0.99 with probability 23 and Dt(2)=1.5 with probability 13. (H) n=4, Dt(3)=0.99 with probability 34 and Dt(3)=2.5 with probability 14. (I) n=5, Dt(4)=0.9 with probability 45 and Dt(4)=3 with probability 15. Dt(3)=1.5 with probability 35 and Dt(3)=2.85 with probability 25.

Fig. 2.

Fig. 2.

Stochastic local stability of p*=0,1 and instability of p*=12. Each plot shows 25 runs with initial p0=0.499 (red) and 25 runs with p0=0.501 (blue). The first 80 generations are shown, although equilibria may take longer to be reached; how often each equilibrium was reached is in SI Appendix, section A, along with E[Dt(j)], the mean of log[1Dt(n1)], and the mean of log1+Dt(2)2 for n=3. (A) n=3, Dt(2)U[0,1) with probability 0.9 and Dt(2)U(2,0] with probability 0.1. (B) n=4, Dt(3)U[0,1) with probability 0.925 and Dt(3)U(3,0] with probability 0.075. (C) n=5, Dt(4)U[0,1) with probability 0.98 and Dt(4)U(4,0] with probability 0.02. Dt(3)U[0,2) with probability 0.55 and Dt(3)U(3,0] with probability 0.45. (D) n=4, Dt(3)=0.99 with probability 34 and Dt(3)=2.99 with probability 14. (E) n=5, Dt(4)=0.99 with probability 34 and Dt(4)=3.99 with probability 14. Dt(3)=0.1 with probability 34 and Dt(3)=0.1 with probability 14.

Fig. 3.

Fig. 3.

Stochastic local stability of p*=12 and instability of p*=0,1. Each plot shows 25 runs with initial p0=0.001 (red) and 25 runs with p0=0.999 (blue). The equilibrium p*=12 is reached in every case, although only the first 80 generations are shown and this equilibrium may take longer to be reached. SI Appendix, section A reports E[Dt(j)], the mean of log[1Dt(n1)], and the mean of log1+Dt(2)2 for n=3. (A) n=3, Dt(2)U[0,1) with probability 0.05 and Dt(2)U(2,0] with probability 0.95. (B) n=4, Dt(3)U[0,1) with probability 0.25 and Dt(3)U(3,0] with probability 0.75. (C) n=5, Dt(4)U[0,1) with probability 0.4 and Dt(4)U(4,0] with probability 0.6. Dt(3)U[0,2) with probability 0.1 and Dt(3)U(3,0] with probability 0.9. (D) n=5, Dt(4)=3.9 with probability 34, Dt(4)=0.9 with probability 14, Dt(3)=0.9 with probability 34, and Dt(3)=1.9 with probability 14.

If Dt(2) are not identically distributed, nonconvergence can occur with n=3 role models. In Fig. 4A, Dt(2) alternates between two fixed values, producing a stable, two-generation cycle in the frequency of A. This cycle is confined to a region in (0,12) or (12,1), depending on whether the initial frequency p0 is below or above p=12, respectively. [Recall that Observation 1 does not rely on the assumption of identically distributed Dt(2).]

Fig. 4.

Fig. 4.

Exact cycles. Each plot shows 25 runs with initial p0<12 (red) and 25 runs with p0>12 (blue). Eighty generations are shown, although simulations ran for 106 generations; no equilibria were reached. Descriptions of the dynamics, exact initial frequencies, and E[Dt(j)] are reported in SI Appendix, section A. (A) n=3, Dt(2) alternates between 1.15 and 0.9 every other generation. (B) n=4, Dt(3)=2.99 every third generation and Dt(3)=0.99 in the other generations. (C) n=5, Dt(4)=3.99 and Dt(3)=2.1 every third generation and in the other generations Dt(4)=0.99 and Dt(3)=0.5.

More Role Models.

The above results apply when the number of role models is n=3. Do such results for global convergence hold for any number n of role models? To address this, we follow Boyd and Richerson (4) and Eq. 2 above is replaced by

pt+1=pt+j=0nDt(j)nnjptj(1pt)nj. [19]

Eq. 19 with the subscript t removed from Dt(j) is the general form of Boyd and Richerson’s (4) deterministic model. The sum contains the probability of each possible sample j of n role models with variant A (a binomial, due to random sampling) multiplied by the scaled conformity coefficient for that sample, Dt(j)n. For any generation t=0,1,2, the vector Dt=Dt(0),Dt(1),,Dt(n) of conformity coefficients Dt(j) has the following properties for j=0,1,2,,n:

Dt(0)=Dt(n)=0,Dt(nj)=Dt(j), [20a]
and forj=1,2,,n1,j<Dt(j)<nj. [20b]

Eq. 20b ensures that probabilities of acquiring A and B are in (0,1). The left-hand side of Eq. 20a entails that if a sample contains A(B) at frequency 1, an individual acquires A(B) with probability 1, and the right-hand side entails “equal treatment” of A and B. If an individual samples j of type A and nj of type B, the amount by which it (anti)conforms to A, namely Dt(j), must be equal and opposite to the amount by which it (anti)conforms to B, Dt(j), for all probabilities to sum to 1. Therefore, when A rather than B is present in nj role models, its conformity coefficient Dt(nj) must equal that for B in the same scenario: Dt(j). Therefore, Eq. 19 is equivalent to

pt+1=pt+j=knDt(j)nnjptj(1pt)njptnj(1pt)j, [21]

where k=n2+1 if n is even and k=n+12 if n is odd.

Assume that the conformity coefficients change randomly at each generation t, and for any t=0,1,2,, Dt is a vector of random variables such that the Dts are identically distributed and independent of the population state pt. The entries of Dt, Dt(j), need not be identically distributed. Since probabilities of different samples j of n, with corresponding Dt(j), depend on pt, it is possible that higher or lower pt corresponds to greater or smaller differences between pt+1 and pt.

Assume first that E(Dt)=0; i.e., E(Dt(j))=0 for all t=0,1,2, and j=0,1,2,,n. In this case we have the following result:

Result 7.

  • 1)

    IfE(Dt)=0, then starting from any initial state0p01the processptt=0converges; that is, limtpt=pwith probability 1.

  • 2)

    If, in addition, Dt(k),Dt(k+1),,Dt(n)for anytare independent random variables, thenptakes the values0,12,1with probability 1.

Proof:

If Dt is independent of pt and E(Dt)=0 for t=0,1,2,, Eq. 19 implies that Ept+1pt=pt. Hence ptt=0 is a bounded martingale and the martingale convergence theorem implies that limtpt=p with probability 1.

Let σj2=VarDt(j) be the positive variance of Dt(j) for j=1,2,,n1 [σ0=σn=0 as Dt(0)=Dt(n)=0]. Then, if Dt(k),Dt(k+1),,Dt(n1) are independent random variables, Eq. 21 implies for any t=0,1,2, that

Var(pt+1pt)=j=kn1σj2n2nj2ptj(1pt)njptnj(1pt)j2. [22]

Since Var(pt+1)=VarE(pt+1pt)+EVar(pt+1pt), we have

Var(pt+1)=Var(pt)+j=kn1σj2n2nj2Eptj(1pt)njptnj(1pt)j2. [23]

Now as pttp with probability 1, by the Lebesgue convergence theorem E(pt)tE(p) for any =1,2,, and Eq. 23 implies that

j=kn1σj2n2nj2Epj(1p)njpnj(1p)j2=0. [24]

Hence pj(1p)nj=pnj(1p)j for j=k,k+1,,n1 and p=0, p=1, or p=1p in which case p=12. Thus, with probability 1, p takes values 0,12,1.

The following result concerning the SLS of p*=0 and p*=1 holds for E(Dt)=0 as well as E(Dt)0.

Result 8.

  • 1)

    IfElog1Dt(n1)<0, thenp*=0andp*=1are SLS.

  • 2)

    IfElog1Dt(n1)>0, thenPlimtpt=0=0andPlimtpt=1=0.

The proof is similar to that of Result 5 based on the linear approximation of Eq. 19 near p*=0 and p*=1 in equation B2 of Denton et al. (49) and the assumption that Dt(n1)<1 by Eq. 20b. If EDt(n1)>0, then using Jensen’s inequality, Elog1Dt(n1)<0, so p*=0 and p*=1 are SLS.

We are unable to obtain corresponding results for the SLS of p*=12. Consider the linear approximation of Eq. 19 near the constant equilibrium p*=12 given in ref. 49, equation B8:

ν=ν1+12n2j=kn1D(j)nnj(2jn)=νϕn. [25]

Ref. 49’s table S1 shows that depending on n, ϕn can be negative, in which case logϕn is not defined and conditions for SLS of p*=12 cannot be determined. Also, Observation 1 does not hold.

With n>3 role models and E(Dt)0, there need not be convergence to an equilibrium (Fig. 5). Below we determine conditions for convergence to an equilibrium with n=4 role models and, subsequently, a general number n of role models.

Fig. 5.

Fig. 5.

Stochastic fluctuation. Each plot shows 25 runs with initial p0=0.49 (red) and 25 runs with p0=0.51 (blue). Eighty generations are shown, although simulations ran for 106 generations; no equilibria were reached. Mean frequencies and E[Dt(j)] are reported in SI Appendix, section A. (A) n=5, Dt(4)=3.9 with probability 12, Dt(4)=3.4 with probability 12, Dt(3)=2.9 with probability 12, and Dt(3)=2.4 with probability 12. (B) n=5, Dt(4)=0.6 with probability 12, Dt(4)=0.8 with probability 12, Dt(3)=1.85 with probability 12, and Dt(3)=1.95 with probability 12.

Example: The Case of n=4 Role Models.

Recall that the recursion with fixed conformity coefficients is the same for n=3 and 4 (4). However, with random conformity coefficients the case n=4 exhibits different evolutionary dynamics from those with n=3 role models. When n=4, we have five conformity coefficients Dt(j) for j=0,1,,4. By Eq. 20a and inequality in Eq. 20b,

Dt(0)=Dt(2)=Dt(4)=0,Dt(1)=Dt(3),3<Dt(3)<1. [26]

Hence, from Eq. 21 with n=4 and k=3, we have

pt+1=pt+Dtpt(1pt)(2pt1), [27]

where Dt are identically distributed and correspond to Dt(3) in Eq. 26. Therefore in this case 1+2Dtpt(1pt) with 3<Dt<1 can be negative, and by Eq. 3, pt+112 and pt12 can have different signs. Therefore Observation 1, which was a key factor in the proof of global convergence when n=3, does not hold. Will there still be global convergence of {pt} when d=E(Dt)0 with n=4?

Since pt+112 and pt12 can have different signs, from Eq. 3 we focus on Qt=(pt12)2. Then

pt+1122=pt1221+2Dtpt(1pt)2. [28]

Now pt(1pt)=14Qt, and Eq. 28 is equivalent to

Qt+1=Qt1+2Dt14Qt2,t=0,1,2, [29]

The following result applies to the process Qtt=0:

Result 9.

Ifd=E(Dt)>0, then for any0Q014the processQtt=0converges with probability 1, and if Q=limtQt, then Qcan take only two values, Q=0 or Q=14.

Proof:

Rewrite Eq. 29 as

Qt+1=Qt1+4Dt14Qt+4Dt214Qt2. [30]

Since Dt is independent of pt and so also of Qt, if d=E(Dt) and σ2=Var(Dt), we have

EQt+1Qt=Qt1+4d14Qt+4σ2+d214Qt2. [31]

If d>0, then EQt+1QtQt for t=0,1,2,. Thus Qtt=0 is a submartingale and since 0Qt14, we can apply the martingale convergence theorem to deduce that with probability 1, Qtt=0 converges to Q. Also applying the Lebesgue convergence theorem, EQttEQ for any =1,2,.

Thus Eq. 31 entails that

E(Q)=E(Q)+4dEQ14Q+4σ2+d2EQ14Q2. [32]

Since d>0, we deduce from Eq. 32 that EQ14Q=0 and EQ14Q2=0. But 0Q14 with probability 1. Hence Q14Q=0 with probability 1 and Q can take only the two values Q=0 or Q=14.

If Q=0, then limt12pt2=0 and limtpt=12. If Q=14, then limtpt1pt=0. Therefore for t large, either pt is near p*=0 or pt is near p*=1. But if d>0, both p*=0 and p*=1 are SLS. Therefore, if Q=14, either limtpt=0 or limtpt=1.

Conclusion.

If EDt>0, then the process ptt=0 converges with probability 1. If p=limtpt, then p can assume only the three values 0, 1, 12.

The situation is more complicated when d=E(Dt)<0. Eq. 31 can be rewritten as EQt+1Qt=Qt(1+ut) with

ut=4d14Qt+4σ2+d214Qt2. [33]

Since 014Qt14, we have

ut414Qtd1+d4+σ24. [34]

If 4d+d2+σ20, or equivalently, if 4d+E(Dt2)0, then ut0 and EQt+1QtQt, making the process a bounded supermartingale so that QttQ with probability 1. For example, this occurs if Dt[3,0] so that σ2d23d by the Bhatia–Davis inequality (54). Therefore, if ut0, a similar argument to the proof of Result 9 (Eq. 32) gives

4dEQ14Q+4σ2+d2EQ14Q2=0. [35]

As 014Q14 and d1+d4+σ24<0, we have

04EQ14Qd1+d4+σ240. [36]

Inequalities in Eq. 36 imply that EQ14Q=0 and Q14Q=0 so that Q=0 or Q=14 with probability 1. Dynamics with n=4 role models are illustrated in Figs. 1 B, E, and H; 2 B and D; and 3B. The conformity coefficient, Dt, is denoted by Dt(3) (Eqs. 26 and 27). Because Observation 1 no longer holds with n>3, runs that start in (0,12) (red) or (12,1) (blue) can cross the p=12 line (e.g., Fig. 1B). Fig. 1 B, E, and H shows examples in which all three equilibria p*=0,12, and 1 are SLS with E[Dt(3)]=0, E[Dt(3)]<0, and E[Dt(3)]>0, respectively. Fig. 2 B and D shows that p*=12 can be unstable with either E[Dt(3)]>0 or E[Dt(3)]<0. However, E[Dt(3)]<0 is necessary for instability of p*=0 and 1 (Fig. 3B and Result 8). Here, the dynamics are similar to those of Eq. 1 with anticonformity (D<0), apart from fluctuations across p*=12 before reaching this equilibrium.

Removing the assumption that Dt(3) are identically distributed, Fig. 4B shows an example in which Dt(3) takes one fixed value every third generation and another fixed value in the other generations. This system produces an exact six-generation cycle in the frequency pt. Unlike in the case of n=3 (Fig. 4A), this cycle occurs around p=12 because Observation 1 does not hold.

Convergence for Any Number of Role Models.

Result 9 can be generalized to apply for any number n of role models. Recall that with n role models we have

pt+1=pt+j=knDt(j)nnjptj(1pt)njptnj(1pt)j, [37]

where k=n2+1 if n is even and k=n+12 if n is odd. Denton et al. (49) showed that for any jk

ptj1ptnjptnj1ptj=pt(1pt)(2pt1)Gjpt(1pt), [38]

where Gj is a polynomial in the variable pt(1pt) for n>4. Simple algebra shows that for any 0pt1

Gjpt(1pt)=pt(1pt)nj1×i=02jn11ptipt2jn1i0. [39]

Moreover, when j=k, if n is even and k=n2+1, then

ptk1ptnkptnk1ptk=pt(1pt)n21(2pt1), [40a]
while ifnis odd andk=n+12,ptk1ptnkptnk1ptk=pt(1pt)n12(2pt1). [40b]

As n3, in both Eqs. 40a and 40b the power of pt1pt is at least 1.

We are now ready to prove a convergence result for any n3 role models with identically distributed Dt.

Result 10.

IfEDt(j)>0for allkj<n, wherek=n2+1ifnis even andk=n+12ifnis odd, the processptt=0converges with probability 1. Moreover, if p=limtpt, then p can assume only the values 0, 1, and 12.

The proof of Result 10 is similar to the proof of Result 9 and is given in SI Appendix, section B.

To illustrate dynamics with n>4 role models, examples with n=5 are shown in Figs. 1 C, F, and I; 2 C and E; 3 C and D; and 5. With n=5 there are two Dt(j) parameters: Dt(4) and Dt(3) correspond to the cases where j=4 or 3, respectively, of the 5 role models have the same variant. Simultaneous SLS of p*=0,12, and 1 can occur with E[Dt(4)]=E[Dt(3)]=0, E[Dt(4)]<0 and E[Dt(3)]>0, or E[Dt(4)]>0 and E[Dt(3)]<0 (Fig. 1 C, F, and I, respectively). Note that all three equilibria can also be SLS when both E[Dt(4)],E[Dt(3)]<0 or both E[Dt(4)],E[Dt(3)]>0 (not shown in Fig. 1). Fixation states p*=0 and 1 may be SLS while p*=12 is unstable if E[Dt(4)]>0 and E[Dt(3)]<0 or if E[Dt(4)]<0 and E[Dt(3)]>0 (Fig. 2 C and E). This is also possible with both E[Dt(4)],E[Dt(3)]>0 or both E[Dt(4)],E[Dt(3)]<0 (not shown in Fig. 2). The condition E[Dt(4)]<0 is necessary but not sufficient for instability of p*=0 and 1 (Result 8). Fig. 3 C and D shows examples in which p*=0 and 1 are unstable and p*=12 is SLS with E[Dt(4)]<0 and either E[Dt(3)]<0 or E[Dt(3)]>0. These dynamics are similar to those in the deterministic model with n=5, D(4),D(3)<0, and sufficiently weak anticonformity or in some cases with D(4)<0 and D(3)>0. In other cases with D(4)<0 and D(3)>0 in the deterministic model, stable polymorphic equilibria with p*12 can occur (figure 1 of ref. 49). Fig. 5B incorporates these specific values, namely E[Dt(4)]=D(4) and E[Dt(3)]=D(3) from ref. 49’s figure 1, and shows stochastic fluctuation around frequencies p12. Finally, nonconvergence is also possible in the stochastic model with E[Dt(4)],E[Dt(3)]<0 (Fig. 5A) and in the deterministic model with strong anticonformity (49).

Stable cycles can also occur when the coefficients Dt cycle between fixed values (and are therefore not identically distributed). In Fig. 4C, Dt(4) and Dt(3) each take one fixed value every third generation and another fixed value in the other generations. As in Fig. 4B, an exact six-generation cycle is produced, but unlike in Fig. 4B, the red and blue runs cycle between different frequencies.

Random Number of Role Models.

Do the global convergence properties described in Results 7 and 10 for any fixed constant number n (n3) of role models also hold when the number of role models can change randomly at each generation? In this case, the frequency pt+1 of the variant A at generation t+1 for any t=0,1,2, depends on pt, nt, and Dnt, where

  • 1)

    For each t, the number of role models nt is chosen, independently of pt, from a probability distribution on the values 3,4,5,,N, which is the same for all t;

  • 2)

    Given nt, the conformity coefficients Dnt(0), Dnt(1), ,Dnt(nt) are random variables independent of pt. They are subject to the constraints for j = 0, 1, . . . , nt

Dnt(0)=Dnt(nt)=0,Dnt(j)=Dnt(ntj) [41a]
and forj=1,2,,nt1,j<Dnt(j)<ntj. [41b]

For any generation t=0,1,2, we then write pt+1 as (Eqs. 19 and 21)

pt+1=pt+j=0ntDnt(j)ntntjptj(1pt)ntj [42a]
=pt+j=ktnt1Dnt(j)ntntjptj(1pt)ntjptntj(1pt)j. [42b]

From Eqs. 42a and 42b we have the following results that generalize Results 7 and 10 to the case where the number of role models nt is chosen randomly at each generation.

Result 11.

  • 1)

    IfEDnt(j)nt=0for allj=0,1,,nt, thenlimtpt=pwith probability 1.

  • 2)

    If, in addition, givenntthe conformity coefficientsDnt(kt),Dnt(kt+1),,Dnt(nt1)are independent random variables (where kt=nt2+1 if nt is even and kt=nt+12 if nt is odd), then p assumes the values 0,1,12 with probability 1.

Result 12.

IfEDnt(j)nt>0forj=kt,kt+1,,nt1 (where kt=nt2+1 if nt is even and kt=nt+12 if nt is odd), then limtpt=p with probability 1 and pcan assume the values 0,1,12. The proofs of Results 11 and 12 are very similar to the proofs of Results 7 and 10, respectively, and are given in SI Appendix, section B.

Weak Random Selection.

To incorporate natural selection on variants A and B into the Boyd and Richerson (4) model with n=3, we assume that the fitness of A is 1+s relative to 1 for B. Then after selection, the frequency of A is given by

p=(1+s)p1+(1p)(2p1)D1+sp1+(1p)(2p1)D. [43]

Assume that selection is weak so that |s| is small. We can approximate Eq. 43 by

p(1+s)p1+(1p)(2p1)Dsp21+(1p)(2p1)D2. [44]

Suppose now that for all t(t=0,1,2,), st are identically distributed random variables and that, as before, Dt are identically distributed random variables. Then

pt+1(1+st)pt1+(1pt)(2pt1)Dtstpt21+(1pt)(2pt1)Dt2. [45]

For |st| small, we have the following result:

Result 13.

For anyt=0,1,2,, suppose thatstandDtare independent of each other, both are independent ofpt, E(Dt)=0, andE(st)0. Then with probability 1, limtpt=p and p=0 or p=1.

Proof:

Let E(st)=μ and Var(Dt)=σ2. Then, as

[1+(1pt)(2pt1)Dt]2=1+2(1pt)(2pt1)Dt+(1pt)2(2pt1)2Dt2, [46]

Eq. 45 implies that

Ept+1ptpt1+μ(1pt)1pt(1pt)(2pt1)2σ2. [47]

Since 2<Dt<1, we have σ2=EDt2<4, and as 0pt1 we have

pt(1pt)(2pt1)2σ2<1414=1. [48]

Thus if μ>0, then Ept+1ptpt, and if μ<0, then Ept+1ptpt, and the process ptt=0 is either a bounded submartingale or a bounded supermartingale. The martingale convergence theorem then ensures that limtpt=p with probability 1.

We can also classify p. For example, if μ=E(st)>0, then Eq. 47 implies that

E(pt+1)E(pt)+μ^Ept(1pt)1pt(1pt)(2pt1)2σ2 [49]

for any 0<μ^<μ. Since pttp and 0pt1, for all t=0,1,,E(pt)tE(p) for any =1,2,. Then from Eq. 49,

Ep(1p)1p(1p)(2p1)2σ20. [50]

But Eq. 48 entails that

p(1p)1p(1p)(2p1)2σ20, [51]

which means that p(1p)=0 with probability 1. We therefore infer that with probability 1, either p=p*=0 or p=p*=1. It should be pointed out that Result 13 also applies to the case when st=s=μ is constant over time.

Observe that both p*=0 and p*=1 are constant equilibria even though st and Dt are random variables. Following ref. 49, the SLS of these equilibria is determined as follows:

p*=0is SLS ifElog[(1+st)(1Dt)]<0p*=1is SLS ifElog1Dt1+st<0. [52]

Thus, using Jensen’s inequality, if E(Dt)0 and E(st)0, both Elog(1+st) and Elog(1Dt) are negative and p*=0 is SLS. If st are small for t=0,1,2,, then Elog(1Dt)(1+st)Elog[(1Dt)(1st)], and p*=1 is SLS if E(Dt)0 and E(st)0. If sts is constant, then if E(Dt)0,

p*=0is SLS if1s0p*=1is SLS ifs0. [53]

Discussion

Previous analytical models of conformity have included constant rather than variable conformity coefficient(s). Many of these include a single conformity coefficient and dynamics that follow Eq. 1 (4, 3033, 3537) or a similar formulation (32, 38, 39, 55). These dynamics are represented by an S-shaped curve and there are three equilibria at p*=0,12, and 1, with convergence to one of these. Others have implemented qualitatively different formulations of conformity that also include a single, constant conformity coefficient and have found more complex dynamics (56, 57).

A model of “strong anticonformity” was explored in ref. 56, where individuals acquire a variant that is sampled at a frequency of 0 with probability 1 (i.e., the left-hand side of Eq. 20a does not hold). In that model, stable cycles of variant frequencies can occur with n=3 role models. In refs. 56 and 57, a model of conformity was studied in which there is one cultural variant of interest, such as an item of learned information. Conformity or anticonformity occurs if this variant is present in the majority of sampled role models, but if the variant is present in a minority of role models, conformity or anticonformity to the opposite type (absence of information) does not occur (i.e., the right-hand side of Eq. 20a does not hold). In ref. 57, social and individual learners are in environments subject to change, and stable cycles in variant frequencies as well as chaos may occur. Finally, ref. 58 included a modified version of Eq. 1 in a susceptible, infectious, susceptible (SIS) epidemic-type model and found polymorphic equilibria with p*12. They also found that under some conditions, a polymorphic equilibrium and a fixation state could be stable simultaneously.

We previously showed that complex dynamics including cycles, chaos, and stable polymorphic equilibria with p*12 are possible under Boyd and Richerson’s (4) general model of conformity specified by Eqs. 19 and 20 (49). This model includes conformity coefficients that are fixed over time but can differ in sign and magnitude depending on the number of role models sampled that carry the same variant. For example, in ref. 19, fruit flies that observed 60% of conspecifics copulating with pink (or green) males showed a greater than 60% chance of copulating with the more commonly chosen male (i.e., conformist bias), whereas those that observed 83% of conspecifics copulating with pink (or green) males showed a less than 83% chance of copulating with the more commonly chosen male (i.e., anticonformist bias). Including both conformity and anticonformity that depend on the numbers of A and B in the sample of role models is necessary to allow polymorphic equilibria with p*12, but is not necessary to produce cycles or chaos. Cycles or chaos can occur when n5 or n10, respectively, and anticonformity is sufficiently strong (49).

The present analysis generalizes previous models by allowing the conformity coefficient(s) to vary over time. Temporal variation in conformity may account for differences in baby name turnover in recent versus older decades (12), and individuals’ tendency to exhibit a related bias, namely Aschian conformity, has decreased over time (29). Thus, conformity coefficients may be better regarded as random variables than fixed parameters. At each generation, t, conformity coefficient(s) are sampled from a continuous or discrete probability distribution. With n5 role models there are multiple conformity coefficients in the model (Eq. 19), which may be sampled from the same or from different probability distributions, whereas with n=3 or 4 there is a single conformity coefficient. All members of generation t display the same levels of conformity, Dt. We have not considered individual variation in conformity, which may be important in real populations. We assume that Dt is sampled independently of the population frequency, pt [although the probability of sampling j of n role models with conformity coefficient Dt(j) does depend on pt]. Association between Dt and pt, such as increasing all conformity coefficients Dt(j) with increasing pt, could be incorporated in future models to determine whether these relationships affect the presence or stochastic local stability of equilibria.

In addition, we analyzed the role of variation in the number of role models, nt, which is always finite and greater than or equal to 3. In agent-based models of conformity on networks, the number of sampled role models can vary by individual (43, 45, 46) and over time (43). In the present model, the number of role models can change over time, but is the same for all individuals in the population at a given time. For example, members of different generations may encounter different numbers of role models due to changes in the dispersion of individuals in a population or changes in modes of information transmission (e.g., social media).

In the deterministic model (4) with conformity [D(j)>0 for all n2<j<n in Eq. 19], the frequency of a common variant (p>12) increases to 1 and the frequency of an uncommon variant (p<12) decreases to 0. Thus, stable equilibria are p*=1 (variant A fixed) and p*=0 (variant B fixed), whereas the equilibrium p*=12 equal frequencies of A and B is unstable and separates the two domains of attraction. In the stochastic model with conformity expected [i.e., E[Dt(j)]>0 for n2<j<n and all t=0,1,], convergence to one of the equilibria p*=0,12, or 1 is also ensured. This is true whether the number of role models, nt, is fixed or stochastically variable. Unlike in the deterministic conformity model, here p*=12 need not be unstable, and thus all three equilibria can be SLS (Fig. 1 G and H). Moreover, if n>3, p*=12 need not separate the domains of attraction: A population with initial frequency p0<12(p0>12) can reach p*=1(p*=0) (Fig. 1H). On the other hand, if conformity is expected, n=3, and p*=12 is unstable (Fig. 2A), Boyd and Richerson’s (4) equation 1 with conformity (D>0) provides a reasonable approximation to the stochastic model.

In the deterministic model with random copying, where individuals do not favor a common or an uncommon variant [D(j)=0 for all j in Eq. 19], there is no change in variant frequencies and any initial p0 is an equilibrium. In the stochastic model, however, if E[Dt(j)]=0 for all j and all t=0,1,, and Dt(j) are independent random variables for j>n2, then there is convergence to one of p*=0,12, or 1 with probability 1. In this model, fluctuations in (anti)conformity can cause one variant to increase relative to the other; hence, there cannot be equilibria where the frequencies are nonzero and unequal (i.e., 0<p*<12 and 12<p*<1). As in the model with conformity expected, this convergence result holds with a fixed or time-varying number of role models. Fig. 1 AC shows that if n=3,4, or 5 (respectively) and random copying is expected, p*=0,12, and 1 are all SLS.

With n=3 role models, E(Dt)=0, and weak selection with E(st)0, convergence to p*=12 does not occur. This is intuitive, as E(st)0 entails that one variant is selectively favored, on average, relative to the other. In this case, convergence to a fixation state (p*=0 or 1) occurs with probability 1. With n=3 and either random copying or conformity expected, if weak selection is expected to favor variant A, p*=1 is SLS; if it is expected to favor variant B, p*=0 is SLS; and if E(st)=0, both p*=0 and 1 are SLS. Finally, all of these results for selection also hold if selection is constant, rather than randomly varying.

In the deterministic model with anticonformity [D(j)<0 for all n2<j<n in Eq. 19], convergence to an equilibrium is guaranteed if n=3 or n=4 (49), in which case p*=0 and 1 are unstable and p*=12 is globally stable, opposite to the deterministic dynamics with conformity. In the stochastic model with anticonformity expected, dynamics of Fig. 3 A (n=3) and B (n=4) are reasonably approximated by Boyd and Richerson’s (4) anticonformity model (apart from the fluctuations across p*=12 prior to convergence in the case n=4). On the other hand, Fig. 1 D and E shows that all three equilibria p*=0,12, and 1 can be SLS with n=3 or 4 and E[Dt(2)]<0 or E[Dt(3)]<0, respectively. Moreover, with n=4 and E[Dt(3)]<0, p*=12 can be unstable while p*=0 and 1 are SLS (Fig. 2D), a result more consistent with Boyd and Richerson’s (4) model of conformity than anticonformity.

In both deterministic and stochastic models with anticonformity, nonconvergence can also occur. A common variant with frequency pt>12 may be strongly disfavored due to anticonformity such that pt+1 is well below p=12; this rare variant may then be strongly favored such that pt+2 is well above p=12, and so on. Figure 2 in ref. 49 illustrates a two-generation cycle around p=12 with n=5. In the stochastic model with n=5, fluctuation around p=12 is shown in Fig. 5A.

Finally, with n>4, both conformity and anticonformity are possible for different samples j of n role models with one variant. In figure 1 of ref. 49, with n=5, D(4)=0.7 (anticonformity) and D(3)=1.9 (conformity), there are two stable polymorphic equilibria: p*=0.1927 and 1p*=0.8073. Here, in the analogous stochastic case E[Dt(4)]=0.7 and E[Dt(3)]=1.9, in Fig. 5B we see stochastic fluctuation around average frequencies p=0.1929 (if p0<12) or p=0.8072 (if p0>12). Thus, although polymorphic equilibria with p*12 were not found in the stochastic model, fluctuation occurred around average frequencies close to the stable polymorphic equilibria with p*12 from the constant conformity model.

Thus far, our discussion has pertained to cases with identically distributed Dt. This assumption is removed in Fig. 4, where Dt instead cycles between fixed values. Here, unlike in the case of identically distributed Dt, nonconvergence is possible with n=3 role models. Fig. 4A shows an example of an exact two-generation cycle with n=3, where if the initial frequency p0>12 or p0<12, the cycle occurs above or below p=12 (respectively). With n=4, cycles can occur around p=12 (Fig. 4B). These dynamics are more complex than in Boyd and Richerson’s (4) model with constant conformity coefficients, where nonconvergence is possible only for n5 (49).

Future empirical research could explore the level of conformity exhibited by members of a fixed age group over generations, allowing the amount of temporal variation in conformity to be estimated and hence which Dt distributions might be appropriate. Such research may also reveal deterministic trends in conformity over time, stimulating new models. For example, just as Aschian conformity in line discrimination tasks decreased over time (29), Boyd and Richerson (4) conformity might also decrease over time in similar mental discernment contexts. In much of our analysis, the vector of conformity coefficients (Dt) was sampled at each generation from the same multivariate distribution, and the assumption of identically distributed Dt was removed only in examples with fixed, cycling Dt (Fig. 4). Additional research could explore how variation in the number of role models, nt, is associated with the level of conformity. In our model, for a given nt=n, an n-dimensional conformity vector (Dt) is sampled from a probability distribution, but we do not include any known relationships between Dt and nt, which may exist and be important. For example, (anti)conformity might be stronger if 70 of 100 sampled individuals share a variant compared to 7 of 10 sampled individuals, as the smaller sample might be perceived as a less reliable indicator of population-level variation.

Other extensions of our analysis could model finite rather than infinite populations, subpopulations with migration among them, and individual-level variation in conformity. Finite populations experience greater stochastic fluctuation in variant frequencies than infinite populations, resulting in elimination of polymorphic equilibria. Migration may also greatly alter population dynamics. For example, incorporating migration into Boyd and Richerson’s (4) deterministic conformity model with infinite population sizes can produce surprising polymorphic equilibria (p*12) (49). In infinite, multipopulation models, it would be interesting to determine whether incorporating stochastic variation in conformity would eliminate or allow stochastic local stability of these polymorphic equilibria. Finally, levels of conformity need not be the same for all individuals in a population (41, 56), which we have assumed. Incorporating individual-level variation in conformity could produce more realistic dynamics, as well as enable a more direct application to empirical research on individual biases.

In conclusion, if conformity coefficients vary over time, evolutionary dynamics are seen that are not possible with constant conformity coefficients. In the constant conformity model, at least one equilibrium must be unstable (49), whereas here all equilibria can be stochastically locally stable simultaneously. We find that irrespective of whether n is constant or time varying, if conformity is expected [E[Dt(j)]>0 for all n2<j<n] or random copying is expected [E[Dt(j)]=0 for all j and Dt(j) are independent], convergence to an equilibrium p*=0,12, or 1 is ensured. If random copying is expected, n=3, and there is weak selection that favors one variant over the other on average, only fixation states p*=0 or 1 are reached as t. With anticonformity expected, nonconvergence is possible under different conditions from those in previous deterministic models (49). Finally, when conformity coefficients are not identically distributed, exact cycles can arise with as few as n=3 role models. Thus, temporal fluctuations in the patterns of conformity can cause significant departures from the conclusions that have been drawn from models with constant levels of conformity.

Supplementary Material

Supplementary File

Acknowledgments

This research was supported in part by the Morrison Institute for Population and Research Studies at Stanford University and the Stanford Center for Computational, Evolutionary, and Human Genomics.

Footnotes

The authors declare no competing interest.

This article contains supporting information online at https://www.pnas.org/lookup/suppl/doi:10.1073/pnas.2107204118/-/DCSupplemental.

Data Availability

There are no data underlying this work.

References

  • 1.Cavalli-Sforza L., Feldman M. W., Models for cultural inheritance. I. Group mean and within group variation. Theor. Popul. Biol. 4, 42–55 (1973). [DOI] [PubMed] [Google Scholar]
  • 2.Henrich J., McElreath R., The evolution of cultural evolution. Evol. Anthropol. 12, 123–135 (2003). [Google Scholar]
  • 3.Cavalli-Sforza L. L., Feldman M. W., Cultural Transmission and Evolution: A Quantitative Approach (Princeton University Press, 1981). [PubMed] [Google Scholar]
  • 4.Boyd R., Richerson P. J., Culture and the Evolutionary Process (University of Chicago Press, 1985). [Google Scholar]
  • 5.Morgan T. J. H., Rendell L. E., Ehn M., Hoppitt W., Laland K. N., The evolutionary basis of human social learning. Proc. Biol. Sci. 279, 653–662 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Muthukrishna M., Morgan T. J. H., Henrich J., The when and who of social learning and conformist transmission. Evol. Hum. Behav. 37, 10–20 (2016). [Google Scholar]
  • 7.Morgan T. J. H., Laland K. N., Harris P. L., The development of adaptive conformity in young children: Effects of uncertainty and consensus. Dev. Sci. 18, 511–524 (2015). [DOI] [PubMed] [Google Scholar]
  • 8.Mesoudi A., Lycett S. J., Random copying, frequency-dependent copying and culture change. Evol. Hum. Behav. 30, 41–48 (2009). [Google Scholar]
  • 9.Herzog H. A., Bentley R. A., Hahn M. W., Random drift and large shifts in popularity of dog breeds. Proc. Biol. Sci. 271 (suppl. 5), S353–S356 (2004). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Bentley R. A., Hahn M. W., Shennan S. J., Random drift and culture change. Proc. Biol. Sci. 271, 1443–1450 (2004). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Hahn M. W., Bentley R. A., Drift as a mechanism for cultural change: An example from baby names. Proc. Biol. Sci. 270 (suppl. 1), S120–S123 (2003). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Acerbi A., Bentley R. A., Biases in cultural transmission shape the turnover of popular traits. Evol. Hum. Behav. 35, 228–236 (2014). [Google Scholar]
  • 13.Gureckis T. M., Goldstone R. L., How you named your child: Understanding the relationship between individual decision making and collective outcomes. Top. Cogn. Sci. 1, 651–674 (2009). [DOI] [PubMed] [Google Scholar]
  • 14.Berger J., Bradlow E. T., Braunstein A., Zhang Y., From Karen to Katie: Using baby names to understand cultural evolution. Psychol. Sci. 23, 1067–1073 (2012). [DOI] [PubMed] [Google Scholar]
  • 15.Pike T. W., Laland K. N., Conformist learning in nine-spined sticklebacks’ foraging decisions. Biol. Lett. 6, 466–468 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Aplin L. M., et al., Experimentally induced innovations lead to persistent culture via conformity in wild birds. Nature 518, 538–541 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Aplin L. M., Sheldon B. C., McElreath R., Conformity does not perpetuate suboptimal traditions in a wild population of songbirds. Proc. Natl. Acad. Sci. U.S.A. 114, 7830–7837 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.van Leeuwen E. J. C., Kendal R. L., Tennie C., Haun D., Conformity and its look-a-likes. Anim. Behav. 110, e1–e4 (2015). [Google Scholar]
  • 19.Danchin E., et al., Cultural flies: Conformist social learning in fruitflies predicts long-lasting mate-choice traditions. Science 362, 1025–1030 (2018). [DOI] [PubMed] [Google Scholar]
  • 20.Asch S. E., Social Psychology (Prentice-Hall, Englewood Cliffs, NJ, 1952). [Google Scholar]
  • 21.Asch S. E., Studies of independence and conformity: I. A minority of one against a unanimous majority. Psychol. Monogr. 70, 1–70 (1956). [Google Scholar]
  • 22.Whiten A., “Conformity and over-imitation: An integrative review of variant forms of hyper-reliance on social learning” in Advances in the Study of Behavior, Naguib M., et al., Eds. (Academic Press, 2019), vol. 51, chap. 2, pp. 31–75. [Google Scholar]
  • 23.Whiten A., Horner V., de Waal F. B. M., Conformity to cultural norms of tool use in chimpanzees. Nature 437, 737–740 (2005). [DOI] [PubMed] [Google Scholar]
  • 24.Luncz L. V., Boesch C., Tradition over trend: Neighboring chimpanzee communities maintain differences in cultural behavior despite frequent immigration of adult females. Am. J. Primatol. 76, 649–657 (2014). [DOI] [PubMed] [Google Scholar]
  • 25.Dindo M., Thierry B., Whiten A., Social diffusion of novel foraging methods in brown capuchin monkeys (Cebus apella). Proc. Biol. Sci. 275, 187–193 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Dindo M., Whiten A., de Waal F. B. M., In-group conformity sustains different foraging traditions in capuchin monkeys (Cebus apella). PLoS One 4, e7858 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.van Leeuwen E. J. C., Haun D. B. M., Conformity without majority? The case for demarcating social from majority influences. Anim. Behav. 96, 187–194 (2014). [Google Scholar]
  • 28.van de Waal E., Borgeaud C., Whiten A., Potent social learning and conformity shape a wild primate’s foraging decisions. Science 340, 483–485 (2013). [DOI] [PubMed] [Google Scholar]
  • 29.Bond R., Smith P. B., Culture and conformity: A meta-analysis of studies using Asch’s (1952b, 1956) line judgment task. Psychol. Bull. 119, 111–137 (1996). [Google Scholar]
  • 30.Henrich J., Boyd R., The evolution of conformist transmission and the emergence of between-group differences. Evol. Hum. Behav. 19, 215–241 (1998). [Google Scholar]
  • 31.Kameda T., Nakanishi D., Cost-benefit analysis of social/cultural learning in a nonstationary uncertain environment: An evolutionary simulation and an experiment with human subjects. Evol. Hum. Behav. 23, 373–393 (2002). [Google Scholar]
  • 32.Nakahashi W., The evolution of conformist transmission in social learning when the environment changes periodically. Theor. Popul. Biol. 72, 52–66 (2007). [DOI] [PubMed] [Google Scholar]
  • 33.Wakano J. Y., Aoki K., Do social learning and conformist bias coevolve? Henrich and Boyd revisited. Theor. Popul. Biol. 72, 504–512 (2007). [DOI] [PubMed] [Google Scholar]
  • 34.Borofsky T., Feldman M. W., Static environments with limited resources select for multiple foraging strategies rather than conformity. Ecol. Monogr. [Google Scholar]
  • 35.Molleman L., Quiñones A. E., Weissing F. J., Cultural evolution of cooperation: The interplay between forms of social learning and group selection. Evol. Hum. Behav. 34, 342–349 (2013). [Google Scholar]
  • 36.Henrich J., Boyd R., Why people punish defectors. Weak conformist transmission can stabilize costly enforcement of norms in cooperative dilemmas. J. Theor. Biol. 208, 79–89 (2001). [DOI] [PubMed] [Google Scholar]
  • 37.Henrich J., Cultural transmission and the diffusion of innovations: Adoption dynamics indicate that biased cultural transmission is the predominate force in behavioral change. Am. Anthropol. 103, 992–1013 (2001). [Google Scholar]
  • 38.Nakahashi W., Wakano J. Y., Henrich J., Adaptive social learning strategies in temporally and spatially varying environments: How temporal vs. spatial variation, number of cultural traits, and costs of learning influence the evolution of conformist-biased transmission, payoff-biased transmission, and individual learning. Hum. Nat. 23, 386–418 (2012). [DOI] [PubMed] [Google Scholar]
  • 39.Lehmann L., Feldman M. W., The co-evolution of culturally inherited altruistic helping and cultural transmission under random group formation. Theor. Popul. Biol. 73, 506–516 (2008). [DOI] [PubMed] [Google Scholar]
  • 40.Nyczka P., Sznajd-Weron K., Anticonformity or independence?—Insights from statistical physics. J. Stat. Phys. 151, 174–202 (2013). [Google Scholar]
  • 41.Jedrzejewski A., Sznajd-Weron K., Statistical physics of opinion formation: Is it a SPOOF? C. R. Phys. 20, 244–261 (2019). [Google Scholar]
  • 42.Nyczka P., Sznajd-Weron K., Cisło J., Phase transitions in the q-voter model with two types of stochastic driving. Phys. Rev. E Stat. Nonlin. Soft Matter Phys. 86, 011105 (2012). [DOI] [PubMed] [Google Scholar]
  • 43.Radosz W., Mielnik-Pyszczorski A., Brzezińska M., Sznajd-Weron K., Q-voter model with nonconformity in freely forming groups: Does the size distribution matter? Phys. Rev. E 95, 062302 (2017). [DOI] [PubMed] [Google Scholar]
  • 44.Castellano C., Muñoz M. A., Pastor-Satorras R., Nonlinear q-voter model. Phys. Rev. E Stat. Nonlin. Soft Matter Phys. 80, 041129 (2009). [DOI] [PubMed] [Google Scholar]
  • 45.Mellor A., Mobilia M., Zia R. K. P., Characterization of the nonequilibrium steady state of a heterogeneous nonlinear q-voter model with zealotry. Europhys. Lett. 113, 48001 (2016). [Google Scholar]
  • 46.Mellor A., Mobilia M., Zia R. K. P., Heterogeneous out-of-equilibrium nonlinear q-voter model with zealotry. Phys. Rev. E 95, 012104 (2017). [DOI] [PubMed] [Google Scholar]
  • 47.Jedrzejewski A., Sznajd-Weron K., Person-situation debate revisited: Phase transitions with quenched and annealed disorders. Entropy (Basel) 19, 1–14 (2017). [Google Scholar]
  • 48.Vieira A. R., Anteneodo C., Threshold q-voter model. Phys. Rev. E 97, 052106 (2018). [DOI] [PubMed] [Google Scholar]
  • 49.Denton K. K., Ram Y., Liberman U., Feldman M. W., Cultural evolution of conformity and anticonformity. Proc. Natl. Acad. Sci. U.S.A. 117, 13603–13614 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Javarone M. A., Social influences in opinion dynamics: The role of conformity. Physica A 414, 19–30 (2014). [Google Scholar]
  • 51.Jędrzejewski A., Pair approximation for the q-voter model with independence on complex networks. Phys. Rev. E 95, 012307 (2017). [DOI] [PubMed] [Google Scholar]
  • 52.Malik M. M., Pfeffer J., “Identifying platform effects in social media data” in Proceedings of the Tenth International AAAI Conference on Web and Social Media (ICWSM 2016) Strohmeier M., Gummadi K.P., eds. (AAAI, in Palo Alto, CA, 2016), vol. 10, pp. 241–249.
  • 53.Karlin S., Lieberman U., Random temporal variation in selection intensities: Case of large population size. Theor. Popul. Biol. 6, 355–382 (1974). [DOI] [PubMed] [Google Scholar]
  • 54.Bhatia R., Davis C., A better bound on the variance. Am. Math. Mon. 107, 353–357 (2000). [Google Scholar]
  • 55.Lachlan R. F., Janik V. M., Slater J. B., The evolution of conformity-enforcing behaviour in cultural communication systems. Anim. Behav. 68, 561–570 (2004). [Google Scholar]
  • 56.Efferson C., Lalive R., Richerson P. J., McElreath R., Lubell M., Conformists and mavericks: The empirics of frequency-dependent cultural transmission. Evol. Hum. Behav. 29, 56–64 (2008). [Google Scholar]
  • 57.Kendal J., Giraldeau L. A., Laland K., The evolution of social learning rules: Payoff-biased and frequency-dependent biased transmission. J. Theor. Biol. 260, 210–219 (2009). [DOI] [PubMed] [Google Scholar]
  • 58.Walters C. E., Kendal J. R., An SIS model for cultural trait transmission with conformity bias. Theor. Popul. Biol. 90, 56–63 (2013). [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary File

Data Availability Statement

There are no data underlying this work.


Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES