Skip to main content
PLOS One logoLink to PLOS One
. 2021 Aug 26;16(8):e0252540. doi: 10.1371/journal.pone.0252540

To maximize or randomize? An experimental study of probability matching in financial decision making

Andrew W Lo 1,2,3,4,#, Katherine P Marlowe 2,#, Ruixun Zhang 2,*,#
Editor: Jason Anthony Aimone5
PMCID: PMC8389367  PMID: 34437550

Abstract

Probability matching, also known as the “matching law” or Herrnstein’s Law, has long puzzled economists and psychologists because of its apparent inconsistency with basic self-interest. We conduct an experiment with real monetary payoffs in which each participant plays a computer game to guess the outcome of a binary lottery. In addition to finding strong evidence for probability matching, we document different tendencies towards randomization in different payoff environments—as predicted by models of the evolutionary origin of probability matching—after controlling for a wide range of demographic and socioeconomic variables. We also find several individual differences in the tendency to maximize or randomize, correlated with wealth and other socioeconomic factors. In particular, subjects who have taken probability and statistics classes and those who self-reported finding a pattern in the game are found to have randomized more, contrary to the common wisdom that those with better understanding of probabilistic reasoning are more likely to be rational economic maximizers. Our results provide experimental evidence that individuals—even those with experience in probability and investing—engage in randomized behavior and probability matching, underscoring the role of the environment as a driver of behavioral anomalies.

1 Introduction

Most economic theories are based on the premise that individuals maximize their self-interest and correctly incorporate the structure of their environment into their decisions. This framework has led to numerous advances, including expected utility theory [1], game theory [1, 2], rational expectations [3], the efficient markets hypothesis [4, 5], and option pricing theory [6, 7]. The influence of this paradigm goes far beyond academia; it underlies current macroeconomic and monetary policy making, becoming an integral component of the rules and regulations that govern financial markets today [8, 9].

However, there is mounting empirical and experimental evidence suggesting that humans do not always behave in the way traditional economic models predict, but often make seemingly random and suboptimal decisions [10]. These behavioral anomalies and psychological traits are especially pronounced when elements of risk and probability are involved. Examples include loss aversion [1114], overconfidence [15, 16], overreaction [17], herding [18] psychological accounting [19], miscalibration of probabilities [20], the uncertainty effect [21], and confirmation bias [22]. The spectacular rise of US stock market prices during the tech bubble in the early 2000s, and the even more spectacular crash following the 2007–2008 financial crisis, has intensified the controversy surrounding the rationality of investors.

One particularly interesting behavioral anomaly is probability matching, also known as the “matching law,” or Herrnstein’s Law [2330]—the tendency of the relative frequency of predictions of outcomes of an independent randomized event to match its underlying probability distribution. The best-known example of probability matching is the human tendency to choose randomly between heads and tails when asked to guess the outcomes of a series of biased coin tosses. When individuals are asked to guess the repeated outcomes of a biased coin, say with a bias of 70% heads, and rewarded based on whether they guessed correctly, most subjects seem to randomize their guesses at around 70% heads, instead of engaging in the economically optimal behavior of always guessing heads.

Probability matching has long puzzled economists and psychologists because of its apparent inconsistency with basic self-interest. The idea of randomizing behavioris especially difficult to reconcile with the standard economic paradigm of expected utility theory, in which individual behavior is non-stochastic and completely determined by the individual’s utility function, budget constraints, and the probability laws governing the environment. For example, Kogler and Kuhberger [31] report that, “Experimental research in simple repeated risky choices shows a striking violation of rational choice theory: the tendency to match probabilities by allocating the frequency of response in proportion to their relative probabilities”.

Nevertheless, probability matching has been observed in thousands of geographically diverse human subjects over several decades, as well as in other animal species, including ants [3235], bees [3638], fish [39, 40], pigeons [41, 42], and primates [43]. In virtually any setting where an animal is able to make a choice between A versus B in a randomized experiment, we observe probability matching.

The source of these irrational behaviors is often attributed to psychological factors, such as fear, greed, and other emotional responses. However, the fact that some of these behaviors are observed so consistently across species suggests that they may have a more fundamental and common origin, one with an evolutionary role that belies their apparent shortcomings. For example, the neurological basis of probability matching has been investigated extensively [4449]. In the context of a binary choice model, Brennan and Lo [50] show that probability matching behavior is perfectly consistent with evolution, arising purely from the forces of natural selection and population growth. Moreover, under generalized environmental conditions, i.e., broad assumptions about the conditions required for reproductive success, they derive more general types of behavior that involve randomization, but not necessarily strict probability matching.

In this paper, we present the first experimental test of the evolutionary model of Brennan and Lo [50]. We design an experiment in real-world decision making with monetary payoffs to measure the degree of probability matching among individuals, its determining factors, and its level of variation. Here by probability matching we mean the “matching law,” or Herrnstein’s Law discussed above—the tendency to choose randomly between heads and tails when asked to guess the outcomes of a series of biased-coin tosses, where the randomization frequency matches the probability of the biased coin.

We recruited a sample of 82 volunteers from the MIT Behavioral Research Laboratory to participate in our experiment. Each participant played a computer game consisting of 200 trials of a binary choice decision. In each trial, an image of either Angelina Jolie or Brad Pitt was displayed with a certain probability, and subjects were paid according to the number of trials in which they correctly guessed which image appeared.

By varying the payoff structure of the game, we were able to test whether subjects showed probability matching behavior, and whether deviations occurred as predicted by the model in Brennan and Lo [50]. Specifically, we designed several payoffs where the evolutionarily dominant behavior was either to maximize, i.e., always to choose one option, or to randomize, i.e., to choose randomly between two options. We found strong evidence for a behavioral difference between theoretical maximizers and theoretical randomizers, as predicted by Brennan and Lo [50]. After controlling for a wide range of demographic and socioeconomic variables, theoretical randomizers still engaged in randomizing behavior more often than theoretical maximizers. When facing different environments (i.e., payoffs in the experiment), our subjects responded differently by adapting to the new conditions and showing different stable behaviors.

We were also able to study individual differences in the tendency to maximize or randomize by collecting basic demographic and socioeconomic information from the anonymous participants. We found that subjects with a higher level of financial assets tended to randomize less often, while subjects with children tend to randomize more often. Moreover, subjects who had taken probability and statistics classes and those who self-reported finding a pattern in the game (none existed) also tended to randomize more often, contrary to our prior expectation that those participants with a better understanding of probability might be more likely to adopt the economically maximizing behavior. In fact, we found that those subjects engage in the exact opposite behavior. This may be due to an attempt to “beat the game,” based on the qualitative answers to our post-trial survey by participants.

From the evolutionary perspective, the key to understanding these behavioral predictions lies in the assumption of systematic reproductive risk [50, 51]. The experiment we describe in this article involves a binary choice in which the risks to the population are idiosyncratic, that is, the outcomes of one individual’s choice are independent of those of another. However, when individuals with preferences formed in response to systematic risks are placed in the different environment, there is the potential for probability matching to occur, creating what appears to be irrational behaviors for those environments.

Our results contribute to the growing literature on rationalizing the existence of probability matching. As far back as the 1950s, researchers [5254] developed statistical models that attempted to explain and predict matching behavior. Since then, several behavioral reasons have been offered, including its emergence as a consequence of pattern searching [55], through the greater utility gained from guessing the rarer event correctly [56], and by the role of diversification to avoid boredom [57]. More recently, explanations of probability matching have been proposed from an evolutionary point of view. Wolford, Miller, and Gazzaniga [45] argue that early human beings look for explanatory causal relationships as a survival strategy. Wozny, Beierholm, and Shams [58] have shown that humans match probabilities not only in cognitive tasks, but also in perceptual tasks. This implies that the human nervous system has a built-in function that samples from a distribution of hypotheses, and updates its belief after each observation.

Our results provide experimental validation for the predictions of Brennan and Lo [50], as well as additional evidence that individuals engage in randomized behavior and probability matching, even those with prior experience in probability and investing. More importantly, our results may provide an explanation for several notable departures from exact probability matching [31, 59, 60]. Randomizing behavior that matches environmental probabilities depends on the relative reproductive success of the outcomes, and the evolutionary framework proposed in Brennan and Lo [50] offers a simple and specific set of conditions for understanding and predicting such behavior.

2 Materials and methods

2.1 Evolutionary origins of probability matching

Brennan and Lo [50] proposed an evolutionary framework for the origin of several behaviors that are considered “anomalous” in economic theories based on the assumption of rational behavior. In particular, probability matching—the tendency of the relative frequency of guesses of the outcomes of a sequence of independent random events to match the underlying probability distribution of events—can be explained when the uncertainty in environment is systematic across all individuals, an example demonstrating that natural selection is able to yield behaviors that may be individually sub-optimal but are optimal for the population. For expositional convenience, we present a brief review of this framework here, and then turn to our experimental design.

We begin with a population of individuals that live for one period, produce a random number of offspring asexually, and then die. During their lives, individuals make only one decision: they choose one of two possible courses of action, denoted a and b, and this choice results in one of two corresponding random numbers of offspring, xa and xb, given by:

Prob(xa=ca1,xb=cb1)=p[0,1]Prob(xa=ca2,xb=cb2)=1-pq (1)

where p is some probability between 0 and 1.

We further assume that xa and xb are independently and identically distributed over time, and identical for all individuals in a given generation. In other words, if two individuals choose the same action a, both will produce the same number of random offspring xa. This implies that the variation in offspring due to behavior is wholly systematic, i.e., the link between action and reproductive success is the same throughout the population.

A “mindless” individual’s behavior in this world is fully specified by the probability of choosing action a. Following the notation in Brennan and Lo [50], we denote this probability as f. Each individual dies after one period, and we assume its behavior f is heritable: offspring will behave in a manner identical to their parents, i.e., they choose between the two actions according to the same probability f.

From the individual’s perspective, always choosing the action with a higher expected reproductive success (f = 0 or 1) will lead to more offspring on average. However, Brennan and Lo [50] showed that from the perspective of the population, this individually optimal behavior cannot survive. In fact, the evolutionarily dominant behavior will depend on the relationship between the probability p and the relative fecundity ratios rjcaj/cbj for each of the two possible states of the world, j = 1, 2, where f can be anywhere between 0 and 1 in general, implying randomized behavior. See Proposition 3 of Brennan and Lo [50] for more detail.

Fig 1 illustrates the evolutionarily dominant behavior f* as a function of r1 and r2. If r1 and r2 are not too different in value—i.e., the ratio of fecundity between choices a and b is not very different between the two states of the world—then random behavior yields no evolutionary advantage over deterministic choice. In this case, the individually optimal behavior (f* = 0 or 1) will prevail in the population.

Fig 1. Regions of the (r1, r2)-plane that imply deterministic (f* = 0 or 1) or randomizing (0 < f* < 1) behavior, where rj = caj/cbj measures the relative fecundities of action a to action b in the two states j = 1, 2.

Fig 1

The asymptotes of the curved boundary line occur at r1 = p and r2 = q. Values of r1 and r2 for which exact probability matching is optimal are given by the solid black curve. Source: Brennan and Lo [50, Fig 1].

However, if one of the r variables is large while the other is small, then random behavior will be more advantageous for the population than a deterministic one. In such cases, there are times in which each choice performs substantially better than the other. Under those conditions, it is evolutionarily optimal for a population to diversify between the two choices, rather than always choosing the outcome with the highest probability of progeny in a single generation.

A simple numerical example from Brennan and Lo [50] will illustrate the basic mechanism of this model. Consider a population of individuals, each facing a binary choice between one of two possible actions, a and b. 70% of the time, environmental conditions are positive, and action a leads to reproductive success, generating 3 offspring for the individual. 30% of the time, environmental conditions are negative, and action a leads to 0 offspring. This corresponds to p = 70%, ca1 = 3, cb1 = 0 in the notation of (1). Suppose action b has exactly the opposite outcomes—whenever a yields 3 offspring, b yields 0, and whenever a yields 0, b yields 3. This corresponds to ca2 = 0, cb2 = 3 in the notation of (1). From the individual’s perspective, always choosing a, which has the higher probability of reproductive success, will lead to more offspring on average. However, if all individuals in the population behaved in this “rational” manner, the first time that a negative environmental condition occurs, the entire population will become extinct. Assuming that offspring behave identically to their parents, the behavior “always choose a” cannot survive over time. For the same reason, “always choose b” is also unsustainable. In fact, one can show that in this special case, the behavior with the highest reproductive success over time is for each individual to choose a 70% of the time and b 30% of the time, matching the probabilities of reproductive success and failure. Eventually, this particular randomizing behavior will dominate the entire population.

The key to understanding these behavioral predictions lies in the assumption of systematic reproductive risk. This dependence on risk has implications that go far beyond the current setting. For example, Zhang, Brennan, and Lo [51] show that environments with a mix of systematic and idiosyncratic reproductive risks cause different risk preferences to emerge. While our risk preferences may be determined by the nature of the risks to which we and our evolutionary ancestors have been exposed, we do not necessarily have the ability to distinguish between systematic and idiosyncratic risks in our day-to-day decision making.

2.2 The binary choice game

Turning to our experimental design, we presented live human subjects with a binary choice game in which the risks to the population are idiosyncratic, that is, the outcomes of one individual’s game are independent of those of another. However, when individuals apply preferences formed in response to systematic risks to the wrong environment, there is the potential for probability matching to occur, creating what appears to be irrational behaviors for those environments.

In our experiment consisted of four particular payoff structures by varying the parameters in (1) (equivalently, four particular points in Fig 1), and observe whether participants show behaviors predicted by Brennan and Lo [50] (equivalently, behaviors indicated by different colors in Fig 1).

We recruited a sample of 82 volunteers and conducted our binary choice experiment at the MIT Behavioral Research Laboratory. Our subjects were varied in their personal and socioeconomic characteristics. We provide a summary of their statistics in Section 3.1.

The full experimental session typically lasted 45 to 60 minutes for a given participant. Each participant used a computer program that completed 200 iterations of a binary choice trial, in essence playing a lottery. On each iteration of the trial, subjects were shown an image of one of two popular film stars—Angelina Jolie or Brad Pitt—with specific fixed probabilities that were unknown to the subjects. Each participant was randomly assigned to one of four experimental designs, as shown in Table 1. In designs 1 and 2, Angelina Jolie appeared 70% of the time and Brad Pitt 30% of the time. Designs 3 and 4 used the opposite probabilities. The participant guessed which image would appear before it was revealed, and the participant would receive a certain amount of virtual dollars if their guess was correct. Fig 2 shows a screenshot of the computer interface used in the experiment.

Table 1. Experimental design.

Design Image Probability Payoff Utility Maximizing Behavior Evolutionarily Dominant Behavior [50]
1 P(Angelina)=0.7
P(Brad)=0.3
Correct: v$2
Incorrect: v$0
Always Guess Angelina f*=P(GuessAngelina)=0.7
2 P(Angelina)=0.7
P(Brad)=0.3
Correct: v$2
Incorrect: v$1
Always Guess Angelina f*=P(GuessAngelina)=1.0
3 P(Brad)=0.7
P(Angelina)=0.3
Correct: v$2
Incorrect: v$1
Always Guess Brad f*=P(GuessBrad)=1.0
4 P(Brad)=0.7
P(Angelina)=0.3
Correct: v$2
Incorrect: v$0
Always Guess Brad f*=P(GuessBrad)=0.7

Fig 2. Screenshots of the experimental program.

Fig 2

The left image shows the screen before the user submits her guess, and the right image shows the screen displaying the result of a correct guess. The pictures of Angelina Jolie and Brad Pitt are republished from Foreign, Commonwealth & Development Office under a CC BY license, with permission through creativecommons.org, original copyright 2014. We use different images of Angelina Jolie and Brad Pitt in the study, and the screenshots here are therefore for illustrative purposes only.

In designs 1 and 4, subjects received two virtual dollars when they guessed correctly, and zero virtual dollars when they guessed incorrectly. In designs 2 and 3, subjects received two virtual dollars when they guessed correctly and one virtual dollar when they guessed incorrectly. These designs correspond to four different evolutionarily dominant behaviors in Fig 1 (see also Brennan and Lo [50]), as shown in the last column of Table 1. Designs 1 and 4 are meant to yield randomized behavior according to theory, while designs 2 and 3 are meant to yield deterministic behavior. In terms of parameters in Fig 1 using p = 0.7, Design 1 corresponds to r1 = ∞ and r2 = 0, which yields the dominant behavior f* = 0.7; Design 2 corresponds to r1 = 2 and r2=12, which yields the dominant behavior f* = 1; Design 3 corresponds to r1=12 and r2 = 2, which yields the dominant behavior f* = 0; Design 4 corresponds to r1 = 0 and r2 = ∞, which yields the dominant behavior f* = 0.3.

Fig 3 shows the trial-by-trial outcome of two representative subjects. The subject in Fig 3a had a mix of guesses of Angelina Jolie and Brad Pitt throughout the duration of the experiment. We refer to these subjects as “randomizers.’ On the other hand, the subject in Fig 3b guessed Angelina Jolie for almost the entire duration of the experiment; her only Brad Pitt guesses occurred within her first few attempts, when it is likely that she was still determining the pattern. We refer to these subjects as “maximizers.”

Fig 3. Experimental outcomes for a representative randomizer (3a) and a representative maximizer (3b).

Fig 3

The highest row of triangles displays the randomly generated appearances of Angelina Jolie for each trial. The second row of triangles displays the instances when the subject’s response was Angelina Jolie. The bottom two rows of triangles represent the same information for Brad Pitt appearances and Brad Pitt responses. The middle row of red ticks represents trials that the subject guessed correctly. The diagonal line shows the cumulative payout to the subject over time.

We provided an extensive explanation of the game for all participants and answered any questions they had before the start. In order to ensure the subjects’ comprehension of the mechanics of the game, we conducted 50 iterations of a control game for each subject, either before or after the real experiment. The control game still showed the two images of Angelina Jolie and Brad Pitt at random, but the subjects were explicitly told that they only would earn a reward by always guessing one of the images, randomized between Angelina Jolie and Brad Pitt, but fixed for any one subject. Subjects who understood the payoff structure should always guess the subject with the reward. This was a trivial game, used to test the participant’s understanding of the main lottery game.

The majority of our subjects passed the control and demonstrated a clear understanding of the payoff. However, a total of 7 subjects guessed the image with zero payoff in the control game more than 40% of the time, and we concluded that they were not paying proper attention to the task, and discarded their data in our subsequent analysis. This left us with valid data from a total of 75 subjects.

3 Results

3.1 Summary statistics

After participating in the experiment, all participants completed a personal information survey form. Table 2 contains summary statistics from these surveys, including the subjects’ personal and socioeconomic characteristics, as well as their game responses. Although our sample is fairly balanced in terms of gender, age, working status, income, wealth, and investment experience, it is skewed toward single (77.3%) subjects and those without children (86.7%). In addition, 64% of our subjects have reported taking some probability and statistics classes, an unsurprising finding, given that the experiment took place at MIT.

Table 2. Participant demographics and summary statistics.

Variable Distribution (n = 75 subjects)
Personal Characteristics
Gender Male
53.3%
Female
45.3%
Has Children Yes
12.0%
No
86.7%
Age ≤23
33.3%
[24, 46]
34.7%
>46
30.7%
Marital Status Single
77.3%
Partnered
9.3%
Married
8.0%
Other
4.0%
Socioeconomic Characteristics
Taken Probability & Statistics Class Yes
64.0%
No
34.7%
Gambling Experience Yes
33.3%
No
65.3%
Housing Status Rent
88.0%
Own
10.7%
Working Status Student
40.0%
Currently Working
38.7%
Unemployed & Other
20.0%
Annual Family Income <$25,000
30.7%
$25,000-$50,000
24.0%
$50,001-$100,000
28.0%
>$100,000
16.0%
Total Financial Assets <$25,000
62.7%
$25,000-$50,000
9.3%
$50,001-$100,000
14.7%
>$100,000
12.0%
Investment Horizon I don’t invest
52.0%
Less than 1 year
9.3%
Over 1 year
37.3%
Game-related Questions
Found Pattern in Game (Self-reported) Yes
44.0%
No
48.0%
I don’t remember
6.7%
Had Strategy in Game (Self-reported) Yes
74.7%
No
22.7%
I don’t remember
1.3%
Game Outcome Percentiles
Correct Guesses (Out of 200 Total) Min
98
5%
108
50%
129
95%
143
Max%
154
Total Earnings ($) (Including $5 Base) Min
14.8
5%
15.8
50%
20.3
95%
22.0
Max%
22.2

Subjects each received $5 in base pay for showing up, and $0.05 for each virtual dollar they earned. Total dollar earnings ranged from $14.80 to $22.20. Table 2 also reports the total number of correct guesses for all subjects. The best performer guessed 154 (77%) trials correctly, while the worst performer only guessed 98 (49%) trials correctly. The median subject guessed 129 (64.5%) out of the 200 trials correctly, slightly less than the expected number of correct guesses for a perfect maximizer, who would always guess the dominant image.

The post-game survey also asked participants about their perceptions of the binary choice game. 44% of our subjects reported that they found a pattern in the game. It is clear that many participants were looking for patterns throughout the game, despite its completely random nature. This is consistent with the “representativeness heuristic” first documented by Tversky and Kahneman [61, 62]. We include quotes from two representative subjects.

“I kept losing count, but clearly the ratio of appearance of Jolie’s picture to Pitts’s kept going up until it was something 7:1, then it went down (not always in increments of one, I think) until it was 1:1, and then it went back up again.”

“70% Angelina. If we picked her too many times, Brad was introduced as a counter-pick.”

In addition, 74.7% of the subjects reported that they had a specific strategy in the game. By reading the post-study surveys, we realized that our subjects exhibited a wide range of heterogeneous strategies for the game. Here we show a few representative quotes from the two extremes of these strategies, where some subjects indicated clearly that they were always choosing one image:

“Always pick Angie.”

“Choosing Brad Pitt all the time. His image appeared more frequently and even if the probability was 50% it would not have mattered who I choose, so why not choose him all the time. Also minimizes thinking effort and time to click.”

Other subjects seemed to engage in more complicated strategies:

“Chose Brad Pitt the majority of the time—if Brad Pitt appeared at least 6 times in a row, chose Angelina Jolie.”

“‘…I was switching between one and another until I noticed some sort of pattern and then I favored Angelina Jolie’s picture for the higher number and Brad Pitt for the lower number in the pattern of 5-1-3-1-2.”

These self-reported strategies are also reflected in the wide heterogeneity in behavior when we analyze participant choices.

3.2 A model for individual behavior

Brennan and Lo [50] predict that subjects assigned to designs 1 and 4 of our binary choice game will randomize their behavior. We refer to them as “theoretical randomizers.” On the other hand, subjects assigned to designs 2 and 3 are predicted to choose the dominant image deterministically, and we refer to them as “theoretical maximizers” (see Table 1). In this section, we study whether theoretical randomizers indeed randomize more often than theoretical maximizers.

We first describe a simple model of individual behavior. Define D to be the dominant option in the game. In our experiment, D represents Angelina Jolie in designs 1 and 2, and Brad Pitt in designs 3 and 4.

Each individual i chooses the dominant option D with probability f, where f represents the individual’s (unobserved) behavior. In other words, the individual’s decision in each trial is generated by a Bernoulli random variable:

yt={1,withprobabilityf,0,withprobability1-f, (2)

where yt = 1 represents choosing the dominant option D, and t = 1, …, 200. Suppose in T trials, an individual chooses the dominant option

Nt=1Tyt (3)

times. From observed data T and N, our goal is to estimate and understand the factors which determine individual behavior f in different payoff structures. The sample average proportion

f^N/T (4)

is the obvious choice as the point estimate of behavior f.

If an individual’s decisions are independent over time, it follows from (2) and (3) that NBinomial(T, f), and f^ is approximately normally distributed with mean f and variance f(1 − f)/T. More generally, if an individuals’ decisions are not independent over time, f^ still has mean f, but its variance may be different. In Section 3.4, we estimate whether individual decisions are independent, and in Section 3.5 we discuss its implications for the variance of f^ and the hypothesis tests we carry out.

3.3 Initial learning

During the experiment, subjects required a number of trials to estimate the frequency of each image. This means that their first few guesses tended to show unstable behavior. To account for this, we divided each individual’s total number of trials into eight consecutive segments of 25 trials each, and estimated the aggregate behavior f for each segment across individuals within the same trial design. Individual behavior was too noisy for successful functional estimates over the initial trials, so we used the aggregate pattern across individuals to better understand the speed of participant learning.

Fig 4 shows the estimated aggregate behavior for theoretical maximizers (designs 2 and 3, f* = 0.7) and theoretical randomizers (designs 1 and 4, f* = 1.0), segmented into eight consecutive batches. We used the sample average proportions f^ in (4) as the point estimate of behavior f, and the normal approximation for binomial distributions to estimate its confidence interval: f^N(f,f(1-f)/T). More specifically, for a given confidence level 1 − α (e.g., α = 0.05, or 95% confidence), the (1 − α)-confidence interval is given by:

(f^-zf^(1-f^)T,f^+zf^(1-f^)T)

where z is the 1-α2 quantile of a standard normal distribution corresponding to the target error rate α. For a 95% confidence level, the error α = 1 − 0.95 = 0.05, so 1-α2=0.975 and z = 1.96. There are other approximations for confidence intervals of binomial random variables, such as the Wilson score interval and the Jeffreys interval [63, 64]. We use the normal approximation for simplicity.

Fig 4. Estimated aggregate behavior for batches of 25-trial segments.

Fig 4

We see that the estimated behavior in the first two batches (the first 50 trials) are lower than in the remaining batches, starting to stabilize around batch 3. For consistency across all individuals, we consider the responses starting from trial 51 as stable, and do not include the first 50 trials in our subsequent analysis. Our main conclusions in this paper are not sensitive to this choice.

3.4 Decision autocorrelation

In general, the distribution of our estimated behavior f^ depends on the covariance structure of an individuals’ decisions over time: {yt}t=1T. For individuals’ stable trials (trial 51 to 200), we do find evidence for autocorrelation in individual decision making. In fact, applying the Ljung–Box Q test to each individual’s sequence of choices yields 38 p-values that are smaller than 0.05 (34 after the Benjamini–Hochberg procedure to control for the False Discovery Rate in multiple testing), out of 75 subjects. This indicates that roughly half of the subjects show significant autocorrelation.

To incorporate correlation into the variance estimate of f^ in (4), we compute an autocorrelation function based on all individuals’ stable trials. Specifically, the lag-l autocorrelation, ACF(l), is estimated using the sample correlation of the following pairs:

{(yt(i),yt+l(i))}t=51,,T-1,andi=1,,n (5)

where the superscripts (i) denotes the i-th subject, which we normally omit for simplicity elsewhere in the paper.

Fig 5 shows the autocorrelation function up to lag 120. The autocorrelation is 32.4% at lag-1 and quickly stablizes around 8%-12%. We also pool together autocorrelations at all lags to yields an estimate of equicorrelation of 10.8%, which we use in the next section to test the hypothesis whether the behavior for theoretical maximizers is different from the behavior for theoretical randomizers.

Fig 5. Autocorrelation function estimated from all individuals’ stable trials.

Fig 5

3.5 Probability matching

The evolutionary model of probability matching [50] predicts that the behavior in design 2 and 3 should be fmaximizer = 1.0, while the behavior in design 1 and 4 should be frandomizer = 0.7. This of course is under the hypothetical condition that the only determinant of an individual’s reproductive success is the payoff from the game. This is obviously an extreme simplification of reality. Nonetheless, the model still provides the important insight that the presence of probability matching, and the degree at which individuals engage in probability matching, are determined by the environment, which we can specifically test through our experiment.

In particular, we are able to test the hypothesis that:

H0:fmaximizerfrandomizer,Ha:fmaximizer>frandomizer (6)

as predicted by Brennan and Lo [50], where fmaximizer is the behavior of individuals in designs 2 and 3, and frandomizer is the behavior of individuals in designs 1 and 4. As specified in (2), we observe repeated individual decisions that are, in our model, determined by the unobserved behavior fmaximizer and frandomizer. We can pool together data from all theoretical maximizers and compare with data from all theoretical randomizers. This is a standard two sample proportion test, except that decisions for the same individual might be correlated, as shown in Section 3.4.

Given a particular individual, we use vector y ≔ (y1, ⋯, yT)′ to denote her sequence of T random Bernoulli trials. For simplicity and analytical tractability, we assume the sequence has equicorrelation of ρ (estimated as 10.8% in our dataset). In other words, the covariance matrix of y is given by

Cov(y)=Var(y1)·Corr(y)=f(1-f)·(1ρρρ1ρρρ1), (7)

where the first equation follows from the fact that y1, ⋯, yT are identically distributed as specified in (2).

Therefore, the variance of the estimated behavior f^ for individual i, as defined in (4), is given by

Var(f^(i))=Var(t=1TytT)=Var(1y)T2=1Cov(y)1T2=f(1-f)T(1+(T-1)ρ)T2=f(1-f)T·(1+(T-1)ρ) (8)

where 1 is the unit vector of all 1’s, and we have omitted the superscript (i) in our derivation for notational simplicity.

Note that the first term in (8), f(1-f)T, is simply the variance of f^(i) if individual decisions are independent Bernoulli random variables. Therefore, the second term in (8), (1 + (T − 1)ρ), can be treated as an adjustment factor of f^(i)’s variance when individual decisions are correlated.

For a set of n independent individuals with T trials each, the overall estimated behavior for them is simply the average estimated behavior of each individual. Therefore the variance for their overall behavior is:

Var(f^)=Var(i=1nf^(i)n)=Var(f^(1))n=f(1-f)nT·(1+(T-1)ρ).

As a result, for an (unpaired) two sample proportion test between two groups of subjects with n1 and n2 individuals respectively, we have the test statistic:

z=(f^1-f^2)-0f*(1-f*)(1n1T+1n2T)·11+(T-1)ρ (9)

where f^1, f^2, and f* are the average behavior for individuals from group 1, group 2, and all pooled together, and they do not depend on y’s covariance structure. The first term in (9) is simply the standard z-score for the two sample proportion test, and the second term in (9) can be treated as the adjustment factor for correlation, which in our case is:

11+(T-1)ρ=11+(150-1)·10.8%0.24

for stable trials.

With this correlation adjustment, the null hypothesis in (6) is rejected with a z-statistic of 2.014 (or a p-value of 0.022), providing evidence for a difference in behavior between theoretical maximizers and theoretical randomizers. As predicted, when facing different environments (i.e., different payoffs in the experiment), theoretical randomizers indeed randomize more often than theoretical maximizers. Subjects responded differently by adapting to the environment and showing different stable behaviors.

After the game, we asked subjects in a survey whether they employed a specific strategy, and 74.7% of the subjects reported “Yes”. We perform the same hypothesis test (6) for subjects who reported “Yes” and those who reported “No” separately. We find that the effect holds strongly for individuals who reported that they used a specific strategy (adjusted z-statistic of 2.489, adjusted p-value of 0.006), but not for those who did not (adjusted z-statistic of −0.058, adjusted p-value of 0.523). This serves as another robustness check that the effect is driven by intentional behavior on the part of the subjects, and is not purely noise. This also provides empirical evidence for theories that attempt to explain probability matching through pattern seeking [55] and searching for causal relationships [45].

In principle, one can also perform the same test for different slices of the subjects across demographic, socioeconomic, and game-specific dimensions shown in Table 2. This helps to build intuition on whether the same effect holds true universally, and what types of individuals have stronger effects. However we acknowledge that the power of our study is limited due to the sample of 75 subjects, particularly after multiple-testing adjustment, and we leave this to a future study.

3.6 Individual differences

To jointly study individual differences in decision-making with the variables considered in Table 2, we consider a logistic regression model at the level of each guess by the individual subject. Specifically, for individual i, at the t-th trial in the game,

yt(i)Bernoulli(f(i))

where yt(i)=1 is a binary random variable that represents whether individual i chooses the dominant option at trial t, similar to the formulation in Eq (2). Individual i’s behavior—the probability of choosing the dominant option—is modeled by:

f(i)=P(yt(i)=1)=Logistic(IsTheoryMaximizeri+IsMalei+HasChildi+AgeBucketi+HasProbClassExpi+HasGamblingExpi+HasInvestExperiencei+IsStudenti+IsOwni+IncomeBucketi+TotalAssetBucketi+HasPatterni) (10)

where Logistic(x) = (1 + exp(−x))−1.

We have seen in Section 3.4 that individual decisions are correlated over time. Therefore, the errors for regression (10) may be autocorrelated. We group trials from the same individual together, and order their decisions chronologically. In particular, the response variable is organized as:

(y1(1),y2(1),,yT(1)1stsubjectsTtrials,y1(2),y2(2),,yT(2)2ndsubjectsTtrials,,y1(n),y2(n),,yT(n)n-thsubjectsTtrials),

and we apply the Newey-West heteroskedasticity and autocorrelation consistent (HAC) estimator for the variance of the coefficients in the following results. We adopt Newey and West’s [65] suggestion to choose the truncation parameter to be the integer part of 4(nT/100)2/9, which is 11 in our case. This indeed increases the variance estimate of our coefficients compared with the case of independent errors, and our results are not materially different with respect to the choice of the truncation parameter. In fact, we have tried an estimation with truncation parameter to be 150, the number of total valid decisions for one individual. The main variable IsTheoryMaximizer remains statistically significant at 5%.

Table 3 summarizes the independent variables in Eq (10). These variables correspond to the collected personal information of the subjects (see Table 2), categorized to make them proper binary or ordinal variables. We have dropped several variables that are highly collinear with the covariates in (10). The p-value of the log-likelihood ratio test of the full model is 4 × 10−54, implying a high degree of significance.

Table 3. Variables and coefficients for logistic regression.

Variable Type Coefficient HAC Standard Error z Statistic
IsTheoryMaximizer Dummy 0.4180 0.106 3.959***
IsMale Dummy 0.1906 0.100 1.901
HasChild Dummy -0.5497 0.126 -4.378***
AgeBucket Ordinal -0.0747 0.078 -0.962
HasProbClassExp Dummy -0.2832 0.116 -2.445*
HasGamblingExp Dummy 0.0120 0.106 0.113
HasInvestExperience Dummy 0.0393 0.108 0.365
IsStudent Dummy -0.1001 0.128 -0.78
IsOwn Dummy -0.2603 0.206 -1.266
IncomeBucket Ordinal 0.0246 0.038 0.642
TotalAssetBucket Ordinal 0.0965 0.048 2.022*
HasPattern Dummy -0.5497 0.101 -5.467***
Intercept 2.1051 0.160 13.126***

Statistics in bold are significant at the 0.1% (***), 1% (**), or 5% (*) level.

The first variable, IsTheoryMaximizer, encodes whether the individual is a theoretical maximizer, i.e., placed in design 2 or 3. The effect is both positive and strongly significant after controlling for all other socioeconomic and game-related variables, implying that theoretical maximizers tend to choose the dominant option more often. This is consistent with our analysis in Section 3.5.

Continuing down the table, a significantly positive coefficient corresponds to a dimension along which individuals tend to be a maximizer, i.e., they randomize less often. Our results show that subjects with more financial assets tend to be maximizers more often than subjects with fewer financial assets.

A significantly negative coefficient, on the other hand, corresponds to a dimension along which individuals tend to be a randomizer, i.e., they switch between two options more often. Our results show that subjects with children tend to be randomizers.

In addition, subjects who have taken probability and statistics classes and who self-reported finding a pattern in the game tend to randomize more. This is contrary to our prior expectation that those who understand probability might be more likely to always choose the dominant option, the economically maximizing behavior. In fact, they behave in the exactly opposite manner, perhaps in an attempt to “beat the game.” This is consistent with our observations of subject narratives in the post-study survey in Section 3.1.

4 Discussion

We have used a simple lottery game to test the occurrence of probability matching behavior in financial decision-making. Our study specifically uses a choice between two images rather than more financially-related tasks (such as guessing the outcome of asset-price movements) in the hopes of triggering a more primitive form of decision-making in our subjects, because financially-related tasks may be more likely to trigger economically optimizing behaviors. Despite a large degree of heterogeneity among individual behaviors, we find that different levels of probability matching occur in different environments of payoff structures. Specifically, individuals in environments with less balanced payoffs between the two random outcomes (design 1 and 4 in Table 1) show a greater degree of randomized behavior, consistent with the behavioral predictions of Brennan and Lo [50].

On the other hand, we did not observe individuals behave in exactly the way the simple theoretical model would suggest (f* = 0.7 for design 1 and 4; f* = 1.0 for design 2 and 4 in Table 1), most likely because the real effects on the reproductive success of each individual are unlikely to be affected by the payoffs provided in the game, but instead are affected by a number of heterogeneous socioeconomic factors such as income, wealth, and other variables. It is difficult, and perhaps impossible, to specify a single model that predicts behavior for every participant under all circumstances. However, we have indeed observed that at the population level, the effect that theoretical randomizers (design 1 and 4) randomize more than theoretical maximizers (design 2 and 3) is real and robust, after controlling for a wide range of such personal and socioeconomic variables.

We find significant evidence of different levels of probability matching behavior when we alter the payoff structure in the game. However, our study is also constrained by the limited magnitude of payoffs (around $20) offered to subjects, as well as the limitations of our sample size (82 subjects). It is therefore difficult to perform additional statistical tests on finer slices of the data to study whether the same behavioral mechanism applies to different demographic categories. It is also difficult to conclude that any demographic dimension that appears neutral to individual decision making would remain neutral in a larger-scale study. It remains a question for future research to confirm whether our experimental conclusions carry over across demographic groups, to other financial and non-financial contexts, and at different magnitudes of payoffs.

In addition to testing the evolutionary model of Brennan and Lo [50], our experimental results suggest that it is valuable to derive behavioral predictions and implications through an evolutionary lens. Traditional utility-based theories would yield the same maximizing behavior for all four designs in our experiment. Yet we find evidence for differences in reality, and the evolutionary framework offers a potential explanation and prediction for such behaviors: the environment matters.

More generally, financial markets—a collection of individual decision makers—can also be studied using the same principles, leading to the Adaptive Markets Hypothesis [66, 67] and its many empirical implications [6870]. In the same way that micro-level individual decision making can be better understood through an evolutionary lens, markets and societies at the system-wide and macroscopic level can also benefit an adaptive perspective.

Acknowledgments

The authors thank the MIT Behavioral Research Laboratory for hosting the experiment, and Jennifer L. Walker, Jayna Cummings, and Allison McDonough for their assistance in the experiments. Insightful comments and discussions from Jason Anthony Aimone (editor), Kim Fairley (reviewer), Taibo Li and Alex Huang are also greatly appreciated.

Data Availability

The full data is publicly available at: https://figshare.com/s/4760fc73b07ce32b5fe5.

Funding Statement

Research support from the MIT Laboratory for Financial Engineering is gratefully acknowledged. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.von Neumann J, Morgenstern O. Theory of Games and Economic Behavior. Princeton, NJ: Princeton University Press; 1944. [Google Scholar]
  • 2.Nash JF. Equilibrium points in n-person games. Proceedings of the national academy of sciences. 1950;36(1):48–49. doi: 10.1073/pnas.36.1.48 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Lucas RE Jr. Expectations and the neutrality of money. Journal of Economic Theory. 1972;4(2):103–124. doi: 10.1016/0022-0531(72)90142-1 [DOI] [Google Scholar]
  • 4.Samuelson PA. Proof that properly anticipated prices fluctuate randomly. Industrial Management Review. 1965;6(2):41–49. [Google Scholar]
  • 5.Fama EF. Efficient Capital Markets: A Review Of Theory And Empirical Work. The Journal of Finance. 1970;25(2):383–417. doi: 10.1111/j.1540-6261.1970.tb00518.x [DOI] [Google Scholar]
  • 6.Black F, Scholes M. The Pricing of Options and Corporate Liabilities. Journal of Political Economy. 1973;81(3):637–654. doi: 10.1086/260062 [DOI] [Google Scholar]
  • 7.Merton RC. Theory of rational option pricing. The Bell Journal of economics and management science. 1973;4(1):141–183. doi: 10.2307/3003143 [DOI] [Google Scholar]
  • 8.Kocherlakota NR. Modern macroeconomic models as tools for economic policy. The Region (Federal Reserve Bank of Minneapolis). 2010; p. 5–21. [Google Scholar]
  • 9.Hu HT. Efficient markets and the law: A predictable past and an uncertain future. Annual Review of Financial Economics. 2012;4(1):179–214. doi: 10.1146/annurev-financial-110311-101733 [DOI] [Google Scholar]
  • 10.Kahneman D, Slovic P, Tversky A. Judgment under Uncertainty: Heuristics and Biases. Cambridge, UK: Cambridge University Press; 1982. [Google Scholar]
  • 11.Tversky A, Kahneman D. Judgment under uncertainty: Heuristics and biases. Science. 1974;185(4157):1124–1131. doi: 10.1126/science.185.4157.1124 [DOI] [PubMed] [Google Scholar]
  • 12.Kahneman D, Tversky A. Prospect theory: An analysis of decision under risk. Econometrica. 1979;47(2):263–291. doi: 10.2307/1914185 [DOI] [Google Scholar]
  • 13.Shefrin H, Statman M. The disposition to sell winners too early and ride losers too long: Theory and evidence. The Journal of finance. 1985;40(3):777–790. doi: 10.1111/j.1540-6261.1985.tb05002.x [DOI] [Google Scholar]
  • 14.Odean T. Are investors reluctant to realize their losses? The Journal of finance. 1998;53(5):1775–1798. [Google Scholar]
  • 15.Barber BM, Odean T. Boys will be boys: Gender, overconfidence, and common stock investment. The quarterly journal of economics. 2001;116(1):261–292. doi: 10.1162/003355301556400 [DOI] [Google Scholar]
  • 16.Gervais S, Odean T. Learning to be overconfident. The Review of Financial Studies. 2001;14(1):1–27. doi: 10.1093/rfs/14.1.1 [DOI] [Google Scholar]
  • 17.De Bondt WF, Thaler R. Does the stock market overreact? The Journal of finance. 1985;40(3):793–805. [Google Scholar]
  • 18.Huberman G, Regev T. Contagious speculation and a cure for cancer: A nonevent that made stock prices soar. The Journal of Finance. 2001;56(1):387–396. doi: 10.1111/0022-1082.00330 [DOI] [Google Scholar]
  • 19.Tversky A, Kahneman D. The Framing of Decisions and the Psychology of Choice. Science. 1981;211:453–458. doi: 10.1126/science.7455683 [DOI] [PubMed] [Google Scholar]
  • 20.Lichtenstein S, Fischhoff B, Phillips LD. Calibration of probabilities: The state of the art to 1980. In: Kahneman D, Slovic P, Tversky A, editors. Judgment under uncertainty: Heuristics and biases. Cambridge, UK: Cambridge University Press; 1982. p. 306–334. [Google Scholar]
  • 21.Gneezy U, List JA, Wu G. The uncertainty effect: When a risky prospect is valued less than its worst possible outcome. The Quarterly Journal of Economics. 2006; p. 1283–1309. doi: 10.1093/qje/121.4.1283 [DOI] [Google Scholar]
  • 22.Mahoney MJ. Publication prejudices: An experimental study of confirmatory bias in the peer review system. Cognitive therapy and research. 1977;1(2):161–175. doi: 10.1007/BF01173636 [DOI] [Google Scholar]
  • 23.Grant DA, Hake HW, Hornseth JP. Acquisition and extinction of verbal conditioned responses with differing percentages of reinforcement. Journal of experimental psychology. 1951;42(1):1–5. doi: 10.1037/h0054051 [DOI] [PubMed] [Google Scholar]
  • 24.Hake HW, Hyman R. Perceptions of the Statistical Structure of a Random Series of Binary Symbols. J Exp Psychol. 1953;45:64–74. doi: 10.1037/h0060873 [DOI] [PubMed] [Google Scholar]
  • 25.Herrnstein RJ. Relative and absolute strength of responses as a function of frequency of reinforcement. Journal of the Experimental Analysis of Behaviour. 1961;4(3):267–272. doi: 10.1901/jeab.1961.4-267 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Herrnstein RJ. On the law of effect. Journal of the Experimental Analysis of Behavior. 1970;13:243–266. doi: 10.1901/jeab.1970.13-243 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Herrnstein RJ. The Matching Law. Cambridge, Massachusetts: Harvard University Press; 1997. [Google Scholar]
  • 28.Bradshaw CM, Szabadi E, Bevan P. Behavior of humans in variable-interval schedules of reinforcement. Journal of the Experimental Analysis of Behavior. 1976;26:135–141. doi: 10.1901/jeab.1976.26-135 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Davison M, McCarthy D. The matching law: A research review. Hillsdale, NJ: Lawrence Erlbaum; 1988. [Google Scholar]
  • 30.Vulcan N. An Economist’s Perspective on Probability Matching. Journal of Economic Surveys. 2000;14:101–118. doi: 10.1111/1467-6419.00106 [DOI] [Google Scholar]
  • 31.Kogler C, Kühberger A. Dual Process Theories: A Key for Understanding the Diversification Bias? J Risk Uncertainty. 2007;34:145–154. [Google Scholar]
  • 32.Deneubourg JL, Aron S, Goss S, Pasteels JM. Error, communication and learning in ant societies. European Journal of Operational Research. 1987;30(2):168–172. doi: 10.1016/0377-2217(87)90093-2 [DOI] [Google Scholar]
  • 33.Pasteels JM, Deneubourg JL, Goss S. Self-organization mechanisms in ant societies. I: Trail recruitment to newly discovered food sources. Experientia Supplementum. 1987;. [Google Scholar]
  • 34.Kirman A. Ants, rationality, and recruitment. Quarterly Journal of Economics. 1993;108(1):137–156. doi: 10.2307/2118498 [DOI] [Google Scholar]
  • 35.Hölldobler B, Wilson EO. The Ants. Cambridge, MA: Belknap Press; 1990. [Google Scholar]
  • 36.Harder LD, Real LA. Why are bumble bees risk averse? Ecology. 1987;68(4):1104–1108. [Google Scholar]
  • 37.Thuijsman F, Peleg B, Amitai M, Shmida A. Automata, matching and foraging behavior of bees. J Theor Biol. 1995;175:305–316. doi: 10.1006/jtbi.1995.0144 [DOI] [Google Scholar]
  • 38.Keasar T, Rashkovich E, Cohen D, Shmida A. Bees in two-armed bandit situations: foraging choices and possible decision mechanisms. Behavioral Ecology. 2002;13:757–765. doi: 10.1093/beheco/13.6.757 [DOI] [Google Scholar]
  • 39.Bitterman ME, Wodinsky J, Candland DK. Some Comparative Psychology. Am J Psychol. 1958;71:94–110. doi: 10.2307/1419199 [DOI] [PubMed] [Google Scholar]
  • 40.Behrend ER, Bitterman ME. Probability-Matching in the Fish. American Journal of Psychology. 1961;74:542–551. doi: 10.2307/1419664 [DOI] [Google Scholar]
  • 41.Graf V, Bullock DH, Bitterman ME. Further Experiments on Probability-Matching in the Pigeon. J Exp Anal Behav. 1964;7:151–157. doi: 10.1901/jeab.1964.7-151 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Young JS. Discrete-Trial Choice in Pigeons: Effects of Reinforcer Magnitude. J Exp Anal Behav. 1981;35:23–29. doi: 10.1901/jeab.1981.35-23 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Woolverton WL, Rowlett JK. Choice maintained by cocaine or food in monkeys: Effects of varying probability of reinforcement. Psychopharmacology. 1998;138:102–106. doi: 10.1007/s002130050651 [DOI] [PubMed] [Google Scholar]
  • 44.Elliott R, Rees GW, Dolan RJ. Ventromedial Prefrontal Cortex Mediates Guessing. Neuropsychologia. 1999;37:403–411. doi: 10.1016/S0028-3932(98)00107-9 [DOI] [PubMed] [Google Scholar]
  • 45.Wolford G, Miller MB, Gazzaniga M. The Left Hemisphere’s Role in Hypothesis Formation. Journal of Neuroscience. 2000;20(6):RC64:1–4. doi: 10.1523/JNEUROSCI.20-06-j0003.2000 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Volz KG, Schubotz RI, von Cramon DY. Predicting Events of Varying Probability: Uncertainty Investigated by fMRI. NeuroImage. 2003;19:271–280. doi: 10.1016/S1053-8119(03)00122-8 [DOI] [PubMed] [Google Scholar]
  • 47.Miller MB, Valsangkar-Smyth M, Newmanb S, Dumontc H, G W. Brain Activations Associated with Probability Matching. Neuropsychologia. 2005;43:1598–1608. doi: 10.1016/j.neuropsychologia.2005.01.021 [DOI] [PubMed] [Google Scholar]
  • 48.Miller MB, Valsangkar-Smyth M. Probability Matching in the Right Hemisphere. Brain and Cognition. 2005;57:165–167. doi: 10.1016/j.bandc.2004.08.038 [DOI] [PubMed] [Google Scholar]
  • 49.Hecht D, Walsh V, Lavidor M. Transcranial Direct Current Stimulation Facilitates Decision Making in a Probabilistic Guessing Task. J Neurosci. 2010;30(12):4241–4245. doi: 10.1523/JNEUROSCI.2924-09.2010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Brennan TJ, Lo AW. The origin of behavior. The Quarterly Journal of Finance. 2011;1(01):55–108. doi: 10.1142/S201013921100002X [DOI] [Google Scholar]
  • 51.Zhang R, Brennan TJ, Lo AW. The origin of risk aversion. Proceedings of the National Academy of Sciences. 2014;111(50):17777–17782. doi: 10.1073/pnas.1406755111 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Estes W. Towards a Statistical Theory of Learning. Psychological Review. 1950;57(2):94–107. doi: 10.1037/h0058559 [DOI] [Google Scholar]
  • 53.Estes WK, Strughan JH. Analysis of a Verbal Conditioning Situation in terms of Statistical Learning Theory. Journal of Experimental Psychology. 1954;47:225–234. doi: 10.1037/h0060989 [DOI] [PubMed] [Google Scholar]
  • 54.Burke CJ, Estes WK, Hellyer S. Rate of Verbal Conditioning in Relation to Stimulus Variability. Journal of Experimental Psychology. 1954;48:153–161. doi: 10.1037/h0053945 [DOI] [PubMed] [Google Scholar]
  • 55.Wolford G, Newman S, Miller MB, Wig GS. Seraching for Patterns in Random Sequences. Canadian Journal of Experimental Psychology. 2004;58(4):221–228. doi: 10.1037/h0087446 [DOI] [PubMed] [Google Scholar]
  • 56.Siegel S. Decision Making and Learning under Varying Conditions of Reinforcement. Annals of the New York Academy of Sciences. 1960;89:766–783. doi: 10.1111/j.1749-6632.1961.tb20177.x [DOI] [Google Scholar]
  • 57.Rubinstein A. Irrational Diversification in Multiple Decision Problems. European Economic Review. 2002;46:1369–1378. doi: 10.1016/S0014-2921(01)00186-6 [DOI] [Google Scholar]
  • 58.Wozny DR, Beierholm UR, Shams L. Probability Matching as a Computational Strategy Used in Perception. PLoS Comput Biol. 2010;6(8):e1000871. doi: 10.1371/journal.pcbi.1000871 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Baum WM. On two types of deviation from the matching law: Bias and undermatching. Journal of the Experimental Analysis of Behavior. 1974;22:231–242. doi: 10.1901/jeab.1974.22-231 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Horne PJ, Lowe CF. Determinants of human performance on concurrent schedules. Journal of the Experimental Analysis of Behavior. 1993;59:29–60. doi: 10.1901/jeab.1993.59-29 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Tversky A, Kahneman D. Belief in the law of small numbers. Psychological bulletin. 1971;76(2):105. doi: 10.1037/h0031322 [DOI] [Google Scholar]
  • 62.Kahneman D, Tversky A. Subjective probability: A judgment of representativeness. Cognitive psychology. 1972;3(3):430–454. doi: 10.1016/0010-0285(72)90016-3 [DOI] [Google Scholar]
  • 63.Newcombe RG. Two-sided confidence intervals for the single proportion: comparison of seven methods. Statistics in medicine. 1998;17(8):857–872. doi: [DOI] [PubMed] [Google Scholar]
  • 64.Brown LD, Cai TT, DasGupta A. Interval estimation for a binomial proportion. Statistical science. 2001; p. 101–117. [Google Scholar]
  • 65.Newey WK, West KD. A simple, positive semi-definite, heteroskedasticity and autocorrelationconsistent covariance matrix. Econometrica. 1987;55(3):703–708. doi: 10.2307/1913610 [DOI] [Google Scholar]
  • 66.Lo AW. The adaptive markets hypothesis. Journal of Portfolio Management. 2004;30(5):15–29. doi: 10.3905/jpm.2004.442611 [DOI] [Google Scholar]
  • 67.Lo AW. Adaptive Markets: Financial Evolution at the Speed of Thought. Princeton, NJ: Princeton University Press; 2017. [Google Scholar]
  • 68.Neely CJ, Weller PA, Ulrich JM. The adaptive markets hypothesis: evidence from the foreign exchange market. Journal of Financial and Quantitative Analysis. 2009;44(2):467–488. doi: 10.1017/S0022109009090103 [DOI] [Google Scholar]
  • 69.Kim JH, Shamsuddin A, Lim KP. Stock return predictability and the adaptive markets hypothesis: Evidence from century-long US data. Journal of Empirical Finance. 2011;18(5):868–879. doi: 10.1016/j.jempfin.2011.08.002 [DOI] [Google Scholar]
  • 70.Khuntia S, Pattanayak J. Adaptive market hypothesis and evolving predictability of bitcoin. Economics Letters. 2018;167:26–28. doi: 10.1016/j.econlet.2018.03.005 [DOI] [Google Scholar]

Decision Letter 0

Jason Anthony Aimone

22 Feb 2021

PONE-D-20-40629

To maximize or randomize? An experimental study of probability matching in financial decision making

PLOS ONE

Dear Dr. Zhang,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

The decision to request a revision comes after consultation with a qualified reviewer who raises important points about the paper that need to be addressed. While I ask you to look at and respond to each of the points, of particular importance is ensuring that the data analysis is correctly completed. Note for instance the lack of description in the text about how repeat observations from individual subjects is treated in the regression analysis.  I gave the paper a read myself and have some points I would like you to address as well. Several of these points are dealing with the manner in which the paper is discussed and where confusion may arise with readers. There are also several statistical points of concern to be addressed.

  • P1/2: Discussion of rationality makes it sound like the factors identified that lead to differences to the things discussed in the first paragraph are “irrational”, which the lines of work discussing those things do not always indicate.  Loss aversion for instance is not necessarily “irrational” even though it goes against what could be called “traditional” theoretical models of rationality.

  • The probability matching example of coin flips does not do what it is trying to do. I’d suggest rewording or providing a more detailed example.  The intro needs a good example for those who are not familiar with the concept.

  • You say “It is interesting to note that the t-statistic is higher for those subjects who reported “Yes” than for those who reported “No”. This implies that subjects who believed there was a pattern were adapting to the environment of the game more strongly”.  Please directly test for this difference.

  • Table 3:  It is not clear how you are setting up these tests from the text. It would seem reasonable to be identifying the “f” for each participant as the probability they choose the dominant option, and thus there would be 1 data point per individual. You then would make comparisons between your four treatments. However, you say “The number of samples in the t-test equals the number of subjects multiplied by 150 stable trials.”  Are you saying here that you are treating each subject as 150 different independent observations (which would be inappropriate?)  This is confusing and muddles the whole results section.

  • Discussion:  The conclusion is written strangely and leads to the opposite conclusions as the paper. “Specifically, individuals in environments with more balanced payoffs between the two random outcomes show a greater degree of randomized behavior, consistent with the behavioral predictions in Brennan and Lo [44]” Your paper sets up behavior in treatments 1 and 4 (That pay for correct and do not pay for incorrect, thus very unbalanced payments) are more likely to result in decision randomization than treatments 2 and 3 (where people get paid relatively more equally for correct and incorrect answers.) This is opposite of the quote from the conclusion. Why? Your regression show analysis shows that the variable “Theory Maximizer” which you say is 1 if in treatments 2 or 3, has a positive coefficient which you say is interpreted as meaning they choose the dominant option more often, which would seem to be the opposite of engaging in more randomized behavior.  The conclusion continues to say that the data did not support the table 1 theory.  That data did appear to support the Evolutionarily Dominate behavior theoretical outcome.  Is there something I am missing or missed in this explanation?

Finally, I head some comments back from a second reviewer who recognized late in the process a conflict of interest and withdrew from the formal process. I will pass on a few thoughts for your revision from that reviewer though. I summarize here:

*p1, ln 13-14 should be rephrased as as overconfidence and loss aversion are less "behaviors" and more psychological traits

*p8, elaborate and clarify the position that pattern seeking might be a form of randomization.

*p6, quote starting "the experiment we describe...for those environments." you may want to emphasize this in the introduction.

Please submit your revised manuscript by Apr 08 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

We look forward to receiving your revised manuscript.

Kind regards,

Jason Anthony Aimone

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2.  We note that the grant information you provided in the ‘Funding Information’ and ‘Financial Disclosure’ sections do not match.

When you resubmit, please ensure that you provide the correct grant numbers for the awards you received for your study in the ‘Funding Information’ section.

3. We note that you have stated that you will provide repository information for your data at acceptance. Should your manuscript be accepted for publication, we will hold it until you provide the relevant accession numbers or DOIs necessary to access your data. If you wish to make changes to your Data Availability statement, please describe these changes in your cover letter and we will update your Data Availability statement to reflect the information you provide.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: No

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: No

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: To maximize or randomize? An experimental study of probability matching in financial

decision making

By Andrew W Lo, Katherine P Marlowe, Ruixun Zhang

Reviewed on Feb 11, 2021

Summary

This study aims to experimentally test the model predictions by Brennan and Lo (2011) on probability matching. By probability matching the authors refer to a behavioural tendency to predict the outcomes of an independent randomized event to match it underlying probability distribution. In other words, participants do not pick one outcome based on maximum utility, but pick various outcomes which correspond to the underlying probability with which these outcomes occur.

Their experimental design was tested on 82 participants in a behavioural laboratory setup and consisted of 200 trials of a binary choice decision. This binary choice involved guessing if an image of Brad Pitt or Angelina Jolie would appear and with every good guess participants would accrue experimental tokens. By varying the payoff structure and the probability with which one of the two images would appear, four experimental conditions were created. Based on the evolutionary dominant strategies as laid out by Brannon and Lo (2011), experimental results indeed showed a behavioural difference between theoretical maximizers and randomizers.

Major concerns

1. Aims of this paper

The goal of this paper is to experimentally test the evolutionary model by Brannan and Lo. This aim should be made much clearer than at the end of the Introduction, where it is written now. Although the authors describe this model in some detail in 2.1, the specific environmental conditions (which leads the authors to design their payoff structure) that underlies probability matching according to the authors are not well described. As this model forms the backbone of the experimental predictions, it should be described in an intuitive clear manner.

2. Power analysis

Did the authors perform a power analysis? As many control variables are included, I wonder if there is enough power to test the numerous relationships as laid out in the results section with only 82 participants.

3. Separate t-tests

It is weird to run separate t-tests for each and every socio-demographic variable. You would need to perform a correction for multiple hypothesis testing to begin with, but also I do not see the purpose of running these t-tests. I would just stick to your logistical regression. With regard to your logistical regression, how do you control for the repeated trials at the subject level, as this probably violates independence of individual data points? This is unclear and needs to be explained better.

4. Discussion

Your discussion is a summary of your results and does not relate your findings to any other study, does not discuss any limitations of your study in much detail, or addresses the relevance and practical implications of your discussion. In short, the discussion does not read as a discussion.

Minor concerns

5. Probability matching is also a well-known term to label a technique to elicit ambiguity preferences. I would suggest you make it clear very early on what probability matching entails in your study.

6. You explain probability matching as randomizing behaviour, but also refer to randomizing and probability as two phenomena. It is unclear if these labels are synonyms or not.

7. What do we learn from the neuroimaging and tDCS studies regarding probability matching in your literature review? These insights feel disconnected from the rest of the literature review. Also, I think many have no clue what a tDSC study entails, so either explain better and embed within your framework, or leave it out.

8. Why did you pick such valanced images as Brad and Angelina and did not go for a neutral binary choice? Especially, since your aim is to ‘use a choice between two images rather than a more financially-related tasks (such as guessing the outcome of asset-price movements) in the hopes of triggering a more primitive form of decision-making in our subjects’. (r. 425-427). What is primitive about choosing between Brad and Angelina?

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Kim Fairley

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

Attachment

Submitted filename: To maximize or randomize.docx

PLoS One. 2021 Aug 26;16(8):e0252540. doi: 10.1371/journal.pone.0252540.r002

Author response to Decision Letter 0


6 Apr 2021

We thank both the editor and the reviewer for their very helpful comments on our work—they have greatly improved the manuscript. The revised manuscript incorporates all of these comments and suggestions and point-by-point responses are provided. We have also provided a revised manuscript with track changes.

Attachment

Submitted filename: 40629 R2R.pdf

Decision Letter 1

Jason Anthony Aimone

18 May 2021

To maximize or randomize? An experimental study of probability matching in financial decision making

PONE-D-20-40629R1

Dear Dr. Zhang,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Jason Anthony Aimone

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The authors have adequately revised their manuscript, in particular the analysis is more sound and the added explanations and examples provide better intiuition.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Kim Fairley

Acceptance letter

Jason Anthony Aimone

17 Aug 2021

PONE-D-20-40629R1

To maximize or randomize?  An experimental study of probability matching in financial decision making

Dear Dr. Zhang:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Jason Anthony Aimone

Academic Editor

PLOS ONE


Articles from PLoS ONE are provided here courtesy of PLOS

RESOURCES