Abstract
Flow is a subjective state characterized by immersion and engagement in one’s current activity. The benefits of flow for productivity and health are well-documented, but a rigorous description of the flow-generating process remains elusive. Here we develop and empirically test a theory of flow’s computational substrates: the informational theory of flow. Our theory draws on the concept of mutual information, a fundamental quantity in information theory that quantifies the strength of association between two variables. We propose that the mutual information between desired end states and means of attaining them — — gives rise to flow. We support our theory across five experiments (four preregistered) by showing, across multiple activities, that increasing increases flow and has important downstream benefits, including enhanced attention and enjoyment. We rule out alternative constructs including alternative metrics of associative strength, psychological constructs previously shown to predict flow, and various forms of instrumental value.
Subject terms: Decision making, Human behaviour
Flow is a desired but elusive state characterized by the subjective experience of immersion and engagement in an activity. Here, the authors develop and empirically validate a formal model that specifies variables and computations involved in the subjective experience of flow.
Introduction
Peak human performance emerges during the experience of flow—the subjective state of being immersed in one’s current task1–3. Flow enhances learning and academic achievement4–6, boosts productivity, fosters artistic talent7, and improves objective measures of athletic skill8,9. Flow also promotes subjective well-being7,10; positive affect has been found to increase with the amount of flow experienced during the workday11, and the absence of flow has been linked to depression2.
Flow is a potential wellspring of achievement and well-being, but often goes underutilized. People frequently find their most important tasks tedious rather than immersive1,2,10,12. To help people reap the benefits of flow, we must achieve a deeper understanding of the mechanisms through which flow emerges. Indeed, existing descriptions of the flow-generating process are underspecified, cast in terms of abstract concepts rather than mathematically precise computational models. Only by grounding the flow-generating process in formal theoretical structures can we identify precisely which parameters must be adjusted, and by how much, to maximize flow for particular people and contexts. We pursue this aim in the present paper by proposing and testing a computational theory of flow.
Our proposal draws inspiration from the disparate fields of social psychology and artificial intelligence, which have converged on similar ideas with relevance to flow. Social psychologists have developed the concept of means-ends fusion to explain what makes some activities more intrinsically interesting than others13–17. The idea is that intrinsic interest emerges from mental associations between desired end states (e.g., bowling a strike) and means of attaining them (e.g., rolling a ball); the more a means and end are associated—or fused—the more interest the means evokes. The concept of intrinsic interest is, if not identical to flow, closely related3, suggesting that the abstract notion of means-ends fusion could guide and constrain the search for flow-generating mechanisms.
Intriguingly, what could be interpreted as a formal definition of means-ends fusion appears in the field of artificial intelligence. It has proven useful to have artificial agents aim to maximize a quantity called empowerment: the maximum of the mutual information between the agent’s actions and end states18,19. Mutual information is a fundamental quantity from information theory that measures the degree of association between two random variables20. Accordingly, the mutual information between actions and end states can be interpreted as the associative strength, or fusion, between means and ends. Other formulations of means-ends fusion are possible, but mutual information is an especially promising candidate, in part because empowerment-maximizing agents are reminiscent of humans experiencing flow: people tend to pursue flow-inducing activities for their own sake3, and agents that maximize empowerment tend to learn, explore, and act meaningfully in the absence of external rewards and punishments21–26.
The consilience between means-ends fusion and empowerment led us to integrate these concepts into a computational theory of flow. The crux of our proposal is this: Flow is an increasing function of the mutual information between desired end states and means of attaining them. As the mutual information between a means and its end increases, so does the degree of flow. We call this the informational theory of flow. Next, we specify what we mean by “means” and “ends,” and how the mutual information between them is computed.
We equate means and ends with the random variables and , respectively, where denotes a state brought about to achieve a goal, and denotes the outcome of goal-pursuit. Most activities can be represented in multiple ways, making and perceiver-dependent27. Consider a dart-throwing game that rewards players for hitting a bullseye. For one person, and could be binary variables denoting whether the bullseye was hit or missed, and whether or not a reward was received, respectively. For someone else, and may be continuous variables denoting the dart’s proximity to the bullseye and the size of the reward. Also note that, as this example illustrates, denotes a state brought about by a goal-directed motor command (e.g., hitting or missing a bullseye), not the motor command itself (e.g., the motor command that implements dart throwing)—an echo of ideomotor theory28, which proposes that actions are encoded in terms of the sensory states they elicit, rather than the motor commands that generate them.
In principle, any goal-directed activity can be decomposed into means and ends, from reading a novel (where could be “discovered the protagonist’s fate” and could be “finished the next chapter”) to dancing a tango (where could be “impressed my partner” and could be “step forward with right foot passing left foot”). Indeed, the very definition of “goal-directed activity” stipulates the existence of a means (the activity) and an end (the goal to which the activity is directed).
Mutual information quantifies the dependence between two random variables as the degree to which observing the value of one variable reduces uncertainty (scored as entropy) about the value of the other. The mutual information between and , denoted as , quantifies the degree to which observing (e.g., whether the bullseye was hit or missed) is expected to reduce uncertainty over (e.g., whether a reward will be received). is maximized when two conditions are met: (i) before observing , the value of is completely uncertain (e.g., before hitting or missing the bullseye, the probability of reward is 50%), and (ii) after observing , the value of is completely certain (e.g., after hitting or missing the bullseye, the probability of reward is 100% or 0%). is minimized when observing the value of fails to reduce any uncertainty about the outcome of (e.g., when the probability of reward is the same regardless of whether the bullseye is hit or missed).
To see how is computed, consider two probability functions: and . specifies the subjective probability (or likelihood) of observing each possible value of . If has two possible values, successful and unsuccessful, specifies the subjective probability of performing the means successfully versus unsuccessfully. specifies the subjective probability (or likelihood) of observing each possible value of conditional on each possible value of . Suppose that has two possible values: attained and unattained. In this case, specifies the probability of the end being attained versus unattained conditional on performing the means successfully versus unsuccessfully. Given and , can be computed:
1 |
where and .
According to the informational theory of flow, flow is a monotonically increasing function of . Evidence for this theory can be found in activities known to elicit flow. Consider slot machines. How do such simple devices develop such a powerful hold on so many players? Part of the answer, according to our theory, is that slot machines have very high levels of : Prior to observing the symbols on the reel, , the size of the payout, , is extremely uncertain, but as soon as is observed, all uncertainty is eliminated. If a slot machine’s level of was lowered, there is little doubt that flow would decline with it. Imagine a slot machine whose value of . By definition, such a machine would always stop on the same symbols, or its symbols would be unrelated to the size of its payouts. In both cases, observing would reduce zero uncertainty about , and in both cases, flow would surely plummet.
Not all of our theory’s predictions are intuitive. As we will see, the informational theory of flow sometimes says flow should be relatively high when factors commonly linked to flow, like skill-challenge balance (the degree to which the difficulty of one’s task feels commensurate with one’s ability) and controllability (the sense of being in control over one’s outcomes)1,3, are relatively low. It also assumes that flow is insensitive to variation in instrumental value, allowing for flow to persist, and even grow stronger, in the face of diminishing rewards and increasing punishments. We tested these predictions and more across five experiments.
Results
Most experiments leveraged the “tile game,” a computer-based task designed to achieve precise control over and (Fig. 1a). On each trial, a tile appears at the center of the screen for a predetermined amount of time. Participants attempt to activate the tile, making it change color, by pressing their spacebar before it disappears. Whether or not the tile is activated determines the probability of receiving a jackpot on the current trial. If a player receives a jackpot, the next screen displays a pleasant image, and $0.10 are added to a bonus fund, which participants receive at the end of the study. If a jackpot is not received, the next screen displays an unpleasant image, and $0 is added to the bonus fund.
The tile game’s instructions, and the design of the game itself, encourage participants to represent both and as having two possible values: if the tile is activated, if the tile is not activated, if a jackpot is received, and if a jackpot is not received. For simplicity, the value of is constrained to equal . Accordingly, is a function of two parameters: and .
Unbeknownst to participants, the tile game includes two types of trials: miss trials, where the tile disappears after 250 ms, and hit trials, where the tile disappears after 750 ms. Responding in under 250 ms is nearly impossible, and responding in under 750 ms is trivial. Thus, the percentage of hit trials corresponds to the value of (for analyses confirming the success of our manipulation of see Supplementary Information). The timing manipulation is not experienced as such. It creates the illusion of responding slightly too slow on some trials and just in time on other trials. We assume that participants tracked the value of either consciously or unconsciously29,30.
To manipulate and , we told participants the probability of attaining a jackpot conditional on activating versus not activating the tile. The true probabilities always matched these instructions. Pressing the spacebar too early produced a warning message lasting 3.5 s to disincentivize spamming the spacebar. Critically, the average amount of money obtained from the tile game is orthogonal to the value of (Fig. 1b).
Experiments 1 and 2
In experiments 1 and 2, participants played two versions of the tile game (order counterbalanced), each lasting 50 trials. The games were distinguished by name and appearance: in the green game, activated tiles turned green, and in the blue game, activated tiles turned blue. For each participant and version of the game, we randomly selected from the set {.2, .3, .4, .5, .6, .7, .8}, and from the set {.6, .7, .8, .9, 1}, with the constraint that neither parameter could be identical across both games. After each game, participants completed measures of flow, skill-challenge balance, and controllability (see Methods). Experiments 1 and 2 were identical except that experiment 2 included additional dependent measures, described below. Statistics are displayed in Figs. 2–5.
Flow is a positive function of
As predicted by the informational theory of flow, flow was a positive function of in both experiments (Fig. 2a). Subsequent analyses confirmed that the effect of on flow was positive over the full range of . Aggregating the data from both experiments, we fit a generalized additive model (GAM)—a statistical technique in which outcomes are assumed to depend on smooth, nonparametric functions of the predictors. Unlike linear regression, GAMs can discover nonlinearities that would violate the informational theory of flow. The result, however, supports our theory: The effect of on flow was everywhere positive (Fig. 2b).
Next, we fit a GAM that modeled flow in terms of and , and generated a matrix containing the predicted value of flow for each combination of the two parameters (Fig. 2c). If flow is a positive function of , this matrix should align with the matrix representing in terms of and (Fig. 1b). Consistent with this, the two matrices were correlated at r = .88 (p < .001).
We turn now to a stricter test of our theory, one that accounts for that fact that is a function of multiple variables, introducing the possibility that flow is not a function of per se, but a subset of terms used to compute . Indeed, can be expressed as:
2 |
where is Shannon entropy. Either entropy term on its own could fully explain a positive effect of on flow—flow could be a positive function of only, or a negative function of only. To show that flow is a positive function of per se, we must show that it is a positive function of , a negative function of , and that both effects are equivalent in magnitude. Accordingly, we simultaneously regressed flow on and . Equation 2 is one of several formulations of , but two features make it uniquely suited to our analytic approach: it describes a linear function, and the correlation between its variables, and , is small enough to adhere to the multicollinearity assumption (experiment 1: r = .179; experiment 2: r = .166).
Our entropy-based analyses suggest that flow is a positive function of per se rather than individual terms used to compute it (Fig. 2a). had positive effects, had negative effects, we could not reject the hypothesis that these effects are equivalent in magnitude (experiment 1: χ2(1) = 3.59, p = .058; experiment 2: χ2(1) = 1.13, p = .287), and the Bayesian information criterion (BIC) favored models that treated flow as a function of (experiment 1: BIC = 2731; experiment 2: BIC = 1868) over models that treated flow as a function of and (experiment 1: BIC = 2734; experiment 2: BIC = 1873).
The effects of on flow cannot be explained in terms of expected value or skill-challenge balance. was uncorrelated with skill-challenge balance in experiment 1, negatively correlated with skill-challenge balance in experiment 2, and uncorrelated with expected value in both experiments (Fig. 3). Adjusting for expected value or skill-challenge balance never eliminated the effect of on flow (see Supplementary Information).
, enjoyment, and attention
Flow often coincides with enjoyment and improved attentional performance1,3,6,31–33, so we measured these outcomes in experiment 2. Showing that predicts enjoyment or attention would bolster our claim that flow increases with greater while demonstrating that can deliver benefits beyond the subjective experience of immersion.
Enjoyment
We collected two measures of enjoyment: a continuous self-report measure (administered after each game), and a binary choice measure that asked participants which game they would prefer to play again (administered at the end of the experiment; see Methods). The effect of on the continuous measure was positive but not significant (Fig. 4). However, had a significant effect on the binary choice measure such that the greater the value of in one game versus the other, the likelier that game was to be chosen (Fig. 4). This finding supports the idea that predicts enjoyment, providing converging evidence for the informational theory of flow.
Attention
In the tile game, greater attention should make responses to the tiles faster34,35 and less variable36–39. Thus, in experiment 2, we recorded response times. We operationalized attention as the average time between the onset of the gray tile and the pressing of the spacebar (RT), and the intra-individual standard deviation of response times (RTSD). RTSD reflects attentional lapses and distractibility, and has been linked to attentional impairments36–39. For example, individuals with attention deficit hyperactivity disorder (ADHD) exhibit significantly greater RTSD40. If attention increases with greater , then RT and RTSD should decrease with greater .
Increasing improved attentional performance as revealed by significant, negative effects on RT and RTSD (Fig. 5a). Moreover, GAMs revealed that these effects were monotonically decreasing (Fig. 5b, c), providing further support for the idea that the effect of on flow is monotonically increasing. These findings provide converging evidence for our theory while demonstrating its utility for optimizing attentional performance.
As expected, both measures of attention varied not only with , but also with (Supplementary Information). Increasing increases RT and RTSD by reducing the average speed with which participants must respond. Critically, the linear effect of cannot account for the linear effect of because is quadratic with respect to (Fig. 1a). Nonetheless, we included as a covariate in all analyses of RT and RTSD. Removing this covariate had no meaningful effect on our results (Supplementary Information).
Confounds
Experiments 1 and 2 include several confounds. Both and flow were positively associated with the following variables: marginal value, the value of information, temporal difference prediction error, the correlation between and , and controllability (Figs. 2a and 3). Below, we define each variable before ruling them out in experiment 3.
Marginal value
Marginal value, denoted as , is the average reward obtained for activating versus not activating the tile:
3 |
where is the financial outcome for each value of . Intuitively, quantifies how much better it is to perform the means successfully versus unsuccessfully.
Value of information
The value of information, or VOI, quantifies the degree to which information can be used to increase expected future rewards. For instance, on each trial of the tile game, participants obtain information about their probability of obtaining jackpots, and can use this information to decide if it is more lucrative to continue playing or to quit the experiment early to pursue different activities. A recent theoretical analysis proposed that VOI may promote flow41. Accordingly, we computed VOI for each combination of and (see Methods).
Temporal difference prediction error
According to computational models of reinforcement learning, reward-predicting stimuli elicit learning signals called temporal difference prediction errors, which humans use to estimate long-run future rewards42–44. On each trial of the tile game, the value of acts as a reward-predicting stimulus by indicating the probability of a jackpot. Therefore, we assume that on each observation of , participants encoded a temporal difference prediction error, denoted as . This variable quantifies the degree to which observing increases or decreases the amount of money a participant expects to earn on a given trial. We computed for each combination of and (see Methods).
Correlation
The correlation between and is quantified in terms of Cramér’s phi, and is denoted as .
Controllability
We operationalized controllability as the degree to which participants reported feeling in control of their outcomes during the tile game (see Methods). Interestingly, previous work has equated controllability with an information theoretic quantity similar to , implying that controllability and may be impossible to disentangle45,46. Challenging this idea, we successfully separated controllability from in experiment 3.
Experiment 3
We address each of the above confounds in experiment 3. All participants played two versions of the tile game in counterbalanced order: the mixture game and either the punish game or the neutral game (Fig. 6a–c). After each game, we measured flow, enjoyment (using the continuous measure from experiment 2), attention (in terms of RT and RTSD), controllability, and skill-challenge balance.
In all three games, , and . The main difference between each game is the outcome of a miss. In the neutral game, misses always yield $0; in the punish game, misses always yield a $0.02 loss; in the mixture game, misses always result in a fifty-fifty chance of $0 or a $0.02 loss. is greatest in the mixture game and identical across the other games (Fig. 6d). This is not true of expected value, skill-challenge balance, or any of the confounds described above (Fig. 6d). Thus, the informational theory of flow uniquely predicts that flow should be highest in the mixture game and identical in the other games. This is a surprising prediction. The basic principle that organisms aim to avoid punishment47–49 suggests that flow should be greatest in the neutral game, where punishment is least frequent. Conversely, the principle that the attentional system prioritizes punishment-related stimuli50 suggests that flow should be greatest in the punish game, where punishment is most frequent. We know of no theory besides ours that expects flow to be greatest in the mixture game, where punishment is neither least frequent nor most frequent.
Flow is a positive function of
We regressed flow on game (mixture vs. punish vs. neutral). As predicted, the effect of game was significant (χ2(2) = 16.68, p < .001; Fig. 7) such that flow was greatest in the mixture game (mixture vs. punish: b = .29, SE = .1, p = .005; mixture vs. neutral: b = .33, SE = .1, p = .002), and equivalent across the punish and neutral games (b = .04, SE = .14, p = .801). These findings support the informational theory of flow, and cannot be explained in terms of , VOI, , , controllability, skill-challenge balance, or expected value.
and attention
Game had no effect on attention (RT: χ2(2) = 3.37, p = .185; RTSD: χ2(2) = 2.97, p = .227), likely due to a floor effect. The smallest value of in experiment 3 is equal to the largest value of from experiments 1 and 2, and increasing beyond this point may have diminishing returns on RT and RTSD due to physical limitations; eventually, people cannot respond any faster or less variably51. Inspection of the RT distribution confirmed that participants were close to floor: 33% of mean RTs were 300 ms or less and the median was 320 ms. For reference, when analyzing RT data, researchers often exclude participants with mean RTs of 300 ms or less because such participants are considered extreme outliers52–54. For the lowest values of in experiment 2 (those in the bottom quartile), RTs were significantly slower: only 19% of mean RTs were 300 ms or less (χ2(1) = 7.69, p = .006) and the median was 336 ms (Wilcoxon rank-sum test, p = .003), providing room for faster responding not present in experiment 3.
and enjoyment
Converging support for the informational theory of flow comes from analyses of enjoyment. The effect of game was significant (χ2(2) = 12.5, p = .002; Fig. 7), such that participants enjoyed the mixture game significantly more than the punish game (b = .34, SE = .1, p < .001) and nonsignificantly more than the neutral game (b = .13, SE = .1, p = .195). We found no significant difference in enjoyment across the punish and neutral games (b = .21, SE = .14, p = .134). It is noteworthy that the neutral game, which does not involve punishment, was not enjoyed more than the mixture game, which does involve punishment. Apparently, the effect of on enjoyment is powerful enough to overcome aversion to negative outcomes.
Experiment 4
If instead of playing the tile game participants merely observed it, would the mutual information between the tile state (hit vs. miss) and jackpot state (jackpot vs. no jackpot) still predict flow55,56? Our theory says it would not. The in denotes a state that someone brings about to achieve their goal, and someone who merely observes the tile game would not bring about the tile state. Thus, for observers, the tile state would not correspond to , and the mutual information between tile state and jackpot state would not correspond to . Since our theory says that flow is a function of specifically, not mutual information generally, it predicts that when the tile game is merely observed, the mutual information between the tile state and jackpot state does not predict flow.
We tested this prediction by assigning each participant to a play condition, where they played the tile game from experiments 1 and 2, or an observe condition, where they merely observed the tile game (see Methods). Participants played or observed two games following the procedures from experiments 1 and 2. Jackpots were worth 1 cent. After each game, we measured flow and enjoyment. RT-based measures of attention were collected only in the play condition, as the observe condition prohibits responding. Analyses of enjoyment and attention unanimously supported the informational theory of flow (Fig. 8).
Flow is a positive function of specifically, not mutual information generally
Due to a coding error, the randomly selected values of and were not recorded in the play condition. Accordingly, we computed the empirical value of using the values of and actually produced by participant. (In experiments 1 and 2, the empirical value of was almost perfectly correlated with the true value at r > .98, and all analyses produced equivalent results regardless of which value we used).
We regressed flow on condition (play vs. observe), mutual information, and their interaction term. As predicted, we found a significant interaction (b = .43, SE = .17, p = .013) such that flow was a positive function of mutual information among players but not observers (Fig. 8), suggesting that flow is a positive function of specifically, not mutual information generally. Further support for the informational theory of flow comes from our entropy-based analyses (Fig. 8): In the play condition, flow was a positive effect of and a negative effect of , and we found no evidence that these effects differ in magnitude (χ2(1) = .74, p = .391).
Experiment 5
In experiment 5, we generalize the informational theory of flow to new tasks. Instead of creating tasks, we subjected our theory to the critical test of making correct predictions about existing activities developed without any intention of supporting our theory. The activities we chose are two of the world’s oldest games: Rock, Paper, Scissors, known as shoushiling in the time of the Chinese Han dynasty57, and Odds vs. Evens, known as ludere par impar in ancient Rome58 (Fig. 9A, B). In Rock, Paper, Scissors, , and in Odds vs. Evens, (see Methods), so our theory predicts that Rock, Paper, Scissors elicits more flow. This finding would further rule out four alternative constructs: , VOI, or , which are higher in Odds vs. Evens, and , which is identical across both games (Fig. 9d). The games in experiment 5 bear little resemblance to the tile game. For instance, the tile game is presented as a game of skill, whereas the hand games are games of chance — players cannot benefit financially from trying harder or paying more attention. This change eliminates features that, according to prior research, promote feelings of agency and control59,60, which some consider important for flow3.
Participants played 50 rounds of both games against a computer that chose hands at random. Participants earned $0.03 for every win and lost $0.01 for every loss. Draws, which are possible only in Rock, Paper, Scissors, were worth $0. After each game, we measured flow and enjoyment.
The informational theory of flow generalizes
Rock, Paper, Scissors elicited significantly more flow (b = .59, SE = .08, p < .001) and enjoyment (b = .59, SE = .07, p < .001) than Odds vs. Evens (Fig. 9c). These findings confirm that the informational theory of flow generalizes beyond the task originally designed to test it, and further rules out , VOI, , and alternative constructs.
Discussion
Flow is considered a key contributor to health, productivity, and well-being1–3,10,11,31, yet a rigorous description of its computational substrates remains elusive. To understand the nature of flow, and to help people regulate flow in their daily lives, it is necessary to ground the concept of flow in formal theoretical structures. We developed such a structure — the informational theory of flow — and obtained empirical support for it across five experiments by showing that flow, along with enjoyment and attention, increases as a function of .
, it seems, is an important contributor to flow—but why? What do humans have to gain by becoming immersed when is high? We speculate that the link between and flow facilitates the fundamental task of learning associations between actions and desired outcomes. This task is complicated by the fact that every desired outcome (e.g., sating hunger) is associated with a relatively small number of actions (e.g., eating)—most action-outcome pairs are unrelated (e.g., sating hunger and knitting). We must restrict our learning efforts to the subset of valid action-outcome pairs. Otherwise, we risk wasting resources trying to learn associations that do not exist. One way to meet this challenge is to become immersed when is high, and grow bored when is low. Indeed, the greater the value of , the more information can be gained by learning the relationship between and . Accordingly, the positive effect of on flow may serve the function of steering us toward learning opportunities and away from epistemic dead ends.
Another open question is how human brains might compute . When considering the complexities of many real-world activities, such as extended action sequences and hierarchical structure, it becomes clear that computing exactly is intractable. Accordingly, brains likely implement algorithms that quickly and efficiently approximate . Examples of such algorithms are emerging in the AI literature21,23 and may serve as inspiration for biologically plausible implementations of the informational theory of flow.
Two caveats deserve spotlighting. First, the present work does not suggest that is the sole contributor to flow, nor does it suggest that contributes to flow across all contexts. The informational theory of flow may yet be expanded by discoveries of additional inputs to the flow-generating process, and contracted by discoveries of contexts in which fails to predict flow. A second caveat is that the quantity at the heart of our theory — — is a function of variables whose properties are subjective. What and denote in a given task depends on how the person performing the task construes their means and end. On the one hand, the perceiver-dependence of and allows our theory to explain individual and situational differences in how much flow a particular activity elicits. On the other hand, it makes our theory difficult to apply to tasks with many possible means-end representations, a challenge we overcame by using tasks with clear means and ends. Expanding our theory to more ambiguous tasks hinges on the progress of ongoing research exploring how humans represent task structure27,61–64. With better theories of how humans carve activities into means and ends, the informational theory of flow will become easier for researchers to falsify and for practitioners to apply.
In addition to raising challenging new questions, the present work supports the recent movement to give computational tools a more prominent role in social psychology65. As others have argued, grounding social psychological phenomena in formal theoretical structures can help us deliver more robust and replicable solutions to societal challenges. This idea has started to take hold, leading social psychologists to import formalisms from a variety of frameworks, such as reinforcement learning, probability theory, and utility theory65. We continue this trend by importing the formalism of mutual information — a concept from information theory, which remains underutilized in social psychology. In this way, the informational theory of flow offers insights into task immersion and engagement, and expands the conceptual toolkit for social psychological theorizing and model building.
Methods
This research was approved by the Human Subjects Committee of Yale University, New Haven, CT, USA. All participants provided informed consent and were compensated for their time. We preregistered experiments 2 (https://aspredicted.org/cx6up.pdf), 3 (https://aspredicted.org/m3ez6.pdf), 4 (https://aspredicted.org/ef7g5.pdf) and 5 (https://aspredicted.org/y4wx9.pdf). Based on pilot data, we expected to have a small-to-medium sized effect on flow (r = .2), requiring a sample of 194 participants to achieve 80% power for two-sided tests. Thus, we aimed for a final sample size of at least 200 in each experiment. We recruited participants using Prolific and recorded data using jsPsych (version 6.1.0). Unless otherwise specified, all analyses were conducted using linear mixed models (LMMs) with subject-level random intercepts fit using the lme4 package in RStudio version 1.3 using R version 3.6, and game order (first game vs. second game) was included as a nuisance regressor.
Participants
We recruited 400 participants in experiment 1, 300 participants in experiments 2 and 3, 1000 in experiment 4, and 400 in experiment 5. Our final samples after exclusions included N = 365 in experiment 1 (62% female; mean age = 35), N = 249 in experiment 2 (68% female; mean age = 26), N = 236 in experiment 3 (52% female; mean age = 32), N = 941 in experiment 4 (59% female; mean age = 40), and N = 397 in experiment 5 (60% female; mean age = 37).
Data processing and multiverse analysis
Participants were excluded if any of the following conditions were met: (i) they failed to answer every self-report question, (ii) during at least one tile game, they activated the tile at least five times more or less than they should have given the value of , or (iii) during at least one tile game, they pressed their spacebar preemptively (i.e. before the tile appeared) at least 10 times. In experiment 4, we needed a sufficient number of hit trials to estimate , so we excluded participants who did not have at least five (N = 11). When calculating mean RT and RTSD, we excluded extremely fast responses (RT < 100 ms) and responses that came immediately after a preemptive response (preemptive responses were followed by an attention-grabbing warning message — “TOO FAST!” — which may influence responding on the following trial). All exclusion criteria were chosen prior to data analysis, but in some cases, they deviated from our preregistrations. Accordingly, we performed a multiverse analysis in which we applied all possible permutations of exclusion criteria, data transformation (i.e., log transforming versus not), and model specification (e.g., including versus not including covariates) (see Supplementary Information). Significant effects in the main text remained significant across the vast majority of the multiverse; marginal effects in the main text were, predictably, less robust. In general, the most reliable effects were those of on flow and attention. Effects of on enjoyment were more sensitive to data processing decisions.
Flow
We measured flow immediately after each tile game was completed. In experiments 1–3, we asked how immersive, engaging, engrossing, and addictive the game was. These items were adopted from existing measures of flow66. Participants responded on a 9-point scale from 1 = Not at all to 9 = Extremely. A principal components analyses with varimax rotation suggested that the “addictive” item did not load on the same factor as “immersive,” “engaging,” and “engrossing” (Supplementary Information, Supplementary Table 1). However, all significant results remained significant, and all non-significant results remained non-significant, when analyses were run with a three-item measure comprised of the “immersive,” “engaging,” and “engrossing” items. In experiments 4 and 5, we replaced the “addictive” item with an item that asked how absorbing the game was. Cronbach’s α for the flow measure was at least .92 in all studies.
Skill-challenge balance
Participants rated skill-challenge balance by answering the question, “Was the [green game / blue game] too easy, too hard, or just the right level of difficulty?” on a 9-point scale from 0 = “Way too easy” to 4 = “Just right” to 8 = “Way too hard”. Scores were computed by subtracting the absolute value of the distance from “Just right” from four.
Controllability
Participants rated controllability by answering the question, “While playing the [green game / blue game], how much control did you feel you had over the outcome of each trial?” on a 9-point scale from 0 = “Zero control’ to 8 = “Complete control”.
Enjoyment
The continuous measure of enjoyment consisted of five items. Participants indicated how enjoyable, fun, and entertaining the game was on a 9-point scale from 1 = Not at all to 9 = Extremely, and indicated how much they liked and disliked the game on a 9-point scale from 1 = Not at all to 9 = Very much. Cronbach’s α for the continuous measure of enjoyment was at least .91 in all studies.
Response time analyses
All response time data (RT and RTSD) were log transformed to correct for skewness. In experiment 1, the tile game was not programmed with the intention of analyzing RT data. Specifically, it did not record RTs on trials where the tile was not activated (i.e. if the spacebar was pressed after the gray tile disappeared, the RT was not recorded). It did, however, record RTs on trials where the tile was activated (i.e. RTs were recorded if the spacebar was pressed before the gray tile disappeared), so we analyzed these data in an exploratory fashion. The results were consistent with the findings from experiments 2 and 4: had significant, negative effects on RT and RTSD (see Supplementary Information).
Temporal difference prediction error and the value of information
On each trial of the tile game, participants can choose to keep playing (i.e. they can continue the experiment) or quit (i.e. they can stop the experiment early and do something else). Let be the set of possible actions. We assume that on each trial, , participants estimate the value (in dollars) of playing and quitting, denoted as and , respectively. If the decision is made to quit, all money earned up to that point is lost, so we let be the negative of the total earnings prior to trial . If the decision is made to play, one of two states are observed: “hit” (if the tile is activated) or “miss” (if the tile is not activated). Let be the state observed on trial . The monetary value of , denoted as , is known. In experiments 1, 2 and 4, and . In experiment 3, .1, 0, and −.02. In experiment 5, .03, 0, and −.01. Once is observed, is updated by means of temporal difference:
4 |
where is a learning rate, and is the temporal difference prediction error elicited by the observation of :
5 |
Each time the decision is made to play, information is obtained, and this information can be used to increase future reward — that is, the information has value. Let denote the value of the information associated with choosing to play on trial . It is computed as follows:
6 |
is the probability of choosing action on the current trial . It is the output of the softmax choice rule:
7 |
where is an inverse temperature parameter that controls the explore-exploit tradeoff. is what the new value of action would be if the choice was made to play on trial :
8 |
is what the new probability of choosing action would be if the choice was made to play on trial :
9 |
The expectation in equation 6 is taken under the probability distribution , which is identical to in each experiment. Intuitively, quantifies the degree to which choosing to play on trial would improve the profitability of the participant’s action policy (i.e. the participant’s probability of choosing to play versus quit on the next trial).
We ran 1,000 simulations for each combination of and , with set to .9 and set to 5. For each simulation, we assumed that on all 50 trials, the choice was made to play, and computed (i) the average value of and (ii) the average of the absolute value of . For both variables, we computed the average output of each simulation to get our final estimates of VOI and .
Observe condition
In the observe condition of experiment 4, let denote whether the tile is activated or not, and let denote the mutual information between and . Our manipulation of in the observe condition paralleled our manipulation of in the play condition. We randomly selected the value of from the set {.2, .3, .4, .5, .6, .7, .8} and the value of from the set {.6, .7, .8, .9, 1} with the constraint that neither parameter could be identical across the two games. We let . Thus, the only difference between the play condition and the observe condition is that, in the observe condition, tile-activation was not a means. To keep participants’ eyes on the screen, we required that the spacebar be pressed at the end of each trial to advance to the next round.
for hand games
In Rock, Paper, Scissors, can take on nine different values and can take on three different values. Each value of corresponds to a possible combination of symbols selected by the player and their opponent: {rock-rock, rock-paper, rock-scissors, paper-rock, paper-paper, paper-scissors, scissors-rock, scissors-paper, scissors-scissors}. Each value of corresponds to an outcome: {win, lose, draw}. Because the opponent chooses randomly, for all values of regardless of the strategy participants employ (e.g., choosing hands at random versus always choosing the same hand). Therefore, always equals 1.58, where is Shannon entropy of :
10 |
The value of is fully determined by the value of : when {rock-scissors, paper-rock, scissors-paper}, when {rock-rock, paper-paper, scissors-scissors}, and when {scissors-rock, rock-paper, paper-scissors}. Therefore, always equals 0, where is the Shannon entropy of conditional on :
11 |
Subtracting from gives , so in Rock, Paper, Scissors, .
In Odds vs. Evens, can take on four different values and can take on two different values. Each value of corresponds to a possible combination of numbers selected by the player and their opponent: {1–1, 1–2, 2–1, 2–2}. Each value of corresponds to an outcome: {win, lose}. The opponent chooses hands randomly, so regardless of the strategy participants employ. As in Rock, Paper, Scissors, the value of is fully determined by the value of : For players who choose odds, when {1–2, 2–1}, and when {1–1, 2–2}, and for players who choose evens, when {1–1, 2–2}, and when {1–2, 2–1}. Therefore, , and .
Reporting Summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Supplementary information
Acknowledgements
This work was funded by the Yale Center for Customer Insights, the Yale School of Management Behavioral Research Lab, and the Automaticity of Cognition, Motivation, and Emotion Lab at Yale University. D.E.M. thanks Lisa Feldman Barrett and Dana Brooks for their mentorship and support.
Source data
Author contributions
The theory was conceptualized by D.E.M. and subsequently refined by all authors. The experiments were conceptualized by all authors, and coded by D.E.M. Back-end code for data collection was developed by R.W.C. Analyses were performed by P.E.S. and D.E.M., figures were created by R.W.C. and D.E.M., and guidance on data interpretation and visualization was provided by all authors. The first draft of the manuscript was written by D.E.M., and subsequent writing and editing was performed by all authors.
Peer review
Peer review information
Nature Communications thanks Valérian Chambon, George Loewenstein, Kou Murayama and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Data availability
The data generated in this study have been deposited at OSF. Source data are provided with this paper.
Code availability
Custom code used for data preprocessing, analysis, and simulation are available at OSF Custom code used for data collection are available at GitHub.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
The online version contains supplementary material available at 10.1038/s41467-022-29742-2.
References
- 1.Czikszentmihalyi, M. Flow: The psychology of optimal experience. (Harper & Row, 1990).
- 2.Csikszentmihalyi, M. Beyond boredom and anxiety. (Jossey-Bass Publishers, 1975).
- 3.Nakamura, J. & Csikszentmihalyi, M. The concept of flow. In Flow and the foundations of positive psychology (ed M. Csikszentmihalyi) 239–263 (Springer, 2014).
- 4.Engeser S, Rheinberg F, Vollmeyer R, Bischoff J. Motivation, flow-experience, and performance in learning settings at universities. Z. fur Padagogische Psychologie. 2005;19:159–172. [Google Scholar]
- 5.Nakamura, J. Optimales Erleben und die Nutzung der Begabung. In Die außergewöhnliche Erfahrung im Alltag. Die Psychologie des Flow-Erlebens (eds M. Csikszentmihalyi & I. S. Csikszentmihalyi) 326–334 (Klett-Cotta, 1991).
- 6.Schüler J. Arousal of flow experience in a learning setting and its effects on exam performance and affect. Z. f.ür. Pädagogische Psychologie. 2007;21:217–227. [Google Scholar]
- 7.Csikszentmihalyi, M., Rathunde, K. R. & Whalen, S. Talented teenagers: A longitudinal study of their development. (Cambridge University Press, 1993).
- 8.Jackson SA, Thomas PR, Marsh HW, Smethurst CJ. Relationships between flow, self-concept, psychological skills, and performance. J. Appl. sport Psychol. 2001;13:129–153. [Google Scholar]
- 9.Pates J, Karageorghis CI, Fryer R, Maynard I. Effects of asynchronous music on flow states and shooting performance among netball players. Psychol. Sport Exerc. 2003;4:415–427. [Google Scholar]
- 10.Csikszentmihalyi M. If we are so rich, why aren’t we happy? Am. psychologist. 1999;54:821–827. [Google Scholar]
- 11.Csikszentmihalyi M, LeFevre J. Optimal experience in work and leisure. J. Personal. Soc. Psychol. 1989;56:815–822. doi: 10.1037//0022-3514.56.5.815. [DOI] [PubMed] [Google Scholar]
- 12.Harter, J. U. S. employee engagement holds steady in first half of 2021. https://www.gallup.com/workplace/352949/employee-engagement-holds-steady-first-half-2021.aspx (2021).
- 13.Kruglanski AW, et al. A structural model of intrinsic motivation: On the psychology of means-ends fusion. Psychological Rev. 2018;125:165–182. doi: 10.1037/rev0000095. [DOI] [PubMed] [Google Scholar]
- 14.Szumowska E, Kruglanski AW. Curiosity as end and means. Curr. Opin. Behav. Sci. 2020;35:35–39. [Google Scholar]
- 15.Kruglanski, A. W. et al. 69–98 (Academic Press, 2015).
- 16.Woolley, K. & Fishbach, A. When intrinsic motivation and immediate rewards overlap. In The Motivation Cognition Interface (eds C. Kopetz & A. Fishbach) (Routledge, 2017).
- 17.Woolley K, Fishbach A. It’s about time: Earlier rewards increase intrinsic motivation. J. Personal. Soc. Psychol. 2018;114:877–890. doi: 10.1037/pspa0000116. [DOI] [PubMed] [Google Scholar]
- 18.Salge, C., Glackin, C. & Polani, D. Empowerment–an introduction. In Guided Self-Organization: Inception (ed Mikhail P.) 67-114 (Springer, 2014).
- 19.Klyubin, A. S., Polani, D. & Nehaniv, C. L. in 2005IEEE Congress on Evolutionary Computation. 128-135 (IEEE).
- 20.Shannon CE. A mathematical theory of communication. Bell Syst. Tech. J. 1948;27:379–423. [Google Scholar]
- 21.Mohamed, S. & Rezende, D. J. in Advances in neural information processing systems. 2125-2133.
- 22.Jung T, Polani D, Stone P. Empowerment for continuous agent—environment systems. Adapt. Behav. 2011;19:16–39. [Google Scholar]
- 23.Karl M, et al. Unsupervised real-time control through variational empowerment. arXiv Prepr. arXiv. 2017;1710:05101. [Google Scholar]
- 24.Gregor, K., Rezende, D. J. & Wierstra, D. Variational intrinsic control. arXiv preprint arXiv:1611.07507 (2016).
- 25.Tiomkin S, Polani D, Tishby N. Control capacity of partially observable dynamic systems in continuous time. arXiv Prepr. arXiv. 2017;1701:04984. [Google Scholar]
- 26.Qureshi AH, Boots B, Yip MC. Adversarial imitation via variational inverse reinforcement learning. arXiv Prepr. arXiv. 2018;1809:06404. [Google Scholar]
- 27.Vallacher RR, Wegner DM. What do people think they’re doing? Action identification and human behavior. Psychological Rev. 1987;94:3–15. [Google Scholar]
- 28.Hommel B, Müsseler J, Aschersleben G, Prinz W. The theory of event coding (TEC): A framework for perception and action planning. Behav. brain Sci. 2001;24:849–878. doi: 10.1017/s0140525x01000103. [DOI] [PubMed] [Google Scholar]
- 29.Kim R, Seitz A, Feenstra H, Shams L. Testing assumptions of statistical learning: is it long-term and implicit? Neurosci. Lett. 2009;461:145–149. doi: 10.1016/j.neulet.2009.06.030. [DOI] [PubMed] [Google Scholar]
- 30.Turk-Browne NB, Scholl BJ, Chun MM, Johnson MK. Neural evidence of statistical learning: Efficient detection of visual regularities without awareness. J. Cogn. Neurosci. 2009;21:1934–1945. doi: 10.1162/jocn.2009.21131. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Csikszentmihalyi, M. & Csikszentmihalyi, I. S. Optimal experience:Psychological studies of flow in consciousness. (Cambridge University Press, 1992).
- 32.van der Linden D, Tops M, Bakker AB. Go with the flow: A neuroscientific view on being fully engaged. Eur. J. Neurosci. 2020;53:947–963. doi: 10.1111/ejn.15014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Wojtowicz, Z., Chater, N. & Loewenstein, G. Boredom and Flow: An Opportunity Cost Theory of Attention-Directing Motivational States. Available at SSRN3339123 (2019).
- 34.Posner MI. Orienting of attention. Q. J. Exp. Psychol. 1980;32:3–25. doi: 10.1080/00335558008248231. [DOI] [PubMed] [Google Scholar]
- 35.Posner MI, Snyder CR, Davidson BJ. Attention and the detection of signals. J. Exp. Psychol.: Gen. 1980;109:160–174. [PubMed] [Google Scholar]
- 36.West R, Murphy KJ, Armilio ML, Craik FI, Stuss DT. Lapses of intention and performance variability reveal age-related increases in fluctuations of executive control. Brain cognition. 2002;49:402–419. doi: 10.1006/brcg.2001.1507. [DOI] [PubMed] [Google Scholar]
- 37.Stuss DT, Murphy KJ, Binns MA, Alexander MP. Staying on the job: the frontal lobes control individual performance variability. Brain. 2003;126:2363–2380. doi: 10.1093/brain/awg237. [DOI] [PubMed] [Google Scholar]
- 38.Sonuga-Barke EJ, Castellanos FX. Spontaneous attentional fluctuations in impaired states and pathological conditions: a neurobiological hypothesis. Neurosci. Biobehav. Rev. 2007;31:977–986. doi: 10.1016/j.neubiorev.2007.02.005. [DOI] [PubMed] [Google Scholar]
- 39.Esterman M, Noonan SK, Rosenberg M, DeGutis J. In the zone or zoning out? Tracking behavioral and neural fluctuations during sustained attention. Cereb. cortex. 2013;23:2712–2723. doi: 10.1093/cercor/bhs261. [DOI] [PubMed] [Google Scholar]
- 40.Castellanos FX, Sonuga-Barke EJ, Milham MP, Tannock R. Characterizing cognition in ADHD: beyond executive dysfunction. Trends Cogn. Sci. 2006;10:117–123. doi: 10.1016/j.tics.2006.01.011. [DOI] [PubMed] [Google Scholar]
- 41.Agrawal, M., Mattar, M. G., Cohen, J. D. & Daw, N. D. The temporal dynamics of opportunity costs: A normative account of cognitive fatigue and boredom. Psychological Review (in press). [DOI] [PubMed]
- 42.Dayan P. Improving generalization for temporal difference learning: The successor representation. Neural Comput. 1993;5:613–624. [Google Scholar]
- 43.Sutton RS. Learning to predict by the methods of temporal differences. Mach. Learn. 1988;3:9–44. [Google Scholar]
- 44.Maes EJ, et al. Causal evidence supporting the proposal that dopamine transients function as temporal difference prediction errors. Nat. Neurosci. 2020;23:176–178. doi: 10.1038/s41593-019-0574-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Mistry P, Liljeholm M. Instrumental divergence and the value of control. Sci. Rep. 2016;6:1–10. doi: 10.1038/srep36295. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Liljeholm M, Wang S, Zhang J, O’Doherty JP. Neural correlates of the divergence of instrumental probability distributions. J. Neurosci. 2013;33:12519–12527. doi: 10.1523/JNEUROSCI.1353-13.2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Tolman EC. Cognitive maps in rats and men. Psychological Rev. 1948;55:189–208. doi: 10.1037/h0061626. [DOI] [PubMed] [Google Scholar]
- 48.Dickinson, A. & Balleine, B. The role of learning in the operation of motivational systems. In Steven’s handbook of experimental psychology: Learning, motivation, and emotion (eds H. Pashler & R. Gallistel) 497-533 (John Wiley & Sons Inc., 2002).
- 49.Thorndike EL. A proof of the law of effect. Science. 1933;77:173–175. doi: 10.1126/science.77.1989.173-a. [DOI] [PubMed] [Google Scholar]
- 50.Watson P, Pearson D, Wiers RW, Le Pelley ME. Prioritizing pleasure and pain: Attentional capture by reward-related and punishment-related stimuli. Curr. Opin. Behav. Sci. 2019;26:107–113. [Google Scholar]
- 51.Luce, R. D. Response times: Their role in inferring elementary mental organization. (Oxford University Press, 1986).
- 52.Ratcliff R, Voskuilen C, McKoon G. Internal and external sources of variability in perceptual decision-making. Psychological Rev. 2018;125:33–46. doi: 10.1037/rev0000080. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Ratcliff R, Thapar A, McKoon G. The effects of aging on reaction time in a signal detection task. Psychol. Aging. 2001;16:323–341. [PubMed] [Google Scholar]
- 54.Greenwald AG, Nosek BA, Banaji MR. Understanding and using the implicit association test: I. An improved scoring algorithm. J. Personal. Soc. Psychol. 2003;85:197–216. doi: 10.1037/0022-3514.85.2.197. [DOI] [PubMed] [Google Scholar]
- 55.Itti L, Baldi P. Bayesian surprise attracts human attention. Vis. Res. 2009;49:1295–1306. doi: 10.1016/j.visres.2008.09.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Itti, L. & Baldi, P. F. In Advances in neural information processing systems. 547-554 (Citeseer).
- 57.Moore, M. E. & Sward, J. Introduction to the game industry. (Pearson Prentice Hall, 2006).
- 58.Krünitz, J. G. Vol. 242 (ed Carl Otto Hoffmann) (Ernst Litfaß, Berlin, 1858).
- 59.Chambon, V. & Haggard, P. Premotor or Ideomotor: How Does the Experience of Action Come About? In Action science:Foundations of an Emerging Discipline (eds W. Prinz, M. Beisert, & A. Herwig) 359-380 (MIT Press, 2013).
- 60.Chambon V, Haggard P. Sense of control depends on fluency of action selection, not motor performance. Cognition. 2012;125:441–451. doi: 10.1016/j.cognition.2012.07.011. [DOI] [PubMed] [Google Scholar]
- 61.Niv Y. Learning task-state representations. Nat. Neurosci. 2019;22:1544–1553. doi: 10.1038/s41593-019-0470-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Gershman SJ, Norman KA, Niv Y. Discovering latent causes in reinforcement learning. Curr. Opin. Behav. Sci. 2015;5:43–50. [Google Scholar]
- 63.Fujita K, Carnevale JJ. Transcending temptation through abstraction: The role of construal level in self-control. Curr. Directions Psychological Sci. 2012;21:248–252. [Google Scholar]
- 64.Silver D, et al. Mastering the game of go without human knowledge. Nature. 2017;550:354–359. doi: 10.1038/nature24270. [DOI] [PubMed] [Google Scholar]
- 65.Cushman, F. & Gershman, S. Editors’ Introduction: Computational Approaches to Social Cognition. Topics Cognitive Sci.11, 281–298 (2019). [DOI] [PubMed]
- 66.Rheinberg, F., Vollmeyer, R. & Engeser, S. Die erfassung des flow-erlebens [Measuring flow experiences]. In Diagnostik von Motivation und Selbstkonzept. Test und Trends Vol. 2 (J. Steinsmeier-Pelster & F. Rheinberg eds) 261–279 (Hogrege, 2003).
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The data generated in this study have been deposited at OSF. Source data are provided with this paper.
Custom code used for data preprocessing, analysis, and simulation are available at OSF Custom code used for data collection are available at GitHub.