Abstract
Recent research has demonstrated that mindfulness-based meditation facilitates basic aspects of cognition, including memory and attention. Further developing this line of inquiry, here we considered the possibility that similar effects may extend to another core psychological process—instrumental learning. To explore this matter, in combination with a probabilistic selection task, computational modelling (i.e., reinforcement drift diffusion model analysis) was adopted to establish whether and how brief mindfulness-based meditation influences learning under conditions of uncertainty (i.e., choices based on the perceived likelihood of positive and negative outcomes). Three effects were observed. Compared with performance in the control condition (i.e., no meditation), mindfulness-based meditation (1) accelerated the rate of learning following positive prediction errors; (2) elicited a preference for the exploration (vs. exploitation) of choice selections; and (3) increased response caution. Collectively, these findings elucidate the pathways through which brief meditative experiences impact learning and decision-making, with implications for interventions designed to debias aspects of social-cognitive functioning using mindfulness-based meditation.
Keywords: Probabilistic learning, mindfulness-based meditation, prediction errors, computational modelling
The benefits of an open mind are considerable. Receptive to the complexities of daily life, open minds display a willingness to consider competing viewpoints, entertain alternative perspectives, and process information in an even-handed manner. In contrast, when closed, minds tend to be dogmatic, cognitively inflexible, and over-reliant on pre-existing opinions, assumptions, and beliefs (Eagly et al., 1999; Nickerson, 1998; Price et al., 2015). Despite these downsides, it should be noted that closed minds can nevertheless offer consistency, cognitive economy, and clarity when multiple choice-related options are available. Practically speaking, these states of mind exert distinct effects on decisional processing (Smillie, 2017). Whereas open-minded cognition is amenable to the potential value of previously untried options/choices, closed-minded thinking is dominated by tried-and-tested decisions that have been successful in the past. As such, the primary advantage of an open mind is that it acknowledges the existence of rival possibilities, thereby optimising decision-making in various task environments. So, relatively speaking, when are minds more open than closed?
Reflecting a core dimension of personality (McCrae, 1993), people high (vs. low) in trait “openness to experience” exhibit many of the upsides of flexible cognition (DeYoung, 2013), in that they tend to be inquisitive, creative, and eager to explore the world and its myriad possibilities (Kaufman et al., 2016; McCrae & Costa, 1997; Silvia et al., 2008). For example, outside the laboratory, these individuals are more likely to visit galleries and museums, possess impressive literary collections, and have spent longer in formal education than their cognitively rigid peers (e.g., Carney et al., 2008; Chamorro-Premuzic et al., 2011; Gosling et al., 2002; Trapp & Ziegler, 2019; Van Eijck & De Graaf, 2004). In many (but not all) task settings, openness to experience is associated with curiosity, information seeking, and the development of personal knowledge (Smillie, 2017).
Pertinent to the current investigation, open-minded cognition arises from factors other than the possession of specific personality characteristics, with situational forces and temporary psychological states also playing a significant contributory role (Jach et al., 2022; Sutton & Barto, 1998). Notably, strategies that promote mindfulness have been shown to cultivate cognitive flexibility and an openness to new experiences. Practised for centuries, mindfulness is a major component of contemporary functional contextual therapies, such as the acceptance and commitment therapy (ACT; Hayes et al., 2012). Originating in eastern Buddhist philosophies (Hayes, 2002), this practice can take many forms, including guided meditation, mindful breathing, mindful movement, mindful eating, and progressive body scan meditation. Operationally, these techniques moderate the salience of internal/external experiences (Luoma et al., 2007) and have acknowledged effectiveness in reducing a range of psychological symptoms (e.g., anxiety, depression, and stress; Carmody & Baer, 2008; Chiesa & Serretti, 2009; Vøllestad et al., 2012). In addition, through the non-judgmental evaluation of present-moment thinking (Baer et al., 2006; Bishop et al., 2004; Brown & Ryan, 2003; Kabat-Zinn, 2003), mindfulness-based practices also impact core psychological processes. For example, even among novice meditators, as little as 5–10 min of experimentally induced mindfulness is sufficient to influence emotional appraisal, selective attention, action control, social perception, and self-construal (e.g., Erisman & Roemer, 2010; Farb et al., 2007; Golubickis et al., 2016, 2022; Jha et al., 2007; Papies et al., 2012, 2015; Tan et al., 2014).
By decreasing judgmental impulsivity and underscoring the value of novel experiences (Baer et al., 2006; Dixon et al., 2019; Hölzel et al., 2011), we suspect that mindfulness meditation may facilitate another fundamental psychological process—instrumental learning. 1 In settings in which competing options are readily available, decision makers are confronted with a commonplace dilemma. Is it better to make choices that have been rewarding in the past (i.e., exploitation) or instead choose novel selections of uncertain value (i.e., exploration)? For example, when deciding on a Thai restaurant for date night, is it better to visit establishments that have been delightful in the past or take a chance on an unknown (i.e., riskier) eatery that could be wonderful or disappointing (Cohen et al., 2007)? Although optimal learning (i.e., which are the tastiest Thai restaurants in one’s neighbourhood?) calls for a balance between these competing strategies, at least in the long run, exploration tends to facilitate the acquisition of knowledge (Sutton & Barto, 1998).
Together with a range of other factors, openness to new experience exerts influence on this basic decision-making dilemma. Whereas individuals high in openness are generally receptive to novel experiences and ideas (even when faced with uncertainty), which inclines them towards exploration; those lower in openness prioritise the comfort of familiar choices, thereby gravitating towards exploitation (i.e., recently rewarded outcomes dominate decision-making). If, therefore, mindfulness attunes people to the potential value of competing alternatives, an interesting possibility arises. Mindfulness may trigger a preference for exploration (vs. exploitation) when learning takes place under conditions of uncertainty (Sternberg, 2002). In addition, given that mindfulness has been demonstrated to promote response caution (i.e., careful decision-making), learning in challenging task environments may be accompanied by an increase in the evidential requirements of choice selection (Golubickis et al., 2023; van Vugt & Jha, 2011; van Vugt & van den Hurk, 2017). That is, when mindful, additional evidence may be required before a response is selected.
Given these observations, our objectives in the current investigation were twofold. First, the primary objective was to explore the effects of brief mindfulness-based meditation (vs. no meditation) on instrumental learning. Specifically, we sought to establish whether the process of adjusting one’s behaviour based on prior choice-related outcomes (i.e., learning) changes following guided meditative practice. To investigate this issue, a probabilistic selection task (PST) was adopted in which participants’ reinforcement learning (RL) abilities were probed under conditions of uncertainty (Frank et al., 2004, 2007). The PST has been widely (and successfully) used to examine how people learn from positive versus negative choice-related feedback (i.e., reinforcement). Following a brief period of mindfulness-based meditation (vs. no meditation), participants were presented with three different stimulus pairs (i.e., AB, CD, EF; see Figure 1) comprising Japanese Hiragana characters. Their task was to figure out, based on repeated choice selections, which symbol in each pairing was most likely to be correct (Frank et al., 2004). Crucially, the feedback provided after each selection was probabilistic and varied across the stimulus pairs (i.e., AB = 80%–20%, CD = 70%–30%, EF = 60%–40%). For example, on AB trials, choosing symbol A led to positive feedback (i.e., correct) on 80% of trials, whereas selecting symbol B led to positive reinforcement on only 20% of the trials. Thus, over numerous choices, participants learned which item in each stimulus pair was more likely to be correct (e.g., A, C, E rather than B, D, F). We hypothesised that a brief period of mindfulness-based meditation would enhance the rate of learning during the PST.
Figure 1.
Example of the stimulus pairs (i.e., Japanese Hiragana characters) and the probabilities of correct responses during the probabilistic selection task.
Our second objective was to discern the underlying mechanisms responsible for any observed effects of brief mindfulness meditation on learning. To identify these processes, computational modelling was undertaken on the data (Golubickis & Macrae, 2022). Despite a substantial literature demonstrating the benefits of mindfulness on cognition and behaviour, quite how these effects arise remains less certain (but see Hölzel et al., 2011; Lutz et al., 2008; Papies et al., 2015; Shapiro et al., 2006; Tang et al., 2007; Teper & Inzlicht, 2013). In this respect, computational approaches are valuable as they provide a mechanistic account of the cognitive operations through which meditative experiences modulate decisional processing (Golubickis et al., 2023; van Vugt et al., 2019; van Vugt & Jha, 2011; van Vugt & van den Hurk, 2017). Accordingly, based on recent analytical developments, a reinforcement learning drift diffusion model (RL-DDM) analysis was adopted (Fontanesi et al., 2019; Pedersen et al., 2017; Pedersen & Frank, 2020).
Integrating sequential sampling and RL models, the RL-DDM identifies the processes that underpin learning and how these are fine-tuned as learning progresses (Miletić et al., 2020; Pedersen & Frank, 2020; Ratcliff et al., 2016). While RL models are useful at predicting changes in the proportion of choice probabilities over the course of learning, they do not account for differences in response latencies, a fundamental dimension of learning (e.g., as learning takes place, decision times decrease). In this respect, sequential-sampling models (e.g., drift diffusion model [DDM]; Ratcliff et al., 2016; P. L. Smith & Ratcliff, 2004) are informative as they predict decision-making by elucidating how response selection and latencies collectively arise from a common set of latent cognitive processes (e.g., rate of evidence accumulation, response caution). In so doing, the RL-DDM is an extension of classic RL models that offers a more precise understanding of the processes through which learning unfolds over time (Fontanesi et al., 2019; Miletić et al., 2020; Pedersen et al., 2017; Pedersen & Frank, 2020).
Method
Participants and design
A total of 60 participants (42 females, 17 males, 1 other; Mage = 23.00, SD = 2.96), with normal or corrected-to-normal visual acuity, took part in the research. Of the 60 participants, 30 (14 in the control condition) reported having minimal meditative experience (e.g., used an app once or twice). In terms of educational background, 36 held bachelor’s degrees, 12 master’s degrees, 2 PhDs, and 10 reported having a high-school education. 2 Five participants (3 females, 2 males) failed to reach learning criteria (Frank et al., 2007), and thus were excluded from the analyses. Data collection was conducted online using Prolific Academic (www.prolific.co), with each participant receiving compensation at the rate of £8.00 (~US$10) per hour. Informed consent was obtained from participants prior to the commencement of the experiment and the protocol was reviewed and approved by the Ethics Committee at the School of Psychology, University of Aberdeen. The experiment had a single factor (Meditation: mindfulness or control) between-participants design. Based on prior research (Golubickis et al., 2023; Golubickis & Macrae, 2022), to detect a significant effect of meditation, a sample of 55 participants with a minimum of 60 learning trials afforded 86% power for a medium effect size (i.e., d = .50; PANGEA, v .0.2). This is a conservative estimation of the sample size as our analytic techniques (i.e., generalised estimating equations and RL-DDM) use hierarchical modelling. A key benefit of these approaches is that parameters are estimated reliably with a small number of experimental trials (Lerche et al., 2017; Wiecki et al., 2013). In addition, as models are fitted at the group-level rather than individually, larger participant numbers substantially improve parameter estimation.
Stimulus materials and procedure
The experiment was performed online via Inquisit Web. Once participants accessed the experiment through the web link, they were randomly assigned to either the mindfulness or control condition. In the mindfulness condition, using headphones, participants listened to a 5-min audio-recording of a mindful breathing exercise based on foundational mindfulness-based stress reduction programmes. During this exercise, attention was focused on the sensation of breathing, with awareness directed in a non-reactive and non-elaborate manner to the present moment (Lymeus et al., 2018; Malinowski, 2013; Zanesco et al., 2018). Participants were asked to listen attentively to the recording and to avoid distractions (Golubickis et al., 2016, 2023; Tan et al., 2014; Tan & Martin, 2013, 2015). If, however, these arose they were requested to perceive the episodes as fleeting experiences and to return attention to their breathing each time a distracting thought, emotion, or memory occurred (H. Smith & Novak, 2003). A bell chimed after 5 min to signal the end of the activity. Prior research has established the effectiveness of this brief intervention in increasing levels of mindful-attention and awareness (Tan et al., 2014). In the control condition, participants performed a 5-min Chinese puzzle task (i.e., Tangram) in which they constructed shapes using polygons (Golubickis et al., 2023).
Next, all participants performed the learning phase of a PST (Frank et al., 2004, 2007). In this task, they were required to learn which symbol was more likely to be correct across three different symbol pairs (denoted as AB, CD, and EF, see Figure 1). After each selection, feedback was provided which showed whether the response was correct or incorrect. The probabilities indicating which symbol was more likely to be correct followed the standard version of the PST (Frank et al., 2004, 2007). Specifically, for the AB pair, A was 80% likely to be correct (20% for B), for the CD pair, C was 70% likely to be correct (30% for D), and finally, for the EF pair, E was 60% likely to be correct (40% for F). Over numerous choice selections, participants learned which item in each pairing was more likely to be correct (i.e., A, C, E rather than B, D, F) based on the feedback provided. The learning task finished when participants reached sufficient levels of accuracy for each pairing (i.e., AB, 60% or above; CD, 55% or above; EF, 50% or above; Frank et al., 2004, 2007).
Each trial began with the presentation of a pair of symbols that remained on the screen until the participant made a response. After the participant selected one of the symbols, both textual (i.e., the word “correct” in green or “incorrect” in red) and auditory (i.e., a high-pitched beep for a correct response or a low-pitched beep for an incorrect response) feedback were provided for 1,000 ms, followed by a blank screen for 500 ms, after which the next trial commenced. Participants had to select a symbol by pressing the appropriate button on the keyboard (i.e., “A” for the symbol on the left side of the screen, “L” for the symbol on the right side of the screen). The symbols in each pair were equally likely to be presented on the left or right side of the screen and stimulus presentation was randomised. Following previous research, no practice or familiarisation period was provided before the PST to prevent the possibility that prior understanding of the task would influence learning (Frank et al., 2004). Participants completed blocks of 60 trials in which each of the three stimulus pairs appeared randomly, equally often, until accuracy reached a satisfactory level. The maximum number of learning blocks was set to six (i.e., 360 trials in total) if the participant did not reach satisfactory levels of accuracy earlier in the task (Frank et al., 2007). If the participant’s learning performance was not sufficient after six blocks of trials, they were excluded from the analyses. On completion of the experiment, participants were debriefed and thanked.
Computational analysis
The data from the PST were submitted to an RL-DDM analysis (Fontanesi et al., 2019; Pedersen et al., 2017; Pedersen & Frank, 2020). 3 The RL-DDM estimates latent parameters through the simultaneous hierarchical Bayesian modelling of response time (RT) and choice data. A scaling parameter (i.e., drift rate, vscaling) measures sensitivity to choice-related feedback by taking both the expected outcome and speed of evidence accumulation into account, such that higher values indicate confident learning, whereas lower values imply uncertainty regarding the anticipated outcome. Drift rate scaling is equivalent to the inverse temperature parameter in classic instrumental learning models and indicates the extent to which exploration or exploitation is favoured during decision-making (Cohen et al., 2007; Daw et al., 2006; Pedersen et al., 2017; Sutton & Barto, 1998). Sensitive to the task context (Behrens et al., 2007), a learning rate parameter (η)—ranging from zero to one—quantifies how quickly learning takes place, with larger values indicating the utilisation of current feedback (i.e., fast learning), and smaller values reflecting reduced updating from recently experienced outcomes (i.e., slow learning). In this regard, either a single learning rate (η) that captures all learning, or separate learning rates for negative and positive prediction errors (η− & η+, respectively) can be estimated (Miletić et al., 2020; Pedersen et al., 2017; Pedersen & Frank, 2020). Whereas negative prediction errors refer to outcomes that were worse than expected, positive prediction errors pertain to outcomes that were better than anticipated. In addition, the model also establishes how much evidence is needed before a decision is made (i.e., threshold separation, a), with larger (vs. smaller) values indicating greater response caution. Finally, the efficiency of non-decisional processes (e.g., stimulus encoding, response execution, t0) is also estimated.
In essence, RL models are applied to understand how feedback (i.e., reward or punishment) is utilised to update subjective expectations of associated outcomes and how, in turn, these revised beliefs guide behaviour (i.e., which response option will be selected in future trials; Fontanesi et al., 2019; Miletić et al., 2020; Pedersen & Frank, 2020). By applying the delta learning rule, the model describes the updating of the expected Q-value for a chosen option (e.g., positively reinforced symbol A) based on the scaled by learning rate (η) reward prediction error (i.e., the difference between observed and expected feedback) in the previous trial (Rescorla & Wagner, 1972; Watkins & Dayan, 1992, see Equation 1):
(1) |
Learning can occur from two sources of prediction error (i.e., positive and negative). Whereas learning from positive prediction errors (η+) happens when the reward is better than expected, negative prediction errors (η−) describe updating when the outcome is worse than expected (Frank et al., 2007; Schultz, 2016). In such instances, based on the prediction error that occurred in a given trial, the Q-value is updated using the extended delta rule formulation (Frank et al., 2007; see Equation 2):
(2) |
Two significant modifications characterise the RL-DDM compared with standard RL models. First, the typical choice rule for RL (i.e., softmax) is replaced by the Wiener diffusion process of the DDM (see Miletić et al., 2020; Pedersen et al., 2017; Pedersen & Frank, 2020). This is a crucial feature of the RL-DDM, as it affords the possibility to model choice and RT data simultaneously. Specifically, in the RL-DDM, the likelihood of each chosen option in a given trial and its associated RT are estimated using the standard DDM probability density function, the Wiener first passage time (wfpt) distribution (Navarro & Fuss, 2009; Pedersen & Frank, 2020). This function provides the probability of choosing option i in a given trial t and its observed RT rti,t (see Equation 3):
(3) |
Second, the algorithm that captures the learning of subjective expectation values (Q) from stimuli and actions (i.e., value-based approach) is integrated into the process of evidence accumulation (i.e., drift rate). Specifically, the RL-DDM formulates the drift rate (v) based on the difference between the expected value of positively (Qpositively-reinforced) and negatively (Qnegatively-reinforced) reinforced choices. To accommodate the way this knowledge is used, the RL-DDM allows an additional free scaling parameter to be estimated (i.e., drift rate scaling, vscaling). This scaling parameter is similar to the inverse temperature in the softmax choice rule and reflects the level of exploration/exploitation during learning (Pedersen & Frank, 2020), such that larger values reflect stronger exploitation of the option with the highest expected value. In the context of the current PST in which participants were presented with two stimuli on each trial and learned which one was most likely to be correct, exploration refers to the tendency to sample and consider both options, rather than consistently selecting the option that was previously associated with positive feedback (i.e., exploitation). See Equation 4:
(4) |
As with other sequential-sampling models, the RL-DDM assumes that evidence is gathered for each choice option (e.g., symbol A vs. symbol B) until a critical evidential threshold is reached, at which point a response is made. This response threshold is captured by the boundary separation (a) parameter, which reflects speed–accuracy trade-offs during decision-making. For example, if a conservative (vs. liberal) decision-making style (i.e., higher evidential requirements) is adopted, this would yield slower but more accurate responses. At the start of the PST, participants make initial guesses as the stimuli have not yet been reinforced, thus the difference in expected values between symbol pairings is extremely low (i.e., slow evidence accumulation due to high uncertainty). As participants start to receive feedback, via application of the delta learning rule (Rescorla & Wagner, 1972), the subjective Q-learning values of positively/negatively reinforced stimuli increase/decrease. The speed at which participants update the expected values is described by the learning rate (η) parameter. On a trial-by-trial basis, this knowledge (i.e., learning which symbol is correct, Q-value) is integrated into the drift rate such that over time the difference in expected values between reinforced options (ACE vs. BDF symbol pairings) increases (Pedersen & Frank, 2020). The larger the difference between positively and negatively reinforced options, the easier (i.e., faster and more accurate) choice selection becomes (i.e., fast information sampling).
Results
Behavioural analysis
Based on the summary data, all participants fully completed the mindfulness or control (i.e., 4–6 Tangram puzzles per 5 min) exercises. For the PST, the outliers, which were omitted from the analyses, were responses faster than 200 ms or slower than 4,000 ms (Frank et al., 2007) and comprised approximately 1.5% of the data. The mean latency of responses and learning performance were submitted to JASP (JASP Team, 2023) for an independent-samples (Meditation: mindfulness or control) t-test (two-tailed). No significant differences emerged on either measure, that is, decision time: Mmindfulness = 1,255 ms, SDmindfulness = 771 ms versus Mcontrol = 1,062 ms, SDcontrol = 332 ms, t(53) = −1.18, p = .243; learning performance: Mmindfulness = 73%, SDmindfulness = 7% versus Mcontrol = 74%, SDcontrol = 8%, t(53) = 0.25, p = .804.
To examine the data more closely, using generalised estimating equations (GEE), a trial-by-trial analysis was conducted on the first 30 responses in the PST to explore the effects of Trial, Meditation (i.e., mindfulness vs. control), and Stimulus Pair (i.e., AB, CD, EF) on learning performance (see Figure 2). Analyses were conducted using the R package “geepack” (Højsgaard et al., 2006). A main effect of Trial, Wald χ2(1) = 4.49, p = .034, and a significant Meditation × Stimulus Pair “CD” interaction, Wald χ2(1) = 6.11, p = .013, were observed. Follow-up analyses revealed different dynamics in the mindfulness (vs. control) condition. Specifically, the interaction with the CD pair was significant, indicating that the effect of mindfulness on learning performance was more pronounced in this stimulus pair compared with the reference AB pair, Wald χ2(1) = 6.11, p = .013. For the EF stimulus pair, Wald χ2(1) = 0.92, p = .338, and the interaction with Trial for both the CD, Wald χ2(1) = 2.64, p = .104, and EF pair, Wald χ2(1) = 0.85, p = .356, no significant effects were observed. These results indicate that the influence of brief mindfulness meditation on learning performance varied across the stimulus pairs, with a specific effect observed for the CD pairing relative to the reference pair (AB).
Figure 2.
Based on ceiling performance during the PST, learning curves over the first 30 trials as a function of Meditation and Stimulus Pair were calculated. The raw data points represent the observed learning outcomes, and the modelled curves reflect the results of the generalised estimating equations (GEE) analysis. The GEE analysis examines differences in learning trajectories across trials and experimental conditions.
Modelling analysis
To identify the processes underpinning probabilistic learning and decision-making, data were fitted to a cognitive model (i.e., RL-DDM; Pedersen et al., 2017; Pedersen & Frank, 2020). To estimate model parameters, an extension of the Bayesian hierarchical drift diffusion toolbox was adopted (Wiecki et al., 2013). Models were response-coded, such that the upper threshold corresponded to positively reinforced stimuli (i.e., symbols corresponding to the letters A, C, and E) and the lower threshold to negatively reinforced items (i.e., symbols corresponding to the letters B, D, and F; Pedersen & Frank, 2020). As calculating the likelihoods of the posterior parameter values using Bayes’ theorem would be intractable (Wiecki et al., 2013), the posterior distributions were modelled using a Markov chain Monte Carlo (MCMC) with 10,000 samples (including 1,000 burn) using the Python package PyMC (see Patil et al., 2010). The MCMC approach samples potential parameter values based on the parameter prior distributions and assesses the likelihood of each sample, such that each new iteration is compared with the previous one and sampled more frequently if it provides a better estimation for the parameter values (Geyer, 2011). Across thousands of iterations, the resulting parameter posterior distributions will converge around the true parameter values. Outliers (5% of the trials) were removed by the HDDM software (Ratcliff & Tuerlinckx, 2002; Wiecki et al., 2013). Two RL-DDM models were estimated for comparison (i.e., single vs. dual learning rate model). In the first model, only a single learning rate (η) was allowed to vary across the experimental conditions (i.e., mindfulness vs. control). This model examined whether there were differences in the speed of learning without considering the different types of prediction error. In contrast, in the second model, learning rates for negative and positive prediction errors (η− and η+, respectively) were estimated separately and were allowed to vary as a function of experimental condition. As such, this model considered whether learning in each of the conditions accelerated following positive or negative prediction errors. In both models, drift rate scaling (vscaling) and boundary separation (a) varied as a function of experimental condition.
Model comparison was performed using the deviance information criterion (DIC) as this approach is routinely adopted when comparing hierarchical Bayesian models (Spiegelhalter et al., 1998, 2002). Lower DIC values favour models with the highest likelihood and least number of parameters. This revealed better fit for the dual (DIC:16309) compared with the single (DIC: 16318) learning rate model. Examination of the posterior distributions (see Figure 3) indicated differences in learning rates for positive prediction errors (η+), drift rate scaling (vscaling), and threshold separation (a). Specifically, comparisons yielded strong evidence that learning rates were faster in the mindfulness condition for positive, pBayes(mindfulness > control) = .061, BF10 = 15, but not for negative, pBayes(mindfulness > control) = .377, BF10 = 2, prediction errors. There was also very strong evidence that drift rate scaling (vscaling) was smaller in the mindfulness condition, pBayes(mindfulness < control) = .013, BF10 = 76. Finally, for boundary separation (a), there was extremely strong evidence that additional decisional evidence was required in the mindfulness condition, pBayes(mindfulness > control) < .001, BF10 > 1,000, indicating that a brief period of meditation increased response caution. 4
Figure 3.
Posterior parameter distributions as a function of meditation for negative (η−) and positive (η+) learning rates, drift rate scaling (vscaling), and boundary separation (a).
Discussion
The current results demonstrated both the benefit of mindfulness-based meditation on probabilistic learning and the cognitive pathways through which this practice influenced decisional processing. Compared with their counterparts who performed a puzzle task (i.e., no meditation), participants who experienced a brief period of mindfulness-based meditation exhibited accelerated learning rates following positive (vs. negative) prediction errors. Interestingly, mindfulness-based meditation boosted performance for the medium probability stimulus pairing (i.e., CD), an effect that emerged during the early stages of the PST (see Figure 2). In terms of the operations underpinning decision-making, meditation increased response caution (i.e., additional evidence was required prior to response selection) and the tendency to explore (vs. exploit) choice selections (Frank et al., 2004, 2007). Collectively, these findings inform understanding of how brief mindfulness-based meditation influences instrumental learning.
Interestingly, both meditation and learning have been shown to modulate activity in the striatum, a core component of the dopaminergic system in the brain (Hagerty et al., 2013; Kjaer et al., 2002; McClure et al., 2003; Pagnoni et al., 2002). During learning, phasic bursts of dopamine act as a signal to orient people to novel or unexpected stimuli based on the discrepancy between the anticipated and actual outcome of an action (i.e., reward prediction error). Specifically, whereas dopamine levels spike following positive feedback, they correspondingly dip when feedback is negative (Schultz, 2016). In the current study, brief mindfulness-based meditation accelerated learning following positive (but not negative) prediction errors. That is, learning was enhanced when choice-related feedback was better than expected (i.e., choice selections were surprisingly correct). What this indicates is that mindfulness meditation increased participants’ responsivity to positive reward prediction errors, thereby accelerating the rate of learning during the PST.
On close inspection, the relationship between mindfulness and reward-based learning is complex, with processing influenced both by the specific requirements of the task at hand, situational constraints, and people’s levels of meditative expertise (e.g., Kirk & Montague, 2015; Kirk et al., 2019; Knytl & Opitz, 2019). Indeed, one intriguing possibility is that, at different dosages, mindfulness meditation may have the capacity either to enhance or dampen the signal value of reward prediction errors. Using a passive conditioning task, Kirk and Montague (2015) demonstrated attenuated reward prediction signals in the striatum among long-term meditators compared with matched controls. In follow-up research, this time testing participants undergoing an 8-week programme of mindfulness-based training, Kirk et al. (2019) similarly observed reduced responsivity to reward prediction errors during a conditioning task. As extensive meditative experience is associated with increased tonic dopamine levels (Kjaer et al., 2002), it is possible that the signal value of reward prediction errors is reduced for expert (vs. novice) mindfulness-based practitioners (Knytl & Opitz, 2019). In the current investigation, a brief period of mindfulness meditation accelerated the rate of learning following positive prediction errors. It remains to be seen, however, whether this effect would persist if participants experienced extended episodes of meditative training (i.e., multiple periods of brief mindfulness-based meditation may diminish the signal value of reward prediction errors). A useful task for future research will be to explore this issue.
In line with recent calls for computational approaches in psychiatry and meditation research (Hitchcock et al., 2023; Huys et al., 2016; van Vugt et al., 2019), the current investigation affirms the value of mathematical modelling. Unlike conventional statistical approaches, models of cognition (e.g., DDM) simultaneously analyse all the available data (i.e., full distributions of accurate and erroneous RTs as well as choices) to identify the latent psychological processes (e.g., response caution/bias, speed of information uptake) that underpin task performance (Evans & Wagenmakers, 2020; Miletić et al., 2020; Ratcliff et al., 2016). As such, this approach yields important insights into how mindfulness impacts cognition across different task contexts (e.g., meditation procedures, experimental paradigms). For instance, and corroborated by the current findings, an emerging observation is that, regardless of the task under consideration (e.g., attentional cueing, working memory, categorisation), mindfulness alters response caution (i.e., boundary separation, a; Golubickis et al., 2023; van Vugt & Jha, 2011; van Vugt & van den Hurk, 2017). Specifically, meditators change their decision-making style by requiring additional information to be accumulated before a judgement is made (i.e., conservative responding). Operating in this way, increased response caution may serve as one of the critical pathways through which the debiasing and de-centring effects of mindfulness-based mediation are realised (Golubickis et al., 2016, 2023; Hussain, 2015; Wells, 2005).
Importantly, the application of a computational model that combined two, characteristically independent, cognitive domains (i.e., instrumental learning and decision-making) allowed the decisional contaminants (e.g., speed–accuracy trade-off) that bias task performance to be controlled (Fontanesi et al., 2019; Miletić et al., 2020; Ratcliff et al., 2016). In so doing, the RL-DDM analysis explicated the cognitive pathways through which brief mindfulness-based mediation influenced instrumental learning. Moving forward, a similar approach could be applied to investigate other domains in which meditation impacts information processing. Take, for example, the common finding that mindfulness enhances attention (e.g., Jha et al., 2007; Semple et al., 2010; Valentine & Sweet, 1999). Paradigms that assess components of attention (e.g., selective attention, sustained vigilance, executive control) are also susceptible to a range of decisional biases (i.e., stimulus and response-driven processes; van Vugt et al., 2019). Crucially, however, such influences can be accounted for by implementing extensions of sequential sampling models that enable the specific attentional processes of interest to be estimated (e.g., shrinking spotlight diffusion model; Golubickis & Macrae, 2021; White & Curl, 2018; White et al., 2011).
The current study adopted a meditative practice that entails careful attention to one’s breathing and the cultivation of a non-judgmental awareness of thoughts and sensations (Chiesa, 2010; Hart, 1987). Notable psychological fingerprints of this practice include heightened meta-cognitive awareness, enhanced emotional regulation, and increased present-moment focus (Kok & Singer, 2017; Kropp & Sedlmeier, 2019). Our results extend these findings. Specifically, here we observed faster learning rates from positive prediction errors following brief meditation, suggesting heightened sensitivity to rewarding feedback during instrumental learning. In addition, we found that exploration was favoured by participants in the mindfulness condition, indicating their enhanced willingness to explore novel choice selections. Replicating prior research, participants also exhibited increased caution after mindfulness practice (Golubickis et al., 2023; van Vugt & Jha, 2011; van Vugt & van den Hurk, 2017). One possibility is that, via enhanced meta-cognitive awareness, diminished impulsivity increases the evidential requirements of response selection during decision-making.
Notwithstanding the reported findings, the current investigation is not without its limitations. Although revealing the effects of mindfulness on probabilistic learning, only a single meditative practice was adopted. Crucially, however, other mindfulness techniques, such as body scan and loving-kindness meditation have distinct psychological signatures, and thus may influence instrumental learning in different ways (Kok & Singer, 2017; Kropp & Sedlmeier, 2019). For instance, the body scan technique, which has been associated with increased self-compassion and enhanced emotional regulation (Kropp & Sedlmeier, 2019), may impact learning by influencing sensitivity to undesirable feedback (i.e., negative reinforcement). Relatedly, loving-kindness meditation, a practice allied with increased positivity and empathy (Kok & Singer, 2017) may influence learning through elevated responsivity to social rewards. Finally, also of interest is the issue of whether various mindfulness-based practices (e.g., guided vs. non-guided) exert comparable effects on response caution (i.e., evidential requirements of response selection). As different techniques wield sometimes divergent psychological effects, future research should explore how a range of meditative practices and dosages impact performance, potentially uncovering novel insights into the pathways through which mindfulness shapes decision-making and learning.
To establish the generalisability of the current findings, consideration should also be given to the effects of mindfulness-based meditation on other learning tasks. One such example is the probabilistic reward task in which participants learn responses that are associated with varying intermittent reinforcement schedules of monetary payoffs (Pizzagalli et al., 2005). Given alterations of the dopaminergic system following meditation (Hagerty et al., 2013; Kjaer et al., 2002; McClure et al., 2003; Pagnoni et al., 2002), it is possible that mindfulness may increase (or decrease) people’s sensitivity to reward incentives in such a task, thereby influencing the rate at which learning takes place. In addition, closer attention should be given to the characteristics of learning using paradigms which, unlike the PST, have been designed explicitly to assay the explore–exploit dilemma. For example, two/four-armed bandit tasks could be employed (Daw et al., 2006; Gershman, 2018). Finally, several psychiatric and neurological disorders, such as Parkinson’s disease, attention-deficit/hyperactivity disorder (ADHD), substance addiction, and schizophrenia, are associated with difficulties in RL (Frank et al., 2004; Huys et al., 2016; Maia & Frank, 2011). Future investigations should therefore explore whether mindfulness-based meditation can be used as an effective intervention in such cases.
In summary, probing the effects of mindfulness on RL, here we demonstrated the benefits of a brief period of mindfulness-based meditation on performance. Compared with their colleagues in the control condition, participants who underwent 5 min of mindfulness-based meditation displayed an accelerated rate of learning following positive reward prediction errors. In addition, learning was associated with greater response caution and a tendency to explore rather than exploit choice selections during decision-making (Frank et al., 2004, 2007). These findings confirm the value of computational approaches in elucidating the mechanisms through which mindfulness-based meditation influences instrumental learning.
Instrumental learning is a form of associative learning in which an individual’s behaviour is modified based on the consequences (i.e., outcomes) of prior actions (i.e., choices).
It was not possible to probe these individual differences in the current investigation. Accordingly, how exactly prior meditative experiences (and theoretically motivated individual differences) influence instrumental learning is an issue that merits future consideration.
The data for the experiment are publicly accessible at: https://osf.io/xud5c/. The code for the RL-DDM analysis was taken from: ttps://hddm.readthedocs.io/en/latest/demo_RLHDDMtutorial.html.
Bayesian p values quantify the degree to which the difference in the posterior distribution is consistent with the hypothesis. For example, a Bayesian p of .05 indicates that 95% of the posterior distribution supports the hypothesis that the parameter posteriors differ across the conditions. Bayes factors (i.e., BF10) were calculated following Marsman and Wagenmakers (2017) with values >10 indicating strong evidence for a comparison (Wagenmakers et al., 2011).
Footnotes
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding: The author(s) received no financial support for the research, authorship, and/or publication of this article.
ORCID iD: Marius Golubickis
https://orcid.org/0000-0001-6128-0331
Data accessibility statement:
The data from the present experiment are publicly available at the Open Science Framework website: https://osf.io/xud5c/
References
- Baer R. A., Smith G. T., Hopkis J., Krietemeyer J., Toney L. (2006). Using self-report assessment methods to explore facets of mindfulness. Assessment, 13, 27–45. [DOI] [PubMed] [Google Scholar]
- Behrens T. E., Woolrich M. W., Rushworth M. F. (2007). Learning the value of information in an uncertain world. Nature Neuroscience, 10, 1214–1221. [DOI] [PubMed] [Google Scholar]
- Bishop S. R., Lau M., Shapiro S., Carlson L., Anderson N. D., Carmody J., . . . Devins G. (2004). Mindfulness: A proposed operational definition. Clinical Psychology: Science and Practice, 11, 230–241. [Google Scholar]
- Brown K. W., Ryan R. M. (2003). The benefits of being present: Mindfulness and its role in psychological well-being. Journal of Personality and Social Psychology, 84, 822–848. [DOI] [PubMed] [Google Scholar]
- Carmody J., Baer R. A. (2008). Relationships between mindfulness practice and levels of mindfulness, medical and psychological symptoms and well-being in a mindfulness-based stress reduction program. Journal of Behavioral Medicine, 31, 23–33. [DOI] [PubMed] [Google Scholar]
- Carney D. R., Jost J. T., Gosling S. D., Potter J. (2008). The secret lives of liberals and conservatives: Personality profiles, interaction styles, and the things they leave behind. Political Psychology, 29, 807–840. [Google Scholar]
- Chamorro-Premuzic T., Swami V., Cermakova B. (2010). Individual differences in music consumption are predicted by uses of music and age rather than emotional intelligence, neuroticism, extraversion, or openness. Psychology of Music, 40, 285–300. [Google Scholar]
- Chiesa A. (2010). Vipassana meditation: Systematic review of current evidence. The Journal of Alternative and Complementary Medicine, 16, 37–46. [DOI] [PubMed] [Google Scholar]
- Chiesa A., Serretti A. (2009). Mindfulness-based stress reduction for stress management in healthy people: A review and meta-analysis. Journal of Alternative and Complementary Medicine, 15, 593–600. [DOI] [PubMed] [Google Scholar]
- Cohen J. D., McClure S. M., Yu A. J. (2007). Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration. Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences, 362, 933–942. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Daw N. D., O’Doherty J. P., Dayan P., Seymour B., Dolan R. J. (2006). Cortical substrates for exploratory decisions in humans. Nature, 441, 876–879. [DOI] [PMC free article] [PubMed] [Google Scholar]
- DeYoung C. G. (2013). The neuromodulator of exploration: A unifying theory of the role of dopamine in personality. Frontiers in Human Neuroscience, 7, Article 762. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dixon M. R., Paliliunas D., Belisle J., Speelman R. C., Gunnarsson K. F., Shaffer J. L. (2019). The effect of brief mindfulness training on momentary impulsivity. Journal of Contextual Behavioral Science, 11, 15–20. [Google Scholar]
- Eagly A. H., Chen S., Chaiken S., Shaw-Barnes K. (1999). The impact of attitudes on memory: An affair to remember. Psychological Bulletin, 125, 64–89. [DOI] [PubMed] [Google Scholar]
- Erisman S. M., Roemer L. (2010). A preliminary investigation of the effects of experimentally induced mindfulness on emotional responding to film clips. Emotion, 10, 72–82. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Evans N. J., Wagenmakers E. J. (2020). Evidence accumulation models: Current limitations and future directions. The Quantitative Methods for Psychology, 16, 73–90. [Google Scholar]
- Farb N. A., Segal Z. V., Mayberg H., Bean J., McKeon D., Fatima Z., Anderson A. K. (2007). Attending to the present: Mindfulness meditation reveals distinct neural modes of self-reference. Social Cognitive and Affective Neuroscience, 2, 313–322. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fontanesi L., Gluth S., Spektor M. S., Rieskamp J. (2019). A reinforcement learning diffusion decision model for value-based decisions. Psychonomic Bulletin & Review, 26, 1099–1121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Frank M. J., Moustafa A. A., Haughey H. M., Curran T., Hutchison K. E. (2007). Genetic triple dissociation reveals multiple roles for dopamine in reinforcement learning. Proceedings of the National Academy of Sciences, 104, 16311–16316. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Frank M. J., Seeberger L. C., O’Reilly R. C. (2004). By carrot or by stick: Cognitive reinforcement learning in parkinsonism. Science, 306, 1940–1943. [DOI] [PubMed] [Google Scholar]
- Gershman S. J. (2018). Deconstructing the human algorithms for exploration. Cognition, 173, 34–42. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Geyer C. J. (2011). Introduction to Markov Chain Monte Carlo. In Brooks S., German A., Jones G., Men X.-L. (Eds.), Handbook of Markov Chain Monte Carlo (pp. 1–46). Chapman and Hall/CRC. [Google Scholar]
- Golubickis M., Macrae C. N. (2021). That’s me in the spotlight: Self-relevance modulates attentional breadth. Psychonomic Bulletin & Review, 28, 1915–1922. [DOI] [PubMed] [Google Scholar]
- Golubickis M., Macrae C. N. (2022). Sticky me: Self-relevance slows reinforcement learning. Cognition, 227, 105207. [DOI] [PubMed] [Google Scholar]
- Golubickis M., Tan L. B., Falbén J. K., Macrae C. N. (2016). The observing self: Diminishing egocentrism through brief mindfulness meditation. European Journal of Social Psychology, 46, 521–527. [Google Scholar]
- Golubickis M., Tan L. B., Saini S., Catterall K., Morozovaite A., Khasa S., Macrae C. N. (2023). Knock yourself out: Brief mindfulness-based meditation eliminates self-prioritization. Psychonomic Bulletin & Review, 30, 341–349. [DOI] [PubMed] [Google Scholar]
- Gosling S. D., Ko S. J., Mannarelli T., Morris M. E. (2002). A room with a cue: Personality judgments based on offices and bedrooms. Journal of Personality and Social Psychology, 82, 379–398. [DOI] [PubMed] [Google Scholar]
- Højsgaard S., Halekoh U., Yan J. (2006). The R package geepack for generalized estimating equations. Journal of Statistical Software, 15, 1–11. [Google Scholar]
- Hagerty M. R., Isaacs J., Brasington L., Shupe L., Fetz E. E., Cramer S. C. (2013). Case study of ecstatic meditation: fMRI and EEG evidence of self-stimulating a reward system. Neural Plasticity, 2013, 653572. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hart W. (1987). The art of living: Vipassana meditation as taught by S.N. Goenka. Pariyatti. [Google Scholar]
- Hayes S. C. (2002). Buddhism and acceptance and commitment therapy. Cognitive and Behavioral Practice, 9, 58–66. [Google Scholar]
- Hayes S. C., Strosahl K. D., Wilson K. G. (2012). Acceptance and commitment therapy (2nd ed.). The Guilford Press. [Google Scholar]
- Hitchcock P. F., Britton W. B., Mehta K. P., Frank M. J. (2023). Self-judgment dissected: A computational modeling analysis of self-referential processing and its relationship to trait mindfulness facets and depression symptoms. Cognitive, Affective, & Behavioral Neuroscience, 23(1), 171–189. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hölzel B. K., Lazar S. W., Gard T., Schuman-Olivier Z., Vago D. R., Ott U. (2011). How does mindfulness meditation work? Proposing mechanisms of action from a conceptual and neural perspective. Perspectives on Psychological Science, 6, 537–559. [DOI] [PubMed] [Google Scholar]
- Hussain D. (2015). Meta-cognition in mindfulness: A conceptual analysis. Psychological Thought, 8, 132–141. [Google Scholar]
- Huys Q. J., Maia T. V., Frank M. J. (2016). Computational psychiatry as a bridge from neuroscience to clinical applications. Nature Neuroscience, 19, 404–413. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jach H. K., DeYoung C. G., Smillie L. D. (2022). Why do people seek information? The role of personality traits and situation perception. Journal of Experimental Psychology: General, 151, 934–959. [DOI] [PubMed] [Google Scholar]
- JASP Team. (2023). JASP (Version 0.17.3) [Computer software]. https://jasp-stats.org/
- Jha A. P., Krompinger J., Baime M. J. (2007). Mindfulness training modifies subsystems of attention. Cognitive, Affective, & Behavioral Neuroscience, 7, 109–119. [DOI] [PubMed] [Google Scholar]
- Kabat-Zinn J. (2003). Mindfulness-based interventions in context: Past, present, and future. Clinical Psychology: Science and Practice, 10, 144–156. [Google Scholar]
- Kaufman S. B., Quilty L. C., Grazioplene R. G., Hirsh J. B., Gray J. R., Peterson J. B., DeYoung C. G. (2016). Openness to experience and intellect differentially predict creative achievement in the arts and sciences. Journal of Personality, 84, 248–258. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kirk U., Montague P. R. (2015). Mindfulness meditation modulates reward prediction errors in a passive conditioning task. Frontiers in Psychology, 6, 90–97. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kirk U., Pagnoni G., Hétu S., Montague R. (2019). Short-term mindfulness practice attenuates reward prediction errors signals in the brain. Scientific Reports, 9, 1–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kjaer T. W., Bertelsen C., Piccini P., Brooks D., Alving J., Lou H. C. (2002). Increased dopamine tone during meditation-induced change of consciousness. Cognitive Brain Research, 13, 255–259. [DOI] [PubMed] [Google Scholar]
- Knytl P., Opitz B. (2019). Meditation experience predicts negative reinforcement learning and is associated with attenuated FRN amplitude. Cognitive, Affective & Behavioral Neuroscience, 19, 268–282. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kok B. E., Singer T. (2017). Phenomenological fingerprints of four meditations: Differential state changes in affect, mind-wandering, meta-cognition, and interoception before and after daily practice across 9 months of training. Mindfulness, 8, 218–231. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kropp A., Sedlmeier P. (2019). What makes mindfulness-based interventions effective? An examination of common components. Mindfulness, 10, 2060–2072. [Google Scholar]
- Lerche V., Voss A., Nagler M. (2017). How many trials are required for parameter estimation in diffusion modeling? A comparison of different optimization criteria. Behavior Research Methods, 49, 513–537. [DOI] [PubMed] [Google Scholar]
- Luoma J. B., Hayes S. C., Walser R. D. (2007). Learning ACT. New Harbinger Publications. [Google Scholar]
- Lutz A., Slagter H. A., Dunne J. D., Davidson R. J. (2008). Attention regulation and monitoring in meditation. Trends in Cognitive Sciences, 12, 163–169. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lymeus F., Lindberg P., Hartig T. (2018). Building mindfulness bottom-up: Meditation in natural settings supports open monitoring and attention restoration. Consciousness and Cognition, 59, 40–56. [DOI] [PubMed] [Google Scholar]
- Maia T. V., Frank M. J. (2011). From reinforcement learning models to psychiatric and neurological disorders. Nature Neuroscience, 14, 154–162. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Malinowski P. (2013). Neural mechanisms of attentional control in mindfulness meditation. Frontiers in Neuroscience, 7, Article 8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marsman M., Wagenmakers E.-J. (2017). Three insights from a Bayesian interpretation of the one-sided p value. Educational and Psychological Measurement, 77, 529–539. [DOI] [PMC free article] [PubMed] [Google Scholar]
- McClure S. M., Berns G. S., Montague P. R. (2003). Temporal prediction errors in a passive learning task activate human striatum. Neuron, 38, 339–346. [DOI] [PubMed] [Google Scholar]
- McCrae R. R. (1993). Openness to experience as a basic dimension of personality. Imagination, Cognition and Personality, 13, 39–55. [Google Scholar]
- McCrae R. R., Costa P. T., Jr. (1997). Personality trait structure as a human universal. American Psychologist, 52, 509–516. [DOI] [PubMed] [Google Scholar]
- Miletić S., Boag R. J., Forstmann B. U. (2020). Mutual benefits: Combining reinforcement learning with sequential sampling models. Neuropsychologia, 136, 107261. [DOI] [PubMed] [Google Scholar]
- Navarro D. J., Fuss I. G. (2009). Fast and accurate calculations for first-passage times in Wiener diffusion models. Journal of Mathematical Psychology, 53, 222–230. [Google Scholar]
- Nickerson R. S. (1998). Confirmation bias: A ubiquitous phenomenon in many guises. Review of General Psychology, 2, 175–220. [Google Scholar]
- Pagnoni G., Zink C. F., Montague P. R., Berns G. S. (2002). Activity in human ventral striatum locked to errors of reward prediction. Nature Neuroscience, 5, 97–98. [DOI] [PubMed] [Google Scholar]
- Papies E. K., Barsalou L. W., Custers R. (2012). Mindful attention prevents mindless impulses. Social Psychological and Personality Science, 3, 291–299. [Google Scholar]
- Papies E. K., Pronk T. M., Keesman M., Barsalou L. W. (2015). The benefits of simply observing: Mindful attention modulates the link between motivation and behavior. Journal of Personality and Social Psychology, 108, 148–170. [DOI] [PubMed] [Google Scholar]
- Patil A., Huard D., Fonnesbeck C. J. (2010). PyMC: Bayesian stochastic modelling in Python. Journal of Statistical Software, 35, 1–81. [PMC free article] [PubMed] [Google Scholar]
- Pedersen M. L., Frank M. J. (2020). Simultaneous hierarchical Bayesian parameter estimation for reinforcement learning and drift diffusion models: A tutorial and links to neural data. Computational Brain & Behavior, 3, 458–471. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pedersen M. L., Frank M. J., Biele G. (2017). The drift diffusion model as the choice rule in reinforcement learning. Psychonomic Bulletin & Review, 24, 1234–1251. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pizzagalli D. A., Jahn A. L., O’Shea J. P. (2005). Toward an objective characterization of an anhedonic phenotype: A signal-detection approach. Biological Psychiatry, 57, 319–327. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Price E., Ottati V., Wilson C., Kim S. (2015). Open-minded cognition. Personality and Social Psychology Bulletin, 41, 1488–1504. [DOI] [PubMed] [Google Scholar]
- Ratcliff R., Smith P. L., Brown S. D., McKoon G. (2016). Diffusion decision model: Current issues and history. Trends in Cognitive Sciences, 20, 260–281. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ratcliff R., Tuerlinckx F. (2002). Estimating parameters of the diffusion model: Approaches to dealing with contaminant reaction times and parameter variability. Psychonomic Bulletin & Review, 9, 438–481. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rescorla R. A., Wagner A. R. (1972). A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement. In Black A. H., Prokasy W. F. (Eds.), Classical conditioning II: Current research and theory (pp. 64–99). Appleton-Century-Crofts. [Google Scholar]
- Schultz W. (2016). Dopamine reward prediction error coding. Dialogues in Clinical Neuroscience, 18, 23–32. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Semple R. J., Lee J., Rosa D., Miller L. F. (2010). A randomized trial of mindfulness-based cognitive therapy for children: Promoting mindful attention to enhance social-emotional resiliency in children. Journal of Child and Family Studies, 19, 218–229. [Google Scholar]
- Shapiro S. L., Carlson L. E., Astin J. A., Freedman B. (2006). Mechanisms of mindfulness. Journal of Clinical Psychology, 62, 373–386. [DOI] [PubMed] [Google Scholar]
- Silvia P. J., Winterstein B. P., Willse J. T., Barona C. M., Cram J. T., Hess K. I., . . . Richard C. A. (2008). Assessing creativity with divergent thinking tasks: Exploring the reliability and validity of new subjective scoring methods. Psychology of Aesthetics, Creativity, and the Arts, 2, 68–85. [Google Scholar]
- Smillie L. (2017). Openness to experience: The gates of the mind. Scientific American Mind, 28, 15–18. [Google Scholar]
- Smith H., Novak P. (2003). Buddhism: A concise introduction. Harper Collins. [Google Scholar]
- Smith P. L., Ratcliff R. (2004). Psychology and neurobiology of simple decisions. Trends in Neurosciences, 27, 161–168. [DOI] [PubMed] [Google Scholar]
- Spiegelhalter D. J., Best N. G., Carlin B. P., Van der Linde A. (1998). Bayesian deviance, the effective number of parameters, and the comparison of arbitrarily complex models (Research Report No. 98-1009). University of Minnesota. [Google Scholar]
- Spiegelhalter D. J., Best N. G., Carlin B. P., Van Der Linde A. (2002). Bayesian measures of model complexity and fit. Journal of the Royal Statistical Society: Series B, 64, 583–639. [Google Scholar]
- Sternberg R. J. (2002). Images of mindfulness. Journal of Social Issues, 56, 11–26. [Google Scholar]
- Sutton R. S., Barto A. G. (1998). Reinforcement learning: An introduction. MIT Press. [Google Scholar]
- Tan L. B. G., Lo B. C., Macrae C. N. (2014). Brief mindfulness meditation improves mental state attribution and empathizing. PLoS ONE, 9, Article e110510. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tan L., Martin G. (2013). Taming the adolescent mind: Preliminary report of a mindfulness-based psychological intervention for adolescents with clinical heterogeneous mental health diagnoses. Clinical Child Psychology and Psychiatry, 18(2), 300–312. [DOI] [PubMed] [Google Scholar]
- Tan L. B. G., Martin G. (2015). Taming the adolescent mind: A randomised controlled trial examining clinical efficacy of an adolescent mindfulness-based group programme. Child and Adolescent Mental Health, 20, 49–55. [DOI] [PubMed] [Google Scholar]
- Tang Y. Y., Ma Y., Wang J., Fan Y., Feng S., Lu Q., . . .Posner M. I. (2007). Short-term meditation training improves attention and self-regulation. Proceedings of the National Academy of Sciences, 104, 17152–17156. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Teper R., Inzlicht M. (2013). Meditation, mindfulness, and executive control: The importance of emotional acceptance and brain-based performance monitoring. Social Cognitive and Affective Neuroscience, 8, 85–92. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Trapp S., Ziegler M. (2019). How openness enriches the environment: Read more. Frontiers in Psychology, 10, Article 1123. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vøllestad J., Nielsen M. B., Nielsen G. H. (2012). Mindfulness- and acceptance-based interventions for anxiety disorders: A systematic review and meta-analysis. British Journal of Clinical Psychology, 51, 239–260. [DOI] [PubMed] [Google Scholar]
- Valentine E. R., Sweet P. L. G. (1999). Meditation and attention: A comparison of the effects of concentrative and mindfulness meditation on sustained attention. Mental Health, Religion & Culture, 2, 59–70. [Google Scholar]
- Van Eijck K., De Graaf P. M. (2004). The Big Five at school: The impact of personality on educational attainment. Netherlands’ Journal of Social Sciences, 40, 24–40. [Google Scholar]
- van Vugt M. K., Jha A. P. (2011). Investigating the effect of mindfulness meditation training on working memory: A mathematical modelling approach. Cognitive, Affective, & Behavioral Neuroscience, 11, 344–353. [DOI] [PMC free article] [PubMed] [Google Scholar]
- van Vugt M. K., Moye A., Sivakumar S. (2019). Computational modelling approaches to meditation research: Why should we care? Current Opinion in Psychology, 28, 49–53. [DOI] [PubMed] [Google Scholar]
- van Vugt M. K., van den Hurk P. M. (2017). Modeling the effects of attentional cueing on meditators. Mindfulness, 8, 38–45. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wagenmakers E.-J., Wetzels R., Borsboom D., van der Maas H. L. J. (2011). Why psychologists must change the way they analyze their data: The case of psi: Comment on Bem (2011). Journal of Personality and Social Psychology, 100, 426–432. [DOI] [PubMed] [Google Scholar]
- Watkins C., Dayan P. (1992). Q-learning. Machine Learning, 8, 279–292. [Google Scholar]
- Wells A. (2005). Detached mindfulness in cognitive therapy: A metacognitive analysis and ten techniques. Journal of Rational-Emotive & Cognitive-Behavior Therapy, 23, 337–355. [Google Scholar]
- White C. N., Curl R. (2018). Cueing effects in the attentional network test: A spotlight diffusion model analysis. Computational Brain & Behavior, 1, 59–68. [Google Scholar]
- White C. N., Ratcliff R., Starns J. J. (2011). Diffusion models of the flanker task: Discrete versus gradual attentional selection. Cognitive Psychology, 63, 210–238. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wiecki T. V., Sofer I., Frank M. J. (2013). HDDM: Hierarchical Bayesian estimation of the drift-diffusion model in Python. Frontiers in Neuroinformatics, 7, Article 14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zanesco A. P., King B. G., MacLean K. A., Saron C. D. (2018). Cognitive aging and long-term maintenance of attentional improvements following meditation training. Journal of Cognitive Enhancement, 2, 259–275. [Google Scholar]