Comparison of model fits. The table lists the proportion of variance explained (R2) and the Bayesian information criterion (BIC) for the tested models. The models were fitted to the individual data points in Fig. 4 by minimizing the least square error. In addition to the absolute BIC value, the BIC is also provided relative to the BIC for the Random model in which choices are drawn randomly. A lower BIC value indicates a more suitable model. RL: Reinforcement Learning. WSLS: Win-Stay Lose-Shift. The Updated Reinforcement Learning model (Fig. 5) is found to be the most suitable model by the BIC and accounts for most of the variance in the data.