Skip to main content
. 2020 Oct 11;46(3):614–621. doi: 10.1038/s41386-020-00881-8

Fig. 1. The 6-Armed bandit task.

Fig. 1

a. Trial structure of the 6-armed bandit task. On each trial, six bandit options were displayed. A number pad was used to select a single bandit and the selected bandit was outlined in white. Then, the reward value of the selected bandit on that trial was displayed onscreen. b. The hidden reward values during the first 20 trials of the 6-armed bandit task. On the y-axis are the values of each of the 6 bandit options per trial. Each bandit option began with an initial value of 50, and values for subsequent trials were randomly adjusted by a biased random walk. Only when a bandit is selected by the player is the bandit’s value revealed. The selections made by a single control participant are shown as black squares.