Skip to main content
. 2010 Aug 26;4:170. doi: 10.3389/fnhum.2010.00170

Figure 2.

Figure 2

The four-armed bandit task. Participants made repeated choices between four slot machines. Unlike standard slots, the mean pay-offs of the four machines changed gradually and independently from trial to trial (four colored lines). Participants were encouraged to earn as many points as possible during the task. Each choice was classified as exploitative or exploratory, using a computational model of reinforcement learning.