Figure 6. Tree representation of the reversal learning task, used in [27], and the behavioural results.
(A) When each trial begins, one of the two stimuli, or
, is presented in random on a screen. The subject can then choose whether to touch the screen (
action) or not (
action). The task is performed in three phases: training, reversal, and extinction. During the training phase, the subject will receive a reward if the stimulus
is presented and the action
is performed by the subject, or if the stimulus
is presented and the action
is selected (
). During the reversal phase, the reward function is reversed, meaning that the
action must be chosen when the stimulus
is presented, and vice versa (
). Finally, during the extinction phase, regardless of the presented stimulus, only the
action leads to a reward (
). (B) During both the training and reversal phases, subjects' reaction time is high at the early stages when they don't have enough experience with the new conditions yet. However, after some trials, the reaction time declines significantly. Error bars represent
.