Skip to main content
. 2008 Jul 23;3(7):e2756. doi: 10.1371/journal.pone.0002756

Figure 4. Underlying state-space of the cued alternation task.

Figure 4

Each quadrant consists of eight states which differ only in the connections into the red and blue states and out of the yellow and magenta states. The quadrants are identified by letter pairs that correspond to the rewarded actions in each type of trial in the original task. The pair (L,R), for instance, means that if the green or cyan stimulus were presented, the agent would be rewarded for selecting the “L” or “R” action, respectively. Red arrows indicate transitions producing negative rewards; blue arrows indicate transitions with positive rewards.