Fig. 1.
State transition and reward structure in the two-step RL task. Each first-stage choice (black background) is predominantly associated with one or the other of the second-stage states (green and blue backgrounds) and leads there 70% of the time. These second-stage choices are probabilistically reinforced with money, whose reward probabilities change over the course of the experiment (see Results for a detailed explanation).
