Skip to main content
. 2010 Dec 14;4:184. doi: 10.3389/fnbeh.2010.00184

Figure 1.

Figure 1

A reinforcement learning state-space with an option to precommit to a larger-later reward. This is the task on which the model was run. Time is schematically portrayed along the horizontal axis. The dashed box encloses the state-space for a simple two-alternative choice. From state C, the agent could choose a large reward RL following a long delay DL, or a small reward RS following a short delay DS. Preceding this simple choice was the option to precommit. From state P, the agent could either enter the simple choice by choosing C, or to precommit to the large delayed choice by choosing N. A precommitment delay DC separated P from the subsequent state.