Skip to main content
. 2014 Jun 5;10(6):e1003640. doi: 10.1371/journal.pcbi.1003640

Figure 2. Belief and policy of an agent acting in a changing environment.

Figure 2

A The belief about the environmental state Inline graphic may influence the choice of the appetitive or aversive reaction. Only after the appetitive reaction, the agent gains new information about the true state of the environment. The belief Inline graphic and the agents knowledge about the transition probabilities of the environmental state combined with potentially new information determines the new belief Inline graphic. B The starting point of the arrows is a belief found by choosing the appetitive reaction once and receiving reward (green), punishment (red) or no reinforcement (blue). If the agent always chooses the aversive reaction thereafter, the belief drifts to the stationary state along the trajectories shown by the arrows. Possible belief states Inline graphic with Inline graphic can be represented as a point in the “belief space” (gray shaded triangle). C The regions in the belief space favoring the appetitive reaction (dark shading, upward arrow) over the aversive reaction (bright shading, downward arrow) depend on the policy and the costs of responding. The provident policy (lowest row) is biased towards the appetitive reaction. A larger cost for the aversive reaction than for the appetitive reaction (left column) decreases the region of the aversive reaction.