Skip to main content
. 2020 Nov 23;117(47):29311–29320. doi: 10.1073/pnas.1912336117

Fig. 2.

Fig. 2.

Illustration of foraging task with latent dynamics and partially observable sensory data. The reward availability in each of the two boxes evolves according to a telegraph process, switching between available (red) and unavailable (blue), and colors give the animal an ambiguous sensory cue about the reward availability. The agent may travel between the locations of the two boxes. When a button is pushed to open a box, the agent receives any available reward.