Figure - PMC

Skip to main content

An official website of the United States government

Here's how you know

Here's how you know

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

View full-text article in PMC

. 2021 Jan 25;376(1820):20190752. doi: 10.1098/rstb.2019.0752

Search in PMC
Search in PubMed
View in NLM Catalog
Add to search

© 2021 The Author(s)

Published by the Royal Society. All rights reserved.

PMC Copyright notice

Figure 2. — An extended HRL model integrating valence. Sensory inputs from the environment (exteroceptive) are evaluated against predictions about interoceptive and exteroceptive outcomes in an integrative field, which determines valence (advantage/harm) of incoming information. Internal state regulation further integrates these inputs by calculating allostatic load relative to meeting homeostatic setpoints and the metabolic cost of current and potential action. Based on the prediction errors resulting from this HRL-like learning scheme, together with valence and the reality of metabolic constraints, a policy for action is selected. Policy selection and resulting action are implemented by genetic and epigenetic regulatory networks. Action modifies the next round of exteroceptive sensory inputs the organism receives. The rounded rectangles represent higher-order functions (sensing, information integration, decision making, implementation, behaviour), while the ovals denote processes or products that feed into or arise from the higher-order functions.