Figure - PMC

Skip to main content

An official website of the United States government

Here's how you know

Here's how you know

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

View full-text article in PMC

. 2020 Jul 7;9:e53262. doi: 10.7554/eLife.53262

Search in PMC
Search in PubMed
View in NLM Catalog
Add to search

© 2020, Bogacz

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

PMC Copyright notice

Figure 6. — (A) Details of the algorithm. The update rules for the variance parameters can be obtained by computing derivatives of $F$ , giving $δ_{g}^{2} - 1 / Σ_{g}$ and $δ_{h}^{2} / Σ_{h}^{2} - 1 / Σ_{h},$ but to simplify these expressions, we scale them by $Σ_{g}^{2}$ and $Σ_{h}^{2}$ , resulting in Equations 6.5. Such scaling does not change the value to which the variance parameters converge because $Σ_{g}^{2}$ and $Σ_{h}^{2}$ are positive. (B) Mapping of the algorithm on network architecture. Notation as in Figure 5B. This network is very similar to that shown in Figure 5B, but now the projection to output nuclei from the habit system is weighted by its precision $1 / Σ_{h}$ (to reflect the weighting factor in Equation 6.2), and also the rate of decay (or relaxation to baseline) in the output nuclei needs to depend on $Σ_{h}$ . One way to ensure that the prediction error in goal-directed system is scaled by $Σ_{g}$ is to encode $Σ_{g}$ in the rate of decay or leak of these prediction error neurons (Bogacz, 2017). Such decay is included as the last term in orange Equation 6.7 describing the dynamics of prediction error neurons. Prediction error evolving according to this equation converges to the value in orange Equation 6.3 (the value in equilibrium can be found by setting the left hand side of orange Equation 6.7 to 0, and solving for $δ_{g}$ ). In Equation 6.7, total reward $R$ was replaced according to Equation 1.1 by the sum of instantaneous reward $r$ , and available reward $v$ computed by the valuation system. (C) Dynamics of the model.

HHS Vulnerability Disclosure