|
|
Actual environment states |
|
st ∈ S
|
Estimated/modeled environment states |
|
ot ∈ Ω |
Actual/observed sensor or outcome values |
|
ot ∈ Ω |
Estimated/modeled (usually future) sensor or outcome values. Note that the index τ instead of t often indicates an estimated future sensor value in Friston et al. (2015). |
|
ut ∈ A
|
Actions |
|
ut ∈ Υ |
Contemplated (usually future) actions |
|
|
Agent memory state |
|
π, |
action sequences |
| θ |
θ |
Generative model parameters |
| θ1
|
A |
Sensor dynamics param. |
| θ2
|
B |
Environment dynamics param. |
| θ3
|
D |
Initial environment state param. |
| ξ |
η |
Generative model hyperparam. or model parameter that subsumes all hyperparameters |
| ξ1
|
a |
sensor dynamics hyperparam. |
| ξ2
|
b |
Environment dynamics hyperparam. |
| ξ3
|
d |
Initial environment state hyperparam. |
| ξΓ
|
β |
Precision hyperparam. |
| (ϕ, ϕΓ) |
η |
Variational param. |
|
s0:T
|
Environment states variational param. |
|
|
For each sequence of actions and for each timestep there is a parameter . Since a categorical distribution is used, the parameter is a vector of probabilities whose entry êτ is equal to the probability of êτ if we set
|
| ϕ1
|
a |
Sensor dynamics variational param. |
| ϕ2
|
b |
Environment dynamics variational param. |
| ϕ3
|
d |
Initial environment state variational param. |
| π |
π |
Future action sequence variational param. |
| ϕΓ
|
β |
Precision variational param. |
|
−G(π) |
Variational action-value function. The dependence of G(π) on is omitted |
| p(s≼t, e≼t, a≺t) |
|
Our physical environment corresponds to the generative process |
|
|
The generative model for active inference |
|
|
Approximate complete posterior for active inference |
|
P(oτ) = σ(Uτ) |
Prior over future outcomes. |