Expanding the Active Inference Landscape: More Intrinsic Motivations in the Perception-Action Loop

. 2018 Aug 30;12:45. doi: 10.3389/fnbot.2018.00045

This article	Friston et al. (2016b)	Note
$e_{t} \in E$		Actual environment states
${\hat{e}}_{t} \in \hat{E}$	s_t ∈ S	Estimated/modeled environment states
$s_{t} \in S$	o_t ∈ Ω	Actual/observed sensor or outcome values
${\hat{s}}_{t} \in \hat{S} = S$	o_t ∈ Ω	Estimated/modeled (usually future) sensor or outcome values. Note that the index τ instead of t often indicates an estimated future sensor value in Friston et al. (2015).
$a_{t} \in A$	u_t ∈ A	Actions
${\hat{a}}_{t} \in \hat{A} = A$	u_t ∈ Υ	Contemplated (usually future) actions
$m_{t} \in M$		Agent memory state
${\hat{a}}_{0 : \hat{T}}$	π,	action sequences
θ	θ	Generative model parameters
θ¹	A	Sensor dynamics param.
θ²	B	Environment dynamics param.
θ³	D	Initial environment state param.
ξ	η	Generative model hyperparam. or model parameter that subsumes all hyperparameters
ξ¹	a	sensor dynamics hyperparam.
ξ²	b	Environment dynamics hyperparam.
ξ³	d	Initial environment state hyperparam.
ξ^Γ	β	Precision hyperparam.
(ϕ, ϕ^Γ)	η	Variational param.
$ϕ^{E_{0 : \hat{T}}}$	s_0:T	Environment states variational param.
$q ({\hat{e}}_{τ} \| {\hat{a}}_{t : \hat{T}}, a_{0 : t - 1}, ϕ^{E_{τ}})$	${(s_{τ}^{π})}_{{\hat{e}}_{τ}}$	For each sequence of actions and for each timestep there is a parameter $s_{τ}^{π}$ . Since a categorical distribution is used, the parameter is a vector of probabilities whose entry ê_τ is equal to the probability of ê_τ if we set $\hat{E} = {1, \dots, \| \hat{E} \|}$
ϕ¹	a	Sensor dynamics variational param.
ϕ²	b	Environment dynamics variational param.
ϕ³	d	Initial environment state variational param.
π	π	Future action sequence variational param.
ϕ^Γ	β	Precision variational param.
$\hat{Q} ({\hat{a}}_{t : \hat{T}}, ϕ)$	−G(π)	Variational action-value function. The dependence of G(π) on $s_{0 : T}^{π}$ is omitted
p(s_≼t, e_≼t, a_≺t)	$R (õ, \tilde{s}, ã)$	Our physical environment corresponds to the generative process
$q ({\hat{s}}_{≼ t}, {\hat{e}}_{0 : \hat{T}}, {\hat{a}}_{0 : \hat{T}}, γ, θ, ξ)$	$P (õ, \tilde{s}, π, γ, A, B, D \| a, b, d, β)$	The generative model for active inference
$r ({\hat{e}}_{0 : \hat{T}}, {\hat{a}}_{0 : \hat{T}}, γ, θ \| π, ϕ^{Γ}, ϕ)$	$Q (\tilde{s}, π, A, B, D, γ \| s_{0 : \hat{T}}^{π}, π, a, b, d, β)$	Approximate complete posterior for active inference
$p^{d} ({\hat{s}}_{τ})$	P(o_τ) = σ(U_τ)	Prior over future outcomes.