Figure 2:
The third step (M) of an incremental summary of active inference. In a generative model of action, state transitions are conditioned on policies . Prior policy beliefs are informed by the baseline prior over policies (“model free,” denoted ) and the expected free energy (), which evaluates each policy-specific perception model (as in M) in terms of the expected risk and ambiguity. Risk biases the action model toward phenotype-congruent preferences (). Posterior policy beliefs are informed by the fit between anticipated (policy-specific) and preferred outcomes, while at the same time minimizing their ambiguity.