Table 1.
Model variable | General definition | Model-specific specification |
---|---|---|
ot | Observable outcomes at time t | Outcome modalities:
|
st | Hidden states at time t | Hidden state factors:
|
π | A distribution over action policies encoding the expectation that a particular policy is most likely to generate preferred outcomes | Allowable policies included the decision to transition from the starting state to each of the 9 possible positions on the runway |
β | The prior on expected policy precision (β) is the “rate” parameter of a γ distribution, which is a standard distribution to use as a prior for expected precision. This latter term modulates the influence of expected free energy on policy selection | When β is high (reflecting low confidence about the best decision), policy selection becomes less deteriministic. Higher β values therefore encode participants’ decision uncertainty during the task (similar to the temperature parameter in a conventional softmax response function) |
A matrix P(ot | st) |
A matrix encoding beliefs about the relationship between hidden states and observable outcomes (i.e., the likelihood that specific outcomes will be observed given specific hidden states) | Encodes beliefs about the relationship between position on the runway and the probability of observing each outcome, conditional on beliefs about the task condition |
B matrix P(st + 1 | st) |
A matrix encoding beliefs about how hidden states will evolve over time (transition probabilities) | Encodes beliefs about the way participants could choose to move the avatar, as well as the belief that the task condition will not change within a trial |
C matrix In P(ot) |
A matrix encoding the degree to which some observed outcomes are preferred over others (technically modelled as prior expectations over outcomes) | Encodes stronger positive preferences for receiving higher numbers of points, and negative preferences for the aversive stimuli (both relative to an anchor value of 0 for the “safe” positive stimulus). The emotional conflict (EC) parameter in our model encoded the value of participants’ preferences against observing the aversive stimuli |
D matrix P(s1) |
A matrix encoding beliefs about (a probability distribution over) initial hidden states | The simulated agent always began in an initial starting state, and believed each task condition was stable across each trial |