All belief trajectories were generated using prior values on the HGF parameters as shown in
Table 1. We simulated performances in six agents by changing the trial-to-trial difference in IKI values across keystroke positions, thus leading to different trajectories of
(
B) and feedback scores (
A). We started with a pattern of IKI values of [0.2, 0.6, 0.2, 0.6, 0.2, 0.6, 0.2] s and iteratively prolonged the inter-keystroke interval at positions 2, 4, and 6, thereby increasing the temporal difference between IKI values, the vector norm of the total IKI pattern, and the cvIKI value across keystroke positions within the trial, termed
. In the plot, steeper and shallower slopes of change across trials in
and associated feedback scores are denoted by green and pink colored lines, respectively. In addition, lighter colors denote smoother trial-by-trial transitions in
values. Darker colors indicate noisier trial-by-trial changes in this measure, representing an agent with a more variable behavioral strategy every trial. (
C, D) Expectation on reward and log-volatility, and (
E, F) the associated variance or estimation uncertainty. (
G,H) Precision-weighted prediction error on reward,
, and volatility,
. A steeper slope of change in feedback scores and
was associated with higher log-volatility estimates and reduced uncertainty about volatility,
. For a fixed slope, increasing levels of noise in the trajectories of the feedback scores and
also contributed to higher volatility estimates and reduced
. Thus, agents either (i) introducing more fluctuations in behavior from trial to trial or (ii) observing a faster rise in scores had a higher expectation of volatility and lower uncertainty about volatility.