. 2019 Mar 28;26(4):1099–1121. doi: 10.3758/s13423-018-1554-2

Table 7.

Group parameter estimates of the full reinforcement learning diffusion decision model

Parameter	M	SD	2.5% percentile	97.5% percentile
$ϕ (μ_{η^{+}})$	0.07	0.02	0.03	0.12
$σ_{η^{+}}$	0.75	0.15	0.50	1.09
$ϕ (μ_{η^{-}})$	0.08	0.02	0.05	0.14
$σ_{η^{-}}$	0.58	0.13	0.37	0.87
$\exp (μ_{v_{mod}})$	0.48	0.10	0.32	0.70
$σ_{v_{mod}}$	0.85	0.14	0.59	1.14
$\exp (μ_{v_{\max}})$	3.47	0.25	2.98	3.98
$σ_{v_{\max}}$	0.31	0.07	0.20	0.47
$μ_{a_{fixed}}$	1.00	0.20	0.62	1.39
$σ_{a_{fixed}}$	0.97	0.14	0.73	1.26
$μ_{a_{mod}}$	−0.010	0.006	−0.021	0.001
$σ_{a_{mod}}$	0.027	0.004	0.020	0.037
$μ_{T_{er}}$	0.76	0.03	0.71	0.81
$σ_{T_{er}}$	0.13	0.02	0.10	0.17

Note. The full reinforcement model had separate learning rates η⁺ and η⁻ for positive and negative prediction errors, two parameters to describe the non-linear mapping between the difference in values and the drift rate, a scaling parameter v_mod, and an asymptote $v_{\max}$ , one fixed threshold parameter a_fixed, one value-modulation parameter a_mod, and finally one non-decision time T_er. Note that $μ_{η^{+}}$ , $μ_{η^{-}}$ , $μ_{v_{mod}}$ , and $μ_{v_{\max}}$ were transformed for interpretability