. 2015 Aug 26;9:225. doi: 10.3389/fnbeh.2015.00225

Table 1.

Parameters from single-learning strategies are also present in dual-learning strategies excepts for θ, which disappears in entropy-based coordination and α which disappears in VPI-based selection.

Model	Symbol	Range	Description
Q-L only	α	0 < α < 1	Learning rate
	β	0 < β < 100	Softmax temperature
BWM only	N	1 < N < 10	Memory size
	θ	0 < θ < log\|A\|	Fixed entropy threshold
	ϵ	0 < ϵ < 0.1	Memory items decay
VPI-based selection	η	0.00001 < η < 0.001	Covariance initialization
	σ_r	0 < σ_r < 1	Reward rate update
Weight-based mixture	w₀	0 < w₀ < 1	Initial weight
Entropy-based coordination	λ₁, λ₂	0.00001 < λ_i < 1000	Sigmoide parameters
	β_final	0 < β_final < 100	Softmax temperature
	σ	0 < σ < 20	Simulated RT