Table 2.
Comparison of reinforcement-learning model fits for choice behavior under the deterministic and probabilistic schedules of reinforcement
Schedule | Age | Forgetting reinforcement learning model | Q-learning model with one learning rate | Q-learning model with two learning rates |
---|---|---|---|---|
| ||||
Deterministic | 35 | 48,893 | 53,096 | 52,131 |
55 | 46,501 | 50,553 | 49,745 | |
75 | 43,048 | 46,585 | 45,964 | |
120 | 65,197 | 70,055 | 69,159 | |
Probabilistic | 35 | 51,216 | 55,615 | 53,573 |
55 | 55,402 | 62,625 | 58,380 | |
75 | 46,257 | 49,098 | 47,983 | |
120 | 86,136 | 91,991 | 89,169 |
Values presented are the sum of the BIC. Bold values are those with the lowest BIC