Skip to main content
. 2021 Aug 2;115(4):365–382. doi: 10.1007/s00422-021-00884-8

Fig. 4.

Fig. 4

Example learning curves. One example simulation for each model: σm2=9, ση2=16, α = 0.15 (for the Cashaback19 and Dhawale19 models), reward criterion = adaptive (mean), target amplitude = 4 σm. The top row shows the movement endpoint behavior that the TTC method uses to estimate exploration. Each movement endpoint is composed of an aim point, motor noise and possibly exploration. The bottom row shows the underlying aim points that are adjusted during learning. Non-successful trials are covered largely by successful trials because following failure, the aim point is not (or only slightly: Dhawale19) updated