Skip to main content
. 2012 Jul 23;6:6. doi: 10.3389/fnbot.2012.00006

Figure 5.

Figure 5

Normalized external reward obtained by different learning agents during training over 40 episodes (1000 primitive actions) in the chain walk task.