Skip to main content
. 2012 May 15;2:400. doi: 10.1038/srep00400

Figure 14. (a) Learning curve for different values of the associativity parameter K if the agent, by external constraints, has only a finite time available to produce an action.

Figure 14

If the simulation takes longer than Dmax, the agent will not be rewarded. In such a case, the asymptotic performance of the learning drops dramatically for large values of K. An ensemble average over 10000 games is shown.