Figure 8.
Long-term performance of agents using MAXIMIZE-EXT (T=107 trials): (a) mean gross energy gain (scaled); (b) switching rates (i.e. mean number of switches per trial). Frames organized as in the figure 7.
Long-term performance of agents using MAXIMIZE-EXT (T=107 trials): (a) mean gross energy gain (scaled); (b) switching rates (i.e. mean number of switches per trial). Frames organized as in the figure 7.