Skip to main content
. 2023 Jul 24;13:11945. doi: 10.1038/s41598-023-38259-7

Figure 5.

Figure 5

Plot of the average rewards over training steps for different setups. Each curve is averaged over three random seeds of the same experiment.