Skip to main content
. 2024 Dec 12;14:30452. doi: 10.1038/s41598-024-75638-0

Fig. 8.

Fig. 8

The agent generates the rewards during training; this graph shows the rewards across steps.