Skip to main content
. 2023 Nov 28;113(5):2655–2674. doi: 10.1007/s10994-023-06481-z

Fig. 1.

Fig. 1

Overview of the reinforcement learning framework used. A dueling network architecture, with two streams to independently estimate the state-values (scalar) and advantages (vector) for each action, is shown