Skip to main content
. 2018 Jan 30;9:5. doi: 10.3389/fpsyg.2018.00005

Figure 4.

Figure 4

For Q-NN, NFQ, and DQN: (A) obtained maximum rewards for different number of hidden units are plotted to find the optimal structure of Q-network; (B) for each iteration, with optimal number of hidden unit (200 units) reward is computed and plotted.