Skip to main content
. 2020 Feb;122:218–230. doi: 10.1016/j.neunet.2019.10.011

Table 2.

Hyper-parameter values used for the DNN component of DQN and CTDL in the grid world simulations.

Parameter Value Description
L 3 Number of layers
U [128, 128, 4] Number of units
C 10,000 Number of steps before updating the target network
B 32 Batch size for training
λ .00025 Learning rate for RMSProp
κ .95 Momentum for RMSProp
ϕ .01 Constant for denominator in RMSProp