. 2020 Feb;122:218–230. doi: 10.1016/j.neunet.2019.10.011

Table 2.

Hyper-parameter values used for the DNN component of DQN and CTDL in the grid world simulations.

Parameter	Value	Description
$L$	3	Number of layers
$U$	[128, 128, 4]	Number of units
$C$	10,000	Number of steps before updating the target network
$B$	32	Batch size for training
$λ$	.00025	Learning rate for RMSProp
$κ$	.95	Momentum for RMSProp
$ϕ$	.01	Constant for denominator in RMSProp