Table 2.
Parameter | Value | Description |
---|---|---|
3 | Number of layers | |
[128, 128, 4] | Number of units | |
10,000 | Number of steps before updating the target network | |
32 | Batch size for training | |
.00025 | Learning rate for RMSProp | |
.95 | Momentum for RMSProp | |
.01 | Constant for denominator in RMSProp |