Table 5.
Hyper-Parameters | Actor/Critic Learning Rate | Reward Discount Factor | Batch Size |
---|---|---|---|
1 | 0.001 | 0.6 | 32 |
2 | 0.001 | 0.8 | 16 |
3 | 0.001 | 0.8 | 32 |
4 | 0.001 | 0.8 | 64 |
5 | 0.001 | 0.8 | 128 |
6 | 0.001 | 0.8 | 256 |
7 | 0.001 | 0.9 | 16 |
8 | 0.001 | 0.9 | 32 |
9 | 0.001 | 0.9 | 64 |
10 | 0.001 | 0.9 | 128 |
11 | 0.001 | 0.9 | 256 |
12 | 0.005 | 0.9 | 16 |
13 | 0.005 | 0.9 | 32 |
14 | 0.005 | 0.9 | 64 |
15 | 0.005 | 0.8 | 64 |
16 | 0.005 | 0.9 | 128 |
17 | 0.005 | 0.9 | 256 |