Table 4.
Algorithm Setup Hyper-Parameters | |
---|---|
Actor/Critic learning rate | 0.001 |
Reward discount factor | 0.9 |
Soft replacement | 0.001 |
Batch size | 32 |
Running episodes | 500 |
Steps per episode | 200 |
Memory capacity | 15,000 |
Updates | episodes × steps |
Angle factor | 0.5 |
Distance factor | 0.5 |
TSC factor | 20 |
Control cycle (s) | 0.125 |