Skip to main content

View full-text article in PMC

. 2020 Jun 21;20(12):3515. doi: 10.3390/s20123515

Table 2.

Hyperparameters of the model.

DDPG Setup Hyper-Parameters
Actor/Critic learning rate	1 × 10⁻³
Reward discount factor $γ$	0.9
Soft replacement $τ$	0.01
Batch size	32
Running episodes	300
Number of track points $K$	50
Training steps per update	200
Memory capacity	80,000
Updates	episodes × points × steps