Table 2.
The parameters assignment of algorithm.
| Parameter | Value | Meaning |
|---|---|---|
| M | 1000 | The number of simulation episodes. |
| T | 300 | The maximum of steps per episode. |
| K | 32 | The training episode of algorithm. |
| k | 32 | The size of training batch. |
| 5000 | The size of experiences memory D. | |
| The availability exponent of PER. | ||
| The initial exponent of IS. | ||
| The increment of exponent of IS. |