Table 6.
The parameters assignment of algorithm.
| Parameter | Value | Meaning |
|---|---|---|
| M | 6000 | The number of simulation episodes. |
| T | 1000 | The maximum of steps per episode. |
| K | 64 | The training episode of algorithm. |
| k | 64 | The size of training batch. |
| The size of experiences memory D. | ||
| The availability exponent of PER. | ||
| The initial exponent of IS. | ||
| The increment of exponent of IS. |