Table 1.
Parameter | Description | Default value |
---|---|---|
COMMON | ||
DC | Delay between P and (C or N) | 100 |
DS | Delay between C and SS | 1 |
RS | Magnitude of smaller-sooner reward | 10 |
DL | Delay between C and LL | 50 |
RL | Magnitude of larger-later reward | 50 |
α | Learning rate | 0.1 |
μAGENTS MODEL | ||
Nμ | Number of μAgents | 1000 |
K | Hyperbolic discount rate | 1 |
AVERAGE REWARD MODEL | ||
σ | Average reward update rate | 0.002 |
HDTD MODEL | ||
σ | Average reward update rate | 0.01 |
SEMI-MARKOV MODEL | ||
k | Hyperbolic discount rate | 1 |