Table 2. Free parameters of the model and their assigned values.
Value | Range | Symbol | Free Parameter |
0.02 | Updating Rate of the Average Reward | ||
0.0001 | - | Used to Determine Process Noise | |
0.05 | - | Variance of Observation Noise | |
1 | - | Rate of Exploration | |
0.1 | Update Rate of the Reward Function | ||
0.95 | Discount Factor |