Table 3.
Hyper-parameter values unique to CTDL. , , , , and were selected by using a random grid search on a single grid world.
Parameter | Value | Description |
---|---|---|
36 | Number of units in SOM | |
10 | Temperature for calculating | |
1 | Temperature for calculating | |
.1 | Standard deviation of the SOM neighbourhood function | |
.1 | Constant for denominator in SOM neighbourhood function | |
.01 | Learning rate for updating the weights of the SOM | |
.9 | Learning rate for updating the Q values of the SOM |