Table A1.
DCCT hyperparameters.
| Hyperparameter | Value |
|---|---|
| Observation rendering | (100, 100) |
| Observation crop | (84, 84) |
| Hidden units (MLP) | 1024 |
| MSA | 8 |
| Optimizer | AdamW |
| (0.9, 0.999) | |
| (0.5, 0.999) | |
| Learning rate | 2 |
| Learning rate | 2 |
| Weight decay | 3 |
| Batch Size | 512 |
| Convolutional layers | 2 |
| Number of filters | 36 |
| Transformer encoder layers | 6 |
| Convolutional kernal size | 3 × 3 |
| Q function EMA | 0.01 |
| Critic target update freq | 2 |
| Nonlinearity | ReLU |
| Encoder EMA | 0.05 |
| Discount | 0.99 |
| Initial temperature | 0.1 |