Skip to main content
. 2023 Jan 3;23(1):515. doi: 10.3390/s23010515

Table A1.

DCCT hyperparameters.

Hyperparameter Value
Observation rendering (100, 100)
Observation crop (84, 84)
Hidden units (MLP) 1024
MSA 8
Optimizer AdamW
(β1,β2)(fθ,πψ,Qϕ) (0.9, 0.999)
(β1,β2)(α) (0.5, 0.999)
Learning rate (fθ,πψ,Qϕ) 2 × 103
Learning rate (α) 2 × 104
Weight decay 3 × 102
Batch Size 512
Convolutional layers 2
Number of filters 36
Transformer encoder layers 6
Convolutional kernal size 3 × 3
Q function EMA T 0.01
Critic target update freq 2
Nonlinearity ReLU
Encoder EMA T 0.05
Discount γ 0.99
Initial temperature 0.1