Skip to main content

View full-text article in PMC

. 2023 Jan 3;23(1):515. doi: 10.3390/s23010515

Table A1.

DCCT hyperparameters.

Hyperparameter	Value
Observation rendering	(100, 100)
Observation crop	(84, 84)
Hidden units (MLP)	1024
MSA	8
Optimizer	AdamW
$(β_{1}, β_{2}) \to (f_{θ}, π_{ψ}, Q_{ϕ})$	(0.9, 0.999)
$(β_{1}, β_{2}) \to (α)$	(0.5, 0.999)
Learning rate $(f_{θ}, π_{ψ}, Q_{ϕ})$	2 $\times 10^{- 3}$
Learning rate $(α)$	2 $\times 10^{- 4}$
Weight decay	3 $\times 10^{- 2}$
Batch Size	512
Convolutional layers	2
Number of filters	36
Transformer encoder layers	6
Convolutional kernal size	3 × 3
Q function EMA $T$	0.01
Critic target update freq	2
Nonlinearity	ReLU
Encoder EMA $T$	0.05
Discount $γ$	0.99
Initial temperature	0.1