Skip to main content
. Author manuscript; available in PMC: 2024 Mar 8.
Published in final edited form as: Proc Mach Learn Res. 2023 Aug;219:285–307.

Table C.5:

Hyperparameter settings for Filter then Avg. across different data splits.

Filter then Avg.
Hyperparameter split1 split2 split3
Learning rate 0.003 0.001 0.003
Weight decay 0.00003 0.00001 0.00001
Learning rate schedule cosine cosine cosine