Skip to main content
. 2023 Feb 10;10:102072. doi: 10.1016/j.mex.2023.102072

Table 3.

Hyper-parameters settings for training the teacher candidates and a non-KD student.

Hyper-parameter Value
Batch Size 16
Optimizer Adam
Epochs 30
Learning Rate 0.0001