| parameter | value | explain |
| warmup_epochs | 3.0 | Number of cycles for warm-up training. During the warm-up period, the learning rate is gradually increased from a smaller value to a set initial learning rate. |
| warmup_momentum | 0.8 | Initial momentum value during warm-up. |
| warmup_bias_lr | 0.1 | Initial bias learning rate during warm-up. |