Skip to main content
. 2024 Mar 14;24:75. doi: 10.1186/s12911-024-02481-8

Table 3.

Hyper-parameters of BERTSUM, in the case of multiple candidate parameter values, the ultimately chosen parameter value is displayed in bold

Parameters Values
encoder (classifier/transformer/rnn)
batch size (1000/2000/3000)
train steps 10,000
dropout 0.1
learning rate 2e-3·minstep-0.5,step·warmup-1.5
warmup (1000/10,000)
optimizer adam