Skip to main content
. 2021 Jan 7;80(8):11765–11788. doi: 10.1007/s11042-020-10183-2

Table 2.

Parameters for BERT-Large

Parameter Name Value of Parameter
Number of Layers 24
Hidden Size 1024
Attention Heads 16
Number of Parameters 340M