Skip to main content
. 2024 Jun 21;16:73. doi: 10.1186/s13321-024-00863-8

Table 1.

Hyperparameters used for the Llamol model

Hyperparameters
Parameter/model Llamol
Number of attention-heads nheads 8
Number of decoder-blocks 8
Dropout probability 10%
Activation function SwiGLU
Positional embeddings RoPe
Embedding dimension demb 384
FFN hidden dim dffn 1024
Vocabulary size dvoc 591
Max SMILES length 256