Table 2.
Hyperparameter | Value |
---|---|
Optimizer | Stochastic gradient descent |
Initial learning rate | 0.01 |
Learning rate decay | Exponential, decay rate 0.1 |
Epochs | 15 |
Batch size | 32 |
Feed-forward classification head | 128, 128 (ReLU activated) |
Hyperparameter | Value |
---|---|
Optimizer | Stochastic gradient descent |
Initial learning rate | 0.01 |
Learning rate decay | Exponential, decay rate 0.1 |
Epochs | 15 |
Batch size | 32 |
Feed-forward classification head | 128, 128 (ReLU activated) |