TABLE I:
The default hyperparameters.
Hyperparamter | Value |
---|---|
GGT Network Architecture (channels) | [256, 256, 128] |
Number Neighbor nodes | 30 |
Initial Learning rate | 1e-3 |
Maximum Epochs | 100 |
Optimizer | ADAM |
Batch size (b) | 4 |
PE dimension | 32 |
weight decay | 1e-6 |