Table I.
Parameter | Search Values |
---|---|
Hidden Layers | 1–5 |
Layer Widths | 65–260 (multiples of 5) |
Learning Rate | 10−5 − 10−2 |
Learning Rate Decay | 10−8 − 10−5 |
Dropout Probability | 0–0.5 |
Weight Initializations | Normal, Uniform He Normal, He Uniform |
Parameter | Search Values |
---|---|
Hidden Layers | 1–5 |
Layer Widths | 65–260 (multiples of 5) |
Learning Rate | 10−5 − 10−2 |
Learning Rate Decay | 10−8 − 10−5 |
Dropout Probability | 0–0.5 |
Weight Initializations | Normal, Uniform He Normal, He Uniform |