Table 1.
Algorithms | Parameter Settings |
---|---|
HyAdamC | , , , |
SGD | |
RMSProp | Learning rate = , , |
Adam | , , |
AdamW | , , |
Adagrad | , , |
AdaDelta | , , |
Rprop | , , , step sizes |
Yogi | , , , |
Fromage | |
TAdam | , , , , |
diffGrad | , , |
Algorithms | Parameter Settings |
---|---|
HyAdamC | , , , |
SGD | |
RMSProp | Learning rate = , , |
Adam | , , |
AdamW | , , |
Adagrad | , , |
AdaDelta | , , |
Rprop | , , , step sizes |
Yogi | , , , |
Fromage | |
TAdam | , , , , |
diffGrad | , , |