Table 3. Comparison of the Experimental Results of Different Optimization Algorithms for the BiGRU Model.
model | running time/s | MSE of the training set | MSE of the test set |
---|---|---|---|
SGD | 3 | 0.000544 | 0.000497 |
SGDM | 3 | 0.000467 | 0.000433 |
NAG | 5 | 0.000460 | 0.000428 |
RMSProp | 3 | 0.000455 | 0.000415 |
AdaGrad | 4 | 0.000471 | 0.000432 |
AdaDelta | 4 | 0.000477 | 0.000433 |
Adam | 4 | 0.000457 | 0.000419 |
Adamax | 4 | 0.000447 | 0.000413 |