Skip to main content
. 2024 Jan 10;15:434. doi: 10.1038/s41467-023-43957-x

Fig. 2. Numerical results on ResNet as a function of step (Each step corresponds to a step of stochastic gradient descent based on the derivatives of the loss computed from 2048 randomly selected training samples).

Fig. 2

a ResNet Hessian spectra during training. b Estimated error proxy during training. c Training accuracy evolution for ResNet.