Skip to main content
. 2016 Nov 15;113(48):E7655–E7662. doi: 10.1073/pnas.1608103113

Fig. 3.

Fig. 3.

Replicated SGD on a fully connected committee machine with N=1,605 synapses and K=5 units in the second layer, comparison between the noninteracting (i.e., standard SGD) and interacting versions, using y=7 replicas and a minibatch size of 80 patterns. Each point shows averages and standard deviations on 10 samples with optimal choice of the parameters, as a function of the training set size. (Top) Minimum training error rate achieved after 104 epochs. (Bottom) Number of epochs required to find a solution. Only the cases with 100% success rate are shown (note that the interacting case at α=0.6 has 50% success rate but an error rate of just 0.07%).