Replicated SGD on a fully connected committee machine with ,605 synapses and units in the second layer, comparison between the noninteracting (i.e., standard SGD) and interacting versions, using replicas and a minibatch size of patterns. Each point shows averages and standard deviations on samples with optimal choice of the parameters, as a function of the training set size. (Top) Minimum training error rate achieved after epochs. (Bottom) Number of epochs required to find a solution. Only the cases with success rate are shown (note that the interacting case at has success rate but an error rate of just ).