Skip to main content
. Author manuscript; available in PMC: 2015 May 1.
Published in final edited form as: Artif Intell. 2014 Feb 24;210:78–122. doi: 10.1016/j.artint.2014.02.004

Figure 9.9.

Figure 9.9

Histogram of the difference between the dropout variance of Oil and its approximate upperbound Wil(Wil) in a MNIST classifier network before and after training. Histograms are obtained by taking all non-input neurons and aggregating the results over 10 random input vectors. Note that at the beginnning of learning, with random small weights, E(Oil)Wil0.5, and thus Var(Oil)0 whereas Wil(1Wil)0.25.