Skip to main content
. 2019 Aug 20;11:220. doi: 10.3389/fnagi.2019.00220

Figure 2.

Figure 2

Common activation functions used in deep learning (red) and their derivatives (blue). When the sigmoid is differentiated, the maximum value is 0.25, which becomes closer to 0 when it continues to multiply.