Figure - PMC

Skip to main content

View full-text article in PMC

. Author manuscript; available in PMC: 2022 May 9.

Published in final edited form as: Phys Med. 2021 May 9;83:242–256. doi: 10.1016/j.ejmp.2021.04.016

Figure 9. — Typical architecture for a (deep) Convolutional Neural Network (CNN). Different convolutional kernels scan the input images leading to several feature maps. Then, down-sampling operations, such as max-pooling (i.e., taking the maximum value of a block of pixels), are applied to reduce the size of the feature maps. These two operations, convolution and pooling, are applied multiple times to extract higher-level features. At the end, the feature maps are flattened and passed through fully connected layers of neurons (see Figure 5), to obtain a final prediction. The embedded (automatic and unsupervised) feature extraction (Figure 4) is what enables CNNs to remove all handcrafted operations and makes them so powerful.