Table 1.
Details of the visual geometry group (VGG)-based networks configurations with respect to number and type of layers (depth), filter size, number of filters, and the output size after each operation. The input image size and output size shown in the table are for the mathematical analysis of images (AMI) and AMI cropped (AMIC) datasets. For the West Pomeranian University of Technology (WPUT) dataset, the models use an input image size of to preserve the aspect ratio and the output after each operation is obtained similar to AMI and AMIC.
Block | Model | Filter Size | Output Size | |||
---|---|---|---|---|---|---|
VGG-11 | VGG-13 | VGG-16 | VGG-19 | |||
Input | Input Image ( RGB) | - | - | |||
Block 1 | Convolution | Convolution | Convolution | Convolution | (64) | |
Convolution | Convolution | Convolution | (64) | |||
Max-Pooling | - | |||||
Block 2 | Convolution | Convolution | Convolution | Convolution | (128) | |
Convolution | Convolution | Convolution | (128) | |||
Max-Pooling | - | |||||
Block 3 | Convolution | Convolution | Convolution | Convolution | (256) | |
Convolution | Convolution | Convolution | Convolution | (256) | ||
Convolution | Convolution | (256) | ||||
Convolution | (256) | |||||
Max-Pooling | - | |||||
Block 4 | Convolution | Convolution | Convolution | Convolution | (512) | |
Convolution | Convolution | Convolution | Convolution | (512) | ||
Convolution | Convolution | (512) | ||||
Convolution | (512) | |||||
Max-Pooling | - | |||||
Block 5 | Convolution | Convolution | Convolution | Convolution | (512) | |
Convolution | Convolution | Convolution | Convolution | (512) | ||
Convolution | Convolution | (512) | ||||
Convolution | (512) | |||||
Adaptive Average Pooling | - | |||||
Fully Connected | Fully Connected, 2048 Neurons | |||||
Dropout Chance | ||||||
Fully Connected, 2048 Neurons | ||||||
Dropout Chance | ||||||
Fully Connected, 100 Neurons | ||||||
Log-Soft-Max |