Table 7. ResNet configuration for ImageNet100 (He et al., 2016).
| Layer | Output size | Network configuration |
|---|---|---|
| conv1 | , 64, stride 1 | |
| conv2 x | ||
| None/FMAttn/CAM | ||
| conv3 x | ||
| None/FMAttn/CAM | ||
| conv4 x | ||
| None/FMAttn/CAM | ||
| conv5 x | ||
| None/FMAttn/CAM | ||
| conv6 x | ||
| None/FMAttn/CAM | ||
| Adaptive average pooling | ||
| FC-100 |
Notes:
The first block of convN x is followed by a downsample layer, except for conv2 x.
The first convolutional layer of conv3_x and conv4_x has a stride of 2. The rest are all 1.
Each convolutional layer is followed by a BN layer and ReLU layer.
[ ]: Convolution kernel size , n channels.
Output size: .