Skip to main content
. 2023 Jun 9;36(5):2194–2209. doi: 10.1007/s10278-023-00832-x

Table 10.

Input size and preprocessing input function for each CNN

CNN Input Shape Preprocess Input Function
VGG16 [36] 224x224 It converts RGB to BGR, The images are converted from RGB to BGR, then each color channel is zero-centered with respect to the ImageNet dataset, without scaling.
InceptionV3 [38] 299x299 The inputs pixel values are scaled between -1 and 1, sample-wise.
ResNet152V2 [40] 224x224 The inputs pixel values are scaled between -1 and 1, sample-wise.
InceptionResNetV2 [41] 299x299 The inputs pixel values are scaled between -1 and 1, sample-wise.
MobileNetV2 [46] 224x224 The inputs pixel values are scaled between -1 and 1, sample-wise.
DenseNet201 [43] 224x224 The input pixels values are scaled between 0 and 1 and each channel is normalized with respect to the ImageNet dataset.
Xception [42] 299x299 The inputs pixel values are scaled between -1 and 1, sample-wise.
NasNetLarge [44] 331x331 The inputs pixel values are scaled between -1 and 1, sample-wise.
EfficientNetV2L [49] 480x480 Nothing