. 2023 Jun 9;36(5):2194–2209. doi: 10.1007/s10278-023-00832-x

Table 10.

Input size and preprocessing input function for each CNN

CNN	Input Shape	Preprocess Input Function
VGG16 [36]	224x224	It converts RGB to BGR, The images are converted from RGB to BGR, then each color channel is zero-centered with respect to the ImageNet dataset, without scaling.
InceptionV3 [38]	299x299	The inputs pixel values are scaled between -1 and 1, sample-wise.
ResNet152V2 [40]	224x224	The inputs pixel values are scaled between -1 and 1, sample-wise.
InceptionResNetV2 [41]	299x299	The inputs pixel values are scaled between -1 and 1, sample-wise.
MobileNetV2 [46]	224x224	The inputs pixel values are scaled between -1 and 1, sample-wise.
DenseNet201 [43]	224x224	The input pixels values are scaled between 0 and 1 and each channel is normalized with respect to the ImageNet dataset.
Xception [42]	299x299	The inputs pixel values are scaled between -1 and 1, sample-wise.
NasNetLarge [44]	331x331	The inputs pixel values are scaled between -1 and 1, sample-wise.
EfficientNetV2L [49]	480x480	Nothing