Skip to main content
. 2025 Aug 23;15:31066. doi: 10.1038/s41598-025-16564-7

Table 7.

Comparative analysis of flops, average inference timey, and memory usage between diverse vision models and the proposed method.

Model FLOPs (G) Average inference time (ms/image) memory usage (MB)
VGG16 15.47 3.46 570.57
AlexNet 1.5 1.22 270.26
ResNet50 4.41 3.10 258.17
DenseNet 2.91 8.12 197.11
EfficientNetV2 2.91 7.49 243.86
Inception_V3 4.12 5.24 255.64
Vision Transformer 16.87 4.62 367.39
Swin Transformer 15.47 13.83 389.85
ConvNext 4.47 3.06 152.78
Proposed 2.10 1.14 98.46