Skip to main content
. 2025 Oct 9;12:1584657. doi: 10.3389/frobt.2025.1584657

TABLE 7.

Comparative table models used in computer vision.

Model Key applications Strengths Weaknesses
CNN (Chua, 1997) Component identification, defect detection - High accuracy in image classification and object detection
- Ability to learn complex patterns
- Requires large datasets
- High computational cost
You Only Look Once (YOLO) (Jiang et al., 2022) Real-time object detection - Real-time processing
- Single pass for object detection
- Lower accuracy for small objects
- Less effective for highly cluttered scenes
Single Shot MultiBox Detector (SSD) (Liu et al., 2016) - Faster than R-CNN
- Balances speed and accuracy
- Lower accuracy compared to R-CNN for small objects
R-CNN (Zhang et al., 2018) Object detection and segmentation - High accuracy in object detection and segmentation
- Effective for complex scenes
- Slower processing speed
- High computational requirements
Fast R-CNN (Girshick, 2015) - Faster than R-CNN- High accuracy in object detection and classification - Still slower than YOLO and SSD
- High computational cost
RetinaNet (Wang et al., 2019) - High accuracy
- Efficient in detecting objects of varying sizes
- Higher computational cost
- Complex training process
Mask R-CNN (Vuola et al., 2019) - Adds instance segmentation to object detection
- High accuracy in detecting and segmenting objects
- Slower processing speed
- High computational requirements