| EMA | Efficient Multi-scale Attention |
| PConv | Partial Convolution |
| EXPMA | Exponential Moving Average |
| EMASlideLoss | Exponential Moving Average Slide Loss |
| Faster R-CNN | Faster Region-based Convolutional Neural Networks |
| Mask R-CNN | Mask Region-based Convolutional Neural Networks |
| YOLO | You Only Look Once |
| MATLAB | Matrix Laboratory |
| CA | Coordinate Attention |
| CBAM | Convolutional Block Attention Module |
| BiFPN | Bidirectional Feature Pyramid Network |
| SOTA | state-of-the-art |
| ELAN | Efficient Layer Attention Network |
| CIoU Loss | Complete Intersection over Union Loss |
| DFL Loss | Distribution Focal Loss |
| BCE Loss | Binary Cross-Entropy Loss |
| IoU | Intersection over Union |
| Conv | Convolution |
| SiLU | Sigmoid Linear Unit |
| BN | Batch Normalization |
| SPPF | Spatial Pyramid Pooling-Fast |
| FLOPs | floating point operations |
| GFLOPs | Giga Floating-point Operations Per Second |
| P | Precision |
| R | Recall |
| F1 | F1-score |
| AP | Average Precision |
| mAP50 | mean Average Precision at IoU threshold of 0.5 |
| mAP75 | mean Average Precision at IoU threshold of 0.75 |
| mAP50-95 | mean Average Precision at a range of IoU thresholds from 0.50 to 0.95 |
| FPS | Frame Per Second |
| Grad-CAM | Gradient-weighted Class Activation Mapping |