| ANN | Artificial Neural Network |
| AP | Average Precision |
| AUC | Area Under the Curve |
| CCD | Charge-Coupled Device |
| CNN | Convolutional Neural Network |
| COCO | Common Objects in Context |
| CPU | Central Processing Unit |
| CVAT | Computer Vision Annotation Tool |
| DL | Deep Learning |
| FN | False Negatives |
| FP | False Positives |
| FPN | Feature Pyramid Network |
| GPS | Global Positioning System |
| GPU | Graphics Processing Unit |
| IoU | Intersection over Union |
| KNN | K-Nearest Neighbours |
| NMS | Non-Maximum Suppression |
| PAN | Path Aggregation Network |
| R-CNN | Regions with CNN features |
| ReLU | Rectified Linear Unit |
| R-FCN | Region-based Fully Convolutional Networks |
| ROI | Region Of Interest |
| RTK | Real-Time Kinematic |
| SLAM | Simultaneous Localization And Mapping |
| SSD | Single-Shot MultiBox Detector |
| SVM | Support Vector Machine |
| TP | True Positives |
| TPU | Tensor Processing Unit |
| UAV | Unmanned Aerial Vehicle |
| UGV | Unmanned Ground Vehicle |
| VOC | Visual Object Classes |
| XML | Extensible Markup Language |
| YOLO | You Only Look Once |