. 2025 Jan 4;14(6):11–22. doi: 10.1093/af/vfae028

Table 1.

Inputs and outputs of computer vision algorithms used in cattle with their respective functions

Input	Output	Applicable function in cattle	Research example
Image	Classification probabilities indicating different classes or situations	Classification of animal behavior (eating, resting, biting, chewing) and classification of health condition (e.g., lameness)	Rodríguez Alvarez et al. (2019)
Image	Regression of numerical values as a result of predictions (supervised learning)	Estimation of numerical values such as BW, dry matter intake, height, milk yield, or average daily gain	Gjergji et al. (2020)
Image	Segmentation probabilities for each pixel according to its corresponding class	Classification objects/items within images, such as collars, ear tags, health damages	Wu et al. (2020)
Image	Combinations of regression (e.g., bounding box coordinates) and probabilities (e.g., Mask RCNN with instance pixel masks within the bounding boxes)	A lightweight and high-precision detection model based on the YOLOv4 framework, named GG-YOLOv4, is used to automatically detect ocular surface temperatures from the thermal images of dairy cows	Wang et al. (2022)
Image or Video	3D reconstruction with point clouds	An unsupervised DBSCAN clustering algorithm was proposed to calibrate the leg region boundary based on clustering features	Li et al. (2022)
Video	Classification probability for sequential frames (e.g., drinking, estrus)	An algorithm for tracking the beef cattle’s key body parts, such as head–ear–neck position, using a state-of-the-art deep learning architecture, DeepLabCut. The extracted key points were analyzed using an extended short-term memory model to classify drinking and non-drinking periods	Islam et al. (2023)
Video	Combination of classification, regression, object tracking ID, and downstream analysis	A CNN model, which included a tensor of 4-channel matrices of data, each with 480 × 640 pixels. The model design was inspired by ResNet CNN, which achieved the best results in an ILSVRC classification and detection competition	Bezen et al. (2020)

Abbreviations: CNN, convolutional neural networks; RCNN, region-based convolutional neural networks; YOLO, you only look once.