Graphical overview of the participants’ methodologies for Task 1 as described in Section 4 (Key: - input frame; - groundtruth; - prediction). AQ-ENIB (a) proposed an ensemble of DenseNet models with Test Time Augment (TTA). BioPolimi (b) combined ResNet50 features with a Histogram of Oriented Gradients (HoG) computed on . RREB (c) proposed a multi-task for segmentation and multi-scale regression of HoG features (, , …) computed on (, , …). GRECHID (d) used 3 SEResNeXt-UNet models individually trained on each class ensembled by thresholding, where are pixels predicted with high confidence and is the empirical threshold. SANO (e) proposed a mean ensemble of Feature Pyramid Network (FPN) with ResNet152 backbone. OOF (f) used an EfficientNet UNet++, preprocessing images with contrast-limited adaptive histogram equalization (CLAHE) and median filter.