Improved predictions on the held-out test data. From left to right: input images ( column), ground truths ( column), SAM masks ( column), predictions from ( column), and final binary occlusion segmentation mask ( column). In the and columns, white regions are classified as occlusions while black ones are classified as non-occlusions. In the column, different colors simply indicate different image segments without associated occlusion class, which is the main shortcoming of SAM in our task. Green boxes highlight occlusion easily visible segmentation improvements.