Evaluation dataset. For each of the 33 patients’ video, 10 images are used for a total of 330 images. From each image, four tissue patches are extracted for a total of 1320 patches relative to the four considered tissue classes: healthy tissue, tissue with hypertrophic vessels, leukoplakia, and tissue with IPCL-like vessels. For a robust evaluation, the dataset is split at patient level to perform threefold cross validation. In each fold, 11 patients are included, for a total of 110 images per fold. Each fold contains 440 patches equally distributed among the laryngeal tissue classes.