Table 1:
PE AUC with vision transformer (ViT) | ||||
---|---|---|---|---|
Model | Image Size | Patch Size | Initialization | Val AUC |
SeXception | 576 | NA | ImageNet | 0.9634 |
ViT-B_32 | 512 | 32 | Random | 0.8212 |
ViT-B_32 | 224 | 32 | ImageNet21k | 0.8456 |
ViT-B_32 | 512 | 32 | ImageNet21k | 0.8847 |
ViT-B_16 | 512 | 16 | Random | 0.8385 |
ViT-B_16 | 224 | 16 | ImageNet21k | 0.8826 |
ViT-B_16 | 512 | 16 | ImageNet21k | 0.9065 |
ViT-B_16 | 576 | 16 | ImageNet21k | 0.9179 |