Skip to main content
. Author manuscript; available in PMC: 2023 Aug 2.
Published in final edited form as: Adv Neural Inf Process Syst. 2022 Dec;35(DB):29776–29788.

Table 2: Human Baseline:

performance of models on joint training experiments is compared to the human baseline. The analysis is restricted to the 45 tasks used for evaluating humans. ResNet 50 approaches human-level performance only after SSL pre-training and finetuning on all task rules with 1000 samples per rule. Which is 50 times higher than the number of samples needed by humans.

N training samples 20 1000
ResNet-50 28.0 0 57.9 14
ViT-small 29.3 1 32.7 3
SCL 26.4 0 44.9 11
WReN 27.5 0 42.4 10
SCL-ResNet 18 26.8 0 64.1 18

ResNet-50 SSL 45.7 7 78.3 25
ViT-small SSL 38.7 6 60.3 17

Humans 78.7 26 - -