Skip to main content
. 2025 Mar 3;6(3):101187. doi: 10.1016/j.patter.2025.101187

Table 5.

Ethical-Lens substantially enhances the bias alignment across various dimensions, mostly surpassing the performance of DALL·E 3

Methods GPT4-V evaluation
HEIM evaluation
Gender Race Age Gender Race Age
DD 1.0 0.958 0.983 0.990 0.662 0.911 0.745
+Ethical-Lens 0.256 0.504 0.584 0.215 0.590 0.504
SD 1.5 0.777 0.817 0.906 0.559 0.784 0.800
+Ethical-Lens 0.163 0.392 0.607 0.283 0.639 0.597
SD 2.0 0.674 0.752 0.877 0.622 0.865 0.854
+Ethical-Lens 0.209 0.396 0.627 0.418 0.678 0.752
SDXL 1.0 0.840 0.858 0.940 0.633 0.788 0.757
+Ethical-Lens 0.196 0.406 0.571 0.255 0.594 0.590
DALL·E 3 0.332 0.497 0.838 0.353 0.574 0.760

The table illustrates the comparison of scores across each alignment perspective within the bias dimension for different text-to-image models and our Ethical-Lens on HumanBias dataset (italics). indicates that lower scores are better.