The Apemen Faces Database (ApeFD)

Zdzisław Lewandowski; Slawomir Wacewicz; Juan Olvido Perea-García; Vojtěch Fiala; Marta Sibierska; Anna Szala; Dariusz P Danel

doi:10.1038/s41597-025-05813-z

. 2025 Aug 21;12:1458. doi: 10.1038/s41597-025-05813-z

The Apemen Faces Database (ApeFD)

Zdzisław Lewandowski ¹, Slawomir Wacewicz ^2,^✉, Juan Olvido Perea-García ^2,³, Vojtěch Fiala ², Marta Sibierska ², Anna Szala ², Dariusz P Danel ⁴

PMCID: PMC12371075 PMID: 40841720

Abstract

The Apemen Faces Database is a novel and versatile stimulus set designed for research in behavioral biology, evolutionary psychology, and related fields. The dataset comprises 620 photorealistic, artificially generated facial images of 31 generalized hominin models, available in multiple ocular coloration variants (31 hominins x 20 color variants). Each of the 31 facial portraits is paired with geometric morphometric data and norming information that includes perceptual ratings of six constructs (Threat, Sociability, Trustworthiness, Health, Age, and Masculinity). Further, editable .psd files enable easy generation of a wide spectrum of great ape eye phenotypes. The images were designed to be morphologically diverse, sufficiently humanlike to elicit social attributions, yet clearly non-human. This unique “humanlike but not human” design facilitates the study of face perception beyond the boundaries of extant human variation, offering novel opportunities for investigating cognitive and perceptual mechanisms in both humans and non-human primates.

Subject terms: Human behaviour, Biological anthropology, Evolutionary developmental biology

Background & Summary

We present the ApeFD, Apemen Faces Database¹, a highly versatile resource with applications in a broad range of research fields, in particular in evolutionary behavioral sciences. Our dataset contains photorealistic, artificial facial images of 31 generalised “hominins” (15 female, 15 male, plus one extra male) presented in different coloration versions (620 PNG files – cf. Figs. 1, 2 for examples), together with geometric morphometric landmarks and measurements, and extensive norming data. It is further extended with 31 PSD master files that allow users to easily arrive at a full range of colorations, capturing most of the extant diversity in visible great ape eye phenotype. The images have been developed to meet the following criteria: (1) represent a diverse range of facial morphologies that (2) look sufficiently humanlike to be meaningfully ascribed human attributes such as sociability or trustworthiness, but (3) be non-human, i.e., distinct from any extant human population. Such “humanlike but not human” stimuli open up possibilities to extend the study of the perception – by humans as well as other primates – of facial features to morphologies outside of the range of human variation.

Fig. 1 — Examples of facial images included in the ApeFD.

Fig. 2 — Examples of five ocular morphologies featured in the ApeFD.

The intended primary area of application of this dataset is the study of human ocular morphology – that is, studying the range of forms and colorations of the eyes that are perceived as humanlike, as distinct from other great ape species. Features such as very dark sclerae or bright-yellow irises, which are typical of chimpanzee eyes, look highly unnatural in humans; because of that, human faces edited to have dark sclerae or yellow irises cannot be easily used as stimuli, since this instantly attracts the participants’ attention to the manipulation. However, our research² suggests hominid-like stimuli pass as natural when presented with ape-like or human-like coloration. By manipulating scleral and iridal coloration beyond what is present in modern-day humans, we can check how these different morphologies affect a range of outcome variables, measured both through ratings (e.g., perceived trustworthiness), psychophysiology (e.g., heart rate, breathing rate, arousal-mediated skin conductance), or behavior (e.g., reaction times, forced choice). This, in turn, informs key hypotheses on the evolution of the peculiar human ocular coloration, and human cognitive evolution more broadly: in particular, the extremely influential but no longer empirically supported “gaze camouflage” hypothesis³ and “cooperative eye hypothesis”⁴.

We emphasise at this point that the utility of the Apemen Faces Database¹ reaches substantially beyond this principal area of interest. Firstly, it enables the study of other facial features in relation to a range of perceptual and psychological phenomena. We note the particular relevance of our research to evolutionary behavioral sciences, where the perception and evaluation of the face is one of the most – and possibly the most – extensively studied topics, and databases such as Chicago FD⁵, Face Research Lab London Set⁶, or Bogazici FD⁷ are a staple resource in a large number of experimental studies. The ApeFD presented here complements these existing databases by making it possible to investigate features, or feature configurations, that fall outside extant variability in our species while still being perceived as humanlike. This aligns with the interest of evolutionary behavioral sciences in identifying human evolved biases in perception, here specifically related to the perception of faces. For example, the database can be used to research features or feature complexes that modern humans associate with masculinity and femininity beyond stereotypes based on standard human facial morphology⁸. Likewise, it can be applied to determine the facial characteristics that predict perception of threat, dominance, and fighting ability⁹.

The need for such a facial database is especially evident considering a number of recent publications. For example, Wacewicz et al.² and Wolf et al.¹⁰ both relied on artificial human-like stimuli to circumvent limitations such as violation of expectations. While these stimuli could serve the specific purposes of the studies for which they were designed, their general use is limited. In Wacewicz et al.², the stimuli were obtained by morphing together photographs of reconstructions of hominids, which are typically copyrighted. Similarly, Perea-García et al.¹¹ employed photographs of diverse primate species to alter their appearance beyond the naturally occurring, as primates are not a familiar percept to most human participants. In Wolf et al.¹⁰, the stimuli depicted highly stylized “aliens” that were specifically designed to test children and are unlikely to appear human or believable enough for adult participants. In addition to these factors, the supplementary material, i.e., the norming data and geometric-morphometric measurements, makes the ApeFD a resource for out-of-the-box analyses with a standardized set that will enable repeatable studies.

Finally, the broader applicability of the Apemen Faces Database¹ extends beyond evolutionary behavioral sciences. In primatology, this stimulus can be feasibly used in studies with non-human primates in zoos^12,13. In ethnology and cultural anthropology, the database presented here has applications in studying the natural human tendency to alter facial appearance, as it does in the area of cosmetics and plastic surgery, and more broadly, in theatre, video game industry, and film studies^14–17. In cross-disciplinary research, the database will find applications in researching perceptual phenomena such as gestaltive face perception and the “uncanny valley”^18,19.

In sum, the Apemen Faces Database¹ constitutes a high-quality, versatile stimulus set, openly available to the academic community. Although developed primarily as a means of studying aspects of ocular appearance, it can be productively applied to other features or contexts in evolutionary psychology and, even more broadly, in human sciences.

Methods

Production of the images

The facial images used in this study were generated by an experienced graphic artist through a combination of manual editing with software tools and automatic image generation with AI-based tools.

AI-based image generation

Midjourney was employed to create novel, original hominin-like faces, which were then further processed and refined using GetImg.ai. Paid subscription plans were used to make the images eligible for open sharing under the CreativeML Open RAIL-M licence. The final images were produced using the RealVisXL V4.0 model (https://huggingface.co/SG161222/RealVisXL_V4.0) with 45 sampling steps and the Euler sampler, through standard text prompting with iteration, and with iterated admixture of previously generated images as “image reference”. The construction of the prompts revolved around several central elements, in particular the individual (“ancient hominin”, “apeman”, “Australopithecus”), parameters of the photograph (“passport shot”, “biometric passport photograph”, “en-face”, “looking straight”) and quality and realism (“ultra-realistic”, “individual imperfections”). Negative prompts emphasised avoiding artificial and rubbery look, gloss and shine, and artistic effects.

Prompt examples

Prompt: Create a passport-style, full-head portrait of a man, woman, or adolescent resembling an ancient hominin (Australopithecus, Homo habilis, caveman) with smooth matte skin featuring a few shallow wrinkles. The subject should have an intense gaze directed straight at the camera, illuminated by a single symmetrical studio key light to emphasize the raw and primitive essence of prehistoric features. The background should be light grey.

Negative prompt: Avoid rim light, catch light, dark backgrounds, blurriness, plastic or shiny textures, and silicone-like appearances.
Prompt: Create an ultra-realistic biometric passport photograph of a lifelike Australopithecus, against a white background. Capture this ancient hominin with meticulous accuracy, emphasizing finely textured skin with visible pores; nuanced, unevenly placed wrinkles; and random patches of long, dirty, dishevelled hair. Incorporate photorealistic detailing featuring asymmetries to enhance authenticity. Apply individual imperfections, such as blemishes and stains.

Negative prompt: artistic, enhanced, stylized, wax, reconstruction, shiny skin, glossy, special effects, rubber, doll, replica.
Prompt: Design a realistic biometric passport shot of an ape girl, a young female character from the “Planet of the Apes”. Juvenile, feminine face. White background, whole face visible, zoom out. Careful attention to every intricate detail: finely textured skin with noticeable pores, expressive eyes, and subtle, irregular and asymmetric wrinkles. Maintain a level of photographic realism that rivals professional, high fidelity photography of human faces.

Negative prompt: truncated, make-up, lipstick, artistic, enhanced, stylized, wax, reconstruction, shiny skin, glossy, special effects, rubber, doll, replica, reflexes, highlights, filters.

Image selection

A total of about 7000 images were generated, with a majority rejected as clearly being of insufficient quality, insufficiently realistic, overly apelike, overly humanlike, incorrectly zoomed, or incorrectly positioned. A total of 1245 images were retained to form the initial dataset.

From the initial set of 1245 images, a final set of 31 images, consisting of 15 female and 15 + 1 male representations, was selected through an internal laboratory voting process conducted by four researchers with expertise in biological anthropology, facial morphology, and image analysis (SW, VF, JOPG, DPD). The selected images then underwent a structured post-processing workflow to ensure consistency in visual presentation.

Standardized post-processing workflow

All images underwent a multi-stage enhancement and editing process by a highly experienced graphic artist (ZL; cf. Fig. 4). Topaz Photo AI v3.5 (topazlabs.com/topaz-photo-ai) was used to enhance facial detail, upscale and sharpen the images, and improve skin texture. Luminar Neo 1.23 (skylum.com/luminar-neo) provided additional skin texturing and mattification when required. Adobe Lightroom Classic v14.1 was employed for global and local adjustments, including symmetrical lighting corrections, softening of deep wrinkles, modifications to contrast, shadow, and black levels, as well as color cast corrections and hair fringing adjustments. Further advanced modifications were made in Adobe Photoshop v26.2, including removal of rim lights and stray light spots, elimination of stray hairs, corrections to hair asymmetry, and adjustments to eye dimensions to enhance a non-human appearance. Background alterations and replacements were also performed in Photoshop, and layered PSD files were created to allow dataset users to flexibly modify the appearance of the iris and sclera (see “Usage notes”).

Final images were saved in PNG format with an sRGB color profile at 300 dpi and a resolution of 1500 × 1500 pixels. Lighting and shadows were standardized across all images, and any remaining color casts or white balance inconsistencies were manually corrected. Skin textures were further mattified to remove excessive shine. Each image was upscaled using Topaz Photo AI v3.5, sharpened, and resized to its original dimensions to ensure consistency in resolution and detail.

To maintain proportionality, all faces and shoulders were aligned according to a standardized passport outline (see e.g., gov.uk/photos-for-passports/photo-requirements for details), as shown in Fig. 3. A version of the stimulus with the shoulders removed was created, but was rejected as the faces with the shoulders removed were judged as less viable (less natural and less comparable to standard face databases).

Fig. 3 — Aligning the position of the faces to a standardised passport outline.

Adjustments were made to the iris brightness to reduce inconsistencies between both eyes in an individual. Lips were retouched to ensure a neutral, closed-mouth expression, and coloration adjustments were applied where needed. On most faces, eye fissure shape was modified to achieve width-to-height proportions intermediate between those typical of humans (relatively wider) and non-human apes (relatively higher). Each face was isolated from its original background, and four different background versions were generated: pure white, mid-grey, black, and black with refined hair edges to eliminate white or grey fringing.

Further refinement was achieved through the addition of adjustment layers for iris and sclera modifications. A “diffuse” sclera version was created with corresponding layers and masks to facilitate targeted modifications – i.e., a patchily pigmented scleral area found in many bonobos and occasionally in other great apes. The final set of images underwent refinement to ensure consistency while preserving realism and precision in facial representation. While complete repeatability of all studio conditions remains unattainable with current AI-based tools, and slight variations may still be present across the final images, the whole dataset preserves a relatively uniform visual style.

Geometric-morphometrics

Human facial shape is a key to individual identity²⁰, but its variance, too, predicts first impressions systematically: studies point to a preference for more “average” and “sex-typical” facial configurations²¹. The dataset is thus provided together with a list of landmark-based facial shape measures, to allow the researchers to account for facial variance other than in the eye area. Each face was labeled with a predefined list of 36 landmarks (denoting anatomically or geometrically identical points across the specimens) and 36 semilandmarks (denoting curves between the landmarks) using an automated tool for placing landmarks on facial portraits, faceDig (www.facedig.org)²². Following correction of the automatically placed landmarks, the configurations were subjected to generalised Procrustes analysis via the ‘gpagen’ function in the R package geomorph^23,24. Subsequently, we calculated the following measures:

Distinctiveness. It measures how distant a specimen is from the corresponding mean. The lower the number, the more average the apemen are among other apemen. It was calculated separately for male and female stimuli²⁵.
Sexual Shape Dimorphism (SShD) measures the expression of morphological differences between male and female specimens²⁶. During calculation, each face is projected on an axis that connects the male and female averages and is assigned a score of SShD, corresponding to its position on the axis. During the calculation, one sex is assigned −1 and the other 1 (default indexes in a linear model).
SexTypicality Score (SST). For better interpretability, we provide SST, a scaled SShD (zero mean, variance unity), multiplied by −1 in females.
Facial asymmetry. The landmarks from the left and right parts of the specimen were mirrored along the vertical midline axis, and the paired landmarks from the left and right sides were relabelled. Subsequently, we computed Procrustes distances between the mirrored and the original configurations. Coefficients with higher values indicate less symmetrical faces, i.e., higher facial asymmetry²⁷.

Furthermore, to provide a comparison of the apemen database and contemporary human population, we used the files of facial landmarks from several scientific databases of contemporary human frontal facial photographs^5,7,26. Figure 5 below compares extant human populations and our database.

Fig. 5 — Comparison of morphometric variables in apemen (leftmost column in every plot) and selected other cultures, representatives of the contemporary human population. Symmetrised landmarks for plots in the first to third row were taken from²⁶, unsymmetrised coordinates for the last row plots come from^5,7.

Norming data

We used the Labvanced online application to collect perceptual ratings for each of our 31 facial images on six constructs: Threat, Sociability, Trustworthiness, Health, Age, and Masculinity. Participants were crowdsourced via the online platform Prolific and were remunerated for completing the task (1.8 GBP). Participants were required to have normal or corrected to normal vision and fluency in English, and could use desktop or laptop computers, but not tablets or smartphones. The task was described as studying “how we perceive the faces of prehistoric ancestors of modern humans (so-called hominins, i.e., “apemen” or “cavemen”)”. Informed consent was obtained from all participants. Ethical approval was obtained from the Research Ethics Committee of the Faculty of Philosophy and Social Sciences, Nicolaus Copernicus in Toruń, decision 40/2024.

Mindful of well-described issues with crowdsourcing participants through online platforms like Prolific, we employed a number of mechanisms to ensure the quality of our ratings:

eligibility criteria. We used custom screeners that required potential participants to have completed at least two previous Prolific studies, have a 100% approval rate on previous studies, and have filled in their Prolific profiles with information on a number of standard demographic variables. As a result, the pool of available participants accounted for ca. 29% of all active Prolific users (69,672 out of 242,201);
instructions and information on attention checks. In the study description on Prolific, potential participants were informed that the study contains attention checks, and they were instructed to complete a screen calibration procedure prior to enrollment. The instructions in the study proper began with a screen calibration test and again emphasised the presence of attention checks in the design;
sample size and variety. After piloting the study with 8 participants, we collected a further 160 complete contributions (a further 23 participants “returned”, i.e., withdrew from the study; 4 timed out). Aiming at geographical diversity, we invited participants in 4 batches, at 00:00 CET, 07:00 CET, 12:30 CET, and 18:00 CET. As some participants opened the survey repeatedly, this left us with 184 unique participant IDs;
excluding incomplete submissions. We excluded all the duplicated participants and participants who did not finish the survey completely (33) or rated all the stimuli with the same number (1). This left us with 150 participants.

Data is provided as “Complete_Individual_Ratings_Before_Excluding_Incongruent_Raters.csv”.

Of these, 79 identify as men, 68 as women, 2 are not binary, and 1 decided not to say.

Participants were born in 30 different countries, resided in 22, and possessed nationalities of 27 countries distributed all across the world. Most participants were born/resided in these nine countries: South Africa (29/34), the United States (33/36), the United Kingdom (25/25), Poland (9/9), Portugal (8/8), Canada (2/7), Mexico (3/4), Greece (4/4), and India (4/3).

The participants were, on average, 32.5 years old (SD = 12.2, range 18–73). The vast majority of participants reported English as their first language (103), followed by Portuguese (9), Spanish (8), Polish (8), and Greek (3).
screening for rating consistency within participants (test-retest). Participants rated all 31 faces twice: first in round one and then again in round two of the study, so as to monitor their rating consistency. We rejected 15 of the 150 complete contributions with relatively the highest differences between the two rating runs (see “Technical Validation” below).

Data is provided as “DataClean_Individual_Ratings.csv”. Per-face norms based on these 135 participants are provided as “Norms_average_per_face.csv”.

Rating procedure

Participants rated the faces on six dimensions—(perceived) threat, sociability, trustworthiness, health, age, and masculinity—by moving sliders on a horizontal line between the left (0) and right (100) extremes for each trait. No time limit was set for completing the ratings. The extremes of the scale had verbal labels (e.g., “very young” and “very old” for Age), but the underlying numerical values were not made visible to the participants (cf. Fig. 6). This assessment method drew on the adaptation of the visual analogue scale. It provided a large spectrum of possible responses, which allowed the detection of minute changes in the ratings while assessing multiple attributes and minimized the clustering of ratings around one value, as could be the case with categorical scales.

Fig. 6 — Layout of a rating trial, print-screen from the original survey. The blue horizontal line below the “Next” button represents the progress bar.

Participants were also asked to provide free-text answers to three (one after round 1, two after round 2) general questions about the study. Overall, these free-text comments revealed that participants found the study to be interesting and straightforward. Participants reported that their decisions were influenced primarily by facial expressions, eye characteristics, and minor changes in features like skin tone or hairstyle; two participants mentioned a non-facial feature (the shoulders). Some participants reported second-guessing their assessments due to the range of features presented, while others found certain traits, such as sex, particularly difficult to determine. The subjective nature of personality judgments was highlighted, with some mentioning the challenge of maintaining consistency.

Interrater agreement

We calculated a measure of inter-rater agreement for each scale in Task 1, Task 2, and the combined sample. For the agreement in Tasks 1 and 2 separately, we used Interclass Correlation (ICC;2,k), i.e., two-way random, average score ICC, which measures inter-rater consistency²⁸ and is recommended because raters are usually considered representative of a broader population of potential raters. All of our participants saw all of the stimuli. In both tasks, the ICC is higher than 0.96 for every scale.

Data Records

The dataset has been published in a general-purpose repository for open research data, RepOD¹ (https://repod.icm.edu.pl/), based on Dataverse, under the permanent identifier: 10.18150/L2RHIA. It consists of the following files:

IMAGES

31 .zip files, each comprising 23 files relevant to a given face:
- 1 .jpg file with a basic version of a face
- 1 .png file with a basic version of a face
- 20 .png files with different versions of a face: 5 eye colorations * 4 backgrounds
- 1 .psd file

GEOMETRIC MORPHOMETRICS

2 .tps files (geometric morphometrics measures for male and female faces: ApeFD_Males.tps, ApeFD_Females.tps)
geometric morphometrics values (ApePPL_GMM_Scales.csv)

NORMING DATA

Norming data (Norms_Average_Per_Face.tab)
Raw norming data (RawData_Labvanced.csv) and raw demographic data of the participants (DemographicData_Prolific.tab)
Processed norming data (DataClean_Individual_Ratings.csv and Complete_Individual_Ratings_Before_Excluding_Incongruent_Raters.csv)

CODE

R script for merging demographic and norming data (ApeFD_Script.R)

Technical Validation

Norming data: initially, 150 participants fulfilled the basic criteria (opened the survey just once, rated all the faces twice, submitted the survey correctly following completion, and did not use the same number for all the targets). We computed the average difference between ratings in the first and second run, and marked 10% of participants with the highest average difference. The marked participants (10% with the highest average difference) were subsequently excluded from the dataset. This left us with 135 participants.

Usage Notes

The database includes 31 layered PSD (Photoshop Document) files that allow users to change the color of the sclera and the iris of the faces, thus tailoring the stimuli to the specific requirements of their studies. Each PSD file includes multiple editable layers, organized and named according to their function. Users can control layer visibility using the eye icon next to each layer in the Layers panel.

Background selection

Each PSD file contains four interchangeable background layers, of which only one should be enabled at a time:

White: A pure white background (RGB: 255, 255, 255).
Grey: A neutral grey background (RGB: 127, 127, 127).
Black 1: A black background (RGB: 0, 0, 0) with visible hair details, which may include some edge fringing.
Black 2: A black background (RGB: 0, 0, 0) using a soft mask to reduce fringing around hair edges.

Users are encouraged to select the background most appropriate for their experimental design and ensure consistency across stimuli within a given condition.

Iris adjustment layers

Several layers are provided to allow for systematic manipulation of iris appearance:

Dark iris multiply: Applies a darkening effect to the iris using the Multiply blending mode. The effect can be adjusted using the layer’s opacity setting (0–100%). To refine the area of application, edit the associated layer mask using a black or white brush.
Bright iris screen: Brightens the iris using the Screen blending mode, similarly controlled via opacity settings. As with other layers, the mask can be manually adjusted for precision.
Hue/saturation orange iris: An adjustment layer modifying the color properties of the iris. Users may adjust the Hue, Saturation, and Lightness sliders to achieve the desired color characteristics. This layer is designed to be used in conjunction with either the Bright or Dark iris layers. The layer’s overall effect can also be modulated using opacity and mask refinements.

Sclera adjustment layers

The sclera can also be manipulated through several dedicated layers:

Sclera multiply: Darkens the sclera using the Multiply blending mode. The effect can be customized via opacity and mask editing.
Diffused sclera: Includes two separate layers corresponding to the left and right eyes. These layers use soft masks to allow for diffusion of the scleral area. Their shape and position can be manually adjusted or transformed. Additional refinement can be achieved through mask painting.
Sclera subtract: Applies a pronounced darkening effect to the sclera using the Subtract blending mode. Due to the intensity of this mode, it is recommended to reduce opacity to achieve more natural results. Edges can be softened by editing the mask range.
Hue/saturation sclera: An adjustment layer for modulating scleral color. This can be used with the above scleral layers to fine-tune Hue, Saturation, and Lightness. For optimal results, this layer should be blended with other scleral layers using varying opacity levels.

General recommendations

Researchers are encouraged to experiment with different combinations of blending modes, opacity levels, and masks to achieve the desired visual outcome. In many cases, adjusting individual layer masks using a soft brush with customized opacity and flow settings may yield a more realistic appearance, especially for subtle modifications of the eye region. This level of control is intended to accommodate diverse research contexts, such as manipulating facial expressions, gaze perception, or morphological realism. For best practices, users should document any modifications performed on the base stimuli and ensure consistency across all manipulated images within a given study.

Acknowledgements

This research was supported by the National Science Centre of Poland, grant 2023/49/B/HS6/02343.

Author contributions

S.W., J.O.P.G., D.P.D., Z.L., V.F. conceptualised the database, oversaw image editing, and wrote the manuscript. S.W. collected the norming data. Z.L., S.W. created the initial images. Z.L. carried out all steps of image editing and prepared the final .psd, .png, and .jpg files. M.S. prepared the dataset for curation. V.F. provided the geometric morphometrics description of the dataset, as well as handled the norming data. A.S. provided feedback on image editing, analyzed the norming data, and contributed to writing the manuscript.

Code availability

All data and code for this study are available in RepOD¹, under the link repod.icm.edu.pl/dataset.xhtml?persistentId = 10.18150/L2RHIA.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

1.Wacewicz, S. et al. ApeDF: Apemen Faces Database. RepOD10.18150/L2RHIA (2025). [Google Scholar]
2.Wacewicz, S., Perea-García, J. O., Lewandowski, Z. & Danel, D. P. The adaptive significance of human scleral brightness: An experimental study. Sci. Rep.12, 20261, 10.1038/s41598-022-24403-2 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Kobayashi, H. & Kohshima, S. Unique morphology of the human eye. Nature387, 767–768 (1997). [DOI] [PubMed] [Google Scholar]
4.Tomasello, M., Hare, B., Lehmann, H. & Call, J. Reliance on head versus eyes in the gaze following of great apes and human infants: The cooperative eye hypothesis. J. Hum. Evol.52, 314–320 (2007). [DOI] [PubMed] [Google Scholar]
5.Lakshmi, A., Wittenbrink, B., Correll, J. & Ma, D. S. The India Face Set: International and cultural boundaries impact face impressions and perceptions of category membership. Front. Psychol.12, 627678, 10.3389/fpsyg.2021.627678 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
6.DeBruine, L. & Jones, B. Face research lab London set. figshare10.6084/M9.FIGSHARE.5047666.V2 (2017).
7.Saribay, S. A. et al. The Bogazici face database: Standardized photographs of Turkish faces with supporting materials. PLoS ONE13, e0192018, 10.1371/journal.pone.0192018 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Boothroyd, L. G., Jones, B. C., Burt, D. M. & Perrett, D. I. Partner characteristics associated with masculinity, health and maturity in male faces. Pers. Individ. Dif.43, 1161–1173, 10.1016/j.paid.2007.03.008 (2007). [Google Scholar]
9.Třebický, V., Stirrat, M. & Havlíček, J. Fighting assessment. in Encyclopedia of evolutionary psychological science (eds Shackelford, T. K. & Weekes-Shackelford, V. A.) 1–11. 10.1007/978-3-319-16999-6_2738-1 (Springer International Publishing, 2019).
10.Wolf, W., Thielhelm, J. & Tomasello, M. Five-year-old children show cooperative preferences for faces with white sclera. J. Exp. Child Psychol.225, 105532, 10.1016/j.jecp.2022.105532 (2023). [DOI] [PubMed] [Google Scholar]
11.Perea-García, J. O., Berris, D., Tan, J. & Kret, M. E. Pupil size and iris brightness interact to affect prosocial behaviour and affective responses. Cogn. Emot. 1–16 10.1080/02699931.2024.2427340 (2024). [DOI] [PubMed]
12.Kano, F., Kawaguchi, Y. & Hanling, Y. Experimental evidence that uniformly white sclera enhances the visibility of eye-gaze direction in humans and chimpanzees. eLife11, e74086, 10.7554/eLife.74086 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Perea-García, J. O., Szala, A., Wacewicz, S., Matzinger, T. & Szczepańska, A. Exploring reactions to eye contact with different scleral pigmentation in pygmy marmosets (C. pygmaea) in a free-viewing paradigm. OSF.10.17605/OSF.IO/E37BQ (2022). [Google Scholar]
14.Ferstl, Y., Kokkinara, E. & McDonnell, R. Facial features of non-player creatures can influence moral decisions in video games. ACM Trans. Appl. Percept.15, 1–12, 10.1145/3129561 (2018). [Google Scholar]
15.Hsu, S. H., Kao, C.-H. & Wu, M.-C. Design facial appearance for roles in video games. Expert Syst. Appl.36, 4929–4934, 10.1016/j.eswa.2008.05.049 (2009). [Google Scholar]
16.Ramirez Gomez, A. & Lankes, M. Eyesthetics: Making sense of the aesthetics of playing with gaze. Proc. ACM Hum.-Comput. Interact.5, 1–24, 10.1145/3474686 (2021).36644216 [Google Scholar]
17.Ravaja, N., Bente, G., Katsyri, J., Salminen, M. & Takala, T. Virtual character facial expressions influence human brain and facial EMG activity in a decision-making game. IEEE Trans. Affect. Comput.9, 285–298, 10.1109/TAFFC.2016.2601101 (2018). [Google Scholar]
18.Geller, T. Overcoming the Uncanny Valley. IEEE Comput. Graph. Appl. 28 (2008). [DOI] [PubMed]
19.Nightingale, S. J. & Farid, H. AI-synthesized faces are indistinguishable from real faces and more trustworthy. Proc. Natl. Acad. Sci. U.S.A.119, e2120481119, 10.1073/pnas.2120481119 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Sheehan, M. J. & Nachman, M. W. Morphological and population genomic evidence that human faces have evolved to signal individual identity. Nat. Commun.5, 4800, 10.1038/ncomms5800 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Kleisner, K. et al. Distinctiveness and femininity, rather than symmetry and masculinity, affect facial attractiveness across the world. Evolution and Human Behavior45(1), 82–90 (2024). [Google Scholar]
22.Kleisner, K., Trnka, J. & Tureček, P. FACEDIG automated tool for placing landmarks on facial portraits for geometric morphometrics users. Sci. Rep. 15. 10.1038/s41598-025-09714-4 (2025). [DOI] [PMC free article] [PubMed]
23.Baken, E. K., Collyer, M. L., Kaliontzopoulou, A. & Adams, D. C. geomorph v4.0 and gmShiny: Enhanced analytics and a new graphical interface for a comprehensive morphometric experience. Methods Ecol. Evol.12, 2355–2363, 10.1111/2041-210X.13723 (2021). [Google Scholar]
24.Adams, D., Collyer, M., Kaliontzopoulou, A. & Baken, E. geomorph: Software for geometric morphometric analyses. R package version 4.0.10. https://cran.r-project.org/package=geomorph (2025).
25.Danel, D. P., Dziedzic-Danel, A. & Kleisner, K. Does age difference really matter? Facial markers of biological quality and age difference between husband and wife. HOMO67, 337–347, 10.1016/j.jchb.2016.05.002 (2016). [DOI] [PubMed] [Google Scholar]
26.Kleisner, K. et al. How and why patterns of sexual dimorphism in human faces vary across the world. Sci. Rep.11, 5978, 10.1038/s41598-021-85402-3 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Mardia, K. Statistical assessment of bilateral symmetry of shapes. Biometrika87, 285–300, 10.1093/biomet/87.2.285 (2000). [Google Scholar]
28.Shrout, P. E. & Fleiss, J. L. Intraclass correlations: uses in assessing rater reliability. Psychol. Bull.86, 420–428 (1979). [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Citations

DeBruine, L. & Jones, B. Face research lab London set. figshare10.6084/M9.FIGSHARE.5047666.V2 (2017).

Data Availability Statement

All data and code for this study are available in RepOD¹, under the link repod.icm.edu.pl/dataset.xhtml?persistentId = 10.18150/L2RHIA.

[CR1] 1.Wacewicz, S. et al. ApeDF: Apemen Faces Database. RepOD10.18150/L2RHIA (2025). [Google Scholar]

[CR2] 2.Wacewicz, S., Perea-García, J. O., Lewandowski, Z. & Danel, D. P. The adaptive significance of human scleral brightness: An experimental study. Sci. Rep.12, 20261, 10.1038/s41598-022-24403-2 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR3] 3.Kobayashi, H. & Kohshima, S. Unique morphology of the human eye. Nature387, 767–768 (1997). [DOI] [PubMed] [Google Scholar]

[CR4] 4.Tomasello, M., Hare, B., Lehmann, H. & Call, J. Reliance on head versus eyes in the gaze following of great apes and human infants: The cooperative eye hypothesis. J. Hum. Evol.52, 314–320 (2007). [DOI] [PubMed] [Google Scholar]

[CR5] 5.Lakshmi, A., Wittenbrink, B., Correll, J. & Ma, D. S. The India Face Set: International and cultural boundaries impact face impressions and perceptions of category membership. Front. Psychol.12, 627678, 10.3389/fpsyg.2021.627678 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR6] 6.DeBruine, L. & Jones, B. Face research lab London set. figshare10.6084/M9.FIGSHARE.5047666.V2 (2017).

[CR7] 7.Saribay, S. A. et al. The Bogazici face database: Standardized photographs of Turkish faces with supporting materials. PLoS ONE13, e0192018, 10.1371/journal.pone.0192018 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR8] 8.Boothroyd, L. G., Jones, B. C., Burt, D. M. & Perrett, D. I. Partner characteristics associated with masculinity, health and maturity in male faces. Pers. Individ. Dif.43, 1161–1173, 10.1016/j.paid.2007.03.008 (2007). [Google Scholar]

[CR9] 9.Třebický, V., Stirrat, M. & Havlíček, J. Fighting assessment. in Encyclopedia of evolutionary psychological science (eds Shackelford, T. K. & Weekes-Shackelford, V. A.) 1–11. 10.1007/978-3-319-16999-6_2738-1 (Springer International Publishing, 2019).

[CR10] 10.Wolf, W., Thielhelm, J. & Tomasello, M. Five-year-old children show cooperative preferences for faces with white sclera. J. Exp. Child Psychol.225, 105532, 10.1016/j.jecp.2022.105532 (2023). [DOI] [PubMed] [Google Scholar]

[CR11] 11.Perea-García, J. O., Berris, D., Tan, J. & Kret, M. E. Pupil size and iris brightness interact to affect prosocial behaviour and affective responses. Cogn. Emot. 1–16 10.1080/02699931.2024.2427340 (2024). [DOI] [PubMed]

[CR12] 12.Kano, F., Kawaguchi, Y. & Hanling, Y. Experimental evidence that uniformly white sclera enhances the visibility of eye-gaze direction in humans and chimpanzees. eLife11, e74086, 10.7554/eLife.74086 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR13] 13.Perea-García, J. O., Szala, A., Wacewicz, S., Matzinger, T. & Szczepańska, A. Exploring reactions to eye contact with different scleral pigmentation in pygmy marmosets (C. pygmaea) in a free-viewing paradigm. OSF.10.17605/OSF.IO/E37BQ (2022). [Google Scholar]

[CR14] 14.Ferstl, Y., Kokkinara, E. & McDonnell, R. Facial features of non-player creatures can influence moral decisions in video games. ACM Trans. Appl. Percept.15, 1–12, 10.1145/3129561 (2018). [Google Scholar]

[CR15] 15.Hsu, S. H., Kao, C.-H. & Wu, M.-C. Design facial appearance for roles in video games. Expert Syst. Appl.36, 4929–4934, 10.1016/j.eswa.2008.05.049 (2009). [Google Scholar]

[CR16] 16.Ramirez Gomez, A. & Lankes, M. Eyesthetics: Making sense of the aesthetics of playing with gaze. Proc. ACM Hum.-Comput. Interact.5, 1–24, 10.1145/3474686 (2021).36644216 [Google Scholar]

[CR17] 17.Ravaja, N., Bente, G., Katsyri, J., Salminen, M. & Takala, T. Virtual character facial expressions influence human brain and facial EMG activity in a decision-making game. IEEE Trans. Affect. Comput.9, 285–298, 10.1109/TAFFC.2016.2601101 (2018). [Google Scholar]

[CR18] 18.Geller, T. Overcoming the Uncanny Valley. IEEE Comput. Graph. Appl. 28 (2008). [DOI] [PubMed]

[CR19] 19.Nightingale, S. J. & Farid, H. AI-synthesized faces are indistinguishable from real faces and more trustworthy. Proc. Natl. Acad. Sci. U.S.A.119, e2120481119, 10.1073/pnas.2120481119 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR20] 20.Sheehan, M. J. & Nachman, M. W. Morphological and population genomic evidence that human faces have evolved to signal individual identity. Nat. Commun.5, 4800, 10.1038/ncomms5800 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR21] 21.Kleisner, K. et al. Distinctiveness and femininity, rather than symmetry and masculinity, affect facial attractiveness across the world. Evolution and Human Behavior45(1), 82–90 (2024). [Google Scholar]

[CR22] 22.Kleisner, K., Trnka, J. & Tureček, P. FACEDIG automated tool for placing landmarks on facial portraits for geometric morphometrics users. Sci. Rep. 15. 10.1038/s41598-025-09714-4 (2025). [DOI] [PMC free article] [PubMed]

[CR23] 23.Baken, E. K., Collyer, M. L., Kaliontzopoulou, A. & Adams, D. C. geomorph v4.0 and gmShiny: Enhanced analytics and a new graphical interface for a comprehensive morphometric experience. Methods Ecol. Evol.12, 2355–2363, 10.1111/2041-210X.13723 (2021). [Google Scholar]

[CR24] 24.Adams, D., Collyer, M., Kaliontzopoulou, A. & Baken, E. geomorph: Software for geometric morphometric analyses. R package version 4.0.10. https://cran.r-project.org/package=geomorph (2025).

[CR25] 25.Danel, D. P., Dziedzic-Danel, A. & Kleisner, K. Does age difference really matter? Facial markers of biological quality and age difference between husband and wife. HOMO67, 337–347, 10.1016/j.jchb.2016.05.002 (2016). [DOI] [PubMed] [Google Scholar]

[CR26] 26.Kleisner, K. et al. How and why patterns of sexual dimorphism in human faces vary across the world. Sci. Rep.11, 5978, 10.1038/s41598-021-85402-3 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR27] 27.Mardia, K. Statistical assessment of bilateral symmetry of shapes. Biometrika87, 285–300, 10.1093/biomet/87.2.285 (2000). [Google Scholar]

[CR28] 28.Shrout, P. E. & Fleiss, J. L. Intraclass correlations: uses in assessing rater reliability. Psychol. Bull.86, 420–428 (1979). [DOI] [PubMed] [Google Scholar]

PERMALINK

The Apemen Faces Database (ApeFD)

Zdzisław Lewandowski

Slawomir Wacewicz

Juan Olvido Perea-García

Vojtěch Fiala

Marta Sibierska

Anna Szala

Dariusz P Danel

Abstract

Background & Summary

Fig. 1.

Fig. 2.

Methods

Production of the images

AI-based image generation

Prompt examples

Image selection

Standardized post-processing workflow

Fig. 4.

Fig. 3.

Geometric-morphometrics

Fig. 5.

Norming data

Rating procedure

Fig. 6.

Interrater agreement

Data Records

Technical Validation

Usage Notes

Background selection

Iris adjustment layers

Sclera adjustment layers

General recommendations

Acknowledgements

Author contributions

Code availability

Competing interests

Footnotes

References

Associated Data

Data Citations

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases