ABSTRACT
Background
Skin colour is a prominent human trait historically used to define ethnicity, yet its validity as a classification tool remains questionable.
Materials and Methods
We quantitatively analyse over 14 000 skin reflectance measurements from eight ethnically diverse groups in the International Skin Spectra Archive (ISSA), using a standard colour space designed to model human visual perception. We assess intragroup variation and intergroup overlap through two complementary approaches: individual‐level perceptual differences and group‐level shared gamut volumes.
Results
Results show that within‐group variability in chromaticity and lightness frequently exceeds between‐group differences. At the individual level, 89.4% (95% CI: 81.5%–91.9%) of samples have perceptually indistinguishable counterparts across ethnicities. At the group level, the median shared gamut overlap is 60.5% (95% CI: 54.5%–63.6%), indicating substantial overlap in skin colour distributions. The two methods correlate strongly (r = 0.83, p < 0.001), confirming robust intergroup overlap.
Conclusion
Skin colour exhibits high within‐group dispersion and extensive between‐group overlap. These findings challenge the use of skin colour as a reliable indicator of ethnicity and underscore the need for objective, data‐driven classification frameworks. They also highlight the complex, continuous nature of human skin variation, beyond simplistic ethnic categories.
Keywords: CIELAB colour space, ethnicity, ISSA, perceptual overlap, skin appearance, skin colour, skin colour variation, skin gamut
1. Introduction
Skin colour is one of the most noticeable traits in human appearance and has been used as a primary characteristic for categorising people into racial or ethnic groups [1, 2]. The associations between skin colour and ethnicity have deep historical roots, shaped by centuries of sociopolitical contexts and scientific classification [3, 4]. However, the validity of skin colour to define ethnicity has rarely been critically quantified, largely due to technological limitations and challenges associated with accurately measuring skin colour across diverse populations. Despite the broad recognition of human skin colour diversity, empirical evidence supporting strict links between skin tones and specific ethnicities remains limited.
The widespread acceptance of race classifications based predominantly on skin colour continues to influence daily life, clinical practice, scientific research and technological applications. In daily life, individuals frequently rely on visible skin colour as a primary marker to categorise the race of others and tend to use monoracial categories even in multiracial populations [5]. In clinical settings, treatment decisions and diagnostic reasoning are often implicitly shaped by racial categorisations [6, 7]. Similarly, research across various disciplines, including dermatology, imaging, biology and sociology, regularly associates ‘Caucasian’ with lighter skin pigmentation and ‘Asian’ with more yellowish tones, perpetuating simplistic stereotypes [8, 9]. In dermatology and cosmetic science in particular, such assumptions have historically informed product development, clinical assessment scales and visual diagnostic criteria, despite growing evidence of substantial skin colour variation within conventionally defined groups [2, 10, 11, 12]. This oversimplification extends beyond skin research, impacting broader fields where skin colour implicitly encodes racial identity [13].
Without rigorous empirical validation, this linkage risks obscuring genuine phenotypic diversity and inadvertently perpetuating biases in scientific inquiry across various domains [14]. Furthermore, the reliance on these racial skin stereotypes may embed biases into more recent technological applications such as facial recognition systems, skin‐type classification tools and consumer‐facing algorithms used in artificial intelligence–generated content, consumer electronics and cosmetic product matching [15, 16]. Such persistent oversimplifications raise an essential question: To what extent can ethnic groups genuinely be distinguished by skin colour alone? Answering this question critically informs efforts toward more inclusive and scientifically accurate representations of human diversity.
Efforts have been made to understand the genetic basis of skin colour and its evolutionary adaptation, yet, influenced by earlier classification biases, the complexity of its genetic architecture has been underestimated [17]. Recent genomic research, however, reveals that human skin colour results from a complex interplay of numerous genetic variants interacting with environmental and cultural factors [18]. Anthropological studies further demonstrate that genetic diversity in skin pigmentation is atypically distributed and unsuitable for strict racial categorisation [19]. The new era of genomic research highlights the much more complex nature of skin colour than previously understood and underscores the necessity of accurately assessing skin colour as both a biological and cultural phenotype across diverse populations [3, 18].
Skin colour measurement has a long history, with early approaches relying on subjective visual assessments such as the Fitzpatrick scale [20] and the Taylor Hyperpigmentation Scale [21]. While widely used in clinical and research settings, these visual classification methods are inherently limited by categorical definitions, observer bias and restricted sensitivity to continuous variation in skin colour. The development of non‐invasive measurement technologies, combined with systematic colorimetry approaches established by the Commission Internationale de l'Eclairage (CIE) [22], has enabled objective and precise evaluation of skin colour. Among various systems, the CIELAB colour space established by CIE, has become the most widely used for representing human skin colour [23]. It defines skin tone in three perceptually uniform dimensions: lightness (L*), redness (a*) and yellowness (b*), corresponding closely to biological features such as melanin concentration, erythema and tanning responses. To systematically study global variations, CIE established Technical Committee TC 1–92 in 2013, providing guidelines for skin colour measurement and initiating the International Skin Spectra Archive (ISSA) [24]. ISSA is a publicly accessible database comprising objectively measured skin colour and spectral data collected under standardised protocols. By adhering to uniform measurement standards and instrument calibration procedures, ISSA enables robust comparisons of skin colour data collected globally and addresses measurement variability across instruments [25, 26].
Utilising objective measurements data from ISSA, the present study challenges the traditional race‐based assumptions about skin colour through rigorous quantitative analysis. We examine intragroup variation and intergroup overlap in skin colour across eight self‐defined ethnic groups using the CIELAB uniform colour space. While differences in lightness (L*) and yellowness (b*) are observed between groups, within‐group variability is greater and shows substantial overlap across ethnicities in both chromatic and luminance dimensions. Applying two complementary analytical approaches, we find extensive intergroup overlap. Specifically, at the individual level, the median overlap percentage of samples below the perceptual colour difference threshold () is 89.4%, indicating that most individuals can have perceptually indistinguishable skin colours from individuals across different ethnic groups. At the group level, the median shared gamut volume is 60.5%, reflecting considerable intersection despite differences in overall gamut size. The strong correlation between individual‐ and group‐level results (r = 0.83, p < 0.001) reinforces the conclusion that skin colour is not a reliable definition of ethnicity. By providing robust empirical evidence, this study underscores the complexity of human skin colour and reveals the significant variability within—and overlap between—ethnic groups, challenging simplistic racial classifications based solely on skin tone.
2. Methods
2.1. ISSA
To evaluate the skin colour overlap across ethnic groups, a comprehensive database with extensive, high‐quality measurements, covering variations related to ethnicity, gender, age and body location, is essential. ISSA, established under the guidelines of the International Commission on Illumination (CIE), comprises 15 256 real skin spectra measurements from 2113 human subjects collected between 2012–2024. This database adheres to a uniform measurement protocol to ensure data consistency across different global sites and covers a wide variability in skin characteristics. A more detailed description of the measurement protocols and the database specifications is available in the data descriptor paper [24, 27]. This study includes data from eight ethnic groups, summarised in Table 1.
TABLE 1.
Summary of subjects, gender, the total spectra by ethnic group: CA (Caucasian), CN (Chinese), SA (South Asian—Pakistani), AF (African), IQ (Middle Eastern—Iraqi), TH (Southeast Asian—Thai), JP (Japanese), AB (Middle Eastern—Arabian).
| Ethnic group | No. of subjects | No. of females | No. of males | No. of spectra |
|---|---|---|---|---|
| CA | 352 | 274 | 78 | 2376 |
| CN | 370 | 177 | 193 | 2736 |
| JP | 118 | 67 | 51 | 944 |
| SA | 157 | 92 | 65 | 1289 |
| AF | 145 | 85 | 60 | 929 |
| IQ | 149 | 77 | 72 | 1183 |
| TH | 708 | 494 | 214 | 4112 |
| AB | 107 | 87 | 20 | 963 |
Ethical approval for ISSA was obtained from the University of Sheffield (Reference Number: 9) and the University of Liverpool (Reference Number: IPHS‐1314‐278) and ratified by the other host institutions. All participants provided written informed consent, which included permission to share anonymised data for research purposes.
2.2. CIELAB Uniform Colour Space
To compare skin colour appearance, measurements must be expressed in a device‐independent, perceptually uniform colour space. The CIELAB colour space [22], recommended by the CIE and widely used in skin research [23], offers a three‐dimensional representation aligning closely with human colour perception. CIELAB includes three orthogonal dimensions: L* (lightness), a* (red‐green dimension) and b* (yellow‐blue dimension). For skin colour, these parameters correspond approximately to perceived skin lightness, redness and yellowness, respectively. Chroma (C*)—representing perceived chroma—is calculated as the distance from the origin in the a*b* plane (Equation 1). The perceptual colour difference is computed using the Euclidean distance between two colours (, , ) and (, , ) in CIELAB space (Equation 2).
![]() |
(1) |
![]() |
(2) |
2.3. Measures of Intragroup Skin Colour Variation
Intragroup skin colour variation was assessed by calculating the mean and standard deviation for CIELAB attributes within each ethnic group. Additionally, the mean colour difference () between each sample and the respective group mean was computed to quantify perceptually meaningful intragroup variation.
2.4. Visual Threshold for Skin Colour Differences
Visual thresholds are critical in determining perceptible colour differences, extensively applied in various fields such as dental restoration [28]. Two thresholds are commonly defined: the perceptibility threshold (PT), representing the smallest colour difference detectable visually (just‐noticeable difference, JND) and the acceptability threshold (AT), the smallest colour difference deemed acceptable. Thresholds typically follow a ‘50:50%’ criterion, meaning 50% of observers can detect or accept the difference. Previous studies on maxillofacial skin replications report a PT and AT of approximately 1.1 (SD = 0.5)/3.0 (SD = 0.4) for lighter skin, and 1.6 (SD = 0.5)/4.4 (SD = 1.0) for darker skin, expressed in the unit of [29].
In the current study, the visual threshold serves as a criterion for differentiating individual skin samples. We adopt a suprathreshold based on the PT, set at 2. This threshold ensures that colour differences for different skin types are perceivable by a majority (>50%) of observers, thereby providing a physiologically valid basis for evaluating differences among skin samples and quantifying overlap.
2.5. Measures of Intergroup Skin Colour Overlap
2.5.1. By Individual Difference
At the individual level, ethnic group differentiation based on skin colour depends on perceptible differences between samples from different groups. Using the defined PT 2), we calculated the smallest perceptual colour difference between each sample from one ethnic group and all samples from another group. We then quantified the proportion of these minimal differences falling below the threshold, providing an objective measure of individual‐level skin colour overlap.
2.5.2. By Shared Skin Gamut
At the group level, we assessed skin colour overlap based on the three‐dimensional distribution of skin samples in CIELAB colour space. Because skin colour distribution is irregular, conventional convex‐hull methods [26, 30, 31] may inaccurately represent the true gamut due to potential concavities. Instead, we segmented each group's gamut into discrete 3 × 3 × 3 unit‐cubes. This cube size ensured that the average Euclidean distance between adjacent cubes exceeded the perceptual threshold, creating meaningful colour clusters.
We then refined the segmentation by applying two criteria:
Removing isolated cubes (surrounded by six empty adjacent cubes).
Filling gaps (empty cubes completely surrounded by six filled adjacent cubes).
The overlap percentage was calculated by determining the proportion of shared cubes between pair of ethnic groups relative to the total cubes of the reference group's gamut. This approach provided an objective measure of shared skin colour gamut volumes between groups.
2.5.3. Robust Overlap Estimates (Median and Confidence Interval)
Due to skewed distributions, we summarised overlap results using medians as robust measures of central tendency. Sampling variability was addressed by employing bootstrap resampling with 1000 iterations to estimate the 95% confidence interval of the median.
3. Results
3.1. Description of Data
This study analyses 14 532 human skin reflectance measurements sourced from the ISSA [24, 27]. Participants include individuals aged 18 to 75 years across eight self‐identified ethnic groups: Caucasian (CA), Chinese (CN), Japanese (JP), South Asian–Pakistani (SA), Middle Eastern–Iraqi (IQ), African (AF), Southeast Asian–Thai (TH) and Middle Eastern–Arabian (AB). Skin colour measurements were collected from twelve body positions: back of hand, cheek, cheekbone, chin, ear lobe, forehead, inner arm, neck, nose tip, palm, ring finger and both inner and outer forearm. Due to incomplete coverage across global sampling sites, measurements from the earlobe and ring finger were excluded from further analyses, leaving measurements from ten positions. Table 1 summarises the number of subjects, gender distribution and the total spectra for each ethnic group.
To facilitate a consistent comparison of colour appearance across diverse samples, skin colour was measured using CIE XYZ coordinates and then converted into the CIELAB colour space, which models human colour perception. The CIELAB parameters, L*, a* and b*, correspond well to the perceptual dimensions of lightness, redness and yellowness, respectively. Figure 1 visualises the skin colour distribution of 14 532 spectra in the CIELAB space: the chromatic a*b* plane (a) and the chroma‐lightness L*C* plane (b). Data point colours approximately represent the actual colour appearance of the corresponding skin samples.
FIGURE 1.

Skin colour distribution (N = 14,532) in CIELAB colour space: (a) chromatic a*b* plane and (b) chroma‐lightness L*C* plane.
3.2. Intra‐Ethnicity Skin Colour Variations
Intragroup skin colour variation is assessed using the mean and standard deviation of CIELAB attributes (Table 2). The CIELAB coordinates also allow for the computation of standardised colour difference metrics (), which are derived as the Euclidean distance in the three‐dimensional space defined by L*, a* and b*. The final column of Table 2 reports the mean values and their standard deviation, calculated as the colour difference between each sample and its group mean. This estimate provides a perceptually meaningful measure of intra‐ethnicity variation. Figure 2 shows the dispersed distribution of each group, with error bars representing two standard deviations (approximately a 95% confidence interval).
TABLE 2.
Mean (± standard deviation) of the CIELAB attributes (L*, a*, b*, C*) and the mean colour differences (± standard deviation) for eight ethnic groups.
| Ethnic group | Lightness L* | Redness a* | Yellowness b* | Chroma C* | Colour difference |
|---|---|---|---|---|---|
| CA | 61.2 (5.4) | 11.0 (3.9) | 14.7 (2.7) | 18.7 (3.2) | 6.3 (3.5) |
| CN | 59.8 (4.3) | 10.1 (2.9) | 16.7 (2.4) | 19.7 (2.6) | 5.2 (2.4) |
| JP | 63.5 (4.2) | 9.8 (2.7) | 17.0 (2.6) | 19.8 (2.5) | 5.1 (2.3) |
| SA | 52.7 (6.4) | 10.4 (2.5) | 17.9 (2.3) | 20.8 (2.4) | 6.4 (3.4) |
| AF | 39.6 (6.7) | 10.2 (2.0) | 14.4 (4.3) | 17.7 (4.4) | 6.9 (4.5) |
| IQ | 57.3 (5.5) | 10.2 (3.2) | 15.8 (2.4) | 19.0 (2.7) | 6.0 (3.2) |
| TH | 56.4 (5.6) | 10.0 (2.4) | 19.0 (2.6) | 21.6 (2.7) | 5.8 (3.1) |
| AB | 60.2 (6.5) | 10.9 (3.0) | 17.9 (2.5) | 21.1 (2.7) | 6.5 (3.9) |
FIGURE 2.

Skin colour distribution of eight ethnic groups in CIELAB space: (a) chromatic a*b* plane and (b) chroma‐lightness L*C* plane. Mean chromaticity values are represented by solid diamonds, with error bars denoting 2 standard deviations. Abbreviations: CA (Caucasian), CN (Chinese), JP (Japanese), SA (South Asian—Pakistani), AF (African), IQ (Middle Eastern—Iraqi), TH (Southeast Asian—Thai), AB (Middle Eastern—Arabian).
In the a*b* chromaticity plane, despite the scattered distribution shown in Figure 1a, the mean a* values are tightly clustered across all groups, ranging from 9.8 to 11.0, indicating relative constancy in the redness (a*) dimension. In contrast, mean b* values exhibit greater variability, ranging from 14.7 to 19.0. Although group means are close, the error bars reveal substantial intragroup variation. A considerable overlap in skin chromaticity, in both redness (a*) and yellowness (b*) dimensions, across all eight ethnicities is evident.
In the L*C* lightness‐chroma plane, the distribution of all human skin data typically forms a ‘banana‐shaped’ curve characteristic (Figure 1b). The mean chroma (C*) values vary modestly between groups, ranging from 17.7 to 21.6, primarily reflecting variations along the yellowness (b*) dimension. However, the lightness (L*) level shows a broader range, with values varying from 39.6 to 63.5 among the groups. Although the African group's mean lightness (L*) is notably lower than other groups, its upper range (two standard deviations) overlaps with the range of other populations. Its chroma range also broadly encompasses the ranges observed in all other groups.
Two‐way analyses of variance (ANOVA) was conducted to evaluate the effects of ethnicity as well as gender on skin lightness (L*), redness (a*) and yellowness (b*). The results revealed significant main effects of ethnicity (p <0.001) and gender (p <0.001) for all three appearance attributes (Appendix Table A1). Across all colour parameters, statistically significant effects were observed given the large sample size. However, effect size estimates indicated that interaction effects were consistently small, and for a*, both ethnicity and gender accounted for only a minimal proportion of the total variance.
Together, these results reveal the substantial individual‐level diversity in skin colour, and the intragroup variability in skin colour is often more pronounced than intergroup differences. In the next two sections, two approaches are applied to quantify these overlaps at both the individual level and group level.
3.3. Quantification of Inter‐Ethnicity Overlap by Individual Difference
To assess the extent of inter‐ethnicity overlap at an individual level, we examined whether skin samples from different ethnic groups can be perceptually indistinguishable based on their colour. For each skin sample within an ethnic group, we identified the most similar sample from another group using the minimal values. We then quantified the degree of overlap by calculating the proportion of these minimal differences that fall below the just noticeable difference (JND) threshold, set at = 2 in the CIELAB colour space.
For example, to determine the overlap between the Caucasian (CA) and Chinese (CN) groups, we calculated the smallest perceptual colour difference for each CA sample relative to all CN samples, and vice versa. Figure 3 illustrates the distribution of these minimal colour differences: the left panel represents CA samples compared to CN samples, while the right panel shows CN samples compared to CA samples. The results indicate 92.3% of the minimal differences for CA samples and 98.4% for CN samples fall below the PT. Thus, virtually all samples from these two groups have perceptually indistinguishable counterparts in the other group, indicating a substantial overlap.
FIGURE 3.

Distribution of minimal values: (a) CA samples compared to CN samples, and (b) CN samples compared to CA samples. The red line marks the perceptual threshold ( = 2).
To generalise this analysis across all ethnic groups, we constructed a heatmap (Figure 4) illustrating the degree of perceptual overlap between every pair of groups. Rows correspond to reference groups, while columns represent comparison groups. Each cell in the heatmap indicates the percentage of samples from the reference group (row) having colour differences below the PT () compared to samples in the other group (column). A higher percentage signifies greater perceptual overlap, visualised by a gradient from white (low overlap) to blue (high overlap).
FIGURE 4.

Heatmap illustrating the percentage of perceptual overlap between ethnic groups. Each cell represents the proportion of individual skin samples from the reference group (rows) whose minimal colour differences compared to samples from the comparison group (columns) fall below the perceptual threshold (). Colour intensity ranges from white (minimal overlap) to blue (high overlap). Abbreviations: CA (Caucasian), CN (Chinese), JP (Japanese), SA (South Asian—Pakistani), AF (African), IQ (Middle Eastern—Iraqi), TH (Southeast Asian—Thai), AB (Middle Eastern—Arabian).
Given the skewed distribution of overlap percentages, we used the median as a robust central measure. To quantify sampling variability and robustness, we applied bootstrap resampling with 1000 iterations to estimate the 95% confidence interval of the median. Across all ethnic comparisons, the median proportion of perceptually indistinguishable samples was 89.4%, with a 95% confidence interval ranging from 81.5% to 91.9%. These findings highlight extensive perceptual overlap between individuals of different ethnic groups.
3.4. Quantification of Inter‐Ethnicity Overlap by Shared Gamut
Colour gamut represents the range of colours encompassed within a given colour space. To quantify intergroup overlap at the group level, we compared the skin colour gamut of each ethnic group within the three‐dimensional CIELAB colour space and measured their shared volume. Due to the irregular distribution of skin colours (Figure 1), precisely defining gamut volume is challenging. To address this, we segmented the gamut of each group into discrete 3 × 3 × 3 unit‐cubes, ensuring that the average Euclidean distance between adjacent cubes exceeds the perceptual threshold. Figure 5 visualises each ethnic group's skin colour gamut segmented into these cubes, with the number above each subplot indicating the total volume (number of cubes). Notably, the Caucasian group has the largest gamut (222 cubes), while the African group has the smallest (97 cubes).
FIGURE 5.

Visualisation of skin colour gamut segmented into 3 × 3 × 3 unit cubes within CIELAB colour space. Numbers indicate total gamut volume per ethnic group. Abbreviations: CA (Caucasian), CN (Chinese), JP (Japanese), SA (South Asian—Pakistani), AF (African), IQ (Middle Eastern—Iraqi), TH (Southeast Asian—Thai), AB (Middle Eastern—Arabian).
We quantified the overlap by calculating the proportion of cubes shared between each pair of ethnic groups relative to their total gamut volume. Figure 6 presents a heatmap illustrating these percentages, where each cell shows the proportion of the reference group's gamut (rows) overlapping with the comparison group (columns). For example, 87.3% of the Chinese (CN) gamut overlaps with the Caucasian (CA) gamut, while only 62.2% of the CA gamut overlaps with the CN gamut, highlighting the larger gamut of the CA group. Across all ethnic group comparisons, the median overlap was 60.5%, with a bootstrapped 95% confidence interval ranging from 54.5% to 63.6%.
FIGURE 6.

Heatmap depicting overlap percentages between ethnic group gamut, with colours ranging from white (low overlap) to blue (high overlap). Abbreviations: CA (Caucasian), CN (Chinese), JP (Japanese), SA (South Asian—Pakistani), AF (African), IQ (Middle Eastern—Iraqi), TH (Southeast Asian—Thai), AB (Middle Eastern—Arabian).
3.5. Consistency Between the Two Approaches
To evaluate the consistency between the two methodologies used for quantifying intergroup skin colour overlap, we calculated Pearson's correlation coefficient between the overlap matrices presented in Figures 4 and 6. This analysis yielded a strong positive correlation (r = 0.83, p < 0.001), demonstrating a high level of agreement between the individual‐level (minimal colour difference) and group‐level (shared gamut) approaches. These results confirm the robustness and reliability of both methods in quantifying skin colour overlap across ethnic groups.
4. Discussion
This study critically examines the relationship between skin colour and ethnicity through the quantitative analysis of over 14 000 skin samples from eight ethnically diverse populations. By combining a reliable database of diverse human skin shades and two complementary analytical approaches, we evaluate both intragroup variation and intergroup overlap. Our findings challenge the traditional use of skin colour as a reliable marker for ethnic or racial categorisation, revealing that within‐group variability often exceeds between‐group differences and that substantial overlap exists across ethnic groups.
Historically, the concept of race implied phenotypic similarity within geographically defined populations. Yet, recent genetic research shows that skin colour is shaped by a complex interplay of genetic diversity, environmental adaptation and sociocultural influences [18]. Understanding the complex nature requires accurate measurement data across global populations. Our study presents data from the ISSA, currently the most comprehensive public database of objectively measured human skin colour. Collected under strict uniform protocols, ISSA provides ecologically valid benchmarks for a wide range of research and applications—from genetics and dermatology to cosmetics and AI. Although not exhaustive of all global ethnicities, Figure 1 demonstrates the observed distribution and range of real human skin colours in CIELAB space, offering a valuable empirical reference. Current classification tools, such as the Fitzpatrick scale [20] and the Monk Skin Tone scale [32], primarily reflect differences in skin lightness and are inadequate for capturing the multidimensional variation in human skin colour across all three CIELAB dimensions (lightness L*, redness a* and yellowness b*).
Consistent with anthropological research suggesting that skin pigmentation is an unreliable marker for other biological traits [19], our analysis reveals substantial variation in skin colour appearance within ethnicities, making it an unreliable indicator of ethnicity. Our analysis also shows that differences between ethnic groups are primarily in lightness (L*), rather than chroma (C*), with yellowness (b*) contributing most to chromatic variation. This aligns with previous research on four ethnicities [26] suggesting that melanin predominantly influences skin lightness and yellowness [33], while factors like haemoglobin and diet have lesser impacts on skin colour modulation [34].
To quantify intergroup overlap, we employed two complementary methods that converged on similar conclusions, with a high correlation between them (r = 0.83, p < 0.001). The first method assessed individual‐level overlap by calculating the minimum perceptual colour difference between samples from different ethnic groups. Nearly 90% of these comparisons fell below the JND threshold (), meaning most individuals had skin colours indistinguishable from someone in another group. The highest overlaps were observed between Japanese and Caucasian individuals (99.0%), while the lowest occurred between African individuals and other groups—particularly those with lighter skin tones—highlighting the broader phenotypic diversity in darker skin types (Figure 4).
The second method evaluated group‐level overlap through a voxel‐based segmentation of the 3D CIELAB colour space. Unlike previous convex hull approaches [26, 30, 31]—which tend to overestimate gamut volume due to their sensitivity to outliers and inability to represent concave regions (e.g. Figure 1b the ‘banana’ shape in the L*C* plane)—our method counts only populated unit cubes, correcting for isolated outliers and unnatural gaps based on the assumption of skin colour continuity. Although the choice of unit size may affect resolution, this approach offers a more accurate estimation of actual skin colour distributions. Our analysis revealed that Caucasian individuals exhibit the largest skin colour gamut, while African and Japanese groups show smaller ranges—each less than half the size of the Caucasian gamut (Figure 5). This disparity also explains asymmetric overlaps: for instance, 90.0% of the Japanese gamut overlaps with the Caucasian gamut, while only 40.5% of the Caucasian gamut is shared with the Japanese group. Despite these differences, the median overlap across all pairwise group comparisons remained substantial at 60.5%, indicating considerable commonality in skin colour traits across populations.
While prior studies have examined skin colour variation by ethnicity, age or body region [26, 35, 36], the extent of overlap between groups has not been systematically quantified. Our findings contribute to a more nuanced understanding of phenotypic diversity and offer broad relevance across a range of fields—including genetics, medicine [37, 38], digital imaging [39, 40, 41, 42], prostheses [43, 44], cosmetics [45] and AI‐based technologies [15, 46]. Our data and analysis offer a vital reference for researchers and practitioners aiming to develop inclusive systems and products that accurately reflect human skin variation.
We advocate for skin classification systems to move away from race‐ or ethnicity‐based labels and instead focus on empirically grounded measures of skin characteristics. Objective data should be prioritised over subjective self‐reporting methods such as consumer questionnaires [4, 6, 47], especially in applications requiring precise colour matching or representation. Furthermore, research involving global populations must account for within‐group diversity. For instance, studies exploring cross‐cultural preferences for skin pigmentations need to ensure that visual stimuli—whether real facial images [48, 49] or colour‐manipulated images [50, 51]—should reflect both inter‐ and intragroup variation to avoid misrepresentation or reinforcement of stereotypes. Recognising the wide overlap in skin colour is also essential for reducing bias in both human perception and algorithmic decision‐making, particularly in technologies such as facial recognition and classification systems.
This study has several limitations. Self‐defined ethnicity was used as a grouping variable despite being a social rather than biological construct, which may mask heterogeneity within groups. The ISSA database provides standardized and objective measurements collected under controlled conditions, which may not fully reflect the variability encountered in clinical or real‐world settings. In addition, the study focuses on healthy skin samples and does not account for pathological features relevant to dermatological diagnosis. This represents an important direction for future research, as accurate skin tone assessment can influence the visual presentation and differential diagnosis of skin lesions, particularly in individuals with darker skin tones [52].
5. Conclusion
This study provides robust empirical evidence that skin colour, while visually salient, is a poor indicator of ethnicity. Through objective measurement and rigorous analysis, we demonstrate that intragroup variation is high and intergroup overlap is extensive. These findings call for a re‐evaluation of how skin colour is used in scientific, clinical and technological contexts, promoting more inclusive, accurate and bias‐aware approaches to understanding human diversity.
Conflicts of Interest
The authors declare no conflicts of interest.
1.
Table A1 Two‐Way ANOVA examining the effects of ethnicity and gender on skin lightness (L*), redness (a*), and yellowness (b*).
| Source | Df | F | p | Partial η2 | |
|---|---|---|---|---|---|
| L* | |||||
| Ethnicity | 7, 14 509 | 2333.431 | <0.001 | 0.540 | |
| Gender | 1, 14 509 | 2784.273 | <0.001 | 0.160 | |
| Ethnicity × Gender | 7, 14 509 | 11.305 | <0.001 | 0.005 | |
| a* | |||||
| Ethnicity | 7, 14 509 | 47.258 | <0.001 | 0.020 | |
| Gender | 1, 14 509 | 237.011 | <0.001 | 0.020 | |
| Ethnicity × Gender | 7, 14 509 | 15.490 | <0.001 | 0.007 | |
| b* | |||||
| Ethnicity | 7, 14 509 | 802.389 | <0.001 | 0.280 | |
| Gender | 1, 14 509 | 31.480 | <0.001 | 0.002 | |
| Ethnicity × Gender | 7, 14 509 | 71.469 | <0.001 | 0.030 |
Data Availability Statement
The data that support the findings of this study are openly available in The International Skin Spectra Archive (ISSA) at https://doi.org/10.6084/m9.figshare.28228571.v4.
References
- 1. Jablonski N. G., “The Evolution of Human Skin and Skin Color,” Annual Review of Anthropology 33 (2004): 585–623, 10.1146/annurev.anthro.33.070203.143955. [DOI] [Google Scholar]
- 2. Naik P. P. and Farrukh S. N., “Influence of Ethnicities and Skin Color Variations in Different Populations: A Review,” Skin Pharmacology and Physiology 35, no. 2 (2022): 65–76, 10.1159/000518826. [DOI] [PubMed] [Google Scholar]
- 3. Jablonski N. G., “Skin Color and Race,” American Journal of Physical Anthropology 175, no. 2 (2021): 437–447, 10.1002/ajpa.24200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Ware O. R., Dawson J. E., Shinohara M. M., and Taylor S. C., “Racial Limitations of Fitzpatrick Skin Type,” Cutis; Cutaneous Medicine for the Practitioner 105, no. 2 (2020): 77–80, http://www.ncbi.nlm.nih.gov/pubmed/32186531. [PubMed] [Google Scholar]
- 5. Feliciano C., “Shades of Race: How Phenotype and Observer Characteristics Shape Racial Classification,” American Behavioral Scientist 60, no. 4 (2016): 390–419, 10.1177/0002764215613401. [DOI] [Google Scholar]
- 6. Harvey V. M., Alexis A., Okeke C. A. V., et al., “Integrating Skin Color Assessments Into Clinical Practice and Research: A Review of Current Approaches,” Journal of the American Academy of Dermatology 91, no. 6 (2024): 1189–1198, Published online February 2024:1‐10, 10.1016/j.jaad.2024.01.067. [DOI] [PubMed] [Google Scholar]
- 7. Gao Y., Niddam J., Noel W., Hersant B., and Meningaud J. P., “Comparison of Aesthetic Facial Criteria Between Caucasian and East Asian Female Populations: An Esthetic Surgeon's Perspective,” Asian Journal of Surgery 41, no. 1 (2018): 4–11, 10.1016/j.asjsur.2016.07.007. [DOI] [PubMed] [Google Scholar]
- 8. Rawlings A. V., “Ethnic Skin Types: Are There Differences in Skin Structure and Function? 1,” International Journal of Cosmetic Science 28, no. 2 (2006): 79–93, 10.1111/j.1467-2494.2006.00302.x. [DOI] [PubMed] [Google Scholar]
- 9. Visscher M. O., “Skin Color and Pigmentation in Ethnic Skin,” Facial Plastic Surgery Clinics of North America 25, no. 1 (2017): 119–125, 10.1016/j.fsc.2016.08.011. [DOI] [PubMed] [Google Scholar]
- 10. Oliveira R., Ferreira J., Azevedo L. F., and Almeida I. F., “An Overview of Methods to Characterize Skin Type: Focus on Visual Rating Scales and Self‐Report Instruments,” Cosmetics 10, no. 1 (2023): 14, 10.3390/cosmetics10010014. [DOI] [Google Scholar]
- 11. Gupta V. and Sharma V. K., “Skin Typing: Fitzpatrick Grading and Others,” Clinics in Dermatology 37, no. 5 (2019): 430–436, 10.1016/j.clindermatol.2019.07.010. [DOI] [PubMed] [Google Scholar]
- 12. Yadav G., Rao J., Miller‐Monthrope Y., Abbarin N., Park‐Wyllie L., and Yeung J., “Terminology, Classification Systems, and Evaluation Tools to Describe Skin of Color in Psoriasis and Other Dermatological Conditions,”JAAD Reviews 2 (2024): 156–163, 10.1016/j.jdrv.2024.11.003. [DOI] [Google Scholar]
- 13. Dhawka P., AiHe H., and Willett W., “We Are the Data: Challenges and Opportunities for Creating Demographically Diverse Anthropographics,” in Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (ACM, 2023), 1–14, 10.1145/3544548.3581086. [DOI] [Google Scholar]
- 14. Bliss C., Race Decoded: The Genomic Fight for Social Justice (Stanford University Press, 2012). [Google Scholar]
- 15. Bacchini F. and Race L. L., “Race, Again: How Face Recognition Technology Reinforces Racial Discrimination,” Journal of Information, Communication and Ethics in Society 17, no. 3 (2019): 321–335, 10.1108/JICES-05-2018-0050. [DOI] [Google Scholar]
- 16. Stuart J., Stephen A., Aul K., et al., “Developing Augmented Reality Filters to Display Visual Cues on Diverse Skin Tones,” Front Virtual Real 5, no. July (2024): 1–15, 10.3389/frvir.2024.1363193. [DOI] [Google Scholar]
- 17. Deng L. and Xu S., “Adaptation of human Skin Color in Various Populations,” Hereditas 155, no. 1 (2018): 1, 10.1186/s41065-017-0036-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Quillen E. E., Norton H. L., Parra E. J., et al., “Shades of Complexity: New Perspectives on the Evolution and Genetic Architecture of human Skin,” American Journal of Physical Anthropology 168, no. S67 (2019): 4–26, 10.1002/ajpa.23737. [DOI] [PubMed] [Google Scholar]
- 19. Relethford J. H., “Apportionment of Global human Genetic Diversity Based on Craniometrics and Skin Color,” American Journal of Physical Anthropology 118, no. 4 (2002): 393–398, 10.1002/ajpa.10079. [DOI] [PubMed] [Google Scholar]
- 20. TB F., “Soleil Et Peau,” Journal de Médecine Esthétique 2 (1975): 24–33. [Google Scholar]
- 21. Taylor S. C., Arsonnaud S., and Czernielewski J., “Hyperpigmentation Scale Study Group. The Taylor Hyperpigmentation Scale: A New Visual Assessment Tool for the Evaluation of Skin Color and Pigmentation,” Cutis; Cutaneous Medicine for the Practitioner 76, no. 4 (2005): 270–274, http://www.ncbi.nlm.nih.gov/pubmed/16315565. [PubMed] [Google Scholar]
- 22. CIE . CIE 015:2018 Colorimetry, 4th Edition. Published online September 9, 2018. doi: 10.25039/TR.015.2018. [DOI]
- 23. Ly B. C. K., Dyer E. B., Feig J. L., Chien A. L., and Del Bino S., “Research Techniques Made Simple: Cutaneous Colorimetry: A Reliable Technique for Objective Skin Color Measurement,” Journal of Investigative Dermatology 140, no. 1 (2020): 3–12.e1, 10.1016/j.jid.2019.11.003. [DOI] [PubMed] [Google Scholar]
- 24. Lu Y., Xiao K., Pointer M., et al., “The International Skin Spectra Archive (ISSA): A Multicultural human Skin Phenotype and Colour Spectra Collection,” Scientific Data 12, no. 1 (2025): 487, 10.1038/s41597-025-04857-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Wang M., Xiao K., Luo M. R., Pointer M., Cheung V., and Wuerger S., “An Investigation Into the Variability of Skin Colour Measurements,” Color Research & Application 43, no. 4 (2018): 458–470, 10.1002/col.22230. [DOI] [Google Scholar]
- 26. Xiao K., Yates J. M., Zardawi F., et al., “Characterising the Variations in Ethnic Skin Colours: A New Calibrated Data Base for human Skin,” Skin Research and Technology 23, no. 1 (2017): 21–29, 10.1111/srt.12295. [DOI] [PubMed] [Google Scholar]
- 27. Lu Y., Xiao K., Pointer M., et al., “The International Skin Spectra Archive (ISSA): AMulticultural human Skin Phenotype and Colour Spectra Collection,” Published online 2025, 10.6084/m9.figshare.28228571.v4. [DOI] [PMC free article] [PubMed]
- 28. Paravina R. D., Pérez M. M., and Ghinea R., “Acceptability and Perceptibility Thresholds in Dentistry: A Comprehensive Review of Clinical and Research Applications,” Journal of Esthetic and Restorative Dentistry 31, no. 2 (2019): 103–112, 10.1111/jerd.12465. [DOI] [PubMed] [Google Scholar]
- 29. Paravina R. D., Majkic G., Del Mar Perez M., and Kiat‐amnuay S., “Color Difference Thresholds of Maxillofacial Skin Replications,” Journal of Prosthodontics 18, no. 7 (2009): 618–625, 10.1111/j.1532-849X.2009.00465.x. [DOI] [PubMed] [Google Scholar]
- 30. Crichton S., Pichat J., Mackiewicz M., Tian G.‐Y., and Hurlbert A., “Skin Chromaticity Gamuts for Illumination Recovery,” Conference on Colour in Graphics, Imaging, and Vision 6, no. 1 (2012): 266–271, 10.2352/CGIV.2012.6.1.art00046. [DOI] [Google Scholar]
- 31. Bakke A., Farup I., and Hardeberg J., “Evaluation of Algorithms for the Determination of Color Gamut Boundaries,” Journal of Imaging Science and Technology 54, no. 5 (2010): 50502–50501–50502–50511, 10.2352/J.ImagingSci.Technol.2010.54.5.050502. [DOI] [Google Scholar]
- 32. Monk E., “The Monk Skin Tone Scale,” (2023), (2019):1‐3, 10.31235/osf.io/pdf4c. [DOI]
- 33. Jimbow K., Quevedo W. C., Fitzpatrick T. B., and Szabo G., “Some Aspects of Melanin Biology: 1950–1975,” Journal of Investigative Dermatology 67, no. 1 (1976): 72–89, 10.1111/1523-1747.ep12512500. [DOI] [PubMed] [Google Scholar]
- 34. Whitehead R. D., Re D., Xiao D., Ozakinci G., and Perrett D. I., “You Are What You Eat: Within‐Subject Increases in Fruit and Vegetable Consumption Confer Beneficial Skin‐Color Changes,” PLoS ONE 7, no. 3 (2012): e32988, 10.1371/journal.pone.0032988. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. de Rigal J., Des Mazis I., Diridollou S., et al., “The Effect of Age on Skin Color and Color Heterogeneity in Four Ethnic Groups,” Skin Research and Technology 16, no. 2 (2010): 168–178, 10.1111/j.1600-0846.2009.00416.x. [DOI] [PubMed] [Google Scholar]
- 36. Yun I. S., Lee W. J., Rah D. K., Kim Y. O., and Park B. Y., “Skin Color Analysis Using a Spectrophotometer in Asians,” Skin Research and Technology 16, no. 3 (2010): 311–315, Published online March 2010:311‐315, 10.1111/j.1600-0846.2010.00434.x. [DOI] [PubMed] [Google Scholar]
- 37. Hirose M., Akaho R., Maita C., Sugawara M., and Tsumura N., “Optimization of Spectral Sensitivities of Mosaic Five‐Band Camera for Estimating Chromophore Densities From Skin Images Including Shading and Surface Reflections,” Optical Review 23, no. 3 (2016): 544–548, 10.1007/s10043-016-0192-z. [DOI] [Google Scholar]
- 38. De D. R., Park S. E., Shih T., Shi V. Y., and Hsiao J. L., “Representation of Skin Color, Disease Severity, and Anatomic Location in Hidradenitis Suppurativa Images,” Ski J Cutan Med 8, no. 4 (2024): 1700–1704, 10.25251/skin.8.4.8. [DOI] [Google Scholar]
- 39. Imai F. H., Tsumura N., Haneishi H., and Miyake Y., “Principal Component Analysis of Skin Color and Its Application to Colorimetric Color Reproduction on CRT Display and Hardcopy,” Journal of Imaging Science and Technology 40, no. 5 (1996): 422–430. [Google Scholar]
- 40. Bonnier N., “Attributes of Image Quality for Color Prints,” Journal of Electronic Imaging 19, no. 1 (2010): 011016, 10.1117/1.3277145. [DOI] [Google Scholar]
- 41. Xiao K., Zhu Y., Li C., Connah D., Yates J. M., and Wuerger S., “Improved Method for Skin Reflectance Reconstruction From Camera Images,” Optics Express 24, no. 13 (2016): 14934, 10.1364/OE.24.014934. [DOI] [PubMed] [Google Scholar]
- 42. Liu Y., Liu Y., Xu S., Cheng K., Masuko S., and Tanaka J., “Comparing VR‐ and AR‐Based Try‐On Systems Using Personalized Avatars,” Electronics 9, no. 11 (2020): 1814, 10.3390/electronics9111814. [DOI] [Google Scholar]
- 43. Xiao K., Zardawi F., van Noort R., and Yates J. M., “Color Reproduction for Advanced Manufacture of Soft Tissue Prostheses,” Journal of Dentistry 41 (2013): e15–e23, 10.1016/j.jdent.2013.04.008. [DOI] [PubMed] [Google Scholar]
- 44. Xiao K., Zardawi F., van Noort R., and Yates J. M., “Developing a 3D Colour Image Reproduction System for Additive Manufacturing of Facial Prostheses,” International Journal of Advanced Manufacturing Technology 70, no. 9–12 (2014): 2043–2049, 10.1007/s00170-013-5448-1. [DOI] [Google Scholar]
- 45. Qiao Y., Atis B., Cointereau‐chardon S., et al., “Developing the most inclusive and relevant liquid foundation ranges for multicultural consumers,”: 1–8, Published online 2020.
- 46. Buolamwini J., “Gender Shades : Intersectional Accuracy Disparities in Commercial Gender Classification ∗,” in Proceedings of Machine Learning Research, 2018:1–15.
- 47. Heldreth C. M., Monk E. P., Clark A. T., Schumann C., Eyee X., and Ricco S., “Which Skin Tone Measures Are the Most Inclusive? An Investigation of Skin Tone Measures for Artificial Intelligence,” ACM Journal on Responsible Computing 1, no. 1 (2024): 1–21, 10.1145/3632120. [DOI] [Google Scholar]
- 48. Lu Y., Yang J., Xiao K., Pointer M., Li C., and Wuerger S., “Skin Coloration Is a Culturally‐Specific Cue for Attractiveness, Healthiness, and Youthfulness in Observers of Chinese and Western European Descent,” PLoS ONE 16, no. 10 (2021): e0259276, 10.1371/journal.pone.0259276. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49. Lu Y., Xiao K., Yang J., Pointer M., Li C., and Wuerger S., “Different Colour Predictions of Facial Preference by Caucasian and Chinese Observers,” Scientific Reports 12, no. 1 (2022): 12194, 10.1038/s41598-022-15951-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50. Han C., Wang H., Hahn A. C., et al., “Cultural Differences in Preferences for Facial Coloration,” Evolution and Human Behavior 39, no. 2 (2018): 154–159, 10.1016/j.evolhumbehav.2017.11.005. [DOI] [Google Scholar]
- 51. Stephen I. D., Scott I. M. L., Coetzee V., Pound N., and Perrett D. I., “Penton‐Voak IS. Cross‐cultural Effects of Color, but Not Morphological Masculinity, on Perceived Attractiveness of Men's Faces,” Evolution and Human Behavior 33, no. 4 (2012): 260–267, 10.1016/j.evolhumbehav.2011.10.003. [DOI] [Google Scholar]
- 52. Karampinis E., Georgopoulou K.‐E., Kampra E., et al., “Clinical and Dermoscopic Patterns of Basal Cell Carcinoma and Its Mimickers in Skin of Color: A Practical Summary,” Medicina (B Aires) 60, no. 9 (2024): 1386, 10.3390/medicina60091386. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The data that support the findings of this study are openly available in The International Skin Spectra Archive (ISSA) at https://doi.org/10.6084/m9.figshare.28228571.v4.


