Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2024 Feb 28;121(10):e2313603121. doi: 10.1073/pnas.2313603121

History constrains the evolution of efficient color naming, enabling historical inference

Colin R Twomey a,b,1, David H Brainard c, Joshua B Plotkin b
PMCID: PMC10927505  PMID: 38416682

Significance

Languages vary widely in how they partition colors into color words. But color vocabularies are not random. Prior work has shown these vocabularies are efficient communication systems shaped by two key constraints: the way humans perceive colors and how often speakers need to talk about different colors. In this study, we show that there is another, qualitatively different constraint on color vocabularies: history. We show how a language’s ancestral color vocabulary shapes its subsequent evolution, while maintaining communicative efficiency. Knowing how history constrains color word evolution allows us to reconstruct ancestral color vocabularies, which sheds light on how color words have changed in number and meaning over time.

Keywords: color categories, language evolution, cultural evolution, collective behavior, information theory

Abstract

Color naming in natural languages is not arbitrary: It reflects efficient partitions of perceptual color space [T. Regier, P. Kay, N. Khetarpal, Proc. Natl. Acad. Sci. U.S.A. 104, 1436–1441 (2007)] modulated by the relative needs to communicate about different colors [C. Twomey, G. Roberts, D. Brainard, J. Plotkin, Proc. Natl. Acad. Sci. U.S.A. 118, e2109237118 (2021)]. These psychophysical and communicative constraints help explain why languages around the world have remarkably similar, but not identical, mappings of colors to color terms. Languages converge on a small set of efficient representations.But languages also evolve, and the number of terms in a color vocabulary may change over time. Here we show that history, i.e. the existence of an antecedent color vocabulary, acts as a nonadaptive constraint that biases the choice of efficient solution as a language transitions from a vocabulary of size n to n+1 terms. Moreover, as efficient vocabularies evolve to include more terms they explore a smaller fraction of all possible efficient vocabularies compared to equally sized vocabularies constructed de novo. This path dependence of the cultural evolution of color naming presents an opportunity. Historical constraints can be used to reconstruct ancestral color vocabularies, allowing us to answer long-standing questions about the evolutionary sequences of color words, and enabling us to draw inferences from phylogenetic patterns of language change.


Are our mental representations of the world anything like those of our ancestors? How could we ever know? On the one hand, this is a question about cognition, and it can be addressed by comparative study of extant nonhuman primates (1), at least over long evolutionary timescales. But for modern humans, over thousands rather than millions of years (2, 3), this is a question about culture and how cultures evolve over time. Color naming—a language community’s mapping of colors to color terms—has become a model system for studying the link between cognition and culture, thanks in part to extensive field work, e.g., refs. 410 and theory (1116). Far from being arbitrary, color naming follows common patterns in linguistic communities around the world (5, 17) that reflect near-optimal partitions of perceptual color space (13) modulated by cross-cultural differences in the needs to communicate about different colors (16). Color naming systems can therefore be seen as efficient allocations of terms to colors (15, 16).

Although recent work has explained extant patterns in color naming based on cognitive constraints and communicative needs (10, 13, 16), this leaves open the question of how color naming in a linguistic community is expected to change over time. Early work by Berlin and Kay (17) posited that color naming systems will follow a stereotyped evolutionary sequence in which color terms appear successively in a (nearly) fixed order. This conceptual model was subsequently revised based on a larger collection of empirical color naming data, broadening the number of potential evolutionary pathways that color vocabularies may take as they increase in size (5, 18). The overall conceptual model—that of a limited number of evolutionary transitions from one stage to the next—has remained largely the same, but with the added potential for path-dependent effects, i.e., the possibility that the next stage of evolutionary development may depend on the path taken in the prior stage.

Are path-dependent evolutionary trajectories for color vocabularies consistent with the theory of efficient color naming? Here we study this question by analyzing how history constrains color naming within an established theoretical framework that predicts color vocabularies as efficient solutions to a representation problem. Whereas the geometry of perceptual color space and communicative demands on colors determine the landscape of efficient color naming systems in this framework, there might nonetheless be path dependence—i.e., the current choice of an efficient solution may constrain the accessibilities of potential future choices. One possibility is that, depending on where in color space a new color term is introduced, any one of several equally efficient vocabularies may be reached. This would be inconsistent with the B&K conceptual model, which posits that some successor vocabularies are unreachable (or at least unlikely) given some prior vocabularies, and it would suggest that color term evolution is more flexible than previously thought. Alternatively, an extant color vocabulary may constrain where a newly introduced color term can become established, thus biasing which efficient vocabularies can be reached from an initial vocabulary. In this case, the initial vocabulary would act as a nonadaptive* constraint on cultural evolution, producing path-dependent effects in color naming systems.

Results

Color Categories.

To study the evolution of color naming systems we must first identify color categories that apply across languages. We take a quantitative approach similar to Lindsey and Brown (19), using the World Color Survey (WCS) color naming experiments to identify cross-language clusters of color terms as universal color categories (Fig. 1; see Materials and Methods: World Color Survey). In the WCS study, roughly 25 speakers from each of 110 languages were asked to name 330 standardized color stimuli. Averaging across speakers, this provides a color term map for each language—namely, the chance that a speaker will use each term to describe a given color stimulus. Using Bayes’ rule we also compute the associated color stimulus map for each term in a language—namely, the probability that each stimulus will be named by that term.

Fig. 1.

Fig. 1.

A cross-language dictionary of 15 color categories (“color words”) derived from clustering all color terms in the empirical WCS languages. (A) The 330 color stimuli shown to native language speakers participating in the WCS study (5, 17). (B) Distribution of vocabulary sizes across the WCS languages. Vocabularies of size n8 are well-represented (N7 languages); vocabularies of size 9, 11, and 12 words are rare (N=5, 3, and 2 languages, respectively). (C) The mapping of WCS color stimuli from A to each color category. An English-language description of the average color for that category is given as the title of each panel, and each mapping is displayed with the corresponding color. Specifically, each panel shows the average (across languages) conditional probability, p(x|x^), that a native speaker’s use of the term for color category x^ in their language refers to WCS color stimulus x. The conditional probability for each stimulus is shown on a zero to one scale, from transparent to the average WCS color of the category. The number, N, of WCS languages containing a term corresponding to each color category is shown in the Upper Right of each panel.

We define cross-language color categories as clusters of color terms identified via modularity maximization (20) (Materials and Methods: Dictionary of Color Categories). We apply a modularity maximization algorithm to the pairwise Earth Mover’s distance (EMD) between the stimulus maps for all terms in all languages (Materials and Methods: Earth Mover’s Distance). The EMD, or 1st Wasserstein distance, measures the minimum cost of transforming one term’s distribution over colors into another term’s distribution. Modularity maximization identified 15 term clusters, which we call color categories, across the 110 languages in the empirical WCS. These clusters include the 11 color categories previously identified by Berlin and Kay, as well as additional categories for orange (occurring in eight languages), red-orange (three languages), and light-brown (two languages), along with a distinction between “off-white” (71 languages) and “white” (38 languages). A dictionary of which WCS color stimuli (Fig. 1A) are typically mapped to each of the 15 color categories is shown in Fig. 1C).

Using this dictionary, we assigned each term in each language to one of the identified color categories by minimizing the EMD between the category-average stimulus map and the language’s stimulus map. If multiple terms in the same language are assigned to the same color category, we consider those terms to be synonyms and treat them as a single term. The resulting nonsynonymous terms we call the language’s color words, and each word is identified with a unique (cross-language) color category. We refer to the complete set of color words in a language as its color vocabulary, and the number of such words is its vocabulary size (Fig. 1B).

Evolutionary Transitions.

What happens when a new term is added to an existing vocabulary of size n? To study this question we use an established model of color naming that captures the tendency of color vocabularies to evolve toward efficient partitions of color space (13) modulated by language-specific communicative needs for colors (10, 16).

Let X be a random variable that represents observable colors in the WCS experiment, taking on values in the Fig. 1A array of 330 WCS color stimuli. Associated with each language’s color terms, denoted X^, is a probabilistic mapping of colors to terms, p(x^|x), that gives the probability that a speaker will use term x^X^ to refer to color xX. Both x and x^ are associated with coordinates in a perceptually uniform color space, x and x^, for measuring distances. While any mapping of colors to color terms is possible, only some mappings are efficient. An efficient mapping minimizes the average distortion introduced by using a color term, where distortion, dxx^, measures the perceptual dissimilarity between a color x and the color typically associated with term x^. The average distortion introduced by a mapping of colors to terms also depends on the language-specific need to communicate each color, p(x), and it is calculated as

Dpx^|x=x,x^px^|xpxdxx^. [1]

Zero average distortion can be achieved using a vocabulary with a unique term matched to every color, i.e. where |X^|=|X| and, for every x, there exists an x^ such that x^=x. But the representational cost—i.e., a rate corresponding to the minimum number of bits needed to encode X^ on average, per observation of X— increases with increasing vocabulary size and specificity. This cost is quantified by the mutual information between X and X^:

Rp(x^|x)=IX;X^=x,x^p(x^|x)pxlogp(x^|x)p(x^), [2]

where p(x^)=xp(x^|x)p(x) is the probability (i.e. frequency) that color term x^ is used by speakers.

The total cost of a choice of mapping, p(x^|x), is given by R+βD, where β[0,) parameterizes the trade-off between representational cost and average distortion (in bits per unit of distortion). As β increases, the specificity of the mapping increases and the average distortion decreases. In this way, β and the number of terms, |X^|, co-determine the representational cost of any given mapping from colors to terms for a given distribution of communicative needs, p(x) (SI Appendix, Fig. S1). A rate-distortion efficient vocabulary, for a particular choice of p(x), β, and |X^|, is a choice of mapping p(x^|x) that minimizes R[p(x^|x)]+βD[p(x^|x)]. Rate-distortion efficient vocabularies are not necessarily unique; more than one efficient solution may be possible.

For fixed communicative needs, p(x), and rate-distortion trade-off, β, the following dynamics

pt+1(x^)=xpt(x^|x)p(x), [3]
pt+1x^|xptx^eβdxx^, [4]
x^t+1=xxpt(x|x^), [5]

provably converge to locally optimal rate-distortion efficient vocabularies as t (21). Locally optimal solutions may likewise be non-unique, and moreover may vary in their relative efficiency. Rates below this efficient frontier for any given choice of allowable distortion (parameterized by β) are impossible. Illustrated diagrammatically,

graphic file with name pnas.2313603121unfig01.jpg

These dynamics move an initial choice of color term mapping, p0(x^|x), from the feasible region (shown in white) toward the efficient frontier—the boundary between feasible (white) and infeasible (gray) solutions, for any choice of trade-off between R and D, parameterized by β. (See SI Appendix, section A for an example that also varies |X^|). As D0, the minimum rate, R, of the efficient frontier is bounded from below by the entropy of X, i.e. H(X).

By systematically introducing a new term at low initial frequency (i.e., rare initial usage as a color word), varying its initial focal position over the set of 330 WCS stimuli, and running forward the vocabulary dynamics (Eqs. 3 to 5, holding p(x) and β constant), we determined the number of unique (n+1)-word vocabularies that a given n-word vocabulary may generate, as well as the relative size of the basin of attraction for each the number of focal positions that converge to the same (n+1)-word vocabulary; see (Materials and Methods: Generating Successor n + 1 Vocabularies). By assigning the terms of the resulting (n+1)-word vocabulary to the closest matching color category, we can identify when the newly introduced term results in a set of nonsynonymous (n+1) color words, as well as any changes in meaning between the n- and (n+1)-word vocabularies.

Using the WCS languages as initial vocabularies we systematically probed the introduction of new terms, and we identified the probability that a given word will transition in meaning as the vocabulary size increases (Fig. 2). When a new word is introduced, it may displace or carve out a new “niche” in color space that impacts the mappings of existing color words. For example, a term that was identified as green-blue in a language of vocabulary size n=5 may shift in meaning and become identified with the color category “green” after a newly introduced term establishes as “blue.” Or, the existing green-blue term could become “blue,” while the newly introduced term becomes “green.” We quantified the probability of a change in meaning as the frequency with which a given color word in extant n-word vocabularies (corresponding to rows in Fig. 2A) is identified with a given color word in the expected (n+1)-word vocabulary (columns in Fig. 2B) after a new term is introduced (marginalizing over both languages and initial placements of the new term).

Fig. 2.

Fig. 2.

Transitions from n- to (n+1)-word vocabularies, based on historically constrained empirical data. (A) Semantic shifts in the meaning of color terms. Each row shows the probabilities that a term in a n-word color vocabulary, originally identified with one color word from the dictionary (Fig. 1), is identified with each of the color words (columns) after the addition of a new term. For example, a term corresponding to the color category “off-white” in a 3-word vocabulary may become associated with either “off-white” or “white” in a resulting 4-word vocabulary. The rows marked + indicate the identification probabilities (columns) of the added term. (B) Sankey diagram of semantic shifts in color word meaning after the introduction of a new term. Each row shows the relative frequency of color words in WCS vocabularies of a given vocabulary size (y-axis), where box size corresponds to the frequency of words among languages and box colors correspond to Fig. 1C. Semantic mappings between stages are shown by connecting lines, where the thickness of each line corresponds to the fraction of times a transition occurs under the modeled rate-distortion dynamics.

Color words vary in their susceptibility to semantic shift under addition of a new term. Fig. 2B illustrates the transition probabilities shown in Fig. 2A diagrammatically: The thickness of the curves linking color words shows the expected fraction of each type of semantic transition. Red, black, and (once it appears) yellow are relatively stable in meaning over successive additions of new terms, whereas, e.g., green-blue and blue are more prone to semantic change. The broader off-white color category feeds into, and then is largely replaced by, a sharper white category, in a successional sequence of vocabularies from n=3 words to n=7 words.

Historical Constraints on Vocabularies.

While both n- and (n+1)-word vocabularies are rate-distortion efficient, they are often not unique. Which particular (n+1)-word vocabulary will appear after addition of a new color term depends on the precursor n-word vocabulary. For two efficient vocabularies constrained by the same communicative needs and rate-distortion tradeoff (Fig. 3A), the probabilities of likely successor vocabularies can vary widely (mean within-language SD in probability of successor vocabularies was 0.40 ± 0.005 SE; Fig. 3B). For example, an extant 6-word WCS vocabulary with words for white, black, yellow, pink, and green-blue, will most likely split green-blue into a term for green and a term for blue; whereas a similar efficient vocabulary that already has blue and green rather than green-blue, but no pink, is unlikely to next introduce pink but more likely to introduce light-green or orange (Fig. 3B).

Fig. 3.

Fig. 3.

Historical constraints influence the likelihood of alternative efficient vocabularies. (A) The rate-distortion efficient vocabulary corresponding to an empirically measured (“historical”) WCS color vocabulary (Left) can differ from a rate-distortion efficient vocabulary constructed de novo using the same language-specific communicative needs, p(x), and same need for precision, β. In this example, one vocabulary contains a term for pink, while the other contains no term for pink and distinct terms for green and for blue. (B) Adding a new term either to the de novo 6-word vocabulary (red bars) or to the historical 6-word vocabulary (blue bars) produces one of five different efficient 7-word vocabularies (rows) under the rate distortion model. The frequencies of these efficient solutions vary widely, and they depend strongly on the precursor language (red bars versus blue bars). In particular, the historical precursor vocabulary with a single word for green-blue is most likely to develop separate words for green and blue (blue bars); whereas the de novo precursor vocabulary lacking pink is unlikely to introduce pink and more likely to introduce either light-green or orange (red bars).

The strength of historical constraint on successor vocabularies increases as vocabulary size grows. To measure this difference, we used the same communicative needs and rate-distortion tradeoff inferred for each WCS language, and constructed de novo all unique locally optimal efficient vocabularies following the vocabulary dynamics in Eqs. 3 to 5. This superset of each extant WCS language vocabulary contained one or more choices of efficient vocabulary possible given the inferred constraints, with the likelihood of each choice given by its basin of attraction (number of converged solutions based on 1,000 random initial configurations). The number of unique vocabularies generated by adding a term to an extant 3-word vocabulary is almost as large as the number of unique 4-word vocabularies generated de novo. But the latter quickly outpaces the former as vocabulary size increases (Fig. 4A). This remains true when weighting by the frequency of each potential successor vocabulary (Fig. 4B)—at least up to vocabularies of size 8, after which the number of WCS languages is sparse (Fig. 1B) and uninformative—and does not reflect any systematic difference in the quality of the rate-distortion efficient solutions achieved under de novo or historical constraints (Fig. 4C). This also remains true when comparing the space spanned by the ensemble of historically constrained versus de novo successor vocabularies (Fig. 4D). The space of all efficient vocabularies is much richer for large de novo vocabularies than for those realized in extant languages constrained by an evolutionary process of successive additions of terms. For example, the average pairwise distance between de novo vocabularies of size 8 is 45% greater than for historically constrained languages. To put this succinctly, history matters.

Fig. 4.

Fig. 4.

The number of possible efficient vocabularies exceeds the number of historically constrained successor vocabularies. (A) As vocabulary size (number of words) increases, the total number of possible efficient vocabularies (constructed de novo) increases while the number of historically constrained successor vocabularies remains relatively constant. (B) The effective number of potential successor vocabularies (accounting for the relative frequency of each solution) as a function of vocabulary size. Here, the “effective number” is computed as 2HV, where HV is the entropy of the frequency distribution over the set V of distinct successor vocabularies. This quantity is largest and equal to the cardinality of V when every successor vocabulary is equally likely. (C) Historically constrained and de novo efficient vocabularies show no systematic difference in the quality of rate-distortion efficient solutions, as measured by their rate (mutual information) and average distortion. Lines show weighted (by solution frequency) LOESS regressions of rate as a function of distortion. (D) Pairwise Wasserstein distance between historically constrained (n+1)-word vocabularies and de novo vocabularies of size n+1. For small vocabulary sizes, the solution space explored is of similar size; as vocabulary size increases, de novo solutions span a larger space.

Ancestral Vocabulary Reconstruction.

Historical constraints on color vocabularies present an opportunity: They can be used to estimate likely ancestral languages. These are reconstructions of entire ancestral color naming systems, based on the rate-distortion model, as opposed to reconstructions of individual lexical items. Holding communicative needs and the community’s rate-distortion tradeoff constant, we can generate de novo (n1)-word vocabularies and compute the likelihood of their [(n1)+1]-word successor vocabularies. The likely ancestral vocabularies are those whose successor vocabularies are likely to produce the observed extant n-word vocabulary (Materials and Methods: Generating precursor n – 1 vocabularies).

The Kay, Berlin, Maffi and Merrifield (KBMM) conceptual model of color term evolution (Fig. 5A) contains a long-standing mystery (18): What are the precursor 3-word vocabularies for the observed 4-word vocabularies that contain words for white (W), red (R), black (Bk), yellow/green/blue (Y/G/Bu) (Fig. 5B lines D and E)? Using the ancestral reconstruction method described above, we computed all potential 3-word precursors that fully partition color space and are rate-distortion efficient, shown in Fig. 5C. The most likely such precursor has a combined term for red and white—a vocabulary that is not observed among the languages surveyed in the WCS. The only other efficient 3-term precursor has terms for white, red/yellow, and black/green/blue. But, despite appearing among the languages sampled in the WCS, by our analysis this candidate precursor is 39% less likely. This suggests that there may be an alternative, unobserved evolutionary pathway, which is missing from the precursors sampled in the WCS dataset. This example demonstrates the utility of directly reconstructing ancestral vocabularies: It can reveal ancestral states that may not be present in a sample of extant languages, or that may not persist to the present at all.

Fig. 5.

Fig. 5.

Ancestral language reconstructions for WCS languages, compared to hypothesized evolutionary transitions. (A) Kay et al. (18, 23) classified the color vocabularies in the WCS data for languages containing between 2 and 6 terms, and hypothesized possible evolutionary transitions between them (arrows). According to this classification, Stage I languages have two color words—grouping white, red, and yellow (W/R/Y) as one term, and grouping black, green, blue (Bk/G/Bu) as a second term. Stages I, II, and V vocabularies have one form each, while Stages III and IV WCS languages (4- and 5-word vocabularies, respectively) have three possible forms each. (Diagram adapted from Kay et al. (23) and Kay & Maffi (18)) (B) KBMM hypothesized several alternative evolutionary sequences, including the hypothesized most common “mainline” sequence, line A; two additional sequences, (lines B and C) that vary at stage III; as well as two alternative sequences, (lines D and E), that contain unknown stage-I and stage-II precursor vocabularies (Diagram adapted from Kay et al. (23) and Kay & Maffi (18)). (C) Likely rate-distortion efficient ancestral (precursor) languages for a stage III Y/G/Bu 4-word language (Top). Among the two possible reconstructions of ancestral 3-word languages, the more likely precursor (P=0.72, Middle) does not correspond to any vocabulary observed in the WCS study; whereas the less likely precursor (P=0.28, Bottom) corresponds to a stage-II language. (D) For comparison, a more typical Stage IV G/Bu vocabulary (Top) is shown along with two almost equally likely reconstructed ancestral languages that agree with the sequences hypothesized by KBMM. The most likely precursor (P=0.43, Middle) corresponds to a Stage III G/Bu vocabulary (sequence B), and the next most likely precursor (P=0.40, Bottom) corresponds to a Stage III Bk/G/Bu vocabulary (sequence A).

As a second example of this reconstruction method, we considered the possible precursors for a 5-word WCS vocabulary that falls along one of the “mainline” KBMM evolutionary sequences. We identify three potential precursor vocabularies, with the two most likely shown in Fig. 5D. These correspond to a precursor with words for white, red/yellow, green/blue, and black (mainline path A), and a precursor with words for white, red, yellow, and black/green/blue (mainline path B). These two precursors are almost equally likely, and each is 245% more likely than the third alternative. Thus the reconstructed vocabularies and their relative likelihoods allow us to distinguish among the potential evolutionary paths considered by KBMM, and can also reveal when ancestry is ambiguous.

Discussion

Systems of color naming are constrained by perception and by communicative needs (1013, 15, 16). Our study reveals a third, qualitatively different constraint on color vocabularies—namely, history. Languages evolve from precursor languages, and this alone produces quantifiable constraints on color vocabularies, introducing path dependence in color word evolution. As new terms are added to an n-word vocabulary, the existing terms will shift in meaning and extent to accommodate. We have shown that this flexibility is limited, so that not all efficient (n+1)-word vocabularies are equally likely. As color vocabularies increase in size, language communities explore only a small subset of the full range of efficient solutions that are possible.

These results establish that the theory of efficient color naming is consistent with a stereotypical set of evolutionary pathways, as hypothesized in early studies (5, 17, 18, 22, 23). Our analysis adds substantial detail to this hypothesis by quantifying the relative probabilities of different evolutionary pathways. This perspective also provides an answer to a long-standing question about ancestral precursors of particular languages in the WCS without invoking additional dynamics.

At the same time, our model is not incompatible with an “emergence hypothesis” (EH) that posits that at early stages of development some color naming systems do not fully partition all of color space Kay and Maffi (18). Rather than assuming communicative needs are constant over time, as we have done in this work, efficient naming under EH suggests that some regions of color space had no communicative need whatsoever in the ancestral language. Still, our results do suggest that color vocabularies of size 5 or larger are unlikely to emerge de novo. Fig. 4AD show that there is substantial overlap in historically constrained and de novo vocabularies only for those of size n=4 words. Thus vocabularies larger than 4 words likely developed from an already efficient precursor. This conjecture is consistent with the small vocabulary size of the few known examples of languages that cover only part of the color domain, e.g., refs. 8, 9, and 24.

Many important questions remain. For one, quantitative estimates of likely transitions in color vocabularies depend on where in color space new terms are introduced, which may not be uniform over the WCS color space as assumed in our analyses, see e.g., ref. 25. Are there biases—cognitive, communicative, or historical—on where new terms are likely to be introduced? How might within-language variation in color naming among speakers (19, 25, 26) play a role in this process? The present analysis does not attempt to explain this individual-level variability, and so it remains an open question as to its impact on the evolution of color naming systems.

Second, while the present work investigates semantic shift when new terms are added to an existing vocabulary, for simplicity this analysis has been done while holding constant the communicative needs and β, the rate-distortion tradeoff parameter controlling the need for precision. How might changes in the need for precision or the distribution of communicative needs across colors alter the course of cultural evolution?

Third, this study has focused on the addition of new terms to an existing language, but deletions might occur as well. Recent work on color vocabulary phylogenies suggests that term loss may be more common than previously thought (27). We observe one form of loss in our results: The addition of a term to a vocabulary can result in the loss of one color word in exchange for two new words. A loss of this kind is due to semantic shift: e.g., a green-blue term becomes a green term after the newly added term establishes as blue. But Haynie and Bowern (27) indicate a distinct form of loss: a deletion that strictly reduces the size of a color vocabulary, contra KBMM (18, 23). We have not treated this possibility in the present study. If such losses are common, repeated cycles of term deletion and addition could allow languages to explore a larger fraction of the efficient vocabularies that are possible. While the simplifying assumption of no deletions is a limitation of this study, our basic framework could be extended using a Markov process that allows for any number of n to n1 and n1 to n transitions, parameterized by the probability of a loss or gain at a vocabulary size of n. To implement such an approach, however, some additional consideration would need to be given to the different possible ways that a term could be lost (i.e. how a term decreases in frequency over time).

Answering these questions, and calibrating an extended model that allows for deletions, will benefit from contact between dated language phylogenies and models of efficient color naming. Color word presence/absence data of the kind used in Haynie and Bowern (27) are insufficient for this purpose: We require direct knowledge of the mappings between terms and colors in order to infer language-specific communicative needs for colors and the need for precision. Whereas the world-spanning breadth of the WCS makes it ideal for answering questions about color naming similarities and differences across cultures, its depth in any particular language family is relatively shallow, limiting its utility for linking color naming to language phylogenies. Future work could use the relatively high number of Austronesian languages within the WCS (N=8) in combination with recent dated phylogenies for the Austronesian language family (28) to study the questions of ancestral needs and term loss. An alternative is the under-explored Mesoamerican Color Survey (MCS) collected by MacLaury (6), which catalogued 116 indigenous languages spoken in Mexico and Central America. Our analysis of color term evolution gives further motivation to curate and assemble these data (29, 30) and develop dated language phylogenies for more Mesoamerican language families e.g., ref. 31 for Uto-Aztecan. Our work also underscores the utility of longitudinal studies that sample color naming in a linguistic community over time, e.g., refs. 32 and 33.

History likely constrains efficient representations in other aspects of language and culture aside from color—such as kinship and spatial relations (3437), numeral systems (38), and person systems (39). Our approach to quantifying historical constraints and inferring ancestral states could be used to study the evolution of these cultural systems as well. There can be surprising utility in understanding how the elements of culture and language change over time; for example, using the colors reported in astronomical observations made thousands of years ago to estimate the fate of nearby stars (40).

Given the backdrop of evolutionary thought, our results are not entirely surprising. The concept of phylogenetic constraint—i.e., the idea that an organism’s ancestry imposes limits on its evolutionary trajectory—is fundamental to evolutionary biology. Gould and Lewontin (41) introduced the concept of “spandrels,” or nonadaptive evolutionary by-products that arise as a consequence of selection for other traits, as an illustration of phylogenetic constraint. Phylogenetic constraint can shape the pace and direction of evolution and limit the range of phenotypes that evolve within a lineage. Quantifying these constraints has been essential for interpreting the patterns and processes of biological form and function. Our work suggests that the same may be true of cultural evolution. This supports the broader view that studying evolutionary processes can provide valuable insights into cultural systems (in color naming, e.g., refs. 42 and 43, and in general (4447)), and vice versa.

matseccnt1

Methods

World Color Survey.

Using a standardized set of 330 color stimuli based on Brown and Lenneberg (4), Kay et al. (5) catalogued color naming in 110 languages around the world. Fieldworkers presented each of the color stimuli, one at a time in a fixed, randomized order, to on average 24 native speakers. Ambient lighting was approximately controlled by presenting stimuli at noon in the shade. The results of this survey are publicly available online from the WCS Data Archives (https://www.icsi.berkeley.edu/wcs/data.html) ( 57).

There is evidence of some variability in how field workers interpreted the WCS instructions for conducting the experiment Gibson et al. (10) SI Appendix, section 1. Critically, some field workers in the WCS study ran a “free-choice” version of the experiment, in which participants could use any word of their choosing for any color, in contrast to the intended “fixed-choice” version in which participants were asked to respond using one of the words in their language previously identified by the field-worker as “basic” according to the WCS criteria. Estimates from Gibson et al. (10) suggest that roughly 18 out of the 110 languages were recorded under free-choice conditions.

To mitigate this variability, we used the same criteria for including recorded terms in our analysis of a language’s vocabulary as in Twomey et al. (16): We include all recorded terms that had an associated focal color, was used by at least two surveyed speakers, and was considered the best choice for at least one WCS color chip. Under these criteria, Twomey et al. (16) SI Appendix, section 3E showed there was no predictive relationship between the free/fixed-choice categorization from Gibson et al. (10) and the performance of the rate-distortion model of color naming used in this work and in Twomey et al. (16). These criteria, together with the identification of a language’s non-synonymous terms discussed in Section: Color categories, give the estimated WCS vocabulary sizes used in this work (Fig. 1B).

Dictionary of Color Categories.

Instead of clustering color terms across languages based on overlapping WCS speaker color maps (choices of words for a given color) as in Lindsey and Brown (19), we clustered based on inferred meanings of a given word, i.e., the probability a speaker is referring to a color given a choice of word. We computed the probability, p(x|x^), that a speaker using a language’s color term, x^, refers to a color, x, according to p(x|x^)p(x|x^)p(x), where p(x^|x) is the WCS speaker-average color map and p(x) is the inferred language-specific communicative needs from Twomey et al. (16). We then measured the dissimilarity, Dij, as the EMD between every pair of color terms across all WCS languages. Using the adjacency matrix Aij=exp[Dij], we formulated the problem of identifying cross-language groupings of color terms (color term “universals”) as a graph-theoretic “community identification” problem (a class of problems in network science concerned with identifying clusters, or “communities,” in a given network). As in Jackson et al. (48) for identifying colexification patterns across languages, we identified communities based on their modularity. Unlike Jackson et al. (48), the large size of A prohibited the use of the exponential time linear integer programming formulation of the problem given by Brandes et al. (51); instead, we used the well-known approximation method of Clause et al. (52), implemented in the R igraph package (53).

Earth Mover’s Distance.

Earth Mover’s Distance (EMD) is a measure of the difference between two probability distributions, defined as the minimum cost of transforming one distribution into another, where cost is calculated as the sum of the distances between each point in the two distributions multiplied by the amount of mass moved. In other words, the EMD measures the minimum amount of “mass transportation” required to move the points in one distribution to match the corresponding points in the other distribution. We compute this quantity using the emdist R package based on the Rubner et al. (54) implementation.

Generating Successor n + 1 Vocabularies.

We start by noting that empirical vocabularies may not be exactly at equilibrium when measured: Adaptation of a color vocabulary can be expected to take time, likely related to the population size of the language community and the frequency of color term use. These nonequilibrium effects are of interest in their own right, but here we aim to control their impact (as well as noise in empirical vocabularies due to sampling) by using the expected equilibrium form of empirically measured color vocabularies under the dynamics given by Eqs. 3 to 5. It has already been shown that the empirical WCS vocabularies are well approximated by these equilibrium solutions (16). We do this in practice through a preprocessing step that produces a rate-distortion efficient version of each language. In this step, we start with the WCS empirically measured focal color positions in CIE Lab color space, as well as the best-fit language-specific estimate of β and inferred communicative needs p(x) from ref. 16, and then run the rate-distortion vocabulary dynamics to equilibrium. We have shown previously that this step does not lead to large changes in vocabulary (16).

Next, given a rate-distortion efficient vocabulary with n words, candidate n+1 term successor vocabularies were generated by systematically introducing a new term with a focal color initialized to each WCS color stimulus (330 total) and low initial frequency (equal to 1e09), and running forward the rate-distortion dynamics given by Eqs. 3 to 5 to equilibrium, while the communicative needs, p(x), of the language, and need for precision, β, were held constant. The choice of an arbitrarily small frequency for the initial usage of the introduced term reflects the fact that any new word must start its life with a single utterance. This is analogous to the standard approach to adaptive dynamics in theoretical evolutionary biology, where the fitness of a mutant is assessed when it is initially rare in a population (55).

Candidate successor vocabularies generated in this way are not unique, and result in one or a small number of efficient solutions. Non-unique (i.e., approximately identical) candidate successor vocabularies were judged based on the average squared distance (in CIE Lab space) between the focal colors of corresponding color terms between the two vocabularies, where correspondence was determined based on the minimum distance assignment problem matching focal colors from one vocabulary to the other. Pairs below a root-mean squared distance threshold (10ΔE)§ were connected, and the connected components of the resulting graph determined the set of nonunique successor vocabularies.

Generating Precursor n − 1 Vocabularies.

Candidate (n1)-word precursor vocabularies for a given n-word rate-distortion efficient vocabulary were generated by systematically deleting each of the n words in turn, and running forward the rate-distortion dynamics (Eqs. 3 to 5) to equilibrium. Unique candidates (determined with the same procedure used when identifying successor vocabularies; see Materials and Methods: Generating Successor n + 1 Vocabularies) were retained, and were each used in turn to generate candidate [(n1)+1]-word successor vocabularies. The proportion of successor vocabularies that converge to the original n-word vocabulary give an estimate of the relative likelihood of each candidate precursor vocabulary. Distance to the original n-word vocabulary was computed as the minimum mean squared matching distance between the two vocabularies focal colors, as above. For a given language, let i index the candidate precursor vocabularies, wij denote the proportion of term additions that result in the [(n1)+1]-word vocabulary j, and Vij the distance between the original n-word vocabulary and the jth [(n1)+1]-word vocabulary. We estimated the overall likelihood, pi, of the precursor vocabulary as

piπ(σ)jwijeσVijdσ, [6]

where σ is a scaling parameter on the (n1)+1 to n distances, and π(σ) is a uniform prior on σ over an interval covering the typical variation in likelihoods across languages. Marginalizing over this prior for σ gives likelihood estimates that are similar to any particular choice of σ on this interval (SI Appendix, section B and Fig. S2).

Supplementary Material

Appendix 01 (PDF)

pnas.2313603121.sapp.pdf (206.6KB, pdf)

Acknowledgments

C.R.T. was supported in part by a postdoctoral fellowship through the University of Pennsylvania’s Data Driven Discovery Initiative. J.B.P. acknowledges funding from the John Templeton Foundation (grant number 62281).

Author contributions

C.R.T., D.H.B., and J.B.P. designed research; C.R.T. performed research; C.R.T. contributed new reagents/analytic tools; C.R.T. analyzed data; and C.R.T., D.H.B., and J.B.P. wrote the paper.

Competing interests

The authors declare no competing interest.

Footnotes

This article is a PNAS Direct Submission.

*Note that non-adaptive does not imply maladaptive. In evolutionary theory, a maladaptive trait is harmful to the fitness of its bearer, while a non-adaptive trait has neither a direct cost nor a benefit.

Colexification refers to the use of a single word to represent multiple concepts in a language. Colexification patterns across languages can be used to estimate the degree of similarity between concepts (49, 50).

The minimum distance assignment problem is a mathematical problem that involves finding the optimal assignment of objects from one set to another based on a given cost function. In this case, the cost function is the squared distance between the focal colors of corresponding color terms in the two vocabularies.

§Beyond this threshold vocabularies appeared visibly distinct in terms of their category extents.

Data, Materials, and Software Availability

All data and code used in this work are publicly available online. The data are available via the WCS Data Archives hosted at https://www.icsi.berkeley.edu/wcs/data.html (57). The code to reproduce this work is available on GitHub at https://github.com/crtwomey/twomey2024 (58), and is based in part on code and data from Twomey et al. 2021 (59). The results in this manuscript were generated using R v4.2.3 and the targets package for reproducibility (56). All empirical data used in this study is from prior, publicly available data sources: namely, the World Color Survey (5).

Supporting Information

References

  • 1.Corballis M. C., Language evolution: A changing perspective. Trends Cogn. Sci. 21, 229–236 (2017). [DOI] [PubMed] [Google Scholar]
  • 2.Powell A., Shennan S., Thomas M. G., Late pleistocene demography and the appearance of modern human behavior. Science 324, 1298–1301 (2009). [DOI] [PubMed] [Google Scholar]
  • 3.Greenhill S. J., Atkinson Q. D., Meade A., Gray R. D., The shape and tempo of language evolution. Proc. R. Soc. B 277, 2443–2450 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Brown R. W., Lenneberg E. H., A study in language and cognition. J. Abnorm. Psychol. 49, 454–462 (1954). [DOI] [PubMed] [Google Scholar]
  • 5.Kay P., Berlin B., Maaffi L., Merrifield W., Cook R., The World Color Survey (CLSI, Standford, 2009). [Google Scholar]
  • 6.MacLaury R. E., Color and Cognition in Mesoamerica: Constructing Categories as Vantages (University of Texas Press, Austin, TX, 1997). [Google Scholar]
  • 7.Heider E. R., Probabilities, sampling, and ethnographic method: The case of Dani colour names. Man 7, 448–466 (1972). [Google Scholar]
  • 8.Levinson S. C., Yélî dnye and the theory of basic color terms. J. Linguist. Anthropol. 10, 3–55 (2000). [Google Scholar]
  • 9.Lindsey D. T., Brown A. M., Brainard D. H., Apicella C. L., Hunter-gatherer color naming provides new insight into the evolution of color terms. Curr. Biol. 25, 2441–2446 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Gibson E., et al. , Color naming across languages reflects color use. Proc. Natl. Acad. Sci. U.S.A. 114, 10785–10790 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.K. Jameson, R. G. D’Andrade, “It’s not really red, green, yellow, blue: An inquiry into perceptual color space” Color Categories in Thought and Language, C. L. Hardin, L. Maffi, Eds. (Cambridge University Press, Cambridge, UK, 1997).
  • 12.Yendrikhovskij S. N., Computing color categories from statistics of natural images. J. Imaging Sci. Technol. 45, 409–417 (2001). [Google Scholar]
  • 13.Regier T., Kay P., Khetarpal N., Color naming reflects optimal partitions of color space. Proc. Natl. Acad. Sci. U.S.A. 104, 1436–1441 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Jäger G., The evolution of convex categories. Linguist. Philos. 30, 551–564 (2007). [Google Scholar]
  • 15.Zaslavsky N., Kemp C., Regier T., Tishby N., Efficient compression in color naming and its evolution. Proc. Natl. Acad. Sci. U.S.A. 115, 7937–7942 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Twomey C., Roberts G., Brainard D., Plotkin J., What we talk about when we talk about colors. Proc. Natl. Acad. Sci. U.S.A. 118, e2109237118 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Berlin B., Kay P., Basic Color Terms: Their Universality and Evolution (University of California Press, Berkeley, 1969). [Google Scholar]
  • 18.Kay P., Maffi L., Color appearance and the emergence and evolution of basic color lexicons. Am. Anthropol. 101, 743–760 (1999). [Google Scholar]
  • 19.Lindsey D. T., Brown A. M., World color survey color naming reveals universal motifs and their within-language diversity. Proc. Natl. Acad. Sci. U.S.A. 106, 19785–19790 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.M. E. J. Newman, G. M, Finding and evaluating community structure in networks. Phys. Rev. E 69 (2004). 10.1103/PhysRevE.69.026113. [DOI] [PubMed]
  • 21.Banerjee A., Merugu S., Dhillon I. S., Ghosh J., Clustering with Bregman divergences. J. Mach. Learn. Res. 6, 1705–1749 (2005). [Google Scholar]
  • 22.Kay P., McDaniel C. K., The linguistic significance of the meanings of basic color terms. Language 54, 610–646 (1978). [Google Scholar]
  • 23.P. Kay, B. Berlin, L. Maffi, W. Merrifield, “Color naming across languages” in Color Categories in Thought and Language, C. L. Hardin, L. Maffi, Eds. (Cambridge University Press, Cambridge, 1997), pp. 21–56.
  • 24.D. T. Lindsey, A. M. Brown, D. H. Brainard, C. L. Apicella, Hadza color terms are sparse, diverse, and distributed, and presage the universal color categories found in other world languages. i-Perception 7 (2016). 10.1177/2041669516681807. [DOI] [PMC free article] [PubMed]
  • 25.Lindsey D. T., Brown A. M., The color lexicon of American English. J. Vision 14 (17), 1–25 (2014). [DOI] [PubMed] [Google Scholar]
  • 26.M. A. Webster, P. Kay, “Individual and population differences in focal colors” in The Anthropology of Color, R. L. MacLaury, G. Paramei, D. Dedrick, Eds. (John Benjamins, Amsterdam, 2007), pp. 29–53.
  • 27.Haynie H., Bowern C., Phylogenetic approach to the evolution of color term systems. Proc. Acad. Sci. U.S.A. 113, 13666–13671 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Gray R. D., Jordan F. M., Language trees support the express-train sequence of Austronesian expansion. Nature 405, 1052–1055 (2000). [DOI] [PubMed] [Google Scholar]
  • 29.K. A. Jameson, P. S. Deshpande, S. Tauber, S. M. Chang, S. Gago, “Using individual differences to better determine normative responses from crowdsourced transcription tasks: An application to the R. E. Maclaury color categorization archive” in 2016 IS&T International Symposium on Electronic Imaging (San Francisco, CA, USA, 2016).
  • 30.K. A. Jameson et al., Mesoamerican color survey digital archive. in Encyclopedia of Color Science and Technology, R. Luo, Ed. (Springer, New York, NY, 2015), pp. 909–921.
  • 31.S. J. Greenhill et al., A recent northern origin for the Uto-Aztecan family. Language (2023).
  • 32.Kuriki I., et al. , The modern Japanese color lexicon. J. Vision 17, 1 (2017). [DOI] [PubMed] [Google Scholar]
  • 33.N. Zaslavsky, K. Garvin, C. Kemp, N. Tishby, T. Regier, The evolution of color naming reflects pressure for efficiency: Evidence from the recent past. J. Lang. Evol. (2022). 10.1093/jole/lzac001. [DOI]
  • 34.Kemp C., Regier T., Kinship categories across languages reflect general communicative principles. Science 336, 1049–1054 (2012). [DOI] [PubMed] [Google Scholar]
  • 35.T. Regier, C. Kemp, P. Kay, “Word meanings across languages support efficient communication” in The Handbook of Language Emergence, B. MacWhinney, W. O’Grady, Eds. (Wiley, Hoboken, NJ, 2015), pp. 237–263.
  • 36.Kemp C., Xu Y., Regier T., Semantic typology and efficient communication. Annu. Rev. Linguist. 4, 109–128 (2018). [Google Scholar]
  • 37.Gibson E., et al. , How efficiency shapes human language. Trends Cogn. Sci. 23, 389–407 (2019). [DOI] [PubMed] [Google Scholar]
  • 38.Xu Y., Liu E., Regier T., Numeral systems across languages support efficient communication: From approximate numerosity to recursion. Open Mind 4, 57–70 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.N. Zaslavsky, M. Mora, J. Culbertson, “Let’s talk (efficiently) about us: person systems achieve near-optimal compression” in Proceedings of the Annual Meeting of the Cognitive Science Society (Curran Associates, Inc., Red Hook, NY, 2021).
  • 40.Neuhäuser R., et al. , Colour evolution of Betelgeuse and Antares over two millennia, derived from historical records, as a new constraint on mass and age. Mon. Not. R. Astron. Soc. 516, 693–719 (2022). [Google Scholar]
  • 41.Gould S., Lewontin R., The spandrels of san Marco and the Panglossian paradigm: A critique of the adaptationist programme. Proc. R. Soc. B 205, 581–598 (1979). [DOI] [PubMed] [Google Scholar]
  • 42.Komarova N. L., Jameson K. A., Narens L., Evolutionary models of color categorization based on discrimination. J. Math. Psychol. 51, 359–382 (2007). [Google Scholar]
  • 43.Xu J., Dowman M., Griffiths T., Cultural transmission results in convergence towards colour term universals. Proc. R. Soc. B 280 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Dunn M., Greenhill S. J., Levinson S. C., Gray R. D., Evolved structure of language shows lineage-specific trends in word-order universals. Nature 473, 79–82 (2011). [DOI] [PubMed] [Google Scholar]
  • 45.D. Dediu et al., “Cultural evolution of language” in Cultural Evolution: Society, Technology, Language, and Religion, P. J. Richerson, M. H. Christiansen, Eds. (MIT Press, Cambridge, MA, 2014), pp. 303–332.
  • 46.S. C. Levinson, “Language evolution” in The Cambridge Handbook of Linguistic Anthropology, N. J. Enfield, P. Kockelman, J. Sidnell, Eds. (Cambridge University Press, 2014), pp. 309–324.
  • 47.Creanza N., Kolodny O., Feldman M. W., Cultural evolutionary theory: How culture evolves and why it matters. Proc. Natl. Acad. Sci. U.S.A. 114, 7782–7789 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Jackson J., et al. , Emotion semantics show both cultural variation and universal structure. Science 366, 1517–1522 (2019). [DOI] [PubMed] [Google Scholar]
  • 49.A. François, “Semantic maps and the typology of colexification: Intertwining polysemous networks” in From Polysemy to Semantic Change: Towards a Typology of Lexical Semantic Associations, M. Vanhove, Ed. (John Benjamins Publishing Company, 2008), vol. 106, pp. 163–215.
  • 50.Natale A. D., Pellert M., Garcia D., Colexification networks encode affective meaning. Affect. Sci. 2, 99–111 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Brandes U., et al. , On modularity clustering. IEEE Trans. Knowl. Data Eng. 20, 172–188 (2008). [Google Scholar]
  • 52.Clause A., Newman M. E. J., Moore C., Finding community structure in very large networks. Phys. Rev. E 70, 066111 (2004). [DOI] [PubMed] [Google Scholar]
  • 53.Csardi G., Nepusz T., The igraph software package for complex network research. InterJournal Complex Syst. 1695 (2006). [Google Scholar]
  • 54.Y. Rubner, C. Tomasi, L. J. Guibas, “A metric for distributions with applications to image databases” in IEEE Sixth International Conference on Computer Vision, Bombay, India (IEEE, 1998), pp. 59–66.
  • 55.J. A. J. Metz, S. A. H. Geritz, G. Meszéna, F. J. A. Jacobs, J. S. Van Heerwaarden, “Adaptive dynamics: A geometrical study of the consequences of nearly faithful reproduction” in Stochastic and Spatial Structures of Dynamical Systems, S. J. Van Strien, S. M. Verduyn Lunel, Eds. (North-Holland, Amsterdam, 1996), pp. 182–231.
  • 56.Landau L. M., The targets R package: A dynamic make-like function-oriented pipeline toolkit for reproducibility and high-performance computing. J. Open Source Softw. 6, 2959 (2021). [Google Scholar]
  • 57.R. Cook, P. Kay, T. Regier, BK-dict.txt, BK-foci.txt, BK-term.txt, cnum-vhcm- lab-new.txt, foci-exp.txt, term.txt. WCS Data Archives. https://www1.icsi.berkeley.edu/wcs/data.html. Accessed 2 August 2017.
  • 58.C. R. Twomey, Code for Twomey et al. 2024 History constrains the evolution of efficient color naming, enabling historical inference. GitHub. https://github.com/crtwomey/twomey2024. Deposited 9 February 2024. [DOI] [PMC free article] [PubMed]
  • 59.C. R. Twomey, Code and data for Twomey et al. 2021 What we talk about when we talk about colors. GitHub. https://github.com/crtwomey/twomey2021. Deposited 9 September 2024.

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Appendix 01 (PDF)

pnas.2313603121.sapp.pdf (206.6KB, pdf)

Data Availability Statement

All data and code used in this work are publicly available online. The data are available via the WCS Data Archives hosted at https://www.icsi.berkeley.edu/wcs/data.html (57). The code to reproduce this work is available on GitHub at https://github.com/crtwomey/twomey2024 (58), and is based in part on code and data from Twomey et al. 2021 (59). The results in this manuscript were generated using R v4.2.3 and the targets package for reproducibility (56). All empirical data used in this study is from prior, publicly available data sources: namely, the World Color Survey (5).


Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES