Abstract
How abstract is language? We show that abstractness pervades every corner of language, going far beyond the usual examples of freedom and justice. In the light of the ubiquity of abstract words, the need to understand where abstract meanings come from becomes ever more acute. We argue that the best source of knowledge about abstract meanings may be language itself. We then consider a seemingly unrelated question: Why isn't language more iconic? Iconicity—a resemblance between the form of words and their meanings—can be immensely useful in language learning and communication. Languages could be much more iconic than they currently are. So why aren't they? We suggest that one reason is that iconicity is inimical to abstraction because iconic forms are too connected to specific contexts and sensory depictions. Form–meaning arbitrariness may allow language to better convey abstract meanings.
This article is part of the theme issue ‘Varieties of abstract concepts: development, use and representation in the brain’.
Keywords: concepts, abstraction, iconicity, word meanings
1. Introduction
Where does abstract knowledge come from? Why isn't language more iconic? These two questions appear to be unconnected. We make the case that by considering them together, we can better understand the origins of abstract knowledge and the design principles of language.
We begin by briefly reviewing two major approaches to understanding semantic knowledge—embodied cognition, which emphasizes the importance of perceptual, motor and emotional experiences in our conceptual structure and word meanings (e.g. [1,2]), and what we gloss as amodal cognition (e.g. [3,4]), which emphasizes the role of symbolic and non-perceptual representations. Accounting for abstract knowledge has posed a challenge for both approaches, and we argue that the problem of abstract knowledge is even more acute than is often acknowledged by proponents of either view. To understand the origin of (some) abstract concepts, we argue that we need to turn to language itself. We discuss several ways that language can give rise to abstract concepts and then argue that this ability may require word forms to be arbitrarily related to their meanings, and so despite the many benefits of a more iconic language, iconicity may be a hindrance to expressing abstract meanings.
Iconicity refers to cases where a word form bears some resemblance to its meaning. This resemblance may be easy to detect, as in onomatopoetic words such as tweet, chirp, click and bang, or more subtle, as in a word like teeny, which is iconic because people have a robust association between smallness and the sound /i/ [5,6]. It is becoming increasingly clear that iconicity is widespread in both signed and spoken languages and offers advantages in both language learning and processing [7–11]. Given these advantages, one might expect languages to be more iconic than they presently are. So why aren't they? The answer, we argue, is that iconic words are too linked to specific referents and contexts, and so are less well suited for expressing abstractions. To support this view, we discuss novel empirical evidence which suggests that there is a tension between abstract meanings and iconicity such that to successfully convey abstract meanings, it is ‘better’ for a word for be arbitrary, or less iconic. Critically, to the extent that we owe many of our abstract concepts to our experiences with language (§3d), a more iconic language not only may make it more difficult to express abstract meanings, but may make it more difficult to learn abstract meanings in the first place.
2. Amodal versus modal approaches to representing semantic knowledge
(1) Lemons are yellow.
(2) Cats make meowing sounds.
(3) The USA was established as a representative democracy.
Understanding sentences like these requires having certain semantic knowledge. But where does this knowledge come from? Answers have spanned a continuum. On one end are approaches that emphasize the role of sensory, motor and affective experiences. This view, often glossed as ‘embodied cognition’, blurs the line between perception, action and cognition by positing that conceptual mental states used in understanding sentences like those above draw on (and, on some versions of embodied cognition, are identical to) mental states used in representing perceptual, motor and affective information. For example, to understand sentences (1) and (2), the comprehender would recruit visual representations of lemons and the auditory representations of cat sounds, respectively [1,12–16]. Importantly, these types of perceptual representations constitute both word meanings and conceptual knowledge itself.
Other perspectives—glossed here as amodal—reject the claim that perceptual, motor and affective knowledge is constitutive of semantic knowledge [4,17]. Of course, few would deny that people's knowledge of lemons and cats derives largely from real-world experiences with lemons and cats. Amodal approaches, however, downplay the importance of such experiences in forming the content of representations that are accessed by words. On the amodal position, words are mapped onto underlying conceptual states. Because the conceptual states are posited to be amodal, so are word meanings.
There are several strands of evidence for the embodied view. The first points to apparent ‘perceptual simulations' that appear to be formed in the course of processing language. When comprehending sentences such as The ranger saw the eagle in the sky [16] and John put the pencil in the cup [14], participants appear to form fairly specific perceptual representations of the mentioned objects, a result that is consistent with the idea that they understand the sentence through a perceptual simulation [17]. Converging evidence comes from functional magnetic resonance imaging (fMRI) and magnetoencephalography (MEG) studies which have shown rapid activation of modality-specific regions in response to modality-specific words [2]. While these results are all correlational, a growing number of studies show that perceptual, motor and affective systems are causally involved in language understanding. For example, Edmiston & Lupyan [18] showed that visual interference impaired participants' ability to respond to verbal questions probing visual knowledge such as whether alligators are green. This result shows visual representations to be causally involved in verbally accessed semantic knowledge of what things look like.
Supporters of the amodal position challenge embodied views on multiple grounds [19–22]. One major objection is that people with very different perceptual experiences can, nevertheless, have very similar conceptual content. For example, people born blind are fully capable of learning and using language and their conceptual structure appears to be very similar to that of sighted people [20,23,24], even of visual concepts such as colours [25,26]. This provides a strong challenge to the assumption that perceptual (or at least visual) experiences are central to conceptual and linguistic knowledge.
Another objection of amodal theorists is that the embodied view does not adequately explain the structure of abstract concepts like justice, or concepts for which we have no direct perceptual experience like atom, and that the meanings of ‘embodied’ concepts like grasp are more abstract than acknowledged by embodied theorists [4]. In the next section, we dwell on this objection and argue that language is indeed abstract—in fact, it is more abstract than is often acknowledged by both embodied and amodal theorists. We will then argue that the abstract nature of language may offer a solution to the problem of abstract knowledge.
3. How abstract is language?
(a). Defining abstractness
Many discussions of abstract word meanings have centred on words such as freedom, democracy and justice [4,27,28]. These abstract words are often contrasted with such concrete words as ball, dog and blinking. What does it mean to say that freedom is more abstract than ball? Abstractness is commonly defined in opposition to concreteness. A particularly clear definition of concreteness comes from Brysbaert et al. [29], who asked participants to place 40 000 English words on a concreteness/abstractness scale. Concrete words were defined as those that ‘refer to things or actions in reality, which you can experience directly through one of the five senses' ([29]; p. 906). Participants were told that if they tried to explain the meaning of a concrete word, they could point to the referent, or enact the meaning in some way:
To explain ‘sweet’ you could have someone eat sugar; To explain ‘jump’ you could simply jump up and down or show people a movie clip about someone jumping up and down; To explain ‘couch’, you could point to a couch or show a picture of a couch
Abstract words were defined as those that refer to ‘meanings that cannot be experienced directly, but which we know because the meanings can be defined by other words' ([29]; p. 906). An abstract word ‘refers to something that you cannot experience directly through your senses or actions. Its meaning depends on language. The easiest way to explain it is by using other words’.1
Suppose we wish to evaluate what people know about concepts referred to by concrete nouns (e.g. strawberry, toothbrush and leg) and concrete verbs (e.g. jump, fly and cut). We can assess such knowledge with or without language. For example, if a person knows that strawberries are red, we can show them pictures of strawberries in various colours and ask them to select the most real-looking one. Or we can hand a person a toothbrush and ask them to indicate how they would use it. Indeed, tests like this are commonly as a way of assessing semantic knowledge without the use of language, (e.g. [30]). The same option is not readily available for assessing the knowledge of abstract words. How does one probe the concept denoted by the words freedom or democracy without the use of language? We cannot simply show people a picture of these entities, precisely because they have no immediately perceptible physical manifestation. And so, although it is certainly possible that conceptual representations of such abstract meanings are entirely independent of language, it should give us pause that probing people's knowledge about these concepts seems to require language.
(b). Beyond freedom and justice: language is surprisingly abstract
According to the Brysbaert et al.'s [29] concreteness norms, the concreteness values of freedom, democracy and justice (1 = most abstract; 5 = most concrete) are, respectively, 2.34, 1.78 and 1.45. These words are clearly abstract. But the focus on such lofty meanings has, we believe, minimized the ubiquity of abstract meanings in language. Just within common nouns, which are the most concrete of all lexical classes (M = 3.53), the words fun (1.97), idea (1.61), chance (1.64) and trouble (2.25) are all judged as more abstract than freedom and are all much more frequent. Moving to other word classes further reveals the ubiquity of abstract words. While some verbs are judged as being concrete, e.g. skate (4.6) and blink (4.4), many more frequent verbs are rated as quite abstract, e.g. imagine (1.53), happen (1.78), enjoy (2.29) and agree (2.31). Everyday adverbs such as especially, maybe, already and never likewise are rated as being highly abstract (<1.60). The most concrete adjectives such as wooden (4.61) and bald (4.69) are dwarfed in number by much more abstract adjectives, such as pleasant (1.55), normal (1.40) and irrelevant (1.50).
As a further demonstration of just how much of English is abstract, suppose we select a random noun, verb or adjective weighed by its frequency.2 Using the SUBTLEX movie subtitle corpus, a widely used corpus in psycholinguistic research [31], we discover that we have a 59% chance of selecting a word that is above the median level of abstractness (M = 2.15). Example words in this part of the concrete/abstract distribution are extrovert, uncomfortable, innovating, immodest and flamboyant.
Even more striking results are obtained if we run the same analysis on a dataset in which abstractness and concreteness are operationalized in terms of sensory experience. Juhasz & Yap collected ratings that ‘reflect the extent to which a word evokes a sensory and/or perceptual experience in the mind of the reader’ [32, p. 160]. Participants rated words on a scale from 1 (no sensory experience, abstract) to 7 (maximal sensory experience, concrete). Among the words with the highest sensory experience ratings, we find garlic (6.56), walnut (6.50), music (6.0), humid (6.0) and hamster (5.6). Among the words with the lowest sensory experience, ratings are choice (1.0), though (1.09), mere (1.08), rite (1.10) and plural (1.18). These ratings are correlated with Brysbaert et al.'s [29] concreteness ratings only moderately (r = 0.4). Applying the same analysis to sensory experience ratings revealed a 73% chance of randomly picking an adjective, noun or verb with a less-than-median level of sensory experience.
We can demonstrate the ubiquity of abstract words further by extending this approach to multiple words. Figure 1 shows the cumulative probability of selecting words of various levels of abstractness in the SUBTLEX corpus. How many words before encountering a word at least as abstract as words like freedom, idea and fun? Figure 1 shows that the answer is surprisingly few. Given an utterance of only five words, there is a 73% chance of coming across a word that is as abstract as idea and 95% chance of coming across a word that is abstract as freedom.
Figure 1.
The cumulative probability of encountering a word at least as abstract as labelled by each line. The probabilities are derived from sampling from words from the SUBTLEX corpus in proportion to each word's frequency. (a) Analyses of words from Brysbaert et al. [29] concreteness norms, excluding closed-class words (n = 26 210). (b) The same analysis but excluding very frequent words such as other, here and is rather than all closed-class words (n = 28 931). (Online version in colour.)
As a final illustration of the ubiquity of abstract words, consider the following Yelp review of a Verizon store:
My fiancé upgraded his phone at the Apple store, but got a strange text on his new phone about his plan. We went over to the Verizon store to see what was up, and that was about as pleasant as having forks jammed in my eyes. Awful customer service. No one came up to ask us if we needed help. We had to tear a disgruntled man away from looking at his Instagram feed to help us. He didn't say there would be a wait or to sit or anything, just said ‘okay then’ and left us. So we stood there waiting for help or even just directions for far too long, were ignored and told him we were leaving. Awful awful awful customer service.
Removing from this paragraph all the words more abstract than the median rating according to the Brysbaert norms, we get the following:
Fiancé his phone Apple store, text on his phone his plan. We (Verizon) store see up, forks jammed in my eyes. Customer. One up us we. We tear man looking his (Instagram) feed us. He said us. We stood directions long, him we leaving. Customer.
These examples are not meant to minimize the abstractness of canonically abstract words like freedom and justice. These words are abstract, especially when compared with other nouns (freedom is more abstract than 82.8% of nouns in the Brysbaert norms; justice is more abstract than 99.5% of nouns). Rather, we wish to emphasize just how ubiquitous words with equal or greater levels of abstractness are in everyday discourse.3 Take away nouns like democracy and justice, and our language is hardly changed. Take away all the words more abstract than way, kind, think, make, easy, other, again and really (all words on the abstract end of the scale), and we lose the ability to talk about, well, most of what we talk about!
(c). Neither embodied nor amodal representations solve the problem of abstract meanings
Proponents of embodied theories have long recognized the necessity for such theories to address the representation of abstract knowledge [1], and this topic continues to be at the centre of debates about the format of semantic knowledge [33,34], as evidenced by this special issue. Nevertheless, reading the literature on embodied cognition, one gets the impression that much of linguistic communication revolves around concrete topics, the things we see, hear, feel, taste and smell in the here and now. Typical sentences used in embodied cognition experiments include sentences such as The ranger saw the eagle in the sky [16] and John put the pencil in the cup [14], or words such as kick and pick [35]. When the issue of abstract knowledge is raised, it is usually in the context of words such as freedom and democracy rather than the much more common words such as fun, idea, chance and trouble.
The amodal position faces its own version of the abstract meaning problem. Supporters of this position are right to point out that large differences in perceptual experience do not seem to have nearly the detrimental effect on semantic knowledge/word meanings that would be expected if the bulk of such knowledge was derived from perceptual experience. However, amodal theorists do not provide a compelling alternative. If semantic knowledge does not come from direct experience with the world, where does it come from? One solution is to posit innate ‘core knowledge’ systems that span various abstract domains including animacy, agency, causality and mathematics (e.g. [36–40]). But such knowledge is far too general to account for concepts that allegedly underlie the meanings of abstract words. How do we ever get from core knowledge, such as knowledge of animacy and basic event structure, to the categories ‘picked out’ by the thousands of abstract words we use every day?
(d). A possible solution
One solution to the problem of abstract meanings is to turn to language [28,41–45]. On this view, the knowledge underlying abstract concepts comes from language itself.4 What does it mean for knowledge to come from language? It is useful to distinguish between three ways that language can impact semantic knowledge:
(i). Language as a source of propositions
Most uncontroversially, language is a source of various propositions. Among these are: (1) relatively specific facts, e.g. that the mayor of Talkeetna, Alaska from 1997 to 2017, was a cat named Stubbs, (2) facts that help guide action, e.g. that sticking a fork in an electric outlet is a bad idea, and (3) more abstract knowledge, e.g. that a year is 365 days, that an even number is divisible exactly by two, and so on. No one, we think, would disagree that a sizable amount of our semantic knowledge is derived from such explicit uses of language (e.g. [48] for discussion), though a precise amount is difficult to quantify.
(ii). Language as a categorical cue
More controversially, language provides a kind of categorical overlay on the world. Rather than simply reflecting the pre-existing joints of nature, language may help to carve these joints (see [49,50] for reviews). Empirical evidence shows that learning verbal labels facilitates category formation beginning in early infancy (e.g. [51–53]) and continuing into adulthood [54]. Even for concrete meanings, language appears to have the effect of making representations more categorical and less linked to specific category exemplars [55–57]. For more abstract concepts, the role of labels is expected to grow [58]. Whereas there is perceptual information that can be used to distinguish, e.g. cats and dogs [59], such perceptual regularities simply do not exist for abstract meanings like those reviewed in the previous section. In the absence of these pre-existing joints, a learner can rely on evidence from language for guidance on what otherwise dissimilar entities should be grouped together and which similar entities should be categorically distinguished.
(iii). Language statistics as knowledge
We can learn facts, such as The sky is blue, by direct observation or because someone tells us. But there is a third possibility. It has long been recognized that the distributional structure of language provides an enormously rich source of knowledge. By the distributional structure, we have in mind Firth's dictum that ‘you shall know a word by the company it keeps' [60, p. 11]. ‘Blue’ co-occurs with ‘sky’ much more frequently than any other colour word. ‘Beard’ tends to co-occur with words related to men (including male names and pronouns). Such statistics scale in surprising ways. A basic machine-learning algorithm exposed to a corpus of English text can construct a fairly accurate map of Europe simply from observing the ways in which city names co-occur in various contexts [43]. Modern distributional models, such as word2vec [61], construct vectors representing word meanings from large corpora of text. Not only do these models yield similar vectors for words with similar meanings, but the vectors end up representing more abstract relationships such as temporal relationships between events (see [62] for discussion). Although not without limitations [63,64], the ability of such models to capture some aspects of abstract meanings simply from patterns of word use hints at the rich information conveyed by language statistics.
Showing that machines can learn certain things from the distributional statistics of language proves that the information is there, but it is a separate question whether people use this knowledge. Experimental evidence suggests that distributional patterns influence linguistic processing, showing that language users mentally represent these statistical patterns [43,65–67]. People's ability to learn from distributional patterns has some empirical support (e.g. [68–70]), but much more work is required to test the extent of such learning. The claim that exposure to language is needed to learn word meanings is hardly surprising. But what we are claiming is that when it comes to many (perhaps most) everyday meanings conveyed by abstract words, the specific category denoted by the meaning does not exist apart from language.
As an example consider the word fun. This word denotes a complex category that includes events (not reducible to the set of enjoyable events), people (a fun person, he wasn't very fun yesterday) and other complexities such as self-reference as when Dr Seuss writes that ‘It is fun to have fun, but you have to know how’ [71]. A person never exposed to the various ways that English speakers use this word would certainly lack the relevant word meaning. Would they nevertheless have the concept? We think not. Recall that on a traditional perspective, words are thought to map onto pre-existing concepts (e.g. [72]). But what is the pre-existing conceptual representation that fun would map onto? On our view, it is observing the same word used across many disparate contexts that helps create a category which otherwise does not exist. We can get a hint of the kind of information linguistic experiences with the word fun conveys by examining its semantic neighbourhood in a model of distributional semantics (word2vec trained on the Google News corpus). In the immediate neighbourhood of fun are related terms and phrases, such as wonderful, thoroughly enjoyable, awesome, laugh and exciting, as well as an eclectic collection of experiences some people might describe as fun: pumpkin carving contests, BMXing, camping, hiking, canoeing and toenail painting—activities that in the absence of a common linking word may share little in common. Recognizing the richness of linguistic input also helps to solve the otherwise puzzling observation that the language of blind people is quite normal. It is normal because blind people are exposed to approximately the same linguistic inputs as sighted people.
In the next section, we turn to the second question motivating this work: Why isn't language more iconic? We argue that while it is now increasingly recognized that language is more iconic (less arbitrary) than many have thought in the past, it could be far more iconic. This raises the possibility that iconicity is resisted by something; or conversely, that arbitrariness is ‘preferred’. We suggest that this something may be abstractness. If language is to be maximally useful for carving joints in nature—establishing categories where none exists—this function may be best filled by words whose forms have no resemblance to their referents. That is, language may be as arbitrary as it is because arbitrary words promote abstraction.
4. How iconic is language?
For centuries, iconicity in language was viewed as natural [73–75]. To the extent that a word's form can give a hint to its meaning, one can infer word meanings without having to learn each word anew. In this way, a more iconic language seems clearly superior to an arbitrary one. Indeed, the arbitrariness of contemporary languages was frequently viewed as a defect to be rectified [73]. With the dominance of structuralism, however, words were viewed as arbitrary signifiers, making arbitrariness the default in linguistic theorizing [76–78].
Recently, there has been a resurgence of interest in iconicity [7,9,79]. This new research shows iconicity to be more than a linguistic quirk limited to onomatopoeic words like buzz. Rather, it is a widespread design feature of both signed and spoken languages. For example, the ability of speakers of one language to guess the meaning of iconic words in other languages is higher than one might suppose [80–83]. The original question of whether language is predominantly arbitrary or predominantly iconic is now increasingly viewed as a false dichotomy, with researchers recognizing multiple interacting forms of iconicity that are interwoven with arbitrariness between two communicative design principles that are mutually compatible [8].
The renaissance of research on iconicity has clarified some previously reported associations between sounds and meanings, for example the tendency for many languages to use the high-front vowel /i/ in words for objects and animals of small physical size [5,6,84–86], and the tendency for languages to disproportionately use the nasal sounds /m/ and /n/ in words for ‘nose’ [79,87,88]. In addition to showing large-scale and widespread exceptions to arbitrariness, this research has also clarified the functions of iconicity.
One function is greater sensory vividness (for review, see [8,89,90]). Describing a sound with an iconic term like squealing appears to provide a more vivid impression of the described sound than paraphrasing the sound using relatively more arbitrary words, as when describing the same sound as loud and high-pitched. An increasing number of studies link iconicity to crossmodal correspondences and sensory imagery [91–93], and some initial evidence suggests that iconic words, compared with arbitrary words, may lead to greater activation in sensory brain areas [8].
Because iconic words essentially ‘give clues’ to their meanings, iconicity can facilitate word learning in children [94–97] and adults [98–101], as well as facilitate the learning of perceptual regularities underlying novel categories [102]. Using observational data, [103] showed that, controlling for numerous possible confounds, iconicity predicts age-of-acquisition: more iconic words are learned earlier by children. People also have a knack for creating iconic gestures [104,105] and vocalizations [106,107], and for understanding gestures and vocalizations created by others to express a wide range of meanings. Such advantages of iconic forms had led some to argue that iconicity played a key role in the evolution of language (e.g. [108]).
(a). Why aren't languages more iconic?
Languages are under pressure to adapt to the learning demands of their users [109,110]. Iconicity enables faster word learning, more vivid communication of sensory content and provides a means by which new word forms can be coined and (to some extent) immediately understood by others. Given these advantages, we might expect languages to not only be more iconic than they are, but to become more iconic over time. But although iconicity may play an important role in the origins of signed and spoken languages [7,9,104,106], languages appear to shed iconicity rather than increase it [111,112]. Why?
There are two common arguments for why language is not more iconic than it is. The first is that resemblance between words and meanings is only possible for a very small range of meanings, e.g. imitations of sounds in speech and imitations of shapes in gesture [113]. Although it is certainly true that some meanings can be ‘resembled’ in gesture and speech more easily than others (what would an iconic form of democracy look or sound like?), the iconic potential of language is substantially greater than what is realized in natural languages. This iconic potential is evidenced by the fact that people are highly adept at creating and interpreting novel vocalizations for expressing a wide range of meanings [107].
The second argument against more widespread iconicity is that if word forms resembled their meanings, then similar words would have similar meanings, leading to confusion. That is, widespread iconicity would lead to an increase in systematicity, which is defined as ‘a statistical relationship between the patterns of sound for a group of words and their usage’ [7, p. 604]. Systematicity is distinct from iconicity in that something can be systematic with or without being iconic [114]; however, if an iconic crossmodal correspondence (such as between /i/ and smallness) is productively used by speakers of a language, iconicity is expected to lead to systematicity. Although possibly beneficial for small vocabularies, systematicity can lead to confusion as the vocabulary grows. This argument is supported by computational simulations by Gasser [115] who presented vocabularies of different sizes to a simple connectionist network. The network was able to learn non-arbitrary form–meaning mappings more easily than arbitrary form–meaning mappings, but for large vocabulary sizes, arbitrariness became beneficial. Sidhu & Pexman [116] show that sparser semantic neighbourhoods (which are less prone to confusion) afford more iconicity. Arbitrariness has also been argued to have processing benefits [117,118] by allowing semantically similar words to be phonologically distinct, which can minimize interference and confusion. The potential for confusion, however, is not as problematic as it may appear because iconicity is not all-or-none. For example, as mentioned above, people associate vowels with physical size. This association means that one can have a system in which the names of large animals/objects contain back vowels and small animals/objects contain front vowels. Importantly, in such a system the words can contain additional phonological segments that distinguish the specific animals without reference to size. By combining iconicity and arbitrariness in the same word form, the potential for confusion can be greatly reduced.
(b). Iconicity limits abstraction and abstractness limits iconicity
We propose that one overlooked reason why languages are not more iconic is that iconicity is inimical to abstraction. To illustrate, consider again the word fun. Despite being abstract, one can imagine ways in which this word could be more iconic. In a signed modality, this could take the form of imitating a prototypical activity such as dancing (a student suggested ‘jazz hands’). In the vocal modality, the iconicity could incorporate phonological characteristics common to laughs or cheers. Note, however, that in doing so, the word form necessarily resembles a particular type of fun rather than a more abstract and generalizable idea of fun [119]. This is because iconic depiction is always selective [120,121]—only particular aspects of a word's meaning are expressed iconically.
This argument also extends to more concrete meanings. If our word for the concept ‘green’ is imitative of some typically green object or animal, then it cannot help but evoke a more specific exemplar of greenness, perhaps carrying with it other aspects of the referent that have nothing to do with colour. An arbitrary word for green, by being associated with a range of greens, can abstract away from specific shades of green [122]. And so, while iconic forms may indeed be easier to learn, to the extent that they resemble specific exemplars or a narrower range of contexts, they may make it more difficult to form more abstract representations in the first place (see also [123,124] for related observations for American and British Sign Languages).
A similar argument has been made for signed languages. Meir [119] discusses evidence from Israeli Sign Language (ISL) and American Sign Language which suggests that certain metaphoric extensions, which are associated with abstract words in English and Hebrew, are not possible in those languages because the corresponding signs are iconic. For example, in English, we can extend the primary meaning of the verb eat (consume food) to cases such as The acid ate the iron key. In ISL, this is not possible because the sign for ‘eat’ is executed at the mouth, an iconic depiction of a human eating event. This more specific type of eating is incompatible with a semantic extension to acid dissolving an iron key. As another example, Meir [119] discusses the metaphoric expression Time flies, which is impossible (or comical) in ISL as the sign for ‘flying’ iconically depicts a specific type of flying (flapping with one's wings).
The restriction of iconic phenomena with respect to metaphorical extension has also been noted for spoken languages. Speakers of English and other languages frequently use so-called synaesthetic metaphors, expressions that combine different sensory words such as rough melody or smooth taste. Research on such expressions has repeatedly found that words describing sounds are less likely to be extended to other sensory dimensions [125–127], e.g. it is possible to say dark sound and rough sound, but squealing colour and screeching feeling appear odd. While there are many possible explanations for this pattern, one possibility is that words for sounds, which are among the most iconic words in English [128] and in other languages [90], may be too iconic to allow for easy abstraction. As noted by Classen [129], ‘auditory terms are too echoic or suggestive of the sounds they represent to be used to characterize other sensory phenomena’ (p. 55). In line with this idea, Winter [130] showed that adjectives rated high on iconicity are less likely to modify nouns associated with other sensory domains. For example, loud is less iconic than squealing, and correspondingly loud colour is much more frequently attested than squealing colour.
The answer to why languages aren't more iconic may therefore be twofold: (i) iconicity, while benefiting learning, may tie a word too closely to a specific more concrete meaning, thereby preventing abstractness and generalization, and (ii) because so much of what adults talk about is abstract, iconicity is resisted even in semantic domains that lend themselves to iconic expression.
5. Evidence that abstractness resists iconicity
In this section, we provide some correlational evidence of an antagonistic relationship between iconicity and abstraction, suggesting that abstract words ‘prefer’ to be arbitrary perhaps because iconicity creates an association between a form and a more specific/more vivid meaning that limits the use of a word to certain contexts, and with it, the word's potential for abstractness.
(a). Iconic words are less abstract
The first evidence for an inverse relationship between iconicity and abstractness comes from previously reported associations between measures of concreteness [29], sensory vividness [32] and iconicity, as measured by participants' judgements of whether words sound like what they mean [103]. These analyses show that words rated high on iconicity are, on average, more concrete and have more sensory vividness than words judged to be arbitrary [116,128]. The special classes of ‘ideophones’ attested in many of the world's languages have been described as being ‘for’ the depiction of sensory imagery [90], and even ‘phonesthemes’, clusters of word–meaning correspondences such as glimmer, glitter, glitz, glisten (which all relate to particular light-reflectance patterns), often described as etymological accidents, tend to have primarily sensory meanings [130].
It is also worth noting that nearly all experimental demonstrations of iconicity have investigated concrete meanings [8], such as the famous kiki/bouba stimuli which are associated with spiky or round visual shapes [131,132]. To the extent that semantic generalizability of iconic forms has been experimentally demonstrated, this generalizability is of a limited type [83]. By pitting semantic notions of size (e.g. ‘mouse’ versus ‘elephant’) against the visual representation of size (e.g. small versus large images of mice and elephants), Auracher [133] was able to show that size iconicity goes beyond specific visual features. However, even in this case the semantic representations relate to a specific sensory characteristic, namely physical size. Another purported case of iconicity for abstract meanings comes from Maglio et al. [134], who showed that front vowels such as /i/ are related to conceptual precision, compared with back vowels such as /o/ and /u/. However, the authors note that the effect appears to stem from (concrete) differences in ‘size’ conveyed by phonological cues. Thus, almost all forms of iconicity discussed in the experimental and linguistic literature are sensory in nature.
(b). Abstract words that are iconic seem more concrete
Finding an inverse relationship between iconicity and concreteness (even controlling for such factors as part-of-speech and frequency) may simply indicate that people have a bias to judge more concrete words as being iconic. To examine whether there is more to it than that we re-analysed data from a recent study by Pexman et al. [135], who conducted a speeded classification task in which participants were shown words varying in concreteness and had to indicate, as quickly as possible, whether the word was concrete or abstract (a binary distinction in Pexman et al.'s study). We reasoned that if abstract words that are relatively more iconic activate a more specific semantic representation, then participants should make more errors classifying iconic-abstract words as concrete compared with arbitrary-abstract words.
To test this prediction, we regressed mean accuracy reported by Pexman et al. [135] on our measure of iconicity and controlling for part-of-speech, and log word-frequency as fixed factors. Overall, iconicity was not related to accuracy (t < 1). However, there was a significant (though small) concreteness-by-iconicity interaction, b = 0.95, 95% CI = [0.07, 1.84], t = 2.13, p = 0.03 (figure 2a). Accuracy for concrete words (i.e. words for which the correct answer was ‘concrete’) was unaffected by iconicity, b = −0.46, 95% CI = [−1.42, 0.51], t = −0.93, p = 0.35. Accuracy for abstract words was significantly negatively associated with iconicity, b = −3.15, 95% CI = [−4.79, –0.78], t = −3.15, p = 0.004. Abstract words with iconicity values below the 25th percentile were classified as ‘abstract’ with 86% accuracy. Words that had iconicity values above the 75th percentile (and which were rated off-line as equally as abstract as the less iconic words) were classified in the task as ‘abstract’ at an accuracy of 79%. This result provides initial evidence—in need of further confirmation—that more iconic words evoke more specific meanings than less iconic words even when controlling for previously rated abstractness.5 The code for all analyses presented here is available at https://osf.io/b9fhx/.
Figure 2.
(a) Re-analysis of data reported by Pexman et al. [135]. More iconic-abstract words are more likely to be mistaken for concrete words. The plotted accuracy measure partials out reaction times, word-frequency and part-of-speech. (b) More iconic words occur in fewer contexts. Coloured lines show slope estimates for selected parts of speech (adv., adverb; prep., preposition; inter., interjection; adj., adjective). Black line shows the overall effect for all parts of speech.
(c). Iconic words are tied to more specific contexts
If iconic words resist abstraction, a further prediction is that they should occur in a narrower range of contexts. We have already discussed this idea in the context of iconicity resticting semantic extensions in metaphorical contexts (see above, [119]). We can also test this prediction in a more general way by correlating iconicity with several measures of contextual diversity while controlling for various possible confounds.
One coarse measure of contextual diversity comes from the SUBTLEX corpus and is simply the number of movies in which a given word appears. This simple measure is more predictive of reading and lexical-decision times than word-frequency [137]. Contextual diversity (log-transformed) was significantly associated with iconicity. Controlling for concreteness and part-of-speech, more iconic words occurred in fewer contexts, b = −0.16, 95% CI = [−0.19, −0.14], t = −10.87, p < 0.00005. This negative association remained highly significant when further controlling for SUBTLEX word-frequency, b = −0.01, 95% CI = [−0.02, −0.002], t = −2.63, p = 0.009. Significant negative associations are also found when we use the (log-transformed) number of documents in which a given word occurred in the British National Corpus [138] while controlling for concreteness and word-frequency, and part-of-speech, b = −0.15, 95% CI = [−0.18, −0.11], t = −7.33, p < 0.00005. These relationships become even stronger when we exclude closed-class words.
Another measure of contextual diversity involves not simply counting contexts, but evaluating the heterogeneity of those contexts using distributional statistics. Such a measure, termed semantic distinctiveness [138], is also negatively associated with iconicity when controlling for word-frequency and part-of-speech: b = −0.02, 95% CI = [−0.03, −0.01], t = −4.02, p = 0.00006, though the association does not survive further controlling for concreteness, b = −0.005, t = −1.05. Thus, a number of independent measures (SUBTLEX contextual diversity, BNC contextual diversity and distributional statistics) point in the same direction: iconicity limits the reusability of words, tying them to a narrower range of contexts.
6. Conclusion
We began by considering two seemingly unconnected questions: (i) Where does abstract knowledge come from?, (ii) Why isn't language more iconic? Neither embodied nor amodal theories of semantic knowledge provide satisfactory answers to the first question. Despite acknowledging the need to understand where knowledge underlying these word meanings comes from, the embodied position has, we believe, focused too narrowly on concrete concepts, neglecting the extent to which the meanings we use in everyday language are abstract. Examining the distribution of abstract words in language makes it clear just how ubiquitous they are. Far from being limited to meanings like democracy and freedom—common examples in discussions of abstract meanings—everyday language is filled with abstract words such as happen, fun, sometimes and enjoy. Given an utterance of only five words, there is about a 95% chance of coming across a word as abstract as freedom.
Amodal theorists, while long criticizing the embodied view for failing to fully acknowledge abstractness, have, for their part, tended to overlook a major source of knowledge of abstract meanings: language itself. We have argued that language is a key source of guidance not just for learning how to use these English words appropriately, but for forming the conceptual representations that underlie these meanings.
In §§4 and 5, we argued that a design feature of language that may facilitate abstraction is form-to-meaning arbitrariness. Although arbitrariness in language is often taken for granted, the extent to which languages are arbitrary is surprising given the many benefits of non-arbitrary (iconic) word forms and the potential for language to be much more iconic than it is. As an explanation of why languages are not as iconic as they could be, we suggest that iconicity interferes with abstraction because to be iconic requires resembling some aspect of meaning. By moving away from iconic resemblance, words can take on a life of their own, helping to carve joints into nature.
Endnotes
This definition of abstract meanings appears to imply a causal connection between having/using certain words and being able to express (and more provocatively, to entertain in the first place) certain abstract meanings.
This analysis is based on the cumulative frequency of tokens, while the median abstractness is based on types. R Code for these and subsequent analyses can be accessed at https://osf.io/b9fhx/.
Aside from the ubiquity of highly abstract content words, languages are replete with highly abstract function words such as the, it and to.
We do not wish to diminish the likely importance of other cognitive structures, such as affective and emotional representations [17,33,34], and metaphorical connections between abstract concepts and concrete ones [46,47].
Although the concreteness-by-iconicity interaction was significant, more iconic ‘concrete’ words were not classified by participants as concrete at greater rates than less iconic ‘concrete’ words. Further investigation is necessary to determine if the iconicity effect we observed here is, in fact, limited to abstract words. In addition, it has to be noted that we are using the accuracy summary data from Pexman et al. [135] and analyse it using linear models, which has been argued to lead to anti-conservative estimates [136].
Data accessibility
The code for all analyses presented here is available at https://osf.io/b9fhx/.
Competing interests
We declare we have no competing interests.
Funding
This work was partially funded by NSF-BCS 1734260 to G.L.
References
- 1.Barsalou LW. 1999. Perceptual symbol systems. Behav. Brain Sci. 22, 577–609; discussion 610–660. [DOI] [PubMed] [Google Scholar]
- 2.Pulvermüller F. 2013. How neurons make meaning: brain mechanisms for embodied and abstract-symbolic semantics. Trends Cogn. Sci. 17, 458–470. ( 10.1016/j.tics.2013.06.004) [DOI] [PubMed] [Google Scholar]
- 3.Machery E. 2007. Concept empiricism: a methodological critique. Cognition 104, 19–46. ( 10.1016/j.cognition.2006.05.002) [DOI] [PubMed] [Google Scholar]
- 4.Mahon BZ, Caramazza A. 2008. A critical look at the embodied cognition hypothesis and a new proposal for grounding conceptual content. J. Physiol. Paris 102, 59–70. ( 10.1016/j.jphysparis.2008.03.004) [DOI] [PubMed] [Google Scholar]
- 5.Sapir E. 1929. A study in phonetic symbolism. J. Exp. Psychol. 12, 225 ( 10.1037/h0070931) [DOI] [Google Scholar]
- 6.Thompson PD, Estes Z. 2011. Sound symbolic naming of novel objects is a graded function. Q. J. Exp. Psychol. 64, 2392–2404. ( 10.1080/17470218.2011.605898) [DOI] [PubMed] [Google Scholar]
- 7.Dingemanse M, Blasi DE, Lupyan G, Christiansen MH, Monaghan P. 2015. Arbitrariness, iconicity, and systematicity in language. Trends Cogn. Sci. 19, 603–615. ( 10.1016/j.tics.2015.07.013) [DOI] [PubMed] [Google Scholar]
- 8.Lockwood G, Dingemanse M. 2015. Iconicity in the lab: a review of behavioral, developmental, and neuroimaging research into sound-symbolism. Front. Psychol. 6, 1246 ( 10.3389/fpsyg.2015.01246) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Perniss P, Thompson RL, Vigliocco G. 2010. Iconicity as a general property of language: evidence from spoken and signed languages. Front. Psychol. 1, 227 ( 10.3389/fpsyg.2010.00227) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Perry LK, Perlman M, Winter B, Massaro DW, Lupyan G. 2017. Iconicity in the speech of children and adults. Dev. Sci. 21, e12572 ( 10.1111/desc.12572) [DOI] [PubMed] [Google Scholar]
- 11.Schmidtke D, Conrad M, Jacobs AM. 2014. Phonological iconicity. Front. Psychol. 5, 80 ( 10.3389/fpsyg.2014.00080) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Glenberg AM. 1997. What memory is for. Behav. Brain Sci. 20, 1–19. [DOI] [PubMed] [Google Scholar]
- 13.Kaschak MP, Zwaan RA, Aveyard M, Yaxley RH. 2006. Perception of auditory motion affects language processing. Cogn. Sci. 30, 733–744. ( 10.1207/s15516709cog0000_54) [DOI] [PubMed] [Google Scholar]
- 14.Stanfield RA, Zwaan RA. 2001. The effect of implied orientation derived from verbal context on picture recognition. Psychol. Sci. 12, 153–156. ( 10.1111/1467-9280.00326) [DOI] [PubMed] [Google Scholar]
- 15.Winter B, Bergen B. 2012. Language comprehenders represent object distance both visually and auditorily. Lang. Cogn. 4, 1–16. ( 10.1515/langcog-2012-0001) [DOI] [Google Scholar]
- 16.Zwaan RA, Stanfield RA, Yaxley RH. 2002. Language comprehenders mentally represent the shapes of objects. Psychol. Sci. 13, 168–171. ( 10.1111/1467-9280.00430) [DOI] [PubMed] [Google Scholar]
- 17.Meteyard L, Cuadrado SR, Bahrami B, Vigliocco G. 2012. Coming of age: a review of embodiment and the neuroscience of semantics. Cortex 48, 788–804. ( 10.1016/j.cortex.2010.11.002) [DOI] [PubMed] [Google Scholar]
- 18.Edmiston P, Lupyan G. 2017. Visual interference disrupts visual knowledge. J. Mem. Lang. 92, 281–292. ( 10.1016/j.jml.2016.07.002) [DOI] [Google Scholar]
- 19.Bedny M, Caramazza A, Grossman E, Pascual-Leone A, Saxe R. 2008. Concepts are more than percepts: the case of action verbs. J. Neurosci. 28, 11 347–11 353. ( 10.1523/JNEUROSCI.3039-08.2008) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Bedny M, Saxe R. 2012. Insights into the origins of knowledge from the cognitive neuroscience of blindness. Cogn. Neuropsychol. 29, 56–84. ( 10.1080/02643294.2012.713342) [DOI] [PubMed] [Google Scholar]
- 21.Leshinskaya A, Caramazza A. 2016. For a cognitive neuroscience of concepts: moving beyond the grounding issue. Psychon. Bull. Rev. 23, 991–1001. ( 10.3758/s13423-015-0870-z) [DOI] [PubMed] [Google Scholar]
- 22.Mahon BZ, Hickok G. 2016. Arguments about the nature of concepts: symbols, embodiment, and beyond. Psychon. Bull. Rev. 23, 941–958. ( 10.3758/s13423-016-1045-2) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Connolly AC, Gleitman LR, Thompson-Schill SL. 2007. Effect of congenital blindness on the semantic representation of some everyday concepts. Proc. Natl Acad. Sci. USA 104, 8241–8246. ( 10.1073/pnas.0702812104) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Kerr NH, Johnson TH. 1991. Word norms for blind and sighted subjects: familiarity, concreteness, meaningfulness, imageability, imagery modality, and word associations. Behav. Res. Methods Instrum. Comput. 23, 461–485. ( 10.3758/BF03209988) [DOI] [Google Scholar]
- 25.Marmor GS. 1978. Age at onset of blindness and the development of the semantics of color names. J. Exp. Child Psychol. 25, 267–278. ( 10.1016/0022-0965(78)90082-6) [DOI] [PubMed] [Google Scholar]
- 26.Shepard RN, Cooper LA. 1992. Representation of colors in the blind, color-blind, and normally sighted. Psychol. Sci. 3, 97–104. ( 10.1111/j.1467-9280.1992.tb00006.x) [DOI] [Google Scholar]
- 27.Barsalou LW, Wiemer-Hastings K. 2005. Situating abstract concepts. In Grounding cognition: the role of perception and action in memory, language, and thinking (eds Pecher D, Zwaan RA), pp. 129–163. Cambridge, UK: Cambridge University Press. [Google Scholar]
- 28.Borghi AM, Binkofski F. 2014. The WAT proposal and the role of language. In Words as social tools: an embodied view on abstract concepts, pp. 19–37. New York, NY: Springer. [Google Scholar]
- 29.Brysbaert M, Warriner AB, Kuperman V. 2014. Concreteness ratings for 40 thousand generally known English word lemmas. Behav. Res. Methods 46, 904–911. ( 10.3758/s13428-013-0403-5) [DOI] [PubMed] [Google Scholar]
- 30.Bozeat S, Lambon-Ralph MA, Patterson K, Garrard P, Hodges JR. 2000. Non-verbal semantic impairment in semantic dementia. Neuropsychologia 38, 1207–1215. ( 10.1016/S0028-3932(00)00034-8) [DOI] [PubMed] [Google Scholar]
- 31.Brysbaert M, New B. 2009. Moving beyond Kučera and Francis: a critical evaluation of current word frequency norms and the introduction of a new and improved word frequency measure for American English. Behav. Res. Methods 41, 977–990. ( 10.3758/BRM.41.4.977) [DOI] [PubMed] [Google Scholar]
- 32.Juhasz BJ, Yap MJ. 2013. Sensory experience ratings for over 5,000 mono- and disyllabic words. Behav. Res. Methods 45, 160–168. ( 10.3758/s13428-012-0242-9) [DOI] [PubMed] [Google Scholar]
- 33.Kousta S-T, Vigliocco G, Vinson DP, Andrews M, Del Campo E. 2011. The representation of abstract words: why emotion matters. J. Exp. Psychol. Gen. 140, 14 ( 10.1037/a0021446) [DOI] [PubMed] [Google Scholar]
- 34.Vigliocco G, Kousta S-T, Della Rosa PA, Vinson DP, Tettamanti M, Devlin JT, Cappa SF. 2013. The neural representation of abstract words: the role of emotion. Cereb. Cortex 24, 1767–1777. ( 10.1093/cercor/bht025) [DOI] [PubMed] [Google Scholar]
- 35.Pulvermüller F. 2005. Brain mechanisms linking language and action. Nat. Rev. Neurosci. 6, 576–582. ( 10.1038/nrn1706) [DOI] [PubMed] [Google Scholar]
- 36.Dehaene S, Izard V, Pica P, Spelke E. 2006. Core knowledge of geometry in an Amazonian indigene group. Science 311, 381–384. ( 10.1126/science.1121739) [DOI] [PubMed] [Google Scholar]
- 37.Hafri A, Papafragou A, Trueswell JC. 2013. Getting the gist of events: recognition of two-participant actions from brief displays. J. Exp. Psychol. Gen. 142, 880–905. ( 10.1037/a0030045) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Spelke ES, Kinzler KD. 2007. Core knowledge. Dev. Sci. 10, 89–96. ( 10.1111/j.1467-7687.2007.00569.x) [DOI] [PubMed] [Google Scholar]
- 39.Strickland B. 2017. Language reflects ‘core’ cognition: a new theory about the origin of cross-linguistic regularities. Cogn. Sci. 41, 70–101. ( 10.1111/cogs.12332) [DOI] [PubMed] [Google Scholar]
- 40.Vallortigara G. 2012. Core knowledge of object, number, and geometry: a comparative and neural approach. Cogn. Neuropsychol. 29, 213–236. ( 10.1080/02643294.2012.654772) [DOI] [PubMed] [Google Scholar]
- 41.Borghi AM, Binkofski F, Castelfranchi C, Cimatti F, Scorolli C, Tummolini L. 2017. The challenge of abstract concepts. Psychol. Bull. 143, 263–292. ( 10.1037/bul0000089) [DOI] [PubMed] [Google Scholar]
- 42.Landauer TK. 1997. A solution to Plato's problem: the latent semantic analysis theory of acquisition, induction, and representation of knowledge. Psychol. Rev. 104, 211–240. ( 10.1037/0033-295X.104.2.211) [DOI] [Google Scholar]
- 43.Louwerse M. 2011. Symbol interdependency in symbolic and embodied cognition. Top. Cogn. Sci. 3, 273–302. ( 10.1111/j.1756-8765.2010.01106.x) [DOI] [PubMed] [Google Scholar]
- 44.Lupyan G. 2012. Linguistically modulated perception and cognition: the label-feedback hypothesis. Front. Cogn. 3, 54 ( 10.3389/fpsyg.2012.00054) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Lupyan G, Bergen B. 2016. How language programs the mind. Top. Cogn. Sci. 8, 408–424. ( 10.1111/tops.12155) [DOI] [PubMed] [Google Scholar]
- 46.Gibbs RW. 1994. The poetics of mind: figurative thought, language, and understanding. Cambridge, UK: Cambridge University Press. [Google Scholar]
- 47.Lakoff G, Johnson M. 1980. Metaphors we live by. Chicago, IL: University of Chicago Press. [Google Scholar]
- 48.Bloom P, Keil FC. 2001. Thinking through language. Mind Lang. 16, 351–367. ( 10.1111/1468-0017.00175) [DOI] [Google Scholar]
- 49.Lupyan G. 2012. What do words do? Towards a theory of language-augmented thought. In The psychology of learning and motivation (ed. Ross BH.), pp. 255–297. Waltham, MA: Academic Press. [Google Scholar]
- 50.Lupyan G. 2016. The centrality of language in human cognition. Lang. Learn. 66, 516–553. ( 10.1111/lang.12155) [DOI] [Google Scholar]
- 51.Althaus N, Mareschal D. 2014. Labels direct infants' attention to commonalities during novel category learning. PLoS ONE 9, e99670 ( 10.1371/journal.pone.0099670) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Ferry AL, Hespos SJ, Waxman SR. 2010. Categorization in 3- and 4-month-old infants: an advantage of words over tones. Child Dev. 81, 472–479. ( 10.1111/j.1467-8624.2009.01408.x) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Plunkett K, Hu J-F, Cohen LB. 2008. Labels can override perceptual categories in early infancy. Cognition 106, 665–681. ( 10.1016/j.cognition.2007.04.003) [DOI] [PubMed] [Google Scholar]
- 54.Lupyan G, Rakison DH, McClelland JL. 2007. Language is not just for talking: labels facilitate learning of novel categories. Psychol. Sci. 18, 1077–1082. ( 10.1111/j.1467-9280.2007.02028.x) [DOI] [PubMed] [Google Scholar]
- 55.Boutonnet B, Lupyan G. 2015. Words jump-start vision: a label advantage in object recognition. J. Neurosci. 32, 9329–9335. ( 10.1523/JNEUROSCI.5111-14.2015) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Edmiston P, Lupyan G. 2015. What makes words special? Words as unmotivated cues. Cognition 143, 93–100. ( 10.1016/j.cognition.2015.06.008) [DOI] [PubMed] [Google Scholar]
- 57.Lupyan G, Thompson-Schill SL. 2012. The evocative power of words: activation of concepts by verbal and nonverbal means. J. Exp. Psychol. Gen. 141, 170–186. ( 10.1037/a0024904) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Gentner D, Boroditsky L. 2001. Individuation, relational relativity and early word learning. In Language acquisition and conceptual development (eds Bowerman M, Levinson SC), pp. 215–256. Cambridge, UK: Cambridge University Press. [Google Scholar]
- 59.Quinn PC, Eimas PD, Tarr MJ. 2001. Perceptual categorization of cat and dog silhouettes by 3- to 4-month-old infants. J. Exp. Child Psychol. 79, 78–94. ( 10.1006/jecp.2000.2609) [DOI] [PubMed] [Google Scholar]
- 60.Palmer FR. (ed.) 1968. Selected Papers of J. R. Firth 1952–59, pp. 168–205. London, UK: Longmans.
- 61.Mikolov T, Chen K, Corrado G, Dean J.2013. Efficient estimation of word representations in vector space. (https://arxiv.org/pdf/1301.3781. )
- 62.Lupyan G, Lewis M. In press. From words-as-mappings to words-as-cues: the role of language in semantic knowledge. Lang. Cogn. Neurosci. 33 ( 10.1080/23273798.2017.1404114) [DOI] [Google Scholar]
- 63.Asr F, Willits J, Jones M. 2016. Comparing predictive and co-occurrence based models of lexical semantics trained on child-directed speech In Proc. 37th Meeting Cogn. Sci. Soc, Philadelphia, PA, 10–13 August 2016 (eds Papafragou A, Grodner D, Mirman D, Trueswell JC), pp. 1092–1097. Austin, TX: Cognitive Science Society. [Google Scholar]
- 64.Chen D, Peterson JC, Griffiths TL. 2017. Evaluating vector-space models of analogy (https://arxiv.org/pdf/1705.04416)
- 65.Louwerse M, Connell L. 2011. A taste of words: linguistic context and perceptual simulation predict the modality of words. Cogn. Sci. 35, 381–398. ( 10.1111/j.1551-6709.2010.01157.x) [DOI] [PubMed] [Google Scholar]
- 66.Louwerse M, Hutchinson S. 2012. Neurological evidence linguistic processes precede perceptual simulation in conceptual processing. Front. Psychol. 3, 385 ( 10.3389/fpsyg.2012.00385) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Louwerse MM, Jeuniaux P. 2010. The linguistic and embodied nature of conceptual processing. Cognition 114, 96–104. ( 10.1016/j.cognition.2009.09.002) [DOI] [PubMed] [Google Scholar]
- 68.Cabrera A, Billman D. 1996. Language-driven concept learning: deciphering Jabberwocky. J. Exp. Psychol. Learn. Mem. Cogn. 22, 539–555. ( 10.1037/0278-7393.22.2.539) [DOI] [PubMed] [Google Scholar]
- 69.Johnson MA, Goldberg AE. 2013. Evidence for automatic accessing of constructional meaning: Jabberwocky sentences prime associated verbs. Lang. Cogn. Process. 28, 1439–1452. ( 10.1080/01690965.2012.717632) [DOI] [Google Scholar]
- 70.Ouyang L, Boroditsky L, Frank MC. 2017. Semantic coherence facilitates distributional learning. Cogn. Sci. 41, 855–884. ( 10.1111/cogs.12360) [DOI] [PubMed] [Google Scholar]
- 71.Seuss D. 1957 The cat in the hat, p. 18. New York: Random House. [Google Scholar]
- 72.Gleitman L, Fisher C. 2005. Universal aspects of word learning. In The Cambridge companion to Chomsky (ed. McGilvray J.), pp. 123–142. New York, NY: Cambridge University Press. [Google Scholar]
- 73.Eco U. 1997. The search for the perfect language. Hoboken, NJ: Wiley-Blackwell. [Google Scholar]
- 74.Jakobson R. 1929. Roman Jakobson--selected writings I: phonological studies. 1st edn Berlin, Germany: Mouton. [Google Scholar]
- 75.Plato. 1999. Cratylus. (Transl. by CDC Reeve; Reprinted with corrections). Indianapolis, IN: Hackett. [Google Scholar]
- 76.de Saussure F. 1959. Course in general linguistics. New York, NY: Philosophical Library. [Google Scholar]
- 77.Newmeyer FJ. 1992. Iconicity and generative grammar. Language 68, 756–796. ( 10.1353/lan.1992.0047) [DOI] [Google Scholar]
- 78.Hockett CF. 1960. The origin of speech. Sci. Am. 203, 88–96. ( 10.1038/scientificamerican0960-88) [DOI] [PubMed] [Google Scholar]
- 79.Blasi DE, Wichmann S, Hammarström H, Stadler PF, Christiansen MH. 2016. Sound–meaning association biases evidenced across thousands of languages. Proc. Natl Acad. Sci. USA 113, 10 818–10 823. ( 10.1073/pnas.1605782113) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Brown RW, Black AH, Horowitz AE. 1955. Phonetic symbolism in natural languages. J. Abnorm. Soc. Psychol. 50, 388 ( 10.1037/h0046820) [DOI] [PubMed] [Google Scholar]
- 81.Klank LJ, Huang Y-H, Johnson RC. 1971. Determinants of success in matching word pairs in tests of phonetic symbolism. J. Verbal Learn. Verbal Behav. 10, 140–148. ( 10.1016/S0022-5371(71)80005-1) [DOI] [Google Scholar]
- 82.Patel AD, Iversen JR. 2003. Acoustic and perceptual comparison of speech and drum sounds in the north Indian tabla tradition: an empirical study of sound symbolism. In Proc. 15th Int. Congr. Phonetic Sciences (ICPhS) (eds Solé M-J, Recansens D, Romero J), pp. 925–928. Barcelona: Universitat Autònoma de Barcelona. [Google Scholar]
- 83.Tzeng CY, Nygaard LC, Namy LL. 2017. The specificity of sound symbolic correspondences in spoken language. Cogn. Sci. 41, 2191–2220. ( 10.1111/cogs.12474) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Berlin B. 2006. The First Congress of Ethnozoological Nomenclature. J. R. Anthropol. Inst. 12, S23–S44. ( 10.1111/j.1467-9655.2006.00271.x) [DOI] [Google Scholar]
- 85.Diffloth G. 1994. I: big, a: small. In Sound symbolism (eds Ohala JJ, Hinton L, Nichols J), pp. 107–113. Cambridge, UK: Cambridge University Press. [Google Scholar]
- 86.Ultan R. 1978. Size–sound symbolism. In Universals of human language (ed. Greenberg JH.), pp. 525–568. Stanford, CA: Stanford University Press. [Google Scholar]
- 87.Blust RA. 2003. The phonestheme n- in Austronesian languages. Ocean. Linguist. 42, 187–212. ( 10.1353/ol.2003.0001) [DOI] [Google Scholar]
- 88.Urban M. 2011. Conventional sound symbolism in terms for organs of speech: a cross-linguistic study. Folia Linguistica 45, 199–213. ( 10.1515/flin.2011.007) [DOI] [Google Scholar]
- 89.Dingemanse M. 2011. Ideophones and the aesthetics of everyday language in a West-African society. Senses Soc. 6, 77–85. ( 10.2752/174589311X12893982233830) [DOI] [Google Scholar]
- 90.Dingemanse M. 2012. Advances in the cross-linguistic study of ideophones. Lang. Linguist. Compass 6, 654–672. ( 10.1002/lnc3.361) [DOI] [Google Scholar]
- 91.Sidhu DM, Pexman PM. In press. Five mechanisms of sound symbolic association. Psychon. Bull. Rev. ( 10.3758/s13423-017-1361-1) [DOI] [PubMed] [Google Scholar]
- 92.Spence C. 2012. Managing sensory expectations concerning products and brands: capitalizing on the potential of sound and shape symbolism. J. Consum. Psychol. 22, 37–54. ( 10.1016/j.jcps.2011.09.004) [DOI] [Google Scholar]
- 93.Spence C, Gallace A. 2011. Tasting shapes and words. Food Qual. Prefer. 22, 290–295. ( 10.1016/j.foodqual.2010.11.005) [DOI] [Google Scholar]
- 94.Imai M, Miyazaki M, Yeung HH, Hidaka S, Kantartzis K, Okada H, Kita S. 2015. Sound symbolism facilitates word learning in 14-month-olds. PLoS ONE 10, e0116494 ( 10.1371/journal.pone.0116494) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95.Imai M, Kita S, Nagumo M, Okada H. 2008. Sound symbolism facilitates early verb learning. Cognition 109, 54–65. ( 10.1016/j.cognition.2008.07.015) [DOI] [PubMed] [Google Scholar]
- 96.Kantartzis K, Imai M, Kita S. 2011. Japanese sound-symbolism facilitates word learning in English-speaking children. Cogn. Sci. 35, 575–586. ( 10.1111/j.1551-6709.2010.01169.x) [DOI] [Google Scholar]
- 97.Yoshida H. 2012. A cross-linguistic study of sound symbolism in children's verb learning. J. Cogn. Dev. 13, 232–265. ( 10.1080/15248372.2011.573515) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 98.Lockwood G, Dingemanse M, Hagoort P. 2016. Sound-symbolism boosts novel word learning. J. Exp. Psychol. Learn. Mem. Cogn. 42, 1274 ( 10.1037/xlm0000235) [DOI] [PubMed] [Google Scholar]
- 99.Lockwood G, Hagoort P, Dingemanse M. 2016. How iconicity helps people learn new words: neural correlates and individual differences in sound-symbolic bootstrapping. Collabra Psychol. 2, 7 ( 10.1525/collabra.42) [DOI] [Google Scholar]
- 100.Nielsen A, Rendall D. 2012. The source and magnitude of sound-symbolic biases in processing artificial word material and their implications for language learning and transmission. Lang. Cogn. 4, 115–125. ( 10.1515/langcog-2012-0007) [DOI] [Google Scholar]
- 101.Nygaard LC, Cook AE, Namy LL. 2009. Sound to meaning correspondences facilitate word learning. Cognition 112, 181–186. ( 10.1016/j.cognition.2009.04.001) [DOI] [PubMed] [Google Scholar]
- 102.Lupyan G, Casasanto D. 2015. Meaningless words promote meaningful categorization. Lang. Cogn. 7, 167–193. ( 10.1017/langcog.2014.21) [DOI] [Google Scholar]
- 103.Perry LK, Perlman M, Lupyan G. 2015. Iconicity in English and Spanish and its relation to lexical category and age of acquisition. PLoS ONE 10, e0137147 ( 10.1371/journal.pone.0137147) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 104.Fay N, Arbib M, Garrod S. 2013. How to bootstrap a human communication system. Cogn. Sci. 37, 1356–1367. ( 10.1111/cogs.12048) [DOI] [PubMed] [Google Scholar]
- 105.Fay N, Lister CJ, Ellison TM, Goldin-Meadow S. 2014. Creating a communication system from scratch: gesture beats vocalization hands down. Front. Psychol. 5, 354 ( 10.3389/fpsyg.2014.00354) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 106.Perlman M, Dale R, Lupyan G. 2015. Iconicity can ground the creation of vocal symbols. R. Soc. open sci. 2, 150152 ( 10.1098/rsos.150152) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 107.Perlman M, Lupyan G. 2018. People can create iconic vocalizations to communicate various meanings to naïve listeners. Sci. Rep. 8, 2634 ( 10.1038/s41598-018-20961-6) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 108.Imai M, Kita S. 2014. The sound symbolism bootstrapping hypothesis for language acquisition and language evolution. Phil. Trans. R. Soc. B 369, 20130298 ( 10.1098/rstb.2013.0298) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 109.Christiansen MH, Chater N. 2008. Language as shaped by the brain. Behav. Brain Sci. 31, 489–509. ( 10.1017/S0140525X08004998) [DOI] [PubMed] [Google Scholar]
- 110.Kirby S, Cornish H, Smith K. 2008. Cumulative cultural evolution in the laboratory: an experimental approach to the origins of structure in human language. Proc. Natl Acad. Sci. USA 105, 10 681–10 686. ( 10.1073/pnas.0707835105) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 111.Frishberg N. 1975. Arbitrariness and iconicity: historical change in American sign language. Language 51, 696–719. ( 10.2307/412894) [DOI] [Google Scholar]
- 112.Garrod S, Fay N, Lee J, Oberlander J, MacLeod T. 2007. Foundations of representation: where might graphical symbol systems come from? Cogn. Sci. 31, 961–987. ( 10.1080/03640210701703659) [DOI] [PubMed] [Google Scholar]
- 113.Hockett CF. 1978. In search of Jove's brow. Am. Speech 53, 243–313. ( 10.2307/455140) [DOI] [Google Scholar]
- 114.Nielsen A. 2016. Systematicity, motivatedness, and the structure of the lexicon. PhD thesis, University of Edinburgh, UK. [Google Scholar]
- 115.Gasser M. 2004. The origins of arbitrariness in language In Proc. 26th Ann. Conf. Cogn. Sci. Soc., Chicago, IL, August 5–7 August 2004 (eds K Forbus, D Gentner, T Regier), pp. 434–439. Mahwah, NJ: Erlbaum. [Google Scholar]
- 116.Sidhu DM, Pexman PM. 2017. Lonely sensational icons: semantic neighbourhood density, sensory experience and iconicity. Lang. Cogn. Neurosci. 33, 25–31. ( 10.1080/23273798.2017.1358379) [DOI] [Google Scholar]
- 117.Christiansen MH, Chater N. 2016. Creating language: integrating evolution, acquisition, and processing. Cambridge, MA: MIT Press. [Google Scholar]
- 118.Christiansen MH, Monaghan P. 2016. Division of labor in vocabulary structure: insights from corpus analyses. Top. Cogn. Sci. 8, 610–624. ( 10.1111/tops.12164) [DOI] [PubMed] [Google Scholar]
- 119.Meir I. 2010. Iconicity and metaphor: constraints on metaphorical extension of iconic forms. Language 86, 865–896. ( 10.1353/lan.2010.0044) [DOI] [Google Scholar]
- 120.Clark HH. 2016. Depicting as a method of communication. Psychol. Rev. 123, 324 ( 10.1037/rev0000026) [DOI] [PubMed] [Google Scholar]
- 121.Clark HH, Gerrig RJ. 1990. Quotations as demonstrations. Language 66, 764–805. ( 10.2307/414729) [DOI] [Google Scholar]
- 122.Forder L, Lupyan G. 2017. Hearing words changes color perception: facilitation of color discrimination by verbal and visual cues. PsyArXiv Prepr. ( 10.17605/OSF.IO/F83AU) [DOI] [PubMed] [Google Scholar]
- 123.Thompson RL, Vinson DP, Vigliocco G. 2009. The link between form and meaning in American Sign Language: lexical processing effects. J. Exp. Psychol. Learn. Mem. Cogn. 35, 550–557. ( 10.1037/a0014547) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 124.Vinson D, Thompson RL, Skinner R, Vigliocco G. 2015. A faster path between meaning and form? Iconicity facilitates sign recognition and production in British Sign Language. J. Mem. Lang. 82, 56–85. ( 10.1016/j.jml.2015.03.002) [DOI] [Google Scholar]
- 125.Strik Lievers F. 2015. Synaesthesia: a corpus-based study of cross-modal directionality. Funct. Lang. 22, 69–95. ( 10.1075/fol.22.1.04str) [DOI] [Google Scholar]
- 126.Ullmann S. 1959. The principles of semantics. Glasgow, UK: Jackson, Son & Co. [Google Scholar]
- 127.Williams JM. 1976. Synaesthetic adjectives: a possible law of semantic change. Language 52, 461–478. ( 10.2307/412571) [DOI] [Google Scholar]
- 128.Winter B, Perlman M, Perry LK, Lupyan G. 2017. Which words are most iconic? Iconicity in English sensory words. Interact. Stud. 18, 433–454. ( 10.1075/is.18.3.07win) [DOI] [Google Scholar]
- 129.Classen C. 1993. Worlds of sense: exploring the senses in history and across cultures. London, UK: Routledge. [Google Scholar]
- 130.Winter B. 2016. The sensory structure of the English lexicon. PhD thesis, University of California, Merced. [Google Scholar]
- 131.Cuskley C, Kirby S. 2013. Synesthesia, cross-modality, and language evolution. In Oxford handbook of synesthesia (eds Simner J, Hubbard EM), pp. 869–907. Oxford, UK: Oxford University Press. [Google Scholar]
- 132.Ramachandran VS, Hubbard EM. 2001. Synaesthesia—a window into perception, thought and language. J. Conscious. Stud. 8, 3–34. [Google Scholar]
- 133.Auracher J. 2017. Sound iconicity of abstract concepts: place of articulation is implicitly associated with abstract concepts of size and social dominance. PLoS ONE 12, e0187196 ( 10.1371/journal.pone.0187196) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 134.Maglio SJ, Rabaglia CD, Feder MA, Krehm M, Trope Y. 2014. Vowel sounds in words affect mental construal and shift preferences for targets. J. Exp. Psychol. Gen. 143, 1082–1096. ( 10.1037/a0035543) [DOI] [PubMed] [Google Scholar]
- 135.Pexman PM, Heard A, Lloyd E, Yap MJ. 2017. The Calgary Semantic Decision Project: concrete/abstract decision data for 10,000 English words. Behav. Res. Methods 49, 407–417. ( 10.3758/s13428-016-0720-6) [DOI] [PubMed] [Google Scholar]
- 136.Jaeger TF. 2008. Categorical data analysis: away from ANOVAs (transformation or not) and towards logit mixed models. J. Mem. Lang. 59, 434–446. ( 10.1016/j.jml.2007.11.007) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 137.Adelman JS, Brown GDA, Quesada JF. 2006. Contextual diversity, not word frequency, determines word-naming and lexical decision times. Psychol. Sci. 17, 814–823. ( 10.1111/j.1467-9280.2006.01787.x) [DOI] [PubMed] [Google Scholar]
- 138.Hoffman P, Ralph MAL, Rogers TT. 2013. Semantic diversity: a measure of semantic ambiguity based on variability in the contextual usage of words. Behav. Res. Methods 45, 718–730. ( 10.3758/s13428-012-0278-x) [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The code for all analyses presented here is available at https://osf.io/b9fhx/.