Skip to main content
Wiley Open Access Collection logoLink to Wiley Open Access Collection
. 2021 Apr 9;14(1):127–142. doi: 10.1111/tops.12532

What Can Network Science Tell Us About Phonology and Language Processing?

Michael S Vitevitch 1,
PMCID: PMC9290073  PMID: 33836120

Abstract

Contemporary psycholinguistic models place significant emphasis on the cognitive processes involved in the acquisition, recognition, and production of language but neglect many issues related to the representation of language‐related information in the mental lexicon. In contrast, a central tenet of network science is that the structure of a network influences the processes that operate in that system, making process and representation inextricably connected. Here, we consider how the structure found across phonological networks of several languages from different language families may influence language processing as we age and experience diseases that affect cognition during the typical and atypical acquisition of new words, during typical perception and production of speech in adults, and during language change over time. We conclude that the network science approach may not only provide insights into specific language processes but also provide a way to connect the work from these domains, which are becoming increasingly balkanized.

Keywords: Phonology, Network science, One‐phoneme metric, Phonological neighbors, Language universals, Word recognition, Speech production, Word‐learning


To understand spoken language, the listener must first identify the specific words that are heard, then access the meaning of those words from that part of memory known as the mental lexicon. Contemporary cognitive psychology models have attempted to explain this important process known as spoken word recognition, but in doing so these models have neglected how memory for the sounds that make up a word (i.e., phonological representations) may influence this process. In the sections that follow, we will first consider evidence for the processing‐centric bias in contemporary cognitive psychology. We will then contrast that approach with an emerging view called cognitive network science, which focuses on the organization of information in memory, and how that structure influences various cognitive processes. Given the importance of spoken word recognition to the rest of language processing, we will limit our discussion to phonological networks, and how the structure of phonological networks might influence various language processes.

1. Processing and representation in contemporary cognitive psychology

Cognitive psychology has traditionally examined how information is represented in memory, and the processes used to acquire, encode, retrieve, and manipulate that information represented in memory. In the area of psycholinguistics, there is a significant amount of research focusing on the cognitive processes involved in the acquisition, recognition, and production of language. What is not as well represented in contemporary cognitive psychology is research on the representation of language‐related information in the mental lexicon. Specifically, what information is represented, how are those representations organized, and how might the structure of those representations influence cognitive processing? The bias toward processes compared to representations is prevalent in contemporary cognitive psychology.

For evidence of this bias toward processing over representation in contemporary cognitive psychology, consider the numerous journal articles that debated whether spoken word recognition was an interactive process, as exemplified in the TRACE model (McClelland & Elman, 1986) or strictly feed‐forward, as exemplified in the Shortlist (Norris, 1994) and Merge models (Norris, McQueen, & Cutler, 2000). A similar debate between interactive (Dell, 1986) versus feed‐forward (Levelt, Roelofs, & Meyer, 1999) models occurred about the processes involved in speech production, providing additional evidence for a processing‐centric bias in contemporary cognitive psychology.

An extreme example of this processing‐centric bias in contemporary cognitive psychology can be found in the computational principles of parallel distributed processing (PDP) in the late 1980s and early 1990s (Rumelhart, McClelland, & PDP Research Group, 1986; see also Rogers & McClelland, 2014). Among the central tenets of the PDP framework is the idea that knowledge “does not exist as a set of dormant data structures in a separate store but is encoded directly in the network architecture, in the values of the connection weights that allow the system to generate useful internal representations and outputs” (Rogers & McClelland, 2014, p. 1039). (Note that network in this context is referring to a type of artificial neural network, not to the structural networks described below.)

Further to this processing‐centric bias, “active representations in the mind are thought to correspond to the patterns of activation generated over a set of units” (Rogers & McClelland, 2014, p. 1038). In other words, in the PDP framework, representations are not explicitly stored in and retrieved from memory but instead are ephemeral and emerge via the processing that occurs over many distributed processing units. In contrast to this view of processing and representation in contemporary cognitive psychology, an emerging approach known as cognitive network science takes a different view of processing and representation (Siew, Wulff, Beckage, & Kenett, 2019).

2. Processing and representation in cognitive network science

Cognitive network science uses the mathematical tools of network science to examine questions commonly studied by cognitive psychologists and cognitive scientists (Vitevitch, 2019a). In this approach, nodes are used to represent entities in a system, like words in the mental lexicon, and edges are used to connect nodes that are related in some way, like words that are phonologically related (Vitevitch, 2008), or words that are semantically related (e.g., Steyvers & Tenenbaum, 2005). See Fig. 1 for a network representation of the word speech. Words that are phonologically similar to speech (based on the addition, substitution, or deletion of one phoneme in the word to speech), such as peach, speed, and speak are known as 1‐hop neighbors of speech. Words such as spike, preach, and spud that are phonologically similar to peach, speed, and speak (but not to speech) are known as 2‐hop neighbors of speech.

Fig. 1.

Fig. 1

In a phonological network, nodes are used to represent words, and edges connect words that sound similar to each other. In the present case, phonological similarity is defined by a simple computational metric (add, delete, or substitute a phoneme in a word to form another word). Phonological similarity can be defined in other ways as well.

Various measures of the resulting network structure can then be made at the level of individual nodes (known as the micro‐level), at the level of the whole network (known as the macro‐level), or of subsets of nodes in the network (known as the meso‐level). It is important to understand the structure of a network at these three levels because a central tenet of network science states that the structure of a network influences how efficiently a process will operate in that system. For example, Kleinberg (2000; see also Latora & Marchiori, 2001) found that the algorithm that enabled an efficient navigation of a type of network known as a small‐world network was less efficient when that algorithm was implemented on a network with the same number of nodes and the same number of edges, but the nodes were connected in a different pattern. In other words, how the representations are structured influences processing. Given the important influence of the structure of a network on processing, the next section will explain the structural characteristics of phonological networks.

3. The structure of phonological networks

Given the important influence that the structure of a network may have on processing, Arbesman, Strogatz, and Vitevitch (2010a) examined whether the structure of the phonological network observed by Vitevitch (2008) was unique to English or might be found in other languages as well. If the characteristics of phonological networks are found across multiple languages, it might suggest that those structural characteristics are in some way important and may influence language processing.

Arbesman et al. (2010a) compared the structure of the English phonological network to the structure of the phonological networks formed from words in Spanish, Mandarin, Hawaiian, and Basque. These languages were selected to be representative examples of different language families. Although English and Spanish are both from the Indo‐European family of languages, English is a Germanic language, whereas Spanish is a Romance language. Mandarin is a Sino‐Tibetan language, Hawaiian is an Austronesian language, and Basque (or Euskara) is a linguistic isolate, meaning that it has not (yet) been identified as a member of a given language family.

Despite these five languages differing from each other in their morphology, phonemic inventories, typical word‐length, canonical syllable shape, use of tone, etc., Arbesman et al. (2010a) found that all five languages had similar network structures characterized by (1) a giant component that was smaller than is typically observed in networks of social or technological systems, (2) a small‐world structure in the giant component, (3) assortative mixing by degree in the giant component, and (4) the degree distribution of the giant component did not follow a power‐law. Below we will discuss each of these structural characteristics and the relevance of them for language processing.

The giant component of a network refers to the largest cluster of interconnected nodes. In social networks that map friendships among people upwards of 90% of the nodes in the network are found in the giant component. However, in the phonological networks examined by Arbesman et al. (2010a), 34%–66% of the lexical nodes were found in the giant component. In addition to observing a smaller giant component, Arbesman et al. (2010a) observed that the phonological networks contained a large number of smaller components (referred to as “lexical islands”) that contained words that were connected to each other, but not to the giant component. (See Arbesman, Strogatz, and Vitevitch [2010b] for an analysis of how the lexical islands differed in English and Spanish.) Further, a large number of isolates (nodes not connected to anything, also referred to as lexical hermits) were also observed in the five phonological networks.

The lack of a large core of highly connected nodes in the phonological networks has some interesting implications. Arbesman et al. (2010a) found that the phonological networks were more resilient to targeted attacks to highly connected nodes and to failures of randomly selected nodes. (Resilient in this context means that the network remained relatively well connected despite increasing amounts of damage.) In contrast, the much larger giant component found in a network model of the Internet was resilient to failures of randomly selected nodes but was vulnerable to targeted attacks to highly connected nodes (Albert, Jeong, & Barabási, 1999). If one views targeted attacks to highly connected nodes and the failures of randomly selected nodes as being analogous to the damage caused by aging, stroke, or disease, then the structure of the phonological networks may confer upon various language processes a level of resilience not typically seen in networks of social or technological systems (see also De Domenico & Arenas, 2017).

The second network structure that was common to the five phonological networks was that the giant component exhibited a small‐world structure. Small‐world structure is typically identified by (1) the shortest average path length being comparable to the shortest average path length of a network with the same number of nodes connected randomly, and (2) by an average clustering coefficient that is several orders of magnitude larger than the average clustering coefficient of a network with the same number of nodes connected randomly. This method of identifying a small‐world network was the one initially proposed by Watts and Strogatz (1998); however, more sophisticated approaches have been developed since that initial paper describing small‐world networks (e.g., Humphries & Gurney, 2008). Recall that Kleinberg (2000) found that navigation algorithms were very efficient in small‐world networks, suggesting that the small‐world structure of the phonological networks may contribute to the rapid and efficient lexical retrieval found in healthy, typically developed language users.

The third network structure that was common to the five phonological networks was assortative mixing by degree in the giant components of the phonological networks. Assortative mixing by degree means that nodes with many connections tended to be connected to nodes that also had many connections. Similarly, nodes with few connections tended to be connected to nodes that also had few connections. In contrast, disassortative mixing by degree, often found in technological networks (e.g., the Internet), occurs when a highly connected node tends to connect to nodes that themselves have few connections (Newman, 2002). Although assortative mixing by degree is not unique to phonological networks—it is observed often in social networks (Newman, 2002)—the values of assortative mixing by degree were much higher than those values typically observed in social networks. The work of Vitevitch, Chan, and Goldstein (2014) suggests that this characteristic in phonological networks may result in the graceful degradation typically observed in instances of failed lexical retrieval. That is, you retrieve a similar sounding word rather than fail to retrieve anything from the lexicon (i.e., a catastrophic failure in lexical retrieval).

The final network structure that was common to the five phonological networks examined by Arbesman et al. (2010a) was that the degree distribution of nodes in the giant component (i.e., the number of nodes with 1 connection, the number of nodes with 2 connections, etc.) did not follow a power law. When a degree distribution follows a power law, the distribution appears as a straight line with an exponent in the range of 2–3 when the x and y axes are plotted on logarithmic scales. This relationship indicates that there are many nodes with only a few connections and a few nodes with many connections (often called hubs).

Networks with degree distributions that follow a power law are known as scale‐free networks (Barabási & Albert, 1999), meaning similar patterns in the network are observed across multiple levels of analysis. The presence of a scale‐free degree distribution may indicate that a particular growth algorithm known as preferential attachment produced the structure of that network. However, work by Keller (2005) and others has shown that other algorithms can also produce power‐law distributions. Further, work by Broido and Clauset (2019) suggests that scale‐free networks/power‐law degree distributions may not be as common as previously thought.

The deviation of the degree distribution in the phonological networks from a power law is interesting because work by Amaral, Scala, Barthélémy, and Stanley (2000) suggests that degree distributions may deviate from a power‐law when there is a cost associated with the attachment of a new node in the network. In the case of phonological networks, the phoneme inventory, canonical syllable structures, and phonotactic constraints (e.g., Vitevitch & Luce, 2016) of a given language may all impose costs that constrain the attachment of a new node in the network. The presence of more than one language in the lexicon (i.e., being bi‐ or multi‐lingual) may also impose a cost that constrains the attachment of a new node in the network (e.g., Bilson, Yoshida, Tran, Woods, & Hills, 2015; Tiv, Gullifer, Feng, & Titone, 2020). Alternatively, growth algorithms other than preferential attachment may influence the acquisition of words in the phonological network and may result in a degree distribution in the phonological networks that deviates from a power law (e.g., Hills, Maouene, Maouene, Sheya, & Smith, 2009; Siew & Vitevitch, 2020a,2020b).

As described above, computer simulations and analogy to networks in other domains suggested that the constellation of network structures observed in phonological networks may influence lexical processing in interesting ways. The possibility of certain network structures influencing lexical processing prompted additional research using traditional psycholinguistic tasks to directly determine how certain network features influenced lexical processing. By using traditional psycholinguistic tasks that are widely used and well understood, one could be confident that the results of those psycholinguistic experiments—some of which are reviewed in the next section—were actually demonstrating the influence of certain network measures on processing.

4. Psycholinguistic evidence that structure influences processing

The first characteristic of phonological networks to be examined with psycholinguistic experiments was clustering coefficient, which measures in a phonological network the proportion of phonological neighbors of a word that are also phonological neighbors of each other (see Fig. 2). The clustering coefficient, C, has a range from 0 to 1. When C = 0, none of the neighbors of a given word are neighbors of each other, and when C = 1, every neighbor of a given word is also a neighbor of all of the other neighbors of that word.

Fig. 2.

Fig. 2

An illustration of the local clustering coefficient for the words badge and log. Notice that both badge and log have the same number of phonological neighbors (i.e., the words encircling badge and log). However, the neighbors of badge are also neighbors of each other to a greater extent than are the neighbors of log.

Given the well‐studied and widespread influences of phonological neighborhood density in psycholinguistics (Vitevitch & Luce, 2016)—which refers to the number of words that are phonologically similar to a given word and corresponds to the term degree in network science—it was reasoned that if any network science measure should influence lexical processing, it would be a measure that captured something about the internal structure of the phonological neighborhood, that is, clustering coefficient. Indeed, earlier attempts to assess the internal structure of the phonological neighborhood suggested that such influences could be observed (Vitevitch, 2002, 2007), making the local clustering coefficient a reasonable network measure to examine first.

Chan and Vitevitch (2009) used a perceptual identification task and an auditory lexical decision task, two conventional psycholinguistic tasks used to study spoken word recognition, to examine how clustering coefficient might influence lexical processing. In the perceptual identification task, they found that words with low clustering coefficient were responded to more accurately than words with high clustering coefficient. In the lexical decision task, they found that words with low clustering coefficient were responded to more quickly than words with high clustering coefficient.

Reasoning from work done in other domains that also used network science, Chan and Vitevitch (2009) proposed a verbal model to account for the differences they observed in performance. They suggested that activation would spread from the target word to the phonologically related words and from those words to other words that were phonologically related, etc. For words with low clustering coefficient, the activation would tend to disperse to the rest of the network, allowing the target word to “stand out” from the background of partially activated phonological neighbors, and therefore be retrieved quickly and accurately. However, for words with high clustering coefficient, the spreading activation would tend to recirculate among the highly interconnected phonological neighbors, resulting in the target word being “buried” in the background of partially activated phonological neighbors.

To further explore how the structure of representations in the lexicon may influence processing, Chan and Vitevitch (2009) also reported the results of a computer simulation of the TRACE model of word recognition (McClelland & Elman, 1986; Strauss, Harris, & Magnuson, 2007). Given that (j)TRACE is a connectionist model of the process of spoken word recognition, and that it does not consider how representations in the lexicon are structured, or how that structure may influence processing, it is perhaps not surprising that (j)TRACE was not able to simulate how differences in the clustering coefficient might influence processing as was observed in the two psycholinguistics tasks used by Chan and Vitevitch (2009).

Additional studies demonstrated that the local clustering coefficient of words in the lexicon influenced other psychological processes, including speech production (Chan & Vitevitch, 2010), word learning (Goldstein & Vitevitch, 2014), and long‐term and short‐term memory (Vitevitch, Chan, & Roodenrys, 2012). Further, the verbal account first put forth in Chan and Vitevitch (2009) was subsequently modeled computationally by Vitevitch, Ercal, and Adagarla (2011) (see also Siew, 2019). They simulated on 2‐hop networks (like those in Fig. 1) the spreading of activation among words with high and low clustering coefficient to provide a more formal account of the influence of clustering coefficient on processing.

It is important to note that measures of the network structure at the level of individual nodes (i.e., the micro‐level)—like local clustering coefficient—are not the only network structures that influence various lexical processes. Other measures at the meso‐ and macro‐level have also been observed to influence lexical processing. Mixing by degree, described above, is considered a macro‐level measure describing a characteristic of the whole network. Recall that Vitevitch et al. (2014), also described above, examined how this structural characteristic might influence how individuals recover from instances of failed lexical retrieval (see also Vitevitch, Goldstein, & Johnson, 2016 for a study of path‐length and misperceptions of speech).

A study by Siew and Vitevitch (2016) looked at the macro‐level structure of where words were located in the phonological network—either in the giant component or in a lexical island—and how that location might influence lexical retrieval in a naming task and a lexical decision task. They also examined how the location of a word in the phonological network might influence short‐term memory processes by using a serial recall task. In all of the tasks considered, Siew and Vitevitch found that words from the lexical islands were recognized more quickly and accurately than words from the giant component despite being comparable in their frequency of occurrence, neighborhood density, word length, etc., demonstrating that macro‐level structures of the phonological network can also influence cognitive processing.

Turning to the meso‐level—which measures subsets of nodes in the network rather than a characteristic at the individual level or of the overall network (e.g., Siew, 2013)—we see that key players in a network (Borgatti, 2006) also influence the process of spoken word recognition. Key players constitute a set of nodes in a network that, when removed, result in the network fracturing into several smaller components. When compared to another set of foil words that were comparable to the key players in frequency of occurrence, neighborhood density, word length, etc., Vitevitch and Goldstein (2014) found in a perceptual identification task that the key players were responded to more quickly and accurately than the foil words. Similar results were obtained in a naming task, a lexical decision task, and in an analysis of data from the English Lexicon Project (Balota et al., 2007). Together the studies described above demonstrate that the micro‐, meso‐, and macro‐level structures of the phonological network influence various language‐related and cognitive processes.

5. Network structure and speech and language disorders

In addition to leading to insights about the typical perception and production of speech in adults (as described above), the network approach has also been used to shed light on the atypical perception and production of speech in adults and children. Consider the work of Castro, Pelczarski, and Vitevitch (2017), who found that closeness centrality influenced reaction times in a lexical decision task of adults who stutter. Closeness centrality measures the distance from one node to all other nodes in the network (following the shortest path between any two nodes being considered). Although stuttering is often viewed as a disorder that primarily affects the fluent production of speech, subtle differences—that are not yet well understood—have been observed in the phonological processing abilities of people who stutter as compared to typically fluent peers (e.g., Byrd, McGill, & Usler, 2015; Newman & Bernstein Ratner, 2007; Pelczarski & Yaruss, 2014, 2016; Sasisekaran & De Nil, 2006). The tools of network science might be able to reveal more of these subtle differences in people who stutter, which are not always revealed using traditional psycholinguistic tasks.

Another study that used network science to gain insights that could not have been observed with standard measures (of phonetic accuracy) comes from Benham, Goffman, and Schweickert (2018), who examined novel sound and syllable production in typical and atypical language learners (i.e., children with developmental language disorder [DLD], also known as specific language impairment). Benham et al. asked children to repeat specially constructed nonwords with two syllables, and they used networks in which nodes represented syllables, and links connected the first to the second syllable in the production of each child. Although a more conventional kinematic analysis could not distinguish between the two groups of children, the nonword networks revealed higher variability and more disorganized production patterns in the children with DLD, despite both typical children and those with DLD exhibiting similar patterns of learning over time. The work of Benham et al. further demonstrates how the network approach can be used to gain insights into typical and atypical language development that might not be observed using more conventional or traditional measures and techniques (see also Carlson, Sonderegger, & Bane, 2014; Siew & Vitevitch, 2020a, 2020b).

Turning to acquired language disorders in adults, work by Vitevitch and Castro (2015) showed that individuals with Wernicke's or Broca's aphasia experienced problems in speech production as measured by the Philadelphia Naming Test that correlated with differences in the closeness centrality and the location in the phonological network (giant component vs. lexical island or a lexical hermit) of the words to be named. Studies like this further demonstrate the utility of the network approach to increase our understanding of acquired language disorders like aphasia.

Although the present paper has focused primarily on phonological networks, interesting work looking at other language disorders has been done with semantic networks, where connections are placed between words that are similar in their meanings instead of similar in how they sound. Some of this work is summarized here to provide further evidence that networks can provide insight into the atypical processes and representations that may characterize certain developmental and acquired speech and language disorders.

Beckage, Smith, and Hills (2011) demonstrated that networks that grow over time could provide insight not only into word learning in typically developing children but also provide insight into “late talking” children, who have vocabulary that are smaller than age‐matched children. Importantly, Beckage et al. (2011) found that late talkers had semantic networks that were less small‐world‐like than the semantic networks of typically developing children. Further, Beckage et al. (2011) observed a bias among the late talkers to learn “oddball” words that were semantically unique, rather than new words that were well connected to other words in the lexicon as typically developing children tend to do (e.g., Hills et al., 2009). It is not clear if other analysis approaches could have provided the insights that Beckage et al. (2011) observed with the network approach.

The tools of network science have also been used to examine the semantic networks of children with cochlear implants (Kenett et al., 2013) and may hold promise for increasing our understanding of the changes that take place in the lexicon as we age and possibly experience certain diseases often associated with aging (e.g., Wulff, De Deyne, Jones, & Mata, 2019). Although many of the diseases often associated with aging, like Alzheimer's disease, typically affect the semantic rather than phonological aspects of language, the work being done on semantic networks (e.g., Zemla & Austerweil, 2019) serves at the very least as a proof of concept of how network science can be used to provide insight into other aspects of the aging lexicon, like phonology.

6. The social network influences the lexical network

Up to this point, we have considered how network science can be used to capture the structure of the linguistic knowledge that individuals represent internally, and how that internally represented linguistic knowledge is acquired and develops over time. However, language does not occur solely in the black box of the mind but is inherently social as evidenced by the need for a language learner to be immersed in a community of language users (either speakers or signers) that transmit the ambient language to the learner. By having nodes represent people and edges connecting people who are friends with each other, the tools of network science can be used to examine the influences of one's social network on language processing.

Lev‐Ari and colleagues provide examples of how the size of one's social network can influence the ability of an individual to correctly interpret semantic information (Lev‐Ari, 2016), and also to process the phonetic/phonological information used to perceive speech (Lev‐Ari, 2018). These studies demonstrate how network science can help us better understand social influences on certain language processes rather than ignore those social influences as is more often done in the typical approach of cognitive psychology to examine language processing.

Further, changes in language occur not only at the developmental time‐scale of an individual but also occur at longer and shorter time‐scales. An example of language change that occurs on a longer time‐scale is seen by the changes in language that occur across generations of language users. An example of language change that occurs on a shorter time‐scale is seen when an individual accommodates to various linguistic features—such as a foreign accent—of a newly encountered interlocutor. The tools of network science can also be used to examine the influences on processing that occur at these shorter and longer time‐scales.

A study by Iacozza, Meyer, and Lev‐Ari (2020) demonstrates that social influences on cognitive processing can be induced very quickly (i.e., on very short time‐scales). Participants in their experiment were asked to learn the names of novel objects. Crucially, the names of the objects were taught to the participants by (ostensibly) students from their own university (inducing an in‐group bias) or by students from another university (inducing an out‐group bias). Participants with stronger in‐group biases (as assessed by a perceptual matching task; Moradi, Sui, Hewstone, & Humphreys, 2015) were less accurate at identifying matching speaker‐label pairs when the speaker was from the out‐group, indicating more specific encoding of information for members of the in‐group than for members of the out‐group.

Although Iacozza et al. (2020) did not use network analysis in this study, their work does show that whether a person is in or out of one's social network (i.e., in‐group versus out‐group, see also Frable & Bem, 1985) can influence certain cognitive processes, and those influences can be induced on a relatively short time‐scale. For an example of the use of network analysis on a longer, evolutionary time‐scale see, for example, Centola and Baronchelli (2015).

Network science may not only provide insights into internal language processes often studied by cognitive psychologists and other language scientists, as described above, but it may also provide a way to connect the work from various domains of language science. The work of Lev‐Ari (2016, 2018) provides one example of how the tools of network science might connect social psychology/sociolinguistics to cognitive psychology/psycholinguistics. The next section examines other ways that network science can be used to connect various domains of language science.

7. Multilevel networks in psycholinguistics

Another way the tools of network science may connect various domains of language is through the use of multilevel networks, also known as a network of networks. In a multilevel network, for example, words in one level are connected to each other via their phonological relationship but connected to each other in another level via their semantic relationship. An additional connection would link cat at the phonological level to cat at the semantic level, for example. Other relationships among words—such as morphology or orthography (Siew & Vitevitch, 2019)—could also be included in another network level to further examine the way different types of linguistic information interact during language processing.

Much progress has already been made using multilevel networks to understand word learning (Stella, Beckage, Brede, & De Domenico, 2018) and acquired language disorders (Castro, Stella, & Siew, 2020). Continued exploration of multilevel linguistic networks may provide new insights into various types of language processes and into language disorders, especially if additional levels are incorporated in the network of networks. With the right additional levels, one might even be able to finally connect the physiological network in the brain to the cognitive network in the mind (Vitevitch, 2019b).

8. Challenges for cognitive network science

The cognitive network science approach has been criticized because “these networks do not ‘do’ anything; they have no function” (Brown et al., 2018, p. 16). On the contrary, what such networks “do” is capture certain regularities and relationships among entities in the world around us. Work by Karuza, Kahn, Thompson‐Schill, and Bassett (2017) suggests that humans are able to extract those relationships among entities from the world around us and exploit that information in subsequent tasks. This suggests that these networks indeed “do” something and “function” to internally organize representations of the world around us.

With the regularities and relationships among entities in the world around us captured in the structure of the network, a simple processing algorithm—like a random walk on the network—may be sufficient to reproduce the behavior exhibited by humans in various tasks (e.g., Abbott, Austerweil, & Griffiths, 2015; Schweickert, Xi, Viau‐Quesnel, & Zheng, 2020). Alternatively, diffusion of activation across the network may also reproduce the behavior exhibited by humans in certain tasks (e.g., Siew, 2019; Vitevitch et al., 2011).

However, simple processes such as random walks and activation diffusing across a structured network may not be sufficient to capture the richness of all of human cognition. Human cognition also has many examples of directed search through memory (e.g., Hills & Pachur, 2012). Therefore, cognitive network science may also need to consider algorithms that employ a mixture of random and directed walks such as the taxi‐drive algorithm proposed by O'Keeffe, Anjomshoaa, Strogatz, Santi, and Ratti (2019)). The taxi‐drive algorithm was inspired by real taxis serving passengers in a city. Typically, a taxi drives around randomly until a passenger flags them down for a ride to a specific location. The taxi then proceeds along the shortest route to the randomly prescribed location, deposits the passenger, and then proceeds to roam around randomly until flagged down for the next randomly selected yet directed destination. This mixture of random and directed walks may better capture certain aspects of human cognition (i.e., executive function) and navigation across a structured network of the mental lexicon.

The tools of network science may enable us to understand how many of the pieces of language and cognition (and social influences) fit together, and how that underlying structure influences the behavior of cognitive and linguistic systems (Barabási, 2009, 2012). Additional computer simulations and psycholinguistic experiments will be required to assess how cognitive network science can examine in new ways the typical and atypical perception and production of speech (in mono‐ and multi‐lingual individuals), the influences of age and disease on language processing, the typical and atypical acquisition of new words, and to language change over time. Work using multilevel networks also holds much promise for reconnecting work from various domains of language science, which have become increasingly balkanized and disconnected over the years.

Acknowledgments

I wish to thank Anisha Kodwani, Yoshiaki Fujita, Sarah Brummett, and three reviewers for helpful comments and suggestions on this work.

This article is part of the topic “Networks of the Mind: How Can Network Science Elucidate Our Understanding of Cognition?,” Thomas T. Hills and Yoed N. Kenett (Topic Editors).

Many contemporary models in cognitive psychology focus on processing, ignoring how the organization of representations in memory may influence processing. In contrast, cognitive network science focuses on the organization of information in memory, and how that structure influences various cognitive processes. Vitevitch reviews work using the cognitive network science approach to understand spoken word recognition and points to some limitations of this approach for cognitive psychology more broadly.

References

  1. Abbott, J. T. , Austerweil, J. L. , & Griffiths, T. L. (2015). Random walks on semantic networks can resemble optimal foraging. Psychological Review, 122, 558–569. [DOI] [PubMed] [Google Scholar]
  2. Albert, R. , Jeong, H. , & Barabási, A. L. (1999). Diameter of the world‐wide web. Nature, 401, 130–131. [Google Scholar]
  3. Amaral, L. A. N. , Scala, A. , Barthélémy, M. , & Stanley, H. E. (2000). Classes of small‐world networks. Proceedings of the National Academy of Sciences, 97, 11149–11152. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Arbesman, S. , Strogatz, S. H. , & Vitevitch, M. S. (2010a). The structure of phonological networks across multiple languages. International Journal of Bifurcation and Chaos, 20, 679–685. [Google Scholar]
  5. Arbesman, S. , Strogatz, S. H. , & Vitevitch, M. S. (2010b). Comparative analysis of networks of phonologically similar words in English and Spanish. Entropy, 12, 327–337. [Google Scholar]
  6. Balota, D. A. , Yap, M. J. , Cortese, M. J. , Hutchison, K. A. , Kessler, B. , Loftis, B . … Treiman, R. (2007). The English Llxicon project. Behavior Research Methods, 39, 445–459. [DOI] [PubMed] [Google Scholar]
  7. Barabási, A. L. , & Albert, R. (1999). Emergence of scaling in random networks. Science, 286, 509–512. [DOI] [PubMed] [Google Scholar]
  8. Barabási, A. L. (2009). Scale‐free networks: A decade and beyond. Science, 325, 412–413. [DOI] [PubMed] [Google Scholar]
  9. Barabási, A. L. (2012). The network takeover. Nature Physics, 8, 14–16. [Google Scholar]
  10. Beckage, N. , Smith, L. , & Hills, T. (2011). Small worlds and semantic network growth in typical and late talkers. PLoS One, 6(5), E19348. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Benham S., Goffman L., Schweickert R. (2018). An Application of Network Science to Phonological Sequence Learning in Children With Developmental Language Disorder. Journal of Speech, Language, and Hearing Research. 61(9), 2275–2291. 10.1044/2018_jslhr-l-18-0036. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Bilson, S. , Yoshida, H. , Tran, C. D. , Woods, E. A. , & Hills, T. T. (2015). Semantic facilitation in bilingual first language acquisition. Cognition, 140, 122–134. [DOI] [PubMed] [Google Scholar]
  13. Borgatti, S. P. (2006). Identifying sets of key players in a network. Computational, Mathematical and Organizational Theory, 12, 21–34. [Google Scholar]
  14. Broido, A. D. , & Clauset, A. (2019). Scale‐free networks are rare. Nature Communications, 10, 1017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Brown, K. S. , Allopenna, P. D. , Hunt, W. R. , Steiner, R. , Saltzman, E. , McRae, K. , & Magnuson, J. S. (2018). Universal features in phonological neighbor networks. Entropy, 20, 526. 10.3390/e20070526 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Byrd, C. T. , McGill, M. , & Usler, E. (2015). Nonword repetition and phoneme elision in adults who do and do not stutter: Vocal versus nonvocal performance differences. Journal of Fluency Disorders, 44, 17–31. [DOI] [PubMed] [Google Scholar]
  17. Carlson, M. T. , Sonderegger, M. , & Bane, M. (2014). How children explore the phonological network in child‐directed speech: A survival analysis of children's first word productions. Journal of Memory and Language, 75, 159–180. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Castro, N. , Pelczarski, K. M. , & Vitevitch, M. S. (2017). Using network science measures to predict lexical decision performance of adults who stutter. Journal of Speech, Language, and Hearing Research, 60, 1911–1918. [DOI] [PubMed] [Google Scholar]
  19. Castro, N. , Stella, M. , & Siew, C. S. Q. (2020). Quantifying the interplay of semantics and phonology during failures of word retrieval by people with aphasia using a multiplex lexical network. Cognitive Science, 44, e12881. [DOI] [PubMed] [Google Scholar]
  20. Centola, D. , & Baronchelli, A. (2015). The spontaneous emergence of conventions: An experimental study of cultural evolution. Proceedings of the National Academy of Sciences, 112, 1989–1994. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Chan, K. Y. , & Vitevitch, M. S. (2009). The influence of the phonological neighborhood clustering‐coefficient on spoken word recognition. Journal of Experimental Psychology: Human Perception & Performance, 35, 1934–1949. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Chan, K. Y. , & Vitevitch, M. S. (2010). Network structure influences speech production. Cognitive Science, 34, 685–697. [DOI] [PubMed] [Google Scholar]
  23. De Domenico, M. , & Arenas, A. (2017). Modeling structure and resilience of the dark network. Physical Review E, 95, 022313. [DOI] [PubMed] [Google Scholar]
  24. Dell, G. S. (1986). A spreading‐activation theory of retrieval in sentence production. Psychological Review, 93, 283–321. [PubMed] [Google Scholar]
  25. Frable, D. E. , & Bem, S. L. (1985). If you are gender schematic, all members of the opposite sex look alike. Journal of Personality and Social Psychology, 49, 459−468. [Google Scholar]
  26. Goldstein, R. , & Vitevitch, M. S. (2014). The influence of clustering coefficient on word‐learning: How groups of similar sounding words facilitate acquisition. Frontiers in Language Sciences, 5, 01307. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Hills, T. T. , Maouene, M. , Maouene, J. , Sheya, A. , & Smith, L. B. (2009). Longitudinal analysis of early semantic networks: Preferential attachment or preferential acquisition? Psychological Science, 20, 729–739. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Hills, T. T. , & Pachur, T. (2012). Dynamic search and working memory in social recall. Journal of Experimental Psychology: Learning, Memory, and Cognition, 38, 218–228. [DOI] [PubMed] [Google Scholar]
  29. Humphries, M. D. , & Gurney, K. (2008). Network ‘small‐world‐ness’: A quantitative method for determining canonical network equivalence. PLoS ONE, 3(4), E0002051. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Iacozza, S. , Meyer, A. S. , & Lev‐Ari, S. (2020). How in‐group bias influences the level of detail of speaker‐specific information encoded in novel lexical representations. Journal of Experimental Psychology: Learning, Memory, and Cognition, 46(5), 894–906. [DOI] [PubMed] [Google Scholar]
  31. Karuza, E. A. , Kahn, A. E. , Thompson‐Schill, S. L. , & Bassett, D. S. (2017). Process reveals structure: How a network is traversed mediates expectations about its architecture. Scientific Reports, 7, 12733. 10.1038/s41598-017-12876-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Keller, E. F. (2005). Revisiting ‘‘scale‐free’’ networks. BioEssays, 27, 1060–1068. [DOI] [PubMed] [Google Scholar]
  33. Kenett, Y. N. , Wechsler‐Kashi, D. , Kenett, D. Y. , Schwartz, R. G. , Ben‐Jacob, E. , & Faust, M. (2013). Semantic organization in children with cochlear implants: Computational analysis of verbal fluency. Frontiers in Psychology, 4, 543. 10.3389/fpsyg.2013.00543 [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Kleinberg, J. M. (2000). Navigation in a small world. Nature, 406, 845. [DOI] [PubMed] [Google Scholar]
  35. Latora, V. , & Marchiori, M. (2001). Efficient behavior of small‐world networks. Physical Review Letters, 87, 198701. [DOI] [PubMed] [Google Scholar]
  36. Lev‐Ari, S. (2016). How the size of our social network influences our semantic skills. Cognitive Science, 40, 2050–2064. 10.1111/cogs.12317 [DOI] [PubMed] [Google Scholar]
  37. Lev‐Ari, S. (2018). The influence of social network size on speech perception. Quarterly Journal of Experimental Psychology, 71(10), 2249–2260. 10.1177/1747021817739865 [DOI] [PubMed] [Google Scholar]
  38. Levelt, W. J. M. , Roelofs, A. , & Meyer, A. S. (1999). A theory of lexical access in speech production. Behavioral & Brain Sciences, 22, 1–38. [DOI] [PubMed] [Google Scholar]
  39. McClelland, J. L. , & Elman, J. L. (1986). The TRACE model of speech perception. Cognitive Psychology, 18, 1–86. [DOI] [PubMed] [Google Scholar]
  40. Moradi, Z. , Sui, J. , Hewstone, M. , & Humphreys, G. W. (2015). In‐group modulation of perceptual matching. Psychonomic Bulletin & Review, 22, 1255–1277. [DOI] [PubMed] [Google Scholar]
  41. Newman, M. E. J. (2002). Assortative mixing in networks. Physical Review Letters, 89, 20889701. [DOI] [PubMed] [Google Scholar]
  42. Newman, R. S. , & Bernstein Ratner, N. B. (2007). The role of selected lexical factors on confrontation naming accuracy, speed, and fluency in adults who do and do not stutter. Journal of Speech, Language, and Hearing Research, 50, 196–213. [DOI] [PubMed] [Google Scholar]
  43. Norris, D. (1994). Shortlist: A connectionist model of continuous speech recognition. Cognition, 52, 189–234. [Google Scholar]
  44. Norris, D. , McQueen, J. M. , & Cutler, A. (2000). Merging information in speech recognition: Feedback is never necessary. Behavioral & Brain Sciences, 23, 299–370. [DOI] [PubMed] [Google Scholar]
  45. O'Keeffe, K. P. , Anjomshoaa, A. , Strogatz, H. , Santi, P. , & Ratti, C. (2019). Quantifying the sensing power of vehicle fleets. Proceedings of the National Academy of Sciences of the United States of America, 116, 12752–12757. 10.1073/pnas.1821667116 [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Pelczarski, K. M. , & Yaruss, J. S. (2014). Phonological encoding of young children who stutter. Journal of Fluency Disorders, 39, 12–24. [DOI] [PubMed] [Google Scholar]
  47. Pelczarski, K. M. , & Yaruss, J. S. (2016). Phonological memory in young children who stutter. Journal of Communication Disorders, 62, 54–66. [DOI] [PubMed] [Google Scholar]
  48. Rogers, T. T. , & McClelland, J. L. (2014). Parallel distributed processing at 25: Further explorations in the microstructure of cognition. Cognitive Science, 38, 1024–1077. [DOI] [PubMed] [Google Scholar]
  49. Rumelhart, D. E. , & McClelland, J. L. & PDP Research Group. (1986) Parallel distributed processing: Explorations in the microstructure of cognition. Volume I: Foundations & volume II: Psychological and biological models. Cambridge, MA: MIT Press. [Google Scholar]
  50. Sasisekaran, J. , & De Nil, L. (2006). Phoneme monitoring in silent naming and perception in adults who stutter. Journal of Fluency Disorders, 31, 284–302. [DOI] [PubMed] [Google Scholar]
  51. Schweickert, R. , Xi, Z. , Viau‐Quesnel, C. , & Zheng, X. (2020). Power law distribution of frequencies of characters in dreams explained by random walk on semantic network. International Journal of Dream Research, 13, 192–201. [Google Scholar]
  52. Siew, C. S. Q. (2013). Community structure in the phonological network. Frontiers in Psychology, 4, 553. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Siew, C. S. Q. (2019). spreadr: An R package to simulate spreading activation in a network. Behavior Research Methods, 51, 910–929. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Siew, C. S. Q. , & Vitevitch, M. S. (2016). Spoken word recognition and serial recall of words from components in the phonological network. Journal of Experimental Psychology: Learning, Memory, and Cognition, 42, 394–410. [DOI] [PubMed] [Google Scholar]
  55. Siew, C. S. Q. , & Vitevitch, M. S. (2019). The phonographic language network: Using network science to investigate the phonological and orthographic similarity structure of language. Journal of Experimental Psychology: General, 148, 475–500. [DOI] [PubMed] [Google Scholar]
  56. Siew, C. S. Q. , & Vitevitch, M. S. (2020a). Investigating the influence of inverse preferential attachment on network development. Entropy, 22, 1029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Siew, C. S. Q. , & Vitevitch, M. S. (2020b). An investigation of network growth principles in the phonological language network. Journal of Experimental Psychology: General, 149, 2376‐2394. [DOI] [PubMed] [Google Scholar]
  58. Siew, C. S. Q. , Wulff, D. U. , Beckage, N. M. , & Kenett, Y. N. (2019). Cognitive network science: A review of research on cognition through the lens of representations, processes, and dynamics. Complexity. 10.1155/2019/2108423 [DOI] [Google Scholar]
  59. Stella, M. , Beckage, N. M. , Brede, M. , & De Domenico, M. (2018). Multiplex model of mental lexicon reveals explosive learning in humans. Scientific Reports, 8, 2259. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Steyvers, M. , & Tenenbaum, J. B. (2005). The large‐scale structure of semantic networks: Statistical analyses and a model of semantic growth. Cognitive Science, 29, 41–78. [DOI] [PubMed] [Google Scholar]
  61. Strauss, T. J. , Harris, H. D. , & Magnuson, J. S. (2007). jTRACE: A reimplementation and extension of the TRACE model of speech perception and spoken word recognition. Behavior Research Methods, 39, 19–30. [DOI] [PubMed] [Google Scholar]
  62. Tiv, M. , Gullifer, J. , Feng, R. , & Titone, D. (2020). Using Network Science to map what Montréal bilinguals talk about across languages and communicative contexts. Journal of Neurolinguistics, 56, 100913. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Vitevitch, M. S. (2002). Influence of onset density on spoken‐word recognition. Journal of Experimental Psychology: Human Perception and Performance, 28, 270–278. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Vitevitch, M. S. (2007). The spread of the phonological neighborhood influences spoken word recognition. Memory & Cognition, 35, 166–175. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Vitevitch, M. S. (2008). What can graph theory tell us about word learning and lexical retrieval? Journal of Speech Language Hearing Research, 51, 408–422. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Vitevitch, M. S. (Ed.). (2019a). Network science in cognitive psychology. Abingdon, UK: Routledge. [Google Scholar]
  67. Vitevitch, M. S. (2019b). Can network science connect mind, brain and behavior? In M. S., Vitevitch (Ed.), Network science in cognitive psychology (pp. 184–197). Oxfordshire, UK: Routledge. [Google Scholar]
  68. Vitevitch, M. S. , & Castro, N. (2015). Using network science in the language sciences and clinic. International Journal of Speech‐Language Pathology, 17, 13–25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Vitevitch, M. S. , Chan, K. Y. , & Goldstein, R. (2014). Insights into failed lexical retrieval from network science. Cognitive Psychology, 68, 1–32. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Vitevitch, M. S. , Chan, K. Y. , & Roodenrys, S. (2012). Complex network structure influences processing in long‐term and short‐term memory. Journal of Memory & Language, 67, 30–44. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Vitevitch, M. S. , Ercal, G. , & Adagarla, B. (2011). Simulating retrieval from a highly clustered network: Implications for spoken word recognition. Frontiers in Language Sciences, 2, 369. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Vitevitch, M. S. , & Goldstein, R. (2014). Keywords in the mental lexicon. Journal of Memory & Language, 73, 131–147. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Vitevitch, M. S. , Goldstein, R. , & Johnson, E. (2016). Path‐length and the misperception of speech: Insights from network science and psycholinguistics. In Mehler A., Blanchard P., Job B., & Banish S. (Eds.), Towards a theoretical framework for analyzing complex linguistic networks (pp. 29−45). Berlin: Springer‐Verla; g. [Google Scholar]
  74. Vitevitch, M. S. , & Luce, P. (2016). Phonological neighborhood effects in spoken word perception and production. Annual Review of Linguistics, 2, 75–94. [Google Scholar]
  75. Watts, D. J. , & Strogatz, S. H. (1998). Collective dynamics of ‘small‐world’ networks. Nature, 393, 440–442. [DOI] [PubMed] [Google Scholar]
  76. Wulff, D. U. , De Deyne, S. , Jones, M. N. , & Mata, R. & The Aging Lexicon Consortium.(2019) New perspectives on the aging lexicon. Trends in Cognitive Science, 23, 686–698. [DOI] [PubMed] [Google Scholar]
  77. Zemla, J. C. , & Austerweil, J. L. (2019). Analyzing knowledge retrieval impairments associated with Alzheimer's disease using network analyses. Complexity. 10.1155/2019/4203158 [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Topics in Cognitive Science are provided here courtesy of Wiley

RESOURCES