Abstract
Cognitive science invokes semantic networks to explain diverse phenomena, from memory retrieval to creativity. Research in these areas often assumes a single underlying semantic network that is shared across individuals. Yet, recent evidence suggests that content, size, and connectivity of semantic networks are experience-dependent, implying sizable individual and age-related differences. Here, we investigate individual and age differences in the semantic networks of younger and older adults by deriving semantic networks from both fluency and similarity rating tasks. Crucially, we use a megastudy approach to obtain thousands of similarity ratings per individual to allow us to capture the characteristics of individual semantic networks. We find that older adults possess lexical networks with smaller average degree and longer path lengths relative to those of younger adults, with older adults showing less interindividual agreement and thus more unique lexical representations relative to younger adults. Furthermore, this approach shows that individual and age differences are not evenly distributed but, rather, are related to weakly connected, peripheral parts of the networks. All in all, these results reveal the interindividual differences in both the content and the structure of semantic networks that may accumulate across the life span as a function of idiosyncratic experiences.
Subject terms: Human behaviour, Cognitive ageing, Learning and memory
Introduction
Semantic networks are a form of knowledge representation that represent relations between items, such as concepts, using a graph-type system consisting of nodes and their interconnections. Such networks have been postulated as the representational basis of our cognitive system1–3 and are, therefore, an integral part of prominent models of memory4, reasoning5, and creativity6,7. Past work has often made the simplifying assumption that a common semantic network can be used to understand human semantic cognition4,5,8–11. This assumptions is implicit, for instance, in efforts to model retrieval from memory12, judgments of relatedness13, or decision making14 using large-scale word vector spaces and free-association networks. However, general theories of learning and development15,16, as well as empirical findings17–19, suggest that semantic networks could vary considerably between individuals and across the life span. Crucially, researchers now seem to agree that understanding experience-dependent changes and individual variation in cognition is an important frontier for the science of aging20.
Aging research has made significant progress in the past decades in quantifying age-related changes in semantic cognition, including large increases in the size of the knowledge-store across adult development, perhaps best documented in the large differences in vocabulary size across the adult life span21,22. More recently, however, research suggests that individual learning and life span development can also lead to changes in the structure of human knowledge23–25. For example, recent efforts have used data from large-scale free-association studies to show that older adults’ semantic networks are less connected, efficient, and structured relative to those of younger adults12,18.
Quantifying individual and age differences in the size and structure of human knowledge is important because this can help identify the sources underlying cognitive changes in later life. Older adults tend to perform worse on a broad set of cognitive tasks, and such findings are commonly attributed to a decline in fluid cognitive abilities26,27. However, changes in the size and structure of the knowledge representation can masquerade as changes in fluid cognitive abilities, which may be sometimes difficult to disentangle9. Moreover, many have argued that changes in the underlying size and structure of representations contributes directly to age differences in cognitive performance, for example, due to activation-spreading across many targets in memory (fan effect28) or difficulties in discrimination learning between many similar items16. Consequently, it has been proposed that it is important to understand the links between the size and structure of human knowledge and individual and age differences in fluid abilities (e.g.,25).
One first step needed to understand the contribution of semantic networks to age differences in cognitive performance is to document the changes in the size and structure of semantic networks across the life span. The few existing studies comparing the semantic networks of younger and older adults have used different elicitation methods to extract only aggregate-level networks of younger and older adults, while also focusing on different parts of the semantic network. In the present study, we seek to address the shortcomings of existing efforts to document potential life span differences in the structure of semantic networks in several ways (see Fig. 1).
First, we investigate age differences in the size and semantic network structure for aggregates of younger and older groups obtained from a semantic fluency task (e.g., “Name all animals you can”). Semantic fluency tasks are typically employed to measure fluid cognitive abilities, for instance, in screening instruments for age-related cognitive pathology (e.g., Alzheimer’s disease29). We study age differences in this paradigm under different conditions: Our analyses of the semantic structure of fluency productions are the first to include age comparisons using different semantic categories (animal vs. country) and different retrieval time allowances (1 minute vs. 10 minutes). Little is known about how structural properties vary across different areas of the semantic network30,31. Nonetheless, retrieval speed represents an established difference between younger and older adults’ cognition26. Consequently, by comparing networks from semantic fluency across different time conditions, we will be able to assess the robustness of past findings with respect to semantic content and situations where older adults may catch-up or even outperform younger adults in semantic fluency when given the opportunity to search their potentially larger semantic stores. Also, past work suggests that younger and older adults differ across semantic categories32, so comparing the network structure of different categories further contributes to understanding the factors that drive such differences.
Second, we adopt a megastudy approach33 to provide the first comparison of younger and older adults’ semantic networks at the level of individuals. This involves collecting over 2,000 similarity ratings from each participant. This is crucial for two reasons. Aggregate networks likely do not accurately reflect the structure of individual level networks9,12,34. Further, similarity ratings likely recruit different retrieval processes than those underlying the two elicitation methods past studies have relied on, semantic fluency and free associations. Consequently, this will allow us to rule out the possibility that age differences in semantic networks are driven by aggregation bias or retrieval processes associated with a particular elicitation method. Furthermore, we will relate individual differences in the structure of similarity rating networks to individual differences in education and cognitive performance, and thereby assess supposed drivers and consequences of semantic network structure12.
In sum, we aim to contribute to mapping the structural differences in the semantic networks of younger and older adults by assessing whether they are robust across different semantic domains (i.e., animals, countries), different retrieval processes (e.g., fluency, similarity rating), and levels of analysis (i.e., aggregate vs. individual) so as to shed light on age differences in semantic network structure.
Results
Age-related differences in fluency networks
Semantic fluency is a neuropsychological test that requires participants to retrieve as many elements as possible from a natural category35, say, animals, within in a given amount of time. Research has begun to analyze semantic fluency data in novel ways to extract from them semantic networks36 and understand individual and age differences in semantic cognition23,37. Such approaches leverage the fact that the proximity of elements within the sequence of responses should reveal information on whether two elements are connected in an underlying semantic network. Several algorithms utilizing this principle have been proposed. To infer semantic networks of younger and older adults, we rely on a random walk plus filtering algorithm, which was recently found to predict human behavior better than other algorithms37. In the first step, this algorithm adds to a single network for each age group edges for every pair of elements that occurred less than two positions apart from each other across all semantic fluency sequences of the age group. Then, in a second step, all edges in the network that were added only once across all sequences or were less frequent than expectation derived from random behavior are removed (see Figure 1A). Previous research has found this approach to produce plausible networks that predict human behavior better than other network inference methods for fluency data37,38.
We compared semantic networks of younger and older adults on the basis of four semantic fluency data sets, stemming from published work39 and two new studies (see Methods for details). Table 1 provides an overview of the data sets. Following previous work18,39,40, we compared younger and older adults’ networks with respect to three macroscopic network measures: average degree (connectivity, ), average local clustering coefficient (structuredness, C), and average shortest path length (efficiency, L). These metrics are frequently employed to characterize the structure of cognitive networks and have been successfully linked to various measures of cognitive performance (for reviews, see11,25,41): For instance, degree has been linked to speed of retrieving words in lexical decison tasks42, clustering has been linked to retrieval success in cued recall tasks43, and shortest path length has been linked to faster information processing44. To avoid any confounding influences of network size and content, younger and older adults’ networks were compared on the largest connected, common subgraph, containing only words that were produced by both younger and older adults. Figure 2 shows the networks estimated for younger and older adults in each of the four data sets analyzed. Overall, the figures suggest similar semantic relations between items, showing that the network inference mechanism generates plausible, intuitive semantic networks. These figures do not allow, however, an easy and direct comparison of network characteristics across groups, which is typically done by relying on quantitative indices of macroscopic network characteristics, such as connectivity indices (e.g., degree, shortest path length; cf.18,45).
Table 1.
Data set | Age (Range) | N | t | ||
---|---|---|---|---|---|
Wulff et al. (2016) | 52.5 (29-65) | 142 | 1 min | 21.2 | .09 |
73.7 (66-94) | 142 | 1 min | 17.9 | .11 | |
Study 1 (Animal) | 25 (18-34) | 41 | 10 min | 90.7 | .15 |
71.1 (66-81) | 71 | 10 min | 89.6 | .18 | |
Study 1 (Country) | 25 (18-34) | 41 | 10 min | 75.3 | .08 |
71.1 (66-81) | 71 | 10 min | 69.7 | .11 | |
Study 2 (Animal) | 24 (18-32) | 36 | 10 min | 92.6 | .17 |
70 (65-78) | 36 | 10 min | 88.6 | .19 |
* Found to be significantly different between younger and older adults according to permutation tests; N = number of participants; t = time limit of the fluency task; = average number of non-duplicate, valid responses per individual; = number of unique responses u divided by the sum of number responses n for all individuals in the group.
We present a comparison of network indices in Fig. 3. Compared to older adults, the networks of younger adults showed consistently higher average degrees and lower average shortest path lengths. These differences were found to be reliable for one (Study 2—Animals) and three (all but study 2—Countries) data sets according to bootstrap tests (see Supplementary Material). However, results for the average clustering coefficient were mixed. A multiverse analysis46 evaluating the results under various implementations of our inference method suggests that inference is robust for degree and shortest path length, but not for clustering, supporting the differences found for degree and shortest path length and providing an explanation for the mixed results in the latter (see Supplementary Material). These results corroborate the existence of systematic structural differences between younger and older adults’ semantic networks in terms of connectivity and efficiency, but not clustering.
Two additional findings concerning younger and older adults’ semantic fluency data are worth noting. First, in the two studies that gave participants 10 minutes to retrieve items from semantic memory, there were no differences in the number of items produced by younger and older adults, as determined by permutation tests (Table 1). Compared with the shorter retrieval periods of previous studies (cf.47–49), the longer retrieval period of 10 minutes seems to eliminate older adults’ disadvantage of slower memory retrieval. Second, as a group, older adults produced more unique category items across all four data sets, as measured by the number of unique items relative to the number of responses produced by an age group (cf. in Table 1), as determined by a permutation test (Table 1), which is supportive of the notion that older adults possess a larger mental lexicon than younger adults21. Despite such differences, the age-related patterns in macroscopic network structure generalize across the different domains and conditions, which speaks to the generality of these findings across elicitation procedures (cf.12,18).
Age-related differences in individual-level similarity networks
A potential criticism of extant comparisons of younger and older adults’ networks is that they lump together the data of many individuals to form aggregate networks, thus obscuring individual differences. To address this limitation, we conducted a comparison of younger and older adults’ semantic networks at the level of the individual. Specifically, we elicited a large number of similarity ratings and constructed networks directly from each individual’s responses. Aside from avoiding problems of aggregation, this approach had five additional advantages: First, similarity ratings likely recruit different memory retrieval processes and may overall be less affected by such processes than semantic fluency, permitting an independent and, potentially, cleaner assessment of network structure. Second, by requiring participants to rate a common set of words, similarity ratings likely are less affected by vocabulary differences between younger and older adults. Third, similarity ratings deliver direct estimates of the connection strength between words, sidestepping the need to infer edges using complex algorithms. Fourth, similarity ratings deliver graded responses permitting the construction of networks with weighted edges. Finally, because network statistics are available for each individual, the comparison between younger and older adults’ networks can be carried out using standard methods of statistical inference.
In our study, each of 36 younger and 36 older participants provided a total of 2,253 similarity ratings, of which 1,953 were given to all possible pairs of 63 common animals and the remaining 300 to a set of repeat pairs, for which we found reliability to be high (older adults: , younger adults: ). Participants were instructed to rate similarity based on the animal’s degree of relatedness or association. We constructed networks by, first, mapping an individual’s ratings from the original scale of 1 (extremely dissimilar) to 20 (extremely similar) to the scale of 0 to 1, by setting a person’s minimum and maximum rating to 0 and 1, respectively. This was done for two reasons: to account for differences in scale use and to introduce a 0 indicating the absence of edges. Second, we placed edges between all 63 animal nodes with weights equal to the transformed ratings. Finally, we eliminated edges with weights below a threshold . This last step was necessary to be able to determine the average local clustering coefficient, which is not defined for completely connected networks, while also providing us with a means to assess the robustness of our results to the choice of threshold. Figure 4 shows the 72 networks obtained from younger and older adults under .
Across all values of , compared to older adults, the networks of younger adults showed consistently higher average degrees () and lower average shortest path lengths (L), and also higher local clustering coefficients (C) (see Fig. 5). We found the same pattern of results when the networks were analyzed as unweighted networks. For small values of , where more than 50% of all edges were retained, i.e., , moderate to large effects were observed, of which many were reliably distinct from 0 as indicated by the 95% confidence intervals. Effects for more restrictive values of , i.e., pointed in the same direction, but they were smaller in size and, due to larger variance, were mostly not reliably different from 0. These results corroborate the structural differences found for aggregate networks and demonstrate, for the first time, systematic age-related differences in the structure of semantic networks at the level of the individual obtained from a megastudy using semantic similarity ratings.
Moreover, analyses reported in the Supplementary Material confirm the existence of aggregation biases. For the average degree, the clustering coefficient, and the average shortest path length, but not the average strength, estimates based on aggregate networks, which we derived by averaging networks within age groups, were considerably higher than the majority of estimates for individual-level networks. Aggregate networks, however, still revealed group differences consistent with those observed on the individual level, suggesting some level of robustness of group comparisons obtained from aggregate data.
Locating age-related differences in semantic network structure
Past work on the development of semantic knowledge suggests that cumulative linguistic experience and general learning process combine to create specific semantic structures that allow efficient discrimination learning16. Crucially, that work proposes that such learning processes involve the strengthening of some associations while weakening others to allow differentiating between meaningful and meaningless pairs of items in memory. One important consequence of this process is that age differences in network structure may not be homogeneous across pairs of associations due to the interaction of learning and cumulative experience.
To shed light on the differences between younger and older adults’ networks, we compared their networks on the level of node pairs with respect to three metrics that directly underlie the macroscopic results in Fig. 5 and allow us to assess homogeneity of age differences across node pairs. Specifically, for each of the 1,953 node pairs, we compare the edge weight w under (corresponding to and ), the proportion with which the pair forms triangles with other nodes () under , and the path length connecting the pair () also under . Figure 6 displays these results separately for younger and older adults with node pairs ordered by the average edge weight w across both age groups. Ordering edges in this way allows direct inference-by-eye to reveal whether age-differences emerge uniformly across the network.
We observed consistent differences between younger and older adults in terms of all three metrics. Specifically, the edge weights and the proportion of triangles were consistently lower for older than younger adults, whereas path lengths were consistently larger. Crucially, we observed that the differences between older and younger adults were considerably larger for the lower half of node pairs. Thus, the differences between younger and older adults’ networks appear to be mainly due to peripheral regions in the network, where edge weights are small, triangles rare, and shortest path lengths long.
We should note that the results above do not seem to be explained by age differences in use of the scale. We observed the judgments of younger and older adults not to differ in terms of the judged minimum () or the judged maximum (). However, we did find younger and older adults to differ in terms of the ratings‘ average () and, crucially, the ratings’ skewness (), with older adults’ ratings being lower on average and more right skewed. This suggests that younger and older adults interpreted and used the end points of the scales in the same way, and differed only in how they distributed the word pairs in between the end points, as would be expected from different perceptions of similarity between judged pairs.
Assessing the links between cumulative experience, network structure and cognitive performance
The assessment of individual differences in semantic networks gives us the opportunity to assess their links to proposed drivers and consequences of semantic network structure (see,12). If, as past work has suggested, cumulative experience is responsible for structural differences in semantic memory, then differences in the level of education experienced by individuals should be related to semantic network structure. The analysis presented in Fig. 7 reveals that this link was present for older adults, but not for younger adults. Specifically, we observed that a dummy variable coding whether a person received a college education shared between 5% (shortest path length) and 27% (clustering) of variance with the structure of older adults’ networks, whereas there was practically no shared variance between education and younger adults’ networks (see Fig. 7). These results suggest that it is not education by itself, but possibly other forms of life experience that impact individuals’ semantic networks. Consistent with this assessment, results reported in the Supplementary Material show that the effects of age group on network structure are not fully accounted for by education level (or gender). Another corollary of the idea that cumulative experience drives network structure is that older adults differ more from each other as a function of their different accumulated experiences25. We tested this expectation in two ways. First, we compared the within-group variance of network measures and observed that those of older individuals was between 55% (degree) and 116% larger than those of younger adults. Second, we evaluated within age-group agreement in terms of edge weights. Specifically, we compared all pairs of individual networks using a weighted Jaccard index (JI). We found older adults’ networks to be considerably less similar to each other () than younger adults’ networks (; ). Overall, these results are compatible with the idea that cumulative exposure to linguistic and other information contributes to individual differences in the structure of semantic networks.
Past work suggests that the structure of semantic networks may drive cognitive performance, however, this link has so far not been assessed from the perspective of individual differences. To fill this gap, we calculated the shared variance between network structure and vocabulary size, working memory capacity assessed using the operation span task, the number of correct decisions in an associative recall task, and the number of retrievals in the animal fluency task. The results presented in Fig. 7 reveal two important insights. First, network structure and cognitive performance shared, on average, more variance for older than for younger adults, likely due to the larger differences between the network structures of older adults. Second, network structure and cognitive performance shared relatively little variance (median = 3%), with the exception of maybe the links between clustering and vocabulary (12% shared variance) and between clustering and associative recall (9%) among older adults. Overall, these results provide only weak support for the idea that semantic network structure drives individual differences in cognitive performance.
Discussion
We investigated differences in the networks of younger and older adults at both the group and the individual level. Our group-level analyses using semantic fluency data replicate previously observed differences between networks of younger and older adults (e.g.,18,23,40): The aggregate younger adults’ networks based on verbal fluency exhibited larger average degrees and lower average shortest path lengths than older adults’ networks, but the networks of the two age groups did not systematically differ in their average clustering coefficients. Overall, these results indicate that semantic networks may become increasingly sparse with age, with the connectivity between items decreasing with age. Importantly, we extend past work by showing that these age patterns generalize across categories (animals, countries) and time constraints (1 vs. 10 minutes), suggesting that such age-related differences are not a function of specific elicitation choices and generalize across domains.
In addition, analyses of individual networks estimated from a similarity-judgment task involving thousands of judgments from the same individuals ruled out potential problems of aggregation and confirmed the differences in average degrees and lower average shortest path lengths, while additionally revealing systematic differences in terms of average clustering coefficients, in the direction of lower clustering in older adults’ semantic networks. We found age differences were especially pronounced for weakly-related, peripheral regions of the network. Further, older adults’ networks were more variable and considerably less similar to each other than younger adults’ networks, and this variability covaried with level of education. All in all, these results provide converging evidence that the semantic networks of younger and older adults differ systematically not only in content, as has been amply suggested in past work21, but also in their structure25. Our results are congruent with the idea of a progressively idiosyncratic nature of semantic representations across the life span, leading to more distinct semantic representations between individuals over time. Further, our findings are novel in suggesting that individual and age differences may be strongest for peripheral parts of semantic representations, which emphasize the importance of investigating a large swath of individuals’ semantic representations to understand the environmental and cognitive contributions to individual differences in semantic cognition.
We should point out a number of limitations in our work. First and foremost, we must acknowledge that we cannot definitively determine to what extent the age differences described above are due to age differences in representation and/or control processes involved in searching and selecting information from memory. The type of network models we adopt here to describe lexical associations are, in principle, compatible with mechanistic explanations based on both representation and process and, therefore, cannot fully arbitrate between the two50. Our finding that results generalize across elicitation conditions (time constraints), domains (animals, countries), and tasks (semantic fluency, similarity judgement) could be indicative of age differences being due to differences in the underlying representation, but only to the extent that one can confidently assume different processes of search and comparison across the different conditions, domains, and tasks. It seems plausible that the underlying cognitive processes are perhaps not identical but at least similar, as all share aspects of controlled selection, involving the activation of concepts (e.g., “animal”) and their features (e.g., “has wings”). There are two main approaches that could be interesting to further address the role of representation and process in engendering age differences in the semantic networks estimated from lexical tasks. One approach could involve additional independent measures to statistically account for the contribution of control processes using an individual differences approach (e.g.,51). Another approach involves making use of neuroimaging techniques to directly measure mechanisms of control and memory retrieval. Past work suggests that representation and semantic control rely on distinct (but interacting) brain regions52 and this information could potentially be leveraged to provide an estimate of the role of control processes in semantic cognition.
Second, we are unable to derive strong conclusions about how the differences in semantic networks impact younger and older adults’ cognitive performance. The three properties of semantic network structure considered—size, connectedness, efficiency, and structuredness—have been linked empirically and theoretically to cognitive performance (for reviews see11,12,25), but in most cases these links were established at the level of words or word pairs, rather than individuals’ overall performance (but see31). For instance, research has found that words with higher degrees (connectedness) are more likely to be retrieved in semantic fluency or episodic memory tasks (e.g.,12). Furthermore, while it is plausible that word-level effects translate to differences in overall performance, we observed only relatively weak links between cognitive network structure and performance. One possible explanation is the lack of semantic overlap between the networks’ contents and the cognitive tasks we deployed. Alternatively, the lack of strong links between network structure and cognitive performance could underpin the possibility that the observed differences in semantic network structure reflect differences in representation, rather than fluid cognitive ability. Indeed, this was one of the motivations behind using similarity ratings as opposed to deriving networks from free association or fluency data alone. The latter approaches might confound fluid abilities and memory representation more strongly than similarity ratings9,12.
Third, on a related note, we do not detail a specific mechanism to account for the interaction between cumulative experience and network structure. Consequently, a key challenge for future research lies in developing models for the age-related changes in the structure of semantic networks reported here. One promising proposal stems from models of discriminative learning, whereby increasing experience leads weakly and strongly related contents in memory to be driven further apart from each other, resulting in a topological expansion of the network. The nature of structural differences, the observations of amplified differences for more weakly related words, as well as the lower similarity between older adults’ compared to younger adults’ networks, seem to support this notion. However, so far, discriminative learning has only been successfully employed to account for age differences in paired-associate learning15,16. Whether such a mechanism can be expanded to account for the full set of results presented here remains an open question.
Fourth, and finally, our work made use of an extreme-group comparison design by comparing groups of younger relative to older adults. This type of design is not optimal to study the role of cumulative experience that is thought to underlie age differences in the content and structure of lexical and semantic networks. Ideally, estimates of cumulative experience and associated semantic networks would be obtained longitudinally for large samples of individuals and across long spans of time involving years or decades. One major difficulty with such studies will be mapping semantic networks for specific individuals but such efforts are under way12.
Despite its limitations, our work has some important implications for understanding and modeling human cognition. In many tasks that are presumed to be linked to the structure of semantic networks, older adults are known to perform worse than younger adults26, which is often considered a consequence of declining fluid abilities26,27. Our and similar findings of systematic differences in semantic networks open up an alternative route leading to age differences in cognitive performance, whereby older adults’ cognitive performance shows apparent decline because of the consequences of learning for the size and the structure of semantic networks. In turn, our finding that age differences may be particularly pronounced in weakly connected, peripheral parts of semantic networks could have implications for future tests of theories of individual and age differences in semantic cognition that may, or may not, make predictions concerning different parts of semantic representations.
Our results may have implications beyond our theoretical understanding of healthy cognitive aging. Lacking a cure, the best way to battle the “dementia epidemic” is timely diagnosis and early treatment53,54. The diagnosis of mild cognitive impairments and early dementia is, however, still predominately based on tests of cognitive performance53. Instruments such as the short dementia screener DemTect29 or the neuropsychological battery CERAD55 involve an individual undergoing a series of standard cognitive tasks, including several of the tasks listed above. Understanding the role of age-related changes in the structure of semantic networks promises to improve our interpretation of current instruments for dementia screening and diagnosis. Further research in this direction could lead to more personalized instruments that can detect changes in cognitive performance earlier and with higher sensitivity by focusing on specific parts of semantic representations than is currently done.
In sum, we have presented converging results from semantic fluency and similarity judgment tasks concerning structural differences in the semantic networks of younger and older adults. Older adults seem to possess richer more idiosyncratic networks, characterized by smaller average degree and longer path lengths relative to those of younger adults. Our results emphasize the importance of considering how life span cognitive development and cumulative experience shape the content and structure of individuals’ semantic cognition.
Methods
Fluency data
Four data sets from three studies were used to infer networks from fluency data. The first data set was obtained from39, who jointly analyzed the data of two published studies, i.e., from Hills et al.49 and the Midlife in the United States (MIDUS3) longitudinal study. The data of Hills et al.49 contain a one-minute animal fluency task collected at Stanford University, CA, of a total of 201 participants aged 27 to 99 (Mdn = 68). The MIDUS3 data contained one-minute animal fluency data—recorded in phone interviews—from 104 individuals aged 34 to 83 (Mdn = 59). Audio recordings were transcribed by us (see Supplementary Material). Following Wulff et al.39, we eliminated 21 individuals with fewer than 10 fluency productions and mini-mental state values lower than 26, which is indicative of either low attention to the task or the onset of age-related disorders, before combining the two data sets for a total sample of 282 individuals. Groups of younger and older adults were created by splitting the data at the median age. This resulted in groups of 142 individuals each aged 29 to 65 years old and 66 to 94 years old, respectively. The original data of our first study were collected in the context of another investigation into age-difference in decision making running in the laboratories of the Max Planck Institute (MPI) for Human Development, Berlin. We collected 10-minute fluency data for both animals and countries from 71 older adults and 41 younger adults. Responses were recorded using a microphone and transcribed by us. Participants were recruited through the internal participant database of the MPI for Human Development. The older adults’ age ranged from 65 to 80 years with a median age of 70 years, the younger adults’ age ranged from 17 to 33 with a median age of 25. Participants were paid 10€/hour for participation. The second study was also conducted at the Max Planck Institute for Human Development using participants from the MPI’s internal database. We collected 10-minute fluency data for animals from 36 older adults and 36 younger adults. Responses were recorded using a microphone and transcribed by us. The older adults’ age ranged from 65 to 78 years with a median age of 70 years, the younger adults’ age ranged from 18 to 32 with a median age of 24. Participants in study 3 also completed measures of vocabulary size, working memory capacity (i.e., operation span task), and associative recall (see Supplementary Material). Participants were paid 10€/hour for participation. Study 1, 2 and 3 were approved by the internal review board of the Max Planck Institute for Human Development. All studies were performed in accordance with relevant guidelines and regulations. Participants provided informed consent at the beginning of each study.
Fluency data were subjected to minimal preprocessing. Responses were scrutinized for category membership and spelling. A lenient criterion was used to assess category membership to retain as much of the original data as possible. In the case of animals, all nonfictional entries that described entire, nonhuman, and nonfictional animals were retained. This led us to exclude a few cases from the data, such as Godzilla, cat eye, or animal trainer. Similarly, in the case of countries, we retained all existing and named territories such as Istrien, a region of Italy, Croatia and Slovenia, the desert Sahara or cities, but not nonexistent, fictional territories such as Middle-earth. Spelling was hand-corrected on the basis of the Merriam-Webster online dictionary. Overall 96.8–99% of responses were retained in the analysis.
Measures of macroscopic network structure
The average degree of a network , with nodes (or vertices) V and edges E, is defined as for unweighted networks and as , where denotes the presence of an edge between nodes i and j and the according edge weight. The average degree or strength, as it is commonly referred to for weighted networks, describes the average connectivity in the network. The average local clustering coefficient for unweighted networks is defined as with and being the degree of node i and the set of neighbors to i. For weighted networks, with being the strength of node i, the weighted analog to . The local clustering coefficient describes the degree of transitivity in the network and is related to network modularity56. It is often conceived of as an indicator of the structuredness of a network57. The average shortest path length is defined as where is the length of shortest path between nodes i and j, also known as the geodesic distance. For weighted networks, is the sum of weights rather than the length. The average shortest path length describes the average distance between nodes. Low average shortest path lengths have been associated with efficient information processes44,58.
Network inference approach
Networks were inferred from semantic fluency data based on the community model developed by Goñi et al.38 and studied by Zemla and Austerweil37. The model is based on a two-step procedure. First, nodes and edges are included for every pair of responses that occurred within a distance of l responses. For instance, for the response sequence “dog, cat, mouse, rabbit” and a criterion of , edges would be included for all pairs less than three responses apart, excluding only the pair dog and rabbit, which are three responses apart. Second, an edge is identified as a true edge if the frequency of the connected words occurring with l or fewer steps apart exceeded a frequency threshold reflecting the required minimum frequency of co-occurring within l responses to be considered in the first place, as well as a frequency threshold . The latter is derived from the probability of two words occurring within l responses by chance, which is calculated as . Furthermore, , the probability of two words to co-occur within a fluency sequence, and , the probability that two responses are no more than l responses apart, are calculated as and with , denoting the number of times two responses occur across M sequence and N denotes the average number of productions per sequence. is then defined as the quantile of the binomial distribution . Consistent with prior literature, we set , , and 37,38 for our main analyses. In addition, we evaluate the robustness of the results in a multiverse analysis46 presented in the Supplementary Material.
Similarity ratings
Similarity ratings were collected in the context of study 3 and prior to participants completing the semantic fluency task. Participants took home a tablet to provide, over the course of roughly one week, on a scale from 1 to 20, similarity ratings for 2,268 pairs of animals, consisting of each possible pair of 63 frequently occurring animals and 315 repeated pairs. The 63 animals were selected on the basis of the semantic fluency responses of study 2 in a manner that equated word frequency across younger and older adult age groups. See Supplementary Material. Reliability was found to be high in both younger and older adults with correlations of , for younger and older adults, respectively. Participants were paid 10€/hour for participation in the lab session and a flat fee of 44.1€ for providing the similarity ratings.
Statistical comparisons
Group comparisons of semantic fluency statistics were carried out using permutation and bootstrap tests. Group differences in the number of retrievals and the number of unique retrievals were tested by comparing them against a null distribution of 10,000 samples generated by randomly assigning individuals to our two age groups. Group differences in fluency network measures were tested by generating sampling distribution consisting of 10,000 samples per group by drawing from the groups’ fluency sequences with replacement. To account for the differences in group sizes in the two data sets of study 1, all study 1 analyses are on repeated random samples of matching group sizes. Group comparisons of similarity network statistics were also carried using bootstrap tests based on 10,000 bootstrap samples. Wherever possible we report the results as 95% confidence intervals rather than p-values to account for the non-confirmatory nature of our investigation.
Supplementary Information
Acknowledgements
We are grateful to Laura Wiles for editing the manuscript. This work was supported by a grant from the Swiss Science Foundation (100015_197315) to Dirk U. Wulff.
Author contributions
D.W., T.H., and R.M. designed the studies. D.W. conducted all analyses. D.W., T.H., and R.M wrote the paper.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
The online version contains supplementary material available at 10.1038/s41598-022-11698-4.
References
- 1.Baronchelli A, Ferrer-i-Cancho R, Pastor-Satorras R, Chater N, Christiansen MH. Networks in cognitive science. Trends Cogn. Sci. 2013;17(7):348–360. doi: 10.1016/j.tics.2013.04.010. [DOI] [PubMed] [Google Scholar]
- 2.Beer RD. Dynamical approaches to cognitive science. Trends Cogn. Sci. 2000;4(3):91–99. doi: 10.1016/S1364-6613(99)01440-0. [DOI] [PubMed] [Google Scholar]
- 3.Borge-Holthoefer J, Arenas A. Semantic networks: Structure and dynamics. Entropy. 2010;12(5):1264–1302. doi: 10.3390/e12051264. [DOI] [Google Scholar]
- 4.Anderson JR. A spreading activation theory of memory. J. Verbal Learn. Verbal Behav. 1983;22(3):261–295. doi: 10.1016/S0022-5371(83)90201-3. [DOI] [Google Scholar]
- 5.Collins AM, Loftus EF. A spreadingactivation theory of semantic processing. Psychol. Rev. 1975;82(6):407. doi: 10.1037/0033-295X.82.6.407. [DOI] [Google Scholar]
- 6.Kenett, Y. N. et al. Flexibility of thought in high creative individuals represented by percolation analysis. Proceedings of the National Academy of Sciences, 201717362 (2018). [DOI] [PMC free article] [PubMed]
- 7.Beaty, R. E. et al. Robust prediction of individual creative ability from brain functional connectivity. Proceedings of the National Academy of Sciences, 201713532 (2018). [DOI] [PMC free article] [PubMed]
- 8.Hills TT, Jones MN, Todd PM. Optimal foraging in semantic memory. Psychol. Rev. 2012;119(2):431–440. doi: 10.1037/a0027373. [DOI] [PubMed] [Google Scholar]
- 9.Hills, T. T. & Kenett, Y. N. Is the mind a network? Maps, vehicles, and skyhooks in cognitive network science. Topics in Cognitive Science (2021). [DOI] [PubMed]
- 10.Jones MN, Hills TT, Todd PM. Hidden processes in structural representations: A reply to abbott, austerweil, and griffiths (2015) Psychol. Rev. 2015;122(3):570–574. doi: 10.1037/a0039248. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Siew, C. S., Wulff, D. U., Beckage, N. M. & Kenett, Y. N. Cognitive network science: A review of research on cognition through the lens of network representations, processes, and dynamics. Complexity, 2108423 (2019).
- 12.Wulff, D. U., De Deyne, S., Aeschbach, S., & Mata, R. (2021). Understanding the aging lexicon by linking individuals’ experience, semantic networks, and cognitive performance. PsyArXiv. [DOI] [PMC free article] [PubMed]
- 13.Kraemer, P. M., Wulff, D. U., & Gluth, S. (2021). A sequential sampling account of semantic relatedness decisions. PsyArXiv.
- 14.Bhatia S. Predicting risk perception: New insights from data science. Manage. Sci. 2019;65(8):3800–3823. doi: 10.1287/mnsc.2018.3121. [DOI] [Google Scholar]
- 15.Ramscar M, Hendrix P, Shaoul C, Milin P, Baayen H. The myth of cognitive decline: Non-linear dynamics of lifelong learning. Top. Cogn. Sci. 2014;6(1):5–42. doi: 10.1111/tops.12078. [DOI] [PubMed] [Google Scholar]
- 16.Ramscar M, Sun CC, Hendrix P, Baayen H. The mismeasurement of mind: Life-span changes in paired-associate-learning scores reflect the “cost” of learning, not cognitive decline. Psychol. Sci. 2017;28(8):1171–1179. doi: 10.1177/0956797617706393. [DOI] [PubMed] [Google Scholar]
- 17.Benedek M, Kenett YN, Umdasch K, Anaki D, Faust M, Neubauer AC. How semantic memory structure and intelligence contribute to creative thought: A network science approach. Think. Reason. 2017;23(2):158–183. doi: 10.1080/13546783.2016.1278034. [DOI] [Google Scholar]
- 18.Dubossarsky H, De Deyne S, Hills TT. Quantifying the structure of free association networks across the life span. Dev. Psychol. 2017;53(8):1560–1570. doi: 10.1037/dev0000347. [DOI] [PubMed] [Google Scholar]
- 19.Morais AS, Olsson H, Schooler LJ. Mapping the structure of semantic memory. Cogn. Sci. 2013;37(1):125–145. doi: 10.1111/cogs.12013. [DOI] [PubMed] [Google Scholar]
- 20.Lindenberger U. Human cognitive aging: Corriger la fortune? Science. 2014;346(6209):572–578. doi: 10.1126/science.1254403. [DOI] [PubMed] [Google Scholar]
- 21.Verhaeghen P. Aging and vocabulary scores: A meta-analysis. Psychol. Aging. 2003;18(2):332–9. doi: 10.1037/0882-7974.18.2.332. [DOI] [PubMed] [Google Scholar]
- 22.Keuleers E, Stevens M, Mandera P, Brysbaert M. Word knowledge in the crowd: Measuring vocabulary size and word prevalence in a massive online experiment. Q. J. Exp. Psychol. 2015;68(8):1665–1692. doi: 10.1080/17470218.2015.1022560. [DOI] [PubMed] [Google Scholar]
- 23.Cosgrove AL, Kenett YN, Beaty RE, Diaz MT. Quantifying flexibility in thought: The resiliency of semantic networks differs across the lifespan. Cognition. 2021;211:104631. doi: 10.1016/j.cognition.2021.104631. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Nation, K. Nurturing a lexical legacy: Reading experience is critical for the development of word reading skill. NPJ Sci. Learn.2(3) (2017). [DOI] [PMC free article] [PubMed]
- 25.Wulff DU, De Deyne S, Jones MN, Mata R. The aging lexicon consortium. New perspectives on the aging lexicon. Trends Cognit. Sci. 2019;23(8):686–698. doi: 10.1016/j.tics.2019.05.003. [DOI] [PubMed] [Google Scholar]
- 26.Salthouse TA. Selective review of cognitive aging. J. Int. Neuropsychol. Soc. 2010;16(5):754–760. doi: 10.1017/S1355617710000706. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Healey MK, Kahana MJ. A four-component model of age-related memory change. Psychol. Rev. 2016;123(1):23. doi: 10.1037/rev0000015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Buchler NE, Reder LM. Modeling age-related memory deficits: A two-parameter solution. Psychol. Aging. 2007;22(1):104. doi: 10.1037/0882-7974.22.1.104. [DOI] [PubMed] [Google Scholar]
- 29.Kalbe E, Kessler J, Calabrese P, Smith R, Passmore A, Brand Ma, Bullock R, et al. Demtect: A new, sensitive cognitive screening test to support the diagnosis of mild cognitive impairment and early dementia. Int. J. Geriatr. Psychiatry. 2004;19(2):136–143. doi: 10.1002/gps.1042. [DOI] [PubMed] [Google Scholar]
- 30.Luce PA, Pisoni DB. Recognizing spoken words: The neighborhood activation model. Ear Hear. 1998;19(1):1. doi: 10.1097/00003446-199802000-00001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Howard DV. A multidimensional scaling analysis of aging and the semantic structure of animal names. Exp. Aging Res. 1983;9(1):27–30. doi: 10.1080/03610738308258416. [DOI] [PubMed] [Google Scholar]
- 32.Verheyen S, Droeshout E, Storms G. Age- related degree and criteria differences in semantic categorization. J. Cogn. 2019;2(1):17. doi: 10.5334/joc.74. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Keuleers E, Balota DA. Megastudies, crowdsourcing, and large datasets in psycholinguistics: An overview of recent developments. Q. J. Exp. Psychol. 2018;68(8):1457–1468. doi: 10.1080/17470218.2015.1051065. [DOI] [PubMed] [Google Scholar]
- 34.Wulff, D. U., Aeschbach, S., De Deyne, S., & Mata, R. (in press). Data from the myswow proof-of-concept study: Linking individual semantic networks and cognitive performance. Journal of Open Psychology Data.
- 35.Bousfield WA. The occurrence of clustering in the recall of randomly arranged associates. J. Gen. Psychol. 1953;49(2):229–240. doi: 10.1080/00221309.1953.9710088. [DOI] [Google Scholar]
- 36.Henry JD, Crawford JR, Phillips LH. Verbal fluency performance in dementia of the alzheimer’s type: A meta-analysis. Neuropsychologia. 2004;42(9):1212–1222. doi: 10.1016/j.neuropsychologia.2004.02.001. [DOI] [PubMed] [Google Scholar]
- 37.Zemla, J. C. & Austerweil, J. L. Estimating semantic networks of groups and individuals from fluency data. Computational Brain & Behavior, 1–23 (2018). [DOI] [PMC free article] [PubMed]
- 38.Goñi J, Arrondo G, Sepulcre J, Martincorena I, de Mendizábal NV, Corominas-Murtra B, Bejarano B, Ardanza-Trevijano S, Peraita H, Wall DP, et al. The semantic organization of the animal category: Evidence from semantic verbal fluency and network theory. Cogn. Process. 2011;12(2):183–196. doi: 10.1007/s10339-010-0372-x. [DOI] [PubMed] [Google Scholar]
- 39.Wulff, D. U., Hills, T. T., Lachman, M., & Mata, R. (2016). The aging lexicon: Differences in the semantic networks of younger and older adults. Proceedings of the 38th Annual Conference of the Cognitive Science Society. Austin, TX, 907-912.
- 40.Zortea M, Menegola B, Villavicencio A, Salles JFd. Graph analysis of semantic word association among children, adults, and the elderly. Psicologia: Reflexão e Crítica. 2014;27(1):90–99. [Google Scholar]
- 41.Kenett, Y. N., Beckage, N. M., Siew, C. S. & Wulff, D. U. Cognitive network science: A new frontier. Complexity, 6870278 (2020).
- 42.De Deyne S, Navarro DJ, Storms G. Better explanations of lexical and semantic cognition using networks derived from continued rather than single-word associations. Behav. Res. Methods. 2013;45(2):480–498. doi: 10.3758/s13428-012-0260-7. [DOI] [PubMed] [Google Scholar]
- 43.Nelson DL, Bennett DJ, Gee NR, Schreiber TA, McKinney VM. Implicit memory: Effects of network size and interconnectivity on cued recall. J. Exp. Psychol. Learn. Mem. Cogn. 1993;19(4):747–64. doi: 10.1037/0278-7393.19.4.747. [DOI] [PubMed] [Google Scholar]
- 44.Bullmore E, Sporns O. The economy of brain network organization. Nat. Rev. Neurosci. 2012;13(5):336. doi: 10.1038/nrn3214. [DOI] [PubMed] [Google Scholar]
- 45.Steyvers M, Tenenbaum J. The large-scale structure of semantic networks: Statistical analyses and a model of semantic growth. Cogn. Sci. 2005;29(1):41–78. doi: 10.1207/s15516709cog2901_3. [DOI] [PubMed] [Google Scholar]
- 46.Steegen S, Tuerlinckx F, Gelman A, Vanpaemel W. Increasing transparency through a multiverse analysis. Perspect. Psychol. Sci. 2016;11(5):702–712. doi: 10.1177/1745691616658637. [DOI] [PubMed] [Google Scholar]
- 47.Rosen WG. Verbal fluency in aging and dementia. J. Clin. Exp. Neuropsychol. 1980;2(2):135–146. doi: 10.1080/01688638008403788. [DOI] [Google Scholar]
- 48.Tombaugh TN, Kozak J, Rees L. Normative data stratified by age and education for two measures of verbal fluency: Fas and animal naming. Arch. Clin. Neuropsychol. 1999;14(2):167–177. [PubMed] [Google Scholar]
- 49.Hills TT, Mata R, Wilke A, Samanez-Larkin GR. Mechanisms of age-related decline in memory search across the adult life span. Dev. Psychol. 2013;49(12):2396–2404. doi: 10.1037/a0032272. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Castro N, Siew CSQ. Contributions of modern network science to the cognitive sciences: Revisiting research spirals of representation and process. Proc. Royal Soc. A: Math. Phys. Eng. Sci. 2020;476(2238):20190825–25. doi: 10.1098/rspa.2019.0825. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Hoffman P. An individual differences approach to semantic cognition: Divergent effects of age on representation, retrieval and selection. Sci. Rep. 2018;8:8145. doi: 10.1038/s41598-018-26569-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Ralph MAL, Jefferies E, Patterson K, Rogers TT. The neural and computational bases of semantic cognition. Nat. Rev. Neurosci. 2017;18(1):42–55. doi: 10.1038/nrn.2016.150. [DOI] [PubMed] [Google Scholar]
- 53.Robinson L, Tang E, Taylor J-P. Dementia: Timely diagnosis and early intervention. BMJ. 2015;350:h3029. doi: 10.1136/bmj.h3029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Larson EB, Yaffe K, Langa KM. New insights into the dementia epidemic. N. Engl. J. Med. 2013;369(24):2275–2277. doi: 10.1056/NEJMp1311405. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Fillenbaum GG, van Belle G, Morris JC, Mohs RC, Mirra SS, Davis PC, Tariot PN, Silverman JM, Clark CM, Welsh-Bohmer KA, et al. Consortium to establish a registry for alzheimer’s disease (cerad): The first twenty years. Alzheimer’s & Dementia. 2008;4(2):96–109. doi: 10.1016/j.jalz.2007.08.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Newman ME. Modularity and community structure in networks. Proc. Natl. Acad. Sci. 2006;103(23):8577–8582. doi: 10.1073/pnas.0601602103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Barrat A, Barthelemy M, Pastor-Satorras R, Vespignani A. The architecture of complex weighted networks. Proc. Natl. Acad. Sci. 2004;101(11):3747–3752. doi: 10.1073/pnas.0400087101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Latora V, Marchiori M. Efficient behavior of small-world networks. Phys. Rev. Lett. 2001;87(19):198701. doi: 10.1103/PhysRevLett.87.198701. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.