Semantic Representations Are Updated Across the Lifespan Reflecting Diachronic Language Change

Ellis Cain; Rachel Ryskin

doi:10.1162/OPMI.a.315

. 2025 Dec 18;9:2114–2148. doi: 10.1162/OPMI.a.315

Semantic Representations Are Updated Across the Lifespan Reflecting Diachronic Language Change

Ellis Cain ^1,^*, Rachel Ryskin ¹

PMCID: PMC12768554 PMID: 41497527

Abstract

Humans learn the meanings of words from the contexts in which they are used. Patterns of language use change over time, suggesting that the contexts in which some words are experienced change across an individual’s lifespan. Here, we investigated whether language users’ semantic space changes in lockstep with changes in the language or whether it retains traces of historical language use/meanings. In two studies, we used distributional semantic word embeddings trained on corpora from different decades (HistWords) to capture meaning change at the level of the (English) language. We first compared these diachronic semantic spaces to the semantic spaces of individuals in different age cohorts (ranging from people in their 20s to people over 70) using an open dataset of associations norms (Small World of Words). Then, using HistWords, we sampled English words that have changed in meaning and words that have maintained the same meaning/usage patterns between the 1950s and the 1990s and collected relatedness judgments for those words with their nearest neighbors from each decade (1950s and 1990s) from both younger (18–33 years) and older (63–92 years) adults. Across the two studies, the semantic spaces of both older and younger adults were most strongly correlated with the semantic spaces derived from more recent corpora. We found little evidence of historical semantic spaces being differentially predictive of the semantic spaces of older adults relative to those of young adults. Our findings suggest that individuals continuously and rapidly update their lexico-semantic representations regardless of age, such that word meanings learned earlier in life are largely replaced with new meanings derived from later language experience.

Keywords: lexical semantics, language change, learning, aging

INTRODUCTION

Language can be viewed as a complex adaptive system (Five Graces Group et al., 2009), in which global patterns and regularities arise from interactions between individual language users. The behavior of speakers in a speech (or writing/signing) community—how they choose to express their intended messages—is based on social norms, cognitive pressures, and memory for past interactions. From these individual behaviors emerge the collective language usage patterns, which are recorded in corpora. Future individuals learn from past collective usage patterns, as well as from interactions with others in their community. A consequence of this complex adaptive system view is that language usage patterns may change over historical time, across (and even within) generations of language users (Bybee, 2015; Bynon, 1977). For instance, word meanings change over time: broadcast used to be a farming term (i.e., broadcast the seeds), icon a religious term (i.e., a religious icon), and google a math term, however, now their dominant meanings have changed to more media and computer-centric senses. Individual language users are able to change their lexico-semantic representations and learn these new meanings as they gain prominence. Yet, how meaning change occurs across the lifespan of an individual and how it is related to change in the language at the collective level is an open question.

Changes in Meaning at the Level of the Language

According to usage based theories of language, word meanings are inextricably linked to the context of their use (e.g., Firth, 1957; Harris, 1954; Wittgenstein, 1953). Words which appear in similar contexts will tend to have similar meanings (e.g., ‘dog’ and ‘cat’). This ‘distributional semantics’ view motivates the use of algorithms like Word2Vec (Mikolov et al., 2013) which use word-context co-occurrence statistics to embed words into a semantic space. These embeddings are known to successfully capture aspects of meaning when compared to human judgments (e.g., Ettinger & Linzen, 2016; Grand et al., 2022; Hill et al., 2015; Lewis et al., 2019).

Diachronic changes in lexical and morphological usage have been documented through corpus analyses (e.g., Davies, 2012; Hilpert & Gries, 2009; Michel et al., 2011), suggesting that the semantic space of the language as a whole may be continually changing. While new words may occasionally be created to capture a new meaning, pre-existing words are often reused and extended to include new meaning senses, leading to polysemy (Ramiro et al., 2018; Srinivasan & Rabagliati, 2015). In other words, polysemy serves as a way for a (limited) lexicon to keep up-to-date with changes in the world or new associations created by language users.

Hamilton et al. (2018) developed a methodology for quantifying semantic change through the evaluation of word embeddings against known historical changes. They trained Word2Vec on decade-level subsets of the Google Ngrams corpus (Michel et al., 2011) and aligned these decade-level embeddings using an orthogonal Procrustes transformation. Their findings suggested that less frequently used words are more likely to undergo meaning change, and secondly, more polysemous words are more likely to undergo meaning change. Polysemy may serve as a driver of semantic change, such that increased flexibility in usage and previous meaning extensions may lead to more extensions in the future. In a parallel analysis of the same corpus, Xu and Kemp (2015) found that related words undergo parallel changes in meaning, with cognitive mechanisms such as analogy preserving the patterns of relationship between words. These semantic changes at the collective level are driven by aggregate changes in individual usage patterns. Yet, it is unknown whether these reflect parallel changes within individuals or changes in population composition.

Changes in Meaning Across the Lifespan

A common way to model our representational structure of linguistic information is through lexical association networks (e.g., De Deyne et al., 2019; Kumar et al., 2022; Siew et al., 2019), where words are represented as the nodes of a network, and associations between these are represented through edges. The structure of these lexical networks have been found to differ by age. Using word association data, Dubossarsky et al. (2017) found that networks are small and sparse during language acquisition (10–18 years old), they are dense and well-connected in mid-life, and then they become sparse again (though larger overall), with a larger proportion of isolated, peripheral nodes, in late-life. Similarly, Cosgrove et al. (2021) found that the semantic networks of younger adults were more interconnected and resilient to disruption compared to those of older adults, which were sparse and segregated.

One explanation for these differences in lexical-semantic networks across the lifespan is that they are the result of age-related cognitive decline which leads to retrieval difficulties in the word association task (e.g., Hills et al., 2013). Alternatively, the changing structure of lexical semantic networks may be the result of lifelong language learning (Brysbaert, Stevens, et al., 2014; Hartshorne & Germine, 2015; Kuperman et al., 2012). It is important to note that these explanations are not mutually exclusive. Adults continue to engage in statistical learning and acquire novel word-meaning mappings based on regularities in the co-occurrences of novel words and contexts (Fitneva & Christiansen, 2011; Smith et al., 2011; Yu & Smith, 2007). And adults’ vocabularies continue to grow across the lifespan, even as other cognitive functions (e.g., executive function, working memory) begin to decline (e.g. Hartshorne & Germine, 2015). Given the skewed distribution of term frequency in natural languages (Piantadosi, 2014; Zipf, 1935), the network sparsity in older adults may be due to the fact that many individuals may only encounter certain rare words well into adulthood. Similarly, age-related declines in performance in paired associate learning and memory search tasks, previously attributed to cognitive decline, can be explained by the effects of longer learning periods and increased vocabularies in older adults (i.e., Baayen et al., 2017; Ramscar et al., 2014).

Further, Castro et al. (2021) analyzed category exemplar responses (e.g., category: ‘a bird’, exemplar: ‘robin’) across different age groups and time periods. They found that the response patterns to many categories were generally stable (i.e., exemplar frequency was highly correlated across time periods/ages), but the ordering of the responses was not always consistent between generations. Categories that underwent change were likely influenced by historical or social factors (e.g., diseases, toys, weapons, fuels), suggesting that lexical meaning changes at the level of the language may lead to differences in lexical meaning between age groups.

Language Change and Age-Related Meaning Change

Differences in lexico-semantic representations across age groups could be largely idiosyncratic. Older adults typically know more words than young adults but which additional words they know may differ across individuals, resulting in greater variability in some word meanings for older adults relative to young adults (i.e., Brysbaert et al., 2016). In contrast, it may be that the differences between age cohorts reflect shared prior experiences. Older adults’ language experience differs from that of younger adults not only in the quantity of words but also in the nature of the contexts in which (some) words have historically appeared. For instance, over the period of time during which a 70-year-old has been experiencing and continually tracking language use patterns (in 2025, a 70-year-old would have started experiencing language in the 1950s), the collective usage patterns and meanings of some words have shifted (Hamilton et al., 2018).

In order to be able to coordinate with other members of their community (Chater & Christiansen, 2010; Five Graces Group et al., 2009), individual language users must update their lexico-semantic representations to match the usage patterns of the other members (e.g., learn a new meaning for “tweet”). If the differences in lexico-semantic networks across age groups are driven primarily by changes in language use patterns, there may be systematic differences in some word meanings for older adults relative to younger adults. Specifically, those differences would reflect the historical meanings of those words.

To our knowledge, only one previous study has looked directly at the relationship between diachronic language change and differences in word meanings across age cohorts. Li and Siew (2022) found that words whose meaning changed more over time elicited slower response times in a semantic decision task, but crucially, this effect was stronger among middle-aged adults (45–55 years old) than young adults (18–25 years old). Their results are suggestive of systematic differences in the lexico-semantic representations of older and younger adults that are related to their differential experience of a changing language. However, the meanings of words were probed only indirectly (via a judgment of concreteness), and because the semantic decision task is speeded, it is unknown whether these age-related effects of meaning change impact all aspects of semantic representation and processing or only real-time aspects (e.g., lexical retrieval). Thus, whether there are systematic age-related differences in some word meanings reflecting their historical change remains an open question.

Present Research

The goals of the present work are to probe differences in lexico-semantic representations across different age groups and test how well these are explained by diachronic language change at the collective level. In two studies, we use representational similarity analysis (RSA; Kriegeskorte et al., 2008) to compare word similarities derived from historical word embeddings (HistWords; Hamilton et al., 2018) across decades to word similarity values obtained from behavioral data from different age cohorts. In the first study, we used openly-available word association data (Small World of Words; De Deyne et al., 2019) to derive word similarities for different age cohorts (from 20 to 70 years of age). In the second study, we collected semantic relatedness judgments from participants in two age groups: younger adults (18 to 33 years of age) and older adults (63 to 92 years of age).

We view the three types of data used for RSA—word embeddings, word associations, and relatedness judgments—as capturing semantic information in distinct but deeply interconnected ways. In everyday language use, speakers and writers draw on their stored representations of meaning, and distributional semantics models are trained on large quantities of aggregated usage data. To the extent that these training data are reasonably representative, the embedding space created by a distributional semantics model will capture the aggregate semantic information in the minds of language users (see Lenci, 2018). Word associations are likely the product of a memory search (i.e., Abbott et al., 2012; Hills et al., 2012, 2015), where individuals generate responses that are associated with a cue by searching through their stored semantic representations. On an individual trial level, the associations are influenced by context, memory search, and expertise (as in response chaining; De Deyne et al., 2019), but once aggregated across a large enough group of trials and people, these patterns of associations can roughly capture the semantic space of a group of language users (Reilly et al., 2025). Relatedness judgments likely require individuals to retrieve and compute the overlap between two words’ semantic representations in the same semantic space (Johns & Jones, 2022), which then must be quantified on a reference scale. Similar to word associations, particular relatedness judgments might be influenced by context and other factors, but they can also be aggregated across a large enough group to represent the relatedness of words in the semantic space of a group of language users. Since words that are related tend to be produced in similar contexts (i.e., Firth, 1957), the relatedness of two words is often tied to linguistic co-occurrence (which is what word embeddings are also based on; Hill et al., 2015; McRae et al., 2012). Therefore, the RSA conducted in the present study is comparing distinct representations of similar semantic information, which will allow us to examine the relationship between the semantic representations of different age groups and historical corpus-based representations.

The form of the relationship between similarities from different decades of corpora and similarities from different age groups will depend on how adults update their word meaning representations in response to experiencing new co-occurrence statistics. Figure 1 provides a schematic depiction of potential hypotheses about how meaning change at the level of the language could manifest in different patterns of correlations across age groups. Word meanings change over time, as shown by the turnover in nearest neighbors at different timepoints, and the time-period-specific meanings can be captured using historical word embeddings (Figure 1A). Different age groups will have experienced and learned different meanings of a word that changed (Figure 1B). Figure 1C illustrates three hypotheses about how meanings experienced at different points over the lifespan contribute to an individual’s current meaning representation: all experienced meanings are combined (averaged) and 1) those experienced earlier are weighted more heavily than recent experiences (H1: weighted mean towards early)¹, 2) all experienced meanings are weighted equally (H2: unweighted mean), and 3) recently experienced meanings are weighted more heavily (H3: weighted mean towards recent)².

Figure 1D simulates the correlations between the meaning representations of individuals from different age groups (e.g., as captured by similarity judgments for words and their neighbors) and the meanings captured in the collective usage patterns of different decades (e.g., semantic similarities of those same words and neighbors from embeddings trained on decade-specific corpora) according to the three hypotheses. According to H1, weighted mean towards early, similarity derived from behavioral data from older adults would be most strongly correlated with corpus similarities from earlier decades. According to H2, unweighted mean, the correlations for older adults would be strongest for decades in the middle of their lifespan. According to H3, weighted mean towards recent, the correlations for older adults would be strongest for the most recent decades. According to all three hypotheses, for young adults the correlations would be highest for recent decades.

STUDY 1

We aimed to quantify the extent to which diachronic changes in meaning/usage patterns explain age-related differences in lexico-semantic association networks. In order to relate diachronic corpus-based word embeddings and word association data, we used representational similarity analysis (RSA), which uses second-order isomorphic representational similarity matrices (RSMs) to abstract away from the original format of the data³. In particular, word embeddings (n-dimensional vectors) cannot be directly compared to word association data (cue-response pairs), therefore, we first compute pairwise similarities between words from each data source and represent them in RSMs, which can be directly correlated with one another (for a similar approach, see Ettinger & Linzen, 2016). The embedding-based RSMs are generated separately for each decade and are intended to capture the lexico-semantic information encoded in that decade’s language usage patterns (Harris, 1954; Lenci, 2018). RSMs were generated separately from the word association data from three age cohorts (younger, middle-age, and older adults) and are intended to capture the lexico-semantic representations of each age cohort; they are matrix equivalents of their semantic networks.

Figure 2 provides an overview of the analyses: first, we compared corpus-based RSMs across decades to quantify the extent of meaning change over time (Figure 2A); second, we compared the association-based RSMs across age groups to test the similarity of representations across different age cohorts (Figure 2B); third, we conducted RSA between the two sets of RSMs (Figure 2C).

Methods

Diachronic Word Embeddings.

We used the English diachronic word embeddings from HistWords (Hamilton et al., 2018) to construct RSMs for each decade from the 1900s to the 1990s, where each cell represents the cosine similarity between two word embeddings (Figure 2A). The embeddings in HistWords were generated using a Word2Vec model (Skip-gram negative sampling) trained on decade-level subsets of text corpora. We used the English Google N-gram All embeddings as the default because they performed best at predicting known historical changes (Hamilton et al., 2018), but we also replicated the analyses using the Google N-gram Fiction and COHA (Corpus of Historical American English; Davies, 2012) Lemma⁴ embeddings. The embeddings from different decades were aligned using an orthogonal Procrustes method, allowing for direct comparison within and between decades.

Lexical Association Networks.

We used English word association data from the Small World of Words (De Deyne et al., 2019) dataset to construct separate RSMs for each age cohort (Figure 2B). Between 2011 and 2018, De Deyne and colleagues collected up to 3 responses for 12,292 cues (such as “couple”, “plug”, “condense”). They had 88,722 participants in total (mean age = 36 years old, SD = 16 years, female = 38%). We split the data into three age cohorts (Younger adults: 20–35 years old, n = 14,346; Middle-aged adults: 35–50 years old, n = 11,361; Older adults: 50–90 years old, n = 6,035).

While the word association data in their original form (cue-response pairs) do not directly map onto similarities between words, De Deyne et al. (2019) developed techniques to estimate semantic similarity using the association data, which are used and described briefly here. First, semantic networks are generated from the group-aggregate word association data, by connecting words/nodes i and j if any person produced j in response to i as a cue. This semantic network is then converted to an adjacency matrix (where the rows and columns refer to the terms, and the values to the connection strength), which is then weighted using a positive point-wise mutual information (PPMI) transformation and row-normalized. Then, since the adjusted adjacency matrix corresponds to a random walk transition matrix, decaying random walks are used to add indirect links to a graph. This graph is then PPMI transformed and the values are normalized to conditional probabilities. Finally, cosine similarity is used on the rows from this matrix to calculate the similarity between words. The intuition behind this method is that the random walks capture the indirect relationships between words in addition to the direct links. More details can be found in De Deyne et al. (2016, 2019).

To ensure that there were enough responses per cue from the Small World of Words data, we additionally filtered each RSM to cues that had at least 15 responses per age cohort (M₂₀₋₃₅ = 55.1 responses per cue; M₃₅₋₅₀ = 43.3; M₅₀₋₉₀ = 32.3). Then, for both the HistWords-based and Small World of Words-based RSMs, we restricted the terms to those shared between both datasets, resulting in 6,886 terms⁵.

The word embeddings used for Study 1 are from Hamilton et al. (2018), and the association data are from De Deyne et al. (2019). Scripts for all analyses can be found at https://osf.io/q7j9n/overview?view_only=97f96afc9d614575af607fbc59836afe.

Results

Representational Similarity Analysis.

We first conducted RSA using Spearman rank correlation to compare the semantic spaces from HistWords across decades (Figure 3A–C), from Small World of Words across age cohorts (Figure 3D), and between the two sources for all decade-age pairs (Figure 2C; Figure 4). In these RSA plots, each cell represents the Spearman correlation between the off-diagonal upper-triangle similarity values from the two RSMs (Ritchie et al., 2017).

Figure 4. — (A) Spearman correlations (RSA) between the association-based and corpus-based RSMs over time, for each corpus-version of HistWords. Each line connects the correlation values for a specific age cohort. (B) Spearman correlation between the association-based and hypothesis RSMs, for each corpus-version of HistWords. Note that the scales differ for each facet of the graph.

Diachronic Word Meaning Change in Corpora.

For the corpus-based semantic organization from Google N-gram All (Figure 3A), the RSMs were most similar for adjacent decades (0.69 ≤ ρ ≤ 0.72; top value from each column), and as the temporal distance between decades increased, the similarity gradually decreased, with the 1900s and 1990s being the least similar (ρ = 0.5). A permutation test indicates that the temporal distance between decades is correlated with the ρ value between the RSMs of the same decades (ρ_{ngram all} = 0.63, 95% CI = [0.52, 0.72]). For HistWords embeddings trained on other corpora (N-gram Fiction and COHA Lemma), we find similar patterns of temporally close decades having higher correlation (ρ_{ngram fiction} = 0.46, 95% CI = [0.33, 0.59]; ρ_{coha lemma} = 0.59, 95% CI = [0.41, 0.71]). See Appendix A for further details.

For the N-gram Fiction embeddings (Figure 3B), the closest decades have the highest correlation except for the 1920s and 1950s, which are most correlated with the 1900s (two decades earlier) and 1920s (three decades earlier), respectively. RSMs from COHA Lemma embeddings (Figure 3C) were also more correlated when they were more temporally close, as with N-gram All, but the range of correlations was greater: the most similar RSMs were always for the adjacent decades (0.66 ≤ ρ ≤ 0.72), and the temporally distant decades were less similar, with the 1900s and 1990s also being the least similar (ρ = 0.39).

Change in Lexical-Semantic Representations Across Age Cohorts.

The results of all pairwise correlations between association-based RSMs from different age groups can be seen in Figure 3D. To measure the internal reliability and consistency of the responses for each age cohort subsets, we calculated the average correlation between two RSMs generated from five random splits (in half) of an age cohort’s association data. The internal reliability measures are along the diagonal of the heat-map. The association-based RSMs were less correlated than the HistWords-based RSMs overall (ρ ≈ 0.5) and had relatively low internal reliability (ρ ≈ 0.2–0.4). These values are likely attenuated because they require splitting the data in half. The internal reliability may be related to the quantity of data contributing to each similarity value. The 20–35 year old cohort had the highest average number of responses per cue (M = 55.1 responses per cue) and the highest internal reliability (ρ = 0.4), and the 50–90 year old cohort had the lowest average number of responses per cue (M = 32.3 responses per cue) and the lowest internal reliability (ρ = 0.24). These modest internal reliabilities place an upper bound on the potential correlations involving these data. No gradient change in similarity related to temporal distance in age cohorts was apparent; the correlation between various age groups was roughly around ρ ≈ 0.5, regardless of the age difference between the groups. A permutation test indicates that the correlation between differences in age and the correlations between RSMs (ρ_swow = 0.49) is within the 95% CI [−0.87, 0.87] for correlations obtained when age cohort labels are randomly shuffled (see Appendix A).

Similarity Between Semantic Representations From Diachronic Corpora and Age-Specific Association Data.

Figure 4A shows the results of correlations between RSMs for each decade and RSMs for each age cohort, across the different versions of HistWords. For N-gram All, RSMs from all decades were on average most strongly correlated with the youngest age group’s RSM and least correlated with the oldest age group’s RSM (ρ₂₀₋₃₅ = 0.26, ρ₅₀₋₉₀ = 0.23). The same was true for N-gram Fiction (ρ₂₀₋₃₅ = 0.17, ρ₅₀₋₉₀ = 0.15) and COHA Lemma (ρ₂₀₋₃₅ = 0.14, ρ₅₀₋₉₀ = 0.11), though the difference in correlation values between age groups was much smaller for N-gram fiction. Given that the internal reliability of a measure places an upper bound on any potential correlations with other measures, the lower correlations for the older age group RSMs may be explained by their lower internal reliability (see Figure 3D).

Further, for N-gram All, across all of the age cohorts, the correlations slightly increased as the decade of the corpus-based RSM became more recent. The association-based RSMs from all three age groups were more strongly correlated with the corpus-based RSM from the 1990s (ρ_{(20−35, 1990s)} = 0.28, ρ_{(35−50, 1990s)} = 0.27, ρ_{(50−90, 1990s)} = 0.24) than any other decade (e.g., ρ_{(20−35, 1900s)} = 0.25, ρ_{(35−50, 1900s)} = 0.24, ρ_{(50−90, 1900s)} = 0.22). The same was true for N-gram Fiction (ρ_{(20−35, 1990s)} = 0.24, ρ_{(20−35, 1900s)} = 0.15), though with greater magnitude and similar correlation values across the age groups. For COHA Lemma, as with N-gram All, the correlations only slightly increased towards more recent decades (ρ_{(20−35, 1990s)} = 0.15, ρ_{(20−35, 1900s)} ≈ 0.13), which was consistent across the age groups.

To test our hypotheses about how different meanings might be weighted over a person’s lifespan, we created three weighted combinations of corpus-based RSMs from different decades. The first hypothesis-RSM was a weighted mean towards earlier, in that the similarity values were averaged together, with the earlier decades having a higher weight. The next hypothesis-RSM was an unweighted mean, in that each decade was given equal weighting. The last hypothesis-RSM was a weighted mean towards recent, with later decades having a higher weight. The weights were calculated using weight = 0.5^x, where x is a decreasing increment for the weighted mean towards early hypothesis or an increasing increment for the weighted mean towards recent hypothesis. The weights are normalized before calculating the mean RSMs (see Figure 1C for illustration).

Figure 4B shows the RSA for the hypothesis-RSMs and the association-based RSMs from each age cohort. Overall, the averaging (regardless of weighting) seems to increase the correlation between the corpus-based and association-based representations; the association-based RSMs are more correlated with the hypothesis RSMs than any of the single decade RSMs. The 20–35 y.o. age group also has the highest correlation, followed by the 35–50 y.o. group and the 50–90 y.o. group. For the N-gram All versions of HistWords, the unweighted mean and weighted mean towards recent hypothesis-RSMs have the highest correlation with association-based RSMSs (weighted mean towards recent: ρ₂₀₋₃₅ = 0.311; unweighted mean: ρ₂₀₋₃₅ = 0.314), and the weighted mean towards early hypothesis-RSM has the lowest correlation (ρ₂₀₋₃₅ = 0.29). For the N-gram Fiction version of HistWords, the weighted mean towards recent hypothesis-RSM has the highest correlation (ρ₂₀₋₃₅ = 0.261), followed by the unweighted mean (ρ₂₀₋₃₅ = 0.257) and weighted mean towards early hypothesis-RSMs (ρ₂₀₋₃₅ = 0.23). In COHA Lemma, across age cohorts, all three hypothesis-RSMs are similarly correlated with the association-based RSM, though the recent-weighted hypothesis-RSM is slightly lower (weighted mean towards early: ρ₂₀₋₃₅ = 0.155; weighted mean towards recent: ρ₂₀₋₃₅ = 0.158; unweighted mean: ρ₂₀₋₃₅ = 0.16).

Ablation analysis of the relationship between semantic representations from diachronic corpora and age-specific association data.

To further quantify the relative predictive ability of each decade’s similarity values for each age cohort’s word similarities, we used a linear model to predict each age cohorts’ RSM as a combination of the decade-level RSMs and iteratively ablated each decade to quantify its impact via model comparison.

The equation for the full (non-ablated) model was as follows:

{sim}_{swow} = β_{0} + (β_{1} * {sim}_{1900}) + \dots + (β_{10} * {sim}_{1990}) + ε

where sim_swow refers to a similarity value from a given age cohort’s RSM, and sim_year refers to the similarity from a given decade’s RSM, derived from HistWords (from the 1900s through the 1990s).

Figure 5 shows the average difference in AIC, Log-likelihood, and R² between the full and ablated models across 5 cross-validation folds. The most recent decades have the largest impact on the models’ ability to predict the association-based similarity for all age groups and versions of HistWords. Further, the impact of the most recent decades appears largest for the youngest age cohort and less for the older age cohort.

With the exception of ablating the 1930s or 1940s for the 35–50 year old group, F-tests comparing the model fit of the full and ablated models showed that the full model always predicts the association-based similarity values significantly better than the ablated models (ps < 0.001). Appendix B includes the details for the ablation, and the tables with the AIC, log-likelihood, R² values, and F-test statistics can be found in the Supplementary Materials.

Discussion

Corpus-based semantic similarity spaces changed gradually over time. Importantly, non-trivial changes occurr within the lifespan of an older adult living in the 2000s: the correlation between similarities from the 1990s and 1980s was ρ = 0.72, whereas the correlation between the 1990s and 1940s was ρ = 0.57. Nonetheless, the minimum correlation was ρ ≈ 0.5, indicating that while there may be broad changes in linguistic meaning over time, there is substantial stability as well.

In contrast, the association-based semantic similarity spaces appeared somewhat consistent across age cohorts. This may be in part because the cue words selected for Small World of Words were intentionally chosen to be high frequency words with an early average age of acquisition − properties that make a word less likely to change over time (Hamilton et al., 2018). Similarity to the corpus-based meanings decreased with age regardless of the corpus decade, with the 20–35-year-olds having the highest correlation, and the 50–90 years old having the lowest. This reduced correlation with representations derived from corpora is consistent with more idiosyncratic knowledge of the language in older adults. However, caution is warranted in interpreting this result because the internal reliability of the RSMs from older adult data was also lower than that of RSMs from younger adult data, likely due to a smaller average number of responses per cue in the former.

Across age cohorts, association-based semantic representations were most correlated with, and best predicted by, the 1990s corpus-based semantic representations. For all age groups, lexico-semantic representations appear to be best captured by usage patterns from the most recent decades. The results were largely invariant to the choice of embeddings, though the magnitude of the temporal trend was smaller when using the N-gram All and COHA Lemma embeddings (The latter of which had the lowest performance at identifying known changes in meaning; Hamilton et al., 2018). Further, the correlation patterns of the hypothesis-RSMs differed across embedding versions, with the unweighted mean and weighted mean towards recent having the highest correlation for N-gram All and N-gram Fiction respectively, and the weighted mean towards early and unweighted mean having the highest correlation in COHA Lemma, depending on the age group.

In sum, the results are most consistent with language users continuing to update their knowledge of word meanings across the lifespan as they experience a changing language. Moreover, recently experienced meanings appear to play a larger role in an individual’s semantic representations than meanings experienced earlier in their life.

STUDY 2

The dataset of word associations used in Study 1 yielded valuable insights but was limited in multiple ways: the behavioral responses were based on associations as opposed to relatedness (which is what the similarities of HistWords embeddings are thought to represent), the sample sizes across age bins were uneven, and the internal reliabilities of the association-based RSMs were modest. Moreover, the cue words for the Small World of Words dataset were selected to maximize the number of associates produced by participants of all ages. These cues may not be optimal for studying the interaction between language change and age-related differences in lexico-semantic representations.

Therefore, in Study 2, we collected explicit word relatedness judgments (following Gerz et al., 2016; Hill et al., 2015), from both young (18–33 years old) and older adults (63–92 years old), comparing words that changed in their usage/meaning over the approximate lifetime of the older adults (1950 to 2000) to those that didn’t.