Significance
Schizophrenia is a debilitating neuropsychiatric disorder whose core clinical features are thought to reflect abnormalities in internal conceptual representations (“cognitive maps”). The current work provides a language-based computational assay of conceptual disorganization in schizophrenia and relates this to neural signatures of cognitive map representation measured using magnetoencephalography (MEG). At a behavioral level, patients with schizophrenia showed reduced semantically guided word sampling during a verbal fluency task (a marker of “looser” conceptual organization). At a neural level, between-participant variance in this effect correlated with the strength of an MEG signature of hippocampal ripple power (measured in a separate task), known to be involved in cognitive map stabilization. These findings shed light on the neural basis of semantic representation in schizophrenia.
Keywords: cognitive map, psychosis, hippocampal replay, sharp wave ripple, natural language processing
Abstract
Human cognition is underpinned by structured internal representations that encode relationships between entities in the world (cognitive maps). Clinical features of schizophrenia—from thought disorder to delusions—are proposed to reflect disorganization in such conceptual representations. Schizophrenia is also linked to abnormalities in neural processes that support cognitive map representations, including hippocampal replay and high-frequency ripple oscillations. Here, we report a computational assay of semantically guided conceptual sampling and exploit this to test a hypothesis that people with schizophrenia (PScz) exhibit abnormalities in semantically guided cognition that relate to hippocampal replay and ripples. Fifty-two participants [26 PScz (13 unmedicated) and 26 age-, gender-, and intelligence quotient (IQ)-matched nonclinical controls] completed a category- and letter-verbal fluency task, followed by a magnetoencephalography (MEG) scan involving a separate sequence-learning task. We used a pretrained word embedding model of semantic similarity, coupled to a computational model of word selection, to quantify the degree to which each participant’s verbal behavior was guided by semantic similarity. Using MEG, we indexed neural replay and ripple power in a post-task rest session. Across all participants, word selection was strongly influenced by semantic similarity. The strength of this influence showed sensitivity to task demands (category > letter fluency) and predicted performance. In line with our hypothesis, the influence of semantic similarity on behavior was reduced in schizophrenia relative to controls, predicted negative psychotic symptoms, and correlated with an MEG signature of hippocampal ripple power (but not replay). The findings bridge a gap between phenomenological and neurocomputational accounts of schizophrenia.
Schizophrenia is a common and debilitating neuropsychiatric disorder. The condition has a lifetime prevalence of 0.7% and remains a leading cause of years lived with disability (1–3). Core clinical features include conceptual disorganization, difficulties in abstract reasoning, and reduced language coherence (i.e., “formal thought disorder”) (4, 5), which together predict poor social functioning (6). A longstanding notion is that these features reflect an abnormality in association-guided cognition (7, 8), where associations reflect relationships between memories, concepts, or objects in the world (“relational knowledge”) (9–11).
The question of how abstract relational knowledge is represented in the brain has gained renewed prominence in cognitive neuroscience, with convergent evidence identifying a role for hippocampal–entorhinal (HEC) cortex. This is exemplified by rodent findings showing hippocampal place and grid cells encode relationships between spatial locations during navigation (12), as well as during rest-period neural “replay” (i.e., sequential place cell reactivation that reflects spatial proximity, expressed during high-frequency hippocampal “ripple” oscillations) (13). Of relevance to psychopathology are findings that HEC also encodes associations between states in more abstract domains (e.g., associations between pictures in a task, or semantic relationships between words) (14–19), indicating that HEC supports a more domain-general “cognitive mapping” function (14, 15) spanning conceptual and semantic “spaces.” Thus, when people perform word recall tasks, HEC theta power covaries with “semantic proximity” between sequentially recalled words (20), while the order of word selection in verbal fluency tasks can be captured using models initially devised to describe behavior in foraging animals (akin to “mental navigation” in a semantic space) (21, 22).
Given a proposed function of HEC in domain-general cognitive mapping, we hypothesized that neural replay and ripple power would relate to patterns of spontaneous word selection during verbal fluency. The underlying intuition is that the stream of consciousness can be construed as a trajectory through a “semantic space” (23, 24), where the sequential sampling of concepts is more or less constrained by some notion of semantic “proximity.” Speculatively, abnormalities in this neural process could also relate to disrupted semantic sampling patterns in patients (a candidate behavioral assay of formal thought disorder). Intriguingly, schizophrenia is associated with abnormalities in both hippocampal replay and ripple oscillations, both in genetic mouse models (25–27) and human subjects (11), but the relationship to conceptual sampling and language metrics is unknown.
Here, we use computational modeling of verbal fluency data to derive measures of semantically structured conceptual sampling in patients with a diagnosis of schizophrenia (PScz) and intelligence quotient (IQ)-matched control participants. We leverage advances in Natural Language Processing (NLP) machine learning tools to provide a quantification of semantic associations between words, an approach increasingly applied to the study of natural language and semantic processing both in psychiatry (9, 28–32) and cognitive neuroscience (20, 33–39). We then relate model-derived measures of semantic sampling to magnetoencephalography (MEG) signatures of neural replay and replay-associated ripple oscillations in the same participants, measured in a rest session following a separate task. Our hypothesis was that schizophrenia is linked to behavioral impairments in semantically guided conceptual sampling and that these relate to neural signatures of structured memory reactivations during rest (replay and ripples) considered to emanate from HEC.
Results
Trajectories through Semantic Space.
A total of 52 participants [26 PScz (13 not taking any psychiatric medication) and 26 control participants, see SI Appendix, Table S1] completed two verbal fluency tasks, wherein they were asked to name as many words as they could either “belonging to the category ‘animals’” (category fluency) or “starting with the letter ‘p’” (letter fluency) in 5 min. For each task, we used a fastText (Facebook AI Research) pretrained NLP word embedding model (40) to quantify the semantic association (proximity) between each pair of response items in terms of cosine similarity (1-cosine distance) (Fig. 1).
In the “category” task, control participants exhibited lower mean semantic distances (i.e., increased semantic similarity) between consecutively generated items compared to PScz [control: mean semantic distance = 0.53 ± 0.003 (SEM), PScz: mean semantic distance = 0.55 ± 0.005, z(50) = −2.59, P = 0.01, Wilcoxon rank sum test, two-tailed]. This group difference was not present for the “letter” task [control: mean semantic distance = 0.643 ± 0.006, PScz: mean semantic distance = 0.636 ± 0.006, t(50) = 0.87, P = 0.39, two-sample t test, two-tailed, Fig. 2]. A group * task ANOVA confirmed a significant interaction effect on mean consecutive semantic distance [F(50, 1) = 5.44, P = 0.024], in addition to a significant main effect of task [F(50, 1) = 456.5, P < 0.001], with no main effect of group [F(50, 1) = 0.20, P = 0.66].
To better characterize the influence of semantic association strength on response item selection, for each participant, we next defined measures of path optimality by comparing the observed item sequence to an “optimal” sequence. This optimal sequence was defined as the sequence which visits each item exactly once in an order that minimizes the total semantic distance traveled (i.e., the shortest Hamiltonian path defined on a complete graph where the weight of an undirected edge connecting any two items corresponds to their cosine similarity, and where this path is derived using a modified “Travelling Salesman” algorithm. Fig. 3A). We defined “global optimality divergence” as the difference between the observed and optimal total path distance. We expressed this metric as a z-score with respect to participant-specific shuffled item lists, thus controlling for variation in item sets between participants (see SI Appendix for details).
In the category task, global optimality divergence was lower for control participants compared to PScz [control: mean −7.02 ± 0.23, PScz: mean = −5.81 ± 0.40, z(50) = −2.19, P = 0.03, Wilcoxon rank sum test, two-tailed, Fig. 3B, negative values reflect increasingly optimal word selection processes and 0 indicates random word selection]. This measure correlated with task performance [total number of words generated, rho(50) = −0.49, P = 0.002, Spearman’s rank correlation, Fig. 3C], with the observed association being preserved even when controlling for an effect of group on list length in a multiple regression analysis [list length ~ group * optimality divergence multiple regression: = −2.71 ± 1.10, t(48) = −2.47, P = 0.017. = 0.41 ± 15.1, t(48) = 0.27, P = 0.98. = −0.55 ± 2.19, t(48) = −0.25, P = 0.80]. All of these results were robust when considering only the first 32 items from each participant’s list (the length of the shortest list over participants), demonstrating that effects are not explained by differences in list length between participants.
We also defined a related “local optimality divergence” metric as the degree to which optimal and observed trajectories are aligned at the level of individual item–item transitions (see SI Appendix for details). Using this measure, control participants also exhibited greater similarity to optimal path trajectories than PScz [controls: mean = −3.77 ± 0.18, PScz: mean = −2.83 ± 0.26, t(50) = −2.94, P = 0.005, two sample t test, two-tailed, Fig. 3D].
To test the sensitivity of these findings to task context, we repeated this analysis on data from the letter fluency task (“name all words beginning with ‘p’”). This task does not require explicit consideration of semantic information and performance is reported to be less affected in psychosis (42–44). In keeping with these prior findings, we found no group differences in path optimality measures nor any correlation with overall task performance (Fig. 3 E–G).
Trajectories through Semantic Communities.
Given the group differences seen in the category task, we next examined the degree to which behavior was sensitive to mesoscale structure of the semantic space (“animal space”), which we partitioned into “semantic communities” using a data-driven agglomerative clustering procedure (Fig. 4A, see SI Appendix for details). While category fluency behavior in both PScz and control participants was sensitive to the identified community partition, this sensitivity was again reduced in PScz. Specifically, compared to controls, PScz lists displayed shorter uninterrupted stretches of items within the same community [mean community “lifetime” in controls expressed as z-score with respect to participant shuffled lists = 9.25 ± 0.80, PScz = 6.32 ± 0.80, z(50) = 2.65, P = 0.008, Wilcoxon rank sum test, two-tailed, Fig. 4D] and an increased propensity to return to a community after it had already been visited [number of returns in controls expressed as z-score with respect to participant shuffled lists = −4.92 ± 0.22, PScz = −3.78 ± 0.31, t(50) = −3.00, P = 0.004, two sample t test, two-tailed, Fig. 4E].
Trajectories through Orthographic Space.
Non-semantic word relationships have also been shown to affect performance in learning, memory, and language tasks (46, 47). We therefore conducted an analogous trajectory analysis using a non-semantic distance metric sensitive to the letter-by-letter similarity between words [orthographic or “Levenshtein” distance (48)]. As expected, orthographic distances between consecutive items were lower in the letter task compared to the category task, and the degree of orthographic clustering (global optimality divergence) predicted performance in the letter task alone. However, we found no group differences in optimality divergence defined using orthographic distance in either task (see SI Appendix, Fig. S3).
Modeling Semantic Retrieval as Local Search.
To ascertain the contribution of semantic association on word selection with finer granularity we next turned to computational modeling of behavior and formally tested for a group * task interaction effect in semantic association strength. In the models considered, for response item , a single word is selected from the set of unique words generated across all participants, and the probability of emitting any given word from this set is a function of the semantic and/or orthographic proximity to response item , multiplicatively scaled by a salience ( ) parameter. The winning model (identified in formal model comparison) allowed for an arbitration between semantic and orthographic association strength in the word selection process and modeled this trade-off independently in category and letter tasks for each participant (i.e., four free parameters: a pair of semantic and orthographic salience parameters for category [ & ] and letter tasks [ & ] separately (Fig. 5A). (See SI Appendix for mathematical details, model comparison, parameter recovery and fitted parameter correlations.)
The fitted model parameters recapitulated the model-agnostic results (above), revealing main effects of task condition on the relative weighting between semantic and orthographic similarities (semantic > orthographic for category task, and orthographic > semantic for letter task). Crucially, in the category task, PScz and control participants showed a significant difference in semantic salience alone [ t(50) = 2.86, P = 0.01, two sample t test, two-tailed], with no group difference for orthographic salience [ z(50) = −0.32, P = 0.75, Wilcoxon rank sum test, two-tailed]. Groups did not differ on either salience parameter in the letter task [ t(50) = −0.50, P = 0.62 and z(50) = 1.02, P = 0.31]. Accordingly, group * task ANOVA on the semantic salience parameter confirmed a significant interaction ( : group * task P = 0.005), with no such effect for the orthographic salience parameter [ : group * task interaction P = 0.34] (Fig. 5B).
As the winning model captured a trade-off between semantic and orthographic saliencies, this enabled us to ascertain the degree to which semantic salience exceeds orthographic salience for each condition ( ), as well as quantify the degree to which this difference metric changes as a function of task context ( , which we term “goal-induced semantic modulation”). Thus, captures a tendency to modulate the associative information guiding memory search in line with task demands. was significantly correlated with both “global” and “local” semantic optimality divergence measures in the category task [global: rho(50) = −0.56, P < 0.001; local: rho(50) = −0.60, P < 0.001, Spearman’s correlation].
As expected, goal-induced semantic modulation ( ) differed between groups [controls: mean = 5.17 ± 0.37, PScz: mean = 3.63 ± 0.40, t(50) = 2.82, P = 0.01, two sample t test, two-tailed], and predicted performance [rho(50) = 0.31, P = 0.03, Spearman’s correlation, Fig. 5C], with no group difference in this latter relationship [list length ~ group* multiple regression: = 1.62 ± 0.87, t(48) = 1.86, P = 0.07. = 11.2 ± 8.56, t(48) = 1.31, P = 0.20. = −2.32 ± 1.74, t(48) = −1.33, P = 0.19].
Relationship to Symptoms and Cognitive Variables.
We found an inverse relationship between total negative symptom score (49) and goal-induced semantic modulation [ , rho(24) = −0.42, P = 0.03, Spearman’s correlation, Fig. 5D]. This was driven by symptom items that relate to “negative thought disorder” (5) (“N5: difficulty in abstract thinking,” rho = −0.47, P = 0.015, “N6: lack of spontaneity and flow of conversation,” rho = −0.43, P = 0.028), which capture difficulties with proverb understanding, semantic similarity assessments, and abstract vs. concrete modes of reasoning. We found no correlation with total positive psychotic symptoms [rho(24) = −0.08, P = 0.69, including the “P2: conceptual disorganization” item, rho(24) = −0.07, P = 0.73], nor with total depressive symptoms [rho(24) = −0.15, P = 0.46, Spearman’s correlation]. There was no monotonic relationship between and other measured cognitive variables in PScz [i.e., working memory capacity on digit span task: rho(24) = 0.18, P = 0.38; estimated full scale IQ: rho(24) = 0.12, P = 0.56, Spearman’s correlation].
Relationship to Neurophysiological Signatures of Structured Memory Replay.
The functions of HEC include a strong link to associative cognition (15, 17, 18, 20, 50–53). This motivated us to test the relationship between goal-induced semantic modulation ( ) and a neural index of associative memory reactivation, thought to originate from HEC. Here, we used resting-state MEG data collected from the same sample of participants from an experimental session conducted on another day (typically, the day after verbal fluency). The full MEG session involved a sequence learning task, wherein participants were tasked to learn the sequential relationships between eight task pictures and maintain this knowledge during a 5-min post-task resting-state scan (11). Using multivariate neural decoding, we previously showed that in this post-task resting MEG data, whole-brain activity patterns reflect spontaneous neural replay of learned task sequences (from the preceding task), where putative “replay onsets” are coincident with transient increases in high-frequency power (ripple band, 120 to 150 Hz) emanating from HEC sources (Fig. 6A) (11, 18). Preclinical studies show that these replay-associated ripple events are causally implicated in associative learning and cognitive map stabilization (54–58).
We correlated behavioral measures of semantic clustering during verbal fluency (i.e., ) and MEG-derived measures of replay and ripples across participants, reasoning that the latter might index a trait-level capacity for associatively-structured memory reactivations. Across all participants, we found a positive relationship between goal-induced semantic modulation ( ) and MEG-derived measure of replay-associated ripple power, with no group difference in the slope of this relationship [ripple ~ group * multiple regression: = 0.16 ± 0.047, t(47) = 3.39, P = 0.001. = −0.48 ± 0.47, t(47) = −1.03, P = 0.31. = −0.06 ± 0.09, t(47) = −0.67, P = 0.51]. (Here, “replay associated ripple power” is defined as the peak power increase in the range 120–150 Hz within 50 ms of a replay onset [±10 ms], averaged over all MEG sensors, as in ref. 11.) Thus, participants exhibiting the greatest ripple power at the time of putative replay showed a greater tendency to use semantic information to guide word selection in a context-sensitive manner. The correlation between ripple power and was also significant in PScz alone [rho(23) = 0.57, P = 0.003, Spearman’s correlation, Fig. 6B]. We note that this same MEG signature of ripple power was previously shown to correlate with the strength of a neural representation of learned task structure (11).
To probe this brain–behavior relationship further, we tested the association between replay-associated ripple power and model-agnostic behavioral measures (i.e., semantic and orthographic path optimality measures from both tasks). Replay-associated ripple power was linearly related to semantic path optimality in the category fluency task [global optimality divergence: = −0.24 ± 0.062, t(47) = −3.87, P = 0.0003; local optimality divergence: = −0.30 ± 0.087, t(47) = −3.44, P = 0.001; results from ripple ~ group * multiple regression across all participants]. However, there was no significant relationship between ripple power and semantic path optimality in the letter task, nor in either task when considering path optimality defined using an orthographic distance metric. This indicates that a primary contributor of the brain-behavior correlation seen with stems from variance in semantically constrained sequencing.
We found no correlation between and our previously reported measure of spontaneous replay, where the latter quantifies the degree to which resting-state MEG data manifest a temporal ordering of neural state reactivations that recapitulates an inferred task structure at 40 to 50 ms lag (11) [replay ~ group * multiple regression: = 0.05 ± 0.14, t(47) = 0.38, P = 0.71. = 3.09 ± 1.37, t(47) = 2.26, P = 0.03. = −0.54 ± 0.29, t(47) = −1.95, P = 0.06].
In a control analysis, we found no significant relationship between and replay-associated alpha power [defined identically to replay-associated ripple power, but using the alpha frequency band, 8 to 13 Hz] [alpha ~ group * multiple regression: = −0.051 ± 0.029, t(47) = −1.74, P = 0.09. = 0.041 ± 0.288, t(47) = 0.143, P = 0.89. = 0.031 ± 0.059, t(47) = 0.52, P = 0.61]. Using a similar multiple regression approach, we also found no significant main effect of on dynamical properties of ripples during MEG rest data, where the latter includes mean (and SD) duration of ripple events (defined as epochs exhibiting high ripple power) and mean (and SD) time separating ripple events (see SI Appendix for details).
Relationship to Medication Status.
PScz taking and not taking dopamine 2/3 receptor antagonist (antipsychotic) medication did not differ in [PScz on medication: 3.57 ± 0.60, off medication: 3.70 ± 0.55, z(24) = −0.51, P = 0.61, Wilcoxon rank sum test, two-tailed], total negative symptom severity [total PANSS negative symptom score, PScz on medication: 14.5 ± 1.64, off medication: 12.7 ± 1.92, z(24) = −1.11, P = 0.27, Wilcoxon rank sum test, two-tailed], mean number of response items (expressed as context-specific rank) [PScz on medication: 25.2 ± 2.94, off medication: 24.3 ± 3.64, t(24) = 0.201, P = 0.84, two sample t test, two-tailed], nor replay-associated ripple power [PScz on medication: 2.39 ± 0.18, off medication: 2.47 ± 0.19, t(24) = −0.304, P = 0.76, two sample t test, two-tailed].
Discussion
At the behavioral level, we present convergent evidence that schizophrenia is associated with reduced semantically guided memory sampling during a category fluency task, even after controlling for between-participant differences in list length or contents (e.g., vocabulary). Using computational modeling, we show that the relative weighting of semantic and orthographic information on task performance is highly sensitive to task goal (context), and that the degree of this “goal-induced semantic modulation” ( ) is reduced in schizophrenia. Strikingly, between-participant variance in correlated with an expression of symptom severity that relates to abstract conceptual reasoning and a “negative” dimension of formal thought disorder (5).
Our work avails of novel NLP tools for automated analysis of speech. Such tools have previously been applied to speech samples from patients with psychosis to derive computational metrics with diagnostic and prognostic utility (9, 28–31, 59–66). For example, measures of semantic coherence (e.g., cosine distances between consecutive words, embedded in semantic space) can potentially be used to differentiate PScz and healthy controls (30, 43, 67–70), automate identification of formal thought disorder in PScz (43, 67, 71, 72), and predict transition to psychosis in those experiencing prodromal symptoms (59, 61). Notwithstanding these proof-of-concept findings, single studies tend to test a variety of hypotheses, and the pattern of statistically significant findings is often inconsistent between studies. Studies also differ in approaches for operationalizing semantic coherence, elicitation of speech data, and clinical assessment of thought disorder, impeding knowledge synthesis (31, 73).
Our study departs from the aforementioned research in that we test a neurocognitive hypothesis where we propose that differences in verbal fluency relate to cognitive and neural processes that underpin associative (relational) cognition. Thus, in addition to our behavioral findings, we find a relationship between and an MEG measure of replay-associated ripple power, where the latter is measured during a rest session following a separate sequence learning task. We consider this intriguing given an established role for both HEC and hippocampal ripples in associative memory and reasoning across diverse task domains (e.g., spatial memory, non-spatial structure learning, semantic retrieval) (10, 14–16, 18, 19, 50, 53, 74, 75). Although our cognitive and neural measures were derived from different task paradigms (semantic memory search and non-spatial sequence learning, respectively), under an HEC “cognitive map” hypothesis they are thought to engage similar neural circuits that support reasoning about the relationships between entities (14, 15). However, we acknowledge that any claim of functional significance or cognitive specificity needs to be tempered, given the correlational and indirect nature of the findings.
We did not find any monotonic relationship between and neural replay across participants, where replay might be considered a more specific measure of sequential neural reactivation (compared to ripples). Of note, both replay and ripples were measured during a rest period with reference to a separate inference task, typically acquired on a different day to verbal fluency data. Although we caution against overinterpretation of the null replay result, one speculation is that expression of replay within an individual might be particularly sensitive to behavioral context (i.e., behavioral “state”), whereas a propensity to generate ripples might reflect a more “trait-like” participant property (related to HEC circuit organization) that underpins computations common to divergent tasks. In our context, the differing cognitive demands of verbal fluency and sequence learning tasks might engender qualitatively different sequence generation (i.e., replay) regimes in HEC (53) (“diffusive” in verbal fluency, vs. fixed sequential in sequence learning), such that mechanisms of sequence generation underlying performance of one task are not readily transferrable to those underlying another. Future experiments involving measurement of ripples and replay across multiple tasks in the same participants are required to test this hypothesis.
We found a correlation between “goal-induced semantic modulation” ( ) and negative symptoms. Some studies have reported a significant correlation between related semantic coherence measures and positive symptoms, including formal thought disorder (43, 67, 71, 72). However, such findings are not universally reported, and, as noted above, previous studies have used a variety of task contexts to elicit verbal data, complicating interpretation given the task-dependent effects on NLP speech metrics reported both in the present work and elsewhere (30, 66). Moreover, in contrast to the present work, many previous studies that use category fluency tasks (43, 67, 69, 71, 72) adopt short task durations (mean list length <20 words), have not controlled for between-subject differences in recalled items, and do not address questions of specificity for semantic associations and sensitivity to task context.
Limitations of the present work include the fact that we assume that the pre-trained word embedding model, fixed for all participants, approximates internal semantic representations in PScz and control participants to a similar degree. Here, we note that our measure of semantic similarity is equivalently able to predict inter-item retrieval times in PScz and control samples (Fig. 4B and SI Appendix). Second, our computational model, while accounting for participant differences in vocabulary, does not address potential influences of non-specific cognitive processes such as working memory or self-monitoring. Importantly, however, we find no correlation between cognitive trait variables and fitted model parameters. Our model postulates that word selection unfolds as a “local search” in semantic space. Other hypotheses include a random walk on a semantic graph (76) or a two-stage process alternating between a local search and “patch switch” policy (21, 22, 43, 69). The latter two-stage modeling approach necessitates further analytic decisions regarding how patch switches are defined, complicating interpretation. Notwithstanding this concern, in SI Appendix, we show that our behavioral findings remain significant after amending our model to mimic such a two-stage process. More generic limitations include the correlational nature of our analyses, the fact that behavioral and neural measures were acquired at different times, and a relatively small sample.
In summary, our findings offer tentative support for an hypothesis that some symptoms of schizophrenia may reflect a dysregulation of a neurocognitive process involving structured conceptual representations (cognitive maps). We anticipate that future studies, combining concurrent functional neuroimaging, language tasks, and rapidly advancing NLP word-embedding tools, will continue to shed light on the neural coding schemes that support such associative cognition.
Materials and Methods
Participants and Assessment.
The study was approved by the London Westminster NHS Research Ethics Committee (15/LO/1361). All participants provided written informed consent and were compensated for their time. Participants attended for two visits (verbal fluency behavioral session, and MEG task session, approximately 1 d apart) (11). The final sample for the verbal fluency analysis comprised 52 participants [26 PScz (6 female, range 18 to 45 y, 13 not taking any psychiatric medication) and 26 control participants (6 female, range 18 to 45 y), see SI Appendix, Table S1]. One patient participant declined MEG owing to paranoia. See SI Appendix for full details.
Verbal Fluency Task and Analysis.
On visit 1, participants completed two verbal fluency tasks (order randomized), wherein they were asked to name as many words as they could either belonging to the category “animals” (category fluency) or starting with the letter “p” (letter fluency) in 5 min. We used a fastText (Facebook AI Research) pretrained NLP word embedding model (40) to quantify the semantic association (proximity) between each pair of words in terms of cosine similarity/distance (Fig. 1A). We defined an analogous (non-semantic) measure of orthographic (letter-by-letter) association as the Levenshtein distance (48).
For both conditions, we quantified the degree to which the sequence of emitted words (the “trajectory through semantic space”) approximated the optimal sequence that minimizes total traversed semantic (or orthographic) distance [i.e., a modified Travelling Salesman optimization problem (77), see SI Appendix] (Fig. 3A). We defined “global optimality divergence” as the difference between the observed and optimal path total distance, and “local optimality divergence” as the degree to which optimal and observed trajectories are aligned at the level of individual item-item transitions. Crucially, we express both effects as a z-score with respect to a participant-specific permutation-derived null distribution, thus controlling for participant differences in vocabulary and list length [i.e., z-scores of 0 indicate a random word search selection process, which is insensitive to semantic (or orthographic) associations; negative z-scores reflect word selection processes that are predictable given such associations]. See SI Appendix for further discussion and mathematical details.
To gain insight into processes driving task performance, for each participant, we fitted a family of generative computational models to the concatenated word list data from both tasks using maximum likelihood estimation (as in ref. 21). We identified the winning model using Akaike Information Criteria (AIC) at the group level (78). See SI Appendix for mathematical details, model comparison, and parameter recovery.
MEG Task and Analysis.
On visit 2, participants completed a separate sequence learning task during MEG, as described in our previous report (11). Briefly, during MEG participants learned sequences containing eight task pictures, before completing a 5-min resting-state MEG scan. We previously identified time points in MEG data form this post-task rest scan that exhibited high evidence of sequential neural replay (i.e., spontaneous neural reactivations of task state representations that recapitulated the task transition structure, occurring at 40 ms lag) using temporally delayed linear modeling (TDLM) (79). Such time points were associated with an exuberance of high frequency (120 to 150 Hz, “ripple”) power, identified as originating from HEC sources in a beamforming analysis (11). In the present work we extract, for each participant, a single estimate of replay strength (evidence of replay at 40 to 50 ms replay time lag) and replay-associated ripple power [peak power in 120 to 150 Hz band, in the 0 to 50 ms (±10 ms) window following putative replay events], for all brain–behavior correlations. These MEG measures are identical to those reported in our previous MEG replay study (11). See SI Appendix for full details of MEG task and analysis.
Statistical Analysis.
We used parametric and non-parametric statistical tests for normally and non-normally distributed variables, respectively (normality was tested using a Shapiro–Wilk test). Effects are reported as mean ± 1 standard error of mean (SEM), and two-tailed P < 0.05 is deemed significant throughout. See SI Appendix for further details.
Supplementary Material
Acknowledgments
We thank Dr. Atheeshaan Arumuham (King’s College London) for help with patient recruitment, and Professor Zeb Kurth-Nelson (DeepMind) for discussions on task design and analysis methods.
Author contributions
M.M.N. and R.J.D. designed research; M.M.N. performed research; M.M.N., D.C.M., and Y.L. contributed new reagents/analytic tools; M.M.N. analyzed data; and M.M.N. and R.J.D. wrote the paper.
Competing interests
The authors declare no competing interest.
Footnotes
This article is a PNAS Direct Submission.
Data, Materials, and Software Availability
Analysis code and data to reproduce the results in the paper will be made available at github.com/matthewnour/verbal_fluency_trajectories and github.com/YunzheLiu/TDLM. The manuscript additionally relates behavioral measures to MEG data originally presented in a previous study: (11).
Supporting Information
References
- 1.Jauhar S., Johnstone M., McKenna P. J., Schizophrenia. Lancet 399, 473–486 (2022). [DOI] [PubMed] [Google Scholar]
- 2.Vigo D., Thornicroft G., Atun R., Estimating the true global burden of mental illness. Lancet Psychiatry 3, 171–178 (2016). [DOI] [PubMed] [Google Scholar]
- 3.Weye N., et al. , Register-based metrics of years lived with disability associated with mental and substance use disorders: a register-based cohort study in Denmark. Lancet Psychiatry 8, 310–319 (2021). [DOI] [PubMed] [Google Scholar]
- 4.Casey P., Kelly B., Fish‘s Clinical Psychopathology: Signs and Symptoms in Psychiatry (Cambridge Medicine, 3rd edition, 2007). [Google Scholar]
- 5.Kircher T., Bröhl H., Meier F., Engelen J., Formal thought disorders: From phenomenology to neurobiology. Lancet Psychiatry 5, 515–526 (2018). [DOI] [PubMed] [Google Scholar]
- 6.Marggraf M. P., Lysaker P. H., Salyers M. P., Minor K. S., The link between formal thought disorder and social functioning in schizophrenia: A meta-analysis. Eur. Psychiatry 63, e34 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Bleuler E., Dementia Praecox or The Group of Schizophrenias (Dementia Praecox oder Gruppe der Schizophrenien)1911 (International University Press, 1911). [Google Scholar]
- 8.Moskowitz A., Heim G., Eugen Bleuler’s Dementia praecox or the group of schizophrenias (1911): A centenary appreciation and reconsideration. Schizophrenia Bull. 37, 471–479 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Corcoran C. M., Cecchi G. A., Using language processing and speech analysis for the identification of psychosis and other disorders. Biol. Psychiatry Cogn. Neurosci. Neuroimaging 5, 770–779 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Musa A., Khan S., Mujahid M., El-gaby M., The shallow cognitive map hypothesis: A hippocampal framework for thought disorder in schizophrenia. Schizophrenia 8, 34 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Nour M. M., Liu Y., Arumuham A., Kurth-Nelson Z., Dolan R. J., Impaired neural replay of inferred relationships in schizophrenia. Cell 184, 4315–4328.e17 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Moser E. I., Kropff E., Moser M.-B., Place cells, grid cells, and the brain’s spatial representation system. Annu. Rev. Neurosci. 31, 69–89 (2008). [DOI] [PubMed] [Google Scholar]
- 13.Foster D. J., Replay comes of age. Annu. Rev. Neurosci. 40, 581–602 (2017). [DOI] [PubMed] [Google Scholar]
- 14.Bellmund J. L. S., Gärdenfors P., Moser E. I., Doeller C. F., Navigating cognition: Spatial codes for human thinking. Science 362, eaat6766 (2018). [DOI] [PubMed] [Google Scholar]
- 15.Behrens T. E. J., et al. , What is a cognitive map? Organizing knowledge for flexible behavior. Neuron 100, 490–509 (2018). [DOI] [PubMed] [Google Scholar]
- 16.O’Keefe J., Nadel L., The Hippocampus as a Cognitive Map (Clarendon Press, 1978). [Google Scholar]
- 17.Whittington J. C., et al. , The Tolman-Eichenbaum machine: Unifying space and relational memory through generalisation in the hippocampal formation. Cell 183, 1249–1263 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Liu Y., Dolan R. J., Kurth-Nelson Z., Behrens T. E. J., Human replay spontaneously reorganizes experience. Cell 178, 640–652 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Schuck N. W., Niv Y., Sequential replay of non-spatial task states in the human hippocampus. Science 364, eaaw5181 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Solomon E. A., Lega B. C., Sperling M. R., Kahana M. J., Hippocampal theta codes for distances in semantic and temporal spaces. Proc. Natl. Acad. Sci. U.S.A. 116, 24343–24352 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Hills T. T., Jones M. N., Todd P. M., Optimal foraging in semantic memory. Psychol. Rev. 119, 431–440 (2012). [DOI] [PubMed] [Google Scholar]
- 22.Hills T. T., Todd P. M., Jones M. N., Foraging in semantic fields: How we search through memory. Topics Cogn. Sci. 7, 513–534 (2015). [DOI] [PubMed] [Google Scholar]
- 23.James W., The Principles of Psychology (CreateSpace Independent Publishing Platform, 1890). [Google Scholar]
- 24.Heusser A. C., Fitzpatrick P. C., Manning J. R., Geometric models reveal behavioural and neural signatures of transforming experiences into memories. Nat. Hum. Behav. 5, 905–919 (2021). [DOI] [PubMed] [Google Scholar]
- 25.Altimus C., Harrold J., Jaaro-Peled H., Sawa A., Foster D. J., Disordered ripples are a common feature of genetically distinct mouse models relevant to schizophrenia. Mol. Neuropsychiatry 1, 52–59 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Suh J., Foster D. J., Davoudi H., Wilson M. A., Tonegawa S., Impaired hippocampal ripple-associated replay in a mouse model of schizophrenia. Neuron 80, 484–493 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Zaremba J. D., et al. , Impaired hippocampal place cell dynamics in a mouse model of the 22q11.2 deletion. Nat. Neurosci. 20, 1612–1623 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Corcoran C. M., et al. , Language as a biomarker for psychosis: A natural language processing approach. Schizophrenia Res. 226, 158–166 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Holmlund T. B., Cheng J., Foltz P. W., Cohen A. S., Elvevåg B., Updating verbal fluency analysis for the 21st century: Applications for psychiatry. Psychiatry Res. 273, 767–769 (2019). [DOI] [PubMed] [Google Scholar]
- 30.Morgan S. E., et al. , Natural language processing markers in first episode psychosis and people at clinical high-risk. Transl. Psychiatry 11, 630 (2021). 10.1038/s41398-021-01722-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Hitczenko K., Mittal V. A., Goldrick M., Understanding language abnormalities and associated clinical markers in psychosis: The promise of computational methods. Schizophrenia Bull. 47, 344–362 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Palaniyappan L., Benrimoh D., Voppel A., Rocca R., Studying psychosis using natural language generation: A review of emerging opportunities. Biol. Psychiatry Cogn. Neurosci. Neuroimag. S2451902223001040 (2023). [DOI] [PubMed] [Google Scholar]
- 33.Lake B. M., Murphy G. L., Word meaning in minds and machines. Psychol. Rev. 130, 401–431 (2021), 10.1037/rev0000297. [DOI] [PubMed] [Google Scholar]
- 34.Goldstein A., et al. , Shared computational principles for language processing in humans and deep language models. Nat. Neurosci. 25, 369–380 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Caucheteux C., Gramfort A., King J.-R., Evidence of a predictive coding hierarchy in the human brain listening to speech. Nat. Hum. Behav. 7, 430–441 (2023), 10.1038/s41562-022-01516-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Heilbron M., Armeni K., Schoffelen J.-M., Hagoort P., de Lange F. P., A hierarchy of linguistic predictions during natural language comprehension. Proc. Natl. Acad. Sci. U.S.A. 119, e2201968119 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Koskinen M., Kurimo M., Gross J., Hyvärinen A., Hari R., Brain activity reflects the predictability of word sequences in listened continuous speech. NeuroImage 219, 116936 (2020). [DOI] [PubMed] [Google Scholar]
- 38.Schrimpf M., et al. , The neural architecture of language: Integrative modeling converges on predictive processing. Proc. Natl. Acad. Sci. U.S.A. 118, e2105646118 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Binz M., Schulz E., Using cognitive psychology to understand GPT-3. Proc. Natl. Acad. Sci. U.S.A. 120, e2218523120 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Mikolov T., Grave E., Bojanowski P., Puhrsch C., Joulin A., “Advances in pre-training distributed word representations” in Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Miyazaki, Japan (European Language Resources Association, 2018), pp. 52–55. [Google Scholar]
- 41.McInnes L., Healy J., Melville J., UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction. arXiv [Preprint] (2020). 10.48550/arXiv.1802.03426 (Accessed 16 June 2023). [DOI]
- 42.Bokat C. E., Goldberg T. E., Letter and category fluency in schizophrenic patients: A meta-analysis. Schizophrenia Res. 64, 73–78 (2003). [DOI] [PubMed] [Google Scholar]
- 43.Lundin N. B., Jones M. N., Myers E. J., Breier A., Minor K. S., Semantic and phonetic similarity of verbal fluency responses in early-stage psychosis. Psychiatry Res. 309, 114404 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Mesholam-Gately R. I., Giuliano A. J., Faraone S. V., Goff K. P., Seidman L. J., Neurocognition in first-episode schizophrenia: A meta-analytic review. Neuropsychology 23, 315–336 (2009). [DOI] [PubMed] [Google Scholar]
- 45.Blondel V. D., Guillaume J. L., Lambiotte R., Lefebvre E., Fast unfolding of communities in large networks. J. Stat. Mech. Theory Exp. P10008, 1–12 (2008). [Google Scholar]
- 46.Siew C. S. Q., The orthographic similarity structure of English words: Insights from network science. Appl. Netw. Sci. 3, 13 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Karuza E. A., Thompson-Schill S. L., Bassett D. S., Local patterns to global architectures: Influences of network topology on human learning. Trends Cogn. Sci. 20, 629–640 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Levenshtein V. I., Binary codes capable of correcting deletions, insertions, and reversals. Soviet Physics Doklady 10, 707–710 (1966). [Google Scholar]
- 49.Kay S. R., Fiszbein A., Opler L. A., The Positive and Negative Syndrome Scale (PANSS) for schizophrenia. Schizophr. Bull. 13, 261–276 (1987). [DOI] [PubMed] [Google Scholar]
- 50.Barron H. C., et al. , Neuronal computation underlying inferential reasoning in humans and mice. Cell 183, 228–243.e21 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Garvert M. M., Dolan R. J., Behrens T. E. J., A map of abstract relational knowledge in the human hippocampal–entorhinal cortex. eLife 6, 1–20 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Constantinescu A. O., O’Reilly J. X., Behrens T. E. J., Organizing conceptual knowledge in humans with a gridlike code. Science 352, 1464–1468 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.McNamee D. C., Stachenfeld K. L., Botvinick M. M., Gershman S. J., Flexible modulation of sequence generation in the entorhinal–hippocampal system. Nat. Neurosci. 24, 851–862 (2021), 10.1038/s41593-021-00831-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Fernández-Ruiz A., et al. , Long-duration hippocampal sharp wave ripples improve memory. Science 364, 1082–1086 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Jadhav S. P., Kemere C., German P. W., Frank L. M., Awake hippocampal sharp-wave ripples support spatial memory. Science 336, 1454–1458 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Michon F., Sun J. J., Kim C. Y., Ciliberti D., Kloosterman F., Post-learning hippocampal replay selectively reinforces spatial memory for highly rewarded locations. Curr. Biol. 29, 1436–1444.e5 (2019). [DOI] [PubMed] [Google Scholar]
- 57.Papale A. E., Zielinski M. C., Frank L. M., Jadhav S. P., Redish A. D., Interplay between hippocampal sharp-wave-ripple events and vicarious trial and error behaviors in decision making. Neuron 92, 975–982 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Roux L., Hu B., Eichler R., Stark E., Buzsáki G., Sharp wave ripples during learning stabilize the hippocampal spatial map. Nat. Neurosci. 20, 845–853 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Bedi G., et al. , Automated analysis of free speech predicts psychosis onset in high-risk youths. npj Schizophrenia 1, 15030 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Çokal D., et al. , The language profile of formal thought disorder. npj Schizophrenia 4, 1–8 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Corcoran C. M., et al. , Prediction of psychosis across protocols and risk cohorts using automated language analysis. World Psychiatry 17, 67–75 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Mota N. B., Copelli M., Ribeiro S., Thought disorder measured as random speech structure classifies negative symptoms and schizophrenia diagnosis 6 months in advance. npj Schizophrenia 3, 1–10 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Spencer T. J., et al. , Lower speech connectedness linked to incidence of psychosis in people at clinical high risk. Schizophrenia Res. 228, 493–501 (2021). [DOI] [PubMed] [Google Scholar]
- 64.Voppel A. E., de Boer J. N., Brederoo S. G., Schnack H. G., Sommer I. E. C., Semantic and acoustic markers in schizophrenia-spectrum disorders: A combinatory machine learning approach. Schizophrenia Bull. 49, S163–S171 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Voleti R., et al. , Language analytics for assessment of mental health status and functional competency. Schizophrenia Bull. 49, S183–S195 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Nettekoven C. R., et al. , Semantic speech networks linked to formal thought disorder in early psychosis. Schizophrenia Bull. 49, S142–S152 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Elvevåg B., Foltz P. W., Weinberger D. R., Goldberg T. E., Quantifying incoherence in speech: An automated methodology and novel application to schizophrenia. Schizophrenia Res. 93, 304–316 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Iter D., Yoon J., Jurafsky D., “Automatic Detection of Incoherent Speech for Diagnosing Schizophrenia” in Proceedings of the Fifth Workshop on Computational Linguistics and Clinical Psychology: From Keyboard to Clinic (Association for Computational Linguistics, 2018), pp. 136–146. [Google Scholar]
- 69.Lundin N. B., et al. , Semantic search in psychosis: Modeling local exploitation and global exploration. Schizophrenia Bull. Open 1, 1–11 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Tang S. X., et al. , Natural language processing methods are sensitive to sub-clinical linguistic differences in schizophrenia spectrum disorders. npj Schizophrenia 7, 1–8 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Ku B. S., Pauselli L., Covington M. A., Compton M. T., Computational linguistic analysis applied to a semantic fluency task: A replication among first-episode psychosis patients with and without derailment and tangentiality. Psychiatry Res. 304, 114105 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Pauselli L., et al. , Computational linguistic analysis applied to a semantic fluency task to measure derailment and tangentiality in schizophrenia. Psychiatry Res. 263, 74–79 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Fradkin I., Nour M. M., Dolan R. J., Theory driven analysis of natural language processing measures of thought disorder using generative language modeling. Biol. Psychiatry Cogn. Neurosci. Neuroimaging S2451902223001258(2023). [DOI] [PubMed] [Google Scholar]
- 74.Dusek J. A., Eichenbaum H., The hippocampus and memory for orderly stimulus relations. Proc. Natl. Acad. Sci. U.S.A. 94, 7109–7114 (1997). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Bunsey M., Eichenbaum H., Conservation of hippocampal memory function in rats and humans. Nature 379, 255–257 (1996). [DOI] [PubMed] [Google Scholar]
- 76.Abbott J. T., Austerweil J. L., Griffiths T. L., Random walks on semantic networks can resemble optimal foraging. Psychol. Rev. 122, 558–569 (2015). [DOI] [PubMed] [Google Scholar]
- 77.Hoffman K., Padberg M., Rinaldi G., “Traveling salesman problem” in Encyclopedia of Operations Research and Management Science, Gass S., Fu M., Eds. (Springer, 2013). [Google Scholar]
- 78.Akaike H., A new look at the statistical model identification. IEEE Trans. Automatic Control 19, 716–723 (1974). [Google Scholar]
- 79.Liu Y., et al. , Temporally delayed linear modelling (TDLM) measures replay in both animals and humans. eLife 10, e66917 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Analysis code and data to reproduce the results in the paper will be made available at github.com/matthewnour/verbal_fluency_trajectories and github.com/YunzheLiu/TDLM. The manuscript additionally relates behavioral measures to MEG data originally presented in a previous study: (11).