Abstract
Thought disorder may be associated with subtle language abnormalities. Binomials are pairs of words of the same grammatical type that are joined by a conjunction that often have a preferred order (for example, “up and down” is more common than “down and up”). We analyzed speech transcripts from patients with first-episode psychosis and found that atypical ordering of binomial pairs was associated with thought disorder but not with other psychosis symptoms. These results illustrate the potential to generate objective, quantifiable measures of disorganized speech.
Subject terms: Psychosis, Human behaviour, Schizophrenia
Main text
Linguistic and semantic properties of spoken language are abnormal in patients with psychotic disorders. Thought disorder is associated with quantifiable and objective patterns of patient speech. Identifying speech patterns that are correlated with thought disorder may be particularly important for first-episode populations as thought disorder is highly linked with treatment failure in this population1. Neuroimaging studies suggest that patients with thought disorder have specific neurological deficits that may require targeted treatments2. Patients with disorganized speech have diminished syntactic complexity and aberrant use of acausal connective words such as “first”3,4. Other measures may even be able to detect subclinical thought disorder symptoms. Tang et al. used Bidirectional Encoder Representations from Transformers (BERT) to analyze speech from patients with psychosis who had low levels of clinician-assessed formal thought disorder5. They found that the embedding distance from an interview prompt increased across sentences in patients but decreased in controls.
Binomial pairs often have a preferred ordering, which can be demonstrated using the Google Books ngram database6,7. Speaker selection of the preferred ordering requires a complex implicit weighting of phonetic, semantic, and statistical factors, such as word length, word frequency, and temporal sequencing as well as recall of which ordering is more frequently encountered6. This makes binomial ordering a subtle and naturalistic probe of cognitive functioning. In people with disorganized speech, failure to inhibit distractors and aberrant semantic priming combine to create altered and inappropriate context for words8. We hypothesized that the same processes that lead to disorganized speech disrupt the selection of binomial orderings and that this manifests as an increase in less common orderings for binomials.
We analyzed transcripts from Structured Clinical Interview for the Positive and Negative Symptom Scale (SCI-PANSS) interviews of 28 patients with first-episode psychosis and compared disorganization factor scores to binomial ordering statistics. Participants produced a highly variable number of words (mean = 3876.1 with standard deviation = 2714.1) and binomials (mean = 8.5 with standard deviation of 8.1). Binomial count was highly correlated with total number of words spoken (Pearson’s r = 0.88). We used the histogram of all binomial ordering preferences across participants and the cumulative probability across likelihoods to select a threshold for considering a binomial ordering to be “rare” (Fig. 1a). Any ordering that occurred less than 0.33 (that is, was outscored 2 to 1) in the corpus was considered a rare binomial ordering. We expected that participants who used more binomials in their speech were more likely to produce rare binomials and therefore, for each participant, we divided the number of rare binomial orderings by the total number of binomials. The normalized proportion of rare binomial orderings was not statistically significantly different in men and women (unpaired t-test, p = 0.19).
Disorganization factor score was statistically significantly correlated with the proportion of rare binomial orderings (Fig. 1b; Spearman’s r = 0.52, padjusted = 0.046). No other factor score was associated with proportion of rare binomial orderings (all padjusted > 0.33. The Disorganization factor score is a weighted sum of three PANSS items: Conceptual Disorganization, Poor Rapport, and Poor Attention9. We then tested whether any of these measures were correlated with proportion of rare binomial orderings. We found that only the Conceptual Disorganization item was correlated with rare binomial orderings (Spearman’s r = 0.54, padjusted = 0.018; the other components had padjusted > 0.36).
We found that disorganized speech is accompanied by grammatically correct but unusual ordering of binomial pairs. This may arise from impaired cognitive control processes that are unable to reliably produce the preferred binomial ordering and/or impaired recall of the more frequently encountered ordering. Our results are consistent with recent work showing that disorganized speech is associated with idiosyncratic use of function words including conjunctions10,11.
This study has several limitations. The sample size was limited and there was no healthy control comparison group. The participants in this study were assessed using the PANSS, a general psychosis scale, rather than a more fine-grained thought-disorder-specific scale. There may be subtypes of thought disorder, such as positive thought disorder, that are associated with atypical binomial orderings and other subtypes, such as negative thought disorder, that are not8. Future work should use larger sample sizes, incorporate scales that can disaggregate thought disorder subtypes, and include directly probing participants ability to order novel binomials.
Methods
Participants and procedure
All study procedures were approved by the Institutional Review Board of Partners Healthcare/Mass General Brigham. Twenty-eight people with first-episode psychosis (within 3 years of initial diagnosis) were recruited from inpatient units and outpatient clinics at McLean Hospital. Exclusion criteria were limited to a history of head injury, neurological disorders, prior electroconvulsive therapy, and active major medical illness. Participants provided written informed consent. Demographic information for the included participants can be found in Table 1. Participants completed a SCI-PANSS interview, which was scored offline12. Factor scores (Positive, Negative, Disorganized, Excited, Depressed) were weighted sums of PANSS items with weights taken from a validated five-factor model9.
Table 1.
Participants | |
---|---|
n | 28 |
Age (years) | 22.5 ± 2.9 |
Sex (male/female) | 17/11 |
Chlorpromazine (CPZ) equivalents (mg) | 249.7 ± 178.0 |
Dx | |
SZ/SZA | 9 |
BP | 15 |
Other | 4 |
PANSS-positive factor | 7.2 ± 3.2 |
PANSS-negative factor | 10.5 ± 3.6 |
PANSS-disorganized factor | 4.6 ± 2.1 |
PANSS-excited factor | 5.5 ± 2.3 |
PANSS-depressed factor | 4.2 ± 1.6 |
Values are mean ± standard deviation.
Binomials and statistics
We identified a list of binomials linked with the word “and” from each interview transcript. Three participants used fewer than three binomials and were excluded from further analysis. For each binomial, we generated the reverse binomial, for example, for the binomial “write and paint” we generated the binomial “paint and write”. We then used the ngramr package in R to query the Google N-gram database for works published in English between 2010 and 20197,13,14. For each binomial, we calculated occurrences of the true binomial divided by the sum of the occurrences of the true binomial and the reversed binomial (the “binomial ordering proportion”). For some binomials, the reverse ordering never occurred. These binomials were not included in the analysis. Spearman correlation was used to measure the relationships between clinical features and linguistic measures. Adjusted p-values were Bonferroni-corrected.
Acknowledgements
This work was funded by National Institute of Mental Health Grant K23 MH118565 (to M.M.).
Author contributions
M.M.: project design, data collection, data analysis, and writing the manuscript. D.O.: project design and writing the manuscript.
Data availability
Binomial lists and symptom scoring data are available from the corresponding author upon reasonable request.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Roche E, Creed L, Macmahon D, Brennan D, Clarke M. The epidemiology and associated phenomenology of formal thought disorder: a systematic review. Schizophr. Bull. 2015;41:951–962. doi: 10.1093/schbul/sbu129. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Cavelti M, Kircher T, Nagels A, Strik W, Homan P. Is formal thought disorder in schizophrenia related to structural and functional aberrations in the language network? A systematic review of neuroimaging findings. Schizophr. Res. 2018;199:2–16. doi: 10.1016/j.schres.2018.02.051. [DOI] [PubMed] [Google Scholar]
- 3.Mackinley M, Chan J, Ke H, Dempster K, Palaniyappan L. Linguistic determinants of formal thought disorder in first episode psychosis. Early Interv. Psychiatry. 2021;15:344–351. doi: 10.1111/eip.12948. [DOI] [PubMed] [Google Scholar]
- 4.Çokal D, et al. The language profile of formal thought disorder. npj Schizophr. 2018;4:1–8. doi: 10.1038/s41537-018-0061-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Tang SX, et al. Natural language processing methods are sensitive to sub-clinical linguistic differences in schizophrenia spectrum disorders. npj Schizophr. 2021;7:1–8. doi: 10.1038/s41537-021-00154-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Morgan E, Levy R. Abstract knowledge versus direct experience in processing of binomial expressions. Cognition. 2016;157:384–402. doi: 10.1016/j.cognition.2016.09.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Michel JB, et al. Quantitative analysis of culture using millions of digitized books. Science. 2011;331:176–182. doi: 10.1126/science.1199644. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Palaniyappan, L. Dissecting the neurobiology of linguistic disorganisation and impoverishment in schizophrenia. Semin. Cell Dev. Biol. 10.1016/j.semcdb.2021.08.015 (2021). [DOI] [PubMed]
- 9.Wallwork RS, Fortgang R, Hashimoto R, Weinberger DR, Dickinson D. Searching for a consensus five-factor model of the Positive and Negative Syndrome Scale for schizophrenia. Schizophr. Res. 2012;137:246–250. doi: 10.1016/j.schres.2012.01.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Tausczik YR, Pennebaker JW. The psychological meaning of words: LIWC and computerized text analysis methods. J. Lang. Soc. Psychol. 2010;29:24–54. doi: 10.1177/0261927X09351676. [DOI] [Google Scholar]
- 11.Silva A, Limongi R, MacKinley M, Palaniyappan L. Small words that matter: linguistic style and conceptual disorganization in untreated first-episode schizophrenia. Schizophr. Bull. Open. 2021;2:1–10. doi: 10.1093/schizbullopen/sgab010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Kay SR, Fiszbein A, Opler LA. The Positive and Negative Syndrome Scale (PANSS) for schizophrenia. Schizophr. Bull. 1987;13:261–276. doi: 10.1093/schbul/13.2.261. [DOI] [PubMed] [Google Scholar]
- 13.Carmody, S. ngramr: Retrieve and Plot Google n-Gram Data. R package version 1.7.4. https://CRAN.Rproject.org/package=ngramr (2021).
- 14.Team, R. C. R: A language and environment for statistical computing. R Foundation for StatisticalComputing, Vienna, Austria. https://www.R-project.org/ (2017).
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
Binomial lists and symptom scoring data are available from the corresponding author upon reasonable request.