Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2024 Mar 28.
Published in final edited form as: Front Lang Sci. 2023 Aug 9;2:1217837. doi: 10.3389/flang.2023.1217837

Lexico-semantics obscures lexical syntax

William Matchin 1
PMCID: PMC10977931  NIHMSID: NIHMS1973501  PMID: 38549784

Abstract

A recently emerging generalization about language and the brain is that brain regions implicated in language that show syntax-related activations (e.g., increased activation for more complex sentence structures) also tend to show word-related activations, such as increased activation to reading real words (e.g. RAIN) relative to pseudowords (e.g. PHREZ) (Fedorenko et al., 2016). Fedorenko et al., (2020) generalize as follows: “…syntactic/combinatorial processing is not separable from lexico-semantic processing at the level of brain regions—or even voxel subsets—within the language network”. Based on this generalization, Fedorenko et al. have made the conclusion “…that a cognitive architecture whereby syntactic processing is not separable from the processing of individual word meanings is most likely”, arguing against “syntax-centric” views of language as promulgated by Chomsky and others. However, the notion of “lexico-semantics” articulated here obscures the fact that words are both syntactic and semantic entities. Because of this, any functional neuroimaging experiment that manipulates lexicality will almost assuredly tax both syntactic and semantic resources, and is therefore inadequate for isolating conceptual-semantic processing in the brain in exclusion to syntax. In addition, Fedorenko et al. do not satisfactorily account for robust lesion data that shows clear functional-anatomical dissociations within the language network. Finally, a “syntax-centric” view of language is perfectly compatible with Fedorenko et al.’s conclusions about syntax-selectivity in the brain because of the multiple potential mappings between linguistic theory and neurobiology beyond individual brain regions.

Words and Rules

“The way language works, then, is that each person’s brain contains a lexicon of words and the concepts they stand for (a mental dictionary) and a set of rules that combine the words to convey relationships among concepts (a mental grammar).”

(Pinker, 1995)

Pinker’s work had a major impact in popularizing the ideas of Chomsky and mainstream generative grammar (MGG). Like Chomsky, Pinker forcibly argued for the concept of an innate linguistic module of the human brain that allows us to learn language. However, his work included some simplifications that have become ensconced in cognitive science. A major one was “Words and Rules” (Pinker, 1999; Pinker & Ullman, 2002): the idea that language is fundamentally two diametrically-opposed systems, a database of form-meaning pairs (the lexicon) and a rule-based, combinatorial system (syntax), each rooted in distinct underlying brain systems (Figure 1, left). Pinker combined the traditional Saussurean notion of human language as fundamentally a system of arbitrary form-meaning pairs with the focus of generative grammar on combinatorial syntactic operations.

Figure 1.

Figure 1.

LEFT: The “Words and Rules” theory of Pinker (1999), adapted from Pinker & Ullman, (2002). Separate cognitive systems are posited, each with distinct memory mechanisms, one containing a lexicon of form-meaning pairs (“words”) and one involved in combining lexical elements together to create complex structures and semantic relations (“rules”). RIGHT: The Y-model posited in mainstream generative grammar (e.g. Chomsky, 1965; 1980; 1995). A syntactic module which combines lexical elements into complex structures is then interpreted in two separate modules, one for phonetic form (how an expression is pronounced), and one for semantic form (the meaning of an utterance). In the Y-model, there is only an indirect and loose connection between the lexicon and meaning, whereas in the Words and Rules model the lexicon is a repository of form-meaning pairs.

While many have taken the “Words and Rules” theory of Pinker to accurately summarize the MGG approach, these approaches are actually fundamentally incompatible (see also Embick & Marantz, 2005). The first models of MGG didn’t even contain a lexicon (Chomsky, 1955, 1957). Later models understood words as syntactic objects; inputs to syntactic computation, only later acquiring phonological and semantic expression (the inverted Y-model; (Chomsky, 1965, 1981, 1995b) (Figure 1, right). Even further, (Chomsky, 1995a) proposed the theory of bare phrase structure, in which the labels of phrasal projections are not traditional syntactic categories like noun and verb, but are rather derived from the lexical items themselves. E.g., instead of verb phrases there are “eat phrases”, instead of noun phrases there are “cat phrases”, etc. Some researchers have pointed to compelling evidence that words do not bear a direct mapping to meaning as Saussure claimed (Pietroski, 2018; Preminger, 2021), referring to phenomenon such as polysemy, in which the meaning of the same lexical item, e.g. “book”, is determined by syntactic context (e.g., the book’s pages were torn – physical object, the book has challenged millions of readers to re-examine their views – abstract collection of words).

I do not claim that this approach is incontrovertibly correct. However, almost every modern (psycho-)linguistic approach acknowledges that words have syntax, a point not raised or addressed by Fedorenko et al. (2020). The most popular alternative approach to MGG, the Construction Grammar/usage-based approach (Goldberg, 2003; Jackendoff, 2002), while advocating for a clearly distinct approach to language, does not abolish the distinction between syntax and semantics, but rather articulates that words and constructions are pairs of syntactic form and meaning. In addition, popular psycholinguistic models of word production involve two stages: the first stage involves going from meaning to the lemma, which includes the syntactic representation of a concept, and in the second stage from the lemma to phonological form (Dell & O’Seaghdha, 1992; Kempen & Huijbers, 1983; Levelt, 1989; Levelt et al., 1999)1. The idea that word retrieval involves access to syntactic information, even in the context of single word production, is mostly uncontroversial in this literature. Thus, the idea of a coherent lexico-semantic system entirely distinct from and diametrically opposed to syntax, is in many ways an aberration, yet it appears to have had substantial impact on cognitive neuroscience.2

Functional differentiation in the language network

Fedorenko et al.’s claims about the neurobiological implementation of language are unusually strong and are based on the problematic notion of “lexico-semantics” reviewed above. First, Fedorenko and colleagues (2020) argue against the idea of a purely syntactic system in the brain because several previous studies “found that any language-responsive brain region or electrode that shows sensitivity to syntactic structure… is at least as sensitive, and often more sensitive, to meanings of individual words.” This is simply the observation that regions implicated in syntax activate more to real words than pseudowords;3,4 there is no evidence that these activations only reflect meaning. A syntactic system in the brain should activate more to real words relative to pseudowords, reflecting access to syntactic elements as reviewed above. Thus, the “lexico-semantic” activations reported in these experiments are ambiguous between lexical-syntactic and lexical-semantic processing.

Then, Fedorenko et al. (2020) performed a series of three additional functional magnetic resonance imaging (fMRI) experiments designed to separately target “lexico-semantic” and syntactic processing, following similar experimental designs in previous research (Dapretto & Bookheimer, 1999; Friederici, 2003; Kuperberg et al., 2000, 2003; Menenti et al., 2011; Noppeney & Price, 2004; Segaert et al., 2012). Here are example stimuli from the critical, putatively “lexico-semantic” conditions in Fedorenko et al. (2020). In these experiments, the initial sentence or clause is followed up by a second sentence or clause which contrasts with the initial one:

  • Experiment 1: “Although his ears were damaged… the man could still cook” (meaning violation)

  • Experiment 2: “The protestor quoted the leader -> The striker cited the chief” (different words)

  • Experiment 3: “The scientist flattered David -> The scientist misled David” (non-synonym)

All three of these experiments conflate syntax and semantics within the “lexico-semantic” condition. In Experiment 1, the meaning violation is also a violation of the expected word, which is a syntactic as well as semantic element. In addition, it is quite possible that violations of meaning are accompanied by syntactic revision processes in order to attempt to re-interpret the sentence. In Experiment 2, different words are different syntactic elements as well as different meanings. In Experiment 3, the manipulation involves changing the final word, which is again both a syntactic and semantic element. It is therefore no surprise that brain areas thought to be potentially selective to syntax (such as the inferior frontal lobe and posterior temporal lobe as postulated by many authors, Bornkessel-Schlesewsky & Schlesewsky, 2013; Friederici, 2017; Hagoort, 2005; Matchin & Hickok, 2020; Tyler & Marslen-Wilson, 2008) show activations to both the “lexico-semantic” and syntactic conditions, because the “lexico-semantic” conditions always involve a “hidden” syntactic manipulation. That is, these experiments always manipulate words, which are intrinsically syntactic as well as semantic, which is a consensus position in linguistic theory as reviewed above.

Researchers using similar experimental conditions in brain imaging research have reported dissociations of syntactic and semantic processing (Dapretto & Bookheimer, 1999; Friederici, 2003; Kuperberg et al., 2000, 2003; Menenti et al., 2011; Noppeney & Price, 2004; Segaert et al., 2012), which seem discrepant with the results reported by Fedorenko et al. (2020). However, these previous authors reported whole-brain activation maps, whereas Fedorenko et al. do not. It is possible that the subtle spatial dissociations reported by previous authors would be replicated in the Fedorenko et al. experiments if whole-brain analyses had been reported.5 Regardless, it is more important that these previous authors operated under the same mistaken assumption as Fedorenko et al.: that “lexico-semantic” manipulations do not tax syntax. The fact that Fedorenko et al. do not (appear to) replicate the syntax-semantics dissociations reported by previous authors is more likely due to the fact that these original experimental designs were flawed to begin with.

Future functional neuroimaging studies investigating the syntax-semantics distinction should account for the dual semantic and syntactic nature of the lexicon by eschewing the conventional notion of “lexical-semantics” itself. Instead, researchers should develop more careful experiments that independently vary the richness of conceptual-semantic content and lexical-syntactic complexity, or separately model these components during sentence comprehension (see Hale et al., 2022; Pylkkänen, 2019, 2020 for reviews).

Syntax in the brain: multiple viable instantiations

Fedorenko et al. (2020) focus on the idea of a syntactic system in the brain which should not be activated by lexical manipulations, a prediction which does not follow from “syntax-centric” theories of MGG (e.g. Chomsky 1965; 1980; 1995). However, they do report a significant preference for the syntactic condition in the posterior temporal lobe for Experiment 2 when using a localizer more sensitive to syntax. It is not a coincidence that the posterior temporal lobe is strongly implicated in syntax by recent authors (Bornkessel-Schlesewsky & Schlesewsky, 2013; Matchin & Hickok, 2020; Pylkkänen, 2019). Recent lesion-symptom mapping literature supports a strong association between syntactic comprehension deficits and damage to posterior temporal-parietal areas (Matchin, Basilakos, et al., 2022; Matchin, den Ouden, et al., 2022; Pillay et al., 2017; Rogalsky et al., 2018), a similar pattern that is also emerging for paragrammatic speech production deficits (Matchin et al., 2020; Yagata et al., 2017). Residual functional activation in posterior temporal lobe after accounting for lesion effects appears to be uniquely associated with aphasia recovery (Wilson et al., 2022; Schneck, 2022). Given that lesion-symptom mapping provides much stronger causal inference than functional neuroimaging (Rorden & Karnath, 2004), such data need to be addressed together.

Finally, while I find the evidence for a hierarchical, abstract syntactic system in posterior temporal lobe to be highly compelling, a variety of multiple perspectives on this issue are possible. First, even if no brain region is selective for syntax, specific network configurations could be (Anderson, 2016; Buzsaki, 2006; Farahani et al., 2019; Schnitzler & Gross, 2005). Further, linguistic theories do not make predictions about how much cortical surface area would be needed to process syntax and semantics, or whether there must be large cortical areas dedicated to processing syntax at all (Embick & Poeppel, 2015; Poeppel & Embick, 2005). The ideas of Chomsky regarding the uniqueness and expressive power of syntax are perfectly compatible with a “slight rewiring of the brain” (Chomsky, 2005) of an evolutionarily recent hominin ancestor, augmenting a sea of brain mechanisms that resulted in the modern human language faculty (Berwick & Chomsky, 2016), regardless of whether there is clear evidence of a large swath of syntax-selective cortex.

Footnotes

1

Some have critiqued the notion of lemmas (Krauska, 2023). However, the alterative view of speech production articulated by these authors also involves two steps with an intermediate syntactic layer.

2

Many researchers use the term “lexico-semantic” to refer to a conglomeration of lexical and conceptual processing that excludes syntax. However, there is an alternative, viable usage of “lexico-semantic” that refers to monadic concepts.

3

The term “nonword” should be supplanted by “pseudoword”. It acknowledges the complexities of the potential higher-level linguistic processing that occurs when people process them (Vitevitch & Luce, 1999).

4

In many of these experiments, the word lists include morphologically complex forms and morphosyntactic features such as past tense. Thus syntax is often present in the putatively “lexico-semantic” conditions.

5

In many papers, Fedorenko et al. do not report whole-brain analyses. However, they are critical supplements to ROI analyses, potentially revealing hidden patterns in the data and allowing for better comparability across studies.

References

  1. Anderson ML (2016). Précis of after phrenology: Neural reuse and the interactive brain. Behavioral and Brain Sciences, 39, e120. [DOI] [PubMed] [Google Scholar]
  2. Berwick RC, & Chomsky N (2016). Why only us: Language and evolution. MIT press. [Google Scholar]
  3. Bornkessel-Schlesewsky I, & Schlesewsky M (2013). Reconciling time, space and function: A new dorsal–ventral stream model of sentence comprehension. Brain and Language, 125(1), 60–76. 10.1016/j.bandl.2013.01.010 [DOI] [PubMed] [Google Scholar]
  4. Buzsaki G (2006). Rhythms of the Brain. Oxford university press. [Google Scholar]
  5. Chomsky N (1955). The logical structure of linguistic theory.
  6. Chomsky N (1957). Syntactic Structures. Mouton. [Google Scholar]
  7. Chomsky N (1965). Aspects of the Theory of Syntax. MIT Press. [Google Scholar]
  8. Chomsky N (1981). Lectures on Government and Binding. Foris. [Google Scholar]
  9. Chomsky N (1995a). Bare Phrase Structure. In Campos H & Kempchinsky P (Eds.), Evolution and Revolution in Linguistic Theory. Georgetown University Press. [Google Scholar]
  10. Chomsky N (1995b). The Minimalist Program. MIT Press. [Google Scholar]
  11. Chomsky N (2005). Three Factors in Language Design. Linguistic Inquiry, 36(1), 1–22. 10.1162/0024389052993655 [DOI] [Google Scholar]
  12. Dapretto M, & Bookheimer SY (1999). Form and Content: Dissociating Syntax and Semantics in Sentence Comprehension. Neuron, 24, 427–432. [DOI] [PubMed] [Google Scholar]
  13. Dell GS, & O’Seaghdha PG (1992). States of lexical access in language production. Cognition, 42(1–3), 287–314. [DOI] [PubMed] [Google Scholar]
  14. Embick D, & Marantz A (2005). Cognitive neuroscience and the English past tense: Comments on the paper by Ullman et al.☆. Brain and Language, 93(2), 243–247. 10.1016/j.bandl.2004.10.003 [DOI] [PubMed] [Google Scholar]
  15. Embick D, & Poeppel D (2015). Towards a computational(ist) neurobiology of language: Correlational , integrated and explanatory neurolinguistics. Language, Cognition and Neuroscience, 30(4), 357–366. 10.1080/23273798.2014.980750 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Farahani FV, Karwowski W, & Lighthall NR (2019). Application of graph theory for identifying connectivity patterns in human brain networks: A systematic review. Frontiers in Neuroscience, 13, 585. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Fedorenko E, Blank IA, Siegelman M, & Mineroff Z (2020). Lack of selectivity for syntax relative to word meanings throughout the language network. Cognition, 203, 104348. 10.1016/j.cognition.2020.104348 [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Friederici AD (2003). The Role of Left Inferior Frontal and Superior Temporal Cortex in Sentence Comprehension: Localizing Syntactic and Semantic Processes. Cerebral Cortex, 13(2), 170–177. 10.1093/cercor/13.2.170 [DOI] [PubMed] [Google Scholar]
  19. Friederici AD (2017). Language in our brain: The origins of a uniquely human capacity. MIT Press. [Google Scholar]
  20. Gallistel CR, & King AP (2010). Memory and the computational brain: Why cognitive science will transform neuroscience. Wiley-Blackwell. [Google Scholar]
  21. Goldberg AE (2003). Constructions: A new theoretical approach to language. Trends in Cognitive Sciences, 7(5), 219–224. 10.1016/S1364-6613(03)00080-9 [DOI] [PubMed] [Google Scholar]
  22. Hagoort P (2005). On Broca, brain, and binding: A new framework. Trends in Cognitive Sciences, 9(9), 416–423. 10.1016/j.tics.2005.07.004 [DOI] [PubMed] [Google Scholar]
  23. Hale JT, Campanelli L, Li J, Bhattasali S, Pallier C, & Brennan JR (2022). Neurocomputational Models of Language Processing. Annual Review of Linguistics, 8(1), 427–446. 10.1146/annurev-linguistics-051421-020803 [DOI] [Google Scholar]
  24. Jackendoff R (2002). Foundations of language: Brain, meaning, grammar, evolution. Oxford University Press. [DOI] [PubMed] [Google Scholar]
  25. Kempen G, & Huijbers P (1983). The lexicalization process in sentence production and naming: Indirect election of words. Cognition, 14(2), 185–209. 10.1016/0010-0277(83)90029-X [DOI] [Google Scholar]
  26. Krauska A (2023). Moving away from lexicalism in psycho- and neuro-linguistics. Frontiers in Language Sciences. [Google Scholar]
  27. Kuperberg GR, Holcomb PJ, Sitnikova T, Greve D, Dale AM, & Caplan D (2003). Distinct Patterns of Neural Modulation during the Processing of Conceptual and Syntactic Anomalies. Journal of Cognitive Neuroscience, 15(2), 272–293. 10.1162/089892903321208204 [DOI] [PubMed] [Google Scholar]
  28. Kuperberg GR, McGuire PK, Bullmore ET, Brammer MJ, Rabe-Hesketh S, Wright IC, Lythgoe DJ, Williams SCR, & David AS (2000). Common and Distinct Neural Substrates for Pragmatic, Semantic, and Syntactic Processing of Spoken Sentences: An fMRI Study. Journal of Cognitive Neuroscience, 12(2), 321–341. 10.1162/089892900562138 [DOI] [PubMed] [Google Scholar]
  29. Levelt WJM (1989). Speaking: From Intention to Articulation. MIT Press. [Google Scholar]
  30. Levelt WJM, Roelofs A, & Meyer AS (1999). A theory of lexical access in speech production. BEHAVIORAL AND BRAIN SCIENCES, 76. [DOI] [PubMed] [Google Scholar]
  31. Matchin W, Basilakos A, den Ouden D-B, Stark BC, Hickok G, & Fridriksson J (2022). Functional differentiation in the language network revealed by lesion-symptom mapping. NeuroImage, 247, 118778. 10.1016/j.neuroimage.2021.118778 [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Matchin W, Basilakos A, Stark BC, den Ouden D-B, Fridriksson J, & Hickok G (2020). Agrammatism and paragrammatism: A cortical double dissociation revealed by lesion-symptom mapping. Neurobiology of Language, 1–47. 10.1162/nol_a_00010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Matchin W, den Ouden D-B, Hickok G, Hillis AE, Bonilha L, & Fridriksson J (2022). The Wernicke conundrum revisited: Evidence from connectome-based lesion-symptom mapping in post-stroke aphasia. Brain, awac219. 10.1101/2021.10.25.465746 [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Matchin W, & Hickok G (2020). The Cortical Organization of Syntax. Cerebral Cortex, 30(3), 1481–1498. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Menenti L, Gierhan SME, Segaert K, & Hagoort P (2011). Shared Language: Overlap and Segregation of the Neuronal Infrastructure for Speaking and Listening Revealed by Functional MRI. Psychological Science, 22(9), 1173–1182. 10.1177/0956797611418347 [DOI] [PubMed] [Google Scholar]
  36. Noppeney U, & Price CJ (2004). An fMRI Study of Syntactic Adaptation. Journal of Cognitive Neuroscience, 16(4), 702–713. 10.1162/089892904323057399 [DOI] [PubMed] [Google Scholar]
  37. Pietroski PM (2018). Conjoining Meanings: Semantics Without Truth Values. Oxford University Press. [Google Scholar]
  38. Pillay SB, Binder JR, Humphries C, Gross WL, & Book DS (2017). Lesion localization of speech comprehension deficits in chronic aphasia. Neurology, 88(10), 970–975. 10.1212/WNL.0000000000003683 [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Pinker S (1995). The language instinct. Penguin. [Google Scholar]
  40. Pinker S (1999). Words and rules: The ingredients of language. Basic Books. [Google Scholar]
  41. Pinker S, & Ullman MT (2002). The past and future of the past tense. Trends in Cognitive Sciences, 6(11), 456–463. 10.1016/S1364-6613(02)01990-3 [DOI] [PubMed] [Google Scholar]
  42. Poeppel D, & Embick D (2005). Defining the relation between linguistics and neuroscience. In Cutler A (Ed.), Twenty-first century psycholinguistics: Four cornerstones (pp. 103–118). Lawrence Erlbaum Associates Publishers. [Google Scholar]
  43. Preminger O (2021). Natural language without semiosis [Invited Talk]. SinFonIJA 14, University of Maryland. https://www.ff.uns.ac.rs/uploads/files/Nauka/Konferencije/2021/Sinfonija%2014/materijali/Invited%20talks/Preminger%20slides.pdf [Google Scholar]
  44. Pylkkänen L (2019). The neural basis of combinatory syntax and semantics. Science, 366(6461), 62–66. 10.1126/science.aax0050 [DOI] [PubMed] [Google Scholar]
  45. Pylkkänen L (2020). Neural basis of basic composition: What we have learned from the red–boat studies and their extensions. Philosophical Transactions of the Royal Society B: Biological Sciences, 375(1791), 20190299. 10.1098/rstb.2019.0299 [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Rogalsky C, LaCroix AN, Chen K-H, Anderson SW, Damasio H, Love T, & Hickok G (2018). The Neurobiology of Agrammatic Sentence Comprehension: A Lesion Study. Journal of Cognitive Neuroscience, 30(2), 234–255. 10.1162/jocn_a_01200 [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Rorden C, & Karnath H-O (2004). Using human brain lesions to infer function: A relic from a past era in the fMRI age? Nature Reviews Neuroscience, 5(10), 812–819. 10.1038/nrn1521 [DOI] [PubMed] [Google Scholar]
  48. Schnitzler A, & Gross J (2005). Normal and pathological oscillatory communication in the brain. Nature Reviews Neuroscience, 6(4), 285–296. [DOI] [PubMed] [Google Scholar]
  49. Segaert K, Menenti L, Weber K, Petersson KM, & Hagoort P (2012). Shared Syntax in Language Production and Language Comprehension—An fMRI Study. Cerebral Cortex, 22(7), 1662–1670. 10.1093/cercor/bhr249 [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Tyler LK, & Marslen-Wilson W (2008). Fronto-temporal brain systems supporting spoken language comprehension. Philosophical Transactions of the Royal Society B: Biological Sciences, 363(1493), 1037–1054. 10.1098/rstb.2007.2158 [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Vitevitch MS, & Luce PA (1999). Probabilistic Phonotactics and Neighborhood Activation in Spoken Word Recognition. Journal of Memory and Language, 40(3), 374–408. 10.1006/jmla.1998.2618 [DOI] [Google Scholar]
  52. Wilson SM, Entrup JL, Schneck SM, Onuscheck CF, Levy DF, Rahman M, Willey E, Casilio M, Yen M, Brito A, Kam W, Davis LT, de Riesthal M, & Kirshner HS (2022). Recovery from aphasia in the first year after stroke. Brain, awac129. 10.1093/brain/awac129 [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Yagata SA, Yen M, McCarron A, Bautista A, Lamair-Orosco G, & Wilson SM (2017). Rapid recovery from aphasia after infarction of Wernicke’s area. Aphasiology, 31(8), 951–980. 10.1080/02687038.2016.1225276 [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES