Abstract
Hierarchical cognitive mechanisms underlie sophisticated behaviors, including language, music, mathematics, tool-use, and theory of mind. The origins of hierarchical logical reasoning have long been, and continue to be, an important puzzle for cognitive science. Prior approaches to hierarchical logical reasoning have often failed to distinguish between observable hierarchical behavior and unobservable hierarchical cognitive mechanisms. Furthermore, past research has been largely methodologically restricted to passive recognition tasks as compared to active generation tasks that are stronger tests of hierarchical rules. We argue that it is necessary to implement learning studies in humans, non-human species, and machines that are analyzed with formal models comparing the contribution of different cognitive mechanisms implicated in the generation of hierarchical behavior. These studies are critical to advance theories in the domains of recursion, rule-learning, symbolic reasoning, and the potentially uniquely human cognitive origins of hierarchical logical reasoning.
Keywords: Logic, Rule-learning, Pattern recognition, Hierarchical reasoning, Bayesian modeling
1. Introduction
Hierarchical logical reasoning is central to the organization and implementation of sophisticated behaviors (Asano, Boeckx, & Seifert, 2021; Chomsky, 1956; Friederici, 2020; Greenfield, 1991; Greenfield & Schneider, 1977; Palmer, 1977; Thibault et al., 2021), including language, mathematics, music, tool-use, and theory of mind (Corballis, 2014; Greenfield, 1991; Hauser, Chomsky, & Fitch, 2002; Hofstadter, 1979; Laland & Seed, 2021; Maclean, 2016; Pinker & Jackendoff, 2005; Tomasello & Rakoczy, 2003). Hierarchical structures are composed of lower-level units combined to form higher-level ones (Greenfield, 1991). Constituent lower-level units are building blocks for higher-level units and can be flexibly combined and moved around (Fodor, 2001; Westphal-Fitch, Huber, Gomez, & Fitch, 2012). Patricia Greenfield (1991) hypothesized that widespread hierarchical behavior arising from hierarchical cognitive mechanisms is central to domain-general abstract cognition. Here, we identify two approaches that are essential for testing this hypothesis:
Comparisons of hierarchical versus non-hierarchical models are critical tests of the domain-generality of hierarchical cognitive mechanisms.
Generation (as opposed to recognition) of hierarchical structures is the key behavior for identifying hierarchical cognitive mechanisms.
2. Behavior and cognitive mechanisms
Hierarchical behavior, that is, the perception and production of hierarchical structures, differs from hierarchical cognitive mechanisms and mental representations. The fact that hierarchical behavior is seen across multiple domains is uncontroversial (Corballis, 2014; Fischmeister, Martins, Beisteiner, & Fitch, 2017; Fitch, 2014; Hauser & Watumull, 2017; Truswell, 2017; Vyshedskiy, 2019); the claim that hierarchical cognitive mechanisms underlie this behavior is not (Frank, Bod, & Christiansen, 2012; Lobina, 2014). Indeed, hierarchical behavior may arise from non-hierarchical cognitive mechanisms like statistical learning (Camp, 2009; Rey, Perruchet, & Fagot, 2012; Santolin & Saffran, 2018) or ordinal reasoning (D’amato & Colombo, 1990; McGonigle & Chalmers, 1977; Orlov, Yakovlev, Hochstein, & Zohary, 2000; Terrace & McGonigle, 1994). Hierarchical structures abound in nature; for example, primate societies tend to be hierarchically organized (Franz, McLean, Tung, Altmann, & Alberts, 2015). Non-human primates, such as baboons, can perceive social dominance hierarchies (Cheney & Seyfarth, 2008). However, this does not imply that hierarchical cognitive mechanisms are involved; other processes like linear transitive inference may lead to the emergence of dominance hierarchies (Camp, 2009; Franz et al., 2015; Paz-y-Miño, G., A., Kamil, & Balda, 2004). Hierarchical structures can thus exist in the absence of hierarchical processes. An open puzzle is whether complex behaviors require specialized hierarchical mechanisms (Corballis, 2007, 2014; Culbertson & Adger, 2014; Dehaene, Meyniel, Wacongne, Wang, & Pallier, 2015; Fitch, 2014; Greenfield, 1991; Hauser et al., 2002). This question has been mainly asked in the domains of language and music but is applicable to a wide range of hierarchical behavior, including mathematics, tool-use, visual pattern perception, goal-directed actions, reasoning and decision-making, and complex social cognition (Dehaene, Al Roumi, Lakretz, Planton, & Sablé-Meyer, 2022; Hauser et al., 2002).
For decades, language learning studies (Fitch & Hauser, 2004; Miller, 1967; Reber, 1967) have dominated the hierarchical rule-learning literature. However, as McCoy, Culbertson, Smolensky, and Legendre (2021) describe in a recent review of artificial grammar learning paradigms, previous studies could not rule out that participants were using non-hierarchical strategies like attending to word order, bigrams, counting, or cognitive mechanisms like associative chaining or ordinal reasoning (De Vries, Monaghan, Knecht, & Zwitserlood, 2008; Ferrigno, Cheyette, Piantadosi, & Cantlon, 2020; Rey et al., 2012). Other researchers have highlighted how aspects of language comprehension are often assumed to require explicit hierarchical encoding but can be comprehended without hierarchical cognitive mechanisms or may arise from memory constraints (Cornish, Dale, Kirby, & Christiansen, 2017; Frank & Bod, 2011; Frank & Yang, 2018). A recent paper about hierarchical structure learning in infants by Shi, Emond, and Badri (2020) describes how most studies are not designed to contrast linear (i.e., non-hierarchical) versus hierarchical alternatives. In recent years, it has become clear that intelligent agents have multiple cognitive mechanisms at their disposal for solving even simple tasks. To identify what mechanisms underlie hierarchical behavior, we need a priori, ideally formal and mathematical, hypotheses about the expected behavior driven by different alternative cognitive mechanisms, viz., p(behavior ∣ cognitive mechanism). For example, two recent papers formalized competing models of associative, ordinal, and hierarchical processes to test the mechanisms underlying human and monkey behavioral performance in a hierarchical reasoning task (Ferrigno et al., 2020; Lakretz & Dehaene, 2021). Another study showed evidence that multiple mechanisms can simultaneously influence hierarchical behavior (Dedhe et al., 2022a). Formulating these predictions in Bayesian terms is helpful (Fitch, 2014) because researchers can then use formal models to calculate p(cognitive mechanism ∣ behavior)—the inferred probability that a given cognitive mechanism was involved in driving hierarchical behavior (Dedhe et al., 2022b). This approach allows for a more precise assessment of whether different cognitive mechanisms, both hierarchical and linear, interact with each other during a task. In summary, comparisons of hierarchical versus non-hierarchical models are critical tests of the domain-generality of hierarchical cognitive mechanisms.
3. Recognition and generation
Studies of natural behaviors suggest a shift in the mechanisms underlying hierarchical behavior over time. Every-day hierarchical behaviors occur in the domains of action and tool-use during routines and constructing complex objects. Children progress through a series of developmental stages from linear to hierarchical as they learn these everyday hierarchical routines (Beagles-Roos & Greenfield, 1979; Goodson & Greenfield, 1975; Greenfield, 1991; Greenfield & Childs, 1977; Greenfield & Schneider, 1977). Domains like language (Matthei, 1982) and theory of mind (Tomasello, 2018) also exhibit developmental progressions from linear to hierarchical rule use during early childhood. A major limitation of prior research about the development of hierarchical behavior is its focus on the recognition (i.e., comprehension) of hierarchical structures as opposed to their generation (i.e., production). Recognition tasks typically involve forcing participants to choose between two alternatives, and the underlying mechanisms are often ambiguous (McCoy et al., 2021). In contrast, generation tasks involve the active step-by-step production of hierarchical structures that need to be selected from a large number of non-hierarchical alternatives. The robustness of hierarchical rule use is less apparent in tasks that do not require generation. Such recognition tasks might, therefore, tap into different cognitive mechanisms than tasks that do require generation. A handful of recent studies used generation tasks to test whether hierarchical cognitive mechanisms were implicated in behavior (Ferrigno et al., 2020; Jiang et al., 2018; Lake & Piantadosi, 2020; Malassis, Dehaene, & Fagot, 2020; Rey et al., 2012). These studies of hierarchical rule generation test not only whether an individual detects hierarchical patterns, but also whether they can use hierarchical logical reasoning to produce those structures, thus providing a stronger test of learning, rule-use, and generalization. In summary, generation (as opposed to recognition) of hierarchical structures is the key behavior for identifying hierarchical cognitive mechanisms.
4. Conclusion
The hypothesis that diverse hierarchical behaviors, including language, music, mathematics, tool-use, visual pattern perception, goal-directed actions, reasoning & decision-making, and complex social cognition arise from domain-general hierarchical cognitive mechanisms, has been a mainstay of cognitive science over the past few decades. However, prior approaches to hierarchical logical reasoning have often failed to distinguish between observable hierarchical behavior and unobservable hierarchical cognitive mechanisms. Furthermore, past research has been largely methodologically restricted to passive recognition tasks as compared to active generation tasks that are stronger tests of the use of hierarchical rules. These two limitations need to be addressed to test the domain-generality of hierarchical cognitive mechanisms. We therefore argue that it is necessary to implement learning studies in humans, non-human species, and machines that are analyzed with formal models comparing the contribution of different cognitive mechanisms implicated in the generation of hierarchical behavior.
Hierarchical generation tasks are often difficult for non-humans to learn. Non-human primates generate hierarchical structures in some tasks, but only after extensive training (Ferrigno et al., 2020; Rey et al., 2012). In the wild, non-human vocal sequences typically lack hierarchical structure (Girard-Buttoz et al., 2022; Townsend, Engesser, Stoll, Zuberbühler, & Bickel, 2018), while the hierarchical sequences that are generated by songbirds and whales may be devoid of meaning and rich semantic content (Sainburg, Theilman, Thielk, & Gentner, 2019; Suzuki, Buck, & Tyack, 2006). Machines also struggle to learn hierarchical structures (Elman, 1990; Kirov & Frank, 2012; Larketz et al., 2021; though see Murty et al., 2022; Yang & Piantadosi, 2022). Furthermore, machines better approximate human-like hierarchical rules when endowed with inductive biases toward inferring hierarchies like nested-tree structures (Coopmans, De Hoop, Kaushik, Hagoort, & Martin, 2022; Lake & Piantadosi, 2020). In summary, differences between species and between natural versus artificial intelligences suggest that humans are “dendrophiliacs,” showing a widespread proclivity toward hierarchical psychological processes (Dehaene et al., 2022; Fitch, 2014). Further research is needed to shed light onto the potentially uniquely human origins of domain-general hierarchical logical reasoning, an important outstanding puzzle for cognitive science.
References
- Asano R, Boeckx C, & Seifert U (2021). Hierarchical control as a shared neurocognitive mechanism for language and music. Cognition, 216, 104847. [DOI] [PubMed] [Google Scholar]
- Beagles-Roos J, & Greenfield PM (1979). Development of structure and strategy in two-dimensional pictures. Developmental Psychology, 15(5), 483. [Google Scholar]
- Camp E. (2009). A language of baboon thought? In The philosophy of animal minds. [Google Scholar]
- Cheney DL, & Seyfarth RM (2008). Baboon metaphysics. University of Chicago Press. [Google Scholar]
- Chomsky N. (1956). Three models for the description of language. IRE Transactions on Information Theory, 2(3), 113–124. [Google Scholar]
- Coopmans CW, De Hoop H, Kaushik K, Hagoort P, & Martin AE (2022). Hierarchy in language interpretation: Evidence from behavioural experiments and computational modelling. Language, Cognition and Neuroscience, 37(4), 420–439. [Google Scholar]
- Corballis MC (2007). The uniqueness of human recursive thinking: The ability to think about thinking may be the critical attribute that distinguishes us from all other species. American Scientist, 95(3), 240–248. [Google Scholar]
- Corballis MC (2014). The recursive mind. Princeton University Press. [Google Scholar]
- Cornish H, Dale R, Kirby S, & Christiansen MH (2017). Sequence memory constraints give rise to language-like structure through iterated learning. PLoS One, 12(1), e0168532. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Culbertson J, & Adger D (2014). Language learners privilege structured meaning over surface frequency. Proceedings of the National Academy of Sciences, 111(16), 5842–5847. [DOI] [PMC free article] [PubMed] [Google Scholar]
- D’amato MR, & Colombo M (1990). The symbolic distance effect in monkeys (Cebus apella). Animal Learning & Behavior, 18(2), 133–140. [Google Scholar]
- Dedhe A, Piantadosi S, & Cantlon J (2022a). Building blocks of recursive pattern processing in human adults. Paper and talk presented at: 44th Annual Meeting of the Cognitive Science Society. Toronto. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dedhe A, Piantadosi S, & Cantlon J (2022b). Building blocks of recursive pattern processing in human adults, children, and monkeys. Poster presented at: 12th Biennial Conference of the Cognitive Development Society. Madison, WI. [Google Scholar]
- Dehaene S, Al Roumi F, Lakretz Y, Planton S, & Sablé-Meyer M (2022). Symbols and mental programs: A hypothesis about human singularity. Trends in Cognitive Sciences, 26(9), 751–766. [DOI] [PubMed] [Google Scholar]
- Dehaene S, Meyniel F, Wacongne C, Wang L, & Pallier C (2015). The neural representation of sequences: From transition probabilities to algebraic patterns and linguistic trees. Neuron, 88(1), 2–19. [DOI] [PubMed] [Google Scholar]
- De Vries MH, Monaghan P, Knecht S, & Zwitserlood P (2008). Syntactic structure and artificial grammar learning: The learnability of embedded hierarchical structures. Cognition, 107(2), 763–774. [DOI] [PubMed] [Google Scholar]
- Elman JL (1990). Finding structure in time. Cognitive Science, 14(2), 179–211. [Google Scholar]
- Ferrigno S, Cheyette SJ, Piantadosi ST, & Cantlon JF (2020). Recursive sequence generation in monkeys, children, US adults, and native Amazonians. Science Advances, 6(26), eaaz1002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fischmeister FP, Martins MJ, Beisteiner R, & Fitch WT (2017). Self-similarity and recursion as default modes in human cognition. Cortex, 97, 183–201. [DOI] [PubMed] [Google Scholar]
- Fitch WT, & Hauser MD (2004). Computational constraints on syntactic processing in a nonhuman primate. Science, 303(5656), 377–380. [DOI] [PubMed] [Google Scholar]
- Fitch WT (2014). Toward a computational framework for cognitive biology: Unifying approaches from cognitive neuroscience and comparative cognition. Physics of Life Reviews, 11(3), 329–364. [DOI] [PubMed] [Google Scholar]
- Fodor J. (2001). Language, thought and compositionality. Royal Institute of Philosophy Supplements, 48, 227–242. [Google Scholar]
- Frank SL, & Bod R (2011). Insensitivity of the human sentence-processing system to hierarchical structure. Psychological Science, 22(6), 829–834. [DOI] [PubMed] [Google Scholar]
- Frank SL, Bod R, & Christiansen MH (2012). How hierarchical is language use? Proceedings of the Royal Society B: Biological Sciences, 279(1747), 4522–4531. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Franz M, McLean E, Tung J, Altmann J, & Alberts SC (2015). Self-organizing dominance hierarchies in a wild primate population. Proceedings of the Royal Society B: Biological Sciences, 282(1814), 20151512. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Frank SL, & Yang J (2018). Lexical representation explains cortical entrainment during speech comprehension. PLoS One, 13(5), e0197304. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Friederici AD (2020). Hierarchy processing in human neurobiology: How specific is it? Philosophical Transactions of the Royal Society B, 375(1789), 20180391. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Girard-Buttoz C, Zaccarella E, Bortolato T, Friederici AD, Wittig RM, & Crockford C (2022). Chimpanzees produce diverse vocal sequences with ordered and recombinatorial properties. Communications Biology, 5(1), 1–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Goodson BD, & Greenfield PM (1975). The search for structural principles in children’s manipulative play: A parallel with linguistic development. Child Development, 46(3), 734–746. [PubMed] [Google Scholar]
- Greenfield PM (1991). Language, tools and brain: The ontogeny and phylogeny of hierarchically organized sequential behavior. Behavioral and Brain Sciences, 14(4), 531–551. [Google Scholar]
- Greenfield PM, & Childs CP (1977). Weaving, color terms, and pattern representation: Cultural influences and cognitive development among the Zinacantecos of southern Mexico. Inter-American Journal of Psychology, 11, 23–48. [Google Scholar]
- Greenfield PM, & Schneider L (1977). Building a tree structure: The development of hierarchical complexity and interrupted strategies in children’s construction activity. Developmental Psychology, 13(4), 299. [Google Scholar]
- Hauser MD, Chomsky N, & Fitch WT (2002). The faculty of language: What is it, who has it, and how did it evolve? Science, 298(5598), 1569–1579. [DOI] [PubMed] [Google Scholar]
- Hauser MD, & Watumull J (2017). The Universal Generative Faculty: The source of our expressive power in language, mathematics, morality, and music. Journal of Neurolinguistics, 43, 78–94. [Google Scholar]
- Hofstadter DR (1979). Gödel, Escher, Bach (pp. 582–583). New York: Basic Books. [Google Scholar]
- Jiang X, Long T, Cao W, Li J, Dehaene S, & Wang L (2018). Production of supra-regular spatial sequences by macaque monkeys. Current Biology, 28(12), 1851–1859. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kirov C, & Frank R (2012). Processing of nested and cross-serial dependencies: An automaton perspective on SRN behaviour. Connection Science, 24(1), 1–24. [Google Scholar]
- Laland K, & Seed A (2021). Understanding human cognitive uniqueness. Annual Review of Psychology, 72, 689–716. [DOI] [PubMed] [Google Scholar]
- Lake BM, & Piantadosi ST (2020). People infer recursive visual concepts from just a few examples. Computational Brain & Behavior, 3(1), 54–65. [Google Scholar]
- Lakretz Y, & Dehaene S (2021). Recursive processing of nested structures in monkeys? Two alternative accounts. Science Advances, 6(26), 1002. [Google Scholar]
- Lobina DJ (2014). Probing recursion. Cognitive Processing, 15(4), 435–450. [DOI] [PubMed] [Google Scholar]
- MacLean EL (2016). Unraveling the evolution of uniquely human cognition. Proceedings of the National Academy of Sciences, 113(23), 6348–6354. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Malassis R, Dehaene S, & Fagot J (2020). Baboons (Papio papio) process a context-free but not a context-sensitive grammar. Scientific Reports, 10(1), 1–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Matthei EH (1982). The acquisition of prenominal modifier sequences. Cognition, 11(3), 301–332. [DOI] [PubMed] [Google Scholar]
- McCoy RT, Culbertson J, Smolensky P, & Legendre G (2021). Infinite use of finite means? Evaluating the generalization of center embedding learned from an artificial grammar. In Proceedings of the Annual Meeting of the Cognitive Science Society (Vol. 43, No. 43). [Google Scholar]
- McGonigle BO, & Chalmers M (1977). Are monkeys logical? Nature, 267(5613), 694–696. [DOI] [PubMed] [Google Scholar]
- Miller GA (1967). Project Grammarama. The psychology of communication. Human Resource Management, 6(3), 43. [Google Scholar]
- Murty S, Sharma P, Andreas J, & Manning CD (2022). Characterizing intrinsic compositionality in transformers with tree projections. arXiv preprint arXiv:2211.01288. [Google Scholar]
- Orlov T, Yakovlev V, Hochstein S, & Zohary E (2000). Macaque monkeys categorize images by their ordinal number. Nature, 404(6773), 77–80. [DOI] [PubMed] [Google Scholar]
- Palmer SE (1977). Hierarchical structure in perceptual representation. Cognitive Psychology, 9(4), 441–474. [Google Scholar]
- G Paz-y-Miño C, Bond AB, Kamil AC, & Balda RP (2004). Pinyon jays use transitive inference to predict social dominance. Nature, 430(7001), 778–781. [DOI] [PubMed] [Google Scholar]
- Pinker S, & Jackendoff R (2005). The faculty of language: What’s special about it? Cognition, 95(2), 201–236. [DOI] [PubMed] [Google Scholar]
- Rey A, Perruchet P, & Fagot J (2012). Centre-embedded structures are a by-product of associative learning and working memory constraints: Evidence from baboons (Papio Papio). Cognition, 123(1), 180–184. [DOI] [PubMed] [Google Scholar]
- Reber AS (1967). Implicit learning of artificial grammars. Journal of Verbal Learning and Verbal Behavior, 6(6), 855–863. [Google Scholar]
- Sainburg T, Theilman B, Thielk M, & Gentner TQ (2019). Parallels in the sequential organization of birdsong and human speech. Nature Communications, 10(1), 1–11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Santolin C, & Saffran JR (2018). Constraints on statistical learning across species. Trends in Cognitive Sciences, 22(1), 52–63. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shi R, Emond E, & Badri S (2020). Hierarchical structure dependence in infants at the early stage of syntactic acquisition. In Proceedings of the 44th Annual Boston University Conference on Language Development (pp. 572–585). Somerville, MA: Cascadilla Press. [Google Scholar]
- Suzuki R, Buck JR, & Tyack PL (2006). Information entropy of humpback whale songs. Journal of the Acoustical Society of America, 119(3), 1849–1866. [DOI] [PubMed] [Google Scholar]
- Terrace HS, & McGonigle B (1994). Memory and representation of serial order by children, monkeys, and pigeons. Current Directions in Psychological Science, 3(6), 180–185. [Google Scholar]
- Thibault S, Py R, Gervasi AM, Salemme R, Koun E, Lövden M, Boulenger V, Roy AC, & Brozzoli C (2021). Tool use and language share syntactic processes and neural patterns in the basal ganglia. Science, 374(6569), eabe0874. [DOI] [PubMed] [Google Scholar]
- Tomasello M. (2018). How children come to understand false beliefs: A shared intentionality account. Proceedings of the National Academy of Sciences, 115(34), 8491–8498. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tomasello M, & Rakoczy H (2003). What makes human cognition unique? From individual to shared to collective intentionality. Mind & Language, 18(2), 121–147. [Google Scholar]
- Townsend SW, Engesser S, Stoll S, Zuberbühler K, & Bickel B (2018). Compositionality in animals and humans. PLoS Biology, 16(8), e2006425. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Truswell R. (2017). Dendrophobia in bonobo comprehension of spoken English. Mind & Language, 32(4), 395–415. [Google Scholar]
- Westphal-Fitch G, Huber L, Gomez JC, & Fitch WT (2012). Production and perception rules underlying visual patterns: Effects of symmetry and hierarchy. Philosophical Transactions of the Royal Society B: Biological Sciences, 367(1598), 2007–2022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vyshedskiy A. (2019). Language evolution to revolution: The leap from rich-vocabulary non-recursive communication system to recursive language 70,000 years ago was associated with acquisition of a novel component of imagination, called Prefrontal Synthesis, enabled by a mutation that slowed down the prefrontal cortex maturation simultaneously in two or more children–The Romulus and Remus hypothesis. Research Ideas and Outcomes, 5, e38546. [Google Scholar]
- Yang Y, & Piantadosi ST (2022). One model for the learning of language. Proceedings of the National Academy of Sciences, 119(5), e2021865119. [DOI] [PMC free article] [PubMed] [Google Scholar]
