Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2018 Feb 1.
Published in final edited form as: Psychon Bull Rev. 2017 Feb;24(1):56–63. doi: 10.3758/s13423-016-1100-z

A Cortical Circuit for Voluntary Laryngeal Control: Implications for the Evolution Language

Gregory Hickok 1
PMCID: PMC5652042  NIHMSID: NIHMS912746  PMID: 27368637

Abstract

The development of voluntary laryngeal control has been argued to be a key innovation in the evolution of language. Part of the evidence for this hypothesis comes from neuroscience. For example, comparative research has shown that humans have direct cortical innervation of motor neurons controlling the larynx whereas non-human primates do not. But research on cortical motor control circuits has shown that the frontal lobe cortical motor system does not work alone; it is dependent on sensory feedback control circuits. Thus, the human brain must have evolved not only the required efferent motor pathway but also the cortical circuit for controlling those efferent signals. To fill this gap I propose a link between the evolution of laryngeal control and neuroscience research on the human dorsal auditory-motor speech stream. Specifically, I argue that the dorsal stream Spt (Sylvian parietal-temporal) circuit evolved in step with the direct cortico-laryngeal control pathway and together represented a key advance in the evolution of speech. I suggest that a cortical laryngeal control circuit may play an important role in language by providing a prosodic frame for speech planning.

Introduction

The goal of the present article is (i) to briefly chronical the discovery, organization, and response properties of the cortical dorsal stream in the Dual Stream model of speech processing (Hickok and Poeppel, 2000, Hickok and Poeppel, 2004, Hickok and Poeppel, 2007, Rauschecker and Scott, 2009), (ii) to survey evidence suggesting that this stream is selective for laryngeal control rather than for the speech gesture control more generally, and (iii) to discuss the implications of this finding for understanding aspects of the evolution of language. We start with a brief overview of the Dual Stream model, then discuss the three issues outlined above in turn.

The Dual Stream Model

The Dual Stream model for speech processing holds that speech is processed along two cortical pathways, a ventral auditory-conceptual stream and a dorsal auditory-motor stream (Hickok and Poeppel, 2000, Hickok and Poeppel, 2004, Hickok and Poeppel, 2007) (Figure 1); see also (Rauschecker and Scott, 2009). The model has its roots in the classical aphasia models of the 19th century, in particular that of Wernicke (Wernicke, 1874/1977) and later Lichtheim (Lichtheim, 1885) who both proposed an auditory-conceptual pathway and an auditory-motor pathway. Dual pathway models generally have been proposed to explain a wide range of facts in non-speech sensory systems including audition (Deutsch and Roll, 1976, Poljak, 1926, Rauschecker, 1998, Rauschecker et al., 1995), vision (Milner and Goodale, 1995, Ungerleider and Mishkin, 1982), and somatosensation (Dijkerman and de Haan, 2007). The prevalence of these models and the empirical coverage they provide suggests that a cortical division of labor in sensory systems is a general organizational principle, likely driven by the need for distinct computational mappings onto conceptual versus motor systems (Hickok and Poeppel, 2007).

Figure 1.

Figure 1

A. Schematic diagram of the Dual-Stream model. Phonological network diverges into two streams, a dorsal sensorimotor stream supporting speech motor control and a ventral sensory-conceptual stream supporting comprehension. The relation between auditory-phonological and articulatory network on one hand and auditory-phonological and conceptual network on the other is mediated by distinct interface systems, the “sensorimotor interface” and the “lexical interface.” B. Approximate anatomical locations of the Dual-Stream model components. Regions shaded pink, represent the more bilaterally organized ventral stream. Regions shaded blue represent the dorsal stream, which is strongly left dominant. Functional area Spt (Sylvian parietal temporal) is indicated by the red circle.

The Discovery of Area Spt and the Dorsal Stream Circuit

Hickok & Poeppel (2000) predicted the existence of a cortical auditory-motor interface system:

a reasonable hypothesis is that the inferior parietal lobe contains an important (but not necessarily the only) interface system mediating between auditory and articulatory representations of speech (p. 135).

Subsequent fMRI experiments in my lab identified a region compatible with this hypothesis, which we termed area Spt to reflect its location at the back of the Sylvian fissure at the parietaltemporal boundary (Buchsbaum et al., 2001, Hickok et al., 2003, Hickok et al., 2009). Spt is part of a functionally identified larger network that includes the posterior superior temporal sulcus, the pars opercularis of Broca’s area, and a more dorsal lateral premotor cite. Spt was found to respond both during the perception and (covert) production of speech; both sensory and motor response properties are characteristic features of sensory-motor integration areas in the dorsal visual stream (Andersen, 1997, Gallese et al., 1997, Grefkes and Fink, 2005, Milner and Goodale, 1995). Spt was also found to be relatively selective for vocally-compared to manually-mediated auditory-motor interactions (Pa and Hickok, 2008), to be functionally (Buchsbaum et al., 2001, Buchsbaum et al., 2005) and anatomically (Isenberg et al., 2012) connected to premotor regions involved in speech production, and to exhibit distinct activation patterns during auditory and motor phases of the task (Hickok et al., 2009), which argues against a purely auditory or purely motor explanation for its activation pattern. Damage to this region is associated with conduction aphasia, which represents an auditory-motor integration disorder (Buchsbaum et al., 2011), and more specifically to deficits in verbatim repetition of speech including non-words, which arguably places particularly strong demands on auditory-motor integration at the phonological level (Rogalsky et al., 2015).

On the basis of these observations as well as others not reviewed here, we proposed that area Spt functions as an interface network transforming auditory-based representations of speech in the posterior superior temporal sulcus into articulatory-based representations of speech and vice versa in the service of state feedback control of speech production (Hickok, 2012, Hickok et al., 2011, Houde and Nagarajan, 2011). This hypothesis and Spt’s anatomical location in/near the posterior parietal lobe fits well with the role of this region in the visual dorsal stream and visuomotor integration (Andersen, 1997, Gallese et al., 1997, Grefkes and Fink, 2005, Milner and Goodale, 1995). Detailed hypotheses regarding the computations performed by the Spt network and the evidence behind the hypotheses are beyond the scope of the present discussion but are provided in several recent papers (Hickok, 2012, Hickok, 2014a, Hickok et al., 2011). Importantly, we conceptualized the object of this speech motor control to include both laryngeal and supralaryngeal articulators. Given that the vocal tract is composed of a number of independent articulators, some of which are under the (partial) control of somatosensory systems (Hickok, 2012, Tremblay et al., 2003), the claim that Spt is involved in the control of all of these action subcomponents may have been an over-generalization. In what follows we consider the possibility that Spt may be involved more specifically in laryngeal control.

Reconsidering the Function of the Spt Circuit

In retrospect, a clue that Spt may not be involved in control of the entire speech production system was evident in the 2003 paper that coined the term for the region where we reported that Spt activity was equally robust during covert speech production and covert humming (Hickok et al., 2003). This clearly showed that Spt was not speech specific. What is surprising is that the increased motor control complexity for speech compared to humming simple melodies did not result in any difference in fMRI signal amplitude in Spt. In fact, if anything, Spt activity was slightly greater for humming than speech (Hickok et al., 2003), which is consistent with an earlier study of the more general planum temporale region showing greater activity to tone stimuli compared to speech (Binder et al., 1996).

A more recent study directly addressed the question of which aspects of the vocal apparatus are controlled by Spt (Isenberg, 2012). We measured neural activity in healthy participants while they tracked an externalized moving sound source either with their tongue (pointing the tip of the tongue continuously in the direction of the source) or with the imagined pitch of their voice. For the latter task, subjects were trained to overtly produce a pitch that corresponded continuously to the horizontal position of the sound source (low pitch=left, high pitch=right); in the scanner subjects performed the task without voicing to avoid contamination from auditory feedback. The results in 19 subjects were robust and clear. The pitch-tracking task (red in Figure 2) activated Spt (circled) more than the tongue-tracking task (blue), which activated a somatosensory circuit more robustly. Even though the sound source that guided action was the same and both tasks involved the vocal tract, Spt activated differentially depending on whether the action involved the larynx versus the tongue. A similar finding was reported by Brown et al. (Brown et al., 2008) who localized laryngeal primary motor cortex by comparing fMRI activation during glottal stop-like movements vs. lip protrusion vs. vertical tongue movements. When the authors looked at “incidental” activations in “non-motor areas” they found that presumed Spt (it was not independently functionally localized) activated during the glottal task but not during the lip or tongue task. In that same study, Spt was also robustly activated in a singing-like task that minimized lip, jaw, and tongue movements, i.e., a task that required a varied sequence of laryngeal movements with auditory-motor feedback.

Figure 2.

Figure 2

fMRI activation contrast for “tracking” a moving sound source with the larynx versus the tongue. Red: larynx > tongue; Blue: tongue>larynx. From (Isenberg, 2012).

Implications for the Evolution of Language

Importance of laryngeal control in the evolution of speech

Speech production is dependent on the voluntary control of respiratory, laryngeal, and vocal tract musculature. Such control is present in humans but only partially present in non-human primates who appear to be able to control only supralaryngeal articulators voluntarily (Fitch and Zuberbuhler, 2013). For example, rudimentary forms of vocal learning—social group-driven acoustic modifications to natural vocalizations—have been documented in chimpanzees for sounds produced under supralaryngeal control (e.g., “raspberry” sounds), but not for sounds produced under laryngeal control (Marshall et al., 1999). And explicit training in the articulation of simple spoken words in a home-raised chimp showed that it is possible to learn to produce a small handful of intelligible words but only in a whispered form (Hayes and Hayes, 1951). Thus, the development of voluntary laryngeal control has been argued to be the “key innovation” in the evolution of speech (Jurgens, 2002, Kuypers, 1958a, Kuypers, 1958b).

Anatomical evidence is consistent with the “Kuypers/Jurgens laryngeal hypothesis.” Both humans and non-human primates have direct cortical innervation of motor neurons controlling the supralaryngeal vocal tract (Jurgens, 2002), whereas only humans appear to have direct cortical innervation of motor neurons controlling the larynx (Iwatsubo et al., 1990, Kuypers, 1958a, Kuypers, 1958b, Simonyan, 2014). Moreover, a recent diffusion tractography study in healthy humans and macaques reported a 7-fold stronger connectivity of laryngeal motor cortex (LMC) with inferior parietal and somatosensory regions in humans compared to macaques (Kumar et al., 2016). Interestingly, the center of mass for the inferior parietal target of LMC in this study is approximately 1 cm from the center of mass of area Spt calculated on the basis of an aggregate analysis of more than 100 fMRI scans using variants of our Spt localizer paradigm (Buchsbaum et al., 2011). And equally intriguing is that the same aggregate analysis of Spt localization identified another area with similar auditory-motor response properties in the pre-central sulcus, which is located also about 1 cm from the location of LMC as indicated by a meta-analysis of 19 fMRI studies (Kumar et al., 2016). The rough correspondence between (i) area Spt and the “parietal” target of LMC from Kumar et al. and (ii) LMC itself and the pre-central sulcus activation found in the broader auditory-motor Spt network raises the possibility that the independently identified Spt and LMC circuits are one and the same. This further implicates the Spt circuit in cortical laryngeal control.

The Kuypers/Jurgens largyngeal hypothesis emphasizes the importance of direct cortical control of laryngeal motor neurons in the evolution of speech. But research on cortical motor control circuits has shown that the frontal lobe cortical motor system does not work alone; it is dependent on sensory feedback control circuits (Hickok, 2014b, Shadmehr and Krakauer, 2008, Wolpert, 1997, Wolpert et al., 1995). Thus, the human brain must have evolved not only the required efferent motor pathway but also the cortical circuit for controlling those efferent signals. The hypothesis that I am advancing here is that the Spt circuit evolved in step with direct cortico-laryngeal control pathway and together represented a key advance in the evolution of speech.

Comparative analysis of the Spt region in human and non-human primates

A comparative analysis of the Spt region and its non-human primate homologues may provide important additional clues to the evolution of this circuit. For example, one might expect to see significant anatomical differences in humans compared to non-human primates in this region. Functionally, there are no doubt obvious differences in that the non-human homologue of the Spt region does not control speech, but one would expect to see some broad functional parallels nonetheless if the current organization of the region has a common evolutionary precursor. I will point out below that indeed there are both some significant anatomical differences as well as some broad functional parallels between the Spt region and its presumed non-human primate homologues.

Spt is located within the Sylvian (lateral) fissure, typically at its posterior most extent. There is individual variation in the location of the region, which is defined functionally, such that it can involve the posterior portions of the planum temporale and/or the parietal operculum, typically deep in the fissure, but which can extend laterally toward the crown of the superior temporal and/or supramarginal gyri. Cytoarchitectonically, this location corresponds to area Tpt (Galaburda and Sanides, 1980)(Figure 3). Importantly, this region is not characteristic of auditory cortex. Galaburda and Sanides emphasize that Tpt “lacks specialty features of sensory cortex” (p. 609) and so should not be considered part of auditory cortex. The homologous area in monkey, also called Tpt, has similar cytoarchitectonic features and is accordingly not considered part of auditory cortex (Smiley et al., 2007, Sweet et al., 2005).

Figure 3. Location and cytoarchitectonic organization of the human posterior Sylvian region.

Figure 3

The location of the planum temporal on the posterior supratemporal plane is indicated in red outline on an inflated representation of the brain which shows structures buried in sulci and fissures. The inset shows a close up of the planum temporale region. Colors indicate approximate location of different cytoarchitectonic fields as delineated by (Galaburda and Sanides, 1980). Note that there are four different fields within the planum temporale suggesting functional differentiation, and that these fields extend beyond the planum temporale. The area in yellow corresponds to cytoarchitectonic area Tpt which is not considered part of auditory cortex. Functional area Spt likely falls within cytoarchitectonic area Tpt, although this has never been directly demonstrated. PaAi, internal parakoniocortex; PaAe, external parakoniocortex; Tpt, temporoparietal; PaA c/d, caudodorsal parakoniocortex. (Reprinted with permission from (Hickok and Saberi, 2012))

The anatomical left > right asymmetry of the planum temporale in humans has long been a topic of discussion in the evolution of language (Geschwind and Levitsky, 1968) and much of this effect appears to be driven by Tpt asymmetry in particular (Galaburda et al., 1978). Similar asymmetries are not present in macaques (Lyn et al., 2011) although they are evident in chimpanzees (Gannon et al., 1998, Spocter et al., 2010), which significantly tempered enthusiasm about the relevance of the planum temporale asymmetry in the evolution of language. A more charitable view is that the comparative asymmetry data suggest that something indeed changed in the planum temporale region in the ape lineage sometime after its split with Old World monkeys and this may have laid the groundwork for the evolution of speech. What this change might have been is currently unclear. There has been some discussion regarding the relation between planum asymmetries and handedness in both humans and apes (Hopkins and Nir, 2010), which may be part of the story, although this is controversial (Fitch and Braccini, 2013). In this context it is worth noting that while Spt/Tpt appears to be auditory- and vocal-weighted in its sensorimotor function, it is not exclusively auditory or vocal in humans (Okada and Hickok, 2009, Pa and Hickok, 2008, Pa et al., 2008) or non-human primates (for Tpt, see below). This fact means that any observed anatomical differentiation in the planum temporale region, such as the emergence of left-right asymmetries, do not necessarily have to be attributable at each evolutionary step to vocal communication, although presumably features present in human but not non-human primates would be candidates for vocal communication-related differentiation. One such candidate is the finer-grained left-right asymmetry in planum temporale minicolumn organization (wider column spacing in left than right), which been found in humans but is absent in both macaques and chimpanzees (Buxhoeveden et al., 2001). The authors conclude that this “strongly infers a rewiring of the human PT between hemispheres” (p. 356), which is consistent with the functional asymmetry observed for Spt (Hickok et al., 2003).

From a functional standpoint relatively little is known about Tpt in non-human primates, but a few studies exist. Unit recordings in macaque Tpt have shown the region to be multisensory with auditory inputs dominating (Leinonen et al., 1980). Interestingly, a fraction of the cells responded during the monkey’s own movements (mostly head rotation) and a majority of the auditory responsive cells were modulated by sound location in head-based coordinates. Although much current theorizing on the role of the dorso-caudal auditory network including the planum temporale region broadly has focused on its spatial location sensitivity (Rauschecker, 1998, Rauschecker and Scott, 2009), the functional picture of Tpt is consistent with the region serving an auditory-weighted sensorimotor function, specifically in controlling head movements and for processing head-related sound source location. Indeed, human research has reported that head movements induce fMRI measured activation in this same general location, but which is largely distinct from auditory-responsive areas (Petit and Beauchamp, 2003). The important point here is that monkey Tpt and human Spt (assuming there is some anatomical alignment) appear to be performing similar computational functions but for different motor effector systems. Perhaps that region contains a number auditory-motor integration sub-regions organized around different motor effector systems (head, larynx, etc.), similar to the organization of the intraparietal sulcus for visuomotor integration (Grefkes and Fink, 2005). Even less is known about the function of chimpanzee Tpt although based on a PET study it appears to be responsive to conspecific vocalization (Taglialatela et al., 2009).

All of this is consistent with the existence of a cortical system already present in the primate brain that is well-suited from a computational-anatomic standpoint for a role in auditorily mediated laryngeal control. Specifically, there is a class of circuits in the posterior parietal cortex (visuomotor) and temporal-parietal junction (auditory-motor) that serve a sensory feedback control function for motor coordination (Desmurget and Grafton, 2000, Diedrichsen et al., 2010, Fogassi et al., 2001, Golfinopoulos et al., 2011, Hickok, 2012, Houde and Nagarajan, 2011, Perkell, 2012, Rizzolatti et al., 1997, Shadmehr and Krakauer, 2008). This provides a computational foundation for sensorimotor control in the primate brain. Evidence suggests that during primate evolution there has been a gradual differentiation of one functional-anatomical portion of the sensorimotor system specifically involving auditory-motor integration in the planum temporale/Tpt region with anatomical asymmetries emerging in great apes and with further microcircuit (Buxhoeveden et al., 2001) and anatomical connectivity (Kumar et al., 2016) differentiation emerging in humans. My suggestion is that human area Spt is the functional region that emerged from this differentiation and that it evolved to support voluntary laryngeal control.

Functional analysis of the utility of voluntary laryngeal control

The ability to control vocalization opens a number of fairly obvious communicative possibilities that I won’t spend much time on here. Briefly, signals can be generated or withheld as appropriate to the social and environmental situation; the ability to time the onset and offset of voicing adds a parameter to articulatory space and therefore expands the inventory of distinctive sounds that can be produced; and control of vocal pitch intensity increases the flexibility of emotional and pragmatic communication. It also may serve a higher-order computational function in speech production by providing a scaffolding for phrasal speech planning, which is worth some discussion here.

It is well established that speech is planned not syllable-by-syllable or even word-by-word but in multi-word chunks (Dell, 1986, Dell, 1995, Garrett, 1975). One open question concerns the nature of the planning frame (e.g., is it syntactic or something else?) and one interesting hypothesis in the context of the present discussion is that the frame is prosodic in nature (Shattuck-Hufnagel, 2015). A review of the behavioral evidence for this hypothesis is beyond the scope of this article, but it is worth noting that data from the effects of damage to the Spt region is broadly consistent with the claim. One complication with the previous idea that Spt serves as an auditor-motor interface for vocal tract control is that damage to the region doesn’t produce catastrophic deficits in speech production. Rather the effects are limited to an increase in the phonological error rate, which can be quite mild, and a decline in the ability repeat speech verbatim. In general, these deficits are worse when the planning unit is longer (multiple syllables or multi-word) (Buchsbaum et al., 2011, Goodglass, 1992, Rogalsky et al., 2015). These effects (relatively mild impairment, planning load dependency) might be explained if Spt supports the planning scaffolding rather than control of within word or within syllable articulation. Specifically, disruption of the planning frame will decrease efficiency, forcing planning to occur more locally (e.g., word by word), but not obliterating the ability to generate connected speech. Or, in the context of state feedback control models, disruption of the auditory-motor interface component of a prosodic planning frame would make it difficult to detect and correct errors in the planning frame, thus increasing the likelihood of phonemic errors as syllabic and segmental information is inserted into the frame.

Summary

Evidence has been mounting for laryngeal control as a key advance in the evolution of speech and language. I have argued here that the much studied human auditory-motor dorsal stream (Hickok, 2012, Hickok and Poeppel, 2007, Rauschecker and Scott, 2009) and Spt in particular comprises the cortical circuit for laryngeal control, that the evolution of this circuit is the functional basis for the evolution of anatomical asymmetries in the planum temporale region, and that the circuit plays an important role in language processes by providing a prosodic frame for speech planning. Given the important role of laryngeal control in vocal music, this circuit may also have played a central role in the evolution of music (Fitch, 2006).

References

  • 1.Andersen R. Multimodal integration for the representation of space in the posterior parietal cortex. Philosophical Transactions of the Royal Society of London B Biological Sciences. 1997;352:1421–1428. doi: 10.1098/rstb.1997.0128. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Binder JT, Frost JA, Hammeke TA, Rao SM, Cox RW. Function of the left planum temporale in auditory and linguistic processing. Brain. 1996;119:1239–1247. doi: 10.1093/brain/119.4.1239. [DOI] [PubMed] [Google Scholar]
  • 3.Brown S, Ngan E, Liotti M. A larynx area in the human motor cortex. Cereb Cortex. 2008;18:837–845. doi: 10.1093/cercor/bhm131. [DOI] [PubMed] [Google Scholar]
  • 4.Buchsbaum B, Hickok G, Humphries C. Role of Left Posterior Superior Temporal Gyrus in Phonological Processing for Speech Perception and Production. Cognitive Science. 2001;25:663–678. [Google Scholar]
  • 5.Buchsbaum BR, Baldo J, Okada K, Berman KF, Dronkers N, D’Esposito M, Hickok G. Conduction aphasia, sensory-motor integration, and phonological short-term memory - an aggregate analysis of lesion and fMRI data. Brain and language. 2011;119:119–128. doi: 10.1016/j.bandl.2010.12.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Buchsbaum BR, Olsen RK, Koch P, Berman KF. Human dorsal and ventral auditory streams subserve rehearsal-based and echoic processes during verbal working memory. Neuron. 2005;48:687–697. doi: 10.1016/j.neuron.2005.09.029. [DOI] [PubMed] [Google Scholar]
  • 7.Buxhoeveden DP, Switala AE, Litaker M, Roy E, Casanova MF. Lateralization of minicolumns in human planum temporale is absent in nonhuman primate cortex. Brain Behav Evol. 2001;57:349–358. doi: 10.1159/000047253. [DOI] [PubMed] [Google Scholar]
  • 8.Dell GS. A spreading-activation theory of retrieval in sentence production. Psychological review. 1986;93:283–321. [PubMed] [Google Scholar]
  • 9.Dell GS. Speaking and misspeaking. In: Glietman LR, Liberman M, editors. An invitation to cognitive science: Language. 2. MIT Press; 1995. pp. 183–208. [Google Scholar]
  • 10.Desmurget M, Grafton S. Forward modeling allows feedback control for fast reaching movements. Trends in cognitive sciences. 2000;4:423–431. doi: 10.1016/s1364-6613(00)01537-0. [DOI] [PubMed] [Google Scholar]
  • 11.Deutsch D, Roll PL. Separate “what” and “where” decision mechanisms in processing a dichotic tonal sequence. Journal of experimental psychology. Human perception and performance. 1976;2:23–29. doi: 10.1037//0096-1523.2.1.23. [DOI] [PubMed] [Google Scholar]
  • 12.Diedrichsen J, Shadmehr R, Ivry RB. The coordination of movement: optimal feedback control and beyond. Trends in cognitive sciences. 2010;14:31–39. doi: 10.1016/j.tics.2009.11.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Dijkerman HC, de Haan EH. Somatosensory processes subserving perception and action. The Behavioral and brain sciences. 2007;30:189–201. doi: 10.1017/S0140525X07001392. discussion 201–139. [DOI] [PubMed] [Google Scholar]
  • 14.Fitch WT. The biology and evolution of music: a comparative perspective. Cognition. 2006;100:173–215. doi: 10.1016/j.cognition.2005.11.009. [DOI] [PubMed] [Google Scholar]
  • 15.Fitch WT, Braccini SN. Primate laterality and the biology and evolution of human handedness: a review and synthesis. Annals of the New York Academy of Sciences. 2013;1288:70–85. doi: 10.1111/nyas.12071. [DOI] [PubMed] [Google Scholar]
  • 16.Fitch WT, Zuberbuhler K. Primate precursors to human language: Beyond discontinuity. In: Altenmuller E, et al., editors. Evolution of emotional communication: from sound in nonhuman mammals to speech and music in man. Oxford University Press; 2013. pp. 26–48. [Google Scholar]
  • 17.Fogassi L, Gallese V, Buccino G, Craighero L, Fadiga L, Rizzolatti G. Cortical mechanism for the visual guidance of hand grasping movements in the monkey: A reversible inactivation study. Brain. 2001;124:571–586. doi: 10.1093/brain/124.3.571. [DOI] [PubMed] [Google Scholar]
  • 18.Galaburda A, Sanides F. Cytoarchitectonic organization of the human auditory cortex. Journal of Comparative Neurology. 1980;190:597–610. doi: 10.1002/cne.901900312. [DOI] [PubMed] [Google Scholar]
  • 19.Galaburda AM, Sanides F, Geschwind N. Human brain. Cytoarchitectonic left-right asymmetries in the temporal speech region. Arch Neurol. 1978;35:812–817. doi: 10.1001/archneur.1978.00500360036007. [DOI] [PubMed] [Google Scholar]
  • 20.Gallese V, Fadiga L, Fogassi L, Luppino G, Murata A. A parietal-frontal circuit for hand and grasping movements in the monkey: Evidence from reversible inactivation experiments. In: Thier P, Karnath H-O, editors. Parietal lobe contributions to orientation in 3D space. Springer-Verlag; 1997. pp. 255–270. [Google Scholar]
  • 21.Gannon PJ, Holloway RL, Broadfield DC, Braun AR. Asymmetry of the chimpanzee planum temporale: humanlike pattern of Wernicke’s brain language area homolog. Science. 1998;279:220–222. doi: 10.1126/science.279.5348.220. [DOI] [PubMed] [Google Scholar]
  • 22.Garrett MF. The analysis of sentence production. In: Bower GH, editor. The psychology of learning and motivation. Academic Press; 1975. pp. 133–177. [Google Scholar]
  • 23.Geschwind N, Levitsky W. Human brain: Left-right asymmetries in temporal speech region. Science. 1968;161:186–187. doi: 10.1126/science.161.3837.186. [DOI] [PubMed] [Google Scholar]
  • 24.Golfinopoulos E, Tourville JA, Bohland JW, Ghosh SS, Nieto-Castanon A, Guenther FH. fMRI investigation of unexpected somatosensory feedback perturbation during speech. Neuroimage. 2011;55:1324–1338. doi: 10.1016/j.neuroimage.2010.12.065. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Goodglass H. Diagnosis of conduction aphasia. In: Kohn SE, editor. Conduction aphasia. Lawrence Erlbaum Associates; 1992. pp. 39–49. [Google Scholar]
  • 26.Grefkes C, Fink GR. The functional organization of the intraparietal sulcus in humans and monkeys. J Anat. 2005;207:3–17. doi: 10.1111/j.1469-7580.2005.00426.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Hayes KJ, Hayes C. The intellectual development of a home-raised chimpanzee. Proceedings of the American Philosophical Society. 1951;95:105–109. [Google Scholar]
  • 28.Hickok G. Computational neuroanatomy of speech production. Nature Reviews Neuroscience. 2012;13:135–145. doi: 10.1038/nrn3158. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Hickok G. The architecture of speech production and the role of the phoneme in speech processing. Language and cognitive processes. 2014a;29:2–20. doi: 10.1080/01690965.2013.834370. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Hickok G. The myth of mirror neurons: the real neuroscience of communication and cognition. W.W. Norton & Company; 2014b. [Google Scholar]
  • 31.Hickok G, Buchsbaum B, Humphries C, Muftuler T. Auditory-motor interaction revealed by fMRI: Speech, music, and working memory in area Spt. Journal of cognitive neuroscience. 2003;15:673–682. doi: 10.1162/089892903322307393. [DOI] [PubMed] [Google Scholar]
  • 32.Hickok G, Houde J, Rong F. Sensorimotor integration in speech processing: computational basis and neural organization. Neuron. 2011;69:407–422. doi: 10.1016/j.neuron.2011.01.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Hickok G, Okada K, Serences JT. Area Spt in the human planum temporale supports sensory-motor integration for speech processing. Journal of neurophysiology. 2009;101:2725–2732. doi: 10.1152/jn.91099.2008. [DOI] [PubMed] [Google Scholar]
  • 34.Hickok G, Poeppel D. Towards a functional neuroanatomy of speech perception. Trends in cognitive sciences. 2000;4:131–138. doi: 10.1016/s1364-6613(00)01463-7. [DOI] [PubMed] [Google Scholar]
  • 35.Hickok G, Poeppel D. Dorsal and ventral streams: A framework for understanding aspects of the functional anatomy of language. Cognition. 2004;92:67–99. doi: 10.1016/j.cognition.2003.10.011. [DOI] [PubMed] [Google Scholar]
  • 36.Hickok G, Poeppel D. The cortical organization of speech processing. Nature Reviews Neuroscience. 2007;8:393–402. doi: 10.1038/nrn2113. [DOI] [PubMed] [Google Scholar]
  • 37.Hickok G, Saberi K. Redefining the Functional Organization of the Planum Temporale Region: Space, Objects, and Sensory-Motor Integration. In: Poeppel D, et al., editors. The human auditory cortex. Springer; 2012. pp. 333–350. [Google Scholar]
  • 38.Hopkins WD, Nir TM. Planum temporale surface area and grey matter asymmetries in chimpanzees (Pan troglodytes): the effect of handedness and comparison with findings in humans. Behav Brain Res. 2010;208:436–443. doi: 10.1016/j.bbr.2009.12.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Houde JF, Nagarajan SS. Speech production as state feedback control. Frontiers in Human Neuroscience. 2011:5. doi: 10.3389/fnhum.2011.00082. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Isenberg AL, Saberi K, Hickok G. Dorsal stream organization for spatially guided sensory-motor integration. Neurobiology of Language Conference. 2012:10. [Google Scholar]
  • 41.Isenberg AL, Vaden KI, Jr, Saberi K, Muftuler LT, Hickok G. Functionally distinct regions for spatial processing and sensory motor integration in the planum temporale. Hum Brain Mapp. 2012;33:2453–2463. doi: 10.1002/hbm.21373. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Iwatsubo T, Kuzuhara S, Kanemitsu A, Shimada H, Toyokura Y. Corticofugal projections to the motor nuclei of the brainstem and spinal cord in humans. Neurology. 1990;40:309–312. doi: 10.1212/wnl.40.2.309. [DOI] [PubMed] [Google Scholar]
  • 43.Jurgens U. Neural pathways underlying vocal control. Neuroscience and biobehavioral reviews. 2002;26:235–258. doi: 10.1016/s0149-7634(01)00068-9. [DOI] [PubMed] [Google Scholar]
  • 44.Kumar V, Croxson PL, Simonyan K. Structural Organization of the Laryngeal Motor Cortical Network and Its Implication for Evolution of Speech Production. J Neurosci. 2016;36:4170–4181. doi: 10.1523/JNEUROSCI.3914-15.2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Kuypers HG. Corticobular connexions to the pons and lower brain-stem in man: an anatomical study. Brain. 1958a;81:364–388. doi: 10.1093/brain/81.3.364. [DOI] [PubMed] [Google Scholar]
  • 46.Kuypers HG. Some projections from the peri-central cortex to the pons and lower brain stem in monkey and chimpanzee. J Comp Neurol. 1958b;110:221–255. doi: 10.1002/cne.901100205. [DOI] [PubMed] [Google Scholar]
  • 47.Leinonen L, Hyvarinen J, Sovijarvi AR. Functional properties of neurons in the temporo-parietal association cortex of awake monkey. Experimental brain research. Experimentelle Hirnforschung. Experimentation cerebrale. 1980;39:203–215. doi: 10.1007/BF00237551. [DOI] [PubMed] [Google Scholar]
  • 48.Lichtheim L. On aphasia. Brain. 1885;7:433–484. [Google Scholar]
  • 49.Lyn H, Pierre P, Bennett AJ, Fears S, Woods R, Hopkins WD. Planum temporale grey matter asymmetries in chimpanzees (Pan troglodytes), vervet (Chlorocebus aethiops sabaeus), rhesus (Macaca mulatta) and bonnet (Macaca radiata) monkeys. Neuropsychologia. 2011;49:2004–2012. doi: 10.1016/j.neuropsychologia.2011.03.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Marshall AJ, Wrangham RW, Arcadi AD. Does learning affect the structure of vocalizations in chimpanzees? Animal Behavior. 1999;58:825–830. doi: 10.1006/anbe.1999.1219. [DOI] [PubMed] [Google Scholar]
  • 51.Milner AD, Goodale MA. The visual brain in action. Oxford University Press; 1995. [Google Scholar]
  • 52.Okada K, Hickok G. Two cortical mechanisms support the integration of visual and auditory speech: A hypothesis and preliminary data. Neurosci Lett. 2009;452:219–223. doi: 10.1016/j.neulet.2009.01.060. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Pa J, Hickok G. A parietal-temporal sensory-motor integration area for the human vocal tract: Evidence from an fMRI study of skilled musicians. Neuropsychologia. 2008;46:362–368. doi: 10.1016/j.neuropsychologia.2007.06.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Pa J, Wilson SM, Pickell B, Bellugi U, Hickok G. Neural organization of linguistic short-term memory is sensory modality-dependent: Evidence from signed and spoken language. Journal of cognitive neuroscience. 2008;20:2198–2210. doi: 10.1162/jocn.2008.20154. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Perkell JS. Movement goals and feedback and feedforward control mechanisms in speech production. Journal of neurolinguistics. 2012;25:382–407. doi: 10.1016/j.jneuroling.2010.02.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Petit L, Beauchamp MS. Neural basis of visually guided head movements studied with fMRI. Journal of neurophysiology. 2003;89:2516–2527. doi: 10.1152/jn.00988.2002. [DOI] [PubMed] [Google Scholar]
  • 57.Poljak S. The connections of the acoustic nerve. Journal of Anatomy. 1926;60:465–469. [Google Scholar]
  • 58.Rauschecker JP. Cortical processing of complex sounds. Current opinion in neurobiology. 1998;8:516–521. doi: 10.1016/s0959-4388(98)80040-8. [DOI] [PubMed] [Google Scholar]
  • 59.Rauschecker JP, Scott SK. Maps and streams in the auditory cortex: nonhuman primates illuminate human speech processing. Nature neuroscience. 2009;12:718–724. doi: 10.1038/nn.2331. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Rauschecker JP, Tian B, Hauser M. Processing of complex sounds in the macaque nonprimary auditory cortex. Science. 1995;268:111–114. doi: 10.1126/science.7701330. [DOI] [PubMed] [Google Scholar]
  • 61.Rizzolatti G, Fogassi L, Gallese V. Parietal cortex: from sight to action. Current opinion in neurobiology. 1997;7:562–567. doi: 10.1016/s0959-4388(97)80037-2. [DOI] [PubMed] [Google Scholar]
  • 62.Rogalsky C, Poppa T, Chen KH, Anderson SW, Damasio H, Love T, Hickok G. Speech repetition as a window on the neurobiology of auditory-motor integration for speech: A voxel-based lesion symptom mapping study. Neuropsychologia. 2015;71:18–27. doi: 10.1016/j.neuropsychologia.2015.03.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Shadmehr R, Krakauer JW. A computational neuroanatomy for motor control. Experimental brain research. Experimentelle Hirnforschung. Experimentation cerebrale. 2008;185:359–381. doi: 10.1007/s00221-008-1280-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Shattuck-Hufnagel S. Prosodic frames in speech production. In: Redford MA, editor. The handbook of speech production. John Wiley & Sons, Inc; 2015. pp. 419–444. [Google Scholar]
  • 65.Simonyan K. The laryngeal motor cortex: its organization and connectivity. Current opinion in neurobiology. 2014;28:15–21. doi: 10.1016/j.conb.2014.05.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Smiley JF, Hackett TA, Ulbert I, Karmas G, Lakatos P, Javitt DC, Schroeder CE. Multisensory convergence in auditory cortex, I. Cortical connections of the caudal superior temporal plane in macaque monkeys. J Comp Neurol. 2007;502:894–923. doi: 10.1002/cne.21325. [DOI] [PubMed] [Google Scholar]
  • 67.Spocter MA, Hopkins WD, Garrison AR, Bauernfeind AL, Stimpson CD, Hof PR, Sherwood CC. Wernicke’s area homologue in chimpanzees (Pan troglodytes) and its relation to the appearance of modern human language. Proceedings. Biological sciences/The Royal Society. 2010;277:2165–2174. doi: 10.1098/rspb.2010.0011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Sweet RA, Dorph-Petersen KA, Lewis DA. Mapping auditory core, lateral belt, and parabelt cortices in the human superior temporal gyrus. J Comp Neurol. 2005;491:270–289. doi: 10.1002/cne.20702. [DOI] [PubMed] [Google Scholar]
  • 69.Taglialatela JP, Russell JL, Schaeffer JA, Hopkins WD. Visualizing vocal perception in the chimpanzee brain. Cereb Cortex. 2009;19:1151–1157. doi: 10.1093/cercor/bhn157. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Tremblay S, Shiller DM, Ostry DJ. Somatosensory basis of speech production. Nature. 2003;423:866–869. doi: 10.1038/nature01710. [DOI] [PubMed] [Google Scholar]
  • 71.Ungerleider LG, Mishkin M. Two cortical visual systems. In: Ingle DJ, et al., editors. Analysis of visual behavior. MIT Press; 1982. pp. 549–586. [Google Scholar]
  • 72.Wernicke C. Der aphasische symptomencomplex: Eine psychologische studie auf anatomischer basis. In: Eggert GH, editor. Wernicke’s works on aphasia: A sourcebook and review. Mouton: 1874/1977. pp. 91–145. [Google Scholar]
  • 73.Wolpert DM. Computational approaches to motor control. Trends in cognitive sciences. 1997;1:209–216. doi: 10.1016/S1364-6613(97)01070-X. [DOI] [PubMed] [Google Scholar]
  • 74.Wolpert DM, Ghahramani Z, Jordan MI. An internal model for sensorimotor integration. Science. 1995;269:1880–1882. doi: 10.1126/science.7569931. [DOI] [PubMed] [Google Scholar]

RESOURCES