With great interest we read the recent report of Guterstam et al. (1) in which they show that the direction of someone’s gaze can be decoded from blood oxygen level-dependent (BOLD) activity in areas MT+ (middle temporal complex) (and temporo-parietal junction). In preceding publications, they presented evidence that the observation of directed gaze evokes motion adaptation (2) and the experience of physical interactions with the environment as if the gaze were pushing the targeted objects (3). The authors suggested that these behavioral effects originate in a cognitive representation of another person’s gaze as a force-carrying beam emanating from the eyes, traversing space and operating on the environment. Against this backdrop, BOLD activity in MT+ is now interpreted as a representation of the motion of the imagined beam.
Undoubtedly, this intriguing concept is in line with the observations. Yet, we are convinced they do not contradict an alternative interpretation. Rather than reflecting a moving beam, the BOLD activity in MT+ might represent the observer’s expectation of motion of the other and/or of the object targeted. An agent shifting attention to an object is likely to engage in physical interactions at any moment, entailing both movements of the object and of the effectors. Guterstam et al. (1) reject this alternative by arguing that in their control condition, in which the agent was blindfolded, MT+ activity did not allow the decoding of stimulus orientation. Indeed, this could be explained by the blindfold’s preventing the beam from being imagined. However, it could as well be due to the fact that the observer may no longer expect interaction-associated motion between the agent, now blindfolded, and the tree. The authors resort to yet another argument to refute the alternative interpretation, namely the assumption that a mighty tree may not be accessible to manipulations with the agent’s unreinforced hand, and consequently that no expectations about interactions are triggered. However, a tree can be approached by means other than grasping, and moreover the stimuli presented did not allow estimates of the size of the tree relative to the agent; assuming the tree to be a tangible toy is equally possible.
Even though we find their hypothesis very inspiring, we think, moreover, that in light of previous research on area MT+ and on the modeling of the other’s intentions the alternative interpretation is more parsimonious. First, it is well established that area MT+ encodes predictions and expectations about the motion direction of objects needed to elicit object-directed eye and hand movement (4–7). Second, perceived gaze is known to be a central building block in modeling intentions of others in an attempt to predict imminent actions, thereby helping the observer to initiate reactions (8–10). Bringing these lines of research together, they provide evidence which allows the results to be explained without postulating an additional cognitive faculty like the encoding of someone’s gaze as energy beams.
Hence, in our view, more carefully controlled experiments will be needed to test the hypothesis preferred by the authors against the alternative hypothesis of interaction-related movement expectations.
Footnotes
The authors declare no competing interest.
References
- 1.Guterstam A., Wilterson A. I., Wachtell D., Graziano M. S. A., Other people’s gaze encoded as implied motion in the human brain. Proc. Natl. Acad. Sci. U.S.A. 117, 13162–13167 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Guterstam A., Graziano M. S. A., Implied motion as a possible mechanism for encoding other people’s attention. Prog. Neurobiol. 190, 101797 (2020). [DOI] [PubMed] [Google Scholar]
- 3.Guterstam A., Kean H. H., Webb T. W., Kean F. S., Graziano M. S. A., Implicit model of other people’s visual attention as an invisible, force-carrying beam projecting from the eyes. Proc. Natl. Acad. Sci. U.S.A. 116, 328–333 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Ferrera V. P., Lisberger S. G., Neuronal responses in visual areas MT and MST during smooth pursuit target selection. J. Neurophysiol. 78, 1433–1446 (1997). [DOI] [PubMed] [Google Scholar]
- 5.Shulman G. L., et al. , Areas involved in encoding and applying directional expectations to moving objects. J. Neurosci. 19, 9480–9496 (1999). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Oreja-Guevara C., et al. , The role of V5 (hMT+) in visually guided hand movements: An fMRI study. Eur. J. Neurosci. 19, 3113–3120 (2004). [DOI] [PubMed] [Google Scholar]
- 7.Ilg U. J., Schumann S., Primate area MST-l is involved in the generation of goal-directed eye and hand movements. J. Neurophysiol. 97, 761–771 (2007). [DOI] [PubMed] [Google Scholar]
- 8.Baron-Cohen S., How to build a baby that can read minds: Cognitive mechanisms in mind reading. Curr. Psychol. Cogn. 13, 513–552 (1994). [Google Scholar]
- 9.Flanagan J. R., Rotman G., Reichelt A. F., Johansson R. S., The role of observers’ gaze behaviour when watching object manipulation tasks: Predicting and evaluating the consequences of action. Philos. Trans. R. Soc. Lond. B Biol. Sci. 368, 20130063 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Pesquita A., Chapman C. S., Enns J. T., Humans are sensitive to attention control when predicting others’ actions. Proc. Natl. Acad. Sci. U.S.A. 113, 8669–8674 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
