Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2016 Jan 25.
Published in final edited form as: Cortex. 2011 Oct 29;48(3):373–377. doi: 10.1016/j.cortex.2011.10.008

Picture-word interference and the response-exclusion hypothesis: A response to Mulatti and Coltheart

Bradford Z Mahon 1,2, Frank E Garcea 1, Eduardo Navarrete 3
PMCID: PMC4725326  NIHMSID: NIHMS752967  PMID: 22130091

Abstract

Mulatti and Coltheart (2011, this issue) review and summarize several findings from the picture-word interference paradigm that the authors argue challenge the Response Exclusion Hypothesis. However, the hypothesis they take to be the Response Exclusion Hypothesis is not that theory—it is an account developed by Mulatti and Coltheart that holds that target naming latencies in the picture-word paradigm are affected only by the process of excluding the distractor word (and by nothing else). We consider some of the background assumptions implicit in Mulatti and Coltheart’s discussion that may have led to this misattribution. Finally, we report a replication of an effect originally described by Dalrymple-Alford (1972) that serves as an empirical basis for reiterating the main points of our proposal and outlining the challenges that lie ahead.


In order for a manipulation of the speed of processing of the distractor (e.g. distractor frequency) to affect the time the response to the target accesses the buffer, it is necessary that the response to the target waits for the buffer to be purged. But this has an unwanted implication: If the response to the target has to wait for the distractor to be purged from the buffer, the benefit that derives from the picture name being of high frequency with respect to being low frequency is cancelled. … An example may illustrate the point.

Suppose that Leonardo (low frequency picture) and Michelangelo (high frequency picture) want to talk to Giotto (the buffer), who is in the Scrovegni Chapel. Leonardo and Michelangelo start from the same point to go to Giotto, but Michelangelo can walk much faster than Leonardo, and gets to the Chapel five minutes before Leonardo. If Giotto is immediately available, then Michelangelo will talk to Giotto 5 minutes before Leonardo. But if Giotto is busy, for example he is painting (that is, the buffer is occupied), then Michelangelo will have to wait, and the advantage of being able to walk much faster than Leonardo vanishes.

(Mulatti and Coltheart, 2011, this issue, p XX)

Imagine the scenario in which Michelangelo called Giotto on his cell while passing through the Prato della Valle, and said, “Hey Giotto, I’m coming to see you so finish up what you are doing and don’t keep me waiting for too long.” And then Leonardo, on his walk, made the same call to say the same thing from the same place. This could result in the situation in which the difference in speed of arrival between Michelangelo (high frequency target) and Leonardo (low frequency target) was not swallowed up by Giotto’s work (distractor exclusion), while at the same time, Giotto might make his visitors wait a bit (distractor exclusion slows down target processing).

Another possibility: Giotto does not receive any advance warning from his visitors, but he is the type who can work only when under pressure. So only when he knows that there is a visitor waiting for him does he start his work (the process of excluding the distractor only begins when a target representation has arrived at the buffer). If that were the case, then with respect to the time that he started walking, Michelangelo (high frequency target) would get to speak to Giotto five minutes sooner than would Leonardo (low frequency target).

There are a number of assumptions that Mulatti and Coltheart’s argument does not acknowledge but which must be specified. For instance, it is critical to know how long Giotto’s work takes relative to the time it takes Michelangelo and Leonardo to walk to him—How long does it take to exclude distractors? It is also important to know whether Giotto starts his work while his visitors are walking, or waits to start his work—can distractors, in principle, be excluded before the target arrives at the buffer? It is also critical to know whether Giotto communicates anything to his visitors about whether he is busy or not, and whether his visitors communicate anything to Giotto about their imminent arrival – is there any exchange of information about the state of lexicalization and distrator exclusion? Does such (potential) information exchange affect processing time?

These types of considerations mean that, at best, the gloomy forcast for the Response Exclusion Hypothesis fortold by Mulatti and Coltheart (2011, this issue) does not follow with the force of logic. This is because the authors’ arguments are built on premises of their own invention—not the assumptions that compose the Response Exclusion Hypothesis (Janssen et al., 2008; Mahon et al., 2007; Miozzo and Caramazza, 2003). Once it is assumed, as do Mulatti and Coltheart, that the time to produce the picture name depends only on the time it takes the system to exclude the distractor word, and nothing else, then it follows (by definitional fiat) that there will be no effect on naming latencies of any variables known to affect target retrieval. We know that hypothesis, outlined for the first time by Mulatti and Coltheart, is falsified by the many robust demonstrations of variables affecting target picture naming latencies (e.g., the picture frequency effect). Because Mulatti and Coltheart contend that their hypothesis is ‘entailed’ by the Response Exclusion Hypothesis they are led into the logical error of concluding that rejecting their hypothesis implies rejection of the Response Exclusion Hypothesis. Two background commitments seem to drive the authors’ approach in thinking about these issues.

The first background assumption behind Mulatti and Coltheart’s argument is that if two cognitive processes are not structurally serial, then they will run in parallel. In other words, anything that can, on structural or logical grounds, run in parallel, will run in parallel. This is too strong of an assumption. Clearing the single channel output buffer of the distractor and processing the target word up until that channel may not happen (entirely, or partly) in parallel, even though there may be no structural constraint that says they cannot happen in parallel. The reason why excluding the distractor may not (by hypothesis) take place during lexicalization of the target could have to do with other (non structural) reasons. For instance, it may be that endogenously allocated attentional resources are required for both processes, and those resources cannot be divided. Or, for instance, it could be that the process of verifying whether a given representation at the output level is the target can operate over only one word at a time, and only happens over the distractor when it becomes necessary to do so—i.e., when the target has ‘arrived.’

A second background assumption made by Mulatti and Coltheart (2011, this issue) that seems too strong is that cognitive processes operate on the basis of only information that is logically necessary, or otherwise constitutive to their operation. Consider for instance the observation that motor processes are engaged by perceptual tasks, and that perceptual judgments can be influenced by putatively motor-based processes (cf embodied cognition, for reviews and discussion see Chatterjee, 2010; Hickok, 2009; Machery, 2007; Mahon and Caramazza, 2008). Or consider, for instance, that decisions about phonological similarity among auditorily presented words can be affected by the orthographic similarity of the words (Tanenhaus et al., 1980). Those types of findings suggest a different approach for thinking about how to use the picture-word paradigm to study language production—one in which information is promiscuously exchanged among distinct systems. This leads to a different class of questions then emphasized by Mulatti and Coltheart: How is information about distractors ‘bundled’ at the level at which they are excluded as potential responses? Does the information about the provenance of a representation at that level have to be ‘looked up’ or ‘indexed’? If so, then do factors that affect the ‘speed of rejecting distractors’ exert their influence on the actual process of exclusion or rather on the ‘looking up’ of the relevant information? For instance, in the case of the distractor frequency effect, Miozzo and Caramazza (2003) interpreted the effect as having to do with how fast distractors are available for exclusion. Could it also be due, in whole or in part, to how fast the relevant information about the distractor (provenance, category, etc) can be ‘looked up’ or otherwise accessed?

Mulatti and Coltheart’s review of the picture-word literature is a wonderfully concise summary of findings; that their conclusions regarding the Response Exclusion Hypothesis do not stand is because their starting premises are only a rough caricature of that theory. To their credit, the authors predicate much of their argument on ceteris paribus provisos – and it might be that by ‘ceteris paribus’ they mean to imply a certain set of background assumptions about how the mind (ceteris paribus) must work. But it is in no way obvious how issues for instance about parallel versus serial action among ‘logically’ independent processes match up with expectations of what should be the case, ceteris paribus. Thus, while the meta-analytic component of Mulatti and Coltheart’s article clearly makes an important contribution in and of itself, revision of their initial assumptions could generate conclusions relevant to the Response Exclusion Hypothesis. As the authors frame the issues, however, the empirical findings that they summarize do not discredit or even embarrass the Response Exclusion Hypothesis. If there were to be any theoretical tension induced by those data, then we would have to explore the available hypothesis space and design the right experiments to tease apart the alternatives.

Taking a Step Back

Over and above different ways of fleshing out the Response Exclusion Hypothesis, what are the broader issues at stake? As Mulatti and Coltheart point out, the ‘REH is not a theory of lexical access per se …[but rather an] …account of performance in the PWI task.’ The theory of lexical access within which the REH was situated was one that dispensed with the need for assuming lexical selection by competition, and instead assumed that the most highly activated word is produced (without regard to the levels of activation of other words). That theory of lexical access offers an explanation of the general pattern of facilitation that is observed in the picture-word interference paradigm when distractor words are in a semantic relationship to target pictures.1 If lexical selection is facilitatory, then there are exciting new issues to explore.

For instance, it is exciting to think about how to integrate inferences derived from chronometric effects with inferences derived on the basis of errors (Oppenheim et al., 2010). On the account that we have outlined (Janssen et al., 2008; Mahon et al., 2007; Miozzo and Caramazza, 2003), there are only two places at which speech production can be slowed down – the semantic level at which a decision must be made about which concept to lexicalize, and the ‘point of no return’ at the response level. Lexicalization on this view is a ballistic process that can be sped up, but not slowed down. There is no temporal cost possible at the level of lexical selection—provided that the correct target word is initially selected as being the most highly activated. Sometimes, however, the wrong word is lexicalized, and an error is made, either internally and never overtly uttered, or actually spoken. For instance, brain damage can lead to selective impairments at the lexical level in which patients make overt semantic coordinate errors (e.g., Caramazza and Hillis, 1991), and such errors can also occur in spontaneous speech, albeit rarely (Dell and Reich, 1981). The existence of such errors are not in conflict with the idea that lexical selection is facilitatory – those errors arise (by hypothesis) because the wrong word was the most highly activated. This type of view represents a new way of thinking about how to model response time effects in speech production (e.g., Dell et al., 2008).

We see this direction of development as an important step toward bringing together two traditions that have existed somewhat side-by-side in pyscholinguistics throughout the ‘reign’ of the theory of lexical selection by competition. On the one hand, there have been models primarly designed to explain how the correct word (among the possible alternatives) is selected; that class of models has principally been evaluated in the context of error data. On the other hand, there has been a tradition of models designed to explain when the system produces the correct word, given a prior stipulation about which word will be selected at the lexical level. In other words, theories that modeled uncertainty as to which word would be the target were evaluated more or less independently of theories that modeled uncertainty about when the target word would be produced. If we are no longer compelled to model temporal uncertainty at the level of lexical selection, then we can turn our attention toward integrating these two classes of models.

An empirical example

In the classic Stroop task, participants are slower to name the ink color (e.g., “red”) of a printed incongruent word (green) compared to a printed congruent word (red). Part of this response time effect is due to slowing down in the incongruent condition and part of this response time difference is presumably due to facilitation in the congruent condition. However, the above described congruent condition is ambiguous as to what level of processing the facilitation effect might be occurring (semantic, lexical, phonological). An overlooked finding (Dalrymple-Alford, 1972) resolves the issue by changing the printed words to fire and lawn. According to the model of selection by competition, fire will compete more for saying the word “red” than will lawn, and should thus lead to slower naming latencies. However, if lexical selection is facilitatory then participants would be faster to name the ink color ‘red’ when the printed word is fire than when it is lawn. The finding, originally reported by Darymple-Alford (1972), shows that naming latencies are faster with fire as the distractor than with lawn as the distractor. Glaser and Glaser (1989) replicated the effect, although did not test the zero Stimulus Onset Asychrony (SOA) condition. We have replicated the original experiment from Dalyrmple-Alford (1972) with our own materials and obtained the same pattern (see Figure 1 for a reproduction of Dalrymple-Alford’s experiment and our replication, Table 1 for response times and error by condition, and Supplemental Online Materials for our experimental procedures).

Figure 1.

Figure 1

Naming latencies for Dalrymple-Alford’s (1972) original study and our replication. The red bar represents the RT for the ‘XXXX’ distractor condition

Table 1.

Mean latency (in ms), standard deviation (SD in ms) and error rate (%) by factor and congruency in Experiment 1.

Color Object X-type
Latency SD % Latency SD % Latency SD %
Congruent 621 94 0.01 650 93 0.01 623 87 0.01
Incongruent 732 121 0.05 669 90 0.01 - - -

The observation, in the original Stroop task, that naming latencies (“red”) are slower with incongruent color word distractors (green) than congruent color word distractors (red) is naturally explained by the Response Exclusion Hypothesis: the prepotent response engendered by the distractor (green) must be excluded as not being the intended response to the ink color, and excluding the distractor incurs a cost because the distractor satisfies response relevant criteria (its is a color name in a color naming task). However, the observation that ink naming latencies are facilitated in the congruent non-color word condition (fire) compared to the incongruent non-color word condition (lawn) is not due to an effect of distractor exclusion: neither ‘fire’ nor ‘lawn’ satisfy relevant criteria of the response ‘red’ in an ink naming task. Rather, the distractor ‘fire’ primes the response ‘red’ at the semantic level, but the distractor ‘lawn’ does not—thus, the phenomenon can be explained only if one dispenses with the idea of competitive lexical selection. This invites the prediction that under delayed naming conditions (Janssen et al., 2008), the difference between the distractors ‘red’ and ‘green’ in naming the ink color ‘red’ will be largely preserved, while the difference between the distractors ‘fire’ and ‘lawn’ will dissappear.

Where does this leave us?

We have argued that Mulatti and Coltheart’s arguments do not have the dire consequences for the Response Exclusion Hypothesis that the authors forecast. That said, their discussion is well taken and adds new dimensions to what a theory should address.

As a field, we know that the ‘old view’ of what is happening in the picture-word paradigm does not work. Further assumptions can be made to build upon the old theory in the context of challenging results, such as restrictions on which words may enter into competition (Bloem and La Heij, 2003; but see Navarrete and Costa, 2005), or the stipulation that there needs to be a critical mass of activated nontarget words for those words to compete with the target (Abdel Rahman and Melinger, 2009; but see Mahon and Caramazza, 2009). Thus, we have new versions of the selection by competition hypothesis and an alternative account – the Response Exclusion Hypothesis— and a lot of exciting future experiments in between.

Supplementary Material

1
01

Footnotes

1

IE, an integral part of our general account of behavior in the PWI task is that naming latenceis are modulated by variables that affect target picture retrieval (semantic facilitation). In fact, the motivation for pursuing the Response Exclusion Hypothesis in the first place was the prior inference that lexical retrieval is facilitatory. This makes it all the more puzzling why Mulatti and Coltheart assert that the Response Exclusion Hypothesis maintains that picture naming latencies are affected only by distractor exclusion.

References

  1. Abdel Rahman R, Melinger A. Semantic context effects in language production: A swinging lexical network proposal and a review. Language and Cognitive Processes. 2009;24:713–734. [Google Scholar]
  2. Bloem I, La Heij W. Semantic facilitation and semantic interference in word translation: Implications for models of lexical access in language production. Journal of Memory and Language. 2003;48:468–488. [Google Scholar]
  3. Caramazza A, Hillis AE. Lexical organization of nouns and verbs in the brain. Nature. 1991;349:788–790. doi: 10.1038/349788a0. [DOI] [PubMed] [Google Scholar]
  4. Chatterjee A. Disembodying Cognition. Language and Cognition. 2010;2:79–116. doi: 10.1515/LANGCOG.2010.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Dalrymple-Alford EC. Associative facilitation and interference in the Stroop color–word tests. Perception and Psychophysics. 1972;11:274–276. [Google Scholar]
  6. Dell GS, Oppenheim GM, Kittredge AK. Saying the right word at the right time: Syntagmatic and paradigmatic interference in sentence production. Language and Cognitive Processes. 2008;23:583–608. doi: 10.1080/01690960801920735. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Dell GS, Reich PA. Stages in Sentence Production - an Analysis of Speech Error Data. Journal of Verbal Learning and Verbal Behavior. 1981;20:611–629. [Google Scholar]
  8. Glaser WR, Glaser MO. Context effects on Stroop-like word and picture processing. Journal of Experimental Psychology: General. 1989;118:13–42. doi: 10.1037//0096-3445.118.1.13. [DOI] [PubMed] [Google Scholar]
  9. Hickok G. Eight Problems for the mirror neuron theory of action understanding in monkeys and humans. Journal of Cognitive Neuroscience. 2009;21:1229–1243. doi: 10.1162/jocn.2009.21189. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Janssen N, Schirm W, Mahon BZ, Caramazza A. Semantic interference in a delayed naming task: evidence for the response exclusion hypothesis. Journal of Experimental Psychology: Learning, Memory, and Cognition. 2008;34:249–256. doi: 10.1037/0278-7393.34.1.249. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Machery E. Concept empiricism: A methodological critique. Cognition. 2007;104:19–46. doi: 10.1016/j.cognition.2006.05.002. [DOI] [PubMed] [Google Scholar]
  12. Mahon BZ, Caramazza A. A critical look at the embodied cognition hypothesis and a new proposal for grounding conceptual content. Journal of Physiology-Paris. 2008;102:59–70. doi: 10.1016/j.jphysparis.2008.03.004. [DOI] [PubMed] [Google Scholar]
  13. Mahon BZ, Caramazza A. Why does lexical selection have to be so hard? Comment on Abdel Rahman and Melinger’s Swinging Lexical Network Proposal. Language and Cognitive Processes. 2009 [Google Scholar]
  14. Mahon BZ, Costa A, Peterson R, Vargas KA, Caramazza A. Lexical selection is not by competition: a reinterpretation of semantic interference and facilitation effects in the picture-word interference paradigm. Journal of Experimental Psychology: Learning, Memory, and Cognition. 2007;33:503–535. doi: 10.1037/0278-7393.33.3.503. [DOI] [PubMed] [Google Scholar]
  15. Navarrete E, Costa A. Phonological activation of ignored pictures: Further evidence for a cascade model of lexical access. Journal of Memory and Language. 2005;53:359–377. [Google Scholar]
  16. Oppenheim GM, Dell GS, Schwartz MF. The dark side of incremental learning: A model of cumulative semantic interference during lexical access in speech production. Cognition. 2010;114:227–252. doi: 10.1016/j.cognition.2009.09.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Tanenhaus MK, Flanigan HP, Seidenberg MS. Orthographic and phonological activation in auditory and visual word recognition. Memory and Cognition. 1980;8:513–520. doi: 10.3758/bf03213770. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1
01

RESOURCES