Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2019 Feb 5.
Published in final edited form as: J Cogn Neurosci. 2015 Jun 4;27(10):1936–1947. doi: 10.1162/jocn_a_00831

Early and late electrophysiological effects of distractor frequency in picture naming: Reconciling input and output accounts

Stephanie K Riès 1, Douglas Fraser 2, Katie L McMahon 3, Greig I de Zubicaray 4
PMCID: PMC6362987  NIHMSID: NIHMS994633  PMID: 26042502

Abstract

The ‘distractor-frequency effect’ refers to the finding that high-frequency (HF) distractor words slow picture naming less than low-frequency distractors (HF) in the picture-word interference (PWI) paradigm. Rival input and output accounts of this effect have been proposed. The former attributes the effect to attentional selection mechanisms operating during distractor recognition, while the latter attributes it to monitoring/decision mechanisms operating on distractor and target responses in an articulatory buffer. Using high-density (128 channel) electroencephalography (EEG), we tested hypotheses from these rival accounts. In addition to conducting stimulus- and response-locked whole-brain corrected analyses, we investigated the correct-related negativity (CRN), an event-related potential (ERP) observed on correct trials at fronto-central electrodes proposed to reflect the involvement of domain general monitoring. The whole-brain ERP analysis revealed a significant effect of distractor frequency at inferior right frontal and temporal sites between 100 ms to 300 ms post-stimulus onset, during which lexical access is thought to occur. Response-locked, region of interest (ROI) analyses of fronto-central electrodes revealed a CRN starting 121 ms before and peaking 125 ms after vocal onset on the grand averages. Slope analysis of this component revealed a significant difference between HF and LF distractor words, with the former associated with a steeper slope on the time-window spanning from 100 ms before to 100 ms after vocal onset. The finding of ERP effects in time-windows and components corresponding to both lexical processing and monitoring suggests the distractor frequency effect is most likely associated with more than one physiological mechanism.

Introduction

It is generally accepted that spoken word production involves selecting a target word from a range of activated lexical candidates. This process has been referred to as the main decision mechanism in language production (Levelt, 1989). According to most theoretical models, candidate words are activated via a process of spreading activation from conceptual to lexical representations (see Goldrick, 2007; Levelt, Roelofs, & Meyer, 1999). However, there is considerable disagreement about whether the selection of the target word is accomplished by competitive or non-competitive mechanisms. Because of the core importance of the lexical selection process in speech production, understanding the way it is performed (by competition or not) is essential. Here we attempted to shed light on this issue by studying the brain dynamics associated with picture naming in a paradigm at the centre of the debate between competitive versus non-competitive accounts of lexical selection.

Models implementing non-competitive selection typically assume a “horse race” mechanism in which the lexical candidate with the highest level of activation is produced after passing a predetermined threshold, or after a certain number of time steps (e.g., Caramazza, 1997; Dell, 1986; Mahon, Costa, Peterson, Vargas, & Caramazza, 2007). Competitive lexical selection models instead assume the time taken to produce a target is a function of the number of activated candidates, with the target selected when a critical difference in activation levels is achieved (e.g., Levelt, Roleofs, & Meyer, 1999; Starreveld & La Heij, 1996). One of the major sources of evidence for the latter type of model has come from experimental manipulations using the picture word interference (PWI) paradigm (Rosinski, 1977). In the conventional PWI paradigm, participants are asked to name a target picture and to ignore an accompanying distractor word. When distractor words are manipulated in terms of categorical relations with the target (e.g., picture FOX, distractor pig), target naming latencies are slower in comparison to unrelated distractors (e.g., FOX-pen) – an effect called semantic interference. The predominant explanation of this effect assumes the activation of the lexical representation of the distractor will spread to its semantically-related neighbours, making the selection of the target lexical representation more difficult if it is semantically-related to the distractor than if it is not. Indeed, it will take longer to reach a critical difference between the activation levels of the target and the distractor if the two are from the same semantic category. Semantic interference thus reflects the additional time taken to resolve the increased lexical competition.

However, findings using a novel manipulation in the PWI paradigm have been argued to challenge this competition account: Miozzo and Caramazza (2003) introduced a manipulation of the lexical frequency of the distractor word in the PWI paradigm in the absence of a categorical relation with the target. If lexical selection is by competition, they hypothesised that high frequency (HF) distractor words should compete more with the picture name than low frequency (LF) words because they have higher activation levels. Instead, they found LF distractor words slowed target picture naming more than HF words, a finding that has been replicated multiple times (e.g., Catling et al., 2010; de Zubicaray et al., 2012; Dhooge & Hartsuiker, 2010, 2013; Geng et al., 2013; Starreveld, La Heij, & Verdonschot, 2013). A broad framework for explaining differential naming latencies in the PWI paradigm involves processing capacity constraints: Reading a word is faster than naming a picture (Cattell, 1885, as discussed in Levelt, 2012). Thus, distractor words will be processed faster than target pictures. Therefore, delays in target naming will reflect the time required to process each type of distractor (Miozzo & Caramazza, 2003; de Zubicaray et al., 2012). However, there is disagreement about the locus of the processing delay resulting in the distractor frequency effect. Whereas the first category of explanations argue the processing delay occurs early on, before or as the name of the picture is accessed, other explanations argue for a late locus of the processing delay, after the name of the picture has been accessed.

Within one prominent competitive lexical selection account – the WEAVER++ model - the distractor frequency effect has been interpreted in terms of an attentional mechanism – distractor blocking by a condition-action rule - sensitive to the frequency of the distractor word (Roelofs, 2005; see also Roelofs, Piai, & Schriefers, 2011). The account assumes HF words will be read more quickly than LF words and will therefore be blocked more quickly. According to Roelofs et al. (2011), the speed of blocking is dependent on an initial processing response to the distractor information, and this processing may span word recognition to word form encoding. Starreveld et al. (2013) have similarly proposed an input account involving different recognition thresholds for HF and LF words. The differential threshold account (DTA) also assumes that the activation level of a distractor word decays upon recognition in a manner proportional to its level of activation (i.e., exponential decay). Thus, both of the above explanations predict that the distractor frequency effect occurs early in time, at a lexical level or earlier, while preserving a competitive lexical selection account.

Alternative accounts of the distractor frequency effect in PWI place its locus at a post-lexical stage of processing. Building on Miozzo and Caramazza’s (2003) proposal of a task-specific distractor blocking mechanism in PWI, the response exclusion account (e.g., Finkbeiner & Caramazza, 2006; Mahon et al., 2007) assumes that distractor words have a privileged relationship with the articulators as reading is considered to be more automatic than picture naming: the distractor word enters an articulatory buffer as a phonologically well-formed response before the phonological representation of the picture name (target). The resulting bottleneck is then solved by a decision mechanism. According to this account, the relative speed of entry and removal of the distractor into and out of the buffer will determine the presence or absence of an interference effect. The selection of the correct response then occurs closer to the response output rather than before accessing the name of the picture, as suggested by the input account. As HF distractors are assumed to be read more quickly and enter the buffer earlier, they are excluded more quickly than LF words. Thus, the account assumes a task-specific, post-lexical, non-competitive selection mechanism is responsible for the distractor frequency effect in PWI. As the response exclusion account was devised solely to explain PWI effects and its decision mechanism was relatively under-specified, Dhooge and Hartsuiker (2010) sought to incorporate the account within the broader framework of speech production by equating its operations with those of the verbal self-monitoring system (e.g., Hartsuiker & Kolk, 2001; Levelt, 1989).

As both input and output accounts can potentially explain the distractor frequency effect in naming latencies, other means are needed to establish its locus. Using functional magnetic resonance imaging (fMRI) with a sparse temporal sampling acquisition, de Zubicaray and colleagues (2012) tested whether the distractor frequency effect was associated with differential activity in brain regions predicted by input or output accounts. They derived hypotheses based on Indefrey’s (2011) meta-analysis of neuroimaging and electrophysiological studies of spoken word production. This meta-analysis identified reliable roles for the mid-to-posterior sections of the middle and superior temporal gyri (MTG and STG) in lexical level processes of lexical-conceptual selection and phonological word form retrieval, respectively, within a predominantly left lateralized network. According to the predictions made by this meta-analysis, during picture naming, activity in these two regions typically occurs between 100 and 400 ms following object recognition, with post-lexical processes including articulation occurring between 400 and 600 ms in left inferior frontal and premotor cortical areas, respectively (see also Strijkers & Costa, 2011, for time-course estimates). In addition, the meta-analysis identified a role for bilateral posterior STG in verbal self-monitoring. The fMRI data revealed significant differential activity in bilateral medial frontal (anterior cingulate cortex: ACC, and supplementary motor area: SMA) and lateral premotor cortices, in addition to posterior STG. However, no differential activity was observed in the mid-MTG, leading de Zubicaray et al. to conclude that the distractor frequency effect most likely had a post-lexical locus per the output account.

While spatially informative, fMRI using the sparse acquisition methodology is unable to provide time-course information. Hence, it remains possible that some or all of the regions observed in the de Zubicaray et al. study could have been activated during lexical rather than post-lexical time windows. Our main aim in the current study was therefore to use electroencephalography (EEG) to determine the time course of activity associated with the distractor frequency effect. The application of EEG for researching overt speech has increased in recent years (Ganushchak, Christoffels, & Schiller, 2011), aided by the development of analysis techniques for reducing articulation-related electromyographic artefacts which otherwise heavily pollute EEG signal (e.g., de Vos et al., 2010). In addition to the time-course of the activation of the brain regions reviewed above, several studies have indicated that spoken word production might also engage aspects of domain general monitoring systems that are sensitive to conflict during information processing (e.g., Acheson et al., 2012; de Zubicaray, 2006; Riès et al., 2011, see Nozari et al., 2011 for a detailed theoretical account). In particular, a response-locked, fronto-central negative potential peaking shortly after both erroneous and correct vocal responses has been observed reliably in production paradigms. These event related potentials (ERPs), initially observed during performance of non-linguistic tasks, have been referred to as the error- and correct-related negativities (ERN and CRN), respectively, and are proposed to have a common source in the ACC and/or SMA (e.g., Bonini et al., 2014; Debener et al., 2005; Dehaene, Posner, & Tucker, 1994; Roger et al., 2010). As this response-locked negativity is observed for both correct and incorrect trials, it has been interpreted as reflecting a general response monitoring rather than error-detection system (Riès et al., 2011; Vidal et al., 2000, 2003). The amplitude of the CRN is usually smaller than the ERN in healthy participants. Importantly, the negative potential emerges before vocal onset, i.e., before auditory feedback from an overt vocal response can be perceived, indicating it is likely to reflect monitoring of internal rather than external speech (Riès et al., 2011, 2013b). The precise nature of the representations being monitored remains a matter of debate (e.g., Acheson & Hagoort, 2014).

To date, the only EEG study to have examined the distractor frequency effect is that of Dhooge and Hartsuiker (2013). Those authors reported stimulus-locked analyses from 31 electrodes showing three separate effects with LF distractors showing more negative amplitudes: The first over left and central electrodes at 20–60 ms, the second occurring between 420 and 500 ms over all electrodes, and the third between 520 and 580 ms again over left and central electrodes. Dhooge and Hartsuiker (2013) interpreted the latter two effects as being consistent with the response exclusion account and operation of the verbal self-monitor (e.g., Mahon et al., 2007), as they occurred later than lexical-level processes that typically occur within the first 400 ms. The initial effect was interpreted as being too early for frequency-related word recognition processes that are typically reported in the 150 to 400 ms time window (for review, see Laszlo & Federmeier, 2014), and so unlikely to reflect a distractor blocking mechanism (e.g., Roelofs et al., 2011).

The present study differed from that of Dhooge and Hartsuiker (2013) for three main reasons. First, we investigated the ERP correlates of the distractor frequency effect using both stimulus and response-locked analyses of EEG data to test hypotheses from the input and output accounts. Although stimulus-locked analyses of EEG data are able to provide some information about the time-courses of processes involved in spoken word production (e.g., Blackford et al., 2012; Dhooge & Hartsuiker, 2013), response-locked analyses have been argued to be better suited to observe later effects linked to the production of the response (Ries et al., 2013a). In particular, the ERN and CRN suggested to reflect response monitoring are measured response-locked (Riès et al., 2011; Vidal et al., 2000, 2003). As speech monitoring is thought to be one the mechanisms sensitive to distractor frequency (according to the output account), we investigated this component in particular. In addition, we performed stimulus-locked analyses to test for a potential early effect of distractor frequency as postulated by input accounts. Second, we also aimed at providing some level of spatial information by using the Laplacian transformation (as in Ries et al., 2013a) and high-density EEG recording. Finally, we addressed the problem caused by articulation-related electromyographic artifacts, prominent in scalp EEG studies of overt speech production. We used a blind-source separation algorithm based on canonical correlation analysis, enabling to observe clean EEG signal both time-locked to stimulus and to vocal-onset (as shown in Ries et al., 2011, 2013a, 2013b).

Methods

Participants

Twenty undergraduate students at the University of Queensland participated in the experiment (10 women; mean age 23 years, SD = 3.63). All were right-handed and native English speakers, with no history of neurological or psychiatric disorder, substance dependence, or known hearing deficits. All had normal or corrected-to-normal vision and gave informed consent in accordance with the protocol approved by the Behavioural and Social Sciences Research Ethics Committee of the University of Queensland.

Materials

The materials were identical to those employed by Catling et al. (2010; Experiment 1) and de Zubicaray et al. (2012). Forty-eight black and white line drawings were chosen from the Snodgrass and Vanderwart (1980) corpus. High Frequency (HF) and Low frequency (LF) word distractors were also matched on a range of linguistic variables including age of acquisition (AoA; more information on the matching variables can be found in the appendix of Catling et al, 2010). Each target picture was paired with a HF and a LF word that did not share a semantic or phonological relationship with it.

The stimuli were presented on a 21-inch CRT monitor (NEC, Accusync 120, resolution 1024×768) placed 60 cm in front of the participant. The visual angle approximated 3 degrees. Black and white target pictures (300 × 300 pixels) and superimposed distractor words were presented centrally on a white background. The visual distractor words were presented in red Arial 50-point font.

Stimuli presentation, response recording and latency measurement (i.e., voice key) were accomplished online via the Cogent 2000 toolbox extension (www.vislab.ucl.ac.uk/cogent_2000.php) for MATLAB (2010a, MathWorks, Inc) using a personal computer equipped with a noise-cancelling microphone (Logitech, Inc).

Procedure

Participants were first familiarized with the set of picture stimuli and their corresponding labels below them. In two subsequent blocks they viewed the pictures without labels and were instructed to name them. The experimenter corrected erroneous naming responses.

After familiarization, participants completed three experimental blocks consisting of 96 trials each. There were 48 word-pair combinations and each pair was repeated twice per block. There were 144 trials per frequency condition (HF/LF) for a total of 288 trials. Participants were instructed to name the pictures as quickly and accurately as possible while ignoring the superimposed distractor word. They were also instructed not to correct themselves if they made an error. Stimuli were presented in the following sequence. A fixation point was shown for 250 msec, followed by a blank screen for 500 msec, and then the target-distractor pair was shown for 750 msec. The inter-trial interval was 3 seconds. Naming latencies were determined online with voice-key code implemented in the Cogent 2000 toolbox, and responses verified off-line using Audacity software (http://audacity.sourceforge.net) in case non-vocal noise triggered the voice key.

EEG Acquisition

The EEG was recorded from 128 Ag/AgCl pre-amplified electrodes using a BioSemi Active Two EEG system. The sampling rate was 1024 Hz. The vertical electrooculogram (EOG) was recorded by means of two surface electrodes just above and below the left eye, respectively. The horizontal EOG was recorded with two electrodes positioned over the two outer canthi.

Data Preprocessing

Behavioral preprocessing

Trials including incorrect or omitted naming responses and speech dysfluencies (e.g., hesitations, stuttering) were scored as errors and excluded from analyses due to their low rate (1.7%). Naming latencies faster than 350 ms and slower than 2000 ms were also excluded (1.9%).

EEG data pre-processing:

Five participants were excluded from the EEG signal-processing because their EEG signals had too many artefacts to permit useful analysis (their recordings had 50% or more of trials rejected). We report analyses performed on the remaining 15 participants (9 females, mean age 23.8 years, SD 3.6).

Channels C8, C32, D28, B1 and B9 were rejected from the data of the participants under analysis because the signal recorded at these channels contained too many artefacts in some of the participants.

Post-acquisition, the EEG data were filtered (high-pass = 0.16 Hz) and resampled at 256 Hz. Vertical eye movement artifacts were then corrected through Independent Component Analysis (ICA) as implemented in EEGLAB (Delorme and Makeig, 2004). For each subject, we manually determined the ICA component that best reflected eye blinks by looking at both their waveforms and their topographies. The waveforms of these components were compared to that of the raw EEG signal to match for eye blink location in time. We removed only the component which clearly captured the eye blinks and with a clear anterior and symmetrical topography.

Speaking induces large facial EMG activities that contaminate the EEG signal. To reduce the EMG artifacts induced by articulation, we used a Blind Source Separation algorithm based on Canonical Correlation Analysis (BSS-CCA, de Clercq et al.,2006) that separates sources based on their autocorrelation. The suitability of BSS-CCA for removing articulatory EMGbursts from EEG signal is described in detail in de Vos et al.(2010) and was used successfully to study monitoring-related components in Riès et al. (2011, 2013b). In the current study the BSS-CCA method was applied twice: First on non-overlapping consecutive windows of 30 seconds to target tonic EMG activity produced by continuous contraction of the facial or neck muscles; Second, on non-overlapping consecutive windows of 1.5 seconds (average RT = 775 ms, σ = 164 ms) enabling the targeting of local EMG bursts (this was done automatically using the EEGLAB plug-in Automatic Artifact Removal implemented by Gomez-Herrero available at http://www.cs.tut.fi/,gomezher/projects/eeg/software.htm#aar). EMG related components were selected according to their Power Spectral Density (PSD). As explained in de Vos et al.(2010), components were considered to be EMG activity if their average power in the EMG frequency band (approximated by 15–30 Hz) was at least 1/5 of the average power in the EEG frequency band (approximated by 0–15 Hz). The use of BSS-CCA was preferred over that of ICA for the separation of EEG sources from EMG sources based on previous investigations showing ICA could not separate these sources optimally and was less specific than BSS-CCA for this particular application (e.g., De Clercq et al. 2006, De Vos et al., 2010). The benefits of BSS-CCA are shown on the power spectra of the response-locked grand averages calculated over a large time-window (from 1000 ms before the vocal-onset to 500 ms post-vocal-onset, Figure S1). Before muscle artifact removal, the power spectra show a lot of high frequency activity across a broad frequency range. After BSS-CCA, this high frequency activity is clearly reduced. The topography of the difference in spectral content before versus after BSS-CCA shows the removed signal is mainly located at lateral frontal and temporal recording sites, where the muscular artifacts are most prominent.

After the BSS-CCA procedure, all remaining artifacts were manually rejected by a trial-by-trial visual inspection of the monopolar recordings. Particular attention was paid to small local artifacts to allow for the subsequent use of Laplacian transformation, which is more sensitive to local small artifacts than the more commonly used monopolar recordings. The remaining EEG recordings were averaged, individually, to stimulus presentation and to vocal onset. Laplacian transformation (i.e., current source density, C.S.D., estimation), as implemented in Brain Analyser TM (BrainProducts, Munich), was applied to each participant’s averages and on the grand averages as in Riès et al. (2011, 2013a, 2013b); (degree of spline: 3, Legendre polynomial: 15° maximum). Laplacian transformation has been shown to increase the spatial resolution of the signal providing a good estimation of the corticogram (Nuñez, 1981). Components therefore appear more focal after Laplacian transformation than on the more commonly-used monopolar recordings. We assumed a radius of 10 cm for the sphere representing the head. The resulting unit was μV/cm2. A 30-Hz low-pass and 1-Hz high-pass filters were applied off-line on the EEG data. For the purpose of cluster-based permutation testing, we also computed Laplacian transformation on the individual trials of each participant.

Analysis

Repeated measures analyses of variance (ANOVAs) were conducted with naming latencies as the dependent variable and distractor frequency as within-participants (F1) and within-items factors (F2). Errors were not subjected to analyses due to their low rate.

Three different types of analysis were performed on the EEG data to detect the differences between the ERPs for high frequency versus low frequency distractor words.

The first 2 types of analysis (referred to as whole-brain vs. region of interest, ROI) were performed using the Mass Univariate ERP toolbox (Groppe, Urbach and Kutas, 2011) to perform mass permutation tests on the Laplacian-transformed ERPs time-locked to stimulus and vocal onset. More precisely, we used repeated measure, two-tailed cluster mass permutation tests (Bullmore et al., 1999) and a family-wise alpha level of 0.05. The advantages of this type of test are that 1. cluster permutation tests are very good at detecting broad effects in time or in space, and 2., they are non-parametric and therefore allow for a straightforward way of correcting for multiple comparisons (Maris and Oostenveld, 2007). One disadvantage worth mentioning here is that this type of test is not very good at detecting short-lived effects, we return to this in the discussion.

  1. For whole-brain analyses, all 123 scalp electrodes were included in the tests. Time-locked to the stimulus, the test was performed on all time points between 0 and 500 ms (i.e., 15867 total comparisons) and any electrodes within approximately 5.44 cm of one another were considered spatial neighbors (assuming a 56 cm average head circumference). The baseline was the 200 ms time-window ranging from 200 ms before stimulus onset to stimulus onset. Repeated measures t-tests were performed for each comparison using the original data and 2500 random within-participant permutations of the data. For each permutation, all t-scores corresponding to uncorrected p-values of 0.05 or less were formed into clusters. The sum of the t-scores in each cluster is the “mass” of that cluster and the most extreme cluster mass in each of the 2501 sets of tests (derived from the 2500 permutations and from the real data) was recorded and used to estimate the distribution of the null hypothesis. We used 2500 permutations to estimate the distribution of the null hypothesis as it is over twice the number recommend by Manly (1997) for a family-wise alpha level of 0.05.

    Time-locked to the response, we performed the same analysis but on all time-points between 500 ms before vocal-onset and vocal-onset with a baseline corresponding to the 200 ms time-window between 1000 ms and 800 ms before vocal-onset. We also performed a whole-brain test on the 200 ms following vocal-onset with a baseline taken from 200 ms to 100 ms before vocal onset (i.e., 6396 comparisons).

  2. We also performed ROI-type of analyses time-locked to stimulus and vocal-onset (Figure 1). These tests were performed using the same time-windows and the same baseline corrections as the whole-brain analyses. Given the fMRI results reported by de Zubicaray et al. (2012) and our hypotheses concerning lexical-level processes and monitoring, we restricted our analysis to a fronto-central cluster of electrodes around the equivalent to FCz/Fz in a 64 electrode system (C20, C21, C22, C23, C24, C25, C12, C11), a left temporal cluster (D23, D22, D21, D26, D29, D30, D31, D24, D25) and a right temporal cluster (B26, B25, B24, B16, B15, B14, B13, B12, B11). We note the signal was not averaged over electrodes. The number of comparisons was greatly reduced in these analyses compared to whole-brain analyses (between 1152 and 416 comparisons).

  3. We focused more closely on the fronto-central component identified as the CRN and performed statistical analyses on the slope of the activity, and peak-to-peak amplitudes of Laplacian-transformed data, similarly as in previously reported studies (Ries et al., 2011, 2013a, 2013b). A negative peak could not be identified within the time-window centred around the latency of the peak on the grand-averages for 4 out of the 15 participants whose data were kept for analysis. We thus measured the surface below the curve on a 50 ms time-window around the latency of the peak as measured on the grand average in each participant. We also measure the surface below the curve on a 50 ms time-window around the latency of the preceding positive dip on the grand average in each participant. We then subtracted this surface measure from the one corresponding to the negative peak and it is this surface difference which we refer to as the peak-to-peak amplitude. This type of measurement was preferred over taking the real difference between two peaks to reduce the contribution of noise. Slopes were measured, by fitting a linear regression to the data, to attest for the statistical existence of the component by comparing it to zero. This measure was chosen because it is also independent from the baseline and it gives morphological information about the data (Carbonnell, Hasbroucq, Grapperon, and Vidal, 2004).

Figure 1:

Figure 1:

Biosemi 128-electrode system used in the present study. Electrodes included in the ROI-type of analysis are circled with the dotted black lines. Electrodes which were rejected from all analyses are blanked.

Results

Analysis of mean naming response times (RT) revealed a significant effect of distractor condition by both participants (F1 [1,19] = 15.02, MSE = 182.2, p < .001, partial η2 = .44) and items (F2 [1,47] = 8.9, MSE = 671.8, p <.005; partial η2 = .16). Pictures with LF words (mean 784 ms) were named on average 16 ms slower (95% confidence interval of difference = 9 ms) than pictures with HF distractors (mean 768 ms), replicating previous results (Catling et al., 2010; Experiment 1; de Zubicaray et al., 2012; Dhooge & Hartsuiker, 2010, 2013).

Stimulus-locked analyses:

The whole-brain analysis of the 500 ms following stimulus onset a revealed significant effect of distractor frequency at two right inferior frontal (C6 and C7) and one right temporal site (B25, Figure 2A). The Laplacian-transformed ERPs started to diverge at around 100 ms post-stimulus onset and the difference remained until around 300 ms post-stimulus (start: 86 ms, 116 ms, and 137 ms; finish: 332, 344 and 359 ms post-stimulus at C07, C06, and B25 respectively). Although the topographies of the difference-wave shows earlier left frontal and temporal foci, no significant difference was found at those sites (Figure 2B). We also note the difference is not clearly visible at B25 on the topographies although it can be seen on the waveforms (Figure 2C), this is due to the choice of baseline (−200 to −100 ms for the figures).1

Figure 2:

Figure 2:

Stimulus-locked effects of distractor frequency. A. Result of the cluster-based mass permutation test performed on all electrodes (the ROI-type of analysis revealed very similar results and is not displayed here). Significant differences between conditions are found at 3 recording sites (C6, C7 and B25) starting around 100 ms and ending around 300 ms. B: The topographies of the difference wave between 100 and 300 ms in 50 ms averages. The right frontal activity starts being visible 150 ms post-stimulus onset. C: Waveforms at the 3 electrodes showing significant effects. Low frequency distractor words are associated with more positive/less negative ERPs at these 3 sites.

Stimulus-locked ROI-type of analyses did not reveal any significant effect of distractor frequency. We note these ROIs did not include the electrodes showing significant effects in the whole-brain analysis (namely, C06, C07, and B25).

Response-locked analyses:

None of the response-locked whole-brain analyses revealed any significant effect of distractor frequency2. At the fronto-central ROI, there was an effect at electrode C21 (corresponding to Fz in the 10–20 system, Figure 3). This effect started 20 ms post-vocal onset and lasted until 121 ms post-vocal onset3. No other ROI-type of analysis revealed any significant effect of distractor frequency. We describe the effect on this fronto-central component in more detail below.

Figure 3:

Figure 3:

Fronto-central response-locked distractor frequency effect. For all measures, the baseline was taken from 200 to 100 ms before vocal onset. A: Results of the cluster-based mass permutation test on fronto-central electrodes. Significant differences between conditions are found at C21 starting 20 ms and ending 121 ms post vocal-onset. B: CRN for high and low frequency distractor words. C: Topography of the difference wave between 20 and 120 ms.

The fronto-central component started 121 ms before and peaked 125 ms after vocal onset on the grand averages, highly resembling the CRN (Figure 3B and 3C). Irrespective of distractor frequency condition, its slope was significantly different from zero on the 200 ms time-window centered on vocal onset (t[14] = −2.05, p <.05, a one-tailed Student t-test was used given the direction of the difference was expected based on previous reports by Vidal et al., 2000, 2003; Riès et al., 2011). The slope analysis also revealed a significant difference between high frequency and low frequency distractor words, where high frequency distractor words were associated with a steeper slope than low frequency distractor words on the time-window spanning from 100 before to 100 ms after vocal onset (t[14] = −2.43, p<0.05, two-tailed student t-test). The peak-to-peak amplitude measured between the negative peak and the preceding positive dip revealed high frequency distractor words were associated with a larger CRN than the low frequency distractor words (t[14] = 3.07, p<.01, two-tailed student t-test, see Figure 3C for topography of the difference wave following vocal onset).

Discussion

Using high-density EEG recordings, we contrasted input and output accounts of the distractor frequency effect in spoken word production. We replicated the distractor frequency effect in picture naming latencies using identical stimuli to previous studies (e.g., Catling et al., 2010; de Zubicaray et al., 2012). Our ERP results show significant effects of distractor frequency both time-locked to stimulus presentation and to vocal onset. Stimulus-locked effects started as early as 100 ms and lasted until around 300 ms post stimulus onset and were confined to right inferior frontal and temporal cortex recording sites. Response-locked distractor frequency effects were found closely following vocal-onset on a fronto-central component corresponding to the CRN. These results suggest that the distractor frequency effect is most likely associated with more than one physiological mechanism, and possibly reconcile rival input versus output accounts of this effect. We discuss the possible nature of these mechanisms below.

According to the competitive lexical selection – or input – account, the distractor frequency effect reflects either a differential recognition threshold for HF and LF words (e.g., Starreveld et al., 2013) or an attentional mechanism that implements reactive blocking during processing of distractors that potentially encompasses word recognition up to word form encoding (WEAVER++; Roelofs et al., 2011). The time window of the effects observed over right inferior frontal and temporal sites in the stimulus-locked analyses indicates the operation of relatively early mechanisms. A recent review of ERP studies of word recognition indicated word frequency related components are reported reliably in the 100 to 400 ms window over central, left and right hemisphere sites (see Laszlo & Federmeier, 2014). The effects observed here were predominantly right-lateralized. This lateralization could be interpreted in terms of the recruitment of right-hemisphere dominant attentional mechanisms that are known to operate during language tasks (e.g., Petersen & Posner, 2012; Roelofs, 2003; Vigneau et al., 2010). This would be in agreement with the input account postulating an early attentional mechanism involved in blocking the processing of the distractor word (Roelofs, 2005; Roelofs et al., 2011). Similarly, the effect at right inferior frontal sites could be considered consistent with the operation of an inhibitory control mechanism (e.g., Aron, Robbins & Poldrack, 2014) that would favour processing of the target picture over processing of the distractor word by reactively blocking the latter (e.g., Roelofs et al., 2011). Response inhibition is closely related to resistance to distractor interference (e.g., Friedman & Miyake, 2004). We note that differences between distractor frequency conditions were visible at other sites on the difference wave topographies (Figure 2B). Although these did not reach statistical significance, it is worth mentioning that the type of analysis we used may have missed those. Indeed, as mentioned in the methods section, cluster mass permutation tests are not very good at detecting short-lived effects. In addition, the large number of electrodes used in the whole-brain analysis yielded a large number of comparisons, which may have disadvantaged smaller effects. Some of the sites where differences could be seen on the difference waves were not included in the ROI-type of analyses (e.g., left inferior frontal sites), as these were not over brain regions identified as being sensitive to this effect in earlier studies and hence we did not have a priori reasons for targeting these regions (de Zubicaray et al., 2012). However, our point here is mainly to note the existence of early distractor frequency effects as these had not been observed previously and are in agreement with one of the hypotheses tested.

Dhooge and Hartsuiker (2013) predicted “effects of distractor frequency should be evident only after 350 ms” if the response exclusion account is correct (p. 233, emphasis added). However, the stimulus-locked activity observed in the 100–300 ms time window is clearly not consistent with this account. Could the finding of a negative going ERP sensitive to distractor frequency in the response-locked fronto-central ROI analysis be used to support an output account?

This effect was visible on the CRN which started to rise around 100 ms before vocal onset. This is in the time-window attributed to articulatory preparation according to meta-analyses (Indefrey and Levelt, 2004; Indefrey, 2011) and would thus support an output locus of the distractor frequency effect, within this theoretical framework. However, Hutson, Damian and Spalek (2013) have recently demonstrated a distractor frequency effect with manual classifications in two separate experiments, indicating the effect is unlikely to involve articulatory-motor programs, and might instead involve a relatively earlier, more abstract level of representation. As we noted in the Introduction, the fronto-central CRN has been observed in both psycholinguistic and non-psycholinguistic tasks, and is therefore interpreted as reflecting the operation of a domain general monitor (e.g., Acheson et al., 2012; Riès et al., 2011). In spoken word production, the ERN/CRN is likely to reflect monitoring of internal rather than external speech as it arises before the response is made (Riès et al., 2011, 2013b). As Hutson et al. (2013) note, there is considerable evidence that we are able to monitor our inner speech at a relatively abstract pre-articulatory level. Thus, a self-monitoring account of the distractor frequency effect might be plausible if a domain general monitor was assumed to operate on relatively abstract representations.

Thus, the input and output accounts may be reconcilable on the basis of our observations. We can speculate about how an early attentional or inhibition mechanism and speech monitoring might be linked. Monitoring is assumed to be always in place during speech production, meaning it is sensitive to the accuracy of each step of speech production (Dhooge & Hartsuiker, 2010). The normal function of the monitoring system(s) is to inspect internal and external speech for problems, and this function is assumed to operate relatively independently of the mechanism for lexical selection (e.g., Hartsuiker & Kolk, 2001; Levelt, 1989). Thus, the fronto-central CRN observed for the distractor frequency effect might reflect the ubiquitous operation of the monitor, checking the outcome of lexical selection, which may itself be facilitated by an early attentional blocking mechanism (e.g., Roelofs et al., 2011; Starreveld et al., 2013). Moreover, it is possible that speech monitoring is generally engaged more strongly in the more demanding condition, even though the early attentional blocking mechanism is usually able to block out the distractor most of the time. According to this interpretation, there might be more than one locus or physiological mechanism responsible for the distractor frequency effect. This interpretation is appealing as it has the potential of reconciling both input and output accounts (see also van Maanen, van Rijn, & Borst, 2009 for an account that assumes interference can be distributed over multiple stages of processing).

Before accepting the above interpretation, it is worth noting the consistencies and inconsistencies with prior neuroimaging and electrophysiological studies of the distractor frequency effect. Our findings of significant ERPs at right temporal and fronto-central sites are consistent with the results of a previous fMRI study that reported bilateral activity in these regions, and so provide complementary information about the time-courses of those effects (e.g., de Zubicaray et al., 2012). Despite the topographies of the difference-wave showing early left frontal and temporal foci, results at those sites did not reach statistical significance in the present study (see also limitations of the statistics used above). In addition, no right inferior frontal activity was observed in the earlier fMRI study. Although EEG and fMRI provide complementary information, it is not unusual to find effects present in one modality that are absent in the other, due to the differing nature of haemodynamic and electrophysiological signals (e.g., Geukes et al., 2013; Van Petten & Luka, 2006; Vartiainen et al., 2011).

We were unable to replicate the stimulus-locked effects reported by Dhooge and Hartsuiker (2013) in their recent lower-density EEG study with Dutch speaking participants, despite testing a subset of comparable electrode sites. The reason for this discrepancy is not immediately apparent, although could reflect differences in the way stimuli were constructed or distractors were presented across the studies. For example, the present study employed the English language stimuli created by Catling et al. (2010; and employed by de Zubicaray et al., 2012 in their fMRI study), replicating those results in terms of naming latencies. Catling et al.’s (2010) HF and LF distractor stimuli were carefully matched in terms of age of acquisition (AoA) amongst other lexical variables. Lexical frequency and AoA are usually highly confounded. This confound might influence ERP results, as frequency and AoA effects involve different neurophysiological mechanisms (see Catling et al., 2010; de Zubicaray et al., 2012; Hutson et al., 2013). In addition, distractor-target picture stimuli were presented for fixed durations of 750 ms in the present study, whereas the durations of Dhooge and Hartsuiker’s (2013) stimuli presentation were more variable, remaining on screen until the participant responded. Moreover, our study differed from the Dhooge and Hartsuiker (2013) in several methodological aspects of signal processing and analysis (including the eye-blink removal technique, articulation-related EMG artifacts removal, filtering, baseline-correction, and statistical tests used). These could also have influenced the results.

Finally, we would like to frame our results in the broader context of the cognitive architecture of language production, while acknowledging the task-specific nature of the mechanisms involved in the distractor frequency effect in PWI. The distractor frequency effect has been used to inform the process of lexical selection, a core decision mechanism in language production. Miozzo and Caramazza’s (2003) initial account of this effect was framed against the competitive account of lexical selection. Indeed, they hypothesized that if lexical selection was a competitive process between highly activated lexical representations, than the HF distractor words should compete more with the picture name than the LF distractor words as HF words are thought to be more highly activated than LF words. Instead, the slower naming latencies in the LF vs. HF distractor conditions were interpreted as inconsistent with the notion of competition at the level of lexical selection. Our results suggest the distractor frequency effect is associated firstly with an early attentional mechanism that preferentially blocks HF distractor words, as hypothesized by the WEAVER++ input account of this effect (e.g., Roelofs, Piai, and Schriefers, 2011). This attentional blocking mechanism operates within the time-window typically attributed to conceptual and lexical access. Secondly, the distractor frequency effect is also associated with a domain-general monitoring mechanism that verifies the performance of the early attentional selection mechanism. Note that this is a different monitoring mechanism to that proposed by Dhooge and Hartsuiker (2010, 2013). In their account of the distractor frequency effect, the self monitor checks the content of the output buffer, and initiates a time-consuming correction to purge the incorrect distractor response. This is an earlier process to that proposed here, which we envisage entails post-selection response verification consistent with the proposed operations of the CRN, rather than an error-detection system per se. As Dhooge and Hartsuiker (2013) themselves noted, after response selection has occurred “the only process left will be the checking of the picture’s response” before articulation is initiated (p. 233). Thus, our results point to the importance of monitoring processes at different stages of language production (see Postma, 2000 for a similar perspective). In addition, the observation of dual loci for the distractor frequency effect emphasizes the fact that multiple physiological mechanisms can be responsible for a given behavioural effect.

Conclusions

We tested rival input and output accounts of the distractor frequency effect in picture naming using both stimulus- and response-locked analyses of ERPs recorded with high-density EEG. According to input accounts, the locus of the effect should occur during processing that encompasses word recognition to form encoding, and thus be completed within the initial 400 ms following stimulus presentation. By contrast, the output account assumes a later locus during processing of articulatory representations, potentially reflecting the involvement of the self-monitoring system. Our results indicate that the distractor frequency effect most likely reflects the operation of more than one physiological mechanism. We argue that these mechanisms involve early attentional processes in addition to domain general monitoring of relatively abstract, pre-articulatory representations. If correct, this account has the potential to reconcile input and output accounts of the distractor frequency effect and point to the importance of domain-general cognitive control processes in language production.

Supplementary Material

Figure S1

Frequency content of the response-locked averages (from 1000 ms pre-vocal onset to 500 ms post vocal onset) after Laplacian transformation and baseline correction (baseline taken from 200 to 100 ms before vocal onset) before and after BSS-CCA. A. Topography of the difference in frequency power in before versus after BSS-CCA between 3.5 and 40 Hz. Lateral frontal and temporal sites contain the most difference in frequency power, these are also the sites where muscular artifacts are most prominent. B. Frequency spectra before and after BSS-CCA at the 4 electrodes showing an effect of distractor frequency. Again, the reduction is clear at lateral sites (C06, C07, and B25) but less clear at the frontal central site C21, where articulation-related EMG artifacts are less prominent (note the scale is 10 times larger for C21 than for the other electrodes).

Acknowledgments

This research was supported by a post-doctoral grant from the National Institute on Deafness and Other Communication Disorders of the National Institutes of Health under Award Number F32DC013245 to S.K.R., a University of Queensland Research Foundation grant to GZ. GZ is supported by an Australian Research Council (ARC) Future Fellowship (FT0991634). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. We are grateful to Chase Sherwell for his assistance with data acquisition and scoring, to Jonathan Mustri for his assistance with data pre-treatment, and to Vitória Piai for helpful discussion.

Footnotes

Conflict of Interest statement

We declare that the research reported here was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

1

None of these effects were observed if articulation-related EMG artifacts were not removed (i.e., before BSS-CCA). ERPs were not significantly different in one condition from the other (alpha=0.050000) at any time point/window analyzed (all p-values >= 0.412800). This underlines the impact of articulation-related EMG artifacts already at this early stimulus-locked time-window. We note there were also no stimulus-locked effect in the ROI analyses (Fronto-central ROI: all p-values>=0.847200; left temporal ROI: all p-values>=0.264000; right temporal ROI: all p-values>=0.220800).

2

The reason why the response-locked whole-brain-type of analysis did not reveal any effects whereas the ROI-type did could be linked to the fact response-locked average are often more noisy than stimulus-locked averages. This is because the detection of the voice onset, which constitutes the time-locking event response-locked, is more variable than stimulus onset.

3

This effect was also not present before BSS-CCA (all p-values>=0.418400). We note there was also no effect response-locked in the whole-brain analysis (all p-values>=0.928000) and in the other ROI analyses response-locked (left temporal ROI: all p-values>=0.117600; right temporal ROI: all p-values>=0.229600).

References

  1. Acheson DJ, Ganushchak LY, Christoffels IK, and Hagoort P (2012). Conflict monitoring in speech production: physiological evidence from bilingual picture naming. Brain and Language, 123, 131–136. [DOI] [PubMed] [Google Scholar]
  2. Acheson DJ, & Hagoort P (2014). Twisting tongues to test for conflict monitoring in speech production. Frontiers in Human Neuroscience, 8: 206. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Aron AR, Robbins TW, & Poldrack RA (2014). Inhibition and the right inferior frontal cortex: one decade on. Trends in Cognitive Sciences, 18, 177–185. [DOI] [PubMed] [Google Scholar]
  4. Bonini F, Burle B, Liégeois-Chauvel C, Régis J, Chauvel P, & Vidal F (2014). Action monitoring and medial frontal cortex: leading role of supplementary motor area. Science, 343(6173): 888–91. doi: 10.1126/science.1247412. [DOI] [PubMed] [Google Scholar]
  5. Bullmore ET, Suckling J, Overmeyer S, Rabe-Hesketh S, Taylor E, & Brammer MJ (1999). Global, voxel, and cluster tests, by theory and permutation, for a difference between two groups of structural MR images of the brain. IEEE Transactions on Medical Imaging, 18(1), 32–42. [DOI] [PubMed] [Google Scholar]
  6. Caramazza A (1997). How many levels of processing are there in lexical access? Cognitive Neuropsychology, 14(1), 177–208. [Google Scholar]
  7. Carbonnell L, Hasbroucq T, Grapperon J, & Vidal F (2004). Response selection and motor areas: A behavioral and electrophysiological study. Clinical Neurophysiology, 115, 2164–2174. [DOI] [PubMed] [Google Scholar]
  8. Catling JC, Dent K, Johnston RA, & Balding R (2010). Age of acquisition, word frequency, and picture–word interference. The Quarterly Journal of Experimental Psychology, 63(7), 1304–1317. [DOI] [PubMed] [Google Scholar]
  9. Cattell IM (1885). Über die Zeit der Erkennung und Benennung von Schriftzeichen, Bildern un Farben [The time it takes to recognize and name letters, pictures, and colors]. Philosophische Studien, 2, 6354650 Cited in Fraisse, P. (1969). Why naming is longer than reading? Acta Psychologica, 30, 96–103. [Google Scholar]
  10. Debener S, Ullsperger M, Siegel M, Fiehler K, von Cramon Y, & Engel AK (2005). Trial-by-trial coupling of concurrent EEG and fMRI identifies the dynamics of performance monitoring. Journal of Neuroscience, 25, 11730–11737. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. de Clercq W, Vergult A, Vanrumste B, Van Paesschen W, & Van Huffel S (2006). Canonical correlation analysis applied to remove muscle artifacts from the electroencephalogram. IEEE Transactions on Biomedical Engineering, 53, 2583–2587. [DOI] [PubMed] [Google Scholar]
  12. Dehaene S, Posner MI, & Tucker DM (1994). Localization of a neural system for error detection and compensation. Psychological Science, 5, 303–305. [Google Scholar]
  13. Dell GS (1986). A spreading-activation theory of retrieval in sentence production. Psychological Review, 93, 283–321. [PubMed] [Google Scholar]
  14. Delorme A, & Makeig S (2004). EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis. Journal of neuroscience methods, 134(1), 9–21. [DOI] [PubMed] [Google Scholar]
  15. de Vos M, Riès S, Vanderperren K, Vanrumste B, Alario F-X, Van Huffel S, & Burle B (2010).Removal of muscle artifacts from EEG recordings of spoken language production. Neuroinformatics 8, 135–150. [DOI] [PubMed] [Google Scholar]
  16. de Zubicaray GI (2006). Cognitive neuroimaging: cognitive science out of the armchair. Brain and Cognition, 60(3), 272–281. doi: 10.1016/j.bandc.2005.11.008 [DOI] [PubMed] [Google Scholar]
  17. de Zubicaray GI, Miozzo M, Johnson K, Schiller NO, & McMahon KL (2012). Independent distractor frequency and age-of-acquisition effects in picture-word interference: fMRI evidence for post lexical and lexical accounts according to distractor type. Journal of Cognitive Neuroscience, 24, 482–495. [DOI] [PubMed] [Google Scholar]
  18. Dhooge E, & Hartsuiker RJ (2010). The distractor frequency effect in picture-word interference: Evidence for response exclusion. Journal of Experimental Psychology: Learning, Memory, and Cognition, 36, 878–891. [DOI] [PubMed] [Google Scholar]
  19. Dhooge E, De Baene W, & Hartsuiker RJ (2013). A late locus of the distractor frequency effect in picture–word interference: Evidence from event-related potentials. Brain and Language, 124, 232–237. [DOI] [PubMed] [Google Scholar]
  20. Finkbeiner M, & Caramazza A (2006). Now you see it, now you don’t: On turning semantic interference into facilitation in a Stroop-like task. Cortex, 42, 790–796. [DOI] [PubMed] [Google Scholar]
  21. Friedman NP, & Miyake A (2004). The relations among inhibition and interference cognitive functions: A latent variable analysis. Journal of Experimental Psychology: General, 133, 101–135. [DOI] [PubMed] [Google Scholar]
  22. Ganushchak LY, Christoffels IK, & Schiller NO (2011). The use of electroencephalography in language production research: A review. Frontiers in Psychology, 2(208), 1–6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Geng J, Schnur TT, & Janssen N (2013). Relative speed of processing affects interference in Stroop and picture–word interference paradigms: evidence from the distractor frequency effect. Language and Cognitive Processes, DOI: 10.1080/01690965.2013.846473. [DOI] [Google Scholar]
  24. Geukes S, Huster RJ, Wollbrink A, Junghöfer M, Zwitserlood P, & Dobel C (2013). A large N400 but no BOLD effect--comparing source activations of semantic priming in simultaneous EEG-fMRI. PLoS One 8(12):e84029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Goldrick M (2007). Connectionist principles in theories of speech production In Gaskell MG (Ed.), The Oxford handbook of psycholinguistics (pp. 515–530). Oxford, U.K.: Oxford University Press. [Google Scholar]
  26. Groppe DM, Urbach TP, Kutas M (2011) Mass univariate analysis of event-related brain potentials/fields I: A critical tutorial review. Psychophysiology. 48(12) pp. 1711–1725, DOI: 10.1111/j.1469-8986.2011.01273.x [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Hartsuiker RJ, & Kolk HHJ (2001). Error monitoring in speech production: A computational test of the perceptual loop theory. Cognitive Psychology, 42, 113–157. [DOI] [PubMed] [Google Scholar]
  28. Hutson J, & Damian MF (2013). Distractor frequency effects in picture-word interference tasks with vocal and manual responses. Language and Cognitive Processes, 28, 615–632. [Google Scholar]
  29. Indefrey P (2011). The spatial and temporal signatures of word production components: A critical update. Frontiers in Psychology, 2, 255. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Laszlo S, Federmeier KD (2014) Never seem to find the time: evaluating the physiological time course of visual word recognition with regression analysis of single-item event-related potentials. Language, Cognition and Neuroscience, 29, 5642–661. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Levelt WJM (1989). Speaking: From intention to articulation. Cambridge, MA: MIT Press. [Google Scholar]
  32. Levelt WJ, Roelofs A, & Meyer AS (1999). A theory of lexical access in speech production. Behavioral and Brain Sciences, 22(1), 1–38; discussion 38–75. [DOI] [PubMed] [Google Scholar]
  33. Levelt WJM (2012). A History of Psycholinguistics: The Pre-Chomskyan Era. Oxford, UK: Oxford University Press. [Google Scholar]
  34. Mahon BZ, Costa A, Peterson R, Vargas KA, & Caramazza A (2007). Lexical selection is not by competition. Journal of Experimental Psychology: Learning, Memory, and Cognition, 33, 503–533. [DOI] [PubMed] [Google Scholar]
  35. Manly BFJ (1997). Randomization, Bootstrap, and Monte Carlo Methods in Biology (2nd ed.). London: Chapman & Hall. [Google Scholar]
  36. Maris E, & Oostenveld R (2007). Nonparametric statistical testing of EEG- and MEG-data. Journal of Neuroscience Methods, 164(1), 177–190. [DOI] [PubMed] [Google Scholar]
  37. Miozzo M, & Caramazza A (2003). When more is less: A counterintuitive effect of distractor frequency in the picture–word interference paradigm. Journal of Experimental Psychology: General, 132, 228–252. [DOI] [PubMed] [Google Scholar]
  38. Nozari N, Dell GS, & Schwartz MF (2011). Is comprehension necessary for error detection? A conflict-based account of monitoring in speech production. Cognitive Psychology, 63, 1–33. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Nuñez P (1981). Electric fields of the brain. Oxford, UK: Oxford University Press. [Google Scholar]
  40. Petersen SE, Posner MI (2012) The attention system of the human brain: 20 years after. Annual Review of Neuroscience, 35, 73–89. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Postma A (2000). Detection of errors during speech production: a review of speech monitoring models. Cognition 77, 97–131. doi: 10.1016/S0010-0277(00)00090-1 [DOI] [PubMed] [Google Scholar]
  42. Riès S, Janssen N, Dufau S, Alario F-X, and Burle B (2011). General purpose monitoring during speech production. J. Cognitive Neurosci. 23(6): 1419–1436. 10.1162/jocn.2010.21467 [DOI] [PubMed] [Google Scholar]
  43. Riès S, Janssen N, Alario F-X, and Burle B (2013a). Response-locked brain dynamics of word production. Plos One, 8:e58197.doi: 10.1371/journal.pone.0058197 [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Riès S, Xie K, Haaland K, Dronkers N and Knight RT (2013b). Role of the lateral prefrontal cortex in speech monitoring. Frontiers in Human Neuroscience. doi: 10.3389/fnhum.2013.00703 [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Roelofs A (2003). Goal-referenced selection of verbal action: modelling attentional control in the Stroop task. Psychological Review, 110 (1), 88–125. [DOI] [PubMed] [Google Scholar]
  46. Roelofs A (2005). From Popper to Lakatos: A case for cumulative computational modeling In Cutler A (Ed.), Twenty-first century psycholinguistics: Four cornerstones (pp. 313–330). Hillsdale, NJ: LEA. [Google Scholar]
  47. Roelofs A, Piai V, & Schriefers H (2011). Selective attention and distractor frequency in naming performance: Comment on Dhooge and Hartsuiker (2010). Journal of Experimental Psychology: Learning, Memory, and Cognition, 37, 1032–1038. [DOI] [PubMed] [Google Scholar]
  48. Roger C, Bénar C, Vidal F, Hasbroucq T, & Burle B (2010). Rostral cingulate zone and correct response monitoring: ICA and source localization evidences for the unicity of correct- and error-negativities. Neuroimage, 51(1): 391–403. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Rosinski RR (1977). Picture-Word Interference Is Semantically Based. Child Development, 48(2), 643–647. doi: 10.2307/1128667 [DOI] [Google Scholar]
  50. Snodgrass JG, & Vanderwart M (1980). A standardized set of 260 pictures: norms for name agreement, image agreement, familiarity, and visual complexity. Journal of experimental psychology: Human learning and memory, 6(2), 174. [DOI] [PubMed] [Google Scholar]
  51. Starreveld PA, & La Heij W (1996). Time-course analysis of semantic and orthographic context effects in picture naming. Journal of Experimental Psychology: Learning, Memory, and Cognition, 22(4), 896–918. [Google Scholar]
  52. Starreveld PA, La Heij W, & Verdonschot R (2013). Time course analysis of the effects of distractor frequency and categorical relatedness in picture naming: An evaluation of the response exclusion account. Language and Cognitive Processes, 28, 633–654. [Google Scholar]
  53. Strijkers K, & Costa A (2011). Riding the lexical speedway: A critical review on the time course of lexical access in speech production. Frontiers in Psychology, 2, 356. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. van Maanen L, van Rijn H, & Borst JP (2009). Stroop and picture-word interference are two sides of the same coin. Psychonomic Bulletin & Review, 16, 987–999. doi: 10.3758/PBR.16.6.987 [DOI] [PubMed] [Google Scholar]
  55. Van Petten C, & Luka B (2006). Neural localization of semantic context effects in electromagnetic and hemodynamic studies. Brain and Language, 97, 279–293. [DOI] [PubMed] [Google Scholar]
  56. Vartiainen J, Liljeström M, Koskinen M, Renvall H, & Salmelin R (2011). Functional magnetic resonance imaging blood oxygenation level-dependent signal and magnetoencephalography evoked responses yield different neural functionality in reading. J Neurosci 31: 1048–1058 [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Vidal F, Hasbroucq T, Grapperon J, and Bonnet M (2000). Is the “error negativity” specific to errors? Biol. Psychol 51(2–3): 109–128. 10.1016/S0301-0511(99)00032-0 [DOI] [PubMed] [Google Scholar]
  58. Vidal F, Burle B, Bonnet M, Grapperon J, and Hasbroucq T (2003). Error negativity on correct trials: a reexamination of available data. Biol. Psychol 64(3): 265–282. 10.1016/S0301-0511(03)00097-8 [DOI] [PubMed] [Google Scholar]
  59. Vigneau M, Beaucousin V, Hervé P-Y, Jobard G, Petit L, Crivello Fabrice, Mellet E, et al. (2011). What is right-hemisphere contribution to phonological, lexico-semantic, and sentence processing? Insights from a meta-analysis. NeuroImage, 54, 577–593. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Figure S1

Frequency content of the response-locked averages (from 1000 ms pre-vocal onset to 500 ms post vocal onset) after Laplacian transformation and baseline correction (baseline taken from 200 to 100 ms before vocal onset) before and after BSS-CCA. A. Topography of the difference in frequency power in before versus after BSS-CCA between 3.5 and 40 Hz. Lateral frontal and temporal sites contain the most difference in frequency power, these are also the sites where muscular artifacts are most prominent. B. Frequency spectra before and after BSS-CCA at the 4 electrodes showing an effect of distractor frequency. Again, the reduction is clear at lateral sites (C06, C07, and B25) but less clear at the frontal central site C21, where articulation-related EMG artifacts are less prominent (note the scale is 10 times larger for C21 than for the other electrodes).

RESOURCES