Distinguishing lexical- versus discourse-level processing using event-related potentials

Yi Ting Huang; Joseph Hopfinger; Peter C Gordon

doi:10.3758/s13421-013-0356-z

. Author manuscript; available in PMC: 2015 Feb 1.

Published in final edited form as: Mem Cognit. 2014 Feb;42(2):275–291. doi: 10.3758/s13421-013-0356-z

Distinguishing lexical- versus discourse-level processing using event-related potentials

Yi Ting Huang ¹, Joseph Hopfinger ², Peter C Gordon ²

PMCID: PMC3968230 NIHMSID: NIHMS531444 PMID: 24122362

Abstract

Two experiments examine the links between neural patterns in EEG (e.g., N400s, P600s) and their corresponding cognitive processes (e.g., lexical access, discourse integration) by varying the lexical and syntactic contexts of co-referential expressions. Experiment 1 examined coreferring expressions when they occurred within the same clause as their antecedents (John/Bill warmly dressed John). Experiment 2 examined between-clause co-referencing with expressions that also varied in lexical frequency (John/Weston went to the store so that John/Weston could buy milk). Evidence of facilitated lexical processing occurred after repeated names, which elicited smaller N400s, as compared with new names. N400s were also attenuated to a greater degree for low-frequency expressions than for high-frequency ones. Repeated names also triggered evidence of postlexical processing, but this emerged as larger P600s for within-clause co-referencing and delayed N400s for between-clause co-referencing. Together, these results suggest that linguistic processes can be distinguished through distinct ERP components or distinct temporal patterns.

Keywords: Event-related potentials, Neurolinguistics, Co-reference, N400s, P600s

Introduction

Language comprehension is often characterized as rapid, incremental, and opportunistic—exploiting multiple cues from various sources to resolve ambiguity and make predictions about upcoming material (MacDonald, Pearlmutter, & Seidenberg, 1994; Tanenhaus, Spivey-Knowlton, Eberhard, & Sedivy, 1995). Yet there remains debate about how these cues are processed during real-time interpretation. One class of hypotheses argues that inputs are mapped onto separable levels of representations and that processes at preceding levels partially constrain computations at subsequent levels (Ferreira & Patson, 2007; Huang & Gordon, 2011; Tily, Federenko, & Gibson, 2010). These accounts maintain that architectural features of the cognitive system limit the concurrent processing of all information, resulting in differential time courses for different types of linguistic procedures. For example, since lexical processes logically precede discourse processes, one would expect evidence of the former to emerge prior to the latter (Huang & Snedeker, 2009a, 2009b, 2011; Ledoux, Gordon, Camblin, & Swaab, 2007). A contrasting class of hypotheses suggests that all relevant inputs are recruited by an all-purpose parser that constrains comprehension in an optimal and exhaustive manner (Grodner, Klein, Carbary, & Tanenhaus, 2010; Jurafsky, 1996; Levy, 2008). These accounts contend that any available information that can inform sentence interpretation will be used to affect processing in its earliest moments. The type of massive interactivity suggests that time course information is an unreliable indicator of underlying linguistic processes.

Much of the work motivating these theories has come from behavioral measures. Thus, the challenge remains for the field to develop a cognitive theory of language comprehension that is also consistent with evidence from neural research. Neural measures, such as event-related potentials (ERPs), have provided valuable assessments of language comprehension precisely because they provide fine-grained time course information that is sensitive to different language processes. For example, one prominent component has been the N400, a negative polarity deflection in the ERP waveform that peaks approximately 400 ms after stimulus onset. Since this component is triggered by factors affecting word recognition, such as frequency (Allen, Badecker, & Osterhout, 2003; Van Petten & Kutas, 1990), repetition (Ledoux et al., 2007; Rugg & Nagy, 1987), predictability (Federmeier & Kutas, 1999; Kutas & Hillyard, 1984), and semantic relatedness (Kutas & Hillyard, 1980; Rugg, 1985), it has traditionally been thought to index lexical-level processes. In contrast, the P600 is a positive-going component that peaks approximately 600 ms after stimulus onset. This component is triggered by mismatches in word order (Hagoort, Brown, & Groothusen, 1993; Osterhout, 1997) and gender/number morphology (Osterhout & Holcomb, 1992; Osterhout & Mobley, 1995), and thus it has traditionally been linked to syntactic processes.

Despite the appealing clarity of this division between ERP components, recent studies have uncovered notable patterns that challenge this straightforward interpretation. For example, Swaab, Camblin, and Gordon (2004; Ledoux et al., 2007) recorded ERPs to sentences like (1). Here, critical words were either new names (John following Bill) or repeated names (John following John) that were preceded by either prominent (John) or nonprominent antecedents (John and Neil).

1. PROMINENT/REPEATED: John left work after John completed the project
2. PROMINENT/NEW: Bill left work after John completed the project
3. NONPROMINENT/REPEATED: John and Neil left work after John completed the project
4. NONPROMINENT/NEW: Bill and Neil left work after John completed the project

Gordon, Swaab, and colleagues found that following nonprominent antecedents, repeated names yielded reduced N400s, as compared with new names (1c vs. 1d) (Ledoux et al., 2007; Swaab et al., 2004). This repetition priming demonstrates that prior exposure to a word facilitates subsequent processing of the same word (Liversedge et al., 2003; Raney, Therriault & Minkoff, 2000; 2003; Rugg & Nagy, 1987; Traxler, Foss, Seely, Kaup, & Morris, 2000) and is consistent with traditional interpretations of the N400 as an index of lexical access (Brown & Hagoort, 1993; Chwilla, Brown, & Hagoort, 1995; Holcomb, 1993; Lau, Phillips, & Poeppel, 2008; Rugg, Furda, & Lorist, 1988; Van Berkum, Hagoort, & Brown, 1999; Van Petten & Kutas, 1991;). However, in the same studies, repeated names also generated greater N400s when they co-referenced prominent antecedents, relative to nonprominent ones (1a vs. 1c). This repeated-name penalty demonstrates that the use of an overinformative, repeated expression interferes with discourse integration (Almor, 1999; Gordon & Hendrick, 1998).

Altogether, these findings are notable for two reasons. First, the same repeated expression generated N400 responses at both the lexical and discourse levels, with no apparent difference in the time course of the processes that generated the effects. This is at odds with behavioral evidence from reading-time studies, which demonstrated that discourse processes indexed by the repeated-name penalty occur well after the word recognition processes indexed by repetition priming (Ledoux et al., 2007). Second, when lexical and discourse effects were compared head-to-head in the prominent sentences, there were no clear differences in the ERP components generated by repeated and new names (1a vs. 1b). The absence of an N400 difference in this comparison raises questions about the exact relationship between the lexical and discourse processes in comprehension. Are linguistic inputs processed via distinct levels of representations (e.g., lexical, discourse), or are they analyzed via a single level of representation (e.g., meaning)?

Similar puzzles have emerged in studies on the interface between lexical semantics and syntax. Kuperberg et al. (2003; Kuperberg, Kreher, Sitnikova, Caplan, & Holcomb, 2007) examined sentences like (2), which varied the semantic relatedness (related vs. unrelated) and presence of a thematic-role violation (no violation vs. violation) expressed in the relationship between preceding subject nouns (boys/eggs) and subsequent verbs (eat/plant).

(2)
1. RELATED/NO VIOLATION: For breakfast the boys would eat toast and jam
2. UNRELATED/NO VIOLATION: For breakfast the boys would plant flowers in the garden
3. RELATED/VIOLATION: For breakfast the eggs would eat toast and jam
4. UNRELATED/VIOLATION: For breakfast the eggs would plant flowers in the garden

They found a greater N400 for unrelated, as compared with related, verbs when there was no thematic role violation (2b vs. 2a) (Kim & Osterhout, 2005; Kuperberg et al., 2003; Kuperberg et al., 2007). This semantic relatedness effect is consistent with accounts of the N400 as reflecting either lexical access (Kutas & Hillyard, 1980, 1984; Rugg, 1985) or postlexical integration (Brown & Hagoort, 1993; Ledoux et al., 2007; Swaab, Camblin, & Gordon, 2004). However, Kuperberg et al. (2007) also found that thematic role violations caused a P600 effect but no N400 differences regardless of whether the subject noun was semantically related to the verb (2c and 2d vs. 2a). These semantic P600s are surprising for two reasons. First, none of the sentences in (2) violated the syntactic dimensions typically associated with P600s—for example, nonconventional word order, agreement errors, or morphological mismatches (Hagoort et al., 1993; Osterhout & Holcomb, 1992). Second, the absence of a larger N400 response in sentences featuring both unrelated words and thematic role violations is puzzling, since prior research has shown evidence of both components within a single sentence (Osterhout & Nichols, 1999). Both patterns appear to be at odds with work showing that N400s and P600s are moderated by different aspects of language. One possibility is that these effects provide evidence in favor of a massively interactive comprehension system. For example, Kim and Osterhout (2005) contended that the semantic P600s demonstrate that semantic and syntactic interpretations are processed in parallel and that the former can influence the latter when they are sufficiently robust. Similarly, Kuperberg and colleagues (2007) suggested that the failure to find both N400s and P600s in the unrelated/violation sentences reflects the canceling of semantic integration in the presence of syntactic violations. Accounts such as these have been prominent in the neurolinguistics literature (Hagoort, Hald, Bastiaansen, & Petersson, 2004; Nieuwland & Van Berkum, 2006a, 2006b; Van Berkum, Brown, Zwitserlood, Kooijman, & Hagoort, 2005; Van Berkum, Zwisterlood, Hagoort, & Brown, 2003).

To summarize, previous studies have suggested clear divisions between neural patterns (e.g., N400s, P600s) and their corresponding cognitive processes (e.g., lexical processing, postlexical integration, syntactic processing). However, recent work from two different linguistic domains raises questions about these traditional mappings (Kim & Osterhout, 2005; Kuperberg et al., 2003; Kuperberg et al., 2007; Ledoux et al., 2007; Swaab et al., 2004). Thus, it remains unclear how the comprehension system can generate both distinct neural and temporal patterns for some linguistic phenomena but conflated patterns for others. One possibility is that these recent results provide definitive evidence in favor of a single level of processing. This type of massive interactivity across all relevant inputs suggests no principled distinction between lexical and postlexical effects. However, a second possibility is that levels of language processing function in separable ways, and interactions are limited to operations at the interface of related linguistic representations. For example, the use of proper names by Gordon, Swaab, and colleagues may offer an exceptional case where referring expressions are uniquely linked to discourse representations (Kripke, 1980). Similarly, the study of thematic role assignments by Kuperberg and colleagues highlight a case where animacy cues from lexical semantics are highly correlated with syntactic categories (Jackendoff, 1972; Ladusaw & Dowty, 1988).

To distinguish these possibilities, the present study examines whether highly correlated lexical and discourse processes can still be distinguished during comprehension, through the manipulation of syntactic and lexical factors. Experiment 1 includes a critical new manipulation of syntactic context, contrasting critical expressions that occur within a clause (e.g., John warmly dressed John) with those that occur between clauses (e.g., John left work after John completed the project). Experiment 2 focuses on a lexical factor, contrasting high-frequency names (e.g., John) with low-frequency ones (e.g., Earl). The goal of both experiments is to assess the neural and temporal patterns associated with discourse-level interpretations of co-referential expressions by comparing them with the reduced N400 response following repeated names, as compared with new names (Ledoux et al., 2007; Swaab et al., 2004). Repetition priming of this kind provides a useful benchmark of lexical processing with which the timing of discourse processing can be compared (Huang & Gordon, 2011). If linguistic inputs are analyzed via separable levels of interpretation, lexical and postlexical processes should correspond to distinct neural and/or temporal patterns. However, if inputs are analyzed via a single level of interpretation, overlapping patterns should continue to persist.

Experiment 1

Experiment 1 examined ERPs to a target referential expression (e.g., John in 3a and 3d) as a function of an earlier referential expression, which we will call Noun1. Noun1 could be a simple noun phrase (NP), in which case its referent was prominent (e.g., 3a and 3b), or it could be embedded as the possessor within a complex possessive NP, in which case its referent was nonprominent (e.g., 3c and 3d). This characterization of the relationship between referential prominence and syntactic structure follows the analysis of Gordon and Hendrick (1998) in which referential prominence was defined as inversely related to syntactic embedding. In addition, Noun1 could be the same as the target (the repeated condition; e.g., 3a and 3c), or it could differ from the target (the new condition; e.g., 3b and 3d).

(3)
1. PROMINENT/REPEATED: Yesterday John warmly dressed John before school
2. PROMINENT/NEW: Yesterday Bill warmly dressed John before school
3. NONPROMINENT/REPEATED: Yesterday John’s mother warmly dressed John before …
4. NONPROMINENT/NEW: Yesterday Bill’s mother warmly dressed John before school

Note that while a repeated expression can felicitously co-refer with a nonprominent antecedent (e.g., 3c), it cannot with a prominent one (e.g., 3a). Instead, a reflexive pronoun (himself) is required for co-reference (or anaphora) in such cases where the two expressions are in the same clause (Chomsky, 1981; Gordon & Hendrick, 1997).

As was discussed above, previous studies have shown that repeated expressions that affect comprehension at both the lexical and discourse levels generate N400 responses that are indistinguishable (Ledoux et al., 2007; Swaab et al., 2004). Contrary to previous behavioral research, this suggests that there are no differences in the neural or temporal patterns associated with language processing at these two different levels. However, another possibility is that lexical and discourse processes are, in fact, distinct components of comprehension but evidence of these separate generators is obscured in situations where there is complete overlap in the ERP components. However, recent work suggests that it may be possible to eliminate this overlap through a manipulation of syntactic context. Gordon, Kacinik, and Swaab (2013) found that when repetition occurred within a single clause, a prominent antecedent (3a) generated a greater P600, as compared with a nonprominent one (3c). This effect is consistent with what has been found following reflexives that do not match their prominent antecedents (Osterhout & Mobley, 1995; e.g., herself rather than himself following a stereotypically male name).

Critically, the presence of a P600 (instead of an N400) during discourse processing allows us to distinguish between different accounts of the relationship between lower-level lexical processes and higher-level discourse effects. Since the interpretation of repeated expressions triggers distinct components across different processes, a direct comparison of their neural responses within a sentence will assess whether lexical effects can co-occur with discourse effects or whether the presence of one necessarily cancels the other (Kuperberg et al., 2003; Kuperberg et al., 2007; Ledoux et al., 2007). Thus, unlike Gordon and colleagues (2013), the present study also manipulates whether prominent antecedents are co-referenced through repeated or new names (3a vs. 3b). If lexical and discourse processes are separable during comprehension, we would expect evidence of both N400 and P600 responses following a repeated expression, as compared with a new one. However, if these processes are largely overlapping, the presence of a syntactic violation may lead to a top-down cancelation of lexical processes (Kim & Osterhout, 2005; Kuperberg et al., 2007). If this were the case, we would find evidence of a P600 response, but not an N400 response.

Method

Participants

Sixteen right-handed adults participated in this study. They were recruited from the university population at the University of North Carolina at Chapel Hill and were compensated $25 for their participation. All participants were native English speakers, and none had any history of neurological impairment. Written consent was obtained from each participant prior to beginning the study.

Procedure

Participants sat in an arm chair inside a dimly lit room that was electrically shielded and sound attenuated. A computer screen was placed in front of the participants, approximately 55 cm away from their eyes. An eyetracker was placed below the screen and was used to monitor participants’ blinking and eye movements throughout the study. At the beginning of the study, the experimenter told participants that they would be asked to judge whether the sentence sounded “good” or “bad.” They were told to base these judgments on their intuitions of how they imagined most people would speak, rather than on any prescribed notions of what is proper or correct. On each trial, the words for the sentence would appear one at a time. The experimenter emphasized that it was important for participants to refrain from moving or blinking during the presentation of the sentence. Responses to the grammaticality judgments could be made by pressing one of two buttons on a video game consol. After their response, participants were given the opportunity to blink and rest their eyes. When they were ready to proceed, they could press any button to continue onto the next trial.

Each trial began with a fixation cross that appeared at the center of the screen for 1,000 ms. This cross alerted participants to the beginning of the trial and also marked the location of the subsequent words of the sentence. These words appeared in rapid serial visual presentation (RSVP) with an on-screen duration of 300 ms per word and an interstimulus interval of 200 ms. They were presented against a black background in 70-point white Tahoma font. Unlike the words for the sentences, the words for the judgment question (“Good or bad?”) appeared on the screen simultaneously, after the completion of the sentence, and remained there until a response was made.

Materials

The materials for the four critical trial types follow the example in sentence (3) and represent the cells of a 2 × 2 design. The first factor, prominence, contrasts the use of a prominent, singular NP (John) versus a nonprominent, possessive NP (John’s brother) as antecedents. The second factor, repetition, contrasts the repetition of a previously mentioned name (John … John) versus the introduction of a new one (Bill … John) as co-referring expressions. Both factors were varied within subjects.

The sentence frames were adopted from Gordon et al. (2013). These frames included locative phrases at the beginning and verb phrases at the end, to ensure that names would not appear in sentence-initial or sentence-final positions. Both first and second mentions of names appeared in the same clause. This created a situation where co-reference of an antecedent was most felicitously done through a reflexive pronoun (e.g., himself, herself). Sentences varied in length from 8 to 11 words, with a mean length of 9.9 (SD = 0.8). Examples of the critical stimuli are presented in Appendix 1. Four versions of each critical base item were used to create four presentation lists, such that each list contained 60 items in each condition and each base item appeared just once in every list.

These 240 critical trials were randomized with 80 control trials and 10 practice trials. Control items were of similar character to the critical items but used the reflexive pronoun in place of a critical name. On half the trials, congruent sentences were ones where the gender of the reflexive matched the gender of its antecedent, both prominent and nonprominent (John … himself, John’s sister … herself). On the other half, incongruent sentences were ones where the gender of the reflexive did not match its antecedent (John … herself, John’s sister … himself). Since prior work has shown that P600s are elicited by these kinds of gender mismatches between antecedents and pronouns (Osterhout & Mobley, 1995), these control trials offer an informative benchmark for effects that may emerge in the critical trials. Practice items varied the presence or absence of morphological errors (e.g., verb tense, number agreement). A total of 330 trials were divided into one practice block and eight test blocks. Each test block lasted about 10 min.

EEG recordings

The EEG was recorded from 96 electrodes (see Fig. 1 for layout) fitted into an elastic cap using an ActiveTwo EEG system with active electrodes (BioSemi; Amsterdam). Activity associated with eye movements was monitored through four additional electrodes at the suborbital region and outer canthi of the right and left eyes. Online recordings were single-ended potential measurements with respect to a common mode sense site near the vertex, and data were sampled at 256 Hz. The data were referenced, offline, to the average of the activity recorded from the left and right mastoids, and the signal was filtered with a bandpass of 0.01-30 Hz. EEGs were analyzed using the Brain Electrical Source Analysis (BESA) 3.0 software package. Initial processing screened single-trial waveforms for artifacts such as amplifier blocking and muscle and eye movements over an epoch beginning from −100 ms before the critical word to 1,000 ms after the critical word. Trial rejection rates on the basis of artifacts for individual subjects varied from 6% to 22% of critical and control trials, with an average of 11% across subjects. ERPs for each participant were calculated by averaging over artifact-free trials for the critical word in the critical and control conditions.

Results

Behavioral data

Accuracy of the grammaticality judgments ranged from 68% to 93% across subjects, with the mean performance at 83% (SD = 8%). There were no significant differences in performance across conditions, Fs < 1.00, ps > .50. This confirms that participants were able to correctly distinguish grammatical sentences (e.g., 3b-d) from ungrammatical ones (e.g., 3a).

ERP analysis

The mean amplitudes of the ERPs to the critical words were analyzed using a series of repeated measures analyses of variance (ANOVAs) during the N400 (250-500 ms) and P600 (650-800 ms) time windows, time-locked to the onset of the target name in the critical trials and to the onset of the reflexive pronoun in the control trials. The latency ranges were based on visual inspection of the waveforms within the time windows reported previously for the N400 and P600 components. In the critical trials, omnibus analyses were first conducted over three within-subjects variables: prominence (prominent vs. nonprominent), repetition (new vs. repeated), and electrode region (seven regions). The latter variable divided electrode sites on the basis of their hemisphere (left vs. right) and anteriority (frontal vs. central vs. posterior), resulting in six regions (left-frontal, left-central, left-posterior, right-frontal, right-central, right-posterior) plus the midline region. In the control trials, omnibus analyses were conducted over three within-subjects variables: prominence (prominent vs. nonprominent), congruency (incongruent vs. congruent), and electrode region. Significant interactions between the manipulated variables and region were followed up with planned comparisons, focusing on the corresponding electrodes. Greenhouse-Geisser corrections were applied to compensate for inhomogeneous variance and covariance across treatment levels (Greenhouse & Geisser, 1959). Adjusted p-values are reported below.

N400

The omnibus ANOVA of the N400 following the critical name revealed a significant two-way interaction between repetition and electrode region, F(6, 90) = 3.60, p < .05, but no additional three-way interaction between repetition, prominence, and electrode region (p > .15). Planned comparisons revealed that repetition effects were maximal over midline electrodes, where new names elicited greater negativity, as compared with repeated names, F(1, 15) = 19.40, p < .01. Figures 2 and 3a illustrate that this repetition priming was observed in both nonprominent trials, where the use of the repeated name was felicitous, F(1, 15) = 14.72, p < .01, and prominent trials, where it was infelicitous, F(1, 15) = 9.31, p < .01. There was no additional main effect of or interaction with prominence (both ps > .30).

Fig. 2 — In Experiment 1, the effects of antecedent type (prominent vs. nonprominent) on the interpretation of co-referring expressions (repeated vs. new names). The ERPs were time-locked to the critical name and reflect grand averages across all participants, recorded from frontal (F3, Fz, F4), central (C3, Cz, C4), and posterior (P3, Pz, P4) sites

Fig. 3 — In Experiment 1, the voltage maps illustrate the spatial distribution of the ERP responses. On critical trials, the a N400 response 375 ms after the critical name reflects the new names minus repeated names difference in the nonprominent condition, and the b P600 response 660 ms after the critical name reflects the prominent names minus nonprominent names difference in the repeated condition

P600

The omnibus ANOVA of the P600 following the critical name revealed a significant three-way interaction between repetition, prominence, and electrode region, F(6, 90) = 5.31, p < .05. Planned comparisons revealed that interactions between repetition and prominence were maximal over right-central electrodes, F(1, 15) = 10.45, p < .01. Figures 2 and 3b illustrate that while new and repeated names were equivalent in the nonprominent condition, F(1, 15) = 1.05, p > .30, repeated names generated greater positivity than did new names in the prominent condition, F(1, 15) = 16.10, p < .01. This pattern also led to a main effect of repetition, F(1, 15) = 16.19, p < .01. There was no additional main effect of prominence (p > .15).

Finally, the P600 effect on the critical trials resembled the P600 generated on control trials, where a gender mismatch between the antecedent and the reflexive pronoun produced a standard P600 response of the sort observed by Osterhout and Mobley (1995). The omnibus ANOVA revealed a significant two-way interaction between congruency and electrode region, F(6, 90) = 7.66, p < .01, but no additional three-way interaction between congruency, prominence, and region (p > .60). Planned comparisons revealed that incongruency effects were maximal over the left posterior electrodes, where incongruent pronouns elicited greater positivity, as compared with congruent pronouns, F(1, 15) = 24.51, p < .001. This occurred following both prominent antecedents, F(1, 15) = 12.31, p < .01, and nonprominent antecedents, F(1, 15) = 14.91, p < .01. There was no additional main effect of or interaction with prominence (both ps > .30).

Discussion

In Experiment 1, we distinguished between lexical processes and discourse processes by varying the prominence and repetition of names within a clause and measuring their effects on two neural responses, the N400 and the P600. Consistent with prior research, we found that new names were more difficult to process, generating larger N400s, as compared with repeated names (for other types of words, see Ledoux et al., 2007; Rugg & Nagy, 1987). This repetition priming demonstrates that prior recognition of a word facilitates subsequent processing of the same word. Critically, we also found evidence that the infelicitous co-reference created by using a repeated expression to refer to a prominent antecedent within the same clause resulted in a P600. This P600 effect was similar to that found in cases of gender mismatches between pronouns and their antecedents (Gordon et al., 2013; Osterhout & Holcomb, 1992; Osterhout & Mobley, 1995). Thus, the repeated name in sentence 2a not only produced a facilitative effect on the N400, as compared with new names, but also produced an inhibitory effect on the P600, as compared with the other sentence types.

Critically, the finding that the same repeated expression leads to both patterns within the same sentence demonstrates a novel finding of distinct effects on lexical and discourse processing. This sheds light on how to interpret prior evidence of interactivity. In particular, it provides new evidence suggesting that while lexical and postlexical processes may be highly correlated at linguistic interfaces (Kuperberg et al., 2003; Kuperberg et al., 2007; Ledoux et al., 2007), they can also function independently for other aspects of interpretation. As the present findings show, when lexical and discourse effects are associated with different components (N400 and P600, rather than N400 for both), evidence for both processes is revealed. Similarly, when lexical and postlexical effects are associated with different cognitive procedures (lexical access and co-reference, rather than thematic role assignment for both), evidence of two processes is revealed. We will return to the implications of these patterns on models of language comprehension in the General Discussion section.

Experiment 2

In Experiment 2, we wanted to examine more closely an important question raised by these results: What is the cause of the N400? In particular, while the P600s elicited by repeated mention within a clause suggest that neural responses are sensitive to structural features of co-reference (Gordon et al., 2013), there remains the puzzle of why N400s index both lexical and discourse effects when co-reference occurs between clauses (Ledoux et al., 2007; Swaab et al., 2004). Recall that in the original studies, repeated names that co-referenced nonprominent antecedents generated reduced N400 responses, as compared with new names (repetition priming; compare 1c to 1d). However, repeated names also generated greater N400 responses when they co-referenced prominent antecedents, relative to nonprominent antecedents (repeated name penalty; compare 1a to 1c). Yet these two cases naturally reflect very different procedures. This intuition is confirmed by behavioral studies demonstrating evidence of rapid repetition priming (Liversedge et al., 2003; Raney et al., 2000; Traxler et al., 2000) paired with delayed effects of repeated name penalty (Ledoux et al., 2007; Swaab et al., 2004).

Experiment 2 distinguishes between the neural processes associated with the N400 by examining possible interactions with lexical frequency. Like (1), critical sentences vary the prominence of antecedents and the repetition of referring expressions between two clauses of a sentence. However, unlike (1), they also recruit names that are either low or high frequency [see (4) and (5), respectively].

(4)
1. LOW/PROMINENT/REPEATED: Yesterday Earl left work after Earl completed the project
2. LOW/PROMINENT/NEW: Yesterday Wade left work after Earl completed the project
3. LOW/NONPROMINENT/REPEATED: Yesterday Earl and Neil left work after Earl …
4. LOW/NONPROMINENT/NEW: Yesterday Wade and Neil left work after Earl …

(5)
1. HIGH/PROMINENT/REPEATED: Yesterday John left work after John completed the project
2. HIGH/PROMINENT/NEW: Yesterday Bill left work after John completed the project
3. HIGH/NONPROMINENT/REPEATED: Yesterday John and Neil left work after John ….
4. HIGH/NONPROMINENT/NEW: Yesterday Bill and Neil left work after John …

At the lexical level, previous behavioral research has found that repetition priming interacts with lexical frequency: Priming is larger for words that are low in frequency and smaller for words that are high in frequency (Lowder, Choi, & Gordon, 2013; Scarborough, Cortese, & Scarborough, 1977; Young & Rugg, 1992). This pattern suggests that repeated mention facilitates word recognition the most in cases where episodic memory representations are least robust. Lexical frequency also interacts with subsequent integration but does so along a different time course, as compared with word recognition (Johnson, Lowder, & Gordon, 2011; Staub, 2011; Tily et al., 2010). For example, Tily and colleagues varied verb frequency in structurally complex object-cleft sentences (It was Vivian who Terrence lectured/chided for always being late) and found that high-frequency verbs lead to earlier processing of the syntactic ambiguity on the cleft region (Terrence lectured). However, this effect did not emerge in low-frequency verbs until the postcleft region (for always). These results suggest that delays in lexical processing have cascaded effects on postlexical integration.

These findings generate predictions for possible effects of lexical frequency on the N400 responses elicited by repetition priming and the repeated name penalty (Ledoux et al., 2007; Swaab et al., 2004). In particular, if the N400 reflects a single neural process, evidence of lexical access and integration should again both be apparent immediately after names across all frequencies. However, if the N400 reflects multiple processes, variations in lexical frequency may distinguish between access and integration. In particular, consistent with the studies above, recent ERP results have demonstrated that difficulty in integration can have downstream effects on neural processing (Hagoort et al., 1993; Kuperberg, Choi, Cohn, Paczynski, & Jackendoff, 2010; Osterhout & Holcomb, 1992). This predicts that delays in lexical processing for low-frequency names may lead to corresponding delays in lexical integration.