Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2015 Feb 20.
Published in final edited form as: Lang Cogn Neurosci. 2014 Feb 20;29(4):512–523. doi: 10.1080/23273798.2014.892144

“hotdog”, not “hot” “dog”: The phonological planning of compound words

Cassandra L Jacobs 1,*, Gary S Dell 1
PMCID: PMC4045613  NIHMSID: NIHMS572032  PMID: 24910853

Abstract

Do we say dog when we say hotdog? In five experiments using the implicit priming paradigm, we assessed whether nominal compounds composed of two free morphemes like sawdust or fishbowl are prepared for production at the segmental level in the same way that two-syllable monomorphemic words (e.g. bandit) are, or instead as sequences of separable words (e.g. full bowl or grey dust). The experiments demonstrated that nominal compounds are planned as a single sequence, not as two sequences. Specifically, the onset of the second component of the compound (e.g. /d/ in sawdust) did not act as a primeable starting point, although comparable onsets did when that component was an independent word (grey dust). We conclude that there may be a dog in hotdog at the morpheme level, but not when phonological segments are prepared for production.

Keywords: compound words, phonological planning, morphology, serial order, language production

Introduction

Language is productive; new combinations are always being formed from existing words and parts of words. An example of productivity at the morphological level is seen in compounding. Compounds are sometimes created on the fly, such as wind table, meaning a plateau of air that is formed by strong winds. Most of the compounds that we use, though, are familiar and often written as single words, (sawdust, hotdog). The central goal of psycholinguistic research on compounds and, indeed, all morphologically complex words, has been to determine how the words’ structure (e.g. sawdust consists of the words saw and dust) affects their production and comprehension. In this paper, we focus on production and ask whether familiar compounds are produced as one or two phonological sequences.

In studies of auditory and visual word recognition, there is reason to believe that the constituent parts within a compound word function somewhat independently of each other. For example, the frequency of the first noun of a nominal compound affects the ease of word recognition and comprehension (e.g. Taft & Forster, 1976). Although production has been studied less often, the morphological structure of the compound matters there, as well. Some evidence for the existence of the morphemes within compounds (so, the saw and dust in sawdust) comes from aphasic patients’ spoken errors. Blanken (2000), Lorenz, Heide and Burchert (2014) and Ayala and Martin (2002) examined picture-naming errors made by a variety of aphasic individuals. They found that the separate parts of a compound could slip to semantically or phonologically related words; a word like birdhouse could become birdhome or finchhouse, indicating the separable contributions of the individual terms (e.g. bird and house) within the target word.

Findings from response-time studies of compound production by unimpaired speakers are, for the most part, consistent with the decompositional perspective suggested by speech error data. Bien, Levelt, and Baayen (2005) found that the frequencies of the two constituents of a compound separately affected production latencies in a task in which participants had to retrieve previously memorized compounds from a cue. Although they found a small effect of the frequency of the entire compound as well, the results generally supported the decompositional approach to compound production. Roelofs (1996; see also Roelofs & Baayen, 2002) showed that advance knowledge of the first syllable of a compound, which is a separate morpheme, speeded production more than advance knowledge of the first syllable of a monomorphemic word. Studies of compound production using the picture-word interference paradigm, in which a distractor word can be specifically related to a single morpheme of a target compound, have also demonstrated an influence of the morphological status of the compound’s components (Dohmes, Zwitserlood, & Bölte, 2004; Gumnior, Bölte, & Zwitserlood, 2006; Lüttmann, et al., 2011), as have studies using long-lag priming (e.g. Koester & Schiller, 2008; 2011).

Not all response-time studies, however, have supported the decompositional view. Janssen, Bi, and Caramazza (2008) found that only the frequency of the compound itself influenced picture-naming times for compounds and not the constituent parts. They concluded that compounds’ lexical representations are retrieved as wholes.

The bulk of the research on compound production, together with studies of the production of prefixed and suffixed words (e.g. Ferreira & Humphreys, 2001; Janssen, Roelofs, & Levelt, 2002; Melinger, 2003), supports the claim that, at least somewhere in the production process, morphologically complex words are planned differently than monomorphemic words. This conclusion is reflected in models of production, as well. For example, Levelt, Roelofs, and Meyer’s (1999) and Dell’s (1986) models both have a morphological layer in the model’s lexical network, which sits between the lemma/word layer and the layer at which phonological units are represented. The models’ claim that morphological structure is intermediate between a word and a segmental level leads to the central question addressed here. Does morphological structure impact downstream levels? For example, at the phonological level, is sawdust planned like multiple words such as grey dust or like a single word such as napkin?

Some recent work by Cohen-Goldberg (2013) suggests that morphological structure penetrates to levels responsible for the processing of the word’s phonological segments. He found that similar phonemes within a word slow production times (e.g. as in Cohen-Goldberg, 2012), but that this interference is smaller when the similar phonemes come from different morphemes. This suggests that morphological boundaries manifest themselves at levels that represent segmental similarity. Although Cohen-Goldberg (2013) did not examine the production of compounds specifically, his claim of low-level morphological influences is assumed to apply generally, potentially to compound words like sunshine, which he used as an example to illustrate the potential interactions within and across morpheme boundaries.

In contrast, Wheeldon and Lahiri (2002) reported data suggesting that the morphological complexity of compounds did not matter at the phonological and more downstream levels. Their study examined the latencies to produce short phrases after a delay. Compounds such as ooglid, (eyelid in Dutch) acted as single phonological words, unlike word sequences such as oud lid (old member).

One way to address the question of morphological structure during lower-level speech planning is to consider how serial order of phonological units in compounds is represented. If the phonological segments of sunshine are encoded without reference to the morphological boundary, then the serial order process operates over the entire word. There is just one sequence. If there is a potent morphological boundary, though, there are two sequences, one for sun and one for shine.

Let us consider the model of serial planning in language production proposed by Houghton (1990). It proposes that all words have START and END nodes and that the word’s segments are differentially associated with these. So, for the word sun, /s/ is associated with START, /n/ with END, and /ʌ/ is associated weakly with both START and END. The serial order is realized by activating the START node and then gradually turning it off while turning on the END node. The result is that first /s/, then /ʌ/ and then /n/ are retrieved.

Houghton (1990)’s serial order model has been supported in Fischer-Baum et al.’s (2010, 2011) investigations of spelling. Using spelling perseveration errors generated by dysgraphic individuals, they showed that a letter’s serial order is stored in reference to the letter’s distance from the word’s START and END positions, exactly as proposed in Houghton’s model. Fischer-Baum et al. showed that the START-END schema applies across the entire word, not separately within each orthographic syllable. So, a word like bandit is a single sequence, not ban- and then -dit.

Given this background, we can make our central question more concrete: Do compounds like sawdust or sunshine have a single START-END schema for their phonological segments, or two of them (see Figure 1)? Schiller et al. (2001) addressed the question of one versus two sequences in compounds by examining spelling-error patterns in two dysgraphic individuals. When both individuals spelled words (aloud, written, or typed), they tended to make more errors toward the end of the word. When the spelled words were compounds, though, one of the patients tended to get the first letter of the second morpheme more correct than would be expected from its overall word serial position. This result suggests that, at least for this individual, that there is second sequence being generated when a compound is spelled. Of course, spelling involves different units than speaking, and is likely responsive to strategies that do not apply in word production. Consequently, it would be useful to consider the nature of the phonological sequence in compounds specifically in spoken production. We can do so by using a paradigm that has been used to investigate serial order in language production, the implicit priming paradigm (Chen, Chen, & Dell, 2002; Meyer, 1990, 1991; Roelofs, 1996; O’Seaghdha, Chen & Chen, 2010).

Figure 1.

Figure 1

Two accounts of the serial order of phonological segments in compounds, based on start-end serial order schemas (e.g. Houghton, 1990). The production of a sequence begins by activating the start node, which differentially activates the segments with the first most strongly. After each segment is encoded, it is inhibited, making way for the next one. As the sequence is produced, the start node is gradually turned off and the end node is gradually turned on, resulting in a smooth transition through the sequence. The contrasting accounts differ on whether there are separate sequences for the two parts of the compound, or just a single sequence.

The implicit priming paradigm takes advantage of the serial nature of sub-lexical units (e.g. phonological segments, syllables, or morphemes). Participants produce words aloud in response to a semantic cue. For example, for a cue girl, participants learn to produce the target boy. The cue-target pairs are learned and tested together in blocks. In critical blocks, called homogeneous blocks, the targets are phonologically similar in some way. In our studies, as with many others (Meyer, 1991; Roelofs, 1996), the targets all began with the same phoneme. Performance in the critical blocks is compared to performance in mixed (heterogeneous) blocks, where cue-target pairs from the various homogeneous blocks are mixed to create blocks that lack this phonological similarity. Faster response times in the homogeneous blocks constitute implicit priming. Importantly, because exactly the same cue-response items are used in heterogeneous and homogeneous blocks, differences in conditions cannot be due to the strengths of the cue-response relations. This feature of the design also gives it the power to detect small priming effects.

Meyer (1990, 1991) developed the implicit priming paradigm and found that one only gets priming if the shared phonological material in homogeneous blocks is at the beginning of the response word, which we will call the starting point. The greater the shared initial material, such as from a single phoneme to a whole syllable, the greater the implicit priming. The priming effect can be explained by proposing that phonological processes common to the targets in a block can be prepared in advance, with processing suspended, until the cue to speak is given (Levelt, Roelofs, & Meyer, 1999). Because of the advance preparation, the time to begin speaking is then less than in a heterogeneous condition, where none of the initial information is shared.

The fact that implicit priming works only for shared material at the starting point tells us that, at the processing levels tapped by this task, processing is strongly ordered. It also allows us to identify starting points. It can assess whether the onset of the second part of a compound (the /d/ in dust from sawdust), is a true starting point, which, if so, suggests that it functions as its own sequence. In our first two experiments, the cue-response pairs will be familiar two-syllable compounds, where the cue is the first morpheme (e.g. saw-) and the response is the second one (e.g. –dust). In homogeneous blocks, all responses will begin with the same phoneme (e.g. /d/). If the /d/ in –dust acts as a second starting point, priming should be obtained. If, instead, compounds are represented as single words with a single starting point (e.g. at /s/), there should be no priming, suggesting that within the context of a compound, the second unit does not function as its own sequence. The three subsequent experiments test alternative explanations and predictions derived from the first experiment’s results.

Methods

Participants

10 undergraduate students were recruited from the introductory psychology course subject pool for each of the five experiments. These 50 students were all native speakers of English and were participating for course credit.

Materials

All cues and targets were only a single syllable long. Experiments 1a, 1b, 2, and 3 all used the same response items. Because these experiments varied as to whether or not they showed priming, the fact that they used the same responses makes it very unlikely that this variation in priming was due to properties of what participants produced (see column 5 in Table 1 of the Appendix for the common response items of these experiments). All response items fell into one of five mutually exclusive phonological categories, based on the phoneme at the onset of that morpheme (e.g., the /d/ in –dust). Each phonological category contained five items.

Cues for these response items were generated based on their relationship to the target in Experiments 1 through 3. In Experiments 1a and 1b, the cues were the first morpheme of a compound word whose second element is the target item (so, for a compound like sawdust, saw would be the cue, and dust would be the target production). In Experiment 2, the cues were semantically related words (e.g. sweep for dust), as done in the original paradigm (Meyer, 1990, 1991). In Experiment 3, the cues were pragmatically viable adjectival modifications of the target words (e.g. grey for dust). The adjectives should be effective retrieval cues for the responses given that the implicit priming procedure involves a learning phase prior to the test trials. Moreover, there is evidence that experienced adjective-noun pairs are represented and influence the production process (Janssen & Barber, 2012). The cues for Experiments 1–3 are in columns 2–4 of the Appendix.

The materials for Experiment 4 were composed of two-syllable, monomorphemic words, with the 25 items’ second syllables falling into one of five mutually exclusive phonological categories based on second-syllable onset phoneme. The first syllables (e.g. the ban- in bandit) are the cues, and the second syllables (e.g. dit) the targets. These are available in column 1 of the Appendix.

Procedure

An implicit priming task modeling the structure from Meyer (1990, 1991) was the basis for our task. Experiment 2 in fact used exactly the same method as Meyer (1990) in terms of number of participants, items, blocks, and trials, and in its use of semantic cues for targets. The only difference between our task and that of Meyer for our other experiments was that, instead of semantic cues, the cues initiated a sequence that the target completed; in those experiments, participants studied cue-target pairs where the cue constituted the first half of a single word or multi-word phrase, and the target was the second half of this word or phrase. By reproducing all of the quantitative aspects of Meyer’s original design, which features five different five-item sets and multiple tests of each item both in heterogeneous and homogeneous blocks for a total of 7500 trials per experiment, our experiments have more than adequate power. Implicit priming effects when the prime is a single onset phoneme (e.g. O’Seaghdha, Chen, & Chen, 2010) or syllable (e.g. Chen, Chen, & Dell, 2002) can be on the order of 10 ms. We shall see that an effect of 8 ms was detected in one of our experiments.

Results

The analyses for all experiments are based on the same principle; a maximum mixed-effects multilevel model was constructed for each of the experiments to assess the effects on production time for production context (whether blocks were heterogeneous or homogeneous). In maximal models, both participants and items are modeled as random intercepts and slopes, a conservative approach to hypothesis testing, which may be required in psycholinguistic studies to avoid Type I errors (e.g. Barr et al., 2013). The variable trial block (which of the 6 blocks the trial appears in) is also examined in the analysis as a control predictor. Although production context is the key contrast, the main effect of block is included as a check on the validity of the experimental paradigm, with lower production times across the course of the experiment indicating learning of the items and/or task. In the analyses reported here, we did not include the interaction between production context and the control predictor, block. In additional unreported analyses that did include this interaction term, it was never found to be a reliable predictor of response times in any experiment. Tests for the production context main effect are directional. Experiments of this sort either result in faster response times for the homogeneous condition, or they fail to do so. Results in the reverse direction are considered to be either spurious, the result of error, or the result of an unwelcome strategy. Tests involving the block variable are nondirectional, as each direction is an interpretable outcome. Speech onset times were determined from the stored sound files using an algorithm by Bansal, Griffin and Spieler (2001). Only correct responses where the speech onset time was less than the time out period of 1000 ms were included in the analysis of production times.

We used the package lme4 to build and evaluate the statistical models of production time (Bates & Sarkar, 2007; see also Quené & Van den Bergh, 2004, and Baayen et al., 2008). For each experiment, we will report only maximal models with random intercepts and slopes for the key variable, production context. The overall advantages of the homogeneous production context over the heterogeneous production context for all five experiments can be seen in Figure 2. All models are reported in Tables 26 in the Appendix. Error rates were very low and approximately equal between homogeneous and heterogeneous conditions in all experiments, and hence, other than the error rates themselves, no statistics for errors are reported.

Figure 2.

Figure 2

Homogeneous Production Context Advantages for Experiments 1–4.

Experiment 1a (cue=saw; response=dust)

This experiment tested whether transparent compounds that were split at the morpheme boundary were treated as two separate words. We found no effect of implicit priming, with a 0 millisecond advantage in the homogeneous production context compared to the heterogeneous context. The mean error rate was 2.5% (2.9% for the heterogeneous and 2.1% for the homogeneous conditions). Both the average homogeneous and heterogeneous production speech onset latencies were 410 ms. Participants did, however, show learning of the procedure and pairs over time, with a significant decrease in response times over blocks (p < .001).

The absence of a priming effect for compounds is most straightforwardly interpreted as support for the claim that the second part of a compound is not true starting point, and hence, compounds are represented as a single whole sequence. Notice that this null result, if it is truly null, is also evidence that the participants are treating the items as compounds, because if they were treating them instead as separate words, the homogeneous condition should foster advance preparation of the shared initial consonants of the response word and hence lead to priming.

Experiment 1b (cue = saw; response = dust)

Because Experiment 1a found no priming effect at all, and because this result was unexpected from theoretical perspectives that allow for morphological structure to influence phonological sequencing, we carried out Experiment 1b, which was simply an attempt to replicate the first experiment.

In Experiment 1b, there was again no advantage for the homogeneous context. Latencies in the homogeneous context were slightly longer than in heterogeneous contexts (426 ms for homogeneous and 422 ms for heterogeneous, with a 3.0% error rate for the heterogeneous condition, and a 2.7% rate for the homogeneous condition). Since we are only interested in an advantage for the homogeneous condition over the heterogeneous one, no test of the production context variable was conducted. As in Experiment 1a, participants got faster over the course of the experiment (p < .001).

From Experiments 1a and 1b, we conclude that cue-target items that correspond to familiar compounds create no implicit priming, and hence that the initial phoneme of the second part of the compound is not a primeable starting point. Our tentative conclusion is that compounds are represented for production as single sequences at the level where segmental order information is represented.

Experiment 2 (cue=sweep; response=dust)

Given the null effects in Experiments 1a and 1b, it is important to establish that these response items can show an onset-consonant priming effect when the onset is known to be a true starting point. That should be the case if the onset begins an independent word. In this experiment, the response words were cued with a separate word that was semantically associated with the target, as in Meyer (1991).

As expected, there was a 13-millisecond advantage for targets in homogeneous contexts over heterogeneous contexts (heterogeneous = 462 ms; homogeneous = 448 ms) with error rates at 2.8% (3.2% in the heterogeneous condition and 2.6% in the homogeneous condition). This finding is consistent with those of other implicit priming studies when the homogeneous condition involved a single onset consonant (O’Seaghdha, Chen, & Chen, 2010). This advantage was statistically significant (p < .03) as was the effect of block (p < .001).

This result allows us to conclude that the negative results from the first two experiments were not due to a peculiarity of the response items used. It appears that dust will be primed in the homogeneous condition (when all responses in the block begin with /d/), provided that dust is a separate word cued semantically, rather than the second half of a compound, cued by the first half.

Experiment 3 (cue=grey; response=dust)

The experiment tested whether the absence of an effect in Experiments 1a and 1b was due to the fact that the cue-target pairs formed a linguistic sequence, rather than the fact that the sequence is specifically a familiar compound. So, the response word in Experiment 3 was cued by a pragmatically viable adjective, making the cue-target pair a sequence, but the response itself a separate word.

Once again, there was an advantage for responses in the homogeneous contexts (heterogeneous = 465 ms; homogeneous = 457 ms) with error rates at 2.1% (2.2% for homogeneous and 1.9% for heterogeneous). The test for the production context variable yielded p = .052, which is right at the .05 rejection region. Given that we are using a conservative maximal random effects structure, and that our expected effect size was around 10 ms, we felt justified in rejecting the null hypothesis for this 8 ms difference. As before, there was a significant effect of how far participants were along in the experiment, p < .001.

Experiment 4 (cue=ban; response=dit)

The preceding analyses suggest that implicit priming for the response’s onset consonant only occurs when the response is a separate word, and specifically that compounds act as a single word. Split monomorphemic words, used in the same manner as the compounds, should then also show no effect of implicit priming. Consequently, Experiment 4 split apart two-syllable monomorphemic words, using the first syllable as the cue to produce the second syllable.

Experiment 4 yielded a negligible 3 millisecond advantage for the homogeneous production context (heterogeneous = 471 ms; homogeneous = 468 ms), p = ns, with a 2.0% error rate (2.1% for homogeneous, 1.9% for heterogeneous). This mirrored the results for the compounds in Experiment 1a and 1b. There was the typical effect of practice, p < .001.

Discussion

Over the course of our five experiments, we attempted to pinpoint the role of morphological structure in the phonological planning of compounds composed of two free morphemes like sawdust, using the implicit priming paradigm.

Experiments 2 and 3 were ancillary studies, designed to verify that the paradigm shows priming with the onsets of whole words. In Experiment 2, we replicated the results of Meyer (1991), O’Seaghdha et al. (2010), and Roelofs (1999), showing that the retrieval of a word from a semantic cue (e.g. cue sweep for the response dust) is facilitated if the response comes from a block in which all the responses begin with the same phoneme, e.g. /d/. Experiment 3 extended this result to sequential word pairs forming a partial noun phrase, with adjectives serving as the cues for nouns (e.g. cue=grey, response=dust). We conclude from the results of Experiments 2 and 3 that the onset of the second item of a cue-target pair functions as a clear starting point in the sense proposed by START-END models of serial planning (Houghton, 1990; Fischer-Baum et al., 2010), provided that the onset is the beginning of an independent word.

Experiments 1a, 1b, and 4 extended the paradigm to situations in which the cue and the target when concatenated form single words, thus allowing for a test of whether the second part of the word acts as a starting point. Experiments 1a and 1b tested compounds composed of two free morphemes where the first part of a compound was the cue for the production of the second part (cue=saw, response=dust). Experiment 4 took this one step further, splitting two-syllable monomorphemic words on the syllable boundary (cue=ban, response=dit). In both Experiments 1 and 4, we failed to find evidence for implicit priming when responses came from conditions in which the onset consonant was homogeneous (e.g. /d/).. Altogether, the results suggest that compounds are structured more like single words than like two individual words at the processing level where the sequential planning of phonological units takes place. This result is just what would be expected from Wheeldon and Lahiri’s (2002) experimental demonstration that compounds function as single prosodic words during production planning, and from the results of Janssen, Bi, and Caramazza (2008), who found that the whole-word frequency of compounds retrieved in a picture naming task has more influence on naming times than the frequencies of the compounds’ components. Before accepting this conclusion, though, it is important to consider potential limitations of the findings and some possible alternative explanations.

First, consider the fact that the key experiments finding no priming when the cue-response pair formed a compound (Experiments 1a and 1b) are “null” results. A null result can be difficult to interpret, but such difficulties arise typically when the experiment is underpowered, or when the finding was in the expected direction, or when there has been no attempt to replicate. That is not the case here. The null result of Experiment 1a was replicated in Experiment 1b, and the two findings were convincingly null (0 ms of priming in 1a, and 4 ms in the wrong direction for 1b). Because of the within-subject and within-item design and the 7500 trials per experiment, each of these experiments had the power to detect the expected priming effects. Experiments 2 and 3 had exactly the same designs as the compound experiments and these successfully detected priming effects of 13 and 8 ms (10 ms for single onset-consonant priming effects was expected). Null results can also be problematic when they cannot be easily interpreted. Again, that is not the case here, as the possibility that compounds are treated as a single phonological sequence has precedence in the literature (e.g. Wheeldon & Lahiri, 2002; Janssen et al., 2008). Moreover, we will present more specific theoretical proposals later in the discussion. Finally, null findings best become meaningful when viewed alongside contrasting positive findings. Here, we have positive findings in Experiments 2 and 3, and in a subsequent section make predictions about expected positive effects in other situations. In summary, we feel that the null results of Experiments 1a and 1b with compounds are truly theoretically constraining.

An alternative explanation for our null findings with compounds must be considered, though. Perhaps the effectiveness of an implicit prime depends inversely on the strength of the cue, which conceivably could be stronger for the compound experiments, thus predicting that priming would be weak for compounds. The cue saw may so effectively point toward dust, that there is no “room” for priming to occur in a homogeneous initial-consonant context. This account has at least two points against it. First, recall that all cue-response pairs are studied and practiced before they are tested. Such study would be expected to reduce any inherent differences in cue effectiveness. Second, the principal model of how implicit priming works (Levelt et al., 1999) specifically holds that the production context variable (homogeneous vs. heterogeneous context) should not interact with cue strength. Cue strength affects the speed of retrieval of a holistic representation of the response item. By contrast, a phonologically homogeneous production context affects the speed of subsequent phonological encoding. These are different steps in the process of word production and hence a speed up because of a strong cue should not diminish priming. Of course, one could argue that this model is not true and assert that the knowledge that the response begins with, say, /d/, could affect an earlier stage of word retrieval. This proposal, though, is unlikely because, as demonstrated by Meyer (1990; 1991), only continuous sequences from the beginning of the response function as implicit phonological primes. Rhyming, for example, creates no implicit priming, despite non-initial primes such as rhymes being very effective cues for lexical retrieval (e.g. Bower & Bolton, 1969). Consequently, the implicit priming effect cannot be the result of the implicit prime cueing lexical retrieval. To summarize the second point, the differences in the strength of the cues across the experiments (e.g. the first part of a compound may be a particularly strong cue) cannot explain differences in priming. The nature of the priming—that it only works at starting points—reveals that priming must occur at a later processing stage than the lexical retrieval stage that would be impacted by cue-response strength.

If we were to point to a weakness in our data, it would be that the priming effect in Experiment 3 (cue=grey, response=dust) was small, and just barely at the threshold for significance. Despite this, we argue that our results are internally consistent. Across five experiments, there is only priming when the response is a separate word, and never priming when it is the second half of the word that begins with the cue, even when that word is a compound. The conclusion from these studies is quite consistent with other studies of compound production that treat compounds as single sequences at the phonological level (Janssen et al., 2008; Wheeldon & Lahiri, 2002). In the following sections, we consider two additional theoretical questions: (1) What specific mechanisms prevent implicit priming for shared response-initial consonants that are not word starting points? (2) How do our results accord with the many studies supporting a role for decomposition in compound production?

Implicit priming, starting points, and representational identity

The “suspend-resume” account of implicit priming (Levelt et al., 1999) assumes that speakers can build the shared component of the representation of the response item in advance, that is, before they know the cue and hence the identity of the item. The mechanism provides an excellent account of why the shared material must be the response-initial element (or that element plus contiguous elements) to get priming. Although we believe that this is the correct account, it does not explain why we did not get priming in three experiments where the implicit prime was response-initial, but was not the starting point of the word formed by the cue-target sequence. Our amendment to the theory is simple: If you do not know how to represent the shared material in advance of knowing the cue, you cannot build a representation in advance. We propose that at starting points, it is possible for shared material to be identically represented in the phonological sequence that guides production, and hence the system can make use of the sharing. The representation of, say, /d/ in cue=grey; response=dust, and /d/ in cue=fat; response=dog, is the same. Both are assumed to be a /d/ in the context of the beginning of a word. Hence, the system knows how to build this in advance, and priming will be obtained. We also propose that seemingly identical phonological material that is not at a starting point may not be representationally identical. The /d/’s in cue=saw; response=dust; and cue=bull; response=dog, are not represented the same way. As a result, one cannot plan this /d/ in advance.

When we say that the /d/’s in sawdust and bulldog are not represented the same way, we do not mean that the system is not working with phonological segments. Rather, the way that the segments are treated is different. To illustrate, in Figure 3 we present an account of this difference based on an incremental segmental spell out model of phonological encoding (e.g. Levelt, et al, 1999). The model has representational levels corresponding to whole lexical items, morphemes, and phonological segments. In the case of a compound such as sawdust, there are morpheme units for saw, and dust, and their order is represented. Each morpheme unit is associated with its ordered segments. During word-form planning, these segments are retrieved from each morpheme in sequence and placed left-to-right in slots of a sequence store for the word. This store has a single starting point, and there is no boundary indicating the beginning of any second morpheme if the word is a compound. The figure shows a moment in the planning of sawdust. The two segments of saw are in place, as is the /d/ from dust, At this point in the representation, we assume no syllabic or phonetic organization in keeping with Levelt et al.’s (1999) approach to word-form encoding. This structure is computed later.

Figure 3.

Figure 3

An incremental account of the lack of implicit priming for compounds. Lexicalized compounds such as “sawdust” are represented at the L-level, and are decomposed into morphemes at the morpheme level. The morphemes are known to be ordered and have access to their phonological segments. The phonological segments are incrementally retrieved and placed into slots in the phonological-word sequence. This sequence has a known starting point (the first slot), but no morphological boundaries. Its slots must be filled incrementally and left to right. Implicit priming resulting from a shared initial response (e.g. /d/) is not possible when the response is the second morpheme of a compound, because the /d/ is not at the starting point in the sequence. It cannot be known in advance where to put the /d/ in the sequence because, before the cue is given, the previous segments in the sequence are not known. If the response is an independent word (e.g. cue=grey, response=dust), its first segment will always occupy the starting slot of the phonological-word sequence, and if that segment is known in advance, it can be placed in that slot in advance, creating a priming effect.

Given this model, it is easy to see why the onset of the second morpheme of a compound cannot be put in advance into the structure. The representational uncertainty is positional. One does not know where to put the second-morpheme onset until one has put in the segments of the first morpheme. After saw, the /d/ goes in the third slot. But for other compounds, the beginning of the second morpheme could be at the second slot (eyeball, assuming that the diphthong is a single unit), the third (bulldog), or even sixth (streetlight). If the response is an independent word, there is no uncertainty; the shared onset just goes in the first slot.

Of course, one could assume that there is syllabic structure at work during this segmental spell out process. If so, one would then know in advance that the /d/’s in bulldog and sawdust go in the same slot, the onset of the second syllable. Even so, other factors can create representational variability, when the shared material is not at the starting point. Suppose that each segment’s representation contains cues or links to prior segments in the word (e.g. forward chaining, Goldberg & Rapp, 2008). If so, the /d/’s in bulldog and sawdust would be contextually different, and could not be planned in advance. But if the /d/’s were word-initial, their prior context cues can be assumed to be identical: the word boundary.

This discussion, we hope, illustrates possible mechanisms for why priming may not be possible for shared material that is response-initial but does not correspond to the relevant starting point in the controlling structure. We can go beyond illustration and make a prediction: Implicit priming for response-initial shared material that is not at a starting point should be obtained only if the system knows how to situate the shared material into the representation in advance. This claim can be, and indeed has been, tested in Mandarin Chinese. Implicit priming in Mandarin only occurs for syllable-sized units (Chen et al., 2002). Specifically, robust priming occurs when the shared material is the entire first syllable of the response word. Priming also occurs, but is smaller, when the shared material consists of just the segmental content of the first syllables; for example, if all responses begin with the syllable bing, but their tone varies, there will be between 10 and 15 ms of priming. This latter case is called “toneless” syllable priming. Unlike in other languages, priming in Mandarin does not occur for shared syllable onsets. Hence, it is proposed that the sequential structure that guides Mandarin production is made up of syllable-sized units, rather than phonemic units (O’Seaghdha, et al., 2010). For two-character words, there are only two slots in the sequence, one for each syllable. Under these circumstances, if the first character is the cue and the second syllable is the response, there should be priming, provided that all of the second syllables of the response set share the syllable’s segmental content. This is because the system does know where to put the implicit prime, even though it is not at the starting point of structure. There are only two slots, and the shared syllable content is known to occupy the second slot.

Chen and Chen (in press) have carried out exactly this experiment using Mandarin two-character compounds. The cue is the compound’s first character, and the response is the second syllable. In homogeneous blocks, the responses (the second syllables), shared phonological content, but not tone; that is, the implicit prime was the toneless syllable. The results showed between 14 and 17 ms of priming, comparable to other findings when word-initial toneless syllables are the prime. This result contrasts with our findings in English, but is exactly what is expected from the account of our results presented in Figure 3. Because the toneless syllable prime can be situated in the syllabic sequence in Mandarin, priming is possible. Notice that this explanation has nothing to do with morphological structure. In fact, we would predict that the same thing would happen with two-character monomorphemic words. Chen and Chen reported this finding, as well. In sum, the procedure in which the cues are the first part of the word, and the response is the second, leads to no priming in English when the implicit prime is the onset of the second syllable, regardless of whether the words are compounds (Experiments 1a and 1b) or monomorphemic (Exp. 4). In Mandarin, the data also patterned similarly for compounds and monomorphemic words, only there, priming was obtained. The differences between Mandarin and English are fully in accord with both our claim that the morphological organization of the compound does not set up a second starting point at the level of the phonological sequence, and our claim that implicit priming only occurs when it is possible to situate shared material that is not at a starting point.

Morphological decomposition in production?

Several studies, using a variety of methods, languages, and types of participants, support the claim that the production of compounds is influenced by their morphological organization, that is, that they are composed of two distinct morphemes (Ayala & Martin, 2002; Bien et al, 2005; Blanken, 2000; Dohmes et al., 2004; Gumnior et al., 2006; Koester & Schiller, 2008; 2011; Lorenz et al., 2014; Lüttmann et al., 2011; Roelofs, 1996; Roelofs & Baayen, 2002; Schiller et al., 2001, but see Chen & Chen, 2006, for discussion of Mandarin, which may differ in this respect). These results support the inclusion of a morphemic level in production. We stress that our findings are entirely consistent with this literature. We simply showed that, when the morphological parts of a compound are spelled out into a phonological sequence, that sequence has a single starting point. Another way to say this is that there is no longer a potent morphological boundary in the sequence. At this level, sawdust and bandit look the same. But just because they look the same at this level, it does not mean that the prior processes were. In fact, our account of the data presented in Figure 3 specifically assumes morphological decomposition. The segments of sawdust are retrieved from two morpheme units, whereas for bandit, they are retrieved from one such unit. The existence of distinct nodes for the morphemes of a compound would, in a spreading activation system, have a number of effects. For example, it would explain errors on one morpheme, but not the other (e.g. Ayala & Martin, 2002) and would make words that share a morpheme more similar and hence subject to positive and negative effects associated with that similarity (e.g. as in the picture-word interference paradigm). Moreover, the effects of morphological similarity would be in excess of that expected from just phonological or semantic similarity (e.g. Koester & Schiller, 2008; Roelofs, 1996). Thus, there is little question that morphological structure impacts production generally and the production of compounds specifically. This conclusion would likely be accepted even by advocates of the view that morphological structure emerges from phonological-semantic mappings, rather than being stipulated a priori, (e.g. Plaut & Gonnerman, 2000). We are simply proposing that, at the level tapped by our experiments, a structure with two morphemes—the English nominal compound--does not compel the creation of two, rather than one, phonological sequences during production.

In summary, our experiments show that compounds are produced as one, and not two, sequences at the phonological level. Because the experiments tested only lexicalized (familiar) nominal English compounds that are commonly spelled as a single word such as ‘sawdust’, our findings properly apply only to this set of compounds. Compounds spelled with two words (“lightning bug,” as opposed to “firefly”) may in fact be phonologically planned as two sequences, either because of a true orthographic influence on phonological processes, or because the probability of a space between the words is predictive of the degree of phonological separation. Also, our results may not generalize to the production of novel compounds, which are very common, and formed spontaneously. Novel compounds, such as cat dust (the lingering dander caused by cat ownership) or thought book (similar to a diary) may not be subject to the same kinds of planning processes and may instead be produced with two unique starting points, much as might be expected when considering the semantic structure of complex noun phrases, where the morphemes are considered to be their own discrete words. It could be the case, then, that lexicalized compounds are stored as single phonological sequences, while non-lexicalized items would necessarily be derived from the component parts, with this two-part structure being reflected at the phonological level. The nature of the production of compounds generally is therefore still an open question, which merits further investigation.

Acknowledgements

We would like to acknowledge our research assistants, Kammie Sturdyvin and Nathaniel Hall, as well as Simon Fischer-Baum, Antje Meyer, Padraig O’Seaghdha, Ariel Cohen-Goldberg, Harald Baayen, Kathryn Bock, and two anonymous reviewers for their comments on this research.

Funding

The first author is supported by an NSF Graduate Research Fellowship; this research was also supported in part by NIH/NIDCD Grant [R01DC000191]

Appendix

Table 1.

Stimuli for Experiments 1–4

Cues Targets
Experiment 4
Cues
Experiment 1
(Compounds)
Cues
Experiment 2
(Semantic cues)
Cues
Experiment 3
(Noun phrases)
Targets
Experiment
1, 2, 3
bam- -boo eye- game round ball
co- -balt soy- meal red bean
mor- -bid surf- wood flat board
lim- -bo fish- food hot bowl
ro- -bot sand- crate large box

ban- -dit week- night hard day
can- -did bull- cat fat dog
ra- -dish touch- up soft down
sar- -dine saw- sweep grey dust
en- -dive out- close closed door

fur- -nish gun- boy big man
or- -nate door- feet wide mat
tur- -nip room- friend good mate
gar- -net wind- grain nice mill
har- -ness ear- cold warm muff

sham- -poo dust- wok deep pan
ram- -page tooth- choose dull pick
mag- -pie bag- flow strong pipe
car- -pet flag- climb thin pole
em- -pire air- ship old port

vor- -tex thumb- wall sharp tack
mo- -tel bob- mouse long tail
lan- -tern day- when short time
gui- -tar buck- mouth white tooth
pla- -teau wash- sink full tub

Table 2.

Mixed-effects model for Experiment 1a (Compounds)

Random effects

Groups Name Variance SD
Participants Intercept 1373.95 37.08
Production context 199.66 14.13
Items Intercept 721.42 26.86
Production context 306.57 17.51

Fixed effects

Estimate SE t
Intercept 429.465 13.18 32.58
Production context −0.039 6.07 −0.01
Block number −5.371 0.65 −8.28*

Table 3.

Mixed-effects model for Experiment 1b (Compounds)

Random effects

Groups Name Variance SD
Participants Intercept 2186.43 46.76
Production context 206.69 14.38
Items Intercept 800.81 28.30
Production context 505.01 22.47

Fixed effects

Estimate SE t
Intercept 443.260 16.12 27.51
Production context 3.047 6.80 0.44
Block number −5.675 0.69 −8.18*

Table 4.

Mixed-effects model for Experiment 2 (Semantic cues)

Random effects

Groups Name Variance SD
Participants Intercept 5098.26 71.40
Production context 312.23 17.67
Items Intercept 702.99 26.51
Production context 110.50 10.51

Fixed effects

Estimate SE t
Intercept 480.597 23.43 20.51
Production context −12.916 6.51 −1.99*
Block number −5.530 0.86 −6.40*

Table 5.

Mixed-effects model for Experiment 3 (Adjective cues)

Random effects

Groups Name Variance SD
Participants Intercept 3011.23 54.88
Production context 71.53 8.46
Items Intercept 712.52 26.69
Production context 50.36 7.10

Fixed effects

Estimate SE t
Intercept 482.17 21.80 22.13
Production context −7.79 4.79 −1.63*
Block number −4.75 0.98 −4.86*

Table 6.

Mixed-effects model for Experiment 4 (Monomorphemic words)

Random effects

Groups Name Variance SD
Participants Intercept 5640.51 75.10
Production context 194.17 13.94
Items Intercept 522.91 22.87
Production context 215.51 14.68

Fixed effects

Estimate SE t
Intercept 522.918 24.34 21.49
Production context −4.952 5.704 −0.87
Block number −8.45 0.64 −13.16*

References

  1. Ayala J, Martin N. Decompositional effects in the production of compound words in aphasia. Brain and Language. 2002;83:81–83. [Google Scholar]
  2. Baayen RH, Davidson DJ, Bates DM. Mixed-effects modeling with crossed random effects for subjects and items. Journal of Memory and Language. 2008;59:390–412. [Google Scholar]
  3. Bansal P, Griffin ZM, Spieler DH. Epd: Matlab wave file parsing software. 2001 [Online]. Available: http://homepage.psy.utexas.edu/homepage/faculty/Griffin/resources.html. [Google Scholar]
  4. Barr DJ, Levy R, Scheepers C, Tily HJ. Random effects structure for confirmatory hypothesis testing: Keep it maximal. Journal of Memory and Language. 2013;68:255–278. doi: 10.1016/j.jml.2012.11.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Bates D, Sarkar D. lme4: Linear mixed-effects models using S4 classes. 2007 [Google Scholar]
  6. Bien H, Levelt WJ, Baayen RH. Frequency effects in compound production. Proceedings of the National Academy of Sciences of the United States of America. 2005;102:17876–17881. doi: 10.1073/pnas.0508431102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Blanken G. The production of nominal compounds in aphasia. Brain and language. 2000;74:84–102. doi: 10.1006/brln.2000.2338. [DOI] [PubMed] [Google Scholar]
  8. Bower GH, Bolton LS. Why are rhymes easy to learn? Journal of Experimental Psychology. 1969;82:453–461. [Google Scholar]
  9. Chen TM, Chen JY. Morphological encoding in the production of compound words in Mandarin Chinese. Journal of Memory and Language. 2006;54:491–514. [Google Scholar]
  10. Chen JY, Chen TM. Annual Meeting of the Psychonomic Society. Toronto, Ontario, Canada: 2013. Phonological planning of compound words in Mandarin Chinese. Paper presented at the. [Google Scholar]
  11. Chen JY, Chen TM, Dell GS. Word-form encoding in Mandarin Chinese as assessed by the implicit priming task. Journal of Memory and Language. 2002;46:751–781. [Google Scholar]
  12. Cohen-Goldberg AM. Phonological competition within the word: Evidence from the phoneme similarity effect in spoken production. Journal of Memory and Language. 2012;67:184–198. [Google Scholar]
  13. Cohen-Goldberg AM. Towards a theory of multimorphemic word production: The heterogeneity of processing hypothesis. Language and Cognitive Processes. 2013;28:1036–1064. [Google Scholar]
  14. Dell GS. A spreading-activation theory of retrieval in sentence production. Psychological Review. 1986;93:283–321. [PubMed] [Google Scholar]
  15. Dohmes P, Zwitserlood P, Bölte J. The impact of semantic transparency of morphologically complex words on picture naming. Brain and Language. 2004;90:203–212. doi: 10.1016/S0093-934X(03)00433-4. [DOI] [PubMed] [Google Scholar]
  16. Ferreira VS, Humphreys KR. Syntactic influences on lexical and morphological processing in language production. Journal of Memory and Language. 2001;44:52–80. [Google Scholar]
  17. Fischer-Baum S, McCloskey M, Rapp B. The representation of grapheme position: Evidence from acquired dysgraphia. Cognition. 2010;115:466–490. doi: 10.1016/j.cognition.2010.03.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Fischer-Baum S, Charny J, McCloskey M. Both-edges representation of letter position in reading. Psychonomic Bulletin & Review. 2011;18:1083–1089. doi: 10.3758/s13423-011-0160-3. [DOI] [PubMed] [Google Scholar]
  19. Goldberg AM, Rapp B. Is compound chaining the serial-order mechanism of spelling? A simple recurrent network investigation. Cognitive Neuropsychology. 2008;25:218–255. doi: 10.1080/02643290701862332. [DOI] [PubMed] [Google Scholar]
  20. Gumnior H, Bölte J, Zwitserlood P. A chatterbox is a box: Morphology in German word production. Language and Cognitive Processes. 2006;21:920–944. [Google Scholar]
  21. Houghton G. The problem of serial order: A neural network model of sequence learning and recall. In: Dale R, Mellish C, Zock M, editors. Current research in natural language generation. London: Academic Press; 1990. pp. 287–319. [Google Scholar]
  22. Janssen N, Barber HA. Phrase Frequency Effects in Language Production. PLoS ONE. 2012;7:e33202. doi: 10.1371/journal.pone.0033202. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Janssen N, Bi Y, Caramazza A. A tale of two frequencies: Determining the speed of lexical access for Mandarin Chinese and English compounds. Language and Cognitive Processes. 2008;23:1191–1223. [Google Scholar]
  24. Janssen DP, Roelofs A, Levelt WJM. Inflectional frames in language production. Language and Cognitive Processes. 2002;17:209–236. [Google Scholar]
  25. Koester D, Schiller NO. Morphological priming in overt language production: Electrophysiological evidence from Dutch. Neuroimage. 2008;42:1622–1630. doi: 10.1016/j.neuroimage.2008.06.043. [DOI] [PubMed] [Google Scholar]
  26. Koester D, Schiller NO. The functional neuroanatomy of morphology in language production. Neuroimage. 2011;55:732–741. doi: 10.1016/j.neuroimage.2010.11.044. [DOI] [PubMed] [Google Scholar]
  27. Levelt WJM, Roelofs A, Meyer AS. A theory of lexical access in speech production. Behavioral and Brain Sciences. 1999;22:1–75. doi: 10.1017/s0140525x99001776. [DOI] [PubMed] [Google Scholar]
  28. Lorenz A, Heide J, Burchert F. Compound naming in aphasia: effects of complexity, part of speech, and semantic transparency. Language, Cognition and Neuroscience. 2014;29:88–106. [Google Scholar]
  29. Lüttmann H, Zwitserlood P, Böhl A, Bölte J. Evidence for morphological composition at the form level in speech production. Journal of Cognitive Psychology. 2011;23:818–836. [Google Scholar]
  30. Melinger A. Morphological structure in the lexical representation of prefixed words: Evidence from speech errors. Language and Cognitive Processes. 2003;18:335–362. [Google Scholar]
  31. Meyer AS. The time course of phonological encoding in language production: The encoding of successive syllables of a word. Journal of Memory and Language. 1990;29:524–545. [Google Scholar]
  32. Meyer AS. The time course of phonological encoding in language production: Phonological encoding inside a syllable. Journal of Memory and Language. 1991;30:69–89. [Google Scholar]
  33. O’Seaghdha P, Chen JY, Chen TM. Proximate units in word production: Phonological encoding begins with syllables in Mandarin Chinese but with segments in English. Cognition. 2010;115:282–302. doi: 10.1016/j.cognition.2010.01.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Plaut DC, Gonnerman LM. Are non-semantic morphological effects incompatible with a distributed connectionist approach to lexical processing. Language and Cognitive Processes. 2000;15:445–485. [Google Scholar]
  35. Quené H, Van den Bergh H. On multi-level modeling of data from repeated measures designs: A tutorial. Speech Communication. 2004;43:103–121. [Google Scholar]
  36. Roelofs A. Serial order in planning the production of successive morphemes of a word. Journal of Memory and Language. 1996;35:854–876. [Google Scholar]
  37. Roelofs A. Phonological segments and features as planning units in speech production. Language & Cognitive Processes. 1999;14:173–200. [Google Scholar]
  38. Roelofs A, Baayen H. Morphology by itself in planning the production of spoken words. Psychonomic Bulletin & Review. 2002;9:132–138. doi: 10.3758/bf03196269. [DOI] [PubMed] [Google Scholar]
  39. Schiller NO, Greenhall JA, Shelton JR, Caramazza A. Serial order effects in spelling errors: Evidence from two dysgraphic patients. Neurocase. 2001;7:1–14. doi: 10.1093/neucas/7.1.1. [DOI] [PubMed] [Google Scholar]
  40. Taft M, Forster KL. Lexical storage and retrieval of polymorphemic and polysyllabic words. Journal of Verbal Learning and Verbal Behavior. 1976;15:607–620. [Google Scholar]
  41. Wheeldon LR, Lahiri A. The minimal unit of phonological encoding: prosodic or lexical word. Cognition. 2002;85:B31–B41. doi: 10.1016/s0010-0277(02)00103-8. [DOI] [PubMed] [Google Scholar]

RESOURCES