Abstract
This study aims at examining and accounting for the scope of the temporal effect of phrase boundaries. Previous research has indicated that there is an interaction between boundary-related lengthening and prominence such that the former extends towards the nearby prominent syllable. However, it is unclear whether this interaction is due to lexical stress and/or phrasal prominence (marked by pitch accent) and how far towards the prominent syllable the effect extends. Here, we use an electromagnetic articulography (EMA) study of Greek to examine the scope of boundary-related lengthening as a function of lexical stress and pitch accent separately. Boundaries are elicited by the means of a variety of syntactic constructions.. The results show an effect of lexical stress. Phrase-final lengthening affects the articulatory gestures of the phrase-final syllable that are immediately adjacent to the boundary in words with final stress, but is initiated earlier within phrase-final words with non-final stress. Similarly, the articulatory configurations during inter-phrasal pauses reach their point of achievement later in words with final stress than in words with non-final stress. These effects of stress hold regardless of whether the phrase-final word is accented or de-accented. Phrase-initial lengthening, on the other hand, is consistently detected on the phrase-initial constriction, independently of where the stress is within the preceding, phrase-final, word. These results indicate that the lexical aspect of prominence plays a role in determining the scope of boundary-related lengthening in Greek. Based on these results, a gestural account of prosodic boundaries in Greek is proposed in which lexical and phrasal prosody interact in a systematic and coordinated fashion. The cross-linguistic dimensions of this account and its implications for prosodic structure are discussed.
Keywords: Prosodic boundaries, boundary-related lengthening, boundary-related shortening, gestural coordination, pauses, Articulatory Phonology, Greek
1.0 Introduction
1.1 Prosodic boundaries, boundary-related lengthening and its scope
The aim of this paper is to examine the scope of the temporal effect of prosodic boundaries, called boundary-related lengthening, to determine the role of lexical and phrasal prominence in it, and to develop a theoretical account that captures it. Prosodic boundaries are essential for organizing speech, chunking it into units adequate for speech processing (production and perception) and language acquisition. They emerge from grouping, one of the two main functions of prosody, which is the component of grammar that organizes speech by encoding grouping and prominence (cf. Fletcher, 2010 for a overview). Grouping groups speech units together forming larger cognitive constituents. For instance, syllables are grouped together into words, words into phrases, and phrases into larger phrases. Prosodic boundaries usually separate constituents of the same type, like phrases from each other (see Selkirk, 1996 for a proposal in which some boundaries separate constituents of different types). Prominence marks syllables within words (i.e., stressed syllables) and words within phrases (i.e., accented words) as rhythmically or conceptually important relatively to their non-prominent counterparts.
Standard phonological theories view prosody as a hierarchical structure, with grouping and prominence arising from this organization (e.g., Beckman & Pierrehumbert, 1986; Hayes, 1989; Nespor & Vogel, 1986; Selkirk, 1984). Lower-level constituents (e.g., words) are grouped together forming higher-level constituents (e.g., phrases). Although the number of levels that the prosodic hierarchy of a language has is debated, there is general agreement on the need for at least a minor and a major phrase above the word level (see Turk & Shattuck-Hugnagel, 1996 for an overview). Minor and major phrases are often referred to as Intonational Phrases (IP) and intermediate phrases (ip) respectively, adopting the terms proposed by the Autosegmental-Metrical model of intonational phonology (Beckman and Pierrehumbert, 1986; Pierrehumbert, 1980). As for prominence, lexical stress (lexical prominence) is marked for most languages at the prosodic word level, and accentuation (phrasal prominence) is marked at the intermediate phrase level mainly via specific pitch movements called pitch accents. Viewed that way, the role of prosody can be rephrased as encoding hierarchies of groupings and prominences.
Prosodic boundaries are associated with spatio-temporal, tonal and pausal events that characterize their strength, specifying therefore their prosodic level and grammatical and communicative function. On the spatio-temporal domain, boundary-adjacent articulations are temporally longer both pre- and post-boundary (often referred to as boundary-related lengthening), spatially larger, especially the first articulation post-boundary (referred to as strengthening), and with the temporal intervals corresponding to their duration overlapping less with each other across the boundary as opposed to their non boundary-adjacent counterparts (e.g., lengthening: Beckman & Edwards, 1992; Byrd & Saltzman, 1998; strengthening: Cho & Keating, 2001, 2009; Fougeron, 2001; Fougeron & Keating, 1997; Keating, Cho, Fougeron & Hsu, 2004; overlap: Byrd, 2000; Byrd, Kaun, Narayanan & Saltzman, 2000; Byrd & Saltzman, 1998). Acoustically, both phrase-final and phrase-initial segments are longer than phrase-medial ones (e.g., phrase-finally: Cooper & Paccia-Cooper, 1980; Klatt, 1975; Lehiste, 1973; Oller, 1973; Shattuck-Hufnagel & Turk, 1998; Turk, 1999; Turk & Shattuck-Hufnagel, 2007; Umeda, 1975; Wightman, Shattuck-Hufnagel, Ostendorf & Price, 1992; phrase-initially: e.g., Cho, McQueen & Cox, 2007; Klatt, 1975; Oller, 1973; Shattuck-Hufnagel & Turk, 1998; Tabain, 2003a). It is the magnitude of these spatio-temporal effects that marks boundary strength, since the effects increase cumulatively across the prosodic hierarchy, becoming larger the stronger the boundary (e.g., pre-boundary lengthening: e.g., Byrd, 2000; Byrd & Saltzman, 1998; Cambier-Langeveld, 1997; Cho, 2006; Tabain, 2003b; Tabain & Perrier, 2005; post-boundary lengthening: Byrd & Saltzman, 1998; Cho, 2006; Cho & Keating, 2001; Fougeron, 2001; Keating et al., 2004; Tabain, 2003b; pre-boundary strengthening: Fougeron & Keating, 1997; Keating, Wright & Zhang, 1999; Tabain, 2003b; post-boundary strengthening: e.g., Cho & Keating, 2001; Fougeron, 2001; Fougeron & Keating, 1997; Keating et al., 1999; Keating et al., 2004; Tabain, 2003b; overlap: Byrd, 2000; Byrd & Choi, 2006; Cho, 2004). The cumulative nature of boundary lengthening has also been detected acoustically (cf. Fougeron & Keating, 1997; Keating et al., 1999).
On the tonal domain, specific falling or rising pitch movements occur at the end of phrases belonging in high prosodic levels (cf. Silverman, Beckman, Pitrelli et al., 1992). The ones associated with intermediate phrases are called phrase accents, and the ones associated with intonational phrases are called boundary tones, using the terms introduced by Autosegmetnal Metrical Phonology. Since the end of an intonational phrase (higher in the hierarchy) always coincides with the end of an intermediate phrase (lower in the hierarchy), a boundary tone is always preceded by a phrase accent. A subset of strong, phrase-level, boundaries might also include pauses. These are heard as silences. In terms of articulation, the small number of studies on the issue indicates that grammatical pauses (contrasted to ungrammatical ones, such as hesitations) involve articulatory configurations (Gick, Wilson, Koch & Cook, 2004; Katsika, Krivokapić, Mooshammer, Tiede & Goldstein, 2014, the latter being complementary study to the current one) with stable spatial and velocity characteristics (Ramanarayanan, Byrd, Goldstein & Narayanan, 2010; Ramanarayanan, Bresch, Byrd, Goldstein & Narayanan, 2009; Ramanarayanan, Goldstein, Byrd, & Narayanan, 2013) and which are regularly timed with respect to boundary tones (Katsika et al., 2014) (see Krivokapić, 2014 for an overview).
Our focus here is on the scope of boundary-related lengthening, i.e., the temporal interval over which the effect extends pre- and post-boundary including pauses. Although the effect itself is broadly attested and its scope is considered not to vary with prosodic level (cf. Cambier-Langeveld, 1997; Katsika, 2009), the exact domain of the affected speech and whether and how this is affected by prominence (lexical or phrasal or both) is not well understood.
Pre-boundary, a large number of studies targeted phrase-final words of different number and structure of syllables in different languages and language varieties using different types of measures, such as acoustic or articulatory (e.g., acoustic studies: British English: Campbell & Isard, 1991; American English: Nakatani, O’Conor & Aston, 1981; Oller, 1973; Turk & Shattuck-Hufnagel, 2007; British English: White, 2002; Dutch: Cambier-Langeveld, 1997; Hebrew: Berkovits 1993a, 1993b, 1994; Estonian: Krull, 1997; Greek: Katsika, 2009; articulatory studies (all on American English): Beckman & Edwards, 1992; Byrd, 2000; Byrd, Krivokapić & Lee, 2006; Byrd, Lee, Riggs & Adams, 2005; Byrd & Saltzman, 1998; Edwards, Beckman & Fletcher, 1991; Krivokapić, 2007a). The largest and most reliable part of boundary-related lengthening was detected on the rhyme of the phrase-final syllable. Moreover, the duration of that rhyme was found to be more important for perceiving prosodic boundaries in comparison to other candidates, such as the phrase-final foot (Wightman et al., 1992). Turning back to speech production, additional, but smaller, systematic effects were observed earlier than the final rhyme. In particular, lengthening extended to the onset of the final syllable when the final vowel was short (not a diphthong or reduced) (Cambier-Langeveld, 1997; Oller, 1973) or when lexical stress was not final (Oller, 1973). With respect to stress, there are additional indications that boundary-related lengthening could extend further away from the boundary, reaching the stressed syllable (e.g., Berkovits, 1994; Krull, 1997; Nakatani et al., 1981), even when the stress was more than two syllables away from the boundary (e.g., acoustics: Turk & Shattuck-Hufnagel, 2007; White, 2002; articulation: Byrd & Riggs, 2008). The limited research on the effect of remotely positioned stress on the scope of phrase-final lengthening so far is inconclusive. The effect yielded one domain of boundary-related lengthening extending from the coda of the stressed syllable to the boundary in British English (White, 2002), but two separate domains, naming the rhyme of the final syllable and the rhyme of the stressed syllable (with any intervening syllables remaining unaffected), in American English (Turk & Shattuck-Hufnagel, 2007 and replication by Rusaw, 2011, 2013; see also Shattuck-Hufnagel & Turk, 1998; Turk, 1999). It needs to be noted however that the amount of lengthening detected on the syllable intervening between the stressed and final syllables in British English was substantially small (approximately 10 ms). Further research in American English via articulatory data found speaker-specific effects of stress on the scope of boundary lengthening, but not systematic boundary-related lengthening within the stressed syllable when the latter was not phrase-final (Byrd & Riggs, 2008; see also Riggs & Byrd, 2011). Given that only consonantal articulations were examined in these studies, the possibility of articulations involved in the stressed vowel being systematically lengthened across speakers remains open. Speaker idiosyncrasies were also detected via acoustic measures in Greek, in which boundary lengthening reliably affected the phrase-final syllable and extended towards the stressed syllable on a speaker-specific basis (Katsika, 2009). A systematic effect of stress was not found in German, in which the whole phrase-final word lengthened acoustically as a function of boundary regardless of stress position (Silverman, 1990; see also Kohler, 1983).
Post-boundary, the scope of the effect has not received a lot of attention. So far, a series of studies, mainly articulatory, indicate that lengthening is limited to the initial segment (Byrd et al., 2006; Byrd & Saltzman, 1998; see also Bombien, Mooshammer, Hoole & Kühnert, 2010, and Cho & Keating, 2009; see also Katsika, 2009 (acoustic)). This pattern is independent from the position of stress within the phrase-initial word, with any such effects of stress being speaker-specific (Byrd & Riggs, 2008). An interaction between stress and post-boundary lengthening, this time cross-boundary, is reported in an acoustic study of American English (Shattuck-Hufnagel & Turk, 1998). This study, which examined one speaker’s reiterant speech, found phrase-initial lengthening only when the pre-boundary final syllable was stressed. The effect was limited to the initial syllable and was larger on the onset than the rhyme. Related to the scope of post-boundary effects, articulations following the initial syllable have been found to present small, but systematic, effects of shortening, considered compensatory in nature (Byrd et al., 2006).
In addition to the general patterns and stress-related speaker idiosyncrasies described above, there is large cross-speaker variation in the scope of boundary-related lengthening (e.g., Byrd et al., 2006; Byrd et al., 2005; Krivokapić, 2007a). Furthermore, given that most studies use a restricted set of materials, the material-specific patterns of boundary-related lengthening are not yet known. Regardless of what the scope of the effect is, most evidence concurs in that it is manifested in a progressive manner, decreasing with distance from boundary (e.g., Berkovits, 1993a, b, 1994; Cambier-Langeveld, 1997; Campbell & Isard, 1991; Nakatani et al., 1981; Oller, 1973, White, 2002).
With respect to inter-phrasal pauses (for an overview see Krivokapić, 2014), their duration was found to increase with prosodic complexity or boundary strength (Krivokapić, 2007b; Krivokapić & Byrd, 2012; see also Katsika, 2009) and with the length of the surrounding phrases (Ferreira, 1991; Fuchs, Petrone, Krivokapić & Hoole, 2013; Krivokapić, 2007a, b; Wheeldon & Lahiri, 1997). Pausal duration was also affected by complexity of syntactic structure (e.g., Ferreira, 1991).
To summarize, previous research has not conclusively identified and has not yet accounted for the patterning of boundary-related lengthening. The effect is systematic immediately preceding and following the boundary, with more remote manifestations possibly being specific to the language or the speaker. Nevertheless, the position of lexical stress in the vicinity of boundaries emerges as a possible determining factor. It is yet to be determined whether this is purely an effect of lexical stress or an effect of pitch accent instead. Most previous studies did not separate the lexical from the phrasal aspect of prominence. To our knowledge, Turk and Shattuck-Hufnagel (2007) is the only study that considered lexical stress separately from pitch accent, and concluded that it is the lexical aspect of prominence that was of relevance. No decisive generalizations can be drawn though, since this part of their analysis included only disyllabic words spoken by one speaker. Furthermore, it still remains to be assessed how far the effect extends after the final pre-boundary prominent syllable. Does it cross the boundary, especially when inter-phrasal pauses are involved? Are the pauses themselves affected?
To specifically address these issues, the current study focuses on Greek, a language in which lexical stress is used contrastively. Our earlier acoustic research on the scope of boundary-related lengthening as a function of prominence position in Greek detected systematic lengthening on the boundary-adjacent segments, speaker-specific interactions between prominence and boundary-related lengthening, and longer pause durations the stronger the boundary (Katsika, 2009). However, that study did not distinguish between lexical stress and pitch accent, and did not control for intonation. Here, we consider Electromagnetic Articulography data to systematically examine across a wide range of intonational contours what the role of both lexical prominence (stress) and phrasal prominence (pitch accent) is in influencing the scope of boundary-related lengthening, including how long in advance of a phrase edge these effects are initiated and their relation to inter-phrasal pauses. Pauses were not targeted by design, but were included in our investigation since the majority of the elicited boundaries contained pauses, which were also related to specific articulatory configurations. In light of these data, we also assess whether the effect extends over a continuous domain or not, and whether it does so progressively. Discussion of these issues is bridged with results of our complementary work on the coordination of boundary tones in Greek (Katsika et al., 2014). Our ultimate goal is to develop a theoretical account of the temporal dynamics of prosodic boundaries able to accommodate both typological variation and speaker idiosyncrasies, offering a better understanding of the human speech production system, language and cognition. In doing so, we also provide one of the first set of articulatory data on prosody in Greek. The hypotheses tested along with their expected outcomes are described in Section 1.4, after Articulatory Phonology, the theoretical framework adopted here, and Greek prosody are introduced in Sections 1.2 and 1.3 respectively.
1.2 Articulatory Phonology
This work is couched within Articulatory Phonology (e.g., Browman & Goldstein, 1992; Goldstein, Byrd & Saltzman, 2006) and its Task-Dynamics computational implementation (e.g., Saltzman & Munhall, 1989; Nam & Saltzman, 2003; Saltzman, Nam, Krivokapić & Goldstein, 2008). Within this framework, phonology and phonetics are isomorphic and linguistic entities result from dynamic systems. Phonological representations are composed of linguistically relevant gestures that control the speech organs called gestures. There are three types of gestures: constriction, tone and clock-slowing gestures.
Constriction and tone gestures are specified for abstract linguistics tasks, are realized by coordinated actions of specific articulators, have spatio-temporal properties, and are triggered by internal oscillators that are coupled to each other either in-phase (synchronously) or antiphase (sequentially) (constriction gestures: e.g., Goldstein, Byrd, & Saltzman, 2006; lexical tone gestures: Gao, 2008; see also Mücke, Nam, Hermes & Goldstein, 2012 for pitch accents, and Katsika et al., 2014 for boundary tones). Constriction and tone gestures differ from each other in that they involve different articulators and are specified for different tasks. Constriction gestures involve coordinated actions of the jaw, lips, tongue, velum and glottis in order to achieve specific constriction locations and degrees (e.g., labial closure). Tone gestures, on the other hand, engage the coordination of the lungs, the trachea, the larynx and a number of muscles, such as the thyroarytenoid, cricoarytenoid and cricothyroid muscles (cf. Hirose, 2010) in order to achieve linguistically relevant variations in the frequency of vibration of the vocal folds (i.e., the task of tone gestures is specified at the F0 space, e.g., low F0 or high F0) (Gao, 2008; cf. Fougeron & Jun, 1998; McGowan & Saltzman, 1995).
Clock-slowing gestures (e.g., Byrd & Saltzman, 2003; Saltzman et al., 2008) are substantially different from constriction and tone gestures in that they are not related to specific articulators, and consequently, they lack spatial properties. Their task is to achieve linguistically relevant modulations of the spatial and temporal properties of the constriction gestures, and presumably of the tone gestures as well, that are coactive with them. The scope of their effects is determined on the basis of two dimensions: 1) the coordination of clock-slowing gestures with the other gestures, and 2) the temporal interval during which they are active. Both of these dimensions are currently not clearly defined due to the limited amount and complex nature of empirical data. The first type of clock-slowing gestures proposed was π-gestures, which are activated at prosodic boundaries, and their task is to locally slow down the central clock that determines the global speech pace (Byrd & Saltzman, 2003; see also Byrd, 2000; Byrd et al., 2000). The amount of slowing is proportional to their maximum activation level determined by the strength of the prosodic boundary, and their extent over time by their activation interval. Figure 1 schematically represents a π-gesture overlapping with the constriction gestures spanning a prosodic boundary.
Figure 1.
A schematic representation of a π-gesture (adapted from Byrd & Saltzman, 2003). The two-shade gray box represents the scope of slowing caused by the π-gesture, with darker gray corresponding to the π-gesture’s maximal level of activation. Constriction gestures co-active with the π-gesture (1 and 2 pre-boundary; 3 post-boundary) undergo slowing. Constriction gesture 4 does not overlap with the π-gesture, and is thus unaffected by boundary-related effects.
As the figure shows, π-gestures act locally at the boundary, over a continuous activation interval, and their activation shape is such that the maximal effects occur close to the boundary, and decrease with distance from it. Finally, boundaries of different strengths are not different in type of effect, only in degree. These properties capture the empirical findings that constriction gestures adjacent to boundaries become longer, larger and further apart, and that these effects extend over a continuous domain in a progressive (i.e., decreasing with distance from boundary) and cumulative (i.e., increasing with boundary strength) manner (Byrd & Saltzman, 2003). The temporal initiation of these effects depends on the coordination of π-gestures with the other gestures. The original proposal was that π-gestures are coordinated with the edge of the phrase (Figure 2a), with this coordination being flexible, meaning that “the prosodic events may overlap the segmental events and need not precisely share an edge with an individual gesture, segment, syllable or phrase” (Byrd & Saltzman, 2003, p. 161). According to Byrd & Riggs (2008), if empirical data show that prominence plays a role in the initiation time of the boundary effects, the π-gesture model will need to adjust in one of the following two ways: i) the π-gesture is shifted as a whole towards the prominent syllable (coordination shift; Figure 2b), or ii) the edge of the π-gesture extends towards the prominent syllable and its activation interval becomes longer (extension; Figure 2c).
Figure 2.
The three possible coordination alternatives for a π-gesture with the phrase-final words (adapted from Byrd & Riggs, 2008). The words used in this figure are disyllabic stressed on the penult. Syllables are represented by ‘ σ ‘, and stress by ‘ ´ ‘.
Recently, the concept of clock-slowing gestures has expanded to capture prominence-related effects as well (Saltzman et al., 2008). The newly revised clock-slowing gestures are called µ-gestures, standing for modulation gestures. For historical reasons, the boundary-related gestures are still referred to as π-gestures, and the name µ-gestures is reserved here for the prominence-related gestures. With this definition, µ-gestures are active during prominence (lexical or phrasal) and, like π-gestures, control the spatio-temporal profile of the gestures that are co-active with them. This type of clock-slowing gestures (i.e., the µ-gestures) were proposed along with a network of hierarchically nested coupled oscillators standing for prosodic levels (Saltzman et al., 2008). However, the current version of the model has not yet incorporated most of the prosodic effects, including boundary-related lengthening, as resulting from this nested hierarchy. For this reason, this network is not further discussed here.
1.3 Greek prosody
This section summarizes some aspects of Greek prosody that are relevant for this study (see Arvaniti, 2007 for an overview). Two phrasal levels have been assumed for Greek, naming intermediate phrases (ip) and Intonational Phrases (IP) (Arvaniti & Baltazani, 2005). These are associated with specific phrase accents and boundary tones respectively (see Arvaniti & Baltazani, 2005 for a complete list). They also present boundary-related lengthening in a cumulative fashion, meaning the amount of lengthening is larger in IP (higher level) than in ip (lower level) (Kainada, 2007). The current study examines the scope of that lengthening as a function of lexical stress and nuclear pitch accent (i.e., the last pitch accent of a phrase). In Greek, lexical stress is placed in one of the final three syllables of the word, and is unpredictable and contrastive (e.g., [ti' lε fɔ nɐ ] “phones, n.” – [tiε ' fɔ nɐ ] “call, 2nd person imp.” – [tilε fɔ ' nɐ ] “3rd person ind.”). The main phonetic correlates of Greek stress are duration and amplitude (see Arvaniti, 2007 and references therein). Following the Tone and Break Indices system (ToBI) notation (English: Silverman et al., 1992; Greek: Arvaniti & Baltazani, 2005), nuclear pitch accents are either singleton tones (L* or H*) or bitonal tones (L+H* or H*+L), related to the stressed syllable of the accented word. Specifically, the F0 peaks of the H* and L+H* and the F0 minimum of L* co-occur with the stressed vowel, while the F0 peak of the H*+L occurs just before the stressed syllable (Arvaniti & Baltazani, 2005; Arvaniti, Ladd & Mennen, 2006b).
1.4 Hypotheses and predictions
This Electromagnetic Articulography study examines the scope of boundary-related lengthening (including any pauses) as a function of lexical stress and pitch accent in Greek. In other words, the following questions need to be answered: 1) When is the effect initiated, 2) when is the effect terminated, 3) does the initiation and termination of the effect and/or the temporal profile of pauses depend on the position of lexical stress or pitch accent, and 4) does the effect stretch continuously between its onset and its offset?
Here, we define the initiation of boundary-related lengthening as coinciding with the most distant constriction gesture before the boundary undergoing the effect. Due to the complex nature of previous findings, it is difficult to formulate a concrete prediction as to when this initiation might occur. Grounded on evidence from our preliminary acoustic study (Katsika, 2009), it is expected that in Greek boundary-related lengthening be initiated within the boundary-adjacent syllable (see similar findings in English e.g., Byrd et al., 2006; Krivokapić, 2007a; Wightman et al., 1992), with initiation within the earlier prominent syllable of the phrase-final word being speaker-dependent (also in agreement with Byrd & Riggs, 2008; Riggs & Byrd, 2011). However, this does not preclude the possibility of the effect being initiated closer to, but possibly not within, the prominent syllable (not including the prominent syllable, with the patterns being speaker-specific: Byrd & Riggs, 2008; Riggs & Byrd, 2011; including the prominent syllable: Turk & Shattuck-Hufnagel, 2007; White, 2002). Compared to acoustic data, articulatory data allow a finer segmentation of the consonants and the vowels, and such a subtle systematic effect of prominence on boundary-related lengthening could be detected. It is not clear, however, whether this effect would be due to lexical stress or to pitch accent, despite some indications supporting the former (Turk and Shattuck-Hufnagel, 2007). It is unlikely that in Greek boundary-related lengthening is initiated upon the onset of the phrase-final word, as assumed to be the case in German (Kohler, 1983; Silverman, 1990).
The termination of the boundary-related lengthening is taken to be the last constriction gesture affected after the boundary. The majority of previous studies concur that lengthening does not stretch long after the boundary, but remains local within the phrase-initial syllable, especially within its first segment, with more remote expressions of the effect being speaker-dependent (e.g., Byrd et al., 2006; Byrd & Riggs, 2008; Byrd & Saltzman, 1998; Krivokapić, 2007a). However, there is acoustics-based evidence suggesting that boundary lengthening appears post-boundary only if the last pre-boundary prominence is on the phrase-final syllable (Shattuck-Hufnagel & Turk, 1998). If this is indeed the case, then the offset of boundary will be located pre-boundary, and its extension post-boundary will be considered conditioned by prominence. It cannot be predicted yet whether this conditioning, if detected, will be stress- or pitch accent-driven.
In order to examine whether boundary lengthening affects a continuous interval of speech, all the constriction gestures that undergo the effect need to be detected. Clear predictions cannot be drawn as to whether the effect would extend over a continuous domain (cf. White, 2002) or not (cf. Turk & Shattuck-Hufnagel, 2007; and replication of this study by Rusaw, 2011, 2013).The effect is predicted to be progressive, i.e., decreasing with distance from the boundary (e.g., Berkovits, 1993a, b, 1994). Continuity and progressiveness are also in accordance with the predictions of the π-gesture model (Byrd & Saltzman, 2003).
As for pauses, no specific predictions on their relation with stress and pitch accent can be induced due to absence of previous research on the matter.
2.0 Methods
2.1 Participants
This study had eight (5 female, 3 male) participants, ages between 19 and 31. All participants were native speakers of standard Greek, who at the time of their participation were associates of Yale University and had been in the United States of America between 1 and 6 years. They were naïve to the purpose of the study and had no self-reported speech, hearing or vision problems. Participants gave informed consent and received financial compensation for their participation. The Yale University Human Investigation Committee approved the protocols reported here.
2.2 Experimental design and stimuli
To detect the scope of boundary lengthening two Boundary conditions were compared to each other, namely intonational phrase (IP) boundaries were compared to word (W) boundaries (see Figure 3 and Table 1 for a summary of the experimental design and a list of the stimuli). The stimuli of the IP set consisted of two sequential intonational phrases, referred to as IP1 and IP2. In the W set, the stimuli consisted of single intonational phrases, used as controls for the IP set. The scope of boundary lengthening was examined over the final word of IP1 and the initial word of IP2. These words were phrase-medial and consecutive in the W condition. To examine the effect of pre-boundary lexical stress (Stress) on the scope of boundary lengthening, the IP1-final word was one of the following neologisms: MAmima, maMIma and mamiMA (capital letters stand for lexical stress). These neologisms compose a stress minimal set, consisting of identical sequence of segments but stressed contrastively in one of the following ways:
On their first syllable, i.e., the antepenult, yielding stress-initial words (S1).
On their second syllable, i.e., the penult, yielding stress-medial words (S2).
On their third syllable, i.e., the ultima, yielding stress-final words (S3).
Neologisms were used to minimize constriction gesture variability while ensuring F0 continuity and optimizing articulator traceability1. Each neologism stood for a different narcotic plant. This meaning was chosen in order to suit the context of all the types of stimuli sentences used, described in detail below. The participants learnt these meanings during the training session that preceded the experiment by one to three days by the means of example sentences. Other than the IP1-final words, the stimuli contained real words of Greek.
Figure 3.
Experimental Design
Table 1.
The stimuli for stress-initial words (MAmima). A rough translation into English of the context sentence (if present) is given first, and the target sentence in IPA along with a rough translation into English follows. The words bearing the nuclear pitch accent are marked with bold letters. Punctuation marks stand for phrase boundaries.
(1) Negative Declarative Showing Reservation (ND): What they are doing is horrible! ðε ðʝ ɐ ci' nun ' ɐ kɔ pi ' mɐ mimɐ . mε tɐ ' ksi mɐ θi' tɔ n kɐ ɾ ɐ mε ' litsε s pu' lun. It is not that they merchandize raw MAmima. It is just ‘candies’ they sell to students. |
(2) Wh-Question (WhQ): We are looking for raw MAmima pu ' psɐ xnε tε ' ʝ ɐ kɔ pi ' mɐ mimɐ ? mε tɐ ' ksi me θi' tɔ n ε ' vɾ ε ɔ s ðʝ ɐ ci' nitε . Where are you looking for raw MAmima? Usually one can find some among students. |
(3) Imperative Request (IR): You seem as if you want to ask me for a favor. ' vɾ ε smu ' liʝ i' ɐ kɔ pi' mɐ mimɐ . mε tɐ ' ksi mɐ θi' tɔ n ε ' vɾ ε ɔ s ðʝ ɐ ci' nitε . |
Find some raw MAmima for me. Usually one can find some among students. |
(4) Affirmative Declarative (AD): What are you looking for in our school? ɐ nɐ zi' tɔ ' ɐ kɔ pi' mɐ mimɐ. mε tɐ ' ksi mɐ θi' tɔ n ε ' vɾ ε ɔ s ðʝ ɐ ci' nitε . I am looking for raw MAmima. Usually one can find some among students. |
(5) Yes-no question (YNQ): ɐ nɐ zi' tɐ s ' ɐ kɔ pi' mɐ mimɐ ? mε tɐ ' ksi mɐ θi' tɔ n ε ' vɾ ε ɔ s ðʝ ɐ ci' nitε . Are you looking for raw MAmima? Usually one can find some among students. |
(6) Causative clause (CC): ɐ ' fu ' vɾ iskun ' ɐ kɔ pi' mɐ mimɐ, mε tɐ ' ksi mɐ θi' tɔ n liciu tin ðʝ ɐ ci' nun. Since it happens to have in their possession raw MAmima, they merchandize it to students. |
(7) Parenthetical Clause (PC): What do these people merchandize? fi' tɐ – ' mɐ lɔ n ' ɐ kɔ pi ' mɐ mimɐ – mε tɐ ' ksi mɐ θi' tɔ n ʝ imnɐ ' siu ðʝ ɐ ci' nun. They merchandise plants - most likely raw MAmima - to high school students. |
(8) Control Negative Declarative Showing Reservation (Control ND): What they are doing is unacceptable! ðε ðʝ ɐ ci' nun ' ɐ kɔ pi' mɐ mimɐ mε tɐ ' ksi mɐ θi' tɔ n kʝ ɐ ' nilikɔ n e' fivɔ n. It is not that they merchandize raw MAmima to students and underage teenagers. |
(9) Affirmative declarative, pre-boundary control (AD-IP1): What types of narcotic plants do they merchandize to students and underage teenagers? ðʝ ɐ ci' nunε ' ɐ kɔ pi' mɐ mimɐ mε tɐ ' ksi mɐ θi ' tɔ n kʝ ɐ ' nilikɔ n e' fivɔ n. They merchandize raw MAmima to students and underage teenagers. |
(10) Affirmative declarative, post-boundary control (AD-IP2): To whom do they merchandize raw MAmima? ðʝ ɐ ci' nunε ' ɐ kɔ pi' mɐ mimɐ mε tɐ ' ksi mɐ θi' tɔ n kʝ ɐ ' nilikɔ n e' fivɔ n. They merchandize raw MAmima to students and underage teenagers. |
To assess a possible effect of the lexical stress of the pre-boundary word on the scope of lengthening post-boundary, the trisyllabic word metaKSI (“among”, capital letters represent the lexically stressed syllable) was used IP2-initially across all stimuli. This word was selected because it is stressed on the ultima ensuring maximum distance of the stress form the preceding boundary, and due to the sequence of consonants of different place of articulation (i.e., /m/, /t/ and /ks/), which adds articulatory variability by being very different from the immediately preceding test word, while guaranteeing three clear consecutive articulatory targets (i.e., labial vs. tongue tip vs. tongue dorsum constriction). However, this implies that the V constrictions of metaKSI are formed by the same speech organ as two of its C constrictions, involving thus a great amount of coarticulation.
To examine the role of pitch accent (Accent) and disentangle it from that one of lexical stress (Stress), two sets of syntactic constructions were used for eliciting IP1, one set in which the IP1-final word was accented (A) and another one in which it was de-accented (D) (see Figure 3 and Table 1). Specifically, following Greek ToBI (Arvaniti & Baltazani, 2005), affirmative declaratives (AD), yes-no questions (YNQ), causative clauses (CC) and parenthetical clauses (PC) were used for eliciting accented IP1-final words, and negative declaratives showing reservation (ND), wh-questions (WhQ) and imperative requests (IR) were used for eliciting de-accented IP1-final words. The boundaries elicited by the accented set (A) differ both in terms of intonation and in terms of impressionistic strength. With respect to intonation, affirmative declaratives and parenthetical clauses involve a L-L% combination of phrase accent and boundary tone, yes-no questions a H-L% combination, and causative clauses a L-H% combination (see Arvaniti & Baltazani, 2005). As for boundary strength, affirmative declaratives and yes-no questions are produced with a stronger IP boundary than causative clauses and parenthetical clauses, as corroborated by the fact that the two latter types correspond to subordinate clauses and are represented in the orthography with a comma. The three de-accented constructions, on the other hand, involve the same intonation contour (L-H* L-!H%; see also Arvaniti & Baltazani, 2005) and the same strength of boundary. Three additional sets of stimuli were used to elicit the control W boundary conditions (i.e., with no IP boundary between the two test words). Specifically, a set of affirmative declaratives and a set of negative declaratives showing reservation were used to elicit the IP1-final words of the IP set in accented and de-accented phrase-medial positions respectively, while another set of affirmative declaratives were used to elicit the IP2-initial words of the IP set in accented phrase-medial positions. In all 10 constructions employed, the last pitch accent of the phrase, namely the nuclear accent, denotes broad focus.2 The experimental design is summarized in Figure 3.
Hence, ten stimuli sentences were employed for each member of the stress minimal triplet (i.e. MAmima, maMIma, and mamiMA), yielding 30 stimuli in total. In all stimuli, seven syllables preceded and thirteen syllables followed the neologisms, with the two syllables neighboring the neologism on each side being unstressed. Contextualizing sentences were employed in order to elicit the expected intonation contour for all constructions except yes-no questions and causative clauses, for which context was not necessary.
Table 1 lists the stimuli for the stress-initial IP1-final words (i.e., SI: MAmima). For stress-medial (S2: maMIma) and stress-final (S3: mamiMA) test words, the same sentence frames were used. Nine blocks of the test material were constructed, each containing one repetition of the thirty test sentences in a randomized order, summing up to 270 sentences per participant.
2.3 Apparatus and recording procedure
For the experiment, the AG500 three-dimensional electromagnetic transduction device (Carstens Medizinelektronik) at the physiology lab at Haskins Laboratories was used. Five receiver coils were attached to areas of interest, namely the tongue dorsum, tongue body, tongue tip, upper lip and lower lip. Six additional receiver coils were attached to reference points, and specifically to the upper incisor, left and front sides of the jaw, left and right ears, and nose. Each experimental session was preceded by the standard calibration procedure, as developed by Hoole, Zierdt and Geng (2003). Acoustic data were acquired along with the articulatory data using a Sennheiser shotgun microphone at a sampling rate of 16 kHz. The microphone was positioned roughly 12 inches away from the participant.
The participants received training familiarizing them with the neologisms, the targeted intonational contours, and the stimuli presentation 1–3 days before the experimental session. The training session lasted 20–30 minutes. The experimental session lasted between 2 and 3 hours, and included coil attachment, review of the instructions for the task, practice trials, data acquisition, breaks, and coil removal. Both the instructions and the speech materials were presented on a computer screen, using custom software (developed by Mark Tiede, Haskins Laboratories). The computer monitor was placed around 60 inches away from the participant. The participants were instructed to pay attention to the position of lexical stress on the test words (Greek orthography has a specific symbol (‘ ´ ‘) for stress, e.g., ‘µάµιµα’, ‘µαµίµα’, ‘µαµιµά’), the punctuation signs, which indicated the phrasing of the sentences, and the words in bold, which denoted that these words were bearing the main information of the sentence. As for the presentation of the stimuli, context sentences appeared first in green letters. Some seconds later their respective target sentence appeared in blue letters. The participants read the context sentences silently and the target ones aloud at their normal speech rate. Sentences produced with speech errors, interruptions or disfluencies were repeated. In order to minimize head movement, participants were shown a real-time display of upper incisor position.
2.4 Analysis
The TAPADM (Three-dimensional Articulographic Position and Align Determination with MATLAB™, developed by Andreas Zierdt) pre-processing procedure was applied to the data (cf. Hoole et al., 2003). This procedure smoothens, corrects and translates the data to the occlusal plane, and also functions as a checking method for the reliability of the data. In parallel, the data were subject to a preliminary acoustic inspection to determine whether they were produced with the targeted F0 contour. These two sets of analyses combined revealed that the datasets from three participants were not eligible for further analysis due to the amount of dropped or noisy data from at least one receiver coil (one speaker) or inadequate F0 production (two sepakers). The remaining five participants are referred to as F01, F02, F03, F04 and M05 (four female and one male). From the data acquired from these five participants less than 3% were eliminated from the analysis due to abnormalities in their displacement or velocity signal. The data were further examined for their prosodic boundaries using GrToBI (Arvaniti & Baltazani, 2005), on the basis of which some tokens were excluded from further analysis because they were lacking a boundary where expected. For instance, speaker F04 did not produce prosodic boundaries in the parenthetical clause condition. With the exception of F04’s parenthetical clauses, the analysis included between 5 and 15 tokens per Stress condition in each syntactic construction per speaker. Thus, some conditions included more than 9 tokens, which, as a reminder, was the number of repetitions required for each sentence by the experimental design. This is because in some cases additional repetitions were acquired for a variety of reasons (e.g., resumption of the recording after interruption due to software error or after the participant’s request for a break).
The resulting datasets were subject to semi-automatic kinematic labeling using custom software (Mark Tiede, Haskins Laboratories). Specifically, the consonant (C) and vowel (V) constriction gestures comprising the last pre-boundary word (i.e., MAmima, maMIma or mamiMA) and the C gestures of the first post-boundary word (i.e., metaKSI) of each token were labeled, where boundary is either an Intonational Phrase (IP) boundary or a word (W) boundary. Pre-boundary, both the C and the V gestures were labeled, because each type of gesture involved a different principle articulator (the C gestures used the lips while the V gestures used the tongue dorsum), and thus their targets could be clearly located. Post-boundary, only the C gestures were labeled, because the experimental design involved different articulator for each consonant (lips for /m/, tongue tip for /t/, and tongue dorsum for /ks/) but allowed a greater amount of coarticulation between the V and the non-labial C constrictions. It is thus acknowledged that the measures taken from post-boundary non-labial C constrictions include the co-articulatory contribution of the neighboring V constriction. However, this does not influence our analysis since the comparison is between the two boundary (IP and W) counterparts of the same constriction. The labial C gestures (/m/) were labeled on the lip aperture tract, the coronal C gestures (/t/) on the tongue tip vertical displacement tract, and the velar C gestures (/k/) along with the V gestures (/ɐ , i, ε /) on the tongue dorsum vertical displacement tract. The kinematic labels (shown in Figure 4) consisted of marking the following landmark timepoints for constriction formation of the C and V gestures: onset timepoint, peak-velocity timepoint, target timepoint, constriction maximum timepoint, and timepoint of release of constriction formation. Additionally, for the C gestures the constriction release phase of the gesture was also marked for the timepoint of peak velocity and timepoint of end of release (offset). These timepoints were identified on the basis of velocity criteria, i.e., velocity minima for constriction maxima and thresholds of velocity ranges between two consecutive alternating velocity extrema (i.e., one minimum and one maximum). These thresholds were set to 20%, except for the C onset and offset that were set to 10%, due to the small amplitude of lip aperture for the C constrictions (see Figure 5). The velocity of lip aperture was used for the labial consonants, and the tangential (xyz-) velocity for all the other constrictions. Using these timepoints, the duration of the formation and the release phases of each test C gesture and the duration of the formation phase of each V gesture was calculated. The duration of the release of V gestures was not calculated, because the release of a V gesture coincides with the formation of the following V gesture. As shown in Figure 4, formation is defined as the interval between the onset of the gesture and the release including the target, and release as the interval between the release and the offset of the gesture.
Figure 4.
Kinematic landmarks for constriction gestures
Figure 5.
The waveform, spectrogram, tongue dorsum vertical displacement (TDz) and lip aperture (LA) in a de-accented (WhQ) stress-medial (S2: maMIma) condition of speaker F04. The part of the token shown includes the IP1-final word, the IP2-initial word and the inter-phrasal pause. The kinematic labels for the IP1-final C (/m/) and V (/ɐ /) and for the pause posture (PP) formation are shown. For the C and V, the beginning of the empty box marks the onset (On), the beginning and end of the solid box mark the target (T) and release (R) respectively, and the vertical broken yellow line within the solid box marks the maximal constriction. The C (/m/) and V (/ε /) articulatory targets of the IP2-initial syllable are approximately located by the means of their phonetic symbols.
Figure 5 illustrates lip aperture (LA) and tongue dorsum vertical displacement (TDz) during pre- and post-boundary words. The example involves a de-accented (wh-question) phrase-final stress-medial (S2) word with the timepoints for the last pre-boundary C and V marked. The figure also contains a representative example of the articulatory configuration during acoustic pauses. It needs to be noted that this study was not designed to investigate pauses, but a large number of pauses was elicited, inspection of which revealed that they involve specific articulatory configurations, which we call here pause postures (PP). In these configurations, the tongue and the lips, after achieving the articulatory targets for the IP1-final C (/m/) and V (/ɐ /), maintain for a substantial amount of time a position within the middle range of tongue dorsum vertical displacement and lip aperture respectively, before they move to a more extreme position, from which they start their opposite advancement towards their next constriction target in IP2 (/m/ and /ε /). The fact that articulators, after their middle-range long-lasting posture, reach a more extreme point, which is also in the opposite direction from their upcoming constriction target, suggests that this posture is not just preparatory for an upcoming event, but rather, is related to the pause itself. These properties generalize over speakers, pointing to a default, and possibly language-specific, articulatory setting during pauses (cf. Gick et al., 2004). Albeit not by design, we examined the duration of the formation of these pause postures as a function of Stress and Accent. PP formation was measured at the tongue dorsum vertical displacement as the interval between the offset of the phrase-final V gesture and the onset of the long-lasting plateau related to the acoustic pause. The onset of this plateau was detected using the same method as for V maximal constrictions described above and illustrated in Figure 4.
Two sets of analyses were conducted. One set, called within-speaker, treated each speaker separately due to the large number of speaker-specific properties reported in previous research (e.g., Byrd et al., 2006; Byrd et al., 2005; Krivokapić, 2007a). The other set of analyses, called across-speaker, focused on the effects that generalize across speakers. Both sets of analyses examined the accented constructions separately from the de-accented ones, since the control sentences were different for the two groups. The set of accented constructions included the data pooled from AD, YNQ, CC and PC, and the set of de-accented constructions included the data pooled form ND, WhQ and IR. The scope of boundary-related lengthening was assessed by examining which test gestures were longer in IP than in W in each Stress. To examine speaker-specific effects, planned comparisons via pair-wise t-tests (α= 0.05) between the two types of boundary (IP and W) were conducted for each speaker. For the effects generalized across speakers, we employed repeated measures ANOVAs on the duration of each test gesture with Boundary (levels: IP and W) as the fixed factor and speaker (F01, F02, F03, F04 and M05) as the repeated factor. As for the speaker-specific properties of pauses, the durations of the PP formation of each speaker was subject to ANOVAs with Stress (levels: S1, S2, S3) as factor. In cases in which the ANOVAs showed significant effects (α = 0.05), pair-wise comparisons using the Bonferroni adjustment followed up (α = 0.05). We also report comparisons with 0.05 < α < 0.07 as marginally significant (m.s.). The generalized properties of PP formation were assessed by the means of repeated measures ANOVAs with Stress (levels: S1, S2, S3) as the fixed factor and speaker (F01, F02, F03, F04 and M05) as the repeated factor. An additional set of analysis examined the durations of each test gesture and of the PP formation as a function of Accent. For this analysis, CC (causative clauses, i.e., one of the accented conditions) and D (all the de-accented constructions with an IP boundary) were used. Of the accented constructions, only CC was selected because of its similarity in terms of boundary-marking intonation with the D. The statistical analyses were carried out in the R statistical environment (R Development Core Team, 2013).
3.0 Results
3.1 Within-speaker analysis
3.1.1 De-accented conditions
Tables 2a and 2b contain the results of the within-speaker analysis for de-accented IP1-final words and their sequential IP2-initial words respectively. Table 3 juxtaposes the pre-boundary and the post-boundary results, summarizing these comparisons. The constriction gestures are given in the order the consonants and the vowels appear in the orthography of the test words. However, during speech there is substantial temporal overlap during speech between the C and V gestures belonging in the same syllable.
Table 2a and 2b.
The planned t-test comparisons of the durations (in ms; with SD within parentheses underneath the respective durations) of the constriction gestures of the de-accented IP1-final words (2a: C1-V3) and their subsequent IP2-initial words (2b: C4-C6) across Boundaries (IP vs. W) per Stress (S1, S2, S3) for each speaker (F01, F02, F03, F04 and M05). Dark gray and light gray cells represent significantly longer (>) and shorter (<) IP than W respectively (p ≤ 0.05). White cells represent insignificant comparisons. Key: C = consonant, V = vowel, F = formation, R = release.
Table 3.
Summary of the boundary-related temporal modifications of the constriction gestures of the IP 1-final and IP2-initial words per speaker for each Stress in the de-accented conditions. The five signs within each cell correspond to speakers, who are presented in the order of their numbering (i.e., the first sign corresponds to F01, the second to F02, etc). The signs ‘>’, ‘<’ and ‘-’ stand for longer, shorter or equally long gestures between the IP and the W boundaries (p ≤ 0.05). Gray cells correspond to the gestures of the stressed syllable. Thick borders around cells point to the patterns observed. Solid and broken borders indicate longer and shorter IP than W gestures respectively.
Initiation of boundary-related lengthening
As Tables 2a and 3 explicitly depict, lexical stress has an effect on the point in time that boundary-related lengthening is initiated. When stress is final (S3), the initiation of the effect occurs within the C and V gestures that are immediately adjacent to the boundary (C3-R and V3). These two gestures are partially co-produced, and thus, the initiation point of the effect can be considered to coincide with both of them. When stress is not final, on the other hand, the effect is initiated further away from the boundary, with its exact initiation point being speaker-specific and not necessarily within the stressed syllable itself. F04 is the only speaker whose boundary-related lengthening reaches back to the stressed syllable in both non-final Stress conditions (S1 and S2). Note that Speaker F03, who initiates the lengthening slightly earlier (C3-F) within the phrase-final syllable of stress-final words (S3) as compared to the other speakers, still shows an earlier initiation of the effect in words with non-final stress (S1 and S2). The stress-affected pattern of boundary-related lengthening initiation holds for all speakers except F01, whose lengthening remains constant across Stress conditions over the release of the final C and the final V (C3-R and V3).
Termination of boundary-related lengthening
Post-boundary, lengthening systematically affects the forming gesture of the word-initial C (C4), which is the gesture immediately adjacent to the boundary. This pattern is observed for all speakers except M05, who shows post-boundary lengthening only on the release of the phrase-initial C (C4-R) in S2. Some instances of post-boundary lengthening later than the phrase-initial C are observed. These are either on the release of the initial C (C4-R) or closer to lexical stress of the word (C5-R, C6-F or C6-R). Since instances of the latter type are relatively scarce, it can be concluded that the termination of boundary-related lengthening occurs within the phrase-initial syllable.
Continuity of boundary-related lengthening
Between the effect’s initiation and termination points established above, lengthening can be considered extending over a continuous interval of speech, despite the small number of sporadic discontinuities observed. Pre-boundary, the three instances of discontinuity detected (i.e., F02’s C3-F and F04’s C2-R in S1 and F02’s V2 in S2) can be disregarded, since the skipped is co-produced with a gesture that presents boundary-related lengthening (e.g., the formation gesture of the final consonant of F02 in S1 does not lengthen, but its coproduced vowels gesture does). Post-boundary, the rare instances of discontinuity might be possibly related to the lexical stress of the phrase-initial word, since in these cases the effect skips several gestures after the ones in the initial syllable and affects either the last gesture before the stressed syllable or the first gesture of that syllable. Future research systematically varying the position of the fist post-boundary stress will assess the validity of this hypothesis. Note that speaker M05 presents a different type of discontinuity, in which the forming gesture of the initial C does not lengthen, but its subsequent release does. In this case, it is still the first C that lengthens.
Boundary-related shortening
Outside the scope of boundary-related lengthening, some gestures are also found to be shorter in IP than in W. This boundary-related shortening is rather unsystematic pre-boundary, but quite consistent post-boundary. Pre-boundary, the shortening effect is speaker-dependent, with usually no more than two speakers shortening the same gesture. Although the gestures undergoing the shortening are not identical across speakers, they are all early in the word, away from the boundary. An interesting observation is that shortening tends to either start or end closer to the boundary as the stress is later in the phrase-final word. For example, for F01 shortening ends in the initial V (V1) in S1, in the medial C release (C2-R) in S2 and in the final C formation (C3-F) in S3. This observation suggests that stress has a similar effect on both lengthening and shortening, with both temporal effects being somewhat attracted towards the stressed syllable. Post-boundary, the boundary-related shortening effect is mainly detected on the formation of the second post-boundary C (C5-F). Instances of shortening further away from the boundary are observed, but they are scarcer (e.g., F02’s C6-F in S1 and S2, F03’s C6-F and F04’s C6-R in S1).
To summarize, boundary-related lengthening is initiated within the phrase-final C and V gestures in words with final stress but earlier in words with non-final stress, and is terminated within the phrase-initial C gesture. The effect extends over a continuous interval of speech, outside of which, instances of boundary-related shortening are observed. Pre-boundary, boundary-related shortening tends to be attracted towards the stressed syllable Post-boundary, it systematically occurs one syllable away from the boundary.
3.1.2 Accented conditions
The results of the within-speaker analysis for the accented conditions are presented in Tables 4a and 4b, and summarized in Table 5 (for within speaker comparisons for each accented condition separately, the reader is referred to Appendix A).
Table 4a and 4b.
The planned t-test comparisons of the durations (in ms; with SD within parentheses underneath the respective durations) of the constriction gestures of the accented IP1-final words (a: C1-V3) and their subsequent IP2-initial words (b: C4-C6) across Boundaries (IP vs. W) per Stress (S1, S2, S3) for each speaker (F01, F02, F03, F04 and M05). Dark gray and light gray cells represent significantly longer (>) and shorter (<) IP than W respectively (p ≤ 0.05)). White cells represent insignificant comparisons. Key: C = consonant, V = vowel, F = formation, R = release.
Table 5.
Summary of the boundary-related temporal modifications of the constriction gestures of the IP1-final and IP2-initial words per speaker for each Stress in each accented condition. Within each cell the rows of five signs correspond to speakers, who are presented in the order of their numbering (i.e., the first sign corresponds to F01, the second to F02, etc). Within each cell, the signs ‘>’, ‘<’ and ‘-’ stand for longer, shorter or equally long gestures between the IP and the W boundaries (p ≤ 0.05). Gray cells correspond to the gestures of the stressed syllable. Thick borders around cells point to the patterns observed. Solid and broken borders indicate longer and shorter IP than W gestures respectively.
As these tables show, the same patterns hold for the accented constructions as for the de-accented ones with respect to all aspect of the boundary-related modifications assessed here.
Initiation of boundary-related lengthening
It is apparent from the tables (4a and 5) that the position of lexical stress plays a role in the initiation of the effect. Although the specifics are speaker-dependent, pre-boundary lengthening is initiated earlier in words with non-final stress than in words with final stress. The effect is limited to the boundary adjacent C releasing and V gestures (C3-R and V3) in words with final stress, and extends up to the releasing gesture of the onset consonant of the penultimate syllable (C2-R) in words with non-final stress. Like in the de-accented conditions, the effect of stress position does not hold for speaker F01, whose pre-boundary lengthening remains confined within the final C release and V gestures regardless of Stress.
Termination of boundary-related lengthening
As Tables 4b and 5 illustrate, post-boundary lengthening is systematically found on the forming gesture of the initial C (C4-F), less consistently on the release of that initial C (C4-R) or on one of the gestures closer or on the stressed syllable of the phrase-initial word (C5-R or C6-F). Based on the systematic effects, it can be deduced that boundary-related lengthening is terminated within the phrase-initial syllable. Similar patterns were observed and similar conclusions were drawn with respect to the de-accented conditions.
Continuity of boundary-related lengthening
In accented constructions, similarly to the de-accented ones, boundary-related lengthening extends over a continuous interval of speech, with its starting point being dependent on the position of stress within the phrase-final word and its ending point being the phrase-initial syllable. Some instances of discontinuity are observed, but they are sporadic and unsystematic.
Boundary-related shortening
Like in de-accented conditions, boundary-related shortening is detected outside the scope of lengthening. Pre-boundary, shortening is present early within the phrase-final word, albeit in a rather unsystematic way. For the speakers that present shortening, the effect is initiated within the formation of the first C (C1-F) in stress-initial words (S1), within the release of the first C (C1-R) or within the first (V1) in stress-medial (S2) and stress-final words (S3). These results, combined with similar findings in de-accented conditions, suggest that pre-boundary, the locus of the shortening effect, although not systematic, tends to be attracted to the position of stress. Post-boundary, like in the de-accented conditions, shortening is detected on the medial C gesture (C5) of the phrase-initial word across Stress conditions. Shortening is observed on other post-boundary gestures as well, but very sporadically.
In sum, the boundary-temporal modifications occurring in the context of accented phrase-final words are similar to those occurring in the context of de-accented phrase-final words. Lengthening systematically affects the boundary-adjacent gestures, namely the release of the phrase-final C, the phrase-final V, and the formation of the phrase-initial C. The position of the phrase-final lexical stress affects the scope of lengthening pre-boundary, with the effect being attracted towards the stressed syllable in words with non-final stress, but not post-boundary. In parallel, tendencies of shortening are observed. Pre-boundary, shortening seems to be stress-related, being initiated or terminated close to the stressed syllable. Post-boundary, shortening remains on the formation of C of the medial syllable, independently of the position of lexical stress on the pre-boundary phrase-final word.
3.2 Across-speaker analysis
The results of the repeated measures ANOVAs that were conducted in order to detect which of the within-speaker patterns generalize across speakers and constructions are summarized in Figure 6 (for the respective figures per speaker the reader is referred to Appendix B).
Figures 6.
Schematic illustration of the difference in duration (in ms) between the constriction gestures of the IP1-final (pre-boundary) and IP2-initial (post-boundary) words and their W counterparts for each Stress (S1, S2, S3) in de-accented and accented constructions. In each row, boxes stand for gestures presented in the orthographic order of the test words (i.e., pre-boundary order: C1-FC1-RV1, C2-FC2-RV2, C2-FC2-RV2, C3-FC3-RV3; post-boundary order: C4-FC4-RC5-FC5-RC6-F, and C6-R). Black boxes represent longer IP than W gestures (lengthening), white boxes in broken black borders shorter IP than W gestures (shortening), and light gray boxes stand for no significant difference between IP and W gestures.
As the figure illustrates, the main patterns detected by the within-speaker analysis are retained in the cross-speaker analysis, meaning the lexical stress-driven effect on lengthening and shortening pre-boundary, the consistent loci of lengthening and shortening post-boundary, and the continuous scope of the effect.
Initiation of boundary-related lengthening
In de-accented constructions, lengthening is limited to the last C and the last V gesture in stress-final words (S3: C3-R: F(1,4) = 21.98, V3: F(1,4) = 31.09; p < 0.05), but extends to the penultimate syllable in words with non-final stress ([S1: V2: F(1,4) = 6.907, p = 0.058 (m.s.); C3-F: F(1,4) = 12.33, p < 0.05; C3-R: F(1,4) = 106.2, p < 0.05; V3: F(1,4) = 77.71, p < 0.05]; [S2: C3-F: F(1,4) = 6.257, p = 0.066 (m.s.); C3-R: F(1,4) = 119, V3: F(1,4) = 12.73; p < 0.05]). In accented constructions, the effect of lexical stress on the initiation of boundary-related lengthening is preserved in stress-initial words (S1), but it disappears in stress-medial (S2) words. Specifically, in accented stress-final words (S3), boundary-related lengthening affects the boundary-adjacent C3-R (F(1,4) = 7.874, p < 0.05) and V3 (F(1,4) = 13.84, p < 0.05). The effect extends towards the stressed syllable in stress-initial words (S1) reaching the V gestures of the penultimate syllable, exactly like in the de-accented constructions (V2: F(1,4) = 7.658, p = 0.05, C3-F: F(1,4) = 9.639, C3-R: F(1,4) = 27.73, V3: F(1,4) = 8.209; p < 0.05). However, in stress-medial words (S2) boundary-related lengthening remains within the boundary-adjacent C and V gestures (C3-R: F(1,4) = 21.12, V3: F(1,4) = 15.04; p < 0.05). Presumably, this is because in stress-medial words the accented penultimate undergoes accentuation-related lengthening confounding the boundary-related effect (see Section 3.3 for details on accentuation-related lengthening).
Termination of boundary-related lengthening
Phrase-initially, boundary-related lengthening systematically affects the forming gesture of the first C (C4-F) regardless of the pre-boundary position of lexical stress and/or pitch accent ([de-accented: S1: F(1, 4) = 30.34, p < 0.05; S2: F(1,4) = 19.49, p < 0.05; S3: F(1,4) = 9.677, p = 0.05]; [accented: S1: F(1, 4) = 24.31, p < 0.05; S2: F(1, 4) = 22.68, p < 0.05; S3: F(1, 4) = 24.66, p < 0.05]). Three rather unsystematic cases of lengthening after the initial C gesture are observed. In two cases, the releasing gesture of the first C (C4-R) lengthens as well (S1 in de-accented: F(1, 4) = 35.4, p < 0.05; S2 in accented: F(1, 4) = 11.21, p = 0.05). In the third case, the forming gesture of the last C (C6-F) lengthens in S2 in the accented constructions (F(1, 4) = 5.166, p < 0.05). The latter case might reflect an interaction between the boundary-related lengthening and the position of stress in the phrase-initial word, an issue that will be explicitly assessed in future research.
Continuity of boundary-related lengthening
As is clear from Figure 6, boundary lengthening affects a continuous interval of speech, starting from either the penult or the ultima depending on the position of stress within the phrase-final word and extending to the initial syllable post-boundary.
Progressiveness of boundary-related lengthening
Figure 6 also shows that when the effect extends further away from the final C and V gesture, it is progressive. This means that the amount of lengthening decreases with distance from the boundary. (The figures in the Appendix show that this pattern generalizes across speakers and constructions.)
Boundary-related shortening
Some gestures further away from the boundary are shorter in IP as compared to W. Pre-boundary, the distribution of these gestures is related to the position of lexical stress, which is especially apparent in the accented constructions. In stress-initial words (S1), it is only the first C gesture that lengthens (C1-F: de-accented: F(1, 4) =7.569, p = 0.05; accented: F(1,4) = 11.71, p < 0.05). In words with non-initial stress, shortening is observed later in the word reaching closer to the stressed syllable ([de-accented: S2: no shortening; S3: V1: F(1,4) = 28.02, p < 0.05]; [accented: S2: C1-R: F(1,4) = 16.42, p < 0.05; V1: F(1,4) = 22.26, p < 0.05; S3: C1-F: F(1,4) = 27.18, p < 0.05, V1: F(1,4) = 21.56, p < 0.05; V2: F(1,4) = 7.919, p < 0.05]). Post-boundary, shortening is systematically detected on the forming gesture of the penult’s C (C5-F) across Stress and Accent conditions ([de-accented: S1: F(1, 4) = 7.059, p = 0.05; S2: F(1, 4) = 13.12, p < 0.05; S3: F(1, 4) = 8.31, p < 0.05]; [accented: S1: F(1,4) = 32.15, p < 0.05; S2: F(1, 4) = 5.002, p = 0.09; S3: F(1,4) = 10.41, p < 0.05]).
3.3 Accentuation-related lengthening on the phrase-final words
This section isolates the effect of pitch accent on the duration of boundary-adjacent constriction gestures by comparing the accented to de-accented phrase-final words by the means of repeated measures ANOVAs with Accent (levels: A, D) as the fixed factor and speaker as the repeated factor. Table 9 and Figure 7 summarize the results of this analysis. As a reminder, for this analysis the accented conditions are represented solely by causative clauses (CC), since this is the most similar accented construction to the de-accented ones in terms of intonation. Both CC and the de-accented constructions involve a boundary-related rising pitch movement at the end of the test words, and differ in that the test words bear a pitch accent in CC but not in the de-accented constructions. It needs to be noted, however, that in causative clauses (like in parenthetical clauses as well) this pitch accent is the nuclear accent of a subordinate clause, which might influence the amount of the accentuation-related lengthening in such clauses in comparison to main clauses, like the other accented constructions used here (affirmative declaratives and yes-no questions).
Figure 7.
Schematic illustration of the difference in duration (in ms) between the constriction gestures of accented (A; only CC) and de-accented (D; pooled ND, WhQ and IR) IP1-final words. Black boxes represent longer A than D gestures, white boxes in broken black borders represent shorter A than D gestures, and gray boxes stand for no statistically significant difference between IP and W gestures.
The analysis revealed that in words stressed on the initial syllable (S1), accentuation-related lengthening is detected mainly on that syllable (C1-F: F(1,4) = 14.37, V1: F(1,4) = 17.47; p < 0.05). A smaller amount of lengthening is also detected on constriction gestures comprising the medial syllable (C2-F: F(1, 4) = 20.42, p < 0.05; C2-R: F(1,4) = 6.429, p = 0.06 (m.s.)). Accentuation-related lengthening on the syllable following the stressed one has been described as the spillover effect in the literature (cf. Dimitrova & Turk, 2012). In stress-medial words (S2), the most amount of lengthening is found on the vowel of the stressed syllable (V2: F(1,4) = 10.89, p < 0.05), with the onset of the syllable also lengthening (C2-F: F(1,4) = 29.71, p < 0.05). The spillover effect is apparent in this case also, affecting C3-F (F(1,4) = 7.541, p = 0.05). In addition, the syllable preceding the stressed one shows small amount of lengthening (C1-F: F(1,4) = 23.29, p < 0.05; V1: F(1,4) = 6.198, p = 0.07 (m.s.)). In stress-final phrase-final words (S3), accentuation-related lengthening is small. In the stressed syllable, the releasing gesture of the onset C lengthens (C3-R: F(1,4) = 19.5, p < 0.05). A small amount of lengthening is also observed in the consonant of the preceding syllable (C2-F: F(1,4) = 5.878, p = 0.07 (m.s.); C2-R: F(1,4) = 10.19, p < 0.05).
For the purposes of this study, it is interesting to note that the most accentual lengthening is detected on the vowel of the stressed syllable in stress-medial words (V2 in S2), with the following C gesture (C3-F) undergoing a significant amount of lengthening as well. This accentual-related lengthening might account for the absence of boundary-related lengthening in these positions (see Section 3.2).
3.4 Duration of pause posture formation as a function of stress and accent
Figure 8 summarize the within-speaker (8a) and cross-speaker (8b) analyses on the duration of pause posture (PP) formation.
Figures 8.
The duration (in ms; with standard error) of pause posture formation as a function of Stress and Accent per speaker (a) and across speakers (b).
The within-speaker analysis by the means of ANOVAs for each speaker separately with Stress (levels: S1, S2, S3) and Accent (levels: accented, de-accented) as factors performed on the duration (in ms) of PP formation showed a main effect of Stress for all speakers (F01: F(2, 222) = 34442, F02: F(2, 199) = 49132, F03: F(2, 165) = 55994, F04: F(2, 169) =28545, M05: F(2, 182) = 58949; p < 0.05). According to the followed-up pairwise comparisons, PP formation is longer in stress-final (S3) than in stress-initial (S1: p < 0.05) and stress-medial words (S2: F01, F02, F03 and M05: p < 0.05; the effect is marginal for F04: p = 0.06). The ANOVAs also revealed a main effect of Accent for speaker F03 (F(1, 165) = 13.613, p < 0.05), with PP formation being longer in accented than in de-accented phrase-final words (p < 0.05). An interaction effect between the two factors was detected for speakers F01 (F(2, 222) = 20553, p < 0.05) and M05 (F(2, 182) = 8.289, p < 0.05). For these speakers, a set of pair-wise comparisons per Accent condition and a set of pair-wise comparisons per Stress condition were performed. It was shown that for speaker F01 the Stress did not have a significant effect on the duration of PP formation in de-accented phrase-final words, but it did in accented phrase-final words, with pause postures formation being longer in stress-final (S3) than stress-initial (S1) (p < 0.05) or stress-medial (S2) (p < 0.05) words. Speaker M05 on the other hand presents shorter PP formation in stress-medial words (S2) than stress-initial (S1) (p < 0.05) and stress-final (S3) words in the de-accented condition (p < 0.05), while he shows the same pattern as the other speakers in the accented condition, in which PP formation is longer in stress-final (S3) words that in either stress-initial (S1) (p < 0.05) or stress-medial (S2) (p < 0.05) words. Moreover, longer PP formation is found in the de-accented condition than in the accented one in stress-initial words (S1) for speaker M05 (p < 0.05) and in stress-medial words (S2) for speaker F01 (p < 0.05). However, in stress-final words (S3), speaker F01 shows longer PP movements in the accented than in the de-accented condition (p < 0.05).
The repeated measures ANOVAs with Stress (levels: S1, S2, S3) and Accent (levels: accented, de-accented) as the fixed factors and speaker as the repeated factor confirmed the effect of the position of stress on PP formation (F(2, 20) = 33.039, p < 0.05) such that PP formation is longer in words with final stress (S3) than in words with non-final stress (S1 and S2, p < 0.05 for both), regardless of the accentual status (accented or de-accented) of the IP1-final word.
3.4 Summary of results
In Greek, the lexical stress of the phrase-final word affects the temporal profile of that word and the following pause, regardless of the presence or not of a pitch accent. Boundary-related lengthening is initiated within the final V gesture and the overlapping release of the onset C in words with final stress, but earlier in words with non-final stress, coinciding either with the formation of the final C or with the V gesture of the penultimate syllable, depending on the speaker. Boundary-related shortening, which is observed further away from the boundary, albeit sporadic and small in amount, is initiated and/or terminated close to the stressed syllable, and the most affected gesture either belongs to or is close to the stressed syllable. Finally, pause posture formation is longer in the context of stress-final words than either stress-medial or stress-initial words. Post-boundary, the effects are not affected by either the stress or the accentual status of the final word pre-boundary. Lengthening consistently affects the initial C, especially its formation phase, and shortening the formation phase of the following C. Boundary-related lengthening extends over a continuous interval of speech, and is progressive, decreasing with distance from the boundary.
4.0 Discussion
Our data show that the most systematic effect of pre-boundary lengthening in Greek is found on the C and V gestures that immediately precede the boundary. This pattern holds for all possible positions of stress. In stress-final words, the effect is retained within these gestures, while in words with non-final stress the effect is initiated earlier, but which exactly gestures are affected differ across speakers. For instance, for some speakers lengthening extends to the formation of the final C, remaining within the limits of the phrase-final syllable, while for others it extends to the penultimate syllable. These findings point to a dual conclusion about boundary-related lengthening in Greek. First, the effect is mainly limited to phrasal edges, and second, it is attracted towards the stressed syllable.
The first part of this conclusion is reinforced by the very concept of boundary events, the function of which is to mark the edges of phrases (cf. Beckman & Pierrehumbert, 1986; White, 2014). It is also in accordance with previous experimental findings. Articulatory studies have shown that boundary lengthening is confined to boundary-adjacent constrictions (Byrd et al., 2006; Byrd & Riggs, 2008; Krivokapić, 2007a), and acoustic studies have proposed phrase-final syllable, and specifically its rhyme, as the domain of the boundary lengthening effect (Berkovits, 1993a, 1993b, 1994; Cambier-Langeveld, 1997; Campbell & Isard, 1991; Nakatani et al., 1981; Oller, 1973; Wightman et al., 1992).
The second part of this conclusion, meaning that the position of lexical stress is a decisive factor in determining when boundary-related lengthening is initiated, finds empirical parallels from American and British English, in which the stressed syllable of the phrase-final word systematically lengthens as a function of boundary (Turk & Shattuck-Hufnagel, 2007 and White, 2002 respectively). According to theses studies, in both American and British English, the domain of boundary-related lengthening includes the stressed syllable. The fact that in Greek boundary-related lengthening expands towards the stressed syllable without necessarily including it might be due to typological differences between Greek and English. Since the data is not sufficient to support this hypothesis, further research on this matter is needed. Evidence for less systematic tendencies of attraction of boundary-related lengthening towards the stress exists elsewhere in the literature and with respect to several languages, like English (Byrd & Riggs, 2008; Nakatani et al., 1981; Oller, 1973), Hebrew (Berkovits, 1994) and Estonian (Krull, 1997). The current findings further contribute to this body of research, by distinguishing lexical stress from pitch accent, and highlighting the importance of lexical stress in determining the initiation of boundary-related lengthening regardless of the presence of a pitch accent or not. Accentuation, although not significantly affecting the scope of boundary-related lengthening, plays a role in the amount of the effect, as shown in Section 3.3.
The effect of phrase-final lexical stress extends to the pause postures following the phrase-final word, with the articulatory movements into these postures being longer in words with final stress as opposed to words with non-final stress. This effect of stress does not extend to the phrase-initial word that follows the pause, with post-boundary lengthening consistently affecting only the initial C gesture. This pattern is not necessarily different from the finding reported in Shattuck-Hufnagel (1998), according to which post-boundary lengthening in American English manifests itself only when the final word of the previous phrase is prominent (Shattuck-Hufnagel & Turk, 1998). The data from American English did not contain pauses. It is thus still possible that the last lexical stress of a phrase in Greek affects the boundary-related lengthening of the following phrase in cases of weaker boundaries that do not involve pauses. With respect to post-boundary lengthening in Greek, it is constantly and systematically detected on the movement forming the first constriction of the phrase, at least in boundaries involving pauses, agreeing with previous articulatory findings of American English (e.g., Byrd et al., 2006; Byrd & Riggs, 2008; Byrd & Saltzman, 1998; Bombien et al., 2010; Krivokapić, 2007a).
In addition to the scope of pre-boundary lengthening and the duration of pause posture formation, lexical stress in Greek is related to boundary-related shortening. Albeit scarce and small, pre-boundary shortening is manifested by all speakers, indicating that lexical stress and boundaries interact in a complex manner. To our knowledge, boundary-related shortening has been reported in the literature only in relation to post-boundary material, and has been attributed to a compensatory effect, caused by the return of constriction gestures to prosodically unaffected timing (Byrd et al., 2006). The pre-boundary shortening observed in Greek cannot be related to restoring the constrictions’ temporal properties that have been perturbed by lengthening. In our view, the pre-boundary shortening effect is a byproduct of a mutual attraction between lexical stress and boundary-related lengthening. Under this scenario, the stress-related lengthening effect in phrase-final words is less than the respective effect phrase-medially, yielding gestures that are shorter in the former case than in the latter. Shortening is also observed post-boundary in our data, manifesting itself in a more systematic way than pre-boundary, regularly affecting the forming gesture of the penultimate syllable’s onset consonant. The constant position of this effect is accountable by the compensation hypothesis put forward by Byrd et al. (2006). However, on the basis of the present data (only stress-final words are examined here post-boundary), possible interaction between the domain of post-boundary shortening and the position of phrase-initial lexical stress cannot be excluded.
Our findings also indicate that boundary lengthening extends over a single continuous domain (cf. Byrd & Riggs, 2008; White, 2002; Wightman et al., 1992; but see Turk & Shattuck-Hufnagel, 2007 and its replication by Rusaw 2011, 2013). The manifestation of the effect is such that the amount of lengthening decreases with distance from the boundary (like in e.g., Berkovits, 1993a, b, 1994).
In order to put all these pieces together, we propose the following account of boundary lengthening in Greek within the framework of Articulatory Phonology and the π-gesture model. This account is schematically represented in Figure 9 (the reader is referred to Katsika et al., 2014 for the part of the account that addresses boundary tones).
Figures 9.
Schematic representation of phrase-final and phrase-initial π-gestures and their coordination in the context of stress-initial (a), stress-medial (b) and stress-final (c) phrase-final words respectively. Steadily solid lines represent in-phase coordination, broken lines represent anti-phase coordination, and lines of crosses represent currently uncertain types of coordination. Stress is represented by ‘ ´ ‘. Gray triangles and white diamonds correspond to the strength levels of π-gesture activation that triggers boundary tone gestures (BT) and pause postures (PP) respectively. The activation intervals of µ-gestures are symbolic, and should not be compared to the activation intervals of the π-gestures. This model is an extension of the proposal in Katsika et al. (2014).
We propose that in Greek the π-gesture has a dual coordination, being coordinated both with the phrase-final V gesture and with the last stress-related µ-gesture of the phrase. The coordination of the π-gesture with the phrase-final V is assumed to be anti-phase, which means that the π-gesture is initiated as the V gesture reaches its articulatory target, explaining the fact that boundary-related lengthening affects the release phase, but not the formation phase, of the last C, and presumaby the part of the vowel that overlaps with that C release. The coordination of the π-gesture with the µ-gesture is uncertain on the basis of the current data, and further research is needed to determine this relationship. However, in-phase coordination between these two prosodic gestures is conceptually a more appropriate account of the attraction of the π-gesture towards the µ-gesture of the stressed syllable. When stress is final, the phrase-final V and the µ- gestures coincide, and as a result the π-gesture, and consequently its lengthening effect as well, is initiated within the phrase-final V. When stress is not final, the µ-gesture and the phrase-final V gesture occur in different syllables, and the π-gesture is pulled towards the stressed syllable via its coordination with the µ-gesture. The fact that this movement towards the stressed syllable is partial and not complete (the π-gesture moves towards the stressed syllable without necessarily reaching it) implies that the coordination with the µ-gesture and the coordination with the phrase-final V gesture are not equal in strength. Rather, the coordination with the µ-gesture is weaker than the coordination with the final V gesture. This weak coordination causes a slight shift of the π-gesture towards the µ-gesture, initiating boundary lengthening earlier within words with non-final stress as opposed to words with final stress (compare (a) and (b) to (c) in Figure 9). As argued in the previous paragraph, the fact that a shortening effect is observed near or on the stressed syllable suggests that the µ-gesture is slightly shifted towards the π-gesture as well. As the π-gesture shifts away from the boundary in words with non-final stress towards the µ-gesture, less of the pause posture is co-active with it, and as a result the movement into achieving the pause posture is shorter in phrase-final words with non-final stress than their final-stress counterparts. Thus, the account proposed here, although significantly different from the alternatives proposed in Byrd and Riggs (2008) and illustrated in Figure 3, is similar to the ‘coordination shift’ alternative in that the π-gesture shifts as a whole in reaction to stress position, and rejects the ‘extension’ alternative, in which the π-gesture’s onset extends to the stress while its offset remains stable.
The fact that the position of phrase-final stress exerts an effect on pause postures, but not on the scope of boundary-related lengthening over the following phrase-initial word suggests that phrase-initial lengthening is caused by another π-gesture, which is coordinated with and extends over the first constriction of the phrase. Whether the phrase-initial π-gesture is also coordinated with another gesture, like for example the µ-gesture that is related to the lexical stress of the phrase-initial word, cannot be addressed by the current experimental design, and remains an open question for future research. The presence of a second, phrase-initial, π-gesture might not be necessary in all boundaries (cf. Byrd, Lee & Campos-Astorkiza, 2008 for gestural overlap spanning boundaries; see also Byrd & Saltzman, 2003), but seems needed in the following two cases: 1) when no other phrase precedes, as for instance at the very beginning of a discourse, and 2) when a grammatical pause intervenes between the current and the preceding phrase (these pauses are not to be confused with ungrammatical pauses that are related to speech disfluencies). In the case of grammatical pauses, it can be assumed that pause postures are the result of the maximum clock-slowing effect of the π-gesture, which, when reached, slows the articulators so much that they actually stop. The assumption that the articulators stop is based on the very long durations of pauses. However, the position at which the articulators stop does not seem to be random or context-dependent, but targeted (cf. Ramanarayanan et al., 2013), possibly reflecting a default articulatory setting (cf. Gick et al., 2004). That the articulatory configuration related to pauses observed in our data is targeted, and thus planned, is also supported by our recent finding that pause postures are achieved at a fixed time from the onset of boundary tones (Katsika et al., 2014). This relationship between boundary tones and pause postures in combination with the widely accepted observation that boundary tones occur at strong boundaries while grammatical pauses occur in a subset of these strong boundaries was taken as an indication that prosodic events are a function of π-gesture’s level of activation (Katsika et al., 2014). Boundary tones are activated when π-gestures reach a specific high level of activation, and pause postures are activated when π-gestures reach another specific, even higher, level of activation (see Figure 9). This proposal is congruous with the notion that lengthening increases cumulatively (e.g., pre-boundary: Byrd, 2000, Byrd & Saltzman, 1998, Cho, 2006, Tabain, 2003b, Tabain & Perrier, 2005), but its scope remains stable across the levels of prosodic hierarchy (Cambier-Langeveld, 1997). It further extends this notion to capture boundary tones and pauses in a unifying manner, since both these types of prosodic events are dependent on whether the π-gesture reaches the appropriate levels, which are in turn related to specific amounts of lengthening. If the assumption that, when achieving these pause postures, the articulators stop is valid, a second, phrase-initial π-gesture would be needed to reinitiate articulatory movement. Everything else being equal, the phrase-final and phrase-initial π-gestures might overlap depending on the length of their temporal intervals. The proposal for two π-gestures being active at pause-containing boundaries fits well with Ferreira’s suggestion that pauses consist of two parts, one related to the phrase preceding the pause and the other related to the phrase following the pause (e.g., Ferreira, 1991).
This proposal, in addition to capturing the facts of Greek, puts forward a novel account that views prosodic boundaries in an integrated fashion, since the boundary prosodic events (i.e., boundary-related lengthening, boundary tones and pauses) emerge from a single system, and allows typological extensions (outlined below). This view has two main tenets. First, boundary prosodic events (i.e., boundary-related lengthening, boundary tones and pauses) are interdependent, and second, lexical prosody does not only interact with phrasal prosody, but functions as the interface between the latter and constriction gestures. The interdependency between boundary events, by addressing their very nature, is expected to hold cross-linguistically. Given the current little knowledge on the prosodic hierarchy of most languages and on the articulatory profile of pauses, this hypothesis still awaits validation. Future research will need to examine whether phrase-final lengthening is a universal phenomenon, whether boundary tones presuppose the presence of lengthening in order to mark typical boundaries, whether, in turn, grammatical, planned, pauses presuppose the presence of boundary tones, and what the timing relationship among lengthening, boundary tones and pauses is. Future research will also need to investigate and integrate into the account prosodic boundaries that have less common communicative functions, such as careful deliberation and attentional focus, in which boundary-related lengthening has been found to occur in the absence of boundary tones and vice versa, and which have been described for that reason as containing mismatches between the two types of events (Beckman & Elam, 1997).
As for the notion of the interface, the hypothesis that lexical prosody (e.g., stress) functions as the interface between phrasal prosody and constriction gestures is compelling and could offer an account for both boundaries and prominence. It is already well accepted that in prominence accentuation-related lengthening and the accompanying pitch accents are associated with stressed syllables. We could similarly assume that at boundaries, the stress-related µ-gesture, whose coordination with the constriction gestures comprising the lexical item is by definition fixed in the lexicon, sfunction as the ‘go’ signal for the phrase-final π-gesture, which by being phrasal does not have a fixed coordination within lexical items. Assuming that lexical prosody interfaces between the boundary events and articulation at least phrase-finally makes conceptual sense, since both the stress-related µ-gesture and the π-gesture are clock-slowing gestures that phrase-finally need to occur one after the other. Without the ‘go’ signal from lexical prosody, the cognitive system would have needed ‘to look ahead’ and detect the last lexical event in advance in order to coordinate the π-gesture with it. In parallel, the assumption that phrase-initial π-gestures are coordinated only with the first constriction gesture of the phrase is also conceptually reasonable. It seems plausible for our cognitive system to be able ‘to know’ to initiate a non-lexical event, such as a phrase-initial π-gesture, coupled with a lexical event that comes first in a list of other lexical events. This account, however speculative in nature, sets a basis for formalizing hypotheses and for investigating and revealing typological dimensions. For example, in different languages the π-gesture might have a stronger coordination with the µ-gesture than with the final V gesture or the coordination with the final V gesture might be completely lacking. This could explain patterns like the ones observed in American (e.g., Turk & Shattuck-Hufnagel, 2007) or British English (White, 2002), in which boundary-related lengthening consistently affects the stressed syllables regardless of its distance from the boundary. Alternatively, other languages might have the π-gesture coordinated just with a constriction gesture, such as the first constriction gesture of the phrase-final word, which could capture the pre-boundary lengthening patterns reported for German (Kohler, 1983; Silverman, 1990). It needs to be noted though that the patterns in German might still be related to lexical stress, since in German lexical stress is mainly positioned on the first syllable of each morpheme. The possibility of the π-gesture being coordinated solely with the phrase-final V is not excluded either, but due to the look-ahead problem mentioned above, it is not the most preferred option. It is thus expected to be encountered the least across the world’s languages and to present the shortest scope of the effect. If the hypothesis that lexical prosody serves as the interface, or is the most preferred interface, between phrasal prosody and the constriction gestures of lexical items is valid, this should hold across languages with different lexical prosodies. A representative list would include, in addition to languages with lexical stress, languages that employ lexical tones (e.g., Chinese), lexical pitch accent (e.g., Japanese), both lexical tone and lexical stress (e.g., Ma’ta), both lexical stress and lexical pitch accent (e.g., Serbo-Croatian), and accentual phrases (e.g., French) (for the lexical prosody of these languages see Fletcher, 2010 and references within).
The current study and its complementary part (Katsika et al., 2014) revealed significant effects of lexical stress on boundary-related phrasal events and proposed an account of prosodic boundaries in Greek with promising cross-linguistic extensions, highlighting the importance of kinematic data and speaker-specific investigation in detecting fine-tuned effects.
Table 6.
The mean duration (in ms) and standard deviation (within parentheses) of the constriction gestures of accented (A; only CC) and de-accented (D; pooled ND, WhQ and IR) IP1-final words per Stress. Dark gray cells correspond to longer gestures in A than in D, and light gray cell to shorter gestures in A than in D. White cells contain no significant differences between A and D.
Highlights.
Lexical stress affects the initiation of phrase-final lengthening.
Pitch accent does not affect the scope of phrase-final lengthening.
Lexical stress affects the duration of pause posture formation.
Lexical stress affects the distribution of boundary-related shortening.
The domain of boundary-related lengthening is continuous.
Acknowledgments
This work was supported by NIH grant NIDCD DC 008780 to Louis Goldstein, and NIH grant NIDCD DC 002717 to Douglas Whalen. The author is grateful to Jelena Krivokapić, Louis Goldstein, Christine Mooshammer and Mark Tiede for their support and assistance. Special thanks go to Amalia Arvaniti, Hosung Nam, Elliot Saltzman, Stefanie Shattuck-Hufnagel, and Douglas Whalen for their useful feedback.
Appendix A.
Tables A1a-A8b.
The planned t-test comparisons of the durations (in ms; with SD within parentheses underneath the respective durations) of the constriction gestures of the accented IP1-final words (a: C1-V3) and their subsequent IP2-initial words (b: C4-C6) across Boundaries (IP vs. W) per Stress (S1, S2, S3) for each speaker (F01, F02, F03, F04 and M05) and construction (ADYNQCCPC). Dark gray and light gray cells represent significantly longer (>) and shorter (<) IP than W respectively (p ≤ 0.05)). White cells represent insignificant comparisons. Key: C = consonant, V = vowel, F = formation, R = release.
Appendix B.
Figures B1-B5. Schematic illustration of the difference in duration (in ms) between the constriction gestures of the IP1-final (pre-boundary) and IP2-initial (post-boundary) words and their W counterparts for each Stress (S1, S2, S3) per speaker for each syntactic construction. In each row, boxes stand for gestures presented in the orthographic order of the test words (i.e., pre-boundary order: C1-FC1-RV1, C2-FC2-RV2, C2-FC2-RV2, C3-FC3-RV3; post-boundary order: C4-FC4-RC5-FC5-RC6-F, and C6-R). Black boxes represent longer IP than W gestures (lengthening), white boxes in broken black borders shorter IP than W gestures (shortening), and gray boxes stand for no statistically significant difference between IP and W gestures.
B1) De-accented constructions (D)
B2) Affirmative declaratives (AD)
B3) Yes-no questions (YNQ)
B4) Causative clauses (CC)
B5) Parenthetical clauses (PC)
Footnotes
The neologisms used nasals as consonants to ensure F0 continuity. The labial nasal /m/ was preferred to the coronal /n/, so that the articulators responsible for the consonants and the vowels used different speech organs (lips vs. tongue). In that way, the articulators were traceable while co-articulation effects were minimized. The alternation between the high-front vowel /i/ and the mid/low-back vowel /a/ ensured that the vocalic constriction targets were as far apart as possible, and thus the articulatory movement from one vowel to the other could be clearly tracked.
A subset of the constructions involving an IP boundary was used to examine the coordination of boundary tones. These were the yes-no questions, the causative clauses, the negative declaratives, the wh-questions and the imperative requests. In these constructions, the combination of phrase accent and boundary tone involved alternating tones detectable at F0 inflection points. The results of this analysis are reported and discussed in Katsika et al. (2014).
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
Reference list
- Arvaniti A. Greek phonetics: The state of the Art. Journal of Greek Linguistics. 2007;8:97–208. [Google Scholar]
- Arvaniti A, Baltazani M. Intonational analysis and prosodic annotation of Greek spoken corpora. In: Jun S-A, editor. Prosodic Typology: The Phonology of Intonation and Phrasing. Oxford, U.K: Oxford University Press; 2005. pp. 84–117. [Google Scholar]
- Arvaniti A, Ladd DR, Mennen I. Tonal association and tonal alignment: evidence from Greek polar questions and contrastive statements. Language and Speech. 2006b;49:421–450. doi: 10.1177/00238309060490040101. [DOI] [PubMed] [Google Scholar]
- Beckman ME, Edwards J. Speech Perception, Production and Linguistics Structure. Tokyo, Japan: Ohmsha; 1992. Intonational categories and the articulatory control of duration; pp. 359–375. [Google Scholar]
- Beckman ME, Elam GA. Guidelines for ToBI labelling. Manuscript and accompanying speech materials. 1997 available from ling.ohio-state.edu/tobi.
- Beckman ME, Pierrehumbert JB. Intonational structure in Japanese and English. Phonology Yearbook. 1986;3:255–309. [Google Scholar]
- Berkovits R. Progressive utterance-final lengthening in syllables with final fricatives. Language and Speech. 1993a;36:89–98. doi: 10.1177/002383099303600105. [DOI] [PubMed] [Google Scholar]
- Berkovits R. Utterance-final lengthening and the duration of final-stop closures. Journal of Phonetics. 1993b;21:476–489. [Google Scholar]
- Berkovits R. Durational effects in final lengthening, gapping, and contrastive stress. Language and Speech. 1994;37:237–250. doi: 10.1177/002383099403700302. [DOI] [PubMed] [Google Scholar]
- Bombien L, Mooshammer C, Hoole P, Kuehnert B. Prosodic and segmental effects on EPG contact patterns of word-initial German clusters. Journal of Phonetics. 2010;38:388–403. [Google Scholar]
- Browman CP, Goldstein LM. Articulatory Phonology: An overview. Phonetica. 1992;45:155–180. doi: 10.1159/000261913. [DOI] [PubMed] [Google Scholar]
- Byrd D. Articulatory vowel lengthening and coordination at phrasal junctures. Phonetica. 2000;57:3–16. doi: 10.1159/000028456. [DOI] [PubMed] [Google Scholar]
- Byrd D, Choi S. Papers in Laboratory Phonology X. Berlin, Germany: Mouton de Gruyter; 2006. At the Juncture of prosody, phonology, and phonetics - The interaction of phrasal and syllable structure in shaping the timing of consonant gestures; pp. 31–60. [Google Scholar]
- Byrd D, Kaun A, Narayanan S, Saltzman E. Phrasal signatures in articulation. In: Kingston J, Beckman ME, editors. Papers in Laboratory Phonology V: Acquisition and the Lexicon. Cambridge, U.K: Cambridge University Press; 2000. pp. 70–87. [Google Scholar]
- Byrd D, Krivokapić J, Lee S. How far, how long: On the temporal scope of phrase boundary effects. Journal of the Acoustical Society of America. 2006;123:4456–4465. doi: 10.1121/1.2912444. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Byrd D, Lee S, Campos-Astorkiza R. Phrase-boundary effects on the temporal kinematics of sequential tongue tip consonants. Journal of the Acoustical Society of America. 2008;120:1589–1599. doi: 10.1121/1.2912444. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Byrd D, Riggs D. Locality interactions with prominence in determining the scope of phrasal lengthening. Journal of the International Phonetic Association. 2008;38:187–202. doi: 10.1017/S0025100308003460. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Byrd D, Saltzman E. Intragestural dynamics of multiple phrasal boundaries. Journal of Phonetics. 1998;26:173–199. [Google Scholar]
- Byrd D, Saltzman E. The elastic phrase: Modeling the dynamics of boundary-adjacent lengthening. Journal of Phonetics. 2003;31:149–180. [Google Scholar]
- Cambier-Langeveld T. The domain of final lengthening in the production of Dutch. In: Coerts J, de Hoop H, editors. Linguistics in the Netherlands. Amsterdam, Netherlands: John Benjamins; 1997. pp. 13–24. [Google Scholar]
- Campbell WN, Isard SD. Segment durations in a syllable frame. Journal of Phonetics. 1991;19:37–47. [Google Scholar]
- Cho T. Prosodically conditioned strengthening and vowel-to-vowel coarticulation in English. Journal of Phonetics. 2004;32:141–176. [Google Scholar]
- Cho T. Papers in Laboratory Phonology VIII: Varieties of Phonological Competence (Phonology and Phonetics) Berlin, Germany: Mouton de Gruyter; 2006. Manifestation of prosodic structure in articulatory variation: Evidence from lip kinematics in English; pp. 519–548. [Google Scholar]
- Cho T. Prosodic strengthening in transboundary V-to-V lingual movement in American English. Phonetica. 2008;65:45–61. doi: 10.1159/000130015. [DOI] [PubMed] [Google Scholar]
- Cho T, Keating P. Articulatory and acoustic studies on domain-initial strengthening in Korean. Journal of Phonetics. 2001;29:155–190. [Google Scholar]
- Cho T, Keating P. Effects of initial position versus prominence in English. Journal of Phonetics. 2009;37:466–485. [Google Scholar]
- Cho T, McQueen J, Cox E. Prosodically driven phonetic detail in speech processing: The case of domain-initial strengthening in English. Journal of Phonetics. 2007;35:210–243. [Google Scholar]
- Cooper WE, Paccia-Cooper J. Syntax and Speech. Cambridge, MA: Harvard University Press; 1980. [Google Scholar]
- Dimitrova S, Turk A. Patterns of accentual lengthening in English four-syllable words. Journal of Phonetics. 2012;40:403–418. [Google Scholar]
- Edwards J, Beckman ME, Fletcher J. The articulatory kinematics of final lengthening. Journal of the Acoustical Society of America. 1991;89:369–382. doi: 10.1121/1.400674. [DOI] [PubMed] [Google Scholar]
- Ferreira F. Effects of length and syntactic complexity on initiation times for prepared utterances. Journal of Memory and Language. 1991;30:210–233. [Google Scholar]
- Fletcher J. The Prosody of Speech: Timing and Rhythm. In: Hardcastle WJ, Laver J, Gibbon FE, editors. The Handbook of Phonetic Sciences. Hoboken, NJ: Wiley-Blackwell; 2010. pp. 523–602. [Google Scholar]
- Fougeron C. Articulatory properties of initial segments in several prosodic constituents in French. Journal of Phonetics. 2001;29:109–135. [Google Scholar]
- Fougeron C, Jun S. Rate effects on French intonation: prosodic organization and phonetic realization. Journal of Phonetics. 1998;26:45–69. [Google Scholar]
- Fougeron C, Keating P. Articulatory strengthening at edges of prosodic domains. Journal of the Acoustical Society of America. 1997;101:3728–3740. doi: 10.1121/1.418332. [DOI] [PubMed] [Google Scholar]
- Fuchs S, Petrone C, Krivokapić J, Hoole P. Acoustic and respiratory evidence for utterance planning in German. Journal of Phonetics. 2013;41:29–47. [Google Scholar]
- Gao M. PhD thesis. Yale University; 2008. Mandarin Tones: an Articulatory Phonology Account. [Google Scholar]
- Gick B, Wilson I, Kock K, Cook C. Language-specific articulatory settings: Evidence from inter-utterance rest position. Phonetica. 2004; 61:220–233. doi: 10.1159/000084159. [DOI] [PubMed] [Google Scholar]
- Goldstein LM, Byrd D, Saltzman E. The role of vocal tract gestural action units in understanding the evolution of phonology. In: Arbib M, editor. From action to language: The mirror neuron system. Cambridge, U.K: Cambridge University Press; 2006. pp. 215–249. [Google Scholar]
- Hayes B. The prosodic hierarchy in meter. In: Kiparsky P, Youmans G, editors. Phonetics and Phonology, Vol. 1: rhythm and meter. New York, N.Y: Academic Press, Inc; 1989. pp. 201–259. [Google Scholar]
- Hirose H. Investigating the Physiology of Laryngeal Structures. In: Hardcastle WJ, Laver J, Gibbon FE, editors. The Handbook of Phonetic Sciences. Hoboken, NJ: Wiley-Blackwell; 2010. pp. 130–152. [Google Scholar]
- Hoole P, Zierdt A, Geng C. Beyond 2D in articulatory data acquisition and analysis; Proceedings of The 15th International Congress of Phonetic Sciences; 2003. pp. 265–268. [Google Scholar]
- Kainada Evia Prosodic Boundary Effects on Durations and Vowel Hiatus in Modern Greek; Proceedings of the 16th International Congress of Phonetic Sciences; Saarbrücken: Uneversität des Saarlandes; 2007. pp. 1225–1228. [Google Scholar]
- Katsika A. Boundary- and prominence-related lengthening and their interaction. Journal of the Acoustical Society of America. 2009;125:2572–2572. [Google Scholar]
- Katsika A, Krivokapić J, Mooshammer C, Tiede M, Goldstein L. The coordination of boundary tones and their interaction with prominence. Journal of Phonetics. 2014;44:62–82. doi: 10.1016/j.wocn.2014.03.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Keating P, Cho T, Fourgeron C, Hsu C. Papers in Laboratory Phonology VI: Phonetic Interpretation. Cambridge, U.K: Cambridge University Press; 2004. Domain-initial articulatory strengthening in four languages; pp. 143–161. [Google Scholar]
- Keating P, Wright R, Zhang J. World-level asymmetries in consonant articulation. UCLA Working Papers in Phonetics. 1999;97:157–173. [Google Scholar]
- Klatt DH. Vowel lengthening is syntactically determined in a connected discourse. Journal of Phonetics. 1975;3:129–140. [Google Scholar]
- Kohler K. Prosodic boundary signals in German. Phonetica. 1983;40:89–134. [Google Scholar]
- Krivokapić J. PhD thesis. University of Southern California; 2007a. The planning, production and perception of prosodic structure. [Google Scholar]
- Krivokapić J. Prosodic planning: effects of phrasal length and complexity on pause duration. Journal of Phonetics. 2007b;35:162–179. doi: 10.1016/j.wocn.2006.04.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Krivokapić J. Gestural Coordiantion at prosodic boundaries and its role for prosodic structure and speech planning processes. Philosophical Translations of the Royal Society B. 2014;369:20130397. doi: 10.1098/rstb.2013.0397. http://dx.doi.org/10.1098/rstb.2013.0397. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Krivokapić J, Byrd D. Prosodic boundary strength: an articulatory and perceptual study. Journal of Phonetics. 2012;40:430–442. doi: 10.1016/j.wocn.2012.02.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Krull D. Prepausal lengthening in Estonian: Evidence from conversational speech. In: Lehiste I, Ross J, editors. Estonian Prosody: Papers from a Symposium. Institute of Estonian Language and Authors; 1997. [Google Scholar]
- Lehiste I. Rhythmic units and syntactic units in production and perception. Journal of the Acoustical Society of America. 1973;54:1228–1234. doi: 10.1121/1.1914379. [DOI] [PubMed] [Google Scholar]
- McGowan RS, Saltzman EL. Incorporating aerodynamic and laryngeal components into task dynamics. Journal of Phonetics. 1995;23:255–269. [Google Scholar]
- Mücke D, Nam H, Hermes A, Goldstein LM. Coupling of tone and constriction gestures in pitch accents. In: Hoole P, Bombien L, Pouplier M, Mooshammer C, Kühnert B, editors. Consonant Clusters and Structural Complexity. Berlin: Mouton de Gruyter; 2012. pp. 205–230. [Google Scholar]
- Nakatani LH, O’Connor KD, Aston CH. Prosodic aspects of American English speech rhythm. Phonetica. 1981;38:84–106. [Google Scholar]
- Nespor M, Vogel I. Prosodic phonology. Dordrecht: Foris; 1986. [Google Scholar]
- Oller KD. The effect of position in utterance on speech segment duration in English. Journal of the Acoustic Society of America. 1973;54:1235–1247. doi: 10.1121/1.1914393. [DOI] [PubMed] [Google Scholar]
- Pierrehumbert JB. PhD thesis. M.I.T; 1980. The phonology and phonetics of English intonation. [Google Scholar]
- Riggs D, Byrd D. The scope of phrasal lengthening: Articulatory and acoustic evidence. Journal of the Acoustical Society of America. 2011;130:2549–2549. [Google Scholar]
- Ramanarayanan V, Byrd D, Goldstein L, Narayanan S. Proceedings of Interspeech 2010. Makuhari, Japan: 2010. Sep, Investigating articulatory setting – pauses, ready position and rest – using real-time MRI. [Google Scholar]
- Ramanarayanan V, Bresch E, Byrd D, Goldstein L, Narayanan S. Analysis of pausing in spontaneous speech using real-time magnetic resonance imaging of articulation. Journal of the Acoustical Society of America, Express Letters. 2009;126:EL160–EL165. doi: 10.1121/1.3213452. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ramanarayanan V, Goldstein L, Byrd D, Narayanan S. A reat-time MRI investigation of articulatory setting across different speaking styles. Journal of the Acoustical Society of America. 2013;134:510–519. doi: 10.1121/1.4807639. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rusaw EC. A biologically inspired neural network for modeling phrase-final lengthening. Journal of the Acoustical Society of America. 2011;130:2553–2553. [Google Scholar]
- Rusaw EC. PhD thesis. University of Illinois at Urbana-Champaign; 2013. Modeling temporal coordination in speech production using an artificial central pattern generator neural network. [Google Scholar]
- Saltzman E. Task dynamic co-ordination of the speech articulators: a preliminary model. In: Heuer H, Fromm C, editors. Generation and Modulation of Action Patterns. Berlin, Germany: Springer-Verlag; 1986. pp. 129–144. [Google Scholar]
- Saltzman E, Munhall KG. A dynamical approach to gestural patterning in speech production. Ecological Psychology. 1989;1:333–382. [Google Scholar]
- Saltzman E, Nam H, Krivokapić J, Goldstein L. A task-dynamic toolkit for modeling the effects of prosodic structure on articulation; Proceedings of the Speech Prosody 2008 Conference pages; 2008. pp. 175–184. [Google Scholar]
- Selkirk E. Phonology and Syntax: The Relation Between Sound and Structure. Cambridge, MA: M.I.T. Press; 1984. [Google Scholar]
- Selkirk E. The prosodic structure of function words. In: Morgan JL, Demuth K, editors. Signal to Syntax: Bootstrapping from Speech to Grammar in Early Acquisition. Mahwah, NJ: Lawrence Erlbaum Associates; 1996. pp. 187–214. [Google Scholar]
- Shattuck-Hufnagel S, Turk A. The domain of phrase-final lengthening in English. Journal of the Acoustical Society of America. 1998;103:2889–2889. [Google Scholar]
- Silverman K. The separation of prosodies: comments on Kohler’s paper. In: Kingston J, Beckman ME, editors. Papers in Laboratory Phonology I: Between the Grammar and Physics of Speech. Cambridge, U.K.: Cambridge University Press; 1990. pp. 139–151. [Google Scholar]
- Silverman K, Beckman ME, Pitrelli J, Ostendorf M, Wightman C, Price P, Pierrehumbert J, Hirschberg J. ToBI: a standard labeling English prosody; Proceedings of the International Conference of Spoken Language Processing; 1992. pp. 867–870. [Google Scholar]
- Tabain M. Effects of prosodic boundary on /aC/ sequences: acoustic results. Journal of the Acoustical Society of America. 2003a;113:516–531. doi: 10.1121/1.1523390. [DOI] [PubMed] [Google Scholar]
- Tabain M. Effects of prosodic boundary on /aC/ sequences: articulatory results. Journal of the Acoustical Society of America. 2003b;113:2834–2849. doi: 10.1121/1.1564013. [DOI] [PubMed] [Google Scholar]
- Tabain M, Perrier P. Articulation and acoustics of /i/ at prosodic boundaries in French. Journal of Phonetics. 2005;33:77–100. [Google Scholar]
- Turk AE. Structural influences on boundary-related lengthening in English; Proceedings of the 14th International Congress of Phonetic Sciences; 1999. pp. 237–240. [Google Scholar]
- Turk AE, Shattuck-Hufnagel S. Multiple targets of phrase-final lengthening in American English words. Journal of Phonetics. 2007;35:445–472. [Google Scholar]
- Umeda N. Vowel duration in American English. Journal of the Acoustical Society of America. 1975;58:434–445. doi: 10.1121/1.380688. [DOI] [PubMed] [Google Scholar]
- White L. PhD thesis. The University of Edinburgh; 2002. English Speech Timing: A domain and locus approach. [Google Scholar]
- Wightman CW, Shattuck-Hufnagel S, Ostendorf M, Price PJ. Segmental durations in the vicinity of prosodic phrase boundaries. Journal of the Acoustical Society of America. 1992;91:1707–1717. doi: 10.1121/1.402450. [DOI] [PubMed] [Google Scholar]
- Wheeldon L, Lahiri A. Prosodic units in speech production. Journal of Memory and Language. 1997;37:356–381. [Google Scholar]
- White L. Communicative function of prosodic form in speech timing. Speech communication. 2014;63:38–54. [Google Scholar]