Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2014 Aug 17.
Published in final edited form as: Lang Var Change. 2011 May 16;23(1):45–86. doi: 10.1017/S0954394510000219

Cross-generational vowel change in American English

Ewa Jacewicz 1, Robert Allen Fox 2, Joseph Salmons 3
PMCID: PMC4134908  NIHMSID: NIHMS526344  PMID: 25140113

Abstract

This study examines cross-generational changes in the vowel systems in central Ohio, southeastern Wisconsin and western North Carolina. Speech samples from 239 speakers, males and females, were divided into three age groups: grandparents (66–91 years old), parents (35–51) and children (8–12). Acoustic analysis of vowel dynamics (i.e., formant movement) was undertaken to explore variation in the amount of spectral change for each vowel. A robust set of cross-generational changes in /ɪ, ε, æ, ɑ/ was found within each dialect-specific vowel system, involving both their positions and dynamics. With each successive generation, /ɪ, ε, æ/ become increasingly monophthongized and /ɑ/ is diphthongized in children. These changes correspond to a general anticlockwise parallel rotation of vowels (with some exceptions in /ɪ/ and /ε/). Given the widespread occurrence of these parallel chain-like changes, we term this development the “North American Shift” which conforms to the general principles of chain shifting formulated by Labov (1994) and others.

INTRODUCTION

The goal of this paper is to contribute to a better understanding of vowel change in American English by observing how vowel variants are transmitted over three generations of speakers. Vowel change has become a central topic in the emerging field of sociophonetics worldwide and has provided a rich basis for the examination of both internal (i.e., system-driven) factors and a variety of external (i.e., contact-driven) factors in different varieties of English in the world (Boberg, 2005; Cox, 1999; Labov, 1994; Labov, Ash & Boberg, 2006; Labov, Yaeger & Steiner, 1972; Torgersen & Kerswill, 2004; Trudgill, 2004; Watson, Maclagan & Harrington, 2000; Watt, 2000, to name a few). Depending on the research focus, studies were conducted to observe changes in a selected subset of vowels or in the entire vowel system. In the present study, we use a large sample size to examine how regional vowel systems have changed across three age groups of speakers which we call here grandparents, parents and children generations. The inclusion of children (8–12 years old) allows us to analyze the latest developments in the respective vowel systems apart from developmental factors present in children’s productions of younger age.

Our approach to the study of vowel change is to explore vowel dynamics, a type of vowel internal variation also known as vowel-inherent spectral change or VISC (Nearey & Assmann, 1986). That is, while examining more global positional vowel changes in the acoustic space, we also take into account the extent to which each vowel is “diphthongal” or “monophthongal.” This is to establish whether the dialect-specific degree of diphthongization is also transmitted across generations. In particular, we ask the question of what constitutes a vowel change: Is it only a positional change found in vowel dispersion patterns in the acoustic vowel space or does vowel change also involve a change in its dynamic characteristics?

In much traditional historical phonology, sound changes including vowel shifts were formulated on the basis of descriptive phonological analyses involving distinctive vowel features high/low and front/back, which allowed for capturing directional diachronic movement of vowels such as rising, lowering, fronting or backing. Subsequent phonetic studies measured the relative positions of vowel nuclei and defined vowel rotations as positional changes in the two-dimensional F1 by F2 plane. From the phonetic perspective, the principles of vowel change (e.g., Labov, 1994) assumed that vowels are more or less “static targets” and their positional movement in the vowel space is the only change transmitted across generations. The present study draws attention to the fact that dynamic structure is inherent in the production of English vowels which must also be acquired by each successive generation. Thus, our current inquiry is to determine whether the dynamic structure remains intact in the process of sound change or whether positional vowel change also involves changes in vowel dynamics.

In this paper, we present cross-generational data which bear on both the shift in the relative positions of vowel nuclei as well as the systematic variation in the dynamic structure. While the “static” positional changes inform us about the current direction of sound change, we also examine how formant patterns change over the course of each vowel’s duration and seek to incorporate this variation into a more systematic study of vowel change. While formant dynamics have been studied previously in the field of phonetics on selected subsets of American, Canadian and Australian English vowels (e.g., Andruski & Nearey, 1992; Fox, 1983; Fox & Jacewicz, 2009; Hillenbrand, Getty, Clark & Wheeler, 1995; Watson & Harrington, 1999), the relevance of this type of variation to cross-generational vowel transmission has not been generally addressed with the notable exception of a study of /u/-fronting in standard southern British by Harrington, Kleber and Reubold (2008).

To study cross-generational vowel change, we selected three dialect areas in the United States: central Ohio (Midland), southeastern Wisconsin (Inland North) and western North Carolina (Inland South). The terms Inland North and Inland South refer to the labels given to the dialects spoken in these areas in the Atlas of North American English (widely known as ANAE, Labov et al., 2006). Each area is closely associated with distinct patterns of vowel variation and change which are discussed in depth in Labov et al.’s 2006ANAE and are evidenced in other phonetic accounts such as Thomas (2001) and Clopper, Pisoni and de Jong (2005). While we expect that at least some of these reported dialect-specific vowel changes will be reflected in our data, we are particularly interested in determining if the inherent dynamic structure also undergoes systematic variations as a function of speaker dialect and generation. In our approach, we use the reported patterns of changes as a general guide and examine more closely how the positional changes found in our sample correspond to a specific pattern of variation in formant dynamics. Below we will review the widely accepted views on the vowel changes which each of these three dialects are reportedly undergoing.

Dialectal vowel variation and change in central Ohio, southeastern Wisconsin and western North Carolina

Central Ohio, our first selected region, is typically seen as part of the Midland dialect area, although there is controversy about whether a “Midland” dialect actually exists (see Preston, 2003). It has traditionally been regarded as not being involved in any systematic chain shift. With regard to vowel patterns generally, Thomas (2001:82) characterizes all of central and southern Ohio as consisting of “several weakly differentiated dialectal areas.” Regardless of the area’s status in dialectology, it is characterized by Low Back Merger (i.e., loss of contrast between /ɑ, ɔ/) or near-merger of those vowels, cf. Labov et al. (2006:263–271) and Thomas (2001:94), among others.

With regard to /æ/, Labov et al. treat central Ohio, and Columbus in particular, as having a “nasal system,” where /æ/ is raised before nasals, but “otherwise the nucleus remains in low front position” (2006:175, elsewhere). Some other evidence suggests backing of /æ/ (Thomas, 2001:94, Thomas, 1989), and real-time historical evidence shows that the vowel has historically not been raised in the region (Thomas (2006), though Thomas (1989) found raising in some speakers). However, more recent work in the Columbus area (Durian, 2008; 2009; Durian et al., 2010) has suggested that the region is, in fact, undergoing a chain shift involving /æ/, namely a form of the “Canadian Shift” (a systematic retraction of /æ, ε, ɪ/) in addition to Low Back Merger.

Southeastern Wisconsin, the second region of our interest, has been widely regarded as the western edge of the Northern Cities Shift (NCS) area. An overview of the whole shift is given in Figure 1. In the NCS, raising of /æ/ is often supposed to be the earliest stage, while speakers also show fronting of /a/, lowering and fronting of /ɔ/, backing of /ʌ/ and /ε/, with the weakest and likely last stage being lowering of /ɪ/ (though Labov (1994) and others have treated /ʌ/ backing as a possible last stage). Other sources, like Gordon (2001:35; 2003:258–260), have given evidence for fronting of the low back vowel as the first stage, which then pushes /æ/ forward, a position now supported by McCarthy (2009). (For detailed review of the relative and absolute chronology of the NCS, see Gordon, 2001:33–36.) Labov et al. (2006) report the NCS to be fully developed in southeastern Wisconsin (showing the shift reaching to just west of Madison), illustrating “advanced” NCS with a speaker from Kenosha, Wisconsin. Strong /æ/ raising in particular appears in the ANAE as a widespread pattern across Wisconsin, reaching westward into Minnesota and Iowa (2006:82, Map 10.5, and elsewhere).

Figure 1.

Figure 1

Schematic of the Northern Cities Shift based on Gordon (2001:197; 2003:255).

More recent work within the region shows somewhat different patterns. For instance, in the Milwaukee area Purnell (2010) finds /æ/ raising and /ε/ lowering, but (using small caps for word classes, see also Labov et al. (2006:13–14)): “Movement among other vowels in the NCS is not as pronounced in southeastern Wisconsin as elsewhere in the NCS region. Bot may shift slightly forward, but not as far forward toward BACK” as it does in communities which show fuller NCS effects (2010:196). On this view, southeastern Wisconsin has a substantially more limited form of the NCS.1 The subjects in the present study were largely from the Madison area, well west of Milwaukee, and thus closer to the edge of any NCS effects.

Another complicating factor for Wisconsin is chronological. While the NCS is regularly thought of as a change in progress (Labov, 1994; Labov et al., 2006, many others), at the heart of the NCS area, new work suggests that “the earliest developments may have run their course, at least in Chicago” McCarthy (2009:110), including /æ/ raising. That would be consistent with Purnell’s findings above. A final possible account for such patterns would be based on Labov (2007), who explains the inconsistently NCS-like patterns of St. Louis as the result of dialect-learning by adults rather than children. We will not pursue that further here.

Our third region under study, far western North Carolina, is associated with an advanced stage of the Southern Shift. In the ANAE formulation (2006:128, elsewhere), this means monophthongization or glide deletion of /aɪ/ before obstruents (stage 1), reversal of position of /e/ and /ε/ (stage 2), and likely reversal of /i/ and /ɪ/ (stage 3). With the raising and fronting of the two lax vowels here, /ɪ, ε/, the Southern Shift also includes diphthongization or “breaking” (Sledd, 1966). While Labov et al. (2006:251–252) see the Southern Shift as “reasonably stable” over apparent time, they identify some recessive tendencies over generations, especially in larger cities, suggesting that it is likely more robust in less urban areas. Other results show even greater diversity within the “South,” broadly defined. Fridland (2002) finds retreat of some aspects of the shift in Memphis, Tennessee, namely /i, ɪ/ reversal. Irons (2007a) finds ongoing advance of the shift across generations in the rural and small-town areas of Kentucky’s Eastern Coalfield region. Feagin (2003: especially 136–139) already calls attention to such reported differences in research findings, suggesting that regional, methodological and urban/rural factors may account for these.

Finally, another chain-shift like pattern (called “solidarity chains” by Hock, 1991:157–158) is reported to be widespread across American English dialects, namely back vowel fronting, led by /u/ with /o/ trailing, and /ʊ/ often found to be included. Back vowel fronting appears to be advanced and pervasive in the South (Feagin, 2003), widespread across the Northeast (Labov et al., 2006:143–144) and the Midland, though less advanced. Wisconsin and areas to the west, in contrast, are regarded as showing “resistance” to this trend (Clopper et al., 1995:1664, see also Labov et al., 2006:91, Map 10.14 and elsewhere). Labov (2010: ch. 5) summarizes /u/ fronting as most advanced in the South and Midland and least in the North. At least some Midland varieties including Ohio, show fronting of /Ã/ as well (Thomas, 1989; Labov et al., 2006:266; Durian et al., 2010:175).

In short, in each of these three regions, complex patterns of vowel change are reported over real and apparent time. The reference view provided by the ANAE contrasts with other, newer research results with regard to key chain shifts: (1) whether the Southern Shift in areas like westernmost North Carolina is progressing or receding, (2) whether Midland varieties of central Ohio are involved in no chain shift or a Canadian Shift pattern, and (3) whether the NCS in southeastern Wisconsin is largely limited to /æ/ raising and /ε/ lowering or shows an advanced form of the shift, involving a full set of seven vowels.

Cross-generational sound transmission

We situate the present study of cross-generational vowel change within the general theoretical framework of Labov’s model of transmission and incrementation (Labov, 2001, chs. 13–16; also Labov, 2007). In each dialect region, we examine how directional and dynamic vowel changes are implemented in terms of transfer of features from each older to each younger generation. Following the model’s assumptions, we understand transmission as the product of the acquisition of language by children. Once the learned linguistic system stabilizes when children reach young adulthood, no structural changes to the vowel system are to be expected and it is this system that is being transmitted to the next generation. In Labov’s words, “structural patterns are not as likely to be diffused because adults do not learn and reproduce linguistic forms, rules, and constraints with the accuracy and speed that children display” (2007:349). Thus, the vowel system of young adults which will be transmitted to the successive generation should not undergo further structural changes itself if these adults stay in the same speech community.

In our study, we expect a change in the vowel system of each successive generation as a result of incrementation, that is, reorganization of the initially acquired system by young children during their preadolescent and adolescent years. In our present sample, we start with an initial system configuration in the grandparents generation, the specific configuration different in each dialect. We then expect to find a structural change in the successive parents generation brought about by incrementation. Children are expected to exhibit further reorganization in their vowel systems as a continuation of the transmission process. What we cannot predict here, however, are the connections between the transmission of systemic vowel features (including the degree of vowel inherent spectral change) and stages of a dialect-specific chain shift. That is, we do not know whether each successive generation carries on the change by “implementing” and /or “continuing” (to use Labov’s terms, 2001: chs. 15–16) a more advanced stage of the chain shift. It could be the case that the most advanced stage has already been reached by grandparents and other, more general principles of vowel change are operative in each younger generation. The present study will shed more light on this issue.

In selecting participants for the study, we were guided by their ties with their respective speech communities. We interviewed adults who were born, raised, spent most of their lives in a given speech community and who were not engaged in frequent travel out of the area. Our selected speech communities were defined in terms of relatively narrow geographic areas. In each dialect region, we recorded participants from three-to-four adjacent counties, mostly professionals, who grew up in their respective regions, attended local schools, events and had ties with the local religious and other social institutions.

In this paper, we focus on the basic, fundamental vowel systems of our speakers, using set of words that were tightly controlled for phonetic context. Using the same speech material and obtaining speech samples under the same experimental conditions is the first critical step to a rigorous comparison of vowel dynamics across generations and dialects. By minimizing the effects of the immediate consonantal context and equally large effects of prosodic variation and linguistic stress (see Fox & Jacewicz, 2009, for experimental evidence of these effects), we were able to obtain a uniform set of data and a homogenous speech sample. Further work will examine whether and how the basic vowel systems obtained in this study differ from the systems constructed on the basis of naturalistic data from a typical sociolinguistic interview, which was also conducted with each present study participant.

METHODS

Speakers, materials and procedures

Speech samples for this study were collected from 239 participants who were born, raised and spent most of their lives in one of three dialect regions discussed above: central Ohio (78 speakers, mostly from Columbus and its suburbs), southeastern Wisconsin (79 speakers, mostly from Madison and its suburbs) and western North Carolina (82 speakers, mostly from Sylva and adjacent areas). The participants fell into three age groups representing three generations of native residents of their respective speech communities which we define here as children (8–12 years old), parents (35–51) and grandparents (66–91). More details about their distribution by age and gender can be found in Table 1. Demographic information about each participant is presented in Appendix A. The present participants were not biologically related to one another (with a few exceptions in Ohio and North Carolina). Rather, the age groups brackets were selected solely on the basis of possible (and most typical) parent and grandparent ages in relation to 8–12 years-old children.

TABLE 1.

Distribution of participants into age and gender groups.

Dialect region Age group Number and gender Age range (years) Mean age (s.d) (years)
OH C 16 M, 16 F 8–12 10.6 (1.6)
P 12 M, 16 F 35–51 42.0 (4.6)
GP 9 M, 9 F 68–87 70.2 (2.3)
WI C 15 M, 16 F 8–12 9.5 (1.1)
P 14 M, 16 F 36–50 44.1 (4.5)
GP 9 M, 9 F 68–90 76.8 (6.2)
NC C 16 M, 16 F 8–12 10.5 (1.2)
P 16 M, 16 F 35–51 43.2 (4.9)
GP 9 M, 9 F 66–91 73.1 (7.1)

OH = Ohio, WI = Wisconsin, NC = North Carolina, C = children, P = parents, GP = grandparents, M = male, F = female.

The participants were recruited using flyers, bulletin board postings, email, word of mouth, announcements in radio, local newspapers, churches, schools and nursing homes. The recordings took place simultaneously in the three states in the years 2006–2008. Most participants were recorded at the university facilities in three locations: The Ohio State University, University of Wisconsin-Madison and Western Carolina University. However, research staff members in each state were available to travel to homes and places outside the university to record participants who were unable to come to the designated testing location. This happened on several occasions, mostly with the older participants. In those cases, the same recording equipment was used as at the stationary locations to ensure uniform quality of the recordings in the entire sample. All participants were paid a nominal fee for their efforts.

During a one-hour testing session, each participant was asked to produce a set of citation form words, a set of prosodically structured sentences and an unconstrained free conversation discussing life experiences, family, regional issues, hobbies and daily activities. For the present analysis of vowel systems, the data come from the first task only, i.e., from citation form words. In this task, each participant read a list of isolated hVd-tokens using the prompts heed, hid, heyd, head, had, hod, hawed, hoed, who’d, hood, hoyd, hide, howed, which contained 13 vowels of interest to the study: /i, ɪ, e, ε, æ, ɑ, ɔ, o, u, ʊ, oɪ, aɪ, aʊ/2. The citation-form vowels provided a common database for all 239 speakers because they were obtained in a uniform style free from effects of variable consonantal and prosodic contexts which, as discussed earlier, introduce variation not only in the relative positions of vowels in the acoustic vowel space but also in their formant dynamics (Fox & Jacewicz, 2009; Hillenbrand, Clark, & Nearey, 2001). Each participant read three repetitions of each token presented in a random order for a total of 9321 items (13 × 3 × 239). The prompts appeared randomly on a computer screen, one at a time, and the tokens were recorded and digitized directly onto a hard disc drive at a 44.1-kHz sampling rate. A head-mounted Shure SM10A dynamic microphone was used, positioned at a distance of about 1.5 inches from the speaker’s lips. The experiment was controlled by a custom MATLAB program and the researcher either accepted and saved the production or asked the participant for additional repetitions.3

Data analysis

Prior to acoustic analysis, the tokens were digitally filtered and downsampled to 11.025 kHz. To capture the dynamic formant change in a vowel, the F1 and F2 frequencies were sampled at five equidistant temporal locations corresponding to the 20–35–50–65–80%-points in the vowel. The measurements of vowel onsets and offsets (which were located by hand), served as input to calculations of vowel duration and formant frequencies in the time course of a vowel. F1 and F2 values (based on 14th order LPC analysis and centering a 25-ms Hanning window at each temporal point) were extracted automatically using a program written in MATLAB which displayed these values along with the FFT and LPC spectra and a wideband spectrogram of the vowel. Two reliability checks were performed on all measurements by two different researchers. All automatically extracted formants were checked using the formant tracking option in the commercially available software program TF32 (Milenkovic, 2003) and any disagreements and errors in the analysis were resolved and hand-corrected when the corrections were considered necessary.

To assess the extent of formant movement in a vowel, we derived a measure formant trajectory length (TL), which is an approximate arc length of the formant trajectory in the F1 by F2 plane. TL is a measure drawn from a family of models using multiple sample points. The most widely used measure is that of the “vector length” which estimates the distance in the F1 by F2 plane between two sample points located close to vowel onset and offset such as 20% – 80% or 20% – 70% (e.g., Hillenbrand et al., 1999; Hillenbrand et al., 2001; Ferguson & Kewley-Port, 2002). The TL used in this paper is an extension of the two-measurement model and is defined as a sum of the lengths, in Hz, of the four separate vowel sections between the 20% and 80%-point (i.e., 20–35%, 35–50%, 50–65% and 65–80%):

TL=n=14VSLn

where the length of one vowel section (VSL) is:

VSLn=(F1n-F1n+1)2+(F2n-F2n+1)2

This 5-point characterization of formant change in the F1 by F2 plane provides a more detailed estimation of the amount of formant movement than the 2-point model (see Fox & Jacewicz (2009) for further discussion of advantages of the TL measure).4 A longer TL implies more frequency change and thus greater formant movement in the plane. Using this measure, two-target diphthongs (such as /aɪ/) and one-target diphthongized vowels (such as /e/) are expected to have longer TLs than monophthongs (such as /i/). To interpret the dynamic variations for the reader, we will use in this paper the terms “monophthongization” and “diphthongization” to explicate the gradience of a speech signal without any reference to the number of the actual phonetic targets. For example, the monophthong /ɪ/ may become more monophthongal (or monophthongized) in the productions of parents compared to grandparents, which indicates simplification of its formant dynamics evident in a smaller amount of spectral change. In a similar way, we will refer to a diphthong as being more or less diphthongized or becoming more or less diphthongal in the productions of a given generation.

RESULTS

We begin the presentation of the results with cross-generational displays of mean relative positions of all 13 vowels in the two-dimensional (F1 × F2) acoustic vowel space and their dynamic formant patterns. In these plots, each vowel symbol is placed near the 80%-point in a vowel indicating the direction of formant movement. Because the vowels were produced in the so called “neutral” hVd context, the effects of consonantal environment are minimized and the dynamic change in a vowel reflects its vowel-inherent spectral change (e.g., Nearey & Assmann, 1986)5. Given the well-known effects of vocal tract length on formant values (i.e., a longer adult vocal tract results in lower formant frequencies than a shorter child’s vocal tract, ceteris paribus), if we are to directly compare the formants (and therefore the positions of the vowels in the vowel space) of these sets of speakers, we need to normalize the measured formant values. This normalization procedure should (1) eliminate variation caused by differences in vocal tract length while (2) preserving dialectal and cross-generational differences. In the current study, Lobanov’s (1971) procedure, which converts formant values in Hz to z-scores for each individual speaker, was used. This is a normalization procedure that Adank, Smits and van Hout (2004) found to be one of the most effective. However, since the resulting values are quite different from the original Hz values (and thus more difficult to interpret), we utilized the Thomas and Kendall (2007) approach to rescale the z-scores for each of the i vowels using the following formulae where F1iN and F2iN refers to the normalized F1 and F2 values for the ith vowel and FNMIN and FNMAX refers to the minimum and maximum FN values found across all vowels and all speakers (i.e., this is a speaker-extrinsic and not a speaker-intrinsic approach).

rescaledandnormalizedFRN1i=250+500(FN1i-FN1MIN)/(FN1MAX-FN1MIN)rescaledandnormalizedFRN2i=850+1400(FN2i-FN2MIN)/(FN2MAX-FN2MIN)

The rescaled and normalized F1 values will range from 250 to 750 Hz, and the rescaled and normalized F2 values will range from 850 to 2250 Hz. All formant frequencies reported in the Results section of this paper represent rescaled normalized formant values. The unnormalized mean Hz values are reported in Appendix B.

Cross-generational patterns of vowel dispersion

Figure 2 shows the vowel plots for three generations of Ohio (OH) speakers. Cross-generational changes are most evident in the vowels /ɪ, ε, æ, ɑ/. The vowels in hid, head and had show the greatest amount of spectral change in the production of grandparents (GP), becoming increasingly monophthongized with each successive generation. A striking finding is that a further reduction of formant movement in /ε/ and /æ/ in children (C) corresponds to a change in the direction of formant movement in these two vowels. In addition, /æ/ shows lowering and retraction with each successive generation. Beginning with parents (P), we also notice a relation between /æ/ and /ɑ/: As the vowel in had lowers the vowel in hod raises. The outcome of the /ɑ/-raising is its merger with /ɔ/, the Low Back Merger. As can be seen, girls show a more advanced stage of both the lowering/backing of /æ/ and the merger compared to boys. These results are consistent with the view that this part of the Midland region has near-merger or merger of the low back vowels (e.g., Thomas, 1989; 2001), and the advanced stage represented by the youngest females suggests ongoing progress of the merger.6

Figure 2.

Figure 2

Cross-generational dynamic vowel dispersion patterns for Ohio. Lines connect measurement points from 20% to 80% and vowel symbols are placed next to the 80%-point. See text for further details. GP = grandparents, P = parents.

Another set of cross-generational changes in the OH data involves the back vowels /o, oɪ/ in hoed and hoyd. The onglide (i.e., the /o/-component of the diphthong, corresponding to the 22–35–50% measurement points) raises with each successive generation. It is also articulated farther back in the oral cavity beginning with P. In parallel to the changes in /oɪ/, the onset of /o/ in hoed raises with each generation and, in C, the vowel is both more monophthongized and more back compared to both P and GP. Finally, while /o/ in hoed is backing in C, /ʊ/ in hood undergoes some fronting in their productions.

The cross-generational vowel dispersion patterns for Wisconsin (WI) are shown in Figure 3. At a first glance, the dominant feature that differentiates the WI and OH vowel systems are the positions of /u/ in who’d and /o/ in hoed, which are articulated farther back in WI speakers as compared with the fronted OH variants. The positions of these two WI vowels and the nature of their formant movement change little across generations, although some fronting can be detected in P- and C-males and some lowering of /o/ in P-and C-females. The /o/ seems to be more monophthongal in GP compared to the younger generations. These results confirm, as noted above, that this region shows resistance to back vowel fronting which is otherwise widespread across the United States and shows that the pattern is strong down to the youngest speakers.

Figure 3.

Figure 3

Cross-generational dynamic vowel dispersion patterns for Wisconsin.

Further differences between the WI and OH systems are in the dynamics of /e/ in heyd and /æ/ in had. The WI variant of /e/ is almost monophthongal in GP and becomes increasingly diphthongized with each successive generation. The monophthongal character of /e, o/ in the Upper Midwest, especially Wisconsin and Minnesota, is a salient feature of speech in the region for non-linguists. A monophthongal pronunciation in the region has also been noted occasionally in the literature, particularly in Thomas (2001:72), Allen (1973–1976, vol. II:21–22) and Labov et al. (2006:181, 183). Our results suggest that this Upper Midwestern feature loses its strength over generations and both vowels become increasingly diphthongal. The WI vowel /æ/ has not only a greater amount of formant movement compared to OH, but shows a characteristic mark of the Northern breaking, a raised /ε/-like onglide and a lowered /æ/-offglide, following general patterns reported elsewhere (e.g. Labov et al., 2006; Purnell, 2010).

In general, the vowels in hid, head and had seem to follow the same mechanism of cross-generational change as observed in OH, that is, reduction of formant movement in /ɪ, ε/ and lowering/backing of /æ/. However, it is evident that the vowel /æ/ does not lose its Northern breaking across generations and the amount of its spectral change does not seem to be reduced. In spite of this, /æ/ clearly begins to lower in P and continues to lower in C. Furthermore, its offglide, corresponding to the 65–80%-region in the vowel, is clearly backing in C, thus following the pattern of change observed in OH. This lowering and backing of /æ/ affects the position of /ɑ/ in hod, which begins to rise in P. If this trend continues in WI, the Low Back Merger seems inevitable in future generations of Wisconsin speakers. This set of vowel changes is substantially different from the NCS-rotations. Although we clearly see NCS elements in the present WI sample such as lowering and centralization of /ε/ and the Northern breaking in /æ/, the current data show notable differences. In particular, /ɪ/ seems to be raising rather than lowering and /æ/ is clearly retracting and lowering. Thus, the linchpin change of the NCS, /æ/-raising, is absent over our generations. Similarly, the low back /ɑ/ is backing rather than fronting whereas the NCS would crucially involve fronting of this vowel. In addition, along with the backing and raising of /ɑ/, we also find backing and raising of the onset of /aɪ/ in hide (i.e., at the 20–35% temporal locations) which eventually “meets” the onset of /aʊ/ in C.

Finally, the cross-generational set of changes involving the WI variants of the back vowels in hoyd and hood parallels these in OH. The diphthong /oɪ/ raises and its onset retracts and the vowel /ʊ/ in hood undergoes successive lowering and fronting, which is more evident in women than in men and is most advanced in children. Fronting of /ʊ/ has been noted, for instance, by Hock as part of a general back vowel fronting pattern in central Illinois (1991:142–158). We can also see it in the map of F2 of /ʊ/ in Labov et al. (2006: Map 10.14). Lowering of the vowel is noted by Labov et al. (2006: Map 10.13), but they expressly say that “the height of /u/ [=/ʊ/] does not play a prominent part in any of the sound changes discussed in the Atlas” (2006:90).

Turning to the North Carolina (NC) vowels displayed in Figure 4, we find a very different set of dialectal features. We will first focus on the vowel system of GP and will then observe the reorganization of the vowel space with each successive generation. First, the GP variants of /i, ɪ, e, ε, æ/ are in close proximity to one another and show a great amount of formant movement. We find a very diphthongal /e/, whose fronted nucleus “reverses” its position with /ε/. The dynamic patterns in the remaining four vowels demonstrate Southern breaking and the raised /æ/ is another characteristic mark of Southern American English.

Figure 4.

Figure 4

Cross-generational dynamic vowel dispersion patterns for North Carolina.

Second, the nominal diphthongs are produced with far less spectral change compared to both the OH and WI variants. We find a monophthongal /aɪ/ which also occupies the lowest position in the vowel space seen so far. The two remaining diphthongs, /aʊ/ and /oɪ/, exhibit reduced formant movement, especially in men. Glide weakening or loss, often referred to as monophthongization of “General American” diphthongs has long been seen as a defining characteristic of Southern speech, as reviewed for all three diphthongs by Thomas (2003).

Third, we find a raised /ɑ/ in hod and a heavily diphthongized /ɔ/ in hawed which has a very distinct character of formant movement indicating a production close to /ɑɔ/. This is what Labov et al. (2006:254, elsewhere) term the Back Upglide Shift, and which is seen as acting “to preclude the low back merger” (2006:303). The fact that the GP generation shows powerful Back Upglide, while it recedes in younger speakers, would suggest that the resistance to Low Back Merger could wane over time. Finally, the dispersion of the /o, u, ʊ/ group bears similarities with that in OH although the /u/ is even more fronted in NC and the /ʊ/ is more raised, consistent with what has been found in other studies.

In P, the following changes take place, which are more evident in women than in men: Fronting of /i, e/ and lowering and backing of /ɪ, ε/, lowering/backing of /æ/ and raising of /ɑ/, greater diphthongization of the three diphthongs /aɪ, aʊ, oɪ/, raising of /oɪ/ and further fronting of the /o, u, ʊ/ group. For this adult generation, the results indicate that the core Southern Shift is receding in this community, with regard to both glide deletion and the reversal of tense and lax front vowels. However, back vowel fronting continues unabated.

The set of changes in C include: Fronting of /i/, loss of Southern breaking and monophthongization of /ɪ, ε, æ/, further lowering/backing of /æ/ and raising of /ɑ/, changing formant pattern in /ɔ/ which, especially in girls, corresponds to both OH and WI variants, greater diphthongization of /aɪ, aʊ, oɪ/, raising of /oɪ/ and further fronting of the /o, u, ʊ/ group. The loss of Back Upgliding — clearest in girls — suggests the possibility of future Low Back Merger in the community. The present cross-generational progression toward the Low Back Merger supports the findings reported by Irons (2007b) for Kentucky English, who proposed that “the mechanism underlying this low back vowel merger is back upglide loss, that is, the erosion of a phonological feature that has been a factor in resistance to merger in the South” (2007:138). With regard to back vowel fronting, in girls, the variants of /u, o/ are clearly backing, which may indicate that fronting of these vowels reached its limit already in P-females and in boys, suggesting that no further fronting of these vowels ought to be expected without a categorical vowel change.

To summarize our observations thus far, we have found vowel dispersion patterns which are clearly dialect-specific as well as cross-generational changes which seem to be common across the three dialects. In particular, neither NCS nor Southern Shift appeared to be active in the selected regions although some elements of either shift were clearly present. Instead, the cross-generational data revealed a new anticlockwise vowel rotation which was common to all three dialect areas. As an additional feature of this new development in the three regional vowel systems, the vowels /ɪ, ε, æ/ are found to be undergoing a common process of increased monophthongization (with the exception of WI /æ/) and /ɑ/ has been developing formant dynamics approximating those in the vowel /ɔ/. Figure 5 illustrates this process in greater detail by displaying the data from Figures 24 cross-generationally, separately for each dialect.

Figure 5.

Figure 5

Crross-dialectal and crossg-enerational display of dynamic dispersion patterns for the vowels /ɪ, ε, æ, ɑ/.

The cross-generational lowering/backing of /æ/ and the corresponding /ɑ/-raising is evident in all three dialects. However, in the case of /ɪ/ and /ε/, there is more variability. While NC /ɪ, ε/ are clearly descending and retracting and WI /ε/ also shows the same trend in P (but not in C), the positional changes in /ɪ/, if any, are along the front-back dimension rather than low-high. Thus, the greatest cross-generational change in /ɪ, ε/, common across all three dialects, is the loss of formant movement and the positional changes may reflect some dialect-specific demands. We will return to this in the General Discussion. At present, we need to determine how strongly we can claim that variation in formant dynamics play a significant role in cross-generational vowel change. The statistical analysis of formant movement (TL) will help us to better understand its importance in the patterns described above.

Cross-generational patterns of formant dynamics

Mean TL values (and standard deviations) for each vowel broken down by dialect, age group and gender are listed in Table 2. The TL differences were first examined using a series of one-way ANOVAs conducted separately for each vowel for each dialect. In these analyses, age group (generation: GP, P, C) and gender were the two between-subject factors. The effects of age group on changes in dialect-specific formant dynamics were of our particular interest. Following the significance of the main effect of age, Scheffé’s multiple comparisons were used as post hoc tests. The second type of analysis used separate one-way ANOVAs to explore more global effects of dialect on TL of each vowel, disregarding the effects of age and gender. In this analysis, dialect was the only between-subject factor and Scheffé’s comparisons followed as post hoc tests.

TABLE 2.

Means (in Hz) and standard deviations for trajectory length (TL) broken down by dialect, age group and gender.

Vowel Token Age/Gender TL/OH TL/WI TL/NC
/i/ heed GP_M 132 (52) 128 (54) 147 (60)
P_M 103 (40) 94 (24) 153 (57)
C_M 146 (30) 124 (45) 151 (52)
GP_F 106 (24) 108 (26) 140 (75)
P_F 126 (59) 91 (34) 123 (34)
C_F 146 (49) 137 (62) 149 (37)
/ɪ/ hid GP_M 220 (46) 201 (58) 295 (56)
P_M 149 (42) 168 (41) 290 (65)
C_M 131 (29) 118 (26) 232 (90)
GP_F 282 (46) 255 (54) 293 (69)
P_F 193 (52) 194 (61) 369 (60)
C_F 126 (29) 108 (31) 218 (96)
/e/ heyd GP_M 216 (53) 153 (43) 315 (53)
P_M 188 (35) 150 (23) 327 (69)
C_M 222 (72) 177 (53) 332 (60)
GP_F 200 (50) 115 (42) 314 (65)
P_F 208 (53) 137 (55) 327 (61)
C_F 229 (67) 193 (61) 340 (60)
/ε/ head GP_M 180 (36) 183 (71) 261 (62)
P_M 117 (40) 120 (28) 253 (84)
C_M 115 (32) 141 (40) 179 (73)
GP_F 221 (40) 249 (53) 278 (89)
P_F 125 (35) 155 (62) 325 (82)
C_F 124 (37) 134 (35) 149 (53)
/æ/ had GP_M 192 (85) 229 (100) 237 (85)
P_M 132 (45) 196 (52) 198 (93)
C_M 146 (34) 238 (84) 169 (49)
GP_F 217 (59) 271 (47) 277 (69)
P_F 158 (39) 271 (84) 287 (87)
C_F 168 (47) 245 (45) 171 (49)
/ɑ/ hod GP_M 207 (34) 162 (55) 153 (51)
P_M 173 (96) 157 (30) 195 (66)
C_M 298 (86) 216 (48) 290 (102)
GP_F 207 (67) 173 (45) 187 (36)
P_F 233 (58) 175 (40) 235 (64)
C_F 258 (63) 234 (71) 324 (102)
/u/ who’d GP_M 163 (40) 187 (69) 172 (53)
P_M 129 (31) 153 (51) 151 (62)
C_M 197 (62) 151 (54) 201 (51)
GP_F 167 (60) 171 (42) 148 (53)
P_F 152 (53) 168 (58) 141 (58)
C_F 179 (62) 143 (47) 188 (56)
/ʊ/ hood GP_M 225 (60) 301 (68) 169 (53)
P_M 213 (44) 287 (60) 148 (53)
C_M 247 (68) 245(46) 170 (59)
GP_F 238 (71) 310 (61) 197 (91)
P_F 249 (43) 319 (53) 160 (85)
C_F 254 (58) 300 (53) 212 (49)
/o/ hoed GP_M 206 (60) 182 (55) 250 (66)
P_M 172 (34) 196 (68) 230 (57)
C_M 173 (50) 191 (47) 224 (49)
GP_F 215 (50) 161 (38) 258 (49)
P_F 202 (38) 207 (63) 242 (41)
C_F 181 (37) 190 (58) 239 (42)
/ɔ/ hawed GP_M 206 (50) 213 (67) 227 (40)
P_M 180 (84) 240 (50) 235 (110)
C_M 275 (92) 257 (55) 246 (83)
GP_F 241 (85) 229 (62) 246(73)
P_F 237 (57) 265 (51) 239 (46)
C_F 260 (86) 284 (72) 246 (86)
/aɪ/ hide GP_M 526 (65) 472 (72) 119 (33)
P_M 479 (65) 447 (93) 196 (113)
C_M 570 (58) 484 (58) 401 (143)
GP_F 634 (54) 526 (64) 224 (110)
P_F 576 (62) 516 (45) 243 (130)
C_F 523 (89) 545 (72) 528 (154)
/ɔɪ/ hoyd GP_M 733 (86) 643 (80) 539 (80)
P_M 760 (57) 695 (58) 713 (104)
C_M 821 (74) 736 (70) 792 (150)
GP_F 813 (58) 663 (70) 706 (117)
P_F 832 (100) 746 (66) 726 (104)
C_F 828 (77) 807 (50) 883 (72)
/aʊ/ howed GP_M 402 (50) 331 (34) 425 (75)
P_M 419 (62) 317 (45) 449 (84)
C_M 444 (67) 383 (47) 500 (81)
GP_F 462 (83) 288 (31) 486 (109)
P_F 427 (67) 345 (69) 477 (102)
C_F 433 (72) 416 (70) 468 (74)

The statistical results are summarized in Table 3. Listed are significance values for the effects of age group separately for each dialect, and significant differences between age groups (GP, P, C) revealed by the Scheffé procedure. The effects of dialect (from the second analysis) are listed in the last column, which also includes significant differences between the dialects (OH, WI, NC) shown in Scheffé’s post hocs. Although these post hoc comparisons were done on each vowel separately (and are thus relatively independent tests), given the large number of tests we chose a relatively conservative a of .001. The significant results are summarized in Table 3. The effects of gender were mostly not significant and are not listed in the table. In general, female TLs were longer than male and significant differences were found only for OH hid, OH and WI hide and NC hoyd.

TABLE 3.

Summary of statistical results for trajectory length (TL) for the effects of age (GP=grandparents, P=parents, C=children) and dialect (OH, NC, WI).

Vowel/token TL/Ohio F(2,72) TL/Wisconsin F(2,72) TL/North Carolina F(2,76) TL/Dialect F(2,235)
/i/ (heed) n.s. n.s. n.s. F=7.85, p=.001
NC>WI
/ɪ/ (hid) F=52.23, p<.001
GP>P, GP>C, P>C
F=37.47, p<.001
GP>P, GP>C, P>C
F=15.35, p<.001
GP>C, P>C
F=59.48, p<.001
NC>OH, NC>WI
/e/ (heyd) n.s. F=7.94, p=.001
C>P, C>GP
n.s. F=183.13, p<.001
NC>OH, NC>WI, OH>WI
/ε/ (head) F=33.17, p<.001
GP>P, GP>C
F=18.34, p<.001
GP>P, GP>C
F=24.70, p<.001
GP>C, P>C
F=41.53, p<.001
NC>OH, NC>WI
/æ/ (had) F=8.12, p=.001
GP>P, GP>C
n.s. F=11.10, p<.001
GP>C, P>C
F=23.10, p<.001
WI>OH, NC>OH
/ɑ/ (hod) F=10.01, p<.001
C>P, C>GP
F=12.07, p<.001
C>P, C>GP
F=20.38, p<.001
C>P, C>GP
F=9.93, p<.001
OH>WI, NC>WI
/u/ (who’d) n.s. n.s. n.s. n.s.
/ʊ/ (hood) n.s. n.s. n.s. F=74.57, p<.001
WI>OH, WI>NC, OH>NC
/o/ (hoed) n.s. n.s. n.s. F=24.28, p<.001
NC>OH, NC>WI
/ɔ/ (hawed) n.s. n.s. n.s. n.s.
/aɪ/ (hide) n.s. n.s. F=42.95, p<.001
C>P, C>GP
F=86.06, p<.001
OH>NC, WI>NC
/ɔɪ/ (hoyd) n.s. F=19.10, p<.001
C>P, C>GP, P>GP
F=23.74, p<.001
C>P, C>GP, P>GP
F=11.10, p<.001
OH>WI, OH>NC
/aʊ/ (howed) n.s. F=18.16, p<.001
C>P, C>GP
n.s. F=48.75, p<.001
NC>OH, NC>WI, OH>WI

As before, we begin with the analysis of the results for OH. The cross-generational changes in formant dynamics were significant only for the vowels /ɪ, ε, æ, ɑ/. For /ɪ/, all pairwise comparisons yielded significant differences between each successive generation, providing evidence that /ɪ/-monophthongization has been a continuous process spanning all three generations. The vowels /ε, æ/ change their dynamics in a similar fashion. However, the differences between P and C were not significant, suggesting that a substantial reduction of formant movement has already occurred in P and formant dynamics did not change much in C. For /ɑ/, we found a significant difference between the increased formant movement in C and either P or GP, and the latter two did not differ significantly from one another.

The results for WI show similar sets of significant cross-generational TL changes for /ɪ, ε, ɑ/. However, there were no significant differences for /æ/, confirming that the amount of formant movement in this vowel remained unchanged across the three generations. Also significant was the amount of spectral change in /e/ in C compared to either P or GP. This vowel, being known as almost monophthongal in WI, is clearly becoming diphthongized in the youngest generation. An interesting relationship can be found between the raising of the diphthong /oɪ/ in hoyd and a significant increase of its formant movement with each successive generation. This gradual and increased cross-generational diphthongization is achieved by both backing the onset of the diphthong and fronting its offset. Finally, there is a significant increase in formant movement in the diphthong /aʊ/ in howed in C, which most likely results from some lowering and fronting of its onset.

In the NC vowel system, the significant reduction of formant movement in the /ɪ, ε, æ/ set is clearly a change introduced more recently. The lack of significant TL differences between GP and P and inspection of the means in Table 2 indicate that Southern breaking is no longer present in C in /ε, æ/ and /ɪ/ has significantly reduced formant movement compared to the two older generations. Another recent change introduced in C can be seen in /ɑ/, which has a significantly greater formant movement. This change corresponds to similar patterns we have already seen in OH and WI in that children’s variants of /ɑ/ were more diphthongal compared to adults and there were no significant difference between the two adult groups. The two remaining NC vowels affected significantly by cross-generational changes are the diphthongs /oɪ/ and /aɪ/. The pattern of cross-generational change in /oɪ/ is the same as in WI in that a significantly greater diphthongization in each successive generation corresponds to the raising of the diphthong. The vowel /aɪ/ undergoes a diphthongization in C and is not produced as a monophthong as in GP and P. The amount of formant movement in the variant of /aɪ/ produced by NC girls is now comparable with that produced by OH and WI girls (see Table 2).

Overall, the statistical analysis of TL variation across age groups confirmed most of our initial observations which were based only on visual inspection of the vowel plots. It also provided firmer ground for reaching conclusions about the importance of formant dynamics and their potential contribution to the process of cross-generational vowel change. Turning to the exploration of a more global effects of dialect on formant dynamics of individual vowels (the second type of our analysis), we found significant effects of dialect for 11 vowels, except for /u/ and /ɔ/. This indicates that, despite significant cross-generational changes in each vowel system, formant dynamics are strongly affected by dialect. In most cases, NC variants had a greater formant movement compared to both or at least one dialect. The two exceptions were /ʊ/ and /aɪ/ which had the smallest amount of formant change.

GENERAL DISCUSSION

The acoustic analysis of vowel systems reported in this study revealed a robust set of cross-generational changes in /ɪ, ε, æ, ɑ/ which necessitates an extended discussion of this chain-shift-like development. We will first consider the positional vowel changes across generations and will then discuss these changes in relation to formant dynamics.

Figure 6 is a partial reproduction of the data from Figure 5 showing the location of each vowel’s midpoint only, which, more or less, corresponds to a broadly defined “vowel target” or nucleus. As can be seen, there is almost no cross-generational change in the positions of /ɪ/ in OH and WI, although some fronting in WI boys can be detected. In both dialects, the location of the vowel in the acoustic space is comparable. This indicates that in OH and WI, the cross-generational change involves primarily its monophthongization and not a positional vowel movement. In NC, the vowel is retracting from its proximity to /i/, with a tendency to lowering. This movement seems to be unrelated to formant dynamics, as is evident in the female data.

Figure 6.

Figure 6

Crross-dialectal and cross-generational display of midpoint measurements for the vowels /ɪ, ε, æ, ɑ/. All data points are redrawn from Figure 5.

The crossdialectal differences are clearly manifested in /ε/. The vowel exhibits almost no positional change in OH (although slight centralization can be detected in men) but shows lowering and centralization in P in WI (more in women than in men), which is consistent with the direction of the NCS. Interestingly, there is no further retraction in WI once the vowel became monophthongized in P and seems to undergo some fronting in children. The cross-generational changes in /ε/ are most evident in the NC variant, which retracts from its high-front position into a more centralized and lower position with each successive generation. This positional change is much greater in women than in men and seems unrelated to the degree of diphthongization. The most dramatic change is in /æ/ which, in each dialect, is descending and backing with each successive generation. The degree of this change differs and can be as large as in NC women (where the vowel was highly raised in GP and now, in girls, is almost as low and as back as in OH children) or as small as in WI men, who nevertheless show a steady progression in the same direction. Finally, the vowel /ɑ/ shows the common cross-generational trend of raising and backing, despite the crossdialectal differences in its exact position in the acoustic space. This upward movement is rather unrelated to any change in formant dynamics.

Taken together, the crossdialectal and cross-generational patterns found in this apparent-time analysis reveal a common mechanism underlying the set of changes observed here. The observed outcome is a chain-like anticlockwise rotation of all four vowels and the nature of this rotation needs to be characterized with a greater precision. This shift is manifested most clearly in NC. In particular, in the production of the oldest speakers, all three vowels /ɪ, ε, æ/ are raised and fronted as a result of the Southern Shift. In the next generation, all three vowels show retraction and lowering, which is manifested more in women than in men. This conforms to the general view of the role of women in sound change, namely, that women are leading men generationally in this development. However, while /ɪ, ε, æ/ are retracting and descending in the vowel space, female data show no change in the position of /ɑ/, indicating no involvement of this vowel at this stage. The final set of changes can be seen in children, showing no further change in /ɪ/ in girls (but retraction and lowering in boys) and further retraction and lowering of /ε, æ/. We now notice raising and some backing of /ɑ/, possibly in response to the low and back position of /æ/.

Elements of this movement across generations can be found in both OH and WI, although they do not manifest themselves as strongly and coherently as in NC. Because the vowels /ɪ/ and /ε/ were not involved in any shift in OH, their initial positions in the oldest speakers are lower and not as fronted as in NC. Apparently, there has been no further lowering of both vowels in OH and some small variation in articulation can be detected in advancement rather than height. However, the raised position of /æ/ in the oldest OH speakers, especially in males, creates a condition for its lowering and backing, which indeed takes place in subsequent generations. The downward movement of /æ/ occurs simultaneously with the raising and backing of /ɑ/. The development in WI is in accord with those in OH and NC, showing no major variation in /ɪ/ (although some lowering and centralizing of /ɪ/ can be seen in adult males), centralization of /ε/ due to the NCS (and its lowering in adult females) and lowering and backing of the raised /æ/. The WI /ɑ/, which is low and not as back as NC or even OH variants, undergoes a steady raising and backing with each successive generation.

Considering now the relation between the positional vowel change and variation in formant dynamics, the results show a coherent set of events. While the vowel /ɪ/ in hid was diphthongized in the oldest age groups in all three dialect regions, we found a significant reduction of the extent of its formant movement in OH and WI parents groups. This reduction has been delayed by one generation in NC and the change is undergoing now in NC children. Comparing TL means, especially among the female groups (see Table 2), we find NC children values similar to those in OH and WI parents a generation earlier. While NC children are “at the OH and WI parents stage,” both OH and WI now demonstrate further reduction of formant movement. The end result of these cross-generational changes is a relatively monophthongal /ɪ/, whose position in the vowel space is basically the same in OH and WI. We understand these sets of changes in /ɪ/, including both positional movement in NC and simplification of the dynamic structure in all three dialects, as cross-generational progress toward the loss of distinctiveness of the dialect-specific phonetic features. Clearly, in children, the vowel has almost lost its diphthongal character which is a most salient perceptual cue in maintaining a distinction between dialects in our oldest speakers. Results from a perceptual testing, currently under analysis in our lab, will verify our current understanding of why the vowel has become increasingly monophthongized in all three dialects.

The vowel /ε/ in head shows a very similar cross-generational development. However, given that this vowel exhibits inherently smaller amounts of spectral change compared to /ɪ/, it takes two generations in OH and WI (and not three) to reduce the formant movement sufficiently to produce a monophthong. Consequently, the /ε/ has already become a monophthong in OH and WI parents. The monophthongization of the NC variant of /ε/ takes place a generation later, in children, which is also consistent with the development of /ɪ/. We can also observe a beginning of a change in the direction of formant movement in NC girls, following the pattern in OH and WI children. Again, this increased monophthongization is most likely to reduce the phonetic contrast among the dialect specific dynamic features which will be verified by the perception data.

The cross-generational changes in /æ/ in had in OH parallel those in /ε/. This is not the case in the WI variant which remains fully diphthongized in all three generations, indicating that its Northern breaking is still a salient dialectal feature. In NC, the progressive monophthongization of /æ/ follows the same route as in /ɪ/ and /ε/, showing up in children (and not in adults). NC girls have now essentially the same variant as OH girls. Finally, across all three dialects, the vowel /ɑ/ in hod shows an increase in formant movement in children compared to either adult group, suggesting that this is a recent development. That is, the quality of this vowel is changing in children, which is now more diphthongized and close to /ɔ/ in terms of the amount of formant movement (compare Table 2).

Clearly, there seems to be a common force underlying this general anticlockwise rotation, which operates in the same way regardless of the initial dialect-specific configuration in older adults. Obviously, we can see in these front vowels the operation of Labov’s Principle II, according to which lax vowels lower in chain shifts (formulated on the basis of Sievers, 1876). However, it is unclear at present what motivates this rotation other than the possibility of perceptual loss of dialectal distinctiveness in this particular subset of vowels. Certainly, the reduction of formant dynamics would point toward likelihood of this future outcome.

In parallel to the developments in the front vowels, the back /ɑ/ begins its route of raising and backing, until reaching the state of the merger with /ɔ/. In addition, a closer inspection of the back vowel space in NC (see Figure 4) indicates its successive expansion and raising of long vowels /aʊ/ and /oɪ/ which, along with /ɑ/, appear to be moving upward along the peripheral track. This development can be understood as an operation of Principle I which, again based on Sievers (1876), predicts a rising tendency of the long back vowels in chain shifts. To complete the anticlockwise rotation in the vowel space, the back vowels /u, o, ʊ/ undergo successive fronting in accord with Principle III, although this fronting may in fact have reached its limit and is receding in NC girls. In OH, we also find the raising of /oɪ/ along with the expansion toward the back although there is no raising of /aʊ/ (compare Figure 2). We also see a slight fronting of /u, ʊ/ but not of /o/, which seems to be backing. Finally, in WI, we can also observe the progressive raising of /oɪ/ along with its expansion toward the back of the vowel space and some fronting of /u, o, ʊ/ across generations (see Figure 3). Thus, it seems that the same set of principles are operative in each dialect, although the particular elements are executed with variable strength and weight in each respective vowel system.

How does this general anticlockwise development correspond to more recent reports involving American English? The changes in the /ɪ, ε, æ, ɑ/ set have been studied in Canada and were initially referred to as the “Canadian Shift” (Clarke, Elms & Youssef, 1995). According to this first report, the Canadian /æ/ was retracting from its low-front position into low-central position, a movement triggered by the Low Back Merger (cat/caught), which opened up an empty space in the vowel system previously occupied by /ɑ/. In response to this merger and the retraction of /æ/, /ε/ lowered to the position occupied by /æ/ and /ɪ/ lowered to the position made available by /ε/. Crucially, this pull chain comes to existence in response to the Low Back Merger, which is thought to trigger the lowering and retraction of the entire front lax vowel system. Building on such earlier work, Gordon (2005) argues that retraction of the low front vowel creates a “margin of security” within the vowel space in response to Low Back Merger. Bigham’s (2009) study of southern Illinois vowels support this at the overall community level, but some individual speakers show retraction without merger, suggesting that retraction can be or become an independent sound change.

The results from 239 speakers presented in this paper do not conform to this version of the Canadian Shift because the lowering and retraction of /æ/ is not associated with the low-back vowel merger. That is, it occurs in dialects which both do exhibit the merger (OH) and do not (WI, NC). On the contrary, our data support the vowel development reported by Boberg (2005), who provided acoustic and statistical evidence against the classic chain shift proposed in the original version of the Canadian Shift. For the shift currently active in Montreal, Boberg emphasizes the parallel retractions of /ɪ, ε, æ/, calling it a “parallel shift” (2005:151; see above on the similar notion of “solidarity shift”). In this interpretation of the sound change, the retraction of /æ/ has not triggered the lowering of /ε/ as in a pull chain; rather, both /ɪ/ and /ε/ changed their positions in a parallel development with /æ/. The present study finds similar parallel developments in three very distinct varieties spoken in the United States. These results show retraction in whole communities without Low Back Merger, suggesting even more independence between those two processes than Bigham argues for.

Given the widespread occurrence of these parallel shift-like changes, we propose to term this recent and overarching development the “North American Shift,” which includes dialect regions in both Canada and the United States. An important component of this parallel anticlockwise rotation of the /ɪ, ε, æ/ set is monophthongization or reduction of dynamic spectral changes in these three vowels. At present, we await further reduction of formant movement in WI /æ/ and NC /ɪ/, the only two vowels which still show a substantial degree of diphthongization in children. In response to lowering, backing and monophthongization of /æ/, we find increased diphthongization of /ɑ/ in children, which may signal an initial stage of its progression toward the Low Back Merger, complementing its progressive raising and backing. In the present data, we find a higher-level unity across very distinct varieties such as studied here that spans the individual dialects. This unity subsumes but is not at odds with the diversity of dialects which is clearly visible in the strong presence of dialect-specific features across generations of speakers in each region.

It is yet unclear how the North American Shift corresponds to the developments reported in other English-speaking countries. For example, a similar anticlockwise rotation of /ɪ, ε, æ/ has been found in southeast England, which is viewed as a classic chain shift, originating with the lowering of /æ/ (Torgersen & Kerswill, 2004). This shift fits together with the fronting of /ʊ/ in this region, which can also be found across the North American dialects. Torgersen and Kerswill refer to Trudgill who has argued that this shift is a drag chain which started with the lowering of /æ/. In their words, “Trudgill (2004), looking only at the three front vowels, considers this to be a drag chain beginning with TRAP. We suggest that the crowding of the vowel space, perhaps caused by the lowering of TRAP, forced STRUT to move back. If this is correct, the chain shift is one beginning in the middle of the chain, initiated by the lowering of TRAP” (2004:45). More reports from other parts of the English-speaking world are needed to provide further insights about the nature of this shift-like rotation.

So far, our discussion has focused on structural factors without addressing possible social factors involved in the set of vowel changes found here. However, as claimed by Labov (2001:463), “there must be a social force that activates the shift and drives the increment.” The basic social characteristics of the present participants are listed in Appendix A. These include both occupation and their education level. All children were enrolled in elementary schools in the public school system and adults represented a variety of educational and professional backgrounds. The oldest participants were mostly retired from their professions and lived at their homes or retirement facilities. The professional background data do not indicate major divisions within and across each dialect such as urban versus rural or related to socioeconomic status. In general, we expect the analysis of this parameter (i.e., occupation) to add little to a better understanding of the “social force” in the present study. The same holds true for the level of education.

Although the cross-generational and crossdialectal patterns of variation found in this study are based on the most formal style of production, the results revealed substantial differences both in formant dynamics and vowel dispersion in the acoustic space as a function of age group and dialect. This indicates that linguistic forms associated with formal style in the production of the oldest adults differ from those in each successive generation. These results underscore the value of analyzing the citation-form speech in addressing questions about sound change. We also expect to find the same cross-generational patterns in vowel systems constructed on the basis of less formal, conversational speech. That is, the general vowel space along with the amount of formant movement in particular vowels will be considerably reduced in casual speech, especially when vowels occur in less prominent prosodic positions in the utterance (vowels in these positions will tend to centralize, cf. Agwuele, Sussman & Lindblom, 2008; Lindblom, Agwuele, Sussman & Cortes, 2007). Variation in speech tempo and consonantal context of the vowel will all contribute to reduction (or expansion) of the vowel space. Yet, within each type of variation, we expect the formant movement to be proportionally reduced (or expanded) although the basic pattern of dispersion within the vowel system itself is expected to remain the same. This aspect of cross-generational vowel change will be examined in our future work, using recordings of spontaneous speech collected from the present participants.

As for the effects of gender, another social factor controlled for in this study, we should not overlook the fact that female speakers, in general, displayed somewhat greater formant movement in vowels compared to male speakers. This was true especially for the diphthongs /aɪ/ and /ɔɪ/ and the /ɪ, ε, æ/ set (see Table 3). These vowels were more “diphthongal” in females compared to males, perhaps reflecting a somewhat more exaggerated articulation in female speech. The gender-related differences may result from self-monitoring or perhaps from differences in the perception of task demands as suggested elsewhere (e.g., Beckford Wassink, Wright & Franklin, 2007; Bell, 2001).

Supplementary Material

Appendix

Acknowledgments

This work was supported by the research grant No. R01 DC006871 from the National Institute of Deafness and Other Communication Disorders, National Institutes of Health.

Footnotes

1

An additional aspect of /æ/-raising clearly distinguishes Wisconsin from the (rest of) the Northern Cities area, “pre-velar raising.” See Bauer and Parker (2008), Purnell (2008), and Bauer (forthcoming). This body of work provides strong evidence that this process is distinct historically and phonologically from general /æ/-raising.

2

The vowel /Ã/ in hud was originally included in pilot testing and then the prompt hud was overlooked at the stage of preparation of the final stimulus material and therefore is not included in the present study. However, instances of the vowel were recorded during the spontaneous talk phase of data collection and will be included in our future analysis of vowels systems constructed from free conversations.

3

Although acceptable productions occurred most of the time, additional repetitions were obtained in case of poor quality recordings such as due to voice quality, ambient noise, “false starts,” laughing, etc.

4

The 2-sample versus 5-sample models are acoustic measures and do not imply that listener might use these linear approximations to identify the vowel. Perceptual experiments are needed to test which model provides sufficient information about vowel quality.

5

A reviewer reminds us that the dynamic change in a later portion of a vowel may be indicative of coarticulatory timing differences between the vowel and the final consonant as a function of speaker age, which is not related to dialectal or cross-generational vowel change. A detailed analysis of coarticulatory events and articulatory data would be, of course, a desirable extension of the study of the acoustic formant changes reported in this paper to shed more light on this issue. At this point, we expect that some degree of formant movement starting at about 70%-point of a citation-form token will be attributed to a transitional movement towards a /d/ locus (and thus more susceptible to individual speaker variation) and not to a vowel inherent spectral change (VISC) per se. However, the time-varying changes of the vowel tract shape that produce VISC must include at least some consonantal effects because vowels are not normally produced with static configurations of the vocal tract shape. While admitting the possibility of coarticulatory interactions with VISC, we selected the most conservative hVd-frame which, in phonetic research, is believed to introduce only marginal consonantal context effects.

6

In a perception study which used a subset of this study’s tokens as stimuli, Garea (2009) found perceptual evidence for the progress of the merger. Young OH adults in their 20s gave higher identification rates for hod produced by the parents generation (78%, confused with hawed 20% of the time), while children’s hod was identified as intended by the speakers only 58% of the time and its confusions with hawed rose to 37%.

Contributor Information

Ewa Jacewicz, The Ohio State University.

Robert Allen Fox, The Ohio State University.

Joseph Salmons, University of Wisconsin-Madison.

References

  1. Adank Patti, Smits Roel, van Hout Roeland. A comparison of vowel normalization procedures for language variation research. Journal of the Acoustical Society of America. 2004;116:3099–3107. doi: 10.1121/1.1795335. [DOI] [PubMed] [Google Scholar]
  2. Agwuele Augustine, Sussman Harvey M, Lindblom Björn. The effect of speaking rate on consonant vowel coarticulation. Phonetica. 2008;65:194–209. doi: 10.1159/000192792. [DOI] [PubMed] [Google Scholar]
  3. Allen Harold B. The linguistic atlas of the Upper Midwest. Vol. 3. Minneapolis: University of Minnesota Press; 1973–1976. [Google Scholar]
  4. Andruski Jean, Nearey Terrance. On the sufficiency of compound target specification of isolated vowels and vowels in /bVb/ syllables. Journal of the Acoustical Society of America. 1992;91:390–410. doi: 10.1121/1.402781. [DOI] [PubMed] [Google Scholar]
  5. Bauer Matt. Sound change and laryngeal height. Proceedings of the Chicago Linguistic Society’s 46th meeting; Chicago: CLS; (forthcoming) [Google Scholar]
  6. Bauer Matt, Parker Frank. /æ/-raising in Wisconsin English. American Speech. 2008;83:403–431. [Google Scholar]
  7. Beckford Wassink Alicia, Wright Richard A, Franklin Amber D. Intraspeaker variability in vowel production: An investigation of motherese, hyperspeech, and Lombard speech in Jamaican speakers. Journal of Phonetics. 2007;35:363–379. [Google Scholar]
  8. Bell Allan. Back in style: Reworking audience design. In: Eckert P, Rickford JR, editors. Style and sociolinguistic variation. Cambridge: Cambridge University Press; 2001. pp. 139–169. [Google Scholar]
  9. Bigham Douglas S. Correlation of the Low-Back Vowel Merger and TRAP-Retraction. University of Pennsylvania Working Papers in Linguistics. 2009;15(2):21–31. http://repository.upenn.edu/pwpl/vol15/iss2/4. [Google Scholar]
  10. Boberg Charles. The Canadian shift in Montreal. Language Variation and Change. 2005;17:133–154. [Google Scholar]
  11. Clarke Sandra, Elms Ford, Youssef Amani. The third dialect of English: Some Canadian evidence. Language Variation and Change. 1995;7:209–228. [Google Scholar]
  12. Clopper Cynthia G, Pisoni David, de Jong Kenneth. Acoustic characteristics of the vowel systems of six regional varieties of American English. Journal of the Acoustical Society of America. 2005;118:1661–1676. doi: 10.1121/1.2000774. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Cox Felicity. Vowel change in Australian English. Phonetica. 1999;56:1–27. doi: 10.1159/000028438. [DOI] [PubMed] [Google Scholar]
  14. Durian David. A new perspective on vowel variation over the course of the 20th Century in Columbus, OH [The Canadian Shift in Columbus, part I]. Paper presented at the Linguistic Society of America annual meeting; San Francisco, CA. 2009. [Google Scholar]
  15. Durian David. Purely a chain shift?: The Canadian Shift in the US Midland [The Canadian Shift in Columbus, part II]. Paper presented at the American Dialect Society annual meeting; Baltimore, MD. 2010. [Google Scholar]
  16. Durian David, Dodsworth Robin, Schumacher Jennifer. Convergence in Blue-Collar Columbus, Ohio, African American and White Vowel Systems? In: Yaeger-Dror M, Thomas E, editors. AAE speakers and their participation in local sound changes: A comparative study. Publications of the American Dialect Society. Vol. 94. Durham: Duke University Press; 2010. pp. 161–190. [Google Scholar]
  17. Feagin Crawford. Vowel shifting in the southern states. In: Nagle S, Sanders S, editors. English in the Southern United States. Cambridge: Cambridge University Press; 2003. pp. 126–140. [Google Scholar]
  18. Ferguson Sarah H, Kewley-Port Diane. Vowel intelligibility in clear and conversational speech for normal-hearing and hearing-impaired listeners. Journal of the Acoustical Society of America. 2002;112:259–271. doi: 10.1121/1.1482078. [DOI] [PubMed] [Google Scholar]
  19. Fox Robert A. Perceptual structure of monophthongs and diphthongs in English. Language and Speech. 1983;26:21–60. doi: 10.1177/002383098302600103. [DOI] [PubMed] [Google Scholar]
  20. Fox Robert A, Jacewicz Ewa. Cross-dialectal variation in formant dynamics of American English vowels. Journal of the Acoustical Society of America. 2009;126:2603–2618. doi: 10.1121/1.3212921. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Fridland Valerie. The social dimension of the Southern Vowel Shift: Gender, age and class. Journal of Sociolinguistics. 2002;5:233–253. [Google Scholar]
  22. Garea Lauren L. BA Honors Thesis. The Ohio State University; Columbus, OH: 2009. Cross-dialectal differences in vowel identification. [Google Scholar]
  23. Gordon Matthew J. Publication of the American Dialect Society. Vol. 84. Durham: Duke University Press; 2001. Small-town values and big-city vowels: A study of the Northern Cities Shift in Michigan. [Google Scholar]
  24. Gordon Matthew J. Investigating chain shifts and mergers. In: Joseph B, Janda R, editors. Handbook of historical linguistics. Malden, MA: Blackwell; 2003. pp. 244–266. [Google Scholar]
  25. Gordon Matthew J. The Midwest and West. In: Schneider E, editor. Handbook of varieties of English: The Americas and Caribbean, Vol I: Phonology. Berlin: Mouton de Gruyter; 2005. pp. 338–350. [Google Scholar]
  26. Harrington Jonathan, Kleber Felicitas, Reubold Ulrich. Compensation for coarticualtion, /u/-fronting, and sound change in standard southern British: An acoustic and perceptual study. Journal of the Acoustical Society of America. 2008;123:2825–2835. doi: 10.1121/1.2897042. [DOI] [PubMed] [Google Scholar]
  27. Hillenbrand James, Getty Laura A, Clark Michael J, Wheeler Kimberlee. Acoustic characteristics of American English vowels. Journal of the Acoustical Society of America. 1995;97:3099–3111. doi: 10.1121/1.411872. [DOI] [PubMed] [Google Scholar]
  28. Hillenbrand James, Clark Michael, Nearey Terrance. Effects of consonantal environment on vowel formant patterns. Journal of the Acoustical Society of America. 2001;109:748–763. doi: 10.1121/1.1337959. [DOI] [PubMed] [Google Scholar]
  29. Hock Hans H. Principles of historical linguistics. 2. Berlin: Mouton de Gruyter; 1991. [Google Scholar]
  30. Irons Terry L. The Southern Shift in Appalachian English. University of Pennsylvania Working Papers in Linguistics. 2007a;13(2):121–134. http://repository.upenn.edu/pwpl. [Google Scholar]
  31. Irons Terry L. On the status of low back vowels in Kentucky English: More evidence of merger. Language Variation and Change. 2007b;19:137–180. [Google Scholar]
  32. Labov William. Principles of linguistic change. I: Internal factors. Oxford: Blackwell; 1994. [Google Scholar]
  33. Labov William. Principles of linguistic change. II: Social factors. Oxford: Blackwell; 2001. [Google Scholar]
  34. Labov William. Transmission and diffusion. Language. 2007;83:344–387. [Google Scholar]
  35. Labov William. Principles of linguistic change. III: Cognitive and cultural factors. Oxford: Blackwell; 2010. [Google Scholar]
  36. Labov William, Yaeger Malcah, Steiner Richard. A quantitative study of sound change in progress. Philadelphia: U.S. Regional Survey; 1972. [Google Scholar]
  37. Labov William, Ash Sharon, Boberg Charles. Atlas of North American English. Berlin: Mouton de Gruyter; 2006. [Google Scholar]
  38. Linblom Björn, Agwuele Augustine, Sussman Harvey M, Cortes Elisabet E. The effect of emphatic stress on consonant vowel coarticulation. Journal of the Acoustical Society of America. 2007;121:3802–3813. doi: 10.1121/1.2730622. [DOI] [PubMed] [Google Scholar]
  39. Lobanov Boris. Classification of Russian vowels spoken by different speakers. Journal of the Acoustical Society of America. 1971;49:606–608. [Google Scholar]
  40. McCarthy Corrine. The Northern Cities Shift in real time: Evidence from Chicago. University of Pennsylvania Working Papers in Linguistics. 2009;15(2):99–110. http://repository.upenn.edu/pwpl/vol15/iss2/12. [Google Scholar]
  41. Milenkovic Paul. TF32 software program. University of Wisconsin; Madison: 2003. [Google Scholar]
  42. Nearey Terrance, Assmann Peter. Modeling the role of inherent spectral change in vowel identification. Journal of the Acoustical Society of America. 1986;80:1297–1308. [Google Scholar]
  43. Preston Dennis R. Where are the dialects of American English at anyhow? American Speech. 2003;78:235–254. [Google Scholar]
  44. Purnell Thomas. Pre-velar raising and its conditioning environments: Role of labial and anterior tongue gestures. American Speech. 2008;83:373–402. [Google Scholar]
  45. Purnell Thomas C. AAE speakers and their participation in local sound changes: A comparative study. Publications of the American Dialect Society. Vol. 94. Durham: Duke University Press; 2010. The vowel phonology of urban southeastern Wisconsin; pp. 191–217. [Google Scholar]
  46. Sievers Eduard. Grundzüge der Lautphysiologie: Zur Einführung in das Studium der Lautlehre der indogermanischen Sprachen. Leipzig: Breitkopf & Härtel; 1876. (Later, beginning with the widely-cited 1881 edition, known as Grundzüge der Phonetik.) [Google Scholar]
  47. Sledd James. Breaking, umlaut, and the southern drawl. Language. 1966;42:18–41. [Google Scholar]
  48. Thomas Erik R. Vowel changes in Columbus, Ohio. Journal of English Linguistics. 1989;22:205–215. [Google Scholar]
  49. Thomas Erik R. An acoustic analysis of vowel variation in New World English. Durham, NC: Duke University Press; 2001. [Google Scholar]
  50. Thomas Erik R. Secrets revealed by Southern vowel shifting. American Speech. 2003;78:150–170. [Google Scholar]
  51. Thomas Erik R. Evidence from Ohio on the evolution of /æ/ In: Murray TE, Simon BL, editors. Language variation and change in the American Midland: A new look at “heartland” English. Amsterdam & Philadelphia: John Benjamins; 2006. pp. 69–89. [Google Scholar]
  52. Thomas Erik, Kendall Tyler. NORM: The vowel normalization and plotting suite. 2007 [Online resource: http://ncslaap.lib.ncsu.edu/tools/norm/]
  53. Torgersen Eivind, Kerswill Paul. Internal and external motivation in phonetic change: dialect levelling outcomes for an English vowel shift. Journal of Sociolinguistics. 2004;8:23–53. [Google Scholar]
  54. Trudgill Peter. New-dialect formation: The inevitability of colonial Englishes. Edinburgh: Edinburgh University Press; 2004. [Google Scholar]
  55. Watson Catherine I, Harrington Jonathan. Acoustic evidence for dynamic formant trajectories in Australian English vowels. Journal of the Acoustical Society of America. 1999;106:458–468. doi: 10.1121/1.427069. [DOI] [PubMed] [Google Scholar]
  56. Watson Catherine I, Maclagan Margaret, Harrington Jonathan. Acoustic evidence for vowel change in New Zealand English. Language Variation and Change. 2000;12:51–68. [Google Scholar]
  57. Watt Dominic. Phonetic parallels between the close-mid vowels of Tyneside English: Are the internally or externally motivated? Language Variation and Change. 2000;12:69–101. [Google Scholar]
  58. Wells John C. Accents of English. Cambridge: Cambridge University Press; 1982. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Appendix

RESOURCES