Skip to main content
American Journal of Audiology logoLink to American Journal of Audiology
. 2015 Jun;24(2):80–83. doi: 10.1044/2015_AJA-14-0056

Competing Speech Perception in Middle Age

Karen S Helfer a,
PMCID: PMC4567969  NIHMSID: NIHMS681712  PMID: 25768264

Abstract

Purpose

This research forum article summarizes research from our laboratory that assessed middle-aged adults' ability to understand speech in the presence of competing talkers.

Method

The performance of middle-aged adults on laboratory-based speech understanding tasks was compared to that of younger and older adults.

Results

Decline in the ability to understand speech in complex listening environments can be demonstrated in midlife. The specific auditory and cognitive contributors to these problems have yet to be established.

Conclusion

There is evidence that the ability to understand a target speech message in the presence of competing speech messages changes relatively early in the aging process. The nature and impact of these changes warrant further investigation.


Older adults often experience substantial problems understanding speech when listening conditions deteriorate. Background noise makes communication in many situations difficult for older listeners, especially when this noise consists of voices of other talkers (e.g., Desjardins & Doherty, 2013; Helfer, Chevalier, & Freyman, 2010; Helfer & Freyman, 2008, 2014; Humes & Coughlin, 2009; Humes, Lee, & Coughlin, 2006; Lee & Humes, 2012; Rossi-Katz & Arehart, 2009; Tun & Wingfield, 1999). Competing speech situations require individuals to use both peripheral and cognitive resources in order to communicate successfully, as the multiple speech sources must be segregated, messages not of interest must be ignored, and attention often must be switched in midconversation.

When do these challenges begin? For most individuals, problems coping in complex listening situations come on gradually. It is not unusual for middle-aged adults to self-refer for hearing assessment, and in fact, people in this age range (even those who have normal pure-tone thresholds) report more hearing-related problems than do younger adults (Demeester et al., 2012). Results of large-scale speech-in-noise screening programs confirm these reports, as a relatively large proportion of middle-aged adults need more advantageous signal-to-noise ratios to achieve criterion levels of performance (Nachtegaal et al., 2009).

The source of functional midlife hearing problems has yet to be determined. Pure-tone threshold elevation often begins in middle age (e.g., Morrell, Gordon-Salant, Pearson, Brant, & Fozard, 1996), which undoubtedly contributes to problems understanding speech in any type of background noise. Changes in suprathreshold peripheral processing of sounds also could contribute to problems understanding speech. For example, suboptimal temporal processing may cause listeners to be less able to take advantage of fluctuations in a competing speech signal, leading to reduced ability to glimpse portions of the to-be-attended message (e.g., George, Festen, & Houtgast, 2006; Takahashi & Bacon, 1992). In fact, several studies have noted abnormal temporal processing in middle-aged listeners (e.g. Grose & Mamo, 2010; Leigh-Paffenroth & Elangovan, 2011; Lister, Besing, & Koehnke, 2002). The ability to segregate sounds (of obvious importance in competing speech situations) also may change by midlife (e.g., Alain & McDonald, 2007; Lewald & Hausmann, 2013). Additionally, cognitive changes that could influence the ability to attend to one message while ignoring another have been documented in middle-aged individuals (e.g., Gunstad et al., 2006).

Speech Perception in Middle-Aged Adults: Results From Our Laboratory

Study 1: Speech Perception and Temporal Processing

The first exploration of speech perception in middle age from our laboratory examined speech understanding and temporal processing in groups of younger (19–22 years) and middle-aged (45–55 years) women (Helfer & Vargo, 2009). Speech understanding was measured in the presence of two types of maskers: a steady-state speech-shaped noise and a competing speech masker consisting of two same-sex talkers. The target and masking signals were presented in both spatially coincident and spatially separated conditions. The Gaps-in-Noise test (Musiek et al., 2005) was used to assess temporal processing. Although performance between the two groups of participants was equivalent in the presence of steady-state noise and when target and masking speech were spatially separated, middle-aged individuals performed significantly more poorly than younger adults in the spatially coincident speech-in-speech condition. Notably, speech recognition scores obtained in that condition were significantly associated with performance on the Gaps-in-Noise test. While high-frequency pure-tone thresholds (which were slightly elevated in the middle-aged listeners, with an average threshold at 4–8 kHz of 17 dB HL) were significantly correlated with speech understanding in steady-state noise, they were not related to speech understanding in competing speech or to Gaps-in-Noise test performance.

Study 2: Hearing, Cognition, and Speech Perception

Another study from our laboratory was designed to examine connections among hearing thresholds, cognitive ability, and speech understanding in younger, middle-aged, and older listeners (Helfer & Freyman, 2014). We manipulated the number of same-sex masking talkers (one or two) as well as confusability of the target speech and masking speech (via altering the syntactic similarity between the to-be-attended and to-be-ignored speech streams). Cognitive abilities measured included auditory short-term and working memory, inhibitory ability, and processing speed. The middle-aged participants in this study performed more similarly to older participants for a single competing talker and more similarly to younger listeners in the two-talker masker condition. However, significant differences were found between middle-aged and younger listeners in all speech-masking conditions, indicating that age-related changes for this type of speech perception task occurred early in senescence. The effect of syntactic similarity was comparable across groups, suggesting that aging does not bring out increased susceptibility to this aspect of confusability between target and masker. Elevated high-frequency pure-tone thresholds, smaller auditory short-term memory spans, and slower processing speed were associated with poorer speech recognition performance among the middle-aged and older participants. Unexpectedly, the proportion of masker errors (that is, when participants responded with a word in the to-be-ignored sentence instead of one from the to-be-attended sentence) was greater for participants with better hearing, longer short-term memory spans, and better inhibitory ability. Of note was that self-perceived hearing problems (measured with selected questions from the Speech, Spatial and Qualities of Hearing Scale; Gatehouse & Noble, 2004) were greater for the middle-aged participants in this study as compared to the older participants.

Study 3: Lexical Factors in Competing Speech Perception

Our most recent work in this area (Helfer & Jesse, 2015) explored how lexical characteristics (specifically, neighborhood density and frequency of word occurrence) of to-be-attended speech (target) and to-be-ignored speech (masker) affected performance of younger, middle-aged, and older adults. Individuals in this study heard pairs of sentences in which the frequency or neighborhood density of words in the target or masking speech stream was manipulated. To-be-attended and to-be-ignored sentences were presented from a single loudspeaker located directly in front of the listener. Participants repeated back the target sentence, which was cued by both the first word (the target sentence always started with the name “Theo”; the masking sentence began with “Victor” or “Michael”) and the target talker's voice, which was consistent throughout the experiment. Participants also completed a battery of cognitive tests similar to the one described earlier for Study 2.

Results of this experiment confirmed that to-be-attended words that occurred frequently or were from sparse lexical neighborhoods were easier to understand than were rare words or those from dense lexical neighborhoods, even in the presence of a competing speech message. Previous studies (Dirks, Takayanagi, Moshfegh, Noffsinger, & Fausti, 2001; Revill & Spieler, 2012; Sommers & Danielson, 1999; Spieler & Balota, 2000; Taler, Aaron, Steinmetz, & Pisoni, 2010) have found that lexical frequency in to-be-attended speech is more important for older than for younger listeners. However, in our study, the effect of word frequency in the to-be-attended speech stream was most influential for middle-aged participants (as compared to either older or younger participants) and was related to selected cognitive abilities (specifically, inhibitory ability and processing speed). Remarkably, lexical characteristics of the masking speech stream also influenced performance, as speech recognition ability was poorer in the presence of to-be-ignored words that were easy to recognize (that is, that came from sparse lexical neighborhoods or that occurred frequently in the lexicon) as compared to performance with hard-to-recognize masking words.

Stimuli in this study had a consistent target talker and a consistent masking talker; hence, we expected that participants would be able to incidentally learn the two voices, leading to improved performance over the course of the study (e.g., Felty, Buchwald, & Pisoni, 2009; Johnsrude et al., 2013). Figure 1 shows percent correct performance as a function of trial, with data averaged over the first 50 trials, the middle 50 trials, and the last 50 trials. It can be observed that some learning occurred for all listener groups, particularly over the first 50 trials. Of note is that, during the initial trials of the study, the speech recognition ability of middle-aged listeners was inferior to that of younger participants listening at a 3-dB poorer signal-to-noise ratio; this difference disappeared as the experiment progressed. However, younger adults listening at the same signal-to-noise ratio as middle-aged participants scored consistently better throughout the experiment.

Figure 1.

Figure 1.

Speech recognition performance in the presence of a single-talker speech masker, as a function of trial order. Older and middle-aged listeners had a signal-to-noise ratio of −1 dB, as did one group of younger participants (Y −1). The other group of younger participants had a signal-to-noise ratio of −4 dB (Y −4). Data are from Helfer and Jesse (2015).

Conclusion

Results of studies described in this research forum article, as well as work from other researchers (e.g., Başkent, van Engelshoven, & Galvin, 2014; Dubno et al., 2008; Ruggles, Bharadwaj, & Shinn-Cunningham, 2012), provide evidence that substantial changes in speech recognition ability can occur in midlife. These problems are most easily observed in difficult listening situations. The reasons behind this early aging effect have yet to be determined but are likely a combination of lack of audibility due to initial stages of presbycusis, changes in suprathreshold processing of speech sounds, and subtle cognitive decline.

Acknowledgments

Work discussed in this research forum article was supported by National Institute on Deafness and Other Communication Disorders Grants R01 01625 (awarded to Richard Freyman) and R01 012057 (awarded to Karen S. Helfer).

Funding Statement

Work discussed in this research forum article was supported by National Institute on Deafness and Other Communication Disorders Grants R01 01625 (awarded to Richard Freyman) and R01 012057 (awarded to Karen S. Helfer).

References

  1. Alain C., & McDonald K. L. (2007). Age-related differences in neuromagnetic brain activity underlying concurrent sound perception. The Journal of Neuroscience, 27, 1308–1314. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Başkent D., van Engelshoven S., & Galvin J. J. III (2014). Susceptibility to interference by music and speech maskers in middle-aged adults. The Journal of the Acoustical Society of America, 135, EL147–EL153. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Demeester K., Topsakal V., Hendrickx J.-J., Fransen E., van Laer L., Van Camp G., … van Wieringen A. (2012). Hearing disability measured by the Speech, Spatial, and Qualities of Hearing Scale in clinically normal-hearing and hearing-impaired middle-aged persons, and disability screening by means of a reduced SSQ (the SSQ5). Ear and Hearing, 33, 615–616. [DOI] [PubMed] [Google Scholar]
  4. Desjardins J. L., & Doherty K. A. (2013). Age-related changes in listening effort for various types of masker noises. Ear and Hearing, 34, 261–272. [DOI] [PubMed] [Google Scholar]
  5. Dirks D. D., Takayanagi S., Moshfegh A., Noffsinger P. D., & Fausti S. A. (2001). Examination of the neighborhood activation theory in normal and hearing-impaired listeners. Ear and Hearing, 22, 1–13. [DOI] [PubMed] [Google Scholar]
  6. Dubno J. R., Lee F.-S., Matthews L. J., Ahlstrom J. B., Horwitz A. R., & Mills J. H. (2008). Longitudinal changes in speech recognition in older persons. The Journal of the Acoustical Society of America, 123, 462–475. [DOI] [PubMed] [Google Scholar]
  7. Felty R. A., Buchwald A., & Pisoni D. B. (2009). Adaptation to frozen babble in spoken word recognition. The Journal of the Acoustical Society of America, 125, EL93–EL97. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Gatehouse S., & Noble W. (2004). The Speech, Spatial and Qualities of Hearing Scale (SSQ). International Journal of Audiology, 43, 85–99. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. George E. L. J., Festen J. M., & Houtgast T. (2006). Factors affecting masking release for speech in modulated noise for normal-hearing and hearing-impaired listeners. The Journal of the Acoustical Society of America, 120, 2295–2311. [DOI] [PubMed] [Google Scholar]
  10. Grose J. H., & Mamo S. K. (2010). Processing of temporal fine structure as a function of age. Ear and Hearing, 31, 755–760. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Gunstad J., Paul R. H., Brickman A. M., Cohen R. A., Arns M., Roe D., … Gordon E. (2006). Patterns of cognitive performance in middle-aged and older adults: A cluster analytic examination. Journal of Geriatric Psychiatry and Neurology, 19, 59–64. [DOI] [PubMed] [Google Scholar]
  12. Helfer K. S., Chevalier J., & Freyman R. L. (2010). Aging, spatial cues, and single- versus dual-task performance in competing speech perception. The Journal of the Acoustical Society of America, 128, 3625–3633. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Helfer K. S., & Freyman R. L. (2008). Aging and speech-on-speech masking. Ear and Hearing, 29, 87–98. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Helfer K. S., & Freyman R. L. (2014). Stimulus and listener factors affecting age-related changes in competing speech perception. The Journal of the Acoustical Society of America, 136, 748–759. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Helfer K. S., & Jesse A. (2015). Lexical influences on competing speech perception in younger, middle-aged, and older adults. Manuscript submitted for publication. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Helfer K. S., & Vargo M. (2009). Speech recognition and temporal processing in middle-aged women. Journal of the American Academy of Audiology, 20, 264–271. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Humes L. E., & Coughlin M. P. (2009). Aided speech-identification performance in single-talker competition by older adults with impaired hearing. Scandinavian Journal of Psychology, 50, 485–494. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Humes L. E., Lee J. H., & Coughlin M. P. (2006). Auditory measures of selective and divided attention in young and older adults using single-talker competition. The Journal of the Acoustical Society of America, 120, 2926–2937. [DOI] [PubMed] [Google Scholar]
  19. Johnsrude I. S., Mackey A., Hakyemez H., Alexander E., Trang H. P., & Carlyon R. P. (2013). Swinging at a cocktail party: Voice familiarity aids speech perception in the presence of a competing voice. Psychological Science, 24, 1995–2004. [DOI] [PubMed] [Google Scholar]
  20. Lee J. H., & Humes L. E. (2012). Effect of fundamental-frequency and sentence-onset differences on speech-identification performance of young and older adults in a competing-talker background. The Journal of the Acoustical Society of America, 132, 1700–1717. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Leigh-Paffenroth E. D., & Elangovan S. (2011). Temporal processing in low-frequency channels: Effects of age and hearing loss in middle-aged listeners. Journal of the American Academy of Audiology, 22, 393–404. [DOI] [PubMed] [Google Scholar]
  22. Lewald J., & Hausmann M. (2013). Effects of sex and age on auditory spatial scene analysis. Hearing Research, 299, 46–52. [DOI] [PubMed] [Google Scholar]
  23. Lister J., Besing J., & Koehnke J. (2002). Effects of age and frequency disparity on gap discrimination. The Journal of the Acoustical Society of America, 111, 2793–2800. [DOI] [PubMed] [Google Scholar]
  24. Morrell C. H., Gordon-Salant S., Pearson J. D., Brant L. J., & Fozard J. L. (1996). Age- and gender-specific reference ranges for hearing level and longitudinal changes in hearing level. The Journal of the Acoustical Society of America, 100, 1949–1967. [DOI] [PubMed] [Google Scholar]
  25. Musiek F. E., Shinn J. B., Jirsa R., Bamiou D.-E., Baran J. A., & Zaida E. (2005). GIN (Gaps-In-Noise) test performance in subjects with confirmed central auditory nervous system involvement. Ear and Hearing, 26, 608–618. [DOI] [PubMed] [Google Scholar]
  26. Nachtegaal J., Smit J. H., Smits C., Bezemer P. D., van Beek J. H. M., Festen J. M., & Kramer S. E. (2009). The association between hearing status and psychosocial health before the age of 70 years: Results from an Internet-based national survey on hearing. Ear and Hearing, 30, 302–312. [DOI] [PubMed] [Google Scholar]
  27. Revill K. P., & Spieler D. H. (2012). The effect of lexical frequency on spoken word recognition in young and older listeners. Psychology and Aging, 27, 80–87. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Rossi-Katz J., & Arehart K. H. (2009). Message and talker identification in older adults: Effects of task, distinctiveness of the talkers' voices, and meaningfulness of the competing message. Journal of Speech, Language, and Hearing Research, 52, 435–453. [DOI] [PubMed] [Google Scholar]
  29. Ruggles D., Bharadwaj H., & Shinn-Cunningham B. G. (2012). Why middle-aged listeners have trouble hearing in everyday situations. Current Biology, 22, 1417–1422. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Sommers M. S., & Danielson S. M. (1999). Inhibitory processes and spoken word recognition in young and older adults: The interaction of lexical competition and semantic context. Psychology and Aging, 14, 458–472. [DOI] [PubMed] [Google Scholar]
  31. Spieler D. H., & Balota D. A. (2000). Factors influencing word naming in younger and older adults. Psychology and Aging, 15, 225–231. [DOI] [PubMed] [Google Scholar]
  32. Takahashi G. A., & Bacon S. P. (1992). Modulation detection, modulation masking, and speech understanding in noise in the elderly. Journal of Speech and Hearing Research, 35, 1410–1421. [DOI] [PubMed] [Google Scholar]
  33. Taler V., Aaron G. P., Steinmetz L. G., & Pisoni D. B. (2010). Lexical neighborhood density effects on spoken word recognition and production in healthy aging. Journals of Gerontology: Series B: Psychological Sciences and Social Sciences, 65, P551–P560. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Tun P. A., & Wingfield A. (1999). One voice too many: Adult age differences in language processing with different types of distracting sound. Journals of Gerontology: Series B: Psychological Sciences and Social Sciences, 54, P317–P327. [DOI] [PubMed] [Google Scholar]

Articles from American Journal of Audiology are provided here courtesy of American Speech-Language-Hearing Association

RESOURCES