Abstract
A recent study in individuals with hyperfunctional voice disorders (HVDs) reported poorer fo discrimination compared to matched controls. In that study, fo discrimination was measured using each individual's own voice as the stimulus, which may have introduced confounds given the differences in vocal quality between the groups. Here, this paper addressed this possibility by using an fo discrimination task where participants with and without HVDs judged the same external voice. In line with the previous study, the HVD group exhibited poorer fo discrimination than controls, providing convergent evidence for a pitch-related auditory deficit in this population.
1. Introduction
Hyperfunctional voice disorders (HVDs) are the most frequently occurring type of voice disorder (Bhattacharyya, 2014) and account for up to 40% of referrals to multidisciplinary voice clinics (Roy, 2003). Traditionally HVDs have been associated with increased or imbalanced muscle tension (Hillman et al., 2020), consistent with the “hyperfunctional” nomenclature, but the mechanisms behind this impaired voice production are not fully understood. Voice use (Altman et al., 2005; Van Houtte et al., 2011), psychosocial factors and personality (Roy et al., 2000), and compensation for pre-existing pathologies (Hillman et al., 2020) have all been implicated. There is also growing evidence suggesting auditory function as a contributing factor in HVDs (Abur et al., 2021; Abur et al., 2023; Nagy et al., 2020; Nguyen et al., 2022; Ramos et al., 2018). Current models of vocal motor control, in which auditory feedback is a key component, illustrate how auditory deficits may result in persistent improper vocal control (Houde et al., 2014; Weerathunge et al., 2022). However, the empirical data supporting this idea are rather limited. In addition to its theoretical importance, a better understanding of the role of auditory acuity in HVDs could motivate new treatment approaches that target voice perception in addition to voice production.
The most compelling evidence to date for a role for auditory acuity in HVDs comes from a recent study by Abur et al. (2021). They examined the ability to use auditory feedback to update feedforward voice control in a large population of individuals with HVDs and reported a greater frequency of atypical responses relative to control participants. They also assessed fundamental frequency (fo) discrimination, using each individual's own voice as the stimulus, and found that participants in the HVD group needed larger differences in fo to reliably notice a difference between two vowels (i.e., they had worse acuity) than controls. Although the use of own-voice stimuli ensured that the auditory task was directly related to each participant's auditory feedback, it also introduced a confound given the overall differences in vocal quality between the groups. Thus, this methodological choice may have influenced the results and/or increased variability in the data. In the current study, we revisited this issue in a new large sample of participants, this time using the same external vowel stimulus for all participants. In doing so, we sought to provide convergent evidence for a pitch-related auditory deficit in individuals with HVDs. Following Abur et al. (2021), our recruitment and analysis approach considered whether or not participants were singers. Singers have an increased risk of developing HVDs due to high voice use (Roy et al., 2005) and are also more likely to have better pitch sensitivity as a result of vocal training compared to non-singers (Watts et al., 2005). Thus, it is important to account for the influence of singing experience on pitch discrimination to avoid this factor obscuring differences between groups with and without HVDs.
2. Methods
2.1. Participants
A total of 120 individuals participated. The HVD group consisted of 17 non-singers (5 male,1 12 female; age 39 ± 13 years) and 43 singers (5 male, 38 female; age 26 ± 10 years). The control group consisted of 17 non-singers (5 male, 12 female; age 39 ± 15 years) and 43 singers (5 male, 38 female; age 23 ± 5 years). Participants were classified as singers based on self-report (with their singing experience spanning professional, semiprofessional, student and teaching capacities).
All individuals with HVDs were diagnosed by a laryngologist based on a comprehensive voice evaluation in collaboration with speech-language pathology at the Massachusetts General Hospital Voice Center. None of the recruited individuals with HVDs had a history of neurological disorders or other speech, language, and hearing disorders. Control participants reported no history of neurological, voice, speech, language, or hearing disorders.
All participants were screened to ensure that they had normal (≤25 dB hearing level) pure-tone thresholds at standard audiometric frequencies from 500 to 4000 Hz. Screening was done using a model 109 or model 119 audiometer (Beltone Electronics Corp., Chicago, IL).
2.2. Procedures
These data were collected as part of a larger study focused on ambulatory monitoring of voice production in individuals with and without HVDs. The pitch discrimination task was conducted in a sound-attenuated booth. Stimuli were presented via a Komplete Audio 6 soundcard (Native Instruments GmbH, Berlin, Germany) and processed with an Eventide Eclipse V4 Harmonizer (Eventide, Inc., Little Ferry, NJ) before delivery via Sennheiser HD 280 Pro headphones (Sennheiser, Wedemark, Germany).
Participants were presented with pairs of vowels and asked to judge whether the two stimuli sounded the “same” or “different” in terms of their pitch. The reference stimulus was a 500-ms segment of a steady vowel (/ɑ/) produced by a female talker with a mean fo of 239 Hz. Prior to the start of the task, participants were played samples and asked to set the volume to a comfortable listening level; this level was not adjusted after the start of the task. Each trial consisted of one reference stimulus (the original recording) and one comparison stimulus with a shift in fo (see Fig. 1). A cosine taper was applied to both stimuli such that the first and last 10 ms of each sample were equal to parts of a phase-shifted cosine. The order of the two stimuli was randomized, and the delay between them was 500 ms.
Fig. 1.
Schematic showing the stimulus design and trial structure.
The fo shift on the comparison stimulus was varied according to a standard adaptive procedure (García-Pérez, 1998), designed to find the fo discrimination “threshold,” or the fo shift at which a participant can reliably judge whether the stimuli are the same or different. Following Abur et al. (2021), the initial shift was +60 cents, with a 4-cent change following two correct responses (decreasing) or one incorrect response (increasing). The adaptive track was terminated after ten reversals or 60 trials, whichever occurred first. Thresholds were estimated by calculating the average fo shift values in cents across the last four reversals, where the adaptive track had stabilized. Intermixed with the adaptive staircase were “catch trials” in which the reference stimulus was played twice to ensure attention to the task. Catch trials made up 20% of all trials, and responses to them were analyzed at the completion of the adaptive track. Catch trial accuracy was high in both groups (averages of 94% and 95% in the control and HVD groups, respectively).
3. Results
Figure 2 shows individual and mean discrimination thresholds for the two groups (controls and HVD), separately for non-singers and singers. The first point to note is that there is substantial individual variability within each group and substantial overlap in the distributions across the four groups. A two-way analysis of covariance was conducted with a dependent variable of fo discrimination threshold, independent variables of group (control, HVD) and singer status (non-singers, singers) as well as their interaction, and a covariate of age. This analysis revealed significant effects of group [F(1,115) = 20.49, p < 0.001, η2 = 0.15], with the HVD group having poorer thresholds than controls, and singer status [F(1,115) = 17.33, p < 0.001, η2 = 0.13], with non-singers having poorer thresholds than singers. There was also a significant interaction [F(1, 115) = 5.42, p = 0.022, η2 = 0.045], with the non-singers showing a larger mean difference between control and HVD groups than the singers. Age did not have a significant relationship with thresholds (p > 0.05).
Fig. 2.
Individual and mean fundamental frequency (fo) discrimination thresholds (in cents) for non-singers (left) and singers (right) for the group with hyperfunctional voice disorders (HVDs) and the control group. Error bars show 95% confidence intervals.
The significant interaction between group and singer status reflects a difference in the magnitude of the group effect, which is also apparent in Fig. 2. In non-singers, the average fo discrimination thresholds were 25.5 cents [standard deviation (SD) = 15.7 cents] in controls and 45.1 cents (SD = 24.4 cents) in speakers with HVDs, for an overall group effect of 19.6 cents. In singers, the average fo discrimination thresholds were 18.9 cents (SD = 7.5 cents) in controls and 25.0 cents (SD = 12.8 cents) in speakers with HVDs, for an overall group effect of 6.1 cents. However, follow-up Student's t tests with a Bonferroni adjustment found significant differences between control and HVD groups for both non-singers [T(58) = 2.70, adjusted p value (padj) = 0.009, d = 0.59] and singers [T(58) = 2.67, padj = 0.01, d = 0.98]. These analyses reveal that, despite the larger absolute difference found in non-singers, the effect size was in fact smaller in non-singers than in singers, attributable to the much higher variability in non-singers.
4. Discussion
The purpose of the current study was to compare fo discrimination for an external voice stimulus in listeners with and without HVDs. The results revealed a robust fo discrimination deficit in individuals with HVDs, in line with the findings of Abur et al. (2021). The current study provides convergent evidence for a pitch-related deficit in individuals with HVDs. An important modification made to the task in the current study was to employ the same external vowel stimulus for all participants, instead of using each participant's own production, to remove a potential source of bias and minimize stimulus-related variability. Discrimination thresholds were broadly consistent with those reported in Abur et al. (2021). In our control group, average values were 19 cents for singers and 26 cents for non-singers. These values are slightly lower than in the previous study (25 and 43 cents for singers and non-singers, respectively). In our HVD group, average thresholds were 25 cents for singers and 45 cents for non-singers. These values are again slightly lower than in the previous study (31 and 60 cents for singers and non-singers, respectively). Lower thresholds in the current study may reflect both differences in the stimuli and different cohorts of participants. Regarding participants, for example, we note that inclusion criteria related to hearing thresholds were slightly stricter in the current study. SDs were generally smaller in the current study than in the previous study, which we attribute primarily to the use of a consistent stimulus across participants. These overall results align with those of Abur et al. (2021) in that singers had substantially better pitch perception than non-singers overall, yet within singer and non-singer groups, individuals with HVDs had worse acuity than their respective controls.
Together, the current results and those of Abur et al. (2021) provide striking evidence that HVDs are associated with deficits in auditory processing (specifically fo discrimination) regardless of whether it is a self-produced or externally produced voice. These findings bolster a small number of other observations in the literature of pitch-related deficits in individuals with voice disorders (Nguyen et al., 2022; Ramos et al., 2018). Importantly, in our sample all participants passed a hearing screening, potentially ruling out peripheral hearing loss as an underlying cause (as postulated previously by Nagy et al., 2020). However, due to this inclusion criterion, we were unable to compare differences in audiometric thresholds between HVD and control groups; additionally, the typical hearing screening did not include 8 kHz or “extended high frequencies” (>8 kHz), which have shown promise at identifying subclinical hearing loss (Lough and Plack, 2022). Thus, it is currently unclear if the auditory profile of individuals with HVDs included peripheral hearing loss. Likewise, we do not know if the deficit is specific to pitch discrimination or extends to other suprathreshold auditory abilities such as loudness discrimination, modulation detection, and gap detection. A broader assessment of peripheral and central auditory function in this population is needed to understand the nature and scope of any deficit [see, for example, Naunheim et al. (2019)]. A more detailed auditory profile would also be valuable for interpreting auditory feedback-based vocal control in HVDs in terms of current theoretical models (Parrell et al., 2019).
Further investigations are also needed to link pitch discrimination abilities to specific speech motor control processes. In the study of Abur et al. (2021), participants also completed pitch perturbation tasks intended to assess sensorimotor vocal learning. The results demonstrated that impaired sensorimotor learning was more common in the HVD group and also that this impaired sensorimotor learning was associated with pitch discrimination thresholds. They concluded that both atypical sensorimotor vocal learning and impaired voice production in HVDs may have an underlying basis in auditory function. Understanding the specificity of the auditory deficits in HVDs is an important next step to developing and testing a theoretical model for why these deficits result in the core symptoms of increased vocal effort and poor voice quality.
Acknowledgments
These data were collected by Katherine Marks, Laura Toles, and Jameson Cooper at the Center for Laryngeal Surgery and Voice Rehabilitation at the Massachusetts General Hospital. The authors would like to thank them and the larger team, including Robert E. Hillman, Daryush Mehta, and Jarrad Van Stan. This work was supported by Grant Nos. DC013017 (C.E.S.), DC015760 (V.B.), and DC015446 (R. E. Hillman) from the National Institute on Deafness and Other Communication Disorders.
Footnotes
Gender information was not collected.
Author Declarations
Conflict of Interest
The authors have no conflicts to disclose.
Ethics Approval
All procedures were approved by the Mass General Brigham and Boston University Institutional Review Boards. Participants provided informed consent and were paid for their participation.
Data Availability
These data were collected at Mass General Brigham, which is not allowed to give access to data without the principal investigator for the human studies protocol first submitting a protocol amendment to request permission to share the data with a specific collaborator on a case-by-case basis. This policy is based on strict rules dealing with the protection of patient data and information. Anyone wishing to request access to the data used for this study must contact Sarah DeRosa (sederosa@partners.org), Program Coordinator for Research and Clinical Speech-Language Pathology, Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital.
References
- 1.Abur, D., Hillman, R. E., and Stepp, C. E. (2023). “Auditory-motor function pre- and post-therapy in hyperfunctional voice disorders: A case series,” J. Voice. (published online). 10.1016/j.jvoice.2023.08.017 [DOI] [PubMed] [Google Scholar]
- 2.Abur, D., Subaciute, A., Kapsner-Smith, M., Segina, R. K., Tracy, L. F., Noordzij, J. P., and Stepp, C. E. (2021). “Impaired auditory discrimination and auditory-motor integration in hyperfunctional voice disorders,” Sci. Rep. 11(1), 13123. 10.1038/s41598-021-92250-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Altman, K. W., Atkinson, C., and Lazarus, C. (2005). “Current and emerging concepts in muscle tension dysphonia: A 30-month review,” J. Voice 19(2), 261–267. 10.1016/j.jvoice.2004.03.007 [DOI] [PubMed] [Google Scholar]
- 4.Bhattacharyya, N. (2014). “The prevalence of voice problems among adults in the United States,” Laryngoscope 124(10), 2359–2362. 10.1002/lary.24740 [DOI] [PubMed] [Google Scholar]
- 5.García-Pérez, M. A. (1998). “Forced-choice staircases with fixed step sizes: Asymptotic and small-sample properties,” Vis. Res. 38(12), 1861–1881. 10.1016/S0042-6989(97)00340-4 [DOI] [PubMed] [Google Scholar]
- 6.Hillman, R. E., Stepp, C. E., Van Stan, J. H., Zañartu, M., and Mehta, D. D. (2020). “An updated theoretical framework for vocal hyperfunction,” Am. J. Speech Lang. Pathol. 29(4), 2254–2260. 10.1044/2020_AJSLP-20-00104 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Houde, J., Niziolek, C., Kort, N., Agnew, Z., and Nagarajan, S. (2014). “Simulating a state feedback model of speaking,” in 10th International Seminar on Speech Production (10th ISSP), Cologne, Germany, pp. 202–205, available at http://www.issp2014.uni-koeln.de/. [Google Scholar]
- 8.Lough, M., and Plack, C. J. (2022). “Extended high-frequency audiometry in research and clinical practice,” J. Acoust. Soc. Am. 151(3), 1944–1955. 10.1121/10.0009766 [DOI] [PubMed] [Google Scholar]
- 9.Nagy, A., Elshafei, R., and Mahmoud, S. (2020). “Correlating undiagnosed hearing impairment with hyperfunctional dysphonia,” J. Voice 34(4), 616–621. 10.1016/j.jvoice.2019.02.002 [DOI] [PubMed] [Google Scholar]
- 10.Naunheim, M. L., Yung, K. C., Schneider, S. L., Henderson‐Sabes, J., Kothare, H., Mizuiri, D., Klein, D. J., Houde, J. F., Nagarajan, S. S., and Cheung, S. W. (2019). “Vocal motor control and central auditory impairments in unilateral vocal fold paralysis,” Laryngoscope 129(9), 2112–2117. 10.1002/lary.27680 [DOI] [PubMed] [Google Scholar]
- 11.Nguyen, D. D., Chacon, A. M., Novakovic, D., Hodges, N. J., Carding, P. N., and Madill, C. (2022). “Pitch discrimination testing in patients with a voice disorder,” J. Clin. Med. 11(3), 584. 10.3390/jcm11030584 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Parrell, B., Lammert, A. C., Ciccarelli, G., and Quatieri, T. F. (2019). “Current models of speech motor control: A control-theoretic overview of architectures and properties,” J. Acoust. Soc. Am. 145(3), 1456–1481. 10.1121/1.5092807 [DOI] [PubMed] [Google Scholar]
- 13.Ramos, J. S., Feniman, M. R., Gielow, I., and Silverio, K. C. A. (2018). “Correlation between voice and auditory processing,” J. Voice 32(6), 771.E25–771.E36. 10.1016/j.jvoice.2017.08.011 [DOI] [PubMed] [Google Scholar]
- 14.Roy, N. (2003). “Functional dysphonia,” Curr. Opin. Otolaryngol. Head Neck Surg. 11(3), 144–148. 10.1097/00020840-200306000-00002 [DOI] [PubMed] [Google Scholar]
- 15.Roy, N., Bless, D. M., and Heisey, D. (2000). “Personality and voice disorders: A multitrait-multidisorder analysis,” J. Voice 14(4), 521–548. 10.1016/S0892-1997(00)80009-0 [DOI] [PubMed] [Google Scholar]
- 16.Roy, N., Merrill, R. M., Gray, S. D., and Smith, E. M. (2005). “Voice disorders in the general population: Prevalence, risk factors, and occupational impact,” Laryngoscope 115(11), 1988–1995. 10.1097/01.mlg.0000179174.32345.41 [DOI] [PubMed] [Google Scholar]
- 17.Van Houtte, E., Van Lierde, K., and Claeys, S. (2011). “Pathophysiology and treatment of muscle tension dysphonia: A review of the current knowledge,” J. Voice 25(2), 202–207. 10.1016/j.jvoice.2009.10.009 [DOI] [PubMed] [Google Scholar]
- 18.Watts, C., Moore, R., and McCaghren, K. (2005). “The relationship between vocal pitch-matching skills and pitch discrimination skills in untrained accurate and inaccurate singers,” J. Voice 19(4), 534–543. 10.1016/j.jvoice.2004.09.001 [DOI] [PubMed] [Google Scholar]
- 19.Weerathunge, H. R., Alzamendi, G. A., Cler, G. J., Guenther, F. H., Stepp, C. E., and Zañartu, M. (2022). “LaDIVA: A neurocomputational model providing laryngeal motor control for speech acquisition and production,” PLoS Comput. Biol. 18(6), e1010159. 10.1371/journal.pcbi.1010159 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
These data were collected at Mass General Brigham, which is not allowed to give access to data without the principal investigator for the human studies protocol first submitting a protocol amendment to request permission to share the data with a specific collaborator on a case-by-case basis. This policy is based on strict rules dealing with the protection of patient data and information. Anyone wishing to request access to the data used for this study must contact Sarah DeRosa (sederosa@partners.org), Program Coordinator for Research and Clinical Speech-Language Pathology, Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital.


