Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2019 Dec 13.
Published in final edited form as: Appl Sci (Basel). 2019 Jul 28;9(15):3042. doi: 10.3390/app9153042

Vocal Pitch Discrimination in Children with and without Vocal Fold Nodules

Elizabeth S Heller Murray 1,2,*, Anne F Hseu 2, Roger C Nuss 2, Geralyn Harvey Woodnorth 2, Cara E Stepp 1,3,4
PMCID: PMC6910133  NIHMSID: NIHMS1054767  PMID: 31840003

Abstract

Vocal pitch discrimination abilities were compared in sixteen children with vocal fold nodules (CwVN) and sixteen matched controls with typical voices (CwTV). Vocal pitch discrimination was also evaluated in thirty-five vocally healthy children and twenty adults to examine potential changes as a function of maturation. CwTV were categorized as either younger (N = 15, 5.6–7.7 years) or older (N = 20, 8.2–11.7 years). Participants completed two-alternative, forced choice listening tasks in which they judged whether pairs of sustained /α/ tokens were different in pitch. Each pair consisted of a base token with a fundamental frequency fo) of 216.2 Hz and a test token with a fo that was adaptively modified, according to the participant’s prior judgments. There were no significant differences in pitch discrimination abilities between CwVN and CwTV. Pitch discrimination abilities were significantly poorer in younger and older CwTV as compared to adults. Additionally, younger CwTV had significantly poorer discrimination abilities than older CwTV. Findings from this study suggest that CwVN do not have differences in pitch discrimination abilities, yet, therapies designed for CwVN should consider this developmental trend in perceptual abilities.

Keywords: pitch, voice, vocal fold nodules, auditory discrimination

1. Introduction

The presence of a dysphonic, or altered, vocal quality can have a significant negative impact on a child [1-3]. The most common cause of dysphonia in children is vocal fold nodules, benign, callous-like lesions on the vocal folds [4-11]. Vocal fold nodules are suggested to be related to vocal use, vocal misuse, and/or vocal hyperfunction [6,12-17]. Voice therapy is considered the best practice for the treatment of vocal fold nodules in children [18-22], yet it yields varied success rates [23,24]. This is likely due to multiple factors, including: Lack of transparency with regard to therapy, differing outcome measures, lack of understanding of the exact etiology of vocal fold nodules’ development, and an overall shortage of pediatric-focused research within the greater field of voice disorders [25-27].

A recent study has provided a more comprehensive view of pediatric-focused voice therapy by conducting a randomized clinical trial with children with vocal fold nodules (CwVN) ages 6–10 [28]. Authors selected two therapy approaches to compare: (1) ‘Indirect’ therapy, focusing on vocal hygiene and discussion of desired vocal productions, and (2) ‘direct’ therapy, focusing on motor learning tasks. Results indicated that both therapy approaches resulted in improvement of the primary outcome measure, the Pediatric Voice-Related Quality of Life (PVRQOL) [29]. Improvements of secondary outcome measures, including auditory-perceptual evaluation of voice, vocal fold nodule size, acoustic measures, and aerodynamic measures were also shown. Although no significant differences were found between the two therapy types, post hoc analyses indicated that older children (8–10 years) had a larger improvement in their PVRQOL scores for the ‘direct’ therapy, as compared to ‘indirect’ therapy. Authors suggested that these older children may be more cognitively able to learn the new motor patterns during the ‘direct’ therapy sessions and, thus, displayed greater improvements [28].

An alternate (or perhaps, extended) hypothesis for differences seen between older and younger children [28] may be that older children have more developed auditory-perceptual abilities, thus allowing them to better detect, learn, and implement vocal changes. Explicit examination of auditory-perceptual discrimination has not been examined in CwVN, yet previous works suggest that CwVN may have perceptual differences as compared to children without voice disorders [30,31]. Historically, methods of voice therapy for children suggested spending extensive time in therapy on attention and awareness (e.g., [32]), as the supposition was these children were not noticing their own vocal characteristics. More recently, a study examining the parental perception of personality traits suggested that CwVN may have difficulties using detection of small changes as a self-regulation tool, as compared to children without diagnosed speech disorders and children who stutter [31]. Finally, emerging evidence in adults with dysphonic voices indicates that they have poorer pitch discrimination than vocally healthy controls [33]. Overall, these previous studies suggest that explicit examination of auditory-perceptual abilities in CwVN is needed to determine if there are perceptual differences in this population.

Studies examining auditory-perceptual discrimination, typically using pure tones, show that most children do not reach adult-like maturity until at least eight years of age [34-40]. Yet it remains unknown whether mature auditory discrimination is required to efficiently learn and apply new therapy tasks and strategies. Additionally, the ecological validity of evaluating auditory discrimination abilities of pure tones is low, making it difficult to translate these findings to auditory discrimination of changes in the voice. Specifically, pure tones are comprised of a single frequency, whereas the voice has a complex harmonic structure, containing a fundamental frequency as well as multiples of that frequency (i.e., harmonics). To date, only a few studies have examined children’s abilities to discriminate pitch stimuli with more complex stimuli, with findings indicating variable results as to when children become adult-like [40,41]. As such, continued and well-controlled work is necessary to further understand whether discrimination of complex stimuli, such as the voice, changes throughout maturation.

Previous work has discussed the optimal methods of testing auditory processing across the lifespan, with researchers noting that an ideal task is one that requires minimal memory and attentional demands, as those can vary greatly in certain age and/or clinical populations [42]. Additionally, the language used to elicit a response can impact the results: Complex relational language, such as ‘higher’ or ‘lower’ has been shown to result in poorer discrimination as compared to less complex language, such as ‘same’ and ‘different’ [43]. Additionally, attentional abilities of the participant may impact discrimination abilities [35], especially in procedures that have increased attentional demands [36,41,44]. The potential effects of attention on auditory discrimination abilities are especially important to consider when evaluating CwVN, given their personality factors and characteristics discussed in the literature. Specifically, CwVN may also have differences in awareness and attention [45,46]. Lee and colleagues [47] conducted a systematic review to examine personality traits, psychological factors, and behavioral factors that may be related to CwVN. Differing results were found regarding inattentive behaviors;some studies noted that children with voice disorders had higher scores of distractibility [45] and higher parental and teaching ratings of inattentiveness [46]; other studies showed no differences from matched controls [48,49]. Thus, when examining auditory discrimination abilities in children, the experimental methods must be carefully chosen so that potential discrimination differences are not overshadowed by attentional differences.

The current study examined the ability to discriminate vocal pitch using an experimental paradigm with low memory, attention, and language demands. Vocal pitch was selected because it is a salient feature of voice that does not require extensive or complex training to understand. The first hypothesis was that school-age CwVN would have poorer discrimination abilities compared to age- and sex-matched children with typical voices (CwTV), suggestive of a perceptual deficit in CwVN. The second hypothesis was that children under eight years of age would have poorer auditory discrimination abilities compared to older children and adults, similar to trends noted in previous work performed with pure tone stimuli [34-39].

2. Materials and Methods

2.1. Participants

Fifty-one children (5.6–12.4 years) and 20 adults (19–28 years) participated in the current study. Children over 7.0 years of age provided verbal assent, and dissent was respected for children under 7.0 years of age. For all children in the study, guardians provided written consent. All adult participants provided written consent. Consent and assent were obtained in compliance with the Boston University Institutional Review Board (protocol number: 2625) and were approved by the Boston Children’s Hospital Institutional Review Board. All participants were compensated for participation and spoke English as their primary language. Additionally, as the current study was examining pitch discrimination abilities, none of the included participants spoke or heard tonal languages at home.

2.2. Children with Vocal Fold Nodules (CwVN)

Sixteen male CwVN (mean (M) = 9.6 years; range 7.2–12.5 years) were diagnosed with vocal fold nodules using endoscopic examination by a board-certified laryngologist at Boston Children’s Hospital. The typical population that arrives at the clinic for a voice evaluation is male [50]; therefore, this cohort was selected for the current study. CwVN had an average ‘overall severity’ score on the Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V) [51] of 47.6 (range: 34.2–60.4) as rated by a speech-language pathologist blinded to the participants’ group. Two participants were receiving ongoing speech or language services, one was receiving voice therapy, and one had received speech or language services in the past. At the time of their clinical visits, all participants also received a complete visual examination of their ears by a medical professional as part of routine care to evaluate for potential otological issues. No participants reported having hearing concerns at home or at school, no participants were found to have middle ear disease, and no participants were referred for an audiology follow-up after their appointment. Data collection for 14 of the participants was completed at Boston Children’s Hospital clinic in a quiet examination room. The two remaining participants completed the experiment in a sound-attenuated booth at Boston University and passed an audiometric hearing screening at a threshold of 25 dB hearing level (HL) or better, at the frequencies 0.25 kHz, 0.5 kHz, 1 kHz, 2 kHz, 4 kHz, 6 kHz, and 8 kHz.

2.3. Children with Typical Voices (CwTV)

Sixteen male age-matched CwTV (M = 8.6; range = 6.8–11.7 years; average difference from CwVN = 12.5 months) without voice complaints were recruited to test our first hypothesis that CwVN would have poor discrimination abilities than CwTV. Voice recordings were available for seventeen of the matched CwTV; participants had an average ‘overall severity’ score on the CAPE-V of 16.1 (range: 0–50.7) rated by a speech-language pathologist blinded to the participant’s group. To address the second hypothesis that younger CwTV would have poorer auditory discrimination abilities than older CwTV, nineteen additional CwTV were recruited (for a total 35 CwTV). The 35 CwTV were subsequently divided into a younger (N = 15; M = 6.9 years; range 5.6–7.7 years; 10 male, 5 female) and an older (N = 20; M = 9.5 years; range 8.2–11.7 years; 9 male, 6 female) group. No participants in the CwTV group had received speech or language services within the past year. Four participants had a previous history of speech or language services. Twenty-two participants were recorded at Boston University in a sound-treated booth and passed an audiometric hearing screening at a threshold of 25 dB HL or better, at the frequencies 0.25 kHz, 0.5 kHz, 1 kHz, 2 kHz, 4 kHz, 6 kHz, and 8 kHz. The remaining thirteen participants were recorded offsite in a quiet room at the participants’ homes, and they were unable to participate in a hearing screening. Two children had relevant surgical histories; one received ear tubes and one had an ear wax removal procedure. Parents of all participants reported no current hearing concerns.

2.4. Adults

Twenty vocally healthy adults (M = 21.2 years, range 19–28 years, 10 male, 10 female) comprised an adult control group. One participant reported receiving speech or language services as a child; no other participants reported a history of speech, language, or hearing disorders. Participants were recorded at Boston University in a sound-treated booth. All participants passed an audiometric hearing screening at a threshold of 25 dB HL or better at the frequencies 0.25 kHz, 0.5 kHz, and 1 kHz in both ears. For the frequencies 2 kHz, 4 kHz, 6 kHz, and 8 kHz, 13 participants passed a hearing screening at 25 dB HL or better, while the remaining two participants passed at 30 dB HL or better.

2.5. Stimuli Development and Presentation

The base token used in the study was constructed from an /α/ production by a single male child (9.9 years old) who was not involved in the current study. Acoustic analysis of the selected token was completed in Praat [52] and indicated that average fundamental frequency (fo: 216.2 Hz), jitter (local, 0.37%), shimmer (local, 3.92%) and cepstral peak prominence (17.45 dB) were within the normative range for age and gender [53,54]. A certified speech-language pathologist, blinded to the source of the sample, provided an ‘overall severity’ score of 3 on the CAPE-V, indicating normal vocal quality. The sample was cropped to be 500 milliseconds (ms) in duration from the middle of the /α/. A cosine window with a rise and fall time of 50 ms was added to the beginning and end of the stimulus to avoid the perception of unnatural clicking. The token was played to participants through Sennheiser HD 280 Pro headphones (Sennheiser electronic GmbH and Co., KG, Wedemark, Germany). Participants recorded at Boston University completed their study in a sound-treated booth, with experimental manipulation of pitch performed in near real-time by the Eventide Eclipse hardware (Eventide Inc., Little Ferry, NJ, USA). The Eventide Eclipse performed a full-spectrum shift of the sustained /α/, in which the values and spacing of the harmonics were shifted up [55]. This shift changes the fo of the production, thereby changing the perception of pitch. Participants recorded offsite were presented with previously created pitch shifted stimuli, created with the Eventide Eclipse hardware, on a laptop computer (Lenovo ThinkPad X1 Carbon). Regardless of the equipment used, stimuli were presented at approximately 60–70 dB SPL using a custom MATLAB script (version 8.2; The MathWorks, Inc., Natick, MA, USA).

2.6. Experimental Design

2.6.1. Training

Prior to the experiment, all participants underwent training to ensure they understood the paradigm. Participants were presented with two pairs of sequential pure tones; one pair had two identical 440 Hz tones, and the other pair had one tone at 440 Hz and the second tone at 800 Hz (a difference of 10 semitones (ST) between tones). After each pair, participants were asked if the tones were the same or different. If participants answered incorrectly, they did not proceed on to the experimental portion and were excluded from the study. If they successfully completed the training, participants continued on to the experiment.

2.6.2. Experimental Paradigm

Participants completed a two-alternative forced choice (TAFC) [56] perceptual task in which they judged whether the fo of two tokens was the ‘same’ or ‘different’ for each trial. The average length of a usable task, hereafter called a run, for children was 5.16 min and the average length for adults was 4.17 min. Tokens presented during each trial were developed from a single /α/ production described in the Stimuli Development and Presentation section. During a single trial, participants heard a 500 ms, unmodified base token (average fo of 216.2 Hz) and a 500 ms test token that was either (1) the same unmodified base token (216.2 Hz) or (2) a token with a modifiedfo. The timing between the two tokens in a single trial was 500 ms. Following the presentation of both tokens, the participant stated whether the two tokens were the same or different and the experimenter advanced to the next trial. Token order (i.e., whether the base token was presented first or second) was randomly determined for each trial. Approximately one-third of the trials had two tokens with the samefo, called ‘catch’ trials. These ‘catch’ trials were used to ensure data validity, attention to the task, and understanding of task instructions. The remaining trials were ‘different’ trials, in which the fo value of the test token was adaptively altered based on the participant’s performance, using the stair-step procedure outlined by Levitt [57]. All adults began with a test token that was 0.5 ST, that is, 6.34 Hz, higher than the base token. Based on the known developmental trends reported in the literature for pitch discrimination with pure tones (e.g., 34, 37–40]), the test token was set between 0.5–3 ST (6.34–40.91 Hz) higher than the base token for the children. For each child, the experimenter determined the starting value of the test token according to the child’s age as well as compliance during the training. Prior to implementing the flexible starting position, pilot testing indicated that using a variable starting place did not preclude an individual from reaching an estimate of their pitch discrimination abilities (Figure 1).

Figure 1.

Figure 1.

Experimental runs from two participants. The first participant started both run #1 (solid black line) and run #2 at 1 semitone (ST, dotted black line). The second participant started run #1 at 1 ST (solid blue line) and started run #2 at 0.5 ST (dotted blue line). Red arrows indicate the last four reversals that were calculated for the just-noticeable-difference (JND) for participant #1, run #1.

Step-size magnitude was defined as the magnitude of change in fo during the adaptive procedure. The first 10 trials had a step-size of 0.1 ST and the remaining trials had a smaller step-size of 0.06 ST, regardless of the direction of the change. The experimental paradigm began with a 1-down-1-up TAFC paradigm, wherein the fo of the test token was moved closer to the fo of the base token by a step-size after every correct answer. This was an effort to avoid an extensive number of trials in which the difference in fo between the tokens was either too easy or too difficult for participants to discriminate. Following the first incorrect judgment, the procedure changed to a 2-down-1-up TAFC paradigm, in which two correct judgments in a row were required to elicit a reduction in the difference between the base and test token fo values; a single incorrect judgment still resulted in an increase in the difference between the base and test token fo values. The run ended after either 10 reversals (i.e., a change in direction of the test token, caused by two correct judgments followed by an incorrect judgment, or an incorrect judgment followed by two correct judgments. See four reversals marked with the red arrows in Figure 1), or when 60 trials had been completed. Using this 2-down-1-up TAFC paradigm allowed for estimation of the participant’s discrimination abilities, located at 70.7% accuracy on the psychometric function [57], hereafter, referred to as the just-noticeable-difference (JND) value. Participants completed either one (CwVN, N = 8; CwTV, N = 19; adults, N = 3) or two (CwVN, N = 8; CwTV, N = 16; adults, N = 17) runs, depending on compliance.

2.7. Data Analysis

All statistical analyses were conducted in Minitab [58]. JND values were calculated by averaging the test token fo values of the last four reversals, reported as the difference in ST of the reversal average referenced to the base token (216.2 Hz) using Equation (1).

Semitoneconversion=12log10(Reversalaverage(Hz)216.2Hz216.2Hz)log102 (1)

If a participant completed two runs, the JND values from both runs were averaged together. Two estimates of attention were included: The percent of ‘catch’ trials that were judged correctly, and variability of the reversals, defined as the standard deviation of the last four reversals [35,36]. JND values from a given run were not included if the participant answered less than 60% of the catch trials correctly or if they had less than 6 reversals. A total of sixteen participants (12 CwTV, 1 CwVN, and 3 adults) who were included in the analyses had at least one run that was removed due to having less than 60% correct for the catch trials (N = 10), having less than 6 reversals (N = 5), or equipment error during acquisition (N = 1). Of note, an additional ten children that were run on the experimental paradigm were completely excluded from the analysis (and thus, not described in group descriptions, results, or discussion) because they did not have any usable runs. These participants were either younger CwTV children (N = 8, mean 6.5 years, 5.5–8.0 years; two participants had less than sixty percent correct on catch trials, five participants had less than six reversals, and one participant did not understand the training and therefore did not continue to the task), or CwVN (N = 2, mean 7.8 years, 7.1–8.5 years; two participants had less than six reversals; the parent of one of these participants informed the experimenter after the run that the participant had perfect pitch).

2.7.1. Comparison between CwTV and CwVN Groups

To examine hypothesis one, 16 age-matched male CwTV and CwVN were examined for potential differences in pitch discrimination abilities (JND values) and two estimates of attention (the percent of ‘catch’ trials that were judged correctly and the standard deviation of the last four reversals). Three linear regressions were conducted to evaluate whether there were main effects of age or group (CwTV, CwVN), or an interaction of age and group for either pitch discrimination abilities or estimates of attention

2.7.2. Comparison among Younger CwTV, Older CwTV, and Adult Groups

To evaluate hypothesis two, potential group (younger CwTV, older CwTV, adult) differences among pitch discrimination abilities (JND values) and two estimates of attention (the percent of ‘catch’ trials that were judged correctly and the standard deviation of the last four reversals) were evaluated with three non-parametric Kruskal-Wallis tests. Dunn’s multiple comparison test was used to examine significant group differences, with a family alpha level of 0.2 and a Bonferroni individual alpha level of 0.067.

3. Results

3.1. Comparison between CwTV and CwVN Groups

JND values as a function of age and group are displayed in Figure 2. There were no significant main effects of age or group, nor were there any significant interactions of age × group on JND values, the percent of ‘catch’ trials that were judged correctly, or the standard deviation of the last four reversals (all p > 0.05). Additionally, the participant with a 2.98 ST JND value (Figure 2) was removed as a potential outlier, and all three regressions were repeated. All factors and interactions continued to be not significant (all p > 0.05), thus that subject remained in the dataset.

Figure 2.

Figure 2.

Just-noticeable-difference (JND) values in semitones (ST) for the children with vocal fold nodules (CwVN, green circles) and the age-matched children with typical voices (CwTV, blue circles).

3.2. Comparison among Younger CwTV, Older CwTV, and Adult Groups

Median JND values were significantly different between groups (younger CwTV, older CwTV, and adult, p < 0.001). Post hoc pairwise comparisons indicated the median JND value for the younger CwTV group (0.96 ST) was significantly larger than both the older CwTV (0 .54 ST) and adult (0.25 ST, both p < 0.004) groups (Figure 3). The older CwTV group was also significantly larger than the adult group (p = 0.04). Neither estimate of attention (percent coreecf of ‘catch’ trials, standard deviation of last four reversals) was significantly different between groups (p > 0.05).

Figure 3.

Figure 3.

Medians and 95% confidence intervals for the just-noticeable-difference (JND) values in semitones (ST). Markers indicate median JND values; groups that have significantly different medians at p < 0.05 are indicated by asterisk. Median confidence interval boxes indicated for the younger children with typical voices (CwTV, light blue), older CwTV (dark blue), and adult (black with white stripes) groups.

4. Discussion

4.1. Comparison between CwTV and CwVN Groups

We initially hypothesized that CwVN would have poorer auditory discrimination abilities than CwTV; however, this was not supported by our data. Examination of age-matched CwVN and CwTV indicated that these two groups did not differ based on pitch discrimination abilities. Additionally, estimates of attention were examined in children with and without vocal fold nodules, and there were no significant differences for either estimate of attention between CwTV and CwVN. Thus, although previous work has suggested that CwVN may have attentional differences [31,45,46], these differences appear to manifest at a larger personality-based level that does not translate to the perceptual discrimination of voice. The lack of difference in pitch discrimination abilities is different than what has been noted in adults with dysphonia [33], suggesting potential differences between CwVN and adults with vocal hyperfunction. Heng Tam and colleagues [33] hypothesized that adults with hyperfunction had differences in their discrimination abilities due to an adaptation effect. This suggests that an individual who is consistently listening to their own dysphonic voice may rely less on auditory feedback, as the consistent vocal source they hear is variable. Thereby, this adaptation to a variable mechanism could result in impaired discrimination abilities. The current study hypothesized that CwVN would have poorer discrimination abilities; however, the voices of the CwVN in the current study were not recorded. Therefore, direct relationships cannot be drawn between their dysphonia severity and pitch discrimination abilities. However, it is likely that many CwVN have been dysphonic for less time than an adult with a voice disorder and, thus, may have less of an adaptation effect. Future work examining the relationship between dysphonia severity in CwVN and pitch discrimination abilities is needed to further elucidate this potential connection.

4.2. Comparison among Younger CwTV, Older CwTV, and Adult Groups

Pitch discrimination abilities are frequently examined using pure tone stimuli [34-39], with only a few studies examining children’s abilities to discriminate pitch stimuli with complex harmonic structures [40,41]. Results from the current study are consistent with our second hypothesis, as well as previous literature, demonstrating that pitch discrimination abilities improve, and become more adult-like as a function of development [34-39]. Specifically, younger CwTV (under 8 years of age) had significantly higher JND values than both older CwTV and adults, suggesting poorer pitch discrimination abilities in younger CwTV. Additionally, although older CwTV’s pitch discrimination abilities were improved as compared to younger CwTV, they were still significantly poorer than the adult group. Some children from the younger and older CwTV groups were able to achieve adult-like JND values, suggesting JND is not solely related to age. However, at the group level, both younger and older CwTV had immature perceptual abilities, indicating that auditory discrimination of vocal pitch continues to develop in this time-period.

In addition to pitch discrimination abilities, attention was also considered in the current study, as previous researchers have noted that inattention can have meaningful effects on discrimination abilities [35,40]. To address this, the current paradigm was designed to have: (1) A maximum of 60 trials, (2) flexible starting points in order to begin the run closer to an individual’s anticipated discrimination ability, and (3) a 1-down-1-up procedure to assist participants in quickly moving through the initial portion of the paradigm. Additionally, two estimates of attention were examined during the analysis. The first was the percent correct of ‘catch’ trials, wherein the base and test tokens had the same fo; a lower percent correct value in these trials was considered indicative of decreased attention. The second measure was the standard deviation of the reversals. It has been proposed that when an adaptive stair-step procedure is accurately measuring an individual’s discrimination abilities, there will be lower variability around the JND value; this is contrasted with an individual who is guessing, resulting in higher variability around the JND value (see [35] for additional details). When examining these two estimates of attention in the analysis of JND values, no significant differences between the groups were found for either estimate of attention. Although these findings suggest that inattention was not a significant factor in the increased JND values noted in both younger CwTV and older CwTV, these measures are not a direct measure of attention, but rather estimates tailored to the current study design. Further work would need to examine direct measures of attention, to conclusively determine if different levels of attention impacted the JND values.

4.3. Clinical Implications for Voice Therapy in CwVN

Findings from the current study about the pitch discrimination abilities of CwVN as well as CwTV have implications for conducting voice therapy with children. Many classical voice therapy techniques suggest spending extensive time in therapy on attention and awareness (e.g., [32]); however, results from this study and others [1,3] suggest that this may be not a productive use of time. Instead, current models (i.e., [13]) suggest focusing on motor learning principles and ‘direct’ therapy. In a randomized clinical trial that evaluated the ‘direct’ versus ‘indirect’ voice therapy in CwVN, improvements were seen from both therapies [28]. Yet, when evaluating the difference in outcome measures for ‘direct’ as compared to ‘indirect’ therapy across age, larger effect sizes were seen in children over eight years of age. The authors proposed that this difference may be related to the increased maturity and cognitive development in children over the age of eight. Consistent with this timeframe, the current study also showed differences between younger and older children in auditory discrimination abilities. Therefore, if children in the clinical trial [28] over the age of eight had adult-like discrimination abilities, this may have provided them with an increased ability to hear subtle differences in voices. This ability could subsequently allow them to benefit more from direct therapy. As such, when designing voice therapies for children, especially under the age of eight, speech-language pathologists may want to consider the development of auditory discrimination abilities. Future work should continue to examine children with and without vocal fold nodules to understand the development of perceptual, cognitive, and motor abilities in order to apply these directly to the design and implementation of voice therapy tasks.

4.4. Limitations and Future Directions

Although this study provided insight into pitch discrimination abilities in children, the small number of total participants means that findings should be interpreted with caution. Furthermore, while the CwTV group did not have vocal complaints, they were not endoscopically screened for the absence of a voice disorder, and some participant’s voices were rated as dysphonic; thus we cannot definitively rule out the presence of a voice disorder. Future work is needed with a larger number of children in the CwVN and CwTV groups to confirm the absence of differences in auditory discrimination abilities. Additionally, hearing screenings were not conducted on the majority of the CwVN group, as well as some participants in the CwTV group. Therefore, we cannot rule out potential auditory threshold differences between the groups in the current study. Although there is some emerging evidence of hearing differences between CwVN and children with voice disorders [59,60], these studies examine a relatively small sample size of children with multiple voice disorders. The hearing differences outlined included both temporary conductive hearing loss and sensory neural hearing loss; therefore, further work in a larger group is needed to elucidate whether CwVN have a higher incidence of hearing difficulties than age-matched peers. Some participants in the current study (in both the CwVN and the CwTV groups) had speech or language deficits. Although including these participants makes this group more representative as a sample of the population, it resulted in less homogenous participant groups. Without directly examining speech, language, and cognition, we cannot say whether these groups were distinct in these areas.

Additionally, participants were recorded in different locations; although all participants were recorded in a quiet area, there may have been an unwanted effect based on participant comfort. Specifically, many CwVN were recorded before a medical examination, potentially increasing their anxiety or distractibility. Future work should control these external and participant factors when examining children with and without vocal fold nodules. Due to the nature of the task, it is possible that younger participants could become fatigued towards the end of a run, which could impact their JND scores. This was unlikely to be an issue given the average run times for both children (5.16 min) and adults (4.17 min); however, future work should evaluate JND values for this groups on a shorter paradigm. Finally, the current work focused on discrimination abilities of vocal pitch, and therefore does not provide information on the perception of other vocal percepts common in CwVN, such as breathiness and strain. Vocal pitch was selected, as it is a salient feature of voice that would not require extensive training to learn, thereby reducing the complexity of the task, and making it appropriate for a range of children. Future work should examine whether discrimination abilities for additional vocal percepts differ between CwVN and CwTV.

5. Conclusions

Pitch discrimination abilities were significantly poorer in CwTV younger than eight years of age as compared to older CwTV and adults. Additionally, pitch discrimination abilities continue to be poorer in older children compared to adults. These results suggest that pitch discrimination abilities are still developing, and although the older children were approaching adult-like values, both older and young children continue to have immature perceptual abilities. There were no differences in pitch discrimination abilities between CwTV and CwVN. Additionally, estimates of attention did not differ by age in CwTV, or when comparing CwTV and CwVN groups. Results from this study indicate that CwVN do not have a deficit in auditory discrimination abilities that would preclude them from learning targets during voice therapy. However, the developmental pattern noted for the maturation of pitch discrimination abilities should be considered when designing voice therapy tasks and targets for CwVN.

Acknowledgments:

Thanks to Roxanne Segina, Katharine KoLin, Roseanne Clark, Liza Catallozzi, Karen Thompson, Yeraldi Geronimo, Sydney Kagan, and Victoria McKenna for their assistance.

Funding: This research was funded by two NIH grants from the National Institute of Deafness and Other Communication Disorders, NIH grants DC016197 (PI: Heller Murray) and DC015446 (PI: Hillman).

Footnotes

Conflicts of Interest: The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

References

  • 1.Connor NP; Cohen SB; Theis SM; Thibeault SL; Heatley DG; Bless DM Attitudes of children with dysphonia. J. Voice 2008, 22, 197–209. [DOI] [PubMed] [Google Scholar]
  • 2.Carroll LM; Mudd P; Zur KB Severity of voice handicap in children diagnosed with elevated lesions. Otolaryngol. Head Neck Surg 2013, 149, 628–632. [DOI] [PubMed] [Google Scholar]
  • 3.Verduyckt I; Remacle M; Jamart J; Benderitter C; Morsomme D Voice-related complaints in the pediatric population. J. Voice 2011, 25, 373–380. [DOI] [PubMed] [Google Scholar]
  • 4.McMurray JS Benign lesions of the pediatric vocal folds: Nodules, webs, and cysts In Clinical Management of Children’s Voice Disorders; Hartnick C, Boseley M, Eds.; Plural Publishing: San Diego, CA, USA, 2010; pp. 145–163. [Google Scholar]
  • 5.Shearer WM Diagnosis and treatment of voice disorders in school children. J. Speech Hear. Disord 1972, 37, 215–221. [DOI] [PubMed] [Google Scholar]
  • 6.Shah RK; Harvey Woodnorth G; Glynn A; Nuss RC Pediatric vocal nodules: Correlation with perceptual voice analysis. Int. J. Pediatr. Otorhinolaryngol 2005, 69, 903–909. [DOI] [PubMed] [Google Scholar]
  • 7.Angelillo IF; Di Costanzo B; Costa G; Barillari M; Barillari U Epidemiological study on vocal disorders in paediatric age. J. Prev. Med. Hyg 2008, 49, 1–5. [PubMed] [Google Scholar]
  • 8.Leeper HA Jr.; Leonard JE; Iverson RL Otorhinolaryngologic screening of children with vocal quality disturbances. Int. J. Pediatr. Otorhinolaryngol 1980, 2, 123–131. [DOI] [PubMed] [Google Scholar]
  • 9.Miller SQ; Madison CL Public school voice clinics, part II: Diagnosis and recommendations—A 10-year review. Lang. Speech Hear. Serv. Sch 1984, 15, 58–64. [Google Scholar]
  • 10.Mortensen M; Schaberg M; Woo P Diagnostic contributions of videolaryngostroboscopy in the pediatric population. Arch. Otolaryngol. Head Neck Surg 2010, 136, 75–79. [DOI] [PubMed] [Google Scholar]
  • 11.Kiliç MA; Okur E; Yildirim I; Güzelsoy S The prevalence of vocal fold nodules in school age children. Int. J. Pediatr. Otorhinolaryngol 2004, 68, 409–412. [DOI] [PubMed] [Google Scholar]
  • 12.Watts CR Behavioral voice therapy in school-age children with vocal fold nodules. EBP Briefs 2012, 6, 1–9. [Google Scholar]
  • 13.Verdolini Abbott K Some guiding principles in emerging models of voice therapy for children. Semin. Speech Lang 2013, 34, 80–93. [DOI] [PubMed] [Google Scholar]
  • 14.Verdolini Abbott K; Yee-Key Li N; Hersan R; Kessler L Voice therapy for children In Clinical Management of Children’s Voice Disorders; Hartnick C, Boseley M, Eds.; Plural Publishing: San Diego, CA, USA, 2010; pp. 111–133. [Google Scholar]
  • 15.Woodnorth GH; Nuss RC Pediatric benign vocal fold lesions: A team approach. Perspect. Voice Dis 2009, 19, 105–112. [Google Scholar]
  • 16.Wilson DK Voice Problems of Children; Williams & Wilkins: Philadelphia, PA, USA, 1987. [Google Scholar]
  • 17.Lee E-K; Son Y-I Muscle tension dysphonia in children: Voice characteristics and outcome of voice therapy. Int. J. Pediatr. Otorhinolaryngol 2005, 69, 911–917. [DOI] [PubMed] [Google Scholar]
  • 18.Connelly A; Clement W; Kubba H Management of dysphonia in children. J. Laryngol. Otol 2009, 123, 642–647. [DOI] [PubMed] [Google Scholar]
  • 19.Allen MS; Pettit JM; Sherblom JC Management of vocal nodules: A regional survey of otolaryngologists and speech-language pathologists. J. Speech Lang. Hear. Res 1991, 34, 229–235. [PubMed] [Google Scholar]
  • 20.Signorelli ME; Madill CJ; McCabe P The management of vocal fold nodules in children: A national survey of speech-language pathologists. Int. J. Speech Lang. Pathol 2011, 13, 227–238. [DOI] [PubMed] [Google Scholar]
  • 21.Moran MJ; Pentz AL Otolaryngologists’ opinions of voice therapy for vocal nodules in children. Lang. Speech Hear. Serv. Sch 1987, 18, 172–178. [Google Scholar]
  • 22.Ongkasuwan J; Friedman EM Is voice therapy effective in the management of vocal fold nodules in children? Laryngoscope 2013, 123, 2930–2931. [DOI] [PubMed] [Google Scholar]
  • 23.Deal RE; McClain B; Sudderth JF Identification, evaluation, therapy, and follow-up for children with vocal nodules in a public school setting. J. Speech Hear. Disord 1976, 41, 390–397. [DOI] [PubMed] [Google Scholar]
  • 24.Trani M; Ghidini A; Bergamini G; Presutti L Voice therapy in pediatric functional dysphonia: A prospective study. Int. J. Pediatr. Otorhinolaryngol 2007, 71, 379–384. [DOI] [PubMed] [Google Scholar]
  • 25.Benninger MS Levels of evidence in the voice literature. J. Voice 2011, 25, 653–656. [DOI] [PubMed] [Google Scholar]
  • 26.Benninger MS Quality of the voice literature: What is there and what is missing. J. Voice 2011, 25, 647–652. [DOI] [PubMed] [Google Scholar]
  • 27.Pedersen M; McGlashan J Surgical versus non-surgical interventions for vocal cord nodules. Cochrane Database Syst. Rev 2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Hartnick C; Ballif C; De Guzman V; Sataloff R; Campisi P; Kerschner J; Shembel A; Reda D; Shi H; Zacny ES Indirect vs. direct voice therapy for children with vocal nodules: A randomized clinical trial. Arch. Otolaryngol. Head Neck Surg 2018, 144, 156–163. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Boseley ME; Cunningham MJ; Volk MS; Hartnick CJ Validation of the pediatric voice-related quality-of-life survey. Arch. Otolaryngol. Head Neck Surg 2006, 132, 717–720. [DOI] [PubMed] [Google Scholar]
  • 30.Andrews ML; Summers AC Voice Treatment for Children and Adolescents; United Nations Publications: New York, NY, USA, 2002. [Google Scholar]
  • 31.Eggers K; Luc F; van den Bergh BR Factorial temperament structure in stuttering, voice-disordered, and typically developing children. J. Speech Lang. Hear. Res 2009, 52, 1610–1622. [DOI] [PubMed] [Google Scholar]
  • 32.Andrews ML Voice Therapy for Children: The Elementary School Years; Longman Publishing Group: Harlow, UK, 1986. [Google Scholar]
  • 33.Heng Tam K; Carding P; Madill CJ The relationship between voice quality and pitch discrimination ability in a population with features of mild vocal hyperfunction. In Proceedings of the 47th Annual Symposium Care of the Pofessional Voice, Philadelphia, PA, USA, 30 May-3 June 2018; Available online: https://voicefoundation.org/wp-content/uploads/2018/04/All-Abstracts_2018.pdf (accessed on 1 September 2018). [Google Scholar]
  • 34.Halliday LF; Taylor JL; Edmondson-Jones AM; Moore DR Frequency discrimination learning in children. J. Am. Acad. Audiol 2008, 123, 4393–4402. [DOI] [PubMed] [Google Scholar]
  • 35.Moore DR; Ferguson MA; Halliday LF; Riley A Frequency discrimination in children: Perception, learning and attention. Hear. Res 2008, 238, 147–154. [DOI] [PubMed] [Google Scholar]
  • 36.Banai K Auditory frequency discrimination development depends on the assessment procedure. J. Basic Clin. Physiol. Pharmacol 2008, 19, 209–222. [DOI] [PubMed] [Google Scholar]
  • 37.Thompson NC; Cranford JL; Hoyer E Brief-tone frequency discrimination by children. J. Speech Lang. Hear. Res 1999, 42, 1061–1068. [DOI] [PubMed] [Google Scholar]
  • 38.Jensen JK; Neff DL Development of basic auditory discrimination in preschool children. Psychol. Sci 1993, 4, 104–107. [Google Scholar]
  • 39.Maxon AB; Hochberg I Development of psychoacoustic behavior: Sensitivity and discrimination. Ear. Hear 1982, 3, 301–308. [DOI] [PubMed] [Google Scholar]
  • 40.Buss E; Flaherty MM; Leibold LJ Development of frequency discrimination at 250 Hz is similar for tone and/ba/stimuli. J. Acoust. Soc. Am 2017, 142, EL150–EL154. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Deroche ML; Zion DJ; Schurman JR; Chatterjee M Sensitivity of school-aged children to pitch-related cues. J. Acoust. Soc. Am 2012, 131, 2938–2947. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Cacace AT; McFarland DJ Factors influencing tests of auditory processing: A perspective on current issues and relevant concerns. J. Am. Acad. Audiol 2013, 24, 572–589. [DOI] [PubMed] [Google Scholar]
  • 43.Andrews ML; Madeira SS The assessment of pitch discrimination ability in young children. J. Speech Hear. Disord 1977, 42, 279–286. [DOI] [PubMed] [Google Scholar]
  • 44.Sutcliffe P; Bishop D Psychophysical design influences frequency discrimination performance in young children. J. Exp. Child. Psychol 2005, 91, 249–270. [DOI] [PubMed] [Google Scholar]
  • 45.Green G Psycho-behavioral characteristics of children with vocal nodules: WPBIC ratings. J. Speech Hear. Disord 1989, 54, 306–312. [DOI] [PubMed] [Google Scholar]
  • 46.D’Alatri L; Petrelli L; Calò L; Picciotti PM; Marchese MR; Bussu F Vocal fold nodules in school age children: Attention deficit hyperactivity disorder as a potential risk factor. J. Voice 2015, 29, 287–291. [DOI] [PubMed] [Google Scholar]
  • 47.Lee JM; Roy N; Dietrich M Personality, psychological factors, and behavioral tendencies in children with vocal nodules: A systematic review. J. Voice 2018. [DOI] [PubMed] [Google Scholar]
  • 48.Roy N; Holt KI; Redmond S; Muntz H Behavioral characteristics of children with vocal fold nodules. J. Voice 2007, 21, 157–168. [DOI] [PubMed] [Google Scholar]
  • 49.Erdur O; Herguner A; Ozturk K; Kibar E; Elsurer C; Bozkurt MK; Herguner S Attention deficit hyperactivity disorder symptoms in children with vocal fold nodules. Int. J. Pediatr. Otorhinolaryngol 2016, 85, 5–7. [DOI] [PubMed] [Google Scholar]
  • 50.Dobres R; Lee L; Stemple JC; Kummer AW; Kretschmer LW Description of laryngeal pathologies in children evaluated by otolaryngologists. J. Speech Hear. Disord 1990, 55, 526–532. [DOI] [PubMed] [Google Scholar]
  • 51.Kempster GB; Gerratt BR; Abbott KV; Barkmeier-Kraemer J; Hillman RE Consensus auditory-perceptual evaluation of voice: Development of a standardized clinical protocol. Am. J. Speech. Lang. Pathol 2009, 18, 124–132. [DOI] [PubMed] [Google Scholar]
  • 52.Boersma P; Weenink D [Computer program]. In Praat: Doing Phonetics by Computer; Phonetic Sciences: Amsterdam, The Netherlands, 2014. [Google Scholar]
  • 53.Infusino SA; Diercks GR; Rogers DJ; Garcia J; Ojha S; Maurer R; Bunting G; Hartnick CJ Establishment of a normative cepstral pediatric acoustic database. Arch. Otolaryngol. Head Neck Surg 2015, 141, 358–363. [DOI] [PubMed] [Google Scholar]
  • 54.Maturo S; Hill C; Bunting G; Ballif C; Maurer R; Hartnick C Establishment of a normative pediatric acoustic database. Arch. Otolaryngol. Head Neck Surg 2012, 138, 956–961. [DOI] [PubMed] [Google Scholar]
  • 55.Heller Murray ES; Lupiani AA; Kolin KR; Segina RK; Stepp CE Pitch shifting with the commercially available eventide eclipse: Intended and unintended changes to the speech signal. J. Speech Lang. Hear. Res 2019, 62, 2270–2279. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Macmillan NA; Creelman CD Adaptive Methods for Estimating Empirical Thresholds In Detection Theory: A User’s Guide; Cambridge University Press: New York, NY, USA, 1991; pp. 183–208. [Google Scholar]
  • 57.Levitt H Transformed up-down methods in psychoacoustics. J. Am. Acad. Audiol 1971, 49, 467–477. [PubMed] [Google Scholar]
  • 58.Minitab Inc. Minitab Statistical Software; Minitab Inc.: State College, PA, USA, 2012. [Google Scholar]
  • 59.Nagy A; Elshafei R; Mahmoud S Correlating undiagnosed hearing impairment with hyperfunctional sysphonia. J. Voice 2019. [DOI] [PubMed] [Google Scholar]
  • 60.Baker BM; Baker CD; Le HT Vocal quality, articulation and audiological characteristics of children and young adults with diagnosed allergies. Ann. Otol. Rhinol. Laryngol 1982, 91, 277–280. [DOI] [PubMed] [Google Scholar]

RESOURCES