Abstract
Objective
High voice users (individuals who demonstrate excessive or loud vocal use) are at risk for developing voice disorders. The objective of this study was to examine, both acoustically and perceptually, vocal changes in healthy speakers following an acute period of high voice use.
Methods
Members of a university women’s volleyball team (N=12) were recorded a week prior (Pre) and week following (Post) the 10-week spring season; N=6 control speakers were recorded over the same time period for comparison. Speakers read four sentences, which were analyzed for relative fundamental frequency (RFF). Eight naïve listeners participated in an auditory-perceptual visual sort and rate (VSR) task, in which they rated each voice sample’s overall severity and strain.
Results
No significant differences were found as a function of time point in the VSR ratings for the volleyball group. Onset cycle 1 RFF values were significantly lower (p = 0.04) in the Post recordings of the volleyball participants compared to Pre recordings, but there was no significant difference (p = 0.20) in offset cycle 10 RFF values. Receiver operating characteristic analyses indicated moderate sensitivity and specificity of onset cycle 1 RFF for discrimination between the volleyball and control participants. Changes were not apparent in the control group as a function of time for either, onset cycle 1 RFF, offset cycle 10 FF, or either vocal attribute.
Conclusion
Onset cycle 1 RFF may be an effective marker for detecting vocal changes over an acute high voice use period of time before perceptual changes are noted.
Introduction
Voice disorders, which are often caused or exacerbated by improper voice use, can have negative social and emotional consequences.1 Many individuals who present at voice clinics due to vocal misuse are diagnosed with vocal hyperfunction.2 Vocal hyperfunction is a “condition of abuse and/or misuse of the vocal mechanism due to excessive and/or imbalanced muscular forces.”3(p373) This presence of heightened muscle tension often causes the voice to be perceived as strained3 and can be especially problematic for individuals who rely heavily on their voices throughout the day, such as teachers, singers, aerobics instructors, and lawyers.1,2,4–6 In fact, approximately 30% of the working population in the United States has an occupation that requires a substantial amount of voice use and approximately 3% of the population has an occupation in which adequate vocal abilities are important for public safety7. High voice users are at an increased risk for developing a voice disorder. 1,2,4–6 Much of the previous work has focused on identifying individuals who are at high risk for developing voice disorders, but less work has examined the vocal changes after acute periods of such high voice use (i.e. excessive or loud vocal use over a period of time). The goal of this study is to examine acoustic and perceptual changes in the vocal characteristics of members of a university women’s volleyball team, who display a unique model of high voice use over a relatively short period of time.
Teaching is a high voice use profession that has received a large amount of attention with respect to the risk of developing voice disorders.1,2,4,5,8–12 Specifically, teachers have been shown to have a higher risk for experiencing voice disorders than non-teachers4 and many teachers report that their profession has an adverse impact on their voices.12 In addition to the high vocal load required for teaching, the suboptimal acoustics of the classroom can also have a detrimental impact on teachers’ voices. An increase in environmental noise has been associated with an increase in vocal symptoms, as teachers are required to speak louder in order to be heard over the noise.8,10 Not surprisingly, in comparison to classroom teachers, physical education teachers have a further increased probability of developing a voice disorder, even independent of gender, age, and daily number of teaching hours.12 The indoor physical education settings have a substantial amount of environmental noise, over which physical education teachers are required to constantly project their voices in order to conduct their classes.13 One study following physical education student teachers across a semester found that ratings of voice quality and vocal fatigue were both increased in the middle and the end of the semester compared to the beginning.14 Like physical education teachers, aerobics instructors also conduct classes in loud environments and have reported experiencing both acute and chronic vocal difficulties.15,16 A survey of aerobics and group fitness instructors revealed an association between duration of exercise instruction and vocal problems.17 It is hypothesized that these instructors are at risk for developing a voice disorders due to the necessity to raise their voices, potentially in an inefficient manner, while teaching their classes.18
The participants in sporting events are a group of high voice users who have received less attention in the literature. Like teachers and instructors, members of athletic teams must regularly communicate in high-noise environments in order to coordinate game-play. In addition, some sporting activities, such as a tennis serve or a golf drive, are often accompanied by repetitive yelling and/or grunting, and this forceful adduction of the vocal folds places a large demand on the vocal mechanism that occurs over an acute period of time in a repetitive fashion.1,19 For example, club tennis players may have a higher concentration of voice use during their tennis seasons than they do during other parts of the year. Due to their risk for developing voice disorders, cheerleaders are one group of athletes that have received attention in the literature. Surveys of high school cheerleaders noted that 32% – 86% reported difficulty with their voices, ranging from mild dysphonia to aphonia at the time of the cheerleading event.20 Relative to their peers, cheerleaders are more likely to experience chronic vocal problems and to be diagnosed with voice disorders.21
The benefit of examining athletes’ vocal changes rests in the acute nature of their vocal trauma. Athletes who utilize their voices during sporting events have temporary, but consistent patterns of voicing. This provides an advantage over examining teachers, who may exhibit more chronic patterns of potential vocal misuse and/or abuse. The present study examined members of a university club sports team before and after an athletic season, allowing examination of potential changes in acoustic and perceptual measures over a short 10-week time period. Acoustic analysis focused on changes in relative fundamental frequency (RFF), which has been shown to correlate with subjective ratings of strain,22,23 and has been implicated in individuals with voice disorders to be associated with vocal hyperfunction.24 RFF is measured from a vowel-voiceless consonant-vowel speech sample, focusing on the 10 cycles of the vowel waveform directly preceding the voiceless consonant (offset) and the 10 cycles of the vowel waveform directly following the voiceless consonant (onset). Previous research has indicated that the tenth offset cycle (offset cycle 10) and first onset cycle (onset cycle 1) RFF values in individuals with vocal hyperfunction are lower than healthy controls.23,24 However, previous work has not examined the utilization of RFF to detect small changes in the voices of participants undergoing high voice use. Perceptual assessment will focus on whether listeners can detect changes in the overall severity and strain. Thus, this study will examine both RFF and subjective impressions of overall severity and strain in individuals before (Pre) and after (Post) a 10-week indoor volleyball season.
We hypothesized that, due to the high voice use that occurred during the volleyball season, 1) the offset cycle 10 and onset cycle 1 of players would have a lower RFF in the recordings taken Post season than in Pre season, and 2) listeners would rate the vocal samples recorded from players Post season as having higher overall severity and strain than those recorded Pre season.
Methods
Participants
All participants completed written consent in compliance with the Boston University Institutional Review Board and were compensated for their participation.
Speakers
Participants were 18 young adults (15 female); all were native speakers of American English. Twelve participants (volleyball group) were members of the Boston University women’s volleyball club team (M = 20.0 years, SD = 1.5 years; 11 players and 1 coach). One speaker reported seeing a speech-language pathologist as a child because of difficulty saying the phoneme /r/. Another speaker reported that it was previously recommended that she receive an evaluation for a possible voice disorder, but as her voice did not bother her, she did not follow the recommendation. As neither of these participants had a diagnosed voice disorder, they were included in the analyses. No other participants reported any speech, language, or hearing disorders. All participants in the volleyball group were recorded the week prior to the beginning of the spring volleyball season (Pre) and returned approximately one week following the completion of the 10-week season (Post; M = 81.8 days, SD = 9.2 days). Although many speakers in the volleyball group reported experiencing episodes of acute phonotrauma (e.g., severe hoarseness and/or aphonia) during the period between the Pre and Post recordings, no speakers were experiencing these symptoms at the time of either recording. For qualitative comparison, six additional individuals (three female; M = 20.8 years, SD = 1.5 years) who were not participants in club sports formed a control group and were also recorded over the same time period as the volleyball season. They returned for a Post recording an average of 85.0 days after the Pre recording (SD = 17.4 days).
Listeners
Eight young adults (four female; M = 19.8 years, SD = 6.4 years) acted as listeners. The listeners were native speakers of American English and reported no prior history of speech, language, or hearing disorders. Clinical training for perceptual judgment of vocal qualities can result in listeners who have their own approaches to rating voices due to their own personal experiences and training with previous research indicating that the perceptual judgments between naïve raters are more consistent as compared to raters who are experienced listeners.25 Therefore, listeners who had no prior experience with or coursework in voice disorders, formal exposure to individuals with voice disorders, or experience using rating scales for judging dysphonia were recruited. All listeners passed a hearing screening at 25 dB SPL for the frequencies 500 Hz, 1000 Hz, 2000 Hz, and 4000 Hz in both ears.
Stimuli collection and preparation
The speakers were recorded reading four sentences at two time points (Pre and Post), for a total of 144 speech samples (18 speakers × 4 sentences × 2 time points = 144 speech samples). The speech samples were recorded in a sound-treated room using a headset microphone (Shure model WH20, Niles, IL) placed at a 45° angle from the mouth, with one of two soundcards, a Presonus Firepod FP10 (PreSonus Audio Electronics, Inc., Baton Rouge, LA) or a Komplete Audio 6 (Native Instruments, Los Angeles, CA). All samples were recorded at a sampling frequency of 44,100 Hz using Audacity (Audacity Software, Pittsburgh, PA).26 Following the acquisition of all the speech samples, nine speech samples were discarded due to misarticulations. The remaining 135 speech samples were normalized by peak amplitude using MATLAB (Mathworks, Natick, MA)27 and used as stimuli for the listening task.
Listening task
Listeners participated in an auditory-perceptual task, rating overall severity of dysphonia and strain using a visual sort and rate paradigm (VSR).28 During the VSR task, listeners were shown sets of eight or nine dots on a screen, with each dot corresponding to a speaker saying one sentence (see Fig. 1, upper). Participants were instructed to listen to all of the speech samples and drag the dots vertically along a scale from 0 to 100, with 0 representing the least severe (or strained) voice they could imagine and 100 representing the most severe (or strained) voice they could imagine. Listeners were encouraged to first sort all of the dots into the general area they believed the speech sample belonged, facilitating a natural grouping of speech samples with similar severities. Then listeners were asked to rate all of the speech samples by moving the dots vertically along the 0 to 100 scale (see Fig. 1, lower).
In preparation for the auditory-perceptual task, four pilot listeners who had experience listening to disordered voices rated the overall severity and strain for each speech sample using the Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V).29 These preliminary ratings ranged from 0.5 to 18.6 for overall severity and 0 to 15.3 for strain. Using these preliminary ratings for overall severity, the speech samples were divided into three groups, characterized qualitatively as 1) 67 mild overall severity samples (range 0.5–3.9), 2) 61 mild-moderate overall severity samples (range 4.0–9.8), and 3) 34 moderate overall severity samples (range 10.0–18.6). To assess intra-rater reliability in the perceptual task, 20% of the speech samples were duplicated, resulting in a total of 162 speech samples (135 speech samples + 27 speech samples repeated for reliability). The 162 speech samples were then divided into 20 sets to be used for the overall severity listening task: 18 sets of 8 speech samples and 2 sets of 9 speech samples. Sets were designed so that each one would mirror the qualitative overall severity distributions (mild, mild-moderate, moderate) of the entire group. This same procedure was followed using the pilot strain ratings in order to obtain 20 sets for the strain listening task. Similar to overall severity, the strain speech samples were divided into three groups, characterized qualitatively as 1) 79 mild strain samples (range 0–1.9), 2) 56 mild-moderate strain samples (range 2.0–5.9), and 3) 27 moderate strain samples (range 6.3–15.3).
Upon constructing of the sets, the perceptual experiment was conducted. Eight listeners were given operational definitions of the two vocal attributes overall severity of dysphonia and strain. Overall severity was defined as ‘a comprehensive measure of how good or poor the voice is’30 (p3017) and strain was defined as ‘the amount of vocal effort perceived in the voice sample.’31 (p1001) To familiarize participants to the two vocal attributes prior to the listening task, listeners were provided with exemplars32 of mild and severe speech samples for both overall severity and strain. All exemplars and the VSR stimuli were presented at a comfortable loudness through headphones (Senneheiser model HD280 pro, Old Lyme, CT). After listening to the exemplars, listeners participated in the VSR task28 using the sets created from the pilot listeners. Participants could listen to the speech samples as many times as they felt necessary and could take a break at any point if they needed to. The presentation of the overall severity sets and the strain sets were counterbalanced, with each group of sets taking approximately 30–45 minutes. Overall severity ratings and strain ratings for each speech sample were averaged across all eight listeners, resulting in one overall severity rating and one strain rating per speech sample. Pearson’s correlation coefficients were calculated to determine a measure of listener intra-rater reliability for both overall severity (average r = 0.67, range r = 0.37–0.93) and for strain (average r = 0.55, range r = 0.23–0.79). A type (2, k) interclass correlation (ICC)33 was calculated as a measure of inter-rater reliability; it was 0.98 for overall severity and 0.99 for strain.
Acoustic analysis
The relative fundamental frequency (RFF) was measured for each speech sample. An RFF instance is defined as a vowel, followed by a voiceless consonant, followed by a vowel (see Fig. 2). RFF analysis can be conducted on a variety of voiceless consonants, however previous work has indicated that sentences with the /ʃ/ and /f/ phonemes have the lowest level of intra-speaker variability and therefore those phonemes are the most appropriate to use for RFF estimation.34 Thus, the four sentences that were chosen as stimuli each contained three RFF instances of interest: two sentences had RFF instances with the center phoneme /ʃ/ and two had RFF instances with the center phoneme /f/ (see Table 1). Averaging across a minimum of six RFF instances is necessary to provide a stable estimation of RFF.34 Therefore this study examined 12 RFF instances per speaker at each time point (three RFF instances from each of the four sentences).22,21 Analysis of RFF was consistent with previous work34 and was conducted by the first author. The acoustic waveform of the RFF instance was viewed in Praat (Praat Software, Amsterdam, Netherlands)35,34 and the 10 vocal cycles of the vowel preceding the voiceless consonant (offset) and the 10 vocal cycles of the vowel following the voiceless consonant (onset) were selected (see Fig. 2).
Table 1.
Stimuli | International Phonetic Alphabet (IPA) transliteration |
---|---|
We showed Nell my shiny new shoe bin | /iʃoʊ/, /ɑɪʃɑɪ/, /uʃu/ |
The dew shimmered over my shiny blue shell again | /uʃI/, /ɑɪʃɑɪ/, /uʃε/ |
Nelly found new fabric while Ray fell down | /ifɑʊ/, /ufæ/, /eɪfε/ |
Only we feel you do fail in new fallen dew | /ifi/, /ufe/, /ufɔ/ |
For each of those 20 vocal cycles, the instantaneous fundamental frequency was calculated by taking the inverse of the period and converting it to semitones (ST) relative to the reference frequency. This conversion to ST was accomplished utilizing Equation 1,36 in which f is the frequency of the measured cycle and fref is the reference frequency. As in previous studies, the reference frequency selected was the cycle furthest away from the voiceless consonant (e.g. offset cycle 1 for all offset cycles or onset cycle 10 for all onset cycles), as those cycles were most likely to capture a steady state portion of the vowel. These reference frequencies were utilized to normalize the values, in order to allow for comparison across individuals with different base fundamental frequencies. RFF instances were excluded if there were glottalizations or less than 10 vocal cycles. Additionally, RFF instances were excluded if the RFF values for offset cycle 2 and onset cycle 9 (the vocal cycles directly adjacent to the reference cycle) had an absolute value greater than 0.8 ST, indicating that the reference cycle was not located in a steady state portion of the surrounding voicing. After exclusions there were an average of 9.8 RFF instances per participant, per time period, which were utilized for data analysis. In order to calculate a Pearson’s correlation coefficient for both intra-rater and inter-rater reliability of RFF, the first author repeated 20% of the samples (intra-rater, r = 0.97) one month later, and the third author analyzed the same speech samples (inter-rater, r = 0.91).
Equation 1 |
Statistical Analysis
Statistical analyses were conducted in Minitab (Minitab Inc., State College, PA).37 RFF changes in the control group, which were obtained over the same time period as the volleyball group, were examined qualitatively. To examine whether listeners perceived changes in vocal quality from the Pre recordings to the Post recordings for the speakers in the volleyball group, two paired-sample t-tests were conducted on the VSR ratings, one for overall severity and one for strain. Similarly, to examine the changes in RFF values from Pre recordings to Post recordings in the members of the volleyball group, two paired-sample t-tests were conducted on RFF offset cycle 10 and onset cycle 1 values, the two cycles furthest away from the reference cycles (the steady state portion of the vowel), which have been previously shown to differentiate between individuals with vocal hyperfunction and healthy controls.23,24
Receiver operating characteristic (ROC) curves were generated from four different measures to look at how effectively changes (Post-Pre) could be utilized to correctly classify participants as members of either the volleyball group and the control group: 1) RFF values for offset cycle 10, 2) RFF values for onset cycle 1, 3) VSR ratings for overall severity, and 4) VSR ratings for strain. ROC curves for RFF measures were generated by finding the sensitivity and specificity for thresholds of RFF changes (Post-Pre) between −4 ST and 3 ST at every 0.001 ST. The ROC curves that utilized VSR ratings were generated by finding the sensitivity and specificity thresholds of perceptual changes (Post-Pre) between −8 to 13, using a step size of 0.001. Sensitivity was defined as the true-positive rate, or the proportion of people in the volleyball group correctly identified as belonging to the volleyball group. Specificity was defined as the true-negative rate, or the proportion of people in the control group who were correctly identified as not belonging to the volleyball group. Using sensitivity and specificity values, the maximum positive likelihood ratio (LR+) and its associated negative likelihood ratio (LR−) were calculated to determine the potential for using changes in the four different measures to differentiate speakers as members of the volleyball or control group. A positive likelihood ratio (LR+), calculated as , is the confidence that a change in a measure across a given threshold indicates a member of the volleyball group. A negative likelihood ratio (LR−), calculated as , is the confidence that a change in a measure across a given threshold indicates a member of the control group. Finally, calculation of ROC curves resulted in area under the curve (AUC) values, providing additional information about how effectively each measure could discriminate between individuals in the volleyball group and individuals in the control group, with an AUC of 0.5 reflecting chance.38
Results
Onset cycle 1 RFF values from the volleyball group were significantly lower (T = 2.3, p = 0.04) in the Post recordings (M = 2.5 ST, SD = 1.6 ST) compared to Pre recordings (M = 3.2 ST, SD = 1.6 ST; see Fig 3). However, there was no significant difference (T = −1.4, p = 0.20) in RFF offset cycle 10 values between Post (M = −0.6 ST, SD = 1.0 ST) and Pre (M = −0.9 ST, SD = 1.2 ST). The control group participants displayed very minimal changes as function of time point in RFF for both onset cycle 1 (Pre: M = 2.9 ST, SD = 0.7 ST; Post: M = 2.9 ST, SD = 0.4 ST) and offset cycle 10 (Pre: M = −1.0 ST, SD = 0.8 ST; Post: M = −0.7 ST, SD = 1.1 ST; see Fig. 3).
Perceptual ratings of overall severity and strain were highly correlated (r = 0.84). Paired sample t-tests were conducted to examine differences as function of time point in the perceptual ratings of overall severity and strain in the volleyball group. There was no significant difference between Pre (M = 40.3, SD = 8.5) and Post (M = 44.0, SD = 6.7) for overall severity (T = −1.7, p = 0.12) or between Pre (M = 41.0, SD = 6.0) and Post (M = 42.6, SD = 5.5) for strain (T = −1.3, p = 0.29). Although no statistical comparisons were made, both overall severity and strain in the control group displayed very minimal changes as a function of time point (see Fig. 4).
ROC curves were generated for changes (Post-Pre) in onset cycle 1 and offset cycle 10 RFF (see Fig. 5, upper). For changes in onset cycle 1 RFF, the AUC was 0.71, which is greater than chance (0.50). The maximum LR+ was 3.5, (occurring at −0.68 ST, with 0.83% specificity and 0.58% sensitivity) and the associated LR− was 0.50. Therefore a RFF change (decrease) in onset cycle 1 of −0.68 ST from Pre to Post was the optimal threshold to differentiate members of the volleyball and control groups. The ROC curve generated for offset cycle 10 RFF changes indicated weaker discrimination (see Fig. 5, upper). The AUC was approximately chance (0.56), with a LR+ of 2.5 (occurring at −0.11 ST, with 0.83% specificity and 0.42% sensitivity) and an associated LR− of 0.70. ROC curves were also generated for changes (Post-Pre) in VSR ratings for the vocal attributes of overall severity and strain (see Fig. 5, lower). The AUC for both vocal attributes was just over chance, with an AUC of 0.61 for strain and 0.65 for overall severity. The maximum LR+ for strain was 3.0, (occurring at 2.60, with 0.83% specificity and 0.50% sensitivity) and the associated LR− was 0.06. The maximum LR+ for overall severity was 3.5, (occurring at 3.55, with 0.83% specificity and 0.58% sensitivity) and the associated LR− was 0.50.
Discussion
Overall, this study demonstrated that onset cycle 1 RFF values decreased after an acute period of high voice use in healthy individuals. These acoustic changes were noted in the absence of clear perceptual changes. Results from this study suggest that onset cycle 1 RFF may be used as a marker for high voice use over an acute period of time and could potentially indicate individuals are misusing and/or abusing their voices even before perceptual changes are noted.
Comparison with previous literature for onset cycle 1
Onset cycle 1 RFF values from Pre and Post recordings from the control group and Pre recordings of the volleyball group were comparable to previously reported RFF values for healthy participants (2.8 ST to 3.8 ST; See Fig 6). 23,24 However, the onset cycle 1 RFF values from the Post recordings of the volleyball group (M = 2.5 ST) were lower and more similar to those previously found in individuals with vocal hyperfunction (1.9 ST to 2.5 ST).23,24,39 The changes in onset cycle 1 RFF values in the volleyball group’s Post recordings suggest that there may have been increased vocal strain in the volleyball group that did not occur in the control group. These changes may be attributed to the high voice use that took place during the volleyball season. Potentially, individuals who engage in high voice activities have an increased opportunity to, and therefore increased risk of, abuse and/or misuse of their voices, especially if they are not utilizing their voices effectively and efficiently. This study provides evidence that individuals who are high voice users may have voicing patterns similar to individuals with vocal hyperfunction: onset cycle 1 RFF values from the volleyball group were lower in the Post recordings, with values that were similar to previously reported data from individuals with vocal hyperfunction.
Onset cycle 1 as a marker for high voice use in a healthy population
Although, this is one of the first papers to discuss RFF changes over an acute period of high voice use, there is a precedent for examining RFF over a short time period. Over a relatively brief period of voice therapy (averaging 10 sessions), RFF values from individuals with vocal hyperfunction have been shown to increase in both offset cycle 10 and onset cycle 1.39 Although onset cycle 1 values increased, they were still lower than those of healthy participants, an effect suggested to be due to the detection of a lingering hyperfunctional pattern.39 The current study offers further evidence that onset cycle 1 RFF can change over an acute period of time.
The ROC curve for changes (Post-Pre) in onset cycle 1 RFF had an AUC of 0.71, indicating that utilizing RFF onset cycle 1 values as a measure of change could differentiate between individuals in the volleyball and control groups at a better than chance rate (an AUC of 0.50). The measure of change in RFF onset cycle 1 had an LR+ score of 3.5, indicating moderate confidence that individuals with a decrease in RFF values of at least 0.68 ST between Pre and Post assessments were a member of the volleyball group. However, the LR− score for the change in RFF onset cycle 1 was 0.5, which indicates insufficient evidence to be confident that an individual with a decreased RFF onset cycle was not a member of the control group.40 Although the confidence levels were not strong, the ability to differentiate healthy individuals (control group) from healthy individuals with high voice use (volleyball group) provides potentially valuable information. Specifically, even in a group of healthy individuals, onset cycle 1 RFF may be a marker for high voice use over short time periods and could potentially indicate that these individuals are misusing and/or abusing their voices and therefore are at risk for developing voice disorders.
Overall severity and strain judgments over an acute period of voice use
Overall severity and strain perceptual judgments were highly correlated (r = .84), which indicates that listeners’ perceptions of overall severity were heavily related to the strain perceived in the samples. There were no significant differences found between Pre and Post recordings for either vocal attribute. However, although no statistical tests were conducted, qualitative differences were noted between the VSR ratings of the control group and the volleyball group, with higher (more severe) average VSR ratings for both vocal attributes for the volleyball group relative to the control group. ROC curves for the perception of changes for both overall severity and strain resulted in AUC curves around 0.6, close to chance. This result indicates that the changes in the perception of overall severity and strain were not effective measures of differentiating between individuals in the volleyball and control groups. Interestingly, significant changes in onset cycle 1 RFF values were noted despite this lack of detection for qualitative differences in vocal quality.
Although perceptual measures are the gold standard for judgments of vocal quality,41 they may not be sensitive to the changes that occur during acute periods of high voice use and may potentially fail to capture vocal abuse and/or misuse behaviors during these periods. Early detection of these vocally abusive behaviors is extremely beneficial for individuals who demonstrate vocal patterns of abuse and/or misuse, as the earlier behavioral patterns are detected, the easier they are to change during therapeutic interventions.19 Additionally, if a high voice user does develop a pathology from her vocal misuse and/or abuse, the earlier it is detected, the more pliable and responsive the pathology is to therapy.19 Therefore, the ability of onset cycle 1 RFF to detect small changes over an acute period of high voice use could have a substantial positive effect on therapeutic outcomes.
Limitations and future work
During the course of this study, the amount of voice use of each participant over the 10-week period between the Pre and Post recordings was not controlled. Although the volleyball participants reported increased vocal use during games and practices, voice use was not measured during these or other activities. Individuals in the volleyball group may have coincidentally misused and/or abused their voices in typical daily use in addition to the voice use related to volleyball, although no such behavior was reported. Future work will examine changes in onset cycle 1 RFF longitudinally in a group of self-reported high voice users and a group of self-reported low voice users utilizing ambulatory monitoring devices (e.g.42,43). RFF has been shown to be calculated accurately from a neck placed accelerometer44 and the use of ambulatory monitoring could provide a non-invasive method of documenting vocal loads from both groups of individuals, potentially providing further insight into the relationship between voice use and changes in onset cycle 1 RFF.
This study also unexpectedly demonstrated that offset cycle 10 increased from the Pre recordings to the Post recordings. However, this pattern was noted in both groups and results from the ROC analysis revealed an AUC of 0.56, indicating that the changes (Post-Pre) of offset cycle 10 was not a viable measure to differentiate between individuals in the volleyball and control groups. Given that changes in offset cycle 10 were noted in both the volleyball and control group, it is unlikely to be related to the differences in voice use between the two, but rather may be due to potential environmental or social changes that could have affected all participants. For instance, the Pre recordings were conducted in winter, when there is generally higher heat and less humidity indoors due to heating systems, and the Post recordings were conducted in spring. It is possible that the changes in offset cycle 10 RFF are mirroring the differences in hydration level based on the season of the recording, with overall lower RFF values noted in the dryer climates associated with indoors during the winter season. These external drying agents in winter can be dehydrating, which can have negative phonatory effects.31,45,46 Specifically, dehydration and exposure to low relative humidity have been shown to raise phonation threshold pressure, which is a measure of the subglottal pressure that is required for vocal fold vibration during phonation. 31,45–48 Additionally, hydration has been shown to have an effect on perceived phonatory effort, with an increase in hydration corresponding to a decrease in perceived phonatory effort.31,48 Future research to examine potential changes in offset cycle 10 RFF at varying hydration levels in both individuals with voice disorders and healthy participants will facilitate better characterization of any potential relationship between hydration and RFF.
Conclusion
This study was the first to characterize RFF changes over an acute period of high voice use. Results do not indicate that offset cycle 10 is an effective maker for vocal misuse and/or abuse over a relatively short period of time. However, small changes in onset cycle 1 were detected in the absence of perceived qualitative changes of overall severity or strain by listeners. Group data as well as ROC analysis at the individual level suggest the possibility of onset cycle1 acting as an effective marker for vocal misuse and/or abuse over an acute high voice use time period. As individuals with vocal hyperfunction often display vocal abuse and /or misuse3 RFF is a promising measure to identify individuals who may be at increased risk for developing a voice disorder.
Acknowledgments
The authors wish to thank Kate Girouard, Yu-An Stephanie Lien, and Timothy Streeter. This work was supported by grants DC012651 and DC04663 from the National Institute on Deafness and Other Communication Disorders (NIDCD).
Footnotes
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
References
- 1.Ramig LO, Verdolini K. Treatment efficacy: voice disorders. J Speech Lang Hear Res. 1998;41(1):S101–S116. doi: 10.1044/jslhr.4101.s101. [DOI] [PubMed] [Google Scholar]
- 2.Roy N, Merrill RM, Gray SD, Smith EM. Voice disorders in the general population: prevalence, risk factors, and occupational impact. Laryngoscope. 2005;115(11):1988–1995. doi: 10.1097/01.mlg.0000179174.32345.41. [DOI] [PubMed] [Google Scholar]
- 3.Hillman RE, Holmberg EB, Perkell JS, Walsh M, Vaughan C. Objective assessment of vocal hyperfunction: an experimental framework and initial results. J Speech Lang Hear Res. 1989;32(2):373–392. doi: 10.1044/jshr.3202.373. [DOI] [PubMed] [Google Scholar]
- 4.Roy N, Merrill RM, Thibeault S, Gray SD, Smith EM. Voice disorders in teachers and the general population: effects on work performance, attendance, and future career choices. J Speech Lang Hear Res. 2004;47(3):542–551. doi: 10.1044/1092-4388(2004/042). [DOI] [PubMed] [Google Scholar]
- 5.Verdolini K, Ramig LO. Review: occupational risks for voice problems. Log Phon Vocol. 2001;26(1):37–46. [PubMed] [Google Scholar]
- 6.Williams NR. Occupational groups at risk of voice disorders: a review of the literature. Occup Med. 2003;53(7):456–460. doi: 10.1093/occmed/kqg113. [DOI] [PubMed] [Google Scholar]
- 7.Titze IR, Lemke J, Montequin D. Populations in the US workforce who rely on voice as a primary tool of trade: a preliminary report. J Voice. 1997;11(3):254–259. doi: 10.1016/s0892-1997(97)80002-1. [DOI] [PubMed] [Google Scholar]
- 8.Kristiansen J, Lund SP, Persson R, Shibuya H, Nielsen PM, Scholz M. A study of classroom acoustics and school teachers’ noise exposure, voice load and speaking time during teaching, and the effects on vocal and mental fatigue development. Int Arch Occup Environ Health. 2014;87(8):1–10. doi: 10.1007/s00420-014-0927-8. [DOI] [PubMed] [Google Scholar]
- 9.Martins RH, Pereira ER, Hidalgo CB, Tavares EL. Voice disorders in teachers. a review. J Voice. 2014;28(6):716–724. doi: 10.1016/j.jvoice.2014.02.008. [DOI] [PubMed] [Google Scholar]
- 10.Simberg S, Sala E, Vehmas K, Laine A. Changes in the prevalence of vocal symptoms among teachers during a twelve-year period. J Voice. 2005;19(1):95–102. doi: 10.1016/j.jvoice.2004.02.009. [DOI] [PubMed] [Google Scholar]
- 11.Thibeault SL, Merrill RM, Roy N, Gray SD, Smith EM. Occupational risk factors associated with voice disorders among teachers. Ann Epidemiol. 2004;14(10):786–792. doi: 10.1016/j.annepidem.2004.03.004. [DOI] [PubMed] [Google Scholar]
- 12.Smith E, Kirchner HL, Taylor M, Hoffman H, Lemke JH. Voice problems among teachers: differences by gender and teaching characteristics. J Voice. 1998;12(3):328–334. doi: 10.1016/s0892-1997(98)80022-2. [DOI] [PubMed] [Google Scholar]
- 13.Ryan S, Mendel LL. Acoustics in physical education settings: the learning roadblock. Phys Educ Sport Pedag. 2010;15(1):71–83. [Google Scholar]
- 14.Grillo EU, Fugowski J. Voice characteristics of female physical education student teachers. J Voice. 2011;25(3):e149–e157. doi: 10.1016/j.jvoice.2009.12.001. [DOI] [PubMed] [Google Scholar]
- 15.Newman C, Kersner M. Voice problems of aerobics instructors: implications for preventative training. Log Phon Vocol. 1998;23(4):177–180. [Google Scholar]
- 16.Rumbach AF. Vocal problems of group fitness instructors: prevalence of self-reported sensory and auditory-perceptual voice symptoms and the need for preventative education and training. J Voice. 2013;27(4):524.e511–524. e521. doi: 10.1016/j.jvoice.2013.01.016. [DOI] [PubMed] [Google Scholar]
- 17.Heidel SE, Torgerson JK. Vocal problems among aerobic instructors and aerobic participants. J Commun Disord. 1993;26(3):179–191. doi: 10.1016/0021-9924(93)90007-w. [DOI] [PubMed] [Google Scholar]
- 18.Long J, Williford HN, Olson MS, Wolfe V. Voice problems and risk factors among aerobics instructors. J Voice. 1998;12(2):197–207. doi: 10.1016/s0892-1997(98)80039-8. [DOI] [PubMed] [Google Scholar]
- 19.Colton RH, Casper JK, Leonard R. Understanding Voice Problems: A Physiological Perspective for Diagnosis and Treatment. 4. Baltimore, MD: Lippincott Williams & Wilkins; 2011. [Google Scholar]
- 20.Reich A, McHenry M, Keaton A. A survey of dysphonic episodes in high-school cheerleaders. Lang Speech Hear Serv Sch. 1986;17(1):63–71. [Google Scholar]
- 21.Andrews M, Shank KH. Some observations concerning the cheering behavior of school-girl cheerleaders. Lang Speech Hear Serv Sch. 1983;14(3):150–156. [Google Scholar]
- 22.Eadie TL, Stepp CE. Acoustic correlate of vocal effort in spasmodic dysphonia. Ann Otol Rhinol Laryngol. 2013;122:169–176. doi: 10.1177/000348941312200305. [DOI] [PubMed] [Google Scholar]
- 23.Stepp CE, Sawin DE, Eadie TL. The relationship between perception of vocal effort and relative fundamental frequency during voicing offset and onset. J Speech Lang Hear Res. 2012;55(6):1887–1896. doi: 10.1044/1092-4388(2012/11-0294). [DOI] [PubMed] [Google Scholar]
- 24.Stepp CE, Hillman RE, Heaton JT. The impact of vocal hyperfunction on relative fundamental frequency during voicing offset and onset. J Speech Lang Hear Res. 2010;53(5):1220–1226. doi: 10.1044/1092-4388(2010/09-0234). [DOI] [PubMed] [Google Scholar]
- 25.Kreiman J, Gerratt BR, Precoda K, Berke GS. Individual differences in voice quality perception. J Speech Lang Hear Res. 1992;35(3):512–520. doi: 10.1044/jshr.3503.512. [DOI] [PubMed] [Google Scholar]
- 26.Audactiy, the Free, Cross-Platform Sound Editor (version 2.0.3) [computer program] 2013 [Google Scholar]
- 27.Matlab R2013a [computer program]. Version 8.1.0.604. Natick, MA: The Mathworks, Inc; 2013. [Google Scholar]
- 28.Granqvist S. The visual sort and rate method for perceptual evaluation in listening tests. Log Phon Vocol. 2003;28(3):109–116. doi: 10.1080/14015430310015255. [DOI] [PubMed] [Google Scholar]
- 29.Kempster GB, Gerratt BR, Abbott KV, Barkmeier-Kraemer J, Hillman RE. Consensus auditory-perceptual evaluation of voice: development of a standardized clinical protocol. Am J Speech Lang Pathol. 2009;18(2):124–132. doi: 10.1044/1058-0360(2008/08-0017). [DOI] [PubMed] [Google Scholar]
- 30.Eadie TL, Doyle PC. Direct magnitude estimation and interval scaling of pleasantness and severity in dysphonic and normal speakers. J Acoust Soc Am. 2002;112(6):3014–3021. doi: 10.1121/1.1518983. [DOI] [PubMed] [Google Scholar]
- 31.Verdolini K, Titze IR, Fennell A. Dependence of phonatory effort on hydration level. J Speech Lang Hear Res. 1994;37(5):1001–1007. doi: 10.1044/jshr.3705.1001. [DOI] [PubMed] [Google Scholar]
- 32.Voice Disorders: Simulations and Games. Board of Regents of the University of Wisconsin System; 2008. [Accessed March 1, 2013]. http://csd.wisc.edu/slpgames/ [Google Scholar]
- 33.Shrout PE, Fleiss JL. Intraclass correlations: uses in assessing rater reliability. Psychol Bull. 1979;86(2):420. doi: 10.1037//0033-2909.86.2.420. [DOI] [PubMed] [Google Scholar]
- 34.Lien Y-AS, Gattuccio CI, Stepp CE. Effects of phonetic context on relative fundamental frequency. J Speech Lang Hear Res. 2014;57(4):1259–1267. doi: 10.1044/2014_JSLHR-S-13-0158. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Praat: doing phonetic by computer(Version 5.3.82) [computer program] Retrieved from http://www.praat.org/2014.
- 36.Baken RJ. Clinical measurement of speech and voice. Austin, Texas: Pro-Ed; 1987. [Google Scholar]
- 37.Minitab Statistical Software [computer program]. Version 16.2.3. State College: Pennsylvania State University; 2012. [Google Scholar]
- 38.Portney L, Watkins M. Foundations of clinical research: applications to practice. 3. Upper Saddle River, New Jersey: Peason Education; 2009. [Google Scholar]
- 39.Stepp CE, Merchant GR, Heaton JT, Hillman RE. Effects of voice therapy on relative fundamental frequency during voicing offset and onset in patients with vocal hyperfunction. J Speech Lang Hear Res. 2011;54(5):1260–1266. doi: 10.1044/1092-4388(2011/10-0274). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Dollaghan C. The handbook for evidence-based practice in communication disorders. Baltimore, Maryland: Paul H. Brookes Publishing Co., Inc; 2007. [Google Scholar]
- 41.Kreiman J, Gerratt BR, Kempster GB, Erman A, Berke GS. Perceptual evaluation of voice quality: review, tutorial, and a framework for future research. J Speech Hear Res. 1993;36(1):21–40. doi: 10.1044/jshr.3601.21. [DOI] [PubMed] [Google Scholar]
- 42.Hillman RE, Heaton JT, Masaki A, Zeitels SM, Cheyne HA. Ambulatory monitoring of disordered voices. Ann Oto Rhinol Laryngol. 2006;115(11):795–801. doi: 10.1177/000348940611501101. [DOI] [PubMed] [Google Scholar]
- 43.Bottalico P, Pavese L, Astolfi A, Hunter EJ. Voice accumulation and voice disorders in primary school teachers. J Acoust Soc Am. 2014;136(4):2294–2294. [Google Scholar]
- 44.Lien Y-AS, Stepp CE. Comparison of voice relative fundamental frequency estimates derived from an accelerometer signal and low-pass filtered and unprocessed microphone signals. J Acoust Soc Am. 2014;135(5):2977–2985. doi: 10.1121/1.4870488. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Levendoski EE, Sundarrajan A, Sivasankar MP. Reducing the negative vocal effects of superficial laryngeal dehydration with humidification. Ann Otol Rhinol Laryngol. 2014;123(7):475–781. doi: 10.1177/0003489414527230. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Sivasankar M, Erickson E, Schneider S, Hawes A. Phonatory effects of airway dehydration: preliminary evidence for impaired compensation to oral breathing in individuals with a history of vocal fatigue. J Speech Lang Hear Res. 2008;51(6):1494–1506. doi: 10.1044/1092-4388(2008/07-0181). [DOI] [PubMed] [Google Scholar]
- 47.Tanner K, Roy N, Merrill RM, Elstad M. The effects of three nebulized osmotic agents in the dry larynx. J Speech Lang Hear Res. 2007;50(3):635–646. doi: 10.1044/1092-4388(2007/045). [DOI] [PubMed] [Google Scholar]
- 48.Verdolini-Marston K, Sandage M, Titze IR. Effect of hydration treatments on laryngeal nodules and polyps and related voice measures. J Voice. 1994;8(1):30–47. doi: 10.1016/s0892-1997(05)80317-0. [DOI] [PubMed] [Google Scholar]