Abstract
Introduction
In speech and singing, the intrinsic laryngeal muscles set the pre-phonatory posture prior to the onset of phonation. The timing and shape of the pre-phonatory glottal posture can directly affect the resulting phonation type. We investigated the dynamics of human laryngeal phonatory posturing.
Methods
Onset of vocal fold adduction to phonation was observed in 27 normal subjects using high-speed video recording. Subjects were asked to utter a variety of phonation types (modal, breathy, pressed, /i/ following sniff). Digital videokymography (DVK) with concurrent acoustic signal was analyzed to assess the timing of the following: onset of adduction to final phonatory posture (FPT), phonation onset time (POT), and phonatory posture time (PPT). Final phonatory posture time was determined as the moment at which the laryngeal configuration used in phonation was first achieved.
Results
33 audio-visual recordings met inclusion criteria. Average FPT, PPT, and POT were as follows: 303, 106, and 409 ms for modal; 430, 104, and 534 ms for breathy; 483, 213, and 696 ms for pressed; and 278, 98 and 376 ms for sniff-/i/. The following posturing features were observed: (1) Pressed phonation: increased speed of closure just prior to final posture, complete glottal closure, and increased supraglottic hyperactivity (SGH), and (2) Breathy phonation: decreased speed of closure prior to final posture, increased posterior glottal gap (PGG), and increased mid-membranous gap (MMG).
Conclusions
Phonation onset latency was shortest for modal, and longest for pressed voice. These findings are likely explained by glottal resistance and subglottal pressure requirements.
Keywords: Speech Production, Phonation Onset, Digital Video Kymogram, Laryngeal Posture
INTRODUCTION
The human larynx must reliably execute multiple tasks including airway maintenance for respiration, airway protection during swallowing, execution of the cough reflex to clear the airway of secretions and foreign particles, and phonation for speech and other forms of communication. All of these complex functions are dependent on coordinated contraction of the intrinsic laryngeal muscles (ILMs), and are often dysfunctional in disease states such as neurodegenerative diseases (e.g. Parkinson's), laryngeal hypofunction (e.g. vocal fold paralysis), and hyperfunction (e.g. spasmodic dysphonia)1-6. Thus, investigations of the dynamics of laryngeal posturing further our understanding of all laryngeal functions.
Laryngeal posturing for voice production begins with neuromuscular activation that sets the pre-phonatory posture (glottal adduction, length, width, stiffness, and tension)6-7, followed by the attack phase (rise in subglottal pressure and onset of vocal fold oscillation that is perceived as sound)7. The speed of vocal fold adduction is an important variable in the etiology of some voice disorders8. Dysfunction in the coordination of final phonatory posturing and phonation onset have been connected to impaired vocal efficiency and quality9-10 and may be an indicator of neural dysfunction11-12. Proper glottal adduction and phonatory posturing are critical to achieve the desired pitch, intensity and efficiency of phonation13-14.
The interactions of glottal posturing and aerodynamic energy from the lungs during voice production are highly coordinated events. It is generally thought that the vocal folds start rapid adduction about 50-100 ms before the expiratory airflow reaches them and that the pre-phonatory posture is set before audible voice is generated15-16. Mean phonation onset latency time (POT), defined as the time from onset of glottal adduction to production of the acoustic signal, was reported by De Biase et al. as 203 ms (SD 72ms)17, and Hillel as 309 ms (SD 59 ms)6. Both of these studies used laryngeal EMG to record glottal adduction onset combined with acoustic analysis to calculate POT. The events and postures occurring during this adduction period in modal and pathologic phonation remain of great interest.
The present study evaluates the dynamics of laryngeal posturing at phonation onset during the utterance of a variety of phonation types in normal human subjects. Using ultra-high-speed video endoscopy we examined glottal closure patterns and, using concurrently recorded acoustic signal, determined the timing of glottal closure from onset of adduction to voice production. The results reveal variations in pre-phonatory adjustments depending upon phonation type, contributing new information on laryngeal physiology that may also assist in furthering the evaluation and understanding of speech disorders of vocal onset.
MATERIALS AND METHODS
Subjects and recording procedures
This study was approved by the Institutional Review Board. 27 phonetically knowledgeable untrained subjects with perceptually normal voices were directed to repeatedly utter the vowel /i/ in a variety of onset vocal qualities (modal, breathy, pressed) at comfortable loudness and duration. An expert with a background in linguistics demonstrated each phonation type to the subjects. A high-speed video camera (Phantom v210; Vision Research Inc., Wayne, NJ) recorded laryngeal dynamics during each utterance at 10,000 or 20,000 frames per second with a resolution of 480×360 or 360×240 pixels per frame. Larynx was visualized transorally using a KayPentax 70-degree rigid laryngoscope and a 300-watt halogen light source. The acoustic signal was concurrently recorded using a hi-fidelity microphone at 50 kHz and synchronized with the video recording using the same reference clock for six-second intervals.
Of the 243 phonation samples, 48 were excluded due to poor audio quality. The remaining 195 samples were rated for accuracy of phonatory type by two otolaryngologists, and 65 were eliminated because both raters did not agree that the targeted phonation type was achieved. This rating was done blind to the intent of the subject by de-identifying samples. Inter-rater reliability scores were calculated. The video quality of the remaining 130 samples was reviewed, and a majority were excluded due to (1) excessive anterior-posterior motion of the larynx during phonation precluding accurate assessment of medial vocal fold movement by DVK, (2) phonation not starting with a fully abducted vocal fold, or (3) inability to see the entire movement of the vocal processes from onset of adduction to phonation. Subject breakdown of the final 33 samples included for this study were as follows: breathy (N=13), from 10 unique subjects; modal (N=9), 8 unique subjects; and pressed (N=11), 9 unique subjects.
As the level of “breathiness” can vary, the breathy samples were further rated by two otolaryngologists and a speech pathologist using the rank and sort method18 at a single sitting on a breathiness scale: 0 = not breathy (N=3); 1 = mildly breathy (N=4); 2 = moderately breathy (N=7); and 3 = severely breathy (N=2). There was 88% exact concordance among the raters between the ratings of the breathy samples. In two instances the discrepancies were within 1 point on the scale and involved ratings of mildly breathy versus moderately breathy. Ratings for these two were determined by forced consensus among the raters.
We also analyzed dynamics of 24 samples of /i/ phonation that followed a sniffing maneuver because this represents another type of posturing for phonation onset. For this task, the subjects were instructed to inhale sharply through the nose and then phonate /i/ while laryngeal posturing was visualized and recorded with a transoral laryngoscope19.
Measurements of Phonatory Posture Dynamics
Frame by frame analysis of video was performed using the Phantom Camera Control Application software (PCC 1.3; Vision Research Inc., Wayne, NJ) and by making a digital videokymogram (DVK) with the kymogram line at the level of the vocal processes. Use of the DVK to follow movement of the vocal fold medial edge facilitated easy temporal localization of the video frame at the onset and offset of adduction as well as overall vocal fold closure pattern (Figure 1-3). The video frame number at the onset of glottal adduction at the vocal process (first hint of glottal closure), end of glottal adduction (final phonatory posture), and phonation onset (vocal fold vibration with acoustic output) were recorded. Observations were made for presence or absence of supraglottal hyperactivity (SGH), mid-membranous glottal gap (MMG) and posterior glottal gap (PGG). Vocal fold closure patterns were noted as follows: accelerations (transition to an increased slope of vocal fold closure as noted in DVK), decelerations (transition to a decreased slope of closure as noted in the DVK), and hesitations (plateaus or reversals of the slope of closure as noted in the DVK).
Figure 1. Phonation Kymograms and Final Phonatory Posture: Modal.
Digital videokymogram of closure pattern (A) and final phonatory posture (B) for modal phonation. Concurrent acoustic signal is overlaid (black line) with the kymogram. FPT = final phonatory posture time; PPT= phonatory posture time; and POT= phonation onset time measurements are marked with brackets. DKG = digital videokymography line
Figure 3. Phonation Kymograms and Final Phonatory Posture: Pressed.
Digital videokymogram of closure pattern (A) and final phonatory posture (B) for pressed phonation. Concurrent acoustic signal is overlaid (black line) with the kymogram. FPT = final phonatory posture time; PPT= phonatory posture time; and POT= phonation onset time measurements are marked with brackets. In this phonation type FPT is bracketed distal to area of complete glottal closure as there was continued glottal posturing activity beyond closure. DKG = digital videokymography line
The timing of the following phonatory postures were measured: Final Phonatory Posture Time (FPT) = the time to reach final phonatory posture from onset of adduction; Phonatory Posture Time (PPT) = time from FPT to phonation onset; and Phonation Onset Time (POT) = time from onset of adduction to phonation onset. FPT was determined as the moment at which the laryngeal configuration used in phonation was first achieved, which was typically the end of glottal medial movement for breathy phonation, or vocal process contact for the other phonatory types. Posturing mean duration and variances were compared with a Student's t-test between each group, with p <0.05 considered significant.
Acoustic Analysis
The fundamental frequency (F0) for each phonatory sample was assessed using Praat software20. The accuracy was also manually confirmed using Sound Forge (Sonic Foundry Sound Forge Version 6.0, Sonic Foundry, Inc., Madison WI) to measure the average F0 over four glottal cycles. From phonation onset, F0 varied for a short period until it stabilized (determined as 8 continuous glottic cycles with calculated F0 within 10%). Thus, the time to F0 stabilization (frequency stabilization time, FST) was also noted. Comparisons of means and variances were made with a Student's t-test or Pearson's correlation calculation, significance at p<0.05.
RESULTS
Phonatory type rating and targeting
Overall inter-rater reliability for the blinded phonatory rating was 82% (159/195) and overall concordance of both raters with the attempted target phonation was 67% (130/195). Dual concordance (both raters agreed) for breathy voice was 74% (50/68), for modal voice 79% (53/67), and for pressed voice 45% (27/60). As mentioned in the methods section, the final 33 samples analyzed for this study required dual concordance for targeted phonation.
Posture characteristics at phonation onset
Representative DVKs of phonation onset for modal, breathy and pressed are shown in Figure 1A through 3A. Final phonatory posture images, shown in Figure 1B through 3B, represent the laryngeal configuration used just prior to phonation (i.e. pre-phonatory posture) for each phonation type. The observed incidences of closure patterns and final phonatory posture characteristics are illustrated in Figure 4. In modal phonation, 1/9 (11%) samples demonstrated more than one inflection point (change in closure speed) during adduction, 4/9 (44%) demonstrated change to a slower rate of closure immediately preceding final phonatory posture, 2/9 (22%) PGG, 4/9 (44%) MMG, and 1/9 (11%) SGH. In breathy phonation, 4/13 (31%) demonstrated more than one inflection point, 13/13 (100%) demonstrated slowed rate of closure immediately preceding final phonatory posture, 13/13 (100%) PGG, 13/13 (100%) MMG, and 0/13 (0%) SGH. In pressed phonation, no samples demonstrated inflection points, a change to a slower rate of closure, or PGG. However, MMG was present in 3/11 (28%) and SGH in 7/11 (64%). In sniff-/i/ phonation, no samples demonstrated more than one inflection point, and 3/24 (12.5%) demonstrated a change to a slower rate of closure immediately preceding final posture.
Figure 4. Closure Patterns and Phonatory Posture by Phonation Type.
Pre-phonatory closure (A) and final posture (B) for the various phonation types. Slow closure is defined as presence of flatter closing slope following a steeper closing slope on kymography just prior to final phonatory posture. Inflection points are transitions between closing phenotypes. 2nd inflection points, also called hesitations, were most common in breathy phonation. PGG = posterior glottic gap at final phonatory posture. MMG = mid membranous gap at final phonatory posture. SGH = supraglottic hyperactivity at final phonatory posture.
Posturing Dynamics
Posturing and phonation onset times by phonation types are shown in figure 5. Average FPT, PPT, and POT were as follows: 278, 98, and 376 ms for sniff-/i/, 303, 106, and 409 ms for modal; 430, 104, and 534 ms for breathy; and 483, 213, and 696 ms for pressed. Detailed descriptive statistics and comparisons are provided in Table 1.
Figure 5. Adduction and Phonation Times by phonation type.
Mean FPT, PPT and POT for each phonation type. Error bars show 95% confidence interval for POT on two tailed student t test. (ms = milliseconds) where the margin error for sniff, modal, breathy and pressed POT are 56, 66, 127 and 175 ms, respectively. FPT = final phonatory posture time; PPT= phonatory posture time; and POT= phonation onset time
Table 1.
Descriptive Statistics
| p-values of mean difference between groups | ||||
|---|---|---|---|---|
| POT | Mean (ms) | Sniff | Modal | Breathy |
| Sniff | 375.8 | - | ||
| Modal | 409.1 | 0.26 | - | |
| Breathy | 534.2 | <.01 | 0.07 | - |
| Pressed | 696.5 | <.001 | <.01 | 0.07 |
| PPT | Mean (ms) | Sniff | Modal | Breathy |
|---|---|---|---|---|
| Sniff | 97.8 | - | ||
| Modal | 105.7 | 0.43 | - | |
| Breathy | 104.0 | 0.44 | 0.49 | - |
| Pressed | 213.3 | <0.05 | 0.06 | 0.05 |
| FPT | Mean (ms) | Sniff | Modal | Breathy |
|---|---|---|---|---|
| Sniff | 278.0 | - | ||
| Modal | 303.4 | 0.29 | - | |
| Breathy | 430.2 | <0.05 | 0.06 | - |
| Pressed | 483.3 | <0.05 | <0.05 | 0.31 |
| FST | Mean (ms) | Sniff | Modal | Breathy |
|---|---|---|---|---|
| Sniff | 54.8 | - | ||
| Modal | 91.8 | <0.05 | - | |
| Breathy | 183.1 | <0.0001 | <0.01 | - |
| Pressed | 58.6 | 0.34 | 0.06 | <0.0001 |
Descriptive statistics of phonation and frequency stabilization times by student t test comparison. Bolded values represent p ≤.05 when comparing the means between groups. FPT = final phonatory posture time; PPT= phonatory posture time; and POT=phonation onset time; FST = Frequency stabilization times
Acoustic Features
Frequency stabilization times (FST) were 55 ms for the sniff-/i/ phonation, 59ms for pressed, 92 ms for modal, and 183 ms for breathy. FST was significantly longer for breathy than for sniff (p<0.000001), pressed (p<0.0001), or modal (p =.035). No significant differences in FST were noted between sniff and pressed (p=0.34). A Higher breathy index rating was positively correlated with FST (Figure 6).
Figure 6. Frequency Stabiliation Times by phonation type and degree of breathiness.
Frequency stabilization times (FST) by phonation type. A trend of increasing frequency stabilization times from pressed to modal to breathy was noted (A). FST was significantly correlated with breathy index (Pearson correlation r = 0.77, df = 16, p =<0.0005) (B).
DISCUSSION
Laryngeal posturing to set up the glottal stiffness and shape of the glottal channel is a critical event in voice production and affects phonation type. In our study the dynamics of this event were evaluated using concurrent high-speed video and acoustic recording of the larynx from 27 human subjects. This study provides a detailed high-resolution analysis of glottal posturing at phonation onset that has not been reported to date using high-speed photography and the largest number of subjects. The phonation types assessed, breathy, modal (normal), and pressed (hard), are common types of vocal onset described by voice clinicians, voice teachers, and linguists21-23 and incorporated into vocal quality assessment tools24-25. The differences between these phonation types have been attributed to velocity and duration of glottal adduction13.
Sniff-/i/ can possibly be used as a reference for smooth voluntary adduction and phonation onset. Faster closure and phonation onset times were noted with fewer hesitations. Having just performed a sniff maneuver and “spring-loaded” the glottis and pulmonary system, targeting /i/ phonation is straightforward and quick. Modal voice also appears to be achieved in a similarly fluid manner and the use of modal voice in most laryngeal electromyography studies appears justified. The findings in this study on modal voice are consistent with LEMG findings on calculated POT by Hillel and others6-7,26.
Longer phonation onset latencies were found for breathy and pressed phonations. Hillel similarly found longer latencies in dysphonia that could be categorized as pathologically pressed or breathy: abductor spasmodic dysphonia subjects averaged 500ms and adductor spasmodic dysphonia subjects averaged 530ms. He concluded that latencies over 400ms were abnormal6. From a clinical perspective, breathy voice is a very common symptom in a variety of voice disorders, including Parkinson's Disease8, 27-29 and speech language pathologists have successfully used techniques to encourage a more forceful closure30.
In our study, breathy voice was always associated with a glottal gap. The closure pattern was characterized by a transition from fast-to-slow glottal closure speed immediately preceding final phonatory posture. In addition, increased rate of hesitations but relatively normal PPT were found. This implies a more cognitively complex task to accurately target the laryngeal posture for a breathy voice quality. It is also possible that the deceleration of glottal closure and maintenance of the glottal gap in breathy voice involves the PCA, as EMG studies demonstrate marked PCA activity in production of voiceless sounds and connected speech that included voiceless sounds6,31. If the PCA is involved in regulating the glottic aperture for breathy phonation, then it seems logical that a “tug-o-war” between PCA and LCA to control the posterior glottic gap would lead to hesitations and slowing in obtaining the final phonatory posture.
Pressed voice is described by increased speed of vocal fold adduction, leading to increased vocal fold impact stress32-34 and high intraglottal contact pressures35, leading to phonotrauma36-37. In addition, supraglottic hyperactivity is associated with pressed voice13. Our results do not support higher speed of closure for pressed voice, as FPT was longest in this category. However, intraglottal contact pressures and glottal resistance are likely increased, as it took significantly longer to phonation onset after glottal contact. We also observed supraglottic hyperactivity in 64% pressed voice, compared to 11% for modal and 0% for breathy. These movements are common in hyperfunctional voices, but are also seen in normal subjects after glottal stops38-40. While this activity alone cannot be considered a precursor to developing vocal fold nodules, there is a higher incidence of supraglottic activity in patients with vocal fold nodules38-40. While it took longer to reach the final phonatory posture in pressed phonation than the other phonatory types, the most significant contribution to the longer phonation onset time was the longer time from final posture to acoustic output. This is likely due to more time needed to achieve the increase in phonation onset pressure required for this phonation type. Interestingly, pressed phonations were the least successfully targeted (40%) based on perceptual ratings.
The frequency stabilization time (FST) after phonation onset was notably longer for breathy phonation, followed by modal, and shortest for pressed phonation and sniff-/i/. The onset fundamental frequency and the stable fundamental frequency demonstrated no notable correlations or trends. FST differs from vocal rise time (the time interval from phonation onset to steady sound intensity)41 as the endpoint is not stable amplitude, but stable frequency. Vocal rise times for breathy and pressed voices are reportedly about 150 and 30 ms respectively41 whereas our frequency stabilization times for breathy and pressed are 183 and 59 ms respectively. The control of F0 is thought to be primarily regulated by ILMs and secondarily by the subglottic pressure14,42-44. Interestingly, post-phonatory pitch shift latencies are reported to be approximately 130-150ms45-46, and this is within the breathy phonation frequency stabilization times found in our study, signifying that breathy phonation may utilize a sensory feedback loop.
Cooke13 and Munhall47 demonstrated, using video (30 fps) and ultrasound, respectively, that there is a difference in vocal fold kinematics between pressed, normal and breathy onsets. They also noted that the pressed (hard) onset was difficult for their subjects to learn, and this may, in part, explain the increased rate of inflection points and the longer PPT that we describe. However, Cooke et al. could not demonstrate a significant difference in total adduction times between gesture types48, due to their inability to assess the vocal adduction onset and offset. In this study we were able to analyze phonatory posturing and phonation onset time at high resolution, and differentiate between times to achieve final phonatory posture (FPT) versus time spent in the phonatory posture (PPT) prior to phonation onset, revealing some interesting details. For example, our analysis reveals similar PPT durations between modal and breathy phonation, and suggests that breathy phonations may be phenotypically closer to modal phenotype in coordination of respiratory effort than previously believed13.
This study has several limitations. The total number of samples included for final analysis was limited by the need to visualize the vocal folds from onset of glottal adduction until acoustic output, and for the subjects to accurately target the phonation type. Therefore, the various phonation types (breathy, modal, and pressed) came from unique subjects. However, we do not see this as a potential source of bias in the interpretation of results. In addition, we did not record concurrent electromyographic activities from laryngeal muscles or measure the subglottal pressure. Access to that data would have further illuminate the neuromuscular and aerodynamic interactions underlying the reported findings. Nevertheless, these findings further the understanding of phonation onset dynamics, and may help future evaluations of phonation onset abnormalities in laryngeal pathologies.
CONCLUSIONS
Via the complex interactions of the intrinsic muscles of the larynx, vocal folds efficiently adduct for airway protection, deglutition, cough and phonation. Many disease states can alter the dynamics of vocal fold adduction, leading to dysphonia and dysphagia. This study provides the first high-resolution assessment of phonatory types and characteristic laryngeal dynamics. The patterns and differences seen here, as normal subjects attempt to create breathy or pressed voices, may help explain the pathophysiology of neurocognitive disorders of the voice. Future high resolution evaluation of laryngeal posturing in patients suffering from neurodegenerative voice disorders may reveal key targets for therapeutic intervention.
Figure 2. Phonation Kymograms and Final Phonatory Posture: Breathy.
Digital videokymogram of closure pattern (A) and final phonatory posture (B) for breathy phonation. Concurrent acoustic signal is overlaid (black line) with the kymogram.* = single inflection point marking the transition from a fast closure to a slow closure speed. FPT = final phonatory posture time; PPT= phonatory posture time; and POT= phonation onset time measurements are marked with brackets. DKG = digital videokymography line
ACKNOWLEDGMENTS
The authors would like to thank Juergen Neubauer, Ph.D., for technical assistance with high-speed video recording.
This study was supported by Grant No. RO1 DC011300 from the National Institutes of Health
Footnotes
Conflict of Interest: None
Financial Disclosure:
The authors have no other financial disclosures to make.
This article was presented as a poster presentation at the 136th Annual Meeting of the American Laryngological Association (ALA), April 22-23, 2015, Boston, Massachusetts, USA.
REFERENCES
- 1.Smith ME, Ramig LO, Dromey C, et al. Intensive voice treatment in Parkinson disease: laryngostroboscopic findings. J Voice. 1995;9:453–9. doi: 10.1016/s0892-1997(05)80210-3. [DOI] [PubMed] [Google Scholar]
- 2.Baylor CR, Yorkston KM, Eadie TL, et al. The psychosocial consequences of BOTOX injections for spasmodic dysphonia: a qualitative study of patients’ experiences. J. Voice. 2007;21:231–247. doi: 10.1016/j.jvoice.2006.01.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Paniello RC, Barlow J, Serna JS. Longitudinal follow-up of adductor spasmodic dysphonia patients after Botulinum injection: quality of life results. Laryngoscope. 2008;118:564–568. doi: 10.1097/MLG.0b013e31815e8be0. [DOI] [PubMed] [Google Scholar]
- 4.Morrison MD, Gore-Hickman P. Voice Disorders in the elderly. J Otolaryngol. 1985;15:231–234. [PubMed] [Google Scholar]
- 5.Mehanna R, Jankovic J. Respiratory problems in neurologic movement disorders. Parkinsonism Relat Disord. 2010;16:628–638. doi: 10.1016/j.parkreldis.2010.07.004. [DOI] [PubMed] [Google Scholar]
- 6.Hillel AD. The study of laryngeal muscle activity in normal human subjects and in patients with laryngeal dystonia using multiple fine-wire electromyography. Laryngoscope. 2001;111(4 pt 2 suppl 97):1–47. doi: 10.1097/00005537-200104001-00001. [DOI] [PubMed] [Google Scholar]
- 7.Faaborg-Andersen K. Current Problems in Phoniatrics and Logopedics. Vol. 3. Karger; Basel, Switzerland: 1965. Electromyography of laryngeal muscles in humans: Technics and results. [PubMed] [Google Scholar]
- 8.Boone DR. The voice and voice therapy. 2nd ed. Prentice-Hall, Inc.; Englewood Cliffs, New Jersey: pp. 2–67. [Google Scholar]
- 9.Koike Y, Hirano M, von Leden H. Vocal initiation: acoustic and aerodynamic investigations of normal subjects. Folia Phoniatr. 1967;19:173–182. doi: 10.1159/000263142. [DOI] [PubMed] [Google Scholar]
- 10.Steinhauer K, Grayhack JP, Smiley-Oyen AL, et al. The relationship among voice onset, voice quality and fundamental frequency: a dynamical perspective. J Voice. 2004;18:432–442. doi: 10.1016/j.jvoice.2004.01.006. [DOI] [PubMed] [Google Scholar]
- 11.Stepp CE, Hillman RE, Heaton JT. A virtual trajectory model predicts differences in vocal fold kinematics in individuals with vocal hyperfunction. J Acoust Soc Am. 2010;127(5):3166–3176. doi: 10.1121/1.3365257. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Andrade DF, Heuer R, Hockstein NE, et al. The frequency of hard glottal attacks in patients with muscle tension dysphonia, unilateral benign masses, and bilateral benign masses. J Voice. 2000;14:240–246. doi: 10.1016/s0892-1997(00)80032-6. [DOI] [PubMed] [Google Scholar]
- 13.Cooke A, Ludlow CL, Hallett N, Selbie WS. Characteristics of vocal fold adduction related to voice onset. J Voice. 1997;11:12–22. doi: 10.1016/s0892-1997(97)80019-7. [DOI] [PubMed] [Google Scholar]
- 14.Titze IR. The physics of small-amplitude oscillation of the vocal folds. J Acoust Soc Am. 1988;83:1536–1552. doi: 10.1121/1.395910. [DOI] [PubMed] [Google Scholar]
- 15.Hunger EJ, Titze IR, Alipour F. A three-dimensional model of vocal fold abduction/adduction. J Acoust Soc Am. 2004;115(4):1747–1759. doi: 10.1121/1.1652033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Wyke BD. Laryngeal neuromuscular control systems in singing: a review of current concepts. Folia Phoniatrica. 1974;26:295–306. doi: 10.1159/000263791. [DOI] [PubMed] [Google Scholar]
- 17.De Biase NG, Korn GP, Lorenzon P, et al. Dysphonia severity degree and phonation onset latency in laryngeal adductor dystonia. J Voice. 2010;24(4):406–9. doi: 10.1016/j.jvoice.2008.10.012. [DOI] [PubMed] [Google Scholar]
- 18.Zhang Z, Kreiman J, Gerrat BR, et al. Acoustic and perceptual effects of change in body layer stiffness in symmetric and asymmetric vocal fold models. J Acoust Soc Am. 2013;133(1):453–62. doi: 10.1121/1.4770235. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Chhetri DK, Neubauer J, Sofer E. Posterior Cricoarytenoid Muscle Dynamics in Canines and Humans. Laryngoscope. 2014;124:2363–67. doi: 10.1002/lary.24742. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Boersma P, Weenink D. [2014];Praat: doing phonetics by computer. 2013 http://www.praat.org.
- 21.Orlikoff RF, Deliyski DD, Baken RJ, et al. Validation of a glottographic measure of vocal attack. Journal of Voice. 2009;23(2):164–168. doi: 10.1016/j.jvoice.2007.08.004. [DOI] [PubMed] [Google Scholar]
- 22.Werner-Kukuk E, von Leden H. Vocal initiation: high speed cinematographic studies on normal subjects. Folia Phoniatr. 1970;22:107–116. [PubMed] [Google Scholar]
- 23.Orlikoff RF, Kahane JC. Laryngeal structure and function. In: Lass NJ, editor. Principles of Experiemtnal Phonetics. Mosby; St Louis, MO: 1996. pp. 112–181. [Google Scholar]
- 24.Giovani A, Revis J, Triglia JM. Objective aerodynamic and acoustic measurement of voice improvement after phonosurgery. Laryngoscope. 1999;109:656–660. doi: 10.1097/00005537-199904000-00026. [DOI] [PubMed] [Google Scholar]
- 25.Petrovic-Lazic M, Javonovic N, Kulic M, et al. Acoustic and Perceptual Characteristics of the voice in patients with vocal polyps after surgery and voice therapy. J Voice. 2015;29(2):241–6. doi: 10.1016/j.jvoice.2014.07.009. [DOI] [PubMed] [Google Scholar]
- 26.Thumfart WF. From larynx to vocal ability. Acta Otolaryngol. 1988;105:425–431. doi: 10.3109/00016488809119496. [DOI] [PubMed] [Google Scholar]
- 27.De Krom G. Some spectral correlates of pathologic breathy and rough voice quality for different types of vowel fragments. J Speech Hear Res. 1995;38:794–811. doi: 10.1044/jshr.3804.794. [DOI] [PubMed] [Google Scholar]
- 28.Aronson AE. Early motor unit disease masquerading as psychogenic breathy dysphonia: a clinical case presentation. J Speech Hear Disord. 1971;36:115–124. doi: 10.1044/jshd.3601.115. [DOI] [PubMed] [Google Scholar]
- 29.Wolfe VI, Martin DP, Palmer CI. Perception of dysphonic voice quality by naïve listeners. J Speech Lang Hear Res. 2000;43:697–705. doi: 10.1044/jslhr.4303.697. [DOI] [PubMed] [Google Scholar]
- 30.Roussel NC, Lobdell M. The clinical utility of the soft phonation index. Clinical Linguistics and Phonetics. 2006;20(2):181–186. doi: 10.1080/02699200400026942. [DOI] [PubMed] [Google Scholar]
- 31.Hirose H. Posterior cricoarytenoid as a speech muscle. Ann Otol Rhinol Laryngol. 1976;85(3 pt 1):335–342. [PubMed] [Google Scholar]
- 32.Alku P, Vilkman E. A comparison of glottal voice source quantification parameters in breathy, normal, and pressed phonation of female and male speakers. Folia Phoniatr Logop. 1996;48:240–254. doi: 10.1159/000266415. [DOI] [PubMed] [Google Scholar]
- 33.Davis PJ, Bartlett D, Luschei ES. Vocal Fold Physiology: Frontiers in Basic Science. Singular Publishing Group; San Diego, CA: 1993. Coordination of the respiratory and laryngeal systems in breathing and vocalization. pp. 189–226. [Google Scholar]
- 34.Stevens KN. Vocal Fold Physiology. University of Tokyo Press; Tokyo: 1981. Vibration modes in relation to model parameters. pp. 291–304. [Google Scholar]
- 35.Hess MM, Verdolini K, Bierhals W, et al. Endolaryngeal contact pressures. J Voice. 1998;12:50–67. doi: 10.1016/s0892-1997(98)80075-1. [DOI] [PubMed] [Google Scholar]
- 36.Berry DA, Verdolini K, Montequin DW, et al. A quantitative output-cost ratio in voice production. J Speech Lang Hear Res. 2001;44:29–37. doi: 10.1044/1092-4388(2001/003). [DOI] [PubMed] [Google Scholar]
- 37.Jiang JJ, Titze IR. Measurement of vocal fold intraglottal pressure and impact stress. J Voice. 1994;8:132–144. doi: 10.1016/s0892-1997(05)80305-4. [DOI] [PubMed] [Google Scholar]
- 38.Stager SV, Bielamovicz S, Gupta A, et al. Supraglottic activity: evidence of vocal hyperfunction or laryngeal articulation? J Speech Lang Hear Res. 2000;43:229–38. doi: 10.1044/jslhr.4301.229. [DOI] [PubMed] [Google Scholar]
- 39.Stager SV, Bielamovicz S, Regnell JR, et al. Quantification of static and dynamic supraglottic activity. J Speech Lang Hear Res. 2001;44(6):1245–56. doi: 10.1044/1092-4388(2001/097). [DOI] [PubMed] [Google Scholar]
- 40.Behrman A, Dahl LD, Abramson AL, Schutte HK. Anterior-posterior and medial compression of the supraglottis: signs of nonorganic dysphonia or normal postures? J Voice. 2003;17(3):403–10. doi: 10.1067/s0892-1997(03)00018-3. [DOI] [PubMed] [Google Scholar]
- 41.Koiko Y. Experimental Studies on vocal attack. Practica Otologica Kyoto. 1967;60:663–688. [Google Scholar]
- 42.Ohala JJ. Aspects of the control and production of speech. UCLA Working papers in phonetics. 1970 No. 15. [Google Scholar]
- 43.Titze IR. Principles of Voice Production. Prentice Hall, New Jersey: 1994. pp. 1–354. [Google Scholar]
- 44.Chhetri DK, Neubauer J, Berry DA. Neuromusscular control of fundamental frequency and glottal posture at phonation onset. J Acoust. Soc. Am. 2012;131(2):1401–1412. doi: 10.1121/1.3672686. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Bauer JJ, Mittal J, Larson CR, Hain TC. Vocal responses to unanticipated perturbations in voice loudness feedback: an automatic mechanism for stabilizing voice amplitude. J Acoust Soc Am. 2006;119(4):2363–71. doi: 10.1121/1.2173513. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Heinks-Maldonado TH, Houde JF. Compensatory responses to brief perturbations of speech amplitude. ARLO. 2005;6:131–137. [Google Scholar]
- 47.Munhall H, Ostry D. Vocal fold physiology: biomechanics, acoustics and phonatory control. Denver Center for the Performing Arts; Denver: 1983. Ultrasonic measurement of laryngeal kinematics. pp. 145–162. [Google Scholar]
- 48.Hunger EJ, Titze IR, Alipour F. A three-dimensional model of vocal fold abduction/adduction. J Acoust Soc Am. 2004;115(4):1747–1759. doi: 10.1121/1.1652033. [DOI] [PMC free article] [PubMed] [Google Scholar]








