Abstract
Many cases of disturbed voice signals can be attributed to incomplete glottal closure, vocal fold oscillation asymmetries, and aperiodicity. Often these phenomena occur simultaneously and interact with each other, making a systematic, isolated investigation challenging. Therefore, ex vivo porcine experiments were performed which enable direct control of glottal configurations. Different pre-phonatory glottal gap sizes, adduction levels, and flow rates were adjusted. The resulting glottal closure types were identified in a post-processing step. Finally, the acoustic quality, aerodynamic parameters, and the characteristics of vocal fold oscillation were analyzed in reference to the glottal closure types. Results show that complete glottal closure stabilizes the phonation process indicated through a reduced left-right phase asymmetry, increased amplitude and time periodicity, and an increase in the acoustic quality. Although asymmetry and periodicity parameter variation covers only a small range of absolute values, these small variations have a remarkable influence on the acoustic quality. Due to the fact that these parameters cannot be influenced directly, the authors suggest that the (surgical) reduction of the glottal gap seems to be a promising method to stabilize the phonatory process, which has to be confirmed in future studies.
I. INTRODUCTION
Voice is an important medium of human communication. Impairment of voice affects not only one's quality of life, but has also a massive impact on the economy, since more and more employees work in the service sector. At-risk individuals span many types of professions, e.g., teachers, telemarketers, vocalists, etc.1 Hence, an effective diagnosis and treatment of voice disorders is of primary importance, which must be based on a fundamental understanding of voice production.
The primary signal of voice is produced in the larynx by a self-sustained oscillation of the vocal folds (e.g., phonation), which is driven by a constant airstream from the lungs. Subsequently, the primary signal is modulated by the vocal tract. Examination of the phonatory process is executed using in vivo experiments and by utilization of ex vivo, numeric, and synthetic larynx models.
Glottal closure (GC) plays an essential role in the phonatory process. In most cases, a complete closure of the vocal folds is desirable. Incomplete closure results in an increased broadband noise which is thought to be one of the main factors causing hoarseness.2–5 Therefore, several techniques are applied for reducing the vocal fold gap and thereby improving voice quality, e.g., material injection and implantation or arytenoid adduction.6–9 On the other hand, several publications contradict these findings by implying that a vocal fold gap does not necessarily lead to dysphonic or pathological voice production. Glottal gaps in the posterior glottis can be found in vocally healthy subjects, primarily in women3,10–12 and children.10
Besides the closure of the vocal folds, their oscillation properties (e.g., left-right asymmetry and periodicity) are of interest due to their influence on the acoustic signal. Recording technologies like high-speed video endoscopy allows for a detailed investigation of these parameters.13 Healthy voice is considered to be characterized by almost complete left-right symmetry and periodic vocal fold oscillations.14 However, the reported effects of such conditions on the acoustic signal are inconsistent.
Regarding asymmetric vocal fold vibration, some in vivo investigations suggest that this property is related to a rough voice signal.3,15,16 On others, it is reported that asymmetries were also observed in healthy patients.17–19 Experiments with synthetic vocal fold models support the assumption that an asymmetric oscillation does not necessarily lead to a reduction in voice quality.20 Numerical investigations report only little influence of asymmetry on objective acoustic measures.21,22 Furthermore, ex vivo investigations suggest that glottal gap exerts more influence on jitter and shimmer than asymmetry.4
Due to the fact that quantitative analysis of high-speed video endoscopy is still not widely distributed in the clinical environment,23 extensive investigations of periodicity in phonation and the influence on acoustic are rare. Mehta et al.24 reported a positive correlation between periodicity in vocal fold vibration and cepstral peak magnitude displaying the quality of the acoustic signal. In contrast, Biever and Bless25 found aperiodicity in 30% of 20 healthy women. These diverse findings of the influence of phonation characteristics on the acoustic quality require further systematic investigations.
Ex vivo experiments offer the possibility of adjusting glottal settings through a systematic variation of parameters, such as the pre-phonatory glottal gap or glottal flow. When this is done, the measurement of fundamental data like subglottal pressure, high-speed video, and the acoustic signal provides a basis for a systematic investigation of relationships between the phonation characteristics and the acoustic outcome. Therefore, this study has the following goals:
Goal 1: Investigate the influence of different GC types on the phonatory process by inducing various pre-phonatory glottal gap sizes and vocal fold adduction levels in an ex vivo porcine larynx model. Objective measures displaying the acoustic quality [Cepstral Peak Prominence (CPP)], vocal fold oscillation characteristics [left-right phase asymmetry (PA), amplitude, and time aperiodicity (AP, TP)], and aerodynamic properties (glottal flow resistance) are analyzed to gain quantitative results.
Goal 2: Examine the impact of the different vocal fold oscillation characteristics and aerodynamic properties, resulting from different GC types, on the acoustic quality.
The aim of this study is to systematically investigate the different factors which may influence the phonatory process and/or the acoustic output through utilization of ex vivo experiments. Therefore, it is hoped that such information about the phonatory process will help improve the diagnosis and treatment of pathological voices.
II. METHODS
A. Experimental setup and measurement procedure
Nine ex vivo porcine larynges were obtained from the local slaughter house. This species has proven to be an adequate model for comparison with human phonation.26,27 Ex vivo studies were utilized, despite the fact that the ventricular folds are active oscillators in normal phonation compared to human phonation.26 After their dissection, the larynges were quick frozen with 2-methylbutan (−150 °C) and stored at −80 °C in order to preserve the tissue properties.28 The larynx was slowly thawed in a refrigerator and soaked in a NaCl solution 15 min before the experiment. The supra-glottal structures were removed to the level of the ventricular folds to gain an optimal view of their movement during phonation. Subsequently, the larynx was mounted on an artificial tracheal tube of stainless steel with a diameter of 20 mm.
Figure 1 shows the experimental setup including the larynx prepared for the experiment, the electro-mechanic devices for cartilage posturing, and the measurement equipment. The experiment was conducted using the setup described in Ref. 29 and is summarized below.
FIG. 1.
(a) Experimental setup for inducing different vocal fold adduction levels, and (b) induced pre-phonatory glottal gap.
Vocal fold elongation was achieved using a 50 g weight fixed to the thyroid cartilage by a surgical suture according to ex vivo investigations with porcine larynges.27 Two electro-mechanic devices were used to adjust variable vocal fold adduction levels by rotating the arytenoid cartilages simulating lateral cricoarytenoid muscle contraction. The induced torque T was measured by a TD70 (ME Meßsysteme GmbH, Hennigsdorf, Germany) sensor quantifying the adduction level. Several metal shims (from 0 up to 2, each 1 mm thick) were inserted between the two arytenoid cartilages to induce different pre-phonatory glottal gap sizes, see Fig. 1(b).
Mass flow Q was adjusted using a 4000B (MKS, Andover, MA) digital power supply driving a 1579A/B (MKS) mass flow controller. The air was physiologically heated and humidified by an Ultrasonat 810 (Hico, Hirtz & Co. KG, Köln, Germany) ultrasound nebulizer preventing tissue dehydration. The time resolved subglottal pressure signal was captured by an XCS-93‐5PSISG (Kulite Semiconductor Products, Inc., Leonia, NJ) pressure sensor driven by a PXIe-4330 (National Instruments, Austin, TX) bridge module and was flush-mounted to the internal wall of the artificial trachea. The acoustic signal was captured by a 4189 (Brüel&Kjaer, 2850 Nærum, Denmark) ½-in free-field microphone mounted with an inclination of 45° and a distance of 30 cm toward the glottis. The acoustic signal was amplified by a Nexus 2690 microphone conditioning amplifier (Brüel&Kjaer) and captured by a 4492 (National Instruments) dynamic signal acquisition module. Subglottal and acoustic pressure signals were recorded with a sampling rate of 96 kHz and duration of 2 s. The ventricular and vocal fold motion was captured by a Phantom V2511 (Vision Research, Wayne, NJ) high-speed camera with a frame rate of 4000 fps, a spatial resolution of 768 px × 768 px, and a duration of 600 ms. For the measurement series, three different pre-phonatory gap sizes (g1 = 0 mm, g2 = 1 mm, g3 = 2 mm) were adjusted in sequence. For each gap, three successive symmetric vocal fold adduction levels were applied (T1 = 5 mNm, T2 = 15 mNm, T3 = 25 mNm). After adjusting the phonation posture, glottal airflow was increased manually until sustained phonation occurred. Afterward, glottal flow was increased stepwise 6 times by 5 standard liters per minute (slm). Nine different glottal configurations (3 gaps × 3 adduction levels = 9), each including 7 flow steps, were applied yielding 63 runs for each ex vivo larynx. In total, 567 runs were recorded, whereas 103 of them did not show proper or even absence of phonation. Due to the massive noise component, these data sets had to be removed from the study, since no fundamental frequency could be identified. Hence, 464 data sets were chosen to be suitable for further processing.
B. Data analysis
For analysis of the aerodynamic properties and the vocal fold oscillation characteristics, the following parameters were calculated.
Glottal flow resistance:
| (1) |
R was computed according to literature,30 with mean glottal flow and mean subglottal pressure .
The high-speed videos were analyzed with our in-house software entitled, “Glottis Analysis Tools.” The analysis involved glottal segmentation and subsequently the computation of the glottal parameters on the basis of 50 sequential oscillation cycles from each high-speed video. This number of cycles has proven to be appropriate based on previous studies.31–33 The following glottal parameters ranging from 0 to 1 were derived based on the glottal area (GA).
Glottis Gap Index (GGI) is the minimum GA divided by the maximum GA during one cycle:32
| (2) |
with N being the number of oscillation cycles and GAi the glottal area for the ith cycle. GGI values close to 1 indicate small changes in GA during one vibrational cycle whereas values close to 0 indicate large movement of vocal folds. Complete closure of the glottis is expressed by GGI = 0.
Open Quotient (OQ) is the duration of the glottis being open divided by the total cycle duration:34
| (3) |
with being the duration of glottis being open and Ti the total duration of the ith cycle. OQ values of 1 are assumed when the vocal folds do not close completely during one cycle (also indicated by GGI > 0). Hence, the duration of GC can only be quantified by OQ if no gap remains. PA is the time difference between the left and right part of the GA reaching their maximum normalized by the total cycle duration:35
| (4) |
with being the time at which the GA of the left/right side in the ith cycle is a maximum.
PA of 0 indicates total temporal left-right symmetric vocal fold oscillation, whereas the value 1 signifies a phase shift of 180° between the left and right vocal fold oscillation. Amplitude periodicity (AP) is the quotient of the minimal and maximum GA amplitude Ai between two successive cycles35
| (5) |
with
| (6) |
Time Periodicity (TP) is the quotient of the minimal and the maximal cycle duration between two successive cycles35
| (7) |
Periodicity values close to 1 indicate a high cycle-to-cycle periodicity and 0 indicate large changes from one cycle to the next.
The acoustic signal is evaluated by CPP displaying the quality of voice. Besides CPP, further acoustic parameters have been established for the assessment of dysphonic voice, e.g., H1–H2.36 As CPP has proven to be a reliable indicator for breathy voice37 and seems to be sensitive to glottal noise generation, vocal fold oscillation periodicity,24 and asymmetry,38 CPP is used in this study to quantify the influence of glottal parameters on the acoustic signal quality. CPP was calculated based on the definition introduced by Hillenbrand et al.39 with a rectangular window with a length of 213 sample points (corresponding to 42.67 s).
Table I gives an overview of all calculated parameters, their value range, and the meaning of a minimum and maximum parameter value.
TABLE I.
Overview of the calculated parameters, their value range, and meaning.
| Parameter | Range | Meaning |
|---|---|---|
| GC | [0,4] | 1: complete closure, 2: posterior gap |
| 3: partial vocal fold contact, 4: no contact | ||
| R | no range provided | low-high flow resistance |
| GGI | [0,1] | 0: glottis closes entirely |
| 1: glottis does not move | ||
| OQ | [0,1] | 0: glottis does not open |
| 0.5: glottis is open half of the cycle duration | ||
| 1: glottis does not close | ||
| PA | [0,1] | 0: total temporal left-right symmetric vocal fold oscillation |
| 1: phase shift of 180° | ||
| AP | [0,1] | 0: large changes of maximum GA from one cycle to the next |
| 1: complete cycle-to-cycle periodicity | ||
| TP | [0,1] | 0: large changes of period duration from one cycle to the next |
| 1: complete cycle-to-cycle periodicity | ||
| CPP | no range provided | low: low periodicity of the acoustic signal |
| high: high periodicity of the acoustic signal |
According to the formulated goals, the analysis contains two parts. First, the impact of GC on the phonatory process is investigated. Therefore, GC is categorized into four groups by visually evaluating the video data. Second, the effect of aerodynamic and glottal dynamic parameters on the acoustic quality is examined. Therefore, an objective clustering method for partitioning the parameter values into three groups is applied. The detailed procedure is explained below.
1. Partitioning of GC types
For the investigation of GC, the high speed video data were assessed subjectively and partitioned into four types of GC on the basis of Inwald et al.3 The resulting GC groups and the procedure of classification of GC are described below. The number of data sets in each group is given in brackets.
GC1: Complete GC (103).
GC2: Glottal gap in the posterior part (one-third of the vocal fold length) (201).
GC3: Partial vocal fold contact (less than two-thirds of the vocal fold length) (128).
GC4: No vocal fold contact (32).
Figure 2 shows representative images of the four GC groups. GC1 displays vocal fold contact with complete closure of the glottis which is associated with the phonation of healthy men. GC2 shows a posterior gap up to one-third of the vocal fold length and is predominantly observed in healthy women. GC3 shows partial vocal fold contact of less than two-thirds of the vocal fold length and can be located anterior or medial (in this case). It is predominantly observed in cases of dysphonia, but was also occasionally found in healthy women. GC4 displays the case without vocal fold contact. Solely, the vocal fold edges vibrate with small amplitude. It is observed almost exclusively in women and men with dysphonia.3
FIG. 2.

Partitioning of GC types into 4 groups. GC1: complete closure, GC2: glottal gap in the posterior third. GC3: partial vocal fold contact, GC4: no vocal fold contact. Each figure shows the time point in the oscillation cycle at which the glottal gap reaches its minimum.
2. Partitioning of glottal and aerodynamic parameters
Although the classification of GC types is based on a visual evaluation method which can be assessed quickly in daily clinical practice, this procedure is very time consuming for comprehensive studies. Therefore, objective parameters (e.g., GGI, PA, AP) indicating glottal gap size or vocal fold oscillation characteristics should be established. To investigate the influence of these parameters, the computed values must be grouped. Unlike the case of GC partitioning, this procedure cannot be done subjectively. Hence, an objective clustering method, K-means, is used to subdivide the values into three groups. This enables a comparison between a low, medium, and high range of the parameter, although the variance of the values is small compared to the total range of the parameters. The cluster centers and the number of data sets per center are given in Table II.
TABLE II.
Cluster centers (built with K-means) and number of runs per resulting group.
| Cluster center | 1 | 2 | 3 | |||
|---|---|---|---|---|---|---|
| mean | #runs | mean | #runs | mean | #runs | |
| Gap parameters | ||||||
| 0.04 | 276 | 0.20 | 130 | 0.43 | 58 | |
| 0.32 | 41 | 0.56 | 33 | 0.82 | 29 | |
| Symmetry parameters | ||||||
| 0.03 | 272 | 0.09 | 139 | 0.17 | 53 | |
| Periodicity parameters | ||||||
| 0.97 | 359 | 0.83 | 75 | 0.59 | 30 | |
| 0.97 | 301 | 0.90 | 112 | 0.82 | 51 | |
| Aerodynamic parameter | ||||||
| 99.3 | 18 | 51.6 | 82 | 20.8 | 364 | |
Clustering of OQ was executed solely for cases in which . This was the case for complete closure of the vocal folds, i.e., GC1. Furthermore, OQ4 contains all cases of GC2; OQ5 all cases of GC3 and GC4. To arrange the parameter values according to its influence on phonation (mild…severe), we ordered the cluster centers as follows. Cluster centers of GGI, OQ, PA, and R are arranged in ascending order, AP and TP cluster centers in descending order.
3. Statistical analysis
Since the group values were not normally-distributed, the Kruskal-Wallis test was used for multiple group comparison. A significance level of was chosen. The Dunn-Bonferroni correction was applied for post hoc tests (Mann-Whitney-U). The correction factor was selected according to the number of tests n = 3, 6, 10 executed for the individual number of groups [3 (GGI, PA, AP, TP, R), 4 (GC), 5(OQ)]. The corrected significance levels are given in Table III in the Appendix.
The clustering and statistical analysis were performed using IBM SPSS Statistics 21 software (IBM, Armonk, NY). Detailed results are given in the Appendix.
III. RESULTS
A. Aerodynamic parameters
All larynges showed nearly linear dependence between and . Pearson's correlation coefficient was determined for each larynx for all 9 glottal configurations (). ranged from 8–155 slm and from 0.2–4 kPa. Average fundamental frequency was f0 = 126 ± 69 Hz. Regarding the individual GC groups, a frequency jump from to is observable (see Fig. 3). Statistical analysis yielded significant differences except for the comparison of and (for details see Table IV in the Appendix).
FIG. 3.
Fundamental frequency f0 as a function of the four GC groups .
B. Influence of GC
CPP decreases continuously for increasing GC groups (see Fig. 4). From cases with complete closure (GC1) to cases with minimal vocal fold contact (GC3), the CPP value decreases by one-third of its maximum, whereas the drop from GC2 to GC3 (approx. 22%) is more distinct than the decrease from GC1 to GC2 (approx. 10%). From GC3 to GC4, CPP remains largely constant. The Kruskal-Wallis test shows statistically significant differences between all groups, except for (for details see Table IV in the Appendix).
FIG. 4.
CPP as a function of the four GC groups .
GGI increases with increasing GC group (see Fig. 5), whereas from GC1 to GC3, the increase covers 41% of the possibly attainable values. From GC3 to GC4, GGI increases steeper which makes up the greater part of change in value. Group comparison yields statistically significant differences between all GC groups.
FIG. 5.
(Color online) Glottal parameters (GGI, PA, AP, TP) as a function of the four GC groups .
OQ for complete closure (GC1) is 0.54 and increases to 1 for . Comparisons between GC1 and the remaining groups are significant (p = 0.000). All other comparisons yield a significance value of 1.
PA decreases slightly, but is statistically significant from GC1 to GC2 and increases distinctly from GC2 to GC3. PA does not increase significantly from GC3 to GC4. The Kruskal-Wallis test shows significant differences between all groups except for .
Regarding the periodicity parameters (AP, TP), the values for complete closure (GC1) and posterior gap (GC2) are not significantly different. A significant decrease to GC3 is observed for both periodicity parameters. For GC3 to GC4, the values increase slightly, whereas group differences are only significant for TP.
R decreases by approx. 48% from GC1 to GC2 (see Fig. 6) and further from GC2 to GC4, but less distinctly (by approx. 24%). The Kruskal-Wallis test shows significant differences between all groups.
FIG. 6.
Glottal flow resistance R as a function of the four GC groups .
C. Influence of glottal and aerodynamic parameters on acoustic quality
CPP shows a decreasing tendency for increasing OQ values (OQ1–OQ3) (see Fig. 7). The Kruskal-Wallis test does not show significant differences for or . However, for comparison between OQ1 and OQ3, statistical analysis yields p = 0.002. From OQ3 to OQ4, the CPP value is almost constant and statistical testing does not show a significant difference. The decrease of CPP is more distinctive for OQ4 to OQ5 in which OQ has the value 1 and the groups differ in terms of the vocal fold contact area. OQ5 differs from all remaining groups significantly (p = 0.000). For detailed results of group comparison, see Table V in the Appendix.
FIG. 7.
CPP as a function of OQ.
Influences of GGI, PA, AP, TP, and R groups on CPP are summarized in Fig. 8. The individual cluster centers of each parameter group are listed in Table II. All parameters exert a negative influence on CPP and thus on the acoustic quality. For the groups of and , CPP decreases almost linearly. CPP decreases slightly from PA1 to PA2 and drops more rapidly toward PA3. The reduction of CPP is steeper for and than for and .
FIG. 8.
(Color online) CPP as a function of glottal and aerodynamic parameter groups (GGI, PA, AP, TP, and R).
All Kruskal-Wallis tests show significant differences between the groups, except for and . For detailed results of statistical tests, see Table VI in the Appendix.
IV. DISCUSSION
A. Phonatory parameter values
Results show a wider range of values in glottal flow rate ( = 8…155 slm) and subglottal pressure (…4 kPa) than other ex vivo porcine experiments. Alipour and Jaiswal27 reported values of = 36…78 slm and …2.5 kPa using ex vivo porcine larynges at different vocal fold adduction levels. Furthermore, they reported a non-linear relationship between and with some linear portions. Our results show an almost linear behavior with a Pearson correlation coefficient of r = 0.99 ± 0.03. The fundamental frequency ( Hz) lies below the findings of Alipour and Jaiswal ( Hz;27 Hz26).
This discrepancy could be due to the difference in the procedure of vocal fold adduction control. However, control of fundamental frequency is achieved primarily by vocal fold elongation (by cricothyroid muscle contraction) rather than by vocal fold adduction (through lateral cricoarytenoid and inter arytenoid muscle contraction), as reported in literature.40
Unfortunately, Alipour and Jaiswal26,27 do not provide detailed information on the vocal fold elongation technique in their study. Therefore, this discrepancy cannot be further explained.
A significant increase in was found in comparing “healthy phonation” () with “pathologic phonation” () (see Fig. 3). Fundamental frequency is a function of the length, stress, and density of the vocal folds and is also dependent on the subglottal pressure.41,42 These dependencies were reproduced in ex vivo investigations applying the present setup.29 However, GC during phonation is a complex interplay of vocal fold adduction, elongation, pre-phonatory glottal gap, and the subglottal pressure, which has to be investigated in further studies.
Nonetheless, the relationship between GC and fundamental frequency was also reported by Yamauchi et al.43 who investigated healthy subjects and patients with vocal fold paralysis (VFP). The authors reported more frequent incomplete closure combined with an increase in fundamental frequency in cases with VFP. Furthermore, they reported an increased oscillatory frequency of the atrophied vocal fold compared to the normal vocal fold, which could be an explanation for the computation of increased f0.
Therefore, it is supposed that fundamental frequency can serve as an additional indicator for distinguishing between healthy and pathologic voice. This was reported by Wolfe et al.44 who found an improved correlation between jitter and the degree of dysphonia by considering the mean fundamental frequency in a multiple regression analysis.
However, a detailed investigation of the correlation between fundamental frequency and the left-right separated oscillation frequencies of the vocal folds should be implemented to investigate the cause of frequency increase for reduced GC.
CPP values ( dB) cover the range of in vivo investigations of Hillenbrand et al.,37,39 who examined breathy and non-breathy voices, and reported a resulting difference in of 8.5 dB.
Glottal gap was induced by inserting several metal shims between the arytenoid cartilages. GGI ranges from 0 for complete closure (GC1) up to 0.46 ± 0.14 for the case without vocal fold contact (GC4). For GC2 (posterior glottal gap) which is associated with the phonation of healthy women, GGI is 0.1 ± 0.08. This lies within the range of Patel et al. and Kunduk et al., who reported values for healthy women of 0.05 ± 0.07 (Ref. 32) and 0.17 ± 0.31.45
OQ was evaluated solely for GC1 and was found to be 0.54 ± 0.21. For , the values were 1 due to no complete closure of the vocal folds. OQ values for healthy subjects were found to be 0.78 ± 0.18.46 Disaggregated by gender, OQ values were reported for men: 0.70 ± 0.16,47 0.74 ± 0.19;33 for women: 0.85 ± 0.13,47 0.86 ± 0.17.33 Baken and Orlikoff34 reported values for healthy subjects from 0.47 to 0.82 depending on f0 (120 Hz…325 Hz) and vocal intensity (low…high). OQ values for GC1 lie within the range of the findings in literature that are calculated on the basis of the GA.
Despite a symmetric stimulation of vocal fold adduction in our experiment, PA values up to 0.11 ± 0.06 for GC4 are observed. The values lie within the range of normal phonation with a maximal asymmetry of 20% for healthy subjects reported by Bonilha et al.48 and Mehta et al.14
Both periodicity values (AP, TP) lie in the range of the values for normal phonation reported in literature. Patel et al.32 reported values for men (AP = 0.99 ± 0.003, TP = 0.97 ± 0.01) and women (AP = 0.98 ± 0.01, TP = 0.95± 0.02). Corresponding values of GC1 (“healthy men”) were AP = 0.96 ± 0.05 and TP = 0.95 ± 0.04, of GC2 (“healthy women”) values were AP = 0.97 ± 0.04 and TP = 0.95± 0.04. Due to the lack of high-speed investigations in the clinical environment, quantitative values for vocal fold periodicity, especially for pathological phonation, are rare.
In summary, the aerodynamic parameter values in this study cover a wider range than comparable investigations with ex vivo porcine larynges. Acoustic measures displayed the difference between healthy and pathological phonation according to in vivo investigations. Glottal gap and dynamic measures lie in the range of the values reported in literature of in vivo investigations. Despite the fact that the ventricular folds are active oscillators in normal phonation of the porcine larynx26 this model seems to be suitable for the investigation of human phonation.
B. Influence of GC
With regard to goal number 1, the influence of GC on glottal and aerodynamic parameters and the acoustic output is discussed.
Vocal fold contact has a distinct influence on the acoustic output, see Fig. 4. CPP decreases from complete closure (GC1) to cases with posterior gap (GC2) and decreases further to cases exhibiting partial vocal fold contact (GC3) and no contact (GC4). This indicates that complete GC is most beneficial for the acoustic output.
The assigned GC groups are based on the classification of GC in Inwald et al.3 The authors report that a subjective evaluation of GC is a valuable predictor for distinguishing between healthy and pathological subjects. This corresponds to the major decrease in CPP from “healthy men and women” () to “pathological phonation” (). Mehta et al.24 reported an increase in cepstral peak magnitude due to the reduction of glottal gap by phonomicrosurgical treatment of organic lesions. The authors assume the variation of turbulent noise to be the main factor of decrease in cepstral measures. This confirms the decrease of CPP from GC1 to GC2.
Despite the increased glottal gap from GC3 to GC4, CPP does not show significant differences. This indicates that the curve tends toward a limiting value, although the reason is not evident. Hence, a further evaluation of the factors affecting the acoustic output is executed in the following.
An increase in GGI and OQ for increasing GC groups confirm the visual classification of the GC types (see Fig. 5). For GC1 the GGI value is 0 and OQ less than 1, which indicates a complete closure of the vocal folds during each oscillation cycle. For , the OQ value is 1 and GGI larger than 0 indicating a remaining gap during each cycle. A major increase in GGI is found for GC3 to GC4, since the minimal GA in one cycle increases dramatically for cases without vocal fold contact.
Comparing (healthy) with (dysphonic), a significant increase in PA is observed. The phonation seems to be destabilized by an increase in glottal gap. These findings are supported by Yamauchi et al.,49 who reported that increased vocal fold asymmetries are combined with a poorer GC in in vivo investigations. Isshiki et al.50 reported that a complete closure reduces vibrational vocal fold asymmetry in canine and human ex vivo larynges with induced asymmetrical vocal fold tension.
A slight but significant decrease of PA is observed for GC1 (PA = 0.06 ± 0.04) to GC2 (PA = 0.04 ± 0.03), which indicates a stabilization of the phonation for cases with posterior gap. This was also observed in Patel et al.,32 who reported values for men of PA = 0.05 ± 0.04 and for women of PA = 0.03 ± 0.02 in vocally healthy subjects. This decrease is in the same range (2%) of the investigations in this study.
The destabilization from to is also indicated by a decrease of AP and TP values. This is confirmed by Kobayashi et al.,51 who reported a stabilizing effect of reduced pre-phonatory glottal gap resulting in an increased periodicity at unilateral paralyzed vocal folds in ex vivo canine larynges. While the periodicity parameters do not change for GC1 to GC2, a slight increase from GC3 to GC4, which is only significant for TP, is observed. The results suggest that a partial contact decreases the oscillation periodicity of the vocal folds. The increased periodicity for cases without vocal fold contact was also observed by Isshiki et al.50 in ex vivo canine and human larynx experiments supported by a theoretical model.
Generally, the variation of absolute values of PA, AP, and TP are small compared to the total value range of the parameters. Hence, it is questionable if these parameters are suitable for voice evaluation in the clinical environment.
Overall, the increase in R is significant, whereas major variation is present from GC1 to GC2. This was also reported in Döllinger et al.,52 who investigated the dependency of R on the vocal fold adduction level, which also results in larger amplitudes of oscillation. Zañartu et al.53 reported a reduction of energy transfer from the glottal airstream to the vocal folds up to 80% for an increase of glottal gap size to 0.1 cm in a numerical model. The results suggest that small gaps between the vocal folds decrease the glottal flow resistance dramatically and thus the energy transfer. Hence, a complete closure of the vocal folds yields to a large energy transfer since the maximum value of R was found for GC1. Comparatively small changes in R for indicate that variation in gap size does not lead to a considerable contribution to the energy transfer in cases with pre-phonatory glottal gap.
In summary, a complete GC (GC1) but also cases with posterior gap (GC2) stabilize the phonatory process indicated by decreased PA and increased AP and TP. The visual classification of GC was confirmed by the objective parameters quantifying the GC. The energy transfer from the glottal airstream to the vocal folds is strongly dependent on the glottal gap size. This fact and the destabilizing effect of the glottal gap are also reflected by the decrease of acoustic quality. The stagnation of CPP from cases with partial vocal fold contact (GC3) to no contact (GC4) raises questions which require further investigations. Increased periodicity values for GC4 compared to GC3 might provide an explanation for this phenomenon. Our hypothesis is that periodicity is a function of contact area between the vocal folds with a minimum value even lower than the periodicity for GC4. Thus, at high GGI values, vocal fold contact seems to further destabilize the vocal fold oscillation compared to the commonly suggested worst case of no contact.
C. Influence of glottal and aerodynamic parameters on acoustic quality
With regard to goal number 2, the influence of the glottal and aerodynamic parameters on the acoustic quality is discussed.
GGI yields a negative influence on CPP (see Fig. 8). This is consistent with the findings of Chen et al.,54 who reported that CPP is affected by glottal gap area in in vivo investigations. Being a valuable predictor for glottal gap, GGI can serve as indicator for proper phonation. However, the absolute value of GGI does not provide information about the location of the gap and the type of vocal fold contact.
OQ reflects the duration of complete GC. A decrease of CPP from OQ1 to OQ3 is observable (see Fig. 7), whereas solely the group comparison between OQ1 and OQ3 yields statistical significant results. This indicates that, besides the fact that closure is present, the duration of complete GC has an influence on the acoustic quality. OQ3 and OQ4 exhibit similar CPP values. This refers to the fact that a phonatory sample which includes a posterior gap may yield comparable acoustic output with cases involving complete closure, but a short period of vocal fold contact. It may be assumed that a posterior gap would not necessarily lead to a poor acoustic signal. This coincides with the findings of Kreiman et al.,55 who reported that the cause of breathiness is dependent on the speaker. In fact, breathiness is the result of a combination of several factors and therefore cannot be inferred solely from OQ. The severe drop of the curve toward OQ5 confirms the findings from above that partial and no vocal fold contact (corresponding to GC3 and GC4) decreases the acoustic quality significantly compared to cases with complete vocal fold contact and posterior gap (GC1 and GC2). Due to the fact that OQ assumes the value of 1 as soon as a gap remains during phonation (in this study about 78%), this parameter can only provide information when the glottis closes completely and is therewith limited in application.
Glottal dynamic parameters contain detailed information about vocal fold oscillation. Low PA as well as high AP and TP are indicators of a stable phonatory process. Despite the fact that the absolute values show a small variation and PA lies within the range of normal phonation,48 a significant influence on the acoustic quality is still noticeable (see Fig. 8). For increasing PA, the CPP value decreases. This was also reported by Yamauchi et al.,49 who found asymmetries of vocal fold vibration in patients with laryngeal pathologies. Samlan and Story38 found an impact of different types of asymmetry on the acoustic quality in a numerical model.
Furthermore, a decreased periodicity (ascending AP and TP groups) of vocal fold oscillation leads to a reduced acoustic quality. This was also reported by Mehta et al.,24 who attributes an increase in cepstral peak magnitude to an attenuation of vocal fold aperiodicity.
Despite the fact that the cluster centers of TP (see Table II) cover a smaller range than the ones for AP, the comparison between TP groups yields overall significant results whereas for AP only the comparisons between and are significant. This indicates that TP, in particular, plays an essential role in voice quality.
An increasing glottal flow resistance R (and thus a higher energy transfer from the glottal flow to the vocal folds) yields an improved acoustic quality, see Fig. 8, with overall significance for all group comparisons. This is supported by Rosenthal et al.,56 who reported an increase in CPP by increasing glottal flow resistance estimated in vocally healthy patients producing speech at minimal, medium, and maximum vocal effort. In in vivo investigations, R can only be measured directly using invasive methods or assumed using indirect measuring methods. Hence, integration in the clinical environment is a critical process.
In summary, glottal parameters confirmed that GC and the duration of closure are essential for proper phonation. Despite only a small variation of glottal oscillation parameters, their influence on CPP was very clear. Glottal flow resistance is strongly dependent on the glottal gap and correlates with acoustic quality. Thus, it is potentially a reliable predictor for healthy voice.
V. SUMMARY AND CONCLUSION
This study systematically investigates the factors impacting the phonatory process and the resulting acoustic quality. Ex vivo porcine larynx experiments were used, which offer the advantage of direct parameter control as compared to in vivo investigations.
The resulting absolute values of glottal parameters quantifying glottal gap and vocal fold oscillation characteristics lie in the range of the results reported in in vivo investigations.
With respect to goal number 1 (investigation of GC types on acoustic quality, vocal fold oscillation characteristics, and aerodynamic properties), the results indicate that complete GC (GC1) but also cases with posterior gap (GC2), stabilize the phonatory process. This was indicated by decreased PA and increased AP and TP. This stabilization was also observed in an increase of the acoustic quality indicated by CPP especially for complete closure (GC1).
Further examination of asymmetry and periodicity parameters (goal number 2) showed that even small changes of absolute values exhibited a remarkable influence on the acoustic quality. Hence, detailed investigations of these parameters on the acoustic output would be desirable, e.g., by isolated variation of left-right asymmetry or AP and TP in numerical models. Furthermore, standard values regarding glottal dynamic parameters (especially periodicity) for healthy and pathological subjects need to be established. However, these small variations in dynamic parameter values question their applicability for diagnosis of voice disorders in the clinical environment. Furthermore, a direct influence of therapy on these parameters is doubtful. Therefore, it is suggested that the focus be on the optimization of surgical methods for reducing the glottal gap which have proven to be beneficial for voice quality.
Besides these main effects the following results were observed:
The energy transfer from the glottal airstream to the vocal folds, as indicated by the glottal resistance, was strongly dependent on GC and had an essential influence on the acoustic quality. This parameter seems to be a good predictor of a healthy voice, but is limited in the direct measurement in in vivo investigations.
Aside from the GC itself, the duration of complete closure, as indicated by the OQ was beneficial for the acoustic quality. However, this parameter is only applicable for cases with complete GC (in this study in only 22% of the data). A calculation using a combination of multiline kymogram and GA might help overcome this limitation.
Results indicated that the glottal gap size only exerts an influence on the acoustic quality up to a certain degree. For cases without vocal fold contact (GC4) compared to cases with partial vocal fold contact (GC3), CPP did not significantly change. A possible explanation could be found in the increased periodicity values for cases without vocal fold contact compared to cases with partial contact. To examine this effect, further studies have to be undertaken to investigate the influence of total glottal gap size and periodicity on acoustic quality.
ACKNOWLEDGMENTS
This research was supported by Deutsche Forschungsgemeinschaft Grant No. DO1247/6-1. The effort of D.A.B. on this project was supported by National Institutes of Health/National Institute on Deafness and Other Communication Disorders (NIH/NIDCD) Grant No. R01 DC013323.
APPENDIX: STATISTICAL ANALYSIS
TABLE III.
Corrected significance levels (Dunn-Bonferroni) for post hoc tests (Mann-Whitney-U).
| Parameter group | |
|---|---|
| 0.008 | |
| 0.005 | |
| 0.017 |
TABLE IV.
P-values (Kruskal-Wallis test) for comparison of acoustic, glottal, and aerodynamic parameters between the four GC groups.
| post-hoc test 0.008) | Kruskal-Wallis | ||||||
|---|---|---|---|---|---|---|---|
| Parameters | ( 0.05) | ||||||
| f0 | 0.113 | 0.000 | 0.841 | 0.000 | 0.000 | 0.000 | 0.000 |
| CPP | 0.000 | 0.000 | 0.279 | 0.000 | 0.000 | 0.000 | 0.000 |
| GGI | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
| OQ | 0.000 | 1.000 | 1.000 | 0.000 | 1.000 | 0.000 | 0.000 |
| PA | 0.000 | 0.000 | 0.156 | 0.000 | 0.000 | 0.000 | 0.000 |
| AP | 0.012 | 0.000 | 0.415 | 0.000 | 0.000 | 0.000 | 0.000 |
| TP | 0.820 | 0.000 | 0.002 | 0.000 | 0.498 | 0.485 | 0.000 |
| R | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
TABLE V.
P-values (Kruskal-Wallis test) for comparison of CPP between the five OQ groups.
| Parameter | post-hoc test 0.005) | Kruskal-Wallis ( 0.05) | ||||
|---|---|---|---|---|---|---|
| CPP | 0.000 | |||||
| 0.461 | 0.046 | 0.536 | 0.000 | 0.002 | ||
| 0.000 | 0.000 | 0.013 | 0.000 | 0.000 | ||
TABLE VI.
P-values (Kruskal-Wallis test) for comparison of CPP between the three glottal and aerodynamic parameter groups.
| Parameter | post hoc test | Kruskal-Wallis | ||
|---|---|---|---|---|
| 0.000 | 0.000 | 0.000 | 0.000 | |
| 0.028 | 0.000 | 0.000 | 0.000 | |
| CPP | ||||
| 0.000 | 0.133 | 0.000 | 0.000 | |
| 0.000 | 0.004 | 0.000 | 0.000 | |
| 0.011 | 0.000 | 0.000 | 0.000 | |
References
- 1. Ruben R. J., “ Redefining the survival of the fittest: Communication disorders in the 21st century,” Laryngoscope 110, 241–245 (2000). 10.1097/00005537-200002010-00010 [DOI] [PubMed] [Google Scholar]
- 2. Isshiki N., Morita H., Okamura H., and Hiramoto M., “ Thyroplasty as a new phonosurgical technique, Acta Oto-Laryngol. 78, 451–457 (1974). 10.3109/00016487409126379 [DOI] [PubMed] [Google Scholar]
- 3. Inwald E. C., Döllinger M.,” Schuster M., Eysholdt U., and Bohr C., “ Multiparametric analysis of vocal fold vibrations in healthy and disordered voices in high-speed imaging,” J. Voice 25, 576–590 (2011). 10.1016/j.jvoice.2010.04.004 [DOI] [PubMed] [Google Scholar]
- 4. Hoffman M. R., Surender K., Devine E. E., and Jiang J. J., “ Classification of glottic insufficiency and tension asymmetry using a multilayer perceptron,” Laryngoscope 122, 2773–2780 (2012). 10.1002/lary.23549 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Park J. B. and Mongeau L., “ Experimental investigation of the influence of a posterior gap on glottal flow and sound,” J. Acoust. Soc. Am. 124, 1171–1179 (2008). 10.1121/1.2945116 [DOI] [PubMed] [Google Scholar]
- 6. Sittel C., Bosch N., and Plinkert P. K., “ Surgical voice rehabilitation in unilateral vocal fold paralysis,” Chirurg. 79, 1055–1064 (2008) (in German). 10.1007/s00104-008-1550-6 [DOI] [PubMed] [Google Scholar]
- 7. Mallur P. S. and Rosen C. A., “ Vocal fold injection: Review of indications, techniques, and materials for augmentation,” Clin. Exp. Otorhinolar. 3, 177–182 (2010). 10.3342/ceo.2010.3.4.177 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Inagi K., Connor N. P., Suzuki T., Bless D. M., and Kamijo T., “ Visual observations of glottal configuration and vocal outcomes in arytenoid adduction,” Am. J. Otolaryng. 24, 290–296 (2003). 10.1016/S0196-0709(03)00054-1 [DOI] [PubMed] [Google Scholar]
- 9. Siu J., Tam S., and Fung K., “ A comparison of outcomes in interventions for unilateral vocal fold paralysis: A systematic review,” Laryngoscope 126, 1616–1624 (2016). 10.1002/lary.25739 [DOI] [PubMed] [Google Scholar]
- 10. Patel R. R., Dixon A., Richmond A., and Donohue K. D., “ Pediatric high speed digital imaging of vocal fold vibration: A normative pilot study of glottal closure and phase closure characteristics,” Int. J. Pediatr. Otorhinolaryngol. 76, 954–959 (2012). 10.1016/j.ijporl.2012.03.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Södersten M. and Lindestad P., “ Glottal closure and perceived breathiness during phonation in normally speaking subjects,” J. Speech Lang. Hear. Res. 33, 601–611 (1990). 10.1044/jshr.3303.601 [DOI] [PubMed] [Google Scholar]
- 12. Sördersten M., Hertegard S., and Hammarberg B., “ Glottal closure, transglottal airflow, and voice quality in healthy middle-aged women,” J. Voice 9, 182–197 (1995). 10.1016/S0892-1997(05)80252-8 [DOI] [PubMed] [Google Scholar]
- 13. Kendall K. A., “ High-speed digital imaging of the larynx: Recent advances,” Curr. Opin. Otolaryngol. Head Neck Surg. 20, 466–471 (2012). 10.1097/MOO.0b013e328359840d [DOI] [PubMed] [Google Scholar]
- 14. Mehta D. D., Deliyski D. D., Quatieri T. F., and Hillman R. E., “ Automated measurement of vocal fold vibratory asymmetry from high-speed videoendoscopy recordings,” J. Speech Lang. Hear. Res. 54, 47–54 (2011). 10.1044/1092-4388(2010/10-0026) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Verdonck-de Leeuw I. M., Festen J. M., and Mahieu H. F., “ Deviant vocal fold vibration as observed during videokymography: The effect on voice quality,” J. Voice 15, 313–322 (2001). 10.1016/S0892-1997(01)00033-9 [DOI] [PubMed] [Google Scholar]
- 16. Eysholdt U., Rosanowski F., and Hoppe U., “ Vocal fold vibration irregularities caused by different types of laryngeal asymmetry,” Eur. Arch. Otorhinolaryngol. 260, 412–417 (2003). 10.1007/s00405-003-0606-y [DOI] [PubMed] [Google Scholar]
- 17. Yamauchi A., Imagawa H., Sakakibara K., Yokonishi H., Nito T., Yamasoba T., and Tayama N., “ Phase difference of vocally healthy subjects in high-speed digital imaging analyzed with laryngotopography,” J. Voice 27, 39–45 (2013). 10.1016/j.jvoice.2012.08.002 [DOI] [PubMed] [Google Scholar]
- 18. Krenmayr A., Wöllner T., Supper N., and Zorowka P., “ Visualizing phase relations of the vocal folds by means of high-speed videoendoscopy,” J. Voice 26, 471–479 (2012). 10.1016/j.jvoice.2011.05.005 [DOI] [PubMed] [Google Scholar]
- 19. Lindestad P., Hertegard S., and Björck G., “ Laryngeal adduction asymmetries in normal speaking subjects,” Logoped. Phoniatr. Vocol. 29, 128–134 (2004). 10.1080/14015430410017009 [DOI] [PubMed] [Google Scholar]
- 20. Zhang Z., Kreiman J., Gerratt B. R., and Garellek M., “ Acoustic and perceptual effects of changes in body layer stiffness in symmetric and asymmetric vocal fold models,” J. Acoust. Soc. Am. 133, 453–462 (2013). 10.1121/1.4770235 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Samlan R. A., Story B. H., Lotto A. J., and Bunton K., “ Acoustic and perceptual effects of left-right laryngeal asymmetries based on computational modeling,” J. Speech. Lang. Hear. Res. 57, 1619–1637 (2014). 10.1044/2014_JSLHR-S-12-0405 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Mehta D. D., Zañartu M., Quatieri T. F., Deliyski D. D., and Hillman R. E., “ Investigating acoustic correlates of human vocal fold vibratory phase asymmetry through modeling and laryngeal high-speed videoendoscopy,” J. Acoust. Soc. Am. 130, 3999–4009 (2011). 10.1121/1.3658441 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Deliyski D. D., Petrushev P. P., Bonilha H. S., Gerlach T. T., Martin-Harris B., and Hillman R. E., “ Clinical implementation of laryngeal high-speed videoendoscopy: Challenges and evolution,” Folia. Phoniatr. Logop. 60, 33–44 (2008). 10.1159/000111802 [DOI] [PubMed] [Google Scholar]
- 24. Mehta D. D., Zeitels S. M., Burns J. A., Friedman A. D., Deliyski D. D., and Hillman R. E., “ High-speed videoendoscopic analysis of relationships between cepstral-based acoustic measures and voice production mechanisms in patients undergoing phonomicrosurgery,” Ann. Otol. Rhinol. Laryngol. 121, 341–347 (2012). 10.1177/000348941212100510 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Biever D. M. and Bless D. M., “ Vibratory characteristics of the vocal folds in young adult and geriatric women,” J. Voice 3, 120–131 (1989). 10.1016/S0892-1997(89)80138-9 [DOI] [Google Scholar]
- 26. Alipour F. and Jaiswal S., “ Phonatory characteristics of excised pig, sheep, and cow larynges,” J. Acoust. Soc. Am. 123, 4572–4581 (2008). 10.1121/1.2908289 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Alipour F. and Jaiswal S., “ Glottal airflow resistance in excised pig, sheep, and cow larynges,” J. Voice 23, 40–50 (2009). 10.1016/j.jvoice.2007.03.007 [DOI] [PubMed] [Google Scholar]
- 28. Chan R. W. and Titze I. R., “ Effect of postmortem changes and freezing on the viscoelastic properties of vocal fold tissues,” Ann. Biomed. Eng. 31, 482–491 (2003). 10.1114/1.1561287 [DOI] [PubMed] [Google Scholar]
- 29. Birk V., Döllinger M., Sutor A., Berry D. A., Gedeon D., Traxdorf M., Wendler O., Bohr C., and Kniesburges S., “ Automated setup for ex vivo larynx experiments,” J. Acoust. Soc. Am. 141, 1349–1359 (2017). 10.1121/1.4976085 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. van den Berg Jw., Zantema J. T., and Doornenbal P., “ On the air resistance and the Bernoulli effect of the human larynx,” J. Acoust. Soc. Am. 29, 626–631 (1957). 10.1121/1.1908987 [DOI] [Google Scholar]
- 31. Döllinger M., Kunduk M., Kaltenbacher M., Vondenhoff S., Ziethe A., Eysholdt U., and Bohr C., “ Analysis of vocal fold function from acoustic data simultaneously recorded with high-speed endoscopy,” J. Voice 26, 726–733 (2012). 10.1016/j.jvoice.2012.02.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Patel R., Dubrovskiy D., and Döllinger M., “ Characterizing vibratory kinematics in children and adults with high-speed digital imaging,” J. Speech Lang. Hear. Res. 57, 674–686 (2014). 10.1044/2014_JSLHR-S-12-0278 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Patel R., Dubrovskiy D., and Döllinger M., “ Measurement of glottal cycle characteristics between children and adults: Physiological variations,” J. Voice 28, 476–486 (2014). 10.1016/j.jvoice.2013.12.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Baken R. J. and Orlikoff R. F., “ Laryngeal function,” in Clinical Measurement of Speech and Voice, 2nd ed. ( Cengage Learning, Clifton Park, NY, 1999), Chap. 10, pp. 407–413. [Google Scholar]
- 35. Qiu Q., Schutte H. K., Gu L., and Yu Q., “ An automatic method to quantify the vibration properties of human vocal folds via videokymography,” Folia Phoniatr. Logo. 55, 128–136 (2003). 10.1159/000070724 [DOI] [PubMed] [Google Scholar]
- 36. Kreiman J. and Gerratt B. R., “ Perceptual sensitivity to first harmonic amplitude in the voice source,” J. Acoust. Soc. Am. 128, 2085–2089 (2010). 10.1121/1.3478784 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Hillenbrand J. and Houde R. A., “ Acoustic correlates of breathy vocal quality: Dysphonic voices and continuous speech,” J. Speech Hear. Res. 39, 311–321 (1996). 10.1044/jshr.3902.311 [DOI] [PubMed] [Google Scholar]
- 38. Samlan R. A. and Story B. H., “ Influence of left-right asymmetries on voice quality in simulated paramedian vocal fold paralysis,” J. Speech Lang. Hear. Res. 60, 306–321 (2017). 10.1044/2016_JSLHR-S-16-0076 [DOI] [PubMed] [Google Scholar]
- 39. Hillenbrand J., Cleveland R. A., and Erickson R. L., “ Acoustic correlates of breathy vocal quality,” J. Speech Hear. Res. 37, 769–778 (1994). 10.1044/jshr.3704.769 [DOI] [PubMed] [Google Scholar]
- 40. Chhetri D. K., Neubauer J., Sofer E., and Berry D. A., “ Influence and interactions of laryngeal adductors and cricothyroid muscles on fundamental frequency and glottal posture control,” J. Acoust. Soc. Am. 135, 2052–2064 (2014). 10.1121/1.4865918 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Titze I., “ On the relation between subglottal pressure and fundamental frequency in phonation,” J. Acoust. Soc. Am. 85, 901–906 (1989). 10.1121/1.397562 [DOI] [PubMed] [Google Scholar]
- 42. Titze I., Riede T., and Mau T., “ Predicting achievable fundamental frequency ranges in vocalization across species,” PLoS Comput. Biol. 12, e1004907 (2016). 10.1371/journal.pcbi.1004907 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. Yamauchi A., Yokonishi H., Imagawa H., Sakakibara K., Nito T., and Tayama N., “ Quantitative analysis of vocal fold vibration in vocal fold paralysis with the use of high-speed digital imaging,” J. Voice. 30, 766.e13–766.e22 (2016). 10.1016/j.jvoice.2015.10.015 [DOI] [PubMed] [Google Scholar]
- 44. Wolfe V., Fitch J., and Cornell R., “ Acoustic prediction of severity in commonly occurring voice problems,” J. Speech Hear. Res. 38, 273–279 (1994). 10.1044/jshr.3802.273 [DOI] [PubMed] [Google Scholar]
- 45. Kunduk M., Döllinger M., McWorther A. J., and Lohscheller J., “ Assessment of the variability of vocal fold dynamics within and between recordings with high-speed imaging and by phonovibrogram,” Laryngoscope 120, 981–987 (2010) 10.1002/lary.20832. [DOI] [PubMed] [Google Scholar]
- 46. Yamauchi A., Yokonishi H., Imagawa H., Sakakibara K., Nito T., Tayama N., and Yamasoba T., “ Quantitative analysis of digital videokymography: A preliminary study on age- and gender-related difference of vocal fold vibration in normal speakers,” J. Voice 29, 109–119 (2015). 10.1016/j.jvoice.2014.05.006 [DOI] [PubMed] [Google Scholar]
- 47. Tsutsumi M., Isotani S., Pimenta R. A., Dajer M. E., Hachjya A., Tsuji D. H., Tayama N., Yokonishi H., Imagawa H., Yamauchi A., Takano S., Sakakibara K., and Montagnoli A. N., “ High-speed videolaryngoscopy: Quantitative parameters of glottal area waveforms and high-speed kymography in healthy individuals,” J. Voice 31, 282–290 (2017). 10.1016/j.jvoice.2016.09.026 [DOI] [PubMed] [Google Scholar]
- 48. Bonilha H. S., Deliyski D. D., and Gerlach T. T., “ Phase asymmetries in normophonic speakers: Visual judgments and objective findings,” Am. J. Speech. Lang. Pathol. 17, 367–376 (2008). 10.1044/1058-0360(2008/07-0059) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49. Yamauchi A., Yokonishi H., Imagawa H., Sakakibara K., Nito T., Tayama N., and Yamasoba T., “ Quantification of vocal fold vibration in various laryngeal disorders using high-speed digital imaging,” J. Voice 30, 205–214 (2016). 10.1016/j.jvoice.2015.04.016 [DOI] [PubMed] [Google Scholar]
- 50. Isshiki N., Tanabe M., Ishizaka K., and Broad D., “ Clinical significance of asymmetrical vocal cord tension,” Ann. Oto. Rhinol. Laran. 86, 58–66 (1977). 10.1177/000348947708600109 [DOI] [PubMed] [Google Scholar]
- 51. Kobayashi J., Yumoto E., Hyodo M., and Gyo K., “ Two-dimensional analysis of vocal fold vibration in unilaterally atrophied larynges,” Laryngoscope 110, 440–446 (2000). 10.1097/00005537-200003000-00022 [DOI] [PubMed] [Google Scholar]
- 52. Döllinger M., Berry D. A., and Kniesburges S., “ Dynamic vocal fold parameters with changing adduction in ex-vivo hemilarynx experiments,” J. Acoust. Soc. Am. 139, 2372–2385 (2016). 10.1121/1.4947044 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53. Zañartu M., Galindo G. E., Erath B. D., Peterson S. D., Wodicka G. R., and Hillman R. E., “ Modeling the effects of a posterior glottal opening on vocal fold dynamics with implications for vocal hyperfunction,” J. Acoust. Soc. Am. 136, 3262–3271 (2014). 10.1121/1.4901714 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54. Chen G., Kreiman J., Shue Y., Alwan A., and Australia D., “ Acoustic correlates of glottal gaps,” in 2011 Interspeech, Florence, Italy, pp. 2673–2676. [Google Scholar]
- 55. Kreiman J., Shue Y., Chen G., Iseli M., Gerratt B. R., Neubauer J., and Alwan A., “ Variability in the relationships among voice quality, harmonic amplitudes, open quotient, and glottal area waveform shape in sustained phonation,” J. Acoust. Soc. Am. 132, 2625–2632 (2012). 10.1121/1.4747007 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56. Rosenthal A. L., Lowell S. Y., and Colton R. H., “ Aerodynamic and acoustic features of vocal effort,” J. Voice 28, 144–153 (2014). 10.1016/j.jvoice.2013.09.007 [DOI] [PubMed] [Google Scholar]







