Abstract
Normal voice is characterized by periodic oscillations of the vocal folds. On the other hand, disordered voice dynamics (e.g., subharmonic and aperiodic oscillations) are often associated with voice pathologies and dysphonia. Unfortunately, not all investigations may be conducted on human subjects; hence animal laryngeal studies have been performed for many years to better understand human phonation. The rabbit larynx has been shown to be a potential model of the human larynx. Despite this fact, only a few studies regarding the phonatory parameters of rabbit larynges have been performed. Further, to the best of our knowledge, no ex vivo study has systematically investigated phonatory parameters from high-speed, audio and subglottal pressure data with irregular oscillations. To remedy this, the present study analyzes experiments with sustained phonation in 11 ex vivo rabbit larynges for 51 conditions of disordered vocal fold dynamics. (1) The results of this study support previous findings on non-disordered data, that the stronger the glottal closure insufficiency is during phonation, the worse the phonatory characteristics are; (2) aperiodic oscillations showed worse phonatory results than subharmonic oscillations; (3) in the presence of both types of irregular vibrations, the voice quality (i.e., cepstral peak prominence) of the audio and subglottal signal greatly deteriorated compared to normal/periodic vibrations. In summary, our results suggest that the presence of both types of irregular vibration have a major impact on voice quality and should be considered along with glottal closure measures in medical diagnosis and treatment.
Keywords: ex vivo phonation, rabbit model, aperiodic dynamics, subharmonic dynamics, high-speed digital imaging
1. Introduction
Voice plays an essential role in interpersonal communication. Therefore, healthy voice production is indispensable and pathologies are associated with a significant loss of quality of life. Various professions deal with this subject, including phoniatricians and speech language pathologists, whose tasks include the exploration and treatment of voice, speech and language disorders. This area of responsibility also includes gaining a profound understanding of the phonation mechanism that has been researched for many decades [1,2]. An in-depth understanding of these mechanisms is important to derive new findings about speech system dysfunctions from which new behavioral, medical, and surgical approaches may be developed.
The voice originates in the larynx (Figure 1a). Through the core muscles, pressure builds up in the lungs and air is directed through the trachea to the larynx, which causes the vocal folds to oscillate (Figure 1b). By opening and closing of the vocal folds, a fundamental tone is generated (in humans usually between 100–300 Hz for normal phonation [3,4]), which is modulated in the vocal tract and then emitted through the mouth [5].
Figure 1.
(a) Simplified way of air for voice production in overview; (b) Schematic longitudinal section through the larynx with laminar airflow under and turbulent airflow above the vocal folds.
Many years of research have made it possible to define certain parameters from which it is feasible to draw conclusions about the histological structures, the voice acoustics and the biomechanical components of the vocal folds. These include a wide variety of aerodynamic and acoustic parameters, as well as parameters computed from high-speed digital imaging (HSI) data [6–8].
The technical progress and possibility to record high-speed images of the vocal folds during the high frequency phonatory process was a revolution in clinical phoniatric diagnostics. Since the first analog high-speed recording of human vocal folds in 1940 by Farnsworth [9], technology has evolved steadily. In recent years, the cameras have become smaller and faster and the spatial resolution has improved enormously. Particularly when used in the clinical diagnosis of dysphonia, the HSI technique is superior to stroboscopy [10] and it is impossible to imagine phonatory clinical research today without HSI [10–16]. As long as the frame rate of the camera teaches at least 4000 frames per second, the technique enables one to capture an accurate representatlon of the vibratory cycle of the vocol folds [17]. From these images and computed parameters, conclusions can tie drawn regarding dynamic changes [18] and voice disorders and pathologies [10,12,19–23].
The detected vocal fold vibrations can be subdivided into three classifications [24]. These are type 1 (periodic), type 2 (subharmonic) and type it (aperiodic or chaotic) vibrations. Since healthy voice production is assumed to show periodic vibrations [25], the type 1 vibrations in this study are also celled normal or non-disordered vibrations. The type 2 and type 3 vibrations are called irregular or disordered vibrations and are often related to voice pathologies [26]. These three types of vibrations have been systematically investigated in various studies over the years [27–29]. However, not all data can be collected easily in everyday clinical practice from humans. In particular, subglottal measurements cannot (or only under severe conditions) be obtained in vivo; ex vivo experiments on larynges are necessary [30].
Voice production in humans and mammals is very similar [31]; therefore, animal laryngeal studies have been carried out from which conclusions are drawn regarding human physiology. In the literature, voice production of animals and the comparison with humans has already been examined many times. Sheep, canines [32,33], porcines [34,35], and rabbits [36] are particularly suitable for comparison with humans. The first three are used, especially because of the similar laryngeal dimensions to humans, whereas the comparison with rabbit larynges is suitable, because the tissue properties are similar to those of the human larynx [36,37]. Despite this fact, only a few studies of rabbit phonation have been performed. Maytag et al. (2013) [38] suggested a method for reliable extraction of phonatory, acoustic and videokymographic data of ex vivo rabbit larynges, as this form of data collection from ex vivo rabbit larynges was not yet sufficiently investigated. They found that the rabbit data was similar in intralaryngeal variability to canine laryngeal data. In a study published in 2017, Mills et al. [7] made adaptations to an excised booth, primarily used for canine larynges, to examine the phonatory range of ex vivo rabbit larynges. It was found that increasing airflow and elongation affected subglottal pressure, fundamental frequency, sound pressure level (SPL) and the vibratory amplitude. In 2018, Döllinger et al. [39] performed the first systematic study on ex vivo rabbit larynges, analyzing HSI, audio and subglottal pressure data and the influence of glottal gap characteristics on the phonatory process, as such comprehensive analyses were missing in the literature. Significant influences of applied airflow and the vocal fold elongation level on vocal fold closure insufficiency were detected [40,41]. In the study of Döllinger et al. [39], only the periodic type 1 vibration data was analyzed. In the context of data collection, however, additional subharmonic type 2, as well as aperiodic type 3 vibration data were observed and recorded. This subharmonic type 2 and aperiodic type 3 vibration data are subject to systematic investigation in the present work. To the best of our knowledge, no ex vivo study has been previously reported that systematically investigated phonatory parameters from high-speed, audio and subglottal pressure data for disordered (subharmonic type 2 and aperiodic type 3) vocal fold oscillations. Therefore, the aim of this study is to close existing research gaps by (1) investigating the impact of glottis closure insufficiency on phonatory parameters also in the range of type 2 and type 3 vibrations; (2) compare the results with those of Döllinger et al. [39] who reported on periodic type 1 oscillations; (3) compare which of both vocal fold dynamics (subharmonic type 2 or aperiodic type 3) result in worse parameter values; (4) investigate whether the regularity (subharmonic type 2, aperiodic type 3 oscillations) or the glottis closure has a stronger impact on the fluid-structure-acoustic interaction (i.e., interaction of airflow—vocal folds—resulting sound) of the phonatory process.
2. Materials and Methods
The data acquisition was already previously described in detail [39]; hence the following is only a short overview of the experimental setup and data collection. For detailed information, we refer to the reference study on type 1 oscillations [39]. Data from 11 ex vivo rabbit larynges (New Zealand White, 4–5 kg body weight, ages 14–118 weeks) were used for this study. Since the rabbits were already sacrificed for another study with buprenorphine, no further approval of the ethics committee was required. Ethical acceptability was approved for the previous study (approval number 54-2532.1-54/12). In preparation for the experiments, the larynges were harvested from the sacrificed rabbits. Then the tissue just above and 30 mm below the larynges was surgically removed. To preserve the tissue characteristics, the prepared larynges were quickly frozen at −150 °C in liquid nitrogen and then stored at −80 °C [42]. Before the experiments, the larynges were slowly thawed at 6 °C in a refrigerator. They were then fixated on a 4 mm inside diameter stainless steel tube, which functioned as an artificial trachea. For fixation, a stainless adjustable steel ring was used to prevent air leakage. Rods and positioning screws were used to maintain stable larynx positions, see Figure 1 in [39]. At a distance of 100 mm below the larynx, a hole was drilled in the stainless-steel tube and the pressure sensor was placed there to measure the subglottal pressure. Figure 2 shows the experimental setup.
Figure 2.
Overview of the experimental setup with inscriptions.
The subglottal pressure data were measured with an XCS-93-5PSISG pressure sensor (Kulite Semiconductor Products, Inc., Leonia, NJ, USA), which was positioned 100 mm below the larynx on the inside of the stainless-steel tube. The pressure sensor was connected to a PXIe-4330 bridge module (National Instruments, Austin, TX, USA).
The acoustic pressure data was measured using a 4189 1/2” free-field microphone (Brüel & Kjaer, 2850 Nærum, Denmark) mounted above the glottis at a distance of 200 mm and a tilt angle of 45°. The microphone output signal from a Nexus 2690 microphone was then further processed by a PXIe-4492 dynamic signal acquisition module (National Instruments, Austin, TX, USA). Both signals were resolved at 24 bits.
The vocal fold vibrations were recorded by a Phantom V2511 high-speed camera (Vision Research, Wayne, NJ, USA) at 8000 frames per second (fps) with a spatial resolution of 768 × 768 pixels. The videos were recorded at 16 bits. A Canon EF 180 mm 1: 3.5 L USM macro lens (Canon, Ōta, Tokyo, Japan) was used.
The subglottal pressure signal as well as the acoustic signal were synchronously sampled at a rate of fs = 96 kHz. For this purpose, a PXIe-6356 multifunctional data acquisition module (National Instruments, Austin, TX, USA) was used. The setup was controlled using the software LabView (National Instruments, Austin, TX, USA). A detailed description of the setup and its control is given in Birk et al. (2017) [43].
Different levels of vocal fold pre-stress were induced by three different weights applied anteriorly to the thyroid cartilage (w1 = 1 g, W2 = 2 g, W3 = 5 g). The weights were sutured to the thyroid cartilage and used to shift it forward, thereby simulating contraction of the cricothyroid muscle in three different phonatory positions. An MF1 mass flow controller (MKS Instruments, Andover, MA, USA) powered by a PR4000B digital power supply (MKS Instruments, Andover, MA, USA) was used to generate an airflow passing through the artificial stainless-steel trachea and larynx. That air was humified with water vapor from a Neptune Heated Humidifier (Teleflex, Morrisville, NC, USA), and heated to 37 °C to simulate physiological in vivo conditions.
The HSI technique allows the computation of quantitative, dynamic-based parameters. Its usefulness has been demonstrated in several previous studies [39,43–46]. The HSI data analysis was performed using the in-house software Glottis-Analysis-Tools (GAT) (University Hospital Erlangen, Erlangen, Germany). The basis for all HSI analyses is the segmentation of the change in the area between the vocal folds over time (i.e., glottis area; given in pixels), which is referred to as the Glottal-Area-Waveform (GAW). A characteristic GAW with the corresponding glottal images is shown in Figure 3. The GAW is used to compute various phonation parameters that describe the phonatory process at the vocal fold level and to obtain information about vocal fold oscillations [47] such as periodicity, glottis closure and left-right symmetry of the vocal folds.
Figure 3.
Phonation cycle and typical Glottal-Area-Waveform (GAW) with example images of the glottis during each cycle phase.
Each experimental run was recorded by the HSI camera for a length of 125ms with sustained phonation (i.e., >42 vibratory cycles, meeting the criteria >20 cycles) as suggested by [48] while the subglottal and audio data were recorded for 500 ms [39].
The computed parameters are subdivided into the groups “GAW parameters”, “Aerodynamic parameters” and “Harmonic measures” according to their data sources; see Table 1.
Table 1.
Computed parameters with explanations. 1
Abbreviation, Unit, References | Parameter | Meaning and Interpretation |
---|---|---|
(A) GAW Parameter | ||
GGI (a.u.) [5] | Glottal gap index | Minimum glottal area/maximum glottal area: [0–0.01] glottis entirely closed ]0.01–0.4[ glottis partially closed [0.4–1] little movement and no contact of vocal folds |
ALR (a.u.) [49] | Amplitude to length ratio | Dynamic range of GAW (max–min)/glottis length: the larger the more deformable the vocal folds |
STIFFNESS (1/frames) [50] | Maximum absolute value of 1st derivative/dynamic range: the higher the value the stiffer the tissue | |
ASQ (a.u.) [51] | Asymmetry quotient | Speed quotient/(Speed quotient + 1) |
CQ (a.u.) [52] | Closing quotient | Glottis closing time/cycle duration 0: glottis does not close at all 0,5: glottis is closed 50% of the cycle duration 1,0: glottis is closed all the time, does not open |
OQ (a.u.) [53] | Open quotient | Glottis open time/cycle duration 0: glottis does not open at all 0,5: glottis is open 50% of the cycle duration 1,0: glottis is open all the time, does not close |
SQ (a.u.) [53] | Speed quotient | Opening time/closing time |
ASI (a.u.) [54] | Amplitude symmetry index | Spatial symmetry of GAW: rate between maximum center and right glottal area, the closer to 1 the more symmetric values are by definition between [0;1] |
PAI (a.u.) [55] | Phase asymmetry index | Symmetry in time: deviation in time between center and right GAW amplitude: the closer to 0 the higher the symmetry values are by definition between [0;1] |
(B) Aerodynamic parameters | ||
RB (Pa s−1) [56] | Laryngeal flow resistance | low-high flow resistance, ratio between the transglottal pressure difference and the mean glottal flow rate. A high flow resistance is desired in phonation |
SPL (dB) | Sound pressure level | Intensity of acoustic signal |
PS (Pa) | Subglottal pressure | Averaged air pressure measured below the vocal folds |
(C) Harmonic measures | ||
CPPA,P (dB) [57] | Cepstral peak prominence | Development of harmonics, the higher the better low: low periodicity of the acoustic signal high: high periodicity of the acoustic signal (computed from the audio “A” and the subglottal pressure “P” signal) |
Parameters are separated into (A) GAW parameters, (B)/(C) audio, flow and subglottal measurement data.
In the study by Döllinger et al. [39], only periodic vocal fold vibrations containing exactly one harmonic were considered; i.e., these vibrations are considered normal [39]. In contrast, only (1) periodic vibrations with two harmonics and (2) aperiodic vocal fold vibrations are examined in this present study; i.e., these vibrations are considered as disordered. Figures 4–6 show typical examples of the GAW, audio and subglottal frequencies for periodic/normal (Figure 4), subharmonic (Figure 5), and aperiodic vocal fold vibrations (Figure 6). In these figures, the audio signal was shifted by 0.58 ms to correct for the time delay of the acoustic signal; i.e., distance from the vocal folds to the microphone (200 mm) × acoustic wave propagation in air (343 m/s).
Figure 4.
GAW, audio signal, subglottal frequencies in periodic vocal fold vibrations.
Figure 6.
GAW, audio signal, subglottal frequencies in aperiodic vocal fold vibrations i.e., no periodicity recognizable in the GAW.
Figure 5.
GAW, audio signal, subglottal frequencies in subharmonic vocal fold vibrations with two harmonics in the GAW.
Question 1: How does glottal closure insufficiency, expressed by GGI (Table 1), influence the fluid-structure-acoustic interaction within disordered vocal fold dynamics?
The GGI groups were chosen equivalent to Döllinger et al. [39] in order to ensure comparability of the two studies. The three GGI groups are categorized as follows: Best—GGI1 ([0; 0.01], desired complete glottis closure during vibration), Medium—GGI2 (]0.01, 0.4[, partial closure of the vocal folds), Worst—GGI3 ([0.4; 1], no contact of the vocal folds during phonation). Based on the small group sizes, Kruskal-Wallis tests were applied for multiple group comparisons. For subsequent post hoc tests, Mann-Whitney-U tests with a Bonferroni correction (0.05/3 = 0.017) with a resulting significance level of p = 0.017 were used
Question 2: a) Do the subharmonic dynamics influence the phonatory fluid-structure-acoustic interaction process differently compared to aperiodic dynamics? b) Which of both dynamics result in worse phonatory parameter values?
First, the data was examined for the presence of a normal distribution (Shapiro-Wilk). The group differences of the normally distributed data were tested for statistical significance with the parametric t-test, those of the not normally distributed data with the Mann-Whitney-U-test. All statistical analyses were performed using IBM SPSS Statistics 22 software (IBM, Amonk, NY, USA).
3. Results
3.1. Fundamental Phonatory Parameters
Table 2 provides an overview of the various fundamental phonatory parameters such as fundamental frequency f0 (Hz), subglottal pressure PS (Pa), glottal air flow (mL s−1), flow resistance RB (Pa s−1), and intensity SPL (dB) with the corresponding mean values, minimum and maximum values.
Table 2.
Mean values, minimum and maximum values of the fundamental phonatory parameters.
f0 (Hz) | PS (Pa) | Flow (mL s−1) | RB (Pa s−1) | SPL (dB) | |
---|---|---|---|---|---|
Mean ± std | 655 ±147 | 1324 ± 798 | 120 ± 42 | 11428 ± 5631 | 74.3 ± 8.9 |
Minimum values | 343 | 196 | 42 | 2587 | 54.7 |
Maximum values | 895 | 3318 | 175 | 21557 | 90.7 |
In this study, data of 51 test runs were used. Of these, 35 test runs contained subharmonic (GroupS) and 16 test runs aperiodic oscillations (GroupA). As Table 3 shows, the subharmonic and aperiodic oscillations occurred at all airflow intensities (different airflow levels) and elongation levels (different weight levels), but tended to increase with higher air flow and higher elongation.
Table 3.
Frequencies of the test-runs divided into the different airflow levels and elongation levels.
Airflow levels | Σ = 51 |
---|---|
1 = onset | 3 |
2–6 = low | 15 |
7–11 = medium | 11 |
12–16 = high | 22 |
Weight levels | Σ = 51 |
w1 = 1 g—low | 12 |
w2 = 2 g—medium | 11 |
w3 = 5 g—high | 28 |
Figure 7a shows the fundamental frequencies of the GAW, subglottal pressure signal and acoustic signal for the subharmonic oscillations. Figure 7b shows this correspondingly for the aperiodic oscillations. The GAW signals show the lowest mean fundamental frequency; see Figure 7a (353 Hz) and Figure 7b (482 Hz). This is followed by the mean fundamental frequency of the subglottal pressure data at 696 Hz (Figure 7a) and 635 Hz (Figure 7b). The highest mean fundamental frequency was found for the audio measurements at 850 Hz (Figure 7a) and 978 Hz (Figure 7b). Despite that increasing tendency, the frequencies show considerable fluctuations; see Figure 7a,b.
Figure 7.
Fundamental frequencies f0 subdivided into GAW, subglottal, audio measurements for (a) Subharmonic oscillations (b) Aperiodic oscillations.
3.2. Influence, of the Glottal Gap on the Phonatory Process
Question 1: How does glottal closure insufficiency, expressed by GGI, influence the fluid-structure-acoustic interaction within disordered dynamics?
The results of the statistical tests are presented in Table 4. The group differences of all phonation parameters used were tested, which occurred firstly between the three GGI groups and secondly between the two vibration characteristics (GroupS and GroupA). There were 8 out of 13 tests (61.5%) between all three GGI groups and 10 out of 24 post hoc tests (41.7%) that showed statistically significant differences. Statistically significant differences were found in 6 out of the 8 GAW parameters. Most differences in the direct comparison of two GGI groups were found between GGI1 and GGI2 with 4 statistically significant differences. Between GGI1 and GGI3, there were 2 statistically significant group differences (Table 4) and 2 of the p-values were slightly above the corrected p-value of 0.017; both with p-values of 0.018. Regarding aerodynamic parameters, the SPL values between the three GGI groups were statistically significantly. For the harmonic measures, CPPA was statistically significantly different between GGI1 and GGI3.
Table 4.
Statistical results for the GAW, aerodynamic, and harmonic parameters. 1
Parameters | Kruskal-Wallis-test | Post hoc tests (corrected significance level p = 0.017) | Mann-Whitney-U-/t-test | ||
---|---|---|---|---|---|
GGI1,2,3 | GGI1,2 | GGI1,3 | GGI2,3 | GroupS,A | |
(A) GAW Measures | |||||
ALR (a.u.) | 0.040 | 0.776 | 0.001 | 0.003 | 0.000 |
STIFFNESS (frames−1) | 0.014 | 0.014 | 0.018 | 0.300 | 0.085 |
ASQ (a.u.) | 0.054 | - | - | - | 0.040 |
CQ (a.u.) | 0.005 | 0.002 | 0.018 | 0.511 | 0.002 |
OQ (a.u.) | 0.000 | 0.000 | 0.000 | 0.124 | 0.870 |
SQ (a.u.) | 0.113 | - | - | - | 0.187 |
ASI (a.u.) | 0.017 | 0.016 | 0.825 | 0.036 | 0.429 |
PAI (a.u.) | 0.017 | 0.027 | 0.606 | 0.020 | 0.009 |
(B) Aerodynamic parameters | |||||
RB (Pa s−1) | 0.038 | - | - | - | 0.440 |
SPL (dB) | 0.004 | 0.981 | 0.002 | 0.014 | 0.000 |
PS (Pa) | 0.086 | - | - | - | 0.715 |
(C) Harmonic measures | |||||
CPPA | 0.020 | 0.129 | 0.005 | 0.066 | 0.677 |
CPPP | 0.123 | - | - | - | 0.231 |
First four columns: Calculated p-values between the three GGI groups. Last column: Calculated p-values for two vibrational characteristics (GroupS and GroupA). Significant p-values are highlighted in bold type.
3.3. Influence of the Vibrational Characteristics Subharmonic and Aperiodic on the Phonatory Process
Question 2: a) Do the subharmonic dynamics influence the fluid-structure-acoustic interaction differently compared to aperiodic dynamics? b) Which of both dynamics result in worse parameter values?
The parameters ALR, Stiffness, SPL and CPPA were normally distributed. All other parameters were not normally distributed, so the nonparametric Mann-Whitney-U-test was used for direct group comparison. The corresponding p-values are given in Table 4 in the last column. When comparing the mean values of aperiodic and subharmonic values, statistically significant differences were found in 4 of 8 GAW parameters (ALR, ASQ, CQ, PAI). Of the aerodynamic parameters, only the SPL values were statistically significantly different (p < 0.001). There were no statistically significant differences in the harmonic measures.
3.4. Descriptive Statistics
Table 5 shows the means and standard deviations for all parameters of the three GGI groups (GGI1 (N = 16), GGI2 (N = 29), GGI3 (N = 6)). The GAW parameters CQ and OQ increase steadily from GGI1 to GGI3, while for the parameters ASI and PAI no clear trend is discernible. The remaining parameters decrease from GGI1 to GGI3. With the exception of PS, the values of the other two aerodynamic parameters RB and SPL and those of the harmonic parameters CPPA and CPPP decrease from GGI1 to GGI3.
Table 5.
Mean values and standard deviations for the GAW, aerodynamic and harmonic measures of the three different GGI groups with tendencies.
Parameters | Mean ± standard deviations |
Tendency for GGI1-3 | ||
---|---|---|---|---|
GGI1 | GGI2 | GGI3 | ||
(A) GAW Measures | ||||
ALR (a.u.) | 16.5 ± 4.5 | 16.1 ± 6.7 | 4.9 ± 5.1 | decrease |
STIFFNESS (frames−1) | 0.33 ± 0.06 | 0.28 ± 0.05 | 0.26 ± 0.06 | decrease |
ASQ (a.u.) | 0.60 ± 0.15 | 0.49 ± 0.14 | 0.52 ± 0.05 | decrease |
CQ (a.u.) | 0.34 ± 0.14 | 0.50 ± 0.15 | 0.48 ± 0.05 | increase |
OQ (a.u.) | 0.87 ± 0.13 | 0.99 ± 0.0 | 1.00 ± 0.00 | increase |
SQ (a.u.) | 2.47 ± 2.50 | 1.34 ± 0.97 | 1.25 ± 0.31 | decrease |
ASI (a.u.) | 0.74 ± 0.15 | 0.85 ± 0.08 | 0.74 ± 0.13 | - |
PAI (a.u.) | 0.14 ± 0.10 | 0.10 ± 0.12 | 0.14 ± 0.06 | - |
(B) Aerodynamic parameters | ||||
RB (Pa s−1) | 14376 ± 5090 | 9992 ± 5759 | 10510 ± 4414 | decrease |
SPL (dB) | 76.7 ± 6.5 | 76.0 ± 7.6 | 59.5 ± 7.5 | decrease |
PS (Pa) | 1783 ± 1073 | 1050 ± 514 | 1423 ± 662 | - |
(C) Harmonic measures | ||||
CPPA (dB) | 17.9 ± 4.3 | 15.8 ± 6.5 | 11.0 ± 3.4 | decrease |
CPPP (dB) | 19.4 ± 5.8 | 16.9 ± 5.6 | 14.0 ± 2.9 | decrease |
The mean and standard deviations of all parameters for the two different vibrational characteristics are given in Table 6. From GroupS to GroupA, all GAW parameters except for ASQ, SQ and PAI decrease; see Table 6. Of the aerodynamic parameters, SPL and PS decrease, while RB increases from GroupS to GroupA. Both CPPA and CPPP decrease from GroupS to GroupA.
Table 6.
Mean values and standard deviations for the GAW, aerodynamic and harmonic parameters of the two different vibrational characteristics with tendencies.
Parameter | Mean standard deviation |
Tendency for GroupS,A | |
---|---|---|---|
GroupS | GroupA | ||
(A) GAW measures | |||
ALR (a.u.) | 18.0 ± 5.5 | 8.1 ± 4.4 | decrease |
STIFFNESS (frames−1) | 0.30 ± 0.06 | 0.28 ± 0.05 | decrease |
ASQ (a.u.) | 0.50 ± 0.14 | 0.59 ± 0.14 | increase |
CQ (a.u.) | 0.49 ± 0.15 | 0.35 ± 0.11 | decrease |
OQ (a.u.) | 0.96 ± 0.06 | 0.93 ± 0.15 | decrease |
SQ (a.u.) | 1.46 ± 1.11 | 2.18 ± 2.40 | increase |
ASI (a.u.) | 0.81 ± 0.13 | 0.79 ± 0.11 | decrease |
PAI (a.u.) | 0.11 ± 0.13 | 0.13 ± 0.05 | increase |
(B) Aerodynamic parameters | |||
RB (Pa s−1) | 11001 ± 5846 | 12363 ± 5387 | increase |
SPL (dB) | 78.1 ± 6.8 | 65.9 ± 7.3 | decrease |
PS (Pa) | 1370 ± 824 | 1223 ± 781 | decrease |
(C) Harmonic measures | |||
CPPA (dB) | 16.1 ± 6.2 | 15.4 ± 5.3 | decrease |
CPPP (dB) | 17.9 ± 6.0 | 16.0 ± 4.5 | decrease |
4. Discussion
A comparison of the computed parameters with other ex vivo rabbit studies has already been reported in Döllinger et al. [39], hence we refer to this work for the interested reader. The main focus of this work is the comparison of periodic type 1 vibrations from [39] with the type 2 and type 3 oscillations in this study, as well as on the comparison within the irregular vibration data.
4.1. Fundamental Phonatory Parameters
In comparison with the mean values of Döllinger et al. [39], it is noticeable that the fundamental frequency f0 shows higher mean values (+8%) as do the minimum (+9%) and maximum values (+4%); see Table 2. This may be an indication for pathological voice, since Yamauchi et al. [58] found a relationship between incomplete glottis closure and an increase of the fundamental frequency, while investigating data of patients with vocal fold paralysis. Also, similar findings were reported by Wolfe et al. [59] who found a correlation between jitter and the degree of dysphonia by considering the mean fundamental frequency. Hence, an increased fundamental frequency may be seen as an indicator of a pathological or disordered voice [60]. Air flow strength here (120 mL/s) is close to the value of Döllinger et al. [39] of 117 mL/s, which was to be expected since the data were collected during the same series of experiments.
Figure 7 shows the fundamental frequencies (GAW, audio and subglottal). This figure shows that the frequency of the GAW is the lowest and that of the subglottal and audio measurements tend to be higher. However, due to the aperiodicity of the signals, this tendency is not necessarily a causal interrelation. Hence, an interpretation of this effect is difficult.
4.2. Phonation Parameters
In comparison to Döllinger et al. [39] not all parameters were computed. The parameters Jitter, Shimmer [61] and Harmonic-To-Noise-Ratio (HNR) [62] are computed in the time domain of the signal and need quasi-periodicity of the signal [63]. Since the aperiodic data show reduced or no periodicity, it was not reasonable to compute and consider these parameters.
4.2.1. GAW Parameters
Eight GAW parameters were considered. The ALR parameter differs statistically significantly between GGI1,3 as well as between GGI2,3. Higher ALR values indicate greater deformability and are desirable [39]. In this study, as well as in Döllinger et al. [39]. ALR decreases from GGI1 to GGI3. Since GGI3 has no vocal fold contact, a lower dynamical deformability and consequently a lower value of ALR is plausible compared to GGI1 and GGI2. It is noticeable that the mean ALR for GGI3 from this study is 37% lower compared to the normal ALR at GGI3 in [39]; see Table 7. These worse results suggest less deformability for the disordered oscillation data. In comparison of the two vibrational characteristics, the values of the ALR parameters were statistically significantly worse for GroupA compared with GroupS (Table 4). The difference between the two mean values was −55% (Table 6). Comparing the lowest mean ALR values of GGI (Table 5) and the vibrational characteristics (Table 6), mean ALR was considerably lower for GGI3 at a mean value of 4.92, compared with the mean ALR value for GroupA (8.14). This suggests that the combination of disordered oscillations (GroupS or GroupA) and GGI3 is particularly unfavorable with respect to the oscillatory characteristics of the vocal folds.
Table 7.
Percentage change of this present data to normal data in corresponding GGI groups, Döllinger et al. (2018) [39]. 1
Parameters | GGI1 | GGI2 | GGI3 |
---|---|---|---|
(A) GAW Measures | |||
ALR (a.u.) | +8% | +5% | −37% |
STIFFNESS (frames−1) | −3% | +0% | +0% |
ASQ(a.u.) | −3% | −20% | −12% |
CQ (a.u.) | +6% | +32% | +17% |
OQ (a.u.) | +5% | +0% | +0% |
SQ (a.u.) | +43% | −22% | −18% |
ASI (a.u.) | −6% | +2% | +1% |
PAI (a.u.) | +8% | +0% | +8% |
(B) Aerodynamic parameters | |||
RB (Pa s−1) | −4% | −15% | +42% |
SPL (dB) | −3% | −0% | −14% |
PS (Pa) | +6% | −25% | +60% |
(C) Harmonic measures | |||
CPPA (dB) | −25% | −30% | −43% |
CPPP (dB) | −29% | −36% | −45% |
Positive deviation means that values increased; negative deviations that our data decreased compared to Döllinger et al. [39].
The two GAW parameters ASI and PAI, which reflect the dynamic left-right symmetry of the vocal folds, are associated with voice pathologies [58,64]. Surprisingly in this study, as well as in Döllinger et al. [39] the ASI and PAI values had better values for GGI2 compared to GGI1 and GGI3. Regarding the vibrational characteristics, it should be noted that the values both for ASI and PAI showed worse values for GroupA compared to GroupS (Table 6). The PAI values were statistically significantly worse for the aperiodic vibrations (p = 0.009). This may be an indication that aperiodic oscillations are related to voice pathologies [58,64]. The ASQ values drop for GGI1,2 from 0.60 to 0.49 and then increases to 0.52 for GGI23. In comparison, in Döllinger et al. [39], the ASQ values decreased continuously from 0.62 to 0.59 with a glottal gap increase from GGI1 to GGI3. Unsurprisingly, there are statistically significant differences in the CQ parameter between the GGI groups, because GGI1 has a complete glottis closure and GGI2,3 show no or only partial glottis closure. The difference in the CQ is also statistically significant between the two vibrational characteristics (GroupS and GroupA). Regarding the GAW data, it is further noticeable that the parameters ASQ, CQ and SQ show strong fluctuations in the mean values of the three GGI groups in comparison to normal data (Table 7). These fluctuations without obvious trends lead to the assumption that these GAW parameters do not seem appropriate to indicate disordered voice.
4.2.2. Aerodynamic Parameters
The energy transfer flow-tissue (RB) between the two vibrational characteristics (GroupS,A) increased by 12% to 12363 (Pa s−1). In Döllinger et al. [39] RB decreased continuously for increasing glottal gap (GGI1-3) by almost 50%. In contrast, here for GGI1-2 RB decreased by 30% and then increased by 5% for GGI2-3. Compared to the data of Döllinger et al. [39] the value of RB for GGI3 in this study was 42% higher; see Table 7. For GGI1-2 our findings support other studies that higher energy transfer RB from the glottal flow to vocal folds yields an improved acoustic quality (i.e., higher CPP values) [39,60,65]. Surprisingly, we found contrary results for GGI2-3, that RB was increasing/improving while CPP decreased/deteriorated even further; see Table 5. This discrepancy could be due to the fact that none of the other studies were investigating data containing solely disordered oscillations. Further investigations on disordered dynamics are required.
The aerodynamic parameter SPL (dB) differs statistically significantly between the GGI groups, as well as between the two vibrational characteristics (GroupS, GroupA); see Table 4. The intensity of the voice signal SPL (dB) decreases from GroupS to GroupA by 16% from 78.1 dB to 65.9 dB, indicating that aperiodic oscillations (GroupA) are worse compared to subharmonic oscillations (GroupS) in terms of efficiency/loudness of voice production.
SPL also decreases from GGI1 to GGI3 by 22% from 76.7 to 59.5 dB (Table 5). This deterioration of SPL from GGI1 to GGI3 is stronger than for normal oscillations [39] from 79.1 to 69.4 (−12%). The general decrease in the intensity of the acoustic signal (SPL) may be explained by the fact that with a higher glottal gap index there is a limited oscillatory mobility of the vocal folds as well as an inadequate glottis closure and therefore sound is produced less efficiently.
It is noteworthy that for GGI3 60% higher subglottal pressure than in Döllinger et al. [39] was needed to produce the intensity of 59.5 dB, which is 14% lower than in the data of Döllinger et al. [39]; see Table 7.
Various studies investigated the influence of high subglottal pressure and irregular oscillation (i.e., disordered; GroupS and GroupA) in voice and rough phonation [28,66,67]. We cannot confirm these findings as the overall subglottal pressure regarding the irregular oscillations (GroupS and GroupA) here (Table 2) is 8% lower (1324 Pa) than that of the normal oscillations of Döllinger et al. [39] (1436 Pa).
However, we did find an interrelationship between high subglottal pressure and glottal closure insufficiency. The subglottal pressure PS for GGI1,2 was relatively similar comparing both studies while PS for GGI3 in this study was 60% higher compared to the normal oscillation data [39]; see Table 7. We assume that the combination of strong glottis closure insufficiency (GGI3) and disordered voices could be directly linked to high subglottal pressure. Unfortunately, previous studies on phonation data [28,66,67] did not examine the glottis closure insufficiency (i.e., GGI), which is why further investigations are needed to verify this assumption. Since the subglottal pressure data cannot be easily measured in humans, more ex vivo studies are hereby necessary.
4.2.3. Harmonic Measures
The parameter CPP is currently one of the most applied parameters for characterizing dysphonia [68–73]. It is a measure of the voice quality that reflects the degree of occurring harmonics in a voice sample [57,74]. In addition, due to its calculation method, it is suitable for application to aperiodic signals, which is why it is of particular importance in the present study [75].
Just like in Döllinger et al. [39] the values of CPPA and CPPP decreased from GGI1-3, and thus became worse (Tables 5 and 7), as the CPP describes the quality of the voice signal [57,68,74,76,77]. Compared to Döllinger et al. [39] it is noticeable that the CPP values for the disordered GGI1,2 are about 30% below the normal CPP values for GGI1,2, while the disordered CPP value for GGI3 is about 45% below the normal CPP values for GGI3; see Table 7. Meaning that these deteriorations of CPP for GGI1-3 are considerably stronger for the disordered oscillations. These findings apply both to CPPA and CPPP. Deteriorations of CPPA and CPPP were also found from GroupS to GroupA, indicating again that aperiodic oscillations (GroupA) are worse for the acoustic quality compared to subharmonic oscillations (GroupS). This deterioration was however not as strong as for GGI1-3 (Tables 5 and 6).
As Table 8 shows, GGI3 is mainly composed of GroupA. So, the poor CPPi values for GGI3 may be based on this over proportional composition. However, since the mean CPPA (15.4 dB) and CPPP (16.0 dB) values of GroupA (Table 6) are above the CPPA (11.0 dB) and CPPP (14.0 dB) values of GGI3 (Table 5), it is the combination of GGI3 and irregularity that seems to be crucial for these poor CPP scores (i.e., reduced harmonics in the acoustic and subglottal signals).
Table 8.
Contingency table with the frequencies of GGII1-3 and GroupS,A.
GGI1 | GGI2 | GGI3 | Σ | |
---|---|---|---|---|
GroupS | 13 | 21 | 1 | 35 |
GroupA | 3 | 8 | 5 | 16 |
Σ | 16 | 29 | 6 | 51 |
4.3. Summary
Only five of the 13 investigated parameters show the same tendencies for both GroupS to GroupA and GGI1 to GGI3; see Tables 5 and 6. With increasing glottis closure insufficiency (i.e., GGI1-3) nine of the 13 investigated phonatory parameters show similar tendencies compared to the periodic data of Döllinger et al. [39]. The worst mean values of the 4 parameters (ALR = 4.9, SPL = 59.5, CPPA = 11.0, CPPP = 14.0) that are directly linked to voice pathologies, voice disorders and dysphonia such as breathiness and roughness of voice [57,68,74,76,77] were found for GGI3. Therefore, we can confirm the findings of Döllinger et al. [39] that the more complete the glottis closure during phonation, the better the acoustic output.
Table 7 shows the percentage change of the (disordered) phonatory parameters compared to (normal/non-disordered) Döllinger et al. [39]. Although GAW and aerodynamic parameters differed compared to [39], no obvious tendency was recognizable. A clear change was observable in the harmonic measures expressed by CPPA and CPPP. Both the quality of the subglottal pressure signal (CPPP) and of the audio signal (CPPA) were noticeably worse compared to [39]. This deterioration (decrease of CPPA,P) became stronger with increasing glottis closure insufficiency (i.e., GGI1-3), see Table 7.
5. Shortcomings
A limiting factor could be the length of time the larynges were frozen. The freezing, storing and thawing was performed according to the study of Chan and Titze [42]. Chan and Titze found that the vocal fold mucosa did not seem to change significantly after 24 h of storage in saline solution at room temperature, nor after one month of frozen storage following quick freezing. Therefore, their findings support the feasibility of using quick freezing to preserve laryngeal tissues for excised larynx experiments, as we performed. However, our larynges were frozen for a maximum of 13 months while the larynges of Chan and Titze were frozen for only one month. This longer time could have an effect on the tissue properties; however, there are no references to this assumption in the literature.
6. Conclusions
Since severe deterioration of the subglottal (CPPP) and audio (CPPA) signal quality occurred regardless which of the two irregular vibration types (subharmonic and aperiodic) were present, we suggest that these two parameters should be considered in medical diagnosis and treatment, alongside with glottal closure characteristics.
As Table 3 shows, disordered oscillations seemed to occur more often under high airflow levels (43%) and high elongation levels (55%). This may suggest that high or over stimulation of laryngeal parameters facilitates disordered phonatory behavior.
To further confirm these assumptions, we suggest further studies investigating disordered laryngeal dynamics with glottis closure insufficiency at different stimulation levels and their impact on the phonatory process, using ex vivo larynx experiments.
Acknowledgments:
The present work was performed in fulfillment of the requirements for obtaining the degree “Dr. med.” at the Friedrich-Alexander-University Erlangen-Nürnberg. We acknowledge support by Deutsche Forschungsgemeinschaft and Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU) within the funding program Open Access Publishing.
Funding: This study was supported by the Deutsche Krebshilfe (http://www.krebshilfe.de/nc/startseite.html) under grant no. 111332. Dr. Berry’s work on this project was funded by the National Institutes of Health/National Institute on Deafness and Other Communication Disorders (NIH/NIDCD) Grant No. R01 DC013323.
Footnotes
Conflicts of Interest: The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, and in the decision to publish the results.
References
- 1.Liu K; Ge P; Sheng X; Jiang J; Qin H Survival in Vivo Canine Phonation Model Without Stimulation. Ann. Otol. Rhinol. Laryngol 2018,127,178–184. [DOI] [PubMed] [Google Scholar]
- 2.Döllinger M; Kniesburges S; Kaltenbacher M; Echternach M Current methods for modelling voice production. HNO 2016, 64, 82–90. [DOI] [PubMed] [Google Scholar]
- 3.Döllinger M The Next Step in Voice Assessment: High-Speed Digital Endoscopy and Objective Evaluation. Curr. Bioinform 2009, 4,101–111. [Google Scholar]
- 4.Ziethe A; Patel R; Kunduk M; Eysholdt U; Graf S Clinical Analysis Methods of Voice Disorders. Curr. Bioinform 2011, 6, 270–285. [Google Scholar]
- 5.Patel R; Dubrovskiy D; Döllinger M Characterizing vibratory kinematics in children and adults with high-speed digital imaging. J. Speech Lang. Hear. Res 2014, 57, S674–S686. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Mau T; Muhlestein J; Callahan S; Weinheimer KT; Chan RW Phonation threshold pressure and flow in excised human larynges. Laryngoscope 2011,121,1743–1751. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Mills RD; Dodd K; Ablavsky A; Devine E; Jiang JJ Parameters from the Complete Phonatory Range of an Excised Rabbit Larynx. J. Voice 2017, 31, 517.e9–517.e17. [DOI] [PubMed] [Google Scholar]
- 8.Xue C; Pulvermacher A; Calawerts W; Devine E; Jiang J An Excised Canine Model of Anterior Glottic Web and Its Acoustic, Aerodynamic, and High-speed Measurements. J. Voice 2017, 31, 246.e21–246.e32. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Farnsworth DW High-speed motion pictures of the human vocal cords. Bell Lab. Rec 1940,18, 203–208. [Google Scholar]
- 10.Patel R; Dailey S; Bless D Comparison of high-speed digital imaging with stroboscopy for laryngeal imaging of glottal disorders. Ann. Otol. Rhinol. Laryngol 2008,117,413–424. [DOI] [PubMed] [Google Scholar]
- 11.Krausert CR; Olszewski AE; Taylor LN; McMurray JS; Dailey SH; Jiang JJ Mucosal wave measurement and visualization techniques. J. Voice 2011,25, 395–405. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Patel RR; Liu L; Galatsanos N; Bless DM Differential vibratory characteristics of adductor spasmodic dysphonia and muscle tension dysphonia on high-speed digital imaging. Ann. Otol. Rhinol. Laryngol 2011, 120, 21–32. [DOI] [PubMed] [Google Scholar]
- 13.Naghibolhosseini M; Deliyski DD; Zacharias SRC; de Alarcon A; Orlikoff RF Temporal Segmentation for Laryngeal High-Speed Videoendoscopy in Connected Speech. J. Voice 2018,32, 256.e1–256.e12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Zacharias SRC; Deliyski DD; Gerlach TT Utility of Laryngeal High-speed Videoendoscopy in Clinical Voice Assessment. J. Voice 2018, 32, 216–220. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Rasp O; Lohscheller J; Döllinger M; Eysholdt U; Hoppe U The pitch rise paradigm: A new task for real-time endoscopy of non-stationary phonation. Folia Phoniatr. Logop 2006, 58, 175–185. [DOI] [PubMed] [Google Scholar]
- 16.Semmler M; Kniesburges S; Parchent J; JakubaR B; Zimmermann M; Bohr C; Schutzenberger A; Döllinger M Endoscopic Laser-Based 3D Imaging for Functional Voice Diagnostics. Appl. Sci 2017, 7, 600. [Google Scholar]
- 17.Deliyski DD; Powell ME; Zacharias SR; Gerlach TT; de Alarcon A Experimental Investigation on Minimum Frame Rate Requirements of High-Speed Videoendoscopy for Clinical Voice Assessment. Biomed. Signal Process. Control 2015,17, 21–28. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Döllinger M; Lohscheller J; McWhorter A; Kunduk M Variability of normal vocal fold dynamics for different vocal loading in one healthy subject investigated by phonovibrograms. J. Voice 2009, 23,175–181. [DOI] [PubMed] [Google Scholar]
- 19.Ahmad K; Yan Y; Bless DM Vocal fold vibratory characteristics in normal female speakers from high-speed digital imaging. J. Voice 2012, 26, 239–253. [DOI] [PubMed] [Google Scholar]
- 20.Yamauchi A; Imagawa H; Yokonishi H; Nito T; Yamasoba T; Goto T; Takano S; Sakakibara K; Tayama N Evaluation of vocal fold vibration with an assessment form for high-speed digital imaging: Comparative study between healthy young and elderly subjects. J. Voice 2012, 26, 742–750. [DOI] [PubMed] [Google Scholar]
- 21.Yan Y; Damrose E; Bless D Functional analysis of voice using simultaneous high-speed imaging and acoustic recordings. J. Voice 2007, 21, 604–616. [DOI] [PubMed] [Google Scholar]
- 22.Kosztyla-Hojna B; Moskal D; Falkowski D; Lobaczuk-Sitnik A; Kraszewska A; Skorupa M; Kita J; Kuryliszyn-Moskal A High-speed digital imaging in the diagnosis of voice pathologies. Pol. Merkur. Lek. Organ Pol. Tow. Lek 2016, 41,26–31. [PubMed] [Google Scholar]
- 23.Voigt D; Döllinger M; Yang A; Eysholdt U; Lohscheller J Automatic diagnosis of vocal fold paresis by employing Phonovibrogram features and machine learning methods. Comput. Methods Programs Biomed 2010, 99, 275–288. [DOI] [PubMed] [Google Scholar]
- 24.Titze IR Workshop on Acoustic Voice Production: Summary Statement; National Center for Voice and Speech: Iowa City, IA, USA, 1995. [Google Scholar]
- 25.Mehta DD; Deliyski DD; Quatieri TF; Hillman RE Automated measurement of vocal fold vibratory asymmetry from high-speed videoendoscopy recordings. J. Speech Lang. Hear. Res 2011,54,47–54. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Deliyski DD; Petrushev PP; Bonilha HS; Gerlach TT; Martin-Harris B; Hillman RE Clinical implementation of laryngeal high-speed videoendoscopy: Challenges and evolution. Folia Phoniatr. Logop 2008, 60, 33–44. [DOI] [PubMed] [Google Scholar]
- 27.Zhang Y; Jiang JJ; Wallace SM; Zhou L Comparison of nonlinear dynamic methods and perturbation methods for voice analysis. J. Acoust. Soc. Am 2005,118,2551–2560. [DOI] [PubMed] [Google Scholar]
- 28.Jiang JJ; Zhang Y; Ford CN Nonlinear dynamics of phonations in excised larynx experiments. J. Acoust. Soc. Am 2003,114, 2198–2205. [DOI] [PubMed] [Google Scholar]
- 29.Behrman A; Agresti CJ; Blumstein E; Lee N Microphone and electroglottographic data from dysphonic patients: Type 1, 2 and 3 signals. J. Voice 1998,12, 249–260. [DOI] [PubMed] [Google Scholar]
- 30.Döllinger M; Kobler J; Berry DA; Mehta DD; Luegmair G; Bohr C Experiments on Analysing Voice Production: Excised (Human, Animal) and In Vivo (Animal) Approaches. Curr. Bioinform 2011, 6, 286–304. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Titze IR Human Speech: A Restricted Use of the Mammalian Larynx. J. Voice 2017, 31,135–141. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Herbst CT; Lohscheller J; Svec JG; Henrich N; Weissengruber G; Fitch WT Glottal opening and closing events investigated by electroglottography and super-high-speed video recordings. J. Exp. Biol 2014, 217 Pt 6, 955–963. [DOI] [PubMed] [Google Scholar]
- 33.Ling C; Li Q; Brown ME; Kishimoto Y; Toya Y; Devine EE; Choi K-O; Nishimoto K; Norman IG; Tsegyal T; et al. Bioengineered vocal fold mucosa for voice restoration. Sci. Transl. Med 2015, 7, 314ra187. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Bohr C; Döllinger M; Kniesburges S; Traxdorf M 3D visualization and analysis of vocal fold dynamics. HNO 2016, 64, 254–261. [DOI] [PubMed] [Google Scholar]
- 35.Luegmair G; Kniesburges S; Zimmermann M; Sutor A; Eysholdt U; Döllinger M Optical Reconstruction of High-Speed Surface Dynamics in an Uncontrollable Environment. IEEE Trans. Med. Imaging 2010, 29, 1979–1991. [DOI] [PubMed] [Google Scholar]
- 36.Thibeault SL; Gray SD; Bless DM; Chan RW; Ford CN Histologic and rheologic characterization of vocal fold scarring. J. Voice 2002,16, 96–104. [DOI] [PubMed] [Google Scholar]
- 37.Hertegard S; Larsson H; Nagubothu SS; Tolf A; Svensson B Elasticity measurements in scarred rabbit vocal folds using air pulse stimulation. Logop. Phoniatr. Vocol 2009, 34,190–195. [DOI] [PubMed] [Google Scholar]
- 38.Maytag AL; Robitaille MJ; Rieves AL; Madsen J; Smith BL; Jiang JJ Use of the rabbit larynx in an excised larynx setup. J. Voice 2013, 27, 24–28. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Döllinger M; Kniesburges S; Berry DA; Birk V; Wendler O; Durr S; Alexiou C; Schutzenberger A Investigation of phonatory characteristics using ex vivo rabbit larynges. J. Acoust. Soc. Am 2018,144,142. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Schneider-Stickler B; Gaechter J; Bigenzahn W Long-term results after external vocal fold medialization thyroplasty with titanium vocal fold medialization implant (TVFMI). Eur. Arch. Otorhinolaryngol 2013,270, 1689–1694. [DOI] [PubMed] [Google Scholar]
- 41.Giraldez-Rodriguez LA; Johns M 3rd. Glottal insufficiency with aspiration risk in dysphagia. Otolaryngol. Clin. N. Am 2013, 46,1113–1121. [DOI] [PubMed] [Google Scholar]
- 42.Chan RW; Titze IR Effect of postmortem changes and freezing on the viscoelastic properties of vocal fold tissues. Ann. Biomed. Eng 2003, 31,482–491. [DOI] [PubMed] [Google Scholar]
- 43.Birk V; Döllinger M; Sutor A; Berry DA; Gedeon D; Traxdorf M; Wendler O; Bohr C; Kniesburges S Automated setup for ex vivo larynx experiments. J. Acoust. Soc. Am 2017,141,1349. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Chen G; Kreiman J; Gerratt BR; Neubauer J; Shue YL; Alwan A Development of a glottal area index that integrates glottal gap size and open quotient. J. Acoust. Soc. Am 2013, 133, 1656–1666. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Dippold S; Voigt D; Richter B; Echternach M High-Speed Imaging Analysis of Register Transitions in Classically and Jazz-Trained Male Voices. Folia Phoniatr. Logop 2015, 67, 21–28. [DOI] [PubMed] [Google Scholar]
- 46.Patel RR; Walker R; Sivasankar PM Spatiotemporal Quantification of Vocal Fold Vibration after Exposure to Superficial Laryngeal Dehydration: A Preliminary Study. J. Voice 2016, 30, 427–433. [DOI] [PubMed] [Google Scholar]
- 47.Schützenberger A; Kunduk M; Döllinger M; Alexiou C; Dubrovskiy D; Semmler M; Seger A; Bohr C Laryngeal High-Speed Videoendoscopy: Sensitivity of Objective Parameters towards Recording Frame Rate. BioMed Res. Int 2016, 2016,4575437. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Schlegel P; Semmler M; Kunduk M; Döllinger M; Bohr C; Schutzenberger A Influence of Analyzed Sequence Length on Parameters in Laryngeal High-Speed Videoendoscopy. Appl. Sci 2018, 8, 2666. [Google Scholar]
- 49.Titze IR Principles of Voice Production; Prentice Hall: Englewood Cliff, NJ, USA, 1994. [Google Scholar]
- 50.Munhall KG; Ostry DJ; Parush A Characteristics of velocity profiles of speech movements. J. Exp. Psychol. Hum. Percept. Perform 1985,11,457–474. [DOI] [PubMed] [Google Scholar]
- 51.Henrich N; Sundin G; Ambroise D; d’Alessandro C; Castellengo M; Doval B Just noticeable differences of open quotient and asymmetry coefficient in singing voice. J. Voice 2003,17, 481–494. [DOI] [PubMed] [Google Scholar]
- 52.Holmberg EB; Hillman RE; Perkell JS Glottal airflow and transglottal air pressure measurements for male and female speakers in soft, normal, and loud voice. J. Acoust. Soc. Am 1988, 84, 511–529. [DOI] [PubMed] [Google Scholar]
- 53.Baken RJ; Orlikoff RF Clinical Measurement of Speech and Voice, 2nd ed.; Cengage Learning: Clifton Park, NY, USA, 1985. [Google Scholar]
- 54.Wang SG; Park HJ; Lee BJ; Lee SM; Ko B; Lee SM; Park YM A new videokymography system for evaluation of the vibration pattern of entire vocal folds. Auris Nasus Larynx 2016, 43, 315–321. [DOI] [PubMed] [Google Scholar]
- 55.Qiu Q; Schutte HK; Gu L; Yu Q An automatic method to quantify the vibration properties of human vocal folds via videokymography. Folia Phoniatr. Logop 2003, 55,128–136. [DOI] [PubMed] [Google Scholar]
- 56.Van den Berg JW; Zantema JT; Doornenbal P Jr. On the Air Resistance and the Bernoulli Effect of the Human Larynx. J. Acoust. Soc. Am 1957,29,626–631. [Google Scholar]
- 57.Hillenbrand J; Cleveland RA; Erickson RL Acoustic Correlates of Breathy Vocal Quality. J. Speech Lang. Hear. Res 1994, 37, 769–778. [DOI] [PubMed] [Google Scholar]
- 58.Yamauchi A; Yokonishi H; Imagawa H; Sakakibara KI; Nito T; Tayama N Quantitative Analysis of Vocal Fold Vibration in Vocal Fold Paralysis with the Use of High-speed Digital Imaging. J. Voice 2016, 30, 766.el3–766.e22. [DOI] [PubMed] [Google Scholar]
- 59.Wolfe V; Fitch J; Cornell R Acoustic Prediction of Severity in Commonly Occurring Voice Problems. J. Speech Lang. Hear. Res 1995, 38, 273–279. [DOI] [PubMed] [Google Scholar]
- 60.Birk V; Kniesburges S; Semmler M; Berry DA; Bohr C; Döllinger M; Schutzenberger A Influence of glottal closure on the phonatory process in ex vivo porcine larynges. J. Acoust. Soc. Am 2017, 142, 2197. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Bielamowicz S; Kreiman J; Gerratt BR; Dauer MS; Berke GS Comparison of voice analysis systems for perturbation measurement. J. Speech Lang. Hear. Res 1996, 39,126–134. [DOI] [PubMed] [Google Scholar]
- 62.Yumoto E; Gould WJ; Baer T Harmonics-to-noise ratio as an index of the degree of hoarseness. J. Acoust. Soc. Am 1982, 71,1544–1549. [DOI] [PubMed] [Google Scholar]
- 63.Davenport PB; Carter KF; Echternach JM; Tuck CR Integrating High-Reliability Principles to Transform Access and Throughput by Creating a Centralized Operations Center. J. Nurs. Adm 2018, 48, 93–99. [DOI] [PubMed] [Google Scholar]
- 64.Samlan RA; Story BH Influence of Left-Right Asymmetries on Voice Quality in Simulated Paramedian Vocal Fold Paralysis. J. Speech Lang. Hear. Res 2017, 60, 306–321. [DOI] [PubMed] [Google Scholar]
- 65.Rosenthal AL; Lowell SY; Colton RH Aerodynamic and acoustic features of vocal effort. J. Voice 2014, 28,144–153. [DOI] [PubMed] [Google Scholar]
- 66.Jiang JJ; Zhang Y; McGilligan C Chaos in voice, from modeling to measurement. J. Voice 2006, 20, 2–17. [DOI] [PubMed] [Google Scholar]
- 67.Jiang JJ; Titze IR A methodological study of hemilaryngeal phonation. Laryngoscope 1993,103, 872–882. [DOI] [PubMed] [Google Scholar]
- 68.Heman-Ackah YD; Michael DD; Goding GS Jr. The relationship between cepstral peak prominence and selected parameters of dysphonia. J. Voice 2002,16, 20–27. [DOI] [PubMed] [Google Scholar]
- 69.Heman-Ackah YD; Michael DD; Baroody MM; Ostrowski R; Hillenbrand J; Heuer RJ; Horman M; Sataloff RT Cepstral peak prominence: A more reliable measure of dysphonia. Ann. Otol. Rhinol. Laryngol 2003,112,324–333. [DOI] [PubMed] [Google Scholar]
- 70.Halberstam B Acoustic and perceptual parameters relating to connected speech are more reliable measures of hoarseness than parameters relating to sustained vowels. ORL 2004, 66, 70–73. [DOI] [PubMed] [Google Scholar]
- 71.Eadie TL; Baylor CR The effect of perceptual training on inexperienced listeners’ judgments of dysphonic voice. J. Voice 2006, 20, 527–544. [DOI] [PubMed] [Google Scholar]
- 72.Maryn Y; Dick C; Vandenbruaene C; Vauterin T; Jacobs T Spectral, cepstral, and multivariate exploration of tracheoesophageal voice quality in continuous speech and sustained vowels. Laryngoscope 2009, 119, 2384–2394. [DOI] [PubMed] [Google Scholar]
- 73.Balasubramanium RK; Bhat JS; Fahim S 3rd; Raju R 3rd. Cepstral analysis of voice in unilateral adductor vocal fold palsy. J. Voice 2011, 25, 326–329. [DOI] [PubMed] [Google Scholar]
- 74.Hillenbrand J; Houde RA Acoustic correlates of breathy vocal quality: Dysphonic voices and continuous speech. J. Speech Lang. Hear. Res 1996, 39, 311–321. [DOI] [PubMed] [Google Scholar]
- 75.Heman-Ackah YD; Sataloff RT; Laureyns G; Lurie D; Michael DD; Heuer R; Rubin A; Eller R; Chandran S; Abaza M; et al. Quantifying the cepstral peak prominence, a measure of dysphonia. J. Voice 2014,28, 783–788. [DOI] [PubMed] [Google Scholar]
- 76.Samlan RA; Kunduk M; Ikuma T; Black M; Lane C Vocal Fold Vibration in Older Adults with and without Age-Related Dysphonia. Am. J. Speech-Lang. Pathol 2018, 27,1039–1050. [DOI] [PubMed] [Google Scholar]
- 77.Awan SN; Roy N; Dromey C Estimating dysphonia severity in continuous speech: Application of a multi-parameter spectral/cepstral model. Clin. Linguist. Phon 2009, 23,825–841. [DOI] [PubMed] [Google Scholar]