Abstract
Quantitative analysis of phonatory characteristics of rabbits has been widely neglected. However, preliminary studies established the rabbit larynx as a potential model of human phonation. This study reports quantitative data on phonation using ex vivo rabbit larynx models to achieve more insight into dependencies of three main components of the phonation process, including airflow, vocal fold dynamics, and the acoustic output. Sustained phonation was induced in 11 ex vivo rabbit larynges. For 414 phonatory conditions, vocal fold vibrations, acoustic, and aerodynamic parameters were analyzed as functions of longitudinal vocal fold pre-stress, applied air flow, and glottal closure insufficiency. Dimensions of the vocal folds were measured and histological data were analyzed. Glottal closure characteristics improved for increasing longitudinal pre-stress and applied airflow. For the subglottal pressure signal only the cepstral peak prominence showed dependency on glottal closure. In contrast, vibrational, acoustic, and aerodynamic parameters were found to be highly dependent on the degree of glottal closure: The more complete the glottal closure during phonation, the better the aerodynamic and acoustic characteristics. Hence, complete or at least partial glottal closure appears to enhance acoustic signal quality. Finally, results validate the ex vivo rabbit larynx as an effective model for analyzing the phonatory process.
I. INTRODUCTION
Investigation of the oral communication process in mammals and humans is within the scope of various scientific fields and motivated by a variety of different reasons. Studies can basically be separated into two categories. For the first group, vocal fold physiology is the primary scientific interest. The second group focuses on the progress and development of medical treatments to facilitate phonation. The principles of voice production or phonation are similar for many mammals and humans (Titze, 2017). An overview of principal vocalization mechanisms in mammals can be found in Elemans et al. (2015). The primary acoustic signal is generated in the larynx by two opposing vocal folds. During vocalization or phonation, vocal fold vibrations are induced by airflow coming from the lungs. During vibration, the vocal folds open and close, producing the primary acoustic signal. This signal is modulated in the vocal tract and then emitted from the mouth or nose. For mammals, investigations of the oscillation frequencies of the vocal folds have exhibited a wide range between 15 Hz for an elephant (Herbst et al., 2013) and 2100 Hz for elk calls (Titze and Riede, 2010). Even higher vocal fold oscillation frequencies were found for other mammals, as reported in Titze et al. (2016). For humans, dependent on age and gender, the fundamental frequency typically ranges between 100 Hz and 350 Hz for normal phonation (Patel et al., 2014). However, during singing, vocal fold vibration frequencies of up to 1568 Hz were reported (Echternach et al., 2013).
Different mammalian species have been investigated when studying the physiology of the voice source (i.e., oscillating vocal folds). The largest mammals considered were elephants (Herbst et al., 2012), elk (Riede and Titze, 2008), bovines (Regner et al., 2010), tigers (Titze et al., 2010), and lions (Klemuk et al., 2011). Mammalian species exhibiting similar laryngeal dimensions to humans were ovines (Alipour and Jaiswal, 2009), porcines (Bohr et al., 2016), and canines (Herbst et al., 2014; Ling et al., 2015). These latter models were studied as potential models of human vocal fold vibration, and their vocal fold dynamics were compared with human laryngeal dynamics (Döllinger et al., 2011). For smaller larynges, dynamical studies on rat (Welham et al., 2009) and rabbit (Novaleski et al., 2016) models have also been reported.
The rabbit larynx model seems specifically suitable for mimicking human voice production as the histology of the rabbit lamina propria (Thibeault et al., 2002) and the layer structure (Hertegard et al., 2003) were found to be similar to those of humans. However, to date, experiments analyzing the phonatory process using rabbit larynges are limited. One study focused on establishing the ex vivo, and another the in vivo, rabbit model as a feasible model for phonatory analysis (Maytag et al., 2013; Ge et al., 2009). Awan et al. (2014) analyzed in vivo nonlinear dynamic phenomena by varying flow rate and adduction levels. However, only one ex vivo study performed detailed parameter studies (Mills et al., 2017). They determined aerodynamic parameters [airflow, subglottal pressure (PS), sound pressure level (SPL)], the fundamental frequency (f0), and the vibrational amplitude as a function of vocal fold elongation. They showed that at phonation onset PS, SPL, and f0 increase as a function of elongation. Further studies were medically oriented and focused on surgical outcome (Kojima et al., 2014; Novaleski et al., 2016) and scarring (Hertegard et al., 2009; Mau et al., 2014). However, despite these previous studies on rabbit phonation, an in-depth, quantitative analysis reporting vocal fold vibratory parameters, the time-resolved acoustic and subglottal pressure signals, has not been performed yet. Hence, thus far, the correlation of vibratory characteristics with aerodynamic and acoustic parameters has largely been neglected for the rabbit model. To overcome this shortcoming, this study has the following two goals:
-
(1)
To present quantitative data and parameters computed from high-speed imaging (HSI) data, and time-resolved subglottal pressure and acoustic signals. A wide range of phonatory conditions is applied to investigate the phonatory range of rabbit larynges.
-
(2)
To identify and quantify dependencies between glottal closure insufficiency and phonatory parameters computed from the dynamic, acoustic, and pressure signals. The motivation and hypothesis for analyzing such dependencies is that glottis closure insufficiency is thought to be accountable for a variety of swallowing and breathing problems, as well as for breathy and weak voice (Schneider-Stickler et al., 2013; Giraldez-Rodriguez and Johns, 2013). When pathological glottis closure insufficiency is detected in humans, it is always the goal to restore glottis closure through vocal fold medialization by applying material injection (e.g., calcium hydroxyapatite) or material implantation (e.g., silicone blocks); see, e.g., Almohizea et al. (2016) and Dumberger et al. (2017). Despite the known negative effect of glottis closure insufficiency on the phonatory process, a quantitative analysis based on detailed parameter extraction is still missing.
II. MATERIALS AND METHODS
A. Specimen
Altogether 12 rabbit ex vivo larynges (New Zealand White female rabbits, 4–5 kg weight, age 14–118 weeks) were available for investigation. The larynges were harvested right after the rabbits were sacrificed using Buprenorphin. Since the rabbits were primarily sacrificed for use in other scientific experiments (local ethical approval number 54-2532.1-54/12), no separate ethical approval was necessary. The larynges were dissected to expose the vocal folds, Fig. 1(A). The trachea was cut 3 cm below the larynx. Afterward the larynges were quick-frozen in liquid nitrogen and stored in a deep freezer at −80 °C, thus preserving the tissue characteristics (Chan and Titze, 2003). The night before the experiments, the larynges were slowly thawed in a refrigerator at 6 °C. One larynx showed tissue hemorrhage, and was therefore excluded from further experiments. The remaining 11 larynges showed no visible anatomical or other potential alterations.
B. Data acquisition
Figure 1 shows the experimental setup. The applied tools and data acquisition devices are explained in the following. The ex vivo larynges were mounted on an artificial, stainless steel, trachea with a diameter of 6 mm, dimensioned for the rabbit larynges. A hole was drilled in the artificial trachea for placement of the subglottal pressure sensor, 100 mm below the larynx. A custom-made support tool prevented the unintentional lateral displacement of the larynx. It consisted of a Polyvinylchloride (PVC) tube and three screws fixing the cricoid cartilages. The entire experimental setup is given in Fig. 1(B) and a close-up of the mounted larynx, as seen through the camera, is given in Fig. 1(A).
The time-resolved subglottal pressure was captured by a XCS-93-5PSISG pressure sensor (Kulite Semiconductor Products, Inc., Leonia, NJ), which was flush-mounted to the internal wall of the artificial trachea. The pressure sensor was driven by a PXIe-4330 bridge module (National Instruments, Austin, TX), offering a 24 bit resolution. The time-resolved acoustic pressure signal was captured by a 4189 1/2-inch free-field microphone (Brüel&Kjaer, 2850 Nærum, Denmark) mounted in coronal plane of the larynx with a 45° inclination toward the sagittal plane at a distance of 20 cm to the glottis. The microphone was driven by a Nexus 2690 microphone conditioning amplifier (Brüel&Kjaer, 2850 Nærum, Denmark). The amplified signal was captured by a PXIe-4492 dynamic signal acquisition module (National Instruments) with a 24 bit resolution. The vocal fold oscillations were recorded by a Phantom V2511 high-speed camera (Vision Research, Wayne, NJ) at 8000 fps and a spatial resolution of 768 × 768 pixel. To correlate the high-speed recordings with the acoustic and subglottal pressure signals, the camera state signals were captured by a PXIe-6356 multifunctional data acquisition module (National Instruments) with 16 bit resolution. The three National Instruments modules were integrated in a PXIe-1073 express (National Instruments). Both pressure signals and the camera state signals were synchronously sampled with a sampling rate of fs = 96 kHz. The measurement was started with the start trigger of the high-speed camera that simultaneously started the PXIe system. The whole setup was controlled by a standard personal computer (PC) via scripts using the software LabView (National Instruments) as described in Birk et al. (2017a).
C. Experimental procedure for phonation
To modify the degree of longitudinal tension in the vocal folds, and thereby induce different phonatory conditions, a suture with three different weights (w1 = 1 g, w2 = 2 g, w3 = 5 g) was attached anteriorly at the thyroid cartilage; see Fig. 1(A). The purpose of this suture was to simulate the action of the cricothyroid muscle by tilting the thyroid cartilage forward. Self-sustained vibrations were induced by an airflow generated and controlled by a 4000B digital power supply driving a MF1 mass flow controller (MKS Instruments, Andover, MA). The humidified (ConchaTherm Neptune, Teleflex, Morrisville, NC) and heated (37 °C) airflow passed through the artificial trachea and then through the glottis (i.e., the area between the two oscillating vocal folds).
The dynamic experiments were based on former ex vivo studies (Alipour and Jaiswal, 2009; Maytag et al., 2013) and were performed as follows: (1) A weight was attached to induce longitudinal tension. (2) To keep the larynx in position, three screws were tightened until they touched the larynx. (3) Two rods were symmetrically tightened posteriorly, simulating the adduction process, to bring the larynx in phonatory position, i.e., until the glottis was entirely closed. Then the phonatory threshold pressure (PTP) level was determined by manually and continuously increasing the air flow through the larynx until sustained phonation occurred (Birk et al., 2017b), i.e., PTP was determined as the subglottal pressure level at which the vocal folds initiated sustained vibration. From the PTP, the airflow was increased 14 times successively in steps of 0.5 lm−1 (8.33 mls−1). This was performed for each of the three weights wi yielding a total of 45 experimental runs of sustained phonation per larynx. For each run, 125 ms of sustained phonation were recorded by the high-speed camera. This corresponded to 37–112 oscillation cycles depending on the oscillation frequency of the vocal folds. For each run, 35 oscillation cycles were considered for further analysis. For the acoustic and subglottal pressure signals, an additional 375 ms were recorded, yielding 500 ms of sustained oscillation or 148–448 cycles, to achieve the recommended number of cycles (i.e., at least 100 cycles) for reliable acoustic analysis (Titze, 1995; Karnell et al., 1995). To eliminate potential environmental noise, the acoustic and subglottal pressure signals were low pass filtered with a cut off frequency of 5 kHz. Due to the extremely large storage space for the high-speed data, a recording time of 500 ms was not practicable. However, the 35 cycles considered in our study constitutes a much larger data sample than used in similar ex vivo quantitative HSI studies (Herbst et al., 2012; Luegmair et al., 2015).
D. Data analysis of phonatory experiments
Before extracting parameters from the vocal fold dynamics, image processing was performed. The vocal fold dynamics are represented by the opening and closing of the vocal folds. Hence, the glottal area between the vocal folds was segmented within the HSI footage yielding the glottal area waveform (GAW) in pixels. The GAW is the function of the glottal area (number of pixels) over time. Figure 2 shows the GAW behavior over a typical oscillation cycle of length T. When the glottis is entirely closed at time step t0, the glottal area contents contain zero pixels. During the opening process, the glottis, and therefore the number of pixels, increases until the maximum opening state is reached at time tM. Then the glottis closes and the GAW becomes smaller again. Glottal segmentation was performed with a software tool developed in-house, known as Glottis Analysis Tools (GAT); the GAT tool is available upon request. This software has proven its functionality in previous quantitative HSI studies and is also used by international colleagues (Chen et al., 2013; Dippold et al., 2015; Patel et al., 2016).
From the GAW signal, parameters were computed reflecting main components of the glottal dynamics, as shown in Table I: (A) glottal closure insufficiency, (B) tissue characteristics, (C) opening and closing behavior, and (D) dynamic left-right symmetry. For the acoustic and time-resolved subglottal pressure signal, parameters reflecting harmonic components and perturbation were computed, as shown in Table II. Within Tables I and II, references are given to show where the parameters were derived.
TABLE I.
Parameter (unit) and references | Abbreviation | Parameter description |
---|---|---|
(A) Glottis closure characteristic | ||
Glottis gap index (a.u.; Patel et al., 2014) | GGI | Minimum glottal area/maximum glottal area: |
Glottis entirely closed [0–0.01]; glottis partially closed] 0.01–0.4[; no contact of vocal fold [0.4–1] | ||
(B) Parameters on tissue characteristics | ||
Amplitude-to-length-ratio (a.u.; Titze, 1994) | ALR | Dynamic range of GAW (max - min)/glottis length: the larger the more deformable the vocal folds |
Stiffness (1/frames; Munhall et al., 1985) | Stiffness | Maximum absolute value of first derivative/dynamic range: the higher the value the stiffer the tissue |
(C) Parameters on opening and closing behavior | ||
Asymmetry Quotient (a.u.; Henrich et al., 2003) | ASQ | Speed quotient/(speed quotient + 1) |
Closing quotient (a.u.; Holmberg et al., 1988) | CQ | Closing time/cycle duration |
Open Quotient (a.u.; Baken and Orlikoff, 1999) | OQ | Glottis open time/cycle duration |
Maximum area declination rate (pixel/frames; Titze, 2006) | MADR | Absolute value of negative peak in the first derivative of GAW, i.e., maximum GAW closing velocity |
Speed quotient (a.u.; Baken and Orlikoff, 1999) | SQ | Opening time/closing time |
(D) Parameters on left-right symmetry | ||
Amplitude symmetry index (a.u.; Wang et al., 2016) | ASI | Spatial symmetry of GAW: rate between maximum left and right glottal area, the closer to 1 the more symmetric |
Phase asymmetry index (a.u.; Qiu et al., 2003) | PAI | Symmetry in time: deviation in time between left and right GAW amplitude; the closer to 0 the higher the symmetry |
TABLE II.
Parameter (unit) and references | Abbreviation | Description/formula |
---|---|---|
Perturbation measures | ||
Jitter (%; Bielamowicz et al., 1996) | JT | Time periodicity of the signal, the lower the better |
Shimmer (%;-(Bielamowicz et al., 1996) | SH | Amplitude periodicity of the signal, the lower the better |
Harmonics components | ||
Harmonic to noise ratio (dB; Yumoto et al., 1982) | HNR | Ratio between energies of harmonics-based signal and noise, the higher the better |
Cepstral peak prominence (dB; Hillenbrand et al., 1994) | CPP | Development of harmonics, the higher the better |
Furthermore, fundamental frequencies (f0) of vocal fold oscillation were computed on the basis of the HSI data (GAW). Applied airflow (lm−1) was measured. Averaged subglottal pressure (PS) was computed from the time-resolved pressure signal. SPL (dB) was computed on the basis of the acoustic signal. The laryngeal flow resistance RB (i.e., the ratio between the transglottal pressure difference and the mean glottal flow rate) was determined using the definition of van den Berg et al. (1957). In contrast to typical aerodynamic applications, a high flow resistance RB is desired in phonation, yielding a high energy transfer from the glottal flow to the vocal fold tissues (Döllinger et al., 2016).
E. Statistical analysis
Question 1.
Do the glottal gap characteristics change for increasing pre-tension levels wi and increasing applied airflow? The glottal gap occurrence during vibration was represented by the glottal gap index (GGI); see Table I. Analysis of the dependency of the GGI on elongation levels (w1, w2, w3) and applied airflow level was performed using a univariate general linear model approach: The dependent variable was GGI, and the 2 fixed effect factors were pre-tension (3 levels) and airflow (15 levels). The random effect factor contained the individual larynges (11 levels). The Levene-test was not significant [F(44,369) = 1.269, p = 0.126]; hence, homogeneity of GGI for the different groups was assumed. Tukey's technique for post hoc comparison was applied. The significance level of p = 0.05 was used.
Question 2.
Is there a direct influence of the dynamic glottal gap (GGI) on other phonatory parameters? The effect of the glottal gap or glottal closure insufficiency during phonation on glottal dynamics, aerodynamic, and acoustic parameters was computed as follows:
-
(1)
All HSI recordings were visually inspected and divided into three visually easy to differentiate groups by one examiner: vocal folds completely closed, vocal folds partially closed, and vocal folds do not collide during phonation. Computing the GGI values for these subjectively allocated groups yielded the following intervals for GGI: GGI1 ([0;0.01], entire closure during vibration), GGI2 (]0.01;0.4[, partial closure), GGI3 ([0.4;1], no visually recognizable contact of vocal folds). The visual inspection and group division was executed as reported before (Birk et al., 2017b). The only difference to that previous study was that, in the current work, only one group for partial glottal closure (GGI2) was used due to reduced sample size. Representative examples for the three GGI groups are given in three mp4 format movie clips (Mm. 1, Mm. 2, and Mm. 3).
-
(2)
To investigate how the parameters differ between the three main GGI groups, the following steps were performed. Shapiro-Wilk tests showed that none of the vibrational, aerodynamic, harmonic, and noise parameters were normally distributed. Hence, Kruskal-Wallis tests were performed followed by post hoc tests (Mann-Whitney-U-test) with Bonferroni adjusted significance level with p = 0.05/3 = 0.017.
All statistics were performed using IBM SPSS version 21 (IBM, Armonk, NY).
F. Histological analysis
To confirm the use of healthy vocal folds, histological analysis was performed after the phonation experiments. The larynges were fixed for 24 h in 4% buffered formaldehyde and treated by the standard procedure of alcoholic dehydration and paraffin embedding. Serial horizontal cross sections (5 μm) were performed using a microtome. Two larynges were ruined during the cutting procedure and could not be further analyzed. For the remaining nine larynges, the sections were alternately stained with both van Gieson for collagen distribution and Gomori for reticular fiber distribution (Mulisch and Welsch, 2010). Van Gieson staining was performed in accordance with the manufacturer's instructions (Roth, Karlsruhe, Germany).
The stained sections were examined with a digital BZ-9000 microscope (Keyence, Neu-Isenburg, Germany) with the software BZ-II-Analyzer. For semi-quantitative analysis, the intensity of collagen and reticular staining from three typical regions within the lamina propria bordering the epithelium (Fig. 3) was assessed through densitometry (three measurements on each of three different slices, size: 200 μm × 150 μm). We collected the data from the nine larynges and calculated the average values and standard deviation in arbitrary units (a.u.). The length of the vocal folds and thicknesses of the epithelium and lamina propria were measured as the average over nine measurements (i.e., three measurements on each of three slices). The length of the vocal fold was measured from the arytenoid cartilage up to the thyroid cartilage.
III. RESULTS
Altogether 414 (84%) of the 495 performed experimental runs could be analyzed. Experiments were excluded when the fundamental frequency could not be determined due to too high aperiodic vibrations (50 runs) and failure or insufficient accuracy of the image processing (31 runs). Hence, it must be noted that the presented results were restricted to periodic vocal fold oscillations, i.e., oscillations where the fundamental frequency was clearly defined, and could be determined reliably and consistently. An overview of the range of the fundamental phonatory parameters is given in Table III.
TABLE III.
F0 (Hz) | PS (Pa) | Flow (mls−1) | RB (Pa s l−1) | SPL (dB) | |
---|---|---|---|---|---|
Phonation onset | 494 ± 112 | 707 ± 346 | 61 ± 11 | 11967 ± 6293 | 64.8 ± 7.0 |
All recordings | 605 ± 105 | 1436 ± 687 | 117 ± 34 | 12365 ± 4845 | 76.3 ± 7.6 |
Minimum values | 315 | 123 | 42 | 1850 | 53.8 |
Maximum values | 858 | 3287 | 180 | 21852 | 91.7 |
Question 1 (Is GGI influenced by pre-stress and airflow?).
The estimated marginal means of the GGI separated for the two fixed effect factors, pre-stress and airflow, are given in Fig. 4. The statistics showed that pre-stress had significant influence on GGI [F(2,369) = 5.973, p = 0.009, partial η2 = 0.372]. GGI clearly decreased from low pre-stress w1 (blue) to w2 (green) and then only slightly decreased to w3 (red); see Fig. 4. Airflow also had a significant influence on GGI [F(14,369) = 6.203, p = 0.000, partial η2 = 0.389]. GGI decreased over all three pre-stress levels for increasing airflow. Further, there was a significant interaction between pre-stress and airflow regarding GGI [F(28,369) = 1.604, p = 0.034, partial η2 = 0.179]. The higher the flow, the less GGI changed for increasing pre-stress: The distance between the wi curves became smaller for increasing airflow. The effect sizes f (Cohen, 1988) were large (fpre-stress = 0.77) and medium (fflow = 0.68, finter = 0.47).
Question 2 (Does GGI influence phonatory parameters?).
The subdivision in the three GGI groups resulted in the following distribution of the experimental runs: GGI1 (N = 148), GGI2 (N = 216), GGI3 (N = 50). The means and standard deviations for all parameters, separated for the three GGI conditions, are given in Table IV.
TABLE IV.
Parameters | GGI1 0.00 ± 0.00 | GGI2 0.13 ± 0.11 | GGI3 0.53 ± 0.09 |
---|---|---|---|
Glottal dynamic parameters | |||
ALR (a.u.) | 15.2 ± 3.4 | 15.4 ± 3.9 | 7.8 ± 2.5 |
Stiffness (frames−1) | 0.34 ± 0.05 | 0.28 ± 0.06 | 0.26 ± 0.05 |
ASQ (a.u.) | 0.62 ± 0.07 | 0.61 ± 0.07 | 0.59 ± 0.05 |
CQ (a.u.) | 0.32 ± 0.08 | 0.38 ± 0.07 | 0.41 ± 0.05 |
OQ (a.u.) | 0.83 ± 0.10 | 0.99 ± 0.02 | 1.00 ± 0.00 |
MADR (px frames−1) | 520 ± 220 | 520 ± 219 | 249 ± 117 |
SQ (a.u.) | 1.73 ± 0.52 | 1.71 ± 0.57 | 1.52 ± 0.35 |
ASI (a.u.) | 0.79 ± 0.13 | 0.83 ± 0.11 | 0.73 ± 0.09 |
PAI (a.u.) | 0.13 ± 0.07 | 0.10 ± 0.06 | 0.13 ± 0.07 |
Aerodynamic parameters | |||
RB (Pa s l−1) | 15024 ± 3714 | 11683 ± 4791 | 7417 ± 2710 |
SPL (dB) | 79.1 ± 6.4 | 76.1 ± 7.1 | 69.4 ± 7.5 |
PS (Pa) | 1679 ± 646 | 1394 ± 691 | 897 ± 387 |
Acoustic signal: Harmonic and perturbation components | |||
JTA (%) | 2.7 ± 4.0 | 2.8 ± 3.6 | 5.7 ± 7.1 |
SHA (%) | 23.2 ± 27.0 | 30.4 ± 30.4 | 55.7 ± 42.5 |
HNRA (dB) | 15.6 ± 6.4 | 14.1 ± 6.7 | 10.8 ± 9.0 |
CPPA (dB) | 24.0 ± 4.8 | 22.8 ± 4.8 | 19.4 ± 4.9 |
Subglottal pressure signal: Harmonic and perturbation components | |||
JTP (%) | 2.0 ± 2.9 | 2.1 ± 3.1 | 2.1 ± 2.9 |
SHP (%) | 9.2 ± 14.9 | 10.0 ± 16.3 | 11.3 ± 17.9 |
HNRP (dB) | 20.4 ± 6.6 | 21.1 ± 6.1 | 20.1 ± 6.4 |
CPPP (dB) | 27.5 ± 3.9 | 26.4 ± 3.9 | 25.6 ± 3.2 |
The p-values of the statistics on GGI group differences for all parameters are given in Table V. All nine GAW parameters showed differences among the GGI groups. The most differences (eight) were found between complete glottis closure (GGI1) and no vocal fold contact (GGI3). The three main aerodynamic parameters (RB, SPL, PS) showed statistical significant differences between all three groups. In 9 out of 12 tests, the acoustic parameters showed significant differences between GGI groups. In contrast, the same parameters computed on the subglottal pressure signal only showed statistically significant differences for 2 out of 12 tests. These two statistically significant differences were both found for cepstral peak prominence (CPPP).
TABLE V.
Parameters | Post hoc tests (Mann-Whitney-U-test; p < 0.017) | Kruskal-Wallis (p < 0.05) | ||
---|---|---|---|---|
GGI1 vs GGI2 | GGI1 vs GGI3 | GGI2 vs GGI3 | ||
GAW parameters | ||||
ALR | 0.227 | 0.000 | 0.000 | 0.000 |
Stiffness | 0.000 | 0.000 | 0.004 | 0.000 |
ASQ | 0.301 | 0.009 | 0.074 | 0.044 |
CQ | 0.000 | 0.000 | 0.042 | 0.000 |
OQ | 0.000 | 0.000 | 0.001 | 0.000 |
MADR | 0.519 | 0.000 | 0.000 | 0.000 |
SQ | 0.242 | 0.006 | 0.078 | 0.034 |
ASI | 0.000 | 0.000 | 0.000 | 0.000 |
PAI | 0.000 | 0.939 | 0.002 | 0.000 |
Aerodynamic parameters | ||||
RB | 0.000 | 0.000 | 0.000 | 0.000 |
SPL | 0.000 | 0.000 | 0.000 | 0.000 |
PS | 0.000 | 0.000 | 0.000 | 0.000 |
Acoustic signal: Harmonic and perturbation components | ||||
JTA | 0.273 | 0.000 | 0.001 | 0.000 |
SHA | 0.012 | 0.000 | 0.000 | 0.000 |
HNRA | 0.021 | 0.000 | 0.027 | 0.001 |
CPPA | 0.007 | 0.000 | 0.000 | 0.000 |
Subglottal pressure signal: Harmonic and perturbation components | ||||
JTP | — | — | — | 0.335 |
SHP | — | — | — | 0.980 |
HNRP | — | — | — | 0.301 |
CPPP | 0.009 | 0.000 | 0.046 | 0.001 |
Histology.
Figure 3(A) shows a histological slice with the positions of the densitometric analysis; positions for length, depth, and epithelial thickness measurements are indicated [Fig. 3(B)]. Histological analysis of the microstructure of the vocal fold showed no lesions, scarring, or bleeding. Figures 3(C) and 3(D) show the reticular and collagen staining, respectively. Both stainings showed a loose network of stained fibers. The average level of collagen was 849 ± 250 a.u. and of reticular fibers was 1975 ± 1236 a.u. Both fiber types were homogeneously distributed for all measurements over all larynges. The mean epithelial layer thickness was measured at 8.1 μm ± 1.2 μm and the mean lamina propria thickness was 602 μm ± 90 μm. The mean vocal fold lengths were measured at 4111 μm ± 426 μm. Elastin was not detected in the upper layer of the lamina propria. Only in the deep layer close to the vocalis muscle was possible to detect elastin bundles of longitudinal aligned fibers.
IV. DISCUSSION
Basic phonatory relationships were analyzed by performing 11 ex vivo rabbit experiments. It was shown that the GGI can be reduced by increasing vocal fold longitudinal pre-tension and applied airflow, yielding improved aerodynamic characteristics (i.e., higher RB and SPL values) and an improved acoustical signal quality [i.e., increased harmonics-to-noise ratio (HNRA) and CPPA; decreased JTA and SHA].
A. Histology
Histologic analysis confirmed intact laryngeal tissue. Similar anatomical dimensions and similar distribution of two main extracellular matrix components (collagen and reticular fibers) were observed in all tested larynges. The measured thickness of the lamina propria was slightly thicker than previously described (Valerie et al., 2016), possibly depending on the age and sex of the animals, but also on the exact place of measurement position. The collagen and reticular fiber distribution looked similar to other investigations of the rabbit vocal fold extracellular matrix (Pitman et al., 2018; Rousseau et al., 2004).
However, the third main component, elastin, was not detected in the superficial region of the lamina propria; neither immunohistological nor Verhoeff staining exhibited any appearance of elastin. Small quantities of short, parallel elastin fibers in the longitudinal direction were found in the deeper region of lamina propria. This is in contrast with other publications in which elastin has been found in cross sections, also in the superficial layer of the vocal fold (Thibeault et al., 2002). One reason might be that we performed longitudinal cuts in which we may have missed the non-homogeneously distributed elastin fibers. Another reason might be that our vocal folds actually contained very little elastin, which could happen due to degradation as a result of reduced vocalization or due to the normal aging process (Roberts et al., 2011).
B. Fundamental phonatory parameters (Table III)
Mean PTP with 707 Pa ± 346 Pa (Table III) was found 57% lower than previously reported (1650 Pa ± 124 Pa) by Maytag et al. (2013). Accordingly, the averaged phonation threshold flow rate was also lower by 21% compared to their experimental data (76.8 mls−1 ± 6.8 mls−1). Mills et al. (2017) reported even lower phonation threshold flow rates down to 27.5 mls−1, which are only 45% of ours, whereas their PTPs (647 Pa–839 Pa) were similar to the mean PTP from our experiments.
Similarly, Novaleski et al. (2016) reported lower mean flow rate (85.9 mls−1) and subglottal pressure (900 Pa) compared to our overall mean values (Table III). Regarding the PS range, our experimental results exhibited lower and higher values (123 Pa–3287 Pa) than given by Mills et al. (2017; approximately 500 Pa–2000 Pa). However, Mills et al. (2017) achieved phonation at lower flow rates starting from ∼17 mls−1. Also the experiments reported by Awan et al. (2014) showed good consistency with our experiments regarding flow rates (85 mls−1–144 mls−1). However, we could apply flow rates of up to 180 mls−1 (25% higher) yielding regular vibrations.
The SPLs (56.2 dB–68.5 dB) reported by Novaleski et al. (2016) were in the same range as those by Mills et al. (2017; 49.1 dB–59.3 dB). In contrast, we measured values between 53.8 and 91.7 dB; see Table III. Similar high SPLs (up to 85 dB) were also reported in in vivo studies by Ge et al. (2009).
The mean fundamental frequency f0 was found higher than in Maytag et al. (2013; 451 Hz). Furthermore, f0 achieved a higher range in our study (Table III) than previous investigations by Novaleski et al. (2016; 419 Hz and 728 Hz), but shows good accordance with the f0 range found by Mills et al. (2017; approximately 300 Hz–850 Hz). Finally, Awan et al. (2014) reported potential fundamental frequencies of around 1000 Hz, which is higher than the maximum value (858 Hz) computed in our study.
To the best of our knowledge, no study has previously reported transglottal flow resistance RB for rabbits; see Table III. Here the reported values of up to 21 852 Pa s l−1 are nearly a factor of 3 higher than maximum values for other mammals such as porcines, ovines, and bovines, which had a maximum value of ∼7770 Pa s l−1 (Alipour and Jaiswal, 2009) and ex vivo human models (Döllinger et al., 2016), which had a maximum value of ∼3000 Pa s l−1. Thus, these high RB values in rabbits suggest a highly efficient energy transfer from the airflow to the vocal fold tissues during phonation (Döllinger et al., 2016; Kniesburges et al., 2017).
Tangentially, our study also confirmed previous observations that with increasing subglottal pressure, the airflow, f0, and SPL increase (Mills et al., 2017); see Fig. 5.
However, we could not confirm whether, at phonation onset, parameters PS and f0 increased for increasing elongation (i.e., longitudinal pre-stress), as reported by Mills et al. (2017); see Table VI. However, we did find increasing flow as a function of increasing elongation (not reported by Mills et al., 2017). In contrast to Mills et al. (2017), our data showed decreasing SPL; see Table VI. However, it must be noted that the absolute parameter variations at phonation onset were rather small, so interpretation was somewhat limited. Differences between both studies may also be based on different experimental setups.
TABLE VI.
Pre-stress | F0 (Hz) | PS (Pa) | Flow (mls-1) | SPL (dB) |
---|---|---|---|---|
w1 | 483 ± 137 | 683 ± 350 | 63 ± 4 | 64.6 ± 10.7 |
w2 | 477 ± 112 | 668 ± 384 | 64 ± 5 | 61.1 ± 11.8 |
w3 | 520 ± 97 | 764 ± 342 | 67 ± 10 | 58.3 ± 11.8 |
In summary, we could show that the dynamic range of the rabbit larynx is greater than previously reported: Normal phonation was observed (1) for lower PS and for higher PS and (2) for higher flow rates than reported in previous studies. Higher SPL values were measured than before, probably due to the greater PS levels, as compared to previous studies. These differences may also be based on possible different ages and sexes of the animals, as compared to other studies. However, this is speculative, since none of the studies reported information on sex and age. Finally, the first data on transglottal flow resistance RB were presented, suggesting a highly effective phonatory process for rabbit larynges. The high SPL and f0 values in combination with large glottal resistance show that the rabbit phonation process is extremely efficient. This high efficiency is necessary because rabbits mainly phonate/vocalize at these high frequencies and loudness in extremely dangerous situations with the aim of baffling predators and warning conspecifics (Quesenberry and Carpenter, 2011; Bays et al., 2006).
C. Phonatory correlations and parameter dependencies
Statistics confirmed that the GGI can be reduced by increasing vocal fold longitudinal pre-tension and applied airflow (i.e., subglottal pressure); see Fig. 4. The glottal closure insufficiency represented by GGI highly influences glottal dynamic GAW parameters, quantitatively (Table IV) and statistically (Table V). Despite the fact that 19 out of 27 statistical post hoc tests were statistically significant for the GGIi groups, the mean parameter value differences between Asymmetry Quotient (ASQ), SQ, and ASI only varied between 5% and 14%. However for stiffness, closing quotient (CQ), open quotient (OQ), and phase asymmetry index (PAI), the mean values varied between 20% and 31% and varied even more for amplitude-to-length-ratio (ALR) (97%) and maximumun area declination rate (MADR) (109%).
In our study, spatial dynamic left-right symmetry (ASI) seemed to be influenced more consistently by GGI than asymmetry in time (PAI). Whereas ASI showed statistically significant differences between all three GGIi groups, interestingly, no statistically significant differences were found for PAI between GGI1 vs GGI3. Despite statistically significant differences, but due to the actual low range of the glottal asymmetry parameters ASI and PAI as compared to the rather high variation in acoustic parameters (Table IV), we encourage further studies to analyze the dependencies between dynamic vocal fold asymmetries and voice quality since this topic is still under discussion (Mehta et al., 2011). Nevertheless, dynamic asymmetry is often reported for voice disorders (Yamauchi et al., 2016; Samlan and Story, 2017).
According to its definition, the GGI is an appropriate measure to describe the degree of glottal closure insufficiency, which is a major cause of dysphonic voice (Inwald et al., 2011; Vaca et al., 2017). Thus, its influence on the acoustic parameters is obvious. The acoustic parameters JTA, SHA, HNRA, and CPPA are statistically different among the three GGI groups (Table V) and show a deterioration of the acoustic output signal with increasing degree of glottal closure insufficiency, as displayed in Table IV.
Smaller GGIs enhanced the quality of the acoustic signal; harmonic and perturbation components showed consistently better values for the entire closed glottis (GGI1) compared to partial closure (GGI2) or no vocal fold contact (GGI3). Whereas the differences between GGI1 and GGI2 were statistically present but not as high, the deterioration of these two groups toward GGI3 was quantitatively large.
Most obvious, this acoustic deterioration is caused by massive decay of the glottal flow resistance RB (Table IV), which is well-correlated with the energy transfer between the glottal air flow and the vocal folds. As a consequence, vocal fold oscillation, which is the primary acoustic source for phonation, is disturbed (i.e., see the change of dynamic glottal parameters as shown in Table IV).
For aerodynamic parameters, it could be shown that with increasing glottal closure insufficiency, the energy transfer flow-tissue was reduced (RB), and the intensity (SPL) of the acoustic outcome signal and the subglottal pressure (PS) also decreased.
The influence of GGI on parameters computed on the time varying subglottal pressure signal was only verified for CPPP. There only statistical significant differences were present between full glottal closure and the other two groups. The CPP is a measure for the periodicity of the signal, i.e., development of the fundamental frequency and its harmonics (Hillenbrand et al., 1994; Birk et al., 2016). In contrast, JT and SH only considered disturbances within the fundamental frequency. This means that glottal closure insufficiency (groups GGI2 and GG3) mainly disturbed the development of the harmonics within the subglottal pressure signal but not amplitude (SHP) or time variation (JTP) of the fundamental frequency within the subglottal pressure signal. The harmonics-to-noise ratio (HNRP) exhibited no statistical differences among GGI groups, although HNR also considers harmonics. However, HNR is computed in the time domain and involves the averaged signal, whereas the CPP is computed in the frequency domain. This suggests that HNR is not as sensitive to changes of the harmonics, as compared to CPP.
Hence, for the fluid–structure–acoustic interaction of the phonatory process, our data suggest that the structure–fluid interaction was detected in changing CPPP values, but not in JTP and SHP for the three different GGI groups, i.e., little structure–fluid interaction. This means that the glottal closure gap induced slight disturbances in the subglottal signal harmonics that also resulted in great acoustic parameter changes, i.e., strong fluid–acoustic interaction. Hence, our data suggest that the harmonics within the subglottal pressure (airflow) are highly important for the acoustic signal quality. Also the results showed that the influence of glottal closure characteristics on the fundamental frequency of the subglottal signal was rather minor.
Regarding general relations within the phonation process found in our study (Tables IV and V), the data suggest the following. First of all, it was difficult to predict acoustic output quality based on the subglottal pressure signal. Whereas the acoustic signal showed high variations in the computed parameters, the subglottal signal seemed to vary only in the harmonic components. Second, glottal closure insufficiency highly influenced laryngeal dynamics, reducing the energy transfer from the airflow to the tissue vibrations, and yielding a reduced quality of the acoustic output signal. In turn, this suggested, third, that full glottis closure was necessary for an optimal interaction of fluid–dynamics and acoustics and therefore for good voice quality. Partial vocal fold closure (GGI2) already interrupted this interaction. However, no contact of the vocal folds (GGI3) during phonation resulted in highly reduced aerodynamic parameter values and acoustic quality.
V. CONCLUSIONS
Glottal closure characteristics appeared to exhibit a major influence on the phonatory process since all dynamic glottal, acoustic, and aerodynamic parameters were statistically and significantly influenced by GGI; see Table V. Complete glottal closure during phonation was desirable. While partial glottal closure during phonation already began to reduce aerodynamic and acoustic characteristics, contact-free oscillations of the vocal folds highly decreased acoustic and aerodynamic characteristics. The influence of glottal closure on the subglottal pressure signal appeared to be minor, but was recognizable in the CPP. Glottal closure insufficiency could be reduced by increasing the pre-tension of the vocal folds, the applied airflow, or both. Finally, the phonatory process in rabbits seemed more effective (i.e., in terms of higher RB values) than in other mammalian species investigated so far and the phonatory range in rabbits was found to be larger than previously reported.
ACKNOWLEDGMENTS
This research was supported by Deutsche Krebshilfe (DKH) under Grant No. 111332. D.A.B.'s work on this project was funded by the National Institutes of Health/National Institute on Deafness and Other Communication Disorders (NIH/NIDCD) Grant No. R01 DC013323.
References
- 1. Alipour, F. , and Jaiswal, S. (2009). “ Glottal airflow resistance in excised pig, sheep, and cow larynges,” J. Voice 23(1), 40–50. 10.1016/j.jvoice.2007.03.007 [DOI] [PubMed] [Google Scholar]
- 2. Almohizea, M. I. , Prasad, V. M. , Fakhoury, R. , Bihin, B. , and Remacle, M. (2016). “ Using peak direct subglottic pressure level as an objective measure during medialization thyroplasty: A prospective study,” Eur. Arch. Otorhinolaryngol. 273(9), 2607–2611. 10.1007/s00405-016-4059-5 [DOI] [PubMed] [Google Scholar]
- 3. Awan, S. N. , Novaleski, C. K. , and Rousseau, B. (2014). “ Nonlinear analysis of elicited modal, raised, and pressed rabbit phonation,” J. Voice 28(5), 538–547. 10.1016/j.jvoice.2014.01.015 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Baken, R. J. , and Orlikoff, R. F. (1999). Clinical Measurement of Speech and Voice, 2nd ed. ( Cengage Learning, Clifton Park, NY: ). [Google Scholar]
- 5. Bays, T. B. , Lightfoot, T. , and Mayer, J. (2006). Exotic Pet Behavior: Birds, Reptiles, and Small Mammals ( Saunders Elsevier, St. Louis, MO: ). [Google Scholar]
- 6. Bielamowicz, S. , Kreiman, J. , Gerratt, B. R. , Dauer, M. S. , and Berke, G. S. (1996).“ Comparison of voice analysis systems for perturbation measurement,” J. Speech Hear. Res. 39(1), 126–134. 10.1044/jshr.3901.126 [DOI] [PubMed] [Google Scholar]
- 7. Birk, V. , Döllinger, M. , Sutor, A. , Berry, D. A. , Gedeon, D. , Traxdorf, M. , Wendler, O. , and Kniesburges, S. (2017a). “ Automated setup for ex vivo larynx experiments,” J. Acoust. Soc. Am. 141(3), 1349–1359. 10.1121/1.4976085 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Birk, V. , Kniesburges, S. , Semmler, M. , Berry, D. A. , Bohr, C. , Döllinger, M. , and Schützenberger, A. (2017b). “ Influence of glottal closure on the phonatory process in ex vivo porcine larynges,” J. Acoust. Soc. Am. 142(4), 2197–2207. 10.1121/1.5007952 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Birk, V. , Sutor, A. , Döllinger, M. , Bohr, C. , and Kniesburges, S. (2016). “ Acoustic impact of ventricular folds on phonation studied in ex vivo human larynx models,” Acta. Acust. Acust. 102(2), 244–256. 10.3813/AAA.918941 [DOI] [Google Scholar]
- 10. Bohr, C. , Döllinger, M. , Kniesburges, S. , and Traxdorf, M. (2016). “ 3D visualization and analysis of vocal fold dynamics,” HNO 64(4), 254–261. 10.1007/s00106-016-0122-1 [DOI] [PubMed] [Google Scholar]
- 11. Chan, R. W. , and Titze, I. R. (2003). “ Effect of postmortem changes and freezing on the viscoelastic properties of vocal fold tissues,” Ann. Biomed. Eng. 31, 482–491. 10.1114/1.1561287 [DOI] [PubMed] [Google Scholar]
- 12. Chen, G. , Kreiman, J. , Gerratt, B. R. , Neubauer, J. , Shue, Y. L. , and Alwan, A. (2013). “ Development of a glottal area index that integrates glottal gap size and open quotient,” J. Acoust. Soc. Am. 133, 1656–1666. 10.1121/1.4789931 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Cohen, J. (1988). Statistical Power Analysis for the Behavioral Sciences, 2nd ed. ( Erlbaum, Mahwah, NJ: ). [Google Scholar]
- 14. Dippold, S. , Voigt, D. , Richter, B. , and Echternach, M. (2015). “ High-speed imaging analysis of register transitions in classically and jazz-trained male voices,” Folia. Phoniatr. Logop. 67(1), 21–28. 10.1159/000381095 [DOI] [PubMed] [Google Scholar]
- 15. Döllinger, M. , Berry, D. A. , and Kniesburges, S. (2016). “ Dynamic vocal fold parameters with changing adduction in ex-vivo hemilarynx experiments,” J. Acoust. Soc. Am. 139(5), 2372–2385. 10.1121/1.4947044 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Döllinger, M. , Kobler, J. , Berry, D. A. , Mehta, D. D. , Luegmair, G. , and Bohr, C. (2011). “ Experiments on analysing voice production: Excised (human, animal) and in vivo (animal) approaches,” Curr. Bioinform. 6(3), 286–304. 10.2174/157489311796904673 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Dumberger, L. D. , Overton, L. , Buckmire, R. A. , and Shah, R. N. (2017). “ Trial vocal fold injection predicts thyroplasty outcomes in nonparalytic glottic incompetence,” Ann. Otol. Rhinol. Laryngol. 126(4), 279–283. 10.1177/0003489416688479 [DOI] [PubMed] [Google Scholar]
- 18. Echternach, M. , Döllinger, M. , Sundberg, J. , Traser, L. , and Richter, B. (2013). “ Vocal fold vibration at high soprano fundamental frequencies,” J. Acoust. Soc. Am. 133(2), EL82–EL87. 10.1121/1.4773200 [DOI] [PubMed] [Google Scholar]
- 19. Elemans, C. P. H. , Rasmussen, J. H. , Herbst, C. T. , Dürng, D. N. , Zollinger, S. A. , Brumm, H. , Srivastava, K. , Svane, N. , Ding, M. , Larsen, O. N. , Sober, S. J. , and Svec, J. G. (2015). “ Universal mechanisms of sound production and control in birds and mammals,” Nat. Commun. 6, 8978. 10.1038/ncomms9978 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Ge, P. J. , French, L. C. , Ohno, T. , Zealear, D. L. , and Rousseau, B. (2009). “ Model of evoked rabbit phonation,” Ann. Otol. Rhinol. Laryngol. 118(1), 51–55. 10.1177/000348940911800109 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Giraldez-Rodriguez, L. A. , and Johns, M., 3rd (2013). “ Glottal insufficiency with aspiration risk in dysphagia,” Otolaryngol. Clin. North. Am. 46(6), 1113–1121. 10.1016/j.otc.2013.09.004 [DOI] [PubMed] [Google Scholar]
- 22. Henrich, N. , Sundin, G. , Ambroise, D. , d'Alessandro, C. , Castellengo, M. , and Doval, B. (2003). “ Just noticeable differences of open quotient and asymmetry coefficient in singing voice,” J. Voice 17(4), 481–494. 10.1067/S0892-1997(03)00005-5 [DOI] [PubMed] [Google Scholar]
- 23. Herbst, C. T. , Lohscheller, J. , Svec, J. G. , Henrich, N. , Weissengruber, G. , and Fitch, W. T. (2014). “ Glottal opening and closing events investigated by electroglottography and super-high-speed video recordings,” J. Exp. Biol. 217(6), 955–963. 10.1242/jeb.093203 [DOI] [PubMed] [Google Scholar]
- 24. Herbst, C. T. , Stoeger, A. S. , Frey, R. , Lohscheller, J. , Titze, I. R. , Gumpenberger, M. , and Fitch, W. T. (2012). “ How low can you go? Physical production mechanism of elephant infrasonic vocalizations,” Science 337(6094), 595–599. 10.1126/science.1219712 [DOI] [PubMed] [Google Scholar]
- 25. Herbst, C. T. , Svec, J. G. , Lohscheller, J. , Frey, R. , Gumpenberger, M. , Stoeger, A. S. , and Fitch, W. T. (2013). “ Complex vibratory patterns in an elephant larynx,” J. Exp. Biol. 216(21), 4054–4064. 10.1242/jeb.091009 [DOI] [PubMed] [Google Scholar]
- 26. Hertegard, S. , Dahlqvist, A. , Laurent, C. , Borzacchiello, A. , and Ambrosio, L. (2003). “ Viscolelastic properties of rabbit vocal folds after augmentation,” Otolaryngol Head Neck Surg. 128(3), 401–406. 10.1067/mhn.2003.96 [DOI] [PubMed] [Google Scholar]
- 27. Hertegard, S. , Larsson, H. , Nagubothu, S. S. , Tolf, A. , and Svensson, B. (2009). “ Elasticity measurements in scarred rabbit vocal folds using air pulse stimulation,” Logoped. Phoniatr. Vocol. 34(4), 190–195. 10.3109/14015430902913527 [DOI] [PubMed] [Google Scholar]
- 28. Hillenbrand, J. , Cleveland, R. A. , and Erickson, R. L. (1994). “ Acoustic correlates of breathy vocal quality,” J. Speech. Hear. Res. 37(4), 769–778. 10.1044/jshr.3704.769 [DOI] [PubMed] [Google Scholar]
- 29. Holmberg, E. B. , Hillman, R. E. , and Perkell, J. S. (1988). “ Glottal airflow and transglottal air pressure measurements for male and female speakers in soft, normal, and loud voice,” J. Acoust. Soc. Am. 84(2), 511–529. 10.1121/1.396829 [DOI] [PubMed] [Google Scholar]
- 30. Inwald, E. C. , Döllinger, M. , Schuster, M. , Eysholdt, U. , and Bohr, C. (2011).“ Multiparametric analysis of vocal fold vibrations in healthy and disordered voices in high-speed imaging,” J. Voice 25(5), 576–590. 10.1016/j.jvoice.2010.04.004 [DOI] [PubMed] [Google Scholar]
- 31. Karnell, M. P. , Hall, K. D. , and Landahl, K. L. (1995). “ Comparison of fundamental frequency and perturbation measurements among three analysis systems,” J. Voice 9(4), 383–393. 10.1016/S0892-1997(05)80200-0 [DOI] [PubMed] [Google Scholar]
- 32. Klemuk, S. A. , Riede, T. , Walsh, E. J. , and Titze, I. R. (2011). “ Adapted to roar: Functional morphology of tiger and lion vocal folds,” PLoS One 6(11), e27029. 10.1371/journal.pone.0027029 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Kniesburges, S. , Birk, V. , Lodermeyer, A. , Schützenberger, A. , Bohr, C. , and Becker, S. (2017). “ Effect of the ventricular folds in a synthetic larynx model,” J. Biomech. 55, 128–133. 10.1016/j.jbiomech.2017.02.021 [DOI] [PubMed] [Google Scholar]
- 34. Kojima, T. , Mitchell, J. R. , Garrett, C. G. , and Rousseau, B. (2014). “ Recovery of vibratory function after vocal fold microflap in a rabbit model,” Laryngoscope 124(2), 481–486. 10.1002/lary.24324 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Ling, C. , Li, Q. , Brown, M. E. , Kishimoto, Y. , Toya, Y. , Devine, E. E. , Choi, K. O. , Nishimoto, K. , Norman, I. G. , Tsegyal, T. , Jiang, J. J. , Burlingham, W. J. , Gunasekaran, S. , Smith, L. M. , Frey, B. L. , and Welham, N. V. (2015). “ Bioengineered vocal fold mucosa for voice restoration,” Sci. Transl. Med. 7(314), 314ra187. 10.1126/scitranslmed.aab4014 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Luegmair, G. , Mehta, D. D. , Kobler, J. , and Döllinger, M. (2015). “ Three-dimensional optical reconstruction of vocal fold kinematics using high-speed video with a laser projection system,” IEEE T. Med. Imaging 34(12), 2572–2582. 10.1109/TMI.2015.2445921 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Mau, T. , Du, M. , and Xu, C. C. (2014). “ A rabbit vocal fold laser scarring model for testing lamina propria tissue-engineering therapies,” Laryngoscope 124(10), 2321–2326. 10.1002/lary.24707 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Maytag, A. L. , Robitaille, M. J. , Rieves, A. L. , Madsen, J. , Smith, B. L. , and Jiang, J. J. (2013). “ Use of the rabbit larynx in an excised larynx setup,” J. Voice 27(1), 24–28. 10.1016/j.jvoice.2012.08.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Mehta, D. D. , Zanartu, M. , Quatieri, T. F. , Deliyski, D. D. , and Hillman, R. E. (2011). “ Investigating acoustic correlates of human vocal fold vibratory phase asymmetry through modeling and laryngeal high-speed videoendoscopy,” J. Acoust. Soc. Am. 130(6), 3999–4009. 10.1121/1.3658441 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Mills, R. D. , Dodd, K. , Ablavsky, A. , Devine, E. , and Jiang, J. J. (2017). “ Parameters from the complete phonatory range of an excised rabbit larynx,” J. Voice 31(4), 517.e9–517.e17. 10.1016/j.jvoice.2016.12.018 [DOI] [PubMed] [Google Scholar]
- 41. Mulisch, M. , and Welsch, U. (2010). Romeis—Mikroskopische Technik ( Springer Spektrum, Heidelberg, Germany: ). [Google Scholar]
- 42. Munhall, K. G. , Ostry, D. J. , and Parush, A. (1985). “ Characteristics of velocity profiles of speech movements,” J. Exp. Psychol. Hum. Percept. Perform. 11(4), 457–474. 10.1037/0096-1523.11.4.457 [DOI] [PubMed] [Google Scholar]
- 43. Novaleski, C. K. , Kojima, T. , Chang, S. , Luo, H. , Valenzuela, C. V. , and Rousseau, B. (2016). “ Nonstimulated rabbit phonation model: Cricothyroid approximation,” Laryngoscope 126(7), 1589–1594. 10.1002/lary.25559 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44. Patel, R. R. , Dubrovskiy, D. , and Döllinger, M. (2014). “ Characterizing vibratory kinematics in children and adults with high-speed digital imaging,” J. Speech. Lang. Hear. Res. 57(2), S674–S686. 10.1044/2014_JSLHR-S-12-0278 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Patel, R. R. , Walker, R. , and Sivasankar, P. M. (2016). “ Spatiotemporal quantification of vocal fold vibration after exposure to superficial laryngeal dehydration: A preliminary study,” J. Voice 30(4), 427–433. 10.1016/j.jvoice.2015.07.009 [DOI] [PubMed] [Google Scholar]
- 46. Pitman, M. J. , Kurita, T. , Powell, M. E. , Kimball, E. E. , Mizuta, M. , Chang, S. , Garrett, C. G. , and Rousseau, B. (2018). “ Vibratory function and healing outcomes after small intestinal submucosa biomaterial implantation for chronic vocal fold scar,” Laryngoscope 128(4), 901–908. 10.1002/lary.26883 [DOI] [PubMed] [Google Scholar]
- 47. Qiu, Q. , Schutte, H. K. , Gu, L. , and Yu, Q. (2003). “ An automatic method to quantify the vibration properties of human vocal folds via videokymography,” Folia Phoniatr. Logop. 55(3), 128–136. 10.1159/000070724 [DOI] [PubMed] [Google Scholar]
- 48. Quesenberry, K. , and Carpenter, J. W. (2011). Ferrets, Rabbits, and Rodents—Clinical Medicine and Surgery, 3rd ed. ( Elsevier Saunders, St. Louis, MO: ). [Google Scholar]
- 49. Regner, M. F. , Robitaille, M. J. , and Jiang, J. J. (2010). “ Interspecies comparison of mucosal wave properties using high-speed digital imaging,” Laryngoscope 120, 1188–1194. 10.1002/lary.20884 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50. Riede, T. , and Titze, I. R. (2008). “ Vocal fold elasticity of the Rocky Mountain elk (Cervus elaphus nelsoni)—Producing high fundamental frequency vocalization with a very long vocal fold,” J. Exp. Biol. 211(13), 2144–2154. 10.1242/jeb.017004 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51. Roberts, T. , Morton, R. , and Al-Ali, S. (2011). “ Microstructure of the vocal fold in elderly humans,” Clin. Anat. 24(5), 544–551. 10.1002/ca.21114 [DOI] [PubMed] [Google Scholar]
- 52. Rousseau, B. , Hirano, S. , Chan, R. W. , Welham, N. V. , Thibeault, S. L. , Ford, C. N. , and Bless, D. M. (2004). “ Characterization of chronic vocal fold scarring in a rabbit model,” J. Voice 18(1), 116–124. 10.1016/j.jvoice.2003.06.001 [DOI] [PubMed] [Google Scholar]
- 53. Samlan, R. A. , and Story, B. H. (2017). “ Influence of left-right asymmetries on voice quality in simulated paramedian vocal fold paralysis,” J. Speech. Lang. Hear. Res. 60(2), 306–321. 10.1044/2016_JSLHR-S-16-0076 [DOI] [PubMed] [Google Scholar]
- 54. Schneider-Stickler, B. , Gaechter, J. , and Bigenzahn, W. (2013). “ Long-term results after external vocal fold medialization thyroplasty with titanium vocal fold medialization implant (TVFMI),” Eur. Arch. Otorhinolaryngol. 270(5), 1689–1694. 10.1007/s00405-013-2416-1 [DOI] [PubMed] [Google Scholar]
- 55. Thibeault, S. L. , Gray, S. D. , Bless, D. M. , Chan, R. W. , and Ford, C. N. (2002). “ Histologic and rheologic characterization of vocal fold scarring,” J. Voice 16(1), 96–104. 10.1016/S0892-1997(02)00078-4 [DOI] [PubMed] [Google Scholar]
- 56. Titze, I. R. (1994). Principles of Voice Production ( Prentice Hall, Englewood Cliffs, NJ). [Google Scholar]
- 57. Titze, I. R. (1995). Workshop on Acoustic Voice Analysis: Summary Statement ( NCVS, Iowa City, IA: ). [Google Scholar]
- 58. Titze, I. R. (2006). “ Theoretical analysis of maximum flow declination rate versus maximum area declination rate in phonation,” J. Speech. Lang. Hear. Res. 49(2), 439–447. 10.1044/1092-4388(2006/034) [DOI] [PubMed] [Google Scholar]
- 59. Titze, I. R. (2017). “ Human speech: A restricted use of the mammalian larynx,” J. Voice 31(2), 135–141. 10.1016/j.jvoice.2016.06.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60. Titze, I. R. , Fitch, W. T. , Hunter, E. J. , Alipour, F. , Montequin, D. , Armstrong, D. L. , McGee, J. A. , and Walsh, E. J. (2010). “ Vocal power and pressure-flow relationships in excised tiger larynges,” J. Exp. Biol. 213(22), 3866–3873. 10.1242/jeb.044982 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61. Titze, I. R. , and Riede, T. (2010). “ A cervid vocal fold model suggests greater glottal efficiency in calling at high frequencies,” PLoS Comput. Biol. 6(8), e1000897. 10.1371/journal.pcbi.1000897 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62. Titze, I. R. , Riede, T. , and Mau, T. (2016). “ Predicting achievable fundamental frequency ranges in vocalization across species,” PLoS Comput. Biol. 12(6), e1004907. 10.1371/journal.pcbi.1004907 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63. Vaca, M. , Cobeta, I. , Mora, E. , and Reyes, P. (2017). “ Clinical assessment of glottal insufficiency in age-related dysphonia,” J. Voice 31(1), 128.e1–128.e5. 10.1016/j.jvoice.2015.12.010 [DOI] [PubMed] [Google Scholar]
- 64. Valerie, A. , Vassiliki, K. , Irini, M. , Nikolaos, P. , Karampela, E. , and Apostolos, P. (2016). “ Adipose-derived mesenchymal stem cells in the regeneration of vocal folds: A study on a chronic vocal fold scar,” Stem Cells Int. 2016, 9010279. 10.1155/2016/9010279 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65. van den Berg, J. , Zantema, J. T. , and Doornenbal, P. (1957). “ On the air resistance and the Bernoulli effect of the human larynx,” J. Acoust. Soc. Am. 29(5), 626–631. 10.1121/1.1908987 [DOI] [Google Scholar]
- 66. Wang, S. G. , Park, H. J. , Lee, S. M. , Ko, B. , Lee, S. M. , and Park, Y. M. (2016). “ A new videokymography system for evaluation of the vibration pattern of entire vocal folds,” Auris. Nasus. Larynx. 43(3), 315–321. 10.1016/j.anl.2015.10.002 [DOI] [PubMed] [Google Scholar]
- 67. Welham, N. V. , Montequin, D. W. , Tateya, I. , Choi, S. H. , and Bless, D. M. (2009). “ A rat excised larynx model of vocal fold scar,” J. Speech. Hear. Res. 52(4), 1008–1020. 10.1044/1092-4388(2009/08-0049) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68. Yamauchi, A. , Yokonishi, H. , Imagawa, H. , Sakakibara, K. I. , Nito, T. , and Tayama, N. (2016). “ Quantitative analysis of vocal fold vibration in vocal fold paralysis with the use of high-speed digital imaging,” J. Voice 30(6), 766.e13–766.e22. 10.1016/j.jvoice.2015.10.015 [DOI] [PubMed] [Google Scholar]
- 69. Yumoto, E. , Gould, W. J. , and Baer, T. (1982). “ Harmonics-to-noise ratio as an index of the degree of hoarseness,” J. Acoust. Soc. Am. 71(6), 1544–1550. 10.1121/1.387808 [DOI] [PubMed] [Google Scholar]