Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2025 Sep 1.
Published in final edited form as: J Voice. 2022 Apr 5;38(5):1008–1022. doi: 10.1016/j.jvoice.2022.03.007

Effects of Age and Parkinson’s disease on the Relationship between Vocal Fold Abductory Kinematics and Relative Fundamental Frequency

Jennifer M Vojtech 1,2,3,4, Cara E Stepp 1,2,5
PMCID: PMC9532464  NIHMSID: NIHMS1790158  PMID: 35393167

Abstract

Purpose:

This study reports on two experiments to examine vocal fold abduction and its relationship with relative fundamental frequency (RFF), considering two attributes that have been shown to elicit group differences in RFF: age (Experiment 1) and Parkinson’s disease (PD; Experiment 2).

Methods:

For both experiments, simultaneous acoustic and nasendoscopic recordings were collected as participants produced the utterance, /ifi/. RFF values were computed from the acoustic signal, whereas abduction duration and glottic angle at voicing offset were identified from the laryngoscopic images. In Experiment 1, 50 speakers with typical voices (18–83 years) were analyzed to examine (1A) the effects of speaker age on individual outcome measures (RFF, abduction duration, glottic angle) via Pearson’s correlation coefficients, and (1B) the effects of abductory measures and age on RFF via an analysis of covariance. In Experiment 2, 20 speakers with PD and 20 matched controls were analyzed to examine (2A) the effects of group (with/without PD) on outcome measures via an analysis of variance, and (2B) the relationship of RFF with abduction duration, glottic angle, and age when considering group via an analysis of covariance.

Results:

Age demonstrated a significant, negative relationship with glottic angle (1A) but was not a significant factor when examining the relationship of vocal fold abduction and RFF (1B). Speaker group (with/without PD) demonstrated a significant effect on measures of RFF and abduction duration (2A) but was not a significant factor when examining the relationship of vocal fold abduction and RFF (2B).

Conclusions:

RFF is sensitive to changes in vocal fold abductory patterns during devoicing, irrespective of speaker age or PD status.

Keywords: Relative fundamental frequency, vocal fold abduction Parkinson’s disease, voice, age

I. Introduction

A. Background

Laryngeal muscle tension and vocal fold abductory kinematics are thought to play key roles in the regulation of intervocalic offsets. Increased vocal fold tension2,3 and abduction4 have been observed during vowels that precede voiceless consonants, suggesting that these mechanisms work in combination to cease vocal fold vibration. Numerous mechanisms have been postulated to describe the physiological underpinnings of increased vocal fold tension during voicing offset, including increased cricothyroid activity,5 thyroarytenoid activity,6 and vocal fold stiffness from passive stretching that occurs as the result of changes in laryngeal height.5,7 On the other hand, vocal fold abduction has been attributed to the posterior cricoarytenoid muscle, which acts as an antagonist to the cricothyroid and adductor muscles (i.e., thyroarytenoid, interarytenoid, lateral cricoarytenoid)8,911 to reduce the duration of vocal fold contact12 and, thus, inhibit vocal fold vibration. Based on these mechanisms, the interplay of laryngeal muscle tension and vocal fold abduction may counteract each other during voicing offset to cease vocal fold vibration, wherein increased tension increases voice fundamental frequency (fo)—or an increase in the frequency of vocal fold vibration—and vocal fold abduction lowers fo13. These mechanisms may be captured by a non-invasive, objective measure called relative fundamental frequency (RFF).

RFF has been proposed as an acoustic estimate of the degree of baseline laryngeal muscle tension. As excessive and/or imbalanced laryngeal muscle forces have been implicated in a large proportion of voice disorders14, RFF may be useful for non-invasively assessing and tracking changes in laryngeal tension in the clinic. Reflecting short-term changes in instantaneous fo during intervocalic offsets and onsets, RFF is calculated from the fo values of the 10 voicing cycles immediately preceding and 10 voicing cycles immediately following the voiceless consonant in a vowel–voiceless consonant–vowel (VCV) production (see Fig. 1). These 20 instantaneous fo values are then normalized to the steady-state fo value of the corresponding vowel—as shown in Eq. 1—to enable comparisons of changes in fo in units of semitones (ST) within and across speakers. This pattern of the 10 RFF values during voicing offset is thought to reflect the interplay of tension and abduction as these mechanisms act to slow down and eventually cease vocal fold vibration. Interestingly, the RFF values of young adults are characteristically stable or slightly decreasing during voicing offset,13,15 whereas those of older adults are significantly lower in magnitude.13,16

RFF(ST)=12*log2(foforef) (1)

Figure 1.

Figure 1.

Acoustic waveform of the vowel–voiceless consonant–vowel production, /ifi/. Voicing cycles preceding the voiceless consonant, /f/, are marked as voicing offset (teal) whereas those following the /f/ are marked as voicing onset (purple).

The dissimilarity in offset RFF trends between young and older adults suggests a potential difference in the mechanisms used for devoicing. Watson13 proposed that older adults may not be able to produce transient increases in vocal fold tension to assist in devoicing, possibly as a byproduct of age-related vocal fold atrophy frequently observed in older adults (e.g., breathiness, weakness, hoarseness, inability to sustain phonation, and/or atrophy of the vocalis muscle).17 Under this hypothesis, older adults may rely on a prolonged abductory gesture rather than the combined efforts of tension and abduction. This would imply that vocal fold abduction begins earlier in the preceding vowel for older adults and may include an increased glottic angle at the time of voicing offset (i.e., due to the vocal folds opening to cease vibration).4 In spite of the theoretical backing to support this contribution of vocal fold abduction to devoicing and, in turn, to measures of RFF, no study to date has physiologically examined the role of abductory kinematics in intervocalic offsets.

The theoretical implications of a prolonged abductory gesture in older adults is also interesting to consider when comparing RFF values between adults with Parkinson’s disease (PD) and age-matched controls.15,16 PD is a progressive neurodegenerative disease that typically develops in middle to late life (with incidence rates rising rapidly after 60 years of age)18 and affects the central and peripheral nervous systems.19,20 Thought to be a product of neural processes and morphological changes to the muscles, increased muscle rigidity is one of the hallmark motor symptoms exhibited in PD.2125 Since the regulation of laryngeal muscle tension is a crucial component of phonation, it is no surprise that excessive intrinsic laryngeal muscle activity has been reported in adults with PD when compared to age-matched controls.26,27 RFF is thought to reflect this disparity in baseline tension, wherein adults with PD exhibit even lower RFF values compared to controls.15,16 Although it follows that the comparably lower offset RFF values in speakers with PD reflect increased levels of muscle rigidity that arise with PD, no study has provided evidence to support this notion. As vocal fold abduction is implicated alongside laryngeal muscle tension during voicing offset, it is possible that the RFF values observed in those with PD are the result of an even greater reliance on vocal fold abduction rather than from increased baseline levels of laryngeal muscle activity. Since the contribution of vocal fold abduction to RFF has not been physiologically examined, however, it is thus unclear how tension and abduction play a role in RFF values in older adults with PD.

B. Purpose of the Current Study

Although RFF shows promise as an acoustic estimate of the degree of baseline laryngeal muscle tension, the specific contribution of vocal fold abduction to RFF has not been physiologically assessed. The purpose of this study was to therefore characterize the relationship between vocal fold abduction and RFF at intervocalic offsets. The motivation for this work stems from prior speculations of a prolonged abductory gesture in older adults and concurrently increased levels of baseline laryngeal muscle tension in adults with PD. Because RFF is captured using an acoustic signal—which is useful for non-invasive clinical voice assessments, yet only indirectly reflects the glottal source—these hypotheses yet to be physiologically confirmed. As such, vocal fold vibratory and abductory kinematics during devoicing cannot be characterized in relation to RFF when only examining the acoustic signal. To carry out this investigation, measures of RFF at voicing offset cycle 10 were examined relative to duration of vocal fold abduction and glottic angle at voicing offset. These three measures were also investigated in the presence of two factors that have been previously hypothesized to impact the relationship between vocal fold abduction and RFF: speaker age and a diagnosis of idiopathic PD (i.e., a disorder characterized by excessive laryngeal muscle tension).

Following previous work from Watson,13 we first sought to characterize the relationship between speaker age and vocal fold abduction during intervocalic offsets, as well as the extent to which this relationship contributes to measures of RFF in vocally healthy speakers. It was hypothesized that vocal fold abduction (abduction duration, offset angle) would be significantly, positively related to speaker age, whereas RFF (at voicing offset cycle 10) would be significantly, negatively related to speaker age. Measures of vocal fold abduction were also hypothesized to exhibit a significant, negative relation with RFF at voicing offset (i.e., greater abduction durations and larger glottic angles at voicing offset will be associated with lower RFF values at voicing offset cycle 10). These hypotheses were specifically tested in Experiment 1.

We then assessed the effects of idiopathic PD on vocal fold abduction, and how this relationship contributes to RFF at intervocalic offsets. It was hypothesized that measures of vocal fold abduction (abduction duration, offset angle) would not significantly differ between speakers with idiopathic PD and age-/sex-matched controls, but that speakers with idiopathic PD would exhibit statistically significantly lower RFF values at voicing offset. Furthermore, it was hypothesized that both vocal fold abduction and speaker group (control, PD) would be significantly predictive of RFF at voicing offset. These hypotheses were specifically tested in Experiment 2.

II. Experiment 1

A. Method

1. Participants

Fifty adults with typical voices (25 cisgender females, 25 cisgender males) aged 18–83 years (M = 43.5 years, SD = 21.8 years) were enrolled in the study. The Montreal Cognitive Assessment (MoCA) was administered to participants over the age of 50 to ensure all enrolled participants had the capacity to consent to study tasks via a priori cut-off of ≥21;28 for the 24 participants over 50 years of age, the average MoCA score was 28.3 (SD = 1.3 years, range = 26–30). All participants were native English speakers, non-smokers, and had no history of speech, language, hearing, neurological, or voice problems. Participants with trained singing experience beyond grade school were excluded to minimize variability in phonatory behaviors that may occur when differentiating between singers and non-singers.29 A certified voice-specializing speech-language pathologist screened all participants with typical voices for healthy vocal function via auditory-perceptual assessment and flexible nasendoscopic laryngeal imaging. A hearing screening of pulsed pure tones30 was administered to each participant at frequencies of 125, 250, 500, 1000, 2000, and 4000 Hz. Of this group, 38 adults passed the hearing screening under 25 dB HL at both ears (protocol based on American Speech-Language-Hearing Association31). The remaining 12 adults—all over 50 years of age—passed this hearing screening at frequencies of 125, 250, 500, and 1000 under 25 dB HL and at 2000 and 4000 Hz under 40 dB HL in at least one ear.32 All participants provided informed, written consent in compliance with the Boston University Institutional Review Board. Detailed demographic information for the control speakers is reported in Table 1.A of Appendix A.

2. Recording Procedures

Participants were trained to produce eight iterations of the VCV utterance, /ifi/, with equal stress (same pitch and duration) on both vowels. This VCV utterance was selected to visualize the vocal folds under endoscopy, wherein the voiceless consonant, /f/, has been shown to minimize within-speaker variations in the resulting acoustic signal.33 Each participant was instructed to produce four /ifi/ utterances, take a breath, then produce the remaining four /ifi/ utterances.

Participants were then seated in a sound-attenuated booth and instrumented with a directional headset microphone (Shure SM35 XLR) placed 45° from the midline and 7 cm from the lips. Microphone signals were pre-amplified (Xenyx Behringer 802 Preamplifier) and digitized at 30 kHz (National Instruments 6312 USB). A flexible routine endoscope (Pentax, Model FNL-10RP3, 3.5-mm) was passed transnasally over the soft palate and into the hypopharynx for laryngeal visualization using a steady xenon light source (300 W KayPENTAX Model 7162B). In the event that participant anatomy or comfort interfered with image acquisition using the routine endoscope, a flexible slim endoscope (Pentax, Model FNL-7RP3, 2.4-mm) was used instead. A nasal decongestant was offered to minimize participant discomfort as the endoscope was passed through the nasal cavity. Laryngeal images were captured at 1 kHz via a camera (FASTCAM Mini AX100l; Model 540K-C-16GB; 40-mm optical lens adapter) attached to the endoscope and were recorded using Photron Fastcam Viewer software (v.3.6.6). A frame rate of 1 kHz was chosen to sufficiently track the fundamental frequency of vocal fold vibration (85–255 Hz in adults34) and capture gross abductory gestures (104–227 ms in adults35) while permitting a reasonable spatial resolution (256 × 352 pixels with the 3.5-mm endoscope and 256 × 256 pixels with the 2.4-mm endoscope) and an appropriate video duration to successfully record all eight /ifi/ repetitions.

During the endoscopic procedure, participants were cued to produce the eight /ifi/ repetitions. This number of repetitions was selected based on the recording limitations of setup: the high-speed imaging and synchronized microphone recordings were restricted in duration to 7.940 seconds when the 3.5-mm endoscope was used and 8.734 seconds when the 2.4-mm endoscope was used. To account for trials in which productions at the end of the recording were incompletely captured or in cases where less than eight /ifi/ utterances were produced, each condition was repeated a minimum of two times. Additional trials were also recorded in the event that the vocal folds were not sufficiently captured (e.g., due to obstruction from the epiglottis) or if the participant produced unequally stressed vowels. The length of the endoscopic procedure lasted approximately 5–10 minutes. A total of 740 productions were captured across the 50 speakers, with an average of 14.8 (SD = 3.2) /ifi/ productions captured per speaker.

3. Data Analysis

a. High-speed Video Processing
(1). Technician Training

A series of nine technicians were trained in glottic angle identification at a conventional framerate of 30 frames per second (fps) using methodology described by Diaz-Cadiz, McKenna, Vojtech and Stepp.36 In brief, the technicians identified glottic angles extending from the anterior commissure along the medial vocal fold edge to the vocal process. These angles were then compared to angle markings made previously by a gold-standard technician. The technicians were required to meet a training standard of .80 via two-way mixed-effects intraclass correlation coefficients (ICC) for consistency of agreement. Using this methodology, the resulting average reliability for the nine technicians was ICC(3,1) = .89 (SD = .01, range = .88–.91).

From here, the technicians were trained to use a semi-automated glottic angle extraction algorithm to identify glottic angles at 1000 fps. The algorithm was developed in MATLAB (version 9.3; The MathWorks, Natick, MA) to track the glottic angle over time within a VCV production, and is described by Diaz-Cadiz et al.36 The algorithm carries out an automated glottic angle extraction process to process to identify the glottis, segment vocal fold edges, and estimate the glottic angle over time. The result of this three-step process is a glottic angle waveform for the VCV production, which is shown in a graphical user interface (GUI) alongside the time-aligned video frames and microphone signal (see Fig. 2). In the event that the technician did not agree with the results of the automated algorithm, the technician would manually mark glottic angles for the production at a downsampled rate of 50 Hz. The automated glottic angle extraction procedure would then run again, this time using the manual glottic angle data as a reference. Within the glottic angle tracking training, technicians were required to meet reliability standards of ICC(3,1) ≥ .80 compared to a gold-standard technician.36 The resulting average reliability of the nine technicians was ICC(3,1) = .85 (SD = .04, range = .80–.91). Following the training, the technicians analyzed the experimental data.

Figure 2.

Figure 2.

(a) View of the vocal folds under flexible nasendoscopy, with the glottic angle marked from the anterior commissure to the vocal processes, and (b) Upper panel: Raw glottic angle waveform (gray) with smoothed data overlay (black), and bottom panel: Filtered quick vibratory profile (QVP). Solid lines indicate the start of vocal fold abduction (green) and time of voicing offset (purple). Abduction duration (Tabd) and glottic angle at voicing offset (θoff) are identified.

(2). Video Usability

Trained technicians manually inspected the video images comprising each /ifi/ production to determine whether the videos effectively captured the vocal folds during the transition into and out of the /f/. In the event that the vocal folds were obstructed (e.g., by the epiglottis) or were not visible (e.g., due to poor image contrast) during the recording, the production was regarded as “unusable” and removed from further analysis. Usable videos were then processed using a semi-automated glottic angle extraction algorithm36 that tracks the glottic angle over time. If the vocal folds were not appropriately tracked, the technician could intervene by manually extracting vocal fold angles to inform the algorithm before running again. If errors still persisted following manual intervention, the technician marked the /ifi/ production as unusable.

The aforementioned technicians used the MATLAB-based algorithm to extract the glottic angle waveform for each /ifi/ production (N = 740). A single technician determined whether the /ifi/ production was usable and, if so, extracted the glottic angle waveform for the production. Of the 740 total /ifi/ productions, technicians accepted the automated algorithmic results in 68.4% of productions (N = 506), and accepted the algorithmic results after performing manual extraction techniques on 16.4% of productions (N = 121). The remaining 113 productions were removed from subsequent analyses due to demonstrated problems in video usability (9.7%; N = 72) or from errors in algorithmic estimation (5.5%; N = 41). Algorithmic reliability was not assessed since prior work indicates that the algorithm yields good reliability ICC ≥ .80.36 The result of this analysis produced 627 usable /ifi/ productions for subsequent processing.

(3). Vocal Fold Abduction

Vocal fold abduction was measured via abduction duration (milliseconds) and glottic angle at voicing offset (degrees). Technicians were presented with a MATLAB GUI that depicted time-aligned video frames, microphone signal, glottic angle waveform, and quick vibratory profile (QVP). In this analysis, the QVP was included as an alternative to the glottic angle waveform—extracted during the prior video usability processing steps—to assist technicians in discriminating the vibrating glottis in video frames of poor resolution, as well as to identify the window of time that contained the transition between the vowel, /i/, and voiceless consonant, /f/. The QVP was estimated using methodology from Ikuma, Kunduk, and McWhorter37 to quantify the vibratory motion of the vocal folds via capturing changes in light intensity of the video frame. The video frame was first centered over the glottis; changes in light intensity were then calculated across the vertical and horizontal directions of the video frame.36 The average of the minimum pixel intensity per row (for vertical profile) and column (for horizontal profile) were then calculated and summed to produce the QVP. From here, the QVP was high-pass filtered using a 7th order Butterworth filter using a cut-off frequency of 50 Hz to attenuate signal noise below a minimum fo of 50 Hz.

Within the MATLAB GUI, three technicians used the glottic angle waveform and QVP to identify two timepoints: abduction onset and voicing offset. In this analysis, abduction onset was defined as the last full or maximum contact of the vocal folds during voicing offset, whereas voicing offset was measured as the termination of the last vibratory cycle before the voiceless consonant. If the arytenoid cartilages obstructed the view of the vocal folds during devoicing (e.g., due to supraglottic constriction), abduction onset was considered as the time in which the arytenoid cartilages began to move away from one another. The technicians were instructed to corroborate the selected indices for abduction onset and voicing offset using the raw video frames in order to minimize errors that may occur if the glottic angle waveform failed to capture small glottal gaps during vibratory cycle phases and/or the QVP was confounded by lighting artifacts (e.g., intensity saturation due to the epiglottis coming into view). The microphone signal was included within the GUI in the event that the glottic angle waveform and QVP both failed to properly track the vibrations of the vocal folds; in such cases, the technicians were able to use the GUI to indicate that the production needed to be rejected or reprocessed.

The technicians each reanalyzed 10% of participants in a separate sitting to ensure adequate intrarater reliability. When assessed via two-way mixed-effects ICCs for absolute agreement, intrarater reliability ranged from moderate to excellent (.70–.99) with an overall mean reliability of .98 (95% CI = .97–1.0). The three technicians also analyzed the high-speed video images of the same participant to assess interrater reliability. Interrater reliability was computed using two-way mixed-effects ICCs for consistency of agreement (single measures), producing an average reliability of .92 (95% CI = .83 −1.0) for abduction onset and voicing offset.

The duration of the abductory gesture (Tabd) was then measured by subtracting the timepoint corresponding to abduction onset from that of voicing offset. The abductory gesture was also characterized by extracting the glottic angle at toff from the glottic angle waveform, called θoff.

b. Acoustic Signal Processing

Semi-automated RFF estimated was performed on the microphone signals of the 627 usable /ifi/ productions via the aRFF-AP algorithm38 in MATLAB (version 9.3). The algorithm required a technician to confirm the location of the midpoint of the voiceless consonant, /f/, in each /ifi/ production. Manual intervention was carried out in the event that the algorithm incorrectly located the /f/. RFF values were then automatically calculated from the vocal cycles closest to the /f/. RFF at voicing offset cycle 10 (RFFoff_10) was retained to examine the association of RFF with group and vocal fold abductory kinematics, as RFFoff_10 has been reported to be reduced when speakers without voice disorders increase their vocal effort39,40 and when speakers have voice disorders characterized by increased muscle tension (e.g., vocal hyperfunction, laryngeal dystonia),41,42 as well as being robust against varying abduction initiation times.43 Voicing offset instances that were rejected during algorithmic processing (e.g., due to voicing during the voiceless consonant, glottalization, or misarticulation) were removed from further analysis (185 /ifi/ productions).

c. Statistical Analyses

The high-speed video and acoustic signal analyses resulted in one value for Tabd, θoff, and RFFoff_10 for each of the 50 speakers. Due to the large amount of rejected data, Pearson’s product-moment correlation coefficients were first calculated to detect possible relationships between the number of missing or unusable /ifi/ productions and speaker age. Similarly, an independent t-test was used to examine the relationship between missing or unusable /ifi/ productions and speaker sex. Following, the impact of speaker age on vocal fold abduction was quantified by computing Pearson’s product-moment correlation coefficients between speaker age and Tabd, θoff, and RFFoff_10. The extent to which speaker age and vocal fold abduction contribute to measures of RFF in adults with typical voices was then assessed via an analysis of covariance (ANCOVA) model. To construct this model, speaker age, Tabd, and θoff were included as covariates. Since older adult men have been shown to exhibit lower RFF values than older adult women,16 speaker sex was included as a factor in the current ANCOVA to minimize confounding effects. RFFoff_10 was implemented as the response variable. Significance was set a priori to p < .05 and partial eta squared (ηp 2) values were calculated as effect sizes.

B. Results

Although no statistically significant relationship was observed between the number of missing or unusable /ifi/ productions and speaker age (p = .386), a significant relationship was observed for speaker sex (p = .027), wherein more /ifi/ productions were missing or unusable for female speakers (M = 4.6, SD = 3.5) than male speakers (M = 2.5, SD = 2.9). Fig. 3 depicts the trends in RFFoff_10, Tabd, and θoff across speaker age. Speaker age exhibited a significant, moderate relationship with θoff (r = −.31, p = .027; Fig. 3c), wherein older speakers tended to have smaller glottic angles during intervocalic offsets. Age was not significantly correlated with RFFoff_10 (r = −.09, p = .538; Fig. 3a) or Tabd (r = −.14, p = .346; Fig. 3b).

Figure 3.

Figure 3.

Scatter plots of speaker values (teal) for (a) relative fundamental frequency (RFF) at offset cycle 10, (b) abduction duration, and (c) glottic angle at voicing offset relative to speaker age. Lines of best fit are shown in purple.

Table 1 summarizes the results of the ANCOVA examining relationship between age, sex, and measures of vocal fold abduction (i.e., Tabd, θoff) on RFFoff_10. The model accounted for 19.2% of the variance in the data for RFFoff_ (adjusted R2 = 12.0%). Whereas age, sex, and Tabd did not exhibit significant effects on RFFoff_10, θoff demonstrated a significant, medium effect on RFF (p = .019, ηp2 = 0.12).

Table 1.

Results of the analysis of covariance model examining the effects of speaker age, speaker sex, abduction duration, and glottic angle at voicing offset on RFF offset cycle 10.

Effect df F p η p 2 Effect Size Interpretation
Speaker Age 1 2.38 .130
Speaker Sex 1 3.01 .090
Abduction Duration (Tabd) 1 0.02 .895
Glottic Angle at Voicing Offset (θoff) 1 5.89 .019 0.12 Medium

Note. Effect size interpretations based on criteria from Witte and Witte.1 Dashes (–) indicate non-significant findings (p > .05).

III. Experiment 2

A. Method

1. Participants

Twenty adults with idiopathic PD (6 cisgender females, 14 cisgender males) aged 50–75 years (M = 65.0 years, SD = 7.4 years) and twenty age- and sex-matched controls (6 cisgender females, 14 cisgender males) aged 47–81 years (M = 65.1 years, SD = 8.8 years) were enrolled in the study. This sex distribution is consistent with the higher incidence of PD in men compared to women18,44. As in Experiment 1, all participants were native English speakers, non-smokers, and did not report trained singing experience beyond grade school. Of note, 14 of 20 of the control speakers constituted a subset of the participant sample examined in Experiment 1. The Montreal Cognitive Assessment (MoCA) was administered to each of the 40 participants to determine cognitive status and an a priori cut-off of ≥21 was set to ensure all included participants had the capacity to consent to the study tasks.28 All participants provided informed, written consent in compliance with the Boston University Institutional Review Board.

a. Characteristics of Speakers with PD

Within the current study, all adults with PD were diagnosed with idiopathic PD by a neurologist and were recorded while on their usual carbidopa/levodopa medication schedule to preserve typical vocal function. Individuals who used deep brain stimulation devices (N = 4) were requested to turn their device off for the duration of data collection to minimize the potential impacts of deep brain stimulation on laryngeal function.

A speech-language pathologist specializing in voice disorders assessed the overall severity of dysphonia of each participant using the Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V).45 The average overall severity of dysphonia score for speakers with PD was 16.8 (SD = 10.6, range = 4.0–33.5). Additionally, the Movement Disorder Society-Sponsored Revision of the Unified Parkinson’s Disease Rating Scale (MDS-UPDRS) was administered to each participant with PD to determine the extent of both motor and non-motor complications; each examination was administered and scored per protocol by a certified MDS-UPDRS administrator. The severity of motor complications was, on average, moderate (M = 43.7, SD = 17.7) and ranged from mild to severe (range = 13–91).46 The average Hoehn-Yahr score was 2.0 (SD = 1.1) and ranged from 0 (no disability) to 4 (severe disability).47,48 Table 2 shows demographic information for the 20 individuals with PD.

Table 2.

Demographic information of participants with Parkinson’s disease (PD).

Participant Sex Age CAPE-VOS Years Post-Dx PD Characteristics MDS-UPDRS-III Hoehn-Yahr Scale
PD1 M 60 30.1 7 54 2
PD2 F 62 5.6 9 49 3
PD3 F 70 14.7 6 77 4
PD4 M 50 7.1 0 17 0
PD5 M 55 18.4 21 49 3
PD6 M 62 10.0 3 50 2
PD7 F 74 30.6 24 59 2
PD8 M 73 33.6 9 19 1
PD9 M 67 6.4 4 63 3
PD10 M 67 19.4 2 38 2
PD11 M 62 27.9 13 47 2
PD12 M 59 4.0 2 23 2
PD13 M 73 6.8 3 23 1
PD14 M 68 5.0 6 38 2
PD15 F 73 33.3 8 52 2
PD16 M 75 22.1 1.5 68 2
PD17 F 65 15.4 10 48 3
PD18 M 68 8.5 1 52 3
PD19 M 65 28.3 1 35 0
PD20 F 51 9.7 5 13 0

Note. Dx = Diagnosis, CAPE-V OS = Consensus of Auditory-Perceptual Evaluation of Voice, Overall Severity of Dysphonia, MDS-UPDRS-III = Movement Disorder Society-sponsored revision of the Unified Parkinson’s Disease Rating Scale: Part III, Motor Examination.

b. Characteristics of Control Speakers

Age- and sex-matched control speakers reported no history of speech, language, hearing, neurological, or voice problems. All control speakers were screened by a certified voice-specializing speech-language pathologist for healthy vocal function via auditory-perceptual assessment (CAPE-V) and flexible nasendoscopic laryngeal imaging. The average score for overall severity of dysphonia of the control group was 10.6 (SD = 7.9, range = 1.7–34.2). Demographic information for the control speakers is reported in Table 2.A of Appendix A.

c. Hearing Status

A hearing screening of pulsed pure tones30 was administered to each participant at frequencies of 125, 250, 500, 1000, 2000, and 4000 Hz (protocol based on American Speech-Language-Hearing Association31). Nineteen control speakers and fourteen speakers with PD passed the hearing screening under at frequencies of 125, 250, 500, and 1000 under 25 dB HL and at 2000 and 4000 Hz under 40 dB HL in at least one ear.32 The control speaker that did not meet these criteria exhibited a threshold of 40 dB HL at 125 and 250 Hz, as well as a threshold of 30 dB HL at 500 Hz. Of the six speakers with PD that did not meet these criteria, one speaker (PD14 in Table 2) demonstrated a threshold of 45 dB HL at 2000 Hz, two speakers (PD16, PD19) exhibited a threshold of 45 dB HL at 4000 Hz, two speakers (PD11, PD15) demonstrated a threshold of 50 dB HL at 4000 Hz, and one speaker (PD10) wore hearing aids during the course of the study and demonstrated thresholds of 45 dB HL at 125 Hz and 30 dB HL at 1000 Hz.

2. Recording Procedures & Data Analysis

All data analyzed in the current experiment were collected during a nasendoscopic examination as described in Experiment 1. In total, there were 104 instances in which 16 full /ifi/ productions were not captured for a speaker; this process resulted in total of 536 /ifi/ productions ([40 participants × 16 /ifi/ productions] – 104 incomplete or missing /ifi/ productions) across the two speaker groups. In using the semi-automated MATLAB algorithm from Experiment 136 to extract the glottic angle waveform for each /ifi/ production, technicians accepted the automated algorithmic results in 61.2% of productions (N = 328) and accepted the algorithmic results after performing manual extraction techniques on 22.2% of productions (N = 119). Of the remaining productions, 10.4% were considered unusable (N = 56) and a further 6.2% were rejected due to errors in algorithmic estimation (N = 33). Finally, 124 voicing offset instances that were rejected during algorithmic processing of the microphone signal (e.g., due to voicing during the voiceless consonant, glottalization, or misarticulation) were removed from further analysis. This process resulted in a total of 323 /ifi/ productions.

Using methodology described in Experiment 1, the analyses performed on the high-speed video images and microphone signals resulted in the following measures for each /i/-to-/f/ transition of the 323 /ifi/ productions: (1) Tabd, (2) θoff, and (3) RFFoff_10. The resulting measures were then evaluated with a series of statistical models to determine the relationship between speaker group, RFF values, and vocal fold abductory kinematics. First, Pearson’s product-moment correlation coefficients were calculated to detect possible relationships between the number of missing or unusable /ifi/ productions and speaker age. Independent t-tests were used to examine the relationship between missing or unusable /ifi/ productions and binary speaker variables of sex and group. Following, three separate analysis of variance (ANOVA) models were constructed to examine the effect of group (speakers with PD versus age- and sex-matched controls) on voicing offset measures of Tabd, θoff, and RFFoff_10. In these models, each voicing offset measure was set as the response variable, group was set as a fixed factor, significance was set a priori to p < .05, and partial eta squared (ηp2) values were calculated as effect sizes. An ANCOVA model was then constructed to examine the effects of covariates of speaker age, Tabd, and θoff, as well as the fixed factors of speaker group, sex, and relevant interactions of group × sex, group × Tabd, and group × θoff on RFFoff_10 (response variable). Significance was set a priori to p < .05 and ηp2 values were calculated as effect sizes.

B. Results

There were no statistically significant relationships between the number of missing or unusable /ifi/ productions and speaker variables of age (p = .290), sex (p = .835) or group (p = .221). Table 3 shows the results for the models examining the effects of group on each of the three voicing offset measures. Group was not a significant factor in the model for θoff (p = .476, ηp2 = 0.01), but was a significant factor in the models for both Tabd (p = .034, ηp2= 0.11) and RFFoff_10 (p = .021, ηp2 = 0.13).

Table 3.

Results of the analysis of variance models examining the effects of speaker group on RFF at offset cycle 10, abduction duration, and glottic angle at voicing offset.

Model Effect df F p η p 2 Effect Size Interpretation
RFF at Voicing Offset Cycle 10 (RFFoff 10) Group 1 5.80 .021 0.13 Medium
Abduction Duration (Tabd) Group 1 4.84 .034 0.11 Medium
Glottic Angle at Voicing Offset (θoff) Group 1 0.52 .476

Note. Effect size interpretations based on criteria from Witte and Witte.1 Dashes (–) indicate non-significant findings (p >.05).

Figure 4 shows individual values for these three variables based on speaker group. Speakers with PD exhibited larger, but less variable, values of Tabd (M = 67.4 ms, SD = 17.3 ms) compared to age- and sex-matched controls (M = 53.9 ms, SD = 21.2 ms). RFF values at voicing offset cycle 10 were substantially more negative, but also less variable for speakers with PD (M = −2.04 ST, SD = 0.92 ST) than controls (M = −1.27 ST, SD = 1.09 ST). Consistent with the lack of a group effect for θoff, values for speakers with PD (M = 15.7 degrees, SD = 5.0 degrees were similar to those from age- and sex-matched controls (M = 14.5 degrees, SD = 5.8 degrees).

Figure 4.

Figure 4.

Individual speaker (purple squares) and mean (blue circles) values for (a) RFF at offset cycle 10, (b) glottic angle at voicing offset, and (c) abduction duration based on speaker group (speakers with PD: left; age-and sex-matched control speakers: right). Error bars show 95% confidence intervals.

Table 4 summarizes the ANCOVA model examining the effects of θoff, Tabd, age, group, sex, and relevant group interactions (group × sex, group × Tabd, group × θoff) on RFFoff_10. The model accounted for 43.4% of the variance in the data for RFFoff_10 (adjusted R2 = 28.8%), with only θoff (p = .003, ηp2 = 0.25) and Tabd (p = .039, ηp2 = 0.13) demonstrating significant effects on RFF.

Table 4.

Results of the analysis of covariance model examining the effects of speaker age, abduction duration, glottis angle at voicing offset, group, and relevant interactions on RFF offset cycle 10.

Effect df F p η p 2 Effect Size Interpretation
Abduction Duration (Tabd) 1 4.65 .039 0.13 Medium
Glottic Angle at Voicing Offset (θoff) 1 10.25 .003 0.25 Large
Speaker Age 1 0.682 .415
Speaker Group 1 0.669 .420
Speaker Sex 1 2.47 .126
Speaker Group × Speaker Sex 1 0.12 .733
Speaker Group × Tabd 1 0.14 .714
Speaker Group × θoff 1 1.96 .172

Note. Effect size interpretations based on criteria from Witte and Witte.1 Dashes (–) indicate non-significant findings (p > .05).

IV. Discussion

The goal of this work was to investigate the relationship between speaker age, vocal fold abductory kinematics, and RFF. Two experiments were carried out to assess these relationships. The first experiment examined the effects of speaker age on both vocal fold abduction and RFF in individuals with typical voices. Simultaneous acoustic recordings and laryngeal images were captured via a microphone and flexible nasendoscope, respectively, as speakers with typical voices produced the VCV utterance, /ifi/. RFF was extracted from the acoustic signal, whereas abduction duration and glottic angle at voicing offset were computed from the laryngeal images to characterize vocal fold abduction. Vocal fold abduction measures and RFF at voicing offset cycle 10 were then analyzed with respect to speaker age within a cohort of speakers with typical voices. The aim of the second experiment was to examine how PD—which is characterized by excessive laryngeal muscle tension—affects vocal fold abduction and to what extent this relationship affects RFF. As such, abduction duration, glottic angle at voicing offset, and RFF at voicing offset cycle 10 were calculated as in the first experiment but were examined with respect to speaker group (speakers with PD versus age- and sex-matched control speakers).

A. Comparison of Outcome Metrics to the Literature

In the current study, glottic angle at voicing offset was measured relative to the physiological cessation of vocal fold vibration. The mean glottic angle at voicing offset for the 50 speakers with typical voices in Experiment 1 was 14.9 degrees, and ranged from 8.6 to 27.8 degrees. In Experiment 2, mean glottic angle was 14.5 degrees (range = 5.6–27.8 degrees) for speakers with typical voices and 15.7 degrees (range = 7.6–27.0 degrees) for speakers with PD. Although numerous works have examined glottic angles during voicing offset, most values reported in the literature focus on maximum abduction angles during non-speech tasks (e.g., sniff or cough)35,49,50 and are thus not comparable to those of the current study. The values obtained in this study are similar to those reported for speakers without voice disorders, which have been shown for speakers without voice disorders to range between 6 and 24 degrees during /ifi/ productions and between 5 and 18 degrees during /iti/ productions.51

As with glottic angle, abduction duration was measured relative to the termination of vocal fold vibration in the current work. Mean abduction duration in Experiment 1 was 50.6 ms for 50 speakers with typical voices. The speakers with typical voices in Experiment 2 exhibited a mean abduction duration of 53.9 ms, whereas speakers with PD demonstrated a mean duration of 67.4 ms. These values are greater than those reported in the literature. In determining the relationship between the acoustic signal and phonatory oscillatory events, Patel, Forrest and Hedges52 reported a mean abduction duration of approximately 40 ms between the time of vocal fold cessation and the first incomplete vocal fold closure. However, the authors examined vocal behaviors during repetitions of the syllable /hi/ rather than /ifi/, as examined here. Differences in mean abduction duration may be a result of dissimilarities in the voicing mechanisms needed to produce a CV sequence (as in /hi/) compared to a VCV sequence (as in /ifi/), as well as possible mechanistic differences for producing /h/ compared to /f/. In addition, the authors used rigid laryngoscopic techniques to visualize the vocal folds, whereas the current study implemented flexible laryngoscopy. Since rigid laryngoscopy requires the tongue to be restricted during the endoscopic examination, it is possible that differences in laryngoscope type may have also led to divergent vocal behaviors—and, in turn, abduction durations—when producing the speech stimuli.

Measures of vocal fold abduction duration have also been obtained using electroglottography. For instance, Watson, Roark and Baken53 simultaneously acquired acoustic and electroglottographic signals during sustained /ɑ/ productions. The authors estimated the physiological time of voicing offset via tracking the contact of the vocal folds through the electroglottographic recordings. Physiological times of voicing offset were compared to those obtained via the acoustic signal the authors used electroglottography to track the contact of the vocal folds. The average duration of the abductory gesture for 112 speakers without voice disorders (57 female, 55 male) was identified as 20.0 ms from the time of acoustic voicing offset. However, Watson, Roark and Baken53 examined abduction duration in a linguistically unconstrained context as participants produced sustained /ɑ/ vowels as well as the words “hallways” and “always.” The devoicing gesture does not require that the vocal mechanism be optimized in linguistically unconstrained contexts to meet the linguistic goals necessary to devoice and then rapidly reinitiate voice (as is required to produce the intervocalic /f/ of /ifi/ examined in the current study). Linguistically unconstrained voicing offsets are instead thought to facilitate the transition into tidal breathing.53 It is therefore possible that the voicing mechanisms required to execute the devoicing gesture to produce /ifi/ require different abductory durations than that of a sustained /ɑ/ or the words “hallways” and “always.”

In addition to differences in the stimuli acquired to assess voicing offset, the definition of abduction duration inherently differed between studies. Whereas the current study measured the start of abduction as the last maximum contact of the vocal folds as observed in the laryngoscopic images, Watson, Roark and Baken53 identified this time from the cross-correlation of the amplitude of an electroglottograph signal that was band-pass filtered ±40% of the speaker’s fo. The current study considered the termination of the abduction gesture to be the cessation of vocal fold vibration, as determined from the laryngoscopic images. Watson, Roark and Baken53 computed this time point using acoustic voicing offset; as it is not uncommon for voicing offset to occur earlier in the acoustic signal than the true cessation of vocal fold vibrations,52 it is unsurprising that the absolute durations of the abductory gesture differ between studies.

The values obtained for RFF at voicing offset cycle 10 are comparable to those observed in the literature. In Experiment 1, the average RFF value at offset cycle 10 was −0.95 ST across the 50 speakers with typical voices. The speakers with typical voices in Experiment 2 exhibited a mean RFF of −1.23 ST, whereas speakers with PD demonstrated an average RFF of −2.04 ST. The mean values reported for the typical speakers are well within range of those reported in the literature for offset cycle 10.13,15,5456 Likewise, the observed mean RFF values at voicing offset cycle 10 in speakers with PD are similar to those reported in the literature: Goberman and Blomgren15 reported an RFF value of −2.20 ST at offset cycle 10 for speakers with PD while on medication and Stepp16 saw a mean offset RFF value of −1.90 ST for speakers with PD while on medication.

B. Effects of Speaker Age on Vocal Fold Abduction and RFF

Prior work examining RFF proposed a relationship between vocal fold abduction and RFF, wherein abductory mechanisms during intervocalic offsets differ between young and older adults.13 In particular, it has been hypothesized that older adults primarily rely on a prolonged vocal fold abductory gesture to devoice rather than laryngeal muscle tension as in young adults. Yet few studies have physiologically examined the supposed relationship between vocal fold abduction and RFF, or the impacts of speaker age on this relationship. Based on this prior work, it was hypothesized that vocal fold abduction (via metrics of abduction duration and glottic angle at voicing offset) would be positively related to speaker age whereas RFF at voicing offset cycle 10 would be negatively related to speaker age.

Although there were no significant correlations between age and either abduction duration or RFF, a significant negative relationship was observed between glottic angle and age. These results lend support to the notion that vocal fold abduction during intervocalic offsets differs with age, though not in the expected direction: larger glottic angles were expected with increasing speaker age due to the hypothesized “prolonged abductory gesture” in older adults, but current findings demonstrate the opposite. These results are not in line with speculations by Watson13 that older speakers rely more than young speakers on vocal fold abduction (via longer abduction durations and/or glottic angles) to achieve devoicing. However, it must be considered that prior work such as that of Watson13 examined voicing offsets relative to speaker age as a categorical factor comprising broad age groupings of “young” versus “older” adults. Speaker age was introduced in this study as a covariate to provide more informative interpretations of the underpinnings of voicing offset across the continuous spectrum of age. In particular, these results suggest that speakers may not be able to perform the quick changes in glottic angle necessary to assist in devoicing with increasing age, perhaps due to the effects of age-related vocal fold atrophy that present as vocal fold bowing, spindle-shaped glottal gap, prominent vocal processes, and thinning of the vocal fold mucosa.17,5762 Additional work is therefore needed to comprehensively assess the effects of vocal fold atrophy on vocal fold abductory kinematics.

C. Effects of Parkinson’s Disease on Vocal Fold Abduction and RFF

It was hypothesized that speakers with PD would exhibit significantly lower RFF values at voicing offset in the absence of significant differences in vocal fold abduction when compared to age- and sex-matched control speakers. These hypotheses were based on the findings of prior work demonstrating increased levels of laryngeal muscle activity in PD.26,27 As hypothesized, speakers with PD exhibited lower average RFF values for voicing offset cycle 10 (M = −2.04 ST) compared to age- and sex-matched control speakers (M = −1.27 ST). However, speakers with PD also demonstrated significantly longer abduction durations (M = 67.4 ms) on average when compared to controls (M = 53.9 ms).

Although these findings do not support the hypothesis that older speakers with typical voices leverage a prolonged abductory gesture for devoicing, it is possible that speaker with PD require such a gesture to effectively terminate vocal fold vibration during intervocalic voicing offsets. One mechanism that could account for the observed trends is a prolonged abductory gesture to counteract increased baseline laryngeal muscle tension observed in PD.26,27 It may not be possible for speakers with PD to further increase laryngeal activity from baseline to control the rapid changes in vocal fold vibration at voicing offset; instead, a prolonged vocal fold abductory gesture may be necessary to cease vocal fold vibration. An alternative mechanism that may contribute to longer abduction durations and lower RFF values in speakers with PD may be an overall slowness of movement in voice mechanisms. Specifically, bradykinesia (slower movements) and hypokinesia (slower, smaller movements) have been observed in PD, with many studies noting smaller and/or slower speech articulatory movements and impairments in the monitoring of speech movement timing at the segmental level.6369 A possible explanation for the observed prolonged abductory gesture may follow these observations in the articulatory domain, wherein voluntary movements of the voicing mechanism are slower (and for some speakers, smaller) in PD during linguistically constrained voicing offsets. Yet additional work is needed to address these speculations and, moreover, elucidate the mechanisms contributed to the observed prolonged abductory gesture in speakers with PD.

It was also hypothesized that both vocal fold abduction and speaker group (control, PD) would be significantly predictive of RFF at voicing offset. Abduction duration and glottic angle at voicing offset demonstrated medium- and large-sized significant effects, respectively, indicating that RFF is related to vocal fold abductory patterns during devoicing and—moreover—that changes in RFF during voicing offset may be captured in part via changes in abduction duration and glottic angle. These results support findings from Serry, Stepp and Peterson43 in which computational models were used to demonstrate that RFF is sensitive to the duration of the abduction period. The authors suggested that this sensitivity, in combination with a dependency on abduction initiation time, leads to variations in RFF values observed clinically. Interestingly, neither speaker group nor its interactions (group × sex, group × Tabd, and group × θoff) were significant predictors in the model for RFF. Taken together with findings from Serry, Stepp and Peterson,43 our results suggest that—although vocal fold abductory kinematics may differ between some speakers with and without PD—the observed differences in RFF values at voicing offset may not be attributed to group differences in vocal fold abduction. These differences may instead be related to increased levels of baseline laryngeal muscle tension that have been observed in speakers with PD; however, it is also possible that another mechanism is at play (e.g., laryngeal height). Thus, although these findings support a relationship between vocal fold abduction and RFF, additional work is needed to comprehensively investigate how laryngeal muscle tension and vocal fold abduction mechanisms differentially contribute to measures of RFF at intervocalic offsets.

D. Limitations and Future Directions

The experiments conducted in this study sought to determine the contribution of vocal fold abduction to RFF during intervocalic voicing offsets in speakers with typical voices and speakers with idiopathic PD. The identified relationships between vocal fold abduction and RFF may not be generalizable to speakers outside of these groups, particularly as speakers with idiopathic PD are only a subset of individuals with disorders that have has been associated with increased levels of intrinsic laryngeal muscle tension.26,27 For instance, speakers with vocal hyperfunction may exhibit excessive or imbalanced laryngeal muscle forces.70 However, the manifestation of vocal hyperfunction is broad, wherein hyperfunctional vocal behaviors may occur in the presence or absence of organic pathology (e.g., vocal nodules), and may be the primary cause of a voice disorder or as a compensatory adaptation to glottal insufficiency. It is therefore unclear whether speakers who exhibit signs of vocal hyperfunction would demonstrate similar trends in vocal fold abductory kinematics as speakers with PD. Although this work attempts to provide insight about the relationship between vocal fold abduction and RFF, future work should aim to expand upon the patterns described here.

Within this vein, the current study was limited to examining the duration and magnitude of speaker abductory gestures during linguistically constrained voicing offsets. Future work should leverage a more comprehensive set of metrics to describe vocal fold abductory kinematics. For instance, timings corresponding to the start of the abductory gesture or the rate of change of the glottic angle could be examined to detect possible deficits in voice initiation or freezing associated with PD or voice disorders such as laryngeal dystonia. Such an investigation should examine additional phonetic contexts (e.g., /iti/) to elucidate inter- and intra-group differences that may not be fully characterized when examining the VCV utterance, /ifi/.

A methodological limitation of this study is in using semi-automated algorithms to examine voicing offsets. In particular, two algorithms were used to extract metrics of vocal fold abductory kinematics from high-speed video images and estimates of RFF from the microphone signal; these algorithms were used to process data at a faster rate than would be possible manually. As a result, however, a large portion of data was rejected due to the conservative decision-making criteria of the algorithms (40.3% in Experiment 1, 39.7% in Experiment 2). Despite this loss of data, subsequent analyses were conducted on the full dataset of 50 speakers in Experiment 1 and 40 speakers in Experiment 2, wherein each speaker had an average of 8.8 and 8.8 usable /ifi/ productions for statistical analysis, respectively. This is notably larger than the recommended number of VCV productions for reliable RFF analysis41. Nonetheless, a significant relationship was observed between the number of missing or unusable /ifi/ productions and speaker sex within the first experiment. Although it is outside the scope of this study, future work should be carried out to identify and mitigate factors that may have led to more female /ifi/ productions being rejected during processing and/or less productions being collected during high-speed videoendoscopic recording (e.g., discomfort during scoping).

Finally, further investigation is needed to characterize the relationship between vocal fold abduction and RFF in the presence of excessive and/or imbalanced laryngeal muscle forces, as tension was not precisely quantified in this work (e.g., via laryngeal electromyography). Characterizing abductory kinematics in speakers with other voice disorders associated with excessive tension may also be a useful step toward isolating the differential contributions of tension and abduction in intervocalic voicing offsets.

V. Conclusion

The current study sought to examine the relationship between vocal fold abduction and RFF during intervocalic offsets. The results of this work support vocal fold abductory patterns as a potential mechanism of RFF, wherein changes in RFF during voicing offset may be captured in part via glottic angle. Despite previous speculations of a prolonged abductory gesture in older adults, however, vocal fold abductory patterns did not significantly affect RFF when considering speaker age. Additionally, vocal fold abductory patterns did not significantly affect RFF when examining speakers with and without idiopathic Parkinson’s disease, a neurodegenerative disease for which excessive intrinsic laryngeal muscle forces have been observed. These findings suggest that differences in RFF values at intervocalic offsets are likely not due to differences in vocal fold abduction across groups but may instead be related to increased levels of baseline laryngeal muscle tension observed in PD or some other mechanism (e.g., laryngeal height). Future work is needed to comprehensively investigate how laryngeal muscle tension and vocal fold abduction mechanisms differentially contribute to measures of RFF at intervocalic offsets.

Acknowledgements

This work was supported by the National Science Foundation under Grant No. 1247312 (J.M.V.) and the National Institutes of Health under Grant No. DC015570 (C.E.S.). The authors would like to acknowledge Victoria McKenna, Daniel Buckley, Dante Cilento, Austin Luong, Jacob Noordzij, Jr., Yeonggwang Park, and Manuel Díaz-Cadíz for their technical assistance with data acquisition and analysis.

Disclosure:

J.M. Vojtech receives financial compensation through employment by Delsys, Inc., a commercial company that manufactures, markets, and sells Electromyographic Sensors and other Physiological Measurement Systems. C.E. Stepp has received consulting fees from Altec, Inc. and Delsys, Inc., companies focused on developing and commercializing technologies related to human movement. Stepp’s interests were reviewed and are managed by Boston University in accordance with their conflict of interest policies.

VII. Appendix A

Table A.1.

Demographic information of participants examined in Experiment 1.

Participants 1–25 Participants 26–50
ID Sex Age CAPE-V OS ID Sex Age CAPE-V OS
C1 F 18 8.8 C26 M 50 5.6
C2 M 21 3.3 C27 F 59 9.8
C3 F 21 7.8 C28 F 30 0.6
C4 M 18 5.6 C29 M 18 6.8
C5 F 29 3.3 C30 F 21 23.5
C6 F 20 4.1 C31 M 25 6.4
C7 F 18 3.3 C32 F 25 3.4
C8 F 63 10.0 C33 M 41 3.8
C9 M 53 6.3 C34 M 22 5.4
C10 M 67 18.1 C35 M 26 6.1
C11 M 46 15.7 C36 M 61 3.0
C12 F 26 4.4 C37 M 62 4.8
C13 F 19 6.3 C38 M 21 2.7
C14 M 54 6.6 C39 M 67 15.4
C15 F 66 7.4 C40 F 67 10.7
C16 M 20 2.1 C41 F 54 6.0
C17 F 80 22.5 C42 M 76 34.2
C18 M 29 3.0 C43 F 61 6.1
C19 M 31 1.9 C44 F 68 113
C20 F 25 6.6 C45 F 61 8.4
C21 F 73 4.6 C46 F 83 25.6
C22 M 81 24.2 C47 M 24 2.4
C23 F 20 3.4 C48 F 57 6.3
C24 M 47 7.8 C49 F 70 5.6
C25 M 21 5.8 C50 M 64 9.7

Note. CAPE-V OS = Consensus of Auditory-Perceptual Evaluation of Voice, Overall Severity of Dysphonia

Table A.2.

Demographic information of participants without Parkinson’s disease examined in Experiment 2.

Participant Sex Age CAPE-V OS
C1 M 67 1.7
C2 M 53 6.3
C3 M 67 18.1
C4 M 54 6.6
C5 F 73 12.5
C6 F 73 4.6
C7 M 81 24.2
C8 M 47 7.8
C9 F 59 9.8
C10 M 67 7.1
C11 M 61 3.0
C12 M 62 4.8
C13 M 67 15.4
C14 F 54 6.0
C15 M 76 34.2
C16 F 62 17.7
C17 M 66 10.7
C18 F 70 5.6
C19 M 67 6.7
C20 M 64 9.7

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

References

  • 1.Witte RS, Witte JS. Statistics. Wiley; 2010. [Google Scholar]
  • 2.Boone DR, McFarlane SC, Von Berg SL, Zraick RI. The Voice and Voice Therapy. Allyn & Bacon Communication Sciences and Disorders. Pearson; 2014. [Google Scholar]
  • 3.Löfqvist A, Baer T, McGarr NS, Story RS. The cricothyroid muscle in voicing control. Journal of the Acoustical Society of America. 1989;85(3):1314–1321. [DOI] [PubMed] [Google Scholar]
  • 4.Fukui N, Hirose H. Laryngeal adjustments in Danish voiceless obstruent production. Annual Bulletin Research Institute of Logopedics and Phoniatrics, University of Copenhagen. 1983;17:61–71. [Google Scholar]
  • 5.Stevens KN. Physics of laryngeal behavior and larynx modes. Phonetica. 1977;34(4):264–79. [DOI] [PubMed] [Google Scholar]
  • 6.Hirano M Morphological structure of the vocal cord as a vibrator and its variations. Folia Phoniatrica et Logopaedica. 1974;26(2):89–94. [DOI] [PubMed] [Google Scholar]
  • 7.Sonninen A, Hurme P, Laukkanen AM. The external frame function in the control of pitch, register, and singing mode: Radiographic observations of a female singer. Journal of Voice. 1999;13(3):319–40. [DOI] [PubMed] [Google Scholar]
  • 8.Choi HS, Berke GS, Ye M, Kreiman J. Function of the posterior cricoarytenoid muscle in phonation: In vivo laryngeal model. Otolaryngology–Head and Neck Surgery. 1993;109(6):1043–51. [DOI] [PubMed] [Google Scholar]
  • 9.Faaborg-Andersen K Electromyographic investigation of intrinsic laryngeal muscles in humans. Acta Physiol Scand. 1957;41(140):1–150.13497754 [Google Scholar]
  • 10.Fujita M, Ludlow CL, Woodson GE, Naunton RF. A new surface electrode for recording from the posterior cricoarytenoid muscle. The Laryngoscope. 1989;99(3):316–20. [DOI] [PubMed] [Google Scholar]
  • 11.Hirano M Vocal mechanisms in singing: Laryngological and phoniatric aspects. Journal of Voice. 1988;2(1):51–69. [Google Scholar]
  • 12.Rothenberg M, Mahshie JJ. Monitoring vocal fold abduction through vocal fold contact area. Journal of Speech, Language, and Hearing Research. 1988;31(3):338–351. [DOI] [PubMed] [Google Scholar]
  • 13.Watson BC. Fundamental frequency during phonetically governed devoicing in normal young and aged speakers. Journal of the Acoustical Society of America. 1998;103(6):3642–3647. [DOI] [PubMed] [Google Scholar]
  • 14.Ramig LO, Verdolini K. Treatment efficacy: Voice disorders. Journal of Speech, Language, and Hearing Research. Feb 1998;41(1):S101–16. [DOI] [PubMed] [Google Scholar]
  • 15.Goberman AE, Blomgren M. Fundamental frequency change during offset and onset of voicing in individuals with Parkinson disease. Journal of Voice. 2008;22(2):178–191. [DOI] [PubMed] [Google Scholar]
  • 16.Stepp CE. Relative fundamental frequency during vocal onset and offset in older speakers with and without Parkinson’s disease. Journal of the Acoustical Society of America. 2013;133(3):1637–43. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Takano S, Kimura M, Nito T, Imagawa H, Sakakibara K-I, Tayama N. Clinical analysis of presbylarynx—vocal fold atrophy in elderly individuals. Auris Nasus Larynx. 2010;37(4):461–464. [DOI] [PubMed] [Google Scholar]
  • 18.Van Den Eeden SK, Tanner CM, Bernstein AL, et al. Incidence of Parkinson’s disease: Variation by age, gender, and race/ethnicity. American Journal of Epidemiology. 2003;157(11):1015–1022. [DOI] [PubMed] [Google Scholar]
  • 19.Schapira AHV, Chaudhuri KR, Jenner P. Non-motor features of Parkinson disease. Nature Reviews Neuroscience. Jul 2017;18(7):435–450. [DOI] [PubMed] [Google Scholar]
  • 20.Braak H, Del Tredici K, Rub U, de Vos RA, Jansen Steur EN, Braak E. Staging of brain pathology related to sporadic Parkinson’s disease. Neurobiol Aging. 2003;24(2):197–211. [DOI] [PubMed] [Google Scholar]
  • 21.Dietz V, Quintern J, Berger W. Electrophysiological studies of gait in spasticity and rigidity: Evidence that altered mechanical properties of muscle contribute to hypertonia. Brain. 1981;104(3):431–49. [DOI] [PubMed] [Google Scholar]
  • 22.Edstrom L Histochemical changes in upper motor lesions, Parkinsonism and disuse: Differential effect on white and red muscle fibres. Experientia. 1968;24(9):916–7. [DOI] [PubMed] [Google Scholar]
  • 23.Mu L, Sobotka S, Chen J, et al. Altered pharyngeal muscles in Parkinson disease. J Neuropathol Exp Neurol. 2012;71(6):520–530. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Rossi B, Siciliano G, Carboncini MC, et al. Muscle modifications in Parkinson’s disease: Myoelectric manifestations. Electroencephalogr Clin Neurophysiol. 1996;101(3):211–8. [DOI] [PubMed] [Google Scholar]
  • 25.Watts RL, Wiegner AW, Young RR. Elastic properties of muscles measured at the elbow in man: II. Patients with Parkinsonian rigidity. Journal of Neurology, Neurosurgery, and Psychiatry. 1986;49(10):1177–81. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Zarzur AP, Duprat AC, Cataldo BO, Ciampi D, Fonoff E. Laryngeal electromyography as a diagnostic tool for Parkinson’s disease. The Laryngoscope. 2013;124(3):725–729. [DOI] [PubMed] [Google Scholar]
  • 27.Zarzur AP, Duprat AC, Shinzato G, Eckley CA. Laryngeal electromyography in adults with Parkinson’s disease and voice complaints. The Laryngoscope. 2007;117(5):831–834. [DOI] [PubMed] [Google Scholar]
  • 28.Dalrymple-Alford JC, MacAskill MR, Nakas CT, et al. The MOCA. Neurology. 2010;75(19):1717. [DOI] [PubMed] [Google Scholar]
  • 29.Stepp CE, Heaton JT, Stadelman-Cohen TK, Braden MN, Jette ME, Hillman RE. Characteristics of phonatory function in singers and nonsingers with vocal fold nodules. Journal of Voice. 2011;25(6):714–24. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Burk MH, Wiley TL. Continuous versus pulsed tones in audiometry. American Journal of Audiology. 2004;13(1):54–61. [DOI] [PubMed] [Google Scholar]
  • 31.American Speech-Language-Hearing Association. Guidelines for manual pure-tone threshold audiometry. Rockville, MD: 2005. [Google Scholar]
  • 32.Schow RL. Considerations in selecting and validating an adult/elderly hearing screening protocol. Ear and Hearing. 1991;12(5):337–48. [DOI] [PubMed] [Google Scholar]
  • 33.Lien YS, Gattuccio CI, Stepp CE. Effects of phonetic context on relative fundamental frequency. Journal of Speech, Language, and Hearing Research. 2014;57:1259–1267. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Baken RJ, Orlikoff RF. Clinical Measurement of Speech and Voice. Singular Thomson Learning; 2000. [Google Scholar]
  • 35.Dailey SH, Kobler JB, Hillman RE, et al. Endoscopic measurement of vocal fold movement during adduction and abduction. The Laryngoscope. 2005;115(1):178–83. [DOI] [PubMed] [Google Scholar]
  • 36.Diaz-Cadiz M, McKenna VS, Vojtech JM, Stepp CE. Adductory vocal fold kinematic trajectories during conventional versus high-speed videoendoscopy. Journal of Speech, Language, and Hearing Research. 2019;62(6):1685–1706. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Ikuma T, Kunduk M, McWhorter AJ. Preprocessing techniques for high-speed videoendoscopy analysis. Journal of Voice. 2013;27(4):500–5. [DOI] [PubMed] [Google Scholar]
  • 38.Vojtech JM, Segina RK, Buckley DP, et al. Refining algorithmic estimation of relative fundamental frequency: Accounting for sample characteristics and fundamental frequency estimation method. Journal of the Acoustical Society of America. 2019;146(5):3184. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Lien YS, Michener CM, Eadie TL, Stepp CE. Individual monitoring of vocal effort with relative fundamental frequency: Relationships with aerodynamics and listener perception. Journal of Speech, Language, and Hearing Research. 2015;58(3):566–575. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.McKenna VS, Heller Murray ES, Lien YS, Stepp CE. The relationship between relative fundamental frequency and a kinematic estimate of laryngeal stiffness in healthy adults. Journal of Speech, Language, and Hearing Research. 2016;59(6):1283–1294. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Eadie TL, Stepp CE. Acoustic correlate of vocal effort in spasmodic dysphonia. Annals of Otology, Rhinology, and Laryngology. 2013;122(3):169–176. [DOI] [PubMed] [Google Scholar]
  • 42.Heller Murray ES, Lien YS, Van Stan JH, et al. Relative fundamental frequency distinguishes between phonotraumatic and non-phonotraumatic vocal hyperfunction. Journal of Speech, Language, and Hearing Research. 2017;60(6):1507–1515. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Serry MA, Stepp CE, Peterson SD. Physics of phonation offset: Towards understanding relative fundamental frequency observations. The Journal of the Acoustical Society of America. 2021;149(5):3654–3664. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Gillies GE, Pienaar IS, Vohra S, Qamhawi Z. Sex differences in Parkinson’s disease. Front Neuroendocrinol. 2014;35(3):370–384. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Kempster GB, Gerratt BR, Verdolini Abbott K, Barkmeier-Kraemer JM, Hillman RE. Consensus auditory-perceptual evaluation of voice: Development of a standardized clinical protocol. American Journal of Speech-Language Pathology. 2009;18(2):124–132. [DOI] [PubMed] [Google Scholar]
  • 46.Martínez-Martín P, Rodríguez-Blázquez C, Mario A, et al. Parkinson’s disease severity levels and MDS-Unified Parkinson’s Disease Rating Scale. Parkinsonism and Related Disorders. 2015;21(1):50–54. [DOI] [PubMed] [Google Scholar]
  • 47.Goetz CG, Poewe W, Rascol O, et al. Movement Disorder Society task force report on the hoehn and yahr staging scale: Status and recommendations the Movement Disorder Society task force on rating scales for Parkinson’s disease. Movement Disorders. 2004;19(9):1020–1028. [DOI] [PubMed] [Google Scholar]
  • 48.Hoehn MM, Yahr MD. Parkinsonism: Onset, progression and mortality. Neurology. 1967;17(5):427–442. [DOI] [PubMed] [Google Scholar]
  • 49.Iwahashi T, Ogawa M, Hosokawa K, Kato C, Inohara H. A detailed motion analysis of the angular velocity between the vocal folds during throat clearing using high-speed digital imaging. Journal of Voice. 2016;30(6):770.e1–8. [DOI] [PubMed] [Google Scholar]
  • 50.Poletto CJ, Verdun LP, Strominger R, Ludlow CL. Correspondence between laryngeal vocal fold movement and muscle activity during speech and nonspeech gestures. Journal of Applied Physiology. 2004;97(3):858–66. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Park Y, Wang F, Díaz-Cádiz ME, Vojtech JM, Groll MD, Stepp CE. Vocal fold kinematics and relative fundamental frequency as a function of obstruent type and speaker age. Journal of the Acoustical Society of America. 2021;149(4):2189–2199. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Patel RR, Forrest K, Hedges D. Relationship between acoustic voice onset and offset and selected instances of oscillatory onset and offset in young healthy men and women. Journal of Voice. 2017;31(3):389.e9–389.e17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Watson BC, Roark RM, Baken RJ. Vocal release time: A quantification of vocal offset. Journal of Voice. 2012;26(6):682–7. [DOI] [PubMed] [Google Scholar]
  • 54.Robb MP, Smith AB. Fundamental frequency onset and offset behavior: A comparative study of children and adults. Journal of Speech, Language, and Hearing Research. 2002;45(3):446–456. [DOI] [PubMed] [Google Scholar]
  • 55.Stepp CE, Hillman RE, Heaton JT. The impact of vocal hyperfunction on relative fundamental frequency during voicing offset and onset. Journal of Speech, Language, and Hearing Research. 2010;53:1220–1226. [DOI] [PubMed] [Google Scholar]
  • 56.Stepp CE, Sawin DE, Eadie TL. The relationship between perception of vocal effort and relative fundamental frequency during voicing offset and onset. Journal of Speech, Language, and Hearing Research. 2012;55(6):1887–96. [DOI] [PubMed] [Google Scholar]
  • 57.Omori K, Slavit DH, Matos C, Kojima H, Kacker A, Blaugrund SM. Vocal fold atrophy: Quantitative glottic measurement and vocal function. Annals of Otology, Rhinology, and Laryngology. 1997;106(7):544–51. [DOI] [PubMed] [Google Scholar]
  • 58.Bloch I, Behrman A. Quantitative analysis of videostroboscopic images in presbylarynges. The Laryngoscope. 2001;111(11):2022–2027. [DOI] [PubMed] [Google Scholar]
  • 59.Isshiki N, Shoji K, Kojima H, Hirano S. Vocal fold atrophy and its surgical treatment. Annals of Otology, Rhinology, and Laryngology. 1996;105(3):182–188. [DOI] [PubMed] [Google Scholar]
  • 60.Angerstein W Vocal changes and laryngeal modifications in the elderly (presbyphonia and presbylarynx). Laryngo-Rhino-Otologie. 2018;97(11):772–776. [DOI] [PubMed] [Google Scholar]
  • 61.Pontes P, Brasolotto A, Behlau M. Glottic characteristics and voice complaint in the elderly. Journal of Voice. 2005;19(1):84–94. [DOI] [PubMed] [Google Scholar]
  • 62.Rodeño MT, Sánchez-Fernández JM, Rivera-Pomar JM. Histochemical and morphometrical ageing changes in human vocal cord muscles. Acta Otolaryngol. 1993;113(3):445–449. [DOI] [PubMed] [Google Scholar]
  • 63.Ackermann H, Gröne BF, Hoch G, Schönle PW. Speech Freezing in Parkinson’s Disease: A Kinematic Analysis of Orofacial Movements by Means of Electromagnetic Articulography. Folia Phoniatrica et Logopaedica. 1993;45(2):84–89. [DOI] [PubMed] [Google Scholar]
  • 64.Ackermann H, Konczak J, Hertrich I. The Temporal Control of Repetitive Articulatory Movements in Parkinson’s Disease. Brain Lang. 1997;56(2):312–319. [DOI] [PubMed] [Google Scholar]
  • 65.Connor NP, Abbs JH, Cole KJ, Gracco VL. Parkinsonian Deficits in Serial Multiarticulate Movements for Speech. Brain. 1989;112(4):997–1009. [DOI] [PubMed] [Google Scholar]
  • 66.Forrest K, Weismer G. Dynamic Aspects of Lower Lip Movement in Parkinsonian and Neurologically Normal Geriatric Speakers’ Production of Stress. Journal of Speech, Language, and Hearing Research. 1995;38(2):260–272. [DOI] [PubMed] [Google Scholar]
  • 67.Forrest K, Weismer G, Turner GS. Kinematic, acoustic, and perceptual analyses of connected speech produced by Parkinsonian and normal geriatric adults. The Journal of the Acoustical Society of America. 11989;85(6):2608–2622. [DOI] [PubMed] [Google Scholar]
  • 68.Yunusova Y, Weismer G, Kent Ray D, Rusche Nicole M. Breath-Group Intelligibility in Dysarthria. Journal of Speech, Language, and Hearing Research. 2005;48(6):1294–1310. [DOI] [PubMed] [Google Scholar]
  • 69.Kearney E, Giles R, Haworth B, Faloutsos P, Baljko M, Yunusova Y. Sentence-Level Movements in Parkinson’s Disease: Loud, Clear, and Slow Speech. Journal of Speech, Language, and Hearing Research. 2017;60(12):3426–3440. [DOI] [PubMed] [Google Scholar]
  • 70.Hillman RE, Holmberg EB, Perkell JS, Walsh M, Vaughan C. Objective assessment of vocal hyperfunction: An experimental framework and initial results. Journal of Speech and Hearing Research. 1989;32(2):373–92. [DOI] [PubMed] [Google Scholar]

RESOURCES