Skip to main content
Journal of Speech, Language, and Hearing Research : JSLHR logoLink to Journal of Speech, Language, and Hearing Research : JSLHR
. 2025 May 5;68(6):2721–2740. doi: 10.1044/2025_JSLHR-24-00467

Reliability and Diagnostic Accuracy of Semi-Automated and Automated Acoustic Quantification of Vocal Tremor Characteristics

Youri Maryn a,b,c,d,, Kaitlyn Dwenger e,f, Sidney Kaufmann e, Julie Barkmeier-Kraemer e,f
PMCID: PMC12173159  PMID: 40324153

Abstract

Purpose:

This study compared three methods of acoustic algorithm-supported extraction and analysis of vocal tremor properties (i.e., rate, extent, and regularity of intensity level and fundamental frequency modulation): (a) visual perception and manual data extraction, (b) semi-automated data extraction, and (c) fully automated data extraction.

Method:

Forty-five midvowel sustained [a:] and [i:] audio recordings were collected as part of a scientific project to learn about the physiologic substrates of vocal tremor. This convenience data set contained vowels with a representative variety in vocal tremor severity. First, the vocal tremor properties in intensity level and fundamental frequency tracks were visually inspected and manually measured using Praat software. Second, the vocal tremor properties were determined using two Praat scripts: automated with the script of Maryn et al. (2019) and semi-automated with an adjusted version of this script to enable the user to intervene with the signal processing. The reliability of manual vocal tremor property measurement was assessed using the intraclass correlation coefficient. The properties as measured with the two scripts (automated vs. semi-automated) were compared with the manually determined properties using correlation and diagnostic accuracy statistical methods.

Results:

With intraclass correlation coefficients between .770 and .914, the reliability of the manual method was acceptable. The semi-automated method correlated with manual property measures better and was more accurate in diagnosing vocal tremor than the automated method.

Discussion:

Manual acoustic measurement of vocal tremor properties can be laborious and time-consuming. Automated or semi-automated acoustic methods may improve efficiency in vocal tremor property measurement in clinical as well as research settings. Although both Praat script-supported methods in this study yielded acceptable validity with the manual data measurements as a referent, the semi-automated method showed the best outcomes.

Supplemental Material:

https://doi.org/10.23641/asha.28873088


Tremor is a neurological movement disorder typified by involuntary and approximately rhythmic movements of one or more body parts (Crawford & Zimmerman, 2011; Elble, 2016; Louis, 2016). When tremor affects muscles of the speech system, it can result in a vocal tremor. Vocal tremor is a neurological voice disorder that results from tremor observed within speech structures and is typically associated with other neurological disorders, such as essential tremor, Parkinson's disease, Huntington's disease, amyotrophic lateral sclerosis, multiple sclerosis, dystonia, multiple system atrophy, cerebellar degeneration, progressive supranuclear palsy, and stroke (Hlavnička et al., 2020; Sulica & Louis, 2010; Wolraich et al., 2010). Torrecillas et al. (2021) characterized the clinical phenotypes for the various types of vocal tremor using combined tremor classification criteria from the International Parkinson and Movement Disorder Society (IPMDS; Bhatia et al., 2018) and laryngeal features from the American Academy of Otolaryngology–Head and Neck Surgery (AAO-HNS; Merati et al., 2005). Several studies have documented the association of tremulous voice modulation with oscillatory movements of the tongue (Bove et al., 2006; Gamboa et al., 1998; Jiang et al., 2000; Lester et al., 2013; Sulica & Louis, 2010), larynx (Ackermann & Ziegler, 1991; Bove et al., 2006; Finnegan et al., 2003; Gamboa et al., 1998; Koda & Ludlow, 1992; Sulica & Louis, 2010; Tomoda et al., 1987), pharyngeal constrictors (Bove et al., 2006; Lester et al., 2013; Sulica & Louis, 2010), soft palate (Bove et al., 2006; Gillivan-Murphy et al., 2016; Panda et al., 2013; Sulica & Louis, 2010), and respiratory musculature (Hachinski et al., 1975; Tomoda et al., 1987). Vocal tremor has been shown to significantly affect communication, participation, quality of life, and well-being (Dromey et al., 2002; Gillivan-Murphy et al., 2019; Mendoza & Carballo, 1999; Merati et al., 2005; Sulica & Louis, 2010).

To document vocal tremor based on the acoustic speech signal, the signal first needs to be decomposed into vocal tremor–related features, which typically are fundamental frequency (f0) and sound intensity level (IL). Once isolated, these time-varying features can be inspected for the presence of a more or less cyclic modulation with a rate between 3 and 12 Hz (Duffy, 1995). Subsequently, the modulation properties rate (i.e., frequency), extent (i.e., depth or amplitude), and regularity (i.e., stability or periodicity) can be described as measurements of vocal tremor. Maryn et al. (2019) explored the concurrent and diagnostic validity of software to automatically decompose the time-varying features of speech signals and determine tremor modulation properties. Such measurement is regarded as pertinent and clinically useful for multiple reasons. That is, measuring vocal tremor modulation properties has potential for use in comparisons and differential diagnosis of patient groups, such as healthy individuals, and those with Parkinson's disease, essential vocal tremor, laryngeal dystonia, Huntington's disease, and myotonic dystrophy (Barkmeier-Kraemer et al., 2018; Dominguez et al., 2018; Gillivan-Murphy et al., 2019; Ramig et al., 1988; Tanaka et al., 2011). Measuring vocal tremor properties also enables clinicians to track the severity of tremor affecting the voice over time during clinical treatment. Current treatment approaches for vocal tremor include (a) injection of botulinum toxin into laryngeal or strap musculature (Adler et al., 2004; Bove et al., 2006; Gurey et al., 2013; Kendall & Leonard, 2011; Nelson et al., 2019; Sulica & Louis, 2010; Warrick et al., 2000); (b) intake of systemic medication (Julius & Longfellow, 2016; Khoury & Randall, 2022; Ondo, 2016), such as primidone (Hartman & Vishwanat, 1984; Nida et al., 2016), propanolol (Koller, 1985; Koller et al., 1985), methazolamide (Busenbark et al., 1996; Muenter et al., 1991), sodium oxybate (Simonyan & Frucht, 2013), octanoic acid (Lowell et al., 2019, 2021), and cannabinoids (Millman et al., 2023); (c) deep-brain stimulation, with described beneficial effect when applied on ventro-intermediate thalamic nucleus (Erickson-DiRenzo, Kuijper, et al., 2020; Erickson-DiRenzo, Sung, et al., 2020; Ho et al., 2019; Lu et al., 2023; Ruckart et al., 2023), caudal zona incerta (Hägglund et al., 2016), and ventro-latero-posterior thalamic nucleus (Matsumoto et al., 2016); and (d) behavioral speech/voice therapy with strategies for reducing perceived vocal tremor (Barkmeier-Kraemer, 2013, 2016, 2019; Barkmeier-Kraemer et al., 2011; Ghosh et al., 2023). In addition, preliminary findings have shown that the voice symptoms in those with essential vocal tremor may be altered during sensorimotor modulation, such as transcutaneous laryngeal vibrotactile stimulation (Dwenger, 2022, 2024). Regardless of the treatment approach, however, it is important to quantify the changes in the properties and severity of vocal tremor and to follow up on its treatment outcomes.

Various clinical and scientific methods have been applied to the differential diagnosis and documentation of vocal tremor (Maryn et al., 2019), including auditory–perceptual, visual–perceptual/spectrographic, acoustic, aerodynamic, and laryngoscopic methods. However, acoustic methods are of particular interest, considering its noninvasiveness, relatively low cost, ease of application, quantitative output, mostly objective protocol, and potential for automatic or semi-automatic extraction and processing of different speech signal properties. Furthermore, clinical application of acoustic methods is relevant because it addresses the speech signal associated with the symptoms or perceived abnormal speaking patterns. Anand et al. (2012), for example, conducted an acoustic-perceptual study of vocal tremor and explored how perceived vocal tremor severity (PVTS) varies with f0, rate and extent of modulation in f0, and signal-to-noise ratio as an indicator of dysphonia severity. Their findings confirmed that the perceived severity of vocal tremor increases with extent of frequency modulation. They also found that, above a modulation rate of 8 Hz, the interaction between extent and rate of frequency modulation became relevant. However, only four tremorous speakers were studied, and IL modulation was not evaluated. Erickson-DiRenzo, Sung, et al. (2020) is another example of a recently published study in which an acoustic interface was used for the manual extraction of f0 and IL modulation properties and measures, in this case for tracking seven patients' vocal tremor before, during, and after placement of a ventral intermediate nucleus stimulator. Their data also confirmed that perceived severity of vocal tremor increases predominantly with extent of f0 modulation, as well as with extent of IL modulation. However, only seven tremorous speakers were studied. Finally, Maryn et al. (2019) studied the correlation between PVTS and vocal tremor properties (i.e., rate, extent, and regularity of f0 and IL modulation) in 56 subjects with various degrees of vocal tremor. They, too, found extent of f0 and IL modulation to provide the highest correlation with PVTS. Furthermore, via multivariate linear regression analysis including all properties, Maryn et al. (2019) constructed the Acoustic Phonatory Tremor Index (APTI), which showed a highly acceptable rSpearman = .848 with PVTS.

The acoustic method as described in detail in Maryn et al. (2019) consisted of a completely automated set of algorithms to (a) isolate the time-varying f0 and IL trajectories, (b) determine the properties of these trajectories, and (c) present graphically the trajectories, measured properties, and APTI within the Praat program. This tool is considered as objective as possible given it is entirely automated in which the user cannot interfere with the measurements once the algorithm is started. However, vocal tremor properties measured from manually delineated modulation cycles as, for example, in Erickson-DiRenzo, Sung, et al. (2020), can still be considered most trustworthy and, therefore, gold standard. Thus, the reliability of the output of any automated or semi-automated acoustic phonatory tremor analysis tool must be confirmed by comparing it with the output of manual methods. In other words, before clinically implementing an acoustic method for vocal tremor measurement, it is crucial to establish its accuracy as a more objective supplement or complement to a manual method. As such, a preliminary comparison between the fully automated tool from Maryn et al. (2019) and manual determination of vocal tremor properties in a few vowel recordings was completed; however, some discrepancies emerged. Figure 1 illustrates the output of a fully automated and accurate acoustic analysis. Rate of modulation was correctly estimated in the f0 and IL traces with all modulation cycles accurately indicated. Thus, tremor extent and regularity were also accurately measured. However, Figure 2 shows the output of a fully automated yet inaccurate acoustic analysis due to an erroneous tremor rate in the irregular IL trace. Instead of 5.31 Hz as manually indicated (2), the program spuriously selected the spectral peak at 3.92 Hz, leading to faulty modulation cycle detection. Despite this error in the measurement of the IL trace, the tremor rate was accurate in the f0 trace (1). Figure 3 displays the output of a fully automated but also inaccurate acoustic analysis due to erroneous modulation cycle choices. Based on manual annotation (1), the tremor rate indeed is circa 5.4 Hz (at least in the first second of the recording). However, some of the tremor cycles were erroneously located (2), resulting in an inaccurate tremor extent measurement. From these examples, it can be observed that errors occurred in small-ranged and/or irregular modulations: The IL contours in Figures 2 and 3 only had a range of 3.6 dB and 2.5 dB, respectively, and the f0 trace in Figure 3 only had a range of 5.3 Hz. In contrast, the f0 traces in Figures 1 and 2 had a range of 61.3 Hz and 21.0 Hz, respectively. The range of the IL trace in Figure 1 was not much larger with 4.6 dB but was relatively regular and, therefore, more accurate in detecting rate and modulation cycles. Because of these exemplary discrepancies, it was concluded that the completely automated method for acoustic vocal tremor analysis may not be sufficiently correct for clinical use and decision making. Thus, a semi-automated tool may be required and more appropriate, especially in cases where there is some irregularity within f0 and IL modulation cycles. This conclusion initiated adjustments to the acoustic phonatory tremor analysis tool from Maryn et al. (2019), as described in the Methods section. Overall, the adjustments enable the clinician/user to intervene when required, for example, to omit erroneous cycles from the f0 or IL modulations or to specify the tremor rate so it only includes the actual rate and not a period doubling-based rate.

Figure 1.

An acoustic phonatory tremor analysis with five graphs. The top panel shows an acoustic spectrogram with frequency in hertz on the vertical axis ranging from 0 to 2000 and time in seconds on the horizontal axis ranging from 0 to 2000. The spectrogram shows a series of wavy lines representing the frequency of the tremor over time. The middle left panel shows line graphs for the fundamental frequency in hertz on the vertical axis versus time in seconds on the horizontal axis for median extent equals 13.05%, absolute and relative rectangular perturbations with 0.44 hertz and 8.91%, respectively. The lines plot irregular square waves and sinusoidal waveforms between 153.7 and 215.0. The bottom left panel shows line graphs for the intensity level in decibels on the vertical axis versus time in seconds on the horizontal axis for median extent equals 2.56%, absolute and relative rectangular perturbations with 0.08 decibels and 29.47%, respectively. The lines plot irregular square waves and sinusoidal waveforms between 49.1 and 53.7. The middle right panel shows a line graph for amplitude in decibels per hertz on the vertical axis ranging from 51.3 to 114.9 versus frequency in hertz on the horizontal axis ranging from 0 to 25. The line plots a decreasing trend with fluctuations and has a spectral peak frequency of 5.08 hertz and spectral peak prominence of 21.46 decibels. The bottom right panel shows a line graph of the amplitude in decibels per hertz on the vertical axis ranging from 25.7 to 89.3 versus frequency in hertz on the horizontal axis ranging from 0 to 25. The line plots a decreasing trend with fluctuations and has a spectral peak frequency of 5.10 hertz and spectral peak prominence of 18.29 decibels.

Illustration of correct vocal tremor analysis by the script of Maryn et al. (2019). f0 = fundamental frequency; IL = intensity level.

Figure 2.

An acoustic phonatory tremor analysis with five graphs. The top panel shows an acoustic spectrogram with frequency in hertz on the vertical axis ranging from 0 to 2000 and time in seconds on the horizontal axis ranging from 0 to 2.845. The spectrogram shows a series of wavy lines representing the frequency of the tremor over time, which has two bands of tremor rates, the first with 0.1756 seconds, 5.69 hertz in the fundamental frequency and the second with 0.1882 seconds, 5.31 hertz in the intensity level. The middle left panel shows line graphs for the fundamental frequency in hertz on the vertical axis ranging from 146.2 to 167.2 versus time in seconds on the horizontal axis for median extent equals 2.01%, absolute and relative rectangular perturbations with 0.51 hertz and 86.23%, respectively. The lines plot irregular square waves and sinusoidal waveforms. The bottom left panel shows line graphs for the intensity level in decibels on the vertical axis ranging from 62.7 to 66.3 versus time in seconds on the horizontal axis for median extent equals 0.95%, absolute and relative rectangular perturbations with 0.10 decibels and 59.96%, respectively. The lines plot irregular square waves and sinusoidal waveforms. The middle right panel shows a line graph for amplitude in decibels per hertz on the vertical axis ranging from 44.2 to 97.5 versus frequency in hertz on the horizontal axis ranging from 0 to 25. The line plots a decreasing trend with fluctuations and has a spectral peak frequency of 5.60 hertz and spectral peak prominence of 17.00 decibels. The bottom right panel shows a line graph of the amplitude in decibels per hertz on the vertical axis ranging from 11.6 to 77.2 versus frequency in hertz on the horizontal axis ranging from 0 to 25. The line plots a decreasing trend with fluctuations and has a spectral peak frequency of 3.92 hertz and spectral peak prominence of 10.73 decibels.

Illustration of erroneous tremor rate in the IL trace by the script of Maryn et al. (2019) in the first [i:] of Subject 2. (1) Accurate tremor rate in the f0 trace. (2) Inaccurate tremor rate in the IL trace. f0 = fundamental frequency; IL = intensity level.

Figure 3.

An acoustic phonatory tremor analysis with five graphs. The top panel shows an acoustic spectrogram with frequency in hertz on the vertical axis ranging from 0 to 2000 and time in seconds on the horizontal axis ranging from 0 to 2.000. The spectrogram shows a series of wavy lines representing the frequency of the tremor over time, which has a band of tremor rates, with 0.1848 seconds, 5.41 hertz in the fundamental frequency and the intensity level. The middle left panel shows line graphs for the fundamental frequency in hertz on the vertical axis ranging from 116.8 to 122.1 versus time in seconds on the horizontal axis for median extent equals 1.13%, absolute and relative rectangular perturbations with 0.13 hertz and 51.03%, respectively. The lines plot irregular square waves and fluctuations, and have four tremor cycles, with three facing upwards and one facing downward. The bottom left panel shows line graphs for the intensity level in decibels on the vertical axis ranging from 57.7 to 60.2 versus time in seconds on the horizontal axis for median extent equals 0.46%, absolute and relative rectangular perturbations with 0.09 decibels and 107.38%, respectively. The lines plot irregular square waves and fluctuations. The middle right panel shows a line graph for amplitude in decibels per hertz on the vertical axis ranging from 49.4 to 86.0 versus frequency in hertz on the horizontal axis ranging from 0 to 25. The line plots a decreasing trend with fluctuations and has a spectral peak frequency of 5.40 hertz and spectral peak prominence of 9.29 decibels. The bottom right panel shows a line graph of the amplitude in decibels per hertz on the vertical axis ranging from 20.3 to 72.4 versus frequency in hertz on the horizontal axis ranging from 0 to 25. The line plots a decreasing trend with fluctuations and has a spectral peak frequency of 5.48 hertz and spectral peak prominence of 9.03 decibels.

Illustration of erroneously located tremor cycles by the script of Maryn et al. (2019) in the first [i:] of Subject 6. (1) Accurate tremor rate. (2) Inaccurate location of four tremor cycles in the f0 trace. f0 = fundamental frequency; IL = intensity level.

The present study had two goals. First, the reliability of the automated and semi-automated acoustic tools in measuring the f0 and IL modulation properties in vocal tremor was assessed based on correlation with manual acoustic measurement as the standard referent. Second, these tools' diagnostic/clinical validity in differentiating between manually determined absence or presence of vocal tremor was investigated using the receiver operating curve analysis.

Method

Experimental Ethics

Institutional review board approval for this study was obtained at the University of Utah (IRB Protocol 00118469).

Subjects

Fifteen adults enrolled in this institutional review board–approved research study at the University of Utah. Demographic and clinical characteristics are in Table 1. The subjects were identified for participation based on a diagnosis of essential tremor as previously documented in their medical records. Following informed consent to participate, subjects underwent a standard neurologic examination by a movement disorder specialist consisting of a physical examination for Parkinsonism using the Unified Parkinson's Disease Rating Scale, Part III (UPDRS Part III; Movement Disorder Society Task Force on Rating Scales for Parkinson's Disease, 2003), and an assessment of tremor using the Fahn–Tolosa–Marin Tremor Rating Scale (Fahn et al., 1988) to determine if criteria for the diagnosis of essential tremor or essential tremor plus were met. All 15 subjects in this study met criteria for a diagnosis of essential tremor as determined by the movement disorder specialist. The criteria used were based on the updated consensus statement by the IPMDS (Bhatia et al., 2018) and includes:

Table 1.

Demographic and clinical characteristics of the 15 participants in this study.

Characteristic ET + VT ET – VT Total
n 10 5 15
Age in years, M (SD) 71 (5) 47 (4) 65 (12)
Gender, n (%)
 Male 7 (70%) 5 (100%) 12 (80%)
 Female 3 (30%) 0 (0%) 3 (20%)
Race, n (%)
 White 7 (70%) 5 (100%) 12 (80%)
 Asian 1 (10%) 0 (0%) 1 (7%)
 African American 1 (10%) 0 (0%) 1 (7%)
 American Indian/Alaska Native 0 (0%) 0 (0%) 0 (0%)
 Native Hawaiian 0 (0%) 0 (0%) 0 (0%)
 Did not disclose 1 (10%) 0 (0%) 1 (7%)
Ethnicity, n (%)
 Hispanic or Latino 0 (0%) 0 (0%) 0 (0%)
 Non-Hispanic 10 (100%) 4 (80%) 14 (93%)
 Did not disclose 0 (0%) 1 (20%) 1 (7%)
Disease duration in years, M (SD) 27 (22) 33 (13) 28 (22)
Progression in disease, n (%) 5 (50%) 0 (0%) 5 (33%)
VHI 30-item total score, M (SD) 45 (33) 24 (34) 39 (37)
QUEST overall health status, M (SD) 81 (6) 63 (31) 76 (19)
QUEST quality of life, M (SD) 82 (13) 65 (29) 78 (21)
QUEST communication score, M (SD) 45 (29) 21 (22) 38 (32)
QUEST physical score, M (SD) 44 (20) 57 (33) 48 (27)

Note. ET = essential tremor; VT = vocal tremor; VHI = Voice Handicap Index; QUEST = Quality of Life in Essential Tremor Questionnaire.

  1. bilateral upper limb action (kinetic and postural) tremor;

  2. at least 3 years in duration;

  3. with or without tremor in other locations (e.g., head, voice, or lower limbs);

  4. absence of other neurologic signs, such as dystonia, ataxia, or Parkinsonism; and

  5. essential tremor plus: tremor with the characteristics of essential tremor and additional neurological signs of uncertain significance (such as impaired tandem gait, questionable dystonic posturing, memory impairment, or other mild neurologic signs of unknown significance) that do not make an additional syndrome classification or diagnosis (including essential tremor with tremor at rest).

In addition, participants underwent a standard clinical voice assessment including patient questionnaires (e.g., 30-item Voice Handicap Index [VHI] and Quality of Life in Essential Tremor [QUEST]), acoustic and aerodynamic recordings, and nasoendoscopy to assess laryngeal structure and function and as part of the cranial nerve examination to determine the presence or absence of vocal tremor affecting speech structures associated with essential tremor. Ten (67%) subjects were classified with vocal tremor. Guidance for classification of tremor affecting the upper airway structures was taken from the Neurolaryngology Committee of the American Academy of Otolaryngology—Head and Neck Surgery report by Merati et al. (2005). This report included the following:

  1. idiopathic action tremor affecting the voice as part of essential tremor;

  2. tremor affecting muscles of the larynx, pharynx, palate, other articulators, and/or respiratory musculature; and

  3. tremor is not task specific (occurs during phonation and respiration).

Additional inclusion criteria for the present study were:

  1. no botulinum toxin injections within 3 months of participation with a return of at least 50% of symptoms at the time of participation;

  2. English as their primary communication language;

  3. normal hearing or no more than mild–moderate unilateral hearing loss;

  4. no prior history of radiation treatment to the head and neck structures resulting in functional changes to the voice, breathing, or swallowing;

  5. no other neuromotor disorder other than essential tremor that affects voice or speech;

  6. no speech abnormalities due to a stroke or traumatic brain injury;

  7. no psychiatric disorder that may affect the ability to participate; and

  8. no history of surgical treatment with deep-brain stimulation or focused ultrasound.

Vowel Samples

Speech sounds were captured with an AKG C520 head-mounted condenser microphone positioned at ~5 cm from the mouth and at an azimuthal angle of circa 45°, in accordance to Titze and Winholtz (1993) and Švec and Granqvist (2010, 2018). This microphone was connected to a Zoom H6 Pro audio recorder (Zoom corporation) for signal recording with a sampling rate of 44.1 kHz and saved in WAV format. To obtain appropriate sound levels, sound pressure calibration for head-mounted microphones were conducted as described in Švec and Granqvist (2018, p. 456) with a Class 1 SoundTrack LxT Sound Level Meter (Larson Davis, PCB Piezotronics, Inc.) as criterion.

Prior research demonstrated that sustained phonation is the optimal speech context for identifying the presence of vocal tremor through acoustic measurement (Barkmeier et al., 2001; Brown & Simonson, 1963; Dromey et al., 2002; Lederle et al., 2012; Lester et al., 2013). In addition, sustained [a:] and [i:] were previously used to measure acoustic rate and extent of f0 and IL modulations across differing pitch and loudness levels (Barkmeier-Kraemer et al., 2011; Erickson-DiRenzo, Sung, et al., 2020; Lederle et al., 2012; Shao et al., 2010). Therefore, the participants were instructed to produce [a:] and [i:] for 5 s at a comfortable pitch and loudness. From these audio recordings, midvowel portions of 2 s of [a:] and [i:] with optimal representation of tremor modulation were extracted using the program Praat (Paul Boersma and David Weenink, Institute for Phonetic Sciences), deidentified, and coded before processing with the following algorithm for measurements of rate and extent of acoustic modulation because of vocal tremor.

Manual Measurements of Vocal Tremor

The senior author (J.B.K.) provided the training to manual measurement raters. One co-author (K.D.) with 4 years of experience collecting these samples and measures completed the manual tremor measurements. Another co-author (S.K.) with 1 year of experience collecting these samples and measures completed measurements on 30 (33%) of the files for interrater reliability. To assess interrater reliability, intraclass correlation coefficients (ICCs; ICC model: two-way random effects; ICC type: single rater; ICC definition: absolute agreement; Koo & Li, 2016) between the manual measurements of the two raters were calculated and listed in Table 2. With single-rater ICCs between .770 and .914 across the four manually determined modulation properties, interrater reliability was considered acceptable. Upon post hoc review, there were no particular samples that explained instances of disagreement between the two raters. Disagreement predominantly pertained to deciding on presence/absence of vocal tremor modulation and was associated with small extent of modulation in f0 or IL, measured on average as 1.7% (±0.9%). The manual tremor property measurements on all vowel samples by the first rater were taken as the standard referent in the further study.

Table 2.

Intraclass correlation coefficients, with their 95% confidence intervals (CIs) and statistical significance, as indicators of interrater reliability of the manual measurement of vocal tremor properties by two raters.

Manual modulation property Single-rater ICC 95% CI
p Interpretation
Lower bound Upper bound
f 0ModRate .803 .582 .914 < .001 Moderate–excellent
f0ModMeanExtent .914 .807 .963 < .001 Good–excellent
ILModRate .770 .478 .910 < .001 Poor–excellent
ILModMeanExtent .887 .715 .958 < .001 Moderate–excellent

Note. Interpretation is based on Portney and Watkins (2000). ICC = intraclass correlation coefficient.

For the manual measurements of vocal tremor, the f0 (in Praat: “pitch”) and IL (in Praat: “intensity”) contours of each coded acoustic file were visually inspected using Praat (Version 6.1.38). Before raters completed the measurements, each file's f0 and IL contours were adjusted individually to assure adequate visualization of modulations using a consistent method across speaker recordings. The f0 range was adjusted to be ±100 Hz of the mean f0 of each file. Similarly, the IL settings ranged within ±10 dB of the mean IL of each file. The following measures were obtained manually for each file with calculations completed within Excel.

For f0 and IL modulation rate, the total number of visible modulation cycles was identified in the 2-s segment analyzed, divided by 2 s. The number of cycles was determined by counting the number of peak-to-peak or valley-to-valley cyclic modulations. The following equation was set in Excel for each file: rateHz=Ncyclesdurations.

For f0 and IL modulation extent, each cycle's extent was calculated by finding the maximum and minimum values for that cycle. For the extent of IL modulation, values were first converted from IL (in dB) into sound pressure level (SPL, in pascal or Pa). The extent of each cycle in the 2-s segment was averaged to determine the average magnitude of cyclic modulation: f0extent%=f0maximumf0minimumf0maximum+f0minimum×100 and ILextent%=SPLmaximumSPLminimumSPLmaximum+SPLminimum×100.

Acoustic Analyses of Vocal Tremor

All acoustic signal processing was achieved through custom programming/scripting in the program Praat by the first author (Y.M.). An automated analysis was first run using the tool as described in Maryn et al. (2019) followed by a semi-automated analysis. The steps in this semi-automated acoustic method were similar to Maryn et al. (2019), but to overcome the above-described difficulties, an interface was built to enable the researcher/clinician/user to intervene when required (i.e., to adjust the tremor's frequency range so it contains the user-approved f0 and/or IL modulation rate, delete signal parts that were erroneously indicated as modulation cycles, or state that there is no tremor-based modulation in f0 and/or IL). Information about the rate, extent, and regularity of f0 modulation and IL modulation in their respective traces was extracted via fast Fourier transformation (i.e., FFT, or spectral analysis) in Praat, as outlined in the automatic and user-intervened steps in the Appendix and the figures in Supplemental Material S1. The final graphical and numerical output of this semi-automated Praat tool resulted from this program.

This study investigated the congruence between the manual and the automated/semi-automated measurements. Given that the manual method only provided measures of IL modulation extent and rate, and f0 modulation extent and rate, this study continued with the four analogous automated/semi-automated measures:

  • ILModMeanExtent : marker of the mean extent of the IL modulation, as measured in the IL modulation trace after spectral processing.

  • ILModRate: frequency of the peak of the Fourier spectrum of the IL modulation trace, as a spectral marker of rate of IL modulation.

  • f0ModMeanExtent : marker of the mean extent of the f0 modulation, as measured in the f0 modulation trace after spectral processing.

  • f 0ModRate: frequency of the peak of the Fourier spectrum of the f0 modulation trace; as a spectral marker of rate of f0 modulation.

Results

Comparison Between Measurement Methods

Table 3 summarizes the descriptive data for the 12 variables (four tremor modulation properties × three measurement methods) in this study. Because analysis with the one-sample Kolmogorov–Smirnov test revealed that most of the variables (all except: automated f0ModRate with p = .052, manual ILModRate with p = .181, manual ILModMeanExtent with p = .191, and semi-automated ILModMeanExtent with p = .082) were not normally distributed, nonparametric analysis of variance with Friedman repeated-measures test for three related and dependent samples was used to compare modulation properties between the three methods. These overall comparisons showed significant differences for all four modulation properties. Therefore, post hoc Wilcoxon signed-rank tests were used to juxtapose pairs of related samples after Bonferroni correction (i.e., results were considered statistically significant at p ≤ .017). For f0ModRate, mean of measurements differed slightly (only 0.15 Hz between manual and automated, and only 0.32 Hz between manual and semi-automated) but significantly between manual and the other two methods. For ILModRate, mean of measurements also differed slightly (only 0.39 Hz between manual and automated, and only 0.32 Hz between manual and semi-automated) but significantly between manual and the other two methods. For ILModMeanExtent , mean of measurements differed significantly between all three methods. The difference was small (only 0.19%) between automated and semi-automated. However, the differences were large (10.24% between manual and automated, and 10.05% between manual and semi-automated) between manual and the other two methods. The latter can be explained mostly by the difference in scale (in pascal for the manual method, and in dB for the automated/semi-automated methods). For f0ModMeanExtent , there were no significant differences between the methods.

Table 3.

Minimum, maximum, mean, and standard deviation of the 12 variables (four tremor modulation properties × three measurement methods) of this study.

Method f0ModRate (Hz)
f0ModMeanExtent (%)
Min Max M SD Min Max M SD
Manual 3.10 10.00 4.40 1.03 0.50 30.80 6.35 7.09
Automated 3.25 5.78 4.55 0.73 0.79 47.39 7.82 9.99
Semi-automated
3.25
10.47
4.72
1.03
0.66
31.52
6.47
6.93
ILMod Rate (Hz)
ILModMeanExtent (%)
Method Min Max M SD Min Max M SD

Manual

2.90

6.60

4.26

0.83

1.00

33.80

12.08

7.52
Automated 3.11 10.26 4.65 1.32 0.25 4.91 1.84 1.21
Semi-automated 3.08 7.18 4.58 0.88 0.37 7.10 2.03 1.37

Correlations Between Manually and Acoustically Determined Tremor Properties

Table 4 lists the Pearson correlation coefficients (rPearson) between the manual measures and the automated measures with the tool from Maryn et al. (2019), as well as between the manual measures and the semi-automated measures. These correlations are also illustrated by the scatterplots in Figure 4. The raters considered tremulous modulation to be absent in 32 of 90 (36%) f0 traces and in 38 of 90 (42%) IL traces; therefore, rPearson was only calculated on the data retrieved from the cases where tremor modulation was determined to be present (i.e., 58 f0 and 52 IL traces). All Pearson correlations were statistically significant. The strength of rPearson values between manual and automated methods varied from low for f0ModRate to strong for ILModMeanExtent . However, the strength of rPearson values between manual and semi-automated methods was strong for all four measured tremor properties. Using Fisher r-to-z transformation, rPearson with manual markers was significantly higher for the semi-automated markers than for the automated markers, except for ILModMeanExtent . However, although rPearson for ILModMeanExtent from the automated method was highest, its difference with the rPearson for ILModMeanExtent from the semi-automated method was only 0.031 and not statistically significant. In general, Table 4 shows that the semi-automated method outperformed the automated method in terms of their correlations with the manual method across all rate and extent of tremor modulation measures.

Table 4.

Pearson correlation coefficients and their statistical significance between manual and automated tremor measures as well as between manual and semi-automated tremor measures, together with the statistical significance of the difference between the paired rPearson values.

Acoustic modulation property Manual and automated
Manual and semi-automated
Between-rPearson differences
N r Pearson p N r Pearson p z p
f 0ModRate 58 .502 < .001 58 .933 < .001 −5.92 < .001
f0ModMeanExtent 58 .811 < .001 58 .995 < .001 −8.47 < .001
ILModRate 52 .709 < .001 52 .918 < .001 −3.42 < .001
ILModMeanExtent 52 .957 < .001 52 .926 < .001 1.38 .168

Note. N = number of vowel samples; rPearson = Pearson correlation coefficient; p = significance level; z = value after Fisher's r-to-z transformation.

Figure 4.

Four scatter plots show the correlation between manual and automated, semi-automated tremor measurements. A. The top left plot shows the correlation between the fundamental frequency tremor rate for automated, semi-automated versus manual. The vertical axis represents the automated and semi-automated tremor rate, ranging from 0 to 11 Hertz, and the horizontal axis represents the manual tremor rate, ranging from 0 to 11 Hertz. A1. The automated FO tremor rate and manual have a mild positive correlation with the best-fit line y equals 0.3526x plus 2.9995 and R squared equals 0.2515. A2. The semi-automated FO tremor rate and manual have a strong positive correlation with the best-fit line y equals 0.9296x plus 0.6288 and R squared equals 0.8713. B. The bottom left plot shows the correlation between the fundamental frequency tremor extent for automated, semi-automated versus manual. The vertical axis represents the automated and semi-automated tremor extent, ranging from 0 to 50, and the horizontal axis represents the manual tremor extent, ranging from 0 to 50. B1. The automated FO tremor extent and manual have a strong positive correlation with the best-fit line y equals 1.1434x plus 0.5521 and R squared equals 0.658. B2. The semi-automated FO tremor extent and manual have a strong positive correlation with the best-fit line y equals 0.9724x plus 0.2901 and R squared equals 0.9899. C. The top right plot shows the correlation between the intensity level and tremor rate for automated, semi-automated versus manual. The vertical axis represents the automated and semi-automated tremor rate, ranging from 0 to 11 Hertz, and the horizontal axis represents the manual tremor rate, ranging from 0 to 11 Hertz. C1. The automated IL tremor rate and manual have a strong positive correlation with the best-fit line y equals 1.1219x minus 0.1369 and R squared equals 0.5029. C2. The semi-automated IL tremor rate and manual have a strong positive correlation with the best-fit line y equals 0.9743x plus 0.4247 and R squared equals 0.8429. D. The bottom right plot shows the correlation between the intensity level and tremor extent for automated, semi-automated versus manual. The vertical axis represents the automated and semi-automated tremor extent, ranging from 0 to 8, and the horizontal axis represents the manual tremor extent, ranging from 0 to 35. D1. The automated IL tremor extent and manual have a strong positive correlation with the best-fit line y equals 0.1534x minus 0.0098 and R squared equals 0.9155. D2. The semi-automated IL tremor extent and manual have a strong positive correlation with the best-fit line y equals 0.1689x minus 0.0.0081 and R squared equals 0.8571.

Scatter plots to illustrate the correlations between manually and acoustically (automated or semi-automated) determined rate and extent of modulations in the f0 and IL traces. f0 = fundamental frequency; IL = intensity level.

Incongruency Between Manual and Automated Measurements

Visual inspection of the automated measures-based scatter plots of Figure 4 indicated six signals for which one or more outlier data (marked by an arrow) located away from the linear regression line. These were signals for which there was obvious incongruence between manually and automated acoustically determined modulation measurements. Table 5 presents the erroneous and corrected measurements on these signals and description of the phenomena that may have led to the errors. From this analysis, main signal characteristics that may have induced error were multiplophonic voice break (see Figure 5), unvoiced interval (by the software interpreted as f0 = 0 Hz; see also Figure 5), and varying modulation rates (see Figure 6).

Table 5.

Description of characteristics of signals for which manual and automated tremor property analysis disagreed.

Subject Recording Modulation Property Measurement
Signal characteristics
Manual Automated Semi-automated (corrected)
1 [i:], 1 f 0 Rate, Hz 5.70 3.34 5.44 Varying modulation rates with the script choosing the lower rate
5 [a:], 3 f 0 Rate, Hz 10.00 4.42 10.47 Varying modulation rates with the script choosing the lower rate
2 [i:], 1 A Rate, Hz 5.30 8.45 5.51 Varying modulation rates with the script choosing the higher rate
2 [a:], 1 f 0 Extent, % 6.00 25.66 5.38 Instances with no period detection leading to erroneous f0 = 0 Hz in the script
7 [a:], 3 A Rate, Hz 4.80 10.26 5.12 Varying modulation rates with the script choosing the higher rate
f 0 Extent, % 11.50 36.67 12.37 Short voice breaks with diplophonia, subsequent drops in f0 and then erroneously high extent measures
Instances with no period detection leading to erroneous f0 = 0 Hz in the script
12 [a:], 1 f 0 Extent, % 14.50 47.39 12.31 Instances with no period detection leading to erroneous f0 = 0 Hz in the script

Note. f 0 = fundamental frequency.

Figure 5.

Five waveforms showing sound pressure, frequency, and fundamental frequency variations over time with five intervals and two multiplophonic intervals. A. The top graph shows sound pressure on vertical axis from negative 0.0150 to 0.0126 pascals versus time in seconds on horizontal axis ranging from 0.000 to 2.000. The graph plots a spectrogram which fluctuates between 0.0126 and -0.0150 Pascals over two seconds. B. The second graph shows frequency in hertz on the vertical axis from negative 0 to 2000 versus time in seconds on the horizontal axis ranging from 0.000 to 2.000. The graph plots a series of waveforms. C. The third graph shows fundamental frequency on the vertical axis from 50 to 300 hertz versus time in seconds on the horizontal axis ranging from 0.000 to 2.000. The graph plots an irregular sinusoidal waveform with disconnection at 5 intervals marked U1, U2, U3, U4, and U5 and two shaded areas of multiplophonic intervals M1 between U1 and U2 and M2 between U4 and U5. D. The fourth graph shows sound pressure on vertical axis from negative 0.0137 to 0.0112 pascals versus time in seconds on horizontal axis ranging from 0.0256 to 0.190. The graph plots fluctuations with peaks and troughs and a shaded region labeled multiplophonic voice break. E. The fifth graph shows fundamental frequency versus time. The graph shows fundamental frequency in hertz on vertical axis ranging from 0.000 to 216.883 and time on horizontal axis with 5 intervals marked at U1, U2, U3, U4, and U5 with two shaded regions between U1 and U2 and U4 and U5. The waveform spikes down to 0.000 at every interval of U1 to U5.

Graphs illustrating five intervals not detected as voiced and thus considered unvoiced by the script (U1, U2, U3, U4, and U5) and two multiplophonic interval (M1 and M2). All these intervals result in clear drops in the f0 modulation trace, resulting in erroneous f0 modulation extent measurements. f0 = fundamental frequency.

Figure 6.

Four graphs of varying modulation rates. A. The first graph shows sound pressure in Pascals on the vertical axis ranging from negative 0.1198 to 0.0823 versus time in seconds on the horizontal axis ranging from 0.000 to 2.161. The graph plots a spectrogram constant along a horizontal line. The second graph shows frequency in Hertz on the vertical axis ranging from 0 to 2000 and time in seconds on the horizontal axis. The frequency plots a series of waveforms. The third graph shows fundamental frequency in Hertz on the vertical axis and time in seconds on the horizontal axis. The fundamental frequency on the vertical axis ranges from 100 to 200. The line fluctuates approximately along 160 hertz. The fourth graph shows f0 modulation rate in Hertz on the vertical axis ranging from 3 to 12 and time in seconds on the horizontal axis. There are several data points marked from 0.000 seconds to 2.161 seconds labeled 6.273, 5.991, 6.273, 6.215, 6.456, 3.500, 3.463, 4.819, 6.855, and 6.519 hertz respectively.

Graphs illustrating variance in period of 10 modulation cycles (MCs) and consequently also in modulation rate (between 3.482 Hz and 6.369 Hz, as averages of respectively the two lowest and seven highest points).

Vocal Tremor Diagnosis Accuracy

To determine the accuracy in diagnosing vocal tremor of the acoustic methods relative to the manual methods of tremor measurement, positive cases were considered those in which tremor modulation was determined as present (i.e., 58/90 f0 traces and 52/90 IL traces), whereas negative cases were those where tremor modulation was absent (i.e., 32/90 f0 traces and 38/90 IL traces). This accuracy of the automated and semi-automated acoustic methods was investigated using the receiver operating characteristics curve (ROC). Table 6 lists the areas under ROC (AROC), as well as the threshold scores that yielded the best combination of sensitivity and specificity as expressed in Youden's index. Compared to manual acoustic measurements, vocal tremor diagnosis accuracy was determined acceptable for all automated and semi-automated acoustic measures of modulation extent with AROC values ranging between 0.888 for automated ILModMeanExtent and 0.920 for semi-automated ILModMeanExtent .

Table 6.

Diagnostic accuracy statistics of automatically and semi-automatically deciding on the presence/absence of tremorous modulation, with manual decision as the referent measure.

Metric Automated
Semi-automated
f0ModMeanExtent ILModMeanExtent f0ModMeanExtent ILModMeanExtent
AROC 0.904 0.888 0.904 0.920
Sensitivity 0.724 0.827 0.741 0.865
Specificity 0.969 0.921 0.937 0.921
Threshold score 1.594% 0.833% 1.690% 0.730%

Note. AROC = area under receiver operating characteristics curve.

Discussion

Vocal tremor is a neurogenic voice disorder that can significantly impact a person's communication and quality of life. It is important to have a readily available methodology to quantify the distinct vocal tremor acoustic properties, for example, for tracking treatment outcomes and changes in the severity of vocal tremor over time. This study investigated the concurrent validity of the two algorithms (automated or semi-automated) for measuring acoustic vocal tremor properties compared to manual extraction of these measures. Specifically, the correlations between manually and acoustically determined vocal tremor properties were gathered and compared for the strength of association. Pearson correlation coefficients between manual and automated acoustic measurements as well as between manual and semi-automated acoustic measurements were all significant. However, for the automated algorithm rPearson ranged from .502 (i.e., moderate) to .957 (i.e., strong) and was thus variable across the four markers, whereas for the semi-automated algorithm, all rPearson exceeded .9 (i.e., strong). With the semi-automated method, enabling the user to correct errors from the automated signal processing, this finding substantiated our hypothesis that it may be better to use a semi-automated algorithm. The semi-automated method was similar to the manual method, with f0 and IL traces and added information shown in Praat's graphical output window and consequently visually processed. It was noted that there are significant differences in modulation properties between measurement methods. In general, these differences were small. Only in ILModMeanExtent there were large differences. This, however, can be explained by the difference in unit upon which modulation extent was calculated: pascal in the manual method versus dB in the automated/semi-automated. In future studies, differences can be assessed with ILModMeanExtent measurements based on variables with the same unit.

This study also investigated the reliability of the manual measurements of vocal tremor properties when executed by different trained clinicians. To date, the reliability of visual-acoustically determined modulation properties in f0 and IL traces as standard in a perceptual-acoustic study has not yet been studied. In the present study, interrater reliability was considered good to excellent. A similar methodology has been used by Lederle et al. (2012) and Lester et al. (2013) for visual inspection and measurement of f0 and IL acoustic modulation using the program Praat. However, visual–perceptual assessment of other graphical outputs in the field of voice disorders may be more common and, therefore, their reliability more studied. For example, reliability analysis of narrow-band spectrographic voice sound assessment showed moderate to excellent agreement, depending on the feature considered (Bastilha et al., 2021). Agreement between neurolaryngologists in their visual assessment/interpretation of graphical laryngeal-electromyographic traces has also been shown to be between moderate and high (Ho et al., 2019). Our visual trace manual reliability measurements were comparable to other established methods. It was interesting, however, to see in post hoc review that there may be some difficulty in visually perceiving vocal tremor acoustic modulation at lower extents of modulation, particularly around 1.7%. Nevertheless, the manual measurements were used as the referent upon which the validity of the automated and semi-automated acoustic algorithms was assessed in the present study.

Another goal of the present study was to determine how accurate the two acoustic (automated and semi-automated) algorithms are in differentiating between tremorous and nontremorous phonation. Upon visual/manual inspection of the f0 and IL traces in the program Praat, the minimum extent of modulation of the traces from tremorous samples was found to be 0.5% for f0ModMeanExtent and 1.0% for ILModMeanExtent . ROC analysis indicated threshold values for the automated and semi-automated measures of extent of f0 and IL modulation (see Table 6). For example, semi-automated ILModMeanExtent < 0.730% has a strong chance of corresponding with absence of vocal tremor. However, confirmation from additional study with more tremorous as well as nontremorous participants is required to substantiate general use of these thresholds. Based on ROC analysis, diagnostic precision was equal between automated and semi-automated measurements of f0ModMeanExtent . However, for ILModMeanExtent , vocal tremor diagnosis precision of the semi-automated measurements was higher than that of the automated measurements. Therefore, as expected, the semi-automated measurements appeared to perform superiorly in terms of correlation with manual measurements and diagnostic precision. This similarity between manual and semi-automated methods is most likely explained by including visual inspections before calculating the vocal tremor properties from the individual acoustic modulation cycles.

Zooming in to signal characteristics that may lead to incongruency between manual and automated measurements of vocal tremor modulation properties, as presented in Table 5, has shown that signal intervals without voiced period detection, multiplophonic intervals, and varying modulation rate may induce erroneous modulation rate and/or extent measurements. Semi-automated analysis enables the user to correct for such phenomena. Furthermore, future Praat scripts may incorporate signal processing steps to reduce their influence, such as interpolation to bridge unvoiced intervals and period detection algorithms that help reducing unwanted f0 drops because of multiplophonia.

Limitations and Future Directions

There are a few limitations of this study that guide future investigations regarding the measurement and quantification of vocal tremor acoustic properties. First, only 15 subjects with essential tremor and various vocal tremor severity levels were included. This limits the clinical representativity of this study's outcomes. Second, only two raters visually inspected the f0 and IL traces. Although their reliability was determined acceptable, it would be interesting to investigate the reliability of such inspection by more raters and to search more in depth for causes of disagreement within and between raters. Furthermore, using the semi-automated tool will require the user to (a) understand how vocal tremor properties are analyzed and measured, (b) to recognize when the program-generated measures differ from their manual measures, and (c) to identify signal characteristics, such as multiplophonic intervals or varying modulation rates resulting in incongruence between acoustic and manual measurements. As only one user applied the semi-automated tool, future work will also need to assess intra-user and inter-user reliability across a sample representative of vocal tremor types and severity levels. In case of insufficient reliability, training of semi-automated acoustic measurement of vocal tremor properties and its impact on reliability could be evaluated. Third, depending on the speech structures involved, vocal tremor may not only affect f0 (predominantly related to the phonatory part of the speech system) and IL (predominantly related to the respiratory part of the speech system, although glottal width modulation during horizontal oscillation of the vocal folds can also affect IL), it may also, for example, involve the first and second formants (both related to the vocal tract due to tremor affecting the articulators comprising the speech system; Barkmeier-Kraemer & Story, 2010). It would, therefore, be important to add analyses of the properties of modulation of formant traces to the semi-automated algorithm. This could help to facilitate the evaluation of the relative contribution of these supralaryngeal tremor properties to vocal tremor, or the analysis of co-occurrence of f0 and formant modulations in cases with oscillating laryngeal position/height. Fourth, vibrato in the singing voice has been studied as a surrogate model of vocal tremor (Lester-Smith & Story, 2016). Thus, it would be interesting to apply the acoustic methods in this study to characterize vibrato modulation properties. Finally, the tools developed and tested in this study can also be applied for identification and characterization of unique acoustic modulation features and classification of vocal tremor between various medical conditions (e.g., essential vocal tremor, vocal tremor associated with laryngeal dystonia, or vocal tremor associated with Parkinson's disease).

Data Availability Statement

The data sets generated and/or analyzed during the current study are available from the corresponding author on reasonable request.

Supplementary Material

Supplemental Material S1. Information about the rate, extent, and regularity of f0 modulation and IL modulation in their respective traces extracted via fast Fourier transformation (i.e., FFT, or spectral analysis) in Praat.
JSLHR-68-2721-s001.jpg (5.3MB, jpg)

Appendix

Algorithm Steps Programmed in the Program Praat to Find Rate, Extent, and Regularity of Vocal Tremor Modulations in the Fundamental Frequency (f0) and Sound Intensity Level (IL) Traces

Number Type Process
1 Au Drawing a narrow-band spectrogram to enable a visual–perceptual interpretation of possible modulation in the speech signals: window length = 0.03 s, maximum frequency = 2000 Hz, time step = 0.002 s, frequency step = 20 Hz, window shape = sine-squared (Hanning)
2 Au Application of the following ranges: range of tremor rate = from 3 to 12 Hz, range of tremor search rate = from 2 to 24 Hz
f 0 tremor: Repeat until user's approval of correct analysis
3 Au Extraction and drawing of f0 trace with Praat's cross-correlation method: minimum f0 = 60 Hz, maximum f0 = 400 Hz, time step = 0.02 s, silence threshold = 0.3, voicing threshold = 0.45, octave cost = 0.01, octave jump cost = 0.8, voiced/unvoiced cost = 0.14
4 Au Setting the mean value of the f0 trace to 0 to remove zero offset
5 Au Windowing of the f0 trace with a Hanning function to remove spectral artefacts resulting from truncation of the time series
6 Au Fast Fourier transformation (FFT) of the f0 trace to yield its FFT spectrum
7 Au One-to-one translation of the f0 trace's FFT spectrum into a long-term average spectrum (LTAS) for further processing of the spectral data. Drawing of the LTAS
8 Au Determination of frequency (and its corresponding period as 1frequency) and amplitude corresponding with the f0-based LTAS maximum/peak within the predefined range of tremor search rate
9 Au Computation of the linear trend line through the f0-based LTAS over the range of tremor search rate
10 Au Determination of the spectral peak's prominence above the linear trend line
11 Au Determination of minimal f0 and maximum f0 in every period of the f0 trace
12 Au Calculation of f0 tremor extent for every period as maximumf0minimumf0maximumf0+minimumf0×100 (Lederle et al., 2012; Lester et al., 2013)
13 Au Calculation of mean and median extents of f0 tremor
14 Au Determination and drawing of a rectangle derived from x (i.e., time) and y (i.e., f0) coordinates of the minimum and maximum data points in every period. These rectangles thus incorporated half duration and extent of every tremor period in a single metric and were therefore hypothesized to summarize elemental tremor cycle geometry.
15 Au Calculation of absolute perturbation in rectangle surfaces between adjacent f0 tremor cycles as i=1N1surfaceisurfacei+1N1
16 Au Calculation of relative perturbation in rectangle surfaces between adjacent f0 tremor cycles as i=1N1surfaceisurfacei+1N1i=1NsurfaceiN×100
17 UI Decision on visual presence of tremor in f0 trace
18 UI Decision on correctness of f0 tremor rate
19 UI Removal of unfit f0 tremor modulations
20 UI User approval if f0 tremor is present, f0 tremor rate is correct and there are no more unfit f0 tremor modulations.
IL tremor: Repeat until user's approval of correct analysis
21 Au Extraction and drawing of IL trace: minimum f0 = 60 Hz, time step = default
22 Au Setting the mean value of the IL trace to 0 to remove zero offset
23 Au Windowing of the IL trace with a Hanning function to remove spectral artefacts resulting from truncation of the time series
24 Au Fast Fourier transformation of the IL trace to yield its FFT spectrum
25 Au One-to-one translation of the IL trace's FFT spectrum into an LTAS for further processing of the spectral data. Drawing of the LTAS
26 Au Determination of frequency (and its corresponding period as 1frequency) and amplitude corresponding with the IL-based LTAS maximum/peak within the predefined range of tremor search rate
27 Au Computation of the linear trend line through the IL-based LTAS over the range of tremor search rate
28 Au Determination of the spectral peak's prominence above the linear trend line
29 Au Determination of minimal amplitude and maximum amplitude in every period of the IL trace
30 Au Calculation of IL tremor extent for every period as maximumILminimumILmaximumIL+minimumIL×100 (Lederle et al., 2012; Lester et al., 2013)
31 Au Calculation of mean and median extents of IL tremor
32 Au Determination and drawing of a rectangle derived from x (i.e., time) and y (i.e., IL) coordinates of the minimum and maximum data points in every period. These rectangles thus incorporated half duration and extent of every tremor period in a single metric and were therefore hypothesized to summarize elemental tremor cycle geometry.
33 Au Calculation of absolute perturbation in rectangle surfaces between adjacent tremor cycles as i=1N1surfaceisurfacei+1N1
34 Au Calculation of relative perturbation in rectangle surfaces between adjacent tremor cycles as i=1N1surfaceisurfacei+1N1i=1NsurfaceiN×100
35 UI Decision on visual presence of tremor in IL trace
36 UI Decision on correctness of IL tremor rate
37 UI Removal of unfit IL tremor modulations
38 UI User approval if IL tremor is present, IL tremor rate is correct, and there are no more unfit IL tremor modulations
39 Au Final graphical and numerical output with addition of the Acoustic Phonatory Tremor Index (APTI) based on Maryn et al. (2019)

Note.  Au = automatic; UI = user interference. Adapted from Maryn et al. (2019).

Funding Statement

This work was funded by the National Institute on Deafness and Other Communication Disorders (R01DC016838, PI: Barkmeier-Kraemer; P50DC019900, PI: Simonyan) with partial support from the University of Utah Voice, Airway, Swallowing Translational (VAST) Research Lab.

Acknowledgments

This work was funded by the National Institute on Deafness and Other Communication Disorders (R01DC016838, PI: Barkmeier-Kraemer; P50DC019900, PI: Simonyan) with partial support from the University of Utah Voice, Airway, Swallowing Translational (VAST) Research Lab.

References

  1. Ackermann, H., & Ziegler, W. (1991). Cerebellar voice tremor: An acoustic analysis. Journal of Neurology, Neurosurgery, & Psychiatry, 54(1), 74–76. 10.1136/jnnp.54.1.74 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Adler, C. H., Bansberg, S. F., Hentz, J. G., Ramig, L. O., Buder, E. H., Witt, K., Edwards, B. W., Krein-Jones, K., & Caviness, J. N. (2004). Botulinum toxin type a for treating voice tremor. Archives of Neurology, 61(9), 1416–1420. 10.1001/archneur.61.9.1416 [DOI] [PubMed] [Google Scholar]
  3. Anand, S., Shrivastav, R., Wingate, J. M., & Chheda, N. N. (2012). An acoustic-perceptual study of vocal tremor. Journal of Voice, 26(6), 811.e1–811.e7. 10.1016/j.jvoice.2012.02.007 [DOI] [PubMed] [Google Scholar]
  4. Barkmeier, J., Case, J. L., & Ludlow C. L. (2001). Identification of symptoms for spasmodic dysphonia and vocal tremor: A comparison of expert and nonexpert judges. Journal of Communication Disorders, 34(1-2), 21–37. 10.1016/S0021-9924(00)00039-3 [DOI] [PubMed] [Google Scholar]
  5. Barkmeier-Kraemer, J. (2013). Special cases: Hiding vocal tremor. In A. Behrman & J. Haskell (Eds.), Exercises for voice therapy (2nd ed., pp. 196–199). Plural. [Google Scholar]
  6. Barkmeier-Kraemer, J. (2016). Evaluation and treatment of vocal tremor. In Y. Maryn & M. De Bodt (Eds.), StEMINARIES 2015–2016 nr. 4, Neurolaryngology (pp. 12–25). Vlaamse Vereniging voor Logopedisten. [Google Scholar]
  7. Barkmeier-Kraemer, J. (2019). The use of reduced voicing duration to treat vocal tremor. In J. C. Stemple & E. Hapner (Eds.), Voice therapy clinical case studies (5th ed., pp. 263–273). Plural. [Google Scholar]
  8. Barkmeier-Kraemer, J., Lato, A., & Wiley, K. (2011). Development of a speech treatment program for a client with essential vocal tremor. Seminars in Speech and Language, 32(1), 043–057. 10.1055/s-0031-1271974 [DOI] [PubMed] [Google Scholar]
  9. Barkmeier-Kraemer, J., & Story, B. (2010). Conceptual and clinical updates on vocal tremor. The ASHA Leader, 15(14), 16–19. 10.1044/leader.FTR2.15142010.16 [DOI] [Google Scholar]
  10. Barkmeier-Kraemer, J., Tanner, K., & Maryn, Y. (2018, October). Distinguishing features of vocal tremor associated with Parkinson disease, spasmodic dysphonia, and essential tremor [Paper presentation]. The Fall Voice Conference, Seattle, WA. [Google Scholar]
  11. Bastilha, G. R., Pagliarin, K. C., Moraes, D. A. O., & Cielo, C. A. (2021). Spectrographic Vocal Assessment Protocol (SVAP): Reliability and criterion validity. Journal of Voice, 35(6), 931.e1–931.e14. 10.1016/j.jvoice.2020.02.017 [DOI] [PubMed] [Google Scholar]
  12. Bhatia, K. P., Bain, P., Bajaj, N., Elble, R. J., Hallett, M., Louis, E. D., Raethjen, J., Stamelou, M., Testa, C. M., Deuschl, G., & Tremor Task Force of the International Parkinson and Movement Disorder Society. (2018). Consensus statement on the classification of tremors. from the task force on tremor of the International Parkinson and Movement Disorder Society. Movement Disorders, 33(1), 75–87. 10.1002/mds.27121 [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Bove, M., Daamen, N., Rosen, C., Wang, C. C., Sulica, L., & Gartner-Schmidt, J. (2006). Development and validation of the Vocal Tremor Scoring System. The Laryngoscope, 116(9), 1662–1667. 10.1097/01.mlg.0000233255.57425.36 [DOI] [PubMed] [Google Scholar]
  14. Brown, J. R., & Simonson, J. (1963). Organic voice tremor: A tremor of phonation. Neurology, 13(6), 520–525. 10.1212/WNL.13.6.520 [DOI] [PubMed] [Google Scholar]
  15. Busenbark, K., Ramig, L., Dromey, C., & Koller, W. C. (1996). Methazolamide for essential voice tremor. Neurology, 47(5), 1331–1332. 10.1212/WNL.47.5.1331 [DOI] [PubMed] [Google Scholar]
  16. Crawford, P., & Zimmerman, E. E. (2011). Differentiation and diagnosis of tremor. American Family Physician, 83(6), 697–702. https://www.aafp.org/pubs/afp/issues/2011/0315/p697.html [PubMed] [Google Scholar]
  17. Dominguez, L. M., Krysinski, M., & Simpson, B. C. (2018, October). Acoustic characteristics of essential vocal tremor versus Parkinsonian tremor [Paper presentation]. The Fall Voice Conference, Seattle, WA. [Google Scholar]
  18. Dromey, C., Warrick, P., & Irish, J. (2002). The influence of pitch and loudness changes on the acoustics of vocal tremor. Journal of Speech, Language, and Hearing Research, 45(5), 879–890. 10.1044/1092-4388(2002/071) [DOI] [PubMed] [Google Scholar]
  19. Duffy, J. R. (1995). Motor speech disorders: Substrates, differential diagnosis, and management. Mosby. [Google Scholar]
  20. Dwenger, K. M. (2022). Evaluation of the contribution of sensory tricks to changes in symptomatology in individuals with laryngeal dystonia in comparison to individuals with essential vocal tremor [Master's thesis, University of Utah]. ProQuest Dissertations and Thesis Global. https://www.proquest.com/openview/63dff3276c70097b8d763936c6cbd549/1 [Google Scholar]
  21. Dwenger, K. M. (2024). The effects of transcutaneous vibrotactile stimulation rates on laryngeal dystonia symptomatology [Doctoral dissertation, University of Utah]. ProQuest Dissertations and Thesis Global. [Google Scholar]
  22. Elble, R. J. (2016). The essential tremor syndromes. Current Opinion in Neurology, 29(4), 507–512. 10.1097/WCO.0000000000000347 [DOI] [PubMed] [Google Scholar]
  23. Erickson-DiRenzo, E., Kuijper, F. M., Barbosa, D. A. N., Lim, E. A., Lin, P. T., Lising, M. A., Huang, Y., Sung, C. K., & Halpern, C. H. (2020). Multiparametric laryngeal assessment of the effect of thalamic deep brain stimulation on essential vocal tremor. Parkinsonism and Related Disorders, 81, 106–112. 10.1016/j.parkreldis.2020.10.026 [DOI] [PubMed] [Google Scholar]
  24. Erickson-DiRenzo, E., Sung, C. K., Ho, A. L., & Halpern, C. H. (2020). Intraoperative evaluation of essential vocal tremor in deep brain stimulation surgery. American Journal of Speech-Language Pathology, 29(2), 851–863. 10.1044/2019_AJSLP-19-00079 [DOI] [PubMed] [Google Scholar]
  25. Fahn, S., Tolosa, E., & Marin, C. (1988). Clinical rating scale for tremor. In J. Jankovic & E. Tolosa (Eds.), Parkinson's disease and movement disorders (pp. 225–234). Urban & Schwarzenberg. [Google Scholar]
  26. Finnegan, E. M., Luschei, E. S., Barkmeier, J., & Hoffman, H. T. (2003). Synchrony of laryngeal muscle activity in persons with vocal tremor. Archives of Otolaryngology—Head & Neck Surgery, 129(3), 313–318. 10.1001/archotol.129.3.313 [DOI] [PubMed] [Google Scholar]
  27. Gamboa, J., Jiménez-Jiménez, F. J., Nieto, A., Cobeta, I., Vegas, A., Ortí-Pareja, M., Gasalla, T., Molina, J. A., & García-Albea, E. (1998). Acoustic voice analysis in patients with essential tremor. Journal of Voice, 12(4), 444–452. 10.1016/S0892-1997(98)80053-2 [DOI] [PubMed] [Google Scholar]
  28. Ghosh, N., Eidson, E., Derrick, E., & Lester-Smith, R. A. (2023). Breathy voice as a compensatory strategy for essential vocal tremor: A single case experiment across participants. Journal of Voice. Advance online publication. 10.1016/j.jvoice.2023.10.003 [DOI] [PubMed] [Google Scholar]
  29. Gillivan-Murphy, P., Carding, P., & Miller, N. (2016). Vocal tract characteristics in Parkinson's disease. Current Opinion in Otolaryngology & Head and Neck Surgery, 24(3), 175–182. 10.1097/MOO.0000000000000252 [DOI] [PubMed] [Google Scholar]
  30. Gillivan-Murphy, P., Miller, N., & Carding, P. (2019). Voice tremor in Parkinson's disease: An acoustic study. Journal of Voice, 33(4), 526–535. 10.1016/j.jvoice.2017.12.010 [DOI] [PubMed] [Google Scholar]
  31. Gurey, L. E., Sinclair, C. F., & Blitzer, A. (2013). A new paradigm for the management of essential vocal tremor with botulinum toxin. The Laryngoscope, 123(10), 2497–2501. 10.1002/lary.24073 [DOI] [PubMed] [Google Scholar]
  32. Hachinski, V. C., Thomsen, I. V., & Buch, N. H. (1975). The nature of primary vocal tremor. Canadian Journal of Neurological Science, 2(3), 195–197. 10.1017/S0317167100020254 [DOI] [PubMed] [Google Scholar]
  33. Hägglund, P., Sandström, L., Blomstedt, P., & Karlsson, F. (2016). Voice tremor in patients with essential tremor: Effects of deep brain stimulation of caudal zona incerta. Journal of Voice, 30(2), 228–233. 10.1016/j.jvoice.2015.04.003 [DOI] [PubMed] [Google Scholar]
  34. Hartman, D. E., & Vishwanat, B. (1984). Spastic dysphonia and essential (voice) tremor treated with primidone. Archives of Otolaryngology, 110(6), 394–397. 10.1001/archotol.1984.00800320048011 [DOI] [PubMed] [Google Scholar]
  35. Hlavnička, J., Tykalová, T., Ulmanová, O., Dušek, P., Horáková, D., Růžička, E., Klempíř, J., & Rusz, J. (2020). Characterizing vocal tremor in progressive neurological diseases via automated acoustic analyses. Clinical Neurophysiology, 131(5), 1155–1165. 10.1016/j.clinph.2020.02.005 [DOI] [PubMed] [Google Scholar]
  36. Ho, G. Y., Leonhard, M., Volk, G. F., Foerster, G., Pototschnig, C., Klinge, K., Granitzka, T., Zienau, A. K., & Schneider-Stickler, B. (2019). Inter-rater reliability of seven neurolaryngologists in laryngeal EMG signal interpretation. European Archives of Otorhinolaryngology, 276(10), 2849–2856. 10.1007/s00405-019-05553-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Jiang, J., Lin, E., & Hanson, D. G. (2000). Acoustic and airflow spectral analysis of voice tremor. Journal of Speech, Language, and Hearing Research, 43(1), 191–204. 10.1044/jslhr.4301.191 [DOI] [PubMed] [Google Scholar]
  38. Julius, A., & Longfellow, K. (2016). Movement disorders: A brief guide in medication management. Medical Clinics of North America, 100(4), 733–761. 10.1016/j.mcna.2016.03.002 [DOI] [PubMed] [Google Scholar]
  39. Kendall, K. A., & Leonard, R. J. (2011). Interarytenoid muscle botox injection for treatment of adductor spasmodic dysphonia with vocal tremor. Journal of Voice, 25(1), 114–119. 10.1016/j.jvoice.2009.08.003 [DOI] [PubMed] [Google Scholar]
  40. Khoury, S., & Randall, D. R. (2022). Treatment of essential vocal tremor: A scoping review of evidence-based therapeutic modalities. Journal of Voice, 38, 922–930. 10.1016/j.jvoice.2021.12.009 [DOI] [PubMed] [Google Scholar]
  41. Koda, J., & Ludlow, C. L. (1992). An evaluation of laryngeal muscle activation in patients with voice tremor. Otolaryngology—Head & Neck Surgery, 107(5), 684–696. 10.1177/019459989210700510 [DOI] [PubMed] [Google Scholar]
  42. Koller, W. (1985). Long-acting propranolol in essential tremor. Neurology, 35(1), 108–110. 10.1212/WNL.35.1.108 [DOI] [PubMed] [Google Scholar]
  43. Koller, W., Graner, D., & Mlcoch, A. (1985). Essential voice tremor: Treatment with propranolol. Neurology, 35(1), 106–108. 10.1212/WNL.35.1.106 [DOI] [PubMed] [Google Scholar]
  44. Koo, T. K., & Li, M. Y. (2016). A guideline of selecting and reporting intraclass correlation coefficients for reliability research. Journal of Chiropractic Medicine, 15(2), 155–163. 10.1016/j.jcm.2016.02.012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Lederle, A., Barkmeier-Kraemer, J., & Finnegan, E. (2012). Perception of vocal tremor during sustained phonation compared with sentence context. Journal of Voice, 26(5), 668.e1–668.e9. 10.1016/j.jvoice.2011.11.001 [DOI] [PubMed] [Google Scholar]
  46. Lester, R. A., Barkmeier-Kraemer, J., & Story, B. H. (2013). Physiologic and acoustic patterns of essential vocal tremor. Journal of Voice, 27(4), 422–432. 10.1016/j.jvoice.2013.01.002 [DOI] [PubMed] [Google Scholar]
  47. Lester-Smith, R. A., & Story, B. H. (2016). The effects of physiological adjustments on the perceptual and acoustical characteristics of vibrato as a model of vocal tremor. The Journal of the Acoustical Society of America, 140(5), 3827–3833. 10.1121/1.4967454 [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Louis, E. D. (2016). Non-motor symptoms in essential tremor: A review of the current data and state of the field. Parkinsonism & Related Disorders, 22(Suppl. 1), S115–S118. 10.1016/j.parkreldis.2015.08.034 [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Lowell, S. Y., Kelley, R. T., Dischinat, N., Monahan, M., Hosbach-Cannon, C. J., Colton, R. H., & Mihaila, D. (2021). Clinical features of essential voice tremor and associations with tremor severity and response to octanoic acid treatment. The Laryngoscope, 131(11), E2792–E2801. 10.1002/lary.29558 [DOI] [PubMed] [Google Scholar]
  50. Lowell, S. Y., Kelley, R. T., Monahan, M., Hosbach-Cannon, C. J., Colton, R. H., & Mihaila, D. (2019). The effect of octanoic acid on essential voice tremor: A double-blind, placebo-controlled study. The Laryngoscope, 129(8), 1882–1890. 10.1002/lary.27695 [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Lu, F., Zhao, K., Wu, Y., Kong, Y., Gao, Y., & Zhang, L. (2023). Voice-related outcomes in deep brain stimulation in patients with vocal tremor: A systematic review and meta-analysis. Journal of Voice. Advance online publication. 10.1016/j.jvoice.2023.09.027 [DOI] [PubMed] [Google Scholar]
  52. Maryn, Y., Leblans, M., Zarowski, A., & Barkmeier-Kraemer, J. (2019). Objective acoustic quantification of perceived voice tremor severity. Journal of Speech, Language, and Hearing Research, 62(10), 3689–3705. 10.1044/2019_JSLHR-S-19-0024 [DOI] [PubMed] [Google Scholar]
  53. Matsumoto, J. Y., Fossett, T., Kim, M., Duffy, J. R., Strand, E., McKeon, A., Lee, K. H., Stead, M., Burnett, M., Adams, A. C., & Klassen, B. T. (2016). Precise stimulation location optimizes speech outcomes in essential tremor. Parkinsonism & Related Disorders, 32, 60–65. 10.1016/j.parkreldis.2016.08.017 [DOI] [PubMed] [Google Scholar]
  54. Mendoza, E., & Carballo, G. (1999). Vocal tremor and psychological stress. Journal of Voice, 13(1), 105–112. 10.1016/S0892-1997(99)80064-2 [DOI] [PubMed] [Google Scholar]
  55. Merati, A. L., Heman-Ackah, Y. D., Abaza, M., Altman, K. W., Sulica, L., & Bielamowicz, S. (2005). Common movement disorders affecting the larynx: A report from the neurolaryngology committee of the AAO-HNS. Otolaryngology—Head & Neck Surgery, 133(5), 654–665. 10.1016/j.otohns.2005.05.003 [DOI] [PubMed] [Google Scholar]
  56. Millman, N., van der Woerd, B., Timmons Sund, L., & Johns, M. (2023). Cannabinoid use in the treatment of laryngeal dystonia and vocal tremor: A pilot investigation. Journal of Voice. Advance online publication. 10.1016/j.jvoice.2023.05.006 [DOI] [PubMed] [Google Scholar]
  57. Movement Disorder Society Task Force on Rating Scales for Parkinson's Disease. (2003). The Unified Parkinson's Disease Rating Scale (UPDRS): Status and recommendations. Movement Disorders, 18(7), 738–750. 10.1002/mds.10473 [DOI] [PubMed] [Google Scholar]
  58. Muenter, M. D., Daube, J. R., Caviness, J. N., & Miller, P. M. (1991). Treatment of essential tremor with methazolamide. Mayo Clinic Proceedings, 66(10), 991–997. 10.1016/S0025-6196(12)61721-2 [DOI] [PubMed] [Google Scholar]
  59. Nelson, R. C., Silva Merea, V., Tierney, W. S., Milstein, C., Benninger, M. S., & Bryson, P. C. (2019). Laryngeal botulinum toxin injection for vocal tremor: Utility of concurrent strap muscle injection. The Laryngoscope, 129(6), 1433–1437. 10.1002/lary.27631 [DOI] [PubMed] [Google Scholar]
  60. Nida, A., Alston, J., & Schweinfurth, J. (2016). Primidone therapy for essential vocal tremor. JAMA Otolaryngology – Head & Neck Surgery, 142(2), 117–121. 10.1001/jamaoto.2015.2849 [DOI] [PubMed] [Google Scholar]
  61. Ondo, W. (2016). Essential tremor: What we can learn from current pharmacotherapy. Tremor and Other Hyperkinetic Movements, 6, Article 356. 10.5334/tohm.318 [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Panda, A. K., Kushwaha, S., & Kaur, M. (2013). Essential palatal tremor with hemifacial and vocal cord tremor. BMJ Case Reports, 2013, Article bcr2013201327. 10.1136/bcr-2013-201327 [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Portney, L. G., & Watkins, M. P. (2000). Foundations of clinical research: Applications to practice (2nd ed.). Prentice Hall. [Google Scholar]
  64. Ramig, L. A., Scherer, R. C., Titze, I. R., & Ringel, S. P. (1988). Acoustic analysis of voices of patients with neurologic disease: Rationale and preliminary data. Annals of Otology, Rhinology & Laryngology, 97(2), 164–172. 10.1177/000348948809700214 [DOI] [PubMed] [Google Scholar]
  65. Ruckart, K. W., Wilson, C., Moya-Mendez, M. E., Madden, L. L., Laxton, A., & Siddiqui, M. S. (2023). Effect of ventral intermediate nucleus deep brain stimulation on vocal tremor in essential tremor. Tremor and Other Hyperkinetic Movements, 13(1), Article 13. 10.5334/tohm.757 [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Shao, J., MacCallum, J. K., Zhang, Y., Sprecher, A., & Jiang, J. J. (2010). Acoustic analysis of the tremulous voice: Assessing the utility of the correlation dimension and perturbation parameters. Journal of Communication Disorders, 43(1), 35–44. 10.1016/j.jcomdis.2009.09.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Simonyan, K., & Frucht, S. J. (2013). Long-term effect of sodium oxybate (Xyrem®) in spasmodic dysphonia with vocal tremor. Tremor and Other Hyperkinetic Movements, 3. 10.5334/tohm.156 [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Sulica, L., & Louis, E. D. (2010). Clinical characteristics of essential voice tremor: A study of 34 cases. The Laryngoscope, 120(3), 516–528. 10.1002/lary.20702 [DOI] [PubMed] [Google Scholar]
  69. Švec, J. G., & Granqvist, S. (2010). Guidelines for selecting microphones for human voice production research. American Journal of Speech-Language Pathology, 19(4), 356–368. 10.1044/1058-0360(2010/09-0091) [DOI] [PubMed] [Google Scholar]
  70. Švec, J. G., & Granqvist, S. (2018). Tutorial and guidelines on measurement of sound pressure level in voice and speech. Journal of Speech, Language, and Hearing Research, 61(93), 441–461. 10.1044/2017_JSLHR-S-17-0095 [DOI] [PubMed] [Google Scholar]
  71. Tanaka, Y., Nishio, M., & Niimi, S. (2011). Vocal acoustic characteristics of patients with Parkinson's disease. Folia Phoniatrica et Logopaedica, 63(5), 223–230. 10.1159/000322059 [DOI] [PubMed] [Google Scholar]
  72. Titze, I. R., & Winholtz, W. S. (1993). Effect of microphone type and placement on voice perturbation measurements. Journal of Speech and Hearing Research, 36(6), 1177–1190. 10.1044/jshr.3606.1177 [DOI] [PubMed] [Google Scholar]
  73. Tomoda, H., Shibasaki, H., Kuroda, Y., & Shin, T. (1987). Voice tremor: Dysregulation of voluntary expiratory muscles. Neurology, 37(1), 117–122. 10.1212/WNL.37.1.117 [DOI] [PubMed] [Google Scholar]
  74. Torrecillas, V., Dwenger, K., & Barkmeier-Kraemer, J. (2021). Classification of vocal tremor using updated consensus-based tremor classification criteria. Laryngoscope Investigative Otolaryngology, 6(2), 261–276. 10.1002/lio2.544 [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Warrick, P., Dromey, C., Irish, J., & Durkin, L. (2000). The treatment of essential voice tremor with botulinum toxin: A longitudinal case report. Journal of Voice, 14(3), 410–421. 10.1016/S0892-1997(00)80086-7 [DOI] [PubMed] [Google Scholar]
  76. Wolraich, D., Marchis-Crisan, V. C., Redding, N., Khella, S. L., & Mirza, N. (2010). Laryngeal tremor: Co-occurrence with other movement disorders. ORL, 72(5), 291–294. 10.1159/000317032 [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental Material S1. Information about the rate, extent, and regularity of f0 modulation and IL modulation in their respective traces extracted via fast Fourier transformation (i.e., FFT, or spectral analysis) in Praat.
JSLHR-68-2721-s001.jpg (5.3MB, jpg)

Data Availability Statement

The data sets generated and/or analyzed during the current study are available from the corresponding author on reasonable request.


Articles from Journal of Speech, Language, and Hearing Research : JSLHR are provided here courtesy of American Speech-Language-Hearing Association

RESOURCES