Skip to main content
Elsevier - PMC COVID-19 Collection logoLink to Elsevier - PMC COVID-19 Collection
. 2020 Oct 1;36(6):879.e13–879.e19. doi: 10.1016/j.jvoice.2020.09.024

Voice Quality Evaluation in Patients With COVID-19: An Acoustic Analysis

Maral Asiaee *, Amir Vahedian-azimi , Seyed Shahab Atashi , Abdalsamad Keramatfar §, Mandana Nourbakhsh ║,1
PMCID: PMC7528943  PMID: 33051108

Abstract

Objectives

With the COVID-19 outbreak around the globe and its potential effect on infected patients’ voice, this study set out to evaluate and compare the acoustic parameters of voice between healthy and infected people in an objective manner.

Methods

Voice samples of 64 COVID-19 patients and 70 healthy Persian speakers who produced a sustained vowel /a/ were evaluated. Between-group comparisons of the data were performed using the two-way ANOVA and Wilcoxon's rank-sum test.

Results

The results revealed significant differences in CPP, HNR, H1H2, F0SD, jitter, shimmer, and MPT values between COVID-19 patients and the healthy participants. There were also significant differences between the male and female participants in all the acoustic parameters, except jitter, shimmer and MPT. No interaction was observed between gender and health status in any of the acoustic parameters.

Conclusion

The statistical analysis of the data revealed significant differences between the experimental and control groups in this study. Changes in the acoustic parameters of voice are caused by the insufficient airflow, and increased aperiodicity, irregularity, signal perturbation and level of noise, which are the consequences of pulmonary and laryngological involvements in patients with COVID-19.

Key words: Coronavirus disease, Covid-19 patients, Voice quality, Acoustic analysis

INTRODUCTION

In early March 2020, the World Health Organization (WHO) declared COVID-19 as a pandemic. Since then, with more than 17,000,000 confirmed cases worldwide and counting, coronavirus has become one of the most challenging issues with which the world is faced. As indicated by Rothan and Byrareddy,1 COVID-19 principally affects the respiratory system, and people infected with the disease may experience pneumonia and acute respiratory distress syndrome. Symptoms of acute upper respiratory tract infections, such as cough, sore throat, rhinorrhea and sneezing, as well as digestion symptoms, like vomiting, are associated with mild cases of this disease, and symptoms like pneumonia and acute respiratory distress are primarily seen in severe and critical cases, respectively.2 “Respiratory tract infections affect the same system and structure that are used for voice production.”3 Although the primary function of the respiratory tract is to provide oxygen to the body, its secondary and equally important function is to provide energy and produce phonation for the purpose of speech communication

Voice production is a 3-stage process: Respiration, phonation, and resonating system. In the respiratory stage, the force that is needed for generating sound is provided by the air expelled from the lungs. A person who has contracted the coronavirus not only may experience shortness of breath but may also have difficulty exhaling, which results in the lack of energy to produce sound and hence a disruption in the speech production cycle. In the phonation stage, the subglottal pressure must reach a certain point to set the vocal folds into a vibratory position. If the first stage of speech mechanism is disrupted, the phonation function of the larynx will be accordingly impaired. Other symptoms of coronavirus, such as recurrent dry coughs, may cause changes in the vocal folds and will consequently give rise to modifications in the acoustic cues related to voice quality. As shown in a study using self-assessment questionnaires, 28.6% of those infected with COVID-19, showed symptoms of dysphonia.4 Nonetheless, using only self-assessed results are subjective and prone to error.

The acoustic analysis of voice quality is of great interest among phoneticians and voice clinicians due to its noninvasive nature, low costs, and ease of application.5 , 6 , 7 In 2001, the European Laryngological Society (ELS) put forward a basic protocol for assessment of voice-related diseases in which they recommended using the acoustic analysis of speech as a diagnostic tool.7 In the ELS protocol, fundamental frequency (F0), perturbation measures of pitch (jitter), and amplitude (shimmer) as well as harmonic-to-noise ratio (HNR) were noted as relevant parameters when evaluating voice quality.7 Along with these parameters, which are the most frequently-used ones6 , 8 , 9 , cepstral peak prominence (CPP),9, 10, 11, 12, 13 harmonic amplitude measures14 , 15 and the aerodynamic parameter of voicing, that is, maximum phonation time (MPT)16 , 17 are among the most-studied characteristics of voice. In addition to all the aforementioned parameters, this study also employed fundamental frequency variation (F0 standard deviation) and number of voice breaks (NVB) for accomplishing the study objective.

Fundamental frequency is the rate at which vocal folds vibrate per second and is expressed in Hertz (Hz). Changes in mass, length and tension of the vocal folds modify the fundamental frequency.18, 19, 20 Variation in fundamental frequency is either normal (eg, difference between F0 in children, women and men) or may occur because of vocal fold pathologies.21 Asymmetrical changes in the mass and tension of the vocal folds caused by a laryngeal pathology such as tumors or paralysis lead to a deviant vibration and consequently changes the fundamental frequency.22 F0SD depicts the amount of variation in the frequency of vocal fold vibrartion.23

Jitter and shimmer are defined as the cycle-to-cycle variation in frequency and amplitude during phonation, respectively.21 Normally, in healthy speakers, vibration of vocal folds show a low-level jitter, and higher levels of jitter are observed in pathological voices.24 Variation in shimmer is mostly detected when there is a mass lesion in the vocal folds such as edema, polyps or carcinomas.25 Values above 1.04% for jitter and above 3.81% for shimmer in adult speakers are considered pathological.26

HNR, also known as signal-to-noise ratio, depicts the degree of periodicity in a signal.27 It is an estimate of energy in the harmonics of voice signal and the noise energy in the signal.21 HNR values are usually higher in normal voice than in pathological voice, since normal voices are more sonorant than pathological ones. HNR values below 7dB are stated to be pathological.28 , 29

CPP is another measure that corresponds to the degree of regularity and periodicity of the voice signal.27 Derived via linear regression, CPP gauges “the relative amplitude of the CPP in relation to the expected amplitude.”14 , 30 Dysphonic voices show lower CPP values in comparison to normal voices, which exhibit higher level of CPP.10

Difference between the amplitude of the first and second harmonics (H1-H2) is indicative of the degree of glottal adduction in different voices.31 As this parameter reflects changes in the open quotient, the higher is the value of (H1-H2), the greater is the open quotient. The (H1-H2) measure is associated with breathiness in the voice. Breathy voices show higher (H1-H2) values; however, strained voices exhibit lower (H1-H2) values.32

In Praat,33 the number of voice breaks (NVB) is defined as “the number of distances between consecutive pulses that are longer than 1.25 divided by the pitch floor.” It is believed that normal voices show a lower number of voice breaks than pathological voices.

MPT is an aerodynamic parameter that describes the maximum length with which a vowel can be vocalized continuously and is expressed in second.16 Generally, phonation time less than 10 seconds is considered abnormal.34

Since COVID-19 is mostly considered as a respiratory disease, and many of its symptoms are associated with the larynx and the lungs infections, those acoustic parameters that represent these organs’ functions, are chosen to be analyzed. Differences in the acoustic parameters of voice between patients and healthy participants could be considered as one of the diagnostic tools, which depict the involvement of the larynx and the other respiratory organs; hence, this study compares COVID-19 patients with healthy individuals to evaluate the effect of this disease on the noted acoustical parameters without resorting to invasive methods.

MATERIAL AND METHODS

The National Institute for Medical Research Development (NIMAD) Ethics Committee in Iran approved the study protocol under the code IR.NIMAD.REC.1399.056. All the participants had given their informed consent to use their speech samples for research purposes.

The present study was an observational case-control study.

Participants

Simple random sampling method was employed to choose the healthy participants. Patients were chosen from those people who were hospitalized at the Baqiyatallah hospital. Diagnosis of participants with COVID-19 was carried out using the World Health Organization (WHO) interim guidance.35 Upon admission, chest computed tomographic (CT) scan and swab test were performed for patients. Since the scan results are readily available (compared with the swab test which takes at least 24 hours), diagnosis was made based on the CT results; Moreover, positive results on a reverse-transcriptase polymerase chain reaction (PT-PCR) assay of a specimen obtained on a nasopharyngeal swab, indicated the confirmation of COVID-19. Therefore, all patients in this study were positive based on the two methods.

At the initial stage of sampling, data from 100 healthy participants and 100 individuals with COVID-19 was collected. All these participants completed a questionnaire. The questionnaire contains questions about participants’ gender, age, health background (including questions about any history of asthma, COPD, laryngitis, and chronic bronchitis), their smoking habits, whether they have any history of substance abuse, whether the participants recently traveled during the COVID-19 pandemic and whether the participants were/are in contact with someone who was/is tested positive for COVID-19.

The inclusion criteria for the healthy participants was to be a nonsmoker and nondrug addict, someone who did not travel during the pandemic and had/has no contact with a person who has contracted COVID-19, in addition to having no prior voice disorder or any kind of respiratory disease. The inclusion criteria for patients were the same as healthy participants except for the traveling and being in contact with a COVID-19 patient.

The final number of participants who met the inclusion criteria was 147. Participants were then divided into an experimental and a control group. In the experimental group, speech samples of 77 speakers who had the disease were initially collected, but, after the first recording session, 13 patients were either transferred to the ICU or passed away. The final number of participants whose data were used in the experimental group was thus 64 (38 male, 26 female). Their age ranged between 16 and 77 (mean = 52.3 years, SD = 12.89).

The control group comprised of 70 (33 male and 37 female) healthy Persian speakers who were aged 18 to 70 (mean = 42.35 years, SD = 10.01).

Voice recordings

All recordings were obtained using ZOOM H5 handy recorder with a sampling rate of 44100 Hz and 16 bit quantization. During the recordings, the recorder was held at the distance of 20 cm with a 45° angle from the participants’ mouth. Before starting the main recording sessions, the examiners demonstrated the task individually for each participant. To minimize the effect of intonational changes and any irregularity caused by the coarticulation effect, only a prolonged vowel was used.36 All the participants were asked to produce a vowel, namely /a/, in as long a time as they could, at their comfortable pitch and with a flat tone and a constant amplitude. For the MPT assessment, the participants were asked to take a deep breath before producing the /a/ sound.

Two sessions of recording were carried out for each participant. In the control group, the recordings were carried out on two different sessions. In the experimental group, the interval between recordings was two days. The first recording was recorded on the day the participants were hospitalized and the second one, 2 days after their first recording session.

Voice recordings were done by two hospital nurses who were trained to do the speech data collection. All safety measures were taken by these nurses while recording; they wore face mask and face shield, disposable gloves and suits. The recorder was also sterilized before and after each recording session, using alcohol pads.

Parameter extraction

Two different methods were used for extracting the acoustical parameters related to voice quality. A Praat33 script was used to extract the local values of jitter, shimmer, MPT, and the number of voice breaks. The measurements were performed using the default settings in Praat. Fundamental frequency, CPP, HNRs, and (H1-H2) were automatically extracted using VoiceSauce,37 a freeware for voice analysis. F0 measurement was done using the default algorithm of VoiceSauce, that is, STRAIGHT.38 By default, VoiceSauce detects F0 at 1-ms intervals and computes the harmonic spectra magnitude, pitch-synchronously over a three-cycle window; however, the default was changed into 5-ms intervals. In VoiceSauce, CPP is calculated using the algorithm proposed by Hillenbrand et al.14 HNR values are gauged using a variable window of five pitch periods by de Krom's algorithm.39 HNR05, HNR15, HNR25, HNR35 measure HNR form 0-500 Hz, 0-1500 Hz, 0-2500 Hz and 0-3500 Hz, respectively. This study used HNR35 (henceforth HNR). Finally, H1*-H2* was used for (H1-H2) assessment, which is the H1-H2 corrected for the effect of formants based on the algorithm proposed by Iseli et al40 and used in VoiceSauce.

Statistical analysis

Data were analyzed in R (R Core Team 2020) version 4.0.0.41 Due to the large number of tokens extracted from the acoustic analysis of the parameters in VoiceSauce (F0, CPP, HNR, and H1-H2), the average of the results for each parameter and each participant was first calculated. Two values were thus obtained for each parameter, each representing the value of that parameter in a repetition for each participant. Another averaging process was then carried out on the results obtained from all the parameters’ repeated recordings; thus, for each participant and each parameter, only one value was obtained.

A two-way ANOVA was performed to evaluate the effects of participants’ health status (healthy/sick [infected]) and gender (male/female) on the different acoustic parameters studied in this research. The data collected on the F0, CPP, HNR, H1H2 parameters were normally distributed according to Shapiro-Wilk's test of normality (P > 0.05). To evaluate the homogeneity of variances, Levene's test of homogeneity of variances was run. The assumption of homogeneity of variances was met (P > 0.05). Since data in jitter, shimmer, MPT and F0SD did not show a normal distribution, the nonparametric Wilcoxon rank-sum test was used. As for the number of voice breaks, only a descriptive analysis will be reported.

Figure 1 exhibits the data distribution for all the parameters across the different genders and for the different health status.

FIGURE 1.

FIGURE 1

Violin plot of the distribution of different acoustic parameters of voice in male and female participants categorized by their health status. The parameters represented in Figures A-D in the first row were normally distributed, and the parameters represented in Figures E-H in the second row were not normally distributed.

CPP, cepstral peak prominence; F0, fundamental frequency; HNR, harmonics-to-noise ratio; H1H2, difference between the first and second harmonic amplitude; MPT, maximum phonation time; F0SD, standard deviation of fundamental frequency.

RESULTS

Table 1 summarizes the total number of voice breaks (NVB) in the female and male participants grouped by their health status and across the two repetitions.

TABLE 1.

The Descriptive Analysis of the Number of Voice Breaks

Gender Health Status Voice Break Number Percent Summary
Female Healthy 0 57 77 77%
1–2 17 23 23%
3–5 0 0
>6 0 0
Patient 0 25 48.1 48.1%
1–2 19 36.5 51.9%
3–5 6 11.5
>6 2 3.9
Male Healthy 0 47 71.6 71.2%
1–2 17 25.8 28.8%
3–5 1 1.5
>6 1 1.5
Patient 0 39 51.3 51.3%
1–2 27 35.5 48.7%
3–5 5 6.6
>6 5 6.6

Based on the data in Table 1, there was no voice break in 77% of the voice samples from the healthy female participants; as for the healthy male participants, the rate was 71.2%.

In the healthy groups, at least one voice break was observed in 23% of the women's and 28.8.9% of the men's phonation. The percentage of nonoccurrence of voice breaks dropped to 48.1% for the female and 51.3% for the male patients in the COVID-19 group. A total of 51.9 % of the data form the women and 48.7% of the data from the men in the patients’ group showed at least one voice break during the articulation of the sustained vowel /a/. Overall, the percentage of NVB was higher in them in comparison with the healthy participants.

Table 2 presents the mean, standard deviation and range of all the acoustic parameters in the healthy and infected male participants.

TABLE 2.

Descriptive Data Pertaining to the Acoustic Parameters Grouped by Health Status in the Male Participants

Parameters Healthy
Patient
Mean (SD) Range Mean (SD) Range
F0 (Hz) 140.61 (24.72) 107.93–206.55 146.11 (26.62) 109.51–210.80
F0 SD (Hz) 8.19 (7.08) 1.96–28.10 10.73 (8.76) 2.20–37.31
Jitter% 0.43 (0.15) 0.22–1.04 0.95 (0.96) 0.36–5.44
Shimmer% 3.28 (1.13) 1.17–6.59 4.66 (2.52) 1.57–13.20
HNR (dB) 46.74 (6.42) 28.89–58.89 26.79 (6.27) 16.41–38.60
CPP (dB) 26.55 (2.69) 19.51–31.88 24.17 (3.26) 17.89–30.98
H1H2 (dB) 3.38 (3.79) −2.5–11.19 7.23 (2.89) 2.11–13.19
MPT (s) 14.69 (4.35) 6.69–24.92 7.02 (2.05) 3.95–13.60

Table 3 presents the mean, standard deviation, and range of all the acoustic parameters in the healthy and infected female participants.

TABLE 3.

Descriptive Data Pertaining to the Acoustic Parameters Grouped by Health Status in the Female Participants

Parameters Healthy
Patient
Mean (SD) Range Mean (SD) Range
F0 (Hz) 209.99 (23.20) 152.79–261.19 207.41 (31.73) 166.55–327.50
F0 SD (Hz) 8.87 (7.67) 2.17–37.20 16.70 (9.04) 4.43–43.33
Jitter% 0.41 (0.14) 0.16–0.84 1.181 (1.18) 0.35–5.58
Shimmer% 3.78 (1.68) 1.39–9.82 5.12 (3.67) 1.43–18.51
HNR (dB) 53.42 (4.77) 43.83–64.99 30.36 (6.73) 17.05–41.47
CPP (dB) 25.13 (2.67) 20.33–31.09 22.22 (2.96) 15.85–29.09
H1H2 (dB) 8.27 (3.68) 1.72–14.96 10.06 (2.76) 4.65–14.18
MPT (s) 13.55 (4.02) 5.47–26.52 6.21 (2.50) 2.15–10.97

A two-way ANOVA was performed to separately examine the effect of status and gender on F0, CPP, HNR and H1H2. The analysis showed that gender has a significant effect on F0 values (F1, 130 = 203.532; P < 0.001; ηg 2 = 0.610); however, there was neither a significant difference between F0 values in the healthy and infected participants (F1, 130 = 0.146; P > 0.001; ηg 2= 0.001), nor a significant interaction between status and gender (F1, 130 = 0.771; P > 0.001; ηg 2= 0.006). When CPP was considered the dependent variable and status and gender the independent variables, the analysis revealed a significant dependence on status (F1, 130 = 26.741; P < 0.001; ηg 2= 0.171), and gender (F1, 130 = 10.772; P < 0.001; ηg 2= 0.077). Nonetheless, no significant interaction was observed between status and gender (F1, 130 = 0.273; P > 0.001; ηg 2 = 0.002). The analysis displayed a significant difference in HNR values between the healthy and infected participants (F1, 130 = 414.303; P < 0.001; ηg 2 = 0.761) and between the female and male participants (F1, 130 = 24.637; P < 0.001; ηg 2 = 0.159). No interaction was observed between status and gender (F1, 130 = 2.170; P > 0.001). The analysis also showed that H1H2 values were significantly affected by participants’ health status (F1, 130 = 23.794; P < 0.001; ηg 2 = 0.155), and gender (F1, 130 = 44.180; P < 0.001; ηg 2 = 0.254). No significant interaction was found between status and gender (F1, 130 = 2.825; P < 0.001; ηg 2 = 0.021).

Wilcoxon's rank-sum test for jitter (P = 0.39 > 0.05; r = 0.07), shimmer (P= 0.68 > 0.05; r = 0.035) and MPT (P = 0.90 > 0.05; r = 0.010) showed that men's median jitter, shimmer, and MPT were not significantly different from women's median jitter, shimmer and MPT, respectively. Nonetheless, the difference in median jitter (P < 0.05; r = 0.657), shimmer (P < 0.05; r = 0.233), and MPT (P < 0.05; r = 0.778) between the healthy participants and the infected was significant. The results of Wilcoxon's rank-sum test for F0SD showed a significant difference between men's and women's median F0SD (P < 0.05; r = 0.175) and also a difference in the median F0SD between the healthy and infected participants (P < 0.05; r = 0.292).

Based on the results, MPT (r = 0.778), HNR (ηg 2 = 0.761), and jitter (r = 0.657) had the largest effect sizes among all the other parameters. That is, 77.8% of variability in MPT values, 76.1% of the change in HNR values and 65.7% of the change in jitter values can be accounted for the participants’ health status. As expected, 61% of the variability in F0 and 29.2% of the variability in F0SD can be interpreted as the effect of gender on these parameters’ values. According to Cohen,42 , 43 r values varying more than 0.5 indicate a large effect.

DISCUSSION

The aim of this study was to investigate whether acoustic parameters of voice differ significantly between covid-19 patients and healthy participants. Fundamental frequency (F0) and its variations (F0SD), fundamental frequency perturbation measures (ie, jitter and shimmer), harmonics-to-noise ratio (HNR), difference between the first two harmonic amplitudes (H1-H2), maximum phonation time (MPT), and CPP were thus measured. These parameters can delineate different aspects of vocal apparatus dysfunction in voice production including irregularity and aperiodicity in vocal fold vibration, airflow insufficiency, increased noise, and signal perturbations.23 , 44 , 45

Except F0, all the other acoustic parameters were significantly different between the experimental and control groups.

The results obtained in this study showed a notable difference in fundamental frequency variation (F0SD) between the healthy and infected participants, which could stem from tremor and insufficient control over laryngeal muscles in the experimental group.23 , 46 An increase in jitter and shimmer was also observed in both female and male patients. The uneven weighting of the vocal folds, which occurs due to inflammation or degeneration of the vocal fold tissues47 as a result of recurrent dry coughs, could explain the higher values of jitter and shimmer in the experimental group.

The present study also revealed a decrease in both HNR and CPP values in COVID-19 patients. A decline in these parameters is an indication of increased spectral noise in patients’ voices, which consequently led to breathier voice in the experimental group.12 , 48 Moreover, many studies have shown that an increase in H1H2 could be considered one of the acoustic indicators of breathiness,49 , 50 especially in pathological voices.51 The considerable growth in the value of the H1H2 parameter is in line with these findings. Air leakage and incomplete vocal fold closure, which may result from the trauma of vocal folds52 during recurrent coughing, may have contributed to the lowered values of HNR and CPP and breathiness in patients with COVID-19. During coughing, the mechanical forces of contact pressure are remarkably larger that in normal phonation,53 which may cause vocal fold injuries. Vomiting, as another symptom of coronavirus, can also give rise to injuries of the vocal folds because of the mechanical force of the gag reflex54 and the acidity of the gastric content, which rises up to the throat and irritated the tissues.54

Previous studies have shown more aperiodicity in pathological voices with an increase in voice break numbers.23 , 16 According to the present findings, the occurrence of voice break is almost rare in healthy participants, but in the experimental group, it had an increased incidence. This finding also confirms the voice dysfunction and the possible injury in vocal folds.

As shown by the results, the MPT is significantly below the normative data in the experimental group. The phonation duration is strongly correlated with lung volume. As noted earlier, this disease has certain effects on the lungs, which accordingly cause the airflow insufficiency for continuation of voice. Moreover, the inadequate closure of vocal folds in pathological larynx generally reduces MPT due to the leakage of through rima glottidis.55 , 44 , 16

CONCLUSION

This study revealed significantly higher values of F0SD, jitter, shimmer, H1H2, and voice break numbers in COVID-19 patients in comparison with the control group. The values of HNR, CPP, and MPT were significantly lower in the experimental group. Changes in MPT demonstrated that these patients suffer mostly from airflow insufficiency due to the involvement of the lungs, which was not far from expectation. The other changes demonstrated some laryngological involvement in patients, since they showed higher aperiodicity, irregularity, and signal perturbation and also increased levels of noise in the patients’ voice in comparison with the control group.

References

  • 1.Rothan HA, Byrareddy SN. The epidemiology and pathogenesis of coronavirus disease (COVID-19) outbreak. J Autoimmun. 2020 doi: 10.1016/j.jaut.2020.102433. February):102433. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Yuki K, Fujiogi M, Koutsogiannaki S. COVID-19 pathophysiology : a review. Clin Immunol. 2020;215 doi: 10.1016/j.clim.2020.108427. January. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Kallvik E, Toivonen L, Peltola V, et al. Respiratory tract infections and voice quality in 4-year-old children in the STEPS study. J Voice. 2019;33 doi: 10.1016/j.jvoice.2018.01.021. 801.e21-801.e25. [DOI] [PubMed] [Google Scholar]
  • 4.Lechien JR, Chiesa-Estomba CM, Cabaraux P, et al. Features of mild-to-moderate COVID-19 patients with dysphonia. J Voice. 2020 doi: 10.1016/j.jvoice.2020.05.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Parsa V, Jamieson DG. Acoustic discrimination of pathological voice. J Speech Lang Hear Res. 2001;44:327–339. doi: 10.1044/1092-4388(2001/027). [DOI] [PubMed] [Google Scholar]
  • 6.Brockmann M, Drinnan MJ, Storck C, et al. Reliable jitter and shimmer measurements in voice clinics: the relevance of vowel, gender, vocal intensity, and fundamental frequency effects in a typical clinical task. J Voice. 2011;25:44–53. doi: 10.1016/j.jvoice.2009.07.002. [DOI] [PubMed] [Google Scholar]
  • 7.Dejonckere PH, Bradley P, Clemente P, et al. A basic protocol for functional assessment of voice pathology, especially for investigating the efficacy of (phonosurgical) treatments and evaluating new assessment techniques: guideline elaborated by the Committee on Phoniatrics of the European Laryngolo. Eur Arch Oto-Rhino-Laryngol. 2001;258:77–82. doi: 10.1007/s004050000299. [DOI] [PubMed] [Google Scholar]
  • 8.Lovato A, De Colle W, Giacomelli L, et al. Multi-Dimensional Voice Program (MDVP) vs Praat for assessing euphonic subjects: a preliminary study on the gender-discriminating power of acoustic analysis software. J Voice. 2016;30:765.e1–765.e5. doi: 10.1016/j.jvoice.2015.10.012. [DOI] [PubMed] [Google Scholar]
  • 9.Heman-Ackah YD, Heuer RJ, Michael DD, et al. Cepstral peak prominence: a more reliable measure of dysphonia. Ann Otol Rhinol Laryngol. 2003;112:324–333. doi: 10.1177/000348940311200406. [DOI] [PubMed] [Google Scholar]
  • 10.Watts CR, Awan SN, Maryn Y. A comparison of cepstral peak prominence measures from two acoustic analysis programs. J Voice. 2017;31:387.e1–387.e10. doi: 10.1016/j.jvoice.2016.09.012. [DOI] [PubMed] [Google Scholar]
  • 11.Watts CR, Awan SN, Lambert E. Spectral/cepstral acoustic measures differentiate hypofunctional from normal speakers purpose.
  • 12.Fraile R, Godino-llorente JI. Cepstral peak prominence : a comprehensive analysis. Biomed Signal Process Contorl. 2014;14:42–54. doi: 10.1016/j.bspc.2014.07.001. [DOI] [Google Scholar]
  • 13.Awan SN, Solomon NP, Helou LB, et al. Spectral-cepstral estimation of dysphonia severity : external validation. Ann Otol Rhinol Laryngol. 2013;122:40–48. doi: 10.1177/000348941312200108. [DOI] [PubMed] [Google Scholar]
  • 14.Hillenbrand J, Cleveland RA, Erickson RL. Acoustic correlates of breathy vocal quality. J Speech Hear Res. 1994;37:769–778. doi: 10.1044/jshr.3704.769. [DOI] [PubMed] [Google Scholar]
  • 15.Keating P, Garellek M, Kreiman J. Acoustic properties of different kinds of creaky voice. ICPhS 2015. 2015;(1):2–7. [Google Scholar]
  • 16.Karlsen T, Sandvik L, Heimdal JH, et al. Acoustic voice analysis and maximum phonation time in relation to voice handicap index score and larynx disease. J Voice. 2020;34:161.e27–161.e35. doi: 10.1016/j.jvoice.2018.07.002. [DOI] [PubMed] [Google Scholar]
  • 17.Schindler A, Mozzanica F, Vedrody M, et al. Correlation between the voice handicap index and voice measurements in four groups of patients with dysphonia. Otolaryngol - Head Neck Surg. 2009;141:762–769. doi: 10.1016/j.otohns.2009.08.021. [DOI] [PubMed] [Google Scholar]
  • 18.Hollien H. Some laryngeal correlates of vocal pitch. J Speech Hear Res. 1960;3:52–58. doi: 10.1044/jshr.0301.52. [DOI] [PubMed] [Google Scholar]
  • 19.Titze IR. Prentice Hall; 1994. Principles of Voice Production.https://books.google.com/books?id=m48JAQAAMAAJ [Google Scholar]
  • 20.Seikel JA, Drumright DG, Seikel P. Delmar Cengage Learning; 2013. Essentials of Anatomy & Physiology for Communication Disorders.https://books.google.com/books?id=wD-9MgEACAAJ [Google Scholar]
  • 21.Patel RR, Harris MS, Halum SL. In: Sataloff's Comprehensive Textbook of Otolaryngology Head & Necck Surgery: Laryngology (Vol. 4) Sataloff RT, editor. Jaypee Brothers Medical Publishers (P) Ltd: The Health Science Publisher; New Delhi, London, Philaldelphia, Panama: 2016. Objective voice assessment; pp. 155–168. [Google Scholar]
  • 22.Davis SB. Acoustic characteristics of normal and pathological voices. Speech Lang. 1979;1:271–335. doi: 10.1016/b978-0-12-608601-0.50010-3. [DOI] [Google Scholar]
  • 23.Rusz J, Klempíř J, Baborová E, et al. Objective acoustic quantification of phonatory dysfunction in Huntington's disease. PLoS One. 2013;8 doi: 10.1371/journal.pone.0065881. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Behrman A. 3rd ed. Plural Publishing; San Diego, CA: 2018. Speech and Voice Science. [Google Scholar]
  • 25.Rosa IS. Analise acústica da voz de indivíduos na terceira idade. 2005.
  • 26.Deliyski DD. Third European Conference on Speech Communication and Technology, EUROSPEECH. Berlin; 1993. Acoustic model and evaluation of pathological voice production. pp. 22–25. [Google Scholar]
  • 27.Hillenbrand JM. Acoustic analysis of voice: a tutorial. Perspect Speech Sci Orofac Disord. 2011;21:31. doi: 10.1044/ssod21.2.31. [DOI] [Google Scholar]
  • 28.Boersma P. Acurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound. IFA Proc 17. 1993;17:97–110. http://www.fon.hum.uva.nl/paul/papers/Proceedings_1993.pdf [Google Scholar]
  • 29.Teixeira JP, Oliveira C, Lopes C. Vocal acoustic analysis – jitter, shimmer and HNR parameters. Procedia Technol. 2013;9:1112–1122. doi: 10.1016/j.protcy.2013.12.124. [DOI] [Google Scholar]
  • 30.Hillenbrand JM, Houde RA. Acoustic correlates of breathy vocal quality : dysphonic voices and continuous speech. J Speech Hear Res. 1996;39:311–321. doi: 10.1044/jshr.3902.311. [DOI] [PubMed] [Google Scholar]
  • 31.Hanson HM, Chuang ES. Glottal characteristics of male speakers: acoustic correlates and comparison with female data. J Acoust Soc Am. 1999;106:1064–1077. doi: 10.1121/1.427116. [DOI] [PubMed] [Google Scholar]
  • 32.Sapienza C, Hoffman-Ruddy B. 3rd ed. Plural Publishing; San Diego, CA: 2018. Voice Disorders. [Google Scholar]
  • 33.Boersma P, Weenink D. Praat: doing phonetics by computer. 2020. www.praat.org.
  • 34.Omori K. Diagnosis of voice disorders. Japan Med Assoc J. 2011;54:248–253. [Google Scholar]
  • 35.Organization WH. World Health Organization; Geneva: 2020. Clinical Management of Severe Acute Respiratory Infection When Novel Coronavirus (2019-NCoV) Infection Is Suspected: Interim Guidance, 28 January 2020. [Google Scholar]
  • 36.Laver J, Hiller S, Beck JM. Acoustic waveform perturbations and voice disorders. J Voice. 1992;6:115–126. doi: 10.1016/S0892-1997(05)80125-0. [DOI] [Google Scholar]
  • 37.Shue Y-L, Keating P, Vicenik C. Voicesauce: a program for voice analysis. J Acoust Soc Am. 2009;126:2221. doi: 10.1121/1.3248865. [DOI] [Google Scholar]
  • 38.Kawahara H, Masuda-Katsuse I, de Cheveigné A. Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds. Speech Commun. 1999;27(3-4):187–207. doi: 10.1016/S0167-6393(98)00085-5. [DOI] [Google Scholar]
  • 39.Krom G De. A cepstrum-based technique for determining a harmonics-to-noise ratio in speech signals. J Speech Hear Res. 1993;36:254–266. doi: 10.1044/jshr.3602.254. [DOI] [PubMed] [Google Scholar]
  • 40.Iseli M, Shue Y, Alwan A. Age, sex, and vowel dependencies of acoustic measures related to the voice source a) J Acoust Soc Am. 2007;121:2283–2295. doi: 10.1121/1.2697522. [DOI] [PubMed] [Google Scholar]
  • 41.R Core Team. R: A Language and Environment for Statistical Computing. 2020. https://www.r-project.org/.
  • 42.Cohen J. Statistical power analysis. Curr Dir Psychol Sci. 1992;1:98–101. doi: 10.1111/1467-8721.ep10768783. [DOI] [Google Scholar]
  • 43.Cohen J. 2nd ed. Lawrence Erlbaum Associates; Hillside, NJ: 1988. Statistical Power Analysis for the Behavioral Sciences. [Google Scholar]
  • 44.Yanagihara N, Koike Y, Von Leden H. Phonation and respiration. Function study in normal subjects. Folia Phoniatr (Basel) 1966;18:323–340. http://europepmc.org/abstract/MED/5959236 [PubMed] [Google Scholar]
  • 45.Ptacek PH, Sander EK. Breathiness and phonation length. J Speech Hear Disord. 1963;28:267–272. doi: 10.1044/jshd.2803.267. [DOI] [PubMed] [Google Scholar]
  • 46.Zwirner P, Murry T, Woodson GE. Phonatory function of neurologically impaired patients. J Commun Disord. 1991;24:287–300. doi: 10.1016/0021-9924(91)90004-3. [DOI] [PubMed] [Google Scholar]
  • 47.Ferrand CT. Allyn and Bacon; Boston: 2012. Voice Disorders: Scope of Theory and Practice. [Google Scholar]
  • 48.Hartl DM, Hans S, Vaissière J, et al. Objective voice quality analysis before and after onset of unilateral vocal fold paralysis. J Voice. 2001;15:351–361. doi: 10.1016/S0892-1997(01)00037-6. [DOI] [PubMed] [Google Scholar]
  • 49.Klatt DH, Klatt LC. Analysis, Synthesis, and Perception of Voice Quality Variations among Female and Male Talkers. Vol 87.; 2014. [DOI] [PubMed]
  • 50.Bickley C. Acoustic analysis and perception of breathy vowels. MIT Res Lab Electron Speech Commun Gr Work Pap. 1982;1:71–81. [Google Scholar]
  • 51.De Krom G. Some spectral correlates of pathological breathy and rough voice quality for different types of vowel fragments. J Speech Hear Res. 1995;38:794–811. doi: 10.1044/jshr.3804.794. [DOI] [PubMed] [Google Scholar]
  • 52.Dogan M, Eryuksel E, Kocak I. Subjective and objective evaluation of voice quality in patients with asthma. 2007;21:224-230. doi:10.1016/j.jvoice.2005.11.003 [DOI] [PubMed]
  • 53.Hess MM, Verdolini K, Bierhals W, et al. Endolaryngeal contact pressures. J Voice. 1998;12:50–67. doi: 10.1016/S0892-1997(98)80075-1. [DOI] [PubMed] [Google Scholar]
  • 54.Datta R, Datta K, Venkatesh MD. Laryngopharyngeal reflux : larynx on fire. Med J Armed Forces India. 2010;66:245–248. doi: 10.1016/S0377-1237(10)80049-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Ortega J, Cassinello N, Dorcaratto D. Computerized acoustic voice analysis and subjective scaled evaluation of the voice can avoid the need for laryngoscopy after thyroid surgery. Surgery. 145(3):265-271. doi:10.1016/j.surg.2008.11.002 [DOI] [PubMed]

Articles from Journal of Voice are provided here courtesy of Elsevier

RESOURCES