Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2023 Feb 1.
Published in final edited form as: Laryngoscope. 2021 Jul 21;132(2):391–397. doi: 10.1002/lary.29763

The impact of face masks on speech acoustics and vocal effort in healthcare professionals

Victoria S McKenna a,b,§, Courtney L Kendall a, Tulsi H Patel a, Rebecca J Howell c, Renee L Gustin c
PMCID: PMC8742743  NIHMSID: NIHMS1723956  PMID: 34287933

Abstract

Objective:

We investigated speech acoustics and self-reported vocal symptoms in mask-wearing healthcare professionals. We hypothesized that there would be an attenuation of spectral energies and increase in vocal effort during masked speech compared to unmasked speech.

Methods:

We prospectively enrolled 21 healthcare providers (13 cisgender female, 8 cisgender male; M=32.9 years; SD=7.9 years) and assessed acoustics and perceptual measures with and without a face mask in place. Measurements included: i) acoustic vowel articulation index (VAI), ii) cepstral and spectral acoustic measures, iii) traditional vocal measures (e.g., fundamental frequency, intensity), iv) relative fundamental frequency (RFF), and v) self-reported ratings of vocal effort and dyspnea.

Results:

During masked speech, there was a significant reduction in VAI, high-frequency information (>4 kHz), and RFF offset 10, as well as a significant increase in cepstral peak prominence and perceived vocal effort. Further investigation showed that high frequency attenuation was more pronounced when wearing an N95 mask compared to a simple mask.

Conclusion:

Face masks pose an additional barrier to effective communication that primarily impacts spectral characteristics, vowel space measures, and vocal effort. Future work should evaluate how long-term mask use impacts vocal health and may contribute to vocal problems.

Level of Evidence:

Level 3

Keywords: Speech-language pathology, acoustics, voice disorders

INTRODUCTION

Healthcare professionals represent a subset of individuals who must communicate verbally to meet their work demands, otherwise known as occupational voice users.1 It is well established that occupational voice users develop vocal problems at higher rates than non-occupational voice users, often leading to negative socioeconomic and emotional consequences.18 In 2014, a survey of 362 healthcare workers (e.g., physicians, speech-language pathologists) reported that nearly 50% frequently spoke in a loud voice.9 Furthermore, 15% of those healthcare workers who experienced a voice disorder in the past year attributed their voice issues to a high vocal load. Despite the high demand for communication in the workplace, few studies have examined vocal risk factors pertaining to healthcare professionals.

Over the past year, healthcare professionals have been required to wear face masks to reduce the transmission of COVID-19.10 By obscuring the nose and mouth of the speaker, face masks pose a unique challenge to effective communication exchange for the listener; this includes inhibiting one’s ability to lip-read, see facial expressions, and extract non-verbal pragmatic intent. Wong et al.11 reported that mask-wearing healthcare professionals have more difficulty establishing patient rapport compared to unmasked professionals.

Although there are several studies that describe the impact of face masks on the listener,1214 there is a need to understand how the speaker may be impacted during communication exchanges. An analysis of artificial speech stimuli demonstrated that masks attenuate speech frequencies between 2–7 kHz at a range of 3–12 dB, depending on mask type.15 Consistent with this finding, Nguyen et al.16 reported attenuation of speech frequencies between 1–8 kHz with a greater impact from N95 masks compared to surgical masks in a group of 16 speakers. Investigation into specific spectral bands (1 kHz bins up to 10 kHz) revealed that N95 masks attenuate frequencies from 3–10kHz, whereas simple masks do not impact speech frequencies until about 5 kHz.17 As high frequency spectral information is critical for speech intelligibility,18 it is possible that speakers may be making compensatory articulatory adjustments to maintain effective communication. To our knowledge, no study has investigated the effect of masks on speech articulation.

In addition to the paucity of research on speech acoustics, there is a general lack of understanding of how masks impact vocal function. A recent survey of over 400 participants found that mask-wearing adults report increased vocal effort as well as difficulty coordinating speaking and breathing.19 However, it does not seem that face masks affect vocal perturbation or breathiness correlates (e.g., jitter, shimmer),17,20 though these studies did not investigate perceived vocal effort and vocal symptoms. To date, an analysis detailing the impacts of mask-wearing on acoustic and perceptual measures has not been undertaken in the health field.

The aim of the current study was to examine speech acoustics and self-reported vocal symptoms during masked and unmasked communication in working healthcare professionals. A secondary objective was to investigate whether additional factors (type of mask, sex) impacted these measures. We hypothesized that masked communication would attenuate spectral information16,17 and subsequently elicit compensatory speaking behaviors, such as increased vocal volume and articulatory space, in order to overcome the acoustic changes. We further hypothesized that participants would report greater amounts of vocal effort and dyspnea along with concurrent increases in voice acoustics indicative of vocal effort during masked communication.

MATERIALS and METHODS

Participants

Twenty-one healthcare professionals (8 cisgender male, 13 cisgender female) aged 23–49 years (M=32.9 years; SD=7.9 years) were prospectively enrolled in the study. Participants were speakers of Standard American English, reported no history of neurological disease or head/neck cancer, and were free from speech, language, hearing, and voice problems at the time of the study. Participants were non-smokers and non-vapers. Some participants (N=8) reported health histories of gastroesophageal reflux and/or asthma, but were not excluded based on these diagnoses.

Occupations primarily included speech-language pathologists (N=7), physicians (N=5), physical therapists (N=3), and respiratory therapists (N=3). The types of masks that participants wore at work included simple disposable masks (referred to as “simple masks”) and N95 respirators, often with the addition of simple masks over them. Table 1 provides demographic information about each participant. Participants were recruited between July and September of 2020 and were free from COVID-19 symptoms (e.g., cough, fever) at the time of the study. All methodology described in this study was approved by the Institutional Review Board at the University of Cincinnati. All participants provided informed, written consent prior to participation.

Table 1.

Participant demographic information.

Subject Age (years) Sex Mask Type Occupation Relevant Health History
01 39 F N95 + Simple Physician GERD
02 36 F N95 + Simple Physician
03 30 F N95 + Simple SLP
04 33 F N95 + Simple SLP Asthma; GERD
05 27 F N95 + Simple SLP
06 31 F N95 + Simple SLP
07 24 F N95 SLP
08 25 F Simple SLP GERD
09 24 F Simple RT
10 24 F Simple RT Childhood asthma
11 35 F Simple Nurse
12 43 F Simple Medical Admin.
13 49 F Simple PT
14 41 M N95 + Simple Physician Childhood asthma
15 44 M N95 + Simple Physician
16 45 M Simple Physician Childhood asthma; GERD
17 35 M Simple PT
18 27 M Simple PT Exercise induced asthma
19 26 M Simple Medical Sales
20 29 M Simple SLP Asthma; GERD
21 23 M Simple RT

Note. SLP = speech-language pathologist; PT = physical therapist; RT = Respiratory therapist; Admin = administrator.

Protocol

Recordings were completed in a quiet room at the participant’s place of employment prior to starting their work shift. Participants donned a headset microphone (MicroMic C555L) attached to a handheld recorder (Zoom HN4) set to acquire acoustic data at a sampling rate of 44.1 kHz with a 16-bit resolution. The microphone was placed at a 45° angle from the midline of the lips at a distance of 8.5 cm; the non-standard microphone distance was to allow for the placement and removal of face masks. Microphone recordings were calibrated to sound pressure level (dB SPL) using a sound level meter (Extech; dB A). To calculate vocal intensity (dB SPL) from the speech recordings, pure tones (500 Hz) were played at varying intensity levels from a mobile phonea that was held at the lips, while the intensity was measured with the sound level meter held at the microphone.

Participants were instructed to read aloud a series of vowels, words, and sentences with corner vowel targets (/i/, /u/, /ɑ/) in order to calculate a vowel-space metric as an indirect measure of articulatory movements in the oral cavity. Participants also read the first paragraph of the Rainbow Passage to acquire a sample of running speech for cepstral and spectral acoustical analyses.21 Finally, participants produced 12 repetitions of vowel-voiceless consonant-vowel (VCV) productions (e.g., /ifi/) to calculate relative fundamental frequency (RFF) for characterizing fundamental frequency (fo) during voicing transitions. Please see Table 2 for complete list of speech tasks. All tasks were completed with (masked) and without a face mask (unmasked).

Table 2.

Speech Stimuli

Sustained vowels × 5 seconds in duration
  • /i/, /u/, /ɑ/

Single words repeated × 3 each
  • Heed, who’d, hod

Sentences (bolded word was analyzed)
  • I wish he would heed my advice.

  • I asked myself, “Who’d do that?”

  • A brick hod is a three-sided box.

  • My father hid food to feed the cat on Tuesday morning.

  • The cat happened to see the food, my father had hid in her pod, at noon time.

  • The fat cat was hot from her sleep in the noon sun beams.

Rainbow Passage (paragraph 1)
VCV utterances, repeated × 4 each
  • /ɑfɑ/, /ifi/, /ufu/

Participants made perceptual ratings of their dyspnea and vocal effort following masked and unmasked readings. The modified Borg Scale22 for dyspnea is a category-ratio scale from 0–10. The prompt asks the participant to rate: “How much difficulty is your breathing causing you right now?” and was slightly modified to be indicative of their breathing difficulty during the speech readings. A self-rating of “0” indicated that the participant’s breathing was not causing any difficulty at all, whereas a score of 10 indicated “Maximal” difficulty. Next, participants rated the amount of vocal effort on a 100-mm visual-analog scale (VAS) in which effort was described as “an exertion of the voice or how hard you have to try to make a voice.” A rating on the left-side of the scale represented “no effort” and would be indicative of a lower score, whereas a rating on the right was anchored as “most effort” and measured as a higher rating.23,24 Participants were instructed to place a single line on the scale to indicate their perceived vocal effort.

Data Extraction

Vowel Analysis.

Acoustics measures were manually extracted from vowel segments using Praat software.25 Praat pitch settings were modified for the self-reported sex of each participant, wherein a female range was 90–500 Hz and a male participant range was set from 60–300 Hz. A trained researcherb extracted the middle portion of the sustained vowel (~3 seconds in duration), and midsegment of the vowel (~100 ms in duration) during single word and sentence stimuli. The midsegment was identified to exclude vocalic offset or onset behavior prior to subsequent perturbation analyses. From these segments, the researcher used the Voice Report tool to extract measures of mean fo, standard deviation (SD) of fo, jitter, shimmer, and harmonics-to-noise ratio (HNR). Praat was also used to extract the vocal intensity level (dB SPL) that was then adjusted to the actual intensity levels from the SPL calibration procedure described earlier.

First (F1) and second (F2) formant values were extracted using a wide-band spectrogram over the same selected vowel segments. Formant values were then averaged for each vowel of interest (i.e., /i/, /u/, /ɑ/) and Vowel Articulation Index (VAI) was calculated (Eq. 1). The VAI is an estimate of vowel space that minimizes the effects of inter-speaker variability27 and instead normalizes the relationships between vowel space across speakers.28 A smaller VAI value indicates vowel centralization.29

VAI=F2/i/+F1/ɑ/F1/i/+F1/u/+F2/u/+F2/ɑ/ (1)

Spectral and Cepstral Analysis.

The second and third sentences of the Rainbow Passage were analyzed21 using the Analysis in Dysphonia in Speech and Voice (ADSV) software (ver. 4.0). The following steps were completed: 1) removal of pauses >150ms, with most pauses occurring at punctuation boundaries, 2) down-sampling the acoustic signal to 22.5 kHz, and 3) setting the software to the “Rainbow passage” analysis setting. This set the spectral window size to 1024 with 75% overlap and a cepstral time averaging of 7. The cepstral peak extraction range was modified for each participant based on their self-reported sex, with 90–500 Hz for females and 60–300 Hz for males. Vocalic detection was applied to improve accuracy of extraction. The low-to-high spectral ratio (L/H ratio) cut-off frequency was set to 4 kHz, which is consistent with previous analyses of effort and dysphonia.23 From here, the cepstral peak prominence (CPP) and its SD (CPP SD) and the L/H ratio and its SD (L/H ratio SD) were automatically calculated for each recording.

RFF.

RFF values were extracted from each VCV production using a semi-automated MATLAB algorithm,30 which calculated the instantaneous fo of the voicing cycles during the transition into/out of the voiceless consonant, normalized cycle fo to the steady-state fo of the closest vowel, and converted resulting values into semitones (ST; Eq. 2). We analyzed the RFF values of the voicing cycles closest to the voiceless consonant (offset cycle 10, onset cycle 1). These cycles been shown to be the most sensitive to changes in the physiological mechanisms underlying RFF—including vocal fold abduction, increased laryngeal muscle tension to cease vocal fold vibration, and aerodynamic forces to reinitiate vocal fold oscillation31—and are significantly lower in people with voice disorders characterized by increased laryngeal tension3133 and in those reporting elevated vocal effort.23,34 As a result of the algorithmic processing, the average number of usable offset and onset RFF instances was 7.0 and 7.5, respectively, out of 12 repetitions. Usable instances were averaged within-participant35 prior to subsequent analyses, resulting in one offset 10 cycles and one onset cycle 1 value per participant.

RFF(ST)=12×log2(foforef) (2)

Self-perceptual Analysis.

Self-perceptual ratings were extracted from paper datasheets and transferred to a Microsoft Excel document (Microsoft Office ver. 2016). The VAS ratings were measured and reported in millimeter units by one rater, then re-checked by a second rater who was blinded to the first rating. Because all re-checked effort ratings were within 1 mm of the original measurement, the original measurement was used for analysis. All category-ratio scale ratings of dyspnea were deemed 100% reliable and identical to the original transfer.

Statistical Analysis

The aims of the study were 1) to investigate acoustic and perceptual measures during masked and unmasked conditions, and 2) to examine the impact of additional factors (mask type, participant sex) on these measures. Therefore, we completed separate mixed-effects analyses of variance (ANOVA) models for each acoustical and perceptual measure. The fixed effects were condition (masked, unmasked), mask type (N95, simple), and sex (male, female), as well as their two- and three-way interactions. Participant was input as a random factor. Significance was set to α<.05. Mixed-effect models were assessed for normality and homoscedasticity, resulting in the appropriate model fit for the data.

Post hoc analyses were completed via Tukey’s simultaneous tests, which automatically adjusted for family-wise error at the time of analysis (i.e., to reduce type 1 error). Subsequently, the α criterion of padj<.05 was established for all pairwise comparisons. Cohen’s d, calculated as (μ12)/pooled SD, was reported to quantify the effect sizes of significant findings. Reliability analyses were completed in R software (ver. 4.0.2), whereas all other analyses were completed in Minitab statistical software (ver. 19).

RESULTS

Acoustic Measures

There were several significant differences between masked and unmasked conditions (Table 3). Our results showed significant increases in HNR (p=.002, d=.39), CPP (p=.001, d=.59), and L/H ratio (p<.001, d=1.18) during masked speech compared to unmasked. Conversely, L/H SD (p=.006, d=.61), RFF offset 10 (p=0.34, d=.19) and VAI (p=.039, d=.69) all significantly decreased during the masked condition.

Table 3.

Results of significant statistical findings for acoustic and perceptual measures during masked and unmasked conditions. Effect size interpretations are based on criteria from Cohen.43

Measure Unmasked Mean (SD) Masked Mean (SD) p-value d Effect Size Interpretation
HNR (dB) 18.55 (3.69) 20.04 (4.05) .002 .39 Small
CPP (dB) 7.37 (1.24) 8.04 (0.99) .001 .59 Medium
L/H Ratio (dB) 39.86 (4.01) 44.39 (3.69) <.001 1.18 Large
L/H Ratio SD (dB) 8.47 (1.67) 7.69 (0.70) .006 .61 Medium
RFF offset 10 (ST) −0.77 (0.90) −0.96 (1.05) .034 .19 Small
VAI 0.91 (0.06) 0.87 (0.06) .039 .69 Medium
Dyspnea 0.18 (0.33) 1.45(1.08) .002 1.59 Large
Vocal Effort (mm) 11.6 (12.05) 41.2 (21.67) <.001 1.69 Large

Note. HNR=harmonics-to-noise ratio; CPP=cepstral peak prominence; L/H Ratio=Low-to-high spectral ratio; RFF=relative fundamental frequency; VAI=vowel articulation index. Effect size calculation= (Mean1-Mean2)/SDpooled

Female participants exhibited significantly greater mean fo (p<.001, d=4.04) and CPP SD (p=.001, d=1.65) compared to male participants. There was also a significant interaction effect of condition × sex for CPP (p<.012). Post hoc comparisons revealed that male participants had significantly higher CPP values (padj=.003, d=1.39) in the masked condition (M=8.13 dB) compared to the unmasked condition (M=6.76 dB), with no other significant pairwise comparisons between males and females by condition.

There was no impact of the main effect of mask type on any acoustic measure. However, there was a significant interaction effect of condition × mask type for L/H ratio (p<.001) and L/H ratio SD (p=.009). L/H ratio was significantly greater in the masked condition for those who were wearing N95 masks, compared to the unmasked N95 condition (padj<.001, d=1.43), unmasked simple condition (padj=.003, d=2.02), and masked simple condition (padj=.032, d=1.38). L/H ratio SD was significantly greater in unmasked N95 wearers compared to all other condition/type combinations. Finally, there were no main or interaction effects for fo SD, jitter, shimmer, vocal intensity, or RFF onset 1.

Perceptual Measures

One participant did not complete self-perceptual ratings, resulting in an analysis of twenty datasets. There was a significant increase in self-reported vocal effort (p<.001, d=1.69) and dyspnea (p=.002, d=1.59) during masked communication. There were no impacts of mask type or participant sex as well as no interaction effects found.

DISCUSSION

We hypothesized that VAI would increase during masked speech as a compensatory strategy; however, the opposite effect was found: VAI significantly decreased during masked speech. We theorize that this effect occurred because masks reduce or restrict movement of the lips and jaw (and subsequently lingual excursions) during masked speech. The significant reduction in articulatory range could lead to a “mumbling effect” and therefore impede speech intelligibility and comprehension during communicative exchanges.

We did not find a statistically significant increase in vocal intensity across masked/unmasked conditions. Participants maintained their vocal intensity in both conditions, increasing slightly from 83.0 dB SPL (unmasked) to 83.39 dB SPL (masked). It is important to note that vocal intensity was measured from a microphone placed outside of the mask. Previous work using an artificial speech signal through simple and N95 masks showed that masks cause an intensity attenuation of approximately 3–12 dB across various frequencies, depending on the type of mask used.15 Therefore, in order for the microphone signal to show consistent vocal intensity values, there must have been an increase in vocal intensity during masked speech. These findings provide evidence that speakers are, indeed, speaking louder in order to maintain their same vocal intensity target during masked speech. Speaking loudly for long periods of time has been associated with higher levels of vocal effort and fatigue,36,37 as well as increased risk for developing phonotraumatic lesions38 and voice disorders.39 As such, it is possible that healthcare professionals may be at increased risk for developing vocal issues due to their occupational vocal load in combination with face masks.

We found a significant increase in both CPP and HNR in the masked condition compared to unmasked. These findings are in accordance with suspected elevation in vocal intensity levels during masked speech, as previous literature has shown that CPP and HNR measures depend upon speaker intensity.40,41 Nguyen et al.16 also reported significant increases in HNR during masked speech, but the authors hypothesized that the change could be due to the mask’s attenuating effects on high frequency content—where aperiodic glottal noise often resides. The vocal parameters of jitter and shimmer (correlates to vocal perturbation and breathiness), did not change in comparisons between masked and unmasked conditions in our study, nor in other literature.17,20 Therefore, we suspect that the significant increase in CPP and HNR are likely due to an increase in vocal intensity and/or high frequency attenuation, and not reflective of an improvement in vocal quality during masked speech.

As expected, we found a significant attenuation of high-frequency information (>4 kHz), exhibited by an increase in L/H ratio values during masked speech. Interaction analyses showed that participants wearing N95s were driving the increase, with significantly greater L/H ratios (indicating high frequency attenuation) compared to both simple masks and unmasked conditions. Investigation into the comparisons yielded large effect size differences (Cohen’s d > 1.0), meaning that N95 masks may pose greater communication challenges than simple masks. The study by Nguyen et al.16 also found a significant attenuation in frequency information from 1–8 kHz with greater attenuation noted with N95 masks (5.2 dB), and surgical masks (2.0 dB), compared to unmasked speech. Besides these findings, there were few main effects of mask type, which may have been due to our relatively small subsets of participants for the comparison (N95: N=9; Simple: N=12). Subsequently, the impact of different face mask types (e.g., face-shields) should be investigated further in a larger cohort.

In line with our hypothesis that participants would exhibit acoustic effects consistent with vocal effort, we found a significant reduction in RFF offset cycle 10 during masked speech. RFF has shown promise as an acoustic indicator of laryngeal tension; lower values are reported in individuals with hyperfunctional voice disorders32,33 and adults with healthy voices who were simulating vocal effort and strain.23,34,42 Although our participants showed lower values during masked speech compared the unmasked, the effect size difference was small and is not easily predictive of the clinical risk of developing a voice disorder. Currently, there is no established clinical guideline for a meaningful change in RFF values, nor is there a cut-off criterion or critical range to indicate a higher risk for developing acute or long-term voice problems. Acoustical measures such as RFF should be interpreted in the context of additional clinical factors such as self-reported effort, peri-laryngeal tension, and fatigue.

In our study, participants reported significantly greater amounts of vocal effort—with a statistically large effect size—in the masked condition. Vocal effort values increased from 11.6 mm to 41.2 mm, which is consistent with levels of “moderate effort” (i.e., 43.8 mm).23 We suspect that mask wearers were trying to overcome the degradation of the speech signal via increases in vocal intensity and laryngeal tension, which they perceived as increased vocal effort. This suggests that mask-wearing healthcare professionals have an added risk factor for developing vocal problems beyond those presented by their occupational vocal demands. An investigation into how daily mask use impacts vocal symptoms and voice acoustics is warranted.

Finally, participants reported a significant increase in perceived dyspnea during masked speech. Although significant with a large effect size, this increase in dyspnea is less likely to be clinically meaningful. The modified Borg Scale for dyspnea provides descriptive anchors at each interval. Using these anchors to understand the degree of change perceived by the speaker, participants reported feeling “nothing at all” (score=0.18) while unmasked, to “very slight dyspnea” (score=1.45) while masked. With that in mind, this statistically significant increase is likely not directly impacting their speech or communication function.

Limitations and Future Directions

Our work is limited because we used a simple spectral cut-off frequency of 4 kHz. It may be beneficial to examine more specific frequency bands (e.g., 2–5 kHz, 5–8 kHz) to understand how frequency attenuation may impact speech formants. We showed evidence for a reduction in vowel articulation based on F1 and F2 formants, but additional information on consonant articulation and high frequency phonemic information (e.g., /f/ vs. /s/) is needed. Furthermore, we did not acquire information about the laryngeal source in our study. Electroglottography or neck-surface accelerometry could provide important information about vocal fold contact (timing, duration) during masked speech. A laryngeal microphone could also be used to measure amplitudes that are not affected by the attenuating characteristics of the mask.

Although this work provides information about the effects of wearing masks on vocal function, healthcare professionals experience numerous additional communication challenges, including face-shields, plexiglass barriers, and loud environments. Future work should aim to comprehensively evaluate speech and voice function in these “real-life” work environments, including the cumulative effects of occupational voice demands, speaking environment, and type of personal protective equipment that have yet to be examined. A longitudinal study would be beneficial to understand how these factors may increase the risk of developing voice problems over time. Finally, an investigation into compensatory techniques (e.g., clear speech) is needed to understand how training may help healthcare professionals improve comprehension and reduce vocal effort during masked communication.

CONCLUSION

Face masks are a barrier to communication that impact speech acoustics and result in speaker’s perception of increased vocal effort. Healthcare professionals may be at increased risk for developing vocal issues due to their occupational vocal load in combination with face masks. Further work is needed to understand how long-term mask use may increase the risk for developing voice problems and whether vocal health education may help to offset these effects.

ACKNOWLEDGMENTS

The project was supported by the National Center for Advancing Translational Sciences of the National Institutes of Health, under Award Number 2UL1TR001425-05A1. The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH. The authors would like to thank Teresa Hollenkamp, M.A., CCC-SLP for her assistance with participant recruitment.

Footnotes

COIs: The authors declare no financial or non-financial conflicts at this time.

Presentations: Portions of this work was presented in a poster at the Boston Speech Motor Control Symposium in June of 2021.

a

The application used to generate the pure tones was called “Frequency Generator” in Android or “Frequency” in iOS.

b

Three researchers (authors V.M., C.K., and T.P.) were trained to perform acoustic extraction on a sample of acoustic recordings captured with and without a mask. Following experimental data extraction, inter-rater reliability was calculated for one randomly selected participant, with researchers blinded to other values. An intraclass correlation coefficient (ICC; two-way, consistency) found moderate-to-excellent reliability26 with M=.93 (range=.68–.99) across all acoustic measures. Intra-rater reliability was completed approximately one month after the original acoustic extraction on a randomly selected participant for each researcher, blinded to previous extracted values. Pearson correlations between the original and new extraction values were M=.94 (r=.90–.97), M=.94 (r=.83–.99) and M=.91 (r=.77–.99) for each researcher, respectively.

REFERENCES

  • 1.Titze IR, Lemke J, Montequin D. Populations in the U.S. workforce who rely on voice as a primary tool of trade: a preliminary report. J Voice. 1997;11(3):254–259. [DOI] [PubMed] [Google Scholar]
  • 2.Heidel SE, Torgerson JK. Vocal problems among aerobic instructors and aerobic participants. J Commun Disord. 1993;26(3):179–191. doi: 10.1016/0021-9924(93)90007-W [DOI] [PubMed] [Google Scholar]
  • 3.Mori MC, Francis DO, Song PC. Identifying Occupations at Risk for Laryngeal Disorders Requiring Specialty Voice Care. Otolaryngol Neck Surg. 2017;157(4):670–675. doi: 10.1177/0194599817726528 [DOI] [PubMed] [Google Scholar]
  • 4.Munier C, Kinsella R. The prevalence and impact of voice problems in primary school teachers. Occup Med. 2008;58(1):74–76. doi: 10.1093/occmed/kqm104 [DOI] [PubMed] [Google Scholar]
  • 5.Roy N, Merrill RM, Thibeault S, Parsa RA, Gray SD, Smith EM. Prevalence of Voice Disorders in Teachers and the General Population. J Speech Lang Hear Res. 2004;47(2):281–293. [DOI] [PubMed] [Google Scholar]
  • 6.Roy N, Merrill RM, Gray SD, Smith EM. Voice Disorders in the General Population: Prevalence, Risk Factors, and Occupational Impact: The Laryngoscope. 2005;115(11):1988–1995. [DOI] [PubMed] [Google Scholar]
  • 7.Sapir S, Keidar A, Mathers—schmidt B. Vocal attrition in teachers: survey findings. Int J Lang Commun Disord. 1993;28(2):177–185. doi: 10.3109/13682829309041465 [DOI] [PubMed] [Google Scholar]
  • 8.Williams NR. Occupational groups at risk of voice disorders: a review of the literature. Occup Med. 2003;53(7):456–460. doi: 10.1093/occmed/kqg113 [DOI] [PubMed] [Google Scholar]
  • 9.Boltežar L, Šereg Bahar M. Voice Disorders in Occupations with Vocal Load in Slovenia. Slov J Public Health. 2014;53(4):304–310. doi: 10.2478/sjph-2014-0033 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Bartoszko JJ, Farooqi MAM, Alhazzani W, Loeb M. Medical masks vs N95 respirators for preventing COVID‐19 in healthcare workers: A systematic review and meta‐analysis of randomized trials. Influenza Other Respir Viruses. 2020;14(4):365–373. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Wong CKM, Yip BHK, Mercer S, et al. Effect of facemasks on empathy and relational continuity: a randomised controlled trial in primary care. BMC Fam Pract. 2013;14(1):200. doi: 10.1186/1471-2296-14-200 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Mendel LL, Gardino JA, Atcherson SR. Speech Understanding Using Surgical Masks: A Problem in Health Care? J Am Acad Audiol. 2008;19(9):686–695. [DOI] [PubMed] [Google Scholar]
  • 13.Champion J, Holt R. Dental care for children and young people who have a hearing impairment. Br Dent J. 2000;189(3):155–159. [DOI] [PubMed] [Google Scholar]
  • 14.Radonovich LJ, Yanke R, Cheng J, Bender B. Diminished Speech Intelligibility Associated with Certain Types of Respirators Worn by Healthcare Workers. J Occup Environ Hyg. 2009;7(1):63–70. [DOI] [PubMed] [Google Scholar]
  • 15.Goldin A, Weinstein BE, Shiman N. How do medical masks degrade speech perception? 2020;27(5):8–9. [Google Scholar]
  • 16.Nguyen DD, McCabe P, Thomas D, et al. Acoustic voice characteristics with and without wearing a facemask. Sci Rep. 2021;11(1):5651. doi: 10.1038/s41598-021-85130-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Magee M, Lewis C, Noffs G, et al. Effects of face masks on acoustic analysis and speech perception: Implications for peri-pandemic protocols. J Acoust Soc Am. 2020;148(6):3562. doi: 10.1121/10.0002873 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Monson BB, Hunter EJ, Lotto AJ, Story BH. The perceptual significance of high-frequency energy in the human voice. Front Psychol. 2014;5:1–11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Ribeiro VV, Dassie-Leite AP, Pereira EC, Santos ADN, Martins P, Irineu R de A. Effect of Wearing a Face Mask on Vocal Self-Perception during a Pandemic. J Voice. Published online October 2020:S0892199720303568. doi: 10.1016/j.jvoice.2020.09.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Fiorella ML, Cavallaro G, Di Nicola V, Quaranta N. Voice Differences When Wearing and Not Wearing a Surgical Mask. J Voice. Published online March 2021:S0892199721000709. doi: 10.1016/j.jvoice.2021.01.026 [DOI] [PubMed] [Google Scholar]
  • 21.Awan SN, Roy N, Dromey C. Estimating dysphonia severity in continuous speech: Application of a multi-parameter spectral/cepstral model. Clin Linguist Phon. 2009;23(11):825–841. [DOI] [PubMed] [Google Scholar]
  • 22.Borg G Psychophysical bases of perceived exertion. Med Sci Sports Exerc. 1982;14(5):377–281. [PubMed] [Google Scholar]
  • 23.McKenna VS, Stepp CE. The relationship between acoustical and perceptual measures of vocal effort. J Acoust Soc Am. 2018;144(3):1643–1658. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Sandage MJ, Patel R. Passive Upper Airway Thermoregulation and High-Speed Assessment for Conventional versus Menthol Cigarette: Implications for Laryngeal Physiology. J Voice. Published online September 2018:S0892199718301760. doi: 10.1016/j.jvoice.2018.07.022 [DOI] [PubMed] [Google Scholar]
  • 25.Boersma P. Praat, a system for doing phonetics by computer. Glot Int. 2001;5:341–345. [Google Scholar]
  • 26.Koo TK, Li MY. A Guideline of Selecting and Reporting Intraclass Correlation Coefficients for Reliability Research. J Chiropr Med. 2016;15(2):155–163. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Sapir S, Ramig L, Spielman J, Fox C. Acoustic metrics of vowel articulation in Parkinsons’s disease: Vowel space area (VSA) vs. vowel articulation index (VAI). In: Models of Analysis of Vocal Emissions for Biomedical Applications: 7th International Workshop. Firenze University Press; 2011:173–175. [Google Scholar]
  • 28.Caverlé MWJ, Vogel AP. Stability, reliability, and sensitivity of acoustic measures of vowel space: A comparison of vowel space area, formant centralization ratio, and vowel articulation index. J Acoust Soc Am. 2020;148(3):1436–1444. doi: 10.1121/10.0001931 [DOI] [PubMed] [Google Scholar]
  • 29.Skodda S, Visser W, Schlegel U. Vowel articulation in Parkinson’s disease. J Voice Off J Voice Found. 2011;25(4):467–472. doi: 10.1016/j.jvoice.2010.01.009 [DOI] [PubMed] [Google Scholar]
  • 30.Lien Y-AS, Heller Murray ES, Calabrese CR, et al. Validation of an Algorithm for Semi-automated Estimation of Voice Relative Fundamental Frequency. Ann Otol Rhinol Laryngol. 2017;126(10):712–716. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Stepp CE, Merchant GR, Heaton JT, Hillman RE. Effects of Voice Therapy on Relative Fundamental Frequency During Voicing Offset and Onset in Patients With Vocal Hyperfunction. J Speech Lang Hear Res. 2011;54(5):1260–1266. doi: 10.1044/1092-4388(2011/10-0274) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Heller Murray ES, Lien Y-AS, Van Stan JH, et al. Relative Fundamental Frequency Distinguishes Between Phonotraumatic and Non-Phonotraumatic Vocal Hyperfunction. J Speech Lang Hear Res. 2017;60(6):1507–1515. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Stepp CE, Hillman RE, Heaton JT. The Impact of Vocal Hyperfunction on Relative Fundamental Frequency During Voicing Offset and Onset. J Speech Lang Hear Res. 2010;53(5):1220–1226. doi: 10.1044/1092-4388(2010/09-0234) [DOI] [PubMed] [Google Scholar]
  • 34.Lien Y-AS, Michener CM, Eadie TL, Stepp CE. Individual Monitoring of Vocal Effort With Relative Fundamental Frequency: Relationships With Aerodynamics and Listener Perception. J Speech Lang Hear Res. 2015;58(3):566–575. doi: 10.1044/2015_JSLHR-S-14-0194 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Eadie TL, Stepp CE. Acoustic Correlate of Vocal Effort in Spasmodic Dysphonia. Ann Otol Rhinol Laryngol. 2013;122(3):169–176. [DOI] [PubMed] [Google Scholar]
  • 36.Bottalico P Speech Adjustments for Room Acoustics and Their Effects on Vocal Effort. J Voice. 2017;31(3):392.e1–392.e12. doi: 10.1016/j.jvoice.2016.10.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Bottalico P, Graetzer S, Hunter EJ. Effects of speech style, room acoustics, and vocal fatigue on vocal effort. J Acoust Soc Am. 2016;139(5):2870–2879. doi: 10.1121/1.4950812 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Bastian RW, Thomas JP. Do Talkativeness and Vocal Loudness Correlate With Laryngeal Pathology? A Study of the Vocal Overdoer/Underdoer Continuum. J Voice. 2016;30(5):557–562. doi: 10.1016/j.jvoice.2015.06.012 [DOI] [PubMed] [Google Scholar]
  • 39.Byeon H The Risk Factors Related to Voice Disorder in Teachers: A Systematic Review and Meta-Analysis. Int J Environ Res Public Health. 2019;16(19):3675. doi: 10.3390/ijerph16193675 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Awan SN, Giovinco A, Owens J. Effects of Vocal Intensity and Vowel Type on Cepstral Analysis of Voice. J Voice. 2012;26(5):670.E15–670.E20. doi: 10.1016/j.jvoice.2011.12.001 [DOI] [PubMed] [Google Scholar]
  • 41.Brockmann-Bauser M, Bohlender JE, Mehta DD. Acoustic Pertubation Measures Improve with Increasing Vocal Intensity in Individuals With and Without Voice Disorders. J Voice. 2018;32(2):162–168. doi: 10.1016/j.jvoice.2017.04.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.McKenna VS, Heller Murray ES, Lien Y-AS, Stepp CE. The Relationship Between Relative Fundamental Frequency and a Kinematic Estimate of Laryngeal Stiffness in Healthy Adults. JSLHR. 2016;59(6):1283–1294. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Cohen J Statistical Power Analysis for the Behavioral Sciences. Second. Lawrence Erlbaum Associates; 1988. [Google Scholar]

RESOURCES