Abstract
Purpose:
Neurodegenerative motor diseases (NMDs) have devastating effects on the lives of patients and their loved ones, in part due to the impact of neurologic abnormalities on speech, which significantly limits functional communication. Clinical speech researchers have thus spent decades investigating speech features in populations suffering from NMDs. Features of impaired articulatory function are of particular interest given their detrimental impact on intelligibility, their ability to encode a variety of distinct movement disorders, and their potential as diagnostic indicators of neurodegenerative diseases. The objectives of this scoping review were to identify (1) which components of articulation are the most represented in the acoustic literature on NMDs; (2) which acoustic articulatory features demonstrate the most potential for detecting speech motor dysfunction in NMDs; and (3) which articulatory components are the most impaired within each NMD.
Method:
This review examined literature published between 1976 and 2020. Studies were identified from six electronic databases using predefined key search terms. The first research objective was addressed using a frequency count of studies investigating each articulatory component, while the second and third objectives were addressed using meta-analyses.
Results:
Findings from 126 studies revealed a considerable emphasis on precision. Of the 24 features included in the meta-analyses, vowel dispersion/distance and stop gap duration exhibited the largest effects when comparing the NMD population to controls. The meta-analyses also revealed divergent patterns of articulatory performance across disease types, providing evidence of unique profiles of articulatory impairment.
Conclusions:
This review illustrates the current state of the literature on acoustic articulatory features in NMDs. By highlighting the areas of need within each articulatory component and disease group, this work provides a foundation on which clinical researchers, speech scientists, neurologists, and computer engineers can develop research questions that will both broaden and deepen the understanding of articulatory impairments in NMDs.
Keywords: speech acoustics, dysarthria, articulatory impairment, neurodegenerative disease, domain knowledge, Acoustics, Neurodegenerative diseases, Articulation
Introduction
Neurodegenerative motor diseases (NMDs) are a heterogeneous group of diseases characterized by progressive neuronal loss in the central and peripheral nervous systems, affecting motor function (Duffy, 2013). Dysarthria, which is a speech motor disorder resulting from loss of articulator muscle strength or control, is a common symptom of NMDs (Duffy, 2013), and the consequent communication disturbances significantly impact patient quality of life (Hartelius et al., 2008). The type and severity of dysarthria is, therefore, often examined in the assessment and differential diagnosis of neurologic disorders (Duffy, 2013; Kent et al., 2000). Furthermore, a growing number of studies suggest that speech impairments may be among the most informative motor abnormalities for detecting the onset of neurological dysfunction or tracking the rate of disease progression (Robin et al., 2020). Exploring the efficacy of candidate speech acoustic biomarkers has thus become an active area of research for the purposes of improving early detection, disease classification, and progress monitoring during clinical trials (Carmichael, 2014; Gunduz, 2019; Gutz et al., 2019; Orozco-Arroyave et al., 2016; Tsanas et al., 2012; Wang et al., 2016).
Aberrant acoustic speech features are loosely linked to their physiologic origins, which involve dysfunction of one or several of the four speech subsystems (i.e., articulatory, phonatory, respiratory, or resonatory). While phonatory, resonatory, respiratory, and prosodic deficits can significantly limit communicative capacity, impairments to articulatory function have a highly detrimental impact on speech intelligibility (De Bodt et al., 2002; Lee et al., 2014; Rong et al., 2015; Kent et al., 1989; Weismer et al., 2001; Turner et al., 1995). Articulatory abnormalities are also thought to encode unique motor symptoms (e.g., hypokinesia, hyperkinesia, ataxia) that may correspond with underlying neuropathologies of movement disorders (Guenther, 2016). For instance, damage to the cerebellum, which is often found in speakers with ataxic dysarthria, has been associated with articulatory discoordination (Ackermann, 2008). Damage to the basal ganglia in speakers with hypokinetic dysarthria, on the other hand, can affect the size and speed of articulatory movement (Hirose, 1986). These presumed linkages between neuropathology and speech motor behavior make articulatory features an appealing diagnostic target to further explore for neurologic disease. Indeed, prior work has already demonstrated the efficacy of articulatory movement features as diagnostic indicators of movement disorders such as bulbar amyotrophic lateral sclerosis (Green et al., 2013), apraxia of speech in neurodegenerative conditions (Takakura et al., 2019), and the nonfluent variant of primary progressive aphasia (Cordella et al., 2017).
Given their diagnostic potential, speech features are increasingly being used to train machine learning models for detecting the presence of NMDs (Robin et al., 2020). Data-driven feature extractors (e.g., OpenSmile; Eyben et al., 2010) are capable of extracting thousands of features—many of which are uninterpretable—from a single speech recording and, therefore, might be viewed as the preferred method for feature discovery. However, domain knowledge (i.e., expert knowledge of a specific discipline) and feature interpretability have been increasingly recognized for reducing model bias, refining diagnostic model performance, and furthering scientific understanding (Childs and Washburn, 2019; Gunning, 2019; Tu et al., 2017). Moreover, recent work has found automatically extracted speech features to have low reliability for dysarthria assessment (Stegmann et al., 2020). To that end, the rationale for the current review paper is in the context of identifying domain-specific features that provide inference ability. Knowledge regarding impaired articulatory function has the potential to advance diagnostic model development by (1) providing guidance regarding features that may be the most likely candidates for detecting speech abnormalities in NMDs (Anand et al., 1995; Berisha et al., 2013; Gupta et al., 2016) and (2) ensuring the inclusion of participants with characteristics that represent the full range of diversity in the target population. Taken together, there is a critical need to identify acoustic articulatory features to improve the diagnostic applications of artificial intelligence and scientific understanding of NMDs.
A proposed framework to guide the investigation of interpretable articulatory features
To guide the organization of articulatory features for this review, we grouped features into five components: coordination, consistency, speed, precision, and repetition rate (Rowe and Green, 2019; Rowe et al., 2020; Rowe et al., 2021). These groupings are intended to be comprehensive but are not exhaustive (i.e., they do not characterize all the variations of articulatory motor impairments) nor mutually exclusive (i.e., one articulatory feature could presumably represent multiple components). Our affiliation of a particular articulatory feature with a particular component was aligned with each author's stated intention in the reviewed source articles. For example, Tykalova and colleagues (2017) measured voice onset time as a measure of coordination and Lam and Tjaden (2016) measured vowel space area as a measure of articulatory precision (see Figure 1). The clinical validity of these components is supported by their widespread use in movement disorders clinics to characterize a wide diversity of movement disorders. As such, they have informed one of the most established speech motor impairment classification systems developed by Darley, Aronson, and Brown (DAB) (Darley et al.,1969a and 1969b)—a system based on the hypothesized relationships between atypical speech patterns and distinct neurological and/or pathophysiological deficits (Ackermann, 2008; Darley et al., 1969a and 1969b; Duffy, 2013; Kent et al., 2000; Guenther, 2016; Hirose, 1986). With the increasing amount of research on acoustic measures of speech function in dysarthria, the proposed five-component scheme for characterizing articulatory motor impairments can help guide the interpretation of new and existing features within the context of common clinical descriptors for movement disorders.
Figure 1.
Dysarthria subtypes for each neurodegenerative motor diseases in the current review, their perceptual articulatory characteristics based on a widely used taxonomy of speech motor disorders (ordered from most severe to least severe, with * indicating more severely impaired than other dysarthria subtypes), and the corresponding areas of impairment based on our framework of articulatory components.
In this scoping review, we aimed to summarize four decades of literature on acoustic features of articulatory performance with the goals of identifying (1) which components of articulation (i.e., coordination, consistency, speed, precision, or repetition rate) are the most represented in the speech acoustic literature on NMDs; (2) which acoustic articulatory features demonstrate the most potential for detecting speech motor dysfunction in NMDs; and (3) which articulatory components are the most impaired within each NMD. Based on recommendations made by Munn and colleagues (2018), a scoping review is an appropriate review method for this paper because we sought to identify the available evidence on acoustic features in NMDs, clarify key concepts underlying acoustic-based articulatory features in the literature by categorizing them into different components of articulation, and determine where the gaps are in the acoustic literature base in NMDs (Munn et al., 2018).
Methods
Using a five-stage framework, Arksey and O'Malley (2005) outline the process of a scoping review that offers transparency and permits the replication of the search strategy used in the review. This scoping review employs the five stages of the framework: A. Identifying the Research Questions; B. Identifying Relevant Studies; C. Study/Feature Selection; D. Charting the Data; and E. Collating, Summarizing, and Reporting the Results (Arksey and O’Malley, 2005). This review also followed the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) Extension for Scoping Reviews guidelines (Tricco et al., 2018).
Identifying the Research Questions
The primary focus of this review was to evaluate the range of acoustic articulatory features that has been used to investigate the most common NMDs. Our first research question focused on the articulatory components that are most represented in the acoustic literature, our second research question focused on the specific features that demonstrate the most potential for detecting speech motor dysfunction in NMDs, and our third research question focused on the articulatory components that are the most impaired within each NMD and, therefore, may be important for differential diagnosis.
Identifying Relevant Studies
Studies were identified by searching the following six electronic databases: MEDLINE, CINAHL, PsycINFO, PubMed, IEEE Xplore, and Google Scholar (to identify sources within gray literature). The most recent search was conducted on December 1, 2020. The search strategy was based on the methodology (i.e., acoustic analysis), speech disorder (i.e., dysarthria), and population (i.e., neurodegenerative motor disease) of interest for this review. Thus, the main search terms and corresponding operational definitions included: acoustics / speech acoustics, defined as any analysis of the speech signal waveform recorded with a microphone, including amplitude, duration, or spectral properties of the waveform; dysarthria / dysarthric speech, defined as speech difficulties due to weakened muscles; articulation, defined as features that represent imprecise consonants, distorted vowels, incoordination, irregular articulatory breakdowns, and reduced speed and range of articulatory movement; and neurodegenerative disease, defined as a heterogeneous group of disorders characterized by the progressive degeneration of the central or peripheral nervous systems related to movement generation.
When appropriate, the Medical Subject Heading (MeSH) terms were “exploded” (i.e., made broader than the scope of the review) to retrieve citations that carried more specific MeSH terms. Individual search terms were adapted for each database to accommodate differences in subject headings. Key search terms included: (‘acoustics’ OR ‘speech acoustics’) AND (‘dysarthria’ OR ‘dysarthric speech’) AND (‘neurodegenerative diseases’) AND (‘articulation’). Reliability of the abstract selection (indexed by percent agreement) was based on two independent raters on 50% of the abstracts identified in the primary search. Percent agreement was 92.6% for the subset of abstracts. Discrepancies were resolved by consensus.
Study/Feature Selection
Using the search terms described above, a total of 1230 articles were identified. Since the terms were exploded and included conditions other than neurodegenerative disease, such as stroke or TBI, stringent selection criteria were determined and applied a priori. Studies were excluded if they (1) had no abstract; (2) were not human studies; (3) were not focused on adults; (4) were not peer-reviewed; (5) were not in English; (6) did not use acoustic analyses (e.g., only used perceptual analyses); (7) did not investigate the articulatory subsystem (e.g., only examined phonatory features); (8) did not investigate NMDs (e.g., only investigated acquired disorders, non-progressive diseases, or diseases with initial symptoms that are not motor-based, such as dementia or Alzheimer’s disease); (9) were classified as a review, report, or book; or (10) were a methodology study (e.g., assessed the accuracy of a speech recognition algorithm). While studies were excluded if they were not in English, we included studies with speakers of languages other than English, as recent work has reported similar deficits across multiple languages in individuals with dysarthria (Rusz, Hlavnicka et al., 2021). Furthermore, including studies with languages other than English expands the generalizability of our findings and provides a wider representation of the current literature.
As mentioned previously, the results of our review were framed within the five proposed components of articulatory motor control (i.e., coordination, consistency, speed, precision, or repetition rate) (see Figure 2). Importantly, features were first identified as being from the articulatory subsystem (as opposed to the respiratory, resonatory, phonatory, or prosodic subsystem). Each feature was then assigned to a component based on the intended use by the author. If the specific terms in our classification scheme were not explicitly used in a source article, we relied on the presence of terms with similar meaning, such as “articulatory variability” for consistency, or “articulatory distinctiveness” for precision. Several notable distinctions were made regarding whether features were primarily characteristic of the articulatory motor subsystem or another subsystem: (1) variability across repeated utterances (articulatory consistency) versus variability in continuous speech (often involved in rhythm) (Grabe and Low, 2008); (2) formant duration of consonant-vowel or diphthong transitions (articulatory speed) versus vowel duration in continuous speech (often involved in stress) (Crystal and House, 1982); and (3) rate of repeated utterances (repetition rate) versus speaking/articulatory rate (can be confounded by pausing or rhythm, respectively) (Allen, 1975; Grosjean and Collins, 1979). Speech acoustics are, nevertheless, at the confluence of multiple neural and biomechanic systems and codependency among performance measures is inevitable. For example, spectral coarticulation was classified as a coordination feature because it represents the appropriate temporal alignment of articulatory movements, but deficits in spectral coarticulation also have implications for precision. However, the extent to which a speech measure represents the status of one or more speech subsystem or divergent impairments within a subsystem (e.g., breathy versus strained voice) is a critical question for advancing speech diagnostics as well as explanatory models of speech. For the purposes of this review, the proposed framework offers an organizational structure for the literature on articulatory impairments. Additional research is needed to better understand the validity of specific acoustic features and their value for identifying distinct movement disorders and the underlying neurological and/or pathophysiological deficits.
Figure 2.
Definitions of the five framework components of articulatory motor and examples of acoustic features that represent each component.
Based on our exclusion and inclusion criteria, 165 studies were identified as relevant to our research questions and were subsequently brought to full review. Since our first research question was focused on the frequency of articulatory features in the literature, articles were excluded if they used feature sets that did not allow for assessment of (1) the individual features of interest (e.g., “blackbox” classification studies) or (2) the population of interest (e.g., aggregate data that included individuals with multiple diagnoses, undiagnosed speakers, or speakers of an unknown diagnosis). Since our second and third questions were focused on the level of impairment of individual features within and across NMDs, additional studies/features were excluded if they (3) did not have a control group; (4) did not include sufficient statistics to calculate an effect size (e.g., no means and standard deviations); (5) only included features that had been investigated in only one population; or (6) only included features that had been investigated in one study. The process of article selection followed the PRISMA Extension for Scoping Reviews guidelines (Tricco et al., 2018) (see Figure 3). Reliability of article selection (indexed by percent agreement) was based on two independent raters on 50% of the full articles. Percent agreement was 94.2% for the subset of articles. Discrepancies were resolved by consensus. Reliability of feature categorization into the five components (indexed by percent agreement) was based on two independent raters on 100% of the features included in the study. Percent agreement was 90.6%. Discrepancies were resolved by consensus.
Figure 3.
PRISMA chart illustrating the article search procedures and selection of included studies.
Charting the Data
For each study, a brief quality assessment was performed by the first author. The key areas for quality assessment included specifying demographics (i.e., age and sex), describing clinical history (i.e., disease duration and speech severity), addressing incomplete outcome data (e.g., providing reasons for study attrition), documenting comorbidities (e.g., cognitive impairments or aphasia), and conducting appropriate statistical analyses (e.g., omnibus testing and correcting for multiple comparisons).
Collating, Summarizing, and Reporting the Results
The fifth and final stage of the five-stage scoping review framework involves collating, summarizing, and reporting the results. Demographic and clinical characteristics (i.e., medical diagnosis and speech diagnosis of the participants, sample size, age, sex, disease duration, and speech severity) were extracted from all 126 studies included for RQ1 and in the subset of 89 studies included for RQs 2 and 3, which can be found in Table 1. The data extraction for RQ1 included (1) diagnosis and (2) acoustic features investigated in study. The additional data extraction for RQs 2 and 3 included (1) statistics necessary for effect size calculations (i.e., means and standard deviations, t values, F values, and number per group); (2) speech stimuli used to derive acoustic features; (3) brief methodology used to extract acoustic features; and (4) pathophysiological significance of acoustic features, when available. Effect sizes were calculated for all comparisons regardless of significance. All effect sizes were then combined regardless of directionality. For most comparisons, speakers in the disorder groups were more impaired than controls. There was, however, a subset of instances where the mean values for the disorder group indicated that they were less impaired than controls, but these effect sizes were very small and were not statistically significant.
Table 1.
Demographics and clinical characteristics of participants across studies for RQ1 and RQs 2 and 3. Data for RQs 2 and 3 are italicized, while a star indicates that data for RQ1 is same as data for RQs 2 and 3. All values are mean with one standard deviation in parentheses.
| ALS | AT | HD | MS | MSA | PD | PSP | |
|---|---|---|---|---|---|---|---|
| Dysarthria Type |
Flaccid- Spastic |
Ataxic | Hyperkinetic | Ataxic- Spastic |
Ataxic- Hypokinetic- Spastic |
Hypokinetic | Hypokinetic- Spastic |
| Global Prevalence | 320,000-650,000a | 430,000b | 200,000 c | 2,800,000d | 340,000e | 10,000,000f | 230,000-460,000g |
| Number | 12.94 (7.38) 14.43 (7.46) |
15.00 (11.78) 15.75 (10.70) |
20.5 (10.84)* | 36.94 (39.96) 44.18 (45.60) |
21.80 (10.59) 17.25 (3.40) |
19.36 (19.21) 19.45 (14.20) |
19.80 (9.39)* |
| Percent Female | 44.23% 39.73% |
54.59% 53.61% |
48.12%* | 59.61% 60.42% |
52.02% 53.15% |
35.76% 36.63% |
27.46%* |
| Years of Age | 56.13 (10.74) 56.37 (10.63) |
47.74 (12.80) 49.56 (12.00) |
48.31 (12.74)* | 47.21 (9.66) 45.17 (10.08) |
61.62 (6.26) 61.24 (6.03) |
65.23 (8.54) 65.25 (8.55) |
66.67 (5.79)* |
| Years Since Onset | 2.57 (2.70) 2.86 (2.90) |
12.61 (6.91) 13.86 (7.84) |
5.37 (3.06)* | 10.97 (6.86) 10.49 (6.98) |
3.95 (2.33) 3.99 (2.48) |
8.54 (4.92) 8.45 (4.57) |
3.56 (1.83)* |
| Speech Severity |
ALSFRS bulbar (0-12): 10.66 (1.71) 11.74 (1.44) Intell (% words correct): 83.16 (15.87) 84.44 (13.24) |
SARA speech (0-6): 1.96 (.93)* Intell (% words correct): 88.17 (9.74) 94.68 (4.43) |
UHDRS dysarthria (0-4):
.84 (.47)* |
Intell (% words correct): 91.83 (2.84) 91.00 (3.00) |
UPDRS III speech (0-4): 1.79 (.73) 1.80 (.70) |
UPDRS III speech (0-4): .99 (.57) .94 (.60) Intell (% words correct): 91.44 (4.62) 91.62 (5.53) |
UPDRS III speech (0-4): 1.97 (.78)* |
ALSFRS bulbar = Amyotrophic Lateral Sclerosis Functional Rating Scale bulbar subscore (0 = total loss of bulbar function to 12 = normal); SARA speech = Scale in the Assessment and Rating of Ataxia speech subscore (0 = normal to 6 = severe); UHDRS speech: Unified Huntington’s Disease Rating Scale dysarthria subscore (0 = normal to 4 = severe); UPDRS III speech = Unified Parkinson’s Disease Rating Scale speech subscore (0 = normal to 4 = unintelligible); intell = intelligibility.
Results
This scoping review investigated a total of 126 studies for RQ1. See Supplementary Material for a complete list of the studies investigated for the review. Performance on a total of 24 acoustic features from 89 articles (a subset of the 126 included for RQ1) was analyzed for the meta-analyses in RQs 2 and 3. Within the accepted studies, an initial total of nine NMDs were examined: ALS; Huntington’s disease (HD); cerebellar ataxia (CA); Friedreich’s ataxia (FA); spinocerebellar ataxia (SCA) variants 2/3/6/7/8; multiple sclerosis (MS); multiple system atrophy (MSA), both the unified syndrome as well as the parkinsonism variant; PD; and progressive supranuclear palsy (PSP). Because CA, FA, and the SCA variants were so infrequently studied, they were combined in this paper and are herein referred to as ataxia (AT). These seven disorders are known to be the most common NMDs worldwide (Erkkinen et al., 2018; Schaffert and Carter, 2020) and have been shown to result in different perceptual clusters of articulatory characteristics (Darley et al., 1969a and 1969b) (see Figure 1).
Consistent with the high prevalence of PD (Parkinson’s Foundation, 2020), speakers with PD were the most investigated, while speakers with MSA, which has one of the lowest prevalence rates (Schrag et al., 1999), were the least commonly investigated (see Table 1 and Figure 4). The findings for each research question are discussed below.
Figure 4.
Pie chart illustrating the number of studies investigating each neurodegenerative motor disease. Studies were counted more than once if they investigated more than one disease. The number in parentheses represents the percentage of the 126 studies that investigated that specific disease.
Research Question 1. Which articulatory components (i.e., coordination, consistency, speed, precision, or repetition rate) are the most represented in the acoustic literature on NMDs?
Based on the 126 articles included for RQ1, precision was the most represented articulatory component in the acoustic literature on NMDs (190 occurrences) and consistency was the least represented (37 occurrences), with coordination, repetition rate, and speed similarly underrepresented (41, 48, and 48 occurrences, respectively) (see Figure 5). Importantly, precision was also represented by the greatest number of unique features (46), while the other components consisted of significantly fewer features (6 for coordination, 4 for consistency, 6 for speed, and 2 for repetition rate) (see Tables 2a, 2b, and 3 in Supplementary Material). Additionally, the results demonstrated that the representation of articulatory components differed based on the NMD investigated. Indeed, precision was the most represented for the majority of NMDs (19 occurrences for ALS, 26 occurrences for MS, 4 occurrences for MSA, 125 occurrences for PD, and 5 occurrences for PSP) (see Figure 5). However, consistency and repetition rate were most frequently studied in individuals with AT (11 and 12 occurrences, respectively), and consistency and precision were most frequently studied, though only by 1 occurrence, in HD (3 occurrences each) (see Figure 5).
Figure 5.
Bar graph illustrating the articulatory components investigated across all studies in individuals with neurodegenerative motor diseases. The frequency of features investigated in each disease is displayed in the corresponding box.
Research Question 2. Which acoustic articulatory features demonstrate the most potential for detecting speech motor dysfunction in NMDs?
A total of 63 unique acoustic articulatory features were investigated in this study, but 39 of those features did not meet the inclusion criteria for the meta-analysis (see Table 2a in Supplementary Material). Thus, 24 unique features from 89 articles were included for RQs 2 and 3 (see Table 2b and Table 3 in Supplementary Material). Based on the features included in the meta-analysis, vowel dispersion/distance and stop gap duration exhibited a mean Cohen’s d effect size (across all disorder groups) greater than 1.5 (d = 1.95 and d = 1.80, respectively) (see Figure 6). In contrast, second formant duration and first spectral moment coefficient exhibited a mean effect size smaller than .6 (d = .26 and d = .43, respectively) (see Figure 6).
Figure 6.
Forest plot illustrating effect sizes (disease group compared to healthy controls) for each articulatory feature. The number of participants investigated in each disease is displayed above the corresponding dot. The total number of participants across all diseases is displayed in brackets to the right of the features. Positive effect sizes indicate that the disease group exhibited poorer performance compared to controls, while negative effect sizes indicate that the disease group performed better than controls.
Research Question 3. For each NMD, which articulatory components are the most impaired?
It should be noted that due to limited datapoints (e.g., only one study looked at speed features in speakers with MSA), we were unable to calculate confidence intervals for every effect size mean. However, based on inspection of the mean effect sizes and confidence intervals we were able to calculate, the findings revealed divergent patterns of articulatory performance within and across NMDs (see Figure 7). We considered the articulatory performance of each NMD population compared to both (1) the other components within the specific diagnosis group and (2) the articulatory performance of the other NMD populations. We found that speakers with ALS exhibited the greatest impairments in precision, speakers with AT exhibited the greatest impairments in repetition rate, speakers with HD exhibited the greatest impairments in consistency, and speakers with MS exhibited the greatest impairments in coordination. We also noted that speakers with MSA exhibited equal impairments in coordination, consistency, precision, and repetition rate; speakers with PD exhibited equal (and minimal) impairments across all components; and, compared to the other components within PSP, speakers with PSP exhibited primarily impaired consistency and precision.
Figure 7.
Forest plot illustrating divergent patterns of articulatory performance across neurodegenerative motor diseases. Each dot represents the mean effect size (disease group compared to healthy controls) for all acoustic features within each articulatory component.
Discussion
This study reviewed the current state of the literature on acoustic articulatory features in NMDs, with the overarching goals of determining (1) the articulatory components that are the most frequently investigated in NMDs, (2) the articulatory features that are the most effective in differentiating NMDs from healthy controls, and (3) the articulatory components that are most impaired in each NMD. Our findings reveal the depth and breadth of the present research base and lay the groundwork for identifying the domain knowledge necessary for improving diagnostic models for individuals with NMDs.
Comprehensively sampling across different articulatory components in NMDs (Research Question 1)
Identifying robust markers of abnormal articulatory function is a high research priority because of their (1) strong influence on speech intelligibility (De Bodt et al., 2002; Kent et al., 1989; Lee et al., 2014; Rong et al., 2015; Turner et al., 1995; Weismer et al., 2001), (2) hypothesized links to distinct neurological and/or pathophysiological deficits (Ackermann, 2008; Guenther, 2016; Hirose, 1986), and (3) previously demonstrated diagnostic efficacy for a number of movement disorders (Cordella et al., 2017; Green et al., 2013; Takakura et al., 2019). These markers thus have the potential to improve diagnostic models for speakers with NMDs. Our review, however, revealed expansive gaps in the literature on all of the articulatory components except precision.
The focus on precision in the acoustic literature is not unfounded, as deficits in precision have been documented in nearly every dysarthria subtype (Darley et al.,1969a and 1969b). Furthermore, in contrast to the small subset of features representing other articulatory components, precision is represented by a plethora of features assessing the distinctiveness of speech sounds within three types of targets: vowels, consonants, and consonant-vowel transitions. Within each type of target, the same articulatory configuration can be acoustically measured in multiple ways. For example, for consonants, the inability to maintain appropriate vocal tract constriction can be indexed by length of frication, length of stop gap, or spectral moment differences. It is important to note that despite—or perhaps in light of—the abundance of precision features, many of them lack standardization, with the exception of the vowel space features. More research is, therefore, needed to test the reliability and validity of existing features. Particular focus should be given to consonant features, as previous work has suggested that consonants may have a disproportionate impact on intelligibility (Owren and Cardillo, 2006).
Compared to precision, the other four articulatory components are far less represented in the acoustic literature. The reasons for this underrepresentation may differ depending on the component. The paucity of investigation of coordination, for example, may arise from the many-to-one mapping of articulatory movements to acoustic output (e.g., uncoordinated lip movement during the production of /p/ will not necessarily alter the acoustic result) (Bernstein, 1967), which limits feature interpretation and impedes the generation of valid acoustic features. Feature development for coordination may also be hindered by inadequate conceptual and operational definitions across and even within disciplines such as computer science, speech science, neurology, and linguistics.
While there are also far fewer features representing consistency, speed, and repetition rate, the lack of studies examining these articulatory components was surprising given their relative ease of measurement (e.g., between-repetition variability in syllable length, second formant slope, and syllables per second, respectively) and their diagnostic utility for different neurologic diseases. Prior work has shown that repetition rate, in particular, can distinguish speakers across a variety of etiologies (e.g., ALS, AT, PD) from healthy controls with high sensitivity (Nishio and Nimi, 2006; Schalling et al., 2007; Tjaden and Watling, 2003; Ziegler and Wessel, 1996).
The disproportionate attention on precision is problematic because it precludes the development of full articulatory profiles or phenotypes that are helpful for distinguishing diseases. Developing phenotypes that elucidate the articulatory components that are impaired and those that are spared may be critical for improving diagnostic accuracy. For example, the dopamine deficiency that underlies hypokinetic dysarthria secondary to PD may result in decreased range of motion and rigidity due to poor control over muscle contractions (Hirose, 1986). However, while this “articulatory undershoot” (or reduced movement excursion) impacts speed and precision, speakers with PD are often able to maintain a normal repetition rate through truncated movements, resulting in a speed-rate dissociation (Blanchet and Snyder, 2009; Kim et al., 2009). Consistent with this proposed pathophysiology, recent work has shown that speakers with PD demonstrate deficits in speed and precision but preserved repetition rate (Rowe et al., 2020). Identifying such phenotypes for other NMDs requires broadening and deepening our exploration of articulatory components across all NMD populations.
Identifying potential markers of speech motor dysfunction in NMDs (Research Question 2)
Our findings revealed several promising features of articulatory dysfunction within the existing literature on acoustic articulatory features. When examining NMDs as a whole, the meta-analysis for RQ2 demonstrated large effect sizes for a subset of features (e.g., vowel dispersion/distance, stop gap duration, alternating motion rate, unvoiced stop duration) (illustrated by the black diamonds in Figure 6), indicating their potential to detect articulatory deficits in dysarthria in general. When examining NMDs individually, the meta-analysis demonstrated large effect sizes for a different subset of features (e.g., vowel space area for ALS, sequential motion rate for AT) (illustrated by the colored dots in Figure 6), which suggests that articulation may be differentially impaired across diseases.
Among the five components, precision exhibited the greatest range in effect size values across all acoustic features, in part because there were significantly more features representing precision compared to any other component. As previously mentioned, features of precision assess the distinctiveness of vowels, consonants, and consonant-vowel transitions. For vowels, reductions in vowel space (e.g., vowel articulation index or vowel space area) reflect the inability to achieve the full range of movement required for distinct vowel production. For consonants, reductions in duration (e.g., unvoiced fricative duration or unvoiced stop duration) or intensity (e.g., first spectral moment coefficient or spectral change range) of spectral energy reflect the inability to maintain sufficient articulatory closure and/or, in the case of spectral moments, inaccurate place and degree of constriction. For consonant-vowel transitions, reductions in formant “movement” (e.g., second formant extent or second formant interquartile range) reflect the inability to achieve appropriate vocal tract movement between sounds. The concept of “articulatory imprecision” is, therefore, employed broadly across different features and articulatory gestures. The overall findings suggest that features reflecting changes in vowel space and in duration of spectral energy are promising for detecting articulatory imprecision across disease types. Further research is, nevertheless, needed to validate acoustic features of precision using kinematic approaches and to determine whether our findings hold with a larger and more diverse sample.
A wide range of effect sizes was also noted across NMDs for nearly every feature, which implied that some features may be uniquely sensitive to specific disease types (see Figure 6). For example, our findings revealed that alternating motion rate is most noticeably impaired in speakers with AT, while vowel space area is particularly reduced in speakers with ALS. However, many features (i.e., voice onset time ratio, second formant standard deviation, spectral change range, resonant frequency attenuation, first spectral moment coefficient, first spectral moment difference, and formant centralization ratio) have only been studied in two populations. Furthermore, based on our literature search, there are several features that, to our knowledge, have never been studied in certain diseases, such as second formant extent in individuals with AT. Additional work is, therefore, needed to examine performance on these features across more disease types.
Aside from further investigation into existing features, more focus is needed on developing new acoustic features in components other than precision. For instance, while second formant slope has been shown to be highly sensitive to several NMD populations (Kent and Kim, 2003; Kim et al., 2009; Yunusova et al., 2012), other features representing speed that were not included in the meta-analysis, such as spectral change rate, may be more sensitive to speed in certain other populations. Dense sampling across multiple different features and disease types is integral to guiding engineers and speech scientists in selecting the features that best detect speech impairment in specific NMDs. Importantly, most findings are based on cross-sectional rather than longitudinal data. Thus, further work is needed to examine the stability of articulatory features over time, as such information is essential for determining the utility of speech as a diagnostic tool or outcome measure in a clinical trial.
Considering divergent articulatory patterns in the differential diagnosis of NMDs (Research Question 3)
The DAB model posits that each disease type has a different presumed neurological and/or pathophysiological basis (Darley et al.,1969a and 1969b), which hypothetically results in a unique articulatory phenotype. Evidence for such phenotypes emerged, to some degree, upon collapsing the meta-analysis results from RQ2 across features within each component for each NMD group (see Figure 7). While findings are preliminary since we were unable to adequately control for severity, our analyses revealed divergent patterns of articulatory performance based on disease type.
Many of the NMD populations exhibited articulatory characteristics consistent with those identified in the DAB model for their corresponding dysarthria subtype (see Figure 1 and Figure 7). Primarily impaired precision in ALS is consistent with increased muscle tone and weakness secondary to mixed spastic-flaccid dysarthria, which often result in labored and imprecise movements. Furthermore, primarily impaired consistency in HD corresponds with the excess movement secondary to hyperkinetic dysarthria, which has been related to irregular articulatory breakdowns (Darley et al., 1969a and 1969b). In addition, given the loss of muscle and control associated with ataxic dysarthria, a typical characteristic seen in this dysarthria subtype is incoordination (Darley et al., 1969a and 1969b). Indeed, speakers with MS, who typically exhibit a mixed spastic-ataxic dysarthria, demonstrated primarily impaired coordination. While it was surprising that speakers with AT did not also exhibit prominent impairments in coordination, perhaps they have coordinative deficits beyond abnormalities in voice onset time that contributed to their remarkably impaired repetition rate. For speakers with PSP, primary impairments were noted in consistency and precision, which may be related to the variable repetition rate and reduced movement extent, respectively, seen in mixed spastic-hypokinetic dysarthria (Darley et al., 1969a and 1969b). Similarly, as speakers with MSA tend to present with a mixed spastic-ataxic-hypokinetic dysarthria, impaired consistency and precision may reflect the spastic and hypokinetic components, while impaired coordination and repetition rate may reflect the ataxic component (Darley et al., 1969a and 1969b). Lastly, our findings of equal (and minimal) impairments across all components for speakers with PD were surprising, as we would have expected greater impairments in speed or precision (Darley et al., 1969a and 1969b). However, the PD group in our meta-analysis demonstrated a mild impairment on average, and prior work has found that in speakers with PD, phonatory features—which were not examined in this study—tend to be the leading deficit in the earlier stages of disease progression, with articulatory features becoming more impaired in the later stages (Ho et al., 1998).
Overall, these findings can only be viewed as very preliminary given the plethora of factors that were not controlled for (e.g., severity). Thus, further work is needed to investigate the presence and validity of divergent articulatory impairment phenotypes across NMD populations. Nonetheless, our findings tentatively suggest that different speech motor subtypes may present with distinct articulatory patterns, which underscores the necessity to explore articulation more broadly across and within NMD populations.
Limitations and Conclusions
Several limitations should be noted in this review. Namely, we did not control for sex, medication use, dysarthria severity, or task used in the study. While most of the studies included in the meta-analysis had age- and sex-matched control groups, we included studies with unequal sex ratios because a subset of the diseases examined do not occur equally across sexes. Indeed, many studies used sex ratios in their sample that represented the ratio of occurrence in the larger population. The greater number of males with ALS, PD, and PSP seen in Table 1 is, therefore, relatively consistent with the higher global incidence of ALS, PD, and PSP in males than in females (Manjaly et al., 2010; Miller and Cronin-Golomb, 2010; Steele et al., 2014).
Additionally, we included studies that did not control for medication use. Of the 86 studies that examined PD, the vast majority (54 of the studies) reported that the participants were taking Levodopa (with 13 studies reporting no medication and 19 studies not reporting medication use). However, recent work has demonstrated little effect of Levodopa on speech performance for speakers with PD, in both the early (Tykalova et al., 2022) and later stages of disease progression (Cavallieri et al., 2021; Fabbri et al., 2017; Rusz, Tykalova et al., 2021). Moreover, including participants regardless of medication time, type, or dosage provides a more representative sample of dysarthric characteristics in PD.
It is also important to note that certain parameters cannot be assessed in some speakers or can be invalidated due to the speaker’s speech severity (e.g., voice onset time cannot be assessed if the speaker has incomplete stop closures due to severe resonance problems). Consequently, a number of features may be underrepresented in speakers with conditions that tend to result in more resonatory dysfunction, for example. Additional research is critically needed to investigate the impacts of hypernasality and phonatory dysfunction on the acoustic signal and to identify more measures that are robust across disease severities.
In addition, we did not control for speech severity across all the disorder groups (i.e., by only accepting articles with speakers in a specific severity range). While the range of intelligibility (i.e., 83-94%) and speech severity illustrated in Table 1 indicates that the average severity level of all speakers was in the mild range, intelligibility and speech severity subscales were not recorded in every study or for all disorders. The findings from our meta-analyses must, therefore, be interpreted with the knowledge that different effect sizes for any two disorder groups (compared to controls) may be due to differences related to severity rather than disease. Finally, we included all tasks (e.g., word, sentence, passage, etc.) to extract the different acoustic features for the meta-analysis. Thus, we were not able to capture the task-specific manifestations of motor disease (e.g., impaired coordination may be more evident during the sequential motion rate task than during passage reading given the rapid sequencing required for this task).
Lastly, this work only considered features from the articulatory subsystem. Our focus on articulation was not intended to dismiss the diagnostic potential of other speech subsystems but rather to bring coherence to an inarguably important topic. Because our focus was on the articulatory subsystem, features representing consistency were derived from repetition tasks (as opposed to connected speech tasks, for which authors would often refer to the feature as a prosodic measure). Future work should, therefore, examine these features in the context of other tasks and review the landscape of resonatory, phonatory, respiratory, and prosodic features in individuals with NMDs.
Overall, our findings revealed a strong focus in the speech motor literature on acoustic features that represent precision and an underrepresentation of studies on features that represent consistency, coordination, repetition rate, and speed. In light of the need for research across all articulatory components to elucidate articulatory phenotypes, the restricted focus on precision is problematic. Furthermore, while the limited data in our meta-analysis precluded us from making specific recommendations regarding the most promising feature for each population, our results revealed phenotypic variability in articulatory impairments across speech motor subtypes. This finding motivates the need to employ more impairment-specific knowledge in algorithm development, which will significantly extend the impact such models have for individuals with NMDs. However, there remains a significant need to broaden and deepen our understanding of the articulatory phenotypes underlying NMDs. Further investigation across a more diverse representation of NMDs and in articulatory components that distinguish speech motor subtypes will be crucial to identifying the most effective features for detecting diagnosis-specific impairments.
Supplementary Material
Funding
This work was supported by the National Institute on Deafness and Other Communication Disorders (NIDCD), part of the National Institutes of Health (NIH) [grant numbers R01DC013547, R01DC017291, K24DC016312, F31DC019556].
Footnotes
Declaration of Interest
The authors report no conflicts of interest. The authors alone are responsible for the content and writing of the paper.
References
- Ackermann H (2008). Cerebellar contributions to speech production and speech perception: psycholinguistic and neurobiological perspectives. Trends in Neurosciences, 31, 265–272. [DOI] [PubMed] [Google Scholar]
- Allen GD (1975). Speech rhythm: Its relation to performance universals and articulatory timing. Journal of Phonetics, 3, 75–86. [Google Scholar]
- Anand SS, Bell DA, & Hughes JG (1995). The role of domain knowledge in data mining. Proceedings of the Conference on Information and Knowledge Management, 37–43. [Google Scholar]
- Arksey H & O'Malley L (2005). Scoping studies: Towards a methodological framework. International Journal of Social Research Methodology, 8(1), 19–32. [Google Scholar]
- Atlas of Multiple Sclerosis. Mapping multiple sclerosis around the world. Accessed May 2020 at https://www.msif.org/.
- Berisha V, Sandoval S, Utianski R, Liss J, & Spanias A (2013). Selecting disorder-specific features for speech pathology fingerprinting. Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, 7562–7566. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bernstein N (1967). The Coordination and Regulation of Movements. Pergamon Press. [Google Scholar]
- Blanchet PG & Snyder GJ (2009). Speech rate deficits in individuals with Parkinson's disease: A review of the literature. Journal of Medical Speech Language Pathology, 17(1), 1–7. [Google Scholar]
- Carmichael J (2014). Diagnosis of dysarthria subtype via spectral and waveform analysis. Computer Systems Science and Engineering, 1, 33–42. [Google Scholar]
- Cavallieri F, Budriesi C, Gessani A, Contardi S, Fioravanti V, Menozzi E, Pinto S, Moro E, Valzania F, & Antonelli F (2021). Dopaminergic treatment effects on dysarthric speech: Acoustic analysis in a cohort of patients with advanced Parkinson’s disease. Frontiers in Neurology, 11, 1–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Childs CM & Washburn NR (2019). Embedding domain knowledge for machine learning of complex material systems. MRS Communications, 9, 806–820. [Google Scholar]
- Cordella C, Dickerson BC, Quimby M, Yunusova Y, & Green JR (2017) Slowed articulation rate is a sensitive diagnostic marker for identifying non-fluent primary progressive aphasia. Aphasiology, 31(2), 241–260. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Crystal TH & House AS (1982). Segmental durations in connected speech signals: Preliminary results. Journal of the Acoustical Society of America, 72(3), 705–16. [DOI] [PubMed] [Google Scholar]
- Darley FL, Aronson AE, & Brown JR (1969a). Cluster of deviant speech dimensions in the dysarthrias. Journal of Speech, Language, and Hearing Research, 12, 462–496. [DOI] [PubMed] [Google Scholar]
- Darley FL, Aronson AE, & Brown JR (1969b). Differential diagnostic patterns of dysarthria. Journal of Speech, Language, and Hearing Research, 12, 246–269. [DOI] [PubMed] [Google Scholar]
- De Bodt MS, Hernandez-Diaz Huici ME, & van De Heyning PH (2002). Intelligibility as a linear combination of dimensions in dysarthric speech. Journal of Communication Disorders, 35(3), 283–292. [DOI] [PubMed] [Google Scholar]
- Duffy JR (2013). Motor Speech Disorders: Substrates, Differential Diagnosis, and Management. Elsevier Health Sciences. [Google Scholar]
- Erkkinen MG, Kim M-O, & Geschwind MD (2018). Clinical neurology and epidemiology of the major neurodegenerative diseases. Cold Spring Harbor Perspectives in Biology, 10(4), 1–44. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Eyben F, Wollmer M, & Schuller BW (2010). Opensmile: The munich versatile and fast open-source audio feature extractor. Proceedings of the IEEE International Conference on Multimedia, 1459–1462. [Google Scholar]
- Fabbri M, Guimaraes I, Cardoso R, Coelho M, Correia Guedes L, Rosa MM, Godinho C, Abreu D, Goncalves N, Antonini A, & Ferreira JJ (2017). Speech and voice response to a Levodopa challenge in late-stage Parkinson’s disease. Frontiers in Neurology, 8, 1–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Grabe E & Low EL (2008). Durational variability in speech and the rhythm class hypothesis. In Gussenhoven C & Warner N (Ed.), Laboratory phonology (pp. 515–546). De Gruyter Mouton. [Google Scholar]
- Green JR, Yunusova Y, Kuruvilla MS, Wang W, Pattee GL, Synhorst L, Zinman L, & Berry JD (2013) Bulbar and speech motor assessment in ALS: Challenges and future directions, Amyotrophic Lateral Sclerosis and Frontotemporal Degeneration, 14(7-8), 494–500. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Grosjean F & Collins M (1979). Breathing, pausing and reading. Phonetica, 36, 98–114. [DOI] [PubMed] [Google Scholar]
- Guenther F (2016). Neural control of speech. The MIT Press: Cambridge, Massachusetts. [Google Scholar]
- Gunduz H (2019). Deep learning-based Parkinson’s disease classification using vocal feature sets. IEEE Access, 7, 115540–115551. [Google Scholar]
- Gunning D (2017). Explainable artificial intelligence (XAI). Defense Advanced Research Projects Agency (DARPA). [Google Scholar]
- Gupta R, Chaspari T, Kim J, Kumar N, Bone D, & Narayanan S (2016). Pathological speech processing: State-of-the-art, current challenges, and future directions. Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, 6470–6474. [Google Scholar]
- Gutz SE, Wang J, Yunusova Y, & Green J (2019). Early identification of speech changes due to amyotrophic lateral sclerosis using machine classification. Interspeech, 604–608. [Google Scholar]
- Hartelius L, Elmberg M, Holm R, Lovberg AS, & Nikolaidis S (2008). Living with dysarthria: Evaluation of a self-report questionnaire. Folia Phoniatrica et Logopaedica, 60(1), 11–19. [DOI] [PubMed] [Google Scholar]
- Hirose H (1986). Pathophysiology of motor speech disorders (dysarthria). Folia Phoniatrica et Logopaedica, 38, 61–88. [DOI] [PubMed] [Google Scholar]
- Ho AK, Iansek R, Marigliani C, Bradshaw JL, & Gates S (1998). Speech impairment in a large sample of patients with Parkinson's disease. Behavioural Neurology, 11, 131–137. [PubMed] [Google Scholar]
- Kent RD, Kent JF, Weismer G, & Duffy JR (2000). What dysarthrias can tell us about the neural control of speech. Journal of Phonetics, 28(3), 273–302. [Google Scholar]
- Kent RD, Kent JF, Weismer G, Martin RE, Sufit RL, Brooks BR, & Rosenbek JC (1989). Relationships between speech intelligibility and the slope of the second-formant transitions in dysarthric subjects. Clinical Linguistics and Phonetics, 3(4), 347–358. [Google Scholar]
- Kent RD & Kim Y (2003). Toward an acoustic typology of motor speech disorders. Clinical Linguistics and Phonetics, 16(6), 427–445. [DOI] [PubMed] [Google Scholar]
- Kim Y, Weismer G, Kent RD, & Duffy JR (2009). Statistical models of F2 slope in relation to severity of dysarthria. Folia Phoniatrica et Logopaedica, 51(6), 329–335. (Included in Review) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lam J & Tjaden K (2016). Clear speech variants: An acoustic study in Parkinson’s disease. Journal of Speech, Language, and Hearing Research, 59, 631–646. (Included in Review) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lee J, Hustad KC, & Weismer G (2014). Predicting speech intelligibility with a multiple speech subsystems approach in children with cerebral palsy. Journal of Speech, Language, and Hearing Research, 57(5), 1666–1678. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Longinetti E & Fang F (2019). Epidemiology of amyotrophic lateral sclerosis: An update of recent literature. Current Opinion in Neurology, 32(50), 771–776. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Manjaly ZR, Scott KM, Abhinav K, Wijesekera L, Ganesalingam J, Goldstein LH, Janssen A, Dougherty A, Willey E, Stanton BR, Turner MR, Ampong M-A, Sakel M, Orrell RW, Howard R, Shaw CE, Leigh PN, & Al-Chalabi A (2010). The sex ratio in amyotrophic lateral sclerosis: A population based study. Amyotrophic Lateral Sclerosis and Frontotemporal Degeneration, 11(5), 439–442. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Miller IN & Cronin-Golomb A (2010). Gender differences in Parkinson’s disease: Clinical characteristics and cognition. Movement Disorders, 25(16), 2695–2703. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Munn Z, Peters MDJ, Stern C, Tufanaru C, McArthur A, & Aromataris E (2018). Systematic review or scoping review? Guidance for authors when choosing between a systematic or scoping review approach. BMC Medical Research Methodology, 18(143), 1–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- National Institute of Neurologic Disorders and Stroke. Progressive supranuclear palsy fact sheet. Accessed May 2020 at https://www.ninds.nih.gov/.
- Nishio M & Nimi S (2006). Comparison of speaking rate, articulation rate and alternating motion rate in dysarthric speakers. Folia Phoniatrica et Logopaedica, 58, 114–131. [DOI] [PubMed] [Google Scholar]
- Orozco-Arroyave JR, Honig F, Arias-Londono JD, Vargas-Bonilla JF, Daqrouq K, Skodda S, Rusz J, & Noth E (2016). Automatic detection of Parkinson’s disease in running speech spoken in three different languages. Journal of the Acoustical Society of America, 139(1), 481–500. [DOI] [PubMed] [Google Scholar]
- Owren MJ & Cardillo GC (2006). The relative roles of vowels and consonants in discriminating talker identity versus word meaning. Journal of the Acoustical Society of America, 119(3), 1727–1739. [DOI] [PubMed] [Google Scholar]
- Parkinson’s Foundation. Statistics. Accessed May 2020 at https://www.parkinson.org/.
- Pringsheim T, Wiltshire K, Day L, Dykeman J, Steeves T, & Jette N (2012). The incidence and prevalence of Huntington’s disease: A systematic review and meta-analysis. Movement Disorders, 27(9), 1083–1091. [DOI] [PubMed] [Google Scholar]
- Robin J, Harrison JE, Kaufman LD, Rudzicz F, Simpson W, & Yancheva M (2020). Evaluation of speech-based digital biomarkers: Review and recommendations. Digital Biomarkers, 4, 99–108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rong P, Yunusova Y, Wang J, & Green JR (2015). Predicting early bulbar decline in amyotrophic lateral sclerosis: A speech subsystem approach. Behavioural Neurology, 1, 1–11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rowe HP & Green JR (2019). Profiling speech motor impairments in persons with amyotrophic lateral sclerosis: An acoustic-based approach. Interspeech, 4509–4513. (Included in Review) [Google Scholar]
- Rowe HP, Gutz SE, Maffei M, & Green JR (2020). Acoustic-based articulatory phenotypes of amyotrophic lateral sclerosis and Parkinson's disease: Towards an interpretable, hypothesis-driven framework of motor control. Interspeech, 4816–4820. (Included in Review) [Google Scholar]
- Rowe HP, Stipancic KL, Lammert AC, & Green JR (2021). Validation of an acoustic-based framework of speech motor control: Assessing criterion and construct validity using kinematic and perceptual measures. Journal of Speech, Language, and Hearing Research, 1–18. (Included in Review) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ruano L, Melo C, Silva MC, & Coutinho P (2014). The global epidemiology of hereditary ataxia and spastic paraplegia: A systematic review of prevalence studies. Neuroepidemiology, 42(3), 174–183. [DOI] [PubMed] [Google Scholar]
- Rusz J, Hlavnicka J, Novotny M, Tykalova T, Pelletier A, Montplaisir J, Gagnon J-F, Dusek P, Galbiati A, Marelli S, Timm PC, Teigen LN, Janzen A, Habibi M, Stefani A, Holzknecht E, Seppi K, Evangelista E, Rassu AL, … Sonka K (2021). Speech biomarkers in rapid eye movement sleep behavior disorder and Parkinson disease. Annals of Neurology, 90(1), 62–75. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rusz J, Tykalova T, Novotny M, Zogala D, Sonka K, Ruzicka E, & Dusek P (2021). Defining speech subtypes in de novo Parkinson disease: Response to long-term Levodopa therapy, 97(21), 2124–2135. [DOI] [PubMed] [Google Scholar]
- Schaffert L-N & Carter WG (2020). Do post-translational modifications influence protein aggregation in neurodegenerative diseases: A systematic review. Brain Sciences, 10(232), 1–30. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schalling E, Hammarberg B, & Hartelius L (2007). Perceptual and acoustic analysis of speech in individuals with spinocerebellar ataxia (SCA). Logopedics Phoniatrics Vocology, 32(1), 31–46. (Included in Review) [DOI] [PubMed] [Google Scholar]
- Schrag A, Ben-Shlomo Y, & Quinn NP (1999). Prevalence of progressive supranuclear palsy and multiple system atrophy: A cross-sectional study. The Lancet, 354(9192), 1771–1775. [DOI] [PubMed] [Google Scholar]
- Steele JC, Richardson JC, & Olszewski J (2014). Progressive supranuclear palsy: A heterogeneous degeneration involving the brain stem, basal ganglia and cerebellum with vertical gaze and pseudobulbar palsy, nuchal dystonia and dementia. Seminars of Neurology, 34(2), 129–150. [DOI] [PubMed] [Google Scholar]
- Stegmann GM, Hahna S, Liss J, Shefner J, Rutkov SB, Kawabata K, Bhandari S, Shelton K, Duncan CJ, & Berisha V (2020). Repeatability of commonly used speech and language features for clinical applications. Digital Biomarkers, 4(3), 109–122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Takakura Y, Otsuki M, Sakai S, Tajima Y, Mito Y, Ogata A, Koshimizu S, Yoshino M, Uemori G, Takakura S, & Nakagawa Y (2019). Sub-classification of apraxia of speech in patients with cerebrovascular and neurodegenerative diseases. Brain and Cognition, 130(2019), 1–10. [DOI] [PubMed] [Google Scholar]
- Tjaden K & Watling E (2003). Characteristics of diadochokinesis in multiple sclerosis and Parkinson's disease. Folia Phoniatrica et Logopaedica, 55(5), 241–59. (Included in Review) [DOI] [PubMed] [Google Scholar]
- Tricco AC , Lillie E, Zarin W, O'Brien KK, Colquhoun H, Levac D, Moher D, Peters MDJ, Horsley T, Weeks L, Hempel S, Akl EA, Chang C, McGowan J, Stewart L, Hartling L, Aldcroft A, Wilson MG, Garritty C, … Straus SE (2018). PRISMA extension for scoping reviews (PRISMA-ScR): Checklist and explanation. Annals of Internal Medicine, 169(7), 467–473. [DOI] [PubMed] [Google Scholar]
- Tsanas A, Little MA, McSharry PE, Spielman J, & Ramig LO (2012). Novel speech signal processing algorithms for high-accuracy classification of Parkinson’s disease. IEEE Transactions on Neural Systems and Rehabilitation Engineering, 59(5), 1264–1271. [DOI] [PubMed] [Google Scholar]
- Tu M, Berisha V, & Liss J (2017). Interpretable objective assessment of dysarthric speech based on deep neural networks. Interspeech, 1849–1853. [Google Scholar]
- Turner GS, Tjaden K, & Weismer G (1995). The influence of speaking rate on vowel space and speech intelligibility for individuals with amyotrophic lateral sclerosis. Journal of Speech, Language, and Hearing Research, 38, 1001–1013. (Included in Review) [DOI] [PubMed] [Google Scholar]
- Tykalova T, Novotny M, Ruzicka E, Dusek P, & Rusz J (2022). Short-term effect of dopaminergic medication on speech in early-stage Parkinson’s disease. NPJ Parkinson’s Disease, 8(22), 1–6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang J, Kothalkar PV, Cao B, & Heitzman D (2016). Towards automatic detection of amyotrophic lateral sclerosis from speech acoustic and articulatory samples. Interspeech, 1195–1199. [Google Scholar]
- Weismer G, Jeng J-Y, Laures JS, Kent RD, & Kent JF (2001). Acoustic and intelligibility characteristics of sentence production in neurogenic speech disorders. Folia Phoniatrica et Logopaedica, 53(1), 1–18. (Included in Review) [DOI] [PubMed] [Google Scholar]
- Yunusova Y, Green JR, Greenwood L, Wang J, Pattee GL, & Zinman L (2012). Tongue movements and their acoustic consequences in amyotrophic lateral sclerosis. Folia Phoniatrica et Logopaedica, 64(2), 94–102. (Included in Review) [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ziegler W & Wessel K (1996). Speech timing in ataxic disorders: Sentence production and rapid repetitive articulation. Neurology, 47, 208–214. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.







