Skip to main content
. Author manuscript; available in PMC: 2024 Sep 1.
Published in final edited form as: Artif Intell Med. 2023 Jul 17;143:102624. doi: 10.1016/j.artmed.2023.102624

Table 5. Features selected via JMIM for building the highest-performing machine learning algorithm (Model I), ranked by importance.

Component (Com)- 1: phonetic motor planning; 2: Semantic and syntactic levels of language organization; 3: Psycholinguistic cues

Linguistic and acoustic features Com Linguistic and acoustic features Com
linregc2 of voice probability 2 Part-of-Speech rate 3
Common verbs 4 linregerrQ of simple moving average of LSP Frequency 2
quartile 1 of MFCC 2 Functional words 3
Words cannot be found in Dictionary in LIWC 4 Textual Lexical Diversity 3
Article 4 Silence time for VERB within clauses 3
Std of LSP frequency 2 Content words 3
Quartile2-Quartile1 LSP frequency 2 Silence time for ADJ/ADV within clauses 2
Voiced Segments Per Second 2 Average of similarity score between clauses without stop word 3
Pause rate 2 Brunet’s Index 3
Total average silence duration in initial clauses 2 Indefinites articles 3
LinregerrQ of MFCC 2 Rate of negative adverbs 2
Words that are longer than six letters. 4 Root type-token ratio 3
Definite articles 2 Interquartile range of the 3rd MFCC coefficient 2
Content Density 3 Analytical thinking (summary variables in LIWC that measure cognitive language style) 4
Lexical frequency 3 Corrected type-token ratio 3
Std of MFCC 2 Linregc2 of simple moving average of LSP Frequency 2
Average Length of Unvoiced Segments 2 linregc1 of perceived loudness 4
Proportion of clauses with a similarity score of zero with stop word 3 Honor’s Statistic 3
Cognitive processes 4 pitch 4
Skewness of LSP frequency 2 MaxPos of Simple Moving Average (sma) of LSP 2
Std of local Shimmer 2 Normalized standard deviation of simple moving average of F2 4
Silence time for NOUNs within clauses 3 Normalized Std of simple moving average of the amplitude of F1 relative to F0 4
Mean of Local Jitter 2 Determiners 3
Standard deviation of similarity score between clauses with stop word 3 80th percentile of Frequency of 27.5Hz 2
Relative pronouns rate 3 Ratio of standardized mean amplitude of F3 and F0 4
Mean F0 Envelope 2 Std of Length of Unvoiced Segment 2
Total average silence duration per word within clauses 3 80th Percentile of Loudness 2
Pronouns 3 Reference Rate to Reality 3
Std of rising slope of loudness 2 Average of similarity score between clauses with stop word 3
std local Jitter 2 Std of harmonic noise ratio 4
Mean ratio energy spectral harmonic 2 Unique word count 3
Speech rate 2 Proportion of clauses with a similarity score of zero without stop word 3
Nouns 3 Hypergeometric Distribution Diversity 3
Quartile2-Quartile3 of F0 2 Word count 4
Long term average spectrum 2 Consecutive repeated clauses 3
Normalized standard deviation of the amplitude of F2 to F0 2 Type-token ratio 3
*

LSP are used to represent linear prediction coefficients (LPC) for transmission over a channel. LSPs have several properties such as smaller sensitivity to quantization noise that make them superior to direct quantization of LPCs.

**

LinregerrQ: The quadratic error computed as the difference of linear approximation and the actual contour.

***

Linregc2: The offset (t) of a linear approximation of the contour.