Skip to main content
. 2020 Nov 14;20(22):6512. doi: 10.3390/s20226512

Table 6.

Mathematical Description of the selected features for classification of LS signal L[n].

Feature Mathematical Representation
Standard Deviation (SD) SD=i=1N(LiLMean)2n where L:L[n]
Peak to Peak (PP) PP=LmaxLmin Where Lmax and Lmin is the minimum and maximum value in the time domain
Log Energy (LE) LE=log[i=1N(|Li|)2]
Spectral Standard Deviation (SSD) σ=i=1N(Li|LMean|)2n where L:L[ω]
Spectral Skewness (SSkw) SSkew=2n[i=1n/21(|Li||LMean|)3σ3] where L:L[ω]
Spectral Kurtosis(SK) SK=2n[i=1n/21(|Li||LMean|)2σ4]3  where L:L[ω]
Spectral Flux (SF) SK=2ii=0i/21(|Ln,i||Ln, i1|)2
Spectral Roll Off (SRO) If mth DFT coefficient corresponds to the spectral roll-off of the kth frame, then i=1nLk(i)=Ci=1FLLk(i) C is the adapted percentage: 95% and  L: L[ω]
Spectral Decrease (SDec) SDec=i=1i/211i(|Li||L0|)i=11/21|Li| where L:L[ω]
Mel frequency cepstral coefficient (MFCC) In MFCC, (i) Frame blocking or windowing to get 50 to 60ms. (ii) Performing a discrete Fourier transform (iii) computing logarithm of the signal. (iv) Deforming the frequencies on a Mel scale, followed by applying the discrete cosine transform (DCT). Mel scale is calculated as follows:
Mel Scale=2595 log10(1+f700)
‘f’ refers to frequency ranges from 0 to fs.
Gammatone Frequency Cepstral Coefficient (GFCC) In GCC, (i) Firstly, the signal is passed through gammatone filter bank which consists of 64 Channels. (ii) Take the absolute value at each channel and reduce it to 100 Hz as a way of time windowing. (iii) Take cubic root on the time-frequency representation. (iv) Deforming the frequencies on an equivalent rectangular bandwidth (ERB) scale Apply DCT to derive cepstral features. ERB scale is calculated as follows.
ERB=Alog10 (1+hz(0.00437)) where
A = 1000loge(10)(24.7)(4.37) ‘hz’ refers to frequency ranges i.e. 0-fs.