Refined multiscale fuzzy entropy based on standard deviation for biomedical signal analysis

Hamed Azami; Alberto Fernández; Javier Escudero

doi:10.1007/s11517-017-1647-5

. 2017 May 2;55(11):2037–2052. doi: 10.1007/s11517-017-1647-5

Refined multiscale fuzzy entropy based on standard deviation for biomedical signal analysis

Hamed Azami ^1,^✉, Alberto Fernández ², Javier Escudero ¹

PMCID: PMC5644759 PMID: 28462498

Abstract

Multiscale entropy (MSE) has been a prevalent algorithm to quantify the complexity of biomedical time series. Recent developments in the field have tried to alleviate the problem of undefined MSE values for short signals. Moreover, there has been a recent interest in using other statistical moments than the mean, i.e., variance, in the coarse-graining step of the MSE. Building on these trends, here we introduce the so-called refined composite multiscale fuzzy entropy based on the standard deviation (RCMFE_σ) and mean (RCMFE_μ) to quantify the dynamical properties of spread and mean, respectively, over multiple time scales. We demonstrate the dependency of the RCMFE_σ and RCMFE_μ, in comparison with other multiscale approaches, on several straightforward signal processing concepts using a set of synthetic signals. The results evidenced that the RCMFE_σ and RCMFE_μ values are more stable and reliable than the classical multiscale entropy ones. We also inspect the ability of using the standard deviation as well as the mean in the coarse-graining process using magnetoencephalograms in Alzheimer’s disease and publicly available electroencephalograms recorded from focal and non-focal areas in epilepsy. Our results indicated that when the RCMFE_μ cannot distinguish different types of dynamics of a particular time series at some scale factors, the RCMFE_σ may do so, and vice versa. The results showed that RCMFE_σ-based features lead to higher classification accuracies in comparison with the RCMFE_μ-based ones. We also made freely available all the Matlab codes used in this study at 10.7488/ds/1477.

Keywords: Complexity, Multiscale entropy, Sample entropy, Fuzzy entropy, Biomedical signal, Statistical moments

Introduction

An important challenge in signal processing is to quantify the dynamical irregularity of time series [1]. To this end, there are a number of approaches, such as entropies and fractal dimensions. Entropy is an appealing and powerful tool that has been widely used in physiological signal analysis [1, 2]. One of the most popular entropy-based approaches is sample entropy (SampEn), which is relatively robust to noise [2]. Another widely used entropy method is fuzzy entropy (FuzEn) [3]. These two entropy approaches have attracted a great deal of attention recently [4–7]. Although SampEn is slightly faster than FuzEn, the latter is more consistent and less dependent on the data length [3, 7].

The traditional methods to quantifying the complexity of biomedical recordings may fail to account for the multiple time scales inherent in such time series and may yield contradictory and misleading results. For instance, even though the SampEn of white Gaussian noise (WGN) time series is higher than that of 1/f noise, showing that WGN is more irregular than 1/f noise, the latter has more complex structures than WGN due to the presence of long-range correlations [8, 9]. To address this problem, Costa et al. introduced the multiscale (sample) entropy (MSE), which is based on assessing the entropy of signals at multiple time scales [8]. In the MSE method, the original signal is first divided into non-overlapping segments of length τ, termed the scale factor. Next, the mean of each segment is estimated to derive the coarse-grained signals. Finally, the entropy measure, using SampEn, is calculated for each coarse-grained sequence [8].

The complexity evaluation of time series with MSE is rooted in the concept that complexity is associated with “meaningful structural richness,” which may be in contrast with regularity measures defined from classical entropy algorithms [8, 10]. This is because the output of entropy-based metrics grows monotonically with the degree of randomness of the analyzed time series. Therefore, these measures assign the highest entropy values to uncorrelated random signals like white noise, which are highly unpredictable but not structurally “complex,” and, at a global level, permit a very simple description. Thus, when applied to biomedical signals, traditional entropy-based methods may lead to misleading outputs. For instance, they assign high entropy values to certain pathologic cardiac rhythms that generate erratic outputs whereas healthy cardiac rhythms that are exquisitely regulated by multiple interacting control mechanisms are given low values of entropy. In this context, the complexity of biomedical signals reflects their ability to adapt and function in an ever-changing environment because physiological signals require to operate across multiple temporal and spatial scales. Thus, substantial attention has been concentrated on defining a quantitative measurement of complexity, i.e., MSE, that vanishes for both deterministic/predictable and uncorrelated random/unpredictable time series [8, 9]. Extensive analyses have shown that abnormal and disease states, which decrease the adaptive capacity of the subject, appear to degrade the multiscale entropy metrics [8, 9]. A recent review about multiscale entropy-based methods can be seen in [11].

Costa and Goldberger have very recently introduced a new MSE approach using the variance, instead of the mean, in the coarse-graining process of MSE. This was named MSE_σ ² [12]. Note that, in order to discriminate MSE_σ ² and basic MSE, we will denote the latter as MSE_μ. MSE_σ ² revealed that the dynamics of the volatility (variance) of heartbeat signals obtained from healthy young subjects are highly complex [12].

Nonetheless, since the standard deviation (σ) has the same dimension as the signal and its mean values (MSE_μ), we propose to use σ in the coarse-graining process, as an alternative to MSE_μ and MSE_σ ². Furthermore, one of the most important problems of MSE_μ is that, when applied to short biological signals, the results may be undefined and inaccurate [13, 14]. To alleviate this problem, the refined composite MSE_μ (RCMSE_μ) has been recently introduced [13] using the average of the SampEn values of several coarse-grained signals in each scale factor. Although simulation results showed that the RCMSE_μ had better stability for all temporal scales than MSE_μ, the problem of undefined values for short signal still exists [13]. We build on these recent developments to combine their advantages, and propose the refined composite multiscale fuzzy entropy (RCMFE) based on μ and σ: RCMFE_μ and RCMFE_σ, respectively. We hypothesize that these measures will be more accurate, robust, and stable than previous entropy metrics. Furthermore, we exemplify the behavior of these measures for different kinds of classical signal concepts (e.g., frequency, non-linearity) to demonstrate the dependency of RCMFE_σ and RCMFE_μ on them. Finally, we illustrate their application to two clinical datasets: focal and non-focal electroencephalograms (EEGs) and resting-state magnetoencephalogram (MEG) activity in Alzheimer’s disease (AD).

Methods

Entropy approaches

Sample entropy

Assume we have a real-valued discrete time series of length N: y = {y ₁, y ₂, ... , y _N}. At each time t of y, a vector including the m-th subsequent values is constructed as $Y_{t}^{m} = \{y_{t} y_{t + 1} ... y_{t + m - 2} y_{t + m - 1}\}$ for t = 1,2,…,N−(m−1), where m, termed embedding dimension, determines how many samples are contained in each vector. Define the distance between such vectors as the maximum difference of their corresponding scalar components, $d [Y_{t_{1}}^{m}, Y_{t_{2}}^{m}] = max \{|Y_{t_{1} + k}^{m} - Y_{t_{2} + k}^{m}| : 0 \leq k \leq m ‐ 1 and t_{1} \neq t_{2}\}$ . A match happens when the distance $d [Y_{t_{1}}^{m}, Y_{t_{2}}^{m}]$ is smaller than a predefined tolerance r. The probability B ^m(r) shows the total number of m-dimensional matched vectors [2]. Similarly, B ^m + 1(r) is defined for embedding dimension of m + 1. Finally, the SampEn is defined as follows [2]:

SampEn (y, m, r) = - ln (B^{m + 1} (r) / B^{m} (r))

Fuzzy entropy (FuzEn)

In this case, for the time series y = {y ₁, y ₂, ... , y _N}, embedding dimension m, and tolerance r, $U_{t}^{m} = \{y_{t} y_{t + 1} ... y_{t + m - 1}\} - y 0_{t}$ is formed where $y 0_{t} = \sum_{j = 0}^{m - 1} \frac{y_{t + j}}{m}$ . The distance between each of $U_{t_{1}}^{m}$ and $U_{t_{2}}^{m}$ is defined as $d_{t_{1} t_{2}} = d [U_{t_{1}}^{m}, U_{t_{2}}^{m}] = max \{|U_{t_{1} + k}^{m} - U_{t_{2} + k}^{m}| : 0 \leq k \leq m - 1 and t_{1} \neq t_{2}\}$ . Given FuzEn power n and tolerance r, the similarity degree $d_{t_{1} t_{2}}$ is calculated through a fuzzy function $μ (d_{t_{1} t_{2}}, n, r)$ as $exp (- {(d_{t_{1} t_{2}})}^{n} / r) .$ The function ϕ ^m is then defined as

ϕ^{m} (y, n, r) = \frac{1}{N - m} \sum_{t_{1} = 1}^{N - m} \frac{1}{N - m - 1} \sum_{t_{2} = 1, t_{1} \neq t_{2}}^{N - m} exp (- {(d_{t_{1} t_{2}})}^{n} / r)

Finally, the FuzEn of the signal is defined as the negative natural logarithm of the ratio of ϕ ^m and ϕ ^m + 1 (computed following the same procedure for embedding dimension m + 1) [3]:

FuzEn (y, m, n, r) = - ln (\frac{ϕ^{m + 1}}{ϕ^{m}})

Coarse-graining for multiscale entropy

A “coarse-graining” process is applied to a time series {x ₁, x ₂, ... , x _b, ... , x _C} where C is the length of the signal. Each element of the coarse-grained time series for MSE_μ/MFE_μ is defined as

{{}^{μ}y}_{i}^{(τ)} = \frac{1}{τ} \sum_{b = (i - 1) τ + 1}^{iτ} x_{b} 1 \leq i \leq ⌊\frac{C}{τ}⌋ = N

where τ is the time scale factor [9]. This means that these coarse-grained sequences are computed as the average of consecutive samples. Costa et al. [12] also have recently proposed to use the variance, instead of the mean value, as follows:

\begin{array}{c} {{}^{σ^{2}}y}_{i}^{(τ)} = \frac{1}{τ} \sum_{b = (i - 1) τ + 1}^{iτ} {(x_{b} - {{}^{μ}y}_{i}^{(τ)})}^{2}, & 1 \leq i \leq ⌊\frac{C}{τ}⌋ = N \end{array}

The dimension of variance is not the same as the samples of the original signal, and the quadratic behavior of variance causes the differences between the data points and their corresponding average to become larger and smaller, respectively, for those differences which are larger and smaller than 1. To alleviate these shortcomings, we propose to use σ in the coarse-graining process as a measure of spread via

\begin{array}{c} {{}^{σ}y}_{i}^{(τ)} = \sqrt{\frac{1}{τ} \sum_{b = (i - 1) τ + 1}^{iτ} {(x_{b} - {{}^{μ}y}_{i}^{(τ)})}^{2}}, & 1 \leq i \leq ⌊\frac{C}{τ}⌋ = N \end{array}

Refined composite multiscale fuzzy entropy

The traditional application of the coarse-graining procedure in MSE_μ leads to two main shortcomings. First, the MSE_μ is not symmetric in its dependency on the samples of the original time series. For example, in scale 3, we could rationally expect the measure to behave the same for x ₃ and x ₄, in comparison with x ₂ and x ₃. However, at scale 3, x ₁, x ₂, and x ₃ are separated from x ₄, x ₅, and x ₆. This phenomenon is illustrated in [15]. The second shortcoming is the variability of the entropy results for high-scale factors. When the MSE_μ is computed, the number of samples of the resulting coarse-grained sequence is ⌊C/τ⌋ = N. When the scale factor τ is high, the number of time points in the coarse-grained sequence decreases. This may yield an unstable measure of entropy.

To alleviate these drawbacks, the improved multiscale permutation entropy and RCMSE_μ algorithm were proposed [13, 15]. Here, considering the advantages of FuzEn over SampEn, and RCMSE_μ over MSE_μ, we introduce RCMFE_σ and RCMFE_μ.

The RCMFE_σ is calculated in two main steps:

First, $z_{u}^{(τ)} = \{{y_{u, 1}}^{(τ)} {y_{u, 2}}^{(τ)} ...\}$ , 1 ≤ u ≤ τ are generated, where ${{}^{σ}y}_{u, j}^{(τ)} = \sqrt{\frac{1}{τ} \sum_{b = u + τ (j - 1)}^{u + τj - 1} {(x_{b} - {{}^{μ}y}_{u, j}^{(τ)})}^{2}}$ , where ${{}^{μ}y}_{u, j}^{(τ)} = \frac{\sum_{b = u + τ (j - 1)}^{u + τj - 1} x_{b}}{τ}$ . In the RCMFE_σ algorithm, for each scale factor τ, we have τ different time series $z_{u}^{(τ)} | (u = 1 ... τ)$ , while in the MSE/MFE methods, only $z_{1}^{(τ)}$ is considered [15].

For a defined scale factor τ and embedding dimension m, ϕ _τ , k ^m|(k = 1, ... , τ) and ϕ _τ , k ^m + 1|(k = 1, ... , τ) for each of $z_{k}^{(τ)} | (k = 1 ... τ)$ are separately calculated. Next, the average of values of ϕ _τ , k ^m and ϕ _τ , k ^m + 1 on 1 ≤ k ≤ τ are computed, respectively. Finally, the RCMFE_σ is computed as follows:

{RCMFE}_{σ} (x, τ, m, n, r) = - ln (\frac{{\bar{ϕ}}_{τ}^{m + 1}}{{\bar{ϕ}}_{τ}^{m}})

It should be mentioned that the difference between RCMFE_σ and RCMFE_μ is that the latter one uses ${{}^{μ}y}_{u, j}^{(τ)} = \frac{\sum_{b = u + τ (j - 1)}^{u + τj - 1} x_{b}}{τ}$ , whereas the first one uses Eq. 6 in their first step of algorithm. The embedding dimension m, FuzEn power n, and tolerance r for all of the approaches were respectively chosen as 2, 2, and 0.15 multiplied by the standard deviation of the original time series [2, 3, 16].

Evaluation signals

Noise and synthetic signals

In this subsection, the signals used to study the mentioned multiscale approaches and their interpretability in terms of classical signal processing concepts are described.

First, we consider the performance of the multiscale entropy metrics on WGN and 1/f noise. The number of sample points of each of the WGN and 1/f noise was 40,000. In addition, we consider other synthetic signals with a sampling frequency (f _s) of 150 Hz and a length of 100 s (15,000 sample points). The time plots of these synthetic signals, and their corresponding spectrograms, and two zooms (for each kind of signal) on their start and end, to show the changes in their characteristics, are illustrated in Fig. 1. All of them have been employed to inspect the Lempel-Ziv complexity measure, improved permutation entropy, or auto-mutual information function rate of decrease and have been described in [15, 17, 18], respectively, where additional details can be found.

RCMFE_σ and RCMFE_μ versus noise: The dependency between the abovementioned multiscale entropy-based methods and 1/f noise and WGN is considered in this paper. WGN has a constant power spectral density as WGN is a signal whose samples are randomly drawn from a Gaussian distribution and uncorrelated [19]. The power spectral density of a stochastic process appropriate to model evolutionary or developmental systems is characterized by equal energy per octave as 1/f noise [20].
RCMFE_σ and RCMFE_μ versus frequency: In order to clarify how the RCMFE_σ/RCMFE_μ changes when the frequency of sinusoidal signals varies, a constant amplitude chirp signal whose frequency is swept logarithmically from 0.1 to 30 Hz in 100 s is considered [15, 17]. RCMFE_σ and the other multiscale entropy methods were applied to this signal using a moving window of 2000 samples (13.33 s) with 90% overlap. Fig. 1a demonstrates the constant chirp signal.
RCMFE_σ and RCMFE_μ versus spectral content of colored noise: In order to find the relationship between the RCMFE_σ or RCMFE_μ and the spectral content of colored noise, an autoregressive (AR) process of order 1, AR(1), was generated varying the model parameter, ρ, linearly from +0.9 to −0.9. Its energy hence moved from low to high frequencies. In case of ρ = 0, the sequence corresponded to WGN, in the center of the synthetic time series. Fig. 1b shows the corresponding spectrogram, time plot, and zoom views.
RCMFE_σ and RCMFE_μ versus changes from randomness to orderliness: In order to consider how the RCMFE_σ and RCMFE_μ change when a stochastic sequence progressively turns into a periodic deterministic time series, we created a MIX process employed in [18, 21, 22]. This is defined as follows:

Fig. 1 — Spectrograms, time plots, and zoom views on the first and last time intervals of the synthetic signals used in this study. a Chirp signal with constant amplitude. b AR(1) process with variable parameter ρ. c MIX process evolving from randomness to periodic oscillations. d Logistic map signal. e Lorenz system with two different non-linear dynamics. *Red* corresponds to high power and *blue* corresponds to low power (color figure online)

MIX = (1 - z) x + zy

where z denotes a random variable which is equal to 1 with probability p and is equal to 0 with probability 1 − p, x depicts a periodic synthetic signal as $x_{k} = \sqrt{2} sin (2 πk / 12)$ , and y is a uniformly distributed variable on $[- \sqrt{3}, \sqrt{3}]$ [18, 21]. Thus, the lower p is selected, the more regular or periodic the time series is, while higher p leads to more irregular signal. In this sense, to show the evolution from randomness to orderliness, p is linearly changed from 0.01 to 0.99. This signal is depicted in Fig. 1c.

5.
RCMFE_σ and RCMFE_μ versus changes from periodicity to non-periodic non-linearity: In order to clarify the dependence of the multiscale entropies on these changes, the logistic map is employed. This analysis is dependent on the model parameter α [18, 21] as follows:

x_{k} = α x_{k - 1} (1 - x_{k - 1})

The synthetic signal x was created varying the parameter α linearly from 3.5 to 3.99. With α = 3.5, the signal oscillated among four values. For α between 3.5 and 3.57, the signal is periodic and the number of values doubles progressively. For 3.57 ≤ α ≤ 3.99, the time series is chaotic, although it has windows of periodic behavior (e.g., α ≈ 3.8, as seen in Fig. 1d) [23].

6.
RCMFE_σ and RCMFE_μ versus different non-linear regimes: In order to investigate the changes in the behavior of a non-linear system, the Lorenz attractor is used here as

\begin{array}{l} \dot{x} = λ (y - x) \\ \dot{y} = x (ρ - z) - y \\ \dot{z} = xy - βz \end{array}

where λ, β, and ρ denote the system parameters [23, 24]. The first segment of this time series has a length of 7500 sample points, and it was created with λ = 10, β = 8/3, and ρ = 28. Therefore, it has a chaotic behavior. The second segment, which has 7500 sample points, was generated with λ = 10, β = 8/3, and ρ = 99.96. It exhibits a torus knot [17, 23]. Both segments were created by the use of a fixed step-size first-order integration technique without pre-integration and with the step size set to 1/f _s. It should be noted that these two segments were normalized with standard deviation (SD) of 1, after these segments had been generated. The coordinate x, which is the signal analyzed in this article, appears in Fig. 1e.

Clinical datasets

The ability of the newly proposed RCMFE_μ and RCMFE_σ to distinguish different types of physiological activity was tested on the following clinical datasets: MEG resting state activity in AD and EEG signals of focal and non-focal origin in epilepsy.

The MEG signals were acquired utilizing a 148-channel whole-head magnetometer (Magnes 2500 WH, 4D Neuroimaging) located in a magnetically shielded room at the “Centro de Magnetoencefalografia Dr. Perez-Modrego,” Spain. Resting-state MEG activity was recorded from 36 patients with probable AD [25] (24 women; age = 74.06 ± 6.95 years, mean ± standard deviation; MMSE score = 18.06 ± 3.36) and 26 age-matched controls (17 women; age = 71.77 ± 6.38 years; MMSE score = 28.88 ± 1.18). The subjects laid on a hospital bed in a relaxed state with eyes closed. For each participant, 5 min of MEG resting-state activity was recorded at a sampling frequency (f _s) of 169.54 Hz. The signals were divided into segments of 10s (1695 samples per channel) and visually inspected using an automated thresholding procedure to discard segments significantly contaminated with artifacts [26]. The effect of cardiac artifact was reduced from the recordings using a constraint blind source separation procedure. Finally, a band-pass FIR filter with cutoffs at 1.5 and 40 Hz was applied to the data. For more information about the dataset, please refer to [27]. For each subject and each channel, we analyzed each epoch of 10s individually and the average of results is reported. Note that all control subjects and AD patients’ caregivers gave informed consent for participation in the study, which was approved by the local Ethics Committee [27].

The intracranial EEG signals were recorded from five patients suffering from pharmacoresistant focal-onset epilepsy leading to two main separate sets of signals. The first one was recorded from brain regions where the primarily ictal EEG recording changes were detected as judged by expert visual inspection (“focal signals”). The second set of signals was recorded from brain regions not involved at seizure onset (“non-focal signals”). Each set includes five patients. Each patient consists of 750 pair signals, and the length of each of them was 10,240 sample points or 20 s. The sampling frequency was 512 Hz. Each pair includes two EEG time series which are recorded from adjacent channels which here we consider the first time series. They also provided a subset of the recordings containing the first 50 signals for each set. We use this subset to evaluate the proposed methods. For more information about the dataset, please refer to [28]. Before computing the multiscale entropy approaches, all signals were digitally filtered employing an FIR band-pass filter with cutoff frequencies at 0.5 and 40 Hz. Note that retrospective EEG data analysis has been approved by the ethics committee of the Kanton of Bern. Moreover, all patients gave written informed consent that the obtained signals from long-term EEG might be utilized for research purposes [28].

Results

Noise signals

First, we consider WGN and 1/f noise as two widely used signals tested in multiscale entropy methods [8, 13]. The results for MSE_μ, MFE_μ, RCMSE_μ, RCMFE_μ, MSE_σ, MSE_σ ², MFE_σ, MFE_σ ², RCMSE_σ, RCMSE_σ ², RCMFE_σ, and RCMFE_σ ² are depicted in Fig. 2a–l, respectively. As it can be observed in Fig. 2, for WGN, the entropy values of all multiscale approaches, except MSE_σ ² and RCMSE_σ ², decrease monotonically with scale factor τ. However, for 1/f noise, the entropy values become approximately constant over larger-scale factors. These facts are in agreement with WGN which only has structure in the shortest temporal scale, whereas 1/f noise has structure across all scales [8, 13]. Note that each error bar of each scale factor τ depicts the SD of the results of 40 signals for each WGN or 1/f noise.

Comparing results obtained by MSE_μ (Fig. 2a) and MFE_μ (Fig. 2b) shows, as expected theoretically, that the MFE_μ leads to a smaller variability in the results. Statistical tests confirmed the smaller variability of the MFE_μ results (p value ≤0.05) as assessed with Levene’s test at τ = 60. In addition, the RCMSE_μ/RCMFE_μ profiles have smaller SDs than MSE_μ/MFE_μ.

Although the MSE_σ ² values for WGN are larger than 1/f noise for scale factors 1 to 60, according to Fig. 2f, it is predicted that this measure for WGN will become smaller than those of 1/f noise for large enough scale factors. For MSE_σ and for scale factors 1 to 37, the larger entropy values are assigned to WGN signal in comparison with 1/f noise, while for scale factors larger than 37, the SampEn values for 1/f noise are larger than those of WGN, in agreement with the fact that 1/f noise is considered more structurally complex across multiple scales [9, 29]. Comparing the results shows that crossing between WGN and 1/f noise does not happen at short levels of scale factor for the coarse-graining process based on variance and standard deviation, unlike the mean.

It should be added that the results obtained for parameter r, used in [12], are similar to our results with r = 0.15 multiplied by the SD of that time series, employed in [16].

In order to understand the importance of refined composite technique on the basic multiscale entropy methods, we employed the coefficient of variation (CV) defined as the SD divided by the mean [30]. The main purpose to employ such a measure is that the SDs of data may increase or decrease proportionally to the mean. Thus, the CV, as a standardization of the SD, permits comparison of variability estimates regardless of the magnitude of the variable [30]. We study the results for 1/f noise and WGN signals at scale factor 20. As can be seen in Table 1, the refined composite technique decreases the CV values of the basic multiscale approaches, leading to more stable results.

Table 1.

The CV values of the proposed and classical multiscale entropy-based analyses at scale factor 20 for 1/f noise and WGN

Signals	MSE_μ	MFE_μ	RCMSE_μ	RCMFE_μ
1/f	0.015	0.013	0.011	0.011
WGN	0.019	0.019	0.011	0.010
	MSE_σ	MFE_σ	RCMSE_σ	RCMFE_σ
1/f	0.023	0.023	0.017	0.016
WGN	0.022	0.020	0.020	0.015
	MSE_σ ²	MFE_σ ²	RCMSE_σ ²	RCMFE_σ ²
1/f	0.026	0.025	0.017	0.016
WGN	0.015	0.018	0.010	0.010

Open in a new tab

The computation times of the conventional and proposed multiscale sample and fuzzy entropy approaches with the maximum scale factor 60 for the WGN signals with the length of 40,000 sample points are demonstrated in Table 2. The simulations have been carried out using a PC with Intel® Xeon® CPU, E5420, 2.5 GHz, and 8-GB RAM by MATLAB R2010a. The results show that FuzEn-based methods are slower than SampEn-based ones and the refined composite technique increases the computation time significantly. The running times of the variance-based methods are similar to those of the standard deviation-based algorithms. Moreover, since the MSE_σ ², MSE_σ MFE_σ ², MFE_σ RCMSE_σ ², RCMSE_σ, RCMFE_σ ², and RCMFE_σ start from scale factor 2 and the computation cost of SampEn and FuzEn is O(N ²) [31], the running times of these kinds of algorithms are noticeably smaller than those of the algorithms based on coarse-graining with regard to the mean.

Table 2.

Computation time of the classical and proposed multiscale sample and fuzzy entropy methods

MSE_μ	MFE_μ	RCMSE_μ	RCMFE_μ
49.08 s	73.21 s	253.61 s	364.73 s
MSE_σ ²	MFE_σ ²	RCMSE_σ ²	RCMFE_σ ²
23.08	35.79 s	186.99 s	299.03 s
MSE_σ	MFE_σ	RCMSE_σ	RCMFE_σ
22.94 s	34.92 s	189.24 s	282.62 s

Open in a new tab

Sensitivity of multiscale methods to signal length

To evaluate the sensitivity of multiscale methods to the signal length, we consider WGN and 1/f noise signals as functions of sample points size C. Figures 3, 4, 5, and 6 respectively depict the MSE_μ, RCMSE_μ, MFE_μ, and RCMFE_μ values for the signal length 100, 300, 1000, 3000, 10,000, and 30,000 computed from 40 different realizations of WGN and 1/f noise. The results show that the greater the value of C, the more robust the multiscale entropy estimations, as seen from the error bars.

Fig. 3 — MSE_μ as a function of data length C, a C = 100, b C = 300, c C = 1000, d C = 3000, e C = 10,000, and f C = 30,000 computed from 40 different WGN and 1/f noise signals. The entropy values are undefined for noise signals with the length of 100 and 300 at all and large-scale factors, respectively. *Red* and *blue* demonstrate 1/f noise and WGN results, respectively (color figure online)

Fig. 4 — RCMSE_μ as a function of data length C, a C = 100, b C = 300, c C = 1000, d C = 3000, e C = 10,000, and f C = 30,000 computed from 40 different WGN and 1/f noise signals. The entropy values are undefined for noise signals with the length of 100 and 300 at all and large-scale factors, respectively. *Red* and *blue* demonstrate 1/f noise and WGN results, respectively (color figure online)

Fig. 5 — MFE_μ as a function of data length C, a C = 100, b C = 300, c C = 1000, d C = 3000, e C = 10,000, and f C = 30,000 computed from 40 different WGN and 1/f noise signals. *Red* and *blue* demonstrate 1/f noise and WGN results, respectively (color figure online)

Fig. 6 — RCMFE_μ as a function of data length C, a C = 100, b C = 300, c C = 1000, d C = 3000, e C = 10,000, and f C = 30,000 computed from 40 different WGN and 1/f noise signals. *Red* and *blue* demonstrate 1/f noise and WGN results, respectively (color figure online)

It has been suggested that the number of sample points is at least 10^m, or preferably at least 30^m, to robustly estimate approximate entropy or SampEn in time series [32]. Because the coarse-graining step reduces the times series length by the scale factor τ, and here we have τ _max = 10 and m = 2, the original signal should have at least 1000 samples. As mentioned before, in SampEn, the number of instances where $d [Y_{t_{1}}^{m}, Y_{t_{2}}^{m}]$ is smaller than a predefined tolerance r is counted. If the length of a time series is too small, this number may be 0, leading to an undefined entropy measure. According to this fact, the results obtained by MSE_μ for C = 100 and 300, respectively depicted in Fig. 3a, b, are undefined.

For RCMSE_μ at scale factor τ, although the length of the signal decreases τ times, we take into account τ time coarse-grained signals, instead of only one signal as in conventional multiscale entropy approaches [13]. Therefore, in refined composite-based algorithms, we have τ times more number of instances in comparison with their corresponding basic versions, leading to more reliable results, especially for short signals. This fact can be seen in Fig. 4 in comparison with Fig. 3. Although RCMSE_μ outperforms MSE_μ in terms of reliability for short signals, RCMSE_μ values for C = 100 and C = 300 (Fig. 4a, b) are still undefined at some scale factors.

However, the FuzEn-based algorithms do not count matches, yet consider all possible range of distances between any two composite vectors. Therefore, MFE_μ and RCMFE_μ avoid resulting in undefined entropy values in such situations. The results obtained by the RCMFE_μ (Fig. 6) have considerably smaller SD values, especially for short signals, than those obtained by MFE_μ (Fig. 5).

Synthetic signals

To understand the effect of frequency on multiscale entropy-based methods, we employed a sliding window moving along each of the abovementioned synthetic signals. Then, for each scale factor, the multiscale entropy-based method of that part of the signal was computed. Because the length of the window is 2000 sample points, we consider the scale factor from 1 to 15, to ensure the length of the coarse-grained signals is enough for m = 2 [33].

For chirp signal with constant amplitude, the RCMFE_σ, RCMFE_μ, MSE_σ ², and MSE_μ results are respectively shown in Fig. 7a–d. When the time window is occupied at the beginning of the signal, which has smaller frequency, the FuzEn and SampEn values are low across all τ. As expected theoretically, all the RCMFE_σ, RCMFE_μ, MSE_σ ², and MSE_μ values increase with higher frequencies, which happens in later temporal windows (TWs). It is worth noting that since the SD/variance, unlike the mean value, of one single number is 0, the entropy measure in the first scale factor is undefined. This fact can be seen in Fig. 7a, c in comparison with Fig. 7b, d.

In Fig. 7e–h, it can be observed generally, using an AR(1) process with variable parameter, that the entropy measures of RCMFE_σ, MFE_σ ², and MFE_σ, unlike RCMFE_μ, increase in higher TWs in every scale factor.

Figure 7i–l respectively shows the results obtained by RCMFE_σ, RCMFE_μ, MFE_σ, and MFE_μ using the abovementioned MIX process. The entropy measures of all of them decrease in higher TWs in every scale factor, showing the evolution from randomness to periodic oscillations.

Figure 7m–p illustrates the results obtained by RCMFE_σ, RCMFE_μ, MSE_σ, and MSE_μ, respectively, using the logistic map which the parameter α changes linearly from 3.5 to 3.99. The entropy measures, obtained by all of them, generally increase along the signal, at each scale factor, except for the downward spikes in the windows of periodic behavior. This fact is in agreement with Fig. 4.10 (page 87 in [23]). It is also supported by Fig. 1d which shows that the frequency of the signal for t = 70–75 s is lower than for its adjacent time samples. In case of increasing scale factor, the RCMFE_σ and MSE_σ results decrease, whereas the RCMFE_μ and MSE_μ results first increase respectively until τ = 2 and τ = 4 then decrease. It shows that mean- and standard deviation-based multiscale approaches, extracting different kinds of dynamical properties of, respectively, mean and spread over multiple time scales, lead to different kinds of features.

Using the Lorentz system, we find that RCMFE_σ, RCMFE_μ, MSE_μ, and RCMSE_μ respectively shown in Fig. 7q–t can distinguish two different non-linear dynamics.