Abstract
Multiscale entropy (MSE) has been a prevalent algorithm to quantify the complexity of biomedical time series. Recent developments in the field have tried to alleviate the problem of undefined MSE values for short signals. Moreover, there has been a recent interest in using other statistical moments than the mean, i.e., variance, in the coarse-graining step of the MSE. Building on these trends, here we introduce the so-called refined composite multiscale fuzzy entropy based on the standard deviation (RCMFEσ) and mean (RCMFEμ) to quantify the dynamical properties of spread and mean, respectively, over multiple time scales. We demonstrate the dependency of the RCMFEσ and RCMFEμ, in comparison with other multiscale approaches, on several straightforward signal processing concepts using a set of synthetic signals. The results evidenced that the RCMFEσ and RCMFEμ values are more stable and reliable than the classical multiscale entropy ones. We also inspect the ability of using the standard deviation as well as the mean in the coarse-graining process using magnetoencephalograms in Alzheimer’s disease and publicly available electroencephalograms recorded from focal and non-focal areas in epilepsy. Our results indicated that when the RCMFEμ cannot distinguish different types of dynamics of a particular time series at some scale factors, the RCMFEσ may do so, and vice versa. The results showed that RCMFEσ-based features lead to higher classification accuracies in comparison with the RCMFEμ-based ones. We also made freely available all the Matlab codes used in this study at 10.7488/ds/1477.
Keywords: Complexity, Multiscale entropy, Sample entropy, Fuzzy entropy, Biomedical signal, Statistical moments
Introduction
An important challenge in signal processing is to quantify the dynamical irregularity of time series [1]. To this end, there are a number of approaches, such as entropies and fractal dimensions. Entropy is an appealing and powerful tool that has been widely used in physiological signal analysis [1, 2]. One of the most popular entropy-based approaches is sample entropy (SampEn), which is relatively robust to noise [2]. Another widely used entropy method is fuzzy entropy (FuzEn) [3]. These two entropy approaches have attracted a great deal of attention recently [4–7]. Although SampEn is slightly faster than FuzEn, the latter is more consistent and less dependent on the data length [3, 7].
The traditional methods to quantifying the complexity of biomedical recordings may fail to account for the multiple time scales inherent in such time series and may yield contradictory and misleading results. For instance, even though the SampEn of white Gaussian noise (WGN) time series is higher than that of 1/f noise, showing that WGN is more irregular than 1/f noise, the latter has more complex structures than WGN due to the presence of long-range correlations [8, 9]. To address this problem, Costa et al. introduced the multiscale (sample) entropy (MSE), which is based on assessing the entropy of signals at multiple time scales [8]. In the MSE method, the original signal is first divided into non-overlapping segments of length τ, termed the scale factor. Next, the mean of each segment is estimated to derive the coarse-grained signals. Finally, the entropy measure, using SampEn, is calculated for each coarse-grained sequence [8].
The complexity evaluation of time series with MSE is rooted in the concept that complexity is associated with “meaningful structural richness,” which may be in contrast with regularity measures defined from classical entropy algorithms [8, 10]. This is because the output of entropy-based metrics grows monotonically with the degree of randomness of the analyzed time series. Therefore, these measures assign the highest entropy values to uncorrelated random signals like white noise, which are highly unpredictable but not structurally “complex,” and, at a global level, permit a very simple description. Thus, when applied to biomedical signals, traditional entropy-based methods may lead to misleading outputs. For instance, they assign high entropy values to certain pathologic cardiac rhythms that generate erratic outputs whereas healthy cardiac rhythms that are exquisitely regulated by multiple interacting control mechanisms are given low values of entropy. In this context, the complexity of biomedical signals reflects their ability to adapt and function in an ever-changing environment because physiological signals require to operate across multiple temporal and spatial scales. Thus, substantial attention has been concentrated on defining a quantitative measurement of complexity, i.e., MSE, that vanishes for both deterministic/predictable and uncorrelated random/unpredictable time series [8, 9]. Extensive analyses have shown that abnormal and disease states, which decrease the adaptive capacity of the subject, appear to degrade the multiscale entropy metrics [8, 9]. A recent review about multiscale entropy-based methods can be seen in [11].
Costa and Goldberger have very recently introduced a new MSE approach using the variance, instead of the mean, in the coarse-graining process of MSE. This was named MSEσ 2 [12]. Note that, in order to discriminate MSEσ 2 and basic MSE, we will denote the latter as MSEμ. MSEσ 2 revealed that the dynamics of the volatility (variance) of heartbeat signals obtained from healthy young subjects are highly complex [12].
Nonetheless, since the standard deviation (σ) has the same dimension as the signal and its mean values (MSEμ), we propose to use σ in the coarse-graining process, as an alternative to MSEμ and MSEσ 2. Furthermore, one of the most important problems of MSEμ is that, when applied to short biological signals, the results may be undefined and inaccurate [13, 14]. To alleviate this problem, the refined composite MSEμ (RCMSEμ) has been recently introduced [13] using the average of the SampEn values of several coarse-grained signals in each scale factor. Although simulation results showed that the RCMSEμ had better stability for all temporal scales than MSEμ, the problem of undefined values for short signal still exists [13]. We build on these recent developments to combine their advantages, and propose the refined composite multiscale fuzzy entropy (RCMFE) based on μ and σ: RCMFEμ and RCMFEσ, respectively. We hypothesize that these measures will be more accurate, robust, and stable than previous entropy metrics. Furthermore, we exemplify the behavior of these measures for different kinds of classical signal concepts (e.g., frequency, non-linearity) to demonstrate the dependency of RCMFEσ and RCMFEμ on them. Finally, we illustrate their application to two clinical datasets: focal and non-focal electroencephalograms (EEGs) and resting-state magnetoencephalogram (MEG) activity in Alzheimer’s disease (AD).
Methods
Entropy approaches
Sample entropy
Assume we have a real-valued discrete time series of length N: y = {y 1, y 2, ... , y N}. At each time t of y, a vector including the m-th subsequent values is constructed as for t = 1,2,…,N−(m−1), where m, termed embedding dimension, determines how many samples are contained in each vector. Define the distance between such vectors as the maximum difference of their corresponding scalar components, . A match happens when the distance is smaller than a predefined tolerance r. The probability B m(r) shows the total number of m-dimensional matched vectors [2]. Similarly, B m + 1(r) is defined for embedding dimension of m + 1. Finally, the SampEn is defined as follows [2]:
1 |
Fuzzy entropy (FuzEn)
In this case, for the time series y = {y 1, y 2, ... , y N}, embedding dimension m, and tolerance r, is formed where . The distance between each of and is defined as . Given FuzEn power n and tolerance r, the similarity degree is calculated through a fuzzy function as The function ϕ m is then defined as
2 |
Finally, the FuzEn of the signal is defined as the negative natural logarithm of the ratio of ϕ m and ϕ m + 1 (computed following the same procedure for embedding dimension m + 1) [3]:
3 |
Coarse-graining for multiscale entropy
A “coarse-graining” process is applied to a time series {x 1, x 2, ... , x b, ... , x C} where C is the length of the signal. Each element of the coarse-grained time series for MSEμ/MFEμ is defined as
4 |
where τ is the time scale factor [9]. This means that these coarse-grained sequences are computed as the average of consecutive samples. Costa et al. [12] also have recently proposed to use the variance, instead of the mean value, as follows:
5 |
The dimension of variance is not the same as the samples of the original signal, and the quadratic behavior of variance causes the differences between the data points and their corresponding average to become larger and smaller, respectively, for those differences which are larger and smaller than 1. To alleviate these shortcomings, we propose to use σ in the coarse-graining process as a measure of spread via
6 |
Refined composite multiscale fuzzy entropy
The traditional application of the coarse-graining procedure in MSEμ leads to two main shortcomings. First, the MSEμ is not symmetric in its dependency on the samples of the original time series. For example, in scale 3, we could rationally expect the measure to behave the same for x 3 and x 4, in comparison with x 2 and x 3. However, at scale 3, x 1, x 2, and x 3 are separated from x 4, x 5, and x 6. This phenomenon is illustrated in [15]. The second shortcoming is the variability of the entropy results for high-scale factors. When the MSEμ is computed, the number of samples of the resulting coarse-grained sequence is ⌊C/τ⌋ = N. When the scale factor τ is high, the number of time points in the coarse-grained sequence decreases. This may yield an unstable measure of entropy.
To alleviate these drawbacks, the improved multiscale permutation entropy and RCMSEμ algorithm were proposed [13, 15]. Here, considering the advantages of FuzEn over SampEn, and RCMSEμ over MSEμ, we introduce RCMFEσ and RCMFEμ.
The RCMFEσ is calculated in two main steps:
First, , 1 ≤ u ≤ τ are generated, where , where . In the RCMFEσ algorithm, for each scale factor τ, we have τ different time series , while in the MSE/MFE methods, only is considered [15].
For a defined scale factor τ and embedding dimension m, ϕ τ , k m|(k = 1, ... , τ) and ϕ τ , k m + 1|(k = 1, ... , τ) for each of are separately calculated. Next, the average of values of ϕ τ , k m and ϕ τ , k m + 1 on 1 ≤ k ≤ τ are computed, respectively. Finally, the RCMFEσ is computed as follows:
7 |
It should be mentioned that the difference between RCMFEσ and RCMFEμ is that the latter one uses , whereas the first one uses Eq. 6 in their first step of algorithm. The embedding dimension m, FuzEn power n, and tolerance r for all of the approaches were respectively chosen as 2, 2, and 0.15 multiplied by the standard deviation of the original time series [2, 3, 16].
Evaluation signals
Noise and synthetic signals
In this subsection, the signals used to study the mentioned multiscale approaches and their interpretability in terms of classical signal processing concepts are described.
First, we consider the performance of the multiscale entropy metrics on WGN and 1/f noise. The number of sample points of each of the WGN and 1/f noise was 40,000. In addition, we consider other synthetic signals with a sampling frequency (f s) of 150 Hz and a length of 100 s (15,000 sample points). The time plots of these synthetic signals, and their corresponding spectrograms, and two zooms (for each kind of signal) on their start and end, to show the changes in their characteristics, are illustrated in Fig. 1. All of them have been employed to inspect the Lempel-Ziv complexity measure, improved permutation entropy, or auto-mutual information function rate of decrease and have been described in [15, 17, 18], respectively, where additional details can be found.
RCMFEσ and RCMFEμ versus noise: The dependency between the abovementioned multiscale entropy-based methods and 1/f noise and WGN is considered in this paper. WGN has a constant power spectral density as WGN is a signal whose samples are randomly drawn from a Gaussian distribution and uncorrelated [19]. The power spectral density of a stochastic process appropriate to model evolutionary or developmental systems is characterized by equal energy per octave as 1/f noise [20].
RCMFEσ and RCMFEμ versus frequency: In order to clarify how the RCMFEσ/RCMFEμ changes when the frequency of sinusoidal signals varies, a constant amplitude chirp signal whose frequency is swept logarithmically from 0.1 to 30 Hz in 100 s is considered [15, 17]. RCMFEσ and the other multiscale entropy methods were applied to this signal using a moving window of 2000 samples (13.33 s) with 90% overlap. Fig. 1a demonstrates the constant chirp signal.
RCMFEσ and RCMFEμ versus spectral content of colored noise: In order to find the relationship between the RCMFEσ or RCMFEμ and the spectral content of colored noise, an autoregressive (AR) process of order 1, AR(1), was generated varying the model parameter, ρ, linearly from +0.9 to −0.9. Its energy hence moved from low to high frequencies. In case of ρ = 0, the sequence corresponded to WGN, in the center of the synthetic time series. Fig. 1b shows the corresponding spectrogram, time plot, and zoom views.
RCMFEσ and RCMFEμ versus changes from randomness to orderliness: In order to consider how the RCMFEσ and RCMFEμ change when a stochastic sequence progressively turns into a periodic deterministic time series, we created a MIX process employed in [18, 21, 22]. This is defined as follows:
8 |
where z denotes a random variable which is equal to 1 with probability p and is equal to 0 with probability 1 − p, x depicts a periodic synthetic signal as , and y is a uniformly distributed variable on [18, 21]. Thus, the lower p is selected, the more regular or periodic the time series is, while higher p leads to more irregular signal. In this sense, to show the evolution from randomness to orderliness, p is linearly changed from 0.01 to 0.99. This signal is depicted in Fig. 1c.
-
5.
RCMFEσ and RCMFEμ versus changes from periodicity to non-periodic non-linearity: In order to clarify the dependence of the multiscale entropies on these changes, the logistic map is employed. This analysis is dependent on the model parameter α [18, 21] as follows:
9 |
The synthetic signal x was created varying the parameter α linearly from 3.5 to 3.99. With α = 3.5, the signal oscillated among four values. For α between 3.5 and 3.57, the signal is periodic and the number of values doubles progressively. For 3.57 ≤ α ≤ 3.99, the time series is chaotic, although it has windows of periodic behavior (e.g., α ≈ 3.8, as seen in Fig. 1d) [23].
-
6.
RCMFEσ and RCMFEμ versus different non-linear regimes: In order to investigate the changes in the behavior of a non-linear system, the Lorenz attractor is used here as
10 |
where λ, β, and ρ denote the system parameters [23, 24]. The first segment of this time series has a length of 7500 sample points, and it was created with λ = 10, β = 8/3, and ρ = 28. Therefore, it has a chaotic behavior. The second segment, which has 7500 sample points, was generated with λ = 10, β = 8/3, and ρ = 99.96. It exhibits a torus knot [17, 23]. Both segments were created by the use of a fixed step-size first-order integration technique without pre-integration and with the step size set to 1/f s. It should be noted that these two segments were normalized with standard deviation (SD) of 1, after these segments had been generated. The coordinate x, which is the signal analyzed in this article, appears in Fig. 1e.
Clinical datasets
The ability of the newly proposed RCMFEμ and RCMFEσ to distinguish different types of physiological activity was tested on the following clinical datasets: MEG resting state activity in AD and EEG signals of focal and non-focal origin in epilepsy.
The MEG signals were acquired utilizing a 148-channel whole-head magnetometer (Magnes 2500 WH, 4D Neuroimaging) located in a magnetically shielded room at the “Centro de Magnetoencefalografia Dr. Perez-Modrego,” Spain. Resting-state MEG activity was recorded from 36 patients with probable AD [25] (24 women; age = 74.06 ± 6.95 years, mean ± standard deviation; MMSE score = 18.06 ± 3.36) and 26 age-matched controls (17 women; age = 71.77 ± 6.38 years; MMSE score = 28.88 ± 1.18). The subjects laid on a hospital bed in a relaxed state with eyes closed. For each participant, 5 min of MEG resting-state activity was recorded at a sampling frequency (f s) of 169.54 Hz. The signals were divided into segments of 10s (1695 samples per channel) and visually inspected using an automated thresholding procedure to discard segments significantly contaminated with artifacts [26]. The effect of cardiac artifact was reduced from the recordings using a constraint blind source separation procedure. Finally, a band-pass FIR filter with cutoffs at 1.5 and 40 Hz was applied to the data. For more information about the dataset, please refer to [27]. For each subject and each channel, we analyzed each epoch of 10s individually and the average of results is reported. Note that all control subjects and AD patients’ caregivers gave informed consent for participation in the study, which was approved by the local Ethics Committee [27].
The intracranial EEG signals were recorded from five patients suffering from pharmacoresistant focal-onset epilepsy leading to two main separate sets of signals. The first one was recorded from brain regions where the primarily ictal EEG recording changes were detected as judged by expert visual inspection (“focal signals”). The second set of signals was recorded from brain regions not involved at seizure onset (“non-focal signals”). Each set includes five patients. Each patient consists of 750 pair signals, and the length of each of them was 10,240 sample points or 20 s. The sampling frequency was 512 Hz. Each pair includes two EEG time series which are recorded from adjacent channels which here we consider the first time series. They also provided a subset of the recordings containing the first 50 signals for each set. We use this subset to evaluate the proposed methods. For more information about the dataset, please refer to [28]. Before computing the multiscale entropy approaches, all signals were digitally filtered employing an FIR band-pass filter with cutoff frequencies at 0.5 and 40 Hz. Note that retrospective EEG data analysis has been approved by the ethics committee of the Kanton of Bern. Moreover, all patients gave written informed consent that the obtained signals from long-term EEG might be utilized for research purposes [28].
Results
Noise signals
First, we consider WGN and 1/f noise as two widely used signals tested in multiscale entropy methods [8, 13]. The results for MSEμ, MFEμ, RCMSEμ, RCMFEμ, MSEσ, MSEσ 2, MFEσ, MFEσ 2, RCMSEσ, RCMSEσ 2, RCMFEσ, and RCMFEσ 2 are depicted in Fig. 2a–l, respectively. As it can be observed in Fig. 2, for WGN, the entropy values of all multiscale approaches, except MSEσ 2 and RCMSEσ 2, decrease monotonically with scale factor τ. However, for 1/f noise, the entropy values become approximately constant over larger-scale factors. These facts are in agreement with WGN which only has structure in the shortest temporal scale, whereas 1/f noise has structure across all scales [8, 13]. Note that each error bar of each scale factor τ depicts the SD of the results of 40 signals for each WGN or 1/f noise.
Comparing results obtained by MSEμ (Fig. 2a) and MFEμ (Fig. 2b) shows, as expected theoretically, that the MFEμ leads to a smaller variability in the results. Statistical tests confirmed the smaller variability of the MFEμ results (p value ≤0.05) as assessed with Levene’s test at τ = 60. In addition, the RCMSEμ/RCMFEμ profiles have smaller SDs than MSEμ/MFEμ.
Although the MSEσ 2 values for WGN are larger than 1/f noise for scale factors 1 to 60, according to Fig. 2f, it is predicted that this measure for WGN will become smaller than those of 1/f noise for large enough scale factors. For MSEσ and for scale factors 1 to 37, the larger entropy values are assigned to WGN signal in comparison with 1/f noise, while for scale factors larger than 37, the SampEn values for 1/f noise are larger than those of WGN, in agreement with the fact that 1/f noise is considered more structurally complex across multiple scales [9, 29]. Comparing the results shows that crossing between WGN and 1/f noise does not happen at short levels of scale factor for the coarse-graining process based on variance and standard deviation, unlike the mean.
It should be added that the results obtained for parameter r, used in [12], are similar to our results with r = 0.15 multiplied by the SD of that time series, employed in [16].
In order to understand the importance of refined composite technique on the basic multiscale entropy methods, we employed the coefficient of variation (CV) defined as the SD divided by the mean [30]. The main purpose to employ such a measure is that the SDs of data may increase or decrease proportionally to the mean. Thus, the CV, as a standardization of the SD, permits comparison of variability estimates regardless of the magnitude of the variable [30]. We study the results for 1/f noise and WGN signals at scale factor 20. As can be seen in Table 1, the refined composite technique decreases the CV values of the basic multiscale approaches, leading to more stable results.
Table 1.
Signals | MSEμ | MFEμ | RCMSEμ | RCMFEμ |
---|---|---|---|---|
1/f | 0.015 | 0.013 | 0.011 | 0.011 |
WGN | 0.019 | 0.019 | 0.011 | 0.010 |
MSEσ | MFEσ | RCMSEσ | RCMFEσ | |
1/f | 0.023 | 0.023 | 0.017 | 0.016 |
WGN | 0.022 | 0.020 | 0.020 | 0.015 |
MSEσ 2 | MFEσ 2 | RCMSEσ 2 | RCMFEσ 2 | |
1/f | 0.026 | 0.025 | 0.017 | 0.016 |
WGN | 0.015 | 0.018 | 0.010 | 0.010 |
The computation times of the conventional and proposed multiscale sample and fuzzy entropy approaches with the maximum scale factor 60 for the WGN signals with the length of 40,000 sample points are demonstrated in Table 2. The simulations have been carried out using a PC with Intel® Xeon® CPU, E5420, 2.5 GHz, and 8-GB RAM by MATLAB R2010a. The results show that FuzEn-based methods are slower than SampEn-based ones and the refined composite technique increases the computation time significantly. The running times of the variance-based methods are similar to those of the standard deviation-based algorithms. Moreover, since the MSEσ 2, MSEσ MFEσ 2, MFEσ RCMSEσ 2, RCMSEσ, RCMFEσ 2, and RCMFEσ start from scale factor 2 and the computation cost of SampEn and FuzEn is O(N 2) [31], the running times of these kinds of algorithms are noticeably smaller than those of the algorithms based on coarse-graining with regard to the mean.
Table 2.
MSEμ | MFEμ | RCMSEμ | RCMFEμ |
49.08 s | 73.21 s | 253.61 s | 364.73 s |
MSEσ 2 | MFEσ 2 | RCMSEσ 2 | RCMFEσ 2 |
23.08 | 35.79 s | 186.99 s | 299.03 s |
MSEσ | MFEσ | RCMSEσ | RCMFEσ |
22.94 s | 34.92 s | 189.24 s | 282.62 s |
Sensitivity of multiscale methods to signal length
To evaluate the sensitivity of multiscale methods to the signal length, we consider WGN and 1/f noise signals as functions of sample points size C. Figures 3, 4, 5, and 6 respectively depict the MSEμ, RCMSEμ, MFEμ, and RCMFEμ values for the signal length 100, 300, 1000, 3000, 10,000, and 30,000 computed from 40 different realizations of WGN and 1/f noise. The results show that the greater the value of C, the more robust the multiscale entropy estimations, as seen from the error bars.
It has been suggested that the number of sample points is at least 10m, or preferably at least 30m, to robustly estimate approximate entropy or SampEn in time series [32]. Because the coarse-graining step reduces the times series length by the scale factor τ, and here we have τ max = 10 and m = 2, the original signal should have at least 1000 samples. As mentioned before, in SampEn, the number of instances where is smaller than a predefined tolerance r is counted. If the length of a time series is too small, this number may be 0, leading to an undefined entropy measure. According to this fact, the results obtained by MSEμ for C = 100 and 300, respectively depicted in Fig. 3a, b, are undefined.
For RCMSEμ at scale factor τ, although the length of the signal decreases τ times, we take into account τ time coarse-grained signals, instead of only one signal as in conventional multiscale entropy approaches [13]. Therefore, in refined composite-based algorithms, we have τ times more number of instances in comparison with their corresponding basic versions, leading to more reliable results, especially for short signals. This fact can be seen in Fig. 4 in comparison with Fig. 3. Although RCMSEμ outperforms MSEμ in terms of reliability for short signals, RCMSEμ values for C = 100 and C = 300 (Fig. 4a, b) are still undefined at some scale factors.
However, the FuzEn-based algorithms do not count matches, yet consider all possible range of distances between any two composite vectors. Therefore, MFEμ and RCMFEμ avoid resulting in undefined entropy values in such situations. The results obtained by the RCMFEμ (Fig. 6) have considerably smaller SD values, especially for short signals, than those obtained by MFEμ (Fig. 5).
Synthetic signals
To understand the effect of frequency on multiscale entropy-based methods, we employed a sliding window moving along each of the abovementioned synthetic signals. Then, for each scale factor, the multiscale entropy-based method of that part of the signal was computed. Because the length of the window is 2000 sample points, we consider the scale factor from 1 to 15, to ensure the length of the coarse-grained signals is enough for m = 2 [33].
For chirp signal with constant amplitude, the RCMFEσ, RCMFEμ, MSEσ 2, and MSEμ results are respectively shown in Fig. 7a–d. When the time window is occupied at the beginning of the signal, which has smaller frequency, the FuzEn and SampEn values are low across all τ. As expected theoretically, all the RCMFEσ, RCMFEμ, MSEσ 2, and MSEμ values increase with higher frequencies, which happens in later temporal windows (TWs). It is worth noting that since the SD/variance, unlike the mean value, of one single number is 0, the entropy measure in the first scale factor is undefined. This fact can be seen in Fig. 7a, c in comparison with Fig. 7b, d.
In Fig. 7e–h, it can be observed generally, using an AR(1) process with variable parameter, that the entropy measures of RCMFEσ, MFEσ 2, and MFEσ, unlike RCMFEμ, increase in higher TWs in every scale factor.
Figure 7i–l respectively shows the results obtained by RCMFEσ, RCMFEμ, MFEσ, and MFEμ using the abovementioned MIX process. The entropy measures of all of them decrease in higher TWs in every scale factor, showing the evolution from randomness to periodic oscillations.
Figure 7m–p illustrates the results obtained by RCMFEσ, RCMFEμ, MSEσ, and MSEμ, respectively, using the logistic map which the parameter α changes linearly from 3.5 to 3.99. The entropy measures, obtained by all of them, generally increase along the signal, at each scale factor, except for the downward spikes in the windows of periodic behavior. This fact is in agreement with Fig. 4.10 (page 87 in [23]). It is also supported by Fig. 1d which shows that the frequency of the signal for t = 70–75 s is lower than for its adjacent time samples. In case of increasing scale factor, the RCMFEσ and MSEσ results decrease, whereas the RCMFEμ and MSEμ results first increase respectively until τ = 2 and τ = 4 then decrease. It shows that mean- and standard deviation-based multiscale approaches, extracting different kinds of dynamical properties of, respectively, mean and spread over multiple time scales, lead to different kinds of features.
Using the Lorentz system, we find that RCMFEσ, RCMFEμ, MSEμ, and RCMSEμ respectively shown in Fig. 7q–t can distinguish two different non-linear dynamics.
Clinical datasets
We also assess the suitability of the RCMFEμ and RCMFEσ methods to characterize AD in MEG signals. The profiles are shown in Fig. 8. The average of RCMFEσ values for AD patients is smaller than that for controls at all scale factors. This is in agreement with [5, 34]. In contrast, the average of RCMFEμ values for AD patients is smaller than that for controls for only 1 ≤ τ ≤ 3.
False discovery rate (FDR)-adjusted [35] p values of a Student t test assuming unequal variances for each MEG channel and temporal scale factor to evaluate the differences between the values of entropy for AD patients and controls are shown in Fig. 8 in a logarithmic scale. The FDR-adjusted p values obtained by RCMFEμ, unlike those of RCMFEσ, initially increase and then decrease along the temporal scale factor for almost all channels.
We also classify the AD subjects and controls using a naive Bayes classifier [36]. For each individual, 15 and 14 features (temporal scale factors) are extracted by averaging the RCMFEμ and RCMFEσ results across all channels, respectively. We ran 200 repetitions of a tenfold cross-validation. The average classification accuracies were 72.81 and 78.22%, respectively, for RCMFEμ and RCMFEσ. This shows that, in this case, RCMFEσ features lead to higher classification accuracy than RCMFEμ ones. The classification was done with the WEKA data mining software [37].
We also study the behavior of RCMFEμ and RCMFEσ in focal and non-focal EEG time series. The error bars illustrating the distributions of the RCMFEμ and RCMFEσ values computed from focal and non-focal EEG signals are shown in Fig. 9a, b until scale factor 30. For each scale factor, the average of entropy values of focal EEG signals is smaller than that of non-focal ones. It illustrates that the non-focal EEG recordings are generally more complex than the focal ones, and it is in agreement with [28] and [38].
We adjusted the FDR independently for each of RCMFEμ and RCMFEσ. The adjusted p values are depicted in Fig. 9c, d for RCMFEμ and RCMFEσ, respectively. The results show that the RCMFEμ method achieves smaller adjusted p values at scale factors 1–9, whereas the RCMFEσ algorithm leads to smaller adjusted p values at scale factors 10–30, demonstrating that when (RC)MFEμ at specific scale factors cannot distinguish different kinds of dynamics, the (RC)MFEσ may do so and vice versa.
We also applied the same classification scheme to distinguish the focal and non-focal signals. The average classification accuracies were 71.58 and 79.62%, respectively, for RCMFEμ and RCMFEσ. It again shows that the RCMFEσ may lead to different or sometimes more useful information for characterization of signals.
To compare the existing and proposed univariate multiscale methods, we use FDR-adjusted p values for focal versus non-focal signals as well as AD patients’ versus controls’ recordings. The results for scale factor 10 are shown in Table 3. The results demonstrate that standard deviation-based methods discriminate two groups for both the datasets better than variance- and mean-based multiscale algorithms. The adjusted p values show that for clinical filtered data, unlike noisy time series, the refined composite technique does not improve the performance of the basic multiscale approaches noticeably. As the refined composite algorithm significantly increases the computation times for these two clinical datasets, the basic versions of multiscale methods are preferable in this case.
Table 3.
Dataset | MSEμ | MFEμ | RCMSEμ | RCMFEμ |
---|---|---|---|---|
EEG data | 2.33∙10−8 | 1.18∙10−8 | 2.21∙10−8 | 1.10∙10−8 |
MEG data | 0.6601 | 0.1321 | 0.6464 | 0.1208 |
MSEσ | MFEσ | RCMSEσ | RCMFEσ | |
EEG data | 2.33∙10−10 | 1.70∙10−10 | 1.99∙10−10 | 1.38∙10−10 |
MEG data | 0.0037 | 0.0035 | 0.0036 | 0.0028 |
MSEσ 2 | MFEσ 2 | RCMSEσ 2 | RCMFEσ 2 | |
EEG data | 2.24∙10−9 | 1.07∙10−9 | 1.39∙10−9 | 1.77∙10−9 |
MEG data | 0.0045 | 0.0052 | 0.0045 | 0.0051 |
Discussions
In this section, we discuss the results obtained by the existing and proposed multiscale methods for noise and synthetic signals and clinical datasets.
Noise signals
The patterns for MFEσ and MFEσ 2 are similar to MSEσ and MSEσ 2, respectively. However, as expected theoretically, the SD of MFEσ and MFEσ 2 values for each scale is comparatively smaller than that of MSEσ and MSEσ 2 measures, respectively. The FuzEn and SampEn for 1/f noise are larger than those of WGN when 33 < τ and 42 < τ, respectively. It shows another relative advantage of MFEσ over MSEσ. Although the curves for RCMSEσ and RCMSEσ 2 have smaller SDs than MSEσ and MSEσ 2, respectively, for each scale factor, these have larger SDs in comparison with RCMFEσ and RCMFEσ 2. This fact confirms our theoretical expectation about RCMFEσ and RCMFEμ producing the most stable results among these 12 multiscale entropy methods. In brief, FuzEn-based multiscale methods are more stable than SampEn-based algorithms. Furthermore, the refined composite coarse-graining techniques improve the stability of MSE or MFE. In addition, for 1/f noise and WGN time series, the multiscale methods based on standard deviation may have better performance in shorter temporal scales than those based on variance.
Sensitivity of multiscale methods to signal length
Using the fuzzy membership function and/or refined composite technique causes the RCMFEμ to become more reliable and stable for short signals in comparison with the other mean-based multiscale methods. Note that the results obtained by variance- and standard deviation-complexity measures are similar to Figs. 3, 4, 5, and 6, although like Fig. 2, the crossing points are different. That is, we have similar advantages of RCMFE methods based on standard deviation or variance over their MSE, MFE, and RCMSE counterparts.
Synthetic signals
For the chirp signal with constant amplitude, the refined composite multiscale entropy-based approaches, i.e., RCMFEσ and RCMFEμ, are more stable than their basic counterparts (MSEσ 2 and MSEμ). For the abovementioned AR(1) process, results obtained by MFEσ and MFEσ 2 have similar patterns, although MFEσ 2 is relatively more variable than MFEσ. As expected theoretically, refined composite technique reduces the variability of the results. For the aforementioned MIX process, when the TW moves from a stochastic signal to periodic deterministic sequence, the entropy measures for all these methods decrease. In addition, moving from τ = 2 to τ = 15, the entropy measures decrease. Although all these approaches generally demonstrate the same behavior, the RCMFEσ and RCMFEμ results are more stable than their corresponding basic counterparts.
For the abovementioned logistic map, the results again demonstrate that mean- and standard deviation-based multiscale approaches, extracting different kinds of dynamical properties of, respectively, mean and spread over multiple time scales, lead to different kinds of features.
The results obtained using the Lorentz system show that although at smaller-scale factors the entropy measures for RCMFEσ, RCMFEμ, MSEμ, and RCMSEμ are very low, two different segments are distinguishable in larger-scale factors. This fact depicts the importance of multiscale entropy methods and temporal scales, in comparison with basic entropy approaches having only scale factor 1, in signal processing. As can be seen in Fig. 7q, t, the results obtained by the RCMFEμ are more stable than RCMSEμ, and RCMSEμ results are more stable than MSEμ ones. It demonstrates the importance of fuzzy entropy and refined composite algorithm to improve the stability of the results.
Clinical datasets
For MEG dataset, the adjusted p values illustrate that the most significant differences are seen around temporal scales 7–14 and 3–9 using RCMFEμ and RCMFEσ, respectively. It shows that if a mean-based multiscale entropy cannot discriminate two groups at specific scale factors, its corresponding standard deviation-based one may be able to do so, and vice versa.
The profiles in Fig. 8 show increases in entropy for RCMFEμ and RCMFEσ at scales 1–3 and 2–6, respectively. This emphasizes the suitability of multiscale evaluations for the assessment of biomedical data as these approaches managed to reveal different dynamics associated with pathology (AD in this case), despite previous claims by some authors that the coarse-graining procedure in MSE had the shortcoming that it tended to artificially decrease the entropy values as a function of the time scale [39].
For focal and non-focal EEG dataset, since all EEG signals were band-pass filtered between 0.5 and 40 Hz, there is no relevant information left for analysis of frequencies higher than 40 Hz. However, the frequency that corresponds to the analysis of scale 1 is 512/2 Hz. This may be the reason why SampEn is so low for short time scales.
It should be added that the entropy parameters used for biomedical signals are exactly similar to those mentioned for synthetic time series. We tested different r values from 0.05 to 0.2 for this kind of signals, and for all of them, the results had similar patterns and the conclusions do not change when the parameters are varied.
Note that in the MSE algorithm, we kept the value of r fixed across temporal scales. Other authors suggested recalculating the tolerance r at each scale factor separately [14]. Using several physiological datasets, they found that recalculating r produced similar results to those obtained by not recomputing r, as in the original description of MSE proposed by Costa et al. [9]. Considering that there was no evidence of the fact that recomputing r for each scale improved the results, we decided to keep r fixed so that we retained the advantages of the original formulation of multiscale entropy by which the entropy of WGN decreases with τ. (Note that renormalizing r for each scale will lead to flatter MSE curves for WGN, contrary to theoretical expectations.)
Conclusions
In this paper, we introduced the RCMFEσ and RCMFEμ, extracting different kinds of dynamical properties (or features) of spread and mean, respectively, over multiple time scales. We illustrated the behavior of these multiscale entropy-based approaches versus WGN, 1/f noise, several straightforward concepts in signal processing, and two clinical datasets. The results showed that MSEσ and MFEσ had better performance to show the concept of complexity than, respectively, MSEσ 2 and MFEσ 2 for 1/f noise and WGN time series. The FuzEn-based multiscale methods were more stable than SampEn-based algorithms, and furthermore, the refined composite technique noticeably improved the stability of the basic MSE and MFE methods. The proposed methods alleviated the problem of undefined MSE and RCMSE values for short signals. The classification results, obtained using simple classification methods, showed that RCMFEσ-based features lead to higher classification accuracies in comparison with the RCMFEμ-based ones. The results also illustrated that when the (RC)MFEμ, as a signal-dependent method, cannot distinguish different types of dynamics of a particular signal, the (RC)MFEσ may do so, and vice versa. We expect that our developments will find applications in physiologic and non-physiologic studies to distinguish different kinds of dynamics.
Acknowledgements
The authors would like to thank Dr. Madalena Costa from the Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, USA, for very useful suggestions and comments. They extend thanks to Dr. Andrzejak from the Department of Information and Communication Technologies, Universitat Pompeu Fabra, Barcelona, Spain, for providing the EEG data described in Sect. 2.
Biographies
Hamed Azami
is a PhD student at The University of Edinburgh, UK. His research interest is developing signal processing and pattern recognition techniques with major applications in biomedical data and neuroscience.
Alberto Fernández
is “Profesor Titular” in the Department of Psychiatry, Complutense University of Madrid. His research interests include the application of magnetoencephalography to the diagnosis of Alzheimer’s disease.
Javier Escudero
is a tenured faculty member (Chancellor’s Fellow) at the University of Edinburgh, UK. His research interests include biomedical signal processing and pattern recognition in clinical applications.
Appendix
The codes for our analysis, including SampEn, FuzEn, MSEμ, MFEμ, RCMSEμ, RCMFEμ, MSEσ 2, MFEσ 2, RCMSEσ 2, RCMFEσ 2, MSEσ, MFEσ, RCMSEσ, and RCMFEσ, are available at 10.7488/ds/1477.
Compliance with ethical standards
All control subjects and AD patients’ caregivers gave informed consent for participation in the study, which was approved by the local Ethics Committee. Retrospective EEG data analysis has been approved by the ethics committee of the Kanton of Bern. Moreover, all patients gave written informed consent that the obtained signals from long-term EEG might be utilized for research purposes
Contributor Information
Hamed Azami, Email: hamed.azami@ed.ac.uk.
Alberto Fernández, Email: aferlucas@med.ucm.es.
Javier Escudero, Email: javier.escudero@ed.ac.uk.
References
- 1.Bandt C, Pompe B. Permutation entropy: a natural complexity measure for time series. Phys Rev Lett. 2002;88:1–4. doi: 10.1103/PhysRevLett.88.174102. [DOI] [PubMed] [Google Scholar]
- 2.Richman JS, Moorman JR. Physiological time-series analysis using approximate entropy and sample entropy. Am J Phys Heart Circ Phys. 2000;278:H2039–H2049. doi: 10.1152/ajpheart.2000.278.6.H2039. [DOI] [PubMed] [Google Scholar]
- 3.Chen W, Wang Z, Xie H, Yu W. Characterization of surface EMG signal based on fuzzy entropy. Neural Syst Rehabil Eng IEEE Trans. 2007;15:266–272. doi: 10.1109/TNSRE.2007.897025. [DOI] [PubMed] [Google Scholar]
- 4.Liu C, Li K, Zhao L, Liu F, Zheng D, Liu C, Liu S. Analysis of heart rate variability using fuzzy measure entropy. Comput Biol Med. 2013;43:100–108. doi: 10.1016/j.compbiomed.2012.11.005. [DOI] [PubMed] [Google Scholar]
- 5.Labate D, La Foresta F, Morabito G, Palamara I, Morabito FC. Entropic measures of EEG complexity in Alzheimer’s disease through a multivariate multiscale approach. Sensors J IEEE. 2013;13:3284–3292. doi: 10.1109/JSEN.2013.2271735. [DOI] [Google Scholar]
- 6.Micó P, Mora M, Cuesta-Frau D, Aboy M. Automatic segmentation of long-term ECG signals corrupted with broadband noise based on sample entropy. Comput Methods Prog Biomed. 2010;98:118–129. doi: 10.1016/j.cmpb.2009.08.010. [DOI] [PubMed] [Google Scholar]
- 7.Chen W, Zhuang J, Yu W, Wang Z. Measuring complexity using FuzzyEn, ApEn, and SampEn. Med Eng Phys. 2009;31:61–68. doi: 10.1016/j.medengphy.2008.04.005. [DOI] [PubMed] [Google Scholar]
- 8.Costa M, Goldberger AL, Peng C-K. Multiscale entropy analysis of biological signals. Phys Rev E. 2005;71:021906. doi: 10.1103/PhysRevE.71.021906. [DOI] [PubMed] [Google Scholar]
- 9.Costa M, Goldberger AL, Peng C-K. Multiscale entropy analysis of complex physiologic time series. Phys Rev Lett. 2002;89:1–4. doi: 10.1103/PhysRevLett.89.068102. [DOI] [PubMed] [Google Scholar]
- 10.Fogedby H. On the phase space approach to complexity. J Stat Phys. 1992;69:411–425. doi: 10.1007/BF01053799. [DOI] [Google Scholar]
- 11.Humeau-Heurtier A. The multiscale entropy algorithm and its variants: a review. Entropy. 2015;17:3110–3123. doi: 10.3390/e17053110. [DOI] [Google Scholar]
- 12.Costa MD, Goldberger AL. Generalized multiscale entropy analysis: application to quantifying the complex volatility of human heartbeat time series. Entropy. 2015;17:1197–1203. doi: 10.3390/e17031197. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Wu S-D, Wu C-W, Lin S-G, Lee K-Y, Peng C-K. Analysis of complex time series using refined composite multiscale entropy. Phys Lett A. 2014;378:1369–1374. doi: 10.1016/j.physleta.2014.03.034. [DOI] [Google Scholar]
- 14.Angelini L, Maestri R, Marinazzo D, Nitti L, Pellicoro M, Pinna GD, Stramaglia S, Tupputi SA. Multiscale analysis of short term heart beat interval, arterial blood pressure, and instantaneous lung volume time series. Artif Intell Med. 2007;41:237–250. doi: 10.1016/j.artmed.2007.07.012. [DOI] [PubMed] [Google Scholar]
- 15.Azami H, Escudero J. Improved multiscale permutation entropy for biomedical signal analysis: interpretation and application to electroencephalogram recordings. Biomed Signal Process Control. 2016;23:28–41. doi: 10.1016/j.bspc.2015.08.004. [DOI] [Google Scholar]
- 16.Ahmed MU, Mandic DP. Multivariate multiscale entropy: a tool for complexity analysis of multichannel data. Phys Rev E. 2011;84:061918. doi: 10.1103/PhysRevE.84.061918. [DOI] [PubMed] [Google Scholar]
- 17.Aboy M, Hornero R, Abasolo D, Alvarez D. Interpretation of the Lempel-Ziv complexity measure in the context of biomedical signal analysis. Biomed Eng IEEE Trans. 2006;53:2282–2288. doi: 10.1109/TBME.2006.883696. [DOI] [PubMed] [Google Scholar]
- 18.Escudero J, Hornero R, Abásolo D. Interpretation of the auto-mutual information rate of decrease in the context of biomedical signal analysis. Application to electroencephalogram recordings. Physiol Meas. 2009;30:187. doi: 10.1088/0967-3334/30/2/006. [DOI] [PubMed] [Google Scholar]
- 19.Diebold F. Elements of forecasting. Boston: Cengage Learning; 1998. [Google Scholar]
- 20.Sejdić E, Lipsitz LA. Necessity of noise in physiology and medicine. Comput Methods Prog Biomed. 2013;111:459–470. doi: 10.1016/j.cmpb.2013.03.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Ferrario M, Signorini MG, Magenes G, Cerutti S. Comparison of entropy-based regularity estimators: application to the fetal heart rate signal for the identification of fetal distress. Biomed Eng IEEE Trans. 2006;53:119–125. doi: 10.1109/TBME.2005.859809. [DOI] [PubMed] [Google Scholar]
- 22.Pincus SM. Approximate entropy as a measure of system complexity. Proc Natl Acad Sci. 1991;88:2297–2301. doi: 10.1073/pnas.88.6.2297. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Baker GL, Gollub JP. Chaotic dynamics: an introduction. Cambridge: Cambridge University Press; 1996. [Google Scholar]
- 24.Thuraisingham RA, Gottwald GA. On multiscale entropy analysis for physiological data. Physica A: Stat Mech Appl. 2006;366:323–332. doi: 10.1016/j.physa.2005.10.008. [DOI] [Google Scholar]
- 25.McKhann G, Drachman D, Folstein M, Katzman R, Price D, Stadlan EM. Clinical diagnosis of Alzheimer’s disease report of the NINCDS-ADRDA Work Group* under the auspices of Department of Health and Human Services Task Force on Alzheimer’s disease. Neurology. 1984;34:939–939. doi: 10.1212/WNL.34.7.939. [DOI] [PubMed] [Google Scholar]
- 26.Escudero J, Sanei S, Jarchi D, Abásolo D, Hornero R. Regional coherence evaluation in mild cognitive impairment and Alzheimer’s disease based on adaptively extracted magnetoencephalogram rhythms. Physiol Meas. 2011;32:1163. doi: 10.1088/0967-3334/32/8/011. [DOI] [PubMed] [Google Scholar]
- 27.Escudero J, Hornero R, Abásolo D, Fernández A. Quantitative evaluation of artifact removal in real magnetoencephalogram signals with blind source separation. Ann Biomed Eng. 2011;39:2274–2286. doi: 10.1007/s10439-011-0312-7. [DOI] [PubMed] [Google Scholar]
- 28.Andrzejak RG, Schindler K, Rummel C. Nonrandomness, nonlinear dependence, and nonstationarity of electroencephalographic recordings from epilepsy patients. Phys Rev E. 2012;86:046206. doi: 10.1103/PhysRevE.86.046206. [DOI] [PubMed] [Google Scholar]
- 29.Zhang Y-C. Complexity and 1/f noise. A phase space approach. J Phys I. 1991;1:971–977. [Google Scholar]
- 30.Reed GF, Lynn F, Meade BD. Use of coefficient of variation in assessing variability of quantitative assays. Clin Diagn Lab Immunol. 2002;9:1235–1239. doi: 10.1128/CDLI.9.6.1235-1239.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Jiang Y, Mao D, Xu Y. A fast algorithm for computing sample entropy. Adv Adapt Data Anal. 2011;3:167–186. doi: 10.1142/S1793536911000775. [DOI] [Google Scholar]
- 32.Pincus SM, Goldberger AL. Physiological time-series analysis: what does regularity quantify? Am J Phys Heart Circ Phys. 1994;266:H1643–H1656. doi: 10.1152/ajpheart.1994.266.4.H1643. [DOI] [PubMed] [Google Scholar]
- 33.Ahmed MU, Mandic DP. Multivariate multiscale entropy analysis. Signal Process Lett IEEE. 2012;19:91–94. doi: 10.1109/LSP.2011.2180713. [DOI] [Google Scholar]
- 34.Ferlazzo E, Mammone N, Cianci V, Gasparini S, Gambardella A, Labate A, Latella MA, Sofia V, Elia M, Morabito FC, Aguglia U. Permutation entropy of scalp EEG: a tool to investigate epilepsies: suggestions from absence epilepsies. Clin Neurophysiol. 2014;125:13–20. doi: 10.1016/j.clinph.2013.06.023. [DOI] [PubMed] [Google Scholar]
- 35.Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Ser B Methodol. 1995;57:289–300. [Google Scholar]
- 36.Rish I. IJCAI 2001 workshop on empirical methods in artificial intelligence. New York: IBM; 2001. An empirical study of the naive Bayes classifier; pp. 41–46. [Google Scholar]
- 37.Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH. The WEKA data mining software: an update. ACM SIGKDD Explorations Newsl. 2009;11:10–18. doi: 10.1145/1656274.1656278. [DOI] [Google Scholar]
- 38.Sharma R, Pachori RB, Acharya UR. Application of entropy measures on intrinsic mode functions for the automated identification of focal electroencephalogram signals. Entropy. 2015;17:669–691. doi: 10.3390/e17020669. [DOI] [Google Scholar]
- 39.Valencia JF, Porta A, Vallverdú M, Claria F, Baranowski R, Orlowska-Baranowska E, Caminal P. Refined multiscale entropy: application to 24-h holter recordings of heart period variability in healthy and aortic stenosis subjects. Biomed Eng IEEE Trans. 2009;56:2202–2213. doi: 10.1109/TBME.2009.2021986. [DOI] [PubMed] [Google Scholar]