Abstract
Blind Source Separation (BSS) problems, under the assumption of static mixture, were extensively explored from the theoretical point of view. Powerful algorithms are now at hand to deal with many concrete BSS applications. Nevertheless, the performances of BSS methods, for a given biomedical application, are rarely investigated. The aim of this paper is to perform quantitative comparisons between various well-known BSS techniques. To do so, synthetic data, reproducing real polysomnographic recordings, are considered.
Keywords: Independent Component Analysis, ICA, BSS, ECG, EEG, Polysomnography, Modeling, Source, Mixing matrix
I. Introduction
In the last decade Blind Source Separation (BSS) methods have been widely used in the field of biomedical engineering. Noise reduction and useful signal extraction are among the most significant applications of BSS. More particularly, assume that one realization of N-dimensional random vector process {x[m]m∈ℕ} with values in the real field is available, such as:
| (1) |
where s[m] = [s1[m] … sP[m]]T is a P source vector with statistically independent components, v[m] = [v1[m] … vN[m]]T is a N noise vector which is independent from the source vector and A = [a1,…, aP] is a (P×N) matrix, called mixing matrix. The BSS problem consists then in finding, only from the data, a (N×P) matrix W, called separator, such that:
| (2) |
is an estimate of s[m] to within a trivial matrix [5], i.e. a matrix of the form ΛΠ where Λ is a diagonal matrix and Π is a permutation matrix.
Up to now, the BSS problem has been extensively explored from the theoretical point of view and many techniques are available to deal with numerous concrete applications. Then a fundamental question needs to be addressed: which BSS method, or class of BSS methods to use for a given biomedical application? In this paper we address this question for a particular problem of electrophysiological signal separation encountered in ambulatory polysomnography [11]. To do so, we have conducted performance comparisons on synthetic data, reproducing real polysomnographic observations, of seven BSS approaches. In the following, these hypotheses are assumed to be fulfilled:
H1) The vector random process {s[m]m∈ℕ} is stationary and ergodic;
H2) For any integer m, the components of s[m] are mutually independent;
H3) At most one source is Gaussian;
H4) For any integer m, the components of s[m] are mutually independent from the components of v[m];
H5) The vector random process {v[m]m∈ℕ} is stationary, ergodic and Gaussian;
H6) The matrix A has a full column rank equal to P.
II. Data generation
The main goal is to obtain synthetic but realistic data for comparing the above BSS methods. To do so, we have used our sleep database (14 patients) [11] and an EEG model [9] for generating background brain activities.
A-. Generation of the source and noise data
The simulated sources are denoted by EEGS, EOGRS, EOGLS and ECGS (figure 1(a)). They represent brain activity, rapid eye movements, slow eye movements and cardiac activity, respectively. More precisely, the EEGS source is simulated using the model of Jansen [9] where parameters were selected to derive a cerebral background activity. Note that the statistical distribution of this signal is quasi-Gaussian. This is in agreement with real background EEG data. The other sources are derived from our sleep recordings database presented in [11]. More precisely, the EOGRS source is issued from a band-pass filtering (between 1 Hz and 8 Hz]) of the derivation FPZ-CZ of the standard 10–20 system. The EOGLS source corresponds to a low-pass filtering of the classical derivation E1–E2 with the cut-off frequency of 4HZ in order to reduce the effect of EEG and EMG interference. Finally, the ECGS source corresponds to a cardiac signal recorded on patients during their sleep. In reference to the additif noise {v[m]m∈ℕ}, it is modeled as a spatially correlated Gaussian noise with the spatial correlation equal to 0.5.
Fig. 1.

An example of 20 secondes portion of original sources, observations and estimated sources
B-. Generation of the mixing matrix
To derive the mixing matrix associated with the brain and ocular activities, a three concentric sphere head model is used. Four dipoles located at four fronto-parietal positions and a patch of two hundred dipoles (uniformly located in the cortex) characterize the eye movements (rapid and slow) and the background EEG sources, respectively. The EEG recording system contains four electrodes plus one reference electrode: two temporal sensors, in front of the higher part of the ears, denoted by F7m and F8m (where m stands for modified), two frontal sensors, above the eyes, denoted by FP1m and FP2m, and the reference electrode CZ located at the top of the head [11]. Then the transfer formula [12] describing the relationship between current dipoles and surface observation is used to obtain the (4 × 3) mixing matrix A′. Since the heart contribution to the data is assumed to be non-uniform on all the channels, we decide to add to the (4 × 3) mixing matrix A′ a fourth column vector a4 with different components. Note that the obtained mixing matrix A = [A′ a4] is slightly ill-conditioned.
III. Study of seven classical BSS methods
We have focused in this section on seven well-know BSS algorithms, namely SOBI [1], TFBSS [7], COM2 [5], JADE [3], COM1 [6], INFOMAX [10] and FastICA [8]. Nevertheless, before presenting these methods, we first formulate the criterion allows for quantitative comparison of the performance given by each BSS method.
A-. Performance criterion
In order to compare quantitatively the ability of two separators, W1 and W2 to extract a source we used the criterion introduced by Chevalier [4]. Chevalier has shown that the quality of the retrieved source is directly related to the Signal to Interference-Plus-Noise Ratio (SINR) of this source after separation. More precisely, the SINR of the p-th source at the i-th output of the separator W = [w1,…, wP] is defined by:
| (3) |
where πp represents the power of the p-th source, wi the i-th column of separator W and Rvp is the total noise covariance matrix for the p-th source, corresponding to the data covariance matrix Rx, in the absence of the source P. On the basis of these definitions, the restitution quality of the p-th source at the output of separator W is evaluated by computing the maximum of SINRP [wi] with respect to i (1 ≤ i ≤ P). This quantity is denoted by SINRMp. The performance of a source separator W is defined by the line vector SINRM(W) given by:
| (4) |
in a given context, a separator W1 is better than a separator W2 for retrieving the source p, if SINRMp[W1] > SINRMP [W2].
B-. Optimal source separator
The criterion given by (4) allows for a quantification of the source separation performed by BSS algorithms. However, besides this criterion it is necessary to know its upper bound, which is achieved by the optimal source separator, in order to completely evaluate the performance of a given BSS method. It is shown [4] that the optimal source separator corresponds to the separator WSMF whose columns are the Spatial Matched Filters (SMF) associated with the different sources. It is defined to within a trivial matrix by:
| (5) |
where Rx, is the covariance of {x[m]m∈ℕ}.
C-. Presentation of the considered BSS methods
The BSS methods considered in the paper aim at implementing, to within a trivial matrix, an estimate of the optimal separator (5) from both an estimate, Rx, of the data covariance matrix Rx common to all the methods, and the estimate, Â, of A, computed by each BSS method. More precisely, in order to estimate A, SOBI (Second Order Blind Identification) [1] exploits a set of delayed covariance matrices of the data. TFBSS (Time-Frequency Blind Source Separation) [7] can be considered as an extension of the SOBI algorithm in order to process non-stationary signals. COM2 (COntrast Maximization 2) [5], JADE (Joint Approximated Diagonalization of Eigen-matrices) [3] and COM1 (COntrast Maximization 1) [6] rely on the maximization of a contrast criterion based on fourth order cumulants. INFOMAX (INFOrmation MAXimization) [10] maximizes a criterion based on information theory and FastICA (Fast Independent Component Analysis) [8] uses the fixed-point algorithm to maximize a negentropy-based criterion. Note that we have considered two versions of SOBI, denoted by SOBI and SOBIR in the sequel, and two versions of FastICA denoted by FastICADO and FastICASO-The difference between SOBI and SOBIR concerns the whitening step. Indeed, while SOBI uses the classical whitening [1], SOBIR exploits the robust whitening [2] which is not affected by a spatially correlated noise provided that this noise is temporally white. Concerning FastICA, DO refers to the Deflation Orthogonal approach (the sources are extracted one by one) while SO refers to the Symmetric Orthogonal approach (the sources are simultaneously extracted).
IV. Simulation results
A comparative performance study on synthesized sleep recording data, of the considered seven algorithms is proposed in this section. To do so, two studies are envisaged. For each one, the performance criterion is averaged over two hundred runs. Figures 1(b) and 1(c) illustrate an example of the observations obtained from the generated mixing matrix and the sources presented in figure 1(a) (with a high SNR) and the results of the separation using FastICADO, of the EEGS, EOGRS, EOGLS and ECGS signals, respectively. Clearly, we observe that cerebral activity (see y4), and the cardiac activity (see y2) are well recovered. Regarding eye movements, y1 and y3 show rapid eye movements and slow eye movements, respectively. However, a few rapid eye mouvements also appear on y3, which implies that the separation is not perfect.
A-. Influence of the data length for a fixed SNR
In this experiment, we set the Signal to Noise Ratio (SNR) to 15 dB for each source. The SINRMp criterion 1 ≤ p ≤ 4 at the output of the SOBI, SOBIR, TFBSS, COM2, COM1, JADE, FastICADO, FastICASO and INFOMAX, is computed as a function of the number of samples M (with a sampling rate of 256 Hz). Figure 2 shows the separation results of the EEGS, EOGRS, EOGLS and ECGS. COM2, COM1, JADE, FastICADO and FastlCASO lead to good behaviors, for EEGS, EOGRS, EOGLS and ECGS. Moreover, the ability of these methods to separate sources increases with the data length M. Nevertheless, we noticed that COM1 is slightly less effective for separating EOGLS sources when the data length is low. The INFOMAX algorithm is slightly less effective for separating EEGS, EOGRS and EOGLS, but its performances are equivalent to those obtained by COM2, COM1, JADE, FastICADO and FastICASO, in the case of ECGS source. Regarding TFBSS algorithm, it appears that its behavior is slightly lower for all the sources, except for the ECGS source whose extraction is quasi-optimal. Finally, the two versions of SOBI algorithm lead to different results. Indeed, using the classical whitening, SOBI algorithm is not able to separate the cerebral activity EEGS and has difficulty for separating the three other sources. On the other hand, we note the very good behavior of SOBIR (using the robust spatial whitening) except for the extraction of the EOGRS which will need more than 1500 samples to be well-separated.
Fig. 2.

Variations (in dB) of each source SINRM for the sources as a function of the data length.
B-. Influence of the SNR
In this section we study the behavior of the seven BSS methods as a function of the SNR, which is assumed to be the same for each source. The data length has been set to 5120 in agreement with the practical issues in the field of polysomnography. Indeed, in polysomnography epochs of 20 seconds of data are considered. Then for a sampling rate of 256 Hz each epoch contains 5120 samples. Figure 3 shows that, when varying the SNR from −20 dB to 25 dB, COM1, COM2, JADE, TFBSS, FastICADO and FastICASO exhibit quasi-optimal performances for all the sources. However, for SNR values higher than 25 dB, their performances deviate from the optimal ones (SMF). More precisely, in the case of EEGS and ECGS the six separators have the same deviation. Regarding the eye mouvements, it appears that EOGRS is slightly well extracted using COM1, JADE, TFBSS and FastICASO and EOGLS is well extracted by COM2 and FastICADO. The INFOMAX algorithm behaves more or less like the six previous methods but its performances slightly decrease when dealing with EOGLS source for an SNR lower than 0 dB and with EEGS for a SNR higher than 30 dB. SOBI and SOBIR present nearly optimal performances for the four sources when the SNR is lower than −5 dB. In addition, the convergence of the SINRM at the output of SOBI and SOBIR is also stopped as soon as the SNR increases beyond 30 dB. Note that for a SNR higher than −5 dB, SOBI and SOBIR provide a poor separation of the ocular sources.
Fig. 3.

Variations of each source SINRM (in dB) as a function of a SNR.
V. Discussion and Conclusion
The two studies have shown that, globally, the SINRM of the studied methods are lower than the optimal SMF minus 2 dB, except for the ECG source (it appears on figure 3 provided that a zoom of the figure is performed). This result is due to the fact that the additive noise is spatially correlated and all the studied methods require a prior spatial whitening based on second order statistics. This stage theoretically needs the perfect knowledge of the noise covariance. Now, if we compare the performances provided by each method, it appears that COM2, COM1, JADE, FastICADO and FastICASO globally (when all the sources are considered) gives the best results. However, COM1, which requires that all the sources have kurtosis (normalized marginal fourth order cumulants) with the same sign, is slightly affected by the presence of the EORLS source when only few samples are available. Computer results have shown that the probability of the EOGLS source to have an estimated kurtosis with a different sign is inversely proportional to the number of snapshots. The INFOMAX and TFBSS algorithms are slightly less effective for separating the EEGS, EOGRS and EOGLS sources but their performances are still acceptable comparing to those obtained by other methods. Regarding the SOBI algorithm, its performances are poor for all the sources. One of the reasons could be the fact that the noise is spatially correlated. Indeed, the obtained results show that for EEGS and ECGS sources, SOBIR, which is not affected by the spatially correlated noise, has a better behavior. Another reason is the fact that some sources, especially EOGRS and EQGLS, seem to be poorly temporally correlated. In fact, for these two sources SOBIR also provides lower performances in comparison to those obtained by COM2, COM1, JADE, FastICADO and FastICASO. To conclude, the selection of a BSS method should be driven by hypotheses and considerations issued from application objectives such some statistical/physiological prior information on the sources (temporal color, non-gaussianity, sign of kurtosis, …) and the additive noise.
References
- 1.BELOUCHRANI A, ABED-MERAIM K, CARDOSO J-F, MOULINES E. A blind source separation technique using second-order statistics. IEEE Transactions on Signal Processing. 1997 Feb;45(2):434–444. [Google Scholar]
- 2.BELOUCHRANI A, CICHOCJI A. Robust whitening procedure in blind source separation context. Electronics Letters. 2000 Novemeber;23(24):2050–2051. [Google Scholar]
- 3.CARDOSO J-F, SOULOUMIAC A. Blind beamforming for non gaussian signals. IEE Proceedings F. 1993 Dec;140(6):362–370. [Google Scholar]
- 4.CHEVALIER P. Optimal separation of independent narrow-band sources: Concept and Performances. Signal Processing, Elsevier. 1999;73:27–47. [Google Scholar]
- 5.COMON P. Independent component analysis, a new concept? Signal Processing, Elsevier. 1994 Apr;36(3):287–314. [Google Scholar]
- 6.COMON P. From source separation to blind equalization, contrast-based approaches. ICISP 01, International Conference on Image and Signal Processing; Agadir, Morocco. May 3–5, 2001; pp. 20–32. [Google Scholar]
- 7.FEVOTTE C, DONCARLI C. Two contributions to blind sources separation using time-frequency distributions. IEEE Signal processing Letters. 2004 Mar;11(3) [Google Scholar]
- 8.HYVARINEN A, KARHUNEN J, OJA P. Independent component analysis, ser. Wiley interscience; John Wiley and Sons: 2001. [Google Scholar]
- 9.JANSEN B-H, RIT V-G. Electroencephalogram and visual evoked potential generation in a mathematical model of coupled cortical columns? Biological Cybernetics. 1995;73:357–366. doi: 10.1007/BF00199471. [DOI] [PubMed] [Google Scholar]
- 10.LEE TW, GIROLAMI M, SEJNOWSKI TJ. Indepenfent component analysis using an extended infomax algorithm for mixed sub-gaussian end super-gaussian sources. Neural Computation. 1999;11(2):417–441. doi: 10.1162/089976699300016719. [DOI] [PubMed] [Google Scholar]
- 11.POREE F, KACHENOURA A, GAUVRIT H, MORVAN C, CARRAULT G, SENHADJI L. blin source separation for ambulatory sleep recording. Transactions on Information Technology in Biomedicine. 2006;10(2):293–301. doi: 10.1109/TITB.2005.859878. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.ZHANG Z. A fast method to compute surface potentials generated by dipoles within multilayer anisotropic spheres. Physics in Medicine and Biology. 1995 Mar;40(3):335–349. doi: 10.1088/0031-9155/40/3/001. [DOI] [PubMed] [Google Scholar]
