Analytical fuzzy approach to biological data analysis

Weiping Zhang; Jingzhi Yang; Yanling Fang; Huanyu Chen; Yihua Mao; Mohit Kumar

doi:10.1016/j.sjbs.2017.01.027

. 2017 Jan 25;24(3):563–573. doi: 10.1016/j.sjbs.2017.01.027

Analytical fuzzy approach to biological data analysis

Weiping Zhang ^a, Jingzhi Yang ^b, Yanling Fang ^c, Huanyu Chen ^c, Yihua Mao ^d,^⁎, Mohit Kumar ^c,^⁎

PMCID: PMC5372457 PMID: 28386181

Abstract

The assessment of the physiological state of an individual requires an objective evaluation of biological data while taking into account both measurement noise and uncertainties arising from individual factors. We suggest to represent multi-dimensional medical data by means of an optimal fuzzy membership function. A carefully designed data model is introduced in a completely deterministic framework where uncertain variables are characterized by fuzzy membership functions. The study derives the analytical expressions of fuzzy membership functions on variables of the multivariate data model by maximizing the over-uncertainties-averaged-log-membership values of data samples around an initial guess. The analytical solution lends itself to a practical modeling algorithm facilitating the data classification. The experiments performed on the heartbeat interval data of 20 subjects verified that the proposed method is competing alternative to typically used pattern recognition and machine learning algorithms.

Keywords: Modeling, Fuzzy membership functions, Variational optimization

1. Introduction

Data mining is increasingly motivating area of research due to an abundance of data facilitated by modern era of information technology. Data mining techniques such as classification and clustering play a vital role in the development of medical decision support systems contributing to improved healthcare quality. The medical decision making problems inherently involve complexities and uncertainties and thus the researchers have advocated the integration of fuzzy methodologies in medical data interpretation. The handling of uncertainties by capturing of knowledge using fuzzy sets and rules together with an interpretability offered by simple linguistic if-then rules are two most important features of fuzzy methodologies. The fuzzy approaches are commonly applied to medical data classification problems (Fan et al., 2011, Gadaras and Mikhailov, 2009, Nguyen et al., 2015, Papageorgiou, 2011, Seera and Lim, 2014). The mathematical analysis of biomedical signals is performed to construct models identifying the mappings between signal features and the patient’s state. The mathematical relationship between signal features and the patient’s state is affected by uncertainties arising from individual factors (e.g. related to body conditions) that can’t be mathematically taken into account. The fuzzy filters have been previously proposed to alleviate the effect of uncertainties on medical data analysis (Kumar et al., 2007a, Kumar et al., 2007b, Kumar et al., 2008, Kumar et al., 2010a, Kumar et al., 2010b) wherein robust estimation algorithms have been applied to design a fuzzy model that identifies the functional relation between physiological parameters and subjective rating scores. Also, stochastic fuzzy modeling and analysis techniques have been introduced to take simultaneously the advantages of Bayesian analysis and fuzzy theory for a mathematical handling of the uncertainties in biomedical signal analysis (Kumar et al., 2010a, Kumar et al., 2010b, Kumar et al., 2012a, Kumar et al., 2012b). A recent work (Kumar et al., 2016a, Kumar et al., 2016b) introduced in a rigorous manner a stochastic framework for robust fuzzy filtering and analysis of signals. Although Kumar et al., 2016a, Kumar et al., 2016b introduced modeling and analysis framework is general and rests on strong mathematical foundations, it considers only the signal and thus can’t be directly applied to nonsignal multivariate data samples. There remains the need of automated design methods to fully exploit the uncertain handling capabilities of fuzzy systems. The typically used approaches to design the fuzzy sets and systems include evolutionary algorithms (Alcala et al., 2009, Antonelli et al., 2012, Cococcioni et al., 2011, Gacto et al., 2010, Pulkkinen and Koivisto, 2010, Robles et al., 2009), data clustering (Celikyilmaz and Turksen, 2008, Chen and Chen, 2007, Liao et al., 2003, Oh et al., 2003), adaptive filtering (Aliasghary and Arghavani, 2012, Kumar et al., 2006, Kumar et al., 2009a, Kumar et al., 2009b, Mottaghi-Kashtiban et al., 2008, Simon, 2005), and information theoretic concepts (Aliasghary and Arghavani, 2012, Au et al., 2006, Makrehchi et al., 2003). The determination of fuzzy membership functions remains a challenge as membership functions, due to the nonlinearity of the problem, can’t be optimized analytically. Thus, most design methods of fuzzy membership functions lack in mathematical theory and are based on numerical algorithms which might be slow and inexact. Recently, (Kumar et al., 2016a, Kumar et al., 2016b) introduced an analytical approach for the determination of fuzzy membership functions using the variational optimization method. The proposed analytical approach of (Kumar et al., 2016a, Kumar et al., 2016b) allows to mathematically incorporate the given modeling scenario in fuzzy membership functions’ design problem and thus can be potentially extended to medical data modeling scenario. The authors observe that the application of fuzzy paradigm in medicine, despite being an extensively studied area, doesn’t provide a rigorous analytically derived methodology or approach to interpret medical data while taking mathematically into account the measurement noise as well as the individuality.

The medical data are multi-dimensional whose good representation by means of fuzzy membership functions is the aim of the mathematical theory presented in this study. This text introduces a data model that takes into account both measurement noise and uncertainties arising from individuality related factors. A multivariate data sample, represented as y = [y₁ ⋯ y_P]T ∈ RP, is assumed to be generated by an uncertain signal model displayed in Fig. 1. It is assumed an uncertain signal model for a scalar y_j. Here, y_j is the observed value of an unknown scalar m_j being affected by measurement noise v_j and uncertainty u_j. The uncertainty u_j (equal to the dot product of Gj ∈ RK and α ∈ RK) is being generated by a linear combination of K different sources: (α₁, ⋯ ,α_K) that the jth element of y is generated as

y_{j} = m_{j} + u_{j} + v_{j}

where vj is the measurement noise, u_j is the uncertainty affecting the model, and mj is an unobserved scalar variable. The uncertainties are assumed to be generated by linearly transforming a K-dimensional (K ⩽ P) vector $α = α = {[α_{1} \dots α_{K}]}^{T} \in R^{K}$ as follows:

[\begin{matrix} u_{1} \\ ⋮ \\ u_{P} \end{matrix}] = [\begin{matrix} G_{11} & \dots & G_{1 K} \\ ⋮ & ⋮ \\ G_{P 1} & \dots & G_{PK} \end{matrix}] [\begin{matrix} α_{1} \\ ⋮ \\ α_{K} \end{matrix}] .

An uncertain signal model for a scalar *y_j*.

Defining $G_{j} = {[G_{j 1} \dots G_{jK}]}^{T} \in R^{K}$ , $u_{j}$ can be expressed as the dot product of $G_{j}$ and $α$ , i.e.,

u_{j} = {(G_{j})}^{T} α .

Our approach is of

1.
treating all the variables (appearing in the uncertain signal model of Fig.1) as uncertain being characterized by fuzzy membership functions.
2.
assuming that medical data, under the given status of a patient, is generated by a finite mixture of uncertain signal models of the type that of Fig. 1.
3.
determining the fuzzy membership functions on variables with the help of experimentally measured data samples in an analytical manner using variational optimization (Kumar et al., 2016a, Kumar et al., 2016b).

The approach results in a tractable solution to model the multivariate data samples by means of fuzzy membership functions and thus medical decision support systems can be built up on the top of the data models.

The modeling of data using a finite mixture of signal models of the type of Fig. 1 is typically considered in a stochastic setting assuming variables as random (i.e. characterized by probability distribution functions) and Bayesian framework is commonly used for the inference of posterior distributions. The originality of this study lies in solving the modeling problem in a completely deterministic framework where fuzzy membership functions are defined over variables to characterize uncertainties about their values. The optimal shapes of fuzzy membership functions are determined via analytically maximizing the “over uncertainties averaged log membership” values of data samples around an initial guess. The maximization problem is analytically solved using variational optimization as suggested initially in Kumar et al., 2016a, Kumar et al., 2016b. The contribution of this study is to derive the analytical expressions of fuzzy membership functions on variables of the multivariate data model leading to the development of a classification algorithm. It is demonstrated through experimental data that our approach is competing alternative to typically used classification algorithms including “k-nearest neighbors”, “support vector machines”, “decision tree”, “random forest”, “AdaBoost”, “Gaussian naive Bayes”, “linear discriminant analysis”, and “quadratic discriminant analysis”. The better classification performance of our approach is attributed to the efficient modeling of the data distribution in multi-parametric space. The significance of this work is that the analytically derived expressions for fuzzy membership functions for representing uncertainties associated with medical data would facilitate a system theoretic approach to mathematically design the medical expert systems. This would provide researchers, unlike typically used ad-hoc numerical algorithms, a mathematical theory on fuzzy membership functions’ applications in medicine.

This text is organized into sections. Section 2 introduces an uncertain model of multivariate data and an analytical solution for optimizing the data model is provided in Section 3. A practical algorithm, based on the derived analytical solution, is stated in Section4 4 for the modeling of multivariate data samples. Section 5 applies the proposed approach on the experimental heartbeat interval data of 20 subjects followed by concluding remarks in Section 6.

2. An uncertain model of multivariate data

By an uncertain model, it is meant that system variables are characterized by fuzzy membership functions. Despite the availability of a wide range of fuzzy membership function types, only following two types of fuzzy membership functions are chosen to model the variables for keeping the analysis in its most basic form:

Definition 1 Gaussian’s membership function (Kumar et al., 2016a, Kumar et al., 2016b) —

The Gaussian membership function on a vector x ∈ Rn, with mean equal to mx and precision equal to Λx, is defined as

$μ (x; m_{x}, Λ_{x}) = \exp (- \frac{1}{2} {(x - m_{x})}^{T} Λ_{x} (x - m_{x})), m_{x} \in R^{n}, Λ_{x}^{- 1} > 0 .$

Definition 2 Gamma membership function (Kumar et al., 2016a, Kumar et al., 2016b) —

The Gamma membership function on a non-negative scalar z can be defined as

$μ (z; a, b) = {(\frac{b}{a - 1})}^{a - 1} \exp (a - 1) {(z)}^{a - 1} \exp (- bz), a ⩾ 1, b > 0 .$

A few examples of this type of membership functions for different values of a and b are provided in Fig. 2. The parameter a is referred to as the shape parameter and b is referred to as the rate parameter (i.e. the reciprocal of the scale parameter). The peak of the membership function is given at (a − 1)/b. The skewness of the membership function is inversely proportional to the value of a. The Gamma membership function can alternatively be represented as

μ (z; r, s) = {(s)}^{r} \exp (r) {(z)}^{r} \exp (- srz), r ⩾ 0, s > 0

A few examples of Gamma membership functions (Kumar et al., 2016a, Kumar et al., 2016b).

The relations between the parameters of two forms of Gamma membership functions are as follows:

r = a - 1, s = b / (a - 1) .

All of the variables, appearing in Fig. 1, are assigned carefully either of Gaussian or Gamma membership function in Definition 3, Definition 4, Definition 5, Definition 6, Definition 7, Definition 8.

Definition 3 Fuzzy membership function on v_j —

The fuzzy membership function on v_j ∈ R is defined as zero-mean Gaussian with scaled precisions as

$μ (v_{j}; λ_{y}, z_{y_{j}}) = \exp (- \frac{λ_{y} z_{y_{j}}}{2} v_{j}^{2})$ (1)

where $λ_{y} > 0$ is the precision scaled by $z_{y_{j}} > 0$ . The uncertainties of $λ_{y}$ and $z_{y_{j}}$ are characterized by the following Gamma membership functions:

$μ (λ_{y}; a_{λ_{y}}, b_{λ_{y}}) = {(\frac{b_{λ_{y}}}{a_{λ_{y}} - 1})}^{a_{λ_{y}} - 1} \exp (a_{λ_{y}} - 1) {(λ_{y})}^{a_{λ_{y}} - 1} \exp (- b_{λ_{y}} λ_{y}), a_{λ_{y}} ⩾ 1, b_{λ_{y}} > 0 >$

$μ (z_{y_{j}}; r_{y}, s_{y}) = {(s_{y})}^{r_{y}} \exp (r_{y}) {(z_{y_{j}})}^{r_{y}} \exp (- r_{y} s_{y} z_{y_{j}}), r_{y} ⩾ 0, s_{y} > 0 .$

Here, $r_{y} > 0$ , and $s_{y} > 0$ are uncertain as well as characterized by the following Gamma membership functions:

$μ (r_{y}; a_{r_{y} y}, b_{r_{y}}) = {(\frac{b_{r_{y}}}{a_{r_{y}} - 1})}^{a_{r_{y}} - 1} \exp (a_{r_{y}} - 1) {(r_{y})}^{a_{r_{y}} - 1} \exp (- b_{r_{y}} r_{y}), a_{r_{y}} ⩾ 1, b_{r_{y}} > 0$

$μ (s_{y}; a_{s_{y} y}, b_{s_{y}}) = {(\frac{b_{s_{y}}}{a_{s_{y}} - 1})}^{a_{s_{y}} - 1} \exp (a_{s_{y}} - 1) {(s_{y})}^{a_{s_{y}} - 1} \exp (- b_{s_{y}} s_{y}), a_{s_{y}} ⩾ 1, b_{s_{y}} > 0$

Definition 4 Fuzzy membership function on y_j —

The fuzzy membership function on $y_{j} \in R$ , for a given $(m_{j}, G_{j}, α, λ_{y}, z_{y_{j}})$ , is defined as

$μ (y_{j}; m_{j}, G_{j}, α, λ_{y}, z_{y_{j}}) = \exp (- \frac{λ_{y} z_{y_{j}}}{2} {(y_{j} - m_{j} - {(G_{j})}^{T} α)}^{2}) .$

The membership function on $y_{j}$ is derived by replacing $v_{j}$ in (1) by $y_{j} - m_{j} - {(G_{j})}^{T} α$ .

Definition 5 Fuzzy membership function on y —

The multivariate fuzzy membership function on $y \in R^{P}$ , for a given $({m_{j}}_{j = 1}^{P}, {G_{j}}_{j = 1}^{P}, α, λ_{y}, {z_{y_{j}}}_{j = 1}^{P})$ , is defined as the product of its individual elements’ membership functions as

$\begin{matrix} μ (y; {m_{j}}_{j = 1}^{P}, {G_{j}}_{j = 1}^{P}, α, λ_{y}, {z_{y_{j}}}_{j = 1}^{P}) & = \prod_{j = 1}^{P} μ (y_{j}; m_{j}, G_{j}, α, λ_{y}, z_{y_{j}}) \\ = \exp (- \frac{λ_{y}}{2} \sum_{j = 1}^{P} z_{y_{j}} {(y_{j} - m_{j} - {(G_{j})}^{T} α)}^{2}) \end{matrix}$

Definition 6 Fuzzy membership function on m —

The multivariate fuzzy membership function on m = ${[m_{1} \dots m_{P}]}^{T} \in R^{P}$ is defined as Gaussian as

$μ (m; m_{o}, Λ_{o}) = \exp (- \frac{1}{2} {(m - m_{o})}^{T} Λ_{o} (m - m_{o})), m_{o} \in R^{P}, Λ_{o} > 0 .$

Definition 7 Fuzzy membership function on $α$ —

The multivariate fuzzy membership function on $α \in R^{K}$ is defined as zero-mean Gaussian with precision equal to unity matrix as

$μ (α) = \exp (- \frac{1}{2} {(α)}^{T} α) .$

Definition 8 Fuzzy membership function on $G_{j}$ —

The multivariate fuzzy membership function on $G_{j} = {[G_{j 1} \dots G_{jK}]}^{T} \in R^{K}$ is defined as zero-mean Gaussian as

$μ (G_{j}; {ϕ_{k}}_{k = 1}^{K}) = \exp (- \frac{1}{2} \sum_{k = 1}^{K} {(G_{jk})}^{2} ϕ_{k})$

where $ϕ_{k} > 0$ is the precision of kth element of $G_{j}$ and is uncertain characterized by the following Gamma membership function:

$μ (ϕ_{k}; a_{ϕ}, b_{ϕ}) = {(\frac{b_{ϕ}}{a_{ϕ} - 1})}^{a_{ϕ} - 1} \exp (a_{ϕ} - 1) {(ϕ_{k})}^{a_{ϕ} - 1} \exp (- b_{ϕ} ϕ_{k}), a_{ϕ} ⩾ 1, b_{ϕ} > 0 .$

To model the multivariate data sample distributed arbitrarily in P-dimensional data space, a mixture of finite number of uncertain signal models is considered in Definition 9.

Definition 9 Fuzzy membership of y as a finite mixture of uncertain signal models —

The fuzzy membership function on $y = {[y_{1} \dots y_{P}]}^{T} \in R^{P}$ , for a given $({π_{i}}_{i = 1}^{C}, Ω)$ , is defined as a mixture of $C$ different uncertain signal models as

$μ (y; {π_{i}}_{i = 1}^{C}, Ω)$

$= \exp (- \frac{π_{1}}{2} λ_{y}^{1} \sum_{j = 1}^{P} z_{y_{j}}^{1} {(y_{j} - m_{j}^{1} - {(G_{j})}^{T} α)}^{2} \dots - \frac{π_{C}}{2} λ_{y}^{C} \sum_{j = 1}^{P} z_{y_{j}}^{C} {(y_{j} - m_{j}^{C} - {(G_{j})}^{T} α^{C})}^{2})$

where $π_{i} \in [0, 1]$ is the mixing proportion of the ith uncertain signal model with $\sum_{i = 1}^{C} π_{i} = 1$ , and $Ω$ is a set of parameters defined as

$Ω = {{α^{i}}_{i = 1}^{C}, {G_{j}}_{j = 1}^{P}, {ϕ_{k}}_{k = 1}^{K}, {m^{i}}_{i = 1}^{C}, {{z_{y_{j}}^{i}}_{j = 1}^{P}}_{i = 1}^{C}, r_{y}, s_{y}, {λ_{y}^{i}}_{i = 1}^{C}}$

where $α^{i} \in R^{K}$ $(K ⩽ P)$ is uncertain characterized by the following Gaussian membership function

$μ (α^{i}) = \exp (- \frac{1}{2} {(α^{i})}^{T} α^{i});$

$G_{j} = {[G_{j 1} \dots G_{jK}]}^{T} \in R^{K}$ is uncertain characterized by the following Gaussian membership function

$μ (G_{j}; {\emptyset_{k}}_{k = 1}^{K}) = \exp (- \frac{1}{2} \sum_{k = 1}^{K} {(G_{jk})}^{2} \emptyset_{k}), \emptyset_{k} > 0$

$\emptyset_{k} > 0$ is uncertain characterized by the following Gamma membership function:

$μ (\emptyset_{k}; α_{\emptyset}, b_{\emptyset}) = {(\frac{b_{\emptyset}}{α_{\emptyset} - 1})}^{α_{\emptyset} - 1} \exp (α_{\emptyset} - 1) {(\emptyset_{k})}^{α_{\emptyset} - 1} \exp (- b_{\emptyset} \emptyset_{k}), α_{\emptyset} ⩾ 1, b_{\emptyset} > 0;$

$m^{i} = {[m_{1}^{i} \dots m_{P}^{i}]}^{T} \in R^{P}$ is uncertain characterized by the following Gaussian membership function:

$μ (m^{i}; m_{o}^{i}, Λ_{o}^{i}) = \exp (- \frac{1}{2} {(m^{i} - m_{o}^{i})}^{T} Λ_{o}^{i} (m^{i} - m_{o}^{i})), m_{o}^{i} \in R^{K}, Λ_{o}^{i} > 0;$

$z_{yj}^{i} > 0$ is uncertain scalar characterized by the following Gamma membership function:

$μ (z_{yj}^{i}; r_{y}, s_{y}) = {(s_{y})}^{r_{y}} \exp (r_{y}) {(z_{yj}^{i})}^{r_{y}} \exp (- r_{y} s_{y} z_{yj}^{i}), r_{y} ⩾ 1, s_{y} > 0;$

$r_{y}$ is uncertain characterized by the following Gamma membership function:

$μ (r_{y}; a_{ry}, b_{ry}) = {(\frac{b_{r_{y}}}{a_{r_{y}} - 1})}^{a_{ry} - 1} \exp (a_{r_{y}} - 1) {(r_{y})}^{a_{r_{y}} - 1} \exp (- b_{r_{y}} r_{y}), a_{r_{y}} ⩾ 1, b_{r_{y}} > 0;$

$s_{y}$ is uncertain characterized by the following Gamma membership function:

$μ (s_{y}; a_{s_{y}}, b_{s_{y}}) = {(\frac{b_{s_{y}}}{a_{s_{y}} - 1})}^{a_{ry} - 1} \exp (a_{s_{y}} - 1) {(S_{y})}^{a_{s_{y}} - 1} \exp (- b_{s_{y}} s_{y}), a_{s_{y}} ⩾ 1, b_{s_{y}} > 0;$

$λ_{y}^{i} > 0$ is uncertain scalar characterized by the following Gamma membership function:

$μ (λ_{y}^{i}; a_{λ_{y}}, b_{λ_{y}}) = {(\frac{b_{λ_{y}}}{a_{λ_{y}} - 1})}^{a_{λ y} - 1} \exp (a_{λ_{y}} - 1) {(λ_{y}^{i})}^{a_{λ_{y}} - 1} \exp (- b_{λ_{y}} λ_{y}^{i}), a_{λ_{y}} ⩾ 1, b_{λ_{y}} > 0 .$

3. Analytical optimization of mixture of uncertain signal models

Given N data samples, ${y^{n}}_{n = 1}^{N}$ , the aim is to define the multivariate fuzzy membership function on y in an “optimal” manner. The approach is to optimize the fuzzy membership function (defined on y by Definition 1) with respect to ${π_{i}}_{i = 1}^{C}$ while taking into account the uncertainties of the parameters represented $z_{yj}^{i}$ by set Ω. To take into account the uncertainties of the parameters represented by the set Ω, the “optimal” membership functions on the parameters must be first determined. For this, assume that $q (α^{i})$ , $q (G_{j})$ , $q (\emptyset_{k})$ , $q (m^{i})$ , $q (z_{yj}^{i})$ , $q (r_{y})$ , $q (s_{y})$ , and $q (λ_{y}^{i})$ are arbitrary fuzzy membership functions on $α^{i}$ , $G_{j}$ , $\emptyset_{k}$ , $m^{i}$ ,, $r_{y}$ , $s_{y}$ and $λ_{y}^{i}$ respectively. Define a function, q(Ω), as follows

q (Ω) = \{\prod_{i = 1}^{C} q (α^{i})\} \{\prod_{j = 1}^{P} q (G_{j})\} \{\prod_{k = 1}^{K} q (\emptyset_{k})\} \{\prod_{i = 1}^{C} q (m^{i})\} \{\prod_{i = 1}^{C} \prod_{j = 1}^{P} q (z_{yj}^{i})\} q (r_{y}) q (s_{y}) \{\prod_{i = 1}^{C} q (λ_{y}^{i})\}

Define a differential functional, $\partial Ω$ , as follows

\partial Ω = \{\prod_{i = 1}^{C} \partial α^{i}\} \{\prod_{j = 1}^{P} \partial G_{j}\} \{\prod_{k = 1}^{K} \partial \emptyset_{k}\} \{\prod_{i = 1}^{C} \partial m^{i}\} \{\prod_{i = 1}^{C} \prod_{j = 1}^{P} \partial z_{yj}^{i}\} \partial (r_{y}) \partial (s_{y}) {\prod_{i = 1}^{C} \partial λ_{y}^{i}}

Define a differential functional, $μ (Ω)$ , as follows

μ (Ω) = \{\prod_{i = 1}^{C} μ (α^{i})\} \{\prod_{j = 1}^{P} μ (G_{j}); {\emptyset_{k}}_{k = 1}^{K}\} \{\prod_{K = 1}^{K} μ (\emptyset_{K}; a_{\emptyset}, b_{\emptyset})\} \{\prod_{i = 1}^{C} μ (m^{i}; m_{o}, Λ_{o})\} \times \{\prod_{i = 1}^{C} \prod_{j = 1}^{P} μ (z_{yj}^{i}; r_{y}, s_{y})\} μ (r_{y}; a_{r_{y}}, b_{r_{y}}) μ (s_{y}; a_{s_{y}}, b_{s_{y}}) {\prod_{i = 1}^{C} μ (λ_{y}^{i}; a_{λ_{y}}, b_{λ_{y}})}

The optimization process maximizes an objective functional, $F$ , defined as

F ({{π_{i}^{n}}_{i = 1}^{C}}_{n = 1}^{N}), q (Ω) = \frac{1}{\int \partial Ω q (Ω)} \int \partial Ω q (Ω) \frac{\sum_{n = 1}^{N} \log (μ (y^{n}; {π_{i}^{n}}_{i = 1}^{C}, Ω))}{N} - \frac{1}{\int \partial Ω q (Ω)} \int \partial Ω q (Ω) \log (\frac{q (Ω)}{μ (Ω)}) - \frac{1}{N} \sum_{n = 1}^{N} \sum_{i = 1}^{C} π_{i}^{n} \log (\frac{π_{i}^{n}}{π_{i}^{o}})

(2)

$F$ is maximized with respect to $q (α^{i})$ , $q (G_{j})$ , $q (\emptyset_{k})$ , $q (m^{i})$ , $q (z_{yj}^{i})$ , $q (r_{y})$ , $q (s_{y})$ , and $q (λ_{y}^{i})$ and ${π_{i = 1}^{n}}_{i = 1}^{C}$ under the following constraints:

1.
Fixed Integral Constraints on Membership Functions: $\int \partial α^{i} q (α^{i} = k_{α^{i}} > 0),$
$\int \partial G_{j} q (G_{j}) = k_{G_{j}} > 0, \int \partial \emptyset_{k} q (\emptyset_{k}) = k_{\emptyset_{k}} > 0, \int \partial m^{i} q (m^{i}) = k_{m^{i}} > 0,$

$\int \partial z_{yj}^{i} q (z_{yj}^{i}) = k_{z_{yj}^{i}} > 0, \int \partial r_{y} q (r_{y}) = k_{r_{y}} > 0, \int \partial s_{y} q (s_{y}) = k_{s_{y}} > 0, \int \partial λ_{y}^{i} q (λ_{y}^{i}) = k_{λ_{y}^{i}} > 0 .$
2.
Unity Maximum Value Constraints on Membership Functions: The values of $k_{α^{i}}, k_{G_{j}}, k_{\emptyset_{k}}, k_{m^{i}}, k_{z_{yj}^{i}}, k_{r_{y}}, k_{s_{y}}$ , and $k_{λ_{y}^{i}}$ are so chosen such that maximum value of $q (α^{i})$ , $q (G_{j})$ , $q (\emptyset_{k})$ , $q (m^{i})$ , $q (z_{yj}^{i})$ , $q (r_{y})$ , $q (s_{y})$ , and $q (λ_{y}^{i})$ is equal to one.
3.
Unity Sum Constraint on Mixing Proportions: $\sum_{i = 1}^{C} π_{i}^{n} = 1, π_{i}^{n} \in [0, (1])$ .

The first term of $F$ computes the averaged log-membership value of data samples when the average is taken over uncertain parameters Ω being modeled by membership function $q (Ω)$ . The second term of $F$ regularizes the maximization problem toward initial guess $μ (Ω)$ . The third term of $F$ regularizes the estimation of $π_{i}^{n}$ toward initial guess $π_{i}^{o}$ .

Result 1

The analytical expressions for variational membership functions, that maximize $F$ under Fixed Integral and Unity Maximum Value Constrains, are

$\begin{matrix} q^{*} (α^{i}) = \exp (- \frac{1}{2} {(α^{i} - {\hat{m}}_{α^{i}})}^{T} {\hat{Λ}}_{α^{i}} (α^{i} - {\hat{m}}_{α^{i}})), \\ {\hat{Λ}}_{α^{i}} = I + \sum_{n = 1}^{N} \sum_{j = 1}^{P} \frac{{\hat{π}}_{i}^{n}}{N} \frac{{\hat{a}}_{λ_{y}^{i}}}{{\hat{b}}_{λ_{y}^{i}}} \frac{{\hat{a}}_{z_{y_{j}}^{i}}}{{\hat{b}}_{z_{y_{j}}^{i}}} ({\hat{m}}_{G_{j}} {({\hat{m}}_{G_{j}})}^{T} + {({\hat{Λ}}_{G_{j}})}^{- 1}) \end{matrix}$ (3)

$\begin{matrix} {\hat{m}}_{α^{i}} = {({\hat{Λ}}_{α^{i}})}^{- 1} \{\sum_{n = 1}^{N} \sum_{j = 1}^{P} \frac{{\hat{π}}_{i}^{n}}{N} \frac{{\hat{a}}_{λ_{y}^{i}}}{{\hat{b}}_{λ_{y}^{i}}} \frac{{\hat{a}}_{z_{y_{j}}^{i}}}{{\hat{b}}_{z_{y_{j}}^{i}}} (y_{j}^{n} - I_{j}^{P} {\hat{m}}_{m^{i}}) {\hat{m}}_{G_{j}}\} \\ q^{*} (G_{j}) = \exp (- \frac{1}{2} {(G_{j} - {\hat{m}}_{G_{j}})}^{T} {\hat{Λ}}_{G_{j}} (G_{j} - {\hat{m}}_{G_{j}})), \end{matrix}$ (4)

Open in a new tab
$\begin{matrix} q^{*} (ϕ_{k}) = {(\frac{{\hat{b}}_{ϕ_{k}}}{{\hat{a}}_{ϕ_{k}} - 1})}^{{\hat{a}}_{ϕ_{k}} - 1} \exp ({\hat{a}}_{ϕ_{k}} - 1) {(ϕ_{k})}^{{\hat{a}}_{ϕ_{k}} - 1} \exp (- {\hat{b}}_{ϕ_{k}} ϕ_{k}), \\ {\hat{a}}_{ϕ_{k}} = a_{ϕ} \end{matrix}$ (7)

$\begin{matrix} {\hat{b}}_{ϕ_{k}} = b_{ϕ} + \frac{1}{2} \sum_{j = 1}^{P} {{(I_{k}^{K} {\hat{m}}_{G_{j}})}^{2} + Tr ({({\hat{Λ}}_{G_{j}})}^{- 1} {(I_{k}^{K})}^{T} I_{k}^{K})} \\ q^{*} (m^{i}) = \exp (- \frac{1}{2} {(m^{i} - {\hat{m}}_{m^{i}})}^{T} {\hat{Λ}}_{m^{i}} (m^{i} - {\hat{m}}_{m^{i}})), \end{matrix}$ (8)

Open in a new tab
$\begin{matrix} q^{*} (λ_{y}^{i}) = {(\frac{{\hat{b}}_{λ_{y}^{i}}}{{\hat{a}}_{λ_{y}^{i}} - 1})}^{{\hat{a}}_{λ_{y}^{i}} - 1} \exp ({\hat{a}}_{λ_{y}^{i}} - 1) {(λ_{y}^{i})}^{{\hat{a}}_{λ_{y}^{i}} - 1} \exp (- {\hat{b}}_{λ_{y}^{i}} λ_{y}^{i}) . \\ {\hat{a}}_{λ_{y}^{i}} = a_{λ_{y}} \end{matrix}$ (11)

$\begin{matrix} {\hat{b}}_{λ_{y}^{i}} = b_{λ_{y}} + \frac{1}{2} \sum_{n = 1}^{N} \sum_{j = 1}^{P} \frac{{\hat{π}}_{i}^{n}}{N} \frac{{\hat{a}}_{z_{y_{j}}^{i}}}{{\hat{b}}_{z_{y_{j}}^{i}}} ({(y_{j}^{n} - I_{j}^{P} {\hat{m}}_{m^{i}} - {({\hat{m}}_{G_{j}})}^{T} {\hat{m}}_{α^{i}})}^{2} + Tr ({({\hat{Λ}}_{m^{i}})}^{- 1} {(I_{j}^{P})}^{T} I_{j}^{P}) + {({\hat{m}}_{α^{i}})}^{T} {({\hat{Λ}}_{G_{j}})}^{- 1} {\hat{m}}_{α^{i}} + {({\hat{m}}_{G_{j}})}^{T} {({\hat{Λ}}_{α^{i}})}^{- 1} {\hat{m}}_{G_{j}} + Tr ({({\hat{Λ}}_{G_{j}})}^{- 1} {({\hat{Λ}}_{α^{i}})}^{- 1})) \end{matrix}$ (12)

$\begin{matrix} q^{*} (z_{y_{j}}^{i}) = {(\frac{{\hat{b}}_{z_{y_{j}}^{i}}}{{\hat{a}}_{z_{y_{j}}^{i}} - 1})}^{{\hat{a}}_{z_{y_{j}}^{i}} - 1} \exp ({\hat{a}}_{z_{y_{j}}^{i}} - 1) {(z_{y_{j}}^{i})}^{{\hat{a}}_{z_{y_{j}}^{i}} - 1} \exp (- {\hat{b}}_{z_{y_{j}}^{i}} z_{y_{j}}^{i}) . \\ {\hat{a}}_{z_{y_{j}}^{i}} = \frac{{\hat{a}}_{r_{y}}}{{\hat{b}}_{r_{y}}} + 1 \end{matrix}$ (13)

$\begin{matrix} {\hat{b}}_{z_{y_{j}}^{i}} = \frac{{\hat{a}}_{r_{y}}}{{\hat{b}}_{r_{y}}} \frac{{\hat{a}}_{s_{y}}}{{\hat{b}}_{s_{y}}} + \frac{1}{2} \sum_{n = 1}^{N} \frac{{\hat{π}}_{i}^{n}}{N} \frac{{\hat{a}}_{λ_{y}^{i}}}{{\hat{b}}_{λ_{y}^{i}}} ({(y_{j}^{n} - I_{j}^{P} {\hat{m}}_{m^{i}} - {({\hat{m}}_{G_{j}})}^{T} {\hat{m}}_{α^{i}})}^{2} + Tr ({({\hat{Λ}}_{m^{i}})}^{- 1} {(I_{j}^{P})}^{T} I_{j}^{P}) + {({\hat{m}}_{α^{i}})}^{T} {({\hat{Λ}}_{G_{j}})}^{- 1} {\hat{m}}_{α^{i}} + {({\hat{m}}_{G_{j}})}^{T} {({\hat{Λ}}_{α^{i}})}^{- 1} {\hat{m}}_{G_{j}} + Tr ({({\hat{Λ}}_{G_{j}})}^{- 1} {({\hat{Λ}}_{α^{i}})}^{- 1})) \end{matrix}$ (14)

$f_{i}^{n} = - \frac{1}{2} \sum_{j = 1}^{P} \frac{{\hat{a}}_{λ_{y}^{i}}}{{\hat{b}}_{λ_{y}^{i}}} \frac{{\hat{a}}_{z_{y_{j}}^{i}}}{{\hat{b}}_{z_{y_{j}}^{i}}} ({(y_{j}^{n} - I_{j}^{P} {\hat{m}}_{m^{i}} - {({\hat{m}}_{G_{j}})}^{T} {\hat{m}}_{α^{i}})}^{2} + Tr ({({\hat{Λ}}_{m^{i}})}^{- 1} {(I_{j}^{P})}^{T} I_{j}^{P}) + {({\hat{m}}_{α^{i}})}^{T} {({\hat{Λ}}_{G_{j}})}^{- 1} {\hat{m}}_{α^{i}} + {({\hat{m}}_{G_{j}})}^{T} {({\hat{Λ}}_{α^{i}})}^{- 1} {\hat{m}}_{G_{j}} + Tr ({({\hat{Λ}}_{G_{j}})}^{- 1} {({\hat{Λ}}_{α^{i}})}^{- 1}))$ (15)

${\hat{π}}_{i}^{n} = \frac{π_{i}^{o} \exp (f_{i}^{n})}{\sum_{i = 1}^{C} π_{i}^{o} \exp (f_{i}^{n})} .$ (16)

$\begin{matrix} q^{*} (r_{y}) = {(\frac{{\hat{b}}_{r_{y}}}{{\hat{a}}_{r_{y}} - 1})}^{{\hat{a}}_{r_{y}} - 1} \exp ({\hat{a}}_{r_{y}} - 1) {(r_{y})}^{{\hat{a}}_{r_{y}} - 1} \exp (- {\hat{b}}_{r_{y}} r_{y}), \\ {\hat{a}}_{r_{y}} = a_{r_{y}} \end{matrix}$ (17)

${\hat{b}}_{r_{y}} = b_{r_{y}} + \frac{{\hat{a}}_{s_{y}}}{{\hat{b}}_{s_{y}}} \sum_{i = 1}^{C} \sum_{j = 1}^{P} \frac{{\hat{a}}_{z_{y_{j}}^{i}}}{{\hat{b}}_{z_{y_{j}}^{i}}} - CP {ψ ({\hat{a}}_{s_{y}}) - \log ({\hat{b}}_{s_{y}})} - CP - \sum_{i = 1}^{C} \sum_{j = 1}^{P} {ψ ({\hat{a}}_{z_{y_{j}}^{i}}) - \log ({\hat{b}}_{z_{y_{j}}^{i}})}$ (18)

$\begin{matrix} q^{*} (s_{y}) = {(\frac{{\hat{b}}_{s_{y}}}{{\hat{a}}_{s_{y}} - 1})}^{{\hat{a}}_{s_{y}} - 1} \exp ({\hat{a}}_{s_{y}} - 1) {(s_{y})}^{{\hat{a}}_{s_{y}} - 1} \exp (- {\hat{b}}_{s_{y}} s_{y}), \\ {\hat{a}}_{s_{y}} = a_{s_{y}} + CP \frac{{\hat{a}}_{r_{y}}}{{\hat{b}}_{r_{y}}} \end{matrix}$ (19)

${\hat{b}}_{s_{y}} = b_{s_{y}} + \frac{{\hat{a}}_{r_{y}}}{{\hat{b}}_{r_{y}}} \sum_{i = 1}^{C} \sum_{j = 1}^{P} \frac{{\hat{a}}_{z_{y_{j}}^{i}}}{{\hat{b}}_{z_{y_{j}}^{i}}}$ (20)

Once the membership functions representing the uncertainties on the parameters have been optimally determined, the optimal multivariate fuzzy membership function on y = [y₁ ⋯ y_P]^T ∈ RP is defined by averaging over the uncertainties such that

$μ^{*} (y) \propto \exp < \log (μ (y; {π_{i}}_{i = 1}^{C}, Ω)) >_{q^{*} (Ω)}$

where

$π_{i} = \frac{π_{i}^{o} \exp (f_{i})}{\sum_{i = 1}^{C} π_{i}^{o} \exp (f_{i})}$

$f_{i} = - \frac{1}{2} \sum_{j = 1}^{P} \frac{{\hat{a}}_{λ_{y}^{i}}}{{\hat{b}}_{λ_{y}^{i}}} \frac{{\hat{a}}_{z_{y_{j}}^{i}}}{{\hat{b}}_{z_{y_{j}}^{i}}} ({(y_{j} - I_{j}^{P} {\hat{m}}_{m^{i}} - {({\hat{m}}_{G_{j}})}^{T} {\hat{m}}_{α^{i}})}^{2} + Tr ({({\hat{Λ}}_{m^{i}})}^{- 1} {(I_{j}^{P})}^{T} I_{j}^{P}) + {({\hat{m}}_{α^{i}})}^{T} {({\hat{Λ}}_{G_{j}})}^{- 1} {\hat{m}}_{α^{i}} + {({\hat{m}}_{G_{j}})}^{T} {({\hat{Λ}}_{α^{i}})}^{- 1} {\hat{m}}_{G_{j}} + Tr ({({\hat{Λ}}_{G_{j}})}^{- 1} {({\hat{Λ}}_{α^{i}})}^{- 1})) .$

After evaluating the integral, $〈 \log (μ (y; {π_{i}}_{i = 1}^{C}, Ω)) 〉_{q^{*} (Ω)}$ , the expression of the optimal membership function on y is as follows:

$\log (μ^{*} (y)) \propto - \frac{1}{2} \sum_{i = 1}^{C} \sum_{j = 1}^{P} π_{i} \frac{{\hat{a}}_{λ_{y}^{i}}}{{\hat{b}}_{λ_{y}^{i}}} \frac{{\hat{a}}_{z_{y_{j}}^{i}}}{{\hat{b}}_{z_{y_{j}}^{i}}} ({(y_{j} - I_{j}^{P} {\hat{m}}_{m^{i}} - {({\hat{m}}_{G_{j}})}^{T} {\hat{m}}_{α^{i}})}^{2} + Tr ({({\hat{Λ}}_{m^{i}})}^{- 1} {(I_{j}^{P})}^{T} I_{j}^{P}) + {({\hat{m}}_{α^{i}})}^{T} {({\hat{Λ}}_{G_{j}})}^{- 1} {\hat{m}}_{α^{i}} + {({\hat{m}}_{G_{j}})}^{T} {({\hat{Λ}}_{α^{i}})}^{- 1} {\hat{m}}_{G_{j}} + Tr ({({\hat{Λ}}_{G_{j}})}^{- 1} {({\hat{Λ}}_{α^{i}})}^{- 1})) .$

Finally, the constant of proportionality is chosen equal to one resulting in

$μ^{*} (y) = \exp (- \frac{1}{2} \sum_{i = 1}^{C} \sum_{j = 1}^{P} π_{i} \frac{{\hat{a}}_{λ_{y}^{i}}}{{\hat{b}}_{λ_{y}^{i}}} \frac{{\hat{a}}_{z_{y_{j}}^{i}}}{{\hat{b}}_{z_{y_{j}}^{i}}} \{{(y_{j} - I_{j}^{P} {\hat{m}}_{m^{i}} - {({\hat{m}}_{G_{j}})}^{T} {\hat{m}}_{α^{i}})}^{2} + Tr ({({\hat{Λ}}_{m^{i}})}^{- 1} {(I_{j}^{P})}^{T} I_{j}^{P}))) ((+ {({\hat{m}}_{α^{i}})}^{T} {({\hat{Λ}}_{G_{j}})}^{- 1} {\hat{m}}_{α^{i}} + {({\hat{m}}_{G_{j}})}^{T} {({\hat{Λ}}_{α^{i}})}^{- 1} {\hat{m}}_{G_{j}} + Tr ({({\hat{Λ}}_{G_{j}})}^{- 1} {({\hat{Λ}}_{α^{i}})}^{- 1})\}) .$ (21)

4. An Algorithm for multivariate data modeling

4.1. Algorithm

The analytical solution to mixture of uncertain signal models, derived in section (3), lends itself to Algorithm 1 for the modeling of multivariate data samples by determining membership functions on all of the variables and parameters. Algorithm 1 suggests to choose initial values of parameters based on k-means clustering and eigenvalue decomposition of sample covariance matrix.

Remark 1 (Complexity and Iterations) Algorithm 1 is based on the invoking of parameters updating rules (3–20). The time complexity of the algorithm, as a result of computing the inverse of a P × P sized matrix in update rule (10), is O(P 3). Algorithm 1, after initializing the parameters, invokes a single iteration of parameters updating rules. Thanks to the analytically derived solution due to which a single iteration is sufficient for parameters to nearly converge after initializing the parameters carefully. However, the optimal values of C and K are determined by maximizing the average fuzzy membership value of the data samples through repeated application of update rules.

Remark 2 (Free parameter β in Algorithm 1) Algorithm 1 has only single free parameter, β ∈ [0, 0.5], to be chosen by the user. The maximum possible number of signal models in the mixture, Cmax, depends on the value of β. It will be demonstrated through experiments that algorithm’s performance is not highly sensitive to the choice of β.

4.2. Data distribution modeling

The application of Algorithm 1 on given data samples ${y^{n}}_{n = 1}^{N}$ results in the determination of Copt different fuzzy membership functions on unobserved variable m which (membership functions) are defined as

μ^{i} (m; {\hat{m}}_{m^{i}}, {\hat{Λ}}_{m^{i}}) = \exp (- \frac{1}{2} {(m - {\hat{m}}_{m^{i}})}^{T} {\hat{Λ}}_{m^{i}} (m - {\hat{m}}_{m^{i}})), \forall i \in {1, \dots, C_{opt}} .

Let $M$ be the set of parameters returned by Algorithm, i.e., $M = {({\hat{m}}_{m^{i}}, {\hat{Λ}}_{m^{i}})}_{i = 1}^{C_{opt}}$ . Finally, a data model, constructed from ${y^{n}}_{n = 1}^{N}$ using Algorithm, is represented by a fuzzy membership function defined as

μ (y; M) = \max_{1 ⩽ i ⩽ C_{opt}} \{\exp (- \frac{1}{2} {(y - {\hat{m}}_{m^{i}})}^{T} {\hat{Λ}}_{m^{i}} (y - {\hat{m}}_{m^{i}}))\} .

(22)

4.3. Classification

The data modeling capability of functional $μ (m; M)$ can be exploited for the classification purpose. If $M^{1}, \dots, M^{S}$ are S different sets returned by Algorithm corresponding to the data samples of S different classes, then the class-label associated to a vector y could be predicted as

pred_label (y) = \arg \max_{1 ⩽ s ⩽ S} μ (y; M^{s})

(23)

4.4. Demonstrations on Toy data sets

Fig.3 shows an example of the 2-dimensional data samples and a display of the fuzzy membership function $μ (y; M)$ (calculated using (22)) over the data space. As depicted in Fig.3, the distribution of the samples ${y^{n}}_{n = 1}^{N}$ in P-dimensional space is modeled by the fuzzy membership function $μ (y; M)$ . Stochastic mixture models have been extensively studied in the literature and are typically used to learn data distributions. The most commonly used Gaussian mixture models(GMM) fit the given data samples by assuming that each data sample has been generated by a stochastic mixture of a finite number of the Gaussian distributions. “Expectation Maximization” algorithm is typically used for the learning of the Gaussian mixture models from data samples where the number of components in the mixture can be efficiently selected using the Bayesian information criterion (BIC). There may arise the situations when GMM don’t give favorable results. Fig.4(a) is an example of data samples where better performance of Algorithm 1 than GMM (together with BIC) is observed. A comparison between color plots of GMM based likelihood (displayed inFig.4(b)) andAlgorithm 1 based fuzzy membership function (displayed in Fig.4(c)) demonstrates the effectiveness of Algorithm 1 in modeling the distribution of data samples.

An example of the model learned from 2-dimensional data samples using Algorithm 1 (with β = 0.5).

An example of the comparison between the Gaussian mixture models and Algorithm 1 (with β = 0.5).

5. Heartbeat intervals classification

The section applies the proposed methodology on the experimentally recorded heartbeat intervals (referred to as the R-R intervals) of 20 different subjects while they were performing two different types of tasks in a chemical laboratory of Zhejiang University. One task involved manual pipetting of the chemical solutions while the other task involved working with the computer. The aim is to classify heartbeat intervals of a subject between the two tasks. The P-dimensional data samples were created from the sequence of R-R intervals as(see Table 1)

Y^{i} = {[{RR}_{i - P + 1} \dots {RR}_{i}]}^{T}

where RR_i is ith heartbeat interval. The R-R intervals corresponding to the first half of the task duration serve as the training data and that of second half as testing data. Table 2 lists the median of classification accuracy over 20 subjects, obtained on testing data by different classification methods, for different values of data dimension P. The better classification accuracy of the analytical fuzzy approach in Table 2 supports the arguments that proposed approach could be an effective tool for modeling and analysis of biomedical data.

Table 1.

A comparison of different classification algorithms with the proposed method in term of classification accuracy on testing data.

Method	Dataset 1	Dataset 2	Dataset 3
Nearest neighbors	100%	100%	75%
Linear SVM	91%	46%	51%
RBF SVM	90%	100%	59%
Decision tree	98%	100%	80%
Random forest	98%	100%	73%
AdaBoost	93%	97%	80%
Naive Bayes	92%	97%	57%
LDA	90%	29%	52%
QDA	90%	96%	57%

Analytical fuzzy (β = 0.5)	100%	100%	82%

Open in a new tab

Table 2.

A The median accuracy (in %) of different algorithms in classifying the testing heartbeat intervals between two tasks performed by subjects.

Method	Median of % accuracy (P = 2) % accuracy (P = 2)	Median of % accuracy (P = 4)	Median of % accuracy (P = 6)	Median of % accuracy (P = 8)
Nearest neighbors	87.11	90.33	91.08	92.65
Linear SVM	87.11	89.24	90.64	91.58
RBF SVM	84.07	84.17	86.99	90.11
Decision tree	84.95	87.22	88.83	89.57
Random forest	86.75	88.93	90.84	92.51
AdaBoost	88.36	90.72	91.87	92.60
Naive Bayes	87.40	89.27	91.05	92.18
LDA	88.67	90.70	91.59	92.99
QDA	88.04	88.46	90.08	90.97

Analytical fuzzy (β = 0)	88.75	91.16	92.14	93.14

Open in a new tab

6. Concluding remarks

The theoretical contribution of this work is to propose an analytical fuzzy approach that provides a principled basis for determining the fuzzy membership functions to handle uncertainties in a modeling problem. The theoretical results form the basis for designing an algorithm that results in an efficient modeling of the data distribution in multi-parametric space. The analytically derived expressions for fuzzy membership functions for representing uncertainties associated with biomedical data should facilitate a system theoretic approach to mathematically design the medical expert systems.

Footnotes

Peer review under responsibility of King Saud University.

Contributor Information

Yihua Mao, Email: maoyihua@zjubh.com.

Mohit Kumar, Email: mohit.kumar@zjubh.com.

References

Alcala R., Ducange P., Herrera F., Lazzerini B., Marcelloni F. A multiobjective evolutionary approach to concurrently learn rule and data bases of linguistic fuzzy-rule- based systems. IEEE Trans. Fuzzy Syst. 2009;17(5):1106–1122. [Google Scholar]
Aliasghary M., Arghavani N. 2012. H∞ estimation for optimization of rational-powered membership functions; pp. 251–256. (2012 IEEE 13th International Symposium on Computational Intelligence and Informatics (CINTI)). [Google Scholar]
Antonelli M., Ducange P., Marcelloni F. Genetic training instance selection in multiobjective evolutionary fuzzy systems: a coevolutionary approach. IEEE Trans. Fuzzy Syst. 2012;20(2):276–290. [Google Scholar]
Au W.H., Chan K., Wong A.K. A fuzzy approach to partitioning continuous attributes for classification. IEEE Trans. Knowl. Data Eng. 2006;18(5):715–719. [Google Scholar]
Celikyilmaz A., Turksen I. Enhanced fuzzy system models with improved fuzzy clustering algorithm. IEEE Trans. Fuzzy Syst. 2008;16(3):779–794. [Google Scholar]
Chen L., Chen C. ISIC; 2007. Pre-shaped fuzzy c-means algorithm (pfcm) for transparent membership function generation; pp. 789–794. (IEEE International Conference on Systems, Man and Cybernetics). [Google Scholar]
Cococcioni M., Lazzerini B., Marcelloni F. On reducing computational overhead in multi-objective genetic takagi-sugeno fuzzy systems. Appl. Soft Comput. 2011;11(1):675–688. [Google Scholar]
Fan C.Y., Chang P.C., Lin J.J., Hsieh J. A hybrid model combining case-based reasoning and fuzzy decision tree for medical data classification. Appl. Soft Comput. 2011;11(1):632–644. [Google Scholar]
Gacto M., Alcala R., Herrera F. Integration of an index to preserve the semantic interpretability in the multiobjective evolutionary rule selection and tuning of linguistic fuzzy systems. IEEE Trans. Fuzzy Syst. 2010;18(3):515–531. [Google Scholar]
Gadaras I., Mikhailov L. An interpretable fuzzy rule-based classification methodology for medical diagnosis. Artif. Intell. Med. 2009;47(1):25–41. doi: 10.1016/j.artmed.2009.05.003. [DOI] [PubMed] [Google Scholar]
Kumar M., Stoll R., Stoll N. Deterministic approach to robust adaptive learning offuzzy models. IEEE Trans. Syst. Man Cybern. B Cybern. 2006;36(4):767–780. doi: 10.1109/tsmcb.2006.870625. [DOI] [PubMed] [Google Scholar]
Kumar M., Stoll N., Kaber D., Thurow K., Stoll R. 2007. Fuzzy filtering for an intelligent interpretation of medical data; pp. 225–230. (Proc. IEEE International Conference on Automation Science and Engineering (CASE 2007), Scottsdale, Arizona USA). [Google Scholar]
Kumar M., Weippert M., Vilbrandt R., Kreuzfeld S., Stoll R. Fuzzy evaluation of heart rate signals for mental stress assessment. IEEE Trans. Fuzzy Syst. 2007;15(5):791–808. [Google Scholar]
Kumar M., Arndt D., Kreuzfeld S., Thurow K., Stoll N., Stoll R. Fuzzy techniques for subjective workload score modelling under uncertainties. IEEE Trans. Syst. Man Cybern. Part B Cybern. 2008;38(6):1449–1464. doi: 10.1109/TSMCB.2008.927712. [DOI] [PubMed] [Google Scholar]
Kumar M., Stoll N., Stoll R. Adaptive fuzzy filtering in a deterministic setting. IEEE Trans. Fuzzy Syst. 2009;17(4):763–776. [Google Scholar]
Kumar M., Stoll N., Stoll R. On the estimation of parameters of takagi-sugeno fuzzy filters. IEEE Trans. Fuzzy Syst. 2009;17(1):150–166. [Google Scholar]
Kumar M., Weippert M., Arndt D., Kreuzfeld S., Thurow K., Stoll N., Stoll R. Fuzzy filtering for physiological signal analysis. IEEE Trans. Fuzzy Syst. 2010;18(1):208–216. [Google Scholar]
Kumar M., Weippert M., Stoll N., Stoll R. A mixture of fuzzy filters applied to the analysis of heartbeat intervals. Fuzzy Optim. Decis. Making. 2010;9(4):383–412. [Google Scholar]
Kumar M., Neubert S., Behrendt S., Rieger A., Weippert M., Stoll N., Thurow K., Stoll R. Stress monitoring based on stochastic fuzzy analysis of heartbeat intervals. IEEE Trans. Fuzzy Syst. 2012;20(4):746–759. [Google Scholar]
Kumar M., Stoll N., Thurow K., Stoll R. 2012. Physiological signals to individual assessment for application in wireless health systems; pp. 1–6. (Proc. 9th International Multi-Conference on Systems, Signals and Devices (SSD)). [Google Scholar]
Kumar M., Stoll N., Stoll R., Thurow K. A stochastic framework for robust fuzzy filtering and analysis of signals-part i. IEEE Trans. Cybern. 2016;46(5):1118–1131. doi: 10.1109/TCYB.2015.2423657. [DOI] [PubMed] [Google Scholar]
Kumar M., Stoll N., Stoll R., Thurow K. Variational optimization of fuzzy membership functions. Artif. Intell. Under-Rev. 2016 [Google Scholar]
Liao T.W., Celmins A.K., Hammell R.J. A fuzzy c-means variant for the generation of fuzzy term sets. Fuzzy Sets Syst. 2003;135(2):241–257. [Google Scholar]
Makrehchi M., Basir O., Kamel M. Generation of fuzzy membership function using information theory measures and genetic algorithm. In: Bilgiç T., De Baets B., Kaynak O., editors. vol. 2715. Springer; Berlin Heidelberg: 2003. pp. 603–610. (Fuzzy Sets and Systems – IFSA 2003, Lecture Notes in Computer Science). [Google Scholar]
Mottaghi-Kashtiban M., Khoei A., Hadidi K. Optimization of rational-powered membership functions using extended kalman filter. Fuzzy Sets Syst. 2008;159(23):3232–3244. [Google Scholar]
Nguyen T., Khosravi A., Creighton D., Nahavandi S. Medical data classification using interval type-2 fuzzy logic system and wavelets. Appl. Soft Comput. 2015;30:812–822. [Google Scholar]
Oh S.K., Pedrycz W., Park H.S. Hybrid identification in fuzzy-neural networks. Fuzzy Sets Syst. 2003;138(2):399–426. [Google Scholar]
Papageorgiou E.I. A new methodology for decisions in medical informatics using fuzzy cognitive maps based on fuzzy rule-extraction techniques. Appl. Soft Comput. 2011;11(1):500–513. [Google Scholar]
Pulkkinen P., Koivisto H. A dynamically constrained multiobjective genetic fuzzy system for regression problems. IEEE Trans. Fuzzy Syst. 2010;18(1):161–177. [Google Scholar]
Robles I., Alcalá R., Benítez J.M., Herrera F. Evolutionary parallel and gradually distributed lateral tuning of fuzzy rule-based systems. Evol. Intell. 2009;2(1–2):5–19. [Google Scholar]
Seera M., Lim C.P. A hybrid intelligent system for medical data classification. Expert Syst. Appl. 2014;41(5):2239–2249. [Google Scholar]
Simon D. H∞ estimation for fuzzy membership function optimization. Int. J. Approximate Reasoning. 2005;40(3):224–242. [Google Scholar]

[b0005] Alcala R., Ducange P., Herrera F., Lazzerini B., Marcelloni F. A multiobjective evolutionary approach to concurrently learn rule and data bases of linguistic fuzzy-rule- based systems. IEEE Trans. Fuzzy Syst. 2009;17(5):1106–1122. [Google Scholar]

[b0010] Aliasghary M., Arghavani N. 2012. H∞ estimation for optimization of rational-powered membership functions; pp. 251–256. (2012 IEEE 13th International Symposium on Computational Intelligence and Informatics (CINTI)). [Google Scholar]

[b0015] Antonelli M., Ducange P., Marcelloni F. Genetic training instance selection in multiobjective evolutionary fuzzy systems: a coevolutionary approach. IEEE Trans. Fuzzy Syst. 2012;20(2):276–290. [Google Scholar]

[b0020] Au W.H., Chan K., Wong A.K. A fuzzy approach to partitioning continuous attributes for classification. IEEE Trans. Knowl. Data Eng. 2006;18(5):715–719. [Google Scholar]

[b0025] Celikyilmaz A., Turksen I. Enhanced fuzzy system models with improved fuzzy clustering algorithm. IEEE Trans. Fuzzy Syst. 2008;16(3):779–794. [Google Scholar]

[b0030] Chen L., Chen C. ISIC; 2007. Pre-shaped fuzzy c-means algorithm (pfcm) for transparent membership function generation; pp. 789–794. (IEEE International Conference on Systems, Man and Cybernetics). [Google Scholar]

[b0035] Cococcioni M., Lazzerini B., Marcelloni F. On reducing computational overhead in multi-objective genetic takagi-sugeno fuzzy systems. Appl. Soft Comput. 2011;11(1):675–688. [Google Scholar]

[b0040] Fan C.Y., Chang P.C., Lin J.J., Hsieh J. A hybrid model combining case-based reasoning and fuzzy decision tree for medical data classification. Appl. Soft Comput. 2011;11(1):632–644. [Google Scholar]

[b0045] Gacto M., Alcala R., Herrera F. Integration of an index to preserve the semantic interpretability in the multiobjective evolutionary rule selection and tuning of linguistic fuzzy systems. IEEE Trans. Fuzzy Syst. 2010;18(3):515–531. [Google Scholar]

[b0050] Gadaras I., Mikhailov L. An interpretable fuzzy rule-based classification methodology for medical diagnosis. Artif. Intell. Med. 2009;47(1):25–41. doi: 10.1016/j.artmed.2009.05.003. [DOI] [PubMed] [Google Scholar]

[b0055] Kumar M., Stoll R., Stoll N. Deterministic approach to robust adaptive learning offuzzy models. IEEE Trans. Syst. Man Cybern. B Cybern. 2006;36(4):767–780. doi: 10.1109/tsmcb.2006.870625. [DOI] [PubMed] [Google Scholar]

[b0060] Kumar M., Stoll N., Kaber D., Thurow K., Stoll R. 2007. Fuzzy filtering for an intelligent interpretation of medical data; pp. 225–230. (Proc. IEEE International Conference on Automation Science and Engineering (CASE 2007), Scottsdale, Arizona USA). [Google Scholar]

[b0065] Kumar M., Weippert M., Vilbrandt R., Kreuzfeld S., Stoll R. Fuzzy evaluation of heart rate signals for mental stress assessment. IEEE Trans. Fuzzy Syst. 2007;15(5):791–808. [Google Scholar]

[b0070] Kumar M., Arndt D., Kreuzfeld S., Thurow K., Stoll N., Stoll R. Fuzzy techniques for subjective workload score modelling under uncertainties. IEEE Trans. Syst. Man Cybern. Part B Cybern. 2008;38(6):1449–1464. doi: 10.1109/TSMCB.2008.927712. [DOI] [PubMed] [Google Scholar]

[b0075] Kumar M., Stoll N., Stoll R. Adaptive fuzzy filtering in a deterministic setting. IEEE Trans. Fuzzy Syst. 2009;17(4):763–776. [Google Scholar]

[b0080] Kumar M., Stoll N., Stoll R. On the estimation of parameters of takagi-sugeno fuzzy filters. IEEE Trans. Fuzzy Syst. 2009;17(1):150–166. [Google Scholar]

[b0085] Kumar M., Weippert M., Arndt D., Kreuzfeld S., Thurow K., Stoll N., Stoll R. Fuzzy filtering for physiological signal analysis. IEEE Trans. Fuzzy Syst. 2010;18(1):208–216. [Google Scholar]

[b0090] Kumar M., Weippert M., Stoll N., Stoll R. A mixture of fuzzy filters applied to the analysis of heartbeat intervals. Fuzzy Optim. Decis. Making. 2010;9(4):383–412. [Google Scholar]

[b0095] Kumar M., Neubert S., Behrendt S., Rieger A., Weippert M., Stoll N., Thurow K., Stoll R. Stress monitoring based on stochastic fuzzy analysis of heartbeat intervals. IEEE Trans. Fuzzy Syst. 2012;20(4):746–759. [Google Scholar]

[b0100] Kumar M., Stoll N., Thurow K., Stoll R. 2012. Physiological signals to individual assessment for application in wireless health systems; pp. 1–6. (Proc. 9th International Multi-Conference on Systems, Signals and Devices (SSD)). [Google Scholar]

[b0105] Kumar M., Stoll N., Stoll R., Thurow K. A stochastic framework for robust fuzzy filtering and analysis of signals-part i. IEEE Trans. Cybern. 2016;46(5):1118–1131. doi: 10.1109/TCYB.2015.2423657. [DOI] [PubMed] [Google Scholar]

[b0110] Kumar M., Stoll N., Stoll R., Thurow K. Variational optimization of fuzzy membership functions. Artif. Intell. Under-Rev. 2016 [Google Scholar]

[b0115] Liao T.W., Celmins A.K., Hammell R.J. A fuzzy c-means variant for the generation of fuzzy term sets. Fuzzy Sets Syst. 2003;135(2):241–257. [Google Scholar]

[b0120] Makrehchi M., Basir O., Kamel M. Generation of fuzzy membership function using information theory measures and genetic algorithm. In: Bilgiç T., De Baets B., Kaynak O., editors. vol. 2715. Springer; Berlin Heidelberg: 2003. pp. 603–610. (Fuzzy Sets and Systems – IFSA 2003, Lecture Notes in Computer Science). [Google Scholar]

[b0125] Mottaghi-Kashtiban M., Khoei A., Hadidi K. Optimization of rational-powered membership functions using extended kalman filter. Fuzzy Sets Syst. 2008;159(23):3232–3244. [Google Scholar]

[b0130] Nguyen T., Khosravi A., Creighton D., Nahavandi S. Medical data classification using interval type-2 fuzzy logic system and wavelets. Appl. Soft Comput. 2015;30:812–822. [Google Scholar]

[b0135] Oh S.K., Pedrycz W., Park H.S. Hybrid identification in fuzzy-neural networks. Fuzzy Sets Syst. 2003;138(2):399–426. [Google Scholar]

[b0140] Papageorgiou E.I. A new methodology for decisions in medical informatics using fuzzy cognitive maps based on fuzzy rule-extraction techniques. Appl. Soft Comput. 2011;11(1):500–513. [Google Scholar]

[b0145] Pulkkinen P., Koivisto H. A dynamically constrained multiobjective genetic fuzzy system for regression problems. IEEE Trans. Fuzzy Syst. 2010;18(1):161–177. [Google Scholar]

[b0150] Robles I., Alcalá R., Benítez J.M., Herrera F. Evolutionary parallel and gradually distributed lateral tuning of fuzzy rule-based systems. Evol. Intell. 2009;2(1–2):5–19. [Google Scholar]

[b0155] Seera M., Lim C.P. A hybrid intelligent system for medical data classification. Expert Syst. Appl. 2014;41(5):2239–2249. [Google Scholar]

[b0160] Simon D. H∞ estimation for fuzzy membership function optimization. Int. J. Approximate Reasoning. 2005;40(3):224–242. [Google Scholar]

PERMALINK

Analytical fuzzy approach to biological data analysis

Weiping Zhang

Jingzhi Yang

Yanling Fang

Huanyu Chen

Yihua Mao

Mohit Kumar

Abstract

1. Introduction

Figure 1.

2. An uncertain model of multivariate data

Definition 1 Gaussian’s membership function (Kumar et al., 2016a, Kumar et al., 2016b) —

Definition 2 Gamma membership function (Kumar et al., 2016a, Kumar et al., 2016b) —

Figure 2.

Definition 3 Fuzzy membership function on vj —

Definition 4 Fuzzy membership function on yj —

Definition 5 Fuzzy membership function on y —

Definition 6 Fuzzy membership function on m —

Definition 7 Fuzzy membership function on α —

Definition 8 Fuzzy membership function on Gj —

Definition 9 Fuzzy membership of y as a finite mixture of uncertain signal models —

3. Analytical optimization of mixture of uncertain signal models

Result 1

4. An Algorithm for multivariate data modeling

4.1. Algorithm

4.2. Data distribution modeling

4.3. Classification

4.4. Demonstrations on Toy data sets

Figure 3.

Figure 4.

5. Heartbeat intervals classification

Table 1.

Table 2.

6. Concluding remarks

Footnotes

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Definition 3 Fuzzy membership function on v_j —

Definition 4 Fuzzy membership function on y_j —

Definition 7 Fuzzy membership function on $α$ —

Definition 8 Fuzzy membership function on $G_{j}$ —