Abstract
Fuzzy c-means (FCM) clustering has been extensively studied and widely applied in the tissue classification of biomedical images. Previous enhancements to FCM have accounted for intensity shading, membership smoothness, and variable cluster sizes. In this paper, we introduce a new parameter called “compactness” which captures additional information of the underlying clusters. We then propose a new classification algorithm, FCM with variable compactness (FCMVC), to classify three major tissues in brain MRIs by incorporating the compactness terms into a previously reported improvement to FCM. Experiments on both simulated phantoms and real magnetic resonance brain images show that the new method improves the repeatability of the tissue classification for the same subject with different acquisition protocols.
Index Terms: Biomedical image processing, fuzzy sets, image segmentation, magnetic resonance imaging
1. INTRODUCTION
The fuzzy c-means (FCM) algorithm [1] has been extensively used in medical image segmentation. It has been especially successfully at classifying major tissues from magnetic resonance images of the human brain [2,3]. FCM converges readily, is scale and shift invariant, and allows for the straightforward incorporation of multichannel data. Furthermore, FCM directly yields soft segmentations (in the form of member-ship functions) that are typically desirable intermediate data structures (as opposed to hard classifications) for further analysis [4]. The original algorithm has also been adapted to account for image shading [5], membership smoothness [6], and variable cluster size [7, 8].
Recently, Clark et. al. [9] observed that there exists inconsistency in tissue segmentations arising from the choice of pulse sequence in data acquisition. We have had similar observations in our experiments. The inconsistency is caused by the fact that the relative size of clusters can be considerably different among multi-parametric scans of the same subject. This is an undesirable property as we prefer the performance of the segmentation algorithm to be independent of the data acquisition process. In order to achieve consistent segmentations, we introduce a new parameter, called compactness, into the FCM framework to obtain a novel variational formulation that has one compactness parameter per class predetermined for different acquisitions. We name the new algorithm FCM with variable compactness (FCMVC). The same idea is also incorporated into an enhanced variant of FCM — Fuzzy and Noise Tolerant Adaptive Segmentation Method (FANTASM) [6], which yields FANTASM with Variable Compactness (FVC).
In this paper, we describe an optimization process to solve the proposed FVC formulation. To validate the proposed method, we compare the performance of FVC against several existing methods, including FANTASM, FCM with a fuzzy covariance matrix [7] (FCMV), and Gaussian Mixture Model (GMM). Experiments show that FVC achieves improved consistency in segmentations when magnetic resonance brain images are acquired using two different pulse sequences (spoiled gradient (SPGR) and magnetization prepared rapid gradient echo (MP-RAGE)) from the same subject.
2. BACKGROUND
Mathematically, FCM is the solution of the following energy function:
| (1) |
where yj is the observed image intensity at the jth pixel, C is the number of classes, υk’s are class centroids, Ω is image domain, and ujk is the membership function value of the jth pixel for the kth class. Membership functions must be non-negative, and satisfy the constraint . The energy function is minimized if high membership values are assigned to observations close to centroids and low membership values to observations away from centroids. The parameter q is a weighting exponent and is constrained by q > 1. When q = 1, FCM reduces to the hard K-means algorithm with the membership functions taking binary values. As the value of q increases, the fuzziness of the membership functions also increase.
Since we are interested in segmenting three major tissues in the human brain – the cerebrospinal fluid (CSF), the gray matter (GM), and the white matter (WM), we consider a three class problem in the following manner. Let υ1, υ2, and υ3 be the class centroids with υ1 < υ2 < υ3. Let yj vary over (−∞,+∞) and assume υk ’s are fixed. Recalling that , we can represent the membership functions [uj1 uj3 uj2], for a particular yj, as a point on the plane x + y + z = 1 (Fig. 1(a)) by taking uj1 = x, uj2 = z, uj3 = y. We plot the membership points as a continuous function of yj on this plane and show the 2D projection of the plane in Fig. 1(b) for the choices of q = 2 and q = 3. The three vertices in Fig. 1(b) starting from bottom left in clockwise direction are υ1, υ2, and υ3, respectively. When y ≪ υ1 or y ≫ υ3, the curve tends to , i.e, the center of the plane. In our case, for yj ∈ [υ1, υ2], it is expected that yj is a mixture of CSF and GM, with uj3 being small. Similarly uj1 is expected to be small for yj ∈[υ2, υ3], as it is a mixture of GM and WM. Considering the effect of q on the memberships, we observe that, if q increases, WM membership u j3 for yj ∈ [υ1, υ2]increases, and so does the CSF membership uj1 for yj ∈[υ2, υ3] This demonstrates that the parameter q captures information about the variability of clusters. Based on this interpretation, we assume that if the WM cluster has large variance, then u j3 for yj ∈[υ1, υ2] should increase. And similarly, if CSF cluster has large variance, then uj1 for yj ∈[υ2, υ3] should increase.
Fig. 1.
(a)uj1 + uj2 + uj3 = 1 plane. (b) Plot of u jk(yj ; q) on the plane. The left red line is u j3 = 0 and the right red line is uj1 = 0.
Several modifications of FCM have been proposed in the literature. A generalized Lp norm has been introduced in [10] to include variability of classes into energy function. Another way to take into account cluster variability is to incorporate a covariance matrix between classes, such as proposed with FCMV [7]. A simplified single-channel formulation is given as follows:
| (2) |
where σk is the variance of the kth class. In the next section, we will describe a new approach to model the variability of clusters.
3. FCM WITH VARIABLE COMPACTNESS (FCMVC)
3.1. Problem Formulation
We introduce a fixed parameter pk, k = 1, … ,C, in the FCM framework as minimization of the following function,
| (3) |
Minimization of JFCMVC gives the membership functions as,
We note that p = [p1 … pC] = [1 … 1] is the same as FCM with q = 2. The plot of memberships ūjk for a fixed set of υk and pk are shown in Fig. 2. We observe that for ∀yj ∈[υ1, υ2], ūj3 with p1 > p2 > p3 has increased from u j3 with p = [1 1 1] (Fig. 2(a)). Similarly ∀yj ∈[υ2, υ3], u¯ j1 with p1 < p2 < p3, has increased from u j1 (Fig. 2(b)). It shows that with decreasing pk, memberships increase for a fixed |yj − υ k|. So we infer that pk is a measure of the compactness of cluster k. It is chosen to be large for small classes and small for large classes.
Fig. 2.
Plot of u¯jk(yj ; p), for different parameters pk.
Based on the proposed method FCMVC, we now modify the FANTASM energy function [6] to FVC,
Ω, ujk, pk, yj, υk have been described earlier. Here, gj is a scalar gain field to account for image inhomogeneity, Dr and Ds are first order finite difference operators, R = 2 denotes vertical and horizontal directions, λ1 and λ 2 are regularizations on gain field, Nj is the set of first order neighbors of pixel j and β is a smoothing coefficient. λ1, λ2 and β are determined empirically.
Minimization of JFVC can be performed using the following algorithm.
Choose suitable compactness parameters pk (discussed in the following section).
Obtain initial estimate of υk.
Assume gj = 1, ∀j ∈ Ω.
- Compute membership functions,
- Update the centroids,
- Update gain field coefficients by solving the spatially varying equation,
If the algorithm converges, stop; otherwise go to step 4.
4. RESULTS
The compactness parameters are estimated based on a set of SPGR training data [4], for which we have manually selected landmarks on the cortical boundaries [11]. An exhaustive search was performed to estimate p, such that the segmentation isocontours generated by FVC were close to the landmarks. The compactness parameters estimated on the training data are pS = [0.9, 1.00, 1.06]. Next we estimated the compactness parameters for MP-RAGE using a training set of SPGR and MP-RAGE data of the same subject. The parameters for MP-RAGE are estimated based on an exhaustive search so that the isocontours of MP-RAGE segmentation line up with the isocontours of SPGR using pS. The estimated compactness for MP-RAGE was pM = [1.25, 1.0, 0.95]. Variances from Eq. 2, were also estimated in exactly the same way. The estimated variances were found to be σS 2 = [3.1, 1.44, 1] for SPGR and σM 2 = [1, 1.52, 2.55] for MP-RAGE. We used these values of pS, pM, σS, σM on the test images.
We conducted two experiments to verify that FVC can improve the consistency of the segmentation of the same subject under the two acquisition protocols. Our first experiment involves two synthetic data sets which we used to simulate the same object imaged with MP-RAGE and SPGR imaging parameters. We generated the phantoms from a fuzzy classification truth model using the statistical model outlined in [12]. The phantoms possessed 5% noise and did not have any inhomogeneity. The misclassification rate, defined as the ratio of the # of correctly classified voxels against the total # of non-background voxels, is reported in Tab. 1 for classifying the MP-RAGE and SPGR phantoms using GMM (with variable priors and variances), FANTASM, FCM with Fuzzy Covariance matrix (FCMV) and FVC. Our second experiment involved a pair of real SPGR and MP-RAGE T1 images for five subjects. We computed the hard segmentations using each of the methods used in the first experiment and report the Jaccard coefficient between the hard segmentations, averaged over all three classes. We used smoothing and inhomogeneity correction while using GMM and FCMV. The results are in Tab. 2, while Fig. 3 shows one of the data sets.
Table 1.
Percent Misclassification rate for SPGR/MP-RAGE with Ground Truth for two simulated phantoms. M is MPRAGE and S is SPGR. FCMV and FVC represent FCM with Fuzzy Covariance matrix and FANTASM with Variable Compactness. The two rows correspond to two different phantoms.
| GMM | FANTASM | FCMV | FVC | ||||
|---|---|---|---|---|---|---|---|
| S | M | S | M | S | M | S | M |
| 5.64 | 12.78 | 3.62 | 5.66 | 4.45 | 5.11 | 3.55 | 3.51 |
| 7.51 | 9.54 | 6.32 | 8.43 | 5.32 | 6.11 | 4.72 | 4.86 |
Table 2.
Jaccard Coefficients, averaged over all three classes, between SPGR and MP-RAGE hard segmentation of five subjects. See Tab. 1 for abbreviations.
| # | GMM | FANTASM | FCMV | FVC |
|---|---|---|---|---|
| 1 | 0.7383 | 0.5611 | 0.7781 | 0.7952 |
| 2 | 0.7047 | 0.4764 | 0.7501 | 0.7538 |
| 3 | 0.8134 | 0.6670 | 0.8391 | 0.8559 |
| 4 | 0.7707 | 0.5209 | 0.6363 | 0.7921 |
| 5 | 0.7377 | 0.4205 | 0.4173 | 0.7588 |
Fig. 3.
Comparison between hard segmentation of real MP-RAGE and SPGR images, subject #1. See Tab. 1 for abbreviations.
Tab. 1 shows that FANTASM has nearly the same misclassification rate as FVC on SPGR images, but fails to do as well on MP-RAGE images. From Fig. 3, we observe that GMM tends to over-estimate gray matter in SPGR, while FANTASM tends to over-estimate CSF in MP-RAGE. We also observe that FANTASM produces visually good results on SPGR data. It is also observed from Fig. 3 that FCMV performs better than FANTASM on MP-RAGE, but fails to do so on SPGR. Finally we can see that FVC performs better on both SPGR and MP-RAGE.
5. DISCUSSION AND CONCLUSION
A new compactness parameter in the FCM framework is introduced and we outlined an automatic fuzzy segmentation method based on the parameter. We have satisfied our primary goal of obtaining more similar segmentations from the same data acquired under differing protocols, SPGR and MP-RAGE. The proposed approach is currently designed for single channel data, it can be readily extended to a multichannel algorithm. In the future, we would like to develop a robust and fully automated framework for estimating the compactness parameters, as opposed to our current practice of estimation based on training.
Acknowledgments
This work was supported by the NIH/NINDS under grant 5R01NS037747.
Contributor Information
Snehashis Roy, Email: snehashisr@jhu.edu.
Harsh Agarwal, Email: harsh@jhu.edu.
Aaron Carass, Email: aaron_carass@jhu.edu.
Ying Bai, Email: ybai1@jhu.edu.
Dzung L. Pham, Email: pham@jhu.edu.
Jerry L. Prince, Email: prince@jhu.edu.
REFERENCES
- 1.Bezdek JC. A convergence theorem for the fuzzy ISO-DATA clustering algorithms. IEEE Trans. on Pattern Anal. Machine Intell.; Jan. 1980; [DOI] [PubMed] [Google Scholar]
- 2.Bezdek JC, Hall LO, Clarke LP. Review of MR image segmentation techniques using pattern recognition. Medical Physics. 1993;vol. 20:1033–1048. doi: 10.1118/1.597000. [DOI] [PubMed] [Google Scholar]
- 3.Pham TD. Image segmentation using probabilistic Fuzzy C-Means clustering. Intl. Conf. on Image Proc. 2001;vol. 1:722–725. [Google Scholar]
- 4.Han X, Pham DL, Tosun D, Rettmann ME, Xu C, Prince JL. CRUISE: Cortical Reconstruction Using Implicit Surface Evolution. NeuroImage. 2004 November;vol. 23:997–1012. doi: 10.1016/j.neuroimage.2004.06.043. [DOI] [PubMed] [Google Scholar]
- 5.Pham DL, Prince JL. An adaptive Fuzzy C-Means algorithm for image segmentation in the presence of intensity inhomogeneities. Pattern Recognition Letters. 1999 January;vol. 20:57–68. [Google Scholar]
- 6.Pham DL. Robust fuzzy segmentation of magnetic resonance images. 14th IEEE Symposium on Computer-Based Medical Systems; 2001. pp. 127–131. [Google Scholar]
- 7.Gustafson DE, Kessel WC. Fuzzy clustering with a fuzzy covariance matrix. Proc. IEEE Conf. Decision Contr; 1979. [Google Scholar]
- 8.Cavalcanti N, de A F, de Carvalho T. An adaptive Fuzzy C-Means algorithm with the L2 norm. Australian Conf. on Artificial Intel; 2005. pp. 1138–1141. [Google Scholar]
- 9.Clark KA, Woods RP, Rottenber DA, Toga AW, Mazziotta JC. Impact of acquisition protocols and processing streams on tissue segmentation of T1 weighted MR images. NeuroImage. 2006;vol. 29:185–202. doi: 10.1016/j.neuroimage.2005.07.035. [DOI] [PubMed] [Google Scholar]
- 10.Hathaway RJ, Bezdek JC, Hu Y. Generalized Fuzzy C-Means clustering strategies using Lp norm distances. IEEE Trans. on Fuzzy Systems. 2000 October;vol. 8(no 5):576–582. [Google Scholar]
- 11.Tosun D, Rettmann ME, Naiman DQ, Resnick SM, Kraut MA, Prince JL. Cortical Reconstruction Using Implicit Surface Evolution: A Landmark Validation Study. MICCAI. 2004;vol. 3216:384–392. [Google Scholar]
- 12.Pham DL, Prince JL, Xu C, Dagher AP. An automated technique for statistical characterization of brain tissues in magnetic resonance imaging. Intl. Journal of Patt. Recog. and Artificial Intell. 1997;vol. 11(no 8):1189–1211. [Google Scholar]



