Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2016 Jul 9.
Published in final edited form as: Proc IEEE Int Symp Biomed Imaging. 2015 Apr;2015:1535–1539. doi: 10.1109/ISBI.2015.7164170

DISJUNCTIVE NORMAL SHAPE MODELS

Nisha Ramesh †,, Fitsum Mesadi †,, Mujdat Cetin *, Tolga Tasdizen †,⋆,*
PMCID: PMC4939085  NIHMSID: NIHMS797653  PMID: 27403233

Abstract

A novel implicit parametric shape model is proposed for segmentation and analysis of medical images. Functions representing the shape of an object can be approximated as a union of N polytopes. Each polytope is obtained by the intersection of M half-spaces. The shape function can be approximated as a disjunction of conjunctions, using the disjunctive normal form. The shape model is initialized using seed points defined by the user. We define a cost function based on the Chan-Vese energy functional. The model is differentiable, hence, gradient based optimization algorithms are used to find the model parameters.

Keywords: implicit, parametric, shape model, disjunctive normal form, Chan-Vese

1. INTRODUCTION

Shape models play an important role in many problems in biomedical imaging such as segmentation and analysis of variability in populations. Shape models can be categorized into several broad categories. First, shape models are either explicit where points on the curve/surface being modeled are directly represented or they are implicit where points on the curve/surface are embedded as a level set of a function. Second, shape models can also be categorized as parametric or non-parametric. A list of points on a 3D surface would be considered a non-parametric explicit model whereas snakes [1] and B-splines [2] are parametric explicit models. The most common implicit shape representation is the level set method [3, 4, 5], which is non-parametric. Parametric, implicit models are rarer and include algebraic curves and surfaces [6, 7]. In this paper, we propose a novel parametric, implicit shape model which we call the Disjunctive Normal Shape Model (DNSM). We approximate the characteristic function of a shape as a union of convex polytopes which themselves are represented as intersections of half-spaces in 2D or 3D. This type of representation of a Boolean function is known as the disjunctive normal form [8]. Next, we convert the disjunctive normal form into a differentiable model by: 1) using DeMorgan’s laws [8] to replace unions with intersections and complements, 2) representing intersections of half spaces as a product of perceptron equations and 3) relaxing the perceptrons used in representing the half-spaces to logistic sigmoid functions. We also take a variational approach and propose a simple cost function based on the Chan-Vese energy that can be used to drive the proposed model for segmenting objects in biomedical images. In this paper, we demonstrate the experimental results of segmentation for different modalities, retinal cells from confocal microscopy images in zebrafish, cardiac CT image, knee MR image, tumors in multimodal MRI brain images. While we focus on the mathematical foundation of the proposed model and its application to data-driven, region-based image segmentation in this paper, it is possible to extend its use to other segmentation scenarios. For instance, given a set of training shapes, prior distributions for the model parameters can be learned and used in segmenting new images. Along with such prior distributions, one can also use DNSMs in conjunction with atlas-based initializations. Finally, the statistics of the model parameters can also be useful in analyzing shape variability.

2. RELATED WORK

The pioneering work of Mumford and Shah [9] exemplifies variational approaches to image segmentation without any explicit ties to a shape model. Methods such as snakes [1] also employ energy minimization in conjunction with a specific shape model. Among such methods, variational image segmentation with level-sets has been a popular choice due to properties such as adaptive topology of level sets which can naturally change during evolution [10, 11]. However, due to their non-parametric nature level-set propagation always has to include a regularization term such as a penalty on curve length/surface area or curvature [12]. On the other hand, regularization is inherent in our model due to the limited amount of representation power afforded by its parametric nature. The proposed DNSM model is implicit and parametric. It has the advantage of being parametric which will can allow us to easily learn statistics and place regularizing priors on the shape model and it has the advantage of being an implicit representation which allows the model to naturally change topology during its evolution if needed. Graph-cut methods have become a popular alternative to level-set based segmentation [13, 14]. The use of interactive segmentation methods have been favored in scenarios when we need to segment a variety of regions. The GrabCut algorithm [15] uses iterative graph-cuts in an interactive fashion. Another popular interactive segmentation method is the random walks [16]. Our work is motivated by the recently proposed logistic disjunctive normal classifier [17].

3. METHODS

Shapes can be represented with their characteristic function f : RnB where B = {0, 1}. Let Ωf = {x ∈ Rn : f(x) = 1} represent the foreground region. The foreground region Ωf can be approximated as the union of N convex polytopes Ω~f=i=1NPi where the ith polytope is the intersection Pi=j=1MHij of M half-spaces Hij = {x ∈ Rn : hij(x)}. The half-space Hij, in arbitrary dimensions is defined using the perceptron equation

hij(x)={1,k=0nwijkxk00,otherwise} (1)

where wijk are the weights. Using Boolean logic any function b : BnB can be represented as a disjunction of conjunctions, known as the disjunctive normal form [8]. Hence, we can formulate the characteristic function for Ω~f as

f~(x)=i=1N(j=1Mhij(x))di(x) (2)

such that Ω~f={xRn:f~(x)=1}. We would like to convert the disjunctive normal form of the function to a differentiable model. First, the conjunction of binary variables j=1Mhij(x) is equivalent to the product j=1Mhij(x). Next, using De Morgan’s laws, we can express the disjunction i=1Ndi(x) as negation of conjunctions, ¬i=1N¬di(x), which in turn can be replaced by 1i=1N(1di(x)). Finally, we relax the binary perceptrons hij(x) to logistic sigmoid functions,

σij(x)=11+ek=0nwijkxk (3)

The resulting approximation to the shape characteristic functions is then given as

f^(x)=1i=1N(1j=1Mσij(x))gi(x) (4)

3.1. Parameter initialization

The parameters are initialized interactively using inputs from the user. The user defines a set of N seed points, Ci, i = 1 to N, for the foreground object such that they are well distributed in the region of interest. Using these seed points, we initialize the shape model with N polytopes and M = 32 logistic sigmoids per polytope. The polytopes are approximated as spheres with a fixed radius. This approximation is obtained by choosing the parameters as

wijk={cosθpsinϕq,k=0sinθpsinϕq,k=1cosϕq,k=2(r+Cixcosθpsinϕq+Ciysinθpsinϕq+Cizcosϕq),k=3}

for varying values of θp, ϕq. We choose θp=π4p and ϕq=π4q for p = [1 ⋯ 8] and q = [1 ⋯ 4]. By using different combinations of θp and ϕq, we get parameters representing different planes.

3.2. Energy Minimization

The cost function based on the Chan-Vese energy segments the image into foreground and background regions.

E(W)=Ωf(I(x)cf)2dx+Ω0(I(x)c0)2dx=Ω(I(x)cf)2f(x)+(I(x)c0)2(1f(x))dx (5)

where cf and c0 are the average intensities in the foreground and background region and Ω0 represents the background region. We fit the model to the data by minimizing this energy with respect to the weights W, using gradient descent. The gradient of the energy function with respect to the weights wijk is evaluated as follows:

Ewijk=[(I(x)cf)2(I(x)c0)2]f(x)f(x)=wijk(1r=1N(1gr(x)))=(ri(1gr(x)))gi(x)wijk=(ri(1gr(x)))(ljMσil(x))σij(x)wijk=(ri(1gr(x)))gi(x)(1σij(x))xk

The update equation is given as wijkwijkηEwijk, where η is the step-size which needs to be tuned for every dataset.

4. RESULTS

We present three experimental setups. We first experimented with confocal images to learn representations of retinal cells in zebrafish embryos. The underlying cell shapes are obtained by smoothing and thresholding the actual image. Segmentation of a cell with increasingly complex models is shown in Figure 1. We first start the segmentation with a single polytope, N = 1 and then refine the segmentation by increasing the number of polytopes. By comparing the segmentations for N = {1 ⋯ 5} polytopes, we see that the DNSM can capture complex boundaries. In this case, N = 3 captures most of the shape information while N = 5 provides almost a perfect representation.

Fig. 1.

Fig. 1

Segmentation of retinal cell in zebrafish embryo for N = 1 ⋯ 5.

Next, to demonstrate the general applicability of our segmentation, we segment images of different organs from different modalities. The objects of interest in these images are varying in size, shape, and contrast. We need to determine the number of polytopes needed to segment a region. We do this by varying the value of N and calculating the optimized energy of the DNSM, as seen in Figure 2. We see that the energy defined in Equation (5) decreases initially and stabilizes at N = 3, giving us the minimum number of polytopes needed to represent this object. We see in Figure 3(a), that the chosen N fits the knee nicely in terms of complexity. Since our algorithm is interactive, we also studied how sensitive the results are to the selection of seed points. The sensitivity of the placement of seed points is studied for a knee MR image and a CT cardiac image. The user defines a set of N seed points. These seed points are randomly shifted, the displacements are drawn from a normal distribution N(0, σ2). The segmentation is recomputed for the shifted seed points. The change in segmentation is calculated as the ratio of pixels that switched labels to the number of pixels originally labeled as foreground. The original images, segmentations and experimental results are shown in Figure 3. The experimental results are averaged over 20 trials. From the results, we observe that the algorithm is stable and produces a small change in segmentation for small displacements of the seed points.

Fig. 2.

Fig. 2

Optimized energy of the shape model for varying N (knee MR image).

Fig. 3.

Fig. 3

Sensitivity analysis of segmentation over 20 trials. Left: Seed initialization in green on original 2D image, the segmented region boundary in green, Right: Experimental results. The x-axis represents the standard deviation in pixels for the shift in the centroid positions. The y-axis represents the change in segmentation. The plotted line represents the mean, with error bar indicating the standard deviation.

While the previous results focused on demonstrating how the techniques works and sensitivity analysis, the last experiment is designed to obtain a quantitative assessment of the segmentation accuracy provided by our method. We applied our method to segment tumors in multimodal brain images in the BRATS-2012 dataset. We used the training set images which consists of 20 high grade images and 10 low grade images. The manual segmentations given had three intensity levels: 1 for edema, 2 for active tumor, and 0 for everything else. We only segmented the active tumor regions in this paper and used N = 5, M = 32. The visual segmentation results for the high grade and low grade real images are shown in Figure 4. Quantitative results for comparing the segmentation with automatic [18, 19] and semiautomatic [20] methods in [21] is given in Table 1. We use the DICE coefficients to compute the similarity between the segmented shape and the manual segmentations. We see that the performance of our algorithm is comparable to the methods in the challenge for high grade images, while it outperforms them for low grade images.

Fig. 4.

Fig. 4

Segmentation results for the BRATS 2012 dataset. Left: input weighted image (α T2 + β T1), manual segmentation (gray - edema, white - tumor), segmentation overlay on input image, Right: segmentation overlay on manual segmentation. The Top row shows an example of a high grade (HG) image and the bottom row shows an example of a low grade (LG) image.

Table 1.

Quantitative comparison of DICE coefficients for segmentation of the BRATS-2012 dataset (HG - high grade images, LG - low grade images).

Method HG LG
Zikic et al. [18] 0.71 0.62
Bauer et al. [19] (with std. dev.) 0.62±0.27 0.49±0.26
Hamamci et al.[20] 0.73 0.71
Our method 0.73 0.74

5. CONCLUSION

We proposed a novel implicit parametric shape model and an associated energy to segment objects of interest in medical images. The proposed method provides good segmentations even on low grade tumor images. We examined the sensitivity of the algorithm to the placement of seeds needed to represent a region. We also illustrated how we can capture complex boundaries by increasing the number of polytopes in the model for segmentation. A local Chan-Vese model can be used to further improve our segmentation results. One direction for future work is fully automated segmentation using the proposed model and atlas based initializations for tasks such as prostate or hippocampus segmentation. While we have used conjunctions of half-spaces in our current work, more application specific shape primitives will be considered in future work. Finally, given a set of training shapes, prior distributions for the model parameters can be learned and used as a regularization term in segmenting new images.

Acknowledgments

This work is supported by NIH grant 1R01-GM098151-01, “Fluorender: An Imaging Tool for Visualization and Analysis of Confocal Data as Applied to Zebrafish Research” and TUBITAK-2221 Fellowships for Visiting Scientists and Scientists on Sabbatical Leave.

REFERENCES

  • [1].Kass M, Witkin A, Terzopoulos D. Snakes: Active contour models. International Journal of Computer Vision. 1988;1(4):321–331. [Google Scholar]
  • [2].Piegl L, Tiller W. The NURBS Book. Springer; 1997. [Google Scholar]
  • [3].Osher S, Sethian JA. Fronts propagating with curvature-dependent speed: Algorithms based on Hamilton-Jacobi formulations. J. Comput. Phys. 1988;79(1):12–49. [Google Scholar]
  • [4].Sethian JA. Level Set Methods and Fast Marching Methods: Evolving Interfaces in Computational Geometry, Fluid Mechanics, Computer Vision and Materials Science. Cambridge University Press; 1999. [Google Scholar]
  • [5].Osher S, Fedkiw RP. Level Set Methods and Dynamic Implicit Surfaces. Springer-Verlag; 2002. [Google Scholar]
  • [6].Taubin G, et al. Parameterized families of polynomials for bounded algebraic curve and surface fitting. IEEE Trans Pattern Analysis and Machine Intelligence. 1994 Mar;16(3):287–303. [Google Scholar]
  • [7].Tasdizen T, et al. Improving the stability of algebraic curves for applications. IEEE Trans Image Processing. 2000 Mar;9(3):405–416. doi: 10.1109/83.826778. [DOI] [PubMed] [Google Scholar]
  • [8].Hazewinkel M, editor. Encylopedia of Mathematics. Springer; 2001. [Google Scholar]
  • [9].Mumford D, Shah J. Optimal approximation by piecewise-smooth functions and associated variational problems. Commun. Pure Appl. Math. 1989;42(5):577–685. [Google Scholar]
  • [10].Osher S, Paragios N. Geometric Level Set Methods in Imaging Vision and Graphics. Springer-Verlag; 2003. [Google Scholar]
  • [11].Brox T, Weickert J. vol. 3175 of Lecture Notes in Computer Science. 2004. Level set based image segmentation with multiple regions; pp. 415–423. [Google Scholar]
  • [12].Cremers D. Shape priors in variational image segmentation: Convexity, lipschitz continuity and globally optimal solutions; CVPR; June 2008.pp. 1–6. [Google Scholar]
  • [13].Boykov Y, et al. Fast approximate energy minimization via graph cuts. IEEE Trans Pattern Analysis and Machine Intelligence. 2001 Nov;23(11):1222–1239. [Google Scholar]
  • [14].Grady L, Alvino C. The piecewise smooth Mumford-Shah functional on an arbitrary graph. IEEE Trans Image Processing. 2009 Nov;18(11):2547–2561. doi: 10.1109/TIP.2009.2028258. [DOI] [PubMed] [Google Scholar]
  • [15].Rother C, et al. Grabcut: Interactive foreground extraction using iterated graph cuts. ACM Trans on Graphics. 2004 Aug;23(3):309–314. [Google Scholar]
  • [16].Grady L. Random walks for image segmentation. IEEE Trans Pattern Analysis and Machine Intelligence. 2006 Nov;28(11):1768–1783. doi: 10.1109/TPAMI.2006.233. [DOI] [PubMed] [Google Scholar]
  • [17].Seyedhosseini M, Sajjadi M, Tasdizen T. Image segmentation with cascaded hierarchical models and logistic disjunctive normal networks; IEEE International Conference on Computer Vision (ICCV); Dec 2013; pp. 2168–2175. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [18].Zikic D, et al. MICCAI, vol. 7512 of Lecture Notes in Computer Science(LNCS) 2012. Decision forests for tissue-specific segmentation of high-grade gliomas in multi-channel MR; pp. 369–376. [DOI] [PubMed] [Google Scholar]
  • [19].Bauer S, et al. MICCAI, vol. 6893 of Lecture Notes in Computer Science (LNCS) 2011. Fully automatic segmentation of brain tumor images using support vector machine classification in combination with hierarchical conditional random field regularization; pp. 354–361. [DOI] [PubMed] [Google Scholar]
  • [20].Hamamci A, et al. Tumor-cut: Segmentation of brain tumors on contrast enhanced MR images for radiosurgery applications. IEEE Trans Medical Imaging. 2012 Mar;31(3):790–804. doi: 10.1109/TMI.2011.2181857. [DOI] [PubMed] [Google Scholar]
  • [21].Menze B, et al. The Multimodal Brain Tumor Image Segmentation Benchmark (BRATS) 2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [22].Ramesh N, Tasdizen T. Cell tracking using particle filters with implicit convex shape model in 4D confocal microscopy images; IEEE International Conference on Image Processing (ICIP); 2014; [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [23].Tasdizen T, Cooper DB. Boundary estimation from intensity/color images with algebraic curve models; IEEE International Conference on Pattern Recognition (ICPR); 2000.pp. 225–228. [Google Scholar]

RESOURCES