Abstract
In this paper, we propose a novel segmentation method for cells in histopathologic images based on a sparse shape prior guided variational level set framework. We automate the cell contour initialization by detecting seeds and deform contours by minimizing a new energy functional that incorporates a shape term involving sparse shape priors, an adaptive contour occlusion penalty term, and a boundary term encouraging contours to converge to strong edges. As a result, our approach is able to accommodate mutual occlusions and detect contours of multiple intersected cells. We apply our algorithm to a set of whole-slide histopathologic images of brain tumor sections. The proposed method is compared with other popular methods, and demonstrates good accuracy for cell segmentation by quantitative measures, suggesting its promise to support biomedical image-based investigations.
Index Terms: Cell Segmentation, Level Set, Shape Priors, Sparse Representation
1. INTRODUCTION
Image segmentation is a fundamental problem in computer vision and image analysis. In this community, level set-based approaches are important tools, because they are able to handle cell shapes with complex topological variations [1, 2, 3, 4]. However, this set of methods still faces several challenges. 1) It is difficult to segment cells without appropriate cell initializations. 2) Methods using shape prior models take as a reference a predefined set including shape priors insufficient to represent shape variations in most cases. 3) Segmentation of mutually occluded cells remains challenging. To address these problems, we propose a variational level set model to segment multiple cells with mutual occlusion and demonstrate its efficacy with application to pathologic images of brain tumor tissues. Although overlapped cells may take a small fraction of the cell population of interest, the absolute number is large in a whole-slide microscopy image where the total cell number is commonly on the scale of million [5]. Additionally, some tumor regions tend to have very high tumor cell density, presenting much more overlapped cells of interest than other regions. Thus, it would lead to too much information loss if we excluded such overlapped cells from clinically meaningful tissue areas for morphometry analysis [6]. The main contributions of this work are summarized as follows: 1) We propose a two-stage framework to effectively detect seeds and segment cells. 2) A sparse representation based shape term is introduced to better incorporate shape prior information. 3) We introduce a dynamic cell occlusion term to deal with occlusion events involving a variable number of cells. 4) Our approach is successfully applied to brain histopathological images for cell segmentation task. Our framework, as shown in Figure 1, consists of a seed detection algorithm for cell contour initialization and an integrated contour deformable model that incorporates region, shape and boundary information.
2. METHOD
2.1. Shape representation
We locate cells and initialize cell contours by a previously developed seed detection algorithm that utilizes joint information of spatial connectivity, distance constraint, image edge map, and a shape-based voting map derived from eigenvalue analysis of Hessian matrix across multiple scales [7]. Given an image with detected seeds of N cells , , …, , we initialize each cell as a small circle centered at a seed. Each cell contour is a closed and bounded shape contour in image domain . The level set framework implicitly represents as a zero level set of a Lipschitz function where ϕi(x) has positive and negative value when x is inside and outside , respectively. A distance map function Γ(ϕi(x)) represents the shortest distance from a current pixel x in the image domain Ω to a given cell contour . In this way, we can represent N cells in a image as N distance map functions. Instead of partitioning an image into disjoint regions, we allow an image pixel to be associated with multiple cells with intersecting contours.
We use a set of 27,000 shape priors to cover the complex shape variation observed in most histopathology images. Shape priors are annotated manually and aligned by the generalized Procrustes analysis first [8]. Next, they are clustered with spectral clustering algorithm into M clusters and the average shape of each resulting cluster is computed [9]. We denote Ψ = {ψ1, ψ2, …, ψM} as the set of two-dimensional distance maps derived from average shape priors with unit Frobenius norm. We align the cell contour to shape priors Ψ by the similarity transformation: , where x = [x1, x2]T and are a pixel location before and after alignment; {si}, {θi}, and {ti = [ti1, ti2]T} are scale, rotation and translation coefficient, respectively. As our shape priors Ψ have nearly same scales, we set si = 1, ∀i to achieve better computational efficiency.
After mapping the i-th cell to the shape prior set, we assume that a cell with the associated distance map function Γ(ϕi(x)) can be approximately represented as a linear composition of aligned sparse shape priors in . We vectorize two dimensional distance maps and denote vectorized distance map of as . Additionally, we compose the mapped shape prior matrix with vectorized shape prior distance maps by column and denote it as . Note that each column has a unit norm. By the linearity assumption, we have , where αi is a coefficient vector representing the weights of shape priors in shape reconstruction and ei is the error vector.
Given a large enough training set, a test shape can be sufficiently represented by a small number of shape priors [10][11]. Dissimilar prior shapes are suppressed due to large penalty measured by the reconstruction error. To ensure sparsity and solvability, we apply the relaxed norm regularization in the formulation [12]. Moreover, we reduce dimensionality to accommodate computational cost using a non-zero projection matrix R ∈ ℝd×m, d ≪ m with entries randomly generated from standard Gaussian distribution and subject to unit length for all rows. Note the choice of matrix R does not critically affect the ability to find the optimal sparse solution by -minimization [10]. The resulting sparse representation problem is formulated as:
(1) |
where , , , , and is an identity matrix.
2.2. Level set functional formulation
Distance map Γ(ϕi) can be encoded as a sparse linear superposition of shape priors, with coefficients obtained by optimizing equation (1). Thus, we define a shape term for the fitting shape error between individual deformable contours and linear combination of sparsely selected shape priors:
(2) |
where is the j-th entry of . With minimization of this term, the distance map function Γ(ϕi) best fits to the feature space defined by the shape priors. Note when λ is sufficiently large, the sparse shape term is determined by a single shape prior and our model is, therefore, degenerated to a single shape prior segmentation.
In two dimensional (2D) histopathologic images, it is common to have mutually occluded cells since 2D images represent projections of objects in 3D space. It is a challenging task to segment mutually occluded cells and identify separating boundaries. To address this problem, we introduce an adaptive occlusion penalty term to dynamically suppress cell intersection events. This occlusion penalty is designed to depend on the number of cells that are overlapped:
(3) |
where H(·) is the Heaviside function. The first term in E2 is a superposition of the Heaviside functions of all cells and the second term is a binary function indicating inside of any cell/outside of all cells. This term represents a dynamic occlusion penalty adaptive to the number of intersecting cells.
We solve cell segmentation problem by minimizing the following functional within a variational level set framework:
(4) |
where ci and cb are scalars that define the average pixel intensities in cell regions and background, respectively. is an exponential function monotonously decreasing as the magnitude of gradient of x increases. R(·) is defined as a double-well potential function [13]. |∇H(ϕi)| is non-zero only on cell boundaries. We denote Φ = {ϕi}, C = {ci, cb}, Θ = {θi} and T = {ti1, ti2}, i = 1, 2, …, N as sets of variables of the energy functional. Parameters {λo, λb, μ, ξ, ω, ν} are weights of different terms. The last two terms in equation (4) are used to encourage contours to converge to strong boundaries and retain signed distance property for stable level set function computation respectively.
2.3. Numerical computations
We optimize the established functional iteratively by updating functions Φ and variables {C, Θ, T} alternatively. Parameterizing iteration as an artificial time variable t > 0, we minimize the functional by solving Euler-Lagrange equation:
(5) |
where δ(·) denotes the Dirac delta function.
Next, we fix Φ and update transformation parameters. We derive updating equations for transformation parameters by computing gradient descent of functional E(Θ, T):
(6) |
(7) |
where k = 1, 2. Finally, we fix Φ, and {Θ, T}, and update ci and cb.
(8) |
(9) |
3. EXPERIMENTAL RESULTS
We verify our algorithm with whole-slide histopathologic images of brain tumors. For computational efficiency, 512×512 pixel image patches are used for test, containing 1857 cells in aggregation. We set parameters empirically as follows: λo = λb = 1, μ = 5000, ξ = 2, ω = 2000, ν = 3000.
We cluster the shape priors into 100 groups by spectral clustering and use group mean shapes to construct the shape prior dictionary. Next, we apply the seed detection algorithm to search for seed points [7]. We present a typical cell seed detection result in Figure 2(a), where cyan circles are used to represent detected cell seeds. We solve the cell segmentation problem by the variational level set framework. The level set functions do not stop evolving until they either reach convergence or exceed iteration number limit. We present the evolution results of zero level sets at iteration 10, 20, and 30 in Figure 2 where the detected cell shapes are well defined. Notably, those overlapping cells are also correctly segmented, with occluded boundaries recovered. It is also observed that most zero level sets can rapidly converge to true cell boundaries within 10 iterations. After that, zero level sets fine tune themselves to better fit to cell contour details. This is especially noticeable for those overlapped cells.
We also investigate the segmentation contribution from distinct terms in our variational model by comparing their associated segmentation results. The segmentation result with our default parameter setting is shown in Figure 3(a). When ν = 0, the shape prior fitting term E1 does not take effect in the contour deformation process. The resulting segmentation outcome is presented in Figure 3(b), where finalized cell contours are less regulated. Similarly, we can remove the occlusion penalty term E2 from the variational model by setting ω = 0. The associated result is illustrated in Figure 3(c). Under this setting, the detected cells present a strong inclination to overlap with each other. When the shape prior fitting term E1, the dynamic occlusion penalty term E2, and the evolutionary stability term are all removed (ν = ω = ξ = 0), the resulting cell contours become significantly degraded as shown in Figure 3(d). Note that shapes appear to be less regulated in Figure 3(b–d) without one or more terms in the variational model. In addition to the final results with only a subset of terms in Figure 3(b–d), we also investigate the sensitivity of parameters to final results. Our investigations with parameter deviation from our proposed value set suggest that results remain similar even when we change μ by 22% (μ = 3900), 10% (μ = 4500), and ν by 67% (ν = 5000), as presented in Figure 3(e–g). Figure 3(h) presents results when μ and ν are simultaneously changed by 22% and 67%. Overall, a larger μ leads to a better contour convergence to true cell boundary; A larger ν forces contours to look more similar to the reconstructed sparse shape priors by E1; A larger ω tends to prevent cells more from overlapping with each other by E2. The other three parameters, i.e. λo = λb = 1, ξ = 2, do not need to be tuned in general. As a result, this method can be applied to other datasets without much parameter tuning.
To assess our method quantitatively, we collect human expert annotations on testing images and use human annotations as ground truth. By comparing the experimental result B with ground truth annotation A, we evaluate our method with multiple metrics[17, 18]: Jaccard coefficient J(A, B), precision rate P(A, B), recall rate R(A, B), F1 score F1(A, B) and Hausdorff distance H(A, B). We present in Table 1 performances of the proposed method and three other popular methods, including MS [14], ISO [15], and SBGFRLS [16]. A qualitative method performance comparison is shown in Figure 4. Note our proposed method is able to capture cell boundaries better. In particular, only our method can recover boundaries of overlapped cells by comparisons. This property makes our method superior to the other methods when analytics of occluded cells is crucial in investigations.
Table 1.
Methods | J | P | R | F1 | H | |||||
---|---|---|---|---|---|---|---|---|---|---|
Mean | Std Dev | Mean | Std Dev | Mean | Std Dev | Mean | Std Dev | Mean | Std Dev | |
MS [14] | 0.49 | 0.25 | 0.61 | 0.32 | 0.80 | 0.17 | 0.62 | 0.25 | 27.97 | 30.07 |
ISO [15] | 0.69 | 0.22 | 0.82 | 0.27 | 0.83 | 0.14 | 0.79 | 0.21 | 12.77 | 24.39 |
SBGFRLS [16] | 0.75 | 0.12 | 0.91 | 0.15 | 0.82 | 0.09 | 0.85 | 0.10 | 6.84 | 8.46 |
Ours | 0.76 | 0.11 | 0.93 | 0.11 | 0.82 | 0.11 | 0.86 | 0.09 | 4.96 | 4.76 |
4. CONCLUSION
We propose a new cell segmentation method based on the level set framework, aiming to identify contours of multiple cells with mutual occlusion in histopathologic images. For cell contour deformation, we construct a shape dictionary including typical cell prior shapes by spectral clustering. An adaptive cell occlusion penalty term is designed to dynamically penalize cell contour occlusion based on the number of overlapped cells. Experiments on brain tumor images produce promising results as compared with other methods, suggesting the effectiveness of our approach on cell segmentation for histopathologic images.
Acknowledgments
This research is supported in part by grants from National Institute of Health K25CA181503 and R01CA176659, The Emory University Research Committee, National Science Foundation ACI 1443054 and IIS 1350885, and CNPq.
References
- 1.Chan TF, Vese LA. Active contours without edges. IEEE Transactions on Image Processing. 2001;10(2):266–277. doi: 10.1109/83.902291. [DOI] [PubMed] [Google Scholar]
- 2.Chan T, Zhu W. IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR) Vol. 2. IEEE; 2005. Level set based shape prior segmentation; pp. 1164–1170. [Google Scholar]
- 3.Qi X, Xing F, Foran DJ, Yang L. Robust segmentation of overlapping cells in histopathology specimens using parallel seed detection and repulsive level set. IEEE Transactions on Biomedical Engineering. 2012;59(3):754–765. doi: 10.1109/TBME.2011.2179298. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Bergeest J, Rohr K. Efficient globally optimal segmentation of cells in fluorescence microscopy images using level sets and convex energy functionals. Medical image analysis. 2012;16(7):1436–1444. doi: 10.1016/j.media.2012.05.012. [DOI] [PubMed] [Google Scholar]
- 5.Kong J, Cooper L, Wang F, Gao J, Teodoro G, Scarpace L, Mikkelsen T, Moreno C, Saltz J, Brat D. Generic, computer-based morphometric human disease classification using large pathology images uncovers signature molecular correlates. PLoS One. 2013;8(11):e81049. doi: 10.1371/journal.pone.0081049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Gupta M, Djalilvand A, Brat D. Clarifying the diffuse gliomas. American Journal of Clinical Pathology. 2005;124(5):755–768. doi: 10.1309/6JNX-4PA6-0TQ5-U5VG. [DOI] [PubMed] [Google Scholar]
- 7.Kong J, Zhang P, Liang Y, Teodoro G, Brat DJ, Wang F. IEEE International Symposium on Biomedical Imaging (ISBI) IEEE; 2016. Robust cell segmentation for histological images of glioblastoma; pp. 1041–1045. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Goodall C. Procrustes methods in the statistical analysis of shape. Journal of the Royal Statistical Society Series B (Methodological) 1991:285–339. [Google Scholar]
- 9.Von Luxburg U. A tutorial on spectral clustering. Statistics and Computing. 2007;17(4):395–416. [Google Scholar]
- 10.Wright J, Yang A, Ganesh A, Sastry S, Ma Y. Robust face recognition via sparse representation. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2009;31(2):210–227. doi: 10.1109/TPAMI.2008.79. [DOI] [PubMed] [Google Scholar]
- 11.Zhang S, Zhan Y, Dewan M, Huang J, Metaxas DN, Zhou XS. IEEE Conference on Computer Vision and Pattern Recognition (CVPR) IEEE; 2011. Sparse shape composition: A new framework for shape prior modeling; pp. 1025–1032. [Google Scholar]
- 12.Donoho D. For most large underdetermined systems of linear equations the minimal l1-norm solution is also the sparsest solution. Communications on Pure and Applied Mathematics. 2006;59(6):797–829. [Google Scholar]
- 13.Li C, Xu C, Gui C, Fox M. Distance regularized level set evolution and its application to image segmentation. IEEE Transactions on Image Processing. 2010;19(12):3243–3254. doi: 10.1109/TIP.2010.2069690. [DOI] [PubMed] [Google Scholar]
- 14.Comaniciu D, Meer P. Mean shift: A robust approach toward feature space analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2002;24(5):603–619. [Google Scholar]
- 15.Grady L, Schwartz E. Isoperimetric graph partitioning for image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2006;28(3):469–475. doi: 10.1109/TPAMI.2006.57. [DOI] [PubMed] [Google Scholar]
- 16.Zhang K, Zhang L, Song H, Zhou W. Active contours with selective local or global segmentation: a new formulation and level set method. Image and Vision Computing. 2010;28(4):668–676. [Google Scholar]
- 17.Fawcett T. An introduction to roc analysis. Pattern Recognition Letters. 2006;27(8):861–874. [Google Scholar]
- 18.Huttenlocher D, Klanderman G, Rucklidge W. Comparing images using the hausdorff distance. IEEE Transactions on Pattern Analysis and Machine Intelligence. 1993;15(9):850–863. [Google Scholar]