Abstract
This paper presents a novel segmentation approach featuring shape constraints of multiple structures. A framework is developed combining statistical shape modeling with a maximum a posteriori segmentation problem. The shape is characterized by signed distance maps and its modes of variations are generated through principle component analysis. To solve the maximum a posteriori segmentation problem a robust Expectation Maximization implementation is used. The Expectation Maximization segmenter generates a label map, calculates image intensity inhomogeneities, and considers shape constraints for each structure of interest. Our approach enables high quality segmentations of structures with weak image boundaries which is demonstrated by automatically segmenting 32 brain MRIs into right and left thalami.
1 Introduction
For many age or disease related brain studies large quantities of Magnetic Reasoning Images (MRI) have to be accurately segmented into anatomical regions. Achieving high quality brain MRI segmentation is quite challenging for automatic methods so researchers often have to rely on labor intensive, manual delineation. The task is challenging because some structures have very similar intensity characteristics, such as substructures in the cortical gray matter, while others have only weakly visible boundaries (e.g. thalamus). Recent methods using enhanced anatomical knowledge have greatly improved the quality of automatically generated results.
We briefly summarize methods that incorporate shape constraints into the segmentation process. A promising approach [1–3] is based on level set functions. It characterizes shape based signed distance maps in combination with the Principle Component Analysis (PCA) [4]. Generally, PCA finds the largest modes of variation among the signed distance maps. Besides level sets, deformable model methods have used many different shape representations, such as spherical harmonics [5], point based models [4], skeleton or medial representations [6], and finite element models [7].
The novel approach presented in this paper is most closely related to work by Tsai and Leventon [1, 2]. While PCA based segmentation methods are very robust they are also constraint in the degrees of freedom of the shape variations allowed. We therefore couple the PCA based shape modeling with a maximum a posteriori estimation problem which will be solved through an Expectation Maximization (EM) implementation developed by Pohl et al. [8]. This allows the system to accommodate shapes that differ some what from those modeled by the PCA. Additionally, the method can segment multiple objects and estimate intensity inhomogeneities in the image.
2 Method
This section discusses the integration of shape constraints into an EM segmentation algorithm. First, the shape variations across subjects are captured through PCA [9]. Afterwards, the shape constraints are added to the parameter space of an EM-based segmentation algorithm [8].
2.1 Shape Representation
Various shape representations have been explored in medical imaging. For our work, we chose signed distance maps due to their robustness. The structure’s shape variations are captured by PCA. To apply PCA to the training data we first align all training sets using the affine registration method developed by Warfield [10]. Then, each data set i is transferred into structure specific signed distance maps , where a represents the structure of interest (see also Figure 1). In these distance maps positive values are assigned to voxels within the boundary of the object, while negative values indicate voxels outside the object. By taking the average over all these distance maps we define the mean distance map and the mean corrected signed distance maps . The input for PCA is the vector defined by the mean corrected signed distance maps of the N structures of interests. Therefore, PCA is applied to all structures at once. This analysis defines the shape constraints of the entire image which is represented by the eigenvector or modes of variation matrix U, eigenvalue matrix Λ, and (see also Figure 2). To reduce the computational complexity for the EM implementation, U and Λ will only be defined by the first K eigenvectors and eigenvalues, where K represents 99 % of the eigenvalues’ energy.
The shapes in a specific brain image will be captured by the expansion coefficients of the eigenvector representation which we call shape parameters 𝒮 = (𝒮1, ⋯, 𝒮K). 𝒮 relates to the distance maps by 𝒟𝒮 = 𝒟̄ + U·𝒮. We will refer to the shape parameter generated distance map of a specific structure a as 𝒟𝒮,a = 𝒟̄a + Ua · 𝒮, where Ua are just the entries in U that refer to structure a.
The probability distribution over the shape parameters p(𝒮) is now defined by the Gaussian distribution
where K is the dimension of eigenvalue matrix Λ.
2.2 Estimating Intensity Inhomogeneities and Shape
The algorithm proposed in this chapter is based on an EM-based segmentation algorithm by Pohl et al. [8] which uses probability atlases to define the spatial distribution of structures. Expanding this approach, we will not only approximate the maximum a posteriori estimate (MAP) of the image intensity inhomogeneities ℬ but also the MAP estimate of the shape parameters 𝒮. In this framework the MAP estimates of the parameter space, i.e. ℬ and 𝒮, depend on the partition of the image in anatomical regions 𝒯 (the hidden data), the log intensities of the input image ℐ (the observed data), and previous estimations of the inhomogeneities ℬ′ as well as the shape parameter 𝒮′. Therefore, our approach tries to solve the following problem:
(1) |
where E𝒯|ℐ,ℬ′,𝒮′(log p(ℬ,𝒮|𝒯,ℐ)) := Σ𝒯 p(𝒯 |ℐ,ℬ′,𝒮′) · log p(ℬ,𝒮|𝒯,ℐ) and we assume independence of 𝒮 in p(ℐ|𝒯,𝒮,ℬ). If we further assume independence between ℬ and 𝒮, and ℬ and 𝒯 than the maximization problem can be simplified to :1
(2) |
(3) |
To solve these two equations the EM algorithm iterates between the Expectation Step (E-Step) and the Maximization Step (M-Step). The E-Step first updates ℬ′ and 𝒮′ with ℬ″ and 𝒮″. Then it calculates the expected value of the two functions based on ℬ′ and 𝒮′. The M-Step approximates separately the MAP estimates ℬ″ and 𝒮″ based on the results of the E-Step. For a general overview of EM we refer the reader to [11].
In the remainder of this section we will first discuss the two MAP estimation problems separately and then integrate these two MAP estimation problems into the EM framework.
Estimating the Intensity Inhomogeneities
To find the MAP estimate of ℬ we assume statistical independence of the voxel location x for ℬ and ℐ. Therefore, Equation (2) simplifies to:
(4) |
The conditional intensity distribution is modeled by a Gaussian distribution:
where n is the number of input channels, and (μa, σa) define the intensity distribution of structure a.‘ ’ refers to footnote x for further explanation. Let’s define
and the weights 𝒲x(a) := E𝒯|ℐ,ℬ′,𝒮′(𝒯x(a)) so that Equation (4) turns into
As Wells shows [12] the above problem can be approximated by a low pass filter H applied to the weighted residual R̄: ℬ ≈ HR̄. Now, we will explicitly define the weights 𝒲x(a) := E𝒯|ℐ,ℬ′,𝒮′(𝒯x(a)):
(5) |
We will model p(𝒯x(a) = 1|𝒮) as a measure of agreement among the shape 𝒮 an the label map 𝒯. This is achieved by transforming the distance maps 𝒟𝒮 produced by 𝒮 into binary maps through ℋ :
where ℋ𝒮 (x,a) is the Heaviside function for structure a. p(𝒯|𝒮) penalizes any disagreement between 𝒯x and ℋ𝒮(x) = (ℋ𝒮(x,1),⋯,ℋ𝒮(x,N)T:
where d is a correlation metric between 𝒯x and ℋ𝒮(x). Here d(v1, v2): = (v1 − v2)T (v1 − v2), which means d is zero when v1 and v2 agree, and 1 or greater when they disagree. f (𝒯x) represents a prior probability on 𝒯x defined by a probability atlas [8]. We therefore can ignore f in the normalizing function Z with m being the number of voxels in the image
If defines the local conditional probability than
Estimating the Shape Parameters 𝒮
As mentioned in Section 2.1 statistical independence among the coefficients of 𝒮 = (𝒮1, ⋯, 𝒮N)T is assumed. Therefore, Equation (3) is solved for each component of 𝒮:
(6) |
where
is zero unless 𝒯x(a) ≠ ℋ𝒮(a) for a structure a and voxel x is located at the border of the shape of a. Thus, if Ω is the set of voxels at the boundaries of ℋ𝒮 Equation (6) simplifies to :
From the above equation the updated shape parameter 𝒮i is defined by the weighted sum of its eigenvector values located at borders and scaled by the ith eigenvalue. In other words, the eigenvector values Ui(x) defines the ‘direction of change’ for parameter 𝒮i and the control the ‘speed of change’.
2.3 The Shape Constraint EM Algorithm
The EM Algorithm is now defined by the E-Step who generates the structure posterior probabilities 𝒲, called weights, based on the constraints imposed by shape, intensity, image inhomogeneities, and location (see Equation (5))
The M-Step calculates the image inhomogeneities ℬ and shape parameters 𝒮 based on the newly updated weights 𝒲. ℬ = H·R̄ is approximated by a simple low pass filter H and the weighted residuum (see also [12]).
The shape parameters 𝒮 = (𝒮1, ⋯, 𝒮N)T are updated in the M-Step by:
The EM algorithm iterates between E- and M-Step until the cost function Q((ℬ,𝒮),(ℬ′,𝒮′)) of Equation (1) converges to a local maximum, which is guaranteed by the EM framework if the iteration sequence has an upper bound [11].
3 Validation
We validate our approach by segmenting 32 test cases into white matter, grey matter, cortical spinal fluid, and the left and right thalamus. The study uses segmentations from one expert which are restricted to the right and left thalamus, which this study regards as gold standard. To introduce no bias into the segmentation approach we only generated shape atlases for those two structures (see also Section 2.1). The shape atlases are produced for each test case by applying PCA to the remaining 31 cases. From the analysis we use the first five modes of variations, which corresponds to 99% of the eigenvalues’ energy. Furthermore, we manually calibrate the EM segmentations by comparing one automatic segmentation result to an expert’s segmentation. Especially for structures like the thalamus, where borders are not clearly visible, large variations of the experts’ opinion about structure’s boundary exist. Therefore, this manual calibration is essential so that automatically generated results meet the experts’ expectations.
To measure the robustness of the method (EM-Shape) we compare the automatic with the expert segmentations using the volume overlap measure Dice [13]. We then compare the experts segmentations to the results of two different EM implementations. The first algorithm (EM-Rigid) uses rigid alignment of atlas information and no shape constraints. The second implementation (EM-NonRigid) also does not incorporate shape constraints but uses non-rigid registration for the initial alignment and models neighborhood relationships through Markov Random Field approximation [8].
Generally, EM-Shape outperformed the other two method (see also Table 1). It had the highest mean average value of agreement, the lowest variance, the highest minimum Dice measure over all cases, and the highest maximum dice measure. Of the three methods, EM-Shape relies the least on the initial registration of the atlas to the patient. The new shape constraints allow a better adjustment of the EM parameters to the specific brain images during the segmentation process. It can capture subtle difference in the shape as the hypothalamus which is underrepresented in both EM-Rigid and EM-NonRigid (see 3D images in Figure 3).
Table 1.
DICE Measure over 32 cases | ||||
---|---|---|---|---|
Method | Mean | Variance | Minimum | Maximum |
| ||||
EM-Rigid | 0.755 | 0.0221 | 0.449 | 0.883 |
| ||||
EM-NonRigid | 0.715 | 0.0149 | 0.34 | 0.883 |
| ||||
EM-Shape | 0.82 | 0.0117 | 0.625 | 0.909 |
The EM-NonRigid heavily relies on the initial non-rigid registration. Even though it produced excellent results for the superior temporal gyrus [14], it performed worse on the thalamus, because the initial alignment process cannot detect the thalamus’ weakly visible boundaries. It produces very smooth segmentations due to the Mean Field approximation which models neighborhood dependencies within an image. On the downside, it also smoothed over subtle differences within small gyri and the thalamus, which are better captured by EM-Shape and EM-Rigid.
4 Discussion
A novel shape constraint segmentation approach has been presented. Embedded in an EM segmentation framework, the algorithm deals with multiple brain structures as well as estimates the intensity inhomogeneities. It generates high quality segmentations of structures with weakly visible boundaries. The approach is not restricted to the modes of variations presented in the shape model but models patient specific abnormalities. Furthermore, we have documented its robustness by segmenting 30 different cases and comparing them to other EM-like methods as well as manual segmentations.
In the future we would like to include more complex conditional probabilities that better model the dependencies between label maps and the shape of the object. We also would like to couple pose and labeling of the objects because their solution depend on each other.
Acknowledgments
This investigation was supported by a research grant from the Whitaker Foundation, by NIH grants (R21 MH67054, R01 LM007861, P41 RR13218, P01 CA67165) and by NSF ERC 8810-27499. We would like to thank Katherine Long, Florent Segonne, Lilla Zollei, Polina Golland, Samson Timoner, and Monica Vantoch for their valuable contributions to this paper.
Footnotes
and p(ℬ|𝒯) = p(ℬ).
ea has a 1 at position a and 0 otherwise
Bayes’ rule: Σ𝒯(i,aj) p(𝒯(1,a1),⋯,𝒯(n,am)|ℐ,ℬ′) · 𝒯x(a) = p(𝒯x(a) = 1|ℐ,ℬ′)
Based on previous independence assumption
. if we assume each voxel is part of only one shape then |ℋ𝒮(x)| = 1 and
where δ is the Dirac’s delta function and the Eigenvector matrix Ua was defined in Section 2.1
where δ0 is the null function with δ0(0) = 1, δ0(x) = 0 for x ≠ 0, and δ0(X) := (δ0(X(1)),⋯ δ0(X(n)))T for a vector X
References
- 1.Leventon M, Grimson W, Faugeras O. Statistical shape influence in geodesic active contours. IEEE Conference on Computer Vision and Pattern Recognition; 2000. pp. 1316–1323. [Google Scholar]
- 2.Tsai A, Yezzi A, Wells W, Tempany C, Tucker D, Fan A, Grimson W, Willsky A. A shape-based approach to the segmentation of medical imagery using level sets. IEEE Transactions in Medical Imaging. 2003;22(2):137– 154. doi: 10.1109/TMI.2002.808355. [DOI] [PubMed] [Google Scholar]
- 3.Rousson M, Paragios N, Deriche R. Tech Rep 4984. Institut National de Recherche en Informatique et en Automatique; Sophia-Antipolis: 2003. Active shape models from a level set perspective. ftp://ftp.inria.fr/INRIA/publication/publi-pdf/RR/RR-4984.pdf. [Google Scholar]
- 4.Cootes T, Hill A, Taylor C, Haslam J. The use of active shape models for locating structures in medical imaging. Imaging and Vision Computing. 1994;12(6):335–366. [Google Scholar]
- 5.Kelemen A, Szekely G, Gerig G. Elastic model-based segmentation of 3-d neuroradiological data sets medical imaging. IEEE Transactions in Medical Imaging. 1999;18:828–839. doi: 10.1109/42.811260. [DOI] [PubMed] [Google Scholar]
- 6.Pizer SM, Gerig G, Joshi S, Aylward SR. Multiscale medial shape-based analysis of image objects. Proceedings of the IEEE, Special Issue on: Emerging Medical Imaging Technology. 2003;91:670– 679. [Google Scholar]
- 7.Papdemetris X, Sinusas AJ, Dione DP, Constable RT, Duncan JS. Estimation of 3-d left ventricular deformation form medical images using biomechanical models. IEEE Transactions in Medical Imaging. 2002;21:786– 800. doi: 10.1109/TMI.2002.801163. [DOI] [PubMed] [Google Scholar]
- 8.Pohl K, Bouix S, Kikinis R, Grimson W. Anatomical guided segmentation with non-stationary tissue class distributions in an expectation-maximization framework. IEEE International Symposium on Biomedical Imaging. 2004:81–84. doi: 10.1109/ISBI.2004.1398479. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Cootes T, Edwards G, Taylor C. Active appearance model. Europeen Conference on Computer Vision (ECCV) 1998;2:484–498. [Google Scholar]
- 10.Warfield S, Rexilius J, Huppi P, Inder T, Miller E, Wells W, Zientara G, Jolesz F, Kikinis R. A binary entropy measure to assess nonrigid registration algorithm. Medical Image Computing and Computer-Assisted Intervention. 2001 Oct;:266–274. [Google Scholar]
- 11.McLachlan GJ, Krishnan T. The EM Algorithm and Extensions. John Wiley and Sons, Inc; 1997. [Google Scholar]
- 12.Wells W, Grimson W, Kikinis R, Jolesz F. Adaptive segmentation of MRI data. IEEE Transactions in Medical Imaging. 1996;15:429–442. doi: 10.1109/42.511747. [DOI] [PubMed] [Google Scholar]
- 13.Dice LR. Measure of the amount of ecological association between species. Ecology. 1945;26:297–302. [Google Scholar]
- 14.Pohl K, Wells W, Guimond A, Kasai K, Shenton M, Kikinis R, Grimson W, Warfield S. Incorporating non-rigid registration into expectation maximization algorithm to segment MR images. Medical Image Computing and Computer-Assisted Intervention. 2002:564–572. doi: 10.1007/3-540-45786-0_70. [DOI] [PMC free article] [PubMed] [Google Scholar]