Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2011 Mar 8.
Published in final edited form as: Med Image Comput Comput Assist Interv. 2010;13(Pt 2):151–159. doi: 10.1007/978-3-642-15745-5_19

A Generative Model for Brain Tumor Segmentation in Multi-Modal Images

Bjoern H Menze 1,2, Koen Van Leemput 1,3,4, Danial Lashkari 1, Marc-André Weber 5, Nicholas Ayache 2, Polina Golland 1
PMCID: PMC3050038  NIHMSID: NIHMS272389  PMID: 20879310

Abstract

We introduce a generative probabilistic model for segmentation of tumors in multi-dimensional images. The model allows for different tumor boundaries in each channel, reflecting difference in tumor appearance across modalities. We augment a probabilistic atlas of healthy tissue priors with a latent atlas of the lesion and derive the estimation algorithm to extract tumor boundaries and the latent atlas from the image data. We present experiments on 25 glioma patient data sets, demonstrating significant improvement over the traditional multivariate tumor segmentation.

1 Introduction

Limited therapy options require a careful diagnostic for patients with brain tumors. A multitude of available brain imaging sequences gives rise to patient data sets that include multi-parametric, multi-modal, and multi-temporal volumes even in standard clinical settings. Quantitative analysis of a lesion in these data poses a challenging computational problem. In this paper, we present a fully automated method for channel-specific tumor segmentation in such multidimensional images.

Generative probabilistic models of spatial tissue distribution and appearance have enjoyed popularity for tissue classification as they exhibit good generalization to unseen images [1,2,3]. Encoding spatial prior knowledge for a lesion, however, is difficult. Tumors may be modeled as outliers relative to the expected shape [4,5] or image signal of healthy tissues [2,6]. In [2], for example, a criterion for detecting outliers is used to generate a tumor prior in a subsequent EM segmentation which is treating tumor as an additional tissue class. Alternatively, the spatial prior for the tumor can be derived from the appearance of tumor-specific bio-markers [7,8]. The tumor classification methods can be augmented with spatial regularization using a Markov Random Field prior [9] or a boundary finding step [2,10] to ensure spatial contiguity of the segmentation results.

Discriminative approaches directly learn the difference between the appearance of the lesion and other tissues and do not rely on spatial priors [11,12,13,14,15,16]. They do, however, often require substantial amounts of training data and typically come at the cost of manual interaction for initialization and postprocessing. Most require the imaging protocol to be exactly the same in the training set and in the novel images to be segmented. Discriminative approaches proposed for tumor segmentation may use specific anatomical [13], but also generic image features (e.g., wavelets [11]) as input to the classifier. A spatial regularization via boundary modeling [11,12,13] or Markov Random Fields [14,15,16] has proved useful when used with discriminative methods as well.

Both generative and discriminative models face significant challenges when applied to multi-modal data. Automatic discriminative approaches are limited to the image modalities of the training set and are sensitive to missing data. Generative models may generalize straightforwardly to multi-channel observations [8,7], but do not allow for modeling differences between the biological processes observed in different modalities. By assuming the same shape and extend of pathology in all modalities, the standard multi-channel segmentation may ignore much of the information potentially available in images. Examples include differences in tissue water (T2, Flair-MRI), enhancement of contrast agents (post-Gadolinium T1-MRI), diffusion (DTI, DCE-MRI), or relative concentrations of selected metabolites (MRSI). Delineating the tumor area in each of these modalities individually is highly preferred for subsequent quantitative analysis of tumor shape and evolution.

We present a tumor appearance model for such multi-dimensional sequences and derive an algorithm for a channel-specific segmentation of the tumor. The method shares information about the spatial location of the lesion among channels while making full use of the highly specific multi-modal signal of the healthy tissue classes for segmenting normal tissues in the brain. In addition to tissue types, the model includes a latent variable for each voxel encoding the probability of observing tumor at that voxel. We derive an estimation algorithm for this model that generalizes the standard atlas-based EM segmentation. In our experiment with 25 multi-modal image volumes, the proposed approach performs significantly better than the traditional multivariate tissue classification method that assumes a single tumor segmentation that is shared by all channels.

2 Generative Tumor Model

We use a generative modeling approach, in which we first build an explicit statistical model of image formation and subsequently use this model to derive a fully automatic segmentation algorithm. Fig. 1 illustrates our generative model.

Fig. 1.

Fig. 1

Graphical model for the proposed segmentation approach. Voxels are indexed with i, the channels are indexed with c. The known prior πk determines the label k of the normal, healthy tissue. The latent atlas α determines the channel-specific presence of tumor t. Normal state k, tumor state t, and intensity distribution parameters θ jointly determine the multi-modal image observations y. Observed (known) quantities are shaded. The tumor segmentation aims to estimate p(ticy), along with the segmentation of healthy tissue p(ki|y).

We model the normal state of the healthy brain using a spatially varying probabilistic prior πk for each of the K tissue classes (Fig. 1, blue). This prior (atlas) is estimated from prior examples and is assumed to be known. At each voxel i, the atlas defines a multinomial distribution for the tissue label ki:

p(ki=k)=πki. (1)

The normal state ki is shared among all C channels at voxel i. In our experiments we assume K = 3, representing gray matter, white matter and cerebrospinal fluid (CSF).

We model the tumor state using a spatially varying “latent” probabilistic atlas α, similar to [10] (Fig. 1, red). At each voxel i, this atlas provides a scalar parameter αi that defines the probability of observing tumor at that voxel. Parameter αi is unknown and is estimated as part of the segmentation process. We define a latent tumor state tic{0,1} that indicates the presence of tumor in channel c at voxel i and model it as a Bernoulli random variable with parameter αi. We form a binary tumor state vector ti=[ti1,,tiC]T indicating the tumor presence for all c observations at voxel i, with probability

p(ti;αi)=cp(tic;αi)=cαitic·(1αi)1tic. (2)

Image observations yic are generated by Gaussian intensity distributions for each of the K tissue classes and the C channels, with mean μkc and variance vkc, respectively (Fig. 1, purple). In tumor tissue (i.e., if tic=1) the normal observations are replaced by intensities from another set of channel-specific Gaussian distributions with mean μK+1c and variance vK+1c, representing the tumor class. Letting θ denote the set of all mean and variance parameters, and yi=[yi1,,yiC]T denote the vector of the intensity observations at voxel i, we define the data likelihood:

p(yiti,ki;θ)=cp(yictic,ki;θ)=c[N(yic;μkic,vkic)1tic·N(yic;μK+1c,vK+1c)tic], (3)

where Inline graphic (·; μ, v) is the Gaussian distribution with mean μ and variance v.

Finally, the joint probability of the the latent atlas and the observed variables

p(yi,ti,ki;θ,αi)=p(yiti,ki;θ)·p(ti;αi)·p(ki) (4)

is the product of the components defined in Eqs. (13).

3 Maximum Likelihood Parameter Estimation

We seek Maximum Likelihood estimates of the model parameters {θ, α}:

θ^,α^=argmaxθ,αp(y1,,yN;θ,α)=argmaxθ,αi=1Np(yi;θ,α),

where N is the number of voxels in the volume and

p(yi;θ,α)=tikip(yi,ti,ki;θ,α).

Observing that evaluating the objective function involves summing over values of ti and ki in Eq. (4), we use Jensen’s inequality to perform the optimization using an iterative, EM-style minorization technique [17]. Letting {θ̃, α̃} denote the current parameter estimates, we can compute the posterior probability of any of the 2C tumor state vectors ti, writing out the components of Eq. (4):

qi(ti)p(tiki,yi;θ,α)kip(yiti,ki;θ)p(ti;αi)p(ki), (5)

and Σtiqi(ti) = 1. Based only on the intensity channels that do not show tumor ( tic=0), we also compute the posterior probability of tissue k at voxel i:

wik(ti)p(kiti,yi;θ,α)πkicN(yic;μkc,vkc)1tic,

and Σk wik (ti) = 1 for all ti. Using qi(·) and wik(·), we arrive at closed-form update expressions that guarantee increasingly better estimates of the model parameters. The updates are intuitive: the latent tumor prior is an average of the corresponding posterior estimates

αitiqi(ti)(1Cctic)

and the intensity parameters are updated with the weighted statistics of the data for the healthy tissues (k = 1, …, K)

μkcitiqi(ti)wik(ti)(1tic)yicitiqi(ti)wik(ti)(1tic),vkcitiqi(ti)wik(ti)(1tic)(yicμkc)2itiqi(ti)wik(ti)(1tic)

and for the tumor class:

μK+1citiqi(ti)ticyicitiqi(ti)tic,vK+1citiqi(ti)tic(yicμK+1c)2itiqi(ti)tic.

We iterate the estimation of the parameters {θ̃, α̃} and the computation of the posterior probabilities {qi(·), wik (·)} until convergence.

4 Tumor Segmentation

Once we have an estimate of the model parameters {θ̂, α̂}, we can evaluate the probability that tumor is visible in channel c of voxel i by summing over all the configurations ti for which tic=1:

p(tic=1yi;θ^,α^)=titicp(tiyi;θ^,α^)=titicqi(ti). (6)

We then assign channel c of voxel i to tumor if p(tic=1yi;θ^,α^)>0.5.

5 Extensions

To augment the generative model outlined above with further physiological knowledge, we derive and implement extensions considering the expected shape, multivariate signal and structural appearance of the tumor.

Little spatial context is used in the basic model, as we assume the tumor state ti in each voxel to be independent from the state of other voxels (Eq. 6 and Eq. 3). It is only the atlas πk that encourages smooth classification for the healthy tissue classes by imposing similar priors in neighboring voxels. To encourage a similar smoothness of the tumor labels, we extend the latent atlas α to include a Markov Random Field (MRF) prior:

p(t1,,tN;β,α)ciαitic(1αi)1ticexp[β2jNitic(1tjc)+tjc(1tic)].

Here, Inline graphic denotes the set of the six nearest neighbors of voxel i, and β is a parameter governing how similar the tumor states tend to be in neighboring voxels. When β = 0, there is no interaction between voxels and the model reduces to the one described in Section 2. For β ≠ 0, the posteriors qi(ti) are no longer given by Eq. (5), and their exact computation becomes infeasible. However, relaxing the MRF to a mean-field approximation [18] we derive an efficient approximate algorithm. We let

nic=jNitjtjcqj(tj)

denote the currently estimated “soft” count of neighbors that show tumor in channel c. The mean-field approximation implies

p(tiαi)cγitic(1γi)(1tic),whereγi=αiαi+(1αi)exp[β(2nic6)]

to replace the previously defined p(ti|αi) in Eq. (4), leading to smoothed estimates of the tumor segmentations.

Moreover, we want to account for the non-homogeneity in the appearance of the tumor class, as gliomas show characteristic substructures such as active and necrotic areas and edema. We model this via a straightforward extensions of the tissue classes to include more than one class for tumor in a second modification to our approach. Finally, to consider higher-order interactions in the multivariate biological signal yi of healthy tissue, we can relax the conditional independence of observations across channels by using multivariate Gaussians in the data likelihood in Eq. (3). We report tests of these three extensions in the next section.

6 Experiments

We evaluate our approach on a data set of 25 patients with glioma. The data set comprises T1, T2, FLAIR-, and post-Gadolinium T1 MR images. Tumors were outlined by a rater in three planes intersecting with the tumor center. We register all images of a patient to the FLAIR volume by using affine registration and segment the volume into the three healthy and an outlier class using a freely available implementation of the EM segmentation with bias correction [1]. Outliers are defined as being more than three standard deviations away from the centroid of any of the three normal tissue classes.

We apply our algorithm to the bias field corrected volumes and initialize intensity parameters with values estimated in the initial segmentation. When using multivariate distributions Inline graphic (·; μ; V), we initialize off-diagonal element in V to zero. When modeling the tumor class with multiple Gaussian densities we initialize the means of additional subclasses to random values. We use outliers in the initial segmentation to initialize the latent atlas α, setting αi for pixels of the outlier class to 0.7 and otherwise to 0.3. We typically observe convergence after 10 to 15 steps. For comparison, we also implement an EM segmentation treating tumor as one of the tissue classes, with a weak MRF prior for spatial regularization, similar to [2]. We use the same data and initializations as above, but augment the atlas by a tumor prior obtained by smoothing the outlier class of the initial segmentation with a 3cm kernel. This alternative segmentation is applied to every single image volume individually in a first experiment, and to the multivariate features of the whole multi-modal volume in a second experiment. To evaluate the classification maps we calculate Dice scores for both methods [19].

Fig. 2 illustrates results for two different subjects. We note that the tumor boundaries change across different modalities and the proposed method captures this variation well, even in post-therapeutic images. The method produces few false positives which can be easily eliminated in a post-processing step. We evaluate the robustness and accuracy of our method in a series of experiments. First, we test the sensitivity of the performance to the MRF parameter β that governs the smoothness of the resulting segmentations. It is the only parameter to be adjusted in our model. We find the performance of the algorithm to be relatively stable for a wide range of β values (Fig. 3), irrespective of size, location or shape of the tumor (i.e., β ∈ [1; 50]). For simplicity, we set β = 1 in all further experiments. In the second experiment, we test different model options for normal tissue and tumor classes. We find little differences between approaches that use non-zero off-diagonal elements in the covariance matrix of the intensity likelihood and those that do not. Modeling tumor by three Gaussians improves the result for some cases, but leads to a somewhat lower performance on average. In a third experiment, we compare our approach to the two alternative EM segmentation methods (Fig. 4). Here, we find the proposed channel-specific segmentation to always perform significantly better (p < 5 · 10−4, paired Cox-Wilcoxon test); it improves the absolute value of the Dice score over all four modalities for nearly all data sets by 0.1 to 0.2 (Fig. 4, right).

Fig. 2.

Fig. 2

Examples of channel-specific segmentation results for four different modalities, in two patients. The outlines of regions with p(tic=1yi;θ^,α^)>0.5 are shown in red. The proposed method localizes the tumor reliably in post-therapeutic images (below), where surgery has led to significant deviations from normalcy for healthy tissues.

Fig. 3.

Fig. 3

Sensitivity to the MRF parameter β. Indicated are the median (solid line) and the interquartile ranges of the average Dice scores of all 25 data set. While some regularization is beneficial, the segmentation performance is relatively insensitive to the choice of the only model parameter β.

Fig. 4.

Fig. 4

Benefits of the channel-specific segmentation. Boxplots show median, quartiles and outliers for the Dice scores of all 25 subjects, for all four modalities. Our channel-wise segmentation (c, green) improves over both multiple univariate (u, blue) and multivariate (m, red) segmentation, both in the absolute terms (left) and with respect to patient-specific differences (right). The right figure shows cu and cm.

7 Conclusions

We present a generative model for tumor appearance in multi-modal image volumes of the brain that provides channel-specific segmentations. We derive an estimation algorithm and demonstrate superior performance over standard multivariate EM segmentation. Unlike discriminative tumor segmentation methods, our model is applicable to any set of multi-modal image volumes, and is fully automatic. Further extensions of the model may consider structure of the tumor, or temporal evolution in longitudinal data sets.

Acknowledgments

This work was supported by the German Academy of Sciences Leopoldina (Fellowship Programme LPDS 2009-10), the Academy of Fin-land (133611), INRIA CompuTumor, NIH NIBIB NAMIC U54-EB005149, NIH NCRR NAC P41-RR13218, NIH NINDS R01-NS051826, NIH R01-NS052585, NIH R01-EB006758, NIH R01-EB009051, NIH P41-RR014075 and the NSF CAREER Award 0642971.

References

  • 1.Van Leemput K, Maes F, Vandermeulen D, Suetens P. Automated model-based tissue classification of MR images of the brain. IEEE TMI. 1999;18:897–908. doi: 10.1109/42.811270. [DOI] [PubMed] [Google Scholar]
  • 2.Prastawa M, Bullitt E, Ho S, Gerig G. A brain tumor segmentation framework based on outlier detection. MedI A. 2004;8:275–283. doi: 10.1016/j.media.2004.06.007. [DOI] [PubMed] [Google Scholar]
  • 3.Pohl KM, Fisher J, Levitt JJ, Shenton ME, Kikinis R, Grimson WEL, Wells WM. A unifying approach to registration, segmentation, and intensity correction. In: Duncan JS, Gerig G, editors. LNCS; MICCAI 2005; Heidelberg: Springer; 2005. pp. 310–318. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Zacharaki EI, Shen D, Davatzikos C. Orbit: A multiresolution framework for deformable registration of brain tumor images. IEEE TMI. 2008;27:1003–1017. doi: 10.1109/TMI.2008.916954. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Bach Cuadra B, Pollo C, Bardera A, Cuisenaire O, Thiran JP. Atlas-based segmentation of pathological brain MR images using a model of lesion growth. IEEE TMI. 2004;23:1301–1314. doi: 10.1109/TMI.2004.834618. [DOI] [PubMed] [Google Scholar]
  • 6.Gering DT, Grimson WEL, Kikinis R. Recognizing deviations from normalcy for brain tumor segmentation. In: Dohi T, Kikinis R, editors. LNCS; MICCAI 2002; Heidelberg: Springer; 2002. pp. 388–395. [Google Scholar]
  • 7.Moon N, Bullitt E, Van Leemput K, Gerig G. Model-based brain and tumor segmentation. Proc ICPR; 2002. pp. 528–531. [Google Scholar]
  • 8.Prastawa M, Bullitt E, Moon N, Van Leemput K, Gerig G. Automatic brain tumor segmentation by subject specific modification of atlas priors. Acad Radiol. 2003;10:1341–1348. doi: 10.1016/s1076-6332(03)00506-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Van Leemput K, Maes F, Vandermeulen D, Colchester A, Suetens P. Automated segmentation of multiple sclerosis lesions by model outlier detection. IEEE TMI. 2001;20:677–688. doi: 10.1109/42.938237. [DOI] [PubMed] [Google Scholar]
  • 10.Riklin-Raviv T, Menze BH, Van Leemput K, Stieltjes B, Weber MA, Ayache N, Wells W, III, Golland P. Joint segmentation via patient-specific latent anatomy model. Proc. MICCAI-PMMIA; 2009. [Google Scholar]
  • 11.Cobzas D, Birkbeck N, Schmidt M, Jagersand M, Murtha A. 3D variational brain tumor segmentation using a high dimensional feature set. Proc. ICCV; 2007. pp. 1–8. [Google Scholar]
  • 12.Lefohn A, Cates J, Whitaker R. Interactive, GPU-based level sets for 3D brain tumor segmentation. In: Ellis RE, Peters TM, editors. LNCS; MICCAI 2003; Heidelberg: Springer; 2003. pp. 568–572. [Google Scholar]
  • 13.Ho S, Bullitt E, Gerig G. Level-set evolution with region competition: automatic 3D segmentation of brain tumors. Proc. ICPR; 2002. pp. 532–535. [Google Scholar]
  • 14.Gorlitz L, Menze BH, Weber MA, Kelm BM, Hamprecht FA. Semi-supervised tumor detection in magnetic resonance spectroscopic images using discriminative random fields. In: Hamprecht FA, Schnörr C, Jähne B, editors. LNCS; DAGM 2007; Heidelberg: Springer; 2007. pp. 224–233. [Google Scholar]
  • 15.Lee C, Wang S, Murtha A, Greiner R. Segmenting brain tumors using pseudo-conditional random fields. In: Metaxas D, Axel L, Fichtinger G, Székely G, editors. LNCS; MICCAI 2008, Part II; Heidelberg: Springer; 2008. pp. 359–366. [PubMed] [Google Scholar]
  • 16.Wels M, Carneiro G, Aplas A, Huber M, Hornegger J, Comaniciu D. A discriminative model-constrained graph cuts approach to fully automated pediatric brain tumor segmentation in 3D MRI. In: Metaxas D, Axel L, Fichtinger G, Székely G, editors. LNCS; MICCAI 2008, Part I; Heidelberg: Springer; 2008. pp. 67–75. [DOI] [PubMed] [Google Scholar]
  • 17.Hunter D, Lange K. A tutorial on MM algorithms. American Statistician. 2004;58:30–37. [Google Scholar]
  • 18.Jordan MI, Gaharamani Z, Jaakola TS, Saul LK. An introduction to variational methods for graphical models. Machine Learning. 1999;37:183–233. [Google Scholar]
  • 19.Dice L. Measure of the amount of ecological association between species. Ecology. 1945;26:297–302. [Google Scholar]

RESOURCES