Incremental Learning With Selective Memory (ILSM): Towards Fast Prostate Localization for Image Guided Radiotherapy

Yaozong Gao; Yiqiang Zhan; Dinggang Shen

doi:10.1109/TMI.2013.2291495

. Author manuscript; available in PMC: 2015 Mar 31.

Published in final edited form as: IEEE Trans Med Imaging. 2014 Feb;33(2):518–534. doi: 10.1109/TMI.2013.2291495

Incremental Learning With Selective Memory (ILSM): Towards Fast Prostate Localization for Image Guided Radiotherapy

Yaozong Gao ¹, Yiqiang Zhan ², Dinggang Shen ^3,^✉

PMCID: PMC4379484 NIHMSID: NIHMS669641 PMID: 24495983

Abstract

Image-guided radiotherapy (IGRT) requires fast and accurate localization of the prostate in 3-D treatment-guided radiotherapy, which is challenging due to low tissue contrast and large anatomical variation across patients. On the other hand, the IGRT workflow involves collecting a series of computed tomography (CT) images from the same patient under treatment. These images contain valuable patient-specific information yet are often neglected by previous works. In this paper, we propose a novel learning framework, namely incremental learning with selective memory (ILSM), to effectively learn the patient-specific appearance characteristics from these patient-specific images. Specifically, starting with a population-based discriminative appearance model, ILSM aims to “personalize” the model to fit patient-specific appearance characteristics. The model is personalized with two steps: backward pruning that discards obsolete population-based knowledge and forward learning that incorporates patient-specific characteristics. By effectively combining the patient-specific characteristics with the general population statistics, the incrementally learned appearance model can localize the prostate of a specific patient much more accurately. This work has three contributions: 1) the proposed incremental learning framework can capture patient-specific characteristics more effectively, compared to traditional learning schemes, such as pure patient-specific learning, population-based learning, and mixture learning with patient-specific and population data; 2) this learning framework does not have any parametric model assumption, hence, allowing the adoption of any discriminative classifier; and 3) using ILSM, we can localize the prostate in treatment CTs accurately (DSC ∼0.89) and fast (∼4 s), which satisfies the real-world clinical requirements of IGRT.

Index Terms: Anatomy detection, image-guided radiotherapy (IGRT), incremental learning, machine learning, prostate segmentation

I. Introduction

IMAGE-guided radiotherapy (IGRT) is a newly developed technology for prostate cancer radiation treatment. It is usually recommended when patients are diagnosed with prostate cancer by biospy [1], [2] in the early stage. IGRT consists of a planning stage followed by a treatment stage (Fig. 1). In the planning stage, a planning computed tomography (CT) scan is acquired from the patient and radiation oncologists then manually delineate the prostate for treatment planning. These steps are the same as in conventional radiotherapy. The novelty of IGRT lies in the treatment stage. To account for daily prostate motions, a CT scan called the treatment image is acquired at each treatment day right before the radiation therapy. Since the treatment image captures a present snapshot of the patient's anatomy, radiation oncologists are able to adapt the treatment plan to precisely target the radiation dose to the current positions of tumors and avoid neighboring healthy tissues. Consequently, IGRT increases the probability of tumor control and typically shortens radiation therapy schedules [3], [4]. In order to effectively adapt the treatment plan, it is critical to localize the prostate in the daily treatment images fast and accurately. Thus, an automatic prostate localization algorithm would be a valuable asset in IGRT.

However, prostate localization in treatment CT images is quite challenging for three reasons. First, unlike the planning CT image, the treatment CT images are typically acquired with low dose protocols, in order to reduce unnecessary radiation exposure to patients during treatment. As a result, the image contrast of treatment CT is relatively lower compared to other modalities (e.g., MR and regular CT). Fig. 1 shows several typical treatment CTs and their prostate contours (red). Second, due to the existence of bowel gas and filling (pointed to by red arrows in Fig. 1), the image appearance of treatment CTs can change dramatically. Third, unpredicted daily prostate motion [5] further complicates the prostate localization procedure.

Many methods have been proposed to address the aforementioned challenges (for example, deformable models [6], [7], and deformable registration [8]). While such methods exhibit some effectiveness in CT prostate localization, their localization accuracy is often limited because they overlook a remarkable opportunity that is inherent in the IGRT workflow. In fact, at each treatment day, several CT scans of the patient have already been acquired and segmented in the planning day and previous treatment days. If the prostate appearance characteristics of this specific patient can be learned from these patient-specific images, an algorithm could exploit this information to localize the prostate much more effectively. Recently Li [9], Liao [10], and Gao [11] proposed different methods that use patient-specific information for CT prostate localization and have achieved promising results. However, their methods require at least three manually segmented patient-specific images available for patient-specific training, which imposes two major limitations: 1) there may not be sufficient patient-specific data available, especially in the beginning treatment days when only planning CT is available; and 2) manual segmentation of patient-specific images is time consuming (11 min) even for experienced physicians. Additionally, these methods typically need minutes or even longer to localize the prostate due to the computationally expensive methodologies adopted (i.e., sparse coding, iterative voxel-wise classification, deformable registration). If the prostate unexpectedly moves during the long localization procedure, the localization result might become meaningless for IGRT.

To this end, we propose a novel learning scheme, namely incremental learning with selective memory (ILSM), for fast and accurate localization of the prostate in treatment CTs. Compared with previous prostate localization methods, the contributions of our work are two-fold: 1) by leveraging the large amount of population data (that is, CT scans of other patients) and the very limited amount of patient-specific data, ILSM is able to learn patient-specific characteristics from only one image of the patient and apply the learned model to the localization of beginning treatment CTs; 2) our method can obtain comparable (if not better) localization accuracy to the state-of-the-art methods while substantially reducing the computational time to 4 s. To the best of our knowledge, this is the first prostate localization method that can satisfy both accuracy and efficiency requirements in the IGRT workflow. Also, compared to previous methods [9]–[11] that require manual annotation of the entire prostate on the patient-specific training images, our method only needs the annotations of seven prostate anatomical landmarks, thus significantly reducing the labor required for manual annotations.

To leverage both population and patient-specific data, our learning framework (ILSM) starts with learning a population-based discriminative appearance model. This model is then “personalized” according to the appearance information from CTs of the specific patient under treatment. Instead of either preserving or discarding all knowledge learned from the population, our method selectively inherits the part of population-based knowledge that is in accordance with the current patient, and at the same time incrementally learns the patient-specific characteristics. This is where the name “incremental learning with selective memory” comes from. Once the population-based discriminative appearance model is personalized, it can be used to detect distinctive anatomical landmarks in new treatment images of the same patient for fast prostate localization. Compared with traditional learning schemes, such as pure patient-specific learning, population-based learning, and mixture learning with patient-specific and population data, ILSM exhibits better capability to capture the patient-specific characteristics embedded in the data. We note that the preliminary version of our method was previously reported in a conference paper [12]. The present paper extends the method by using multi-atlas RANSAC for prostate localization, and further evaluates the performance with much more comprehensive experiments and on a larger dataset.

The rest of the paper is organized as follows. Section II gives an overview of related methods on both CT prostate localization and incremental learning. Section III presents our ILSM framework and the prostate localization procedure. The experimental results are provided in Section IV. Finally, Section V presents the conclusion.

II. Related Works

As mentioned, we employ incremental learning to localize the prostate in CT images. The following literature review will cover CT prostate localization and incremental learning, respectively.

A. CT Prostate Localization

Many methods have been proposed to address the challenging prostate localization problem in CT images. Most of them can be categorized into three groups: deformable models, deformable registration, and pixel-wise classification/labeling.

Deformable models are popular in medical image segmentation [13], [14], and widely adopted in CT prostate localization. For example, Pizer [15] proposed a medial shape model named M-reps for joint segmentation of bladder, rectum, and prostate. Freedman [16] proposed to segment the CT prostate by matching the probability distributions of photometric variables. Costa et al. [6] proposed coupled 3-D deformable models by considering the nonoverlapping constraint from bladder. Feng et al. [17] proposed to selectively combine the gradient profile features and region-based features to guide the deformable segmentation. Although deformable models have shown their robustness in many medical image segmentation problems, their performance highly depends on good initialization of the model, which is difficult to obtain in CT prostate localization since the daily prostate motion is unpredictable and sometimes can be very large due to the bowel gas and filling.

Deformable Registration [18]–[21] has been investigated in the community for many years as a way to align the corresponding structures between two images. It can also be used to localize the CT prostate by warping the previous treatment CTs (with the prostate segmented) of the same patient to the current treatment CT. For example, Foskey et al. [8] proposed a deflation method to explicitly eliminate bowel gas before 3-D deformable registration. Liao et al. [22] proposed a feature-guided deformable registration method by exploiting patient-specific information. Compared to deformable models, deformable registration takes into account global appearance information and is thus more robust to prostate motion. However, the nonrigid registration procedure is often time-consuming and typically takes minutes or even longer to localize the prostate, which is problematic if the prostate moves during the long localization procedure.

Pixel-wise classification/labeling is a recently proposed method for precise prostate segmentation. The basic idea is to enhance the indistinct prostate in CT scans through pixel-wise labeling. Li et al. [9] proposed to utilize image context information to assist the pixel-wise classification, and level-set was used to segment the prostate based on the classification response map. Gao et al. [23] proposed a sparse representation based classifier with a discriminative learned dictionary and further employed multi-atlas labeling for prostate segmentation. Liao et al. [10] proposed a sparse patch-based label propagation framework that effectively transfers the labels from previous treatment CTs of the patient for pixel-wise labeling. Shi et al. [24] proposed a semi-automated prostate segmentation method by designing spatial-constrained transductive lasso for multi-atlas based label fusion. Despite the high accuracy of these methods (mean DSC ≈ 0.9), in general, they suffer from two limitations: 1) as in deformable registration, pixel-wise labeling is usually time consuming; 2) in order to learn statistically reliable patient-specific appearance information, these methods require a sufficient number of manually segmented patient-specific images (i.e., at least three images). In practice, one may not be able to collect enough patient-specific images, especially in the beginning of radiotherapy when only the planning CT image is available.

Besides the aforementioned methods, Haas et al. [25] used 2-D flood fill with the shape guidance to localize the prostate in CT images. Ghosh et al. [26] proposed a genetic algorithm with prior knowledge in the form of texture and shape. Although these approaches adopted novel methodologies, their localization accuracies were very limited.

There are also many methods published for prostate segmentation in other modalities (e.g., MR [27], [28], ultrasound [29]–[31]). However, due to various reasons, most of these methods cannot be readily adapted for fast prostate localization in CT images. For example, conventional multi-atlas methods (e.g., [27]) often require nonrigid registration to align each atlas with the image to be segmented. This procedure is quite time consuming (e.g., 15 min per registration) and thus not suitable for fast prostate localization. Methods such as [29], [30] utilized the existence of diagnostic probe in ultrasound images for feature design and prostate localization. Due to lack of such structure in CT images, these kinds of methods are no longer applicable. Other methods (e.g., [28], [30]) are not considered because they are either semi-automatic or only applicable to 2-D segmentation.

B. Incremental Learning

Incremental learning has been extensively investigated in the area of machine learning area. The key objective of incremental learning is to adapt previously learned models (e.g., classifiers) to new data without retraining from scratch. Polikar et al. [32] proposed Learn++, an algorithm for incremental training of neural network (NN) classifiers. By assembling previously learned classifiers with incrementally trained classifiers, Learn++ is able to adapt the trained classifiers to incoming data. Diehl et al. [33] proposed an incremental learning algorithm for adapting the support vector machine (SVM) classifier. Ross et al. [34] proposed an incremental learning algorithm for online updating of the Gaussian appearance model for visual tracking. While these methods share some similarities with our framework in terms of incremental learning, there are three main differences between previous incremental learning approaches and our method: 1) instead of preserving all previously learned knowledge [32], ILSM selectively discards some learned population characteristics if they are no longer applicable to the patient-specific data; 2) in contrast to [34], which assumes image appearance follows Gaussian distribution, ILSM doesn't impose any assumption on appearance distribution, as prostate appearance often follows a complex non-Gaussian distribution (demonstrated in Fig. 3); 3) different from [33], which only focuses on the incremental learning of a specific classifier (i.e., SVM), ILSM provides a general learning framework for effective combination of large population data and the limited patient-specific data, hence allowing for the adoption of any classifier. To the best of our knowledge, this is the first work that employs the concept of incremental learning to effectively combine the appearance statistics from large population data and the limited patient-specific data.

Fig. 3 — Non-Gaussianity of local appearance distribution of the seven landmarks in Fig. 2. Given all prostate CT scans, we ask an expert to annotate the seven landmarks on the images. For each landmark, we extract its local patches (size 9 × 9 × 9 mm³) from all annotated CT scans and perform a PCA analysis on the extracted patches. Figures are plotted using the first two principal components. Each point in the figure denotes a local patch represented by the first two principal component scores. To take annotation error into account, we randomly perturb the annotated landmarks by at most 1 mm to generate additional samples. Clearly, none of these distributions follow the Gaussian distribution.

III. Methodology

Our method aims to localize the prostate in daily treatment images via learning a set of local discriminative appearance models. Specifically, these models are used as anatomy detectors to detect distinctive prostate anatomical landmarks as shown in Fig. 2. Based on the detected landmarks, multiple patient-specific shape atlases (i.e., prostate shapes in planning and previous treatment stages) can be aligned onto the treatment image space by RANSAC [35]. Finally, majority voting is adopted to fuse the labels from different shape atlases.

Fig. 2 — Seven prostate anatomical landmarks used in our study: prostate center (PC), right lateral point (RT), left lateral point (LF), poterior point (PT), anterior point (AT), base center (BS), and apex center (AP).

As shown in Fig. 4, our method consists of three components, 1) cascade detector learning, 2) incremental learning with selective memory, and 3) robust prostate localization by multi-atlas RANSAC, which will be detailed in the following subsections.

A. Cascade Learning for Anatomy Detection

Our prostate localization method relies on several anatomical landmarks of the prostate. Inspired by Viola's face detection work [36], we adopt a learning-based detection method, which formulates landmark detection as a classification problem. Specifically, for each image, the voxel of the specific landmark is positive and all others are negatives. In the training stage, we employ a cascade learning framework that aims to learn a sequence of classifiers to gradually separate negatives from positives (Fig. 5). Compared to learning only a single classifier, cascade learning has shown better classification accuracy and runtime efficiency [36], [37]. Mathematically, cascade learning can be formulated as follows.

Fig. 5 — Illustration of cascade learning.

Input: Positive voxel set X, negative voxel set X, and label set ℒ = {+1,−1}.

Classifier: C(x) : Inline graphic (x) →ℒ, (x) denotes the appearance features of a voxel x.

Initial Set: X₀ = X ∪ X.

Objective: Optimize C_k, k = 1, 2,…, K, such that

X_{0} \supseteq X_{1} \supseteq \dots \supseteq X_{k} \supseteq \dots \supseteq X_{K}, X_{K} \supseteq X_{P}, and ‖ X_{K} \cap X_{N} ‖ \leq τ ‖ X_{P} ‖

where X_k = {x|x ∈ X_k−1 and C_k(x) = +1}, and τ controls the tolerance ratio of false positives.

The cascade classifiers C_k, k = 1, 2,…, K, are optimized sequentially. As shown in (1), C_k is optimized to minimize the false positives left over by the previous k − 1 classifiers

C_{k} = arg min_{C} ‖ {x | x \in X_{k - 1} \cap X_{N} and C (x) = + 1} ‖ s . t . \forall x \in X_{P}, C (x) = + 1

(1)

where ‖·‖ denotes the cardinality of a set. It is worth noting that the constraint in (1) can be simply satisfied by adjusting the threshold of classifier C_k [36] to make sure that all positive training samples are correctly classified. This cascade learning framework is general to any image feature and classifier. Extended Haar wavelets [38], [39] and the Adaboost [36] classifier are employed in our study.

Once the cascade classifiers {C_k(x)} are learned, they have captured the appearance characteristics of the specific anatomical landmark. Given a testing image, the learned cascade is applied to each voxel. The voxel with the highest classification score after going through the entire cascade is selected as the detected landmark. To increase the efficiency and robustness of the detection procedure, a multi-scale scheme is further adopted. Specifically, the detected landmark in the coarse resolution serves as the initialization for landmark detection in a following finer resolution, in which the landmark is only searched in a local neighborhood centered by the initialization. In this way, the search space is largely reduced and the detection procedure is more robust to local minima.

B. Incremental Learning With Selective Memory (ILSM)

1) Motivation

Using cascade learning, one can learn anatomy detectors from training images of different patients (population-based learning). However, since intra-patient anatomy variations are much less pronounced than inter-patient variations (Fig. 6), patient-specific appearance information available in the IGRT workflow should be exploited in order to improve the detection accuracy for an individual patient. Unfortunately, the number of patient-specific images is often very limited, especially in the beginning of IGRT. To overcome this problem, one may apply random spatial/intensity transformations to produce more “synthetic” training samples with larger variability. However, these artificially created transformations may not capture the real intra-patient variations, e.g., the uncertainty of bowel gas and filling (Fig. 6). As a result, cascade learning using only patient-specific data (pure patient-specific learning) often suffers from overfitting. One can also mix population and patient-specific images for training (mixture learning). However, since patient-specific images are the “minority” in the training samples, detectors trained by mixed samples might not capture patient-specific characteristics very well. To address this problem, we propose a new learning scheme, ILSM, to combine the general information in the population images with the personal information in the patient-specific images. Specifically, population-based anatomy detectors serve as an initial appearance model and are subsequently “personalized” by the limited patient-specific data. ILSM consists of backward pruning to discard obsolete population appearance information and forward learning to incorporate the online-learned patient-specific appearance characteristics.

Fig. 6 — Inter- and intra-patient prostate shape and appearance variations. Red points denote the prostate center. Each row represents prostate shapes and images from the same patient.

2) Notations

Denote $D^{pop} = {C_{k}^{pop}, k = 1, 2, \dots, K^{pop}}$ as the population-based anatomy detector (learned as outlined in Section III-A), which contains a cascade of classifiers. $X_{P}^{pat}$ and $X_{N}^{pat}$ are positives and negatives from the patient-specific training images, respectively. D(x) denotes the class label (landmark versus nonlandmark) of voxel x predicted by detector D.

3) Backward Pruning

The general appearance model learned from the population is not necessarily applicable to the specific patient. More specifically, the anatomical landmarks in the patient-specific images (positives) may be classified as negatives by the population-based anatomy detectors, i.e., $\exists k \in {1, 2, \dots, K^{pop}}, \exists x \in X_{P}^{pat}, C_{k}^{pop} (x) = - 1$ . In order to discard these parts of the population appearance model that do not fit the patient-specific characteristics, we propose backward pruning to tailor the population-based detector. As shown in Algorithm 1, in backward pruning, the cascade is pruned from the last level until all patient-specific positives successfully pass through the cascade. This is equivalent to searching for the maximum number of cascade levels that could be preserved from the population-based anatomy detector

Algorithm 1 Backward pruning algorithm.

Input:

D^{POP} = {C_{k}^{POP}, k = 1, 2, \dots, K^{POP}}

- the population-based detector

X_{P}^{pat}

- patient-specific positive samples

Output: D^bk - the tailored population-based detector

Init: k = K^pop, D^bk = D^pop.

while

\exists x \in X_{P}^{pat} : D^{bk} (x) = - 1

D^{bk} = D^{bk} \ C_{k}^{pop}

k = k − 1

end while

K^bk = k

return

D^{bk} = {C_{k}^{POP}, k = 1, 2, \dots, K^{bk}}

	Dataset A	Dataset B
Planning Resolution (mm)	0.98 × 0.98 × 3	1.24 × 1.24 × 3
Treatment Resolution (mm)	0.98 × 0.98 × 3
Image Size	512 × 512 × 30 ∼ 120
Number of patients	25	7
Number of images	349	129

Scale	K_pop	K_bk	K_pat
Coarse	13.1 ± 1.2	12.6 ± 3.4	2.1 ± 1.3
Middle	13.5 ± 0.6	12.7 ± 2.8	2.5 ± 1.6
Fine	15.5 ± 2.1	6.3 ± 2.8	2.9 ± 0.1

		POP	PPAT	MIX	IL	ILSM
Training images	Population	✓		✓	✓	✓
Training images	Patient-specific		✓	✓	✓	✓
Learning strategies	Cascade Learning	✓	✓	✓	✓	✓
	Backward Pruning					✓
	Forward Learning				✓	✓

	POP	PPAT	MIX	IL	ILSM
PC	6.69 ± 3.65	4.89 ± 5.64	6.03 ± 3.03	5.87 ± 4.01	4.73 ± 2.69
RT	7.85 ± 8.44	6.09 ± 9.00	5.72 ± 4.04	6.33 ± 4.82	3.76 ± 2.80
LF	6.89 ± 4.63	5.39 ± 7.61	5.61 ± 3.63	5.90 ± 4.54	3.69 ± 2.69
PT	7.04 ± 5.04	8.66 ± 13.75	6.18 ± 4.76	6.74 ± 5.05	4.78 ± 4.90
AT	6.60 ± 4.97	4.54 ± 5.06	5.38 ± 4.55	5.68 ± 4.97	3.54 ± 2.19
BS	6.12 ± 2.97	5.63 ± 7.44	6.63 ± 3.98	5.61 ± 2.94	4.68 ± 2.71
AP	10.42 ± 6.03	8.94 ± 16.07	8.77 ± 5.00	9.50 ± 7.17	6.28 ± 4.60
Average	7.37 ± 5.52	6.31 ± 10.13	6.33 ± 4.32	6.52 ± 5.09	4.49 ± 3.49
p-value	< 10⁻⁵	< 10⁻⁵	< 10⁻⁵	< 10⁻⁵	n/a

	PC	RT	LF
ILSM	4.72 ± 1.42	3.03 ± 1.75	3.17 ± 1.61
Inter-rater	4.50 ± 1.22	5.25 ± 1.27	5.77 ± 1.49
	PT	AT	BS
ILSM	2.45 ± 1.00	3.24 ± 1.28	5.57 ± 1.98
Inter-rater	5.71 ± 2.85	4.44 ± 3.09	4.63 ± 1.32
	AP	Average	p-value
ILSM	7.18 ± 4.17	4.20 ± 2.65	n/a
Inter-rater	4.44 ± 1.05	4.96 ± 2.00	0.01

	POP (S)	PPAT (S)	MIX (S)	IL (S)	ILSM (S)	ILSM (M)
Mean DSC	0.81 ± 0.10	0.84 ± 0.15	0.83 ± 0.09	0.83 ± 0.09	0.87 ± 0.06	0.88 ± 0.06
Acceptance Rate	66%	85%	74%	77%	90%	91%

Method	Deformable	Registration	Multi-atlas	Classification		ILSM

	Feng [17]	Liao [22]	Liao [10]	Li [9]	Gao [23]

Automaticity	Fully	Fully	Fully	Fully	Semi	Fully
Mean DSC	0.89	0.90	0.91	0.91	0.91	0.89
Mean ASD	2.08	1.08	0.97	1.40	1.24	1.72
Median Sen.	n/a	0.89	0.90	0.90	0.92	0.89
Median PPV.	n/a	0.89	0.92	0.90	0.92	0.92
Speed (sec.)	96	228	156	180	600	4

Method	Deformable Models		Registration	ILSM

	Costa [6]	Chen [7]	Foskey [8]

image #	16	185	65	446
patient #	n/a	13	5	32
Mean DSC	n/a	n/a	0.84	0.89
Median Sen.	0.79	0.84	n/a	0.89
Median PPV.	0.86	0.87	n/a	0.92
Speed (sec.)	n/a	60	750	4

	DSC	ASD (mm)	Sen.	PPV.

Dataset A	0.88±0.06	1.89±0.98	0.87±0.06	0.89±0.06
Dataset B	0.91±0.05	1.27±0.90	0.88±0.05	0.93±0.06
All	0.89±0.06	1.72±1.00	0.88±0.06	0.90±0.06

	DSC	ASD (mm)	Sen.	PPV.

UpperBound	0.92±0.03	1.00±0.60	0.91±0.05	0.94±0.03
LowerBound	0.81±0.09	3.01±2.01	0.80±0.10	0.83±0.10

Excluded	PC	RT	LF	PT
DSC	0.89±0.05	0.88±0.06	0.88±0.06	0.88±0.06
Excluded	AT	BS	AP
DSC	0.88±0.05	0.89±0.05	0.89±0.05

PERMALINK

Incremental Learning With Selective Memory (ILSM): Towards Fast Prostate Localization for Image Guided Radiotherapy

Yaozong Gao

Yiqiang Zhan

Dinggang Shen

Roles

Abstract

I. Introduction

Fig. 1.

II. Related Works

A. CT Prostate Localization

B. Incremental Learning

Fig. 3.

III. Methodology

Fig. 2.

Fig. 4.

A. Cascade Learning for Anatomy Detection

Fig. 5.

B. Incremental Learning With Selective Memory (ILSM)

1) Motivation

Fig. 6.

2) Notations

3) Backward Pruning

4) Forward Learning

Fig. 7.

5) Insight of ILSM

Fig. 8.

C. Robust Prostate Localization by Multi-Atlas RANSAC

Fig. 9.

IV. Experimental Results

A. Data Description

Table I. Parameters of Two CT Prostate Datasets.

Fig. 10.

B. Accuracy Measurements

C. Accuracy and Efficiency Requirement for Image Guided Radiation Therapy

D. Parameter and Experimental Setting

Table II. Training Parameters for Multi-Scale Landmark Detection.

Fig. 11.

E. Number of Cascade Classifiers

Table III. Statistics of Numbers of Cascade Classifiers.

F. Comparison Studies

1) Comparison With Traditional Learning-Based Approaches

Table VII. Quantitative Comparisons on Overlap Ratios (DSC) Between ILSM and Four Learning-Based Methods in Dataset A. (S) and (M) Indicate Single-Atlas and Multi-Atlas RANSAC, Respectively.

Fig. 12.

2) Comparison With Single-Atlas RANSAC

Fig. 13.

3) Comparison With Traditional Bone Alignment

Fig. 14.

4) Comparison With Other CT Prostate Localization Methods on the Same Dataset

Table VIII. Quantitative Comparison With Other CT Prostate Localization Methods on the Same Dataset (DSC: Dice Similarity Coefficient, ASD: Average Surface Distance, Sen.: Sensitivity, PPV.: Positive Predictive Value).

5) Comparison With Other CT Prostate Localization Methods on the Different Datasets

Table IX. Comparison With Other CT Prostate Localization Methods on Different Datasets for Reference (DSC: Dice Similarity Coefficient, Sen.: Sensitivity, PPV.: Positive Predictive Value).

G. Algorithm Performance

1) Accuracy

Table X. Localization Accuracy of ILSM on Two Datasets (DSC: Dice Similarity Coefficient, ASD: Average Surface Distance, Sen: Sensitivity, PPV: Positive Predictive Value).

Table XI. Lower and Upper Bound Accuracy of ILSM in CT Prostate Localization. Reported Values Are Calculated on Both Dataset A and Dataset B.

2) Robustness to Unsupervised Annotation

3) Generalization

4) Sensitivity to Landmark Selection

Table XII. Sensitivity to Landmark Selection. The Table Below Shows the Localization Accuracies of Six Landmarks by Excluding Any of The Seven Landmarks Used in the Paper. Reported Values Are Computed on Dataset A.

5) Temporal Analysis of Localization Accuracy

Fig. 15.

6) Speed

H. Experiment Summary

V. Conclusion and Discussion

Fig. 16.

Acknowledgments

Footnotes

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases