Abstract
Purpose
To develop an accurate, fast, and robust algorithm for registering portal and computed tomographic (CT) images for radiotherapy using a combination of sparse and dense field data that complement each other.
Methods and Materials
Gradient Feature Weighted Minimax (GFW Minimax) method was developed to register multiple portal images to three-dimensional CT images. Its performance was compared with that of three others: Minimax, Mutual Information, and Gilhuijs' method. Phantom and prostate cancer patient images were used. Effects of registration errors on tumor control probability (TCP) and normal tissue complication probability (NTCP) were investigated as a relative measure.
Results
Registration of four portals to CTs resulted in 30% lower error when compared with registration with two portals. Computation time increased by nearly 50%. GFW Minimax performed the best, followed by Gilhuijs' method, the Minimax method, and Mutual Information.
Conclusions
Using four portals instead of two lowered the registration error. Reduced fields of view images with full feature sets gave similar results in shorter times as full fields of view images. In clinical situations where soft tissue targets are of importance, GFW Minimax algorithm was significantly more accurate and robust. With registration errors lower than 1 mm, margins may be scaled down to 4 mm without adversely affecting TCP and NTCP.
Keywords: Multimodality image registration, Prostate radiotherapy, Patient positioning, Tumor control probability, NTCP
INTRODUCTION
The aim of three-dimensional conformal radiotherapy (3DCRT) is to deliver a precise dose of radiation to the tumor, while sparing adjacent normal tissues. To achieve maximum benefit, radiation beams from multiple directions are shaped to conform to the projected outline of the tumor. Recent developments such as intensity-modulated radiotherapy (IMRT) modulate the intensities within each beam using volumetric imaging data (1). Treatment planning is usually done using three-dimensional computed tomographic (3DCT) images and computerized dose calculations to realize an intensity pattern that best corresponds to the tumor shape. As the dose is delivered in multiple fractions, in both 3DCRT and IMRT, correct patient positioning is crucial to realizing the maximum potential of the therapy. Traditionally, patient positioning parameters are estimated by a visual comparison of portal and simulator images. Investigators have reported that in 22% of the cases, errors between 5 and 10 mm are common (2). Immobilization devices and surgically implanted gold fiducials have minimized errors to some extent (3), but have often proved to be inadequate because of patient discomfort, cost, and internal organ motion between and during treatment fractions (4, 5). To overcome these limitations, the planning margins around the target are chosen to be large enough to include maximum excursions outside the targeted volume. This approach compromises normal tissue safety. Currently, the target volume is delineated by a clinician through the use of a 3D data set such as CT or another 3D imaging modality. A plan is then developed that takes into account microscopic tumor spread, organ motion, and patient positioning uncertainty that will deliver radiation to the tumor while causing tolerable harm to normal tissues. To confirm that the treatment is being administered as prescribed, digitally reconstructed radiographs (DRRs) from the 3DCT data set or kilovoltage radiographs (simulator films) are used as templates to guide corrections to the patient position as documented on the linear accelerator with megavoltage energy radiographs. Unless gold fiducial markers have been placed in a soft tissue target such as the prostate, skeletal bone is used as a surrogate structure for assessing positioning accuracy. All these manual matching or registration methods are laborious and time-consuming. Since the early 1990s, considerable research has been devoted to automating the process of patient positioning resulting in a multitude of methods aimed at registering 3DCT data to portal images.
The most popular registration methods may be classified as dense field (intensity-based) and sparse field (feature-based) techniques. Dense field methods are robust to noise and blur, but are computationally expensive. Lemieux utilized an automated pixel intensity correlation-based method to register 3DCT data to two portal images (6). Berger and Gerig used a least-squares template matching algorithm to register DRRs to portals (7). Hristov and Fallone proposed a correlation-based registration method to align portal and simulator images (8). Dong and Boyer registered portal images to megavolt DRRs by optimizing the cross-correlation coefficient in regions of interest (9). Weese et al. employed a pattern intensity–based similarity measure for voxel-based registration of 3DCT images and intratreatment X-ray fluoroscopy images (10). All these methods are quite robust but are computationally expensive.
Sparse field methods extract homologous features and use them to estimate the alignment. Some methods extract natural features such as edges, ridges, and contours in the images, whereas others locate fiducial markers that have been implanted before imaging. Fritsch et al. introduced a multiscale method based on cores that extracts salient anatomic features for registration (11). Balter et al. used manually digitized open curve segments for registering two-dimensional projection radiographs (12). Gilhuijs et al. extracted ridge-like features from portal images and matched them to projections through CT data (13). In earlier attempts, Gilhuijs and van Herk used the arithmetic mean of a neighborhood of pixels sampled from a distance transformed image as the cost function for registration (14). Leszczynski et al. extended this cost function to include a root mean square distance (15). In 1995, Gilhuijs described a method that extracts bony ridges from images, to automatically register 3DCT data to orthogonal portal images for patient setup verification (16). Stoeckel et al. introduced tensor-based scaling parameters to suppress spurious ridges in portal images before registration (17). All these algorithms act on sparser data and are computationally less expensive, but are limited by the accuracy of feature extraction.
In this article, we present the results of registering multiple (n ≥ 2) two-dimensional portal images to 3DCT volumetric data using an iterative procedure called the Minimax entropy algorithm, which was initially a dense field metric. The initial algorithm registered 3DCT of a phantom to two orthogonal full field of view (FOV) portals (anterior-posterior [AP] and left-lateral [LL]) (18). The Minimax algorithm was extended to use a mixture of dense and sparse field information, and is termed the Gradient Feature Weighted Minimax (GFW Minimax) method. It has the flexibility to use up to four portal images with full and reduced fields of view (FFOV and RFOV). The RFOV portals are representative of views projected by 3DCRT or IMRT beams. The portals were acquired such that no two images represented the same views. The results of the algorithm were compared with the values estimated by two other popular algorithms: Mutual Information (MI) (19) and Gilhuijs' feature-based method (13, 14). All algorithms were tested on both phantom and patient data. We also present work done to increase the computational efficiency of the algorithm. This was done by estimating the probability density functions (pdfs) using a Support Vector Machine (SVM) based method instead of the initial Parzen Windows (PW) method.
As the issue of prostate positioning relative to the dose plan continues to be of interest, we investigated the effects of the errors in algorithm-derived patient setup parameters on the tumor control probability (TCP) and the normal tissue complication probability (NTCP). TCP is a measure of the effectiveness of the administered therapy in destroying the malignant cells within the target volume, and NTCP is an estimate of the risk to the neighboring normal organs. These two parameters can potentially be used to measure the success of a treatment plan (20, 21). The standard procedure is to manually contour the prostate on CT images and generate a planning target volume (PTV) by providing a margin of nearly 10 mm around it to ensure dose delivery to a clinical target volume (CTV) that encompasses the entire possible range of prostate locations. This results in normal tissue complications as large volumes of adjacent tissues receive high doses. The current focus is on treating tumors with escalated doses (>75 Gy) with highly accurate prostate localization and substantially smaller planning margins. Our efforts have been directed toward relating the uncertainty in prostate location to variations in TCP and NTCP after registration with the Minimax and GFW Minimax algorithms.
METHODS AND MATERIALS
The Minimax algorithm was discussed extensively in another paper (22). We shall introduce the essential steps of the algorithm and discuss the changes and extensions that were made since it was first introduced.
The Minimax algorithm
The algorithm has two steps that run iteratively to determine portal image segmentation and the registration parameters. Portal images are segmented into two regions: bone and background. The intensities in these regions are assumed to be drawn from two different distributions. The Max Step estimates the segmentation of the portal image into “bone” and “no-bone” (background) based on a posteriori estimates of the pdf. The Min Step then estimates the joint densities by using the segmentation labels computed in the Max Step. This (Max, Min) process continues until convergence. Within the framework, we use a Markov Random Field (MRF) strategy incorporating line processes [as in (23)] to model pixel-to-pixel correlations and also the boundaries between the “bone” and “no-bone” regions.
The mathematical setup is as follows (24): All images (3D and 2D) are converted to a vectorized form where each individual pixel or voxel can be indexed by a one-dimensional parameter i. First, let G = {gi, ∀i = (1, … , (I × J × K))} be a homogeneous random field with pdf, pG(G) [abbreviated as p(G)] with each outcome G being a 3DCT image of size I × J × K. Next, if T is a 6-parameter vector representing values for the three possible translations and three possible rotations in a rigid 3D transformation, we can let each outcome Y(T) of the random field Y(T) = {yi (T), ∀i = (1, … , I × J)} be an image of size I × J pixels that is formed by taking a projection through the 3DCT image and is referred to as a DRR. When T is conditioned on the image information, it is also a random vector (T). Finally, we let X = {xi, ∀i = (1, … , I × J)} be a homogeneous random field also with pdf p(X), whose each vectorized outcome X is a particular view MV portal image taken of the same patient that contains I × J pixels. The portal image and the DRR are usually of the same size, but if they are not, we evaluate the various entropy terms [P(.)ln P(.) in Eq. 2] on the overlapping regions of the two images. We assume that the pixels in each of the random fields are identically and independently distributed (iid). Thus, the pdf of the random field X can be factored as . For notational simplicity, we shall now write x(i) as xi and y(i, T) as yi. Here, segmentation information is incorporated into the problem of matching each portal image to a related DRR by considering the joint density function p(xi, yi) as a mixture density. Indicator variables with values of 0 or 1 signifying no-bone and bone classes are introduced at each pixel position to classify the content. Line processes based on edge information and intensity correlations between neighboring pixels are introduced at this stage to constrain the usually ill-posed problem of segmentation and to achieve robustness with respect to noise. Consequently, for any portal image p, we can realize a segmentation matrix that can be considered to be one outcome of a N × N random field Mp = {m(i)}, for i = 1, … , N2. For our purposes, at each spatial position i each random variable m is labeled as either “bone” or “no-bone”. We iteratively solve for pdfs related to the segmentation of any and all portal images p first and then use them to estimate a best overall transformation T between the planning CT space and the treatment space using a Minimax entropy strategy. This can be summarized as follows:
The Max Step:
(1) |
(2) |
where Pk (Mp) is the pdf for a segmentation Mp which is sampled from a possible set of segmentations for each portal image, given the current rigid transformation set: T. The iteration number is represented by k, and p is the index over the portals being segmented (typically 1, 2, or 4). X and Y are the one-dimensional MRFs drawn from the portal image and the 3DCT projected DRR related to it respectively. This inference assumes that pixel intensities are not independent and identically distributed (iid), but are correlated to their neighbors. The solution to Eq. 2 is as follows:
(3) |
Equation 3 estimates the probability that the segmentation class of the ith pixel is either “bone” or “no-bone”. The class labels are represented by a (or b) and are drawn from A which is the complete classification set. The density functions for the class labels are represented by P k–1a(xi, yi), and are estimated using the Parzen Windows method.
The Min Step
Each portal image's registration with its corresponding DRR, at every iteration step k, produces a distinct transformation parameter set . The global transformation set is a statistical union of these individual transformation sets:
(4) |
where each parameter set is a vector of its weighted components: , 0 ≤ αj ≤ 1, tj : {tx, ty, tz, θx, θy, θz}. Weighting was introduced because a particular pose estimates some parameters better than others. For instance, on registering the AP portal to its corresponding DRR, the parameters best estimated are tx, ty, and θz. The individual Ti values are computed iteratively as follows:
(5) |
where Ha (x, y) is the joint entropy (of the DRR and portal) for class a, and N represents the dimension (N × N) of the random field from which the portal images are sampled and F (T) is the cost function to be minimized. We follow the optimization technique of stochastic gradient descent (19) adapted for inclusion of segmentation information. Transformation parameters are updated according to the following strategy until convergence, or until a preset number of iterations are reached.
(6) |
where λ is the learning factor and I and J are randomly sampled sets of pixels used to estimate both the component density functions and joint entropy terms. Overall, in two iterative steps, this algorithm uses mixture densities to estimate pixel labels of bone and no-bone and arrives at an optimum transformation parameter set T.
Extensions to the algorithm
An efficient probability density estimation method
A crucial step in the Minimax algorithm is the computation of probability densities. In most imaging modalities pdfs are not easily predicted, but are estimated. The standard estimation methods are grouped under parametric, semiparametric, and nonparametric methods. A Gaussian is a standard example of a parametric pdf, whereas a finite mixture model such as a mixture of Gaussians is an example of a semiparametric pdf (25). However, in many cases, nonparametric methods are the models of choice because no prior assumptions about the forms of the pdfs are required. Examples are histograms, K-nearest neighbor method, and kernel-based estimators such as the Parzen Windows (26). Viola and Wells used the Parzen method in the context of their MI metric because it provides a very smooth estimate (19). However, smoothness is often achieved at a great computational expense because this method uses the full data set to compute estimates. Component density functions, , in the Max Step are estimated by the Parzen method as follows:
(7) |
where N is the sample size, and Gψ (.,.) is a Gaussian kernel with variance ψ and (x, y) is the reference point at which the density is estimated. The function interpolates between the sample points, with each point contributing an amount proportional to its distance from (x, y). It is apparent that the computation tends to be intensive if the sample set is large. Another constraint when using the Parzen method is that the sample set used to represent the density should not be the set from which the reference points are sampled. If N1 and N2 are the two sets, then at every point in set N2 we need to calculate the linear combination of N1 window functions, making the total N1 × N2 computations. This computation becomes prohibitively large with increasing N1 and N2.
In the current project, registering 512 × 512 × 400 sized 3DCT datasets to multiple portal images has proved to be highly time-consuming and takes approximately 10 min on a personal computer (2.4 GHz). This prompted us to look at computational speedup strategies. We replaced the Parzen method with the Support Vector Method (SVM) as suggested by Vapnik and Mukherjee (27) for estimating the density. The SVM method recasts this as a standard optimization problem which is then solved with less computational expense. We set up the optimization problem as follows: Let be an approximation to the distribution function corresponding to p(x), where θ (.) is the Heaviside function. As density estimation problems are known to be ill-posed (27), we introduce a regularization function ψ (p), such that
(8) |
We minimize this functional subject to the following constraints:
(9) |
where εl is a regularization parameter, Kh(.,.) is a non-negative kernel of width h, and εl are nonuniform weighting coefficients. Only a few of the βi are nonzero and the xi corresponding to those nonzero βi are called the support vectors (27). A judiciously chosen kernel function will have the smallest number of support vectors. Consequently, the support vector method of density estimation is analogous to the Parzen Windows method, but with a sparser data set, and consequently, a faster convergence. We implemented this in a MI framework earlier and discovered that the SVM method used approximately one-third the data and was approximately 2.5 times faster than the Parzen Windows method (28). We later included it as part of our Minimax algorithm.
Registration with multiple variable FOV portal images
In 3DCRT, the therapy beam is shaped to conform to the projected prostate shape, and portal images acquired on the electronic portal imaging device (EPID) do not represent a full FOV with all bony structures used to assess positioning accuracy. In IMRT, an incident beam may only deliver treatment to a subset of the target volume, further decreasing the value for registration purposes of transmission EPID images acquired with the therapy beam. In such cases, it is imperative that a registration algorithm should detect discernable structures within this truncated window (RFOV images) and accurately perform the registration. We introduce a window function W (Xi, ω(r, l)) which defines a subset of the portal image, with r and l representing the size and location of the window. Let S represent a segmentation map in the window. This segmentation map in the portal window must be registered to one of a set of similar windows in a projected DRR containing a similar segmentation distribution. The conditional probability distribution of the portal window is:
(10) |
where Ψ (.;.,.) is a distribution of the intensities in the window with a mean μ and a standard deviation σ, and as in the previous section, “The Minimax algorithm,” M is a particular overall segmentation map of which S is a subset. It follows from Bayes' theorem
(11) |
The modified Max Step may be expressed as follows:
(12) |
where k is the iteration number. This is similar to Eq. 2 with modifications to accommodate the reduced information in truncated windows. The function estimated in the previous step is used in calculating the transformation T in the Min Step. Mathematically,
(13) |
In our modified algorithm we used RFOV portals with sparser sets of data points, but this drawback was offset by the addition of extra portal images (n = 4).
Gradient feature weighted intensity matching: GFW Minimax
Sparse field registration is accurate but susceptible to noise, whereas dense field registration is more robust to noise and other factors, but can be computationally expensive and sometimes less precise. A key development in our efforts to enhance the Minimax registration was to combine the sparse and dense field approaches. The advantages of each could be seized, while avoiding the liabilities. In our efforts we were guided by some earlier approaches (29, 30). The implementation of this method proceeds along the lines described earlier in the section, “The Minimax algorithm,” but with the incorporation of sparse field information. At a given transformation parameter set T, each portal image and its corresponding DRR are resized to a preset scale γ and contrast enhanced. Gradient operators (∇=∂/∂x + ∂/∂y) are applied to both images: ∇Xp(γ) and ∇Yp(T(k−1),γ) and the edges thus recovered are sampled uniformly, depending upon the scale factor γ. At corresponding points in both images, angles (α) between these gradient vectors are computed as inverse cosines of the normalized scalar products of the vectors:
(14) |
The angles are incorporated into weighting factors as w(γ) = (cos(2α) + 1) / 2. The factors are subsequently multiplied with the minimum of the two gradient magnitudes: min{|∇Xp(γ)|, |∇Yp(T(k−1),γ)|}, to create a scalar variable Γ(Xp(γ), Yp(T(k−1)γ)). Each contributing sample point's intensity is weighted with Γ(.,.). After incorporating sparse field information, the new transformation parameter set in the Min Step is computed. Thus strong gradients have a greater emphasis and the corresponding points (typically edges and ridges of bony segments) drive the registration process. This mixed approach was particularly helpful with low-contrast portal images.
Influence of registration errors on TCP and NTCP
The algorithm that we found to be most accurate (GFW Minimax) was used to compensate for setup variations in patient positioning. The planned dose profiles were blurred and overlaid on the reference 3DCT image. The blurring was done to simulate positioning and organ (bladder, prostate, and rectum) location uncertainties between successive fractions, as was earlier done by van Herk et al. (31). Dose–volume histograms (DVH) which represent the cumulative absorbed radiation by a segmented organ were computed. Investigators have attempted to reduce DVH information to two simple quantities: the TCP and the NTCP (20, 21). Current models that compute TCP and NTCP mainly rely on a factor called tumor clonogen radiosensitivity (SF2) (21, 32). SF2 is defined as the fraction of tumor cells that survive a 2 Gy dose (33, 34). In our computation of TCP and NTCP, we also included algorithm-derived variations in prostate position (35, 36).
Implementation
Testing was done on both phantom and patient data sets.
Phantom data
Four 3DCT-to-portal registration algorithms were compared and contrasted using data obtained by imaging a bone-embedded-in-Plexiglas pelvic phantom. CT images of the phantom were acquired on a GE Lightspeed Spiral CT scanner (GE Healthcare, Waukesha, WI). Voxels were 1.0 mm in all dimensions. A volume rendering of the pelvis obtained from these data is shown in Fig. 1. Portal images (6 MV) were acquired on an amorphous silicon (aSi500, PortalVision; Varian Medical Systems, Palo Alto, CA) electronic portal imager (EPID) in the AP and LL directions. The sensitive active imaging area of the detector (array dimension) is 40 × 30 cm2. It has a 512 × 384 pixel (0.784 mm pitch) matrix. The portal images of the phantoms were acquired in 14 different combinations of translations or rotations relative to a fixed origin. This was performed by placing the phantom on the treatment couch, and moving the calibrated couch and gantry and the EPID imager accordingly. The calibrated movements consisting of 6 parameters (3 translations and 3 rotations) were recorded as ground truth. The details of the combinations are given in Table 1.
Table 1.
Param | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
tx | 0 | 2 | 0 | 0 | 20 | 0 | 0 | 10 | 0 | 0 | 0 | 0 | 0 | −4 |
ty | 0 | 0 | 2 | 0 | 0 | 20 | 0 | 10 | 0 | 0 | 0 | 0 | 0 | 6 |
tz | 0 | 0 | 0 | 2 | 0 | 0 | 20 | 10 | 0 | 0 | 0 | 0 | 0 | −3 |
θxy | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 5 | 0 | 0 | 0 | 0 |
θyz | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 | 0 | 0 | 0 |
θzx | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 5 | −5 |
The four different automated or semiautomated 3DCT-to-2-portal rigid image registration approaches used to estimate the setup variation from the planning data (3DCT) to the treatment environment were:
Patient data
An important goal was to apply our fully automated algorithm to estimate actual patient setup variations. We acquired unique sets of data for 8 subjects that included a 3DCT reference image (with 1 mm3 voxels) at the initial planning day using our GE Lightspeed Spiral CT scanner. Subsequently, for 7 weeks, EPID images (AP and LL) were acquired weekly using a Varian EPID system or our film-based system. On these days, volumetric 3DCT images were also acquired to develop gold standard plans. For all acquisitions, the patient was constrained in a soft foam immobilization system, with the toes aligned vertically, in accordance with our clinical protocol and using the same parameter settings as Day 1. The methodologies for phantom and patient testing were essentially similar, except that Gilhuijs' method was not used on patient data. The MI algorithm served as a benchmark for comparison.
Dosimetry data
The 3DCRT patients received a dose of 75.6 Gy in 42 fractions, each nominally at 1.8 Gy with a dose margin of 10 mm around the prostate. CT images were acquired at resolutions of 0.9 mm × 0.9 mm × 1.0 mm. The prostate, rectum, and bladder were segmented and the CTV and PTV were drawn by a radiation oncologist. The dose distributions were determined for six 18-MV photon beams. The plans were generated at 2.0 mm resolution in all directions using a 3D planning system (Theraplan Plus 3.7, Nucletron, Veenendaal, The Netherlands). Once the prostate location uncertainties were known, the pdfs for the prostate dose (p(D)) were computed by interpolation for each location and the results archived on the computer. Dose plans were spatially scaled around the prostate to simulate narrower margins between 4 mm and 10 mm, and the process of determining p(D) for various positions was repeated and finally the DVH, TCP, and NTCP outcomes were computed.
RESULTS
The data used in this study consisted of phantom and patient data. When registering 3DCT data with four portals, the latter were simulated from phantom data. When comparing various segmentation/registration algorithms, we used 3DCTs and megavoltage portals that were obtained from patients and a phantom.
Registration using four portal images
A set of four simulated portal images were registered to 3DCT phantom data. The portal images of pixel size 1 mm were created by projecting through the phantom 3DCT. Two were the AP and LL portals (0° and 90°), and the other two were posterior oblique (135° and 215°). Accordingly, there is no duplication of information in any direction. Figure 2(a–d) shows the final segmentation in all four directions.
Figures 3a and 3b show results of registering two and four portals with initial misalignments of 10 mm and 10° in the x-y plane, in the presence of increasing Gaussian noise. Using four portals resulted in an error reduction of 30–50%. In both figures, the abscissae show noise in terms of the standard deviation. The addition of Gaussian noise of standard deviation σ implies that 34.13% of the additive noise will have gray level values in the range [0,σ]% of the maximum intensity of the image (8 bits, 255 gray levels in this case) (38). To estimate the effect of noise in portals, varying amounts of it were added to the images and signal-to-noise (SNR) ratios were computed. These portals were then registered to the 3DCT data and the absolute errors were noted. With no initial misalignment, an error-free run should converge to 0 mm. Figure 4 is a plot of SNR vs. the absolute translation error for registration with two and four portals. In practice, SNR values vary widely. Hilt et al. cited a value of 0.97 for a pelvic portal image which was then enhanced to 42.84 after intensity correction and noise reduction (39). In this experiment, using four portals resulted in a reduction of registration error. The major problem in using four portals was the computation time which was almost 1.5 times that required for two portals.
Registration using variable FOV portal images
Four portal images (256 × 256 pixels, pixel size: 1 mm) with varying FOV (RFOV) were simulated by placing a window centered at the prostate and then sequentially shrinking it. The resulting images captured a range of structures from the center of the image to the full feature set. Figure 5 displays plots of absolute translation and rotation errors vs. different window sizes. At a window size in the vicinity of 175 pixels (6 cm FOV), both plots make a transition beyond which the reduction error is not significant. We assume that a window of 175 pixels captured most of the tangible features in the images and a wider window did not necessarily add more information. However, we need to test this hypothesis more rigorously on a larger data set. The registration with four RFOV (35% reduction in size) portal images has converged to the true transformation values in a time frame comparable to the case when two FFOV portals were used. However, when four RFOV portals were used, the registration time was approximately 1.5 times longer than the situation when two FFOV portals were used, as previously noted.
Comparison of various 3DCT to multiple portal image registration algorithms using phantom data
Data consisting of 3DCT images and AP and LL portal images were acquired by imaging a bone-embedded-in-Plexiglas pelvic phantom. In this experiment, the ground truth data (Tgold) is assembled from the true estimates of the position settings as given in Table 1. The estimates of the six rigid mapping parameters using previously described algorithms are recorded as setup variation vectors Talg. For statistical comparisons, we used two factors: Algorithm Type consisting of four different possibilities, Setup Variation Parameters consisting of 3 translations (tx, ty, tz in voxels) and 3 rotations (θxy, θyz, θzx, in degrees). Dependent variables are error magnitudes between any one of the gold standard parameters and the matching algorithm-predicted parameters (e.g., ). A sample set of errors for one position setting is presented in Table 2. The results of using MI and the GFW Minimax algorithms to map several regions visualized and marked by a radiologist on the AP portal images to the candidate matched AP DRR are illustrated in Fig. 6a. A chart summarizing the results from applying the four algorithms to all 14 positions listed in Table 1 is shown in Fig. 6b. It displays the percentage improvement in error ((εMI − εalg) / εMI) for each algorithm when compared with MI. Of all metrics, MI's registration showed the largest deviation from the gold standard parameters. The dense field Minimax method and the bony ridge based methods (Gilhuijs' technique) fared better, but the GFW Minimax method performed the best by a significant margin for all translations and rotations (p < 0.05 corrected for multiple comparisons). The number of iterations (series of Max and Min steps) was constrained to be within 10. If the algorithm did not converge by this number, it would imply that it was not properly initialized, and one or more conditions such as the initial transformation parameters, source to patient or patient to EPID distances were not correctly set. However, in all the successful instances, the algorithm converged between 3 to 7 iterations for both versions of the Minimax method.
Table 2.
Param | MM1 | MM2 | MI | G |
---|---|---|---|---|
tx | 1.5 | −1.2 | 1.8 | −1.3 |
ty | 1.8 | −1.6 | −2.0 | 1.8 |
tz | 1.4 | 1.2 | 1.9 | 1.5 |
θxy | 1.6 | 1.1 | −1.9 | 1.3 |
θyz | 1.7 | 1.2 | −1.8 | −1.4 |
θzx | −1.6 | 1.2 | 1.8 | 1.3 |
The metrics used were Minimax (MM1), GFW Minimax (MM2), Mutual Information (MI), and Gilhuijs'method (G). Translations (in mm) in the x, y, and z directions are denoted by tx, ty, and tz and rotations (in degrees) in the xy, yz, and zx planes are denoted by θxy, θyz, and θzx respectively. The values in the table represent deviations from true values, which were tx = ty = tz = 10 mm and θxy = θyz = θzx = 0°. MM2 resulted in the smallest deviations from true values.
Comparison of various 3DCT to multiple portal image registration algorithms using patient data
In this experiment, we estimated the vector Tgold by performing 3D-to-3D registration of planning and treatment Day 3DCT images using normalized MI. To compare candidate 3DCT-to-dual-portal registrations with this gold standard, two 3DCT-to-2-portal rigid registrations were performed in sequence for each of the above algorithms for each patient:
A registration of the planning Day 3DCT to the two treatment day portals, from which the forward setup variation parameters Talg–setup are estimated.
A registration of treatment day portals to treatment Day 3DCT data, which estimates the position variation between the treatment couch and the 3DCT scan (performed in a separate room).
Composing the forward setup variation Talg–setup with the inverse of the treatment day position variation allows us to form an algorithm-specific rigid registration error from the gold standard that we use as our dependent variable for testing:
(15) |
where ○ indicates matrix multiplication. To compare algorithms, we followed the same procedure used in the subsection “Phantom data.” In this experiment, we did not use Gilhuijs' method because the bony landmarks in the portal images were not clearly delineated, and consequently, a considerable amount of contrast adjustment of the images would be required which detracts from the automated nature of the algorithms under comparison. For each of the two new, nonstandard algorithms (Minimax, GFW Minimax) we computed the percentage improvement of the registration compared with that produced by the MI method as: ((εMI − εT–alg) / εMI). The individual ε's were obtained via comparison to the gold standard. Statistically significant differences were found between the algorithms with MI showing the most error. Improvement was seen with the Minimax approach, and even greater gains were seen when using GFW Minimax. Figure 7a shows example results from 1 patient comparing traced bony ridges transferred from a projection through the 3DCT data to the AP portal image, using MI and GFW Minimax approach, to a tracing of the same structure on the portal image. Figure 7b summarizes the key findings here by plotting the percentage improvement over MI (see above) for the Minimax and GFW Minimax algorithms for 8 patients, compared across 4 (out of 6) weeks of treatment. This plot clearly indicates the improvement of the GFW Minimax algorithm over the other approaches (p < 0.05). The number of iterations when using the GFW Minimax method ranged between 4 to 7.
Effect of setup variations on treatment plan delivery
The uncertainties (standard deviation σ mm) in patient setup were computed after performing registration with the algorithm that gave the lowest errors: GFW Minimax. To understand the impact of these errors on therapy, parameters such as the TCP and NTCP were calculated using a standard linear-quadratic (LQ) model (21). Different margins (M), ranging from ⨆ 1–10 mm were simulated by shrinking the original dose plan about its isocenter. TCP and NTCP were calculated for the patients described in the subsection “Patient data” where weekly portals and CT scans (Day 7, Day 14, etc.) had been acquired in addition to the typical Day 0 CT. Bony structures as well as the segmented prostate, bladder, and rectum can be identified in each of the CTs shown in Fig. 8. Such information is potentially useful to compensate for the setup variation, and to better center the dose field on the prostate. The observations provided here are illustrative of the setup uncertainty occurring over all the fractions of therapy. The effect of such uncertainty is to make the dose distribution diffuse (40).
Contour plots of prostate TCP and rectum and bladder NTCP values in the (M, σ) domain are shown for one of these patients in Fig. 9. As illustrated in the Day 0, Day 7 CTs of Fig. 8, the prostate is shifted relative to the PTV during every dose fraction (or stage). The positions of the prostate from one stage to the next, related to patient positioning and internal organ motion issues, are generally independent from stage to stage. TCP is relatively insensitive to σ for the M = 10 mm currently used for external beam radiation therapy. However, with margin reduction the TCP becomes quite sensitive to setup error. Rectal NTCP and bladder NTCP are relatively insensitive to σ, because these organs can move either toward, or away from, the high dose region during the fractions of external beam radiation therapy. Thus NTCP is most sensitive to margin M. Although the maps are essentially the same for all the patients in current M and σ ranges, for tighter M and improved σ (i.e., reduced setup error) features of the maps become more patient-specific.
DISCUSSION
Although conformally planned escalated dose treatments can be beneficial for organ-confined prostate cancer (41, 42), the poor quality of portals in the treatment environment as well as the lack of robust, automatic registration of reference to treatment images has hindered treatment setup verification. Accurate assessment of treatment plan delivery is thus difficult. Neglecting such problems could lead to rectal or bladder injury or insufficient dose delivered to the prostate cancer. This is particularly critical for implementation of IMRT plans which have steep dose gradients at the beam edges. Hence, a crucial first step for effective radiotherapy is correct patient positioning. Toward this goal we introduced and improved upon a novel automated iterative registration and segmentation method called the Minimax algorithm.
Initially, the Minimax algorithm was a purely intensity-based metric. We extended it to include a combination of intensity and feature-based measures (GFW Minimax). We then compared our method to two other popular methods: MI and Gilhuijs' method. Our mixed approach gave the best results when registering patient data, whereas a purely feature-based approach was not very successful because of limited contrast in the portal images. However, with phantom data, both feature-based and mixed approaches worked very well, with the latter outperforming the former by a slight margin. This was due to the fact that in a phantom image formed from the pelvic bones, the only noticeable aspects were the edges and ridges. Differences in tissue contrast were negligible, which resulted in the purely intensity-based methods (MI and intensity-based Minimax) not doing all that well.
The other aim of this study was to effectively and efficiently register multiple portal images to 3DCT volume data. We did so using data drawn from simulations, phantom images, and patient images. Using simulations we verified that using four vs. two portal images reduces registration error by 30–50%, albeit at some computational expense. It would be instructive to correlate the reduction of error to the number of portal images used for registration. Although not entirely analogous, in a fully 3D-to-3D registration the number of views employed for registration is not a constraint. However, instead of pushing the computation to its limits, the registration process can be terminated once the point of diminishing returns is reached. In this case, that would be the maximum number of projections, beyond which the accuracy would cease to improve.
We also experimented with RFOV images, as in many cases the clinical images do not contain a full set of features. As is expected, registration with images with wider FOV results in lower errors. However, there was a transition point beyond which the reduction of error was not significant. In the current case, that was around a window size of 175 pixels (FOV of 6 cm). We believe that a stage was reached wherein most of the significant details in the images are now present, and consequently, any further widening of the window will not bring in more features. This conclusion needs to be tested on a larger set of images.
Once we compared algorithms and chose the optimum one (GFW Minimax), we decided to explore the correlation between registration errors or uncertainty in prostate position and the variability in TCP and NTCP. We should note that although registration errors contribute significantly to the uncertainty in prostate location, other sources of error are internal organ motion and inherent errors in the therapy machine. Our algorithm does not account for prostate motion as it only utilizes rigid pelvic bony landmarks. Our results demonstrate that if the position uncertainty is reduced to ~1 mm, the dose margins can be scaled down substantially from the current 10 mm to 4 mm, without adversely affecting TCP and NTCP. This emphasizes the crucial role played by imaging techniques in the administration of radiotherapy.
A logical extension to the work presented in this paper is to incorporate fully 3D-to-3D nonrigid registration into our algorithm. Our improvements and extensions were all based on rigid registration that cannot accommodate internal motion relative to the pelvic structure. This requires images superior in quality to the portal images. The evolving trend of daily conebeam 3DCT data acquisition would provide images with good soft tissue contrast, but will carry its own problems and challenges and will amplify the need for a fast 3D automated segmentation and registration metrics.
Acknowledgments
The authors thank Dr. Richard Peschel for providing patient data.
This work was primarily supported by NIH Grant R01EB002164 (formerly R01-CA80894). The support provided by Varian Medical Systems and Nucletron is also acknowledged.
REFERENCES
- 1.King CR, Lehmann J, Adler JR, et al. CyberKnife radiotherapy for localized prostate cancer: Rationale and technical feasibility. Technol Cancer Res Treat. 2003;2:25–30. doi: 10.1177/153303460300200104. [DOI] [PubMed] [Google Scholar]
- 2.Clippe S, Sarrut D, Malet C, et al. Patient setup error measurement using 3D intensity-based image registration techniques. Int J Radiat Oncol Biol Phys. 2003;56:259–265. doi: 10.1016/s0360-3016(03)00083-x. [DOI] [PubMed] [Google Scholar]
- 3.Song PY, Washington M, Vaida F, et al. A comparison of four patient immobilization devices in the treatment of prostate cancer patients with three dimensional conformal radiotherapy. Int J Radiat Oncol Biol Phys. 1996;34:213–219. doi: 10.1016/0360-3016(95)02094-2. [DOI] [PubMed] [Google Scholar]
- 4.van Herk M, Bruce A, Kroes AG, et al. Quantification of organ motion during conformal radiotherapy of the prostate by three-dimensional image registration. Int J Radiat Oncol Biol Phys. 1995;33:1311–1320. doi: 10.1016/0360-3016(95)00116-6. [DOI] [PubMed] [Google Scholar]
- 5.Stroom J, Koper P, Korevaar G, et al. Internal organ motion in prostate cancer patients treated in prone and supine treatment position. Radiother Oncol. 1999;51:237–248. doi: 10.1016/s0167-8140(99)00061-4. [DOI] [PubMed] [Google Scholar]
- 6.Lemieux L. A patient-to-computed-tomography image registration method based on digitally reconstructed radiographs. Med Phys. 1994;21:1749–1760. doi: 10.1118/1.597276. [DOI] [PubMed] [Google Scholar]
- 7.Berger M, Gerig G. Motion measurements in low-contrast X-ray imagery. In: Wells WM, Colchester A, Delp S, editors. Motion Measurements in Low-Contrast X-Ray Imagery. Vol. 1496. Springer; New York: 1998. pp. 832–841. [Google Scholar]
- 8.Hristov DH, Fallone BG. A gray level image alignment algorithm for registration of portal images and digitally reconstructed radiographs. Med Phys. 1996;23:75–84. doi: 10.1118/1.597743. [DOI] [PubMed] [Google Scholar]
- 9.Dong L, Boyer AL. An image correlation procedure for digitally reconstructed radiographs and electronic portal images. Int J Radiat Oncol Biol Phys. 1995;33:1053–1060. doi: 10.1016/0360-3016(95)02082-9. [DOI] [PubMed] [Google Scholar]
- 10.Weese J, Penney GP, Buzug TM, et al. CARS'97 Computer Assisted Radiology and Surgery. Berlin, Germany: Jun 25-28, 1997. 2D/3D registration of pre-operative CT images and intra-operative X-ray projections for image guided surgery. [Google Scholar]
- 11.Fritsch DS, Chaney EL, Boxwala A, et al. Core-based portal image registration for automatic radiotherapy treatment verification. Int J Radiat Oncol Biol Phys. 1995;33:1287–1300. doi: 10.1016/0360-3016(95)02092-6. [DOI] [PubMed] [Google Scholar]
- 12.Balter JM, Pelizzari CA, Chen GTY. Correlation of projection radiographs in radiation therapy using open curve segments and points. Med Phys. 1992;19:329–334. doi: 10.1118/1.596863. [DOI] [PubMed] [Google Scholar]
- 13.Gilhuijs KGA, van deVen PJH, van Herk M. Automatic three-dimensional inspection of patient setup in radiation therapy using portal images, simulator images, and computed tomography data. Med Phys. 1996;23:389–399. doi: 10.1118/1.597801. [DOI] [PubMed] [Google Scholar]
- 14.Gilhuijs KGA, van Herk M. Automatic on-line inspection of patient setup in radiation therapy using digital portal images. Med Phys. 1993;20:667–677. doi: 10.1118/1.597016. [DOI] [PubMed] [Google Scholar]
- 15.Leszczynski KW, Loosea S, Boyko S. An image registration scheme applied to verification of radiation therapy. Br J Radiother. 1998;71:413–426. doi: 10.1259/bjr.71.844.9659135. [DOI] [PubMed] [Google Scholar]
- 16.Gilhuijs K. Automated verification of radiation treatment geometry. Department of Radiotherapy, University of Amsterdam; Amsterdam: 1995. [Google Scholar]
- 17.Stoeckel J, Vos FM, Vos PH, et al. An evaluation of ridge extraction methods for portal imaging. In: Sanfelin A, Villanueva JJ, editors. Proceedings of the Third International Conference on Medical Robotics; Barcelona, Spain. September 3–7, 2000.pp. 433–436. [Google Scholar]
- 18.Bansal R, Staib L, Chen Z, et al. A Minimax entropy registration framework for patient setup verification in radiotherapy. Computer Aided Surgery. 1999;4:287–304. doi: 10.1002/(SICI)1097-0150(1999)4:6<287::AID-IGS1>3.0.CO;2-9. [DOI] [PubMed] [Google Scholar]
- 19.Viola P, Wells W. Alignment by maximization of mutual information. Int J Comp Vision. 1997;24:137–154. [Google Scholar]
- 20.Lyman JT. Complication probability as assessed from dose–volume histograms. Radiat Res. 1985;104:S13–S19. [PubMed] [Google Scholar]
- 21.Niemierko A, Goitein M. Implementation of a model for estimating tumor control probability for an inhomogeneously irradiated tumor. Radiat Oncol. 1993;29:140–147. doi: 10.1016/0167-8140(93)90239-5. [DOI] [PubMed] [Google Scholar]
- 22.Bansal R, Staib L, Chen Z, et al. A novel approach for the registration of 2D portal and 3DCT images for treatment setup verification in radiotherapy. MICCAI'98. 1998;1496:1075–1086. [Google Scholar]
- 23.Geman S, Geman D. Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images. IEEE Trans Pattern Analysis and Machine Intelligence. 1984;6:721–741. doi: 10.1109/tpami.1984.4767596. [DOI] [PubMed] [Google Scholar]
- 24.Bansal R, Staib L, Chen Z, et al. Entropy-based dual-portal-to-3-DCT registration incorporating pixel correlation. IEEE Trans Med Imaging. 2003;22:29–49. doi: 10.1109/TMI.2002.806430. [DOI] [PubMed] [Google Scholar]
- 25.McLachlan GJ, Peel D. Finite mixture models. John Wiley & Sons; New York: 2000. [Google Scholar]
- 26.Izenman AJ. Recent developments in nonparametric density estimation. J Am Stat Assoc. 1991;86:205–224. [Google Scholar]
- 27.Vapnik N, Mukherjee S. Support vector method for multivariate density estimation. In: Solla SA, Leer TK, Muller KR, editors. Advances in neural information processing systems. MIT Press; Cambridge, MA: 2000. pp. 659–665. [Google Scholar]
- 28.Chelikani S, Purushothaman K, Duncan J. Support vector machine density estimator as a generalized Parzen windows estimator for mutual information based image registration. In: Ellis RE, Peters TM, editors. Medical image computing and computer-assisted intervention. Vol. 2879. Springer Verlag; New York: 2003. pp. 854–861. Lecture Notes in Computer Science. [Google Scholar]
- 29.Pluim J, Maintz J, Viergever M. Image registration by maximization of combined mutual information and gradient information. IEEE Trans Medical Imaging. 2000;19:809–814. doi: 10.1109/42.876307. [DOI] [PubMed] [Google Scholar]
- 30.Roche A, Pennec X, Malandain G, et al. Rigid registration of 3-D ultrasound with MR images: A new approach combining intensity and gradient information. IEEE Trans Medical Imaging. 2001;20:1038–1049. doi: 10.1109/42.959301. [DOI] [PubMed] [Google Scholar]
- 31.van Herk M, Remeijer P, Lebesque JV. Inclusion of geometric uncertainties in treatment plan evaluation. Int J Radiat Oncol Biol Phys. 2002;52:1407–1422. doi: 10.1016/s0360-3016(01)02805-x. [DOI] [PubMed] [Google Scholar]
- 32.Levegrun S, Jackson A, Zelefsky M, et al. Analysis of biopsy outcome after three-dimensional conformal radiation therapy of prostate cancer using dose distribution variables and tumor control probability models. Int J Radiat Oncol Biol Phys. 2000;47:1245–1260. doi: 10.1016/s0360-3016(00)00572-1. [DOI] [PubMed] [Google Scholar]
- 33.Roberts SA, Hendry JH. A realistic closed-form radiobiological model of clinical tumor-control data incorporating intertumor heterogeneity. Int J Radiat Oncol Biol Phys. 1998;41:689–699. doi: 10.1016/s0360-3016(98)00100-x. [DOI] [PubMed] [Google Scholar]
- 34.Scott SL, Gumerlock PH, Beckett L, et al. Survival and cell cycle kinetics of human prostate cancer cell lines after single- and multifraction exposures to ionizing radiation. Int J Radiat Oncol Biol Phys. 2004;59:219–227. doi: 10.1016/j.ijrobp.2004.01.027. [DOI] [PubMed] [Google Scholar]
- 35.Purushothaman K, Chelikani S, Chen Z, et al. Influence of position errors on tumor control in external beam therapy of prostate cancer: Concepts of TCP maps and probability densities [Abstract] Int J Radiat Oncol Biol Phys. 2003;57(Suppl):S338. [Google Scholar]
- 36.Purushothaman K, Chelikani S, Chen Z, et al. Variability of tumor control under localization uncertainty in external beam radiation therapy (EBRT) of prostate cancer [Abstract] Int J Radiat Oncol Biol Phys. 2004;60(Suppl):S267–S268. [Google Scholar]
- 37.Studholme C, Hill DLG, Hawkes DJ. An overlap invariant entropy measure of 3D medical image alignment. Pattern Recognition. 1999;32:71–86. [Google Scholar]
- 38.Gonzalez RC, Woods RE. Digital image processing. Prentice Hall; New York: 2002. [Google Scholar]
- 39.Hilt H, Wolf D, Aletti P. Preprocessing of control portal images for patient setup verification during the treatments in external radiotherapy. Med Phys. 1999;26:2539–2549. doi: 10.1118/1.598791. [DOI] [PubMed] [Google Scholar]
- 40.Leong J. Implementation of random positioning error in computerized radiation treatment planning systems as a result of fractionation. Phys Med Biol. 1987;32:327–334. doi: 10.1088/0031-9155/32/3/002. [DOI] [PubMed] [Google Scholar]
- 41.Dearnaley D, Khoo V, Norman A, et al. Comparison of radiation side-effects of conformal and conventional radiotherapy in prostate cancer: A randomised trial. Lancet. 1999;353:267–272. doi: 10.1016/S0140-6736(98)05180-0. [DOI] [PubMed] [Google Scholar]
- 42.Jacob R, Hanlon A, Horwitz E, et al. The relationship of increasing radiotherapy dose to reduced distant metastases and mortality in men with prostate cancer. Cancer. 2004;100:538–543. doi: 10.1002/cncr.11927. [DOI] [PubMed] [Google Scholar]