Abstract
Four-dimensional computed tomography (4D-CT) imaging technology has been developed for radiation therapy to provide tumor and organ images at the different breathing phases. In this work, a procedure is proposed for estimating and modeling the respiratory motion field from acquired 4D-CT imaging data and predicting tissue motion at the different breathing phases. The 4D-CT image data consist of series of multislice CT volume segments acquired in ciné mode. A modified optical flow deformable image registration algorithm is used to compute the image motion from the CT segments to a common full volume 3D-CT reference. This reference volume is reconstructed using the acquired 4D-CT data at the end-of-exhalation phase. The segments are optimally aligned to the reference volume according to a proposed a priori alignment procedure. The registration is applied using a multigrid approach and a feature-preserving image downsampling maxfilter to achieve better computational speed and higher registration accuracy. The registration accuracy is about 1.1±0.8 mm for the lung region according to our verification using manually selected landmarks and artificially deformed CT volumes. The estimated motion fields are fitted to two 5D (spatial 3D+tidal volume+airflow rate) motion models: forward model and inverse model. The forward model predicts tissue movements and the inverse model predicts CT density changes as a function of tidal volume and airflow rate. A leave-one-out procedure is used to validate these motion models. The estimated modeling prediction errors are about 0.3 mm for the forward model and 0.4 mm for the inverse model.
Keywords: 4D-CT, deformable image registration, optical flow, respiratory motion modeling
INTRODUCTION
Recent developments in computerized cancer treatment planning techniques such as intensity-modulated radiation therapy (IMRT) require precise and efficient calculation of cumulative dose distributions to irradiate the tumor while avoiding surrounding normal structures.1 It has been long understood that respiratory motion causes significant problems in the treatment planning and irradiation delivery of lung cancer patients. Four-dimensional computed tomography (4D-CT) imaging has been, therefore, proposed to provide valuable information on the shapes and trajectories of tumor and normal structures to guide treatment planning and improve the accuracy of tumor targeting.2, 3
The 4D-CT procedure is usually carried out by acquiring CT data in either ciné or helical modes for multiple times at different couch positions over multiple respiratory cycles, while internal or external respiratory surrogates are measured simultaneously.4, 5, 6, 7 The acquired CT data are then mapped to the measured breathing data, and binned into different respiratory phases and reconstructed into 3D-CT volumes. The current protocol used in our institute (Low et al. in 2003) is to scan the patient using a multislice CT scanner in ciné mode and bin the acquired 3D-CT segments into the nearest neighbor phase using amplitude plus breathing direction (inhale or exhale) sorting.4, 8 There are several other 4D-CT protocols in the literature (Rietzel et al. in 2005, Keall et al. in 2004, and Vedam et al. in 2003), which acquire CT images or sinogram projection data in either ciné or helical modes.2, 6, 7 The acquired 4D-CT data can be sorted by the phase angle or by the respiratory amplitude derived from measured respiratory data, or even by internal organ positions.2, 8, 9, 10, 11
The primary focus of this paper is to use deformable image registration methods to compute the breathing motion from the 4D-CT image data and to construct motion models to approximate the estimated motion field. Our work is related to the following published literature. McClelland et al.12 reported using a B-spline based deformable image registration method to compute respiratory motion and to build a motion model by approximating the computed motion fields. A breath-holding CT was used as the reference volume for registration. The 4D-CT acquisition protocol used is similar to Low et al.4 Sarrut et al.13 reported using an optical flow-based deformable registration algorithm to register two breath-holding lung CT volumes acquired at end-of-exhalation and end-of-inhalation phases. An a priori lung density modification (APLDM) algorithm was used to account for changes in lung density due to respiration. Boldea et al.14 further applied the same method and computed respiratory motion between the 4D-CT images during the entire breathing cycle. This information was used to analyze associated nonlinearities and hysteresis of lung breathing motion. Low et al.15 in 2005 proposed a 5D (3D+tidal volume+air flow rate) respiratory motion model. The objective of this model was to account for breathing hysteresis effect.15, 16
The following shifts have been made in this work from the current literature: (1) a full volume reference image is reconstructed from the acquired 4D-CT image data and used for deformable image registration; (2) deformable image registration is carried out from each individual multislice CT segment to the reference volume. CT segments of the adjacent couch positions are not involved in the registration computations; therefore, potential artifacts caused by slice misalignments are avoided; (3) the Horn–Schunck optical flow method17 is used for motion computation. This is because it was more efficient than either the B-spline method or the demons method according to our experiments; (4) a feature preserving maxfilter is applied for image downsampling in the multigrid approach, and the prior knowledge gathered from the acquired 4D-CT data is further used to optimally align the CT segments to the reference volume. The overall registration accuracy is significantly improved by these two procedures; (5) both manually selected landmark points and an artificially deformed CT volume are used to validate the registration accuracy; therefore, the registration accuracy is validated for both selected feature points and the entire lung volume; (6) in addition to the 5D model suggested by Low et al.,15 an inverse 5D model is proposed to predict the inverse motion fields for different radiation therapy applications.
MATERIALS AND METHODS
4D-CT data acquisition
The 4D-CT data were acquired as 16-slice 3D-CT volume segments using a Philips CT scanner (Brilliance, Philips Medical Systems, Cleveland, OH) operated in the ciné mode. A 3D-CT volume segment, also referred to hereafter as a scan, is 512×512 pixels×16 slices with voxel dimension of 0.9375×0.9375×1.5 mm3. The segments were acquired while the patient breathed freely. Twenty-five segments were acquired at each couch table position and a number of (17 to 19) adjacent couch positions were imaged to contain both lungs and upper abdominal organs. It took the CT scanner 0.42 s to acquire the projection data for a scan (a complete gantry rotation) and 0.25 to 0.3 s idle time between scans. The reconstructed CT volume segments were saved for later offline processing. It took about 18 s to perform 25 scans for one couch table position and about 7 s to transport the couch table to the next couch table position.
During CT data acquisition, the patient spirometry and abdominal circumference (pneumatic bellows) were continuously measured using an independent data acquisition workstation. The measured spirometry data were processed to correct for drifting. The corrected spirometry data were used to linearly calibrate and convert the measured bellow values to tidal volumes for subsequent sorting.18 The converted tidal volume values were offset to 0 ml at the fifth percentile tidal volume, which was used as the reference for the end-of-exhalation phase. The workstation also recorded x-ray on∕off signal from the CT scanner to provide synchronization between CT scans and the bellow data. The starting of an x-ray ON signal was synchronized to the starting of a CT scan in time. Because each CT scan took 0.42 s to finish, the bellow values measured at the centers of the 0.42 s periods were used as the bellow values for the CT scans. An example of the measured bellow values and the CT scanning versus time is shown in Fig. 1. Each set of 25 scans typically covered more than two respiratory cycles.
Reconstructing the reference 3D-CT volume
An amplitude sorting algorithm was used to reconstruct the reference 3D-CT volume from the acquired CT segments at the end-of-exhalation phase. Such a reference volume is used because: (a) the end-of-exhalation phase is typically the most reproducible phase, (b) respiration motion is minimized, and (c) the tissues are at their most superior positions, so that the reference volume would contain all tissues necessary to register CT segments from all couch positions.
Motion estimation
Challenges and solutions for lung CT image registration
There are a few issues that may complicate accurate registration in lung images, especially for the low contrast regions of the lungs. First, motion magnitude of the inferior parts of the lungs can be over 30 mm between the inhaled phases to the exhaled phases.16 Second, the acquired CT segments have only 16 slices. Third, the lung tissue is sliding along the chest wall during breathing. The motion of the lung tissue and motion of the chest wall are discontinuous. Finally, the lung density changes from the inhaled phases to the exhaled phases.13 These density changes are greatest at the inferior parts of the lungs.
Because of these issues and because the end-of-exhalation phase was used as the reference volume, it is generally difficult to register the inhaled inferior lung segments to the reference volume with satisfactory accuracy using most deformable registration algorithms.13 A multigrid approach is typically adopted (to sequentially register the two images from the lower resolutions to the higher resolutions) to account for large motions. However, this approach is affected by the limited number of slices per segments and the image smearing problem (e.g., smearing bronchi in the downsampled images). Registration of these smeared images at lower resolution stages may result in larger registration errors, which may not be recovered in subsequent higher image resolution stages. To mitigate these issues, a midrespiration phase might be selected as the reference instead of end of exhalation. However, it is difficult to reconstruct a slice-by-slice aligned 3D-CT volume of the entire lung for the midrespiration phases given the current 4D-CT acquisition protocols.19 Therefore, the end-of-exhalation reference volume remains the best choice for our purposes.
Our proposed solutions for these issues are the following. The prior knowledge (slice positions of lung apex and diaphragm apex) were derived from the acquired 4D-CT image data and were used to optimally align the CT segments to the reference volume. Thus, the magnitude of the residual misalignment in the superior-inferior (SI) direction is greatly reduced. Nevertheless, the multigrid approach is still needed to account for residual motion in the SI direction and in the axial plane. Therefore, a feature preserving maxfilter was used to replace the commonly used Laplacian pyramid filter20 for image downsampling. The maxfilter (discussed below) preserves small image features, thus, can potentially achieve better registration accuracy across the multiple image resolution stages. The optimal slice alignment procedure and the maxfilter can also help in reducing the problems caused by lung density changes.
The deformable image registration algorithm
Several deformable image registration algorithms have been evaluated in this study. These included the optical flow method by Horn and Schunk,17 the least mean square (LMS) optical flow method by Lucas and Kanada,21 the level set algorithm by Vemuri et al.,22 and the demons algorithm by Thirion.23 The evaluation results based on 34 manually identified landmarks inside the lungs are summarized in Table 1. This dataset was provided by Princess Margaret Hospital (PMH) and had been previously used in a deformable registration accuracy study as described in Sec. 2D4.24 Accordingly, the Horn and Schunck’s algorithm was chosen due to its computational efficiency and comparable accuracy to other algorithms.
Table 1.
Algorithms | Registration error (mm) | Computation time (s) | ||
---|---|---|---|---|
Mean | Standard deviation | Maximum | ||
Horn–Schunck optical flow | 1.57 | 1.46 | 7.19 | 2036 |
Demons | 1.96 | 1.26 | 4.77 | 2906 |
Lucas–Kanade optical flow | 1.55 | 1.53 | 8.18 | 3843 |
Levelset | 3.49 | 2.06 | 8.84 | 3059 |
Horn–Schunck optical flow (without CT number limiting) | 1.88 | 1.47 | 6.89 | 2040 |
Preprocessing
To improve the computation speed, the patient’s body was segmented using a simple thresholding method and morphological operators. Voxels outside the patient’s body were set to −1000 (CT number of air). All the CT segments were cropped to exclude most of the zero value space.
In order to improve the overall accuracy of image registration, the maximal CT number was limited to 100 (i.e., if a CT number of a voxel was greater than 100, it would be set to 100). The image intensity was then normalized to [0,1] range as required by the Horn–Schunck optical flow algorithm. Truncating larger CT numbers (bony structures) reduces the range of the image intensity before normalization. This resulted in a relative increase of the image gradient values of lower contrast regions (soft tissues) compared to higher contrast regions (bony structures). Consequently, the overall registration accuracy was improved. Table 1 shows preliminary results comparing the registration accuracies achieved with and without such a CT number truncation step.
A priori alignment (APA)
Initially, CT segments were aligned to the reference volume according to their couch table position. A major problem with this simple approach was that the segments of the inferior lung could not be registered accurately because the image motion magnitude (with respect to the reference 3D-CT volume) in these areas was too large, even with a multigrid approach in place. Therefore, slice position parameters derived from the acquired image data were used to achieve an optimal alignment so that the CT segment could be registered more easily and accurately. This procedure is denoted as a priori alignment (APA).
In the APA procedure, the following parameters were derived from the acquired 4D-CT images: the couch table positions of the lung apex and the diaphragm apex for both the end-of-inhalation phase and the end-of-exhalation phase, the tidal volumes of the inhaled phase, the exhaled phase, and every CT segment. The lung was assumed to expand in the SI direction from the exhaled phase to the inhaled phase as a simple linear function of tidal volume. An approximate SI position z where the middle of the CT segment was in the reference volume could then be computed according to simple geometric rules as
(1) |
where zA is the lung apex position (same for both the exhaled phase and the inhaled phase), zE and zI are the positions of the diaphragm apex in the exhaled phase and the inhaled phase, zS is the couch position of the middle of the CT segment, VE, VI, and VS are the tidal volumes of the exhaled phase, the inhaled phase, and the CT segment, respectively.
The CT segment was then aligned to the reference volume according to the computed z value. The APA procedure was similar to a SI direction translation-only-rigid-registration procedure. It reduced the SI direction relative motion between the CT segments to the reference volume. Because the lung respiratory motion in the SI motion was usually the largest in magnitude, APA reduced the overall motion magnitude and, thus, made the deformable registration easier. On the other hand, APA also helped to reduce the problems caused by the lung-chest wall sliding issue because the CT segments were aligned optimally, regardless of the position of the chest wall tissues.
Reference volume cropping
After applying the APA procedure, the relative maximal SI motion was reduced from 30 to 9 mm according to our estimation. Therefore, only a subsection of the reference volume extending to a 9 mm range of image motion beyond the target 16-slice segment was needed for the deformable registration computation. To ensure adequate coverage, the reference volume subsection was extended from the target segment by 16 slices, equal to 24 mm, in both superior and inferior directions, resulting in a 48 (3×16) slice volume. Using a subsection of the reference volume instead of the full reference volume for computation helped to improve the overall computational speed.
The multigrid approach and the multiple pass approach
A common limitation of the optical flow methods is that only small motions can be recovered as implied by the Taylor expansion. To allow registration of large motions, the multigrid approach22 was used to perform the optical flow computation sequentially in multiple image resolution stages, from low resolution to high resolution. At each image resolution stage, the multiple pass approach was used25 to perform the computation in multiple passes of a smaller number of iterations. This also allows better and faster computation of the motion fields.
Maxfilter for feature preserving downsampling
A maxfilter was used in our work for image downsampling operations in the multigrid approach instead of the commonly used Laplacian pyramid filter23 in order to preserve small image features of the lung (e.g., bronchi and nodes). These features appear brighter than the surrounding air content in the lung. When downsampling the lung CT images with the Laplacian pyramid filter or the Gaussian low-pass filter, smaller bronchi and nodes are often smeared by the darker air content background if the images are half-sampled for two or more times. This problem is illustrated in Figs. 2b, 2c, 2d. Loss of these image features in the downsampled images often causes larger registration errors. A comparison of registration accuracy is shown in Table 2 for using the maxfilter and using the Laplacian pyramid filter.
Table 2.
Down-sampling filter | Laplacian pyramid filter | Maxfilter | ||
---|---|---|---|---|
Inferior | Middle | Inferior | Middle | |
Registration error (mm) | 5.8±3.8 | 4.5±2.8 | 2.2±1.8 | 2.3±2.5 |
Other considerations
The reference 3D-CT volume is larger than the CT segments and encloses them completely; therefore, the reference volume was chosen as the moving image and the CT segments as the fixed images for deformable registration. One 16-slice motion vector field was computed for every CT segment. The resulting motion fields were defined on the voxels of the CT segments.
Registration accuracy validation
Validation using artificially deformed CT volume
The reference 3D-CT volume of one patient was artificially deformed with the following synthesized motion fields that simulate regular breathing motion:
(2) |
(3) |
(4) |
where XXY is the spatial vector on the X-Y axial plane, CL and CR are the manually determined centers of the left lung and the right lung in the X-Y plane, z1∕4 is the z coordinate of the top 1∕4 lung, G is a 2D Gaussian low-pass filter with sigma=40 pixel, V is the 3D motion vector, VXY is the projection of V into the X-Y plane, VZ is the projection of V on the z axis.
Equation 2 was computed for every pixel on the X-Y plane, excluding the pixels outside the patient’s body. The equation consists of two parts. The first part mainly determines the direction of with the 2D Gaussian filter to ensure the continuity of the directions. Both parts together determine the magnitude of . Equation 3 was computed for the entire X-Y plane after Eq. 2. It rescales the magnitude of to a maximum of 5 pixels. The maximal Z motion computed by Eq. 4 is 20 pixels (equal to 30 mm). The values of 5 pixels and 20 pixels were determined empirically. Figure 3 shows a coronal view and a transverse view of the original reference CT volume, the deformed CT volume, and the difference overlaid with the motion field vectors.
The dimensions of both the reference volume and the generated new volume were 274×373×192. The generated volume was cut into 12 16-slice segments to emulate the actual 4D-CT data. The segments were registered to the original reference volume individually. The computed motion fields for every segment were then compared to the synthesized motion fields to determine the registration errors for the lung region. The advantage of this validation is the ability to measure registration accuracy for the entire lung and not only for corresponding feature points.
Validation of the maxfilter for image downsampling
The effectiveness of the maxfilter was validated by using the artificially deformed CT volume. Two generated 16-slice segments for the middle lung and the inferior lung were selected. The two segments were registered to the reference volume by using the maxfilter and the Laplacian pyramid filter. The estimated motion fields were compared to the ground truth motion fields.
Validation using patient 4D-CT data
For each patient, validation was carried out by using the reference volume and multiple CT segments randomly selected from multiple and different couch table positions. The CT segments were selected from the middle to the inferior lung of the patients with higher corresponding tidal volumes so that the image motions were at their largest values with respect to the reference volume. Corresponding landmarks were manually selected at visually tractable lung bifurcation points in the reference volume and in the CT segments.
Validation using full volume 4D-CT images
Another 4D-CT lung image dataset was used for independent validations. The image dataset was provided by PMH, Toronto, Canada.24 The dataset contained two 512×512×152 4D-CT volumes from the inhaled and the exhaled phases. Voxel dimension was 0.9766×0.9766×2.5 mm3. There were 38 landmarks defined, 17 for the left lung, 17 for the right lung, two for the aorta, and two for the heart. The same registration parameters and settings were applied in this validation as the parameters and settings used to register the individual CT segment to the reference volume. In this way, the validation results were comparable to the CT segment to reference volume registration validation results.
Inverse motion modeling
The 5D inverse motion field prediction model
The following 5D inverse motion model was proposed to approximate the inverse motion fields that were directly computed by the deformable registration method:
(5) |
where T is the tidal volume (converted from the measured bellow values), F is the flow rate (computed from T by differentiation), X is the imaging space position vector, defined for the imaging device, the parameters a, b, and c are per voxel fitting constant vectors, and MR is the inverse motion vector similar to the motion fields computed for each CT segment. It is defined on the grid of CT segments and represents how each voxel in a CT segment maps to a position in the reference volume.
This 5D inverse motion model is based on the 5D motion model concept proposed by Low et al.15 McClelland et al. had also proposed a similar model at the AAPM 2007 conference.26 In this model, the image motion is determined by two variables, the tidal volume and the air flow rate. Time is not a variable, unlike other linear or spline-based temporal fitting models. This 5D model is also able to predict motion hysteresis.15, 16 The constant parameters a, b, and c were estimated voxel by voxel using the computed motion fields for the 25 scans of every couch table position by a LMS regression fitting algorithm.
Evaluation of the 5D inverse motion prediction model
To evaluate the accuracy of the 5D model, modelling error (ME), target modeling error (TME), and model prediction error (MPE) were computed.
ME is defined as the error between the predicted motion field and the motion fields used to estimate the fitting parameters. For each CT segment, the predicted motion fields were computed for the same tidal volume and flow rate. ME was then computed between the predicted motion fields and the motion fields computed by deformable image registration.
TME is defined as the error from the transformed reference volume landmark points to the manually identified corresponding landmark points in the CT segments. The reference volume landmark points were transformed using the motion fields predicted for the same tidal volumes and flow rates of the CT segments.
MPE is defined as the error between the predicted motion fields to the known motion field while the known motion field is not used to estimate the modeling parameters. MPE was computed using a leave-one-out procedure. For each of the 25 segments at a couch position, one segment was left out and the modeling parameters were estimated using the motion fields of the other 24 segments. The motion field for the tidal volume and the airflow rate of the left-out segment was predicted using the estimated modeling parameters, and the errors were computed between the predicted motion fields to the motion field of the left-out segment. This procedure was then repeated for all 25 segments for the couch position.
Evaluation of the discontinuity at the segment boundaries
Because image registrations for different CT segments are performed separately, the computed motion fields of CT segments with adjacent couch positions are typically discontinuous at superior and inferior boundaries of these segments. On the other hand, adjacent CT segments are acquired at different time, at different tidal volume, and air flow rate, therefore, the content of the CT segments may partially overlap with each other and motion fields computed of adjacent CT segments need not be continuous. Furthermore, the motion model parameters, which are computed voxel by voxel, are usually continuous within the 16-slice segments because the motion fields are continuous within the 16 slices; however, the model parameters could have discontinuities at the segment boundaries. Please be advised that discontinuities in such cases are not equivalent to having errors because continuous motion fields on discrete voxels also appear discontinuous.
To analyze the boundary discontinuity of the motion field predicted by the model, we computed MR for inhaled lung using Eq. 5, with T=90th percentile of the measured tidal volume and F=0. The inhaled phase is used because the segments will have the largest motion with respect to the reference volume, and subsequently will have the largest boundary discontinuities. MR was computed for every pair of adjacent slices (the last slice of a segment, and the first slice of the segment at the next couch position). The absolute magnitude difference of MR was then computed for each slice pair and its descriptive statistics (mean, standard deviation, and the maximum values) were recorded.
Forward motion modeling
The 5D motion model proposed by Low et al.15 is
(6) |
where Xr is the image position vector in the reference volume, MF is the forward motion vector, α, β, and γ are the per voxel fitting constant vectors.
The forward motion model is similar to the inverse motion model in Eq. 5. The difference is that Xr is defined on the patient position (the reference volume), and MF indicates where voxels in the reference volume are moving to.
Because the motion fields computed by the optical flow registration algorithm27 are the inverse motion fields, these motion fields have to be inverted to obtain the forward motion fields. A motion field is invertible only if the motion field is well behaved, i.e., smooth with no folding.
The spm_invdef code from Statistical Parametric Mapping28 version 5 package was used for motion field inversion. The method is based on the algorithm by Ashburner et al.29 In this study, the voxels in a 16-slice CT segment were from the voxels in the reference volume outside the 16 slice, especially in the SI direction. In order to accurately invert such a 16-slice motion field, it was extended in both the positive and negative SI directions for the number of slices it reaches. The motion fields of the most superior and inferior boundary slices were used to fill the values for the extended slices. The inversion was then performed on the slice-extended motion field. The resulting forward motion field was considered valid at any voxel if its field vector was pointing into the 16-slice CT segment, otherwise it is invalid. The accuracy of the motion field inversion computation was good. This was verified by computing the inverted motion field for the 16-slice motion fields, then inverting the slice-extended inverted motion fields back, and comparing with the 16-slice motion fields. The estimated error was 0.03±0.05 pixel.
In a similar fashion to the inverse 5D motion model, the computed forward motion fields were used to estimate the modeling parameters α, β, and γ and their accuracy was evaluated by computing ME, TME, MPE and the boundary discontinuities metrics.
Implementation
All the algorithms were implemented in MATLAB (version 7, Mathworks) on a multiprocessor workstation, running a RedHat 4.0 Linux operating system with 8 AMD 1.8 GHz dual core Opteron 865 processors and 16 GB RAM. However, only one thread∕core was used at a time.
To register a CT segment to a reference volume, three multigrid stages were used in the computation. Four, three, and two passes were used for multigrid stage 1, 2, and 3 (stage 3 is the final high resolution stage) with 30, 20, and 10 iterations per pass, respectively. The regularization term in the Horn and Schunck’s algorithm was set to 0.2. After every pass in a stage, and after every stage, a Gaussian low pass filter with sigma of 0.5 pixels was applied to the resulting motion field. A trilinear filter was used for all volumetric interpolations.
RESULTS
The results of the preliminary comparison among four deformable image registration algorithms are listed in Table 1. All registrations were done on the same image data set, with the same or similar parameters and settings. 4D-CT lung images with manually selected landmarks were used for comparison. These preliminary results show that the Horn–Schunck optical flow algorithm is 60 to 90% faster than the other three algorithms and its accuracy is better than or at least similar to the other tested algorithms. Table 1 also shows the comparison of the results with and without using the CT number truncation procedure. One can see that using CT number truncation as a preprocessing step would reduce the overall registration error from 1.88 to 1.57 mm.
Table 2 shows a comparison of the mean and the standard deviation of registration errors between using the maxfilter and the Laplacian pyramid filter. Two CT segments from the artificially deformed CT (Sect. 2D1) were selected for this comparison representing the middle and the inferior lung regions. These results indicate that using the maxfilter for image downsampling in the multigrid approach significantly improves the overall registration accuracy.
Examples of registration results achieved using the Laplacian pyramid filter and using the maxfilter are shown in Fig. 4. These results suggest that using the maxfilter would lead to better accuracy for lower contrast image regions.
Registration accuracy test results using the artificially deformed deformed CT images are summarized in Table 3. The registration errors were 1.55 to 3.23 mm without the APA procedure and were reduced to 0.92 to 1.03 mm using the APA procedure, reflecting significant improvement in registration accuracy. Note that the registration errors in this case were calculated for lung tissues only. Body and chest wall were excluded.
Table 3.
Lung position | Maximum axial motion (mm) | Without APA | With APA | ||
---|---|---|---|---|---|
SI motion (mm) | Registration errors (mm) | Maximum SI motion after APA (mm) | Registration errors (mm) | ||
Superior | 4.68 | −9.2 to 0.4 | 1.55±1.11 | 3.14 | 0.92±0.54 |
Middle | 4.68 | 0.6 to 13.6 | 2.48±2.32 | 2.01 | 0.77±0.42 |
Inferior | 4.68 | 13.8 to 30 | 3.23±3.78 | 4.97 | 1.03±1.08 |
Entire lung | 4.68 | −9.2 to 30 | 2.55±2.63 | 4.97 | 0.92±0.73 |
Examples of the APA procedure application are shown in Fig. 5. Significant improvement in registration accuracy is noted, especially in regions bounded by the dashed lines.
The registration accuracy test results on the manually selected landmarks are summarized in Table 4. The test was carried out according to the procedure outlined in Sect. 2D3 for images of three patients. The mean registration error was about 1.2 mm, which is smaller than the 1.5 mm voxel size in the SI direction. We further applied the test to the full 4D-CT images according to the description in Sect. 2D4. The mean registration error in this case was about 1.5 mm. This is again below the 2.5 mm mark corresponding to the voxel size in the SI direction of this dataset.
Table 4.
Landmark pairs | Mean (mm) | Standard deviation (mm) | Maximum (mm) | |
---|---|---|---|---|
Patient 1 | 137 | 0.98 | 0.47 | 2.78 |
Patient 2 | 43 | 1.47 | 0.95 | 3.72 |
Patient 3 | 43 | 0.96 | 0.62 | 3.08 |
Full volume 4D-CT | 34 | 1.48 | 1.34 | 5.87 |
Figure 6 shows LMS parameter fitting examples of the inverse 5D motion model. The fitting was performed for one voxel in the left lung of a selected patient. Figure 7 shows parameter fitting examples of the forward 5D motion model at the same voxel. It is noticed that the parameter fitting process is working well for motions in all directions for both the inverse model and the forward model.
The 5D motion model validation results are listed in Table 5. For the inverse 5D model, the averaged values for TME, ME, and MPE were about 1.2, 0.5, and 0.5 mm. For the forward 5D model, the average values for TME, ME, and MPE were about 0.4, 0.3, and 0.2 mm. It is noticed that the TME values were typically larger than ME values because: (1) there might have been some inconsistencies during the landmark manual selecting procedure, (2) the ME values were averaged for all the CT segments of the patients, including the CT segments without significant motion, whereas the landmarks were always selected at the points that have large motions. The MPE values demonstrate that both 5D models are able to predict motion with averaged errors of about 0.5 and 0.2 mm (magnitude of error in 3D), respectively. For both 5D models, the mean value of boundary discontinuity was in the range of 0.5 to 1.5 mm. This indicates that the predicted motion field was generally continuous at the segment boundaries. By contrasting the validation results of both 5D models, one can notice that the forward 5D model worked generally better than the reverse 5D model. This issue is further discussed in Sect. 4F.
Table 5.
Patient | Inverse 5D model | Forward 5D model | ||||||||
---|---|---|---|---|---|---|---|---|---|---|
LP | TME (mm) | ME (mm) | MPE (mm) | BD (mm) | LP | TME (mm) | ME (mm) | MPE (mm) | BD (mm) | |
1 | 131 | 1.02±0.52 (2.6) | 0.40±0.31 | 0.40±0.37 | 1.0±0.8 (6.1) | 2025 | 0.52±0.32 (2.8) | 0.19±0.36 | 0.24±0.39 | 0.9±0.8 (6.7) |
2 | 37 | 1.71±1.04 (4.4) | 0.38±0.29 | 0.27±0.35 | 0.7±0.6 (4.5) | 929 | 0.59±0.36 (3.0) | 0.25±0.42 | 0.27±0.48 | 0.7±0.5 (5.1) |
3 | 42 | 1.04±0.49 (2.3) | 0.77±0.63 | 0.55±0.82 | 1.5±0.9 (6.1) | 1198 | 0.67±0.61 (4.9) | 0.37±0.62 | 0.47±0.77 | 1.4±1.1 (5.2) |
In Fig. 8, we show representative examples of the deformed reference volume using the predicted inverse motion fields to generate 3D-CT volumes at selected breathing phases. The generated 3D-CT volumes in row (b) are compared to the reference volume and the differences are shown in row (c). The generated 3D-CT volumes are also compared to the 3D-CT volume reconstructed by the amplitude sorting method as shown in row (a). The differences are presented in row (d). It can be seen that the differences shown in row (d) are small, especially for the lung tissues. This is a qualitative validation of the deformable registration accuracy and the modeling accuracy of the inverse 5D motion model.
Examples of the estimated fitting parameters of the 5D forward motion model for a selected patient are shown in Fig. 9. Further information can be quantitatively derived from these parameter maps as follows. Given the tidal volume difference between the exhaled phase to the inhaled phase, the respiratory motion between the two phases can be estimated from the α parameter maps (Fig. 9, column 1). For instance, if the tidal volume difference was 0.5 L, and α in SI direction for inferior lung was about 40 mm∕L (Fig. 9, column 1), then the SI motion of an inferior lung can be estimated to be about 20 mm. Similarly, if the air flow rates were known, the β parameter maps could be used to estimate the motion hysteresis. Please be advised that the γ parameter maps (Fig. 9, column 3) are expected to be 0 only if the reference volume is reconstructed from CT segments with tidal volume=0 and air flow rate=0 for all couch table positions. The γ parameter maps are not 0 in this example because such a condition was not met under the current 4D acquisition protocol.
The average computation time for the most critical procedures is summarized in Table 6. Computation of the fitting parameters for the forward 5D model takes a longer time than the parameter fitting computation for the inverse 5D model. This is because the forward motion fields from the segments of the adjacent couch positions are required in the forward 5D model parameter fitting procedure.
Table 6.
Task | Average computation time (s) |
---|---|
Deformable registration of a scan to the reference volume | 170 |
Motion estimation for 25 scans | 4250=170×25 |
Invert a 304×448×16-slice motion field | 8.2 |
Compute the fitting parameters for the inverse 5D model for one couch position | 140 |
Compute the fitting parameters for the forward 5D model for one couch position | 230 |
DISCUSSION
Deformable registration accuracy
The deformable image registration accuracy in our implementation is generally good after applying the a priori alignment (APA) procedure for slice alignment and the maxfilter for image downsampling. The absolute registration error is about 1 to 2 mm, less than the voxel vertical size. However, sometimes relatively larger registration errors have been noticed for certain inferior lung CT segments. This is due to the fact that the lung motion in the SI direction is not uniform transversely. Even though the APA procedure can successfully align most parts of the lung vertically for a given CT segment, it may sometimes fail to align other parts. According to our observation, the multigrid approach by itself is not sufficient to capture large motions for unaligned parts of the lung. For these reasons, a better alignment procedure (or rigid registration procedure) should probably be developed in order to achieve better alignment from the CT segments to the reference volume.
Another problem associated with APA is that it might decrease the registration accuracy for the chest wall. Practically, this error could be tolerated if respiratory motion of lung is the purpose of the study.
The problem of lung density changes due to respiration is generally overcome by applying the APA procedure and the maxfilter for image downsampling. There are other alternative solutions that could be incorporated into our framework, such as the a priori lung density modification (APLDM) method reported by Sarrut et al.,13 and mass conservation approaches for optical flow algorithms, reported by Wildes et al.,30 and by Bereziat et al..31
Slice misalignments in the reference volumes
The segments in the reference volumes are usually aligned fairly well because the reference volume was reconstructed for the end-of-exhalation phase, even though minor slice misalignments may still be present at segment boundaries because of the temporal discontinuities in the 4D acquisition protocol. The minor slice misalignments have not caused major registration errors in this study. McClelland et al. avoided the possible slice misalignment problem by using a separately acquired breath-holding helical CT scan as the reference volume.12 In our future work, we plan to acquire breath-holding CT and compare against this approach.
Quality of the respiration data
The measured bellows data was used in this study as the respiration surrogate. It was assumed in our method that the respiratory motion could be fully reflected by these measured respiration data. However, the quality of the measured bellows data could be affected by many artifacts, including nonrespiratory patient motion and highly irregular breathing patterns (e.g., coughing). Our method can tolerate minor artifacts in the measured respiration surrogate data. In fact, the deformable image registration step is less affected by such artifacts and the model parameter fitting step can smooth out these minor artifacts through the LMS fitting procedure. The problems with the measured bellows data may be detected by checking the curve shapes and the histories of the bellow values. Moreover, problematic CT segments can be excluded from motion estimation and modeling parameters computation.
Motion computation efficiency
Currently, it takes about 3 min to register one CT segment to the reference volume. The total computation time is still relatively long. There are several possible ways to improve the computation speed: (1) the number of slices for the cropped reference volume could be optimally reduced according to the breathing phase (inhalation versus exhalation) of the segment. Such a step can improve the speed by a factor of 2; (2) the image motion could be computed on the 1∕2 X-Y resolution and full Z resolution, and this step can improve the speed by a factor of 4. We are planning to validate the accuracy of such registration schemes versus the full resolution registration results in our future work, (3) the software can be programmed in C∕C++ instead of MTLAB. Graphic processing unit (GPU) acceleration technology and multiprocessors parallel computation technologies can also be utilized.
Applications of the inverse motion model
At a given tidal volume T and a flow rate F, the model in Eq. 5 could be applied to predict the inverse motion fields. These inverse motion fields can then be used to deform the information defined on the reference volume onto the target CT volume such as image intensity, dose, contour information, etc. Moreover, this model could be applied to generate 4D-CT images for any target breathing phase. To do so, the inverse motion fields for the target breathing phase are computed for every cough position to form a full volume motion field. This full volume motion field could be smoothed by a Gaussian low pass filter in order to reduce discontinuities at the 16-slice segment boundaries. The reference volume is then deformed by the smoothed full volume motion field to generate the target 4D-CT image. Compared to the 4D-CT image reconstructed by the amplitude or phase angle sorting algorithms, the 4D-CT images generated in this way are usually more continuous, with less slice misalignment problems27 as shown in Fig. 8 [(a1) to (a3)].
Another potential application is to deform the structures defined on the treatment planning CT to any breathing phase. If the structures were defined at the end-of-exhalation reference volume, then for every structure, the corresponding binary mask volume could be generated. This binary mask volume is of the same dimensions as the reference volume, and has a value of 1 inside the structure and a value of 0 outside the structure. The binary mask volume is deformed using the predicted motion fields for a selected target breathing phase. The deformed binary mask volume can then be converted into slice by slice contours corresponding to the moved structure of the target breathing phase. If these structures were defined on another treatment planning 3D-CT volume instead of the reference volume, then structures need to be deformed into the reference volume before they can be used.
The model predicts the 4D motion using three parameter vector maps a, b, and c. The parameter maps could be further parametrized. For instance, the parameters can be approximated by certain polynomial equations of X. Thus, the model will be more compact and efficient to use. This will be the subject of future research.
Relation and differences between the two 5D motion models
The forward motion model predicts the tissue and tumor motion. However, it is not suitable to deform the entire reference volume because such deformation procedure requires that the inverse motion fields be defined on the target coordinates. The forward motion fields could be inverted again to get such inverse motion fields, but this requires more computation and may introduce more errors. The inverse 5D model is recommended for such applications that require inverse motion fields.
Our model evaluation results show that the forward 5D model is more accurate than the reverse 5D model in almost every measurement. This is not intuitive because the forward 5D model needs an extra motion field inversion step. An extra step would usually result in more uncertainties rather than less uncertainties. However, we think that this difference of the modeling accuracy could be attributed to the difference in these motion field definitions. In both 5D models, the model parameters are computed voxel-by-voxel by fitting the 25 motion vectors. In the forward 5D model, the 25 motion vectors at any voxel position define how the tissue piece in the reference volume moves. In contrast, in the reverse 5D model, the 25 motion vectors define how the voxels at the position of the 25 CT segments move; however, the 25 voxels at the same spatial position represent different tissue pieces for the 25 CT segments. For example, a piece of tissue in the exhaled inferior lung could move over 30 mm further in the inferior direction, while at the same voxel position a tissue piece of inhaled lung could be 20 to 30 mm away in the superior direction. Because the forward motion vectors at any voxel position describe the motion of the same piece of tissue with respect to different tidal volume T and flow rate F, the motion vectors tend to be more correlated. The 25 backward motion vectors at any voxel position describe the motion of the 25 different pieces of tissue, therefore, the motion vectors are less correlated. Therefore, the forward motion vectors could provide better fit to T and F compared to the backward motion vectors and would result in less ME, TME, and MPE error values.
Different 4D-CT acquisition protocols
The 4D-CT data used in this paper were acquired as multislice volume segments according to the protocol developed by Low et al.4 The acquisition of the CT volume segments was discontinuous in time and consequently the computed motion fields for every couch table position were also discontinuous in time. The modeling parameters were computed by LMS fitting over 25 motion fields and temporal discontinuity was smoothed by the LMS fitting process. In general, our motion computation and modeling method could be applied to other 4D-CT acquisition protocols. If the acquired 4D-CT data are in sinogram domain, then they need to be reconstructed into 3D-CT images before deformable image registration is applied.
CONCLUSION
In this paper, a new method based on the optical flow deformable image registration algorithm has been presented to compute image motion from acquired 4D-CT data (ciné mode multislice volume segments). The motion fields were computed for each acquired CT segment with respect to a reference volume, which was reconstructed by an amplitude sorting method at the end-of-exhalation phase. To achieve higher image registration accuracy, a priori alignment (APA) procedure was applied to optimally align the CT segments to the reference volume and a feature preserving maxfilter was used instead of the commonly used Laplacian pyramid filter for image downsampling. Two 5D motion models were proposed to approximate the computed image motion in order to predict the motion at arbitrary breathing phases. The inverse motion prediction model was able to directly predict the inverse motion fields. The forward 5D motion model was able to predict tissue movements of the reference volume. Both motion models are potentially useful in image-guided radiotherapy applications.
ACKNOWLEDGMENTS
This research is partially supported by a grant from the American Cancer Society No. IRG-58-010-50, by NIH No. R01 CA 96679, and NIH No. R01 CA116712. Part of this work was presented at the Laughlin Science Council Research Symposium, AAPM, 2008.
References
- Webb S., Intensity-Modulated Radiation Therapy, 1st ed. (Taylor & Francis, London, 2001). [Google Scholar]
- Rietzel E., Pan T., and Chen G. T. Y., “Four-dimensional computed tomography: Image formation and clinical protocol,” Med. Phys. 10.1118/1.1869852 32, 874–889 (2005). [DOI] [PubMed] [Google Scholar]
- Keall P., “4-dimensional computed tomography imaging and treatment planning,” Semin. Radiat. Oncol. 14, 81–90 (2004). [DOI] [PubMed] [Google Scholar]
- Low D. A., Nystrom M., Kalinin E., Parikh P., Dempsey J. F., Bradley J. D., Mutic S., Wahab S. H., Islam T., Christensen G., Politte D. G., and Whiting B. R., “A method for the reconstruction of four-dimensional synchronized CT scans acquired during free breathing,” Med. Phys. 10.1118/1.1576230 30, 1254–1263 (2003). [DOI] [PubMed] [Google Scholar]
- Ford E. C., Mageras G. S., Yorke E., and Ling C. C., “Respiration-correlated spiral CT: A method of measuring respiratory-induced anatomic motion for radiation treatment planning,” Med. Phys. 10.1118/1.1531177 30, 88–97 (2003). [DOI] [PubMed] [Google Scholar]
- Keall P. J., Starkschall G., Shukla H., Forster K. M., Ortiz V., Stevens C. W., Vedam S. S., George R., Guerrero T., and Mohan R., “Acquiring 4D thoracic CT scans using a multislice helical method,” Phys. Med. Biol. 10.1088/0031-9155/49/10/015 49, 2053–2067 (2004). [DOI] [PubMed] [Google Scholar]
- Vedam S. S., Keall P. J., Kini V. R., Mostafavi H., Shukla H. P., and Mohan R., “Acquiring a four-dimensional computed tomography dataset using an external respiratory signal,” Phys. Med. Biol. 10.1088/0031-9155/48/1/304 48, 45–62 (2003). [DOI] [PubMed] [Google Scholar]
- Lu W., Parikh P. J., Hubenschmidt J. P., Bradley J. D., and Low D. A., “A comparison between amplitude sorting and phase-angle sorting using external respiratory measurement for 4D CT,” Med. Phys. 10.1118/1.2219772 33, 2964–2974 (2006). [DOI] [PubMed] [Google Scholar]
- Wink N., Panknin C., and Solberg T. D., “Phase versus amplitude sorting of 4D-CT data,” J. Appl. Clin. Med. Phys. 10.1120/jacmp.2027.25373 7, 77–85 (2006). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pan T., “Comparison of helical and cine acquisitions for 4D-CT imaging with multislice CT,” Med. Phys. 10.1118/1.1855013 32, 627–634 (2005). [DOI] [PubMed] [Google Scholar]
- Pan T., Lee T.-Y., Rietzel E., and Chen G. T. Y., “4D-CT imaging of a volume influenced by respiratory motion on multi-slice CT,” Med. Phys. 10.1118/1.1639993 31, 333–340 (2004). [DOI] [PubMed] [Google Scholar]
- McClelland J. R., Blackall J. M., Tarte S., Chandler A. C., Hughes S., Ahmad S., Landau D. B., and Hawkes D. J., “A continuous 4D motion model from multiple respiratory cycles for use in lung radiotherapy,” Med. Phys. 10.1118/1.2222079 33, 3348–3358 (2006). [DOI] [PubMed] [Google Scholar]
- Sarrut D., Boldea V., Miguet S., and Ginestet C., “Simulation of four-dimensional CT images from deformable registration between inhale and exhale breath-hold CT scans,” Med. Phys. 10.1118/1.2161409 33, 605–617 (2006). [DOI] [PubMed] [Google Scholar]
- Boldea V., Sharp G. C., Jiang S. B., and Sarrut D., “4D-CT lung motion estimation with deformable registration: Quantification of motion nonlinearity and hysteresis,” Med. Phys. 10.1118/1.2839103 35, 1008–1018 (2008). [DOI] [PubMed] [Google Scholar]
- Low D. A., Parikh P. J., Lu W., Dempsey J. F., Wahab S. H., Hubenschmidt J. P., Nystrom M. M., Handoko M., and Bradley J. D., “Novel breathing motion model for radiotherapy,” Int. J. Radiat. Oncol., Biol., Phys. 10.1016/j.ijrobp.2005.03.070 63, 921–929 (2005). [DOI] [PubMed] [Google Scholar]
- Seppenwoolde Y., Shirato H., Kitamura K., Shimizu S., van Herk M., Lebesque J. V., and Miyasaka K., “Precise and real-time measurement of 3D tumor motion in lung due to breathing and heartbeat, measured during radiotherapy,” Int. J. Radiat. Oncol., Biol., Phys. 10.1016/S0360-3016(02)02803-1 53, 822–834 (2002). [DOI] [PubMed] [Google Scholar]
- Horn B. K. P. and Schunck B. G., “Determining optical flow,” Artif. Intell. 10.1016/0004-3702(81)90024-2 17, 185–203 (1981). [DOI] [Google Scholar]
- Lu W., Low D. A., Parikh P. J., Nystrom M. M., Naqa I. M. E., Wahab S. H., Handoko M., Fooshee D., and Bradley J. D., “Comparison of spirometry and abdominal height as four-dimensional computed tomography metrics in lung,” Med. Phys. 10.1118/1.1935776 32, 2351–2357 (2005). [DOI] [PubMed] [Google Scholar]
- Wolthaus J. W. H., Schneider C., Sonke J.-J., van Herk M., Belderbos J. S. A., Rossi M. M. G., Lebesque J. V., and Damen E. M. F., “Mid-ventilation CT scan construction from four-dimensional respiration-correlated CT scans for radiotherapy planning of lung cancer patients,” Int. J. Radiat. Oncol., Biol., Phys. 10.1016/j.ijrobp.2006.04.031 65, 1560–1571 (2006). [DOI] [PubMed] [Google Scholar]
- Burt P. J. and Adelson E. H., “The Laplacian pyramid as a compact image code,” IEEE Trans. Commun. 10.1109/TCOM.1983.1095851 31, 532–540 (1983). [DOI] [Google Scholar]
- Lucas B. D. and Kanade T., “An iterative image registration technique with an application to stereo vision,” Proceedings of the 7th International Joint Conference on Artificial Intelligence, pp. 674–679 (1981).
- Vemuri B. C., Ye J., Chen Y., and Leonard C. M., “Image registration via level-set motion: Applications to atlas-based segmentation,” Med. Image Anal. 10.1016/S1361-8415(02)00063-4 7, 1–20 (2003). [DOI] [PubMed] [Google Scholar]
- Thirion J. P., “Image matching as a diffusion process: An analogy with Maxwell’s demons,” Med. Image Anal. 10.1016/S1361-8415(98)80022-4 2, 243–260 (1998). [DOI] [PubMed] [Google Scholar]
- Brock K. K., “A multi-institution deformable registration accuracy study,” Int. J. Radiat. Oncol., Biol., Phys. 69, S44–S44 (2007). [DOI] [PubMed] [Google Scholar]
- Iu S.-L. and Lin Y.-T., “Re-examining the optical flow constraint. A new optical flow algorithm with outlier rejection,” International Conference on Image Processing, Vol. 3, pp. 727–731 (1999).
- McClelland J., Blackall J., Tarte S., Hughes S., and Hawkes D., “MO-D-L100J-04: Non-rigid registration based respiratory motion models of the lung using two parameters,” Med. Phys. 34, 2516 (2007). [Google Scholar]
- Ehrhardt J., Werner R., Saring D., Frenzel T., Lu W., Low D., and Handels H., “An optical flow based method for improved reconstruction of 4D CT data sets acquired during free breathing,” Med. Phys. 10.1118/1.2431245 34, 711–721 (2007). [DOI] [PubMed] [Google Scholar]
- Friston K. J., Statistical Parametric Mapping: The Analysis of Functional Brain Images, 1st ed. (Academic Press, New York, 2006). [Google Scholar]
- Ashburner J., Andersson J. L. R., and Friston K. J., “Image registration using a symmetric prior—In three dimensions,” Hum. Brain Mapp 9, 212–225 (2000). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wildes R. P., Amabile M. J., Lanzillotto A. M., and Leu T. S., “Recovering estimates of fluid flow from image sequence data,” Comput. Vis. Image Underst. 10.1006/cviu.2000.0874 80, 246–266 (2000). [DOI] [Google Scholar]
- Bereziat D., Herlin I., and Younes L., “A generalized optical flow constraint and its physical interpretation,” IEEE 2000 Conference on Computer Vision and Pattern Recognition (2000).