Skip to main content
Medical Physics logoLink to Medical Physics
. 2012 Oct 25;39(11):6943–6946. doi: 10.1118/1.4762288

4D cone beam CT via spatiotemporal tensor framelet

Hao Gao 1,a), Ruijiang Li 2, Yuting Lin 3, Lei Xing 4
PMCID: PMC3494730  PMID: 23127087

Abstract

Purpose: On-board 4D cone beam CT (4DCBCT) offers respiratory phase-resolved volumetric imaging, and improves the accuracy of target localization in image guided radiation therapy. However, the clinical utility of this technique has been greatly impeded by its degraded image quality, prolonged imaging time, and increased imaging dose. The purpose of this letter is to develop a novel iterative 4DCBCT reconstruction method for improved image quality, increased imaging speed, and reduced imaging dose.

Methods: The essence of this work is to introduce the spatiotemporal tensor framelet (STF), a high-dimensional tensor generalization of the 1D framelet for 4DCBCT, to effectively take into account of highly correlated and redundant features of the patient anatomy during respiration, in a multilevel fashion with multibasis sparsifying transform. The STF-based algorithm is implemented on a GPU platform for improved computational efficiency. To evaluate the method, 4DCBCT full-fan scans were acquired within 30 s, with a gantry rotation of 200°; STF is also compared with a state-of-art reconstruction method via spatiotemporal total variation regularization.

Results: Both the simulation and experimental results demonstrate that STF-based reconstruction achieved superior image quality. The reconstruction of 20 respiratory phases took less than 10 min on an NVIDIA Tesla C2070 GPU card. The STF codes are available at https://sites.google.com/site/spatiotemporaltensorframelet.

Conclusions: By effectively utilizing the spatiotemporal coherence of the patient anatomy among different respiratory phases in a multilevel fashion with multibasis sparsifying transform, the proposed STF method potentially enables fast and low-dose 4DCBCT with improved image quality.

Keywords: low-dose, 4D cone beam CT, tensor framelet, GPU, reconstruction

INTRODUCTION

On-board 4D cone beam CT (4DCBCT) offers respiratory phase-resolved volumetric imaging,1, 2, 3, 4 and improves the accuracy of target localization in image guided radiation therapy.5 However, the clinical utility of 4DCBCT has been greatly impeded by its degraded image quality, prolonged imaging time, and increased imaging dose. Conventional approaches for respiration-correlated or 4DCBCT imaging reconstruct each respiratory phase independently, and thus often have a very limited amount of projection data available for image reconstruction. Although some progress has been made to improve the 4DCBCT image quality,6, 7, 8 there is clearly an unmet need for improving the current 4DCBCT imaging technique.

Considering the fact that the patient anatomy during respiration is highly correlated and redundant, the efficient 4DCBCT reconstruction should utilize the spatiotemporal coherence of the patient anatomy among different respiratory phases,9, 10, 11, 12, 13 which is inherently superior to the conventional phase-independent reconstructions. In this letter, we present a novel method that effectively utilizes such spatiotemporal coherence in a multilevel fashion with multibasis sparsifying transform, with the attempt for fast and low-dose 4DCBCT with improved image quality.

METHODS

Specifically, a new sparse representation method–the spatiotemporal tensor framelet (STF)—is proposed to characterize the spatiotemporal coherence of the 4D images. This new method goes as follows. Let us consider a 4D image as a tensor X = {xijkm, i ≤ Nx, j ≤ Ny, k ≤ Nz, m ≤ Nt} and define Xx, Xy, Xz, Xt as the unfolded matrices of X along x, y, z, t dimension, respectively. For example, Xy is a Ny by Nx×Nz×Nt matrix with each column Xyikm = {xikm, j ≤ Ny}. Let W1 be the standard 1D framelet transform (see the Appendix). Then the STF transform W and its adjoint WT are defined by

Y=WX={W1Xjkmx,W1Xikmy,W1Xijmz,W1Xijkt,iNx,jNy,kNz,mNt}, (1)
WTY=(jkmW1TYjkmx+ikmW1TYikmy+ijmW1TYijmz+ijkW1TYijkt)/4. (2)

The STF via Eqs. 1, 2 is a natural generalization of 1D framelet to the high-dimensional space through the tensor approach and it is easy to verify that many nice features of the framelet are kept, such as WT(WX) = X. STF has mainly two advantages: STF takes into account the temporal similarity of the 4D images by the 1D framelet operator along the temporal dimension (e.g., W1Xt) in addition to the spatial sparsity (e.g., W1Xx); in terms of the required memory, STF (e.g., ∼4 nN) is more feasible for high-dimensional problems (e.g., 4DCBCT) than the framelet (e.g., ∼n4N). Here n is the number of framelet bases (refinement masks) and N = Nx × Ny × Nz × Nt.

With STF, 4DCBCT is formulated as a L1 minimization problem

X=argminXAXY22+λWX1. (3)

In Eq. 3, AX is the x-ray transform of X with the scan geometry specified later, Y represents the 4DCBCT data, and ∥WX1 is the L1 norm of the STF of X, i.e.,

WX1=j,k,mW1Xjkmx1+i,k,mW1Xikmy1+i,j,mW1Xijmz1+i,j,kW1Xijkt1. (4)

The detail of ∥W1·∥1 is in the Appendix, which clearly indicates that STF is a generalization of the first-order differencing operator, i.e., spatiotemporal total variation (STV),12, 13 which will be compared with STF next as a state-of-art reconstruction method.

This typical nondifferentiable L1 problem 3 can be solved through the split Bregman method.14, 15 Here, we implement a GPU-based parallel solver based on our prior algorithms10, 11 for both STF and STV, in which the recently proposed fast and highly parallelizable algorithms16 are used to accelerate the x-ray transform AX and its adjoint ATX that are two most time-consuming components. Here l is scaled with the L1 norm of A and the data are scaled so that the image intensity is roughly between 0 and 1, for which l = 0.1 serves as a good choice for image reconstruction. Please note that STF is slightly faster than STV due to the left-inverse property WTW = 1, since otherwise WTW needs to be computed repeatedly for STV.

MATERIALS

The 4DCBCT reconstruction was evaluated first using a digital NCAT phantom,17 and then with experimental data. To generate the 30-s simulation data for the NCAT phantom, 40 respiratory phases were utilized, which corresponded to the maximal temporal resolution (100 ms) that was allowed by the simulation setting. For physical experiments, the CIRS dynamic thorax phantom was used, which provides known, accurate (<0.1 mm), and reproducible target motion inside a tissue equivalent phantom.18 The phantom body represents an average human thorax in shape, proportion, and composition. A lung equivalent rod containing a spherical target was inserted into the phantom. The TrueBeamTM LINAC (Varian Medical Systems, Palo Alto, CA) equipped with an on-board kV imaging system was used to scan the phantom. The kV detector has a physical size of 39.7 × 29.8 cm2. The kV imaging parameters were set to be 100 kVp, 80 mA, and 10 ms pulse width.

In both cases, the distance from source to the isocenter was 100 cm, the distance from the isocenter to the detector plane was 50 cm. The projection dataset was a 256 × 64 array (central slices of the measurement data) with the pixel size 0.16 × 0.16 cm2. The respiratory signal during scanning was a sinusoidal curve with a period of 4 s. The full-fan scan was performed with a single 200° gantry rotation that takes ∼30 s. The total number of projections was 400 and 372 for the digital and physical phantom, respectively.

For reconstruction, the field of view was 33 × 33 × 8 cm3. The reconstructed image had the spatial dimension 256 × 256 × 64 with the spatial resolution 0.13 × 0.13 × 0.13 cm3. A total of 20 respiratory phases were reconstructed to provide a temporal resolution of ∼200 ms. The stopping criterions for both STF and STV were based on the minimal data fidelity discrepancy. There were 18–20 projections of data corresponding to each reconstructed phase.

RESULTS

With the simulation data, the phantom images, the STV results, and the STF results are displayed in Fig. 1 (the exhale phase) and Fig. 2 (the inhale phase). The relative differences between the reconstructed image X and the phantom image X0 through ijkm|X-X0|/∑ijkm|X0|, were 8.0% for STV and 3.5% for STF. With the experimental data, the STV results and the STF results are displayed in Fig. 3 (Phase 8) and Fig. 4 (Phase 16). Both the simulation and experimental results (Fig. 1234) suggest that STF enables the 30-s 200° full-fan 4DCBCT scan, and it offers the superior image quality than STV.

Figure 1.

Figure 1

Simulation results (the exhale phase). Sagittal view: (a) Phantom; (b) STV; (c) STF. Coronal view: (d) Phantom; (e) STV; (f) STF. Transverse view: (g) Phantom; (h) STV; (i) STF.

Figure 2.

Figure 2

Simulation results (the inhale phase). Sagittal view: (a) Phantom; (b) STV; (c) STF. Coronal view: (d) Phantom; (e) STV; (f) STF. Transverse view: (g) Phantom; (h) STV; (i) STF.

Figure 3.

Figure 3

Experimental results (Phase 8). Sagittal view: (a) STV; (b) STF. Coronal view: (c) STV; (d) STF. Transverse view: (e) STV; (f) STF.

Figure 4.

Figure 4

Experimental results (Phase 16). Sagittal view: (a) STV; (b) STF. Coronal view: (c) STV; (d) STF. Transverse view: (e) STV; (f) STF.

In terms of the temporal resolution, the reconstructed 4D images had 20 phases, while the maximally allowed phases for the setting were roughly 40. On the other hand, the conventional 4DCBCT reconstruction often has roughly ten phases. In this sense, STF is able to improve the temporal resolution of the reconstructed 4D image, or equivalently reduce the number of projections for the same temporal resolution. The further improvement of the 4D reconstruction with the maximal phases or the minimal number of projections will be explored.

The GPU-based reconstruction was implemented with a NVIDIA Tesla C2070 GPU card (448 cores and 5.25 GB device memory). The required device memory was less than 1 GB, while the total computer memory was roughly 5 GB (mainly due to STF). Two most time-consuming components are the x-ray transform (∼2 s each) and its adjoint (∼13 s each). Since the image reconstruction takes roughly 30 iterations and each iteration involves one pair of the x-ray transform and its adjoint, the total computational time for a 4D reconstruction via STF is currently less than 10 min. Further acceleration is possible with the GPU card with more cores or the multi-GPU parallelization, since the computational cost of the used x-ray transform and its adjoint per parallel thread is O(1).16

CONCLUSION

We have proposed a novel 4DCBCT image reconstruction technique based on STF, a high-dimensional tensor generalization of the 1D framelet that effectively utilizes the spatiotemporal coherence of 4DCBCT images in a multilevel and fashion with multibasis sparsifying transform. The proposed STF method potentially enables fast- and low-dose 4DCBCT with improved image quality. The STF codes are available online.19

ACKNOWLEDGMENT

This work is partially supported by National Institutes of Health (NIH) Grant No. EB013387 and NIH Grant No. CA166186.

APPENDIX: 1D FRAMELET TRANSFORM AND ITS ADJOINT

In this study, W1 is the 1D piecewise-linear framelet or tight frame transform (TF) constructed by three following refinement masks:

w0=14[121],w1=24[101],w2=14[121]. (A1)

Please note that w0, w1, and w2 correspond to the averaging operator, the first-order differencing operator, and the second-order differencing operator, respectively. Therefore, the TF transform contains the total variation (TV) transform on multilevels.

Let x be a 1D vector. The multilevel 1D TF transform of x up to L levels is

W1x=[W1x]01[W1x]11[W1x]21 Level :1[W1x]0l[W1x]1l[W1x]2l Level :l[W1x]0L[W1x]1L[W1x]2L Level :LxL, (A2)

where the fine-to-coarse convolutions are defined by the following with x0 = x:

xl=w0l*xl1,[W1x]0l=w0l*xl,[W1x]1l=w1l*xl,[W1x]2l=w2l*xl,1lL. (A3)

Here for the efficient multilevel decomposition, instead of downsampling xl, we dilute the masks A1 to wli, e.g.,

w0l=14[1002l12002l11]. (A4)

And its transpose W1T is defined as

W1T(W1x)=xL+l=1L[W1x]0l+[W1x]1l+[W1x]2l. (A5)

Consequently, the 1D TF norm is the sum of the L1 norm of the TF terms at multilevels with multifilters, i.e.,

W1x1=xL1+l=1L{[W1x]0l1+[W1x]1l1+[W1x]2l1}, (A6)

Also, it is worthy to mention that, in terms of the computation speed, the algorithm with TF regularization is in general faster than using the transform without left inverse, such as TV, since it would require additional computation of WTW otherwise.

References

  1. Sonke J.-J., Zijp L., Remeijer P., and van Herk M., “Respiratory correlated cone beam CT,” Med. Phys. 32, 1176–1186 (2005). 10.1118/1.1869074 [DOI] [PubMed] [Google Scholar]
  2. Kriminski S., Mitschke M., Sorensen S., Wink N., Chow P., Tenn S., and Solberg T., “Respiratory correlated cone-beam computed tomography on an isocentric C-arm,” Phys. Med. Biol. 50, 5263–5280 (2005). 10.1088/0031-9155/50/22/004 [DOI] [PubMed] [Google Scholar]
  3. Dietrich L., Jetter S., Tücking T., Nill S., and Oelfke U., “Linac-integrated 4D cone beam CT: First experimental results,” Phys. Med. Biol. 51, 2939–2952 (2006). 10.1088/0031-9155/51/11/017 [DOI] [PubMed] [Google Scholar]
  4. Li T., Xing L., Munro P., McGuinness C., Chao M., Yang Y., Loo B., and Koong A., “Four-dimensional cone-beam computed tomography using an on-board imager,” Med. Phys. 33, 3825–3833 (2006). 10.1118/1.2349692 [DOI] [PubMed] [Google Scholar]
  5. Xing L., Thorndyke B., Schreibmann E., Yang Y., Li T. F., Kim G. Y., Luxton G., and Koong A., “Overview of image-guided radiation therapy,” Med. Dosim. 31, 91–112 (2006). 10.1016/j.meddos.2005.12.004 [DOI] [PubMed] [Google Scholar]
  6. Li T., Koong A., and Xing L., “Enhanced 4D cone-beam CT with interphase motion model,” Med. Phys. 34, 3688–3695 (2007). 10.1118/1.2767144 [DOI] [PubMed] [Google Scholar]
  7. Leng S., Tang J., Zambelli J., Nett B., Tolakanahalli R., and Chen G., “High temporal resolution and streak-free four-dimensional cone-beam computed tomography,” Phys. Med. Biol. 53, 5653–5673 (2008). 10.1088/0031-9155/53/20/006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Zhang Q., Hu Y., Liu F., Goodman K., Rosenzweig K. E., and Mageras G. S., “Correction of motion artifacts in cone-beam CT using a patient-specific respiratory motion model,” Med. Phys. 37, 2901–2909 (2010). 10.1118/1.3397460 [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Jia X., Lou Y., Dong B., Tian Z., and Jiang S., “4D computed tomography reconstruction from few-projection data via temporal non-local regularization,” Lect. Notes Comput. Sci. 6361, 143–150 (2010). 10.1007/978-3-642-15705-9_18 [DOI] [PubMed] [Google Scholar]
  10. Gao H., Cai J. F., Shen Z. and Zhao H., “Robust principal component analysis-based four-dimensional computed tomography,” Phys. Med. Biol. 56, 3181–3198 (2011). 10.1088/0031-9155/56/11/002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Gao H., Yu H., Osher S., and Wang G., “Multi-energy CT based on a prior rank, intensity and sparsity model (PRISM),” Inverse Probl. 27, 115012 (2011). 10.1088/0266-5611/27/11/115012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Ritschl L., Sawall S., Knaup M., Hess A., and Kachelrieß M., “Iterative 4D cardiac micro-CT image reconstruction using an adaptive spatio-temporal sparsity prior,” Phys. Med. Biol. 57, 1517–1525 (2012). 10.1088/0031-9155/57/6/1517 [DOI] [PubMed] [Google Scholar]
  13. Wu H., Maier A., Fahrig R., and Hornegger J., “Spatial-temporal total variation regularization (STTVR) for 4D-CT reconstruction,” Proc. SPIE 8313, (2012). [Google Scholar]
  14. Goldstein T. and Osher S., “The split Bregman algorithm for l1 regularized problems,” SIAM J. Imaging Sci. 2, 323–343 (2009). 10.1137/080725891 [DOI] [Google Scholar]
  15. Cai J. F., Osher S., and Shen Z., “Split Bregman methods and frame based image restoration,” Multiscale Model. Simul. 8,337–369 (2009). 10.1137/090753504 [DOI] [Google Scholar]
  16. Gao H., “Fast parallel algorithms for the x-ray transform and its adjoint,” Med. Phys. (in press). [DOI] [PMC free article] [PubMed]
  17. Segars W. P., Lalush D. S., and Tsui B. M. W., “A realistic spline-based dynamic heart phantom,” IEEE Trans. Nucl. Sci. 46, 503–506 (1999). 10.1109/23.775570 [DOI] [Google Scholar]
  18. http://www.cirsinc.com/products/modality/18/dynamic-thorax-phantom/.
  19. https://sites.google.com/site/spatiotemporaltensorframelet.

Articles from Medical Physics are provided here courtesy of American Association of Physicists in Medicine

RESOURCES