Abstract
Respiration-correlated CBCT, commonly called 4DCBCT, provides respiratory phase-resolved CBCT images. A typical 4DCBCT represents averaged patient images over one breathing cycle and the fourth dimension is actually breathing phase instead of time. In many clinical applications, it is desirable to obtain true 4DCBCT with the fourth dimension being time, i.e., each constituent CBCT image corresponds to an instantaneous projection. Theoretically it is impossible to reconstruct a CBCT image from a single projection. However, if all the constituent CBCT images of a 4DCBCT scan share a lot of redundant information, it might be possible to make a good reconstruction of these images by exploring their sparsity and coherence/redundancy. Though these CBCT images are not completely time resolved, they can exploit both local and global temporal coherence of the patient anatomy automatically and contain much more temporal variation information of the patient geometry than the conventional 4DCBCT. We propose in this work a computational model and algorithms for the reconstruction of this type of semi-time-resolved CBCT, called cine-CBCT, based on low rank approximation that can utilize the underlying temporal coherence both locally and globally, such as slow variation, periodicity or repetition, in those cine-CBCT images.
Index Terms: Cine cone beam computed tomography (CBCT), low-rank matrix, reconstruction
I. Introduction
When cone beam computed tomography (CBCT) is applied to thorax or upper abdomen regions, motion-induced artifacts, such as blurring or distortion, become a serious problem, because different X-ray projections correspond to different volumetric CBCT images due to patient respiratory motion. To overcome this problem, four-dimensional CBCT (4DCBCT) [1]–[3] has been developed. In such a modality, all X-ray projections are first retrospectively grouped into different respiratory phase bins according to a breathing signal tagged on every projection image. A set of CBCT images are then reconstructed, each at a breathing phase, under the assumption that the projections placed into each bin correspond to the same or similar CBCT image. The number of phase bins is usually empirically chosen by a user based on the consideration of balancing temporal resolution and image quality. On one hand, a high temporal resolution requires a large number of phase bins, which leads to insufficient number of projections available to each phase and hence degraded quality of reconstructed CBCT images. On the other, a small number of phase bins results in low temporal resolution, as well as a relatively large bin width and the associated residual motion artifacts in the reconstructed images. To maintain clinically acceptable temporal resolution and image quality, 4DCBCT acquisition protocols such as slow gantry rotations and multiple gantry rotations have been proposed [3]–[5], and novel reconstruction or image processing techniques have been employed [6]–[9].
Conventional 4DCBCT reconstruction approach reconstructs CBCT images at different phases independent of each other [1]. This straightforward method neglects the temporal correlations of CBCT images at different phases. In contrast, a collaborative reconstruction scheme has been recently proposed [10]–[15]. Among them, [11] and [13] proposed a temporal nonlocal means method to constraint that the reconstructed CBCT images at neighboring phases must contain repetitive anatomical features. The locations of these features are allowed to vary among phases. The work of [10] and [14] first reconstructs an average CBCT image using all projections at all phases and impose a similarity constraint between this average image and the CBCT at each phase. These methods explicitly enforces the similarity among reconstructed CBCT images. Gao et al. [12] utilizes robust PCA techniques for the 4DCBCT reconstruction problem. This method restores a matrix whose columns are the 4DCBCT images at all phases under the assumption that, after a proper transformation, this matrix can be decomposed as the sum of a low-rank matrix corresponding to an almost static background and a sparsematrix corresponding to a moving foreground. The low-rank constraint achieved by minimizing the nuclear norm [16] of the associated matrix implicitly imposes the inter-phase similarity of the background. However, in the context of anatomical motion in thorax or upper abdomen, there is no clear distinction between foreground and background. Besides, as pointed out in [17], the nuclear norm minimization always finds a matrix factorization with mutually orthogonal factors. In this sense, it may have some unfavourable bias on the resulting low-rank matrices.
Although commonly called 4DCBCT, respiration-correlated CBCT [18] is a more accurate name for this imaging technique, since its fourth dimension is actually the respiratory phase rather than time. Clinically, it is more preferable to reconstruct true 4DCBCT with the fourth dimension being time, i.e., one CBCT image is reconstructed based on the corresponding instantaneous projection, if possible. This time-resolved CBCT, called cine-CBCT in this work to avoid confusion, encounters an apparent technical barrier of reconstructing a CBCT image based on only one projection where the information needed for a good reconstruction is severely insufficient. One way to overcome this barrier is to borrow some useful information of a particular patient’s anatomy from prior images. Previously we have tried to utilize prior 4DCT images of the same patient [19], [20]. Particularly, we have built a lung motion model for the patient by performing a principle component analysis (PCA) on the motion vector fields obtained from deformable registration on prior 4DCT images. We discovered that only a few principle components are sufficient to represent the lung motion to a satisfactory degree of accuracy. The reconstruction of a volumetric image based on one acquired X-ray projection is then achieved by finding those PCA coefficients, so that the projection of the corresponding CBCT matches the acquired one. However, this method heavily depends on how similar the current CBCT images are to the prior 4DCT images and how accurately the PCA parameters can be determined using one projection.
In this paper, we will propose another way to overcome this technical barrier without using any prior images. We will show the principle of proof of a computational model and numerical algorithms for cine-CBCT reconstruction based on a low rank assumption of the matrix which includes all cine-CBCT images as its columns. The low rank approximation, which uses a form of matrix factorization, maximally and automatically exploits the temporal coherence of the patient anatomy locally and globally without using any prior knowledges. Moreover, sparsity condition is also enforced under appropriate transforms, i.e., wavelet transform in space, to satisfy desired properties for the sequence of CBCT images. Simulation studies have demonstrated promising results for our method for this challenging problem.
II. Methods and Materials
Assumption
Let us denote a patient CBCT image at time t by x(t), which can be viewed as a point embedded in a high dimensional space RN with its pixel values being the coordinate, where N is the total number of pixels in each CBCT image. x(t) travels along a smooth trajectory as patient breathes. A cine-CBCT reconstruction problem tries to recover points on this trajectory, each corresponding to a moment at an X-ray projection measurement. The key assumption in our approach is that the trajectory stays piecewisely close to a low dimensional subspace in the very high dimensional embedding space. Equivalently, when we form a matrix U with each column being an CBCT image, the resulting matrix is approximately of low-rank. We believe this assumption is valid due to the fact that body motion is far from a free form motion, which is verified by the fact that only a few principle components are sufficient to represent the lung motion to a satisfactory degree of accuracy as studied in [19], [20]. Based on this assumption, we use a matrix factorization formulation U ≈ LR. The number of columns for the matrix L constraints the rank of approximation. We further impose desired image properties by enforcing sparsity of the tight wavelet frame transform of L. In this matrix factorization formulation both the low rank approximation and the sparse representation in wavelet transform domain reduces the true degree of freedom of U significantly. In all previous approaches mentioned in the Section of Introduction, reduction of degrees of freedom are exploited explicitly or implicitly from redundancy and coherence in space and time as well as sparse representation. In particular, 4DCBCT based on phase binning explicitly used the low rank assumption, i.e., the rank is equal to the number of phases. However, as mentioned before, the number of phases, i.e., the rank, is usually empirically chosen and more importantly coherence in time or among different phases are not explored at all.
Another important practical issue is how well the small number of image basis for the low dimensional subspace and the factorization U ≈ LR can be recovered from the projection data. Intuitively, this requires each basis image or equivalently all cine-CBCT images collectively have a sufficient number of projections covering a sufficiently large scanning angle, which is also required for the construction of a single CBCT image. As shown in experiments in Section IV-C, reconstruction from data using different projections can produce different results for the same ground truth.
A. Model
In this proof-of-principle study, we consider the reconstruction of a 2-D slice of the CBCT to illustrate the principles of the new algorithm. The basic ideas, however, can be easily generalized into 3-D contexts. Let the unknown 2-D image be a function u defined in ℝ2. When the X-ray source is placed with angle θ, the projection measured by the imager at location z is
(1) |
where xθ ∈ ℝ2 is the coordinate of the X-ray source, and rz ∈ 𝕊2 and ℓz are respectively the direction and the length of the line connecting the X-ray source and the location z on the imager. The operator ℘ in (1) is also known as Radon transform. If fθ(z) is sampled with respect to z, the resulting projection data can be written as a vector fθ ∈ ℝM that obeys
(2) |
Here Pθ ∈ ℝM×N and u ∈ ℝN are the discretization of ℘θ and u in (1) respectively. Suppose that we have T projections where the X-ray source is placed with angels θ1, θ2, …, θT, respectively. Let Pi and fi stand for Pθi and fθi respectively. Then, by putting (2) with different angles together, the CT projection can be written into a system of linear equations
(3) |
where P = [P1; P2; …; PT] ∈ ℝMT×N and f = [f1; f2; …; fT] ∈ ℝMT. In other words, the CT image reconstruction problem is to recover image u from its partial Radon transform.
In the context of cine-CBCT reconstruction, instead of only one unknown image u as in the CBCT problem, there is a set of unknown images, each associated with a projection, denoted by {u1, u2, …, uT} and the projection condition in (3) may be modified to
(4) |
If we write all the unknown images in a matrix form U = [u1, u2, …, uT], (4) can be rewritten into a compact matrix equation
(5) |
where ℘U = [P1u1, …, PTuT] and F = [f1, …, fT].
With low rank approximation assumption for U as discussed in Section II-A, the number of intrinsic unknowns of (5) are possibly less than the number of measurements, and therefore it is possible to reconstruct T CBCT images in U from T projections. To incorporate the underlying low-rank assumption into the cine-CBCT reconstruction process, we would like to explore matrix factorization based low-rank models in this study. More specifically, we enforce a decomposition form of the unknown matrix U of images as U = LR where L ∈ ℝN×K and R ∈ ℝK×T for a small integer K. From basic linear algebra knowledge, any matrix U of rank K can be represented in U = LR and conversely, the rank of U is at most K given the factorization form. In our algorithm, the value of K is specified by the user and our method regarding the selection of K will be presented later. We would like to point out that there is another popular approach for imposing the low rank condition that penalizes the nuclear norm of the matrix U, namely the sum of the singular values. Compare these two methods, the latter always finds orthogonal basis because of the involved singular value decomposition process. In contrast, our factorization method does not require this implicit orthogonality and hence attains the advantage of avoiding unfavourable bias on the resulting low-rank matrices [17]. In some sense, the approach of penalizing the nuclear norm of U is like principle component analysis and our approach is like independent component analysis. As a consequence, each of our basis, i.e., each column of L, represents more meaningful images and hence we can enforce some desired image properties on L, such as sparsity under wavelet transform.
This matrix factorization approach also allows for the additional regularizations on L and R, so that U = LR carries desirable physical properties. First, the columns of the matrix L, corresponding to images, can be interpreted as a basis that efficiently represents all the columns of U. Notice that images usually have sparse coefficients under suitable transforms such as wavelet tight frames [21], [22]. Let 𝒟 be such a transform. Since the columns of L are images, we want 𝒟L to be sparse. It is well-known that ℓ1-norm minimization leads to sparse solutions. Therefore, the ℓ1-norm ||𝒟L||1 is penalized. Second, the rows of R are the coefficients of U under the basis L, which reflect the respiratory motion. As the respiratory motion varies and could be irregular sometimes, there is no universal regularization that is suitable for all kinds of respiratory motion. We simply penalize its Frobenius norm to balance the energy of L and R. By enforcing sparse representation of L in wavelet domain, the intrinsic degrees of freedom are further reduced. Both low rank and sparse conditions can be regarded as some kind of regularization that makes the whole problem less under-determined or ill-posed.
Altogether, we propose to reconstruct the cine-CBCT by solving an optimization problem
(6) |
In practice, (5) is barely satisfied, because there is always unavoidable noise in the measurements F. Moreover, in reality the matrix U is only approximately low-rank and explicitly enforcing a low-rank representation leads to error. For these considerations, we solve
(7) |
where ||·||F is the Frobenius norm, namely for a matrix A. In (7), σ is a parameter to control to what extent the violation of (5) is allowed.
Algorithm
Let us first consider the algorithm for solving (6). We use a split Bregman method [23], [24] (also known as augmented Lagrangian method) to solve this problem. The augmented Lagrangian of (6) is
(8) |
where 〈·, ·〉 is the inner product, and Z is Lagrange multipliers. With appropriate fixed Z, an optimal L, R can be found by simply minimizing E(L, R, Z) with respect to (L, R). Therefore, the trick is to determine Z. In the augmented Lagrangian algorithm, we use
(9) |
Algorithm 1.
1: | Repeat for sout times.
|
2: | If is small enough, then return. |
3: | Update Z ← Z + (℘(LR − F). |
4: | Goto Step 1. |
The two subproblems in (9) are solved by split Bregman algorithm again in a similar fashion to (8) and (9). Let us take the first one as an example. By introducing an auxiliary variable C, the first subproblem in (9) is equivalent to
(10) |
and the associated augmented Lagrangian is
Then, a split Bregman method for solving the first subproblem in (9) is as follows:
(11) |
All steps involved in (11) can be solved easily by either entrywise soft-thresholding (for C) and linear equation solvers (for L), e.g., conjugate gradient method. We use the same method for solving the second subproblem in (9). The full algorithm is summarized in Algorithm 1, where 𝒯 is the soft-thresholding operator defined by [𝒯α(A)]ij = sign([A]ij) · max{|[A]ij| − α, 0}.
For any true or semi time resolved CBCT reconstruction, both storage and computational cost is a challenge. An important feature of our formulation and computational algorithm is to explore the low rank property. In our numerical computation, the huge matrix is stored and manipulated in a low-rank decomposition form, which can significantly reduce the memory size and computational cost. In particular, the computational cost of Algorithm 1 is mainly on solving linear equations in Step 1(a)(i) and 1(b)(i), where the conjugate gradient (CG) algorithm is employed. For moderately large μ1/μ and μ2/μ, the condition number of these linear equations is small, and therefore only a few iterations are necessary for the convergence of the CG algorithm. Consequently, each step in Algorithm 1 can be done in several matrix-vector products. The computational time for the reconstruction of images with 128×128 pixels and 360 projections is typically a few hours, on a computer with Core i7–2600 CPU at 3.40 GHz, 8 GB memory, and MATLAB R2013a.
One complexity of this problem comes from the nonconvexity of the objective function in (6). Since there may exist local minima, we have to make sure our algorithm get a desired solution. Yet, in our numerical experiments, this nonconvexity issue is not found to be a problem for the following two reasons. First of all, our algorithm may find the global minimum of (6). As pointed out in [25], the augmented Lagrangian algorithm attains the global minimum for nonconvex objectives in low-rank factorization form under suitable assumptions. Similar algorithms to ours has been used in, e.g., SDPLR (semi-definite programming via low rank factorization) [26]. Secondly, we have also chosen the initial guess to the iterative algorithm in the Algorithm 1 carefully as following. We first solve a convex minimization problem
(12) |
where ||U||* is the nuclear norm, i.e., the summation of the singular values of U. The nuclear norm minimization is able to find a lowest-rank solution U* of (5) within a precision [16]. Then, the initial guess in the Algorithm 1 is chosen to be the best rank-K approximation to U*. Let U* = WΣVT be a singular value decomposition. If the rank of U* is less than K, we choose L(0) = WΣ1/2 and R(0) = Σ1/2VT as the initial guesses. If the rank of U* exceeds K, we choose and as the initial guesses, where WK, VK are the first K columns of W and V respectively, and ΣK is the K×K principle submatrix of Σ. By this way, L(0) R(0) is the best rank-K approximation of U*.
As for the problem in (7), we still use the Algorithm 1, but we stop the iteration as soon as . By this way, we can get a quite good approximate solution to (9); see [27] for a detailed discussion. This approach has been previously used and discussed in similar mathematical problems but in other contexts, e.g., [24], [27], [28].
B. Choice of K
One practical issue in our algorithm is the selection of the parameter K to control the matrix rank. In practice, we achieve this goal by the following procedure.
We first set K to be a large enough number, e.g., K = 20, and run our algorithm. Recall that the columns of L forms a basis to represent the images in U. Because of the sufficiently large K value in this trial run, the algorithm is forced to generate a set of K basis vectors containing those good ones for U, as well as the unnecessary bad ones. This is reflected by the fact that some columns of L in the solution make significant contributions to U, while the other columns contribute little. For those unnecessary columns in L, their presence introduces a small but observable amount of signals into the finally reconstructed images in U and compromises its quality. It is therefore desirable to eliminate them from the first place by setting the value of K to be the number of the significant columns. A second reconstruction is then performed with this properly chosen K value.
It remains to determine which columns are significant and which ones are negligible. Since the contribution of the ith column of L to U is L(:, i)R(i, :) any norm of this project reflects the importance of this basis vector L(:, i). The larger norm is, the more significant L(:,i) is. In the experiments, we use the ∞-norm to identify the significance, which is the maximum of row absolute sums of a matrix, or equivalently, the pixel that has the largest absolute sum over time. The column L(:, i) is considered to be negligible, if ||L(:, i)R(i, :)||∞ is close to zero, and significant otherwise.
C. Experiments
We tested our cine-CBCT reconstruction algorithm on a digital NURBS-based cardiac-torso (NCAT) phantom [29], which generates a patient body in thorax region with detailed anatomical features and a realistic motion pattern. The patient respiratory period is 4 s. The CBCT gantry rotates about the patient at a constant speed for a full rotation in 59 s, in which 360 X-ray projections are acquired. At each X-ray projection acquisition, we compute the NCAT phantom image at the specific time point with a resolution of 128 × 128, and the projection is then computed using a ray-tracing algorithm at the associated projection angle with a detector resolution of 256 bins. The patient breathing period and the gantry rotation period are chosen to be incommensurate deliberately, so that all the 360 patient images are distinct, although some of them visually look close. Some of the underlying true images are shown in Fig. 1. Under this setup, the size of U is 16384 × 360, and the size F of is 256 × 360.
Two experiments are performed in this feasibility study, namely the measured projections F generated from a full-fan and half-fan scan. In each case, we will first present the results for the selection of the parameter K and then the reconstruction results. Apart from visual inspection of the reconstructed cine-CBCT images, we have also quantitatively assessed the restored image quality using relative error as a metric defined by ||LR − Utrue||F/||Utrue||F, where L, R are the outputs of the proposed method, and Utrue is the matrix consisting of ground truth NCAT images. Unless specified, the parameters used throughout our experiments are λ = 4000, μ = 1, μ1 = μ2 = 105, and sin = sout = 10. We stop the algorithm when ||℘(LR) − F||F/||F||F ≤ ε and ε is chosen case by case.
III. Experimental Results
A. Full-Fan Scan
For this case, the plot of ||L(:, i)R(i, :)||∞ in a descending order as a function of the column index i is first depicted in Fig. 2, with a trial run of K = 20. Clearly, the eighth column and beyond contribute little to the reconstructed images and hence should be removed. As such, K = 7 is selected and the corresponding results are shown in Figs. 3 and 4. Comparing the restored images in Fig. 3 with the corresponding ground truth images in Fig. 1, it is found that our algorithm is able to capture the motions of the anatomy and restore the structures, even those small ones inside the lung. Meanwhile, observable artifacts inside the heart and at its boundary also exist. Quantitatively, the relative error of the restored cine-CBCT images is 3.89%.
To further look into the reconstruction results, we plot the columns of L in Fig. 4(a), where each column is reshaped into a 128 × 128 image. These images form the basis to represent all the reconstructed cine-CBCT images. It is observed that the first one is similar to an image averaged over all the cine-CBCT images. Its presence provides an overall structure that is common to all the images in the cine-CBCT. Meanwhile, other basis images represent differences between images of U. We also plot the corresponding coefficients in R in Fig. 4(b), which attain a periodically variation pattern, indicating the patient respiratory motion.
B. Noisy Data From a Full-Fan Scan
To demonstrate the robustness of our algorithm to noise, we test our algorithm with noisy data. We add noise at 0.5 mAs/projection to the projection data from the full-fan scan in Section III-A. In this case, the parameter K is selected as K = 4 according to Fig. 5. Fig. 6(a) depicts the reconstructed images. The relative error of the restored cine-CBCT images is 6.81%. The columns of L and the rows of R are plotted in Fig. 6(b) and (c), respectively. Again, we see that the columns of L are meaningful basis corresponding to the average image as well as the variations between them, and the rows of R oscillate periodically.
C. Half-Fan Scan
For the case with noiseless data from a half-fan scan, the parameter K is selected as K = 7 according to Fig. 7. In Fig. 8, we plot the reconstructed images, the columns of L, and the rows of R. The relative error of the restored cine-CBCT images is 6.62%, and we see again that the columns of L are meaningful basis corresponding to the average image as well as the variations between them, and the rows of R oscillate periodically.
IV. Validations on Our Model
A. Validation on the Low-Rank Assumption
To demonstrate validity of the low rank assumption, we have conducted a study on a digital NCAT phantom, where typical images of such a phantom are in Fig. 1. A sequence of T images are generated at a transverse slice during a breathing cycle. Each of these images is represented by a vector of length N, where N is the number of pixels in one image. We then form a N × T matrix using these vectors as columns. Singular value decomposition of this matrix is conducted. For a case with T = 100 per breathing cycle, there are in total 100 nonzero singular values (SVs), indicating that the matrix is strictly speaking of full rank. Yet, only the first few SVs dominate, as illustrated in the left panel of Fig. 9. This implies that the matrix is approximately of low rank. In particular, the eight largest SVs accounts for over 97% of the sum of all the SVs. Moreover, this ratio, namely the required number of leading SVs to account over 97% of the sum of all SVs to the total number of SVs (or images), decreases dramatically as the number of images T sampled in a breathing cycle increases, as shown in the right panel of Fig. 9. This simple numerical experiment demonstrates the validity of our assumption to a certain extent and the rest of our reconstruction work is based on this assumption. However, we remark that whether this assumption holds in real clinical cases is subject to further validations.
B. Exactly Low-Rank Ground Truth
We have also conducted one experiment where the ground truth 4DCT images are indeed forms a low-rank matrix. This experiments show that our model works better for exactly low-rank ground truths than with approximately low-rank ones.
We generate the true images Û1 by Û1 = L̂1R̂1, where L̂1 and R̂1 are from Fig. 4(a) and (b), respectively. By this way, the true cine-CBCT image Û1 is an approximation to the cine-CBCT image in Section II-E whose respiratory motion is rather regular. Clearly, the rank of Û1 is 7 and the dimension is 16384 × 360, as L̂1 consists of 7 basis images of size 128 × 128 and R̂1 of 7 coefficients of length 360. Then, we restore L and R from F = ℘Û1 by Algorithm 1. The restored basis images and the ground truth images are compared in Fig. 10. The relative error of the restored cine-CBCT image is 1.35%, which is smaller than that in Section III-A, where the ground truth is not exactly but approximately low-rank. This shows the low-rankness of the ground truth 4DCT images can help reduce the reconstruction error.
C. One Cycle Scan
Our model does not depend on the periodicity of the respiratory motion. To validate this fact, we tested our model on cine-CBCT images generated in one breathe cycle, where no breathe phase is repeated. We use the true cine-CBCT image from the one described in Section II-E with a scanning period 4.1 s and still T = 360, and frames of this true cine-CBCT image are shown in Fig. 11. Therefore, there is only one full cycle of respiratory motion in the true cine-CBCT images. The results are plotted in Figs. 12 and 13. Our method can still give a satisfactory result, and the relative error of the restored cine-CBCT images is 9.78%.
For the one cycle scan, we performed two more experiments to see how the projection operator ℘ affects the quality of the reconstructed images. Firstly, we find that a random permutation on the projection angels could improve the quality of the reconstructed image. This can be seen from the results presented in Fig. 14, and the relative error of the restored cine-CBCT images is 5.39%. The reason is that, when we randomly permute the projection angels, the projection angels for similar images are random, making the reconstruction more accurate and robust especially when using one cycle scan. Secondly, increasing the temporal resolution could improve the quality of reconstructed images. To see this, we use the true cine-CBCT image from the one described in the previous paragraph with a scanning period 4.1 s and T = 720. The results are depicted in Fig. 15. The relative error of the restored cine-CBCT images is 7.42%. This is related to the validation test presented in Section IV-A, i.e., the ration of required number of leading SVs to account over a certain percentage of the sum of all SVs to the total number of SVs (or images), decreases dramatically as the number of images T sampled in a breathing cycle increases, as shown in the right panel of Fig. 9.
D. High Resolution Images
Since the image video is stored and manipulated in a low-rank decomposition form, our method can reconstruct images with high resolution images. To demonstrate this, we test our algorithm on a high resolution correspondence of the datasets in Section III-A. In particular, we test images with 512 × 512 pixels and 360 projections, and we still use a detector resolution of 256 bins. The results are depicted in Fig. 16. The relative error of the restored cine-CBCT images is 10.11%. The error is larger than that for low-resolution since the linear (5) is more under-determined than the low resolution case.
V. Conclusion and Discussion
In this work, we have proposed an computational algorithm to reconstruct cine-CBCT to provide a set of time-dependent images in the thorax region under the assumption that the matrix formed by all of these images to be reconstructed has a low rank approximation. In contrast to the currently used 4DCBCT, where only respiratory phase-resolved imaging is achieved, the cine-CBCT modality retrieves the instantaneous patient anatomy corresponding to each CBCT projection. Cine-CBCT reconstruction is apparently a challenging problem due to the very limited projection information. Yet, by effectively incorporating the underlying local and global temporal coherence of cine-CBCT images into the reconstruction process, we demonstrate the feasibility of this modality. In particular, the low rank approximation is based on matrix factorization form U = LR for the matrix U whose columns are the images to be reconstructed. The dimension of the matrix L constraints the rank of U and hence maximally and automatically exploits a coherence condition among all the images in cine-CBCT. Simulation studies on an NCAT phantom serve as a very preliminary test to demonstrate the feasibility of our approach, where successive reconstructions have been observed in various experiments.
In addition to the apparent advantage of improved temporal resolution in the cine-CBCT over 4DCBCT, the reconstruction of cine-CBCT is not as demanding regarding the amount of projection data as 4DCBCT, at least based on our simulation studies. Because of the phase binning in 4DCBCT, a certain number of projections are required in each bin to yield a CBCT image for the bin with a satisfactory quality. Hence, it is usually challenging to get a 4DCBCT with only one-minute scan and in practice, protocols of multiple rotations or a slow rotation is utilized. In contrast, the cine-CBCT effectively incorporates the inter-image correlations into the reconstruction and a set of images with visible major anatomical features are obtained even with a super-fast scan protocol corresponding to only a single breathing period, i.e., a 4-s scan if the patient’s breathing period is 4 s, as demonstrated in our studies. Another advantage is that our method can deal with respiratory motion quite well in the sense that irregularity does not affect the construction of the regular images too much as shown in the experiments.
Although the fourth dimension in cine-CBCT is more than breathing phase as in conventional 4DCBCT, we would like to point out that cine-CBCT is not completely time resolved in general. A good cine-CBCT reconstruction is based on two key assumptions. First, all cine-CBCT images can be represented (approximately) by linear combinations of a small number of basis images, meaning certain irregular patient anatomy variation may not be captured by cine-CBCT if they cannot be approximated well by the basis images. Second, all cine-CBCT images collectively correspond to a sufficient number of projections covering a sufficiently large scanning angle, meaning our algorithm will not work if the scanning angle is smaller than that required by the regular CBCT reconstruction. Nor will other methods do since no reconstruction method can recover information beyond the data. That being said, cine-CBCT images reconstructed using our algorithm still contain clinically valuable temporal information and represent a quantum leap from the conventional 4DCBCT images with the same setup and measurements.
Despite the success in this feasibility study, there are a few practical issues as for the clinical applications of cine-CBCT. First of all, the low rank assumption has to be verified using clinical data. Second, only one slice of the patient anatomy is reconstructed in this proof-of-principle study. When cone-beam geometry is considered to reconstruct volumetric CBCT images, although the fundamental principles in our algorithm can be easily applied, one has to address challenges for both storage and computational time for any true or semi-time resolved CBCT reconstruction in clinical practice. Currently, it takes a few hours to reconstruct those images shown in our study with 128 × 128 pixels and 360 projections on a typical computer using MATLAB. For real clinical cases with a cone beam geometry and a high resolution, e.g., voxels, the computation time would be estimated too long under the current computation platform. However, more powerful computational platforms, such as graphics processing unit (GPU) [30], or multi-GPU [31], are expected to greatly accelerate the reconstructions. GPU is essentially a parallel processing platform with a large number of processing cores. For example, in Algorithm 1, the most computation intensive part is computing forward projection ℘ and its transpose ℘T when solving the least-square problems. For problems of large sizes, these matrices are too large to be stored. But these operations can be greatly accelerated on the GPU platform by splitting the operations according to projection ray-lines, or image voxels [32]. Other computation intensive tasks, such as matrix-matrix or matrix-vector operations, can be easily parallelized to achieve significant speed up. Another challenge is large memory requirement which can not fit into single GPU. In this case, splitting calculations across multiple GPUs will be the solution, at the cost of inter-GPU communications [31]. It will be our future research project to implement the reconstruction algorithm on GPU. Over the years, GPU-based computing has demonstrated the feasibility of greatly accelerating CBCT reconstructions, such that the computation time of a high-resolution 3-D case on GPU is comparable to that for a 2-D case using MATLAB on CPU [33]. It is expected that these advanced approaches can eventually render the computation efficiency to a practically acceptable level.
Moreover, even in the current form, namely reconstructing only a 2-D slice in a time-resolve fashion, our algorithm will still have some important applications and attain advantages over those methods employed in current clinical practice. One context is lung tumor motion management in advanced radiation therapy. At present, this is typically achieved by monitoring external surrogate motion, e.g., chest wall or diaphragm, and inferring tumor motions via a certain motion model. This indirect method suffers from problems such as motion model accuracy and reliability caused by poor internal-external correlations and temporal variations of motion patterns. Measuring motions of makers that are implanted to the lung tumor is another practical way. Yet, the invasive nature makes it less clinically preferred. Compared to these approaches, our method is capable of directly imaging motions, although in a 2-D slice in this initial study due to computational costs. It will hence provide valuable information to facilitate a number tasks in radiotherapy motion managements, e.g., tumor tracking or gated treatment delivery. In addition, because our algorithm does not rely on motion periodicity, its application can be generalized to tumor sites other than lung, such as prostate, which are affected by motions of nearby organs. For these sites, 4DCBCT cannot be applied due to the lack of motion periodicity. With limited correlations between the organ of interest and other surrogates, it is not possible to infer organ motions via surrogate motions. Tracking implanted markers is employed currently if needed. Our method could be another method to retrieve motion information for these sites and therefore help motion managements in this context.
Finally, we would also like to remark that, we propose a new modality of cine-CBCT in this study and a reconstruction algorithm that can be potentially used to achieve it. It is not our claim that the algorithm is practical enough for cine-CBCT reconstruction, as only a 2-D slice is reconstructed in the studies here and 3-D cases are apparently computationally challenging for the moment. However, the problem of interest here is a temporally resolved imaging modality on the cone beam CT hardware system. Hence, the experimental designs in this study have been focused on the CBCT context, such that parameters in the simulation cases, e.g., gantry rotation speed and projection geometry, are set according to those in a CBCT scanner. It is also our study in near future to test the algorithm in real data acquired in a CBCT machine. The X-ray source will be collimated, so that only the central slice projection data will be used for reconstructing only one transverse plane. We also noticed that our algorithm can be applied to cases with a fan-beam projection geometry, i.e., conventional CT. Yet, cine-CT has been already achieved there [34]–[36]. Due to the ultra fast source rotation speed, cine-CT reconstruction is not a difficult problem with modern CT scanners, as enough many projections covering a large angular range can be acquired with in a short time window for a time point. Therefore, we focus our study on a much more challenging context with the current CBCT scanner, which hence leads to the name of cine-CBCT.
Acknowledgments
The work of J.-F. Cai was supported by the Simons Foundation under Grant 281384. The work of S. Jiang and X. Jia was supported in part by the National Institutes of Health (NIH) under Grant 1R01CA154747-01, in part by the Master Research Agreement from Varian Medical Systems, Inc., and in part by the Early Career Award from Thrasher Research Fund. The work of H. Gao was supported by the NIH-National Institute of Biomedical Imaging and Bioengineering under Grant R21EB013387. The work of H. Zhao was supported in part by the Office of Naval Research under Grant N00014-11-1-0602 and in part by the National Science Foundation under Grant DMS-1115698.
Contributor Information
Jian-Feng Cai, Department of Mathematics, The University of Iowa, Iowa City, IA 52242 USA.
Xun Jia, Department of Radiation Oncology, University of Texas Southwestern Medical Center, Dallas, TX 75235 USA.
Hao Gao, School of Biomedical Engineering and Department of Mathematics, Shanghai Jiao Tong University, Shanghai 200240, China.
Steve B. Jiang, Department of Radiation Oncology, University of Texas Southwestern Medical Center, Dallas, TX 75235 USA
Zuowei Shen, Department of Mathematics, National University of Singapore, Singapore 119076, Singapore.
Hongkai Zhao, Department of Mathematics, University of California, Irvine, CA 92697 USA.
References
- 1.Sonke JJ, et al. Respiratory correlated cone beam CT. Med Phys. 2005;32:1176–1186. doi: 10.1118/1.1869074. [DOI] [PubMed] [Google Scholar]
- 2.Kriminski S, et al. Respiratory correlated cone-beam computed tomography on an isocentric C-arm. Phys Med Biol. 2005;50:5263–5280. doi: 10.1088/0031-9155/50/22/004. [DOI] [PubMed] [Google Scholar]
- 3.Li TF, et al. Four-dimensional cone-beam computed tomography using an on-board imager. Med Phys. 2006;33:3825–3833. doi: 10.1118/1.2349692. [DOI] [PubMed] [Google Scholar]
- 4.Lu J, Guerrero TM, Munro P, Jeung A, Chi PCM, Balter P, Zhu XR, Mohan R, Pan T. Four-dimensional cone beam CT with adaptive gantry rotation and adaptive data sampling. Med Phys. 2007;34(9):3520–3529. doi: 10.1118/1.2767145. [DOI] [PubMed] [Google Scholar]
- 5.Li T, Xing L. Optimizing 4D cone-beam CT acquisition protocol for external beam radiotherapy. Int J Radiat Oncol Biol Phys. 2007 Mar;67(4):1211–1219. doi: 10.1016/j.ijrobp.2006.10.024. [DOI] [PubMed] [Google Scholar]
- 6.Bergner F, Berkus T, Oelhafen M, Kunz P, Pan T, Kachelriess M. Autoadaptive phase-correlated (AAPC) reconstruction for 4D CBCT. Med Phys. 2009 Dec;36(12):5695–5706. doi: 10.1118/1.3260919. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Rit S, Wolthaus JWH, Herk MV, Sonke JJ. On-the-fly motion-compensated cone-beam CT using an a priori model of the respiratory motion. Med Phys. 2009 Jun;36(6):2283–2296. doi: 10.1118/1.3115691. [DOI] [PubMed] [Google Scholar]
- 8.Bergner F, Berkus T, Oelhafen M, Kunz P, Pan T, Grimmer R, Ritschl L, Kachelriess M. An investigation of 4D cone-beam CT algorithms for slowly rotating scanners. Med Phys. 2010 Sep;37(9):5044–5053. doi: 10.1118/1.3480986. [DOI] [PubMed] [Google Scholar]
- 9.Zhang Q, Hu YC, Liu F, Goodman K, Rosenzweig K, Mageras G. Correction of motion artifacts in cone-beam CT using a patient-specific respiratory motion model. Med Phys. 2010 Jun;37:2901–2909. doi: 10.1118/1.3397460. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Leng S, Tang J, Zambelli J, Nett B, Tolakanahalli R, Chen GH. High temporal resolution and streak-free four-dimensional cone-beam computed tomography. Phys Med Biol. 2008 Oct;53:5653–5673. doi: 10.1088/0031-9155/53/20/006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Jia X, Lou Y, Dong B, Tian Z, Jiang S. 4D computed tomography reconstruction from few-projection data via temporal non-local regularization. Proc MICCAI. 2010:143–150. doi: 10.1007/978-3-642-15705-9_18. [DOI] [PubMed] [Google Scholar]
- 12.Gao H, Cai JF, Shen Z, Zhao H. Robust principle component analysis based four-dimensional computed tomography. Phys Med Biol. 2011;56(11):3181–3198. doi: 10.1088/0031-9155/56/11/002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Tian Z, Jia X, Dong B, Lou Y, Jiang S. Low-dose 4DCT reconstruction via temporal nonlocal means. Med Phys. 2011;38:1359–1365. doi: 10.1118/1.3547724. [DOI] [PubMed] [Google Scholar]
- 14.Qi Z, Chen GH. Performance studies of four-dimensional cone beam computed tomography. Phys Med Biol. 2011;56(20):6709–6721. doi: 10.1088/0031-9155/56/20/013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Jia X, Tian Z, Lou Y, Sonke J, Jiang S. Four-dimensional cone beam CT reconstruction and enhancement using a temporal non-local means method. Med Phys. 2012;39:5592–5602. doi: 10.1118/1.4745559. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Cai J-F, Candès EJ, Shen Z. A singular value thresholding algorithm for matrix completion. SIAM J Optimiz. 2010;20(4):1956–1982. [Google Scholar]
- 17.Chandrasekaran V, Recht B, Parrilo P, Willsky A. The convex geometry of linear inverse problems. Foundat Computat Math. 2012;12(6):805–849. [Google Scholar]
- 18.Ford E, Mageras G, Yorke E, Ling C. Respiration-correlated spiral CT: A method of measuring respiratory-induced anatomic motion for radiation treatment planning. Med Phys. 2003;30(1):88–97. doi: 10.1118/1.1531177. [DOI] [PubMed] [Google Scholar]
- 19.Li R, Jia X, Lewis JH, Gu X, Folkerts M, Men C, Jiang SB. Real-time volumetric image reconstruction and 3-D tumor localization based on a single X-ray projection image for lung cancer radiotherapy. Med Phys. 2010;37(6):2822–2826. doi: 10.1118/1.3426002. [DOI] [PubMed] [Google Scholar]
- 20.Li R, Lewis JH, Jia X, Gu X, Folkerts M, Men C, Song WY, Jiang SB. 3-D tumor localization through real-time volumetric X-ray imaging for lung cancer radiotherapy. Med Phys. 2011;38(5):2783–2794. doi: 10.1118/1.3582693. [DOI] [PubMed] [Google Scholar]
- 21.Ron A, Shen Z. Affine systems in L2(Rd): the analysis of the analysis operator. J Funct Anal. 1997;148(2):408–447. [Google Scholar]
- 22.Cai J-F, Dong B, Osher S, Shen Z. Image restoration: Total variation, wavelet frames, and beyond. J Am Math Soc. 2012;25(4):1033–1089. [Google Scholar]
- 23.Goldstein T, Osher S. The split Bregman method for L1 regularized problems. SIAM J Imag Sci. 2009;2(2):323–343. [Google Scholar]
- 24.Cai JF, Osher S, Shen Z. Split Bregman methods and frame based image restoration. Multiscale Model Simulat. 2009;8:337–369. [Google Scholar]
- 25.Burer S, Monteiro RDC. Local minima and convergence in low-rank semidefinite programming. Math Program, ser A. 2005;103(3):427–444. [Google Scholar]
- 26.Burer S, Monteiro RDC. A nonlinear programming algorithm for solving semidefinite programs via low-rank factorization. Math Program, ser B. 2003;95(2):329–357. [Google Scholar]
- 27.Osher S, Burger M, Goldfarb D, Xu J, Yin W. An iterative regularization method for total variation-based image restoration. Multi-scale Model Simul. 2005;4(2):460–489. electronic. [Google Scholar]
- 28.Cai JF, Osher S, Shen Z. Linearized Bregman iterations for frame-based image deblurring. SIAM J Imag Sci. 2009;2(1):226–252. [Google Scholar]
- 29.Segars W, Mahesh M, Beck T, Frey E, Tsui B. Realistic CT simulation using the 4D XCAT phantom. Med Phys. 2008;35:3800. doi: 10.1118/1.2955743. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Jia X, Lou Y, Li R, Song WY, Jiang SB. Gpu-based fast cone beam CT reconstruction from undersampled and noisy projection data via total variation. Med Phys. 2010;37(4):1757–1760. doi: 10.1118/1.3371691. [DOI] [PubMed] [Google Scholar]
- 31.Yan H, Wang X, Cervino L, Jiang SB, Jia X. Towards the clinical implementation of iterative cone beam CT reconstruction for radiation therapy using a multi-GPU system. Med Phys. 2014 doi: 10.1118/1.4898324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Jia X, Dong B, Lou Y, Jiang S. GPU-based iterative cone-beam CT reconstruction using tight frame regularization. Phys Med Biol. 2011;56:3787–3807. doi: 10.1088/0031-9155/56/13/004. [DOI] [PubMed] [Google Scholar]
- 33.Jia X, Ziegenhein P, Jiang SB. GPU-based high-performance computing for radiation therapy. Phys Med Biol. 2014;59(4):R151. doi: 10.1088/0031-9155/59/4/R151. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Ford E, Mageras G, Yorke E, Ling C. Respiration-correlated spiral CT: A method of measuring respiratory-induced anatomic motion for radiation treatment planning. Med Phys. 2003;30:88–97. doi: 10.1118/1.1531177. [DOI] [PubMed] [Google Scholar]
- 35.Keall P, Starkschall G, Shukla H, Forster K, Ortiz V, Stevens C, Vedam S, George R, Guerrero T, Mohan R. Acquiring 4D thoracic CT scans using a multislice helical method. Phys Med Biol. 2004;49:2053–2067. doi: 10.1088/0031-9155/49/10/015. [DOI] [PubMed] [Google Scholar]
- 36.Low D, Nystrom M, Kalinin E, Parikh P, Dempsey J, Bradley J, Mutic S, Wahab S, Islam T, Christensen G. A method for the reconstruction of four-dimensional synchronized CT scans acquired during free breathing. Med Phys. 2003;30:1254–1263. doi: 10.1118/1.1576230. [DOI] [PubMed] [Google Scholar]