Abstract
The PET-enabled dual-energy CT method allows dual-energy CT imaging on PET/CT scanners without the need for a second x-ray CT scan. A 511 keV γ-ray attenuation image can be reconstructed from time-of-flight PET emission data using the maximum-likelihood attenuation and activity (MLAA) algorithm. However, the attenuation image reconstructed by standard MLAA is commonly noisy. To suppress noise, we propose a neural-network approach for MLAA reconstruction. The PET attenuation image is described as a function of kernelized neural networks with the flexibility of incorporating the available x-ray CT image as anatomical prior. By applying optimization transfer, the complex optimization problem of the proposed neural MLAA reconstruction is solved by a modularized iterative algorithm with each iteration decomposed into three steps: PET activity image update, attenuation image update, and neural-network learning using a weighed mean squared-error loss. The optimization algorithm is guaranteed to monotonically increase the data likelihood. The results from computer simulations demonstrated the neural MLAA algorithm can achieve a significant improvement on the γ-ray CT image quality as compared to other algorithms.
Keywords: Dual energy CT, PET/CT, neural network, kernel methods, MLAA
1. INTRODUCTION
Conventionally, dual-energy computed tomography (DECT) uses two different x-ray energies to obtain energy-dependent tissue attenuation information to allow quantitative material decomposition.1 DECT can be implemented with simultaneous data acquisition using a dedicated DECT scanner or sequential data acquisition on conventional single-energy CT scanners. Combined use of DECT and positron emission tomography (PET) functional imaging provides a multi-parametric characterization of disease states in cancer and other diseases. It will be valuable to enable simultaneous PET/DECT imaging for clinical applications. However, the integration of DECT with PET would not be trivial, either requiring costly CT hardware upgrade or significantly increasing CT radiation dose.
We have proposed a new dual-energy CT imaging method that is enabled using standard time-of-flight PET/CT scans without change of scanner hardware or adding additional radiation dose or scan time.2,3 Instead of using two different x-ray energies, our PET-enabled dual-energy CT method combines a high-energy “γ-ray CT (GCT)” at 511 keV with the low-energy x-ray CT (usually ≤ 140 keV) to produce a pair of dual-energy CT images for multi-material decomposition. Note that the GCT is not acquired using an external radiation source but the internal γ-rays generated by annihilation radiation of PET radiotracer decays in a subject. The reconstruction of GCT can be achieved by the maximum likelihood attenuation and activity (MLAA) estimation from the PET emission scan.4
However, the GCT image reconstructed by standard MLAA is commonly noisy. The noise may affect the quantitative accuracy of GCT for multi-material decomposition. To suppress noise, the kernel MLAA approach has been developed by use of x-ray CT as image prior and has demonstrated substantial improvements,3 but can be further improved. It is worth noting that all the existing kernel methods for image reconstruction (e.g.,5–7) follow an unregularized formulation, while the kernel methods in machine learning for classification or regression commonly use a regularized framework.8 Adding a penalty function in the kernel method for image reconstruction may be helpful to stabilize the solution but will make the reconstruction formulation more complicated and result in a more challenging optimization problem. A regularization-based method also often requires a convergent solution to achieve the optimal solution, which can be computationally challenging in the MLAA framework.
Instead of using any explicit penalty function, in this paper we propose to use a neural network representation to impose an implicit regularization to the kernel method for MLAA reconstruction. The ML estimation of kernel coefficients then becomes the ML estimation of neural network weights from raw projection data. This usage of neural networks shares the same spirit as the work of Gong et al.9 for PET activity image reconstruction based on the deep image prior method.10
One challenge with this kind of neural network reconstruction problem is the complex optimization because the forward model is now nonlinear. The neural network learning and tomographic reconstruction are also coupled with each other in the projection domain. Gong et al.9 used the alternating direction of multiplier method (ADMM) to decouple the network training and image reconstruction steps. However, the hyper parameters associated with an ADMM algorithm is challenging to tune in practice. In this work, we apply the optimization transfer principle to obtain a modularized optimization algorithm that can be easily implemented and does not involve hyper-parameter tuning, in the same spirit as the work of Wang and Qi on parametric PET image reconstruction.11
As the kernel representation in this work can also be deemed as a neural network layer with predetermined weights (see Fig. 1 for illustration), we call the proposed algorithm the neural MLAA to feature its nature of neural network representation and differentiate it from the original kernel MLAA.3
Figure 1:

Graphical illustration of a neural network representation of the attenuation image μ for MLAA reconstruction.
2. PET-ENABLED γ-RAY CT RECONSTRUCTION BY MLAA
2.1. Statistical Model
In time-of-flight (TOF) PET, the expectation of the PET projection data is related to the radiotracer activity image λ and object attenuation image μ at 511 keV via
| (1) |
where m denotes the mth TOF bin and Gm is the PET detection probability matrix. rm accounts for the expectation of random and scattered events. nm(μ) is the normalization factor with the ith element being
| (2) |
where ci,m represents the multiplicative factor excluding the attenuation correction factor and A is the system matrix for transmission imaging.
The TOF PET measurement y can be well modeled as independent Poisson random variables using the log-likelihood function,
| (3) |
where i denotes the index of PET detector pair and Nd is the total number of detector pairs. Nt is the number of TOF bins.
2.2. MLAA Reconstruction
The maximum-likelihood attenuation and activity (MLAA4) reconstruction jointly estimates the attenuation image μ and the activity image λ from the projection data y by maximizing the Poisson loglikelihood,
| (4) |
An iterative interleaved updating strategy is commonly used to seek the solution. In each iteration of the algorithm, λ is first updated with fixed attenuation image ,
| (5) |
which can be solved by the maximum-likelihood expectation maximization (MLEM) algorithm.12 Next, μ is updated with fixed λ using the following kernel maximum-likelihood transmission reconstruction (MLTR) method,
| (6) |
which can be solved by the separable paraboloidal surrogate (SPS) algorithm.13
In the PET-enabled dual-energy CT method,2,3 the estimated GCT image μ is combined with the x-ray CT image to form a pair of dual-energy CT images for multi-material decomposition.
3. PROPOSED NEURAL MLAA
3.1. Image Representation Using Kernelized Neural Networks
Following the kernel method for image reconstruction,5 the attenuation image μ can be described as a linear function of kernels,
| (7) |
where α is the kernel coefficient image of the same size as μ. The kernel matrix K is sparse and generated from image prior, e.g., the x-ray CT image x.3 Substituting the kernel model into Eq. 4 leads to the kernel MLAA algorithm.3 It is also possible to add an explicit penalty function to stabilize the estimation of α as indicated in the original kernel method,5 which however may result in a more challenging optimization problem and add one more hyperparameter to tune.
Inspired by the work of Gong et. al.9 using the deep image prior (DIP),10 we further describe the kernel coefficient image α as a function of neural networks,
| (8) |
where f denotes the neural network mapping from an input image z to the coefficient image α with θ the parameters of the neural network.
Combining the neural network model and kernel model together, we have a kernelized neural network representation to describe the GCT image,
| (9) |
Fig. 1(a) shows a graphical illustration of the overall network model, of which the last layer has fixed weights as determined by the sparse matrix K. If the sub network f(θ|z) is an identity mapping, i.e., α = z, then the neural MLAA proposed in this paper is the same as the kernel MLAA.3 Note that f(θ|z) can be a more complex neural network model such as the residual U-net9 as shown in Fig. 1(b), which in turn introduces an implicit regularization to stabilize the estimation of the kernel coefficient image α.
It is worth noting that f(θ|z) can be a pretrained model by which θ is learned and fixed and z is to be estimated in image reconstruction. It can also be used without pretraining in the same way as the DIP model.9,10 In this work, we focus on the latter. z is fixed and θ is to be estimated.
3.2. Incorporation into MLAA
Substituting Eq. (9) into the MLAA formulation gives the following neural MLAA formulation,
| (10) |
with z either an image of random noise or the x-ray CT image x. Similar to the standard MLAA4 and kernel MLAA,3 an interleaved updating strategy is used to update λ and θ.
In each iteration, the λ-step remains the same as other MLAAs given an known . The θ-step is a neural MLTR step that finds the ML solution of the neural network weights by
| (11) |
where we use a simpler notation to refer to the objective function.
Once is estimated, the GCT image is obtained by
| (12) |
3.3. Neural Network Reconstruction Using Optimization Transfer
The optimization problem defined by Eq. (11) for neural network reconstruction is challenging to solve because the unknown θ is non-linearly involved in the projection domain. Here we develop an optimization transfer algorithm using a similar concept as used for parametric PET image reconstruction.11
Our method transfers the complex optimization into a learning-friendly weighted mean squared-error (wMSE) formulation:
| (13) |
where
| (14) |
with
| (15) |
and αn ≜ f(θn|z). The gradient image gn and weight image ωn are calculated based on αn and y.
Note that the surrogate function wMSE(θ;θn) is not arbitrarily defined but minorizes the original likelihood function :
| (16) |
| (17) |
where ∇θ denotes the gradient with respect to θ.
Thus, each iteration of the neural MLTR optimization can be decomposed into two separable steps:
Step 1: Obtain an intermediate image update from the projection data y using Eq. (15);
Step 2: Perform a DIP-like neural network learning of using the wMSE loss.
Step 1 can be implemented in a conventional image reconstruction platform, while Step 2 is fairly easy to implement in a deep-learning platform such as the PyTorch.
The algorithm is guaranteed to monotonically increase the original MLAA likelihood function, i.e.,
| (18) |
Compared with a possible ADMM optimization algorithm as used in,9 the proposed optimization transfer algorithm has the advantage that does not involve hyper parameter tuning.
4. SIMULATION RESULTS
4.1. Simulation Setting
We simulated a GE Discovery 690 PET/CT scanner in 2D. The TOF timing resolution of this PET scanner was 550 ps. The true PET activity image and 511 keV attenuation image are shown in Fig. 2(a) and (b), respectively. The images were first forward projected to generate noise-free sinogram of 11 TOF bins. A 40% uniform background was included to simulate random and scattered events. Poisson noise was then generated using 5 million expected events. The x-ray CT image at a low-energy 80 keV is shown in Fig. 2(c).
Figure 2:

The digital phantom used in the PET/CT computer simulation. (a) PET activity image in Bq/cc; (b) PET attenuation image at 511 keV in cm−1; (c) x-ray CT image at 80 keV.
4.2. Reconstruction Methods for Comparison
Four types of MLAA reconstruction algorithms were compared, which includes (1) standard MLAA, (2) kernel MLAA with x-ray CT as image prior, and (3–4) proposed neural MLAA with and without kernels. Note that the neural MLAA without kernel (i.e., when K is an identity matrix) applies a DIP model to the GCT image μ directly. The kernel matrix was built using using image patches of size 3 × 3 extracted from the x-ray CT image and 50 nearest neighbors. The reconstruction steps was implemented in MATLAB and the neural network learning step was implemented in PyTorch. All the reconstructions were run for 150 iterations. Different MLAA methods were compared for the image quality of GCT using the mean squared error (MSE) defined by
| (19) |
where represents the reconstructed GCT image by each MLAA method and μtrue denotes the ground truth.
4.3. Comparison Results for GCT Image Quality
Fig. 3 shows the comparison of GCT images reconstructed by different algorithms. The images were chosen according to the best image MSE in each method. Fig. 3(b) shows the standard MLAA with a uniform image initial for μ and Fig. 3(c) shows the MLAA with the μ initial set to the 511 keV linear attenuation coefficient map converted from the the x-ray CT using bilinear scaling. Clearly, a good initial from x-ray CT provided a better image quality. Hereafter we mainly present the results that use the CT initial. The results by the kernel MLAA and the neural MLAA without kernel are shown in Fig. 3(d) and 3(e), respectively. Both reconstructions had a better MSE than the standard MLAA but also demonstrated some artifacts. In comparison, the neural MLAA with kernel, as shown in Fig. 3(f), demonstrated a better visual quality with a higher image MSE.
Figure 3:

True and reconstructed GCT images from PET emission data by different algorithms with the best image MSE. (a) Ground truth; (b-c) standard MLAA with (b) uniform initial or (c) CT initial for μ; (d) Kernel MLAA; (e) Neural MLAA without kernel; (f) Neural MLAA with kernel.
Fig. 4 further shows image MSE as a function of iteration number for different algorithms. As compared to the kernel MLAA, the two neural MLAA approaches had a significant acceleration to reach their own best MSE. Among different algorithms, the neural MLAA with kernel achieved the lowest image MSE.
Figure 4:

Plots of image MSE for different algorithms.
5. CONCLUSION AND FUTURE WORK
In this paper, we have developed a neural MLAA approach that combines the kernel model and neural network-based deep image prior model for reconstruction of γ-ray CT image from PET emission data. The neural networks are used to regularize the estimation of kernel coefficients without adding an explicit regularization term. An optimization transfer algorithm is also developed for efficient neural network reconstruction. Computer simulation results have demonstrated the improvement of the neural MLAA over conventional MLAA algorithms. Our future work will evaluate the impact of the improved image quality for PET-enabled dual-energy CT multi-material decomposition. The proposed method will also be combined with the design of neural kernels14 to further improve the MLAA reconstruction for PET-enabled dual-energy CT imaging.
ACKNOWLEDGMENTS
This work is supported in part by National Institutes of Health (NIH) under grant no. R21 EB027346.
REFERENCES
- [1].McCollough CH, Leng SA, Yu LF, et al. , “Dual-and Multi-Energy CT: Principles, Technical Approaches, and Clinical Applications,” Radiology, 276: 637–653, 2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [2].Wang G, “PET-enabled Dual-energy CT: A Proof-of-Concept Simulation Study,” 2018 IEEE Nuclear Science Symposium and Medical Imaging Conference Proceedings (NSS/MIC), Nov. 2018. [Google Scholar]
- [3].Wang G, “PET-enabled Dual-Energy CT: Image Reconstruction and A Proof-of-Concept Computer Simulation Study,” Physics in Medicine and Biology, 65(24): 245028, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [4].Rezaei A, Defrise M, Bal G, et al. , “Simultaneous reconstruction of activity and attenuation in time-of-flight PET,” IEEE Transations on Medical Imaging, 31: 2224–2233, 2012. [DOI] [PubMed] [Google Scholar]
- [5].Wang G, Qi J, “PET image reconstruction using kernel method,” IEEE Transactions on Medical Imaging, vol. 34, no. 1, pp. 61–71, 2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [6].Hutchcroft W, Wang G, Chen K, Catana C, Qi J, “Anatomically-aided PET reconstruction using the kernel method,” Physics in Medicine and Biology, 61(18): 6668–6683, 2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [7].Novosad P, Reader AJ, “MR-guided dynamic PET reconstruction with the kernel method and spectral temporal basis functions,” Physics in Medicine and Biology, 61(12): 4624–4645, 2016. [DOI] [PubMed] [Google Scholar]
- [8].Hofmann T, Scholkopf B, Smola AJ, “Kernel methods in machine learning,” The Annals of Statistics, vol. 36, no. 3, pp. 1171–1220, 2008. [Google Scholar]
- [9].Gong K, Catana C, Qi J, and Li Q, “PET Image Reconstruction Using Deep Image Prior,” IEEE Transactions on Medical Imaging, vol. 38, no. 7, pp. 1655–1665, Jul. 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [10].Ulyanov D, Vedaldi A, and Lempitsky V, “Deep Image Prior,” International Journal of Computer Vision, vol. 128, no. 7, pp. 1867–1888, Mar. 2020. [Google Scholar]
- [11].Wang G and Qi J, “Generalized Algorithms for Direct Reconstruction of Parametric Images From Dynamic PET Data,” IEEE Transactions on Medical Imaging, vol. 28, no. 11, pp. 1717–1726, Nov. 2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [12].Shepp LS and Vardi Y, “Maximum likelihood reconstruction for emission tomography,” IEEE Transations on Medical Imaging, vol. 1, no. 2, pp. 113–122, Oct. 1982. [DOI] [PubMed] [Google Scholar]
- [13].Erdogan H, Fessler JA, “Monotonic algorithms for transmission tomography,” IEEE Transactions on Medical Imaging, vol. 18, no. 9, pp. 801–814, April 1999. [DOI] [PubMed] [Google Scholar]
- [14].Li S, Wang G, “ Modified Kernel MLAA Using Autoencoder for PET-enabled Dual-Energy CT,” arXiv, 2010.07484. October 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
