Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2021 Jun 1.
Published in final edited form as: IEEE Trans Radiat Plasma Med Sci. 2017 Nov 17;2(1):27–32. doi: 10.1109/trpms.2017.2774834

Estimation of the Initial Image’s Contributions to the Iterative Landweber Reconstruction

Gengsheng L Zeng 1
PMCID: PMC8168974  NIHMSID: NIHMS1581529  PMID: 34079920

Abstract

This paper develops a closed-form frequency-domain transfer function to predict the initial image’s contributions in an iterative Landweber reconstruction for a given number of iterations. The transfer function shows that the initial image’s high frequency components play an important role when the iterative algorithm is stopped before convergence. The contributions from the initial image diminish as the iteration number increases if the imaging matrix has full rank.

Index Terms—: Fourier transform, image reconstruction, iterative algorithm, linear filter, numeric analysis, tomography

I. Introduction

Initial image (or initial condition) can influence the result of the iterative image reconstruction, especially when the number of iterations is low. The basic principles of iterative image reconstruction are classic and well covered in textbooks [1]–[4].

Two methods are commonly practiced in our medical imaging community for iterative image reconstruction. The first method is to use a constant as the initial image. If the algorithm uses the additive update scheme, the initial image is set as a zero image. If the algorithm is multiplicative, the initial image is chosen as an image with a constant ONE (or any positive constant) for its pixels. The motivation of using a constant initial image is to avoid any undesired structures that are in the initial image and may propagate into the reconstruction.

The other method is based on the strategy of speeding up the iterative reconstruction by using an initial image that is close to the expected reconstruction. The filtered backprojection (FBP) algorithm [5] is fast but suffers from reconstruction noise if the noise is not properly controlled. The FBP image can be used as the initially image in an iterative algorithm for a faster convergence [6]–[8].

If the projections are not sufficiently measured, for example, due to data truncation, special initial condition, or image support are considered [9], [10]. This paper only considers the situation where the projections are sufficiently measured.

To gain more insights of how the initial image influences the reconstruction, this paper derives a frequency-domain transfer function that relates the initial image to the reconstructed image. This transfer function is a high-pass filter in nature and its maximum gain diminishes as the iteration number becomes higher.

The derivation of the transfer function is based on our previous work [11]–[14]. For sake of making this paper self-contained, the entire derivation is given in Section II. Computer simulations are provided.

II. Methods

Let us consider an iterative Landweber algorithm that minimizes the following quadratic objective function:

f=(AXP)T(AXP)+βXTX (1)

where X is the image array expressed as a vector, P is the projection array expressed as a vector, A is the imaging matrix, and β is a parameter controlling influence of the minimum-norm regularization term on the objective function f. If β = 0, the minimum-norm regularization is not effective. The Landweber algorithm is a gradient descent algorithm, and the gradient of the objective function (1) is given as [15], [16]

f=2ATAX2ATP+2βX. (2)

The Landweber algorithm that minimizes (1) is given as

X(k)=X(k1)+α(ATPATAX(k1)βX(k1)) (3)

where X(k) is the result of X at the kth iteration and the relaxation parameter α controls the step size. For a quadratic objective function (1), the Landweber algorithm will converge if the relaxation parameter α > 0 is small enough. The recursive expression (3) can be transformed to a non-recursive form as follows:

X(k)=X(k1)+αAT(PAX(k1))αβX(k1)=αATP+(IαATAαβI)X(k1)=αATP+(IαATAαβI)[αATP+(IαATAαβI)×X(k2)]=[n=0k1(IαATAαβI)n]αATP+(IαATAαβI)k×X(0). (4)

If the square matrix (IM) is nonsingular, we have the identity

n=0k1Mn=(IM)1(IMk) (5)

which can be readily verified by premultiplying (IM) on both sides. In fact, premultiplying the left-hand-side of (5) by (IM), we obtain

(IM)n=0k1Mn=n=0k1(MnMn+1)=IM+M+Mk1Mk=IMk. (6)

Thus the nonrecursive form (4) can be further written as a closed form without the Σ sign

X(k)=(αATA+αβI)1[I(IαATAαβI)k]αATP+(IαATAαβI)kX(0)=(ATA+βI)1[I(IαATAαβI)k]ATP+(IαATAαβI)kX(0). (7)

Expression (7) can be treated as a linear system with two separate inputs: ATP and X(0). In this paper, we focus on X(k)’s dependency on X(0).

In tomography, ATA is a projection-backprojection operator in the matrix form. When it operates upon an image X, it equivalently convolves the image X by a 2-D 1/r kernel, where r is the distance to the origin [4]. In the Fourier domain, this 1/r convolution kernel corresponds to the function 1/ω, where ω is the frequency vector in the 2-D Fourier domain.

We thus obtain the equivalence between the matrix ATA and the Fourier-domain function 1/ω. We also notice that the identity matrix I corresponds to the function 1 in the Fourier domain. Multiplication with a matrix k times is equivalent to multiplying the filter function k times in the frequency domain.

The operator of the second term on the right-hand-side of (7) in the matrix form is (IαATAαβI)k, and in the frequency domain, it becomes

F(ω)=(1α/ωαβ)k, if ω0 (8)

The transfer function (8) is undefined when ω ≠ 0. If we require that the transfer function be smooth (as seen in Figs. 1 and 2), we must have F(0) = 0. The parameter α is chosen such that negative values of F are not allowed. Thus, we define the transfer function (8) to be 0 when ω=0.

Fig. 1.

Fig. 1.

Transfer function (8) with α = 0.001, β = 0, and 4 different k values.

Fig. 2.

Fig. 2.

Transfer function (8) with α = 0.001, β = 10, and 4 different k values.

According to [19], the correct way to obtain the discrete frequency-domain transfer function is to take the discrete Fourier transform of its discrete spatial-domain convolution kernel. If the convolution kernel is not available (as in our case), discretization of the continuous frequency-domain transfer function can cause errors at very low frequencies. However, these errors can be made arbitrarily small by decreasing the frequency-domain sampling interval, which is equivalent to pad enough zeros in the spatial domain.

Some examples of this transfer function are shown in Fig. 1 with α = 0.001, β = 0, and some values of parameter k. Similar examples are shown in Fig. 2 for β = 10.

We must point out that the parameter α in the spatial-domain expressions and in the discrete frequency-domain expressions are different. They follow different rules. In the spatial domain, the convergence requires ∥IαATAαβ< 1, in which α depends on the maximum eigenvalue of ATA. The matrix A is different for different applications. Usually the value of α is chosen by trial-and-error.

On the other hand, in the discrete frequency domain, we require that |1α/ωαβ|<1, which is the same as 0α<2/(β+1/ω). If one assumes the discrete frequencies are between −0.5 and 0.5 with a frequency sampling interval Δ, the lowest non-zero frequency is 1/Δ and our requirement becomes 0 < α < 2/(β +1/Δ). For example, if the 512-point discrete Fourier transform is used, Δ is 1/512. In this case, parameter α can be chosen anywhere in 0 < α < 2/(β + 512).

In the frequency-domain implementation, the selection of the value of α is independent from the imaging matrix A. The upper-bound of α is approximately 2/(the number of frequency-domain samples). In matrix implementation, α is approximately upper bounded by 2max(A), where σmax(A) is the maximum singular value of the matrix A (as known as the L2 norm of the matrix A). If the projection matrix A and backprojection matrix AT are normalized, the selection of α in matrix implementation does not depend on matrix A.

Some observations can be obtained from the curves in Figs. 1 and 2, where the parameter k is proportional to the iteration number in an iterative algorithm.

  1. When k is very small, almost all frequency components are passed to the reconstruction except for the DC (i.e., ω=0 component.

  2. As k increases, the low-frequency components are gradually attenuated. The high-frequency components are dominating.

  3. The overall contributions from the initial image to the reconstruction are suppressed as an exponential function of k. When k is very large, the contributions from the initial image to the reconstruction gradually diminish.

  4. The parameter β makes the contributions from the initial image diminish faster.

  5. The contributions of the initial image are independent from the noise in the projection data.

The above observations from the frequency-domain transfer function can be used to predict the effects of the initial image in iterative reconstruction. In the next section, the results are presented from two iterative algorithms: 1) the iterative Landweber algorithm and 2) iterative ML-EM (maximum likelihood expectation maximization) algorithm [17].

III. Results

The computer simulations were carried out using MATLAB (The MathWorks, Inc., Natick, MA, USA). The image size was 180 × 180. The number of view angles was 180 over 180°. The image pixel size was the same as the detector bin size. The computer generated phantom had an elliptical shape, containing two brighter discs and two darker discs. The initial image was a square, which looked very different from the elliptical phantom. In this section, the parameter β is set to zero except for the studies in Fig. 4.

Fig. 4.

Fig. 4.

Predicted contributions from the (left) square initial image and (right) associated central horizontal line profiles with α = 0.001 and β = 10. Row 1: k = 1. Row 2: k = 10. Row 3: k = 100. Row 4: k = 1000.

A. Prediction by the Frequency-Domain 2-D Filter

First, the frequency-domain transfer function (8) was implemented as a 2-D filter and applied to the initial image. If a uniform square is chosen as the initial image, some resultant images and central horizontal line (at row 90) profiles are shown in Fig. 3 with α = 0.001 and β = 0. The maximum values in the images are 0.8742, 0.7241, 0.5724, and 0.1390 for k = 1, 10, 100, and 1000, respectively.

Fig. 3.

Fig. 3.

Predicted contributions from the (left) square initial image and (right) associated central horizontal line profiles with α = 0.001 and β = 0. Row 1: k = 1. Row 2: k = 10. Row 3: k = 100. Row 4: k = 1000.

The similar results for β = 10 are shown in Fig. 4. The maximum value in the images are 0.8560, 0.6476, 0.2070, and 5.8601 ×10−6 for k = 1, 10, 100, and 1000, respectively. The results from Figs. 3 (β = 0) and 4 (β = 10) are similar. In Parts B and C of this section, studies with β = 0 only will be presented.

If the 180 × 180 FBP reconstruction is chosen as the initial image, some resultant images and central horizontal line (at row 90) profiles are shown in Fig. 5. The maximum value in the images are 18.6018, 12.3005, 3.7197, and 0.3874 for k = 1, 10, 100, and 1000, respectively.

Fig. 5.

Fig. 5.

Predicted contributions from the (left) FBP-reconstructed initial image and (right) associated central horizontal line profiles with α = 0.001 and β = 0. Row 1: k = 1. Row 2: k = 10. Row 3: k = 100. Row 4: k = 1000.

Each image is displayed using the linear gray scale from its own minimum value to its own maximum value.

B. Iterative Landweber Reconstructions

To avoid committing the inverse-problem crime, the projection matrix A in reconstruction algorithms and that in projection data generation were different. In data generation, the image array was up-sampled ten times, and the detection array was also up-sampled ten times. The detection array was then down-sampled ten times, and Poisson noise was incorporated in the down-sampled detection array.

The iterative Landweber algorithm is expressed as

X(k)=X(k1)+αAT(PAX(k1)). (9)

We remind the reader that the parameters α and k in the spatial-domain iterative algorithm are not the same as in the frequency domain, even when they produce similar effects. The following cases with the square initial image are shown in Fig. 6 with α = 0.001: k = 5, 15, 105, and 1005, respectively. As the iteration number k increases, the phantom image gets sharper and the contributions from the initial image gradually diminish. The low frequency components of the initial image disappear first. The high frequency edges of the initial image stay in the reconstruction a little longer.

Fig. 6.

Fig. 6.

Iterative Landweber reconstructions using the (left) square initial image with α = 0.001 and (right) associated central horizontal line profiles. Row 1: k = 5. Row 2: k = 15. Row 3: k = 105. Row 4: k = 1005.

C. Iterative ML-EM Reconstructions

The iterative ML-EM algorithm is popular in emission tomography image reconstruction. It can be symbolically expressed as

X(k)={AT[P./(AX(k1))]}./[AT1] (10)

where 1 is the image (formatted as a vector) with all its pixel values being the constant 1 and “./” is the element-wise division as in MATLAB. There is no parameter α in this algorithm. The ML-EM algorithm has a built-in weighting for the Poisson noise for the projections P. However, we did not put the noise weighting in Part B for the iterative Landweber algorithm. Some computer simulation results are shown in Fig. 7 with the initial images as a square.

Fig. 7.

Fig. 7.

Iterative ML-EM reconstructions using the (left) square initial image and (right) associated central horizontal line profiles. Row 1: k = 1. Row 2: k = 10. Row 3: k = 100. Row 4: k = 1000.

For the iterative Landweber algorithm, the image value in the square is 1 and the background value is 0. The initial image value cannot be set to 0 for the MLEM algorithm, because the zero-valued image pixels will remain zero forever. For the iterative MLEM algorithm, the image value in the square is 2 and the background value is 1.

The ML-EM algorithm has a shift-variant Poisson noise weighting and a non-negativity constraint. These two facts prevent the ML-EM algorithm from being presented in the frequency domain. In the image regions where the image values are above zero and the Poisson noise variance does not change rapidly, the iterative Landweber algorithm can be a good approximation for the ML-EM when the ML-EM algorithm is written in the additive form [3], [4] and the proposed initial image effect model can be applied.

In our derivation of the frequency-domain transfer function (8), no noise weighing was considered. Even though noise weighting is considered in the ML-EM algorithm, the contributions from the initial image as a function of the iteration number k still follow the general trend as discussed at the end of the Section II.

IV. Discussion and Conclusion

The traditional treatment of the initial condition effects uses the null-space theory and assumes the convergence of the algorithm [18]. If a non-zero image, I0, belongs to the null-space of matrix A (that is, A I0 = 0) and image I is a solution of the converged iterative algorithm (that is, A I = P), then I + I0 is also a solution. If the image I0 is used as the initial image, then I0 will remain in final solution. The classic treatment of the initial image effect only considers two extreme cases of the iteration numbers: 1) zero and 2) infinity. On the other hand, we have a closed-form expression for each iteration number.

This paper investigates the initial image effects in a nontraditional way—by using frequency-domain analysis. It shows how the contributions from the initial image to the reconstruction change with the iteration number k. As the parameter k increases, the propagation of the frequency components into the reconstruction gets attenuated more and more. Eventually, the influence of the initial image diminishes. The low frequency components disappear sooner than the high frequency components. This novel frequency-domain analysis perspective gives some new insights about the iteration number dependency of the initial image influence on the reconstruction. Our computer simulations support the following.

The contributions from the initial image to the reconstructed image have a trend that the entire initial image contributes at very low iterations, the lower frequency components then get attenuated, and eventually the influences vanish.

The traditional null-space theory only gives a “binary” conclusion whether the initial image 100% remains in the final image. On the other hand, our theory presents a “relative attenuation” (less than 100%) concept and its dependency on the iteration number.

The reader may ask: Does the frequency-domain model (8) always hold? The answer is no. The foundation of the result (8) is the equivalence of the matrix ATA and the convolution kernel 1/r. In reality, this relationship is only an approximation due to the finite pixel size, finite image array size, and weighting factors W when ATA is replaced by ATWA. This relationship is further damaged if the number of views is not sufficient or the projections are truncated due to a relative small detector and a large object. In the situation of limited-angle tomography, where the view angles do not cover the required 180°, ATWA cannot be approximated by convolution with the convolution kernel 1/r. The bottom line is that if ATWA does not deviate from 1/r too much, the model (8) is still useful to indicate the trend of the initial image’s influence on the reconstruction.

In practice, the iterative algorithm is usually stopped early before its convergence. The initial image can provide some information about the sharp edges. The DC values in the initial image are almost lost while propagating into the reconstruction. When using the FBP image as the initial image, the iterative algorithm can converge much faster than using the constant as the initial condition. However, one must be aware that the high-frequency noise in the FBP reconstruction may affect the iterative reconstruction in a negative way. It may be helpful that a denoising filter is applied to the FBP image before it is used as the initial image. It is desirable that this denoising filter is nonlinear and can preserve the edges. It is beyond the scope of this paper to suggest an optimal initial image.

Acknowledgments

This work was supported by NIH under Grant R01HL108350.

References

  • [1].Kak AC and Slaney M, Principles of Computerized Tomographic Imaging. New York, NY, USA: IEEE Press, 1988. [Google Scholar]
  • [2].Herman GT, Fundamentals of Computerized Tomography: Image Reconstruction From Projections, 2nd ed. London, U.K.: Springer, 2009. [Google Scholar]
  • [3].Zeng GL, Medical Imaging Reconstruction, A Tutorial. Berlin, Germany: Springer, 2010. [Google Scholar]
  • [4].Zeng GL, Image Reconstruction: Applications in Medical Sciences. Boston, MA, USA: Walter de Gruyter GmbH, 2017. [Google Scholar]
  • [5].Shepp LA and Logan BF, “The Fourier reconstruction of a head section,” IEEE Trans. Nucl. Sci, vol. 21, no. 3, pp. 21–43, June. 1974. [Google Scholar]
  • [6].Guo J et al. , “Iterative image reconstruction for limited-angle CT using optimized initial image,” Comput. Math. Methods Med, vol. 2016, 2016, Art. no. 5836410, doi: 10.1155/2016/5836410. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [7].Shi H, Luo S, Yang Z, and Wu G, “A novel Iterative CT reconstruction approach based on FBP algorithm,” PLoS ONE, vol. 10, no. 9, 2015, Art. no. e0138498, doi: 10.1371/journal.pone.0138498. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [8].Jensen K, Martinsen ACT, Tingberg A, Aaløkken TM, and Fosse E, “Comparing five different iterative reconstruction algorithms for computed tomography in an ROC study,” Eur. Radiol, vol. 24, no. 12, pp. 2989–3002, 2014, doi: 10.1007/s00330-014-3333-4. [DOI] [PubMed] [Google Scholar]
  • [9].Pan T-S, Tsui BMW, and Byrne CL, “Choice of initial conditions in the ML reconstruction of fan-beam transmission with truncated projection data,” IEEE Trans. Med. Imag, vol. 16, no. 4, pp. 426–438, August. 1997. [DOI] [PubMed] [Google Scholar]
  • [10].Zeng GL and Gullberg GT, “An SVD study of truncated transmission data in SPECT,” IEEE Trans. Nucl. Sci, vol. 44, no. 1, pp. 107–111, February. 1997. [Google Scholar]
  • [11].Zeng GL, “A filtered backprojection algorithm with characteristics of the iterative Landweber algorithm,” Med. Phys, vol. 39, no. 2, pp. 603–607, 2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [12].Zeng GL, “A filtered backprojection MAP algorithm with nonuniform sampling and noise modeling,” Med. Phys, vol. 39, no. 4, pp. 2170–2178, 2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [13].Zeng GL and Zamyatin A, “A filtered backprojection algorithm with ray-by-ray noise weighting,” Med. Phys, vol. 40, no. 3, March. 2013, Art. no. 031113, doi: 10.1118/1.4790696. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [14].Zeng GL, “Model based filtered backprojection algorithm: A tutorial,” Biomed. Eng. Lett, vol. 4, no. 1, pp. 3–18, March. 2014. [Online]. Available: http://link.springer.com/article/10.1007/s13534-014-0121-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [15].Landweber L, “An iteration formula for Fredholm integral equations of the first kind,” Amer. J. Math, vol. 73, no. 3, pp. 615–624, 1951. [Google Scholar]
  • [16].Bertero M and Boccacci P, Introduction to Inverse Problems in Imaging, Bristol, U.K.: Inst. Phys., 1998, ch. 6. [Google Scholar]
  • [17].Shepp LA and Vardi Y, “Maximum likelihood reconstruction for emission tomography,” IEEE Trans. Med. Imag, vol. 1, no. 2, pp. 113–122, October. 1982. [DOI] [PubMed] [Google Scholar]
  • [18].Hoffman KM and Kunze R, Linear Algebra, 2nd ed. Singapore: Pearson Educ., 1971. [Google Scholar]
  • [19].Zeng GL, “Revisit of the ramp filter,” IEEE Trans. Nucl. Sci, vol. 62, no. 1, pp. 131–136, February. 2015. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES