Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2019 Sep 13;14(9):e0222406. doi: 10.1371/journal.pone.0222406

Development of a denoising convolutional neural network-based algorithm for metal artifact reduction in digital tomosynthesis for arthroplasty: A phantom study

Tsutomu Gomi 1,*, Rina Sakai 1, Hidetake Hara 1, Yusuke Watanabe 1, Shinya Mizukami 1
Editor: Li Zeng2
PMCID: PMC6743787  PMID: 31518374

Abstract

The present study aimed to develop a denoising convolutional neural network metal artifact reduction hybrid reconstruction (DnCNN-MARHR) algorithm for decreasing metal objects in digital tomosynthesis (DT) for arthroplasty by using projection data. For metal artifact reduction (MAR), we implemented a DnCNN-MARHR algorithm based on a training network (mini-batch stochastic gradient descent algorithm with momentum) to estimate the residual reference (140 keV virtual monochromatic [VM]) and object (70 kV with metal artifacts) images. For this, we used projection data and subtracted the estimated residual images from the object images, involving hybrid and subjectively reconstructed image usage (back projection and maximum likelihood expectation maximization [MLEM]). The DnCNN-MARHR algorithm was compared with the dual-energy material decomposition reconstruction algorithm (DEMDRA), VM, MLEM, established and commonly used filtered back projection (FBP), and a simultaneous algebraic reconstruction technique-total variation (SART-TV) with MAR processing. MAR was compared using artifact index (AI) and texture analysis. Artifact spread functions (ASFs) for images that were out-of-plane and in-focus were evaluated using a prosthesis phantom. The overall performance of the DnCNN-MARHR algorithm was adequate with regard to the ASF, and the derived images showed better results, without being influenced by the metal type (AI was almost equal to the best value for the DEMDRA). In the ASF analysis, the DnCNN-MARHR algorithm generated better MAR compared with that obtained employing usual algorithms for reconstruction using MAR processing. In addition, comparison of the difference (mean square error) between DnCNN-MARHR and the conventional algorithm resulted in the smallest VM. The DnCNN-MARHR algorithm showed the best performance with regard to image homogeneity in the texture analysis. The proposed algorithm is particularly useful for reducing artifacts in the longitudinal direction, and it is not affected by tissue misclassification.

Introduction

Cementless hip arthroplasty has gained more popularity in clinic, recently. It is essential that biological fixation procedures employed are reliable for achieving success with this technique [1]. Medical imaging plays an important role for assessing the proper placement of the components of hip arthroplasty, postoperatively, and also to evaluate the potential complications in the long-term [2]. Digital tomosynthesis (DT), a recently developed technique, provides three-dimensional (3D) structural information to a limited extent, by combining computed tomography (CT) with the advantages of digital imaging [18], and another advantage of DT is that it can be employed easily with radiography, it can help reducing the radiation doses. However, the image rebuilding procedure using DT is unpredictable and is restricted by low ratios of signal-to-noise, related to the superposition of multiple low-exposure projection images.

Metal objects, that reduce the image quality by decreasing contrast and masking specific features, obstruct the observation of relevant organ parts leading to incorrect diagnosis. Before imaging, it is necessary to ascertain that there is no hematoma or inflammation in tissues surrounding target area and to evaluate any potential interaction of osteosynthetic materials, metallic joint prostheses or implants with nearby tissues and radiation.

Artifacts in DT imaging show along the sweep direction, as zones of much less signal, surrounding the edges of metal prostheses and osteosynthetic materials, which are highly attenuating. This is mostly due to discrepancies between the reality (i.e., wide spectral range) and the reconstruction algorithm assumptions (i.e., ideal monochromatic beam). A relatively minor contribution to these artifacts is also caused by limited sweep angle.

Efficiency of iterative reconstruction (IR) was investigated in earlier studies on DT for arthroplasty [1, 6, 7]. Indeed, the image quality was superior and the balance between low- and high-frequency features was better with IR, compared with the filtered back projection (FBP) [5] technique [1, 7]. In fact, many of the earlier studies made a quantitative comparison of radiation doses generated by different prevailing DT algorithms for arthroplasty and the image qualities [7, 9] and have noticed that IR effectively lowers both radiation exposure and quantum noise. In the latest study [9], it was found that among IR algorithms (including total variation [TV]-based compressive sensing [1013]), the reconstruction algorithm with the best effect for reducing metal artifacts in DT imaging was maximum likelihood expectation maximization (MLEM) [14].

Previous reports have evaluated metal artifacts and have developed methods (adaptive filtering [combined IR and shift-and-add method] using polychromatic X-ray [15] and combined material decomposition and adaptive filtering using dual-energy [DE] X-ray [16]) for metal artifact reduction (MAR) [1518]. Among these reported MAR methods, the most effective method for reducing metal artifacts at present is the DE material decomposition reconstruction algorithm (DEMDRA) [16]. Although the DEMDRA is particularly excellent for reducing the high-frequency component in a metal artifact image, its drawback involves mechanical limitations (DE X-ray exposure) because it requires material decomposition processing using DE X-rays. Therefore, further studies are required to generalize the benefits of MAR.

Deep learning approaches have successfully been employed recently, in pattern recognition and image processing methods, including image denoising [19], image super-resolution [20], and low-dose CT reconstruction [21, 22]. For instance, a convolutional neural network (CNN) has been implemented for artifact reduction in medical imaging [23, 24], and the CNN has been employed to get rid-off the residual errors from MAR. Even though these previous studies showed that the CNN could enhance MAR effectively, no study has been conducted on MAR using DT. A CNN-based modification (denoising convolutional neural network [DnCNN]) was presented by Zhang et al. [19]. The feature of this DnCNN is construction to include the progresses in learning algorithms, very deep architecture and methods of regularization for image denoising. The reference image used in the training workflow for the DnCNN is important for enhancing MAR. Different physical elements can generate metal artifacts, and these include beam hardening, photon starvation, and X-ray scattering. Beam hardening results when an X-ray beam consisting of polychromatic photons passes through a medium. It was suggested earlier that DE virtual monochromatic (VM) spectral imaging can potentially reduce beam hardening induced metal artifacts [16, 2529]. We think that the application of the VM approach is useful for MAR, as the reference image is appropriate. By performing denoising at the projection data level using DnCNN processing, a reduction effect for metal artifacts after reconstruction can be expected. To support this basis, DnCNN is designed primarily to remove noise from the image. It comprises of a built-in deep feedforward CNN. However, as the DnCNN uses a residual learning method, it is also possible to train the DnCNN architecture to reduce artifacts. In the residual learning method, effects as the MAR method can be expected because the residual image is estimated by learning in the DnCNN network.

The DnCNN algorithm could possibly provide a superior solution to the intrinsic problems. In addition, a decrease in metal artifacts can be achieved by reconstruction employing denoised projection results from each material (e.g., titanium and bone) and adaptive filtering [15]. The novelty of this study is to reduce metal artifacts by processing DnCNN in combination with adaptive filtering [15] at the projection data level. In the present study, we developed a hybrid method of reconstruction that is based on projection space approach by combining the DnCNN and adaptive filtering [15] with a focus on reducing metal artifacts (DnCNN MAR hybrid reconstruction [DnCNN-MARHR] algorithm) in DT. The developmental process of the method and its basic evaluation are presented in this study.

Materials and methods

Phantom specifications

A prosthetic phantom consisting of an artificial bone and implant (Table 1) was immersed in the center of a water-filled polymethyl methacrylate case (case dimensions, φ 200 × 300 mm), for evaluating MAR. A simulated humeral proximal fracture (internal fracture fixation via retrograde intramedullary nail fixation) was present in the prosthetic phantom. In order for DE-DT acquisition, the phantom was positioned parallel to the detector plane (Fig 1).

Table 1. Specifications of prosthetic phantom employed in this study.

Element Ratio (%)
[Artificial bonea (Foam cortical shell) ] Hydrogen (H) 7.9192
Local density (0.48 g/cm3) Carbon (C) 40.4437
Nitrogen (N) 15.7213
Oxygen (O) 35.9157
[Implantb (Titanium alloy) ] Titanium (Ti) 90.255
Local density (4.43 g/cm3) Nitrogen (N) 0.05
Carbon (C) 0.08
Hydrogen (H) 0.015
Iron (Fe) 0.40
Oxygen (O) 0.20
Aluminum (Al) 5.5
Vanadium (V) 3.5

Specifications and chemical composition of the prosthetic phantom employed in this study.

aOrthopedic Humerus Normal Anatomy (Model 1013, Sawbones, Inc., WA, USA)

bProximal Retrograde Humeral Nail® (PRHN, Mizuho Inc., Tokyo, Japan)

Fig 1. The experimental geometric placement adopted to assess metal artifact reduction employing the new denoising convolutional neural network metal artifact reduction hybrid reconstruction (DnCNN-MARHR) algorithm and conventional algorithms.

Fig 1

A prosthetic phantom consisting of an artificial joint and implant positioned parallel to the detector plane was employed for the experiments.

DE-DT system

The DE-DT system (SonialVision Safire II; Shimadzu Co., Kyoto, Japan) contained an X-ray tube (anode, made out of tungsten with rhenium and molybdenum; real filter: inherent; aluminum [1.1 mm], additional; aluminum [0.9 mm] and copper [0.1 mm]) with a 0.4-mm focal spot and an amorphous selenium (362.88 × 362.88-mm) digital flat-panel detector (detector element, 0.15 × 0.15-mm). The distances between source (focal point)-to-isocenter and source-to-detector were 924 and 1100 mm, respectively (anti-scatter grid, focused type; grid ratio, 12:1). We selected the kV values (low, 70 kV; high, 140 kV) because the focus of this study is MAR improvement [16].

A 40° swing angle and linear system movement were employed while conducting tomography, and 37 projection images (1024 × 1024 matrix) at low- and high-voltage were obtained during a single tomographic pass. In DE-DT imaging, pulsed X-ray exposures were used with rapid switching between low and high energies. Even though, low voltage is generally used for clinical application (e.g., prosthesis assessment), all the projection images for low-voltage X-ray were acquired at 187 mA with a 22-ms exposure time and for high-voltage X-ray, at 260 mA with a 5-ms exposure time. To produce reconstructed tomograms of the required height, we employed a 1024 × 1024 matrix with 32 bits (single-precision floating number) per image (pixel size, 0.279 mm/pixel; reconstruction interval, 1 mm; total slice number, 50; starting height of the reconstruction from the detector surface, 150mm). The plane locations were the in-focus plane and out-of-plane from the object location. The image reconstructed at the in-focus plane shows that the object is faithfully reconstructed on the focal plane (Fig 2).

Fig 2. The schematic diagram to illustrate the relationship of the in-focus plane and the out-of-plane in the Z-axis direction.

Fig 2

The in-focus plane is not affected by the blur, but the out-of-plane is contains blur.

Generation of reference projection images

One of the approaches for MAR is the use of VM X-ray. A previous study on DT reported that a VM X-ray image is more useful for MAR when compared with a polychromatic X-ray image [16]. Considering these results, we decided to use a VM X-ray image as a reference image for training workflow. The proposed algorithm was developed to realize the projection data level as well as the contents reported by Gomi et al. [16]. In the MAR, it has been reported that DEMDRA applying material decomposition is the most effective for MAR [16]. However, DEMDRA has to apply DnCNN to projection data separated into a plurality of material decompositions. Then, it may be difficult to maintain the residual accuracy with the polychromatic projection data to be compared. Conversely, VM X-ray imaging can learn residuals with high accuracy by using a single projection datum; thus an effective MAR can be expected. Therefore, the VM X-ray image was used as a reference image in this study. Thirty-seven reference images (VM X-ray projection image) corresponding to the corrected input image pairs were randomly selected as the training set from the original projection data set (total original projection data set: 74). In the present study, we made use of a simple projection space (pre-reconstruction) decomposition method to assess the material fractions (Fn) of the artificial bone (foam cortical shell, Ff), soft tissue (water, Fw), and implant (titanium alloy, Ft) in the phantom.

The three basis materials can also be denoted as a linear combination of their attenuation coefficients as follows:

μ(r,E)=(μρ)t(E)ρt(r)+(μρ)w(E)ρw(r)+(μρ)f(E)ρf(r) (1)

where the basis materials exhibit different photoelectric and Compton characteristics; (μ/ρ)i(E), i = t (titanium alloy), w (water), and f (foam cortical shell), is the mass attenuation coefficient of the three basis materials; and ρi(r), i = t (titanium alloy), w(water), and f(foam cortical shell), is the local density (g/cm3) of the three basis materials at location r.

In DE acquisition, the detected image intensity can be depicted as:

PL=IL(E)exp{(μρ)t(E)Kt(μρ)w(E)Kw(μρ)f(E)Kf}dE (2)
PH=IH(E)exp{(μρ)t(E)Kt(μρ)w(E)Kw(μρ)f(E)Kf}dE (3)
Kt*Kw*Kf=1.0 (4)

where IL(E) and IH(E) are the primary intensities at low- and high energy, respectively, whereas, PL and PH are the attenuated intensities at low- and high energy, respectively. Each X-ray spectrum is shown in Fig 3. (Measurement tool: RAMTEC413 Toyo Medic Co., Tokyo, Japan; Detector: CdTe; Channel: 1024 (0.2keV/channel); Measuring method: Compton-scattering measurement [30])

Fig 3. Spectra of the Sonial Vision Safire II tube at 70 and 140 kV potentials.

Fig 3

The peaks represent the characteristic lines of the tungsten with rhenium and molybdenum anode and the continuous spectrum is a result of Bremsstrahlung. The mean photon energies are 49 and 80 keV, respectively. (Real filter: inherent; aluminum [1.1 mm], additional; aluminum [0.9 mm] and copper [0.1 mm]).

The equivalent densities (g/cm3) Kt, Kw, and Kf of the three basis materials should be calculated for each ray path. Eqs (2), (3) and (4) can be solved for the equivalent area densities, where Kt, Kw, and Kf represent the unknown materials. Basis material decomposition can thus be ascertained by solving simultaneous equations to determine the values of Kt, Kw, and Kf from the quantified projection pixel values [31]. By employing the density that corresponds to each of the areas with the 3 basis materials, the linear attenuation coefficient μ(r,E) can be determined for any photon. The theoretical mass attenuation coefficient and linear attenuation coefficient curve shown in Fig 4 were calculated using the local density and area density of each material. These are generated by inputting the chemical compositions of the titanium alloy, foam cortical shell, and water shown in Table 1 into the XCOM program developed by Berger and Hubbell [32]. Finally, for the projection space decomposition approach, the following process was used to generate material decomposition images for titanium alloy, foam cortical shell, and water.

Fig 4. The linear attenuation and mass attenuation coefficients of a foam cortical shell, titanium alloy, and water with respect to the photons.

Fig 4

Based on the linear attenuation coefficient map, each energy image of virtual monochromatic X-ray processing was created.

Eqs (2) and (3) were used to calculate values for PL_t, PL_w, PL_f, PH_t, PH_w, and PH_f as simulated attenuation intensities of these materials at the two energy levels. These values were then used to construct a sensitivity matrix, and the material fractions (material decomposition images; Ft, Fw, Ff) were obtained from the inverse of this matrix, as shown in Eq (5):

[FtFwFf]=[PL_tPL_wPL_fPH_tPH_wPH_f1.01.01.0]1[XDTSELXDTSEH1.0] (5)
FtPL_t+FwPL_w+FfPL_f=XDTSEL
FtPH_t+FwPH_w+FfPH_f=XDTSEH
Ft+Fw+Ff=1.0

where, two DT projection image sets, each acquired with a different energy (XDTSEL and XDTSEH i.e., 70 and 140 kV).

Inverse of this matrix was used to obtain material fractions. Following decomposition by matrix inversion, the “inv” function available in MATLAB (Mathworks; Natick, MA, USA) was employed; this function limits the possible fraction to [0,1] while imposing a sum of 1. Thus, three material fractions arise from the processing pipeline, related to water, foam cortical shell and the titanium alloy [16]. VM processing are obtained using the Eq (4):

VMpimg=Ft*(μρ)t(E)+Fw*(μρ)w(E)+Ff*(μρ)f(E) (6)

where VMpimg is the VM projection image, and (μ/ρ)t(E), (μ/ρ)w(E), and (μ/ρ)f(E) are each material’s corresponding mass attenuation coefficients. The energy of the VM X-ray was selected as 140 keV, which is effective for reducing metal artifacts [16].

DnCNN-MARHR

Zhang et al. further investigated the construction of a feed-forward DnCNN to include the progresses in learning algorithms, very deep architecture, and methods of regularization for image denoising [19]. Particularly, residual learning and batch normalization were used to fasten the process of training and also to enhance the denoising performance [19]. This network is primarily designed to remove noise from the image (residual learning method). However, it is possible to train the DnCNN architecture to remove artifacts and increase the image resolution. Therefore, by applying this algorithm as an image quality improvement method (for MAR), it is expected that MAR can be effectively achieved. The training workflow of the DnCNN can be realized by using the mini-batch stochastic gradient descent algorithm with momentum (SGDM) method [33]. The SGDM has extensively been employed for the training of CNN models. Even though, the mini-batch SGDM is simple and effective to use, its efficiency of training is mostly compromised by the shift of internal covariates, i.e., alterations in the distributions of internal non-linearity inputs while training [34]. The main algorithm we propose involves processing and correction at the projection data level using the DnCNN. The input image to be corrected was selected to be a low-energy projection image (PL; tube voltage, 70 kV) in consideration of the influences of tube voltage in clinical use and metal artifacts in polychromatic X-ray imaging [16]. In the presence of high-energy, the difference in the linear attenuation coefficient between the normal tissues becomes narrow, and the contrast tends to decrease. Accordingly, low energy was selected in terms of retention of contrast in normal tissue as well as MAR. The average mean squared error between the required residual projection images and the estimated images from included artifact projection (with noise) image input can be implemented as the loss function q to learn the trainable parameters δ (SGDM with the weight decay of 0.0001, momentum of 0.9, initial learning rate of 0.1, and hyper-parameters mini-batch size and epochs) in the DnCNN [19]. With regard to δ, mini-batch size and epochs are hyper-parameters that affect MAR [23, 24]. We evaluated the optimal parameter, and we have applied the parameters in the “Evaluation” section below.

q(δ,φ)=12Nj=1NU(PLj;δ)(PLjVMpimgj)G2 (7)

where learned a residual mapping U(PL), {(PLj,VMpimgj)}j=1N represents N artifact-free training projection image (patch: N = 32) pairs and output trained network φ.

With regard to deep architecture [19], considering a DnCNN with depth D, there are three types of layers as follows: (1) convolution + rectifier linear unit (ReLU) [35] (the first layer), 64 filters of size 3 × 3 × 1 are used to generate 64 feature maps and ReLU (max[0, ⋅]) is then utilized for non-linearity; (2) convolution + batch normalization + ReLU (layers 2~[D−1]), 64 filters of size 3 × 3 × 64 are employed and between convolution and ReLU, batch normalization [34] is added; (3) convolution (the last layer), filters of size 3 × 3 × 64 are used to rebuild the output. In this study, network depth was set to 20, scanning step size (horizontal and vertical directions) was set to “1,” and the zero padding size was set to “1” in each of the positions (upper, lower, left, and right positions) in the DnCNN [19].

The loss function q in Eq (7) is deployed to learn the residual mapping U(PL) for residual prediction. The trained network φ is activated to estimate the residual projection image.

ImgRes=βεZ=1Z(fZ¯*φZ(fZ*PL)) (8)

where Img−Res is the estimated residual of VMpimg with respect to PL, β is the scanning step size, ε is the regularization parameter, fz is the zth convolution filter kernel, and fz¯ is the adjoint filter of fz. The influence function φz(⋅)can be regarded as pointwise nonlinearity applied to the convolution feature map Eq (8). Eq (8) is a two-layer feed-forward CNN. The CNN architecture further generalizes one-stage trainable nonlinear reaction diffusion (TNRD) [36, 37] from three aspects:

(1) Replacing the influence function with ReLU to ease CNN training;

(2) Increasing the CNN depth to improve the capacity in modeling image characteristics; (3) incorporating batch normalization to boost the performance.

The connection with one-stage TNRD provides insights into the use of residual learning for CNN-based image restoration [19].

The estimated residual projection image is subtracted from the original projection image to obtain an artifact-reduced projection image.

corimg=PLImgRes (9)

Different reconstruction algorithms have been examined for achieving the metal artifact reduction in arthroplasty images. Among these, a superior performance was obtained using the MLEM method [9]. A two step- MLEM algorithm, that comprises of a forward step (for modeling the DT acquisition process) and a backward step (for updating the reconstructed object) per iteration has also been proposed [14]. The MLEM algorithm is employed iteratively, so that there is resemblance between the reconstructed volume projections, deduced from an image formation model, and the experimental projections. Thus, we used the MLEM algorithm for reconstructing the projection images with much lower artifacts. Besides, as the noise tends to be more in artifact-curtailed projection data, we suggested the MAR (adaptive filtering) processing to be employed [15] during this reconstruction process. We anticipate a further decrease in both noise and metal artifacts by MAR processing (Fig 5).

Fig 5. Flowchart of the denoising convolutional neural network metal artifact reduction hybrid reconstruction (DnCNN-MARHR) algorithm.

Fig 5

The DnCNN-MARHR algorithm was employed to decrease metal artifacts in weighted hybrid reconstructed images (maximum likelihood expectation maximization [MLEM] and back projection). This was achieved using a training network (mini-batch stochastic gradient descent algorithm with momentum) to estimate residual reference images and object images with projection data and subtract the estimated residual images from the object images. Abbreviations: DnCNN = denoising convolutional neural network, SGDM = mini-batch stochastic gradient descent algorithm with momentum, BP = back projection.

The following algorithm for reconstruction and adaptive filtering processing [15] for the real space (post-reconstruction processing) was employed:

MLEMDnCNNu+1MLEMDnCNNu[XTcorimgMLEMDnCNNu]XT1 (10)
BPDnCNNXT(corimg) (11)
MARDnCNN=[MLEMDnCNN*(1w)]+[BPDnCNN*w] (12)

The respective parameter definitions used in Eqs (10), (11) and (12) are shown below.

Initialize: MLEMDnCNN0; all voxels can be initialized to 1.0.

XT is back projection, X is multiplication by the system matrix, T (superscript) is the matrix response, u is the number of iterations, MLEMDnCNN is the MLEM [iteration (convergence): 30] image from artifact-reduced projection, BPDnCNN is the back-projection image from artifact-reduced projection, MLEMDnCNN is the DnCNN-MARHR image, and w is the weighting coefficient.

Evaluation

Optimization parameters for mini-batch size, epochs, and weighting coefficient (w)

According to Gomi et al. [9, 15], a weighting coefficient of 0.7 is optimal for effectively processing MAR. Considering these results, the initial weighting coefficient (assumption) was set to 0.7. In the study by Zhang et al. [19], evaluations were performed by setting epochs to 50. In addition, it has been reported that effective MAR can be realized by increasing epochs in terms of MAR [23, 24]. It is considered necessary to increase epochs to realize effective MAR according to these reported results. In this study, the initial epochs (assumption) value was set to 60 considering reports of a value of 50 or more for MAR [19, 23, 24]. The initial value (assumption) was set as follows: epochs, 60 and weighting coefficient (w), 0.7, and the mini-batch size optimization and validity of the assumed initial value (epochs and weighting coefficient) setting were evaluated. We set the patch size as 32 × 32, and crop [mini-batch size] × [maximum number of iterations: epoch × number of projection (37)] patches to train the model.

Reconstruction was accomplished using DT system-derived real projection data. MATLAB (Mathworks) was employed for image reconstruction and processing. Artifact index (AI) was deduced for evaluating the effect of MAR on each image in the in-focus plane [38]. Optimization was evaluated using the AI, and the lowest AI value and standard error were selected as the optimum parameters.

Evaluation of MAR

We calculated the AI to assess the effect of MAR on each in-focus plane image. To compare the difference between DnCNN-MARHR and the conventional algorithm, the difference was evaluated using differential images and mean square error (MSE) in the in-focus plane. We further ascertained the artifact spread function (ASF) to determine the influence of metal artifacts on the features of rebuilt image in the neighboring out-of-plane zone [14]. Texture analysis [39] was used to evaluate the homogeneity of the entire image including metal artifacts. Comparison with the DnCNN-MARHR algorithm was performed by selecting an effective algorithm for MAR (DEMDRA, VM [140 keV], polychromatic MLEM-MAR [70 kV], polychromatic FBP (kernel: Shepp & Logan)-MAR [70 kV], and polychromatic simultaneous algebraic reconstruction technique-TV [SART-TV] [13] MAR [140 kV]) reported by Gomi et al. [16]. The previously reported iteration numbers (DEMDRA, VM, MLEM; 30, SART-TV; 10) were applied to these conventional algorithms for MAR [9]. An iteration number for TV minimization of 100 and length per gradient-descent step of 50 were considered optimal parameters for SART-TV [9]. DnCNN was evaluated using the application image of optimization parameters.

Artifact index (AI)

Quantification of the degree of metal artifacts, was done using the AI, which permits low-frequency artifact determination. The AI of identified metal artifacts was as follows:

AIn=sqrt(|ArtifactROInBGROI|) (13)

where n = 1, 2,…, 10 defines the formula for Artifact_ROI_n and Artifact_ROI_1, Artifact_ROI_2,…, Artifact_ROI_10 that represent the corresponding regions of interest (ROIs) for the relative standard deviations (SDs) of real features (metal artifacts) in the in-focus plane. BG_ROI is the relative SD of the background in the in-focus plane [Fig 6 (a)]. For evaluation of the each features (metal artifacts) and background, the ROI was set at 4 × 14 pixels.

Fig 6. ROI setting diagram for AI calculation and optimization verification results.

Fig 6

(a) Metal artifacts obtained employing the artifact index (AI) of the selected characteristics. The in-focus plane image displays the background areas and metal artifact of the AI measurements. The AIs resulting from differences in the mini-batch size (b), epochs (c), and weighting coefficient (w) in the denoising convolutional neural network metal artifact reduction hybrid reconstruction (DnCNN-MARHR) images are shown (d). The settings of mini-batch size of 544, epochs of 60, and weighting coefficient (w) of 0.7 generated the maximum artifact decreasing effect. The error bar represents the standard error.

Mean square error (MSE)

The MSE of identified in-focus plane image is given as below:

MSE=1mni=0m1j=0n1[Kref(i,j)Vobj(i,j)]2 (14)

where, Kref(i,j) is the (i,j)th entry of a DnCNN-MARHR image and Vref(i,j) is the (i,j)th entry of an each conventional algorithm image.

Artifact spread function (ASF)

Wu et al. proposed An ASF metric has been proposed by Wu et al., and this takes into account not only the objects in the focus plane that cause artifacts, but also the effects on these objects, arising from out-of-plane [14]. The resultant measure of ASF reveals the DT’s capability to distinguish superimposed features along the direction of tomographic slice. The ASF of artifacts in the image plane is given as below:

ASF=ArtifactROI(z)BGROI(z)ArtifactROI(z0)BGROI(z0) (15)

where z0 and z represent the positions of the real features (i.e., metal artifacts) in the in-focus and out-of-plane images, respectively; the ROIs for the mean pixel intensities of the features and background in the in-focus-plane image are shown as Artifact_ROI (z0) and BG_ROI (z0), respectively; and in the out-of-plane image, the corresponding ROIs are Artifact_ROI (z) and BG_ROI (z). The size of ROI for assessing the background and features (i.e., metal artifacts) was 4 × 14 pixels.

Texture analysis

The gray-level co-occurrence matrix (GLCM) is a statistical texture inspection method that takes the spatial relationship of pixels into consideration. The GLCM function creates a GLCM by calculating how often a pair of pixels with a specified value and a specified spatial relationship occurs in an image. By extracting statistical information from this matrix, the features of the texture of the image can be obtained. Texture analysis can quantitatively analyze the variation of image intensity in an image [39]. Before computation of texture features, pixel intensities were discretized to 16 gray levels [39]. In addition, pixel values were rescaled between the mean ± SD. The statistical properties of the images derived from the GLCM in this study were evaluated using “inverse difference moment (homogeneity),” “contrast (dissimilarity),” and “correlation,” which are defined as follows:

Inverse_Difference_Moment=i,jp(i,j)1+|ij| (16)
Contrast=i,j|ij|2p(i,j) (17)
Correlation=i,j(iμi)(jμj)p(i,j)σiσj (18)

where p(i,j) is the (i,j)th entry of a normalized gray-level spatial dependence matrix, and μi,μj and σi and σj are the means and SDs of pi and pj, respectively.

Results

The initial value (assumption) was set as follows: epochs, 60 and weighting coefficient (w), 0.7, and the mini-batch size was changed to 128, 256, 384, 464, 512, 544, 552, 576, and 640. The AI and standard error value of mini-batch size 544 were the lowest [Fig 6(B)]. Next, to verify optimization of the assumed initial value (epochs, 60 and weighting coefficient [w], 0.7) with a mini-batch size of 544, the AI and standard error were measured with epochs of 10, 20, 40, 60, 80, and 100, and weighting coefficients of 0.5 to 0.9 [Fig 6(C)]. For epochs of 60 and a weighting coefficient (w) of 0.7, the AI value and standard error were the lowest [Fig 6(D)]. Therefore, learning was performed by setting the mini-batch size as 544, epochs as 60 (hyper-parameter), and weighting coefficient (w) of blended image processing as 0.7 in the DnCNN-MARHR algorithm (Fig 7).

Fig 7. AI surface plot with epochs and weighting coefficients when mini-batch size at 544.

Fig 7

The surface plot was processed by cubic linear interpolation.

Fig 8 shows the reconstructed images of the prosthetic phantom attained with the DnCNN-MARHR algorithm or each established algorithm for reconstruction with MAR processing. Remarkably, DT images produced employing the DnCNN-MARHR algorithm showed decreased metal artifacts in the X-ray sweep direction (i.e., vertical direction), specifically in the prosthetic phantom’s peripheral regions. On the other hand, images produced with the help of FBP-MAR demonstrated more noise and metal artifacts. Comparison of the difference between DnCNN-MARHR and the conventional algorithm resulted in the smallest VM (Table 2).

Fig 8. Comparisons among the denoising convolutional neural network metal artifact reduction hybrid reconstruction (DnCNN-MARHR) algorithm and the traditional reconstruction algorithms with metal artifact reduction (MAR) processing in the in-focus plane.

Fig 8

(DnCNN-MARHR (a), 0.8029–0.9119; DEMDRA (b), 0.9597–0.9914; dual-energy virtual monochromatic [VM]-MAR [140 keV] (c), 0.9487–0.9934; maximum likelihood expectation maximization [MLEM]-MAR [70 kV] (d), 0.9411–0.9983; filtered back projection [FBP, kernel: Shepp & Logan]-MAR [70 kV] (e), 0.5676–0.6117; simultaneous algebraic reconstruction technique-total variation [SART-TV]-MAR [140 kV] (f), 0.7577–0.8184, difference between DnCNN-MARHR and DEMDRA (g), 0–0.1047; difference between DnCNN-MARHR and [VM]-MAR [140 keV] (h), 0–0.1047; difference between DnCNN-MARHR and [MLEM]-MAR [70 kV] (i), 0–0.1047; difference between DnCNN-MARHR and [SART-TV]-MAR [140 kV] (j), 0–0.1047; difference between DnCNN-MARHR and [FBP, kernel: Shepp & Logan]-MAR [70 kV] (k), 0–0.1047) The display variety of the prosthetic phantom is changed to make visual comparison of the contrast and background gray levels. The X-ray source is moved along the image vertically. In the displayed areas, the artifact indices are determined.

Table 2. Mean square error (MSE) between each reconstruction algorithm.

MSE
DnCNN-MARHR between DEMDRA 3.8048e-05
DnCNN-MARHR between VM with MAR(140keV) 3.5683e-05
DnCNN-MARHR between MLEM with MAR(70kV) 3.7185e-04
DnCNN-MARHR between SART-TV with MAR(140kV) 9.6985e-05
DnCNN-MARHR between FBP with MAR(70kV) 1.0318e-04

Results of the comparison of the mean square error (MSE) between the DnCNN-MARHR image and each MAR image. (Comparison image is in-focus plane).

Fig 9 shows the positioning of the ROI in the prosthetic phantom and a graph of the AI results. The DEMDRA gave rise to the smallest values of metal artifact characteristics, irrespective of the status of MAR processing (mean AI ± standard error, 0.01426 ± 0.0022). The difference in AI values between the DnCNN-MARHR algorithm and DEMDRA was small, and the value was lower than that for VM (140 keV) used as a reference image in the training network, confirming the usefulness of MAR (DnCNN-MARHR, 0.01557 ± 0.0017; VM-MAR, 0.01794 ± 0.0025). Metal artifact production was dependent on the reconstruction algorithm type for polychromatic imaging algorithms using MAR processing (MLEM-MAR [70 kV], 0.0195 ± 0.0033; SART-TV-MAR [140 kV], 0.0207 ± 0.0029; FBP-MAR [70 kV], 0.0333 ± 0.0069).

Fig 9. Comparisons of the artifact indices (AIs) determined for in-focus plane images procured via the denoising convolutional neural network metal artifact reduction hybrid reconstruction (DnCNN-MARHR) algorithm and the traditional reconstruction algorithms with metal artifact reduction (MAR) processing.

Fig 9

Metal artifacts originated from the AIs of 10 selected metal artifact areas (features) and one background area are presented in the in-focus plane. The results are mean ± standard error.

Fig 10 shows a plot of the ASF results and the ROI in the prosthetic phantom. The DnCNN-MARHR algorithm produced the maximum decrease in metal artifacts. However, the FBP-MAR (70 kV) algorithm gave rise to elevated metal artifacts. The DEMDRA, VM-MAR (140 keV), MLEM-MAR (70 kV), and SART-TV-MAR (140 kV) algorithms did not produce any significant alterations in the artifact level.

Fig 10. ROI setting diagram and ASF calculation result for ASF.

Fig 10

(Figure) Metal artifacts derived using the artifact spread function (ASF) of the selected features. The in-focus plane image displays the metal artifact and background areas of the measurements of ASF. (Chart) Plots of the ASF vs. the slice numbers from the in-focus planes of the denoising convolutional neural network metal artifact reduction hybrid reconstruction (DnCNN-MARHR) algorithm and the conventional reconstruction algorithms with metal artifact reduction (MAR) processing.

Fig 11 shows the GLCM calculated using texture analysis with comparison of the feature quantities of “inverse difference moment (homogeneity)” and “contrast (dissimilarity)” and presents the results. The DnCNN-MARHR algorithm showed the best performance regarding the low noise variation image and homogeneous image in the texture analysis (inverse difference moment: DnCNN-MARHR, 0.965; DEMDRA, 0.952; VM-MAR [140 keV], 0.958; MLEM-MAR [70 kV], 0.955; FBP-MAR [70 kV], 0.905; SART-TV-MAR [140 kV], 0.955; and contrast: DnCNN-MARHR, 0.070; DEMDRA, 0.099; VM-MAR [140 keV], 0.085; MLEM-MAR [70 kV], 0.097; FBP-MAR [70 kV], 0.315; SART-TV-MAR [140 kV], 0.104). For the whole image, the measure of the “correlation” between a pixel and the neighboring area was as follows: DnCNN-MARHR, 0.993; DEMDRA, 0.992; VM-MAR (140 keV), 0.993; MLEM-MAR (70 kV), 0.992; FBP-MAR (70 kV), 0.926; SART-TV-MAR (140 kV), 0.990.

Fig 11. Texture analysis results.

Fig 11

Comparisons of the inverse different moment (homogeneity) (a) and contrast (dissimilarity) (b) of in-focus plane images obtained via the denoising convolutional neural network metal artifact reduction hybrid reconstruction (DnCNN-MARHR) algorithm and the conventional reconstruction algorithms with metal artifact reduction (MAR) processing.

Training was implemented by MATLAB on two CPU (Intel(R) Xeon(R) E5-2620 v4, 2.10GHz) and one GPU (NVIDIA Tesla K40c, 12GB) systems. The network required approximately 15 h for training. The reconstruction time was approximately 30 minutes for DnCNN-MARHR (not included in the training network) and DEMDRA approximately 20 minutes for MLEM and SART-TV, and approximately 2 minutes for FBP.

Discussion and conclusions

In the present prosthetic phantom study, we compared our DnCNN-MARHR algorithm and different traditional DT reconstruction algorithms without and with MAR processing.

We found that our newly developed DnCNN using the training network algorithm DnCNN-MARHR had adequate overall performance. The DnCNN-MARHR images showed better results that are unaffected by the metal type present in the prosthetic phantom. In addition, this algorithm was efficient in getting rid-off the noise artifacts from images, specifically at higher distances from metal objects. Especially, this algorithm was principally useful for decreasing artifacts related to out-of-plane effects on objects causing artifacts in the focus plane. This algorithm might be a favorable new choice for prostheses imaging, as it yielded 3D visualizations in images with reduced artifacts that were much better than those in images processed employing traditional algorithms. In this DnCNN-MARHR algorithm, the versatility of selecting the imaging parameters, that is dependent on the required final images and conditions of prosthetic imaging, might be useful to users. Regarding the success or failure of our proposed method, when it is applied in other cases, the key points are the accuracy involved in the VM X-ray process and the method of extraction of the residual image during learning with high accuracy. For example, an effective MAR may not be expected in cases where normal tissues and artificial bones have a complicated arrangement relation.

The proposed DnCNN method should help find a suitable correction map q:PLImg−Res for MAR in such a way that U(PL)≈Img−Res, where PL is the attenuation distribution at fixed energy L [19]. The map q should consider VMpimg, as VMpimg is considered as prior information of DT projection images. Therefore, q(PL) should be ascertained by not only PL but also VMpimg. Owing to the highly nonlinear and complicated structure of VMpimg, it could be quite difficult to determine q without using DnCNN methods.

With regard to the DnCNN step, the strength is fusion of helpful facts from various sources to prevent profound artifacts, whereas the limitation is that not all artifacts can be removed, with mild artifacts typically remaining [23]. With regard to adaptive filtering processing in prior image-based MAR methods, moderate artifacts can be removed and a satisfactory prior image can be generated. The success of adaptive filtering processing needs to still be established by imaging, and any noticed artifacts might be due to the absence of the high level normal contributions of artifact-free voxels. Even though, normal contributions are initially made by these voxels, their values decline slightly after removing the largest normal contribution. This means, for each voxel, there will be a rejection of one contribution that is abnormal, while the remaining contributions, along with the largest normal contribution, will be used. Thus, artifact-containing voxels are likely to have elevated values than the surrounding artifact-free voxels. However, when there are major artifacts, the prior image generally suffers from tissue misclassification. By combining the adaptive filtering processing with DnCNN, only with fewer epochs DnCNN training can be stopped, and the procured prior DnCNN is not influenced by tissue misclassification (Fig 11).

The following two factors are important to ascertain exceptional performance of the DnCNN-MARHR algorithm: choosing the right MAR method and training data preparation. Sufficient data for the DnCNN to discriminate artifacts from tissue structures is provided by the appropriately selected MAR method. The preparation of training data ascertains general application of the trained DnCNN by including as many types of metal artifacts as feasible. Thus, it can be considered that the effect of MAR was useful in the longitudinal direction (Fig 10).

TV minimization supposes that a true image is relatively uniform and piece wise. These TV minimization approaches can effectively prevent inadmissible solutions [24], but noise and artifacts appear as deviations or valleys and peaks, which will have comparatively larger TV values since TV is described as the sum of the first-order derivative magnitudes [13], and therefore, their applicability is limited to computed clinical imaging. The DnCNN has the capability to learn nonlinear regression for different sources of artifacts, because it efficiently employs complicated prior knowledge of artifacts and DT images.

In the AI analysis, metal artifacts showed almost an equal MAR effect as the conventional DEMDRA. The SGDM method employed in the DnCNN-MARHR algorithm uses a subset of the training set (mini-batch) to evaluate gradients and update parameters. In general, the DnCNN (or CNN) is usually trained iteratively using multiple image batches, and it is possible to speed up learning without losing accuracy by increasing the mini-batch size. It is considered that reduction of high-frequency metal artifacts cannot be significantly improved owing to the influence of the decomposition/restoration process accompanying the mini-batch and the convolution processing.

Wolterink et al. [22] report that they were useful for noise reduction by using CNN based generative adversarial networks (GAN) in low-dose CT. The results demonstrate that training with adversarial feedback from a discriminator CNN can produce images that resemble more in appearance to the normal-dose CT than training in the absence of a discriminator CNN [22]. A similar trend was found in our results this time. We think that feedback from avoids smoothing in the image and permits quantification of including metal artifacts objects in DT scans, more accurately.

Our DnCNN-MARHR algorithm has some limitations. First, the algorithm does not take into account metal artifacts that come from photon starvation. The functioning of the learning-based projection data correction method could be perfected by improving the forward model, which precisely represents different realistic artifacts. Second, the suggested learning model is intended for a particular type of phantom implant and it does not work efficiently when the trained network is adapted to projection data correction from totally different scanning geometries. Future studies is required to develop a learning model, which can be used in general cases.

We successfully developed a DnCNN-based algorithm for MAR in DT for arthroplasty. Our DnCNN-MARHR algorithm is particularly useful for reducing artifacts associated with out-of-plane effects on artifact-causing objects in the focus plane, and it is not affected by tissue misclassification.

Acknowledgments

We wish to thank Mr. Kazuaki Suwa and Yuuki Watanabe at Department of Radiology Dokkyo Medical University Koshigaya Hospital for support on experiment.

Data Availability

All relevant data are within the manuscript.

Funding Statement

The author(s) received no specific funding for this work.

References

  • 1.Tang H, Yang D, Guo S, Tang J, Liu J, Wang D, et al. Digital tomosynthesis with metal artifact reduction for assessing cementless hip arthroplasty: a diagnostic cohort study of 48 patients. Skeletal Radiol. 2016;45(11):1523–32. Epub 2016/09/04. 10.1007/s00256-016-2466-8 . [DOI] [PubMed] [Google Scholar]
  • 2.Gothlin JH, Geijer M. The utility of digital linear tomosynthesis imaging of total hip joint arthroplasty with suspicion of loosening: a prospective study in 40 patients. Biomed Res Int. 2013;2013:594631 Epub 2013/10/01. 10.1155/2013/594631 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Machida H, Yuhara T, Mori T, Ueno E, Moribe Y, Sabol JM. Optimizing parameters for flat-panel detector digital tomosynthesis. Radiographics. 2010;30(2):549–62. Epub 2010/03/17. 10.1148/rg.302095097 . [DOI] [PubMed] [Google Scholar]
  • 4.Duryea J, Dobbins JT 3rd, Lynch JA. Digital tomosynthesis of hand joints for arthritis assessment. Med Phys. 2003;30(3):325–33. Epub 2003/04/04. 10.1118/1.1543573 . [DOI] [PubMed] [Google Scholar]
  • 5.Dobbins JT 3rd, Godfrey DJ. Digital x-ray tomosynthesis: current state of the art and clinical potential. Phys Med Biol. 2003;48(19):R65–106. Epub 2003/10/29. 10.1088/0031-9155/48/19/r01 . [DOI] [PubMed] [Google Scholar]
  • 6.Gomi T, Hirano H. Clinical potential of digital linear tomosynthesis imaging of total joint arthroplasty. J Digit Imaging. 2008;21(3):312–22. Epub 2007/06/09. 10.1007/s10278-007-9040-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Gomi T, Sakai R, Goto M, Watanabe Y, Takeda T, Umeda T. Comparison of Reconstruction Algorithms for Decreasing the Exposure Dose During Digital Tomosynthesis for Arthroplasty: a Phantom Study. J Digit Imaging. 2016;29(4):488–95. Epub 2016/03/05. 10.1007/s10278-016-9876-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Becker AS, Martini K, Higashigaito K, Guggenberger R, Andreisek G, Frauenfelder T. Dose Reduction in Tomosynthesis of the Wrist. AJR Am J Roentgenol. 2017;208(1):159–64. Epub 2016/10/21. 10.2214/AJR.16.16729 . [DOI] [PubMed] [Google Scholar]
  • 9.Gomi T, Sakai R, Goto M, Hara H, Watanabe Y, Umeda T. Evaluation of digital tomosynthesis reconstruction algorithms used to reduce metal artifacts for arthroplasty: A phantom study. Phys Med. 2017;42:28–38. Epub 2017/11/28. 10.1016/j.ejmp.2017.07.023 . [DOI] [PubMed] [Google Scholar]
  • 10.Candes EJ, Romberg J, Tao T. Robust uncertainty principles: exact signal reconstruction from highly incomplete frequency information. IEEE Trans Inf Theory. 2006;52:489–509. 10.1109/TIT.2005.862083 [DOI] [Google Scholar]
  • 11.Sidky EY, Pan X. Image reconstruction in circular cone-beam computed tomography by constrained, total-variation minimization. Phys Med Biol. 2008;53(17):4777–807. Epub 2008/08/15. 10.1088/0031-9155/53/17/021 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Aharon M, Elad M, Rruckstein A. A K-SVD an algorithm for denoising overcomplete dictionaries for sparse representation. IEEE Trans Signal Process. 2006;54:4311–22. 10.1109/TSP.2006.881199 [DOI] [Google Scholar]
  • 13.Du Y, Wang X, Xiang X, Wei Z. Evaluation of hybrid SART + OS + TV iterative reconstruction algorithm for optical-CT gel dosimeter imaging. Phys Med Biol. 2016;61(24):8425–39. Epub 2016/11/16. 10.1088/0031-9155/61/24/8425 . [DOI] [PubMed] [Google Scholar]
  • 14.Wu T, Moore RH, Rafferty EA, Kopans DB. A comparison of reconstruction algorithms for breast tomosynthesis. Med Phys. 2004;31(9):2636–47. Epub 2004/10/19. 10.1118/1.1786692 . [DOI] [PubMed] [Google Scholar]
  • 15.Gomi T, Hirano H, Umeda T. Evaluation of the X-ray digital linear tomosynthesis reconstruction processing method for metal artifact reduction. Comput Med Imaging Graph. 2009;33(4):267–74. Epub 2009/02/25. 10.1016/j.compmedimag.2009.01.004 . [DOI] [PubMed] [Google Scholar]
  • 16.Gomi T, Sakai R, Goto M, Hara H, Watanabe Y. Development of a novel algorithm for metal artifact reduction in digital tomosynthesis using projection-based dual-energy material decomposition for arthroplasty: A phantom study. Phys Med. 2018;53:4–16. Epub 2018/09/23. 10.1016/j.ejmp.2018.07.011 . [DOI] [PubMed] [Google Scholar]
  • 17.Wellenberg RH, Boomsma MF, van Osch JA, Vlassenbroek A, Milles J, Edens MA, et al. Low-dose CT imaging of a total hip arthroplasty phantom using model-based iterative reconstruction and orthopedic metal artifact reduction. Skeletal Radiol. 2017;46(5):623–32. Epub 2017/02/17. 10.1007/s00256-017-2580-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Funama Y, Taguchi K, Utsunomiya D, Oda S, Hirata K, Yuki H, et al. A newly-developed metal artifact reduction algorithm improves the visibility of oral cavity lesions on 320-MDCT volume scans. Phys Med. 2015;31(1):66–71. Epub 2014/12/03. 10.1016/j.ejmp.2014.10.003 . [DOI] [PubMed] [Google Scholar]
  • 19.Zhang K, Zuo W, Chen Y, Meng D, Zhang L. Beyond a gauaaian denoiser: residual learning of deep CNN for image denoising. IEEE Trans Image Process. 2017;26(7):3142–55. 10.1109/TIP.2017.2662206 [DOI] [PubMed] [Google Scholar]
  • 20.Yoon Y, Jeon HG, Yoo D, Lee JY, Kweon IS. Learning a deep convolutional network for light-field image super-resolution IEEE international conference on computer vision workshop. 2015:57–65. 10.1109/ICCVW.2015.17 [DOI]
  • 21.Chen H, Zhang Y, Kalra MK, Lin F, Chen Y, Liao P, et al. Low-Dose CT With a Residual Encoder-Decoder Convolutional Neural Network. IEEE Trans Med Imaging. 2017;36(12):2524–35. Epub 2017/06/18. 10.1109/TMI.2017.2715284 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Wolterink JM, Leiner T, Viergever MA, Isgum I. Generative Adversarial Networks for Noise Reduction in Low-Dose CT. IEEE Trans Med Imaging. 2017;36(12):2536–45. Epub 2017/06/03. 10.1109/TMI.2017.2708987 . [DOI] [PubMed] [Google Scholar]
  • 23.Zhang Y, Yu H. Convolutional Neural Network Based Metal Artifact Reduction in X-Ray Computed Tomography. IEEE Trans Med Imaging. 2018;37(6):1370–81. Epub 2018/06/06. 10.1109/TMI.2018.2823083 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Park HS, Lee SM, Kim HP, Seo JK, Chung YE. CT sinogram-consistency learning for metal-induced beam hardening correction. Med Phys. 2018;45(12):5376–84. Epub 2018/09/22. 10.1002/mp.13199 . [DOI] [PubMed] [Google Scholar]
  • 25.Pessis E, Campagna R, Sverzut JM, Bach F, Rodallec M, Guerini H, et al. Virtual monochromatic spectral imaging with fast kilovoltage switching: reduction of metal artifacts at CT. Radiographics. 2013;33(2):573–83. Epub 2013/03/13. 10.1148/rg.332125124 . [DOI] [PubMed] [Google Scholar]
  • 26.Kuchenbecker S, Faby S, Sawall S, Lell M, Kachelriess M. Dual energy CT: how well can pseudo-monochromatic imaging reduce metal artifacts? Med Phys. 2015;42(2):1023–36. Epub 2015/02/06. 10.1118/1.4905106 . [DOI] [PubMed] [Google Scholar]
  • 27.Yue D, Fan Rong C, Ning C, Liang H, Ai Lian L, Ru Xin W, et al. Reduction of metal artifacts from unilateral hip arthroplasty on dual-energy CT with metal artifact reduction software. Acta Radiol. 2018;59(7):853–60. Epub 2017/09/14. 10.1177/0284185117731475 . [DOI] [PubMed] [Google Scholar]
  • 28.Hegazy MAA, Eldib ME, Hernandez D, Cho MH, Cho MH, Lee SY. Dual-energy-based metal segmentation for metal artifact reduction in dental computed tomography. Med Phys. 2018;45(2):714–24. Epub 2017/12/09. 10.1002/mp.12719 . [DOI] [PubMed] [Google Scholar]
  • 29.Katsura M, Sato J, Akahane M, Kunimatsu A, Abe O. Current and Novel Techniques for Metal Artifact Reduction at CT: Practical Guide for Radiologists. Radiographics. 2018;38(2):450–61. Epub 2018/03/13. 10.1148/rg.2018170102 . [DOI] [PubMed] [Google Scholar]
  • 30.Maeda K, Matsumoto M, Taniguchi A. Compton-scattering measurement of diagnostic x-ray spectrum using high-resolution Schottky CdTe detector. Med Phys. 2005;32(6):1542–7. Epub 2005/07/15. 10.1118/1.1921647 . [DOI] [PubMed] [Google Scholar]
  • 31.Alvarez RE, Macovski A. Energy-selective reconstructions in X-ray computerized tomography. Phys Med Biol. 1976;21(5):733–44. Epub 1976/09/01. 10.1088/0031-9155/21/5/002 . [DOI] [PubMed] [Google Scholar]
  • 32.Berger M, Hubbell J. Photon cross sections on a personal computer. Gent Radiat Res. 1987:1–28. 10.2172/6016002 [DOI] [Google Scholar]
  • 33.Sutskever I, Martens J, Dahl G, Hinton G. On the importance of initialization and momentum in deep learning. Proceedings of the 30th international conference on machine learning. 2013;PMLR 28(3):1139–47.
  • 34.Ioffe S, Szegedy C. Batch normalization: accelerating deep network training by reducing internal covariate shift. International conference on machine learning. 2015:448–56.
  • 35.Krizhevsky A, Sutskever I, Hinton GE. Imagenet classification with deep convolutional neural networks. Advances in neural information processing systems. 2012:1097–105. [Google Scholar]
  • 36.Chen Y, Yu W, Pock T. On learning optimized reaction diffusion processes for effective image restoration. IEEE Conference on Computer Vision and Pattern Recognition. 2015:5261–9. 10.1109/CVPR.2015.7299163 [DOI]
  • 37.Chen Y, Pock T. Trainable nonlinear reaction diffusion: A flexible framework for fast and effective image restoration. IEEE transactions on Pattern Analysis and Machine Intelligence. 2017;39(6):1256–72. 10.1109/TPAMI.2016.2596743 [DOI] [PubMed] [Google Scholar]
  • 38.Wang Y, Qian B, Li B, Qin G, Zhou Z, Qiu Y, et al. Metal artifacts reduction using monochromatic images from spectral CT: evaluation of pedicle screws in patients with scoliosis. Eur J Radiol. 2013;82(8):e360–6. Epub 2013/03/23. 10.1016/j.ejrad.2013.02.024 . [DOI] [PubMed] [Google Scholar]
  • 39.Haralick RM, Shanmugam K, Dinstein I. Textural features for image classification. IEEE Trans Syst Man Cybern. 1973;SMC-3(6):610–21. 10.1109/TSMC.1973.4309314 [DOI] [Google Scholar]

Decision Letter 0

Li Zeng

26 Jul 2019

PONE-D-19-15441

Development of a novel denoising convolutional neural network-based algorithm for metal artifact reduction in digital tomosynthesis for arthroplasty: A phantom study

PLOS ONE

Dear Prof Gomi,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

We would appreciate receiving your revised manuscript by Sep 09 2019 11:59PM. When you are ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter.

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). This letter should be uploaded as separate file and labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. This file should be uploaded as separate file and labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. This file should be uploaded as separate file and labeled 'Manuscript'.

Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

We look forward to receiving your revised manuscript.

Kind regards,

Li Zeng

Academic Editor

PLOS ONE

Journal requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

http://www.journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and http://www.journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Please ensure that you have fully discussed how the present study advances on your previous work in this area. Please ensure that you discuss how your work relates to, and advances upon, the following publication:

"Development of a novel algorithm for metal artifact reduction in digital tomosynthesis using projection-based dual-energy material decomposition for arthroplasty: A phantom study"

https://www.physicamedica.com/article/S1120-1797(18)31136-0/fulltext"

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: No

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The authors are developing a novel denoising convolutional neural network metal artifact reduction hybrid

reconstruction (DnCNN-MARHR) algorithm for decreasing metal objects in digital tomosynthesis (DT) for

arthroplasty employing projection data. The DnCNN-MARHR algorithm based on a training network

(mini-batch stochastic gradient descent algorithm with momentum) to estimate residual reference and object

images using projection data and subtract the estimated residual images from the object images, involving hybrid

and subjectively reconstructed image usage (back projection and maximum likelihood expectation maximization.

However, the proposed methodology is not new. The paper simply combines several preexisting and widely available techniques, such as, convolutional neural network, residual learning, mini-batch stochastic gradient descent algorithm, back projection maximum likelihood expectation maximization. In this context, what are the methodological and algorithmic contributions of the work to the research community?

#Comments

1. Over all the paper, I'm not sure the purpose of this paper is denoising or metal artifacts reduction.

2. The structure of the article is very confusing. For instance, the equation of the evaluation criteria (artifact index) is in the

Optimization parameters section and the other two evaluation criterias are in the Evaluation section.

3. The methods section is badly written. Many important aspects in the methodology are left unexplained in the article. The authors state to use low-energy projection image (PL) as input image and VM X-ray image as reference image for training workflow (Page 8, Line 159-160). There is no information on how many total data sets and how may data sets were allocated each to training, validation and testing.

4. It is very confusing in how to generate the reference projection images VM X-ray image? What is the meaning of the Ft, Fw and Ff in equation (5), how to determine the value of Ft, Fw and Ff? And why VM X-ray image can be the reference projection images for traning the DnCNN, there's still artifacts in this image in my opinion.

5. In fig 2, I cannot find the difference between the original image (PL) and artifact reduced projection image (Cor_img) from the DnCNN step. I suggest that the authors can directly use the original image (PL) and the equation (11) to obtain the final MAR image and make a comparison between this MAR image and DnCNN-MARHR image, I think the difference in the image quality between this MAR image and DnCNN-MARHR image is small.

Reviewer #2: The authors proposed a novel denoising convolutional neural network metal artifact reduction hybrid reconstruction (DnCNN-MARHR) algorithm to reduce the metal artifacts in digital tomosynthesis. The proposed method is based on a training network to estimate residual reference and the object images using projection. And then subtract the estimated residual images from the object images to get the artifact reduced projection. At last, the hybrid reconstruction algorithm was used to obtain the reconstructed tomosynthesis image with metal artifact reduction. And the authors executed a phantom study to compare the performance with different methods. Overall, the paper includes extensive work and the comparison studies are very clear. However, there are few limitations and the authors are encouraged to address the following concerns:

1. As far as I know, the material decomposition algorithm using dual-energy imaging technique needs a known material phantom to build the relationship between the projection (attenuation) and the material property in the calibration process. For the testing process, the projection of the unknown object with low and high energy was converted to the basis images with known materials in the projection domain. The virtual monochromatic images (VMIs) are generated by using the basis image and the attenuation coefficient according to the linear relation of the attenuation coefficient combination. Do you use the calibration phantom? Please provide more information on the content.

2. The DnCNN-MARHR algorithm is the major development of the study. And the Eq.(7) is the core to calculate the residual projection image. Please add more explanations or references.

3. The specification of the computing environment? How long does it take for training and testing process? Iteration number of different image reconstruction? The differences in the calculation time cost for DnCNN-MARHR algorithm and other methods.

4. What is the image size of the projection? Does it have any image pre-processing on the projections?

5. DEMDRA has the lowest AI value. Why not use DEMDRA image to be the reference image of the training network?

6. In generally, the metal artifacts could be reduced by increasing tube voltage for 3D X-ray imaging. (M.-J.Lee et al., “Overcoming artifacts from metallic orthopedic implants at high-field-strength MR imaging and multi-detector CT,” Radiographics, vol. 27, no. 3, pp. 791–803, 2007.) Why do you choose the projection with lower tube voltage to be the input image for DnCNN-MARHR algorithm?

7. Do you use any additional filters in the low and high tube voltage? Could you provide the energy spectrum of the low and high tube voltage setting?

8. How do you perform the data acquisition of the dual-energy tomosynthesis imaging? Is the imaging with different image parameters in the sequential process?

9. What is the total slice number of the reconstructed tomosynthesis image? What is the start height of the reconstruction from the detector surface? Where is the focal point during tomosynthesis imaging?

10. Line 313-315 Could you provide 2D AI surface plot with epochs and weighting coefficients when mini-batch size is 544?

11. Line 334-335 In Fig.2, the image of MLEM_DnCNN is the same as the polychromatic MLEM-MAR [70 kV]? And the image of BP_DnCNN is also the same as the polychromatic filtered back projection [FBP] MAR [70 kV]? Why do you combine the image of MLEM_DnCNN and BP_DnCNN? Is it empirical knowledge?

12. Could you provide the schematic diagram to illustrate the relationship of the in-focus plane and the out-of-plane in the Z-axis direction?

13. The execute time of DnCNN-MARHR algorithm (not include the training network)?

14. Line 447-448 the imaging parameters case by case? For the object in the different region or the various kind of object (e.g.: materials,…), the proposed algorithm could deal with it?

Minor corrections:

1. Line 137-138 Please normalize the expression of units. –mm μm

2. Line 175-176 Please double check the formula correctness of Eq.(2) and Eq.(3). Does it miss the exponential mark?

3. Line 238 “Equation (6)”

4. Line 245 “Equation (7)”

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: Yes: Chia-Hao Chang

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files to be viewed.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Please note that Supporting Information files do not need this step.

Decision Letter 1

Li Zeng

29 Aug 2019

[EXSCINDED]

Development of a denoising convolutional neural network-based algorithm for metal artifact reduction in digital tomosynthesis for arthroplasty: A phantom study

PONE-D-19-15441R1

Dear Dr. Gomi,

We are pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it complies with all outstanding technical requirements.

Within one week, you will receive an e-mail containing information on the amendments required prior to publication. When all required modifications have been addressed, you will receive a formal acceptance letter and your manuscript will proceed to our production department and be scheduled for publication.

Shortly after the formal acceptance letter is sent, an invoice for payment will follow. To ensure an efficient production and billing process, please log into Editorial Manager at https://www.editorialmanager.com/pone/, click the "Update My Information" link at the top of the page, and update your user information. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to enable them to help maximize its impact. If they will be preparing press materials for this manuscript, you must inform our press team as soon as possible and no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

With kind regards,

Li Zeng

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: (No Response)

Reviewer #2: Thank you for all your efforts to include new figures and add more explanations in the manuscript. The current version is much clearer and well supported with the further information provided. However, I suggest adding some definition of the material fractions(F_t, F_w, F_f), XDTS_EL, XDTS_EH in Equation (5). That would help readers to understand the physical meanings of this equation.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Acceptance letter

Li Zeng

4 Sep 2019

PONE-D-19-15441R1

Development of a denoising convolutional neural network-based algorithm for metal artifact reduction in digital tomosynthesis for arthroplasty: A phantom study

Dear Dr. Gomi:

I am pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please notify them about your upcoming paper at this point, to enable them to help maximize its impact. If they will be preparing press materials for this manuscript, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

For any other questions or concerns, please email plosone@plos.org.

Thank you for submitting your work to PLOS ONE.

With kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Professor Li Zeng

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    Attachment

    Submitted filename: rebuttal_letter.pdf

    Data Availability Statement

    All relevant data are within the manuscript.


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES