Abstract
Objective(s):
This study aimed to create a deep learning (DL)-based denoising model using a residual neural network (Res-Net) trained to reduce noise in ring-type dedicated breast positron emission tomography (dbPET) images acquired in about half the emission time, and to evaluate the feasibility and the effectiveness of the model in terms of its noise reduction performance and preservation of quantitative values compared to conventional post-image filtering techniques.
Methods:
Low-count (LC) and full-count (FC) PET images with acquisition durations of 3 and 7 minutes, respectively, were reconstructed. A Res-Net was trained to create a noise reduction model using fifteen patients’ data. The inputs to the network were LC images and its outputs were denoised PET (LC + DL) images, which should resemble FC images. To evaluate the LC + DL images, Gaussian and non-local mean (NLM) filters were applied to the LC images (LC + Gaussian and LC + NLM, respectively). To create reference images, a Gaussian filter was applied to the FC images (FC + Gaussian). The usefulness of our denoising model was objectively and visually evaluated using test data set of thirteen patients. The coefficient of variation (CV) of background fibroglandular tissue or fat tissue were measured to evaluate the performance of the noise reduction. The SUVmax and SUVpeak of lesions were also measured. The agreement of the SUV measurements was evaluated by Bland–Altman plots.
Results:
The CV of background fibroglandular tissue in the LC + DL images was significantly lower (9.102.76) than the CVs in the LC (13.60 3.66) and LC + Gaussian images (11.51 3.56). No significant difference was observed in both SUVmax and SUVpeak of lesions between LC + DL and reference images. For the visual assessment, the smoothness rating for the LC + DL images was significantly better than that for the other images except for the reference images.
Conclusion:
Our model reduced the noise in dbPET images acquired in about half the emission time while preserving quantitative values of lesions. This study demonstrates that machine learning is feasible and potentially performs better than conventional post-image filtering in dbPET denoising.
Key Words: Deep neural networks, db PET, Noise reduction, Image quality
Introduction
Malignant tumors are often detected and staged using [18F]fluorodeoxyglucose ([18F]FDG) positron emission tomography (PET) (1). [18F]FDG PET is also used for staging or re-staging breast cancer as well as predicting the response to therapy and diagnosing recurrence in breast cancer patients (2-5). However, whole body PET (WB-PET) may have a reduced ability to detect small (<10 mm) breast cancer tumors (6, 7). A recent study also reported that the sensitivity of dbPET was higher for tumors smaller than 10 mm compared to WB-PET (8). In addition, partial volume effects (PVE) in PET image occurs whenever the tumor size is less than three times the full width half maximum (FWHM) of the spatial resolution (9). The FWHM of the spatial resolution of commercially available WB-PET systems is reported to be 4.0-5.0 mm (10, 11). Because of the limited spatial resolution of WB-PET, the quantitative values of small tumors are affected by PVE (12).
To overcome those limitations, high-resolution breast PET scanners have been developed. These systems have been used to detect breast cancer lesions, diagnose intra-mammary spread, assess the morphological details of tumors, and metabolic information (13). There are two types of high-resolution breast PET scanners: positron emission mammography (PEM) (14) and ring-type dedicated breast PET (dbPET) (15). PEM provides limited-angle tomographic images using two planar or curved detectors, whereas dbPET provides fully tomographic images of the breast with a ring-shaped detector (16). dbPET can provide PET images with higher spatial resolution than WB-PET because of the small size of the crystals, the proximity of the detectors to the breast and the reduction of non-collinearity effects due to smaller ring diameters. Miyake et al reported that the FWHM of the spatial resolution of the dbPET system is 0.8-1.3 mm when reconstructed with a clinically used reconstruction method (17). Because of the high spatial resolution of dbPET, it has been reported that the ability of dbPET to detect breast cancers smaller than 10 mm is better than that of WB-PET (8). A phantom study using microspheres less than 10 mm in diameter has also reported higher detectability with dbPET compared to WB-PET (18). In addition, Berg et al. previously reported that PEM had improved specificity compared with MRI (19). Furthermore, the usefulness of dbPET for evaluating the breast cancer response to neoadjuvant chemotherapy using the standardized uptake value (SUV) has also been reported (20).
Despite the high maximal sensitivity at the center of the axial field of view of the dbPET system, the dbPET images often have a high level of noise, especially around the edge of the detector, due to a decrease in effective counts near the edge (16). A reconstruction method that prioritizes improvement of specificity of detected uptake patterns can also increase image noise. The noise in the dbPET images may lead to the detection of a larger number of non-pathologic uptake foci, and result in false-positive diagnoses (21). Several methods can be used to suppress the noise of PET images, including increasing acquisition time, post-image filtering, such as a Gaussian filter or non-local mean (NLM) filter (22), and applying Bayesian penalized likelihood reconstruction algorithms (23). Longer scan times increase the probability of motion artifacts and physical burden on a patient, especially in dbPET, which scans the patient in a prone position. A Gaussian filter is sometimes used for dbPET image denoising, but it can remove details of the tumor structure (24). An NLM filter can reduce image noise while preserving image details, but it requires some parameters to be optimized. Bayesian penalized likelihood reconstruction algorithms also need a regularization parameter to be set to control noise and preserve edges, and a bad optimization of this parameter leads to over-smoothed images.
Recently, machine learning methods for PET denoising, such as those based on convolutional neural networks and U-Net (25, 26), have achieved improvements in both objective and subjective assessment. However, to our knowledge, no study has used machine learning for noise reduction in dbPET images, which have a higher spatial resolution than WB-PET. U-Net has occasionally been used for PET image denoising, but this architecture may cause blurred images due to the down-sampling and up-sampling, despite the use of skip connections (25-27). Blur in images is a problem, especially for dbPET, which requires high spatial resolution. Convolution filters with large kernel size increase the receptive field size without down-sampling and up-sampling, thus avoid blurring (28). A residual neural network (Res-Net) also prevents the blurring of images in machine learning-based denoising (29, 30).
In this study, we created a deep learning (DL)-based denoising model using a Res-Net with large kernel size of convolution filters that was trained to reduce noise in dbPET images acquired in about half the emission time. We evaluated the usefulness of the model in terms of its noise reduction performance and preservation of quantitative values by comparing it with conventional post-image filtering.
Materials
Patient data
A total of twenty-eight consecutive patients with known or suspected breast cancers who underwent dbPET scan from February 2021 to December 2021 were included in this study. Patients fasted at least 4 h prior to administration of [18F]FDG (3.5 MBq/kg) and were scanned 90 minutes after administration. PET data were acquired in three-dimensional (3D) list mode for 7 minutes per breast using a dbPET scanner (Elmammo Avant Class, Shimadzu Corp., Kyoto, Japan). We also acquired 3 minutes of PET data from the list data. Low-count (LC) and full-count (FC) PET images with acquisition durations of 3 and 7 minutes, respectively, were reconstructed with the 3D list mode dynamic row-action maximum-likelihood algorithm (DRAMA) using one iteration, 128 subsets, and a relaxation control parameter of β =20 for all data sets. No post-smoothing was applied to the PET images. The matrix size was 236×132 with pixel sizes of 0.78×0.78 mm, and the slice thickness was 0.78 mm. Scatter correction was conducted using the convolution-subtraction method. Attenuation correction was performed using calculated uniform attenuation maps created from tissue boundaries estimated from the emission data. This study was performed in line with the principles of the Declaration of Helsinki. Approval was granted by the Ethics Committee of Kyoto University Graduate School and Faculty of Medicine (Approval number, R3034). Informed consent was waived by the Ethics Committee of Kyoto University Graduate School and Faculty of Medicine due to the retrospective design.
Network architecture
Our network structure was similar to the Res-Net used in a prior study (30). Skip connection from input-end to output-end was used in this architecture to compensate the lost details and to perform residual learning simultaneously. The network architecture is shown in Figure 1 and is composed of convolutional layers with a 15×15 kernel size, batch normalization (BN) (31), and parametric rectified linear unit (PReLU) activation (32). BN was added between the convolution and PReLU layers. The number of filters for each convolutional layer was 128, and the spatial size of the network input was 236×132. A larger receptive field size can make use of context information in a larger image region, and hence we adopted the large kernel size of 15×15. To avoid the problem of vanishing gradients that occurs the rectified linear unit (ReLU) activation function, we used PReLU as the activation function of the network.
Network training
Fifteen patients’ data were used for training (6,372 images) and validation (708 images). We randomly reserved 10% of the training samples for validation data to monitor the performance of the network during training. The inputs for the network were two-dimensional (2D) LC images. The outputs were denoised 2D PET (LC + DL) images, which should resemble the FC images. The network was trained for 100 epochs and optimized using the Adam optimizer (33) to minimize the mean squared error (MSE). The batch size was 16, and a learning rate of 0.001 was used. The network was implemented using Keras with a TensorFlow backend (Google, Mountain View, California), and trained used a single NVIDIA GeForce RTX 2080Ti GPU (NVIDIA Corporation, Santa Clara, California).
Comparison with conventional post-image filtering
To evaluate the incremental value of our denoising model, LC + DL images were compared to dbPET images denoised by conventional post-image filtering. Gaussian filter and an NLM filter were applied to the LC images (LC+Gaussian and LC+NLM, respectively). To create reference images, a Gaussian filter was also applied to the FC images to obtain the same reconstruction conditions as in our clinical setting (FC+Gaussian). The full width at half maximum of the Gaussian filter was 1.17 mm. For the NLM method, the patch size was 3×3, and the search window size was 5×5. The standard deviation of the Gaussian kernel used in the NLM method was set to be the standard deviation of the background fibroglandular tissue in breasts measured with the LC images. The other parameters of the NLM filter were determined based on previous reports (34, 35).
Quantitative analysis
The performance of the noise reduction model was objectively evaluated using a test data set of thirteen patients with and without breast lesions. The SUVmean and coefficient of variation (CV) of the background fibroglandular tissue without lesions were measured to assess the noise level of the images. In addition, the CV of the background fibroglandular tissue or fat tissue at the edge of the FOV was also measured. For visually FDG-avid breast lesions that were histologically confirmed or highly suspicious on imaging modalities other than dbPET, the SUVmax and SUVpeak were measured to evaluate the effect of this model on the lesion uptake values.
The SUVmean and CV of the background fibroglandular tissue were obtained from five 2D regions of interest (2D-ROIs) with a diameter of 8 mm per breast placed on background fibroglandular tissue. The CV of the background fibroglandular tissue or fat tissue at the edge of the FOV were obtained from five 2D-ROIs of 10×30 pixel rectangles placed at 5 pixels from the FOV edge. Each ROI was placed in 5 different slices, which are at least 5 slices apart from each other to include as wide range of background fibroglandular tissue or fat tissue as possible (Figure 2). The SUVmean is an average of SUV within the 2D-ROI, and the CV was calculated using the following equation.
Here, is the standard deviation (SD) within the 2D-ROI.
The SUVmax of the lesions was obtained from a 3D volume of interest (3D-VOI). The SUVpeak was defined as the average SUV, which was measured in a 2D-ROI with a fixed diameter of 10 mm centered at the maximum value of the lesions. In addition, the agreement of the SUV measurements of the reference (FC + Gaussian) and each target image (LC, LC + Gaussian, LC + NLM, and LC + DL) was assessed. Relative differences were calculated for the SUVmax and SUVpeak using the FC + Gaussian images as reference, and the agreement was evaluated using Bland–Altman plots. The relative difference (d) between the reference and the target images was defined as the following equation.
Here, andare the SUV measurements obtained in the target and reference images, respectively. The bias and variance of the relative differences in the SUV measurements were defined as the mean and 1.96×SD of d, respectively.
The lesions were classified into three types of uptake: focus, mass uptake (MU), and non-mass uptake (NMU), based on the 3D morphologic features with reference to a previous report (15).
All analyses were performed using MATLAB 2021a (The MathWorks, Inc., Natick, Massa-chusetts).
Visual assessment
For the visual assessment, craniocaudal (CC) and mediolateral (ML) maximum intensity projection (MIP) images of test data sets were visually evaluated for smoothness (degree of the image noise) and lesion contrast between the mammary gland and lesions using a four-point scale (0, not acceptable for diagnosis; 1, acceptable; 2, good; and 3, excellent) by an experienced nuclear medicine physician and an experienced PET technologist blinded to the reconstruction settings. For lesion contrast, FDG-avid breast lesions that were histologically confirmed or highly suspicious on imaging modalities other than dbPET were visually evaluated on CC or ML MIP images. The MIP images were displayed on an inverse grayscale with a standardized uptake range of 0–4.
Statistical analysis
The SUVmean and CV of background fibro-glandular tissue or fat tissue obtained for all image sets were compared using the paired t-test with Bonferroni correction. The SUVmax and SUVpeak of the lesions in the reference images and target images were compared using the Wilcoxon signed rank test with Bonferroni correction. Differences in patient characteristics in the data sets was examined using the Mann–Whitney U test and Fisher's exact test. Visual scores for all image sets were compared using the Wilcoxon paired ranked-sum test with Bonferroni correction. Inter-reader agreement was evaluated using Cohen’s kappa test. A p value of less than 0.05 was considered statistically significant for each analysis. The statistical analysis was performed using JMP® 16.1.0 (SAS Institute Inc., Cary, NC, USA).
Results
Table 1 summarizes the characteristics of fifteen patients (age range, 34–81 years; mean age, 65 years; number of lesions, 20) in the training and validation data sets and thirteen patients (age range, 49–82 years; mean age, 63 years; number of lesions, 22) in the test data set. There was no significant difference between the two sets in terms of age and treatment prior to dbPET examinations.
Table 1.
Characteristics | Training + validation (n=15) | Test (n=13) | p value |
---|---|---|---|
Age (mean±SD) | 65±14.5 | 63±11.6 | 0.53 |
Number of lesions | 20 | 22 | |
Treatment prior to dbPET | 1.00 | ||
No treatment | 12 | 11 | |
Neoadjuvant chemotherapy | 2 | 1 | |
Surgery | 1 | 1 |
For the reference and each of the target images, Figure 3 shows trans-axial images and Figure 4 shows CC MIP images. A subjective visual inspection revealed that the LC + DL image has lower noise levels than the LC image. The SUVmean of the background fibroglandular tissue in the LC + DL images was slightly higher (mean±SD [95%confidence interval (CI)], 1.18±0.33 [1.11–1.27]) than that in the LC, LC + Gaussian, and LC + NLM images (mean±SD [95% CI], 1.18±0.34 [1.10-1.27], 1.18±0.34 [1.10–1.27] and 1.18±0.33 [1.10-1.26] respectively), whereas no significant difference in SUVmean was observed between the LC + DL and reference images (mean±SD [95% CI], 1.19±0.33 [1.11-1.28]). The CV of the background fibroglandular tissue in the LC + DL images was significantly lower (mean±SD [95% CI], 9.10±2.76 [8.42–9.79]) than that in the LC, LC + Gaussian, and LC + NLM images (mean±SD [95% CI], 13.60±3.66 [12.69–14.51], 11.51±3.56 [10.63–12.39]and 10.40±3.92 [9.43–11.38], respectively), whereas no significant difference in CV was observed between the LC + DL and reference images (mean±SD [95% CI], 8.64±2.75 [7.96–9.32]). Figure 5 and Figure 6 show the SUVmean and CV of the background fibro-glandular tissue. The CV of the background fibroglandular tissue or fat tissue at the edge of the FOV in the LC + DL images was significantly lower (mean±SD [95% CI], 19.60±6.32 [18.03–21.16]) than that in the LC, LC + Gaussian, LC + NLM, and reference images (mean±SD [95% CI], 24.33±5.33 [23.00–25.65], 22.22±5.55 [20.84–23.60], 21.24±5.78 [19.81–22.67] and 21.88±6.33 [22.28–25.33], respectively). Figure 7 shows the CV of the background fibroglandular tissue or fat tissue at the edge of the FOV.
Quantitative assessment was performed for a total of twenty-two lesions in the test data set. The details of the lesions are listed in Table 2. Six lesions were focus lesions and sixteen were mass uptake lesions.
Table 2.
Characteristics | Number ( %) or value |
---|---|
Diagnosis | |
Invasive ductal carcinoma* | 14 (64.6) |
Ductal carcinoma in situ* | 6 (27.3) |
Paget disease* | 1 (4.5) |
Lymph node metastasis** | 1 (4.5) |
U ptake type | |
Focus | 6 (27.3) |
Mass uptake | 16 (72.7) |
Non- mass uptake | 0 (0) |
S UV max (mean±SD ) * * * | |
All | 9.958.15 |
Focus | 2.84±0.85 |
Mass uptake | 12.61±8.05 |
* Diagnosed based on the histopathological findings
** Diagnosed based on imaging studies
*** SUVmax measured on the reference (FC + Gaussian) images
The SUVmax of the lesions for the LC (median, 2.48; interquartile range (IQR), 4.39 to 12.51) and LC + NLM (median, 2.39; IQR, 4.39 to 12.51) images were significantly higher than those for
the reference images (median, 1.70; IQR, 4.11 to 12.02) (p<0.001, p<0.001, respectively), and no significant differences in SUVmax were observed for the LC + Gaussian (median, 2.20; IQR, 4.00 to 11.93) and LC + DL images (median, 1.91; IQR, 3.80 to 11.65) when compared with the reference images. The SUVpeak for the LC (median, 0.74; IQR, 1.69 to 8.22), LC + Gaussian (median, 0.73; IQR, 1.71 to 8.17), and LC + NLM (median, 0.72; IQR, 1.69 to 8.22) images were significantly higher than those for the reference images (median, 0.84; IQR, 1.67 to 8.72) (p<0.033, p<0.042, p<0.033, respectively), and no significant differences in SUVpeak were observed for the LC + DL images (median, 0.76; IQR, 1.70 to 8.75) when compared with the reference images. Table3 shows the results of the SUVmax and SUVpeak of the lesions. The relative differences for SUVmax and SUVpeak are shown in Bland– Altman plots in Figure 8 and Figure 9, respectively. The Bland–Altman plots show the lowest mean bias of the relative differences for SUVmax and SUVpeak (−0.07 % and 0.80 %) in the LC + DL images. In the LC + DL images, the variance of the relative difference in SUVmax was the smallest, while that for SUVpeak was the largest.
Table 3.
Image sets |
Median (Interquartile range)
SUV max SUV peak |
|
---|---|---|
FC + Gaussian | 1.70 (4.11-12.02) | 0.84 (1.67-8.72) |
LC | 2.48 (4.39-12.51) | 0.74 (1.69-8.22) |
LC + Gaussian | 2.20 (4.00-11.93) | 0.73 (1.71-8.17) |
LC+NML | 2.39 (4.39-12.51) | 0.74 (1.69-8.22) |
LC + DL | 1.91 (3.80-11.65) | 0.76 (1.70-8.75) |
Data are presented as median and interquartile range
Table 4 and 5 shows the results of the visual evaluation of smoothness and contrast for all image sets, respectively. The smoothness for the LC + DL images was significantly better than that for the other images except for the reference images (p<0.001). Furthermore, the smoothness for the reference images was significantly better than that for the other images except for the LC + DL images (p<0.001). The smoothness for the LC + NLM images was also significantly better than that for the LC + Gaussian images (p=0.003). No significant differences in the lesion contrast were observed in all image sets.
Table 4.
Visual score | |||||
---|---|---|---|---|---|
FC + Gaussian | LC | LC + Gaussian | LC+NML | LC + DL | |
0 | 8 (6.7) | 16 (13.3) | 14 (11.7) | 22 (18.3) | 4 (3.3) |
1 | 20 (16.7) | 83 (69.2) | 77 (64.2) | 81 (67.5) | 37 (30.8) |
2 | 87 (72.5) | 21 (17.5) | 29 (24.2) | 17 (14.2) | 73 (60.8) |
3 | 5 (4.2) | 0 (0) | 0 (0) | 0 (0) | 6 (5.0) |
kappa | 0.15 | 0.20 | 0.22 | 0.18 | 0.29 |
Data are presented as number (%)
Table 5.
Visual score | |||||
---|---|---|---|---|---|
FC + Gaussian | LC | LC + Gaussian | LC+NML | LC + DL | |
0 | 1 (2.3) | 0 (0) | 0 (0) | 0 (0) | 0 (0) |
1 | 1 (2.3) | 1 (2.3) | 1 (2.3) | 1 (2.3) | 2 (4.5) |
2 | 1 (2.3) | 2 (4.5) | 2 (4.5) | 2 (4.5) | 1 (2.3) |
3 | 41 (93.2) | 41 (93.2) | 41 (93.2) | 41 (93.2) | 41 (93.2) |
kappa | 0.31 | 0.31 | 0.31 | 0.31 | 0.65 |
Data are presented as number (%)
Discussion
The present study showed that the quantitative values of lesions could be preserved at less than half of the emission time when using the Res-Net model to reduce noise.
In [18F]FDG PET examinations, it is desirable to reduce the acquisition time or injected activity; however, an insufficient count will lead to an increase in image noise. The noise can influence diagnosis, decrease the detectability of small lesions, and affect the SUV measurement. In the present study, our model significantly reduced noise due to reduced acquisition time when compared with the use of a Gaussian and an NLM filter. A post-smoothing filter is usually adopted to reduce noise in PET images, but its performance is limited because it is designed to reduce Gaussian random noise, which is distinct from the noise in PET images (36, 37). Our model can effectively reduce the noise in PET images, which is characterized by a complex noise distribution. An NLM filter is mainly used to remove Gaussian noise and speckle noise. In addition, the use of this filter requires the standard deviation of the noise to be set, and an improper setting will lead to blur in the images (29). The dbPET system has the characteristic of low sensitivity at the edge of the detector (17), and noise distributions in the trans-axial images used as the input of our model vary with respect to the location in the plane. Therefore, we believe it is challenging to determine the optimal parameters for the NLM filter. The present study showed that our model, which uses a larger filter size than the model in a previous report (30), was able to capture more context information in a larger image region and efficiently reduce the levels of location-dependent noise (38, 39).
Our model slightly changed the SUVmean of the background fibroglandular tissue compared to the reference images, as well as the use of the NLM filter. Non-linear image processing (e.g., the NLM filter or deep learning based denoising) may result in a slight shift the mean value of the image, but we consider this to have minimal clinical impact.
Semiquantitative analysis using the SUV is used to diagnose malignancy as well as monitor the response to therapy of a breast tumor (2). In the present study, our model obtained the lowest bias of the relative differences for SUVmax and SUVpeak. We believe that noise reduction using our model removed the variability in quantitative values of the lesions, while maintaining the SUVmean of the background fibroglandular tissue. These results are consistent with those of a previous study on noise reduction for low-dose [18F]FDG PET images using a supervised deep learning model (40). Using our model, the variance of the relative difference for SUVmax was the smallest, whereas that for SUVpeak was the largest. The SUVpeak was measured in an ROI centered at the maximum value of the lesions; therefore, the position of the ROI was not always identical among the images. In some cases, noise reduction with our model may have caused the position of the ROI to change with respect to the ROI in the reference images, which also has a degree of noise, resulting in an increase in the variability of the relative difference in SUVpeak.
A Gaussian filter leads to a slight decrease in SUVmax of lesions due to blurring. In this study, our model was trained using the image sets without a Gaussian filter, but denoised images obtained using our model exhibited comparable quantitative values to the reference images with a Gaussian filter. The network used in this study was trained with the MSE as a loss function, which is known to introduce slight blurring in the network output (25). In medical imaging, there are some reports that using the structural similarity index and perceptual loss as a loss function may improve the result and should be considered in future (41- 43).
The model used in this study consisted of fewer layers, whereas the filter size was larger than that used in a prior study (30), resulting in increased computational cost. The use of dilated convolution is expected to improve efficiency while maintaining performance (38).
There were several limitations in this study. First, the detectability of small lesions was not evaluated in this study. In general, the choice of a post-smoothing filter is concerned about loss of details in tumor structure and the reduced detectability of small lesions. Furthermore, there are some reports that an NLM filter leads to blurring and loss of the details of high-contrast small lesions especially in images with high noise levels (44, 45). Because PET images tend to have more noise in regions of high uptake (46), an NLM filter may cause reduced detectability of small lesions in the higher accumulation of the background fibroglandular tissue. Therefore, we believe it is important to compare the detectability of small lesions in images using our model and other post-smoothing filters. However, the visualization of the six focus lesions were maintained in our study. Furthermore, there were no non-mass uptake lesions in the test data, and future studies are needed to assess the influence of our
denoising model on non-mass uptake lesions. Second, our sample size was relatively small. Only twenty-eight patients were included in this study population and there were only twenty-two lesions. Data augmentation is one way to increase training data, but excessive data augmentation can lead to unpredictable results. Moreover, the noise distributions in the dbPET images vary with respect to the location in the plane. A post-smoothing filter denoise the image uniformly, but our model could have been efficiently reducing the location-dependent noise, especially on the chest wall side. However, the amount or distribution of the noise on the chest wall side varies among patients, so it takes a large number of sample size to investigate the detectability of lesions on the chest wall side. Thus, a larger sample size is required to create models and evaluate the detection performance. In addition, further consideration should be given to how the network is trained, particularly with respect to input images for the network. 2D images were used as the input of the network in this study, but previous reports have shown good performance of PET image denoising using 3D or 2.5D images as the inputs of the network (27, 28). The use of 3D or 2.5D images as the inputs may be possible to improve image quality and should be investigated in the future. Furthermore, we did not eliminate slices that include only air in this study, and it may be possible to improve the result by using only the slices include the breast.
The present study reduced the noise from dbPET images obtained in about half the emission time, and further reduction of emission time could be possible by training or evaluating models with fewer counts images.
Conclusion
The present study showed that the use of the Res-Net model reduced the noise in dbPET images acquired in about half the emission time while preserving quantitative values of lesions. The machine learning is feasible in the noise reduction in dbPET images and potentially performs better than conventional post-image filtering.
Competing interests
Author Masakazu Toi has received research support from Shimadzu Corporation.
Funding
No funding was received for conducting this study.
Acknowledgments
We would like to thank Koji Fujimoto, MD, PhD, for his expertise and assistance with our study. We thank Kimberly Moravec, PhD, from Edanz (https://jp.edanz.com/ac) for editing a draft of this manuscript.
References
- 1.Fletcher JW, Djulbegovic B, Soares HP, Siegel BA, Lowe VJ, Lyman GH, et al. Recommendations on the use of 18F-FDG PET in oncology. J Nucl Med. 2008;49(3):480–508. doi: 10.2967/jnumed.107.047787. [DOI] [PubMed] [Google Scholar]
- 2.Groheux D, Espié M, Giacchetti S, Hindié E. Performance of FDG PET/CT in the clinical management of breast cancer. Radiology. 2013;266(2):388–405. doi: 10.1148/radiol.12110853. [DOI] [PubMed] [Google Scholar]
- 3.Groheux D, Cochet A, Humbert O, Alberini JL, Hindié E, Mankoff D. 18F-FDG PET/CT for staging and restaging of breast cancer. J Nucl Med. 2016;57(Suppl 1):17S–26S. doi: 10.2967/jnumed.115.157859. [DOI] [PubMed] [Google Scholar]
- 4.Lee H, Lee DE, Park S, Kim TS, Jung SY, Lee S, et al. Predicting response to neoadjuvant chemotherapy in patients with breast cancer: Combined statistical modeling using clinicopathological factors and FDG PET/CT texture parameters. Clin Nucl Med. 2019;44(1):21–9. doi: 10.1097/RLU.0000000000002348. [DOI] [PubMed] [Google Scholar]
- 5.Jo J, Chung HW, So Y, Yoo YB, Park KS, Nam SE, et al. FDG PET/CT to predict recurrence of early breast invasive ductal carcinoma. Diagnostics. 2022;12(3):694. doi: 10.3390/diagnostics12030694. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Avril N, Rosé CA, Schelling M, Dose J, Kuhn W, Bense S, et al. Breast imaging with positron emission tomography and fluorine-18 fluorodeoxyglucose: Use and limitations. J Clin Oncol. 2000;18(20):3495–502. doi: 10.1200/JCO.2000.18.20.3495. [DOI] [PubMed] [Google Scholar]
- 7.Kumar R, Chauhan A, Zhuang H, Chandra P, Schnall M, Alavi A. Clinicopathologic factors associated with false negative FDG-PET in primary breast cancer. Breast Cancer Res Treat. 2006;98:267–74. doi: 10.1007/s10549-006-9159-2. [DOI] [PubMed] [Google Scholar]
- 8.Sueoka S, Sasada S, Masumoto N, Emi A, Kadoya T, Okada M. Performance of dedicated breast positron emission tomography in the detection of small and low-grade breast cancer. Breast Cancer Res Treat. 2021;187:125–33. doi: 10.1007/s10549-020-06088-1. [DOI] [PubMed] [Google Scholar]
- 9.Nahmias C, Wahl LM. Reproducibility of standardized uptake value measurements determined by 18F-FDG PET in malignant tumors. J Nucl Med. 2008;49(11):1804–8. doi: 10.2967/jnumed.108.054239. [DOI] [PubMed] [Google Scholar]
- 10.Pan T, Einstein SA, Kappadath SC, Grogg KS, Lois Gomez C, Alessio AM, et al. Performance evaluation of the 5-Ring GE Discovery MI PET/CT system using the national electrical manufacturers association NU 2-2012 Standard. Med Phys. 2019;46(7):3025–33. doi: 10.1002/mp.13576. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Karlberg AM, Sæther O, Eikenes L, Goa PE. Quantitative comparison of PET performance-siemens biograph mCT and mMR. EJNMMI Phys. EJNMMI Physics. 2016; 3(1):1–4. doi: 10.1186/s40658-016-0142-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Yano F, Itoh M, Hirakawa H, Yamamoto S, Yoshikawa A, Hatazawa J. Diagnostic Accuracy of Positron Emission Mammo-graphy with 18F-fluorodeoxyglucose in Breast Cancer Tumor of Less than 20 mm in Size. Asia Ocean J Nucl Med Biol. 2019;7(1):13–21. doi: 10.22038/AOJNMB.2018.31101.1213. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Satoh Y, Kawamoto M, Kubota K, Murakami K, Hosono M, Senda M, et al. Clinical practice guidelines for high-resolution breast PET, 2019 edition. Ann Nucl Med. 2021;35(3):406–14. doi: 10.1007/s12149-021-01582-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Thompson CJ, Murthy K, Picard Y, Weinberg IN, Mako R. Positron emission mammography (PEM): A promising technique for detecting breast cancer. IEEE Trans Nucl Sci. 1995;42(4):1012–7. [Google Scholar]
- 15.Miyake KK, Kataoka M, Ishimori T, Matsumoto Y, Torii M, Takada M, et al. A proposed dedicated breast pet lexicon: Standardization of description and reporting of radiotracer uptake in the breast. Diagnostics. 2021;11(7):1267. doi: 10.3390/diagnostics11071267. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Miyake KK, Nakamoto Y, Togashi K. Current status of dedicated breast PET imaging. Curr Radiol Rep. 2016;4:16. [Google Scholar]
- 17.Miyake KK, Matsumoto K, Inoue M, Nakamoto Y, Kanao S, Oishi T, et al. Performance evaluation of a new dedicated breast PET scanner using NEMA NU4-2008 standards. J Nucl Med. 2014;55(7):1198–203. doi: 10.2967/jnumed.113.131565. [DOI] [PubMed] [Google Scholar]
- 18.Satoh Y, Motosugi U, Imai M, Onishi H. Comparison of dedicated breast positron emission tomography and whole-body positron emission tomography/computed tomography images: a common phantom study. Ann Nucl Med. 2020;34:119–27. doi: 10.1007/s12149-019-01422-0. [DOI] [PubMed] [Google Scholar]
- 19.Berg WA, Madsen KS, Schilling K, Tartar M, Pisano ED, Larsen LH, et al. Breast cancer: Comparative effectiveness of positron emission mammography and MR imaging in presurgical planning for the ipsilateral breast. Radiology. 2011;258(1):59–72. doi: 10.1148/radiol.10100454. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Sasada S, Masumoto N, Goda N, Kajitani K, Emi A, Kadoya T, et al. Dedicated breast PET for detecting residual disease after neoadjuvant chemotherapy in operable breast cancer: A prospective cohort study. Eur J Surg Oncol. 2018;44(4):444–8. doi: 10.1016/j.ejso.2018.01.014. [DOI] [PubMed] [Google Scholar]
- 21.Yuge S, Miyake KK, Ishimori T, Kataoka M, Matsumoto Y, Fujimoto K, et al. Reproducibility assessment of uptake on dedicated breast PET for noise discrimination. Ann Nucl Med. . 2023; 37:121 –30. doi: 10.1007/s12149-022-01809-6. [DOI] [PubMed] [Google Scholar]
- 22.Buades A, Coll B, Morel JM. A non-local algorithm for image denoising. Proc - 2005 IEEE;2005; 2:60–5. [Google Scholar]
- 23.Teoh EJ, McGowan DR, Macpherson RE, Bradley KM, Gleeson F V. Phantom and clinical evaluation of the Bayesian penalized likelihood reconstruction algorithm Q Clear on an LYSO PET/CT system. J Nucl Med. 2015;56(9):1447–52. doi: 10.2967/jnumed.115.159301. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Satoh Y, Imai M, Hirata K, Asakawa Y, Ikegawa C, Onishi H. Optimal relaxation parameters of dynamic row-action maximum likelihood algorithm and post-smoothing filter for image reconstruction of dedicated breast PET. Ann Nucl Med. 2021;35:608–16. doi: 10.1007/s12149-021-01604-9. [DOI] [PubMed] [Google Scholar]
- 25.Schaefferkoetter J, Yan J, Ortega C, Sertic A, Lechtman E, Eshet Y, et al. Convolutional neural networks for improving image quality with noisy PET data. EJNMMI Res. 2020;10(1):105. doi: 10.1186/s13550-020-00695-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Weyts K, Lasnon C, Ciappuccini R, Lequesne J, Corroyer-Dulmont A, Quak E, et al. Artificial intelligence-based PET denoising could allow a two-fold reduction in [18F]FDG PET acquisition time in digital PET/CT. Eur J Nucl Med Mol Imaging. 2022;49(11):3750–60. doi: 10.1007/s00259-022-05800-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Mehranian A, Wollenweber SD, Walker MD, Bradley KM, Fielding PA, Su KH, et al. Image enhancement of whole-body oncology [18F]-FDG PET scans using deep neural networks to reduce noise. Eur J Nucl Med Mol Imaging. 2022;49(2):539–49. doi: 10.1007/s00259-021-05478-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Spuhler K, Serrano-Sosa M, Cattell R, DeLorenzo C, Huang C. Full-count PET recovery from low-count image using a dilated convolutional neural network. Med Phys. 2020;47(10):4928–38. doi: 10.1002/mp.14402. [DOI] [PubMed] [Google Scholar]
- 29.Zhang K, Zuo W, Chen Y, Meng D, Zhang L. Beyond a Gaussian denoiser: Residual learning of deep CNN for image denoising. IEEE Trans Image Process. 2017;26(7):3142–55. doi: 10.1109/TIP.2017.2662206. [DOI] [PubMed] [Google Scholar]
- 30.Liu P, Fang R. Wide Inference Network for Image denoising via learning pixel-distribution Prior. arXiv. 2017:1707. [Google Scholar]
- 31.Ioffe S, Szegedy C. Batch normalization: Accelerating deep network training by reducing internal covariate shift. 32nd Int Conf Mach Learn ICML. 2015:448–56. [Google Scholar]
- 32.He K, Zhang X, Ren S, Sun J. Delving deep into rectifiers: Surpassing human-level performance on imagenet classification. Proc IEEE Int Conf Comput Vis. 2015:1026–34. [Google Scholar]
- 33.Kingma DP, Ba JL. Adam: A method for stochastic optimization. 3rd Int Conf Learn Represent ICLR. 2015: 1–15. [Google Scholar]
- 34.Watanabe M, Nakamoto Y, Nakamoto R, Ishimori T, Saga T, Togashi K. Qualitative and quantitative assessment of nonlocal means reconstruction algorithm in a flexible PET scanner. Am J Roentgenol. 2021;216(2):486–93. doi: 10.2214/AJR.19.22245. [DOI] [PubMed] [Google Scholar]
- 35.Gong K, Guan J, Liu CC, Qi J. PET image denoising using a deep neural network through fine tuning. IEEE Trans Radiat Plasma Med Sci. 2018;3(2):153–61. doi: 10.1109/TRPMS.2018.2877644. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Teymurazyan A, Riauka T, Jans HS, Robinson D. Properties of noise in positron emission tomography images recon-structed with filtered-backprojection and row-action maximum likelihood algorithm. J Digit Imaging. 2013;26:447–56. doi: 10.1007/s10278-012-9511-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Kim JH, Ahn IJ, Nam WH, Chang Y, Ra JB. Post-filtering of PET image based on noise characteristic and spatial sensitivity distribution. IEEE Nucl Sci Symp Conf Rec. 2013:1–3. [Google Scholar]
- 38.Wang T, Sun M, Hu K. Dilated deep residual network for image denoising. Proc - Int Conf Tools with Artif Intell ICTAI. 2017:1272–9. [Google Scholar]
- 39.Tian C, Xu Y, Fei L, Wang J, Wen J, Luo N. Enhanced CNN for image denoising. CAAI Trans Intell Technol. 2019;49(1):17–23. [Google Scholar]
- 40.Zhou L, Schaefferkoetter JD, Tham IWK, Huang G, Yan J. Supervised learning with cyclegan for low-dose FDG PET image denoising. Med Image Anal. 2020;65:101770. doi: 10.1016/j.media.2020.101770. [DOI] [PubMed] [Google Scholar]
- 41.Panda A, Naskar R, Rajbans S, Pal S. A 3D wide residual network with perceptual loss for brain MRI image denoising In2019 10th International Conference on Computing, Communication and Networking Techno-logies (ICCCNT) IEEE. 2019:1– 7. [Google Scholar]
- 42.Gholizadeh-Ansari M, Alirezaie J, Babyn P. Deep learning for low-dose CT denoising using perceptual loss and edge detection layer. J Digit Imaging. 2020;;33:504–15. doi: 10.1007/s10278-019-00274-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Ataei S, Alirezaie J, Babyn P. Low dose ct denoising using dilated residual learning with perceptual loss and structural dissimilarity In2020 IEEE 5th Middle East and Africa Conference on Biomedical Engineering (MECBME) IEEE. 2020:1–5. [Google Scholar]
- 44.Shreyamsha Kumar BK. Image denoising based on non-local means filter and its method noise thresholding. Signal, Image Video Process. 2013;7:1211–27. [Google Scholar]
- 45.Zhang X, Hou G, Jianhua M, Yang W, Lin B, Xu Y, et al. Denoising MR images using non-local means filter with combined patch and pixel similarity. PLoS One. 2014;9(6):e100240. doi: 10.1371/journal.pone.0100240. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Barrett HH, Wilson DW, Tsui BMW. Noise properties of the EM algorithm. I. Theory. Phys Med Biol. 1994;39(5):833–46. doi: 10.1088/0031-9155/39/5/004. [DOI] [PubMed] [Google Scholar]