Skip to main content
PLOS One logoLink to PLOS One
. 2017 Apr 5;12(4):e0175106. doi: 10.1371/journal.pone.0175106

Image denoising substantially improves accuracy and precision of intravoxel incoherent motion parameter estimates

Carolin Reischauer 1,2,*, Andreas Gutzeit 1,3
Editor: Yuanquan Wang4
PMCID: PMC5381911  PMID: 28380018

Abstract

Applicability of intravoxel incoherent motion (IVIM) imaging in the clinical setting is hampered by the limited reliability in particular of the perfusion-related parameter estimates. To alleviate this problem, various advanced postprocessing methods have been introduced. However, the underlying algorithms are not readily available and generally suffer from an increased computational burden. Contrary, several computationally fast image denoising methods have recently been proposed which are accessible online and may improve reliability of IVIM parameter estimates. The objective of the present work is to investigate the impact of image denoising on accuracy and precision of IVIM parameter estimates using comprehensive in-silico and in-vivo experiments. Image denoising is performed with four different algorithms that work on magnitude data: two algorithms which are based on nonlocal means (NLM) filtering, one algorithm that relies on local principal component analysis (LPCA) of the diffusion-weighted images, and another algorithms that exploits joint rank and edge constraints (JREC). Accuracy and precision of IVIM parameter estimates is investigated in an in-silico brain phantom and an in-vivo ground truth as a function of the signal-to-noise ratio for spatially homogenous and inhomogenous levels of Rician noise. Moreover, precision is evaluated using bootstrap analysis of in-vivo measurements. In the experiments, IVIM parameters are computed a) by using a segmented fit method and b) by performing a biexponential fit of the entire attenuation curve based on nonlinear least squares estimates. Irrespective of the fit method, the results demonstrate that reliability of IVIM parameter estimates is substantially improved by image denoising. The experiments show that the LPCA and the JREC algorithms perform in a similar manner and outperform the NLM-related methods. Relative to noisy data, accuracy of the IVIM parameters in the in-silico phantom improves after image denoising by 76–79%, 79–81%, 84–99% and precision by 74–80%, 80–83%, 84–95% for the perfusion fraction, the diffusion coefficient, and the pseudodiffusion coefficient, respectively, when the segmented fit method is used. Beyond that, the simulations reveal that denoising performance is not impeded by spatially inhomogeneous levels of Rician noise in the image. Since all investigated algorithms are freely available and work on magnitude data they can be readily applied in the clinical setting which may foster transition of IVIM imaging into clinical practice.

Introduction

Using diffusion-weighted imaging (DWI), the apparent diffusion coefficient (ADC) can be calculated which is a measure of tissue diffusivity and has been shown to be a viable biomarker for various pathological conditions. For instance, the ADC shows great promise for characterizing tumor masses and evaluating response to therapy at an early stage in head and neck tumors [1, 2] and brain cancers [3]. However, it has long been recognized that the ADC integrates the effects of diffusion and perfusion due to the pseudorandom organization of the capillary network at the voxel level [4, 5]. For this reason, Le Bihan et al. proposed the concept of intravoxel incoherent motion (IVIM) imaging. Signal attenuation due to diffusion weighting is thereby modelled as [4, 5]:

S(b)=S0((1f)exp(bD)+fexp(bD*)), (1)

where S0 relates to the signal without diffusion weighting, f denotes the perfusion fraction, D is the diffusion coefficient, and D* corresponds to the pseudodiffusion coefficient. The first term describes signal decay due to diffusion in the intra- and extracellular tissue compartments and the second term relates to the so-called pseudodiffusion phenomenon. Due to the pseudodiffusion coefficient D* typically being an order of magnitude greater than the diffusion coefficient D, both compartments can be separated. Le Bihan and Turner established a link between the product of the perfusion fraction and the pseudodiffusion coefficient and the relative perfusion or blood flow [6]. In this manner, IVIM imaging permits separating the effects of diffusion and perfusion and may lead to a more comprehensive and differentiated understanding of the underlying tissue pathology and of alterations that occur in response to treatment.

However, clinical applicability of IVIM imaging is hampered by the limited reliability in particular of the perfusion-related parameter estimates if the biexponential fit is performed using iterative nonlinear least squares methods [7, 8]. One possibility to increase robustness of the results is to compute averaged values over regions of interest rather than individual voxels [7, 8]. Another approach that preserves spatial resolution is to use a segmented fit method as proposed by Pekar et al. [7]. It is likely the most frequently used algorithm in IVIM analysis. It relies on the fact that the decay rate resulting from pseudodiffusion is usually an order of magnitude greater than that stemming from tissue diffusion. For this reason, pseudodiffusion dominantly affects signal attenuation at lower b-values while it accounts for only a small proportion of the measured signal at higher b-values. Thus, the diffusion coefficient can be derived from a monoexponential fit of the high b-value images (typically > 200 s/mm2). Thereafter, the perfusion fraction is determined using the intercept obtained in the fit and the actual measurement without diffusion weighting. Finally, the pseudodiffusion coefficient is derived from a biexponential fit using the previously calculated values of the diffusion coefficient and the perfusion fraction.

Alternatively, it was shown that estimation uncertainty may be reduced relative to nonlinear least squares methods when a Bayesian probability approach is used for model fitting [912]. Furthermore, Freiman et al. have shown that combining a spatially-constrained incoherent motion model with an iterative fusion bootstrap solver results in more precise estimates of IVIM parameters [13]. However, the underlying algorithms are not readily available and suffer from an increased computational burden. Contrary, several computationally fast image denoising methods have recently been proposed which are accessible online (see Materials and Methods section) and may improve reliability of IVIM parameter estimates [1419]. Up to now, image denoising as a means to increase accuracy and precision of IVIM modeling has not been evaluated and will be investigated in the current work.

Image denoising can be applied to either complex or magnitude diffusion-weighted (DW) images. Even though denoising of complex images allows for an easier modeling of the noise characteristics, it is preferable from a practical point of view to denoise magnitude images since they are widely available and require less storage. In many cases, noise in DW magnitude images follows a Rician distribution such as in images from a single coil [2022] and in multiple-coil images reconstructed with sensitivity encoding (SENSE) [23]. Based on this assumption several algorithms have been proposed [2426]. To improve denoising performance, a variety of constraints that exploit prior information can be incorporated. A well-known approach uses nonlocal similarities within the image [27]. The high computational burden that is associated with the method can be diminished by an optimized blockwise implementation of the algorithm [14]. The original version of this algorithm relied on the assumption of a Gaussian distribution of the noise in the image. To facilitate applicability to DW images, Wiest-Dasslé et al. proposed an adapted version which incorporates a Rician distribution of the noise [15]. In the following, this algorithm will be referred to as the nonlocal means (NLM) algorithm. The NLM algorithm assumes spatially homogenous levels of Rician noise. In practice, however, DWI acquisition is typically combined with parallel imaging to diminish susceptibility-induced image artifacts and T2* blurring [28, 29] which leads to varying levels of Rician noise across the image. For this reason, Manjón et al. proposed the adaptive nonlocal means (ANLM) algorithm [16]. As part of the denoising procedure, the ANLM algorithm inherently determines estimates of the noise variance on a voxelwise basis.

Alternative approaches exploit the multidirectional nature of DW images for image denoising. For instance, Manjón et al. demonstrated that magnitude DW images can be effectively denoised using local principal component analysis (LPCA) [17]. The LPCA algorithm assumes a Rician distribution of the noise in the image and accounts for spatially varying noise patterns. Beyond that, Lam et al. showed that the signal-to-noise ratio (SNR) of DW images can be effectively improved using joint rank and edge constraints (JREC) [18, 19]. Contrary to the aforementioned algorithms, the noisy magnitude images are modeled by a noncentral χ distribution of which the Rician distribution is a special case.

The aim of the present study is to evaluate the impact of image denoising on accuracy and precision of IVIM parameter estimates using comprehensive in-silico and in-vivo experiments. Image denoising is performed using the NLM, the ANLM, the JREC, and the LPCA algorithms. All investigated algorithms are freely available and work on magnitude data. Thus, they can be readily applied in the clinical setting which may foster transition of IVIM imaging into clinical practice.

Materials and methods

Denoising performance of the NLM, the ANLM, the LPCA (available for download at: https://sites.google.com/site/pierrickcoupe/softwares/denoising-for-medical-imaging/mri-denoising/mri-denoising-software, accessed 23 May 2016), and the JREC (available for download at: http://mri.beckman.uiuc.edu/software.html, accessed 23 May 2016) algorithms was assessed with regard to accuracy and precision of the IVIM parameter estimates. IVIM parameters were computed twofold a) by using the segmented fit method and b) by performing a biexponential fit of the entire attenuation curve based on nonlinear least squares estimates. The latter will hereafter be referred to as a full biexponential fit.

In-silico simulations

Denoising performance of the algorithms was investigated in a three-dimensional brain phantom (matrix = 144 x 144 x 10) based on the discrete version of the digital brain phantom created by Collins et al. (available for download at: http://brainweb.bic.mni.mcgill.ca/brainweb/anatomic_normal.html, accessed 23 May 2016) [30]. The IVIM parameters in the phantom were set to recent literature values (Table 1) [31]. Based on Eq 1, DW images of the brain phantom were computed with b-values of 0, 15, 30, 45, 60, 100, 250, 400, 550, 700, 850, 1000 s/mm2. Different levels of spatially homogeneous and inhomogeneous Rician noise were simulated by adding white Gaussian noise to the real and imaginary parts of the DW images resulting in a SNR range of = 20–100 in the non-DW images. Spatially varying noise distributions were generated similar to those described by Tabelow et al. [32]. In the simulations, the noise standard deviation was set relative to the joint average of the signals of gray and white matter in the non-DW images of the brain phantom. In clinical practice, DW images are usually acquired along three orthogonal directions to compensate for the possible influence of the relative orientation between tissue and imaging system [33]. For this reason, three noisy DW images were computed at each b-value and subsequently combined into a trace image.

Table 1. IVIM Parameters in the In-Silico Brain Phantom.

f D D*
[no units] [x10-3 mm2/s] [x10-3 mm2/s]
Gray matter 0.14 0.84 8.2
White matter 0.07 0.77 7.9

An estimate of the noise variance of each DW image series was determined using the median absolute deviation estimator adapted for Rician noise [34] and used as input parameter for the NLM and the JREC algorithms. The rank model order and the regularization parameter in the JREC algorithm were optimized at each SNR such that accuracy of the IVIM parameter estimates was maximized compared to the gold standard. The other image denoising algorithms were run with their default parameters.

Each noisy DW image series was denoised using all investigated algorithms and the IVIM parameters calculated. For comparison with previously published data, using the segmented fit method, the diffusion coefficients D were computed from the DW images with b > 250 s/mm2 [31]. In addition, IVIM parameters were computed from the noisy and noise-free DW image series, respectively. The latter were treated as gold standard for determining accuracy of parameter estimation. For each SNR, the simulation was repeated 50 times. Thereafter, accuracy and precision were computed jointly over gray and white matter.

The simulations were implemented and run in MATLAB (MathWorks, Natick, MA, USA, Release 2015a). To decrease computation time, the code was parallelized and distributed using MATLAB’s Parallel Computing Toolbox and Distributed Computing Server for Amazon EC2 (MathWorks, Natick, MA, USA, Release 2015a). Nonlinear least squares fitting for parameter estimation was implemented using the levmar package (Version 2.5) which comprises a C/C++ implementation of the Levenberg-Marquardt algorithm [35].

In-vivo experiments

DWI measurements

For in-vivo comparison of the image denoising algorithms, DWI data of the brain of a healthy volunteer were acquired on a 3 T MR scanner (Achieva, Release 3.2.1, Philips Healthcare, Best, the Netherlands) using a 32-channel receive-only head coil array (Philips Healthcare, Best, the Netherlands). Data acquisition was approved by the Cantonal Research Ethics Committee and written informed consent was obtained from the volunteer. A standard DW spin-echo echo planar imaging (EPI) sequence with 12 b-values (b-values of 0, 15, 30, 45, 60, 100, 250, 400, 550, 700, 850, 1000 s/mm2) separately applied along three orthogonal directions was used. The scan parameters were as follows: field of view = 220 x 220 mm2, image matrix = 140 x 140, slices = 16, slice thickness = 5 mm, no slice gap, partial Fourier encoding = 75%, SENSE factor = 2, TR = 2000 ms, TE = 64 ms, number of signal averages (NSA) = 1, scan duration = 72 s. The scan was repeated 50 times to permit evaluating precision of IVIM parameter estimation using the bootstrap method [36]. Eddy current-induced image warping and motion were corrected and all data sets coregistered using a correlation-based affine registration algorithm [37]. The SNRs in a single measurement were determined according to the National Electrical Manufacturers Association [38]. Thereby, the SNR was calculated on a voxelwise basis using the ratio of the mean signal intensity of a given voxel over all scan repetitions divided by the standard deviation around that mean.

Bootstrap analysis of in-vivo data

The non-DW images were averaged and calculated for the probability of belonging to either gray matter, white matter or cerebrospinal fluid on a voxelwise basis using SPM (Statistical Parametric Mapping, SPM8, Wellcome Department of Cognitive Neurology, London, UK). The white matter probability mask was smoothed using a three-dimensional Gaussian kernel with a full width at half maximum of 3 mm to mitigate partial volume effects [31, 39]. A joint gray and white matter mask was thereafter created at a threshold of 0.9.

Bootstrapping was used to assess in-vivo precision of IVIM parameter estimates as a function of the NSA (NSA = 2–25). For each NSA, the corresponding number of scan repetitions was randomly selected out of the 50 repetitions and averaged to create a bootstrap estimate of the DW image series. The procedure was repeated to generate a total of 50 bootstrap estimates with replacement at each NSA. It should be noted that scan repetitions rather than individual b-values were randomly picked and averaged to account for signal drifts as a results of temporal scanner instability [40]. Each bootstrap estimate was denoised using all evaluated algorithms and the IVIM parameters calculated. In addition, IVIM parameter estimates were also derived directly from the noisy bootstrap estimates of the DW image series. In the analysis, the rank model order in the JREC algorithm was set to 4 and the regularization parameter was chosen by visual inspection of the IVIM parameter maps to avoid oversmoothing [18, 19].

Surrogate in-vivo ground truth

Due to the absence of a gold standard, accuracy of IVIM estimation cannot be determined in vivo. For comparison with the in-silico simulations, a ground truth was generated from the in-vivo data using a similar approach as proposed by Zhou et al. [41]. In the process, a) the DW image series were eddy-current corrected, motion compensated and coregistered, b) the DW image series were denoised using the JREC algorithm and subsequently averaged, and c) the IVIM parameter maps were calculated. The JREC algorithm was chosen for post processing since it allows a tradeoff between denoising performance and image smoothing. For computation of this ground truth, a relatively large regularization parameter was chosen to maximize noise removal while accepting some loss of image detail. Based on the thus generated in-vivo ground truth, the same simulations as in the in-silico phantom were run, i.e. the same b-values were assumed and different levels of spatially homogeneous and inhomogeneous Rician noise were added resulting in a SNR range of = 20–100 in the non-DW images. It should be noted that smaller regularization parameters were used when the JREC algorithm was used for image denoising in the simulations than when the ground truth was created.

Measures of algorithm performance

In addition to a visual comparison of noisy and denoised IVIM parameter maps, two error metrics were computed to assess accuracy and precision of IVIM parameter estimation. To quantify parameter estimation accuracy, the root-mean-square error (RMSE) of the IVIM parameter estimates was computed:

RMSE=m=1M(pm-p^m)2M, (2)

where pm and p^m correspond to the IVIM parameter estimates in voxel m computed from the gold standard and the noisy/denoised DW images, respectively. M corresponds to the total number of gray and white matter voxels in the data set. For each SNR/NSA, the RMSE was calculated jointly over gray and white matter and subsequently averaged across all simulations (n = 50).

At each SNR/NSA, precision was assessed by computation of the coefficient of variation (CV):

CV=100%m=1Mσm/μmM, (3)

where μm denotes the average value and σm the corresponding standard deviation of the IVIM parameter estimate in voxel m across all simulations (n = 50) and M relates to the total number of gray and white matter voxels in the data set.

Results

Visual comparison of IVIM parameter maps

For visual comparison, Fig 1 depicts IVIM parameter maps computed from noisy and denoised DW image series when the segmented fit method is used. Quality of the IVIM parameter maps notably improves after image denoising. With regard to the pseudodiffusion coefficient, quality of the parameter map is highest when the JREC algorithm is utilized. However, it should be noted that relative to the gold standard the pseudodiffusion coefficient appears to be systematically underestimated in brain white matter (Fig 1, third row).

Fig 1. Parameter maps of the perfusion fraction f (first row), the diffusion coefficient D (second row), and the pseudodiffusion coefficient D* (third row) in the in-silico brain phantom computed from noisy (first column) and denoised DW images using the NLM (second column), the ANLM (third column), the JREC (fourth column), and the LPCA algorithms (fifth column).

Fig 1

Stationary homogeneous Rician noise was added to obtain a SNR of 40 and the IVIM parameters were computed using the segmented fit method. The gold standard is depicted for reference.

By way of example, Fig 2 shows IVIM parameter maps computed from noisy and denoised DW image series when a full biexponential fit is performed. Visual inspection reveals that the highest image quality is achieved when using the LPCA algorithm for image denoising. However, even then image quality of the pseudodiffusion coefficient is relatively poor. Furthermore, it can be observed that both NLM-related algorithms lead to errors in the parameter estimates at transitions between different types of tissue.

Fig 2. Parameter maps of the perfusion fraction f (first row), the diffusion coefficient D (second row), and the pseudodiffusion coefficient D* (third row) in the in-silico brain phantom computed from noisy (first column) and denoised DW images using the NLM (second column), the ANLM (third column), the JREC (fourth column), and the LPCA algorithms (fifth column).

Fig 2

Stationary homogeneous Rician noise was added to obtain a SNR of 60 and the IVIM parameters were computed using a full biexponential fit. The gold standard is depicted for reference.

Accuracy of IVIM parameters after image denoising

In-silico brain phantom simulations

Fig 3 depicts the RMSEs of the IVIM parameter estimates in the in-silico brain phantom as a function of the SNR. It can be appreciated that independent of the fit method, accuracy of IVIM parameter estimation considerably improves after image denoising, in particular at low and moderate SNRs. The JREC and the LPCA algorithms result in similar RMSEs and generally perform better than the NLM-related methods. Relative to the noisy DW image series, accuracy improves by 76–79%, 79–81%, and 84–99% for the perfusion fraction f, the diffusion coefficient D, and the pseudodiffusion coefficient D*, respectively, across the simulated SNR range, if the DW image series is denoised with the LPCA algorithm prior to IVIM parameter estimationusing the segmented fit method (Fig 3A). Thereby, denoising performance is not impeded by spatially varying levels of Rician noise (Fig 3B). Despite image denoising, accuracy of IVIM parameter estimation based on a full biexponential fit is much lower than when the segmented fit method is used. Both fit methods achieve comparable results only at high SNRs (Fig 3A and 3C). When the JREC algorithm is used for image denoising it should be noted with regard to the pseudodiffusion coefficient that a constant RMSE is observed that persists at high SNRs (Fig 3, third row). This finding is a consequence of the systematic error in the parameter estimates in brain white matter, as discussed further up. For the pseudodiffusion coefficient D*, a critical SNR can be observed below which the RMSE increases strongly (Fig 3, third row), this is likely caused by the Levenberg-Marquardt algorithm converging towards a local instead of the global minimum.

Fig 3. RMSEs of the perfusion fraction f (first column), the diffusion coefficient D (second column), and the pseudodiffusion coefficient D* (third column) in the in-silico brain phantom as a function of the SNR.

Fig 3

Results of the simulations when (a) stationary and (b) spatially varying Rician noise is added and the segmented fit method is utilized, and (c) when stationary Rician noise is added and a full biexponential fit is performed. Please note that the RMSEs of the pseudodiffusion coefficient (third column) are displayed on a logarithmic scale for clarity. At each SNR, the RMSEs were calculated as the average across all simulations (n = 50).

In-vivo ground truth simulations

Due to the lack of a gold standard, accuracy of IVIM parameter estimation cannot be determined in vivo. As noted above, a surrogate in-vivo ground truth was generated from the in-vivo measurements to assess transferability of the results of the in-silico simulations to the in-vivo situation. Table 2 summarizes the IVIM parameters in the in-vivo ground.

Table 2. IVIM Parameters in the In-Vivo Ground Truth.
f D D*
[no units] [x10-3 mm2/s] [x10-3 mm2/s]
Gray matter 0.11 ± 0.06 0.76 ± 0.20 7.7 ± 4.2
White matter 0.07 ± 0.03 0.72 ± 0.20 7.2 ± 4.5

Please note, values in the in-vivo ground truth are expressed as mean ± standard deviation.

With regard to accuracy, the results of the in-silico experiments are corroborated in the in-vivo ground truth. By way of example, Fig 4 depicts the RMSEs of the IVIM parameter estimates in the in-vivo ground truth when spatially homogeneous Rician noise is added and the segmented fit method is utilized. The results agree well with those of the in-silico brain phantom (Figs 3A and 4). However, it should be noted that the RMSEs in the in-vivo ground truth were generally somewhat higher than in the in-silico brain phantom.

Fig 4. RMSEs of the perfusion fraction f (first column), the diffusion coefficient D (second column), and the pseudodiffusion coefficient D* (third column) in the in-vivo ground truth as a function of the SNR when spatially homogeneous Rician noise is added and a segmented fit is performed.

Fig 4

Please note that the RMSEs of the pseudodiffusion coefficient (third column) are displayed on a logarithmic scale for clarity. At each SNR, the RMSEs were calculated as the average across all simulations (n = 50).

Precision of IVIM parameters after image denoising

In-silico brain phantom simulations

Fig 5 depicts the CVs of the IVIM parameter estimates in the in-silico brain phantom as a function of the SNR. As can be seen, image denoising considerably improves precision of the estimates. In agreement with the observations made with regard to accuracy, the JREC and the LPCA algorithms outperform the NLM-related methods. Relative to the noisy DW image series, precision is improved by 74–80%, 80–83%, and 84–95% for the perfusion fraction f, the diffusion coefficient D, and the pseudodiffusion coefficient D*, respectively, across the simulated SNR range, if the DW image series is denoised with the LPCA algorithm prior to IVIM parameter estimation using the segmented fit method (Fig 5A). Furthermore, denoising performance is not impeded when spatially varying levels of Rician noise are added (Fig 5B). Similar to before, precision of IVIM parameter estimation based on a full biexponential fit is much lower than when the segmented fit method is applied. Both fit methods achieve comparable results only at high SNRs (Fig 5A and 5C).

Fig 5. CVs of the perfusion fraction f (first column), the diffusion coefficient D (second column), and the pseudodiffusion coefficient D* (third column) in the in-silico brain phantom as a function of the SNR.

Fig 5

Results of the simulations when (a) stationary and (b) spatially varying Rician noise is added and the segmented fit method is utilized, and (c) when stationary Rician noise is added and a full biexponential fit is performed. Please note that the CVs of the pseudodiffusion coefficient (third column) are displayed on a logarithmic scale for clarity.

In-vivo ground truth simulations

With regard to precision, the results of the in-silico experiments are corroborated in the in-vivo ground truth. By way of example, Fig 6 depicts the CVs of the IVIM parameter estimates in the in-vivo ground truth when spatially homogeneous Rician noise is added and the segmented fit method is utilized. The results are in agreement with those of the in-silico brain phantom (Figs 5A and 6). However, it should be noted that precision of the IVIM parameter estimates after image denoising with the JREC algorithm was generally higher in the in-silico brain phantom than in the in-vivo ground truth (Figs 5A and 6).

Fig 6. CVs of the perfusion fraction f (first column), the diffusion coefficient D (second column), and the pseudodiffusion coefficient D* (third column) in the in-vivo ground truth as a function of the SNR when spatially homogeneous Rician noise is added and a segmented fit is performed.

Fig 6

Please note that the CVs of the pseudodiffusion coefficient (third column) are displayed on a logarithmic scale for clarity.

Bootstrap analysis of the in-vivo measurements

For visual comparison, Fig 7 depicts IVIM parameter maps computed from noisy and denoised in-vivo DW images when the segmented fit method is applied. The parameter maps were computed from a single measurement (NSA = 1, scan duration = 72s). Quality of the parameters maps clearly improves after image denoising. As before, visual inspection reveals that the JREC and the LPCA algorithms perform better that the NLM-related methods. The artifact that is apparent on the pseudodiffusion coefficient maps corresponds to a residual aliasing artifact after SENSE reconstruction likely caused by inaccurate estimation of the coil sensitivity maps.

Fig 7. Parameter maps of the perfusion fraction f (first row), the diffusion coefficient D (second row), and the pseudodiffusion coefficient D∗ (third row) computed from noisy (first column) and denoised in-vivo DW images using the NLM (second column), the ANLM (third column), the JREC (fourth column), and the LPCA algorithms (fifth column).

Fig 7

The results were computed from a single measurement (NSA = 1, scan duration = 72 s).

The SNR computed jointly over gray and white matter amounted on average to 22.92 ± 4.12 in the non-DW images (NSA = 1). Please note that when neglecting diffusion, the SNR in the trace DW images is increased by a factor of square root of three relative to the non-DW images since diffusion weighting was sequentially applied along three orthogonal directions. Taking into consideration that the SNR scales with the square root of the NSA, the NSA range (NSA = 2–25) in the bootstrap analysis of the in-vivo measurements translates into a SNR range that agrees well with the in-silico simulations. By way of example, S1 Fig of the Supporting Information shows in-vivo SNR maps and S1 Table summarizes the results of the in-vivo SNR measurements.

Finally, Fig 8 depicts the results of the bootstrap analysis. In-vivo precision of the IVIM parameter estimates is slightly lower but generally agrees well with the in-silico simulations (Figs 5A, 5B and 8). Precision of the perfusion fraction f considerably increases after image denoising with either the LPCA or the JREC algorithm (CVs ≤ 20% for NSA ≥ 5). Moreover, it should be stressed that the CVs of the diffusion coefficient D were smaller than 5% across the entire NSA range after image denoising using the aforementioned algorithms.

Fig 8. Results of the bootstrap analysis of the in-vivo data: CVs of the perfusion fraction f (first column), the diffusion coefficient D (second column), and the pseudodiffusion coefficient D∗ (third column) as a function of the NSA when IVIM parameter estimation is performed using the segmented fit method.

Fig 8

Please note that the CVs of the pseudodiffusion coefficient (third column) are displayed on a logarithmic scale for clarity.

Discussion

In the present study, the impact of image denoising on the reliability of IVIM parameter estimates was investigated using comprehensive in-silico and in-vivo experiments. IVIM parameters were computed by using either the segmented fit method or by performing a biexponential fit of the entire attenuation curve. Irrespective of the fit method, the results demonstrate that both accuracy and precision are substantially improved after image denoising. It was shown that the LPCA and the JREC algorithms perform in a similar manner and outperform the NLM-related methods. Despite image denoising, reliability of IVIM parameter estimates is generally markedly lower when a full biexponential rather than a segmented fit is utilized. Both methods achieve similar results only at high SNRs which are unlikely to be obtained in most clinical applications.

In the present study, precision of IVIM parameter estimates was evaluated in an in-silico brain phantom. The results of these simulations were verified by bootstrap analysis of in-vivo measurements. Due to the lack of a gold standard, accuracy of the IVIM parameter estimates could not be determined in vivo. To investigate transferability of the results of the in-silico simulations to the in-vivo situation, a ground truth was generated from the in-vivo measurements in a similar manner as proposed by Zhou et al. [41]. The IVIM parameters in the resulting in-vivo ground truth agreed well with literature values (see Tables 1 and 2) [31, 42]. The simulations that were run in the thus generated in-vivo ground truth corroborated the findings in the in-silico experiments. Finally, the results of the present work with regard to accuracy of IVIM parameter derived from noisy data agree well with recently published data [31].

In the simulations, accuracy was determined relative to the IVIM parameters computed from the noise-free images. It should be noted that there is a systematic error relative to the gold standard when the segmented fit method is used. This inaccuracy stems from a residual amount of signal due to perfusion even at higher b-values and is inherent to the method [7].

Performance of the image denoising algorithms relies on an accurate estimate of the noise variance. The NLM and the JREC algorithms require this estimate as an input parameter. For this purpose, an object-based method was utilized in the present work. While background-based noise estimation methods are easily applicable they rely on reasonably large background regions which are representative of the noise level in the object. This assumption might not be met in DWI since image acquisition is typically performed using EPI which is prone to ghosting artifacts due to the misalignment of even and odd echoes. These artifacts may lead to erroneous results if background-based noise estimation methods are used.

As noted above, DWI is usually performed in conjunction with parallel imaging to diminish susceptibility-induced image artifacts and T2* blurring [28, 29] which leads to spatially varying levels of Rician noise in the image. Among the algorithms studied, only the ANLM and the LPCA algorithms permit estimating the noise variance on a voxelwise basis. However, the JREC algorithm could be readily adapted to allow for spatially varying levels of noise by introducing a voxel-dependent noise variance in combination with a noise estimation method that allows for a voxelwise determination of the noise level [18]. However, the present results indicate that denoising performance is not diminished by spatially inhomogeneous levels of Rician noise.

In many cases, noise in magnitude MR images follows a Rician distribution such as in images from a single coil [2022] and in multiple-coil images reconstructed with SENSE [23]. However, for instance generalized autocalibrating partially parallel acquisition reconstruction with sum-of-squares combination follows a noncentral χ distribution. In this case, images can either be directly denoised using the JREC algorithm or image denoising may be performed prior to coil combination using the other evaluated denoising methods.

Reliability of the IVIM parameter estimates was evaluated assuming a DWI sequence with 12 b-values as described in Wu et al. [31]. While accuracy and precision may be increased by acquiring more b-values, the maximum number of b-values is in practice limited by time constraints in the clinical setting. Nevertheless, it should be noted that the simulations lead to comparable results when a large number of b-values (number of b-values = 25, data not shown) was assumed. For this reason, it is recommended to use image denoising as a postprocessing step independent of the number of acquired b-values.

The present work demonstrates that image denoising based on either the LPCA or the JREC algorithm results in more robust IVIM parameter estimates than when the NLM-related methods are used. Both algorithms make use of the multidirectional nature of DW images. Thus, misregistration among DW images with different b-values will lead to erroneous results. For this reason, it is important to correct in-vivo data for eddy current-induced image distortions and motion. To this end, a correlation-based affine registration algorithm was utilized in the present work [37].

In the in-vivo experiment, the parameters of the JREC algorithm were chosen by visual inspection of the IVIM parameter maps to avoid oversmoothing. In situations where a DW data set features particularly low SNRs, a higher regularization parameter may be chosen to increase noise removal at the expense of image detail. The NLM-related algorithms were run with their default parameters which were optimized in a previous work [14]. To increase denoising performance of the NLM-related methods Coupé et al. proposed a multiresolution framework with wavelet subband mixing [42]. However, preliminary simulations (data not shown) indicated only minor improvements in denoising performance at the cost of double the computational burden. For this reason, image denoising was performed without subband mixing in the present work.

Finally, it should be noted that image denoising could in principle be easily combined with parameter estimation using a Bayesian probability approach for model fitting. Previous work has shown that in this manner reliability of the IVIM parameters estimates may potentially be increased [912]. However, IVIM parameter estimation was in the present work performed by nonlinear least-squares fitting due to its computational efficiency and its widespread use.

In conclusion, the present work evaluated the impact of image denoising on the reliability of IVIM parameter estimates. Irrespective of the fit method, it was shown that both accuracy and precision are substantially improved when DW images are denoised prior to IVIM parameter fitting. It was observed that the LPCA and the JREC algorithms in conjunction with the segmented fit method result in the highest robustness of the IVIM parameter estimates. Both algorithms work on magnitude data and can thus be readily applied in the clinical setting which may foster transition of IVIM imaging into clinical practice.

Supporting information

S1 Fig. In-vivo SNR maps of three slices in the brain at b-values of 0, 100, 550, 1000 s/mm2.

Please note that diffusion weighting at each b-value was applied along three orthogonal directions and the images were subsequently combined into a trace image.

(DOCX)

S1 Table. Mean values and standard deviations of the SNRs in the in-vivo measurements jointly computed across gray and white matter (NSA = 1).

(DOCX)

S2 Fig. Flow chart depicting the simulation pipeline used in the present work.

In addition, links to the locations of the source code that is used in the simulations are provided.

(PDF)

Acknowledgments

The authors would like to thank Dr. Fan Lam from the Beckman Institute for Advanced Science and Technology at the University of Illinois at Urbana-Champaign for helpful discussions on image denoising. Moreover, the authors would like to thank the MR-Center of the University Hospital for Psychiatry and the Department of Child and Adolescent Psychiatry and Psychotherapy at the University of Zurich and in particular Dr. Philipp Staempfli for the opportunity to acquire the in-vivo DWI data.

Data Availability

All relevant data are within the paper and its Supporting Information files.

Funding Statement

The authors received no specific funding for this work.

References

  • 1.Chawla S, Kim S, Wang S, Poptani H. Diffusion-weighted imaging in head and neck cancers. Future Oncol. 2009;5(7):959–75. PubMed Central PMCID: PMCPMC2791671. 10.2217/fon.09.77 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Thoeny HC. Diffusion-weighted MRI in head and neck radiology: applications in oncology. Cancer Imaging. 2011;10:209–14. PubMed Central PMCID: PMCPMC3080114. 10.1102/1470-7330.2010.0030 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Svolos P, Kousi E, Kapsalaki E, Theodorou K, Fezoulidis I, Kappas C, et al. The role of diffusion and perfusion weighted imaging in the differential diagnosis of cerebral tumors: a review and future perspectives. Cancer Imaging. 2014;14:20 PubMed Central PMCID: PMCPMC4331825. 10.1186/1470-7330-14-20 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Le Bihan D, Breton E, Lallemand D, Grenier P, Cabanis E, Laval-Jeantet M. MR imaging of intravoxel incoherent motions: application to diffusion and perfusion in neurologic disorders. Radiology. 1986;161(2):401–7. 10.1148/radiology.161.2.3763909 [DOI] [PubMed] [Google Scholar]
  • 5.Le Bihan D, Breton E, Lallemand D, Aubin ML, Vignaud J, Laval-Jeantet M. Separation of diffusion and perfusion in intravoxel incoherent motion MR imaging. Radiology. 1988;168(2):497–505. 10.1148/radiology.168.2.3393671 [DOI] [PubMed] [Google Scholar]
  • 6.Le Bihan D, Turner R. The capillary network: a link between IVIM and classical perfusion. Magn Reson Med. 1992;27(1):171–8. [DOI] [PubMed] [Google Scholar]
  • 7.Pekar J, Moonen CT, van Zijl PC. On the precision of diffusion/perfusion imaging by gradient sensitization. Magn Reson Med. 1992;23(1):122–9. [DOI] [PubMed] [Google Scholar]
  • 8.Le Bihan D, Turner R, Moonen CT, Pekar J. Imaging of diffusion and microcirculation with gradient sensitization: design, strategy, and significance. J Magn Reson Imaging. 1991;1(1):7–28. [DOI] [PubMed] [Google Scholar]
  • 9.Bretthorst GL, Hutton WC, Garbow JR, Ackerman JJH. Exponential parameter estimation (in NMR) using Bayesian probability theory. Concepts Magn Reson Part A. 2005;27(2):55. [Google Scholar]
  • 10.Dyvorne HA, Galea N, Nevers T, Fiel MI, Carpenter D, Wong E, et al. Diffusion-weighted imaging of the liver with multiple b values: effect of diffusion gradient polarity and breathing acquisition on image quality and intravoxel incoherent motion parameters-a pilot study. Radiology. 2013;266(3):920–9. 10.1148/radiol.12120686 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Neil JJ, Bretthorst GL. On the use of Bayesian probability theory for analysis of exponential decay date: an example taken from intravoxel incoherent motion experiments. Magn Reson Med. 1993;29(5):642–7. [DOI] [PubMed] [Google Scholar]
  • 12.Orton MR, Collins DJ, Koh D-M, Leach MO. Improved intravoxel incoherent motion analysis of diffusion weighted imaging by data driven Bayesian modeling. Magn Reson Med. 2014;71(1):411–20. 10.1002/mrm.24649 [DOI] [PubMed] [Google Scholar]
  • 13.Freiman M, Perez-Rossello JM, Callahan MJ, Voss SD, Ecklund K, Mulkern RV, et al. Reliable estimation of incoherent motion parametric maps from diffusion-weighted MRI using fusion bootstrap moves. Med Image Anal. 2013;17(3):325–36. 10.1016/j.media.2012.12.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Coupé P, Yger P, Prima S, Hellier P, Kervrann C, Barillot C. An optimized blockwise nonlocal means denoising filter for 3-D magnetic resonance images. IEEE Trans Med Imaging. 2008;27(4):425–41. 10.1109/TMI.2007.906087 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Wiest-Daesslé N, Prima S, Coupé P, Morrissey SP, Barillot C. Non-local means variants for denoising of diffusion-weighted and diffusion tensor MRI. Med Image Comput Comput Assist Interv. 2007;10:344–51. 10.1007/978-3-540-75759-7_42 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Manjón JV, Coupé P, Martí BL, Collins DL, Robles M. Adaptive non-local means denoising of MR images with spatially varying noise levels. J Magn Reson Imaging. 2010;31(1):192–203. 10.1002/jmri.22003 [DOI] [PubMed] [Google Scholar]
  • 17.Manjón JV, Coupé P, Concha L, Buades A, Collins DL, Robles M. Diffusion weighted image denoising using overcomplete local PCA. PLoS One. 2013;8(9):e73021 10.1371/journal.pone.0073021 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Lam F, Babacan SD, Haldar JP, Weiner MW, Schuff N, Liang Z-P. Denoising diffusion-weighted magnitude MR images using rank and edge constraints. Magn Reson Med. 2014;71(3):1272–84. 10.1002/mrm.24728 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Lam F, Liu D, Song Z, Schuff N, Liang Z-P. A fast algorithm for denoising magnitude diffusion-weighted images with rank and edge constraints. Magn Reson Med. 2016;75(1):433–40. 10.1002/mrm.25643 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Henkelman RM. Measurement of signal intensities in the presence of noise in MR images. Med Phys. 1985;12(2):232–3. 10.1118/1.595711 [DOI] [PubMed] [Google Scholar]
  • 21.Bernstein MA, Thomasson DM, Perman WH. Improved detectability in low signal-to-noise ratio magnetic resonance images by means of a phase-corrected real reconstruction. Med Phys. 1989;16(5):813–7. 10.1118/1.596304 [DOI] [PubMed] [Google Scholar]
  • 22.Gudbjartsson H, Patz S. The Rician distribution of noisy MRI data. Magn Reson Med. 1995;34(6):910–4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Dietrich O, Raya JG, Reeder SB, Ingrisch M, Reiser MF, Schoenberg SO. Influence of multichannel combination, parallel imaging and other reconstruction techniques on MRI noise characteristics. Magn Reson Imaging. 2008;26(6):754–62. 10.1016/j.mri.2008.02.001 [DOI] [PubMed] [Google Scholar]
  • 24.Nowak RD. Wavelet-based Rician noise removal for magnetic resonance imaging. IEEE Trans Image Process. 1999;8(10):1408–19. 10.1109/83.791966 [DOI] [PubMed] [Google Scholar]
  • 25.Aja-Fernandez S, Niethammer M, Kubicki M, Shenton ME, Westin C-F. Restoration of DWI data using a Rician LMMSE estimator. IEEE Trans Med Imaging. 2008;27(10):1389–403. 10.1109/TMI.2008.920609 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Koay CG, Ozarslan E, Basser PJ. A signal transformational framework for breaking the noise floor and its applications in MRI. J Magn Reson. 2009;197(2):108–19. 10.1016/j.jmr.2008.11.015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Buades A, Coll B, Morel J-M. A review of image denoising algorithms, with a new one. Multiscale Model Simul. 2005;4(2):490–530. [Google Scholar]
  • 28.Griswold MA, Jakob PM, Chen Q, Goldfarb JW, Manning WJ, Edelman RR, et al. Resolution enhancement in single-shot imaging using simultaneous acquisition of spatial harmonics (SMASH). Magn Reson Med. 1999;41(6):1236–45. Epub 1999/06/17. [DOI] [PubMed] [Google Scholar]
  • 29.Bammer R, Auer M, Keeling SL, Augustin M, Stables LA, Prokesch RW, et al. Diffusion tensor imaging using single-shot SENSE-EPI. Magn Reson Med. 2002;48(1):128–36. 10.1002/mrm.10184 [DOI] [PubMed] [Google Scholar]
  • 30.Collins DL, Zijdenbos AP, Kollokian V, Sled JG, Kabani NJ, Holmes CJ, et al. Design and construction of a realistic digital brain phantom. IEEE Trans Med Imaging. 1998;17(3):463–8. 10.1109/42.712135 [DOI] [PubMed] [Google Scholar]
  • 31.Wu W-C, Chen Y-F, Tseng H-M, Yang S-C. Caveat of measuring perfusion indexes using intravoxel incoherent motion magnetic resonance imaging in the human brain. European Radiology. 2015;25(8):2485–92. 10.1007/s00330-015-3655-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Tabelow K, Voss HU, Polzehl J. Local estimation of the noise level in MRI using structural adaptation. Med Image Anal. 2015;20(1):76–86. 10.1016/j.media.2014.10.008 [DOI] [PubMed] [Google Scholar]
  • 33.Padhani AR, Liu G, Koh DM, Chenevert TL, Thoeny HC, Takahara T, et al. Diffusion-weighted magnetic resonance imaging as a cancer biomarker: consensus and recommendations. Neoplasia. 2009;11(2):102–25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Coupé P, Manjón JV, Gedamu E, Arnold D, Robles M, Collins DL. Robust Rician noise estimation for MR images. Med Image Anal. 2010;14(4):483–93. 10.1016/j.media.2010.03.001 [DOI] [PubMed] [Google Scholar]
  • 35.Lourakis MIA. Levmar: Levenberg-Marquardt nonlinear least squares algorithms in C/C++. www.ics.forth.gr/~lourakis/levmar/; 2004.
  • 36.Efron B, Tibshirani R. An introduction to the bootstrap. New York: Chapman & Hall; 1994. [Google Scholar]
  • 37.Netsch T, Van Muiswinkel A. Quantitative evaluation of image-based distortion correction in diffusion tensor imaging. IEEE Trans Med Imaging. 2004;23(7):789–98. 10.1109/TMI.2004.827479 [DOI] [PubMed] [Google Scholar]
  • 38.Clark H. Characterization of special purpose coils for diagnostic magnetic resonance images. National Electrical Manufacturers Association: NEMA Standards and Products Guide. 1991;Report nr MS 6–1991 (R1991, R2000).
  • 39.Wu W-C, Lin S-C, Wang DJ, Chen K-L, Li Y-D. Measurement of Cerebral White Matter Perfusion Using Pseudocontinuous Arterial Spin Labeling 3T Magnetic Resonance Imaging—an Experimental and Theoretical Investigation of Feasibility. PLoS One. 2013;8(12):e82679 10.1371/journal.pone.0082679 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Vos SB, Tax CM, Luijten PR, Ourselin S, Leemans A, Froeling M. The importance of correcting for signal drift in diffusion MRI. Magn Reson Med. 2016. [DOI] [PubMed] [Google Scholar]
  • 41.Zhou M-X, Yan X, Xie H-B, Zheng H, Xu D, Yang G. Evaluation of Non-Local Means Based Denoising Filters for Diffusion Kurtosis Imaging Using a New Phantom. PLoS One. 2015;10(2):e0116986 10.1371/journal.pone.0116986 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Coupé P, Hellier P, Prima S, Kervrann C, Barillot C. 3D wavelet subbands mixing for image denoising. Int J Biomed Imaging. 2008. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 Fig. In-vivo SNR maps of three slices in the brain at b-values of 0, 100, 550, 1000 s/mm2.

Please note that diffusion weighting at each b-value was applied along three orthogonal directions and the images were subsequently combined into a trace image.

(DOCX)

S1 Table. Mean values and standard deviations of the SNRs in the in-vivo measurements jointly computed across gray and white matter (NSA = 1).

(DOCX)

S2 Fig. Flow chart depicting the simulation pipeline used in the present work.

In addition, links to the locations of the source code that is used in the simulations are provided.

(PDF)

Data Availability Statement

All relevant data are within the paper and its Supporting Information files.


Articles from PLoS ONE are provided here courtesy of PLOS

RESOURCES