Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2026 May 11.
Published in final edited form as: Magn Reson Imaging. 2025 Jan 17;117:110326. doi: 10.1016/j.mri.2025.110326

Deep learning-based free-water correction for single-shell diffusion MRI

Tianyuan Yao a, Derek B Archer b, Praitayini Kanakaraj a, Nancy Newlin a, Shunxing Bao c, Daniel Moyer a, Kurt Schilling d, Bennett A Landman a,b,c,d, Yuankai Huo a,d,*
PMCID: PMC13157656  NIHMSID: NIHMS2170154  PMID: 39827997

Abstract

Free-water elimination (FWE) modeling in diffusion magnetic resonance imaging (dMRI) is crucial for accurate estimation of diffusion properties by mitigating the partial volume effects caused by free water, particularly at the interface between white matter and cerebrospinal fluid. The presence of free water partial volume effects leads to biases in estimating diffusion properties. Additionally, the existing mathematical FWE model is a two-compartment model, which can be well posed for multi-shell data. However, single-shell acquisitions are more common in clinical cohorts due to time constraints. To overcome these problems, we proposed a deep-learning framework that focuses on mapping and correcting free-water partial volume contamination in DWI. It utilizes data-driven techniques to infer plausible free-water volumes across different diffusion MRI acquisition schemes, including single-shell acquisitions. In this work, we study the Human Connectome Project Young Adults (HCP-ya), the HCP Aging dataset (HCP-a) as well as Brain Tumor Connectomics Data (BTC). The evaluation demonstrates that it produces more plausible results compared to previous single-shell free water estimation approaches. The proposed method is generalizable through model fine-tuning and b-value re-mapping when dealing with new data. The results have demonstrated improved consistency of properties estimation between scan/rescan data and accuracy in identifying neural pathways, as well as enhanced clarity in the visualization of white matter tracts.

Keywords: Diffusion MRI, Free water elimination, Deep learning, Reproducibility

1. Introduction

Diffusion magnetic resonance imaging (dMRI) is a noninvasive technique that provides unique in vivo microstructural information, particularly for studying white matter structure and brain connectivity [13]. One common application of dMRI is Diffusion Tensor Imaging (DTI), which is clinically utilized for surgical planning [4]. DTI quantifies the three-dimensional movement of water molecules by assuming Gaussian diffusion within a voxel [5]. Key metrics derived from DTI, such as Fractional Anisotropy (FA) and Mean Diffusivity (MD), provide valuable insights into brain microstructure. FA measures the coherence and directionality of white matter tracts, with higher values indicating greater structural integrity and alignment of fibers. In contrast, MD captures the average rate of water diffusion, offering a non-directional measure of tissue density and health. Elevated MD values often reflect pathological changes, such as increased water content from edema or tissue degradation. These metrics enable DTI to support the delineation of neuronal fibers and assessment of brain microstructure, making it a cornerstone in neuroimaging [6].

The single-tensor DTI model assumes a single tissue compartment per voxel, which leads to biased DTI metrics in voxels containing a mixture of white matter and freely diffusing extracellular water molecules [7]. Free water refers to water molecules that are not restricted by cellular barriers and do not experience flow. In the human brain, cerebrospinal fluid (CSF) in the ventricles, around the brain parenchyma, or within lesion edema is considered free water. In the context of dMRI, signals from free water and brain tissue can combine within a voxel due to partial volume effects. When these signals are not explicitly modeled, free water introduces inaccuracies—referred to as “contamination”—that distort the derived diffusion metrics and lead to potential misinterpretations [8]. (Illustrated in Fig. 1.)

Fig. 1.

Fig. 1.

Partial volume effects and free water contamination. In the upper panel, the Free Water partial volume effects (PVE) manifest at the boundary between brain tissue and ventricles filled with cerebrospinal fluid (CSF). The image to the right presents a pristine slice where the distinction between the brain tissue and CSF is sharp and clear. In contrast, the central image reveals partial volume effects in voxels encompassing both CSF and tissue, as indicated by the yellow arrows. In the bottom panel, water molecules diffusing freely (as in edema) tend to reduce the anisotropy measurements because their movement is less restricted and more isotropic compared to water molecules in healthy brain tissue, where diffusion is more anisotropic due to barriers like cell membranes and axonal fibers. The free water contamination can make it challenging to accurately assess white matter integrity and to differentiate pathological changes from changes in water content. The primary objective of the desired algorithm is to extract parameter estimates that are exclusive to the brain tissue within these voxels. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)

This contamination has significant implications for diffusion tractography, where free water can obscure the structural pathways of white matter, and for studying neurodegenerative diseases such as Alzheimer’s disease, Multiple Sclerosis, Parkinsonism, and Schizophrenia. Subtle changes in water diffusion within these conditions often serve as key indicators of disease progression [9]. Accurately addressing these partial volume effects is crucial for reliable interpretation of dMRI data and for isolating tissue-specific diffusion properties.

The Free Water Elimination (FWE) model, as proposed by Pasternak et al. [10], aims to mitigate the adverse impact of CSF partial volume effects on DTI metrics [6]. This model leverages a bi-tensor approach to separate isotropic (free water) and anisotropic (tissue) diffusion components. The initial FWE model requires multiple b-values to distinguish tissue types at the sub-voxel level because water in free and tissue-bound compartments exhibits distinct diffusion behaviors that are more easily disentangled across a range of diffusion weightings [11,12]. Without these multi-shell constraints, the model often becomes ill-posed, making it difficult to estimate free water fractions.

Although mathematical models’ dependence on multi-shell data limits their applicability—since multi-shell acquisitions are not commonly available in clinical practice due to longer scan times [4,12]—in clinical scenarios where single-shell data is more prevalent, direct application of these models is not feasible. To address this limitation, alternative strategies such as spatial regularization methods [9,10] have been proposed to impose smooth constraints and stabilize estimation. Deep learning approaches, like those introduced by Molina-Romero et al. [7], leverage data-driven methods for adapting FWE techniques to single-shell data using multi-layer perceptron. However, traditional deep learning methods [7,37], follow a case-by-case approach and lack generalizability across datasets with diverse acquisition schemes.

In this study, we propose a novel deep learning method [13] for mapping and correcting free-water partial volume contamination in DWI to address these limitations. As compared with the deep learning state-of-the-art (SOTA) method, the proposed deep learning framework can be fed with data simultaneously across a range of acquisition settings (different combinations of b-values within 1000, 2000, 3000 s/mm2), while targeting the free water fractions generated from Human Connectome Project young adults (HCP-ya) dataset [3,14]. With the combination of dynamic head and spherical convolution, this approach leverages data-driven techniques to reliably infer plausible free-water volumes across different diffusion MRI acquisition schemes, including single-shell acquisitions. During the testing stage, we used only 1000 s/mm2 shell data (including b0) to evaluate the free water fraction estimation across methods. For extension, we also applied our method to the HCP aging dataset [15] as well as the Brain Tumor Connectomics Data [16,17] (both pre-operative and post-operative). For further evaluation, the test-retest group from HCP-ya is employed for the reproducibility assessment of the proposed free-water elimination method. We also performed tractography and streamline analysis to further demonstrate the plausibility of the proposed method.

2. Related work

2.1. Free water elimination DTI

Pasternak et al. [10] introduced a bi-tensor model to segregate free water from brain tissue in diffusion MRI. This approach accounts for isotropic (free water) and anisotropic (tissue) components, significantly improving the interpretation of DTI data, particularly in areas with high free water content, such as periventricular zones.

In diffusion MRI, the measured signal is the combined contribution of CSF and tissue (parenchyma) components, where the relative signal contribution of the fast-diffusing component is described by f. The free water elimination signal model is expressed as:

S=(1-f)Stissue+fSCSF (1)

and can be expanded as:

S=S0(1f)expbgiTDtissuegi+fexpbDiso, (2)

where Diso is the free water diffusivity, Dtissue is the tissue diffusion tensor, and gi represents the diffusion gradient direction. Since Dtissue<Diso, the distinct diffusion characteristics of tissue and free water are better disentangled with multi-shell data acquired at multiple b-values. However, disentangling these volume fractions becomes ill-posed in single-shell diffusion MRI, where diffusion-sensitizing gradients are applied at a single b-value. This limitation renders traditional free water elimination techniques inapplicable without additional constraints [6,12].

2.2. Free water elimination for single-shell DWI

In clinical settings, multi-shell datasets are less common due to longer acquisition times, necessitating the development of methods for single-shell data. To address this, Pasternak et al. [10] proposed single-shell strategies, which incorporate spatial constraints and utilize Gradient Descent (GD) to help solve the ill-posed problem. A regularized gradient descent (RGD) method [10,18] has since been implemented as the standard approach for single-shell free water elimination.

While some studies indicate that plausible free water fractions can be obtained using these methods [9], others argue that the resulting maps are unreliable due to the degenerate nature of the model in single-shell acquisitions [6,19,20]. This controversy underscores the challenges of accurately quantifying free water in single-shell data without introducing artifacts or biases that misrepresent tissue properties. Consequently, advanced techniques or complementary strategies are needed to improve the robustness of free water estimation in such scenarios.

2.3. Learning-based method

Molina-Romero et al. [7] presented a deep learning framework in their study. This method developed a deep learning methodology for eliminating cerebrospinal fluid (CSF) contamination in diffusion MRI data. This method employs a multi-layer perceptron trained on a synthetic dataset of DWIs to estimate the free water fractions. The network architecture is optimized for processing DWI data, incorporating techniques like dropout and batch normalization to enhance model performance. This approach is particularly significant as it demonstrates the capability of deep learning models to interpret complex medical imaging data, providing a new avenue for more accurate and robust analysis in diffusion MRI, especially in clinical contexts where precise tissue characterization is crucial.

However, these deep learning frameworks are not generalizable to new acquisition schemes (as shown in Fig. 2. This complicates applying a deep learning (DL) model to data acquired in different acquisition settings. Our model aims to train a DL framework that can be adapted to an arbitrary number of available dMRI sequences (including multi-shell and single-shell sequences.) By regarding the multi-shell sequences as multi-modality data, a dynamic deep learning framework shall enable input with multiple configurations (single shell, two shells, and three shells, e.g.). During the training stage, the network shall transfer the knowledge between the full shell dMRI with the drop-out self-version, thus enabling the framework to estimate plausible free water fractions for single shell signal input during the inference stage.

Fig. 2.

Fig. 2.

Comparison of methods for free water elimination (FWE) in dMRI. Conventional methods rely on spatially regularized gradient descent (RGD) to impose constraints for solving the FWE problem in single-shell data. However, these methods are often ill-posed for single-shell dMRI and require additional regularization to achieve a stable solution. Previous deep learning-based methods provide accurate model fitting for single-shell and multi-shell data but are typically limited in generalizability due to their case-specific designs (F1, and F2 represent different network design). These approaches often fail when presented with dMRI data from unseen acquisition schemes. In contrast, the proposed method introduces a unified model (represented as F) capable of handling various shell configurations.

3. Method

To serve the above motivation, we employ a dynamic head (DH) design [13] to handle the multi-shell problem on the three most common b-values: 1000, 2000, and 3000 s/mm2. Additionally, spherical CNNs were employed to tackle the problem of varying gradient directions on each shell. In this study, we employ free water fraction estimation as our chosen task to perform assessments on both conventional/mathematical models and other deep learning frameworks. The model is shown in Fig. 3.

Fig. 3.

Fig. 3.

In our study, we aim to design a deep learning framework that integrates signals from both the q-space and radial space as input. This will allow the framework to take in both single-shell and multi-shell diffusion signals (two shells, three shells) that could utilize the powerful data-driven capabilities of deep learning, thus yielding improved accuracy over conventional fitting or modeling of diffusion MRI signals such as free water fractions estimation, ODF estimation and other diffusion measures, like fractional anisotropy, mean diffusivity, etc. we introduce a dynamic head network built on spherical convolutional neural network. The dynamic headings offer a flexible way to handle diverse data types within a single model, adapting its behavior to best suit the input configuration. Using the spherical convolution network serves as a feature extractor to get crucial information from the diffusion MRI signals. Our evaluation of this method focuses on its application to estimate the free water fraction for single-shell DWIs.

3.1. Dynamic head

A dynamic head design (illustrated in Fig. 4) in multi-modality deep learning [21] offers a flexible way to handle diverse data types within a single model, adapting its behavior to best suit the input modality. In our scenario, we intend to use a dynamic head that allows the neural network to effectively deal with diverse inputs from different shells by adapting its processing mechanism accordingly.

Fig. 4.

Fig. 4.

To mitigate the large input variation caused by artificially zero-ed channels, we use the dynamic head to generate the parameters for the first convolutional layer. In our scenario, this feature scaling matrix will adjust the S2 convolutional kernel in the spherical convolution part.

The input is normalized data expanded using spherical harmonics up to degree eight. Note that with K shells in our scheme, there are 2K – 1 configurations. To improve the network expressiveness, we devise a dynamic head to adaptively generate model parameters conditioned on the availability of input shells. We use a binary code mRK indicates that m is a vector with K real-valued entries or components. K ∈ [0, 1] that 0/1 represent the absence/presence of each shell. To mitigate the large input variation caused by artificially zero-ed channels, we use the dynamic head to generate the parameters for the first convolutional layer. For a combination of 1000, 2000, and 3000 s/mm2 in our training configuration, K was set to 3 in our study.

3.2. Spherical convolution

Spherical CNNs are employed to handle variations in gradient schemes and achieve rotational equivariance [22]. After the dynamic filtering, the following architecture (illustrated in Fig. 5) consists of an S2 convolutional layer and a SO(3) convolutional layer. The S2 layer performs directional convolution on the sphere, lifting the spherical input to the rotation group, while the SO(3) layer performs convolution within the rotation group. These layers are followed by three channel-wise activations and two restricted generalized convolutions [22], culminating in a final restricted generalized convolution [22] that maps the features to a rotationally invariant representation. The extracted features are then concatenated and passed through fully connected layers to estimate the free water fraction ffw.

Fig. 5.

Fig. 5.

The architecture used for the convolutional base in our spherical-CNN models. The input to the first convolutional layer is a signal on the sphere.

3.3. B-value mapping

Due to differences in the b-values between datasets, we will first match the lower b-values and then proceed to feed into the deep learning framework. Previous work [23] illustrated that diffusion signal has linear decay for values 500s/mm2<bvalue<1500s/mm2. The equation for the new diffusion signal Sbnew is as follows:

Sbnew=S0*expbnew/boriginal*logSboriginal/S0 (3)

For shells less than or equal to 1500s/mm2, the lowest shell value shall be re-calibrated to 1000s/mm2 to match the training scheme.

3.4. DWI estimation from estimated free water fraction

After the ffw is estimated, the desired FW eliminated signal (interpreted as S^ ) shall be calculated (derived from Eq. (2)) by the following equation:

S^:=S-S0*ffw*exp-bDiso (4)

Additional parameter adjustments were applied during the estimation process. A mean diffusivity regularization threshold of 2.7e-3 mm2. s−1 was used. This threshold corresponds to the approximate value of free water diffusivity in physiological conditions. If the mean diffusivity exceeds this threshold, the diffusion signal is assumed to originate entirely from free water (i.e., the free water volume fraction is set to 1, and the tissue’s diffusion parameters are set to zero).

4. Experiments

4.1. Data

We have chosen DW-MRI from the HCP-ya dataset [3,14], which consists of healthy adults aged 22–35 years. A total of 220 subjects were included in this study. The acquisitions had b-values of 1000, 2000, and 3000 s/mm2 with 90 gradient directions on each shell. The data was preprocessed by the PreQual [26] pipeline with the default settings and Gibbs de-ringing to remove artifacts, including denoising using the Marchenko-Pastur PCA (MP-PCA) technique, intensity normalization, and distortion corrections. Distortion corrections included eddy current[SP]-induced distortion correction, intervolume motion correction, and slice-wise signal dropout imputation. A T1 volume of the same subject was used for brain masks segmented using SLANT [24]. All HCP-ya dMRIs were distortion corrected with top-up and eddy [25]. Of the 220 subjects, 200 were used as training data, 10 as validation, and 10 as testing data. The validation dataset was used during model training to tune hyperparameters and monitor performance, while the testing dataset was kept separate and used only for final performance assessment to ensure unbiased evaluation. To simulate different shell configurations, the data was reorganized to generate two-shell and single-shell datasets (b0 shells included). Another cohort from HCP-ya, the test-retest group, includes 10 subjects with scan/rescan DWIs and was employed for test-retest reproducibility assessment. For single-shell free water estimation, only the 1000 s/mm2 shell from the testing cohort was used.

The diffusion and T1-weighted MRI data of 50 unrelated healthy adults aged 50+ from the Lifespan HCP in Aging (HCP-A) [15] study was used to evaluate the effectiveness of the proposed FWE method on healthy elderly subjects. This dataset spans a wide age range (36–100+years), capturing age-related variability in brain structure and function. Diffusion encoding was acquired with two shells of 1500 and 3000 s/mm2 (98–99 directions per shell) and 28 b-value = 0 s/mm2 images interleaved. All datasets were acquired with two phase encoding directions: anterior to posterior (AP) and reversed phase encoding direction (PA). The data was preprocessed by the PreQual [26] pipeline with the default settings and Gibbs de-ringing. The T1 image was registered to the mean b0 image and segmented using SLANT [24]. Of the 50 subjects, 40 were used to fine-tune the model, 5 as validation, and 5 as testing data. The 1500 s/mm2 shell was processed with a b-value re-mapped to 1000 s/mm2 before feeding to the deep learning framework.

The third dataset studied is the Brain Tumor Connectomics Data [16,17], which includes both pre-operative and post-operative data. The pre-operative data consists of 11 glioma patients, 14 meningioma patients, and 11 control subjects, while the post-operative data consists of 7 glioma patients, 12 meningioma patients, and 10 control subjects. This dataset provides a clinical perspective, focusing on tumor-related structural changes. The DWI scans were acquired using a multi-shell high-angular resolution protocol at 700, 1200, and 2800 s/mm2. The data was preprocessed by the PreQual [26] pipeline with the default settings and Gibbs de-ringing. The T1 image was registered to the mean b0 image and segmented using SLANT [24]. Only the 700 s/mm2 shell was involved in the deep learning experiment and re-mapped to 1000 s/mm2 before feeding to the deep learning framework. Three DWIs from a subset of unique subjects (patients and controls) were withheld as the testing cohort, ensuring no overlap with subjects whose data was included in the training or fine-tuning datasets. The remaining DWIs from pre-operative and post-operative data were used exclusively for training and fine-tuning.

The target free water fractions are generated using the Free Water Elimination model described in [6,10]. The implementation details of the free water DTI model are provided in [20] and made available through the DIPY library (version 1.70) with its default settings [27]. Since ground truth mappings of free water volume in human data are not available, we use the free water fractions estimated from the full configuration of multi-shell data as the silver standard in this study. This approach assumes that multi-shell HARDI protocols provide a more accurate estimation of free water. Our deep learning-based model is designed to bridge the gap between single-shell data and this silver standard, effectively learning to approximate a more complex method from simpler data inputs.

The target free water fractions are generated using the Free Water Elimination model described in [6,10]. The implementation details of the free water DTI model are provided in [20] and made available through the DIPY library (version 1.70) with its default settings [27]. Since ground truth mappings of free water volume in human data are not available, we use the free water fractions estimated from the full configuration of multi-shell data as the silver standard in this study. This approach assumes that multi-shell HARDI protocols provide a more accurate estimation of free water. While the silver standard is consistent across subjects in terms of processing and assumptions, its accuracy is limited in regions with low signal-to-noise ratio (SNR) or complex microstructural environments where multi-shell models themselves may face challenges. Despite these limitations, the silver standard serves as a practical benchmark for evaluating our model, and its reliability is further supported by the high quality of the datasets and rigorous preprocessing. Our deep learning-based model is designed to bridge the gap between single-shell data and this silver standard, effectively learning to approximate a more complex method from simpler data inputs.

4.2. Experimental settings

4.2.1. Deep learning estimation on free water fraction

4.2.1.1. Model-based baseline methods.

The method of Free Water Elimination from Pasternak et al. [10] was implemented as the baseline for the conventional FWE model. When dealing with single shell data, spatially regularized gradient descent (RGD) algorithms [9] applied constraints that are imposed via the time evolution of a gradient flow on a Riemannian manifold [28] to get a unique solution.

4.2.1.2. Deep learning baseline methods.

The ANN proposed by Molina-Romero et al. [7] is performed as a deep learning-based method to extract the free water fraction. We followed their settings by initializing a 4-layer FCN (two hidden layers with Nb/2 and Nb/4 respectively). In our case, Nb is 96. The input size is 96 and one single output unit yields the estimate of free water fractions. We trained the ANN architectures with the training Cohort.

Additionally, the shore basis function [2931] has been shown to capture the representation of multi-shell dMRI with minimal representation error and ensure the same when modeling single-shell dMRI. Thus, we utilize the shore representation as an additional baseline to assess the representation extracted from the spherical convolution network. We used 6th order and regularization constants: 1e – 8. The scaling factor is carefully calculated by ζ defined in units of mm-2 as ζ=1/8π2τMD is calculated based on the mean diffusivity (MD) obtained from the data. We also applied a 4-layer FCN to fit the FW fraction. The input size is 50 (SHORE-based ODF estimates 50 coefficients at 6th) and the sizes of the hidden layer are 100, and 50 respectively.

For the implementation of the proposed method, we used a spherical convolution model with the architecture [32]. The first block includes a directional convolution on the sphere that lifts the spherical input onto the rotation group, all parameters for this first convolution block are dynamically generated based on a given prior (dynamic headings). The second block includes a convolution on the rotation group, hence its input and output both live on the rotation group. We then apply a restricted generalized convolution. The same type is used for the following three channel-wise tensor-product activations and two restricted generalized convolutions until the final restricted generalized convolution maps down to a rotationally invariant representation. As is traditional in convolution networks we gradually decrease the resolution, with the spatial spherical harmonic bandlimits/degree [32] (L0, L1 , L2, L3, L4,L5 = (20, 10, 10, 6, 3, 1) and increase the number of channels, with (1, 20, 22, 24, 26, 28). The detail of the efficient generalized layer can be found in [33]. By using L1 regularization (regularization strength 10−5) and applying a restricted batch normalization across fragments (irreducible representations of the rotation group [32,34]), where the fragments are only scaled by their average and not translated (to preserve equivariance).

The RMSE loss was applied as the criterion function during the training process for all deep learning methods, we chose the best-performing models on the validation cohort and compared them against the Pasternak’s with the RGD method on the HCP-ya dataset. After free water elimination, the fractional anisotropy and mean diffusivity are calculated around the whole brain on the test cohort. Histograms of such diffusion properties are generated to assess the effect of regularization on free water estimation across methods.

Additionally, we compared the deep learning models to evaluate the generalizability across different shell configurations. The ANN method for different configurations is trained separately and the input size for the first layer of FCN is adjusted to the corresponding number of gradient directions of input.

4.2.2. Tractography and bundle analysis on free water corrected DWI

For each test subject, we used the MRTrix (version-3.0.3) default probabilistic tracking algorithm of second-order integration over fiber orientation distributions (FODs) for tractography [35]. We generated 10 million streamlines to build each tractogram, seeding, and termination using the five-tissue-type mask. We allowed backtracking. Tracts and binary segmentation masks for each bundle are generated using Tract-Seg [36]. Further specific bundle analyses were performed for different datasets to assess the effectiveness of free water correction of the proposed method.

5. Results

5.1. Deep learning estimation on single shell DWIs

The RMSE with generated free water fractions and the silver standard is calculated and shown in Table 1. From the table, the deep learning methods outperformed other methods on single-shell data in terms of RMSE for predicting ffw. Notably, the RMSE of our proposed method is even lower than the scan-rescan variability observed in the high-quality HCP dataset, which demonstrated an RMSE of 0.0354. This highlights the robustness and precision of our model compared to both conventional methods and other deep learning approaches.

Table 1.

Comparison of metrics between methods. Both Deep SHORE and our proposed method are trained on a variety of shell configurations to improve the capability of the model. ANN is trained only with single-shell dMRI. The RMSE of ffw is applied to test the precision of different methods. A Wilcoxon signed-rank test was performed to validate the significance and consistency of model performance, the reference is marked with ★, and all the deep learning method results yielded a p-value of less than 0.001.

Category Method RMSE of ffw
Conventional★ Pasternak et al. [6,10,20] w. RGD [28] 3.62 ± 0.44
Deep-learning ANN [7,37] 3.08 ± 0.31
Deep SHORE [30] 2.89 ± 0.19
Proposed 1.97 ± 0.21

All metric values are in units of 10−2.

5.2. Deep learning estimation on heterogeneous configuration DWIs

In Table 2, we evaluated different deep learning pipelines’ generalizability across different shell configurations. To be specific, the ANN method has 7 independent models for different subsets of the 1000, 2000, 3000 s/mm2 scheme. It is worth noting that the SHORE representation has decent performances in multi-shell configuration with our proposed method which proved to enable multi-shell b-value generalizability of the data-driven deep learning approach. Our proposed method outperforms Deep SHORE in a single-shell scheme proving the dynamic head strategy can produce similar feature representations when the information of multiple shells is absent.

Table 2.

Comparison of RMSE values ffw for different methods across varying shell configurations. The proposed method demonstrates consistent improvements in free-water fraction estimation over other deep learning frameworks, with tighter variance, indicating higher precision.

Method Single shell Two shells Three shells
ANN [7,37] 2.99 ± 0.35 2.79 ± 0.12 2.41 ± 0.28
Deep SHORE [30] 2.94 ± 0.24 1.82 ± 0.18 1.77 ± 0.19
Proposed 1.89 ± 0.19 1.85 ± 0.20 1.63 ± 0.17

All metric values are in units of 10−2

Additional experiments have been conducted to evaluate the proposed framework when applying the model (trained only with HCP-YA data) to the HCP-Aging dataset, as shown in Table 4. During fine-tuning, only the parameters of the fully connected layer are trainable, allowing the model to adapt to new datasets while retaining its core structure. The model was assessed on both single-shell and multi-shell configurations, revealing a significant improvement in performance following fine-tuning. This is evidenced by the reduction in RMSE values across all tested configurations. For example, the RMSE decreased from 3.61 to 2.22 for the 1000 (1500) and 3000 s/mm2 configuration, highlighting the efficacy of fine-tuning in enhancing model accuracy and generalizability.

Table 4.

Evaluation of model generalizability on the HCP-Aging dataset with direct application to unseen data and fine-tuning on both single-shell and multi-shell configurations. A Wilcoxon signed-rank test validated the significance and consistency of model performance. The reference is marked with ★, and all deep learning method results yielded a p-value of less than 0.001.

Method Training data Test configuration *RMSE
Proposed HCP-YA 1000 (1500), 3000 3.61 ± 0.38
Proposed HCP-YA 1000 (1500) 3.26 ± 0.36
Proposed (Fine-tuned) HCP-YA, HCP-A 1000 (1500), 3000 2.22 ± 0.18
Proposed (Fine-tuned) HCP-YA, HCP-A 1000 (1500) 2.34 ± 0.19
Pasternak et al. w. RGD [28] N/A 1500 2.89 ± 0.24
Pasternak et al. ★ (Silver Standard) N/A 1500, 3000 N/A
*

All metric values are in units of 10−2

Notably, the inclusion of high b-value data (e.g., b3000 s/mm2) offers a potential advantage, as it significantly attenuates the free water signal, leading to clearer tissue characterization. However, high b-value data is known to suffer from reduced SNR, which can negatively impact model performance in less optimal datasets. While the HCP datasets used in this study provide high-quality data with minimal noise, the generalizability of the proposed method to other datasets with high b-value acquisitions (e.g., b3000 s/mm2) warrants further exploration.

5.3. Diffusion properties estimation after FWE on single shell DWIs

To further estimate the effectiveness of free water elimination to microstructural inspection. A cubic patch, in the central of the volume, which shall include the cerebrospinal fluid confined ventricles (50 × 50 × 50) is selected as the region of interest(ROI)to calculate the change of FA in the HCP-ya test cohort as shown in Table 3.

Table 3.

Comparison of FA (fractional anisotropy) in ROI patches to assess the correction effects of different methods in the CSF-confined ventricles. Values are reported as mean ± standard deviation.

Category Method FA (ROI)
Before FWE 0.453 ± 0.124
Conventional Method Pasternak et al. [10] w. RGD [28] 0.498 ± 0.125
Deep-Learning Methods ANN [7,37]
Deep SHORE [30]
Proposed Method
0.493 ± 0.119
0.501 ± 0.118
0.508 ± 0.127
Silver Standard (Upper Bound) 0.517 ± 0.124

Both our proposed method and the baselines showed similar performance (a boost of FA) as shown in the histogram of Fig. 6. From the table mentioned above, the deep learning methods outperformed other methods on single-shell data on both RMSE of the prediction error of ffw and the average FA on ROI patches, a visualization of the error with the silver standard is shown in the bottom panel of Fig. 6. Our proposed framework has also reached an average FA of 0.508 on a selected ROI as compared with 0.517 on the silver standard.

Fig. 6.

Fig. 6.

The histogram of the FA is shown before and after FWE from different methods respectively in the top panel. To further assess the transform/prediction, the FA error map with the silver standard is depicted in the bottom panel.

CSF contribution was removed from all the MD maps as shown in the left panel of Fig. 7. However, from the histogram of the right panel of Fig. 7, both ANNs and the Pasternak et al. w. RGD method suffered from over-regularization of MD. Our proposed method has the closest distribution with the MD of the silver standard.

Fig. 7.

Fig. 7.

The mean diffusivity of before/after different free water correction methods is qualitatively depicted in (a). To further assess the correction effect, the histogram of MD is shown in (b).

Previous work has shown free water elimination improves test–retest reproducibility of diffusion tensor imaging indices in the brain [38], we also assessed the test-retest reproducibility on the HCP-ya test-retest group dataset. Binary segmentation of 72 tracts are generated using TractSeg [36]. The scan/rescan images are registered and the joint voxels for the 72 tracts are selected to calculate the FA, MD voxel-wise and average over regions before and after free water elimination. The differences between the scan and rescan data were quantified and subsequently converted into percentages to facilitate a more nuanced analysis. As shown in Figs. 8 and 9, the grouped dot plot with error bars illustrate that deep learning FWE significantly improves test-retest reproducibility on DTI metrics in brain. The corpus callosum (CC), the corticospinal tract (CST) and the superior longitudinal fascicle (SLF) were selected as ROI and performances metrics on such region has been calculated with an improvement of agreement (reduction of difference) by 4.1 %, 4.3 %, 5.2 % in FA and 6.1 %, 6.3 %, 4.7 % in MD.

Fig. 8.

Fig. 8.

Voxel-wise fractional anisotropy (FA) differences for each of the 72 bundles from the TractSeg bundle map, averaged across regions and HCP-YA test-retest subjects. The FA differences between scan and rescan data were quantified and expressed as percentages. The mean FA differences and standard deviations are presented in a grouped dot plot with error bars. Statistical significance was assessed using a Wilcoxon signed-rank test, yielding a p-value of less than 0.05.

Fig. 9.

Fig. 9.

Voxel-wise mean diffusivity (MD) differences for each of the 72 bundles from the TractSeg bundle map, averaged across regions and HCP-YA test-retest subjects. The MD differences between scan and rescan data were quantified and expressed as percentages. The mean MD differences and standard deviations are presented in a grouped dot plot with error bars. Statistical significance was assessed using a Wilcoxon signed-rank test, yielding a p-value of less than 0.05.

5.4. Bundle analysis on aging subjects

The proposed FWE model is applied to the BTC pre-operative test cohort data after fine tuning with the training cohort. Only the 700 s/mm2 shell (re-mapping to 1000 s/mm2) is involved in this section. We performed tractography on both the DWI and FW-corrected DWI, a visualization of the tracts is shown in Figs. 10 and 11 where the bundles around the tumor region are highlighted. The findings demonstrate that most patients exhibit an increase in the extent of the peritumoral region covered by recognizable tracks when free water elimination is employed.

Fig. 10.

Fig. 10.

Comparison of tractography results before and after free water elimination for two representative patients: Sub-03, a 60-year-old female with meningioma type I in the parietal brain (tumor size: 78.44 cm3), and Sub-22, a 53-year-old male with oligodendroglioma type II in the parietal brain (tumor size: 15.16 cm3). The images display the original DWI (left), FW-corrected DWI (middle), and tumor mask (right). Red boxes highlight regions with notable differences, predominantly in areas affected by tumors, as indicated by the tumor mask. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)

Fig. 11.

Fig. 11.

Tractography results for a 61-year-old male patient with anaplastic astrocytoma type III in the temporal brain (tumor size: 59.21 cm3). The streamlines around the tumor region are displayed, showing a 4.2 % increase in the number of streamlines after free water elimination.

6. Discussion

The elimination of free water in Diffusion Weighted Imaging (DWI) analysis is crucial for enhancing the accuracy, specificity, and inter-pretability of imaging results. This process is essential as it allows for a more precise representation of tissue microstructure by mitigating the confounding effects of free water, such as in cerebrospinal fluid or edema, which can otherwise lead to overestimation of tissue diffusion and misinterpretation of pathological states. By employing sophisticated post-processing techniques to separate free water from tissue-bound water signals, free water elimination improves tissue characterization, enhances quantitative analysis, reduces imaging artifacts, and increases the sensitivity of DWI to subtle pathological changes. This refinement is particularly vital in neurological applications, aiding in the accurate diagnosis, monitoring, and research of brain disorders, thereby significantly contributing to the advancement of medical imaging and patient care. Our proposed method is more generalizable when dealing with multi-acquisition scheme DWIs compared with previous deep learning-based free water elimination methods.

However, the proposed method is also limited to a restricted number of shell configurations, as it depends on the shell configuration of the training data scheme. In our study, we utilize higher b-values of 3000 s/mm2 as it is used in specific research settings to investigate microscopic features of tissue, such as in studies of white matter microstructure or in more advanced diffusion models. High b-values can be advantageous for capturing complex fiber configurations, like crossing or kissing fibers, and may provide additional information about tissue microstructure. However, these come with challenges like reduced SNR and increased sensitivity to artifacts, necessitating advanced acquisition and postprocessing techniques to mitigate these issues. Typical b-values for clinical applications are usually much lower, ranging from about 500 to 1000 s/mm2. Besides, not unheard of in specialized research studies aiming to capture finer details of tissue architecture from only 3000 s/mm2 shells. The high resolution and high SNR properties of the HCP dataset have enabled us to conduct this study. The gap between clinically sufficient and research-oriented datasets can be challenging to the further application of the framework. The introduction of 700 and 1500 s/mm2 shells will be imperative under the clinical applicable situation.

Additionally, differences in other acquisition parameters, such as spatial resolution, echo time (TE), and signal-to-noise ratio (SNR), can influence the generalizability and performance of the proposed framework. For example, lower resolutions and higher echo times can lead to significant signal attenuation, potentially affecting the free water fraction estimation. Future work should explore approaches to account for these variations systematically, such as through adaptive learning strategies or data harmonization techniques, to extend the applicability of the method across a broader range of clinical and research settings.

The linear decay model employed for b-value mapping offers a simple and computationally efficient solution for harmonizing datasets with different b-value shells. However, this model may not accurately capture the complexity of signal attenuation in certain tissue types, particularly in regions where diffusion is influenced by non-Gaussian behaviors or microstructural heterogeneities. Such simplifications could result in suboptimal calibration, especially when dealing with highly complex tissues. Future investigations could address these issues by employing advanced radial space modeling instead of simple remapping. More potential improvements could include designing an adaptive head instead of the binary coding of shell absence/presence, as introducing more b-values to our proposed method would significantly increase the model parameters.Additionally, incorporating weighted loss functions could improve the model’s ability to handle tissue-specific variations by emphasizing regions with higher free water contamination.

7. Conclusions

This study demonstrates that the proposed deep-learning method, trained with heterogeneous multishell data, is capable of estimating tissue volume fractions from measured single-shell diffusion signals. The proposed framework outperforms the conventional method by Pasternak et al., with regularized gradient descent, in correcting single-shell diffusion MRI data. This approach leverages dynamic spherical convolution with data-driven techniques to reliably infer plausible free-water volumes across diverse diffusion MRI acquisition schemes, highlighting its potential for broad applicability in clinical and research settings.

Acknowledgments

This work was supported by the National Institutes of Health under award numbers R01EB017230, 1R01DK135597, T32EB001628, K01AG073584, and 5T32GM007347, and in part by the National Center for Research Resources and Grant UL1 RR024975-01. This study was also supported by the National Science Foundation (1452485, 1660816, and 1750213). The Vanderbilt Institute for Clinical and Translational Research (VICTR) is funded by the National Center for Advancing Translational Sciences (NCATS) Clinical Translational Science Award (CTSA) Program, Award Number 5UL1TR002243-03. The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH or NSF. This work was also supported by Vanderbilt Seed Success Grant, Vanderbilt Discovery Grant, and VISE Seed Grant. We extend gratitude to NVIDIA for their support by means of the NVIDIA hardware grant.

Footnotes

CRediT authorship contribution statement

Tianyuan Yao: Writing – original draft, Project administration, Methodology, Data curation, Conceptualization. Derek B. Archer: Writing – review & editing, Data curation. Praitayini Kanakaraj: Writing – review & editing, Formal analysis. Nancy Newlin: Writing – review & editing. Shunxing Bao: Writing – review & editing. Daniel Moyer: Writing – review & editing. Kurt Schilling: Writing – review & editing. Bennett A. Landman: Writing – review & editing, Supervision. Yuankai Huo: Writing – review & editing, Supervision, Conceptualization.

Declaration of generative AI and AI-assisted technologies in the writing process

During the preparation of this work, the author(s) used ChatGPT4/ChatGPT4o in order to check for writing errors and perform appropriate editing and refinement. After using this tool, the author(s) reviewed and edited the content as needed and take(s) full responsibility for the content of the published article.

References

  • [1].Basser PJ, Mattiello J, LeBihan D. Estimation of the effective self-diffusion tensor from the nmr spin echo. J Magn Reson B 1994;103(3):247–54. [DOI] [PubMed] [Google Scholar]
  • [2].Van Essen DC, Ugurbil K, Auerbach E, Barch D, Behrens TE, Bucholz R, et al. The human connectome project: a data acquisition perspective. Neuroimage 2012;62 (4):2222–31. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [3].Glasser MF, Sotiropoulos SN, Wilson JA, Coalson TS, Fischl B, Andersson JL, et al. The minimal preprocessing pipelines for the human connectome project. Neuroimage 2013;80:105–24. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [4].Weninger L, Na C-H, Jütten K, Merhof D. Analyzing the effects of free water modeling by deep learning on diffusion mri structural connectivity estimates in glioma patients. PloS One 2020;15(9):e0239475. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [5].Basser PJ, Mattiello J, LeBihan D. Mr diffusion tensor spectroscopy and imaging. Biophys J 1994;66(1):259–67. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [6].Hoy AR, Koay CG, Kecskemeti SR, Alexander AL. Optimization of a free water elimination two-compartment model for diffusion tensor imaging. Neuroimage 2014;103:323–33. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [7].Molina-Romero M, Gomez PA, Albarqouni S, Sperl JI, Menzel MI, Menze BH. Deep learning with synthetic data for free water elimination in diffusion mri. Proc Intl Soc Mag Reson Med 2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [8].Arezza NJ, Santini T, Omer M, Baron CA. Estimation of free water-corrected microscopic fractional anisotropy. Front Neurosci 2023;17:1074730. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [9].Golub M, Neto Henriques R, Gouveia Nunes R. Free-water dti estimates from single b-value data might seem plausible but must be interpreted with care. Magn Reson Med 2021;85(5):2537–51. [DOI] [PubMed] [Google Scholar]
  • [10].Pasternak O, Sochen N, Gur Y, Intrator N, Assaf Y. Free water elimination and mapping from diffusion mri. Magn Reson Med 2009;62(3):717–30. [DOI] [PubMed] [Google Scholar]
  • [11].Bergmann Ø, Henriques R, Westin C-F, Pasternak O. Fast and accurate initialization of the free-water imaging model parameters from multi-shell diffusion mri. NMR Biomed 2020;33(3):e4219. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [12].Scherrer B, Warfield SK. Why multiple b-values are required for multi-tensor models. Evaluation with a constrained log-euclidean model. In: 2010 IEEE international symposium on biomedical imaging: from nano to macro. IEEE; 2010. p. 1389–92. [Google Scholar]
  • [13].Yao T, Newlin N, Kanakaraj P, Nath V, Cai LY, Ramadass K, et al. A unified learning model for estimating fiber orientation distribution functions on heterogeneous multi-shell diffusion-weighted mri. In: International workshop on computational diffusion MRI. Springer; 2023. p. 13–22. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [14].Van Essen DC, Smith SM, Barch DM, Behrens TE, Yacoub E, Ugurbil K, et al. The wu-minn human connectome project: an overview. Neuroimage 2013;80:62–79. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [15].Bookheimer SY, Salat DH, Terpstra M, Ances BM, Barch DM, Buckner RL, et al. The lifespan human connectome project in aging: an overview. Neuroimage 2019;185:335–48. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [16].Aerts H, Schirner M, Jeurissen B, Van Roost D, Achten E, Ritter P, et al. Modeling brain dynamics in brain tumor patients using the virtual brain. Eneuro 2018;5(3). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [17].Aerts H, Schirner M, Dhollander T, Jeurissen B, Achten E, Van Roost D, et al. Modeling brain dynamics after tumor resection using the virtual brain. NeuroImage 2020;213:116738. [DOI] [PubMed] [Google Scholar]
  • [18].Gur Y, Pasternak O, Sochen N. Fast gl (n)-invariant framework for tensors regularization. Int J Comput Vis 2009;85:211–22. [Google Scholar]
  • [19].Correia MM, Henriques RN, Golub M, Winzeck S, Nunes RG. The trouble with freewater elimination using single-shell diffusion mri data: a case study in ageing. Imaging Neurosci 2024;2:1–17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [20].Henriques RN, Rokem A, Garyfallidis E, St-Jean S, Peterson ET, Correia MM. [re] optimization of a free water elimination two-compartment model for diffusion tensor imaging. BioRxiv 2017:108795. preprint. [Google Scholar]
  • [21].Liu H, Fan Y, Li H, Wang J, Hu D, Cui C, et al. Moddrop++: a dynamic filter network with intra-subject co-training for multiple sclerosis lesion segmentation with missing modalities. In: International conference on medical image computing and computer-assisted intervention. Springer; 2022. p. 444–53. [Google Scholar]
  • [22].Goodwin-Allcock T, McEwen J, Gray R, Nachev P, Zhang H. How can spherical cnns benefit ml-based diffusion mri parameter estimation?. In: Computational diffusion MRI: 13th international workshop, CDMRI 2022, held in conjunction with MICCAI 2022, Singapore, Singapore, September 22, 2022, proceedings. Springer; 2022. p. 101–12. [Google Scholar]
  • [23].Karayumak SC, Bouix S, Ning L, James A, Crow T, Shenton M, et al. Retrospective harmonization of multi-site diffusion mri data acquired with different acquisition parameters. Neuroimage 2019;184:180–200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [24].Huo Y, Xu Z, Xiong Y, Aboud K, Parvathaneni P, Bao S, et al. 3d whole brain segmentation using spatially localized atlas network tiles. NeuroImage 2019;194: 105–19. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [25].Jenkinson M, Beckmann CF, Behrens TE, Woolrich MW, Smith SM. Fsl. Neuroimage 2012;62(2):782–90. [DOI] [PubMed] [Google Scholar]
  • [26].Cai LY, Yang Q, Hansen CB, Nath V, Ramadass K, Johnson GW, et al. Prequal: an automated pipeline for integrated preprocessing and quality assurance of diffusion weighted mri images. Magn Reson Med 2021;86(1):456–70. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [27].Garyfallidis E, Brett M, Amirbekian B, Rokem A, Van Der Walt S, Descoteaux M, et al. Dipy, a library for the analysis of diffusion mri data. Front Neuroinform 2014; 8:8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [28].Lee JM. Introduction to Riemannian manifoldsvol. 2. Springer; 2018. [Google Scholar]
  • [29].Ozarslan E, Koay C, Shepherd TM, Blackband SJ, Basser PJ. Simple harmonic oscillator based reconstruction and estimation for three-dimensional q-space mri. Proc Intl Soc Mag Reson Med 2009;17:1396. Citeseer. [Google Scholar]
  • [30].Nath V, Schilling KG, Parvathaneni P, Hansen CB, Hainline AE, Huo Y, et al. Deep learning reveals untapped information for local white-matter fiber reconstruction in diffusion-weighted mri. Magn Reson Imaging 2019;62:220–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [31].Cheng J, Jiang T, Deriche R. Theoretical analysis and practical insights on eap estimation via a unified hardi framework. In: MICCAI workshop on computational diffusion MRI (CDMRI); 2011. [Google Scholar]
  • [32].Cobb OJ, Wallis CG, Mavor-Parker AN, Marignier A, Price MA, d’Avezac M, et al. Efficient generalized spherical cnns. arXiv preprint; 2020. arXiv:2010.11661. [Google Scholar]
  • [33].Kondor R, Trivedi S. On the generalization of equivariance and convolution in neural networks to the action of compact groups. In: International conference on machine learning. PMLR; 2018. p. 2747–55. [Google Scholar]
  • [34].Kondor I, Trivedi S, Lin Z. A fully fourier space spherical convolutional neural network based on clebsch-gordan transforms. US Patent App; Sep. 2, 2021. 17/253,840. [Google Scholar]
  • [35].Tournier JD, Calamante F, Connelly A, et al. Improved probabilistic streamlines tractography by 2nd order integration over fibre orientation distributions. Proc Int Soc Magn Reson Med 2010;1670. John Wiley & Sons, Inc. New Jersey, USA. [Google Scholar]
  • [36].Wasserthal J, Neher P, Maier-Hein KH. Tractseg-fast and accurate white matter tract segmentation. NeuroImage 2018;183:239–53. [DOI] [PubMed] [Google Scholar]
  • [37].Molina-Romero M, Wiestler B, Gomez PA, Menzel MI, Menze BH. Deep learning with synthetic diffusion mri data for free-water elimination in glioblastoma cases. In: International conference on medical image computing and computer-assisted intervention. Springer; 2018. p. 98–106. [Google Scholar]
  • [38].Albi A, Pasternak O, Minati L, Marizzoni M, Bartres-Faz D, Bargallo N, Jovicich J. Free water elimination improves test–retest reproducibility of diffusion tensor imaging indices in the brain: A longitudinal multisite study of healthy elderly subjects. Human Brain Mapping 2016;38:12–26. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES