Skip to main content

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

bioRxiv logoLink to bioRxiv
[Preprint]. 2025 Jul 28:2024.11.11.622925. [Version 3] doi: 10.1101/2024.11.11.622925

Simulation-Based Inference at the Theoretical Limit: Fast, Accurate Microstructural MRI with Minimal diffusion MRI Data

Maximilian F Eggl 1,2,, Silvia De Santis 1,
PMCID: PMC12324183  PMID: 40766636

Abstract

Diffusion-weighted magnetic resonance imaging (dMRI) is an essential tool in neuroscience, providing non-invasive insights into brain microstructure. However, obtaining accurate maps demands long acquisitions, due to oversampling of the parameter space. Leveraging simulation-based inference (SBI) with neural networks, we directly approximate posterior distributions of diffusion parameters from experimental measurements without requiring real-data training. SBI achieves accurate parameter estimation using up to 90% fewer acquisitions and outperforms standard nonlinear least squares under noisy, sparse sampling. We demonstrate improvements across diffusion tensor imaging, diffusion kurtosis imaging, and biophysical models estimating axonal density and calibre. Validation on simulated and real datasets from healthy and pathological brains shows SBI’s robustness and generalizability. This approach can expand dMRI access, e.g. for paediatric and other time-sensitive patients, enable advanced microstructure-sensitive protocols, and rescue legacy data with suboptimal quality. By reducing scan times and preserving privacy, SBI-integrated dMRI workflows promise faster, more comfortable exams, MRI-based virtual tissue biopsy, and have potential to substantially impact radiology workflows.

1. Introduction

By sensitizing magnetic resonance imaging (MRI) signals to water’s random motion, diffusion-weighted MRI (dMRI) has become a pivotal neuroimaging tool, providing unique insights into tissue microstructure in vivo and noninvasively [1, 2]. When combined with mathematical representations based on tensors like diffusion tensor imaging (DTI) [3] or diffusion kurtosis imaging (DKI) [4], dMRI has significantly impacted clinical applications [5, 6]. More recently, advances aiming at dissecting the MRI signal coming from distinct brain tissue compartments has led to the development of biophysical representations, which notably allow for direct measurement of fine cell properties like soma size, neurite morphology, permeability and axon radius [713]. These insights, thus, provide a non-invasive way of deriving meaningful biomarkers of health and disease, effectively positioning dMRI as a possible tool in achieving virtual tissue biopsy. However, noisy acquisitions substantially affect these methods, compromising the accuracy and reliability of fitted diffusion parameters [14, 15]. While denoising techniques exist (e.g., [1619]), fitting methods still demand extensive oversampling— lengthening scan times by up to 10 times the theoretical minimum or more, excluding some patients, and precluding richer multimodal protocols [2022]. In fact, the vast majority of toolboxes and software used to extract parameter maps from dMRI data utilize least square approaches, statistical frameworks which determine the optimal parameters of a model by minimizing the sum of the squared differences between observed and predicted values. However, these methods, including non-linear least squares (NLLS) and weighted least squares (WLS) [3, 19, 23], besides requiring massive oversampling, are additionally sensitive to initial parameter guesses and may not fully capture uncertainties in noisy, low signal-to-noise ratio (SNR) conditions [14]. More modern approaches like Rician-unbiased or variational Bayesian [24, 25] have been introduced to mitigate these issues. However, their high computational cost, reliance on precise noise-variance estimates under a strict Rician model, and tendency toward overconfident posteriors in low-SNR regions have so far limited their practical adoption. Due to these limitations, even basic diffusion models remain underused in clinical practice, and more insightful methods rarely reach clinical researchers. A huge opportunity to translate recent MRI advances into clinical practice is lost in the process.

Simulation-based inference (SBI) sidesteps these issues by using simulated acquisitions to learn posterior distributions of diffusion parameters [2629]. Neural posterior estimation (NPE) [30, 31] trains a deep network on simulated data, yielding robust fits even in low-SNR regimes. NPE has been successfully applied in fields like cosmology [32], neuroscience [33, 34], and imaging neuroscience [3538]. Nonetheless, in the context of dMRI this framework has not been used previously to explore the idea of achieving accurate parameter maps with the minimal amount of theoretically required information. Importantly, SBI relies on simulations, contrasting with other deep learning approaches that require training on real data [39, 40], making it data- and privacy-friendly.

Since the diffusion-based representations we study commonly have straightforward simulation paradigms, SBI, and particularly NPE, is a natural fit for enhancing fitting accuracy. This work demonstrates how applying SBI to different tensor and non-tensor based dMRI representations allows direct fitting of the models and extraction of accurate parameter maps with minimal or significantly reduced acquisitions. Here, we focus on four different established models: DTI, DKI, Composite Hindered And Restricted ModEL of Diffusion (CHARMED) and AxCaliber, but the approach is inherently flexible and readily extendable to emerging or future models. By validating the capabilities of SBI and dMRI using simulated and real scans from healthy and pathological datasets and across different sampling schemes [41, 42], we believe this approach can substantially impact radiology in research and clinical settings, providing a framework to extract the maximum amount of microstructural information with minimum acquisition time and revolutionize accessibility to both basic and advanced dMRI.

2. Methods

2.1. Tensor-based frameworks

In DTI, diffusion is modelled as a Gaussian process using the diffusion tensor D, a 3 × 3 symmetric positive-definite matrix representing diffusion coefficients in three-dimensional space. The signal attenuation is given by:

S(b,g)=S0expbgDg, (1)

where S(b,g) is the measured signal with diffusion weighting factor b and gradient orientation g, and S0 is the non-diffusion-weighted signal. This framework allows derivation of metrics related to the diffusion tensor like mean diffusivity (MD) and fractional anisotropy (FA) (Fig. 1a), widely employed for diagnostic purposes (e.g. [43]; review: [44]). However, the Gaussian assumption underlying DTI limits its sensitivity to more complex diffusion phenomena.

Fig. 1: Overview of the dMRI data processing and analysis pipeline.

Fig. 1:

a) dMRI data from the brain captures the diffusion of water molecules as they interact with the underlying brain microstructure. This data can be analysed with various models, for example, mathematical signal representations such as DTI or DKI or more advanced multi-compartment models such as CHARMED and AxCaliber. These models then provide interpretable biomarkers which give insight into the underlying health of the brain parenchyma. b) When sampling different gradient strengths, gradient directions, and diffusion times, the signal acquires a fingerprint of the underlying microstructure. By using a simulation-based inference approach, this signal is provided to a network pre-trained on simulated signals generated by a model (e.g., DTI). This network then generates the posterior distribution over the model parameters given the measured diffusion signal.

To overcome this limitation, DKI extends the DTI model by incorporating non-Gaussian effects, capturing the kurtosis of water diffusion. The signal attenuation in DKI is modelled as:

S(b,g)=S0expbgDg+16b2gDg2K(g) (2)

where K(g) is the kurtosis coefficient in orientation g [4]. DKI captures more complex diffusion dynamics, providing metrics like mean kurtosis (MK) for deeper microstructural insights. Other DKI metrics include axial kurtosis (AK), radial kurtosis (RK), the mean kurtosis tensor (MKT), and kurtosis fractional anisotropy (KFA). MKT and KFA, though less common, specifically assess the accuracy of kurtosis tensor estimates [45, 46].

2.2. The AxCaliber model

Here, we briefly introduce the AxCaliber framework, an extension of the CHARMED formulation (for full derivations, see [12, 47]). AxCaliber models the signal as arising from two compartments: a hindered component (denoted as ()h) and a restricted component (denoted ()r), as illustrated in Fig. 1a. For the hindered component, we use the standard DTI framework with Dh as the diffusion tensor. The restricted compartment, representing white matter microstructure, is modelled as radially symmetric cylindrical structures (i.e., no radial diffusion). Its signal is decomposed into parallel and perpendicular components. Assuming the cylinder is long relative to the diffusion distance, the parallel component simplifies to a simple 1-D exponential decay with D as the scalar diffusion coefficient. The perpendicular component, influenced by the cylinder’s radius, R, is significantly more complex, and the full expression is omitted here but can be found in [48]. A key aspect of AxCaliber is recognizing that axon bundles feature a distribution of radii. Originally a gamma distribution was used, but difficulties in fitting both shape and scale motivated our use of a Poisson model [49]. This model requires fitting only the mean radius λ, simplifying interpretation.

Thus, the complete expression of the diffusion signal becomes

S(b,g)=fhSh(b,g)+fri=0NwRiSrb,g,Ri, (3)

where fh, fr are the fractions of the hindered and restricted components, Sh, Sr are the diffusion signals of those compartments and wRi are the normalized weights from the Poisson distribution. Importantly, the AxCaliber formulation posits that varying the diffusion time Δ (with other parameters adjusted to keep the same b-value) yields different signals, as shown in Fig. 1b. Intriguingly, the simplified Poisson formulation requires fitting only a single parameter (similar to the ActiveAx implementation [50], which only fits the mean axon diameter), theoretically enabling axon diameter estimation from a single diffusion time.

2.3. Simulation-based inference fitting framework

In this work, we employed NPE [31, 51], an approach in the family of SBI-methods to fit the parameters of a given diffusion model. NPE leverages normalizing flows [52], flexible probabilistic models constructed by applying invertible transformations to simple distributions, such as Gaussians. By utilising neural networks to learn the normalizing-flow transformations, NPE can approximate the posterior over parameters even when the likelihood is intractable.

Fig. 1b provides a schematic overview of the NPE procedure. Briefly, during training, candidate parameters are sampled from a prior distribution, and synthetic signals are generated by applying these parameters to our forward DTI, DKI or AxCaliber models. The NPE network is optimized by minimizing the negative log probability of the sampled parameters under its predicted posterior, conditioned on each synthetic signal. After convergence, we input experimental data into the trained network to obtain the posterior distribution of model parameters. Since the network outputs a full posterior distribution given the input data, it naturally provides a measure of uncertainty that reflects the confidence in our parameter estimates. To obtain a single parameter estimate from the posterior, several approaches can be adopted, such as taking the mean (when the posterior is well-behaved), the mode (histogram mode), or employing more advanced techniques like Maximum a Posteriori (MAP) estimation [53]. In this work, we used the posterior mean, computed efficiently by averaging a large batch of parallel samples, eliminating additional voxel-wise optimization and scaling easily to whole-slice inference.

2.4. In silico data

As SBI relies on simulated data for training, realistic simulations are crucial for accurate results [54, 55]. Therefore, for the three investigated models, the priors used to sample the relevant parameters were designed to mimic features of real human data [28, 56].

For DTI simulations, parameters were sampled from uniform distributions and adjusted to ensure the diffusion tensor was positive definite and symmetric. It was found that the uniform distribution for the diagonal entries needed to be roughly twice as wide as that for the off-diagonal entries. With this approach, we were able to effectively replicate the HCP data features without prior knowledge (see Supplementary Fig. S1).

DKI simulations required a more sophisticated approach due to the non-linear relationship between DTI and DKI terms and challenges in defining suitable priors. Generating stable signals was crucial because DKI includes a positive b2 term, which can lead to unstable simulations if D and K are not carefully chosen. To address this, prior distributions were obtained from a single subject of the HCP dataset (excluded from further analysis) to generate diffusion and kurtosis tensors that reflect real-world non-linear relationships while remaining biologically plausible. More precisely, where appropriate, we fit log-normal or normal distributions to each of the entries of the diffusion and kurtosis tensors (see Supplementary Figs. S2b-d). Approximately 15% of tensors yielded unstable signals due to mathematically invalid tensor–kurtosis combinations (not biological variability) and were resampled.

Finally, the AxCaliber framework models a combination of diffusion tensors (for the hindered component) and biologically driven parameters. Diffusion tensors followed the same procedure as above, while priors for biological parameters were uniform distributions informed by literature (see for example axon size [57, 58]).

Simulated signals were generated using a Monte Carlo approach and corrupted with Rician noise at varying signal-to-noise ratios [59]. Simulations were split into training and testing sets; neural networks were trained on the training set to estimate model parameters, and the testing set was used to compare fitted parameters of both SBI and NLLS against the available ground truth. All simulations and tensor-specific metric calculations were performed using Dipy functions [60], while we implemented an in-house Python implementation of the AxCaliber framework.

We generated 300,000 training samples for each model. Unless stated otherwise, in silico inference results were generated using 500 simulated signals. DKI priors were fitted using scipy.stats, and NPE was performed with the sbi-toolbox [29].

2.5. Human dMRI data

To validate the performance of the SBI framework for fitting DTI and DKI parameters using real-world data, we selected 32 individuals (17 females and 15 males, ages 20–55) from the Human Connectome Project (HCP) database. Initial pre-processing was performed by the HCP [Minimal Preprocessing Pipeline 61], while the study-specific pre-processing consisted of downsampling the images from 1.5 mm/voxel to 2.5 mm/voxel. This downsampling was performed for the sake of computational efficiency, but all results in this work also apply to the higher 1.5 mm/voxel resolution (see supplemental fig. S3). The datasets consisted of two sets of b-values, b=1000 and b=3000, with 69 gradient acquisitions per b-shell.

To validate the DKI fits on a different dataset as well as study the AxCaliber model, which requires multiple diffusion times, we selected 8 additional individuals (6 females and 2 male, ages 26–55, 5 with MS and 3 healthy subjects) from the dataset of [42]. This data, available through the public repository DIGITAL.CSIC and preprocessed with the same HCP pipeline, consisted of two sets of b-values, b=2000 and b=4000, with 30 and 59 gradient acquisitions, respectively. Diffusion times (Δ) were utilised in the AxCaliber setting and were set to 7, 35, and 61 ms. Images were also downsampled to 2.5 mm/voxel as above.

2.6. Number of acquisitions used

A key focus of this work is evaluating how SBI and NLLS performance varies with the number of acquisitions used under DTI, DKI, and AxCaliber models. To this end, we defined the following acquisition subsets:

  • DTI full set: 69 acquisitions (5 B0, and 64 with b=1000)

  • DTI medium set: 20 acquisitions (1 B0, and 19 with b=1000) which, according to [21], represents a realistic and near-optimal set of 20 orientations sufficient for the DTI framework.

  • DTI minimum set: 7 acquisitions (1 B0, and 6 with b=1000, theoretical minimum)

  • DKI full set: 138 acquisitions (10 B0, 64 with b=1000, 64 with b=3000)

  • DKI medium set: 35 acquisitions (1 B0, 6 with b=1000, 28 with b=3000)

  • DKI minimum set: 22 acquisitions (1 B0, 6 with b=1000, 15 with b=3000)

  • AxCaliber full set: 271 acquisitions (1 B0, 30 with b=2000, 60 with b=4000, repeated for three diffusion times)

  • AxCaliber reduced set: 19 acquisitions (1 B0, 2 with b=2000, 3 with b=4000, repeated for three diffusion times)

Gradient orientations in all simulated acquisition sets were selected using electrostatic repulsion principles [20] to ensure optimal angular coverage, primarily to support NLLS. Notably, SBI remained robust even under suboptimal orientation and b-value configurations, underscoring its suitability in data-limited settings (see Supplemental Fig. S4).

2.7. Metrics to evaluate SBI and NLLS performance

In our in-silico simulations, access to the ground truth allows direct comparison between estimated and true metrics. We quantified error as the absolute difference between true and estimated metrics.

For human data, lacking ground truth, we compared results from minimal and medium acquisition sets against the full set. By working with images rather than isolated signals, we leveraged structural information to assess performance. As a proxy for accuracy, we used the Structural Similarity Index (SSIM) [62], which measures similarity based on luminance, contrast, and structure. SSIM values closer to 1 indicate higher similarity; we set a minimum desired performance threshold of 66% based on clinical interpretability criteria, as preliminary analyses showed that SSIM values above this level preserved structural features necessary for clinical interpretation. Such features include lesion boundaries and tissue interfaces, ensuring minimal perceptible differences for clinicians. However, we note that most of our SBI results reach a much higher benchmark (often > 90%), meaning that the results become virtually indistinguishable from the ground truth versions.

To assess precision, we calculated the standard deviation of the fitted metric within white matter. We considered performance acceptable when the interquartile range of minimum or reduced acquisition fits matched that of full acquisitions.

3. Results

3.1. Validating SBI in DTI and DKI

3.1.1. In-silico analysis

For DTI, we began by generating a single signal and adding noise at varying SNR levels (Fig.2a). Estimating MD and FA from the 200 noisy versions, we found that NLLS was accurate at SNR > 10 but declined at lower SNR, while SBI remained robust down to SNR = 2 with lower variance (Fig.2b).

Fig. 2: SBI applied to simulated data of DTI and DKI models shows more accurate performance than standard NLLS.

Fig. 2:

a) dMRI data is affected by varying levels of noise, which impacts signal clarity. The effect of different signal-to-noise ratios (SNR = 2–20; dashed gray) is shown in comparison to the true signal (black). b) Accuracy of estimates of DTI metrics (MD - top and FA - bottom) obtained using the SBI (teal) and NLLS (orange) approach. The black dashed line indicates the true parameter values while the box plots represent median values, with boxes showing IQR and whiskers extending to 1.5 times the IQR. c-d) Using either the full set of acquisitions (c) or the minimum (d), both NLLS and SBI are fit to 500 simulated signals with the absolute difference from the true values in MD and FA being shown. Box plots as in (b). e) Under the DKI framework, SBI and NLLS are used to infer model parameters and reconstruct an example DKI signal from the full acquisition set (SNR = 20). f) To evaluate performance across a range of simulated signals, 500 example DKI signals were generated and subjected to four SNRs (20, 10, 5, and 2). The resulting parameter fits were compared to the true parameters. Box plots as in (b). g-h) Repeating the analysis in panels e–f with the minimum DKI set of acquisitions.

Across 500 noisy signals, both methods performed similarly at high SNR with full acquisitions (Fig.2c). At lower SNR, SBI’s errors remained lower. With minimum acquisitions (Fig.2d), MD estimates remained stable, but FA from NLLS degraded substantially, while SBI retained better accuracy. Supplemental Fig. S5 confirms that SBI reconstructs the true signal more faithfully, while NLLS tends to overfit the noise (higher correlation with the noisy signal than SBI in a iv) and vi) of the supplemental figure).

Using simulated DKI data based on HCP characteristics (Supplementary Fig. S6), we first compared SBI and NLLS using the full set of acquisitions. With a single signal at SNR = 20, both methods reconstructed the signal effectively (Fig. 2e). Across 500 noisy signals, SBI outperformed NLLS at all SNR levels, maintaining higher robustness (Fig. 2f).

Next, we used the minimum set of acquisitions for the DKI inference. In reconstructing the signal (Fig. 2g), SBI achieved more accurate results despite limited data, especially at higher b-values where NLLS deviated noticeably from the true signal. When comparing across 500 signals (Fig. 2h), in general both methods performed worse with minimal data than with full measurements due to reduced information. Interestingly, as SNR decreased, NLLS performance sometimes improved with fewer measurements—consistent with DKI’s known sensitivity to noise, where additional noisy data can degrade fits. In contrast, SBI maintained stable performance across SNR levels and acquisition sizes, effectively extracting reliable information even from minimal data.

3.1.2. In-vivo analysis

To mimic clinical application, we examined MD and FA maps from a medial axial slice of one HCP subject, comparing SBI and NLLS (Figs. 3a,c). Both methods yielded similar structures, though NLLS had lower MD values in the lateral ventricles. Based on trends from in-silico data — where NLLS tended to underestimate MD — we argue that SBI likely provides a more accurate reflection of the true tensor metrics. For FA, both methods yielded similar results, with differences driven primarily by noise.

Fig. 3: Even with minimal information SBI is able to get meaningful estimates on real-world data where NLLS fails.

Fig. 3:

a-b) Comparison of MD values obtained with SBI and NLLS using the full and minimal acquisition sets, respectively (differences below). c-d) Similar to (a–b), but showing FA instead of MD. e-f) MD and FA were fit with SBI (single network, teal) and NLLS (orange) on medial-axial slices from 31 HCP subjects, using full, medium, and minimal acquisition sets. We report absolute errors in MD and FA, SSIM, and the standard deviation of each metric in white matter. Given a lack of an objective ground truth, the full NLLS and full SBI results are used for their respective subsets. A comparison between the full SBI and full NLLS is shown in the dark gray boxplot. Boxplots represent the median values while the boxes represent the IQR and the whiskers 1.5 the IQR. The dashed black line in the SSIM plots refers to our acceptable minimum accuracy. The shaded areas in the standard deviation plots refer to the interquartile range of the full-set result. g-h) The same analysis as in e, f but performed for 8 individuals (3 control and 5 with MS) of the [42] dataset.

With minimal measurements (Figs. 3b,d), SBI’s advantage becomes clear. SBI’s MD map closely matches the full version (SSIM = 0.97), whereas NLLS appears noisier (SSIM = 0.59). Even the more complex FA metric is effectively reproduced by SBI, whereas NLLS loses structural information.

We then trained a generalist network to simultaneously evaluate 31 HCP subjects. Without ground truth, we compared each subset against its own full fit. The results of the absolute error, SSIM, and white matter standard deviation of the metrics (when compared against the respective full fit) are shown in Fig. 3e and f. For SBI, the accuracy of both MD and FA remains well above the 66% minimum accuracy we set (with the minimum SBI MD only losing approximately 5.6% SSIM accuracy with less than 10% of the available information) and far outperforms the minimum NLLS results (65% average SSIM MD accuracy loss). SBI maintained consistent precision in white matter; NLLS deviated from the full acquisition IQR under minimal conditions and showed artificially low FA variability in the minimum acquisition setting due to producing uniform (and incorrect) values. We note that a similar picture appears in the [42] dataset that includes both healthy and pathological individuals (Fig. 3g-h). Here both NLLS and SBI perform well for the MD metric, even under the sparsest conditions. However, NLLS is unable to obtain accurate FA measurements for this case, which SBI is still able to do. Importantly, we note that the b-values of that dataset (b=2000) are not optimal for the purposes of DTI. Nonetheless, SBI is able to obtain reasonable parameter estimates even with this sub-optimal paradigm.

To ascertain if applying denoising (in this case MP-PCA[63] to the dataset before fitting NLLS would improve its performance, we took an individual from the HCP dataset and added noise at varying strengths (SNR = 50 to 10) and then denoised the image and fit NLLS - on the other hand SBI was directly applied to the corrupted images. The results of this are seen in supplemental figure S7. We note that SBI is able to perform the fit on the noisy results where NLLS still fails.

Next, we repeated this analysis for the DKI model. With the full acquisition set, both SBI and NLLS captured similar structural features across kurtosis metrics (Fig. 4a-b). Importantly, the kurtosis-specific metrics, such as MKT and KFA, were well resolved by both methods (see Fig. S8a). However, SBI showed greater stability, while NLLS produced gaps from unrealistic or negative values, visible as black dots.

Fig. 4: Even with minimum acquisitions, SBI can resolve kurtosis metrics on real-world data where NLLS fails.

Fig. 4:

a-b) SBI (a) and NLLS (b) were applied to the axial slice of a held-out individual using full and minimum acquisition sets. This enables qualitative comparison of fitted kurtosis tensor metrics, with black dots marking large negative or invalid values. Differences between full and minimum fits (for both SBI and NLLS) are shown as heat maps (darker red = larger error). c-e) SBI (teal) and NLLS (orange) were fit to medial-axial slices from 31 HCP subjects using full, medium, and minimum acquisition sets (via a single network). Panels show absolute error, SSIM, and standard deviation of each metric in white matter, analogous to the DTI analysis. Full-set SBI and NLLS results serve as references for their corresponding reduced-set fits; gray boxplots compare full SBI vs full NLLS for error and SSIM. Boxplots show medians; boxes, IQR; whiskers, 1.5× IQR. Orange denotes NLLS; teal, SBI. Dashed black lines in SSIM plots mark our minimum-acceptable accuracy. Shaded areas in standard deviation plots indicate the IQR of the full acquisition result. f-h) The same analysis as in c-e but performed for 8 individuals (3 control and 5 with MS) of the [42] dataset.

Considering the minimum acquisition setting, SBI maintained high-quality reconstructions across DKI metrics. In contrast, NLLS failed under these conditions, producing implausibly negative values. Compared to SBI, NLLS exhibited greater structural discrepancies between full and minimum results, indicating NLLS’s inability to recreate accurate tensor metrics with limited data.

As before, we trained a network capable of fitting DKI to 31 HCP individuals simultaneously, with the resulting absolute error, SSIM, and standard deviations of the metrics in Fig. 4c-e. The increase in absolute error from medium to minimum acquisitions was substantially smaller for SBI than for NLLS. Additionally, SBI achieved higher SSIM scores than its NLLS counterpart. SBI also maintained stable precision, unlike NLLS, which degraded with fewer measurements. To address potential bias from using a single HCP subject for simulation priors, we tested generalizability using a different dataset[42] in Fig. 4f-h. In particular, we applied the same protocol to both healthy individuals and those exhibiting MS. Across the board, SBI was able to get accurate DKI metrics (> 0.85 SSIM score for the minimum set for MK and RK) that mirrored the results of the HCP result.

Importantly, due to the non-linearities of the DKI models, we used it to run a more extensive benchmarking of SBI against other fitting strategies, less commonly adopted than NLLS, but potentially more stable at low SNRs, namely weighted least square, Maximum likelihood estimation (MLE) and a variational Bayesian approaches. We compared the performance of all approaches in both simulations and the HCP dataset. In all cases, SBI was able to outperform these methods, both for the full and minimum sets (see Supplemental Fig. S9).

3.2. Application of SBI to AxCaliber

We evaluated AxCaliber performance (and by extension, CHARMED) using both in-silico simulations and in-vivo data by first testing the SBI framework on synthetic signals with known ground truth, followed by application to real-world data from the [42] dataset. To start, we generated a set of 500 noisy signals and then fitted AxCaliber to each with SBI and NLLS. The average difference between the inferred angles of the restricted component and the true angles at SNR = 30 is shown in Fig. 5a. Here, the red arrow represents the true axon orientation. Overall, SBI (both full and reduced) yields more accurate axon angle estimates than NLLS (tighter circles).

Fig. 5: Performance of SBI and NLLS for in-silico and in-vivo AxCaliber data.

Fig. 5:

a) For 20 simulated signals with SNR = 30, we calculated the average difference in the angle of the restricted compartment for each fitting routine. The red arrow indicates zero error. Increasing circle sizes denote errors (in radians) of 0.065, 0.24, 0.34, and 0.65 for full SBI, reduced SBI, full NLLS, and reduced NLLS, respectively. b-e) Absolute differences between fitted and true parameters for (b) mean radius, (c) restricted fraction, (d) perpendicular diffusion, and (e) hindered fractional anisotropy, using SBI (teal) and NLLS (orange). Black lines show medians; boxes, IQR; whiskers, 1.5× IQR. f) Example restricted signal fraction fits for one axial slice, comparing SBI and NLLS. g) Example coronal slice of the same individual, with corpus callosum–average axonal radius overlay.h) Similarly, segmenting the corpus callosum in two more individuals permits quantitative comparison of reduced versus full acquisition sets (ground truth). Green triangles indicate means; middle lines, medians; boxes, IQR; whiskers, 1.5 ×IQR. i)-k) Structural Similarity Index of (i) restricted fraction, (l) Hindered Mean diffusivity, and (l) rest. perpendicular diffusion, comparing reduced to full acquisitions for SBI (teal) and NLLS (orange).

We then extended our analysis to assess the effects of varying SNR levels on parameter estimation. Signals were generated and corrupted at SNR levels of 2, 10, 20, and 30. For each noise level, we computed the absolute differences between the true and fitted parameters, focusing on the mean radius, restricted signal fraction, perpendicular restricted diffusion, and hindered FA (Figs. 5be). In most cases, SBI substantially outperformed NLLS. As expected, increased SNR generally led to reduced errors, indicating improved accuracy with lower noise. However, while full SBI consistently yielded the best performance, NLLS errors did not decrease in the same way. This confirms that for certain parameters — such as mean radius, whose contribution may be small relative to the noise — NLLS struggles to produce accurate estimates, even at SNR = 30.

We next applied the AxCaliber framework to the real-world data. Examples of parameter fits for one individual are shown in Fig. 5f (restricted fraction) and Fig. 5g (average axon radius). Both methods yield similar restricted fraction maps under the full acquisition setting. With reduced data, however, SBI retains more structural detail and exhibits lower noise levels than NLLS.

As illustrated in Fig. 5g, SBI successfully captures the known structure of the axon sizes of the corpus callosum — revealing larger fibres in the body relative to the genu and splenium— for both the full and reduced acquisition sets. Although the reduced acquisition introduces some noise in the axon size fit, it consistently recapitulates the overall structure with the full set and previous findings [58]. In contrast, NLLS shows substantially more noise in its reconstruction and less detailed structure (both full and reduced sets).

We repeated this analysis for the other two healthy individuals and quantified the absolute differences between the inferred axon radius in the corpus callosum and the ground truth (defined by their respective full fit). The results, depicted in Fig. 5h, show that SBI reduced acquisition fits (teal) yield both smaller average and range of errors than NLLS (orange). Importantly, the NLLS fits show a large amount of voxels with the maximum possible error (clustering around 5 μm) implying that the reduced NLLS is unable to consistently obtain the correct axon size.

We next also evaluated SBI on additional key AxCaliber metrics for the full set of 8 individuals to demonstrate that the framework works both for healthy and pathological settings. For this, an axial slice from each individual was selected, and the SSIM computed as a proxy for image accuracy. The results (Figs. 5hk) compare the reduced acquisition results to their full parameter counterparts. We note that SBI outperforms NLLS across all metrics. Particularly in the SSIM of the restricted fraction, while NLLS breaks down, SBI achieves average SSIM values above 0.8, implying a strong resemblance in the results. Given that all the other average SSIM values do not go below 0.7, even with only 10% of the available information for the fit, substantial structural insight is preserved.

4. Discussion

In this work, we evaluated how SBI, specifically NPE, performs across mathematical and biophysical models (DTI, DKI, and CHARMED/AxCaliber) under significant noise and limited acquisitions. We trained neural networks on large simulated datasets derived from the mathematical formulations of these models. This approach enhances parameter estimation accuracy while minimizing acquisition time, offering a robust and flexible tool for diffusion MRI analysis. Importantly, our results have the potential of solving a long-standing issue in the field: the huge gap between researcher’s and clinicians’ MRI, stemming from the fact that advanced methods are too time-consuming to be included in the clinical practice. As a consequence, clinical diagnosis in the daily practice of radiology services is still carried out with basic approaches. For example, in the current registered clinical trials in EU, only 3 out of more than 4000 using MRI employ an approach that goes beyond DTI [64]. SBI-based fitting can facilitate more widespread adoption of advanced dMRI approaches like DKI or AxCaliber in clinical applications, providing deeper insights than basic protocols typically used. Additionally, this framework enables the recovery of previously acquired data with short acquisition protocols, like those employed in valuable longitudinal initiatives started decades ago, or affected by noise [65]. Critically, all parameter estimates are obtained without relying on real data, avoiding concerns related to data availability and ethics [36, 37, 54, 55]. Our results could facilitate the use of advanced dMRI approaches like DKI or AxCaliber in clinical applications, providing deeper insights than basic protocols typically used. While we demonstrated this work only on one biophysical model, this approach is inherently flexible and readily extendable to emerging or future models [36].

This work fits within a broader trend of applying neural networks to dMRI tasks, including combining Bayesian frameworks, reconstructing data from undersampled acquisitions, fiber tracking, and enhancing resolution [39, 6669]. Some works [40, 70] considered using fewer acquisitions but i) only considered simpler models like DTI and ii) relied on real data for training, which can introduce issues with data availability, bias, and generalizability. In terms of SBI, one example of this approach is [36] who demonstrated that SBI is effective for parameter estimation in dMRI, providing quantifiable measures of model degeneracy not available with traditional methods. Other studies using NPE include [71] for epilepsy models, [72, 73] for intravoxel incoherent motion (IVIM) models and [74] who used it to accurately estimate white matter fibre orientations.

Our approach distinguishes itself by combining NPE with dMRI to study AxCaliber and DKI, beyond the linear DTI framework, assessing different noise levels and evaluating minimum acquisition protocols without requiring real data for training. This offers a flexible and efficient framework for accurate parameter estimation when acquisition time is limited. Unlike previous NPE approaches that use summary statistics via multi-layer perceptrons (MLPs) [35, 36], we directly input the diffusion signal to the network, eliminating the need for additional network training and reducing complexity. Two caveats arise: i) our neural networks are specific to the acquisition scheme, and ii) the input dimension can be large with full acquisitions. We addressed i) by providing an identifier for the acquisition scheme (via an integer flag), and for ii) noted that even full acquisition sets were computationally feasible on a personal computer. Future work will aim to incorporate summary statistics to retain simplicity and interpretability while gaining generalization over acquisition numbers and directions.

Nonetheless, our approach may have limitations that warrant further investigation. For example, the use of choice of prior may influence the result of our inference. For DTI, we used simple uniform distributions needing no pre-tuning while for DKI, priors initially required knowledge of parameter distributions from one individual. However, we showed these priors generalized across other individuals, datasets and pathological subjects [42], even with different b-values and protocols, suggesting minimal prior knowledge may suffice. Importantly, in the AxCaliber framework, using pre-reported values from existing literature for the prior distributions of the biological parameters proved sufficient to obtain robust model estimates. Additionally, we acknowledge that NLLS may be a weak baseline to compare SBI to - as its sensitivity to noise is a well-known problem. While NLLS remains one of the most widely used estimators in major toolboxes (for example, DIPY) and clinical studies because of its simplicity, interpretability, and low computational cost, it is limited by noise sensitivity and lack of regularization. Our goal with SBI was to match NLLS’s efficiency while improving robustness. However, we further validated our work by benchmarking the efficacy of SBI in reconstructing kurtosis markers (arguably the most unstable due to non-linearities in their formulation) against three other methods: weighted least squares, a Rician-unbiased approach, and a variational Bayesian approach.

In summary, this work demonstrates how data-friendly SBI methods can substantially minimize acquisition times in both research and clinical settings. By reducing acquisition times by up to 90% in DTI, 85% in DKI and 93% in AxCaliber while maintaining high accuracy, SBI addresses the key limitation of long scan durations in MRI. SBI’s robustness to noise and effectiveness with fewer data points make it especially valuable where high-quality data acquisition is challenging. Demonstrating efficacy in DTI, DKI and AxCaliber, complex models that are more sensitive to noise, opens avenues for impactful clinical research and real-world deployment in dMRI that, after more than 30 years since its first introduction, is still lacking.

Supplementary Material

Supplement 1
media-1.pdf (3MB, pdf)

Acknowledgements

We acknowledge the support of the Institute of Neuroscience, CSIC-UMH. We thank Pedro Gonçalves and Caterina Mainero for fruitful discussions and guidance. This research was funded by ”la Caixa” Foundation with Project-ID LCF/BQ/PI23/11970039 (MFE). SDS was supported by the the Spanish Ministerio de Ciencia e Innovación, Agencia Estatal de Investigación (PID2021-128909NA-I00 and CNS2023-14488 ), by the Programs for Centres of Excellence in R&D Severo Ochoa (CEX2021-001165-S), by the Generalitat Valenciana through a Subvencion para la contratación de investigadoras e investigadores doctores de excelencia 2021 (CIDEGENT/2021/015), and by the Fundacion Pasqual Maragall (ref. 2023-1296).

Footnotes

Competing Interests

The authors declare no competing interests.

Data and code availability

All data and code is available in the github repository: https://github.com/TIB-Lab/SBIDTI. Within that repository we also provide a link to the trained networks that were used in this work. The MS Data used in the preparation of this work were obtained at the Martinos Center for Biomedical Imaging at Massachusetts General Hospital (MGH) for the NIH 1R21NS123419 Project (Principal Investigator: Caterina Mainero, M.D., Ph.D.). Full MS data can be found at https://digital.csic.es/handle/10261/340684.

References

  • [1].Stejskal E. O. & Tanner J. E. Spin diffusion measurements: spin echoes in the presence of a time-dependent field gradient. The journal of chemical physics 42, 288–292 (1965). [Google Scholar]
  • [2].Le Bihan D. et al. Diffusion tensor imaging: concepts and applications. Journal of Magnetic Resonance Imaging: An Official Journal of the International Society for Magnetic Resonance in Medicine 13, 534–546 (2001). [DOI] [PubMed] [Google Scholar]
  • [3].Basser P. J., Mattiello J. & LeBihan D. Estimation of the effective self-diffusion tensor from the nmr spin echo. Journal of Magnetic Resonance, Series B 103, 247–254 (1994). [DOI] [PubMed] [Google Scholar]
  • [4].Jensen J. H., Helpern J. A., Ramani A., Lu H. & Kaczynski K. Diffusional kurtosis imaging: the quantification of non-gaussian water diffusion by means of magnetic resonance imaging. Magnetic Resonance in Medicine: An Official Journal of the International Society for Magnetic Resonance in Medicine 53, 1432–1440 (2005). [DOI] [PubMed] [Google Scholar]
  • [5].Grand S. et al. Perfusion imaging in brain disease. Diagnostic and Interventional Imaging 94, 1241–1257 (2013). [DOI] [PubMed] [Google Scholar]
  • [6].Koh D.-M. & Collins D. J. Diffusion-weighted mri in the body: applications and challenges in oncology. American Journal of Roentgenology 188, 1622–1635 (2007). [DOI] [PubMed] [Google Scholar]
  • [7].Barazany D., Basser P. J. & Assaf Y. In vivo measurement of axon diameter distribution in the corpus callosum of rat brain. Brain 132, 1210–1220 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [8].Yi S. Y. et al. Detecting microglial density with quantitative multi-compartment diffusion mri. Frontiers in neuroscience 13, 81 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [9].Garcia-Hernandez R. et al. Mapping microglia and astrocyte activation in vivo using diffusion mri. Science Advances 8, eabq2923 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [10].Palombo M. et al. Sandi: A compartment-based model for non-invasive apparent soma and neurite imaging by diffusion mri. Neuroimage 215, 116835 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [11].Assaf Y. & Basser P. J. Composite hindered and restricted model of diffusion (charmed) mr imaging of the human brain. Neuroimage 27, 48–58 (2005). [DOI] [PubMed] [Google Scholar]
  • [12].Assaf Y., Blumenfeld-Katzir T., Yovel Y. & Basser P. J. Axcaliber: a method for measuring axon diameter distribution from diffusion mri. Magnetic Resonance in Medicine: An Official Journal of the International Society for Magnetic Resonance in Medicine 59, 1347–1354 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [13].Zhang H., Schneider T., Wheeler-Kingshott C. A. & Alexander D. C. Noddi: practical in vivo neurite orientation dispersion and density imaging of the human brain. Neuroimage 61, 1000–1016 (2012). [DOI] [PubMed] [Google Scholar]
  • [14].Jones D. K. & Basser P. J. “squashing peanuts and smashing pumpkins”: how noise distorts diffusion-weighted mr data. Magnetic Resonance in Medicine: An Official Journal of the International Society for Magnetic Resonance in Medicine 52, 979–993 (2004). [DOI] [PubMed] [Google Scholar]
  • [15].Farrell J. A. et al. Effects of signal-to-noise ratio on the accuracy and reproducibility of diffusion tensor imaging–derived fractional anisotropy, mean diffusivity, and principal eigenvector measurements at 1.5 t. Journal of Magnetic Resonance Imaging: An Official Journal of the International Society for Magnetic Resonance in Medicine 26, 756–767 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [16].Coupé P. et al. An optimized blockwise nonlocal means denoising filter for 3-d magnetic resonance images. IEEE transactions on medical imaging 27, 425–441 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [17].Manjón J. V. et al. Diffusion weighted image denoising using overcomplete local pca. PloS one 8, e73021 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [18].Fadnavis S., Batson J. & Garyfallidis E. Patch2self: Denoising diffusion mri with self-supervised learning. Advances in Neural Information Processing Systems 33, 16293–16303 (2020). [Google Scholar]
  • [19].Veraart J., Sijbers J., Sunaert S., Leemans A. & Jeurissen B. Weighted linear least squares estimation of diffusion mri parameters: strengths, limitations, and pitfalls. Neuroimage 81, 335–346 (2013). [DOI] [PubMed] [Google Scholar]
  • [20].Jones D. K., Horsfield M. A. & Simmons A. Optimal strategies for measuring diffusion in anisotropic systems by magnetic resonance imaging. Magnetic Resonance in Medicine: An Official Journal of the International Society for Magnetic Resonance in Medicine 42, 515–525 (1999). [PubMed] [Google Scholar]
  • [21].Jones D. K. The effect of gradient sampling schemes on measures derived from diffusion tensor mri: a monte carlo study. Magnetic Resonance in Medicine: An Official Journal of the International Society for Magnetic Resonance in Medicine 51, 807–815 (2004). [DOI] [PubMed] [Google Scholar]
  • [22].Landman B. A. et al. Effects of diffusion weighting schemes on the reproducibility of dti-derived fractional anisotropy, mean diffusivity, and principal eigenvector measurements at 1.5 t. Neuroimage 36, 1123–1138 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [23].Koay C. G. & Basser P. J. Analytically exact correction scheme for signal extraction from noisy magnitude mr signals. Journal of magnetic resonance 179, 317–322 (2006). [DOI] [PubMed] [Google Scholar]
  • [24].Chappell M. A., Groves A. R., Whitcher B. & Woolrich M. W. Variational bayesian inference for a nonlinear forward model. IEEE Transactions on Signal Processing 57, 223–236 (2008). [Google Scholar]
  • [25].Goldsmith J., Wand M. P. & Crainiceanu C. Functional regression via variational bayes. Electronic journal of statistics 5, 572 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [26].Brehmer J. Simulation-based inference in particle physics. Nature Reviews Physics 3, 305–305 (2021). [Google Scholar]
  • [27].Boelts J., Lueckmann J.-M., Gao R. & Macke J. H. Flexible and efficient simulation-based inference for models of decision-making. Elife 11, e77220 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [28].Cranmer K., Brehmer J. & Louppe G. The frontier of simulation-based inference. Proceedings of the National Academy of Sciences 117, 30055–30062 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [29].Tejero-Cantero A. et al. Sbi–a toolkit for simulation-based inference. arXiv preprint arXiv:2007.09114 (2020).
  • [30].Papamakarios G., Sterratt D. & Murray I. Sequential Neural Likelihood: Fast Likelihood-Free Inference with Autoregressive Flows (2019).
  • [31].Lueckmann J.-M. et al. Flexible statistical inference for mechanistic models of neural dynamics. Advances in neural information processing systems 30 (2017). [Google Scholar]
  • [32].Alsing J., Charnock T., Feeney S. & Wandelt B. Fast likelihood-free cosmology with neural density estimators and active learning. Monthly Notices of the Royal Astronomical Society 488, 4440–4458 (2019). [Google Scholar]
  • [33].Greenberg D., Nonnenmacher M. & Macke J. Automatic posterior transformation for likelihood-free inference (2019).
  • [34].Gonçalves P. J. et al. Training deep neural density estimators to identify mechanistic models of neural dynamics. Elife 9, e56261 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [35].Jallais M., Rodrigues P. L. C., Gramfort A. & Wassermann D. Inverting brain grey matter models with likelihood-free inference: a tool for trustable cytoarchitecture measurements. Journal of Machine Learning for Biomedical Imaging 1–27 (2022).
  • [36].Jallais M. & Palombo M. μguide: a framework for quantitative imaging via generalized uncertainty-driven inference using deep learning. eLife 13 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [37].de Almeida Martins J. P. et al. Neural networks for parameter estimation in microstructural mri: application to a diffusion-relaxation model of white matter. NeuroImage 244, 118601 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [38].Consagra W., Ning L. & Rathi Y. A deep learning approach to multi-fiber parameter estimation and uncertainty quantification in diffusion mri. arXiv preprint arXiv:2405.13655 (2024). [DOI] [PubMed]
  • [39].Golkov V. et al. Q-space deep learning: twelve-fold shorter and model-free diffusion mri scans. IEEE transactions on medical imaging 35, 1344–1351 (2016). [DOI] [PubMed] [Google Scholar]
  • [40].Tiwari A., Singh R. K. & Shigwan S. J. Swindti: swin transformer-based generalized fast estimation of diffusion tensor parameters from sparse data. Neural Computing and Applications 36, 3179–3196 (2024). [Google Scholar]
  • [41].Van Essen D. C. et al. The human connectome project: a data acquisition perspective. Neuroimage 62, 2222–2231 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [42].Cerdá A. C. et al. A translational mri approach to validate acute axonal damage detection as an early event in multiple sclerosis. Elife 13, e79169 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [43].Beaulieu C., Does M. D., Snyder R. E. & Allen P. S. Changes in water diffusion due to wallerian degeneration in peripheral nerve. Magnetic resonance in medicine 36, 627–631 (1996). [DOI] [PubMed] [Google Scholar]
  • [44].Dahlberg L. S., Becerra L., Borsook D. & Linnman C. Brain changes after spinal cord injury, a quantitative meta-analysis and review. Neuroscience & Biobehavioral Reviews 90, 272–293 (2018). [DOI] [PubMed] [Google Scholar]
  • [45].Hansen B. & Jespersen S. N. Data for evaluation of fast kurtosis strategies, b-value optimization and exploration of diffusion mri contrast. Scientific data 3, 1–5 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [46].Glenn G. R., Helpern J. A., Tabesh A. & Jensen J. H. Quantitative assessment of diffusional kurtosis anisotropy. NMR in Biomedicine 28, 448–459 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [47].Assaf Y., Freidlin R. Z., Rohde G. K. & Basser P. J. New modeling and experimental framework to characterize hindered and restricted water diffusion in brain white matter. Magnetic Resonance in Medicine: An Official Journal of the International Society for Magnetic Resonance in Medicine 52, 965–978 (2004). [DOI] [PubMed] [Google Scholar]
  • [48].Vangelderen P., DesPres D., Vanzijl P. & Moonen C. Evaluation of restricted diffusion in cylinders. phosphocreatine in rabbit leg muscle. Journal of Magnetic Resonance, Series B 103, 255–260 (1994). [DOI] [PubMed] [Google Scholar]
  • [49].De Santis S., Jones D. K. & Roebroeck A. Including diffusion time dependence in the extra-axonal space improves in vivo estimates of axonal diameter and density in human white matter. NeuroImage 130, 91–103 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [50].Sepehrband F., Alexander D. C., Kurniawan N. D., Reutens D. C. & Yang Z. Towards higher sensitivity and stability of axon diameter estimation with diffusion-weighted mri. NMR in Biomedicine 29, 293–308 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [51].Lueckmann J.-M., Boelts J., Greenberg D., Goncalves P. & Macke J. Benchmarking simulation-based inference (2021).
  • [52].Trippe B. L. & Turner R. E. Conditional density estimation with bayesian normalising flows. arXiv preprint arXiv:1802.04908 (2018).
  • [53].Bassett R. & Deride J. Maximum a posteriori estimators as a limit of bayes estimators. Mathematical Programming 174, 129–144 (2019). [Google Scholar]
  • [54].Gyori N. G., Palombo M., Clark C. A., Zhang H. & Alexander D. C. Training data distribution significantly impacts the estimation of tissue microstructure with machine learning. Magnetic resonance in medicine 87, 932–947 (2022). [DOI] [PubMed] [Google Scholar]
  • [55].Guerreri M., Epstein S., Azadbakht H. & Zhang H. Resolving quantitative mri model degeneracy with machine learning via training data distribution design (2023).
  • [56].Durkan C., Murray I. & Papamakarios G. On contrastive learning for likelihood-free inference (2020).
  • [57].Aboitiz F., Scheibel A. B., Fisher R. S. & Zaidel E. Fiber composition of the human corpus callosum. Brain research 598, 143–153 (1992). [DOI] [PubMed] [Google Scholar]
  • [58].Huang S. Y. et al. Corpus callosum axon diameter relates to cognitive impairment in multiple sclerosis. Annals of clinical and translational neurology 6, 882–892 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [59].Gudbjartsson H. & Patz S. The rician distribution of noisy mri data. Magnetic resonance in medicine 34, 910–914 (1995). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [60].Garyfallidis E. et al. Dipy, a library for the analysis of diffusion mri data. Frontiers in neuroinformatics 8, 8 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [61].Glasser M. F. et al. The minimal preprocessing pipelines for the human connectome project. Neuroimage 80, 105–124 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [62].Wang Z., Bovik A. C., Sheikh H. R. & Simoncelli E. P. Image quality assessment: from error visibility to structural similarity. IEEE transactions on image processing 13, 600–612 (2004). [DOI] [PubMed] [Google Scholar]
  • [63].Veraart J., Fieremans E. & Novikov D. S. Diffusion mri noise mapping using random matrix theory. Magnetic resonance in medicine 76, 1582–1593 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [64].European Union Clinical Trials Register. Search results for “mri”. https://www.clinicaltrialsregister.eu/ctr-search/search?query=MRI. Accessed: 2025-07-27.
  • [65].Schumann G. et al. The imagen study: reinforcement-related behaviour in normal brain function and psychopathology. Molecular Psychiatry 15, 1128–1139 (2010). See Supplementary Information for DTI protocol: six non-collinear diffusion directions (b=1000 s/mm2) + one b=0 volume. [DOI] [PubMed] [Google Scholar]
  • [66].Girard G., Whittingstall K., Deriche R. & Descoteaux M. Towards quantitative connectivity analysis: reducing tractography biases. Neuroimage 98, 266–278 (2014). [DOI] [PubMed] [Google Scholar]
  • [67].Reisert M., Kellner E., Dhital B., Hennig J. & Kiselev V. G. Disentangling micro from mesostructure by diffusion mri: a bayesian approach. Neuroimage 147, 964–975 (2017). [DOI] [PubMed] [Google Scholar]
  • [68].Benou I. & Riklin Raviv T. Deeptract: A probabilistic deep learning framework for white matter fiber tractography (2019).
  • [69].Abramian D., Eklund A. & Özarslan E. Super-resolution mapping of anisotropic tissue structure with diffusion mri and deep learning. bioRxiv 2023–04 (2023). [DOI] [PMC free article] [PubMed]
  • [70].Aliotta E., Nourzadeh H. & Patel S. H. Extracting diffusion tensor fractional anisotropy and mean diffusivity from 3-direction dwi scans using deep learning. Magnetic Resonance in Medicine 85, 845–854 (2021). [DOI] [PubMed] [Google Scholar]
  • [71].Hashemi M. et al. Amortized bayesian inference on generative dynamical network models of epilepsy using deep neural density estimators. Neural Networks 163, 178–194 (2023). [DOI] [PubMed] [Google Scholar]
  • [72].Lanzarone E., Mastropietro A., Scalco E., Vidiri A. & Rizzo G. A novel bayesian approach with conditional autoregressive specification for intravoxel incoherent motion diffusion-weighted mri. NMR in Biomedicine 33, e4201 (2020). [DOI] [PubMed] [Google Scholar]
  • [73].Vasylechko S. D., Warfield S. K., Afacan O. & Kurugol S. Self-supervised ivim dwi parameter estimation with a physics based forward model. Magnetic resonance in medicine 87, 904–914 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [74].Manzano-Patrón J. et al. Uncertainty mapping and probabilistic tractography using simulation-based inference in diffusion mri: A comparison with classical bayes. bioRxiv 2024–11 (2024). [DOI] [PubMed]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplement 1
media-1.pdf (3MB, pdf)

Data Availability Statement

All data and code is available in the github repository: https://github.com/TIB-Lab/SBIDTI. Within that repository we also provide a link to the trained networks that were used in this work. The MS Data used in the preparation of this work were obtained at the Martinos Center for Biomedical Imaging at Massachusetts General Hospital (MGH) for the NIH 1R21NS123419 Project (Principal Investigator: Caterina Mainero, M.D., Ph.D.). Full MS data can be found at https://digital.csic.es/handle/10261/340684.


Articles from bioRxiv are provided here courtesy of Cold Spring Harbor Laboratory Preprints

RESOURCES