Model-based multi-parameter mapping

Yaël Balbastre; Mikael Brudfors; Michela Azzarito; Christian Lambert; Martina F Callaghan; John Ashburner

doi:10.1016/j.media.2021.102149

. 2021 Oct;73:102149. doi: 10.1016/j.media.2021.102149

Model-based multi-parameter mapping

Yaël Balbastre ^a,^b,^⁎,^⁎, Mikael Brudfors ^a,^c, Michela Azzarito ^d, Christian Lambert ^a, Martina F Callaghan ^a, John Ashburner ^a

PMCID: PMC8505752 PMID: 34271531

Highlights

•
Model-based estimation of quantitative multi-parameter maps.
•
Maximum-likelihood or Maximum a posteriori solutions.
•
Embedded denoising using a joint total-variation prior.
•
Stable second-order solver using a novel approximate Hessian.

Graphical abstract

Abstract

Quantitative MR imaging is increasingly favoured for its richer information content and standardised measures. However, computing quantitative parameter maps, such as those encoding longitudinal relaxation rate ( $R_{1}$ ), apparent transverse relaxation rate ( $R_{2}^{*}$ ) or magnetisation-transfer saturation (MT_sat), involves inverting a highly non-linear function. Many methods for deriving parameter maps assume perfect measurements and do not consider how noise is propagated through the estimation procedure, resulting in needlessly noisy maps. Instead, we propose a probabilistic generative (forward) model of the entire dataset, which is formulated and inverted to jointly recover (log) parameter maps with a well-defined probabilistic interpretation (e.g., maximum likelihood or maximum a posteriori). The second order optimisation we propose for model fitting achieves rapid and stable convergence thanks to a novel approximate Hessian. We demonstrate the utility of our flexible framework in the context of recovering more accurate maps from data acquired using the popular multi-parameter mapping protocol. We also show how to incorporate a joint total variation prior to further decrease the noise in the maps, noting that the probabilistic formulation allows the uncertainty on the recovered parameter maps to be estimated. Our implementation uses a PyTorch backend and benefits from GPU acceleration. It is available at https://github.com/balbasty/nitorch.

1. Introduction

The magnetic resonance imaging (MRI) signal is governed by a number of tissue-specific parameters. While many common MR sequences only aim to maximise the contrast between tissues of interest, the field of quantitative MRI (qMRI) is concerned with the extraction of the original parameters (Tofts, 2003). This interest stems from the fundamental relationship that exists between the magnetic parameters and the tissue microstructure: the longitudinal relaxation rate $R_{1} = 1 / T_{1}$ is sensitive to myelin content (Sigalovsky, Fischl, Melcher, 2006, Dick, Tierney, Lutti, Josephs, Sereno, Weiskopf, 2012, Sereno, Lutti, Weiskopf, Dick, 2013, Lutti, Dick, Sereno, Weiskopf, 2014); the apparent transverse relaxation rate $R_{2}^{*} = 1 / T_{2}^{*}$ can be used to probe iron content (Ordidge, Gorell, Deniau, Knight, Helpern, 1994, Ogg, Langston, Haacke, Steen, Taylor, 1999, Hasan, Walimuni, Kramer, Narayana, 2012, Langkammer, Krebs, Goessler, Scheurer, Ebner, Yen, Fazekas, Ropele, 2010); the magnetization-transfer saturation ratio (MT_sat) is related to protons bound to macromolecules (in contrast to free water) and offers another metric to investigate myelination (Tofts, Steens, van Buchem, 2003, Helms, Dathe, Kallenberg, Dechent, 2008b, Campbell, Leppert, Narayanan, Boudreau, Duval, Cohen-Adad, Pike, Stikov, 2018). Additionally, quantitative MRI allows many of the scanner- and centre-specific effects to be factored out, making measures more comparable across sites (Tofts, Steens, Cercignani, Admiraal-Behloul, Hofman, van Osch, Teeuwisse, Tozer, van Waesberghe, Yeung, Barker, van Buchem, 2006, Deoni, Williams, Jezzard, Suckling, Murphy, Jones, 2008b, Bauer, Jara, Killiany, 2010, Weiskopf, Suckling, Williams, Correia, Inkster, Tait, Ooi, Bullmore, Lutti, 2013).

Multi-parameter mapping

Estimating quantitative parameters involves acquiring a collection of MR images while varying the sequence parameters (repetition time, echo time, flip angle) that are known to interact with the quantitative parameters, so that the function that governs the weighted signal intensity can be inverted. This function emerges from the integration of ordinary differential equations (Bloch, 1946) and is therefore nonlinear and can be difficult to invert in closed-form. Therefore, early attempts at quantitative MRI targeted a single parameter at a time and used sequences with parameters carefully chosen so that other terms would disappear with a few algebraic manipulations (Gupta, 1977). However, as quantitative mapping moved from pure MR research to clinical applications, it became important to design sequences that allowed a maximum number of parameters to be estimated from a minimum number of acquisitions.

The multi-parameter mapping (MPM) protocol stems from this need: it uses multi-echo spoiled gradient echo (SPGR) acquisitions with variable flip angles and allows the quantification of $R_{1}$ , $R_{2}^{*}$ , MT_sat and proton density (PD) at submillimetric resolution and in a clinically acceptable scan time (Weiskopf et al., 2013). In this context, a wider range of sequence parameters can be used, at the price of rational approximations of the signal (Helms et al., 2008b). Images acquired with a SPGR sequence are parameterised by three parameters (PD, $R_{1}$ , $R_{2}^{*}$ ) plus an eventual fourth one if MT weighting is used. The ESTATICS forward model reparameterises the SPGR signal equation using one intercept per contrast (i.e., weighted volumes extrapolated to the theoretical zero echo time) and a shared $R_{2}^{*}$ decay (Weiskopf, Callaghan, Josephs, Lutti, Mohammadi, 2014, Mohammadi, D’Alonzo, Ruthotto, Polzehl, Ellerbrock, Callaghan, Weiskopf, Tabelow, 2017). While ESTATICS provides a model-based estimation of $R_{2}^{*}$ , the intercepts remain weighted images and another model must be inverted to recover the quantitative parameter maps: $R_{1}$ , PD and MT_sat.¹ While this inversion has an analytical solution when all contrasts have the same repetition time (Wang, Riederer, Lee, 1987, Liberman, Louzoun, Bashat, 2014, Mohammadi, D’Alonzo, Ruthotto, Polzehl, Ellerbrock, Callaghan, Weiskopf, Tabelow, 2017), rational approximations must be used as soon as different repetition times are used (Helms et al., 2008a).

Non-linear fitting in MR relaxometry

Many quantitative mapping methods discard the probabilistic nature of the acquired signal, which randomly deviates from its theoretical value due to thermal and physiological noise. This noise propagates through every computational step and ends up in the maps with a non-trivial distribution (Polders, Leemans, Luijten, Hoogduin, 2012, Cercignani, Alexander, 2006) and can even induce bias in the estimators (Sijbers, Dekker, Raman, Dyck, 1999, Chang, Koay, Basser, Pierpaoli, 2008). An alternative is to estimate well defined probabilistic quantities conditioned on the observed data, such as posterior distributions or maximum-likelihood estimators (Hurley, Yarnykh, Johnson, Field, Alexander, Samsonov, 2012, Tisdall, van der Kouwe, 2016) – although they can still be biased (Tisdall, 2017). The use of qMRI estimators that directly optimise a probability distribution have remained elusive, in part due to the non-linear and non-convex nature of the corresponding log-likelihood. Still, several groups have proposed to fit quantitative maps using iterative methods, either from magnitude images (Hurley, Yarnykh, Johnson, Field, Alexander, Samsonov, 2012, Tisdall, 2017) or directly from k-space measurements (Chen, Prato, McKenzie, 1998, Sumpf, Uecker, Boretius, Frahm, 2011, Sumpf, Petrovic, Uecker, Knoll, Frahm, 2014, Zhao, Lam, Liang, 2014). In order to solve the non-linear system, Tisdall (2017) used Brent’s method, which does not require derivatives; Chen et al. (1998) used a Levenberg–Marquardt algorithm, Sumpf, Uecker, Boretius, Frahm, 2011, Sumpf, Petrovic, Uecker, Knoll, Frahm, 2014 used a conjugate gradient algorithm, while Zhao et al. (2014) used a quasi-Newton Broyden–Flecher–Goldfarb–Shannon (BFGS) optimiser, which is not ensured to converge in general (Mascarenhas, 2014) and is highly sensitive to initialisation. Scholand et al. (2020) pushed this idea further by modelling the entire Bloch equations and spatial encoding. In their work, the problem is solved using Gauss-Newton optimisation (which corresponds to Fisher’s scoring of the Hessian). However, for this much more complex inverse-problem to be well-posed, spiral trajectories are needed so that each shot acquires as many spatial frequencies as possible. In multi-exponential $T_{2}$ mapping, the most common procedure involves non-negative least-squares fits (Whittall and MacKay, 1989), which can be very sensitive to noise and optimisation parameters (Wiggermann et al., 2020). The use of non-linear least-squares algorithms is also common and revolves around (constrained) Gauss-Newton or Levenberg-Marquardt (Milford, Rosbach, Bendszus, Heiland, 2015, Gong, Tong, He, Sun, Zhong, Zhang, 2020).

Denoising

The push to higher resolutions and the use of parallel imaging reduces the signal-to-noise ratio (SNR) of the acquired images and consequently the precision of the computed parameter maps. Papp et al. (2016) found a scan-rescan root mean squared error of about 7.5% for $R_{1}$ at 1mm, in the absence of inter-scan movement. Smoothing can be used to improve SNR, but at the cost of lower spatial specificity. In this context, providing mapping methods that are robust to high levels of noise, both in terms of bias and variance, is of major importance.

Denoising methods aim to separate signal from noise. They take advantage of the fact that signal and noise have intrinsically different spatial profiles: thermal noise is spatially independent and often has a characteristic distribution while the signal is highly structured. Denoising methods originate from partial differential equations, adaptive filtering, variational optimisation or Markov random fields, and many connections exist between them. Two main families emerge:

1.
Inversion of a generative model²:
$\hat{Y} = \arg \min_{Y} E_{l} (X, F (Y)) + E_{p} (G (Y)),$
where $X$ is the observed data, $Y$ is the unknown noise-free data, $F$ is an arbitrary forward transformation (e.g., spatial transformation, downsampling, smoothing) mapping from the reconstructed to the observed data and $G$ is a linear transformation (e.g., spatial gradients, Fourier transform, wavelet transform) that extracts features of interest from the reconstruction. $E_{l}$ and $E_{p}$ are two log-probability distributions that correspond to the likelihood of the parameters and their prior.
2.
Application of an adaptive nonlocal filter:
${\hat{Y}}_{i} = \sum_{j \in N_{i}} w (P_{i} (X), P_{j} (X)) X_{j},$
where the reconstruction of a given voxel $i$ is a weighted average over all observed voxels $j$ in a given (possibly infinite) neighbourhood $N_{i}$ , with weights reflecting similarity between patches $P$ centred about these voxels. Convolutional neural networks naturally extend this family by stacking many such filters and making their weights learnable rather than fixed.

For the first family of methods, it was found that the denoising effect is stronger when $E_{p}$ is an absolute norm (or sum of), rather than a squared norm, because the solution is implicitly sparse in the feature domain (Bach, 2011). This family of methods includes total variation (TV) regularisation (Rudin et al., 1992) and wavelet soft-thresholding (Donoho, 1995). The second family also leverages sparsity in the form of redundancy in the spatial domain; that is, the dictionary of patches necessary to reconstruct the noise-free images is smaller than the actual number of patches in the image. Several such methods have been developed specifically for MRI, with the aim of finding an optimal, voxel-wise weighting based on the noise distribution (Coupe, Yger, Prima, Hellier, Kervrann, Barillot, 2008, Manjón, Coupé, Martí-Bonmatí, Collins, Robles, 2010, Coupé, Manjón, Robles, Collins, 2012, Manjón, Coupé, Buades, Louis Collins, Robles, 2012).

Optimisation methods can naturally be interpreted as a maximum a posteriori (MAP) solution in a generative model, which eases their interpretation and extension. This feature is especially important in qMRI, where we possess a well-defined (nonlinear) forward function and log-likelihood, and wish to regularise a small number of maps. Joint total variation (JTV) (Sapiro and Ringach, 1996) is an extension of TV to multi-contrast images, where the absolute norm is defined across contrasts, introducing an implicit correlation between them. TV and JTV have been used before in MR reconstruction, e.g., in compressed-sensing (Huang et al., 2012), quantitative susceptibility mapping (Liu et al., 2011) and super-resolution (Brudfors et al., 2018). Recently, Varadarajan et al. (2020) applied JTV to multi-echo SPGR data for $B_{0}$ distortion correction, with the joint prior defined across echoes. However, these problems are linear and (therefore) convex, while the MR signal equations are not. This non-convexity has limited the use of iterative methods in this context, as algorithms either have a slow convergence – in the case of first order methods – or are not ensured to converge – in the case of second order methods.

Non-linear MPM fitting

In Balbastre et al. (2020), we introduced a non-linear, spatially regularised version of ESTATICS that was fit using Gauss-Newton optimisation. However, since regularisation did not apply directly to the parameter maps but to these weighted intercepts, the denoising of the quantitative parameters was merely a by-product of the denoising of the intercepts. In this work, we instead propose a generic framework that directly finds MAP quantitative (log) parameters from all SPGR data at hand. To this end, we extend our optimisation framework to work directly with the SPGR signal equation, and jointly regularise the log-parameter maps. Working with log-parameters $\log PD$ , $\log R_{1}$ , $\log R_{2}^{*}$ , $logit {MT}_{sat}$ ) is advantageous as their spatial gradients are independent from the choice of unit and representation (time, in $s$ or $m s$ , or rate, in $s^{- 1}$ or ${ms}^{- 1}$ ), although it introduces an additional degree of nonlinearity. Another advantage is that posterior distributions are often better approximated by Laplace’s method when a log basis is used (MacKay, 1998). We name this extended model SPGR+JTV when it uses regularisation and SPGR-ML when it does not. In Gauss–Newton, positive-definiteness of the Hessian is enforced by the use of Fisher’s scoring, which discards terms that involve second derivatives of the SPGR signal function. In this new work, we found that a more stable algorithm can be derived by keeping the (absolute) diagonal of this second-order term. We illustrate the improved convergence on a toy problem, and experimentally show that it generalises to least-squares ESTATICS and SPGR. Additionally, our implementation uses a quadratic upper bound of the JTV functional, allowing the surrogate problem to be efficiently solved using second-order optimisation.

We validated our method on a unique dataset – five repeats of the MPM protocol acquired, within a single session, on a healthy subject – by leave-one-out cross-validation: maps were estimated on one repeat and used to predict the weighted images from the other repeats. Our methods (SPGR+JTV and SPGR-ML) were compared to algorithms that rely on different flavours of the ESTATICS model: LOGLIN uses a log-linear fit to recover the intercepts and decay (Weiskopf et al., 2014), whereas NONLIN, and NONLIN+JTV use a (regularised) non-linear fit (Balbastre et al., 2020). We also applied ESTATICS to echoes that had been previously denoised using the adaptive optimized nonlocal means (AONLM) method (Manjón et al., 2010), as a baseline for the denoising component of our method. Quantitative maps were then computed analytically from the intercepts obtained from all ESTATICS methods. SPGR+JTV consistently outperformed all baselines. Besides, we probed the effect of regularisation by reconstructing maps with a varying number of echoes, and explored the use of our generative model to estimate uncertainty about the inferred quantitative values.

2. Theory

2.1. Forward model: spoiled gradient echo signal equation

The steady state signal intensity of a spoiled gradient echo acquired with flip angle $α$ , repetition time TR and echo time TE is³

S = A \sin α \frac{1 - \exp (- R_{1} TR)}{1 - \cos α \exp (- R_{1} TR)} \exp (- R_{2}^{*} TE)

(1)

where $A$ is proportional to the proton density (with a multiplicative factor related to the sensitivity of the receiver coil).

Multi-echo techniques acquire multiple echoes, allowing the $R_{2}^{*}$ decay rate to be mapped, while variable flip-angle (VFA) techniques acquire images with multiple flip-angles to map the $R_{1}$ relaxation rate. Multi-parameter mapping protocols combine these two techniques in a single imaging session.

In addition, a similar sequence can be used to measure the magnetisation transfer saturation: in a two-pool model, protons are separated between the free pool – protons in free water that can diffuse freely, which cause the MR signal – and bound pool – non-acqueous protons, e.g., in proteins and lipids of the myelin sheath encasing axons. Following selective saturation of the bound pool via an off-resonance pulse, the process of magnetisation transfer leads to partial saturation of the observable free pool. By acquiring images with and without this off-resonance pulse, the proportion of saturated protons can be measured, giving a surrogate measure of the bound pool. The resulting signal (which we write at TE=0 for brevity) can be modelled by adding an extra excitation pulse, with repetition time TR $_{2}$ and flip angle $α_{2}$ (Helms et al., 2008b):

S_{TE = 0} = A \sin α \frac{1 - \cos α_{2} e^{- R_{1} TR} - (1 - \cos α_{2}) e^{- R_{1} {TR}_{2}}}{1 - \cos α \cos α_{2} e^{- R_{1} TR}}

(2)

Here, $α_{2}$ is unknown (it depends on the degree of magnetisation transfer saturation) and, following Helms et al. (2008b), we name $δ = 1 - \cos α_{2}$ the MT saturation. Furthermore, we assume that TR $_{2} = 0$ , and the signal equation simplifies to:

S_{TE = 0} = A \sin α \frac{(1 - δ) (1 - \exp (- R_{1} TR))}{1 - (1 - δ) \cos α \exp (- R_{1} TR)}

(3)

In practice, the MR environment is not perfect and multiple phenomena cause deviations from this theoretical model and introduce artefacts:

•
The excitation field, $B_{1}^{+}$ , is not perfectly homogeneous, causing the effective flip-angle to be location-dependent and to deviate from its nominal value. Multiple techniques allow this field to be measured (Jiru, Klose, 2006, Yarnykh, 2007, Sacolick, Wiesinger, Hancu, Vogel, 2010).
•
The receive field, $B_{1}^{-}$ , suffers from the same issue, especially when high-density phased-array coils are used. This is particularly problematic when between-scan motion happens, as the receive field modulation then has a different value in the different contrasts (Papp et al., 2016).
•
The main magnetic field, $B_{0}$ , is not perfectly flat either, due to changes in magnetic susceptibility at air-tissue interfaces and imperfect shimming. As odd and even echoes are acquired with a readout of inverse polarity, between-echo distorsion is possible in regions where field inhomogeneity is high (Varadarajan et al., 2020).
•
Even with the use of gradient and RF spoiling, the assumption that no transverse magnetisation persists across TRs is rarely met and can lead to additional dependencies, on both tissue properties and hardware, in the $R_{1}$ estimates (Corbin and Callaghan, 2021, Preibisch, Deichmann, 2009).

2.2. Noise model: Gaussian

The measured signal is hampered by thermal noise, which is uniform and Gaussian in each complex coil image and becomes Rician or non-central Chi distributed in the sum-of-square magnitude image. In the high SNR regime, this noise is approximately Gaussian, and we make that approximation in this work. This means that the fitting procedure – which consists of inferring the unknown parameters from noisy measurements – is effectively a non-linear least squares problem.

2.3. Optimisation: non-linear least squares

A non-linear least square problem is one of the form:

\min_{y} {ℓ (y) : = \frac{1}{2} \sum_{i} {(f_{i} (y) - x_{i})}^{2}},

(4)

where all $f_{i}$ are non-linear functions. In general, there is no closed-form solution (when there is a solution at all), and the problem can be non-convex. Therefore, carefully crafted iterative methods must be used. A method of choice is Gauss–Newton optimisation. Gauss–Newton emerges from Newton’s method, that is, from a second order Taylor expansion of the objective function. Let us write the gradient of function $f$ in $y$ as $g_{f} (y)$ and its Hessian as $H_{f} (y)$ . The Taylor expansion of Eq. (4) about $y_{0}$ gives:

ℓ (y) \approx ℓ (y_{0}) + {(y - y_{0})}^{T} g_{ℓ} (y_{0}) + \frac{1}{2} {(y - y_{0})}^{T} H_{ℓ} (y_{0}) (y - y_{0})

(5)

g_{ℓ} (y_{0}) = \sum_{i} (f_{i} (y_{0}) - x_{i}) g_{f_{i}} (y_{0})

(6)

H_{ℓ} (y_{0}) = \sum_{i} g_{f_{i}} (y_{0}) g_{f_{i}} {(y_{0})}^{T} + (f_{i} (y_{0}) - x_{i}) H_{f_{i}} (y_{0})

(7)

Under the assumption that $H_{ℓ} (y_{0})$ is positive-definite, Newton’s method solves the surrogate quadratic problem by finding its stationary point:

y_{1} = y_{0} - H_{ℓ} {(y_{0})}^{- 1} g_{ℓ} (y_{0})

(8)

However, as previously stated, the problem is not necessarily convex and therefore the Hessian not necessarily positive-definite. Gauss-Newton circumvents this issue using Fisher’s scoring, i.e., by assuming – in the Hessian – that residuals are all zero:

P_{ℓ} (y_{0}) = \sum_{i} g_{f_{i}} (y_{0}) g_{f_{i}} {(y_{0})}^{T}

(9)

We denote this alternative Hessian $P$ for preconditioner. The resulting quantity is (under mild conditions) positive-definite, and the surrogate quadratic problem now has a minimum. When it converges, the Gauss-Newton iteration converges to a stationary point of the objective function. However, convergence is not guaranteed (Mascarenhas, 2014). In the following subsections, we introduce a sufficient condition for monotonic convergence in Newton-like methods and show simple examples related to MR relaxometry where Gauss-Newton does not converge monotonically or even diverges. We then introduce a new preconditioner that is shown (experimentally) to enforce the condition for monotonic convergence. This preconditioner will be used to optimise the non-linear SPGR least square problem.

A sufficient condition for monotonic convergence

Let us focus on the simpler 1D case. Let $ℓ : R \to R$ be a twice-differentiable non-linear function over the real line with a single local (and therefore global) minimum $y^{*}$ and no local maximum⁴. This function does not need to be convex but, by construction, its first derivative $g : R \to R$ is negative everywhere on the left of its optimum and positive everywhere on its right. Let $h : R \to R_{*}^{+}$ be a strictly positive function that we will use as a preconditioner (e.g., the absolute value of the curvature). Applying a Newton-like step at point $y_{n}$ gives an update of the form $y_{n + 1} = y_{n} - g (y_{n}) / h (y_{n})$ . Since $h$ is positive, the step is ensured to be in the direction of the optimum; a sufficient condition for this step to improve the objective function ( $ℓ (y_{n + 1}) < ℓ (y_{n})$ ) is for the updated point to fall between $y_{n}$ and the optimum $y^{*}$ . Finally, this condition can be formalised as

\frac{| g (y_{n}) |}{h (y_{n})} ⩽ | y_{n} - y^{*} |

(10)

While this condition seems difficult to enforce or check, note that it is always true at the optimum, where $g (y^{*}) = 0$ . Let us write $r (y) = g (y) / h (y)$ and add the condition that $h$ is differentiable; then condition (10) is enforced if the (absolute value of the) derivative of $r$ is everywhere equal or below 1. On the right (positive) side, $r^{'} (y) ⩽ 1$ on $[y^{*}, y_{n}]$ implies

| r (y_{n}) | = r (y_{n}) = \int_{y^{*}}^{y_{n}} r^{'} (y) d y ⩽ \int_{y^{*}}^{y_{n}} 1 d y = y_{n} - y^{*} = | y_{n} - y^{*} |

The same thing can be shown on the left (negative) side.

In the multivariate case, we want the update to stay in the same “quadrant” (that is, all coordinates fall between their original value and the optimum), which we formalise as:

| H {(y_{n})}^{- 1} g (y_{n}) | ⩽ | y_{n} - y^{*} |

(11)

where $| \cdot |$ is the element-wise absolute value. However, we can apply any arbitrary rotation to the problem and this condition stays true. We can therefore relax it to:

∥ H {(y_{n})}^{- 1} g (y_{n}) ∥ ⩽ ∥ y_{n} - y^{*} ∥

(12)

where $∥ \cdot ∥$ is the Euclidean norm.

Toy problem 1: exponential least square

Let us now discuss two toy problems that are closely related to MR relaxometry. The first one consists of finding the least square solution when the non-linear function is an exponential and there is a single data point:

ℓ (y) = \frac{1}{2} {(e^{- y} - x)}^{2}

(13)

The corresponding energy landscape, shown as a solid black line in Fig. 1, is non-convex on its left. When $x > 0$ , there is an obvious analytical solution ( $y^{*} = - \log (x)$ ), but we will use an iterative method to get to the same conclusion. Using the same notations as before, we have:

g_{ℓ} (y) = - (e^{- y} - x) e^{- y}

(14)

h_{ℓ} (y) = e^{- 2 y} + (e^{- y} - x) e^{- y}

(15)

p_{ℓ} (y) = e^{- 2 y}

(16)

If we compute the derivative of $r_{ℓ} (y) = g_{ℓ} (y) / p_{ℓ} (y)$ , we find

r_{ℓ}^{'} (y) = - x e^{y}

(17)

whose absolute value is not bounded by 1. We therefore introduce an alternative pre-conditioner that uses the absolute value of the second term rather than discarding it:

{\tilde{p}}_{ℓ} (y) = e^{- 2 y} + | e^{- y} - x | e^{- y}

(18)

Although we do not prove it, we find experimentally that this preconditioner enforces condition (10). The corresponding quadratic approximations are plotted in red (Gauss–Newton) and blue (proposed preconditioner) in Fig. 1. Experimental convergence with both preconditioners is also shown. Conversely to Gauss-Newton, the proposed version is monotonic when attacking from the non-convex side and converges almost as fast as Gauss-Newton when attacking from the convex side.

Fig. 1 — Qualitative analysis of two toy problems related to MR relaxometry. The first toy problem (first row) has an objective function of the form ${(x - \exp (- y))}^{2}$ , while the other toy problem (second row) as one of the form ${(x - \exp (- \exp (y)))}^{2}$ . In each case, the first column shows the objective function in black along with quadratic approximations corresponding to Gauss-Newton (red) and the proposed second order method (blue). The second column shows the magnitude of the gradient-hessian ratio. The black line corresponds to a purely quadratic objective function, whose optimum is found in one step ( $| g (x) | / h (x) = | x - x^{*} |$ ). When the ratio is under this black line, monotonic convergence is ensured. The third column shows convergence of the Gauss-Newton (red) and proposed (blue) iterative schemes when attacking from either a convex or non-convex lobe, with the optimum shown in dashed black. In each case, convergence is shown both in terms of objective value per iteration and parameter value per iteration. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)

Toy problem 2: nested exponential least square

Our second toy problem involves nested exponentials:

ℓ (y) = \frac{1}{2} {(x - e^{- e^{y}})}^{2} .

(19)

The corresponding energy landscape is shown in Fig. 1: its curvature now tends towards zero on both sides of the real line. Its gradient, Hessian and preconditioners are:

g_{ℓ} (y_{0}) = - e^{y_{0} - e^{y_{0}}} (e^{- e^{y_{0}}} - x)

(20)

h_{ℓ} (y_{0}) = {(e^{y_{0} - e^{y_{0}}})}^{2} + (e^{2 y_{0}} - e^{y_{0}}) e^{- e^{y_{0}}} (e^{y_{0}} - x)

(21)

p_{ℓ} (y_{0}) = {(e^{y_{0} - e^{y_{0}}})}^{2}

(22)

{\tilde{p}}_{ℓ} (y_{0}) = {(e^{y_{0} - e^{y_{0}}})}^{2} + | (e^{2 y_{0}} - e^{y_{0}}) e^{- e^{y_{0}}} (e^{y_{0}} - x) |

(23)

Again, $| r_{ℓ}^{'} |$ is not bounded, and the Gauss-Newton preconditioner fails to enforce the convergence condition. Experimental convergence is shown in Fig. 1, where the proposed method is shown to monotonically converge while Gauss-Newton can diverge when starting in a non-convex section of the energy landscape.

Generalisation to the multivariate case

The toy problems investigated in the previous section are highly relevant to MR relaxometry: the first case is related to the exponential fit of the signal decay, where each data point is seen as the exponential of a linear combination of parameters, while the second case is an extension of the first, where positive parameters are encoded by their log. Given the apparent robustness of the proposed preconditioner on these toy problems, we propose two extensions to the general multivariate case by replacing the absolute value of the scalar Hessian in the one-dimensional case with either: (1) the absolute value of the diagonal of the Hessian, $| H_{f_{i}} (y_{0}) | ⊙ I$ ; (2) the sum of the absolute value of the Hessian across columns, $diag (| H_{f_{i}} (y_{0}) | 1)$ . Only the second one is ensured to majorise the true Hessian (Chun and Fessler, 2018), but we show in our experiments that the first one yields a stable convergence in the MPM context. We therefore use the preconditioner:

{\tilde{P}}_{ℓ} (y_{0}) = \sum_{i} g_{f_{i}} (y_{0}) g_{f_{i}} {(y_{0})}^{T} + | f_{i} (y_{0}) - x_{i} | | H_{f_{i}} (y_{0}) | ⊙ I .

(24)

The impact of this loading is illustrated with a pair of two-dimensional toy problems that are closely related to the one-dimensional problems presented before. The first problem involves an objective function of the form $ℓ (y, z) = \frac{1}{2} {(x_{0} - e^{z - t_{0} y})}^{2} + \frac{1}{2} {(x_{1} - e^{z - t_{1} y})}^{2}$ . In the second problem, the decay parameter is encoded by its log: $ℓ (y, z) = \frac{1}{2} {(x_{0} - e^{z - t_{0} e^{y}})}^{2} + \frac{1}{2} {(x_{1} - e^{z - t_{1} e^{y}})}^{2}$ . Step sizes and optimisation trajectories are depicted in Fig. 2. When the Gauss-Newton preconditioner is used, a vast part of the parameter space has very large step sizes that lead to overshooting. When the proposed preconditioners are used, the entire parameter space (or most of it) lies below the monotonic convergence threshold.

Fig. 2 — Qualitative analysis of two-dimensional toy problems related to MR relaxometry. The first toy problem (top half) has an objective function of the form $ℓ (y, z) = {(x_{0} - e^{z - t_{0} y})}^{2} + {(x_{1} - e^{z - t_{1} y})}^{2}$ , while the other toy problem (bottom half) has one of the form $ℓ (y, z) = {(x_{0} - e^{z - t_{0} e^{y}})}^{2} + {(x_{1} - e^{z - t_{1} e^{y}})}^{2}$ . In each case, the first column shows the objective function along each of the two dimensions (in black) along with quadratic approximations corresponding to Gauss-Newton (red) and the proposed second order methods (blue and green). The second column shows the normalised step size for each method, color-coded such that values below one are blue and values above one are red. The optimum is marked by a red star. Convergence paths from two random initial coordinates (solid and dashed curves) are overlaid. In the second toy problem, two different loading matrices are used: *Proposed* uses the diagonal of the absolute Hessian $diag (| H |)$ whereas *Proposed+* uses its sum across columns $| H | 1$ . (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)

2.4. Likelihood: spoiled gradient echo

The main contribution of this paper lies in the direct optimisation of $A$ , $R_{1}$ $R_{2}^{*}$ and $δ$ . First, each parameter is encoded by its log (or logit in the case of $δ$ ). Let us switch from a physics to a mathematical notation, where scalars and vectors are written in lowercase while capital letters are reserved for matrices and tensors. We use a tilde to differentiate the log-parameters from their exponentiated version, so that in each voxel:

\begin{matrix} a & = \exp (\tilde{a}), r_{1} = \exp ({\tilde{r}}_{1}), \\ r_{2} & = \exp ({\tilde{r}}_{2}), δ = \frac{1}{1 + \exp (- \tilde{δ})} \end{matrix}

Let us assume that all images are defined on the same grid with $N$ voxels. We write the set of log-parameters as $Y = [\tilde{a}, {\tilde{r}}_{1}, {\tilde{r}}_{2}, \tilde{δ}] \in R^{N \times 4}$ and the set of observed 3D images as $X \in R^{N \times I}$ . Observed images come with a set of fixed parameters $Θ \in R^{4 \times I}$ , where each quadruplet $θ_{i} = (α_{i}, t_{r i}, t_{e i}, σ_{i}^{2}) \in R^{4}$ contains the flip angle, repetition time, echo time and noise variance of an image. The likelihood term of the objective function involves a sum across images followed by a sum across voxels:

ℓ (Y) = \sum_{i} \sum_{n} \frac{1}{2 σ_{i}^{2}} {(x_{i n} - s (y_{n} ∣ θ_{i}))}^{2}

(25)

where $s$ is the intensity of a spoiled gradient echo⁵:

s (\tilde{a}, {\tilde{r}}_{1}, {\tilde{r}}_{2}, \tilde{δ} ∣ α, t_{r}, t_{e}) = a \sin α \frac{(1 - δ) (1 - e^{- r_{1} t_{r}})}{1 - (1 - δ) \cos α e^{- r_{1} t_{r}}} e^{- r_{2} t_{e}}

(26)

For contrasts that do not include an MT pulse, $δ = 0$ and gradients with respect to $\tilde{δ}$ are zero. We recognise a non-linear least squares problem, and propose to use the preconditioner from equation (24), which depends on the gradient and Hessian of $s$ . Let us focus on a single voxel and drop the $(i, n)$ subscripts for clarity. After differentiation, the gradient is:

\frac{\partial s}{\partial \tilde{a}} = s

(27)

\frac{\partial s}{\partial {\tilde{r}}_{1}} = - t_{r} r_{1} \frac{((1 - δ) \cos α - 1) e^{- r_{1} t_{r}}}{(1 - (1 - δ) \cos α e^{- r_{1} t_{r}}) (1 - e^{- r_{1} t_{r}})} s

(28)

\frac{\partial s}{\partial {\tilde{r}}_{2}} = - t_{e} r_{2} s

(29)

\frac{\partial s}{\partial \tilde{δ}} = \frac{δ}{(1 - δ) \cos α e^{- r_{1} t_{r}} - 1} s

(30)

and the Hessian is:

\frac{\partial^{2} s}{\partial {\tilde{a}}^{2}} = \frac{\partial s}{\partial \tilde{a}}

(31)

\frac{\partial^{2} s}{\partial {\tilde{r}}_{1}^{2}} = (1 - t_{r} r_{1} \frac{1 + (1 - δ) \cos α e^{- r_{1} t_{r}}}{1 - (1 - δ) \cos α e^{- r_{1} t_{r}}}) \frac{\partial s}{\partial {\tilde{r}}_{1}}

(32)

\frac{\partial^{2} s}{\partial {\tilde{r}}_{2}^{2}} = (1 - t_{e} r_{2}) \frac{\partial s}{\partial {\tilde{r}}_{2}}

(33)

\frac{\partial^{2} s}{\partial {\tilde{δ}}^{2}} = (\frac{2 (1 - δ) (1 - \cos α e^{- r_{1} t_{r}})}{1 - (1 - δ) \cos α e^{- r_{1} t_{r}}} - 1) \frac{\partial s}{\partial \tilde{δ}}

(34)

The gradient $g_{ℓ}$ and preconditioner ${\tilde{P}}_{ℓ}$ of the least-square objective function are then computed as in equations (6) and (24) and summed across echo times and contrasts. The log-likelihood of one observation with respect to each parameter – keeping the others fixed – is shown in Fig. 3, along with the step size function, which is consistently under the monotonic convergence bound. Although this figure only shows the step size function for one given set of parameters, the convergence property is consistently enforced when sampling random parameters and observed values.

Fig. 3 — Negative log-likelihood and step size function of the nonlinear SPGR model. The first column shows the log-likelihood of a single observation with respect to each log-parameter, while keeping the others fixed. The second column shows the step size function (red) obtained with the proposed approximate Hessian, compared with the theoretical perfect step size function $f (x) = | x - x^{*} |$ . As long as the actual step size is below this theoretical function, monotonic convergence is ensured. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)

Spatial projection

In practice, it is common for people to move their head between scans, making inter-scan registration a mandatory initial step. While reslicing all scans in the same space is a possibility, we prefer to keep the raw data untouched and integrate the spatial projection from the reconstruction space to the different acquisition spaces in the forward model, modifying slightly the objective function. Let us write all log parameter maps as $y = vec (Y) \in R^{N K}$ , where $N$ is the number of voxels in reconstruction space and $K$ the number of maps, and $Ψ \in R^{M K \times N K}$ a matrix that encodes linear reslicing from reconstruction space to one acquisition space⁶ The new objective function is $ℓ_{ψ} (y) = ℓ (Ψ y)$ . Application of the chain rule yields:

\frac{\partial ℓ_{ψ}}{\partial y} = Ψ^{T} (\frac{\partial ℓ}{\partial y} (Ψ y))

(35)

\frac{\partial^{2} ℓ_{ψ}}{\partial y \partial y^{T}} = Ψ^{T} (\frac{\partial^{2} ℓ}{\partial y \partial y^{T}} (Ψ y)) Ψ

(36)

However, this new Hessian is much less sparse: it cannot be seen as block diagonal anymore. It was shown in Ashburner et al. (2018) that this matrix can be majorised (in the Löwner order sense) by a block-diagonal matrix:

\frac{\partial^{2} ℓ_{ψ}}{\partial y \partial y^{T}} ⪯ diag (Ψ^{T} (\frac{\partial^{2} ℓ}{\partial y \partial y^{T}} (Ψ y)))

(37)

In other words, for each contrast, the log parameter maps are resliced (pulled) to the acquisition space, where the gradient and Hessian of the least square function are computed and sent back to recon space using the adjoint of the reslicing operation (pushed).

2.5. Prior: joint total variation

Multiple types of spatial regularisation exist, with different properties. Here, we focus on joint total-variation, equivalent to an $ℓ_{1, 2}$ norm over spatial gradients, which benefits from the same sparsity-inducing properties as other $ℓ_{1}$ norms, except that sparsification is shared across contrasts. This property is particularly interesting when the same object is seen through different contrasts, as is the case in the context of MPMs, since edges are shared across parameter maps. Again, let $Y = [\tilde{a}, {\tilde{r}}_{1}, {\tilde{r}}_{2}, \tilde{δ}] \in R^{N \times 4}$ be the set of all parameter maps, the regulariser has the form:

JTV (Y) = \sum_{n = 1}^{N} \sqrt{\sum_{k = 1}^{4} λ_{k} {(G_{n} y_{k})}^{T} (G_{n} y_{k})}

(38)

where $G_{n}$ extracts all 6 forward and backward finite-differences (in 3D) about the $n$ th voxel and $λ_{k}$ is a parameter-specific regularisation factor.

This regulariser is convex but non-smooth, complicating its use in optimisation. However, several methods have been developed over the years to solve this problem; e.g., ISTA (Daubechies et al., 2004), FISTA (Beck and Teboulle, 2009), ADMM (Boyd et al., 2011) and variations thereof. In this paper, we use an iteratively reweighted least squares (IRLS) scheme (Daubechies, DeVore, Fornasier, Güntürk, 2010, Bach, 2011), which relies on the bound:

\sqrt{\sum_{k} λ_{k} {(G_{n} y_{k})}^{T} (G_{n} y_{k})} = \min_{w_{n} > 0} {\frac{1}{2 w_{n}} + \frac{w_{n}}{2} \sum_{k} λ_{k} {(G_{n} y_{k})}^{T} (G_{n} y_{k})} .

(39)

When the weight map $w$ is fixed, this bound can be seen as a Tikhonov (i.e., $ℓ_{2}$ ) prior with nonstationary regularisation, which is a quadratic prior that factorises across parameters. Therefore, the between-parameters correlations induced by the JTV prior are entirely captured by the weights. Conversely, when the parameter maps are fixed, the weights can be updated in closed-form:

w_{n} = {(\sum_{k} λ_{k} {(G_{n} y_{k})}^{T} (G_{n} y_{k}))}^{- \frac{1}{2}}

(40)

Let us now vectorise all log-parameter maps: $y = vec (Y)$ . The quadratic term in Eq. (39) – when summed over all voxels – can be written as $y^{T} L y$ with $L = G^{T} W G \otimes diag (λ)$ , where $G \in R^{6 N \times N}$ extracts all 6 forward and backward finite-differences about all voxels and $W = diag (w) \otimes I_{6}$ . Note further that $G^{T} G$ is Toeplitz and can be implemented as a convolution by a small kernel (Ashburner, 2007). Similarly, $diag (G^{T} W G)$ can be obtained by convolving the weight map $w$ with a small “cross” kernel.

2.6. Optimisation: regularised spoiled gradient echo

Optimisation follows an IRLS scheme, where each iteration consists of alternately solving for the parameter maps $y$ and the JTV weights $w$ . At each iteration, the objective function in $y$ is of the form:

ℓ (y) = \sum_{c, e} \frac{1}{2 σ_{c}^{2}} {∥ x_{c e} - s_{c e} (y) ∥}^{2} + \frac{1}{2} y^{T} L y

(41)

where $c$ and $e$ are contrasts and echoes. This function is non-linear and solved using the proposed second order method, from which we get a gradient $g \in R^{4 N}$ and an approximate Hessian $H \in R^{4 N \times 4 N}$ . This matrix is actually block diagonal as it consists of $N$ small $4 \times 4$ matrices. Performing the Newton update step involves solving the linear system

{(H + L)}^{- 1} (g + L y)

(42)

which is done using the preconditioned conjugate gradient method. The preconditioner that we use is

H + (G^{T} W G ⊙ I) \otimes diag (λ)

2.7. Implementation

In practice, 10 IRLS iterations were used, with 5 inner Newton iterations and 32 conjugate gradient iterations per Newton step. A tolerance threshold on the gain between two subsequent iterations was set for early stopping (IRLS: $10^{- 5}$ , Newton: $10^{- 5}$ , CG: $10^{- 3}$ ). The variance of the thermal noise was estimated by fitting a two-class Rician mixture to each acquired volume (Ashburner and Ridgway, 2013); the class with lowest mean was considered as the background. Variances were then averaged across echo times using the geometric mean. Initial (constant) parameter maps were estimated using the mean of the other (non-background) class in each volume.

This algorithm, along with baseline methods, is implemented in NiTorch, a Python library with a PyTorch backend dedicated to neuroimaging. It is publicly available at https://github.com/balbasty/nitorch. The processing of a 0.8 mm MPM dataset – from which $A$ , $R_{1}$ , $R_{2}^{*}$ and MT_sat can be estimated – fits in 12 GB of memory and takes about 10 minutes on a Nvidia Titan V GPU for SPGR+JTV (5 minutes for NONLIN+JTV, $<$ 1 minute for LOGLIN).

3. Experiments

3.1. Convergence analysis

This experiment was based on simulated values. One thousand single-voxel datasets were simulated, each with their own relaxation parameters, TRs, TEs and flip angles. Each voxel had three contrasts (i.e., three different TRs and flip angles) – including one with an MT pulse – and five echo times per contrast. All log parameters ( $\log A$ , $\log R_{1}$ , $\log R_{2}^{*}$ , logit MT_sat, log TR, log TE) were uniformly sampled in $[- 5, 5]$ , while flip angles were uniformly sampled in $[0, \frac{π}{4}]$ and the noise variance was fixed at $σ^{2} = 1$ . The (unregularised) SPGR model was then fitted using 10,000 iterations of the proposed second-order optimisation, and the negative log-likelihood was computed after each iteration. Two baseline algorithms were used as well: Gauss-Newton with a backtracking line-search (at each step, the descent direction was modulated by an Armijo factor) and Levenberg-Marquardt (LM). Following Press et al. (2007), updates were only accepted if they improved the objective-function. On failure, the GN Armijo factor (respectively the LM damping parameter) was divided (resp. multiplied) by 10, while it was multiplied (resp. divided) by 10 on success. One Armijo or damping factor was defined in each virtual voxel.

A similar experiment was conducted in order to compare the use of a log, rate or time representation during optimisation. Two alternative optimisation schemes were derived such that either ( $A$ , $R_{1}$ , $R_{2}^{*}$ , MT $_{sat}$ ) or ( $A$ , $T_{1}$ , $T_{2}^{*}$ , MT $_{sat}$ ) are optimised instead of ( $\log A$ , $\log R_{1}$ , $\log R_{2}^{*}$ , $logit {MT}_{sat}$ ). These two representations required an even more stable Hessian, were $| H | 1$ is used to load the diagonal instead of $diag (| H |)$ .

3.2. Datasets

Two dataset were used in this study:

1.
The first dataset used in this study is a single-subject, single-session reproducibility dataset: a single participant was scanned five times in a single session with a 0.8 mm MPM protocol. This protocol consists of three multi-echo gradient echo contrasts (flip angle: 21/6/6 deg, number of echoes: 8/8/6 with 2.3 ms echo spacing, off-resonance pulse: No/No/Yes, TR: 25 ms, resolution: 0.8 mm, FOV: $256 \times 224 \times 179$ mm $^{3}$ , GRAPPA acceleration: $2 \times 2$ with 40 reference lines). Calibration data were acquired to additionally account for contrast-specific receive field (Papp et al., 2016) and participant-specific transmit field inohomogeneities (Lutti et al., 2010) using the protocols described in Callaghan et al. (2019). Signal-to-noise ratios were estimated by fitting a two-class Rician mixture model to the image histograms. SNRs in the first echo were 29 (PDw), 25 (T1w) and 28 (MTw). SNRs in the last echo were 16 (PDw), 15 (T1w) and 16 (MTw).
2.
The second dataset is the demonstration dataset from the hMRI toolbox (Callaghan et al., 2019), which consists of the same 0.8 mm MPM protocol acquired in a single subject, with motion between the MT-weighted scan and the PD- and T1-weighted scans.

3.3. Algorithms

We compared two variants of our method (ML and MAP) to methods based on a two-step process described in following Tabelow et al. (2019) (ESTATICS to estimate $R_{2}^{*}$ and $T_{E} = 0$ intercepts, followed by an analytical fit of $R_{1}$ , $A$ and MT_sat from these intercepts). These baseline methods use either a log-linear fit or a non-linear fit of ESTATICS, and use different denoising methods (pre-processing with a non-local means filter or MAP with a JTV prior):

•
LOGLIN performs a log-linear fit of $R_{2}^{*}$ from which weighted images extrapolated to TE=0 are obtained. This is equivalent to ESTATICS (Weiskopf et al., 2013), except that the spatial projection is integrated in the model. The Hessian matrix that is used is a (slight) majoriser of the true Hessian, so three Newton-like iterations are needed to reach the optimum (compared to a single one, had the true Hessian been used). Since all TRs are identical, $A$ , $R_{1}$ and MT_sat are then computed analytically according to Mohammadi et al. (2017), without requiring rational approximations.
•
LOGLIN+AONLM applies an anisotropic non-local mean filter (Manjón et al., 2010) to all individual echoes before processing them with LOGLIN.
•
NONLIN performs a non-linear fit or $R_{2}^{*}$ instead of a log-linear fit, but does not use any regularisation. This method was not used in Balbastre et al. (2020), because Gauss-Newton was not stable enough without regularisation. The new Hessian used in this work makes its optimisation possible.
•
NONLIN+JTV adds JTV-regularisation to NONLIN (Balbastre et al., 2020).
•
SPGR-ML performs a non-linear maximum-likelihood fit of the log-parameters $\log R_{1}$ , $\log R_{2}^{*}$ , $\log A$ and logit MT_sat.
•
SPGR+JTV adds JTV regularisation to SPGR-ML, and effectively performs a non-linear maximum a posteriori fit.

For each dataset, as a preprocessing step, all contrasts across all repeats (and their $B_{1}^{+ / -}$ maps) were registered towards the PD-weighted series of the first repeat with SPM12. This co-registration step only modified the orientation matrices in the header of the NIfTI files and did not involve any reslicing.

3.4. Cross-validation

This experiment was based on the single-session dataset. First, one of the repeats was used to estimate optimal regularisation factors for LOGLIN+AONLM, NONLIN+JTV, and SPGR+JTV by cross-validation across echoes. Five folds were constructed, with the first six echo times randomly permuted and split between a training set – used to estimate parameter maps – and a testing set – used to evaluate predictions. In each fold, the training echoes were used to estimate parameters with multiple regularisation values; these parameters were then used to simulate the left-out echoes and compared against the true (acquired) volumes. The mean-squared error (MSE) between the true and predicted echoes was computed and z-normalised across folds, contrast and echo times. Finally, the median z-normalised MSE was used as a metric to select the most adequate regularisation factor.

The remaining four repeats were used to compare all (optimised) methods by cross-validation across repeats. Parameter maps were computed for each repeat and used to predict individual echoes from all other repeats.

3.5. Echo decimation

This experiment was based on the reproducibility dataset. First, reference parameter maps were obtained by computing the maximum-likelihood solution (SPGR-ML) of combined data from all four test runs. Parameter maps of one of the runs were then reconstructed by either SPGR+JTV or SPGR-ML using the first 2, 4 or 6 echoes. The root mean squared error (RMSE) – within a brain ROI – was computed between each map and its reference.

3.6. Uncertainty mapping

The inverse of the Hessian of the objective function at the optimum can be used, under the Laplace approximation, to estimate the posterior uncertainty of the log-parameters (MacKay, 2003). Since it is easier to invert, we rather use the inverse of the block-diagonal preconditioner and extract its diagonal:

cov [y | X] \approx {(H + diag (G^{T} W G) \otimes λ)}^{- 1}

(43)

where $X$ contains all observed volumes used to estimate the maps. Note that under the Laplace approximation, the posterior of the log-parameter maps is Normal-distributed, making the parameters themselves Log-Normal-distributed. If the parameter maps themselves are of interest, it might be more interesting to return their expected value and variance, which in the case of $R_{1}$ gives:

E [r_{1}] = \sqrt{\exp ({\tilde{σ}}^{2})} \exp ({\tilde{r}}_{1})

(44)

V [r_{1}] = (\exp ({\tilde{σ}}^{2}) - 1) \exp ({\tilde{σ}}^{2}) \exp {({\tilde{r}}_{1})}^{2}

(45)

Conversely, if $T_{1} = 1 / R_{1}$ is of interest:

E [t_{1}] = \sqrt{\exp ({\tilde{σ}}^{2})} \exp (- {\tilde{r}}_{1})

(46)

V [t_{1}] = (\exp ({\tilde{σ}}^{2}) - 1) \exp ({\tilde{σ}}^{2}) \exp {(- {\tilde{r}}_{1})}^{2}

(47)

In this experiment, we used three echoes from the hMRI demonstration dataset to reconstruct maps using SPGR+JTV and SPGR-ML and computed the posterior uncertainty according to the above formula. This allowed us to map the posterior mean and standard deviation of log- $R_{1}$ , $R_{1}$ and $T_{1}$ . The ratio between the expected value of $R_{1}$ under the Laplace approximation and under a point estimate is given by $\sqrt{\exp ({\tilde{σ}}^{2})}$ .

4. Results

4.1. Convergence

The negative log-likelihood and gain after each fitting iteration in 50 randomly selected voxels are shown in Fig. 4. These plots show that all voxels converge monotonically with our approach (the gain is never negative) despite the very wide range of parameters ( $e . g .$ , SNR, observed relaxation times) covered by the simulations. Conversely, Gauss-Newton and Levenberg-Marquardt fail to converge to the optimum in a large number of voxels, despite the fact that this simulation context is very favourable to them: they are allowed to define one regularisation parameter per voxel, whereas in a spatially regularised case, voxels would not be independent and would have to share the same parameter.

Note that under this setup (3 contrasts, 5 echoes per contrasts), the vast majority of these maximum-likelihood fits yield optimal negative log-likelihood that are lower than the expected negative log-likelihood of the true parameters. This shows that ML likely over-fits the data and is strong evidence that regularisation is needed.

Fig. 5 shows results for the same experiment, this time comparing different optimisation basis (log, rate or time). While all methods converge to the same optimum, the rate of convergence is clearly higher when log-parameters are optimised.

4.2. Hyper-parameter selection

SPGR+JTV used the same regularisation factor ( $λ$ ) for all contrasts. Values ${1, 5, 10, 15, 20}$ were tried and the smallest MSE was obtained with $λ = 10$ . Because they have different dynamic scales, NONLIN+JTV used a different factor for the $R_{2}^{*}$ map ( $λ_{r}$ ) and for the log-intercepts ( $λ_{i}$ ). We tried values ${0.001, 0.01, 0.1, 1.0, 10}$ and ${100, 500, 1000, 5000, 10000}$ respectively and found an optimal combination with $λ_{r} = 0.1$ and $λ_{i} = 1000$ . For LOGLIN+AONLM, we used the optimal value $β = 0.2$ found in Balbastre et al. (2020).

4.3. Methods cross-validation

Quantitative maps obtained with all methods are displayed in Fig. 6, along with MSE. As expected, directly fitting the parameters is beneficial over the use of sequential parameter estimation. SPGR+JTV (median MSE = 1459) outperform non-regularised approaches such as SPGR-ML (median MSE = 1706). Methods that make use of rational approximations all trail behind, with the regularised NONLIN+JTV (median 1716) again improving over the non-regularised NONLIN (median 2467). The log-linear version trails far behind (median MSE = 2873), with AONLM preprocessing only slighlty improving it (median MSE = 2799).

4.4. Echo decimation

Fig. 7 displays all four parameter maps ( $R_{1}$ , $R_{2}^{*}$ , PD, MT_sat) reconstructed using a growing number of echoes (2, 4, 6) with a regularised algorithm (SPGR+JTV) and a non-regularised one (SPGR-ML). The RMSE between all maps and a reference map (the combined ML solution of all 4 runs) was computed and the regularised maps show a much lower RMSE than the non-regularised ones, even when fewer echoes are used. Noise amplification in the center of the brain appears clearly in the SPGR-ML maps when only two echoes are used, whereas SPGR+JTV manages to preserve anatomical details in this region, despite some additional smoothing. Note that in our implementation, the noise variance and regularisation are two different parameters, even though they share a single degree of freedom in the optimisation. The noise variance is estimated on a volume-by-volume basis while the regularisation factor is kept a priori fixed. Consequently, the balance between the likelihood and the prior appears clearly in the maps displayed here, as more smoothing happens when less information (e.g., fewer echoes) is present in the data.

4.5. Uncertainty mapping

The capacity to estimate uncertainty is exemplified via $R_{1}$ : posterior expected $R_{1}$ and $T_{1}$ maps and their uncertainties are shown in Fig. 8. Uncertainty after a SPGR-ML fit is mostly driven by intensities: white matter has a higher uncertainty because its MR signal, and therefore SNR, is generally lower than the grey matter. Conversely, uncertainty after a SPGR+JTV fit is mostly driven by edges, since less local signal averaging is possible in their vicinity.

5. Discussion & conclusion

In this paper, we introduced an optimisation-based framework for multi-parameter mapping that directly fits the signal equations to all available SPGR data, and can work with (or without) a wide range of regularisers. The key innovation of our method is a second-order optimiser that makes use of a bespoke approximate Hessian that shows excellent experimental convergence. We showed on toy problems that our update steps have theoretical properties that ensure monotonic convergence even when the quadratic approximation is not a majoriser of the objective function. Although we did not offer analytical proof that this monotonic convergence extends to the multivariate case, our solution is at least as good as Gauss-Newton, which is known to converge to the optimum when it converges. Our optimiser therefore benefits from the same property and, experimentally, was never found to diverge.

We also introduced a novel encoding of the relaxation parameters, based on their log, that makes regularisation insensitive to the choice of unit. This encoding has the additional advantage that parameters live on the entire real line, making the optimisation problem unconstrained.

We evaluated the use of a joint total variation regulariser, an edge-preserving prior that introduces implicit correlations between channels, increasing its denoising power over parameter-wise total variation. We show that this prior yields the best parameter inference, under the metric that inferred parameters best predict unseen data. Although this paper focuses on relatively uninformative spatial regularisers, our method opens the door to the use of informative learned priors. One possibility, inspired by Brudfors, Ashburner, Nachev, Balbastre, 2019, Brudfors, Balbastre, Flandin, Nachev, Ashburner, 2020, is to include a multivariate Gaussian mixture model of the log-parameter maps, with learned tissue parameters (means, covariances, tissue probability maps). Alternatively, priors based on learned dictionaries of patches could be used (Dalca et al., 2017).

Although this was not the topic of this paper, it is important to investigate some properties of the maximum-likelihood estimator. While our second-order scheme may output an approximate posterior distribution under the Laplace approximation, it is centered about the mode of the posterior, which is certainly not aligned with the posterior expected value. Different corrections could be devised depending on the map of interest ( $e . g .$ , $R_{1}$ , $T_{1}$ or $\pm \log R_{1}$ ).

The Laplace approximation allowed posterior uncertainty to be estimated. These uncertainty maps are far from perfect: they only contain aleatory uncertainty (i.e., uncertainty that stems from the probabilistic nature of the variables in play, that is known and can be modelled), and are based on the assumption that the $B_{1}^{+ / -}$ fields and thermal noise variance are known, thereby discarding any uncertainty about their values. Furthermore, it relies on a Gaussian assumption that could be very wrong. Nonetheless, all models are wrong but some are useful, and – while flawed – these uncertainty maps are better than blind confidence. They could be used to e.g. down-weight low-quality data points in downstream regression or classification tasks.

Finally, transmit and receive field maps are assumed pre-computed and fixed in this work and spoiling is assumed perfect. However, it has been shown that key sources of variance in $T_{1}$ mapping come from imperfect spoiling and errors in the estimate of the $B_{1}^{+}$ field (Stikov et al., 2015). In theory, transmit field map estimation could be integrated in the optimisation framework as long as the corresponding imaging data is included as well (Hurley et al., 2012). However, $B_{0}$ and $B_{1}^{+}$ mapping protocols based on phase differences would be difficult to fit using second-order methods because of the circular support of phase values (Fessler and Noll, 2004). On the other hand, imperfect spoiling is challenging to integrate in the generative model because, in its current form, it is based on full numerical Bloch simulations and a posteriori transformation of apparent $T_{1}$ maps under a polynomial whose coefficients depend on the $B_{1}^{+}$ (Preibisch and Deichmann, 2009). Of course, those post hoc corrections can still be applied to the parameters estimated here.

Some remaining approximations could also be removed: (1) our Gaussian noise model could be replaced by a more accurate non-central Chi or Rice model that can be fitted using a majorisation-minimisation framework similar to the one used for optimising the JTV functional in this work (Varadarajan and Haldar, 2015); (2) the true TR of the off-resonance pulse could be used instead of being set to zero (Mohammadi et al., 2017), which may remove some residual bias in the $R_{1}$ and MT_sat maps. Furthermore, the generative model could be extended to more complex biophysical models, such as multi-compartment models (Deoni et al., 2008a), and to additional sequences, such as spin echo or inversion recovery. Diffusion data could also be integrated and modelled (Jian et al., 2007), as joint diffusion-relaxometry datasets are increasingly being used for accurate microstructure modelling (Hutter et al., 2018). Note that the diffusion signal can also be described using exponential-linear models, which are likely to benefit from our improved Hessian.

On the computational side, this method would benefit from any progress made on TV optimisation. Our implementation uses a relatively simple conjugate-gradient optimiser with Jacobi preconditioning, and it is certain that using preconditioners better tailored to TV would improve its convergence. Additionally, although we experimentally show the stability of our approximate Hessian, we did not provide a proof of convergence in this work. A formal proof that builds on our analysis would surely help move the field forward.

In conclusion, we have presented a flexible probabilistic generative modelling framework, which capitalises on a novel approximate Hessian to enable the robust joint estimation of multiple quantitative MRI parameters. The framework can be adapted to incorporate spatial projections, denoising schemes and additional priors. With this framework we have demonstrated enhanced performance relative to the consecutive estimation techniques commonly used in multi-parameter mapping, with additional benefits such as uncertainty mapping.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

YB, MFC and JA were funded by the MRC and Spinal Research Charity through the ERA-NET Neuron joint call (MR/R000050/1). YB was partially supported by the National Institutes of Health under award numbers U01MH117023, R01AG064027 and P41EB030006. CL is supported by an MRC Clinician Scientist award (MR/R006504/1). The Wellcome Centre for Human Neuroimaging is supported by core funding from the Wellcome [203147/Z/16/Z].

This procedure neglects noise propagation and assumes that successive quantities are exact – even though they are not.

A spoiled gradient echo acquisition is obtained by playing an excitation pulse that causes the bulk magnetization to rotate by an angle $α$ (the flip angle), followed by a series of dephasing and rephasing gradients that create an echo train.

⁴

This proof could be easily extended to local-convergence when more than one minimum exists, but we focus on the simpler case for clarity

⁵

When the receive and transmit fields are known from e.g. field mapping, Eq. (26) is slightly changed. The flip angle $α$ takes a voxel-dependent value $α_{n} = α_{nom} b_{n}^{+}$ , where the nominal flip angle is modulated by the transmit field efficiency; and the whole signal is modulated voxel-wise by the net receive field $b_{n}^{-}$ .

⁶

Since the same transformation is applied to all maps, $Ψ$ really is the Kronecker product of an $M \times N$ reslicing matrix and a $K \times K$ identity matrix.

Generative model should be understood in the classical sense: a (probabilistic) function that mimics the true data-generating process.

References

Ashburner J. A fast diffeomorphic image registration algorithm. Neuroimage. 2007;38(1):95–113. doi: 10.1016/j.neuroimage.2007.07.007. [DOI] [PubMed] [Google Scholar]
Ashburner J., Brudfors M., Bronik K., Balbastre Y. An algorithm for learning shape and appearance models without annotations. Neuroimage. 2018;55:197–215. doi: 10.1016/j.media.2019.04.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ashburner J., Ridgway G.R. Symmetric diffeomorphic modeling of longitudinal structural MRI. Front. Neurosci. 2013;6:197. doi: 10.3389/fnins.2012.00197. [DOI] [PMC free article] [PubMed] [Google Scholar]
Bach F. Optimization with sparsity-inducing penalties. FNT Mach. Learn. 2011;4(1):1–106. [Google Scholar]
Balbastre Y., Brudfors M., Azzarito M., Lambert C., Callaghan M.F., Ashburner J. In: Medical Image Computing and Computer Assisted Intervention MICCAI 2020. Martel A.L., Abolmaesumi P., Stoyanov D., Mateus D., Zuluaga M.A., Zhou S.K., Racoceanu D., Joskowicz L., editors. Springer International Publishing; Cham: 2020. Joint total variation ESTATICS for robust multi-parameter mapping; pp. 53–63. [Google Scholar]
Bauer C.M., Jara H., Killiany R. Whole brain quantitative T2 MRI across multiple scanners with dual echo FSE: applications to AD, MCI, and normal aging. Neuroimage. 2010;52(2):508–514. doi: 10.1016/j.neuroimage.2010.04.255. [DOI] [PMC free article] [PubMed] [Google Scholar]
Beck A., Teboulle M. Fast gradient-based algorithms for constrained total variation image denoising and deblurring problems. IEEE Trans. Image Process. 2009;18(11):2419–2434. doi: 10.1109/TIP.2009.2028250. [DOI] [PubMed] [Google Scholar]; Conference Name: IEEE Transactions on Image Processing
Bloch F. Nuclear induction. Phys. Rev. 1946;70(7–8):460–474. [Google Scholar]; 07663
Boyd S., Parikh N., Chu E., Peleato B., Eckstein J. Distributed optimization and statistical learning via the alternating direction method of multipliers. Found. Trends Mach. Learn. 2011;3(1):1–122. [Google Scholar]
Brudfors M., Ashburner J., Nachev P., Balbastre Y. In: Simulation and Synthesis in Medical Imaging. Burgos N., Gooya A., Svoboda D., editors. Springer International Publishing; Cham: 2019. Empirical Bayesian mixture models for medical image translation; pp. 1–12. [Google Scholar]
Brudfors M., Balbastre Y., Flandin G., Nachev P., Ashburner J. In: Medical Image Computing and Computer Assisted Intervention MICCAI 2020. Martel A.L., Abolmaesumi P., Stoyanov D., Mateus D., Zuluaga M.A., Zhou S.K., Racoceanu D., Joskowicz L., editors. Springer International Publishing; Cham: 2020. Flexible Bayesian modelling for nonlinear image registration; pp. 253–263. [Google Scholar]
Brudfors M., Balbastre Y., Nachev P., Ashburner J. Proceedings of the 22nd Conference on Medical Image Understanding and Analysis. Southampton, UK; 2018. MRI super-resolution using multi-channel total variation. [Google Scholar]; 00000
Callaghan M.F., Lutti A., Ashburner J., Balteau E., Corbin N., Draganski B., Helms G., Kherif F., Leutritz T., Mohammadi S., Phillips C., Reimer E., Ruthotto L., Seif M., Tabelow K., Ziegler G., Weiskopf N. Example dataset for the hMRI toolbox. Data Brief. 2019;25:104132. doi: 10.1016/j.dib.2019.104132. [DOI] [PMC free article] [PubMed] [Google Scholar]
Campbell J.S.W., Leppert I.R., Narayanan S., Boudreau M., Duval T., Cohen-Adad J., Pike G.B., Stikov N. Promise and pitfalls of g-ratio estimation with MRI. Neuroimage. 2018;182:80–96. doi: 10.1016/j.neuroimage.2017.08.038. [DOI] [PubMed] [Google Scholar]
Cercignani M., Alexander D.C. Optimal acquisition schemes for in vivo quantitative magnetization transfer MRI. Magn. Reson. Med. 2006;56(4):803–810. doi: 10.1002/mrm.21003. [DOI] [PubMed] [Google Scholar]
Chang L.-C., Koay C.G., Basser P.J., Pierpaoli C. Linear least-squares method for unbiased estimation of T1 from SPGR signals. Magn. Reson. Med. 2008;60(2):496–501. doi: 10.1002/mrm.21669. [DOI] [PMC free article] [PubMed] [Google Scholar]
Chen Z., Prato F.S., McKenzie C. T1 fast acquisition relaxation mapping (T1/-FARM): an optimized reconstruction. IEEE Trans. Med. Imaging. 1998;17(2):155–160. doi: 10.1109/42.700728. [DOI] [PubMed] [Google Scholar]; Conference Name: IEEE Transactions on Medical Imaging
Chun I.Y., Fessler J.A. Convolutional dictionary learning: acceleration and convergence. IEEE Trans. Image Process. 2018;27(4):1697–1712. doi: 10.1109/TIP.2017.2761545. [DOI] [PubMed] [Google Scholar]
Corbin N., Callaghan M.F. Imperfect spoiling in variable flip angle T1 mapping at 7T: quantifying and minimizing impact. Magn. Reson. Med. 2021;86(2):693–708. doi: 10.1002/mrm.28720. [DOI] [PMC free article] [PubMed] [Google Scholar]
Coupé P., Manjón J., Robles M., Collins D. Adaptive multiresolution non-local means filter for three-dimensional magnetic resonance image denoising. IET Image Process. 2012;6(5):558–568. [Google Scholar]
Coupe P., Yger P., Prima S., Hellier P., Kervrann C., Barillot C. An optimized blockwise nonlocal means denoising filter for 3-D magnetic resonance images. IEEE Trans. Med. Imaging. 2008;27(4):425–441. doi: 10.1109/TMI.2007.906087. [DOI] [PMC free article] [PubMed] [Google Scholar]
Dalca A.V., Bouman K.L., Freeman W.T., Rost N.S., Sabuncu M.R., Golland P. In: Information Processing in Medical Imaging. Niethammer M., Styner M., Aylward S., Zhu H., Oguz I., Yap P.-T., Shen D., editors. Springer International Publishing; Cham: 2017. Population based image imputation; pp. 659–671. [DOI] [PMC free article] [PubMed] [Google Scholar]
Daubechies I., Defrise M., Mol C.D. An iterative thresholding algorithm for linear inverse problems with a sparsity constraint. Commun. Pure Appl. Math. 2004;57(11):1413–1457. [Google Scholar]
Daubechies I., DeVore R., Fornasier M., Güntürk C.S. Iteratively reweighted least squares minimization for sparse recovery. Commun. Pure Appl. Math. 2010;63(1):1–38. [Google Scholar]
Deoni S.C.L., Rutt B.K., Arun T., Pierpaoli C., Jones D.K. Gleaning multicomponent T1 and T2 information from steady-state imaging data. Magn. Reson. Med. 2008;60(6):1372–1387. doi: 10.1002/mrm.21704. [DOI] [PubMed] [Google Scholar]
Deoni S.C.L., Williams S.C.R., Jezzard P., Suckling J., Murphy D.G.M., Jones D.K. Standardized structural magnetic resonance imaging in multicentre studies using quantitative T1 and T2 imaging at 1.5 t. Neuroimage. 2008;40(2):662–671. doi: 10.1016/j.neuroimage.2007.11.052. [DOI] [PubMed] [Google Scholar]
Dick F., Tierney A.T., Lutti A., Josephs O., Sereno M.I., Weiskopf N. In vivo functional and myeloarchitectonic mapping of human primary auditory areas. J. Neurosci. 2012;32(46):16095–16105. doi: 10.1523/JNEUROSCI.1712-12.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
Donoho D. De-noising by soft-thresholding. IEEE Trans. Inf. Theory. 1995;41(3):613–627. [Google Scholar]
Fessler J.A., Noll D.C. Proceedings of the 2nd IEEE International Symposium on Biomedical Imaging: Nano to Macro (IEEE Cat No. 04EX821) 2004. Iterative image reconstruction in MRI with separate magnitude and phase regularization; pp. 209–212Vol. 1. [Google Scholar]
Gong T., Tong Q., He H., Sun Y., Zhong J., Zhang H. MTE-NODDI: multi-TE NODDI for disentangling non-T2-weighted signal fractions from compartment-specific T2 relaxation times. Neuroimage. 2020;217:116906. doi: 10.1016/j.neuroimage.2020.116906. [DOI] [PubMed] [Google Scholar]
Gupta R.K. A new look at the method of variable nutation angle for the measurement of spin-lattice relaxation times using fourier transform NMR. J. Magn. Reson. (1969) 1977;25(1):231–235. [Google Scholar]
Hasan K.M., Walimuni I.S., Kramer L.A., Narayana P.A. Human brain iron mapping using atlas-based T2 relaxometry. Magn. Reson. Med. 2012;67(3):731–739. doi: 10.1002/mrm.23054. [DOI] [PMC free article] [PubMed] [Google Scholar]
Helms G., Dathe H., Dechent P. Quantitative FLASH MRI at 3T using a rational approximation of the ernst equation. Magn. Reson. Med. 2008;59(3):667–672. doi: 10.1002/mrm.21542. [DOI] [PubMed] [Google Scholar]
Helms G., Dathe H., Kallenberg K., Dechent P. High-resolution maps of magnetization transfer with inherent correction for RF inhomogeneity and T1 relaxation obtained from 3D FLASH MRI. Magn. Reson. Med. 2008;60(6):1396–1407. doi: 10.1002/mrm.21732. [DOI] [PubMed] [Google Scholar]
Huang J., Chen C., Axel L. In: Proceedings of the MICCAI 2012. Ayache N., Delingette H., Golland P., Mori K., editors. Springer; Berlin, Heidelberg: 2012. Fast multi-contrast MRI reconstruction; pp. 281–288. [Google Scholar]
Hurley S.A., Yarnykh V.L., Johnson K.M., Field A.S., Alexander A.L., Samsonov A.A. Simultaneous variable flip angle actual flip angle imaging method for improved accuracy and precision of three-dimensional T1 and B1 measurements. Magn. Reson. Med. 2012;68(1):54–64. doi: 10.1002/mrm.23199. [DOI] [PMC free article] [PubMed] [Google Scholar]
Hutter J., Slator P.J., Christiaens D., Teixeira R.P.A.G., Roberts T., Jackson L., Price A.N., Malik S., Hajnal J.V. Integrated and efficient diffusion-relaxometry using ZEBRA. Sci. Rep. 2018;8(1):15138. doi: 10.1038/s41598-018-33463-2. [DOI] [PMC free article] [PubMed] [Google Scholar]; Number: 1 Publisher: Nature Publishing Group
Jian B., Vemuri B.C., zarslan E., Carney P.R., Mareci T.H. A novel tensor distribution model for the diffusion-weighted MR signal. Neuroimage. 2007;37(1):164–176. doi: 10.1016/j.neuroimage.2007.03.074. [DOI] [PMC free article] [PubMed] [Google Scholar]
Jiru F., Klose U. Fast 3D radiofrequency field mapping using echo-planar imaging. Magn. Reson. Med. 2006;56(6):1375–1379. doi: 10.1002/mrm.21083. [DOI] [PubMed] [Google Scholar]
Langkammer C., Krebs N., Goessler W., Scheurer E., Ebner F., Yen K., Fazekas F., Ropele S. Quantitative MR imaging of brain iron: a postmortem validation study. Radiology. 2010;257(2):455–462. doi: 10.1148/radiol.10100495. [DOI] [PubMed] [Google Scholar]
Liberman G., Louzoun Y., Bashat D.B. T1 Mapping using variable flip angle SPGR data with flip angle correction. J. Magn. Reson. Imaging. 2014;40(1):171–180. doi: 10.1002/jmri.24373. [DOI] [PubMed] [Google Scholar]
Liu T., Liu J., de Rochefort L., Spincemaille P., Khalidov I., Ledoux J.R., Wang Y. Morphology enabled dipole inversion (MEDI) from a single-angle acquisition: comparison with COSMOS in human brain imaging. Magn. Reson. Med. 2011;66(3):777–783. doi: 10.1002/mrm.22816. [DOI] [PubMed] [Google Scholar]
Lutti A., Dick F., Sereno M.I., Weiskopf N. Using high-resolution quantitative mapping of R1 as an index of cortical myelination. Neuroimage. 2014;93:176–188. doi: 10.1016/j.neuroimage.2013.06.005. [DOI] [PubMed] [Google Scholar]
Lutti A., Hutton C., Finsterbusch J., Helms G., Weiskopf N. Optimization and validation of methods for mapping of the radiofrequency transmit field at 3T. Magn. Reson. Med. 2010;64(1):229–238. doi: 10.1002/mrm.22421. [DOI] [PMC free article] [PubMed] [Google Scholar]
MacKay D.J. Choice of basis for laplace approximation. Mach. Learn. 1998;33(1):77–86. [Google Scholar]
MacKay D.J.C. sixth printing 2007 edition. Cambridge University Press; Cambridge, UK ; New York: 2003. Information Theory, Inference and Learning Algorithms. [Google Scholar]
Manjón J.V., Coupé P., Buades A., Louis Collins D., Robles M. New methods for MRI denoising based on sparseness and self-similarity. Med. Image Anal. 2012;16(1):18–27. doi: 10.1016/j.media.2011.04.003. [DOI] [PubMed] [Google Scholar]
Manjón J.V., Coupé P., Martí-Bonmatí L., Collins D.L., Robles M. Adaptive non-local means denoising of MR images with spatially varying noise levels. J. Magn. Reson. Imaging. 2010;31(1):192–203. doi: 10.1002/jmri.22003. [DOI] [PubMed] [Google Scholar]
Mascarenhas W.F. The divergence of the BFGS and gauss newton methods. Math. Program. 2014;147(1):253–276. [Google Scholar]
Milford D., Rosbach N., Bendszus M., Heiland S. Mono-exponential fitting in T2-relaxometry: relevance of offset and first echo. PLoS ONE. 2015;10(12):e0145255. doi: 10.1371/journal.pone.0145255. [DOI] [PMC free article] [PubMed] [Google Scholar]; Publisher: Public Library of Science
Mohammadi S., D’Alonzo C., Ruthotto L., Polzehl J., Ellerbrock I., Callaghan M.F., Weiskopf N., Tabelow K. Simultaneous adaptive smoothing of relaxometry and quantitative magnetization transfer mapping. Weierstrass Inst. Appl. Anal. Stoch. Preprint 2432. 2017 doi: 10.20347/WIAS.PREPRINT.2432. [DOI] [Google Scholar]
Ogg R.J., Langston J.W., Haacke E.M., Steen R.G., Taylor J.S. The correlation between phase shifts in gradient-echo MR images and regional brain iron concentration. Magn. Reson. Imaging. 1999;17(8):1141–1148. doi: 10.1016/s0730-725x(99)00017-x. [DOI] [PubMed] [Google Scholar]
Ordidge R.J., Gorell J.M., Deniau J.C., Knight R.A., Helpern J.A. Assessment of relative brain iron concentrations using T2-weighted and T2*-weighted MRI at 3 tesla. Magn. Reson. Med. 1994;32(3):335–341. doi: 10.1002/mrm.1910320309. [DOI] [PubMed] [Google Scholar]
Papp D., Callaghan M.F., Meyer H., Buckley C., Weiskopf N. Correction of inter-scan motion artifacts in quantitative R1 mapping by accounting for receive coil sensitivity effects. Magn. Reson. Med. 2016;76(5):1478–1485. doi: 10.1002/mrm.26058. [DOI] [PMC free article] [PubMed] [Google Scholar]
Polders D.L., Leemans A., Luijten P.R., Hoogduin H. Uncertainty estimations for quantitative in vivo MRI T1 mapping. J. Magn. Reson. 2012;224:53–60. doi: 10.1016/j.jmr.2012.08.017. [DOI] [PubMed] [Google Scholar]
Preibisch C., Deichmann R. Influence of RF spoiling on the stability and accuracy of T1 mapping based on spoiled FLASH with varying flip angles. Magn. Reson. Med. 2009;61(1):125–135. doi: 10.1002/mrm.21776. [DOI] [PubMed] [Google Scholar]
Press W.H., Teukolsky S.A., Vetterling W.T., Flannery B.P. 3 edition. Cambridge University Press; Cambridge, UK ; New York: 2007. Numerical Recipes 3rd Edition: The Art of Scientific Computing. [Google Scholar]
Rudin L.I., Osher S., Fatemi E. Nonlinear total variation based noise removal algorithms. Phys. D. 1992;60(1):259–268. [Google Scholar]
Sacolick L.I., Wiesinger F., Hancu I., Vogel M.W. B1 mapping by Bloch-Siegert shift. Magn. Reson. Med. 2010;63(5):1315–1322. doi: 10.1002/mrm.22357. [DOI] [PMC free article] [PubMed] [Google Scholar]
Sapiro G., Ringach D.L. Anisotropic diffusion of multivalued images with applications to color filtering. IEEE Trans. Image Process. 1996;5(11):1582–1586. doi: 10.1109/83.541429. [DOI] [PubMed] [Google Scholar]
Scholand N., Wang X., Rosenzweig S., Holme H.C.M., Uecker M. Generic quantitative MRI using model-based reconstruction with the Bloch equations. Proc. Intl. Soc. Mag. Reson. Med. 2020;Vol. 28 [Google Scholar]
Sereno M.I., Lutti A., Weiskopf N., Dick F. Mapping the human cortical surface by combining quantitative T1 with retinotopy. Cereb. Cortex. 2013;23(9):2261–2268. doi: 10.1093/cercor/bhs213. [DOI] [PMC free article] [PubMed] [Google Scholar]
Sigalovsky I.S., Fischl B., Melcher J.R. Mapping an intrinsic MR property of gray matter in auditory cortex of living humans: a possible marker for primary cortex and hemispheric differences. Neuroimage. 2006;32(4):1524–1537. doi: 10.1016/j.neuroimage.2006.05.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
Sijbers J., Dekker A.J.d., Raman E., Dyck D.V. Parameter estimation from magnitude MR images. Int. J. Imaging Syst. Technol. 1999;10(2):109–114. [Google Scholar]
Stikov N., Boudreau M., Levesque I.R., Tardif C.L., Barral J.K., Pike G.B. On the accuracy of T1 mapping: searching for common ground. Magn. Reson. Med. 2015;73(2):514–522. doi: 10.1002/mrm.25135. [DOI] [PubMed] [Google Scholar]
Sumpf T.J., Petrovic A., Uecker M., Knoll F., Frahm J. Fast T2 mapping with improved accuracy using undersampled spin-echo MRI and model-based reconstructions with a generating function. IEEE Trans. Med. Imaging. 2014;33(12):2213–2222. doi: 10.1109/TMI.2014.2333370. [DOI] [PMC free article] [PubMed] [Google Scholar]; Conference Name: IEEE Transactions on Medical Imaging
Sumpf T.J., Uecker M., Boretius S., Frahm J. Model-based nonlinear inverse reconstruction for T2 mapping using highly undersampled spin-echo MRI. J. Magn. Reson. Imaging. 2011;34(2):420–428. doi: 10.1002/jmri.22634. [DOI] [PubMed] [Google Scholar]
Tabelow K., Balteau E., Ashburner J., Callaghan M.F., Draganski B., Helms G., Kherif F., Leutritz T., Lutti A., Phillips C., Reimer E., Ruthotto L., Seif M., Weiskopf N., Ziegler G., Mohammadi S. hMRI – a toolbox for quantitative MRI in neuroscience and clinical research. Neuroimage. 2019;194:191–210. doi: 10.1016/j.neuroimage.2019.01.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
Tisdall M.D. Bias and SNR of T1 estimates derived from joint fitting of actual flip-angle and FLASH imaging data with variable flip angles. Proc. Intl. Soc. Mag. Reson. Med. 2017;25:1445. [PMC free article] [PubMed] [Google Scholar]
Tisdall M.D., van der Kouwe A.J.W. Efficient algorithm for maximum likelihood estimate and confidence intervals of T1 from multi-flip, multi-echo FLASH. Proc. Intl. Soc. Mag. Reson. Med. 2016;Vol. 24 [Google Scholar]
Tofts P.S. 1. John Wiley & Sons, Ltd; 2003. Quantitative MRI of the Brain. [Google Scholar]
Tofts P.S., Steens S.C.A., van Buchem M.A. Quantitative MRI of the Brain. John Wiley & Sons, Ltd; 2003. MT: magnetization transfer; pp. 257–298. [Google Scholar]
Tofts P.S., Steens S.C.A., Cercignani M., Admiraal-Behloul F., Hofman P.A.M., van Osch M.J.P., Teeuwisse W.M., Tozer D.J., van Waesberghe J.H.T.M., Yeung R., Barker G.J., van Buchem M.A. Sources of variation in multi-centre brain MTR histogram studies: body-coil transmission eliminates inter-centre differences. Magn. Reson. Mater. Phys. 2006;19(4):209–222. doi: 10.1007/s10334-006-0049-8. [DOI] [PubMed] [Google Scholar]
Varadarajan D., Frost R., van der Kouwe A., Morgan L., Diamond B., Boyd E., Fogarty M., Stevens A., Fischl B., Polimeni J.R. Edge-preserving B0 inhomogeneity distortion correction for high-resolution multi-echo ex vivo MRI at 7T. Proc. Intl. Soc. Mag. Reson. Med. 2020;Vol. 28 [Google Scholar]
Varadarajan D., Haldar J.P. A majorize-minimize framework for Rician and non-central chi MR images. IEEE Trans. Med. Imaging. 2015;34(10):2191–2202. doi: 10.1109/TMI.2015.2427157. [DOI] [PMC free article] [PubMed] [Google Scholar]
Wang H.Z., Riederer S.J., Lee J.N. Optimizing the precision in T1 relaxation estimation using limited flip angles. Magn. Reson. Med. 1987;5(5):399–416. doi: 10.1002/mrm.1910050502. [DOI] [PubMed] [Google Scholar]
Weiskopf N., Callaghan M.F., Josephs O., Lutti A., Mohammadi S. Estimating the apparent transverse relaxation time (R2*) from images with different contrasts (ESTATICS) reduces motion artifacts. Front. Neurosci. 2014;8 doi: 10.3389/fnins.2014.00278. [DOI] [PMC free article] [PubMed] [Google Scholar]
Weiskopf N., Suckling J., Williams G., Correia M.M., Inkster B., Tait R., Ooi C., Bullmore E.T., Lutti A. Quantitative multi-parameter mapping of R1, PD*, MT, and R2* at 3T: a multi-center validation. Front. Neurosci. 2013;7 doi: 10.3389/fnins.2013.00095. [DOI] [PMC free article] [PubMed] [Google Scholar]
Whittall K.P., MacKay A.L. Quantitative interpretation of NMR relaxation data. J. Magn. Reson. (1969) 1989;84(1):134–152. [Google Scholar]
Wiggermann V., Vavasour I.M., Kolind S.H., MacKay A.L., Helms G., Rauscher A. Non-negative least squares computation for in vivo myelin mapping using simulated multi-echo spin-echo T2 decay data. NMR Biomed. 2020;33(12):e4277. doi: 10.1002/nbm.4277. [DOI] [PubMed] [Google Scholar]
Yarnykh V.L. Actual flip-angle imaging in the pulsed steady state: a method for rapid three-dimensional mapping of the transmitted radiofrequency field. Magn. Reson. Med. 2007;57(1):192–200. doi: 10.1002/mrm.21120. [DOI] [PubMed] [Google Scholar]
Zhao B., Lam F., Liang Z. Model-based MR parameter mapping with sparsity constraints: parameter estimation and performance bounds. IEEE Trans. Med. Imaging. 2014;33(9):1832–1844. doi: 10.1109/TMI.2014.2322815. [DOI] [PMC free article] [PubMed] [Google Scholar]; Conference Name: IEEE Transactions on Medical Imaging

[bib0001] Ashburner J. A fast diffeomorphic image registration algorithm. Neuroimage. 2007;38(1):95–113. doi: 10.1016/j.neuroimage.2007.07.007. [DOI] [PubMed] [Google Scholar]

[bib0002] Ashburner J., Brudfors M., Bronik K., Balbastre Y. An algorithm for learning shape and appearance models without annotations. Neuroimage. 2018;55:197–215. doi: 10.1016/j.media.2019.04.008. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0003] Ashburner J., Ridgway G.R. Symmetric diffeomorphic modeling of longitudinal structural MRI. Front. Neurosci. 2013;6:197. doi: 10.3389/fnins.2012.00197. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0004] Bach F. Optimization with sparsity-inducing penalties. FNT Mach. Learn. 2011;4(1):1–106. [Google Scholar]

[bib0005] Balbastre Y., Brudfors M., Azzarito M., Lambert C., Callaghan M.F., Ashburner J. In: Medical Image Computing and Computer Assisted Intervention MICCAI 2020. Martel A.L., Abolmaesumi P., Stoyanov D., Mateus D., Zuluaga M.A., Zhou S.K., Racoceanu D., Joskowicz L., editors. Springer International Publishing; Cham: 2020. Joint total variation ESTATICS for robust multi-parameter mapping; pp. 53–63. [Google Scholar]

[bib0006] Bauer C.M., Jara H., Killiany R. Whole brain quantitative T2 MRI across multiple scanners with dual echo FSE: applications to AD, MCI, and normal aging. Neuroimage. 2010;52(2):508–514. doi: 10.1016/j.neuroimage.2010.04.255. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0007] Beck A., Teboulle M. Fast gradient-based algorithms for constrained total variation image denoising and deblurring problems. IEEE Trans. Image Process. 2009;18(11):2419–2434. doi: 10.1109/TIP.2009.2028250. [DOI] [PubMed] [Google Scholar]; Conference Name: IEEE Transactions on Image Processing

[bib0008] Bloch F. Nuclear induction. Phys. Rev. 1946;70(7–8):460–474. [Google Scholar]; 07663

[bib0009] Boyd S., Parikh N., Chu E., Peleato B., Eckstein J. Distributed optimization and statistical learning via the alternating direction method of multipliers. Found. Trends Mach. Learn. 2011;3(1):1–122. [Google Scholar]

[bib0010] Brudfors M., Ashburner J., Nachev P., Balbastre Y. In: Simulation and Synthesis in Medical Imaging. Burgos N., Gooya A., Svoboda D., editors. Springer International Publishing; Cham: 2019. Empirical Bayesian mixture models for medical image translation; pp. 1–12. [Google Scholar]

[bib0011] Brudfors M., Balbastre Y., Flandin G., Nachev P., Ashburner J. In: Medical Image Computing and Computer Assisted Intervention MICCAI 2020. Martel A.L., Abolmaesumi P., Stoyanov D., Mateus D., Zuluaga M.A., Zhou S.K., Racoceanu D., Joskowicz L., editors. Springer International Publishing; Cham: 2020. Flexible Bayesian modelling for nonlinear image registration; pp. 253–263. [Google Scholar]

[bib0012] Brudfors M., Balbastre Y., Nachev P., Ashburner J. Proceedings of the 22nd Conference on Medical Image Understanding and Analysis. Southampton, UK; 2018. MRI super-resolution using multi-channel total variation. [Google Scholar]; 00000

[bib0013] Callaghan M.F., Lutti A., Ashburner J., Balteau E., Corbin N., Draganski B., Helms G., Kherif F., Leutritz T., Mohammadi S., Phillips C., Reimer E., Ruthotto L., Seif M., Tabelow K., Ziegler G., Weiskopf N. Example dataset for the hMRI toolbox. Data Brief. 2019;25:104132. doi: 10.1016/j.dib.2019.104132. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0014] Campbell J.S.W., Leppert I.R., Narayanan S., Boudreau M., Duval T., Cohen-Adad J., Pike G.B., Stikov N. Promise and pitfalls of g-ratio estimation with MRI. Neuroimage. 2018;182:80–96. doi: 10.1016/j.neuroimage.2017.08.038. [DOI] [PubMed] [Google Scholar]

[bib0015] Cercignani M., Alexander D.C. Optimal acquisition schemes for in vivo quantitative magnetization transfer MRI. Magn. Reson. Med. 2006;56(4):803–810. doi: 10.1002/mrm.21003. [DOI] [PubMed] [Google Scholar]

[bib0016] Chang L.-C., Koay C.G., Basser P.J., Pierpaoli C. Linear least-squares method for unbiased estimation of T1 from SPGR signals. Magn. Reson. Med. 2008;60(2):496–501. doi: 10.1002/mrm.21669. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0017] Chen Z., Prato F.S., McKenzie C. T1 fast acquisition relaxation mapping (T1/-FARM): an optimized reconstruction. IEEE Trans. Med. Imaging. 1998;17(2):155–160. doi: 10.1109/42.700728. [DOI] [PubMed] [Google Scholar]; Conference Name: IEEE Transactions on Medical Imaging

[bib0018] Chun I.Y., Fessler J.A. Convolutional dictionary learning: acceleration and convergence. IEEE Trans. Image Process. 2018;27(4):1697–1712. doi: 10.1109/TIP.2017.2761545. [DOI] [PubMed] [Google Scholar]

[bib0019] Corbin N., Callaghan M.F. Imperfect spoiling in variable flip angle T1 mapping at 7T: quantifying and minimizing impact. Magn. Reson. Med. 2021;86(2):693–708. doi: 10.1002/mrm.28720. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0020] Coupé P., Manjón J., Robles M., Collins D. Adaptive multiresolution non-local means filter for three-dimensional magnetic resonance image denoising. IET Image Process. 2012;6(5):558–568. [Google Scholar]

[bib0021] Coupe P., Yger P., Prima S., Hellier P., Kervrann C., Barillot C. An optimized blockwise nonlocal means denoising filter for 3-D magnetic resonance images. IEEE Trans. Med. Imaging. 2008;27(4):425–441. doi: 10.1109/TMI.2007.906087. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0022] Dalca A.V., Bouman K.L., Freeman W.T., Rost N.S., Sabuncu M.R., Golland P. In: Information Processing in Medical Imaging. Niethammer M., Styner M., Aylward S., Zhu H., Oguz I., Yap P.-T., Shen D., editors. Springer International Publishing; Cham: 2017. Population based image imputation; pp. 659–671. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0023] Daubechies I., Defrise M., Mol C.D. An iterative thresholding algorithm for linear inverse problems with a sparsity constraint. Commun. Pure Appl. Math. 2004;57(11):1413–1457. [Google Scholar]

[bib0024] Daubechies I., DeVore R., Fornasier M., Güntürk C.S. Iteratively reweighted least squares minimization for sparse recovery. Commun. Pure Appl. Math. 2010;63(1):1–38. [Google Scholar]

[bib0025] Deoni S.C.L., Rutt B.K., Arun T., Pierpaoli C., Jones D.K. Gleaning multicomponent T1 and T2 information from steady-state imaging data. Magn. Reson. Med. 2008;60(6):1372–1387. doi: 10.1002/mrm.21704. [DOI] [PubMed] [Google Scholar]

[bib0026] Deoni S.C.L., Williams S.C.R., Jezzard P., Suckling J., Murphy D.G.M., Jones D.K. Standardized structural magnetic resonance imaging in multicentre studies using quantitative T1 and T2 imaging at 1.5 t. Neuroimage. 2008;40(2):662–671. doi: 10.1016/j.neuroimage.2007.11.052. [DOI] [PubMed] [Google Scholar]

[bib0027] Dick F., Tierney A.T., Lutti A., Josephs O., Sereno M.I., Weiskopf N. In vivo functional and myeloarchitectonic mapping of human primary auditory areas. J. Neurosci. 2012;32(46):16095–16105. doi: 10.1523/JNEUROSCI.1712-12.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0028] Donoho D. De-noising by soft-thresholding. IEEE Trans. Inf. Theory. 1995;41(3):613–627. [Google Scholar]

[bib0029] Fessler J.A., Noll D.C. Proceedings of the 2nd IEEE International Symposium on Biomedical Imaging: Nano to Macro (IEEE Cat No. 04EX821) 2004. Iterative image reconstruction in MRI with separate magnitude and phase regularization; pp. 209–212Vol. 1. [Google Scholar]

[bib0030] Gong T., Tong Q., He H., Sun Y., Zhong J., Zhang H. MTE-NODDI: multi-TE NODDI for disentangling non-T2-weighted signal fractions from compartment-specific T2 relaxation times. Neuroimage. 2020;217:116906. doi: 10.1016/j.neuroimage.2020.116906. [DOI] [PubMed] [Google Scholar]

[bib0031] Gupta R.K. A new look at the method of variable nutation angle for the measurement of spin-lattice relaxation times using fourier transform NMR. J. Magn. Reson. (1969) 1977;25(1):231–235. [Google Scholar]

[bib0032] Hasan K.M., Walimuni I.S., Kramer L.A., Narayana P.A. Human brain iron mapping using atlas-based T2 relaxometry. Magn. Reson. Med. 2012;67(3):731–739. doi: 10.1002/mrm.23054. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0033] Helms G., Dathe H., Dechent P. Quantitative FLASH MRI at 3T using a rational approximation of the ernst equation. Magn. Reson. Med. 2008;59(3):667–672. doi: 10.1002/mrm.21542. [DOI] [PubMed] [Google Scholar]

[bib0034] Helms G., Dathe H., Kallenberg K., Dechent P. High-resolution maps of magnetization transfer with inherent correction for RF inhomogeneity and T1 relaxation obtained from 3D FLASH MRI. Magn. Reson. Med. 2008;60(6):1396–1407. doi: 10.1002/mrm.21732. [DOI] [PubMed] [Google Scholar]

[bib0035] Huang J., Chen C., Axel L. In: Proceedings of the MICCAI 2012. Ayache N., Delingette H., Golland P., Mori K., editors. Springer; Berlin, Heidelberg: 2012. Fast multi-contrast MRI reconstruction; pp. 281–288. [Google Scholar]

[bib0036] Hurley S.A., Yarnykh V.L., Johnson K.M., Field A.S., Alexander A.L., Samsonov A.A. Simultaneous variable flip angle actual flip angle imaging method for improved accuracy and precision of three-dimensional T1 and B1 measurements. Magn. Reson. Med. 2012;68(1):54–64. doi: 10.1002/mrm.23199. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0037] Hutter J., Slator P.J., Christiaens D., Teixeira R.P.A.G., Roberts T., Jackson L., Price A.N., Malik S., Hajnal J.V. Integrated and efficient diffusion-relaxometry using ZEBRA. Sci. Rep. 2018;8(1):15138. doi: 10.1038/s41598-018-33463-2. [DOI] [PMC free article] [PubMed] [Google Scholar]; Number: 1 Publisher: Nature Publishing Group

[bib0038] Jian B., Vemuri B.C., zarslan E., Carney P.R., Mareci T.H. A novel tensor distribution model for the diffusion-weighted MR signal. Neuroimage. 2007;37(1):164–176. doi: 10.1016/j.neuroimage.2007.03.074. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0039] Jiru F., Klose U. Fast 3D radiofrequency field mapping using echo-planar imaging. Magn. Reson. Med. 2006;56(6):1375–1379. doi: 10.1002/mrm.21083. [DOI] [PubMed] [Google Scholar]

[bib0040] Langkammer C., Krebs N., Goessler W., Scheurer E., Ebner F., Yen K., Fazekas F., Ropele S. Quantitative MR imaging of brain iron: a postmortem validation study. Radiology. 2010;257(2):455–462. doi: 10.1148/radiol.10100495. [DOI] [PubMed] [Google Scholar]

[bib0041] Liberman G., Louzoun Y., Bashat D.B. T1 Mapping using variable flip angle SPGR data with flip angle correction. J. Magn. Reson. Imaging. 2014;40(1):171–180. doi: 10.1002/jmri.24373. [DOI] [PubMed] [Google Scholar]

[bib0042] Liu T., Liu J., de Rochefort L., Spincemaille P., Khalidov I., Ledoux J.R., Wang Y. Morphology enabled dipole inversion (MEDI) from a single-angle acquisition: comparison with COSMOS in human brain imaging. Magn. Reson. Med. 2011;66(3):777–783. doi: 10.1002/mrm.22816. [DOI] [PubMed] [Google Scholar]

[bib0043] Lutti A., Dick F., Sereno M.I., Weiskopf N. Using high-resolution quantitative mapping of R1 as an index of cortical myelination. Neuroimage. 2014;93:176–188. doi: 10.1016/j.neuroimage.2013.06.005. [DOI] [PubMed] [Google Scholar]

[bib0044] Lutti A., Hutton C., Finsterbusch J., Helms G., Weiskopf N. Optimization and validation of methods for mapping of the radiofrequency transmit field at 3T. Magn. Reson. Med. 2010;64(1):229–238. doi: 10.1002/mrm.22421. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0045] MacKay D.J. Choice of basis for laplace approximation. Mach. Learn. 1998;33(1):77–86. [Google Scholar]

[bib0046] MacKay D.J.C. sixth printing 2007 edition. Cambridge University Press; Cambridge, UK ; New York: 2003. Information Theory, Inference and Learning Algorithms. [Google Scholar]

[bib0047] Manjón J.V., Coupé P., Buades A., Louis Collins D., Robles M. New methods for MRI denoising based on sparseness and self-similarity. Med. Image Anal. 2012;16(1):18–27. doi: 10.1016/j.media.2011.04.003. [DOI] [PubMed] [Google Scholar]

[bib0048] Manjón J.V., Coupé P., Martí-Bonmatí L., Collins D.L., Robles M. Adaptive non-local means denoising of MR images with spatially varying noise levels. J. Magn. Reson. Imaging. 2010;31(1):192–203. doi: 10.1002/jmri.22003. [DOI] [PubMed] [Google Scholar]

[bib0049] Mascarenhas W.F. The divergence of the BFGS and gauss newton methods. Math. Program. 2014;147(1):253–276. [Google Scholar]

[bib0050] Milford D., Rosbach N., Bendszus M., Heiland S. Mono-exponential fitting in T2-relaxometry: relevance of offset and first echo. PLoS ONE. 2015;10(12):e0145255. doi: 10.1371/journal.pone.0145255. [DOI] [PMC free article] [PubMed] [Google Scholar]; Publisher: Public Library of Science

[bib0051] Mohammadi S., D’Alonzo C., Ruthotto L., Polzehl J., Ellerbrock I., Callaghan M.F., Weiskopf N., Tabelow K. Simultaneous adaptive smoothing of relaxometry and quantitative magnetization transfer mapping. Weierstrass Inst. Appl. Anal. Stoch. Preprint 2432. 2017 doi: 10.20347/WIAS.PREPRINT.2432. [DOI] [Google Scholar]

[bib0052] Ogg R.J., Langston J.W., Haacke E.M., Steen R.G., Taylor J.S. The correlation between phase shifts in gradient-echo MR images and regional brain iron concentration. Magn. Reson. Imaging. 1999;17(8):1141–1148. doi: 10.1016/s0730-725x(99)00017-x. [DOI] [PubMed] [Google Scholar]

[bib0053] Ordidge R.J., Gorell J.M., Deniau J.C., Knight R.A., Helpern J.A. Assessment of relative brain iron concentrations using T2-weighted and T2*-weighted MRI at 3 tesla. Magn. Reson. Med. 1994;32(3):335–341. doi: 10.1002/mrm.1910320309. [DOI] [PubMed] [Google Scholar]

[bib0054] Papp D., Callaghan M.F., Meyer H., Buckley C., Weiskopf N. Correction of inter-scan motion artifacts in quantitative R1 mapping by accounting for receive coil sensitivity effects. Magn. Reson. Med. 2016;76(5):1478–1485. doi: 10.1002/mrm.26058. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0055] Polders D.L., Leemans A., Luijten P.R., Hoogduin H. Uncertainty estimations for quantitative in vivo MRI T1 mapping. J. Magn. Reson. 2012;224:53–60. doi: 10.1016/j.jmr.2012.08.017. [DOI] [PubMed] [Google Scholar]

[bib0056] Preibisch C., Deichmann R. Influence of RF spoiling on the stability and accuracy of T1 mapping based on spoiled FLASH with varying flip angles. Magn. Reson. Med. 2009;61(1):125–135. doi: 10.1002/mrm.21776. [DOI] [PubMed] [Google Scholar]

[bib0057] Press W.H., Teukolsky S.A., Vetterling W.T., Flannery B.P. 3 edition. Cambridge University Press; Cambridge, UK ; New York: 2007. Numerical Recipes 3rd Edition: The Art of Scientific Computing. [Google Scholar]

[bib0058] Rudin L.I., Osher S., Fatemi E. Nonlinear total variation based noise removal algorithms. Phys. D. 1992;60(1):259–268. [Google Scholar]

[bib0059] Sacolick L.I., Wiesinger F., Hancu I., Vogel M.W. B1 mapping by Bloch-Siegert shift. Magn. Reson. Med. 2010;63(5):1315–1322. doi: 10.1002/mrm.22357. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0060] Sapiro G., Ringach D.L. Anisotropic diffusion of multivalued images with applications to color filtering. IEEE Trans. Image Process. 1996;5(11):1582–1586. doi: 10.1109/83.541429. [DOI] [PubMed] [Google Scholar]

[bib0061] Scholand N., Wang X., Rosenzweig S., Holme H.C.M., Uecker M. Generic quantitative MRI using model-based reconstruction with the Bloch equations. Proc. Intl. Soc. Mag. Reson. Med. 2020;Vol. 28 [Google Scholar]

[bib0062] Sereno M.I., Lutti A., Weiskopf N., Dick F. Mapping the human cortical surface by combining quantitative T1 with retinotopy. Cereb. Cortex. 2013;23(9):2261–2268. doi: 10.1093/cercor/bhs213. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0063] Sigalovsky I.S., Fischl B., Melcher J.R. Mapping an intrinsic MR property of gray matter in auditory cortex of living humans: a possible marker for primary cortex and hemispheric differences. Neuroimage. 2006;32(4):1524–1537. doi: 10.1016/j.neuroimage.2006.05.023. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0064] Sijbers J., Dekker A.J.d., Raman E., Dyck D.V. Parameter estimation from magnitude MR images. Int. J. Imaging Syst. Technol. 1999;10(2):109–114. [Google Scholar]

[bib0065] Stikov N., Boudreau M., Levesque I.R., Tardif C.L., Barral J.K., Pike G.B. On the accuracy of T1 mapping: searching for common ground. Magn. Reson. Med. 2015;73(2):514–522. doi: 10.1002/mrm.25135. [DOI] [PubMed] [Google Scholar]

[bib0066] Sumpf T.J., Petrovic A., Uecker M., Knoll F., Frahm J. Fast T2 mapping with improved accuracy using undersampled spin-echo MRI and model-based reconstructions with a generating function. IEEE Trans. Med. Imaging. 2014;33(12):2213–2222. doi: 10.1109/TMI.2014.2333370. [DOI] [PMC free article] [PubMed] [Google Scholar]; Conference Name: IEEE Transactions on Medical Imaging

[bib0067] Sumpf T.J., Uecker M., Boretius S., Frahm J. Model-based nonlinear inverse reconstruction for T2 mapping using highly undersampled spin-echo MRI. J. Magn. Reson. Imaging. 2011;34(2):420–428. doi: 10.1002/jmri.22634. [DOI] [PubMed] [Google Scholar]

[bib0068] Tabelow K., Balteau E., Ashburner J., Callaghan M.F., Draganski B., Helms G., Kherif F., Leutritz T., Lutti A., Phillips C., Reimer E., Ruthotto L., Seif M., Weiskopf N., Ziegler G., Mohammadi S. hMRI – a toolbox for quantitative MRI in neuroscience and clinical research. Neuroimage. 2019;194:191–210. doi: 10.1016/j.neuroimage.2019.01.029. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0069] Tisdall M.D. Bias and SNR of T1 estimates derived from joint fitting of actual flip-angle and FLASH imaging data with variable flip angles. Proc. Intl. Soc. Mag. Reson. Med. 2017;25:1445. [PMC free article] [PubMed] [Google Scholar]

[bib0070] Tisdall M.D., van der Kouwe A.J.W. Efficient algorithm for maximum likelihood estimate and confidence intervals of T1 from multi-flip, multi-echo FLASH. Proc. Intl. Soc. Mag. Reson. Med. 2016;Vol. 24 [Google Scholar]

[bib0071] Tofts P.S. 1. John Wiley & Sons, Ltd; 2003. Quantitative MRI of the Brain. [Google Scholar]

[bib0072] Tofts P.S., Steens S.C.A., van Buchem M.A. Quantitative MRI of the Brain. John Wiley & Sons, Ltd; 2003. MT: magnetization transfer; pp. 257–298. [Google Scholar]

[bib0073] Tofts P.S., Steens S.C.A., Cercignani M., Admiraal-Behloul F., Hofman P.A.M., van Osch M.J.P., Teeuwisse W.M., Tozer D.J., van Waesberghe J.H.T.M., Yeung R., Barker G.J., van Buchem M.A. Sources of variation in multi-centre brain MTR histogram studies: body-coil transmission eliminates inter-centre differences. Magn. Reson. Mater. Phys. 2006;19(4):209–222. doi: 10.1007/s10334-006-0049-8. [DOI] [PubMed] [Google Scholar]

[bib0074] Varadarajan D., Frost R., van der Kouwe A., Morgan L., Diamond B., Boyd E., Fogarty M., Stevens A., Fischl B., Polimeni J.R. Edge-preserving B0 inhomogeneity distortion correction for high-resolution multi-echo ex vivo MRI at 7T. Proc. Intl. Soc. Mag. Reson. Med. 2020;Vol. 28 [Google Scholar]

[bib0075] Varadarajan D., Haldar J.P. A majorize-minimize framework for Rician and non-central chi MR images. IEEE Trans. Med. Imaging. 2015;34(10):2191–2202. doi: 10.1109/TMI.2015.2427157. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0076] Wang H.Z., Riederer S.J., Lee J.N. Optimizing the precision in T1 relaxation estimation using limited flip angles. Magn. Reson. Med. 1987;5(5):399–416. doi: 10.1002/mrm.1910050502. [DOI] [PubMed] [Google Scholar]

[bib0077] Weiskopf N., Callaghan M.F., Josephs O., Lutti A., Mohammadi S. Estimating the apparent transverse relaxation time (R2*) from images with different contrasts (ESTATICS) reduces motion artifacts. Front. Neurosci. 2014;8 doi: 10.3389/fnins.2014.00278. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0078] Weiskopf N., Suckling J., Williams G., Correia M.M., Inkster B., Tait R., Ooi C., Bullmore E.T., Lutti A. Quantitative multi-parameter mapping of R1, PD*, MT, and R2* at 3T: a multi-center validation. Front. Neurosci. 2013;7 doi: 10.3389/fnins.2013.00095. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0079] Whittall K.P., MacKay A.L. Quantitative interpretation of NMR relaxation data. J. Magn. Reson. (1969) 1989;84(1):134–152. [Google Scholar]

[bib0080] Wiggermann V., Vavasour I.M., Kolind S.H., MacKay A.L., Helms G., Rauscher A. Non-negative least squares computation for in vivo myelin mapping using simulated multi-echo spin-echo T2 decay data. NMR Biomed. 2020;33(12):e4277. doi: 10.1002/nbm.4277. [DOI] [PubMed] [Google Scholar]

[bib0081] Yarnykh V.L. Actual flip-angle imaging in the pulsed steady state: a method for rapid three-dimensional mapping of the transmitted radiofrequency field. Magn. Reson. Med. 2007;57(1):192–200. doi: 10.1002/mrm.21120. [DOI] [PubMed] [Google Scholar]

[bib0082] Zhao B., Lam F., Liang Z. Model-based MR parameter mapping with sparsity constraints: parameter estimation and performance bounds. IEEE Trans. Med. Imaging. 2014;33(9):1832–1844. doi: 10.1109/TMI.2014.2322815. [DOI] [PMC free article] [PubMed] [Google Scholar]; Conference Name: IEEE Transactions on Medical Imaging

PERMALINK

Model-based multi-parameter mapping

Yaël Balbastre

Mikael Brudfors

Michela Azzarito

Christian Lambert

Martina F Callaghan

John Ashburner

Highlights

Graphical abstract

Abstract

1. Introduction

Multi-parameter mapping

Non-linear fitting in MR relaxometry

Denoising

Non-linear MPM fitting

2. Theory

2.1. Forward model: spoiled gradient echo signal equation

2.2. Noise model: Gaussian

2.3. Optimisation: non-linear least squares

A sufficient condition for monotonic convergence

Toy problem 1: exponential least square

Fig. 1.

Toy problem 2: nested exponential least square

Generalisation to the multivariate case

Fig. 2.

2.4. Likelihood: spoiled gradient echo

Fig. 3.

Spatial projection

2.5. Prior: joint total variation

2.6. Optimisation: regularised spoiled gradient echo

2.7. Implementation

3. Experiments

3.1. Convergence analysis

3.2. Datasets

3.3. Algorithms

3.4. Cross-validation

3.5. Echo decimation

3.6. Uncertainty mapping

4. Results

4.1. Convergence

Fig. 4.

Fig. 5.

4.2. Hyper-parameter selection

4.3. Methods cross-validation

Fig. 6.

4.4. Echo decimation

Fig. 7.

4.5. Uncertainty mapping

Fig. 8.

5. Discussion & conclusion

Declaration of Competing Interest

Acknowledgments

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases