MoDL-MUSSELS: Model-Based Deep Learning for Multishot Sensitivity-Encoded Diffusion MRI

Hemant K Aggarwal; Merry P Mani; Mathews Jacob

doi:10.1109/TMI.2019.2946501

. Author manuscript; available in PMC: 2021 Apr 1.

Published in final edited form as: IEEE Trans Med Imaging. 2019 Oct 9;39(4):1268–1277. doi: 10.1109/TMI.2019.2946501

MoDL-MUSSELS: Model-Based Deep Learning for Multishot Sensitivity-Encoded Diffusion MRI

Hemant K Aggarwal ¹, Merry P Mani ², Mathews Jacob ³

PMCID: PMC7894613 NIHMSID: NIHMS1582099 PMID: 31603819

Abstract

We introduce a model-based deep learning architecture termed MoDL-MUSSELS for the correction of phase errors in multishot diffusion-weighted echo-planar MR images. The proposed algorithm is a generalization of the existing MUSSELS algorithm with similar performance but significantly reduced computational complexity. In this work, we show that an iterative re-weighted least-squares implementation of MUSSELS alternates between a multichannel filter bank and the enforcement of data consistency. The multichannel filter bank projects the data to the signal subspace, thus exploiting the annihilation relations between shots. Due to the high computational complexity of the self-learned filter bank, we propose replacing it with a convolutional neural network (CNN) whose parameters are learned from exemplary data. The proposed CNN is a hybrid model involving a multichannel CNN in the k-space and another CNN in the image space. The k-space CNN exploits the annihilation relations between the shot images, while the image domain network is used to project the data to an image manifold. The experiments show that the proposed scheme can yield reconstructions that are comparable to state-of-the-art methods while offering several orders of magnitude reduction in run-time.

Keywords: Diffusion MRI, Echo Planar Imaging, Deep Learning, convolutional neural network

I. INTRODUCTION

Diffusion MRI (DMRI), which is sensitive to anisotropic diffusion processes in the brain tissue, has the potential to provide rich information on white matter anatomy [1]. It has several applications, including the studies of neurological disorders [2], the aging process [3], and acute stroke [4]. Diffusion MRI relies on large bipolar directional gradients to encode water diffusion, which attenuates the signals from diffusing molecules in the direction of the gradient. The diffusion-sensitized signal is often spatially encoded using single-shot echo-planar imaging (ssEPI), which allows the acquisition of the entire k-space in a single excitation and readout. While such acquisitions can offer high sampling efficiency, the longer readout makes the acquisition vulnerable to distortions induced by B0 inhomogeneity. Specifically, the recovered images often exhibit geometric distortions [5]. These artefacts, resulting from the long readouts, essentially limit the extent of k-space coverage and thereby the spatial resolution that ssEPI sequences can achieve.

Multishot echo-planar imaging (msEPI) methods were introduced to minimize the distortions related to the long readouts in ssEPI. This scheme splits the k-space sampling over multiple excitations and readouts, resulting in shorter readout lengths for each shot, as shown in Fig. 1. While multishot imaging can offer high resolution, a challenge associated with this scheme is its vulnerability to inter-shot motion in the diffusion setting. Specifically, subtle physiological motion during the large bipolar gradients manifests as phase differences between different shots. The direct combination of the k-space data from these shots results in artefacts in the diffusion-weighted images (DWI) arising from phase inconsistencies.

Fig. 1. — Demonstration of multishot EPI acquisition employing multiple excitations and readouts. The first radio-frequency (RF) excitation and diffusion sensitization are followed by a k-space readout by shot 1 that samples k-space lines 1, 3, and 5. The second RF excitation and diffusion sensitization are followed by a k-space readout by shot 2 capturing lines 2, 4, and 6. The combined data corresponds to the fully sampled k-space.

We recently introduced a multishot sensitivity-encoded diffusion data recovery algorithm using structured low-rank matrix completion (MUSSELS) [6], which allows the reconstruction of DWI that are immune to the motion-induced phase artefacts. The method exploits the redundancy between the Fourier samples of the shots to jointly recover the missing k-space samples in each of the shots [7]. The k-space data recovery is then posed as a matrix completion problem that utilizes a structured low-rank algorithm and parallel imaging to recover the missing k-space data in each shot. While this scheme can offer state-of-the-art results, the challenge is the high computational complexity. The large data size and the need for matrix lifting make it challenging to reconstruct the high-resolution data from different directions and slices despite the existence of fast structured low-rank algorithms [7], [8].

In this paper, we introduce a novel deep learning framework to minimize the computational complexity of MUSSELS [6]. This work is inspired by the convolutional network structure of MUSSELS and is formulated in k-space to exploit the convolutional relations between the Fourier samples of the shots. The proposed scheme is also motivated by our recent work on model-based deep learning (MoDL) [9] and similar algorithms that rely on the unrolling of iterative algorithms [10]–[12]. The main benefit of MoDL is the ability to exploit the physics of the acquisition scheme and the ability to incorporate multiple regularization priors [13], in a deep learning setting, to achieve improved performance. The use of the conjugate-gradient algorithm within the network to enforce data consistency in MoDL provides improved performance for a specified number of iterations. The sharing of network parameters across iterations enables MoDL to keep the number of learned parameters decoupled from the number of iterations, thus providing good convergence without increasing the number of trainable parameters. A smaller number of trainable parameters translates to significantly reduced training data demands, which is particularly attractive for data-scarce medical-imaging applications.

We first introduce an approach based on the iterative reweighted least-squares algorithm (IRLS) [14] to solve the MUSSELS cost function [6]. The MUSSELS algorithm [6], which is based on iterative singular value shrinkage, alternates between a data-consistency block and a low-rank matrix recovery block. By contrast, the IRLS-MUSSELS algorithm [7] alternates between a data-consistency block and a residual multichannel¹ convolution block. The multichannel convolution block can be viewed as the projection of the data to the nullspace of the multichannel signals; the subtraction of the result from the original ones, induced by the residual structure, projects the data to the signal subspace, thus removing the artefacts in the signal. The IRLS-MUSSELS algorithm learns the parameters of the denoising filter from the data itself, which requires several iterations. Motivated by our earlier work [9], we propose replacing the multichannel linear convolution block in IRLS-MUSSELS with a convolutional neural network (CNN). Unlike the self-learning strategy in IRLS-MUSSELS, where the filter parameters are learned from the measured data itself, we propose learning the parameters of the non-linear CNN from exemplar data. We hypothesize that the non-linear structure of the CNN will enable us to learn and generalize from examples. The learned CNN will facilitate the projection of each test dataset to the associated signal subspace. While the architecture is conceptually similar to MoDL, the main difference is the extension to multishot settings and the learning in the Fourier domain (k-space) enabled by the IRLS-MUSSELS reformulation.

The proposed framework has similarities to recent k-space deep learning strategies [16]–[19], which also exploit the convolution relations in the Fourier domain. The main distinction of the proposed scheme with these methods is the model-based framework, along with the training of the unrolled network. Many of the current schemes [18] are not designed for the parallel imaging setting. The use of the conjugate gradient steps in our network allows us to account for parallel imaging efficiently, requiring few iterations. We also note the relation of the proposed work with that of Akcakaya et al. [20], which uses a self-learned network to recover parallel MRI data. The weights of the network are estimated from the measured data itself. Since we estimate the weights from exemplar data, the proposed scheme is significantly faster.

II. BACKGROUND

A. Problem formulation

The high-resolution DMRI requires long-duration EPI readouts that are vulnerable to field-inhomogeneity-induced spatial distortions. Also, the large rewinder gradients make the achievable echo-time rather long, resulting in lower signal-to-noise ratio (SNR). To minimize these distortions, it is common practice to acquire the data using msEPI schemes for high-resolution applications. These schemes acquire a highly undersampled subset of the k-space at each shot. Since the subsets are complementary, the data from all these shots can be combined to obtain the final image. The image acquisition of the i^th shot and the j^th coil can be expressed as

y_{i, j} [k] = \int_{ℝ^{2}} ρ (r) s_{j} (r) \exp ({i k}^{T} r) d r + n_{i, j} [k]; \forall k \in Θ_{i} .

(1)

Here, s_j(r) denotes the coil sensitivity of the j^th coil and Θ_i, i = 1,.., N, denotes the subset of the k-space that is acquired at the i^th-shot. Note that the sampling indices of the different shots are complementary, implying that the combination of the data from the different shots will result in a fully sampled image. Specifically, we have $\cup_{i = 1}^{N} Θ_{i} = Θ$ , where Θ is the Fourier grid corresponding to the fully sampled image. The above relation, to acquire the desired image ρ(r) from N shots, can be compactly represented as

y_{i} = A_{i} (ρ (r)) + n, i = 1, \dots, N

(2)

in the absence of phase errors. Here, y_i represents the under-sampled multichannel measurements of the i^th shot acquired using the acquisition operator A_i, and n represents the additive Gaussian noise that may corrupt the samples during acquisition.

Diffusion MRI uses large bipolar diffusion gradients to encode the diffusion motion of water molecules. Unfortunately, subtle physiological motion between the bipolar gradients often manifests as phase errors in the acquisition. With the addition of the unknown phase function ϕ_i(r), |ϕ_i(r)| = 1 introduced by physiological motion, the forward model is modified as

y_{i} = A_{i} \underset{ρ_{i} (r)}{\underset{︸}{(ρ (r) ϕ_{i} (r))}} + n, i = 1, .., N .

(3)

If the phase errors ϕ_i(r), i = 1, .., N, are uncompensated, the image obtained by the combination of y_i, i = 1, .., N will show artefacts arising from the inconsistent phase. The widely used multishot method, termed MUSE [21], [22], relies on the independent estimation of ϕ_i(r) from low-resolution reconstructions of the phase-corrupted images ρ_i(r). The forward model can be compactly written as y = A(ρ), where $ρ = {[\begin{matrix} ρ_{1}^{T}, \dots ρ_{N}^{T} \end{matrix}]}^{T}$ is the vector of multishot images. Once the phases are estimated, the reconstruction is posed as a phase-aware reconstruction [21], [22].

B. Brief Review of MUSSELS

The MUSSELS algorithm [6] relies on a structured low-rank formulation to jointly recover the phase-corrupted images ρ_i from their under-sampled multi-coil measurements. The MUSSELS algorithm capitalizes on the multi-coil nature of the measurements as well as annihilation relations between the phase-corrupted images. The key observation is that these phase-corrupted images satisfy an image domain annihilation relation [23]

ρ_{i} (r) ϕ_{j} (r) - ρ_{j} (r) ϕ_{i} (r) = 0, \forall r .

(4)

This multiplicative annihilation relation, resulting from phase inconsistencies, translates to convolution relations in the Fourier domain:

\hat{ρ_{i}} (k) * \hat{ϕ_{j}} (k) - \hat{ρ_{j}} (k) * {\hat{ϕ}}_{i} (k) = 0 \forall k,

(5)

where $\hat{x}$ denotes the Fourier transform of x. Since the phase images ϕ_j(r) are smooth, their Fourier coefficients $\hat{ϕ_{j}} (k)$ can be assumed to be support-limited to a region Λ in the Fourier domain. This allows us to rewrite the convolution relations in (5) in a matrix form using block-Hankel convolution matrices $H_{Λ}^{Γ} (ρ)$ . The matrix product $H_{Λ}^{Γ} (ρ)$ s corresponds to the 2D convolution between a signal ρ supported on a grid Γ and the filter s of size Λ. Thus, the Fourier domain convolution relations can be compactly expressed using matrices [6] as

[H_{Λ}^{Γ} ({\hat{ρ}}_{i}) ∣ H_{Λ}^{Γ} ({\hat{ρ}}_{j})] \underset{s}{\underset{︸}{[\frac{{\hat{ϕ}}_{j}}{- {\hat{ϕ}}_{i}}]}} = 0.

(6)

We note that there exists a similar annihilation relation between each pair of shots, which implies that the structured matrix

T (\hat{ρ}) = [H_{Λ}^{Γ} ({\hat{ρ}}_{1}) | \dots | H_{Λ}^{Γ} ({\hat{ρ}}_{N})]

(7)

is low-rank. The MUSSELS algorithm [6] recovers the multishot images from their undersampled k-space measurements by solving

\tilde{ρ} = \underset{ρ}{\arg \min} ∥ A (ρ) - y ∥_{2}^{2} + λ ∥ T (\hat{ρ}) ∥_{*},

(8)

where ∥ · ∥_∗ denotes the nuclear norm. The above problem is solved in earlier work [6] using an iterative shrinkage algorithm.

III. DEEP LEARNED MUSSELS

A. IRLS reformulation of MUSSELS

To bring the MUSSELS framework to the MoDL setting, we first introduce an IRLS reformulation [14] of the MUSSELS. Using an auxiliary variable z, we rewrite (8) as

\underset{ρ, z}{\arg \min} ∥ A (ρ) - y ∥_{2}^{2} + β ∥ \hat{ρ} - z ∥_{F}^{2} + λ ∥ T (z) ∥_{*} .

(9)

We observe that (9) is equivalent to (8) as β → ∞. An alternating minimization algorithm to solve the above problem yields the following steps:

ρ_{n + 1} = \underset{ρ}{\arg \min} ∥ A (ρ) - y ∥_{2}^{2} + β {‖ \hat{ρ} - z_{n} ‖}_{F}^{2}

(10)

z_{n + 1} = \underset{z}{\arg \min ∥} {\hat{ρ}}_{n + 1} - {z ∥}_{F}^{2} + \frac{λ}{β} ∥ T (z) ∥_{*} .

(11)

We now borrow from the literature [24], [25] and majorize the nuclear norm term in (11) as

∥ T (z) ∥_{*} \leq ∥ T (z) Q ∥_{F}^{2},

(12)

where the weight matrix is specified by

Q = {[T^{H} (z) T (z) + ϵ I]}^{- 1 / 4}

(13)

Here, I is the identity matrix. Similar majorization strategies were used in the work [8]. With the majorization in (12), the z-subproblem in (11) would involve the alternation between

z_{n + 1} = \underset{z}{\arg \min} ∥ {\hat{ρ}}_{n + 1} - {z ∥}_{F}^{2} + \frac{λ}{β} ∥ T (z) Q ∥_{F}^{2}

(14)

and the update of the Q using (13). Thus the IRLS reformulation of the MUSSELS scheme would alternate between (10), (14), and (13) as summarized in Algorithm 1. The matrix Q may be viewed as a surrogate for the nullspace of T(z) as shown in the work [8]. The Q matrix at each iteration is estimated based on the previous iterate of z. The update step (14) can be interpreted as finding an approximation of ${\hat{ρ}}_{n + 1}$ from the signal subspace.

Algorithm 1.

Summary of the IRLS-MUSSELS algorithm

Require: ρ₀, z₀

Ensure: ρ_n+1

1: for n = 1 to max_Iterations do

2: ρ_n+1 = solve (10) using conjugate gradient

3: z_n+1 = solve (14) using conjugate gradient.

4: Q_n+1 = [T^H (z_n+1)T(z_n+1) + ϵI]^−1/4

5: end for

6: return ρ_n+1

Open in a new tab

B. Interpretation of IRLS- MUSSELS as an iterative denoiser

We now focus on the term ∥T(z)Q∥² = ∑_i ∥T(z)q_i∥² in (14); q_i are the columns of Q representing nullspace vectors of T(z) similar vector s in Eq. (6). We note that the matrix-vector product T(z)q_i corresponds to the multichannel convolution of z with the columns of Q, specified by q_i. We split each column q_i into sub-filters q_ij to obtain

Q = [\begin{matrix} \underset{q_{1}}{\underset{︸}{[\begin{matrix} \underline{q_{11}} \\ \frac{⋮}{q_{1 N}} \end{matrix}]}} & \underset{q_{2}}{\underset{︸}{[\begin{matrix} \underline{q_{21}} \\ \frac{⋮}{q_{2 N}} \end{matrix}]}} & \underset{q_{N}}{\underset{︸}{[\begin{matrix} \underline{q_{11}} \\ \frac{⋮}{q_{1 N}} \end{matrix}]}} \end{matrix}]

(15)

where each q_ij is of length |Λ|. Note that z = [z₁| _… |z_N] is the multishot data. This allows us to rewrite the multichannel convolution

T (z) q_{i} = H_{Λ}^{Γ} (z_{1}) q_{i 1} + .. H_{Λ}^{Γ} (z_{N}) q_{i N} ....

(16)

as the sum of convolutions of z_j with q_i,j. Due to the commutativity of convolution h ∗ g = g ∗ h, each term in (16) can be re-expressed as

H_{Λ}^{Γ} (g) h = S (h) g,

(17)

where S(h) is an appropriately² sized block Hankel matrix constructed from the zero-filled entries of h. We use this relation to rewrite

T (z) Q = \underset{G (Q)}{\underset{︸}{[\begin{matrix} S (q_{11}) & S (q_{12}) & \dots & S (q_{1 N}) \\ ⋮ & \dots & ⋮ \\ S (q_{N 1}) & S (q_{12}) & \dots & S (q_{N N}) \end{matrix}]}} \underset{z}{\underset{︸}{[\begin{matrix} z_{1} \\ ⋮ \\ z_{N} \end{matrix}]}} .

We note that G(Q)z corresponds to the multichannel convolution of z₁, ..., z_N with the filterbank having filters q_i,j. With this reformulation, (14) is simplified as

z_{n + 1} = \underset{z}{\arg \min} ∥ {\hat{ρ}}_{n} - {z ∥}_{F}^{2} + \frac{λ}{β} ∥ G (Q) z ∥_{F}^{2} .

(18)

Differentiating the above expression and setting it equal to zero, we get

z_{n + 1} = {(I + \frac{λ}{β} G {(Q)}^{H} G (Q))}^{- 1} ρ_{n + 1} .

One may use a numerical solver to determine z_n+1. An alternative is to solve this step approximately using the matrix inversion lemma, assuming λ << β:

\begin{array}{l} z_{n + 1} \approx [I - \frac{λ}{β} G {(Q)}^{H} G (Q)] {\hat{ρ}}_{n + 1} \\ = {\hat{ρ}}_{n + 1} - \frac{λ}{β} G {(Q)}^{H} G (Q) {\hat{ρ}}_{n + 1} . \end{array}

(19)

We note that G(Q) can be viewed as a single layer convolutional filter bank, while multiplication by G(Q)^H can be viewed as flipped convolutions (deconvolutions in deep learning context) with matching boundary conditions. Note that neither of the above layers have any non-linearities. Thus, (19) can be thought of as a residual block, which involves the convolution of the multishot signals ${\hat{ρ}}_{n}$ with the columns of Q, followed by deconvolution as shown in Fig. 2(a). As discussed before, the filters specified by the columns of Q are surrogates for the nullspace of $T (\hat{ρ})$ . Thus, the update (19) can be thought of as removing the components of ${\hat{ρ}}_{n}$ in the nullspace and projecting the data to the signal subspace, which may be viewed as a sophisticated denoiser, as shown in Fig. 2(a).

The IRLS-MUSSELS scheme [7], as summarized in Fig. 2, provides state-of-the-art results. However, note that the filters specified by the columns of Q are estimated for each diffusion direction by using Algorithm 1, which has high computational complexity, especially in the context of diffusion-weighted imaging, where several directions need to be estimated for each slice.

C. MoDL-MUSSELS Formulation

To minimize the computational complexity of the IRLS-MUSSELS, we propose learning a non-linear denoiser from exemplar data rather than learning a custom denoising block specified by $[I - \frac{λ}{β} G {(Q)}^{H} G (Q)]$ for each direction and slice. We hypothesize that the non-linearities in the network, as well as the larger number of filter layers, can facilitate the learning of a generalizable model from the exemplar data. This framework may be viewed as a multishot extension of the MoDL [9] approach. The cost function associated with the network is

\underset{ρ}{\arg \min} ∥ A (ρ) - y ∥_{2}^{2} + λ_{1} ∥ N_{k} {(ρ) ∥}_{2}^{2} .

(20)

Here, N_k(ρ) is a non-linear residual convolutional filterbank working in the Fourier domain, with

N_{k} (ρ) = ρ - D_{k} (ρ) .

(21)

D_k(ρ) can be thought of as a multichannel CNN in the Fourier domain. The image domain input ρ is first transformed to k-space as $\hat{ρ}$ , then passes through the k-space model, and, then transformed back to the image domain. Figure 3(a) shows the proposed M-layer CNN architecture. The overall k-space MoDL-MUSSELS network architecture is shown in Fig. 3(b), which solves Eq. (20). Unlike IRLS-MUSSELS in Fig. 2, the parameters of this network are not updated within the iterations and are learned from the exemplar data.

Fig. 3. — The block diagram of the proposed k-space network architecture to solve Eq. (20). (a) The N_w block represents the deep learned noise predictor, and D_w is a residual learning block. (b) Here, the denoiser D_k is the M-layer network D_w that performs the k-space denoising.

D. Hybrid MoDL-MUSSELS Regularization

A key benefit of the MoDL framework over direct inversion methods is the ability to exploit different kinds of priors, as shown in our prior work [13]. The IRLS-MUSSELS and the MoDL-MUSSELS schemes exploit the multichannel convolution relations between the k-space data. By contrast, we relied on an image domain convolutional neural network in earlier work [9] to exploit the structure of patches in the image domain. Note that this structure is completely complementary to the multichannel convolution relations. We now propose to jointly exploit both the priors as follows:

\underset{ρ}{\arg \min} ∥ A (ρ) - y ∥_{2}^{2} + λ_{1} ∥ N_{k} {(ρ) ∥}_{2}^{2} + λ_{2} ∥ N_{I} {(ρ) ∥}_{2}^{2},

(22)

where N_k is the same prior as in (20), while N_I is an image space residual network of the form N_I(ρ) = ρ − D_I(ρ). Here, D_I is an image domain CNN as in earlier work [9]. The problem (22) can be rewritten as

\underset{ρ}{\arg \min} ∥ A (ρ) - y ∥_{2}^{2} + λ_{1} ∥ ρ - D_{k} {(ρ) ∥}_{2}^{2} + λ_{2} ∥ ρ - D_{I} {(ρ) ∥}_{2}^{2} .

By substituting η = D_k(ρ), and ζ = D_I(ρ), an alternating minimization-based solution to the above problem iterates between the following steps:

ρ_{n + 1} = {(A^{H} A + λ_{1} I + λ_{2} I)}^{- 1} (A^{H} y + λ_{1} η + λ_{2} ζ)

(23)

η_{n + 1} = D_{k} (ρ_{n + 1})

(24)

ζ_{n + 1} = D_{I} (ρ_{n + 1}) .

(25)

The above solution results in the hybrid MoDL-MUSSELS architecture shown in Fig. 4. Note that this alternating minimization scheme is similar to the plug-and-play priors [26] widely used in inverse problems. The main exception is that we train the resulting network in an end-to-end fashion. Note that, unlike the plug-and-plug denoisers that learn the image manifold, the network D_k is designed to exploit the redundancies between the multiple shots resulting from the annihilation relations. This non-linear network is expected to project the multichannel k-space data orthogonal to the nullspaces of the multichannel Hankel matrices. The regularization parameters λ₁ and λ₂ control the contribution of the k-space network and the image-domain network, respectively. During experiments we kept the values of λ₁ = 0,01, λ₂ = 0.05 fixed. However, it can be noted that these values can be made trainable as in the MoDL [9].

IV. EXPERIMENTS

We perform several experiments to validate different aspects of the proposed model, such as, the benefits of the recursive network, the impact of regularization, robustness to outliers, comparison with existing deep learning models such as U-NET [27], and comparison with a model-based technique P-MUSE [22].

A. Dataset Description

In vivo data were collected from healthy volunteers at the University of Iowa in accordance with the Institutional Review Board recommendations. The imaging was performed on a GE MR750W 3T scanner using a 32-channel head coil. A Stejskal-Tanner spin-echo diffusion imaging sequence was used with a 4-shot EPI readout. A total of 60 diffusion gradient direction measurements were taken with a b-value of 700 s/mm². The relevant imaging parameters were FOV = 210 × 210 mm, matrix size = 256 × 152 with partial Fourier oversampling of 24 lines, slice thickness = 4 mm and TE = 84 ms. Data were collected from 7 subjects.

The training dataset constituted a total of 68 slices, each having 60 directions and 4 shots, from 5 subjects. The validation was performed on 6 slices of the 6th subject, whereas testing was carried out on 5 slices of the 7th subject. Thus, a total of 4080, 360, and 300 complex images each having size 256×256×4 (rows×columns×shots) were used for training, validation, and testing, respectively.

To perform quantitative comparisons, we also made use of simulated data with high SNR. For this purpose, we utilized a subset of pre-processed, relatively high-SNR diffusion dataset from the human connectome project [28]. We extracted 15 volumes and 20 slices from 100 subjects, which resulted in 30,000 magnitude images of size 145 × 174. We prepared a dataset of 23,000 training images, 3,000 validation images, and 3,000 test images. We simulated the sensitivity maps using Walsh algorithm [29]. To simulate the multishot data with phase errors, we multiplied each magnitude image with synthetically generated random bandlimited phase errors using the image formation model in Eq. (3). Gaussian noise of varying amounts of standard deviation σ was added to the phase-corrupted k-space data. The k-space data was under-sampled to generate the multishot data.

B. Multi-coil forward model

All of the model-based schemes used in this study (MUSE, MUSSELS, MoDL-MUSSELS) rely on a forward model that mimics the image formation. We implement this forward model as described in (1) and (3). The raw dataset consists of 32 channels. We reduce the data to four virtual channels using singular value decomposition (SVD) of the non-diffusion weighted (b0) image. The coil sensitivity maps of these four virtual channels were estimated using ESPIRIT [30]. The same channel combination weights were used to reduce the diffusion-weighted MRI data to four coils.

C. Quantitative metrics used in experiments

The reconstruction quality is measured using the structure similarity index (SSIM) [31] and peak signal-to-noise ratio (PSNR). The PSNR is defined as

PSNR (x, y) = 10 * \log_{10} (\frac{max {(x)}^{2}}{MSE (x, y)})

where MSE is the mean-square-error between x and y. The final PSNR/SSIM value is estimated by the average of the PSNR/SSIM of individual shots.

D. Algorithms used for comparison

We compare the proposed scheme against IRLS-MUSSELS [7],P-MUSE [22], and a solution based on U-NET [27]. The IRLS-MUSSELS is a modification of the MUSSELS algorithm [6]. Specifically, the modification involve an IRLS based implementation instead of iterative shirnkage algorithm in [6], which results in a faster implementation. Moreover, it also includes an additional conjugate symmetry constraint in addition to the annihilation relations between the shots that is exploited in the MUSSELS method [6]. We refer the readers to [7], which shows that the addition of the conjugate symmetry constraint reduces blurring and results in sharper images compared to the original MUSSELS method [6] for partial Fourier acquisitions. In the results section, the IRLS-MUSSELS is referred to as simply IRLS-M.

P-MUSE [22] is a two-step algorithm that first estimates the motion-induced phase using the SENSE [15] reconstruction and the total-variation denoising. With the knowledge of the phase errors, it recovers the images using a regularized optimization with (3) as the forward model. The P-MUSE algorithm [22] has three parameters λ₁ = 0.01, λ₂ = 0.01, and the number of iterations = 40. The parameters λ₁ and λ₂ control the total variation regularization during phase estimation and reconstruction, respectively. We searched over the parameters to yield the best possible reconstruction.

We extended the U-NET [27] model for the multishot diffusion MR image reconstruction. The number of convolution layers, the feature maps in each layer, and the filter size were kept the same as in [27]. The input to the extended U-NET model was the concatenation of the real and imaginary parts of phase-corrupted coil-combined complex 4 shots images. The IRLS-MUSSELS [7] reconstructions were used as the ground truth for the training of the deep learning models on experimental data. We trained the network in the image domain with 1,000 epochs for 13 hours using the Adam [32] optimizer.

We also performed a comparison between the k-space MoDL-MUSSELS formulation in section III-C and the hybrid MoDL-MUSSELS formulation in section III-D. We refer to the former as the k-space network and the latter as the hybrid network. To perform a fair comparison between hybrid and k-space networks, the number of parameters in the k-space network was kept the same as that of the hybrid model by increasing the number of feature maps in the convolution layers.

E. Network architecture and training

In this work, we trained an 8-layer CNN having convolution filters of size 3 × 3 in each layer. Each layer comprises a convolution, followed by ReLU, except the last layer, which consists of a 1 × 1 convolution as shown in Fig. 5. The real and imaginary components of the complex 4 shots data were considered as channels in the residual learning CNN architecure, whereas the data-consistency block worked explicitly with complex data.

Fig. 5. — The specific M=8 layer residual learning CNN architecture used as D_k and D_I blocks in the experiments. The 4 shot complex data are the input and output of the network. The first layer concatenates the real and imaginary parts as 8 input features. The numbers on top of each layer represent the number of feature maps learned at that layer. We learn 3 × 3 filters at each layer except the last, where we learn 1 × 1 filter.

The proposed network architecture, as shown in Fig. 4, was unfolded for three iterations, and the end-to-end training was performed for 100 epochs. The input to the unfolded network is the zero-filled complex data from the four shots, which corresponds to A^H y, while the network outputs the fully sampled complex data for the four shots. The proposed MoDL-MUSSELS architecture combines the data from the four shots using the sum-of-squares approach. The network weights were randomly initialized using Xavier initialization and shared between iterations. The network was implemented using the TensorFlow library in Python 3.6 and trained using the NVIDIA P100 GPU. The conjugate-gradient optimization in the DC step was implemented as a layer operation in the TensorFlow library as described in the work [9]. We utilized the mean-square-error as the loss function during training. The total network training time of the network was around 37 hours.

The plot in Fig. 6 shows training loss decays smoothly with epochs. It can be noted that the loss on the validation dataset also has overall decaying behavior, which implies that the trained model did not over-fit the dataset. The model-based framework has considerably fewer parameters than direct inverse methods and hence requires far fewer training data to achieve good performance, as seen from the experiments in the previous work [9].

V. RESULTS

A. Comparisons using simulated data

Table I summarizes the quantitative results (PSNR and SSIM values) obtained from the simulated data in Section IV-A. Specifically, we quantitatively compare the reconstructions provided by the five algorithms, while varying the noise levels. We did not perform the training of the deep learning methods (k-space, U-NET, and hybrid) for different noise levels but instead utilized the same model trained for a single noise level (σ = 0.001). We adjusted the parameters of the P-MUSE and IRLS-MUSSELS algorithms for different noise levels to get the best average results. The average performance of the U-NET is lower than all other methods since the U-NET does not have an explicit data-consistency term like the other methods. It is evident from the graphs in Fig. 7 that proposed hybrid method performs better than other methods on all individual slices and directions of a test subject.

TABLE I.

The PSNR (dB) and SSIM values obtained by five methods on the testing dataset with simulated phases and added Gaussian noise of varying standard deviation σ. The values are reported as mean ± standard deviation.

Peak signal to noise ratio (dB)

Noise (std)	σ = 0.001	σ = 0.002	σ = 0.003

U-NET	32.15 ± 2.12	29.98 ± 1.19	27.63 ± 0.82
P-MUSE	34.08 ± 2.31	31.68 ± 2.21	29.19 ± 1.84
IRLS-M	38.81 ± 1.98	36.21 ± 1.32	32.43 ± 1.33
K-space	40.02 ± 1.18	36.92 ± 0.96	34.69 ± 1.38
Hybrid	40.59 ± 1.87	37.37 ± 1.56	35.40 ± 1.36

Structural similarity index

U-NET	0.89 ± 0.01	0.82 ± 0.02	0.73 ± 0.03
P-MUSE	0.79 ± 0.03	0.69 ± 0.04	0.63 ± 0.05
IRLS-M	0.88 ± 0.01	0.83 ± 0.01	0.72 ± 0.03
K-space	0.94 ± 0.01	0.89 ± 0.02	0.84 ± 0.03
Hybrid	0.96 ± 0.00	0.94 ± 0.01	0.92 ± 0.01

Open in a new tab

Fig. 7. — This plot compares the variation of the SSIM on all the slices and directions of one test subject from simulation dataset. The vertical lines seperate the different slices, i.e., first fifteen images are the directions corresponding to the first slice, and so on. There is a total of 20 slices, each having 15 different directions, resulting in a total of 300 images.

Figure 8 shows an example set of images reconstructed using the five methods for this simulated data. For comparison, the uncorrected image and the error maps for all the reconstructions, compared to the ground truth image are also provided. It is evident from the error maps that the proposed hybrid model has the least error among the methods compared. Figure 8(e) shows that the k-space network is able to compensate for phase errors of multishot data. The addition of image-domain regularization in the hybrid model further improves the reconstructions in Fig. 8(f). We note that the image domain network exploits the manifold structure of patches, which serves as a strong prior that the k-space network has difficulty capturing.

Fig. 8. — Simulation results using the high-SNR data obtained from HCP. The reconstructed images and the corresponding error maps from five different algorithms are shown. Here, the ground truth is an image from the test dataset that was corrupted with phase errors of bandwidth 3×3 and noise standard deviation σ = 0.001 to simulate 4 shot acquisition. (g) shows the uncorrected image if we do not correct the phase errors during reconstruction. The numbers in the sub-captions represent the SSIM values.

B. Robustness to outliers

We further performed an experiment to determine the robustness of the proposed MoDL-MUSSELS approach against outliers. In particular, we simulated a lesion image by increasing the intensity at a few pixels as indicated by an arrow in Fig. 9(a). This image was passed through the existing trained model. It is observed from Fig. 9(b) that simulated lesion was preserved by the proposed method. Note that the training dataset did not have any lesion images and we did not simulate the lesion images during training. The robustness of the algorithm to such outliers can be attributed to the fact that the algorithm relies on k-space and q-space deep-learning networks with small receptive fields, unlike direct inversions methods that rely on large receptive fields. Hence, the proposed scheme learns only to exploit local redundancies in k-space and q-space and does not memorize whole images.

Fig. 9. — Lesion experiment. Arrow points to the location of Lesion. The proposed MoDL-MUSSELS method preserves the lesion.

C. Comparison of reconstruction time

Table II compares the time taken to reconstruct the entire testing dataset for various methods. It is noted that the computational complexity of the MoDL-MUSSELS is around 28-fold lower than the IRLS-MUSSELS. Note that IRLS-MUSSELS estimates the optimal linear filter bank from the measurements itself, which requires significantly many iterations. By contrast, MoDL-MUSSELS pre-learns non-linear network weights. The quite significant speed increase directly follows from the significantly fewer number of iterations. Note that we rely on a conjugate-gradient algorithm to enforce data consistency specified by (23). Also note that solving (23) exactly as opposed to the use of steepest gradient steps at each iteration would require more unrolling steps, thus diminishing the gain in speedup. The greatly reduced runtime is expected to facilitate the deployment of the proposed algorithm on clinical scanners.

TABLE II.

Time to reconstruct all five slices of the test subject. Each slice had 60 directions, 4 shots, and 32 coils. IRLS-MUSSELS and P-MUSE were runs on CPU with parallel processing.

Algorithm:	U-NET	P-MUSE	IRLS-M	MoDL-MUSSELS

Time (sec) :	7	632	1386	49

Open in a new tab

D. Impact of iterations on image quality

Figure 10 shows the impact of the number of iterations in the iterative algorithm described in (23)–(25). Specifically, we unrolled the iterative algorithm for the different numbers of iterations and compared the performance of the resulting networks. We used the hybrid model due to its improved performance. The parameters of both the k-space and image-space networks were shared across iterations. Specifically, MoDL-MUSSELS uses three iterations of alternating-minimization, with five iterations of CG within each alternating step. The IRLS-MUSSELS uses five iterations of both outer loop as well as CG step. The images in Fig. 10 each correspond to a specific direction and slice in the testing dataset. We note that the contrast and details in the image improved with iterations, as did the visualization of some features, as shown in the zoomed portions.

Fig. 10. — Effect of iterations on image quality. We observe that the quality of the reconstructions with the proposed MoDL-MUSSELS scheme improve with iterations. Specifically, the sharpness of the image and the contrast seem to improve with more iterations.

E. Comparisons on experimental data

Next we compare the performance of the proposed method to reconstruct experimental data. Figure 11 shows the reconstructions offered by the different algorithms. A separate network was trained with the experimental data utilizing the IRLS-MUSSELS as the ground truth. While this comparison may not be fair to P-MUSE, we used this approach since the main goal is to validate the MoDL-MUSSELS and U-NET which relied on IRLS-MUSSELS results for training. As evident from the error maps, the U-NET reconstructions appear less blurred, but it seems to miss some key features highlighted by yellow boxes. The hybrid method provides good results comparable to that of IRLS-MUSSELS.

Fig. 11. — Reconstructions obtained using different algorithms on experimental partial-Fourier data. Row 1 and Row 3 shows reconstruction results from two different slices. We generated the error maps in rows 2 and 4 by considering MUSSELS reconstructions as ground truth. The yellow boxes highlight the differences.

To further validate the reconstruction accuracy of all the DWIs corresponding to the test slice, we performed a tensor fitting using all the DWIs and compared the resulting fractional anisotropy (FA) maps and the fiber orientation maps. For this purpose, the DWIs reconstructed using various methods were fed to a tensor fitting routine (FDT Toolbox, FSL). The FA maps were computed from the fitted tensors, and the direction of the primary eigenvectors of the tensors was used to estimate the fiber orientation. The FA maps generated using the various reconstruction methods are shown in Fig. 12, which has been color-coded based on the fiber direction. It is noted that these fiber directions reconstructed by the IRLS-MUSSELS method and the MoDL-MUSSELS match the true anatomy known for this brain region from a diffusion tensor imaging (DTI) white matter atlas (http://www.dtiatlas.org).

Fig. 12. — The fractional-anisotropy maps on a test dataset slice. These images are computed from the sixty directions of the slices, recovered using the respective algorithms. We note the proposed scheme provides less blurred reconstructions than P-MUSE, which are comparable with IRLS-MUSSELS.

VI. DISCUSSION

We observe that the IRLS-MUSSELS reconstructions on experimental data are noisy. This noisy ground truth training data causes fuzziness in the training loss, which translates to the slight blurring in MoDL-MUSSELS reconstructions in Fig. 11. Note that the MoDL-MUSSELS reconstructions in the simulated data experiments in Fig. 8 are less blurred. The dependence of the final image quality on the training data is a limitation of the current work, especially in the multishot diffusion setting where noise-free training data is difficult to acquire. We plan to experiment with denoising strategies as well as the acquisition of training data with multiple averages to mitigate these problems. Further, we note from Fig. 11 that the reconstructions provided by the MoDL-MUSSELS appear less noisy and are visually more appealing than the noisy ground truth obtained using the IRLS-MUSSELS. This behavior may be attributed to the convolutional structure of the network, which is known to offer implicit regularization [33].

In this work, we utilized an eight-layer neural network, as shown in Fig. 5. However, the proposed MoDL-MUSSELS architecture in Fig. 4 is not constrained by choice of network. Any network architecture (e.g., U-NET) may be used instead. It is possible that the results can improve by utilizing more sophisticated network architecture. Further, it can be noted that the proposed model architecture is flexible to allow different network architectures for image-space and k-space models. However, for the proof of concept, we used the same network architecture for both k-space and image space.

To avoid overfitting the model and reduce the training time, the proposed network in Fig. 4 was unfolded for three iterations before performing the joint training. The sharing of network parameters allows the network to be unfolded for any number of iterations without increasing the number of trainable parameters. In this work, we restricted our implementation to a three iteration setting. We note that the results may improve with more outer iterations. However, increasing the outer iterations require more GPU memory during network training.

The deep learning blocks used in the proposed scheme map the noisy/artefact-prone N-shot data to the noise-free N-shot data. The size of the filters in the first and last layers of the deep learning blocks depend on the number of shots. Hence, the network needs to be retrained if the number of shots changes. Since the filters capture the annihilation relations between the shots, we do not anticipate the need to retrain the network if other parameters (e.g., image size, TR, TE, etc.) change. Finally, we note that the current method depends on the estimation of the coil sensitivities to recover the multishot data.

VII. CONCLUSIONS

We introduced a model-based deep learning framework termed MoDL-MUSSELS for the compensation of phase errors in multishot diffusion-weighted MRI data. The proposed algorithm alternates between a conjugate gradient optimization algorithm to enforce data consistency and multichannel convolutional neural networks (CNN) to project the data to appropriate subspaces. We rely on a hybrid approach involving a multichannel CNN in the k-space and another one in the image space. The k-space CNN exploits the annihilation relations between the shot images, while the image domain network is used to project the data to an image manifold. The weights of the deep network, obtained by unrolling the iterations in the iterative optimization scheme, are learned from exemplary data in an end-to-end fashion. The experiments show that the proposed scheme can yield reconstructions that are comparable to state-of-the-art methods while offering several orders of magnitude reduction in run-time.

VIII. ACKNOWLEDGMENT

Data were provided [in part] by the Human Connectome Project, WU-Minn Consortium (Principal Investigators: David Van Essen and Kamil Ugurbil; 1U54MH091657) funded by the 16 NIH Institutes and Centers that support the NIH Blueprint for Neuroscience Research; and by the McDonnell Center for Systems Neuroscience at Washington University.

This work is supported by 1R01EB01996101A1. This work was conducted on an MRI instrument funded by 1S10OD025025–01.

Footnotes

¹ The term multichannel is used in a traditional signal processing sense to refer to multichannel convolution using a multichannel filter. The different channels corresponds to the images from different shots of the multishot data. Since we rely on the SENSE forward model [15], the channels do not refer to multi-coil data in parallel MRI.

² The size of the matrix is |Γ| − |Γ ⊖ Λ| × |Γ| such that (17) holds. Here, Γ is the size of the image, and Λ is the size of the filter. ⊖ refers to the set erosion operator as defined in the work [8].

Contributor Information

Hemant K. Aggarwal, department of electrical and computer engineering, University of Iowa, Iowa, USA..

Merry P. Mani, Division of neuroradiology, University of Iowa, Iowa, USA.

Mathews Jacob, department of electrical and computer engineering, University of Iowa, Iowa, USA..

REFERENCES

[1].Le Bihan D, “Looking into the functional architecture of the brain with diffusion MRI,” Nature Reviews Neuroscience, vol. 4, no. 6, p. 469, 2003. [DOI] [PubMed] [Google Scholar]
[2].Ciccarelli O, Catani M, Johansen-Berg H, Clark C, and Thompson A, “Diffusion-based tractography in neurological disorders: concepts, applications, and future developments,” The Lancet Neurology, vol. 7, no. 8, pp. 715–727, 2008. [DOI] [PubMed] [Google Scholar]
[3].Charlton RA, Barrick T, McIntyre D, Shen Y, O’sullivan M, Howe F, Clark C, Morris R, and Markus H, “White matter damage on diffusion tensor imaging correlates with age-related cognitive decline,” Neurology, vol. 66, no. 2, pp. 217–222, 2006. [DOI] [PubMed] [Google Scholar]
[4].Moseley M, Cohen Y, Mintorovitch J, Chileuitt L, Shimizu H, Kucharczyk J, Wendland M, and Weinstein P, “Early detection of regional cerebral ischemia in cats: comparison of diffusion-and T2-weighted MRI and spectroscopy,” Magnetic resonance in medicine, vol. 14, no. 2, pp. 330–346, 1990. [DOI] [PubMed] [Google Scholar]
[5].Wu W and Miller KL, “Image formation in diffusion MRI: a review of recent technical developments,” Journal of Magnetic Resonance Imaging, vol. 46, no. 3, pp. 646–662, 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
[6].Mani M, Jacob M, Kelley D, and Magnotta V, “Multi-shot sensitivity-encoded diffusion data recovery using structured low-rank matrix completion (MUSSELS),” Magnetic Resonance in Medicine, vol. 78, no. 2, pp. 494–507, 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
[7].Mani M, Aggarwal HK, Magnotta V, and Jacob M, “Improved reconstruction for high-resolution multi-shot diffusion weighted imaging,” in https://arxiv.org/abs/1906.10178, 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
[8].Ongie G and Jacob M, “A fast algorithm for convolutional structured low-rank matrix recovery,” IEEE transactions on computational imaging, vol. 3, no. 4, pp. 535–550, 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
[9].Aggarwal HK, Mani MP, and Jacob M, “MoDL: Model based deep learning architecture for inverse problems,” IEEE Trans. Med. Imag, vol. 38, no. 2, pp. 394–405, 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
[10].Adler J and Öktem O, “Learned primal-dual reconstruction,” IEEE Trans. Med. Imag, vol. 37, no. 6, pp. 1322–1332, 2018. [DOI] [PubMed] [Google Scholar]
[11].Yang Y, Sun J, Li H, and Xu Z, “Deep ADMM-Net for compressive sensing mri,” in Advances in Neural Information Processing Systems 29, 2016, pp. 10–18. [Google Scholar]
[12].Hammernik K, Klatzer T, Kobler E, Recht MP, Sodickson DK, Pock T, and Knoll F, “Learning a Variational Network for Reconstruction of Accelerated MRI Data,” Magnetic resonance in Medicine, vol. 79, no. 6, pp. 3055–3071, 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
[13].Biswas S, Aggarwal HK, and Jacob M, “Dynamic MRI using model-based deep learning and storm priors: MoDL-SToRM,” Magnetic Resonance in Medicine, vol. 82, no. 1, pp. 485–494, 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
[14].Chartrand R and Yin W, “Iteratively reweighted algorithms for compressive sensing,” in IEEE International Conference on Acoustics, Speech and Signal Processing, March 2008, pp. 3869–3872. [Google Scholar]
[15].Pruessmann KP, Weiger M, Scheidegger MB, and Boesiger P, “SENSE: sensitivity encoding for fast MRI,” Magnetic resonance in medicine, vol. 42, no. 5, pp. 952–962, 1999. [PubMed] [Google Scholar]
[16].Lee J, Han Y, and Ye JC, “k-space deep learning for reference-free EPI ghost correction,” arXiv preprint arXiv:1806.00153, 2018. [DOI] [PubMed] [Google Scholar]
[17].Souza R and Frayne R, “A hybrid frequency-domain/image-domain deep network for magnetic resonance image reconstruction,” arXiv preprint arXiv:1810.12473, 2018. [Google Scholar]
[18].Wang S, Ke Z, Cheng H, Jia S, Leslie Y, Zheng H, and Liang D, “DIMENSION: Dynamic MR imaging with both k-space and spatial prior knowledge obtained via multi-supervised network training,” arXiv preprint arXiv:1810.00302, 2018. [DOI] [PubMed] [Google Scholar]
[19].Eo T, Jun Y, Kim T, Jang J, Lee H-J, and Hwang D, “KIKI-net: cross-domain convolutional neural networks for reconstructing under-sampled magnetic resonance images,” Magnetic resonance in medicine, vol. 80, no. 5, pp. 2188–2201, 2018. [DOI] [PubMed] [Google Scholar]
[20].Akcakaya M, Moeller S, Weingartner S, and Ugurbil K, “Scan-specific robust artificial-neural-networks for k-space interpolation (RAKI) reconstruction: Database-free deep learning for fast imaging,” Magnetic Resonance in Medicine, vol. 81, no. April 2018, pp. 439–453, 2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
[21].Chen N.-k., Guidon A, Chang H-C, and Song AW, “A robust multishot scan strategy for high-resolution diffusion weighted MRI enabled by multiplexed sensitivity-encoding (MUSE),” Neuroimage, vol. 72, pp. 41–47, 2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
[22].Chu M-L, Chang H-C, Chung H-W, Truong T-K, Bashir MR, and Chen N.-k., “POCS-based reconstruction of multiplexed sensitivity encoded mri (POCSMUSE): A general algorithm for reducing motionrelated artifacts,” Magnetic Resonance in Medicine, vol. 74, no. 5, pp. 1336–1348, 2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
[23].Morrison RL, Jacob M, and Do MN, “Multichannel estimation of coil sensitivities in parallel MRI,” in IEEE Int. Symp. Bio. Imag. IEEE, 2007, pp. 117–120. [Google Scholar]
[24].Mohan K and Fazel M, “Iterative reweighted algorithms for matrix rank minimization,” Journal of Machine Learning Research, vol. 13, no. Nov, pp. 3441–3473, 2012. [Google Scholar]
[25].Fornasier M, Rauhut H, and Ward R, “Low-rank matrix recovery via iteratively reweighted least squares minimization,” SIAM Journal on Optimization, vol. 21, no. 4, pp. 1614–1640, 2011. [Google Scholar]
[26].Chan SH, Wang X, and Elgendy OA, “Plug-and-Play ADMM for Image Restoration: Fixed Point Convergence and Applications,” IEEE Trans. on Comp. Imag, vol. 3, no. 1, pp. 84–98, 2017. [Google Scholar]
[27].Ronneberger O, Fischer P, and Brox T, “U-NET: Convolutional networks for biomedical image segmentation,” in International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI) Springer, 2015, pp. 234–241. [Google Scholar]
[28].Van Essen DC, Smith SM, Barch DM, Behrens TE, Yacoub E, Ugurbil K, Consortium W-MH et al. , “The wu-minn human connectome project: an overview,” Neuroimage, vol. 80, pp. 62–79, 2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
[29].Walsh DO, Gmitro AF, and Marcellin MW, “Adaptive reconstruction of phased array mr imagery,” Magnetic Resonance in Medicine: An Official Journal of the International Society for Magnetic Resonance in Medicine, vol. 43, no. 5, pp. 682–690, 2000. [DOI] [PubMed] [Google Scholar]
[30].Uecker M, Lai P, Murphy MJ, Virtue P, Elad M, Pauly JM, Vasanawala SS, and Lustig M, “ESPIRiT - An eigenvalue approach to autocalibrating parallel MRI: Where SENSE meets GRAPPA,” Magnetic Resonance in Medicine, vol. 71, no. 3, pp. 990–1001, 2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
[31].Wang Z, Bovik AC, Sheikh HR, Simoncelli EP et al. , “Image quality assessment: from error visibility to structural similarity,” IEEE Trans. Image Process, vol. 13, no. 4, pp. 600–612, 2004. [DOI] [PubMed] [Google Scholar]
[32].Kingma DP and Ba JL, “Adam: a Method for Stochastic Optimization,” International Conference on Learning Representations 2015, pp. 1–15, 2015. [Google Scholar]
[33].Ulyanov D, Vedaldi A, and Lempitsky V, “Deep image prior,” in IEEE Conference on Computer Vision and Pattern Recognition, 2018, pp. 9446–9454. [Google Scholar]

[R1] [1].Le Bihan D, “Looking into the functional architecture of the brain with diffusion MRI,” Nature Reviews Neuroscience, vol. 4, no. 6, p. 469, 2003. [DOI] [PubMed] [Google Scholar]

[R2] [2].Ciccarelli O, Catani M, Johansen-Berg H, Clark C, and Thompson A, “Diffusion-based tractography in neurological disorders: concepts, applications, and future developments,” The Lancet Neurology, vol. 7, no. 8, pp. 715–727, 2008. [DOI] [PubMed] [Google Scholar]

[R3] [3].Charlton RA, Barrick T, McIntyre D, Shen Y, O’sullivan M, Howe F, Clark C, Morris R, and Markus H, “White matter damage on diffusion tensor imaging correlates with age-related cognitive decline,” Neurology, vol. 66, no. 2, pp. 217–222, 2006. [DOI] [PubMed] [Google Scholar]

[R4] [4].Moseley M, Cohen Y, Mintorovitch J, Chileuitt L, Shimizu H, Kucharczyk J, Wendland M, and Weinstein P, “Early detection of regional cerebral ischemia in cats: comparison of diffusion-and T2-weighted MRI and spectroscopy,” Magnetic resonance in medicine, vol. 14, no. 2, pp. 330–346, 1990. [DOI] [PubMed] [Google Scholar]

[R5] [5].Wu W and Miller KL, “Image formation in diffusion MRI: a review of recent technical developments,” Journal of Magnetic Resonance Imaging, vol. 46, no. 3, pp. 646–662, 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R6] [6].Mani M, Jacob M, Kelley D, and Magnotta V, “Multi-shot sensitivity-encoded diffusion data recovery using structured low-rank matrix completion (MUSSELS),” Magnetic Resonance in Medicine, vol. 78, no. 2, pp. 494–507, 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R7] [7].Mani M, Aggarwal HK, Magnotta V, and Jacob M, “Improved reconstruction for high-resolution multi-shot diffusion weighted imaging,” in https://arxiv.org/abs/1906.10178, 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R8] [8].Ongie G and Jacob M, “A fast algorithm for convolutional structured low-rank matrix recovery,” IEEE transactions on computational imaging, vol. 3, no. 4, pp. 535–550, 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R9] [9].Aggarwal HK, Mani MP, and Jacob M, “MoDL: Model based deep learning architecture for inverse problems,” IEEE Trans. Med. Imag, vol. 38, no. 2, pp. 394–405, 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R10] [10].Adler J and Öktem O, “Learned primal-dual reconstruction,” IEEE Trans. Med. Imag, vol. 37, no. 6, pp. 1322–1332, 2018. [DOI] [PubMed] [Google Scholar]

[R11] [11].Yang Y, Sun J, Li H, and Xu Z, “Deep ADMM-Net for compressive sensing mri,” in Advances in Neural Information Processing Systems 29, 2016, pp. 10–18. [Google Scholar]

[R12] [12].Hammernik K, Klatzer T, Kobler E, Recht MP, Sodickson DK, Pock T, and Knoll F, “Learning a Variational Network for Reconstruction of Accelerated MRI Data,” Magnetic resonance in Medicine, vol. 79, no. 6, pp. 3055–3071, 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R13] [13].Biswas S, Aggarwal HK, and Jacob M, “Dynamic MRI using model-based deep learning and storm priors: MoDL-SToRM,” Magnetic Resonance in Medicine, vol. 82, no. 1, pp. 485–494, 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R14] [14].Chartrand R and Yin W, “Iteratively reweighted algorithms for compressive sensing,” in IEEE International Conference on Acoustics, Speech and Signal Processing, March 2008, pp. 3869–3872. [Google Scholar]

[R15] [15].Pruessmann KP, Weiger M, Scheidegger MB, and Boesiger P, “SENSE: sensitivity encoding for fast MRI,” Magnetic resonance in medicine, vol. 42, no. 5, pp. 952–962, 1999. [PubMed] [Google Scholar]

[R16] [16].Lee J, Han Y, and Ye JC, “k-space deep learning for reference-free EPI ghost correction,” arXiv preprint arXiv:1806.00153, 2018. [DOI] [PubMed] [Google Scholar]

[R17] [17].Souza R and Frayne R, “A hybrid frequency-domain/image-domain deep network for magnetic resonance image reconstruction,” arXiv preprint arXiv:1810.12473, 2018. [Google Scholar]

[R18] [18].Wang S, Ke Z, Cheng H, Jia S, Leslie Y, Zheng H, and Liang D, “DIMENSION: Dynamic MR imaging with both k-space and spatial prior knowledge obtained via multi-supervised network training,” arXiv preprint arXiv:1810.00302, 2018. [DOI] [PubMed] [Google Scholar]

[R19] [19].Eo T, Jun Y, Kim T, Jang J, Lee H-J, and Hwang D, “KIKI-net: cross-domain convolutional neural networks for reconstructing under-sampled magnetic resonance images,” Magnetic resonance in medicine, vol. 80, no. 5, pp. 2188–2201, 2018. [DOI] [PubMed] [Google Scholar]

[R20] [20].Akcakaya M, Moeller S, Weingartner S, and Ugurbil K, “Scan-specific robust artificial-neural-networks for k-space interpolation (RAKI) reconstruction: Database-free deep learning for fast imaging,” Magnetic Resonance in Medicine, vol. 81, no. April 2018, pp. 439–453, 2018. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R21] [21].Chen N.-k., Guidon A, Chang H-C, and Song AW, “A robust multishot scan strategy for high-resolution diffusion weighted MRI enabled by multiplexed sensitivity-encoding (MUSE),” Neuroimage, vol. 72, pp. 41–47, 2013. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R22] [22].Chu M-L, Chang H-C, Chung H-W, Truong T-K, Bashir MR, and Chen N.-k., “POCS-based reconstruction of multiplexed sensitivity encoded mri (POCSMUSE): A general algorithm for reducing motionrelated artifacts,” Magnetic Resonance in Medicine, vol. 74, no. 5, pp. 1336–1348, 2015. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R23] [23].Morrison RL, Jacob M, and Do MN, “Multichannel estimation of coil sensitivities in parallel MRI,” in IEEE Int. Symp. Bio. Imag. IEEE, 2007, pp. 117–120. [Google Scholar]

[R24] [24].Mohan K and Fazel M, “Iterative reweighted algorithms for matrix rank minimization,” Journal of Machine Learning Research, vol. 13, no. Nov, pp. 3441–3473, 2012. [Google Scholar]

[R25] [25].Fornasier M, Rauhut H, and Ward R, “Low-rank matrix recovery via iteratively reweighted least squares minimization,” SIAM Journal on Optimization, vol. 21, no. 4, pp. 1614–1640, 2011. [Google Scholar]

[R26] [26].Chan SH, Wang X, and Elgendy OA, “Plug-and-Play ADMM for Image Restoration: Fixed Point Convergence and Applications,” IEEE Trans. on Comp. Imag, vol. 3, no. 1, pp. 84–98, 2017. [Google Scholar]

[R27] [27].Ronneberger O, Fischer P, and Brox T, “U-NET: Convolutional networks for biomedical image segmentation,” in International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI) Springer, 2015, pp. 234–241. [Google Scholar]

[R28] [28].Van Essen DC, Smith SM, Barch DM, Behrens TE, Yacoub E, Ugurbil K, Consortium W-MH et al. , “The wu-minn human connectome project: an overview,” Neuroimage, vol. 80, pp. 62–79, 2013. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R29] [29].Walsh DO, Gmitro AF, and Marcellin MW, “Adaptive reconstruction of phased array mr imagery,” Magnetic Resonance in Medicine: An Official Journal of the International Society for Magnetic Resonance in Medicine, vol. 43, no. 5, pp. 682–690, 2000. [DOI] [PubMed] [Google Scholar]

[R30] [30].Uecker M, Lai P, Murphy MJ, Virtue P, Elad M, Pauly JM, Vasanawala SS, and Lustig M, “ESPIRiT - An eigenvalue approach to autocalibrating parallel MRI: Where SENSE meets GRAPPA,” Magnetic Resonance in Medicine, vol. 71, no. 3, pp. 990–1001, 2014. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R31] [31].Wang Z, Bovik AC, Sheikh HR, Simoncelli EP et al. , “Image quality assessment: from error visibility to structural similarity,” IEEE Trans. Image Process, vol. 13, no. 4, pp. 600–612, 2004. [DOI] [PubMed] [Google Scholar]

[R32] [32].Kingma DP and Ba JL, “Adam: a Method for Stochastic Optimization,” International Conference on Learning Representations 2015, pp. 1–15, 2015. [Google Scholar]

[R33] [33].Ulyanov D, Vedaldi A, and Lempitsky V, “Deep image prior,” in IEEE Conference on Computer Vision and Pattern Recognition, 2018, pp. 9446–9454. [Google Scholar]

PERMALINK

MoDL-MUSSELS: Model-Based Deep Learning for Multishot Sensitivity-Encoded Diffusion MRI

Hemant K Aggarwal, IEEE

Merry P Mani

Mathews Jacob, IEEE

Roles

Abstract

I. INTRODUCTION

Fig. 1.

II. BACKGROUND

A. Problem formulation

B. Brief Review of MUSSELS

III. DEEP LEARNED MUSSELS

A. IRLS reformulation of MUSSELS

Algorithm 1.

B. Interpretation of IRLS- MUSSELS as an iterative denoiser

Fig. 2.

C. MoDL-MUSSELS Formulation

Fig. 3.

D. Hybrid MoDL-MUSSELS Regularization

Fig. 4.

IV. EXPERIMENTS

A. Dataset Description

B. Multi-coil forward model

C. Quantitative metrics used in experiments

D. Algorithms used for comparison

E. Network architecture and training

Fig. 5.

Fig. 6.

V. RESULTS

A. Comparisons using simulated data

TABLE I.

Fig. 7.

Fig. 8.

B. Robustness to outliers

Fig. 9.

C. Comparison of reconstruction time

TABLE II.

D. Impact of iterations on image quality

Fig. 10.

E. Comparisons on experimental data

Fig. 11.

Fig. 12.

VI. DISCUSSION

VII. CONCLUSIONS

VIII. ACKNOWLEDGMENT

Footnotes

Contributor Information

REFERENCES

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases