Abstract
In diffusion MRI, the outcome of estimation problems can often be improved by taking into account the correlation of diffusion-weighted images scanned with neighboring wavevectors in q-space. For this purpose, we propose in this paper to employ tight wavelet frames constructed on non-flat domains for multi-scale sparse representation of diffusion signals. This representation is well suited for signals sampled regularly or irregularly, such as on a grid or on multiple shells, in q-space. Using spectral graph theory, the frames are constructed based on quasi-affine systems (i.e., generalized dilations and shifts of a finite collection of wavelet functions) defined on graphs, which can be seen as a discrete representation of manifolds. The associated wavelet analysis and synthesis transforms can be computed efficiently and accurately without the need for explicit eigen-decomposition of the graph Laplacian, allowing scalability to very large problems. We demonstrate the effectiveness of this representation, generated using what we call tight graph framelets, in two specific applications: denoising and super-resolution in q-space using ℓ0 regularization. The associated optimization problem involves only thresholding and solving a trivial inverse problem in an iterative manner. The effectiveness of graph framelets is confirmed via evaluation using synthetic data with noncentral chi noise and real data with repeated scans.
1 Introduction
Diffusion-weighted MR signals, in addition to the diffusion time, are controlled by a quantity called wavevector. The wavevector is dependent on the length, strength, and orientation of the gradient pulses during the measurement sequence. It is often denoted as vector q, which can be separated into a scalar wavenumber |q| and a diffusion encoding direction q̂ = q/|q|. The effects of both diffusion time, t, and wavenumber are summarized using a quantity called b-value, which is defined as b = t|q|2. A diffusion MRI protocol normally acquires multiple diffusion-weighted images, each corresponding to a wavevector q in q-space. For example, in shell acquisition schemes, b is fixed by fixing both t and |q| and only the gradient direction varies among measurements. This can be extended to multi-shell acquisition, where the measurements are collected at shells of different b-values. These images afford microstructural information that can be harnessed for tissue characterization and axonal tracing.
Due to the noisy nature of diffusion MRI, it is often beneficial to leverage the correlation between diffusion-weighted images acquired with neighboring wavevectors in q-space for improving estimation robustness in applications such as denoising and compressed-sensing reconstruction. To achieve this, a common practice is by fitting a diffusion model to the data, concurrently taking into account all q-space measurements at each voxel. Such approach, however, is dependent on specific assumptions the model makes regarding the data. For example, the diffusion tensor model assumes that the signals at each voxel reflect only axon bundles that are aligned in one coherent direction, hence falling short in capturing the complexity of the water diffusion patterns in the human brain.
In this paper, we take a wavelet-based approach, which is less dependent on specific biophysical assumptions, to constructing sparse representations for diffusion signals. To cater to the fact that q-space sampling schemes can differ significantly, ranging from Cartesian sampling in diffusion spectrum imaging (DSI) to multi-shell sampling with multiple distinct b-values, we propose to represent the q-space signals using tight wavelet frames defined on graphs [1, 2], which can be flexibly adapted to different sampling domains. The power of tight wavelet frames lies in their ability to sparsely approximate piecewise smooth functions and the existence of fast decomposition and reconstruction algorithms associated with them. A wide range of tight wavelet frames can be generated conveniently using the unitary extension principle (UEP) [3]. We use the term tight graph framelets to denote basic functions that, when dilated and shifted, form tight wavelet frames on graphs. Tight graph framelets have compact support, are not restricted to signals of a certain shape (e.g., the “donut” [4]), and provide a natural bridge between continuous and discrete representations [1].
We demonstrate the utility and effectiveness of tight graph framelets in two specific applications: denoising and super-resolution reconstruction in q-space. To take advantage of the high redundancy of graph framelets, we regularize the associated problems using a sparse-inducing norm. Instead of the more conventional ℓ1 regularization, which has been shown in the theory of compressed sensing to produce sparse solutions, we opted to use ℓ0 regularization. In [5], both iterative soft and hard thresholding algorithms were adopted and the latter was found to achieve better image quality. In [6], wavelet frame based ℓ0 regularization also shows better edge-preserving quality compared with the conventional ℓ1 regularization. Evaluation performed using synthetic data with noncentral chi signal distribution as well as real data with repeated scans indicates that the proposed method is superior to denoising and interpolation using spherical radial basis functions (sRBF) [7].
2 Approach
2.1 Graph Framelets
In q-space, sampling points and their relationships can be encoded, respectively, using vertices and edges of a graph. We denote a graph by 𝒢 ≔ {ℰ, 𝒱, w}, where 𝒱 ≔ {υk ∈ ℳ : k = 1, …, K} is a set of vertices representing points on a manifold ℳ, ℰ ⊂ 𝒱 × 𝒱 is a set of edges relating the vertices, and w : ℰ ↦ ℝ+ is a weight function. The associated adjacency matrix 𝒜 ≔ (wk,k′) is symmetric with wk,k′ > 0 if υk and υk′ are connected by an edge in ℰ; otherwise wk,k′ = 0. Given the degree matrix D ≔ diag{d[1], d[2], …, d[K]}, where d[k] ≔ ∑k′ wk,k′, the graph Laplacian, defined as ℒ ≔ D − W, is consistent with the Laplace-Beltrami operator of the manifold [8]. Denote by the pairs of eigenvalues and eigenvectors of ℒ with 0 = λ0 ≤ λ1 ≤ λ2 ≤ … ≤ λK−1 = λmax. The eigenvectors form an orthonormal basis for all functions on the graph: . The Fourier transform of a function f𝒢 : 𝒱 ↦ ℝ on the graph 𝒢 is given by .
The key idea involved in constructing wavelet frames on a graph is to view eigenvectors of the graph Laplacian as Fourier basis on graphs and the associated eigenvalues as frequency components [1,2]. One then slices the frequency spectrum in a multi-scale fashion by using a set of masks {âr(·) : r = 0, …, R}, where â0(·) acts as a low-pass filter and âr(·) with 0 < r ≤ R as band-pass or high-pass filters. More specifically, the graph framelet analysis transform is defined as W f𝒢 ≔ {Wl,r f𝒢 : (l, r) ∈ ℬL}, with ℬL ≔ {(1, 1), (1, 2), …, (1, R), (2, 1), …, (L, R)} ∪ {(L, 0)} and
(1) |
where λ̃k = (λk/λmax)π and γ > 1 is the dilation factor. Noting the eigen-decomposition ℒ = UΛU⊤, where Λ = diag{λ0, λ1, …, λK−1}, we have
(2) |
where Λ̃ = diag{λ̃0, λ̃1, …, λ̃K−1} and
(3) |
Letting α ≔ W f𝒢 ≔ {αr,l ≔ Wl,r f𝒢 : (l, r) ∈ ℬL} and if the masks satisfy , which is one of the requirements of the unitary extension principle (UEP) [1, 3]), it is easy to show that the synthesis transform W⊤α gives W⊤α = W⊤W f𝒢 = I f𝒢 = f𝒢. To avoid explicit decomposition of a potentially large Laplacian matrix, we approximate the masks using Chebyshev polynomials, i.e., , where with Tk(ξ) being the k-th order Chebyshev polynomial and
(4) |
Equation (2) can be shown to be equivalent to
(5) |
Note the interchange between scalar-valued and matrix-valued polynomials. See [1, 2] for more details. Some examples of framelet masks are shown in Table 1.
Table 1.
Haar | Linear | Quadratic | ||
---|---|---|---|---|
â0(ξ) = cos(ξ/2) | â0(ξ) = cos2(ξ/2) | â0(ξ) = cos3(ξ/2) | ||
â1(ξ) = sin(ξ/2) | ||||
â2(ξ) = sin2(ξ/2) | ||||
â3(ξ) = sin3(ξ/2) |
2.2 Sparse Recovery
Given the observation vector f ∈ ℝK′, we are interested in recovering u ∈ ℝK by solving the following problem:
(6) |
where and with ℐ(z) being an indicator function that returns 1 when z ≠ 0 or 0 otherwise. Here, u is defined on 𝒢 with degree matrix D ≔ diag{d[1], d[2], …, d[K]}. The observation vector f is defined in relation to a subset of vertices of 𝒢, indexed by {ki : i = 1, …, K′}. We let D′ ≔ diag{d[k1], d[k2], …, d[kK′]}. K′ and K are in general not necessarily equal. We require K′ ≤ K.
The problem (6) can be solved effectively using penalty decomposition (PD) [6, 9]. Defining auxiliary variables υ ≔ (υl,r) ≔ (Wl,ru), this amounts to minimizing the following objective function with respect to u and υ:
(7) |
In PD, we (i) alternate between solving for u and υ using block coordinate descent (BCD). Once this converges, we (ii) increase μ > 0 by a multiplicative factor that is greater than 1 and repeat step (i). This is repeated until increasing μ does not result in further changes to the solution [6, 9].
First Subproblem
We solve for υ in the first problem, i.e., minυ Lμ(u, υ):
(8) |
The solution can be obtained via hard thresholding [10, 11]:
(9) |
Second Subproblem
By taking the partial derivative of Lμ(u, υ) with respect to u, the solution to the second subproblem, i.e., minu Lμ(u, υ), is
(10) |
Since , the problem can be simplified to become
(11) |
which is a set of linear equations that can be solved, for example, using the conjugate gradient method.
2.3 Denoising and Super-Resolution
We are interested in the super-resolution problem of recovering signals when given only a subset of noisy measurements. We partition u as u = [u1; u2] ∈ ℝK, where u1 ∈ ℝK′ is defined on vertices whose observed values are given by f ∈ ℝK′. No observed values are available for u2. To recover u, we set in (6) A = [IK′ × K′ 0] ∈ ℝK′ × K, where IK′ × K′ is an identity matrix of size K′ × K′. When K′ = K, i.e., u = u1 and A = IK′ × K′, the problem reduces to a denoising problem, where we are interested in recovering the noiseless version of f. Since
(12) |
the resulting matrix on the left of (11) is diagonal and the solution can hence be obtained easily without matrix inversion.
3 Experiments
3.1 Parameter Settings
For all experiments, we used the quadratic masks, set the decomposition level to L = 5, and set the maximum order of Chebyshev polynomials to n = 8. No significant gain in improvement was obtained beyond these values. The dilation factor was set to a standard value γ = 2. The tuning parameters for the sparse recovery problem were set as
(13) |
with η = 0.75 and (i.e., the universal penalty level [12]). We define the adjacency matrix 𝒜 ≔ (wk,k′) of the q-space samples by letting
(14) |
where σb = 10 and θ is set to the maximum of the angles between neighboring gradient directions.
3.2 Results
Synthetic Data
We generated a synthetic dataset consisting of a set of voxels with white matter (WM), gray matter (GM), and cerebrospinal fluid (CSF) signal profiles. Combinations of these signal profiles were also included to simulate partial volume effects. To generate the WM signal profiles, mixtures of up to two tensor models were used. The diffusivities for WM, GM, and CSF are respectively [1.7, 0.3, 0.3] × 10−3 mm2/s, 0.9 × 10−3 mm2/s, and 2.5 × 10−3 mm2/s. The number of non-collinear gradient directions were 21, 81, and 321, each generated by subdividing the faces of an icosahedron a number of times. Three shells were generated, i.e., b = 1000, 2000, 3000 s/mm2. For evaluation, two levels of 32-channel noncentral chi noise [13] was added, corresponding to SNR = 20 and 30 with respect to the non-diffusion-weighted signal of WM.
Results for denoising (21 − 21, 81 − 81, 321 − 321) and super-resolution (21 − 81, 81 − 321), averaged over 10 realizations of noise, are shown in Fig. 1. The comparison baseline is the spherical radial basis function (sRBF) method described in [7]. This basically amounts to weighted averaging of signals on the shells using the weights given by (14). The results confirm that our method yields consistent PSNR improvements. Note that, for both synthetic and real data, noncentral chi bias was removed from the solution of (6) using the method described in [13].
Real Data
The real datasets were acquired using Siemens 3T TRIO MR scanner with b = 2000 s/mm2 and 48 non-collinear gradient directions. The imaging protocol is as follows: 128 × 96 imaging matrix, voxel size of 2 × 2 × 2 mm3, TE = 97 ms, TR = 11,300 ms. Imaging acquisition was repeatedly performed on the same subject for 8 times. We averaged the 8 sets of diffusion-weighted images and removed the bias caused by the noncentral chi noise to obtain the ground truth for evaluation.
The performance statistics for the denoising of the 8 sets of diffusion-weighted images are shown in Fig. 2, again confirming that our method yields results that are in close agreement with the ground truth with higher PSNRs. We also show in Fig. 3 the white matter fiber orientation distribution functions (ODFs) to demonstrate the benefit of q-space super-resolution. We used the measurements from the original 48 gradient directions to generate the data for 81 directions. It can be observed from the figure that increasing the angular resolution helps resolve some fiber crossings.
4 Conclusion
In this paper, we present preliminary results confirming that tight graph framelets can be employed for sparse representation of diffusion signals in q-space. The effectiveness of this representation is demonstrated with applications involving denoising and super-resolution. Future work will be directed to incorporating this representation with spatial framelet representation to obtain a joint representation for both spatial and wavevector dimensions.
Acknowledgments
This work was supported in part by NIH grants (NS093842, EB006733, EB008374, EB009634, AG041721, and MH100217).
References
- 1.Dong B. Sparse representation on graphs by tight wavelet frames and applications. Applied and Computational Harmonic Analysis. 2015 [Google Scholar]
- 2.Hammond DK, Vandergheynst P, Gribonval R. Wavelets on graphs via spectral graph theory. Applied and Computational Harmonic Analysis. 2011;30(2):129–150. [Google Scholar]
- 3.Ron A, Shen Z. Affine systems in L2(ℝd): The analysis of the analysis operator. Journal of Functional Analysis. 1997;148(2):408–447. [Google Scholar]
- 4.Michailovich O, Rathi Y. On approximation of orientation distributions by means of spherical ridgelets. IEEE Transactions on Image Processing. 2010;19(2):461–476. doi: 10.1109/TIP.2009.2035886. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Chan RH, Chan TF, Shen L, Shen Z. Wavelet algorithms for high-resolution image reconstruction. SIAM Journal on Scientific Computing. 2003;24(4):1408–1432. [Google Scholar]
- 6.Zhang Y, Dong B, Lu Z. ℓ0 minimization for wavelet frame based image restoration. Mathematics of Computation. 2013;82:995–1015. [Google Scholar]
- 7.Tuch DS. Q-ball imaging. Magnetic Resonance in Medicine. 2004;52:1358–1372. doi: 10.1002/mrm.20279. [DOI] [PubMed] [Google Scholar]
- 8.Belkin M, Niyogi P. Towards a theoretical foundation for Laplacian-based manifold methods. Learning Theory. 2005;(486–500) [Google Scholar]
- 9.Lu Z, Zhang Y. Sparse approximation via penalty decomposition methods. SIAM Journal on Optimization. 2013;23(4):2448–2478. [Google Scholar]
- 10.Lu Z. Iterative hard thresholding methods for l0 regularized convex cone programming. Mathematical Programming: Series A and B. 2014;147(1–2):125–154. [Google Scholar]
- 11.Yap PT, Zhang Y, Shen D. Diffusion compartmentalization using response function groups with cardinality penalization. Medical Image Computing and Computer-Assisted Intervention (MICCAI) 2015;9349:183–190. doi: 10.1007/978-3-319-24553-9_23. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Donoho DL. De-noising by soft-thresholding. IEEE Transactions on Information Theory. 1995;41(3):613–627. [Google Scholar]
- 13.Koay CG, Özarslan E, Basser PJ. A signal transformational framework for breaking the noise floor and its applications in MRI. Journal of Magnetic Resonance. 2009;197(108–119) doi: 10.1016/j.jmr.2008.11.015. [DOI] [PMC free article] [PubMed] [Google Scholar]