Abstract
This paper establishes a modeling framework for data located onto or close to (unknown) smooth manifolds, embedded in Euclidean spaces, and considers its application to dynamic magnetic resonance imaging (dMRI). The framework comprises several modules: First, a set of landmark points is identified to describe concisely a data cloud formed by highly under-sampled dMRI data, and second, low-dimensional renditions of the landmark points are computed. Searching for the linear operator that decompresses low-dimensional data to high-dimensional ones, and for those combinations of landmark points which approximate the manifold data by affine patches, leads to a bi-linear model of the dMRI data, cognizant of the intrinsic data geometry. Preliminary numerical tests on synthetically generated dMRI phantoms, and comparisons with state-of-the-art reconstruction techniques, underline the rich potential of the proposed method for the recovery of highly under-sampled dMRI data.
I. INTRODUCTION
Current medical research and diagnosis rely heavily on magnetic resonance imaging (MRI); a non-invasive and non-ionizing technology for high fidelity visualization of anatomical structures and physiological functions [9]. Collecting a sufficiently large number of MRI data to guarantee high-quality image reconstruction is an inherently slow process due to patient discomfort as well as to physical and physiological constraints imposed on the scanning speed [9]. Such obstacles arise prominently in dynamic (d)MRI, where data acquisition needs to abide also by the inescapable constraints imposed by the monitored dynamical process, e.g., a beating heart [8].
Under-sampling the dMRI data is an efficient way to speed up scanning times, at the price of aliasing effects in the reconstructed images. To surmount aliasing given the limited number of data, spatio-temporal correlations and any available prior knowledge regarding the dMRI mechanism have to be taken into account. Along these lines, compressed sensing [6], [11], [15] and the principal-component-analysis (PCA) or low-rank-approximation paradigm [5], [7], [10], [12], [26] are modeling frameworks that have shown a rich potential in incorporating successfully prior information and data dependencies into dMRI-recovery algorithms. For example, [26] extracts a low-dimensional subspace, via the singular-value-decomposition of a highly under-sampled data matrix, prior to forming a convex inverse problem whose solution yields high-quality reconstructed images. Elaborate dictionary learning (DL) techniques have also contributed largely to the dMRI literature [1], [14], [23], [24], [27]. Remarkably, in all of the previous DL techniques, etre is no utilization of the (presumably low-dimensional) data geometry, other than the popular algebraic tool of matrix factorization.
Driven by the recently successful machine-learning paradigm, manifold-learning techniques have been also implemented in dMRI recovery [13], [16], [22], [28]. Local linear embedding (LLE) [18] is used in [22] to achieve dimensionality reduction of the high-dimensional dMRI data, prior to the application of a reconstruction algorithm. Study [16] formulates a convex minimization recovery task which is penalized by a Laplacian-matrix-based quadratic term that quantifies all the available knowledge about the underlying smooth data manifold. Further, to exploit any potential non-linear dependencies between data, [28] maps the observed data to even higher dimensional functional spaces, via kernel mappings, and solves an inverse problem to recover the dependencies in the original input space. Very recently, a joint manifold-learning and sparsity-cognizant framework has been introduced in [13]. Prior to forming a convex minimization recovery task, the intrinsic low-dimensionality geometry of the data is learned from their highly under-sampled renditions via an LLE-motivated framework [19]. Extensive experimentation has showed that learning, first, any (potentially non-linear) data geometry helps the solutions of convex inverse problems to produce higher quality reconstruction images than state-of-the-art PCA-based techniques [13].
The present study takes [13] a step forward into explicitly modeling the low-dimensional dMRI data geometry. Built again on an unknown smooth-manifold hypothesis for the data geometry, a small set of data representatives, called landmark points, are extracted from the under-sampled dMRI data cloud. These landmark points provide a concise description of the observed data, promoting low-dimensional and parsimonious data representations as well as efficient data storage. Motivated by the concept of tangent spaces of smooth manifolds, data are approximated via affine combinations of the extracted landmark points. Similarly to [13] and along the lines of [19], a dimensionality-reduction module is applied to compute the low-dimensional rendition, or, “compressed” versions of the landmark points. Having the previous affine combinations, as well as a linear decompression operator, as unknowns of the data-modeling hypothesis, a bi-linear model is derived [cf. (2)], cognizant of the underlying low-dimensional nature of the observed dMRI data. Exploiting also the fact that dMRI data usually capture periodic time series, e.g., a beating heart, a non-convex, bi-linear recovery task, penalized by terms which account for the periodicity of the recorded time series, is formulated to achieve high-fidelity data reconstruction. Subsequently, recently developed convex and non-convex minimization techniques are employed to solve the resultant recovery tasks. Preliminary numerical tests on synthetically generated high-dimensional dMRI phantom data and comparisons against state-of-the-art methodologies underline the rich potential of the method for high-quality data recovery.
II. PROPOSED FRAMEWORK
MRI data are observed at the discrete k-space, or frequency domain, which admits complex-valued data and spans an area of size Np × Nf, where Np stands for the number of phase-encoding lines and Nf for the number of frequency-encoding ones [9]. In dMRI, an additional dimension is added to the MRI k-space to account for the time horizon (the axis vertical on the paper in Fig. 1a), resulting in the augmented (k,t)-space. In other words, the dMRI (k,t)-space can be viewed as the Nfr-fold Cartesian product of the (Np × Nf)-sized MRI k-space, where Nfr represents the number of collected MRI frames over time. K-space data ( is the set of all complex-valued numbers), j ∈ {1, …, Nfr }, can be also viewed as the two-dimensional (discrete) Fourier transform of the Np × Nf image-domain data , i.e., Without any loss of generality, this study assumes that the “low-frequency” part of is located around the center of the Np × Nf area. Availability of a large number of dMRI data is infeasible in practice; the (k,t)-space of Fig. 1a is usually severely under-sampled [8]. To extract useful information from (k,t)-space data, the present framework assumes full availability of a part of the low-frequency region of the (k,t)-space. For simplicity and illustration reasons, it is assumed that a small number ν (≪ Np ) of phase-encoding lines of (coined “navigator”) data are always available (the gray-colored area in Fig. 1a). These navigator data will be exploited to learn the intrinsic low-dimensional structure of the dMRI data. Other than the navigator data, highly (pseudo-randomly) under-sampled (k,t)-space data are considered to be also available.
Fig. 1.
(a) The Np × Nf × Nfr (k,t)-space. Data geometry is learned from the “navigator” data, i.e., the data which lie in the gray-colored ν × Nf × Nfr area of the (k,t)-space (ν ≪ Np ). (b) Landmark points are affinely combined to describe . All possible affine combinations of are denoted by the gray-colored area.
To facilitate processing and data representations, the (k,t)-space data are vectorized. More specifically, the operation stacks one column of below the other to yield the complex-valued Np Nf × 1 vector , ∀j ∈ {1, …, Nfr}. To avoid notation clutter, stands also for the two-dimensional (discrete) Fourier transform even when applied to vectorized versions of image frames: . All vectorized k-space frames are gathered in the Np Nf × Nfr matrix , so that the vectorized original image-domain data are . In a similar way, per jth k-space frame, the “navigator” data yield a νNf × 1 vector and thus the νNf × Nfr matrix . Clearly, Ynav = ΩY, where Ω is a submatrix of the identity matrix INpNf that selects the rows of Y which correspond to the navigator data. Recovering Y from its partial Ynav is viable only under errors, e.g., an approximation can be obtained via Ω†Ynav, where Ω† stands for the Moore-Penrose pseudoinverse of Ω. The following modeling hypothesis generalizes the previous argument.
Assumption 1. There exist an Np Nf ×νNf matrix G1 and an Np Nf × Nfr matrix E1, which gathers all approximation errors, such that (s.t.) Y = G1Ynav + E1.
A. Landmark points
The high-dimensional navigator data carry useful information about spatio-temporal dependencies in the (k,t)-space. To promote parsimonious data representations, especially in cases where Nfr attains large values, it is desirable to extract a subset , called landmark points, which provide a “concise description,” in a user-defined sense, of the data cloud . To this end, the following modeling assumption imposes structure on .
Assumption 2. Data lie on a smooth low-dimensional manifold [21] embedded in the high-dimensional Euclidean space (cf. Fig. 1b).
For example, the most commonly met case of a smooth manifold in theory and practice is a linear subspace. Based on As. 2 and the concept of the tangent space of a smooth manifold, it is conceivable that neighboring landmark points cooperate affinely to describe vector (the gray-colored area in Fig. 1b depicts all possible affine combinations of ). Upon defining the νNf × Nl matrix , it is assumed that there exists an Nl ×1 vector bj that renders the approximation error small, where denotes the standard Euclidean norm of space . Since affine combinations are desirable, bj is constrained to satisfy , where stands for the all-one Nl × 1 vector and superscipt ⊤ denotes vector/matrix transposition. Moreover, motivated by the low-dimensional nature of , according to As. 2, it is envisioned that only a few landmark points cooperate into representing , i.e., bj is sparse. The previous arguments are summarized into the following modeling hypothesis.
Assumption 3. There exist a sparse Nl ×Nfr matrix B, with , and a νNf ×Nfr matrix E2, which gathers approximation errors, s.t. Ynav = ΛB + E2.
The previous modeling assumption holds true in the prototypical case where is a linear subspace. In such a case, any linearly independent subset of , which spans the column (range) space of Ynav, can be selected as the columns of Λ. Notice also that under such a choice for Λ, the coefficient matrix B satisfies , since the column (range) space of Ynav, being a linear subspace, is also the affine hull [17] of the columns of Λ.
Although several strategies may be implemented to identify the landmark points Λ of a data cloud that lies onto or close to a general manifold , a greedy optimization methodology, introduced in [2], is adopted here. In short, at every step of the algorithm, a landmark point is selected from by maximizing, over all un-selected , the minimum distance to the landmark points which have been already selected up to the previous step of the algorithm. The algorithm of [2] scores a computational complexity of order , which is naturally heavier than that of the naive random-selection algorithm that chooses randomly from .
B. Reducing the dimension of the landmark points
The landmark points Λ, obtained in the previous section, are still high-dimensional. To meet restrictions imposed by finite computational resources, it is desirable to reduce the dimensionality of Λ. To this end, the methodology of [19], which is motivated by [3], [18], is employed. The approach comprises two steps.
1) Given Λ and a user-defined λW > 0 solve the convex minimization task
| (1) |
where stands for the Frobenius norm of a matrix. A few comments on the choice of the loss function and constraints follow. Since lie on the manifold , then according to As. 2, Fig. 1b and the discussion regarding , any point taken from may be faithfully approximated by an affine combination of the rest of the landmark points. In other words, there exists a matrix W s.t. Λ ≈ ΛW. With manifesting the previous desire for affine combinations, the constraint diag(W) = 0 is used to exclude the trivial solution of the identity matrix for W. Task (1) is an affinely constrained composite convex minimization task, and, hence, the framework of [20] can be employed to solve it, due to the flexibility by which [20] deals with affine constraints when compared with state-of-the-art convex optimization techniques.
2) Once W has been obtained from the previous step and for a user-defined integer number d ≪ Np Nf, solve
where the constraint is used to exclude the trivial solution of . It is not difficult to verify that the solution of the previous task is nothing but the complex conjugate transpose of the matrix (denoted by the * superscipt) which comprises the d minimal eigenvectors of .
The following hypothesis establishes a linear relation between Λ and its low-dimensional rendition .
Assumption 4. There exist an Np Nf × d matrix G3 and an Np Nf × Nl matrix E3, which gathers all approximation errors, so that .
Matrix G3 can be viewed as the “decompression” operator which reconstructs the “full” Λ from its low-dimensional representation .
C. Data recovery task
Putting modeling assumptions 1, 3 and 4 together, it can be verified that there exist matrices G and E s.t.. Upon defining , and since due to the definition of (cf. Sec. II), the following bi-linear model between Y and the unknowns (U, B) is established:
| (2) |
Bi-linearity means that whenever one of the block of variables (U, B) is fixed to a specific value, then the dependence between Y and the other block of variables is linear (modulo the error E term). Interestingly, the linearity of the inverse Fourier transform suggests that the previous modeling hypothesis holds true also in the image domain:
In practice, only few (k,t)-space data are known. To explicitly take account of the limited number of data, a (linear) sampling operator is introduced where keeps only those entries of Y that are collected by the measurement system. It is also often in dMRI that image frames capture a periodic process, e.g., heart movement, other than the static background. In other words, it is reasonable to assume that the one-dimensional Fourier transform of the time profile of every MR pixel, i.e., every row of the matrix , is a sparse vector. Since the DC Fourier coefficients do not provide any useful information on periodic time series, the linear operator is also applied to to remove the first column of , which gathers the DC Fourier coefficients of all the MR-pixel time profiles.
To summarize, given parameters λ1, λ2, λ3, CU > 0, the bi-linear data-recovery task is formulated as
| (3) |
where ei denotes the ith column of the identity matrix Id and the auxiliary variable Z is used to incorporate the sparsity of into the design. Notice that the CU bound is used to prevent unbounded solutions for U due to the bi-linearity in . To solve the non-convex task (3), the successive-convex-approximation framework of [4] is employed and presented in a concise form in Alg. 1.
Step 7 of Alg. 1 comprises convex minimization sub-tasks. More specifically, at every step of the algorithm, given (Un, Bn, Zn), the following estimates are required (for τU , τB > 0):
| (4) |
and
| (5) |
Both (4) and (5) can be viewed as affinely constrained composite convex minimization tasks, so that [20] can be used to solve them. From a computational complexity perspective, it is important to point out that the proposed scheme relies on minimization sub-tasks for three block variables, whose computational complexities depend on the solver adopted for the optimization. It is also worth noting that those three minimization sub-tasks are independent of each other, hence they can be solved in parallel. Details on the implementation issues of parallel optimization techniques, which speed up execution time, and of [20], as the optimization module which solves the minimization sub-tasks, are deferred to an upcoming journal publication.
III. NUMERICAL TESTS
Following [13], an MRXCAT phantom [25], based on extended cardiac torso (XCAT), was used to generate breath-hold cardiac cine data of size (Np , Nf , Nfr ) = (408, 408, 360). Data are dynamic, since they include 15 cardiac cycles and 24 cardiac phases, and under-sampled: i) A number of ν := 4 navigator lines are considered (cf. Fig. 1); ii) Only 8 samples, picked randomly, out of the Nfr -length time profile of each k-space pixel are considered. The locations of the chosen samples are distributed independently over all time profiles. Overall, the under-sampling factor turns out to be (approximately) equal to 32. The proposed method is tested vs. the PCA-based PS-sparse scheme [26] and [13]. Validation is performed via the normalized-root-mean-square error where X is the original image-domain data (cf. Sec. II) and stands for an estimate of X, computed via any of the employed reconstruction schemes. Parameters of each method are tuned to achieve the best possible NRMSE performance. With regards to the proposed method, the dimensionality of the compressed data is d := 4, while the number of landmark points is set to Nl := 60.
Figs. 2 and 3 depict results of the numerical tests for all employed algorithms. Fig. 2 demonstrates reconstruction results for image frame #351 of the dMRI data. Further, Fig. 3 compares the NMRSE errors across all 360 frames. It can be observed that i) the errors for the proposed scheme are uniform across all 360 frames, as opposed to the fluctuations observed for the other two schemes, and ii) the proposed scheme achieves the lowest NMRSE across all frames. Quantitative results on the whole data set (408 × 408 × 360) show an NRMSE of 0.051 for the proposed scheme, which is lower in comparison to the value of 0.081 for PS-Sparse [26] and 0.079 for [13].
Fig. 2.

Original frame #351, of size 408 × 408, and its reconstructed versions. NRMSE values per method are: 0.059 for PS-sparse [26], 0.06 for [13] and 0.052 for the proposed method.
Fig. 3.
Frame-wise NRMSE: Comparing the proposed scheme (yellow curve), [13] (blue curve, ‘MLS’) and [26] (red curve, ‘PS’) over all 360 MRXCAT frames of size 408 × 408. The proposed method achieves the lowest NRMSE uniformly across all frames.
IV. CONCLUSIONS AND THE ROAD AHEAD
A modeling framework of manifold data and its application to dMRI were introduced. Given a cloud of highly under-sampled dMRI data, landmark points were computed to capture the underlying low-dimensional manifold-data geometry. Low-dimensional renditions of the landmark points were also computed to promote parsimonious representations and efficient storage of data. Overall, the dMRI reconstruction task was viewed as an inverse problem where the data fidelity is modeled via a bi-linear term, and the objective function is penalized by a term which quantifies the a-priori information on the periodicity of the observed time series. Preliminary numerical tests on synthetically generated phantom dMRI data underline the rich potential of the proposed framework against state-of-the-art dMRI reconstruction techniques. On-going research includes tests on real data, comparisons with additional state-of-the-art techniques and several extensions of the advocated model, which will be presented during CAMSAP 2017.
ACKNOWLEDGMENT
The authors would like to thank the Institute of Biomedical Eng., ETH Zurich, and the Cardiovascular Magnetic Resonance Society for making the data, used in Sec. III, available online. Moreover, this work was partially supported by the NSF grants 1514056, 1718796 and the NIH grant R21EB020861.
Contributor Information
Konstantinos Slavakis, Email: kslavaki@buffalo.edu.
Gaurav N. Shetty, Email: gauravna@buffalo.edu.
Abhishek Bose, Email: abose3@buffalo.edu.
Ukash Nakarmi, Email: ukashnak@buffalo.edu.
Leslie Ying, Email: leiying@buffalo.edu.
REFERENCES
- [1].Awate SP and DiBella EVR, “Spatiotemporal dictionary learning for undersampled dynamic MRI reconstruction via joint frame-based and dictionary-based sparsity,” in Proc. Intern. Symp. Biomedical Imag, 2012, pp. 318–321. [Google Scholar]
- [2].De Silva V and Tenenbaum JB, “Sparse multidimensional scaling using landmark points,” Stanford University, Tech. Rep, 2004. [Google Scholar]
- [3].Elhamifar E and Vidal R, “Sparse manifold clustering and embedding,” in Proc. NIPS, Granada: Spain, December 2011. [Google Scholar]
- [4].Facchinei F, Scutari G, and Sagratella S, “Parallel selective algorithms for nonconvex big data optimization,” IEEE Trans. Signal Process, vol. 63, no. 7, pp. 1874–1889, 2015. [Google Scholar]
- [5].Jung H, Ye JC, and Kim EY, “Improved k-t BLAST and k-t SENSE using FOCUSS,” Physics in Medicine and Biology, vol. 52, no. 11, pp. 3201–3226, 2007. [DOI] [PubMed] [Google Scholar]
- [6].Liang D, DiBella EVR, Chen R-R, and Ying L, “k-t ISD: Dynamic cardiac MR imaging using compressed sensing with iterative support detection,” Magnetic Resonance in Medicine, vol. 68, no. 1, pp. 41–53, 2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [7].Liang Z-P, “Spatiotemporal imaging with partially separable functions,” in Proc. Intern. Symp. Biomedical Imag, 2007, pp. 988–991. [Google Scholar]
- [8].Liang Z-P and Lauterbur PC, “An efficient method for dynamic magnetic resonance imaging,” IEEE Trans. Medical Imag, vol. 13, no. 4, pp. 677–686, 1994. [DOI] [PubMed] [Google Scholar]
- [9].Liang Z-P, Principles of Magnetic Resonance Imaging: A Signal Processing Perspective IEEE Press, 2000. [Google Scholar]
- [10].Lingala S, Hu Y, DiBella EVR, and Jacob M, “Accelerated dynamic MRI exploiting sparsity and low-rank structure: k-t SLR,” IEEE Trans. Medical Imag, vol. 30, no. 5, pp. 1042–1054, 2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [11].Lustig M, Donoho D, and Pauly JM, “Sparse MRI: The application of compressed sensing for rapid MR imaging,” Magnetic Resonance in Medicine, vol. 58, no. 6, pp. 1182–1195, 2007. [DOI] [PubMed] [Google Scholar]
- [12].Mardani M, Ying L, Scutari G, Slavakis K, and Giannakis GB, “Dynamic MRI using subspace tensor tracking,” in Proc. Engineering in Medicine & Biology Conf, Chicago: USA, August 2014. [Google Scholar]
- [13].Nakarmi U, Slavakis K, Lyu J, and Ying L, “M-MRI: A manifold-based framework to highly accelerated dynamic magnetic resonance imaging,” in Proc. Intern. Symp. Biomedical Imag, Melbourne: Australia, April 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [14].Nakarmi U, Zhou Y, Lyu J, Slavakis K, and Ying L, “Accelerating dynamic magnetic resonance imaging by nonlinear sparse coding,” in Proc. Intern. Symp. Biomedical Imag, 2016, pp. 510–513. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [15].Otazo R, Kim D, Axel L, and Sodickson DK, “Combination of compressed sensing and parallel imaging for highly accelerated first-pass cardiac perfusion MRI,” Magnetic Resonance in Medicine, vol. 64, no. 3, pp. 767–776, 2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [16].Poddar S and Jacob M, “Dynamic MRI using smoothness regularization on manifolds (SToRM),” IEEE Trans. Medical Imag, vol. 35, no. 4, pp. 1106–1115, 2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [17].Rockafellar RT, Convex Analysis Princeton, NJ: Princeton University Press, 1970. [Google Scholar]
- [18].Saul LK and Roweis ST, “Think globally, fit locally: Unsupervised learning of low dimensional manifolds,” J. Machine Learning Research, vol. 4, pp. 119–155, 2003. [Google Scholar]
- [19].Slavakis K, Giannakis GB, and Leus G, “Robust sparse embedding and reconstruction via dictionary learning,” in Proc. Conf. Inform. Sciences and Systems, Baltimore: USA, March 2013. [Google Scholar]
- [20].Slavakis K and Yamada I, “Fejér-monotone hybrid steepest descent method for affinely constrained and composite convex minimization tasks,” arXiv e-prints, 2017. [Online]. Available: http://arxiv.org/abs/1608.02500 [Google Scholar]
- [21].Tu LW, An Introduction to Manifolds New York: Springer, 2008. [Google Scholar]
- [22].Usman M, Atkinson D, Kolbitsch C, Schaeffter T, and Prieto C, “Manifold learning based ECG-free free-breathing cardiac CINE MRI,” J. Magnetic Resonance Imag, vol. 41, no. 6, pp. 1521–1527, 2015. [DOI] [PubMed] [Google Scholar]
- [23].Wang Y, Cao N, Liu Z, and Zhang Y, “Real-time dynamic MRI using parallel dictionary learning and dynamic total variation,” Neurocomputing, vol. 238, pp. 410–419, 2017. [Google Scholar]
- [24].Wang Y, Zhou Y, and Ying L, “Undersampled dynamic magnetic resonance imaging using patch-based spatiotemporal dictionaries,” in Proc. Intern. Symp. Biomedical Imag, 2013, pp. 294–297. [Google Scholar]
- [25].Wissmann L, Santelli C, Segars WP, and Kozerke S, “MRXCAT: Realistic numerical phantoms for cardiovascular magnetic resonance,” J. Cardiovascular Magnetic Resonance, vol. 16, no. 1, p. 63, 2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [26].Zhao B, Haldar JP, Christodoulou AG, and Liang Z-P, “Image reconstruction from highly undersampled (k,t)-space data with joint partial separability and sparsity constraints,” IEEE Trans. Medical Imag, vol. 31, no. 9, pp. 1809–1820, 2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [27].Zhong W, Li D, Wang L, and Zhang M, “Low-rank plus sparse reconstruction using dictionary learning for 3D-MRI,” in Proc. Intern. Congress Image and Signal Process., BioMedical Eng. and Informatics, 2016, pp. 1407–1411. [Google Scholar]
- [28].Zhou Y, Shi C, Ren F, Lyu J, Liang D, and Ying L, “Accelerating MR parameter mapping using nonlinear manifold learning and supervised pre-imaging,” in Proc. Intern. Symp. Biomedical Imag, 2015, pp. 897–900. [Google Scholar]


