Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2021 Apr 15.
Published in final edited form as: IEEE Trans Med Imaging. 2021 Apr 1;40(4):1157–1167. doi: 10.1109/TMI.2020.3048933

Separation of Metabolites and Macromolecules for Short-TE 1H-MRSI Using Learned Component-Specific Representations

Yahang Li 1, Zepeng Wang 1, Ruoyu Sun 2, Fan Lam 3,4
PMCID: PMC8049099  NIHMSID: NIHMS1689914  PMID: 33395390

Abstract

Short-echo-time (TE) proton magnetic resonance spectroscopic imaging (MRSI) allows for simultaneously mapping a number of molecules in the brain, and has been recognized as an important tool for studying in vivo biochemistry in various neuroscience and disease applications. However, separation of the metabolite and macromolecule (MM) signals present in the short-TE data with significant spectral overlaps remains a major technical challenge. This work introduces a new approach to solve this problem by integrating imaging physics and representation learning. Specifically, a mixed unsupervised and supervised learning-based strategy was developed to learn the metabolite and MM-specific low-dimensional representations using deep autoencoders. A constrained reconstruction formulation is proposed to integrate the MRSI spatiospectral encoding model and the learned representations as effective constraints for signal separation. An efficient algorithm was developed to solve the resulting optimization problem with provable convergence. Simulation and experimental results have been obtained to demonstrate the component-specific representation power of the learned models and the capability of the proposed method in separating metabolite and MM signals for practical short-TE 1H-MRSI data.

IndexTerms—: Proton (1H) magnetic resonance spectroscopic imaging, short TE, signal separation, deep learning, deep autoencoder, low-dimensional models

I. Introduction

PROTON MRSI (1H-MRSI) is a unique molecular imaging modality that can noninvasively map various endogenous metabolites in the brain. This molecular-level information has been demonstrated useful in different neuroscience and clinical applications, including brain tumors [1], [2], metabolic disorders [3], and neurodegenerative diseases [4], [5]. Short-echo-time (TE) 1H-MRSI, in particular, offers several unique advantages compared to the more commonly used long-TE acquisitions, such as higher signal-to-noise ratio (SNR) due to less relaxation-induced signal loss and improved detection and quantification of molecules with short T2’s and/or J-coupled spins, e.g., myo-inositol (mI), glutamate (Glu) and glutamine (Gln) [6]–[10]. However, the applications of short-TE 1H-MRSI have been limited by several technical challenges. One of these major challenges is the presence of macromolecule (MM) signals that overlap with the metabolite signals across the entire spectrum. This makes accurate and reproducible metabolite quantification difficult. It has been demonstrated that metabolite quantification can be substantially improved with better characterization and separation of MM signals from the short-TE data [11]–[14]. Moreover, the separated MM components may also provide additional biomarkers for various disease applications [13], [15]–[17].

A number of methods have been proposed to separate the metabolite and MM signals in short-TE MRSI data. One approach is to suppress the metabolite or MM signals during the data acquisition stage by exploiting their longitudinal relaxation (T1) or diffusion property differences [18]. Examples include the most commonly used inversion recovery (IR) based excitation strategies, which are designed to null either the metabolite (with longer T1’s) or MM (with shorter T1’s) signals to measure the other [11], [19]–[22]. Methods that use two acquisitions to obtain both metabolites and MMs have also been proposed [13], [23]. Due to the variable ranges of T1 values for different molecules in vivo, complete nulling of metabolites or MMs is impossible and additional processing are usually needed to further remove the residual spectral components. If both metabolites and MMs are desired, two acquisitions are needed which will inherently increase the imaging time.

An alternative signal processing based approach is to model the overall short-TE data using parametric models of metabolites and MMs individually. The separation is then achieved by estimating the model parameters for each component from the data, e.g., solving a constrained nonlinear least-square problem [11], [24]–[29]. Improved fitting strategies have been proposed to take advantage of the fast decaying nature of the MM signals for better separation. Specifically, one can fit and back-extrapolate the metabolites using a truncated FID with negligible MM contributions and estimate the MM component by subtracting the extrapolated metabolite fits from the original signal [27], [30], [31]. Iterative subtraction and refitting can be done for improved separation performance. However, these methods are sensitive to model mismatch and noise, and often lead to substantial voxel-to-voxel estimation variations for practical MRSI data. Nonetheless, inspired by these parametric models, we recognized that the metabolites and MMs have their distinct spectral patterns specified by just a few spectral parameters, e.g., concentrations, resonance frequencies, and lineshapes, thus should reside in their own nonlinear low-dimensional manifolds embedded in the original high-dimensional space [32]. We hypothesized that these manifolds could be learned from specially designed training data and used as effective constraints for metabolite and MM separation.

While learning nonlinear low-dimensional models from high-dimensional heterogeneous data has been a major challenge in machine learning, recent breakthroughs in deep learning have enabled excellent solutions to such problems [33]–[35]. Leveraging this progress, deep neural networks have been successfully adapted to process and quantify short-TE MRSI data with MM separation/removal capability [36]–[39]. The initial attempts have been focused on training an end-to-end network that learns the inverse function to directly map the noisy and artifact containing data to the desired spectral parameters [36], [39], spectra with MM components removed [37], or even a group of networks to extract individual metabolites [38]. These methods require the complicated networks to simultaneously capture the physical model, metabolite and MM spectral variations, and all other nuances related to noise, artifacts, and acquisition designs. Recently, an alternative approach has been proposed to use deep networks to learn a low-dimensional representation of general MR spectra and use this as a prior in a constrained reconstruction formalism instead of a direct inverse mapping. This approach not only simplifies the learning problem but also allows for more flexible integration with the physical forward models for different acquisition designs. And it has been successfully applied to 31P-MRSI reconstruction [40].

Inspired by this approach, we proposed here a novel method to separate metabolite and MM signals for short-TE 1H-MRSI by learning their distinct nonlinear low-dimensional models and incorporating the learned models into a constrained reconstruction formulation. Specifically, we proposed a new strategy that combines supervised and unsupervised learning to train two special deep autoencoders (DAEs) to learn efficient low-dimensional representations that are specific to metabolites and MMs, respectively. We devised a formulation to integrate the learned representations as effective constraints with a spatiospectral encoding model for joint reconstruction as well as signal separation. An efficient algorithm was developed to solve the resulting optimization problem. We demonstrated the efficient and component-specific low-dimensional representations learned by our DAEs for metabolites and MMs, respectively. Numerical simulations and in vivo experiments were performed to illustrate the superior separation performance achieved by the proposed method over the standard parametric fitting method. Theoretical convergence analysis for the proposed algorithm was also provided. The following sections describe the proposed model learning strategy, reconstruction formulation, and numerical algorithm in details.

II. Background

A. Metabolite and Macromolecule Signal Separation

In a short-TE 1H-MRSI acquisition, the data will contain non-negligible contributions from both a metabolite component ρmet(r, t) and a macromolecule component ρMM(r, t). The goal of separating these two signal components can be mathematically defined as estimating ρmet(r, t) and ρMM(r, t) from their summation:

ρ(r,t)=ρmet(r,t)+ρMM(r,t), (1)

which is an ill-posed problem. Solving this problem requires effective constraints. One of the most common approaches is to impose parametric models on each component, i.e., ρmet(r, t) = fmet(t; α(r)) and ρMM(r, t) = fMM(t; β(r)) where α(r) and β(r) contain spectral parameters for the metabolites and MMs, respectively. While fmet(t; α(r)) usually incorporates resonance structures/metabolite basis generated by quantum mechanical simulations or phantom measurements, models for fMM(t; β(r)) are generally considered to be less molecule-specific (some MM peaks can be attributed to specific amino-acid residues in various proteins which may help to generate potentially stronger spectral priors [19]). As a result, different models including polynomials [27], [41], splines [24], [28], wavelets [26], and Gaussian lineshapes [13], [21], [22], [29] have been considered for fMM. Gaussian lineshape based models with a priori determined chemical shift frequencies (from extensive in vivo and in vitro experiments) have shown a great balance between model complexity and fitting accuracy.

Specifically, a widely used model for both metabolites and MMs for an individual FID in short-TE 1H-MRSI data can be written as follows [14], [37], [42]

ρ(t)=m=1Mcmei(ϕ0+ϕm)vm(t)et/T2,m*+i2πδfmt+l=1Lblei(ϕ0+ψl)et2π2Wl24ln(2)+i2πδflt, (2)

where the first summation represents the metabolite signals with cm, T2,m*, and δfm denoting the concentrations coefficients, physiology/experiment-dependent lineshapes and frequency shifts for individual molecules, respectively, and {vm(t)} corresponds to the metabolite basis. The second term captures the MM signals where Wl and δfl denote the Gaussian linewidths and resonance frequencies for each MM group. The variables ϕ0, ϕm, and ψl are a global zeroth-order phase and molecule-dependent phases. While directly estimating all these parameters from a single FID or spectrum can be rather challenging and lead to large estimation variances, these models imply that the metabolite and MM signals may reside in their own nonlinear low-dimensional manifolds, which we believe can be learned and then used as effective constraints for metabolite and MM separation.

III. Proposed Method

A. Learning Component-Specific Low-Dimensional Models

Learning a single low-dimensional model for MR spectra by treating the entire spectrum as a point in a high-dimensional space has been investigated in [40], [43], [44]. While these learned models are powerful for spatiospectral reconstruction or denoising, they are not suited to address the signal separation problem, which requires component-specific constraints. The metabolite and MM signals have their own unique signal characteristics (e.g., distinct spectral features and parameter distributions); thus, separate models can be learned to capture the low-dimensional manifolds where they reside on or close to. However, a straightforward application of the previously described DAE [40] trained using metabolites and MMs individually may not be effective since they are not optimized to differentiate the two spectrally overlapping signal components. Therefore, we proposed here a mixed supervised and unsupervised learning strategy to address this issue. Specifically, we seek to train a DAE that can extract an accurate and efficient low-dimensional representation of the metabolite (or MM) signals with simultaneously minimal representation capability of the other component. Figure 1 provides a graphical illustration of this special model learning strategy. Mathematically, the learning problems are formulated as follows, for metabolites:

{θ^met}=argminθmet 1N[n=1Nϵ(xmet nNmet (xmet n;θmet ))+Nmet(xmmn;θmet)22], (3)

and for macromolecules:

{θ^mm}=argminθmm1N[n=1Nϵ(xmmnNmm(xmmn;θmm))+Nmm(xmetn;θmm)22] (4)

{xmetn}n=1N and {xmmn}n=1N are training sets (FIDs) for the metabolites and MMs, respectively, with each xnR2T being a sample with real and imaginary parts concatenated. N is the number of training samples and T is the length of the FID. Nmet(.,.) and Nmm(.,.) denote the metabolite and MM component-specific neural networks (NNs) parameterized by θmet and θmm, respectively. Here in the first term of Eq. (3), ϵ represents the loss for training that measures the error between the original metabolite input and the network approximation. This term enforces the NN to learn an accurate low-dimensional representation of the metabolite signals. The second term of Eq. (3), appearing as a regularization, is designed to minimize the output of the metabolite NN that corresponds to the MM data (xmmn). This can also be viewed as using zeros as the labels for the MM input. As a result, the metabolite NN is trained to learn a representation specific to metabolite signal features with minimal representation power for MMs. Likewise, the two terms of Eq. (4) serve a similar purpose (to capture MM-specific low-dimensional features while minimizing metabolite representation). We hypothesized that DAEs trained separately in this fashion would have not only the ability to extract accurate nonlinear low-dimensional representations of metabolites and MMs individually but also the desired property that inaccurately models the other component. The specific network has an embedded “bottleneck” encoding-decoding structure that encodes the high-dimensional data into a set of L-dimensional features that can recover the original data, where L is referred to as the model order below. More details on the network are provided in the supplementary materials (Fig. S4).

Fig. 1.

Fig. 1.

Illustration of the proposed model learning strategy with a mixture of supervised and unsupervised learning. Specifically, two DAEs are designed to capture the metabolite and MM-specific low-dimensional representations. For metabolite DAE (metabolite-specific NN), the unsupervised part enforces the network to extract a set of low-dimensional features that can approximate the metabolite signals accurately while the supervised part uses zeros as labels for the corresponding MM inputs. This enforces the network to learn to minimize its representation power of the MM signals, which will be useful for signal separation. A similar training strategy is applied to MM-specific DAE (macromolecule-specific NN) with the roles of metabolites and MMs exchanged. The mathematical formulations of this training strategy are provided in Eqs. (3) and (4).

B. Signal Separation Using the Learned Models

With the trained DAEs Nmet(.,.) and Nmm(.,.) capturing learned metabolite and MM-specific models, one remaining challenge is to effectively utilize the learned models for signal separation from practical 1H-MRSI data. To this end, we proposed a regularized reconstruction formulation that integrates the forward spatiospectral encoding model with B0 inhomogeneity correction capability and the two learned models for metabolite and MM separated reconstruction. Specifically, we formulated the separation problem as:

minXmetX1,XmmX2dΩ{FB(Xmet+Xmm)}22+λ1n=1NNmet(Xmetn)Xmetn22+λ2n=1NNmm(Xmmn)Xmmn22+λ3Dw(Xmet+Xmm)F2, (5)

where XmetCN×T and XmmCN×T are matrix representations of the spatiotemporal functions of interest for the metabolite and MM components, with each row being a T-point FID and N the number of voxels. The feasible sets X1, X2 are balls with large enough radius that contain ground-truth representations.1 B models the linear phases induced by B0 inhomogeneity, ⊙ represents a point-wise multiplication, F denotes the Fourier transform, Ω is a (k, t)-space sampling operator (allows for flexible sampling designs), and d is a vector containing the noisy measured data. The first term enforces the imaging model and data consistency. The next two terms impose the priors that FID signals of metabolites and MMs belong to their own low-dimensional manifolds captured by the learned DAEs. The last term is a spatial smoothness constraint with Dw being a weighted finite-difference operator [45], and ‖·‖F denoting the Frobenius norm. Eq. (5) results in a high-dimensional optimization problem, which is challenging to solve due to the presence of both nonlinear functions related to the DAEs and quadratic functions of Xmet and Xmm.

C. Optimization Algorithm

We developed an efficient algorithm to address the computational challenges associated with the problem in Eq. (5). Specifically, we introduced an auxiliary variable

S=B(Xmet+Xmm) (6)

and reformulated the problem as:

{X^met,X^mm,S^}=argminXmet,Xmm,SdΩ{FS}22+λ1n=1NNmet(Xmetn)Xmetn22+λ2n=1NNmm(Xmmn)Xmmn22+λ3DwB¯SF2 s.t. B(Xmet+Xmm)=S,XmetX1,XmmX2, (7)

where B¯ denotes element-wise conjugate of B. Then, the alternating direction method of multipliers (ADMM) was adapted to solve this equivalent problem [46], in which it was decomposed into simpler linear least-squares problems and nonlinear problems that can be solved in a parallel fashion. More specifically, the following subproblems were solved iteratively:

  1. Update Xmet with fixed Xmm(i) and S(i) as follows (i is the iteration index)
    Xmet(i+1)=argminXmetX1λ1n=1NNmet(Xmetn)Xmetn22+ρ2B(Xmet+Xmm(i))S(i)+Y(i)ρF2 (8)
    where Y(i) is the Lagrangian multiplier and ρ is the penalty parameter.
  2. Update Xmm with fixed Xmet(i+1) and S(i) as
    Xmm(i+1)=argminXmmX2λ2n=1NNmm(Xmmn)Xmmn22+ρ2B(Xmet(i+1)+Xmm)S(i)+Y(i)ρF2 (9)
  3. Update S with X(i+1)=Xmet(i+1)+Xmm(i+1) by solving
    S(i+1)=argminSdΩ{FS}22+λ3DwB¯SF2+ρ2BX(i+1)S+Y(i)ρF2 (10)
  4. Update Y as
    Y(i+1)=Y(i)+ρ(BX(i+1)S(i+1)) (11)

Subproblems (I) and (II) contain both terms associated with the nonlinear networks, Nmet(.,.) and Nmm(.,.), and can be solved using a generic nonlinear optimization solver. Subproblem (III) is a typical linear least-squares problem with a quadratic regularization. Note that although directly minimizing Eq. (8) and Eq. (9) are very high-dimensional problems for which computing the gradient is very demanding, it can be solved in a voxel-by-voxel fashion since the Frobenius norm term is separable for all the voxels (i.e., individual rows in X). Based on the autoencoder design, the gradients for individual voxels can be efficiently calculated through backpropagation. More specifically, denote

fn(Xmetn,Xmmn)=λ1Nmet(Xmetn)Xmetn22+λ2Nmm(Xmmn)Xmmn22+ρ2[BXS(i)+Y(i)ρ]n22 (12)

as the cost function for the nth voxel, the then gradients for the metabolite component can be written as:

fn(Xmetn)=2λ1(JNmetI)T(Nmet(Xmetn)Xmetn)+ρB(n)H(B(n)XnSn(i)+Yn(i)ρ) (13)

where JNmetRT×T is the Jacobian of the metabolite network, I is a T × T identity matrix, and B(n) represents a diagonal matrix formed by the nth row of B. The gradients for MM component can be derived similarly (omitted due to space constraint). And the Jacobians JNmet and JNmm can be calculated through backpropagation described in [40].

Subproblem (III) is equivalent to solving a system of linear equations with a spatial regularization on the overall spatiotemporal function (due to the way the auxiliary variable is introduced). The iteration is terminated until a specified iteration number is reached (e.g., 20) or the relative change between Xmet(i) and Xmet(i+1) is below a threshold (e.g., 10−4).

D. Training Data Generation

One common issue of training deep neural networks is the requirement of a large number of high-quality training data. Strategies that combine spectral fitting models, quantum mechanical (QM) simulations and experimental data have been described in several literature to address this issue for spectral model learning [37], [40], [47]. Utilizing a similar strategy here, we generated metabolite and MM training data separately using the model in Eq. (2). For metabolites, the basis vm(t) were generated from QM simulations using the NMRScopeB software [48], for both the FID (pulse and acquire) and semi-LASER excitation schemes with different TEs [7]. These molecule specific resonance structures can be assumed to be invariant with respect to different subjects. Meanwhile, the empirical distributions of the spectral parameters, i.e., cm, T2,m*, and δfm were estimated from literature values as well as fitting experimental high-SNR, low-resolution MRSI data from healthy volunteers [37], [43], [47]. The empirical distributions were fitted to parametric Gaussian distributions to allow for generating more randomly distributed parameters. The global zeroth-order phase was generated from a Gaussian distribution with mean zero and standard deviation of 25 degrees, and Gaussian distributed molecule dependent phases were also introduced to simulate more realistic signal variations (with mean zero and standard deviation of 10 degrees). Finally, the metabolite basis and parameters randomly sampled from these distributions were combined using the model in Eq. (2) to generate 300,000 1H MR spectra. Metabolites commonly observed and quantified in 1H-MRSI experiments are considered, i.e., N-acetylaspartate (NAA), creatine (Cr), choline (Cho), glutamate (Glu), glutamine (Gln), myo-inositol (mI), gamma-Aminobutyric acid (GABA), glutathione (GSH) and lactate (Lac). For MMs, another 300,000 training samples were generated using a similar procedure and the model in Eq. (2). 13 commonly reported MM resonances with mean δfl’s equal to 0.9, 1.21, 1.38, 1.63, 2.01, 2.09, 2.25, 2.61, 2.96, 3.11, 3.67, 3.8, and 3.96 ppm were included. The parameters bl (concentration coefficients) and Wl (linewidths) for different peak groups were assumed to follow Gaussian distributions. The mean values were acquired from [37] with standard deviations specified as 20% of the means to introduce relative peak variations. A global scaling factor was introduced to the MM coefficients to reflect experimentally observed metabolite-to-MM signal ratios. Finally, all data were normalized to the range of −1 to 1 for training.

E. Other Implementation Details

Among the 300,000 training data, 200,000 were used for training and 100,000 for testing. All the model parameters were upper and lower bounded based on our own and other published 1H-MRSI data [37], [38]. Specifically, the T2,m* values were lower bounded by 5 ms and upper bounded by 200 ms, and the cm values were bounded between 0 and 2 with the mean NAA concentration being 1. The linewidth Wl values were bounded within the range of 5 to 70 Hz, and bl’s were lower bounded by 0. The parameters were first generated, and the values outside these ranges were excluded. The spectral bandwidth (BW) was fixed at 2000 Hz. Similar to the DAE used in [40], the metabolite and MM networks have a fully-connected structure of 2T − 1000 − 250 − 100 − L − 100 − 250 − 1000 − 2T. Both Tanh and ReLu units can be used in the nonlinear hidden layers except the middle linear layer (with similar performances). The results shown below were from ReLu. The learned network models were first evaluated with a range of L’s. The exact L for phantom and in vivo data processing was chosen, such that the NNs for metabolites and MMs have similar approximation errors (around 5%). Note that the training only needs to be performed once for a fixed excitation scheme (with specific choices of RF pulses, TE, and field strength). All the networks were implemented in PyTorch and trained using an NVIDIA RTX Titan graphics processing unit on Windows 10 using the Adam optimizer [49] with a batch size of 500, an initial learning rate of 0.001, and 300 epochs while the other parameters remained as default. The Broyden–Fletcher–Goldfarb–Shanno (BFGS) algorithm was used to solve the optimization problem for individual voxels in Eqs. (8) and (9), and the linear conjugate gradient was used to solve Eq. (10) [50].

IV. Simulation and Experimental Settings

A. Numerical Simulations

The component-specific representation power of our learned models was first evaluated. Specifically, we validated the approximation accuracy of the trained DAE-based metabolite and MM-specific low-dimensional models with comparison to linear subspace models (estimated from the same training data) [43], at different model orders. Testing metabolite and MM data were generated and passed through the trained metabolite and MM-specific networks, respectively. The errors of the same test data projected onto the metabolite and MM subspaces were also calculated. More specifically, two Casorati matrices were first constructed by stacking all the training data for metabolites or MMs, respectively. Then the component-specific subspace was obtained by SVD with a rank truncation (L, the model order). Finally, the testing data were projected onto the two subspaces separately to evaluate approximation accuracy [40], [43]. The approximation performance was evaluated quantitatively using a relative 2 error defined as:

err=XtrueX^FXtrueF (14)

where Xtrue denotes the original data (each column being an FID), and X^ represents the model approximation or reconstructed data (see below).

A numerical phantom was constructed to evaluate the signal separation performance using the learned DAE-based nonlinear models (details of the phantom generation process can be found in the supplementary materials). In short, brain tissue fraction maps for gray matter (GM), white matter (WM), and cerebrospinal fluid (CSF) were first obtained from an in vivo anatomical T1-weighted image. Then regional spectral parameters for different 1H metabolites and MM components described in the Training Data Generation section were assigned based on literature values [14], [37], [51]. The constant parameters in each region were subsequently combined using tissue fraction maps as weightings to simulate continuously varying parameters across the brain. Finally, the parameters at different voxels along with metabolite basis {vm} were fed into Eq. (2) to synthesize spatially localized FIDs. To simulate a more realistic scenario, voxel-dependent random frequency shifts (mean zero and standard deviation of 5 Hz) for different molecules, as well as B0 inhomogeneity (mean zero and standard deviation of 10 Hz), were also introduced. A lesion-like feature with significantly altered metabolite ratios (e.g., a factor of three higher Cho and lower concentrations for other metabolites) and a higher MM level was included. Noisy data were generated by adding complex white Gaussian noise with different SNRs to the simulated (x, t)-space data. The SNR is defined with respect to the maximum NAA peak amplitude within the FOV. After the proposed reconstruction, the separated metabolite and MM components were fitted individually using a metabolite-only and a MM-only parametric model from Eq. (2) in a voxel-by-voxel fashion to produce molecular maps. In comparison, a direct parametric fitting was also performed to the original data without the proposed separation. An FID truncation was performed to fit the metabolites first which were then back-extrapolated for subtraction to fit the MMs. A metabolite refitting was done after subtracting the fitted MMs from the original data. All fittings were done using in-house implementations which have been validated against the time-domain fitting using the jMRUI package [27], [48] (The customized implementations provided more flexibility for further optimizations).

B. In Vivo Experiments

We have evaluated the performance of the proposed method using practical in vivo data acquired from five healthy volunteers with approval from the local Institutional Review Board. Experimental brain MRSI data were acquired on a 3T Prisma scanner equipped with a 20-channel head coil using both an FID-MRSI (pulse and acquire) sequence and a semi-LASER MRSI short-TE sequence (sLASER). We chose these two sequences because they are among the most commonly used short-TE data acquisition schemes and require rather different metabolite basis sets which serves to demonstrate that the proposed method can be flexibly adapted to work with any sequences. The parameters for the FID-MRSI sequence were as follows: TR/TE = 800/4 ms, field-of-view (FOV) = 230 × 230 mm2, slice thickness = 10 mm, matrix size = 36 × 36, spectral bandwidth (BW) = 2000 Hz and 512 FID samples. The total acquisition time was about 13.5 minutes with elliptical sampling. The parameters for the sLASER sequence were: TR/TE = 1600/40 ms, FOV = 180×190 mm2, slice thickness = 15 mm, matrix size = 24×24, 2000 Hz BW and 1024 FID samples. The total acquisition time was about 16 minutes. A 60 Hz weak water suppression and carefully placed outer volume suppression bands were used for all the scans. Before reconstruction, the nuisance water and lipid signals were first removed using the method in [52] followed by coil combination of the water/lipid-removed data.

V. Results

A. Simulation Results

Figure 2 compares the representation powers of our learned component-specific nonlinear low-dimensional models and the linear subspace models. As shown, the learned metabolite (Fig. 2a) and MM (Fig. 2b) DAE-based models achieved higher accuracy with lower relative 2 errors than the subspace models for their respective components. With the same model order (L), the MM DAE has higher accuracy than the metabolite DAE. The approximations for two representative testing metabolite and MM spectra are also shown in Fig. 2cf to further demonstrate the accuracy and specificity of the learned models. With a fixed model order L = 24 for metabolite and L = 8 for MM, the metabolite DAE can accurately capture the metabolite spectral features (Fig. 2c), exhibiting a higher accuracy than the subspace model with the same dimension. Similar results can be observed for the MM test spectrum (Fig. 2d). More importantly, the metabolite DAE offers a poor approximation of the MM spectrum as we designed it to, while the linear metabolite subspace can still capture a decent amount of MM spectral energy (Fig. 2e), implying a weaker capability for signal separation. This component-specific representation can also be observed for the MM DAE (Fig. 2f), which does not capture the metabolite spectral features, while the MM subspace can again capture a large portion of metabolite signal energy. These validate the desirable component-specific representation capability of the learned models and imply their unique potential for improved metabolite and MM separation.

Fig. 2.

Fig. 2.

Representation capability of the learned nonlinear models: a) Approximation errors (relative 2) of the trained metabolite DAE (orange curve) compared to a linear subspace model (blue curve) for the metabolite data at different L’s; b) Approximation errors of the MM DAE (orange curve) with comparison to a linear subspace model (blue curve); c) and e) Representative metabolite and MM spectra (black), and the approximations of each signal by the metabolite DAE (orange) and metabolite linear subspace (blue) both with L = 24; d) and f) Representative MM and metabolite spectra (black), and the approximations by the MM DAE (orange) and subspace (blue) with L = 8. It is evident that the learned DAEs have more accurate and component-specific representation than the linear subspaces.

A set of metabolite and MM signal separation results from the numerical phantom (SNR = 30) are shown in Figs. 3 and 4. Here the resulting spectra were shown in magnitude for visualization purpose (the real parts of reconstructed spectra can be found in the supplementary materials Fig. S1). The model order (L) was chosen as 24 for the metabolite DAE and 16 for the MM DAE with similar approximation errors (~5% error), which achieved a good balance between model complexity and approximation accuracy. The regularization parameters λ1 and λ2 were chosen based on a single voxel separation performance, and λ3 was chosen using the discrepancy principle and then fine-tuned by minimizing the 2 errors of the final spatiospectral reconstructions. A time-domain direct parametric model-based fitting with back-extrapolation was also performed as described above, and the results were compared. The separated metabolite and MM spatiotemporal distributions from the proposed method were subject to parametric fitting (using the metabolite-only and MM-only parametric models, respectively). As can be seen, both the parametric fitting and the proposed method achieved similar estimates of the overall spectra (Fig. 3c), but the proposed method produced significantly more accurate separated metabolite and MM components (Figs. 3d and e). The metabolite maps obtained by direct parametric fitting of the overall data (Parametric Fitting) and the proposed method (separate fitting) are compared in Fig. 4. The results demonstrate the benefits of the proposed signal separation. More specifically, the metabolite maps from fitting the separated signals exhibit significantly less spatially dependent estimation variances and higher accuracy than those produced by a direct parametric fitting of the combined signals. An additional set of results from less noisy data (SNR = 60) are shown in the supplementary materials (Fig. S3).

Fig. 3.

Fig. 3.

Simulation results: a) Spatial variations of the overall metabolite and MM signals in the phantom (2 integral along the FID dimension); b) A sampled voxel spectrum and its noisy counterpart (SNR = 30); c)-e) Separation results from the proposed method (orange curves) and the direct parametric fitting (blue curves, without the proposed separation) for three different voxels (in GM, WM, and Lesion). The voxel locations are indicated by different shapes in (a). Similar overall spectra (c) were produced by both methods. But the proposed method yielded more accurate separated metabolite (d) and MM (e) components. The black arrows identify some spectral features better recovered by the proposed method.

Fig. 4.

Fig. 4.

Simulation results: molecular maps of NAA, Cr, Cho, and MM from the ground truth (Gold Standard, column 1), the direct parametric fitting method (column 2) and the proposed method (column 3) are compared. For the proposed method, the maps were obtained by fitting the separated metabolite and MM components individually. The first MM peak group (located at ~0.9 ppm) is shown [37]. Note that the MM maps were normalized separately, thus having a different scaling compared to the metabolite maps. Relative 2 errors for the separated metabolite and MM signals are also calculated (shown in the images). The improved signal separation offered by the proposed method lead to significantly improved molecular quantification.

B. In Vivo Results

A set of spatially-resolved spectral reconstruction obtained by the proposed method from the in vivo FID-MRSI data are shown in Fig. 5. As can be seen, the proposed method was able to separate the metabolite and MM spectral components with a similar overall spectrum to the direct parametric fitting method. Furthermore, the metabolite and MM maps from the proposed method exhibited a higher quality with less spatial estimation variances. More specifically, the molecule maps produced by the direct parametric fitting had apparent artifacts, e.g., locally dark/bright areas and sudden discontinuities (indicated by the white arrows in Fig. 5b). These artifacts were effectively reduced in the maps from the proposed method. The relative peak intensities may not appear the same as those from standard IR-based MM measurements due to different T1 weightings (effects of no IR and the shorter TR used), e. g., a strong 0.9 ppm MM peak even before separation. Hence, results from another dataset acquired with a longer TR (1500 ms) are included in the supplementary materials (Fig. S2) to further illustrate the TR effects. We have also performed a reconstruction of the same FID-MRSI data with the first 36 time points truncated (i.e., 18 ms) to evaluate the robustness of the proposed method. As shown in Fig. 6, significantly reduced MM signals were obtained, indicating that the learned model is not overfitting.

Fig. 5.

Fig. 5.

Experimental results from the in vivo FID-MRSI data: a) and b) Maps of NAA, Cr, Cho, Glu and MM estimated from the separated signals produced by the proposed method (a) and from the direct parametric fitting method (b). The molecular maps are overlaid on an anatomical image for the matched slice; c) Spatially-resolved spectra from the voxel marked by the blue symbol, with the first and second rows showing the results from the proposed method (orange curves) and parametric fitting (blue curves), respectively. The original spectra are shown in black. The overall reconstruction, as well as the separated metabolite and MM spectra, are compared. The white arrows indicate some artifact-like features generated by the direct parametric fitting which are not present in the maps produced by the proposed method.

Fig. 6.

Fig. 6.

Results from the same data in Fig. 5 but with the first 36 FID points truncated (~18 ms) and zero-padded to the original length. Two representative spatially-resolved spectra from the locations marked by the corresponding symbols are shown, including the original spectra (black), the overall reconstruction, and the separated metabolite and MM spectra (orange). Significantly reduced MM signals were observed, indicating that the proposed method is not overfitting.

Figure 7 shows the spatially-resolved spectral reconstruction from the sLASER data to further demonstrate the utility of the proposed method. Signals can only be observed from the central region of the brain due to the volume selective excitation. The proposed method again produced visually better separation, which can be observed in the metabolite and MM maps (e.g., better gray/white matter contrast and fewer artifacts) as well as the selective voxel spectra. The proposed method effectively reduced the over and underestimation of some metabolite and MM spectral components in the parametric fitting method (Figs. 7b and d). Additional metabolite maps can be found in the supplementary materials (Fig. S5). Our approach should work for any excitation as long as the corresponding metabolite basis can be obtained for training data generation.

Fig. 7.

Fig. 7.

Experimental results from the in vivo sLASER data: a) Maps of NAA, Cr and MM from the proposed method (overlaid on anatomical images); b) Two representative spatially-resolved spectra (voxel locations marked by the corresponding symbols) with the original spectra (black, column 1), the overall reconstruction (column 1), and the separated metabolite (column 2) and MM (column 3) spectra; c) and d) The corresponding results from the parametric fitting method with the same arrangement. The spectra from the two methods (b and d) are from the same voxels. The arrows indicate some over and underestimation of metabolite or MM components from the parametric fitting that were mitigated by the proposed method.

C. Convergence Analysis

We have also performed convergence analysis of the proposed algorithm. Figure 8 shows the relative changes between iterates (‖X(i+1)X(i) ‖/‖X(i)) and relative 2 errors for the metabolite and MM estimates w.r.t. the iteration number. Empirical convergence can be observed. Furthermore, our problem formulation and the ADMM-based algorithm allow us to theoretically characterize its convergence.

Fig. 8.

Fig. 8.

Convergence analysis of the algorithm: (a) Relative changes (in terms of 2 errors) between different iterates; (b) Relative 2 errors for the metabolite (orange) and MM (blue) components w.r.t. different iterations. As can be seen, the result changes minimally after 10 iterations.

Theorem 1:

There exists a constant ρ0 such that if ρρ0, every limit point of the sequence (X(i),S(i)) generated by the algorithm described in (8), (9), (10), and (11) is a stationary solution of the optimization problem (7) (i.e. a solution that satisfies the KKT condition).

This theorem states that for a proper choice of penalty parameter ρ, the sequence generated by our algorithm is guaranteed to converge to stationary points. For non-convex problems, convergence to the global minimum is often very difficult. Thus, we follow the common practice to prove a result of convergence to stationary points. We remark that even the convergence to a stationary solution is a non-trivial property because a general convergence result of ADMM for non-convex problems is still an open question. This theoretical characterization is enabled by the unique structure of our problem formulation: we show that it is a special case of the non-convex sharing problem [53], for which the convergence results have been established.

Proof Sketch:

If we denote X1 = vec(Xmet), X2 = vec(Xmm), X0 = vec(S), then λ1n=1NNmet(Xmetn)Xmetn22 is a function of X1, which we denote as g1(X1). Similarly, λ2n=1NNmm(Xmmn)Xmmn22, is a function of X2, which we denote as g2(X2). The remaining terms, dΩ{FX0}22+λ3DwB¯X0F2, can be written as a function of the vectorized variable X0, which we denote as (X0). The constraint B ⊙ (Xmet + Xmm) = S can be rewritten as AX1 + AX2 = X0 for a certain matrix A. Then our optimization problem in Eq. (7) can be written in the general form of

minX1X1,X2X2,X0g1(X1)+g2(X2)+l(X0) s.t. AX1+AX2=X0. (15)

Recognizing that this is a special case of the sharing problem in [53], we apply the convergence result provided in this work. The detailed proof is provided in the supplementary materials.

VI. Discussion

We have successfully combined the physics-based data acquisition model with learned low-dimensional models for effective metabolite and macromolecule separation. Our unique strategy of combining supervised and unsupervised learning to discover component-specific low-dimensional manifolds is a novel attempt motivated by the nature of spectroscopic data. The proposed reconstruction represents a rigorous approach to leverage deep learning to solve this long-standing challenge. In contrast to the existing methods that learn end-to-end mappings, the proposed method allows the use of a general (k, t)-space sampling operator with high flexibility in the choices of sampling designs and SNR levels, and the ability to account for B0 inhomogeneity. While significant noise reduction can be observed in the separated metabolite and MM spectra due to the inherent denoising capability of the low-dimensional models used, it should be noted that the proposed method is focused on addressing the signal separation problem and not a substitute for spectral quantification. Our hypothesis is that a better separation will lead to improved spectral quantification of different components of interest, which has been supported by both simulation and experimental results. Meanwhile, we expect that the proposed method can be readily integrated with other more sophisticated parametric models for data generation and representation learning (both metabolites and MMs) as well as advanced quantification strategies for the separated signal components. This is beyond the scope of this work but will be investigated in future research.

While in this work we have only considered nine metabolites that are the common molecules of interest in most brain MRSI studies, especially at 3T, the proposed model learning and reconstruction methodologies are not limited by the number of metabolites considered. Adding more metabolites into the model will increase the model order to achieve the same approximation accuracy but will not cause substantially higher computation burden. For the other 1H metabolites, it will be very challenging to quantify them reliably given the SNR level and ignoring them has a minimal bias (due to their weak signals), hence we did not include them. But more metabolites can be considered when we adapt our method for data with higher SNRs from higher field strengths (e.g., 7T). One additional thing to note is that the learned MM-specific model should be able to capture potential residual lipids spectrally overlapping with the MM peaks (if sufficient lipid removal can be achieved), because of the lineshape and frequency variations introduced when generating the MM training data. Thus, small lipid residuals will not affect the model’s representation capability or metabolite separation.

One important issue with the proposed method is the choice of regularization parameters. In our current implementation for practical data, we first performed a single voxel separation for the selection of λ1 and λ2 using the parameter values from the phantom studies. The third parameter λ3 was then initialized based on the simulation studies and adjusted according to the discrepancy principle. Some minor fine-tuning together with λ1 and λ2 was performed using visual inspection of the separation reconstruction to balance SNR improvement and smoothing effects (the relative ratios between the three parameters remained the same during this step). More sophisticated parameter selection strategies can be explored in future research. Moreover, the formulation can readily be extended to incorporate other spatiospectral constraints for improved reconstruction.

The current fully-connected network-based DAE structures and the way of handling complex-valued data may be limited in scalability. We have investigated convolutional structures (with reduced numbers of training parameters) for both FID and spectral data and found that they were not as effective as our current DAEs in terms of dimensionality reduction. Various combinations of fully-connected and convolutional feature extraction layers, as well as choices of activation functions, are currently being explored. The current training data generation processes used relatively simplified spectral parameter distributions. While producing strong performance, this strategy does not fully exploit the information available from experimental 1H spectroscopy data. Estimation of more sophisticated distributions using such data will be studied in future work, e.g., using kernel density estimation [54].

Although Cartesian k-space sampling has been used to demonstrate the utility of the proposed method, other sampling trajectories can be considered by generalizing the forward encoding operator without having to retrain the models (another unique advantage of our approach). The most computationally expensive step in the current algorithm is solving Eq. (8) and Eq. (9) that involves backpropagation. This is, however, a highly parallelizable process that can significantly benefit from translating the current implementation to parallel computing platforms.

VII. Conclusion

We have presented a new method to reconstruct and separate metabolite and MM signals for short-TE 1H-MRSI by learning the two signal components’ distinct nonlinear low-dimensional models and using the learned models as priors for reconstruction. The models were learned using two deep autoencoder based neural networks to accurately capture metabolite and MM-specific low-dimensional manifolds of their high-dimensional spectral variations. A constrained spatiospectral reconstruction formulation that exploits the learned models for signal separation was proposed and solved by an efficient ADMM-based algorithm. Significantly improved separation over the standard parametric fitting approach has been demonstrated using both simulated and experimental short-TE brain 1H-MRSI data. Theoretical analysis of the proposed formulation and algorithm was also provided.

Supplementary Material

supp1-3048933

Acknowledgments

This work was supported in part by NSF under Grant CBET-1944249 and Grant CCF-1755847 and in part by NIH, under Grant 1R21EB029076A.

Footnotes

1

For the empirical computation, we often set X1, X2 to be CN×T; but due to technical consideration, in the convergence results, we set them to be bounded sets.

This article has supplementary downloadable material available at https://doi.org/10.1109/TMI.2020.3048933, provided by the authors.

Contributor Information

Ruoyu Sun, Department of Industrial and Enterprise Systems Engineering, University of Illinois Urbana–Champaign, Urbana, IL 61801 USA..

Fan Lam, Department of Bioengineering, University of Illinois Urbana–Champaign, Urbana, IL 61801 USA; Beckman Institute for Advanced Science and Technology, University of Illinois Urbana–Champaign, Urbana, IL 61801 USA.

References

  • [1].Wilson M et al. , “Magnetic resonance spectroscopy metabolite profiles predict survival in paediatric brain tumours,” Eur. J. Cancer, vol. 49, no. 2, pp. 457–464, January. 2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [2].Luyten PR et al. , “Metabolic imaging of patients with intracranial tumors: H-1 MR spectroscopic imaging and PET,” Radiology, vol. 176, no. 3, pp. 791–799, September. 1990. [DOI] [PubMed] [Google Scholar]
  • [3].Davison JE et al. , “MR spectroscopy-based brain metabolite profiling in propionic acidaemia: Metabolic changes in the basal ganglia during acute decompensation and effect of liver transplantation,” Orphanet J. Rare Diseases, vol. 6, no. 1, p. 19, 2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [4].Colla M et al. , “MR spectroscopy in Alzheimer’s disease: Gender differences in probabilistic learning capacity,” Neurobiol. Aging, vol. 24, no. 4, pp. 545–552, July. 2003. [DOI] [PubMed] [Google Scholar]
  • [5].Su L, Blamire AM, Watson R, He J, Hayes L, and O’Brien JT, “Whole-brain patterns of 1H-magnetic resonance spectroscopy imaging in Alzheimer’s disease and dementia with Lewy bodies,” Transl. Psychiatry, vol. 6, no. 8, p. e877, August. 2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [6].Mlynárik V, Gambarota G, Frenkel H, and Gruetter R, “Localized short-echo-time proton MR spectroscopy with full signal-intensity acquisition,” Magn. Reson. Med, vol. 56, no. 5, pp. 965–970, November. 2006. [DOI] [PubMed] [Google Scholar]
  • [7].Scheenen TWJ, Klomp DWJ, Wijnen JP, and Heerschap A, “Short echo time 1H-MRSI of the human brain at 3 T with minimal chemical shift displacement errors using adiabatic refocusing pulses,” Magn. Reson. Med, vol. 59, no. 1, pp. 1–6, January. 2008. [DOI] [PubMed] [Google Scholar]
  • [8].Henning A, Fuchs A, Murdoch JB, and Boesiger P, “Slice-selective FID acquisition, localized by outer volume suppression (FIDLOVS) for 1H-MRSI of the human brain at 7 T with minimal signal loss,” NMR Biomed, vol. 22, no. 7, pp. 683–696, August. 2009. [DOI] [PubMed] [Google Scholar]
  • [9].Bogner W, Gruber S, Trattnig S, and Chmelik M, “High-resolution mapping of human brain metabolites by free induction decay 1H MRSI at 7 T,” NMR Biomed, vol. 25, no. 6, pp. 873–882, June. 2012. [DOI] [PubMed] [Google Scholar]
  • [10].Gasparovic C et al. , “Test-retest reliability and reproducibility of short-echo-time spectroscopic imaging of human brain at 3 T,” Magn. Reson. Med, vol. 66, no. 2, pp. 324–332, August. 2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [11].Cudalbu C, Mlynárik V, and Gruetter R, “Handling macromolecule signals in the quantification of the neurochemical profile,” J. Alzheimer’s Disease, vol. 31, no. 3, pp. S101–S115, September. 2012. [DOI] [PubMed] [Google Scholar]
  • [12].Penner J and Bartha R, “Semi-LASER 1H MR spectroscopy at 7 Tesla in human brain: Metabolite quantification incorporating subject-specific macromolecule removal,” Magn. Reson. Med, vol. 74, no. 1, pp. 4–12, July. 2015. [DOI] [PubMed] [Google Scholar]
  • [13].Povavzan M et al. , “Mapping of brain macromolecules and their use for spectral processing of 1H-MRSI data with an ultra-short acquisition delay at 7 T,” NeuroImage, vol. 121, pp. 126–135, November. 2015. [DOI] [PubMed] [Google Scholar]
  • [14].Birch R, Peet AC, Dehghani H, and Wilson M, “Influence of macromolecule baseline on 1H MR spectroscopic imaging reproducibility,” Magn. Reson. Med, vol. 77, no. 1, pp. 34–43, January. 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [15].Seeger U, Klose U, Mader I, Grodd W, and Nägele T, “Parameterized evaluation of macromolecules and lipids in proton MR spectroscopy of brain diseases,” Magn. Reson. Med, vol. 49, no. 1, pp. 19–28, January. 2003. [DOI] [PubMed] [Google Scholar]
  • [16].Graham GD, Hwang J-H, Rothman DL, and Prichard JW, “Spectroscopic assessment of alterations in macromolecule and small-molecule metabolites in human brain after stroke,” Stroke, vol. 32, no. 12, pp. 2797–2802, December. 2001. [DOI] [PubMed] [Google Scholar]
  • [17].Craveiro M, Clement-Schatlo V, Marino D, Gruetter R, and Cudalbu C, “In vivo brain macromolecule signals in healthy and glioblastoma mouse models: 1H magnetic resonance spectroscopy, post-processing and metabolite quantification at 14.1 T,” J. Neurochem, vol. 129, pp. 806–815, June. 2014. [DOI] [PubMed] [Google Scholar]
  • [18].Kunz N, Cudalbu C, Mlynarik V, Hüppi PS, Sizonenko SV, and Gruetter R, “Diffusion-weighted spectroscopy: A novel approach to determine macromolecule resonances in short-echo time 1H-MRS,” Magn. Reson. Med, vol. 64, no. 4, pp. 939–946, June. 2010. [DOI] [PubMed] [Google Scholar]
  • [19].Behar KL, Rothman DL, Spencer DD, and Petroff OAC, “Analysis of macromolecule resonances in 1H NMR spectra of human brain,” Magn. Reson. Med, vol. 32, no. 3, pp. 294–302, September. 1994. [DOI] [PubMed] [Google Scholar]
  • [20].Mader I, Seeger U, Karitzky J, Erb M, Schick F, and Klose U, “Proton magnetic resonance spectroscopy with metabolite nulling reveals regional differences of macromolecules in normal human brain,” J. Magn. Reson. Imag, vol. 16, no. 5, pp. 538–546, November. 2002. [DOI] [PubMed] [Google Scholar]
  • [21].Snoussi K et al. , “Comparison of brain gray and white matter macromolecule resonances at 3 and 7 Tesla,” Magn. Reson. Med, vol. 74, no. 3, pp. 607–613, September. 2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [22].Giapitzakis I, Kreis R, and Henning A, “Characterization of the macro-molecular baseline with a metabolite-cycled double-inversion recovery sequence in the human brain at 9.4 T,” in Proc. Annu. Conf. ISMRM, 2016, p. 16. [Google Scholar]
  • [23].Gottschalk M, Troprès I, Lamalle L, Grand S, Bas J-FL, and Segebarth C, “Refined modelling of the short-T2 signal component and ensuing detection of glutamate and glutamine in short-TE, localised, 1H MR spectra of human glioma measured at 3 T,” NMR Biomed, vol. 29, no. 7, pp. 943–951, July. 2016. [DOI] [PubMed] [Google Scholar]
  • [24].Provencher SW, “Estimation of metabolite concentrations from localized in vivo proton NMR spectra,” Magn. Reson. Med, vol. 30, pp. 672–679, December. 1993. [DOI] [PubMed] [Google Scholar]
  • [25].Provencher SW, “Automatic quantitation of localized in vivo 1H spectra with LCModel,” NMR Biomed, vol. 14, no. 4, pp. 260–264, 2001. [DOI] [PubMed] [Google Scholar]
  • [26].Young K, Soher BJ, and Maudsley AA, “Automated spectral analysis II: Application of wavelet shrinkage for characterization of non-parameterized signals,” Magn. Reson. Med, vol. 40, no. 6, pp. 816–821, December. 1998. [DOI] [PubMed] [Google Scholar]
  • [27].Ratiney H, Coenradie Y, Cavassila S, van Ormondt D, and Graveron-Demilly D, “Time-domain quantitation of 1H short echo-time signals: Background accommodation,” MAGMA Magn. Reson. Mater. Phys., Biol. Med, vol. 16, no. 6, pp. 284–296, May 2004. [DOI] [PubMed] [Google Scholar]
  • [28].Zhang Y and Shen J, “Smoothness of in vivo spectral baseline determined by mean-square error,” Magn. Reson. Med, vol. 72, no. 4, pp. 913–922, October. 2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [29].Lee HH and Kim H, “Parameterization of spectral baseline directly from short echo time full spectra in 1H-MRS,” Magn. Reson. Med, vol. 78, no. 3, pp. 836–847, 2017. [DOI] [PubMed] [Google Scholar]
  • [30].Lam F, Li Y, Clifford B, and Liang Z, “Macromolecule mapping of the brain using ultrashort-TE acquisition and reference-based metabolite removal,” Magn. Reson. Med, vol. 79, no. 5, pp. 2460–2469, May 2018. [DOI] [PubMed] [Google Scholar]
  • [31].Ma C, Lam F, Ning Q, Johnson CL, and Liang Z-P, “High-resolution 1H-MRSI of the brain using short-TE SPICE,” Magn. Reson. Med, vol. 77, no. 2, pp. 467–479, February. 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [32].Peyré G, “Manifold models for signals and images,” Comput. Vis. Image Understand, vol. 113, no. 2, pp. 249–260, February. 2009. [Google Scholar]
  • [33].Hinton GE and Salakhutdinov RR, “Reducing the dimensionality of data with neural networks,” Science, vol. 313, no. 5786, pp. 504–507, 2006. [DOI] [PubMed] [Google Scholar]
  • [34].Masci J, Meier U, Ciresan D, and Schmidhuber J, “Stacked convolutional auto-encoders for hierarchical feature extraction,” in Artificial Neural Networks and Machine Learning—ICANN 2011, Honkela T, Duch W, Girolami M, and Kaski S, Eds. Berlin, Germany: Springer, 2011, pp. 52–59. [Google Scholar]
  • [35].Bengio Y, Courville A, and Vincent P, “Representation learning: A review and new perspectives,” IEEE Trans. Pattern Anal. Mach. Intell, vol. 35, no. 8, pp. 1798–1828, August. 2013. [DOI] [PubMed] [Google Scholar]
  • [36].Gurbani SS, Sheriff S, Maudsley AA, Shim H, and Cooper LAD, “Incorporation of a spectral model in a convolutional neural network for accelerated spectral fitting,” Magn. Reson. Med, vol. 81, no. 5, pp. 3346–3357, May 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [37].Lee HH and Kim H, “Intact metabolite spectrum mining by deep learning in proton magnetic resonance spectroscopy of the brain,” Magn. Reson. Med, vol. 82, no. 1, pp. 33–48, July. 2019. [DOI] [PubMed] [Google Scholar]
  • [38].Lee HH and Kim H, “Deep learning-based target metabolite isolation and big data-driven measurement uncertainty estimation in proton magnetic resonance spectroscopy of the brain,” Magn. Reson. Med, vol. 84, no. 4, pp. 1689–1706, October. 2020. [DOI] [PubMed] [Google Scholar]
  • [39].Hatami N, Sdika M, and Ratiney H, “Magnetic resonance spectroscopy quantification using deep learning,” in Proc. Med. Image. Comput. Comput. Assist. Interv, 2018, pp. 467–475. [Google Scholar]
  • [40].Lam F, Li Y, and Peng X, “Constrained magnetic resonance spectroscopic imaging by learning nonlinear low-dimensional models,” IEEE Trans. Med. Imag, vol. 39, no. 3, pp. 545–555, March. 2020. [DOI] [PubMed] [Google Scholar]
  • [41].Eslami R and Jacob M, “Robust reconstruction of MRSI data using a sparse spectral model and high resolution MRI priors,” IEEE Trans. Med. Imag, vol. 29, no. 6, pp. 1297–1309, June. 2010. [DOI] [PubMed] [Google Scholar]
  • [42].Vanhamme L, van den Boogaart A, and Van Huffel S, “Improved method for accurate and efficient quantification of MRS data with use of prior knowledge,” J. Magn. Reson, vol. 129, no. 1, pp. 35–43, November. 1997. [DOI] [PubMed] [Google Scholar]
  • [43].Lam F, Li Y, Guo R, Clifford B, and Liang Z, “Ultrafast magnetic resonance spectroscopic imaging using SPICE with learned subspaces,” Magn. Reson. Med, vol. 83, no. 2, pp. 377–390, February. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [44].Qu X et al. , “Accelerated nuclear magnetic resonance spectroscopy with deep learning,” Angew. Chem. Int. Ed, vol. 59, no. 56, pp. 10297–10300, 2020. [DOI] [PubMed] [Google Scholar]
  • [45].Haldar JP, Hernando D, Song S-K, and Liang Z-P, “Anatomically constrained reconstruction from noisy data,” Magn. Reson. Med, vol. 59, no. 4, pp. 810–818, 2008. [DOI] [PubMed] [Google Scholar]
  • [46].Yang J and Zhang Y, “Alternating direction algorithms for 1-problems in compressive sensing,” SIAM J. Sci. Comput, vol. 33, no. 1, pp. 250–278, January. 2011. [Google Scholar]
  • [47].Li Y, Lam F, Clifford B, and Liang Z-P, “A subspace approach to spectral quantification for MR spectroscopic imaging,” IEEE Trans. Biomed. Eng, vol. 64, no. 10, pp. 2486–2489, October. 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [48].Stefan D et al. , “Quantitation of magnetic resonance spectroscopy signals: The jMRUI software package,” Meas. Sci. Technol, vol. 20, no. 10, October. 2009, Art. no. 104035. [Google Scholar]
  • [49].Kingma DP and Ba J, “Adam: A method for stochastic optimization,” 2014, arXiv:1412.6980. [Online]. Available: http://arxiv.org/abs/1412.6980 [Google Scholar]
  • [50].Nocedal J and Wright SJ, Numerical Optimization. Berlin, Germany: Springer, 2006. [Google Scholar]
  • [51].Graaf RAD, In vivo NMR Spectroscopy: Principles and Techniques. Hoboken, NJ, USA: Wiley, 2019. [Google Scholar]
  • [52].Ma C, Lam F, Johnson CL, and Liang Z-P, “Removal of nuisance signals from limited and sparse 1H MRSI data using a union-of-subspaces model,” Magn. Reson. Med, vol. 75, no. 2, pp. 488–497, February. 2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [53].Hong M, Luo Z-Q, and Razaviyayn M, “Convergence analysis of alternating direction method of multipliers for a family of nonconvex problems,” SIAM J. Optim, vol. 26, no. 1, pp. 337–364, January. 2016. [Google Scholar]
  • [54].Vapnik V and Mukherjee S, “Support vector method for multivariate density estimation,” in Proc. Adv. Neural. Inf. Process. Syst, 1999, pp. 659–665. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

supp1-3048933

RESOURCES