Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2013 Oct 20.
Published in final edited form as: IEEE Trans Biomed Eng. 2012 Oct 9;60(1):78–89. doi: 10.1109/TBME.2012.2223466

Denoising MR Spectroscopic Imaging Data With Low-Rank Approximations

Hien M Nguyen 1,, Xi Peng 2, Minh N Do 3, Zhi-Pei Liang 4
PMCID: PMC3800688  NIHMSID: NIHMS508541  PMID: 23070291

Abstract

This paper addresses the denoising problem associated with magnetic resonance spectroscopic imaging (MRSI), where signal-to-noise ratio (SNR) has been a critical problem. A new scheme is proposed, which exploits two low-rank structures that exist in MRSI data, one due to partial separability and the other due to linear predictability. Denoising is performed by arranging the measured data in in appropriate matrix forms (i.e., Casorati and Hankel) and applying low-rank approximations by singular value decomposition (SVD). The proposed method has been validated using simulated and experimental data, producing encouraging results. Specifically, the method can effectively denoise MRSI data in a wide range of SNR values while preserving spatial-spectral features. The method could prove useful for denoising MRSI data and other spatial-spectral and spatial-temporal imaging data as well.

Index Terms: Denoising, linear prediction, low-rank approximation, MR spectroscopic imaging (MRSI), MR spectroscopy, partially separable functions

I. Introduction

The acquired magnetic resonance spectroscopic (MRS) signal in (k, t)-space can be expressed as

s(k,t)=ρ(r,f)ei2πk·rei2πftdrdf+ξ(k,t) (1)

where ρ(r, f) denotes the desired spatial-spectral function and ξ(k, t) is the measurement noise often modeled as a complex white Gaussian process. The function ρ(r, f) provides valuable information on the spatial-spectral distribution of metabolites, and is useful for noninvasive metabolite imaging of living systems. For example, 13C magnetic resonance spectroscopic imaging (MRSI) can be used to study glucose metabolism [1]; 31P MRSI is capable of detecting metabolites participating in tissue energy metabolism [1]; 1H MRSI can map out the spatial distributions of N-Acetylaspartate (NAA), creatine, choline, and lactate metabolites that are useful for the investigation of neurological disorders [2]. However, considerable practical challenges remain in obtaining ρ(r, f) in both high spatial-spectral resolution and high signal-to-noise ratio (SNR). These difficulties are due to acquisition time limitations and low concentrations of metabolites (typically thousands fold below that of tissue water [3]). This paper addresses the low SNR problem.

A straightforward way to improve the SNR of MRSI data is to acquire multiple sets of measurements for signal averaging, but at the expense of lengthening the already long data acquisition time. Another approach is to apply a linear shift-invariant filter such as a Gaussian smoothing filter. However, such an approach often has poor tradeoff between spatial-spectral resolution and SNR. For a better tradeoff between SNR and resolution, many advanced denoising methods proposed for general signal processing applications can be used. Most notable in this class are transform-based methods (e.g., wavelet shrinkage [4], singular value decomposition (SVD) truncation [5], [6], etc.) and PDE-based methods [7]. These methods utilize known signal properties such as piecewise smoothness for feature-preserving denoising. They are effective when the SNR is beyond some threshold, but can not well separate signal from noise in the presence of severe noise contamination, as it is often the case with practical MRSI data.

There are also several denoising methods specifically designed for MRSI signals. In [8], for instance, MRS signals are denoised by consecutive projection on to different domains, represented by a set of linear time-frequency transforms. Explicit parametric models have also been used for denoising MRSI data either prior to or during metabolite quantitation [9]–[13]. These methods are very effective when the models are correct. However, generic parametric models that fully account for all spectral features are not yet available and incorrect models often create significant bias which can be very problematic in practical applications. For example, a popular denoising method described in [9] and originally proposed in [14] makes a strict use of the Lorentzian lineshape of the spectral peaks and is very sensitive to noise. Less-constrained approaches were proposed in [15] and [16].

In this study, we propose a new scheme for MRSI denoising, coined low rank approximations (LORA), by utilizing the low-rank structures of the spatial-spectral data. Specifically, we incorporate low-rank approximation in (k, t) domain by assuming that spatial variations are separable from temporal variations to some low order [17]. In addition, we exploit the low-rank structure of the temporal signal due to its linear predictability. The use of these two low-rank properties provides efficient spatial-spectral filtering. Our preliminary results were reported in [18]. This paper presents a more detailed description and a more thorough evaluation of the proposed method.

The rest of this paper is organized as follows. Section II presents the proposed method, including a description of the low-rank properties and the associated denoising algorithm. Section III discusses algorithm considerations. Section IV shows simulation and experimental results illustrating denoising performance of the proposed algorithm, followed by the conclusion in Section V. For easy reference, the following is a list of the key symbols used in this paper:

ρ(r, f) spatial-spectral function of a spin system;
ρ(f) MRSI spectrum at an arbitrary spatial location;
s0 (k, t) ideal noiseless MRSI signals;
s(k, t) measured MRSI signals;
ξ(k, t) noise in the measured MRSI signals;
s(t) MRSI signal at an arbitrary spatial location;
L1 order of the partially separable (PS) model;
L2 order of the linear predictive (LP) model;
cl (r) spatial basis of the PS model;
ψl (f) spectral basis of the PS model;
ĉl (k) Fourier transform of cl (r);
ψ̂l (t) inverse Fourier transform of ψl (f);
αl coefficients of the mixture of the Lorentzians model;
φl Lorentzian spectral function;
C Casorati matrix;
H Hankel matrix;
A a general matrix;
λ singular values of a matrix.

II. Proposed Method

The proposed method is based on low-rank approximations of MRSI data. Specifically, it exploits two low-rank properties (one due to partial separability and the other due to linear predictability) for noise removal.

A. Low Rankness Due to Spatiotemporal Partial Separability

The spatial-spectral distribution function ρ(r, f) can be expressed as

ρ(r,f)=l=1L1cl(r)ψl(f) (2)

where cl (r) can be viewed as the spatial basis and ψl (f) as the spectral basis for ρ(r, f). This model is called L1 th-order partially separable (between space and frequency). Equivalently, the noiseless MRSI data s0 (k, t) can be expressed as partially separable between k and t to the L1 th order

s0(k,t)=l=1L1ĉl(k)ψ̂l(t) (3)

where ĉl (k) and ψ̂l (t) are related to cl (r) and ψl (f) in (2) by the Fourier transform, respectively. In (3), we assume that field inhomogeneity effects have been previously removed. This task can be achieved using a range of field-correction techniques, such as those in [19], [20], and [21].

Validity of the PS model (2) can be justified as follows. Noting that ρ(r, f) is an ℒ2-function, model (2) is always valid for L1 = ∞. In practice, the PS model is valid for a finite (small) L1 because there is a finite number of resonances (spectral components) in any practical MRS experiment. In this case, function ψl (f) can be viewed as the spectral function of the lth resonance component and cl (r) is its corresponding spatial distribution.

Model (2) also arises when ρ(r, f) can be decomposed into a summation of compartmental spectral functions. This simplified form of the PS model was previously used in [22] for spectroscopic imaging. In this case, L1 is the number of compartments, cl (r) = 1, and

ψl(f)={1Vlρ(r,f)dr,r𝒟l0,otherwise (4)

where 𝒟l represents the lth compartment and Vl is its corresponding volume. While compartmental spatial-spectral functions are partially separable, the PS model in (2) represents a much broader class of functions and does not impose compartmental homogeneity as in [22], which is often problematic for practical MRSI data [23].

An important property of model (3) is that the Casorati matrix formed from s0 (k, t) samples is low rank. Specifically, let

C0=[s0(k1,t1)s0(k1,t2)s0(k1,tM)s0(k2,t1)s0(k2,t2)s0(k2,tM)s0(kN,t1)s0(kN,t2)s0(kN,tM)] (5)

for any point set {s0(kn,tm)}n=1,m=1N,M. Then, model (3) implies that C0 has at most rank L1 [17].

In practice, L1 is much smaller than min (N,M) due to the small number of spectral components (about five to ten commonly MR-observable metabolites, four resonance components from macromolecules, and five components from lipids in the human brain [24], [25]). This enables the use of the low-rank property for effective denoising.

B. Low Rankness Due to Linear Predictability

The time-domain signal of a spin system with L2 spectral components resonating at frequency fl with damping factor γl can be expressed as [3]

s0(t)=l=1L2αle(γl+j2πfl)t. (6)

Equivalently in the frequency domain, the spectrum consists of L2 Lorentzian resonance lines φl (f):

ρ(f)=l=1L2αlφl(f)

where

φl(f)=1/γl1+4π2(f+fl)2/γl2j2π(f+fl)/γl21+4π2(f+fl)2/γl2. (7)

In practice, resonance lines φl (f) may deviate from the Lorentzian lineshape due to magnetic field inhomogeneity, limited spatial resolution, etc. So, an associated practical question is whether the observed resonance lines can be well represented using (7). This question was previously raised in [3] and [26], and it was found that any observed lines can be fitted using a linear combination of Lorentzian basis functions. For example, the broad resonance from residual water was found to be well represented using 3–10 Lorentzians [3]. Therefore, (7) is valid (in a mathematical sense), although in this case L2 does not physically represent the number of spectral peaks. In practice, L2 is larger than the number of spectral peaks but much smaller than the number of available samples M.

An important property of model (6) is that the discrete timedomain signal is linearly predictable, i.e.,

s0[m]=l=1L2βls0[ml] (8)

where s0 [m] = s0 (mΔt) with Δt being the sampling interval. A Hankel matrix formed from s0 [m] is low rank. Specifically, {s0[m]}m=1M is L2th-order linearly predictable as defined in (8) if and only if the following Hankel matrix

H0=[s0[1]s0[K]s0[2]s0[K+1]s0[MK+1]s0[M]] (9)

has rank L2 [27].

C. Proposed Algorithm

For the measured noisy data s(k, t) = s0 (k, t) + ξ(k, t), the corresponding Casorati and Hankel matrices in (5) and (9) can be expressed, respectively, as C = C0 + EC and H = H0 + EH, where EC and EH denote noise matrices. The proposed algorithm reduces ξ(k, t) by performing low-rank approximation on C and H, respectively.

1) Low-rank matrix approximation

Assume that an arbitrary rank-L matrix A0 is perturbed by noise (denoted by a random matrix E) as

A=A0+E. (10)

In the case when rank L is known a priori, the low-rank approximation of A is given by

Ā=argminrank(Â)=LAÂ (11)

where the norm ∥ · ∥denotes either spectral or Frobenius norm. It is well known that Ā can be obtained using singular value decomposition (SVD) as

Ā=l=1Lλl(A)ulυlH (12)

where λl ,ul, and vl are the singular values, left singular vectors, and right singular vectors of A, respectively.

2) Rank determination

Case 1 (A0 is a general N × M low-rank matrix and E is an N × M random Gaussian matrix): In this case, L can be determined using the results from random matrix theory. Specifically, we first estimate noise standard deviation σ0 from the measured data s(kn, tm) and use σ0 to estimate ∥E2 based on the Marchenko–Pastur distribution of the eigenvalues of EHE [28]. We then choose rank so that

λ+1(A)E2λ(A). (13)

Case 2 (A0 is a low-rank Hankel matrix and E is a Gaussian Hankel matrix) : The rank selection problem in this case has been well studied and is related to the model-order estimation problem for autoregressive models. The rank can be determined using various criteria such as the akaike information criterion (AIC) [29]. Specifically, is chosen to yield a minimum change in AIC() − AIC( + 1), where

AIC()=Mloge()+2 (14)

and e() is the error of a least-squares fit of model (6) to the noisy data, given a particular candidate value .

3) Denoising by Low-Rank Approximations (LORA)

The proposed denoising algorithm consists of two key steps: PS-based low-rank filtering and LP-based low-rank filtering, which are summarized next.

  1. Given noisy data s(kn, tm), construct matrix C according to (5) and solve the following optimization problem:
    =argminrank(Ĉ)=L1CĈ (15)
    by performing rank-L1 approximation on C using SVD.
  2. Take the 2-D discrete Fourier transform (DFT) of each column of matrix to obtain (rn, tm).

  3. For each voxel rn, construct matrix H from (rn, tm) according to (9) and solve the following optimization problem:
    =argminrank(Ĥ)=L2HĤ (16)
    by performing rank-L2 approximation using SVD.
  4. Denoised data at each voxel rn are obtained by extracting the elements from the first row and last column of .

III. Algorithm Considerations

A. Improved LP-Based Low-Rank Filtering

Our empirical results indicate that PS-based low-rank filtering can be applied in a wide range of SNRs, while the performance of LP-based low-rank filtering is rather sensitive to the noise level. One reason for the high sensitivity of LP-based low-rank filtering is that the signal subspace in the Hankel matrix H is usually not well conditioned and is not well separated from the noise subspace. This is a well-known limitation with LP-based analysis of MRSI data [9], [14]. We have observed that for the low SNR values in typical in vivo MRSI data, the LP-based low-rank filtering step alone often resulted in spectral artifacts such as loss of signal components and introduction of spurious peaks. This is especially the case if both low-rank and Hankel structures are imposed on Ĥas it is done in the Cadzow signal enhancement algorithm [14], which has been used for denoising [4], [9], [10]. By not imposing the Hankel structure on Ĥ in step 3 of the proposed algorithm, the “rank” constraint is actually weakly imposed. This “softening” of the LP-based low-rank filtering seems to enable the algorithm to accommodate non-Lorentzian lines and avoid spectral artifacts in low SNR cases. In addition, it is important that the PS-based lowrank filtering is applied first, which is followed by the LP-based low-rank filtering. In this way, the data SNR is improved before LP-based low-rank filtering is applied. Another step that can further help to reduce the noise sensitivity of LP-based low-rank filtering is to perform “segmented” low-rank filtering. Specifically, instead of forming a single Hankel matrix using the entire signal, we can divide a given signal {s[m]}m=0N1 into two (or multiple) segments, say {s[m]}m=0N/21 and {s[m]}m=N/2N1, and then form two (or multiple) Hankel matrices, each of which will be processed using the proposed LP-based low-rank filtering.

Fig. 1 shows a set of simulation results to illustrate the sensitivity (to noise and model order) of LP-based low-rank filtering. Fig. 1(a) shows ground truth (red) and noisy (black) spectra at two SNR levels. The denoised spectra in Fig. 1(b)–(e) were obtained using LP-based low-rank filtering with the Hankel constraint (as is done in Cadzow [14]) and without the Hankel constraint (step 2 of LORA), respectively, for two model orders (L2 = 20 and 8). Note that in the high SNR case, the Hankel constraint helped to produce almost “perfect” denoised result with a correct model order [top row, (b)] but lost some spectral components with an underestimated model order [top row, (d)]. In the low SNR case, the Hankel constraint introduced spurious spectral features [bottom row, (b)] even with the correct model order. The problem can be alleviated with a small model order, but at the expense of missing spectral components [such as the loss of the smallest peak at 6.05 ppm and oversmoothing of the non-Lorentzian, closely spaced spectral peaks in the regions from 2.2 to 2.8 ppm and from 3.3 to 3.8 ppm; see the bottom row in (d)]. Both problems (spurious peaks and missing peaks) were alleviated by not imposing the Hankel structure [see (e)].

Fig. 1.

Fig. 1

Simulation results of LP-based low-rank filtering with and without Hankel constraint: (a) original (red) and noisy (black) spectra at low (I) and high (II) noise levels; (b) and (d) denoised spectra with Hankel constraint (Cadzow) for L2 = 20 and 8, respectively; (c) and (e) denoised spectra without Hankel constraint (step 2 of LORA) for L2 = 20 and 8, respectively. Note that the Hankel constraint helped reduce noise but at the expense of spectral artifacts.

B. Rank Selection

For PS-based low-rank filtering, in the case of high/moderate SNR (ideally as long as ∥E2 ≤ λL (A0) or in the range above 18 dB for the simulated 1H MRSI dataset described in Section IV), we choose rank L1 of C0 according to (13). To evaluate the performance of the described rank selection method, we performed rank selection on the simulated dataset with true rank L1 = 8. Specifically, we performed a Monte Carlo study with 9 different noise levels, 32 noise realizations per noise level, and estimated ∥E2 using the Marchenko–Pastur distribution ([28]). The effective rank 1 was selected according to (13). Fig. 7(a) shows some representative noisy spectra at the highest, medium, and lowest SNRs used in the experiment. For this experiment and the rest of this paper, we define SNR in terms of signal energy and signal amplitude as

SNRe=10log10s02ss02SNRp=amplitude of the largest peakstandard deviation of the noise (17)

where s and s0 are the noisy and noiseless signals, respectively.

Fig. 7.

Fig. 7

Monte Carlo study of low-rank filtering based on partial separability: (a) representative noisy and denoised spectra at three SNR levels; (b) error distributions calculated as the histograms of e and ē according to (21) at a particular location (r0, f0). For this illustrative example, r0 is a representative voxel in the white matter region marked as “2” in Fig. 3(e) and f0 is the frequency point marked in Fig. 7(a). Notice the reduced noise variances achieved by denoising.

Table I shows the mean effective rank 1, averaged over all the noise realizations. Notice that ∥E2 is closely estimated using the Marchenko–Pastur distribution at every SNR level, as can be seen from the reported relative error E2Ê2E2. The results also show that as the SNR decreases, the discussed approach underestimates L1. In practice, the choice of L1 can also be guided by the known number of spectral components present in the spectrum [30].

TABLE 1.

Monte Carlo Experiment on Choosing Effective Rank 1 at Nine Different SNR Levels

SNRe (s(k,t)) 23.72 22.23 20.94 19.83 18.85 17.97 17.17 16.45 15.78
SNRe (s(f)) at voxel 1 6.42 5.17 4.45 3.83 3.26 2.83 2.27 1.99 1.86
SNRe (s(f)) at voxel 2 32.46 31.31 29.96 28.52 27.82 27.01 26.22 25.41 24.83
E2Ê2E2
0.0057 0.0051 0.0053 0.0058 0.0046 0.0050 0.0054 0.0050 0.0058
L1 8 8 7.03 6.63 6.06 6.00 5.25 5.03 4.97

True rank L1 = 8. Voxel 1 is in the region of low SNR, while voxel 2 is in the region of high SNR. All SNR values are reported in decibels.

For LP-based low-rank filtering, the choice of rank L2 of matrix H0 is based on the AIC criteria as specified in (14). We performed a Monte Carlo study on rank L2 selection with the simulated dataset at 3 different noise levels, for each noise level 256 noise realizations were considered. The corresponding representative noisy spectra are shown in Fig. 8(a). Fig. 2 shows the computed mean AIC values (averaged over the noise realizations) as a function of candidate rank values. The effective rank 2 is typically chosen to minimize AIC. However, this usually leads to an overestimated rank [29]. To achieve a closer estimate, we suggest choosing the rank to be the threshold point at which there is no significant reduction in the AIC value when the rank is further increased. In practice, the choice of L2 can also be guided by the known number of spectral peaks present in the spectrum.

Fig. 8.

Fig. 8

Monte Carlo study of low-rank filtering based on linear predictability: (a) representative noisy and denoised spectra at three noise levels; (b)–(d) error distributions calculated as the histograms of e and ē according to (21) at particular locations (r0, f1), (r0, f2), and (r0, f3), respectively. For this illustrative example, r0 is a representative voxel in the white matter region marked as “2” in Fig. 3(f) and f1, f2, and f3 are the frequency points marked in Fig. 8(a). Notice that the denoising performance is frequency dependent.

Fig. 2.

Fig. 2

Monte Carlo study of rank L2 selection based on AIC: top row shows the mean AIC value (averaged over 256 noise realizations) as a function of 2 at (a) low noise level with SNRe=26.61 dB and SNRp=94.4; (b) medium noise level with SNRe=17.03 dB and SNRp=30.4; (c) high noise level with SNRe=12.76 dB and SNRp=17.9. The true rank L2 in this experiment was 20.

Inaccurate rank estimation is more problematic for LP-based low-rank filtering than for PS-based low-rank filtering. Specifically, an overestimated rank L2 (2 > L2) can result in spurious peaks. Using an underestimated rank (without imposing the Hankel structure) can alleviate the problem. However, a highly underestimated rank 2 can lead to a loss of useful spectral information, which is undesirable. Typically, peaks with small amplitudes are distorted or lost first [31]. Possible peak loss due to rank underestimation can be alleviated by not imposing the Hankel constraint as discussed in Section III-A. For PS-based low-rank filtering, severe rank underestimation can lead to spatial blurring and spectral line broadening. An overestimated rank L1 in this case does not directly introduce bias and spurious peaks, but would reduce its filtering effectiveness.

IV. Results and Discussions

In this section, results from simulated and experimental data are presented to demonstrate the denoising performance of the proposed technique.

A. Simulations

We simulate a spatial-spectral distribution function based on literature values of 1H metabolites and experimental data. We consider five commonly MR-observable metabolites in the human brain [24], [30]: N-acetylaspartate (NAA), creatine (Cr), choline (Cho), glutamate/glutamine (Glx), and myo-inositol (m-Ins). For each metabolite, a spectral profile was obtained from quantum mechanical simulations of a spin-echo MR experiment [32]. The spatial distribution of each metabolite was created based on commonly reported literature values from three segmented regions of cerebrospinal fluid, gray matter (GM), and white matter (WM) [30], [33], [34]. To evaluate the ability of the proposed method to handle nonideal conditions, Gaussian (instead of pure Lorentzian) lineshape was used with an additional baseline signal. Lineshape parameters were chosen based on commonly reported values in the literature [35], [36]. The baseline signal was extracted from single-voxel CSI PRESS experimental data of the brain with TE=30 ms. MRSI data were simulated at the magnetic field strength of 3 T with a spectral bandwidth of 1200 Hz and complex Gaussian noise was added.

Figs. 3 and 4 show the spatial distribution of the spatialspectral function at a high SNR frequency of 3.94 ppm and a low SNR frequency of 6.66 ppm, both indicated in Fig. 5(f) as “f1” and “f2 ”, respectively. We compare the proposed approach with 3-D wavelet soft shrinkage (Daubechies 4-tap kernel filter, 4 levels), sparse 3-D transform-domain collaborative filtering [37] (code is available at http://www.cs.tut.fi/foi/GCFBM3D/), and conventional Gaussian apodization. The wavelet shrinkage threshold and apodization constant were chosen to yield the same level of the residual noise variance as that for the proposed approach (threshold T = 2.28σ0). It can be clearly seen that the corresponding wavelet-denoised spatial-spectral function in Figs. 3(c) and 4(c) has an improved SNR at the expense of blurring the spatial features significantly, while the proposed denoising preserves the spatial features better. Collaborative filtering performed well at a high SNR frequency such as the one shown in Fig. 3, but at a lower SNR frequency shown in Fig. 4 the spatial features such as edges in the GM region were smoothed out noticeably by the collaborative filtering while the residual noise variance in the denoised spectrum was larger than that of obtained from LORA [see Fig. 5(d)]. Gaussian apodization shown in Figs. 3(e) and 4(e) has the poorest tradeoff between SNR and resolution among all methods, as expected.

Fig. 3.

Fig. 3

Denoising results—spatial distributions ρ(r, f1) at a SNR frequency [3.94 ppm; see Fig. 5(f)] obtained from (a) noisy data; (b) LORA denoising; (c) wavelet denoising; (d) sparse 3-D transform-domain collaborative filtering; (e) Gaussian apodization; and (f) noiseless data. Corresponding errors of (b)–(e) are shown in (g)–(j), respectively.

Fig. 4.

Fig. 4

Denoising results—spatial distributions ρ(r, f2) at a low-SNR frequency [6.66 ppm; see Fig. 5(f)], obtained from (a) noisy data; (b) LORA denoising; (c) wavelet denoising; (d) sparse 3-D transform-domain collaborative filtering; (e) Gaussian apodization; and (f) noiseless data. Corresponding errors of (b)–(e) are shown in (g)–(j), respectively.

Fig. 5.

Fig. 5

Denoising results—spectra at a particular voxel, marked as “1” in Fig. 3(f), obtained from (a) noisy data; (b) LORA denoising; (c) wavelet shrinkage; (d) sparse 3-D transform-domain collaborative filtering; (e) Gaussian apodization; and (f) noiseless data. Corresponding errors of (b)–(e) are shown in (g)–(j), respectively. All spectra are shown in the real-valued mode.

Fig. 5 shows representative spectra from a particular voxel marked as “1” in Fig. 3(f). Both Gaussian apodization and wavelet denoising improve the SNR at an expense of line broadening and loosing metabolite amplitudes, with Gaussian apodization performing the worst. This amplitude loss can be seen from the corresponding residual spectra, which are not centered at zero, as shown in Fig. 5(h) and (j).

B. Denoising Performance

To analyze the denoising performance of the proposed low-rank filtering, we rewrite C and as C = C0 + E and = C0 +Δ, where C0 is an N × M data matrix with rank L1 as described in (5), E is the noise matrix, is formed from the denoised data as obtained from (15), and Δ is the residual error, which can contain both residual noise and possible signal loss due to low-rank filtering.Without loss of generality, we assume M < N. The denoised matrix is obtained according to (15). We define the noise reduction factor as

g=EFΔF. (18)

By defining the SNR before and after denoising as 20log10PsEF and 20log10PsΔF, where Ps denotes the signal power, the SNR gain can be calculated as 20 log10 g. Ideally, one would want g = ∞. In the full-rank case when L1 = M and no denoising is achieved, g is equal to 1.

Computing g as in (18) requires knowing the ground truth in order to compute ∥ΔF. However, notice that we only need to estimate the norm ∥ΔF rather than knowing Δ completely. Thus, one would hope that there is a signal-independent expression to predict ∥ΔF closely. In general, it is hard to accurately characterize singular values of C0 ,C, and . However, under assumptions that:

Cond1: the signal and noise (before and after denoising) subspaces are uncorrelated in the sense that C0H E = 0 and C0HΔ = 0;

Cond2: the noise is complex white, i.e., EHE and ΔH Δ are scalar multiples of identity matrix [38]; we can obtain the following approximate formula based on the theoretical results of SVD of noisy matrices in [38]:

g=i=1Mλi2(E)i=11λi2(E),if1L1. (19)

Expression (19) provides a signal-independent prediction of g. It suggests that 1) in the case of accurate rank estimation (1 = L1) and λi(E) are all equal, the noise reduction factor is of the order of M1; 2) in the case of overestimated rank (1 > L1), the residual noise level is larger than the residual noise level with correct rank determination. In practice, the conditions (Cond1) and (Cond2) which are required for (19) to hold are never satisfied exactly. However, we observed that a mild violation of these conditions may still give a rough guess of g, as shown in our simulation study discussed below.

Based on the simulated 1H MRSI dataset described in Section IV-A, we performed a Monte Carlo study to compute g according to (18), averaged over 32 noise realizations. The noise reduction factor g is plotted as a function of the estimated rank 1 in Fig. 6.We observe that when the rank is properly chosen, g reaches its maximum value (about 7.6 for the simulated dataset). As the rank is overestimated (1 > L1) and approaches the full rank, g decreases to 1, as expected. The reduction of g reflects the residual noise variance in the overestimated rank case and the signal loss in the underestimated rank case. A nice feature of this plot is that it shows how much of the noise reduction we can achieve even in the case when the rank is incorrectly estimated. For example, if the rank was overestimated to be 32, while the true rank is 8, then according to this plot, we still achieve a factor of about 3 (or 9.5 dB) of improvement in the SNR. Thus, for the sake of improving SNR, estimating rank precisely is not a critical point, as long as we do not underestimate the rank causing the signal loss. Note that with the considered dataset in Fig. 6, the relative error EHEMσ02IFMσ0=0.99 for condition (Cond1) and C0HEFC0HC0FEHEF=0.15 for condition (Cond2). These numbers did not significantly change as the SNR was varied from low to high [see Fig. 7(a)] for representative SNR levels. Conditions (Cond1) and (Cond2) for Δ are satisfied closer as approaches M. Predicted value from (19) is plotted in the same figure as the empirical value of g in Fig. 6. Notice that in the region 1 > L1 the theoretically predicted values of g approach its empirical value as 1 tends to the full-rank case.

Fig. 6.

Fig. 6

Noise reduction factor g for low-rank filtering based on partial separability as a function of the estimated rank 1. Empirical values of g (black) and corresponding theoretical approximations (red) were computed according to (18) and (19), respectively. The true rank L1 = 8. Notice that in the region 1 > L1, the theoretically predicted values of g tend to its empirical values and both approach 1 in the full-rank case.

We have performed a similar analysis on the noise reduction factor g, for the case of low-rank filtering of linear prediction matrix. However, to evaluate the level of SNR improvement for low-rank filtering of linear prediction matrix, we rather define the following factor:

h=ξ2η2 (20)

where ξ and η are vectors extracted from the first row and last column of E and Δ, respectively. We performed a Monte Carlo study with 256 noise realizations to compute h at different noise levels shown in Fig. 8(a). The reduction factor h depends on the conditioning of the Hankel matrix H0. For the case considered here, we observed that h achieves the value of around 1.7. However, for better-conditioned cases with well-separated spectral peaks, h can be significantly bigger (in the range of 5 for a spectrum with three peaks, separated by 0.57 ppm). This experiment suggests that LP-based low-rank filtering can be less effective than PS-based low-rank filtering, although both are used in LORA.

We performed a Monte Carlo study to further evaluate the denoising effectiveness of low-rank filtering with 2048 noise realizations to obtain histograms of the errors before and after denoising as

e=ss0ē=s0 (21)

where denotes the denoised data. Fig. 7 shows some representative results or the low-rank filtering based on partial separability. It is easy to see that a significant reduction in the noise variance was achieved at every SNR level. Note that low-rank filtering is a “biased” estimator. However, the resulting bias is negligible once the rank is correctly chosen and SNR is not extremely low. In the case of extremely low SNR, using an overestimated rank reduces bias, but at the expense of reduced filtering effect.

Fig. 8 shows some representative results from a Monte Carlo study of low-rank filtering based on linear predictability. Notice that the denoising performance is frequency dependent. Histogram at a low SNR frequency f2 in Fig. 8(c) shows that a significant noise variance reduction is still achieved; however, a larger bias is introduced, compared to the case of a higher SNR frequency f1 in Fig. 8(b). For the frequency regions that contain mostly noise (such as the frequency f3 in [Fig. 8(d)], we observe significant noise variance reduction with no bias.

C. In Vivo Experiments

We have applied LORA to denoise in vivo MRSI data of the mouse brain, acquired using a Varian INOVA 11.74 T (500 MHz) MRI scanner at Washington University in St. Louis. This dataset was previously described and used in [39]. A craniectomy was performed on the mouse and the middle cerebral artery (MCA) was electrocoagulated. In vivo MRSI data were acquired using a CSI sequence with chemical shift selective suppression (CHESS) water suppression, TE=270 ms, TR=1500 ms, bandwidth=6000 Hz, 1024 FID data points, and 8 averages. The CSI dataset was preprocessed to compensate for field inhomogeneity and remove water resonance using HSVD [3]. Notice that the residual water and artifacts from field inhomogeneity were not completely removed. For a detailed description of the dataset, see [39]. Before performing LORA denoising, data were preprocessed with spatial denoising using anatomical reconstruction [40] to protect spatial features from field inhomogeneity artifacts during the low-rank filtering due to spatiotemporal partial separability. NAA, lactate (Lac), and glutamate/glutamine (Glx) spatial maps were obtained by integrating the complex spectrum at 2.02 ppm, 1.33 ppm, and in the region from 2.7 to 2.8 ppm, respectively. Fig. 9 shows NAA and Lac maps obtained from the measured, spatially denoised, and LORA-denoised data after each individual low-rank filtering. Fig. 10 shows spectra obtained from these reconstructions at voxels outside of electrocogualation area (voxels marked as “1” and “3” in [Fig. 9(a)]) and in the region of electrocogualation (voxel marked as “2” in [Fig. 9(a)]). It can be clearly seen that the noise has been significantly suppressed. Electrocoagulation blocked MCA, resulting in a stoppage of blood supply and lack of oxygen. Thus, we expect elevated Lac and reduced NAA concentrations in the electrocogualation region, which can be seen from the LORA-denoised reconstruction in Fig. 9(d).

Fig. 9.

Fig. 9

Denoising results from in vivo experimental data. Spatial distributions corresponding to NAA in the region from 1.9 to 2.1 ppm (top row), Lac in the region from 1.2 to1.37 ppm (middle row), and Glx in the region from 2.7 to 2.8 ppm (bottom row) obtained from (a) noisy data; (b) spatial denoising; (c) PS-based low-rank filtering; (d) LP-based low-rank filtering of the results in (c) (the final output of LORA).

Fig. 10.

Fig. 10

Denoising results from in vivo experimental dataset: spectra at three voxels in the region outside of electrocogualation (voxel 1, top row, and voxel 3, bottom row) and inside of electrocogualation area (voxel 2, middle row), obtained from (a) noisy data; (b) spatial denoising; (c) PS-based low-rank filtering of the spatial filtering results; (d) LP-based low-rank filtering of PS-based low-rank filtering results (which is the final output of LORA). All spectra are shown in absolute mode and on the same scale.

It is also interesting to observe the filtering results of each individual filtering step. While spatial filtering is useful, its output is often still too noisy for practical application. The results were significantly improved by subsequent PS-based and LP-based low-rank filterings. Note that while PS-based low-rank filtering performs spatial-spectral denoising, LP-based low-rank filtering is just a spectral filter. These filtering characteristics can be clearly observed from Figs. 9 and 10. Note also that the denoised spectra in Fig. 10(d) show minimal spectral distortions with impressive SNR improvement as compared to the results in Fig. 10(a).

V. Conclusion

This paper has presented a new method (LORA) for spatialspectral filtering of MR spectroscopic imaging data. LORA exploits the low-rank properties of MRSI data, one due to partial separability and the other due to linear predictability. The combined use of partial separability and linear predictability provides a new principled way to improve SNR for spatiotemporal imaging. Simulation and experimental results demonstrate that LORA can effectively denoise MRSI data with very low SNRs. It should prove useful for practical MRSI applications.

Acknowledgments

This work was supported in part by Grant NIHP41-EB-001977, GrantNIH-P41-EB015904, GrantNIH-R01-EB013695, Grant NSF-CBET-07-30623, Grant NSF-CCF-06-35234, Grant NSF-CCF-09-16953, and in part by the VEF fellowship.

Biographies

graphic file with name nihms508541b1.gif

Hien M. Nguyen received the B.S. degree in computer science from the Hanoi University of Technology, Hanoi, Vietnam, in 2002, and the M.S. and Ph.D. degrees in electrical and computer engineering from the University of Illinois at Urbana-Champaign, Champaign, IL, in 2007 and 2011, respectively.

She is currently a Postdoctoral Researcher in the Department of Radiology, Richard M. Lucas Center for Imaging, Stanford University, Stanford, CA. Her research interests include multidimensional signal processing, parameter estimation, and inverse problems, with particular emphasis on biomedical imaging applications.

graphic file with name nihms508541b2.gif

Xi Peng received the B.Eng. and M.S. degrees in measurement, control, and instruments, and the Ph.D. degree in electrical engineering all from Wuhan University, Hubei, China, in 2006, 2008, and 2012, respectively.

During the doctoral process, he was a Visiting Scholar in Beckman Institute, University of Illinois at Urbana-Champaign. He is currently an Assistant Professor in the Paul C. Lauterbur Research Center for Biomedical Imaging, Institute of Biomedical and Health Engineering, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China. His research interests include image reconstruction, parameter estimation, and signal processing for biomedical applications.

graphic file with name nihms508541b3.gif

Minh N. Do received the Dr.Sci. degree in communication systems from the Swiss Federal Institute of Technology Lausanne (EPFL), Zurich, Switzerland, in 2001.

He is currently an Associate Professor in the Department of Electrical and Computer Engineering, University of Illinois at Urbana, Champaign, Urbana. His research interests include image and multidimensional signal processing, wavelets and multiscale geometric analysis, computational imaging, augmented reality, and visual information representation.

Prof. Do received a Silver Medal from the 32nd International Mathematical Olympiad in 1991, a University Medal from the University of Canberra in 1997, a Doctorate Award from the EPFL in 2001, a CAREER Award from the National Science Foundation in 2003, and a Young Author Best Paper Award from the IEEE in 2008. He is a member of the IEEE Signal Processing Theory and Methods and Image, Video, and Multidimensional Signal Processing Technical Committees, and an Associate Editor of the IEEE Transactions on Image Processing.

graphic file with name nihms508541b4.gif

Zhi-Pei Liang (M’92–SM’98–F’06) received the Ph.D. degree in biomedical engineering from Case Western Reserve University, Cleveland, OH, in 1989.

He is currently the Franklin W. Woeltge Professor of Electrical and Computer Engineering at the University of Illinois at Urbana-Champaign (UIUC), Champaign, IL. His research interests include image formation theory, algorithms, and biomedical applications.

Dr. Liang is a recipient of the 1990 Sylvia Sorkin Greenfield Best Paper Award (Medical Physics), an National Science Foundation’s CAREER Award (1995), and the 1999 IEEEEMBS Early Career Achievement Award, and the 2012 International Federation for Medical and Biological Engineering Otto Schmitt Award. He was named Henry Magnuski Scholar (1999–2001), and University Scholar (2001–2004) at UIUC. He is a fellow of the International Society for Magnetic Resonance in Medicine (2010), and was elected to the American Institute for Medical and Biological Engineering in 2005 and to the International Academy of Medical and Biological Engineering in 2012. He served as President of the IEEE Engineering in Medicine and Biology Society from 2011 to 2012.

Contributor Information

Hien M. Nguyen, Lucas Center for MR Spectroscopy and Imaging, Department of Radiology, Stanford University, Palo Alto, CA 94304 USA (hnguyen6@stanford.edu)

Xi Peng, Paul C. Lauterbur Research Center for Biomedical Imaging, Institute of Biomedical and Health Engineering, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, Guangdong 518055, China (xi.peng@siat.ac.cn).

Minh N. Do, Department of Electrical and Computer Engineering and the Coordinated Science Laboratory, University of Illinois at Urbana-Champaign, Urbana, IL 61801 USA (minhdo@illinois.edu)

Zhi-Pei Liang, Department of Electrical and Computer Engineering and the Beckman Institute, University of Illinois at Urbana-Champaign, Urbana, IL 61801 USA (z-liang@illinois.edu).

References

  • 1.de Graaf RA. In Vivo NMR Spectroscopy. 2nd ed. New York: Wiley; 2007. [Google Scholar]
  • 2.Costanzo AD, Trojsi F, Tosetti M, Schirmer T, Lechner SM, Popolizio T, Scarabino T. Proton MR spectroscopy of the brain at 3 T: An update. Eur. Radiol. 2007;vol. 17:1651–1662. doi: 10.1007/s00330-006-0546-1. [DOI] [PubMed] [Google Scholar]
  • 3.van den Boogaart A, van Ormondt D, Pijnappel WWF, de Beer R, Ala-Korpela M. Removal of the water resonance from 1H magnetic resonance spectra. In: McWhirter JG, editor. Mathematical Signal Processing III. Oxford: Clarendon Press; 1994. pp. 175–195. [Google Scholar]
  • 4.de Greiff HFC, Ramos-Garcia R, Lorenzo-Ginori JV. Signal denoising in magnetic resonance spectroscopy using wavelet transforms. Concepts Magn. Reson. 2002;vol. 14:388–401. [Google Scholar]
  • 5.Andrews HC, Patterson CL. Singular value decompositions and digital image processing. IEEE Trans. Acoust. Speech Sign. Process. 1976 Feb;vol. 24(no. 1):26–53. [Google Scholar]
  • 6.Kumaresan R, Tufts DW. Estimating the parameters of exponentially damped sinusoids and pole-zero modeling in noise. IEEE Trans. Acoust., Speech, Signal Process. 1982 Dec;vol. 30(no. 6):833–840. [Google Scholar]
  • 7.Buades A, Coll B, Morel JM. A review of image denoising algorithms, with a new one. Multiscale Model. Simul. 2005;vol. 4:490–530. [Google Scholar]
  • 8.Ahmed OA. New denoising scheme for magnetic resonance spectroscopy signals. IEEE Trans. Med. Imag. 2005 Jun;vol. 24(no. 6):809–816. doi: 10.1109/TMI.2004.828350. [DOI] [PubMed] [Google Scholar]
  • 9.Diop A, Zaim-Wadghiri Y, Briguet A, Graveron-Demilly D. Improvements of quantitation by using the Cadzow enhancement procedure prior to any linear-prediction methods. J. Magn. Reson. 1994;vol. 105:17–24. [Google Scholar]
  • 10.Beer RD. Quantitative in vivo NMR. Univ. Technol. Delft Lect. Notes; 1994. [Online]. Available: http://www.intermnet.ua.es/inteRMNet/c59/index.html. [Google Scholar]
  • 11.Sava AC, Sima D, Poullet J, Huffel SV. Exploiting spatial information for estimating metabolite concentration in MRSI. Proc. 17th Int. Soc. Mag. Reson. Med. 2009 [Google Scholar]
  • 12.Bao Y, Maudsley AA. Improved reconstruction for MR spectroscopic imaging. IEEE Trans. Med. Imag. 2007 May;vol. 26(no. 5):686–695. doi: 10.1109/TMI.2007.895482. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Kornak J, Young K, Schuff N, Maudsley AA, Weiner MW. Bayesian reconstruction of low resolution magnetic resonance imaging modalities. Proc. Leeds Annu. Stat. Res. Workshop—Bioinformat. Imag. Wavelets. 2004:89–92. [Google Scholar]
  • 14.Cadzow JA. Signal enhancement—A composite property mapping algorithm. IEEE Trans. Acoust. Speech Sign. 1988 Jan;vol. 36(no. 1):49–62. [Google Scholar]
  • 15.Eslami R, Jacob M. Reduction of distortions in MRSI using a new signal model. Proc. IEEE Int Symp. Biomed. Imag. 2009 Jun-Jul;:438–441. [Google Scholar]
  • 16.Nguyen HM, Haldar JP, Do MN, Liang Z-P. Denoising of MR spectroscopic imaging data with spatial-spectral regularization. Proc. IEEE Int. Symp. Biomed. Imag. 2010 Apr;:720–723. [Google Scholar]
  • 17.Liang Z-P. Spatiotemporal imaging with partially separable functions. Proc. IEEE Int. Symp. Biomed. Imag. 2007 Apr;:988–991. [Google Scholar]
  • 18.Nguyen HM, Peng X, Do MN, Liang Z-P. Spatiotemporal filtering of MR spectroscopic imaging data by low-rank approximations. Proc. IEEE Int. Symp. Biomed. Imag. 2011 Mar-Apr;:857–860. [Google Scholar]
  • 19.Peng X, Nguyen H, Haldar J, Hernando D, Wang X-P, Liang Z-P. Correction of field inhomogeneity effects in limited k-space coverage MRSI data with anatomical constraint. Proc. Int. Soc. Mag. Reson. Med. 2010:883–886. doi: 10.1109/IEMBS.2010.5627873. [DOI] [PubMed] [Google Scholar]
  • 20.Bashir A, Yablonskiy DA. Natural linewidth chemical shift imaging (NL-CSI) Mag. Res. Imag. 2006 Jul;vol. 56:7–18. doi: 10.1002/mrm.20917. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Khalidov I, Ville DVD, Jacob M, Lazeyras F, Unser M. BSLIM: Spectral localization by imaging with explicit B0 field inhomogeneity compensation. IEEE Trans. Med. Imag. 2007 Jul;vol. 26(no. 7):990–1000. doi: 10.1109/TMI.2007.897385. [DOI] [PubMed] [Google Scholar]
  • 22.Hu X, Levin DN, Lauterbur PC, Spraggins T. SLIM: Spectral localization by imaging. Magn. Reson. Med. 1988;vol. 8:314–322. doi: 10.1002/mrm.1910080308. [DOI] [PubMed] [Google Scholar]
  • 23.Liang Z-P, Lauterbur PC. A theoretical analysis of the SLIM technique. J. Magn. Reson. 1993;vol. 102:54–60. [Google Scholar]
  • 24.Barker PB, Lin DDM. In vivo proton MR spectroscopy of the human brain. Prog. NMR Spect. 2006;vol. 49:99–128. [Google Scholar]
  • 25.Seeger U, Klose U. Parameterized evaluation of macromolecules and lipids in proton MR spectroscopy of brain diseases. Magn. Reson. Med. 2003;vol. 49:19–28. doi: 10.1002/mrm.10332. [DOI] [PubMed] [Google Scholar]
  • 26.Pijnappel WWF, van den Boogaart A, de Beer R, van Ormondt D. SVD-based quantification of magnetic resonance signals. J. Magn. Reson. 1992;vol. 97:122–134. [Google Scholar]
  • 27.Iohvidov IS. Hankel and Toeplitz Matrices and Forms. Birkhauser; 1982. [Google Scholar]
  • 28.Marcenko VA, Pastur LA. Distribution of eigenvalues for some sets of random matrices. Math. USSR—Sb. 1967;vol. 1:457–483. [Google Scholar]
  • 29.Shibata R. Selection of the order of an autoregressive model by Akaike’s information criterion. Biometrika. 1976;vol. 63:117–126. [Google Scholar]
  • 30.Govindaraju V, Young K, Maudsley AA. Proton NMR chemical shifts and coupling constants for brain metabolites. NMR Biomed. 2000;vol. 13:129–153. doi: 10.1002/1099-1492(200005)13:3<129::aid-nbm619>3.0.co;2-v. [DOI] [PubMed] [Google Scholar]
  • 31.Bazan FSV, Toint PL. Conditioning of infinite Hankel matrices of finite rank. Syst. Control Lett. 2000;vol. 41:347–359. [Google Scholar]
  • 32.Soher BJ, Young K, Bernstein A, Aygula Z, Maudsley AA. GAVA: Spectral simulation for in vivoMRS applications. J. Magn. Reson. 2007;vol. 185:291–299. doi: 10.1016/j.jmr.2007.01.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Srinivasan R, Sailasuta N, Hurd R, Nelson S, Pelletier D. Evidence of elevated glutamate in multiple sclerosis usingmagnetic resonance spectroscopy at 3 T. Brain. 2005;vol. 128:1016–1025. doi: 10.1093/brain/awh467. [DOI] [PubMed] [Google Scholar]
  • 34.Wang Y, Li S-J. Differentiation of metabolic concentrations between gray matter and white matter of human brain by in vivo1 H magnetic resonance spectroscopy. Magn. Reson. Med. 1998;vol. 39:28–33. doi: 10.1002/mrm.1910390107. [DOI] [PubMed] [Google Scholar]
  • 35.Traber F, Block W, Lamerichs R, Gieseke J, Schild HH. 1H metabolite relaxation times at 3.0 Tesla: Measurements of T1 and T2 values in normal brain and determination of regional differences in transverse relaxation. J. Magn. Reson. Imag. 2004;vol. 19:537–545. doi: 10.1002/jmri.20053. [DOI] [PubMed] [Google Scholar]
  • 36.Choi C, Coupland NJ, Bhardwaj PP, Kalra S, Casault CA, Reid K, Allen PS. T2 measurement and quantification of glutamate in human brainin vivo. Magn. Reson. Med. 2006;vol. 56:971–977. doi: 10.1002/mrm.21055. [DOI] [PubMed] [Google Scholar]
  • 37.Dabov K, Foi A, Katkovnik V, Egiazarian K. Image denoising by sparse 3D transform-domain collaborative filtering. IEEE Trans. Image Process. 2007 Aug;vol. 16(no. 8):2080–2095. doi: 10.1109/tip.2007.901238. [DOI] [PubMed] [Google Scholar]
  • 38.Moor BD. The singular value decomposition and long and short spaces of noisy matrices. IEEE Sign. Process. 1993 Sep;vol. 41(no. 9):2826–2838. [Google Scholar]
  • 39.Haldar JP, WU T-H, Wang Q, Chen C-I, Song S-K, Liang Z-P. Further development in anatomically constrained MR image reconstruction: Application to multimodal imaging of mouse stroke. Proc. Int. IEEE Eng. Med. Biol. Soc. 2008 Aug;:422–425. doi: 10.1109/IEMBS.2008.4649180. [DOI] [PubMed] [Google Scholar]
  • 40.Haldar JP, Hernando D, Song S-K, Liang Z-P. Anatomically constrained reconstruction from noisy data. Magn. Reson. Med. 2008;vol. 59:810–818. doi: 10.1002/mrm.21536. [DOI] [PubMed] [Google Scholar]

RESOURCES