Abstract
Neural networks are potentially valuable for many of the challenges associated with MRS data. The purpose of this manuscript is to describe the AGNOSTIC dataset, which contains 259,200 synthetic 1H MRS examples for training and testing neural networks. AGNOSTIC was intended to be acquisition-agnostic by using 270 basis sets that were simulated across 18 field strengths, 15 echo times, and a range of dwell times. The synthetic examples were produced to resemble healthy and clinical in vivo brain data with combinations of metabolite, macromolecule, residual water signals, and noise. To demonstrate the utility, we apply AGNOSTIC to train two Convolutional Neural Networks (CNNs) to address out-of-voxel (OOV) echoes. A Detection Network was trained to identify the point-wise presence of OOV echoes, providing proof of concept for real-time detection. A Prediction Network was trained to reconstruct OOV echoes, allowing subtraction during post-processing. Complex OOV signals were mixed into 85% of synthetic examples to train two separate CNNs for the detection and prediction of OOV signals. AGNOSTIC is available through Dryad and all Python 3 code is available through GitHub. The Detection network was shown to perform well, identifying 95% of OOV echoes. Traditional modeling of these detected OOV signals was evaluated and may prove to be an effective method during linear-combination modeling. The Prediction Network greatly reduces OOV echoes within FIDs and achieved a median log10 normed-MSE of −1.79, an improvement of almost two orders of magnitude.
Keywords: Magnetic Resonance Spectroscopy, Synthetic Data, Simulation, Deep Learning, Out-of-voxel Artifacts, Human Brain
1. Introduction:
Proton (1H) magnetic resonance spectroscopy (MRS) non-invasively measures levels of endogenous neurometabolites. MRS-visible metabolites are present at millimolar concentrations in the brain, yielding detectable signals with relatively low signal-to-noise ratio (SNR) which mutually overlap. In vivo spectra suffer from several artifacts that complicate modeling and interpretation of the data, including eddy current effects and out-of-voxel (OOV) echoes (Kreis, 2004). While there is some degree of standardization and consensus around pre-processing, modeling, and quantification of MRS data (Maudsley et al., 2021; Near et al., 2021; Öz et al., 2021; Wilson et al., 2019), this is an evolving field without a single ideal solution due to the complexity of the problem, and which therefore is likely to benefit from recent advances in machine learning.
Deep learning (DL) uses a network consisting of a series of computational layers to process information (Lecun et al., 2015). Iterative training allows features of the data to be identified and weighted to estimate a final function which predicts a desired output based on a given input (Goodfellow et al., 2016). Supervised learning involves training the network based on a pre-defined target, associating ground-truth parameters with each input. An extensive, balanced, and diverse dataset is preferred to increase the generalizability of the DL outcome. High-dimensional data, such as medical images or time series, are demonstrated to be the most beneficial set of data for several computer vision tasks, such as classification, registration, segmentation, reconstruction, and object detection (Gassenmaier, Küstner, et al., 2021; Lundervold & Lundervold, 2019).
DL has been developed for MRS data as a proof-of-concept in many applications, including metabolite quantification (Chandler et al., 2019; Hatami et al., 2018; H. H. Lee & Kim, 2019, 2020; Rizzo et al., 2023; Shamaei et al., 2023; Zhang & Shen, 2023), signal separation (Li et al., 2020), phase and frequency correction (Ma et al., 2022; Shamaei et al., 2023; Tapper et al., 2021), reconstruction of missing data (H. Lee et al., 2020), accelerated post-processing (Gurbani et al., 2019; Iqbal et al., 2021), denoising (Chen et al., 2023; Dziadosz et al., 2023; Lam et al., 2020), super-resolution (Gassenmaier, Afat, et al., 2021; Iqbal et al., 2019), artifact removal (Gurbani et al., 2018; Kyathanahally et al., 2018), and anomaly detection (Jang et al., 2021). Despite the potential, these methods have yet to be shown to generalize outside of small datasets with a single fixed acquisition protocol. Whereas ‘classical’ methods for post-processing are often driven by an understanding of the problem to be solved, and therefore can often be applied broadly, deep learning methods cannot be assumed to function well outside of the specific datasets used for training and testing. Broadly applicable deep learning methods will only arise from broad training and testing. A key barrier is the lack of a generalized benchmark dataset for training and testing, to play the role that MNIST, ImageNet, and COCO have played in the field of Computer Vision (Fei-Fei et al., 2010; Li Deng, 2012; T.-Y. Lin et al., 2014). Such a dataset lowers the barrier to entry for neural network development in MRS and facilitates performance comparisons between models. The Synthetic Data Working Group of the MRS study group of the International Society for Magnetic Resonance in Medicine’s Synthetic Data Working Group has recently highlighted the MRS community’s need for such a resource.
OOV echoes, which represent a subset of the artifacts often referred to as ‘spurious’ or ‘ghost’ echoes (Kreis, 2004), are a substantial issue for in vivo MRS, and an under-studied potential DL application. MRS voxel localization is achieved via a combination of RF pulses and magnetic field gradients, with the intended coherence transfer pathway selected both by phase cycling and dephasing “crusher” gradient scheme (Bodenhausen, 2011). OOV signals arise from gradient echoes – signals from outside the shimmed voxel of interest are refocused by evolution in local field gradients that are either inherent (from air-tissue-bone interfaces) or arising from second-order shim terms (Starck et al., 2009). Therefore, brain regions close to air cavities (e.g., medial prefrontal cortex) or which require stronger shim gradients (e.g., thalamus, hippocampus, etc.) most commonly exhibit OOV artifacts (Starck et al., 2009). OOV echoes seldom occur at the time of the primary echo, so they manifest in the spectrum as broad peaks with strong first-order phase “ripple” that can obscure metabolite resonances. While acquisition strategies can mitigate OOV echoes to some extent, by careful consideration of crusher schemes or voxel orientation (Ernst & Chang, 1996; Landheer & Juchem, 2019; Song et al., 2023), post-processing strategies remain valuable where complete elimination is not possible.
This manuscript develops Adaptable Generalized Neural-Network Open-source Spectroscopy Training dataset of Individual Components (AGNOSTIC), a dataset consisting of 259,200 synthetic MRS examples. AGNOSTIC spans a range of field strengths, echo times, and clinical profiles, representing metabolite signals, macromolecule (MM) background signals, residual water signals, and Gaussian noise as separate components. To date, DL applications to MRS have relied upon narrow in-house-generated training datasets that limit the generalizability of the solutions developed and comparisons between tools; AGNOSTIC is proposed as a benchmark dataset to fill this gap. In order to demonstrate the utility of this resource, we then illustrate a specific augmentation of the AGNOSTIC dataset to train neural networks for the detection and prediction of OOV echoes.
2. Methods:
2.1. AGNOSTIC Synthetic Dataset
The parameter space that AGNOSTIC spans is deliberately broad, comprising: 18 field strengths; 15 echo times; broad distributions of metabolite, MM, and water amplitudes; and densely sampled time-domain to allow down-sampling. Calculations were carried out using an in-house Python 3 (Van Rossum & Drake, 2009) programming script using NumPy (Harris et al., 2020). The dataset is structured as a zipped NumPy archive file (.npz) and can be opened as a Python 3 dictionary object. This dictionary contains complex-valued NumPy arrays of time-domain data corresponding to the metabolite, macromolecule, water, and noise components which can be combined in different ways depending on the application. For instance, a denoising model may want to target the combined metabolite, MM, and water signal without noise. Within the file, all the acquisition parameters (field strength, echo time, spectral width, etc.), simulation parameters (signal to noise, full-width half-max, concentrations, T2 relaxation, etc.), and data augmentation options are specified as detailed below.
2.1.1. Basis Set Simulation:
Metabolite spectra are based upon density-matrix-simulated basis functions (Blum, 1981; Fano, 1957; Farrar, 1990; O. W. Sørensen et al., 1984). A total of 270 basis sets were created across 18 field strengths (1.4 T – 3.1 T in steps of 0.1 T) and 15 echo times (10 ms – 80 ms in steps of 5 ms). The Point RESolved Spectroscopy (PRESS) pulse sequence (Bottomley, 1982) was simulated using ideal pulses with TE1 = TE2. The simulated spectral width, centered on 4.7 ppm, was 63.62 ppm for all field strengths (e.g., 8 kHz at 3 T; 4 kHz at 1.5 T). The simulated “acquisition window” was started immediately after the last pulse to generate points before the echo. Each metabolite signal was output as a N x 16684 NumPy array, where N is the number of spins for a given metabolite and 16684 is the fixed length of complex time points (300 points before the echo maximum, with an appropriate padding number of zeros and followed by the simulated pre-echo signal, and 16384 points after the echo).
39 brain metabolite basis functions were simulated: Adenosine Triphosphate (ATP); Acetate (Ace); Alanine (Ala); Ascorbate (Asc); Aspartate (Asp); β-hydroxybutyrate (bHB); β-hydroxyglutarate (2HG); Citrate (Cit); Cysteine (Cys); Ethanolamine (EA); Ethanol (EtOH); Creatine (Cr); y-Amino-Butyric Acid (GABA); Glucose (Glc); Glutamine (Gln); Glutamate (Glu); Glycerophosphocholine (GPC); Glutathione (GSH); Glycerol (Glyce); Glycine (Gly); Water (H2O); Homocarnosine (HCar); Histamine (Hist); Histidine (His); Lactate (Lac); Myo-Inositol (mI); N-Acetyl-Aspartate (NAA); N-Acetyl-Aspartate-Glutamate (NAAG); Phenylalanine (Phenyl); Phosphocholine (PCho); Phosphocreatine (PCr); Phosphoethanolamine (PE); Scyllo-Inositol (sI); Serine (Ser); Taurine (Tau); Threonine (Thr); Tryptophan (Tryp); Tyrosine (Tyr); and Valine (Val). GABA was separately simulated using two different spin-system enumerations (Govindaraju et al., 2000; Near et al., 2012). Both α-glucose and beta-glucose were simulated.
2.1.2. Assembly of Metabolite Component:
Individual metabolite basis functions were linearly combined to give a metabolite spectral component, weighted by metabolite concentrations sampled from distributions defined by our recent meta-analysis (Gudmundson et al., 2023), including both healthy and clinical cohort ranges. From the full basis sets, 22 metabolites were selected which had defined concentration ranges available in our recent meta-analysis (Gudmundson et al., 2023). Concentrations were selected with equal probability from a range defined by ±2.5 standard deviations from the meta-analysis mean of each cohort (Gudmundson et al., 2023).
T2* relaxation decay of time-domain data was simulated with an exponential and Gaussian component to produce a Voigt lineshape (Marshall et al., 1997) in the frequency domain. The exponential component represents the pure T2 arising from dipole-dipole interactions, paramagnetic interaction, etc., while the Gaussian component represents the transverse dephasing from diffusion and exchange of spins in an inhomogeneous field (Koch et al., 2009; Marshall et al., 1997; Michaeli et al., 2002; Yablonskiy & Haacke, 1994). While pure T2 is understood to be field-independent (Bloembergen et al., 1948; Carr & Purcell, 1954; Held et al., 1973; Michaeli et al., 2002), the dominant Gaussian decay (Marshall et al., 1997) increases with increasing static field strength and is attributed to greater microscopic (Michaeli et al., 2002) and macroscopic (Juchem et al., 2021; Tkáč et al., 2001) susceptibility gradients. Here, the pure Lorentzian T2 component is based upon the relaxation times at 1.5 T from a relaxation meta-regression (Gudmundson et al., 2023), which are assumed to be the least impacted by susceptibility gradients that scale with B0 (Bloembergen et al., 1948; De Graaf et al., 2006; Michaeli et al., 2002). Once the Lorentzian T2 component was applied, the additional T2* contributions were modeled by applying appropriate amounts of Gaussian broadening, to achieve a frequency-domain full-width half-maximum (FWHM) linewidth of the NAA singlet between 3 Hz and 18 Hz with a uniform distribution. A small amount of jitter (between 20 s−2 and 100 s−2) was added to the Gaussian decay rate so that each metabolite would undergo a similar, but not identical, amount of Gaussian decay to better replicate the variability observed for in vivo data.
2.1.3. Macromolecular Component:
Fourteen MM signals were modeled at: 0.92 ppm; 1.21 ppm; 1.39 ppm; 1.67 ppm; 2.04 ppm; 2.26 ppm; 2.56 ppm; 2.70 ppm; 2.99 ppm; 3.21 ppm; 3.62 ppm; 3.75 ppm; 3.86 ppm; and 4.03 ppm (Cudalbu et al., 2021; Giapitzakis et al., 2018). MM chemical shifts were jittered by ±0.03 ppm to both account for observed differences in MM designations and provide further dataset augmentation. Each MM signal was simulated as a singlet with exponential decay rate sampled uniformly from a range specified by literature of MM T2 time constants (Murali-Manohar et al., 2020) and additional Gaussian decay to reach published linewidths (Giapitzakis et al., 2018; Murali-Manohar et al., 2020). MM amplitudes were sampled uniformly from within published ranges (Giapitzakis et al., 2018; Murali-Manohar et al., 2020).
2.1.4. Noise Component:
Noise was generated from a normal distribution, with independent random real and imaginary points. The noise was scaled such that the signal-to-noise ratio of the NAA singlet (SNRNAA, defined as NAA height divided by the standard deviation of the noise) was between 5 and 80, uniformly sampled. The noise amplitude values are also stored within the archive file.
2.1.5. Residual Water Component:
The residual water basis signal was simulated as a singlet at 4.7 ppm. In order to simulate imperfect water suppression, this residual water signal was used to model up to 5 unique water signals with variable ppm locations, phases, and amplitudes (L. Lin et al., 2019). The ranges for these parameters are listed in Table 1. The final water component was scaled to be between 1× and 20× the maximum value of the frequency-domain metabolite spectrum. The water components used, along with their corresponding parametrizations, are stored within the NumPy archive file.
Table 1.
Parametrization of the residual water signal within AGNOSTIC.
| Component | Location | Phase / deg | Amplitude | |||
|---|---|---|---|---|---|---|
| 1 | 4.679 | 4.711 | −10 | 10 | 1.00 | |
| 2 | 4.599 | 4.641 | 15 | 45 | .35 | .55 |
| 3 | 4.801 | 4.759 | −60 | −30 | .35 | .55 |
| 4 | 4.449 | 4.541 | 45 | −70 | .10 | .25 |
| 5 | 4.859 | 4.901 | 105 | 135 | .10 | .25 |
2.1.6. Frequency and Phase Shifts:
Within the NumPy archive file, frequency and phase shifts are specified for each entry in the dataset, but not applied to the time-domain components. Frequency shifts were sampled uniformly from the range −0.313 ppm to +0.313 ppm. Zero-order phase shifts were sampled uniformly from the range −1.57 radians to +1.57 radians. First-order phase shifts were sampled uniformly from the range −0.34 to +0.34 radians per ppm. Users may choose to omit phase and frequency shifts, use the provided shifts, or specify their own.
2.2. Exemplar Application to AGNOSTIC: Machine Learning for Out-Of-Voxel Artifacts:
The primary motivation for the AGNOSTIC dataset is as a training resource for the development of processing, modeling, and analysis tools for MRS. Synthetic spectra with known ground truths are valuable in a range of applications, from the development and validation of traditional linear combination modeling algorithms to training DL models.
In order to demonstrate the utility of the dataset, an exemplar application is presented, in which the AGNOSTIC dataset is supplemented by simulated artifacts (in this case out-of-voxel OOV echoes) and used to train DL models to detect and predict the artifact signals. The AGNOSTIC dataset was developed as building blocks which can be combined to train a variety of different models. A strength of this dataset is that custom user-defined components can be utilized. We demonstrate this point here by building an OOV dataset to train and evaluate a DL model to identify and suppress OOV artifacts.
2.2.1. Simulation of Out-Of-Voxel Echoes:
OOV artifacts were defined as complex time-domain signals with a time point (τOOV), width (WOOV), frequency (ωOOV), phase (fOOV) and amplitude (aOOV) as shown in Figure 1. τOOV describes the timepoint of the top of the OOV echo and was sampled randomly from a uniform distribution between 10 ms and 400 ms. WOOV describes the Gaussian decay rate and was sampled randomly from a uniform distribution between 500 s−2 and 8000 s−2, resulting in a FWHM echo duration between 18 ms and 74 ms. ωOOV describes the offset in the frequency domain, and was sampled randomly from a uniform distribution in order to produce OOVs that occur between 1 ppm and 4 ppm.
Figure 1.
Simulation of OOV echoes and OOV-corrupted synthetic data. OOV echoes were simulated as complex time-domain signals with a center timepoint (τOOV), width (WOOV), frequency (ωOOV), phase (fOOV), amplitude (aOOV). OOV echoes were added to 85% of synthetic data to create datasets for training and evaluation.
| [1] |
2.2.2. Integration of OOV Echoes into AGNOSTIC for the Training Dataset:
To build the training dataset, we combined metabolite, water, MM, and noise components from the AGNOSTIC dataset. We then added OOV signals to 85% of the dataset and a complex zeros array in the remaining 15%. Finally, we applied the included frequency and phase shifts specified within the AGNOSTIC dataset. The network input consisted of the combined metabolite, water, MM, noise, and OOV signals as a complex time-domain signal. This input was normalized so that the absolute maximum among the real and imaginary values was 1. Finally, training data were converted to a TensorFlow Dataset (Abadi et al., 2015).
2.2.3. Detection Network:
The first exemplar network is designed to detect OOV echoes within time-domain data by identifying the points in the spectra that have been contaminated by OOV echoes. This Detection Network is a fully Convolutional Neural Network (CNN) designed using TensorFlow2 with Keras (Chollet & others, 2015) in a Python 3 environment. The network consists of contracting encoding layers and expanding decoding layers with a total of 1.543 million parameters, as shown in Figure 2. Each layer was initialized (kernel_initializer) with “he_normal” (He et al., 2015). Each convolutional layer (except the output layer which uses a sigmoid activation) includes batch normalization and a leaky rectified linear unit (ReLu) activation function (Agarap, 2018). The network is designed to receive a time-domain input signal and return a binary mask of the same size as the input with ones placed in OOV-detected regions and zeros elsewhere. A ground-truth binary mask was determined as the 5% level of the Gaussian OOV kernel. For training, the input and output of this network is a 60 x 2048 x 2 x 1 tensor, where 60 is the batch number of input examples, 2048 is the number of time points, 2 is the real/imaginary dimension, and 1 is the channel dimension.
Figure 2.
Convolutional Neural Network Architecture, Input, and Output: A) Fully convolutional neural network architecture used for both the Detection and Prediction Network. Convolutional strides, batch normalization, and Leaky ReLu activation functions are denoted by a colored line. Dark gray blocks represent complex data with the 2nd dimension representing real and imaginary components, while white blocks represent the network abstracted single dimension. Arrows show residual connections. Note, inputs and outputs are all time-domain signals; Frequency-domain is shown for convenient visualization. B) OOV-corrupted synthetic example and the isolated OOV. The complex OOV-corrupted data was used as the Detection and Prediction Network input. The target Output is the isolated OOV.
The Dice coefficient (Carass et al., 2020; Dice, 1945; T. Sørensen, 1948) of the overlap between the network output and the correct binary OOV location vector was used as a training loss function, calculated as 2x the intersection divided by the union plus 1; where 1 was used to avoid division by 0. The Adam (Kingma & Ba, 2015) optimizer was used with a learning rate of 0.0003. Training was performed using an 8 GB NVIDIA GeForce RTX 3070 GPU. A clustering algorithm was applied to the final network output, which zeroed any group of time points in which the network detected OOV echo that was smaller than 5 consecutive time points, to dampen spurious output.
2.2.4. Modeling:
Modeling of the OOV echoes was performed as an optimization problem and solved with SciPy (Virtanen et al., 2020) minimization routines. Here, the non-gradient Powell (Powell, 1964, 1994) optimizer was used to determine the five OOV parameters (τOOV, WOOV, ωOOV, fOOV, and aOOV), minimizing the mean squared error (MSE) between the model and the data within the time window identified by the Detection Network. Initial values for τOOV, WOOV, and the aOOV are inferred from the Detection Network’s output center timepoint, the detection duration, and the standard deviation of the target signal within the detected region.
Optimization was performed as three sequential optimization steps. The first optimization is used to determine τOOV, WOOV, and the aOOV by minimizing the MSE between the absolute values of the model and the data (i.e., removing frequency and phase from the model) in the time domain. The second optimization determines ωOOV by absolute-mode minimization in the frequency domain. The third optimization refines the values determined by optimizations 1 and 2 and determines fOOV by complex optimization in the time domain.
2.2.5. Prediction Network:
The second exemplar network is designed to predict the OOV echoes found within time-domain data. This prediction network is also a fully CNN designed using TensorFlow2 with Keras in a Python 3 environment, with the same architecture as the detection network (as shown in Figure 2). The network is designed to receive a time-domain input signal containing a combination of the ground-truth time-domain signal and the OOV artifact and return a time-domain output signal that only contains the OOV signal, amplified 10x, where the amplification served to globally weight the entire echo. For training, a weighted mean squared error was used as a loss function with an ADAM (Kingma & Ba, 2015) optimizer and learning rate of 0.0003. Training was again performed on an 8 GB NVIDIA GeForce RTX 3070 GPU.
2.2.6. Evaluating the Performance of Networks and Modeling:
In the final testing set, OOV artifacts were present in 6,137 of the total 7,200 examples (85.2%). The detection network was evaluated using the Dice coefficient (Carass et al., 2020; Dice, 1945; Powell, 1964), the overlap between the ground-truth binary OOV mask and the cluster-thresholded network output. As well as computing global success, the dependence of detection success on various attributes of the OOV echo and the underlying spectrum were also investigated.
Both modeling and the prediction network return a pure OOV signal, and in both cases, the MSE between the prediction/model and the ground-truth OOV echo is used for evaluation. If the ground-truth echo datapoints are Ei and the model or echo prediction is Mi, we calculate the fractional remaining OOV amplitude as:
| [2] |
where the bars represent the complex amplitude. The sum is taken over the ground-truth range of the OOV echo. In order to visualize a wide range of success and failure, we take the log10 of this quantity for plotting (i.e., a log10 value of 1 is no change, anything positive is a manipulation that is worse than doing nothing, and a negative value show the order of magnitude of improvement). Note that is the ground-truth echo signal, not the signal from which the echo is being removed which also contains metabolite, macromolecule, and noise components.
The timing of the OOV was found to be a key parameter determining the success of detection and prediction, and as a result, the evaluation metrics were calculated for the following time-bins (based on the known value of tOOV): 10-20 ms; 20-40 ms; 40-60 ms; 60-80 ms; 80-120 ms; 120-200 ms; 200-300 ms; 300-400 ms.
2.2.7. In Vivo Proof-of-Principle
As a proof-of-principle demonstration of this exemplar use of the AGNOSTIC dataset, the network was applied to 256 transients of in vivo data, selected because they contain prominent OOV echoes and were excluded during quality assessment in a recent study (Zöllner et al., 2023). These data were collected on a 2.89 T Siemens scanner using the MEGA-PRESS (Mescher et al., 1996, 1998) pulse sequence with a TE of 68 ms and TR of 1.75 s, and a spectral width of 2.4 kHz. Note that this challenges the generality of the training because the network has never seen data acquired at 2.89 T, nor at 2.4 kHz spectral width, nor at TE 68 ms, nor with MEGA-Editing, nor with actual real RF pulses. Raw data from a 25 x 25 x 25 mm3 voxel in the cerebellum were loaded and coil combined in Osprey (Oeltzschner et al., 2020). Time-domain data were saved as a MATLAB (The MathWorks Inc., 2022) .mat file and loaded as a Python 3 object using SciPy. The data were normalized (as above with training data) to be used as input for the neural networks.
One challenge of in vivo data (and the reason that this network demonstration focuses substantially on synthetic data) is that no ground truth is available. Therefore, the degree of success in removing OOV echo signals from time-domain data is:
| [3] |
where s denotes the standard deviation. Note that, in contrast to the metric used for synthetic data in Equation 1, only is available, not the ground truth , which substantially changes the ceiling of success. It is still expected that substantial signal variance remains after OOV removal, since contains metabolite signals and noise. The range over which this standard deviation is calculated is the 50% level of the normalized histogram of the detection network’s output across the 128 averages.
3. Results:
3.1. AGNOSTIC Synthetic Dataset:
The AGNOSTIC dataset contains 259,200 examples, consisting of 960 examples from each of the eighteen field strengths and fifteen echo times (i.e., 960x18x15=259,200). A representative set of ten spectra are shown in Figure 3, illustrating the diversity of field strengths, TEs, SNR, and linewidth within the dataset.
Figure 3.

AGNOSTIC synthetic dataset. 10 representative spectra from the AGNOSTIC dataset. The 10 examples show the diversity of field strength, TE, linewidths, and residual water signal present among the data. Note, examples are shown here in the frequency-domain to better illustrate the heterogeneity, but the dataset provides time-domain examples.
One challenge with making this dataset available is its size — 75 GB — but we do make it freely available on Dryad. The basis sets from which these are constructed are more manageable — 9 GB — and can also be accessed through Dryad. Code for generating the AGNOSTIC dataset locally is available at: https://github.com/agudmundson/agnostic.
3.2. Exemplar Application to AGNOSTIC: Machine Learning for Out-Of-Voxel Artifacts:
3.2.1. Detection Network:
Of the 6,137 examples where OOV artifacts were present, the Detection Network correctly identified 5,827 (94.9%) with a median Dice score of 0.974 (0.941–0.985 interquartile range) and missed 310 (5.05%) with a Dice score of 0.00. In the 1063 examples that did not include OOV artifacts, the network correctly ignored 912 (85.8%) and falsely detected OOV echoes in 151 (14.2%). Figure 4 shows the Detection Network’s output for a synthetic OOV-corrupted example.
Figure 4.
OOV-corrupted example: OOV-corrupted synthetic example and the isolated OOV. Results from Detection Network (green), Model (orange), and Prediction Network (blue) are shown below the ground truth OOV-corrupted and OOV. OOV residuals are shown for the Model (orange) and Prediction Network (blue) demonstrating remaining signal after subtraction. Note, frequency-domain is shown for convenient visualization, but the Detection Network, Modeling, and Prediction Network all operate on time-domain signals.
Analysis of the factors that determined success indicated that the time at which OOV signals occur is most critical. Therefore, OOV echoes were further broken down into eight time-bins, and the Dice score plotted in Figure 5. The median Dice scores — 0.165, 0.858, 0.892, 0.934, 0.960, 0.974, 0.978, and 0.978 — are poor in the first bin and improve thereafter. Note that these bins are not spaced equally to emphasize poor performance extremely early. The number of examples in each bin is 161, 289, 282, 329, 622, 1256, 1565, and 1633, respectively.
Figure 5.
Evaluation of Detection Network, Modeling, and Prediction Network. A testing set with 7200 (2400 examples with 3 different OOV echoes) unseen examples was used to evaluate the A) Detection Network and B) Modeling and Prediction Network. Performance across the whole test set is shown on the left-hand side. Performance across the binned center timepoint (τOOV) is shown across the right-hand side.
3.2.2. Modeling
The modeling optimization converged in 5,824 of the 5,827 examples where the detection network detected OOV artifacts and provided initial values. Across this subset of the examples, the modeling achieved a median log10 (fractional OOV remaining) of −2.19 (−2.90 – −1.19 inter-quartile range), i.e., a median reduction of more than two orders of magnitude. Figure 5 shows the resulting model for a synthetic OOV-corrupted example.
These values — broken down into 8 time-bins — are shown in Figure 5. The median log10(fractional OOV remaining) decreases across the time bins: 1.663, 1.324, 0.680, 0.223, −1.586, −2.276, −2.491, and −2.567.
3.2.3. Prediction Network:
In the 6,137 examples where OOV artifacts were present, the prediction network achieved a median log10 normed-MSE of −1.79 (−2.21 – −1.11 inter-quartile range). In the 5,824 examples where OOV artifacts were successfully modeled, the prediction network achieved a median log10 normed-MSE of −1.85 (−2.24 – −1.24 inter-quartile range). Figure 5 shows the Prediction Network’s output for a synthetic OOV-corrupted example.
OOVs were further broken down into 8 time-bins (Figure 5) early — the number of examples in each bin is 86, 226, 261, 312, 592, 1208, 1538, 1601. The median log10(fractional OOV remaining) decreases across the time bins: −0.207, −0.583, −0.862, −1.250, −1.577, −1.878, −2.005, and −2.052.
3.2.4. In Vivo Proof-of-Principle:
The Detection Network identified an OOV in 243 of 256 transients (94.9%). In these 243 OOV-detected transients, the modeling achieved a median reduction in standard deviation of 71.0 % (60.2 −75.3% inter-quartile range). The Prediction Network achieved a median reduction in standard deviation of 69.65% (66.33 %/72.7 % inter-quartile range) in this subset. In the full set of 256 transients, the Prediction Network achieved a median 69.4 % (65.3 – 72.6 % inter-quartile range) reduction in standard deviation. The standard deviation of the noise floor was found to account for a median of 10.3% (9.35–11.6 % inter-quartile range) of the standard deviation of signal within the time window for the 256 averages. A representative in vivo example is shown in Figure 6.
Figure 6.
In vivo MEGA-PRESS OOV-corrupted example. Results from Detection Network (green), Model (orange), and Prediction Network (blue) are shown below. Detection and Prediction CNNs identified and reconstructed the OOV echo, despite having never seen data acquired with 2.89 T, 2.4 kHz spectral width, 68 ms, editing, nor real RF pulses. Note, frequency-domain is shown for convenient visualization, but the Detection Network, Modeling, and Prediction Network all operate on time-domain signals.
4. Discussion:
AGNOSTIC is a benchmark MRS dataset for training and evaluating performance across various models. In order to make these synthetic data representative of in vivo brain MRS datasets, a total of 22 brain metabolites and 14 MM peaks were simulated within 270 basis sets, spanning field strengths from 1.4 T to 3.1 T and TEs from 10 to 80 ms. Parameterized water residual and noise were included. SNR and linewidths were assigned at random, independent of B0 or TE. The broad span of the dataset is key in training networks that generalize. While AGNOSTIC is broad in these dimensions, it does only represent simulated data for PRESS (Bottomley, 1982) acquisitions, and may benefit from expansion to include other pulse sequences, such as STEAM (Frahm et al., 1987), SPECIAL (Mekle et al., 2009; Mlynarik et al., 2006), LASER (Garwood & DelaBarre, 2001), and semi-LASER (Scheenen, Heerschap, et al., 2008; Scheenen, Klomp, et al., 2008), and edited versions including MEGA (Mescher et al., 1996, 1998) and Hadamard-encoded (Chan et al., 2016, 2019; Oeltzschner et al., 2019; Saleh et al., 2016) schemes. AGNOSTIC is limited by simulations that used ideal pulses, a calculated trade-off to emphasize generalizability across field strength, echo time, and spectral width, and thus fail to capture effects associated with spatially heterogeneous coupling evolution. The extent to which these limitations matter will depend on the applications that AGNOSTIC synthetic data are being used for.
The Detection network was highly successful, identifying 94.9% of the testing set where OOV artifacts were present. The precise value of this success metric is obviously impacted by the parameters of the OOVs – a later minimum OOV time would tend to increase performance, and earlier would degrade it. It is noteworthy that, although the training datasets never contained more than one OOV echo, the detection and prediction networks were able to handle more than one OOV echo in vivo data, presumably because CNNs operate locally within the FID. It is also encouraging that the networks generalized well to the in vivo data (Figure 6), which was collected with unseen acquisition parameters, i.e., edited MEGA-PRESS (Mescher et al., 1996, 1998) data acquired at 2.89 T with a TE of 68 ms, and 2.4 kHz spectral width.
In the exemplar OOV application, the success of the networks depended heavily on the timing of the OOV signal. The earliest OOV echoes were most challenging, unsurprisingly since such signals are broad Gaussian resonances that are indistinguishable from within-voxel MM and baseline signals. Indeed, the only feature that differentiates OOV signals from other broad components of the model is timing. It is conceptually helpful to consider this in the Fourier domain, even though all network processing is performed in the time domain. In the frequency domain, a mismatch between the echo-top and the acquisition start is represented as a first-order phase error of the signal associated with that echo. Where insufficient first-order phase exists to be represented within the linewidth of the signal in question, (which in the time domain corresponds to substantial truncation of the lefthand side of the echo), the network struggles to identify OOV signals.
In the context of this study, modeling and prediction are treated as two alternative approaches to OOV characterization. For early OOV signals, the modeling approach tended to mis-attribute non-artifact signal as OOV signal, a result that the metric scored as worse than no intervention. The median performance of the Prediction network, even for very early OOV signals, was close to zero. Both modeling and prediction performance improve as the OOV moves later in the acquired signal, with modeling improving faster than the network, and performing better than prediction beyond 120 ms. This strong performance of the model at least in part reflects the exact match between the generative model of the synthetic OOV artifacts and the model that is being used to extract them. More moderate performance might be expected for real in vivo examples – but the same may also be true for networks which have been trained with the same synthetic data and may have learned specifically to identify OOV signals that have a Gaussian kernel.
One key difference between most DL applications and applications in MRS, is the strict requirement to preserve amplitude fidelity in network outputs. A common approach to artifacts in DL is to return an artifact-free version of the network input. In contrast, the approach taken here is to return the artifact, which has the following benefits: it avoids networks over-learning the formulaic pattern of typical spectra; it reduces the impact of the lack of sequence diversity within the AGNOSTIC dataset; and it is less likely to impact the amplitudes of metabolite signals.
The ultimate goal of this work is to extract metabolite levels from MRS data that are not impacted by OOV artifacts. This problem can be addressed at several points: either by not acquiring data that contain OOV artifacts; by removing OOV artifacts post-acquisition; and by incorporating appropriate OOV model components into quantification model so that the impact of OOV is minimized. While the work presented here focuses primarily on the second context, it raises important potential applications in the other contexts. One motivator for developing the Detection network is the possibility of real-time deployment during sequence acquisition to trigger sequence changes when OOV artifacts are detected. The modeling applied here was time-restricted to a given window and ignored other components of the spectrum, but demonstrates potential for future integration within a full linear-combination model.
5. Conclusion:
In conclusion, we have presented the AGNOSTIC benchmark dataset which can be used for training and testing brain-specific 1H MRS deep learning models. This large synthetic dataset is open-source and encompasses a range of field strengths, TEs, and dwell times to ensure networks are robust to a variety of in vivo data acquisitions protocols. Using this dataset, we have demonstrated an exemplar use case to develop CNNs to detect and predict out-of-voxel artifacts.
Highlights:
AGNOSTIC is an open-source benchmark dataset for training and testing brain a wide range of 1H MRS deep learning models.
AGNOSTIC is intended to be step towards developing acquisition-agnostic deep learning models by including synthetic examples from 18 field strengths, 15 echo times, multiple dwell times, healthy and clinical brain concentrations, and a large range of spectral quality (i.e., SNR and linewidth).
Two deep learning models were trained using AGNOSTIC to detect and predict out-of-voxel echoes where the models generalized to in vivo data collected with unseen acquisition protocols.
Acknowledgments:
This work has been supported by The Henry L. Guenther Foundation, Sonderforschungsbereich (SFB) 974 (TP B07) of the German Research foundation, and the National Institute of Health, grants T32 AG00096, R00 AG062230, R21 EB033516, R01 EB016089, R01 EB023963, K00AG068440, P30 AG066519, R21 AG053040, R01 AG076942, P30 AG066519 and P41 EB031771.
Abbreviations:
- 1H
proton
- 2HG
β-hydroxyglutarate
- Ace
acetate
- AGNOSTIC
adaptable generalized neural-network open-source spectroscopy training dataset of individual components
- Ala
alanine
- Asc
ascorbate
- Asp
aspartate
- ATP
adenosine triphosphate
- bHB
β-hydroxybutyrate
- Cho
choline-containing compounds
- Cit
citrate
- CNN
Convolutional Neural Networks
- Cr
creatine
- Cys
Cysteine
- DL
deep learning
- EA
Ethanolamine
- EtOH
ethanol
- FID
free induction decay
- FWHM
full-width half-maximum
- GABA
gamma-aminobutyric acid
- Glc
glucose
- Gln
glutamine
- Glu
glutamate
- Glx
sum of glutamate and glutamine
- Gly
glycine
- Glyce
glycerol
- GM
gray matter
- GPC
glycerophosphocholine
- GSH
glutathione
- H2O
water
- HCar
homocarnosine
- Hist
histamine
- His
histidine
- ISMRM
international society for magnetic resonance in medicine
- Lac
lactate
- LASER
localization by adiabatic selective refocusing
- MEGA
Mescher-Garwood
- mI
myo-inositol
- MM
macromolecule
- MRS
magnetic resonance spectroscopy
- MSE
mean-squared error
- NAA
N-acetylaspartate
- NAAG
N-acetyl-aspartyl-glutamate
- OOV
out-of-voxel
- PCho
phosphocholine
- PCr
phosphocreatine
- PE
phosphoethanolamine
- Phenyl
phenylalanine
- PRESS
point resolved spectroscopy
- ReLu
rectified linear unit
- Ser
serine
- sI
scyllo-inositol
- sLASER
semi-adiabatic localization by adiabatic selective refocusing
- SPECIAL
spin echo full intensity acquired localiezed
- STEAM
stimulated echo acquisition mode
- SNR
signal-to-noise ratio
- T2
spin-spin relaxation time
- Tau
taurine
- tCho
sum of choline-containing metabolites
- tCr
sum of creatine and phosphocreatine
- TE
echo-time
- Thr
threonine
- tNAA
sum of N-acetyl-aspartate and N-acetyl-aspartyl-glutamate
- Tryp
Tryptophan
- Tyr
Tyrosine
- Val
Valine
Footnotes
CRediT authorship contribution statement:
Aaron T. Gudmundson: Conceptualization, Data Curation, Formal Analysis, Investigation, Methodology, Software, Visualization, Writing-original draft, Writing-Review & Editing. Christopher W. Davies-Jenkins: Data Curation, Resources, Writing-original draft, Writing-Review & Editing. İpek Özdemir: Data Curation, Writing-original draft, Resources, Writing-Review & Editing. Saipavitra Murali-Manohar: Data Curation, Writing-original draft, Writing-Review & Editing. Helge J. Zöllner: Data Curation, Resources, Writing-original draft, Writing-Review & Editing. Yulu Song: Data Curation, Resources, Writing-original draft, Writing-Review & Editing. Kathleen E. Hupfeld: Writing-original draft, Writing-Review & Editing. Alfons Schnitzler: Data Curation, Resources, Writing-Review & Editing. Georg Oeltzschner: Conceptualization, Supervision, Writing-original draft, Writing-Review & Editing. Craig Stark: Conceptualization, Funding acquisition, Project Administration, Resources, Supervision, Writing-original draft, Writing-Review & Editing. Richard A.E. Edden: Conceptualization, Funding acquisition, Project Administration, Resources, Supervision, Visualization, Writing-original draft, Writing-Review & Editing.
References:
- Abadi M., Agarwal A., Barham P., Brevdo E., Chen Z., Citro C., Corrado G. S., Davis A., Dean J., Devin M., Ghemawat S., Goodfellow I., Harp A., Irving G., Isard M., Jia Y., Jozefowicz R., Kaiser L., Kudlur M., … Zheng X. (2015). Tensorflow: Large-scale Machine Learning on Heterogeneous Distributed Systems. 10.5281/zenodo.4724125 [DOI] [Google Scholar]
- Agarap A. F. (2018). Deep learning using rectified linear units (relu). ArXiv Preprint ArXiv:1803.08375. [Google Scholar]
- Bloembergen N., Purcell E. M., & Pound R. V. (1948). Relaxation effects in nuclear magnetic resonance absorption. Physical Review, 73(7), 679–712. 10.1103/PhysRev.73.679 [DOI] [Google Scholar]
- Blum K. (1981). Density Matrix Theory and Applications (1st ed.). Springer US. 10.1007/978-1-4615-6808-7 [DOI] [Google Scholar]
- Bodenhausen G. (2011). Reflections of pathways: A short perspective on ‘Selection of coherence transfer pathways in NMR pulse experiments’. Journal of Magnetic Resonance, 213(2), 295–297. 10.1016/j.jmr.2011.08.004 [DOI] [PubMed] [Google Scholar]
- Bottomley P. A. (1982). Selective volume method for performing localized NMR spectroscopy. 19. https://patents.google.com/patent/US4480228A/en [Google Scholar]
- Carass A., Roy S., Gherman A., Reinhold J. C., Jesson A., Arbel T., Maier O., Handels H., Ghafoorian M., Platel B., Birenbaum A., Greenspan H., Pham D. L., Crainiceanu C. M., Calabresi P. A., Prince J. L., Roncal W. R. G., Shinohara R. T., & Oguz I. (2020). Evaluating White Matter Lesion Segmentations with Refined Sørensen-Dice Analysis. Scientific Reports, 10(1), 1–19. 10.1038/s41598-020-64803-w [DOI] [PMC free article] [PubMed] [Google Scholar]
- Carr H. Y., & Purcell E. M. (1954). Effects of Diffusion on Free Precession in Nuclear Magnetic Resonance Experiments*t. [Google Scholar]
- Chan K. L., Edden R. A. E., & Barker P. B. (2019). Simultaneous editing of GABA and GSH with Hadamard - encoded MR spectroscopic imaging. July 2018, 21–32. 10.1002/mrm.27702 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chan K. L., Puts N. A. J., Schär M., Barker P. B., & Edden R. A. E. (2016). HERMES: Hadamard encoding and reconstruction of MEGA-edited spectroscopy. Magnetic Resonance in Medicine, 76(1), 11–19. 10.1002/mrm.26233 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chandler M., Jenkins C., Shermer S. M., & Langbein F. C. (2019). MRSNet: Metabolite Quantification from Edited Magnetic Resonance Spectra With Convolutional Neural Networks. 1–12. http://arxiv.org/abs/1909.03836 [Google Scholar]
- Chen D., Hu W., Liu H., Zhou Y., Qiu T., Huang Y., Wang Z., Lin M., Lin L., Wu Z., Wang J., Chen H., Chen X., Yan G., Guo D., Lin J., & Qu X. (2023). Magnetic Resonance Spectroscopy Deep Learning Denoising Using Few In Vivo Data. IEEE Transactions on Computational Imaging, 1–12. 10.1109/TCI.2023.3267623 [DOI] [Google Scholar]
- Chollet F., & others. (2015). Keras. [Google Scholar]
- Cudalbu C., Behar K. L., Bhattacharyya P. K., Bogner W., Borbath T., de Graaf R. A., Gruetter R., Henning A., Juchem C., Kreis R., Lee P., Lei H., Marjańska M., Mekle R., Murali-Manohar S., Považan M., Rackayová V., Simicic D., Slotboom J., … Mlynárik V. (2021). Contribution of macromolecules to brain 1H MR spectra: Experts’ consensus recommendations. NMR in Biomedicine, 34(5), 1–24. 10.1002/nbm.4393 [DOI] [PMC free article] [PubMed] [Google Scholar]
- De Graaf R. A., Brown P. B., McIntyre S., Nixon T. W., Behar K. L., & Rothman D. L. (2006). High magnetic field water and metabolite proton T1 and T 2 relaxation in rat brain in vivo. Magnetic Resonance in Medicine, 56(2), 386–394. 10.1002/mrm.20946 [DOI] [PubMed] [Google Scholar]
- Dice L. R. (1945). Measures of the Amount of Ecologic Association Between Species Author (s): Dice Lee R. Published by : Ecological Society of America Stable; URL : http://www.jstor.org/stable/1932409. Ecology, 26(3), 297–302. [Google Scholar]
- Dziadosz M., Rizzo R., Kyathanahally S. P., & Kreis R. (2023). Denoising single MR spectra by deep learning: Miracle or mirage? Magnetic Resonance in Medicine. 10.1002/mrm.29762 [DOI] [PubMed] [Google Scholar]
- Ernst T., & Chang L. (1996). Elimination of artifacts in short echo time1H MR spectroscopy of the frontal lobe. Magnetic Resonance in Medicine, 36(3), 462–468. 10.1002/mrm.1910360320 [DOI] [PubMed] [Google Scholar]
- Fano U. (1957). Description of States in Quantum Mechanics by Density Matrix and Operator Techniques. Reviews of Modern Physics, 29(1), 74–93. 10.1103/RevModPhys.29.74 [DOI] [Google Scholar]
- Farrar T. (1990). Density matrices in NMR spectroscopy: Part I. Concepts in Magnetic Resonance, 2, 1–12. [Google Scholar]
- Fei-Fei L., Deng J., & Li K. (2010). ImageNet: Constructing a large-scale image database. Journal of Vision, 9(8), 1037–1037. 10.1167/9.8.1037 [DOI] [Google Scholar]
- Frahm J., Merboldt K. D., & Hänicke W. (1987). Localized proton spectroscopy using stimulated echoes. Journal of Magnetic Resonance (1969), 72(3), 502–508. 10.1016/0022-2364(87)90154-5 [DOI] [Google Scholar]
- Garwood M., & DelaBarre L. (2001). The return of the frequency sweep: Designing adiabatic pulses for contemporary NMR. Journal of Magnetic Resonance. 10.1006/jmre.2001.2340 [DOI] [PubMed] [Google Scholar]
- Gassenmaier S., Afat S., Nickel D., Kannengiesser S., Herrmann J., Hoffmann R., & Othman A. E. (2021). Application of a Novel Iterative Denoising and Image Enhancement Technique in T1-Weighted Precontrast and Postcontrast Gradient Echo Imaging of the Abdomen. Investigative Radiology, 56(5), 328–334. 10.1097/RLI.0000000000000746 [DOI] [PubMed] [Google Scholar]
- Gassenmaier S., Küstner T., Nickel D., Herrmann J., Hoffmann R., Almansour H., Afat S., Nikolaou K., & Othman A. E. (2021). Deep Learning Applications in Magnetic Resonance Imaging: Has the Future Become Present? Diagnostics, 11(12), 2181. 10.3390/diagnostics11122181 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Giapitzakis I. A., Avdievich N., & Henning A. (2018). Characterization of macromolecular baseline of human brain using metabolite cycled semi-LASER at 9.4T. Magnetic Resonance in Medicine, 80(2), 462–473. 10.1002/mrm.27070 [DOI] [PubMed] [Google Scholar]
- Goodfellow I., Bengio Y., & Courville A. (2016). Deep Learning. The MIT Press. http://www.deeplearningbook.org/ [Google Scholar]
- Govindaraju V., Young K., & Maudsley A. A. (2000). Proton NMR chemical shifts and coupling constants for brain metabolites. NMR in Biomedicine, 13(3), 129–153. [DOI] [PubMed] [Google Scholar]
- Gudmundson A. T., Koo A., Virovka A., Amirault A. L., Soo M., Cho J. H., Oeltzschner G., Edden R. A. E., & Stark C. E. L. (2023). Meta-analysis and Open-source Database for In Vivo Brain Magnetic Resonance Spectroscopy Studies of Health and Disease. 10.1101/2023.02.10.528046 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gurbani S. S., Schreibmann E., Maudsley A. A., Cordova J. S., Soher B. J., Poptani H., Verma G., Barker P. B., Shim H., & Cooper L. A. D. (2018). A convolutional neural network to filter artifacts in spectroscopic MRI. Magnetic Resonance in Medicine, 80(5), 1765–1775. 10.1002/mrm.27166 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gurbani S. S., Sheriff S., Maudsley A. A., Shim H., & Cooper L. A. D. (2019). Incorporation of a spectral model in a convolutional neural network for accelerated spectral fitting. Magnetic Resonance in Medicine, 81(5), 3346–3357. 10.1002/mrm.27641 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Harris C. R., Millman K. J., van der Walt S. J., Gommers R., Virtanen P., Cournapeau D., Wieser E., Taylor J., Berg S., Smith N. J., Kern R., Picus M., Hoyer S., van Kerkwijk M. H., Brett M., Haldane A., del Río J. F., Wiebe M., Peterson P., … Oliphant T. E. (2020). Array programming with NumPy. Nature, 585(7825), 357–362. 10.1038/s41586-020-2649-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hatami N., Sdika M., & Ratiney H. (2018). Magnetic resonance spectroscopy quantification using deep learning. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 11070 LNCS, 467–475. 10.1007/978-3-030-00928-1_53 [DOI] [Google Scholar]
- He K., Zhang X., Ren S., & Sun J. (2015). Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification. 2015 IEEE International Conference on Computer Vision (ICCV), 2015 Inter, 1026–1034. 10.1109/ICCV.2015.123 [DOI] [Google Scholar]
- Held G., Noack F., Pollak V., & Melton B. (1973). Protonenspinrelaxation und Wasserbeweglichkeit in Muskelgewebe / Proton Spin Relaxation and Mobility of Water in Muscle Tissue. Zeitschrift Für Naturforschung C, 28(1–2), 59–62. 10.1515/znc-1973-1-209 [DOI] [PubMed] [Google Scholar]
- Iqbal Z., Nguyen D., Hangel G., Motyka S., Bogner W., & Jiang S. (2019). Super-Resolution 1H Magnetic Resonance Spectroscopic Imaging Utilizing Deep Learning. Frontiers in Oncology, 9. 10.3389/fonc.2019.01010 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Iqbal Z., Nguyen D., Thomas M. A., & Jiang S. (2021). Deep learning can accelerate and quantify simulated localized correlated spectroscopy. Scientific Reports, 11(1), 8727. 10.1038/s41598-021-88158-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jang J., Lee H. H., Park J.-A., & Kim H. (2021). Unsupervised anomaly detection using generative adversarial networks in 1H-MRS of the brain. Journal of Magnetic Resonance, 325, 106936. 10.1016/j.jmr.2021.106936 [DOI] [PubMed] [Google Scholar]
- Juchem C., Cudalbu C., Graaf R. A., Gruetter R., Henning A., Hetherington H. P., & Boer V. O. (2021). B 0 shimming for in vivo magnetic resonance spectroscopy: Experts’ consensus recommendations. NMR in Biomedicine, 34(5), 1–20. 10.1002/nbm.4350 [DOI] [PubMed] [Google Scholar]
- Kingma D. P., & Ba J. L. (2015). Adam: A method for stochastic optimization. 3rd International Conference on Learning Representations, ICLR 2015 - Conference Track Proceedings, 1–15. [Google Scholar]
- Koch K. M., Rothman D. L., & de Graaf R. A. (2009). Optimization of static magnetic field homogeneity in the human and animal brain in vivo. In Progress in Nuclear Magnetic Resonance Spectroscopy (Vol. 54, Issue 2, pp. 69–96). 10.1016/j.pnmrs.2008.04.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kreis R. (2004). Issues of spectral quality in clinical1H-magnetic resonance spectroscopy and a gallery of artifacts. NMR in Biomedicine, 17(6), 361–381. 10.1002/nbm.891 [DOI] [PubMed] [Google Scholar]
- Kyathanahally S. P., Döring A., & Kreis R. (2018). Deep learning approaches for detection and removal of ghosting artifacts in MR spectroscopy. Magnetic Resonance in Medicine, 80(3), 851–863. 10.1002/mrm.27096 [DOI] [PubMed] [Google Scholar]
- Lam F., Li Y., & Peng X. (2020). Constrained Magnetic Resonance Spectroscopic Imaging by Learning Nonlinear Low-Dimensional Models. IEEE Transactions on Medical Imaging, 39(3), 545–555. 10.1109/TMI.2019.2930586 [DOI] [PubMed] [Google Scholar]
- Landheer K., & Juchem C. (2019). Dephasing optimization through coherence order pathway selection (DOTCOPS) for improved crusher schemes in MR spectroscopy. Magnetic Resonance in Medicine, 81(4), 2209–2222. 10.1002/mrm.27587 [DOI] [PubMed] [Google Scholar]
- Lecun Y., Bengio Y., & Hinton G. (2015). Deep learning. Nature, 521, 436–444. 10.1038/nature14539 [DOI] [PubMed] [Google Scholar]
- Lee H. H., & Kim H. (2019). Intact metabolite spectrum mining by deep learning in proton magnetic resonance spectroscopy of the brain. Magnetic Resonance in Medicine, 82(1), 33–48. 10.1002/mrm.27727 [DOI] [PubMed] [Google Scholar]
- Lee H. H., & Kim H. (2020). Deep learning-based target metabolite isolation and big data-driven measurement uncertainty estimation in proton magnetic resonance spectroscopy of the brain. Magnetic Resonance in Medicine, 84(4), 1689–1706. 10.1002/mrm.28234 [DOI] [PubMed] [Google Scholar]
- Lee H., Lee H. H., & Kim H. (2020). Reconstruction of spectra from truncated free induction decays by deep learning in proton magnetic resonance spectroscopy. Magnetic Resonance in Medicine, 84(2), 559–568. 10.1002/mrm.28164 [DOI] [PubMed] [Google Scholar]
- Deng Li. (2012). The MNIST Database of Handwritten Digit Images for Machine Learning Research [Best of the Web]. IEEE Signal Processing Magazine, 29(6), 141–142. 10.1109/MSP.2012.2211477 [DOI] [Google Scholar]
- Li Y., Wang Z., & Lam F. (2020). Separation of Metabolite and Macromolecule Signals for 1 H-Mrsi Using Learned Nonlinear Models. Proceedings - International Symposium on Biomedical Imaging, 2020-April, 1725–1728. 10.1109/ISBI45749.2020.9098365 [DOI] [Google Scholar]
- Lin L., Považan M., Berrington A., Chen Z., & Barker P. B. (2019). Water removal in MR spectroscopic imaging with L2 regularization. Magnetic Resonance in Medicine, 82(4), 1278–1287. 10.1002/mrm.27824 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lin T.-Y., Maire M., Belongie S., Bourdev L., Girshick R., Hays J., Perona P., Ramanan D., Zitnick C. L., & Dollár P. (2014). Microsoft COCO: Common Objects in Context. http://arxiv.org/abs/1405.0312 [Google Scholar]
- Lundervold A. S., & Lundervold A. (2019). An overview of deep learning in medical imaging focusing on MRI. Zeitschrift Für Medizinische Physik, 29(2), 102–127. 10.1016/j.zemedi.2018.11.002 [DOI] [PubMed] [Google Scholar]
- Ma D. J., Le H. A. M., Ye Y., Laine A. F., Lieberman J. A., Rothman D. L., Small S. A., & Guo J. (2022). MR spectroscopy frequency and phase correction using convolutional neural networks. Magnetic Resonance in Medicine, 87(4), 1700–1710. 10.1002/mrm.29103 [DOI] [PubMed] [Google Scholar]
- Marshall I., Higinbotham J., Bruce S., & Freise A. (1997). Use of Voigt lineshape for quantification of in vivo 1H spectra. Magnetic Resonance in Medicine, 37(5), 651–657. 10.1002/mrm.1910370504 [DOI] [PubMed] [Google Scholar]
- Maudsley A. A., Andronesi O. C., Barker P. B., Bizzi A., Bogner W., Henning A., Nelson S. J., Posse S., Shungu D. C., & Soher B. J. (2021). Advanced magnetic resonance spectroscopic neuroimaging: Experts’ consensus recommendations. NMR in Biomedicine, 34(5), 1–22. 10.1002/nbm.4309 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mekle R., Mlynárik V., Gambarota G., Hergt M., Krueger G., & Gruetter R. (2009). MR spectroscopy of the human brain with enhanced signal intensity at ultrashort echo times on a clinical platform at 3T and 7T. Magnetic Resonance in Medicine, 61(6), 1279–1285. 10.1002/mrm.21961 [DOI] [PubMed] [Google Scholar]
- Mescher M., Merkle H., Kirsch J., Garwood M., & Gruetter R. (1998). Simultaneous in vivo spectral editing and water suppression. NMR in Biomedicine, 11(6), 266–272. [DOI] [PubMed] [Google Scholar]
- Mescher M., Tannus A., O’Neil Johnson M., & Garwood M. (1996). Solvent suppression using selective echo dephasing. Journal of Magnetic Resonance - Series A, 123(2), 226–229. 10.1006/jmra.1996.0242 [DOI] [Google Scholar]
- Michaeli S., Garwood M., Zhu X.-H., DelaBarre L., Andersen P., Adriany G., Merkle H., Ugurbil K., & Chen W. (2002). ProtonT2 relaxation study of water, N-acetylaspartate, and creatine in human brain using Hahn and Carr-Purcell spin echoes at 4T and 7T. Magnetic Resonance in Medicine, 47(4), 629–633. 10.1002/mrm.10135 [DOI] [PubMed] [Google Scholar]
- Mlynarik V., Gambarota G., Frenkel H., & Gruetter R. (2006). Localized short-echo-time proton MR spectroscopy with full signal-intensity acquisition. MAGNETIC RESONANCE IN MEDICINE, 56(5), 965–970. 10.1002/mrm.21043 [DOI] [PubMed] [Google Scholar]
- Murali-Manohar S., Borbath T., Wright A. M., Soher B., Mekle R., & Henning A. (2020). T2 relaxation times of macromolecules and metabolites in the human brain at 9.4 T. Magnetic Resonance in Medicine, 84(2), 542–558. 10.1002/mrm.28174 [DOI] [PubMed] [Google Scholar]
- Near J., Harris A. D., Juchem C., Kreis R., Marjańska M., Öz G., Slotboom J., Wilson M., & Gasparovic C. (2021). Preprocessing, analysis and quantification in single-voxel magnetic resonance spectroscopy: experts’ consensus recommendations. NMR in Biomedicine, 34(5), 1–23. 10.1002/nbm.4257 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Near J., Leung I., Claridge T., Cowen P., & Jezzard P. (2012). Chemical shifts and coupling constants of the GABA spin system. Proc. Intl. Soc. Mag. Reson. Med., 20(1993). [Google Scholar]
- Oeltzschner G., Saleh M. G., Rimbault D., Mikkelsen M., Chan K. L., Puts N. A. J., & Edden R. A. E. (2019). Advanced Hadamard-encoded editing of seven low-concentration brain metabolites: Principles of HERCULES. NeuroImage, 185(September 2018), 181–190. 10.1016/j.neuroimage.2018.10.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Oeltzschner G., Zöllner H. J., Hui S. C. N., Mikkelsen M., Saleh M. G., Tapper S., & Edden R. A. E. (2020). Osprey: Open-source processing, reconstruction & estimation of magnetic resonance spectroscopy data. Journal of Neuroscience Methods, 343(June), 108827. 10.1016/j.jneumeth.2020.108827 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Öz G., Deelchand D. K., Wijnen J. P., Mlynárik V., Xin L., Mekle R., Noeske R., Scheenen T. W. J., Tkáč I., Andronesi O., Barker P. B., Bartha R., Berrington A., Boer V., Cudalbu C., Emir U. E., Ernst T., Fillmer A., Heerschap A., … Wilson M. (2021). Advanced single voxel 1H magnetic resonance spectroscopy techniques in humans: Experts’ consensus recommendations. NMR in Biomedicine, 34(5), 1–18. 10.1002/nbm.4236 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Powell M. J. D. (1964). An efficient method for finding the minimum of a function of several variables without calculating derivatives. The Computer Journal, 7(2), 155–162. 10.1093/comjnl/7.2.155 [DOI] [Google Scholar]
- Powell M. J. D. (1994). A Direct Search Optimization Method That Models the Objective and Constraint Functions by Linear Interpolation. In Gomez S. & Hennart J.-P. (Eds.), Advances in Optimization and Numerical Analysis (pp. 51–67). Springer Netherlands. 10.1007/978-94-015-8330-5_4 [DOI] [Google Scholar]
- Rizzo R., Dziadosz M., Kyathanahally S. P., Shamaei A., & Kreis R. (2023). Quantification of MR spectra by deep learning in an idealized setting: Investigation of forms of input, network architectures, optimization by ensembles of networks, and training bias. Magnetic Resonance in Medicine, 89(5), 1707–1727. 10.1002/mrm.29561 [DOI] [PubMed] [Google Scholar]
- Saleh M. G., Oeltzschner G., Chan K. L., Puts N. A. J., Mikkelsen M., Schär M., Harris A. D., & Edden R. A. E. (2016). Simultaneous edited MRS of GABA and glutathione. NeuroImage, 142, 576–582. 10.1016/j.neuroimage.2016.07.056 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Scheenen T. W. J., Heerschap A., & Klomp D. W. J. (2008). Towards 1H-MRSI of the human brain at 7T with slice-selective adiabatic refocusing pulses. Magnetic Resonance Materials in Physics, Biology and Medicine, 21(1–2), 95–101. 10.1007/s10334-007-0094-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- Scheenen T. W. J., Klomp D. W. J., Wijnen J. P., & Heerschap A. (2008). Short echo time1H-MRSI of the human brain at 3T with minimal chemical shift displacement errors using adiabatic refocusing pulses. Magnetic Resonance in Medicine, 59(1), 1–6. 10.1002/mrm.21302 [DOI] [PubMed] [Google Scholar]
- Shamaei A., Starcukova J., Pavlova I., & Starcuk Z. (2023). Model-informed unsupervised deep learning approaches to frequency and phase correction of MRS signals. Magnetic Resonance in Medicine, 89(3), 1221–1236. 10.1002/mrm.29498 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Song Y., Zöllner H. J., Hui S. C. N., Hupfeld K. E., Oeltzschner G., & Edden R. A. E. (2023). Impact of gradient scheme and non-linear shimming on out-of-voxel echo artifacts in edited MRS. NMR in Biomedicine, 36(2). 10.1002/nbm.4839 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sørensen O. W., Eich G. W., Levitt M. H., Bodenhausen G., & Ernst R. R. (1984). Product operator formalism for the description of NMR pulse experiments. Progress in Nuclear Magnetic Resonance Spectroscopy, 16, 163–192. 10.1016/0079-6565(84)80005-9 [DOI] [Google Scholar]
- Sørensen T. (1948). A method of establishing groups of equal amplitude in plant sociology based on similarity of species and its application to analyses of the vegetation on Danish commons. Kongelige Danske Videnskabernes Selskab, 5(4), 1–34. [Google Scholar]
- Starck G., Carlsson A., Ljungberg M., & Forssell-Aronsson E. (2009). k-space analysis of point-resolved spectroscopy (PRESS) with regard to spurious echoes in in vivo (1)H MRS. NMR IN BIOMEDICINE, 22(2), 137–147. 10.1002/nbm.1289 [DOI] [PubMed] [Google Scholar]
- Tapper S., Mikkelsen M., Dewey B. E., Zöllner H. J., Hui S. C. N., Oeltzschner G., & Edden R. A. E. (2021). Frequency and phase correction of J-difference edited MR spectra using deep learning. Magnetic Resonance in Medicine, 85(4), 1755–1765. 10.1002/mrm.28525 [DOI] [PMC free article] [PubMed] [Google Scholar]
- The MathWorks Inc. (2022). MATLAB version: 9.13.0 (R2022b). The MathWorks Inc. https://www.mathworks.com [Google Scholar]
- Tkáč I., Andersen P., Adriany G., Merkle H., Uǧurbil K., & Gruetter R. (2001). In vivo 1 H NMR spectroscopy of the human brain at 7 T. Magnetic Resonance in Medicine, 46(3), 451–456. 10.1002/mrm.1213 [DOI] [PubMed] [Google Scholar]
- Van Rossum G., & Drake F. L. (2009). Python 3 Reference Manual. CreateSpace. [Google Scholar]
- Virtanen P., Gommers R., Oliphant T. E., Haberland M., Reddy T., Cournapeau D., Burovski E., Peterson P., Weckesser W., Bright J., van der Walt S. J., Brett M., Wilson J., Millman K. J., Mayorov N., Nelson A. R. J., Jones E., Kern R., Larson E., … Vázquez-Baeza Y. (2020). SciPy 1.0: fundamental algorithms for scientific computing in Python. Nature Methods, 17(3), 261–272. 10.1038/s41592-019-0686-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wilson M., Andronesi O., Barker P. B., Bartha R., Bizzi A., Bolan P. J., Brindle K. M., Choi I., Cudalbu C., Dydak U., Emir U. E., Gonzalez R. G., Gruber S., Gruetter R., Gupta R. K., Heerschap A., Henning A., Hetherington H. P., Huppi P. S., … Howe F. A. (2019). Methodological consensus on clinical proton MRS of the brain: Review and recommendations. Magnetic Resonance in Medicine, 82(2), 527–550. 10.1002/mrm.27742 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yablonskiy D. A., & Haacke E. M. (1994). Theory of NMR signal behavior in magnetically inhomogeneous tissues: The static dephasing regime. Magnetic Resonance in Medicine, 32(6), 749–763. 10.1002/mrm.1910320610 [DOI] [PubMed] [Google Scholar]
- Zhang Y., & Shen J. (2023). Quantification of spatially localized MRS by a novel deep learning approach without spectral fitting. Magnetic Resonance in Medicine. 10.1002/mrm.29711 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zöllner H. J., Thiel T. A., Füllenbach N.-D., Jördens M. S., Ahn S., Wilms L. M., Ljimani A., Häussinger D., Butz M., Wittsack H.-J., Schnitzler A., & Oeltzschner G. (2023). J-difference GABA-edited MRS reveals altered cerebello-thalamo-cortical metabolism in patients with hepatic encephalopathy. Metabolic Brain Disease, 38(4), 1221–1238. 10.1007/s11011-023-01174-x [DOI] [PMC free article] [PubMed] [Google Scholar]





