Abstract
J-difference-edited MRS is widely used to study GABA in the human brain. Editing for low-concentration target molecules (such as GABA) typically exhibits lower signal-to-noise ratio (SNR) than conventional non-edited MRS, varying with acquisition region, volume and duration. Moreover, spectral lineshape may be influenced by age-, pathology-, or brain-region-specific effects of metabolite T2, or by task-related blood-oxygen level dependent (BOLD) changes in functional MRS contexts. Differences in both SNR and lineshape may have systematic effects on concentration estimates derived from spectral modelling.
The present study characterises the impact of lineshape and SNR on GABA+ estimates from different modelling algorithms: FSL-MRS, Gannet, LCModel, Osprey, spant and Tarquin. Publicly available multi-site GABA-edited data (222 healthy subjects from 20 sites; conventional MEGA-PRESS editing; TE = 68 ms) were pre-processed with a standardised pipeline, then filtered to apply controlled levels of Lorentzian and Gaussian linebroadening and SNR reduction.
Increased Lorentzian linewidth was associated with a 2–5% decrease in GABA+ estimates per Hz, observed consistently (albeit to varying degrees) across datasets and most algorithms. Weaker, often opposing effects were observed for Gaussian linebroadening. Variations are likely caused by differing baseline parametrization and lineshape constraints between models. Effects of linewidth on other metabolites (e.g., Glx and tCr) varied, suggesting that a linewidth confound may persist after scaling to an internal reference.
These findings indicate a potentially significant confound for studies where linewidth may differ systematically between groups or experimental conditions, e.g. due to T2 differences between brain regions, age, or pathology, or varying T2* due to BOLD-related changes. We conclude that linewidth effects need to be rigorously considered during experimental design and data processing, for example by incorporating linewidth into statistical analysis of modelling outcomes or development of appropriate lineshape matching algorithms.
Keywords: MRS, GABA, spectral editing, MEGA-PRESS, quantification, linewidth, functional spectroscopy
Graphical Abstract

To assess the degree to which aspects of linewidth, lineshape and SNR may confound GABA+ estimates, a collection of in-vivo datasets were quantified with six modelling algorithms, with linebroadening and SNR varied experimentally. Most algorithms showed a strong association between GABA+ estimate and Lorentzian linebroadening (2–5% decrease per Hz), with weaker effects for Gaussian broadening. This indicates a potentially significant confound in cases of differing relaxation parameters between groups or experimental conditions.
1. Introduction
The impact of spectral quality on metabolite concentration estimates from MR spectroscopy (MRS) data is a vital yet under-investigated topic, particularly in the context of specialised editing sequences such as MEGA-PRESS1,2, frequently used for the assessment of γ-aminobutyric acid (GABA). The linewidth, lineshape3,4 and signal-to-noise ratio (SNR) of MRS data can vary substantially depending on acquisition parameters, anatomical region of the voxels, and subject groups. This may impact fitting outcomes and therefore compromise the reproducibility and reliability of metabolite estimation5–9. Prior studies have investigated these factors for non-edited PRESS10, JPRESS11, LASER12,13, PEPSI14 and STEAM15,16 data, and outcomes from a multi-site fitting challenge of synthetic non-edited PRESS data17 identified linewidth-related trends in certain conditions. Another recent study18 specifically assessed the impact of SNR and linewidth on a synthetic MEGA-PRESS dataset for two of the algorithms considered herein. However, many existing studies report reliability using absolute (unsigned) metrics, reflecting the magnitude of estimation errors without regard to direction – perhaps leading to the fallacious assumption that estimation error is normally distributed, without appreciable bias. Although informative, these metrics mask a potentially more severe issue: the risk of systematic differences in GABA+ estimation as a function of signal quality, which may bias outcomes and lead to spurious findings if not adequately considered. Many studies also fail to differentiate between Lorentzian and Gaussian broadening mechanisms.
Factors of signal quality become particularly critical in the case of time-resolved or functional MR spectroscopy (fMRS), where metabolic dynamics and design of the functional paradigm may further constrain the number of transients contributing to the resultant spectra, and hence the achievable SNR. Moreover, local variations in effective transverse relaxation rate (T2*) associated with blood oxygen level dependent (BOLD) changes are known to have an impact on spectral linewidth. In this context, linewidth-related biases have been demonstrated previously19,20, albeit limited to Lorentzian effects on unedited MRS at higher field strengths.
In non-functional contexts, the competing demands of increased anatomical specificity and reduced scan times lead to the question of just how much of a compromise in SNR may be acceptable. Within the same subject, tissue composition and homogeneity, achievable shim quality and hence spectral lineshape will vary between different brain regions. Between subjects or sessions, factors such as age, medication and pathology may also have an impact, as will hardware and sequence parameters between different sites and different study protocols. In all these scenarios, the ability to robustly compare concentration estimates requires rigorous characterisation of any associated biases, and strategies to account for these.
The present study focuses on the impact of linewidth and SNR variation on concentration estimates from GABA-edited MRS data, obtained using the MEGA-PRESS sequence1,2. A number of modelling algorithms are available for the processing and quantification of this data, adopting different strategies for separating the signals of interest from overlapping metabolite and background signal. Variability between algorithms has been investigated previously for short-echo-time data 9,15,21,22, and for GABA-edited MRS data 23. However, for many of these algorithms a comprehensive investigation of the impact of signal quality is currently lacking.
This study therefore aims to characterise the nature and extent of linewidth- and SNR-related effects on concentration estimates obtained from GABA-edited MRS data, and the degree to which these effects vary across sites or according to choice of modelling algorithm. The study compares estimates of GABA+ (GABA modelled together with underlying co-edited macromolecule signals) quantified using six modelling algorithms: FSL-MRS24, Gannet25, LCModel26, Osprey27, spant28 and Tarquin29,30, building on earlier work23 comparing several of these algorithms. In-vivo data were filtered to simulate acquisitions of lower SNR (by reducing the number of transients contributing to the averaged spectra for modelling) and increased linewidth (both by Lorentzian and Gaussian linebroadening); results are primarily considered in relation to the unfiltered case. Complementary to the in-vivo findings, a series of synthetic datasets were generated with controlled metabolite linewidth and nominal GABA+ content, progressively incorporating additional confounding factors to independently assess the impact of background/noise signal and co-edited macromolecules on the observed outcomes.
2. Methods
2.1. In-vivo datasets
The in-vivo datasets underlying the present analysis were obtained from the Big GABA 31,32 repository on NITRC, https://www.nitrc.org/projects/biggaba; these datasets have been extensively characterised in previous studies23,31–33. Data originated from twenty 3T MRI scanners, each at a different site, covering three major manufacturers (GE, Philips, Siemens). Full details on the hardware and software configurations at each site are described in the previous studies 23,31–33, along with details on sample composition and acquisition protocol; an MRSinMRS checklist34 and other key details are reproduced in Supplementary Tables 1 and 2.
The datasets include GABA-edited spectra (TR/TE = 2000/68 ms, 320 averages, with editing pulses at 1.9/7.46 ppm for edit-ON/-OFF respectively) acquired from a 3 × 3 × 3 cm3 voxel in the posterior cingulate region, along with water-unsuppressed reference data (8 or 16 averages) from the same region. Tissue fractions were derived previously31 from T1-weighted structural images. The present sample consists of data from 222 adult volunteers in the 18–36 years age range, with approximately even female/male split and having no known neurological or psychiatric illness. Datasets were acquired in accordance with ethical standards of the respective local institutional review boards, with those subjects examined herein having consented to sharing of deidentified data for further study.
2.1.1. In-vivo data preparation
After initially loading with import functions and coil combination from Gannet25, in-vivo spectral data were processed using a common pipeline built on FID-A35 functionality and described fully in an earlier study23. The basic pipeline includes rejection of motion-corrupted transients, alignment of individual transients by spectral registration36, averaging, eddy-current correction37, zero-order phase adjustment of each sub-spectrum according to a dual-Lorentzian model for creatine (Cr) and choline (Cho), before final alignment between edit-ON and edit-OFF sub-spectra, again using spectral registration36.
Datasets were subsequently filtered to simulate spectra of varying quality. After initial loading, metabolite data were sub-sampled to reduce the effective number of averages, and thereby the SNR. Random sampling of transients was performed after spectral registration, immediately before averaging, retaining adjacent edit-ON/edit-OFF pairs but without attention to phase cycling. Randomly sampled subsets proportional to [all, 1/2, 1/4, 1/8, 1/16, 1/32] of the available data (i.e., [320, 160, 80, 40, 20, 10] averages) were selected.
To simulate acquisitions of varying linewidth, Lorentzian linebroadening () was applied to the resultant spectra from each of the randomized subsets, by multiplication of the time-domain FID by an exponential decay function (, where ). LB was applied at 0–2 Hz in steps of 0.2 Hz, 2–5 Hz in steps of 0.5 Hz, and 5–10 Hz in steps of 1 Hz. A similar approach was used to separately assess Gaussian line-broadening (LBgauss), multiplying the FID by (with )38,39. Filtered data were saved at the same resolution as the input data, without zero-fill. No filtering was performed on water reference data.
2.1.2. Quality Control
Quality of the incoming in-vivo data and initial processing was evaluated using only the unfiltered spectra – that is, spectra processed using the full set of available transients (after rejecting motion-corrupted transients) and without additional linebroadening, before fitting. Two rejection criteria (adopted from an earlier study23) were applied on a per-dataset basis, designated R1 and R2 in subsequent usage; acquisitions which failed either of these criteria were excluded from further analysis:
R1 captures spectra having strongly aberrant features in the fit range: processing was deemed to have failed if the 0-lag cross-correlation of the normalized, reconstructed frequency domain difference spectrum in the metabolite range (2.6–4.2 ppm) with the normalized mean of all other difference spectra was below 0.5 or differed from the group mean by more than three standard deviations.
R2 establishes thresholds on basic signal quality metrics: linewidth (FWHM > 10 Hz40, measured from the negative n-acetyl aspartate (NAA) peak in the difference spectrum (NAAdiff)), and SNR (< 80, defined by maximum negative peak amplitude of NAAdiff in the [1.8, 2.2] ppm interval, over standard deviation across the [−2, 0] ppm range.
The modelling algorithms tested vary greatly in their reported quality-of-fit metrics (%SD, CRLB, FitError, etc); in the context of experimentally manipulated SNR and linewidth, these may present different biases41 and limitations. Since our analysis involves deliberately degrading SNR and linewidth (in some cases beyond the range which would ordinarily be considered acceptable), no further rejection criteria were applied to the filtered spectra. Extreme outliers in the modelled GABA+/H2O estimates (see section 2.4) were removed according to a third criterion:
R3 rejects extreme outliers from the modelled GABA+/H2O estimates. Individual estimates diverging from the median by more than five times the median absolute deviation (MAD) for that algorithm were rejected from subsequent analysis, with median/MAD parameters evaluated on a “higher quality” subset: estimates modelled on the full set of transients with 0–6 Hz linebroadening.
For the simulated datasets (section 2.3), only R3 was applied, separately for each of the nominal simulated GABA+ concentrations. Outlier-robust methods were also adopted in subsequent analyses, wherever feasible.
2.2. Basis Set Preparation
All the assessed fitting algorithms (except Gannet) require prior knowledge in the form of a basis set. For each algorithm, the same vendor-specific basis sets were used, in a format appropriate to the tool. These had been previously created23,27 using fast spatially resolved 2D density-matrix simulations42 implemented in FID-A35, with ideal excitation pulses and vendor-specific refocusing pulses and timings, and using chemical shifts and J-coupling coefficients from Kaiser et al.43. Basis sets included components for GABA, Glutamate (Glu), Glutamine (Gln), glutathione (GSH), NAA, n-acetylaspartylglutamate (NAAG), and parametrized Gaussian components representing co-edited macromolecules around 0.91 ppm (denoted MM09ex; FWHM=10.9 Hz), and co-edited macromolecule signal underlying the GABA peak at 3.0 ppm (denoted MM3co) with FWHM = 14 Hz and scaling equivalent to two protons44. The sum of GABA and MM3co fit components is reported herein as GABA+.
2.3. Synthetic datasets
To independently assess the contribution of overlapping metabolite signal, co-edited macromolecule signal and background signals (baseline, noise, artefactual components) to any observed effects, a series of hybrid spectra were synthesized, combining simulated metabolite components with realistic background signal derived as follows from the in-vivo data. For each of the unfiltered in-vivo spectra, the modelled baseline and fit residuals from each of the basic modelling algorithms in an earlier analysis23 were extracted. For each point across the fit range (0.2 – 4.2 ppm) of the frequency domain spectrum, the median baseline and median fit residual across algorithms was evaluated to yield a “consensus” background signal for each dataset. This signal represents features of the in-vivo data which were consistently not covered by the prior knowledge metabolite models of the algorithms investigated: broad baseline fluctuations, co-edited macromolecule or metabolite contributions not captured by the model, artefactual components, and random noise. By considering background derived by multiple different fitting models, we reduce the risk that the background signal thereby derived, in combination with simulated basis spectra, is perfectly conformant to any one of the algorithms assessed45. The across-algorithms median frequency domain baseline model signal will hereafter be referred to as the “consensus baseline”, describing broad undulations in the in-vivo spectrum, while the across-algorithms median fit residual will be referred to as the “consensus residual”, describing signals in the fit range not captured by existing metabolite and baseline models. The combination of these will be referred to as the “consensus background”. These parts are illustrated for a representative subject in Figure 1 (A).
Figure 1.
(A) Derivation of consensus background, residual and baseline signal: median across algorithms from a single representative in-vivo subject, subsequently combined with simulated metabolite components (B) to yield in-vivo-like simulated datasets.
Background signals from thirty subjects were selected for further analysis: ten typical, representative backgrounds from each of the three hardware manufacturers, randomly sampled from the upper 70-percentile correlation of the subjects’ consensus background signal with the group mean for the corresponding manufacturer. For each of these thirty background signals, simulated metabolite signal components were added from the corresponding manufacturer-specific basis set. Signal components were combined incrementally in the following schemes, illustrated in Figure 1 (B):
G: GABA only (no background), in simulated concentrations of [2.5, 3.2, 3.9] mM.
GM: GABA as above, with a standard set of simulated Metabolites: 12.5 mM NAA, 2.5 mM NAAG, 6 mM Cr, 4 mM phosphocreatine (PCr), 1.2 mM glycerophosphorylcholine (GPC), 1.8 mM phosphorylcholine (PCh), 7.5 mM myo-inositol (mI), 5 mM lactate (Lac), 9 mM Glu, 3.5 mM Gln, 1 mM GSH.
G+M : GABA+, simulated as GABA plus a fixed 1.5 mM MM3co, underlying 3.0 ppm macromolecule signal, in total concentrations of [2.5, 3.2, 3.9] mM, with a standard set of Metabolites as above.
GMB: GABA with a standard set of Metabolites (per GM case), with consensus Baseline signal included.
GMR: GABA with a standard set of Metabolites (per GM case), with consensus Residual signal included.
GMBR: GABA with a standard set of Metabolites (per GM case), with both consensus Baseline and consensus Residual signals included.
G+MBR: A complete in-vivo-like signal: GABA+ (GABA with a fixed 1.5 mM MM3co), in total concentrations of [2.5, 3.2, 3.9] mM, with standard set of Metabolites as above; consensus Baseline and consensus Residual signals included.
For each of these cases, metabolite linewidth was varied from 2 – 12 Hz in steps of 0.5 Hz (Lorentzian linebroadening); background and reference signals were not affected. Hence: 21 linewidths for each of three GABA(+) concentrations were quantified using seven different combinations of metabolite and background: a total of 441 simulated spectra for each of the thirty selected subject backgrounds.
2.4. Processing and Quantification
Processed and filtered spectra (both in-vivo and simulated) were passed into each algorithm in an appropriate format. In general, the algorithms were invoked using the developer-supplied default or recommended configuration parameters for GABA-edited MEGA-PRESS data, as detailed in a prior study23 and summarised in Supplementary Section H, with basis sets as detailed in section 2.2. Some adjustments to baseline modelling were made for Osprey and LCModel (0.6 ppm spline knot spacing adopted for each), and for spant (see Supplementary Section E, Supplementary Table 7 and Supplementary Figure 10). Built-in processing steps for the various implementations are bypassed; this is particularly significant for Gannet, which would ordinarily perform linebroadening and zero-fill before modelling. Batch processing was automated in Matlab (v2021a, MathWorks Inc., Natick, MA, USA), except for spant which was scripted externally in R 46 (v3.5.2). This processing and modelling workflow is summarised in Figure 2.
Figure 2:
Data (A), processing (B) and modelling (C) workflow, summarising applied filtering and configuration of the various algorithms assessed. Figure derived from Craven et al, 202223.
The raw ratio of metabolite to water signal intensities was scaled with tissue correction47 using previously derived tissue fractions31; full details on this re-scaling have been documented previously23. Tissue corrected molar concentrations scaled to the unfiltered water reference are denoted “/H2O”. While the bulk of our analysis focuses on GABA+ from the edited difference spectra, filtered edit-OFF sub-spectra are also modelled for the in-vivo datasets, to allow the efficacy of referencing to total creatine from the edit-OFF sub-spectrum (tCroff) to be discussed; associated modelling parameters are supplied in the Supplementary Material, section H.2.
2.5. Numerical and Statistical Analysis of Modelling Outcomes
Outcomes of the batch modelling were collated and analysed using locally developed scripts written in Python (v3.9.2), leveraging the pandas 48 (v1.5.2) data analysis framework, with numeric methods from NumPy 49 (v1.23.5) and statistical methods from the SciPy 50 (v1.9.3), pingouin 51 (v0.5.2) and statsmodels 52 (v0.13.5) libraries. Visualisation was performed using matplotlib 53 (v3.3.4) and seaborn 54 (v0.12.1).
Where appropriate, normality was assessed with the Shapiro-Wilk method 55, and comparability of variance using Fligner-Killeen’s test 56; t-tests were performed using Welch’s method57. In sub-analyses where Holm-Bonferroni 58,59 correction was applied, adjusted p-values are denoted pholm, with a corrected significance threshold defined as pholm<0.05; uncorrected p-values are denoted punc..
The dataset described herein has previously been assessed with respect to demographics, underlying signal quality and with respect to broad differences with certain combinations of vendor and algorithm 23,31. To harmonise estimates across sites (and hence, across vendors), the ComBat approach60 was used; individual harmonisation factors were determined from the unfiltered spectra, then applied to the full set of filtered spectra (prior to rejection of outlier estimates, R3). For subsequent analyses, the impact of inter-individual variation was mitigated by expressing in-vivo estimates for each subject and algorithm as percentage differences relative to the estimate from the corresponding unfiltered spectra. For each filtered spectrum, achieved SNR and linewidth were measured using the negative NAAdiff peak (as described in section 2.1.2), expressed both as absolute values and as changes relative to the unfiltered spectrum.
Variance Partition Coefficients (VPCs) were derived for subject, SNR and linewidth factors on filtered spectra at various applied SNR and LB levels, using an unconditional linear mixed-effects model implemented in R46 with the lme461 package. Within each group (site), a series of paired t-tests was performed, evaluating estimates at each combination of SNR and LB against those obtained from the unfiltered data (uncorrected, significance threshold p=0.05); this allowed an indicative rate of spurious findings (type I errors) associated with a particular change in linewidth to be identified.
One limitation of applied line-broadening (essentially a smoothing function) is that this will also alter the SNR of the data; in the present study, numerical methods were used to separate these factors. For each algorithm, a two-dimensional mesh representation of obtained GABA+ values as a function of both the nominal, applied LB and the achieved (measured) SNR was generated. Estimates from each algorithm were distributed evenly into 14 bins according to LB, then within each of these into seven even bins according to relative SNR. Mesh coordinates were defined as the median of SNR and linewidth parameters within each bin. Uniform mesh refinement with cubic interpolation allowed the expected GABA+ estimate at arbitrary combinations of SNR and LW within the experimental range to be approximated – hence, the impact of linewidth to be assessed independently of SNR. The same mesh was used to generate contour maps describing relative change in concentration estimates (presented in Figure 4 and Figure 5), for visual inspection to guide subsequent analysis steps.
Figure 4.
GABA+ estimate (relative to unfiltered case) as a function of Lorentzian linewidth and SNR, for each of the algorithms assessed.
Figure 5.
GABA+ estimate (relative to unfiltered case) as a function of Gaussian linewidth and SNR, for each of the algorithms assessed.
The effect of small changes in linewidth on GABA+ estimates, as a function of starting linewidth, was quantified with a piecewise linear description. This was derived by taking the linear least-squares regression of GABA+ on linewidth, as a function of linewidth (, with LW measured after filtering) over a five-point sliding window across the range of linewidths for each individual subject, for each algorithm assessed. The median of these factors is calculated across a limited range of linewidths: the first 2.12 Hz of broadening for in-vivo data, and nominal linewidth of 5.26 ± 1.06 Hz for the simulated data (guided by the linewidths observed in unfiltered in-vivo data). This factor, which quantifies the approximated linear relation between measured concentration estimate and linewidth (valid over a limited range of linewidths), will hereafter by referred to as the linear linewidth factor, LLWF. This factor forms the basis for statistical comparisons across modelling algorithms and vendors (see section 3.1.2). While the overarching focus of the present analysis remains on GABA+, similar factors were assessed for Glx and tNAA (both from the edited difference spectrum) and tCroff (from the edit-OFF sub-spectrum).
2.5.1. Synthetic Data
For each of the incremental synthetic models, for each algorithm and at each simulated linewidth, Standard Deviation (SD), Mean Signed Difference (MSD) and Mean Absolute Error (MAE) of GABA+ estimates were evaluated, relative to the nominal (simulated) concentration. The Pearson correlation between nominal and estimated concentration was also assessed.
3. Results
3.1. In-vivo data
Four datasets were rejected according to criterion R1; no further rejections according to criterion R2. The global mean estimate for GABA+/H2O across all algorithms and subjects was found to be 3.2 ± 0.8 i.u.. Rejection rate according to R3 is summarised in Supplementary Figures 2A and 3A. Linewidth of the incoming data (after pre-processing, without filtering) was measured as [4.25, 5.33, 7.61] Hz for the [5, 50, 95]-percentiles; per-site SNR and linewidth are presented in Supplementary Table 3.
3.1.1. Fit Outcomes
Mean filtered data and fits by each algorithm for low and high linewidth and SNR cases are presented in Figure 3 and Supplementary Figure 1 (the latter covering the full fit range). Particularly of note here is that the right side of the GABA peak is clearly elevated in broader linewidth cases, as it increasingly merges with broadened Glx signals in the 2.3–2.6 ppm range; for several algorithms, the baseline model is seen to distort upwards in this area. Baseline estimates after Gaussian LB are less variable than after Lorentzian LB case for several algorithms. Also of interest is behaviour around a characteristic hump at 3.2 ppm (not directly attributable to GABA+); most algorithms manage to reject this in the lower linewidth cases (even at lower SNR), but appear less successful at doing so with broader linewidths – potentially dragging the GABA model slightly to the left in the process; LCModel in particular appears to have enveloped essentially all the 3.2 ppm signal. The observed baseline behaviour prompted further exploratory analysis on LCModel baseline knot spacing, described in Supplementary Section G, Supplementary Figure 14 and Supplementary Table 9.
Figure 3.
Mean spectra, GABA+ fits and modelled baseline for each algorithm, illustrating the unfiltered original data alongside filtered variants having lower SNR (1/32 subset), higher linewidth (10 Hz of Lorentzian and Gaussian broadening, separately), and a combination of SNR and LB factors. Vertical scaling is normalised. Outcomes over the full fit range are presented in Supplementary Figure 1.
VPCs for subject, LB and SNR factors are presented in Supplementary Table 4; applied Lorentzian LB accounted for 12.8 ± 10.8 % of total variance in GABA+ estimates, depending on algorithm. Applied Gaussian LB accounted for substantially less variance, 2.8 ± 1.6 %, as did the applied SNR factor: 1.0 ± 1.0 %.
Paired t-tests on the obtained GABA+/H2O estimates for each site, comparing each SNR/LB step against the same data in the unfiltered case, allowed assessing the likelihood of spurious (false positive) findings given certain SNR/LW differences. These outcomes are presented in Supplementary Figure 2C and Supplementary Figure 3C. Several algorithms (LCModel, spant, Tarquin) yielded statistically significant differences (punc < 0.05) between filtered and unfiltered cases in over half the sites assessed after less than 0.5 Hz of LBlorentz; other algorithms did so at a lower rate, with FSL-MRS and Osprey appearing somewhat more robust (but still far from ideal) in this regard, reporting < 20% statistically significant differences within the first 1 Hz of linewidth difference.
3.1.2. LLWF: GABA+ estimate as a function of Linewidth
GABA+ estimate (relative to the unfiltered case) as a function of relative SNR and linewidth is presented in Figure 4 and Figure 5 for Lorentzian and Gaussian LB respectively. All of the algorithms except FSL-MRS show a clear trend towards lower GABA+ estimates with increasing Lorentzian linewidth; visual inspection suggests GABA+ estimates decrease by roughly 10% with 3 Hz of linebroadening for all algorithms except FSL-MRS. The SNR factor does not present a clear trend, although several algorithms (FSL-MRS, Gannet, LCModel, spant) yield slightly higher estimates for the lowest SNR cases. Relative estimates for co-edited Glx are presented for interest in Supplementary Figures 4 and 5, but are not analysed further.
Conversely, Gaussian linebroadening shows a more moderate effect on GABA+ estimates, often opposing the Lorentzian effects: there were slight positive associations for moderate LBgauss factors (<1% per Hz LB, for LBgauss<4 Hz), becoming more substantial for larger broadening factors). SNR effects are also more apparent in the higher LBgauss cases.
The linewidth relation is explored further in Figure 6. Estimates modelled at a fixed SNR (i.e., with LB and SNR factors separated as described in section 2.5) showed the same trends across all algorithms. These estimates are also presented in Figure 6A, having been sampled across the fixed SNR values (relative SNR 100%, indicated by a dashed vertical line in Figure 4 and Figure 5). Changes in GABA+ estimates for spant (up to 6 Hz) and Tarquin (2–6 Hz) were slightly moderated when separated from SNR effects; conversely, LCModel (over 5 Hz) and Gannet (Gaussian LB) showed greater changes when SNR effects were removed.
Figure 6.
Showing the relative obtained GABA+ estimate as a function of applied Lorenzian (A) and Gaussian (C) linebroadening, and (B) the piecewise linear rate-of-change in GABA+ estimate for the Lorenzian case. Dashed vertical line indicates approximate range of the LLWF metric evaluation.
The LLWF metric (defined in section 2.5) is illustrated in Figure 6B, as the per-subject median over the first 2.12 Hz, and indicates the degree to which estimates from a given algorithm are likely to be affected by a moderate change in linewidth. A comparison of this metric across different algorithms (grouped by vendor) is presented in Supplementary Figure 6, with numerical and statistical outcomes (unpaired t-tests) examining algorithm-specific effects in Supplementary Table 5, and vendor-specific effects (within algorithm) in Supplementary Table 6.
While most algorithms presented a similar general trend (lower GABA+ estimates as linewidth increased), the magnitude varied significantly (pholm < 0.01) between all pairs of algorithms, except between LCModel, Gannet and Osprey and between spant and Tarquin, which happened to behave similarly in the defined range. Although the present analysis and discussion is focussed on GABA+, the interested reader may find similar plots for Glxdiff, tNAAdiff, and tCroff in Supplementary Figure 7.
3.2. Synthetic Data
Quantification outcomes from the synthetic data are presented in Figure 7 for the GABA-only case and for the complete model (G and G+MBR respectively), with outcomes from intermediate model steps presented in Supplementary Figure 8, and fits in Supplementary Figure 9. Remarkably, the basic trend towards lower estimates at higher Lorentzian linewidth presents already in the trivial GABA-only case for most algorithms, although LCModel in particular maintained a flat response up to 7.5 Hz LW.
Figure 7.
Quantification outcomes for each algorithm, modelling synthetic data of varying complexity: GABA only (G), GABA and additional metabolite components (GM), GABA+ with additional metabolite components and background signal (G+MBR)
FSL-MRS exhibited some variability in behaviour at lower linewidths in the intermediate model steps. This was largely (but not entirely) resolved in the full (G+MBR) model, and is explored further in Supplementary Section F (Supplementary Table 8 and Supplementary Figures 11, 12 and 13). Gannet also exhibited greater variability at lower (<5 Hz) linewidths, particularly for the lower simulated concentration – most likely an interaction between the GABA+ and baseline models. Once again, this variability was not observed with the full synthetic model. We note that Gannet by default applies 3 Hz linebroadening on top of the data’s inherent linewidth, so this is unlikely to affect in-vivo data in general usage.
Some algorithms, most notably Osprey, show distinct inflection points after stepping from the GABA-only synthetic data (G) to the GM model. The response of the Osprey model turns after 8 Hz, which reflects the behaviour seen in-vivo (see Figure 6). Tarquin exhibited a steep slope between 5–6 Hz, but was virtually flat between 7 and 11 Hz. These features became apparent only after stepping from the trivial GABA-only synthetic data (G) to data with other metabolites included (GM). LCModel presented a slight deviation at 6 Hz linewidth, most visible in the lower nominal GABA case; this is also reflected in a markedly reduced R2 at that point.
For most algorithms, error metrics %MAE and %MSD degrade progressively with linewidth, albeit to differing degrees. However, the %SD of estimates were not seen to degrade appreciably over this range, and most algorithms maintained their sensitivity to the nominal differences (assessed by R2) for a broad range of linewidths. Apparent variability in estimates constrained the sensitivity of FSL-MRS at lower linewidths.
4. Discussion and Conclusions
4.1. Limitations
4.1.1. Scope and generalisability
While many of the key findings in this study present across several of the algorithms examined, they do not necessarily generalise further. A number of other algorithms and implementations are available to the MRS community62–71, each potentially differing in modelling strategy, baseline handling and constraint models and therefore likely to present unique characteristics. Most implementations are also highly configurable, with even subtle differences in modelling parameters or prior knowledge having potentially substantial impact on the behaviour, as observed in previous work23,44 and also here in the supplementary analyses (supplementary sections E, F, G). Since the configurations tested herein generally remain close to the developer-supplied default or recommended configurations, our findings are more representative of “typical use”, rather than the best achievable results with extensive local optimisation.
4.1.2. In-vivo limitations
The in-vivo analysis is constrained by the absence of a ground truth; it is not possible to assess the absolute accuracy of measurements, only outcomes relative to the higher-quality unfiltered spectra. Taking random subsets of transients for the reduced SNR cases may over-estimate variance relative to contiguous acquisition of the same number of transients, since the longer scan may be subject to greater confounds associated with subject motion, thermal drift72, and perhaps real physiological variations.
The linebroadening operations applied to in-vivo data entail broadening (smoothing) across the entire frequency range, which also affects SNR. It is not straightforward to disentangle these factors. A common approach is to add noise back into the broadened spectra – however, while simple additive noise (Gaussian white noise for example) may yield similar SNR estimates, the characteristics of such noise are not necessarily representative of a true spectrum acquired with lower SNR. In the present analysis we instead separate these factors by numeric modelling, interpolating across a 2D grid of SNR and LB factors to estimate variation associated with each factor independently (see section 2.5). Nonetheless, the independence of noise samples contributing to the fixed-SNR estimates may be reduced on account of the linebroadening.
4.1.3. Synthetic Data limitations
The lack of ground truth in the in-vivo analysis is addressed by additional analyses using carefully-prepared synthetic data73. However, any study involving simulated data is limited by the knowledge on which the simulation was based, and on simplifications and assumptions inherent in the simulation model – often the same knowledge which guides design of the modelling algorithms. The present study is no exception: the simulated metabolite signals come from the same basis set with which the data was subsequently fit. While we attempt to mitigate any gaps in our prior knowledge basis sets through incorporation of residual signals derived from in-vivo data fits, the simulation is nonetheless limited in other aspects, including potential asymmetric/non-Lorentzian/non-Gaussian lineshape changes, shifts in frequency or phase and other factors which may be expected to arise in in-vivo data.
4.2. Macromolecule Fitting, Baseline and Artefact Rejection around 3.2 ppm
Previous studies have identified a number of factors potentially affecting the accuracy of quantification of GABA-edited MRS data and worthy of further attention. Key amongst these is the handling of co-edited macromolecule and metabolite signal underlying the 3.0 ppm GABA peak 44,74–78. Although broadly targeted in the simulation study, the reliability of estimates from simulated data was not substantially degraded by inclusion of this component in the simulation model, suggesting that the fitting models as configured perform reasonably well at modelling the aggregate signal (GABA+). Similarly, other baseline distortions in the data were not found to drive the linewidth-related bias. This is compatible with recent findings on simulated PRESS data, where most of the packages considered performed equally well on datasets with and without macromolecules17. We emphasize that the current study looks only at GABA+, and studies attempting to quantify GABA alone are likely to be affected differently.
Another commonly observed artefactual feature in GABA-edited MEGA-PRESS spectra is a small hump around 3.2 ppm, which may arise due to incomplete subtraction of choline 79, or contributions from other co-edited signals (perhaps including valine-containing macromolecules 80 and arginine 81). Although not explicitly modelled in this study, this component is subtly present in the consensus background signals applied in the simulation study. Once again, it does not appear to drive the linewidth-related effects observed here.
Although these factors were not found to be significant specifically in relation to the linewidth-related bias, they remain items of interest in further refinement of fitting algorithms.
4.3. Factors contributing to the observed changes
The simulation analysis described in section 2.3 was designed to identify specific features giving rise to the observed effects in the in-vivo data. Surprisingly, several algorithms presented a linewidth-related effect even for the most trivial case of a single metabolite signal on a flat background. This suggests that some of the observed effects relate to the basic lineshape model, or are intrinsic to the optimisation techniques applied and not solely dependent on any particular confounding signal. This may indicate that modelling optimization is not sufficiently responsive to changing linewidth, instead preferring to ascribe some part of the signal to other model components (perhaps including the baseline model). It is possible that further tuning of cost factors associated with linebroadening may mitigate this effect.
The influence of lineshape model should not be underestimated; while basis set peaks and applied linebroadening herein exhibit a basic Lorentzian lineshape, measured in-vivo metabolite peaks will present a more complex profile – closer to a Voigt profile45. Discrepancies between the real and modelled lineshape have been shown to introduce systematic errors in quantification3,4. This may be a contributing factor particularly for algorithms having only a basic lineshape model (e.g., Gannet which by default uses a Gaussian fit). Even where a more nuanced lineshape model is used, this is often dominated by a small number of major peaks (such as Cho and Cr for short TE PRESS, or NAA for GABA-edited data. It is not certain that an identical model will be applicable to all peaks – particularly when transferring from strong singlets to complex, edited multiplets. Hence, a mismatch in lineshape between the major peaks dominating the lineshape model, and the peaks to which that model is applied, may account for some of the variation seen herein. This may be mitigated by allowing refined per-peak lineshape modelling – although such modelling may be less robust to noise in the case of smaller peaks. Indeed, the exploratory analysis for FSL-MRS (supplementary section F) showed, for that algorithm, having the same lineshape and shift parameters across all metabolite components (including the strong negative NAA peak) appeared to mitigate linewidth-related changes in GABA+ estimate. In practice, a well-tuned, firm but not rigid constraint may prove most suitable.
Another major factor differentiating the algorithms is baseline parameterization. LCModel, Osprey and spant all adopt a spline baseline (with adaptive spacing in the case of spant); in each of these cases, the baseline in higher linewidth cases is seen to be pulled substantially upwards (see Figure 3), through the apparent GABA peak – likely contributing to the underestimation in those cases. Simpler baseline models, such as the polynomial baseline of FSL-MRS, or Gannet’s sinusoidal baseline model, appear more robust in this regard – although the Gannet implementation becomes problematic in low SNR cases, leading to higher rejection rates.
4.4. Accounting for lineshape
For several of the algorithms tested, even a small difference in Lorentzian linewidth (0.5 Hz) between otherwise identical datasets could yield a statistically significant difference in obtained concentrations; this is concerning when considering comparisons across different regions, or between subject groups where spectral quality may be expected to differ, even slightly. Somewhat counterintuitively, the risk of false positive outcomes actually increases with “better” datasets: higher SNR data is likely to give reduced overall variance, and larger subject groups increase the statistical power to detect these subtle differences. With different metabolites exhibiting differing degrees of variation according to linewidth (see Supplementary Figure 7), scaling to an internal concentration reference (such as tCroff) subject to similar linewidth changes is not likely to be effective at mitigating these effects.
Since Lorentzian and Gaussian linebroadening factors exhibit contrasting behaviours, basic linewidth matching with a pure Lorentzian function (as frequently seen in the fMRS literature) may not be effective if this does not reflect the shape of the effects to be corrected, and may in fact exacerbate quantitative differences. If linewidth matching is to be performed, it is essential that an appropriate broadening function be chosen. If the nature of prospective differences is unknown, lineshape matching (for example, by a deconvolution approach) may be more appropriate. We note however that matching approaches have limitations and complexities, some of which may be addressed by matching individual basis set components82 rather than the complete metabolite spectrum.
A more generalisable approach would be to routinely consider estimated linewidth as a covariate within statistical analysis of metabolite estimates, in any situation where signal quality may vary between groups. However, this may give rise to an unacceptable loss of statistical power, especially in cases where the linewidth correlates strongly with other (real) measurable effects.
4.5. Implications for Quality Control
Although the overall accuracy of estimates (in terms of %MAE, %MSD) deteriorated with increasing linewidth for most algorithms, the sensitivity to changes in concentrations (as reflected by %SD and moreover R2) was largely maintained across the range of linewidths assessed in the synthetic datasets; therefore, within the investigated range, accounting for lineshape as described in section 4.4 may be more advantageous than strict rejection criteria based on linewidth.
Similarly, the in-vivo study showed no strong group-level effects associated with SNR, except in high Gaussian LB cases, suggesting that estimates from lower-SNR data may remain usable in certain contexts18. We therefore re-iterate the need for quality control procedures which look beyond these basic signal metrics: close attention to the shape of the data, fit and residuals, and to the distribution of estimates themselves, may be more meaningful (and perhaps less prone to bias) than strict thresholding based on these metrics alone.
4.6. Key Recommendations
Based on our findings, we make the following recommendations:
In any contexts where acquired spectral linewidth may vary between experimental groups, this must be rigorously controlled.
Linewidth matching may be problematic if the broadening function applied does not accurately reflect the broadening mechanisms in the data; a more refined lineshape matching approach may be preferable.
Inclusion of linewidth as a covariate in statistical modelling may be more generalisable, and avoids pitfalls associated with matching.
Controlling for differences in linewidth should be prioritised over simply rejecting datasets with linewidth over a certain threshold.
Additionally, we propose two areas of interest for potential refinement of these modelling algorithms:
Broadly, further development of modelling algorithms should pay close attention to cost factors associated with linewidth, and how the model optimization may behave with respect to changes in linewidth of the source data.
More specifically, revisiting the manner in which the lineshape model is derived, and whether this is appropriate to all the peaks being modelled. Lineshape variation is likely to differ between metabolite and macromolecule components.
While the effects described herein present across multiple tested configurations, the exact extent can vary significantly – we therefore re-iterate the need for consistency in modelling and reporting, and the value of converging on consensus best practices in this regard21.
4.7. Conclusions
We show that several contemporary fitting algorithms exhibit strong associations between linewidth and concentration estimate in response to LBlorentz (T2 differences), and more moderate, contrasting associations in relation to LBgauss (T2*/inhomogeneity effects).
The finding of reduced concentration estimates as a function of linewidth is consistent with much earlier findings15,16 for short TE STEAM and simulated datasets, where similar trends may be observed for several metabolites in certain conditions, using Hankel-Lanczos singular value decomposition (HLSVD)83,84, LCModel and AMARES algorithms for modelling. This is also consistent with more recent findings17 for major metabolites in simulated PRESS data, for several algorithms. That these trends persist after more than twenty years of algorithmic evolution leads to two conclusions: first, such patterns deserve further investigation and concerted effort to mitigate within the design of the modelling algorithms. At the same time, such trends must not be overlooked by users of the algorithms in analysing and interpreting their results.
Our findings highlight the ongoing need for rigorous consideration of potential linewidth differences between samples or conditions – and furthermore highlight the importance of adopting appropriate strategies to account for this.
Supplementary Material
Highlights:
In-vivo GABA-edited 1H-MRS data from 222 subjects were filtered to simulate varying linewidth and SNR conditions
Filtered datasets were quantified with six different modelling algorithms to assess the impact of linewidth and SNR on the metabolite level estimates.
Synthetic spectra with controlled GABA+ levels and in-vivo-like background signals (applied incrementally) were also assessed.
For both in-vivo and synthetic datasets, GABA+ estimates showed a significant association with Lorentzian linewidth across most algorithms, even for small changes in linewidth.
Weaker, often opposing associations were observed for Gaussian linebroadening.
This indicates a potentially significant confound for studies where linewidth or lineshape may be expected to differ, even slightly, between groups.
The need for appropriate strategies to account for lineshape differences is highlighted.
Acknowledgements
Analysis was performed within a project funded by ERC grant #249516, which additionally supported the contributions of ARC, LE, KH. Data used in this analysis were previously collected through an international collaborative study funded under NIH grant R01 EB016089. TKB acknowledges support from a One Child Every Child Postdoctoral Fellowship and a Cumming School of Medicine Postdoctoral Fellowship. ADH holds a Canada Research Chair in Magnetic Resonance Spectroscopy in Brain Injury and acknowledges funding support from the Natural Sciences and Engineering Research Council (NSERC) of Canada. GO acknowledges funding support from R00 AG062230, R21 EB033516, and P41 EB031771.
List of Abbreviations
- BOLD
blood oxygen level dependent
- Cho
Choline
- Cr
Creatine
- diff
difference (edited) spectrum
- FD
frequency domain
- FID
Free Induction Decay (time-domain MRS signal)
- FWHM
linewidth: full width at half maximum
- G(+)[MBR]
Synthetic spectral models, see section 2.3
- GABA
γ-aminobutyric acid
- GABA+
total edited signal at 3 ppm; GABA with underlying coedited signals
- Gln
Glutamine
- Glu
Glutamate
- GPC
glycerophosphorylcholine
- GSH
glutathione
- HLSVD
Hankel-Lanczos singular value decomposition
- i.u.
institutional units
- Lac
Lactate
- LBgauss
Gaussian linebroadening factor
- LBlorentz
Lorentzian linebroadening factor
- LCM
linear combination modelling
- LLWF
Linear line-width factor, see section 2.5
- MAD
Median Absolute Deviation
- MAE
Mean Absolute Error
- MEGA-PRESS
Mescher–Garwood point-resolved spectroscopy
- mI
myo-inositol
- MRS
Magnetic Resonance Spectroscopy
- MSD
Mean Signed Difference
- NAA
N-acetyl aspartate
- NAAdiff
NAA measured from the difference spectrum
- NAAG
n-acetylaspartylglutamate
- PCh
phosphorylcholine
- PCr
phosphocreatine
- pholm
Holm-Bonferroni adjusted p-value
- punc
Uncorrected p-value
- R1–3
adopted rejection criteria; see methods section 2.1.2
- SD
Standard Deviation
- SNR
Signal-to-Noise Ratio
- T2*
Effective transverse relaxation rate
- tCroff
total creatine measured from the edit-OFF sub-spectrum
- TD
time domain
- VPC
Variance Partition Coefficients
Footnotes
Declaration of Interest
The authors declare no conflicting interests.
Data Availability Statement
Scripts used for the present analysis are publicly available here; further dependencies are described within: https://git.app.uib.no/bergen-fmri/gaba-linewidth
Spectra analysed in this manuscript were obtained from the publicly available Big GABA repository on NITRC, https://www.nitrc.org/projects/biggaba
Basis sets used in the primary analysis were obtained from the publicly available Osprey package, https://schorschinho.github.io/osprey
References
- 1.Mescher M, Merkle H, Kirsch J, Garwood M, Gruetter R. Simultaneous in vivo spectral editing and water suppression. NMR Biomed. 1998;11(6):266–272. doi: [DOI] [PubMed] [Google Scholar]
- 2.Rothman DL, Petroff OA, Behar KL, Mattson RH. Localized 1H NMR measurements of gamma-aminobutyric acid in human brain in vivo. Proc Natl Acad Sci. 1993;90(12):5662–5666. doi: 10.1073/pnas.90.12.5662 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Marshall I, Higinbotham J, Bruce S, Freise A. Use of voigt lineshape for quantification ofin vivo1H spectra. Magn Reson Med. 1997;37(5):651–657. doi: 10.1002/mrm.1910370504 [DOI] [PubMed] [Google Scholar]
- 4.Marshall I, Bruce SD, Higinbotham J, et al. Choice of spectroscopic lineshape model affects metabolite peak areas and area ratios. Magn Reson Med. 2000;44(4):646–649. doi: [DOI] [PubMed] [Google Scholar]
- 5.Brix MK, Ersland L, Hugdahl K, et al. Within- and between-session reproducibility of GABA measurements with MR spectroscopy: Reproducibility of MRS GABA Measurements. J Magn Reson Imaging. 2017;46(2):421–430. doi: 10.1002/jmri.25588 [DOI] [PubMed] [Google Scholar]
- 6.Duda JM, Moser AD, Zuo CS, et al. Repeatability and reliability of GABA measurements with magnetic resonance spectroscopy in healthy young adults. Magn Reson Med. 2021;85(5):2359–2369. doi: 10.1002/mrm.28587 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.O’Gorman RL, Michels L, Edden RA, Murdoch JB, Martin E. In vivo detection of GABA and glutamate with MEGA-PRESS: Reproducibility and gender effects. J Magn Reson Imaging. 2011;33(5):1262–1267. doi: 10.1002/jmri.22520 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Baeshen A, Wyss PO, Henning A, et al. Test–Retest Reliability of the Brain Metabolites GABA and Glx With JPRESS, PRESS, and MEGA-PRESS MRS Sequences in vivo at 3T. J Magn Reson Imaging. 2020;51(4):1181–1191. doi: 10.1002/jmri.26921 [DOI] [PubMed] [Google Scholar]
- 9.Mullins PG, Rowland L, Bustillo J, Bedrick EJ, Lauriello J, Brooks WM. Reproducibility of 1H-MRS measurements in schizophrenic patients. Magn Reson Med. 2003;50(4):704–707. doi: 10.1002/mrm.10598 [DOI] [PubMed] [Google Scholar]
- 10.Wijtenburg S, Knight-Scott J. The impact of SNR on the reliability of LCModel and QUEST quantitation in 1H-MRS. In: Proc Int Soc Magn Reson Med. Vol 17.; 2009:2401–2401. [Google Scholar]
- 11.Zhang Y, Shen J. Effects of noise and linewidth on in vivo analysis of glutamate at 3 T. J Magn Reson. 2020;314:106732. doi: 10.1016/j.jmr.2020.106732 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Bartha R. Effect of signal-to-noise ratio and spectral linewidth on metabolite quantification at 4 T. NMR Biomed. 2007;20(5):512–521. doi: 10.1002/nbm.1122 [DOI] [PubMed] [Google Scholar]
- 13.Hong D, van Asten JJA, Rankouhi SR, Thielen JW, Norris DG. Effect of linewidth on estimation of metabolic concentration when using water lineshape spectral model fitting for single voxel proton spectroscopy at 7 T. J Magn Reson. 2019;304:53–61. doi: 10.1016/j.jmr.2019.05.002 [DOI] [PubMed] [Google Scholar]
- 14.Otazo R, Mueller B, Ugurbil K, Wald L, Posse S. Signal-to-noise ratio and spectral linewidth improvements between 1.5 and 7 Tesla in proton echo-planar spectroscopic imaging. Magn Reson Med. 2006;56(6):1200–1210. doi: 10.1002/mrm.21067 [DOI] [PubMed] [Google Scholar]
- 15.Kanowski M, Kaufmann J, Braun J, Bernarding J, Tempelmann C. Quantitation of simulated short echo time1H human brain spectra by LCModel and AMARES. Magn Reson Med. 2004;51(5):904–912. doi: 10.1002/mrm.20063 [DOI] [PubMed] [Google Scholar]
- 16.Bartha R, Drost DJ, Williamson PC. Factors affecting the quantification of short echoin-vivo1H MR spectra: prior knowledge, peak elimination, and filtering. NMR Biomed. 1999;12(4):205–216. doi: [DOI] [PubMed] [Google Scholar]
- 17.Marjańska M, Deelchand DK, Kreis R, et al. Results and interpretation of a fitting challenge for MR spectroscopy set up by the MRS study group of ISMRM. Magn Reson Med. 2022;87(1):11–32. doi: 10.1002/mrm.28942 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Zöllner HJ, Oeltzschner G, Schnitzler A, Wittsack HJ. In silico GABA+ MEGA-PRESS: Effects of signal-to-noise ratio and linewidth on modeling the 3 ppm GABA+ resonance. NMR Biomed. Published online September 28, 2020:e4410. doi: 10.1002/nbm.4410 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Mangia S, Tkáč I, Gruetter R, et al. Sensitivity of single-voxel 1H-MRS in investigating the metabolism of the activated human visual cortex at 7 T. Magn Reson Imaging. 2006;24(4):343–348. doi: 10.1016/j.mri.2005.12.023 [DOI] [PubMed] [Google Scholar]
- 20.Zhu XH, Chen W. Observed BOLD effects on cerebral metabolite resonances in human visual cortex during visual stimulation: A functional1H MRS study at 4 T. Magn Reson Med. 2001;46(5):841–847. doi: 10.1002/mrm.1267 [DOI] [PubMed] [Google Scholar]
- 21.Bhogal AA, Schür RR, Houtepen LC, et al. 1 H-MRS processing parameters affect metabolite quantification: The urgent need for uniform and transparent standardization. NMR Biomed. 2017;30(11):e3804. doi: 10.1002/nbm.3804 [DOI] [PubMed] [Google Scholar]
- 22.Zöllner HJ, Považan M, Hui SCN, Tapper S, Edden RAE, Oeltzschner G. Comparison of different linear-combination modeling algorithms for short-TE proton spectra. NMR Biomed. 2021;34(4). doi: 10.1002/nbm.4482 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Craven AR, Bhattacharyya PK, Clarke WT, et al. Comparison of seven modelling algorithms for γ-aminobutyric acid–edited proton magnetic resonance spectroscopy. NMR Biomed. Published online February 23, 2022. doi: 10.1002/nbm.4702 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Clarke WT, Stagg CJ, Jbabdi S. FSL-MRS: An end-to-end spectroscopy analysis package. Magn Reson Med. 2021;85(6):2950–2964. doi: 10.1002/mrm.28630 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Edden RAE, Puts NAJ, Harris AD, Barker PB, Evans CJ. Gannet: A batch-processing tool for the quantitative analysis of gamma-aminobutyric acid-edited MR spectroscopy spectra: Gannet: GABA Analysis Toolkit. J Magn Reson Imaging. 2014;40(6):1445–1452. doi: 10.1002/jmri.24478 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Provencher SW. Estimation of metabolite concentrations from localizedin vivo proton NMR spectra. Magn Reson Med. 1993;30(6):672–679. doi: 10.1002/mrm.1910300604 [DOI] [PubMed] [Google Scholar]
- 27.Oeltzschner G, Zöllner HJ, Hui SCN, et al. Osprey: Open-source processing, reconstruction & estimation of magnetic resonance spectroscopy data. J Neurosci Methods. 2020;343:108827. doi: 10.1016/j.jneumeth.2020.108827 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Wilson M. Adaptive baseline fitting for MR spectroscopy analysis. Magn Reson Med. 2021;85(1):13–29. doi: 10.1002/mrm.28385 [DOI] [PubMed] [Google Scholar]
- 29.Reynolds G, Wilson M, Peet A, Arvanitis TN. An algorithm for the automated quantitation of metabolites in in vitro NMR signals. Magn Reson Med. 2006;56(6):1211–1219. doi: 10.1002/mrm.21081 [DOI] [PubMed] [Google Scholar]
- 30.Wilson M, Reynolds G, Kauppinen RA, Arvanitis TN, Peet AC. A constrained least-squares approach to the automated quantitation of in vivo 1 H magnetic resonance spectroscopy data: Automated Quantitation of In Vivo 1 H MRS Data. Magn Reson Med. 2011;65(1):1–12. doi: 10.1002/mrm.22579 [DOI] [PubMed] [Google Scholar]
- 31.Mikkelsen M, Rimbault DL, Barker PB, et al. Big GABA II: Water-referenced edited MR spectroscopy at 25 research sites. NeuroImage. 2019;191:537–548. doi: 10.1016/j.neuroimage.2019.02.059 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Mikkelsen M, Barker PB, Bhattacharyya PK, et al. Big GABA: Edited MR spectroscopy at 24 research sites. NeuroImage. 2017;159:32–45. doi: 10.1016/j.neuroimage.2017.07.021 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Považan M, Mikkelsen M, Berrington A, et al. Comparison of Multivendor Single-Voxel MR Spectroscopy Data Acquired in Healthy Brain at 26 Sites. Radiology. 2020;295(1):171–180. doi: 10.1148/radiol.2020191037 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Lin A, Andronesi O, Bogner W, et al. Minimum Reporting Standards for in vivo Magnetic Resonance Spectroscopy (MRSinMRS): Experts’ consensus recommendations. NMR Biomed. Published online February 9, 2021. doi: 10.1002/nbm.4484 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Simpson R, Devenyi GA, Jezzard P, Hennessy TJ, Near J. Advanced processing and simulation of MRS data using the FID appliance (FID-A)—An open source, MATLAB - based toolkit. Magn Reson Med. 2017;77(1):23–33. doi: 10.1002/mrm.26091 [DOI] [PubMed] [Google Scholar]
- 36.Near J, Edden R, Evans CJ, Paquin R, Harris A, Jezzard P. Frequency and phase drift correction of magnetic resonance spectroscopy data by spectral registration in the time domain: MRS Drift Correction Using Spectral Registration. Magn Reson Med. 2015;73(1):44–50. doi: 10.1002/mrm.25094 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Klose U. In vivo proton spectroscopy in presence of eddy currents. Magn Reson Med. 1990;14(1):26–30. doi: 10.1002/mrm.1910140104 [DOI] [PubMed] [Google Scholar]
- 38.Bottomley PA, Griffiths JR, eds. Handbook of Magnetic Resonance Spectroscopy in Vivo: MRS Theory, Practice and Applications. Wiley; 2016. [Google Scholar]
- 39.Abramowitz M, Stegun IA. Handbook of Mathematical Functions: With Formulas, Graphs and Mathematical Tables [Conference under the Auspices of the National Science Foundation and the Massachussetts Institute of Technology]. Unabridged, unaltered and corr. republ. of the 1964 ed. Dover publ; 1972. https://personal.math.ubc.ca/~cbm/aands/ [Google Scholar]
- 40.Kreis R. Issues of spectral quality in clinical1H-magnetic resonance spectroscopy and a gallery of artifacts. NMR Biomed. 2004;17(6):361–381. doi: 10.1002/nbm.891 [DOI] [PubMed] [Google Scholar]
- 41.Kreis R. The trouble with quality filtering based on relative Cramér-Rao lower bounds: The Trouble with Quality Filtering Based on Relative CRLB. Magn Reson Med. 2016;75(1):15–18. doi: 10.1002/mrm.25568 [DOI] [PubMed] [Google Scholar]
- 42.Zhang Y, An L, Shen J. Fast computation of full density matrix of multispin systems for spatially localized in vivo magnetic resonance spectroscopy. Med Phys. 2017;44(8):4169–4178. doi: 10.1002/mp.12375 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Kaiser LG, Young K, Meyerhoff DJ, Mueller SG, Matson GB. A detailed analysis of localized J-difference GABA editing: theoretical and experimental study at 4 T. NMR Biomed. 2008;21(1):22–32. doi: 10.1002/nbm.1150 [DOI] [PubMed] [Google Scholar]
- 44.Zöllner HJ, Tapper S, Hui SCN, Barker PB, Edden RAE, Oeltzschner G. Comparison of linear combination modeling strategies for edited magnetic resonance spectroscopy at 3 T. NMR Biomed. 2022;35(1). doi: 10.1002/nbm.4618 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Elster C, Link A, Schubert F, Seifert F, Walzel M, Rinneberg H. Quantitative MRS: comparison of time domain and time domain frequency domain methods using a novel test procedure. Magn Reson Imaging. 2000;18(5):597–606. doi: 10.1016/S0730-725X(00)00140-5 [DOI] [PubMed] [Google Scholar]
- 46.R Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing; 2018. https://www.R-project.org/ [Google Scholar]
- 47.Gasparovic C, Song T, Devier D, et al. Use of tissue water as a concentration reference for proton spectroscopic imaging. Magn Reson Med. 2006;55(6):1219–1226. doi: 10.1002/mrm.20901 [DOI] [PubMed] [Google Scholar]
- 48.McKinney W. Data Structures for Statistical Computing in Python. In: van der Walt S, Millman J, eds.; 2010:56–61. doi: 10.25080/Majora-92bf1922-00a [DOI] [Google Scholar]
- 49.Harris CR, Millman KJ, van der Walt SJ, et al. Array programming with NumPy. Nature. 2020;585(7825):357–362. doi: 10.1038/s41586-020-2649-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.SciPy 1.0 Contributors, Virtanen P, Gommers R, et al. SciPy 1.0: fundamental algorithms for scientific computing in Python. Nat Methods. 2020;17(3):261–272. doi: 10.1038/s41592-019-0686-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Vallat R. Pingouin: statistics in Python. J Open Source Softw. 2018;3(31):1026. doi: 10.21105/joss.01026 [DOI] [Google Scholar]
- 52.Seabold S, Perktold J. statsmodels: Econometric and statistical modeling with python. In:; 2010. https://www.statsmodels.org/ [Google Scholar]
- 53.Hunter JD. Matplotlib: A 2D Graphics Environment. Comput Sci Eng. 2007;9(3):90–95. doi: 10.1109/MCSE.2007.55 [DOI] [Google Scholar]
- 54.Waskom M. seaborn: statistical data visualization. J Open Source Softw. 2021;6(60):3021. doi: 10.21105/joss.03021 [DOI] [Google Scholar]
- 55.Shapiro SS, Wilk MB. An Analysis of Variance Test for Normality (Complete Samples). Biometrika. 1965;52(3/4):591. doi: 10.2307/2333709 [DOI] [Google Scholar]
- 56.Fligner MA, Killeen TJ. Distribution-Free Two-Sample Tests for Scale. J Am Stat Assoc. 1976;71(353):210–213. doi: 10.1080/01621459.1976.10481517 [DOI] [Google Scholar]
- 57.Welch BL. The generalisation of student’s problems when several different population variances are involved. Biometrika. 1947;34(1–2):28–35. doi: 10.1093/biomet/34.1-2.28 [DOI] [PubMed] [Google Scholar]
- 58.Holm S. A Simple Sequentially Rejective Multiple Test Procedure. Scand J Stat. 1979;6(2):65–70. [Google Scholar]
- 59.Bonferroni CE. Il calcolo delle assicurazioni su gruppi di teste. In: Studi in Onore Del Professore Salvatore Ortu Carboni. Tipografia del Senato; 1935:13–60. [Google Scholar]
- 60.Bell TK, Godfrey KJ, Ware AL, Yeates KO, Harris AD. Harmonization of multi-site MRS data with ComBat. NeuroImage. 2022;257:119330. doi: 10.1016/j.neuroimage.2022.119330 [DOI] [PubMed] [Google Scholar]
- 61.Bates D, Mächler M, Bolker B, Walker S. Fitting Linear Mixed-Effects Models Using lme4. J Stat Softw. 2015;67(1). doi: 10.18637/jss.v067.i01 [DOI] [Google Scholar]
- 62.Vanhamme L, van den Boogaart A, Van Huffel S. Improved Method for Accurate and Efficient Quantification of MRS Data with Use of Prior Knowledge. J Magn Reson. 1997;129(1):35–43. doi: 10.1006/jmre.1997.1244 [DOI] [PubMed] [Google Scholar]
- 63.Poullet JB, Sima DM, Simonetti AW, et al. An automated quantitation of short echo time MRS spectra in an open source software environment: AQSES. NMR Biomed. 2007;20(5):493–504. doi: 10.1002/nbm.1112 [DOI] [PubMed] [Google Scholar]
- 64.Gajdošík M, Landheer K, Swanberg KM, Juchem C. INSPECTOR: free software for magnetic resonance spectroscopy data inspection, processing, simulation and analysis. Sci Rep. 2021;11(1):2094. doi: 10.1038/s41598-021-81193-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Naressi A, Couturier C, Devos JM, et al. Java-based graphical user interface for the MRUI quantitation package. Magma Magn Reson Mater Phys Biol Med. 2001;12(2–3):141–152. doi: 10.1007/BF02668096 [DOI] [PubMed] [Google Scholar]
- 66.Stefan D, Cesare FD, Andrasescu A, et al. Quantitation of magnetic resonance spectroscopy signals: the jMRUI software package. Meas Sci Technol. 2009;20(10):104035. doi: 10.1088/0957-0233/20/10/104035 [DOI] [Google Scholar]
- 67.Mandal PK, Shukla D. KALPANA: Advanced Spectroscopic Signal Processing Platform for Improved Accuracy to Aid in Early Diagnosis of Brain Disorders in Clinical Setting. J Alzheimers Dis. 2020;75(2):397–402. doi: 10.3233/JAD-191351 [DOI] [PubMed] [Google Scholar]
- 68.Purvis LAB, Clarke WT, Biasiolli L, Valkovič L, Robson MD, Rodgers CT. OXSA: An open-source magnetic resonance spectroscopy analysis toolbox in MATLAB. Motta A, ed. PLOS ONE. 2017;12(9):e0185356. doi: 10.1371/journal.pone.0185356 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Graveron-Demilly D. Quantification in magnetic resonance spectroscopy based on semi-parametric approaches. Magn Reson Mater Phys Biol Med. 2014;27(2):113–130. doi: 10.1007/s10334-013-0393-4 [DOI] [PubMed] [Google Scholar]
- 70.Ratiney H, Coenradie Y, Cavassila S, van Ormondt D, Graveron-Demilly D. Time-domain quantitation of 1 H short echo-time signals: background accommodation. MAGMA Magn Reson Mater Phys Biol Med. 2004;16(6):284–296. doi: 10.1007/s10334-004-0037-9 [DOI] [PubMed] [Google Scholar]
- 71.Soher B, Semanchuk P, Todd D, Steinberg J, Young K. VeSPA: integrated applications for RF pulse design, spectral simulation and MRS data analysis. In: Proc Int Soc Magn Reson Med. Vol 19.; 2011:1410. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Hui SCN, Mikkelsen M, Zöllner HJ, et al. Frequency drift in MR spectroscopy at 3T. NeuroImage. 2021;241:118430. doi: 10.1016/j.neuroimage.2021.118430 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Henning A. Advanced Spectral Quantification: Parameter Handling, Nonparametric Pattern Modeling, and Multidimensional Fitting. In: Harris RK, Wasylishen RL, eds. eMagRes. John Wiley & Sons, Ltd; 2016:981–994. doi: 10.1002/9780470034590.emrstm1472 [DOI] [Google Scholar]
- 74.Deelchand DK, Marjańska M, Henry P, Terpstra M. MEGA-PRESS of GABA+: Influences of acquisition parameters. NMR Biomed. 2021;34(5). doi: 10.1002/nbm.4199 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Henry PG, Dautry C, Hantraye P, Bloch G. Brain GABA editing without macromolecule contamination. Magn Reson Med. 2001;45(3):517–520. doi: [DOI] [PubMed] [Google Scholar]
- 76.Murdoch JB, Dydak U. Modeling MEGA-PRESS macromolecules for a better grasp of GABA. In:; 2011:1394. https://cds.ismrm.org/protected/11MProceedings/files/1394.pdf [Google Scholar]
- 77.Bhagwagar Z, Wylezinska M, Jezzard P, et al. Reduction in Occipital Cortex γ-Aminobutyric Acid Concentrations in Medication-Free Recovered Unipolar Depressed and Bipolar Subjects. Biol Psychiatry. 2007;61(6):806–812. doi: 10.1016/j.biopsych.2006.08.048 [DOI] [PubMed] [Google Scholar]
- 78.Dydak U, Jiang YM, Long LL, et al. In vivo measurement of brain GABA concentrations by magnetic resonance spectroscopy in smelters occupationally exposed to manganese. Environ Health Perspect. 2011;119(2):219–224. doi: 10.1289/ehp.1002192 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Evans CJ, Puts NAJ, Robson SE, et al. Subtraction artifacts and frequency (Mis-)alignment in J-difference GABA editing: J-Difference GABA Editing. J Magn Reson Imaging. 2013;38(4):970–975. doi: 10.1002/jmri.23923 [DOI] [PubMed] [Google Scholar]
- 80.Behar KL, Ogino T. Characterization of macromolecule resonances in the 1H NMR spectrum of rat brain. Magn Reson Med. 1993;30(1):38–44. doi: 10.1002/mrm.1910300107 [DOI] [PubMed] [Google Scholar]
- 81.Jofre F, Anderson ME, Markley JL. L_arginine. Published online 2006. doi: 10.13018/BMSE000029 [DOI] [Google Scholar]
- 82.Xiao Y, Lanz B, Lim S, Tkáč I, Xin L. Improved reproducibility of γ-aminobutyric acid measurement from short-echo-time proton MR spectroscopy by linewidth-matched basis sets in LCModel. NMR Biomed. Published online October 15, 2023:e5056. doi: 10.1002/nbm.5056 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.van den Boogaart A, Ala-Korpela M, Jokisaari J, Griffiths JR. Time and frequency domain analysis of NMR data compared: An application to 1D1H spectra of lipoproteins. Magn Reson Med. 1994;31(4):347–358. doi: 10.1002/mrm.1910310402 [DOI] [PubMed] [Google Scholar]
- 84.Barkhuijsen H, de Beer R, van Ormondt D. Improved algorithm for noniterative time-domain model fitting to exponentially damped magnetic resonance signals. J Magn Reson 1969. 1987;73(3):553–557. doi: 10.1016/0022-2364(87)90023-0 [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Scripts used for the present analysis are publicly available here; further dependencies are described within: https://git.app.uib.no/bergen-fmri/gaba-linewidth
Spectra analysed in this manuscript were obtained from the publicly available Big GABA repository on NITRC, https://www.nitrc.org/projects/biggaba
Basis sets used in the primary analysis were obtained from the publicly available Osprey package, https://schorschinho.github.io/osprey







