Abstract
The GNAT (General NMR Analysis Toolbox) is a free and open‐source software package for processing, visualising, and analysing NMR data. It supersedes the popular DOSY Toolbox, which has a narrower focus on diffusion NMR. Data import of most common formats from the major NMR platforms is supported, as well as a GNAT generic format. Key basic processing of NMR data (e.g., Fourier transformation, baseline correction, and phasing) is catered for within the program, as well as more advanced techniques (e.g., reference deconvolution and pure shift FID reconstruction). Analysis tools include DOSY and SCORE for diffusion data, ROSY T 1/T 2 estimation for relaxation data, and PARAFAC for multilinear analysis. The GNAT is written for the MATLAB® language and comes with a user‐friendly graphical user interface. The standard version is intended to run with a MATLAB installation, but completely free‐standing compiled versions for Windows, Mac, and Linux are also freely available.
Keywords: analysis, NMR, diffusion, DOSY, MATLAB, multiway, processing, relaxation, software, toolbox
1. INTRODUCTION
A variety of software packages are available for processing high‐resolution NMR data. They can be classified into three main categories: (a) supplied by spectrometer manufacturers,1, 2, 3, 4, 5 (b) commercial packages,6, 7, 8, 9, 10, 11 and (c) free software.12, 13, 14, 15, 16 In recent years, an increasing number of free software packages, often produced by individual researchers or groups to cover specific needs, have been produced.17, 18, 19, 20, 21, 22, 23, 24, 25 One example is the popular DOSY (Diffusion‐Ordered SpectroscopY) Toolbox,23 a free and open‐source software package typically intended for processing NMR diffusion (a.k.a. DOSY26, 27) data, although sometimes used for other purposes. The first official version was released in 2009, and it has since become a useful tool for many scientists, complementing other NMR software.
Here, we describe the GNAT (General NMR Analysis Toolbox), a free and open‐source platform, released under the General Public License, for processing and analysing NMR data. This new toolbox is based on and replaces the DOSY Toolbox (although existing versions of the DOSY Toolbox will remain available). The intention is to provide a more general tool for analysing NMR data, independent of acquisition platform, to complement the manufacturers' software offerings. As open‐source software, it allows users to implement their own algorithms. The GNAT works within the MATLAB® environment, which has extensive libraries of mathematical computation and visualisation routines, making the GNAT flexible and easily extendable. The aim of this publication is to present the major features of the GNAT program to the wider NMR community and to give an overview of its general features. This publication is intended in part as a short introductory manual, highlighting the most important features and illustrating them with real experimental data. The GNAT can be downloaded from our website (http://nmr.chemistry.manchester.ac.uk/), where we intend to publish updates as well as making more detailed and updated documentation and test data available. All the example data used in this publication can also be found at the DOI: http://dx.doi.org/10.17632/pyr9688wvb.1.
2. FEATURES
The GNAT is intended to provide a user‐friendly tool for analysing NMR data, primarily high resolution, independent of source. As such, it currently supports import of data from Bruker, Varian, and JEOL instruments, but it can easily be extended to support other formats. Many of the analysis tools in GNAT are tailored for use with series of NMR spectra, whether as a function of gradient level (typical for diffusion data), of an incremented delay (typical for relaxation data), or of some other variable, such as spectra from a time‐series. Array structures as a function of up to two independent variables are supported (e.g., spectra as a function of both diffusion and relaxation). Data import therefore supports the standard Varian array structure and Bruker “ser” files, as well as a series of consecutively numbered 1D datasets in either format. Import of processed 1D spectra is also supported for Bruker data. All of these options are easily accessible from the “File” menu. Data can subsequently be saved in the internal GNAT file format, either in binary or in human‐readable ASCII format. Whole data structures can also be saved in standard MATLAB format as *.mat, and read back separately into MATLAB using the “load” command. Data can be saved as raw unprocessed data, or in the form of “FIDs” (Free Induction Decays) produced by inverse Fourier transform of the complex or real processed spectra. The latter option irreversibly saves any processing, such as baseline correction or reference deconvolution, in the form of a FID that can then of course be reopened and processed further. Access to some important processing parameters is available from the “Edit” menu.
The graphical user interface (GUI) consists of a main window (Figure 1), displaying spectra or FIDs, from which access to most processing and analysis features is available. When more advanced functions are used, for example, diffusion or relaxation 2D plots (DOSY and ROSY [Relaxation‐Ordered SpectroscopY], respectively), separate windows are opened, in which method‐specific features are available. The GUI is divided between basic processing and features in the left tab group (e.g., Fourier transformation and plot control) and more advanced analysis (e.g., DOSY and PARAFAC [PARAllel FACtor]) in the right tab group. Table 1 lists the most important processing and analysis features implemented in the current version of the GNAT.
Table 1.
Processing | Basic | Fourier transformation | |
Zero‐filling | |||
Apodisation | |||
Referencing | |||
Phase correction28 | |||
Baseline correction29 | |||
Advanced | Reference deconvolution30 | ||
Pure shift interferogram construction31, 32, 33, 34 | |||
Analysis | Basic | Peak width | |
Frequency shift | |||
Resolution change | |||
Amplitude change | |||
Temperature change | |||
Phase change35 | |||
Integral change | |||
Advanced | Univariate methods | DOSY26, 27, 36 | |
ROSY37, 38, 39 | |||
Multivariate methods | SCORE/RSCORE40, 41 | ||
OUTSCORE42 | |||
LOCODOSY43 | |||
ILT44, 45 | |||
DECRA46, 47, 48, 49 | |||
FDM/RRT50, 51, 52 | |||
ICA53, 54, 55 | |||
MCR56, 57, 58 | |||
PARAFAC59, 60, 61, 62, 63, 64, 65 | |||
Slicing66 |
3. PROCESSING AND ANALYSING NMR DATA
The GNAT software is mainly designed for analysing series of spectra such as diffusion and relaxation data, but also caters for processing of single FIDs. In order to optimise the information extractable from spectra, a variety of tools are included. Many of these are standard for most NMR processing software, and only a short description will be included here.
3.1. Data processing (left tab group)
The “Plot” tab contains the controls for display; these includes setting of spectral limits and scale units (ppm or Hz). Separate plots can be produced for inclusion in reports and publications.
The “Phase” tab contains the controls for phase correction. Manual correction is performed by setting a pivot point and varying the zeroth and first order phases. The automatic phase correction algorithm is based on the simple and effective early work in this field.28, 67 Phase correction can be performed either for a whole array of spectra simultaneously, or for individual spectra, by switching between “Global” and “Individual” mode.
The “FT” tab implements Fourier transformation of FIDs (Figure 2). The number of data points in the spectrum is determined by the Fourier number (fn), the total number of complex data points, to allow for zero‐filling or truncation to arbitrary size. Window functions are used to multiply the FID, typically to enhance sensitivity or resolution in the resultant spectrum and/or to suppress so‐called sinc wiggles where FID data are truncated. Figure 3 shows an example where a line‐broadening window function is applied in order to enhance sensitivity and suppress sinc wiggles. The interface allows display of both the window function and the weighted FID. In the GNAT, window functions are implemented with Lorentzian and Gaussian parameters by multiplying the FID by the function below, where t is time within the FID and lw and gw are the additional Lorentzian and Gaussian linewidths at half height, respectively.
Thus, using gw = 0 convolutes the raw spectrum with a Lorentzian line of width lw Hz, whereas using lw = 0 convolutes the raw spectrum with a Gaussian line of width gw Hz.
Using a negative value for lw and positive for gw allows resolution enhancement by Lorentz–Gauss transformation, whereas using both values positive gives a Voigt lineshape. In the same tab, the user can also set a chemical shift reference.
The “Correct” tab implements various forms of correction of the data (Figure 4). Manual baseline correction is achieved by selecting the regions of the spectrum that contain signal; the remaining baseline is then fitted to a polynomial of user‐defined order and subtracted from the spectrum. The automatic baseline correction29 iteratively adjusts the definitions of the baseline regions to achieve the best baseline correction for a specified order of polynomial.
A very powerful, but still underused, tool for correcting systematic errors is reference deconvolution.30 This works by using the differences between an ideal reference signal and a resolved experimental signal to correct for systematic errors (e.g., lineshape distortions, phase errors, and frequency shifts) in the whole spectrum. In the GNAT, the reference signal can be a simple singlet (for spectra of any nucleus), or tetramethylsilane (TMS ) or TSP‐d 4 (trimethylsilylpropanoate) for 1H spectra. In the latter two cases, the known 29Si and 13C satellite patterns of the reference signals are included in the calculation. Reference deconvolution is particularly valuable for multivariate analysis methods, because these rely heavily on the linearity of the data (which requires that the spectral shapes for individual resonances remain the same in all increments of a dataset). In our laboratory, reference deconvolution is used routinely, and we and others have seen significant improvements in many investigations.41, 49, 62, 63, 68, 69, 70, 71, 72, 73, 74, 75, 76 The implementation of reference deconvolution processing in the GUI is shown in Figure 4. The user simply uses two cursors to define the spectral limits of the reference peak, typically the whole peak plus a small piece of baseline either side, the type of reference signal, and the target lineshape (the shape required for the experimental reference peak after correction, as specified by lw and gw in the weighting function above). The correction is executed by pressing the FIDDLE (Free Induction Decay Deconvolution for Lineshape Enhancement) button.
The method is demonstrated here (Figure 5) on data from a proton experiment on a quinine sample in which the homogeneity of the static field was deliberately perturbed by poor shimming. The quinine methoxy group at 3.9 ppm (Figure 4) was used as reference, with a 1 Hz Gaussian target lineshape to correct the severe lineshape distortions in the rest of the spectrum (Figure 5).
The “Array” tab contains an interface for plotting spectra from an array; see Section 3.2.1 below.
The “Prune” tab contains an interface to remove (prune) part of the data before further analysis. Individual array elements and/or spectral regions can be removed as desired.
The “Pure shift” tab (Figure 6) contains an interface for constructing a pure shift FID from an interferogram experiment (including pure shift DOSY77, 78, 79). Pure shift methods,31, 32, 33, 34 where the resolution of NMR spectra is improved by suppressing the multiplet structure caused by homonuclear couplings (Figure 7), can be classified into two main groups on the basis of the way the data are acquired: pseudo‐2D (interferogram)80, 81, 82, 83 or real‐time.84, 85, 86, 87, 88 Real‐time experiments directly generate a single FID that after standard Fourier transformation gives a homodecoupled 1D NMR spectrum. In interferogram experiments, a synthetic 1D homodecoupled FID (or “interferogram”) is constructed by concatenating data chunks extracted from individual time‐domain datasets acquired in 2D mode. The GNAT includes the post‐processing needed for interferogram experiments to reconstruct the 1D pure shift FID; the parameters required are read in with the raw data but can also be adjusted by the user if necessary. “Chunks” refers to the number of increments in the dataset; “Chunk duration” the duration of a single FID chunk; “Chunk points” the number of data points per chunk; “First chunk” the number of data points in the first chunk; and “Drop points” the number of data points to be discarded from the start of each chunk.
The “Misc” tab currently provides buttons to save and recall the state of the current data (these functions can also be found in the shortcuts in the top left corner of the main GUI) and an interface to listen to your FIDs (for the musically inclined).
The “Info” tab displays information about the data, for example, import information and error messages. A log file (GNATlog.txt in the current working directory) keeps a running copy of the GNAT information displayed in this tab. It can be saved separately, for example, for efficient identification and reporting of bugs.
3.2. Data analysis (right tab group)
In the data analysis section of the interface, various methods have been implemented for specific data analyses. The first level of the tab group is divided into different categories of data: (general) Analysis, Diffusion, Relaxation, Multiway, and Misc; each in turn contains a range of methods. The analysis functions typically use the spectral region displayed in the main window of the GUI (active spectral window). Further regions can be excluded from analysis in the “Prune tab” (see above).
3.2.1. The “Analysis” tab group
Here, the user can find some general analysis tools, which are mainly intended for analysis of a series of spectra.
The “Analyse” tab contains an interface for the analysis of individual and arrayed experiments (see Figure 1). The user can make simple measurements of linewidth (“Shape” function) and frequency difference (“Delta” function). Using “Spectra array,” the user can choose how to plot an array, by choosing start, stop, and step points and vertical and horizontal offsets. These settings are also used to determine the array elements to use in the different analysis methods in this tab. “Freq” plots the change in frequency (for the highest peak in the display window that is above a user‐set threshold) as a function of spectrum number; “Resol” the peak width at half height; “Ampl” the peak amplitude; “Temp” the temperature estimated from the peak position (vide infra); “Phase” the absolute signal phase deviation from pure absorption mode (the actual calculation uses the dispersion mode; see below); and “Int” the integral (for a defined integral region). Typical outputs are shown below (Figure 8) for resolution and phase, showing the stability of one of our spectrometers over an 8‐hr period.
The change in phase is determined by comparing the peak excursions of a near‐dispersion mode signal.35 This relies on having a (close to) Lorentzian lineshape, which can be achieved in practice by applying a Lorentzian window function that is broad compared with the experimental linewidth, and phasing the first spectrum in a series approximately to absorption mode. The algorithm then temporarily shifts the phase by 90° and uses the positive and negative peak excursions to calculate the phase deviation.
The estimated change in temperature with spectrum number assumes that the sample is dissolved in D2O and that the dominant peak is either of residual water (HDO) or of a resonance with a negligible temperature coefficient of chemical shift. In the former case, the temperature dependence is that of the primary isotope effect of hydrogen in D2O; in the latter, it is dominated by the (relatively large) temperature sensitivity of the deuterium chemical shift of the D2O lock material. The calibration used is based on experimental measurements of the temperature dependences of the HDO and TSP chemical shifts, as reported by Topspin software, for a standard Bruker test sample of HDO and TSP in D2O. This yielded temperature coefficients of 2.97 × 10−4 ppm/K for HOD and 1.14 × 10−2 ppm/K for TSP (Figure 9). When performing temperature analysis, which relies on the temperature dependence of the chemical shift, it is important to ensure that the TSP/HOD peak remains within the active spectral window for all spectra of the array.
The “Align” tab contains an interface to allow a series of spectra to be manually aligned in frequency space.
The “Integrate” tab contains an interface to allow integration of spectral peaks, including correction of baseline offset and tilt. Integral regions can be picked manually or automatically and normalised if desired. Integral values and/or regions can be exported in a human‐readable ASCII format, and regions can be reimported from an export file.
3.2.2. The “Diffusion” tab group
Here, the user can find various options for analysing diffusion NMR data (Figure 10). The general description of this topic is beyond the scope of this publication, and the reader is referred, as a starting point, to selected review and application articles.26, 27, 89, 90, 91, 92, 93, 94, 95, 96, 97 Many of the methods in this and other tabs are highly sensitive to the quality of the input data, because of the assumptions on which the algorithms rely. It is therefore prudent to take extra care in the preparation of data, with, for example, careful baseline correction and phasing. We find it particularly helpful to use reference deconvolution, where a suitable reference signal is available, to correct for the many systematic errors present in experimental data. The GNAT has some automatic settings, for example, from parameters imported with the raw data, but user‐defined parameters and parameter adjustments are possible using “Edit > Settings > Diffusion.” (e.g., when the diffusion data are imported but the class of pulse sequence used cannot be determined automatically, the GNAT will assume a bipolar sequence type; this can be changed here.)
The “DOSY” tab contains controls for typical DOSY26, 27, 36 processing. The default processing is HR‐DOSY98 in which each peak is assumed to originate from a single species, and therefore a monoexponential fit is performed to some version of the Stejskal–Tanner equation:99, 100
(1) |
where I 0 is the signal intensity in the absence of diffusion, γ is the magnetogyric ratio of the diffusion‐encoded spins, δ is the gradient pulse duration, g is the gradient amplitude, D is the diffusion coefficient, and Δ′ is the corrected diffusion‐encoding time. An empirical correction of Equation (1) for the effects of spatially non‐uniform pulsed field gradients (NUG) can also be used but requires calibration of the spectrometer/probe.101 An attempt to fit to an arbitrary number of exponentials102, 103 (e.g., when a signal contains contributions from several components) can be made by selecting “Multiexp” and choosing the number of exponentials. The fitting routine will reduce the number of components until statistically significant results are obtained. Any fit can be performed using either automated peak picking, all points in the spectrum, or pre‐set integral regions.
The resulting DOSY spectrum is displayed in a separate DOSY GUI with individual controls for plotting and analysing the data. These include plots of fits and residuals, projections, and separate plots intended for reports. A separate text file with the relevant fit statistics is also accessible via the DOSY GUI. Figure 11 shows an example of a pure shift DOSY spectrum obtained with the GNAT for a mixture of provitamin and vitamin D3 in DMSO‐d 6, using monoexponential fitting.79
The “SCORE” tab contains the controls for SCORE (Speedy COmponent REsolution) analysis. SCORE40, 41, 104, 105 is a type of multivariate processing in which entire component spectra are produced by a successful analysis. The OUTSCORE (Optimized Unmixing of True Spectra for COmponent REsolution)42 variant, in which the separation criterion maximises spectral differences rather than minimising residuals, as in SCORE, is also available. The user chooses the number of components to fit and has the options to use a pure exponential (Equation (1)) or a NUG‐corrected101 decay and to do the fit with or without non‐negativity constraints (i.e., with only positive values of the spectra and diffusion decays allowed). The result is presented as fitted spectra and diffusion decays together with an estimate of the relative signal integrals. In Figure 12, the fitted SCORE components from a mixture of maltotriose and glucose are shown, together with the mixture spectrum. As an option, diagnostic plots of residuals, leverages, and residuals versus leverages can be plotted. These plots, inspired by the N‐way toolbox,106 are useful for identifying spectral regions and/or gradient levels that are behaving in an unexpected way and could therefore usefully be excluded from the fitting, or for indicating that the number of components chosen was incorrect.
The “LOCODOSY” tab contains the controls for LOCODOSY (LOcal COvariance Diffusion‐Ordered SpectroscopY) fitting. LOCODOSY43 is a hybrid between univariate (e.g., DOSY) and multivariate (e.g., SCORE) data processing. It takes advantage of the enhanced statistical leverage of multivariate methods while seeking to alleviate one of their disadvantages, that typically only a small number of components can be resolved by such methods. The spectrum is divided into individual segments that are each assumed to contain signals from only a small number of components. For each region, the maximum number of components is specified (either manually or automatically), and a multivariate fit is performed. A full DOSY spectrum is then assembled from the individual segments (for more detail, see the original publication43). Automated analysis is performed by pressing the “Auto” button in the “Segment” part and pressing “Run.” LOCODOSY can provide very good results but is strongly dependent on high quality data and appropriate choice of segmentation. The current options for the multivariate method to use are SCORE, OUTSCORE, and DECRA (Direct Exponential Curve Resolution Algorithm; see below).
The “ILT” tab contains controls for an implementation of what is commonly known as the inverse Laplace transform (ILT).44 The decay of signal with gradient amplitude in a diffusion NMR experiment contains information on the complete distribution of compounds of different sizes, which in principle is accessible via the ILT. In practice, however, this is an ill‐posed problem with an infinite number of solutions. There are a number of methods available for diffusion NMR that constrain the mathematical problem in different ways to allow unique solutions to be found.36, 45, 72, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119 They all have different advantages and disadvantages and require careful interpretation, but this is beyond the scope of this publication. (In principle, the HR‐DOSY approach is a very strict, but useful, constraint on the ILT, although it is not normally described as such in the context of diffusion NMR data). The current implementation of the ILT in the GNAT is based on the MATLAB Regularisation Tools,120, 121 as described by Day.114 The default values chosen should give a sensible result in many cases, but the most important parameters are all under user control.
The “DECRA” tab contains the controls for DECRA fitting. DECRA46, 47, 48, 49 is a very fast multivariate processing method that exploits the fact that diffusion NMR data ideally show pure exponential behaviour with increasing gradient amplitude squared. The only user input is the number of components to be fitted.
The “FDM/RRT” tab contains controls for the FDM (Filter Diagonalization Method)50 and RRT (Regularized Resolvent Transform)51 for analysing diffusion NMR data.52 This method performs a type of ILT (as defined above) and displays the results in a DOSY plot.
The “ICA” tab contains the controls for ICA (Independent Component Analysis)54 processing of diffusion NMR data. ICA is a multivariate method that separates components based on assumptions about their statistical independence and the non‐Gaussian behaviour of signals. It can be very effective for diffusion NMR analysis of mixtures where there is little spectral overlap.55 In the GNAT, this processing is implemented using the fast‐ICA algorithm.53
The “MCR” tab contains the controls for MCR (Multivariate Curve Resolution)56, 58 analysis of diffusion NMR data. Results here are highly dependent on starting guesses, and are primarily useful for refining fitted solutions subject to sensible constraints. In the GNAT implementation, the starting guesses are implemented using PCA‐VARIMAX57 and DECRA.46 These starting points can then be refined by imposing a non‐negativity constraint and/or by forcing the decay to be either a pure exponential or of a shape determined by a NUG calibration.101
3.2.3. The “Relaxation” tab group
Here, the user can find various ways to analyse relaxation data (Figure 13), a common and important part of NMR.122 The general description of this topic is beyond the scope of this publication, and the reader is referred, as a starting point, to selected books.123, 124 The GNAT imports relaxation data in standard format from Varian or Bruker, but user‐defined parameters and changes are possible using “Edit > Settings > Relaxation.” For Bruker data, delay values are imported from a vclist or vdlist file.
The “ROSY” tab includes controls for relaxation processing with ROSY9, 38 display, as shown in Figure 14. (Various other names for such analyses have been suggested in the literature.37, 125) A ROSY display is analogous to DOSY in that it plots relaxation time or rate versus chemical shift in a pseudo‐2D plot. As with DOSY, the user controls whether the fitting is done using automatic peak picking, for all data points, or for pre‐set integral regions (see above). There is a choice between T 1 inversion recovery,126 T 1 saturation recovery,127 and T 2 analysis.128, 129, 130 The resulting ROSY spectrum is displayed in a separate ROSY GUI, with individual controls for plotting and analysing the data. These includes plots of fits and residuals, projections, and separate plots intended for reports. A separate text file with the relevant fit statistics is also accessible via the ROSY GUI. A ROSY display of REST2 (Relaxation‐Encoded Selective TOCSY using T 2 weighting)39 data is shown in Figure 14.
The “RSCORE” tab includes an implementation of SCORE41 and OUTSCORE42 analysis for relaxation data. The user can choose between T 2 and T 1 fitting functions and whether or not to use a non‐negativity constraint.
3.2.4. The “Multiway” tab group
Multiway analysis is a superset of multivariate analysis in which the data have three or more dimensions, and is of increasing interest in chemistry131 and NMR spectroscopy.62, 63, 64, 70, 73, 74, 132, 133, 134, 135
The “PARAFAC” tab incorporates an interface to the PARAFAC functionality in the N‐way toolbox106 (Figure 15). The PARAFAC model assumes that the data dimensions (or modes) are mathematically independent (i.e., the data are multilinear).59, 60, 61, 64 When this assumption holds, PARAFAC typically yields results for each mode that are directly physically relevant, for example, corresponding to individual chemical components rather than to arbitrary linear combinations as in many other forms of multivariate analysis. One example is when diffusion NMR data are acquired during the course of a chemical reaction.63, 70 The resultant data then vary with the three independent dimensions of Larmor frequency, gradient amplitude (causing signal decay as a result of diffusion), and time (causing signal amplitude changes as a result, e.g., of chemical reaction) for the species involved. Figure 16 shows the result of a PARAFAC analysis of the hydrolysis of maltotriose.63 Such PARAFAC analyses can be very powerful, but they are very sensitive to deviations from linearity such as changes in the NMR spectrum due to variation in, for example, shimming or temperature. In some cases, these variations can be corrected with reference deconvolution.30, 75 In the interface, the user can choose to constrain the algorithm (e.g., by non‐negativity), to try different initialisation methods, and to produce a series of diagnostic plots.
The “Slicing” tab contains an implementation of PowerSlicing for further analysis with the PARAFAC algorithm.61, 106 PowerSlicing66 and Slicing132 are directly related to DECRA46 in that they take advantage of exponential behaviour, for example, in some diffusion or relaxation NMR data, to produce an artificially trilinear system. PowerSlicing also shares a disadvantage of DECRA in that it fails when data deviate from pure exponential behaviour.49 By analysing data with the PARAFAC algorithm rather than just with DECRA, the user has the possibility to incorporate constraints in order to improve results.
3.2.5. The “Misc” tab group
Here, we have implemented some tools that do not easily fit in the other categories but may well still be useful.
The “Sim DOSY” tab contains controls for simulating DOSY data, for further analysis in the GNAT or for export. Typical parameters for a pulsed field gradient NMR experiment are available, and the user can decide, for example, the diffusion coefficients and T 2 values of the individual simulated peaks in the spectrum.
The “Bin” tab incudes a facility to bin NMR spectra for further analysis, for example, by PCA (principal components analysis). The binned data can be exported in MATLAB (*.mat) or *.csv formats.
The “ICOSHIFT” tab incorporates an interface to the ICOSHIFT (interval correlation optimized shifting) algorithm136 for peak alignment, for example, in metabolomics data.
4. CONCLUSION
The GNAT is a free and open‐source tool for the analysis of NMR data. It aspires to be a platform for easy and fast implementation of useful processing methods. Revised versions will be downloadable from our website (http://nmr.chemistry.manchester.ac.uk/); our intention is to continuously support the software with new implementations, improvements, and bug fixes.
4.1. Experimental section
Experimental spectra were recorded on three different spectrometers: 500 MHz Bruker Avance II+, 500 MHz Bruker Avance Neo, and 400 MHz Varian INOVA. Eight different samples were used: sample 1 contained 100 mM quinine and TMS in DMSO‐d 6; sample 2 contained 50 mM quinine in DMSO‐d 6; sample 3 contained 1% vol/vol acetone in D2O; sample 4 contained TSP in D2O; sample 5 contained an equimolar mixture (50 mM) of vitamin D3 and provitamin D3 in acetone‐d 6 (more information in original publication79); sample 6 contained an equimolar mixture (100 mM) of glucose and maltotriose in D2O; sample 7 contained an equimolar mixture (100 mM) of lactose and melibose in D2O (more information in original publication39); and sample 8 initially contained 18 mM maltotriose, 25 mM pivalic acid (as a reference), and 90 mM sulphuric acid in D2O (more information in original publication63).
ACKNOWLEDGEMENTS
Over the years, many people have been involved in developing the GNAT, whether in the form of implementing code in the DOSY Toolbox or pointing out potential improvements and reporting bugs. They include, but are by no means limited to, Adolfo Botana, Iain Swan, Johannes Björnerås, Cláudio Tormena, Thais Mendonça Barbosa, Parvaneh Ebrahimi, Peter Kiraly, Mohammadali Foroozandeh, Pinelopi Moutzouri, Ralph Adams, Davy Sinnaeve, and Rob Evans.
This work was supported by the Engineering and Physical Sciences Research Council (Grants EP/E057888/1; EP/E05899X/1; EP/H024336/1; EP/I007989/1; EP/L018500/1; EP/M013820/1; and EP/N033949/1) and by a studentship to GDP from Science without Borders – Brazil (Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq) reference number 233163/2014‐0).
Castañar L, Dal Poggetto G, Colbourne AA, Morris GA, Nilsson M. The GNAT: A new tool for processing NMR data. Magn Reson Chem. 2018;56:546–558. https://doi.org/10.1002/mrc.4717
REFERENCES
- 1.TopSpin ,http://www.bruker.com/service/support-upgrades/software-downloads/nmr
- 2.Delta ,http://www.jeol.co.jp/en/products/detail/Delta5
- 3.TNMR ,http://www.tecmag.com/softwaretnmr
- 4. Güntert P., Dötsch V., Wider G., Wüthrich K., J. Biomol. NMR 1992, 2, 619. [Google Scholar]
- 5.Prospa ,http://www.magritek.com/prospa
- 6. Cobas J. C., Sardina F. J., Concepts in Magnetic Resonance Part A 2003, 19A, 80. [Google Scholar]
- 7.MNova ,http://www.mestrelab.com/download/mnova
- 8.ACD ,http://www.acdlabs.com/products/adh/nmr
- 9.iNMR ,http://www.inmr.net
- 10.NMRTEC ,http://www.nmrtec.com
- 11.Felix NMR ,http://www.felixnmr.com
- 12.VnmrJ ,http://openvnmrj.org
- 13.RNMRTK ,http://rnmrtk.uchc.edu/rnmrtk/RNMRTK.html
- 14. ONMR , http://www.originlab.com/index.aspx?go=Support&pid=932
- 15.RELAX ,http://www.nmr-relax.com/manual/Relaxation_curve_fitting.html
- 16.NPNMR ,http://www.npnmr.com
- 17. Balacco G., J. Chem. Inf. Comput. Sci. 1994, 34, 1235. [Google Scholar]
- 18. Delaglio F., Grzesiek S., Vuister G. W., Zhu G., Pfeifer J., Bax A., J. Biomol. NMR 1995, 6, 277. [DOI] [PubMed] [Google Scholar]
- 19. Pons J.‐L., Malliavin T. E., Delsuc M. A., J. Biomol. NMR 1996, 8, 445. [DOI] [PubMed] [Google Scholar]
- 20. Günther U. L., Ludwig C., Rüterjans H., J. Magn. Reson. 2000, 145, 201. [DOI] [PubMed] [Google Scholar]
- 21. Zhao Q., Stoyanova R., Du S., Sajda P., Brown T. R., Bioinformatics 2006, 22, 2562. [DOI] [PubMed] [Google Scholar]
- 22. van Beek J. D., J. Magn. Reson. 2007, 187, 19. [DOI] [PubMed] [Google Scholar]
- 23. Nilsson M., J. Magn. Reson. 2009, 200, 296. [DOI] [PubMed] [Google Scholar]
- 24. Lewis I. A., Schommer S. C., Markley J. L., Magn. Reson. Chem. 2009, 47, S123. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.SpinWorks ,https://home.cc.umanitoba.ca/~wolowiec/spinworks
- 26. Johnson C. S., Prog. Nucl. Magn. Reson. Spectrosc. 1999, 34, 203. [Google Scholar]
- 27. Morris G. A., in Encyclopedia of Magnetic Resonance, John Wiley & Sons, Ltd, Chichester: 2009, pp. https://doi.org/10.1002/9780470034590.emrstm0119.pub2 [Google Scholar]
- 28. Craig E. C., Marshall A. G., J. Magn. Reson. 1988, 76, 458. [Google Scholar]
- 29. Pearson G. A., J. Magn. Reson. 1977, 27, 265. [Google Scholar]
- 30. Morris G. A., Barjat H., Home T. J., Prog. Nucl. Magn. Reson. Spectrosc. 1997, 31, 197. [Google Scholar]
- 31. Adams R. W., eMagRes 2014, 3, 295. [Google Scholar]
- 32. Castañar L., Parella T., Magn. Reson. Chem. 2015, 53, 399. [DOI] [PubMed] [Google Scholar]
- 33. Zangger K., Prog. Nucl. Magn. Reson. Spectrosc. 2015, 86‐87, 1. [DOI] [PubMed] [Google Scholar]
- 34. Castañar L., Magn. Reson. Chem. 2017, 55, 47. [DOI] [PubMed] [Google Scholar]
- 35. Morris G. A., J. Magn. Reson. 1988, 78, 281. [Google Scholar]
- 36. Morris K. F., Johnson C. S., J. Am. Chem. Soc. 1992, 114, 3139. [Google Scholar]
- 37. Lupulescu A., Kotecha M., Frydman L., J. Am. Chem. Soc. 2003, 125, 3376. [DOI] [PubMed] [Google Scholar]
- 38. Nishiyama Y., Frey M. H., Mukasa S., Utsumi H., J. Magn. Reson. 2010, 202, 135. [DOI] [PubMed] [Google Scholar]
- 39. Dal Poggetto G., Castañar L., Adams R. W., Morris G. A., Nilsson M., Chem. Commun. 2017, 53, 7461. [DOI] [PubMed] [Google Scholar]
- 40. Stilbs P., Paulsen K., Rev. Sci. Instrum. 1996, 67, 4380. [Google Scholar]
- 41. Nilsson M., Morris G. A., Anal. Chem. 2008, 80, 3777. [DOI] [PubMed] [Google Scholar]
- 42. Colbourne A. A., Meier S., Morris G. A., Nilsson M., Chem. Commun. 2013, 49, 10510. [DOI] [PubMed] [Google Scholar]
- 43. Colbourne A. A., Morris G. A., Nilsson M., J. Am. Chem. Soc. 2011, 133, 7640. [DOI] [PubMed] [Google Scholar]
- 44. Istratov A. A., Vyvenko O. F., Rev. Sci. Instrum. 1999, 70, 1233. [Google Scholar]
- 45. Yuan B., Ding Y. M., Kamal G. M., Shao L. M., Zhou Z. M., Jiang B., Sun P., Zhang X., Liu M. L., J. Magn. Reson. 2017, 278, 1. [DOI] [PubMed] [Google Scholar]
- 46. Windig W., Antalek B., Intell C., Lab, Syst. 1997, 37, 241. [Google Scholar]
- 47. Antalek B., Hewitt J. M., Windig W., Yacobucci P. D., Mourey T., Le K., Magn. Reson. Chem. 2002, 40, S60. [Google Scholar]
- 48. Alam T. M., Alam M. K., J. Chemom. 2003, 17, 583. [Google Scholar]
- 49. Nilsson M., Morris G. A., Magn. Reson. Chem. 2007, 45, 656. [DOI] [PubMed] [Google Scholar]
- 50. Mandelshtam V. A., Prog. Nucl. Magn. Reson. Spectrosc. 2001, 38, 159. [Google Scholar]
- 51. Armstrong G. S., Chen J., Cano K. E., Shaka A. J., Mandelshtam V. A., J. Magn. Reson. 2003, 164, 136. [DOI] [PubMed] [Google Scholar]
- 52. Martini B. R., Mandelshtam V. A., Morris G. A., Colbourne A. A., Nilsson M., J. Magn. Reson. 2013, 234, 125. [DOI] [PubMed] [Google Scholar]
- 53. Hyvarinen A., IEEE Trans. Neural Netw. 1999, 10, 626. [DOI] [PubMed] [Google Scholar]
- 54. Hyvarinen A., Oja E., Neural Networks 2000, 13, 411. [DOI] [PubMed] [Google Scholar]
- 55. Zhong J., DiDonato N., Hatcher P. G., J. Chemom. 2012, 26, 150. [Google Scholar]
- 56. Van Gorkom L. C. M., Hancewicz T. M., J. Magn. Reson. 1998, 130, 125. [DOI] [PubMed] [Google Scholar]
- 57. Kayser J., Tenke C. E., Clin. Neurophysiol. 2003, 114, 2307. [DOI] [PubMed] [Google Scholar]
- 58. Huo R., Wehrens R., Buydens L. M. C., J. Magn. Reson. 2004, 169, 257. [DOI] [PubMed] [Google Scholar]
- 59. Harshman R., UCLA Working Papers in Phonetics. 1970, 16, 1. [Google Scholar]
- 60. Harshman R. A., Lundy M. E., Comput. Stat. Data An. 1994, 18, 39. [Google Scholar]
- 61. Bro R., Chemometrics Intell. Lab. Syst. 1997, 38, 149. [Google Scholar]
- 62. Nilsson M., Botana A., Morris G. A., Anal. Chem. 2009, 81, 8119. [DOI] [PubMed] [Google Scholar]
- 63. Khajeh M., Botana A., Bernstein M. A., Nilsson M., Morris G. A., Anal. Chem. 2010, 82, 2102. [DOI] [PubMed] [Google Scholar]
- 64. Shinzawa H., Nishida M., Kanematsu W., Tanaka T., Suzuki K., Noda I., Analyst 2012, 137, 1913. [DOI] [PubMed] [Google Scholar]
- 65. Björnerås J., Botana A., Morris G. A., Nilsson M., J. Biomol. NMR 2013, 58, 251. [DOI] [PubMed] [Google Scholar]
- 66. Engelsen S. B., Bro R., J. Magn. Reson. 2003, 163, 192. [DOI] [PubMed] [Google Scholar]
- 67. Ernst R. R., J. Magn. Reson. 1969, 1, 7. [Google Scholar]
- 68. Nilsson M., Morris G. A., Magn. Reson. Chem. 2006, 44, 655. [DOI] [PubMed] [Google Scholar]
- 69. Mobli M., Nilsson M., Almond A., Glycoconjugate J. 2008, 25, 401. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70. Nilsson M., Khajeh M., Botana A., Bernstein M. A., Morris G. A., Chem. Commun. 2009, 1252. [DOI] [PubMed] [Google Scholar]
- 71. Codling D. J., Zheng G., Stait‐Gardner T., Yang S., Nilsson M., Price W. S., J. Phys. Chem. B 2013, 117, 2734. [DOI] [PubMed] [Google Scholar]
- 72. Toumi I., Torrésani B., Caldarelli S., Anal. Chem. 2013, 85, 11344. [DOI] [PubMed] [Google Scholar]
- 73. Acar E., Papalexakis E. E., Gurdeniz G., Rasmussen M. A., Lawaetz A. J., Nilsson M., Bro R., BMC Bioinformatics 2014, 15, 239. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74. Björnerås J., Botana A., Morris G. A., Nilsson M., J. Biomol. NMR 2014, 58, 251. [DOI] [PubMed] [Google Scholar]
- 75. Ebrahimi P., Nilsson M., Morris G. A., Jensen H. M., Engelsen S. B., J. Chemom. 2014, 28, 656. [Google Scholar]
- 76. Björnerås J., Nilsson M., Maler L., Biochim. Biophys. Acta 2015, 1848, 2910. [DOI] [PubMed] [Google Scholar]
- 77. Nilsson M., Morris G. A., Chem. Commun. 2007, 933. [DOI] [PubMed] [Google Scholar]
- 78. Aguilar J. A., Faulkner S., Nilsson M., Morris G. A., Angew. Chem. Int. Ed. 2010, 49, 3901. [DOI] [PubMed] [Google Scholar]
- 79. Foroozandeh M., Castañar L., Martins L. G., Sinnaeve D., Dal Poggetto G., Tormena C. F., Adams R. W., Morris G. A., Nilsson M., Angew. Chem. Int. Ed. 2016, 55, 15579. [DOI] [PubMed] [Google Scholar]
- 80. Zangger K., Sterk H., J. Magn. Reson. 1997, 124, 486. [Google Scholar]
- 81. Aguilar J. A., Nilsson M., Morris G. A., Angew. Chem. Int. Ed. 2011, 50, 9716. [DOI] [PubMed] [Google Scholar]
- 82. Foroozandeh M., Adams R. W., Meharry N. J., Jeannerat D., Nilsson M., Morris G. A., Angew. Chem. Int. Ed. 2014, 53, 6990. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83. Dal Poggetto G., Castañar L., Morris G. A., Nilsson M., RSC Adv. 2016, 6, 100063. [Google Scholar]
- 84. Lupulescu A., Olsen G. L., Frydman L., J. Magn. Reson. 2012, 218, 141. [DOI] [PubMed] [Google Scholar]
- 85. Meyer N. H., Zangger K., Angew. Chem. Int. Ed. 2013, 52, 7143. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86. Castañar L., Nolis P., Virgili A., Parella T., Chem. – Eur. J. 2013, 19, 17283. [DOI] [PubMed] [Google Scholar]
- 87. Ying J., Roche J., Bax A., J. Magn. Reson. 2014, 241, 97. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88. Adams R. W., Byrne L., Kiraly P., Foroozandeh M., Paudel L., Nilsson M., Clayden J., Morris G. A., Chem. Commun. 2014, 50, 2512. [DOI] [PubMed] [Google Scholar]
- 89. Stilbs P., Prog. Nucl. Magn. Reson. Spectrosc. 1987, 19, 1. [Google Scholar]
- 90. Price W. S., Concepts Magn. Reson. 1997, 9, 299. [Google Scholar]
- 91. Price W. S., Concepts Magn. Reson. 1998, 10, 197. [Google Scholar]
- 92. Cohen Y., Avram L., Frish L., Angew. Chem. Int. Ed. 2005, 44, 520. [DOI] [PubMed] [Google Scholar]
- 93. Liebau J., Ye W. H., Maler L., Magn. Reson. Chem. 2017, 55, 395. [DOI] [PubMed] [Google Scholar]
- 94. Pregosin P. S., Magn. Reson. Chem. 2017, 55, 405. [DOI] [PubMed] [Google Scholar]
- 95. Stilbs P., Magn. Reson. Chem. 2017, 55, 386. [DOI] [PubMed] [Google Scholar]
- 96. Zubkov M., Dennis G. R., Stait‐Gardner T., Torres A. M., Willis S. A., Zheng G., Price W. S., Magn. Reson. Chem. 2017, 55, 414. [DOI] [PubMed] [Google Scholar]
- 97. Antalek B., Concepts Magn. Reson. 2002, 14, 225. [Google Scholar]
- 98. Barjat H., Morris G. A., Smart S., Swanson A. G., Williams S. C. R., J. Magn. Reson. Ser. B 1995, 108, 170. [Google Scholar]
- 99. Stejskal E. O., Tanner J. E., J. Chem. Phys. 1965, 42, 288. [Google Scholar]
- 100. Sinnaeve D., Concepts Magn. Reson. A 2012, 40A, 39. [Google Scholar]
- 101. Connell M. A., Bowyer P. J., Bone P. A., Davis A. L., Swanson A. G., Nilsson M., Morris G. A., J. Magn. Reson. 2009, 198, 121. [DOI] [PubMed] [Google Scholar]
- 102. Morris K. F., Johnson C. S., J. Am. Chem. Soc. 1993, 115, 4291. [Google Scholar]
- 103. Nilsson M., Connell M. A., Davis A. L., Morris G. A., Anal. Chem. 2006, 78, 3040. [DOI] [PubMed] [Google Scholar]
- 104. Stilbs P., Paulsen K., Griffiths P. C., J. Phys. Chem. 1996, 100, 8180. [Google Scholar]
- 105. Stilbs P., Eur. Biophys. J, Biophys. 2013, 42, 25. [DOI] [PubMed] [Google Scholar]
- 106. Andersson C. A., Bro R., Chemometrics Intell. Lab. Syst. 2000, 52, 1. [Google Scholar]
- 107. Provencher S. W., Comput. Phys. Commun. 1982, 27, 229. [Google Scholar]
- 108. Hinton D. P., Johnson C. S., J. Phys. Chem. 1993, 97, 9064. [Google Scholar]
- 109. Chen A., Wu D. H., Johnson C. S., J. Am. Chem. Soc. 1995, 117, 7965. [Google Scholar]
- 110. Delsuc M. A., Malliavin T. E., Anal. Chem. 1998, 70, 2146. [Google Scholar]
- 111. Ambrosone L., Ceglie A., Colafemmina G., Palazzo G., Phys. Chem. Chem. Phys. 2002, 4, 3040. [Google Scholar]
- 112. Armstrong G. S., Loening N. M., Curtis J. E., Shaka A. J., Mandelshtam V. A., J. Magn. Reson. 2003, 163, 139. [DOI] [PubMed] [Google Scholar]
- 113. Sebastiao R. C. O., Pacheco C. N., Braga J. P., Pilo‐Veloso D., J. Magn. Reson. 2006, 182, 22. [DOI] [PubMed] [Google Scholar]
- 114. Day I. J., J. Magn. Reson. 2011, 211, 178. [DOI] [PubMed] [Google Scholar]
- 115. Urbańczyk M., Bernin D., Koźmiński W., Kazimierczuk K., Anal. Chem. 2013, 85, 1828. [DOI] [PubMed] [Google Scholar]
- 116. Xu K. P., Zhang S. M., Anal. Chem. 2014, 86, 592. [DOI] [PubMed] [Google Scholar]
- 117. Roding M., Williamson N. H., Nyden M., J. Magn. Reson. 2015, 261, 6. [DOI] [PubMed] [Google Scholar]
- 118. Zhou X. Y., Xu K. P., Zhang S. M., J. Magn. Reson. 2015, 252, 114. [DOI] [PubMed] [Google Scholar]
- 119. Urbańczyk M., Bernin D., Czuron A., Kazimierczuk K., Analyst 2016, 141, 1745. [DOI] [PubMed] [Google Scholar]
- 120. Hansen P. C., Numer. Algorithms 1994, 6, 1. [Google Scholar]
- 121. Hansen P. C., Numer. Algorithms 2007, 46, 189. [Google Scholar]
- 122. Kowalewski J. M., Maler L., Nuclear Spin Relaxation in Liquids: Theory, Experiments, and Applications, CRC Press, Boca Raton, FL: 2006. [Google Scholar]
- 123. Levitt M. H., Spin Dynamics: Basics of Nuclear Magnetic Resonance, 2nd ed., Wiley, Chichester, UK: 2008. [Google Scholar]
- 124. Keeler J., Understanding NMR Spectroscopy, 2nd ed., Wiley, Chichester, UK: 2010. [Google Scholar]
- 125. Gilard V., Trefi S., Balayssac S., Delsuc M. A., Gostan T., Malet‐Martino M., Martino R., Prigent Y., Taulelle F., in NMR Spectroscopy in Pharmaceutical Analysis, (Eds: Holzgrabe U., Wawer I., Diehl B.), Elsevier Science Bv, Amsterdam: 2008. 269. [Google Scholar]
- 126. Vold R. L., Waugh J. S., Klein M. P., Phelps D. E., J. Chem. Phys. 1968, 48, 3831. [Google Scholar]
- 127. Markley J. L., Horsley W. J., Klein M. P., J. Chem. Phys. 1971, 55, 3604. [Google Scholar]
- 128. Carr H. Y., Purcell E. M., Phys. Rev. 1954, 94, 630. [Google Scholar]
- 129. Meiboom S., Gill D., Rev. Sci. Instrum. 1958, 29, 688. [Google Scholar]
- 130. Aguilar J. A., Nilsson M., Bodenhausen G., Morris G. A., Chem. Commun. 2012, 48, 811. [DOI] [PubMed] [Google Scholar]
- 131. Smilde A., Bro R., Geladi P., Multi‐way Analysis: Applications in the Chemical Sciences, John Wiley & Sons, Chichester, UK: 2004. [Google Scholar]
- 132. Pedersen H. T., Bro R., Engelsen S. B., J. Magn. Reson. 2002, 157, 141. [DOI] [PubMed] [Google Scholar]
- 133. Dyrby M., Petersen M., Whittaker A. K., Lambert L., Norgaard L., Bro R., Engelsen S. B., Anal. Chim. Acta 2005, 531, 209. [Google Scholar]
- 134. Pedersen H. T., Dyrby M., Engelsen S. B., Bro R., Ann. Rep. NMR Spectro. 2006, 59, 207. [Google Scholar]
- 135. Acar E., Nilsson M., Saunders M., in Signal Processing Conference (EUSIPCO), 2014 Proceedings of the 22nd European, 2014, pp. 111. [Google Scholar]
- 136. Savorani F., Tomasi G., Engelsen S. B., J. Magn. Reson. 2010, 202, 190. [DOI] [PubMed] [Google Scholar]