Abstract
Metabolomics is a rapidly growing multidisciplinary field with ever increasing demand and usability, which is attracting a surge of new researchers. While their varied skill sets, scientific questions, and approaches enrich the field with fresh perspectives and innovation, individual investigators also bring wide-ranging levels of metabolomics-specific experience and diverse areas of interest. These factors introduce considerable variability and inconsistency in both the methodology and reporting. A recent comparative literature review of nuclear magnetic resonance (NMR) metabolomics from studies published in 2010 and 2020 revealed significant shortcomings in the reporting of experimental details necessary for evaluating both the scientific rigor and the reproducibility of NMR-based metabolomics experiments. Each stage of metabolomics research contains multiple methodological choices and various optimization parameters, all of which can introduce experimental bias and alter the study results. This emphasizes the need for proper reporting to enhance reproducibility, data reusability, and study comparability. To address these concerns, the NMR Special Interest Group within the Metabolomics Association of North America presents reporting recommendations focused on fundamental aspects of NMR metabolomics research identified from the detailed literature review report. These include specifics with respect to study design, sample preparation, data acquisition, data processing and analysis, data accessibility, and comparability to previous studies. Also presented is a complementary list of seminal papers in the field to guide the study design and implementation of NMR metabolomics experiments. This initiative seeks to enhance the long-term impact of NMR metabolomics by supporting high-quality, reproducible, and impactful data collected from well-executed and thoroughly reported studies.
1. Introduction
Over the course of the past 25 years, the field of metabolomics has expanded in conjunction with many analytical and methodological advancements. These advancements, along with the establishment of numerous academic societies and consortia, − and metabolomics databases and repositories have led to the rapid growth of metabolomics-based research. Although this growth underscores the broad utility of metabolomics, it has also brought together researchers of various subdisciplines and training backgrounds, increasing methodological variability within the field. , These factors necessitate the establishment of standardized best practices, particularly for the standardization of reporting metrics, to improve interpretability and comparability across the field of metabolomics.
Efforts to establish reporting standards in both nuclear magnetic resonance (NMR)- and mass spectrometry (MS)-based metabolomics have been initiated by several organizations, including Metabolomics Standards Initiative (MSI), Coordination of Standards in Metabolomics (COSMOS), Metabolomics Association of North America (MANA), and Metabolomics Quality Assurance & Quality Control (QC) Consortium (mQACC). ,− The adaptation of recommended reporting standards, however, has been inconsistent. , For instance, a large-scale, NMR-specific literature review that quantitatively assessed methodological reporting patterns in the metabolomics literature revealed minimal or near-absent reporting of many fundamental parameters needed to properly describe NMR-based metabolomic studies. The challenge lies in the fact that metabolomics is a rapidly evolving field that requires a diversity of technical expertise and analytical skills. These are often difficult to acquire within a single research group. Further, the low barrier to entry in the field and the absence of widely distributed and well-vetted best practices have contributed to laboratory-specific methodologies, which limit the reproducibility and reuse of metabolomics data for other investigators. Metabolomics necessitates continuous, collaborative communication among researchers to promote and adopt best practices including reporting norms embraced by the research community. For example, enforcement of specific reporting criteria (e.g., by journals, funding agencies, or academic societies) has proven difficult in such a dynamic field where the diversity of metabolomics applications that span various biological contexts require standard practices tailored to individual uses. The inconvenience of detailed reporting leads to over-reliance on citing previous publications for methods without providing full experimental details. This practice often omits critical information, further challenging the widespread adoption of reporting guidelines. , The lack of properly validated and widely adopted best practices is likely contributing to the reproducibility crisis in metabolomics. − Thus, the establishment of community-adopted best practices and reporting standards would help to address reproducibility concerns and enhance the value of metabolomics studies.
NMR-based metabolomics data can be analyzed in two fundamentally different ways. The first approach is a purely quantitative method that uses spectral deconvolution to quantify a predefined set of metabolites in the sample prior to downstream statistical analysis. The second approach employs statistical methods (e.g., principal component analysis, partial least squares discriminant analysis (PLS-DA), and statistical total correlation spectroscopy) to identify spectral features and utilizes relative quantification to highlight metabolites that exhibit distinct differences between classes or cases, which may be considered semiquantitative. Quantitative studies typically produce lists of positively identified and accurately quantified compounds and are usually done using commercial programs or company services (e.g., Chenomx, FoodScreener, In Vitro Diagnostics Research (IVDr), and Nightingale Health). These methods use internally added standards and strictly defined sample preparation/acquisition protocols and are limited to a select number of sample types (serum, plasma, urine, and wine).
The term “profiling” has been used variably in the metabolomics literature, which highlights the further need for standardized terminology usage in the field. For example, profiling and “fingerprinting” are routinely and incorrectly used interchangeably. To avoid ambiguity and promote consistent usage of terminology, profiling is defined herein as an NMR workflow that uses a single internal standard for chemical shift reference and then applies various multivariate statistical techniques for feature selection and individual metabolite identification, which is commonly followed by relative or absolute quantification and comparison of these individual metabolites through univariate analysis. Conversely, fingerprinting simply relies on the entire NMR spectral data set with multivariate statistical analysis to discriminate between two or more biological groups without the identification of individual metabolite changes. As quantitative profiling and semiquantitative profiling or fingerprinting approaches follow different workflows, their reporting requirements also differ. Therefore, our reporting recommendations consider the specific methodologies used in the NMR-based metabolomic studies.
This manuscript serves as a direct follow-up to a previously published review of the NMR metabolomics literature that evaluated both differences and scope in reporting in 2010 versus 2020. The text of this paper highlights and discusses some significant reporting deficiencies identified from that 2024 literature review and expands on additional topics that are insufficiently reported in the literature. Accompanying this paper is a reporting recommendations table that can be utilized as a guide for designing experiments as well as for writing and reviewing metabolomics manuscripts. We also provide a list of seminal references in the Supporting Information that demonstrates the current consensus on the implementation and reporting of experimental parameters in a metabolomics study. The table, text, and Supporting Information have been arranged into five reporting categories that we believe are integral for evaluating study integrity and repeatability: (1) study design, (2) sample preparation, (3) data acquisition, (4) data processing and analysis, and (5) data accessibility and comparability to previous studies. With these reporting categories in mind, researchers can use the table and the references as a template for developing a thorough, well-planned, and properly implemented NMR metabolomics experiment. The overarching goal of this work is to establish minimum reporting criteria in NMR-based metabolomics studies, which will help generate well-executed studies that have the potential to significantly enhance the long-term value of reproducible and meaningful NMR-based metabolomic data. Ultimately, this will help propel progress and discovery in the field of metabolomics research.
2. Reporting Recommendations and Guidelines
2.1. Study Design
Comprehensive reporting of the experimental design is crucial for providing the necessary context that allows readers to accurately interpret a study, making it essential for the dissemination of research results. A complete description of the overarching goals and specific objectives should form the foundation of a properly reported study, yet fewer than 50% of studies published in 2010 and 2020 reported a clearly stated research hypothesis. This low reporting rate stems in part from field-specific factors, highlighting the ongoing challenges in establishing hypothesis-driven research in NMR metabolomics.
The nature of hypotheses that can be reported in NMR metabolomics studies is determined by the goals set during the planning and development of the study, which occurs prior to the setup of experiments. For example, when experimental questions focus on predetermined sets of metabolites or metabolic pathways (i.e., targeted design), all research hypotheses to be tested should be reported. In contrast, when well-defined phenotypes of interest are compared without prior knowledge of relevant metabolic interactions, a general metabolome assessment can be conducted to form broad conclusions (e.g., compare metabolomes of clinical phenotypes) with the goal of generating specific, testable hypotheses (e.g., about cellular mechanisms), which were not previously feasible (i.e., untargeted design). − In such cases, these exploratory aims should be clearly reported in the absence of more traditional hypotheses to avoid any ambiguity.
Due to their complementary nature, untargeted studies should ideally be followed by targeted studies in sequence as a study system becomes more resolved. ,, Although exploratory studies bring about identification and discovery of undescribed metabolites, researchers should be cautious about designing exploratory studies that simply aim to report detectable metabolites in understudied systems. Such efforts do not necessarily lead to insightful research and often provide minimal utility to the wider research community, especially as the field transitions from hypothesis generation to mechanistic insights into metabolic functions. , Quantitative NMR studies almost exclusively utilize targeted designs due to methodological constraints (e.g., selection of internally added standards), while profile NMR metabolomics studies can be either targeted or untargeted depending on study design. Additional study designs that may lack traditional hypotheses, such as method and technical papers, should clearly state their objectives similar to the recommendations for untargeted designs described above.
Sample size is a critical factor in a study design, representing the total number of data points in the evaluation of the study objective. Sample numbers greatly impact the ensuing protocols. For example, there is a vast difference between processing a few dozen samples compared with processing hundreds or thousands of samples. A larger number of samples are always desirable to achieve the required statistical power and to obtain meaningful outcomes, especially considering the high-dimensionality of metabolomics data (i.e., observation of more experimental features than samples). − The type of sample also dictates the number required; therefore, there are no clear rules; only guidelines exist. Researchers should assess the expected biological variability of their samples before determining the appropriate sample size. For example, well-defined samples with modest biological variability (e.g., cell cultures, plant tissues, food and beverage products) require fewer biological replicates compared to more complex biological samples (e.g., animal- and human-derived materials), especially in cases where confounding factors (e.g., local environment, diet, age) cannot be rigorously controlled (see Supporting Information Table S2).
Despite its intrinsic value, the use of analytical replicates was rarely reported in the NMR metabolomics studies previously reviewed, likely due to the high innate reproducibility (i.e., coefficients of variance, CVs ≤ 5%) of NMR data, yet analytical replicates can assess variance in sample preparation and instrumental analysis. It is difficult to justify the value of including analytical replicates due to the added cost and time, especially for studies involving hundreds to thousands of biological replicates. Of course, numerous practical considerations tend to limit the number of samples that are either available for a metabolomics study or can be properly handled and processed. Based on the total number of samples used (median of 40) and the number of biological replicates per group (median of 10) reported in the 2024 literature review, most metabolomics studies examined, especially clinical studies, seemed to rely on too few biological replicates to obtain statistical significance. This lack of statistical power may contribute to a perceived reproducibility problem in the field. − , In response, the metabolomics community is working toward establishing recommendations on the minimum number of biological replicate samples for a reliable metabolomics study (Supporting Information Table S2). These minimum recommendations must take into account the type of samples being analyzed and the study goals.
The experimental design itself largely determines the overall quality of a study. It is thus crucial to clearly report the study objectives and/or hypotheses, and to critically assess these goals to ensure that the proposed work and subsequent methods not only support one another but are also achievable. , The rapid response of the metabolome to both internal and external factors is part of the widespread allure of metabolomics; however, as a fundamentally comparative experimental design, it also necessitates appropriate and well-defined biological controls and a randomized sample-run order to minimize bias in both sample preparation and data acquisition steps, regardless of the study size. The most rigorous checkpoint for the experimental design should occur prior to the initiation of the study, as afterward, many controllable study variables become entrenched. When reporting findings, it is important for researchers to explain their experimental design in detail (see Tables and Supporting Information S1) so readers can accurately assess the ability of the study to achieve the stated experimental goals.
1. Reporting Recommendations for NMR Metabolomics.
| Section | Category | Subcategory | Information to report |
|---|---|---|---|
| Introduction | 1. study design | objectives | scientific hypothesis and study goals clearly stated |
| Methods | 2. sample preparation | sample information | appropriate metadata for reproducing study |
| sample pretreatment | collection and storage conditions | ||
| sample processing | extraction or treatment of samples clearly detailed | ||
| sample scheme | order in which samples were extracted; number of batches; how treatments were divided among batches | ||
| QC samples | use and treatment of blanks, technical replicates, pooled QCs, and SRMs | ||
| experimental parameters | sample pH; sample temperature; buffer; number of samples, groups, and replicates per group (biological and analytical); reference compound; spectrometer vendor, model, and frequency; probe type/vendor; experiment conducted with specific pulse sequence; inclusion of presaturation pulse; sample order and acquisition on any QC samples (buffer blanks, extraction blanks, pooled QC, SRMs) | ||
| 3. data acquisition | spectral data processing parameters | software used for processing spectra; baseline correction method; phasing (automated or manual); spectral region removal of solvent/buffer peaks; choice of window function; application of zero-filling; line broadening (or any other raw data manipulation); bucketing | |
| 4. data processing and analysis | data manipulation | normalization; centering; transformation; scaling | |
| statistical methods | statistical software(s) used; unsupervised and supervised multivariate statistics utilized; univariate statistics utilized | ||
| annotation | type of NMR experiments used for metabolite identification and validation, especially in complex mixtures; samples used for metabolite assignment | ||
| spectrometer performance | stability of 90° pulses; water suppression efficiency; signal-to-noise values on standard samples; system suitability test on standard sample (i.e., sucrose) | ||
| Results | 3. data acquisition | metabolite assignment and quantification | number and list of identified and quantified metabolites; level of assignment for each metabolite |
| 4. data analysis | statistical validation | minimal fold change; Student’s t statistic; p-value; degrees of freedom; false discovery rate or other multiple hypothesis correction; quality values R 2 and Q 2 of supervised multivariate models; multivariate model validation (e.g., CV-ANOVA, permutation test); receiver operating characteristic (ROC) curve-cross validation, sensitivity, and specificity | |
| QA/QC validation | performance results of the system suitability test; QC precision within and across batches (spectral median RSD; RSDs for each compound); SRM performance metrics; variability | ||
| 5. data accessibility | data accessibility | metabolomics data deposition; metadata availability | |
| Discussion | 6. data comparability | study comparison and reporting | compared identified metabolites with prior literature, where applicable |
2.2. Sample Preparation
Due to the labile nature of metabolites, the pretreatment of samples, including sample collection and storage protocols, must be appropriately designed and implemented to minimize technical variability and to avoid nonbiologically relevant changes in metabolite patterns. These considerations remain true across the metabolomics field and are not analytical platform specific. Rather, the type and number of unique specimens and the research question should drive sample collection, storage, and handling approaches. It is well-known that a combination of collection, handling, and storage parameters will impact the chemical composition of the sample. Pretreatment and postprocessing considerations range from spatial location of the specimen to time of day for collection and include various sample transfer and storage options, such as temperature, duration, and chemical treatment. Failure to disclose such vital information is a barrier to successfully reproducing metabolomics experiments. For example, metabolites in blood, which is the most widely collected biofluid, are influenced by the type of collection tube (e.g., no additive, 3.2% sodium citrate, potassium EDTA, lithium-heparin, etc.) used during sampling, , and the sample processing method (e.g., centrifugation, mixing, temperature, standing time, number of freeze–thaw cycles, etc.) postcollection. , Challenges also arise with spatial and temporal sample collections. For example, variations may occur among different cells within a leaf or across the whole plant (stem, root, leaf) depending on the time of day and photosynthesis stage.
Thoroughly documenting and reporting the pretreatment of samples is critical for repeatability, study comparison, and future growth in the field because sample collection and handling protocols are not universally agreed upon and remain undefined for many nonmodel species. Recommendations on sample collection and handling protocols for each type of biospecimen are beyond the scope of this article and will require additional efforts to recognize and disseminate best practices for specific sample types. We refer readers to a preanalytical and biobanking review and other relevant studies. − for detailed insights into sample collection and biobanking. Additional references to protocols regularly utilized by the community for the collection and handling of various biofluids and tissue samples are also provided in Supporting Information Table S2.
A lack of unified protocols also holds true for sample processing and metabolite extraction steps, which must be carefully defined prior to undertaking the project since the study goals and research questions should define the methodological approach. It is well-known that the method used for metabolite extraction can greatly influence the results by modulating the metabolomes of interest via the chemical environment. For example, the type, number, and polarity of the solvent(s) used for metabolite extraction can affect both metabolome coverage and data quality. When multistep extractions and multiple solvents are used, the order and steps in which the solvents are added can affect metabolite partitioning between solvent phases, ultimately affecting both reproducibility and overall extraction yield. There are many metabolite extraction protocols reported in the scientific literature that cover a variety of biological sample types commonly encountered in metabolomics studies (Supporting Information Table S2). Of course, the availability of several distinct extraction protocols per sample type can contribute to the confusion and lack of standardized methods. Implementing metabolite extraction protocols across laboratories may enable harmonization of metabolomics data.
Methodological details of the protocol used should be clearly defined within the paper rather than referencing a protocol for the whole methodology. In the 2024 review of NMR metabolomics papers, a common outcome was an inability to locate the original protocols. Instead, it was found that readers were often directed to references within references, which led to ambiguities or the complete absence of knowledge about the methodology used. Software tools, such as protocols.io, can greatly simplify the reporting, sharing, and updating of protocols by associating a protocol with a digital object identifier. Protocols can then be used directly by other researchers or “forked” to modify a given protocol while maintaining its history. We encourage metabolomics researchers to adopt a method to help standardize NMR metabolomics sample preparation, study design, and even data collection and processing steps.
The inclusion of QCs, especially in MS-based metabolomics workflows, is widely recognized as important, as evident by a majority of the community (83%) self-reporting the use of pooled materials in their metabolomics projects in a 2017 questionnaire. Despite their importance in MS-based metabolomics, QC methods for NMR were reported at an extremely low rate in the literature (2% of 2010 papers, 17% of 2020 papers), including multiple specific categories (e.g., pooled QCs, buffer, and extraction blanks) that were nearly entirely absent from the 2010 literature. It is likely that the discrepancy between the 2017 questionnaire and the 2024 NMR literature review can partly be explained by the fact that most NMR facilities perform routine quality checks (temperature calibration, shimming, signal-to-noise ratio (S/N) testing, and tuning on external reference materials) that are not included in the final manuscripts. The quantitative approach does not require QC samples due to the strict protocols and inclusion of specific internal standards, so laboratories that use quantitative NMR methods are unlikely to report QC data. Given that approximately 50% or more of NMR metabolomic studies fall into this category, this alone may explain the dearth of QC reporting for NMR as noted in the 2024 literature review. On the other hand, profile NMR metabolomics studies do require QC samples to help address variation from experimental (e.g., identification of contaminants, alignment of peaks) and instrumental (e.g., correct for instrument drift and changes in instrument sensitivity or probe tuning) sources. Therefore, these types of NMR studies require rigorous QA/QC and should report QC methods and data. Certainly, if this was done more frequently, then the quality of data being published by the metabolite profiling NMR community would be greatly enhanced.
For profile NMR metabolomics studies, reporting the use of QCs during sample pretreatment demonstrates that measures were taken to ensure the quality of the study and resulting data. Better yet, if these types of NMR studies reported the scope of use of their QCs (e.g., extraction efficiency, contamination, batch monitoring/correcting, two-dimensional (2D) annotation, etc.) based on definitions accepted by the community, it would prevent readers and reviewers from having to interpret and assume how the QC samples were being used in the study. Members of the metabolomics Quality Assurance and QC Consortium (mQACC) have defined the roles for each QC in a profile NMR metabolomics workflow, especially for MS-based metabolomics. These include the minimal and best practices on the reporting of QCs, and how to predefine the scope of use for all QC samples. This includes analysis of blanks, buffer blanks, pooled QCs, and certified reference materials (CRMs). , To this point, the validity of a metabolite profiling study can be determined only if QC methods and data are demonstrated or presented in the paper or associated repository. Reporting QC results is discussed further in the Section Data Processing and Analysis.
2.3. Data Acquisition
NMR spectrometer specifications, reconstitution buffers, and experimental parameters largely influence the quality of the resulting NMR spectra. Thus, these variables must be prominently defined to allow for proper interpretation and comparison of metabolomics data between studies. In fact, properly describing the data acquisition parameters is so fundamental to disseminating metabolomics research results that any reporting rates below 100% are a cause for concern. Yet, even a straightforward parameter like sample temperature was severely underreported across the literature in the 2024 literature review (reported in <75% of 2020 papers) despite its importance in influencing the chemical shift values of NMR signals. Again, some of this underreporting may be due to the widespread use of quantitative NMR metabolomics methods, where the protocols are strictly defined and largely identical from one study to the next.
Reporting rates for the inclusion of one-dimensional (1D) or 2D NMR experimental parameters were more encouraging (97.5% and 89%, respectively, in 2010 papers; 96% and 93%, respectively, in 2020 papers); however, these metrics do not describe the extent to which experimental details were included. For example, specific NMR pulse sequences used for 1D and 2D NMR experiments were reported less frequently than reported inclusion rates in both 2010 papers (86% and 76%, respectively) and 2020 papers (85% and 79%, respectively), which is of concern for profile NMR metabolomics studies. Due to the wide array of potentially suitable 1D and 2D NMR experiments for metabolite profiling, reporting specific and complete experimental details is required to fully convey the proper and reproducible detection of spectral data. On the other hand, because quantitative NMR studies use only a small number of very specific pulse sequences (1D nuclear Overhauser effect spectroscopy or variants thereof), the issue of pulse sequence variability is less of a concern. Reported compositions of the NMR buffers presented a similar issue. For example, the chemical shift reference compound included in the NMR buffer was very frequently reported (99% in 2010 papers; 92% in 2020 papers), while notably fewer studies reported the buffer type (81% in 2010 papers; 74% in 2020 papers), pH (56% in 2010 and 2020 papers), and concentration of buffer components (49% in 2010 papers; 52% in 2020 papers). Sample pH, osmolality, buffer and solvent type, and temperature all influence NMR chemical shifts, line widths, and spectral splitting patterns (especially for compounds with ionizable functional groups). Thus, providing a complete description of the chemical shift standard, buffer composition and sample conditions is necessary to achieve accurate annotations, harmonize data, and to generate reliable results enabling comparisons across studies. ,
The reviewed metabolomics papers well reported the total number of samples analyzed (96% in 2010 papers; 83% in 2020 papers) but was not as effective in fully describing the complete set of samples used (e.g., number of replicates per group, QC samples, etc.). Based on these findings, we reinforce and concur with previous recommendations , and urge NMR metabolomics studies to include a table or a figure that describes the complete set of experimental and QC samples (if required) and the cohort composition. Simply put, it should not be the responsibility of the reader to ascertain the overall study design and sample size by counting the number of symbols in the data plots, for example. In addition to the low reporting rates of QC samples (described in detail in Section ), the results necessary for demonstrating optimal spectral quality for metabolite profiling studies were also underreported. For example, metrics related to spectrometer performance and reproducibility (e.g., water suppression efficiency, spectral relative standard deviation of pooled QCs, etc.) were not assessed in the literature review, nevertheless, they should be reported to provide transparency regarding data quality. ,
Some parameters (e.g., buffer type, chemical shift reference, number of samples, etc.) were reported less frequently in 2020 compared to 2010 papers. This decrease over time suggests that reporting expectations for basic data acquisition parameters may have eased in the past decade, perhaps from a tendency for methods sections to focus on the increasingly elaborate and diverse data processing and analysis procedures used in NMR metabolomics. In this context, we urge the field of NMR metabolomics to re-emphasize the importance of reporting of foundational data acquisition parameters outlined in Tables and Supporting Information S1, thus allowing readers to confidently interpret the presented results. In specific instances where one of the recommended parameters listed in Tables and Supporting Information S1 may not be relevant or not applicable (NA) (e.g., sample pH when using organic solvents), it should be explicitly stated as such in the text and/or in relevant tables and figures rather than being omitted. It is also important to recognize that a growing amount of metabolomics research is being generated by commercial vendors, instead of individual investigators. In such cases, we recommend that the companies that provide such services clearly report all relevant experimental details needed to accurately and reliably reproduce results, such as the types of NMR experiments performed and the acquisition and processing parameters.
2.4. Data Processing and Analysis
NMR data collected from an instrument can be processed and manipulated using various mathematical functions to enhance S/N, resolution, and sensitivity and to correct baselines and phase spectra. For quantitative NMR studies, these spectral transformations are strictly defined and are usually part of a well-defined protocol or menu of functions offered by data processing software, such as Chenomx NMRSuite, IVDr, and FoodScreener; as well as academic programs such as BATMAN, Bayesil, and SAND. Ideally these processing details should be provided in the method descriptions to ensure broad reproducibility. However, for some programs and certain commercial providers, this information is not available to the user, thereby limiting the reproducibility of the result.
For NMR metabolite profiling studies, the variability of data processing methods is much greater, and therefore much more information about data processing must be reported. For example, ensembles of spectra must be aligned and integrated, and solvent or buffer peaks are often removed from spectral regions to avoid their impact on downstream data analysis. When this is done, it is crucial to report the type of mathematical functions used in transforming the NMR spectra into a format employed to quantify and identify the metabolites. Failure to do so might compromise reproducibility, data utilization, and data preservation. For instance, studies show the approaches used in processing NMR spectra, such as line broadening factors, zero filling and window functions, baseline correction methods, and removal of noise affecting data interpretation. Therefore, it is important to report all of the processing methods that have been applied to the NMR spectra.
Regardless of whether the data are acquired from a quantitative NMR study or a profiling NMR metabolomics study, normalization, transformation, and scaling are critical data processing steps to prepare metabolomics data for statistical analysis. However, the 2024 literature review found that only about 60% of papers published in 2010 and 2020 reported a normalization method, a critical step for making data comparable across samples. Reporting on data scaling was even lower, appearing in only about 45% of papers. This may stem from confusion regarding the need for both data normalization and data scaling and the inherent difference between these two methods. To clarify, normalization corrects for differences in overall concentration (e.g., dilution factor) between samples. On the other hand, data scaling accounts for large dynamic range differences between individual spectral signals and prevents intense signals from dominating statistical models and negating biologically important changes in lower intensity peaks. There are various approaches available for the manipulation of metabolomics data to enhance feature selection. Different methods are recommended depending on the specific sample types. For instance, probabilistic quotient normalization, which estimates the most probable dilution factor by comparing the distribution of intensity quotients to a reference spectrum, is considered robust for analyzing urine samples. Similarly, different scaling approaches have been shown to affect the outcome of the analysis. The primary consideration in choosing an appropriate scaling or normalization technique is to ensure an accurate classification of the specific sample type into different metabophenotypes, a distinct pattern of metabolite levels and types in a sample that correlates with a particular phenotype, and to achieve the reproducible identification of spectral features that distinguish sample groups. Since different combinations of scaling and normalization methods can yield varied results, it is essential for researchers to report each step in the data processing pipeline to facilitate the harmonization of data retrieved from public databases and to ensure study reproducibility and reliability.
Many researchers in the NMR metabolomics community do not have extensive experience in biostatistics or statistical modeling, which can result in an overreliance on default software settings or adopting incorrect literature protocols because of a false impression of an established precedent. Thus, a common occurrence across many metabolomics studies is the failure to thoroughly validate models. For instance, supervised analysis, such as PLS-DA and orthogonal PLS-DA (OPLS-DA), is widely used for multivariate analysis of metabolomics data sets. These techniques, however, require validation before interpreting the variables that contribute to the observed group differences. Encouragingly, there has been a positive trend in reporting R 2 and Q 2 values, with reporting rates increasing from 30% in 2010 to 63% in 2020. Unfortunately, this increase may have been driven, in part, by a common misunderstanding of the meaning of R 2 and Q 2 values. These parameters are often incorrectly used to infer validity of a model. Instead, R 2 only provides a measure of the quality of fit, and Q 2 identifies the inherent variability or stability of the model following a leave-one-out or, preferably, leave-n-out analysis. For model validation and to avoid an overfitted model, it is essential to report metrics such as a permutation test or cross-validation analysis of variance (CV-ANOVA) p-values. Adhering to best practices also includes ensuring R 2 > Q 2, Q 2 ≥ 0.4, and Q 2 is within 20% of R 2 to minimize an overfitted model. Further, negative R 2 and Q 2 values indicate an invalid model, and the resulting model should never be presented as useful or analyzed for biological significance. Multivariate statistical models are routinely interpreted by a visual inspection of the relative group clustering in a scores plot, which can be inaccurate and lead to bias outcomes. Instead, a few simple quantitative methods can be applied to scores plots to avoid any such errors. Group membership can be defined by ellipses that correspond to the 95% confidence limit of the normal distribution of each cluster. Group overlap or separation can then be statistically defined by within-group and between-group Mahalanobis distances and a univariate analysis (i.e., p-value) to assess the relative similarity of these distances.
Univariate analyses, including t-tests and ANOVA, are also frequently applied to compare group means and variances. Given the large number of metabolites analyzed in metabolomics studies, corrections for multiple hypothesis testing are vital to control the false discovery rate (FDR). Findings from the 2024 literature review paper indicate that FDR reporting remains inadequate, with only 14% of studies in 2010 and 34% in 2020 addressing this crucial requirement. Researchers should routinely report FDR-adjusted p-values to minimize false positives (type I errors). It is also important for researchers to explicitly note the method used for such corrections to improve the reliability and reproducibility of their results. , Reporting both uncorrected and FDR-corrected p-values to address concerns with false negative rates (type II errors) is acceptable.
In addition to reporting proper metrics and correcting for multiple hypothesis testing, NMR metabolomics studies that rely on metabolite profiling must also report QC results. These must be assessed to appropriately define the technical variability within a study and to validate the biological significance of a model. For QCs, reporting their use alone is not sufficient. Kirwan et al. (2022), on behalf of mQACC, provided valuable guidance on acceptance criteria and precision of QC analysis for NMR metabolite profiling studies. Specifically for these types of studies, reporting should ideally include the spectral median standard deviation for CRMs, pooled QCs, and experimental groups. Any assessment between batch precision and batch precision, especially if corrections are applied, should also be reported. Demonstrating the extent of reproducibility through reporting is necessary for all NMR metabolite profiling studies, especially when conclusions are drawn about the relative concentration changes of specific metabolites in different phenotypic or health states.
The manner in which metabolite assignment is achieved and reported can also impact accuracy, rigor, translation to other studies, and adoption of new methods. A recent human urine metabolome study clearly illustrated this ongoing metabolite annotation challenge when it demonstrated assignment errors that resulted in metabolites being mistaken for other compounds. These inaccuracies were evident as the reported human concentrations were well below the detection limit of NMR. In addition, some of these inaccurate annotations were apparent due to metabolite insolubility in water. Reporting the extent of metabolite identification and describing previously cited examples in the literature may help to prevent compound misidentifications. Unfortunately, the level of assignment for each metabolite proposed by the MSI Chemical Analysis Working Group , does not appear to be well-documented in the literature and was also not recorded in the previous literature review, likely due to too few instances of reporting. Along these lines, the 2D NMR experiments necessary to reach the optimal annotation level (MSI level 1) were only utilized about 45% of the time and did not improve from 2010 to 2020. This low level of reporting may also reflect the frequency of quantitative NMR studies, which do not require 2D NMR experiments to confirm their metabolite identifications. Journals requiring information concerning the level of metabolite assignments combined with a clearer definition of MSI levels specific to NMR (especially for NMR metabolite profiling studies) would likely encourage wider adoption of the practice by metabolomics investigators. More importantly, it would lead to a greater level of metabolite annotation confidence.
While quantitative NMR studies follow very consistent practices in compound nomenclature (because of the uniformity of the software), the same nomenclature consistency is not seen in NMR metabolite profiling studies. Nomenclature standardization is currently an urgent need that requires serious consideration by the metabolomics community. Broad use of a standard nomenclature would facilitate the cross-referencing of results across multiple NMR metabolite profiling studies. Unfortunately, the 2024 literature review did not record whether the studies reported more than one name per metabolite or if the study leveraged standard metabolite identifiers. Ambiguous nomenclature continues to occur in interlaboratory studies and greatly impedes attempts to compile and harmonize metabolomics data.
As previously stated, efforts in nomenclature standardization suggest a minimum of two types of naming conventions to be included in a metabolite annotation list. , Ideally, this would include one structural code (e.g., InChI, SMILES) and one chemical name (e.g., IUPAC, common name). Among identifiers derived from structural information, the InChIKey is generally the preferred form for translation purposes due to its length and suitability for indexing and searching as it is a unique ID. Canonical SMILES is also often used to both define structure and act as a unique ID. In contrast, InChI and SMILES strings can be written in numerous ways to represent the same molecule. However, none provide unique naming for protons, which is essential for NMR studies. d-Glucose alone has 25 synonyms for common names in the Human Metabolomics Database (HMDB, https://hmdb.ca/), clearly illustrating the challenge and need for more accurate structural identifiers. Chemical formula and database ID numbers (e.g., HMDB, Kyoto Encyclopedia of Genes and Genomes [KEGG, https://www.genome.jp/kegg/], etc.) provide additional sources of information that can improve data searching and harmonization. The inclusion of database ID numbers can not only aid in quick database searches but also provide further resolution of potentially ambiguous chemical structures and names. Chemical taxonomy-based tools, such as ClassyFire and ChemFOnt, enable standardized chemical structure and function classification, and this information also helps with automated comparisons with previous studies. ,
An additional challenge in nomenclature is the reporting of stereochemistry without conducting further experiments to confirm the specific conformation of the metabolite as there are currently no clear guidelines for the NMR metabolomics community. NMR can identify and distinguish structural isomers, but depending on the complexity of the molecular structure, additional experiments are often necessary to confirm stereochemistry. Metabolite stereochemistry is often assumed without verification, leading to inaccurate reporting and challenges for data harmonization. In such cases, we recommend avoiding differentiating between stereoisomers and instead recommend the reporting of the nonstereoisomeric form. The NMR metabolomics community would greatly benefit from clearly defining standardized criteria for confidently reporting structural isomers.
Of all parameters examined in the literature, including study design, sample preparation, data acquisition, and data processing/analysis, the choice of statistical software and tools for data analysis showed the most notable variation in NMR metabolomics research. Researchers used a wide range of software and databases to process and analyze NMR metabolomics data sets, with over 110 distinct programs being used by the community, together with in-house R or Python scripts. However, many publications reviewed in the 2024 paper failed to provide essential details such as software names, versions, and availability.
To improve reproducibility, researchers are encouraged to include detailed reports of their data analysis methods. For instance, when using MetaboAnalyst for statistical analysis, the R code generated by the platform can be shared in data repositories (e.g., GitHub) or as Supporting Information, which could enable others to replicate the analysis and support the metabolomics field’s progress toward standardization. At a minimum, reporting of the software and version used to process, analyze, and annotate NMR data is needed for data reuse and study reproducibility. Further, to improve transparency and confidence in published metabolomics data, especially results generated by groups with limited prior involvement in the field, it is strongly recommended that reproducibility assessments be reported for a given project. For guidance, several studies have provided practical approaches to include reproducibility assessments in NMR metabolomics that should be broadly followed by the community. − This would include intra- and/or interlaboratory validation efforts, overall instrument and biological variability measurements, and, critically, acknowledgment and comparison to previous results reported in the literature. Unfortunately, it is all too common for metabolomics manuscripts to exclude a proper comparison of their results to prior studies, even though such an analysis would be critical to assess the true reliability and reproducibility of metabolomics data.
2.5. Data Accessibility and Comparability
Ideally, acquired spectra along with sufficient metadata would be deposited in a publicly accessible repository appropriate for NMR metabolomics data to enable the reproducibility of results. The inclusion of QC and reference material spectra allows for data quality assessment, especially in consideration of data reuse and comparability across laboratories even as technology advances. Most commonly used repositories for metabolomics data sets are currently Metabolights (https://www.ebi.ac.uk/metabolights/), the NIH Common Fund’s National Metabolomics Data Repository, formerly known as Metabolomics Workbench (https://www.metabolomicsworkbench.org/), and the Biological Magnetic Resonance Data Bank (http://bmrb.io). According to the literature review, there were no reports of metabolomics data being deposited in an appropriate repository in 2010. This figure increased to a mere 11% by 2020.
The landscape of data sharing has shifted dramatically since 2010, when few journals enforced data deposition. Presently, journals such as Metabolites and Metabolomics strongly encourage this crucial step, and the lack of deposition requires a statement regarding why the data have not been deposited. Despite an overall shift toward FAIR (findability, accessibility, interoperability, and reusability) data principles (https://www.go-fair.org/fair-principles/), the 2020 11% deposition rate leaves vast room for improvement. The lack of data deposition, except in cases of proprietary information and health insurance portability and accountability act regulations, may be attributed to several factors, including the length of time required to gather and deposit the associated metadata, the challenges in learning how to deposit data (or new data format required) into a new repository system, the loss of experimental details, or limitations of currently available repositories. There is also a significant discrepancy between what is expected and required for the deposition of quantitative NMR metabolomics data versus profiling NMR metabolomics data. Technically, quantitative NMR metabolomics produces lists of unambiguously identified compounds and accurately measured concentrations, which are publicly shared upon publication. Most quantitative NMR metabolomics methods are rigorously validated (externally and internally) and require adherence to strict operational protocols. Therefore, for most analytical and clinical chemistry journals, lists of compounds and their concentrations are all that is normally required for public, FAIR-compliant reporting. It is still an open question as to whether the raw data from a quantitative NMR metabolomics study is of general use given the strong dependency on commercial software providers or commercial entities to generate the quantitative data sets. Clearly, the availability of free, open source, or open access deconvolution software would help address this issue and would make the deposition of raw metabolomics data from quantitative studies more meaningful.
On the other hand, because NMR metabolite profiling data is so variable in nature, the need to access raw data for secondary confirmation and validation is a near-absolute requirement, and these types of data must be deposited in the public repositories. To address these challenges and ease data deposition, the NMR metabolite profiling community would greatly benefit from developing or enhancing data reporting tools that can be implemented with NMR metabolite profiling studies in real-time. Such tools would allow metadata to be systematically collected and organized during the study, thus, simplifying the deposition process. A recent perspective outlined recommendations for the use and reuse of metabolomics data by highlighting both the progress in the field and the existing needs to achieve the laudable goal of routine and ease of data depositions.
Comparisons of study outcomes with the existing scientific literature and the reporting of previously identified metabolites, together with noting possible inconsistencies, should be included as a fundamental component of all published metabolomics manuscripts. These issues tend to be less significant for quantitative NMR metabolomics studies, which tend to be highly consistent and uniform, but can be quite significant for profiling NMR metabolomics studies given the extreme variability in sample processing, data analysis, and interpretation. Even when experimental designs differ, comparisons of results provide valuable insights, especially if the procedural differences are highlighted. Reporting a study as “groundbreaking” while failing to contextualize findings within the existing body of research can impede scientific progress. For instance, metabolomic studies that identify a significant change in the concentration of a certain metabolite without acknowledging the prior findings could lead to redundancy or missed opportunities to explore novel aspects of the metabolite(s) of interest. Doing so will also neglect the opportunity to verify the reproducibility of outcomes across multiple laboratories and studies.
3. Summary
First and foremost, the intent of this paper, particularly Table and Supporting Information, is to serve as a useful template for reporting fundamental study parameters for the successful dissemination of NMR metabolomics data. Second, this paper underscores the importance of comprehensive reporting to ensure study reliability, reproducibility, and study comparability. We aim to encourage thoughtful reflection on the reasons why experimental reporting in NMR metabolomics, especially NMR metabolite profiling metabolomics, is currently inadequate, strategies to make metabolomics data reporting easier, areas where current guidelines fall short, and the need to establish best practices for reporting reliable findings to the scientific community.
We also provide Supporting Information which points to seminal papers in the field to serve as guidance on how to best design and implement quantitative and profiling NMR metabolomics studies. While this paper does not propose best practices for specific biological systems or NMR techniques, we suggest follow-up efforts to establish guidelines in collaboration with the respective research communities. Similarly, establishing best practices for study design, methodology, and data analysis should be a high priority and the logical next step for the NMR metabolomics community. These efforts should include the adoption of standardized formatting in conjunction with the mQACC best practices for profiling NMR metabolomics and a living guidance document to ensure consistency, accessibility, and interoperability as the field of NMR metabolomics grows. A link to the mQACC living guidance document, including a glossary of terms, is forthcoming, and mQACC efforts in establishing a framework for best practices are documented in a study by Mosley et al. (2024). Additionally, the Best Practices for Non-Targeted Analysis (BP4NTA) working group developed a MS-based Non-Targeted Analysis Study Reporting Tool and a Study Planning Tool, which have aided study design, evaluation, and peer review, serving as excellent examples for our intended efforts.
An extensive list of items to report in an NMR-based metabolomics study is provided in Tables and Supporting Information S1, and some of the details necessary for a thorough report on a few critical topics are discussed throughout the text. We acknowledge that the efforts required to include this level of detail in a manuscript and the fact that journal word count limitations commonly lead to the exclusion of experimental details from the main text. Therefore, we recommend incorporating many of these important experimental details into Supporting Information text and tables.
Supplementary Material
Acknowledgments
This piece was conceptualized and written within the scope of MANA working groups, specifically with the NMR Special Interest Group. The graphical abstract was created using Canva. This work was supported with funding from the Nebraska Center for Integrated Biomolecular Communication (P20 GM113126, NIGMS). A.S.E. and J.H. were supported by the NSF-funded Network for Advanced NMR (1946970). V.C. has been supported by NIH grant R01 DK117473-01A1. C.C.F. is supported by the NIH Office of the Director and the National Institute of Neurological Disorders and Stroke DP2NS127704. T.G. is supported by funding from NSERC-Discovery Grant (RGPIN-2025-06578 and DGECR-2025-00277)
The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acs.analchem.5c03274.
Reporting recommendations for NMR metabolomics and compilation of references for study design and implementation of robust NMR metabolomics studies (PDF)
The manuscript draft was prepared by E.R.A., A.L.B., and T.G., and all authors commented on and edited the manuscript and approved submission. E.R.A. and A.L.B. contributed equally to this work and have the right to list their name first on their CV.
The authors declare no competing financial interest.
References
- Alonso A., Marsal S., Julià A.. Analytical Methods in Untargeted Metabolomics: State of the Art in 2015. Front. Bioeng. Biotechnol. 2015;3:23. doi: 10.3389/fbioe.2015.00023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yu B., Zanetti K. A., Temprosa M., Albanes D., Appel N., Barrera C. B., Ben-Shlomo Y., Boerwinkle E., Casas J. P., Clish C., Dale C., Dehghan A., Derkach A., Eliassen A. H., Elliott P., Fahy E., Gieger C., Gunter M. J., Harada S., Harris T., Herr D. R., Herrington D., Hirschhorn J. N., Hoover E., Hsing A. W., Johansson M., Kelly R. S., Khoo C. M., Kivimäki M., Kristal B. S., Langenberg C., Lasky-Su J., Lawlor D. A., Lotta L. A., Mangino M., Le Marchand L., Mathé E., Matthews C. E., Menni C., Mucci L. A., Murphy R., Oresic M., Orwoll E., Ose J., Pereira A. C., Playdon M. C., Poston L., Price J., Qi Q., Rexrode K., Risch A., Sampson J., Seow W. J., Sesso H. D., Shah S. H., Shu X.-O., Smith G. C. S., Sovio U., Stevens V. L., Stolzenberg-Solomon R., Takebayashi T., Tillin T., Travis R., Tzoulaki I., Ulrich C. M., Vasan R. S., Verma M., Wang Y., Wareham N. J., Wong A., Younes N., Zhao H., Zheng W., Moore S. C.. The Consortium of Metabolomics Studies (COMETS): Metabolomics in 47 Prospective Cohort Studies. Am. J. Epidemiol. 2019;188(6):991–1012. doi: 10.1093/aje/kwz028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zanetti K. A., Hall R. D., Griffin J. L., Putri S., Salek R. M., Styczynski M. P., Tugizimana F., van der Hooft J. J. J.. The Metabolomics Society-Current State of the Membership and Future Directions. Metabolites. 2019;9(5):89. doi: 10.3390/metabo9050089. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Beger R. D., Dunn W. B., Bandukwala A., Bethan B., Broadhurst D., Clish C. B., Dasari S., Derr L., Evans A., Fischer S., Flynn T., Hartung T., Herrington D., Higashi R., Hsu P.-C., Jones C., Kachman M., Karuso H., Kruppa G., Lippa K., Maruvada P., Mosley J., Ntai I., O’Donovan C., Playdon M., Raftery D., Shaughnessy D., Souza A., Spaeder T., Spalholz B., Tayyari F., Ubhi B., Verma M., Walk T., Wilson I., Witkin K., Bearden D. W., Zanetti K. A.. Towards Quality Assurance and Quality Control in Untargeted Metabolomics Studies. Metabolomics. 2019;15(1):4. doi: 10.1007/s11306-018-1460-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Haug K., Salek R. M., Steinbeck C.. Global Open Data Management in Metabolomics. Curr. Opin. Chem. Biol. 2017;36:58–63. doi: 10.1016/j.cbpa.2016.12.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fiehn O., Robertson D., Griffin J., Van Der Werf M., Nikolau B., Morrison N., Sumner L. W., Goodacre R., Hardy N. W., Taylor C., Fostel J., Kristal B., Kaddurah-Daouk R., Mendes P., Van Ommen B., Lindon J. C., Sansone S.-A.. The Metabolomics Standards Initiative ({MSI}) Metabolomics. 2007;3(3):175–178. doi: 10.1007/s11306-007-0070-6. [DOI] [Google Scholar]
- Rocca-Serra P., Salek R. M., Arita M., Correa E., Dayalan S., Gonzalez-Beltran A., Ebbels T., Goodacre R., Hastings J., Haug K., Koulman A., Nikolski M., Oresic M., Sansone S.-A., Schober D., Smith J., Steinbeck C., Viant M. R., Neumann S.. Data Standards Can Boost Metabolomics Research, and If There Is a Will, There Is a Way. Metabolomics. 2016;12:14. doi: 10.1007/s11306-015-0879-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rubtsov D. V., Jenkins H., Ludwig C., Easton J., Viant M. R., Günther U., Griffin J. L., Hardy N.. Proposed Reporting Requirements for the Description of NMR-Based Metabolomics Experiments. Metabolomics. 2007;3(3):223–229. doi: 10.1007/s11306-006-0040-4. [DOI] [Google Scholar]
- Sumner L. W., Amberg A., Barrett D., Beale M. H., Beger R., Daykin C. A., Fan T. W.-M., Fiehn O., Goodacre R., Griffin J. L., Hankemeier T., Hardy N., Harnly J., Higashi R., Kopka J., Lane A. N., Lindon J. C., Marriott P., Nicholls A. W., Reily M. D., Thaden J. J., Viant M. R.. Proposed Minimum Reporting Standards for Chemical Analysis Chemical Analysis Working Group (CAWG) Metabolomics Standards Initiative (MSI) Metabolomics. 2007;3(3):211–221. doi: 10.1007/s11306-007-0082-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Goodacre R., Broadhurst D., Smilde A. K., Kristal B. S., Baker J. D., Beger R., Bessant C., Connor S., Capuani G., Craig A., Ebbels T., Kell D. B., Manetti C., Newton J., Paternostro G., Somorjai R., Sjöström M., Trygg J., Wulfert F.. Proposed Minimum Reporting Standards for Data Analysis in Metabolomics. Metabolomics. 2007;3(3):231–241. doi: 10.1007/s11306-007-0081-3. [DOI] [Google Scholar]
- Taylor C. F., Binz P.-A., Aebersold R., Affolter M., Barkovich R., Deutsch E. W., Horn D. M., Hühmer A., Kussmann M., Lilley K., Macht M., Mann M., Müller D., Neubert T. A., Nickson J., Patterson S. D., Raso R., Resing K., Seymour S. L., Tsugita A., Xenarios I., Zeng R., Julian R. K.. Guidelines for Reporting the Use of Mass Spectrometry in Proteomics. Nat. Biotechnol. 2008;26(8):860–861. doi: 10.1038/nbt0808-860. [DOI] [PubMed] [Google Scholar]
- Goodacre R.. Water, Water, Every Where, but Rarely Any Drop to Drink. Metabolomics. 2014;10(1):5–7. doi: 10.1007/s11306-013-0618-6. [DOI] [Google Scholar]
- Salek R. M., Neumann S., Schober D., Hummel J., Billiau K., Kopka J., Correa E., Reijmers T., Rosato A., Tenori L., Turano P., Marin S., Deborde C., Jacob D., Rolin D., Dartigues B., Conesa P., Haug K., Rocca-Serra P., O’Hagan S., Hao J., van Vliet M., Sysi-Aho M., Ludwig C., Bouwman J., Cascante M., Ebbels T., Griffin J. L., Moing A., Nikolski M., Oresic M., Sansone S.-A., Viant M. R., Goodacre R., Günther U. L., Hankemeier T., Luchinat C., Walther D., Steinbeck C.. COordination of Standards in MetabOlomicS (COSMOS): Facilitating Integrated Metabolomics Data Access. Metabolomics. 2015;11(6):1587–1597. doi: 10.1007/s11306-015-0810-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kirwan J. A., Gika H., Beger R. D., Bearden D., Dunn W. B., Goodacre R., Theodoridis G., Witting M., Yu L.-R., Wilson I. D.. metabolomics Quality Assurance and Quality Control Consortium (mQACC) Quality Assurance and Quality Control Reporting in Untargeted Metabolic Phenotyping: mQACC Recommendations for Analytical Quality Management. Metabolomics. 2022;18(9):70. doi: 10.1007/s11306-022-01926-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gouveia G. J., Head T., Cheng L. L., Clendinen C. S., Cort J. R., Du X., Edison A. S., Fleischer C. C., Hoch J., Mercaldo N., Pathmasiri W., Raftery D., Schock T. B., Sumner L. W., Takis P. G., Copié V., Eghbalnia H. R., Powers R.. Perspective: Use and Reuse of NMR-Based Metabolomics Data: What Works and What Remains Challenging. Metabolomics. 2024;20(2):41. doi: 10.1007/s11306-024-02090-6. [DOI] [PubMed] [Google Scholar]
- Considine E. C., Thomas G., Boulesteix A. L., Khashan A. S., Kenny L. C.. Critical Review of Reporting of the Data Analysis Step in Metabolomics. Metabolomics. 2018;14(1):7. doi: 10.1007/s11306-017-1299-3. [DOI] [PubMed] [Google Scholar]
- Spicer R. A., Salek R., Steinbeck C.. A Decade after the Metabolomics Standards Initiative It’s Time for a Revision. Sci. Data. 2017;4:170138. doi: 10.1038/sdata.2017.138. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Powers R., Andersson E. R., Bayless A. L., Brua R. B., Chang M. C., Cheng L. L., Clendinen C. S., Cochran D., Copié V., Cort J. R., Crook A. A., Eghbalnia H. R., Giacalone A., Gouveia G. J., Hoch J. C., Jeppesen M. J., Maroli A. S., Merritt M. E., Pathmasiri W., Roth H. E., Rushin A., Sakallioglu I. T., Sarma S., Schock T. B., Sumner L. W., Takis P., Uchimiya M., Wishart D. S.. Best Practices in NMR Metabolomics: Current State. TrAC, Trends Anal. Chem. 2024;171:117478. doi: 10.1016/j.trac.2023.117478. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Roth H. E., Powers R.. Meta-Analysis Reveals Both the Promises and the Challenges of Clinical Metabolomics. Cancers. 2022;14(16):3992. doi: 10.3390/cancers14163992. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Goveia J., Pircher A., Conradi L.-C., Kalucka J., Lagani V., Dewerchin M., Eelen G., DeBerardinis R. J., Wilson I. D., Carmeliet P.. Meta-Analysis of Clinical Metabolic Profiling Studies in Cancer: Challenges and Opportunities. EMBO Mol. Med. 2016;8(10):1134–1142. doi: 10.15252/emmm.201606798. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Smits A. J., Botros L., Mol M. A. E., Ziesemer K. A., Wilkins M. R., Vonk Noordegraaf A., Bogaard H. J., Aman J.. A Systematic Review with Meta-Analysis of Biomarkers for Detection of Pulmonary Arterial Hypertension. ERJ. Open Res. 2022;8(2):00009–02022. doi: 10.1183/23120541.00009-2022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cochran D., Noureldein M., Bezdeková D., Schram A., Howard R., Powers R.. A Reproducibility Crisis for Clinical Metabolomics Studies. TrAC, Trends Anal. Chem. 2024;180:117918. doi: 10.1016/j.trac.2024.117918. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chenomx NMR Suite. https://www.chenomx.com (accessed Mar 27, 2025).
- FoodScreener®: Platform for NMR-Based Food Analysis. https://www.bruker.com (accessed Mar 27, 2025).
- In Vitro Diagnostics Research (IVDr) Platform for NMR-Based Metabolomics. https://www.bruker.com/(accessed Mar 27, 2025).
- Nightingale Health. https://nightingalehealth.com (accessed Mar 27, 2025).
- Ghini V., Meoni G., Vignoli A., Di Cesare F., Tenori L., Turano P., Luchinat C.. Fingerprinting and Profiling in Metabolomics of Biosamples. Prog. Nucl. Magn. Reson. Spectrosc. 2023;138–139:105–135. doi: 10.1016/j.pnmrs.2023.10.002. [DOI] [PubMed] [Google Scholar]
- Dunn W. B., Broadhurst D. I., Atherton H. J., Goodacre R., Griffin J. L.. Systems Level Studies of Mammalian Metabolomes: The Roles of Mass Spectrometry and Nuclear Magnetic Resonance Spectroscopy. Chem. Soc. Rev. 2011;40(1):387–426. doi: 10.1039/B906712B. [DOI] [PubMed] [Google Scholar]
- Semmar, N. Metabotype Concept: Flexibility, Usefulness and Meaning in Different Biological Populations. In Metabolomics; Roessner, U. , Ed.; InTech, 2012. [Google Scholar]
- Kell D. B., Oliver S. G.. Here Is the Evidence, Now What Is the Hypothesis? The Complementary Roles of Inductive and Hypothesis-Driven Science in the Post-Genomic Era. Bioessays. 2004;26(1):99–105. doi: 10.1002/bies.10385. [DOI] [PubMed] [Google Scholar]
- Sévin D. C., Kuehne A., Zamboni N., Sauer U.. Biological Insights through Nontargeted Metabolomics. Curr. Opin. Biotechnol. 2015;34:1–8. doi: 10.1016/j.copbio.2014.10.001. [DOI] [PubMed] [Google Scholar]
- Rosato A., Tenori L., Cascante M., De Atauri Carulla P. R., Dos Santos V. A. P. M., Saccenti E.. From Correlation to Causation: Analysis of Metabolomics Data Using Systems Biology Approaches. Metabolomics. 2018;14(4):37. doi: 10.1007/s11306-018-1335-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Broadhurst D. I., Kell D. B.. Statistical Strategies for Avoiding False Discoveries in Metabolomics and Related Experiments. Metabolomics. 2007;2(4):171–196. doi: 10.1007/s11306-006-0037-z. [DOI] [Google Scholar]
- Saccenti E., Hoefsloot H. C. J., Smilde A. K., Westerhuis J. A., Hendriks M. M. W. B.. Reflections on Univariate and Multivariate Analysis of Metabolomics Data. Metabolomics. 2014;10(3):361–374. doi: 10.1007/s11306-013-0598-6. [DOI] [Google Scholar]
- Westerhuis, J. A. ; Van Velzen, E. J. J. ; Jansen, J. J. ; Hoefsloot, H. C. J. ; Smilde, A. K. . Analysis of High-Dimensional Data from Designed Metabolomics Studies. In Issues in Toxicology; Grootveld, M. , Ed.; Royal Society of Chemistry: Cambridge, 2014; Chapter 4, pp 117–136. [Google Scholar]
- Eghbalnia H. R., Romero P. R., Westler W. M., Baskaran K., Ulrich E. L., Markley J. L.. Increasing Rigor in NMR-Based Metabolomics through Validated and Open Source Tools. Curr. Opin. Biotechnol. 2017;43:56–61. doi: 10.1016/j.copbio.2016.08.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bhinderwala, F. ; Powers, R. . NMR Metabolomics Protocols for Drug Discovery. NMR-Based Metabolomics: Methods and Protocol; Springer, 2019; Vol. 2037, pp 265–311. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Álvarez-Sánchez B., Priego-Capote F., De Castro M. D. L.. Metabolomics Analysis I. Selection of Biological Samples and Practical Aspects Preceding Sample Preparation. TrAC, Trends Anal. Chem. 2010;29(2):111–119. doi: 10.1016/j.trac.2009.12.003. [DOI] [Google Scholar]
- Yin P., Lehmann R., Xu G.. Effects of Pre-Analytical Processes on Blood Samples Used in Metabolomics Studies. Anal. Bioanal. Chem. 2015;407(17):4879–4892. doi: 10.1007/s00216-015-8565-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vignoli A., Tenori L., Morsiani C., Turano P., Capri M., Luchinat C.. Serum or Plasma (and Which Plasma), That Is the Question. J. Proteome Res. 2022;21(4):1061–1072. doi: 10.1021/acs.jproteome.1c00935. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sotelo-Orozco J., Chen S.-Y., Hertz-Picciotto I., Slupsky C. M.. A Comparison of Serum and Plasma Blood Collection Tubes for the Integration of Epidemiological and Metabolomics Data. Front. Mol. Biosci. 2021;8:682134. doi: 10.3389/fmolb.2021.682134. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cochran D., Takis P. G., Alexander J. L., Mullish B. H., Powell N., Marchesi J. R., Powers R.. Evaluating Protocols for Reproducible Targeted Metabolomics by NMR. Analyst. 2024;149(22):5423–5432. doi: 10.1039/D4AN01015A. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sumner, L. W. ; Yang, D. S. ; Bench, B. J. ; Watson, B. S. ; Li, C. ; Jones, A. D. . Spatially Resolved Plant Metabolomics. In Annual Plant Reviews Vol. 43; Hall, R. D. , Ed.; Wiley, 2011; pp 343–366. [Google Scholar]
- Kirwan J. A., Brennan L., Broadhurst D., Fiehn O., Cascante M., Dunn W. B., Schmidt M. A., Velagapudi V.. Preanalytical Processing and Biobanking Procedures of Biological Samples for Metabolomics Research: A White Paper, Community Perspective (for “Precision Medicine and Pharmacometabolomics Task Group”-The Metabolomics Society Initiative) Clin. Chem. 2018;64(8):1158–1182. doi: 10.1373/clinchem.2018.287045. [DOI] [PubMed] [Google Scholar]
- Ghini V., Quaglio D., Luchinat C., Turano P.. NMR for Sample Quality Assessment in Metabolomics. New Biotechnol. 2019;52:25–34. doi: 10.1016/j.nbt.2019.04.004. [DOI] [PubMed] [Google Scholar]
- Ghini V., Abuja P. M., Polasek O., Kozera L., Laiho P., Anton G., Zins M., Klovins J., Metspalu A., Wichmann H.-E., Gieger C., Luchinat C., Zatloukal K., Turano P.. Metabolomic Fingerprints in Large Population Cohorts: Impact of Preanalytical Heterogeneity. Clin. Chem. 2021;67(8):1153–1155. doi: 10.1093/clinchem/hvab092. [DOI] [PubMed] [Google Scholar]
- Ghini V., Abuja P. M., Polasek O., Kozera L., Laiho P., Anton G., Zins M., Klovins J., Metspalu A., Wichmann H.-E., Gieger C., Luchinat C., Zatloukal K., Turano P.. Impact of the Pre-Examination Phase on Multicenter Metabolomic Studies. New Biotechnol. 2022;68:37–47. doi: 10.1016/j.nbt.2022.01.006. [DOI] [PubMed] [Google Scholar]
- Lin C. Y., Wu H., Tjeerdema R. S., Viant M. R.. Evaluation of Metabolite Extraction Strategies from Tissue Samples Using NMR Metabolomics. Metabolomics. 2007;3(1):55–67. doi: 10.1007/s11306-006-0043-1. [DOI] [Google Scholar]
- Wu H., Southam A. D., Hines A., Viant M. R.. High-Throughput Tissue Extraction Protocol for NMR- and MS-Based Metabolomics. Anal. Biochem. 2008;372(2):204–212. doi: 10.1016/j.ab.2007.10.002. [DOI] [PubMed] [Google Scholar]
- Teytelman L., Stoliartchouk A., Kindler L., Hurwitz B. L.. Protocols.Io: Virtual Communities for Protocol Development and Discussion. PLoS Biol. 2016;14(8):e1002538. doi: 10.1371/journal.pbio.1002538. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mosley J. D., Schock T. B., Beecher C. W., Dunn W. B., Kuligowski J., Lewis M. R., Theodoridis G., Ulmer Holland C. Z., Vuckovic D., Wilson I. D., Zanetti K. A.. Establishing a Framework for Best Practices for Quality Assurance and Quality Control in Untargeted Metabolomics. Metabolomics. 2024;20(2):20. doi: 10.1007/s11306-023-02080-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Keeler, J. J. Understanding NMR Spectroscopy, 2nd ed.; Wiley: Chichester, 2011. [Google Scholar]
- Emwas A.-H., Roy R., McKay R. T., Tenori L., Saccenti E., Gowda G. A. N., Raftery D., Alahmari F., Jaremko L., Jaremko M., Wishart D. S.. NMR Spectroscopy for Metabolomics Research. Metabolites. 2019;9(7):123. doi: 10.3390/metabo9070123. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Xiao C., Hao F., Qin X., Wang Y., Tang H.. An Optimized Buffer System for NMR-Based Urinary Metabonomics with Effective pH Control, Chemical Shift Consistency and Dilution Minimization. Analyst. 2009;134(5):916–925. doi: 10.1039/b818802e. [DOI] [PubMed] [Google Scholar]
- Cloarec O., Dumas M. E., Trygg J., Craig A., Barton R. H., Lindon J. C., Nicholson J. K., Holmes E.. Evaluation of the Orthogonal Projection on Latent Structure Model Limitations Caused by Chemical Shift Variability and Improved Visualization of Biomarker Changes in 1H NMR Spectroscopic Metabonomic Studies. Anal. Chem. 2005;77(2):517–526. doi: 10.1021/ac048803i. [DOI] [PubMed] [Google Scholar]
- Parsons H. M., Ekman D. R., Collette T. W., Viant M. R.. Spectral Relative Standard Deviation: A Practical Benchmark in Metabolomics. Analyst. 2009;134(3):478–485. doi: 10.1039/B808986H. [DOI] [PubMed] [Google Scholar]
- Giraudeau P., Silvestre V., Akoka S.. Optimizing Water Suppression for Quantitative NMR-Based Metabolomics: A Tutorial Review. Metabolomics. 2015;11(5):1041–1055. doi: 10.1007/s11306-015-0794-7. [DOI] [Google Scholar]
- Hao J., Liebeke M., Astle W., De Iorio M., Bundy J. G., Ebbels T. M. D.. Bayesian Deconvolution and Quantification of Metabolites in Complex 1D NMR Spectra Using BATMAN. Nat. Protoc. 2014;9(6):1416–1427. doi: 10.1038/nprot.2014.090. [DOI] [PubMed] [Google Scholar]
- Ravanbakhsh S., Liu P., Bjordahl T. C., Mandal R., Grant J. R., Wilson M., Eisner R., Sinelnikov I., Hu X., Luchinat C., Greiner R., Wishart D. S.. Accurate, Fully-Automated NMR Spectral Profiling for Metabolomics. PloS One. 2015;10(5):e0124219. doi: 10.1371/journal.pone.0124219. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wu Y., Sanati O., Uchimiya M., Krishnamurthy K., Wedell J., Hoch J. C., Edison A. S., Delaglio F.. SAND: Automated Time-Domain Modeling of NMR Spectra Applied to Metabolite Quantification. Anal. Chem. 2024;96(5):1843–1851. doi: 10.1021/acs.analchem.3c03078. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang B., Maldonado-Devincci A. M., Jiang L.. Evaluating Line-Broadening Factors on a Reference Spectrum as a Bucketing Method for NMR Based Metabolomics. Anal. Biochem. 2020;606:113872. doi: 10.1016/j.ab.2020.113872. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ebel A., Dreher W., Leibfritz D.. Effects of Zero-Filling and Apodization on Spectral Integrals in Discrete Fourier-Transform Spectroscopy of Noisy Data. J. Magn. Reson. 2006;182(2):330–338. doi: 10.1016/j.jmr.2006.06.026. [DOI] [PubMed] [Google Scholar]
- Wang K.-C., Wang S.-Y., Kuo C., Tseng Y. J.. Distribution-Based Classification Method for Baseline Correction of Metabolomic 1D Proton Nuclear Magnetic Resonance Spectra. Anal. Chem. 2013;85(2):1231–1239. doi: 10.1021/ac303233c. [DOI] [PubMed] [Google Scholar]
- Halouska S., Powers R.. Negative Impact of Noise on the Principal Component Analysis of NMR Data. J. Magn. Reson. 2006;178(1):88–95. doi: 10.1016/j.jmr.2005.08.016. [DOI] [PubMed] [Google Scholar]
- Craig A., Cloarec O., Holmes E., Nicholson J. K., Lindon J. C.. Scaling and Normalization Effects in NMR Spectroscopic Metabonomic Data Sets. Anal. Chem. 2006;78(7):2262–2267. doi: 10.1021/ac0519312. [DOI] [PubMed] [Google Scholar]
- Dieterle F., Ross A., Schlotterbeck G., Senn H.. Probabilistic Quotient Normalization as Robust Method to Account for Dilution of Complex Biological Mixtures. Application in 1H NMR Metabonomics. Anal. Chem. 2006;78(13):4281–4290. doi: 10.1021/ac051632c. [DOI] [PubMed] [Google Scholar]
- van den Berg R. A., Hoefsloot H. C. J., Westerhuis J. A., Smilde A. K., van der Werf M. J., Centering M. J.. Centering, scaling, and transformations: improving the biological information content of metabolomics data. BMC Genom. 2006;7:142. doi: 10.1186/1471-2164-7-142. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Walach J., Filzmoser P., Hron K.. Data Normalization and Scaling: Consequences for the Analysis in Omics Sciences. Compr. Anal. Chem. 2018;82:165–196. doi: 10.1016/bs.coac.2018.06.004. [DOI] [Google Scholar]
- Worley B., Halouska S., Powers R.. Utilities for Quantifying Separation in PCA/PLS-DA Scores Plots. Anal. Biochem. 2013;433(2):102–104. doi: 10.1016/j.ab.2012.10.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Peluso A., Glen R., Ebbels T. M. D.. Multiple-Testing Correction in Metabolome-Wide Association Studies. BMC Bioinf. 2021;22(1):67. doi: 10.1186/s12859-021-03975-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bouatra S., Aziat F., Mandal R., Guo A. C., Wilson M. R., Knox C., Bjorndahl T. C., Krishnamurthy R., Saleem F., Liu P., Dame Z. T., Poelzer J., Huynh J., Yallou F. S., Psychogios N., Dong E., Bogumil R., Roehring C., Wishart D. S.. The Human Urine Metabolome. PloS One. 2013;8(9):e73076. doi: 10.1371/journal.pone.0073076. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pham N., van Heck R. G. A., van Dam J. C. J., Schaap P. J., Saccenti E., Suarez-Diez M.. Consistency, Inconsistency, and Ambiguity of Metabolite Names in Biochemical Databases Used for Genome-Scale Metabolic Modelling. Metabolites. 2019;9(2):28. doi: 10.3390/metabo9020028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wohlgemuth G., Haldiya P. K., Willighagen E., Kind T., Fiehn O.. The Chemical Translation Servicea Web-Based Tool to Improve Standardization of Metabolomic Reports. Bioinformatics. 2010;26(20):2647–2648. doi: 10.1093/bioinformatics/btq476. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dashti H., Westler W. M., Markley J. L., Eghbalnia H. R.. Unique Identifiers for Small Molecules Enable Rigorous Labeling of Their Atoms. Sci. Data. 2017;4(1):170073. doi: 10.1038/sdata.2017.73. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Djoumbou Feunang Y., Eisner R., Knox C., Chepelev L., Hastings J., Owen G., Fahy E., Steinbeck C., Subramanian S., Bolton E., Greiner R., Wishart D. S.. ClassyFire: Automated Chemical Classification with a Comprehensive, Computable Taxonomy. J. Cheminf. 2016;8:61. doi: 10.1186/s13321-016-0174-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wishart D. S., Girod S., Peters H., Oler E., Jovel J., Budinski Z., Milford R., Lui V. W., Sayeeda Z., Mah R., Wei W., Badran H., Lo E., Yamamoto M., Djoumbou-Feunang Y., Karu N., Gautam V.. ChemFOnt: The Chemical Functional Ontology Resource. Nucleic Acids Res. 2023;51(D1):D1220–D1229. doi: 10.1093/nar/gkac919. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pang Z., Chong J., Zhou G., de Lima Morais D. A., Chang L., Barrette M., Gauthier C., Jacques P.-E. ´., Li S., Xia J.. MetaboAnalyst 5.0: Narrowing the Gap between Raw Spectra and Functional Insights. Nucleic Acids Res. 2021;49(W1):W388–W396. doi: 10.1093/nar/gkab382. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stavarache C., Nicolescu A., Duduianu C., Ailiesei G. L., Balan-Porcăraşu M., Cristea M., Macsim A.-M., Popa O., Stavarache C., Hîrtopeanu A., Barbeş L., Stan R., Iovu H., Deleanu C.. A Real-Life Reproducibility Assessment for NMR Metabolomics. Diagnostics. 2022;12(3):559. doi: 10.3390/diagnostics12030559. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zailer E., Holzgrabe U., Diehl B. W. K.. Interlaboratory Comparison Test as an Evaluation of Applicability of an Alternative Edible Oil Analysis by 1H NMR Spectroscopy. J. AOAC Int. 2017;100(6):1819–1830. doi: 10.5740/jaoacint.17-0063. [DOI] [PubMed] [Google Scholar]
- Musio B., Ragone R., Todisco S., Rizzuti A., Latronico M., Mastrorilli P., Pontrelli S., Intini N., Scapicchio P., Triggiani M., Di Noia T., Acquotti D., Airoldi C., Assfalg M., Barge A., Bateman L., Benevelli F., Bertelli D., Bertocchi F., Bieliauskas A., Borioni A., Caligiani A., Callone E., Čamra A., Cesare Marincola F., Chalasani D., Consonni R., Dambruoso P., Davalli S., David T., Diehl B., Donarski J., Gil A. M., Gobetto R., Goldoni L., Hamon E., Harwood J. S., Kobrlová A., Longobardi F., Luisi R., Mallamace D., Mammi S., Martin-Biran M., Mazzei P., Mele A., Milone S., Vilchez D. M., Mulder R. J., Napoli C., Ragno D., Randazzo A., Rossi M. C., Rotondo A., Šačkus A., Sáez Barajas E., Schievano E., Sitaram B., Stevanato L., Takis P. G., Teipel J., Thomas F., Torregiani E., Valensin D., Veronesi M., Warren J., Wist J., Zailer-Hafer E., Zuccaccia C., Gallo V.. A Community-Built Calibration System: The Case Study of Quantification of Metabolites in Grape Juice by qNMR Spectroscopy. Talanta. 2020;214:120855. doi: 10.1016/j.talanta.2020.120855. [DOI] [PubMed] [Google Scholar]
- Ragone R., Todisco S., Triggiani M., Pontrelli S., Latronico M., Mastrorilli P., Intini N., Ferroni C., Musio B., Gallo V.. Development of a Food Class-Discrimination System by Non-Targeted NMR Analyses Using Different Magnetic Field Strengths. Food Chem. 2020;332:127339. doi: 10.1016/j.foodchem.2020.127339. [DOI] [PubMed] [Google Scholar]
- Da Silva L., Godejohann M., Martin F.-P. J., Collino S., Bürkle A., Moreno-Villanueva M., Bernhardt J., Toussaint O., Grubeck-Loebenstein B., Gonos E. S., Sikora E., Grune T., Breusing N., Franceschi C., Hervonen A., Spraul M., Moco S.. High-Resolution Quantitative Metabolome Analysis of Urine by Automated Flow Injection NMR. Anal. Chem. 2013;85(12):5801–5809. doi: 10.1021/ac4004776. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jiménez B., Holmes E., Heude C., Tolson R. F., Harvey N., Lodge S. L., Chetwynd A. J., Cannet C., Fang F., Pearce J. T. M., Lewis M. R., Viant M. R., Lindon J. C., Spraul M., Schäfer H., Nicholson J. K.. Quantitative Lipoprotein Subclass and Low Molecular Weight Metabolite Analysis in Human Serum and Plasma by 1H NMR Spectroscopy in a Multilaboratory Trial. Anal. Chem. 2018;90(20):11962–11971. doi: 10.1021/acs.analchem.8b02412. [DOI] [PubMed] [Google Scholar]
- Centelles S. M., Hoefsloot H. C. J., Khakimov B., Ebrahimi P., Lind M. V., Kristensen M., de Roo N., Jacobs D. M., van Duynhoven J., Cannet C., Fang F., Humpfer E., Schäfer H., Spraul M., Engelsen S. B., Smilde A. K.. Toward Reliable Lipoprotein Particle Predictions from NMR Spectra of Human Blood: An Interlaboratory Ring Test. Anal. Chem. 2017;89(15):8004–8012. doi: 10.1021/acs.analchem.7b01329. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tynkkynen T., Wang Q., Ekholm J., Anufrieva O., Ohukainen P., Vepsäläinen J., Männikkö M., Keinänen-Kiukaanniemi S., Holmes M. V., Goodwin M., Ring S., Chambers J. C., Kooner J., Järvelin M.-R., Kettunen J., Hill M., Smith G. D., Ala-Korpela M.. Proof of Concept for Quantitative Urine NMR Metabolomics Pipeline for Large-Scale Epidemiology and Genetics. Int. J. Epidemiol. 2019;48(3):978–993. doi: 10.1093/ije/dyy287. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hoch J. C., Baskaran K., Burr H., Chin J., Eghbalnia H. R., Fujiwara T., Gryk M. R., Iwata T., Kojima C., Kurisu G., Maziuk D., Miyanoiri Y., Wedell J. R., Wilburn C., Yao H., Yokochi M.. Biological Magnetic Resonance Data Bank. Nucleic Acids Res. 2023;51(D1):D368–D376. doi: 10.1093/nar/gkac1050. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Peter K. T., Phillips A. L., Knolhoff A. M., Gardinali P. R., Manzano C. A., Miller K. E., Pristner M., Sabourin L., Sumarah M. W., Warth B., Sobus J. R.. Nontargeted Analysis Study Reporting Tool: A Framework to Improve Research Transparency and Reproducibility. Anal. Chem. 2021;93(41):13870–13879. doi: 10.1021/acs.analchem.1c02621. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Ideally, acquired spectra along with sufficient metadata would be deposited in a publicly accessible repository appropriate for NMR metabolomics data to enable the reproducibility of results. The inclusion of QC and reference material spectra allows for data quality assessment, especially in consideration of data reuse and comparability across laboratories even as technology advances. Most commonly used repositories for metabolomics data sets are currently Metabolights (https://www.ebi.ac.uk/metabolights/), the NIH Common Fund’s National Metabolomics Data Repository, formerly known as Metabolomics Workbench (https://www.metabolomicsworkbench.org/), and the Biological Magnetic Resonance Data Bank (http://bmrb.io). According to the literature review, there were no reports of metabolomics data being deposited in an appropriate repository in 2010. This figure increased to a mere 11% by 2020.
The landscape of data sharing has shifted dramatically since 2010, when few journals enforced data deposition. Presently, journals such as Metabolites and Metabolomics strongly encourage this crucial step, and the lack of deposition requires a statement regarding why the data have not been deposited. Despite an overall shift toward FAIR (findability, accessibility, interoperability, and reusability) data principles (https://www.go-fair.org/fair-principles/), the 2020 11% deposition rate leaves vast room for improvement. The lack of data deposition, except in cases of proprietary information and health insurance portability and accountability act regulations, may be attributed to several factors, including the length of time required to gather and deposit the associated metadata, the challenges in learning how to deposit data (or new data format required) into a new repository system, the loss of experimental details, or limitations of currently available repositories. There is also a significant discrepancy between what is expected and required for the deposition of quantitative NMR metabolomics data versus profiling NMR metabolomics data. Technically, quantitative NMR metabolomics produces lists of unambiguously identified compounds and accurately measured concentrations, which are publicly shared upon publication. Most quantitative NMR metabolomics methods are rigorously validated (externally and internally) and require adherence to strict operational protocols. Therefore, for most analytical and clinical chemistry journals, lists of compounds and their concentrations are all that is normally required for public, FAIR-compliant reporting. It is still an open question as to whether the raw data from a quantitative NMR metabolomics study is of general use given the strong dependency on commercial software providers or commercial entities to generate the quantitative data sets. Clearly, the availability of free, open source, or open access deconvolution software would help address this issue and would make the deposition of raw metabolomics data from quantitative studies more meaningful.
On the other hand, because NMR metabolite profiling data is so variable in nature, the need to access raw data for secondary confirmation and validation is a near-absolute requirement, and these types of data must be deposited in the public repositories. To address these challenges and ease data deposition, the NMR metabolite profiling community would greatly benefit from developing or enhancing data reporting tools that can be implemented with NMR metabolite profiling studies in real-time. Such tools would allow metadata to be systematically collected and organized during the study, thus, simplifying the deposition process. A recent perspective outlined recommendations for the use and reuse of metabolomics data by highlighting both the progress in the field and the existing needs to achieve the laudable goal of routine and ease of data depositions.
Comparisons of study outcomes with the existing scientific literature and the reporting of previously identified metabolites, together with noting possible inconsistencies, should be included as a fundamental component of all published metabolomics manuscripts. These issues tend to be less significant for quantitative NMR metabolomics studies, which tend to be highly consistent and uniform, but can be quite significant for profiling NMR metabolomics studies given the extreme variability in sample processing, data analysis, and interpretation. Even when experimental designs differ, comparisons of results provide valuable insights, especially if the procedural differences are highlighted. Reporting a study as “groundbreaking” while failing to contextualize findings within the existing body of research can impede scientific progress. For instance, metabolomic studies that identify a significant change in the concentration of a certain metabolite without acknowledging the prior findings could lead to redundancy or missed opportunities to explore novel aspects of the metabolite(s) of interest. Doing so will also neglect the opportunity to verify the reproducibility of outcomes across multiple laboratories and studies.


