Abstract
Metabolomics commonly relies on using one-dimensional (1D) 1H nuclear magnetic resonance (NMR) spectroscopy or liquid chromatography-mass spectrometry (LC-MS) to derive scientific insights from large collections of biological samples. NMR and MS approaches to metabolomics require, among other issues, a data processing pipeline. Quantitative assessment of the performance of these software platforms is challenged by a lack of standardized datasets with “known” outcomes. To resolve this issue, we created a novel simulated LC-MS dataset with known peak locations and intensities, defined metabolite differences between groups (i.e., fold change > 2, coefficient of variation ≤ 25%), and different amounts of added Gaussian noise (0%, 5%, or 10%) and missing features (0%, 10%, or 20%). This dataset was developed to improve benchmarking of existing LC-MS metabolomics software and to validate the updated version of our MVAPACK software, which added gas chromatography (GC)-MS and LC-MS functionality to its existing 1D and two-dimensional (2D) NMR data processing capabilities. We also included two experimental LC-MS datasets acquired from a standard mixture and Mycobacterium smegmatis cell lysates since a simulated dataset alone may not capture all the unique characteristics and variability of real spectra needed to assess software performance properly. Our simulated and experimental LC-MS datasets were processed with the MS-DIAL and XCMSOnline software packages and our MVAPACK toolkit to showcase the utility of our datasets to benchmark MVAPACK against community standards. Our results demonstrate the enhanced objectivity and clarity of software assessment that can be achieved when both simulated and experimental data are employed since distinctly different software performances were observed with the simulated and experimental LC-MS datasets. We also demonstrate that the performance of MVAPACK is equivalent to or exceeds existing LC-MS software programs while providing a single platform for processing and analyzing both NMR and MS datasets.
Keywords: simulated LC-MS dataset, MVAPACK, Metabolomics, Software, NMR, liquid chromatography-mass spectrometry
Introduction
Metabolomics is the quantitative analysis of small molecule metabolites present in biological samples.1 These metabolites include co-factors, substrates, and end products of enzymatic reactions, which play pivotal roles in cellular processes and are found in diverse tissues, cell lysates, and biofluids such as serum, saliva, and urine.2–4 Collectively, these metabolites constitute an organism’s metabolome, which can comprise thousands of distinct compounds.5 Given the sheer diversity and abundance of metabolites in various biological samples, the metabolome serves as a robust proxy for understanding a wide variety of biological processes and disease states.6 For instance, the abundance of specific metabolite biomarkers has been correlated with Parkinson’s disease, oral cancers, diabetes, heart disease, and an assortment of other human ailments.7–10 Thus, quantifying metabolic biomarkers is critical for deciphering disease mechanisms, formulating diagnostic and prognostic tests, and propelling drug discovery and development.11–15 Achieving these and other laudable goals is critically dependent on the complete and thorough characterization of the metabolome for a biological system. However, realizing these objectives hinges on comprehensively characterizing an organism’s metabolome. A significant challenge in this endeavor is identifying and quantifying every detectable metabolite in a sample. This task demands a synergy of advanced analytical techniques and specialized software for data processing and statistical analysis.
Liquid chromatography-mass spectrometry (LC-MS) is the most widely used analytical platform for metabolomics studies because of its high sensitivity and rapid throughput.16–18 Despite the popularity of LC-MS, spectral processing, and metabolite annotation follow a complex procedure of spectral alignment, batch correction, peak picking, and metabolite feature grouping, selection, and analysis. Spectral feature selection requires further investigator decision points such as identifying an acceptable minimal fold change (FC ≥ 2–3),19–21 coefficient of variance (CV < 30%),19, 22 p-values from pairwise comparisons (p < 0.05 – 0.01),19, 23 importance of each variable (VIP ≥ 1),24–27 and a missing value threshold (≥ 80%).28, 29 Further processing of the data matrix requires missing value imputation, normalization, and scaling. The scientific literature contains multiple algorithms for each processing step, creating numerous decision forks and a complex array of possible data processing pipelines dependent on the data type. Consequently, most metabolomics software has only been developed for a single data type, like MS-DIAL30 and XCMSOnline31 for LC-MS, which hinders the multiplatform approach needed to improve the coverage of the metabolome.32
The diversity and proliferation of metabolomics-related software is a result of the complexity of the data and the data processing steps. Still, it may also result from insufficient guidance from community-certified protocols. For example, despite on-going efforts,33–35 benchmark data and community accepted performance standards are lacking, which impedes the further development, assessment, validation, and adoption of a specific data processing pipeline and its underlying software. Benchmarking performance is critical for developers to fine-tune an algorithm’s behavior, optimize its outcomes, and for end-users to decide if a given software package fits their analytical needs and expectations and performs reliably. Despite the straightforward utility and need, the metabolomics community currently lacks standardized benchmarks that are universally employed. This is especially problematic for LC-MS software development given the large assortment of instrument vendors, experimental protocols, and chromatography column parameters, in addition to known issues with batch variability, high baseline noise, and missing peaks. These data imperfections originate from biological and technical sources and are independent of resolution and sensitivity.
Existing metabolomics datasets are often utilized with previously established results to benchmark novel software performance. Reanalyzing these datasets with new software can directly compare them with earlier findings. However, this approach is inherently flawed since it assumes the original analysis was correct and complete. Since the data’s ground truth is unknown, it is impossible to determine the actual accuracy and precision of a new metabolomics software package. Furthermore, it is unlikely the benchmark dataset is broadly accessible by the entire scientific community. More often, separate datasets are used by different groups to evaluate software, which prevents an accurate comparison of performance across groups or software packages. Broadly accessible simulated metabolomics data would directly address these issues and facilitate software development. Simulated data has ground truth, with each peak’s waveform, spectral noise, metabolite identity, and concentration fully defined.5
The complete knowledge of simulated spectra properties enables a straightforward assessment of algorithmic performance. Spectral features selected by an algorithm can be directly compared to the known composition of metabolites, allowing calculations of false positive and false negative sensitivity, selectivity, and other measures of accuracy and precision. Ground truth understanding becomes especially useful when simulated datasets incorporate noise and missing values to mimic the qualities of real spectra. Reducing spectral quality enables stress testing of peak picking, imputation, normalization, and other methods commonly employed in metabolomics data processing. In this manner, simulated data provide feedback on an algorithm’s sensitivity to these quality factors.8 These laudable goals cannot be achieved with experimental data since missing values and noise are not easily tunable factors. While there are clear advantages to simulated data for assessing software performance, there is still inherent value in using experimental data. It is challenging for simulated datasets to capture all the unique characteristics and variability of a real spectrum, especially distributed over a dataset comprised of multiple replicate spectra. Instead, as illustrated herein, a preferred approach incorporates both simulated and experimental data in the software evaluation, capitalizing on the strength of both techniques. Despite the clear advantages of using simulated and experimental LC-MS data to assess and validate metabolomics software, the field lacks a dataset developed for benchmarking.
In this work, we created LC-MS datasets that combine both simulated and experimental data for algorithm validation and to assess the performance of our MVAPACK toolkit relative to the existing MS-based metabolomics software packages MS-DIAL and XCMSOnline. We have created nine simulated LC-MS datasets with varying quality and noise levels using metabolite feature locations derived from the VIMMS software package https://github.com/glasgowcompbio/vimms20 and its associated database. We also acquired an experimental LC-MS dataset using a standard mixture that captures real spectral quality while minimizing data complexity and simplifying analysis. Both datasets have a set of known peaks that enables an assessment of the performance of the MVAPACK, MS-DIAL, and XCMSOnline software. The LC-MS dataset also contains a second experimental LC-MS dataset collected on Mycobacterium smegmatis cell lysates. This dataset captures the realistic characteristics of a typical LC-MS metabolomics experiment and enables an evaluation of relative software performance without ground truth. We anticipate that our simulated and experimental LC-MS dataset will assist and facilitate further software and method development in metabolomics. We also present an updated MVAPACK version to analyze NMR and LC-MS data. We identified several standard algorithms for LC-MS peak processing from the scientific literature and developed functions to execute these routines within the MVAPACK software platform. In this manner, MVAPACK is the first software toolkit for processing 1D 1H NMR, 2D NMR, and LC-MS datasets.
Materials and Methods
Summary of Study Design.
A synthetic LC-MS dataset was created to simulate a metabolomics study of two groups (G1 and G2) with ten replicates per group (Table 1) and 7,540 individual peaks with well-defined and high-resolution waveforms. Gaussian curves were fitted to each peak to idealize the data further and enable later manipulation (Figure 1A). Metabolites that differentiate G1 from G2 had a fold change (FC) greater than two and a coefficient of variation (CV) less than or equal to 25% (Figure 1B). Different amounts of added noise (0%, 5%, or 10%) (Figure 1C) and missing features (0%, 10%, or 20%) (Figure 1D) were applied to the individual spectra to create a final dataset comprising a total of 9 sets of LC-MS spectra. Two additional experimental LC-MS datasets were collected on a Waters (Milford, MA) nanoACQUITY UPLC and XEVO G2-XS QTof system from a standard mixture dataset (Table 2) and a biological dataset derived from M. smegmatis cell lysates. The simulated and experimental LC-MS datasets were then used to validate and benchmark the GC/LC-MS data processing pipeline (Table S1) implemented into our MVAPACK36 metabolomics toolkit (http://bionmr.unl.edu/mvapack.php) and to compare the performance of MVAPACK to MS-DIAL 4.7030 and XCMSOnline,31 which are popular metabolomics software packages. Peaks were further filtered by intensity (> 100000 counts) for the standard dataset. A detailed description of the experimental protocols can be found in the Supplemental Information and on the project’s git page (https://git.unl.edu/powers-group/mvapack-lcms-supplemental).
Table 1.
Summary of Raw Simulated Datasetsa
| Dataset Name | Feature Countb | Significant Featuresb | Non-Significant Featuresb | Median FC | Mean %CV | %Noise | Number Missing Featuresc | %Missing Featuresc |
|---|---|---|---|---|---|---|---|---|
| SD1 | 2673 | 935 | 1738 | 8.32 | 31.42% | 0% | 0 | 0% |
| SD2 | 2673 | 935 | 1738 | 8.32 | 31.42% | 5% | 0 | 0% |
| SD3 | 2673 | 935 | 1738 | 8.32 | 31.42% | 10% | 0 | 0% |
| SD4 | 2673 | 935 | 1738 | 8.43 | 40.09% | 0% | 1870 | 10% |
| SD5 | 2673 | 935 | 1738 | 8.26 | 40.06% | 5% | 1870 | 10% |
| SD6 | 2673 | 935 | 1738 | 8.40 | 40.09% | 10% | 1870 | 10% |
| SD7 | 2673 | 935 | 1738 | 8.37 | 46.81% | 0% | 3740 | 20% |
| SD8 | 2673 | 935 | 1738 | 8.32 | 46.81% | 5% | 3740 | 20% |
| SD9 | 2673 | 935 | 1738 | 8.27 | 46.81% | 10% | 3740 | 20% |
Figure 1.

An overview of the construction of the simulated dataset. A) An example of idealized, isotope waveforms. B) The application of statistically significant multipliers to create two groups of metabolite features, G1 (red) and G2 (green), which vary significantly between groups but have low variance within each group. C) The application of 5% (dark grey) and 10% (black) baseline noise to the original waveform (light grey). D) Illustration of the removal of features across G1 (red) and G2 (green) replicates to approximate data imperfections common to experimental datasets.
Table 2.
Summary of Standard Mixture Dataset
| compound | Stock (μg/ml) | Group 1 (ng/μl) | Group 2 (ng/μl) | Group 3 (ng/μl) |
|---|---|---|---|---|
| acetaminophen | 10 | 1 | 0.5 | 0.25 |
| caffeine | 1.5 | 0.15 | 0.075 | 0.0375 |
| sulfaguanidine | 5 | 0.5 | 0.25 | 0.125 |
| sulfadimethoxine | 1 | 0.1 | 0.05 | 0.025 |
| val-tyr-val | 2.5 | 0.25 | 0.125 | 0.0625 |
| verapamil | 0.2 | 0.02 | 0.01 | 0.005 |
| terfenadine | 0.2 | 0.02 | 0.01 | 0.005 |
| leucine-enkephalin | 2.5 | 0.25 | 0.125 | 0.0625 |
| reserpine | 0.6 | 0.06 | 0.03 | 0.015 |
Results
LC-MS data processing pipeline added to MVAPACK.
LC-MS processing functionality was successfully implemented into MVAPACK, which enables end-to-end analysis of MS-metabolomics data. In keeping with the existing MVAPACK package, LC-MS functionality is modular and provides users with multiple options for each processing step. 40 Octave functions (Table S1) were added to MVAPACK to process LC-MS datasets. The functions comprise the following general capabilities: (i) data input/output (11 functions), (ii) peak alignment (1 function), (iii) feature identification (11 functions), (iv) normalization (4 functions), (v) imputation (4 functions), and (vi) general data processing (9 functions). Acceptable input formats include mzML, mzXML, and proteoWizard.txt formats. Peaks are identified using Gaussian wavelet or Savitzky-Golay filters, and alignment is achieved via an RMSD approach or the ObiWarp method.31, 37 De-isotoping is performed to aggregate the individual peaks into metabolite features. Aggregated matrices are normalized through maximum intensity, quantile, or p-norm methods, and chemically interesting metabolite features are identified through ANOVA, maximum fold change, and maximum variance filtration methods. Individual LC-MS processing steps combine to form an analysis pipeline that interfaces with existing MVAPACK modeling functionality. MVAPACK provides processing capabilities for LC-MS, 1D 1H NMR, 2D 1H-13C/15N/31P NMR, and multiplatform datasets. The process of generating an LC-MS feature matrix and an example MVAPACK script are detailed in Figures 2A and 2B, respectively. MVAPACK was used to produce PCA models from the simulated (Figure 3A), standard mixture (Figure 4A), and M. smegmatis metabolomics (Figure 5A) LC-MS datasets. The MVAPACK PCA score plots are shown in Figures 4A and 5A compared to plots generated by MS-DIAL (Figures 4B, 5B) and XCMSOnline (Figures 4C, 5C).
Figure 2.

An overviewillustration of the MVAPACK LC-MS processing workflowa A) generic workflow for LC-MS spectra data processing and the corresponding B) MVAPACK Octave commands. Each processing step is handled with only a few lines of Octave code and the example script can handle a large volume of experimental replicates (see supplemental information).
Figure 3.

Comparison of software performances based on PCA models created from the simulated LC-MS metabolomics dataset. A) An example of a PCA scores plot generated by the new LC-MS-based functions implemented into the MVAPACK toolkit. The R2 and Q2 scores are 0.75477 and 0.72571, respectively. B) A compilation of the cumulative R2 scores for the first two components of the PCA models produced by MVAPACK, MS-DIALMSDial, and XCMSOnline as a function of increased noise and the percentage of missing features. The first bar in each plot corresponds to the cumulative R2 scores for the ideal feature matrix representing the ideal result for each dataset. Please note that, the cumulative R2 score decreases for the ideal matrix according to the increase in the number of missing features. C) The cumulative R2 scores for the first two components of the PCA models produced by MVAPACK, MS-DIALMSDial, and XCMSOnline are plotted versus sensitivity.
Figure 4.

Comparison of software performances based on PCA models created from the standard mixture LC-MS dataset. PCA scores plot generated by A) MVAPACK (R2 = 0.88298, Q2 = 0.84791), B) MS-DIALMSDial (R2 = 0.92499, Q2 = 0.89594), and C) XCMSOnline (R2 = 0.63398, Q2 = 0.51479). D) Venn diagram summarizing the number of features picked by each software platform picks. The identified features were counted as matching between two or more software programs if the m/z and retention times were within 0.1 Da and 10 seconds, respectively. E) Bar plot comparing the percentage of true positives, false positives, and false negative features compared to the total number of peaks picked by MVAPACK, MS-DIALMSDIAL, and XCMSOnline.
Figure 5.

Comparison of software performances based on PCA models created from the M. smegmatis LC-MS dataset. PCA scores plot generated by A) MVAPACK (R2 = 0.50773, Q2 = 0.43665), B) MS-DIALMSDial (R2 = 0.74270, Q2 = 0.70561), and C) XCMSOnline (R2 = 0.59050, Q2 = 0.54036). D) Venn diagram summarizing the number of features picked by each software platform picks.. Insert shows an expanded view of the Venn diagram highlighting the MVAPACK and MSDial results. The identified features were counted as matching between two or more software programs if the m/z and retention times were within 0.1 Da and 10 seconds, respectively.
Validation of LC-MS data processing with simulated dataset.
The LC-MS functionality added to MVAPACK was benchmarked against XCMSOnline and MS-DIAL using the simulated LC-MS dataset with various levels of added noise and missing metabolite features. The missing features corresponded to 0%, 10%, or 20%. Similarly, the added noise amounted to 0%, 5%, or 10%. The simulated spectra provided a ground truth corresponding to the 935 known statistically distinct metabolite peak features and an idealized PCA model for comparison. Each software platform performed only the initial feature matrix generation due to potential differences in matrix processing algorithms. The PCA algorithm implemented into MVAPACK was previously shown to agree with the SIMCA-P+ equivalent.36 The raw feature matrix was independently generated for the nine conditions using the three software platforms and each ground truth matrix. To ensure a fair comparison of software performance, feature normalization, filtration, and PCA model calculations were executed by the same MVAPACK script regardless of the software (see Supplemental Information).
Software performance was evaluated by comparing the metabolite features identified by MS-DIAL, MVAPACK, or XCMSOnline to the ground truth. Selected metabolites can be either true positives (TP) when the features exist in the ideal matrix or false positives (FP) when they are absent. Additionally, a false negative (FN) is an ideal metabolite feature with no counterpart in the matrix selected by the software platform. The TP, FP, and FN results were then used to calculate sensitivity (SI eqn. 3), positive predictive value (PPV) (SI eqn. 4), and F1 score (SI eqn. 5) values. Overall, higher TP rates, sensitivity and PPV percentages, and F1 scores are desirable, while lower FP and FN rates are desirable. The total number of metabolite features identified, the number of TP, FP, and FN peaks, and the calculated sensitivity, PPV, and F1 score values are listed in Table 3 for each simulated dataset and software platform. Initially, we used the default peak-picking parameters when processing the synthetic data with XCMSOnline and MS-DIAL (Table S2–S5). However, we observed reduced peak picking performance for both software compared to MVAPACK. We performed a simple grid search for the optimal peak processing parameters of XCMSOnline and MS-DIAL on the 0% missing and noise dataset to improve their performance on the synthetic data set. The parameter set with the highest sensitivity was then applied to the other eight versions of the simulated dataset (Table S6–S7). For XCMSOnline, the ppm peak width and signal-to-noise threshold were optimized to 30 and 12, respectively. For MS-DIAL, the minimum peak height and mass width were optimized to 100,000 counts and 0.05 Da, respectively. We compared the performance of both XCMSOnline and MS-DIAL using their respective optimized parameters for the simulated dataset.
Table 3.
Summary of Performance Metrics for the Simulated Dataset Using Optimized Parameters
| Software | Missing Featuresa | Noiseb | Real Featuresc | Features Identifiedd | True Positivese | False Positivesf | False Negativesg | Sensitivityh | PPVi | F1 Scorej | Mahalanobis Distancek | RV Coefficientl | R 2 m |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| MS-DIAL | 0% | 0% | 935 | 1037 | 374 | 663 | 561 | 40% | 36% | 38% | 1412 | 0.9988 | 0.64 |
| MS-DIAL | 0% | 5% | 935 | 601 | 228 | 373 | 707 | 24% | 38% | 30% | 1599 | 0.9985 | 0.74 |
| MS-DIAL | 0% | 10% | 935 | 693 | 228 | 465 | 707 | 24% | 33% | 28% | 1950 | 0.9983 | 0.73 |
| MS-DIAL | 10% | 0% | 935 | 537 | 180 | 357 | 755 | 19% | 34% | 24% | 757 | 0.9980 | 0.55 |
| MS-DIAL | 10% | 5% | 935 | 468 | 166 | 302 | 769 | 18% | 35% | 24% | 666 | 0.9979 | 0.56 |
| MS-DIAL | 10% | 10% | 935 | 468 | 166 | 302 | 769 | 18% | 35% | 24% | 543 | 0.9973 | 0.57 |
| MS-DIAL | 20% | 0% | 935 | 579 | 205 | 374 | 730 | 22% | 35% | 27% | 105 | 0.9953 | 0.39 |
| MS-DIAL | 20% | 5% | 935 | 471 | 156 | 315 | 779 | 17% | 33% | 22% | 170 | 0.9953 | 0.40 |
| MS-DIAL | 20% | 10% | 935 | 424 | 151 | 273 | 784 | 16% | 36% | 22% | 157 | 0.9952 | 0.41 |
| Avg.(STD) | 586 (189) | 206 (69) | 380 (120) | 729 (69) | (22) 7 | (35) 2 | 27 (5) | 818 (681) | 0.9972 (0.015) | 0.55 (0.13) | |||
| MVAPACK | 0% | 0% | 935 | 691 | 528 | 163 | 407 | 56% | 76% | 64% | 5759 | 0.9285 | 0.76 |
| MVAPACK | 0% | 5% | 935 | 692 | 528 | 164 | 407 | 56% | 76% | 64% | 5674 | 0.9283 | 0.76 |
| MVAPACK | 0% | 10% | 935 | 686 | 526 | 160 | 409 | 56% | 77% | 66% | 5904 | 0.9494 | 0.76 |
| MVAPACK | 10% | 0% | 935 | 682 | 506 | 176 | 429 | 54% | 74% | 62% | 3094 | 0.8866 | |
| MVAPACK | 10% | 5% | 935 | 689 | 513 | 176 | 422 | 55% | 74% | 64% | 2520 | 0.8926 | 0.64 |
| MVAPACK | 10% | 10% | 935 | 682 | 501 | 181 | 434 | 54% | 73% | 62% | 3968 | 0.9556 | 0.65 |
| MVAPACK | 20% | 0% | 935 | 668 | 494 | 174 | 441 | 53% | 74% | 62% | 1684 | 0.9235 | 0.49 |
| MVAPACK | 20% | 5% | 935 | 686 | 500 | 186 | 435 | 53% | 73% | 61% | 1523 | 0.8899 | 0.50 |
| MVAPACK | 20% | 10% | 935 | 668 | 487 | 181 | 448 | 52% | 73% | 60% | 1601 | 0.9095 | 0.49 |
| Avg.(STD) | 683 (9) | 509 (15) | 173 (9) | 426 (15) | 54% (2) | 74% (2) | 63% (2) | 3525 (1863) | 0.9182(0.025) | 0.63 (0.12) | |||
| XCMSOnline | 0% | 0% | 935 | 889 | 735 | 154 | 200 | 79% | 83% | 81% | 2567 | 0.9995 | 0.61 |
| XCMSOnline | 0% | 5% | 935 | 740 | 612 | 128 | 323 | 65% | 83% | 73% | 1332 | 0.9994 | 0.60 |
| XCMSOnline | 0% | 10% | 935 | 630 | 520 | 110 | 415 | 56% | 83% | 66% | 1569 | 0.9993 | 0.61 |
| XCMSOnline | 10% | 0% | 935 | 758 | 580 | 178 | 355 | 62% | 77% | 69% | 2217 | 0.9993 | 0.45 |
| XCMSOnline | 10% | 5% | 935 | 634 | 489 | 145 | 446 | 52% | 77% | 62% | 1106 | 0.9992 | 0.46 |
| XCMSOnline | 10% | 10% | 935 | 550 | 420 | 130 | 515 | 45% | 76% | 57% | 912 | 0.9991 | 0.47 |
| XCMSOnline | 20% | 0% | 935 | 569 | 411 | 158 | 524 | 44% | 72% | 55% | 439 | 0.9990 | 0.37 |
| XCMSOnline | 20% | 5% | 935 | 494 | 369 | 125 | 566 | 39% | 75% | 52% | 356 | 0.9988 | 0.38 |
| XCMSOnline | 20% | 10% | 935 | 472 | 347 | 125 | 588 | 37% | 74% | 49% | 376 | 0.9988 | 0.38 |
| Avg.(STD) | 637 (136) | 498 (127) | 139 (21) | 437 (127) | 53% (14) | 78% (4) | 63% (10) | 1208 (754) | 0.9991 (0.0002) | 0.48 (0.10) | |||
| Total Avg. (Std)m | 635 (135) | 404 (164) | 231 (128) | 531 (164) | 43% (18) | 62% (20) | 51% (19) | 1013 (727) | 0.9981 (0.0014) | 0.52 (0.12) |
Percentage of data metabolite features missing among 10 data files.
Percentage of noise synthetically added through waveform functions.
Absolute number of real features present in the dataset.
Number of features identified by specified software.
Number of true features identified by the software.
Number of false features identified by the software.
Number of real metabolite features not identified by the software.
Sensitivity as defined by SI eqn. (3).
Positive predictive value (PPV) of the software as defined by SI eqn. (4).
F1 score of the software as defined by SI eqn. (5).
Mahalanobis distance between the two groups in the PCA scores plots.
RV coefficient42 calculated between the PCA scores from the idealized PCA model and each individual PCA model calculated for the simulated datasets.
Cumulative R2 values calculated for each PCA model.
Averages and standard deviations include unoptimized MVAPACK values from Table 3.
Overall, MVAPACK compared well, and in some cases outperformed, both MS-DIAL and XCMSOnline in the analysis of the simulated datasets. None of the software platforms correctly identified all the 935 known and statistically distinct metabolite features in the simulated dataset. The best outcome was 735 out of 935 features produced by XCMSOnline when the missing features and added noise were 0%. The worst outcome was 151 out of 935 peaks produced by optimized MS-DIAL parameters at 20% missing peaks and 10% added noise. The average number of true positive metabolites detected was 404 ± 164, in which XCMSOnline and MVAPACK were consistently above the average at 498 ± 127 and 509 ± 15, respectively. MS-DIAL was consistently below the average at 206 ± 69. Of course, the outcomes for the false negative features were simply the inverse of the true positives. The average number of false negatives was 531 ± 164, with both XCMSOnline and MVAPACK consistently below this average at 437±127 and 426 ± 15, respectively. Again, MS-DIAL was consistently above the average at 729 ± 69.
The average number of false positive features detected was 231 ± 128, in which XCMSOnline and MVAPACK were consistently below the average at 139 ± 121 and 173 ± 9, respectively. MS-DIAL was significantly above the average at 380 ± 120. Thus, XCMSOnline and MVAPACK performed similarly, having overall stronger performances than MS-DIAL in false positive, true positive, and false negative counts. Despite a lack of optimization, MVAPACK generally performed above the average, slightly better than XCMSOnline. MVAPACK’s performance remained consistent despite any noise increase or missing metabolite features. This is evident by the relatively small standard deviations observed for all the values summarized in Table 3. Under all these conditions, the performance of MS-DIAL and XCMSOnline deteriorated proportional to the decay in the quality of the simulated data.
The difference in MS-DIAL, MVAPACK, and XCMSOnline performance was further evident by comparing the sensitivity and PPV values in Table 3. The average sensitivity and PPV values were 43% ± 18% and 62% ± 20%, respectively. Again, MVAPACK performed better than average in both categories, with an average sensitivity of 54% ± 2% and an average PPV of 74% ± 2%. Conversely, MS-DIAL yielded both a low PPV of 35% ± 2% and a low sensitivity of 22% ± 7%, and XCMSOnline produced a better-than-average sensitivity of 53% but a significantly higher PPV of 78% ± 4%. Given the large variance between the sensitivity and PPV values for the three software platforms and the difficulty in discerning which parameter is more important, F1 scores (SI eqn. 5) were also calculated. The average F1 score was 51% ± 19%. MVAPACK had the highest F1 score of 63% ± 2% compared to 27% ± 5% for MS-DIAL and 63% ± 10% for XCMSOnline. This comparison suggests MVAPACK had the best overall performance in the analysis of the simulated dataset, although it is rivaled by the performance of XCMSOnline.
Software performance was also assessed by comparing the quality of each PCA model relative to an idealized PCA model calculated from the true matrix of 935 known statistically distinct metabolite features (Figure 3). One method to assess the overall quality of a PCA model is by comparing cumulative R2 values, where a higher R2 value indicates a better fit (Table 3). The R2 values for the first two principal components are plotted in Figure 3B for each dataset condition (i.e., different noise levels and percent missing peaks) and the three metabolomics software platforms. Each graph also contains, as the first bar, the idealized results for the true matrix. Under all conditions, MVAPACK performed the best, followed by MS-DIAL and then XCMSOnline. The decrease in performance of MS-DIAL relative to MVAPACK was generally equal to or greater than that of MVAPACK relative to the true matrix. Similarly, the decrease in the performance of XCMSOnline relative to MS-DIAL was often greater than that of MS-DIAL relative to MVAPACK. As expected, the performance of the three metabolomics platforms decreased as the number of missing features and noise levels increased. The performance for the true matrix also decreased as the dataset’s quality deteriorated. Interestingly, in all cases, the software performance was more sensitive to an increase in the percentage of missing features relative to an increase in noise. The software performance was relatively consistent despite increasing noise at a given percentage of missing peaks (Figure 3B). Similarly, additional false peaks added to the data matrix are added noise that may minimize group differences by diminishing the impact of real peak differences.
A further analysis of the relationship between the cumulative R2 values and the other performance metrics listed in Table 3 identified significant correlations (Table S8). An average correlation of 0.67 ± 0.17 was observed between sensitivity and the cumulative R2 values for the three metabolomics software packages (Figure 3C). Sensitivity (SI eqn. S3) accounts for true positives and false negatives. A similar positive correlation was observed between the cumulative R2 values and the total number of features identified (0.66 ± 0.15), true positives (0.75± 0.22), and F1 Scores (0.69 ± 0.14). A negative correlation was observed between missing features (−0.99 ± 0.01) and false negatives (−0.75 ± 0.22). Conversely, there was no clear correlation with added noise, PPV, or false positives. Note that PPV (SI eqn. 4) accounts for false positives.
The Mahalanobis distance was measured between the two groups in each PCA scores plot and compared to the results obtained from the idealized PCA model (Table 3). A maximal group separation with minimal within-group variation would result in a large Mahalanobis distance where the idealized PCA model yielded the largest possible Mahalanobis distance for the simulated datasets. We previously demonstrated the utility of the Mahalanobis distance for benchmarking PCA quality and validity.38 Furthermore, we ;38 and have identified that the inclusion and amount of noise in the data matrix are important contributors to the quality of a multivariate statistical model.38–41 MVAPACK produced the largest group separation in the PCA score plots with an average Mahalanobis distance of 3525 ± 1863 and a maximum distance of 5904, indicating the best overall group separation. The average group separation in the MVAPACK PCA model was nearly three times as great as both MS-DIAL and XCMSOnline. XCMSOnline performed slightly better than MS-DIAL with an average and maximum Mahalanobis distance of 1208 ± 754 and 2567, respectively, compared to 818 ± 681 and 1412. These differences are not significant. Not surprisingly, the Mahalanobis distance decreased as more noise was added, true positive features were removed, and more false negative peaks were detected. Like other statistical parameters, the number of missing metabolite features had the greatest detrimental effect on group separation in the PCA scores plots. The increase in missing features was typically correlated with a decrease in true positives and an increase in false negatives. Again, MVAPACK performed better than MS-DIAL and XCMSOnline, maintaining a larger group separation despite the deterioration in the dataset’s quality.
The idealized PCA models represent the best possible outcomes from the analysis of the simulated dataset. In this regard, the quality of the MS-DIAL, MVAPACK, and XCMSOnline PCA models can be further assessed by comparing the scores to this idealized PCA model. The similarities in the PCA scores were determined by calculating the RV coefficients42 and the squared Pearson correlation. Accordingly, the RV coefficient ranges from 0 to 1, where a score of 1 indicates the two sets of scores are identical. When using default parameters, MVAPACK had the highest average RV coefficient of 0.92 ± 0.03and MS-DIAL and XCMSOnline had slightly lower and similar results with average RV coefficients of 0.86 ± 0.03 and 0.83 ± 0.04, respectively (Table S5). This trend was inverted with optimized parameters, where MS-DIAL and XCMSOnline had much higher RV coefficients of 0.9972 ± 0.015 and 0.9991 ± 0.0002, respectively. Surprisingly, unlike the other statistical parameters, the RV coefficient did not vary as a function of added noise or missing metabolite features.
Validation of LC-MS data processing with the standard mixture dataset.
All three metabolomics software platforms were used to independently analyze an LC-MS dataset derived from a nine-compound standard mixture. The identical protocol was followed as described above for the simulated LC-MS dataset. There were nine mzML files comprising three groups, each containing three replicates. The three groups differed by a simple dilution factor. The LC-MS dataset was interrogated manually to confirm 97 real spectral features, including eight of the nine standards (Table 5). Reserpine was not identified manually or by any software. Due to poor initial performance with default values, a minimum intensity filter of 1×106 counts was added to the peak picking protocols for the three programs. The total number of metabolite features identified, the number of TP, FP, and FN peaks, and the calculated sensitivity, PPV, and F1 score values are listed in Table 5. Analysis was initially done using default parameters for each package, but the resulting metrics demonstrated a need for further refinement (Table S9). Further XCMSOnline and MS-DIAL parameter optimization was performed through a grid search of various parameters on the three replicates in the QC group. Picked peaks were compared to the manually curated list of 97 features. The parameter set with the highest sensitivity was then applied to the other two groups in the standard dataset (Table S10–S11). All tables and figures in the main text use these optimized parameters for XCMSOnline and MS-DIAL.
Surprisingly, MVAPACK, MS-DIAL, and XCMSOnline performed worse in the analysis of the standard mixture dataset compared to the simulated datasets. The decrease in performance may be attributed to fundamental differences between simulated and real experimental datasets. The presence of batch drift, spectral artifacts, chemical noise, larger variance and dynamic range in peak intensities, and non-uniform noise all contribute to a more challenging analysis of an experimental dataset that is difficult to reproduce with a simulated spectrum. Overall, MVAPACK, MS-DIAL, and XCMSOnline identified 287, 94, and 89 peaks, respectively (Table 4, Figure 4D). Of these 470 peaks, only 21 were shared by all three software, while 68 peaks were identified by two different programs.
Table 4.
Summary of Performance Metrics for the Standard Mixture Dataset Using Optimized Parameters
| Software | Real Featuresa | Features Identifiedb | True Positivesc | False Positivesd | False Negativese | Sensitivityf | PPVg | F1 Score h | Mahalanobis distance (G1-G2)i | Mahalanobis distance (G1-G3)j | Mahalanobis distance (G2-G3)k | RV coefficientl | R 2 m |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| MS-DIAL | 97 | 94 | 41 | 53 | 56 | 42% | 44% | 43% | 553 | 874 | 712 | 0.9340 | 0.97 |
| MVAPACK | 97 | 287 | 62 | 225 | 35 | 64% | 22% | 32% | 277 | 660 | 519 | 0.7527 | 0.94 |
| XCMSOnline | 97 | 89 | 30 | 59 | 67 | 31% | 34% | 32% | 1662 | 5080 | 211 | 0.9900 | 0.85 |
| Average | 157 | 44 | 112 | 53 | 46% | 33% | 36% | 831 | 2205 | 481 | 0.892 | 0.92 | |
| Standard Deviation | 112 | 16 | 98 | 16 | 17% | 11% | 6% | 733 | 2492 | 253 | 0.124 | 0.06 |
Number of real peaks manually identified in the LC-MS spectra.
Number of peaks identified by each software.,
Number of peaks correctly identified by each software.
Number of additional peaks identified by the software not assigned as real peaks in the LC-MS spectra.
Number of peaks not identified by the software that were present in the spectra.
Sensitivity as defined by SI eqn. (3).
Positive predictive value (PPV) of the software as defined by SI eqn. (4).
F1 score of the software as defined by SI eqn. (5).
Mahalanobis distance between the first two groups of PCA scores.
Mahalanobis distance between the first and third groups of PCA scores.
Mahalanobis distance between the second and third groups of PCA scores.
RV coefficient42 calculated between the PCA scores from manually determined matrix and the automated processes.
Cumulative R2 values calculated for each PCA model.
None of the software platforms correctly identified all 97 of the manually verified spectral features associated with the nine-compound standard mixture (Table 4, Figure 4E). The best outcome was 62 out of 97 features produced by MVAPACK. MS-DIAL second with the correct identification of 41 features. XCMSOnline was the worst performer, with only 30 identified features out of 97. This is somewhat consistent with the performance observed with the simulated LC-MS dataset in which MVAPACK showed the strongest performance, although MS-DIAL now slightly edges out XCMSOnline. The average number of true positive metabolites detected was 44 ± 16. Of course, the outcomes for the false negative features were simply the inverse of the true positives. The average number of false negatives was 53 ± 16, with XCMSOnline above this average at 67.
Regarding false positives, the trend was again the reverse of the results obtained with the simulated datasets. MS-DIAL identified the fewest false positives, with the best outcome of 53 false positive features. The worst outcome was 225 false positives produced by MVAPACK, with XCMSOnline second with 59 false positives. The average number of false positives detected was 112 ± 98. Thus, both MS-DIAL and XCMSOnline sacrificed identifying true positives to reduce the number of false positives. Conversely, MVAPACK prioritized the identification of true positive peaks at the expense of a higher false positive rate.
The difference in the performance of MS-DIAL, MVAPACK, and XCMSOnline in the analysis of the standard mixture dataset was further evident by comparing the sensitivity and PPV values in Table 4. The average sensitivity and PPV values were 46% ± 17% and 33% ± 12%, respectively. The standard mixture analysis obtained PPV values of 34, 44, and 22% for XCMSOnline, MS-DIAL, and MVAPACK. This disparity in scoring also extends to recalls, where XCMSOnline, MS-DIAL, and MVAPACK have sensitivities of 31%, 42%, and 64%, respectively. Notably, the average recalls, 43% ± 18% and 46% ± 17%, obtained for the simulated and standard mixture datasets were statistically equivalent. Conversely, the PPV of 33% ± 11% for the standard mixture dataset was notably worse and statistically different than the 62% ± 20% obtained with the simulated dataset. F1 scores were again calculated further to clarify performance differences between the three software platforms. The average F1 score was 36% ± 6%, again statistically worse than the 51% ± 19% obtained with the simulated dataset. MS-DIAL had the highest F1 score of 43% compared to 32% for MVAPACK and XCMSOnline. This comparison somewhat flips the order of MS-DIAL and MVAPACK compared to the outcome with the simulated datasets.
Software performance was also assessed by comparing the quality of each PCA model relative to an idealized PCA model calculated from the true matrix of 97 known spectral features. A simple visual inspection of the PCA scores plots shows a clear separation between the three groups (Figures 4A–C). The overall appearance of the MVAPACK and MS-DIAL PCA scores plots are similar, consistent with the overall similarity in global feature identification. Conversely, XCMSOnline is noticeably different, primarily due to the limited displacement along PC2 between the three groups. Also, the magnitude of the PC1 variance (20) is somewhat higher when compared to MVAPACK (15) and MS-DIAL (10). Again, this is consistent with the difference in the performance of MS-DIAL in its feature identification. To further quantify group separations, the Mahalanobis distance was measured between the three groups in each PCA scores plot and compared to the results obtained from the idealized PCA model. The XCMSOnline PCA model produced the largest Mahalanobis distances, ranging from 211 to 5080. MVAPACK was second, with distances ranging from 277 to 660, close to the averages for the three software platforms. Again, the MS-DIAL and MVAPACK ranking was flipped relative to the simulated datasets. XCMSOnline exhibited the smallest distances, an order of magnitude smaller than MS-DIAL and consistent with the limited number of identified features. PCA score plots were also assessed by calculating the RV coefficients relative to the idealized PCA model. XCMSOnline had the highest RV coefficient of 0.9900 and MVAPACK was a close second with an RV coefficient of 0.7527. MS-DIAL had the lowest RV coefficient of 0.9340.
Validation of LC-MS data processing with biological dataset.
To further assess software performance, we used a realistic biological dataset derived from M. smegmatis cell lysates with or without treatment with D-Cycloserine (DCS), a second-line drug for tuberculosis. The dataset consists of two groups, controls versus treatment, each comprising ten biological replicates for 20 LC-MS spectra. We previously reported the major metabolic changes a DCS treatment induced into the M. smegmatis metabolome using NMR.43, 44 However, this information cannot be directly correlated to an LC-MS dataset.43, 44 Unlike the two synthetic datasets, this biological dataset has an unknown ground truth since the dataset was too complex for manual analysis to achieve a reliable ground truth. Accordingly, it was not possible to determine the number of TP, FP, and FN peaks or to calculate sensitivity, PPV, F1 score, and RV coefficients. Instead, we can only provide a limited assessment of the quality of PCA models and the similarities of the data matrices for each software platform (Figure 5). All analyses of the M. smegmatis dataset utilized the XCMSOnline and MS-DIAL parameters optimized for the standard dataset described in the previous section.
The three PCA scores plots appear similar, exhibiting good group separation and comparable within-group variance (Figures 5A–C). A closer examination does highlight a few differences. While the PC2 separation is nearly identical for all packages on the range (−20, 20). PC1 separation is similar for MS-DIAL and XCMSOnline (−40, 40) but less for MVAPACK (−30, 30). The differences in the R2 and Q2 quality factors were more relevant. The R2 values were 66.5%, 85.8%, and 69.0%, and the Q2 values were 54.6%, 78.2%, and 55.6% for MVAPACK, MS-DIAL, and XCMSOnline, respectively. Again, MVAPACK and XCMSOnline performed similarly and worse than MS-DIAL. The selected features show a further divergence between the MVAPACK, MS-DIAL, and XCMSOnline data matrix. A total of 120, 115, and 235 features (Figure 5D) were identified by MVAPACK, MS-DIAL, and XCMSOnline, respectively. A set of 20 features were identified by all three software platforms, and another 78 features were commonly selected by two programs. XCMSOnline selected the most peaks, of which 60.4% had no match with a data matrix from the other two programs. A similar outcome was seen with the simulated LC-MS dataset. MS-DIAL and MVAPACK had lower and comparable relative exclusion feature counts with 22.6% and 71.7% of their peaks, respectively, not identified by another software package.
Discussion
A novel LC-MS metabolomics dataset comprising simulated and experimental data was presented as a valuable and important asset for systematically benchmarking new functionality implemented into our MVAPACK toolkit.36 The LC-MS dataset also serves as a performance comparison tool, which allowed us to evaluate the capabilities and reliability of MVAPACK relative to other software packages routinely used by the metabolomics community. While new LC-MS packages are frequently evaluated with experimental data, a true validation of performance and an assessment of accuracy and reproducibility is difficult or impossible to achieve without ground truth. Furthermore, a negative or biased performance outcome may result from a manually curated LC-MS dataset. The inherent complexity of the data may easily lead to the inclusion of any number of false positive and false negative features identified as true. In this context, a new algorithm or software could produce a correct result and a better analysis of the dataset but, nevertheless, be heavily penalized concerning existing software due to these annotation errors that are incorrectly perceived as being correct. Simply put, the algorithm may accurately identify these features as false negatives or false positives, but the evaluation metrics would score these as incorrect classifications, downgrading the overall performance of the software. Instead, a simulated dataset provides full control over the construction, structure, and content of the dataset to establish a ground truth. The user has full control over the number of biological replicates per group, the number of groups, the spectral signal-to-noise, the present or absent peaks, and relative peak intensities and peak shapes. Furthermore, any statistical significance between peaks, spectra, or groups can be easily specified in the design of a simulated dataset. An experimental data set derived from a standard mixture may also provide a similar ground truth, but, importantly, under real spectral conditions. A lower level of spectral complexity than a simulated is achieved due to a limited number of metabolites in the standard mixtures and a corresponding decrease in spectral features. Further, not all the tunable parameters in simulated spectra can be similarly adjusted in an experimental spectrum of a known mixture. Thus, a simulated and experimental data set can provide useful, complementary information when assessing software performance.
Knowing which peaks and metabolite features exist within the simulated or experimental spectra allows for accurately calculating true positive, false positive, and false negative rates. Peak heights can also be easily defined and varied across replicates to establish statistically distinct features that are known to differentiate between the defined groups. A large, non-statistically valid variance in peak heights can be assigned to other spectral features to establish a known complex background and complicate the search for differential features. The group-independent variance in random features can be seen as surrogates for chemical noise, impurities, or simply metabolites that do not respond to the external stressor or genetic mutation. Further, the precise control over noise levels and missing spectral features provides a mechanism to stress-test any algorithm as a function of spectral quality. This is easily accomplished with a high level of control in a simulated data set by simply changing peak intensity values. Still, an experimental data set of a standard mixture requires careful and precise adjustments of metabolite concentrations. Accordingly, our simulated LC-MS metabolomics dataset consists of 2,673 metabolite features, of which 935 are statistically distinct between the two defined groups, comprising 10 replicates each. The 935 statistically significant features have an FC greater than 2 and a CV less than or equal to 25%.19–22 Our dataset consists of 9 sets of spectra (Table 1), where the quality of each subsequent set of spectra is decreased by the addition of an increasing amount of noise (0%, 5%, 10%) and missing spectral features (0, 10, 20%). Creating a simulated dataset with nine distinct quality levels enables granular analysis of algorithm sensitivity to input replicate quality. Likewise, the small size of the files (approximately 50 megabytes per mzML file) enabled rapid analysis of the files by each algorithm used. In addition to the simulated dataset, our LC-MS dataset for assessing software performance includes an experimental LC-MS dataset consisting of a standard mixture of 9 metabolites (Table 2) that yielded 97 annotated features from the manual analysis. The dataset contained three groups of ten replicates where the three groups were differentiated by serial dilutions, 1:1, 1:2, and 1:4. A second LC-MS experimental dataset is also included that consisted of a polar extract of lysed M. smegmatis cells, a non-pathogenic surrogate for tuberculosis. The dataset contained two groups of ten replicates where one set of cell cultures was treated with a sub-lethal dose of DCS, a second-line treatment of TB.45 Our simulated and experimental LC-MS metabolomics datasets are freely accessible to benchmark the performance of new or existing software or algorithms (https://git.unl.edu/powers-group).
The initial implementation of our MVAPACK software provided a complete processing pipeline from raw 1D or 2D NMR metabolomics data to validated statistical models.36 Accordingly, MVAPACK provided a diversity of existing functions for the inputting, pre-processing (alignment, normalization, scaling, denoising, etc.), statistical modeling (PCA, OPLS, LDA, etc.), visualization (scores plots, backscaled-loadings plots, S-plots, volcano plots, etc.), and validation (CV-ANOVA, cross-permutation, tree diagrams, etc.) of a data matrix. Thus, incorporating the complete analysis and processing of GC/LC-MS metabolomics data into MVAPACK only required adding new functions to convert standard GC/LC-MS metabolomics data files (e.g., mzML, mzML, proteoWizard.txt) into a data matrix compatible with existing MVAPACK functions. This was achieved by adding 40 new functions written in GNU Octave to the current distribution of MVAPACK (Table S1). Accordingly, MVAPACK now provides a complete data processing pipeline for LC-MS metabolomics data consisting of input/output, peak alignment, feature identification, normalization, imputation, and general data processing. It is important to note that the functions added to MVAPACK were all previously described in the scientific literature and correspond to established and widely used algorithms. No new algorithms were developed for this implementation of the LC-MS processing pipeline. To our knowledge, MVAPACK is now the only metabolomics toolkit capable of providing a complete data processing pipeline for GC/LC-MS, 1D, and 2D NMR metabolomics datasets. MVAPACK can also mix multiple datasets from distinct analytical sources to create a unified statistical model by using multi-block versions of PCA, PLS, and OPLS methods.46, 47 Figures 3A, 4A, and 5A provide representative PCA score plots calculated by MVAPACK utilizing the simulated, standard mixture, and M. smegmatis LC-MS metabolomics datasets as diagramed in Figure 2.
MVAPACK users can rely on the package as an all-in-one toolbox for building high-level statistical models from raw spectra. The diversity of supported input data types is a major asset for MVAPACK as researchers have significantly lower technical overheads when conducting metabolomics data analyses. Analysis of MVAPACK, XCMSOnline, and MS-DIAL parameters (Tables S2–S4) show that MVAPACK’s default settings are consistent with those in other packages (Table S12), indicating it is similarly suited to general-purpose use. LC-MS analysis with MVAPACK can be performed in less than an hour, regardless of input data size. Accessibility to the underlying functions allows users to work up mzML files in parallel as peak picking relies only on data in a single file. Dozens of input files can be handled in parallel across batch jobs on UNIX-based clusters, which are commonplace in academic research environments. Datasets containing tens of thousands of peaks, metabolites feature matrix refinement, and selection are performed in minutes. This is in stark contrast to desktop and webserver implementations of LC-MS metabolomics processing, which analyze data in serial, leading to analyses taking hours or upwards of a day or more to complete. MVAPACK can also be run on a single processor but still provides versatility for researchers.
An exhaustive benchmarking of MVAPACK was undertaken to ensure our software’s accurate and reliable performance by comparing its output with that of other popular software in the field. MS-DIAL and XCMSOnline were selected as standards, given their prominence in the metabolomics community and free accessibility. The performance of the three metabolomics software platforms was compared using our LC-MS simulated dataset and two experimental datasets. These three LC-MS datasets provided performance benchmarking across various data quality levels. The simulated dataset is small, relatively clean, and provides a known ground truth for assessing peak picking performance and metabolite feature identification. The simulated dataset has increased levels of real noise and missing peaks but still represents a relatively simple composition to analyze compared to true experimental spectra. The LC-MS spectra collected with real biological samples contain chemical noise, spectral noise, spectral artifacts, and batch-order drift commonly encountered with experimental metabolomics datasets. Accordingly, the experimental datasets provide a more severe stress test for the software packages, but at the expense of limited or no-known ground truths. Overall software performance was quantified and compared by measuring sensitivity, PPV, and F1 scores calculated from the true positives, false positives, and false negatives. PCA models were also collected from the data matrix produced by the three software platforms and compared to a PCA model created from an ideal matrix. The ideal data matrix could only be defined from the simulated and standard mixture datasets.
Our benchmarking (Tables 3, 4, and Figure 3B) demonstrates that MVAPACK has a similar or better performance relative to XCMSOnline and MS-DIAL. MVAPACK’s performance was less sensitive to increased noise or metabolite feature removal than the other two packages. Not surprisingly given their similar peak identification algorithms, MVAPACK’s overall performance was closest to XCMSOnline, where the relative performance ranking was somewhat reversed between the simulated and standard mixture datasets. Conversely, MS-DIAL exhibited notably worse performance on the synthetic dataset but improved significantly on the standard and biological datasets. Again, this grouping in performance can be partially explained by the fact that the peak picking used by MVAPACK was based on the Gaussian derivative waveform used by XCMSOnline. XCMSOnline identified significantly more features and fewer false positives than MVAPACK and MS-DIAL with the simulated LC-MS dataset. However, this performance decreased considerably as the spectral quality was reduced, a trend reversed for the standard mixture dataset. All three software programs performed significantly worse with the standard mixture dataset than the simulated dataset in all facets except PCA model quality. The simulated dataset demonstrated that a decrease in the overall quality of the LC-MS dataset based on an increase in noise and missing peaks lead to a pronounced decrease in software performance. Thus, the decrease in software performance with the standard mixture dataset represents a continuation of this trend. The level of noise and artifacts was higher in the experimental datasets relative to the simulated dataset. the software performance factors obtained with the lowest quality simulated dataset (10% noise, 20% missing peaks) were on par with the results obtained with the standard mixture dataset. Differences in performance are difficult to explain through parameter comparison alone as all three packages share few analogous parameters, and those shared values are generally comparable and were optimized when different (Table S2–S4, S6–S8, S10–12). Our results show the software packages may be marginally suited to different LC-MS analysis tasks as each package exposes a mostly unique set of parameters that cannot be readily accessed in comparable packages. It should also be noted that results for any software platform may be improved via parameter tuning, and our results suggest that parameter tuning can largely remove differences, though some will still exist.
In general, the PCA models created by all three software platforms for either of the three datasets were quite visually comparable despite clear differences in the data matrices used to create the multivariate statistical models (Figures 4A–C, 5A–C). The resulting PCA scores plots showed excellent group separation and reliable models. The cumulative R2 values in Figure 3B provided the best comparative evaluation of software performance based on the PCA model. MVAPACK performed notably better than MS-DIAL and XCMSOnline and was comparable to the ideal model. PCA models generated by XCMSOnline had the lowest cumulative R2 across all spectral quality settings. Notably, the cumulative R2 values were shown to correlate with most of the performance metrics listed in Table 3, indicating this composite metric robustly described the overall quality of the data matrix created by MVAPACK, MS-DIAL, and XCMSOnline (Table S8). The cumulative R2 values nicely capture the overall decay in software performance as a function of missing features and added noise, which leads to the observed increase in false positives and false negatives peaks and a decrease in true positive peaks. An interesting observation was the significantly higher sensitivity by all three software platforms to missing features relative to increased added noise to the data matrix. The cumulative R2 values exhibited no correlation with the amount of added Gaussian noise, while the correlation with missing features closely approached the unit in magnitude (−0.98 ± 0.01). Nevertheless, missing true features or additional false features can be considered as alternative sources of noise added to the data matrix. It may be viewed as an extreme version of added Gaussian noise (i.e., mask true peaks).
Quantifying differences between the PCA models by other means, such as measuring Mahalanobis distances between groups, was not particularly informative given similar large group separations. The only notable exception was observed with the XCMSOnline analysis of the standard mixture dataset. The relative group separation and PC1 values were significantly higher for XCMSOnline than both MVAPACK and MS-DIAL (Figure 4A–C). Surprisingly, the RV coefficients calculated by comparing the principal components between an ideal PCA model and each calculated model were completely uninformative. It was not platform, noise, or missing feature dependent and seemed to provide random results. Thus, despite the data matrices calculated by each software platform being unique and containing different sets of selected and missed features relative to a perfect data matrix, the RV coefficients were quite similar. An average RV coefficient for the simulated and standard mixture databases were calculated as 0.99 ± 0.00 and 0.89 ± 0.12, respectively. A modest but small difference was observed between the two datasets. Overall, our results suggest a PCA model provides minimal utility in evaluating software performance. This is likely a result of the large, inherent data reduction that is the intended outcome of PCA. The data reduction likely masks intrinsic differences in the original data matrices.
The observation that the PCA model essentially hides underlying differences in the raw data matrices was quite apparent in the performance comparison using the M. smegmatis LC-MS metabolomics dataset. Visually, (Figure 5) the three PCA scores plots were quite similar and yielded the expected outcome of a large group separation between the drug-treated and untreated M. smegmatis cell cultures. The addition of DCS, a known antibiotic used as a second line treatment of TB, would be expected to induce a significant metabolic response. Although we observed reasonable overall.43, 44 More troubling was the low similarity between the data matrices produced by the three software platforms used to generate the PCA models, the resulting PCA models showed much tighter agreement (Figure 5D). As observed with the simulated LC-MS dataset, XCMSOnline and MVAPACK both identified more spectral features than , with XCMSOnline finding the most overall. Out of the 470 total features identified, a set of 20 were identified by the three metabolomics software platforms, but 78.2%, 22.6%, and 60.4% of the remaining features were uniquely defined by MS-DIAL, MVAPACK, and XCMSOnline, respectively. In total, 254 spectral features were identified by a single software program. While these software programs and the underlying algorithms are in wide use, these outcomes raise some serious questions about the robustness and reliability of the feature selection protocols. We observed a similar inconsistency in identified lipids when comparing different LC methods for an LC-MS lipidomics study despite all other experimental parameters being identical.48 Furthermore, a recent article by Li et al. (2023) describing their new Asari LC-MS metabolomics data processing software observed similar discrepancies when comparing its performance to XCMS, MZmine, and MS-DIAL using two experimental LC-MS datasets.49 Our and other results strongly suggest that further development and understanding of overall performance and reproducibility of metabolomics software is an exciting new direction for research in the community that will stand to benefit the field at large. Our simulated LC-MS dataset may provide a valuable asset for these future endeavors.
Herein, we described the creation of a unique metabolomics package capable of analyzing both NMR and LC-MS datasets. We additionally created a simulated dataset for high fidelity benchmarking and performed validation across three distinct datasets as well as a comparison of the results created by MVAPACK versus both MS-DIAL and XCMSOnline. We anticipate that this work will serve as a blueprint for other researchers in the field to perform robust validation of metabolomics packages and algorithms with ours or other similar simulated datasets to advance the field and identify areas of need. While we introduced quality variations, further refinements could be made to optimize the simulated data set by incorporating other commonly encountered LC-MS spectral issues. Future efforts may include adding differences in spectral resolution, peak intensities, and drift times, and the presence of different adducts, oligomers, batch variation, chemical noise, and other spectral artifacts. MVAPACK is freely available for download by interested users at https://bionmr.unl.edu/mvapack.php. Similarly, the corresponding documentation, simulated dataset, and all associated scripts can be downloaded from https://git.unl.edu/powers-group to both replicate these results and further develop simulated datasets.
Supplementary Material
Acknowledgements
This material is based upon work supported by the National Science Foundation under Grant Number 1660921 to RP and by the National Institute of General Medical Sciences (NIH R35 MIRA 1R35GM147706) to JDY. This work was also supported in part by funding from the Nebraska Center for Integrated Biomolecular Communication (P20GM113126, NIGMS). The research was performed in facilities renovated with support from the National Institutes of Health (RR015468-01).
Footnotes
Associated Content
The Supporting Information is available free of charge on the ACS Publications website at DOI: Materials and methods related to generation of the simulated dataset, the standard mixture, and the standard Mycobacterium smegmatis metabolomics sample; LC-MS data acquisition, software implementation, and validation and benchmarking; and examples of Octave/MVAPACK scripts used to process each LC-MS dataset. Table S1 lists the LC-MS functions added to MVAPACK. Tables S2–S4 list the MVAPACK, MS-DIAL, and XCMSOnline processing parameters. Table S8 lists the cumulative R2 values correlated with performance metrics presented in Table 3. Table S12 compares the analysis parameters MVAPACK, MS-DIAL, and XCMSOnline used. Table S13 lists the compound statistics used to generate the simulated LC-MS data set.
The authors declare no competing financial interest.
References
- (1).Idle JR; Gonzalez FJ Metabolomics. Cell Metabolism 2007, 6 (5), 348–351. DOI: 10.1016/j.cmet.2007.10.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (2).Luo X; Li L Metabolomics of Small Numbers of Cells: Metabolomic Profiling of 100, 1000, and 10000 Human Breast Cancer Cells. Analytical Chemistry 2017, 89 (21), 11664–11671. DOI: 10.1021/acs.analchem.7b03100. [DOI] [PubMed] [Google Scholar]
- (3).Ramautar R; Mayboroda OA; Somsen GW; de Jong GJ CE-MS for metabolomics: Developments and applications in the period 2008–2010. ELECTROPHORESIS 2011, 32 (1), 52–65. DOI: 10.1002/elps.201000378. [DOI] [PubMed] [Google Scholar]
- (4).Huang Q; Tan Y; Yin P; Ye G; Gao P; Lu X; Wang H; Xu G Metabolic Characterization of Hepatocellular Carcinoma Using Nontargeted Tissue Metabolomics. Cancer Research 2013, 73 (16), 4992–5002. DOI: 10.1158/0008-5472.Can-13-0308 (acccessed 2/5/2023). [DOI] [PubMed] [Google Scholar]
- (5).Nobeli I; Thornton JM A bioinformatician’s view of the metabolome. BioEssays 2006, 28 (5), 534–545. DOI: 10.1002/bies.20414. [DOI] [PubMed] [Google Scholar]
- (6).Wishart DS Metabolomics for Investigating Physiological and Pathophysiological Processes. Physiological Reviews 2019, 99 (4), 1819–1875. DOI: 10.1152/physrev.00035.2018. [DOI] [PubMed] [Google Scholar]
- (7).Crotty GF; Maciuca R; Macklin EA; Wang J; Montalban M; Davis SS; Alkabsh JI; Bakshi R; Chen X; Ascherio A; et al. Association of caffeine and related analytes with resistance to Parkinson disease among LRRK2 mutation carriers. A metabolomic study 2020, 95 (24), e3428–e3437. DOI: 10.1212/wnl.0000000000010863. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (8).Arneth B; Arneth R; Shams M Metabolomics of Type 1 and Type 2 Diabetes. International Journal of Molecular Sciences 2019, 20 (10), 2467. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (9).Huang M; Zhao H; Gao S; Liu Y; Liu Y; Zhang T; Cai X; Li Z; Li L; Li Y; et al. Identification of coronary heart disease biomarkers with different severities of coronary stenosis in human urine using non-targeted metabolomics based on UPLC-Q-TOF/MS. Clinica Chimica Acta 2019, 497, 95–103. DOI: 10.1016/j.cca.2019.07.017. [DOI] [PubMed] [Google Scholar]
- (10).Chen X; Yu D Metabolomics study of oral cancers. Metabolomics 2019, 15 (2), 22. DOI: 10.1007/s11306-019-1483-8. [DOI] [PubMed] [Google Scholar]
- (11).Castelli FA; Rosati G; Moguet C; Fuentes C; Marrugo-Ramírez J; Lefebvre T; Volland H; Merkoçi A; Simon S; Fenaille F; et al. Metabolomics for personalized medicine: the input of analytical chemistry from biomarker discovery to point-of-care tests. Analytical and Bioanalytical Chemistry 2022, 414 (2), 759–789. DOI: 10.1007/s00216-021-03586-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (12).Powers R The Current State of Drug Discovery and a Potential Role for NMR Metabolomics. Journal of Medicinal Chemistry 2014, 57 (14), 5860–5870. DOI: 10.1021/jm401803b. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (13).Chen C-J; Lee D-Y; Yu J; Lin Y-N; Lin T-M Recent advances in LC-MS-based metabolomics for clinical biomarker discovery. Mass Spectrometry Reviews 2022, n/a (n/a), e21785. DOI: 10.1002/mas.21785. [DOI] [PubMed] [Google Scholar]
- (14).Kaspy MS; Semnani-Azad Z; Malik VS; Jenkins DJA; Hanley AJ Metabolomic profile of combined healthy lifestyle behaviours in humans: A systematic review. PROTEOMICS 2022, 22 (18), 2100388. DOI: 10.1002/pmic.202100388. [DOI] [PubMed] [Google Scholar]
- (15).Kumar A; Misra BB Challenges and Opportunities in Cancer Metabolomics. PROTEOMICS 2019, 19 (21–22), 1900042. DOI: 10.1002/pmic.201900042. [DOI] [PubMed] [Google Scholar]
- (16).Plumb RS; Gethings LA; Rainville PD; Isaac G; Trengove R; King AM; Wilson ID Advances in high throughput LC/MS based metabolomics: A review. TrAC, Trends Anal. Chem 2023, 160, 116954. DOI: 10.1016/j.trac.2023.116954. [DOI] [Google Scholar]
- (17).Schrimpe-Rutledge AC; Codreanu SG; Sherrod SD; McLean JA Untargeted Metabolomics Strategies-Challenges and Emerging Directions. J. Am. Soc. Mass Spectrom 2016, 27 (12), 1897–1905. DOI: 10.1007/s13361-016-1469-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (18).Dettmer K; Aronov PA; Hammock BD Mass spectrometry-based metabolomics. Mass Spectrom. Rev 2007, 26 (1), 51–78, 10.1002/mas.20108. DOI: 10.1002/mas.20108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (19).Schiffman C; Petrick L; Perttula K; Yano Y; Carlsson H; Whitehead T; Metayer C; Hayes J; Rappaport S; Dudoit S Filtering procedures for untargeted LC-MS metabolomics data. BMC Bioinformatics 2019, 20 (1), 334. DOI: 10.1186/s12859-019-2871-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (20).Wandy J; Davies V; Van Der Hooft JJJ; Weidt S; Daly R; Rogers S In silico optimization of mass spectrometry fragmentation strategies in metabolomics. Metabolites 2019, 9 (10), 219. DOI: 10.3390/metabo9100219. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (21).Ortmayr K; Charwat V; Kasper C; Hann S; Koellensperger G Uncertainty budgeting in fold change determination and implications for non-targeted metabolomics studies in model systems. Analyst 2017, 142 (1), 80–90, 10.1039/C6AN01342B. DOI: 10.1039/C6AN01342B. [DOI] [PubMed] [Google Scholar]
- (22).Xia J; Mandal R; Sinelnikov IV; Broadhurst D; Wishart DS MetaboAnalyst 2.0—a comprehensive server for metabolomic data analysis. Nucleic Acids Research 2012, 40 (W1), W127–W133. DOI: 10.1093/nar/gks374 (acccessed 3/10/2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- (23).Bourgon R; Gentleman R; Huber W Independent filtering increases detection power for high-throughput experiments. Proc Natl Acad Sci U S A 2010, 107 (21), 9546–9551. DOI: 10.1073/pnas.0914005107 From NLM. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (24).Gorrochategui E; Jaumot J; Lacorte S; Tauler R Data analysis strategies for targeted and untargeted LC-MS metabolomic studies: Overview and workflow. TrAC Trends in Analytical Chemistry 2016, 82, 425–442. DOI: 10.1016/j.trac.2016.07.004. [DOI] [Google Scholar]
- (25).Cao M; Liu Y; Jiang W; Meng X; Zhang W; Chen W; Peng D; Xing S UPLC/MS-based untargeted metabolomics reveals the changes of metabolites profile of Salvia miltiorrhiza bunge during Sweating processing. Scientific Reports 2020, 10 (1), 19524. DOI: 10.1038/s41598-020-76650-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (26).Chong WPK; Thng SH; Hiu AP; Lee D-Y; Chan ECY; Ho YS LC-MS-based metabolic characterization of high monoclonal antibody-producing Chinese hamster ovary cells. Biotechnology and Bioengineering 2012, 109 (12), 3103–3111. DOI: 10.1002/bit.24580. [DOI] [PubMed] [Google Scholar]
- (27).Wold S; Sjöström M; Eriksson L PLS-regression: a basic tool of chemometrics. Chemometrics and Intelligent Laboratory Systems 2001, 58 (2), 109–130. DOI: 10.1016/S0169-7439(01)00155-1. [DOI] [Google Scholar]
- (28).Bijlsma S; Bobeldijk I; Verheij ER; Ramaker R; Kochhar S; Macdonald IA; van Ommen B; Smilde AK Large-Scale Human Metabolomics Studies: A Strategy for Data (Pre-) Processing and Validation. Analytical Chemistry 2006, 78 (2), 567–574. DOI: 10.1021/ac051495j. [DOI] [PubMed] [Google Scholar]
- (29).Yin P; Xu G Current state-of-the-art of nontargeted metabolomics based on liquid chromatography–mass spectrometry with special emphasis in clinical applications. Journal of Chromatography A 2014, 1374, 1–13. DOI: 10.1016/j.chroma.2014.11.050. [DOI] [PubMed] [Google Scholar]
- (30).Tsugawa H; Cajka T; Kind T; Ma Y; Higgins B; Ikeda K; Kanazawa M; VanderGheynst J; Fiehn O; Arita M MS-DIAL: data-independent MS/MS deconvolution for comprehensive metabolome analysis. Nature Methods 2015, 12 (6), 523–526. DOI: 10.1038/nmeth.3393. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (31).Smith CA; Want EJ; O’Maille G; Abagyan R; Siuzdak G XCMS: Processing Mass Spectrometry Data for Metabolite Profiling Using Nonlinear Peak Alignment, Matching, and Identification. Analytical Chemistry 2006, 78 (3), 779–787. DOI: 10.1021/ac051437y. [DOI] [PubMed] [Google Scholar]
- (32).Jeppesen MJ; Powers R Multiplatform untargeted metabolomics. Magn Reson Chem 2023. DOI: 10.1002/mrc.5350 From NLM. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (33).Sansone S-A; Fan T; Goodacre R; Griffin JL; Hardy NW; Kaddurah-Daouk R; Kristal BS; Lindon J; Mendes P; Morrison N; et al. The Metabolomics Standards Initiative. Nat. Biotechnol 2007, 25 (8), 846–848. DOI: 10.1038/nbt0807-846b. [DOI] [PubMed] [Google Scholar]
- (34).Salek RM; Neumann S; Schober D; Hummel J; Billiau K; Kopka J; Correa E; Reijmers T; Rosato A; Tenori L; et al. COordination of Standards in MetabOlomicS (COSMOS): facilitating integrated metabolomics data access. Metabolomics 2015, 11 (6), 1587–1597. DOI: 10.1007/s11306-015-0810-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (35).Beger RD; Dunn WB; Bandukwala A; Bethan B; Broadhurst D; Clish CB; Dasari S; Derr L; Evans A; Fischer S; et al. Towards quality assurance and quality control in untargeted metabolomics studies. Metabolomics 2019, 15 (1), 1–5. DOI: 10.1007/s11306-018-1453-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (36).Worley B; Powers R MVAPACK: A Complete Data Handling Package for NMR Metabolomics. ACS Chemical Biology 2014, 9 (5), 1138–1144. DOI: 10.1021/cb4008937. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (37).Sturm M; Bertsch A; Gröpl C; Hildebrandt A; Hussong R; Lange E; Pfeifer N; Schulz-Trieglaff O; Zerck A; Reinert K; et al. OpenMS – An open-source software framework for mass spectrometry. BMC Bioinformatics 2008, 9 (1), 163. DOI: 10.1186/1471-2105-9-163. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (38).Worley B; Powers R PCA as a Practical Indicator of OPLS-DA Model Reliability. Curr. Metabolomics 2016, 4 (2), 97–103, 10.2174/2213235X04666160613122429. DOI: . [DOI] [PMC free article] [PubMed] [Google Scholar]
- (39).Halouska S; Powers R Negative impact of noise on the principal component analysis of NMR data. J. Magn. Reson 2006, 178 (1), 88–95, 10.1016/j.jmr.2005.08.016. DOI: 10.1016/j.jmr.2005.08.016. [DOI] [PubMed] [Google Scholar]
- (40).Vu T; Riekeberg E; Qiu Y; Powers R Comparing normalization methods and the impact of noise. Metabolomics 2018, 14, 108, 10.1007/s11306–018-1400–6. DOI: 10.1007/s11306-018-1400-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (41).Vu T; Siemek P; Bhinderwala F; Xu Y; Powers R Evaluation of Multivariate Classification Models for Analyzing NMR Metabolomics Data. J. Proteome Res 2019, 18 (9), 3282–3294, 10.1021/acs.jproteome.9b00227. DOI: 10.1021/acs.jproteome.9b00227. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (42).Robert P; Escoufier Y A Unifying Tool for Linear Multivariate Statistical Methods: The RV- Coefficient. Journal of the Royal Statistical Society. Series C (Applied Statistics) 1976, 25 (3), 257–265. DOI: 10.2307/2347233 (acccessed 2023/04/13/).JSTOR. [DOI] [Google Scholar]
- (43).Halouska S; Chacon O; Fenton RJ; Zinniel DK; Barletta RG; Powers R Use of NMR Metabolomics To Analyze the Targets of D-Cycloserine in Mycobacteria: Role of D-Alanine Racemase. J. Proteome Res 2007, 6 (12), 4608–4614, 10.1021/pr0704332. DOI: 10.1021/pr0704332. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (44).Halouska S; Fenton RJ; Zinniel DK; Marshall DD; Barletta RG; Powers R Metabolomics analysis identifies D-Alanine-D-Alanine ligase as the primary lethal target of D-cycloserine in mycobacteria. J. Proteome Res 2014, 13 (2), 1065–1076, 10.1021/pr4010579. DOI: 10.1021/pr4010579. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (45).Caminero JA; Sotgiu G; Zumla A; Migliori GB Best drug treatment for multidrug-resistant and extensively drug-resistant tuberculosis. Lancet Infect. Dis 2010, 10 (9), 621–629. DOI: 10.1016/s1473-3099(10)70139-0. [DOI] [PubMed] [Google Scholar]
- (46).Xu Y; Goodacre R Multiblock principal component analysis: an efficient tool for analyzing metabolomics data which contain two influential factors. Metabolomics 2012, 8 (1), 37–51. DOI: 10.1007/s11306-011-0361-9. [DOI] [Google Scholar]
- (47).Worley B; Powers R A sequential algorithm for multiblock orthogonal projections to latent structures. Chemom. Intell. Lab. Syst 2015, 149 (Part B), 33–39, 10.1016/j.chemolab.2015.10.018. DOI: 10.1016/j.chemolab.2015.10.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (48).Sakallioglu IT; Maroli AS; Leite ADL; Powers R A reversed phase ultra-high-performance liquid chromatography-data independent mass spectrometry method for the rapid identification of mycobacterial lipids. J. Chromatogr. A 2022, 1662, 462739. DOI: 10.1016/j.chroma.2021.462739. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (49).Li S; Siddiqa A; Thapa M; Chi Y; Zheng S Trackable and scalable LC-MS metabolomics data processing using asari. Nature Communications 2023, 14 (1), 4113. DOI: 10.1038/s41467-023-39889-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
