Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2022 Jan 24;17(1):e0262545. doi: 10.1371/journal.pone.0262545

Molecular perturbations in pulmonary tuberculosis patients identified by pathway-level analysis of plasma metabolic features

Nguyen Phuoc Long 1,2, Da Young Heo 2, Seongoh Park 3, Nguyen Thi Hai Yen 1,2, Yong-Soon Cho 1,2, Jae-Gook Shin 1,2,4, Jee Youn Oh 5,*, Dong-Hyun Kim 2,*
Editor: Selvakumar Subbian6
PMCID: PMC8786114  PMID: 35073339

Abstract

Insight into the metabolic biosignature of tuberculosis (TB) may inform clinical care, reduce adverse effects, and facilitate metabolism-informed therapeutic development. However, studies often yield inconsistent findings regarding the metabolic profiles of TB. Herein, we conducted an untargeted metabolomics study using plasma from 63 Korean TB patients and 50 controls. Metabolic features were integrated with the data of another cohort from China (35 TB patients and 35 controls) for a global functional meta-analysis. Specifically, all features were matched to a known biological network to identify potential endogenous metabolites. Next, a pathway-level gene set enrichment analysis-based analysis was conducted for each study and the resulting p-values from the pathways of two studies were combined. The meta-analysis revealed both known metabolic alterations and novel processes. For instance, retinol metabolism and cholecalciferol metabolism, which are associated with TB risk and outcome, were altered in plasma from TB patients; proinflammatory lipid mediators were significantly enriched. Furthermore, metabolic processes linked to the innate immune responses and possible interactions between the host and the bacillus showed altered signals. In conclusion, our proof-of-concept study indicated that a pathway-level meta-analysis directly from metabolic features enables accurate interpretation of TB molecular profiles.

Introduction

Tuberculosis (TB) is a devastating infectious disease, and an estimated 1.7 billion people are latently infected globally [1]. Despite extensive efforts, TB remains a leading cause of mortality worldwide, especially in countries where it is endemic. According to the World Health Organization Global Report (2020), there were around 10 million newly diagnosed TB patients in 2019, and approximately 1.4 million deaths [2]. TB has a broad pathophysiological spectrum, hampering eradication efforts [3]. A holistic model based on high-dimensional data is required to describe host-response endotype characteristics in general, and the TB immune endotype in particular. Specifically, -omics technologies have facilitated the discovery of clinically useful biomarkers for risk assessment, diagnosis, and prediction of clinical events. For instance, after performing a comprehensive analysis of plasma pulmonary TB samples, and samples from community-acquired pneumonia patients, lung cancer patients and normal controls, Huang et al. introduced xanthine, 4-pyridoxate, and d-glutamic acid as potential biomarkers [4]. Sweeney3 (GBP5, DUSP3, KLF2), a host-response-based gene signature, met the criteria of the World Health Organization/Foundation for Innovative New Diagnostics target product profile for a non-sputum-based triage test [5]. Comprehensive -omics data and appropriate analytical methods enable investigation of drug efficacy, personalized dosing, prediction of relapse-free cure, and phenotypic drug susceptibility testing, as aspects of personalized precision medicine [68].

Studies of host-response transcriptome biosignatures have achieved considerable success in terms of stratifying TB patients for the purposes of risk prediction [9], diagnosis [10], treatment monitoring, outcome prediction [11], and recurrence prediction [12]. Blood metabolic responses have also been tracked based on the “blood metabolic signature,” which partially reflects the interaction between the human body and Mycobacterium tuberculosis (M. tuberculosis) bacilli. The metabolic responses of TB patients may aid predictions of risk, diagnosis, and outcomes, as well as treatment monitoring [13]. Integrating multi-omics data with clinical information could facilitate host-directed therapy for TB; for example, TB meningitis [14]. However, the usefulness of the serum and plasma metabolomic analysis for tracking the blood metabolic signature of TB across populations is unclear. Moreover, the variability in study designs and limited guidelines for the use of omics technologies in clinical research could lead to less reliable data, complicated analyses, and missed biological signals [15]. Therefore, rigorous designs are required for the reproducibility of–omics studies.

In computationally functional interpretation, a set of genes or metabolites associated with a phenotype of interest is typically identified by a statistical test. Next, it is compared with a predefined database of biological functions, which returns enriched scores, for which p-values and/or q-values are calculated. Its fundamental principle comprises over-representation analyses. Gene set enrichment analysis (GSEA) utilizes a metric representing the overall ranks of features (e.g., t-score or fold-change) to find “significantly coordinated reposition” of the association strength based on a database of genes or metabolites sharing biological functions [16]. GSEA has been used extensively in transcriptomics studies, but comparatively infrequently in metabolomics. Despite its ability to obtain profound information from samples, untargeted metabolomics has not met expectations in terms of providing mechanistic insight into the metabolic alterations of phenotypes of interest. This is primarily due to the difficulty of compound annotation and identification. Generally, a tiny fraction of ions can be assigned to metabolites with an acceptable level of confidence. The limited ability to define metabolites hampers subsequent functional interpretation. The situation has gradually improved since the introduction of the Mummichog algorithm [17]. Overall, Mummichog leverages known metabolic networks to map all potentially molecular relevant metabolites. Hence, it allows a test of representation in which potentially valid metabolites are over-represented in a pathway, whereas others are randomly distributed to a metabolic network. This allows rapid assessment of potential alterations in a phenotype of interest in a hypothesis-generating study using untargeted metabolomics data [18].

Meta-analysis for pathway enrichment analysis or pathway-level meta-analysis is a powerful approach for capturing the biological signatures of a particular phenotype of interest across studies with heterogeneous settings [19]. Pathway-level meta-analysis of metabolomics data using GSEA has recently become feasible [20]. Using a computational method to predict functional activities from metabolic features and a pathway-level enrichment meta-analysis using GSEA may provide insight into the metabolic biosignature of phenotypes of interest.

Problematic reproducibility and minimal overlap of metabolic features across studies have hampered the investigation of the metabolic alterations in TB. There is an urgent need to develop a strategy to reliably capture the global metabolic biosignature of TB. Herein, we conducted a pathway-level GSEA-based meta-analysis of two pulmonary TB untargeted metabolomics data sets from South Korea and China. The analysis is a proof-of-concept of the ability of metabolomics meta-analysis using metabolic features to identify metabolic alterations in pulmonary TB.

Results

Data exploration reveals considerable metabolic feature changes in TB patients

We performed principal component analysis to examine and visualize the untargeted metabolomics data of the two studies in positive and negative ion modes. The three-dimensional score plots of cPMTb (positive ion mode), ST001231 (positive ion mode), cPMTb (negative ion mode), and ST001231 (negative ion mode) suggested comparatively clear separation of TB patients from their counterparts (Fig 1A–1D). However, the TB patients and cPMTb controls (NC) had a higher level of metabolome similarity than in ST001231.

Fig 1. Data exploration and visualization.

Fig 1

Principal component analysis of (A) cPMTb positive ion mode, (B) ST001231 positive ion mode, (C) cPMTb negative ion mode, and (D) ST001231 negative ion mode. TB, tuberculosis; NC, normal control.

Unpaired t-tests were also conducted, and the features were visualized using volcano plots. At a significance level of 0.05, only 218 features were upregulated, while 142 were downregulated in the TB group of cPMTb (positive ion mode, S1A Fig in S1 File). By contrast, 549 upregulated features and 412 downregulated features were found in the TB group in ST001231 (positive ion mode, S1B Fig in S1 File). Notably, few features had a high fold change in cPMTb, whereas many features had a high fold change in ST001231. Similar patterns were observed in negative ion mode (S1C and S1D Fig in S1 File).

TB patients have distinct metabolome profiles

Partial least-squares discriminant analysis and random forest analysis were used to examine whether the metabolic profiles could be used to classify TB patients and controls. In positive ion mode, the partial least-squares discriminant analysis models possessed excellent discriminatory performance. In particular, the optimal model in the cPMTb study contained five principal components with an accuracy, goodness-of-fit (R2), and goodness-of-prediction (Q2) of 0.90, 0.93, and 0.63, respectively (Fig 2A). Likewise, the optimal model in ST001231, which had four principal components, had an accuracy, R2, and Q2 of 1.00, 1.00, and 0.96, respectively (Fig 2B). Similar performance was observed in negative ion mode: cPMTb (accuracy, 0.94; R2, 0.96; Q2, 0.70) and ST001231 (accuracy, 1.00; R2, 1.00; Q2, 0.91) (Fig 2C and 2D). Remarkably, there were marked differences in metabolic profiles between TB patients and NCs in ST001231. The out-of-bag errors of the four random forest models were 0.12, 0.00, 0.13, and 0.00 for cPMTb (positive ion mode), ST001231 (positive ion mode), cPMTb (negative ion mode), and ST001231 (negative ion mode), respectively (S2 Fig in S1 File). These analyses collectively indicated that TB patients possess a distinct metabolome profile, compared with controls.

Fig 2. Partial least-squares discriminant analysis.

Fig 2

(A) cPMTb positive ion mode. (B) ST001231 positive ion mode. (C) cPMTb negative ion mode. (D) ST001231 negative ion mode. * optimal value of Q2; TB, tuberculosis; NC, normal control.

Profound plasma metabolic alterations of pulmonary TB patients

Pathway-level meta-analysis was conducted separately in positive and negative ion modes. In positive ion mode, the meta-analysis revealed that 15 pathways had a combined p-value of < 0.05. They belonged to metabolic homeostasis, proinflammatory processes, and vitamin metabolism. The five pathways with the lowest combined p-values were “carnitine shuttle,” “vitamin A (retinol) metabolism,” “pentose phosphate pathway,” “purine metabolism,” and “pentose and glucuronate interconversions” (Fig 3A). Notably, only two pathways were significant in both individual studies among the significant pathways in the meta-analysis: “carnitine shuttle” and “vitamin A (retinol) metabolism.” Several pathways, including “pentose and glucuronate interconversions,” “hyaluronan metabolism,” and “fructose and mannose metabolism”—were enriched only in the cPMTb study (A1pos). By contrast, pathways, such as “sialic acid metabolism,” “purine metabolism,” and “androgen and estrogen biosynthesis and metabolism”—were significant only in ST001231 (B1pos). The heterogeneity of significant pathways among studies might be due to their relatively small sample sizes, sample heterogeneity, and use of different LC-MS platforms. More details are shown in S1 Table in S2 File.

Fig 3. Pathway meta-analysis by gene set enrichment analysis.

Fig 3

(A) Positive ion mode. (B) Negative ion mode. The enrichment factor of a pathway was calculated by dividing its number of significant hits by the expected number of hits.

Analysis of the data in negative ion mode yielded a greater number of significant pathways. Indeed, 24 pathways had a combined p-value of < 0.05 in the meta-analysis. Similar to the enriched pathways in positive ion mode, these belonged to proinflammatory processes, vitamin metabolism, metabolic homeostasis, amino acid-related metabolism, and some potentially novel pathways. Five of the pathways with the lowest combined p-values were “glycolysis and gluconeogenesis,” “pyruvate metabolism,” “fructose and mannose metabolism,” “vitamin D3 (cholecalciferol) metabolism,” and “de novo fatty acid biosynthesis” (Fig 3B). Among the significant pathways in the meta-analysis, eight were significantly enriched in both studies—the above-mentioned five pathways and “bile acid biosynthesis,” “arachidonic acid metabolism,” and “vitamin A (retinol) metabolism.” Seven pathways were significantly enriched only in cPMTb (A1neg), including “leukotriene metabolism,” “galactose metabolism,” “C21-steroid hormone biosynthesis and metabolism,” and “sialic acid metabolism.” In contrast, eight pathways—including “propanoate metabolism,” several amino acid-related pathways and “vitamin B3 (nicotinate and nicotinamide) metabolism”—were enriched only in ST001231 (B1neg). More details are shown in S2 Table in S2 File.

Discussion

Meta-analysis enhances statistical power, reliability, and generalizability, especially in high-throughput data settings [21]. A feature-level meta-analysis provides more comprehensive information than secondary pooled analyses of a limited number of identified metabolites. Moreover, as mentioned above, metabolite identification remains a fundamental issue in metabolomics [22]. An analysis that forgoes metabolite identification significantly reduces the time (i.e., from days to hours) required to obtain valuable insights and derive actionable targets for the phenotype of interest. Therefore, we could focus more on the validation of potential biomarkers and the performance of experiments to delineate molecular mechanisms of disease.

In this study, a pathway-level GSEA-based meta-analysis of two pulmonary TB untargeted metabolomics data sets was conducted. The two included data sets had a significant degree of heterogeneity in clinical characteristics, which might affect the number of enriched pathways. Nevertheless, the meta-analysis provided considerable insight into global metabolic alterations in plasma from pulmonary TB patients. The results are pathophysiologically comparable with previous findings using conventional targeted methods in addition to novel metabolic alterations. The analysis is capable of suggesting biological processes that may be significantly influenced by the clinical characteristics of a cohort. Furthermore, the findings suggested that functional interpretation of metabolomics data at the pathway level can provide insights into the molecular signatures of TB patients. Importantly, biological speculations at the level of individual metabolites exhibit human-centric bias [23]. Below we discuss some of the most important findings.

“Vitamin A (retinol) metabolism” and “vitamin D3 (cholecalciferol) metabolism” were altered in the meta-analysis. Vitamin A deficiency is reportedly associated with an increased risk of incident TB among household contacts [24]. Vitamin A supplementation may boost immunity against TB [25], and vitamin A and zinc co-supplementation may improve outcomes [26]. Vitamin D3 deficiency is a risk factor for TB. Vitamin D3 supplementation may be associated with immune activation, and thus should improve treatment outcomes; however, this requires validation [26]. In addition, the “bile acid biosynthesis” and “purine metabolism” pathways were significantly altered in our TB patients compared to controls. These pathways may also be involved in host defense. Indeed, some bile acids inhibit the in vitro growth of M. tuberculosis [27]. Bile acid derivatives are also potential anti-TB agents [28], and purine metabolism in M. tuberculosis is a target for drug development [29, 30]. Furthermore, together with lipid metabolism, these pathways are reportedly linked to anti-TB drug-induced hepatotoxicity [31].

Notably, we observed significant systemic changes in the host (i.e., disease phenotype) due to TB infection. Proinflammatory lipid mediators and pro-resolving lipid mediators are associated with TB and strongly associated with TB comorbid type 2 diabetes. The arachidonic acid-derived leukotriene and prostaglandin families were reported to be the most abundant proinflammatory lipid mediators [32]. Our pathway analysis revealed significant enrichment of “arachidonic acid metabolism,” “leukotriene metabolism,” and “prostaglandin formation from arachidonate.” We also found various processes related to nutrients and oxidative stress, including “pyruvate metabolism,” “fructose and mannose metabolism,” “glycolysis and gluconeogenesis,” “de novo fatty acid biosynthesis,” and the metabolism of several amino acids. These findings are concur with a previous report that metabolic processes are involved in adaptations and/or interactions of the host and microbe during infection [33]. Medium-chain fatty acids are involved in protective immunity against M. tuberculosis [34]. Additionally, alteration of “pyruvate metabolism” might be linked to the increased catabolism and/or energy consumption observed in TB patients [35]. “Fructose and mannose metabolism” and “glycolysis and gluconeogenesis” in M. tuberculosis are reportedly affected by nutrient starvation. In addition, they are linked to central carbon metabolism, which is essential for the maintenance of metabolic homeostasis in M. tuberculosis [36]. For example, mycobacteria in phagosomes took up exogenous pyruvate more efficiently than glucose and the pyruvate was used as a carbon source for intracellular growth [37].

We also found some potentially important pathways associated with TB. In a study of the innate immune responses to M. tuberculosis using macrophages, Blischak et al. found a subset of genes specifically involved in infection, including protein-coding genes related to the regulation of sialic acid synthesis [38]. We found that “sialic acid metabolism” was altered in plasma from TB patients, and Isa et al. [39] reported an altered level of sialic acid in urine. Further studies are warranted to explore the role of sialic acid metabolism and the associated glycoproteins in the immune response, to understand the susceptibility of TB and potential therapeutic targets. “C21-steroid hormone biosynthesis and metabolism” was significantly changed in TB patients, and may be associated with pathological processes (e.g., host defense against TB infection) [40]. Finally, the roles of other pathways showing alterations in TB patients, such as “Vitamin B3 (nicotinate and nicotinamide) metabolism,” “Propanoate metabolism,” and “androgen and estrogen biosynthesis and metabolism,” remain to be elucidated.

This study had some limitations. First, the analysis was conducted with only two untargeted metabolomics data sets. The lack of data might impede the identification of subtle TB-associated metabolic disturbances. Second, similar to a recent study [41], the pathway annotations require validation. Nonetheless, the analysis validated the available pathological and biological evidence, suggesting its reliability. Third, blood-derived metabolomics studies cannot directly elucidate in vivo growth mechanisms or the mode of action of anti-TB drugs [13]. Instead, they are more suitable for applications related to host systematic molecular alterations. Finally, post hoc metabolite identification and individual quantification are required to evaluate the associations of metabolites with clinical TB manifestations.

Conclusions

We showed that pathway meta-analysis of several studies can overcome cross-study inconsistency by increasing the power and generalizability of the results. In addition, pathologically comparable and novel metabolic alterations in plasma from pulmonary TB patients were described. Subsequent studies are needed to leverage these findings to discover novel diagnostic biomarkers, metabolism-informed clinical care, and metabolism-informed therapeutic development.

Materials and methods

Institutional review board statement

This study was approved by the Institutional Review Board of Korea University Guro Hospital (2017GR0012). All investigations were conducted in accordance with the principles of the Declaration of Helsinki. Informed consent was obtained from all subjects involved in the cPMTb study. Patients provided written informed consent for analysis of their blood and clinical data.

Korean tuberculosis cohort characteristics

The samples used in this study were part of a multi-center TB cohort entitled Center for Precision Medicine for Tuberculosis (cPMTb). The biospecimens and data used for this study were provided by the Biobank of Korea University Guro Hospital, a member of Korea Biobank. Individuals with human immunodeficiency virus infection, chronic renal disease, chronic liver diseases, chronic lung diseases, and malignant diseases were excluded from the analysis. Eventually, plasma samples from 63 clinically diagnosed pulmonary TB patients and 50 normal controls were collected.

In the TB group, the mean age (± standard deviation) was 55 (± 16) years and 27% of the patients were women. Forty-eight patients (76%) had positive sputum smears and 14 patients (22%) had a chest cavity image on x-ray. In the controls, the mean age (± standard deviation) was 60 (± 10) years, and 58% of the controls were women.

Chinese tuberculosis cohort characteristics

We downloaded data from TB patients and NC (Metabolomics Workbench, study ID ST001231) for the pathway-level meta-analysis with the cPMTb cohort to elucidate the metabolic profiles of pulmonary TB. In brief, the study involved 70 plasma samples of pulmonary TB (35 samples) and NC (35 samples). In the TB group, the age ranged from 18 to 64 years and 49% of the patients were women; of the patients, 86% had positive sputum smears and 17% had a chest cavity image on x-ray. The age of the NC group ranged from 23 to 60 years, and 31% of the controls were women. The untargeted metabolomics study was carried out by ultra-high-performance liquid chromatography coupled with Q Exactive mass spectrometer in positive and negative ion modes. More details are provided in the original publication [4].

Chemicals and reagents

High-performance liquid chromatography-grade water, methanol, and acetonitrile (ACN) were from J.T. Baker (Phillipsburg, NJ, USA). Analytical-grade formic acid and ammonium acetate and the internal standard (cholic acid-d5), were purchased from Toronto Research Chemicals (Toronto, Canada). Authentic chemicals for establishing the in-house database were purchased from Sigma-Aldrich (St. Louis, MO, USA).

Sample preparation

Blood samples were collected routinely on the day of enrollment in the overnight-fasted and medication-free state before treatment. Plasma was prepared by centrifuging the whole blood for 10 min at 4,500 rpm and stored at -80°C until analysis.

The extraction of metabolites from plasma was conducted in accordance with our established protocol [42]. In brief, 50 μL of plasma were mixed with 150 μL of ACN containing 5 μg/mL cholic acid-d5 in a microcentrifuge Eppendorf tube. The mixture was vigorously vortexed for 5 min and centrifuged for 10 min at 13,000 rpm at 4°C; the supernatant was collected. An equal amount of each sample was collected and mixed to create a pooled quality control (QC) sample. All extracts were stored at -20°C and subsequently analyzed using a high-performance liquid chromatography quadrupole time of flight mass spectrometer.

Instrumental conditions for untargeted metabolomics

The analysis was conducted as described previously with an Agilent 1200 series high-performance liquid chromatography (Agilent Technologies, Santa Clara, CA, USA) coupled to a 6530 Q-TOF mass spectrometer (Agilent Technologies) [42]. The autosampler was set at 4°C for all procedures. In positive ion mode, ACQUITY UPLC BEH C18 (100 × 2.1 mm, 1.7 μm; Waters) was maintained at 40°C, and metabolite separation was conducted by binary gradient elution with a flow rate of 0.4 mL/min. Mobile phase A was water with 0.1% formic acid; mobile phase B was ACN with 0.1% formic acid. The gradient was 0 min, 2% B; 1 min, 2% B; 3 min, 20% B; 8 min, 90% B; 14 min, 90% B; 14.5 min, 2% B; 18 min, 2% B. Essential mass spectrometer parameters are given in S3 Table in S2 File. In negative ion mode, the ZIC-HILIC column (100 × 2.1 mm, 3.5 μm; Merck, Darmstadt, Germany) was maintained at 35°C, and metabolite separation was conducted by binary gradient elution with a flow rate of 0.5 mL/min. Mobile phase A was ACN/water (5:95, v/v) with 10 mM ammonium acetate; mobile phase B was ACN/water (95:5, v/v) with 10 mM ammonium acetate. The gradient was 0 min, 99% B; 1 min, 99% B; 15 min, 50% B; 17 min, 50% B; 17.1 min, 99% B; and 22 min, 99% B. The mass spectrometer was operated using equivalent conditions to positive ion mode.

Data preprocessing and alignment

The generated *.d raw files were converted to mzML files using ProteoWizard [43]. The mzML files were then submitted to MS-DIAL (version 4.60) [44] for peak detection, alignment, and annotation. Essential data processing parameters are given in S4 Table in S2 File. Features with sample average signals lower than fivefold above the blank average were removed. LOWESS signal correction across batches was applied to the aligned data set. Before subsequent statistical analyses, features with a relative standard deviation of ≥ 20% in QC samples were removed. Features with missing values in ≥ 50% of samples were also removed, otherwise imputed using feature-wise k-nearest neighbors. Finally, normalized and filtered features were log-transformed and Pareto scaled. Post-processing data treatment were conducted using MetaboAnalyst 5.0 [20].

The *.raw files from ST001231 were submitted directly to MS-DIAL (version 4.60) for peak detection, alignment, and annotation. Data processing parameters are given in S4 Table in S2 File. Because there were no blank samples, no feature removal based on blank information was applied. Features with missing values in ≥ 50% of the samples were removed, otherwise imputed using feature-wise k-nearest neighbors. Features with a relative standard deviation of ≥ 20% in QC samples were removed. Quantile normalization was employed for cross-sample normalization. Finally, the data were log-transformed and Pareto-scaled before subsequent analyses.

Normalized data are provided in S5-S8 Tables in S2 File.

Data exploration and visualization

Principal component analysis was conducted to reduce data dimensionality, thus facilitating exploration and visualization of the data. The principal component analysis aims to find an orthogonal basis (or new axes) that can explain data variability and project observations onto a smaller subspace. In our study, e1, e2, e3 were the new axes (or eigenvectors corresponding to the three largest eigenvalues of the sample covariance matrix) and each observation xRp was converted to a vector (xTe1, xTe2, xTe3) and plotted in three-dimensional space.

Statistical analysis

Multiple statistical methods were used to analyze the untargeted metabolomics data. For univariate analysis, unpaired t-tests were used. The adjusted p-value following the Benjamini-Hochberg procedure (i.e., a false discovery rate of 0.05) was used as the significance level. Partial least-squares discriminant analysis and random forest analysis (number of trees, 500; number of predictors, 50) were used to examine the class discrimination (i.e., TB and the counterpart) using metabolomics data. A 10-fold cross-validation procedure was used to measure classification performance.

Pathway-level meta-analysis of metabolic features

The normalized and transformed data that contained m/z values, retention time (in seconds), and peak intensity for each ion mode were subjected to pathway-level meta-analysis. Before the pathway-level integration, the following calculations were performed: individual m/z statistics (i.e., t-test); putative metabolite annotation (mass tolerance, 10 ppm); and pathway prediction. Next, the p-values from individual studies were combined using Fisher’s method. Given individual p-values pi from the ith hypothesis i = 1, …, n, the method aggregated them by:

X2=-2i=1nlogpi,

which follows the chi-square distribution with degrees of freedom 2n under the null hypotheses.

GSEA was used for the pathway-level enrichment algorithm. In brief, the algorithm ranked all genes in data based on t-statistics and compared them to a prespecified gene set (or pathway), termed S. If top-ranked genes (i.e., large t-statistics) had many overlaps with S, such that the enrichment score increased, then S was regarded as an active pathway. The Homo sapiens (human) [MFN] (combined KEGG, BiGG, and Edinburgh) was used as the pathway library for analysis. A pathway with a combined p-value of < 0.05 was considered statistically significant.

Supporting information

S1 File. Volcano plots of metabolic features and the random forest classification of the two studies.

(DOCX)

S2 File. Supplementary method and materials and data.

(XLSX)

Data Availability

All relevant data are within the manuscript and its Supporting information files.

Funding Statement

This research was supported in part by the Bio & Medical Technology Development Program of the National Research Foundation (NRF) funded by the Korean government (MSIT) (No. 2019M3E5D1A01068994), and the National Research Foundation of Korea (NRF) grant funded by the Korean government (MSIT) (grant No. 2018R1A5A2021242). The funders did not influence study design, data collection, data analysis and interpretation, and the manuscriptșs content.

References

  • 1.Houben RMGJ, Dodd PJ. The Global Burden of Latent Tuberculosis Infection: A Re-estimation Using Mathematical Modelling. PLOS Medicine. 2016;13(10):e1002152. doi: 10.1371/journal.pmed.1002152 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Organization WH. Global tuberculosis report2020.
  • 3.Singhania A, Verma R, Graham CM, Lee J, Tran T, Richardson M, et al. A modular transcriptional signature identifies phenotypic heterogeneity of human tuberculosis infection. Nature Communications. 2018;9(1):2308. doi: 10.1038/s41467-018-04579-w [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Huang H, Shi L-Y, Wei L-L, Han Y-S, Yi W-J, Pan Z-W, et al. Plasma metabolites Xanthine, 4-Pyridoxate, and D-Glutamic acid as novel potential biomarkers for pulmonary tuberculosis. Clinica Chimica Acta. 2019;498:135–42. doi: 10.1016/j.cca.2019.08.017 [DOI] [PubMed] [Google Scholar]
  • 5.Warsinske H, Vashisht R, Khatri P. Host-response-based gene signatures for tuberculosis diagnosis: A systematic comparison of 16 signatures. PLOS Medicine. 2019;16(4):e1002786. doi: 10.1371/journal.pmed.1002786 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Lange C, Aarnoutse R, Chesov D, van Crevel R, Gillespie SH, Grobbel H-P, et al. Perspective for Precision Medicine for Tuberculosis. Frontiers in Immunology. 2020;11(2442). doi: 10.3389/fimmu.2020.566608 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Seyhan AA, Carini C. Are innovation and new technologies in precision medicine paving a new era in patients centric care? Journal of Translational Medicine. 2019;17(1):114. doi: 10.1186/s12967-019-1864-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Karczewski KJ, Snyder MP. Integrative omics for health and disease. Nature Reviews Genetics. 2018;19(5):299–310. doi: 10.1038/nrg.2018.4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Roe J, Venturini C, Gupta RK, Gurry C, Chain BM, Sun Y, et al. Blood Transcriptomic Stratification of Short-term Risk in Contacts of Tuberculosis. Clinical Infectious Diseases. 2020;70(5):731–7. doi: 10.1093/cid/ciz252 [DOI] [PubMed] [Google Scholar]
  • 10.Sambarey A, Devaprasad A, Mohan A, Ahmed A, Nayak S, Swaminathan S, et al. Unbiased Identification of Blood-based Biomarkers for Pulmonary Tuberculosis by Modeling and Mining Molecular Interaction Networks. EBioMedicine. 2017;15:112–26. doi: 10.1016/j.ebiom.2016.12.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Penn-Nicholson A, Mbandi SK, Thompson E, Mendelsohn SC, Suliman S, Chegou NN, et al. RISK6, a 6-gene transcriptomic signature of TB disease risk, diagnosis and treatment response. Scientific Reports. 2020;10(1):8629. doi: 10.1038/s41598-020-65043-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Darboe F, Mbandi SK, Naidoo K, Yende-Zuma N, Lewis L, Thompson EG, et al. Detection of Tuberculosis Recurrence, Diagnosis and Treatment Response by a Blood Transcriptomic Risk Signature in HIV-Infected Persons on Antiretroviral Therapy. Frontiers in Microbiology. 2019;10(1441). doi: 10.3389/fmicb.2019.01441 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.du Preez I, Luies L, Loots DT. The application of metabolomics toward pulmonary tuberculosis research. Tuberculosis. 2019;115:126–39. doi: 10.1016/j.tube.2019.03.003 [DOI] [PubMed] [Google Scholar]
  • 14.van Crevel R, Avila-Pacheco J, Thuong NTT, Ganiem AR, Imran D, Hamers RL, et al. Improving host-directed therapy for tuberculous meningitis by linking clinical and multi-omics data. Tuberculosis. 2021:102085. doi: 10.1016/j.tube.2021.102085 [DOI] [PubMed] [Google Scholar]
  • 15.Krassowski M, Das V, Sahu SK, Misra BB. State of the Field in Multi-Omics Research: From Computational Needs to Data Mining and Sharing. Frontiers in Genetics. 2020;11(1598). doi: 10.3389/fgene.2020.610798 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, et al. Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. Proceedings of the National Academy of Sciences. 2005;102(43):15545. doi: 10.1073/pnas.0506580102 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Li S, Park Y, Duraisingham S, Strobel FH, Khan N, Soltow QA, et al. Predicting Network Activity from High Throughput Metabolomics. PLOS Computational Biology. 2013;9(7):e1003123. doi: 10.1371/journal.pcbi.1003123 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Kim SJ, Long NP, Jung CW, Anh NH, Min JE, Kim HM, et al. Exposure of nano-polystyrene induces metabolic alteration on lipid homeostasis in Caco-2. Environmental Science: Nano. 2021. doi: 10.1039/D1EN00145K [DOI] [Google Scholar]
  • 19.Shen K, Tseng GC. Meta-analysis for pathway enrichment analysis when combining multiple genomic studies. Bioinformatics. 2010;26(10):1316–23. doi: 10.1093/bioinformatics/btq148 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Chong J, Soufan O, Li C, Caraus I, Li S, Bourque G, et al. MetaboAnalyst 4.0: towards more transparent and integrative metabolomics analysis. Nucleic Acids Research. 2018;46(W1):W486–W94. doi: 10.1093/nar/gky310 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Phuoc Long N, Hee Jung K, Jun Yoon S, Hoang Anh N, Diem Nghi T, Pyo Kang Y, et al. Systematic assessment of cervical cancer initiation and progression uncovers genetic panels for deep learning-based early diagnosis and proposes novel diagnostic and prognostic biomarkers. Oncotarget. 2017;8(65). doi: 10.18632/oncotarget.22689 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Ivanisevic J, Want EJ. From Samples to Insights into Metabolism: Uncovering Biologically Relevant Information in LC-HRMS Metabolomics Data. Metabolites. 2019;9(12):308. doi: 10.3390/metabo9120308 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Booth SC, Weljie AM, Turner RJ. Computational tools for the secondary analysis of metabolomics experiments. Computational and Structural Biotechnology Journal. 2013;4(5):e201301003. doi: 10.5936/csbj.201301003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Aibana O, Franke MF, Huang C-C, Galea JT, Calderon R, Zhang Z, et al. Impact of Vitamin A and Carotenoids on the Risk of Tuberculosis Progression. Clinical Infectious Diseases. 2017;65(6):900–9. doi: 10.1093/cid/cix476 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Qrafli M, El Kari K, Aguenaou H, Bourkadi JE, Sadki K, El Mzibri M. Low plasma vitamin A concentration is associated with tuberculosis in Moroccan population: a preliminary case control study. BMC Research Notes. 2017;10(1):421. doi: 10.1186/s13104-017-2737-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Tiberi S, du Plessis N, Walzl G, Vjecha MJ, Rao M, Ntoumi F, et al. Tuberculosis: progress and advances in development of new drugs, treatment regimens, and host-directed therapies. The Lancet Infectious Diseases. 2018;18(7):e183–e98. doi: 10.1016/S1473-3099(18)30110-5 [DOI] [PubMed] [Google Scholar]
  • 27.Epstein D, Mistry K, Whitelaw A, Watermeyer G, Pettengell K. The effect of physiological concentrations of bile acids on in vitro growth of Mycobacterium tuberculosis2012. [DOI] [PubMed]
  • 28.Pore VS, Divse JM, Charolkar CR, Nawale LU, Khedkar VM, Sarkar D. Design and synthesis of 11α-substituted bile acid derivatives as potential anti-tuberculosis agents. Bioorganic & Medicinal Chemistry Letters. 2015;25(19):4185–90. doi: 10.1016/j.bmcl.2015.08.006 [DOI] [PubMed] [Google Scholar]
  • 29.Beena Rawat DS. Antituberculosis Drug Research: A Critical Overview. Medicinal Research Reviews. 2013;33(4):693–764. doi: 10.1002/med.21262 [DOI] [PubMed] [Google Scholar]
  • 30.William BP, Mary CL. Purine Metabolism in Mycobacterium tuberculosis as a Target for Drug Development. Current Pharmaceutical Design. 2007;13(6):599–608. doi: 10.2174/138161207780162863 [DOI] [PubMed] [Google Scholar]
  • 31.Liu L, Li X, Huang C, Bian Y, Liu X, Cao J, et al. Bile acids, lipid and purine metabolism involved in hepatotoxicity of first-line anti-tuberculosis drugs. Expert Opinion on Drug Metabolism & Toxicology. 2020;16(6):527–37. doi: 10.1080/17425255.2020.1758060 [DOI] [PubMed] [Google Scholar]
  • 32.Shivakoti R, Dalli J, Kadam D, Gaikwad S, Barthwal M, Colas RA, et al. Lipid mediators of inflammation and Resolution in individuals with tuberculosis and tuberculosis-Diabetes. Prostaglandins & Other Lipid Mediators. 2020;147:106398. doi: 10.1016/j.prostaglandins.2019.106398 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Luier L, Loots DT. Tuberculosis metabolomics reveals adaptations of man and microbe in order to outcompete and survive. Metabolomics. 2016;12(3):40. doi: 10.1007/s11306-016-0969-x [DOI] [Google Scholar]
  • 34.Weiner J 3rd, Parida SK, Maertzdorf J, Black GF, Repsilber D, Telaar A, et al. Biomarkers of Inflammation, Immunosuppression and Stress Are Revealed by Metabolomic Profiling of Tuberculosis Patients. PLOS ONE. 2012;7(7):e40221. doi: 10.1371/journal.pone.0040221 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Zhou A, Ni J, Xu Z, Wang Y, Lu S, Sha W, et al. Application of 1H NMR Spectroscopy-Based Metabolomics to Sera of Tuberculosis Patients. Journal of Proteome Research. 2013;12(10):4642–9. doi: 10.1021/pr4007359 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Rizvi A, Shankar A, Chatterjee A, More TH, Bose T, Dutta A, et al. Rewiring of Metabolic Network in Mycobacterium tuberculosis During Adaptation to Different Stresses. Frontiers in Microbiology. 2019;10(2417). doi: 10.3389/fmicb.2019.02417 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Osada-Oka M, Goda N, Saiga H, Yamamoto M, Takeda K, Ozeki Y, et al. Metabolic adaptation to glycolysis is a basic defense mechanism of macrophages for Mycobacterium tuberculosis infection. International Immunology. 2019;31(12):781–93. doi: 10.1093/intimm/dxz048 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Blischak JD, Tailleux L, Mitrano A, Barreiro LB, Gilad Y. Mycobacterial infection induces a specific human innate immune response. Scientific Reports. 2015;5(1):16882. doi: 10.1038/srep16882 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Isa F, Collins S, Lee MH, Decome D, Dorvil N, Joseph P, et al. Mass Spectrometric Identification of Urinary Biomarkers of Pulmonary Tuberculosis. EBioMedicine. 2018;31:157–65. doi: 10.1016/j.ebiom.2018.04.014 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Wipperman MF, Sampson NS, Thomas ST. Pathogen roid rage: Cholesterol utilization by Mycobacterium tuberculosis. Critical Reviews in Biochemistry and Molecular Biology. 2014;49(4):269–93. doi: 10.3109/10409238.2014.895700 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Pang Z, Zhou G, Chong J, Xia J. Comprehensive Meta-Analysis of COVID-19 Global Metabolomics Datasets. Metabolites. 2021;11(1):44. doi: 10.3390/metabo11010044 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Park J, Shin Y, Kim TH, Kim D-H, Lee A. Plasma metabolites as possible biomarkers for diagnosis of breast cancer. PLOS ONE. 2019;14(12):e0225129. doi: 10.1371/journal.pone.0225129 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Chambers MC, Maclean B, Burke R, Amodei D, Ruderman DL, Neumann S, et al. A cross-platform toolkit for mass spectrometry and proteomics. Nature Biotechnology. 2012;30(10):918–20. doi: 10.1038/nbt.2377 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Tsugawa H, Cajka T, Kind T, Ma Y, Higgins B, Ikeda K, et al. MS-DIAL: data-independent MS/MS deconvolution for comprehensive metabolome analysis. Nature Methods. 2015;12(6):523–6. doi: 10.1038/nmeth.3393 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 File. Volcano plots of metabolic features and the random forest classification of the two studies.

(DOCX)

S2 File. Supplementary method and materials and data.

(XLSX)

Data Availability Statement

All relevant data are within the manuscript and its Supporting information files.


Articles from PLoS ONE are provided here courtesy of PLOS

RESOURCES