Abstract
The mammalian gut harbors a complex and dynamic microbial ecosystem: the microbiota. While emerging studies support that microbiota regulates brain function with a few molecular cues suggested, the overall biochemical landscape of the “microbiota-gut-brain axis” remains largely unclear. Here we use high-coverage metabolomics to comparatively profile feces, blood sera, and cerebral cortical brain tissues of germ-free C57BL/6 mice and their age-matched conventionally raised counterparts. Results revealed for all three matrices metabolomic signatures owing to microbiota, yielding hundreds of identified metabolites including 533 altered for feces, 231 for sera, and 58 for brain with numerous significantly enriched pathways involving aromatic amino acids and neurotransmitters. Multicompartmental comparative analyses single out microbiota-derived metabolites potentially implicated in interorgan transport and the gut-brain axis, as exemplified by indoxyl sulfate and trimethylamine-N-oxide. Gender-specific characteristics of these landscapes are discussed. Our findings may be valuable for future research probing microbial influences on host metabolism and gut-brain communication.
Subject terms: Metabolomics, Microbiology, Biochemical networks, Biomarkers
The gut microbiota harbours neuroactive potential with links to neurological disorders. Here, the authors apply global metabolomics with an integrated annotation strategy to comparatively profile fecal, blood serum and cerebral cortical brain tissues of eight-week-old germ-free mice vs. age-matched specific-pathogen-free mice, providing a snapshot of the metabolome status linked to the gut-brain axis.
Introduction
The mammalian body and particularly the gastrointestinal (GI) tract is inhabited by hundreds of trillions of microbes, collectively termed the microbiota1. Complex, dynamic, and metabolically active by nature, these commensal microbes have been discovered to constantly interact with the host as a crucial mediator for physiological processes spanning energy harvest2, immune cell development3, and gut epithelial homeostasis4. Interestingly, recent studies support that microbiota also harbors novel neuroactive potential with links to neurological and/or psychiatric disorders, as further encapsulated as the “microbiota-gut-brain axis”5. For example, using maternal immune activation (MIA) mouse, a model of autism spectrum disorder (ASD), Hsiao and colleagues discovered6 that both ASD-mimicking GI barrier defects and behavioral abnormalities MIA offspring exhibited were restored through colonizing human commensal Bacteroidetes fragilis, supporting a gut-microbiota-brain connection for autism. In a recent work by Valles-Colomer and colleagues7, analyses of a large human cohort correlating fecal metagenomic features with indicators of quality of life and depression identified microbial strains, pathways, and metabolites pertaining to mental health and gut-brain interaction, providing the first population-scale evidence linking microbiota to mental health outcomes.
Despite the emerging data, whether and how microbiota controls brain function remains largely undefined. It has been postulated that at least two routes are involved8, namely (i) the vagus nerves (neuronal) that connect the central nervous system (CNS) and enteric nervous system (ENS, the “second brain”) and (ii) the circulatory system (humoral) encompassing blood and lymphatic circulation. Gut microbes, in close proximity to numerous local neurons and immune cells, may either act on the ENS in situ to signal the CNS remotely, or more likely, they synthesize or transform molecular cues that can translocate from gut lumen to systemic circulation, and possibly cross the blood-brain barrier (BBB) and affect CNS directly9. Despite such interest, surprisingly, molecular underpinnings for such microbiota-gut-brain axis are unclear. Although there have been sporadic studies targeting a few microbial molecules or chemical classes in this regard, as represented by α-synuclein10, 3,4-dihydroxyphenylacetate7, and bile acids11, many of these still have yet to be validated as a gut-brain mediator.
In this work, we use high-coverage comparative metabolomics analyses combining targeted and untargeted annotation strategies to address this (Supplementary Fig. 1)12,13. We profile fecal, blood sera, and cerebral cortical brain tissues of 8-week-old germ-free (GF) C57BL/6 mice and their age-matched conventionally raised (CONV-R) specific-pathogen-free counterparts using high-resolution mass spectrometry (HRMS). We assess group patterns using univariate and multivariate statistics, annotate chemical structures of all distinct ion features through an integrated cheminformatic approach, and leverage a suite of statistical and data visualization tools for systematic comparisons across sample compartments. Here, we report 701 unique metabolites of differentiated GF/CONV-R profiles; potential mechanistic links of resident microbiota to the humoral gut-brain axis are examined through enrichment analyses, random forest classification, and metabolomic network analysis. This work presents datasets from a unique multi-metabolome perspective for probing microbial influences on mammalian interorgan transport and gut-blood-brain interaction.
Results
High-coverage metabolomics of GF vs. CONV-R mice
To uncover metabolites underlying microbiota-host interaction in light of gut-brain signaling, we conducted metabolomics, targeted and untargeted, on fecal matter, blood sera, and brain tissues (cerebral cortex slices) for 12 GF and 12 CONV-R C57BL/6 mice (8-week-old) (Fig. 1a). Using UHPLC-HESI-HRMS, we detected and aligned ion features into master peak tables for each sample type based on MS1 full-scan data, generating total ion feature numbers ranging from highest 17,386 for feces (heated ESI+) to lowest 6,334 for brain tissues (heated ESI+) (Fig. 1b). We statistically assessed these tables to screen for features of GF/CONV-R difference (fold change ≥ 1.5, p value < 0.01, Welch’s t-test) (Fig. 1b, c), yielding a total of 20,939 significant ion features including 16,001 for feces, 3,977 for sera, and 961 for brain tissues. Tandem mass spectra were acquired for all these features, and the highest metabolome coverage possible was achieved through a cheminformatic pipeline incorporating targeted and untargeted annotation procedures (Fig. 1a; Supplementary Fig. 1). The targeted procedure matched the unknown against an in-house spectral library containing retention time, accurate mass, and MS2 characteristic fragments for 423 authentic chemical standards of canonical pathway metabolites and 46 literature-reported microbial by-products including neurotransmitters (Supplementary Data 1). Meanwhile, untargeted analyses entailed validated rules and tools for formula generation14, structural dereplication15, and machine learning-based retention time prediction16. We manually curated and combined identification results from both procedures, yielding confidently annotated 533 fecal metabolites, 231 serum metabolites, and 58 metabolites of either level 1 confirmed structure or level 2 probable structure17 (Fig. 1d; Supplementary Data 2, 4).
Microbiota as a master regulator of gastrointestinal metabolism
Because the gut is the largest niche for commensal microbes and houses most critical sites of digestion, homeostasis, and immunity, gut lumen metabolomes represent a unique avenue for probing microbiota-host interaction18. Our metabolomics revealed distinct fecal patterns due to microbiota, as illustrated in principal component analysis (PCA) (Fig. 2a, b) and metabolomic total ion chromatogram cloudplot (Fig. 2c). Note that the 16,001 distinct fecal features accounted for 49.0% of the total detected. High-coverage cheminformatics successfully resolved 533 fecal metabolites from these features.
To gain a “landscape” view, chemical similarity enrichment analysis (ChemRICH)19 was performed. Seventy chemical classes were clustered, covering a wide lipophilicity range with varied compound numbers (node size) and overall trends of change (node color); for the color spectrum of node, red indicated that among the metabolites of GF/CONV-R difference, GF-enriched ones outnumbered those increased in CONV-R and/or overall embraced larger fold changes (Fig. 2d; Supplementary Data 5, 6). Most significantly enriched were cholic acids, oligopeptides, glutamates, indoles, and nucleosides, followed by other amino acids (e.g., sulfur, aromatic), lipids (e.g., acylcarnitines, fatty acids), and neurotransmitter families including taurine and catecholamines. To interpret in the context of biologic pathways, we conducted quantitative metabolite sets enrichment analysis (qMSEA) based on 99 a priori defined sets of metabolites in the Small Molecule Pathway Database (SMPDB)20,21. We identified 71 microbiota-perturbed pathways in the GI tract (adjusted p < 0.05) (Supplementary Data 7) with top 50 plotted (Fig. 2e). Our analyses enriched purines, tryptophan metabolism, bile acid biosynthesis, porphyrin metabolism, and fatty acid metabolism, alongside numerous other amino acids (e.g., arginine, proline, tyrosine, phenylalanine, aspartate, glycine, serine, lysine, and branched-chain amino acids), lipid metabolism (e.g., phospholipids, carnitines, and sex steroid hormones), cofactors (e.g., vitamin K, vitamin B5, biotin, vitamin B3, and vitamin B6), and polyamine metabolism (e.g., spermidine), etc. Both analyses confirmed for microbiota as a master regulator of gut metabolism, in protein metabolism (over 100 oligopeptides altered), energy harvest (e.g., fatty acid β-oxidation, carnitine shuttle, citric acid cycle, and gluconeogenesis), and numerous signaling pathways that involved amino acids and lipids as substrates or ligands.
We turned to individual fecal metabolites and select pathways for in-depth analyses. Using a random forest model, altered fecal metabolites were ranked by their relative contribution to group separation with top 50 shown in a variable importance plot (VIP) (Fig. 3a). Top-ranked members were structurally diverse, indicating a multifaceted and complex nature of the microbiome-metabolome network. We specifically focused on aromatic amino acid pathways because they have confirmed microbial involvement and neuromodulatory activities22–24, enriched with tryptophan metabolism being the second most significant (Fig. 2e), represented major compound classes perturbed (in addition to those for energy harvest and protein metabolism) (Fig. 2d), and included multiple metabolite members highly ranked in the random forest classification model, including kynurenine (4th), shikimate (12th), and serotonin (21st) (Fig. 3a). In the integrated view (Fig. 3b), all three tryptophan catabolic fluxes were enhanced when microbiota was present, as characterized by (i) reduced tryptophan pools (1.4-fold), (ii) decrease levels of kynurenine, 5-hydroxy-L-tryptophan, and indole, and (iii) increased levels of serotonin (5-HT) (5.7-fold), kynurenate (4.1-fold), and indole derivatives of 1.8–36.3-fold changes including indole-3-propionate (IPA), indole-3-acetate (IAA), indole-3-lactate (ILA) and indole-3-carboxaldehyde (I3A) (Fig. 3b, c). We also observed distinct bile acid patterns with fold changes as large as three orders of magnitude (Fig. 3d; Supplementary Data 2). Note that free-form primary bile acids such as chenodeoxycholate, cholate, muricholates (αMCA+βMCA) were markedly higher in CONV-R feces, while their taurine- or glycine- conjugates had an opposite trend. Correspondingly, secondary bile acids were found to be much enriched in CONV-R feces than in GF’s (Fig. 3d; Supplementary Data 2).
Microbiota extensively mediates gut neurotransmitter production and transformation
The discovery that our commensal microbiota comprises innumerable neurotransmitter producers has recently led to exciting hypotheses questioning their effects on host neurotransmission, gut-brain signaling, and mental health outcomes25,26. Here, we offer a complete analysis of fecal neurotransmitter profiles comparing GF and CONV-R mice. As selectively shown (Fig. 4a–d), we found for a number of neurotransmitters much-elevated levels in the presence of microbiota, spanning Class A (Rhodopsin-like) G-protein coupled receptor pathway (GPCR) amines including acetylcholine, epinephrine, histamine, serotonin, and tyramine (Fig. 4a), Class C metabotropic neurotransmitters L-glutamate and GABA (Fig. 4b) to neurotransmitters (or their precursors) of other channels or receptors, e.g., taurine, L-homocysteate, L-3,4-dihydrophenylalanine (L-DOPA) (Fig. 4c). Many neurotransmitters were virtually absent in GF feces, as indicated by peak intensities as low as detection limits even on ultra-sensitive mass spectrometry assays of ours. This suggests an essential role of resident microbiota for neurotransmitter production. Several neuroendocrine signaling molecules, namely glycine, cortisol, adenosine, and 2-aminoadipate embraced an opposite trend (Fig. 4d), suggesting endogenous sources other than influences from microbiota.
We took turns to specifically examine two major neurotransmitter pathways, namely catecholamine biosynthesis (Fig. 4e) and the glutamine/glutamate-GABA cycle (Fig. 4f), since both have been suggested recently as potential routes for microbes to modulate host neurotransmission extending beyond the GI tract to the entire body. Catecholamines, namely dopamine, norepinephrine, and epinephrine, are biogenic amine neurotransmitters with crucial functions in motivation, reward, and hedonistic regulation27. Consistent with previous data28, our results showed markedly higher fecal levels of norepinephrine (2.9-fold), epinephrine (3.2-fold), as well as two dopamine precursors tyramine (5.7-fold) and L-DOPA (4.7-fold). Though fecal dopamine levels manifested no statistical difference, we found two related sulfate-conjugates (dopamine-3-O-sulfate and dopamine-4-sulfate) substantially elevated when microbiota was present (Fig. 4e), warranting future analyses. Glutamine/glutamate-GABA metabolism, coupling with much altered tricarboxylic acid (TCA) cycle, was also perturbed in the gut, as represented by decreased L-glutamine (2.3-fold) and elevated L-glutamate (2.5-fold) and GABA (6.8-fold) with the presence of microbiota (Fig. 4f).
Unexpectedly, we observed a realm of conjugated compounds altered owing to microbiota, involving glucuronides and sulfates of neurotransmitters (Fig. 4g). We constructed a heatmap of them, with 28 enriched in CONV-R feces and five with opposite trends (Fig. 4g). Based on extensive literature search, we highlighted those with neuromodulatory properties in bold texts, including 11 compounds much diminished (e.g., N-acetylserotonin glucuronide) and three enriched (e.g., dopamine 4-sulfate) when microbiota was present, alongside other conjugates of diet-derived flavonoids (e.g., genistein 4’-O-glucuronide), bile acids (e.g., taurocholate 3-sulfate), and indole derivatives (e.g., indole-3-carboxylate-O-sulfate, indole-3-acetate O-glucuronide) (Fig. 4g). These data together support that the present microbiota plays an integral role in neurotransmitter metabolism in the local gut lumen in terms of production, transformation, and bioavailability, warranting future efforts to delineate microbial species-level contributions in a time- and space-specific manner.
Microbial impacts on circulating blood metabolism and fecal-blood exchange
With detailing of microbiota’s control over gut metabolism and neurotransmitter profiles, the question arises as to whether these local effects will propagate from gut to peripheral organs. To investigate this, we profiled the circulating blood sera of GF and CONV-R mice. We observed systemic perturbations, though not as striking as the fecal data, for serum metabolome due to microbiota (Fig. 5a–c), with 3,977 significant ion features detected that accounted for 21.0% of total aligned features. We successfully resolved 231 unique structures from these altered features (Supplementary Data 3).
To infer, we performed chemical similarity enrichment and quantitative pathway enrichment analyses. ChemRICH plot clustered the serum metabolites into 27 chemical classes, spanning carnitines, dipeptides, indoles alongside lipid species, amino acids, and organic acids (Fig. 5d; Supplementary Data 8, 9). We discovered that 175 out of the total 231 metabolites (75.8%) were downregulated in the presence of microbiota, also indicated by the prevailing purple-to-red colors (Fig. 5d). qMSEA analysis revealed for circulating blood 61 significantly enriched pathways (adjusted p < 0.05) (Supplementary Data 10), with top 50 shown (Fig. 5e). In line with fecal data, top enriched serum pathways spanned from porphyrin metabolism, ketone body metabolism, bile acid biosynthesis to numerous amino acid metabolism, including the aromatic amino acid family (Fig. 5e). The results show that microbiota has systemic effects and can modulate peripheral blood circulation, raising possibilities of a microbiota-gut-brain axis through humoral transport of molecular cues in vivo.
Similar to fecal data, we focused on individual serum metabolites for in-depth analyses to delineate potential gut-blood network and neuroendocrine pathways. Random forest analyses ranked all altered 231 serum metabolites by contribution to group separation, with top 50 shown (Fig. 6a). In addition to metabolites for energy processes (e.g., acylcarnitines, oligopeptides, ketone bodies), we noted a realm of microbial products in sera. For example, the top-ranked 4-ethylphenol, is a known tyrosine metabolite synthesized by bacteria29,30. Phenyl sulfate ranked third with surging levels in CONV-R sera (173.33-fold), is a confirmed gut-microbiota-derived uremic toxin contributing to albuminuria in diabetic kidney disease31 (Fig. 6a; Supplementary Data 3). Another phenolic derivative N-(2-phenylacetyl)glycine (PAG), with 10.5-fold elevation with microbiota, was ranked the fifth place. In the VIP chart, we also observed several tryptophan indole derivatives enriched in CONV-R serum. These included indoxyl sulfate (4,351.6-fold), IPA (7.3-fold), methyl indole-3-acetate (4.3-fold) alongside other indoles (Supplementary Fig. 2a). The results together support that microbiota’s systemic effects on the host involve humoral transport of microbial molecular cues.
To detail the microbiota-serum metabolome network, we focused on aromatic amino acid pathways (Fig. 6b) with multiple top-ranked phenolics and indoles on the VIP chart (Fig. 6a). Compared with fecal data (Fig. 3b), we observed for CONV-R sera overall a consistent pattern, such as enhanced fluxes of tryptophan catabolism featuring decreased kynurenine and enriched indoles. That said, interestingly, we noted that serum serotonin levels, though at high peak intensity (area, 108), were not statistically different between the CONV-R and GF mice, which was in conflict with studies illuminating microbiota’s regulation of peripheral serotonin levels32. We re-evaluated our data and discovered that serum serotonin levels were markedly higher in male CONV-R mice than in male GF mice (p = 0.0022, pairwise Wilcoxon rank-sum test, with a CONV-R/GF fold change of 1.6), while female mice exhibited no statistical difference (p = 0.18) (Supplementary Fig. 2b), suggesting gender as a potential factor in gut microbial effects on mammalian circulating serotonin levels. We also examined serum neurotransmitter and neuromodulatory compounds (Supplementary Fig. 2c) and noted that glycine, cortisol, and 2-aminoadipate shared a similar trend with fecal data (decreased in CONV-R mice). Whereas histamine and GABA embraced an opposite trend with markedly lower levels in CONV-R than in GF sera. Surprisingly, CONV-R-gut-enriched neurotransmitters such as acetylcholine, epinephrine, and taurine were either below detection levels in serum or exhibited no statistical difference between groups. The results together show that microbiota’s modulation of peripheral circulating neurotransmitters is strictly compartmentalized.
To further delineate gut-blood exchange, we focused on metabolites of GF/CONV-R difference shared by fecal and serum matrices. In total, 88 metabolite pairs are shown in MetaMapp network graphs33 for a side-by-side comparison (Fig. 6c, d). The network analyses generated nine compound clusters, including fatty acids, amino acids, and carnitines/choline. The overall change in serum was smaller than in feces, as indicated by generally smaller node sizes and larger adjusted p-values of Welch’s t-test (Fig. 6c, d). We noted that fatty acid pairs, including microbiota-derived adipate34 and sebacate35 embraced a contrasting gut-blood pattern, with 20 elevated in feces but lowered in serum for microbiota-harboring mice. Conversely, phenolics such as hydroxyphenyllactate and 4-hydroxy-3-methoxycinnamate were lower in feces but higher in blood sera for CONV-R mice. Apart from these two, other compound classes showed similar fecal-blood patterns of change, spanning amino acid derivatives (including indoles), nucleotides, steroids, and carnitines/choline. The results support that microbiota’s impacts on local gut epithelial homeostasis, energy harvest, and signaling can translocate and propagate to peripheral blood circulation, raising further possibilities of humoral transport and effects on host CNS.
Integrated analyses of cerebral cortical brain, feces, and blood serum metabolomes
To further determine whether commensal microbiota modulates brain biochemistry, we profiled the metabolome of cerebral cortical brain tissues comparing GF and CONV-R mice. We focused on the cerebral cortex region because it is the largest site of neural integration (occupying over two-thirds of mammalian brain mass), carries pivotal roles in brain functions such as thinking, memory, language, and consciousness36,37, and importantly, cerebral cortical injuries are often linked to onset of mental disorders such as depression38 among which many also are common comorbidities to gut dysbiosis. In total, we identified 58 altered metabolites in cerebral cortical brain tissues (Supplementary Data 4). To interpret these, we constructed a network graph of all metabolites and ranked them based on a random forest model (Fig. 7a). The network showed changes in diverse compounds, as represented by indoles, amino acids, nucleotides, and fatty acids. On the VIP chart, we discovered an array of markers and pathways related to oxidative stress and reactive oxygen species (ROS) reaction. For example, the methionine/glutathione transsulfuration pathway was extensively perturbed in brain, as characterized by elevated levels of L-methionine (1.47-fold), S-(5’-adenosyl)-L-homocysteine (SAH, 1.30-fold), L-glutathione (GSH, 15.1-fold), and 5’-deoxy-5’-methylthioadenosine (1.62-fold) when microbiota was present (Supplementary Fig. 3)39,40. We also noted for CONV-R brain elevated levels of two eicosanoids prostaglandin B2 (1.8-fold) and E2 (1.7-fold), α-ketoglutarate (AKG, 1.4-fold) (a sensitive oxidative stress indicator) as well as the antioxidative ketone body 3-hydroxybutyrate (2.0-fold). In addition to generic markers, we discovered a range of microbiota-derived metabolites implicated in redox homeostasis. These spanned short-chain fatty acids (e.g., butyrate, caproate), indoles (e.g., indoxyl sulfate, indole-3-lactate), trimethylamine-N-oxide (TMAO), shikimate and phenylalanine derivatives (e.g., hydroxyphenyllactate, PAG), all elevated in CONV-R brain tissues compared with GF mice (Fig. 7a). The results support that commensal microbiota can largely mediate, directly or indirectly, redox homeostasis, energy metabolism, as well as neuronal signaling inside the mammalian CNS.
To further interpret in light of the gut-brain axis, we compared multiple matrices for an integrated analysis, with examples shown in boxplots (Fig. 7b). We noted under heated ESI- analysis mode, ion feature m/z 212.002 at the retention time of 4.9 minutes was much elevated in CONV-R mice while virtually absent in GF mice, with highest fold changes in both sera (4,351.6-fold) and brain tissues (26.8-fold) (Fig. 7b). In silico formula analyses determined the elemental composition of this ion feature to be C8H6NO4S ([M-H]-), based on which the neutral structure was dereplicated and confirmed as indoxyl sulfate (Fig. 7c). Indoxyl sulfate was a known uremic toxin and cardiotoxin that originated from host-microbiota metabolism of dietary tryptophan. Our results indicated that microbiota-derived indoxyl sulfate may cross the BBB and reach the CNS, which was consistent with two emerging studies41,42. Similar to indoxyl sulfate which has microbial relevance and shared trends across compartmental matrices, we also noted two other aromatic amino acid derivatives indole-3-lactate and PAG that were enriched in CONV-R mice. Other potential humoral pathways indicated in this study were exemplified for CONV-R mice by elevated levels of TMAO, methionine, vitamin B5 in all three matrices and decreased levels of allantoin and corticosterone in serum and brain tissues (Fig. 7b; Supplementary Data 1–4).
Microbial rewiring of host biochemistry has gender-specific characteristics
Sex dimorphism has been increasingly identified in host-microbiota interaction43. Here, to assess the gender-specificity of microbiota-metabolome signatures in light of the humoral gut-brain axis, we revisited aligned feature tables and performed two-way analysis of variance (ANOVA) for all ion features considering both variables of microbiota (GF/COVN-R) and gender (male/female). For features with significant main effects of microbiota, a considerable proportion also exhibited microbiota×gender interaction in feces (32.6% out of 63.0%, i.e., 51.7%, HESI+), while less for sera (36.5%, HESI+) and least for the cortical brain (10%, HESI+) (adjusted p < 0.05) (Fig. 8a); for a more general check, a main effect of gender has been identified for a number of features in all three matrices, among which fecal features embraced the largest proportion with microbiota×gender interaction (82.8%, HESI+), while less for sera (51.3%, HESI+) and least for brain tissues (11.0%, HESI+) (Fig. 8b). Grouping data by gender, post hoc Tukey’s honestly significant difference (HSD) test was conducted comparing GF and CONV-R (adjusted p < 0.05) to examine whether individual features of significant main effects of microbiota exhibited sex-specificity (i.e., only of GF/CONV-R difference either in male or female) with the gender-specific distribution summarized in Venn diagrams (Supplementary Fig. 4) and pie charts (Fig. 8c–e). In parallel, we revisited the annotation tables to examine gender specificity of the altered metabolites, yielding PieDonut distributions (Fig. 8f–h) that embraced a consistent trend with ion feature pie chart (i.e., feces < sera ≈ brain) in terms of the proportion of sex-specific molecular signatures. Beyond GF/CONV-R comparison and for a more general check, we also performed Tukey’s HSD test on GF/CONV-R grouped data to compare separately for male and female, with ion feature distribution summarized in Venn diagrams and pie charts (Supplementary Fig. 5).
To examine sex-specific biochemical details, MetaMapp network graphs (Supplementary Fig. 6–8) and MSEA charts (Supplementary Fig. 9) were generated from sex-specific metabolites of GF/CONV-R difference (Supplementary Data 11). We noted for fecal metabolomes amino acids, oligopeptides, oxylipins, bile acids, and nucleobases that were either only altered in males (67 compounds) or in females (60 compounds) (Supplementary Fig. 6). Specifically, tryptophan catabolites kynurenate, xanthurenate, indole-3-carboxaldehyde, and 4-(2-aminophenyl)-2,4-dioxobutanoate were elevated exclusively in male CONV-R feces compared to GF males (Supplementary Fig. 6a). Known microbial products also exhibited alterations selective to gender, as represented by sebacate and adipate only for male feces (Supplementary Fig. 6a) and by 2,3-butadione, a known bacterial toxin linked to health conditions, such as inflammatory bowel diseases44, cystic fibrosis45, and nonalcoholic fatty liver disease46 that was enriched only in the female fecal metabolome when microbiota was present (Supplementary Fig. 6b). For sera, we observed a realm of lipids, organic acids, and amino acids that exhibited gender-specific qualities (Supplementary Fig. 7). Specifically, for male blood sera, multiple acylcarnitines had differentiated levels between GF and CONV-R (Supplementary Fig. 7a), while female sera involved an array of phospholipids, oxylipins, and long-chain fatty acids that were all lower in CONV-R than in GF (Supplementary Fig. 7b). We also noted several aromatic amino acid derivatives that were selectively altered in females but not in males, including tryptophan catabolites (e.g., indole-3-carboxaldehyde, indole-3-lactate) and ferulate (a neuroprotectant)47. As for metabolites in the cortical brain, we discovered 13 altered specific to male mice and nine to females, with multiple compounds involved in redox homeostasis and energy metabolism (Supplementary Fig. 8). Of note, prostaglandins (B2 and E2), GSH, AKG, and nicotinamide were only altered in male cortical brains (Supplementary Fig. 8a), while SAH, indole-3-lactate, caproate, 2-hydroxybutyrate, and eicosapentaenoate (EPA) underwent changes only in females (Supplementary Fig. 8b). Taken together, the microbiota-metabolome-brain axis exhibited gender-specific characteristics that need to be addressed in future studies of microbiota’s functional roles in the given pathophenotypes.
Discussion
Recent data support that commensal microbiota regulates mammalian brain function, opening doors of targeting microbiota for neuroprotective causes. However, biochemical underpinnings for such microbial effects in light of the gut-brain axis remain largely unclear. Metabolomics can address this but often encounters low compound identification rates due to limited references and tools, such as commercially available chemical standards and mass spectral databases. To expand metabolome coverages, recent upgrade of accurate-mass measurement and cheminformatic tools shows promise for enabling high-throughput in silico prediction of elemental composition, molecular structures, and chromatographic retention time with integrated and validated procedures. In this light, we sought to apply a complete suite of cheminformatic rules and tools (untargeted annotation procedures), alongside the conventional use of an in-house library based on hundreds of reference chemical standards (targeted annotation procedures), to conduct high coverage mapping for the biochemical landscape of the “microbiota-gut-brain axis.” Fecal matter, blood sera, and cerebral cortical brain tissues of GF and CONV-R C57BL/6 mice of 8-week-old were harvested, extracted, and analyzed using high-resolution LC-HESI-HRMS. Using statistics and cheminformatics, we screened 20,939 distinct ion features combining both HESI modes and identified hundreds of altered metabolites, including 533 for fecal, 231 for serum, and 58 for brain tissues. To our knowledge, these datasets represent the first high-coverage metabolome characterization for microbiota’s effects on host biochemistry in light of humoral routes for interorgan transport and gut-brain communication.
We gained a “landscape” view of these microbiota-mediated biochemical changes through both the chemical and pathway enrichment analyses. For metabolism in gut lumen where trillions of microbes reside and constant host-microbiota interaction occurs and converges, a total of 70 chemical class clusters and 71 pathways were significantly perturbed, represented by 14 amino acid pathways (aromatic amino acids in particular), purines, bile acids, porphyrins, vitamins (e.g., B and K), and polyamines alongside massive protein catabolism (over 100 oligopeptides altered) and energy harvest events (e.g., TCA cycle, acylcarnitine shuttle). In compliance with past studies22–24, fecal aromatic amino acids, especially the tryptophan and tyrosine metabolism, stood out with high statistical significance and in the 50 multiple top-ranked metabolite members in random forest classification. We offered an integrated view of fecal aromatic amino acid changes and observed for microbiota-harboring mice (i) enhanced fluxes of tryptophan pathways featuring decreased tryptophan and enriched indoles, serotonin, xanthurenate, and kynurenate, (ii) altered shikimate pathway with lowered shikimate and increased levels of chorismate derivatives, (iii) suppressed folate biosynthesis, and (iv) enriched tyrosine and downstream by-products including catecholamine neurotransmitters. Of significance, we also identified over 30 altered fecal bile acids with the free-form bile acids (primary and secondary) much elevated while conjugated primary bile acids significantly lowered in the presence of microbiota, including those associated with neurodegenerative disease outcomes, such as Alzheimer’s disease11,48.
Systematic effects of microbiota on blood circulation were characterized, as represented by 231 altered metabolites of 27 chemical classes and 61 significantly perturbed pathways spanning porphyrin metabolism, ketone bodies, bile acid biosynthesis, and a range of amino acids, etc. Note that many of the serum pathways were also perturbed in the gut, with the latter generally embracing a larger number of altered metabolites in the given pathway in parallel with compartment-specific details to note. For instance, similar to intestinal changes, microbiota-harboring mice exhibited distinct serum aromatic amino acids featuring (i) enriched levels of microbiota-derived indoles such as indoxyl sulfate and IPA (an important ligand to Pregnane X receptors49,50), (ii) altered shikimate pathway with decreased levels in shikimate-3-phosphate and quinate, and (iii) enriched phenolic metabolites phenyllactae, hippurate, salicylate, and phenyl sulfate, while no differences were detected for tyrosine and associated catecholamine products as observed in the gut. Notably, the shikimate pathway, which mammalian cells lack but heavily involved in microorganisms, were detected to be perturbed by microbiota with effects on other aromatic amino acids and associated neurotransmitter generation (e.g., serotonin, tyramine). This suggests a need to evaluate the microbiota-shikimate-host network and associated health risks, for example, of using shikimate pathway-inhibiting neurotoxic herbicides (e.g., glyphosate) that were long believed to be innocuous for mammals51. On the molecular level, for instance, random forest classification ranked 4-ethylphenol, a microbial metabolite of tyrosine29,30 and the precursor of the microbial 4-ethylphenylsulfate6, as a top metabolite variable to CONV-R/GF group separation. Together, these microbiota-driven systemic changes coupling to gut metabolic alteration may serve as a valuable reference for future studies probing microbial influences on host physiology and health, warranting efforts to further delineate genes, enzymes, and transporters involved as well as specific microbial members in play.
We also applied high-coverage metabolomics searching for brain signatures owing to microbiota (Supplementary Fig. 1). Targeting the cerebral cortical brain region, we sought to find proof-of-principle of the microbiota-brain network. In total, 58 altered brain metabolites were identified, spanning oxidative stress makers from methionine/glutathione metabolites, eicosanoid prostaglandins, ketone bodies, to microbiota-derived metabolites such as butyrate, indoxyl sulfate, TMAO, and shikimate/phenylalanine derivatives hydroxyphenyllactate and PAG. Overall, the results were consistent with recently emerging studies, featuring specific metabolites (e.g., 2-hydroxybutyrate, TMAO, glutathione, and acetylneuraminate)13,52 and pathways (e.g., oxidative stress, transsulfuration)13,53,54, while complementing well with others that touched upon short-chain fatty acids55, microbe-associated molecular patterns (e.g., lipopolysaccharides)56 and phenolics57. Our new data support microbiota’s control over physiological conditions of the CNS, particularly in transsulfuration, redox homeostasis, and neuroinflammation.
One central theme of the microbiome field is to probe the neuromodulatory activities of resident microbiota in light of gut-brain signaling, given their large capacity to produce neurotransmitters and associations with many mental and/or neurologic disease outcomes such as depression7, autism spectrum disorder6,58, Alzheimer’s disease48, and Parkinson’s disease10. However, only a few molecular cues underlying such microbiota-gut-brain axis have been reported to date, leaving the mechanistic underpinnings largely unclear13. Our high-coverage approach incorporating targeted and untargeted annotation procedures allowed us to address this; the metabolomes of GF/CONV-R difference, either with common or opposite trends among sample matrices, indicate possibilities of direct or indirect humoral pathways, as partially supported by reported knowledge concerning interorgan transport of gut microbial metabolites and in turn, their effects on permeabilities of the BBB22,59,60.
We discovered for gut microbiota as a master regulator of neurotransmitter production in the GI tract, as demonstrated in (i) massive depletion of over 15 “classical” neurotransmitters without microbiota, (ii) integrated mapping of changes in catecholamine biosynthesis and glutamine/glutamate-GABA metabolism, as well as (iii) tryptophan pathway-derived neurotransmitters (e.g., serotonin, kynurenate). Further, we found extensive control of microbiota over their transformation as well, particularly through deconjugation. The results complement well recent studies confirming gut bacteria as neurotransmitter producers while supporting local gut microbial capacities to reactivate phase II metabolites from their conjugated forms (e.g., glucuronides, sulfates), likely through hydrolytic enzymes such as β-glucuronidases (GUSs) and/or β-glucosidases61,62. Interestingly, except for histamine, glycine, aminoadipate, and cortisol, the majority of neurotransmitters with distinct fecal patterns were either not detected, exhibited no statistical difference, or showed opposite trends (e.g., GABA, histamine) in the serum metabolome; of note, serum serotonin levels were significantly elevated in male CONV-R mice (compared with male GF) but were not significantly different between female groups. These together show that despite the large metabolic activities of the microbe-neurotransmitter network in the intestine, humoral transport and/or systemic effects of these microbiota-derived neurotransmitters are relatively confined in gut and can be affected by gender. Such stringent organ compartmentation and gender-specific characteristics of microbial neurotransmitters need to be studied in the future. That said, we nevertheless discovered a range of compounds potentially involved in interorgan transport and gut-brain communication, for which the presence of microbiota contributed surging levels extending from gut and/or blood to the cerebral cortex. These spanned indoles (e.g., indoxyl sulfate, indole-3-lactate), phenolics (e.g., 4-ethylphenol, phenyl sulfate, PAG), choline derivatives (e.g., TMAO), vitamin B (e.g., pantothenate), butyrate, and methionine, with some virtually absent in GF mice. Of note, indoxyl sulfate, a known uremic toxin and an oxidative stress marker originated from gut bacterial indoxyl63,64, embraced the highest fold changes of elevation in both circulating blood (4,351.6-fold) and brain tissues (26.8-fold) with the presence of microbiota. Likewise, TMAO was ranked the third for highest fold changes in both serum (267.3-fold) and brain (7.32-fold) alongside elevated levels in gut lumen (5.90-fold) owing to microbiota. TMAO, also a uremic toxin and an oxidative stress marker, is an oxidation product of gut bacterial trimethylamine through host hepatic flavin monooxygenase with demonstrated adverse effects in cardiovascular diseases65. On the contrary, antioxidant agents such as pantothenate and Bifidobacterium longum product indole-3-lactate66, were also much elevated in both serum and brain when microbiota was present. Together, these findings buttress the notion that commensal microbiota can mediate the CNS (especially redox homeostasis and neuroinflammation) through direct humoral pathways, but the effects may vary depending on multiple factors such as specific microbiota composition and their metabolites, as well as gender, age, and the health status of mammalian hosts29.
The strength of this study lies in the use of germ-free mouse model, the sensitive and high-coverage metabolomics approach, and a suite of statistics and visualization tools that allows complete profiling and mining of microbiota-specific molecular signatures as well as their multicompartmental comparison for microbiota-gut-brain interpretation. Cautions and limitations of the present study include: (i) the metabolomics results are by nature descriptive, necessitating more focused efforts testing specific hypotheses of humoral interorgan connections through, for example, fecal transplantation experiments and stable isotope tracer-assisted flux analysis of specific pathways in question; (ii) the metabolome changes observed in this study should be considered as molecular phenotypic representations of host-microbiota interaction rather than microbial effects alone35; (iii) use of peak area for statistics instead of absolute molar concentrations essential for inferring actual metabolic activities at given organ matrices; (iv) level 2 annotations confirmed with accurate-mass MS/MS analyses and machine learning-predicted retention time in this study may still fall short for an unambiguous stereoisomeric assignment; (v) biochemical pathways were interpreted for breadth and mostly from a reductionist angle, warranting studies to further delineate at bacteria- and/or pathway-specific levels and to investigate interaction between seemingly distinct chemical classes. In summary, we present here a high-coverage metabolomics approach for characterizing microbiota’s effects on host metabolome in light of interorgan transport and gut-brain crosstalk in mice; the novel findings and insights from this study may prove valuable for future microbiome research.
Methods
Chemicals and reagents
LC/MS-grade (OptimaTM) solvents including methanol (MeOH; catalog #A456-4), acetonitrile (ACN; catalog #A955-4), water (catalog #W6-4) and formic acid (catalog #A117-50), were obtained from Fisher Scientific (Waltham, WA, USA). Stable-isotope-labeling internal standards for a range of classical neurotransmitters and tryptophan catabolites, including acetylcholine-d13 (N,N,N-trimethyl-d9; 1,1,2,2-d4) (d13-ACh; catalog #D-1780), γ-aminobutyric acid-d2 (d2-GABA; catalog #D-1731), L-glutamine-2,3,3,4,4-d5 (d5-Gln; catalog #D-2532), L-tryptophan-2,3,3-d3 (d3-Trp; catalog #D-7419), serotonin-α,α,β,β-d4 (d4-5HT; catalog #D-1550), L-kynurenine (ring-d4, 3,3-d2) (d6-Kyn; catalog #DLM-7842), indole-3-acetic-2,2-d2 acid (d2-IAA; catalog #D-1709) and indole-3-propionic-2,2-d2 acid (d2-IPA; catalog #D-7686) were obtained from CDN Isotopes (Pointe-Claire, Quebec, Canada) and Cambridge Isotopes (Tewksbury, MA, USA). For quality control, these internal standards were spiked in sample pretreatment for monitoring sample recovery and analytical variability.
Animals
All animal procedures were approved by the Institutional Animal Care & Use Committee (IACUC) at The University of North Carolina at Chapel Hill (UNC) under study protocol number 19-235.0. Conventionally raised (CONV-R) wild-type C57BL/6 mice were obtained from the Jackson Laboratory (Bar harbor, ME, USA) and housed under specific-pathogen-free (SPF) conditions at the UNC animal facility for multiple generations; germ-free (GF) mice were generated and housed in stringently germ-free chambers for generations at the National Gnotobiotic Rodent Resource Center of UNC in Association for Assessment and Accreditation of Laboratory Animal Care International (AAALAC)-accredited facilities (Chapel Hill, NC, USA). C57BL/6 mouse littermates of ~7 weeks old raised under GF and conventional conditions were age-matched upon selection for sacrifice and analyses, resulting in a final total of 24 mice, including 12 GF mice and 12 SPF CONV-R mice with 6 males and 6 females included in each. The animals were raised under the following conditions: 22 °C, 40–70% humidity, and a daily 12:12 h light-dark cycle. For dietary administration, we consistently fed all mice the same purified (and sterile) standard Prolab RHM 3000 pelleted rodent diets (St. Louis, MO, USA) and provided tap water ad libitum. Prior to sacrifice and sample harvest, all mice were observed under their original housing conditions for one week; animals were not considered if they exhibited significant signs of serious injury or morbidity (e.g., malocclusion or fight wounds). Upon euthanization in CO2 chambers, feces, brain, and blood sera were harvested, snap-frozen, and stored in -80 °C freezer before analysis.
Sample extraction
Fecal matter, blood sera, and brain tissues (of the cerebral cortical region) were aliquoted and extracted for metabolome analyses. To maximize coverage and throughput, extraction procedures were kept as simple and nonselective as can be. For feces, ~20 mg thawed samples were aliquoted to a 1.5-mL Eppendorf tube (Hamburg, Germany) with ~30 mg acid-washed glass beads added (Sigma-Aldrich, St. Louis, MO). For every 10 mg feces aliquots, 300 μL of ice-cold MeOH:water (50:50, v/v) with spiked ISDs were added for extraction. The samples were homogenized on a TissueLyzer (Qiagen, Hilden, Germany) at 50 Hz for 10 min and centrifuged at 12,000 × g for 10 min (Eppendorf, Hamburg, Germany). A 100-μL aliquot of the supernatant was transferred and dried in a CentriVap vacuum evaporator (Labconco, MO, USA). For serum, 20 μL was aliquoted from each sample in a 1.5-mL Eppendorf tube (Hamburg, Germany). A total of 180 μL ice-cold ISD-containing MeOH was added, vortexed, and incubated at -20 °C freezer for 30 min. The extracts were centrifuged at 15,000 × g for 10 min for precipitation of particulates and proteins. Aliquoted 100-μL of supernatants were SpeedVac-dried as described above. For cerebral cortical brain tissues, ~20 mg was aliquoted from each sample to a 2-mL screw cap microcentrifuge tubes (VWR, Radnor, PA, USA) that contained clean stainless-steel beads of 5-mm i.d. To every 20 mg tissue, 400 μL of the ice-cold ISD-containing MeOH was added for extraction. The samples were homogenized on a TissueLyzer (Qiagen, Hilden, Germany) at 50 Hz for 2 min and further incubated at -20 °C for 1 h prior to centrifugation at 18,000 × g for 10 min. One hundred-μL of the extractant supernatants were SpeedVac-dried as described before. Upon instrumental analysis, all dried extracts, including feces, sera, and brain tissue, were resuspended in 50 μL ACN:water (2:98, v/v) for injection.
Instrumental analysis
Global metabolome profiling was performed on a Thermo Fisher Scientific Vanquish UHPLC coupling to a high-resolution accurate-mass (HRAM) Q-Exactive mass spectrometer interfacing with a heated electrospray ionization (HESI) source and a hybrid quadrupole-orbitrap mass analyzer (Waltham, MA, USA). For chromatographic separation, a Waters Acquity UPLC HSS T3 column (100Å, 1.8 μm, 2.1 mm × 100 mm) was used (Milford, MA, USA). The mobile phases consisted of 0.1% formic acid in water (A) and 0.1% formic acid in ACN (B), flowing at a rate of 0.4 mL/min under 40 °C with a 15-min gradient: 2% B, 0–1 min; 2 to 15% B, 1–3 min; 15 to 50% B, 3–6 min; 50 to 98% B, 6–7.5 min; 98% B, 7.5–11.5 min; 98 to 2% B, 11.5–11.6 min; 2% B, 11.6–15 min. HESI positive and negative ionization were both performed, with following ion source setting: sheath gas 60 L/min, aux gas 10 L/min, sweep gas 1 L/min, spray voltage 2.75 kV, capillary 325 °C, and aux gas heater 400 °C. All mass spectral data (in *.RAW format) were acquired through the Thermo XCalibur program (version 4.1) (Waltham, MA, USA). For MS1 full scan profiling (Q not used) aiming at examining global metabolomic profiles while screening for distinct ion features, the mass spectrometer was operated scanning mass range of 80-1,000 Da with a mass resolution of 70,000 full width at half height (FWHM) at 200 Da. For MS/MS spectra acquisition in later unknown structural annotation stages (with both Q and orbitrap running), the mass spectrometer was operated scanning for 80-1,000 Da in full-scan MS1 mode (FWHM 70,000; AGC target 3e6; max IT 200 ms) alternating with parallel reaction monitoring (PRM) MS/MS mode with normalized collision energy averaging 10, 50 and 100 (FWHM 17 500; AGC target 2e5; max IT 50 ms).
Data processing, statistics, and visualization
MS1 full-scan *.RAW data were converted to *.mzXML files in ProteoWizards MS Convert (Palo Alto, CA, USA) and uploaded to Scripps XCMS67 (La Jolla, CA, USA) for data processing, resulting in master peak tables aligning all samples separately from three sample types acquired under two HESI polarity modes. The dataset was analyzed in R by Welch’s t-test with adjusted p-values (i.e., q values) calculated based on the Benjamin-Hochberg procedure to compare GF and CONV-R groups. To further determine the effect of gender, two-way analysis of variance (ANOVA) was conducted. Raw peak areas for each ion feature were pareto-scaled and log-transformed to meet the test assumptions of ANOVA (e.g., no significant outliers, normality, and homogeneity of variance). Then, two-way ANOVA and post hoc Tukey’s (honestly significant difference) HSD test were performed to determine the main effects of microbiota and gender and to test for within-group statistical difference, respectively. Plots and analyses including Box and Whisker plot, barplot, principal component analysis score and scree plot, variable importance plot, PieDonut chart, as well as heatmaps were generated in R (version 4.0.1) (Vienna, Austria) using packages including readxl (version 1.3.1), dplyr (version 1.0.6), tidyverse (version 1.3.1), ggplot2 (version 3.3.3), ggpubr (version 0.4.0), ggthemes (version 4.2.4), ggsignif (version 0.6.1), RColorBrewer (version 1.1.2), dendextend (version 1.15.1), randomForest (version 4.6.14), FactoMineR (version 2.4), factoextra (version 1.0.7), pheatmap (version 1.0.12), wesanderson (version 0.3.6), and webr (version 0.1.5)68. Specifically for principal component analysis, we performed Pareto scaling first and included all ion features from the alignment table for the given sample matrix under specific heated ESI mode; for a showcase, the PCA score plots were generated using all 17,386 ion features for feces (HESI+) (Fig. 2a, b) and 10,012 features in total for blood sera (HESI+) (Fig. 5a, b). Random forest classification was conducted using the randomForest (version 4.6.14) and rfPermute (version 2.2) packages of R (Vienna, Austria) to generate variable importance plots to rank individual metabolites, with hyperparameters and permutation results provided (Supplementary Information)69. Metabolite networks were constructed using a MetaMapp approach33 (web-based portal, version 2020) that calculated biological pathways relevance (KEGG reactant pairs) and chemical structural similarity (Tanimoto coefficient > 0.7) which were further visualized in CytoScape with version 2.7.2 for Window 10 OS and version 3.8.0 for macOS (Seattle, WA, USA). Chemical similarity enrichment analysis (ChemRICH) plots were constructed to visualize changing metabolites based on chemical classes (Davis, CA, USA)19. Quantitative metabolite set enrichment analysis (qMSEA) was conducted on altered metabolites in MetaboAnalyst 4.0 (Montreal, QC, Canada) using their PubChem CID identifiers and log-transformed peak area pairs as data input to compute based on 99 a priori defined sets of metabolites in pathway-associated metabolites sets (SMPDB); metabolite sets that contained at least two compounds were used.
Compound identification
Since the millennium, untargeted/global metabolomics has been fast evolving with significant development in gas/liquid chromatography-mass spectrometry technologies, featuring the rise of accurate-mass measurement that has since opened doors of confident de novo determination of elemental composition and chemical structure in a high-throughput, sensitive, and nonselective manner70. The conceptual differences between the untargeted analysis, targeted analysis, and the like in metabolomics necessitates clear elucidation. The untargeted/global metabolomics analysis aims to detect as many features as possible in a single run and after a differential statistical comparison, the significant ones are sent to compound identification for hypothesis generation; annotation strategies can be either knowledge-based/targeted (matching against a predetermined in-house spectral library or experimental databases) or data-driven/untargeted (inferring directly from data, e.g., using streamlined in silico cheminformatic algorithms and scoring). While, in the case of targeted metabolomics, one has previous information of what he/she will look for, such as precursor/fragment transitions, and usually seeks a quantitative analysis to determine the absolute concentration levels using commercial labeled standards. In this study, we conducted global full-scan profiling of all ion features, performed statistics for those aligned across all samples, and leveraged both targeted (with known reference) and untargeted (infer from data directly) cheminformatic strategies for annotation with goals to identify/annotate all ion features with statistical significance (Supplementary Fig. 1). We refer to our approach as “high-coverage annotation,” since both the knowledge-based/targeted and data-driven/untargeted annotation strategies were performed on all significant ion features in a streamlined manner, allowing for integrated scoring, cross-validation, and postretention time prediction and filtering that helped achieve highest metabolome coverage possible.
Specifically, we collected a total of 20,939 tandem mass spectra of distinct ion features (pertaining to the microbiota-gut-brain connection) for structural annotation. These included 16,001 for feces, 3,977 for sera, and 961 for cerebral tissues combining HESI positive and negative modes. The *.RAW data were converted into *.abf format using the Reifycs Abf Converter (https://www.reifycs.com/AbfConverter/), uploaded to MS-DIAL 4.16 (Riken, Japan)71 to obtain *.MAT files that contained MS1 accurate mass, MS1 isotopic abundances, MS/MS spectra, and retention time for each ion feature with an acquired tandem mass spectrum. The targeted annotation procedure exploits an in-house experimental library established from 423 authentic chemical standards of canonical pathways, neurotransmitters, and literature-reported microbial metabolites; the local library stores chromatographic retention time, accurate mass, and characteristic MS2 fragment ions. For untargeted analyses, a cutting-edge cheminformatic pipeline was used integrating machine learning-based retention time predication (Retip)16, embedded experimental mass spectral database search (GNPS72, MassBank73, and ReSpect74, totaling 28,293 spectra), and rule-based in silico cheminformatic analysis (MS-FINDER, version 3.30)15 which integrates a number of cheminformatic algorithms while querying major biomolecule chemical databases, such as HMDB, LipidMAPS and PubChem Biomolecules adding up to over 100 million structures. The detailed settings of MS-FINDER are provided in Supplementary Information (Page 12). Importantly, the elemental composition was predicted (based on MS1 accurate mass, MS1 isotopic ratios, and MS/MS product ions) and filtered based on the Seven Golden Rules14, which refers to an algorithm consisting of seven heuristic rules, i.e., (i) restriction for element numbers, (ii) LEWIS and SENIOR rules, (iii) isotopic patterns, (iv) H/C ratios, (v) element ratios of N, O, P, S vs. C, (vi) elemental ratio probabilities, and (vii) presence of trimethylsilylation (for GC/derivatization platforms if applicable). Afterwards, the top five candidates were sent to structural dereplication, and chromatographic retention times were predicted afterward for these structures using the Retip (version 0.5.4) and h2o (version 3.32.1.3) package of R (Vienna, Austria) and compared with true values for lipophilicity validation and annotation cleanup. The resultant top-score structures were manually assessed and curated carefully to boost annotation confidence; the postcuration strategies included, but not limited to, chemical inference, biological relevance, comparison of targeted and untargeted results, and the use of additional experimental mass spectral database search including Metlin (La Jolla, CA, USA), MoNA (Davis, CA, USA), HMDB (Edmonton, AB, Canada), and mzCloud (Waltham, MA, USA).
The application of specific quality criteria to filter out low-quality MS/MS data and annotations is not only common but essential in untargeted metabolomics practices. In this study, however, we did not exclude any ion features of low abundance (under MS1 full-scan) for MS/MS acquisition, nor did we apply thresholds of minimum/noise of ion intensity for filtering out low-quality/noisy tandem MS/MS. This is because (i) we aimed for untargeted annotation in an unbiased manner for all ion features regardless of low abundances, and (ii) our integrated and streamlined scoring strategy carried a penalty mechanism itself (Supplementary Fig. 1c), meaning, if the tandem mass spectrum per se is too noisy or the quality is poor, the resultant scores will be correspondingly low, and the annotations will be naturally filtered out from the final annotation list (Supplementary Fig. 1d). For the finalized tables of the changed fecal metabolome (Supplementary Data 2), serum metabolome (Supplementary Data 3), and brain metabolome (Supplementary Data 4), we only included identification/annotation results of highest confidence, i.e. level 1 confirmed structure and level 2 probable structure according to the Metabolomics Standard Initiative (MSI) to best inform future studies on microbiota in health and diseases17,75,76.
Supplementary information
Acknowledgements
This study was funded by National Institute of Environmental Health Sciences (NIEHS) through award nos. R01ES024950, R03ES032067, R35ES028366, P42ES031007, and P30ES010126. We especially thank the instrumentation support from the Chemistry and Analytical Core (CAC) of the UNC Superfund Research Program (P42ES031007). The first author also acknowledges partial funding from the Chen-Yu Yen & Whay-Ray C. Yen Graduate Fellowship awarded by UNC Gillings School of Global Public Health (2018-2019). The authors wish to thank Dr. Oliver Fiehn, Dr. Tobias Kind, and Sili Fan (UC Davis, CA, USA), Dr. Yan Ma (NIBS, Beijing, China), Dr. Hiroshi Tsugawa (Riken, Saitama, Japan), Dr. Dinesh Barupal (Icahn School of Medicine at Mount Sinai, NY, USA) and the team for the dedicated development, guidance, and technical support of the many essential cheminformatic tools and databases used in this study.
Source data
Author contributions
K.L., Y.L. and H.R. conceived the study. Y.L. and C.-W.L. performed instrumental analyses. Y.L. conducted statistical analyses, compound identification, and data visualization. Y.Y., Y.-C.H. and Y.L. raised animals and harvested samples. Y.L. wrote the original manuscript; Y.L. and K.L. revised the manuscript. All contributing authors have agreed to the submission of this manuscript for publication.
Data availability
The relevant metabolomics raw data and master alignment ion feature tables generated for this study have been deposited in the National Metabolomics Data Repository (accession IDs: ST001756, ST001757, and ST001758) with Project No. PR001126 (10.21228/M8569X). All source data for figures and tables of this work are available online at Zenodo (10.5281/zenodo.5016278) and/or GitHub (https://github.com/darciliz/nc_hcm_code); requests for additional information or data can be addressed to the corresponding author. Experimental and in silico mass spectral databases used for postcuration/validation purposes included MassBank of North America (MoNA) (https://mona.fiehnlab.ucdavis.edu/), Metlin (https://metlin.scripps.edu/), HMDB (https://hmdb.ca/), and mzCloud (https://www.mzcloud.org/). Source data are provided with this paper.
Code availability
R Codes and associated data for reproducing part of the figures in this study have been deposited at GitHub (https://github.com/darciliz/nc_hcm_code). Unless noted otherwise, all other analyses including ChemRICH and MetaMapp were done using web-based portal and visualized in standalone GUI software such as Adobe Illustrator (version 2020; 2021) and CytoScape (version 2.7.2 for Window 10 OS and version 3.8.0 for macOS).
Competing interests
The authors declare no competing interests.
Footnotes
Peer review information Nature Communications thanks Alma Villaseñor and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
The online version contains supplementary material available at 10.1038/s41467-021-26209-8.
References
- 1.Turnbaugh PJ, et al. The human microbiome project. Nature. 2007;449:804–810. doi: 10.1038/nature06244. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Turnbaugh PJ, et al. An obesity-associated gut microbiome with increased capacity for energy harvest. Nature. 2006;444:1027–1031. doi: 10.1038/nature05414. [DOI] [PubMed] [Google Scholar]
- 3.Hooper LV, Littman DR, Macpherson AJ. Interactions between the microbiota and the immune system. Science. 2012;336:1268–1273. doi: 10.1126/science.1223490. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Wells JM, Rossi O, Meijerink M, van Baarlen P. Epithelial crosstalk at the microbiota–mucosal interface. Proc. Natl Acad. Sci. 2011;108:4607–4614. doi: 10.1073/pnas.1000092107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Cryan JF, O’Riordan KJ, Sandhu K, Peterson V, Dinan TG. The gut microbiome in neurological disorders. Lancet Neurol. 2020;19:179–194. doi: 10.1016/S1474-4422(19)30356-4. [DOI] [PubMed] [Google Scholar]
- 6.Hsiao ElaineY, et al. Microbiota modulate behavioral and physiological abnormalities associated with neurodevelopmental disorders. Cell. 2013;155:1451–1463. doi: 10.1016/j.cell.2013.11.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Valles-Colomer M, et al. The neuroactive potential of the human gut microbiota in quality of life and depression. Nat. Microbiol. 2019;4:623–632. doi: 10.1038/s41564-018-0337-x. [DOI] [PubMed] [Google Scholar]
- 8.de la Fuente-Nunez C, Meneguetti BT, Franco OL, Lu TK. Neuromicrobiology: How microbes influence the brain. ACS Chem. Neurosci. 2018;9:141–150. doi: 10.1021/acschemneuro.7b00373. [DOI] [PubMed] [Google Scholar]
- 9.Needham BD, Kaddurah-Daouk R, Mazmanian SK. Gut microbial molecules in behavioural and neurodegenerative conditions. Nat. Rev. Neurosci. 2020;21:717–731. doi: 10.1038/s41583-020-00381-0. [DOI] [PubMed] [Google Scholar]
- 10.Kim S, et al. Transneuronal propagation of pathologic α-synuclein from the gut to the brain models Parkinson’s disease. Neuron. 2019;103:627–641. doi: 10.1016/j.neuron.2019.05.035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.MahmoudianDehkordi S, et al. Altered bile acid profile associates with cognitive impairment in Alzheimer’s disease-An emerging role for gut microbiome. Alzheimers Dement. 2019;15:76–92. doi: 10.1016/j.jalz.2018.07.217. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Luan H, Wang X, Cai Z. Mass spectrometry-based metabolomics: targeting the crosstalk between gut microbiota and brain in neurodegenerative disorders. Mass Spectrom. Rev. 2019;38:22–33. doi: 10.1002/mas.21553. [DOI] [PubMed] [Google Scholar]
- 13.Konjevod M, et al. Metabolomics analysis of microbiota-gut-brain axis in neurodegenerative and psychiatric diseases. J. Pharm. Biomed. Anal. 2021;194:113681. doi: 10.1016/j.jpba.2020.113681. [DOI] [PubMed] [Google Scholar]
- 14.Kind T, Fiehn O. Seven Golden Rules for heuristic filtering of molecular formulas obtained by accurate mass spectrometry. BMC Bioinforma. 2007;8:105. doi: 10.1186/1471-2105-8-105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Tsugawa H, et al. Hydrogen rearrangement rules: computational MS/MS fragmentation and structure elucidation using MS-FINDER software. Anal. Chem. 2016;88:7946–7958. doi: 10.1021/acs.analchem.6b00770. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Bonini P, Kind T, Tsugawa H, Barupal DK, Fiehn O. Retip: retention time prediction for compound annotation in untargeted metabolomics. Anal. Chem. 2020;92:7515–7522. doi: 10.1021/acs.analchem.9b05765. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Schymanski EL, et al. Identifying small molecules via high resolution mass spectrometry: communicating confidence. Environ. Sci. Technol. 2014;48:2097–2098. doi: 10.1021/es5002105. [DOI] [PubMed] [Google Scholar]
- 18.Powell N, Walker MM, Talley NJ. The mucosal immune system: master regulator of bidirectional gut–brain communications. Nat. Rev. Gastroenterol. Hepatol. 2017;14:143–159. doi: 10.1038/nrgastro.2016.191. [DOI] [PubMed] [Google Scholar]
- 19.Barupal DK, Fiehn O. Chemical similarity enrichment analysis (ChemRICH) as alternative to biochemical pathway mapping for metabolomic datasets. Sci. Rep. 2017;7:14567. doi: 10.1038/s41598-017-15231-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Frolkis A, et al. SMPDB: the small molecule pathway database. Nucleic Acids Res. 2010;38:D480–D487. doi: 10.1093/nar/gkp1002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Chong J, et al. MetaboAnalyst 4.0: towards more transparent and integrative metabolomics analysis. Nucleic Acids Res. 2018;46:W486–W494. doi: 10.1093/nar/gky310. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Cervenka, I., Agudelo, L. Z. & Ruas, J. L. Kynurenines: Tryptophan’s metabolites in exercise, inflammation, and mental health. Science357, eaaf9794 (2017). [DOI] [PubMed]
- 23.Platten M, Nollen EAA, Rohrig UF, Fallarino F, Opitz CA. Tryptophan metabolism as a common therapeutic target in cancer, neurodegeneration and beyond. Nat. Rev. Drug Disco. 2019;18:379–401. doi: 10.1038/s41573-019-0016-5. [DOI] [PubMed] [Google Scholar]
- 24.Dodd D, et al. A gut bacterial pathway metabolizes aromatic amino acids into nine circulating metabolites. Nature. 2017;551:648–652. doi: 10.1038/nature24661. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Strandwitz P. Neurotransmitter modulation by the gut microbiota. Brain Res. 2018;1693:128–133. doi: 10.1016/j.brainres.2018.03.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Mittal R, et al. Neurotransmitters: The Critical Modulators Regulating Gut-Brain Axis. J. Cell Physiol. 2017;232:2359–2372. doi: 10.1002/jcp.25518. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Volkow ND, Wise RA, Baler R. The dopamine motive system: implications for drug and food addiction. Nat. Rev. Neurosci. 2017;18:741–752. doi: 10.1038/nrn.2017.130. [DOI] [PubMed] [Google Scholar]
- 28.Asano Y, et al. Critical role of gut microbiota in the production of biologically active, free catecholamines in the gut lumen of mice. Am. J. Physiol.-Gastrointest. Liver Physiol. 2012;303:G1288–G1295. doi: 10.1152/ajpgi.00341.2012. [DOI] [PubMed] [Google Scholar]
- 29.Nicholson JK, et al. Host-gut microbiota metabolic interactions. Science. 2012;336:1262–1267. doi: 10.1126/science.1223813. [DOI] [PubMed] [Google Scholar]
- 30.Bone E, Tamm A, Hill M. The production of urinary phenols by gut bacteria and their possible role in the causation of large bowel cancer. Am. J. Clin. Nutr. 1976;29:1448–1454. doi: 10.1093/ajcn/29.12.1448. [DOI] [PubMed] [Google Scholar]
- 31.Kikuchi K, et al. Gut microbiome-derived phenyl sulfate contributes to albuminuria in diabetic kidney disease. Nat. Commun. 2019;10:1835. doi: 10.1038/s41467-019-09735-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Yano JM, et al. Indigenous bacteria from the gut microbiota regulate host serotonin biosynthesis. Cell. 2015;161:264–276. doi: 10.1016/j.cell.2015.02.047. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Barupal DK, et al. MetaMapp: mapping and visualizing metabolomic data by integrating information from biochemical pathways and chemical and mass spectral similarity. BMC Bioinforma. 2012;13:99. doi: 10.1186/1471-2105-13-99. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Karlsson E, Mapelli V, Olsson L. Adipic acid tolerance screening for potential adipic acid production hosts. Microb. Cell Factories. 2017;16:20. doi: 10.1186/s12934-017-0636-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Visconti A, et al. Interplay between the human gut microbiome and host metabolism. Nat. Commun. 2019;10:4505. doi: 10.1038/s41467-019-12476-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Fernández V, Llinares-Benadero C, Borrell V. Cerebral cortex expansion and folding: what have we learned? EMBO J. 2016;35:1021–1044. doi: 10.15252/embj.201593701. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Friedland RP. Mechanisms of molecular mimicry involving the microbiota in neurodegeneration. J. Alzheimers Dis. 2015;45:349–362. doi: 10.3233/JAD-142841. [DOI] [PubMed] [Google Scholar]
- 38.Pandya M, Altinay M, Malone DA, Anand A. Where in the brain is depression? Curr. Psychiatry Rep. 2012;14:634–642. doi: 10.1007/s11920-012-0322-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Hughes AN, Oxford JT. A lipid-rich gestational diet predisposes offspring to nonalcoholic fatty liver disease: a potential sequence of events. Hepat. Med. 2014;6:15–23. doi: 10.2147/HMER.S57500. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Belalcázar AD, Ball JG, Frost LM, Valentovic MA, Wilkinson JT. Transsulfuration is a significant source of sulfur for glutathione production in human mammary epithelial cells. ISRN Biochem. 2014;2013:637897. doi: 10.1155/2013/637897. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Chu C, et al. The microbiota regulate neuronal function and fear extinction learning. Nature. 2019;574:543–548. doi: 10.1038/s41586-019-1644-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Swann JR, Spitzer SO, Diaz Heijtz R. Developmental signatures of microbiota-derived metabolites in the mouse brain. Metabolites. 2020;10:172. doi: 10.3390/metabo10050172. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Santos-Marcos JA, et al. Sex differences in the gut microbiota as potential determinants of gender predisposition to disease. Mol. Nutr. Food Res. 2019;63:e1800870. doi: 10.1002/mnfr.201800870. [DOI] [PubMed] [Google Scholar]
- 44.Ahmed I, Greenwood R, Costello B, Ratcliffe N, Probert CS. Investigation of faecal volatile organic metabolites as novel diagnostic biomarkers in inflammatory bowel disease. Aliment Pharm. Ther. 2016;43:596–611. doi: 10.1111/apt.13522. [DOI] [PubMed] [Google Scholar]
- 45.Whiteson KL, et al. Breath gas metabolites and bacterial metagenomes from cystic fibrosis airways indicate active pH neutral 2,3-butanedione fermentation. ISME J. 2014;8:1247–1258. doi: 10.1038/ismej.2013.229. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Raman M, et al. Fecal microbiome and volatile organic compound metabolome in obese humans with nonalcoholic fatty liver disease. Clin. Gastroenterol. Hepatol. 2013;11:868–875. doi: 10.1016/j.cgh.2013.02.015. [DOI] [PubMed] [Google Scholar]
- 47.Liu Z, et al. Ferulic acid increases intestinal Lactobacillus and improves cardiac function in TAC mice. Biomedicine Pharmacother. 2019;120:109482. doi: 10.1016/j.biopha.2019.109482. [DOI] [PubMed] [Google Scholar]
- 48.Nho K, et al. Altered bile acid profile in mild cognitive impairment and Alzheimer’s disease: Relationship to neuroimaging and CSF biomarkers. Alzheimers Dement. 2019;15:232–244. doi: 10.1016/j.jalz.2018.08.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Alexeev EE, et al. Microbiota-Derived Indole Metabolites Promote Human and Murine Intestinal Homeostasis through Regulation of Interleukin-10 Receptor. Am. J. Pathol. 2018;188:1183–1194. doi: 10.1016/j.ajpath.2018.01.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Venkatesh M, et al. Symbiotic bacterial metabolites regulate gastrointestinal barrier function via the xenobiotic sensor PXR and Toll-like receptor 4. Immunity. 2014;41:296–310. doi: 10.1016/j.immuni.2014.06.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Mesnage, R. et al. Shotgun metagenomics and metabolomics reveal glyphosate alters the gut microbiome of Sprague-Dawley rats by inhibiting the shikimate pathway. bioRxiv, 870105, 10.1101/870105 (2019).
- 52.Matsumoto M, et al. Cerebral low-molecular metabolites influenced by intestinal microbiota: a pilot study. Front Syst. Neurosci. 2013;7:9. doi: 10.3389/fnsys.2013.00009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Hertel J, et al. Integrated analyses of microbiome and longitudinal metabolome data reveal microbial-host interactions on sulfur metabolism in Parkinson’s disease. Cell Rep. 2019;29:1767–1777. doi: 10.1016/j.celrep.2019.10.035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Luan H, et al. LC-MS-based urinary metabolite signatures in idiopathic Parkinson’s disease. J. Proteome Res. 2015;14:467–478. doi: 10.1021/pr500807t. [DOI] [PubMed] [Google Scholar]
- 55.Erny D, et al. Host microbiota constantly control maturation and function of microglia in the CNS. Nat. Neurosci. 2015;18:965–977. doi: 10.1038/nn.4030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.O’Connor J, et al. Lipopolysaccharide-induced depressive-like behavior is mediated by indoleamine 2,3-dioxygenase activation in mice. Mol. Psychiatry. 2009;14:511–522. doi: 10.1038/sj.mp.4002148. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Valdés L, et al. The relationship between phenolic compounds from diet and microbiota: impact on human health. Food Funct. 2015;6:2424–2439. doi: 10.1039/C5FO00322A. [DOI] [PubMed] [Google Scholar]
- 58.De Angelis M, et al. Fecal microbiota and metabolome of children with autism and pervasive developmental disorder not otherwise specified. PLOS ONE. 2013;8:e76993. doi: 10.1371/journal.pone.0076993. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Braniste V, et al. The gut microbiota influences blood-brain barrier permeability in mice. Sci. Transl. Med. 2014;6:263ra158–263ra158. doi: 10.1126/scitranslmed.3009759. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Angelino, D. et al. 5-(Hydroxyphenyl)-γ-Valerolactone-Sulfate, a Key Microbial Metabolite of Flavan-3-ols, Is Able to Reach the Brain: Evidence from Different in Silico, In Vitro and In Vivo Experimental Models. Nutrients11, 2678 (2019). [DOI] [PMC free article] [PubMed]
- 61.Bhatt AP, Redinbo MR, Bultman SJ. The role of the microbiome in cancer development and therapy. CA: A Cancer J. Clinicians. 2017;67:326–344. doi: 10.3322/caac.21398. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Biernat KA, Li B, Redinbo MR. Microbial Unmasking of Plant Glycosides. mBio. 2018;9:e02433–02417. doi: 10.1128/mBio.02433-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Wikoff WR, et al. Metabolomics analysis reveals large effects of gut microflora on mammalian blood metabolites. Proc. Natl Acad. Sci. 2009;106:3698–3703. doi: 10.1073/pnas.0812874106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Evenepoel, P., Meijers, B. K., Bammens, B. R. & Verbeke, K. Uremic toxins originating from colonic microbial metabolism. Kidney Int Suppl.114, S12–S19 (2009). [DOI] [PubMed]
- 65.Wang Z, et al. Gut flora metabolism of phosphatidylcholine promotes cardiovascular disease. Nature. 2011;472:57–63. doi: 10.1038/nature09922. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Meng D, et al. Indole-3-lactic acid, a metabolite of tryptophan, secreted by Bifidobacterium longum subspecies infantis is anti-inflammatory in the immature intestine. Pediatr. Res. 2020;88:209–217. doi: 10.1038/s41390-019-0740-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Tautenhahn R, Patti GJ, Rinehart D, Siuzdak G. XCMS online: a web-based platform to process untargeted metabolomic data. Anal. Chem. 2012;84:5035–5039. doi: 10.1021/ac300698c. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.R: A language and environment for statistical computing (R Foundation for Statistical Computing, Vienna, Austria, 2010).
- 69.Bruce, P., Bruce, A. & Gedeck, P. Practical Statistics for Data Scientists: 50+ Essential Concepts Using R and Python. (O’Reilly Media, 2020).
- 70.Junot C, Fenaille F, Colsch B, Bécher F. High resolution mass spectrometry based techniques at the crossroads of metabolic pathways. Mass Spectrom. Rev. 2014;33:471–500. doi: 10.1002/mas.21401. [DOI] [PubMed] [Google Scholar]
- 71.Tsugawa H, et al. MS-DIAL: data-independent MS/MS deconvolution for comprehensive metabolome analysis. Nat. Methods. 2015;12:523–526. doi: 10.1038/nmeth.3393. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Wang M, et al. Sharing and community curation of mass spectrometry data with Global Natural Products Social Molecular Networking. Nat. Biotechnol. 2016;34:828–837. doi: 10.1038/nbt.3597. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Horai H, et al. MassBank: a public repository for sharing mass spectral data for life sciences. J. Mass Spectrom. 2010;45:703–714. doi: 10.1002/jms.1777. [DOI] [PubMed] [Google Scholar]
- 74.Sawada Y, et al. RIKEN tandem mass spectral database (ReSpect) for phytochemicals: a plant-specific MS/MS-based data resource and database. Phytochemistry. 2012;82:38–45. doi: 10.1016/j.phytochem.2012.07.007. [DOI] [PubMed] [Google Scholar]
- 75.Sumner LW, et al. Proposed minimum reporting standards for chemical analysis Chemical Analysis Working Group (CAWG) Metabolomics Standards Initiative (MSI) Metabolomics. 2007;3:211–221. doi: 10.1007/s11306-007-0082-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Schrimpe-Rutledge AC, Codreanu SG, Sherrod SD, McLean JA. Untargeted Metabolomics Strategies-Challenges and Emerging Directions. J. Am. Soc. Mass Spectrom. 2016;27:1897–1905. doi: 10.1007/s13361-016-1469-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The relevant metabolomics raw data and master alignment ion feature tables generated for this study have been deposited in the National Metabolomics Data Repository (accession IDs: ST001756, ST001757, and ST001758) with Project No. PR001126 (10.21228/M8569X). All source data for figures and tables of this work are available online at Zenodo (10.5281/zenodo.5016278) and/or GitHub (https://github.com/darciliz/nc_hcm_code); requests for additional information or data can be addressed to the corresponding author. Experimental and in silico mass spectral databases used for postcuration/validation purposes included MassBank of North America (MoNA) (https://mona.fiehnlab.ucdavis.edu/), Metlin (https://metlin.scripps.edu/), HMDB (https://hmdb.ca/), and mzCloud (https://www.mzcloud.org/). Source data are provided with this paper.
R Codes and associated data for reproducing part of the figures in this study have been deposited at GitHub (https://github.com/darciliz/nc_hcm_code). Unless noted otherwise, all other analyses including ChemRICH and MetaMapp were done using web-based portal and visualized in standalone GUI software such as Adobe Illustrator (version 2020; 2021) and CytoScape (version 2.7.2 for Window 10 OS and version 3.8.0 for macOS).