ABSTRACT
Phytoplankton transform inorganic carbon into thousands of biomolecules that represent an important pool of fixed carbon, nitrogen, and sulfur in the surface ocean. Metabolite production differs between phytoplankton, and the flux of these molecules through the microbial food web depends on compound-specific bioavailability to members of a wider microbial community. Yet relatively little is known about the diversity or concentration of metabolites within marine plankton. Here, we compare 313 polar metabolites in 21 cultured phytoplankton species and in natural planktonic communities across environmental gradients to show that bulk community metabolomes reflect the chemical composition of the phytoplankton community. We also show that groups of compounds have similar patterns across space and taxonomy, suggesting that the concentrations of these compounds in the environment are controlled by similar sources and sinks. We quantify several compounds in the surface ocean that represent substantial understudied pools of labile carbon. For example, the N-containing metabolite homarine was up to 3% of particulate carbon and is produced in high concentrations by cultured Synechococcus, and S-containing gonyol accumulated up to 2.5 nM in surface particles and likely originates from dinoflagellates or haptophytes. Our results show that phytoplankton composition directly shapes the carbon composition of the surface ocean. Our findings suggest that in order to access these pools of bioavailable carbon, the wider microbial community must be adapted to phytoplankton community composition.
IMPORTANCE Microscopic phytoplankton transform 100 million tons of inorganic carbon into thousands of different organic compounds each day. The structure of each chemical is critical to its biological and ecosystem function, yet the diversity of biomolecules produced by marine microbial communities remained mainly unexplored, especially small polar molecules which are often considered the currency of the microbial loop. Here, we explore the abundance and diversity of small biomolecules in planktonic communities across ecological gradients in the North Pacific and within 21 cultured phytoplankton species. Our work demonstrates that phytoplankton diversity is an important determinant of the chemical composition of the highly bioavailable pool of organic carbon in the ocean, and we highlight understudied yet abundant compounds in both the environment and cultured organisms. These findings add to understanding of how the chemical makeup of phytoplankton shapes marine microbial communities where the ability to sense and use biomolecules depends on the chemical structure.
KEYWORDS: phytoplankton, metabolomics, North Pacific, HILIC, homarine, trigonelline, gonyol, diatoms, microbial ecology, microbial loop
INTRODUCTION
In the ocean, the molecular makeup of organic carbon shapes its journey through the global carbon cycle. Phytoplankton fix approximately 100 million tons of carbon on a daily basis (1), roughly equivalent to half the total biomass of humans on earth (2). Each day, the microbial community respires about half of this carbon through the microbial loop (3). Approaches analyzing gene expression suggest freshly fixed small biomolecules, or metabolites, are among the most bioavailable in the surface ocean and represent a substantial conduit of carbon and energy flux. Much of the chemical complexity in phytoplankton-derived organic matter remains poorly described both qualitatively and quantitatively, particularly the highly labile portion of organic matter encompassing small polar metabolites. Here, we characterize the small molecules within particulate organic matter in natural marine microbial communities in the North Pacific and cultures of 21 phytoplankton species to show that the chemical character of the bulk carbon pool in the ocean reflects the taxonomy of the primary producers present.
Small polar metabolites can be major carbon, nutrient, and/or energy sources for heterotrophs (4, 5) and are often considered the currency of the microbial loop in the ocean. Beyond this, they can maintain phytoplankton-bacterial interactions (6, 7), serve as micronutrients (8–10), manage redox stress (11), fuel nitrogen fixation (12), act as chemical defenses (13, 14), and more. The comprehensive analysis of the metabolites in a system (metabolomics) is a nascent field and analytically challenging in environmental settings (15–17). Metabolomic studies are being used to investigate physiological changes in marine organisms under laboratory conditions (4, 18–22), though the same techniques have only recently been applied to whole communities in natural environments (22). Existing community marine metabolomic studies have employed targeted approaches in which the compounds detected are chosen by the analyst (5, 12, 21, 23, 24) or the analytical techniques employed preclude the observation of small, highly polar compounds (25, 26).
The chemical makeup of small polar compounds in freshly fixed organic matter influences the flux of carbon and energy through the microbial loop in the surface ocean. Here, we determine the metabolite pools in natural marine communities across space to explore the distributions of both known and unknown compounds. We compare our field observations to metabolomes of cultured marine primary producers from a broad taxonomic range and show how primary producers play an active role in shaping the chemical environment of the surface ocean. Finally, we highlight small polar compounds that may serve as potentially significant conduits of energy and nutrients in marine systems.
RESULTS AND DISCUSSION
Patterns of metabolites across space and taxonomy.
Using an established untargeted metabolomics approach, we obtained a list of mass features (liquid chromatography-mass spectrometry [LC-MS]-derived peaks of a particular mass and time) from marine particles. We dereplicated these mass features for adducts and isotopes, performed extensive quality control and blank comparisons, and visually inspected the resulting mass features. Through this process, we obtained a curated list of mass features, which we refer to as metabolites throughout the paper. Before statistical analyses, peak areas of each metabolite were normalized for instrument-derived obscuring variability and volume sampled as noted in Materials and Methods. To examine abundance patterns of compounds, we normalized peak areas to the total observed peak area in each sample set. See Materials and Methods for details on each step.
We explored the patterns of metabolite abundances in three sample sets of marine particles in the North Pacific Ocean: one surface meridional transect and two depth profiles (Fig. 1A; see also Table S1 in the supplemental material). In the transect sample set, seven general patterns emerged across latitude using a k-medoids clustering approach of 313 metabolites (Fig. 1B). The most common pattern (40% of compounds) showed a modest increase in concentration with latitude (mode a in Fig. 1B). This is likely related to the general increase in biomass with latitude (Fig. S1A and B and as seen in the increase of chlorophyll in Fig. 1A). Many compounds (30%) had their highest concentration in samples from 33 or 34°N (Fig. 1B, modes c to f). About 19% of metabolites did not have a clear pattern with latitude (Fig. 1B, mode b), while 8% of the compounds were generally more abundant in the southern samples than the northern samples (Fig. 1B, mode g).
In a nonmetric multidimensional scaling (NMDS) analysis where each metabolite was treated with equal weight, there was a distinct shift in metabolite patterns in the samples on either side of approximately 30°N (Fig. S2A, analysis of similarity [ANOSIM] stat = 0.316, P = 0.005). This corresponds well with the southern boundary of the North Pacific Transition Zone (NPTZ), a well described oceanographic feature which extends from Japan to North America and arises from large-scale ocean circulation (27, 28). The northern and southern edges of this transition zone comprise rapid changes in thermohaline structure and biological species composition (27, 29, 30). We saw a similar stark transition within the metabolite pools that reiterate the transition from the warm, oligotrophic North Pacific Subtropical Gyre (NPSG) into the colder, more nutrient-replete North Pacific Transition Zone (NPTZ) where chlorophyll concentrations, Synechococcus, and picoeukaryote assemblages flourish (Fig. 1 and Fig. S1C to E; see Table S2 for oceanographic conditions). Interestingly, the difference between metabolite profiles within the northern samples encompasses a much wider range in multivariate space, even within samples collected at the same time and location (biological replicates, Fig. S2A). This suggests the NPTZ is more heterogeneous in its metabolite profiles than the NPSG and is supported by the observed high variability in particulate carbon (PC) at northern sampling sites (Fig. S1A).
We performed the same clustering technique (k-medoids) on the same metabolites (when observed) within two depth profiles: one from the NPSG and one from the NPTZ (Fig. 1A for locations). Most of the metabolite concentrations decreased with depth, again corresponding with a decrease of PC (Fig. 1C, modes a to f, and Fig. 1D, modes a to c). The extent of decrease in concentrations varied among metabolites, exemplified in the NPTZ depth profile by comparing the sharply attenuating mode c to the gentler attenuation of mode a (Fig. 1C). Modes a in both NPTZ and NPSG depth profiles follow the PC pattern closely, but the other modes do not follow the same trend as bulk PC with depth (Fig. 1C and D; data in Table S9 at https://doi.org/10.5061/dryad.brv15dv8s). A minority of compounds in both of these depth profiles either had a subsurface maximum or had no clear relationship with depth (Fig. 1C, mode g, and Fig. 1D, mode d).
Using the 313 metabolites from the transect sample set as a template, we searched for the same compounds within metabolomes of 21 species of axenic phytoplankton grown under controlled conditions and analyzed on the same instrumental setup (5) (Table S3; also see Table S7 at https://doi.org/10.5061/dryad.brv15dv8s). Phytoplankton are the primary source of fixed carbon to the surface ocean, and our cultures were grown under conditions that support autotrophic growth so that we could interrogate the metabolite pools these organisms produce de novo from inorganic components. The cultures explored here encompass a wide taxonomic range from picocyanobacteria that dominate much of our transect (Fig. S1) to members of ubiquitous eukaryotic phytoplankton lineages like diatoms and coccolithophores. The taxonomic groups were recapitulated after a multivariate analysis of the metabolites across this data set in a semiquantitative manner, using both NMDS (Fig. S2B) and k-medoids clustering (Fig. 2A).
Overall, we saw that 17% (52) of the 313 metabolites were present in most of the cultured organisms with 44 of the metabolites within this mode observed in over 80% of the phytoplankton species (mode a in Fig. 2A). This suggests a set of compounds observable in most phytoplankton when analyzed under our analytical conditions. We were able to identify most of these compounds (33/52, 64%), which include many amino acids, primary metabolites, and nucleic acids (see Tables S5 and S6 at https://doi.org/10.5061/dryad.brv15dv8s). The remaining 36% of compounds within this mode could not be identified, demonstrating that even the compounds critical to the physiology and biochemistry of a broad swath of marine primary producers remain elusive. Another 39% (123) of metabolites were seen primarily in subsets of organisms, separating into five modes (modes b to f in Fig. 2A). Finally, about 44% of the 313 metabolites were either rarely or never observed in our cultures (mode g and “not observed” in Fig. 2A).
The patterns of metabolites across the cultures suggest suites of compounds that are closely associated with taxonomic groups of organisms. Several identified metabolites in these groups corroborate previous work showing that certain types of organisms produce high concentrations of particular small molecules. For instance, 2,3-dihydroxypropane-1-sulfonate (DHPS) and isethionate are within the mode of metabolites associated with diatoms (mode d) (5, 7), taurine is associated with dinoflagellates and haptophytes (mode f) (5), and glucosylglycerol is associated with cyanobacteria (31) (mode c, Fig. 2A; see also Table S6 at https://doi.org/10.5061/dryad.brv15dv8s). Most of the taxon-associated metabolites (72% of metabolites in modes b to f in Fig. 2A) are still unidentified and offer possible future taxon-specific biomarkers in the polar organic carbon pools.
Primary producers leave a metabolite signature in the environment.
Compounds with similar patterns across these data sets would suggest shared sources and sinks. To assess this, we tested whether each k-medoids-derived mode (within each sample set) was enriched in metabolites from a given mode from a different sample set, beyond what would be expected with a random assignment (assessed by a Monte Carlo-based bootstrapping approach, P value < 0.05). For example, of the 52 metabolites that were observed in most of our cultured phytoplankton (mode a in Fig. 2A), there was a robust enrichment of compounds from the meridional transect mode a (in Fig. 2C; general increase with latitude, P < 0.01; Fig. 2B). This may reflect a general increase in phytoplankton biomass with latitude as supported by the increase in PC and chlorophyll (Fig. S1A and Fig. 1A).
We capitalized on our results to search for enrichment between each mode in each sample set and visualized enriched connections among all sample sets in a network analysis where each connection is a statistically significant enrichment between two modes (P < 0.05 by Monte Carlo permutation, Fig. 3). This analysis revealed a few metaclusters, or groups of compounds that have similar patterns as each other across different spatial and taxonomic ranges (Fig. 3). These metaclusters suggest compounds have similar sources across taxonomy that persist across both latitudinal and depth gradients in the environment. Taxonomy here is represented by the modes derived from the culture data set, presented in Fig. 2A, and corresponding modes are depicted as dark blue markers in Fig. 3. Building on the observation of overenrichment of metabolites between transect data set mode a and culture data set mode a, we also saw that these modes share metabolites well beyond random assignment with the modes of metabolites in both depth profiles that attenuated in close proportion to PC with depth (modes a and b in NPTZ depth profile, mode a in NPSG depth profile, P < 0.05, Fig. 3). Identified compounds within this “core metabolome” metacluster included the amino acid glutamic acid, the nucleoside adenosine, the amino acid precursor homoserine, and several other primary metabolites (see Table S6 at https://doi.org/10.5061/dryad.brv15dv8s).
Beyond the “core metabolome,” 77% of the 30 compounds associated tightly with diatoms (mode d in Fig. 2A) were also overrepresented within the group of compounds with a general increase with latitude (P < 0.01, Fig. 2B). This pattern corresponded with an increase in fucoxanthin, a diatom biomarker observed to increase with latitude in a separate analysis from the same sampling period (Fig. S1F). The diatom-associated metabolites were also overrepresented in the medium-attenuating metabolites from the NPTZ depth profiles (mode b in Fig. 1C and Fig. 3). DHPS (a probable osmolyte produced in high concentrations by diatoms [5, 21]) and glycerophosphocholine (a headgroup of phosphatidylcholine lipids commonly produced by eukaryotic phytoplankton in marine systems [32]) sat within this pattern space as did 11 other unidentified compounds (see Table S6 at https://doi.org/10.5061/dryad.brv15dv8s).
Surprisingly, compounds tightly associated with dinoflagellates (mode e in Fig. 2A) showed a significant overrepresentation with an environmental distribution showing a distinct increase in concentration at 34.5°N (Fig. 2). Metabolites displaying these patterns were overrepresented in the sharply attenuating modes in the two depth profiles (dinoflagellate-associated metacluster in Fig. 3), in contrast to the metabolites found associated with diatoms. None of the compounds that reside in this interaction space could be identified, leaving room for future work to identify and leverage these compounds as possible biomarkers for dinoflagellates that are easily observable in the environment.
The group of compounds observed in our environmental samples but not observed in our culture data set (“not observed” mode in Fig. 2A) were overrepresented in compounds that were more abundant in the NPSG than the NPTZ in the transect (mode g in Fig. 2C) or increased with depth in the two depth profiles (rare metabolites metacluster in Fig. 3). We analyzed phytoplankton only in our initial analysis; therefore, it is likely that a subset of these compounds were produced by organisms we did not survey. For instance, the compound β-glutamic acid was found to be more abundant at depth than in the surface waters in both of our depth profiles, in contrast to the majority of compounds observed (Fig. S4 inset) and was absent from our phytoplankton cultures (see Table S5 at https://doi.org/10.5061/dryad.brv15dv8s). β-Glutamic acid is a major osmolyte in methanogenic archaea (33, 34), prompting us to search for this compound in Nitrosopumilus maritimus strain SCM1, a model species of Marine Group I Thaumarchaeota that are abundant in the ocean’s subsurface (35). We grew N. maritimus, analyzed its metabolome, and found β-glutamic acid as the most abundant identified metabolite, present at an intracellular concentration of 730 mM (see Table S6 at https://doi.org/10.5061/dryad.brv15dv8s).
It is likely that some compounds in this group were not made “freshly” by primary producers like phytoplankton or ammonia-oxidizing archaea but were rather a signature of reworked particulate matter. For example, the compound arsenobetaine, which we detected in all of our environmental samples, similarly to β-glutamic acid, generally increased with depth in the depth profiles (Fig. S4 inset). This compound is a by-product of heterotrophic degradation of phytoplankton-produced arsenometabolites (36) and would therefore necessitate a co-culture in order to be observed in a laboratory setting (as well as a growth medium with arsenic). Finally, it is likely that the cultures explored here were not producing all the compounds they are genetically able to produce—in previous laboratory experiments certain metabolites accumulate in cultures under specific environmental conditions and are not detectable under other environmental conditions (20, 21). If the production of certain compounds is variable or at rates below detection, we may not have seen them on our culture data.
Metabolites as a quantitative component of the bulk carbon pool.
We obtained absolute concentrations of the identified compounds to better understand the quantitative importance of these different metabolites within the particulate carbon landscape. The combined concentration of the identified metabolites (85 of the 313 total) ranged from 68 to 234 nM particulate carbon in the surface transect samples (Fig. S1B and Fig. S4; see Table S9 at https://doi.org/10.5061/dryad.brv15dv8s). This corresponds to 2.9% (±1.0%) to 5.2% (±1.4%) of the particulate carbon pool and 2.6% (±1.0%) to 8.2% (±2.4%) of the particulate nitrogen pool across this transect (see Table S9 at https://doi.org/10.5061/dryad.brv15dv8s). There was no clear pattern in the percentage of particulate carbon or nitrogen characterized by the quantifiable metabolites with latitude; this is likely confounded by the high variability in the particulate carbon and nitrogen measurements and the low geographical resolution of the metabolite sampling. In the NPTZ depth profile, we quantified 17 to 966 nM particulate carbon in the metabolite pool, corresponding to a rough estimate of 10% of the particulate carbon and nitrogen pools in the surface sample (see Table S9 at https://doi.org/10.5061/dryad.brv15dv8s). In the NPSG profile, we quantified approximately 3.7% of the total carbon pool in the surface sample (see Table S9 at https://doi.org/10.5061/dryad.brv15dv8s). The concentration of surface particulate metabolites was approximately two times higher than what we observed a year later in the NPSG (during the transect sampling; see Table S9 at https://doi.org/10.5061/dryad.brv15dv8s), likely due to the fact that the NPSG depth profile was sampled within an anticyclonic eddy with high surface primary productivity and particulate carbon (37).
Quantitatively, the environmental metabolite pools were dominated by a few abundant compounds, similar to previous work (12, 24). There were obvious differences in metabolite composition between the three environmental samplings (Fig. 4). For example, on a molar basis, glycine betaine (GBT) contributed to up to 17% of the quantified metabolite pool in samples below 125 m in the NPTZ, substantially more (on a mole fraction basis) than the other data sets. In contrast, the NPSG depth profile had high contributions from gonyol in the surface and guanine at depth.
We quantified the same molecules in the 21 species of phytoplankton and one species of Thaumarchaeota (Fig. 5, Fig. S5, and Table S3). Most of the abundant compounds in the environment were also found in high abundance in at least some of our cultures, though many of the most abundant compounds were not ubiquitously observed across the cultures (e.g., glycine betaine and sucrose; Fig. S5). We estimated the contribution of each metabolite to the carbon pool within each organism and compared this value to the surface samples of particulate metabolites in the field (Fig. S3). This comparison yielded a consistency suggesting most of the surface particles contain compounds that have not been heavily reworked, corroborating previous work looking at macromolecule pools (38), particularly in the compounds found within the “core metabolome” metacluster in Fig. 3 (Fig. S3). Comparing our environmental data sets to the culture data sets highlights compounds that were overrepresented in either the culture data sets or the environmental data set in a quantitative sense (Fig. S5). For example, common compounds guanine and creatine and less well studied compounds like isethionic acid and dimethylsulfonioacetate (DMS-Ac) were all higher on a per-carbon basis in the environment than in any of our cultures.
Homarine, an understudied metabolite of high abundance.
The metabolite homarine (N-methylpicolinic acid) was present at 0.6 to 67 nM in marine particles, represented up to 3% of the total PC pool in our transect samples, and was the most abundant compound measured in our data sets (Fig. 4 and 6 and Fig. S5; see Table S10 at https://doi.org/10.5061/dryad.brv15dv8s). We found these concentrations surprising both in their absolute abundance and compared to other more commonly studied polar metabolites known to accumulate in marine phytoplankton. For example, other studies have shown that homarine in marine particles is less abundant than the compatible solute glycine betaine (GBT) (12, 39), contrasting with our findings. Both homarine and GBT are zwitterionic nitrogenous betaines that likely serve (at least in part) as compatible solutes. We also detected trigonelline (N-methylnicotinic acid), an isomer of homarine, albeit at much lower concentrations (1 to 300 pM in transect samples; Fig. 6 and Fig. S5; see Table S10 at https://doi.org/10.5061/dryad.brv15dv8s). To our knowledge, trigonelline has not been previously detected in any marine samples, though it has been highlighted as an important component of labile carbon in terrestrial ecosystems due to its accumulation in higher plants (40).
In our cultured isolates, we detected homarine in both Synechococcus strains (intracellular concentration up to 400 mM), four of six surveyed diatoms (0.5 to 57 mM), and one strain of Emiliania huxleyi (a haptophyte, at 3.8 mM; Fig. 6D; see also Table S8 at https://doi.org/10.5061/dryad.brv15dv8s). Homarine has been observed in diatoms and E. huxleyi in previous studies (20, 41–45) but has not been associated with the ubiquitous marine cyanobacterium Synechococcus. We estimated that homarine was 4.8% of the particulate carbon within Synechococcus strain WH8102. Synechococcus has been estimated to contribute 10 to 20% of global ocean net primary production at approximately 8 Gt C per year (46); by extrapolation this suggests up to 0.5 to 1% global ocean net primary production could be attributed to Synechococcus through this one molecule, with potential for more production from diatoms. A caveat to this calculation is that homarine production is not quantitatively consistent among different strains of Synechococcus; Synechococcus WH7803 produced nearly 100 times less homarine (4 to 5 mM) under the same growing conditions (Fig. 6; see also Table S8 at https://doi.org/10.5061/dryad.brv15dv8s). This estimation is a first pass with the limited data at hand, and the sizable standing stock of homarine in our northern samples (about 2% of the PC) far exceeds what we would expect from the observed Synechococcus standing stock, which contributes less than 10% of the total PC pool (Fig. S1). Homarine had a clear attenuation with depth in both of our depth profiles (within mode b in the NPTZ and mode a in the NPSG in Fig. 1, shown in detail in Fig. 6B and C). All together, these data support active production and cycling of this compound in the surface ocean that has been previously unnoticed.
Homarine showed a clear spatial pattern along our transect with a nearly 10-times-higher abundance in the NPTZ (average 14.3 nM) than the NPSG (average 1.85 nM) (Fig. 6A), which we hypothesize is a result of the changing phytoplankton community and increasing prevalence of Synechococcus around 32°N (Fig. S1) and diatoms further north (indicated by increasing fucoxanthin around 34°N, Fig. S1F). Since Synechococcus standing stock cannot explain the observed homarine concentrations, we hypothesize that this compound may transfer to and accumulate in organisms beyond Synechococcus, which has been observed for osmolytes in other systems (45). Trigonelline followed a similar pattern along the transect, but with a less pronounced increase in concentration from the NPSG (average 0.07 nM) to the NPTZ (average 0.14 nM) that was shifted more northward (Fig. 6A). Homarine decreased sharply with depth in both depth profiles while trigonelline did not show appreciable attenuation in the NPSG profile (Fig. 6B and C).
Biochemically, the sources and sinks for homarine and trigonelline are likely distinct. Trigonelline is produced from nicotinic acid (47), while homarine is decarboxylated from quinolinic acid, which is produced from tryptophan (48), though the exact enzyme that performs the decarboxylation has not been characterized. The first step of bacterial trigonelline degradation is the opening of the aromatic ring by the TgnA/TgnB oxygenase system (40). This enzymatic machinery is unlikely to operate on homarine due to steric hindrance in the ring-opening step. Supporting the differential catabolism of homarine and trigonelline, we saw that the model marine heterotrophic bacterium Ruegeria pomeroyi DSS-3 was not able to grow on homarine as effectively as trigonelline (Fig. S6). Without characterized biosynthetic or degradation pathways for homarine, it is not surprising that this metabolite has not been identified as an important component of the labile organic carbon and nitrogen pools using gene-based techniques. Our spatial patterns and divergent observations of these compounds in our cultured organisms (Fig. 6D) support distinct biological sources for these structurally similar compounds, demonstrating the intricate networks that exist in microbial communities rooted in the substrate-matched metabolisms.
Our observations of trigonelline and homarine were possible because of the chromatography methodology we employed (hydrophilic interaction liquid chromatography [HILIC])—the compounds would not be resolved in time in more commonly employed reversed-phase (RP) chromatography (24, 26) due to their high polarity and same empirical formula (and therefore exact mass, Fig. 6E). This is also true for many sets of isomers of known compounds (e.g., sarcosine, β-alanine, and alanine; homoserine and threonine; β-glutamic acid and glutamic acid) as well as unknowns (e.g., inosine and another unidentified metabolite with the same m/z; two metabolites with an m/z of 236.1492; see Table S5 at https://doi.org/10.5061/dryad.brv15dv8s). We bring attention to this detail to highlight the power of incorporating cutting-edge analytical capabilities to study microbial ecology—without HILIC, we would not have been able to accurately measure many of the most abundant polar compounds.
Organic sulfur compounds.
Six of the top 30 most abundant compounds in our environmental samples were organic sulfur compounds. These compounds fall into two general categories: sulfoniums ([SR3]+) and sulfonates ([RSO3]–). We detected the well-studied sulfonium compound dimethylsulfoniopropionate (DMSP), though our methods likely underestimated the concentration due to compound instability in methanol-based extractions (49). Using our untargeted approach, we putatively identified two additional sulfonium compounds, dimethylsulfonioacetate (DMS-Ac) and 3-5-dimethylsulfonio-3-hydroxypentanoate (gonyol), as prominent peaks in our environmental samples. We later obtained standards that confirmed these identifications and enabled quantification that revealed gonyol as among our most abundant compounds with a particularly high concentration (up to 2.5 nM) in the NPSG depth profile (Fig. S7). Gonyol was named after the dinoflagellate Gonyaulax polyedra (50), and gonyol was present in high concentrations in all four dinoflagellate strains (81 to 196 mM) and in lower concentrations in the haptophytes (23 to 61 mM, Fig. S5 and S7; see also Table S6 at https://doi.org/10.5061/dryad.brv15dv8s). The environmental samples contained more DMS-Ac per unit carbon than culture samples, suggesting a source of this compound in the environment not reflected in the cultured phytoplankton (Fig. S3). Both of these compounds share structural similarity with DMSP and disrupt bacterial DMSP degradation pathways (51). Thus, predicting marine dimethyl sulfide (DMS) production from DMSP may be complicated by these highly abundant compounds. Although marine organic sulfur has gained much attention with regard to its massive inventory (52) and role in microbial processes (53), ours are the first observations of these understudied sulfoniums in natural marine systems.
Remaining unidentified compounds.
Many of the metabolites with interesting patterns across space and taxonomy could not be identified. For example, the mass feature I121.0684R10.7 has a m/z of 121.0684 and major peak in its MS2 fragmentation spectra of m/z 63.02703 (see Table S5 at https://doi.org/10.5061/dryad.brv15dv8s). This metabolite likely has the empirical formula of C5H12OS and was observed in 19 of the 21 phytoplankton species, attenuated with depth, and had a distinct maximum from 32 to 34°N in the meridional transect (see Table S5 at https://doi.org/10.5061/dryad.brv15dv8s). Unfortunately, none of the possible matches to these compounds have fragmentation data in the major mass spectral databases; without an identification, we cannot quantify this compound. It is very likely that within these unidentified compounds are more underappreciated compounds involved in the microbial loop—a fruitful endeavor for future oceanographers, mass spectrometrists, and biochemists alike.
Conclusions.
The work here explores the small molecules within marine particulate organic matter which contribute to the dissolved organic matter pool after excretion, cell lysis, or sloppy feeding. Once in a dissolved form, other organisms in the environment may be able to use these compounds as the substrates as sources for carbon, nutrients, and energy (54, 55), if they have the required enzymatic machinery to access these resources. These small molecules may also act as chemical attractants or deterrents for organisms and therefore assist in shaping microbial communities. By directly observing small molecules in both field particulate material and cultured phytoplankton, we show that small molecules in natural marine systems are determined in part by the taxonomy of the phytoplankton community. This suggests that to access these pools of labile organic carbon, the wider microbial community must be adapted to phytoplankton community composition. By quantitatively contextualizing our metabolomics data sets, we uncover a rich set of compounds that likely fuel the microbial loop that have been previously overlooked. Cycling of organic matter thus depends both on the amount of primary productivity and on phytoplankton composition—“who” matters on a chemical level.
MATERIALS AND METHODS
Environmental sample collection.
Samples were collected for environmental metabolomics of particulate material at locations shown in Fig. 1. Samples for the NPSG depth profile were collected aboard the R.V. Kilo Moana cruise KM1513 on 31 July 2015 from four depths (15, 45, 75, and 125 m); we reported on these samples in a previous publication (16). Samples for the meridional transect were collected on cruise KOK1606 aboard the R.V. Ka’imikai-O-Kanoloa from 20 April to 2 May 2016, all at approximately 15 m. Samples for the NPTZ depth profile were collected during MGL1704 aboard the R.V. Marcus Langseth at seven depths between 30 and 250 m on 3 June 2017. At each sampling location and depth, single, duplicate, or triplicate filters were collected for environmental metabolomics, as previously described (16), using either Niskin bottles or the uncontaminated underway seawater intake. Table S3 in the supplemental material has summarized descriptions of the samples collected for metabolomics, with full description of each sample (including time of collection) in Table S4 at https://doi.org/10.5061/dryad.brv15dv8s. In short, samples (4 to 15 liters each) were collected into polycarbonate carboys, filtered onto 147-mm 0.2-μm polytetrafluoroethylene (PTFE) filters using a peristaltic pump, flash frozen in liquid N2, and stored at −80°C until extraction. In addition to our samples, we filtered duplicates of methodological blanks by filtering seawater through two 0.2-μm PTFE filters in series and used the second filter as the blank. This blank is especially important to parse metabolite signals from contaminants as well as compounds within the residual dissolved pool and salt matrix adsorbed during filtration.
Pure cultures and sampling.
In addition to environmental samples, we analyzed metabolomes of cultured representatives of marine phytoplankton that were grown and analyzed on the same LC-MS system as previously presented (5). Media, light, and temperature were chosen for optimal growth of each species and are reported in reference 5. In short, axenic phytoplankton were cultured in controlled laboratory settings and harvested under exponential growth using a gentle vacuum filtration onto 47-mm Durapore filters (pore size, 0.2 μm). Samples were flash frozen in liquid N2 and stored at −80°C until extraction. In addition to samples, medium blanks corresponding to each medium type were harvested and served as matrix blank to each corresponding phytoplankton sample. In order to estimate intracellular concentrations of metabolites, we used biovolume estimates from reference 5.
We also grew Nitrosopumilus maritimus strain SCM1 and harvested it under exponential growth. Pure culture of Marine Group I Thaumarchaeota Nitrosopumilus maritimus strain SCM1 was maintained in liquid mineral medium with 1 mM ammonia (56) at 30°C in the dark without shaking. The growth of N. maritimus was monitored by measuring nitrite production and cell abundance. Nitrite concentration was determined spectrophotometrically using the Griess reagent (57). Cell counts were determined using Moviol-SYBR green I staining protocol as previously reported (58) with a Zeiss epifluorescence microscope to count 15 random fields of view for each sample with 30 to 200 cells per field. Mid-exponential-phase cells were harvested using a gentle vacuum filtration on 0.22-μm Durapore membrane filters (Millipore Co., MA, USA) and stored at −80°C until metabolite extractions. These archaea have a biovolume of approximately 0.023 μm3 (59).
We estimated carbon contents for all the cultures from cellular volume (60), using an empirical relationship between flow cytometry-based cell size and PC (61), or using previous direct measurements (61–63). An abbreviated sample description is given in Table S3; full sample descriptions are in Table S7 (at https://doi.org/10.5061/dryad.brv15dv8s, including carbon estimates and the method used for each species).
Additional oceanographic data.
Samples for particulate carbon were sampled and processed as in reference 64. Underway flow cytometry data were acquired and processed as in reference 61. Samples for pigment analysis were filtered onto GF/F filters (Whatman), stored in snap-cap tubes, wrapped in aluminum foil, and flash-frozen. Samples were analyzed for high-performance liquid chromatography (HPLC)-based measurements of total chlorophyll (monovinyl + divinyl), fucoxanthin, and other photosynthetic and photoprotective pigments. These analyses were made in the Oregon State University HPLC facility via a Waters 996 absorbance photodiode array detector in combination with a Waters 2475 fluorescence detector according to the protocol of reference 65.
Homarine bioavailability experiment.
To test if homarine was as bioavailable as trigonelline in marine systems, we cultured the model marine heterotrophic bacterium Ruegeria pomeroyi DSS-3 under different primary carbon sources and observed its cell density. DSS-3 was streaked to isolation on 1/2 YTSS agar plates (1.25 g tryptone, 2 g yeast extract, 10 g sea salts, 8 g agar per 500 ml MQ water) from frozen glycerol stocks at room temperature for 3 days. A single colony was inoculated into artificial L1-bac seawater medium (described below) supplemented with acetate (final concentration of 50 nM). This culture was grown overnight at room temperature at 200 rpm at 30°C. Next, a 96-well plate was prepared with 90 μl of fresh medium described above (without additional carbon) in all wells. In 8 wells, we added 5 μl of the overnight inoculum and 10 μl of water (no additional carbon treatment). In 8 wells, we added 15 μl of water and no inoculum (negative control). In the remaining wells, we added 5 μl of the overnight inoculum and 10 μl of either acetate, homarine, or trigonelline (all at 100 nM carbon, n = 8 for each, acetate serving as positive control). Plates were covered in a breathable sealing membrane (Breathe-Easy) and placed into a plate reader (BioTek Synergy H1MF). Cultures were grown at 30°C, shaken every 2 min for 3 s, and monitored via absorbance at 600 nm every 2 min (immediately after shaking).
Artificial L1-bac seawater medium was prepared using MQ water with 28 g Sigma sea salts, trace and macronutrients based on the recipe from the National Center for Marine Algae and Microbiota (without silica), nitrogen, and vitamins as in reference 18; Sigma M5550 MEM essential amino acids (1:1,000 dilution); and Sigma M7145 MEM nonessential amino acids (1:2,000 dilution). Salt water was autoclaved in combusted borosilicate glass containers, and all additions were made from filter-sterilized stocks. Final medium was filter sterilized using a 0.22-μm polyvinylidene difluoride (PVDF) membrane bottle top filter.
Metabolite data acquisition.
Metabolites were extracted as previously described (16). Briefly, filters were bead-beaten three times in 30-s bursts over 30 min (kept at −20°C between bursts) in 1:1:2 methanol-water-dichloromethane and separated into two fractions: a polar aqueous extract (methanol and water extractable) and an organic extract (dichloromethane extractable). We used the same internal standard suite at the same injection concentrations as in the work of Boysen et al. (16) to train normalization and monitor instrument stability. After drying under clean N2, all samples were reconstituted in 400 μl water.
The polar fraction of this extract was analyzed on both reversed-phase (RP) and hydrophilic interaction chromatography (HILIC) using the same solvents, columns, and gradients as previously reported (16). We diluted the KOK1606 samples (1 part sample to 2 parts water) and MGL1704 samples (1 part sample to 1 part water), which helped with signal stability over the course of the runs. Internal standards were added during the dilution step and were the same concentrations in all analyzed samples to aid in quantitative comparisons between sample sets. We injected 2 μl of sample onto the column for HILIC analysis and 5 μl (for environmental samples) or 15 μl (for culture samples) for RP analysis.
Both LC configurations (RP and HILIC) were analyzed on a Thermo Q-Exactive (QE) mass spectrometer in full scan mode for quantitative data or data-dependent acquisition (DDA) for fragmentation. Full scan analyses were conducted as in the work of Boysen et al. (16); pooled samples were run in DDA mode for MS2 fragmentation as described in the work of Heal et al. (20).
Metabolomic data processing.
To compare our field data with our culture data, we used our untargeted data from the meridional transect (36 surface samples) as our template to examine the other sample sets. To do this, we used an established untargeted metabolomics approach (detailed below) to acquire a list of curated, dereplicated, and high-quality mass features. With this curated list, we then searched for the same mass features in the remaining field and culture sample sets. This allowed us to compare relative abundances of these mass features within each sample set, with high confidence in the shared identity of these compounds between sample sets.
Untargeted metabolomics data from transect samples were converted with MS Convert (66) and processed through XMCS (67–69), using the same parameters for XCMS and methodological blank filtering as previously reported (20). Next, we normalized for obscuring variation (nonbiological variability inherent to LC-MS analysis) using B-MIS normalization (16). As in the work of Heal et al. (20), we disregarded peaks that did not demonstrate acceptable replicability in the pooled samples (coefficient of variance > 30%); we also removed peaks that showed greater average variability between biological replicates than over the whole sample set as in previous work (4).
In untargeted metabolomics, multiple mass features can correspond to one metabolite due to natural abundance isotopes, adducts, or multiply charged ions. As in the work of Heal et al. (20), to avoid putting extra statistical weight on these isotopes and adducts, we identified mass features that were likely 13C, 15N, or 34S isotopologues of other mass features. We extended this search to include adducts of Na+, NH4+, K+ (for positive ionization), and Cl– (for negative ionization), as well as for doubly charged ions of mass features whose M+H ion was present. We performed these searches within each 3-s (for RP) or 6-s (for HILIC) corrected retention time window and discarded these mass features from downstream statistical analyses.
For the largest 200 peaks in our HILIC analysis (positive and negative analyzed separately) and RP analysis, we exported the m/z and retention time information to Skyline (70) for closer inspection. XCMS peak picking algorithms assume a normal Gaussian shape for peaks (67–69), which often results in poor integrations for compounds that do not achieve this shape during chromatographic separation; these peaks are often removed during our coefficient of variance (CV) filter or manual peak quality verification. Therefore, we also imported a list of compounds we regularly target (see reference 16 for full list of standards) and manually integrated these compounds in each of the samples (first removing compounds that were picked during the peak picking step). In Skyline (70), we integrated these peaks (both the untargeted and known compounds) in the transect data set since XCMS often results in imperfect integrations that can introduce nonbiological variability to metabolite abundances (71). Next, we eliminated mass features that were not present in at least 50% of the transect samples and also removed peaks that were not (on average) three times larger than the matrix blank. These stringent filters in the transect data set allowed us to use a culled number of high-quality mass features that are common in surface seawater particles as a fingerprint of metabolite pools. In all, we obtained 149, 74, and 90 high-quality, manually integrated peaks in HILIC-positive ionization (HILICPos), HILIC-negative ionization (HILICNeg), and RP, respectively. For these quality mass features, we searched for corresponding MS2 scans in the data-dependent acquisition (DDA) files and applied a filter to remove low-abundance fragments in the exact manner as reported in reference 20.
With this list of high-quality mass features (referred to in the text as metabolites), we extracted the exact masses and integrated peaks at the same retention times in the two other environmental data sets (NPSG and NPTZ depth profiles) and the culture data sets. We also integrated our internal standards (in exact concentrations as in reference 16) and performed B-MIS normalization (16) across our environmental data sets, which minimizes the variability present during analysis (not biological variability). This resulted in adjusted areas of each compound in each sample that are quantitatively comparable within each sample set (but not between). Since the phytoplankton data sets are not in a consistent matrix and were analyzed in several different batches, we did not attempt to use B-MIS to normalize across the organisms. Instead, we kept the raw peak area, normalized it to the biovolume analyzed, and made semiquantitative comparisons on the log10-transformed biovolume-normalized peak areas. The log10 transformations ensure that only large differences are evaluated as contributing to variability between samples, well beyond matrix variability or instrument performance.
As in reference 20, we used the ranking system outlined in reference 72, to attempt to identify the quality mass features present in these sample sets in an automated fashion. We searched an internal database of compounds with known exact m/z and retention time on the LC-MS configurations used in the lab (found at https://github.com/IngallsLabUW/Ingalls_Standards), publicly available MS/MS2 spectral databases (73–77), and compounds in the KEGG database (78, 79) (based only on m/z).
Calculating concentrations.
Commercially available standards were analyzed in the same batch as each of the three environmental data sets. For this subset of compounds, we calculated absolute concentrations, similar to previous work (12, 21, 41). In short, we applied the following calculation for each analyte:
where RF is the response factor of each compound at known concentration in water. Standards are run before and after each run on each instrument; therefore, an RF for each compound is obtained within each batch. Volreconst is the volume into which the samples were reconstituted; Volfiltered is the volume filtered in the field (for environmental samples) or the total estimated biovolume collected (for culture samples); RFratio is the of these compounds in a matrix of marine particulates (as described in the work of Boysen et al. [16]); we calculated RFratio using samples from the transect data set which were applied throughout. We calculated an RFratio as in reference 16 using a representative environmental matrix sample. Values for Volfiltered for each sample are reported in Tables S4 and S7 (both at https://doi.org/10.5061/dryad.brv15dv8s); Volreconst was 400 μl for each sample.
Several compounds were identified in the transect data set and purchased and analyzed using the same LC-MS method at a later date, which we quantified using the same approach as in reference 41. Because the RF for each compound can vary substantially between analytical runs, we used a relative response factor (RFrelative) to estimate RF and calculate the concentrations of these compounds in earlier runs. To calculate RFrelative, we matched compounds with a standard that had been analyzed in all sample sets that share the same column, ionization state, and some structural similarity (matched standard). For instance, for the compound DMS-Ac, we matched it to another dimethylated sulfonium zwitterion, DMSP. After the samples were analyzed, we analyzed these new standards and the other standards on the same LC-MS setup as our sample set and calculated RFrelative using the following formula:
Then, we used this RFrelative and the RFmatched standard to calculate the concentration of the analyte from earlier runs. For a full explanation for how each compound was quantified in each sample set and for the matched standard used for each compound (when necessary), see Table S11 at https://doi.org/10.5061/dryad.brv15dv8s.
Statistical approaches.
For multivariate statistics on environmental samples, peak areas (adjusted via B-MIS for instrumental variability and normalized to water volume filtered) were standardized to the total peak area observed for each mass feature across each sample set. For each mass feature in the cultured organisms, log10-transformed peak areas were standardized to the maximum log10 peak areas observed across all cultured organisms. We used two different multidimensional approaches on these data sets, both nonmetric to accommodate for the high variable-to-sample ratio and nonnormal distribution of peak areas in our data sets. This prevents overfitting, which can be a problem in other multidimensional approaches in metabolomics (80). We used a nonmetric dimensional scaling (NMDS) analysis (81) based on a Euclidean distance matrix of standardized peak areas to visualize overall metabolic differences between samples along our transect samples. We assessed dimensionality of the NMDS by examining a scree plot and calculated the probability with a Monte Carlo permutation which resulted in a low-stress ordination. We accompanied this with an analysis of similarities (ANOSIMS) (82) to discern differences between the oceanographic regimes we sampled as well as time of sampling. Data transformation, standardization, NMDS, and ANOSIMS statistics were performed in R using the vegdist (v2.4-2) or vegan (v2.4-2) packages.
Next, we employed a k-medoids-based clustering approach (83), which aggregates metabolites based on patterns across samples. We performed this clustering on the combined culture data sets and on the three environmental sample sets separately (four total k-medoids analyses) using the clara function in the cluster package (v2.1.0) in R. This nonsupervised clustering technique is exclusive and nonhierarchical and assigns each mass feature into one cluster, or mode. The metabolites within each mode have similar patterns of abundance across samples. We chose the appropriate number of modes for each sample set by selecting the mode number that resulted in a local maximum average silhouette width between samples.
Finally, we investigated whether the resulting modes of metabolites from the four separate k-medoids analyses shared metabolites beyond a random assignment. Essentially, we asked if the patterns in the 2016 transect samples could be explained in part by patterns in metabolites across the available culture data or could be recapitulated in the depth profile sample sets. To test for overrepresented sharing of metabolites between modes, we used a Monte Carlo resampling technique to simulate the random frequency of shared metabolites using 1,000 permutations. We then compared the observed frequency of shared metabolites to the permutations to estimate the P value of our observed shared metabolites. To assign a compound to a metacluster (as noted in Table S6 at https://doi.org/10.5061/dryad.brv15dv8s), an individual compound must be assigned to the organism mode rooting each cluster and to at least 2 of the modes from the environmental samples within the metacluster.
Data availability.
For all data analysis, we used R v4.0.0. Codes for figures, tables, and data analysis are found at https://github.com/kheal/Gradients1_SemiTargeted3. Raw data for metabolomics samples are deposited at Metabolomics Workbench (https://www.metabolomicsworkbench.org/): cultures are project ID ST001514, meridional transect samples are project ID ST001410, NPSG depth profile is project ID ST001372, and NPTZ depth profile is project ID ST001409. Project IDs for environmental samples also listed in Table S4 at https://doi.org/10.5061/dryad.brv15dv8s.
ACKNOWLEDGMENTS
We thank Grace Workman, Natalie Kellogg, Regina Lionheart, Rhonda Morales, and Alexa Wied for assistance in growing or enumerating cultures, extracting environmental metabolomics samples, and preliminary data processing. We acknowledge the science team and crew of KM1314, KOK1606, and MGL1704 cruises (particularly Rachelle Lim and Ryan Grossmann) for sample collection. We thank G. Pohnert for the generous donation of gonyol and DMS-acetate standards. Cultures were provided by D. Stahl, S. Chisholm, A. Coe, M. Moran, and M. Saito.
This work was supported by grants from the Simons Foundation (385428 to A.E.I.; 426570 to E.V.A., A.E.I., A.E.W., and R.M.B.; 548565 to W.Q.; 598819 to K.R.H.) and the National Science Foundation (NSF GRFP to A.K.B. and K.R.H.; NSF OCE-PRF 1521564 to B.P.D., NSF IGERT Program on Ocean Change to A.K.B.).
Contributor Information
Anitra E. Ingalls, Email: aingalls@uw.edu.
Manuel Liebeke, Max Planck Institute for Marine Microbiology.
REFERENCES
- 1.Falkowski PG, Wilson C. 1992. Phytoplankton productivity in the North Pacific ocean since 1900 and implications for absorption of anthropogenic CO2. Nature 358:741–743. doi: 10.1038/358741a0. [DOI] [Google Scholar]
- 2.Walpole SC, Prieto-Merino D, Edwards P, Cleland J, Stevens G, Roberts I. 2012. The weight of nations: an estimation of adult human biomass. BMC Public Health 12:439. doi: 10.1186/1471-2458-12-439. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Azam F, Fenchel T, Field JG, Gray JS, Meyer-Reil LA, Thingstad F. 1983. The ecological role of water-column microbes in the sea. Mar Ecol Prog Ser 10:257–263. doi: 10.3354/meps010257. [DOI] [Google Scholar]
- 4.Johnson WM, Kido Soule MC, Kujawinski EB. 2016. Evidence for quorum sensing and differential metabolite production by a marine bacterium in response to DMSP. ISME J 10:2304–2316. doi: 10.1038/ismej.2016.6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Durham BP, Boysen AK, Carlson LT, Groussman RD, Heal KR, Cain KR, Morales RL, Coesel SN, Morris RM, Ingalls AE, Armbrust EV. 2019. Sulfonate-based networks between eukaryotic phytoplankton and heterotrophic bacteria in the surface ocean. Nat Microbiol 4:1706–1715. doi: 10.1038/s41564-019-0507-5. [DOI] [PubMed] [Google Scholar]
- 6.Amin SA, Hmelo LR, van Tol HM, Durham BP, Carlson LT, Heal KR, Morales RL, Berthiaume CT, Parker MS, Djunaedi B, Ingalls AE, Parsek MR, Moran MA, Armbrust EV. 2015. Interaction and signalling between a cosmopolitan phytoplankton and associated bacteria. Nature 522:98–101. doi: 10.1038/nature14488. [DOI] [PubMed] [Google Scholar]
- 7.Durham BP, Sharma S, Luo H, Smith CB, Amin SA, Bender SJ, Dearth SP, Van Mooy BAS, Campagna SR, Kujawinski EB, Armbrust EV, Moran MA. 2015. Cryptic carbon and sulfur cycling between surface ocean plankton. Proc Natl Acad Sci U S A 112:453–457. doi: 10.1073/pnas.1413137112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Bertrand EM, McCrow JP, Moustafa A, Zheng H, McQuaid JB, Delmont TO, Post AF, Sipler RE, Spackeen JL, Xu K, Bronk DA, Hutchins D, Allen AE. 2015. Phytoplankton-bacterial interactions mediate micronutrient colimitation at the coastal Antarctic sea ice edge. Proc Natl Acad Sci U S A 112:9938–9943. doi: 10.1073/pnas.1501615112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Heal KR, Qin W, Ribalet F, Bertagnolli AD, Coyote-Maestas W, Hmelo LR, Moffett JW, Devol AH, Armbrust EV, Stahl DA, Ingalls AE. 2017. Two distinct pools of B12 analogs reveal community interdependencies in the ocean. Proc Natl Acad Sci U S A 114:364–369. doi: 10.1073/pnas.1608462114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Paerl RW, Sundh J, Tan D, Svenningsen SL, Hylander S, Pinhassi J, Andersson AF, Riemann L. 2018. Prevalent reliance of bacterioplankton on exogenous vitamin B1 and precursor availability. Proc Natl Acad Sci U S A 115:E10447–E10456. doi: 10.1073/pnas.1806425115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Sunda W, Kieber DJ, Kiene RP, Huntsman S. 2002. An antioxidant function of DMSP and DMS in marine algae oceanic dimethylsulfide (DMS) photolysis. Nature 418:317–320. doi: 10.1038/nature00851. [DOI] [PubMed] [Google Scholar]
- 12.Boysen AK, Carlson LT, Durham BP, Groussman RD, Aylward FO, Ribalet F, Heal KR, DeLong EF, Armbrust EV, Ingalls AE. 2020. Diel oscillations of particulate metabolites reflect synchronized microbial activity in the North Pacific Subtropical Gyre. bioRxiv doi: 10.1101/2020.05.09.086173. [DOI]
- 13.Slattery M, Hamann MT, McClintock JB, Perry TL, Puglisi MP, Yoshida WY. 1997. Ecological roles for water-borne metabolites from Antarctic soft corals. Mar Ecol Prog Ser 161:133–144. doi: 10.3354/meps161133. [DOI] [Google Scholar]
- 14.Sheng J, Malkiel E, Katz J, Adolf JE, Place AR. 2010. A dinoflagellate exploits toxins to immobilize prey prior to ingestion. Proc Natl Acad Sci U S A 107:2082–2087. doi: 10.1073/pnas.0912254107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Kido Soule MC, Longnecker K, Johnson WM, Kujawinski EB. 2015. Environmental metabolomics: analytical strategies. Mar Chem 177:374–387. doi: 10.1016/j.marchem.2015.06.029. [DOI] [Google Scholar]
- 16.Boysen A, Heal K, Carlson L, Ingalls A. 2018. Best-matched internal standard normalization in liquid chromatography-mass spectrometry metabolomics applied to environmental samples. Anal Chem 90:1363–1369. doi: 10.1021/acs.analchem.7b04400. [DOI] [PubMed] [Google Scholar]
- 17.Fassbender AJ, Palevsky HI, Martz T, Ingalls AE, Gledhill M, Fawcett S, Brandes JA, Aluwihare LI, Aluwihare L, Anderson R, Bender S, Boyle E, Brandes J, Bronk D, Buesseler K, Burdige D, Casciotti K, Close H, Conte M, Cutter G, Estapa M, Fassbender A, Fennel K, Ferron S, Glazer B, Goni M, Grand M, Guay C, Hatta M, Hayes C, Horner T, Ingall E, Ingalls A, Johnson K, Juranek L, Knapp AN, Lam P, Luther G, Matrai P, Measures C, Nicholson D, Paytan A, Pellenbarg R, Popendorf K, Reddy C, Ruttenberg K, Sabine C, Sansone F, Shaltout N, Sikes E, Sundquist E, Valentine D, Wang ZA, Wilson S, Barrett P, Behrens M, Belcher A, Biermann L, Boiteau R, Clarke J, Collins J, Coppola A, Ebling A, Garcia-Tigreros F, Goldman J, Guallart EF, Haskell W, Hurley S, Janssen D, Johnson W, Lennartz S, Liu S, Palevsky H, Rahman S, Ray D, Sarkar A, Steiner Z, Widner B, Yang B. 2017. Perspectives on chemical oceanography in the 21st century: participants of the COME ABOARD Meeting examine aspects of the field in the context of 40 years of DISCO. Mar Chem 196:181–190. doi: 10.1016/j.marchem.2017.09.002. [DOI] [Google Scholar]
- 18.Durham BP, Dearth SP, Sharma S, Amin SA, Smith CB, Campagna SR, Armbrust EV, Moran MA. 2017. Recognition cascade and metabolite transfer in a marine bacteria-phytoplankton model system. Environ Microbiol 19:3500–3513. doi: 10.1111/1462-2920.13834. [DOI] [PubMed] [Google Scholar]
- 19.Kujawinski EB, Longnecker K, Alexander H, Dyhrman ST, Fiore CL, Haley ST, Johnson WM. 2017. Phosphorus availability regulates intracellular nucleotides in marine eukaryotic phytoplankton. Limnol Oceanogr 2:119–129. doi: 10.1002/lol2.10043. [DOI] [Google Scholar]
- 20.Heal KR, Kellogg NA, Carlson LT, Lionheart RM, Ingalls AE. 2019. Metabolic consequences of cobalamin scarcity in the diatom Thalassiosira pseudonana as revealed through metabolomics. Protist 170:328–348. doi: 10.1016/j.protis.2019.05.004. [DOI] [PubMed] [Google Scholar]
- 21.Dawson HM, Heal KR, Boysen AK, Carlson LT, Ingalls AE, Young JN. 2020. Potential of temperature- and salinity-driven shifts in diatom compatible solute concentrations to impact biogeochemical cycling within sea ice. Elementa Sci Anthropocene 8:25. doi: 10.1525/elementa.421. [DOI] [Google Scholar]
- 22.Kuhlisch C, Althammer J, Sazhin AF, Jakobsen HH, Nejstgaard JC, Pohnert G. 2020. Metabolomics-derived marker metabolites to characterize Phaeocystis pouchetii physiology in natural plankton communities. Sci Rep 10:20444. doi: 10.1038/s41598-020-77169-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Muratore D, Boysen AK, Harke MJ, Becker KW, Casey JR, Coesel SN, Mende DR, Wilson ST, Aylward FO, Eppley JM, Visolova A, Peng S, Rodriguez-Gonzalez RA, Beckett SJ, Armbrust EV, DeLong EF, Karl DM, White AE, Zehr JP, Van Mooy BAS, Dyhrman ST, Ingalls AE, Weitz JS. 2020. Community-scale synchronization and temporal partitioning of gene expression, metabolism, and lipid biosynthesis in oligotrophic ocean surface waters. bioRxiv doi: 10.1101/2020.05.15.098020. [DOI]
- 24.Johnson WM, Longnecker K, Kido Soule MC, Arnold WA, Bhatia MP, Hallam SJ, Van Mooy BAS, Kujawinski EB. 2020. Metabolite composition of sinking particles differs from surface suspended particles across a latitudinal transect in the South Atlantic. Limnol Oceanogr 65:111–127. doi: 10.1002/lno.11255. [DOI] [Google Scholar]
- 25.Llewellyn CA, Sommer U, Dupont CL, Allen AE, Viant MR. 2015. Using community metabolomics as a new approach to discriminate marine microbial particulate organic matter in the western English Channel. Prog Oceanogr 137:421–433. doi: 10.1016/j.pocean.2015.04.022. [DOI] [Google Scholar]
- 26.Petras D, Koester I, Da Silva R, Stephens BM, Haas AF, Nelson CE, Kelly LW, Aluwihare LI, Dorrestein PC. 2017. High-resolution liquid chromatography tandem mass spectrometry enables large scale molecular characterization of dissolved organic matter. Front Mar Sci 4:405. doi: 10.3389/fmars.2017.00405. [DOI] [Google Scholar]
- 27.Roden GI. 1991. Subarctic-subtropical transition zone of the North Pacific: large-scale aspects and mesoscale structure, p 1–38. In Wetherall JA (ed), NOAA technical report NMFS 105: biology, oceanography, and fisheries of the North Pacific transition zone and subarctic frontal zone. National Oceanic and Atmospheric Administration, Silver Spring, MD. [Google Scholar]
- 28.Gradoville MR, Farnelid H, White AE, Turk-Kubo KA, Stewart B, Ribalet F, Ferrón S, Pinedo-Gonzalez P, Armbrust EV, Karl DM, John S, Zehr JP. 2020. Latitudinal constraints on the abundance and activity of the cyanobacterium UCYN-A and other marine diazotrophs in the North Pacific. Limnol Oceanogr 65:1858–1875. doi: 10.1002/lno.11423. [DOI] [Google Scholar]
- 29.Polovina JJ, Howell EA, Kobayashi DR, Seki MP. 2017. The transition zone chlorophyll front updated: advances from a decade of research. Prog Oceanogr 150:79–85. doi: 10.1016/j.pocean.2015.01.006. [DOI] [Google Scholar]
- 30.Juranek LW, Quay PD, Feely RA, Lockwood D, Karl DM, Church MJ. 2012. Biological production in the NE Pacific and its influence on air-sea CO2 flux: evidence from dissolved oxygen isotopes and O2/Ar. J Geophys Res 117:C05022. doi: 10.1029/2011JC007450. [DOI] [Google Scholar]
- 31.Klähn S, Steglich C, Hess WR, Hagemann M. 2010. Glucosylglycerate: a secondary compatible solute common to marine cyanobacteria from nitrogen-poor environments. Environ Microbiol 12:83–94. doi: 10.1111/j.1462-2920.2009.02045.x. [DOI] [PubMed] [Google Scholar]
- 32.Van Mooy BAS, Fredricks HF. 2010. Bacterial and eukaryotic intact polar lipids in the eastern subtropical South Pacific: water-column distribution, planktonic sources, and fatty acid composition. Geochim Cosmochim Acta 74:6499–6516. doi: 10.1016/j.gca.2010.08.026. [DOI] [Google Scholar]
- 33.Robertson DE, Roberts MF, Belay N, Stetter KO, Boone DR. 1990. Occurrence of β-glutamate, a novel osmolyte, in marine methanogenic bacteria. Appl Environ Microbiol 56:1504–1508. doi: 10.1128/AEM.56.5.1504-1508.1990. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Robertson DE, Lesage S, Roberts MF. 1989. β-Aminoglutaric acid is a major soluble component of Methanococcus thermolithotrophicus. Biochim Biophys Acta 992:320–326. doi: 10.1016/0304-4165(89)90091-3. [DOI] [PubMed] [Google Scholar]
- 35.Karner MB, DeLong EF, Karl DM. 2001. Archaeal dominance in the mesopelagic zone of the Pacific Ocean. Nature 409:507–510. doi: 10.1038/35054051. [DOI] [PubMed] [Google Scholar]
- 36.Duncan EG, Maher WA, Foster SD. 2015. Contribution of arsenic species in unicellular algae to the cycling of arsenic in marine ecosystems. Environ Sci Technol 49:33–50. doi: 10.1021/es504074z. [DOI] [PubMed] [Google Scholar]
- 37.Wilson ST, Aylward FO, Ribalet F, Barone B, Casey JR, Connell PE, Eppley JM, Ferron S, Fitzsimmons JN, Hayes CT, Romano AE, Turk-Kubo KA, Vislova A, Armbrust EV, Caron DA, Church MJ, Zehr JP, Karl DM, De Long EF. 2017. Coordinated regulation of growth, activity and transcription in natural populations of the unicellular nitrogen-fixing cyanobacterium Crocosphaera. Nat Microbiol 2:17118. doi: 10.1038/nmicrobiol.2017.118. [DOI] [PubMed] [Google Scholar]
- 38.Lee C, Wakeham S, Arnosti C. 2004. Particulate organic matter in the sea: the composition conundrum. Ambio 33:565–575. doi: 10.1579/0044-7447-33.8.565. [DOI] [PubMed] [Google Scholar]
- 39.Keller MD, Matrai PA, Kiene RP, Bellows WK. 2004. Responses of coastal phytoplankton populations to nitrogen additions: dynamics of cell-associated dimethylsulfoniopropionate (DMSP), glycine betaine (GBT), and homarine. Can J Fish Aquat Sci 61:685–699. doi: 10.1139/f04-058. [DOI] [Google Scholar]
- 40.Perchat N, Saaidi PL, Darii E, Pellé C, Petit JL, Besnard-Gonnet M, de Berardinis V, Dupont M, Gimbernat A, Salanoubat M, Fischer C, Perret A. 2018. Elucidation of the trigonelline degradation pathway reveals previously undescribed enzymes and metabolites. Proc Natl Acad Sci U S A 115:E4358–E4367. doi: 10.1073/pnas.1722368115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Dawson HM, Heal KR, Torstensson A, Carlson LT, Ingalls A, Young JN. 2020. Large diversity in nitrogen- and sulfur-containing compatible solute profiles in polar and temperate diatoms. Integr Comp Biol 60:1401–1413. doi: 10.1093/icb/icaa133. [DOI] [PubMed] [Google Scholar]
- 42.Keller MD, Kiene RP, Matrai PA, Bellows WK. 1999. Production of glycine betaine and dimethylsulfoniopropionate in marine phytoplankton. I. Batch cultures. Mar Biol 135:237–248. doi: 10.1007/s002270050621. [DOI] [Google Scholar]
- 43.Keller MD, Kiene RP, Matrai PA, Bellows WK. 1999. Production of glycine betaine and dimethylsulfoniopropionate in marine phytoplankton. II. N-limited chemostat cultures. Mar Biol 135:249–257. doi: 10.1007/s002270050622. [DOI] [Google Scholar]
- 44.Gebser B, Pohnert G. 2013. Synchronized regulation of different zwitterionic metabolites in the osmoadaption of phytoplankton. Mar Drugs 11:2168–2182. doi: 10.3390/md11062168. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Fenizia S, Thume K, Wirgenings M, Pohnert G. 2020. Ectoine from bacterial and algal origin is a compatible solute in microalgae. Mar Drugs 18:42. doi: 10.3390/md18010042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Flombaum P, Gallegos JL, Gordillo RA, Rincón J, Zabala LL, Jiao N, Karl DM, Li WKW, Lomas MW, Veneziano D, Vera CS, Vrugt JA, Martiny AC. 2013. Present and future global distributions of the marine Cyanobacteria Prochlorococcus and Synechococcus. Proc Natl Acad Sci U S A 110:9824–9829. doi: 10.1073/pnas.1307701110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Ashihara H. 2008. Trigonelline (N-methylnicotinic acid) biosynthesis and its biological role in plants. Nat Prod Commun 3:1423–1428. doi: 10.1177/1934578X0800300906. [DOI] [Google Scholar]
- 48.Hall ER, Gurin S. 1975. Experiments in marine biochemistry: homarine metabolism in Penaeus duorarum. J Biol Chem 250:6943–6946. doi: 10.1016/S0021-9258(19)41023-5. [DOI] [PubMed] [Google Scholar]
- 49.Spielmeyer A, Pohnert G. 2010. Direct quantification of dimethylsulfoniopropionate (DMSP) with hydrophilic interaction liquid chromatography/mass spectrometry. J Chromatogr B Analyt Technol Biomed Life Sci 878:3238–3242. doi: 10.1016/j.jchromb.2010.09.031. [DOI] [PubMed] [Google Scholar]
- 50.Nakamura H, Fujimaki K, Sampei O, Murai A. 1993. Gonyol: methionine-induced sulfonium accumulation in a dinoflagellate Gonyaulax polyedra. Tetrahedron Lett 34:8481–8484. doi: 10.1016/S0040-4039(00)61364-3. [DOI] [Google Scholar]
- 51.Gebser B, Thume K, Schiller F, Steinke M, Pohnert G. 2020. Phytoplankton-derived zwitterionic gonyol and dimethylsulfonioacetate interfere with microbial dimethylsulfoniopropionate sulfur cycling. MicrobiologyOpen 9:e1014. doi: 10.1002/mbo3.1014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Ksionzek KB, Lechtenfeld OJ, McCallister SL, Schmitt-Kopplin P, Geuer JK, Geibert W, Koch BP. 2016. Dissolved organic sulfur in the ocean: biogeochemistry of a petagram inventory. Science 354:456–459. doi: 10.1126/science.aaf7796. [DOI] [PubMed] [Google Scholar]
- 53.Moran MA, Durham BP. 2019. Sulfur metabolites in the pelagic ocean. Nat Rev Microbiol 17:665–678. doi: 10.1038/s41579-019-0250-1. [DOI] [PubMed] [Google Scholar]
- 54.Azam F. 1998. Microbial control of oceanic carbon flux: the plot thickens. Science 280:694–696. doi: 10.1126/science.280.5364.694. [DOI] [Google Scholar]
- 55.Azam F, Malfatti F. 2007. Microbial structuring of marine ecosystems. Nat Rev Microbiol 5:782–791. doi: 10.1038/nrmicro1747. [DOI] [PubMed] [Google Scholar]
- 56.Qin W, Heal KR, Ramdasi R, Kobelt JN, Martens-Habbena W, Bertagnolli AD, Amin SA, Walker CB, Urakawa H, Könneke M, Devol AH, Moffett JW, Armbrust EV, Jensen GJ, Ingalls AE, Stahl DA. 2017. Nitrosopumilus maritimus gen. nov., sp. nov., Nitrosopumilus cobalaminogenes gen. nov., sp. nov., Nitrosopumilus oxyclinae gen. nov., sp. nov., and Nitrosopumilus ureaphilus gen. nov., sp. nov., four marine ammonia-oxidizing archaea of the phylum Thaumarc. Int J Syst Evol Microbiol 67:5067–5079. doi: 10.1099/ijsem.0.002416. [DOI] [PubMed] [Google Scholar]
- 57.Strickland JDH, Parsons TR. 1972. A practical handbook of seawater analysis. Fisheries Research Board of Canada, Ottawa, Canada. [Google Scholar]
- 58.Qin W, Amin SA, Martens-Habbena W, Walker CB, Urakawa H, Devol AH, Ingalls AE, Moffett JW, Armbrust EV, Stahl DA. 2014. Marine ammonia-oxidizing archaeal isolates display obligate mixotrophy and wide ecotypic variation. Proc Natl Acad Sci U S A 111:12504–12509. doi: 10.1073/pnas.1324115111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Urakawa H, Martens-Habbena W, Stahl DA. 2011. Physiology and genomics of ammonia-oxidizing archaea, p 115–155. In Ward BB, Arp DJ, Klotz MG (ed), Nitrification. ASM Press, Washington, DC. [Google Scholar]
- 60.Menden-Deuer S, Lessard EJ. 2000. Carbon to volume relationships for dinoflagellates, diatoms, and other protist plankton. Limnol Oceanogr 45:569–579. doi: 10.4319/lo.2000.45.3.0569. [DOI] [Google Scholar]
- 61.Ribalet F, Berthiaume C, Hynes A, Swalwell J, Carlson M, Clayton S, Hennon G, Poirier C, Shimabukuro E, White A, Armbrust EV. 2019. SeaFlow data v1, high-resolution abundance, size and biomass of small phytoplankton in the North Pacific. Sci Data 6:277. doi: 10.1038/s41597-019-0292-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Dron A, Rabouille S, Claquin P, Le Roy B, Talec A, Sciandra A. 2012. Light-dark (12:12) cycle of carbon and nitrogen metabolism in Crocosphaera watsonii WH8501: relation to the cell cycle. Environ Microbiol 14:967–981. doi: 10.1111/j.1462-2920.2011.02675.x. [DOI] [PubMed] [Google Scholar]
- 63.Zhang Y, Qin W, Hou L, Zakem EJ, Wan X, Zhao Z, Liu L, Hunt KA, Jiao N, Kao SJ, Tang K, Xie X, Shen J, Li Y, Chen M, Dai X, Liu C, Deng W, Dai M, Ingalls AE, Stahl DA, Herndl GJ. 2020. Nitrifier adaptation to low energy flux controls inventory of reduced nitrogen in the dark ocean. Proc Natl Acad Sci U S A 117:4823–4830. doi: 10.1073/pnas.1912367117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Juranek LW, White AE, Dugenne M, Henderikx Freitas F, Dutkiewicz S, Ribalet F, Ferrón S, Armbrust EV, Karl DM. 2020. The importance of the phytoplankton ‘middle class’ to ocean net community production. Global Biogeochem Cycles 34:e2020GB006702. doi: 10.1029/2020GB006702. [DOI] [Google Scholar]
- 65.Bidigare RR, Van Heukelem L, Trees CC. 2005. Analysis of algal pigments by high-performance liquid chromatography, p 327–345. In Andersen RA (ed), Algal culturing techniques. Academic Press, New York, NY. [Google Scholar]
- 66.Chambers MC, Maclean B, Burke R, Amodei D, Ruderman DL, Neumann S, Gatto L, Fischer B, Pratt B, Egertson J, Hoff K, Kessner D, Tasman N, Shulman N, Frewen B, Baker TA, Brusniak MY, Paulse C, Creasy D, Flashner L, Kani K, Moulding C, Seymour SL, Nuwaysir LM, Lefebvre B, Kuhlmann F, Roark J, Rainer P, Detlev S, Hemenway T, Huhmer A, Langridge J, Connolly B, Chadick T, Holly K, Eckels J, Deutsch EW, Moritz RL, Katz JE, Agus DB, MacCoss M, Tabb DL, Mallick P. 2012. A cross-platform toolkit for mass spectrometry and proteomics. Nat Biotechnol 30:918–920. doi: 10.1038/nbt.2377. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Benton HP, Want EJ, Ebbels TMD. 2010. Correction of mass calibration gaps in liquid chromatography–mass spectrometry metabolomics data. Bioinformatics 26:2488–2489. doi: 10.1093/bioinformatics/btq441. [DOI] [PubMed] [Google Scholar]
- 68.Smith CA, Want EJ, O’Maille G, Abagyan R, Siuzdak G. 2006. XCMS: processing mass spectrometry data for metabolite profiling using nonlinear peak alignment, matching, and identification. Anal Chem 78:779–787. doi: 10.1021/ac051437y. [DOI] [PubMed] [Google Scholar]
- 69.Tautenhahn R, Böttcher C, Neumann S. 2008. Highly sensitive feature detection for high resolution LC/MS. BMC Bioinformatics 9:504. doi: 10.1186/1471-2105-9-504. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Pino LK, Searle BC, Bollinger JG, Nunn B, Maclean B, Maccoss MJ. 2020. The Skyline ecosystem: informatics for quantitative mass spectrometry proteomics. Mass Spectrom Rev 39:229–244. doi: 10.1002/mas.21540. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Schiffman C, Petrick L, Perttula K, Yano Y, Carlsson H, Whitehead T, Metayer C, Hayes J, Rappaport S, Dudoit S. 2019. Filtering procedures for untargeted LC-MS metabolomics data. BMC Bioinformatics 20:334. doi: 10.1186/s12859-019-2871-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Sumner LW, Amberg A, Barrett D, Beale MH, Beger R, Daykin CA, Fan TW-M, Fiehn O, Goodacre R, Griffin JL, Hankemeier T, Hardy N, Harnly J, Higashi R, Kopka J, Lane AN, Lindon JC, Marriott P, Nicholls AW, Reily MD, Thaden JJ, Viant MR. 2007. Proposed minimum reporting standards for chemical analysis Chemical Analysis Working Group (CAWG) Metabolomics Standards Initiative (MSI). Metabolomics 3:211–221. doi: 10.1007/s11306-007-0082-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Horai H, Arita M, Kanaya S, Nihei Y, Ikeda T, Suwa K, Ojima Y, Tanaka K, Tanaka S, Aoshima K, Oda Y, Kakazu Y, Kusano M, Tohge T, Matsuda F, Sawada Y, Hirai MY, Nakanishi H, Ikeda K, Akimoto N, Maoka T, Takahashi H, Ara T, Sakurai N, Suzuki H, Shibata D, Neumann S, Iida T, Tanaka K, Funatsu K, Matsuura F, Soga T, Taguchi R, Saito K, Nishioka T. 2010. MassBank: a public repository for sharing mass spectral data for life sciences. J Mass Spectrom 45:703–714. doi: 10.1002/jms.1777. [DOI] [PubMed] [Google Scholar]
- 74.Wang M, Carver JJ, Phelan VV, Sanchez LM, Garg N, Peng Y, Nguyen DD, Watrous J, Kapono CA, Luzzatto-Knaan T, Porto C, Bouslimani A, Melnik AV, Meehan MJ, Liu WT, Crüsemann M, Boudreau PD, Esquenazi E, Sandoval-Calderón M, Kersten RD, Pace LA, Quinn RA, Duncan KR, Hsu CC, Floros DJ, Gavilan RG, Kleigrewe K, Northen T, Dutton RJ, Parrot D, Carlson EE, Aigle B, Michelsen CF, Jelsbak L, Sohlenkamp C, Pevzner P, Edlund A, McLean J, Piel J, Murphy BT, Gerwick L, Liaw CC, Yang YL, Humpf HU, Maansson M, Keyzers RA, Sims AC, Johnson AR, Sidebottom AM, Sedio BE, Klitgaard A, Larson CB, Boya CAP, Torres-Mendoza D, Gonzalez DJ, Silva DB, Marques LM, Demarque DP, Pociute E, O’Neill EC, Briand E, Helfrich EJN, Granatosky EA, Glukhov E, Ryffel F, Houson H, Mohimani H, Kharbush JJ, Zeng Y, Vorholt JA, Kurita KL, Charusanti P, McPhail KL, Nielsen KF, Vuong L, Elfeki M, Traxler MF, Engene N, Koyama N, Vining OB, Baric R, Silva RR, Mascuch SJ, Tomasi S, Jenkins S, Macherla V, Hoffman T, Agarwal V, Williams PG, Dai J, Neupane R, Gurr J, Rodríguez AMC, Lamsa A, Zhang C, Dorrestein K, Duggan BM, Almaliti J, Allard PM, Phapale P, Nothias LF, Alexandrov T, Litaudon M, Wolfender JL, Kyle JE, Metz TO, Peryea T, Nguyen DT, VanLeer D, Shinn P, Jadhav A, Müller R, Waters KM, Shi W, Liu X, Zhang L, Knight R, Jensen PR, Palsson B, Pogliano K, Linington RG, Gutiérrez M, Lopes NP, Gerwick WH, Moore BS, Dorrestein PC, Bandeira N. 2016. Sharing and community curation of mass spectrometry data with Global Natural Products Social Molecular Networking. Nat Biotechnol 34:828–837. doi: 10.1038/nbt.3597. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Vaniya A, Fiehn O. 2015. Using fragmentation trees and mass spectral trees for identifying unknown compounds in metabolomics. Trends Analyt Chem 69:52–61. doi: 10.1016/j.trac.2015.04.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Sawada Y, Nakabayashi R, Yamada Y, Suzuki M, Sato M, Sakata A, Akiyama K, Sakurai T, Matsuda F, Aoki T, Hirai MY, Saito K. 2012. RIKEN tandem mass spectral database (ReSpect) for phytochemicals: a plant-specific MS/MS-based data resource and database. Phytochemistry 82:38–45. doi: 10.1016/j.phytochem.2012.07.007. [DOI] [PubMed] [Google Scholar]
- 77.Wishart DS, Tzur D, Knox C, Eisner R, Guo AC, Young N, Cheng D, Jewell K, Arndt D, Sawhney S, Fung C, Nikolai L, Lewis M, Coutouly MA, Forsythe I, Tang P, Shrivastava S, Jeroncic K, Stothard P, Amegbey G, Block D, Hau DD, Wagner J, Miniaci J, Clements M, Gebremedhin M, Guo N, Zhang Y, Duggan GE, MacInnis GD, Weljie AM, Dowlatabadi R, Bamforth F, Clive D, Greiner R, Li L, Marrie T, Sykes BD, Vogel HJ, Querengesser L. 2007. HMDB: the human metabolome database. Nucleic Acids Res 35:D521–D526. doi: 10.1093/nar/gkl923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Kanehisa M, Sato Y, Kawashima M, Furumichi M, Tanabe M. 2016. KEGG as a reference resource for gene and protein annotation. Nucleic Acids Res 44:D457–D462. doi: 10.1093/nar/gkv1070. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Kanehisa M, Furumichi M, Tanabe M, Sato Y, Morishima K. 2017. KEGG: new perspectives on genomes, pathways, diseases and drugs. Nucleic Acids Res 45:D353–D361. doi: 10.1093/nar/gkw1092. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Saccenti E, Hoefsloot HCJ, Smilde AK, Westerhuis JA, Hendriks MMWB. 2014. Reflections on univariate and multivariate analysis of metabolomics data. Metabolomics 10:361–374. doi: 10.1007/s11306-013-0598-6. [DOI] [Google Scholar]
- 81.Kruskal JB, Wish M. 1978. Multidimensional scaling, p 7–28. In Uslaner EM (ed), Sage university paper series on quantitative applications in the social sciences, no. 07-011. Sage Publications, Thousand Oaks, CA. [Google Scholar]
- 82.Clarke KR. 1993. Non-parametric multivariate analyses of changes in community structure. Austral Ecol 18:117–143. doi: 10.1111/j.1442-9993.1993.tb00438.x. [DOI] [Google Scholar]
- 83.MacQueen J. 1967. Some methods for classification and analysis of multivariate observations. Proc Fifth Berkeley Symp Math Stat Probab 1:281–297. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
For all data analysis, we used R v4.0.0. Codes for figures, tables, and data analysis are found at https://github.com/kheal/Gradients1_SemiTargeted3. Raw data for metabolomics samples are deposited at Metabolomics Workbench (https://www.metabolomicsworkbench.org/): cultures are project ID ST001514, meridional transect samples are project ID ST001410, NPSG depth profile is project ID ST001372, and NPTZ depth profile is project ID ST001409. Project IDs for environmental samples also listed in Table S4 at https://doi.org/10.5061/dryad.brv15dv8s.