Abstract
Marine sponge holobionts, defined as filter-feeding sponge hosts together with their associated microbiomes, are prolific sources of natural products. The inventory of natural products that have been isolated from marine sponges is extensive. Here, using untargeted mass spectrometry, we demonstrate that sponges harbor a far greater diversity of low-abundance natural products that have evaded discovery. While these low-abundance natural products may not be feasible to isolate, insights into their chemical structures can be gleaned by careful curation of mass fragmentation spectra. Sponges are also some of the most complex, multi-organismal holobiont communities in the oceans. We overlay sponge metabolomes with their microbiome structures and detailed metagenomic characterization to discover candidate gene clusters that encode production of sponge-derived natural products. The multi-omic profiling strategy for sponges that we describe here enables quantitative comparison of sponge metabolomes and microbiomes to address, among other questions, the ecological relevance of sponge natural products and for the phylochemical assignment of previously undescribed sponge identities.
Keywords: sponge, natural product, glycosylation, metabolomics, metagenomics
1. Introduction
Sponges are filter-feeding sessile invertebrates that are present at all latitudes in the oceans. High rates of water filtration, up to several thousand times the sponge volume per day, make sponges primary players in the benthic-pelagic nutrient recirculation [1,2]. Sponges host a diverse bacterial microbiome and among high-microbial-abundance sponges such as those discussed in this study, the microbial population can comprise up to 35% weight of the sponge [3,4]. In addition to bacteria, sponges also host eukaryotic partners such as fungi and dinoflagellates [5]. Thus, from a molecular perspective, sponges can be conceptualized as a community of multiple interacting partners, referred to here as holobionts. In the marine environment, sponge holobionts are among the richest sources of specialized metabolites, colloquially termed natural products. The complexity of sponge natural product chemical structures has offered title compounds for total synthesis and catalyst development, and several sponge-derived molecules have progressed through pharmaceutical development pipelines to be approved for clinical use [6,7]. Uncoupled from these motivations, natural products are postulated to serve core ecological functions such as defending the sponge against predation and infection and shaping the structure of the benthic community [8]. Using next-generation sequencing and assembly of bacterial genomes from metagenomic data, it is well established that members of the sponge microbiome are responsible for the biogenesis of several sponge-derived natural products [9].
Traditionally, sponge natural products have been inventoried using isolation of individual molecules and structural elucidation using a combination of experimental and theoretical approaches. These efforts have built a foundation of more than 11,000 sponge-derived natural product structures [10]. The application of untargeted mass spectrometry has revealed that the natural product diversity from sponges has been severely underappreciated [11]. The analytical challenge, now, is two-fold: first, to gain structural insights into metabolites that due to their lesser abundance and the complexity of the sponge metabolomes will remain inaccessible to traditional spectroscopic characterization; second, to advance the characterization of sponge extracts from disjointed descriptions of individual natural products to an overall metabolomic description of the holobiont community. These two advances, when realized, will allow for comparative analyses to determine which fractions of sponge metabolomes stay conserved, or diverge, in response to ecological, pharmacological, and/or other metrics. Furthermore, sponge metabolomes could then be overlaid with the holobiont microbial community structures to determine how the overall holobiont community shapes, and responds to, differences in sponge metabolomes. Correlative conservation between metabolomic features and microbiome structures aid in metagenomic mining to search for natural product biosynthetic gene loci in the sponge holobiont.
In this study, we couple the metabolomic and metagenomic characterizations of marine sponges of the genus Melophlus that were collected in Guam (Apra Harbor). The Melophlus genus is typified by the large barrel-shaped sponge Melophlus sarasinorum (previously called Asteropus sarasinorum) that is widely distributed in the Indo-Pacific with natural product isolation studies realized from Indonesia, Palau, Guam, and the Solomon Islands. In this study, DNA barcoding, microbiome composition, and metabolomic comparison places an additional, morphologically unrecognizable sponge species within the Melophlus genus. We demonstrate that the chemical diversity of natural products in the Melophlus metabolome far exceeds the current inventory. Using metagenomics, we discover possible genetic routes for the production of characteristic Melophlus natural products. Aided by structural insights from mass spectrometric fragmentation, we detect the presence of new natural product classes previously unknown from Melophlus sponges. Finally, we compare and contrast the microbiome and metabolome architectures of dominant and abundant sponge genera in close geographical neighborhoods of each other to demonstrate that a shared ecological environment does not necessarily translate to metabolomic and microbiological overlap. Our findings demonstrate the applicability of contemporary omic technologies in providing a detailed chemical, genetic, and microbiological characterization of sponge holobionts.
2. Results
2.1. Sponge Phylogeny and Microbiome
The sponge M. sarasinorum was identified by numerous small ostia distributed around a large central osculum on a yellow, round ultrastructure at the Apra Harbor in Guam (Figure 1A). Two biological replicate specimens were collected, GUM_22 and GUM_59. Specimens were frozen dry, for chemical analyses, and in preserving solution RNAlater, for DNA extraction. Sponge specimens were barcoded by Sanger sequencing of PCR amplicons corresponding to the internal transcribed spacer-2 (ITS-2) region between 5.8S and 28S rRNA encoding genes, and the 5’ terminus of the 28S rRNA encoding gene (Figure S1). BLAST against the GenBank nr database identified that sequences with the highest similarity corresponded to the Melophlus genera [12,13].
A different sponge morphology, for which biological replicates GUM_133 and GUM_139 were collected, was not immediately recognizable (Figure 1B). The GUM_133 and GUM_139 28S rRNA gene amplicons, only 40% similar to GUM_22 and GUM_59, had the greatest similarity to Theonella genus sequences. However, the ITS-2 amplicon sequences were identical between the two groups, GUM_22/GUM_59 and GUM_133/GUM_139 (Table S1). Next, we determined the microbiome composition of all sponge specimens in duplicate, using next-generation sequencing of the v4 region of the 16S rRNA gene PCR amplicons and differentiated amplicon sequence variants (ASVs). Taxonomic assignment to the phylum level reveals that microbiome architectures between the GUM_22/GUM_59 and GUM_133/GUM_139 specimens were highly similar, with Chloroflexi, Proteobacteria, Acidobacteria, and Actinobacteria being the dominant bacterial phyla (Figure 1C). Given the high microbiome and metabolomic similarity (vide infra) between the two groups of sponges, we propose that GUM_133 and GUM_139 may constitute a single species among the Melophlus genus.
2.2. Overall Metabolomic Description
Untargeted liquid chromatography/high-resolution time-of-flight mass spectrometry (LC/MS) data were acquired with no prior derivatization or fractionation of organic extracts generated from sponge tissues. Each sponge extract was analyzed in duplicate. Of the Melophlus-derived natural products, two natural product families dominate. These are the glycosylated sterol molecules grouped as sarasinosides, and the polyketide/fatty acid tetramate conjugates- melophlins [14,15,16,17,18,19,20]. Manually curating previously described sarasinosides and melophlins, first, we dereplicated identities of already known congeners (Tables S2 and S3). Nearly all known sarasinosides and melophlins could be identified. Together with the LC/MS datasets, MS2 fragmentation spectra corresponding to sarasinoside and melophlin congeners were deposited to the Global Natural Products Social Molecular Networking (GNPS) database [21].
Next, metabolites detected in the LC/MS datasets were extracted as ‘features’ using the feature detection and alignment tool MZmine2 [22]. Extracted features comprise ion m/zs conjoined with MS2 spectra, and information corresponding to retention time and ion abundance is appended to them. Features were ‘deisotoped’, in that, only the most abundant isotopic ion is preserved in a feature. Using feature mining, we compared the relative abundances of known sarasinoside and melophlin congeners among the four sponge specimens (Figure 1D). The M. sarasinorum samples GUM_22/GUM_59 harbor a greater abundance and diversity of sarasinoside congeners while the Melophlus sp. samples GUM_133/GUM_139 were enriched for melophlins. In this approach, isomeric sarasinosides and melophlin congeners are grouped together as they cannot be easily differentiated based on MS2 fragmentation data alone. The overall metabolomes of M. sarasinorum and Melophlus sp. sponges were highly similar. Of the 1319 total unique features detected, 930 features (70.5%) are shared among the four sponge specimens (Figure 1E). Only 79 (6.0%) and 106 (8.0%) features were unique to M. sarasinorum and Melophlus sp. specimens, respectively. Of the 70 features that are shared among M. sarasinorum samples only, multiple features correspond to sarasinoside congeners, and of the 76 features shared among the Melophlus sp. samples only, features corresponding to melophlins can be detected (Table S4).
2.3. Sarasinoside Chemical Diversity
The detection of sarasinoside A1 (1, Figure 2A) from M. sarasinorum has extensive precedent. To verify the identity of the natural product from Melophlus sp., we chromatographically isolated it from GUM_133 and acquired 1H and 13C NMR spectra. Comparison to literature data established the identity of the molecule isolated from GUM_133 as 1 (Figures S2 and S3; Table S5). Molecule 1 is representative of a polyglycosylated sterol (glycosyl moieties: A-xylose, B1-N-acetylglucosamine, B2-N-acetylgalactosamine, C,D-glucose). Ions observed in the MS2 spectra for 1, which is dominated by glycosidic fragments, were structurally annotated (Figure S4). For sarasinoside B1 (2, Figure 2A), among other sarasinosides, the ‘C’ glycosyl moiety is changed from glucose to xylose, as compared to 1. The C23-oxo group is invariant among sarasinosides.
Next, we queried the chemical diversity of sarasinoside congeners detected in the LC/MS datasets using MS2 spectral matching-based molecular networking [21]. To connect nodes that remain disjointed by spectral overlap alone, spectral matching was supplemented with determination of spectral motifs by MS2LDA [23]. Here, MS2LDA motifs represent substructures characterized by discrete fragments and neutral losses present in MS2 spectra. Hence, nodes that are not connected in classical molecular network but contain a common substructure are searched using MS2LDA. A molecular network for sarasinoside congeners in which parent ions are represented as nodes that are connected by overlap of fragmentation spectra, as well as conservation of MS2LDA spectral motifs, is illustrated in Figure 2B. Nodes corresponding to 1 and 2 are labeled. Nodes for previously known sarasinoside congeners are shown as diamonds, and nodes for molecules that could not be dereplicated are shown as rounded squares. Based on previously described sarasinoside congeners, accurate mass, and manual annotation of MS2 fragmentation spectra, the network is divided among pentaglycosylated (unshaded), tetraglycosylated (shaded in brown), triglycosylated (gray), and diglycosylated sarasinosides (yellow). It is clear that pentaglycosylated congeners dominate the sarasinoside inventory (Figure 2C). Within each of these groups, the fraction of congeners that have been described in literature is minor. None of the diglycosylated sarasinosides were previously reported.
The node corresponding to 1 demonstrates the presence of multiple MS2LDA motifs (Figure 2B). Of these, motif_505, highlighted as a green edge, is conserved for each node in the sarasinoside molecular network (Figure 2B). The motif_505 is comprised of a single ion, m/z 204, which corresponds to the N-acetylhexose fragment generated by the cleavage of the glycosidic bond between A and B2 sugars (Figure 2D, left; complete inventory of MS2LDA motifs is provided as supplementary information). Conservation of motif_505 demonstrates that the A-B2 glycosylation pattern is conserved among all sarasinosides. Note that motif_505 now allows us to collate all sarasinoside nodes, including singletons that were otherwise disjointed by spectral overlap alone.
Nodes corresponding to 1 and 2 differ in the presence of motif_451 (brown edge, present only for 1) and motif_668 (magenta edge, present only for 2). Motifs _451 and _668 divide the pentaglycosylated sarasinosides into two subnetworks. The motif_668 comprises of a single MS2 fragment ion, m/z 498, the structure of which is rationalized to be the B1-C-D glycosidic chain for 2 (Figure 2D, middle). As the C glycosyl ring differs between 1 and 2 (glucose and xylose, respectively, Figure 2A), motif_668 is not conserved for 1. Hence, nodes demonstrating the presence of motif_668 should possess the glycosidic pattern identical to 2, and not 1. Structural annotation of motif_451 reveals the presence of ions corresponding to the A-B1-C-D and B1-C glycosidic fragments in which the C ring corresponds to glucose (Figure 2D, right). Hence, motif_451 allows for determination of the glycosylation pattern for these nodes to be identical to that for 1. It is instructive to observe that only a single methoxy group differentiates 1 and 2, which is then differentially propagated by motifs _668 and _451 to classify pentaglycosylated sarasinoside into two distinct families.
The analyses presented above condenses parent ions within a 0.01 Da range into a single node. For instance, the node labeled ‘A’ in Figure 2B is comprised of 28 MS2 spectra spanning a 0.75 min retention time window (Table S6). Rather than a single sarasinoside congener, it is likely that the spread of retention time represents the presence of isomers. By combining classical molecular networking with feature mining by MZmine2, feature-based molecular networking (FBMN) allows for the separation of isomeric molecules as distinct features, because features incorporate an additional metric of retention time [24]. FBMN reveals that the sarasinoside chemical diversity extends even further than that illustrated in Figure 2B. For example, at least four distinct features corresponding to m/z 1321.65 can be detected (Figure 2E). Organizing these features in a molecular network reveals that they indeed cluster together, along with other sarasinoside congeners.
Next, we focused on the determination of putative structures of unknown sarasinoside congeners. Illustratively, node labeled X in Figure 2B, [M + H]1+ m/z 1317.65, possesses an identical glycosylation pattern as 1 revealed by the conservation of motif_451 and motif_505. The mass increase of 27.99 Da between X and 1 corresponds to the addition of a methoxy group and a dehydrogenation as compared to 1 (+ MeO(− H) − 2H = 27.99 Da), as is evident by the comparison of the MS2 fragmentation ions for 1 and the sarasinoside congener corresponding to node X (Figure 3A). The sterol core of 1 is likely derived from the C30-demethylation of lanosterol which introduces the Δ14,15 unsaturation, as is observed in the structure of sarasinoside A3 (vide infra, Table S2). We posit that methoxylation at C12, as is observed in the structure of sarasinoside F, will furnish the natural product corresponding to node X. Likewise, for the node labeled Y in Figure 2B, [M + H]1+ m/z 1285.63, the glycosylation pattern is conserved as for 1. The molecular formula thus predicted can only be satisfied by an unprecedented two addition dehydrogenations on the sterol core as compared to 1 (Figure 3A). Rather than traversing each node individually, we asked whether structural differences can be curated on the basis of accurate mass differences. We could indeed traverse nodes in the sarasinoside molecular network based on mass differences corresponding to common modifications such as hydroxylation (15.99 Da), methylation (14.01 Da), and dehydrogenation (2.01 Da) (Figure 3B). In light of the characterization of the glycosylation patterns, a convergent theme emerges in which tailoring of the sterol core emerges as the principal driver of sarasinoside chemical diversity.
2.4. Putative Sterol Biosynthetic Gene Clusters in the Sponge Microbiome
Next, we searched for the sarasinoside biosynthetic potential in the M. sarasinorum holobiont. Supported by literature, a hypothetical enzymatic scheme furnishing sarasinosides is illustrated in Figure 4A [25,26]. In this scheme, enzyme squalene synthase converts two molecules of farnesyl pyrophosphate to squalene. Squalene epoxidase then furnishes 2,3-oxidosqualene followed by conversion to lanosterol by lanosterol synthase. Alternatively, 2,3-oxidosqualene can also be converted to cycloartenol by cycloartenol synthase. Specific for sarasinosides, C30-lanosterol is likely demethylated by lanosterol demethylase which also introduces the Δ14,15 unsaturation. Redox tailoring to establish the C23-oxo modification and reduction of the Δ14,15 unsaturation furnishes the aglycone which is then glycosylated to yield 1 and 2. The catalytic sequence for C30-demethylation and redox tailoring of the tetracyclic core can be reversed. Indeed, sterols bearing the C23-oxo moiety without C30-demethylation are detected from other marine sources [27,28]. In the event that the Δ14,15 unsaturation is not reduced, the aglycone corresponding to sarasinosides A3, B3, and C3 is produced (Table S2). Enzymatic glycosylation is expected to proceed via glycosyltransferase enzymes that employ sugar-diphosphate nucleosidic substrates. We used enzymes in this predictive biosynthetic scheme as diagnostic elements to mine the M. sarasinorum metagenome.
Total DNA isolated from the GUM_22 holobiont was packaged into a 500 bp Illumina TruSeq DNA library and sequenced using the 2 × 150 bp paired end protocol on an Illumina HiSeq 2500 sequencer. Adaptor trimming and quality filtering yielded 105,560,704 high quality reads which were assembled into 58,828 scaffolds. Scaffolds were phylogenetically binned according to procedures that we have described previously [29,30]. Scaffold bins were organized into distinct clusters according to their phylogenetic derivation, which included sponge host as well as associated microbial taxa. Immediately, we could observe the clustering of scaffolds derived from different microbial taxa on a coverage (y-axis) vs %GC content (x-axis) plot (Figure 4B). Predicted protein functional annotations were obtained with Prokka [31]. Hidden Markov Model based searches were performed to identify candidate sterol biosynthetic enzymes using PFAM database patterns PF00494 (squalene synthase), PF08491 (squalene epoxidase), and PF13243 and PF13249 (squalene cyclase) [32]. Sequences encoding glycosyltransferase candidate enzymes were detected using PFAM database pattern PF00535 and CAZy classified search patterns for glucose, xylose, and N-acetylhexose transferases [33]. For each of these search criteria, individually, multiple genes were detected in the GUM_22 metagenome. For instance, 109 candidate squalene synthases and 32 squalene epoxidases were detected, along with 592 N-acetylhexose and 473 glucose/xylose glycosyltransferases. To sort the large number of potential hits that were dispersed across the metagenome, we queried whether multiple candidate sterol biosynthetic genes and glycosyltransferase-encoding genes were present together, within a single microbial taxon. Only a solitary γ-proteobacterium satisfied this criterion, the tight grouping of metagenomic scaffolds for which is demonstrated in Figure 4C. The metagenomically assembled genome (MAG) of this γ-proteobacterial symbiont, divided among 57 scaffolds, is judged by CheckM to be 95% complete [34]. The full length 16S rRNA gene sequence recovered from the MAG demonstrates its closest sequenced relatives to be uncultivated sponge symbionts. The sterol biosynthetic genes, annotated within the Melophlus-derived sterol biosynthesis (msb) gene cluster and glycosyltransferases, present in the Melophlus-derived glycosyl biosynthesis (mgb) gene cluster, can be identified from metagenomic scaffolds derived from this γ-proteobacterium (Figure 4D).
Within the msb locus, gene msbE encodes a bi-functional squalene epoxidase/sterol cyclase. Sequence alignment identifies a key valine residue in the C-terminal MsbE cyclase domain that allows for its annotation as a lanosterol cyclase, rather than an oxidosqualene cyclase (Figure 4A) [25]. Residues involved in coordinating the flavin cofactor can be discerned in the N-terminal epoxidase domain. Gene msbC encodes an integral membrane sterol reductase, homologs of which catalyze the reduction of the Δ14,15 unsaturation while protein product of gene msbD demonstrates high homology to the lanosterol C30-demethylase. Overlapping genes msbA and msbB encode a NAD(P)-dependent oxidoreductase and a Rieske oxygenase, respectively. Homologs of MsbA and MsbB have been demonstrated to catalyze redox tailoring of the sterol scaffold, such as oxidative demethylation of 4,4-gemdimethyl [35]. We posit that, among other activities, MsbA and MsbB could participate in the installation of the C23-oxo moiety. Gene msbF encodes a SAM-dependent sterol methyltransferase that could participate in the biosynthesis of methoxylated sarasinoside congeners such as sarasinoside F, H2, I2 (Table S2). Genes at either terminus of the msb gene locus described here bear no resemblance to sterol biosynthetic genes. Separated from the msb locus is the mgb locus. Genes mgb7–9,13 all encode glycosyltransferases while genes mgb2,4–6 encode enzymes involve in the biosynthesis and tailoring of sugar-diphosphate nucleosides. Gene mgb1 encodes a transcriptional regulator. Other genes, such as mgb10 and mgb12 that encode SAM-dependent methyltransferases, have no readily discernable roles in sarasinoside biosynthesis.
2.5. Mining for Other Glycosylated Molecules
The abundance of glycosyltransferases identified in the M. sarasinorum metagenome translates to the Melophlus metabolomes being exceptionally enriched for glycosylated molecules. The MS2LDA motif_505, which corresponds to the m/z 204 fragment ion for N-acetylhexose moieties of 1 (Figure 2), alone, could be detected in 692 (28.7%) of the 2405 nodes in the molecular network for Melophlus LC/MS data used in this study. Glycosylated molecules are distributed across all four sponge specimens ranging in parent mass m/zs from 393.20 to 1889.98, implying that a wide diversity of structures undergo glycosylation in Melophlus holobionts.
Querying for other glycosylated molecules presents interesting results. For instance, for the subnetwork illustrated in Figure 5A, MS2 spectra for ions m/z 601.41 (green node) and m/z 763.46 (blue node) demonstrate neutral losses corresponding to one and two hexose moieties, respectively (Figure 5B). The presence of evenly distributed lower abundance fragmentation ions between 100 and 300 Da that differ by 14 Da points towards a long, linear alkyl chain. Higher abundance fragment ions, m/z 421.34 and m/z 439.35 correspond to the molecular formulae C30H45O+ and C30H47O2+, respectively, representing the aglycone structures for glycosylated parent ions m/z 601.41 and m/z 763.46. Mining the Melophlus metabolomes for m/z 421.34 and m/z 439.35 lead to the identification of another dedicated subnetwork in which molecular ions corresponding to these aglycone substructures are present (Figure 5C, nodes in red). Dereplicating these molecules using the MarinLit repository lead to hits to sponge-derived polyacetylenes with m/z 439.35 neatly corresponding to yakushynol F (Figure 5D) [36]. While the positions of the unsaturations can undoubtedly differ, this is the first instance in which polyacetylinic natural products have been detected from Melophlus sponges and to the best of our knowledge, glycosylated polyacetylenes from marine sponges overall [37]. Polyacetylinic natural products are widespread in marine metabolomes and based on accurate mass differences, other nodes in the subnetwork illustrated in Figure 5C can be rationalized as modifications such as hydroxylation (node 455.35), dehydrogenation (node 437.34), and reduction (node 441.36) of the C30 yakushynol F-like core structure.
2.6. Diversity of Melophlin Natural Products
We inventoried the diversity of the other class of natural products previously described from Melophlus sponges, the long alkyl chain tetramates-melophlins. To verify the identity of melophlins from Melophlus sp. samples GUM_133/GUM_139, we isolated the most abundant melophlin congener present in the GUM_139 extract and acquired 1H and 13C NMR data (Figures S5 and S6). Comparison to literature data established that the isolated melophlin corresponded to melophlin I (3, Figure 6A) [20]. Melophlin congeners can be divided among two families, one with and the other without methylation at C5, such as the isomeric melophlin J (4, Table S3). Biosynthetically, this difference likely translates to the tetramate heterocycle being derived from the condensation of glycine (Figure 6A, top) or alanine (Figure 6A, bottom) primary amine with the activated carboxylic acid of a β-keto acid (path a, Figure 6A) followed by Dieckmann cyclization of the enolate with the aminoacyl activated carboxylic acid (path b) [38,39,40,41]. Spectroscopically, C5-methylated and C5-demethyl melophlins can be distinguished by 13C shifts at 62.7 and 57.7 ppm, respectively (in CDCl3) [20].
The MS2 spectra for 3 was structurally annotated (Figure S7). In the molecular network, node corresponding to 3 occurs as a singleton and is associated with motif_437 (blue edge, Figure 6B) and motif_444 (green edge). The dominant MS2 product ions comprising motifs _437 and _444 were annotated (Figure 6C). Determining the presence of these two motifs in the molecular network identified a subnetwork (dashed) in which nodes corresponding to other melophlin congeners were detected (Figure 6B, nodes corresponding to known melophlins shown as diamonds). In this subnetwork, a third MS2LDA motif, motif_660 was identified (red edge, Figure 6B). The motif_660 is comprised of only two MS2 product ions that can be structurally annotated to be derived from both the C5-methylated and C5-desmethyl melophlins (Figure 6C). The motif_660 connects numerous other nodes in the network, including singletons that also possess motif_437 and motif_444, demonstrating that the structural diversity of melophlins, just as for sarasinosides, is far greater than that appreciated previously (Figure 6D). Within these potentially novel melophlin congeners are brominated derivatives with the bromine atoms predicated upon the alkyl chain, as determined by annotation of the fragmentation spectra (yellow nodes, Figure 6B).
2.7. Microbiome and Metabolome Comparison of Melophlus Sponges to Neighboring Sponges
Next, we aimed to decipher whether Melophlus sponges share any of their microbiome or metabolomic architecture with their geographical neighbors. In Apra Harbor (Guam), together with M. sarasinorum, the ‘elephant ear’ sponge Ianthella basta is abundantly present (Figure 7A). The microbiome of the I. basta specimen GUM_65 was sequenced in duplicate and compared to that of M. sarasinorum specimen GUM_22. The I. basta microbiome, with a mean Shannon diversity index of 1.25 is less diverse than that of M. sarasinorum (mean diversity index 6.85, Figure 7B). Individual components of the microbiome are also divergent, with the I. basta biome being dominated by Thaumarchaeota with lesser relative abundance of Chloroflexi, Actinobacteria, and Acidobacteria phyla that dominate the M. sarasinorum microbiome (Figure 7C). The holobiont architectural divergence is reflected in the respective metabolomes as well. The I. basta metabolome, just as for Melophlus, have been extensively mined to reveal a rich tapestry of brominated alkaloids collectively called the bastadins (Figure 7D) [42,43]. The metabolomes of four I. basta biological replicates were compared to metabolomes of the four Melophlus specimens used in this study. A comparative plot reveals that no metabolomic features are shared between Melophlus and I. basta (Figure 7D). In this analysis, features contributing the most to metabolomic divergence are members of natural product families dedicated to respective sponge taxa, sarasinosides and melophlins for Melophlus and bastadins for I. basta.
3. Discussion
While Melophlus sponges have been extensively mined for natural products, this is the first untargeted metabolomic characterization of their natural product profiles, their microbiome architectures, and their metagenomes. Contemporary omics tools are making it abundantly clear that the chemical diversity and biosynthetic potential for marine sponges far exceeds prior characterizations [11]. A paired multi-omic profiling of marine sponge holobionts provides several advances. Overlapping metabolomic and microbiome architectures allow for phylochemical insight into identities of novel sponges to assist phylogenetic assignments, especially in cases of non-concerted evolution of the rRNA genes as we observe for the Melophlus genus here [44]. Overall, grossly different sponge morphologies for species in the Melophlus genus do not correspond to different microbiomes and different metabolomes. We have previously recorded similar observation for the Lamellodysidsea genus of marine sponges [29]. How and why the microbiome structure is selected for at the genus level in marine sponges is presently not clear.
A multi-omic profiling strategy also enables biosynthetic insights. For sarasinosides, chemical diversity is largely realized via modifications on the sterol core demonstrating the participation of promiscuous redox catalysts. Extensive redox tailoring of the sterol tetracyclic core is broadly observed in marine sponges [45]. Accurate mass differences between key MS2 fragment ions allows for rapidly curating what these redox modifications are, as is illustrated for representative sarasinoside congeners in Figure 3. The complexity of the sponge holobiont complicates the discovery of these modifying enzymes. It is plausible that a central sterol core that is delivered by a sponge symbiont is modified by other members of the holobiont community. Thus, in this study, we have focused on the discovery of biogenetic routes for the delivery of the sterol core structure. While sterols are ubiquitous in sponge metabolomes, the msb locus represents, to the best of our knowledge, the very first microbially encoded sponge-derived gene cluster that likely encodes for sterol production. Sterol biosynthetic genes can be dispersed in bacterial genomes, as is also the case for the γ-proteobacterial symbiont from which the msb locus is derived [25]. A squalene synthase encoding gene, an enzyme which converts farnesyl pyrophosphate to squalene (Figure 4A), is missing from the msb locus and is indeed located on a different metagenomic scaffold derived from this symbiont. Lesser variability is observed in the sarasinoside glycosylation pattern. The xylose-glycosyltransferase which installs the ‘A’ sugar ring is likely tolerant towards a large diversity of sterol structures. The B1 and B2 N-acetylhexose sugars remains invariant. Promiscuity for the sugar nucleosidic substrate (glucose or xylose) is apparent for the C-sugar glycosyltransferase. In addition to a diversity of marine sources, glycosylated sterols are widely present in plant metabolomes, where, due to their amphipathic structures, they are postulated to serve roles in modulating membrane permeabilization [46,47].
Another primary advance afforded by multi-omic profiling of sponges are the insights gained into the overall metabolomic architecture of sponges. Instructive to observe is the enrichment of distinct natural product classes. The motif_505, which corresponds to the presence of N-acetylhexose sugars (Figure 2D), is conserved for greater than 25% of all ions detected in the Melophlus LC/MS datasets. The holobiont metagenome then reveals nearly 600 N-acetylhexose glycosyltransferases to be present. While undoubtedly some of these glycosyltransferases will be involved in primary metabolism, the abundance of glycosylated metabolites and the number of candidate glycosyltransferase enzymes detected in Melophlus is unprecedented. Along the same lines, our prior metabolomic profiling of the Indo-Pacific sponge Lamellodysidea herbacea found the sponge to be exceptionally rich in the diversity of polyhalogenated phenols, leading us to posit that the sponge holobiont harbors promiscuous halogenating enzymes [48]. The rules that govern Melophlus sponges’ preponderance towards glycosylated natural products while Lamellodysidea sponges select for polyhalogenated phenols are presently not clear. These two examples are in complete contrast to our metabolomic description of the Floridian sponge Smenospongia aurea, the metabolome of which, while being very diverse, is not enriched in one particular natural product class [49].
Organizing of sponge extracts as a collection of metabolomic features allows for quantitative comparisons to address questions of choice. For instance, here, we ask whether geographical neighbors that experience identical predatory and infectious pressure use shared metabolites to combat ecological stressors? Comparison of the Melophlus metabolomes with that of I. basta, another sponge that is co-dominant on the Apra Harbor seafloor, reveals no metabolomic overlap. Crucially, this analysis includes not only abundant natural products in respective sponges that have been described spectroscopically, but also lesser abundance metabolites that are outside the purview of traditional isolation-based characterization. If sponges do not share or conserve metabolites to address comparable ecological challenges, do each of these different natural product classes represent independent defense mechanisms? It is likely that the role of sponge natural products extends beyond the simple hypothesis of predator deterrence to include recruitment and maintenance of specific commensal associations, and other, as yet unidentified roles in holobiont physiology. While ecological insight is an obvious starting point, metabolomic overlap can be used to address other questions as well such as to discern identities of metabolites in different sponge extracts leading to similar pharmacological assay readouts. Geographic proximity also does not translate to overlapping microbiomes. While Proteobacteria are abundant in both high- and low-microbial-abundance sponges, high-microbial-abundance sponges such as M. sarasinorum are enriched in Chloroflexi while low-microbial-abundance sponges such as I. basta are enriched in archaeal symbionts (Figure 7B,C), a general trend that has been observed previously [50].
Our study design used less than 5 g of wet tissue collected for each sponge specimen. The multi-omic workflows described here present a starting point to prioritize sponge specimens for larger collections for targeted deliverables. Geographically mapping paired multi-omic data from biodiversity hotspots will also democratize the addressing of a greater diversity of scientific questions from a single resource.
4. Materials and Methods
4.1. DNA Extraction from Sponge Tissues
The DNA extraction protocol used in this study was adapted from our previous reports [29,30]. Sponge tissue frozen in RNAlater was thawed overnight at 4 °C and rinsed thrice with Tris-EDTA (TE) buffer (10 mM Tris-HCl pH 7.5, 1 mM EDTA-Na). Sponge tissue was homogenized using sterile scalpel and treated with lysozyme at 30 °C in TE buffer for 2 h, followed by treatment with proteinase K at 65 °C for 30 min. Impurities were extracted twice with buffered phenol:chloroform:isoamyl alcohol and DNA precipitated by addition of equal volume of ice cold isopropanol. DNA was pelleted by centrifugation, washed twice with 70% EtOH, dissolved in TE buffer, and further column purified using Qiagen DNA purification kit using manufacturer’s directions.
4.2. Molecular Determination of Sponge Phylogeny
The hypervariable ITS-2 region of the ribosomal gene was amplified from metagenomic DNA using Q5 high fidelity DNA polymerase with the forward primer SP58bF (5′-AATCATCGAGTCTTTGAACG-3′) and reverse primer SP28cR (5′-CTTTTCACCTTTCCCTCA-3′), as described previously [51]. The thermocycling conditions were as follows: initial denaturation of 30 s at 98 °C, 35 cycles each of 10 s at 98 °C, 30 s at 45 °C, and 1 min at 72 °C, and a final extension of 2 min at 72 °C. The PCR products were purified and concentrated using Zymo DNA clean and concentrator kit and clone libraries were generated using ligation-independent TA cloning with the pGEM-T Easy Vector System kit. Three clones for each sponge amplicon were Sanger sequenced and reads were filtered manually to remove the nucleotides from the pGEM-T Easy vector. Filtered sequences were used to search the GenBank nr/nt database using Basic Local Alignment Search Tool (BLAST). The D3-D5 region of the 28S rRNA gene was amplified using the primers NL4F (5′-GACCCGAAAGATGGTGAACTA-3′) and NL4R (5′-ACCTTGGAGACCTGATGCG-3′) [12]. The thermocycling conditions were as follows: initial denaturation of 2 min at 95 °C, 35 cycles each of 30 s at 95 °C, 30 s at 55 °C, 70 s at 72 °C, and a final extension of 2 min at 72 °C in a 25 µL reaction volume using GoTaq DNA polymerase. Purification, assembly into vectors, and Sanger sequencing proceeded as described above.
4.3. 16S rRNA Gene Amplification, Sequencing, and Data Analysis
Next-generation sequencing of the v4 region of the 16S rRNA gene was used to inventory sponge-associated microbiomes. Illumina MiSeq sequencing of the dual-indexed PCR amplicons was carried out as previously described [52]. The v4 primers 515F and 806R were used with both forward and reverse primers barcoded and appended with Illumina-specific adaptors according to established procedures [53]. The PCR reactions contained 1 µL 20 ng/µL template DNA, 0.5 µL each 20 µM forward and reverse primer, 0.5 µL 10 nM dNTPs, 0.25 µL of Q5 high fidelity DNA polymerase, reaction buffer, and molecular biology grade water to adjust the volume to 25 µL. The thermocycling conditions were as follows: initial denaturation of 30 s at 98 °C, 35 cycles each of 30 s at 98 °C, 30 s at 50 °C, 20 s at 72 °C, and a final extension of 2 min at 72 °C. Amplicons were purified and concentrated using Zymo DNA clean and concentrator kit, pooled in equimolar concentrations, and sequenced on an Illumina MiSeq platform using a 500 cycle kit with 10% PhiX DNA to increase read diversity. The raw sequences were imported into QIIME2 (v.2017.7; https://docs.qiime2.org/2019.7/) using the qiime tools import script with the input format as PairedEndFastqManifestPhred33, and the sequences were demultiplexed using the qiime demux script [54]. DADA2 software was used to obtain a set of observed sequence variants (SVs) [55]. Based on quality scores, the forward and reverse reads were truncated at 150 bp using the qiime dada2 denoise script. Taxonomy was assigned using the SILVA pre-trained classifier (silva-119-99-515-806-nb-classifier) using the qiime feature classifier plug-in [56]. Taxonomic distributions of the samples were calculated using the qiime taxa barplot script.
The α-diversity metrics associated with the M. sarasinorum (GUM_22) and I. basta (GUM_65) were computed using the qiime diversity core metrics script. The Shannon diversity index was used to the compare the microbial diversity as it takes both abundance and richness into consideration. The relative abundances of the different taxonomic phyla associated were represented through a heatmap. The reads corresponding to each individual phylum in a sample were normalized by dividing the value by the sum of the reads of all phyla in the given sample. The negative logarithm of the normalized values was computed to reduce the spread of the data, and the values were imported to R studio in a .csv file format. The corresponding heatmap was generated in RStudio using the following script:
#x<-read.csv(“Final XIC area.csv”, row.names = 1, check.names = FALSE)
#y<- data.matrix(x)
#pal<-choose_palette()
#heatmap.2(y, trace = ‘none’, margins = c(10,12), Rowv = FALSE, Colv = FALSE, col = pal, cexRow = 1.0,lwid = c(2,10),lhei = c(2,10), density.info = ‘none’, key.par = list(mar = c(4,4,4,10)))\
4.4. Sponge Chemical Extraction and Mass Spectrometry Data Acquisition
Sponge tissues were frozen and lyophilized to dryness. Dried tissues were soaked in 1:1 v/v DCM/MeOH (1 mL solvent/100 mg dry sponge tissue) for 48 h at room temperature. The DCM/MeOH extract was clarified by centrifugation, dried, resuspended in MeOH, and directly analyzed using an Agilent 1290 Infinity II UHPLC coupled to a Bruker ImpactII ultra-high-resolution Qq-TOF mass spectrometer equipped with an electron spray ionization source. A Kinetex 1.7 μm C18 reversed-phase UHPLC column (50 × 2.1 mm) was employed for chromatographic separation. MS spectra were acquired in positive ionization mode from m/z 50–2000 Da. An active exclusion of ‘2’ spectra was employed, implying, that an MS1 ion would not be selected for fragmentation after two consecutive MS2 spectra had been recorded for it in a 0.5 min time window. For acquiring MS2 data, eight most intense ions per MS1 spectra were selected. Chromatography solvent A: water + 0.1% v/v formic acid, solvent B: MeCN + 0.1% v/v formic acid. Flow rate was held constant at 0.5 mL/min throughout. The elution profile employed was: 5% solvent B for 3 min, a linear gradient from 5% to 50% B in 5 min, 50% B for 2 min, from 50% to 100% B in 5 min, 100% B for 3 min, from 100% to 5% B in 1 min, 5% B for 1 min, from 5% to 100% B in 1 min, 100% B for 3 min, from 100% to 5% B in 1 min, 5% B for 5 min.
4.5. Molecular Networking
Molecular networks were generated using the GNPS online workflow [21]. The MS2 raw data were clustered with MS-Cluster with a parent mass tolerance of 0.01 Da and a MS2 fragment ion tolerance of 0.05 Da to create consensus spectra. The edges were filtered to have a cosine score above 0.7 and more than 4 MS2 matched peaks were used. Edges between nodes were preserved if both nodes were within each other’s top 10 most similar nodes. The molecular network thus created was visualized using Cytoscape 3.6.1. The molecular networking job processed by the classical molecular networking workflow in GNPS was reanalyzed by MS2LDA using default parameters [23]. For feature-based molecular networking, raw data files were processed using MZmine2 for feature detection [22]. The MS1 data was filtered by assigning a threshold level for noise detection at 10,000, MS2 spectra were filtered using a noise level threshold of 500. The following are the parameters applied to the filtered data: chromatogram builder (minimum time span: 0.1 min; minimum intensity of the highest data point in the chromatogram: 1500; m/z tolerance: 15 ppm); chromatogram deconvolution (local minimum search, m/z range for MS2 scan pairing: 0.025 Da; retention time range for MS2 scan pairing: 0.2 min); isotopic peaks grouper (m/z tolerance: 15 ppm; retention time tolerance absolute: 0.1 min; maximum charge: 3; representative isotope: most intense); join aligner (m/z tolerance: 15 ppm; retention time tolerance: 0.1 min); feature list rows filter (m/z range: 1280 to 1350 Da; MS/MS filter; reset peak number ID); remove duplicate filter (retention time tolerance absolute: 0.1; m/z tolerance: 5 ppm); peak finder (intensity tolerance: 0.1; retention time tolerance absolute: 0.1; m/z tolerance: 15 ppm). The two output files of MZmine2 were a feature table with ion intensities (.csv file format) and a list of MS2 spectra (mgf file). These files were exported with MZmine2 to the GNPS feature based networking workflow [24]. The parameters were set with the precursor ion mass tolerance of 0.01 Da and the fragment ion mass tolerance of 0.05 Da. The cosine score was set at 0.7 and a minimum of 6 matched fragment ions were needed to generate the network. The network was visualized as before in Cytoscape 3.6.1 and used to generate the Venn diagram illustrating the metabolomic overlap between M. sarasinorum (samples GUM_22 and GUM_59) and Melophlus sp. (samples GUM_133 and GUM_139). Plot for comparison of Melophlus and I. basta metabolomes were generated by MetaboAnalyst using features extracted by MZmine2 [57]. The volcano plot was generated at a fold-change threshold of 3 and a p-value threshold of 0.05, after inter quartile data filtering and pareto scaling of the data.
4.6. Isolation and Structural Characterization of 1 and 3
Sponge extracts, generated as described above, were chromatographed on a Luna 5 μm C18 reversed-phase LC column (250×10 mm) using chromatography solvent A: water + 0.05% v/v trifluoracetic acid (TFA), solvent B: MeCN + 0.05% v/v TFA. Flow rate was held constant at 2 mL/min throughout. The elution profile employed for isolation of 1 was: 5% solvent B for 5 min, a linear gradient from 5% to 40% B in 5 min, from 40% to 60% B in 20 min, from 60% to 100% B in 5 min, 100% B for 3 min, from 100% to 5% B in 1 min, 5% B for 1 min, from 5% to 100% B in 1 min, 100% B for 3 min, from 100% to 5% B in 1 min, 5% B for 5 min. The elution profile employed for isolation of 3 was: 5% solvent B for 5 min, a linear gradient from 5% to 50% B in 5 min, 50% B for 10 min, from 50% to 90% B in 2 min, from 90% to 100% B in 22 min, 100% B for 5 min, from 100% to 5% B in 1 min, 5% B for 1 min, from 5% to 100% B in 1 min, 100% B for 1 min, from 100% to 5% B in 1 min, 5% B for 5 min. An additional isocratic purification step was employed for 3, 93% solvent B for 45 min. Solvents were removed in vacuo to afford dried molecules. Molecules were dissolved 1 in CD3OD and 3 in CDCl3 followed by NMR data acquisition using an 800 MHz Bruker Avance III HD spectrometer.
4.7. Quantification of Relative Abundances of Sarasinosides and Melophlins
The abundance of sarasinoside and melophlin congeners was calculated by the area under the extracted ion chromatogram (EIC) generated as a part of the feature table processing in MZmine2. Identical parameters as described above for feature table generation were used, except that the m/z range was changed to 250–1335 Da in the feature list rows filter module. Areas under the EIC for known sarasinoside and melophlin congeners were tabulated and normalized separately for sarasinosides and melophlins. These values were then converted into a negative logarithmic scale. Normalized logarithmic areas for absent congeners were assigned a value of 6 (since the negative logarithm of 0 is undefined). The value 6 was chosed based on the least abundant sarasinoside which had a negative logarithmic value of 4.41. Therefore, 6 is a fair estimation, when based on the logarithmic scale, to denote the abundance of sarasinosides that are not detected in the sponge metabolome. The script used to generate the heatmap in RStudio is described above.
4.8. Metagenomic Sequencing, Assembly, and Mining
Illumina HiSeq 2500 reads (2 × 150 paired end) were processed using Trimmomatic version 0.339 [58], then assembled using IDBA-UD version 1.1.117 [59]. Scaffolds were binned based on percent GC, nucleotide composition, coverage depth, and taxonomic classification by DarkHorse version 2.0 [60], as previously described [61]. Reads were mapped to binned scaffold groups using Bowtie2, version 2.218 [62], and coverage depth was determined with the idxstats module of samtools version 0.1.191 [63]. Second-round, targeted assemblies were performed to obtain MAGs from individual binned read subsets using Celera Assembler version 8.3 [64]. MAG completeness was assessed using CheckM version 1.07 [34].
5. Database Deposition Information
The mass spectrometry data collected were deposited in the MassIVE repository with the MassIVE ID# MSV000084824. Sponge 28S rRNA and ITS-2 rRNA amplicon data, 16S amplicon data, and metagenomic sequencing data were deposited to GenBank under BioProject accession number PRJNA602901. Sequences for the msb and mgb gene loci have been deposited to Genbank under accession numbers MT026193 and MT026194, respectively.
Acknowledgments
The authors thank Michelle A. Schorn for assistance during sponge collection and Olakunle A. Jaiyesimi for insightful discussions.
Supplementary Materials
The following are available online at https://www.mdpi.com/1660-3397/18/2/124/s1, Figure S1: Molecular determination of sponge phylogeny, Figure S2: 1H-NMR spectra for 1 in CD3OD, Figure S3: 13C-NMR spectra for 1 in CD3OD, Figure S4: Structural annotation of the MS2 spectra for 1, Figure S5: 1H-NMR for 3 in CDCl3, Figure S6: 13C-NMR for 3 in CDCl3, Figure S7: Structural annotation of the MS2 spectra for 3, Table S1: Percentage identity matrix for Melophlus sponges as generated by EMBL-EBI Clustal Omega, Table S2: Sarasinoside congeners detected in this study and deposited to GNPS, Table S3: Melophlins congeners detected in this study and deposited to GNPS, Table S4: Table representing representative features that are shared among sponge specimens used in this study as denoted in Figure 1E in the main text, Table S5: 13C and 1H NMR data for 1 in CD3OD, Table S6: Table of retention times and precursor intensities for parent ion m/z 1321.65 condensed into a single node in molecular networking, Table S7: 13C and 1H NMR data for 3 in CDCl3.
Author Contributions
V.A. designed research; I.M. and S.P. performed research; J.S.B. and N.G. provided access to sponges and analytical tools, respectively; I.M., S.P., E.E.A. and V.A. analyzed data; V.A. wrote the paper with contribution from all authors. All authors have read and agreed to the published version of the manuscript.
Funding
This research and the APC was funded by the National Institutes of Health grant numbers [R00-ES026620 to V.A. and R01-ES030316 to E.E.A.], the National Science Foundation grant number [OCE-1837116 to E.E.A.], and an Alfred P. Sloan Foundation fellowship in Ocean Sciences to V.A.
Conflicts of Interest
The authors declare no competing financial interests.
References
- 1.De Goeij J.M., van Oevelen D., Vermeij M.J.A., Osinga R., Middelburg J.J., de Goeij A.F.P.M., Admiraal W. Surviving in a Marine Desert: The Sponge Loop Retains Resources Within Coral Reefs. Science. 2013;342:108–110. doi: 10.1126/science.1241981. [DOI] [PubMed] [Google Scholar]
- 2.Maldonado M., Ribes M., van Duyl F.C. Chapter three—Nutrient Fluxes Through Sponges: Biology, Budgets, and Ecological Implications. In: Becerro M.A., Uriz M.J., Maldonado M., Turon X., editors. Advances in Marine Biology. Volume 62. Academic Press; Cambridge, MA, USA: 2012. pp. 113–182. [DOI] [PubMed] [Google Scholar]
- 3.Gloeckner V., Wehrl M., Moitinho-Silva L., Gernert C., Schupp P., Pawlik J.R., Lindquist N.L., Erpenbeck D., Wörheide G., Hentschel U. The HMA-LMA Dichotomy Revisited: An Electron Microscopical Survey of 56 Sponge Species. Biol. Bull. 2014;227:78–88. doi: 10.1086/BBLv227n1p78. [DOI] [PubMed] [Google Scholar]
- 4.Webster N.S., Taylor M.W. Marine sponges and their microbial symbionts: Love and other relationships. Environ. Microbiol. 2012;14:335–346. doi: 10.1111/j.1462-2920.2011.02460.x. [DOI] [PubMed] [Google Scholar]
- 5.Nguyen M.T.H.D., Thomas T. Diversity, host-specificity and stability of sponge-associated fungal communities of co-occurring sponges. PeerJ. 2018;6:e4965. doi: 10.7717/peerj.4965. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Gerwick W.H., Moore B.S. Lessons from the past and charting the future of marine natural products drug discovery and chemical biology. Chem. Biol. 2012;19:85–98. doi: 10.1016/j.chembiol.2011.12.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Molinski T.F., Dalisay D.S., Lievens S.L., Saludes J.P. Drug development from marine natural products. Nat. Rev. Drug Discov. 2009;8:69–85. doi: 10.1038/nrd2487. [DOI] [PubMed] [Google Scholar]
- 8.Loh T.-L., Pawlik J.R. Chemical defenses and resource trade-offs structure sponge communities on Caribbean coral reefs. Proc. Natl. Acad. Sci. USA. 2014;111:4151–4156. doi: 10.1073/pnas.1321626111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Morita M., Schmidt E.W. Parallel lives of symbionts and hosts: Chemical mutualism in marine animals. Nat. Prod. Rep. 2018;35:357–378. doi: 10.1039/C7NP00053G. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.MarinLit. [(accessed on 14 January 2020)]; Available online: http://pubs.rsc.org/marinlit/searchresult/compounds?AFreeText=porifera&FullText=True&Page=bib.
- 11.Paul V.J., Freeman C.J., Agarwal V. Chemical Ecology of Marine Sponges: New Opportunities through “-Omics”. Integr. Comp. Biol. 2019;59:765–776. doi: 10.1093/icb/icz014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Nichols S.A. An evaluation of support for order-level monophyly and interrelationships within the class Demospongiae using partial data from the large subunit rDNA and cytochrome oxidase subunit I. Mol. Phylogenet. Evol. 2005;34:81–96. doi: 10.1016/j.ympev.2004.08.019. [DOI] [PubMed] [Google Scholar]
- 13.Cárdenas P., Xavier J.R., Reveillaud J., Schander C., Rapp H.T. Molecular Phylogeny of the Astrophorida (Porifera, Demospongiaep) Reveals an Unexpected High Level of Spicule Homoplasy. PLoS ONE. 2011;6:e18318. doi: 10.1371/journal.pone.0018318. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Schmitz F.J., Ksebati M.B., Gunasekera S.P., Agarwal S. Sarasinoside A1: A saponin containing amino sugars isolated from a sponge. J. Org. Chem. 1988;53:5941–5947. doi: 10.1021/jo00260a025. [DOI] [Google Scholar]
- 15.Espada A., Jiménez C., Rodríguez J., Crews P., Riguera R. Sarasinosides D-G: Four new triterpenoid saponins from the sponge asteropus sarasinosum. Tetrahedron. 1992;48:8685–8696. doi: 10.1016/S0040-4020(01)89444-8. [DOI] [Google Scholar]
- 16.Dai H.-F., Edrada R.A., Ebel R., Nimtz M., Wray V., Proksch P. Norlanostane Triterpenoidal Saponins from the Marine Sponge Melophlus sarassinorum. J. Nat. Prod. 2005;68:1231–1237. doi: 10.1021/np050152d. [DOI] [PubMed] [Google Scholar]
- 17.Lee H.-S., Seo Y., Cho K.W., Rho J.-R., Shin J., Paul V.J. New Triterpenoid Saponins from the Sponge Melophlus isis. J. Nat. Prod. 2000;63:915–919. doi: 10.1021/np990589j. [DOI] [PubMed] [Google Scholar]
- 18.Aoki S., Higuchi K., Ye Y., Satari R., Kobayashi M. Melophlins A and B, Novel Tetramic Acids Reversing the Phenotype of ras-Transformed Cells, from the Marine Sponge Melophlus sarassinorum. Tetrahedron. 2000;56:1833–1836. doi: 10.1016/S0040-4020(00)00092-2. [DOI] [Google Scholar]
- 19.Xu J., Hasegawa M., Harada K.-I., Kobayashi H., Nagai H., Namikoshi M. Melophlins p, q, R, and s: Four New Tetramic Acid Derivatives, from Two Palauan Marine Sponges of the Genus Melophlus. Chem. Pharm. Bull. 2006;54:852–854. doi: 10.1248/cpb.54.852. [DOI] [PubMed] [Google Scholar]
- 20.Wang C.-Y., Wang B.-G., Wiryowidagdo S., Wray V., van Soest R., Steube K.G., Guan H.-S., Proksch P., Ebel R. Melophlins C−O, Thirteen Novel Tetramic Acids from the Marine Sponge Melophlus sarassinorum. J. Nat. Prod. 2003;66:51–56. doi: 10.1021/np0202778. [DOI] [PubMed] [Google Scholar]
- 21.Wang M.X., Carver J.J., Phelan V.V., Sanchez L.M., Garg N., Peng Y., Nguyen D.D., Watrous J., Kapono C.A., Luzzatto-Knaan T., et al. Sharing and community curation of mass spectrometry data with Global Natural Products Social Molecular Networking. Nat. Biotechnol. 2016;34:828–837. doi: 10.1038/nbt.3597. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Pluskal T., Castillo S., Villar-Briones A., Orešič M. MZmine 2: Modular framework for processing, visualizing, and analyzing mass spectrometry-based molecular profile data. BMC Bioinform. 2010;11:395. doi: 10.1186/1471-2105-11-395. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Van der Hooft J.J.J., Wandy J., Barrett M.P., Burgess K.E.V., Rogers S. Topic modeling for untargeted substructure exploration in metabolomics. Proc. Natl. Acad. Sci. USA. 2016;113:13738–13743. doi: 10.1073/pnas.1608041113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Nothias L.F., Petras D., Schmid R., Dührkop K., Rainer J., Sarvepalli A., Protsyuk I., Ernst M., Tsugawa H., Fleischauer M., et al. Feature-based Molecular Networking in the GNPS Analysis Environment. bioRxiv. 2019:812404. doi: 10.1101/812404. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Wei J.H., Yin X., Welander P.V. Sterol Synthesis in Diverse Bacteria. Front. Microbiol. 2016;7:990. doi: 10.3389/fmicb.2016.00990. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Haubrich A.B. Microbial Sterolomics as a Chemical Biology Tool. Molecules. 2018;23:2768. doi: 10.3390/molecules23112768. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Li X.-C., Jacob M.R., Ding Y., Agarwal A.K., Smillie T.J., Khan S.I., Nagle D.G., Ferreira D., Clark A.M. Capisterones A and B, which Enhance Fluconazole Activity in Saccharomyces cerevisiae, from the Marine Green Alga Penicillus capitatus. J. Nat. Prod. 2006;69:542–546. doi: 10.1021/np050396y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Ebada S.S., Lin W., Proksch P. Bioactive Sesterterpenes and Triterpenes from Marine Sponges: Occurrence and Pharmacological Significance. Mar. Drugs. 2010;8:313–346. doi: 10.3390/md8020313. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Agarwal V., Blanton J.M., Podell S., Taton A., Schorn M.A., Busch J., Lin Z., Schmidt E.W., Jensen P.R., Paul V.J., et al. Metagenomic discovery of polybrominated diphenyl ether biosynthesis by marine sponges. Nat. Chem. Biol. 2017;13:537–543. doi: 10.1038/nchembio.2330. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Podell S., Blanton J.M., Neu A., Agarwal V., Biggs J.S., Moore B.S., Allen E.E. Pangenomic comparison of globally distributed Poribacteria associated with sponge hosts and marine particles. ISME J. 2019;13:468–481. doi: 10.1038/s41396-018-0292-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Seemann T. Prokka: Rapid prokaryotic genome annotation. Bioinformatics. 2014;30:2068–2069. doi: 10.1093/bioinformatics/btu153. [DOI] [PubMed] [Google Scholar]
- 32.El-Gebali S., Mistry J., Bateman A., Eddy S.R., Luciani A., Potter S.C., Qureshi M., Richardson L.J., Salazar G.A., Smart A., et al. The Pfam protein families database in 2019. Nucleic Acids Res. 2018;47:D427–D432. doi: 10.1093/nar/gky995. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Lombard V., Golaconda Ramulu H., Drula E., Coutinho P.M., Henrissat B. The carbohydrate-active enzymes database (CAZy) in 2013. Nucleic Acids Res. 2013;42:D490–D495. doi: 10.1093/nar/gkt1178. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Parks D.H., Imelfort M., Skennerton C.T., Hugenholtz P., Tyson G.W. CheckM: Assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 2015;25:1043–1055. doi: 10.1101/gr.186072.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Lee A.K., Banta A.B., Wei J.H., Kiemle D.J., Feng J., Giner J.-L., Welander P.V. C-4 sterol demethylation enzymes distinguish bacterial and eukaryotic sterol synthesis. Proc. Natl. Acad. Sci. USA. 2018;115:5884–5889. doi: 10.1073/pnas.1802930115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Takanashi E., Takada K., Hashimoto M., Itoh Y., Ise Y., Ohtsuka S., Okada S., Matsunaga S. Cytotoxic linear acetylenes from a marine sponge Pleroma sp. Tetrahedron. 2015;71:9564–9570. doi: 10.1016/j.tet.2015.10.062. [DOI] [Google Scholar]
- 37.Zhou Z.-F., Menna M., Cai Y.-S., Guo Y.-W. Polyacetylenes of Marine Origin: Chemistry and Bioactivity. Chem. Rev. 2015;115:1543–1596. doi: 10.1021/cr4006507. [DOI] [PubMed] [Google Scholar]
- 38.Wu Q., Wu Z., Qu X., Liu W. Insights into Pyrroindomycin Biosynthesis Reveal a Uniform Paradigm for Tetramate/Tetronate Formation. J. Am. Chem. Soc. 2012;134:17342–17345. doi: 10.1021/ja304829g. [DOI] [PubMed] [Google Scholar]
- 39.Gui C., Li Q., Mo X., Qin X., Ma J., Ju J. Discovery of a New Family of Dieckmann Cyclases Essential to Tetramic Acid and Pyridone-Based Natural Products Biosynthesis. Org. Lett. 2015;17:628–631. doi: 10.1021/ol5036497. [DOI] [PubMed] [Google Scholar]
- 40.Kakule T.B., Zhang S., Zhan J., Schmidt E.W. Biosynthesis of the Tetramic Acids Sch210971 and Sch210972. Org. Lett. 2015;17:2295–2297. doi: 10.1021/acs.orglett.5b00715. [DOI] [PubMed] [Google Scholar]
- 41.Bihlmaier C., Welle E., Hofmann C., Welzel K., Vente A., Breitling E., Müller M., Glaser S., Bechthold A. Biosynthetic Gene Cluster for the Polyenoyltetramic Acid α-Lipomycin. Antimicrob. Agents Chemother. 2006;50:2113–2121. doi: 10.1128/AAC.00007-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Calcul L., Inman W.D., Morris A.A., Tenney K., Ratnam J., McKerrow J.H., Valeriote F.A., Crews P. Additional insights on the bastadins: Isolation of analogues from the sponge Ianthella cf. reticulata and exploration of the oxime configurations. J. Nat. Prod. 2010;73:365–372. doi: 10.1021/np9005986. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Peng J., Li J., Hamann M.T. The marine bromotyrosine derivatives. Alkaloids Chem. Biol. 2005;61:59–262. doi: 10.1016/s1099-4831(05)61002-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Pereira T.J., Baldwin J.G. Contrasting evolutionary patterns of 28S and ITS rRNA genes reveal high intragenomic variation in Cephalenchus (Nematoda): Implications for species delimitation. Mol. Phylogenet. Evol. 2016;98:244–260. doi: 10.1016/j.ympev.2016.02.016. [DOI] [PubMed] [Google Scholar]
- 45.Sarma S.N., Krishna S.M., Rao R.S. Sterol Ring System Oxidation Pattern in Marine Sponges. Mar. Drugs. 2005;3:84–111. doi: 10.3390/md303084. [DOI] [Google Scholar]
- 46.Ferrer A., Altabella T., Arró M., Boronat A. Emerging roles for conjugated sterols in plants. Prog. Lipid Res. 2017;67:27–37. doi: 10.1016/j.plipres.2017.06.002. [DOI] [PubMed] [Google Scholar]
- 47.Stonik A.V., Stonik V.I. Sterol and Sphingoid Glycoconjugates from Microalgae. Mar. Drugs. 2018;16:514. doi: 10.3390/md16120514. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Agarwal V., Li J., Rahman I., Borgen M., Aluwihare L.I., Biggs J.S., Paul V.J., Moore B.S. Complexity of naturally produced polybrominated diphenyl ethers revealed via mass spectrometry. Environ. Sci. Technol. 2015;49:1339–1346. doi: 10.1021/es505440j. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Cantrell T.P., Freeman C.J., Paul V.J., Agarwal V., Garg N. Mass Spectrometry-Based Integration and Expansion of the Chemical Diversity Harbored Within a Marine Sponge. J. Am. Soc. Mass Spectrom. 2019;30:1373–1384. doi: 10.1007/s13361-019-02207-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Moitinho-Silva L., Steinert G., Nielsen S., Hardoim C.C.P., Wu Y.-C., McCormack G.P., López-Legentil S., Marchant R., Webster N., Thomas T., et al. Predicting the HMA-LMA Status in Marine Sponges by Machine Learning. Front. Microbiol. 2017;8:752. doi: 10.3389/fmicb.2017.00752. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Thacker R.W., Starnes S. Host specificity of the symbiotic cyanobacterium Oscillatoria spongeliae in marine sponges, Dysidea spp. Mar. Biol. 2003;142:643–648. doi: 10.1007/s00227-002-0971-x. [DOI] [Google Scholar]
- 52.Pratte Z.A., Longo G.O., Burns A.S., Hay M.E., Stewart F.J. Contact with turf algae alters the coral microbiome: Contact versus systemic impacts. Coral Reefs. 2018;37:1–13. doi: 10.1007/s00338-017-1615-4. [DOI] [Google Scholar]
- 53.Kozich J.J., Westcott S.L., Baxter N.T., Highlander S.K., Schloss P.D. Development of a Dual-Index Sequencing Strategy and Curation Pipeline for Analyzing Amplicon Sequence Data on the MiSeq Illumina Sequencing Platform. Appl. Environ. Microbiol. 2013;79:5112–5120. doi: 10.1128/AEM.01043-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Bolyen E., Rideout J.R., Dillon M.R., Bokulich N.A., Abnet C.C., Al-Ghalith G.A., Alexander H., Alm E.J., Arumugam M., Asnicar F., et al. Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2. Nat. Biotechnol. 2019;37:852–857. doi: 10.1038/s41587-019-0209-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Callahan B.J., McMurdie P.J., Rosen M.J., Han A.W., Johnson A.J.A., Holmes S.P. DADA2: High-resolution sample inference from Illumina amplicon data. Nat. Methods. 2016;13:581–583. doi: 10.1038/nmeth.3869. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Yilmaz P., Parfrey L.W., Yarza P., Gerken J., Pruesse E., Quast C., Schweer T., Peplies J., Ludwig W., Glockner F.O. The SILVA and “All-species Living Tree Project (LTP)” taxonomic frameworks. Nucleic Acids Res. 2014;42:D643–D648. doi: 10.1093/nar/gkt1209. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Chong J., Wishart D.S., Xia J. Using MetaboAnalyst 4.0 for Comprehensive and Integrative Metabolomics Data Analysis. Curr. Protoc. Bioinform. 2019;68:e86. doi: 10.1002/cpbi.86. [DOI] [PubMed] [Google Scholar]
- 58.Bolger A.M., Lohse M., Usadel B. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Peng Y., Leung H.C., Yiu S.M., Chin F.Y. IDBA-UD: A de novo assembler for single-cell and metagenomic sequencing data with highly uneven depth. Bioinformatics. 2012;28:1420–1428. doi: 10.1093/bioinformatics/bts174. [DOI] [PubMed] [Google Scholar]
- 60.Podell S., Gaasterland T. DarkHorse: A method for genome-wide prediction of horizontal gene transfer. Genome Biol. 2007;8:R16. doi: 10.1186/gb-2007-8-2-r16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Podell S., Ugalde J.A., Narasingarao P., Banfield J.F., Heidelberg K.B., Allen E.E. Assembly-driven community genomics of a hypersaline microbial ecosystem. PLoS ONE. 2013;8:e61692. doi: 10.1371/journal.pone.0061692. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Langmead B., Salzberg S.L. Fast gapped-read alignment with Bowtie 2. Nat. Methods. 2012;9:357–359. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Li H., Handsaker B., Wysoker A., Fennell T., Ruan J., Homer N., Marth G., Abecasis G., Durbin R. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Myers E.W., Sutton G.G., Delcher A.L., Dew I.M., Fasulo D.P., Flanigan M.J., Kravitz S.A., Mobarry C.M., Reinert K.H.J., Remington K.A., et al. A Whole-Genome Assembly of Drosophila. Science. 2000;287:2196–2204. doi: 10.1126/science.287.5461.2196. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.