Abstract
Many marine sponges host highly diverse microbiomes that contribute to various aspects of host health. Although the putative function of individual groups of sponge symbionts has been increasingly described, the extreme diversity has generally precluded in‐depth characterization of entire microbiomes, including identification of syntrophic partnerships. The Indo‐Pacific sponge Ianthella basta is emerging as a model organism for symbiosis research, hosting only three dominant symbionts: a Thaumarchaeotum, a Gammaproteobacterium, and an Alphaproteobacterium and a range of other low abundance or transitory taxa. Here, we retrieved metagenome assembled genomes (MAGs) representing >90% of I. basta's microbial community, facilitating the metabolic reconstruction of the sponge's near complete microbiome. Through this analysis, we identified metabolic complementarity between microbes, including vitamin sharing, described the importance of low abundance symbionts, and characterized a novel microbe–host attachment mechanism in the Alphaproteobacterium. We further identified putative viral sequences, highlighting the role viruses can play in maintaining symbioses in I. basta through the horizontal transfer of eukaryotic‐like proteins, and complemented this data with metaproteomics to identify active metabolic pathways in bacteria, archaea, and viruses. This data provide the framework to adopt I. basta as a model organism for studying host–microbe interactions and provide a basis for in‐depth physiological experiments.
INTRODUCTION
Sponges are ecologically important components of aquatic ecosystems around the world, providing habitat and mediating biochemical cycles by filtering thousands of litres of water each day (Bell, 2008; de Goeij et al., 2013; Weisz et al., 2008). On coral reefs, for example, sponges recycle and retain nutrients through the ‘sponge‐loop’, in which sponges take up and transform dissolved organic matter (DOM) into biomass, which in turn is consumed by reef‐dwelling detritivores to complete the loop (de Goeij et al., 2013). Marine sponges host diverse and stable communities of microorganisms that are distinct from the surrounding seawater (Schmitt et al., 2012; Taylor et al., 2013) and contribute to holobiont health and overall function. For example, sponge symbionts are actively involved in the assimilation of DOM, recycle host waste products, metabolize toxic ammonia, and are enriched in restriction‐modification systems that could aid in defence against mobile genetic elements, such as viruses (Hudspith et al., 2021; Moeller et al., 2019; Pita et al., 2018; Robbins et al., 2021; Slaby et al., 2017).
Despite the importance of the sponge microbiome, its complexity hampers the in‐depth characterization of each microbial taxa, with many sponge species hosting up to 3000 distinct microbial species (Webster et al., 2010). Consequently, few sponge symbionts have been described in detail and hypotheses on microbial syntrophy have remained confined to select members of the community (Bayer et al., 2020; Moitinho‐Silva et al., 2017). Moreover, sponge species for which the function of all dominant microbial members has been inferred from omics data frequently lack data on symbionts present in low abundance (Gauthier et al., 2016). However, low abundance microbes can drive fundamentally different processes than high abundance microbes (Rivett & Bell, 2018). To systematically understand how the sponge holobiont functions and to allow for manipulative experiments to further unravel the mechanisms of establishment and maintenance of symbiosis, fully characterized model organisms are needed. Such models will also be extremely valuable for studying more complex animal–microbiota symbiotic relationships.
Ianthella basta is an abundant sponge throughout the Indo‐Pacific and hosts only three dominant symbionts: a Thaumarchaeotum, a Gammaproteobacterium, and an Alphaproteobacterium (Luter et al., 2010; Moeller et al., 2019), and a range of other low abundance taxa, making this sponge an ideal candidate for establishing a model for sponge symbiosis. Ianthella basta's microbiome is stable across different environments (Luter et al., 2010), with the dominant symbionts, which are vertically transmitted (Engelberts et al., 2022), typically comprising over 90% of the community. Previous metagenomic and physiological studies of I. basta have characterized the dominant Thaumarchaeotum and Gammaproteobacterium as key players within important pathways, such as ammonia oxidation and taurine metabolism, respectively (Moeller et al., 2019, 2022). Here, we complete the microbiome by recovering MAGs for the dominant Alphaproteobacterium and several low abundant microbes that are persistent across samples, which allowed us to not only describe the full genomically predictable metabolic potential of I. basta's microbiome, but also to reconstruct metabolic complementarity between these microbes. We further highlight the contribution of viruses to the metabolic capability of the sponge and complement all data with metaproteomic sequencing. This study provides an unprecedented framework within which to use I. basta in future manipulative experiments (Engelberts et al., 2021).
EXPERIMENTAL PROCEDURES
Sample collection
The sponge Ianthella basta presents in two colour morphotypes, purple and yellow (Freckelton et al., 2012), of which the tissue can be designated as either thick or thin. For this study, three yellow Ianthella basta individuals with thin tissue were collected in November 2019 from Orpheus Island, central Great Barrier Reef (18°37′ S, 146°30′ E) between 11 and 15 m depth using scuba. Sponges were kept in aquaria for 12 h, after which they were fragmented, snap frozen, and sent to the Australian Centre for Ecogenomics (ACE), Brisbane, for metagenomic sequencing and to the Centre for Environmental Research, Helmholtz (Leipzig, Germany), for metaproteomic analysis.
To determine whether the metagenomes recovered in this study were representative of the dominant symbionts found in different morphotypes of I. basta, we blasted previously recovered 16S rRNA gene amplicon sequences of the three dominant symbionts of a purple, thick morphotype (Engelberts et al., 2022) to our genomes using Blast v2.9.0+ (Camacho et al., 2009). A further three yellow, thin and three yellow, thick individuals were collected from Davies Reef (S 18°49.354′, E 147°38.253′) in February 2021, using SCUBA at depths between 12 and 15 m depth. For these individuals, 16S rRNA gene amplicon sequences of the three dominant symbionts were obtained following the methods described in Engelberts et al. (2022) and again blasted to the genomes from this study (see Table S2 for 16S rRNA gene amplicon sequences).
Microbial enrichment, DNA extraction, and metagenomic sequencing
Microbial cells were separated from the host tissue for all three I. basta individuals (~1 to 3 g sponge wet weight per individual) according to the methods described in Botté et al. (2019), with the exception that samples were ground on ice in 1X Calcium/Magnesium‐Free Sea Water (CMFSW) for 5 min to homogenize the sponge tissue. Samples were treated with DNase I (RNase‐free, New England Biolabs, M0303) to remove any remaining host and extracellular DNA, after which the microbial DNA was extracted using the phenol‐chloroform protocol described in Robbins et al. (2019), omitting all freeze–thaw cycles which could shear the DNA. We note that it is possible that the removal of host cells could remove intracellular symbionts, such as Chlamydiae (Engelberts et al., 2022). Quantity and quality of the extracted DNA were checked using Qubit assays and 16S/18S rRNA PCR. Libraries were prepared with the Illumina Nextera Flex library prep kit at Microba Life Sciences (Brisbane, Australia) and sequenced on the Illumina NovaSeq 6000 (2 × 250 bp). All samples (n = 3) were first sequenced at an average depth of 5.5 Gbp for differential coverage binning (Albertsen et al., 2013), followed by sequencing at a depth of 20 Gbp to increase the coverage of microorganisms present at low abundance.
Metagenome assembly, binning, and taxonomic assignment
Paired‐end DNA sequencing data were demultiplexed and adapters trimmed using Illumina BaseSpace Bcl2fastq2 v2.20, accepting one mismatch in index sequences. All reads were assembled using metaSPades v3.14.0 (Nurk et al., 2017). To supplement this dataset, microbial metagenomic reads previously retrieved from six sequencing runs on one purple, thick I. basta individual (sampled in October 2011; Moeller et al., 2019) were reassembled using metaSPAdes v3.14.0 (Nurk et al., 2017) and analysed together with data from this study. Binning of all data (separately for each sponge individual) was performed with the ‘recover’ pipeline of Aviary (https://github.com/rhysnewell/aviary), which uses the binning algorithms MetaBAT v1 (Kang et al., 2015), MetaBAT v2 (Kang et al., 2019), MaxBin v2 (Wu et al., 2015), CONCOCT (Alneberg et al., 2014), VAMB (Nissen et al., 2021), and Rosella (https://github.com/rhysnewell/rosella). DASTool v1.1.2 (Sieber et al., 2018) was used to select for the best representative metagenome assembled genome (MAG) for each sponge individual produced from the different binning algorithms. Completeness and contamination of each MAG were determined using CheckM v1.1.3 (Parks et al., 2015) and only MAGs that were over 50% complete with less than 5% contamination were retained for downstream analyses.
GTDB‐Tk v1.5.0 (Chaumeil et al., 2019), which is based on the Genome Taxonomy Database (GTDB, http://gtdb.ecogenomic.org) taxonomy Release 202, was used to assign taxonomy to each MAG. GTDB‐Tk uses FASTANI (Jain et al., 2018) and pplacer (Matsen et al., 2010) to classify MAGs based on their placement in a reference tree, which is inferred using a set of 120 bacterial and 122 archaeal markers.
MAGs were dereplicated at 95% identity with Drep v2.6.2 using otherwise default parameters (Olm et al., 2017) to remove duplicate MAGs recovered from I. basta individuals from our study and Moeller et al. (2019). To confirm that the recovered MAGs represented the majority of I. basta's microbial community, single‐copy marker genes were identified in all raw reads and MAGs and the fraction of marker genes captured in the MAGs was subsequently assessed using the ‘pipe’ and ‘appraise’ functions of SingleM v0.13.2 (https://github.com/wwood/singlem). To calculate the relative abundance of each MAG in all four I. basta individuals (three from this study, one study from Moeller et al., 2019), MAGs were first dereplicated at 95% identity with Drep v2.6.2 (Olm et al., 2017) to avoid arbitrary mapping between representatives of highly similar genomes, and reads were mapped to these MAGs with CoverM v0.2.0 (https://github.com/wwood/CoverM, min‐covered‐fraction set to 0.1). The final heatmap was visualized with R v4.0.5 (R Core Team, 2013), showing reads from the deep sequencing runs on the three I. basta individuals from this study, reads from one I. basta individual from Moeller et al. (2019), and MAGs that were consistently present over the four I. basta individuals (i.e. present in at least three out of the four individuals; Figure 1). These MAGs were used for the metabolic reconstruction of I. basta's microbiome. We further analysed a dataset originating from a 454‐pyrosequencing run from Moeller et al. (2019). From this dataset, a Thaumarchaeotal and Gammaproteobacterial MAG were retrieved with ANI values of 98.90% and 97.78% compared to the genomes recovered in this study. However, due to the low sequencing depth, this data was not included in Figure 1.
FIGURE 1.
Relative abundance of MAGs present in ≥3 I. basta individuals after de‐replication at 95% identity. For I. basta individual 4 the average relative abundance is shown, as this individual was sequenced six times in the study Moeller et al. (2019). ‘Unmapped’ represents reads that could not be mapped to any of the six MAGs. IB: Ianthella basta followed by the replicate number. The proposed species name (dominant symbionts; Moeller et al., 2019, 2022) or the GTDB‐Tk taxonomy (low abundant symbionts) is displayed.
Tree building
Archaeal and bacterial phylogenomic trees were inferred for each of I. basta's three dominant symbionts, using the ‘de_novo_wf’ workflow of GTDB‐Tk v1.5.0 (Chaumeil et al., 2019). This workflow calculates de novo trees using FastTree v2.1.9 (Price et al., 2010) using the WAG + GAMMA model. Each tree included all publicly available genomes from GTDB that belonged to the same phylum or class as the respective I. basta symbiont ‐ i.e. all Thaumarchaeota (a total of 564 MAGs), Gammaproteobacteria (8980 MAGs), or Alphaproteobacteria (7361 MAGs). Tree files were further visualized and refined with the Interactive Tree Of Life (iTOL) v6.4.1 (Letunic & Bork, 2021) and Inkscape v0.92.4 (https://inkscape.org/) for Figures [Link], [Link].
The closest relatives to the I. basta Thaumarchaeotal MAG and Gammaproteobacterial 16S rRNA gene amplicon sequence (see Moeller et al., 2022) came from the sponge Hexadella detritifera, which like I. basta, belongs to the Ianthellidae family. To elucidate whether the dominant Alphaproteobacterial and Gammaproteobacterial MAGs from I. basta were also most closely related to microbial symbionts found in H. detritifera, we assembled and binned previously published raw reads from H. detritifera (Zhang et al., 2019). The resulting Alpha‐ and Gammaproteobacterial MAGs were added to the phylogenomic trees (see Supplementary Note 1).
Functional annotation of MAGs
The EnrichM v0.6.3 (https://github.com/geronimp/enrichM) ‘annotate’ function was used to functionally annotate MAGs with the Kyoto Encyclopaedia of Genes and Genomes (KEGG) Orthologies (KOs) and PFAM databases (Kanehisa et al., 2015; Mistry et al., 2020), as well as with Hidden Markov Models (HMMs) from the database for Automated carbohydrate‐Active Enzyme Annotation (dbCAN) (Yin et al., 2012) to identify carbohydrate‐active enzymes (CAZYs). For carbon fixation, vitamin production, and amino acid biosynthesis pathways, EnrichM v0.6.3 ‘classify’ was used to identify pathways that were ≥60% complete and consequently assigned to a MAG. Notably, the KEGG module for cobalamin biosynthesis does not include the full pathway, thus the cobalamin module defined by Engelberts et al. (2020) was used. The presence of amino acid biosynthesis pathways was further validated using GapMind (Price et al., 2020). For ABC transporters to be positively assigned to a MAG, genes for both the substrate‐specific component and the integral membrane protein had to be present. Secondary metabolite clusters were predicted in each MAG using AntiSMASH v5.0 (Blin et al., 2019). Proteins with predicted signal peptides for the translocation across the cytoplasmic and outer membrane were identified using SignalP v5.0 (Almagro Armenteros et al., 2019).
Identification, cross‐validation, and annotation of viral sequences
Putative viral sequences were identified in all metagenomic assemblies using VirSorter v1.1 (Roux et al., 2015). To assign taxonomy, the putative viral sequences were aligned to the viral component of the RefSeq Nonredundant (Nr) database (https://www.ncbi.nlm.nih.gov/refseq/), with an E‐value cut‐off of 1e‐05. Sequences were also blasted against previously recovered viral sequences from I. basta (Laffy et al., 2018) using BlastP v2.9.0 (Camacho et al., 2009) for cross‐validation and to further confirm their taxonomic assignment (Table S6). Viral contigs were functionally annotated by calling open reading frames (ORFs) in the metagenomic assemblies using Prodigal v2.6.3 (Hyatt et al., 2010). ORFs were subsequently annotated with the KO and PFAM databases using EnrichM v0.6.3 (https://github.com/geronimp/enrichM).
Protein extraction and proteome analyses
Tissue from the same three I. basta individuals metagenomically sequenced in this study was sent to the Centre for Environmental Research, Helmholtz, for metaproteomic analysis. Sponge tissue was disrupted by bead beating (FastPrep‐24, MP Biomedicals, Sanra Ana, CA, USA; 5.5 ms, 1 min, 3 cycles) followed by ultra‐sonication (UP50H, Hielscher, Teltow, Germany; cycle 0.5, amplitude 60%) and centrifugation (10,000 × g, 10 min). The protein lysate was loaded on an SDS‐gel and run for 10 min. The gel piece was cut, washed, and incubated with 25 mM 1,4 dithiothreitol (in 20 mM ammonium bicarbonate) for 1 h and 100 mM iodoacetamide (in 20 mM ammonium bicarbonate) for 30 min, and destained, dehydrated, and proteolytically cleaved overnight at 37°C with trypsin (Promega). Digested peptides were extracted and desalted using ZipTip μC18 tips (Merck Millipore, Darmstadt, Germany). Peptide lysates were re‐suspended in 15 μl 0.1% formic acid and analysed by nanoliquid chromatography mass spectrometry (UltiMate 3000 RSLCnano, Dionex, Thermo Fisher Scientific). Mass spectrometric analyses of eluted peptide lysates were performed on a Q Exactive HF mass spectrometer (Thermo Fisher Scientific) coupled with a TriVersa NanoMate (Advion, Ltd., Harlow, UK). Peptide lysates were injected on a trapping column (Acclaim PepMap 100 C18, 3 μm, nanoViper, 75 μm × 2 cm, Thermo Fisher Scientific) with 5 μl/min by using 98% water/2% ACN 0.5% trifluoroacetic acid and separated on an analytical column (Acclaim PepMap 100 C18, 3 μm, nanoViper, 75 μm × 25 cm, Thermo Fisher Scientific) with a flow rate of 300 nl/min. Mobile phase was 0.1% formic acid in water (A) and 80% ACN/0.08% formic acid in water (B). Full MS spectra (350–1550 m/z) were acquired in the Orbitrap at a resolution of 120,000 with automatic gain control (AGC) target value of 3 × 106 ions.
Data resulting from LC–MS/MS measurements were analysed with the Proteome Discoverer v2.4 (Thermo Fischer Scientific) using SEQUEST HT. Protein identification was performed using a custom protein database, which consisted of proteins identified by Prodigal v2.6.3 (Hyatt et al., 2010) in all metagenomic assemblies and MAGs from the three I. basta individuals from this study and the one from Moeller et al. (2019). All protein sequences were functionally annotated with the KO and PFAM databases using EnrichM v0.6.3 (https://github.com/geronimp/enrichM). To avoid arbitrary mapping, only non‐redundant protein sequences were used for protein identification (clustered at an identity threshold of 100%). Searches were conducted with the following parameters: Trypsin as enzyme specificity and two missed cleavages allowed. A peptide ion tolerance of 10 ppm and an MS/MS tolerance of 0.02 Da were used. As modifications, oxidation (methionine) and carbamidomethylation (cysteine) were selected. Peptides that scored a q‐value >1% based on a decoy database and with a peptide rank of 1, were considered identified. Redundant proteins were grouped in protein groups by applying the strict parsimony principle. Only the protein groups that explain at least one unique identified peptide were reported through the Top3 approach implemented in the Proteome Discoverer v2.4.
This protein dataset was supplemented with previously recovered metaproteomics data from the I. basta individual sampled in Moeller et al. (2019). To this end, all 49 raw protein spectra from Moeller et al. (2019) (https://www.ebi.ac.uk/pride/archive/projects/PXD012484) were searched against the custom I. basta protein database using the same parameters as described above. However, as the MS‐analysis in Moeller et al. (2019) was performed on a LTQ Velos Orbitrap and MS/MS spectra were measured in ‘unit resolution’‐mode, protein reports were further filtered so that each listed protein had at least two peptides to prevent the identification of a protein based on one peptide.
RESULTS AND DISCUSSION
General MAG statistics and selection
In total, 43 microbial MAGs were recovered from four I. basta individuals, of which 23 originated from the three I. basta individuals collected in this study and 20 from an individual sampled in 2011 (Moeller et al., 2019). The recovered MAGs had an average completeness of 86% ± 13%, contamination of 1% ± 1%, and GC content of 61% ± 3% (Table S1). In contrast to the low GC percentage typically observed in seawater microbes (Giovannoni et al., 2005), a relatively high GC content (i.e. >50%) is commonly observed in sponge symbiont metagenomes (Horn et al., 2016; Moeller et al., 2019). This could be driven by a high environmental availability of nitrogen (Luo et al., 2015) derived from the concentration of nutrients (DOM and DON) and the release of taurine by the sponge host (Moeller et al., 2022; Rix et al., 2020; Robbins et al., 2021). De‐replication of the MAGs at 95% identity to remove duplicate MAGs between I. basta individuals reduced the total number to 10, including one archaeal MAG from the phylum Thaumarchaeota representing the previously characterized ‘Candidatus Nitrosospongia ianthellae' (Moeller et al., 2019) (Thermoproteota as per GTDB Release 202) and nine bacterial MAGs, belonging to the phyla Proteobacteria (classes Alpha‐ and Gammaproteobacteria; three MAGs and one MAG, respectively, the gammaproteobacterial MAG represents the previously analysed ‘Candidatus Taurinisymbion ianthellae’; Moeller et al., 2022), Planctomycetota (three MAGs), Cyanobacteria (1), and Acidobacteriota (1). All genomes of the three dominant symbionts recovered from the four I. basta individuals had an ANI between 96.1% and 99.9%. Analysis of the relative abundance of the MAGs in each I. basta individual showed that MAGs belonging to the phyla Acidobacteriota, Cyanobacteria, and Planctomycetetota (two MAGs) were absent in half of the individuals. Thus, these MAGs were omitted from further analyses. The resulting six MAGs that were consistently (i.e. in ≥3 of the 4 tested individuals) present across I. basta individuals had a relative abundance between 0.2% ± 0.6% and 51% ± 11% (Figure 1). Based on the presence of single‐copy marker genes identified by SingleM in both the raw reads and MAGs, these six symbionts represented 92% of the sponge's microbial community, allowing for the metabolic reconstruction of all major members of the I. basta microbiome (i.e. the near complete microbiome). The remaining 8% of the detected microbial community likely represent food bacteria or very low abundance symbionts.
Dominant symbionts fall within sponge‐specific symbiont clades
To determine whether metagenomes recovered in this study were representative of the dominant symbionts found in all morphotypes of I. basta, we blasted previously (purple, thick morphotype) and newly (yellow, thin and thick morphotype) recovered 16S rRNA gene amplicon sequences of the three dominant symbionts to the genomes from this study (yellow, thin morphotype). All 16S rRNA gene sequences mapped with ≥99.2% identity to their respective genome, showing that highly similar symbiont communities are found in geographically and morphologically different I. basta individuals (Table S2). To then determine the phylogenetic placement and closest relatives of I. basta's dominant symbionts, three phylogenomic trees were inferred for the Thaumarchaeotum ‘Candidatus Nitrosospongia ianthellae’ (Figure S1), the Gammaproteobacterium ‘Candidatus Taurinisymbion ianthellae’ (Figure S2), and the abundant Alphaproteobacterium (Figure S3). All three dominant symbionts clustered with previously recovered microbial MAGs from sponges, allowing for the extrapolation of our findings to other sponge species and supporting the wider applicability of I. basta as a model species for symbiosis research. The Thaumarchaeotum ('Ca. Nitrosospongia ianthellae'; Moeller et al., 2019), which was the most abundant symbiont in I. basta (Figure 1), was most closely related to MAGs from the genus Nitrosopumilus, associated with the sponge Hexadella dedritifera (Zhang et al., 2019) (64.00% average amino acid identity [AAI] to GB_GCA_003724325.1; Figure S1), which, like I. basta, is a member of the sponge family Ianthellidae. 'Ca. Nitrosospongia ianthellae' also clustered with Nitrosopumilus‐related MAGs from the marine demosponge Coelocarteria singaporensis (Botté et al., 2019). Ianthella basta's dominant Gammaproteobacterium (‘Ca. Taurinisymbion ianthellae’; Moeller et al., 2022) was most closely related to two MAGs from the LS‐SOB family which were recovered from the sponge H. dedritifera (79.83% AAI to Hexadella_Gamma_MAG_1 and 77.48% AAI to Hexadella_Gamma_MAG_2, see Supplementary Note 1). Together, these three genomes formed a sister group to one LS‐SOB MAG from the sponge Ircinia ramosa (Engelberts et al., 2020), which in turn formed a sister group to six other LS‐SOB MAGs from the sponges Coscinoderma matthewsi (Glasl et al., 2020), Lophophysema eversa (Tian et al., 2016), Mycale hentscheli (Rust et al., 2020; Storey et al., 2020), and Ircinia ramosa (Engelberts et al., 2020) (Figure S2). The Alphaproteobacterium (o_JABSOH01) had the lowest resolved taxonomy of the three dominant symbionts and was most closely related to an Alphaproteobacterial MAG recovered from H. dedritifera (71.68% AAI), which together formed a sister group to six genomes found in the deep‐sea sponge Vazella pourtalesii (Bayer et al., 2020) (Figure S3) belonging to the same order. Compared to the six Alphaproteobacterial MAGs from the deep‐sea sponge Vazella pourtalesii, the Alphaproteobacterial MAG in I. basta had highest AAI of 43.87% with GCA_014238935.1 (Figure S3), which was well below an AAI of 60%–80% that is typical for MAGs grouped at the same genus (Konstantinidis et al., 2017). Thus, we propose that this symbiont represents a new species within a new genus and tentatively named it ‘Ca. Luteria ianthellae’. The species name ianthellae refers to the host of this microorganism, the sponge Ianthella basta. The genus is named after Dr. Heidi Luter, who performed the foundational work on I. basta.
All three dominant symbionts of I. basta were most closely related to microbial symbionts of H. dedritifera. Both sponge species are members of the Ianthellidae family, suggesting that the three main symbionts were acquired by a common ancestor of these two sponge genera. Future research should focus on characterizing the microbial communities of other members of this sponge family to better understand the evolutionary history of the Ianthellidae microbiome.
General overview of metaproteomics data
In total, 11,543 expressed protein sequences (8675 non‐redundant) were recovered from the four I. basta individuals, (where three were sequenced in this study and one in the study by Moeller et al., 2019), of which 6480 could be functionally annotated, either by the (KEGG) Orthologies (KOs) or PFAM database (Table S3). A total of 1700 proteins were captured in the genomes of I. basta's six symbionts of which 1134 were functionally annotated: 631 (37%) protein sequences were assigned to the Gammaproteobacterium, 496 (29%) to the dominant Alphaproteobacterium, 354 (21%) to the Thaumarchaeotum, 124 (7%) to Ruegeria (Alphaproteobacterium), 88 (5%) to UBA1268 (Planctomycetota), and 7 (0.4%) to UBA2767 (Alphaproteobacterium; Table S3).
Metaproteogenomic analysis of Ianthella basta's microbiome
A metaproteogenomic analysis of I. basta's complete microbiome provided unique insights into the distribution of functions across the six symbionts (Figure 2, Tables S3 and S4). For example, microbial oxidation of ammonia can prevent its accumulation to toxic levels (Zhang et al., 2014), benefitting the sponge host. Functional analysis of the gene responsible for this step, ammonia monooxygenases (amoA), revealed that only Thaumarchaeota are capable of ammonia oxidation in sponges (Robbins et al., 2021), which was confirmed in I. basta with isotope‐based functional assays (Moeller et al., 2019), and further corroborated here (Figure 2). Interestingly, I. basta harbours no nitrite oxidizer and thus performs as a holobiont incomplete nitrification of ammonia to nitrite (Moeller et al., 2019). In addition, we show that low abundance microbes have the genomic potential to reduce nitrite and nitric oxide produced by the oxidation of ammonia. For example, one of the two low abundant Alphaproteobacteria (g_Ruegeria) encodes genes for the reduction of nitrite to dinitrogen through nitric and nitrous oxide (i.e. denitrification, catalysed by nirS, norBC, and nosZ), which could potentially release nitrogen from the system (Figure 2, Table S4). Notably, the first two steps of denitrification can also be performed by the Gammaproteobacterium (‘Ca. Taurinisymbion ianthellae’) through NirK and qNOR, of which the latter was further encoded in 'Ca. Luteria ianthellae'. Alternatively, nitric oxide could be detoxified to nitrate via the other low abundance alphaproteobacterium (g_UBA2767), catalysed by nitric oxide dioxygenase (Hmp; Figure 2, Table S4). Interestingly, denitrification and nitrate production were previously suggested to occur in I. basta based on an isotope‐based functional assays (Moeller et al., 2019), but the identity of the organisms responsible could not be confirmed. Based on our results, we suggest that the low abundance Alphaproteobacteria from the genera Ruegeria and UBA2767 are responsible for these key processes of removing and transforming nitrogen compounds from the sponge host.
FIGURE 2.
Metabolic reconstruction of I. basta's microbiome and virome. Arrowheads point to the hypothesized direction of transfer. Genes expressed in the metaproteomics data are indicated by a black dot. For secondary metabolite production and vitamin and amino acid biosynthesis, proteins were either not detected (secondary metabolite production) or detected for <60% of the genes in the metabolic pathway (vitamin and amino acid biosynthesis). The legend displays the proposed species name (for the dominant symbionts, Moeller et al., 2019, 2022) or the lowest resolved taxonomy as per GTDB taxonomy Release 202, with the three dominant symbionts highlighted by a star. HP‐HB cycle, hydroxypropionate‐hydroxybutyrate cycle; NPRS, non‐ribosomal peptide synthetase; PKS, polyketide synthase; RiPP, ribosomally synthesized and post‐translationally modified peptides; RRE, RiPP recognition element; TPR, tetratricopeptide repeats
Taurine is a naturally occurring compound in marine sponges, with concentrations up to 6 μmol/g wet weight measured in I. basta (Moeller et al., 2022). It has been shown that the gammaproteobacterium ‘Candidatus Taurinisymbion ianthellae’ has the metabolic potential to use taurine for energy conservation coupled with the release of sulfate and ammonia (Moeller et al., 2022). Interestingly, not only the abundant symbiont ‘Ca. Taurinisymbion ianthellae’ but also the low abundance Alphaproteobacterium (g_UBA2767) is well equipped for the use of taurine. The latter symbiont encodes the taurine transporter (tauABC) as well as genes for two taurine degradation pathways to convert taurine to sulfite (either via taurine dioxygenase [tauD] or taurine dehydrogenase [of which the large subunit TauY is encoded, but not the small subunit TauX] and sulfoacetaldehyde acetyltransferase [xsc]). This symbiont further encodes genes to subsequently convert sulfite into adenylyl sulfate (APS), via adenylylsulfate reductase (aprAB), and to reduce APS into 3′‐phosphoadenylyl sulfate (PAPS) via the bifunctional enzyme CysN/CysC (cysNC; Figure 2, Table S4). AprAB can further shuttle electrons from the oxidation of sulfite to the quinone pool. PAPS is an active sulfate donor, which can be used by microorganisms to synthesize sulfated compounds. PAPS can also be converted into sulfide via sulfite and incorporated into amino acids, such as cysteine. These reactions are catalysed by phosphoadenosine phosphosulfate reductase (cysH) and sulfite reductase (sir), respectively, of which the latter was found in the genomes of the Planctomycetota (g_UBA1268) and 'Ca. Nitrosospongia ianthellae' (Figure 2). Alternatively, APS can be converted to sulfate via sulfate adenylyltransferase (sat), generating ATP, thus allowing the low abundance Alphaproteobacterium (g_UBA2767) to use taurine for energy conservation. Moreover, two of I. basta's symbionts, the Alphaproteobacteria (g_UBA2767 and g_Ruegeria), are postulated to import sulfate from the surrounding seawater via sulfate permease (sulP). This environmentally obtained sulfate could also be reduced to PAPS for assimilation (Figure 2). However, it must be noted that sulfate permease can also export sulfate generated from taurine, which has been previously proposed to occur in 'Ca. Taurinisymbion ianthellae' (Moeller et al., 2022). Overall, these findings suggest that sulfur metabolism in I. basta's microbiome is predominantly geared towards gaining energy for the Alphaproteobacterium (g_UBA2767) and 'Ca. Taurinisymbion ianthellae' and towards gaining cellular sulfur for the other symbionts. Notably, the Alphaproteobacterium (g_UBA2767) is predicted to also be able to obtain energy through the oxidation of thiosulfate to sulfate through sulfur‐oxidizing proteins (soxAB, soxXYZ), called the SOX complex (Tian et al., 2014). This further indicates the importance of the Alphaproteobacterium (g_UBA2767) in the sulfur metabolism of I. basta, which was initially thought to be solely carried out by the dominant Gammaproteobacterium, which can also convert DMSP (Moeller et al., 2022).
The third low abundance symbiont, the Planctomycetota (g_UBA1268), is not predicted to be a major player in nitrogen and sulfur metabolism, but it plays an important role in catabolizing glycoconjugates and carbohydrates via glycosyl hydrolases (GHs) and carbohydrate esterases (CEs; Figure 2). Marine sponges continuously (re)cycle dissolved organic matter (DOM), a rich source of complex sugars, and a recent study showed that sponge symbionts are actively involved in utilizing this DOM (Campana et al., 2021). Similarly, free‐living Planctomycetota play an important role in metabolizing complex carbohydrates (Martinez‐Garcia et al., 2012; Orellana et al., 2022; Sichert et al., 2020) Here, we found that I. basta's Planctomycetota (g_UBA1268) encoded nine glycosyl hydrolases, acting amongst others on fucose (GH29), chitin (GH18), xylan (GH10), and sialic acids (GH33; Figure 2), compounds often found in marine algae or the sponge itself (Hsieh & Harris, 2019; Kappelmann et al., 2019; Morya et al., 2012; Mutsenko et al., 2017). These nine GHs were absent from all other symbionts, highlighting the unique role of the Planctomycetota (g_UBA1268) in breaking down a wide range of complex carbohydrates. Interestingly, the three Alphaproteobacterial symbionts and 'Ca. Taurinisymbion ianthellae' could import sialic acid through the sialic acid tripartite ATP‐independent periplasmic transporter (SiaPQM). GH29 and GH33 were also expressed in the metaproteome of I. basta's microbiome and have been identified previously as important hydrolase classes for many sponge symbionts and were often present in other sponge‐associated Planctomycetota (Robbins et al., 2021). The carbohydrate esterases that were identified in the genomes of I. basta's symbionts predominantly targeted xylan (CE1) and glycosamine (CE11), a structural component of sponges (Fernandez‐Busquets & Burger, 2003; Kamke et al., 2013), while CE10, an arylesterase, was further encoded and expressed in the proteome of the Planctomycetota (g_UBA1268; Figure 2, Table S3). Arylesterases are involved in detoxifying reactive oxygen species, acting as antioxidants in humans (Asare et al., 2018), and could potentially play similar roles in marine sponges. No GHs and CEs were identified in the Thaumarchaeotum, which was consistent with previous broad‐scale analyses on sponge associated Thaumarchaeota (Robbins et al., 2021). Notably, Thaumarchaeota are predominantly autotrophic and primarily obtain their carbon from fixation via the 3‐hydroxypropionate/4‐hydroxybutyrate cycle (Figure 2) (Burgsdorf et al., 2021; Engelberts et al., 2020; Moeller et al., 2019; Offre et al., 2013; Robbins et al., 2021; Spang et al., 2012), which likely omits the need for GHs and CEs to break down complex sugars. Furthermore, ammonia‐oxidizing archaea do not have peptidoglycan cell walls, but an S‐layer and as such do not require enzymes for the modification of peptidoglycans during growth.
In addition to its role in catabolizing glycoconjugates and carbohydrates, the Planctomycetota (g_UBA1268) also encoded a polyphosphate kinase (ppk). Polyphosphate kinase catalyses the reversible transfer of the terminal phosphate of ATP to form a long‐chain polyphosphate (polyP), which can contribute to the phosphorus sequestration in Ianthella basta (Zhang et al., 2015). This gene was further found in the Gammaproteobacterial symbiont and the two low abundance Alphaproteobacteria (g_Ruegeria and g_UBA2767) and expressed in the metaproteome of I. basta.
Ianthella basta's microbiome is characterized by syntrophy
Sponge symbionts do not only interact with their host, but they also have the potential to engage in complex interactions with each other. Microbes that are unable to synthesize a particular compound themselves (auxotrophy) have to live off products excreted by other species in the community (syntrophy) in order to persist in the environment. For example, in I. basta, all symbionts, apart from the Alphaproteobacterium (g_UBA2767), were auxotrophic for at least one of the six essential B‐vitamins, that is, thiamine (vitamin B1), riboflavin (B2), pantothenate (B5), pyridoxine (B6), biotin (B7), or cobalamin (B12). B‐vitamins are essential for the central metabolism of microorganisms (Combs Jr & McClung, 2016), including the biosynthesis of branched amino acids and methionine, and microbes that cannot synthesize these vitamins themselves must acquire them from the environment. Thiamine for example, could only be biosynthesized by the Alphaproteobacterium (g_UBA2767) and Thaumarchaeotum (Figure 2). The other symbionts could import thiamine through the thiamine transporter (tbpA, thiPQ), which was encoded and expressed in the proteomes of the three dominant symbionts and encoded in Ruegeria. All of the other essential B‐vitamins (i.e. B2, B5, B6, B7, and B12) could be biosynthesized by at least four of I. basta's symbionts, leaving up to two symbionts auxotrophic to the respective vitamin (Figure 2). For example, genes necessary for cobalamin production were identified in the three alphaproteobacterial symbionts and the Thaumarchaeotum (Figure 2), which was further supported by their unique ability to produce heme, a co‐factor necessary for cobalamin production. In contrast, the Gammaproteobacterium and Planctomycetota (g_UBA1268), cobalamin auxotrophs, require an external source of cobalamin to persist in the environment (Figure 2).
Microbial mobility and attachment
Symbionts must be able to avoid phagocytosis by host cells to maintain stable symbioses and it has been hypothesized that eukaryotic‐like proteins (ELPs), such as ankyrin (ARP), WD40, NHL, leucine‐rich (LRR), tetratricopeptide (TPR), and HEAT repeat families play an important role in this process (Jahn et al., 2019; Nguyen et al., 2014; Reynolds & Thomas, 2016). For example, ankyrin‐domain‐containing‐proteins encoded by phages could modulate the immune response of a eukaryotic host against bacteria (Jahn et al., 2019). All six symbionts in I. basta encoded at least one type of ELP and all ELPs were expressed in I. basta's microbiome (Tables S3 and S4). However, the distribution of ELPs varied between symbionts, where the Alphaproteobacterium (g_UBA2767) encoded ARP, WD40, LRR, NHL, TPR, and HEAT repeat families (50 copies in total), displaying the most diverse array of ELPs, the dominant Alphaproteobacterium ('Ca. Luteria ianthellae') only encoded TPR repeat families (19 copies). The factors driving these differences and whether each ELP serves a different function in maintaining stable symbiosis remains to be elucidated.
Persistence of microbes within the sponge can also be facilitated by direct attachment to host tissue or by attachment to other microbes that are attached to the host tissue. Proteins that mediate cell–cell adhesion and biofilm formation, such as cadherin domains and fibronectin‐binding proteins, were found to be enriched in sponge microbiomes (Robbins et al., 2021) and may play similarly important roles in maintaining symbiosis in I. basta. Cadherin domains can bind directly to cell surfaces or to cellulose, xylan, and related compounds, mediating not only attachment to the host tissue, but also the degradation of complex carbohydrates by microbes (Fraiberg et al., 2010; Fraiberg et al., 2011). Fibronectin‐binding proteins are proteins that bind to fibronectin III domains, which can be found in both eukaryotes and prokaryotes (Hymes & Klaenhammer, 2016; Schwarz‐Linek et al., 2003). Cadherin domains were encoded in the Planctomycetota (g_UBA1268) and Alphaproteobacterium (g_UBA2767) and encoded and expressed in the Gammaproteobacterium proteome (Figure 2, Tables S3 and S4). Interestingly, these three genomes also contained genes for GH10 and CE1 that degrade xylan, a process that could be facilitated by attachment to the carbohydrate itself through cadherin domains. Genes for fibronectin‐binding proteins were identified in the Planctomycetota (g_UBA1268), the Gammaproteobacterium, and the Thaumarchaeotum. Fibronectin III domains were encoded in the Planctomycetota (g_UBA1268) and encoded and expressed in the Gammaproteobacterium, allowing for biofilm formation. No fibronectin‐binding proteins were found in the two low abundance Alphaproteobacteria (g_UBA2767 and g_Ruegeria). Interestingly, these microbes encoded a flagellum and expressed the flagellar protein (fliL; Figure 2, Tables S3 and S4), suggesting these symbionts are motile. In addition to motility, flagella can support adhesion (Kimkes & Heinemann, 2020), where fliL could play an important role in sensing the presence of a surface (Lee & Belas, 2015). Thus, the two low abundance Alphaproteobacteria may use their flagella as an alternative mechanism to maintain stable symbiosis.
The dominant Alphaproteobacterium ('Ca. Luteria ianthellae') lacked attachment proteins, suggesting it uses alternative mechanisms to persist within the sponge tissue. This genome instead encoded and expressed secretion system III (Figure S4, Tables S3 and S4), which can be used by microorganisms to attach to their host and inject effector proteins to modulate its metabolism (Costa et al., 2015). These proteins often contain signal peptides that mark them for secretion. In the dominant Alphaproteobacterium, signal peptides were identified on TPR repeat proteins and ubiquitin‐like proteins (i.e. proteins that can modify other proteins after translation). As TPRs and ubiquitin‐like proteins have previously been linked to symbiont establishment within the host (Robbins et al., 2021; Thomas et al., 2010; Zhou & Zhu, 2015), injection of these proteins could aid the persistence of the Alphaproteobacterium within the sponge tissue by altering I. basta's metabolism (Figure S4). Additionally, secretion system III has been found to facilitate direct attachment of pathogenic bacteria to host tissue through the insertion of the so‐called virulence factor translocated intimin receptor (Tir) into the host membrane (Franzin & Sircili, 2015; Kenny et al., 1997). Once Tir is injected into the host's membrane through secretion system III, it can bind to an inverse autotransporter located in the membrane of the bacterium (Kenny et al., 1997). The dominant Alphaproteobacterium also encoded and expressed an inverse autotransporter, which would allow the symbiont to directly anchor itself to the host tissue if Tir is translocated by secretion system III (Figure S4).
To investigate whether the ability to attach to the host tissue through Tir and the inverse autotransporter was present in other sponge symbionts, we searched all available sponge symbiont genomes for secretion system III and the inverse autotransporter. Both proteins were found in three Alphaproteobacterial MAGs (GCA_014239005.1, SAMN15855071, and SAMN15855069) (Bayer et al., 2020; Robbins et al., 2021), all of which one belonged to the same order as 'Ca. Luteria ianthellae' (o_JABSOH01), and in two MAGs belonging to the Endozoicomonadaceae family (GCA_002238585.1 and SAMN15854979) (Robbins et al., 2021; Slaby et al., 2017), suggesting this mechanism aids attachment in a variety of sponge symbionts. The ability to anchor to host tissue would indicate a tight symbiosis and consistent with such a highly dependent lifestyle, the dominant Alphaproteobacterium in I. basta lacked a multitude of metabolic pathways, such as the pentose phosphate cycle, vitamin production (apart from cobalamin), fatty acid B‐oxidation, aromatic ring degradation, nitrogen metabolism, and dTDP‐l‐rhamnose biosynthesis (Figure S4). It further encoded and expressed a wide range of transporters used to import essential nutrients, including amino acid transporters and sugar transporters (Figure S4). Lastly, 'Ca. Luteria ianthellae' encoded several genes involved in DMSP metabolism, which can be a source of reduced carbon and sulfur for marine bacteria. DMSP metabolism was also identified in ‘Ca. Taurinisymbion ianthellae’ (Moeller et al., 2022), suggesting that DMSP, in addition to taurine, may be an important substrate for I. basta symbionts.
Low abundance microbes are sources of secondary metabolite clusters
Marine sponges are rich sources of bioactive and economically important compounds that are frequently produced by their microbial symbionts and potentially aid in the chemical defence of the holobiont (Rust et al., 2020; Tianero et al., 2019; Wilson et al., 2014). I. basta is known to produce bastadins (Kazlauskas et al., 1981; Kunze et al., 2013; Pordesimo & Schmitz, 1990), a biotechnologically relevant secondary metabolite that can act as an antifouling agent and has cytotoxic activity against human tumour cells (Bayer et al., 2011; Greve et al., 2008). Here, we searched the genomes of the six symbionts of I. basta for the ability to produce bioactive compounds and found a total of 22 biosynthetic gene clusters (BGCs), of which 21 were encoded in genomes from the low abundance members of the community. (Figure 2, Table S5). Notably, genome size correlated with the number of encoded BGCs, with Ruegeria having the largest genome and most BGCs (4.33 kb, 13 BGCs), followed by UBA2767 (4.30 kb, 4 BGCs), UBA1267 (3.26 kb, 4 BGCs), 'Ca. Taurinisymbion ianthellae' (2.08 kb, no BGCs), 'Ca. Luteria ianthellae' (2.01 kb, no BGCs), and 'Ca. Nitrosospongia ianthellae' (1.78 kb, 1 BGC). The most abundant BGCs were non‐ribosomal peptide synthases (NRPS/NRPS‐like, 6 clusters), hserlactones (4 clusters), and type I and III polyketide synthases (T1PKS and T3PKS, 3 clusters). Only five BGCs showed similarity to previously identified clusters, frequently with similarity scores below 30%, highlighting the novelty of the BGCs in I. basta (Table S5). One BGC, an NRPS‐like cluster encoded in the genome of the Planctomycetota (g_UBA1267), showed 100% similarity to a previously identified BGC thought to produce 1‐heptadecene. Interestingly, 1‐n‐heptadecane can be used by denitrifying bacteria to grow anaerobically (Spormann & Widdel, 2000), but whether this is the case for Planctomycetota or whether 1‐heptadecene is used by the other symbionts as an energy source remains to be determined. Although the novelty of I. basta's BGCs precluded further identification of the metabolites produced, the identification of BGCs is the first step towards the discovery of new bioactive compounds. Thus, we suggest future research focus on identifying the activity of these BGCs.
Ianthella basta's virome extends the metabolic capability of the holobiont
Marine sponges host distinct viral communities that shape their microbiome, alter host–microbe interactions (Jahn et al., 2019; Pascelli et al., 2020), and directly contribute to their host's (eukaryotic or prokaryotic) metabolism through auxiliary metabolic genes (AMGs). Viruses can further mediate horizontal gene transfer between members of the holobiont. Ankyrin repeat proteins, for example, were suggested to be horizontally transferred by viruses as these proteins are widespread in sponge symbionts from different sponge species (Reynolds & Thomas, 2016) and enriched in the sponge virome (Pascelli et al., 2020). Similarly, any of the previously described functions/proteins in I. basta's microbiome can theoretically be horizontally transferred by viruses or have a viral origin.
To identify putative viral sequences in I. basta, VirSorter was run on all metagenomic assemblies. A total of 276 viral contigs were identified, of which 250 were phage and 25 prophage (Table S6). Of the 250 contigs that were assigned as phage, 25 were within category 1 (‘most confident’ predictions), 157 in Category 2 (‘likely’ predictions), and 68 in Category 3 (‘possible’ predictions). For the prophages, 4 were in Category 2 and 21 in Category 3 (Table S6). Of all prophages, nine were captured in bacterial MAGs and one in the Thaumarchaeotal MAG, which encoded a Type IV toxin–antitoxin system. Interestingly, toxin–antitoxin systems were previously found to be enriched in sponge associated Thaumarchaeota (Haber et al., 2021; Moeller et al., 2019). Taxonomy was subsequently assigned to all viral contigs by blasting them against the viral RefSeq Nr database and their viral origin and taxonomy was further cross‐validated by blasting them against previously recovered viral sequences from I. basta. The most diverse viral taxa in I. basta were dsDNA viruses from the order Caudovirales, including the families Myoviridae, Podoviridae, and Siphoviridae and ssDNA viruses from the order Microviridae (Table S6). Interestingly, we identified a Lavidaviridae in I. basta's virome, which are known virophages that can coinfect eukaryotes with giant dsDNA viruses of the family Mimiviridae (Fischer, 2020). Although no contigs were assigned to the family Mimiviridae in this study, Mimiviridae were previously identified in I. basta and suggested to infect sponge amoeba‐like cells (Pascelli et al., 2020).
Viral contigs were functionally annotated to investigate the role of the virome in the overall metabolic capability of I. basta. A total of 854 genes were identified across all viral contigs, of which 89% could be functionally annotated (Table S6). Most annotated genes were replication related genes, such as transposable elements and viral DNA replication genes (Table S6). However, putative auxiliary metabolic genes (AMGs) were also detected (Figure 2, Table S6). For example, eukaryotic‐like proteins, such as TPR repeat proteins and WD40 repeat proteins, were found on seven viral contigs, of which two were captured in the MAGs of the Thaumarchaeotum (prophage, Category 3) and Alphaproteobacterium (g_Ruegeria; prophage, Category 2). Interestingly, TPR repeat proteins were previously suggested to be laterally transferred between sponge symbionts (Robbins et al., 2021), which could be mediated by viruses. Similarly, attachment proteins, including cadherins, are thought to be laterally transferred between sponge symbionts and were identified on the viral contigs in I. basta (Figure 2, Table S6). I. basta's virome also encoded various transporters, such as the spermidine/putrescine transport system (prophage Category 2, binned in the Alphaproteobacterium from the genus Ruegeria). Spermidine and putrescine are polyamines that play essential roles in cell differentiation and proliferation. Thus, the virus‐encoded spermidine/putrescine transport system could further aid in the overall transport of vital compounds in I. basta's microbiome.
CONCLUSION AND OUTLOOK
To fully understand the functioning of the sponge holobiont and establish a model species for future experiments, we present a metaproteogenomic analysis of >90% of I. basta's microbiome and virome, providing key insights into the unique functional role of each symbiont and their metabolic interplay. We show that low abundance microbes drive fundamentally different processes compared with the dominant symbionts (e.g. complex carbohydrate degradation and vitamin production) and are repositories of key functions involved in nitrogen cycling and secondary metabolite production. We further highlight the auxotrophic nature of sponge symbionts to vitamin production and their dependency on other microbes in the environment. Ianthella basta's symbionts were also predicted to use a variety of mechanisms to avoid host phagocytosis and to attach to the host tissue. These mechanisms, such as the presence of secretion system III, could be further investigated using cryo‐EM. Lastly, we characterized I. basta's virome by identifying putative viral contigs in its microbiome, cross‐validating each sequence against previously recovered viral sequences from the sponge. Functional characterization of I. basta's virome suggested a role for viruses in the lateral gene transfer of ELPs and attachment proteins and their ability to extend the metabolic capability of the holobiont through transporters. Taken together, this metaproteogenomic data provide the metabolic framework to adopt I. basta as a model organism for studying host–microbe interactions, particularly the establishment, development, and maintenance of sponge symbionts.
AUTHOR CONTRIBUTIONS
Pamela Engelberts: Conceptualization (lead); data curation (lead); formal analysis (lead); investigation (lead); methodology (lead); validation (lead); visualization (lead); writing – original draft (lead); writing – review and editing (equal). Steven Robbins: Data curation (supporting); formal analysis (supporting); investigation (supporting); methodology (supporting); supervision (lead); validation (supporting); writing – review and editing (supporting). Craig W Herbold: Formal analysis (equal); investigation (equal); methodology (equal); writing – review and editing (supporting). Florian Moeller: Investigation (supporting); validation (supporting). Nico Jehmlich: Data curation (supporting); formal analysis (supporting); investigation (equal); methodology (supporting); writing – review and editing (supporting). Patrick Laffy: Formal analysis (supporting); investigation (supporting); writing – review and editing (supporting). Michael Wagner: Resources (equal); supervision (supporting); writing – review and editing (equal). Nicole Webster: Resources (equal); supervision (supporting); writing – review and editing (supporting).
CONFLICT OF INTEREST
The authors declare no conflict of interest.
ETHICAL STATEMENT
All prevailing scientific ethical practices have been respected.
Supporting information
Figure S1. Phylogenomic tree based on 564 publicly available Thaumarchaeotal MAGs, showing the placement of the Thaumarchaeotal MAG from I. basta. The outgroup consists of MAGs belonging to the phylum Undinarchaeota. The tree was clustered on orders, only showing the closest relatives to the recovered MAG from this study (displayed in red) and their sponge hosts.
Figure S2. Phylogenomic tree based on 8980 publicly available Gammaproteobacterial MAGs and two Gammaproteobacterial MAGs recovered from Hexadella detritifera in this study, showing the placement of the Gammaproteobacterial MAG from I. basta. The outgroup consists of MAGs belonging to the phylum Chloroflexota. The tree was clustered on orders and families, only showing the closest relatives to the recovered MAG from this study (displayed in red) and their sponge hosts.
Figure S3. Phylogenomic tree based on 7361 publicly available Alphaproteobacterial MAGs and one Alphaproteobacterial MAG recovered from Hexadella detritifera in this study, showing the placement of the Alphaproteobacterial MAG from I. basta. The outgroup consists of MAGs belonging to the phylum Chloroflexota. The tree was clustered on orders, only showing the closest relatives to the recovered MAG from this study (displayed in red) and their sponge hosts.
Figure S4. Metabolic reconstruction of the dominant Alphaproteobacterium in Ianthella basta ('Ca. Luteria ianthellae'). Genes encoded in the genome are displayed in black, genes absent from the genome are red, and genes expressed in the metaproteomics dataset are blue. The Alphaproteobacterium lacks the pentose phosphate pathway, pathways to metabolize vitamins, fatty acid B‐oxidation, aromatic ring degradation, dTDP‐l‐rhamnose biosynthesis, and secondary metabolite clusters. The metabolic reconstruction of the dominant Thaumarchaeotum and Gammaproteobacterium can be found in Moeller et al. (2019) and Moeller et al. (2022), respectively.
Table S1. Taxonomy and statistics of all 43 MAGs retrieved from the four Ianthella basta individuals. The six MAGs used for metabolic reconstruction are highlighted in blue. Taxonomy as per GTDB taxonomy Release 202 is shown.
Table S2. Percentage identity between 16S rRNA gene amplicon sequences of the three dominant symbionts of the yellow (thin and thick, unpublished data) and purple (thick; Engelberts et al., n.d.) morphotypes and the genomes retrieved from this study (yellow, thin).
Table S3. Expressed proteins in the four Ianthella basta individuals. Each protein sequence was given a unique sequence ID (seqID). If the sequence was found to be part of another sequence (i.e. redundant), the ‘unique’ column received a value of ‘0’, with the ‘representative seqID’ column displaying the representative sequence. The columns ‘redundantGeneID’ and ‘uniqueGeneID’ are a numerical index. All seqIDs with the same redundantGeneID come from the same scaffold with the same coordinates and orientation. All SeqIDs with the same ‘uniqueGeneID’ have the exact same peptide sequence predicted regardless of which dataset/scaffold it comes from. Information on the source assembly, whether the protein was captured in a MAG, and the functional of the sequence are also included in the table.
Table S4. KO count for the six MAGs retrieved from the four Ianthella basta individuals used for metabolic reconstruction. MAG IDs consist of the MAG ID from Table S1 combined with the lowest resolved taxonomy as per GTDB taxonomy Release 202.
Table S5. Secondary metabolite clusters identified in each of the six MAGs retrieved from Ianthella basta used for metabolic reconstruction. MAG IDs consist of the MAG ID from Table S1 combined with the lowest resolved taxonomy as per GTDB taxonomy Release 202.
Table S6. Taxonomy and function of all putative viral contigs in the metagenomic assemblies of Ianthella basta. Metagenomic assemblies containing the codes H1, H2, or M1‐M4 originated from the six sequencing runs on Ianthella basta from Moeller et al. (2019). For metagenomic assemblies from this study: IB stands for Ianthella basta followed by the replicate number and whether the individual was deeply or shallowly sequenced. Taxonomy was included for each viral contig based on its match against previously recovered I. basta viral sequences and the viral component of the RefSeq NR database. Further information included in the table is whether the viral contig was binned in one of the symbiont genomes, whether the contig was phage or prophage and to what category it belonged. Viral contigs that contained multiple gene annotations are highlighted in pink.
Supplementary Note 1: Supporting Information
ACKNOWLEDGEMENTS
The authors thank Dr. Kenneth Wasmund, Dr. Bettina Glasl, and Dr. Paul Evans for their assistance in data analysis and interpretation. Michael Wagner thanks the European Research Council (ERC Advanced Grant NITRICARE 294343) and the Austrian Science Fund FWF (Wittgenstein Award Z‐383‐B) for financial support. Open access publishing facilitated by The University of Queensland, as part of the Wiley ‐ The University of Queensland agreement via the Council of Australian University Librarians.
Engelberts, J.P. , Robbins, S.J. , Herbold, C.W. , Moeller, F.U. , Jehmlich, N. , Laffy, P.W. et al. (2023) Metabolic reconstruction of the near complete microbiome of the model sponge Ianthella basta . Environmental Microbiology, 25(3), 646–660. Available from: 10.1111/1462-2920.16302
Funding information European Research Council, Grant/Award Number: 294343; Austrian Science Fund, Grant/Award Number: Z‐383‐B
DATA AVAILABILITY STATEMENT
Raw reads and MAGs are available at NCBI under Bioproject ID PRJNA807825. The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE (Perez‐Riverol et al., 2022) partner repository with the dataset identifier PXD032278.
REFERENCES
- Albertsen, M. , Hugenholtz, P. , Skarshewski, A. , Nielsen, K.L. , Tyson, G.W. & Nielsen, P.H. (2013) Genome sequences of rare, uncultured bacteria obtained by differential coverage binning of multiple metagenomes. Nature Biotechnology, 31(6), 533–538. [DOI] [PubMed] [Google Scholar]
- Almagro Armenteros, J.J. , Tsirigos, K.D. , Sønderby, C.K. , Petersen, T.N. , Winther, O. , Brunak, S. et al. (2019) SignalP 5.0 improves signal peptide predictions using deep neural networks. Nature Biotechnology, 37(4), 420–423. [DOI] [PubMed] [Google Scholar]
- Alneberg, J. , Bjarnason, B.S. , de Bruijn, I. , Schirmer, M. , Quick, J. , Ijaz, U.Z. et al. (2014) Binning metagenomic contigs by coverage and composition. Nature Methods, 11(11), 1144–1146. [DOI] [PubMed] [Google Scholar]
- Asare, G.A. , Andam, S.E. , Asare‐Anane, H. , Ammanquah, S. , Anang‐Quartey, Y. , Afriyie, D.K. et al. (2018) Lipid associated antioxidants: arylesterase and paraoxonase‐1 in benign prostatic hyperplasia treatment‐naïve patients. Prostate International, 6(1), 36–40. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bayer, K. , Busch, K. , Kenchington, E. , Beazley, L. , Franzenburg, S. , Michels, J. et al. (2020) Microbial strategies for survival in the glass sponge Vazella pourtalesii. Msystems, 5(4), e00473–e00420. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bayer, M. , Hellio, C. , Maréchal, J.P. , Frank, W. , Lin, W. , Weber, H. et al. (2011) Antifouling bastadin congeners target mussel phenoloxidase and complex copper (II) ions. Marine Biotechnology, 13(6), 1148–1158. [DOI] [PubMed] [Google Scholar]
- Bell, J.J. (2008) The functional roles of marine sponges. Estuarine, Coastal and Shelf Science, 79(3), 341–353. [Google Scholar]
- Blin, K. , Shaw, S. , Steinke, K. , Villebro, R. , Ziemert, N. , Lee, S.Y. et al. (2019) antiSMASH 5.0: updates to the secondary metabolite genome mining pipeline. Nucleic Acids Research, 47(W1), W81–W87. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Botté, E. , Nielsen, S. , Abdul Wahab, M.A. , Webster, J. , Robbins, S. , Thomas, T. et al. (2019) Changes in the metabolic potential of the sponge microbiome under ocean acidification. Nature Communications, 10(1), 1–10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Burgsdorf, I. , Sizikov, S. , Squatrito, V. , Britstein, M. , Slaby, B.M. , Cerrano, C. et al. (2021) Lineage‐specific energy and carbon metabolism of sponge symbionts and contributions to the host carbon pool. The ISME Journal, 16(4), 1163–1175. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Camacho, C. , Coulouris, G. , Avagyan, V. , Ma, N. , Papadopoulos, J. , Bealer, K. et al. (2009) BLAST+: architecture and applications. BMC Bioinformatics, 10(1), 421. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Campana, S. , Busch, K. , Hentschel, U. , Muyzer, G. & de Goeij, J.M. (2021) DNA‐stable isotope probing (DNA‐SIP) identifies marine sponge‐associated bacteria actively utilizing dissolved organic matter (DOM). Environmental Microbiology, 23, 4489–4504. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chaumeil, P.‐A. , Mussig, A.J. , Hugenholtz, P. & Parks, D.H. (2019) GTDB‐Tk: a toolkit to classify genomes with the genome taxonomy database. Bioinformatics, 36(6), 1925–1927. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Combs, G.F., Jr. & McClung, J.P. (2016) The vitamins: fundamental aspects in nutrition and health, 5th edition. San Diego: Academic Press Inc. [Google Scholar]
- Costa, T.R. , Felisberto‐Rodrigues, C. , Meir, A. , Prevost, M.S. , Redzej, A. , Trokter, M. et al. (2015) Secretion systems in gram‐negative bacteria: structural and mechanistic insights. Nature Reviews. Microbiology, 13(6), 343–359. [DOI] [PubMed] [Google Scholar]
- de Goeij, J.M. , van Oevelen, D. , Vermeij, M.J.A. , Osinga, R. , Middelburg, J.J. , de Goeij, A.F.P.M. et al. (2013) Surviving in a marine desert: the sponge loop retains resources within coral reefs. Science, 342(6154), 108–110. [DOI] [PubMed] [Google Scholar]
- Engelberts, J.P. , Abdul, W.M.A. , Manuel, M. , Laura, R. , Emma, M. , Robbins Steven, J. et al. (2022) Microbes from mum: symbiont transmission in the tropical reef sponge Ianthella basta . ISME Communications, 2(1), 1–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Engelberts, J.P. , Robbins, S.J. , Damjanovic, K. & Webster, N.S. (2021) Integrating novel tools to elucidate the metabolic basis of microbial symbiosis in reef holobionts. Marine Biology, 168(12), 175. [Google Scholar]
- Engelberts, J.P. , Robbins, S.J. , de Goeij, J.M. , Aranda, M. , Bell, S.C. & Webster, N.S. (2020) Characterization of a sponge microbiome using an integrative genome‐centric approach. The ISME Journal, 15(5), 1100–1110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fernandez‐Busquets, X. & Burger, M. (2003) Circular proteoglycans from sponges: first members of the spongican family. Cellular and Molecular Life Sciences, 60(1), 88–112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fischer, M.G. (2020) The virophage family lavidaviridae. Current Issues in Molecular Biology, 40(1), 1–24. [DOI] [PubMed] [Google Scholar]
- Fraiberg, M. , Borovok, I. , Bayer, E.A. , Weiner, R.M. & Lamed, R. (2011) Cadherin domains in the polysaccharide‐degrading marine bacterium Saccharophagus degradans 2‐40 are carbohydrate‐binding modules. Journal of Bacteriology, 193(1), 283–285. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fraiberg, M. , Borovok, I. , Weiner, R.M. & Lamed, R. (2010) Discovery and characterization of cadherin domains in Saccharophagus degradans 2‐40. Journal of Bacteriology, 192(4), 1066–1074. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Franzin, F.M. & Sircili, M.P. (2015) Locus of enterocyte effacement: a pathogenicity Island involved in the virulence of enteropathogenic and enterohemorragic Escherichia coli subjected to a complex network of gene regulation. BioMed Research International, 2015, 534738. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Freckelton, M.L. , Luter, H.M. , Andreakis, N. , Webster, N.S. & Motti, C.A. (2012) Qualitative variation in colour morphotypes of Ianthella basta (porifera: Verongida). Hydrobiologia, 687(1), 191–203. [Google Scholar]
- Gauthier, M.‐E.A. , Watson, J.R. & Degnan, S.M. (2016) Draft genomes shed light on the dual bacterial symbiosis that dominates the microbiome of the coral reef sponge Amphimedon queenslandica . Frontiers in Marine Science, 3, 196. [Google Scholar]
- Giovannoni, S.J. , Tripp, H.J. , Givan, S. , Podar, M. , Vergin, K.L. , Baptista, D. et al. (2005) Genome streamlining in a cosmopolitan oceanic bacterium. Science, 309(5738), 1242–1245. [DOI] [PubMed] [Google Scholar]
- Glasl, B. , Robbins, S. , Frade, P.R. , Marangon, E. , Laffy, P.W. , Bourne, D.G. et al. (2020) Comparative genome‐centric analysis reveals seasonal variation in the function of coral reef microbiomes. The ISME Journal, 14(6), 1435–1450. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Greve, H. , Kehraus, S. , Krick, A. , Kelter, G. , Maier, A. , Fiebig, H.H. et al. (2008) Cytotoxic bastadin 24 from the Australian sponge Ianthella quadrangulata . Journal of Natural Products, 71(3), 309–312. [DOI] [PubMed] [Google Scholar]
- Haber, M. , Burgsdorf, I. , Handley, K.M. , Rubin‐Blum, M. & Steindler, L. (2021) Genomic insights into the lifestyles of Thaumarchaeota inside sponges. Frontiers in Microbiology, 11, 622824. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Horn, H. , Slaby, B.M. , Jahn, M.T. , Bayer, K. , Moitinho‐Silva, L. , Förster, F. et al. (2016) An enrichment of CRISPR and other defense‐related features in marine sponge‐associated microbial metagenomes. Frontiers in Microbiology, 7, 7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hsieh, Y.S.Y. & Harris, P.J. (2019) Xylans of red and green algae: what is known about their structures and how they are synthesised? Polymers, 11(2), 354. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hudspith, M. , Rix, L. , Achlatis, M. , Bougoure, J. , Guagliardo, P. , Clode, P.L. et al. (2021) Subcellular view of host–microbiome nutrient exchange in sponges: insights into the ecological success of an early metazoan–microbe symbiosis. Microbiome, 9(1), 1–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hyatt, D. , Chen, G.L. , LoCascio, P.F. , Land, M.L. , Larimer, F.W. & Hauser, L.J. (2010) Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics, 11(1), 1–11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hymes, J.P. & Klaenhammer, T.R. (2016) Stuck in the middle: fibronectin‐binding proteins in gram‐positive bacteria. Frontiers in Microbiology, 7, 1504. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jahn, M.T. , Arkhipova, K. , Markert, S.M. , Stigloher, C. , Lachnit, T. , Pita, L. et al. (2019) A phage protein aids bacterial symbionts in eukaryote immune evasion. Cell Host & Microbe, 26(4), 542–550 e5. [DOI] [PubMed] [Google Scholar]
- Jain, C. , Rodriguez‐R, L.M. , Phillippy, A.M. , Konstantinidis, K.T. & Aluru, S. (2018) High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries. Nature Communications, 9(1), 5114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kamke, J. , Sczyrba, A. , Ivanova, N. , Schwientek, P. , Rinke, C. , Mavromatis, K. et al. (2013) Single‐cell genomics reveals complex carbohydrate degradation patterns in poribacterial symbionts of marine sponges. The ISME Journal, 7(12), 2287–2300. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kanehisa, M. , Sato, Y. , Kawashima, M. , Furumichi, M. & Tanabe, M. (2015) KEGG as a reference resource for gene and protein annotation. Nucleic Acids Research, 44(D1), D457–D462. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kang, D.D. , Froula, J. , Egan, R. & Wang, Z. (2015) MetaBAT, an efficient tool for accurately reconstructing single genomes from complex microbial communities. PeerJ, 3, e1165. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kang, D.D. , Li, F. , Kirton, E. , Thomas, A. , Egan, R. , An, H. et al. (2019) MetaBAT 2: an adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies. PeerJ, 7, e7359. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kappelmann, L. , Krüger, K. , Hehemann, J.H. , Harder, J. , Markert, S. , Unfried, F. et al. (2019) Polysaccharide utilization loci of North Sea Flavobacteriia as basis for using SusC/D‐protein expression for predicting major phytoplankton glycans. The ISME Journal, 13(1), 76–91. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kazlauskas, R. , Lidgard, R.O. , Murphy, P.T. , Wells, R.J. & Blount, J.F. (1981) Brominated tyrosine‐derived metabolites from the sponge Ianthella basta . Australian Journal of Chemistry, 34(4), 765–786. [Google Scholar]
- Kenny, B. , DeVinney, R. , Stein, M. , Reinscheid, D.J. , Frey, E.A. & Finlay, B.B. (1997) Enteropathogenic E. coli (EPEC) transfers its receptor for intimate adherence into mammalian cells. Cell, 91(4), 511–520. [DOI] [PubMed] [Google Scholar]
- Kimkes, T.E. & Heinemann, M. (2020) How bacteria recognise and respond to surface contact. FEMS Microbiology Reviews, 44(1), 106–122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Konstantinidis, K.T. , Rosselló‐Móra, R. & Amann, R. (2017) Uncultivated microbes in need of their own taxonomy. The ISME Journal, 11(11), 2399–2406. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kunze, K. , Niemann, H. , Ueberlein, S. , Schulze, R. , Ehrlich, H. , Brunner, E. et al. (2013) Brominated skeletal components of the marine demosponges, Aplysina cavernicola and Ianthella basta: analytical and biochemical investigations. Marine Drugs, 11(4), 1271–1287. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Laffy, P.W. , Wood‐Charlson, E.M. , Turaev, D. , Jutz, S. , Pascelli, C. , Botté, E.S. et al. (2018) Reef invertebrate viromics: diversity, host specificity and functional capacity. Environmental Microbiology, 20(6), 2125–2141. [DOI] [PubMed] [Google Scholar]
- Lee, Y.‐Y. & Belas, R. (2015) Loss of FliL alters Proteus mirabilis surface sensing and temperature‐dependent swarming. Journal of Bacteriology, 197(1), 159–173. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Letunic, I. & Bork, P. (2021) Interactive tree of life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Research, 49(W1), W293–W296. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Luo, H. , Thompson, L.R. , Stingl, U. & Hughes, A.L. (2015) Selection maintains low genomic GC content in marine SAR11 lineages. Molecular Biology and Evolution, 32(10), 2738–2748. [DOI] [PubMed] [Google Scholar]
- Luter, H.M. , Whalan, S. & Webster, N.S. (2010) Exploring the role of microorganisms in the disease‐like syndrome affecting the sponge Ianthella basta . Applied and Environmental Microbiology, 76(17), 5736–5744. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Martinez‐Garcia, M. , Brazel, D.M. , Swan, B.K. , Arnosti, C. , Chain, P.S.G. , Reitenga, K.G. et al. (2012) Capturing single cell genomes of active polysaccharide degraders: An unexpected contribution of Verrucomicrobia. PLoS One, 7(4), e35314. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Matsen, F.A. , Kodner, R.B. & Armbrust, E.V. (2010) Pplacer: linear time maximum‐likelihood and Bayesian phylogenetic placement of sequences onto a fixed reference tree. BMC Bioinformatics, 11(1), 538. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mistry, J. , Chuguransky, S. , Williams, L. , Qureshi, M. , Salazar, G.A. , Sonnhammer, E.L.L. et al. (2020) Pfam: the protein families database in 2021. Nucleic Acids Research, 49(D1), D412–D419. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Moeller, F.U. , Herbold, C.W. , Schintlmeister, A. , Mooshammer, M. , Motti, C. , Behnam, F. et al. (2022) Taurine as a key intermediate for host‐symbiont interaction in the tropical sponge Ianthella basta . bioRxiv. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Moeller, F.U. , Webster, N.S. , Herbold, C.W. , Behnam, F. , Domman, D. , Albertsen, M. et al. (2019) Characterization of a thaumarchaeal symbiont that drives incomplete nitrification in the tropical sponge Ianthella basta . Environmental Microbiology, 21(10), 527234. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Moitinho‐Silva, L. , Díez‐Vives, C. , Batani, G. , Esteves, A.I.S. , Jahn, M.T. & Thomas, T. (2017) Integrated metabolism in sponge–microbe symbiosis revealed by genome‐centered metatranscriptomics. The ISME Journal, 11(7), 1651–1666. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Morya, V.K. , Kim, J. & Kim, E.‐K. (2012) Algal fucoidan: structural and size‐dependent bioactivities and their perspectives. Applied Microbiology and Biotechnology, 93(1), 71–82. [DOI] [PubMed] [Google Scholar]
- Mutsenko, V.V. , Gryshkov, O. , Lauterboeck, L. , Rogulska, O. , Tarusin, D.N. , Bazhenov, V.V. et al. (2017) Novel chitin scaffolds derived from marine sponge Ianthella basta for tissue engineering approaches based on human mesenchymal stromal cells: biocompatibility and cryopreservation. International Journal of Biological Macromolecules, 104, 1955–1965. [DOI] [PubMed] [Google Scholar]
- Nguyen, M.T. , Liu, M. & Thomas, T. (2014) Ankyrin‐repeat proteins from sponge symbionts modulate amoebal phagocytosis. Molecular Ecology, 23(6), 1635–1645. [DOI] [PubMed] [Google Scholar]
- Nissen, J.N. , Johansen, J. , Allesøe, R.L. , Sønderby, C.K. , Armenteros, J.J.A. , Grønbech, C.H. et al. (2021) Improved metagenome binning and assembly using deep variational autoencoders. Nature Biotechnology, 39(5), 555–560. [DOI] [PubMed] [Google Scholar]
- Nurk, S. , Meleshko, D. , Korobeynikov, A. & Pevzner, P.A. (2017) metaSPAdes: a new versatile metagenomic assembler. Genome Research, 27(5), 824–834. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Offre, P. , Spang, A. & Schleper, C. (2013) Archaea in biogeochemical cycles. Annual Review of Microbiology, 67(1), 437–457. [DOI] [PubMed] [Google Scholar]
- Olm, M.R. , Brown, C.T. , Brooks, B. & Banfield, J.F. (2017) dRep: a tool for fast and accurate genomic comparisons that enables improved genome recovery from metagenomes through de‐replication. The ISME Journal, 11(12), 2864–2868. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Orellana, L.H. , Francis, T.B. , Ferraro, M. , Hehemann, J.H. , Fuchs, B.M. & Amann, R.I. (2022) Verrucomicrobiota are specialist consumers of sulfated methyl pentoses during diatom blooms. The ISME Journal, 16(3), 630–641. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Parks, D.H. , Imelfort, M. , Skennerton, C.T. , Hugenholtz, P. & Tyson, G.W. (2015) CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Research, 25(7), 1043–1055. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pascelli, C. , Laffy, P.W. , Botté, E. , Kupresanin, M. , Rattei, T. , Lurgi, M. et al. (2020) Viral ecogenomics across the Porifera. Microbiome, 8(1), 144. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Perez‐Riverol, Y. , Bai, J. , Bandla, C. , García‐Seisdedos, D. , Hewapathirana, S. , Kamatchinathan, S. et al. (2022) The PRIDE database resources in 2022: a hub for mass spectrometry‐based proteomics evidences. Nucleic Acid Research, 50(D1), D543–D552. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pita, L. , Rix, L. , Slaby, B.M. , Franke, A. & Hentschel, U. (2018) The sponge holobiont in a changing ocean: from microbes to ecosystems. Microbiome, 6(1), 46. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pordesimo, E.O. & Schmitz, F.J. (1990) New bastadins from the sponge Ianthella basta . The Journal of Organic Chemistry, 55(15), 4704–4709. [Google Scholar]
- Price, M.N. , Dehal, P.S. & Arkin, A.P. (2010) FastTree 2–approximately maximum‐likelihood trees for large alignments. PLoS One, 5(3), e9490. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Price, M.N. , Deutschbauer, A.M. & Arkin, A.P. (2020) GapMind: automated annotation of amino acid biosynthesis. mSystems, 5(3), e00291–e00220. [DOI] [PMC free article] [PubMed] [Google Scholar]
- R Core Team . (2013) R: a language and environment for statistical computing .
- Reynolds, D. & Thomas, T. (2016) Evolution and function of eukaryotic‐like proteins from sponge symbionts. Molecular Ecology, 25(20), 5242–5253. [DOI] [PubMed] [Google Scholar]
- Rivett, D.W. & Bell, T. (2018) Abundance determines the functional role of bacterial phylotypes in complex communities. Nature Microbiology, 3(7), 767–772. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rix, L. , Ribes, M. , Coma, R. , Jahn, M.T. , de Goeij, J.M. , van Oevelen, D. et al. (2020) Heterotrophy in the earliest gut: a single‐cell view of heterotrophic carbon and nitrogen assimilation in sponge‐microbe symbioses. The ISME Journal, 14(10), 1–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Robbins, S. , Singleton, C.M. , Chan, C.X. , Messer, L.F. , Geers, A.U. , Ying, H. et al. (2019) A genomic view of the reef‐building coral Porites lutea and its microbial symbionts. Nature Microbiology, 4(12), 2090–2100. [DOI] [PubMed] [Google Scholar]
- Robbins, S. et al. (2021) A genomic view of the microbiome of coral reef demosponges. The ISME Journal, 15(6), 1–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Roux, S. , Enault, F. , Hurwitz, B.L. & Sullivan, M.B. (2015) VirSorter: mining viral signal from microbial genomic data. PeerJ, 3, e985. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rust, M. , Helfrich, E.J.N. , Freeman, M.F. , Nanudorn, P. , Field, C.M. , Rückert, C. et al. (2020) A multiproducer microbiome generates chemical diversity in the marine sponge Mycale hentscheli . Proceedings of the National Academy of Sciences, 117(17), 9508–9518. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schmitt, S. , Tsai, P. , Bell, J. , Fromont, J. , Ilan, M. , Lindquist, N. et al. (2012) Assessing the complex sponge microbiota: core, variable and species‐specific bacterial communities in marine sponges. The ISME Journal, 6(3), 564–576. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schwarz‐Linek, U. , Werner, J.M. , Pickford, A.R. , Gurusiddappa, S. , Kim, J.H. , Pilka, E.S. et al. (2003) Pathogenic bacteria attach to human fibronectin through a tandem β‐zipper. Nature, 423(6936), 177–181. [DOI] [PubMed] [Google Scholar]
- Sichert, A. , Corzett, C.H. , Schechter, M.S. , Unfried, F. , Markert, S. , Becher, D. et al. (2020) Verrucomicrobia use hundreds of enzymes to digest the algal polysaccharide fucoidan. Nature Microbiology, 5(8), 1026–1039. [DOI] [PubMed] [Google Scholar]
- Sieber, C.M.K. , Probst, A.J. , Sharrar, A. , Thomas, B.C. , Hess, M. , Tringe, S.G. et al. (2018) Recovery of genomes from metagenomes via a dereplication, aggregation and scoring strategy. Nature Microbiology, 3(7), 836–843. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Slaby, B.M. , Hackl, T. , Horn, H. , Bayer, K. & Hentschel, U. (2017) Metagenomic binning of a marine sponge microbiome reveals unity in defense but metabolic specialization. The ISME Journal, 11(11), 2465–2478. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Spang, A. , Poehlein, A. , Offre, P. , Zumbrägel, S. , Haider, S. , Rychlik, N. et al. (2012) The genome of the ammonia‐oxidizing Candidatus Nitrososphaera gargensis: insights into metabolic versatility and environmental adaptations. Environmental Microbiology, 14(12), 3122–3145. [DOI] [PubMed] [Google Scholar]
- Spormann, A.M. & Widdel, F. (2000) Metabolism of alkylbenzenes, alkanes, and other hydrocarbons in anaerobic bacteria. Biodegradation, 11(2), 85–105. [DOI] [PubMed] [Google Scholar]
- Storey, M.A. , Andreassend, S.K. , Bracegirdle, J. , Brown, A. , Keyzers, R.A. , Ackerley, D.F. et al. (2020) Metagenomic exploration of the marine sponge Mycale hentscheli uncovers multiple polyketide‐producing bacterial symbionts. MBio, 11(2), e02997–e02919. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Taylor, M.W. , Tsai, P. , Simister, R.L. , Deines, P. , Botte, E. , Ericson, G. et al. (2013) ‘Sponge‐specific’ bacteria are widespread (but rare) in diverse marine environments. The ISME Journal, 7(2), 438–443. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Thomas, T. , Rusch, D. , DeMaere, M.Z. , Yung, P.Y. , Lewis, M. , Halpern, A. et al. (2010) Functional genomic signatures of sponge bacteria reveal unique and shared features of symbiosis. The ISME Journal, 4(12), 1557–1567. [DOI] [PubMed] [Google Scholar]
- Tian, R.M. , Sun, J. , Cai, L. , Zhang, W.P. , Zhou, G.W. , Qiu, J.W. et al. (2016) The deep‐sea glass sponge Lophophysema eversa harbours potential symbionts responsible for the nutrient conversions of carbon, nitrogen and sulfur. Environmental Microbiology, 18(8), 2481–2494. [DOI] [PubMed] [Google Scholar]
- Tian, R.M. , Wang, Y. , Bougouffa, S. , Gao, Z.M. , Cai, L. , Bajic, V. et al. (2014) Genomic analysis reveals versatile heterotrophic capacity of a potentially symbiotic sulfur‐oxidizing bacterium in sponge. Environmental Microbiology, 16(11), 3548–3561. [DOI] [PubMed] [Google Scholar]
- Tianero, M.D. , Balaich, J.N. & Donia, M.S. (2019) Localized production of defence chemicals by intracellular symbionts of Haliclona sponges. Nature Microbiology, 4(7), 1149–1159. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Webster, N.S. , Taylor, M.W. , Behnam, F. , Lücker, S. , Rattei, T. , Whalan, S. et al. (2010) Deep sequencing reveals exceptional diversity and modes of transmission for bacterial sponge symbionts. Environmental Microbiology, 12(8), 2070–2082. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Weisz, J.B. , Lindquist, N. & Martens, C.S. (2008) Do associated microbial abundances impact marine demosponge pumping rates and tissue densities? Oecologia, 155(2), 367–376. [DOI] [PubMed] [Google Scholar]
- Wilson, M.C. , Mori, T. , Rückert, C. , Uria, A.R. , Helf, M.J. , Takada, K. et al. (2014) An environmental bacterial taxon with a large and distinct metabolic repertoire. Nature, 506(7486), 58–62. [DOI] [PubMed] [Google Scholar]
- Wu, Y.‐W. , Simmons, B.A. & Singer, S.W. (2015) MaxBin 2.0: an automated binning algorithm to recover genomes from multiple metagenomic datasets. Bioinformatics, 32(4), 605–607. [DOI] [PubMed] [Google Scholar]
- Yin, Y. , Mao, X. , Yang, J. , Chen, X. , Mao, F. & Xu, Y. (2012) dbCAN: a web resource for automated carbohydrate‐active enzyme annotation. Nucleic Acids Research, 40(W1), W445–W451. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang, F. , Blasiak, L.C. , Karolin, J.O. , Powell, R.J. , Geddes, C.D. & Hill, R.T. (2015) Phosphorus sequestration in the form of polyphosphate by microbial symbionts in marine sponges. Proceedings of the National Academy of Sciences, 112(14), 4381–4386. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang, F. , Pita, L. , Erwin, P.M. , Abaid, S. , López‐Legentil, S. & Hill, R.T. (2014) Symbiotic archaea in marine sponges show stability and host specificity in community structure and ammonia oxidation functionality. FEMS Microbiology Ecology, 90(3), 699–707. [DOI] [PubMed] [Google Scholar]
- Zhang, S. , Song, W. , Wemheuer, B. , Reveillaud, J. , Webster, N. & Thomas, T. (2019) Comparative genomics reveals ecological and evolutionary insights into sponge‐associated Thaumarchaeota. Msystems, 4(4), e00288–e00219. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhou, Y. & Zhu, Y. (2015) Diversity of bacterial manipulation of the host ubiquitin pathways. Cellular Microbiology, 17(1), 26–34. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Figure S1. Phylogenomic tree based on 564 publicly available Thaumarchaeotal MAGs, showing the placement of the Thaumarchaeotal MAG from I. basta. The outgroup consists of MAGs belonging to the phylum Undinarchaeota. The tree was clustered on orders, only showing the closest relatives to the recovered MAG from this study (displayed in red) and their sponge hosts.
Figure S2. Phylogenomic tree based on 8980 publicly available Gammaproteobacterial MAGs and two Gammaproteobacterial MAGs recovered from Hexadella detritifera in this study, showing the placement of the Gammaproteobacterial MAG from I. basta. The outgroup consists of MAGs belonging to the phylum Chloroflexota. The tree was clustered on orders and families, only showing the closest relatives to the recovered MAG from this study (displayed in red) and their sponge hosts.
Figure S3. Phylogenomic tree based on 7361 publicly available Alphaproteobacterial MAGs and one Alphaproteobacterial MAG recovered from Hexadella detritifera in this study, showing the placement of the Alphaproteobacterial MAG from I. basta. The outgroup consists of MAGs belonging to the phylum Chloroflexota. The tree was clustered on orders, only showing the closest relatives to the recovered MAG from this study (displayed in red) and their sponge hosts.
Figure S4. Metabolic reconstruction of the dominant Alphaproteobacterium in Ianthella basta ('Ca. Luteria ianthellae'). Genes encoded in the genome are displayed in black, genes absent from the genome are red, and genes expressed in the metaproteomics dataset are blue. The Alphaproteobacterium lacks the pentose phosphate pathway, pathways to metabolize vitamins, fatty acid B‐oxidation, aromatic ring degradation, dTDP‐l‐rhamnose biosynthesis, and secondary metabolite clusters. The metabolic reconstruction of the dominant Thaumarchaeotum and Gammaproteobacterium can be found in Moeller et al. (2019) and Moeller et al. (2022), respectively.
Table S1. Taxonomy and statistics of all 43 MAGs retrieved from the four Ianthella basta individuals. The six MAGs used for metabolic reconstruction are highlighted in blue. Taxonomy as per GTDB taxonomy Release 202 is shown.
Table S2. Percentage identity between 16S rRNA gene amplicon sequences of the three dominant symbionts of the yellow (thin and thick, unpublished data) and purple (thick; Engelberts et al., n.d.) morphotypes and the genomes retrieved from this study (yellow, thin).
Table S3. Expressed proteins in the four Ianthella basta individuals. Each protein sequence was given a unique sequence ID (seqID). If the sequence was found to be part of another sequence (i.e. redundant), the ‘unique’ column received a value of ‘0’, with the ‘representative seqID’ column displaying the representative sequence. The columns ‘redundantGeneID’ and ‘uniqueGeneID’ are a numerical index. All seqIDs with the same redundantGeneID come from the same scaffold with the same coordinates and orientation. All SeqIDs with the same ‘uniqueGeneID’ have the exact same peptide sequence predicted regardless of which dataset/scaffold it comes from. Information on the source assembly, whether the protein was captured in a MAG, and the functional of the sequence are also included in the table.
Table S4. KO count for the six MAGs retrieved from the four Ianthella basta individuals used for metabolic reconstruction. MAG IDs consist of the MAG ID from Table S1 combined with the lowest resolved taxonomy as per GTDB taxonomy Release 202.
Table S5. Secondary metabolite clusters identified in each of the six MAGs retrieved from Ianthella basta used for metabolic reconstruction. MAG IDs consist of the MAG ID from Table S1 combined with the lowest resolved taxonomy as per GTDB taxonomy Release 202.
Table S6. Taxonomy and function of all putative viral contigs in the metagenomic assemblies of Ianthella basta. Metagenomic assemblies containing the codes H1, H2, or M1‐M4 originated from the six sequencing runs on Ianthella basta from Moeller et al. (2019). For metagenomic assemblies from this study: IB stands for Ianthella basta followed by the replicate number and whether the individual was deeply or shallowly sequenced. Taxonomy was included for each viral contig based on its match against previously recovered I. basta viral sequences and the viral component of the RefSeq NR database. Further information included in the table is whether the viral contig was binned in one of the symbiont genomes, whether the contig was phage or prophage and to what category it belonged. Viral contigs that contained multiple gene annotations are highlighted in pink.
Supplementary Note 1: Supporting Information
Data Availability Statement
Raw reads and MAGs are available at NCBI under Bioproject ID PRJNA807825. The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE (Perez‐Riverol et al., 2022) partner repository with the dataset identifier PXD032278.