Abstract
In glacier-fed streams, ecological windows of opportunity allow complex microbial biofilms to develop and transiently form the basis of the food web, thereby controlling key ecosystem processes. Using metagenome-assembled genomes, we unravel strategies that allow biofilms to seize this opportunity in an ecosystem otherwise characterized by harsh environmental conditions. We observe a diverse microbiome spanning the entire tree of life including a rich virome. Various co-existing energy acquisition pathways point to diverse niches and the exploitation of available resources, likely fostering the establishment of complex biofilms during windows of opportunity. The wide occurrence of rhodopsins, besides chlorophyll, highlights the role of solar energy capture in these biofilms while internal carbon and nutrient cycling between photoautotrophs and heterotrophs may help overcome constraints imposed by oligotrophy in these habitats. Mechanisms potentially protecting bacteria against low temperatures and high UV-radiation are also revealed and the selective pressure of this environment is further highlighted by a phylogenomic analysis differentiating important components of the glacier-fed stream microbiome from other ecosystems. Our findings reveal key genomic underpinnings of adaptive traits contributing to the success of complex biofilms to exploit environmental opportunities in glacier-fed streams, which are now rapidly changing owing to global warming.
Subject terms: Water microbiology, Freshwater ecology
In glacier-fed streams, ecological windows of opportunity allow complex microbial biofilms to develop and transiently form the basis of the food web. Using metagenomics, this study reveals the metabolic strategies and key genomic underpinnings of adaptive traits that enable these biofilms to exploit environmental opportunities.
Introduction
Ecosystems and their constituent biota are finely tuned to the seasonal variations of their environment. This phenology is particularly pronounced in glacier-fed streams (hereafter GFSs), which are commonly enveloped by snow cover and darkness in winter, and subject to high flow and sediment mobilization in summer. Yet, ecological ‘windows of opportunity’ arise in spring and autumn1,2 when nutrient (N, P) and light availability is elevated and streamflow is moderate1–3. During the onset of spring snowmelt, inorganic N that has accumulated from atmospheric deposition and concentrated at the snowpack surface is washed into GFSs3,4, whereas hydrologic connectivity with various glacial sources (e.g., subglacial) can increase concentrations of phosphorus as the melt season progresses4,5. Following the height of the melt season in summer, discharge and turbidity decline in autumn, again elevating nutrient concentrations and light availability. These favorable conditions allow algae and cyanobacteria to rapidly develop into ‘green oases’ of phototrophic biofilms. Partially due to the absence of major terrestrial organic matter subsidies from the catchment, this punctuated exploitation of solar energy in an otherwise energy-limited ecosystem transiently forms the base of the GFS food web and ecosystem energetics1,6. Such windows of opportunity may therefore function as ‘ecosystem control points’7 with disproportionately high ecological processing rates affecting ecosystem dynamics relative to longer intervening time periods. These ecosystem control points are widely distributed across ecosystems and vary across spatial and temporal scales7. However, our understanding on the microbiology of the communities that facilitate ecosystem control points remains limited to date.
Owing to climate change, the mass balance and melting dynamics of mountain glaciers are rapidly changing worldwide, altering the annual distribution of runoff in GFSs8. Invigorated glacial melt increases discharge and sediment delivery, but after glaciers shrink past a certain point (i.e., ‘peak water’), GFSs are likely to become warmer, less turbid, and less hydrologically dynamic4. These changes are almost certain to have substantial impacts on GFS ecosystem structure and function by either contracting or extending the duration of these windows of opportunity. It is therefore critical to understand how benthic biofilms operate during these times in order to predict how these ecosystems are likely to change in the future4.
In streams, biofilms closely interact with the sedimentary environment9. For example, extracellular polymeric substances (EPS) produced by biofilms bind fine sediment grains together, which can locally stabilize substrata, reducing scour and vertical permeability10. Similarly, boulders resist flow-induced disturbance to promote biofilm growth11, and if protruding through the water column, may also increase light availability to further facilitate photosynthesis. Therefore, it seems advantageous for phototrophic biofilms to colonize boulders, which can be regarded as islands of stability in otherwise highly unstable GFS channels. These islands may allow biofilm growth to locally persist beyond the typical windows of opportunity (at least until snow cover), drive ecosystem energetics (i.e., gross primary production)12, and to sustain the GFS food web and related benthic biodiversity4,13.
The relationships between photoautotrophs (such as algae and cyanobacteria), prokaryotes and fungi regulate nutrient and carbon cycling, and therefore represent a fundamental ecological interface in aquatic ecosystems. This interface (i.e., the phycosphere) has received substantial attention in pelagic ecosystems over the last decades14–17, but less so in stream ecosystems. While early work on phototrophic biofilms colonizing the benthic zone in streams has highlighted the role of algal–bacterial interactions for carbon and nutrient fluxes18,19, we do not currently understand the fine-scale mechanisms of such interactions. For example, cyanobacteria produce pigments that protect the biofilm as a whole against harmful UV-radiation20, while mucilage-rich algal colonies (e.g., Hydrurus spp.) provide labile organic matter to heterotrophic microorganisms and facilitate their attachment. Such interactions may foster facultative interactions between photoautotrophs and other microorganisms, which, similarly to the phycosphere, may be particularly beneficial to microbial life in oligotrophic and harsh ecosystems such as GFSs. Unraveling the genomic and metabolic underpinnings of algal–bacterial relationships in biofilms helps to better understand the success of the biofilm mode of life in an extreme ecosystem.
Here we dissect the microbiome of GFSs and describe the genomic underpinnings of the adaptive mechanisms that potentially contribute to the success of complex biofilms. Using 16S rRNA and 18S rRNA gene amplicon sequencing, we assess the microbiome structure of biofilms associated with two sedimentary habitats that are common in GFSs, namely sandy sediments (i.e., epipsammic biofilms) and boulders (i.e., epilithic biofilms). We sampled geographically distant streams, transcending hemispheres (Southern Alps in New Zealand, NZ, and the Caucasus, CC), to draw more generalisable conclusions about microbiome structure and assembly. Furthermore, using genome-resolved metagenomics, we screen twenty-one epilithic biofilm microbiomes for energy pathways and cross-domain metabolic interactions. Our findings suggest the diversification of energy-acquiring pathways and metabolic interactions are relevant for epilithic biofilms to thrive during the ecological windows of opportunity, and beyond, within low-disturbance patches in GFSs. Moreover, our findings shed light on what the future biofilm mode of life in GFSs may look like as glaciers shrink and GFS ecosystems are predicted to become more autotrophic4.
Results and discussion
Sedimentary habitats affect microbiome structure and assembly
We used 16S rRNA and 18S rRNA gene amplicon sequencing to compare the microbiome structure of 48 epipsammic and epilithic biofilm samples from GFSs in NZ and CC collected during spring and autumn, respectively (Methods) (Fig. 1a; Supplementary Fig. 1a, b). These seasons broadly align with the windows of opportunity in these GFSs; however, we recognize that epilithic biofilms, in particular, may extend beyond these windows well into summer or until snow coverage. We found that both prokaryotic and eukaryotic communities differed between the two habitat types in terms of community structure and alpha diversity (Fig. 1b, c). Overall, taxonomic differences were even apparent at the phylum level, despite high inter-sample variability within the categories (Supplementary Fig. 1c, d). Geography (i.e., NZ versus CC) explained 11.5% and 12.9% of the variability in the prokaryotic and eukaryotic datasets (db-RDA, p < 0.05 for both datasets), while sedimentary habitats explained an additional 10% and 8.3% of the variability (db-RDA, p < 0.05 for prokaryotes and eukaryotes).
The estimated α-diversity (i.e., richness of amplicon sequence variants; ASVs) was higher for both prokaryotes and eukaryotes in epipsammic biofilms when compared to epilithic biofilms (2–3 fold differences, non-parametric t-tests, p < 0.001) (Fig. 1d, e). These observations are in accordance with findings by Tolotti and colleagues21 where α-diversity of the epipsammic habitats were higher than the epilithic biofilms in rock glacier- and groundwater/precipitation-fed waters21. It is plausible that continuous dispersal and mixing facilitated by the transport of fine sediments from various upstream sources (e.g., the subglacial environment and adjacent soils) leads to the greater diversity of the epipsammic biofilms. Overall, our results unravel distinct microbiome structures for both sediment habitats within the same GFS reaches. This agrees with previous studies21, and more generally with the relationship between streambed physical variation and spatial biodiversity dynamics22,23. Streambeds, including their biofilms, are understood as landscapes where dispersal among patches can shape biodiversity and resilience24–26. Therefore, we hypothesized that epilithic communities are partially structured by dispersal from epipsammic communities that typically dominate the GFS streambeds by area. Using Sloan’s neutral community model27, we instead found that the composition of the epilithic biofilms is not dictated by a source-sink relationship with the epipsammic communities (Supplementary Note). In other words, the epilithic biofilm communities are not determined by epipsammic communities that typically surround the boulders within the complex landscape of the GFS streambed.
Metagenomics unveils the complexity of epilithic biofilms
To unveil the full complexity of the epilithic biofilms, we performed whole genome shotgun metagenomics on 21 epilithic samples from four GFSs each in NZ and CC (Supplementary Fig. 1a, b); low biomass associated with sandy sediments precluded epipsammic biofilms from metagenomic analysis. Metagenomic sequencing, after quality filtering, yielded on average 1.2 × 108 (±1.4 × 107 s.d.) reads per sample which were assembled to obtain an average of 8.7 × 105 contigs per sample that were subsequently binned. Bacteria and eukaryotes dominated the biofilm communities across all samples (Supplementary Fig. 2a). Seventy-three (70 bacteria and three archaea) medium-to-high quality (>70% completion, < 5% contamination) metagenome-assembled genomes (MAGs) from a total of 662 MAGs formed the pool of the prokaryotes. As seen from the phylogenomic analysis, the high-quality MAGs (n = 49, >90% completion and <5% contamination) span the bacterial tree of life. Based on the phylogenomic analyses along with the taxonomic information (Fig. 2), we sought to further characterize these MAGs that could represent novel species or species that have not previously been reported (Fig. 2a). We found that only 30% of these high-quality MAGS were annotated up to the family level, whereas the remaining MAGs could be taxonomically labelled at the genus level. Only high-quality MAGs were used for the phylogenetic analyses to mitigate disparities arising from incomplete MAGs. Aggregated at the genus level, Polaromonas was both abundant and prevalent in the biofilms along with representatives of Flavobacterium, Cyanobacteria, and unclassified MAGs from the Bacteroidota and Candidate Phyla Radiation (CPR; Patescibacteria) (Fig. 2b). These taxa were found in over half of the samples, irrespective of geographic origin. The CPR bacteria have only recently been identified based on genomic data28, and Patescibacteria specifically have been reported from oligotrophic ecosystems, including groundwater29 and thermokarst lakes30. Their apparently minimal biosynthetic and metabolic pathways may help them dwell in these ecosystems, which is of equal relevance in GFSs.
Alongside these bacteria, archaea contributed less than 1% to the microbiome of epilithic biofilms, with representatives of Asgardarchaeota, Crenarchaeota and Nanoarchaeota. Intriguingly, the recently discovered lineages of Asgardarchaeota31,32 have been reported from freshwater sediments, yet not from cryospheric environments. Algae, mostly diatoms and Hydrurus (Ochrophyta phylum), as well as dinoflagellata, were the most important photoautotrophs of the eukaryotic domain (Fig. 2c). The prevalence of Hydrurus (~87% relative abundance) underscores the function of these filamentous algae as a resource to higher trophic levels in GFS33. Our metagenomic insights further support the notion that phototrophic biofilms are highly diverse with representatives from all three domains of life28.
In addition to the archaeal, bacterial and eukaryotic community members, we also found a diverse viral community associated with epilithic biofilms (Supplementary Fig. 2b). Most of the viruses were bacteriophages targeting abundant MAGs such as Flavobacterium, Pseudomonas, and Bacillus genera, but we also identified eukaryotic phages (i.e., Paramecium bursaria Chlorella virus). Few have studied viruses in stream biofilms to date34, potentially because it was common wisdom that the biofilm mode of life protects bacteria from viral infection. While viruses have previously been shown to be abundant in glaciers35,36, our findings provide evidence for a diverse and likely active viral community in GFS biofilms where they may influence bacterial growth and both carbon and nutrient cycling as on the glacier surface35.
Epilithic biofilms form the basis for a ‘green’ food web in glacier-fed streams
Cyanobacteria and eukaryotic algae dominated the photoautotrophs in the epilithic biofilms and hence form the basis of the ‘green’ food web during the windows of opportunity. While these photoautotrophs are well known to use chlorophyll to capture solar energy, little is known on retinal-based phototrophy using rhodopsins in GFSs. Intriguingly, we found that MAGs from sixteen out of twenty phyla in the epilithic biofilms, including the abundant groups, such as Proteobacteria (Polaromonas) and Bacteroidota (Flavobacterium), encoded for (bacterio-)rhodopsins (Fig. 3a). These also included genes encoding for light-harvesting complex 1 (LH1), reaction centre (RC) subunits (pufBALM), and transcriptional regulators (ppsR) required for aerobic anoxygenic phototrophs along with rhodopsins as a signature of energy-limitation adaptations (Fig. 3a). Recently, rhodopsins were also reported to serve as a photoprotectant in Flavobacterium from glaciers37. Collectively, our findings unveil multiple strategies of photoautotrophy, which may help cyanobacteria and algae to maximize their utilization of solar energy and to thrive on boulders in GFSs.
In order to exploit the favorable habitat provided by boulders during and beyond the windows of opportunity in GFS, rapid growth may be advantageous for primary producers such as cyanobacteria. Moreover, functional independence from other microorganisms could allow them to seize environmental opportunities. To test this hypothesis, we assessed the relationship between projected times of growth (doubling time in hours) with the median KEGG pathway completion within each MAG. Given the partial completeness of the MAGs, including possibly missing metabolic modules, we performed a linear regression between median KEGG pathway completion and projected time of growth, accounting for MAG completion as a fixed effect. Strikingly, 86% of the cyanobacterial MAGs (n = 38 out of 44) exhibited decreased projected times of growth with an increase in median KEGG module completion per MAG (rs = −0.47, Two way ANOVA, adj. p < 0.05). These observations suggest that when encoding all genes to form a complete KEGG pathway, phototrophic taxa within these epilithic biofilms may indeed grow rapidly and be self-sufficient, putatively autonomously from other microorganisms from other (micro)organisms and fostering growth.
Given the energetic constraints in GFSs, it would be beneficial for bacterial heterotrophs to interact with these photoautotrophic (micro)organisms for meeting their energy and nutrient demands. To investigate such cross-domain relationships, we used network analyses and identified key interacting taxa based on positively co-occurring nodes using all prokaryotic and eukaryotic MAGs (see Methods). Based on a null model assessment (see Methods), our interaction networks showed preferential attachment within the nodes, along with increased centralities (i.e., degree and edge-betweenness, Supplementary Fig. 3a, b), suggesting that the interactions within these networks were not random. More importantly, the largest connected component (based on degree and betweenness centralities) of the interaction network contained taxa spanning archaea, bacteria and eukaryotic domains (Fig. 3b and Supplementary Fig. 3b). Though Acidobacteria had a high degree of centrality, both Polaromonas and Methylotenera demonstrated strong interactions (>0.6 betweenness centrality) with primary producers (including eukaryotic algae) and fungi. Specifically, Polaromonas had a strong interaction with algae, while Methylotenera co-occurred with Chytridiomycetes (Fig. 3b). Interestingly, we found similarly connected nodes demonstrating cross-domain interactions within the largest component of the individual regions, i.e., NZ (Supplementary Fig. 3c, d) and CC (Supplementary Fig. 3e, f), albeit the two regions had varying numbers of edges (NZ = 205 and CC = 30). This suggests that inherent interactions within these GFS epilithic biofilms are conserved irrespective of geographic origins. These results also support our hypothesis of heterotrophic bacteria co-occurring with eukaryotes, primarily algae, for metabolic cross-feeding, similar to those occurring in the phycosphere15.
Furthermore, our results hint at the existence of a more cryptic interaction in epilithic biofilms between the parasitic fungi Chytridiomycetes and algae (mostly Ochrophyta). Fungal parasitism on pelagic algae has been recently reported to be more important than expected, even with consequences for carbon and nutrient cycling as mediated by the fungal shunt38,39. The possibility of fungal parasitism on algae in epilithic biofilms further supports the notion of photoautotrophs forming the foundation of a complex food web in GFS ecosystems.
Genomic underpinnings of algae–bacteria metabolic interactions
As photoautotrophs grow and senesce, they increasingly exude intracellular material into their ambient environment, where it can be metabolized by heterotrophic bacteria through extracellular enzymes40. To explore this metabolic cross-feeding between bacterial heterotrophs and algae, we assessed the MAGs for genes encoding five common extracellular enzymes required for cleaving complex polysaccharides, phosphomonoesters and proteins41. Not unexpectedly, these genes were associated with bacterial heterotrophs rather than with the photoautotrophs (Supplementary Fig. 4), which suggests adapted genomic traits to meet specific metabolic needs of the heterotrophs. However, based on the presence of extracellular enzyme genes among Cyanobacteria, we cannot discount the possibility of mixotrophy in the epilithic biofilms (Supplementary Fig. 4b). Additionally, genes associated with mixotrophy, such as those encoding for auto- and heterotrophic pathways, were also found in other abundant members of the epilithic microbiome (e.g., Proteobacteria). The widespread occurrence of mixotrophy in planktonic communities42, including members of the Cyanobacteria, and the ensuing food web dichotomy is considered as an adaptive strategy to oligotrophic and cold ecosystems (e.g., the polar sea42 and alpine lakes43). Therefore, we argue that mixotrophy may also be an important trait of Cyanobacteria within GFS biofilms.
Carbohydrate-active enzymes (CAZymes) are the primary tools used by heterotrophic bacteria to initiate the degradation of polysaccharides, largely algae-derived in the GFS epilithic biofilms. To shed light on this potential trophic interaction identified through specific extracellular enzyme activities (EEAs), we tested if all the CAZymes in the metagenomes covaried with the abundance of eukaryotes. Overall, we found positive correlations between eukaryote abundances and CAZymes, particularly carbohydrate-binding modules (CBM) and glycoside hydrolases (GH) (Supplementary Fig. 4d). More specifically, these correlations were particularly pronounced for GH and some of the algal groups (e.g., Ochrophyta, Haptophyta, Cryptophyta) that we found at relatively high abundances in the epilithic biofilms (Fig. 3c and Supplementary Fig. 4d). As some of these algae are known to copiously produce sulfated carbohydrates44, we suggest a similar involvement of CAZymes (Supplementary Data 1) in relation to polysaccharide degradation in GFS epilithic biofilms as recently reported from Verrucomicrobia isolates45. Given that sulfated carbohydrates are more resistant to bacterial degradation than other carbohydrates45, our findings suggest that they are still relevant to carbon turnover in an ecosystem that is inherently carbon limited.
In order to understand whether functions potentially geared towards cross-domain interactions were enriched in epilithic biofilms in GFSs, we compared the KEGG orthology (KO) annotations from our metagenomes to 105 metagenomes from a wide range of ecosystems (Supplementary Data 2). Strikingly, we found that whole metagenome comparisons revealed that KOs associated with quorum sensing, vitamin B12 (cobalamin) transporters and thiamine biosynthesis were enriched in epilithic GFS biofilms compared to other ecosystems (Supplementary Data 3). The associated pathways and their completion levels were evaluated using KEGGDecoder (Fig. 3d; Supplementary Fig. 5) indicating a high completion of pathways associated with cross-domain interactions. These findings are in line with previous genomic insights into algal–bacterial interactions46,47, specifically with the observed upregulation of vitamin biosynthesis in bacteria (Halomonas) growing in the presence of algal extracts.
Furthermore, several MAGs were found to encode genes (e.g., quorum sensing, cobalamin metabolism, tryptophan synthesis) potentially facilitating algal–bacterial interactions (Fig. 3a). Particularly, cobalamin metabolism may be relevant for nutrient acquisition in algal–bacterial relationships48, whereas tryptophan was reported as a key signalling molecule involved in interactions between bacteria and associated phytoplankton16,49. Collectively these genomic insights stress cross-domain interactions as an adaptive potential that the epilithic microorganisms have developed to exploit the window of opportunity in GFSs.
Energy acquisition and biogeochemical pathways in epilithic biofilm MAGs
The dominance (~88%) of MAGs encoding for organic carbon metabolism suggests a ‘baseline’ heterotrophy in GFSs likely supported by organic carbon subsidies from melting glaciers6,50,51 ‘green food web’ during the windows of opportunity, potentially sustaining metabolic interactions between primary producers and heterotrophs. Given the notoriously low concentrations of dissolved organic carbon in GFSs50,52,53, including our study sites in NZ (96.18 ± 21.35 µg C L−1) and CC (221.36 ± 31.01 µg C L−1), we suggest that the ‘green food web’ dominates over allochthonous subsidies.
Exploring the gene repertoire of the epilithic biofilms, we found that Cyanobacteria were one of the largest bacterial contributors to carbon fixation along with Bacteroidota and few Gammaproteobacteria (Fig. 4a). An in-depth analysis across the 662 MAGs revealed that 583 MAGs encoded genes involved in organic carbon oxidation, while 120 MAGs encoded genes involved in CO2 fixation. In line with the above findings, the majority of these MAGs was identified as Cyanobacteria along with few other phyla such as Proteobacteria, Asgardarchaeota, Crenarchaeota and Huberarchaeota. We also note that 351 MAGs encoded genes for fermentation (Fig. 4b) spanning several phyla, including Actinobacteriota, Bacteroidota, Patescibacteria, Planctomycetota and Verrucomicrobiota.
For biofilms to thrive in GFSs, particularly during the windows of opportunity, it appears opportune to diversify the exploitation of energy sources. Therefore, we performed an in-depth characterisation of chemolithotrophic pathways to explore the potential role of minerals derived from the glacial comminution of bedrock as an energy source for microorganisms54. The prevalence of the sox gene cluster in representatives of the Bacteriodota (UBA7662) and Bdellovibrionota reveals the potential importance of inorganic sulfur oxidation in epilithic biofilms. This notion is supported by the broad occurrence of sulfur dioxygenases (SDOs) across the various phyla that facilitate sulfur oxidation (Fig. 4c). Interestingly, Tranter and Raiswell suggested that sulfates derived from sulfide oxidation in comminuted bedrock55 potentially increase sulfur availability and acquisition in glacial meltwaters56. Sulfide oxidation can stimulate carbonate weathering with the resulting CO2 potentially being fixed by algae and cyanobacteria in the epilithic biofilms—a link that appears relevant given that GFSs are often undersaturated in CO257. Furthermore, we found that almost all MAGs encoded for group IV hydrogen dehydrogenases (NiFe_Gp4; Fig. 4c), which potentially serve as an alternate energy acquisition pathway. Hydrogen dehydrogenases have recently been reported to support primary production in various glacial and other extreme environments58,59. This suggests that lithogenic hydrogen may also contribute energy to bacteria within the epilithic biofilms.
Genomic insights into the nitrogen cycle revealed the Dissimilatory Nitrate Reduction to Ammonium (DNRA, or nitrite ammonification) and, to a lesser extent, denitrification, as major pathways (Fig. 4d). Relatively little is known regarding these two competing pathways in stream biofilms or sediments60, particularly in GFSs. This is in line with other ecosystems where DNRA is favoured over denitrification when alternate electron donors prevail over nitrate61. For instance, predicting metagenomes from 16S rRNA sequences, Ren et al.62 found DNRA to be an important pathway in GFSs, suggesting that bacteria use inorganic nitrogen more as an energy source than a source for biosynthesis. Our analyses revealed Burkholderiales (Gammaproteobacteria) as the largest contributor to nitrate assimilation and ammonia-oxidation genes (Fig. 4a, c). DNRA, if not conducive to N2O production, would enhance nitrogen recycling within epilithic biofilms through ammonia assimilation by algae and cyanobacteria, for instance. Our genomic evidence for nitrogen recycling that potentially overwhelms nitrogen losses through denitrification is corroborated by flux measurements from microbial mats in Antarctic GFSs63, and highlights recycling as a strategy to cope with nutrient limitation in glacier ecosystems63–65.
Strikingly, we found only few MAGs, mostly belonging to Deinococcota, Gammaproteobacteria, Beijerinckiaceae and Crenarchaeota, involved in the oxidation of ammonia and nitrite, potentially leading to the accumulation of nitrate. The involvement of archaea would be in line with recent studies showing ammonia oxidation by archaea in Arctic soils66 and with the observation that archaea couple ammonia oxidation with biomass formation (i.e., via CO2 fixation)67. Our finding that archaeal MAGs encode for carbon fixation genes (Fig. 4b) further highlight their role in ammonia oxidation and biomass accrual in epilithic biofilms. Overall, the overlap of metabolic capacities within the MAGs suggests that the epilithic biofilms efficiently recycle carbon and nutrients. Internal recycling in stream biofilms is thought to be facilitated by increased residence times of water and contained solutes within the biofilms compared to the overlying water68, which is certainly an advantage in a losing ecosystem such as GFSs.
Genomic underpinnings of adaptation to the extreme GFS environment
The GFS environment is extreme as illustrated by near-freezing temperatures, high UV-radiation, and high flow velocities. To assess potential adaptive traits of bacteria dwelling in epilithic biofilms, we first performed a phylogenomic analysis of Polaromonas spp., one of the most abundant and prevalent genera in the studied GFSs. Our analysis revealed that a few of the GFS Polaromonas formed clades that are distinct from Polaromonas identified in other environments (Methods), thus potentially comprising novel ‘species’ (Fig. 5a). This phylogenomic pattern indicates that Polaromonas has evolved traits that facilitates its success in GFS, both in NZ and CC. To identify such traits, we created a pangenome and performed an enrichment analysis for clusters of orthologous genes. We found three categories that were significantly enriched in GFS Polaromonas compared to those from other environments (Supplementary Data 4). Two categories are related to defense mechanisms, both general and transcription, and one to energy production (Fig. 5b). It is plausible that these mechanisms are related to high UV-radiation69,70 and oxidative stress71, as well as to cold stress responses as previously reported from other bacteria72–74. Furthermore, the presence of CRISPR-Cas proteins in the enriched clusters of orthologous genes (COGs) hint at defense mechanisms against phages (Supplementary Data 4), which we showed to be present in the epilithic biofilms. This is in accordance with reports demonstrating that cryospheric bacteria (such as Janthinobacterium spp.) develop defense strategies, including biofilm formation75 and extracellular vesicle formation76 to escape viruses. On the other hand, the transcription of ‘defense mechanism’ genes have been linked to cold adaptation in psychrophiles72. Cold-shock proteins regulate transcription at low temperature, while genes involved in membrane biogenesis77 and membrane transport proteins78, several of which are also enriched in the GFS Polaromonas genomes, are up-regulated. For example, in the psychrophilic Colwellia psychrerythraea 34H, adaptation to cold includes the maintenance of the cell membrane in a liquid-crystalline state via the expression of genes involved in polyunsaturated fatty acid synthesis79. Similarly, ATP-driven or proton motive secondary transport systems have been associated with solute transfers across membranes in bacteria and archaea as an adaptation to the cold78.
Our insights into the adaptive potential of Polaromonas to the GFS environment prompted us to expand our search for adaptive traits across all MAGs from the epilithic biofilms. Querying for 76 genetic traits spanning nine categories related to cold adaptation73, we indeed found distinct patterns of genomic adaptation across MAGs (Fig. 5c). Several MAGs encoded for genes associated with membrane and peptidoglycan alterations, cold and heat shock proteins, oxidative stress, and transcription/translation factors alongside DNA replication and repair. While all major phyla encoded for adaptive traits related to the outer membrane and cell wall, Proteobacteria were the predominant group with an overall higher copy number of genes (~5 copies/genome), albeit insignificant compared to other phyla, involved in counteracting osmotic and oxidative stress. This was followed by Bacteroidota, Cyanobacteria and Actinobacteriota with three, two and two copies per genome respectively. Interestingly, we found that Patescibacteria MAGs had significantly lower copies of cold adaptation genes, whilst both Actinobacteria and Asgardarchaeota demonstrated a significantly higher number of osmotic stress genes (Supplementary Data 7). This is in line with metagenomic studies reporting an enrichment of sigma B genes in Antarctic mats, allowing for surviving severe osmotic stress during freezing74. Similarly, Psychrobacter arcticus80 and Planococcus halocryophilus Or181 were shown to have specific genomic modifications, particularly with genes involved in putrescine and spermidine accumulation, both of which are associated with alleviating oxidative stress. Furthermore, MAGs from Proteobacteria were characterized by a high prevalence of genes potentially expressed in response to stressors, such as UV and reactive oxygen species (Fig. 5c).
Our genomic insights into possible adaptive traits of epilithic microorganisms may also contribute to our understanding of their adaptation beyond the windows of opportunity when the GFS environment is even harsher. In fact, with the onset of winter and during winter, GFSs partially freeze and become snow-covered thereby inhibiting primary production. Mixotrophy as observed within the Cyanobacteria (Supplementary Fig. 4b) would be advantageous during these periods.
Furthermore, it is recognized that cell membrane alterations and lipid composition allow for withstanding cold conditions73. Our observations regarding several MAGs encoding genes associated with membrane and peptidoglycan alterations are concordant with previous reports of increased membrane fluidity in Psychrobacter arcticus 273–480, Sphingopyxis alaskensis82, and Pseudomonas extremaustralis83. Simultaneously, at low temperatures oxygen solubility increases, potentially generating reactive oxygen species and subsequently leading to oxidative stress84. As reported above, we observed several MAGs encoding genes to counteract this phenomenon which may be even more critical as temperatures may decrease outside of the ‘warmer’ windows of opportunity. Overall, the diversity of the cold adaptation genes and their potential mechanisms within MAGs support the notion that these taxa are potentially equipped to deal with the even harsher GFS environment outside the windows of opportunity.
In conclusion, our genome-resolved metagenomic analyses have set the stage for a mechanistic understanding of how the diversification of energy and matter acquisition pathways, metabolic interactions, and genomic adaptations to harsh environmental conditions allow GFS biofilms to persist and thrive during windows of opportunity and beyond. We acknowledge that a metagenomic time series outside and throughout windows of opportunity would be required to substantiate some of our observations. Nevertheless, our findings shed light on boulders as important habitats that confer stability to biofilms even outside the typical windows of opportunity. GFSs count among the ecosystems that are most vulnerable to climate change. Therefore, our findings open a window into the future of how microbial life, with a strong photoautotrophic component, may look like in GFSs when the environmental conditions become more favorable for primary producers as glaciers shrink.
Methods
Sample collection
We sampled a total of eight GFSs from the Southern Alps in New Zealand Southern Alps and the Caucasus in Russia in early- and mid-2019, respectively, for a total of 27 epipsammic samples taken from sandy sediments and 21 epilithic biofilm samples from boulders adjacent to the epipsammic samples (Supplementary Data 5). In order to have comparable samples, the collection was largely constrained to the vernal and autumnal windows of opportunity, respectively. Epipsammic samples were collected from each GFS by first identifying three patches within a reach of ~5–10 m. From each patch, epipsammic samples were taken from the <5 cm surface of the streambed with a flame-sterilized metal scoop and sieved to retain the 250 μm to 3.15 mm size fraction. While three epipsammic samples were taken from each stream, epilithic samples were taken opportunistically from up to three boulders per reach (Supplementary Data 5) due to their heterogeneity within and among the streams due to the unequal presence of boulders in each GFS. Epilithic biofilms were sampled using a sterilized metal spatula. All samples were immediately flash-frozen in liquid nitrogen in the field and transported and stored frozen pending DNA extraction. Streamwater turbidity, conductivity, temperature, and pH were measured in situ during the sampling (Supplementary Data 5). Samples for the determination of streamwater dissolved organic carbon and inorganic nutrient concentrations were filtered through pre-combusted (450 °C) glass microfiber filters (GF/F, Whatman), frozen, and analyzed in the laboratory. DOC concentration was measured with a TOC carbon analyzer (Sievers M9 TOC Analyser, GE). Phosphate, ammonium, nitrite and nitrate were measured with a continuous flow injection analyzer (Lachat QuikChem 8500, methods 10-115-01-1-M (PO4), 10-107-04-1-B (NO3/NO2) and 10-107-06-3-D (NH3)) (Supplementary Data 5).
DNA extraction and purification
A previously established protocol85 was used to extract DNA from all samples. Briefly, 5 g of epipsammic and 0.05–0.1 g of epilithic biofilm were subjected to a phenol:chloroform-based extraction and purification method. The differential input volume for the DNA extractions were established to account for the differences in biomass between the epipsammic and epilithic biofilms. The samples were treated with a lysis buffer containing SDS along with 0.1 M Tris-HCl pH 7.5, 0.05 M EDTA pH 8, 1.25% SDS and RNase A (10 µl: 100 mg/ml). The samples were vortexed and incubated at 37 °C for 1 h. Proteinase K (100 µl; 20 mg/ml) was subsequently added and further incubated at 70 °C for 10 min. Samples were purified once with phenol/chloroform/isoamyl alcohol (ratio 25:24:1, pH 8) and the supernatant was subsequently extracted with a 24:1 ratio chloroform/isoamyl alcohol. Linear polyacrylamide (LPA) was used along with sodium acetate and ice-cold isopropanol for precipitating that DNA overnight at −20 °C. For epilithic biofilms, the entire protocol was adapted to a smaller scale due to the availability of higher DNA concentrations compared to sediment. The former was treated with 0.75 ml of lysis buffer (instead of 5 ml for sediment) and all subsequent volumes of reagents were adapted accordingly (see supplementary material). Furthermore, a mechanical lysis step of bead-beating was necessary along with a lysis buffer to facilitate DNA release from the more developed epilithic biofilms. Due to the higher DNA yields, the addition of LPA was omitted from the DNA precipitation step. DNA quantification was performed for all samples with the Qubit dsDNA HS kit (Invitrogen).
Metabarcoding library preparation and sequencing
The prokaryotic 16S rRNA gene metabarcoding library preparation was performed as described in Fodelianakis et al.86, targeting the V3-V4 hypervariable region of the 16S rRNA gene with the 341 F (5ʹ-CCTACGGGNGGCWGCAG-3ʹ) and 785R (5ʹ-GACTACHVGGGTATCTAATCC-3ʹ) primers and following Illumina guidelines for 16S metagenomic library preparation for the MiSeq system. The eukaryotic 18 S rRNA gene metabarcoding library preparation was performed likewise but using the TAReuk454F (5ʹ-CCAGCASCYGCGGTAATTCC-3ʹ) and TAReukREV3 (5ʹ- CTTTCGTTCTTGATYRA-3ʹ) primers to target the 18 S rRNA gene V4 loop87. Samples were sequenced using a 300-bp paired-end protocol partly in the Genomic Technologies Facility of the University of Lausanne (27 epipsammic samples) and partly at the Biological Core Lab of the King Abdullah University of Science and Technology (21 epilithic samples).
Metabarcoding analyses
The 16S rRNA gene metabarcoding data were analysed using a combination of Trimmomatic88 and QIIME289 as described in Fodelianakis et al.86, with the exception that here the latest SILVA database90 v138.1 was used for taxonomic classification of 16S rRNA and 18S rRNA gene amplicons. Non-bacterial ASVs including those affiliated to archaea, chloroplasts and mitochondria were discarded from the 16S rRNA amplicon dataset in all downstream analyses. ASVs observed only once were removed from both 16S rRNA and 18 S rRNA amplicon datasets. Diversity analyses were performed in R using the vegan91 and metacoder92 packages. For non-metric multidimensional scaling (nMDS) and distance-based redundancy (db-RDA) analyses data were log(x + 1) transformed and the capscale and ordiR2step (backwards direction, 200 permutations) functions from vegan were used. To test for a source-sink hypothesis from epipsammic to epilithic, the Sloan’s Neutral Community Model27 was used based on the R implementation developed by Burns et al.93.
Whole-genome shotgun libraries and sequencing
All epilithic biofilm DNA samples underwent random shotgun sequencing following library preparation using the NEBNext Ultra II FS library kit94. Briefly, 50 ng of DNA was used for constructing metagenomic libraries under 6 PCR amplification cycles, following enzymatic fragmentation of the input DNA for 12.5 min. The average insert size of the libraries was 450 bp. Qubit (Invitrogen) was used to quantify the libraries followed by quality assessment using the Bioanalyzer from Agilent. Sequencing was performed at the Functional Genomics Centre Zurich on a NovaSeq (Illumina) using a S4 flowcell.
Metagenomic preprocessing, assembly, binning, and analyses
For processing metagenomic sequence data, we used the Integrated Meta-omic Pipeline (IMP)95 workflow to process paired forward and reverse reads using version 3.0 (commit# 9672c874; available at https://git-r3lab.uni.lu/IMP/imp3)96. IMP’s workflow includes preprocessing, assembly, genome reconstructions and additional functional analysis of genes based on custom databases in a reproducible manner. Briefly, adapter trimming is followed by an iterative assembly using MEGAHIT v1.2.997. Concurrently, MetaBAT2 v2.12.198 and MaxBin2 v2.2.799 are used for binning in addition to an in-house method, binny100, for reconstructing metagenome-assembled genomes (MAGs). Binning was completed by selecting a non-redundant set of MAGs using DASTool101 based on a score threshold of 0.7. The quality of the MAGs was assessed using CheckM v1.1.3102, while taxonomy was assigned using the GTDB-toolkit v1.4.1103.
For the downstream analyses including identification of viruses, VIBRANT v1.2.1104 was used on the metagenomic assemblies. The output from this was used to identify the viral taxa using vConTACT2 v0.9.22105. Independently, the viral contigs were also validated using CheckV v0.7.0106. To estimate the overall abundances of eukaryotes along with prokaryotes including archaea, we used EUKulele v1.0.5107 with both the MMETSP and the PhyloDB databases, run separately, to confirm the detected eukaryotic profiles. To understand the overall metabolic and functional potential of the metagenome and reconstructed MAGs we used MANTIS108. Additionally, we used METABOLIC v4.0109, metabolisHMM v2.21110, and Lithogenie from MagicLamp v1.0 (https://github.com/Arkadiy-Garber/MagicLamp) to identify metabolic and biogeochemical pathways relevant for determining nutritional phenotypes of all MAGs along with the ‘anvi-estimate-metabolism’ function from anvi’o111. This information was manually validated based on the different tools to identify which MAGs encode for the respective pathways. Subsequently, to determine the growth rates of prokaryotes, we used codon usage statistics for detecting optimization of genes that are highly expressed, as an indicator of maximal growth rates with gRodon v1.0112. All the parameters, databases, and relevant code for the analyses described above are openly available at https://git-r3lab.uni.lu/susheel.busi/nomis_pipeline and included in the Code availability section.
Eukaryote assembly and binning
To obtain eukaryotic MAGs, an alternate, custom pipeline (https://github.com/Mass23/NOMIS_ENSEMBLE/tree/coassembly) was established for coassembling the twenty-one epilithic biofilm sequence data with subsequent binning. Individual samples were first preprocessed similar to the workflow used in IMP, i.e., using FastP v0.20.0113. Subsequently, the reads were deduplicated to avoid overlap and enhance computation efficiency using clumpify.sh from the BBmap suite v38.79114. Thereafter, any reads mapping to bacteria or viruses were removed by filtering the reads against a Kraken2 v2.0.9beta115 maxikraken database available at https://lomanlab.github.io/mockcommunity/mc_databases.html. Only reads that were unknown or mapping to eukaryotes were retained and concatenated. This was followed by another round of deduplication using clumpify.sh. The concatenated reads were assembled using MEGAHIT v1.2.7 with the following options: -kmin-1pass -m 0.9 -k-list 27,37,47,57,67,77,87 -min-contig-len 1000. Following assembly, EukRep v0.6.7116 was used for retrieving eukaryotic contigs with a minimum length of 2000 bp and the ‘-m strict’ flag. These contigs were used for binning into MAGs as described herein.
Eukaryotic MAGs were binned using CONCOCT v1.1.0117. To do this, coverages were estimated for the contigs by mapping the reads of all samples against the contigs using the coverm v0.6.1 (https://github.com/wwood/CoverM) to generate bam files. These files were then used to generate a table with coverage depth information per sample. The protein coding genes of the MAGs was predicted with MetaEuk v4.a0f584d118 with their in-house database made with MERC, MMETSP and Uniclust50 (http://wwwuser.gwdg.de/~compbiol/metaeuk/). The annotation was then subsequently done with eggNOG-mapper v2.1.0119. The completeness and contamination of the MAGs were assessed with Busco v5.0.0120 and the eukaryotic lineage (255 genes). We determined their taxonomy by comparing the results of the EUKulele v1.0.3107 and EukCC v0.3121 along with homology comparisons with publicly available genomes not included in the previous tools by protein BLAST v2.10.0122.
Co-occurrence interaction networks
Co-occurrence networks between the pro- and eukaryotic MAGs were constructed using an average of the distance matrices created from SparCC123, Spearman’s correlation and SpiecEasi124, where the networks were constructed using the ‘Meinshausen and Bühlmann (mb)’ method. Nodes with fewer than two degrees were discarded to identify cliques with three or more interactions, while negative edges were removed to visualize only mutualistic relationships. The matrix was visualised using the igraph125 R package. The largest component from the overall co-occurrence network was determined using the components module of the igraph package. Null model hypothesis was tested by assessing the distribution of the node degree and the respective probabilities of the occurrence network against those simulating the Erdos-Renyi, Barabasi-Albert, Stochastic-block null models126. The igraph package was also used for plotting the networks.
Phylogenomics and pangenomes
For the pangenome analyses, we collected all the bins taxonomically identified as Polaromonas spp. and used the pangenome workflow described by Meren et al. (http://merenlab.org/2016/11/08/pangenomics-v2) using anvi’o111, along with NCBI127 refseq genomes for comparison and an outgroup from the closely related Rhodoferax genus. The choice of Polaromonas spp. was based on its high abundance and prevalence within the epilithic biofilms. The accession IDs from the reference genomes obtained from NCBI are provided in the supplementary material. The pangenome was run using the -min-bit 0.5, -mcl-inflation 10 and -min-occurence 2 parameters, excluding the partial gene calls. A phylogenomic tree was built using MUSCLE v3.8.1551128 and FastTree2 v2.1.10129 on all single-copy gene clusters in the pangenome that were present in at least 30 genomes and had a functional homogeneity index below 0.9, and geometric homogeneity index above 0.9. The phylogenomic tree was used to order the genomes, the frequency of gene clusters (GC) to order the GC dendrogram. A phylogenomic bacterial tree of life containing the 47 high-quality MAGs along with 264 NCBI bacterial genomes was built based on a set of 74 single-copy genes using the GToTree v1.5.51130 pipeline with the -D parameter, allowing to retrieve taxonomic information for the NCBI accessions. Briefly, HMMER3 v3.3.2131 was used to retrieve the single-copy genes after gene-calling with Prodigal v2.6.3132 and aligned using TrimAl v1.4.rev15133. The entire workflow is based on GNU Parallel v20210222134.
Data analyses and figures
Figures for the study including visualizations derived from the taxonomic and functional components, were created using version 3.6 of the R statistical software package135. The maps indicating the collection sites were generated using the ggmap136 package in R. KEGGDecoder137 was used to assess enriched KEGG orthology (KO) IDs in comparison to 105 publicly available metagenome sampled in various ecosystems at a global scale (Supplementary Data 3 and 6), which were processed using the IMP workflow. DESeq2138 with FDR-adjustments for multiple testing were used to assess KOs significantly enriched in the GFS metagenomes compared to this comparison dataset. The volcano plot highlighting the significant KOs was generated using the EnhancedVolcano139 R package. Figures from metabarcoding data were also generated in Rv3.6 using the ggplot2140 package and were further annotated graphically using Inkscape141 while the network plots were generated using the igraph v1.2.2 package.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Supplementary information
Acknowledgements
This research was funded by The NOMIS Foundation to TJB. SBB was supported by the Synergia grant (CRSII5_180241: Swiss National Science Foundation) to TJB. LdN and PW are supported by the Luxembourg National Research Fund (FNR; PRIDE17/11823097). RM and DD are supported by King Abdullah University of Science and Technology through baseline research funds to DD. We are thankful for the assistance of Audrey Frachet Bour, Lea Grandmougin, Janine Habier, Laura Lebrun (LCSB) and Emmy Marie Oppliger (EPFL) for laboratory support. We are grateful to Alex Washburne for his feedback on the draft, and we also acknowledge the valuable input from Rashi Halder at the LCSB Sequencing Platform with respect to library preparation. We are equally grateful for the valuable insights into metagenomic processing from Patrick May, Anna Heintz-Buschart, and Cedric Christian Laczny, and especially Valentina Galata with the python scripts and Snakemake workflows. The computational analyses presented in this paper were carried out using the HPC facilities at the University of Luxembourg (https://hpc.uni.lu)142.
Source data
Author contributions
SBB, MB, SF, HP, PW, and TJB conceived of the project. MiST, MT, VDS, MaSc, and HP conducted the fieldwork. PP, SF and SBB extracted DNA, while SBB and PP prepared the metagenomic and metabarcoding libraries, and RM and DD performed the sequencing. SBB conceptualized the data analyses, while SBB, MB, SF, GM and LE performed the analyses. LdN contributed to the python scripts and Snakemake workflows for the analyses. SBB, MB, TJK, PW and TJB wrote the manuscript with significant input and editing from all coauthors.
Peer review
Peer review information
Nature Communications thanks the other anonymous reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Data availability
Raw sequencing data samples and the MAGs are available at NCBI’s sequence read archive under BioProject accession PRJNA733707. The Biosample accession IDs and the metadata associated with each sample are listed under Supplementary Data 6. A snippet of the results and source data generated and used in this study have been deposited in Zenodo at 10.5281/zenodo.5545722. Data used to generate the figures are also provided as a ‘Source Data’ file. Source data are provided with this paper.
Code availability
The detailed code used for the downstream functional and growth analyses is available at https://git-r3lab.uni.lu/susheel.busi/nomis_pipeline and 10.5281/zenodo.6372573. The custom pipeline for eukaryote analyses can be found here: https://github.com/Mass23/NOMIS_ENSEMBLE/tree/coassembly. Subsequent binning and manual refinement of eukaryotic MAGs was done as described here: https://git-r3lab.uni.lu/susheel.busi/nomis_pipeline/-/blob/master/workflow/notes/MiscEUKMAGs.md.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Susheel Bhanu Busi, Massimo Bourquin, Stilianos Fodelianakis.
Contributor Information
Paul Wilmes, Email: paul.wilmes@uni.lu.
Tom J. Battin, Email: tom.battin@epfl.ch
Supplementary information
The online version contains supplementary material available at 10.1038/s41467-022-29914-0.
References
- 1.Uehlinger, U., Robinson, C. T., Hieber, M. & Zah, R. The physico-chemical habitat template for periphyton in alpine glacial streams under a changing climate. in Global Change and River Ecosystems—Implications for Structure, Function and EcosystemServices (eds. Stevenson, R. J. & Sabater, S.) 107–121 (Springer Netherlands, 2010).
- 2.Battin TJ, Wille A, Psenner R, Richter A. Large-scale environmental controls on microbial biofilms in high-alpine streams. Biogeosciences. 2004;1:159–171. doi: 10.5194/bg-1-159-2004. [DOI] [Google Scholar]
- 3.Kuhn M. The nutrient cycle through snow and ice, a review. Aquat. Sci. 2001;63:150–167. doi: 10.1007/PL00001348. [DOI] [Google Scholar]
- 4.Milner AM, et al. Glacier shrinkage driving global changes in downstream systems. Proc. Natl Acad. Sci. U. S. A. 2017;114:9770–9778. doi: 10.1073/pnas.1619807114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Tockner K, Malard F, Uehlinger U, Ward JV. Nutrients and organic matter in a glacial river-floodplain system (Val Roseg, Switzerland) Limnol. Oceanogr. 2002;47:266–277. doi: 10.4319/lo.2002.47.1.0266. [DOI] [Google Scholar]
- 6.Boix Canadell M, et al. Regimes of primary production and their drivers in Alpine streams. Freshw. Biol. 2021;66:1449–1463. doi: 10.1111/fwb.13730. [DOI] [Google Scholar]
- 7.Bernhardt ES, et al. Control points in ecosystems: moving beyond the hot spot hot moment concept. Ecosystems. 2017;20:665–682. doi: 10.1007/s10021-016-0103-y. [DOI] [Google Scholar]
- 8.Huss M, Hock R. Global-scale hydrological response to future glacier mass loss. Nat. Clim. Chang. 2018;8:135–140. doi: 10.1038/s41558-017-0049-x. [DOI] [Google Scholar]
- 9.Battin TJ, Besemer K, Bengtsson MM, Romani AM, Packmann AI. The ecology and biogeochemistry of stream biofilms. Nat. Rev. Microbiol. 2016;14:251–263. doi: 10.1038/nrmicro.2016.15. [DOI] [PubMed] [Google Scholar]
- 10.Roncoroni M, Brandani J, Battin TI, Lane SN. Ecosystem engineers: biofilms and the ontogeny of glacier floodplain ecosystems. WIREs Water. 2019;6:e1390. doi: 10.1002/wat2.1390. [DOI] [Google Scholar]
- 11.Hoyle JT, Kilroy C, Hicks DM, Brown L. The influence of sediment mobility and channel geomorphology on periphyton abundance. Freshw. Biol. 2017;62:258–273. doi: 10.1111/fwb.12865. [DOI] [Google Scholar]
- 12.Canadell, M. B. et al. Regimes of primary production and their drivers in Alpine streams. Freshwater Biol.10.1111/fwb.13730 (2021).
- 13.Fell SC, Carrivick JL, Brown LE. The multitrophic effects of climate change and glacier retreat in mountain rivers. Bioscience. 2017;67:897–911. doi: 10.1093/biosci/bix107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Cole JJ. Interactions between bacteria and algae in aquatic ecosystems. Annu. Rev. Ecol. Syst. 1982;13:291–314. doi: 10.1146/annurev.es.13.110182.001451. [DOI] [Google Scholar]
- 15.Seymour, J. R., Amin, S. A., Raina, J.-B. & Stocker, R. Zooming in on the phycosphere: the ecological interface for phytoplankton–bacteria relationships. Nat. Microbiol.2, 17065 (2017). [DOI] [PubMed]
- 16.Amin SA, et al. Interaction and signalling between a cosmopolitan phytoplankton and associated bacteria. Nature. 2015;522:98–101. doi: 10.1038/nature14488. [DOI] [PubMed] [Google Scholar]
- 17.Christie-Oleza JA, Sousoni D, Lloyd M, Armengaud J, Scanlan DJ. Nutrient recycling facilitates long-term stability of marine microbial phototroph–heterotroph interactions. Nat. Microbiol. 2017;2:1–10. doi: 10.1038/nmicrobiol.2017.100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Haack TK, McFeters GA. Nutritional relationships among microorganisms in an epilithic biofilm community. Microb. Ecol. 1982;8:115–126. doi: 10.1007/BF02010445. [DOI] [PubMed] [Google Scholar]
- 19.Kaplan LA, Bott TL. Diel fluctuations in bacterial activity on streambed substrata during vernal algal blooms: effects of temperature, water chemistry, and habitat. Limnol. Oceanogr. 1989;34:718–733. doi: 10.4319/lo.1989.34.4.0718. [DOI] [Google Scholar]
- 20.Vincent WF, Downes MT, Castenholz RW, Howard-Williams C. Community structure and pigment organisation of cyanobacteria-dominated microbial mats in Antarctica. Eur. J. Phycol. 1993;28:213–221. doi: 10.1080/09670269300650321. [DOI] [Google Scholar]
- 21.Tolotti M, et al. Alpine headwaters emerging from glaciers and rock glaciers host different bacterial communities: Ecological implications for the future. Sci. Total Environ. 2020;717:137101. doi: 10.1016/j.scitotenv.2020.137101. [DOI] [PubMed] [Google Scholar]
- 22.Besemer K, Singer G, Hödl I, Battin TJ. Bacterial community composition of stream biofilms in spatially variable-flow environments. Appl. Environ. Microbiol. 2009;75:7189–7195. doi: 10.1128/AEM.01284-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Risse‐Buhl U, et al. Near streambed flow shapes microbial guilds within and across trophic levels in fluvial biofilms. Limnol. Oceanogr. 2020;65:2261–2277. doi: 10.1002/lno.11451. [DOI] [Google Scholar]
- 24.Palmer MA, Swan CM, Nelson K, Silver P, Alvestad R. Streambed landscapes: evidence that stream invertebrates respond to the type and spatial arrangement of patches. Landsc. Ecol. 2000;15:563–576. doi: 10.1023/A:1008194130695. [DOI] [Google Scholar]
- 25.Battin TJ, et al. Microbial landscapes: new paths to biofilm research. Nat. Rev. Microbiol. 2007;5:76–81. doi: 10.1038/nrmicro1556. [DOI] [PubMed] [Google Scholar]
- 26.Dzubakova, K. et al. Environmental heterogeneity promotes spatial resilience of phototrophic biofilms in streambeds. Biol. Lett. 14, 20180432 (2018). [DOI] [PMC free article] [PubMed]
- 27.Sloan WT, et al. Quantifying the roles of immigration and chance in shaping prokaryote community structure. Environ. Microbiol. 2006;8:732–740. doi: 10.1111/j.1462-2920.2005.00956.x. [DOI] [PubMed] [Google Scholar]
- 28.Hug LA, et al. A new view of the tree of life. Nat. Microbiol. 2016;1:16048. doi: 10.1038/nmicrobiol.2016.48. [DOI] [PubMed] [Google Scholar]
- 29.Chaudhari, N. M., Overholt, W. A. & Figueroa-Gonzalez, P. A. The economical lifestyle of CPR bacteria in groundwater allows little preference for environmental drivers. bioRxiv. 16, 1–8 (2021). [DOI] [PMC free article] [PubMed]
- 30.Vigneron A, et al. Ultra‐small and abundant: candidate phyla radiation bacteria are potential catalysts of carbon transformation in a thermokarst lake ecosystem. Limnol. Oceanogr. Lett. 2020;5:212–220. doi: 10.1002/lol2.10132. [DOI] [Google Scholar]
- 31.Liu Y, et al. Expanded diversity of Asgard archaea and their relationships with eukaryotes. Nature. 2021;593:553–557. doi: 10.1038/s41586-021-03494-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Cai M, et al. Ecological features and global distribution of Asgard archaea. Sci. Total Environ. 2021;758:143581. doi: 10.1016/j.scitotenv.2020.143581. [DOI] [PubMed] [Google Scholar]
- 33.Niedrist GH, Füreder L. When the going gets tough, the tough get going: The enigma of survival strategies in harsh glacial stream environments. Freshw. Biol. 2018;63:1260–1272. doi: 10.1111/fwb.13131. [DOI] [Google Scholar]
- 34.Payne AT, et al. Widespread cryptic viral infections in lotic biofilms. Biofilms. 2020;2:100016. doi: 10.1016/j.bioflm.2019.100016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Anesio, A. M., Mindl, B., Laybourn-Parry, J., Hodson, A. J. & Sattler, B. Viral dynamics in cryoconite holes on a high Arctic glacier (Svalbard). J. Geophys. Res. 112, (2007).
- 36.Bellas CM, Schroeder DC, Edwards A, Barker G, Anesio AM. Flexible genes establish widespread bacteriophage pan-genomes in cryoconite hole ecosystems. Nat. Commun. 2020;11:4403. doi: 10.1038/s41467-020-18236-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Liu Q, et al. Light stimulates anoxic and oligotrophic growth of glacial Flavobacterium strains that produce zeaxanthin. ISME J. 2021;15:1844–1857. doi: 10.1038/s41396-020-00891-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Sánchez Barranco V, et al. Trophic position, elemental ratios and nitrogen transfer in a planktonic host-parasite-consumer food chain including a fungal parasite. Oecologia. 2020;194:541–554. doi: 10.1007/s00442-020-04721-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Klawonn, I. et al. Characterizing the ‘fungal shunt’: Parasitic fungi on diatoms affect carbon flow and bacterial communities in aquatic microbial food webs. Proc. Natl. Acad. Sci. U. S. A. 118, (2021). [DOI] [PMC free article] [PubMed]
- 40.Chróst, R. J. Microbial Enzymes in Aquatic Environments. (Springer-Verlag, 1991).
- 41.Sinsabaugh RL, Hill BH, Follstad Shah JJ. Ecoenzymatic stoichiometry of microbial organic nutrient acquisition in soil and sediment. Nature. 2009;462:795–798. doi: 10.1038/nature08632. [DOI] [PubMed] [Google Scholar]
- 42.Stoecker, D. K. & Lavrentyev, P. J. Mixotrophic plankton in the polar seas: a pan-arctic review. Front. Mar. Sci. 5, 292 (2018).
- 43.Waibel A, Peter H, Sommaruga R. Importance of mixotrophic flagellates during the ice-free season in lakes located along an elevational gradient. Aquat. Sci. 2019;81:45. doi: 10.1007/s00027-019-0643-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Avcı B, Krüger K, Fuchs BM, Teeling H, Amann RI. Polysaccharide niche partitioning of distinct Polaribacter clades during North Sea spring algal blooms. ISME J. 2020;14:1369–1383. doi: 10.1038/s41396-020-0601-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Sichert A, et al. Verrucomicrobia use hundreds of enzymes to digest the algal polysaccharide fucoidan. Nat. Microbiol. 2020;5:1026–1039. doi: 10.1038/s41564-020-0720-2. [DOI] [PubMed] [Google Scholar]
- 46.Zhou J, Lyu Y, Richlen M, Anderson DM, Cai Z. Quorum sensing is a language of chemical signals and plays an ecological role in algal–bacterial interactions. CRC Crit. Rev. Plant Sci. 2016;35:81–105. doi: 10.1080/07352689.2016.1172461. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Croft MT, Lawrence AD, Raux-Deery E, Warren MJ, Smith AG. Algae acquire vitamin B12 through a symbiotic relationship with bacteria. Nature. 2005;438:90–93. doi: 10.1038/nature04056. [DOI] [PubMed] [Google Scholar]
- 48.Grossman A. Nutrient Acquisition: The Generation of Bioactive Vitamin B12 by Microalgae. Curr. Biol.: CB vol. 2016;26:R319–R321. doi: 10.1016/j.cub.2016.02.047. [DOI] [PubMed] [Google Scholar]
- 49.Segev, E. et al. Dynamic metabolic exchange governs a marine algal-bacterial interaction. Elife5, e17473 (2016). [DOI] [PMC free article] [PubMed]
- 50.Hood E, Battin TJ, Fellman J, O’Neel S, Spencer RGM. Storage and release of organic carbon from glaciers and ice sheets. Nat. Geosci. 2015;8:91–96. doi: 10.1038/ngeo2331. [DOI] [Google Scholar]
- 51.Fellman JB, et al. Evidence for the assimilation of ancient glacier organic carbon in a proglacial stream food web. Limnol. Oceanogr. 2015;60:1118–1128. doi: 10.1002/lno.10088. [DOI] [Google Scholar]
- 52.Singer GA, et al. Biogeochemically diverse organic matter in Alpine glaciers and its downstream fate. Nat. Geosci. 2012;5:710–714. doi: 10.1038/ngeo1581. [DOI] [Google Scholar]
- 53.Boix Canadell M, Escoffier N, Ulseth AJ, Lane SN, Battin TJ. Alpine glacier shrinkage drives shift in dissolved organic carbon export from quasi‐chemostasis to transport limitation. Geophys. Res. Lett. 2019;46:8872–8881. doi: 10.1029/2019GL083424. [DOI] [Google Scholar]
- 54.Anesio AM, Lutz S, Chrismas NAM, Benning LG. The microbiome of glaciers and ice sheets. NPJ Biofilms Microbiomes. 2017;3:10. doi: 10.1038/s41522-017-0019-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Tranter, M., Mills, R. & Raiswell, R. Chemical weathering reactions in Alpine glacial meltwaters. in International symposium on water-rock interaction. 687–690 (1989).
- 56.Tranter M, Brown G, Raiswell R, Sharp M, Gurnell A. A conceptual model of solute acquisition by Alpine glacial meltwaters. J. Glaciol. 1993;39:573–581. doi: 10.1017/S0022143000016464. [DOI] [Google Scholar]
- 57.St Pierre KA, et al. Proglacial freshwaters are significant and previously unrecognized sinks of atmospheric CO2. Proc. Natl Acad. Sci. U. S. A. 2019;116:17690–17695. doi: 10.1073/pnas.1904241116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Dunham, E. C., Dore, J. E., Skidmore, M. L., Roden, E. E. & Boyd, E. S. Lithogenic hydrogen supports microbial primary production in subglacial and proglacial environments. Proc. Natl. Acad. Sci. U. S. A. 118, (2021). [DOI] [PMC free article] [PubMed]
- 59.Hernández, M. et al. Reconstructing genomes of carbon monoxide oxidisers in volcanic deposits including members of the Class Ktedonobacteria. Microorganisms8, 1880 (2020). [DOI] [PMC free article] [PubMed]
- 60.Quick AM, et al. Nitrous oxide from streams and rivers: a review of primary biogeochemical pathways and environmental variables. Earth-Sci. Rev. 2019;191:224–262. doi: 10.1016/j.earscirev.2019.02.021. [DOI] [Google Scholar]
- 61.Kuypers MMM, Marchant HK, Kartal B. The microbial nitrogen-cycling network. Nat. Rev. Microbiol. 2018;16:263–276. doi: 10.1038/nrmicro.2018.9. [DOI] [PubMed] [Google Scholar]
- 62.Ren Z, Gao H, Elser JJ, Zhao Q. Microbial functional genes elucidate environmental drivers of biofilm metabolism in glacier-fed streams. Sci. Rep. 2017;7:12668. doi: 10.1038/s41598-017-13086-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Gooseff MN, McKnight DM, Runkel RL, Duff JH. Denitrification and hydrologic transient storage in a glacial meltwater stream, McMurdo Dry Valleys, Antarctica. Limnol. Oceanogr. 2004;49:1884–1895. doi: 10.4319/lo.2004.49.5.1884. [DOI] [Google Scholar]
- 64.Varin T, Lovejoy C, Jungblut AD, Vincent WF, Corbeil J. Metagenomic profiling of Arctic microbial mat communities as nutrient scavenging and recycling systems. Limnol. Oceanogr. 2010;55:1901–1911. doi: 10.4319/lo.2010.55.5.1901. [DOI] [Google Scholar]
- 65.Kohler TJ, et al. Patterns and drivers of extracellular enzyme activity in New Zealand glacier-fed streams. Front. Microbiol. 2020;11:591465. doi: 10.3389/fmicb.2020.591465. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Alves RJE, et al. Ammonia oxidation by the Arctic terrestrial thaumarchaeote candidatus nitrosocosmicus arcticus is stimulated by increasing temperatures. Front. Microbiol. 2019;10:1571. doi: 10.3389/fmicb.2019.01571. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Könneke M, et al. Ammonia-oxidizing archaea use the most energy-efficient aerobic pathway for CO2 fixation. Proc. Natl Acad. Sci. U. S. A. 2014;111:8239–8244. doi: 10.1073/pnas.1402028111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Battin TJ, Kaplan LA, Denis Newbold J, Hansen CME. Contributions of microbial biofilms to ecosystem processes in stream mesocosms. Nature. 2003;426:439–442. doi: 10.1038/nature02152. [DOI] [PubMed] [Google Scholar]
- 69.Cockell CS, et al. Influence of ice and snow covers on the UV exposure of terrestrial microbial communities: dosimetric studies. J. Photochem. Photobiol. B. 2002;68:23–32. doi: 10.1016/S1011-1344(02)00327-5. [DOI] [PubMed] [Google Scholar]
- 70.Sommaruga R. The role of solar UV radiation in the ecology of alpine lakes. J. Photochem. Photobiol. B. 2001;62:35–42. doi: 10.1016/S1011-1344(01)00154-3. [DOI] [PubMed] [Google Scholar]
- 71.Margesin R, Collins T. Microbial ecology of the cryosphere (glacial and permafrost habitats): current knowledge. Appl. Microbiol. Biotechnol. 2019;103:2537–2549. doi: 10.1007/s00253-019-09631-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.De Maayer P, Anderson D, Cary C, Cowan DA. Some like it cold: understanding the survival strategies of psychrophiles. EMBO Rep. 2014;15:508–517. doi: 10.1002/embr.201338170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Tribelli, P. M. & López, N. I. Reporting key features in cold-adapted bacteria. Life8, 8 (2018). [DOI] [PMC free article] [PubMed]
- 74.Varin T, Lovejoy C, Jungblut AD, Vincent WF, Corbeil J. Metagenomic analysis of stress genes in microbial mat communities from Antarctica and the High Arctic. Appl. Environ. Microbiol. 2012;78:549–559. doi: 10.1128/AEM.06354-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Alonso-Sáez L, et al. Winter bloom of a rare betaproteobacterium in the Arctic Ocean. Front. Microbiol. 2014;5:425. doi: 10.3389/fmicb.2014.00425. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Hornung C, et al. The Janthinobacterium sp. HH01 genome encodes a homologue of the V. cholerae CqsA and L. pneumophila LqsA autoinducer synthases. PLoS One. 2013;8:e55045. doi: 10.1371/journal.pone.0055045. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Maillot NJ, Honoré FA, Byrne D, Méjean V, Genest O. Cold adaptation in the environmental bacterium Shewanella oneidensis is controlled by a J-domain co-chaperone protein network. Commun. Biol. 2019;2:323. doi: 10.1038/s42003-019-0567-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Konings WN, Albers S-V, Koning S, Driessen AJM. The cell membrane plays a crucial role in survival of bacteria and archaea in extreme environments. Antonie Van. Leeuwenhoek. 2002;81:61–72. doi: 10.1023/A:1020573408652. [DOI] [PubMed] [Google Scholar]
- 79.Methé BA, et al. The psychrophilic lifestyle as revealed by the genome sequence of Colwellia psychrerythraea 34H through genomic and proteomic analyses. Proc. Natl Acad. Sci. U. S. A. 2005;102:10913–10918. doi: 10.1073/pnas.0504766102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Ayala-del-Río HL, et al. The genome sequence of Psychrobacter arcticus 273-4, a psychroactive Siberian permafrost bacterium, reveals mechanisms for adaptation to low-temperature growth. Appl. Environ. Microbiol. 2010;76:2304–2312. doi: 10.1128/AEM.02101-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Mykytczuk NCS, et al. Bacterial growth at −15 °C; molecular insights from the permafrost bacterium Planococcus halocryophilus Or1. ISME J. 2013;7:1211–1226. doi: 10.1038/ismej.2013.8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Ting L, et al. Cold adaptation in the marine bacterium, Sphingopyxis alaskensis, assessed using quantitative proteomics. Environ. Microbiol. 2010;12:2658–2676. doi: 10.1111/j.1462-2920.2010.02235.x. [DOI] [PubMed] [Google Scholar]
- 83.Tribelli PM, et al. Novel essential role of ethanol oxidation genes at low temperature revealed by transcriptome analysis in the Antarctic bacterium Pseudomonas extremaustralis. PLoS One. 2015;10:e0145353. doi: 10.1371/journal.pone.0145353. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Blagojevic DP, Grubor-Lajsic GN, Spasic MB. Cold defence responses: the role of oxidative stress. Front. Biosci. 2011;3:416–427. doi: 10.2741/s161. [DOI] [PubMed] [Google Scholar]
- 85.Busi SB, et al. Optimised biomolecular extraction for metagenomic analysis of microbial biofilms from high-mountain streams. PeerJ. 2020;8:e9973. doi: 10.7717/peerj.9973. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Fodelianakis, S. et al. Microdiversity characterizes prevalent phylogenetic clades in the glacier-fed stream microbiome. ISME J. 10.1038/s41396-021-01106-6 (2021). [DOI] [PMC free article] [PubMed]
- 87.Stoeck T, et al. Multiple marker parallel tag environmental DNA sequencing reveals a highly complex eukaryotic community in marine anoxic water. Mol. Ecol. 2010;19(Suppl 1):21–31. doi: 10.1111/j.1365-294X.2009.04480.x. [DOI] [PubMed] [Google Scholar]
- 88.Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Bolyen E, et al. Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2. Nat. Biotechnol. 2019;37:852–857. doi: 10.1038/s41587-019-0209-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Quast C, et al. The SILVA ribosomal RNA gene database project: improved data processing and web-based tools. Nucleic Acids Res. 2013;41:D590–D596. doi: 10.1093/nar/gks1219. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Dixon P. VEGAN, a package of R functions for community ecology. J. Vegetation Sci. 2003;14:927–930. doi: 10.1111/j.1654-1103.2003.tb02228.x. [DOI] [Google Scholar]
- 92.Foster ZSL, Sharpton TJ, Grünwald NJ. Metacoder: an R package for visualization and manipulation of community taxonomic diversity data. PLoS Comput. Biol. 2017;13:e1005404. doi: 10.1371/journal.pcbi.1005404. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Burns AR, et al. Contribution of neutral processes to the assembly of gut microbial communities in the zebrafish over host development. ISME J. 2016;10:655–664. doi: 10.1038/ismej.2015.142. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.Gautreau, I. E7805 NEBNext® UltraTM II FS DNA Library Prep Kit for Illumina® Protocol for use with Inputs ≤ 100 ng. https://www.protocols.io/view/e7805-nebnext-ultra-ii-fs-dna-library-prep-kit-for-k8tczwn (2020).
- 95.Narayanasamy S, et al. IMP: a pipeline for reproducible reference-independent integrated metagenomic and metatranscriptomic analyses. Genome Biol. 2016;17:260. doi: 10.1186/s13059-016-1116-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 96.Heintz-Buschart A, et al. Integrated multi-omics of the human gut microbiome in a case study of familial type 1 diabetes. Nat. Microbiol. 2016;2:16180. doi: 10.1038/nmicrobiol.2016.180. [DOI] [PubMed] [Google Scholar]
- 97.Li D, Liu C-M, Luo R, Sadakane K, Lam T-W. MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph. Bioinformatics. 2015;31:1674–1676. doi: 10.1093/bioinformatics/btv033. [DOI] [PubMed] [Google Scholar]
- 98.Kang DD, et al. MetaBAT 2: an adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies. PeerJ. 2019;7:e7359. doi: 10.7717/peerj.7359. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99.Wu Y-W, Simmons BA, Singer SW. MaxBin 2.0: an automated binning algorithm to recover genomes from multiple metagenomic datasets. Bioinformatics. 2016;32:605–607. doi: 10.1093/bioinformatics/btv638. [DOI] [PubMed] [Google Scholar]
- 100.Hickl, O., Queirós, P., Wilmes, P., May, P. & Heintz-Buschart, A. binny: an automated binning algorithm to recover high-quality genomes from complex metagenomic datasets. bioRxiv 2021.12.22.473795 10.1101/2021.12.22.473795. (2021). [DOI] [PMC free article] [PubMed]
- 101.Sieber CMK, et al. Recovery of genomes from metagenomes via a dereplication, aggregation and scoring strategy. Nat. Microbiol. 2018;3:836–843. doi: 10.1038/s41564-018-0171-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 102.Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 2015;25:1043–1055. doi: 10.1101/gr.186072.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 103.Chaumeil, P.-A., Mussig, A. J., Hugenholtz, P. & Parks, D. H. GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database. Bioinformatics.10.1093/bioinformatics/btz848 (2019). [DOI] [PMC free article] [PubMed]
- 104.Kieft K, Zhou Z, Anantharaman K. VIBRANT: automated recovery, annotation and curation of microbial viruses, and evaluation of viral community function from genomic sequences. Microbiome. 2020;8:90. doi: 10.1186/s40168-020-00867-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 105.Zablocki, O., Jang, H. B., Bolduc, B. & Sullivan, M. B. vConTACT 2: A tool to automate genome-based prokaryotic viral taxonomy. in Plant and Animal Genome XXVII Conference (January 12-16, 2019) (PAG, 2019).
- 106.Nayfach S, et al. CheckV assesses the quality and completeness of metagenome-assembled viral genomes. Nat. Biotechnol. 2021;39:578–585. doi: 10.1038/s41587-020-00774-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 107.Krinos, A. I., Hu, S. K., Cohen, N. R. & Alexander, H. EUKulele: Taxonomic annotation of the unsung eukaryotic microbes. arXiv [q-bio.PE] (2020).
- 108.Queirós, P., Delogu, F., Hickl, O., May, P. & Wilmes, P. Mantis: flexible and consensus-driven genome annotation. bioRxiv (2020). [DOI] [PMC free article] [PubMed]
- 109.Zhou, Z. et al. METABOLIC: High-throughput profiling of microbial genomes for functional traits, biogeochemistry, and community-scale metabolic networks. bioRxiv 761643. 10.1101/761643 (2020). [DOI] [PMC free article] [PubMed]
- 110.McDaniel, E. A., Anantharaman, K. & McMahon, K. D. metabolisHMM: Phylogenomic analysis for exploration of microbial phylogenies and metabolic pathways. bioRxiv 2019.12.20.884627. 10.1101/2019.12.20.884627 (2019).
- 111.Eren AM, et al. Anvi’o: an advanced analysis and visualization platform for ‘omics data. PeerJ. 2015;3:e1319. doi: 10.7717/peerj.1319. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 112.Weissman, J. L., Hou, S. & Fuhrman, J. A. Estimating maximal microbial growth rates from cultures, metagenomes, and single cells via codon usage patterns. Proc. Natl. Acad. Sci. U. S. A. 118, (2021). [DOI] [PMC free article] [PubMed]
- 113.Chen S, Zhou Y, Chen Y, Gu J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics. 2018;34:i884–i890. doi: 10.1093/bioinformatics/bty560. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 114.Bushnell, B. BBMap: A fast, accurate, splice-aware aligner. https://www.osti.gov/biblio/1241166 (2014).
- 115.Wood DE, Lu J, Langmead B. Improved metagenomic analysis with Kraken 2. Genome Biol. 2019;20:257. doi: 10.1186/s13059-019-1891-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 116.West PT, Probst AJ, Grigoriev IV, Thomas BC, Banfield JF. Genome-reconstruction for eukaryotes from complex natural microbial communities. Genome Res. 2018;28:569–580. doi: 10.1101/gr.228429.117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 117.Alneberg, J. et al. CONCOCT: Clustering cONtigs on COverage and ComposiTion. arXiv [q-bio.GN] (2013).
- 118.Levy Karin E, Mirdita M, Söding J. MetaEuk-sensitive, high-throughput gene discovery, and annotation for large-scale eukaryotic metagenomics. Microbiome. 2020;8:48. doi: 10.1186/s40168-020-00808-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 119.Huerta-Cepas J, et al. eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses. Nucleic Acids Res. 2019;47:D309–D314. doi: 10.1093/nar/gky1085. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 120.Seppey M, Manni M, Zdobnov EM. BUSCO: assessing genome assembly and annotation completeness. Methods Mol. Biol. 2019;1962:227–245. doi: 10.1007/978-1-4939-9173-0_14. [DOI] [PubMed] [Google Scholar]
- 121.Saary P, Mitchell AL, Finn RD. Estimating the quality of eukaryotic genomes recovered from metagenomic analysis with EukCC. Genome Biol. 2020;21:244. doi: 10.1186/s13059-020-02155-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 122.Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J. Mol. Biol. 1990;215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
- 123.Inferring Correlation Networks from Genomic Survey Data. https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1002687. [DOI] [PMC free article] [PubMed]
- 124.Kurtz ZD, et al. Sparse and compositionally robust inference of microbial ecological networks. PLoS Comput. Biol. 2015;11:e1004226. doi: 10.1371/journal.pcbi.1004226. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 125.Csardi G, Nepusz T. The igraph software package for complex network research. InterJournal Complex. Syst. 2006;1695:1–9. [Google Scholar]
- 126.Dormann CF, Frund J, Bluthgen N, Gruber B. Indices, graphs and null models: analyzing bipartite ecological networks. Open Ecol. J. 2009;2:7–24. doi: 10.2174/1874213000902010007. [DOI] [Google Scholar]
- 127.Pruitt KD, Tatusova T, Maglott DR. NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res. 2007;35:D61–D65. doi: 10.1093/nar/gkl842. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 128.Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32:1792–1797. doi: 10.1093/nar/gkh340. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 129.Price MN, Dehal PS, Arkin AP. FastTree 2-approximately maximum-likelihood trees for large alignments. PLoS One. 2010;5:e9490. doi: 10.1371/journal.pone.0009490. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 130.Lee MD. GToTree: a user-friendly workflow for phylogenomics. Bioinformatics. 2019;35:4162–4164. doi: 10.1093/bioinformatics/btz188. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 131.Eddy SR. Accelerated profile HMM searches. PLoS Comput. Biol. 2011;7:e1002195. doi: 10.1371/journal.pcbi.1002195. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 132.Hyatt D, et al. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinforma. 2010;11:119. doi: 10.1186/1471-2105-11-119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 133.Capella-Gutiérrez S, Silla-Martínez JM, Gabaldón T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. 2009;25:1972–1973. doi: 10.1093/bioinformatics/btp348. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 134.Tange, O. GNU Parallel 2018. (Lulu.com, 2018).
- 135.Team, R. C. & Others. R: A language and environment for statistical computing. (2013).
- 136.Kahle D, Wickham H. Ggmap: spatial visualization with ggplot2. R. J. 2013;5:144. doi: 10.32614/RJ-2013-014. [DOI] [Google Scholar]
- 137.Graham ED, Heidelberg JF, Tully BJ. Potential for primary productivity in a globally-distributed bacterial phototroph. ISME J. 2018;12:1861–1866. doi: 10.1038/s41396-018-0091-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 138.Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 1–21 (2014). [DOI] [PMC free article] [PubMed]
- 139.kevinblighe/EnhancedVolcano: Publication-ready volcano plots with enhanced colouring and labeling. https://github.com/kevinblighe/EnhancedVolcano.
- 140.Wickham H. ggplot2: ggplot2. Wiley Interdiscip. Rev. Comput. Stat. 2011;3:180–185. doi: 10.1002/wics.147. [DOI] [Google Scholar]
- 141.Bah, T. Inkscape: guide to a vector drawing program. (Prentice Hall Press, 2007).
- 142.Varrette, S., Bouvry, P., Cartiaux, H. & Georgatos, F. Management of an academic HPC cluster: The UL experience. In 2014 International Conference on High Performance Computing Simulation (HPCS) 959–967 (2014).
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Raw sequencing data samples and the MAGs are available at NCBI’s sequence read archive under BioProject accession PRJNA733707. The Biosample accession IDs and the metadata associated with each sample are listed under Supplementary Data 6. A snippet of the results and source data generated and used in this study have been deposited in Zenodo at 10.5281/zenodo.5545722. Data used to generate the figures are also provided as a ‘Source Data’ file. Source data are provided with this paper.
The detailed code used for the downstream functional and growth analyses is available at https://git-r3lab.uni.lu/susheel.busi/nomis_pipeline and 10.5281/zenodo.6372573. The custom pipeline for eukaryote analyses can be found here: https://github.com/Mass23/NOMIS_ENSEMBLE/tree/coassembly. Subsequent binning and manual refinement of eukaryotic MAGs was done as described here: https://git-r3lab.uni.lu/susheel.busi/nomis_pipeline/-/blob/master/workflow/notes/MiscEUKMAGs.md.