Abstract
Although microbes mediate much of the biogeochemical cycling in freshwater, the categories of carbon and nutrients currently used in models of freshwater biogeochemical cycling are too broad to be relevant on a microbial scale. One way to improve these models is to incorporate microbial data. Here, we analyze both genes and genomes from three metagenomic time series and propose specific roles for microbial taxa in freshwater biogeochemical cycles. Our metagenomic time series span multiple years and originate from a eutrophic lake (Lake Mendota) and a humic lake (Trout Bog Lake) with contrasting water chemistry. Our analysis highlights the role of polyamines in the nitrogen cycle, the diversity of diazotrophs between lake types, the balance of assimilatory vs. dissimilatory sulfate reduction in freshwater, the various associations between types of phototrophy and carbon fixation, and the density and diversity of glycoside hydrolases in freshwater microbes. We also investigated aspects of central metabolism such as hydrogen metabolism, oxidative phosphorylation, methylotrophy, and sugar degradation. Finally, by analyzing the dynamics over time in nitrogen fixation genes and Cyanobacteria genomes, we show that the potential for nitrogen fixation is linked to specific populations in Lake Mendota. This work represents an important step towards incorporating microbial data into ecosystem models and provides a better understanding of how microbes may participate in freshwater biogeochemical cycling.
Keywords: Freshwater, Metabolism, Carbon cycling, Nutrient cycling, Microbial communities
Introduction
Lakes receive nutrients from surrounding terrestrial ecosystems (Williamson et al., 2008), placing lakes as “hotspots” for carbon and nutrient cycling in the landscape (Butman et al., 2015). Approximately half of the carbon received by freshwater ecosystems from the terrestrial landscape is emitted as carbon dioxide (0.2 Pg C/year) or buried in sediments (0.8 Pg C/year) (Cole et al., 2007). Similarly, 20% of global denitrification is estimated to occur in freshwater, roughly equivalent to the amount of denitrification taking place in soils (22%) and about a third of the amount occurring in oceans (58%) (Seitzinger et al., 2006).
Most of this freshwater biogeochemical cycling is performed by microbial communities, yet the categories in the models and budgets used to study these cycles are too broad to incorporate microbial data. For example, carbon compounds are often classified as labile and recalcitrant (Guillemette & Del Giorgio, 2011), or autochthonous and allochthonous (Jonsson et al., 2001). While some work has been done on microbial responses to these carbon categories (Eiler et al., 2003; Kritzberg et al., 2004), using such broad categorizations masks the complexity of microbial ecophysiology. Incorporating microbially-mediated transformations of specific compounds in freshwater would significantly improve the accuracy and predictive power of biogeochemical cycling models.
However, linking microbial taxa to specific biogeochemical functions is a challenging task. Previous research has investigated substrate use by freshwater taxa using cultured isolates and microscopy fluorescence in situ hybridization coupled to microautoradiography to detect incorporation of labeled substrates in uncultured lineages (Hahn et al., 2012; Salcher, Posch & Pernthaler, 2013). While these techniques are definitive, they cannot be scaled to investigate many community members simultaneously. Other research has used scalable genomics techniques to link microbial taxa to predicted biogeochemical functions, generating hypotheses that can be tested using more targeted experiments. Sequencing data has previously been employed to great effect to analyze the distribution of functional marker genes in freshwater (Ramachandran & Walsh, 2015; Peura et al., 2015) and to predict metabolic potential in freshwater genomes (Salcher et al., 2015; Eiler et al., 2016; Hamilton et al., 2017; He et al., 2017; Cabello-Yeves et al., 2018).
In this research, we combined insights from both genes and genomes in three freshwater metagenomic time series to link function to taxonomy at the community level. Our metagenomic time series included multiple years of sampling for microbial DNA from two lakes in Wisconsin, USA: Lake Mendota, a large eutrophic lake, and Trout Bog Lake, a small humic lake. Mendota and Trout Bog are ideal sites for comparative time series metagenomics because of their contrasting limnological attributes and their history of extensive environmental sampling by the North Temperate Lakes–Long Term Ecological Research program (NTL–LTER, http://lter.limnology.wisc.edu) (Table 1; Table S1). They have also been the subjects of many prior efforts to document and understand freshwater bacterial community diversity and dynamics (Shade et al., 2007; Linz et al., 2017; Hall et al., 2017). We describe both predicted pathways in metagenome-assembled genomes (MAGs) and the distributions of functional marker genes to provide a comprehensive overview of microbially-mediated biogeochemical cycling in these two contrasting freshwater lakes.
Table 1. Characteristics of Lake Mendota and Trout Bog Lake.
Lake Mendota | Trout Bog Epilimnion | Trout Bog Hypolimnion | |
---|---|---|---|
Location | Madison, WI | Boulder Junction, WI | |
Coordinates | 43.107055, −89.411729 | 46.041172, −89.686297 | |
Depth of lake (m) | 25.3 | 7.9 | |
Surface area of lake (km2) | 39.61 | 0.01 | |
Microbial sampling depth range (m) | 0–12 | 0–2 | 2–7 |
Years sampled | 2008–2012 | 2007–2009 | 2007–2009 |
Oxygenation | Oxic | Oxic | Suboxic/Anoxic |
pH | 8.6 (0.4) | 5.0 (0.2) | 5.3 (0.2) |
Dissolved inorganic carbon (ppm) | 41 (5) | 2.6 (2.2) | 6.9 (3.1) |
Dissolved organic carbon (ppm) | 6.0 (6.2) | 18 (5) | 22 (6) |
Total dissolved nitrogen (ppb) | 923 (487) | 637 (204) | 1,392 (1,031) |
Total nitrogen (ppb) | 1,099 (521) | 831 (316) | 1,684 (1,563) |
Total dissolved phosphorus (ppb) | 44 (51) | 15 (14) | 69 (98) |
Total phosphorus (ppb) | 64 (52) | 32 (14) | 95 (126) |
Sulfate (ppm) | 17 (1) | 1.2 (0.3) | 0.9 (0.7) |
Note:
Water from Mendota and Trout Bog was sampled weekly during the ice-free periods using an integrated water column sampler, and bacteria were collected on a 0.22 micron filter. Metagenomic sequencing was performed on DNA extracted from filters collected in 2008–2012 from Mendota and in 2007–2009 from Trout Bog. The epilimnion (upper thermal layer) was sampled in both lakes, while the hypolimnion (bottom thermal layer) was sampled only in Trout Bog. Chemistry data were collected by NTL–LTER from depth discrete samples taken from zero to four meters for Mendota, zero meters for the Trout Bog epilimnion, and three and seven meters for the Trout Bog hypolimnion. Values reported here are the means of all measurements in the sampling time span for each lake, with standard deviations reported in parentheses.
Throughout this paper, we highlight several functional categories with particularly interesting results. We discuss differences in the identity and diversity of potential nitrogen fixing bacteria in Trout Bog vs. Mendota, as well as the high prevalence of genes related to polyamines, which are proposed to be an important component of the dissolved organic nitrogen pool. We observed that assimilatory sulfate reduction pathways were encoded more frequently than dissimilatory sulfate reduction pathways, in contrast to what is thought to be the case in marine systems. We split the broader category of primary production into different types of phototrophy, including photosynthesis performed by Cyanobacteria, green sulfur bacteria, and aerobic anoxygenic phototrophs, and analyzed their associated carbon fixation pathways (when present). Using annotations of carbohydrate-active enzymes, we compared the potential for complex carbon degradation and describe significant differences in the coding density and diversity of these encoded enzymes between lakes. To compare more basic properties of freshwater microbes, we assessed differences between lakes in central microbial metabolisms such as hydrogen metabolism, oxidative phosphorylation, methylotrophy, and degradation of low molecular weight carbon. Finally, we show how trends over time in the abundances of both nitrogen fixation marker genes and Cyanobacteria MAGs likely encoding nitrogen fixation were highly correlated, demonstrating how genomic data can reveal dynamics in both functions and taxa.
Methods
Sampling
Samples were collected from Lake Mendota and Trout Bog Lake as previously described (Bendall et al., 2016). Briefly, integrated samples of the water column were collected during the ice-free periods of 2007–2009 in Trout Bog and 2008–2012 in Mendota. In Mendota, the top 12 m of the water column were sampled, approximating the epilimnion (upper, oxygenated, and warm thermal layer). The epilimnion and hypolimnion (bottom, anoxic, and cold thermal layer) of Trout Bog were sampled separately at depths determined by measuring temperature and dissolved oxygen concentrations. The sampling depths were most often zero to two meters for the epilimnion and two to seven meters for the hypolimnion. DNA was collected by filtering 150 mL of the integrated water samples through 0.2-μm pore size polyethersulfone Supor filters (Pall Corp., Port Washington, NY, USA). Filters were stored at −80 °C until extraction using the FastDNA Spin Kit (MP Biomedicals, Burlingame, CA, USA) with minor modifications (Shade et al., 2007).
Sequencing
As previously described (Bendall et al., 2016; Roux et al., 2017), metagenomic sequencing was performed by the Department of Energy Joint Genome Institute (DOE JGI) (Walnut Creek, CA, USA). A total of 94 samples collected over 5 years were sequenced for Mendota, while 47 metagenomes collected over 3 years were sequenced for each layer in Trout Bog (Table S2). Samples were sequenced on the Illumina HiSeq 2500 platform (Illumina, San Diego, CA, USA), except for four libraries (two from each layer of Trout Bog) that were sequenced using the Illumina TruSeq protocol on the Illumina GAIIx platform; all samples were sequenced using paired ends with read lengths of 150 base pairs (Data S1). Paired-end sequencing reads were merged with FLASH v1.0.3 with a mismatch value of less than 0.25 and a minimum of 10 overlapping bases (Magooc & Salzberg, 2011). 16S rRNA gene amplicon sequencing was also performed on samples collected with the same method over the same time periods. These datasets are available under DOE JGI project IDs 1078703 and 1018581 for Trout Bog and Mendota, respectively. Samples from Trout Bog were sequenced on the 454 GS FLX-Titanium platform (Roche, Branford, CT, USA) targeting the V8 hypervariable region (primer 1392R: ACGGGCGGTGTGTRC) (Engelbrektson et al., 2010), and sequences were trimmed to 324 base pairs using VSEARCH (v2.3.4) (Rognes et al., 2016). Samples from Mendota were sequenced on an Illumina MiSeq, and the V4 region was targeted using paired-end sequencing (primers 525F: GTGCCAGCMGCCGCGGTAA and 806R: GGACTACHVGGGTWTCTAAT) (Caporaso et al., 2012). Both datasets were trimmed based on alignment quality and chimera checking using mothur v.1.39.5 (Schloss et al., 2009). Unclustered, unique sequences were classified using a custom database of freshwater 16S rRNA gene sequences (Newton et al., 2011) and the Greengenes database (DeSantis et al., 2006) with the classification pipeline TaxAss (Rohwer et al., 2018).
Assembly and binning
To recover MAGs, metagenomic reads from the same sampling sites (Mendota’s epilimnion, Trout Bog’s epilimnion, and Trout Bog’s hypolimnion) were pooled (Table S2) and then assembled as previously described (Bendall et al., 2016; Roux et al., 2017). In metagenomes from Trout Bog, this assembly was performed using SOAPdenovo2 at various k-mer sizes (Luo et al., 2012), and the resulting contigs were combined using Minimus (Sommer et al., 2007). In Mendota, merged reads were assembled using Ray v2.2.0 with a single k-mer size (Boisvert et al., 2012). Contigs from the combined assemblies were binned using MetaBAT (“-veryspecific” settings, minimum bin size of 20 kb, and minimum contig size of 2.5 kb) (Kang et al., 2015), and reads from individual metagenomes were mapped to the assembled contigs using the Burrows–Wheeler Aligner (≥95% sequence identity, n = 0.05) (Li & Durbin, 2010), which allowed time-series resolved binning (Table S2). DOE JGI’s Integrated Microbial Genome (IMG) database tool (https://img.jgi.doe.gov/mer/) (Markowitz et al., 2012) was used for gene prediction and annotation. Annotated MAGs can be retrieved directly from the IMG database and JGI’s Genome Portal using the IMG Genome ID provided (also known as IMG Taxon ID). MAG completeness and contamination/redundancy was estimated based on the presence of a core set of genes with CheckM (Rinke et al., 2013; Parks et al., 2015), and MAGs were taxonomically classified using Phylosift (Darling et al., 2014) or the phylogeny-based “guilt by association” method (Hamilton et al., 2017). As recommended by Bowers et al. (2017), only MAGs that were at least approximately 50% complete with less than 10% estimated contamination/redundancy (meeting the MIMARKS definition of a medium or high quality MAG) (Bowers et al., 2017) were included in the study.
A total of 193 medium to high quality bacterial MAGs were recovered from the three combined time series metagenomes in Trout Bog and Mendota: 99 from Mendota, 31 from Trout Bog’s epilimnion, and 63 from Trout Bog’s hypolimnion (Data S2). These population genomes ranged in estimated completeness from 50 to 99% based on CheckM estimates. Several MAGs from Trout Bog’s epilimnion and hypolimnion appeared to belong to the same population based on average nucleotide identities greater than 99% calculated using DOE JGI’s ANI calculator (Data S3) (Varghese et al., 2015). This is likely because assembly and binning were carried out separately for each thermal layer, even though some populations were present throughout the water column.
Functional marker gene analysis
To analyze functional marker genes in the unassembled, unpooled metagenomes, we used a curated database of reference protein sequences (Data S4) (Anantharaman et al., 2016) and identified open reading frames (ORFs) in our unassembled metagenomic time series using Prodigal (Hyatt et al., 2010). This analysis was conducted on merged reads. The protein sequences and ORFs were compared using BLASTx (Camacho et al., 2009) with a cutoff of 30% identity. Read abundance was normalized by metagenome size for plotting. We chose to perform this analysis because gene content in unassembled metagenomes is likely more quantitative and more representative of the entire microbial community than gene content in the MAGs, due to limitations of assembly and binning algorithms.
These comparisons were run between the epilimnia of Trout Bog and Mendota, and between the epilimnion and hypolimnion of Trout Bog. We did not compare Mendota’s epilimnion to Trout Bog’s hypolimnion, as the multitude of factors differing between these two sites make this comparison illogical. We aggregated marker genes by function (as several marker genes from a phylogenetic range were included in the database for each type of function) and tested for significant differences in distribution between lakes and layers using a Wilcoxon rank sum test in R with a Bonferroni correction for multiple pairwise testing.
Pathway prediction
Pathways were analyzed by exporting IMG’s functional annotations for the MAGs, including KEGG, COG, PFAM, and TIGRFAM annotations, and mapping to pathways in the KEGG and MetaCyc databases as previously described (He et al., 2017). To score presence, a pathway needed at least 50% of the required enzymes encoded by genes in a MAG, and if there were steps unique to a pathway, at least one gene encoding each unique step. Putative pathway presence was aggregated by lake and phylum in order to link potential functions identified in the metagenomes to taxonomic groups that may perform those functions in each lake. Glycoside hydrolases were identified using dbCAN2’s implementation of HMMER (Zhang et al., 2018). Nitrogen usage in amino acids was calculated by taking the average number of nitrogen atoms in translated ORF sequences across each MAG.
Data formatting and plotting was performed in R (R Core Team, 2017) using the following packages: ggplot2 (Wickham, 2009.), cowplot (Wilke, 2017), reshape2 (Wickham, 2007), and APE (Paradis, Claude & Strimmer, 2004). The datasets, scripts, and intermediate files used to predict pathway presence and absence are available at https://github.com/McMahonLab/MAGstravaganza. Any future updates or refinements to this dataset will be available at this link.
Results and Discussion
Community functional marker gene analysis
Due to the contrasting water chemistry of Mendota and Trout Bog (Table 1; Table S1), we expected that microbial metabolisms would differ between lakes, and that these differences would be reflected in metagenomic gene content. To assess the potential for differing microbial metabolisms by lake, we tested whether functional marker genes identified in the unassembled merged metagenomic reads appeared more frequently in one lake or layer compared to the others. Many functional markers were found to be significantly more abundant in specific sites; more will be reported in each of the following sections (Fig. 1; Table S3). The recovered MAGs represent a diverse set of genomes assigned to taxonomic groups typically observed in freshwater (Fig. S2).
Figure 1. Analysis of marker gene abundances reveals differences between lakes and layers.
To assess potential differences in microbial metabolisms in our study sites, we predicted open reading frames in unassembled metagenomes using Prodigal and compared the resulting ORFs to a custom database of metabolic marker genes using BLAST. In these boxplots, significant differences in numbers of gene hits between sites were tested using a pairwise Wilcoxon rank sum test with a Bonferroni correction; significance was considered to be p < 0.05. A total of 94 metagenomes were tested for Mendota, while 47 metagenomes were tested in each layer of Trout Bog. Significant differences between the Trout Bog and Mendota epilimnia and between the Trout Bog epilimnion and hypolimnion are indicated by a green or a purple star, respectively. Significant differences between the Trout Bog hypolimnion and the Mendota epilimnion were not tested, as the large number of variables differing in these sites makes the comparison less informative. This analysis revealed differences in the number of marker genes observed by lake for many metabolic processes involved in carbon, nitrogen, and sulfur cycling. p-values of markers described in Fig. 1 and elsewhere in the text are reported in Table S3.
Overview of the MAGs dataset
To identify the phylogenetic affiliations of the microbes carrying marker genes and the co-occurrences of key marker genes within the same population genomes, we used MAGs from each metagenomic time series to predict metabolic pathways based on genomic content. To assess the diversity of our MAGs, we constructed an approximate maximum likelihood tree of all the MAGs in FastTree (Price, Dehal & Arkin, 2010) using whole genome alignments (Fig. S1). The tree is not intended to infer detailed evolutionary history, but to provide an overall picture of similarity between genomes. MAGs recovered are a diverse set of genomes assigned to taxa typically observed in freshwater (Fig. S2).
We also compared 16S rRNA gene amplicon sequencing data from the same timeframe as the metagenomes to confirm that the microbial community composition for these lakes and years was not “abnormal” compared to previous published studies (Fig. S3). The observed taxonomic compositions were consistent with other 16S-based studies carried out on these lakes (Linz et al., 2017; Hall et al., 2017) and with freshwater bacterial community compositions in general (Newton et al., 2011).
Nitrogen cycling
Nitrogen availability is an important factor structuring freshwater microbial communities. It is often a determining factor in a lake’s trophic status and a risk factor for the development of toxic cyanobacterial blooms (Smith, 2003; Beversdorf, Miller & McMahon, 2013). Because of the significance of nitrogen in freshwater, we analyzed nitrogen-related marker genes and identified MAGs containing characteristic nitrogen cycling pathways. We discovered significant differences in the abundances of marker genes, along with differences in phylogenetic affiliations of the MAGs containing these pathways.
Genes encoding nitrogenase, the key enzyme in nitrogen fixation, were observed most frequently in metagenomes from Trout Bog’s hypolimnion, followed by Trout Bog’s epilimnion, and lastly by Mendota’s epilimnion (Fig. 1; Table S3). We analyzed MAGs predicted to fix nitrogen and found differences in the identities of putative diazotrophs between the two ecosystems (Fig. 2; Fig. S1 and Data S5). In Mendota, two-thirds of MAGs encoding the nitrogen fixation pathway were classified as Cyanobacteria, while the other third was assigned to Betaproteobacteria and Gammaproteobacteria. Although not all Cyanobacteria fix nitrogen, previous studies of nitrogen fixation in Mendota have reported a strong correlation between this pathway and the cyanobacterium affiliated with Aphanizomenon (Beversdorf, Miller & McMahon, 2013). MAGs containing genes encoding nitrogen fixation were more phylogenetically diverse in Trout Bog and included Deltaproteobacteria, Gammaproteobacteria, Epsilonproteobacteria, Acidobacteria, Verrucomicrobia, Chlorobi, and Bacteroidetes. The higher diversity of diazotrophs in Trout Bog compared to Mendota suggests that nitrogen fixation may be a more advantageous trait in humic lakes than in eutrophic lakes.
Figure 2. Metabolisms in Mendota and Trout Bog.
A pathway was considered present when at least 50% of enzymes in a pathway were encoded in the genome and all enzymes unique to or required for the pathway were present. Putative pathway presence was aggregated by lake and phylum. This analysis can link potential functions identified in the metagenomes to taxonomic groups that may perform those functions. For example, MAGs that putatively fix carbon also likely fix nitrogen in both lakes. Similarly, putative degradation pathways for rhamnose, fucose, and galactose were frequently encoded within the same MAGs. Proteobacteria was split into classes due to the high diversity of this phylum. The number of MAGs assigned to each phylum is indicated in parentheses after the phylum name. Data for each genome can be found in Data S6.
We noted a high frequency of genes related to polyamine biosynthesis and degradation in our MAGs. We found that 94% of MAGs encoded pathways for polyamine synthesis, and 87% encoded pathways for polyamine degradation. These pathways were predicted in diverse MAGs from both lakes, including Actinobacteria as previously observed (Ghylin et al., 2014; Hamilton et al., 2017). While there is some evidence for the importance of polyamines in aquatic systems (Mou et al., 2011), the ecological roles of these compounds in freshwater are not fully resolved. Polyamines are known to play a critical but poorly understood role in bacterial metabolism (Igarashi & Kashiwagi, 1999), and the exchange of these nitrogen compounds between populations may be a factor structuring freshwater microbial communities. Polyamines can also result from the decomposition of amino acids, so higher trophic levels such as fish or zooplankton may represent an additional polyamine source (Al Bulushi et al., 2009). The frequent appearance of polyamine-related pathways in our MAGs lends support to the hypothesis that these compounds are important but largely unrecognized parts of the dissolved organic nitrogen and carbon pool in freshwater.
We analyzed genes for denitrification, including reductases for nitrous oxide, nitrite, and nitrate. Denitrification genes were observed most frequently in Trout Bog’s hypolimnion, with the exception of nitrous oxide reductase, which was found more frequently in Mendota. Genes encoding urease were not identified more frequently in any site. Denitrification and urea degradation pathways were predicted in similar proportions of MAGs from both lakes.
Sulfur cycling
Sulfur is another essential element in freshwater that is cycled between oxidized and reduced forms by microbes. Our marker gene analysis demonstrated that genes encoding sulfide:quinone reductase (for sulfide oxidation) and the sox pathway (for thiosulfate oxidation) were significantly more abundant in Trout Bog compared to Mendota, with no significant differences between the layers of Trout Bog (Fig. 1; Table S3). Genes encoding sulfite reductases were the least abundant sulfur cycling marker genes in all sites. Dissimilatory sulfite reductase was observed only in MAGs from Trout Bog, especially those classified as Chlorobiales. Because this enzyme is thought to operate in reverse in green sulfur-oxidizing phototrophs such as Chlorobiales (Holkenbrink et al., 2011), this may indicate an oxidation process rather than a reductive sulfur pathway. Sulfur oxidation pathways were observed in MAGs classified as Betaproteobacteria from both lakes and Epsilonproteobacteria in Trout Bog’s hypolimnion. Assimilatory sulfate reduction was overall the most common sulfur-related pathway identified in the MAGs (Fig. 2; Data S5).
Assimilatory sulfate reduction was observed more frequently than dissimilatory sulfate reduction; this suggests that sulfate is more commonly used for biosynthesis, while reduced forms of sulfur are used as electron donors for energy mobilization in these populations. This is in contrast to marine systems, where sulfate reduction holds a central role as an energy source for organotrophic energy acquisition (Bowles et al., 2014), although sulfate reduction could also be occurring in Mendota’s hypolimnion.
Phototrophy
Primary production (the coupling of photosynthesis and carbon fixation) is a critical component of the freshwater carbon cycle. To identify differences in routes of primary production between freshwater environments, we compared marker genes for carbon fixation across sites. Ribulose-1,5-bisphosphate carboxylase/oxygenase (RuBisCO), the marker gene for carbon fixation via the Calvin–Benson–Bassham (CBB) pathway, was most frequently observed in Trout Bog’s epilimnion (Fig. 1; Table S3).
We assessed the MAGs for photoautotrophy, expecting to find differences between our two study sites based on the observed contrasts in the functional marker gene analysis (Fig. 2; Data S5). In Mendota, the majority of MAGs encoding phototrophic pathways were classified as Cyanobacteria. These MAGs contained genes encoding enzymes in the CBB pathway. In Trout Bog, most MAGs encoding phototrophy were classified as Chlorobium clathratiforme, a species of Chlorobiales widespread in humic lakes (Karhunen et al., 2013). The Chlorobiales MAGs in Trout Bog contained genes encoding citrate lyase and other key enzymes in the reductive tricarboxylic acid (TCA) cycle, an alternative carbon fixation method commonly found in green sulfur bacteria such as Chlorobi (Kanao et al., 2002; Tang & Blankenship, 2010). Although we found genes annotated as the RuBisCO large subunit (rbcL) in some of the Chlorobiales MAGs, the reductive TCA cycle is the only carbon fixation pathway known to be active in cultured representatives of Chlorobiales. Homologs of rbcL have been previously identified in isolates of Chlorobium, and were associated with sulfur metabolism and oxidative stress (Hanson & Tabita, 2001). Given this information, it seems likely that this rbcL homolog encodes a function other than carbon fixation in our Chlorobiales MAGs. MAGs affiliated with Cyanobacteria in Mendota and Chlorobi in Trout Bog also possessed genes encoding diazotrophy, providing a link between carbon and nitrogen fixation. As both Chlorobi and Cyanobacteria are often abundant members of freshwater communities (Eiler & Bertilsson, 2004; Peura et al., 2012), their fixation capabilities may be relevant even at the ecosystem scale.
The potential for photoheterotrophy via the aerobic anoxygenic phototrophic (AAP) pathway was identified in several MAGs from all lake environments, especially from epilimnia, based on the presence of genes annotated as pufABCLMX, puhA, and pucAB encoding the core reaction center RC-LH1 (Martinez-Garcia et al., 2012b). Betaproteobacteria and Gammaproteobacteria, particularly MAGs classified as Burkholderiales (including PnecC, LD28, and Zwartia alpina), most often contained these genes, although they were not broadly shared across the phylum (Fig. 2). As AAP has previously been associated with freshwater Proteobacteria (Martinez-Garcia et al., 2012b), these results are not surprising. However, an Acidobacteria MAG from the Trout Bog epilimnion also contained genes suggesting AAP, which to our knowledge has not previously been found in this phylum.
Another form of photoheterotrophy previously identified in freshwater is the use of light-activated proteins such as rhodopsins (Martinez-Garcia et al., 2012b). We observed genes encoding rhodopsins in MAGs from each lake environment, but more frequently in Actinobacteria and Bacteroidetes MAGs from Mendota (Fig. 2). Trout Bog, especially the hypolimnion, harbored fewer and less diverse MAGs encoding rhodopsins than those from Mendota.
Glycoside hydrolases
Degradation of high-complexity, recalcitrant carbon compounds requires specialized enzymes, but wide availability of these carbon compounds can make complex carbon degradation an advantageous trait. One way to predict the ability to degrade high-complexity carbon in microbial populations is by identifying genes annotated as glycoside hydrolases (GHs), which encode enzymes that break the glycosidic bonds found in complex carbohydrates. However, it is important to keep in mind that GHs can also play structural roles in microbial cells in addition to the degradation of complex carbon substrates (Henrissat & Davies, 1997). A previous study of Verrucomicrobia MAGs from our dataset found that the profiles of GHs differed between Mendota and Trout Bog, potentially reflecting the differences in available carbon sources (He et al., 2017). We expanded this analysis of GHs to all of the MAGs in our dataset to identify differences in how populations from our two study sites degrade complex carbohydrates.
We calculated the coding density of GHs, defined as the percentage of coding regions in a MAG annotated as a GH, to identify differences in carbon metabolism between MAGs from different lake environments (Fig. 3; Data S6). Our GH coding density metric was significantly correlated with the diversity of GHs identified (r2 = 0.92, p < 2.2 × 10−16), which is an indicator of the number of substrates an organism can utilize. The MAGs with the highest GH coding densities were classified as Bacteroidales, Ignavibacteriales, Sphingobacteriales, and Verrucomicrobiales from Trout Bog’s hypolimnion. Two of these orders, Sphingobacteriales and Verrucomicrobiales, also contained MAGs with high GH coding densities in Mendota and Trout Bog’s epilimnion. There were several additional orders with high GH coding density that were unique to Mendota, including Mycoplasmatales (Tenericutes), Cytophagales (Bacteroidetes), Planctomycetales (Planctomycetes), and Puniceicoccales (Verrucomicrobia). Members of Verrucomicrobia have been previously identified as potential polysaccharide degraders in freshwater, although our coding densities for this phylum are higher than previously reported (Martinez-Garcia et al., 2012a). This may be due to differences in trophic status between our lakes and those previously studied, or it may be that MAGs capture more pan-genomic content than isolate or single amplified genomes. In concordance with their ability to hydrolytically degrade biopolymers to sugars, MAGs with high GH coding densities also contained putative degradation pathways for a variety of sugars (Fig. 2). The increased diversity of these genes found in Trout Bog’s hypolimnion compared to the other study sites suggests differing diversity and complexity of the available organic carbon.
Figure 3. Glycoside hydrolase content in the MAGs.
Annotations of GHs were used as an indication of complex carbon degradation. Genes potentially encoding GHs were identified and assigned CAZyme annotations using dbCAN2. GH coding density was calculated for each MAG and averaged by order and lake. While a few orders contained genes encoding glycoside hydrolases in all three sites, many orders were unique to each site. The orders with the highest coding densities were all found in the Trout Bog hypolimnion. Glycoside hydrolase diversity, an indicator of the range of substrates an organism can degrade, was significantly correlated with coding density (r2 = 0.92, p < 2.2 × 10−16). Proteobacteria was split into classes due to the high diversity of this phylum.
Central metabolism and simple carbon degradation
Freshwater microbes are exposed to a great variety of low-complexity carbon sources such as carbohydrates, carboxylic acids, and single-carbon (C1) compounds. The central metabolic pathways shared by most living cells are often an entry point for the least complex carbon compounds. Therefore, the specific routing of central metabolism predicted in our MAGs may reveal how low complexity carbon compounds are used within freshwater populations.
We investigated the types of cytochrome oxidases encoded in our MAGs to compare oxidative phosphorylation between lakes and layers (Fig. 2; Data S5). Cytochrome c oxidases, both aa3- and cbb3-type, were widespread in all three lake environments and frequently co-occurred within MAGs. aa3-type cytochromes are associated with high oxygen concentrations, while cbb3-type cytochromes are associated with low oxygen concentrations (Gong et al., 2018). The presence of genes encoding both types suggests the flexibility to operate under a range of oxygen concentrations.
Similarly, hydrogen metabolism can influence and be influenced by other aspects of nutrient usage. Iron-only hydrogenases were found primarily in MAGs from Trout Bog’s hypolimnion (Fig. 2; Table S3), consistent with their previously identified presence in anaerobic, often fermentative bacteria (Peters et al., 2015). Group 3 [Ni–Fe] hydrogenases were identified in MAGs belonging to Cyanobacteria and Chlorobiales in both lakes. This finding is consistent with the proposed function of Group 3d, which is to remove excess electrons produced by photosynthesis (Peters et al., 2015).
Low molecular weight carbohydrates may be derived either from autochthonous (such as algae) or allochthonous (such as terrestrial plants) sources (Giroldo, Augusto & Vieira, 2005; Ramanan et al., 2016). The pathway for mannose degradation was encoded in many MAGs from all three sites (Fig. 2; Data S5). Predicted pathways for rhamnose, fucose, and galactose degradation were often found within the same MAGs (including members of Planctomycetes and Verrucomicrobia from Mendota, and members of Bacteroidetes, Ignavibacteria, and Verrucomicrobia from Trout Bog). Xylose is a common freshwater sugar which has already been proposed as a potential carbon source for streamlined Actinobacteria (Ghylin et al., 2014). We confirmed this in our MAGs and also identified Bacteroidetes, Planctomycetes, and Verrucomicrobia from Mendota and Bacteroidetes and Verrucomicrobia from Trout Bog as additional potential xylose degraders. Genes for the degradation of glycolate, an acid produced by algae and consumed by heterotrophic bacteria (Paver & Kent, 2010), were identified in Cyanobacteria and Betaproteobacteria MAGs from Mendota and in Acidobacteria, Verrucomicrobia, Alpha-, Beta-, Gamma-, and Epsilonproteobacteria MAGs from Trout Bog. The pathways predicted in our MAGs may inform us about which low molecular weight compounds are important carbon substrates in freshwater.
Methylotrophy, the ability to grow solely on C1 compounds such as methane or methanol, was predicted in MAGs from both Trout Bog and Mendota. Putative pathways for methanol and methylamine degradation were found in MAGs classified as Methylophilales (now merged with Nitrosomonadales; Boden, Hutt & Rae, 2017), while Methylococcales MAGs were potential methane degraders based on the presence of genes encoding methane monooxygenase. Methylococcales MAGs from Trout Bog also encoded the pathway for nitrogen fixation, consistent with reports of nitrogen fixation in cultured isolates of this taxon (Bowman, Sly & Stackebrandt, 1995). Methylotrophy in cultured freshwater isolates from Methylococcales and Nitrosomonadales is well-documented (Kalyuzhnaya et al., 2011; Salcher et al., 2015). We also found predicted pathways for methanol degradation in MAGs classified as Burkholderiales and Rhizobiales in Trout Bog. Methylotrophy has been identified in members of Rhizobiales, such as Methylobacterium and Methylocystaceae, and in Burkholderiales, including Methylibium (Auman et al., 2000; Chistoserdova et al., 2003; Kane et al., 2007). Our MAGs may represent populations related to these known methylotrophs.
Using MAGs to track population abundances over time
Because our metagenomes comprise a time series, we can investigate potential changes in function over time using our MAGs and functional marker genes. We analyzed nitrogen fixation over time in Cyanobacteria, known to be highly variable over time in Mendota. We found that in each year, one Cyanobacteria MAG was substantially more abundant (based on read coverage) than the rest; this single MAG was plotted for each year in Mendota (Figs. 4A–4E). We compared read coverage-based abundance of the dominant Cyanobacteria MAG to the normalized number of BLAST hits in the metagenomes from abundant functional marker genes encoding nitrogenase subunits (TIGR1282 (nifD), TIGR1286 (nifK specific for molybdenum–iron nitrogenase), and TIGR1287 (nifH, common among different types of nitrogenases)) (Figs. 4F–4J). As expected, we detected significant correlations (p < 0.05) between MAG abundance and nitrogen fixation marker genes in 2008, 2011, and 2012. In these years, the dominant Cyanobacteria MAGs were predicted to fix nitrogen based on gene content, while the dominant MAGs in 2009 and 2010 were not predicted to fix nitrogen. In agreement with this, the number of hits for the nitrogenase marker genes were an order of magnitude lower in 2009 and 2010 compared to 2008 and 2012. While genome incompleteness precludes us from concluding that the potential for nitrogen fixation in Mendota was lower in 2009 and 2010 because the dominant Cyanobacteria populations were not diazotrophic, it does suggest a strong link between Cyanobacteria dynamics and nitrogen fixation in this ecosystem (Beversdorf, Miller & McMahon, 2013). This could also have important implications for cyanotoxin production, since nitrogen stress has been linked to toxin production (Beversdorf et al., 2015).
Figure 4. Cyanobacteria and nitrogen fixation over time.
To investigate potential functional changes over time in Mendota, we compared the abundances of Cyanobacteria MAGs (approximated using read coverage normalized by genome length) to the abundances of nitrogen fixation marker genes (approximated using the number of BLAST hits to metagenomes normalized by metagenome size). Only the most abundant Cyanobacteria MAG is shown for each year (A–E) because a single MAG was much more abundant than the rest in each year. The marker genes used were TIGR1282, TIGR1286, and TIGR1287, encoding subunits of Mo–Fe nitrogenase, as these were the most frequently observed nitrogenase markers in the Mendota metagenomes (F–J). Significantly correlated trends over time were observed between the MAGs and the nitrogenase marker genes in 2008, 2011, and 2012. In years where there was no significant correlation, the dominant MAG did not contain genes indicative of the nitrogen fixation pathway. This suggests that Cyanobacteria dynamics may be linked to the potential for nitrogen fixation in Mendota.
Conclusions
Our analysis of functional marker genes indicated potentially significant differences in microbial biogeochemical cycling between Mendota’s epilimnion, Trout Bog’s epilimnion, and Trout Bog’s hypolimnion. We next used MAGs from multi-year metagenomic time series to propose specific roles in freshwater biogeochemical cycles for microbial taxa. In the nitrogen cycle, we predicted many pathways for the degradation and biosynthesis of polyamines, consistent with their hypothesized role in the dissolved organic nitrogen pool. We observed an association between nitrogen fixation and Cyanobacteria in Mendota, but observed a greater diversity of putative diazotrophs in Trout Bog. Assimilatory sulfate reduction pathways were predicted more frequently that dissimilatory sulfate reduction pathways, suggesting a bias towards using sulfate for biosynthesis. We identified several types of phototrophy, which in some but not all genomes co-occurred with carbon fixation via the Calvin Cycle or the reductive TCA cycle. We found the greatest diversity and density of GHs in MAGs from Trout Bog’s hypolimnion, suggesting a greater potential to degrade recalcitrant carbon in this region. Our combination of functional marker gene analysis and MAG pathway prediction provides insight into the complex metabolisms underpinning freshwater communities and how microbial processes scale to ecosystem functions.
We anticipate that this dataset will be a valuable community resource for other freshwater microbial ecologists to mine and incorporate into comparative studies across lakes around the world. As such, all data is publicly available at https://github.com/McMahonLab/MAGstravaganza. The results of this study can be used to guide efforts to build microbially-resolved models of freshwater carbon and nutrient cycles with better predictive power.
Supplemental Information
Additional chemistry data were collected by NTL-LTER (http://lter.limnology.wisc.edu) from depth discrete samples taken from 0 and 4 m for Mendota, 0 m for the Trout Bog epilimnion, and 3 and 7 m for the Trout Bog hypolimnion. Values reported here are the means of all measurements in the sampling time span for each lake, with standard deviations reported in parentheses.
Metagenomic samples were pooled by lake and layer to allow time-resolved binning. The Mendota time series spans 2008–2012, while the Trout Bog time series spans 2007–2009. Just under 200 medium to high quality metagenome-assembled genomes (MAGs) were produced.
A Wilcoxon rank sum test was used to non-parametrically test for significant differences in functional marker gene distributions between our study sites. P-values of less than 0.05 are considered significant.
This dataset includes information about the metagenomes used in this study including date collected, size in reads and base pairs, and their IMG Genome IDs (IMG Taxon ID).
Information about the completeness, size, and taxonomy of our MAGs, as well as their IMG OIDs, are presented here.
Average nucleotide identity (ANI) was calculated between all MAGs in our dataset. MAGs with extremely high ANIs (>97%) are likely from the same populations.
This dataset lists the TIGRFAM, COG, or PFAM IDs of sequences used as functional marker genes to analyze how gene content differs by site.
This dataset is the input to Fig. 2 and contains pathway completeness estimates for each MAG individually.
To assess the potential to degrade complex carbon compounds, we annotated carbohydrate active enzymes in our MAGs using dbCAN2. The output of dbCAN2 for each MAG is presented here.
To visualize the diversity of our MAGs, phylogenetic marker genes were extracted from each MAG and aligned using Phylosift. An approximate maximum-likelihood tree based on these alignments was constructed using FastTree. The potential for nitrogen fixation based on gene content is indicated on the branch tips.
We used read coverage normalized by MAG and metagenome size to approximate the abundance of our MAGs. MAGs were recovered from diverse freshwater phyla. The abundances of phyla represented by MAGs differed by lake and layer. MAGs were classified using Phylosift, and Proteobacteria was split into classes due to the high diversity of this phylum.
The community composition observed via 16S rRNA gene amplicon sequencing in our dataset is consistent with previously published analyses of freshwater community composition. This confirms that the years included in our study are not abnormal. The 16S V6–V8 region was targeted in Trout Bog, while the V4 region was targeted in Mendota. Proteobacteria was split into classes due to the high diversity of this phylum.
Acknowledgments
We thank the North Temperate Lakes—Long Term Ecological Research Program and Lake Mendota Microbial Observatory field crews, UW-Trout Lake Station, the UW Center for Limnology, and the Global Lakes Ecological Observatory Network for field and logistical support. We acknowledge efforts by many McMahon laboratory undergraduate students and technicians whose work has been related to sample collection and DNA extraction. We thank Emily Stanley and Joshua Hamilton for insightful comments on an early draft of this manuscript. Finally, we personally thank the individual program directors and leadership at the National Science Foundation for their commitment to continued support of long-term ecological research.
Funding Statement
This research was supported by the U.S. Department of Energy Joint Genome Institute through the Community Sequencing Program (CSP 394). The work conducted by the U.S. Department of Energy Joint Genome Institute, a DOE Office of Science User Facility, is supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC02-05CH11231. Katherine D. McMahon received funding from the United States National Science Foundation Microbial Observatories program (MCB-0702395), the Long Term Ecological Research Program (NTL–LTER DEB-1440297), and an INSPIRE award (DEB-1344254). Alexandra M. Linz was supported by a pre-doctoral fellowship provided by the University of Wisconsin–Madison Department of Bacteriology and by the National Science Foundation Graduate Research Fellowship Program under grant no. DGE-1256259 during this research. This material is also based upon work supported by the National Institute of Food and Agriculture, U.S. Department of Agriculture (Hatch Project 1002996). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Additional Information and Declarations
Competing Interests
The authors declare that they have no competing interests.
Author Contributions
Alexandra M. Linz conceived and designed the experiments, analyzed the data, prepared figures and/or tables, authored or reviewed drafts of the paper, approved the final draft.
Shaomei He analyzed the data, contributed reagents/materials/analysis tools, authored or reviewed drafts of the paper, approved the final draft.
Sarah L.R. Stevens analyzed the data, contributed reagents/materials/analysis tools, authored or reviewed drafts of the paper, approved the final draft.
Karthik Anantharaman analyzed the data, contributed reagents/materials/analysis tools, authored or reviewed drafts of the paper, approved the final draft.
Robin R. Rohwer analyzed the data, contributed reagents/materials/analysis tools, authored or reviewed drafts of the paper, approved the final draft.
Rex R. Malmstrom conceived and designed the experiments, authored or reviewed drafts of the paper, approved the final draft.
Stefan Bertilsson conceived and designed the experiments, authored or reviewed drafts of the paper, approved the final draft.
Katherine D. McMahon conceived and designed the experiments, prepared figures and/or tables, authored or reviewed drafts of the paper, approved the final draft.
DNA Deposition
The following information was supplied regarding the deposition of DNA sequences:
Metagenomes, pooled metagenome assemblies, and metagenome-assembled genomes (MAGs) described here are accessible through the Integrated Microbial Genomes (IMG) database. IMG Genome IDs for these many sequences can be found at https://github.com/McMahonLab/MAGstravaganza (also included as Supplemental Documents).
Data Availability
The following information was supplied regarding data availability:
McMahon Lab Github–MAGstravaganza
References
- Al Bulushi et al. (2009).Al Bulushi I, Poole S, Deeth HC, Dykes GA. Biogenic amines in fish: Roles in intoxication, spoilage, and nitrosamine formation—a review. Critical Reviews in Food Science and Nutrition. 2009;49(4):369–377. doi: 10.1080/10408390802067514. [DOI] [PubMed] [Google Scholar]
- Anantharaman et al. (2016).Anantharaman K, Brown CT, Hug LA, Sharon I, Castelle CJ, Probst AJ, Thomas BC, Singh A, Wilkins MJ, Karaoz U, Brodie EL, Williams KH, Hubbard SS, Banfield JF. Thousands of microbial genomes shed light on interconnected biogeochemical processes in an aquifer system. Nature Communications. 2016;7:13219. doi: 10.1038/ncomms13219. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Auman et al. (2000).Auman AJ, Stolyar S, Costello AM, Lidstrom ME. Molecular characterization of methanotrophic isolates from freshwater lake sediment. Applied and Environmental Microbiology. 2000;66(12):5259–5266. doi: 10.1128/AEM.66.12.5259-5266.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bendall et al. (2016).Bendall ML, Stevens SL, Chan L-K, Malfatti S, Schwientek P, Tremblay J, Schackwitz W, Martin J, Pati A, Bushnell B, Froula J, Kang D, Tringe SG, Bertilsson S, Moran MA, Shade A, Newton RJ, McMahon KD, Malmstrom RR. Genome-wide selective sweeps and gene-specific sweeps in natural bacterial populations. ISME Journal. 2016;10(7):1589–1601. doi: 10.1038/ismej.2015.241. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Beversdorf et al. (2015).Beversdorf LJ, Chaston SD, Miller TR, McMahon KD. Microcystin mcyA and mcyE gene abundances are not appropriate indicators of microcystin concentrations in lakes. PLOS ONE. 2015;10(5):e0125353. doi: 10.1371/journal.pone.0125353. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Beversdorf, Miller & McMahon (2013).Beversdorf LJ, Miller TR, McMahon KD. The role of nitrogen fixation in cyanobacterial bloom toxicity in a temperate, eutrophic lake. PLOS ONE. 2013;8(2):e56103. doi: 10.1371/journal.pone.0056103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Boden, Hutt & Rae (2017).Boden R, Hutt LP, Rae AW. Reclassification of Thiobacillus aquaesulis (Wood & Kelly, 1995) as Annwoodia aquaesulis gen. nov., comb. nov., transfer of Thiobacillus (Beijerinck, 1904) from the Hydrogenophilales to the Nitrosomonadales, proposal of Hydrogenophilalia class. nov. within the ‘Proteobacteria,’ and four new families within the orders Nitrosomonadales and Rhodocyclales. International Journal of Systematic and Evolutionary Microbiology. 2017;67(5):1191–1205. doi: 10.1099/ijsem.0.001927. [DOI] [PubMed] [Google Scholar]
- Boisvert et al. (2012).Boisvert S, Raymond F, Godzaridis É, Laviolette F, Corbeil J. Ray Meta: scalable de novo metagenome assembly and profiling. Genome Biology. 2012;13:R122. doi: 10.1186/gb-2012-13-12-r122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bowers et al. (2017).Bowers RM, Kyrpides NC, Stepanauskas R, Harmon-Smith M, Doud D, Reddy TBK, Schulz F, Jarett J, Rivers AR, Eloe-Fadrosh EA, Tringe SG, Ivanova NN, Copeland A, Clum A, Becraft ED, Malmstrom RR, Birren B, Podar M, Bork P, Weinstock GM, Garrity GM, Dodsworth JA, Yooseph S, Sutton G, Glöckner FO, Gilbert JA, Nelson WC, Hallam SJ, Jungbluth SP, Ettema TJG, Tighe S, Konstantinidis KT, Liu WT, Baker BJ, Rattei T, Eisen JA, Hedlund B, McMahon KD, Fierer N, Knight R, Finn R, Cochrane G, Karsch-Mizrachi I, Tyson GW, Rinke C, Lapidus A, Meyer F, Yilmaz P, Parks DH, Eren AM, Schriml L, Banfield JF, Hugenholtz P, Woyke T. Minimum information about a single amplified genome (MISAG) and a metagenome-assembled genome (MIMAG) of bacteria and archaea. Nature Biotechnology. 2017;35(8):725–731. doi: 10.1038/nbt.3893. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bowles et al. (2014).Bowles MW, Mogollon JM, Kasten S, Zabel M, Hinrichs K-U. Global rates of marine sulfate reduction and implications for sub-sea-floor metabolic activities. Science. 2014;344(6186):889–891. doi: 10.1126/science.1249213. [DOI] [PubMed] [Google Scholar]
- Bowman, Sly & Stackebrandt (1995).Bowman JP, Sly LI, Stackebrandt E. The phylogenetic position of the family Methylococcaceae. International Journal of Systematic Bacteriology. 1995;45(3):622. doi: 10.1099/00207713-45-3-622a. [DOI] [PubMed] [Google Scholar]
- Butman et al. (2015).Butman D, Stackpoole S, Stets E, McDonald CP, Clow DW, Striegl RG. Aquatic carbon cycling in the conterminous United States and implications for terrestrial carbon accounting. Proceedings of the National Academy of Sciences of the United States of America. 2015;113(1):58–63. doi: 10.1073/pnas.1512651112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cabello-Yeves et al. (2018).Cabello-Yeves PJ, Zemskaya TI, Rosselli R, Coutinho FH, Zakharenko AS, Blinov VV, Rodriguez-Valera F. Genomes of novel microbial lineages assembled from the sub-ice waters of Lake Baikal. Applied and environmental microbiology. 2018;84(1):e02132-17. doi: 10.1128/AEM.02132-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Camacho et al. (2009).Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, Madden TL. BLAST+: architecture and applications. BMC Bioinformatics. 2009;10(1):421. doi: 10.1186/1471-2105-10-421. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Caporaso et al. (2012).Caporaso JG, Lauber CL, Walters WA, Berg-Lyons D, Huntley J, Fierer N, Owens SM, Betley J, Fraser L, Bauer M, Gormley N, Gilbert JA, Smith G, Knight R. Ultra-high-throughput microbial community analysis on the Illumina HiSeq and MiSeq platforms. ISME Journal. 2012;6(8):1621–1624. doi: 10.1038/ismej.2012.8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chistoserdova et al. (2003).Chistoserdova L, Chen S-W, Lapidus A, Lidstrom ME. Methylotrophy in Methylobacterium extorquens AM1 from a genomic point of view. Journal of bacteriology. 2003;185(10):2980–2987. doi: 10.1128/JB.185.10.2980-2987.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cole et al. (2007).Cole JJ, Prairie YT, Caraco NF, McDowell WH, Tranvik LJ, Striegl RG, Duarte CM, Kortelainen P, Downing JA, Middelburg JJ, Melack J. Plumbing the global carbon cycle: integrating inland waters into the terrestrial carbon budget. Ecosystems. 2007;10(1):172–185. doi: 10.1007/s10021-006-9013-8. [DOI] [Google Scholar]
- Darling et al. (2014).Darling AE, Jospin G, Lowe E, Matsen FA, Bik HM, Eisen JA. PhyloSift: phylogenetic analysis of genomes and metagenomes. PeerJ. 2014;2:e243. doi: 10.7717/peerj.243. [DOI] [PMC free article] [PubMed] [Google Scholar]
- DeSantis et al. (2006).DeSantis TZ, Hugenholtz P, Larsen N, Rojas M, Brodie EL, Keller K, Huber T, Dalevi D, Hu P, Andersen GL. Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB. Applied and Environmental Microbiology. 2006;72(7):5069–5072. doi: 10.1128/AEM.03006-05. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Eiler & Bertilsson (2004).Eiler A, Bertilsson S. Composition of freshwater bacterial communities associated with cyanobacterial blooms in four Swedish lakes. Environmental Microbiology. 2004;6(12):1228–1243. doi: 10.1111/j.1462-2920.2004.00657.x. [DOI] [PubMed] [Google Scholar]
- Eiler et al. (2003).Eiler A, Langenheder S, Bertilsson S, Tranvik LJ. Heterotrophic bacterial growth efficiency and community structure at different natural organic carbon concentrations. Applied and Environmental Microbiology. 2003;69(7):3701–3709. doi: 10.1128/AEM.69.7.3701-3709.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Eiler et al. (2016).Eiler A, Mondav R, Sinclair L, Fernandez-Vidal L, Scofield DG, Schwientek P, Martinez-Garcia M, Torrents D, McMahon KD, Andersson SG, Stepanauskas R, Woyke T, Bertilsson S. Tuning fresh: radiation through rewiring of central metabolism in streamlined bacteria. ISME Journal. 2016;10(8):1902–1914. doi: 10.1038/ismej.2015.260. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Engelbrektson et al. (2010).Engelbrektson AL, Kunin V, Wrighton KC, Zvenigorodsky N, Chen F, Ochman H, Hugenholtz P. Experimental factors affecting PCR-based estimates of microbial species richness and evenness. ISME Journal. 2010;4(5):642–647. doi: 10.1038/ismej.2009.153. [DOI] [PubMed] [Google Scholar]
- Ghylin et al. (2014).Ghylin TW, Garcia SL, Moya F, Oyserman BO, Schwientek P, Forest KT, Mutschler J, Dwulit-Smith J, Chan L-K, Martinez-Garcia M, Sczyrba A, Stepanauskas R, Grossart H-P, Woyke T, Warnecke F, Malmstrom R, Bertilsson S, McMahon KD. Comparative single-cell genomics reveals potential ecological niches for the freshwater acI Actinobacteria lineage. ISME Journal. 2014;8(12):2503–2516. doi: 10.1038/ismej.2014.135. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Giroldo, Augusto & Vieira (2005).Giroldo D, Augusto A, Vieira H. Polymeric and free sugars released by three phytoplanktonic species from a freshwater tropical eutrophic reservoir. Journal of Plankton Research. 2005;27(7):695–705. doi: 10.1093/plankt/fbi043. [DOI] [Google Scholar]
- Gong et al. (2018).Gong X, Garcia-Robledo E, Lund MB, Lehner P, Borisov SM, Klimant I, Revsbech N-P, Schramm A. Gene expression of terminal oxidases in two marine bacterial strains exposed to nanomolar oxygen concentrations. FEMS Microbiology Ecology. 2018;94(7):72. doi: 10.1093/femsec/fiy072/4983120. [DOI] [PubMed] [Google Scholar]
- Guillemette & Del Giorgio (2011).Guillemette F, Del Giorgio PA. Reconstructing the various facets of dissolved organic carbon bioavailability in freshwater ecosystems. Limnology and Oceanography. 2011;56(2):734–748. doi: 10.4319/lo.2011.56.2.0734. [DOI] [Google Scholar]
- Hahn et al. (2012).Hahn MW, Scheuerl T, Jezberová J, Koll U, Jezbera J, Šimek K, Vannini C, Petroni G, Wu QL. The passive yet successful way of planktonic life: genomic and experimental analysis of the ecology of a free-living polynucleobacter population. PLOS ONE. 2012;7:e32772. doi: 10.1371/journal.pone.0032772. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hall et al. (2017).Hall MW, Rohwer RR, Perrie J, McMahon KD, Beiko RG. Ananke: temporal clustering reveals ecological dynamics of microbial communities. PeerJ. 2017;5:e3812. doi: 10.7717/peerj.3812. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hamilton et al. (2017).Hamilton JJ, Garcia SL, Brown BS, Oyserman BO, Moya-Flores F, Bertilsson S, Malmstrom RR, Forest KT, McMahon KD. Metabolic network analysis and metatranscriptomics reveal auxotrophies and nutrient sources of the cosmopolitan freshwater microbial lineage acI. mSystems. 2017;2(4):e00091-17. doi: 10.1128/mSystems.00091-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hanson & Tabita (2001).Hanson TE, Tabita FR. A ribulose-1,5-bisphosphate carboxylase/oxygenase (RubisCO)-like protein from chlorobium tepidum that is involved with sulfur metabolism and the response to oxidative stress. Proceedings of the National Academy of Sciences of the United States of America. 2001;98(8):4397–4402. doi: 10.1073/pnas.081610398. [DOI] [PMC free article] [PubMed] [Google Scholar]
- He et al. (2017).He S, Stevens SLR, Chan L-K, Bertilsson S, Glavina Del Rio T, Tringe SG, Malmstrom RR, McMahon KD. Ecophysiology of freshwater Verrucomicrobia inferred from metagenome-assembled genomes. mSphere. 2017;2(5):e00277-17. doi: 10.1128/mSphere.00277-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Henrissat & Davies (1997).Henrissat B, Davies G. Structural and sequence-based classification of glycoside hydrolases. Current Opinion in Structural Biology. 1997;7(5):637–644. doi: 10.1016/S0959-440X(97)80072-3. [DOI] [PubMed] [Google Scholar]
- Holkenbrink et al. (2011).Holkenbrink C, Barbas SO, Mellerup A, Otaki H, Frigaard N-U. Sulfur globule oxidation in green sulfur bacteria is dependent on the dissimilatory sulfite reductase system. Microbiology. 2011;157(4):1229–1239. doi: 10.1099/mic.0.044669-0. [DOI] [PubMed] [Google Scholar]
- Hyatt et al. (2010).Hyatt D, Chen G-L, LoCascio PF, Land ML, Larimer FW, Hauser LJ. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics. 2010;11(1):119. doi: 10.1186/1471-2105-11-119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Igarashi & Kashiwagi (1999).Igarashi K, Kashiwagi K. Polyamine transport in bacteria and yeast. Biochemical Journal. 1999;344(3):633–642. doi: 10.1042/bj3440633. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jonsson et al. (2001).Jonsson A, Meili M, Bergström A-K, Jansson M. Whole-lake mineralization of allochthonous and autochthonous organic carbon in a large humic lake (örträsket, N. Sweden) Limnology and Oceanography. 2001;46(7):1691–1700. doi: 10.4319/lo.2001.46.7.1691. [DOI] [Google Scholar]
- Kalyuzhnaya et al. (2011).Kalyuzhnaya MG, Beck DAC, Vorobev A, Smalley N, Kunkel DD, Lidstrom ME, Chistoserdova L. Novel methylotrophic isolates from lake sediment, description of Methylotenera versatilis sp. nov. and emended description of the genus methylotenera. International Journal of Systematic and Evolutionary Microbiology. 2011;62(1):106–111. doi: 10.1099/ijs.0.029165-0. [DOI] [PubMed] [Google Scholar]
- Kanao et al. (2002).Kanao T, Kawamura M, Fukui T, Atomi H, Imanaka T. Characterization of isocitrate dehydrogenase from the green sulfur bacterium Chlorobium limicola. European Journal of Biochemistry. 2002;269(7):1926–1931. doi: 10.1046/j.1432-1327.2002.02849.x. [DOI] [PubMed] [Google Scholar]
- Kane et al. (2007).Kane SR, Chakicherla AY, Chain PSG, Schmidt R, Shin MW, Legler TC, Scow KM, Larimer FW, Lucas SM, Richardson PM, Hristova KR. Whole-genome analysis of the methyl tert-butyl ether-degrading beta-proteobacterium Methylibium petroleiphilum PM1. Journal of Bacteriology. 2007;189(5):1931–1945. doi: 10.1128/JB.01259-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kang et al. (2015).Kang DD, Froula J, Egan R, Wang Z. MetaBAT, an efficient tool for accurately reconstructing single genomes from complex microbial communities. PeerJ. 2015;3:e1165. doi: 10.7717/peerj.1165. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Karhunen et al. (2013).Karhunen J, Arvola L, Peura S, Tiirola M. Green sulphur bacteria as a component of the photosynthetic plankton community in small dimictic humic lakes with an anoxic hypolimnion. Aquatic Microbial Ecology. 2013;68(3):267–272. doi: 10.3354/ame01620. [DOI] [Google Scholar]
- Kritzberg et al. (2004).Kritzberg ES, Cole JJ, Pace ML, Granéli W, Bade DL. Autochthonous versus allochthonous carbon sources of bacteria: results from whole-lake 13C addition experiments. Limnology and Oceanography. 2004;49(2):588–596. doi: 10.4319/lo.2004.49.2.0588. [DOI] [Google Scholar]
- Li & Durbin (2010).Li H, Durbin R. Fast and accurate long-read alignment with Burrows–Wheeler transform. Bioinformatics. 2010;26(5):589–595. doi: 10.1093/bioinformatics/btp698. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Linz et al. (2017).Linz AM, Crary BC, Shade A, Owens S, Gilbert JA, Knight R, McMahon KD. Bacterial community composition and dynamics spanning five years in freshwater bog lakes. mSphere. 2017;2:1–15. doi: 10.1101/127035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Luo et al. (2012).Luo R, Liu B, Xie Y, Li Z, Huang W, Yuan J, He G, Chen Y, Pan Q, Liu Y, Tang J, Wu G, Zhang H, Shi Y, Liu Y, Yu C, Wang B, Lu Y, Han C, Cheung DW, Yiu S-M, Peng S, Xiaoqian Z, Liu G, Liao X, Li Y, Yang H, Wang J, Lam T-W, Wang J. SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler. GigaScience. 2012;1:1–6. doi: 10.1186/2047-217X-1-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Magooc & Salzberg (2011).Magooc T, Salzberg SL. FLASH: fast length adjustment of short reads to improve genome assemblies. Bioinformatics. 2011;27(21):2957–2963. doi: 10.1093/bioinformatics/btr507. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Markowitz et al. (2012).Markowitz VM, Chen IMA, Palaniappan K, Chu K, Szeto E, Grechkin Y, Ratner A, Jacob B, Huang J, Williams P, Huntemann M, Anderson I, Mavromatis K, Ivanova NN, Kyrpides NC. IMG: the integrated microbial genomes database and comparative analysis system. Nucleic Acids Research. 2012;40(D1):D115–D122. doi: 10.1093/nar/gkr1044. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Martinez-Garcia et al. (2012a).Martinez-Garcia M, Brazel DM, Swan BK, Arnosti C, Chain PSG, Reitenga KG, Xie G, Poulton NJ, Gomez ML, Masland DED, Thompson B, Bellows WK, Ziervogel K, Lo C-C, Ahmed S, Gleasner CD, Detter CJ, Stepanauskas R. Capturing single cell genomes of active polysaccharide degraders: an unexpected contribution of Verrucomicrobia. PLOS ONE. 2012a;7(4):e35314. doi: 10.1371/journal.pone.0035314. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Martinez-Garcia et al. (2012b).Martinez-Garcia M, Swan BK, Poulton NJ, Gomez ML, Masland D, Sieracki ME, Stepanauskas R. High-throughput single-cell sequencing identifies photoheterotrophs and chemoautotrophs in freshwater bacterioplankton. ISME Journal. 2012b;6(1):113–123. doi: 10.1038/ismej.2011.84. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mou et al. (2011).Mou X, Vila-Costa M, Sun S, Zhao W, Sharma S, Moran MA. Metatranscriptomic signature of exogenous polyamine utilization by coastal bacterioplankton. Environmental Microbiology. 2011;3(6):798–806. doi: 10.1111/j.1758-2229.2011.00289.x. [DOI] [PubMed] [Google Scholar]
- Newton et al. (2011).Newton RJ, Jones SE, Eiler A, McMahon KD, Bertilsson S. A guide to the natural history of freshwater lake bacteria. Microbiology and Molecular Biology Reviews. 2011;75(1):14–49. doi: 10.1128/MMBR.00028-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Paradis, Claude & Strimmer (2004).Paradis E, Claude J, Strimmer K. APE: analyses of phylogenetics and evolution in R language. Bioinformatics. 2004;20(2):289–290. doi: 10.1093/bioinformatics/btg412. [DOI] [PubMed] [Google Scholar]
- Parks et al. (2015).Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Research. 2015;25(7):1043–1055. doi: 10.1101/gr.186072.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Paver & Kent (2010).Paver SF, Kent AD. Temporal patterns in glycolate-utilizing bacterial community composition correlate with phytoplankton population dynamics in humic lakes. Microbial Ecology. 2010;60(2):406–418. doi: 10.1007/s00248-010-9722-6. [DOI] [PubMed] [Google Scholar]
- Peters et al. (2015).Peters JW, Schut GJ, Boyd ES, Mulder DW, Shepard EM, Broderick JB, King PW, Adams MWW. [FeFe]- and [NiFe]-hydrogenase diversity, mechanism, and maturation. Biochimica et Biophysica Acta (BBA)-Molecular Cell Research. 2015;1853(6):1350–1369. doi: 10.1016/j.bbamcr.2014.11.021. [DOI] [PubMed] [Google Scholar]
- Peura et al. (2012).Peura S, Eiler A, Bertilsson S, Nykänen H, Tiirola M, Jones RI. Distinct and diverse anaerobic bacterial communities in boreal lakes dominated by candidate division OD1. ISME Journal. 2012;6(9):1640–1652. doi: 10.1038/ismej.2012.21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Peura et al. (2015).Peura S, Sinclair L, Bertilsson S, Eiler A. Metagenomic insights into strategies of aerobic and anaerobic carbon and nitrogen transformation in boreal lakes. Scientific Reports. 2015;5(1):12102. doi: 10.1038/srep12102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Price, Dehal & Arkin (2010).Price MN, Dehal PS, Arkin AP. FastTree 2–approximately maximum-likelihood trees for large alignments. PLOS ONE. 2010;5(3):e9490. doi: 10.1371/journal.pone.0009490. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ramachandran & Walsh (2015).Ramachandran A, Walsh DA. Investigation of XoxF methanol dehydrogenases reveals new methylotrophic bacteria in pelagic marine and freshwater ecosystems. FEMS Microbiology Ecology. 2015;91(10):fiv105. doi: 10.1093/femsec/fiv105. [DOI] [PubMed] [Google Scholar]
- Ramanan et al. (2016).Ramanan R, Kim B-H, Cho D-H, Oh H-M, Kim H-S. Algae–bacteria interactions: evolution, ecology and emerging applications. Biotechnology Advances. 2016;34(1):14–29. doi: 10.1016/j.biotechadv.2015.12.003. [DOI] [PubMed] [Google Scholar]
- R Core Team (2017).R Core Team . R: a language and environment for statistical computing. Vienna: R Foundation for Statistical Computing; 2017. [Google Scholar]
- Rinke et al. (2013).Rinke C, Schwientek P, Sczyrba A, Ivanova NN, Anderson IJ, Cheng J-F, Darling AE, Malfatti S, Swan BK, Gies EA, Dodsworth JA, Hedlund BP, Tsiamis G, Sievert SM, Liu W-T, Eisen JA, Hallam SJ, Kyrpides NC, Stepanauskas R, Rubin EM, Hugenholtz P, Woyke T. Insights into the phylogeny and coding potential of microbial dark matter. Nature. 2013;499(7459):431–437. doi: 10.1038/nature12352. [DOI] [PubMed] [Google Scholar]
- Rognes et al. (2016).Rognes T, Flouri T, Nichols B, Quince C, Mahé F. VSEARCH: a versatile open source tool for metagenomics. PeerJ. 2016;4:e2584. doi: 10.7717/peerj.2584. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rohwer et al. (2018).Rohwer RR, Hamilton JJ, Newton RJ, McMahon KD. TaxAss: leveraging a custom freshwater database achieves fine-scale taxonomic resolution. mSphere. 2018;3(5):e00327-18. doi: 10.1128/mSphere.00327-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Roux et al. (2017).Roux S, Chan LK, Egan R, Malmstrom RR, McMahon KD, Sullivan MB. Ecogenomics of virophages and their giant virus hosts assessed through time series metagenomics. Nature Communications. 2017;8(1):858. doi: 10.1038/s41467-017-01086-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Salcher et al. (2015).Salcher MM, Neuenschwander SM, Posch T, Pernthaler J. The ecology of pelagic freshwater methylotrophs assessed by a high-resolution monitoring and isolation campaign. ISME Journal. 2015;9(11):2442–2453. doi: 10.1038/ismej.2015.55. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Salcher, Posch & Pernthaler (2013).Salcher MM, Posch T, Pernthaler J. In situ substrate preferences of abundant bacterioplankton populations in a prealpine freshwater lake. ISME Journal. 2013;7(5):896–907. doi: 10.1038/ismej.2012.162. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schloss et al. (2009).Schloss PD, Westcott SL, Ryabin T, Hall JR, Hartmann M, Hollister EB, Lesniewski RA, Oakley BB, Parks DH, Robinson CJ, Sahl JW, Stres B, Thallinger GG, Van Horn DJ, Weber CF. Introducing mothur: open-source, platform-independent, community-supported software for describing and comparing microbial communities. Applied and Environmental Microbiology. 2009;75(23):7537–7541. doi: 10.1128/AEM.01541-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Seitzinger et al. (2006).Seitzinger S, Harrison JA, Böhlke JK, Bouwman AF, Lowrance R, Peterson B, Tobias C, Drecht G Van. Denitrification across landscapes and waterscapes: a synthesis. Ecological Applications. 2006;16(6):2064–2090. doi: 10.1890/1051-0761(2006)016[2064:DALAWA]2.0.CO;2. [DOI] [PubMed] [Google Scholar]
- Shade et al. (2007).Shade A, Kent AD, Jones SE, Newton RJ, Triplett EW, McMahon KD. Interannual dynamics and phenology of bacterial communities in a eutrophic lake. Limnology and Oceanography. 2007;52(2):487–494. doi: 10.4319/lo.2007.52.2.0487. [DOI] [Google Scholar]
- Smith (2003).Smith VH. Eutrophication of freshwater and coastal marine ecosystems a global problem. Environmental Science and Pollution Research. 2003;10(2):126–139. doi: 10.1065/espr2002.12.142. [DOI] [PubMed] [Google Scholar]
- Sommer et al. (2007).Sommer DD, Delcher AL, Salzberg SL, Pop M. Minimus: a fast, lightweight genome assembler. BMC Bioinformatics. 2007;8(1):64. doi: 10.1186/1471-2105-8-64. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tang & Blankenship (2010).Tang KH, Blankenship RE. Both forward and reverse TCA cycles operate in green sulfur bacteria. Journal of Biological Chemistry. 2010;285(46):35848–35854. doi: 10.1074/jbc.M110.157834. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Varghese et al. (2015).Varghese NJ, Mukherjee S, Ivanova N, Konstantinidis KT, Mavrommatis K, Kyrpides NC, Pati A. Microbial species delineation using whole genome sequences. Nucleic Acids Research. 2015;43(14):6761–6771. doi: 10.1093/nar/gkv657. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wickham (2007).Wickham H. Reshaping data with the reshape package. Journal of Statistical Software. 2007;21(12):1–20. [Google Scholar]
- Wickham (2009).Wickham H. ggplot2: elegant graphics for data analysis. New York: Springer-Verlag; 2009. [Google Scholar]
- Wilke (2017).Wilke CO. cowplot: streamlined plot theme and plot annotations for “ggplot2”. R Package Version 0.9.2https://CRAN.R-project.org/package=cowplot 2017
- Williamson et al. (2008).Williamson CE, Dodds W, Kratz TK, Palmer MA. Lakes and streams as sentinels of environmental change in terrestrial and atmospheric processes. Frontiers in Ecology and the Environment. 2008;6(5):247–254. doi: 10.1890/070140. [DOI] [Google Scholar]
- Zhang et al. (2018).Zhang H, Yohe T, Huang L, Entwistle S, Wu P, Yang Z, Busk PK, Xu Y, Yin Y. dbCAN2: a meta server for automated carbohydrate-active enzyme annotation. Nucleic Acids Research. 2018;46(W1):W95–W101. doi: 10.1093/nar/gky418. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Additional chemistry data were collected by NTL-LTER (http://lter.limnology.wisc.edu) from depth discrete samples taken from 0 and 4 m for Mendota, 0 m for the Trout Bog epilimnion, and 3 and 7 m for the Trout Bog hypolimnion. Values reported here are the means of all measurements in the sampling time span for each lake, with standard deviations reported in parentheses.
Metagenomic samples were pooled by lake and layer to allow time-resolved binning. The Mendota time series spans 2008–2012, while the Trout Bog time series spans 2007–2009. Just under 200 medium to high quality metagenome-assembled genomes (MAGs) were produced.
A Wilcoxon rank sum test was used to non-parametrically test for significant differences in functional marker gene distributions between our study sites. P-values of less than 0.05 are considered significant.
This dataset includes information about the metagenomes used in this study including date collected, size in reads and base pairs, and their IMG Genome IDs (IMG Taxon ID).
Information about the completeness, size, and taxonomy of our MAGs, as well as their IMG OIDs, are presented here.
Average nucleotide identity (ANI) was calculated between all MAGs in our dataset. MAGs with extremely high ANIs (>97%) are likely from the same populations.
This dataset lists the TIGRFAM, COG, or PFAM IDs of sequences used as functional marker genes to analyze how gene content differs by site.
This dataset is the input to Fig. 2 and contains pathway completeness estimates for each MAG individually.
To assess the potential to degrade complex carbon compounds, we annotated carbohydrate active enzymes in our MAGs using dbCAN2. The output of dbCAN2 for each MAG is presented here.
To visualize the diversity of our MAGs, phylogenetic marker genes were extracted from each MAG and aligned using Phylosift. An approximate maximum-likelihood tree based on these alignments was constructed using FastTree. The potential for nitrogen fixation based on gene content is indicated on the branch tips.
We used read coverage normalized by MAG and metagenome size to approximate the abundance of our MAGs. MAGs were recovered from diverse freshwater phyla. The abundances of phyla represented by MAGs differed by lake and layer. MAGs were classified using Phylosift, and Proteobacteria was split into classes due to the high diversity of this phylum.
The community composition observed via 16S rRNA gene amplicon sequencing in our dataset is consistent with previously published analyses of freshwater community composition. This confirms that the years included in our study are not abnormal. The 16S V6–V8 region was targeted in Trout Bog, while the V4 region was targeted in Mendota. Proteobacteria was split into classes due to the high diversity of this phylum.
Data Availability Statement
The following information was supplied regarding data availability:
McMahon Lab Github–MAGstravaganza