Skip to main content
Applied and Environmental Microbiology logoLink to Applied and Environmental Microbiology
. 2022 Apr 19;88(9):e02464-21. doi: 10.1128/aem.02464-21

Metagenomic and Metatranscriptomic Insights into Population Diversity of Microcystis Blooms: Spatial and Temporal Dynamics of mcy Genotypes, Including a Partial Operon That Can Be Abundant and Expressed

Colleen E Yancey a, Derek J Smith a, Paul A Den Uyl b, Osama G Mohamed c,d,e, Fengan Yu d, Steven A Ruberg f, Justin D Chaffin g,h, Kelly D Goodwin i,j, Ashootosh Tripathi c,d,k, David H Sherman d,k, Gregory J Dick a,b,
Editor: Knut Rudil
PMCID: PMC9088275  PMID: 35438519

ABSTRACT

Cyanobacterial harmful algal blooms (cyanoHABs) degrade freshwater ecosystems globally. Microcystis aeruginosa often dominates cyanoHABs and produces microcystin (MC), a class of hepatotoxins that poses threats to human and animal health. Microcystin toxicity is influenced by distinct structural elements across a diversity of related molecules encoded by variant mcy operons. However, the composition and distribution of mcy operon variants in natural blooms remain poorly understood. Here, we characterized the variant composition of mcy genes in western Lake Erie Microcystis blooms from 2014 and 2018. Sampling was conducted across several spatial and temporal scales, including different bloom phases within 2014, extensive spatial coverage on the same day (2018), and frequent, autonomous sampling over a 2-week period (2018). Mapping of metagenomic and metatranscriptomic sequences to reference sequences revealed three Microcystis mcy genotypes: complete (all genes present [mcyA–J]), partial (truncated mcyA, complete mcyBC, and missing mcyD–J), and absent (no mcy genes). We also detected two different variants of mcyB that may influence the production of microcystin congeners. The relative abundance of these genotypes was correlated with pH and nitrate concentrations. Metatranscriptomic analysis revealed that partial operons were, at times, the most abundant genotype and expressed in situ, suggesting the potential biosynthesis of truncated products. Quantification of genetic divergence between genotypes suggests that the observed strains are the result of preexisting heterogeneity rather than de novo mutation during the sampling period. Overall, our results show that natural Microcystis populations contain several cooccurring mcy genotypes that dynamically shift in abundance spatiotemporally via strain succession and likely influence the observed diversity of the produced congeners.

IMPORTANCE Cyanobacteria are responsible for producing microcystins (MCs), a class of potent and structurally diverse toxins, in freshwater systems around the world. While microcystins have been studied for over 50 years, the diversity of their chemical forms and how this variation is encoded at the genetic level remain poorly understood, especially within natural populations of cyanobacterial harmful algal blooms (cyanoHABs). Here, we leverage community DNA and RNA sequences to track shifts in mcy genes responsible for producing microcystin, uncovering the relative abundance, expression, and variation of these genes. We studied this phenomenon in western Lake Erie, which suffers annually from cyanoHAB events, with impacts on drinking water, recreation, tourism, and commercial fishing.

KEYWORDS: freshwater harmful algal blooms, Microcystis, metagenomics, mcy operon, Lake Erie

INTRODUCTION

Cyanobacterial harmful algal blooms (cyanoHABs) dominated by Microcystis spp. are globally distributed and reported on every continent except Antarctica (1). These toxic blooms are expected to expand in frequency and severity due to eutrophication, rising atmospheric CO2, and continued human-induced global warming (24). Microcystis has a complex genome that encodes many diverse, potentially harmful secondary metabolites (59). These compounds can contribute to an array of negative outcomes, including impacts on human and environmental health (10, 11), threats to potable water (12), and disruption of ecosystem function (2).

Microcystis is capable of producing the hepatotoxin microcystin (MC), a cyclic heptapeptide with over 270 reported structural variants, termed congeners (13). It is generated by a nonribosomal peptide synthetase multienzyme complex, which is encoded by a set of 10 genes in the bidirectional mcy operon (14, 15). This gene cluster is ancient, having originated from a common ancestor of several cyanobacteria genera, including Microcystis, Dolichospermum, and Planktothrix (16). The mcy operon also has a polyphyletic distribution within the genus Microcystis, providing evidence for multiple independent mcy gene loss events (1618). Due to its prevalence and potential toxicity, the expression, regulation, and secretion of microcystin have been studied extensively in locations around the world (1, 6, 8, 19, 20).

The mcy operon displays extensive sequence diversity (14, 21, 22), and each allele is thought to encode distinct MC congeners (2124). However, many other factors influence the structural variations of microcystins, including the availability of amino acids in cells (25), the flexibility of amino acid adenylation domains comprising the biosynthetic enzyme complex (21), nitrogen form and availability (26), and C-to-N ratios of available nutrients (27). The genes mcyA, mcyB, and mcyC demonstrate the highest level of variation and a “mosaic”-like structure (21). Frequent recombination events in these genes may lead to unique gene sequences that encode distinct MC congeners, such as the replacement of the mcyB B1 domain with the mcyC C1 domain (see Fig. 1 for examples), which leads to the production of at least two MC congeners, microcystin-LR, and microcystin-RR (22).

FIG 1.

FIG 1

Metagenomic read mapping suggests three distinct mcy genotypes in western Lake Erie Microcystis populations at different times and locations. Genotypes were inferred by mapping metagenomic reads onto the reference mcy operon from Microcystis aeruginosa reference strain PCC 7806 (GenBank accession number AF183408.1). For each plot, the top panel indicates the percent identity of the read mapped to the reference operon, and the bottom panel shows coverage (number of reads mapped per position). The color of the genes indicates modular enzymes encoded by mcy genes. (A) Mapping results from a sample dominated by the complete genotype (WE12, 4 August). The asterisks indicate evidence of recombination in which the C1 domain replaced the B1 domain, creating a C1-like mcyB genotype, which is interpreted to dominate the metagenome of this sample (bottom operon). The lack of this recombination preserves the mcyB orientation for a B1-like mcyB genotype (top operon), as in the reference sequence from PCC 7806. (B) Sample dominated by the partial genotype (WE12, 23 September). (C) Absent genotype with no mcy genes present.

While numerous studies have addressed how environmental variables impact congener production under controlled conditions (21, 22, 26), few have assessed the variation of the mcy operon structure and sequence in natural populations. Genetic research has largely focused on laboratory studies of Microcystis cultures aimed at understanding the mcy operon structure and the production of microcystin (21, 22, 24, 28). Field studies show that blooms consist of nontoxic and toxic strains (29) but have relied on PCR or consensus sequences from metagenomic assemblies (30), both of which may fail to detect important population-level variation. Mapping of metagenomic and metatranscriptomic sequence reads to reference genomes yields valuable insights into the genomic diversity and functional activity of Microcystis (3133). However, these methods have not yet been widely applied to seasonal strain successions of natural Microcystis populations, which are an important control on bloom toxicity (18). Thus, the true environmental diversity of Microcystis populations remains poorly understood.

Here, we studied the diversity of the mcy operon in a natural Microcystis population in western Lake Erie, which experiences annual cyanoHABs (34). The western basin is the shallowest, warmest, and most populated of the Laurentian Great Lakes and provides drinking water to 11 million people as well as fishing and other recreation, which have substantial economic value (Western Lake Erie Basin Project, NRCS). In August 2014, Toledo, OH, experienced a drinking water crisis that left its nearly half a million residents without potable water as a result of microcystin concentrations exceeding the World Health Organization guideline of 1 microgram/L (12, 35). The 2014 western Lake Erie bloom was characterized by an early toxic phase and a late nontoxic phase that correlated with a succession of Microcystis strains (35, 36). This pattern of succession has been observed worldwide but is not fully understood with respect to environmental or ecological drivers (18).

We collected samples from Lake Erie during cyanoHABs in 2014 and 2018. Sampling of blooms during 2014 (35), 2018 (37), and 2018 via an autonomous 3rd-generation environmental sample processor housed within a long range-autonomous underwater vehicle (3G-ESP LRAUV) (P. A. Den Uyl, S. R. Chaganti, L. R. Thompson, R. M. Errera, C. M. Preston, W. Ussler III, C. E. Yancey, J. M. Birch, S. A. Ruberg, G. J. Doucette, G. J. Dick, C. A. Scholin, and K. D. Goodwin, submitted for publication) provided seasonal and spatial variation, high spatial resolution, and high temporal resolution, respectively. The 2014 cyanoHAB data provided insights into spatial and temporal shifts from weekly samples collected from three stations from July to October (35). The 2018 “HABs Grab” data set achieved higher spatial resolution via 25 samples collected from various points in the lake on the same day (37). The 2018 3G-ESP captured high temporal resolution through autonomous sample collection over a 2-week period (Den Uyl et al., submitted). Analysis of the mcy genes and strain diversity in these data sets was used to assess novel genotypes and their relative frequencies as well as high genomic variation across time and space within the western basin and determine the relationships between genotypes and environmental variables to determine patterns of ecological strain succession.

RESULTS

mcy operon coverage and genotypes.

Mapping of metagenomic sequence reads to the mcy operon from the reference genome Microcystis aeruginosa PCC 7806 revealed three major mcy genotypes in western Lake Erie samples. First, relatively even coverage across the operon indicated dominance by a complete genotype with all 10 mcy genes present (Fig. 1A). Second, a partial mcy genotype was inferred through the detectable coverage of half of the mcyA gene and the full mcyB and mcyC genes but essentially no coverage of other mcy genes (Fig. 1B). A third, “absent” genotype was inferred through no coverage of mcy genes despite the presence of Microcystis in a sample, indicating Microcystis genotypes in which no mcy genes were present (Fig. 1C). Where mcyB and mcyC genes were present, mapping resulted in a lack of coverage at the location of the B1 domain (∼2.5 kb of mcyB) and approximately double coverage of the C1 domain (∼2.5 kb of mcyC) (Fig. 1A). This is indicative of a recombination event in which the mcyB B1 domain is replaced by the mcyC C1 domain as described previously by Mikalsen et al. (22).

The relative abundances of the Microcystis populations that contained a complete, partial, or absent mcy genotype from each station were estimated by comparison of mcy gene coverage with Microcystis 16S rRNA gene coverage (see Materials and Methods). The estimated relative abundance of each mcy genotype in 2014 cyanoHAB samples shifted with location and bloom conditions (Fig. 2). Relative abundances are reported in unitless ratios from 0-1 with 0 indicating the absence of a gene or genotype, and 1 indicating a 1 to 1 ratio of the mcy gene to the 16s rRNA gene. At nearshore station WE12, complete genotypes accounted for approximately half (0.51 to 0.54) of the Microcystis population during the toxic, nitrate-replete bloom in July and early August, but absent and partial mcy genotypes dominated from late August through October, after microcystin and nitrate concentrations declined (Fig. 2C). At the other nearshore station, WE2, a similar transition from complete to absent and partial mcy genotypes was observed. During the peak phase of the bloom at WE2 (4 August), the complete genotype coverage ratio was 1.58, indicating that mcy genes were more abundant than 16S rRNA genes at this time. At the offshore station (WE4), mcy genotypes were more mixed (Fig. 2A). The mcyC1 recombination variant was the most abundant (e.g., replacing the mcyB1 genotype) at all times and stations (Fig. 2B).

FIG 2.

FIG 2

Relative abundances of Microcystis mcy genotypes from the 2014 cyanoHAB by station and date. Relative abundances of genotypes were calculated by taking the ratio of the coverage of mcy genes to the coverage of the V4 16S rRNA gene. (A) Proportion of the Microcystis population that contained a complete, partial, or absent mcy operon by sample. Values above 1 indicate higher coverage of mcy genes than the 16S rRNA genes (see Materials and Methods). (B) Proportion of the Microcystis population that contains either the mcyB C1-like or B1-like variant. (C) Concentrations of particulate microcystins (MCs), nitrate (NO3), and phycocyanin (PC) per sample (micrograms per liter).

We next calculated the relative abundances of mcy genotypes in the 2018 HABs Grab (37) and the 2018 3G-ESP LRAUV (Den Uyl et al., submitted) data sets to assess dynamics over narrow spatial and temporal scales, respectively, compared to the 2014 data set. In every 2018 sample, the most abundant mcy genotype was the absent genotype (Fig. 3A). Among mcyB genes, the C1 variant was most abundant (Fig. 3B). Samples from the HABs Grab effort were compared based on their distances from the mouth of the Maumee River, the major source of nutrient runoff for western Lake Erie (38). The relative abundance of the complete mcy operon genotype was highest (0.42) in samples 37 to 47 km from the mouth of the Maumee River, peaking at 37.5 km (Fig. 3).

FIG 3.

FIG 3

Relative abundances of mcy genotypes for Microcystis populations sampled during the 2018 HABs Grab and 2018 ESP monitoring efforts. Relative abundances were calculated by taking the ratio of the coverage of mcy genes to the coverage of the V4 16S rRNA gene. The x axis is the distance from the Maumee River (kilometers) from which the sample was collected. (A) Proportion of the population containing a complete, partial, or absent mcy operon. (B) Proportion of the population containing the C1-like or B1-like mcyB variant for both the 2018 HABs Grab and 3G-ESP sampling efforts.

3G-ESP data from 2018 also showed low relative abundances of complete and partial mcy genotypes (<0.20) throughout the 2-week sampling period. While strong spatial or temporal trends were not observed during this period, variability was observed across small spatial scales (<0.5 km) (Fig. 3).

mcy genotypes and environmental variable correlations.

The relative abundances of mcy genotypes in the 2014 Microcystis population showed significant correlations with several environmental variables (Table 1; see also Table S1 in the supplemental material). The relative abundance of the complete mcy operon had significant positive correlations with pH (Pearson’s R = 0.51 [P = 0.05]) and nitrate concentrations (Pearson’s R = 0.62 [P = 0.013]) and a nonsignificant relationship with ammonium concentrations. The relative abundance of the partial mcy genotype was significantly and negatively correlated with both nitrate and ammonium concentrations (Pearson’s R = −0.53 [P = 0.044] and Pearson’s R = −0.52 [P = 0.048], respectively). The relative abundance of the absent genotype was significantly and negatively correlated with pH (Pearson’s R = −0.54 [P = 0.037]) and nonsignificantly and positively correlated with ammonium concentrations (Pearson’s R = 0.40 [P = 0.14]) (Table 1). The relative abundance of the C1/B1 mcyB variant showed a positive, significant correlation with dissolved organic carbon (DOC) (Pearson’s R = 0.580 [P = 0.022]) (Table S2).

TABLE 1.

Pearson’s correlations generated for the relative proportion of the Microcystis population that had a complete, partial, or absent mcy operon as well as the ratio of C1/B1 variants against the environmental variables pH, nitrate concentration, and ammonium concentration (n = 15)

mcy genotype category pH
Nitrate concn (mg/L)
Ammonium concn (μg/L)
R P value R P value R P value
Complete 0.513 0.05 0.621 0.013 −0.082 0.77
Partial −0.0318 0.91 −0.525 0.044 −0.52 0.048
Absent −0.541 0.037 −0.359 0.19 0.403 0.14

Microcystis strain diversity.

To determine if mcy genotypes were a result of the presence of multiple strains or recent gene transfer events, we calculated genome-wide strain variation of Microcystis in the 2014 cyanoHAB metagenomes based on single nucleotide variants (SNVs) (39). Consensus average nucleotide identity (conANI), a measure of SNVs between samples (see Materials and Methods), was calculated for Microcystis in all samples. Hierarchical clustering of conANI values based on pairwise comparisons showed that samples clustered based on the bloom phase (Fig. 4). ConANI values between samples ranged from 97.3014% to 99.7936% (Table S2). These ranges provide strong evidence that these samples consist of multiple strains as a value of 99.999% has been shown to be a sufficiently stringent cutoff for ANI measurements in identifying identical strains within metagenomic data sets (39). There was a clear distinction in sample clustering between the peak phase of the bloom, where the populations were predominantly composed of strains that contained the complete mcy operon, and the mid- to late phases, where the partial and absent genotypes were more prevalent. These results suggested that the Microcystis strains present during different bloom phases were distinct in terms of their genome-wide nucleotide similarity and that changes in mcy genotype relative abundances cannot be explained by recent gene transfer events.

FIG 4.

FIG 4

Hierarchical clustering of 2014 metagenomic samples based on pairwise conANI comparisons. Clustering indicates that the Microcystis strain composition is more similar in samples from the same bloom phase than in those from different phases that tend to have the same dominant genotype. The bloom phase described as “middle” in the illustration above refers to samples collected in August and September where the Microcystis biomass and microcystin concentration are still detectable yet past peak concentrations.

ConANI comparisons measure the difference in SNVs present between two samples and, thus, represent a minimum estimate of the number of mutations that distinguish the Microcystis strains in two samples. To evaluate whether the observed SNV differences between samples could feasibly arise from mutations formed during the course of a single bloom season, we used upper and lower limits of bacterial mutation rates (40) to estimate the divergence time required to produce the observed conANI values. The results indicated this to be unlikely because thousands to millions of years would be required to account for the observed genome-wide differences, and hundreds to thousands of years would be required to account for nucleotide differences in mcy genes between samples containing complete and partial genotypes (Table 2).

TABLE 2.

Estimations of times of divergence for Microcystis strains observed in the 2014 cyanoHABa

Sequence Mutation rate (no. of substitutions/site/yr) Time of divergence (yrs)
Whole genome 1.00E−04 1.06E+06–1.39E+07
1.00E−07 1.06E+09–1.39E+10
mcyB 1.00E−04 3.53E+02–1.24E+04
1.00E−07 3.53E+05–1.24E+07
mcyC 1.00E−04 3.00E+02–1.24E+04
1.00E−07 3.00E+02–1.24E+07
a

Mutation rates included in this table represent the upper and lower limits of mutation rates found in several bacterial genomes based on an analysis completed by Gibson and Eyre-Walker (40). Times of divergence were calculated for the samples that had the lowest and highest conANI scores for whole genomes and the mcyB and mcyC genes compared pairwise.

mcy gene expression.

Metatranscriptomic data were analyzed to track the relative abundances of mcy gene transcripts over time and space in samples from the 2014 bloom. The transcript relative abundance of the complete mcy operon was highest at nearshore station WE12 on all three dates sampled (Fig. 5). The transcript relative abundance was low (near detection limits) during the early phases of the bloom, as seen at nearshore station WE2 on 21 July and offshore station WE4 on 29 July. When the partial mcy operon was more prevalent in the population than the complete operon genotype (Fig. 2), the partial genes were expressed (Fig. 5), as observed at station WE2 on 6 October and at station WE4 on 8 September.

FIG 5.

FIG 5

Heatmaps showing the relative abundances of metatranscriptomic reads mapped to the mcy operon. Relative abundance is determined by calculating the coverage of the gene of interest (number of transcript reads mapped divided by gene length) divided by the coverage of transcripts mapped to the entire genome divided by genome length.

DISCUSSION

We used metagenomic and metatranscriptomic data to reveal the dynamics of Microcystis strains across a variety of spatial and seasonal scales in 2014 and 2018 blooms in western Lake Erie. The 2014 bloom presented the opportunity to investigate a seasonal succession of Microcystis strains alongside changing environmental conditions (35). An early toxic phase of the bloom (i.e., high concentrations of MCs) that coincided with high nitrate (nitrate-N > 400 μg/L) conditions (41) gave way to a nontoxic phase of the bloom with nitrate-depleted conditions (nitrate-N < 200 μg/L later in the season) (Fig. 2 and Table 1). In addition to the widely known toxigenic and nontoxigenic Microcystis genotypes (all mcy genes present and absent, respectively), a genotype with a partial set of genes (mcyA–C) was found to be abundant and expressed in the nontoxic phase of the bloom.

The significant correlations between mcy genotype relative abundance estimates and concentrations of ammonium and nitrate and pH (Fig. 2 and Table 1) suggest that nitrogen availability and the photosynthetic rate of the bloom, which drives extreme changes in pH (33), may influence the competitive fitness of each genotype and thus shape the Microcystis strain composition in the blooms. Microcystis also showed microdiversity across the whole genome. Nucleotide similarity between samples was clustered by bloom phase (Fig. 4), indicating that discrete subpopulations (7) may be ecologically distinct as well. Based on known mutation rates, genetic divergence between subpopulations was estimated to require at least thousands of years, indicating that the observed succession involved primarily ecological selection rather than the evolution of genotypes within the season.

Our approach for estimating the relative abundances of genotypes through coverage ratios of metagenomic sequence reads that mapped to the mcy gene and the 16S rRNA gene was subject to several uncertainties. First, whereas it is commonly assumed that Microcystis genomes contain one copy of the mcy and rRNA operons (42, 43), all nine closed genomes of Microcystis that were publicly available have two copies of the 16S rRNA gene and one copy of the mcy operon (44). The gene copy numbers of mcy and 16S rRNA genes in native populations are uncertain (45) and may vary from strain to strain. Variation in coverage and, thus, our estimates of relative abundance could be affected by both the mcy gene copy number per genome as well as the relative abundance of cells with the various mcy genotypes. Second, rapid genome replication can result in gene dose effects wherein genes near the origin of replication are present in higher copy numbers than those at the terminus (4648). The presence of mcy genes near the origin of replication or multiple copies of mcy genes per genome may explain our reported coverage ratio value of 1.58 at WE2 on 4 August 2014 (Fig. 2), but the available data cannot determine whether these factors were actually at play in these western Lake Erie populations. Third, the observed gene counts could be affected by nonspecific matches to V4 16S rRNA and mcy genes from other cyanobacteria (e.g., Dolichospermum) or homologous biosynthetic genes. However, we used stringent thresholds and aligned all mapped mcy and 16S rRNA gene sequence reads against a custom universal database and the Silva SSU DB v.138 database, respectively, to confirm that there was no significant spurious mapping. Despite these limitations, the relative abundances measured here track major shifts in genotypes across spatiotemporal scales, and our values are consistent with previous findings in western Lake Erie blooms (29). While the read-level analysis conducted here has long been used to study the population-level diversity of microbial communities (49, 50), it has not been widely applied to studies of cyanobacterial blooms.

The results from this study reveal that natural Microcystis populations consist of multiple mcy genotypes, some of which have been described previously from studies of pure cultures, such as the complete and absent genotypes (21, 22, 51). In this study, the Microcystis population was dominated at times by genotypes that lack mcy genes altogether (throughout 2018) (Fig. 3) or by a genotype in which only the mcyA–C genes were present (late September in 2014) (Fig. 2). This partial mcy genotype is present in just 1 of 159 publicly available Microcystis genomes derived from culture (PCC 9717; NCBI assembly ASM31216v1), which was isolated from a water dam in Rochereau, La Vendée, France. To our knowledge, this partial genotype has been briefly mentioned only once in the literature, by Pearson et al. (6), who identified it in a PCC 9717 culture. Interestingly, we detected the partial genotype across multiple locations and dates spanning 4 years (Fig. 2 and 3), indicating that it is a persistent member of the Microcystis bloom community in Lake Erie. The partial genotype was the most abundant during the secondary bloom phase of 2014 where conditions were nutrient depleted (29, 52). This persistence and occasional dominance, together with the active transcription of genes from this partial operon (Fig. 5), suggest that this strain is ecologically successful and that the partial mcy operon may be functional. The significant negative correlation of the partial genotype ratio estimate with both nitrate and ammonium concentrations (Table 1) suggests that it is adapted to conditions of low N. Experiments on pure cultures will be helpful in addressing open questions on the ecology, biosynthetic potential, and toxigenicity of this partial mcy genotype.

The complete mcy genotype often dominated in the early stages of the bloom under nitrate-replete conditions (Fig. 2A and C). Only the complete mcy genotype coverage ratio was positively correlated with nitrate concentrations (Table 1). This observation was consistent with microcystin-producing strains having a higher demand for nitrogen (41, 52, 53) due to the high N content of microcystin metabolites as predicted by nutrient stoichiometry theory (54, 55). Similarly, only the complete mcy genotype was positively correlated with pH (Table 1). This finding suggests that MC-producing strains were associated with increased photosynthetic rates and/or high pH relative to non-MC-producing strains. Increasing photosynthetic rates greatly increase the pH and serve as a good proxy for bloom activity (56, 57). This agrees with previous results showing that faster-growing strains tend to be MC producers (54) and that shifts in Microcystis genotypes correlated strongly with pH, with potential links to carbon acquisition and concentrating mechanisms (33). Lower pH generally indicates cyanoHAB communities dominated by slow growth and maintenance, which may be a specialty of nontoxigenic strains (54, 57).

Our results show that several different mcy variants, reflecting the mosaic structure of the mcy operon identified in pure cultures, co-occur in natural environments. The study by Mikalsen et al. (22) demonstrated that the sequence of the mcyB1 domain (e.g., either a B1-like or a C1-like genotype) influences the MC congener produced within cultured isolates. Microcystis strains with the B1-like genotype produced LR isoforms, while those with C1-like genotypes produced both LR and RR isoforms (22). We detected both the B1 and C1 genotypes in western Lake Erie populations, providing genetic context for the production of MC-LR and -RR, two of the most common congeners in this system (37, 58). The LR- and RR-producing C1 variant was the most abundant form of the mcyB gene in western Lake Erie during both 2014 (Fig. 2) and 2018 (Fig. 3) as well as 2006 (59), showing that these genotypes are present interannually. While our study focused on mcyB variants, it is likely that other genotypes are present in these populations, especially within the hyperdynamic mcyABC region.

Given the dynamic nature of Microcystis genomes (5) and evidence of genome rearrangement in response to environmental conditions (31), one competing hypothesis explaining seasonal trends in genotypic diversity is active horizontal gene transfer (HGT) or loss of mcy genes during the season. However, the distinct clustering of samples by bloom phase based on genome-wide conANI scores (Fig. 4) suggested that the seasonal dynamics of mcy genotypes were due to shifts in the abundances of Microcystis strains rather than ongoing evolution at the time scale of this study. If the differential coverage ratios of mcy genotypes were due to active HGT and/or gene loss, the same strains would be present at all times, and the conANI would be at least 99.999% (39); in contrast, we found that the values ranged from 97.3014% to 99.7936% (see Table S2 in the supplemental material). We conclude that the samples contained distinct strains that shifted in abundance over the course of the bloom and diverged on the order of thousands of to several million years based on whole-genome analysis and a wide range of mutation accumulation rates (Table 2). While conANI results varied across genes, as expected due to different evolutionary histories and forces (6, 2123), even the shortest estimated divergence times were on the scale of hundreds to thousands of years. Rather than HGT, the results from this study suggested that specific genotypes were favored during the bloom according to shifting niche space, with an emphasis on nitrogen form and availability as well as pH (Table 1).

The co-occurrence of various mcy genotypes in natural cyanobacterial bloom communities highlights that genotypes commonly observed in culture (21, 22, 51) as well as novel genotypes that have scarcely been mentioned in the literature (6) are common in natural populations. Our results show that this genotype diversity shifts dynamically in time and space and suggest that it is likely responsible in part, along with the direct influence of environmental conditions, for the diversity of structural variants of MC observed in nature. The presence of an abundant, transcriptionally active, partial mcy operon of unknown biosynthetic potential merits further investigation. Its presence further highlights the potential pitfalls of using single mcy genes to assess MC-producing strains, as mcyA–C will not distinguish partial and complete genotypes, and the use of mcyD or mcyE will not capture the presence of the partial genotype. For example, the Phytotoxigene (Akron, OH, USA) quantitative PCR (qPCR) screening kit, which is commonly used to detect MC-producing potential, detects only the mcyE gene. Such an approach would not detect the partial operon identified in this study. Overall, this study shows the utility of shotgun omics approaches for resolving known and novel mcy genotypes that can dominate natural blooms with the potential for new phenotypic outcomes of concern to human and animal health.

MATERIALS AND METHODS

Sample collection.

The 2014 cyanoHAB samples were collected from NOAA Great Lakes Environmental Research Laboratory (GLERL) sampling stations WE2, WE4, and WE12 (Fig. 6) throughout the western basin of Lake Erie from mid-June through late October 2014 (60). WE2 is close to the inlet for the Maumee River (41°45.743′N, 83°19.874′W), WE4 is considered an offshore site closer to the center of the basin (41°49.595′N, 83°11.698′W), and WE12 is proximal to the Toledo drinking water inlet (41°42.535′N, 83°14.989′W). Given the propensity of Microcystis to migrate vertically through the water column and occur at varying depths, 20-L integrated-depth samples were collected from each station for biological and chemical analyses. Integrated depths were defined as the surface of the water to 1 m above the lake floor. While samples were collected, measurements of pH and water temperature were collected as well. At the time of collection, GLERL also collected samples to measure various environmental variables, including nutrient and pigment concentrations. In order to capture Microcystis aggregates, 2 L of integrated-depth-collected water was filtered through a 100-μm polycarbonate mesh filter. The biomass on the filter was then collected and filtered onto a 0.22-μm filter. The biomass on the 0.22-μm filter was preserved in 1 mL of RNAlater (Invitrogen, Ambion) and placed on ice. Upon arrival at the laboratory, samples were stored in a −80°C freezer until DNA and RNA extractions could be completed.

FIG 6.

FIG 6

Map of western Lake Erie HAB long-term monitoring stations, 2018 HABs Grab sample locations, and 2018 3G-ESP sample collection points. Western Lake Erie long-term monitoring stations are denoted with large red circles, 2018 HABs Grab samples are green circles, and 3G-ESP sample locations are orange circles. (The Open Street Map [OSM] [https://wiki.osmfoundation.org/wiki/Main_Page] was used as the basis for the map.)

We also analyzed shotgun data from samples collected during the western Lake Erie 2018 HABs Grab and 2018 ESP sampling campaigns (Fig. 6). These studies have been outlined elsewhere (37; Den Uyl et al., submitted). Briefly, the 2018 HABs Grab aimed at achieving high spatial sampling on a single day. Samples (n = 100) were collected on 9 August 2018, with a wide range of accompanying environmental data. Integrated-depth water samples were collected using a 2-m-long tube sampler for an integrated sample from the surface to 2 m deep, and the sample water from the tube sampler was deposited into a clean 20-L bucket. Two liters of water was poured into a polyethylene terephthalate glycol (PETG) bottle per station and stored in dark coolers until processing was completed at the laboratory later that day. EXO2 sonde (YSI Inc., Yellow Springs, OH, USA) was used to measure sample parameters, including temperature, pH, specific conductivity, turbidity, and chlorophyll and phycocyanin fluorescence, within 60 s of collection using an EXO calibration cup. A subset of samples (n = 25) was selected for DNA extraction off 25-mm 1.2-μm-pore-size filters (Versapor acrylic 297 copolymer, part number 66393; Pall, Port Washington, NY, USA).

The 2018 3rd-generation environmental sampling processer (3G-ESP) sampling effort was performed over a 2-week period spanning 24 August to 2 September 2018, with an autonomous sampling vehicle. The 3G-ESP was integrated into the payload of a Tethys-class long-range autonomous underwater vehicle (LRAUV) (61, 62). Water samples were collected via an intake tube, pumped through an internal cartridge within the 3G-ESP with water filtered onto stacked 5-μm and 0.22-μm filters, preserved with RNAlater (Invitrogen, Ambion) on the vehicle, and archived for retrieval once the vehicle was returned to the laboratory. The 5-μm filters were used for subsequent metagenomic analysis. Samples were collected 2 to 6 m below the surface to ensure the complete submersion of the vehicle, and water was continuously collected until each filter was clogged (23 to 100 mL). Filter clogging was determined by a pressure sensor setting within the instrument.

Sample processing and sequencing.

The 2014 bloom DNA extractions were completed using DNeasy blood and tissue extraction kits with QIAshredder columns according to the protocol provided by the manufacturer (Qiagen, Hilden, Germany). The extracted DNA concentration was determined using the Quant-iT PicoGreen double-stranded (dsDNA) assay kit. RNA extractions were completed using the Qiagen (Hilden, Germany) RNeasy kit according to the manufacturer’s protocol. Shotgun DNA and RNA sequencing was completed at the University of Michigan DNA Sequencing Core. Sequencing was completed on the Illumina HiSeq platform (2000 PE 100; Illumina Inc., San Diego, CA, USA).

For the 25 samples from the 2018 HABs Grab sampling effort DNA was extracted, quantified, and sequenced in a prior study as described previously (37).

The 2018 ESP data samples were processed and sequenced according to the following protocol. Filters were recovered from the archived cartridges upon the completion of the 2-week sampling effort and stored at −80°C until further processing was completed. To complete DNA sequencing, filters were removed from the freezer and subjected to the Qiagen DNeasy blood and tissue extraction protocol using QIAshredder columns according to the manufacturer’s directions (Qiagen, Hilden, Germany). The purity and quantity of DNA were determined using a NanoDrop Lite spectrophotometer (Thermo Scientific, Waltham, MA, USA). DNA samples were submitted to the University of Michigan Advanced Genomics Core for genomic library construction and sequencing. Sequencing was completed on an Illumina NovaSeq 6000 system equipped with an S4 flow cell with 300 cycles (150-bp paired ends).

mcy operon analysis.

Trimming, adaptor removal, and quality check (QC) of raw reads were completed using a suite of tools from the IMG-JGI BBTools package (63). Reads that satisfied quality check parameters were saved and used for subsequent analysis. QC forward and reverse reads were mapped or aligned to sequences using BLAST v.2.8.1. (64).

Given that all Microcystis cells contain a 16S rRNA gene but not all Microcystis cells have mcy genes, we used the ratio of the coverages of these genes to estimate the relative abundance of Microcystis cells that have mcy genes out of the total Microcystis population. The coverage ratio of mcy to V4 16S rRNA gene regions for Microcystis was calculated by dividing the number of reads mapped to mcy genes by the number of reads mapped to the Microcystis 16S rRNA gene V4 variable region, normalized by gene lengths (lengths are recorded in Table S4 in the supplemental material). Mapping databases for both mcy and V4 16S rRNA genes can be accessed at https://github.com/ceyancey/mcyGenotypes-databases. Reads that mapped to all 10 mcy genes (Fig. 1) and 16S rRNA V4 for Microcystis were quantified and normalized by gene length, as shown by the following equation:

Coverage ratio (Microcystis mcy/16S)=number of reads mapped to mcy gene X/length of mcy gene X (bp)number of reads mapped to Microcystis 16S V4/length of Microcystis 16S V4 (bp)

The 16S mapping database was constructed using the V4 regions available for the phylum Cyanobacteria using the criteria of 95% identity and 80% alignment length, with sequences accessed from the SILVA 138.1 database in February 2021 (65). The use of these cutoffs and the V4 region was shown to be sensitive, specific, and sufficient for 16S rRNA gene mapping as the V4 variable region is more specific to the target organism than the entire 16S rRNA gene sequence. Mapping analysis also demonstrated that these cutoffs ensure that only Microcystis reads mapped to the V4 region. To identify mcy genes in the shotgun metagenomic data, all mcy gene sequences found in the JGI IMG/MER (66) database (accessed February 2021) were used, as were sequences from closely related cyanobacteria to maintain competitive mapping. Reads were kept for downstream analysis if they were at least 95% identical to the reference gene, 80% in alignment length, and the singular top hit per read. These cutoff parameters, database metrics, and mapping tools were tested exhaustively to ensure specificity and sensitivity by cross-checking matches to the full SILVA database and a universal database (https://github.com/TealFurnholm/Universal_Microbiomics_Alignment_Database), and extensive analysis was completed on the mapping method to ensure accuracy and remove any erroneous, nonspecific read mapping (data not shown).

Coverage ratios were then used to estimate the relative abundances of complete, partial, or absent mcy operon genotypes. The complete, partial, and absent genotypes were initially identified by viewing the metagenomic coverage of the PCC 7806 mcy operon (GenBank accession number AF183408.1) (see Fig. 1 for examples). First, the relative proportions of the complete operon were determined by averaging the coverage ratios (mcy gene/Microcystis V4 16S rRNA) of the population that contained the mcyD–J genes. Next, the relative abundance of the partial operon genotype was estimated by subtracting the average mcy gene/V4 16S rRNA coverage ratio of the Microcystis population that had the complete operon from the average ratio of the population that contained the mcyA–C genes. Finally, the relative abundance of the population that did not contain any mcy genes, i.e., the absent genotype, was estimated by subtracting the coverage ratio of the population that contained the complete genotype plus the coverage ratio that contained the partial genotype from 1 (representing the entire population).

The relative abundances of mcyB B1 and C1 variants were quantified as follows. The PCC 7806 mcy operon contains an mcyB region with a B1 variant; thus, reads that mapped to the B1 domain specifically were counted and used to calculate the proportion of the population that contained the B1 variant using the equation listed above. The mcyB1 domain region is between bp coordinates 2156 and 3111 as described previously by Mikalsen et al. (22). The estimated relative abundance of Microcystis that contained the C1 variant was then determined by subtracting the relative abundance of the B1 variant from the total portion that contained the mcyB gene.

mcy genotypes and environmental variable correlation analysis.

To assess the relationships between the mcy genotypes and environmental conditions, a series of simple linear regressions were generated for the 2014 cyanoHAB data. The 2018 HABs Grab and ESP data were excluded from this data set because these data sets did not pair with similar environmental condition measurements. Pearson’s correlations were generated for the estimates of the population that contained the complete, partial, or absent mcy genotype as well as the mcyB B1 or C1 variant against several environmental variables, including pH, the mass of nitrogen as measured through nitrate concentration (micrograms per liter), and ammonium concentration (micrograms per liter) (n = 15). For each correlation, Pearson’s correlation coefficient, r, was generated, as was a P value, to determine the strength of the correlation (α = 0.05) respectively.

mcy operon expression analysis.

In order to analyze mcy gene expression, metatranscriptomic reads were mapped to the reference mcy operon database used for metagenomic analysis for a select number of samples with paired metatranscriptomes from the 2014 cyanoHAB. BLAST was used with parameters identical to those used for metagenomic mapping. In order to normalize gene expression, metatranscriptomic reads were also mapped to the entire reference genome Microcystis aeruginosa PCC 7806SL (GenBank accession number CP020771.1; GI:1181755937). In order to achieve competitive genome mapping, this database contained complete genomes from Anabaena sp. strain 90 (accession number GCA_000312705.1), Cyanobium sp. strain NIES-981 (accession number GCA_900088535.1), and Pseudanabaena sp. strain PCC 7367 (accession number GCA_000317065.1), which are other common cyanobacterial taxa found in western Lake Erie cyanoHABs. The following equation was used to normalize metatranscriptomic reads per gene/entire Microcystis genome expression (67):

Expression (relative transcript abundance)=number of reads mapped per gene X/length of gene X (bp)total number of reads mapped to PCC 7806SL genome/length of PCC7806 SL genome (bp)

Strain variation detection.

In order to determine if samples consisted of identical or distinct strains of Microcystis, inStrain software (39) was implemented. Briefly, metagenomic reads were mapped to a database that contained reference genomes of commonly occurring cyanobacteria in western Lake Erie, Microcystis aeruginosa PCC 7806SL (GenBank accession number CP020771.1; GI:1181755937), Anabaena sp. 90 (accession number GCA_000312705.1), Cyanobium sp. NIES-981 (accession number GCA_900088535.1), and Pseudanabaena sp. PCC 7367 (accession number GCA_000317065.1), using bbmap (63) with a 90% identity threshold. The generated bam files were profiled using default parameters in inStrain, and profiles were compared using the compare function, with default settings. Measurements of average nucleotide identity (ANI), specifically consensus, or conANI, were used to determine whether samples contained identical or distinct strains of Microcystis as discussed previously by Olm et al. (39). Olm et al. describe a cutoff of 99.999% as the threshold for identical strains as being a stringent and sufficient metric for metagenomic data and that samples that share the same strain but had undergone a recent transmission event would still have an ANI score of at least 99.999% (39). Therefore, we used this cutoff to conclude if our samples were comprised of identical or different strains. Using the conANI metric, a similarity matrix of samples was rendered, and a dendrogram was generated to cluster populations by samples. The same method was applied to determine pairwise conANI values for the mcyB and mcyC genes as well. The sample collected at WE12 on 8 July was omitted as it did not pass the coverage thresholds described by Olm et al. for comparison.

To estimate the time of divergence between samples for whole-genome comparisons and mcy gene comparisons, a few basic calculations were completed. Mutation rates for the 16S rRNA gene of cyanobacteria have been calculated to be 5.7 × 10−7 substitutions/site/year for a Microcoleus operational taxonomic unit (OTU) and 2.3 × 10−7 substitutions/site/year for a Geitlerninema OTU (68). An analysis completed by Gibson and Eyre-Walker demonstrated that the majority of mutation rates analyzed for 34 bacterial species fell in the range of 1 × 10−7 to 1 × 10−6 substitutions/site/year (40). Since Segawa et al. (68) considered only a slowly evolving, highly conserved gene, the 16S rRNA gene, in their mutation rate calculations, and most of the genome mutation rates for bacteria analyzed by Gibson and Eyre-Walker fell between 1 × 10−7 and 1 × 10−6 substitutions/site/year, this range likely captures the accumulation rates observed in Microcystis present in western Lake Erie. However, since this rate has not yet been determined for Microcystis, we consider a hypermutation rate observed in a few bacteria that is on the order of 1 × 10−4 substitutions/site/year (40). Therefore, in order to consider a wide range of mutation accumulation rates, we report the time of divergence for the range of 1 × 10−7 to 1 × 10−4 substitutions/site/year. The time of divergence was completed for each pairwise comparison between samples for the whole genome as well as the mcyB and mcyC genes.

Figure creation and statistical analysis.

All statistical analyses and plots were created using R and R Studio 3.5.1 (69). The R packages ggplot2 (70), ggthemes (71), gridExtra (72), viridis (73), and ggpubr (74) aided in figure creation. Adobe Illustrator v25.1 (75) was used to generate schematic figure drawings. Maps were rendered in QGIS using the Quick Map Services plug-in (76). Metadata can be found in Table S3.

Data availability.

Sequences have been deposited in the NCBI database under the BioProject accession number PRJNA702128. Metagenomic and metatranscriptomic raw reads can be found in the NCBI database under the BioProject accession numbers PRJNA464361 and PRJNA370007, respectively.

ACKNOWLEDGMENTS

We thank Teal Furnholm, Robert Hein, and Luke Thompson for their bioinformatic support. We also thank Dack Stuart and Kent Baker for field sampling support and the NOAA Great Lakes Environmental Research Laboratory and the Cooperative Institute for Great Lakes Research for allowing us to sample alongside their HAB monitoring efforts and providing seasonal nutrient data.

This work was supported by NIH and NSF awards to the Great Lakes Center for Fresh Waters and Human Health (NIH award 1P01ES028939-01 and NSF award OCE-1840715) and the Cooperative Institute of Great Lakes Research (NA17OAR4320152). This CIGLR contribution number is 1191. HABs Grab research was supported by the NOAA ECOHAB award (NA17NOS4780186). Support was also provided by the NOAA OAR ‘Omics and NOAA OAR Ocean Technology Development Initiative.

We declare no conflict of interest.

Footnotes

Supplemental material is available online only.

Supplemental file 1
Tables S1 to S4. Download aem.02464-21-s0001.pdf, PDF file, 0.2 MB (190.1KB, pdf)

Contributor Information

Gregory J. Dick, Email: gdick@umich.edu.

Knut Rudi, Norwegian University of Life Sciences.

REFERENCES

  • 1.Harke MJ, Steffen MM, Gobler CJ, Otten TG, Wilhelm SW, Wood SA, Paerl HW. 2016. A review of the global ecology, genomics, and biogeography of the toxic cyanobacterium, Microcystis spp. Harmful Algae 54:4–20. 10.1016/j.hal.2015.12.007. [DOI] [PubMed] [Google Scholar]
  • 2.Huisman J, Codd GA, Paerl HW, Ibelings BW, Verspagen JM, Visser PM. 2018. Cyanobacterial blooms. Nat Rev Microbiol 16:471–483. 10.1038/s41579-018-0040-1. [DOI] [PubMed] [Google Scholar]
  • 3.Watson SB, Miller C, Arhonditsis G, Boyer GL, Carmichael W, Charlton MN, Confesor R, Depew DC, Höök TO, Ludsin SA, Matisoff G, McElmurry SP, Murray MW, Peter Richards R, Rao YR, Steffen MM, Wilhelm SW. 2016. The re-eutrophication of Lake Erie: harmful algal blooms and hypoxia. Harmful Algae 56:44–66. 10.1016/j.hal.2016.04.010. [DOI] [PubMed] [Google Scholar]
  • 4.Sandrini G, Cunsolo S, Schuurmans JM, Matthijs HCP, Huisman J. 2015. Changes in gene expression, cell physiology and toxicity of the harmful cyanobacterium Microcystis aeruginosa at elevated CO2. Front Microbiol 6:401. 10.3389/fmicb.2015.00401. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Humbert J-F, Barbe V, Latifi A, Gugger M, Calteau A, Coursin T, Lajus A, Castelli V, Oztas S, Samson G, Longin C, Medigue C, de Marsac NT. 2013. A tribute to disorder in the genome of the bloom-forming freshwater cyanobacterium Microcystis aeruginosa. PLoS One 8:e70747. 10.1371/journal.pone.0070747. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Pearson LA, Crosbie ND, Neilan BA. 2020. Distribution and conservation of known secondary metabolite biosynthesis gene clusters in the genomes of geographically diverse Microcystis aeruginosa strains. Mar Freshw Res 71:701. 10.1071/MF18406. [DOI] [Google Scholar]
  • 7.Pérez-Carrascal OM, Terrat Y, Giani A, Fortin N, Greer CW, Tromas N, Shapiro BJ. 2019. Coherence of Microcystis species revealed through population genomics. ISME J 13:2887–2900. 10.1038/s41396-019-0481-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Martins J, Saker ML, Moreira C, Welker M, Fastner J, Vasconcelos VM. 2009. Peptide diversity in strains of the cyanobacterium Microcystis aeruginosa isolated from Portuguese water supplies. Appl Microbiol Biotechnol 82:951–961. 10.1007/s00253-009-1877-z. [DOI] [PubMed] [Google Scholar]
  • 9.Wiegand C, Pflugmacher S. 2005. Ecotoxicological effects of selected cyanobacterial secondary metabolites: a short review. Toxicol Appl Pharmacol 203:201–218. 10.1016/j.taap.2004.11.002. [DOI] [PubMed] [Google Scholar]
  • 10.Stewart I, Seawright AA, Shaw GR. 2008. Cyanobacterial poisoning in livestock, wild mammals and birds—an overview. Adv Exp Med Biol 619:613–637. 10.1007/978-0-387-75865-7_28. [DOI] [PubMed] [Google Scholar]
  • 11.Pouria S, De Andrade A, Barbosa J, Cavalcanti RL, Barreto VTS, Ward CJ, Preiser W, Poon GK, Neild GH, Codd GA. 1998. Fatal microcystin intoxication in haemodialysis unit in Caruaru, Brazil. Lancet 352:21–26. 10.1016/S0140-6736(97)12285-1. [DOI] [PubMed] [Google Scholar]
  • 12.Steffen MM, Davis TW, McKay RML, Bullerjahn GS, Krausfeldt LE, Stough JMA, Neitzey ML, Gilbert NE, Boyer GL, Johengen TH, Gossiaux DC, Burtner AM, Palladino D, Rowe MD, Dick GJ, Meyer KA, Levy S, Boone BE, Stumpf RP, Wynne TT, Zimba PV, Gutierrez D, Wilhelm SW. 2017. Ecophysiological examination of the Lake Erie Microcystis bloom in 2014: linkages between biology and the water supply shutdown of Toledo, OH. Environ Sci Technol 51:6745–6755. 10.1021/acs.est.7b00856. [DOI] [PubMed] [Google Scholar]
  • 13.Bouaïcha N, Miles CO, Beach DG, Labidi Z, Djabri A, Benayache NY, Nguyen-Quang T. 2019. Structural diversity, characterization and toxicology of microcystins. Toxins (Basel) 11:714. 10.3390/toxins11120714. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Tillett D, Dittmann E, Erhard M, Von Döhren H, Börner T, Neilan BA. 2000. Structural organization of microcystin biosynthesis in Microcystis aeruginosa PCC7806: an integrated peptide-polyketide synthetase system. Chem Biol 7:753–764. 10.1016/s1074-5521(00)00021-1. [DOI] [PubMed] [Google Scholar]
  • 15.Pearson LA, Dittmann E, Mazmouz R, Ongley SE, D’Agostino PM, Neilan BA. 2016. The genetics, biosynthesis and regulation of toxic specialized metabolites of cyanobacteria. Harmful Algae 54:98–111. 10.1016/j.hal.2015.11.002. [DOI] [PubMed] [Google Scholar]
  • 16.Rantala A, Fewer DP, Hisbergues M, Rouhiainen L, Vaitomaa J, Börner T, Sivonen K. 2004. Phylogenetic evidence for the early evolution of microcystin synthesis. Proc Natl Acad Sci USA 101:568–573. 10.1073/pnas.0304489101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Meyer KA, Davis TW, Watson SB, Denef VJ, Berry MA, Dick GJ. 2017. Genome sequences of lower Great Lakes Microcystis sp. reveal strain-specific genes that are present and expressed in western Lake Erie blooms. PLoS One 12:e0183859. 10.1371/journal.pone.0183859. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Dick GJ, Duhaime MB, Evans JT, Errera RM, Godwin CM, Kharbush JJ, Nitschky HS, Powers MA, Vanderploeg HA, Schmidt KC, Smith DJ, Yancey CE, Zwiers CC, Denef VJ. 2021. The genetic and ecophysiological diversity of Microcystis. Environ Microbiol 23:7278–7313. 10.1111/1462-2920.15615. [DOI] [PubMed] [Google Scholar]
  • 19.Otten TG, Paerl HW, Dreher TW, Kimmerer WJ, Parker AE. 2017. The molecular ecology of Microcystis sp. blooms in the San Francisco Estuary. Environ Microbiol 19:3619–3637. 10.1111/1462-2920.13860. [DOI] [PubMed] [Google Scholar]
  • 20.Tang X, Krausfeldt LE, Shao K, LeCleir GR, Stough JMA, Gao G, Boyer GL, Zhang Y, Paerl HW, Qin B, Wilhelm SW. 2018. Seasonal gene expression and the ecophysiological implications of toxic Microcystis aeruginosa blooms in Lake Taihu. Environ Sci Technol 52:11049–11059. 10.1021/acs.est.8b01066. [DOI] [PubMed] [Google Scholar]
  • 21.Tooming-Klunderud A, Mikalsen B, Kristensen T, Jakobsen KS. 2008. The mosaic structure of the mcyABC operon in Microcystis. Microbiology (Reading) 154:1886–1899. 10.1099/mic.0.2007/015875-0. [DOI] [PubMed] [Google Scholar]
  • 22.Mikalsen B, Boison G, Skulberg OM, Fastner J, Davies W, Gabrielsen TM, Rudi K, Jakobsen KS. 2003. Natural variation in the microcystin synthetase operon mcyABC and impact on microcystin production in Microcystis strains. J Bacteriol 185:2774–2785. 10.1128/JB.185.9.2774-2785.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Welker M, Von Döhren H. 2006. Cyanobacterial peptides—Nature’s own combinatorial biosynthesis. FEMS Microbiol Rev 30:530–563. 10.1111/j.1574-6976.2006.00022.x. [DOI] [PubMed] [Google Scholar]
  • 24.Makower AK, Schuurmans JM, Groth D, Zilliges Y, Matthijs HCP, Dittmann E. 2015. Transcriptomics-aided dissection of the intracellular and extracellular roles of microcystin in Microcystis aeruginosa PCC 7806. Appl Environ Microbiol 81:544–554. 10.1128/AEM.02601-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Tonk L, Van De Waal DB, Slot P, Huisman J, Matthijs HCP, Visser PM. 2008. Amino acid availability determines the ratio of microcystin variants in the cyanobacterium Planktothrix agardhii. FEMS Microbiol Ecol 65:383–390. 10.1111/j.1574-6941.2008.00541.x. [DOI] [PubMed] [Google Scholar]
  • 26.Puddick J, Prinsep M, Wood S, Kaufononga S, Cary S, Hamilton D. 2014. High levels of structural diversity observed in microcystins from Microcystis CAWBG11 and characterization of six new microcystin congeners. Mar Drugs 12:5372–5395. 10.3390/md12115372. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Liu J, Van Oosterhout E, Faassen EJ, Lürling M, Helmsing NR, Van de Waal DB. 2016. Elevated pCO2 causes a shift towards more toxic microcystin variants in nitrogen-limited Microcystis aeruginosa. FEMS Microbiol Ecol 92:fiv159. 10.1093/femsec/fiv159. [DOI] [PubMed] [Google Scholar]
  • 28.Pearson LA, Hisbergues M, Börner T, Dittmann E, Neilan BA. 2004. Inactivation of an ABC transporter gene, mcyH, results in loss of microcystin production in the cyanobacterium Microcystis aeruginosa PCC 7806. Appl Environ Microbiol 70:6370–6378. 10.1128/AEM.70.11.6370-6378.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Rinta-Kanto JM, Konopko EA, DeBruyn JM, Bourbonniere RA, Boyer GL, Wilhelm SW. 2009. Lake Erie Microcystis: relationship between microcystin production, dynamics of genotypes and environmental parameters in a large lake. Harmful Algae 8:665–673. 10.1016/j.hal.2008.12.004. [DOI] [Google Scholar]
  • 30.Otten TG, Graham JL, Harris TD, Dreher TW. 2016. Elucidation of taste- and odor-producing bacteria and toxigenic cyanobacteria in a Midwestern drinking water supply reservoir by shotgun metagenomic analysis. Appl Environ Microbiol 82:5410–5420. 10.1128/AEM.01334-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Steffen MM, Dearth SP, Dill BD, Li Z, Larsen KM, Campagna SR, Wilhelm SW. 2014. Nutrients drive transcriptional changes that maintain metabolic homeostasis but alter genome architecture in Microcystis. ISME J 8:2080–2092. 10.1038/ismej.2014.78. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Steffen MM, Li Z, Effler TC, Hauser LJ, Boyer GL, Wilhelm SW. 2012. Comparative metagenomics of toxic freshwater cyanobacteria bloom communities on two continents. PLoS One 7:e44002. 10.1371/journal.pone.0044002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Pound HL, Wilhelm SW. 2020. Tracing the active genetic diversity of Microcystis and Microcystis phage through a temporal survey of Taihu. PLoS One 15:e0244482. 10.1371/journal.pone.0244482. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Davis TW, Stumpf R, Bullerjahn GS, McKay RML, Chaffin JD, Bridgeman TB, Winslow C. 2019. Science meets policy: a framework for determining impairment designation criteria for large waterbodies affected by cyanobacterial harmful algal blooms. Harmful Algae 81:59–64. 10.1016/j.hal.2018.11.016. [DOI] [PubMed] [Google Scholar]
  • 35.Berry MA, Davis TW, Cory RM, Duhaime MB, Johengen TH, Kling GW, Marino JA, Den Uyl PA, Gossiaux D, Dick GJ, Denef VJ. 2017. Cyanobacterial harmful algal blooms are a biological disturbance to western Lake Erie bacterial communities. Environ Microbiol 19:1149–1162. 10.1111/1462-2920.13640. [DOI] [PubMed] [Google Scholar]
  • 36.Berry MA, White JD, Davis TW, Jain S, Johengen TH, Dick GJ, Sarnelle O, Denef VJ. 2017. Are oligotypes meaningful ecological and phylogenetic units? A case study of Microcystis in freshwater lakes. Front Microbiol 8:365. 10.3389/fmicb.2017.00365. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Chaffin JD, Bratton JF, Verhamme EM, Bair HB, Beecher AA, Binding CE, Birbeck JA, Bridgeman TB, Chang X, Crossman J, Currie WJS, Davis TW, Dick GJ, Drouillard KG, Errera RM, Frenken T, MacIsaac HJ, McClure A, McKay RM, Reitz LA, Domingo JWS, Stanislawczyk K, Stumpf RP, Swan ZD, Snyder BK, Westrick JA, Xue P, Yancey CE, Zastepa A, Zhou X. 2021. The Lake Erie HABs Grab: a binational collaboration to characterize the western basin cyanobacterial harmful algal blooms at an unprecedented high-resolution spatial scale. Harmful Algae 108:102080. 10.1016/j.hal.2021.102080. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Jankowiak J, Hattenrath-Lehmann T, Kramer BJ, Ladds M, Gobler CJ. 2019. Deciphering the effects of nitrogen, phosphorus, and temperature on cyanobacterial bloom intensification, diversity, and toxicity in western Lake Erie. Limnol Oceanogr 64:1347–1370. 10.1002/lno.11120. [DOI] [Google Scholar]
  • 39.Olm MR, Crits-Christoph A, Bouma-Gregson K, Firek BA, Morowitz MJ, Banfield JF. 2021. inStrain profiles population microdiversity from metagenomic data and sensitively detects shared microbial strains. Nat Biotechnol 39:727–736. 10.1038/s41587-020-00797-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Gibson B, Eyre-Walker A. 2019. Investigating evolutionary rate variation in bacteria. J Mol Evol 87:317–326. 10.1007/s00239-019-09912-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Gobler CJ, Burkholder JAM, Davis TW, Harke MJ, Johengen T, Stow CA, Van de Waal DB. 2016. The dual role of nitrogen supply in controlling the growth and toxicity of cyanobacterial blooms. Harmful Algae 54:87–97. 10.1016/j.hal.2016.01.010. [DOI] [PubMed] [Google Scholar]
  • 42.Vaitomaa J, Rantala A, Halinen K, Rouhiainen L, Tallberg P, Mokelke L, Sivonen K. 2003. Quantitative real-time PCR for determination of microcystin synthetase E copy numbers for Microcystis and Anabaena in lakes. Appl Environ Microbiol 69:7289–7297. 10.1128/AEM.69.12.7289-7297.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Davis TW, Berry DL, Boyer GL, Gobler CJ. 2009. The effects of temperature and nutrients on the growth and dynamics of toxic and non-toxic strains of Microcystis during cyanobacteria blooms. Harmful Algae 8:715–725. 10.1016/j.hal.2009.02.004. [DOI] [Google Scholar]
  • 44.Smith DJ, Tan JY, Powers MA, Lin XN, Davis TW, Dick GJ. 2021. Individual Microcystis colonies harbour distinct bacterial communities that differ by Microcystis oligotype and with time. Environ Microbiol 23:3020–3036. 10.1111/1462-2920.15514. [DOI] [PubMed] [Google Scholar]
  • 45.Beversdorf LJ, Rude K, Weirich CA, Bartlett SL, Seaman M, Kozik C, Biese P, Gosz T, Suha M, Stempa C, Shaw C, Hedman C, Piatt JJ, Miller TR. 2018. Analysis of cyanobacterial metabolites in surface and raw drinking waters reveals more than microcystin. Water Res 140:280–290. 10.1016/j.watres.2018.04.032. [DOI] [PubMed] [Google Scholar]
  • 46.Roth JR, Benson N, Galitski T, Haack K, Lawrence JG, Miesel L. 1996. Rearrangements of the bacterial chromosome: formation and applications, p 2256–2276. In Neidhardt FC, Curtiss R, III, Ingraham JL, Lin ECC, Low KB, Magasanik B, Reznikoff WS, Riley M, Schaechter M, Umbarger HE (ed), Escherichia coli and Salmonella: cellular and molecular biology, 2nd ed. ASM Press, Washington, DC. [Google Scholar]
  • 47.Rocha EPC. 2004. The replication-related organization of bacterial genomes. Microbiology (Reading) 150:1609–1627. 10.1099/mic.0.26974-0. [DOI] [PubMed] [Google Scholar]
  • 48.Couturier E, Rocha EPC. 2006. Replication-associated gene dosage effects shape the genomes of fast-growing bacteria but only for transcription and translation genes. Mol Microbiol 59:1506–1518. 10.1111/j.1365-2958.2006.05046.x. [DOI] [PubMed] [Google Scholar]
  • 49.Tyson GW, Chapman J, Hugenholtz P, Allen EE, Ram RJ, Richardson PM, Solovyev VV, Rubin EM, Rokhsar DS, Banfield JF. 2004. Community structure and metabolism through reconstruction of microbial genomes from the environment. Nat 428:37–43. 10.1038/nature02340. [DOI] [PubMed] [Google Scholar]
  • 50.Allen EE, Banfield JF. 2005. Community genomics in microbial ecology and evolution. Nat Rev Microbiol 3:489–498. 10.1038/nrmicro1157. [DOI] [PubMed] [Google Scholar]
  • 51.Fewer DP, Tooming-Klunderud A, Jokela J, Wahlsten M, Rouhiainen L, Kristensen T, Rohrlack T, Jakobsen KS, Sivonen K. 2008. Natural occurrence of microcystin synthetase deletion mutants capable of producing microcystins in strains of the genus Anabaena (Cyanobacteria). Microbiology (Reading) 154(Part 4):1007–1014. 10.1099/mic.0.2007/016097-0. [DOI] [PubMed] [Google Scholar]
  • 52.Chaffin JD, Davis TW, Smith DJ, Baer MM, Dick GJ. 2018. Interactions between nitrogen form, loading rate, and light intensity on Microcystis and Planktothrix growth and microcystin production. Harmful Algae 73:84–97. 10.1016/j.hal.2018.02.001. [DOI] [PubMed] [Google Scholar]
  • 53.Davis TW, Harke MJ, Marcoval MA, Goleski J, Orano-Dawson C, Berry DL, Gobler CJ. 2010. Effects of nitrogenous compounds and phosphorus on the growth of toxic and non-toxic strains of Microcystis during cyanobacterial blooms. Aquat Microb Ecol 61:149–162. 10.3354/ame01445. [DOI] [Google Scholar]
  • 54.Van De Waal DB, Verspagen JMH, Lürling M, Van Donk E, Visser PM, Huisman J. 2009. The ecological stoichiometry of toxins produced by harmful cyanobacteria: an experimental test of the carbon-nutrient balance hypothesis. Ecol Lett 12:1326–1335. 10.1111/j.1461-0248.2009.01383.x. [DOI] [PubMed] [Google Scholar]
  • 55.Van de Waal DB, Smith VH, Declerck SAJ, Stam ECM, Elser JJ. 2014. Stoichiometric regulation of phytoplankton toxins. Ecol Lett 17:736–742. 10.1111/ele.12280. (Erratum, 18:983–984, 2015, .) [DOI] [PubMed] [Google Scholar]
  • 56.Kotak BG, Lam AK-Y, Prepas EE, Hrudey SE. 2000. Role of chemical and physical variables in regulating microcystin-LR. Can J Fish Aquat Sci 57:1584–1593. 10.1139/f00-091. [DOI] [Google Scholar]
  • 57.Wicks RJ, Thiel PG. 1990. Environmental factors affecting the production of peptide toxins in floating scums of the cyanobacterium Microcystis aeruginosa in a hypertrophic African reservoir. Environ Sci Technol 24:1413–1418. 10.1021/es00079a017. [DOI] [Google Scholar]
  • 58.Palagama DSW, Baliu-Rodriguez D, Snyder BK, Thornburg JA, Bridgeman TB, Isailovic D. 2020. Identification and quantification of microcystins in western Lake Erie during 2016 and 2017 harmful algal blooms. J Great Lakes Res 46:289–301. 10.1016/j.jglr.2020.01.002. [DOI] [Google Scholar]
  • 59.Dyble J, Fahnenstiel GL, Litaker RW, Millie DF, Tester PA. 2008. Microcystin concentrations and genetic diversity of Microcystis in the lower Great Lakes. Environ Toxicol 23:507–516. 10.1002/tox.20370. [DOI] [PubMed] [Google Scholar]
  • 60.Cooperative Institute for Great Lakes Research, University of Michigan; NOAA Great Lakes Environmental Research Laboratory. 2019. Physical, chemical, and biological water quality monitoring data to support detection of Harmful Algal Blooms (HABs) in western Lake Erie, collected by the Great Lakes Environmental Research Laboratory and the Cooperative Institute for Great Lakes Research since 2012. NOAA National Centers for Environmental Information. Dataset. 10.25921/11da-3x54. Accessed 5 April, 2022. [DOI] [Google Scholar]
  • 61.Yamahara KM, Preston CM, Birch J, Walz K, Marin R, Jensen S, Pargett D, Roman B, Ussler W, Zhang Y, Ryan J, Hobson B, Kieft B, Raanan B, Goodwin KD, Chavez FP, Scholin C. 2019. In situ autonomous acquisition and preservation of marine environmental DNA using an autonomous underwater vehicle. Front Mar Sci 6:373. 10.3389/fmars.2019.00373. [DOI] [Google Scholar]
  • 62.Zhang Y, Ryan JP, Hobson BW, Kieft B, Romano A, Barone B, Preston CM, Roman B, Raanan B-Y, Pargett D, Dugenne M, White AE, Freitas FH, Poulos S, Wilson ST, DeLong EF, Karl DM, Birch JM, Bellingham JG, Scholin CA. 2021. A system of coordinated autonomous robots for Lagrangian studies of microbes in the oceanic deep chlorophyll maximum. Sci Robot 6:eabb9138. 10.1126/scirobotics.abb9138. [DOI] [PubMed] [Google Scholar]
  • 63.Bushnell B. 2014. BBMap: a fast, accurate, splice-aware aligner. Abstr 9th Annu Genomics Energy Environ Meet, Walnut Creek, CA, 17 to 20 March 2014.
  • 64.Madden T. 2013. The BLAST sequence analysis tool. In The NCBI handbook, 2nd ed. NCBI, Bethesda, MD. [Google Scholar]
  • 65.Quast C, Pruesse E, Yilmaz P, Gerken J, Schweer T, Yarza P, Peplies J, Glöckner FO. 2013. The SILVA ribosomal RNA gene database project: improved data processing and Web-based tools. Nucleic Acids Res 41:D590–D596. 10.1093/nar/gks1219. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Chen IMA, Chu K, Palaniappan K, Ratner A, Huang J, Huntemann M, Hajek P, Ritter S, Varghese N, Seshadri R, Roux S, Woyke T, Eloe-Fadrosh EA, Ivanova NN, Kyrpides NC. 2021. The IMG/M data management and analysis system v.6.0: new tools and advanced capabilities. Nucleic Acids Res 49:D751–D763. 10.1093/nar/gkaa939. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Dick GJ. 2018. Genomic approaches in earth and environmental sciences, 1st ed. John Wiley & Sons Ltd, Hoboken, NJ. [Google Scholar]
  • 68.Segawa T, Takeuchi N, Fujita K, Aizen VB, Willerslev E, Yonezawa T. 2018. Demographic analysis of cyanobacteria based on the mutation rates estimated from an ancient ice core. Heredity (Edinb) 120:562–573. 10.1038/s41437-017-0040-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Allaire JJ. 2015. RStudio: integrated development environment for R.
  • 70.Wickham H. 2016. ggplot2: elegant graphics for data analysis. Springer-Verlag, New York, NY. https://ggplot2.tidyverse.org. [Google Scholar]
  • 71.Arnold JB. 2021. ggthemes: extra themes, scales and geoms for ‘ggplot2’. R package version 4.2.4. https://CRAN.R-project.org/package=ggthemes.
  • 72.Auguie B. 2017. gridExtra: miscellaneous functions for “grid” graphics. R package version 2.3. https://CRAN.R-project.org/package=gridExtra.
  • 73.Garnier S, Ross N, Rudis R, Camargo AP, Sciaini M, Scherer C. 2021. Rvision—colorblind-friendly color maps for R. R package version 0.6.1.
  • 74.Kassambara A. 2020. ggpubr: ‘ggplot2’ based publication ready plots. R package version 0.4.0. https://CRAN.R-project.org/package=ggpubr.
  • 75.Adobe Inc. 2019. Adobe illustrator. Adobe Inc, San Jose, CA. https://adobe.com/products/illustrator. [Google Scholar]
  • 76.QGIS Development Team. 2020. QGIS geographic information system. Open Source Geospatial 742 Foundation Project. http://qgis.osgeo.org.

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental file 1

Tables S1 to S4. Download aem.02464-21-s0001.pdf, PDF file, 0.2 MB (190.1KB, pdf)

Data Availability Statement

Sequences have been deposited in the NCBI database under the BioProject accession number PRJNA702128. Metagenomic and metatranscriptomic raw reads can be found in the NCBI database under the BioProject accession numbers PRJNA464361 and PRJNA370007, respectively.


Articles from Applied and Environmental Microbiology are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES