Skip to main content
Microbiology Resource Announcements logoLink to Microbiology Resource Announcements
. 2020 Jan 16;9(3):e01086-19. doi: 10.1128/MRA.01086-19

Draft Genome Sequence of the Non-Microcystin-Producing Microcystis aeruginosa Strain KLA2, Isolated from a Freshwater Reservoir in Northern California, USA

Pia H Moisander a,b,, Mari Ochiai b,c
Editor: Jason E Stajichd
PMCID: PMC6965573  PMID: 31948955

We report a draft genome sequence for Microcystis aeruginosa KLA2. The total draft genome size is 5,213,465 bp with a GC content of 42.5%. The genome does not have genes indicative of microcystin production but does contain genes indicative of production of several other secondary metabolites.

ABSTRACT

We report a draft genome sequence for Microcystis aeruginosa KLA2. The total draft genome size is 5,213,465 bp with a GC content of 42.5%. The genome does not have genes indicative of microcystin production but does contain genes indicative of production of several other secondary metabolites.

ANNOUNCEMENT

Strain KLA2 of the unicellular colony- and bloom-forming cyanobacterium Microcystis aeruginosa was isolated from the Copco Reservoir on the Klamath River, California (41.984N, −122.325W), in July 2007. Water was spread on BG11 agar (1) with a reduced nitrate content of 1.76 mM (BG11N−), and plates were grown under a 15-h/9-h light/dark cycle. Picked colonies were subsequently grown in liquid BG11N−. The strain was considered a non-microcystin producer, as the mcyA gene (encoding part of the microcystin gene cluster) did not amplify (2) and microcystin was never detected in multiple tests conducted during the years before and after genome sequencing (enzyme-linked immunosorbent assay [ELISA] kit; Envirologix, Portland, ME, USA).

Strain KLA2 was first purified using fluorescence-activated cell sorting (BD Influx; Beckton Dickinson, USA). To prepare the culture for sequencing, cells were agitated briefly in a bead beater in tubes without beads to break up colonies and to collapse gas vacuoles. Cells were then filtered through a 10-μm filter to reduce aggregation. Cells (10,000 total) were then flow-sorted to nuclease-free water and frozen. The sorted cells directly underwent whole-genome amplification as described previously (REPLI-g midikit; Qiagen, USA) (3). Library preparation and sequencing were conducted at the University of California Santa Cruz Genome Sequencing Center on the Roche 454 Titanium platform (∼450-bp reads), and a subsequent SOLiD mate pair run was conducted to improve the assembly (sequencing was conducted in 2009 and 2010, respectively). The 659,000 454 reads (246 Mb total) were assembled with Newbler (454 Life Sciences, Branford, CT, USA), producing 810 contigs (497 contigs of >500 bp). The SOLiD mate pair sequencing yielded 76 million reads (2 × 50 bp per read, 2- to 3-kb insert size); these reads were mapped to the 454 contigs using LifeScope 2.5.1 (Thermo Fisher) with default parameters (Life Technologies Bioinformatics Service). PCR duplicates likely resulting from library generation were filtered out during the alignment, with only the primary alignment with mapping quality of 10 or greater considered. The mapping resulted in 15 million reads (7.5 million mate pairs). Application of MIPS Scaffolder 0.6 (4) resulted in 251 scaffolds of >500 bp with a total length of 5,213,465 bp, an N50 value of 30,960 bp (5), and a GC content of 42.5%. For scaffolding, the inferred median insert size (2,167 ± 581 bp [interquartile range, 1,586 to 2,748 bp]) determined from the data was used for the mate pairs that mapped to different contigs.

Annotation was conducted with the Prokaryotic Genome Annotation Pipeline (PGAP) (6) on the version submitted to NCBI. The annotation indicated 5,215 and 4,634 total and protein-coding genes, respectively; 1 each of 5S, 16S, and 23S rRNA; and 41 tRNAs. A comparative annotation using Rapid Annotation of microbial genomes using Subsystems Technology (RAST) 2.0 (7) identified 5,841 coding sequences and 263 subsystems. Based on both annotations, genes homologous to nitrate and urea ABC transporters and urease alpha-, beta-, and gamma-units were present, suggesting the capacity for using these nitrogen sources. Both annotations also identified genes homologous for gas vesicle production, suggesting a capacity for surface bloom formation.

A total of 9 Cas gene clusters and 35 CRISPR spacer regions with up to 8 CRISPR spacer repeats per region were detected with the CRISPRCasFinder (8). The KLA2 draft genome sequence has 95.96 and 97.12% average nucleotide identity (two-way) (9) with the genomes of M. aeruginosa NIES-843 (10) and PCC 7806 (11), respectively, which both contain the microcystin gene cluster. Microcystin genes were not found in KLA2. Several other entire or partial secondary metabolite biosynthetic gene clusters were found using antiSMASH 5.0 with default parameters (Table 1) (12). The potential for secondary metabolite production in a non-microcystin-producing strain should contribute to investigations of the evolution and roles of secondary metabolites in cyanobacteria (13) and their potential impacts on aquatic ecosystems (14).

TABLE 1.

Secondary metabolite biosynthetic gene clusters in the M. aeruginosa KLA2 draft genome sequence

Product typea Most similar known cluster Similarity (%)
Cyanobactin Piricyclamide 58b
Cyanobactin Piricyclamide 25b
Cyanobactin, LAP Aeruginosamide 100
Lanthipeptide, bacteriocin NAc NAb
Bacteriocin Yersiniabactin 2b
Microviridin Microviridin B 100
hglE-KS NA NAb
T1PKS Cylindrocyclophane 1b
T1PKS Puwainaphycins 30b
T1PKS Bartoloside 2/bartoloside 3/bartoloside 4 27b
T3PKS, T1PKS Merocyclophane C/merocyclophane D 44b
NRPS Aeruginosin 85b
NRPS Micropeptin 100b
NRPS, beta-lactone Anabaenopeptin 100b
Resorcinol Bartoloside 2/bartoloside 3/bartoloside 4 45
Terpene NA NAb
Terpene NA NA
Terpene NA NAb
a

LAP, linear azol(in)e-containing peptides; hglE-KS, heterocyst glycolipid synthase-like PKS; T1PKS, type 1 polyketide synthase; NRPS, nonribosomal peptide synthetase cluster.

b

Region of the gene cluster was on the contig edge, which may have reduced the similarity values of <100%.

c

NA, not available.

Data availability.

This whole-genome shotgun project has been deposited at DDBJ/ENA/GenBank under the accession number VTRR00000000. The version described in this paper is version VTRR01000000. The study has been deposited under BioProject number PRJNA561215 and BioSample number SAMN12611847. The raw sequence reads are in the Sequence Read Archive under the numbers SRR10053317 and SRR10079464.

ACKNOWLEDGMENTS

Funding was provided by the Klamath Watershed Institute and University of Massachusetts Dartmouth.

We thank Brandon Carter and Shellie Bench for technical assistance and Jonathan Zehr for sharing laboratory resources.

REFERENCES

  • 1.Rippka R, Deruelles J, Waterbury JB, Herdman M, Stanier RY. 1979. Generic assignments, strain histories and properties of pure cultures of cyanobacteria. J Gen Microbiol 111:1–61. doi: 10.1099/00221287-111-1-1. [DOI] [Google Scholar]
  • 2.Moisander PH, Lehman PW, Ochiai M, Corum S. 2009. Diversity of Microcystis aeruginosa in the Klamath River and San Francisco Bay delta, California, USA. Aquat Microb Ecol 57:19–31. doi: 10.3354/ame01320. [DOI] [Google Scholar]
  • 3.Bench SR, Ilikchyan IN, Tripp HJ, Zehr JP. 2011. Two strains of Crocosphaera watsonii with highly conserved genomes are distinguished by strain-specific features. Front Microbiol 2:261. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Salmela L, Mäkinen V, Välimäki N, Ylinen J, Ukkonen E. 2011. Fast scaffolding with small independent mixed integer programs. Bioinformatics 27:3259–3265. doi: 10.1093/bioinformatics/btr562. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Gurevich A, Saveliev V, Vyahhi N, Tesler G. 2013. QUAST: quality assessment tool for genome assemblies. Bioinformatics 29:1072–1075. doi: 10.1093/bioinformatics/btt086. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Tatusova T, DiCuccio M, Badretdin A, Chetvernin V, Nawrocki EP, Zaslavsky L, Lomsadze A, Pruitt KD, Borodovsky M, Ostell J. 2016. NCBI Prokaryotic Genome Annotation Pipeline. Nucleic Acids Res 44:6614–6624. doi: 10.1093/nar/gkw569. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Overbeek R, Olson R, Pusch GD, Olsen GJ, Davis JJ, Disz T, Edwards RA, Gerdes S, Parrello B, Shukla M, Vonstein V, Wattam AR, Xia F, Stevens R. 2014. The SEED and the Rapid Annotation of microbial genomes using Subsystems Technology (RAST). Nucleic Acids Res 42:D206–D214. doi: 10.1093/nar/gkt1226. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Couvin D, Bernheim A, Toffano-Nioche C, Touchon M, Michalik J, Neron B, Rocha EPC, Vergnaud G, Gautheret D, Pourcel C. 2018. CRISPRCasFinder, an update of CRISRFinder, includes a portable version, enhanced performance and integrates search for Cas proteins. Nucleic Acids Res 46:W246–W251. doi: 10.1093/nar/gky425. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Rodriguez-R L, Konstantinidis KT. 2016. The enveomics collection: a toolbox for specialized analyses of microbial genomes and metagenomes. PeerJ Prepr 4:e1900v1901 https://peerj.com/preprints/1900/. [Google Scholar]
  • 10.Kaneko T, Nakajima N, Okamoto S, Suzuki I, Tanabe Y, Tamaoki M, Nakamura Y, Kasai F, Watanabe A, Kawashima K, Kishida Y, Ono A, Shimizu Y, Takahashi C, Minami C, Fujishiro T, Kohara M, Katoh M, Nakazaki N, Nakayama S, Yamada M, Tabata S, Watanabe MM. 2007. Complete genomic structure of the bloom-forming toxic cyanobacterium Microcystis aeruginosa NIES-843. DNA Res 14:247–256. doi: 10.1093/dnares/dsm026. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Frangeul L, Quillardet P, Castets AM, Humbert JF, Matthijs HCP, Cortez D, Tolonen A, Zhang CC, Gribaldo S, Kehr JC, Zilliges Y, Ziemert N, Becker S, Talla E, Latifi A, Billault A, Lepelletier A, Dittmann E, Bouchier C, de Marsac NT. 2008. Highly plastic genome of Microcystis aeruginosa PCC 7806, a ubiquitous toxic freshwater cyanobacterium. BMC Genomics 9:274. doi: 10.1186/1471-2164-9-274. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Blin K, Shaw S, Steinke K, Villebro R, Ziemert N, Lee SY, Medema MH, Weber T. 2019. antiSMASH 5.0: updates to the secondary metabolite genome mining pipeline. Nucleic Acids Res 47:W81–W87. doi: 10.1093/nar/gkz310. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Kehr J-C, Picchi DG, Dittmann E. 2011. Natural product biosyntheses in cyanobacteria: a treasure trove of unique enzymes. Beilstein J Org Chem 7:1622–1635. doi: 10.3762/bjoc.7.191. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Le Manach S, Sotton B, Huet H, Duval C, Paris A, Marie A, Yepremian C, Catherine A, Matheron L, Vinh J, Edery M, Marie B. 2018. Physiological effects caused by microcystin-producing and nonmicrocystin producing Microcystis aeruginosa on medaka fish: a proteomic and metabolomic study on liver. Environ Pollut 234:523–537. doi: 10.1016/j.envpol.2017.11.011. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

This whole-genome shotgun project has been deposited at DDBJ/ENA/GenBank under the accession number VTRR00000000. The version described in this paper is version VTRR01000000. The study has been deposited under BioProject number PRJNA561215 and BioSample number SAMN12611847. The raw sequence reads are in the Sequence Read Archive under the numbers SRR10053317 and SRR10079464.


Articles from Microbiology Resource Announcements are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES