ABSTRACT
We report the genomic DNA sequence of the actinomycete Streptomyces auratus DSM 41897. The genome assembled as two linear chromosomal contigs (4,434,406 bp and 3,992,024 bp) and a circular plasmid (12,008 bp), with an overall GC content of 71.2%, and 33 predicted biosynthetic gene clusters for potential secondary metabolites.
KEYWORDS: Streptomyces auratus, genome, antibiotics
ANNOUNCEMENT
The Actinomycetota phylum is a source of diverse bacteria known to produce highly sought-after secondary metabolites for clinical and industrial use (1). Within this group, Streptomyces have particularly been exploited for their production of antibiotics and other bioactive compounds (2, 3). The type strain Streptomyces auratus DSM 41897 was sequenced as part of the “Sequencing the genomes of 1,000 Actinobacteria strains” project at the U.S. Department of Energy (DOE) Joint Genome Institute to provide insight into the pharmacological potential of this species.
S. auratus DSM 41897 was provided by the Leibniz Institute DSMZ and cultured in Trypticase soy broth (DSMZ medium 535) at 28°C for 3 days with shaking at 120 rpm (4). Genomic DNA was extracted using the MasterPure gram-positive DNA Purification Kit (Biosearch Technologies). Genomic DNA was sheared to 10 kb using Covaris g-Tube. An SMRTbell template library was sequenced on a Pacific Biosciences (PacBio) Sequel II sequencer using 8M 1 SMRT cells and Version 1.0 sequencing chemistry with 1 × 900 sequencing movie run times (5). PacBio’s SMRT Link software performed quality control and adapter trimming under default conditions. Read error correction was performed by the Flye assembler (6). A total of 5,108,190 filtered subreads (read N50 = 5855 bases) were assembled using Flye v.2.8.1 under the following settings: –asm-coverage 100 –plasmid –genome-size 5000000 t 64 –pacbio-raw (fasta list). The resulting assembly totaled 8,438,454 bp (71.2% GC) in three contigs: a 4,434,406 bp linear chromosomal contig, a 3,992,024 bp linear chromosomal contig, and a 12,008 bp circular plasmid. Circularity of the plasmid assembly was confirmed with Circlator (7). The average coverage was 2,279×. CheckM2 was used to assess genome completeness (100%) and possible contamination (0.32%) (8).
The S. auratus DSM 41897 genome was annotated by the integrated microbial genomes genome annotation pipeline version 5.0.23 (9). A total of 7,454 genes were identified, including 7,300 protein-coding genes, 19 rRNA genes, and 71 tRNA genes. The sequence showed 98.6% average nucleotide identity (10) to the S. auratus AGR0001 draft genome sequence (GenBank accession no. CP072931) (11), but the DSM 41897 assembly is consolidated to three scaffolds, compared to 238 for AGR0001. Accurate prediction of biosynthetic gene clusters (BGCs) is facilitated by high-quality genome assemblies (12). BGCs in S. auratus DSM 41897 were identified using antiSMASH v.7.1.0 (13) with default software parameters. The genome contains 33 high-confidence BGCs (Table 1) (14), including nine with 80% or greater similarity to known clusters in the MIBig v.4.0 database (15). These BGCs may encode bioactive compounds with potential for biotechnological research applications.
TABLE 1.
Biosynthetic gene cluster distribution of S. auratus DSM 41897
| BGC class | Total | Products of most similar known cluster(s) |
|---|---|---|
| Non-ribosomally synthesized peptides | 1 | No prediction |
| Polyketides | 9 | iso-migrastatin/migrastatin/dorrigocin A/dorrigocinB/13-epi-dorrigocin A; naringenin;https://mibig.secondarymetabolites.org/go/BGC0000242/1lysolipin I;https://mibig.secondarymetabolites.org/go/BGC0000123/1phoslactomycin B;https://mibig.secondarymetabolites.org/go/BGC0002051/1antipain |
| Ribosomally synthesized and post- translationally modified peptides |
6 | radamycin/globimycin;https://mibig.secondarymetabolites.org/go/BGC0001550/1citrulassin D |
| Terpenes | 3 | No prediction |
| Other | 10 | legonoxamine A/desferrioxamine B/legonoxamine B,https://mibig.secondarymetabolites.org/go/BGC0002052/1Ectoine |
| Hybrid | 4 | Ulleungdin |
ACKNOWLEDGMENTS
The work (proposal DOI [https://doi.org/10.46936/10.25585/60001044]) conducted by the U.S. Department of Energy Joint Genome Institute (https://ror.org/04xm1d337), a DOE Office of Science User Facility, is supported by the Office of Science of the U.S. Department of Energy operated under contract no. DE-AC02-05CH11231.
This announcement was largely prepared by undergraduate students, and we gratefully acknowledge JGI for initiating and supporting it as an educational project (the “Adopt-a-Genome” Project).
Contributor Information
Craig Stephens, Email: cstephens@scu.edu.
Julie C. Dunning Hotopp, University of Maryland School of Medicine, Baltimore, Maryland, USA
DATA AVAILABILITY
The genome of Streptomyces auratus DSM 41897 is deposited in GenBank under accession number NZ_JBNGCB000000000, with the respective annotated DNA sequences available as JBNGCB010000001.1, JBNGCB010000002.1, and JBNGCB010000003.1. Raw data are available as SRA accession numberhttps://www.ncbi.nlm.nih.gov/sra/SRX7194182%5Baccn%5DSRX7194182.
REFERENCES
- 1. Seshadri R, Roux S, Huber KJ, Wu D, Yu S, Udwary D, Call L, Nayfach S, Hahnke RL, Pukall R, et al. 2022. Expanding the genomic encyclopedia of Actinobacteria with 824 isolate reference genomes. Cell Genomics 2:100213. doi: 10.1016/j.xgen.2022.100213 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Goodfellow M, Kumar Y, Labeda DP, Sembiring L. 2007. The Streptomyces violaceusniger clade: a home for streptomycetes with rugose ornamented spores. Antonie Van Leeuwenhoek 92:173–199. doi: 10.1007/s10482-007-9146-6 [DOI] [PubMed] [Google Scholar]
- 3. Caicedo-Montoya C, Manzo-Ruiz M, Ríos-Estepa R. 2021. Pan-genome of the genus Streptomyces and prioritization of biosynthetic gene clusters with potential to produce antibiotic compounds. Front Microbiol 12:677558. doi: 10.3389/fmicb.2021.677558 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Koblitz J, Halama P, Spring S, Thiel V, Baschien C, Hahnke RL, Pester M, Overmann J, Reimer LC. 2023. Media dive : the expert-curated cultivation media database . Nucleic Acids Res 51:D1531–D1538. doi: 10.1093/nar/gkac803 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Eid J, Fehr A, Gray J, Luong K, Lyle J, Otto G, Peluso P, Rank D, Baybayan P, Bettman B, et al. 2009. Real-time DNA sequencing from single polymerase molecules. Science 323:133–138. doi: 10.1126/science.1162986 [DOI] [PubMed] [Google Scholar]
- 6. Kolmogorov M, Yuan J, Lin Y, Pevzner PA. 2019. Assembly of long, error-prone reads using repeat graphs. Nat Biotechnol 37:540–546. doi: 10.1038/s41587-019-0072-8 [DOI] [PubMed] [Google Scholar]
- 7. Hunt M, Silva ND, Otto TD, Parkhill J, Keane JA, Harris SR. 2015. Circlator: automated circularization of genome assemblies using long sequencing reads. Genome Biol 16:294. doi: 10.1186/s13059-015-0849-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. 2015. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res 25:1043–1055. doi: 10.1101/gr.186072.114 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Chen I-MA, Chu K, Palaniappan K, Pillay M, Ratner A, Huang J, Huntemann M, Varghese N, White JR, Seshadri R, Smirnova T, Kirton E, Jungbluth SP, Woyke T, Eloe-Fadrosh EA, Ivanova NN, Kyrpides NC. 2019. IMG/M v.5.0: an integrated data management and comparative analysis system for microbial genomes and microbiomes. Nucleic Acids Res 47:D666–D677. doi: 10.1093/nar/gky901 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Varghese NJ, Mukherjee S, Ivanova N, Konstantinidis KT, Mavrommatis K, Kyrpides NC, Pati A. 2015. Microbial species delineation using whole genome sequences. Nucleic Acids Res 43:6761–6771. doi: 10.1093/nar/gkv657 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Han X, Li M, Ding Z, Zhao J, Ji K, Wen M, Lu T. 2012. Genome sequence of Streptomyces auratus strain AGR0001, a phoslactomycin-producing actinomycete. J Bacteriol 194:5472–5473. doi: 10.1128/JB.01155-12 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Lee N, Hwang S, Kim J, Cho S, Palsson B, Cho BK. 2020. Mini review: genome mining approaches for the identification of secondary metabolite biosynthetic gene clusters in Streptomyces. Comput Struct Biotechnol J 18:1548–1556. doi: 10.1016/j.csbj.2020.06.024 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Blin K, Shaw S, Augustijn HE, Reitz ZL, Biermann F, Alanjary M, Fetter A, Terlouw BR, Metcalf WW, Helfrich EJN, van Wezel GP, Medema MH, Weber T. 2023. antiSMASH 7.0: new and improved predictions for detection, regulation, chemical structures and visualisation. Nucleic Acids Res 51:W46–W50. doi: 10.1093/nar/gkad344 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Jørgensen TS, Mohite OS, Sterndorff EB, Alvarez-Arevalo M, Blin K, Booth TJ, Charusanti P, Faurdal D, Hansen TØ, Nuhamunada M, Mourched A-S, Palsson BØ, Weber T. 2024. A treasure trove of 1034 actinomycete genomes. Nucleic Acids Res 52:7487–7503. doi: 10.1093/nar/gkae523 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Zdouc MM, Blin K, Louwen NLL, Navarro J, Loureiro C, Bader CD, Bailey CB, Barra L, Booth TJ, Bozhüyük KAJ, et al. 2025. MIBiG 4.0: advancing biosynthetic gene cluster curation through global collaboration. Nucleic Acids Res 53:D678–D690. doi: 10.1093/nar/gkae1115 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The genome of Streptomyces auratus DSM 41897 is deposited in GenBank under accession number NZ_JBNGCB000000000, with the respective annotated DNA sequences available as JBNGCB010000001.1, JBNGCB010000002.1, and JBNGCB010000003.1. Raw data are available as SRA accession numberhttps://www.ncbi.nlm.nih.gov/sra/SRX7194182%5Baccn%5DSRX7194182.
