Abstract
Streptomyces iranensis HM 35 has been shown to exhibit 72.7% DNA-DNA similarity to the important drug rapamycin (sirolimus)-producing Streptomyces rapamycinicus NRRL5491. Here, we report the genome sequence of HM 35, which represents a partially overlapping repertoire of secondary metabolite gene clusters with S. rapamycinicus, including the gene cluster for rapamycin biosynthesis.
GENOME ANNOUNCEMENT
Streptomyces iranensis HM 35 is a novel member of the Streptomyces violaceusniger-S. hygroscopicus group (1). Based on 16S rRNA gene sequencing, HM 35 is phylogenetically closely related to Streptomyces rapamycinicus NRRL 5491, followed by S. violaceusniger DSM 40563, Streptomyces yogyakartensis DSM 41766, and Streptomyces javensis DSM 41764 (1). Thus, this species is of special interest for the exploration of its genomic and chemical capacity, which may include interesting bioactive compounds, such as the antifungal and immunosuppressant drug rapamycin (2), but it also allows for the discovery of novel secondary metabolites and a study of their regulation (3, 4).
Genomic DNA from S. iranensis HM 35 (NCBI taxonomy ID, 576784; sample ID, DSM41954) was obtained from a sample cultured in tryptone soya broth (TSB) (5). DNA library preparation (paired-end 2 × 100 bp) and sequencing on Illumina HiSeq 2000 were performed at LGC Genomics (Berlin). The raw reads were adapter clipped (6), quality trimmed, and error corrected (7). The initial contigs were generated using Velvet (8). The contigs were shredded into overlapping 350-bp sequences, generating a coverage of 17×, and assembled using Newbler 2.6 (454 Life Sciences). Gaps in the resulting sequences were filled using the Beijing Genomics Institute (BGI) GapCloser software (9).
For ab initio gene prediction, GeneMark-ES (10) was applied. Functional annotation was performed using Blast2GO (11) and InterProScan (12). Gene descriptions were obtained by blasting the predicted protein sequences against those of Streptomyces bingchenggensis BCW1 (Genbank accession no. CP002047) and S. violaceusniger Tu4113 (Genbank accession no. CP002994). Matches with the lowest e-value below 10-5, 70% sequence identity, and a subject hit length of 70% were considered highly similar. Secondary metabolite gene clusters were predicted using antiSMASH (13).
DNA sequencing resulted in 17,692,354 raw reads, of which 16,014,549 reads passed the quality filter (estimated genome coverage, 122.7-fold) and were used for sequence assembly. The resulting assembly consists of 7 scaffolds and 12.1 Mbp (longest scaffold, 12.0 Mbp). The G+C content of the assembly is 70.9%. The final structural gene prediction resulted in 9,967 gene models. We assigned functional names to 7,398 transcripts, GO categories to 4,443 transcripts, and protein domains to 8,359 translated transcripts. A total of 2,120 proteins were predicted to contain transmembrane domains.
AntiSMASH predicted 171 enzymatic genes that may be involved in the synthesis of secondary metabolites. Based on these functional annotations, as well as on a polyketide synthase (PKS) and nonribosomal protein synthesis (NRPS) domain search, the existences of 11 PKS, 5 NRPS, and 3 NRPS-PKS (in total 19) belonging to different secondary metabolite gene clusters were predicted. Noteworthy is the high similarity of the genes SIRAN8002 to SIRAN8065 with the previously described rapamycin gene cluster (14). Indeed, the formation of rapamycin was verified by liquid chromatography mass spectrometry (LC-MS) analysis (data not shown).
Nucleotide sequence accession numbers.
This whole-genome shotgun project has been deposited in DDBJ/ENA/GenBank under the accession no. LK022848 to LK022854. The version described in this paper is the first version. Genome data and additional information are also available at the HKI Genome Resource (http://www.genome-resource.de/).
ACKNOWLEDGMENTS
This work was supported by the DFG-funded excellence graduate school Jena School for Microbial Communication (JSMC) and the DFG-funded CRC/Transregio 124 “Pathogenic fungi and their human host: Networks of interaction,” subproject INF.
Footnotes
Citation Horn F, Schroeckh V, Netzker T, Guthke R, Brakhage AA, Linde J. 2014. Draft genome sequence of Streptomyces iranensis. Genome Announc. 2(4):e00616-14. doi:10.1128/genomeA.00616-14.
REFERENCES
- 1. Hamedi J, Mohammadipanah F, Klenk HP, Pötter G, Schumann P, Spröer C, von Jan M, Kroppenstedt RM. 2010. Streptomyces iranensis sp. nov., isolated from soil. Int. J. Syst. Evol. Microbiol. 60:1504–1509. 10.1099/ijs.0.015339-0 [DOI] [PubMed] [Google Scholar]
- 2. Vézina C, Kudelski A, Sehgal SN. 1975. Rapamycin (AY-22,989), a new antifungal antibiotic. I. Taxonomy of the producing streptomycete and isolation of the active principle. J. Antibiot. (Tokyo) 28:721–726. 10.7164/antibiotics.28.721 [DOI] [PubMed] [Google Scholar]
- 3. Liu G, Chater KF, Chandra G, Niu G, Tan H. 2013. Molecular regulation of antibiotic biosynthesis in Streptomyces. Microbiol. Mol. Biol. Rev. 77:112–143. 10.1128/MMBR.00054-12 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Park SR, Yoo YJ, Ban YH, Yoon YJ. 2010. Biosynthesis of rapamycin and its regulation: past achievements and recent progress. J. Antibiot. (Tokyo) 63:434–441. 10.1038/ja.2010.71 [DOI] [PubMed] [Google Scholar]
- 5. Kieser T, Bibb MJ, Buttner MJ, Chater KF, Hopwood DA. 2000. Practical Streptomyces genetics. John Innnes Foundation, Norwich, United Kingdom [Google Scholar]
- 6. Pearson WR, Wood T, Zhang Z, Miller W. 1997. Comparison of DNA sequences with protein sequences. Genomics 46:24–36. 10.1006/geno.1997.4995 [DOI] [PubMed] [Google Scholar]
- 7. Kelley DR, Schatz MC, Salzberg SL. 2010. Quake: quality-aware detection and correction of sequencing errors. Genome Biol. 11:R116. 10.1186/gb-2010-11-11-r116 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Zerbino DR, Birney E. 2008. Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 18:821–829. 10.1101/gr.074492.107 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Luo R, Liu B, Xie Y, Li Z, Huang W, Yuan J, He G, Chen Y, Pan Q, Liu Y, Tang J, Wu G, Zhang H, Shi Y, Liu Y, Yu C, Wang B, Lu Y, Han C, Cheung DW, Yiu SM, Peng S, Xiaoqian Z, Liu G, Liao X, Li Y, Yang H, Wang J, Lam TW, Wang J. 2012. SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler. GigaScience. 1:18. 10.1186/2047-217X-1-18 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Borodovsky M, Lomsadze A. 2011. Eukaryotic gene prediction using GeneMark.hmm-E and GeneMark-ES. Curr. Protoc. Bioinformatics Chapter 4:Unit 4.6.1-10. 10.1002/0471250953.bi0406s35 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Conesa A, Götz S, Garcia-Gomez JM, Terol J, Talon M, Robles M. 2005. Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics 21:3674–3676. 10.1093/bioinformatics/bti610 [DOI] [PubMed] [Google Scholar]
- 12. Quevillon E, Silventoinen V, Pillai S, Harte N, Mulder N, Apweiler R, Lopez R. 2005. InterProScan: protein domains identifier. Nucleic Acids Res. 33:W116–W120. 10.1093/nar/gni118 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Blin K, Medema MH, Kazempour D, Fischbach MA, Breitling R, Takano E, Weber T. 2013. antiSMASH 2.0—a versatile platform for genome mining of secondary metabolite producers. Nucleic Acids Res. 41:W204–W212. 10.1093/nar/gkt449 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Schwecke T, Aparicio JF, Molnár I, König A, Khaw LE, Haydock SF, Oliynyk M, Caffrey P, Cortés J, Lester JB, et al. 1995. The biosynthetic gene cluster for the polyketide immunosuppressant rapamycin. Proc. Natl. Acad. Sci. U. S. A. 92:7839–7843. 10.1073/pnas.92.17.7839 [DOI] [PMC free article] [PubMed] [Google Scholar]