Abstract
Streptomyces sp. Tü6071 is a soil-dwelling bacterium which has a highly active isoprenoid biosynthesis. Isoprenoids are important precursors for biopharmaceutical molecules such as antibiotics or anticancer agents, e.g., landomycin. Streptomyces sp. Tü6071 produces the industrially important terpene glycosides phenalinolactones, which have antibacterial activity against several Gram-positive bacteria. The availability of the genome sequence of Streptomyces sp. Tü6071 allows for understanding the biosynthesis of these pharmaceutical molecules and will facilitate rational genome modification to improve industrial use.
GENOME ANNOUNCEMENT
Streptomyces sp. Tü6071 was isolated from a soil sample collected in Cape Coast, Ghana. Today, this strain is the most important bacterium in producing natural drugs, such as phenalinolactones and landomycin (3, 4). Considering the fertility of the secondary metabolites produced by Tü6071, its genome sequence will provide fundamental understanding of the biosynthesis of these compounds and facilitate the bioengineering efforts of this species.
The whole genome shotgun sequence was determined by Roche/454 pyrosequencing (9) and Sanger paired-end sequencing (5) on cosmids. A cosmid library of Tü6071 was constructed in pWeb (Epicentre, Madison WI), with an average insert size of 35 kb. A 3-fold physical coverage of cosmids was chosen to order most of the Roche/454 contigs. Using standard primers and BigDye version 3.1 on the ABI3730xl genome analyzer, 672 clones were end sequenced from both sides. A total number of 1,298 reads with an average read length of 793 bases (Phred20/30) were generated. 454 contigs and paired-end cosmid reads were coassembled using GAP4 (Staden Package) to determine cosmids spanning gaps in the Newbler assembly. Gap closure was performed by sequencing on combinatorial PCR products generated from genomic DNA.
Putative protein-coding sequences were predicted using Glimmer3 (2) trained on annotated open reading frames of Streptomyces coelicolor A3(2) (1) and Streptomyces avermitilis (6). The annotation was accomplished by BlastP analyses of sequences in UniProt databases and by manual curation of the outputs of a variety of similarity searches. Nontranslating genes were predicted using tRNAscan-SE (8) and ARAGORN (7).
The genome of Streptomyces sp. Tü6071 consists of one linear chromosome composed of 7,359,294 bp (contig 1, contig 2) with 73.1% G+C content, as well as one linear plasmid (147,347 bp, 70.9% G+C content). It is shorter than the other streptomycete linear chromosomes of S. coelicolor A3(2) (8.7 Mbp), S. avermitilis MA-4680 (9.0 Mbp), and S. griseus IFO13350 (8.5 Mbp) (10).
Analysis of the Streptomyces Tü6071 genome revealed that its chromosome contains 6,466 protein-coding genes with at least 4,887 proteins with assigned putative functions based on 70% sequence identity and an amino acid alignment length of at least 100 residues. On the plasmid, from 176 predicted coding sequences, 73 have been functionally annotated. Furthermore, at least six rRNA operons and 74 tRNAs on the linear chromosome were predicted.
Isoprenoid-derived antibiotics, such as phenalinolactone A, produced by Tü6071 are synthesized by the condensation of isopentenyl diphosphate (IPP) and dimethylallyl diphosphate. The IPP units required for isoprene biosynthesis are generated by two distinct pathways. In eukaryotes, the mevalonate pathway is very common, while in prokaryotes, the 2-C-methyl-d-erylthritol 4-phosphate (MEP) pathway is mainly used. The phenalinolactone biosynthetic gene cluster (3) is located on the linear chromosome at megabase positions 1.69 to 1.73. In Tü6071, members of the MEP pathway are located upstream of the phenalinolactone biosynthetic gene cluster.
Nucleotide sequence accession number.
This whole-genome shotgun project has been deposited at DDBJ/EMBL/GenBank under the accession number AFHJ00000000. The version described in this paper is the first version, AFHJ01000000.
Acknowledgments
We thank Berthold Fartmann (LGC Genomics Berlin, Germany) for technical assistance in genome sequencing and H.-P. Fiedler (University of Tübingen, Germany) for providing us this strain.
This research was supported by the Excellence-Initiative (Deutsche Forschungsgemeinschaft) in the excellence cluster System Analysis of Biogenic Drugs by Pharmaceutical Bioinformatics.
Footnotes
Published ahead of print on 24 June 2011.
REFERENCES
- 1. Bentley S. D., et al. 2002. Complete genome sequence of the model actinomycete Streptomyces coelicolor A3(2). Nature 417:141–147 [DOI] [PubMed] [Google Scholar]
- 2. Delcher A. L., Bratke K. A., Powers E. C., Salzberg S. L. 2007. Identifying bacterial genes and endosymbiont DNA with Glimmer. Bioinformatics 23:673–679 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Dürr C., et al. 2006. Biosynthesis of the terpene phenalinolactone in Streptomyces sp. Tü6071: analysis of the gene cluster and generation of derivatives. Chem. Biol. 13:365–377 [DOI] [PubMed] [Google Scholar]
- 4. Erb A., Krauth C., Luzhetskyy A., Bechthold A. 2009. Differences in the substrate specificity of glycosyltransferases involved in landomycins A and E biosynthesis. Appl. Microbiol. Biotechnol. 83:1067–1076 [DOI] [PubMed] [Google Scholar]
- 5. Fleischmann R. D., et al. 1995. Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. Science 269:496–512 [DOI] [PubMed] [Google Scholar]
- 6. Ikeda H., et al. 2003. Complete genome sequence and comparative analysis of the industrial microorganism Streptomyces avermitilis. Nat. Biotechnol. 21:526–531 [DOI] [PubMed] [Google Scholar]
- 7. Laslett D., Canback B. 2004. ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. Nucleic Acids Res. 32:11–16 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Lowe T. M., Eddy S. R. 1997. tRNAscan-SE: a program for improved detection of tRNA genes in genomic sequence. Nucleic Acids Res. 25:955–964 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Margulies M., et al. 2005. Genome sequencing in microfabricated high-density picolitre reactors. Nature 437:376–380 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Ohnishi Y., et al. 2008. Genome sequence of the streptomycin-producing microorganism Streptomyces griseus IFO 13350. J. Bacteriol. 190:4050–4060 [DOI] [PMC free article] [PubMed] [Google Scholar]
