Abstract
We report the first high-quality draft genome sequence of an antibiotic (wuyiencin)-producing strain, Streptomyces ahygroscopicus subsp. wuyiensis CK-15, isolated from soil samples collected from Fujian Province, China. The 9.41-Mb genome comprises 8,311 protein-coding sequences, encodes 89 structural RNAs, and shows a G+C content of 72.25%.
GENOME ANNOUNCEMENT
Streptomyces ahygroscopicus subsp. wuyiensis CK-15, a Gram-positive filamentous bacterium isolated from a soil sample collected from the Wuyi Mountains in Fujian Province, China, is of interest, as it expresses the antibiotic wuyiencin, which has wide applications as an agricultural antibiotic to control various fungal diseases of vegetable and field crops (1). This strain has been preserved at the China General Microbiological Culture Collection Center (CGMCC no. 0730) and can be cultured at 28°C on mannitol-soybean (MS) agar or in a yeast extract-malt extract (YEME) liquid medium (2). The soluble fermentation medium comprises the following (per 100 mL): 4 g soybean flour, 2 g glucose, 3 g corn starch, 300 mg CaCO3, and 100 mg (NH4)2SO4.
Genome sequencing was performed on both the Illumina MiSeq platform (Illumina, Inc.), by generating a paired-end library with an insert size of about 500 bp, and on the Illumina HiSeq2000 platform (Illumina, Inc.), by generating a mate-pair library with an insert size of about 5 kb; sequencing was performed at Sangon Biotech Co., Ltd. (Shanghai, China). Paired-end reads were de novo assembled using Velvet version 1.2.07 (3), and GapCloser version 1.12 (4) was used to fill the gaps. Gene prediction was performed using the RAST annotation server (5). Gene functional annotation was based on a BLASTp comparison with the nonredundant (NR), Kyoto Encyclopedia of Genes and Genomes (KEGG) (6), Protein Families (PFAM) (7), and Clusters of Orthologous Groups (COG) databases (8).
The genome size of S. ahygroscopicus subsp. wuyiensis CK-15 was 9,410,232 bp, with a 72.25% G+C content, which is similar to the 72.2% G+C content of the Streptomyces griseus subsp. griseus NBRC 13350 (NC_010572.1). A total of 130 contigs were obtained. Analysis of the genome revealed that it contained 8,311 protein-coding genes (CDSs), which accounted for approximately 83.38% of the genome. Among the 8,311 open reading frames (ORFs), clear functions could be identified for about 7,914 (95.22%) ORFs, while no match could be found for the remaining 397 (4.78%) using the NR protein database. In addition, the genome encoded 89 structural RNAs, including 7 5S rRNAs, 6 16S rRNAs, 8 23S rRNAs, and 68 tRNAs.
The antibiotic wuyiencin synthesized by the CK-15 strain shows a high similarity with an antibiotic produced by Streptomyces nourser ATCC 11455 (nystatin). The DNA sequence analysis of the gene clusters related to antibiotic biosynthesis (including nysDI, nysDII, nysDIII, nysA, nysE, nysF, nysG, nysH, nysM, nysN, nysRI, ORF2, and ORF3) revealed at least an 80% sequence homology between the CK-15 and ATCC 11455 strains. Further analyses revealed that the function of certain genes was similar; for example, ORF2 in both strains enhanced antibiotic production (2). This draft genome sequence will assist in developing a deeper understanding of the regulatory mechanisms underlying antibiotic production, thereby allowing the construction of high-yield wuyiencin strains for large-scale industrial production.
Nucleotide sequence accession numbers.
The draft genome sequence of Streptomyces ahygroscopicus subsp. wuyiensis CK-15 has been deposited at DDBJ/EMBL/GenBank under the accession number JXYI00000000. The version described in this paper is the second version, JXYI02000000.
ACKNOWLEDGMENTS
This work was supported by grants from the National Natural Science Foundation (31401796, 3171985) and the Special Fund for Agro-Scientific Research in the Public Interest in China (201103002, 201303025).
Footnotes
Citation Ge B, Liu Y, Liu B, Zhang K. 2015. Draft genome sequence of Streptomyces ahygroscopicus subsp. wuyiensis CK-15, isolated from soil in Fujian Province, China. Genome Announc 3(5):e01125-15. doi:10.1128/genomeA.01125-15.
REFERENCES
- 1.Cui Z, Zhang K, She G, Lin Y, Sun L, Cheng Y, Tan B. 2010. Antimicrobial activity investigation on wuyiencin fractions of different polarity. Molecules 15:3179–3185. doi: 10.3390/molecules15053179. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Liu Y, Ryu H, Ge B, Pan G, Sun L, Park K, Zhang K. 2014. Improvement of wuyiencin biosynthesis in Streptomyces wuyiensis CK-15 by identification of a key regulator, WysR. J Microbiol Biotechnol 24:1644–1653. doi: 10.4014/jmb.1405.05017. [DOI] [PubMed] [Google Scholar]
- 3.Zerbino DR, Birney E. 2008. Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res 18:821–829. doi: 10.1101/gr.074492.107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Rohde H, Qin J, Cui Y, Li D, Lornan NJ, Hentschke M, Chen W, Pu F, Peng Y, Li J, Xi F, Li S, Li Y, Zhang Z, Yang X, Zhao M, Wang P, Guan Y, Cen Z, Zhao X, Christner M, Kobbe R, Loos S, Oh J, Yang L, Danchin A, Gao GF, Song Y, Li Y, Yang H, Wang J, Xu J, Pallen MJ, Wang J, Aepfelbacher M, Yang R. E. coli O104:H4 Genome Analysis Crowd-Sourcing Consortium . 2011. Open-source genomic analysis of Shiga-toxin-producing E. coli O104:H4. N Engl J Med 365:718–724. [DOI] [PubMed] [Google Scholar]
- 5.Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, Formsma K, Gerdes S, Glass EM, Kubal M, Meyer F, Olsen GJ, Olson R, Osterman AL, Overbeek RA, McNeil LK, Paarmann D, Paczian T, Parrello B, Pusch GD, Reich C, Stevens R, Vassieva O, Vonstein V, Wilke A, Zagnitko O. 2008. The RAST server: Rapid Annotations using Subsystems Technology. BMC Genomics 9:75. doi: 10.1186/1471-2164-9-75. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Ogata H, Goto S, Fujibuchi W, Kanehisa M. 1998. Computation with the KEGG pathway database. Biosystems 47:119–128. doi: 10.1016/S0303-2647(98)00017-3. [DOI] [PubMed] [Google Scholar]
- 7.Punta M, Coggill PC, Eberhardt RY, Mistry J, Tate J, Boursnell C, Pang N, Forslund K, Ceric G, Clements J, Heger A, Holm L, Sonnhammer ELL, Eddy SR, Bateman A, Finn RD. 2012. The Pfam protein families database. Nucleic Acids Res 40:D290–D301. doi: 10.1093/nar/gkr1065. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Tatusov RL, Galperin MY, Natale DA, Koonin EV. 2000. The COG database: a tool for genome-scale analysis of protein functions and evolution. Nucleic Acids Res 28:33–36. doi: 10.1093/nar/28.1.33. [DOI] [PMC free article] [PubMed] [Google Scholar]
