Skip to main content
Genome Announcements logoLink to Genome Announcements
. 2014 Jan 23;2(1):e01123-13. doi: 10.1128/genomeA.01123-13

Genome Sequence of Bacillus sp. Strain FJAT-14515

Guohong Liu a, Bo Liu a,, Weiqi Tang b, Jianmei Che a, Yingzhi Lin a, Yujing Zhu a, Mingxing Su a, Jianyang Tang a
PMCID: PMC3900888  PMID: 24459256

Abstract

We report the draft genome sequence of Bacillus sp. strain FJAT-14515. The genome is 5.44 Mb in length. It covers 5,263 genes with an average length of 791 bp, has a G+C value of 37.06%, and contains 67 tRNAs, 31 small RNAs, and 5 rRNA loci.

GENOME ANNOUNCEMENT

Bacillus sp. strain FJAT-14515 (16S rRNA GenBank accession number JX262264), a mesophile and endospore-forming bacterium, was isolated from a soil sample collected from Taiwan. It grows optimally at a 0% NaCl (range, 0 to 5%) concentration in nutrient agar (NA) at 30°C (10 to 40°C) and pH 7.0 (range, pH 6 to pH 9). The 16S rRNA similarities between FJAT-14515 and the closest strains, Bacillus muralis DSM 16288T and Bacillus simplex DSM 30646T, were <98% through the EzTaxon-e database (http://eztaxon-e.ezbiocloud.net/) (1). Devereux et al. (2) and Fry et al. (3) have proposed that a similarity of <98% in a 16S rRNA sequence should be considered evidence for separate species. So, the genome of Bacillus sp. FJAT-14515 was sequenced with a view to determining whether it is a novel species of the genus Bacillus.

The complete genome sequence was determined by Illumina Solexa technology at the Beijing Genomics Institute (BGI) (Shenzhen, China). Assembly was performed using SOAP de novo v 2.04 (4).

The genome assembly of Bacillus sp. FJAT-14515 (G+C content of 37.06%) has approximately 98-fold coverage. It contains 28 scaffolds totaling 5,443,019 bp (largest, 2,476,485 bp, and smallest, 598 bp). The scaffolds consist of 43 contigs totaling 5,436,942 bp (largest, 793,370 bp, and smallest, 204 bp). N50 scaffold lengths of 793,370 bp and N50 contig lengths of 435,981 bp were obtained. All assembly data were deposited in the DDBJ/EMBL/GenBank nucleotide sequence database.

Coding sequences (CDS) were predicted using Glimmer 3.02 (5) and further annotated using Uniprot, NCBInr, COG, and KEGG through BLASTP. tRNAs, rRNAs, and small RNAs (sRNAs) were identified using tRNAscan-SE (6), RNAmmer (7), and Rfam (8), respectively.

The genome contains 5,263 CDS with an average length of 789 bp, which represent 76.5% of the whole genome. The results of annotation showed that only 837 genes (15.9%) did not match any known protein in the current public protein databases. Of the 5,263 genes, 2,735 and 2,812 encode proteins assigned into COG functional categories and KEGGs, respectively. Additionally, 67 tRNAs, 31 sRNAs, 5 rRNAs, and 7 clustered regularly interspaced short palindromic repeats (CRISPRs) were identified in the genome.

Nucleotide sequence accession number.

This whole-genome shotgun project for Bacillus sp. strain FJAT-14515 has been deposited at DDBJ/EMBL/GenBank under the accession number AYSD00000000.

ACKNOWLEDGMENTS

This work was supported by the Agricultural Bio-resource Institute, Fujian Academy of Agricultural Sciences, People’s Republic of China. The work was financed by the 948 Project (2011-G25) from the Chinese Ministry of Agriculture as well as by the 973 Program Early Research Project (2011CB111607), the Project of Agriculture Science and Technology Achievement Transformation (2010GB2C400220), the International Cooperation Project (2012DFA31120) from the Chinese Ministry of Science and Technology, and the Natural Science Foundation of China (NSFC) (31370059).

Footnotes

Citation Liu G, Liu B, Tang W, Che J, Lin Y, Zhu Y, Su M, Tang J. 2014. Genome sequence of Bacillus sp. strain FJAT-14515. Genome Announc. 2(1):e01123-13. doi:10.1128/genomeA.01123-13.

REFERENCES

  • 1. Kim OS, Cho YJ, Lee K, Yoon SH, Kim M, Na H, Park SC, Jeon YS, Lee JH, Yi H, Won S, Chun J. 2012. Introducing EzTaxon-e: a prokaryotic 16S rRNA gene sequence database with phylotypes that represent uncultured species. Int. J. Syst. Evol. Microbiol. 62:716–721. 10.1099/ijs.0.038075-0 [DOI] [PubMed] [Google Scholar]
  • 2. Devereux R, He SH, Doyle CL, Orkland S, Stahl DA, LeGall J, Whitman WB. 1990. Diversity and origin of Desulfovibrio species: phylogenetic definition of a family. J. Bacteriol. 172:3609–3619 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Fry NK, Warwick S, Saunders NA, Embley TM. 1991. The use of 16S ribosomal RNA analyses to investigate the phylogeny of the family Legionellaceae. J. Gen. Microbiol. 137:1215–1222. 10.1099/00221287-137-5-1215 [DOI] [PubMed] [Google Scholar]
  • 4. Li R, Zhu H, Ruan J, Qian W, Fang X, Shi Z, Li Y, Li S, Shan G, Kristiansen K, Li S, Yang H, Wang J, Wang J. 2010. De novo assembly of human genomes with massively parallel short read sequencing. Genome Res. 20:265–272. 10.1101/gr.097261.109 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Delcher AL, Harmon D, Kasif S, White O, Salzberg SL. 1999. Improved microbial gene identification with GLIMMER. Nucleic Acids Res. 27:4636–4641. 10.1093/nar/27.23.4636 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Lowe TM, Eddy SR. 1997. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25:955–964. 10.1093/nar/25.5.0955 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Lagesen K, Hallin P, Rødland EA, Staerfeldt HH, Rognes T, Ussery DW. 2007. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res. 35:3100–3108. 10.1093/nar/gkm160 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Gardner PP, Daub J, Tate JG, Nawrocki EP, Kolbe DL, Lindgreen S, Wilkinson AC, Finn RD, Griffiths-Jones S, Eddy SR, Bateman A. 2009. Rfam: updates to the RNA families database. Nucleic Acids Res. 37(Suppl 1):D136–D140. 10.1093/nar/gkn766 [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Genome Announcements are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES