Skip to main content
Genome Announcements logoLink to Genome Announcements
. 2016 Dec 15;4(6):e01423-16. doi: 10.1128/genomeA.01423-16

Draft Genome Sequences of Type Strains Bacillus drentensis DSM 15600T and Bacillus novalis DSM 15603T

Bo Liu 1,, Guo-Hong Liu 1, Yu-jing Zhu 1, Jie-Ping Wang 1, Jian-Mei Che 1, Qian-Qian Chen 1, Zheng Chen 1
PMCID: PMC5159591  PMID: 27979958

Abstract

Here, we report the draft genome sequences of Bacillus drentensis DSM 15600T and Bacillus novalis DSM 15603T with 5,305,306 bp and 5,667,584 bp, respectively, which will provide useful information for the functional gene mining and application of these two species. The average DNA G+C contents were 38.91% and 40.01%, respectively.

GENOME ANNOUNCEMENT

Type strains Bacillus drentensis DSM 15600T and Bacillus novalis DSM 15603T are Gram-positive, spore-forming, and aerobic bacteria, isolated from the soil of several disused hay fields in the Drentse A agricultural research area (the Netherlands). As a result of the recent decrease in the cost of genomic sequencing, it has been proposed that whole-genome sequencing information be combined with the main phenotypic characteristics as a polyphasic approach strategy (taxono-genomics) to describe new bacterial taxa (14). In this study, a high-quality genome sequence of B. drentensis DSM 15600T was sequenced, which may promote research in the genomic taxonomy of the Bacillus-like bacteria.

The genomes of B. drentensis DSM 15600T and B. novalis DSM 15603T were sequenced with massively parallel sequencing (MPS) Illumina technology. Two DNA libraries were constructed: a paired-end library with an insert size of 500 bp and a mate-pair library with an insert size of 5 kb. The 500-bp library and the 5-kb library were sequenced using an Illumina HiSeq 2500 by PE125 strategy. Library construction and sequencing were performed at the Beijing Novogene Bioinformatics Technology Co., Ltd. Quality control of both paired-end and mate-pair reads were performed using an in-house program. After this step, Illumina PCR adapter reads and low-quality reads were filtered. The filtered reads were assembled by SOAPdenovo (5, 6) to generate scaffolds. All reads were used for further gap closure. Through the data assembly, 5,305,306 bp within three scaffolds and 5,668,192 bp within two scaffolds were obtained, and the scaffold N50 values were 5,303,701 bp and 5,667,584 bp, respectively, for B. drentensis DSM 15600T and B. novalis DSM 15603T. The longest and shortest scaffolds of these two species were 5,303,701 bp and 689 bp and 5,667,584 bp and 608 bp, respectively.

For the genome assemblies of these two species, gene prediction was performed with GeneMarkS (7). Transfer RNA (tRNA) genes were predicted with tRNAscan-SE (8), rRNA genes were predicted with rRNAmmer (9) and short RNAs (sRNAs) were predicted by BLAST against the Rfam (10) database. PHAST (11) was used for prophage prediction and CRISPRFinder (12) was used for clustered regularly interspaced short palindromic repeat (CRISPR) identification. Totals of 5,516 and 5,986 genes of B. drentensis DSM 15600T and B. novalis DSM 15603T were predicted, including 5,337 and 5,827 coding sequences (CDS), respectively, and four sRNAs, 125 tRNAs, 50 rRNAs (17 5S, 16 16S, and 17 23S) and five sRNAs, 118 tRNAs, 36 rRNA (13 5S, 11 16S, and 12 23S), respectively. The average DNA G+C contents were 38.91% and 40.01%, respectively.

Accession number(s).

These whole-genome shotgun projects for B. drentensis DSM 15600T and B. novalis DSM 15603T have been deposited at DDBJ/EMBL/GenBank under the accession numbers LUUU00000000 and LUUR00000000, respectively. The versions described in this paper are versions LUUU00000000.1 and LUUR00000000.1.

ACKNOWLEDGMENTS

This work was financially supported by the National Natural Science Foundation of China (grant 31370059), the Scientific Research Foundation for Returned Scholars, Fujian Academy of Agricultural Sciences (grant YJRC2014-1), the Fujian key science and technology special projects–key agricultural science and technology special project (grant 2015NZ0003-1), and the Seed industry innovation project of Fujian Province—“Fujian Resource Preservation Center of the Bacillus-like Bacteria” in the Seed industry innovation and industrialization project of Fujian Province (FJZZZY-1544).

Footnotes

Citation Liu B, Liu G-H, Zhu Y-J, Wang J-P, Che J-M, Chen Q-Q, Chen Z. 2016. Draft genome sequences of type strains Bacillus drentensis DSM 15600T and Bacillus novalis DSM 15603T. Genome Announc 4(6):e01423-16. doi:10.1128/genomeA.01423-16.

REFERENCES

  • 1.Ramasamy D, Mishra AK, Lagier JC, Padhmanabhan R, Rossi M, Sentausa E, Raoult D, Fournier PE. 2014. A polyphasic strategy incorporating genomic data for the taxonomic description of novel bacterial species. Int J Syst Evol Microbiol 64:384–391. doi: 10.1099/ijs.0.057091-0. [DOI] [PubMed] [Google Scholar]
  • 2.Keita MB, Diene SM, Robert C, Raoult D, Fournier PE, Bittar F. 2013. Noncontiguous finished genome sequence and description of Bacillus massiliogorillae sp. nov. Stand Genomic Sci 9:93–105. doi: 10.4056/sigs.4388124. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Mishra AK, Pfleiderer A, Lagier JC, Robert C, Raoult D, Fournier PE. 2013. Noncontiguous finished genome sequence and description of Bacillus massilioanorexius sp. nov. Stand Genomic Sci 8:465–479. doi: 10.4056/sigs.4087826. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Mishra AK, Lagier JC, Rivet R, Raoult D, Fournier PE. 2012. Noncontiguous finished genome sequence and description of Paenibacillus senegalensis sp. nov. Stand Genomic Sci 7:70–81. doi: 10.4056/sigs.3056450. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Li R, Zhu H, Ruan J, Qian W, Fang X, Shi Z, Li Y, Li S, Shan G, Kristiansen K, Li S, Yang H, Wang J, Wang J. 2010. De novo assembly of human genomes with massively parallel short read sequencing. Genome Res 20:265–272. doi: 10.1101/gr.097261.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Li R, Li Y, Kristiansen K, Wang J. 2008. SOAP: short oligonucleotide alignment program. Bioinformatics 24:713–714. doi: 10.1093/bioinformatics/btn025. [DOI] [PubMed] [Google Scholar]
  • 7.Besemer J, Lomsadze A, Borodovsky M. 2001. GeneMarkS: a self-training method for prediction of gene starts in microbial genomes. Implications for finding sequence motifs in regulatory regions. Nucleic Acids Res 29:2607–2618. doi: 10.1093/nar/29.12.2607 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Lowe TM, Eddy SR. 1997. TRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res 25:955–964. doi: 10.1093/nar/25.5.0955. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Lagesen K, Hallin P, Rødland EA, Staerfeldt H-H, Rognes T, Ussery DW. 2007. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res 35:3100–3108. doi: 10.1093/nar/gkm160. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Gardner PP, Daub J, Tate JG, Nawrocki EP, Kolbe DL, Lindgreen S, Wilkinson AC, Finn RD, Griffiths-Jones S, Eddy SR, Bateman A. 2009. Rfam: updates to the RNA families database. Nucleic Acids Res 37(Suppl 1):D136–D140. doi: 10.1093/nar/gkn766. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Zhou Y, Liang Y, Lynch KH, Dennis JJ, Wishart DS. 2011. PHAST: a fast phage search tool. Nucleic Acids Res 39:W347–W352. doi: 10.1093/nar/gkr485. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Grissa I, Vergnaud G, Pourcel C. 2007. CRISPRFinder: a web tool to identify clustered regularly interspaced short palindromic repeats. Nucleic Acids Res 35:W52–W57. doi: 10.1093/nar/gkm360. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Genome Announcements are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES