ABSTRACT
We report here the draft genome sequence of Lysinibacillus sphaericus strain A1, a potential lignin-degrading bacterium isolated from municipal solid waste (MSW) soil and capable of enhancing gas release from lignocellulose-containing soil.
GENOME ANNOUNCEMENT
Lysinibacillus sphaericus strain A1 is a Gram-positive bacterium from the Firmicutes phylum that was isolated from municipal solid waste (MSW) soil in the United Kingdom (1). This lignin-degrading bacterium strain is particularly interesting since, under microscale anaerobic conditions, it presents enhanced methane release from lignocellulose-containing soil, suggesting the potential for in situ treatment and enhancement of landfill soil gas production (1).
We report here the genome sequence of L. sphaericus strain A1. We sequenced the genome of this strain on an Illumina HiSeq 2500 system at the CTBE next-generation sequencing (NGS) facility, generating 5,262,224 paired-end reads (insert size, 300 bp) and 4,454,143 mate pair reads (3 libraries with an insert size between 3 and 4 kb, 5 and 7 kb, and 8 and 11 kb). Paired-end reads were preprocessed with Trimmomatic (2) to remove the adapter and low-quality sequences, and mate pair reads were processed using NextClip (3), resulting in 5,178,788 and 1,227,816 high-quality reads, respectively.
The genome size of L. sphaericus was estimated to be 4.54 Mb based on k-mer count statistics accessed with KmerGenie (4), with an estimated coverage of approximately 200×. Genome assembly was carried out with SPAdes v3.6.2 (5) using an assembly of k-mer values (21, 33, 47, 55, and 77) and SSPACE (6). The presence of typical bacterial marker genes in the assembled genome was assessed using CheckM (7), which estimated the completeness of the genome to be 99.91%.
The resulting assembly for L. sphaericus has nine scaffolds with a total length of 4,517,188 bp and an N50 value of 3,305,581 bp. The average GC content of the genome is 37%.
Gene prediction and annotation were carried out using the Prokka prokaryotic genome annotation pipeline (8). A total of 4,371 genes were identified in the L. sphaericus genome; of these, there are 4,305 protein-encoding genes, 9 rRNAs, 56 tRNAs, and 1 transfer-messenger RNA (tmRNA). Regarding 16S rRNA, L. sphaericus strain A1 presents 99.92% sequence similarity with L. sphaericus KCTC 3346T, 99.76% with L. fusiformis NBRC 15717T, and 99.60% with L. mangiferihumi M-GX18T, as analyzed using the EzBioCloud Web server identity tool (9).
Accession number(s).
This whole-genome shotgun project has been deposited at DDBJ/ENA/GenBank under the accession number PGLV00000000. The version described in this paper is version PGLV01000000.
ACKNOWLEDGMENTS
We thank the Brazilian Bioethanol Science and Technology Laboratory CTBE NGS Sequencing Facility for generating the DNA sequencing described here.
T.D.H.B. is supported by BBSRC-FAPESP grant BB/P01738X/1. F.M.S. is supported by the FAPESP grant 2015/50590-4.
Footnotes
Citation Persinoti GF, Paixão DAA, Bugg TDH, Squina FM. 2018. Genome sequence of Lysinibacillus sphaericus, a lignin-degrading bacterium isolated from municipal solid waste soil. Genome Announc 6:e00353-18. https://doi.org/10.1128/genomeA.00353-18.
REFERENCES
- 1.Rashid GMM, Durán-Peña MJ, Rahmanpour R, Sapsford D, Bugg TDH. 2017. Delignification and enhanced gas release from soil containing lignocellulose by treatment with bacterial lignin degraders. J Appl Microbiol 123:159–171. doi: 10.1111/jam.13470. [DOI] [PubMed] [Google Scholar]
- 2.Bolger AM, Lohse M, Usadel B. 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30:2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Leggett RM, Clavijo BJ, Clissold L, Clark MD, Caccamo M. 2014. NextClip: an analysis and read preparation tool for Nextera Long Mate Pair libraries. Bioinformatics 30:566–568. doi: 10.1093/bioinformatics/btt702. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Chikhi R, Medvedev P. 2014. Informed and automated k-mer size selection for genome assembly. Bioinformatics 30:31. doi: 10.1093/bioinformatics/btt310. [DOI] [PubMed] [Google Scholar]
- 5.Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, Pyshkin AV, Sirotkin AV, Vyahhi N, Tesler G, Alekseyev MA, Pevzner PA. 2012. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol 19:455–477. doi: 10.1089/cmb.2012.0021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Boetzer M, Henkel CV, Jansen HJ, Butler D, Pirovano W. 2011. Scaffolding pre-assembled contigs using SSPACE. Bioinformatics 27:578–579. doi: 10.1093/bioinformatics/btq683. [DOI] [PubMed] [Google Scholar]
- 7.Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. 2015. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res 25:1043–1055. doi: 10.1101/gr.186072.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Seemann T. 2014. Prokka: rapid prokaryotic genome annotation. Bioinformatics 30:2068–2069. doi: 10.1093/bioinformatics/btu153. [DOI] [PubMed] [Google Scholar]
- 9.Yoon S-H, Ha S-M, Kwon S, Lim J, Kim Y, Seo H, Chun J. 2017. Introducing EzBioCloud: a taxonomically united database of 16S rRNA gene sequences and whole-genome assemblies. Int J Syst Evol Microbiol 67:1613–1617. doi: 10.1099/ijsem.0.001755. [DOI] [PMC free article] [PubMed] [Google Scholar]
