Abstract
Clostridium neonatale is a Gram-positive endospore-forming obligate anaerobe first isolated from the feces of premature neonates during an intensive care unit outbreak of necrotizing enterocolitis (NEC) in a Canadian neonatal intensive care unit. Here, we announce the genome draft sequence of this bacterium. It is composed of 4,710,818 bp and contains 4,169 protein-coding genes and 80 RNA genes, including 11 rRNA genes.
GENOME ANNOUNCEMENT
Clostridium neonatale was first isolated from the blood and feces from neonates born between 30 weeks and term gestation in a context of a necrotizing enterocolitis epidemic affecting 8 neonates (1). C. neonatale was isolated from stool samples and rectal samples, which were inoculated onto prereduced blood agar supplemented with vitamin K and hemin plates and incubated at 37°C in an anaerobic chamber (1). The 16S rRNA sequence of this bacterium showed 98% identity using BLASTN (2) with the sequence of Clostridium saccharobutylicum strain NCP 262. The genus Clostridium has been associated with several diseases in adults and children, and the species Clostridium butyricum was specifically associated recently with cases of necrotizing enterocolitis (3). The presence of four toxins detected in the genomes of these C. butyricum strains was hypothesized to be responsible for the disease (3).
The genome of C. neonatale (ATCC BAA-265) was sequenced using MiSeq technology (Illumina, San Diego, CA, USA), with a mate-pair 250-bp strategy. Velvet version 1.1.06 (4) was used to perform a de novo assembly of the reads. The total estimated size of the C. neonatale genome is 4.71 Mb (coverage, 41×), composed of 20 contigs regrouped into 17 scaffolds, with an average G+C content of 28.42%. Using BLASTN and RNAmmer (5), the genome was shown to contain 80 RNA genes, including eight 5S rRNA genes, three 16S rRNA genes, and 69 tRNA genes.
The draft genome sequence of C. neonatale is smaller than those of Clostridium beijerinckii and Clostridium diolis (4.71, 6.00, and 5.85 Mb, respectively) but larger than those of Clostridium paraputrificum, Clostridium sartagoforme, and Clostridium perfringens (3.56, 3.98, and 3.26 MB, respectively). The G+C content of C. neonatale is smaller than those of C. paraputrificum and C. beijerinckii (29.64 and 29.86%, respectively) but larger than those of C. sartagoforme and C. perfringens (27.91 and 28.38%, respectively).
Potential coding sequences (CDSs) were predicted using the Prodigal software (6). Assignment of protein functions was performed by searching against the GenBank, Clusters of Orthologous Groups (COG), and Pfam (7) databases using BLASTP. A total of 4,249 genes were identified, representing a coding capacity of 3,907,356 bp (82.94% of the sequenced genome). Among these genes, 2,638 (63.27%) genes were associated with COG categories (8). The majority of the gene products are involved in the metabolism of amino acids and carbohydrates, ribosomal structure and biogenesis, and transcription and translation. We identified 1,780 proteins associated with a mobilome found in the phage database (PhAnToMe) and 119 toxin/antitoxin protein predictions. Interestingly, the four toxins described in C. butyricum and that are suspected to be the cause of necrotizing enterocolitis (3) were also identified in C. neonatale (identity percent ranging from 77% to 92%).
Nucleotide sequence accession numbers.
The C. neonatale genome was deposited in EMBL under the accession numbers CZLD01000001 to CZLD01000020.
ACKNOWLEDGMENT
This work was supported by the Programme Hospitalier de Recherche Clinique 2009.
Footnotes
Citation Benamar S, Cassir N, La Scola B. 2016. Genome sequence of a Clostridium neonatale strain isolated in a Canadian neonatal intensive care unit. Genome Announc 4(1):e01431-15. doi:10.1128/genomeA.01431-15.
REFERENCES
- 1.Alfa M, Robson D, Davi M, Bernard K, Van Caeseele P, Harding G. 2002. An outbreak of necrotizing enterocolitis associated with a novel Clostridium species in a neonatal intensive care unit. Clin Infect Dis 35:S101–S105. doi: 10.1086/341929. [DOI] [PubMed] [Google Scholar]
- 2.Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. 1990. Basic local alignment search tool. J Mol Biol 215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
- 3.Cassir N, Benamar S, Khalil JB, Croce O, Saint-Faust M, Jacquot A, Million M, Azza S, Armstrong N, Henry M, Jardot P, Robert C, Gire C, Lagier J, Chabrière E, Ghigo E, Marchandin H, Sartor C, Boutte P, Cambonie G, Simeoni U, Raoult D, La Scola B. 2015. Clostridium butyricum strains and dysbiosis linked to necrotizing enterocolitis in preterm neonates. Clin Infect Dis 61:1107–1115. doi: 10.1093/cid/civ468. [DOI] [PubMed] [Google Scholar]
- 4.Zerbino DR, Birney E. 2008. Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res 18:821–829. doi: 10.1101/gr.074492.107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Lagesen K, Hallin PF, Rødland E, Stærfeldt HH, Rognes T, Ussery DW. 2007. RNammer: consistent annotation of rRNA genes in genomic sequences. Nucleic Acids Res 35:3100–3108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Hyatt D, Chen G, Locascio PF, Land ML, Larimer FW, Hauser LJ. 2010. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics 11:119. doi: 10.1186/1471-2105-11-119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Punta M, Coggill PC, Eberhardt RY, Mistry J, Tate J, Boursnell C, Pang N, Forslund K, Ceric G, Clements J, Heger A, Holm L, Sonnhammer ELL, Eddy SR, Bateman A, Finn RD. 2012. The Pfam protein families database. Nucleic Acids Res 40:D290–D301. doi: 10.1093/nar/gkr1065. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Tatusov RL, Galperin MY, Natale DA, Koonin EV. 2000. The COG database: a tool for genome-scale analysis of protein functions and evolution. Nucleic Acids Res 28:33–36. doi: 10.1093/nar/28.1.33. [DOI] [PMC free article] [PubMed] [Google Scholar]
