Skip to main content
Frontiers in Pharmacology logoLink to Frontiers in Pharmacology
. 2017 Nov 15;8:840. doi: 10.3389/fphar.2017.00840

Genomic Analysis of a Pathogenic Bacterium, Paeniclostridium sordellii CBA7122 Containing the Highest Number of rRNA Operons, Isolated from a Human Stool Sample

Joon Yong Kim 1, Yeon Bee Kim 1, Hye Seon Song 1, Won-Hyong Chung 2, Changsu Lee 1, Seung Woo Ahn 1, Se Hee Lee 1, Min Young Jung 1, Tae-Woon Kim 1, Young-Do Nam 2,3,*, Seong Woon Roh 1,*
PMCID: PMC5695201  PMID: 29187822

Introduction

Paeniclostridium sordellii was first isolated by Alfredo Sordelli in 1922 under the proposed name Bacillus oedematis, and was then renamed Bacillus sordellii in 1927 (Hall and Scott, 1927). Two years later, it was classified as Clostridium sordellii (Hall et al., 1929). Recently, this bacterium was reclassified as a species of the genus Paeniclostridium, named P. sordellii comb. nov. (Sasi Jyothsna et al., 2016). P. sordellii is an anaerobic, Gram-stain-positive, spore-forming rod bacterium with flagella. Most strains are non-pathogenic, but some strains have been associated with severe infections of humans and animals. In humans, P. sordellii is mainly associated with trauma, toxic shock, soft tissue skin infections, and gynecologic infections. Despite the serious consequences of infection with P. sordellii, treatment is difficult because of the rapid progression from recognition of the first symptoms to death (Aldape et al., 2006).

In this study, we performed whole-genome sequencing and genomic analysis of strain CBA7122 belonging to P. sordellii, which was isolated from the stool sample of an 85-year-old healthy female residing in the Republic of Korea. This genomic information of P. sordellii CBA7122 should motivate further research on related strains, which may provide new insight into the pathogenesis of P. sordellii toward development of new strategies for the control, prevention, and treatment of life-threatening infections.

Materials and methods

Strain isolation, culture, and DNA extraction

Strain CBA7122 was isolated from the stool sample of an 85-year-old healthy female by the standard dilution plating technique on modified Eggerth-Gagnon agar medium (containing per liter of distilled water: 10 g peptone, 4 g Na2HPO4, 2 g porcine gastric mucin, 50 ml sheep blood, 15 g agar) at 37°C for 24 h in an anaerobic chamber (Coy Laboratory Products) with an atmosphere of N2/CO2/H2 (90:5:5, by volume). Routine cultivation of strain CBA7122 was performed under the same conditions. Genomic DNA of strain CBA7122 was extracted using the QIAamp DNA extraction kit (Qiagen, USA) and QuickGene DNA tissue kit S (Kurabo, Japan), and quantified using the Quant-iT PicoGreen dsDNA Assay kit (Invitrogen, USA). The condition of extracted DNA was assessed by agarose gel electrophoresis on a 1% agarose gel.

Genome sequencing, assembly, and annotation

The genome sequencing of strain CBA7122 was performed using a PacBio RS II sequencing platform as described previously (Kim et al., 2016). The library based on the genomic DNA of strain CBA7122 was constructed according to the manufacturer's instructions, and sequenced using a Pacific Biosciences RS II instrument. The 150,292 generated reads were filtered and assembled using the HGAP 2 protocol with default parameters in SMRT Analysis version 2.3. Gene prediction and the basic annotation for the assembled genome of strain CBA7122 were performed using the NCBI Prokaryotic Genome Annotation Pipeline (PGAP) (Tatusova et al., 2016). In brief, 16S rRNAs and 23S rRNAs were predicted using BLASTn, and 5S rRNAs and small ncRNAs were predicted using cmsearch 1.1.1. tRNAscan-SE was used to predict tRNA gene sequences. Coding sequences (CDSs) were detected using GenMarkS+. Functional gene annotations of the genome of strain CBA7122 were performed against various databases, including the catalytic families (CatFam) (Yu et al., 2009), Clusters of Orthologous Groups (COG) (Tatusov et al., 2000), NCBI reference sequence (RefSeq) (O'Leary et al., 2016), and SEED (Overbeek et al., 2014) databases. CRISPRs were confirmed using CRISPRFinder (Grissa et al., 2007). Prophage analysis was performed by PHAST (Arndt et al., 2016).

Comparative genomic analysis

To find unique features of the genome of strain CBA7122, the genomes of the following P. sordellii and Paraclostridium bifermentans strains were selected to perform comparative genomic analysis using the NCBI genome database (http://www.ncbi.nlm.nih.gov/genome/): P. sordellii strains ATCC 9714T (GCA_000444075.1), VPI 9048 (GCA_000444095.1), and JGS6382 (GCA_000953555.1), and P. bifermentans ATCC 638T (GCA_000452245.1). Genome similarities between strain CBA7122 and the reference strains were determined using Orthologous Average Nucleotide Identity (OrthoANI) values, and used to reconstruct the phylogenetic tree using the Orthologous Average Nucleotide Identity Tool (Lee et al., 2016). For comparisons at the whole-genome level, the genomes of strain CBA7122 and related strains were aligned using the progressive MAUVE algorithm in the MAUVE multiple genome alignment software 2.4.0 (Darling et al., 2004). Pan-genome analysis was performed by BIOiPLUG (Chunlab, Korea).

Virulence factor identification

To determine the virulence factors of strain CBA7122, Basic Local Alignment Search Tool (BLAST) was used with the core dataset containing information on genes associated with experimentally verified virulence factors in the virulence factor database (VFDB) (Chen et al., 2016), with the expected e-value 0.0001.

Ethics statement

The study protocol was approved by the institutional review board of the Theragen ETEX Bio Institute (700062-20160804-JR-005-02). Before the current study, the purpose, experimental procedure, and benefits were fully explained to the participants. Oral consents were obtained from each volunteer and consent procedure was witnessed and documented on the research record.

Results

General genomic features of P. sordellii CBA7122

The genome of P. sordellii strain CBA7122 comprised 3 contigs and was 3,550,411 bp long. Based on NCBI PGAP, a total of 3,459 genes were predicted, including 17 16S-23S-5S rRNA operons (51 rRNAs), 105 tRNAs, and 4 ncRNAs. The circular map of the genome is displayed in Figure 1, and detailed genome features of strain CBA7122 are listed in Table 1. Among the COG categories, 2,919 genes were categorized to transcription (230 genes); amino acid transport and metabolism (223); energy production and conversion (174); translation, ribosomal structure, and biogenesis (164); signal transduction mechanisms (156); cell wall/membrane/envelope biogenesis (155); carbohydrate transport and metabolism (153); inorganic ion transport and metabolism (143); replication, recombination, and repair (133); coenzyme transport and metabolism (93); nucleotide transport and metabolism (86); posttranslational modification, protein turnover, and chaperones (82); lipid transport and metabolism (45); cell motility (43); cell cycle control, cell division, and chromosome partitioning (33); secondary metabolites biosynthesis, transport, and catabolism (22); and function unknown (984). In the SEED database, a total of 1,470 genes were matched to the subsystem as follows: protein metabolism (264 genes); amino acids and derivatives (228); cofactors, vitamins, prosthetic groups, and pigments (206); carbohydrates (189); cell wall and capsule (169); DNA metabolism (110); nucleosides and nucleotides (105); RNA metabolism (104); virulence, disease, and defense (86); membrane transport (80); fatty acids, lipids, and isoprenoids (80); stress response (80); dormancy and sporulation (69); motility and chemotaxis (65); phosphorus metabolism (42); respiration (39); cell division and cell cycle (38); phages, prophages, transposable elements, and plasmids (28); regulation and cell signaling (25); sulfur metabolism (18); iron acquisition and metabolism (17); miscellaneous (14); potassium metabolism (11); nitrogen metabolism (9); and metabolism of aromatic compounds (1). According to CRISPR analysis, strain CBA7122 did not have known CRISPRs. Strain CBA7122 had two intact phage genomes (30,978 bp long and 41,432 bp long), located in contig 2. Furthermore, 12,687 of the 16,833-bp-long contig 1 matched to incomplete phage genes.

Figure 1.

Figure 1

Circular map of the Paeniclostridium sordellii CBA7122 genome. RNA genes (red, rRNA; blue, tRNA), forward and reverse strands (colored according to COG categories) are indicated from the outer fringe to the center. Inner circles show the GC content in yellow and blue and the GC skew is shown with red and green indicating positive and negative values, respectively. This genome map was generated by CLgenomics 1.52 (Chun Lab Inc.).

Table 1.

General genome features of Paeniclostridium sordellii CBA7122.

Item Values
Finishing quality Draft
Sequencing platform PacBio RS II
Assembler PacBio SMRT Analysis 2.3.0
Methods reads 82,499
Genome coverage 298.06 X
Assembly size (bp) 3,550,411
N50 3,527,419
DNA G + C content (%) 27.8
Total contigs 3
Total genes 3,459
Total CDSs 3,299
RNA genes 160
    rRNAs 17 rRNA operons (16S-23S-5S)
    tRNAs 105
    ncRNAs 4

The number of rRNA gene copies of P. sordellii CBA7122

Interestingly, the genome of strain CBA7122 had 17 rRNA operons, which represented the highest number of rRNA operons known so far in the domain Bacteria. Of all P. sordellii genomes listed in the NCBI genome database, strain CBA7122 had the highest 16S and 23S rRNA genes copies, whereas other genomes had lower 16S (3.7 in average) and 23S (5.5 in average) rRNA genes copies (Supplementary Figure S1). In the case of the number of 5S rRNA genes, all genomes had an average of 13.7 gene copies, and strain CBA7122 had the second most number of gene copies. According to Roller et al. (2016), the number of rRNA operons is known to be associated with these two factors of reproduction, growth rate, and growth efficiency.

Comparative genomic analysis

Based on the OrthoANI values, strain CBA7122 had greatest sequence similarity with P. sordellii strain JGS6382 (98.74%), followed by P. sordellii strains VPI 9048 (98.69%) and ATCC 9714T (98.31%), and was most dissimilar to P. bifermentans ATCC 638T (81.14%). The phylogenetic tree based on orthoANI values supported that strain CBA7122 is closely related to P. sordellii (Supplementary Figure S2). Whole-genome comparison of strain CBA7122 with P. sordellii JGS6382, VPI 9048, and ATCC 9714T, and P. bifermentans ATCC 638T revealed that most of locally collinear blocks (LCBs) are closely homologous within the species rather than genus (Supplementary Figure S3). In the pan-genome analysis, a total of 4,856 pan-genome orthologous groups (POGs) were obtained from the 17,352 CDSs of the 5 genomes. As shown in Supplementary Figure S4, the core genome comprises 2,481 POGs and the genome of strain CBA7122 contains only 145 POGs as singletons. Of the 48 genes in these 145 singleton POGs, 36 were annotated to the SEED subsystem, including cell wall and capsule (8 genes); amino acids and derivatives (5); DNA metabolism (5); motility and chemotaxis (4); phages, prophages, transposable elements, and plasmids (2); carbohydrates (1); miscellaneous (1); nucleosides and nucleotides (1); phosphorus metabolism (1); RNA metabolism (1); and virulence, disease, and defense subsystem (1). The others were annotated as hypothetical or uncharacterized proteins (Supplementary Table S1).

Virulence factors

We identified several known virulence factors in the genome of strain CBA7122, including perfringolysin O (pfoA), sialidase (nanH), thiol-activated cytolysin (ALO), polysialic acid capsule biosynthesis protein SiaC (siaC), Hsp60, 60K heat shock protein HtpB (htpB), capsular polysaccharide synthesis enzyme Cap8D (cap8D), UDP-galactopyranose mutase (cpsI), UDP-glucose 6-dehydrogenase (hasB), ATPase EscN (escN), glycosyl transferase CpsE (cpsE), Listeria adhesion protein Lap (lap), ATP-dependent protease (clpE), sigma 54-dependent response regulator (fleR/flrC), collagenase (colA), and UDP-galactopyranose mutase (glf). However, there were no large clostridial cytotoxin (LCC) genes identified in the genome of strain CBA7122, whereas these genes were identified in strains ATCC 9714T and VPI 9048, which are known as the key factors of human infection leading to death.

Our data based on genomic analyses provides basic information of P. sordellii, which should serve as a useful reference for detailed studies focused on gaining a better understanding of the virulence factors in the genomes of these strains and their effects on human health. In addition, since strain CBA7122 contains a prophage genome, it is necessary to check whether the human infection status will be changed through phage therapy in further studies.

Data access

The genome sequence of Paeniclostridium sordellii CBA7122 were deposited in the DDBJ/ENA/GenBank under accession numbers BDJI01000001BDJI01000003.

Author contributions

SWR and Y-DN designed and coordinated all the experiments. HSS performed cultivation, DNA extraction, and purification. JYK, YBK, SHL, and MYJ performed the sequence assembly, gene prediction, gene annotation, comparative genomic analysis, and wrote manuscript. W-HJ, CL, SWA, and T-WK checked and edited the manuscript. All authors have read and approved the manuscript.

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Footnotes

Funding. This research was supported by grants from the World Institute of Kimchi (KE1702-2), funded by the Ministry of Science and ICT, Korea Food Research Institute (E0170602-01), and Basic Science Research Program through the National Research Foundation of Korea (NRF) (2015R1D1A1A09061039), Republic of Korea.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fphar.2017.00840/full#supplementary-material

References

  1. Aldape M. J., Bryant A. E., Stevens D. L. (2006). Clostridium sordellii infection: epidemiology, clinical findings, and current perspectives on diagnosis and treatment. Clin. Infect. Dis. 43, 1436–1446. 10.1086/508866 [DOI] [PubMed] [Google Scholar]
  2. Arndt D., Grant J. R., Marcu A., Sajed T., Pon A., Liang Y., et al. (2016). PHASTER: a better, faster version of the PHAST phage search tool. Nucleic Acids Res. 44, W16–W21. 10.1093/nar/gkw387 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Chen L., Zheng D., Liu B., Yang J., Jin Q. (2016). VFDB 2016: hierarchical and refined dataset for big data analysis−10 years on. Nucleic Acids Res. 44, D694–D697. 10.1093/nar/gkv1239 [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Darling A. C., Mau B., Blattner F. R., Perna N. T. (2004). Mauve: multiple alignment of conserved genomic sequence with rearrangements. Genome Res. 14, 1394–1403. 10.1101/gr.2289704 [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Grissa I., Vergnaud G., Pourcel C. (2007). CRISPRFinder: a web tool to identify clustered regularly interspaced short palindromic repeats. Nucleic Acids Res. 35, W52–W57. 10.1093/nar/gkm360 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Hall I. C., Rymer M. R., Jungherr E. (1929). Comparative Study of Bacillus sordellii (Hall and Scott) and Clostridium Oedematoides (Meleney, Humphreys and Carp). J. Infect. Dis. 45, 42–60. 10.1093/infdis/45.1.42 [DOI] [Google Scholar]
  7. Hall I. C., Scott J. P. (1927). Bacillus Sordellii, a cause of malignant edema in man. J. Infect. Dis. 41, 329–355. 10.1093/infdis/41.5.329 [DOI] [Google Scholar]
  8. Kim J. Y., Song H. S., Kim Y. B., Kwon J., Choi J. S., Cho Y. J., et al. (2016). Genome sequence of a commensal bacterium, Enterococcus faecalis CBA7120, isolated from a Korean fecal sample. Gut Pathog. 8:62. 10.1186/s13099-016-0145-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Lee I., Kim Y. O., Park S. C., Chun J. (2016). OrthoANI: an improved algorithm and software for calculating average nucleotide identity. Int. J. Syst. Evol. Microbiol. 66, 1100–1103. 10.1099/ijsem.0.000760 [DOI] [PubMed] [Google Scholar]
  10. O'Leary N. A., Wright M. W., Brister J. R., Ciufo S., Haddad D., McVeigh R., et al. (2016). Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation. Nucleic Acids Res. 44, D733–D745. 10.1093/nar/gkv1189 [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Overbeek R., Olson R., Pusch G. D., Olsen G. J., Davis J. J., Disz T., et al. (2014). The SEED and the Rapid Annotation of microbial genomes using Subsystems Technology (RAST). Nucleic Acids Res. 42, D206–D214. 10.1093/nar/gkt1226 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Roller B. R., Stoddard S. F., Schmidt T. M. (2016). Exploiting rRNA operon copy number to investigate bacterial reproductive strategies. Nat. Microbiol. 1:16160. 10.1038/nmicrobiol.2016.160 [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Sasi Jyothsna T. S., Tushar L., Sasikala C., Ramana C. V. (2016). Paraclostridium benzoelyticum gen. nov. sp. nov., isolated from marine sediment and reclassification of Clostridium bifermentans as Paraclostridium bifermentans comb. nov. Proposal of a new genus Paeniclostridium gen. nov. to accommodate Clostridium sordellii and Clostridium ghonii. Int. J. Syst. Evol. Microbiol. 66, 2459–2459. 10.1099/ijsem.0.001144 [DOI] [PubMed] [Google Scholar]
  14. Tatusov R. L., Galperin M. Y., Natale D. A., Koonin E. V. (2000). The COG database: a tool for genome-scale analysis of protein functions and evolution. Nucleic Acids Res. 28, 33–36. 10.1093/nar/28.1.33 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Tatusova T., DiCuccio M., Badretdin A., Chetvernin V., Nawrocki E. P., Zaslavsky L., et al. (2016). NCBI prokaryotic genome annotation pipeline. Nucleic Acids Res. 44, 6614–6624. 10.1093/nar/gkw569 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Yu C., Zavaljevski N., Desai V., Reifman J. (2009). Genome-wide enzyme annotation with precision control: catalytic families (CatFam) databases. Proteins 74, 449–460. 10.1002/prot.22167 [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials


Articles from Frontiers in Pharmacology are provided here courtesy of Frontiers Media SA

RESOURCES