Skip to main content
New Microbes and New Infections logoLink to New Microbes and New Infections
. 2016 May 4;12:90–95. doi: 10.1016/j.nmni.2016.04.011

Noncontiguous finished genome sequence and description of Enterococcus massiliensis sp. nov.

S Le Page 1, T Cimmino 1, A Togo 1, M Million 1, C Michelle 1, S Khelaifia 1, J-C Lagier 1, D Raoult 1, J-M Rolain 1,
PMCID: PMC4913184  PMID: 27330820

Abstract

Enterococcus massiliensis strain sp. nov. (= CSUR P1927 = DSM 100308) is a new species within the genus Enterococcus. This strain was first isolated from a fresh stool sample of a man during culturomics study of intestinal microflora. Enterococcus massiliensis is a Gram-positive cocci, facultative anaerobic and motile. E. massiliensis is negative for mannitol and positive for β-galactosidase, contrary to E. gallinarum. The complete genome sequence is 2 712 841 bp in length with a GC content of 39.6% and contains 2617 protein-coding genes and 70 RNA genes, including nine rRNA genes.

Keywords: Culturomics, Enterococcus massiliensis, genome, new species, taxonogenomics

Introduction

Enterococcus massiliensis sp. nov. strain AM1T (= CSUR P1927 = DSM 100308) belongs to the genus Enterococcus. This bacterium is a Gram-positive, facultative anaerobic, motile and unpigmented. It was isolated from a fresh stool sample of a human in Marseille as part of culturomics study [1]. Currently, a polyphasic approach that combines proteomic by matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) analysis [2], genomic data with 16S rRNA sequence identity and phylogeny [3], genomic G+C content diversity and DNA-DNA hybridization (DDH) and phenotypic characterization is used to describe new bacterial species [4].

Enterococcus were classified in the genus of Streptococcus because of the presence of the D antigen until 1984 [5]. But after analysis of the genome of Streptococcus faecalis and S. faecium, these strains have been transferred to the genus Enterococcus [6]. Members of the genus Enterococcus are components of the intestinal flora of humans and animals. There are opportunistic pathogens with two principal strains: Enterococcus faecalis and Enterococcus faecium, responsible for nosocomial infections [7], [8].

Here we present a summary classification and a set of features for E. massiliensis sp. nov. strain AM1T together with the description of the complete genome sequence and annotation. These characteristics support the circumscription of the species E. massiliensis.

Organism Information

A stool sample was collected in 2015 from a voluntary patient as a negative control and isolated on Columbia agar supplemented with 5% sheep's blood (bioMérieux, Marcy-l’Étoile, France) in aerobic and anaerobic condition using GasPak EZ Anaerobe Container System Sachets (Becton Dickinson (BD), San Diego, CA, USA) at 37°C. Enterococcus massiliensis was sequenced as part of a culturomics study aiming to isolate all bacterial species colonizing the human gut [9]. Enterococcus massiliensis strain AM1T (GenBank accession no. LN833866) exhibited a 97% 16S rRNA nucleotide sequence similarity with Enterococcus gallinarum (JF915769), the phylogenetically closest validly published bacterial species (Fig. 1) after comparison with National Center for Biotechnology Information (NCBI) database. This value is lower than 98.7% 16S rRNA gene sequence similarity set as a threshold recommended by Stackebrandt and Ebers [3] to delineate a new species without carrying out DNA-DNA hybridization.

Fig. 1.

Fig. 1

Consensus phylogenetic tree highlighting position of Enterococcus massiliensis relative to other type strains within genus Enterococcus by 16S. GenBank accession numbers appear in brackets. Sequences were aligned using CLUSTALW, and phylogenetic inferences were obtained using maximum-likelihood method in MEGA6 software package. Numbers at nodes are percentages of bootstrap values from 1000 replicates that support nodes. Streptococcus pneumoniae and Staphylococcus aureus were used as outgroups. Scale bar = 1% nucleotide sequence divergence.

Growth occurred between 25°C and 37°C, but optimal growth was observed at 37°C, 24 hours after inoculation. Colonies were smooth and whitish, approximately 1 mm in diameter on 5% sheep's blood–enriched agar (bioMérieux). Growth of the strain was tested under anaerobic and microaerophilic conditions using GasPak EZ Anaerobe pouch (BD) and CampyGen Compact (Oxoid, Basingstoke, UK) systems, respectively, and in aerobic conditions, with or without 5% of CO2. Growth was achieved under aerobic (with and without CO2), microaerophilic and anaerobic conditions. Gram staining showed Gram-positive cocci without sporulation (Fig. 2A). A motility test was positive and realized with API M Medium (bioMérieux), a semisolid medium with an inoculation performed by swabbing one colony into the medium. After 24 hours of incubation, the growth of E. massiliensis was away from this stabbed line, characteristic of positive motility. Cells grown on agar exhibited a mean diameter of 0.5 μm and a mean length ranging from 1.1 to 1.3 μm (mean 1.2 μm), determined by negative staining transmission electron microscopy (Fig. 2B).

Fig. 2.

Fig. 2

(A) Gram stain and (B) transmission electron micrograph of Enterococcus massiliensis strain taken by Technai G20 Cryo (FEI Company, Limeil-Brevannes, France) at operating voltage of 200 kV at 1000× magnification.

Differential phenotypic characteristics using API 50CH and API Zym system (bioMérieux) between E. massiliensis sp. nov. AM1T and other Enterococcus species [9] are presented in Table 1. Antibiotic susceptibility testing was performed by the disk diffusion method on Müller-Hinton agar with blood (bioMérieux). E. massiliensis is susceptible to vancomycin, teicoplanin, linezolid, gentamicin, ciprofloxacin, doxycycline, rifampicin and pristinamycin and resistant or intermediate to penicillin G, oxacillin, ceftriaxone, cefoxitin, trimethoprim/sulfamethoxazole, fosfomycin, erythromycin and clindamycin.

Table 1.

Differential characteristics of Enterococcus massiliensis sp. AM1, E. faecalis, E. casseliflavus, E. gallinarum, E. haemoperoxidus, E. cecorum, E. sulfureus and E. caccae

Property E. massiliensis E. faecalis E. casseliflavus E. gallinarum E. haemoperoxidus E. cecorum E. sulfureus E. caccae
Oxygen requirement Facultative anaerobic Facultative anaerobic Facultative anaerobic Facultative anaerobic Facultative anaerobic Facultative anaerobic Facultative anaerobic Facultative anaerobic
Gram stain Positive Positive Positive Positive Positive Positive Positive Positive
Motility Motile Motile Motile
Pigment + +
Production of:
 Alkaline phosphatase
 Catalase
 Oxydase
 β-Glucuronidase NA + NA NA
 α-Galactosidase NA NA NA
 β-Galactosidase + NA NA NA
 N-acetyl-glucosamine + + + + + +
Acid form:
 Mannitol + + + +
 Sorbose
 l-Arabinose + + +
 Sorbitol + v
 d-Raffinose + + + + +
 Xylose + + +
 d-Trehalose + + + + + + + +
G+C content (%) 39.6 37.3 42.7 40.7 35.8 36.3 37.8 35.8
Habitat Human stool Intestine of mammals Intestine of mammals Intestine of mammals Water Commensal chicken Plants Human stool

+, positive result; −, negative result; v, variable result; NA, data not available.

Extended Features Descriptions

MALDI-TOF MS protein analysis was carried out as previously described [2] using a Microflex spectrometer (Bruker Daltonics, Leipzig, Germany). Twelve distinct deposits were done for strain AM1T from 12 isolated colonies. Twelve distinct deposits were done for strain AM1T from 12 isolated colonies. Spectra were imported into the MALDI BioTyper software, version 2.0 (Bruker), and analysed by standard pattern matching against 7765 bacterial spectra, including 92 spectra from 31 Enterococcus species, in the BioTyper database. Interpretation of scores was as follows: a score of ≥2 enabled the identification at the species level, a score of ≥1.7 but <2 enabled the identification at the genus level and a score of <1.7 did not enable any identification (scores established by the manufacturer, Bruker). For strain AM1T, no significant MALDI-TOF MS score was obtained against the Bruker database, thus suggesting that our isolate was a new species. We incremented our database with the spectrum from strain AM1T (Fig. 3).

Fig. 3.

Fig. 3

Graphical circular map of chromosome. From outside to centre: genes on forward strand (coloured by COGs categories explain in Table 3), genes on reverse strand (coloured by COGs categories), RNA genes (tRNAs green, rRNAs red), G+C content, G+C skew.

Genome Sequencing Information

Enterococcus massiliensis sp. nov. (GenBank accession no. CVRN00000000) is the 54 species described within Enterococcus genus.

After DNA extraction by the phenol–chloroform method, genomic DNA of E. massiliensis was sequenced on a MiSeq instrument (Illumina, San Diego, CA, USA) using paired end and mate pair strategies.

For genome annotation, open reading frames (ORFs) were predicted using Prodigal (http://prodigal.oml.gov/) with default parameters, but the predicted ORFs were excluded if they spanned a sequencing gap region. The predicted bacterial protein sequences were searched for against the GenBank database (http://www.ncbi.nlm.nih.gov/genbank) and the Clusters of Orthologous Groups (COGs) databases using BLASTP. The tRNAScanSE tool [10] was used to find tRNA genes, whereas ribosomal RNAs were detected using RNAmmer [11] and BLASTn against the GenBank database.

The ARG-ANNOT database for acquired antibiotic resistance genes (ARGs) was used for a BLAST search using the Bio-Edit interface [12]. The assembled sequences were searched against the ARG database under moderately stringent conditions (e-value of 10−5) for the in silico ARG prediction. E. massiliensis presents the Lsa gene, encoding a putative ABC protein Lsa with an identity to 72% with Lsa family ABC-F of E. faecalis in NCBI, which phenotypically confirms its resistance to clindamycin.

Analysis of presence of polyketide synthase (PKS) and nonribosomal polyketide synthesis (NRPS) was performed by discriminating the gene with a large size using a database realized in our laboratory; predicted proteins were compared against the nonredundant (nr) GenBank database using BLASTP and finally examined using antiSMASH [13]. Analysis of the genome revealed the absence of NRPKs and PKS. Lipoprotein signal peptides and the number of transmembrane helices were predicted using SignalP [14] and TMHMM [15], respectively. ORFans were identified if their BLASTP E value was lower than 10−3 for alignment length >80 amino acids.

We used the Genome-to-Genome Distance calculator (GGDC) web server (http://ggdc.dsmz.de) to estimate the overall similarity among the compared genomes and to replace the wet-lab DDH by a digital DDH [16], [17]. GGDC 2.0 BLAST+ was chosen as alignment method, and the recommended formula 2 was taken into account to interpret the results.

We compared the genome of E. massiliensis with nine other genomes of Enterococcus strains. The genome is 2 712 841 bp long (one chromosome, no plasmid) with a GC content of 39.6% (Table 2). The properties and statistics of the genome are summarized in Table 2. The draft genome of E. massiliensis is smaller than those of E. moraviensis, E. haemoperoxidus, E. caccae, E. casseliflavus, E. gallinarum and E. faecalis (3.60, 3.58, 3.56, 3.43, 3.16 and 2.96 Mb, respectively), but larger than those of E. saccharolyticus, E. columbae, E. cecorum and E. sulfureus (2.60, 2.58, 2.34 and 2.31, respectively). The G+C content of E. massiliensis is lower than those of E. casseliflavus and E. gallinarum (42.8 and 40.7) but greater than those of E. moraviensis, E. haemoperoxidus, E. caccae, E. saccharolyticus, E. columbae, E. cecorum, E. sulfureus and E. faecalis (39.6, 36.1, 35.7, 35.8, 36.9, 36.6, 36.4, 38.0 and 37.5, respectively). Of the 2687 predicted chromosomal genes, 2617 were protein-coding genes and 70 were RNAs including 61 tRNAs and nine rRNAs (5S = 4, 23S = 2, 16S = 3). A total of 1889 genes (72.2%) were assigned to a putative function (Fig. 3, Table 3). Seventy-one genes were identified as ORFans (2.71%), and the remaining genes were annotated as hypothetical proteins. The distribution of genes into COGs functional categories is presented in Table 3.

Table 2.

Nucleotide content and gene count levels of genome

Attribute Value % of totala
Genome size (bp) 2 712 841 100
DNA G+C content (bp) 1 075 567 39.6
DNA coding region (bp) 2 408 151 88.77
Total genes 2687 100
RNA genes 70 2.60
Protein-coding genes 2617 97.39
Genes with function prediction 1889 72.18
Genes assigned to COGs 1863 71.19
Genes with peptide signals 250 9.55
Genes with transmembrane helices 630 24.07

COGs, Clusters of Orthologous Groups database.

a

Total is based on either size of genome in base pairs or total number of protein-coding genes in annotated genome.

Table 3.

Number of genes associated with 25 general COGs functional categories

graphic file with name fx1.jpg

Conclusion and Perspectives

On the basis of phenotypic, phylogenetic and genomic analyses, we formally propose the creation of Enterococcus massiliensis sp. nov. AM1T. This strain was isolated in Marseille, France.

Taxonomic and Nomenclatural Proposals

Description of Enterococcus massiliensis sp. nov.

Enterococcus massiliensis (massiliensis because this strain was isolated in Massilia, the Latin name of Marseille, where the strain was sequenced).

Colonies were whitish and approximately 1 mm diameter on 5% sheep's blood–enriched agar. Cells are Gram-positive, nonhaemolytic, facultative anaerobic with a mean length of 1.2 μm and a mean diameter of 0.6 μm. Growth occurred between 25°C to 37°C, but optimal growth was observed at 37°C. Alkaline phosphatase, esterase (C4), esterase lipase (C8), leucine arylamidase, acid phosphatase, β-galactosidase and N-acetyl-β-glucosaminidase activities were present. Esculin activity was also positive, but catalase, oxydase, β-galactosidase and N-acetyl-β-glucosaminidase were negative. Positive reaction were obtained for d-ribose, d-glucose, d-fructose, d-mannose and N-acetylglucosamine. E. massiliensis was susceptible to vancomycin, teicoplanin, linezolid, gentamicin, ciprofloxacin, doxycycline, rifampicin and pristinamycin, but resistant to trimethoprim/sulfamethoxazole, fosfomycin, erythromycin and clindamycin.

The G+C content of the genome is 39.6%. The 16S rRNA and genome sequences are deposited in GenBank under accession numbers LN833866 and CVRN00000000, respectively. The type strain AM1T (= CSUR P1927 = DSM 100308) was isolated from a fresh stool sample of a patient in Marseille, France.

Acknowledgements

The authors thank the Xegen Company (http://www.xegen.fr/) for automating the genomic annotation process.

Conflict of Interest

None declared.

References

  • 1.Lagier J.C., Hugon P., Khelaifia S., Fournier P.E., La Scola B., Raoult D. The rebirth of culture in microbiology through the example of culturomics to study human gut microbiota. Clin Microbiol Rev. 2015;28:237–264. doi: 10.1128/CMR.00014-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Seng P., Drancourt M., Gouriet F., La Scola B., Fournier P.E., Rolain J.M. Ongoing revolution in bacteriology: routine identification of bacteria by matrix-assisted laser desorption ionization time-of-flight mass spectrometry. Clin Infect Dis. 2009;49:543–551. doi: 10.1086/600885. [DOI] [PubMed] [Google Scholar]
  • 3.Stackebrandt E., Ebers J. Taxonomic parameters revisited: tarnished gold standards. Microbiol Today. 2006;33:152–155. [Google Scholar]
  • 4.Rosselo-Mora R. DNA-DNA reassociation methods applied to microbial taxonomy and their critical evaluation. In: Stackebrandt E., editor. Molecular identification, systematics, and population structure of prokaryotes. Springer; Berlin: 2015. pp. 23–50. [Google Scholar]
  • 5.Thiercelin E., Jouhaud L. Reproduction de l’enterocoque; taches centrales; granulations peripheriques et microblastes. Compt Rend Soc Biol. 1903;55:686–688. [Google Scholar]
  • 6.Schleifer K.H., Kilpper-Balz R. Transfer of Streptococcus faecalis and Streptococcus faecium to the genus Enterococcus nom. rev. as Enterococcus faecalis comb. nov. and Enterococcus faecium comb. nov. Int J Syst Bacteriol. 1984;34:31–34. [Google Scholar]
  • 7.Bereket W., Hemalatha K., Getenet B., Wondwossen T., Solomon A., Zeynudin A. Update on bacterial nosocomial infections. Eur Rev Med Pharmacol Sci. 2012;16:1039–1044. [PubMed] [Google Scholar]
  • 8.Miller W.R., Munita J.M., Arias C.A. Mechanisms of antibiotic resistance in enterococci. Expert Rev Anti Infect Ther. 2014;12:1221–1236. doi: 10.1586/14787210.2014.956092. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Lagier J.C., Armougom F., Million M., Hugon P., Pagnier I., Robert C. Microbial culturomics: paradigm shift in the human gut microbiome study. Clin Microbiol Infect. 2012;18:1185–1193. doi: 10.1111/1469-0691.12023. [DOI] [PubMed] [Google Scholar]
  • 10.Lowe T.M., Eddy S.R. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 1997;25:955–964. doi: 10.1093/nar/25.5.955. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Lagesen K., Hallin P., Rodland E.A., Staerfeldt H.H., Rognes T., Ussery D.W. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res. 2007;35:3100–3108. doi: 10.1093/nar/gkm160. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Gupta S.K., Padmanabhan B.R., Diene S.M., Lopez-Rojas R., Kempf M., Landraud L. ARG-ANNOT, a new bioinformatic tool to discover antibiotic resistance genes in bacterial genomes. Antimicrob Agents Chemother. 2014;58:212–220. doi: 10.1128/AAC.01310-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Weber T., Blin K., Duddela S., Krug D., Kim H.U., Bruccoleri R. antiSMASH 3.0—a comprehensive resource for the genome mining of biosynthetic gene clusters. Nucleic Acids Res. 2015;43:W237–W243. doi: 10.1093/nar/gkv437. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Bendtsen J.D., Nielsen H., von Heijne G., Brunak S. Improved prediction of signal peptides: SignalP 3.0. J Mol Biol. 2004;340:783–795. doi: 10.1016/j.jmb.2004.05.028. [DOI] [PubMed] [Google Scholar]
  • 15.Krogh A., Larsson B., von Heijne G., Sonnhammer E.L. Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J Mol Biol. 2001;305:567–580. doi: 10.1006/jmbi.2000.4315. [DOI] [PubMed] [Google Scholar]
  • 16.Meier-Kolthoff J.P., Auch A.F., Klenk H.P., Göker M. Genome sequence–based species delimitation with confidence intervals and improved distance functions. BMC Bioinformatics. 2016;21:14–60. doi: 10.1186/1471-2105-14-60. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Meier-Kolthoff J.P., Klenk H.P., Göker M. Taxonomic use of DNA G+C content and DNA-DNA hybridization in the genomic age. Int J Syst Evol Microbiol. 2014;64:352–356. doi: 10.1099/ijs.0.056994-0. [DOI] [PubMed] [Google Scholar]

Articles from New Microbes and New Infections are provided here courtesy of Elsevier

RESOURCES