Here, we report the draft genome sequence of the bacteriocin-producing Enterococcus faecium strain HY07, isolated from traditional Chinese fermented sausages. The genome comprises 2,585,631 bp with 2,624 coding sequences, as assigned by NCBI, which may provide fundamental molecular information on elucidating the adaption mechanism of Enterococcus faecium to the meat environment.
ABSTRACT
Here, we report the draft genome sequence of the bacteriocin-producing Enterococcus faecium strain HY07, isolated from traditional Chinese fermented sausages. The genome comprises 2,585,631 bp with 2,624 coding sequences, as assigned by NCBI, which may provide fundamental molecular information on elucidating the adaption mechanism of Enterococcus faecium to the meat environment.
ANNOUNCEMENT
Enterococci are found in many food products, especially those of animal origin, seafood, and dairy products. Enterococci are the most thermotolerant nonsporulating bacteria, and some can even survive pasteurization temperatures. Tolerance to environmental extremes explains their survival during the processing of cooked and uncooked cured meats, as well as their ability to multiply during fermentations (1). On the one hand, the species Enterococcus faecium was considered a harmless commensal of the mammalian gastrointestinal (GI) tract and is commonly used as a probiotic. On the other hand, some of the E. faecium strains are typical opportunistic pathogens that cause diseases, especially in nosocomial settings, which may, in part, be linked to the presence of virulence determinants and antibiotic resistance (2).
In this study, we report on the genome sequence of E. faecium HY07, which was isolated from traditional Chinese fermented sausage. We also found that E. faecium HY07 exhibited antimicrobial activity against Listeria, Salmonella, Shigella, Staphylococcus, Clostridium, and Bacillus species (3).
E. faecium HY07 was anaerobically cultured overnight at 37°C in de Man, Rogosa, and Sharpe (MRS) broth. Genomic DNA was extracted using phenol-chloroform extraction with bead beating, and the genomic libraries were prepared using a sequencing kit (SQK-MAP-006; Oxford Nanopore Technologies, Oxford, UK) according to the manufacturer’s instructions. Pacific Biosciences sequencing technology was then used to sequence the genomic DNA isolated from E. faecium HY07. Trimmomatic version 0.38 was used to trim low-quality reads. A total of 98,024 reads were generated at an average read length of 9,193 bp, with an average depth of coverage of 432.70-fold. Corrected reads were assembled using the Celera Assembler (4). Protein-coding sequences were predicted by GLIMMER software version 3.02 (5) and annotated using BLAST searches of nonredundant (Nr) protein sequences from the NCBI, clusters of orthologous groups (COG), Gene Ontology (GO), and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases (6–8). tRNA genes were detected using tRNAscan-SE 1.23, and rRNA genes were detected using RNAmmer 1.2 (9, 10).
The complete genome of E. faecium HY07 is composed of a 2,585,631-bp chromosome with a G+C content of 38.34% and two plasmids, namely, a plasmid of 258,852 bp with a G+C content of 36.18% and another plasmid of 21,516 bp with a G+C content of 33.55%. Using GLIMMER, 2,838 open reading frames (ORFs) were predicted in E. faecium HY07, including 2,624 protein-coding genes, 67 tRNA-coding genes, and 18 rRNA-coding genes.
Several coding DNA sequences (CDSs) for the production of bacteriocins, namely, enterocin P (GenBank accession number AYA33186), which contains the consensus class IIa bacteriocin motif YGNGV in the N-terminal region, were found (11). The genes encoding bacteriocins (GenBank accession numbers AYA33274, AYA35563, and AYA35573) were found in the genome of E. faecium HY07.
A search for virulence factors associated with invasiveness and disease severity was conducted (12). The genes involved in the resistance to antimicrobials, such as copper, penicillin, tetronasin, and vancomycin, were found in the genome of E. faecium HY07.
These data provide insights into the genetic basis of bacteriocin-producing E. faecium HY07 in relation to the meat environment.
Data availability.
The complete genome sequence project has been deposited in GenBank under the accession numbers CP032307 (plasmid), CP032308 (chromosome), and CP032309 (plasmid). The versions described in this paper are the first versions, CP032307.1, CP032308.1, and CP032309.1. Raw sequencing reads are available in NCBI BioProject under the accession number PRJNA416245.
ACKNOWLEDGMENTS
This work was supported by the National Key Research and Development Program of China (number 2016YFF0202300), the National Natural Science Foundation of China (number 31571811), Research Projects of Hunan Provincial Education Department (number 16K043), and double first-class construction project of Hunan Agricultural University (number SYL201802002).
REFERENCES
- 1.Hugas M, Garriga M, Aymerich MT. 2003. Functionality of enterococci in meat products. Int J Food Microbiol 88:223–233. doi: 10.1016/S0168-1605(03)00184-3. [DOI] [PubMed] [Google Scholar]
- 2.Ogier JC, Serror P. 2008. Safety assessment of dairy microorganisms: the Enterococcus genus. Int J Food Microbiol 126:291–301. doi: 10.1016/j.ijfoodmicro.2007.08.017. [DOI] [PubMed] [Google Scholar]
- 3.Zhou H, Hu YM, Jiang LW, Zhou HL, Ma J, Liu CG. 2015. Antilisterial activity of bacteriocin HY07 from Enterococcus faecium HY07 isolated from Chinese sausages. Food Biotech 29:51–68. doi: 10.1080/08905436.2014.996893. [DOI] [Google Scholar]
- 4.Myers EW, Sutton GG, Delcher AL, Dew IM, Fasulo DP, Flanigan MJ, Kravitz SA, Mobarry CM, Reinert KH, Remington KA, Anson EL, Bolanos RA, Chou HH, Jordan CM, Halpern AL, Lonardi S, Beasley EM, Brandon RC, Chen L, Dunn PJ, Lai Z, Liang Y, Nusskern DR, Zhan M, Zhang Q, Zheng X, Rubin GM, Adams MD, Venter JC. 2000. A whole-genome assembly of Drosophila. Science 287:2196–2204. doi: 10.1126/science.287.5461.2196. [DOI] [PubMed] [Google Scholar]
- 5.Delcher AL, Harmon D, Kasif S, White O, Salzberg SL. 1999. Improved microbial gene identification with GLIMMER. Nucleic Acids Res 27:4636–4641. doi: 10.1093/nar/27.23.4636. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Tatusov RL, Galperin MY, Natale DA, Koonin EV. 2000. The COG database: a tool for genome-scale analysis of protein functions and evolution. Nucleic Acids Res 28:33–36. doi: 10.1093/nar/28.1.33. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP, Issel-Tarver L, Kasarskis A, Lewis S, Matese JC, Richardson JE, Ringwald M, Rubin GM, Sherlock G. 2000. Gene ontology: tool for the unification of biology. Nat Genet 25:25–29. doi: 10.1038/75556. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Kanehisa M, Goto S, Kawashima S, Okuno Y, Hattori M. 2004. The KEGG resource for deciphering the genome. Nucleic Acids Res 32:D277–D280. doi: 10.1093/nar/gkh063. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Lowe TM, Eddy SR. 1997. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res 25:955–964. doi: 10.1093/nar/25.5.955. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Lagesen K, Hallin P, Rødland EA, Staerfeldt HH, Rognes T, Ussery DW. 2007. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res 35:3100–3108. doi: 10.1093/nar/gkm160. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Chang S-Y, Chen Y-S, Pan S-F, Lee Y-S, Chang C-H, Chang C-H, Yu B, Wu H-C. 2013. Enterocin TW21, a novel bacteriocin from dochi-isolated Enterococcus faecium D081821. J Appl Microbiol 115:673–678. doi: 10.1111/jam.12265. [DOI] [PubMed] [Google Scholar]
- 12.Giridhara Upadhyaya PM, Umapathy BL, Ravikumar KL. 2010. Comparative study for the presence of enterococcal virulence factors gelatinase, hemolysin and biofilm among clinical and commensal isolates of Enterococcus faecalis. J Lab Physicians 2:100–104. doi: 10.4103/0974-2727.72159. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The complete genome sequence project has been deposited in GenBank under the accession numbers CP032307 (plasmid), CP032308 (chromosome), and CP032309 (plasmid). The versions described in this paper are the first versions, CP032307.1, CP032308.1, and CP032309.1. Raw sequencing reads are available in NCBI BioProject under the accession number PRJNA416245.