ABSTRACT
Mycobacterium setense CSUR47 is a rapidly growing Mycobacterium species strain isolated from pus collected from a left maxillary sinus in Marseille, France. Here, we report the complete 6,278,097-bp genome sequence of M. setense CSUR47, which exhibits a 66.40% GC content and encodes 5,863 protein-coding genes, 48 tRNAs, and 9 rRNAs.
GENOME ANNOUNCEMENT
Mycobacterium setense is a nontuberculous, rapidly growing Mycobacterium species belonging to the M. fortuitum complex of mycobacteria (1). This emerging species was initially isolated from a 52-year-old patient presenting with soft tissue infection and osteitis in France (1). In 2008, a second isolate named M. setense CSUR47 was isolated in Marseille from pus collected from the left maxillary sinus of a patient with a history of sinus osseous graft (2). We sequenced and analyzed the whole-genome sequence of strain CSUR47 in order to describe its genomic content and to determine its phylogenetic relationships for facilitating its identification.
M. setense CSUR47 was cultured on Middlebrook 7H11 agar supplemented with 10% (vol/vol) oleic acid-albumin-dextrose-catalase (Becton, Dickinson, Sparks, MD, USA). Genomic DNA was then sequenced with MiSeq technology (Illumina, Inc., San Diego, CA, USA) using a mate pair library. The index representation for M. setense CSUR47 was determined to be 10.10%. A total of 1,538,022 paired-end reads were filtered per the read qualities and then assembled using the Velvet tool (3). Contigs were combined using SSPACE (4) assisted by manual finishing and GapFiller (5). This yielded a 6,278,097-bp draft genome with a 66.40% GC content and 19 scaffolds composed of 63 contigs. Prodigal (6) was used with default parameters to predict open reading frames (ORFs). Functional annotation was achieved using a BLASTp search against the GenBank (7) and Clusters of Orthologous Groups (COGs) databases (8). When no hit was found, a second search was performed against the NR database using BLASTp. RNAmmer (9), ARAGORN (10), Rfam (11), Pfam (12), and Infernal (13) were used to predict noncoding genes and miscellaneous features. Of the 5,918 predicted genes, 5,863 were protein-coding genes and 55 were RNAs, including 3 5S rRNAs, 3 16S rRNAs, 3 23S rRNAs, and 48 tRNAs. A total of 4,507 genes (76.87% of protein-coding genes) were assigned a putative function, 94 genes were identified as ORFs with no homologs (ORFans) (1.60%), and 982 genes (16.75%) were annotated as hypothetical proteins.
Based on 16S rRNA gene sequence proximity, genomes were selected and incorporated with in silico DNA-DNA hybridization (DDH) (14). The DDH values were calculated using the GGDC version 2.0 online tool (15). This analysis yielded sequence similarities of 99.80% with the type strain M. setense DSM 45070 (GenBank accession number JTJW00000000), 89.80% with the nonpathogenic strain Manresensis (JTLZ00000000), 30.50% with M. farcinogenes DSM 43637 (CCAY000000000), 30.40% with M. senegalense CK2 (LDPU00000000), 29.70% with M. houstonense ATCC 49403 (NZ_FJVO00000000), 29.40 with M. fortuitum DSM 46621 (ALQB00000000), and 20.60 with Mycobacterium gilvum PYR-GCK (NC_009338). These results confirmed that strain CSUR47, also known as isolate 74023791 (2), belongs to the M. setense species within the M. fortuitum complex. Furthermore, M. setense CSUR47 was more closely related to the clinical M. setense DSM 45070 type strain than to the environmental Manresensis strain isolated from Cardener River in Manresa, Spain, and was shown to be less virulent than M. bovis BCG in immunocompromised mice (16).
Accession number(s).
The M. setense CSUR47 genome sequence has been deposited at EMBL under the accession number OEJY01000000.
ACKNOWLEDGMENT
This study was financially supported by IHU Méditerranée Infection, Marseille, France.
Footnotes
Citation Bouam A, Levasseur A, Drancourt M. 2018. Draft genome sequence of Mycobacterium setense CSUR47. Genome Announc 6:e01415-17. https://doi.org/10.1128/genomeA.01415-17.
REFERENCES
- 1.Lamy B, Marchandin H, Hamitouche K, Laurent F. 2008. Mycobacterium setense sp. nov., a Mycobacterium fortuitum-group organism isolated from a patient with soft tissue infection and osteitis. Int J Syst Evol Microbiol 58:486–490. doi: 10.1099/ijs.0.65222-0. [DOI] [PubMed] [Google Scholar]
- 2.Toro A, Adekambi T, Cheynet F, Fournier PE, Drancourt M. 2008. Mycobacterium setense infection in humans. Emerg Infect Dis 14:1330–1332. doi: 10.3201/eid1408.080179. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Zerbino DR, Birney E. 2008. Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res 18:821–829. doi: 10.1101/gr.074492.107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Boetzer M, Henkel CV, Jansen HJ, Butler D, Pirovano W. 2011. Scaffolding pre-assembled contigs using SSPACE. Bioinformatics 27:578–579. doi: 10.1093/bioinformatics/btq683. [DOI] [PubMed] [Google Scholar]
- 5.Boetzer M, Pirovano W. 2012. Toward almost closed genomes with GapFiller. Genome Biol 13:R56. doi: 10.1186/gb-2012-13-6-r56. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Hyatt D, Chen GL, Locascio PF, Land ML, Larimer FW, Hauser LJ. 2010. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics 11:119. doi: 10.1186/1471-2105-11-119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Benson DA, Karsch-Mizrachi I, Clark K, Lipman DJ, Ostell J, Sayers EW. 2012. GenBank. Nucleic Acids Res 40:D48–D53. doi: 10.1093/nar/gkr1202. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Tatusov RL, Galperin MY, Natale DA, Koonin EV. 2000. The COG database: a tool for genome-scale analysis of protein functions and evolution. Nucleic Acids Res 28:33–36. doi: 10.1093/nar/28.1.33. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Lagesen K, Hallin P, Rødland EA, Staerfeldt HH, Rognes T, Ussery DW. 2007. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res 35:3100–3108. doi: 10.1093/nar/gkm160. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Laslett D, Canback B. 2004. ARAGORN, a program to detect tRNA genes and tmRNA genes in nucleotide sequences. Nucleic Acids Res 32:11–16. doi: 10.1093/nar/gkh152. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Griffiths-Jones S, Bateman A, Marshall M, Khanna A, Eddy SR. 2003. Rfam: an RNA family database. Nucleic Acids Res 31:439–441. doi: 10.1093/nar/gkg006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Punta M, Coggill PC, Eberhardt RY, Mistry J, Tate J, Boursnell C, Pang N, Forslund K, Ceric G, Clements J, Heger A, Holm L, Sonnhammer EL, Eddy SR, Bateman A, Finn RD. 2012. The Pfam protein families database. Nucleic Acids Res 40:D290–D301. doi: 10.1093/nar/gkr1065. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Nawrocki EP, Kolbe DL, Eddy SR. 2009. Infernal 1.0: inference of RNA alignments. Bioinformatics 25:1335–1337. doi: 10.1093/bioinformatics/btp157. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Richter M, Rosselló-Móra R. 2009. Shifting the genomic gold standard for the prokaryotic species definition. Proc Natl Acad Sci U S A 106:19126–19131. doi: 10.1073/pnas.0906412106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Auch AF, von Jan M, Klenk HP, Göker M. 2010. Digital DNA-DNA hybridization for microbial species delineation by means of genome-to-genome sequence comparison. Stand Genomic Sci 2:117–134. doi: 10.4056/sigs.531120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Rech G, Vilaplana C, Velasco J, Pluvinet R, Santín S, Prat C, Julián E, Alcaide F, Comas I, Sumoy L, Cardona PJ. 2015. Draft genome sequences of Mycobacterium setense type strain DSM-45070 and the nonpathogenic strain Manresensis, isolated from the bank of the Cardener River in Manresa, Catalonia, Spain. Genome Announc 3(1):e01485-14. doi: 10.1128/genomeA.01485-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
