Abstract
We report an annotated draft genome sequence of the Mycobacterium smegmatis strain MKD8. This strain acts as a recipient during conjugation with the reference M. smegmatis strain mc2155. While the genomes of the two strains are colinear and have similar sizes, extensive genome-wide sequence variation suggests rich diversity within the M. smegmatis clade.
GENOME ANNOUNCEMENT
Mycobacteria are a biologically diverse group of bacteria that range from obligate pathogens, including Mycobacterium tuberculosis, to free-living saprophytes, such as Mycobacterium smegmatis. M. smegmatis is the model organism for all mycobacterial research because it is nonpathogenic, relatively fast-growing, genetically facile, and well suited for high-throughput analyses (1, 2). Natural isolates of M. smegmatis participate in a novel form of conjugation, termed distributive conjugal transfer, in which chromosomal DNA segments are transferred from a donor strain to a recipient strain (3, 4) and generate transconjugant progeny with highly mosaic genomes (T. A. Gray, J. Krywy, J. Harold, M. J. Palumbo, and K. M. Derbyshire, submitted for publication). Our standard experimental conjugation system pairs the widely used laboratory strain M. smegmatis mc2155 (a donor) with an independent strain, M. smegmatis MKD8 (a recipient). MKD8 is a spontaneous streptomycin-resistant subclone of M. smegmatis mc2874 (lysA ept [5]), derived from the original isolate, M. smegmatis PM5 (6). Determining the chromosomal sequence of this strain is necessary to accurately analyze the mosaic transconjugant genomes, identify those genes that distinguish donor and recipient functions, and begin to explore the rich genetic diversity of M. smegmatis strains.
The genomic sequence of M. smegmatis MKD8 was compiled from the data from three approaches: an initial 454 library, an Illumina ~250-bp paired-end library, and a second 454 approach yielding a ~3-kb paired-end library to join some of the repeat-separated contigs. Aggregate reads were assembled de novo using Celera Assembler with the Best Overlap Graph (CABOG) (7) and Velvet (8). The two scaffolds were separated by the two rRNA loci present in M. smegmatis. The assembled draft genome was annotated using the Annotation Engine service (http://ae.igs.umaryland.edu/cgi/index.cgi) and visualized using the Manatee genome curation and browsing tool (http://manatee.sourceforge.net).
The MKD8 genome is 7,092,137 bp long, with an overall G+C content of 67.3%, similar to that of mc2155 (6,988,209 bp; GenBank accession no. NC_008596). Whole-genome comparisons by Mauve (9) showed an overall colinearity between the two strains but significant sequence divergence (1.6% overall single-nucleotide polymorphism [SNP] frequency and 649 indels of >19 bp). One notable difference is that MKD8 lacks the 55.2-kb genome duplication present in mc2155 (msmeg1002 to msmeg1058). Subsets of these SNPs and indels are likely responsible for the many phenotypic differences displayed by these two strains (e.g., in donor and recipient activities, colony morphology, phage susceptibility, biofilm formation, and streptomycin resistance).
Conjugation in mycobacteria generates progeny that are genetic mosaics of the parental strains. Since multiple independent strains of M. smegmatis are conjugationally active (3, 10), mosaic transconjugant genomes could be generated with genes that are lost, acquired, replaced, or blended. Such mosaic transconjugant genotypes might combine phenotypes, or potentially create new phenotypes, to promote the colonization of different environmental niches or hosts. Therefore, transconjugants generated today may become the extant mycobacteria of tomorrow. Establishing the reference genomes of conjugationally active mycobacteria is essential for recognizing the products of conjugation, estimating the gene flow through the community, and identifying the genetic basis of the phenotypes that drive the evolution of mycobacteria. The determination of the MKD8 sequence is an important first step in that direction.
Nucleotide sequence accession number.
The assembled and annotated draft genome sequence of M. smegmatis MKD8 has been deposited in GenBank under the accession no. CM001762.
ACKNOWLEDGMENTS
We are grateful to the Institute for Genome Sciences at the University of Maryland for assistance in genome annotation and analyses and to the Wadsworth Bioinformatics Core.
We are also grateful to the Wadsworth Center and NIH grants AI42308 and R56AI80694 for funding.
Footnotes
Citation Gray TA, Palumbo MJ, Derbyshire KM. 2013. Draft genome sequence of MKD8, a conjugal recipient Mycobacterium smegmatis strain. Genome Announc. 1(2):e00148-13. doi:10.1128/genomeA.00148-13.
REFERENCES
- 1. Shiloh MU, DiGiuseppe Champion PA. 2009. To catch a killer. What can mycobacterial models teach us about Mycobacterium tuberculosis pathogenesis? Curr. Opin. Microbiol. 13:86–92 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Snapper SB, Melton RE, Mustafa S, Kieser T, Jacobs WR., Jr 1990. Isolation and characterization of efficient plasmid transformation mutants of Mycobacterium smegmatis. Mol. Microbiol. 4:1911–1919 [DOI] [PubMed] [Google Scholar]
- 3. Parsons LM, Jankowski CS, Derbyshire KM. 1998. Conjugal transfer of chromosomal DNA in Mycobacterium smegmatis. Mol. Microbiol. 28:571–582 [DOI] [PubMed] [Google Scholar]
- 4. Wang J, Karnati PK, Takacs CM, Kowalski JC, Derbyshire KM. 2005. Chromosomal DNA transfer in Mycobacterium smegmatis is mechanistically different from classical Hfr chromosomal DNA transfer. Mol. Microbiol. 58:280–288 [DOI] [PubMed] [Google Scholar]
- 5. Pavelka MS, Jr, Jacobs WR., Jr. 1996. Biosynthesis of diaminopimelate, the precursor of lysine and a component of peptidoglycan, is an essential function of Mycobacterium smegmatis. J. Bacteriol. 178:6496–6507 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Tokunaga T, Mizuguchi Y, Suga K. 1973. Genetic recombination in mycobacteria. J. Bacteriol. 113:1104–1111 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Miller JR, Delcher AL, Koren S, Venter E, Walenz BP, Brownley A, Johnson J, Li K, Mobarry C, Sutton G. 2008. Aggressive assembly of pyrosequencing reads with mates. Bioinformatics 24:2818–2824 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Zerbino DR, Birney E. 2008. Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 18:821–829 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Darling AC, Mau B, Blattner FR, Perna NT. 2004. Mauve: multiple alignment of conserved genomic sequence with rearrangements. Genome Res. 14:1394–1403 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Mizuguchi Y, Suga K, Tokunaga T. 1976. Multiple mating types of Mycobacterium smegmatis. Jpn. J. Microbiol. 20:435–443 [DOI] [PubMed] [Google Scholar]