Skip to main content
Genome Biology and Evolution logoLink to Genome Biology and Evolution
. 2015 Jun 16;7(7):1871–1886. doi: 10.1093/gbe/evv111

Characterization of Three Mycobacterium spp. with Potential Use in Bioremediation by Genome Sequencing and Comparative Genomics

Sarbashis Das 1, BM Fredrik Pettersson 1, Phani Rama Krishna Behra 1, Malavika Ramesh 1, Santanu Dasgupta 1, Alok Bhattacharya 2, Leif A Kirsebom 1,*
PMCID: PMC4524478  PMID: 26079817

Abstract

We provide the genome sequences of the type strains of the polychlorophenol-degrading Mycobacterium chlorophenolicum (DSM43826), the degrader of chlorinated aliphatics Mycobacterium chubuense (DSM44219) and Mycobacterium obuense (DSM44075) that has been tested for use in cancer immunotherapy. The genome sizes of M. chlorophenolicum, M. chubuense, and M. obuense are 6.93, 5.95, and 5.58 Mb with GC-contents of 68.4%, 69.2%, and 67.9%, respectively. Comparative genomic analysis revealed that 3,254 genes are common and we predicted approximately 250 genes acquired through horizontal gene transfer from different sources including proteobacteria. The data also showed that the biodegrading Mycobacterium spp. NBB4, also referred to as M. chubuense NBB4, is distantly related to the M. chubuense type strain and should be considered as a separate species, we suggest it to be named Mycobacterium ethylenense NBB4. Among different categories we identified genes with potential roles in: biodegradation of aromatic compounds and copper homeostasis. These are the first nonpathogenic Mycobacterium spp. found harboring genes involved in copper homeostasis. These findings would therefore provide insight into the role of this group of Mycobacterium spp. in bioremediation as well as the evolution of copper homeostasis within the Mycobacterium genus.

Keywords: genome sequencing, biodegradation, Mycobacterium, oxygenases, copper homeostasis

Introduction

Bacteria of the genus Mycobacterium are acid fast, robust, and can inhabit various environmental reservoirs, for example, ground and tap water, soil, animals, and humans. This genus includes nonpathogenic environmental bacteria, opportunistic pathogens, and highly successful pathogens such as Mycobacterium tuberculosis (Mtb, the causative agent of tuberculosis). The diversity of ecological niches inhabited by Mycobacterium spp. demands widely varied life styles with different growth patterns and morphologies and ability to adapt to changes in the environment (Häggblom et al. 1994; Tortoli 2003, 2006; Primm et al. 2004; Vaerewijck et al. 2005; Falkinham 2009, 2015; Kazada et al. 2009; Whitman et al. 2012; Thoen et al. 2014). To understand the biology and versatility of Mycobacterium spp. we need to expand our knowledge about the genomic contents and their phenotypic expressions for different members of this genus.

Mycobacterium chlorophenolicum, Mycobacterium chubuense, and Mycobacterium obuense are classified as rapidly growing mycobacteria found in the same branch of 16S rRNA based phylogenetic trees. The first two are members of the Mycobacterium sphagni clade whereas M. obuense is positioned close but not adjacent to these two and belongs to the Mycobacterium parafortuitum clade (Apajalahti and Salkinoja-Salonen 1987; Miethling and Karlson 1996; McLellan et al. 2007; Whitman et al. 2012). These three species have been isolated from water, soil, and one isolate of M. obuense from the sputum of a patient. Strains related to these three species have the capacity to degrade different types of chlorinated pollutants. Hence they have the potential for use in bioremediation of contaminated soils (Whitman et al. 2012; Satsuma and Masuda 2012; see below).

Mycobacterium chlorophenolicum (originally referred to as Rhodococcus chlorophenolicus) was first isolated from a pentachlorophenol enrichment culture inoculated from chlorophenol-contaminated sediment from a lake in Finland (Briglia et al. 1994; Häggblom et al. 1994). It is a rapidly growing mesophilic Mycobacterium spp. with an optimal growth temperature of 28 °C and it produces yellow to orange colonies. It also shows “coccoid-to-rod-to-coccoid” morphological transitions during its growth cycle (Briglia et al. 1994; Häggblom et al. 1994). Mycobacterium chlorophenolicum can degrade polychlorinated phenol compounds such as pentachlorophenol (PCP; an environmental pollutant used in the past as a wood preservative agent [McLellan et al. 2007]) and its degradation products (Apajalahti and Salkinoja-Salonen 1987; Häggblom et al. 1994; Miethling and Karlson 1996). It has been used for in situ bioremediation of PCP-contaminated soils with some success (Briglia et al. 1994; Miethling and Karlson 1996). A recent study also shows that it can O-methylate tetrachlorobisphenol-A and tetrabromobisphenol-A, a brominated flame retardant used in consumer products, making them more lipophilic (George and Häggblom 2008).

Mycobacterium chubuense was first isolated from garden soil in Japan (Saito et al. 1977; Tsukamura and Mizuno 1977). It is rapidly growing, mesophilic, pigmented, and has rod- and-coccoid shaped cell morphologies and its position based on the 16S rRNA phylogenetic tree is close to M. obuense (98.5% sequence identity; Pitulle et al. 1992). Another isolate, originally referred to as M. chubuense NBB4 (and also Mycobacterium spp. NBB4, see below) has the potential for use in bioremediation. It is able to degrade chlorinated aliphatic compounds such as vinyl chloride and 1,2-dichloroethane, both intermediates in the production of polyvinyl chloride. In addition, a very broad range of hydrocarbons can promote its growth and the number of genes encoding mono-oxygenases is unusually high in this strain (Coleman et al. 2006; Le and Coleman 2011) as revealed by the complete genome sequence that was recently made available (acc no NC_018027.1).

Mycobacterium obuense was originally isolated from the sputum of a Japanese patient but it has also been isolated from soil samples and is not considered to be associated with any disease (Tsukamura and Mizuno 1971; Whitman et al. 2012). It has been described as a rod-shaped, rapidly growing bacterium that forms pigmented colonies. Recent data demonstrate that M. obuense too has the capacity, albeit limited, to reductively dechlorinate the insecticide methoxychlor that is used as an alternative to dichlorodiphenyltrichloro-ethane (DDT) (Masuda et al. 2012). Mycobacterium obuense as well as M. chubuense have been suggested as a stimulant of the immune system in bladder cancer therapeutics (Yuksel et al. 2011), and M. obuense is being evaluated in clinical trials for use in immunotherapy of several types of cancer (Fowler et al. 2011). Similar to M. chlorophenolicum and M. chubuense, M. obuense also has rod- andcoccoid-shaped cell morphologies (Saito et al. 1977). Moreover, both M. obuense and M. chubuense have been shown to be motile on agar surfaces (Agustí et al. 2008).

We decided to sequence the genomes of the type strains, M. chlorophenolicum DSM43826, M. chubuense DSM44219, and M. obuense DSM44075 (hereafter referred to as MchloDSM, MchuDSM, and MobuDSM, respectively), and to undertake comparative genomic analysis in order to understand how some of the characteristics of these genomes with respect to genome size, common and unique genes, horizontal gene transfer (HGT), and codon usage might be manifested as phenotypic differences. That these three Mycobacterium spp. do change their cell shape during cultivation (Saito et al. 1977; Häggblom et al. 1994) made such studies relevant to our interest in mechanisms of morphological changes seen in Mycobacterium spp. (Ghosh et al. 2009). We were also interested in understanding the evolutionary relationship between these Mycobacterium spp. and the Mycobacterium spp. NBB4 strain (hereafter referred to as MycNBB4), in particular the relation between this strain and the MchuDSM type strain. Here, we report a first analysis of their draft genomes. Interestingly, our data reveal that the MchuDSM type strain is phylogenetically closer to MchloDSM than it is to the MycNBB4 strain. This raises an important question about whether MchuDSM and MycNBB4 (also referred to as M. chubuense NBB4) belong to the same species. We also provide data showing the presence of putative genes encoding for oxygenases in all four species as well as for proteins involved in copper homeostasis in MchloDSM and in MycNBB4.

Materials and Methods

Strains

The M. chlorophenolicum DSM43826 (MchloDSM), M. chubuense DSM44219 (MchuDSM), and M. obuense DSM44075 (MobuDSM) type strains were obtained from the Deutsche Sammlung von Mikroorganismen und Zellkulturen in Germany and grown under conditions as recommended by the supplier.

Cultivation and DNA Isolation

Aliquots of the MchloDSM, MchuDSM, and MobuDSM type strains were taken from −80 °C stocks, plated on Middlebrook 7H10 media and incubated at 30 °C (MchloDSM) and 37 °C (MchuDSM and MobuDSM) under aerobic conditions. Extraction of DNA from these cultures and sequencing of the 16S rRNA genes after polymerase chain reaction amplification were consistent with the cultures being free from contaminations. Genomic DNA was isolated as previously described (Pettersson et al. 2014).

Genome Sequencing, Assembly, and Annotation

Whole-genome sequencing was performed at the SNP&SEQ Technology Platform of Uppsala University on a HiSeq2000 (Illumina) platform.

The genomes of the MchloDSM, MchuDSM, and MobuDSM type strains were sequenced at coverage of 157×, 482×, and 285×, respectively. These genomes were assembled with the A5 assembly pipeline (Tritt et al. 2012). The A5-pipeline included the quality filtering of the reads, error correction, scaffolding, and gap filling steps. Genome alignment and reordering of the scaffolds was done using the Mauve program (Darling et al. 2004) and plotted with the R-package genoPlotR (Guy et al. 2010). rRNA and tRNA genes were identified using the RNAmmer (Lagesen et al. 2007) and tRNAScan-SE (Lowe and Eddy 1997) programs, respectively.

To predict the presence of plasmids, scaffolds were aligned with the MycNBB4 complete genome (chromosome; acc no NC_018027.1). Scaffolds that did not align with the MycNBB4 genome were subjected to BLAST using the NCBI plasmid database. We considered that scaffolds originated from plasmids if more than 90% of the scaffold sequence aligned with the plasmid database. Prophage sequences were predicted using the PHAST server (Zhou et al. 2011).

Identification and annotation of coding sequences (CDS) was done using both the Prokka software (version 1.0.9) (Seemann 2014) and the RAST server (http://rast.nmpdr.org/, last accessed May 5, 2015; Aziz et al. 2008). Functional classification was done using the RAST subsystem classification that uses data both from “The Project to Annotate 1000 genomes” and a collection of protein families referred to as FIGfams. Finally, the listed CDS are those that were predicted by both the Prokka and the RAST server. This annotation program also predicted genes encoding transposases and insertion sequence (IS) elements.

Phylogenetic Analysis Based on Single and Multiple Genes

The sequences of the 16S ribosomal RNA genes (and rpoB, dprE1, and rnpB) were extracted from the draft MchloDSM, MchuDSM, and MobuDSM genomes and the publicly available MycNBB4 genome (acc no NC_018027.1). The homologous sequences of these genes present in other Mycobacterium spp. including MycNBB4 were downloaded from the NCBI database and aligned using the MAFFT (version 5) software (Katoh et al. 2005). Phylogenetic trees based on the multiple sequence alignment were computed using the FastTree software (Price et al. 2009) with 1,000 cycles of bootstrapping and the figures were generated with the FigTree software (http://tree.bio.ed.ac.uk/software/figtree/).

Average Nucleotide Identity and Core Gene Analysis

The average nucleotide identity (ANI) was calculated using the Jspecies tool (Richter and Rosselló-Móra 2009) based on the sequenced genomes to identify whether those belonged to the same species or not. Core gene analysis was performed on the translated protein sequences of all predicted CDS. Protein sequences were subjected to “all-versus-all” BLAST and homologous sequences (referred to as “core genes”) were identified using PanOct with an identity of 45% and query coverage of 65% (Fouts et al. 2012).

Horizontal Gene Transfer

Horizontally transferred genes were predicted based on BLAST best-hit approach using the newly available HGTector software, which follows a hybrid between “BLAST-based” and phylogenetic approaches (Zhu et al. 2014). This tool distributes the genes on the basis of the best BLAST hit into predefined hierarchical evolutionary categories: self, close, and distal based on NCBI taxonomy (as of July 2014). Genes that fall in the category “distal” are classified as putative HGT genes. We used the following stringent criteria: e-value set at <1e-100 for the BLAST hits, self = Mycobacterium (taxonomic_id 1763) and close = Actinomycetales (taxonomic_id 2037) groups (Zhu et al. 2014). Furthermore, common and unique putative horizontally transferred genes among the four genomes were identified using BLASTp with percentage identity of 45% and query coverage of 70%.

Codon Usage Analysis

Relative synonymous codon usage analysis was done on nucleotide sequence of all the predicted genes and HGT genes in MchloDSM, MchuDSM, MobuDSM, and MycNBB4 using the CodonW software (Peden 1999).

Accession Numbers

The genome sequences have been deposited at GenBank/DDBJ/EMBL under the following accession numbers: JYNL00000000 Mycobacterium chlorophenolicum DSM43826, JYNX00000000 Mycobacterium chubuense DSM44219, and JYNU00000000 Mycobacterium obuense DSM44075.

Results

Genome Assembly, Alignment, Annotation, and Overall Description

The draft genome sequences of the MchloDSM, MchuDSM, and MobuDSM type strains were based on 59, 95, and 55 scaffolds (fig. 1A; supplementary figs. S1A, C, E, and F, Supplementary Material online) and their genome sizes were calculated to be 6,925,482 (MchloDSM), 5,945,132 (MchuDSM), and 5,576,960 (MobuDSM) base pairs, respectively (fig. 1B). As expected for mycobacterial species the GC-contents were high ranging from 67.9% to 69.2% (fig. 1B; Goodfellow et al. 2012; see also below). The total number of predicted CDS was found to be highest in MchloDSM, which correlates with its larger genome size (fig. 1B). Recently, the complete genome of the MycNBB4 strain was released (acc no NC_018027.1). We therefore included the MycNBB4 genome data in our analysis. Comparison of MycNBB4 and MchuDSM suggested that their genome sizes differ: the complete MycNBB4 genome is approximately 0.4 Mb smaller than the MchuDSM genome.

Fig. 1.—

Fig. 1.—

Genome assemblies and annotations. (A) Barplots showing number of raw reads, read coverage, number of scaffolds, and assembly quality (N50) for the three genomes represented by different colors as indicated. (B) Bar plots represent genome size, GC-content in %, number of tRNA genes, number of CDS, number of rRNA operons and noncoding RNA (ncRNA) for the four genomes represented by different color codes. Bars marked with * indicate that these genomes contains one complete and one partial rRNA operon.

The presence of known plasmid sequences were identified in the MchloDSM and MobuDSM draft genomes while no sequence of plasmid origin could be detected for MchuDSM (see Materials and Methods). For MchloDSM, 454,038 bp in nine scaffolds were identified (supplementary fig. S1B, Supplementary Material online) and sequence alignment suggested that MchloDSM contains plasmid fragments of different origins similar to: 1) the MchuNBB4 pMYCCH.01 (acc. no NC_018022.1), 2) the M. gilvum Spyr1 plasmid (acc. no NC_014811.1), and 3) Mycobacterium smegmatis JS623 pMYCSM02 (acc. no NC_019958.1). For MobuDSM, 133,713 bp of plasmid origin located on three scaffolds were identified (supplementary fig. S1D, Supplementary Material online) and showed greater than 90% identity with sequences of pMKMS01 (acc no NC_008703.1), which is present in Mycobacterium spp. KMS. For MchloDSM, the plasmid fragments were predicted to carry 502 putative genes and 59% of these were annotated as hypothetical proteins. For functional annotation see supplementary table S1, Supplementary Material online.

One of the reasons of genome rearrangement is due to the presence of IS elements and the IS116/IS110/IS902-family (Moss et al. 1992; Kulakov et al. 1999) was identified to be present in the MchloDSM, MchuDSM, and MobuDSM genomes (the light brown diagonal lines in fig. 2A suggest genomic rearrangements involving IS elements). Moreover, in MchloDSM a total of 19 copies of genes encoding transposases were predicted and of these, seven were located on plasmid fragments. MchuDSM and Mobu carry fewer copies one and five, respectively. We emphasize that due to the presence of repeated sequences such as IS elements hinder the assembly of genomes into one single scaffold.

Fig. 2.—

Fig. 2.—

Whole genome and CDS alignment of the four genomes. (A) Whole genome alignment and (B) complete CDS alignment for the four Mycobacterium spp. as indicated. Each of the colored horizontal lines represents one genome and the vertical bars represent homologous regions. Light brown to dark vertical lines represent small to large homologous fragments and diagonal lines represent genomic rearrangements whereas red blocks below the black line which is connected with the blue diagonal lines mark inversions. (C) Gene synteny plot of the conserved small (marked with S; the large prophage sequences in MchloDSM is marked with L) prophage sequence predicted in the MchloDSM, MchuDSM, and MycNBB4. Black horizontal lines represent prophage sequences in the respective genomes. Blue and green arrows indicate predicted CDS of bacterial and phage origin, respectively. Vertical lines represent the attachment sites.

Prophage sequences including attachment sites were predicted in MchloDSM, MchuDSM, and MycNBB4 but not in MobuDSM. The MchloDSM genome carries two fragments, 25 and 17 kb covering 26 and 9 CDS, respectively (fig. 2A and supplementary fig. S1A, Supplementary Material online). The smaller fragment is conserved in MchuDSM and partially conserved in MycNBB4 and it was predicted to encode mainly phage proteins (fig. 2C). However, it also carries a gene encoding a protein belonging to the PE-PPE family of proteins, which are commonly present among mycobacteria such as M. tuberculosis (Cole et al. 1998). For genes predicted to be located on the large prophage fragment in MchloDSM see supplementary table S2, Supplementary Material online.

One complete and one partial ribosomal operon were identified in the three draft genomes whereas in the complete MycNBB4 genome two complete ribosomal RNA operons are present. We did, however, detect the presence of partial sequences that corresponded to rRNA operons (including two genes encoding 5S rRNA) in all the three draft genomes. Moreover, for all three genomes the average read depth of the genomic region carrying ribosomal RNA genes was 2-fold higher compared with the rest of the scaffold (supplementary fig. S2, Supplementary Material online). Together this suggested that MchloDSM, MchuDSM, and MobuDSM also have two complete ribosomal RNA operons in keeping with what is known for other rapidly growing Mycobacterium spp. (Ji et al. 1994); however see also (Stadthagen-Gomez et al. 2008).

The transfer RNA genes were annotated using the tRNAScan-SE (Lowe and Eddy 1997) and the numbers are shown in figure 1B. We identified 47 tRNA genes in MchloDSM, MchuDSM, and MobuDSM suggesting that the numbers of functional tRNA isoacceptors were the same in these three species. In MchloDSM, we also detected a tRNA-CGA pseudogene. The different tRNA genes are scattered around the chromosomes at roughly the same positions relative to the “oriC” in all four species (supplementary figs. S1A, C, E, and F, Supplementary Material online; the position of oriC is inferred from the position of dnaA [and dnaN] and rpmH [Gao et al. 2013 and references therein]; note that in MobuDSM the positioning of rpmH relative to dnaA is altered compared with the other three strains). Also comparing the positioning of the different tRNA isoacceptor genes revealed that the same isoacceptor genes cluster in a similar way in the four strains with just a few exceptions (supplementary figs. S1A, C, E, and F, Supplementary Material online). The complete list of tRNAs identified are shown in supplementary table S3, Supplementary Material online. Comparison with MycNBB4, however, indicated that one tRNA gene was missing in this strain. Interestingly, this corresponded to a gene encoding a tRNACys isoacceptor. (Note: many bacteria only have one gene encoding tRNACys [http://trna.bioinf.uni-leipzig.de]; see Discussion). All the other three strains have two genes encoding cysteine tRNA, cysT and cysU (cysU is marked with a * on the MchloDSM, MchuDSM, and MobuDSM chromosomes; supplementary fig. S1A, C, and E, Supplementary Material online). The cysU gene is located near the tRNALeu(CAA) isoacceptor gene in all three species between genes encoding an arabinose efflux permease family protein and a small multidrug resistance protein. Analysis of the gene synteny covering this region in all four strains revealed that in MycNBB4 cysU is missing at this location, possibly due to a deletion event (supplementary fig. S3A, Supplementary Material online). Moreover, sequence alignment of cysT and cysU revealed differences in: the amino acid acceptor-stem, the D-stem/ loop, the anticodon stem, the variable loop, and the T-loop. This might indicate possible differences in the amino acid charging of these two tRNACys isoacceptors. We also noted that the 3′-terminal CA sequence was not encoded in cysU suggesting that formation of the 3′CCA termini occurred posttranscriptionally possibly involving the enzyme nucleotidyl transferase (supplementary fig. S3B, Supplementary Material online; Martin et al. 2008). In this context, we would also like to emphasize that tRNA genes are implicated to be targets for integration of foreign DNA, for example, pathogenicity islands (for ref. see e.g., Hacker and Kaper 2000; Juhas et al. 2009).

Whole Genome Alignment Revealed Homologous and Unique Genomic Regions

Whole genome (excluding the plasmid fragments) and CDS alignments of the newly assembled genomes were generated using Mauve (see Materials and Methods; fig. 2). As apparent from figure 2A MchloDSM harbors the highest number of unique regions and genes (949; see also below), which was expected since its genome size was larger compared with the other genomes (fig. 1B). Similar patterns were also observed by whole CDS alignment (see fig. 2B). The genome wide ANI for the MchloDSM and MchuDSM type strains was higher (95.7%) compared with the ANI for MchuDSM and MycNBB4 (85%; supplementary fig. S4, Supplementary Material online). In fact, the ANI values comparing MchuDSM with MycNBB4 (85%) were very similar to the values comparing MchuDSM with MobuDSM (85.5%) (we emphasize that comparing MobuDSM and MycNBB4 resulted in an ANI value of 84.5%). Moreover, the MobuDSM showed low and similar ANI values when compared with MchuDSM and MchloDSM, 85.5% and 85.7%, respectively. The genomes were clustered based on the ANI values using hierarchical clustering and the result indicated that MchuDSM is closer to MchloDSM than it is to MycNBB4 (supplementary fig. S4, Supplementary Material online; see also below).

Mycobacterium chlorophenolicum DSM43826 and M. chubuense DSM44219 Show High Numbers of Homologous Genes

Homologous and nonhomologous chromosomal genes were identified among the four mycobacterial strains using “BLASTp” with 45% identity and 70% query coverage and e-value 1e-05 cut offs. Relative to the MchuDSM strain pairwise comparison of genes suggested that 90% of its genes are homologous to genes present in MchloDSM (supplementary fig. S5, Supplementary Material online; nMchuMchlo/ nMchu, where nMchuMchlo = 4,885 and nMchu = 5,421). In contrast, only 78% (nMchuMycNBB4/ nMycNBB4, where nMchuMycNBB4 = 3,887 and nMycNBB4 = 4,973; fig. 3 and supplementary fig. S5, Supplementary Material online) of the MycNBB4 genes have homologs that are present in MchuDSM. Interestingly, 80% of the MycNBB4 genes are homologous with genes present in the MchloDSM type strain and 70% with those in MobuDSM. Moreover, comparative analysis of the homologous genes in all the four Mycobacterium spp. suggested that 3,254 homologs are present in all four strains. These genes are referred to as core genes (fig. 3).

Fig. 3.—

Fig. 3.—

Venn diagram—presence of homologous and nonhomologous genes. The Venn diagram represents homologous and nonhomologous genes present in MchloDSM, MchuDSM, MobuDSM, and MycNBB4. The Venn diagram was generated as outlined in Materials and Methods and the different mycobacterial strains are color coded as indicated.

MchloDSM has the highest number of unique genes (n = 949) among the four mycobacterial strains. The number of unique genes in the MchuDSM and MobuDSM type strains are lower 344 and 800, respectively. In contrast, the MycNBB4 strain has a fairly large number of unique genes, almost 2-fold higher than that of the MchuDSM type strain, even though its genome size is the smallest of these four mycobacterial strains (fig. 3). Together this again indicates that the MchuDSM type strain is more distantly related to MycNBB4 than it is to MchloDSM.

Phylogenetic Analysis

We performed phylogenetic analysis of these four mycobacterial strains using a set of genes to understand their evolutionary positions with respect to other Mycobacterium spp. (see Materials and Methods). The genes selected for this analysis were: 1) the 16S rRNA and rpoB genes, which have been used extensively in phylogenetic analysis and 2) the rnpB and dprE1 genes, which have been used to a lesser extent but have been demonstrated to discriminate between different mycobacterial species as well as the 16S rRNA and rpoB genes, if not better (Incandela et al. 2013; Herrmann et al. 2014). Protein sequences were used for the analysis with rpoB and dprE1 whereas the DNA sequences were used for the other two. In addition, we used both the 3,254 core genes (protein sequence) that are present in all four mycobacterial strains (see above) and the 671 Mycobacterium core genes that corresponds to homologous genes present in all available Mycobacterium spp. deduced from complete genome sequences as indicated in figure 4 and supplementary figure S6, Supplementary Material online (see also supplementary table S4, Supplementary Material online).

Fig. 4.—

Fig. 4.—

Phylogenetic analysis. Phylogenetic trees were generated based on (A) 16S rRNA complete gene sequences and (B) core genes in Mycobacterium spp., for details see main text. (C) A phylogenetic tree where we used the 3,268 homologous genes that were identified to be present in the MchloDSM, MchuDSM, MobuDSM, and MycNBB4 genomes as indicated. Bootstrap values in percentage are shown at the common nodes.

The different phylogenetic trees were consistent with our current understanding of mycobacterial phylogeny. The results suggested that MchloDSM and MchuDSM are the closest neighbors whereas the MycNBB4 and MobuDSM strains are more distantly related to both MchloDSM and MchuDSM (fig. 4 and supplementary fig. S6, Supplementary Material online). This relationship also corroborate with the genomic distances derived from ANI values discussed above and raises the possibility that MchuDSM (which belongs to the M. sphagni clade [Whitman et al. 2012]) and MycNBB4 might belong to different clades.

Functional Classification of Common and Unique Genes

Functional classification of annotated genes was done using RAST subsystem classification for each genome as outlined in Materials and Methods. Number of genes in different functional/ subsystem categories was similar in all four mycobacterial strains with the exception of the two categories “photosynthesis” and “metabolism of aromatic compounds” (fig. 5A). In a preliminary analysis, 12 genes were annotated in the first category photosynthesis in both MchloDSM and MchuDSM whereas none was found either in MycNBB4 or in MobuDSM. The 12 genes included multiple copies of genes encoding octaprenyl diphosphate synthase, phytoene dehydrogenase, beta-carotene ketolase, and single copies of genes encoding proteorhodopsin, phytoene synthase, and lycopene beta cyclase. However, in depth analysis based on sequence similarity (see Materials and Methods) revealed that among these 12 genes only the proteorhodopsin gene, which is important for green light absorption, is unique in MchloDSM and MchuDSM. Both proteorhodopsin genes contain a domain which is 90% identical compared with bac_rhodopsin (bacterial rhodopsin like proteins, AccCdd:smart01021) at the protein level. Genes homologous to the other 11 genes in this category were identified to be present in all these four mycobacterial strains. Moreover, analysis of the gene synteny suggested that the gene encoding proteorhodopsin is missing in MycNBB4 and MobuDSM (fig. 6F; see also Discussion).

Fig. 5.—

Fig. 5.—

Functional classifications of total and unique genes. Bar plots representing functional classifications of genes in different categories: (A) Total predicted genes, (B) subclassification of genes that belong to the category metabolism of aromatic compounds, (C) number of genes encoding mono- and dioxygenases in the four Mycobacterium spp. as indicated, (D) unique genes in the four mycobacterial strains (MchloDSM, MchuDSM, MobuDSM, and MycNBB4), and (E) subclassification of the unique genes that belong to the category “virulence, disease, and defense.” Different colors represent different genomes as indicated. In (A) and (D) the x axis represents percentage of total genes whereas in (B), (C), and (E) the x axis corresponds to the number of genes.

Fig. 6.—

Fig. 6.—

Identification and characterization of horizontally transferred genes. (A) Horizontal bar plots showing number of HGT-genes identified in the MchloDSM, MchuDSM, MobuDSM, and MycNBB4 genomes. x axis represents number of HGT-genes and y axis shows the four genomes as indicated. (B) Venn diagram showing common and unique HGT-genes. (C) Box plot showing percentage GC-content of the HGT-genes in the MchloDSM, MchuDSM, MobuDSM, and MycNBB4 genomes. y axis represents percentage GC-content. The horizontal lines represent the first (25%), second (50%), third (75%), and forth (100%) quartiles. The thick horizontal line in the middle of each colored box represents the median value and filled squares are the outliers. (D) Heat map showing the probable source of the HGT-genes (see also supplementary fig. S8A, Supplementary Material online). Color code: dark brown refers to high while light colors to fewer numbers of genes. (E) Functional classification of the HGT-genes using subsystem classifications. x axis represents number of the HGT-genes in percentage. (F) Gene synteny plot of upstream and downstream of the photosynthetic gene encoding the homologous protein proteorhodopsin (see also supplementary fig. S8C, Supplementary Material online). The left column represents the mycobacterial strain, the locus tag of the first and last genes in the gene synteny plot is represented by the prefix “MCHLDSM_,” “MCHUDSM_,” and “Mycch_” for MchloDSM, MchuDSM, and MycNBB4, respectively.

With respect to the category metabolism of aromatic compounds a near 2-fold higher number of genes were identified in MchloDSM relative to the other mycobacterial strains. Irrespective of strain these genes encode proteins involved in the degradation of a number of different organic compounds. Here, the most apparent difference among the four strains is that the MchloDSM type strain harbors several genes encoding proteins that are involved in the central meta-cleavage pathway of aromatic compound degradation (fig. 5B). For a compilation of the genes in this category see supplementary table S5, Supplementary Material online. An important class of enzymes in this context is the mono- and dioxygenases (Bugg 2003; Fuchs et al. 2011) and MchloDSM has the highest number of genes encoding this class of enzymes (fig. 5C; see also the Discussion and supplementary fig. S7, Supplementary Material online). These genes were distributed around the MchloDSM chromosome and this is also the case for the other three Mycobacterium spp. (supplementary fig. S1, Supplementary Material online).

Next, we did a functional classification of the unique genes found in all four strains (fig. 5B). More than 22% of the total number of nonhomologous genes in the MycNBB4 strain was classified in the category “fatty acids, lipids, and isoprenoids.” For the other three species this value was lower; in particular for MchloDSM in which less than 5% of the unique genes belonging to this category. Moreover, the MchuDSM strain has only one gene in the functional category “stress response” whereas the others contain several, for example 5% in MycNBB4. Detailed analysis suggested that the unique stress response genes in MchloDSM are related to oxidative stress whereas in MycNBB4 and MobuDSM they are involved in both oxidative and osmotic stress response (not shown). It should also be noted that none of the unique genes in MchuDSM belongs to the “membrane transport” category, which is not the case for the other strains.

Although these mycobacteria are nonpathogenic it is interesting that all four strains were predicted to have several genes in the “virulence, disease, and defense” category (see also Discussion). Here, MchloDSM is suggested to have the highest number of unique genes in this category whereas MchuDSM the lowest. The majority of these genes were identified as homologs of mce (mammalian cell entry) genes, a class of genes encoding proteins involved in mycobacterial cell invasion and virulence (for a review see [Zhang and Xie 2011]). Moreover, genes encoding proteins with a putative role in copper homeostasis were identified. This was particularly apparent for MchloDSM in which 28 putative genes were detected on the chromosome (fig. 5D and E; one was predicted to be located on a plasmid fragment, see supplementary table S1, Supplementary Material online) and many of these genes are clustered near the oriC on the MchloDSM chromosome (supplementary fig. S1A, Supplementary Material online).

Identification of Horizontally Transferred Genes

Next, we predicted the number of horizontally transferred genes on the chromosome in each of the four mycobacterial strains following the criteria as outlined in Materials and Methods (genes carried on plasmids might also be classified as HGT-genes but these were not included here). As shown in figure 6A our data suggest that depending on species the number of horizontally transferred genes vary between 220 and 262. Of the predicted HGT-genes 81 were common while the highest number of unique HGT-genes were detected in MobuDSM (fig. 6B). Analysis of the GC-content of the HGT-genes in all four strains showed that roughly 25% of the HGT-genes have a lower GC-content compared with that of the total genome for the respective strain consistent with these genes being HGT-genes (figs. 1B and 6C). We also calculated the codon usage for the predicted HGT-genes and compared it with the codon usage for all CDS (supplementary fig. S9, Supplementary Material online). Here, we detected variations in the codon usage frequencies of translational stop codons irrespective of strain, in particular UAG and UGA. (Note that UGA also codes for selenocysteine in many bacteria [http://trna.bioinf.uni-leipzig.de] and this is also the case for the four strains studied here [supplementary table S3, Supplementary Material online]. However, incorporation of selenocysteine at UGA codons depends on a specific selenocysteine insertion sequence downstream of UGA [Thanbichler and Böck 2001].) Moreover, relative to the other strains MobuDSM appeared to differ most in codon frequency usage comparing predicted HGT-genes and total CDS (supplementary fig. S9, Supplementary Material online). The possible origins of the predicted HGT-genes in the four mycobacterial strains were then identified on the basis on BLAST best hits. As shown in figure 6D the results suggested that these HGT-genes might have originated from a large number of bacterial species that belong mainly to the groups Rhizobiales, Pseudomonadales, Burkholderiales, Solirubrobacteriales, and Bacillales.

Functional classification suggested that the HGT-genes belong to four main categories (including metabolism and degradation for the different categories): 1) amino acid and derivatives; 2) carbohydrates; 3) cofactors, vitamins, prosthetic groups, and pigments; and 4) fatty acids, lipids, and isoprenoids (fig. 6E). Interestingly, the genomic region harboring the proteorhodopsin gene was predicted to be horizontally transferred in MchloDSM (fig. 6F). This finding is consistent with that these genes were identified as unique in this species (see above). Moreover, as indicated in supplementary figure S8B, Supplementary Material online, the proteorhodopsin gene has also been identified in other Actinobacteria. Comparing the gene synteny for those and MchloDSM and MchuDSM indicated that within this region it is only the proteorhodopsin gene that is common between these Actinobacteria (supplementary fig. S8C, Supplementary Material online).

Classification of Genes Encoding Small RNAs and Regulatory RNA Motifs

Like other bacteria MchloDSM, MchuDSM, MobuDSM, and MycNBB4 do also encode ncRNAs (fig. 1B). These ncRNAs were classified and found to belong to different categories based on Rfam (12.0) annotation (fig. 7; supplementary table S6, Supplementary Material online [see also Arnvig and Young 2012]); 1) small RNAs, 2) antisense RNAs, 3) gene; ribozyme, 4) intron, 5) cis-regulatory riboswitches, 6) cis-regulatory thermoregulators, 7) cIS-reg RNAs, and 8) gene. Moreover, compared with the other strains MchuDSM lacks several ncRNAs while MchloDSM contains several copies of in particular Ms_IGR7. The analysis also revealed unique ncRNAs in MchloDSM, MobuDSM, and MycNBB4 whereas none was identified in MchuDSM.

Fig. 7.—

Fig. 7.—

Functional classification of noncoding RNA (ncRNAs). Classification of noncoding RNAs in the MchloDSM, MchuDSM, MobuDSM, and MycNBB4 strains, for details see the main text.

Discussion

We present the genome structure and functional correlation of three mycobacterial species considered to be closely related phylogenetically: the MchloDSM, MchuDSM, and MobuDSM type strains. Isolates of two of them, Mchlo and Mobu, show biodegrading properties (Whitman et al. 2012; Satsuma and Masuda 2012; see Introduction). For comparison we included the available complete genome of the Mycobacterium strain MycNBB4, which has also been referred to as the M. chubuense NBB4 strain. This strain was isolated from the environment on the basis of the presence of genes encoding soluble di-iron monooxygenases. It grows in mineral salt media with ethylene as the sole carbon source and 16S rDNA sequencing positioned MycNBB4 (99% sequence identity) close to M. chubuense and Mycobacterium wolinskyi (Coleman et al. 2006, 2011; Martin et al. 2014). Phylogenetic analysis based on 16S rDNA, rpoB, dprE1, and rnpB as well as 3,254 core genes present in all four strains and 671 genes present in Mycobacterium spp., for which complete genome sequences are available, suggested that MycNBB4 may not be a M. chubuense strain. In fact, our data show that the MchuDSM type strain is closer to MchloDSM than it is to MycNBB4. (Based on comparative analysis of the complete sequence of the 16S rDNA including M. wolinskyi it is likely that MycNBB4 is not a M. wolinskyi strain [not shown]). Thus, on the basis of our phylogenetic data we suggest that MycNBB4 should be considered as a separate species. Because it was originally isolated on ethylene (Coleman et al. 2006) a suitable name to consider would be Mycobacterium ethylenense NBB4. With respect to bacterial speciation we emphasize the difficulties and that high rDNA sequence identity can be misleading because it does not give the total genomic picture (Fraser et al. 2009; Wiedenbeck and Cohan 2011; Rahman et al. 2015).

In this context, we also note that a draft genome of the biodegrading Mycobacterium rufum strain JS14 has just been made available (acc no JROA00000000). Phylogeny based on complete 16S rDNA sequences suggested that this Mycobacterium spp. is close to MchloDSM, sharing 99% identity. However, phylogenetic trees generated using rpoB, rnpB, and dprE1 (not shown) as well as core (3111) genes present in M. rufum strain JS14, MchloDSM, MchuDSM, MobuDSM, and MycNBB4 suggested that MchuDSM is closer to MchloDSM than M. rufum strain JS14 (see supplementary fig. S6F, Supplementary Material online). Together this emphasizes the importance of using more than one gene for doing phylogenetic analysis (see above). Note also that the genome of M. rufum strain JS14 is considerably smaller than that of MchloDSM, 6.2 versus 6.9 Mb, respectively.

Bioremediation of organic and aromatic pollutants using bacteria such as MchloDSM and MycNBB4 has attracted much attention. However, plants are the main producers of aromatic compounds in nature but they lack degradation pathways. Microorganisms including those that we studied here play key roles in recycling of carbon derived from aromatic ring structures (Fuchs et al. 2011). Hence, identification of genes encoding enzymes involved in degrading aromatic and organic compounds in various microorganisms is of importance. In this context, both mono- and dioxygenases are key enzymes because they catalyze the oxidation of hydrocarbons by incorporating one and two oxygens, respectively, into the product (Boyd et al. 2003; Bugg 2003). Genes encoding both mono- and dioxygenases were previously identified in MycNBB4 (see e.g., Coleman et al. 2006; Martin et al. 2014 and references therein; see also the annotation of the available MycNBB4 genome). Genes encoding for oxygenases were also identified in the MchloDSM, MchuDSM, and MobuDSM genomes. For the four strains the number of genes encoding mono-oxygenases exceed the number of dioxygenase encoding genes (fig. 5C). Mono- and dioxygenase genes are also found in other Mycobacterium spp. such as M. smegmatis, Mycobacterium vanbaalenii as well as in species that are phylogenetically closely related to these on the basis of 16S rDNA (supplementary figs. S6 and S7, Supplementary Material online). In all these species the total number of oxygenase genes exceeds the number of genes in the four strains studied in this report. However, there is no correlation between the number of genes and genome size. Hence, as we do not have any explanation to this we can only speculate. The mycobacterial species with high number of oxygenase genes might have evolved in environments where these genes provide a growth advantage. In this context we also note that 16S rDNA from the ethene-degrading Mycobacterium spp. JS60 show 99% sequence identity compared with that of MycNBB4 and MobuDSM. However, a deeper analysis of the JS60 strain such as the one done in this report is needed to understand its position in the phylogenetic tree relative to MycNBB4 and MobuDSM. We also would like to emphasize that Nocardia farcinica, which like Mycobacterium belongs to the order Actinomycetales, carries a gene (nfa35380 or rox) encoding a monooxygenase that renders the bacteria resistant to the front line TB drug rifampicin (Hoshino et al. 2010; Goldstein 2014). Homologs to nfa35380 are present in several Mycobacterium species (data not shown). Whether the presence and expression of these homologs also result in rifampicin resistance remains to be determined.

Copper (Cu) is required in different biological processes. However, in excess, Cu is toxic (Prohaska and Lukasewycz 1981; Chillappagari et al. 2010; Grass et al. 2011). Recent data suggest that Cu also has some positive effects. Its concentration increases in macrophages upon bacterial infection and higher levels of Cu kills invading bacteria, including M. tuberculosis. In order to counter this, many bacteria are equipped with genes encoding enzymes such as multicopper oxidase that allow them to grow inside macrophages (Wolschendorf et al. 2011; Rowland and Niederweis 2013). Add to this the importance of copper-transporting ATPases for copper detoxification in bacteria (White et al. 2009; Shafeeq et al. 2011; Wakeman and Skaar 2012). For M. tuberculosis data suggest that copper resistance plays an important role for virulence and CtpV is a copper exporting ATPase needed for full virulence (Ward et al. 2010; Wolschendorf et al. 2011; Wakeman and Skaar 2012; Fu et al. 2014). Hence, it is interesting that 28 genes in the nonpathogenic MchloDSM were annotated on the chromosome (and one on a plasmid fragment) to encode proteins possibly involved in copper homeostasis and 7 of these were predicted to be horizontally transferred (fig. 5E; not shown). The genes were identified as putative copper ATP transporters, copper resistance protein D, multicopper oxidase, and copper chaperone encoding genes (supplementary fig. S1A, Supplementary Material online). On the basis of our phylogenetic analysis these 29 genes are suggested to have been acquired after MchloDSM and MchuDSM diverged. Perhaps this reflects that in the environment where MchloDSM grows, the presence of these genes confers some selective advantage. For example, such genes would allow growth and survival inside amoebas that engulf bacteria including Mycobacterium spp. through phagocytosis (Adékambi and Drancourt 2004; Ben Salah and Drancourt 2010). Nonetheless, we envision that our findings will be useful to understand the evolution of genes involved in copper homeostasis and possibly also with respect to the evolution of efflux pumps and multidrug resistance (Martinez et al. 2009).

The four Mycobacterium spp. studied here were isolated from water sediment and soil (see Introduction). In accordance with this many of the predicted horizontally transferred genes are homologous to genes present in other soil and aquatic living bacteria (fig. 6D). Interestingly, in both MchloDSM and MchuDSM we identified homologs to genes encoding proteorhodopsin, light absorbing proteins that act as proton pumps. This gene is also present in M. rufum JS14 (acc no JROA01000000) and in some other Actinobacteria (see also Sharma et al. 2006). The gene synteny in both MchloDSM and M. rufum JS14 is similar whereas it is different compared with the other Actinobacteria (supplementary fig. S8B and C, Supplementary Material online). Proteorhodopsin is the most abundant member of the rhodopsin family and microbial rhodopsin is divided into two categories, transporters and receptors (Sharma et al. 2006). It is found primarily among bacteria living in aquatic environments (Bamann et al. 2014). The proteorhodopsin gene shows high lateral genetic mobility among bacterioplanktons (Frigaard et al. 2006), which is consistent with our data identifying this gene as one of the predicted horizontally transferred genes in two of the mycobacterial strains, MchloDSM and MchuDSM. Interestingly, the proteorhodopsin gene is positioned close to tRNA genes, which are known to act as targets for integration of foreign DNA into bacterial chromosomes (see e.g., Hacker and Kaper 2000; Juhas et al. 2009). Moreover, it has been suggested that proteorhodopsin influences the response to stress in bacteria (DeLong and Béjà 2010). Whether this also applies to Mycobacterium spp. encoding proteorhodopsin and which function rhodopsin has in these mycobacterial species remains to be seen.

Most bacteria only have one gene encoding tRNACys (http://trna.bioinf.uni-leipzig.de). However, our analysis showed that MchloDSM, MchuDSM, and MobuDSM carry an additional tRNACys gene, cysU (supplementary fig. S3, Supplementary Material online). Further analysis revealed that many of the rapidly growing Mycobacterium spp. and some Rhodococcus spp. such as Rhodococcus erythropolis (but not Rhodococcus equi) are also having two tRNACys genes (not shown). Putative cysU orthologs were also detected in several mycobacterial bacteriophages and in different proteobacteria (not shown). The significance of these findings and why MchloDSM, MchuDSM, and MobuDSM (and other bacteria) have two tRNACys genes is at present not known but it might give a growth advantage. This needs further investigation.

Finally, in the MobuDSM genome we predicted the presence of regions encoding two hammerhead ribozyme structures of class II and III (Birikh et al. 1997; Hammann et al. 2012) similar to that seen in the M. vanbaalenii genome (supplementary fig. S10, Supplementary Material online). The putative class II ribozyme is located just at the 3′-end of a gene encoding the alpha subunit of ribonucleotide-diphosphate reductase whereas the other putative hammerhead ribozyme gene is close and positioned at the 3′-end of the neighboring gene, a gene encoding a transcriptional regulator. We also noted that regions encoding putative hammerheads, HH-II and HH-III, are also present in Mycobacterium intracellulare but at another location (data not shown). On the basis of the secondary structures of the corresponding hammerhead RNAs we believe that the class III hammerhead might indeed be able to act as a ribozyme. However, this remains to be tested as well as to investigate the biological function of these RNA structures.

Supplementary Material

Supplementary tables S1–S6 and figures S1–S10 are available at Genome Biology and Evolution online (http://www.gbe.oxfordjournals.org/).

Supplementary Data

Acknowledgments

The authors thank Dr S. Andersson for critical reading of the manuscript and Dr W.H. McClain for discussions. This work was funded by the Swedish Research Council (M), SIDA/SAREC, the Swedish Research Council-SIDA, the Swedish Research Council for Environment, Agricultural Sciences, and Spatial Planning (FORMAS), and Uppsala RNA Research Center (Swedish Research Council Linneus support). A. Bhattacharya acknowledges the Department of Biotechnology, India. Sequencing was performed by the SNP&SEQ Technology Platform in Uppsala. The platform is part of Science for Life Laboratory at Uppsala University and supported as a national infrastructure by the Swedish Research Council. The computations were performed on resources provided by SNIC through Uppsala Multidisciplinary Center for Advanced Computational Science (UPPMAX) under Project b2011072. L.A.K. is on the board of directors of Bioimics AB. L.A.K. and S.D. are holders of Swedish patent application number PCT/SE2008/051486.

Literature Cited

  1. Adékambi T, Drancourt M. 2004. Dissection of phylogenetic relationships among 19 rapidly growing Mycobacterium species by 16S rRNA, hsp65, sodA, recA and rpoB gene sequencing. Int J Syst Evol Microbiol. 54:2095–2105. [DOI] [PubMed] [Google Scholar]
  2. Agustí G, Astola O, Rodríguez-Güell E, Julián E, Luquin M. 2008. Surface spreading motility shown by a group of phylogenetically related, rapidly growing pigmented mycobacteria suggests that motility is a common property of mycobacterial species but is restricted to smooth colonies. J Bacteriol. 190:6894–6902. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Apajalahti JH, Salkinoja-Salonen MS. 1987. Dechlorination and para-hydroxylation of polychlorinated phenols by Rhodococcus chlorophenolicus. J Bacteriol. 169:675–681. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Arnvig K, Young D. 2012. Non-coding RNA and its potential role in Mycobacterium tuberculosis pathogenesis. RNA Biol. 9:427–436. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Aziz RK, et al. 2008. The RAST Server: rapid annotations using subsystems technology. BMC Genomics 9:75. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Bamann C, Bamberg E, Wachtveitl J, Glaubitz C. 2014. Proteorhodopsin. Biochim Biophys Acta. 1837:614–625. [DOI] [PubMed] [Google Scholar]
  7. Ben Salah I, Drancourt M. 2010. Surviving within the amoebal exocyst: the Mycobacterium avium complex paradigm. BMC Microbiol. 10:99. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Birikh KR, Heaton PA, Eckstein F. 1997. The structure, function and application of the hammerhead ribozyme. Eur J Biochem. 245:1–16. [DOI] [PubMed] [Google Scholar]
  9. Boyd DR, et al. 2003. Stereochemical and mechanistic aspects of dioxygenase-catalysed benzylic hydroxylation of indene and chromane substrates. Org Biomol Chem. 1:1298–1307. [DOI] [PubMed] [Google Scholar]
  10. Briglia M, Eggen RI, Van Elsas DJ, De Vos WM. 1994. Phylogenetic evidence for transfer of pentachlorophenol-mineralizing Rhodococcus chlorophenolicus PCP-I(T) to the genus Mycobacterium. Int J Syst Bacteriol. 44:494–498. [DOI] [PubMed] [Google Scholar]
  11. Bugg TDH. 2003. Dioxygenase enzymes: catalytic mechanisms and chemical models. Tetrahedron 59:7075-7101. [Google Scholar]
  12. Chillappagari S, et al. 2010. Copper stress affects iron homeostasis by destabilizing iron-sulfur cluster formation in Bacillus subtilis. J Bacteriol. 192:2512–2524. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Cole ST, et al. 1998. Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence. Nature 393:537-544. [DOI] [PubMed] [Google Scholar]
  14. Coleman NV, Bui NB, Holmes AJ. 2006. Soluble di-iron monooxygenase gene diversity in soils, sediments and ethene enrichments. Environ Microbiol. 8:1228–1239. [DOI] [PubMed] [Google Scholar]
  15. Coleman NV, et al. 2011. Untangling the multiple monooxygenases of Mycobacterium chubuense strain NBB4, a versatile hydrocarbon degrader. Environ Microbiol Rep. 3:297–307. [DOI] [PubMed] [Google Scholar]
  16. Darling ACE, Mau B, Blattner FR, Perna NT. 2004. Mauve: multiple alignment of conserved genomic sequence with rearrangements. Genome Res. 14:1394–1403. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. DeLong EF, Béjà O. 2010. The light-driven proton pump proteorhodopsin enhances bacterial survival during tough times. PLoS Biol. 8:e1000359. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Falkinham JO., 3rd 2009. Surrounded by mycobacteria: nontuberculous mycobacteria in human environment. J Appl Microbiol. 107:356–367. [DOI] [PubMed] [Google Scholar]
  19. Falkinham JO., 3rd 2015. Environmental sources of nontuberculous mycobacteria. Clin Chest Med. 36:35–41. [DOI] [PubMed] [Google Scholar]
  20. Fouts DE, Brinkac L, Beck E, Inman J, Sutton G. 2012. PanOCT: automated clustering of orthologs using conserved gene neighborhood for pan-genomic analysis of bacterial strains and closely related species. Nucleic Acids Res. 40:e172. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Fowler DM, Araya CL, Gerard W, Fields S. 2011. Enrich: software for analysis of protein function by enrichment and depletion of variants. Bioinformatics 27:3430–3431. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Fraser C, Alm EJ, Polz MF, Spratt BG, Hanage WP. 2009. The bacterial species challenge: making sense of genetic and ecological diversity. Science 323:741–746. [DOI] [PubMed] [Google Scholar]
  23. Frigaard N-U, Martinez A, Mincer TJ, DeLong EF. 2006. Proteorhodopsin lateral gene transfer between marine planktonic Bacteria and Archaea. Nature 439:847–850. [DOI] [PubMed] [Google Scholar]
  24. Fu Y, Chang F-MJ, Giedroc DP. 2014. Copper transport and trafficking at the host-bacterial pathogen interface. Acc Chem Res. 47:3605–3613. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Fuchs G, Boll M, Heider J. 2011. Microbial degradation of aromatic compounds—from one strategy to four. Nat Rev Microbiol. 9:803–816. [DOI] [PubMed] [Google Scholar]
  26. Gao F, Luo H, Zhang C-T. 2013. DoriC 5.0: an updated database of oriC regions in both bacterial and archaeal genomes. Nucleic Acids Res. 41:D90–D93. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. George KW, Häggblom MM. 2008. Microbial O-methylation of the flame retardant tetrabromobisphenol-A. Environ Sci Technol. 42:5555–5561. [DOI] [PubMed] [Google Scholar]
  28. Ghosh J, et al. 2009. Sporulation in mycobacteria. Proc Natl Acad Sci U S A. 106:10781–10786. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Goldstein BP. 2014. Resistance to rifampicin: a review. J Antibiot. 67:625–630. [DOI] [PubMed] [Google Scholar]
  30. Grass G, Rensing C, Solioz M. 2011. Metallic copper as an antimicrobial surface. Appl Environ Microbiol. 77:1541–1547. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Guy L, Kultima JR, Andersson SGE. 2010. genoPlotR: comparative gene and genome visualization in R. Bioinformatics 26:2334–2335. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Hacker J, Kaper JB. 2000. Pathogenicity islands and the evolution of microbes. Annu Rev Microbiol. 54:641–679. [DOI] [PubMed] [Google Scholar]
  33. Häggblom MM, et al. 1994. Transfer of polychlorophenol-degrading Rhodococcus chlorophenolicus (Apajalahti et al. 1986) to the genus Mycobacterium as Mycobacterium chlorophenolicum comb. nov. Int J Syst Bacteriol. 44:485–493. [DOI] [PubMed] [Google Scholar]
  34. Hammann C, Luptak A, Perreault J, de la Peña M. 2012. The ubiquitous hammerhead ribozyme. RNA 18:871–885. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Herrmann B, et al. 2014. Differentiation and Phylogenetic Relationships in Mycobacterium spp with Special Reference to the RNase P RNA Gene rnpB. Curr Microbiol. 69:634–639. [DOI] [PubMed] [Google Scholar]
  36. Hoshino Y, et al. 2010. Monooxygenation of rifampicin catalyzed by the rox gene product of Nocardia farcinica: structure elucidation, gene identification and role in drug resistance. J Antibiot. 63:23–28. [DOI] [PubMed] [Google Scholar]
  37. Incandela ML, et al. 2013. DprE1, a new taxonomic marker in mycobacteria. FEMS Microbiol Lett. 348:66–73. [DOI] [PubMed] [Google Scholar]
  38. Ji YE, Colston MJ, Cox RA. 1994. The ribosomal RNA (rrn) operons of fast-growing mycobacteria: primary and secondary structures and their relation to rrn operons of pathogenic slow-growers. Microbiology 140 (Pt 1):2829–2840. [DOI] [PubMed] [Google Scholar]
  39. Juhas M, et al. 2009. Genomic islands: tools of bacterial horizontal gene transfer and evolution. FEMS Microbiol Rev. 33:376–393. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Katoh K, Kuma K, Toh H, Miyata T. 2005. MAFFT version 5: improvement in accuracy of multiple sequence alignment. Nucleic Acids Res. 33:511–518. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Kazada J, Pavlik I, Falkinham JO, 3rd, Hruska K. 2009. The ecology of mycobacteria: impact on animal’s and human's health. Dordrecht Heidelberg London, New York: Springer Verlag. [Google Scholar]
  42. Kulakov LA, Poelarends GJ, Janssen DB, Larkin MJ. 1999. Characterization of IS2112, a new insertion sequence from Rhodococcus, and its relationship with mobile elements belonging to the IS110 family. Microbiology 145:561–568. [DOI] [PubMed] [Google Scholar]
  43. Lagesen K, et al. 2007. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res. 35:3100–3108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Le NB, Coleman NV. 2011. Biodegradation of vinyl chloride, cis-dichloroethene and 1,2-dichloroethane in the alkene/alkane-oxidising Mycobacterium strain NBB4. Biodegradation 22:1095–1108. [DOI] [PubMed] [Google Scholar]
  45. Lowe TM, Eddy SR. 1997. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25:955–964. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Martin G, Doublié S, Keller W. 2008. Determinants of substrate specificity in RNA-dependent nucleotidyl transferases. Biochim Biophys Acta. 1779:206–216. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Martin KE, Ozsvar J, Coleman NV. 2014. SmoXYB1C1Z of Mycobacterium sp. strain NBB4: a soluble methane monooxygenase (sMMO)-like enzyme, active on C2 to C4 alkanes and alkenes. Appl Environ Microbiol. 80:5801–5806. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Martinez JL, et al. 2009. Functional role of bacterial multidrug efflux pumps in microbial natural ecosystems. FEMS Microbiol Rev. 33:430–449. [DOI] [PubMed] [Google Scholar]
  49. Masuda T, et al. 2012. Cold-inducible RNA-binding protein (Cirp) interacts with Dyrk1b/Mirk and promotes proliferation of immature male germ cells in mice. Proc Natl Acad Sci U S A. 109:1–6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. McLellan I, et al. 2007. The environmental behaviour of polychlorinated phenols and its relevance to cork forest ecosystems: a review. J Environ Monit. 9:1055–1063. [DOI] [PubMed] [Google Scholar]
  51. Miethling R, Karlson U. 1996. Accelerated mineralization of pentachlorophenol in soil upon inoculation with Mycobacterium chlorophenolicum PCP1 and Sphingomonas chlorophenolica RA2. Appl Environ Microbiol. 62:4361–4366. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Moss MT, et al. 1992. IS902, an insertion element of the chronic-enteritis-causing Mycobacterium avium subsp. silvaticum. J Gen Microbiol. 138:139–145. [DOI] [PubMed] [Google Scholar]
  53. Peden J. 1999. Analysis of codon usage. Thesis, Dept of Genetics, University of Nottingham. [Google Scholar]
  54. Pettersson BMF, et al. 2014. Draft genome sequence of Saccharopolyspora rectivirgula. Genome Announc. 2:e01117–13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Pitulle C, Dorsch M, Kazda J, Wolters J, Stackenbrandt E. 1992. Phylogeny of rapidly growing members of the genus Mycobacterium. Int J Syst Bacteriol. 42:337–343. [DOI] [PubMed] [Google Scholar]
  56. Price MN, Dehal PS, Arkin AP. 2009. FastTree: computing large minimum evolution trees with profiles instead of a distance matrix. Mol Biol Evol. 26:1641–1650. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Primm TP, Lucero CA, Falkinham JO., 3rd 2004. Health impacts of environmental mycobacteria. Clin Microbiol Rev. 17:98–106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Prohaska JR, Lukasewycz OA. 1981. Copper deficiency suppresses the immune response of mice. Science 213:559–561. [DOI] [PubMed] [Google Scholar]
  59. Rahman SA, et al. 2015. Reply to ‘“Mycobacterium indicus pranii” is a strain of Mycobacterium intracellulare': “M. indicus pranii” is a distinct strain, not derived from M. intracellulare, and is an organism at an evolutionary transition point between a fast grower and slow grower. MBio 6:e00352–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Richter M, Rosselló-Móra R. 2009. Shifting the genomic gold standard for the prokaryotic species definition. Proc Natl Acad Sci U S A. 106:19126–19131. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Rowland JL, Niederweis M. 2013. A multicopper oxidase is required for copper resistance in Mycobacterium tuberculosis. J Bacteriol. 195:3724–3733. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Saito H, et al. 1977. Cooperative numerical analysis of rapidly growing Mycobacteria: the second report. Int J Syst Bacteriol. 27:75–85. [Google Scholar]
  63. Satsuma K, Masuda M. 2012. Reductive dechlorination of methoxychlor by bacterial species of environmental origin: evidence for primary biodegradation of methoxychlor in submerged environments. J Agric Food Chem. 60:2018–2023. [DOI] [PubMed] [Google Scholar]
  64. Seemann T. 2014. Prokka: rapid prokaryotic genome annotation. Bioinformatics 30:2068–2069. [DOI] [PubMed] [Google Scholar]
  65. Shafeeq S, et al. 2011. The cop operon is required for copper homeostasis and contributes to virulence in Streptococcus pneumoniae. Mol Microbiol. 81:1255–1270. [DOI] [PubMed] [Google Scholar]
  66. Sharma AK, Spudich JL, Doolittle WF. 2006. Microbial rhodopsins: functional versatility and genetic mobility. Trends Microbiol. 14:463–469. [DOI] [PubMed] [Google Scholar]
  67. Stadthagen-Gomez G, et al. 2008. The organization of two rRNA (rrn) operons of the slow-growing pathogen Mycobacterium celatum provides key insights into mycobacterial evolution. FEMS Microbiol Lett. 280:102–112. [DOI] [PubMed] [Google Scholar]
  68. Thanbichler M, Böck A. 2001. Functional analysis of prokaryotic SELB proteins. Biofactors 14:53–59. [DOI] [PubMed] [Google Scholar]
  69. Thoen CO, Steele JH, Kaneene JB. 2014. Zoonotic tuberculosis: Mycobacterium bovis and other pathogenic mycobacteria. 3rd ed Ames (IA): Wiley Blackwell. [Google Scholar]
  70. Tortoli E. 2003. Impact of genotypic studies on mycobacterial taxonomy: the New Mycobacteria of the 1990s. Clin Microbiol Rev. 16:319–354. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Tortoli E. 2006. The new mycobacteria: an update. FEMS Immunol Med Microbiol. 48:159–178. [DOI] [PubMed] [Google Scholar]
  72. Tritt A, Eisen JA, Facciotti MT, Darling AE. 2012. An integrated pipeline for de novo assembly of microbial genomes. PLoS One 7:e42304. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Tsukamura M, Mizuno S. 1971. Mycobacterium obuense, a rapidly growing scotochromogenic Mycobacterium capable of forming a black product from p-aminosalicylate and salicylate. J Gen Microbiol. 68:129–134. [DOI] [PubMed] [Google Scholar]
  74. Tsukamura M, Mizuno S. 1977. Numerical analysis of relationships among rapidly growing, scotochromogenic mycobacteria. J Gen Microbiol. 98:511–517. [DOI] [PubMed] [Google Scholar]
  75. Vaerewijck MJ, Huys G, Palomino JC, Swings J, Portaels F. 2005. Mycobacteria in drinking water distribution systems: ecology and significance for human health. FEMS Microbiol Rev. 29:911–934. [DOI] [PubMed] [Google Scholar]
  76. Wakeman CA, Skaar EP. 2012. Metalloregulation of gram-positive pathogen physiology. Curr Opin Microbiol. 15:169–174. [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Ward SK, Abomoelak B, Hoye EA, Steinberg H, Talaat AM. 2010. CtpV: a putative copper exporter required for full virulence of Mycobacterium tuberculosis. Mol Microbiol. 77:1096–1110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. White C, Lee J, Kambe T, Fritsche K, Petris MJ. 2009. A role for the ATP7A copper-transporting ATPase in macrophage bactericidal activity. J Biol Chem. 284:33949–33956. [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Whitman WB, et al. 2012. Bergey’s Manual® of Systematic Bacteriology. 2nd ed New York: Springer. [Google Scholar]
  80. Wiedenbeck J, Cohan FM. 2011. Origins of bacterial diversity through horizontal genetic transfer and adaptation to new ecological niches. FEMS Microbiol Rev. 35:957–976. [DOI] [PubMed] [Google Scholar]
  81. Wolschendorf F, et al. 2011. Copper resistance is essential for virulence of Mycobacterium tuberculosis. Proc Natl Acad Sci U S A. 108:1621–1626. [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. Yuksel ZS, et al. 2011. Mycobacterial strains that stimulate the immune system most efficiently as candidates for the treatment of bladder cancer. J Mol Microbiol Biotechnol. 20:24–28. [DOI] [PubMed] [Google Scholar]
  83. Zhang F, Xie J-P. 2011. Mammalian cell entry gene family of Mycobacterium tuberculosis. Mol Cell Biochem. 352:1–10. [DOI] [PubMed] [Google Scholar]
  84. Zhou Y, Liang Y, Lynch KH, Dennis JJ, Wishart DS. 2011. PHAST: a fast phage search tool. Nucleic Acids Res. 39:1–6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  85. Zhu Q, Kosoy M, Dittmar K. 2014. HGTector: an automated method facilitating genome-wide discovery of putative horizontal gene transfers. BMC Genomics 15:717. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Data

Articles from Genome Biology and Evolution are provided here courtesy of Oxford University Press

RESOURCES