Skip to main content
Frontiers in Microbiology logoLink to Frontiers in Microbiology
. 2011 Dec 9;2:236. doi: 10.3389/fmicb.2011.00236

Genome-Wide Sequence Variation among Mycobacterium avium Subspecies paratuberculosis Isolates: A Better Understanding of Johne’s Disease Transmission Dynamics

Chung-Yi Hsu 1, Chia-Wei Wu 1, Adel M Talaat 1,2,*
PMCID: PMC3234532  PMID: 22163226

Abstract

Mycobacterium avium subspecies paratuberculosis (M. ap), the causative agent of Johne’s disease, infects many farmed ruminants, wild-life animals, and recently isolated from humans. To better understand the molecular pathogenesis of these infections, we analyzed the whole-genome sequences of several M. ap and M. avium subspecies avium (M. avium) isolates to gain insights into genomic diversity associated with variable hosts and environments. Using Next-generation sequencing technology, all six M. ap isolates showed a high percentage of similarity (98%) to the reference genome sequence of M. ap K-10 isolated from cattle. However, two M. avium isolates (DT 78 and Env 77) showed significant sequence diversity (only 87 and 40% similarity, respectively) compared to the reference strain M. avium 104, a reflection of the wide environmental niches of this group of mycobacteria. Within the M. ap isolates, genomic rearrangements (insertions/deletions) were not detected, and only unique single nucleotide polymorphisms (SNPs) were observed among M. ap isolates. While more of the SNPs (~100) in M. ap genomes were non-synonymous, a total of ~6,000 SNPs were detected among M. avium genomes, most of them were synonymous suggesting a differential selective pressure between M. ap and M. avium isolates. In addition, SNPs-based phylo-genomics had a enough discriminatory power to differentiate between isolates from different hosts but yet suggesting a bovine source of infection to other animals examined in this study. Interestingly, the human isolate (M. ap 4B) was closely related to a M. ap isolate from a dairy facility, suggesting a common source of infection. Overall, the identified phylo-genomes further supported the idea of a common ancestor to both M. ap and M. avium isolates. Genome-wide analysis described here could provide a strong foundation for a population genetic structure that could be useful for the analysis of mycobacterial evolution and for the tracking of Johne’s disease transmission among animals.

Keywords: Mycobacteria, paratuberculosis, Johne’s disease, whole-genome sequencing, genomics, pathogenesis

Introduction

Infection with Mycobacterium avium subspecies paratuberculosis (M. ap) causes Johne’s disease, or paratuberculosis, in a large number of ruminants and wild-life animals (Collins et al., 1994b). The combination of low milk yield and mortalities caused by Johne’s disease significantly impacts the economics of the dairy industry in the US and worldwide (Barrett et al., 2006; Raizman et al., 2009). Both infected and clinically ill animals can shed M. ap in their feces, a common source of infection, especially to young calves through a fecal–oral route (Collins et al., 1994a). The infected animals usually have a prolonged subclinical phase which eventually leads to severe gastroenteritis and death. The disease affects mainly ruminants with some evidence suggesting an association between M. ap and Crohn’s disease in humans (Naser et al., 2002, 2004; Over et al., 2011). As expected, M. ap isolates from variable hosts were the subject of several genetic analyses to decipher a potential role for genetic variations on M. ap virulence and pathogenesis. For example, approaches based on PCR amplification of specific targets (e.g., IS1311, IS900; Whittington et al., 2000) and short sequence repeats (SSR; Sevilla et al., 2008) revealed isolate variation on a genetic level. Recently, DNA microarrays were applied to examine variations on a whole-genome level (Paustian et al., 2005; Wu et al., 2006) which provided a comprehensive analysis of large scale differences among examined isolates. However, events of genomic rearrangements (insertions/deletions, Indels) were not easily identified. In this report, we resorted to a high throughput sequencing strategy to address our hypothesis linking genomic diversity to mycobacterial adaption to variable host and environments where they replicate and persist.

Several studies attempted to examine the genomic polymorphisms among M. avium complex (MAC) strains to better identify mycobacterial species associated with infections. In one study, dnaJ sequence revealed a limited genomic diversity among human and veterinary strains (Morita et al., 2004). However, comparative genomic hybridization revealed more diversity among M. avium, M. ap, and M. avium subsp. silvaticum (Semret et al., 2004; Paustian et al., 2005). Despite a 95% similarity at the nucleotide level between M. avium and M. ap, long oligonucleotide microarrays were able to assess genomic diversity among the genomes of MAC members (Semret et al., 2004; Wu et al., 2006). Fortunately, the complete genome sequence of M. ap K-10 was reported in 2005 and has been revised and re-sequenced recently (Li et al., 2005; Wu et al., 2009; Wynne et al., 2010) which allowed more detailed comparative analysis of several M. ap strains. For the M. tuberculosis complex, the presence of historical data and documented isolates collection helped in better understanding of the origin and evolution of members of this group (Behr and Small, 1999; Smith et al., 2009). Unfortunately, such records are not available for members of MAC.

With the advancements in next-generation sequencing technology (Bentley et al., 2008), we took advantage of Illumina-based technology to decipher the genome contents of M. ap isolates from various animals and their environments. This technology allows us to compare genomic sequences on unprecedented level, the nucleotide level, with high speed and accuracy. With the help of an array of bioinformatics tools, we were able to analyze the genomes of eight mycobacterial isolates including the ATCC 19698, a widely used M. ap isolate in several virulence and pathogenesis studies (Tanaka et al., 1994; Shin et al., 2006; Van et al., 2010). Most M. ap isolates had a high level of sequence similarity to the reference, M. ap K-10 strain on a genome-wide scale even when human isolates were analyzed (Wynne et al., 2011). However, the genomes of two M. avium isolates had lower level of sequence identity to the M. avium 104 genome, the reference genome for M. avium subsp. hominissuis (MAH). Overall, genomic rearrangements represented by large scale inversions and deletions were found between M. ap and M. avium genomes. However, single nucleotide polymorphisms (SNPs) were the most common variations observed among M. ap isolates from different animals despite the bovine origin for all of these isolates. The observed genomic polymorphism among MAC isolates provided us a better understanding for the evolutionary forces active on both closely related organisms with different characteristic phenotypes.

Materials and Methods

Bacterial strains

Six strains of M. avium subspecies paratuberculosis (M. ap) from different hosts and environments and two M. avium subspecies avium (M. avium) were selected (Table 1) for whole-genome sequencing. Laboratory strain of M. ap ATCC 19698, JTC 1281, JTC 1285, and DT 3 were all obtained from Johne’s testing center (JTC) at the University of Wisconsin–Madison. M. ap 4B, a human isolate, was provided by Dr. Saleh Naser at the University of Central Florida (Wu et al., 2006). Environmental isolates were obtained from the soil and utensils of dairy farms and provided by National Veterinary Service Laboratories at Ames, IA. The identity of each strain was confirmed and genotyped as M. ap versus M. avium by PCR analysis of three genes (16S rRNA, IS1311, hsp65) and growth phenotype in presence/absence of mycobactin J as outlined before (Wu et al., 2006). Each strain was cultured in Middlebrook 7H9 broth media supplemented with 10% ADC (2% glucose, 5% bovine serum albumin factor V, and 0.85% NaCl), 0.05% Tween 80 at 37°C. For M. ap isolates, 2 μg/ml of Mycobactin J was added.

Table 1.

A list of mycobacterial isolates used in this study.

Strain Organism Genes used to verify identity Host Sample origin
ATCC 19698 M. ap 16s rRNA, IS1311 Cow Feces
JTC 1281 M. ap 16s rRNA, IS1311 Oryx Lymph node
JTC 1285 M. ap 16s rRNA, IS1311 Goat Feces
M. ap 4B M. ap 16s rRNA, IS1311 Human Ileum
DT 3 M. ap 16s rRNA, IS1311 British red deer Feces
Env 210 M. ap 16s rRNA, IS1311 Dairy farm Environment
DT 78 M. avium 16s rRNA, Hsp65 Water buffalo Ileum
Env 77 M. avium 16s rRNA, Hsp65 Dairy farm Environment

Genomic DNA extraction

Five to 10 ml of bacterial cultures at mid-log phase were used for DNA extraction and isolation. Briefly, M. ap cultures were centrifuged down at 10,000 rpm for 3 min and pellets were then resuspended in sterile Tris–EDTA buffer. Bacterial cells were killed at 80°C for 20 min before adding lysozyme (10 μl of 100 mg/ml) for an overnight incubation at 37°C. After cell lysis, 12 μl of 20 mg/ml proteinase K/pellet was added and incubated at 65°C for 2–3 h. For DNA isolation, 100 μl of 5 M NaCl/pellet was added and incubated at 65°C for 10 min, followed by adding 80 μl of CTAB/NaCl and then incubated for another 10 min at 65°C. In addition, equal volume of phenol–chloroform–isoamyl alcohol (25:24:1) was added to each tube and centrifuge at 10,000 rpm for 5 min at room temperature. The aqueous upper layers were collected and transferred to a fresh tube for washing with equal volume of chloroform–isoamyl alcohol (24:1) and then with isopropanol followed by incubation at −20°C for at least 1 h. Genomic DNA was precipitated by centrifugation at 10,000 rpm for 15 min followed by washing in 75% ethanol. After wash, DNA pellets were dried in a Speed-Vac and resuspended in nuclease-free sterile water.

Whole-genome sequencing

Purified genomic DNA (1–5 μg) samples isolated from each target strain were sent to the Genomic Resource Center (GRC) at the University of Maryland for Illumina GAIIx whole-genome sequencing with multiplexing using the sample preparation oligonucleotide kit from Illumina. The integrity and concentration of DNA was checked by GRC again and followed by fragmentation of DNA using nebulization. DNA ends were repaired and A-tails and adaptors were added using Illumina protocols. The desired size of DNA (around 200 bp) were selected and then amplified with adaptor specific primers that contained four nucleotide barcode tags. The amplified DNA libraries were analyzed by Agilent Bioanalyzer to determine the size and the concentration of DNA fragments. DNA libraries were loaded onto eight channel flow cell. DNA fragments were denatured and hybridized to the oligonucleotides in flow cells followed by and amplification step to form clusters. The flow cells were then transferred to Genome analyzer II for sequencing. For paired-end sequence reads, the amplicons were flipped on the flowcells so the other end can be read as described before (Quail et al., 2008). At the end of each run, four images were collected and used for base-calling.

Sequence assembly and alignments

Raw sequences obtained from the Illumina GAIIx were analyzed using the CLC Genomic Workbench software (version 4.0.3, CLC Bio, Cambridge, MA, USA) to perform both de novo and comparative reference assembly. For the M. ap genomes, all sequences were assembled in reference to the revised M. ap K-10 sequences (Wynne et al., 2010). The M. avium DT 78 genome was assembled using the genome of M. avium subsp. hominissuis 104 (NCBI accession NC 008595) as a reference. The de novo assembly was used for the genome sequence of Env 77 strain because of the lack of significant similarity to other genomes. Additionally, the MAUVE algorithm was used to align paired or multiple genomes for comparative purposes, as outlined before (Perna et al., 1998; Darling et al., 2010). The gapped consensus sequence of each strain was imported to MAUVE for sequence alignment at default seed weight setting.

Single nucleotide polymorphisms analysis

For SNPs detection, we used algorithms implemented in CLC Bio Workstation. Criteria for identifying SNPs included a coverage range setting at 10–55 reads and a presence frequency in at least 50% of the reads before consideration for further analysis. A randomly selected number of SNPs were further analyzed using Sanger sequencing to confirm Next-Generation sequencing data. The primers were designed to cover 10 possible SNPs. The BigDye Terminator (Applied Biosystems, Foster City, CA, USA) version 3.1 cycle sequencing kit was used for sequencing. The sequencing PCR included an initial denaturation cycle at 95°C for 5 min followed by 35 cycles of 95°C for 20 s, 45°C for 30 s and 60°C for 2 min with a final extension at 72°C for 7 min. All samples were sent to the Biotechnology Center at the University of Wisconsin–Madison for sequencing on a ABI 3730XL machine (Applied Biosystems).

For the genome-wide phylogeny (phylo-genome analysis), the predicted SNPs from sequenced genomes (M. ap isolates) and the corresponding nucleotides in DT 78, M. ap K-10 and M. avium 104 were tabulated to create a concatenated sequences of each strain. The genome of M. avium Env 77 isolate was excluded from such analysis because of the low similarity to other genomes. The concatenated sequence of each strain was aligned using CLUSTALW, and phylogenetic trees were generated with MEGA version 5 using one of the following methods: maximum parsimony (MP), maximum likelihood (ML), maximum likelihood with molecular clock (MLK) assumption in addition to Neighbor-joining algorithm with a bootstrapping values of 1,000 replicates applied to all methods (Tamura et al., 2011).

Results

Characterization of mycobacterial isolates

Several mycobacterial species belonging to the M. avium complex are present in animal surroundings; each with different capacities to cause illness (e.g., M. ap, M. avium) and potential to spread to humans (Alvarez-Uria, 2010). Before initiating our genome analysis of members of the M. avium complex, we searched our collection of mycobacterial isolates originating from diverse hosts, diverse tissues as well as from environmental samples of dairy herds that might help in spreading the infection. Our selection scheme identified eight isolates that were subjected for further genotyping protocols to confirm their identity. Based on acid-fast staining and amplification of the 16S rRNA gene using mycobacteria-specific primers (Talaat et al., 1997), all eight isolates were shown to belong to the genus mycobacterium. Moreover, typing based on the hsp65 gene (Smole et al., 2002) confirmed the identity of two mycobacterial isolates, DT 78 and Env 77 as M. avium subspecies avium (M. avium) while the rest of the isolates were all M. ap. Identification of sheep or cattle types of M. ap was based on IS1311 amplification followed by HinfI digestion (data not shown). All of the six M. ap isolates belonged to the bovine origin (M. ap type II). A compiled list of all mycobacterial isolates used in this study and their origin is shown in Table 1.

Whole-genome sequencing of mycobacterial isolates

The Illumina sequencer generated an average read length of 50 nucleotides with an average coverage of 42–68× of each sequenced genome after reference assembly. The number of reads, mapped reads, and the length of consensus sequence are all listed in Table 2. The revised version of M. ap K-10 sequence (Wynne et al., 2010) and M. avium subspecies hominissuis (M. avium 104) were used as references for comparative genome assembly of the target isolates. As expected, all examined M. ap genomes showed a high sequence identity (up to 99%) to the M. ap K-10 genome. Lack of sequence coverage in some parts of the genome could explain some of the differences from the reference genome. Despite the presence of small deleted regions among M. ap genomes, only 2 gaps >1 kb had been seen among M. ap genomes, including the one isolated from human (M. ap 4B isolate), suggesting a high level of similarity to the M. ap K-10 strain isolated from cattle. On the other hand, the M. avium DT 78 strain had only 87% sequence identity to the M. avium 104 genome while it had a higher similarity (93%) to the M. ap K-10 genome, despite its established genotype as M. avium isolate. In the DT 78 genome, more gaps were present whether M. avium 104 or M. ap K-10 were used for reference alignment (Figure 1). The average gap size in this genome is ~4 kb.

Table 2.

A summary report for CLC Bio reference assembly of M. avium and M. ap isolates.

ATCC 19698 M. ap 4B JTC 1281 JTC 1285 DT 3 Env 210 DT 78
Reference organism M. ap K-10 M. ap K-10 M. ap K-10 M. ap K-10 M. ap K-10 M. ap K-10 M. avium 104
Reference length 4,832,589 4,832,589 4,832,589 4,832,589 4,832,589 4,832,589 5,475,491
Total read count 5,994,312 6,729,396 4,645,230 5,985,952 6,374,242 6,294,162 6,978,706
Matched read count 5,417,459 6,522,333 4,164,731 5,391,674 6,177,155 6,080,493 5,637,136
Non-specific match read counta 53,145 56,051 39,879 54,951 50,700 53,340 61,192
Consensus length 4,822,328 4,815,985 4.823.742 4,823,165 4,815,376 4,817,334 4,808,427
Homology (%)b 99.79 99.66 99.82 99.80 99.64 99.68 87.82
Average coveragec 55.77 68.71 42.87 55.50 65.07 64.05 51.16

aNon-specific match read counts are those reads that can be matched more than one place in the reference genome and such reads were randomly placed in one of the matched spots.

bHomology percentage was calculated as: consensus length divided by reference length and then multiplies 100.

cAverage coverage is the average of all the reads coverage in each area in the consensus sequence.

Figure 1.

Figure 1

A whole-genome alignment of M. avium DT 78, M. avium 104 and M. ap DT 78. MAUVE algorithm (Darling et al., 2010) was used for the alignment of the three genomes where white areas indicate low coverage gaps in the sequence of M. avium DT 78 genome, and about seven large region Indels were identified in M. avium DT 78. Regions with the same color indicate high similarity and connected by same color bars. The genomes were drawn to scale based on the reference M. avium 104 genome.

Among the sequenced genomes, the genome of M. avium Env 77 provided a significant challenge because of the low level of similarity to M. avium 104 genome during the reference assembly phase. Accordingly, we employed an algorithm for de novo assembly that generated 772 contigs. These contigs were used as queries in MegaBLAST search against the Mycobacteria genome database (blast.ncbi.nlm.nih.gov). The coverage of each contig is at least 20× and the average coverage of all contigs is around 30× for this strain. In fact, the Env 77 genome was sequenced twice with similar result for each sequencing run (data not shown). Interestingly, BLAST analysis showed only a third of the Env 77 genome with sequence similarity to the genomes of either the M. ap K-10 or M. avium 104 and to a lesser degree to other sequenced mycobacterial genomes, suggesting a mosaic genome structure (Figure 2). Detailed BLAST analysis of the Env 77 draft genome shared common conserved genes, mainly with four mycobacterium species, including ribosomal proteins, DNA polymerase, proteinase Clp, cell division protein Fts, and some transcription or translation regulatory factors. As indicated in Figure 2, the genome of M. avium Env 77 has higher similarity to M. avium 104 and M. ap K-10 than other mycobacterial species. Overall, the sequenced genomes from all strains, except Env 77, mapped to the reference genomes with a significantly high level of similarity. All sequenced genomes were deposited to GenBank database for download and further analysis. The accession numbers for the deposited sequences are listed in Table A1 in Appendix.

Figure 2.

Figure 2

Genome composition of M. avium Env 77. MegaBLAST algorithm was used to identify closely related bacteria to all contig sequences from the M. avium Env 77 isolate. Genomes with <10% homology were excluded from representation. Members of the M. tuberculosis complex included M. tuberculosis and M. bovis with sequence divergence <5%. The same criteria was used to formulate M. avium and M. ap groups.

Genomic rearrangements among M. ap isolates

A major goal of our investigation was to delineate events of insertions and deletions among mycobacterial genomes to better understand their evolutionary relationships. To identify large scale events of insertions/deletions (Indels), we compared the assembled genomes of the six M. ap isolates to the standard M. ap K-10 genome using MAUVE software (version 2.3.1; Darling et al., 2010; Figure 3). Among the potential Indels that could exist among these genomes, we identified only gaps that are <1 kb. A common gap area located at reference position 3,767,550–3,767,870 which is part of MAPK 3350 gene encoding a hypothetical protein has been seen among all six strains with a gap size ~300 bp. At this region, low or zero read coverage has observed among all six strains suggesting a problematic region for Illumina sequencer. The sequence in this gap region appeared to have high GC contents (82%) but no repetitive elements involved.

Figure 3.

Figure 3

Comparative analysis of M. ap and M. avium from animals and environmental sources. The gapped consensus sequence of each strain was used for comparison by MAUVE version 2.3.1. (A) A close-up depiction of a breaking point in the alignment of six M. ap genomes in comparison to M. ap K-10 reference genome. The white areas indicated low or zero reads. In this example, the flanking sequences of the breaking point contain high GC percentage sequence but not repetitive sequences. (B) Indels among M. ap and M. avium genomes. Notice genome rearrangements are usually surrounding the genome origin of replication.

Based on the MAUVE comparison, the consensus sequences of these six strains are closely matched to the M. ap K-10 genome and no inversions were observed (Figure 3). On the other hand, when MAUVE was used to compare the genome of M. ap isolates to the M. avium 104 or M. avium DT78 genomes, about seven large regions of Indels were identified, confirming earlier findings by our group when DNA microarray was used (Wu et al., 2006). For example, one 11 kb Indel was found in all six M. ap strains at position 2,318,400–2,333,740 (MAPK 2038–MAPK 2050) but absent from M. avium. This 11 kb region encodes mostly hypothetical proteins in M. ap K-10 genome with two exceptions, MAPK 2040 and MAPK 2050. MAPK 2040 is a predicted hydrolase and earlier analysis (Santema et al., 2009) also showed the absence of this gene in M. avium 104, but present in other M. avium strain (Table 3). In addition, a total of six genomic inversions spanning ~2.4 Mb were identified among all M. ap strains when compared to M. avium 104 genome, similar to our earlier analysis of only M. ap K-10 and M. avium 104 genomes (Wu et al., 2006).

Table 3.

A list of genes in the 11 kb island which is absent in M. avium 104.

New annotation (Wynne et al., 2010) Old annotation (Li et al., 2005) Length (bp) Function
MAPK 2038 MAP 1730c 1,023 Hypothetical protein
MAPK 2039 MAP 1729c 828 Hypothetical protein
MAPK 2040 MAP 1728c 723 YfnB-hydrolase
MAPK 2041 MAP 1727 906 Hypothetical protein
MAPK 2042 MAP 1726c 585 Hypothetical protein
MAPK 2043 MAP 1725c 1,029 Hypothetical protein
MAPK 2044 MAP 1724c 558 Hypothetical protein
MAPK 2045 MAP 1723 666 Hypothetical protein
MAPK 2046 MAP 1722 1,221 Hypothetical protein
MAPK 2047 MAP 1721c 672 Hypothetical protein
MAPK 2048 MAP 1720 1,020 Hypothetical protein
MAPK 2049 MAP 1719c 615 Hypothetical protein
MAPK 2050 MAP 1718c 456 MAP specific protein

SNPs among M. ap isolates

To better analyze genomic diversity among M. ap isolates, we also examined genomic variations on the nucleotide level. For SNPs analysis, we set stringent criteria for SNP detection (see Materials and Methods). The total number of SNPs among six M. ap genomes ranged from 56 to 131 (Figure 4), among which 17 were found in >1 genome (Table 4). The number of non-synonymous SNPs (nSNPs) is slightly higher than synonymous SNPs (sSNPs), suggesting a positive selective pressure on the identified genes. In addition, most genes harbored one SNP with exceptions of 23 genes that contained two or three SNPs (Table A2 in Appendix). Interestingly, GlnE and MAPK 4304 contained three SNPs each, all are nSNPs, suggesting a high selective pressure on these two genes. Majority of genes contained >1 SNP are larger than 1 kb in size with an average SNP density of 1 SNP per 1.44 kb. Remaining 232 genes that harbored only one SNP represented a similar SNP density of one SNP per 1.44 kb that was identified in other mycobacterium (Qi et al., 2009). For the M. ap JTC 1281 and M. ap 4B, the percentage of nSNPs were 52.68 and 51.76% respectively, and the rest of M. ap strains with >60% of SNPs were nSNPs. Interestingly, genes encoding the Cytochrome P450 proteins harbored a high number of alleles in three of the six examined genomes (Table 5), similar to the same family of genes in M. tuberculosis (Cole, 1999). Intergenic SNPs were identified and counted for <10% of total SNPs.

Figure 4.

Figure 4

The total number of single nucleotide polymorphism (SNP) among M. ap isolates. The number of nSNP (non-synonymous) and sSNP (synonymous) and SNPs in the intergenic regions are color coded as indicated. SNPs were detected using reference assembled sequences of each strain. About 60–130 SNPs were detected M. ap isolates. Percentage of nSNP is generally higher than sSNP which indicates a high selective pressure in these strains.

Table 4.

A list of non-synonymous SNPs in M. ap genome resulted in more than one strain.

Strains K-10 position K-10 allele Variation Gene Function
1 All 6 strains 3,259,329 C T MAPK 2850 Trypsin-like serine protease
2 All 6 strains 4,394,282 A G MAPK 3393 Fucose permease
3 All 6 strains 2,041,445 T C glnE Glutamine synthase
4 ATCC 198698, JTC 1281, JTC 1285, DT 3, Env 210 1,169,976 A C MAPK 1064 Hemolysin-like protein
5 ATCC 198698, JTC 1281, JTC 1285, DT 3, Env 210 91,310 A G nirB Nitrate reductase
6 JTC 1281, JTC 1285, M. ap 4B, Env 210 3,133,871 G A speE Spermidine synthase
7 JTC 1281, M. ap 4B, DT 3, Env 210 2,806,612 G T cydD ATP-binding protein ABC transporter CydD
8 ATCC 19698, JTC 1281, JTC 1285, DT 3 3,278,891 A T pyrH Uridylate kinase PyrH
9 ATCC 19698, JTC 1281, DT 3 1,204,735 T C bpoB Peroxidase BpoB
10 JTC 1281, JTC 1285, Env 210 4,206,587 C T pks2 Polyketide synthase Pks2
11 M. ap 4B, Env 210 1,50,857 G C lipW Esterase LipW
12 M. ap 4B, Env 210 2,25,551 C T fctA Transferase
13 M. ap 4B, Env 210 6,47,971 C A nuoL NADH dehydrogenase subunit L
14 M. ap 4B, Env 210 2,353,857 C A MAPK 2071, hspR Heat shock regulator protein
15 M. ap 4B, Env 210 3,981,515 G A pks13 Polyketide synthase Pks13
16 M. ap 4B, Env 210 4,262,844 T G MAPK 3814 Lipoprotein
17 ATCC 19698, DT 3 1,363,662 A C MAPK 1234 Arabinose efflux permease

Table 5.

A list of nSNP in cytochrome P450 proteins.

Strains K-10 position K-10 allele Variation Gene Amino acid change (functional consequence)
Env 210 1,227,540 A G MAPK 1119 Ile → Met (non-polar)
JTC 1285 1,301,615 C T MAPK 1184 Glu → Lys (Polar acidic → polar basic)
JTC 1285 2,024,939 G A MAPK 1789 Ala → Val (non-polar)
JTC 1281 1,973,792 A G MAPK 1738 Val → Ala (non-polar)
JTC 1281 3,841,168 G C MAPK 3424 Arg → Pro (polar basic → non-polar)

Generally, a modest number of SNPs were detected among genomes of M. ap isolates, unlike M. avium isolates. The M. avium DT 78 genome had a significantly high number of SNPs detected (6,278 SNPs) when compared to the standard M. avium 104 genome suggesting an earlier separation of this strain during its evolutionary pathway. In addition, >75% of the identified SNPs were synonymous, an indication of a higher stabilizing selective pressure for M. avium genes than those of M. ap. For the M. avium Env 77, SNP detection was not performed because the whole sequence aligned poorly with either M. ap K-10 or M. avium 104. Finally, 10 SNPs were randomly chosen for further confirmation using the Sanger sequencing method. The 10 SNPs were chosen based on the ATCC 19698 genome. The same 10 SNPs were also found in JTC 1281, while only 5 common SNPs were found in JTC 1285. All amplicons were sequenced from both forward and reverse strands (Table A3 in Appendix). Three SNPs were not detected in JTC 1285 based on the Sanger results, and is most likely caused by the Illumina sequencer error. Overall, Illumina sequencing was very beneficial in providing a high level of single nucleotide polymorphism in all examined genomes.

Phylo-genomic relationship among M. ap isolates

Single nucleotide polymorphisms of six M. ap strains were concatenated and used for phylogenetic analysis on a genome-wide (phylo-genome) level. The two reference strains, M. ap K-10 and M. avium 104, were included in the analysis. A total of 301 SNPs present among the six M. ap strains as well as in M. avium 104 and M. avium DT 78 genomes were included in this analysis using the Neighbor-joining method (Tamura et al., 2011). The un-rooted tree showed a strong discriminatory power of SNP for all examined isolates based on their origin (Figure 5A) while maintained branches of M. avium genomes separate from genomes of M. ap isolates. Such discriminatory power was not possible when single-gene genotypes were tried (see above). Nonetheless, when the tree was rooted to M. avium 104 genome, two distinct major branches within the M. ap genomes were easily discerned (Figure 5B).

Figure 5.

Figure 5

Phylogenomic analysis of M. ap and M. avium strains. (A) A dendrogram displaying an un-rooted, Neighbor-joining tree of the concatenated SNPs from all eight mycobacterial isolates under study. (B) A rooted Neighbor-joining tree using M. ah 104 genome as out group. The bootstrap consensus tree inferred from 1,000 replicates is taken to represent the evolutionary history of the taxa analyzed. The bootstrap replicates are marked on each branch and a less than 50% bootstrap replicates were collapsed. The percentage of replicate trees in which the associated taxa clustered together in the bootstrap test is shown next to the branches.

In one branch within M. ap genomes (Figure 5B), an isolate from red deer (M. ap DT 3) was closely related to the standard cattle strains (M. ap K-10 and ATCC 19698). On the other hand, isolates from goat and oryx (M. ap JTC 1281 and JTC 1285, respectively) were more closely related to the recently isolated cattle type strain (M. ap K-10) than to other laboratory strain (ATCC 19698), suggesting a cattle source of infection. In the other branch of the tree, M. ap 4B and M. ap Env 210 isolates from human and dairy farm, respectively, were closely related to each other. It is noteworthy to mention here that the association of M. avium DT 78 genome to the M. avium 104 strain based on phylo-genomic analysis confirmed our earlier identification of this isolate to belong M. avium group despites its overall higher similarity to the M. ap K-10. Finally, when we tried additional three methods for tree construction (MP, ML, MLK) on independent lists of sSNPs and nSNPs, a congregant topology was obtained for all trees with a high bootstrap support, similar to the one showed in Figure 5B. The Log Likelihood Ratio test for MLK consensus tree against ML tree indicated that the molecular clock assumption was not valid (p < 0.007). Overall, the identified tree topology suggests that M. avium 104 as a common ancestor from which M. ap likely emerged and diversified into two lineages: a lineage that clustered Env 210 with M. ap 4B (Human) while the second clustered all type II strains of M. ap. In both lineages, infected cows are the most likely reservoir for spreading the type II M. ap strains.

Discussion

Understanding the genome-wide variations among pathogenic Mycobacteria will improve our understanding of the pathogenesis and evolution of these important pathogens. Recently, Next-generation sequencing technologies provided us the opportunity to examine whole-genome variations on a much faster basis than traditional sequencing or DNA-microarray technologies. In this study, M. ap isolates were chosen from diverse hosts, sources, and locations to better assess the impact of these variations on pathogen genome composition. As expected, the six M. ap genomes sequenced in this study shared ~99% sequence similarity M. ap K-10 reference genome with a modest number of SNPs (~100) suggesting a stabilizing selective pressure. On the contrary, isolates of M. avium origin showed more diversity. For example, the M. avium DT 78 genome had a significant number of gaps and a large number of SNPs (~6,000) compared to M. avium 104 despite its significant similarity to M. ap K-10 (>90%) on a whole-genome level. This isolate is likely to represents an intermediate strain between M. avium 104 and M. ap K-10. Generally, M. avium replicate faster than M. ap and survive in a more diverse environments, those factors are likely to contribute to adaptive polymorphism. Previous analyses showed that M. avium has more diversity than M. ap strains (Turenne et al., 2006; Wu et al., 2006). Additionally, the M. avium Env 77 genome BLAST search indicated its complex and mosaic structure, another indication of diversity among M. avium isolates. Although standard genotyping protocols used here (based on hsp65 and IS1311) clearly typed DT78 and Env 77 isolates as M. avium, our genome sequencing approach question the validity of genotyping of Mycobacteria based on a single or a few genes and advocate for a whole-genome based approach.

Because of the close relatedness of M. ap genomes, SNPs from each strain provided valuable information on the divergence and evolutionary process that control members of MAC. A wide range of studies used SNPs for studying drug resistance mutations in organisms (Xu et al., 2008), analysis of genomic evolution (Filliol et al., 2006), and association of M. ap infection to Crohn’s disease patients (Wynne et al., 2011). In this study, SNPs were used for genome-wide typing of isolates to understand the dynamics of Johne’s disease transmission. Examining the modest number of SNPs detected in M. ap identified the presence of a higher percentage of nSNPs in all six M. ap isolates, suggesting a close relatedness among strains (Gutacker et al., 2002; Holden et al., 2004; Rocha et al., 2006). This close relationship among M. ap isolates could indicate a “spillover” infection from cattle to other animals (in this study red deer). However, the observed higher percentage of nSNPs could indicate adaptive evolution of M. ap to different hosts with positive selective pressure. A significant number of nSNPs were located in genes that encode hypothetical proteins while others in genes that encode proteins with enzymatic functions, some of them involved in metabolism and energy pathways, such as Pks proteins and NuoL protein. Interestingly, a SNP in the glnE gene were identified in all six M. ap genomes and additional SNPs in this gene were observed in ATCC 19698 and M. ap 4B separately. In M. tuberculosis, GlnE is an adenylyl transferase modulating glutamine synthetase activity and it is essential for bacterial growth under alternative nitrogen sources (Carroll et al., 2008). SNPs within this gene could likely be an indication of common evolutionary ancestor with environmental isolates. Similarity, SNPs in cytochrome P450 enzymes (Table 5) that catalyze mixed oxidation of hydrophobic compounds associated with free-living saprophyte (Arnold, 2007), another indication of a common environmental ancestor for M. ap. Finally, genes encoding cytochrome P450 were shown to play a role in the persistence of M. tuberculosis in tissues (McLean et al., 2010). SNPs found in M. ap counterpart could potentially contribute to the M. ap-host interactions.

An interesting outcome of the employed phylo-genomic analysis provided here is the further support provided to the hypothesis of presence of common origin to both subspecies of M. avium complex, namely, M. ap and M. avium subsp. avium. Such hypothesis was supported before based on large genomic regions of insertions/deletions (Wu et al., 2006; Alexander et al., 2009). This study provides further support to this hypothesis using SNPs on a genome-wide level. The whole-genome approach we employed here allowed us to explore the diversity among MAC isolates from different hosts and variable locations. It also provided more clues regarding the dynamic of mycobacterial transmission among animals. Sequencing the genome of more isolates will definitely enrich our understanding of the genome content and evolution of both environmental and pathogenic strains of mycobacteria and will eventually provide a comprehensive population genetic structure. Such knowledge base could elucidate the relationship between strains and host or some special environmental cues. In addition, sequencing more diverse isolates will help to evaluate the dynamic of disease transmission among animals or from animals to humans. Developing algorithms that can utilize the information gained from Next-generation sequencers will only improve the phylo-genomic analysis and is greatly needed to advance our understanding of microbe–host interactions.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

We would like to thank Meagan Cooney for reading the manuscript. We also like to thank Sarah Marcus and Eric Cabot for assisting with the CLC Bio assembling software. Illumina genome sequencing was performed by Genomic Resource center at University of Maryland, Baltimore. This project is supported by National Research Initiative of the U.S. Department of Agriculture Cooperative State Research, Education and Extension Service (grants #2007-35204-18400 and #2004-35605-14243 for Johne’s disease integrated program) and US-AID grant #1937.

Appendix

Table A1.

A list of accession numbers of genome deposited to GenBank.

Organisms Accession number
Mycobacterium avium subsp. paratuberculosis ATCC 19698 AGAR00000000
Mycobacterium avium subsp. paratuberculosis JTC 1281 AGAK00000000
Mycobacterium avium subsp. paratuberculosis JTC 1285 AGAL00000000
Mycobacterium avium subsp. paratuberculosis 4B AGAM00000000
Mycobacterium avium subsp. paratuberculosis DT 3 AGAN00000000
Mycobacterium avium subsp. paratuberculosis Env 210 AGAO00000000
Mycobacterium avium subsp. avium DT 78 AGAP00000000
Mycobacterium avium subsp. avium Env 77 AGAQ00000000

Table A2.

A list of genes that harbored >1 SNPs and its SNPs density.

No. K-10 position N/S Annotations Size of gene SNP density
1 38,870 N MAPK 0028 11,206 5,603
46,975 N MAPK 0028
2 1,42,825 N MAPK 0106 1,481 740.5
1,42,835 N MAPK 0106
3 1,57,866 N fadD4 1,517 758.5
1,57,867 S fadD4
4 2,05,752 S mecD 1,613 806.5
2,05,753 N mecD
5 5,09,508 S MAPK 0430 524 262
5,09,633 N MAPK 0430
6 6,93,293 S MAPK 0603 1,166 583
6,93,483 N MAPK 0603
7 1,233,762 N MAPK 1125 500 250
1,233,871 N MAPK 1125
8 1,603,684 N MAPK 1444 2,354 1,177
1,604,357 S MAPK 1444
9 1,910,469 S MAPK 1687 1,730 865
1,910,564 N MAPK 1687
10 1,994,322 S MAPK 1761 1,061 530.5
1,994,370 S MAPK 1761
11 2,040,806 N GlnE 2,996 998.7
2,040,946 N GlnE
2,041,445 N GlnE
12 2,150,741 N MAPK 1898 4,377 2188.5
2,151,370 S MAPK 1898
13 2,353,857 N MAPK 2071 380 190
2,353,858 S MAPK 2071
14 2,367,415 N MAPK 2083 1,577 788.5
2,367,416 N MAPK 2083
15 2,605,368 S MAPK 2303 1,454 727
2,605,372 N MAPK 2303
16 2,664,521 N MAPK 2348 19,154 9,577
2,676,784 S MAPK 2348
17 2,785,781 N MAPK 2437 2,600 1,300
2,786,976 S MAPK 2437
18 2,92,000 N MAPK 2539 1,196 598
2,920,661 S MAPK 2539
19 3,888,659 N MAPK 3467 689 344.5
3,888,879 N MAPK 3467
20 4,040,757 S MAPK 3602 1,604 802
4,041,118 N MAPK 3602
21 4,079,226 N MAPK 3645 905 452.5
4,079,231 S MAPK 3645
22 4,206,587 N pks2 6,296 3,148
4,211,613 S pks2
23 4,773,398 N MAPK 4304 1,208 402.7
4,774,040 N MAPK 4304
4,774,041 N MAPK 4304

N, non-synonymous SNP; S, synonymous SNP.

Table A3.

A list of 10 SNPs picked for Sanger sequencing.

No. Gene name K-10 position K-10 allele ATCC 19698 JTC 1285 JTC 1281 M. av ATCC 25291 M. av 104
1 MAP 2578 2,900,072 G C C N/D C C
2 MAP 2578 2,900,076 A T T N/D T T
3 MAP 3165 3,517,478 C A C N/D C C
4 MAP 3165 3,517,668 C G G G G G
5 MAP 3391c 3,767,913 T G G G G G
6 MAP 3391c 3,767,939 G C C C C C
7 Neighbor of rpoC 4,606,712 G G C N/D G G
8 rpoC 4,607,283 G A G N/D G G
9 MAP 4302c 4,771,441 A T T T T T
10 MAP 4302c 4,771,588 G A A A A A

N/D, not detected.

References

  1. Alexander D. C., Turenne C. Y., Behr M. A. (2009). Insertion and deletion events that define the pathogen Mycobacterium avium subsp. paratuberculosis. J. Bacteriol. 191, 1018–1025 10.1128/JB.01340-08 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Alvarez-Uria G. (2010). Lung disease caused by nontuberculous mycobacteria. Curr. Opin. Pulm. Med. 16, 251–256 [DOI] [PubMed] [Google Scholar]
  3. Arnold C. (2007). Molecular evolution of Mycobacterium tuberculosis. Clin. Microbiol. Infect. 13, 120–128 10.1111/j.1469-0691.2006.01637.x [DOI] [PubMed] [Google Scholar]
  4. Barrett D. J., Good M., Hayes M., More S. J. (2006). The economic impact of Johne’s disease in an Irish dairy herd: a case study. Ir. Vet. J. 59, 282 [Google Scholar]
  5. Behr M. A., Small P. M. (1999). A historical and molecular phylogeny of BCG strains. Vaccine 17, 915–922 10.1016/S0264-410X(98)00277-1 [DOI] [PubMed] [Google Scholar]
  6. Bentley D. R., Balasubramanian S., Swerdlow H. P., Smith G. P., Milton J., Brown C. G., Hall K. P., Evers D. J., Barnes C. L., Bignell H. R., Boutell J. M., Bryant J., Carter R. J., Keira C. R., Cox A. J., Ellis D. J., Flatbush M. R., Gormley N. A., Humphray S. J., Irving L. J., Karbelashvili M. S., Kirk S. M., Li H., Liu X., Maisinger K. S., Murray L. J., Obradovic B., Ost T., Parkinson M. L., Pratt M. R., Rasolonjatovo I. M., Reed M. T., Rigatti R., Rodighiero C., Ross M. T., Sabot A., Sankar S. V., Scally A., Schroth G. P., Smith M. E., Smith V. P., Spiridou A., Torrance P. E., Tzonev S. S., Vermaas E. H., Walter K., Wu X., Zhang L., Alam M. D., Anastasi C., Aniebo I. C., Bailey D. M., Bancarz I. R., Banerjee S., Barbour S. G., Baybayan P. A., Benoit V. A., Benson K. F., Bevis C., Black P. J., Boodhun A., Brennan J. S., Bridgham J. A., Brown R. C., Brown A. A., Buermann D. H., Bundu A. A., Burrows J. C., Carter N. P., Castillo N., Chiara E., Catenazzi, Chang S., Neil C. R., Crake N. R., Dada O. O., Diakoumakos K. D., Dominguez-Fernandez B., Earnshaw D. J., Egbujor U. C., Elmore D. W., Etchin S. S., Ewan M. R., Fedurco M., Fraser L. J., Fuentes Fajardo K. V., Scott F. W., George D., Gietzen K. J., Goddard C. P., Golda G. S., Granieri P. A., Green D. E., Gustafson D. L., Hansen N. F., Harnish K., Haudenschild C. D., Heyer N. I., Hims M. M., Ho J. T., Horgan A. M., Hoschler K., Hurwitz S., Ivanov D. V., Johnson M. Q., James T., Huw Jones T. A., Kang G. D., Kerelska T. H., Kersey A. D., Khrebtukova I., Kindwall A. P., Kingsbury Z., Kokko-Gonzales P. I., Kumar A., Laurent M. A., Lawley C. T., Lee S. E., Lee X., Liao A. K., Loch J. A., Lok M., Luo S., Mammen R. M., Martin J. W., McCauley P. G., McNitt P., Mehta P., Moon K. W., Mullens J. W., Newington T., Ning Z., Ling N. B., Novo S. M., O’Neill M. J., Osborne M. A., Osnowski A., Ostadan O., Paraschos L. L., Pickering L., Pike A. C., Pike A. C., Chris P. D., Pliskin D. P., Podhasky J., Quijano V. J., Raczy C., Rae V. H., Rawlings S. R., Chiva R. A., Roe P. M., Rogers J., Rogert Bacigalupo M. C., Romanov N., Romieu A., Roth R. K., Rourke N. J., Ruediger S. T., Rusman E., Sanches-Kuiper R. M., Schenker M. R., Seoane J. M., Shaw R. J., Shiver M. K., Short S. W., Sizto N. L., Sluis J. P., Smith M. A., Ernest Sohna S. J., Spence E. J., Stevens K., Sutton N., Szajkowski L., Tregidgo C. L., Turcatti G., Vandevondele S., Verhovsky Y., Virk S. M., Wakelin S., Walcott G. C., Wang J., Worsley G. J., Yan J., Yau L., Zuerlein M., Rogers J., Mullikin J. C., Hurles M. E., McCooke N. J., West J. S., Oaks F. L., Lundberg P. L., Klenerman D., Durbin R., Smith A. J. (2008). Accurate whole human genome sequencing using reversible terminator chemistry. Nature 456, 53–59 10.1038/nature07517 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Carroll P., Pashley C. A., Parish T. (2008). Functional analysis of GlnE, an essential adenylyl transferase in Mycobacterium tuberculosis. J. Bacteriol. 190, 4894–4902 10.1128/JB.00166-08 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Cole S. T. (1999). Learning from the genome sequence of Mycobacterium tuberculosis H37Rv. FEBS Lett. 452, 7–10 10.1016/S0014-5793(99)00536-0 [DOI] [PubMed] [Google Scholar]
  9. Collins D. M., Radford A. J., De Lisle G. W., Billman-Jacobe H. (1994a). Diagnosis and epidemiology of bovine tuberculosis using molecular biological approaches. Vet. Microbiol. 40, 83–94 10.1016/0378-1135(94)90048-5 [DOI] [PubMed] [Google Scholar]
  10. Collins M. T., Sockett D. C., Goodger W. J., Conrad T. A., Thomas C. B., Carr D. J. (1994b). Herd prevalence and geographic distribution of, and risk factors for, bovine paratuberculosis in Wisconsin. J. Am. Vet. Med. Assoc. 204, 636–641 [PubMed] [Google Scholar]
  11. Darling A. E., Mau B., Perna N. T. (2010). progressiveMauve: multiple genome alignment with gene gain, loss and rearrangement. PLoS ONE 5, e11147. 10.1371/journal.pone.0011147 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Filliol I., Motiwala A. S., Cavatore M., Qi W., Hazbon M. H., Bobadilla d. V., Fyfe J., Garcia-Garcia L., Rastogi N., Sola C., Zozio T., Guerrero M. I., Leon C. I., Crabtree J., Angiuoli S., Eisenach K. D., Durmaz R., Joloba M. L., Rendon A., Sifuentes-Osornio J., Ponce de L. A., Cave M. D., Fleischmann R., Whittam T. S., Alland D. (2006). Global phylogeny of Mycobacterium tuberculosis based on single nucleotide polymorphism (SNP) analysis: insights into tuberculosis evolution, phylogenetic accuracy of other DNA fingerprinting systems, and recommendations for a minimal standard SNP set. J. Bacteriol. 188, 759–772 10.1128/JB.188.8.3162-3163.2006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Gutacker M. M., Smoot J. C., Migliaccio C. A. L., Ricklefs S. M., Hua S., Cousins D.V., Graviss E. A., Shashkina E., Kreiswirth B. N., Musser J. M. (2002). Genome-wide analysis of synonymous single nucleotide polymorphisms in Mycobacterium tuberculosis complex organisms: resolution of genetic relationships among closely related microbial strains. Genetics 162, 1533–1543 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Holden M. T., Feil E. J., Lindsay J. A., Peacock S. J., Day N. P., Enright M. C., Foster T. J., Moore C. E., Hurst L., Atkin R., Barron A., Bason N., Bentley S. D., Chillingworth C., Chillingworth T., Churcher C., Clark L., Corton C., Cronin A., Doggett J., Dowd L., Feltwell T., Hance Z., Harris B., Hauser H., Holroyd S., Jagels K., James K. D., Lennard N., Line A., Mayes R., Moule S., Mungall K., Ormond D., Quail M. A., Rabbinowitsch E., Rutherford K., Sanders M., Sharp S., Simmonds M., Stevens K., Whitehead S., Barrell B. G., Spratt B. G., Parkhill J. (2004). Complete genomes of two clinical Staphylococcus aureus strains: evidence for the rapid evolution of virulence and drug resistance. Proc. Natl. Acad. Sci. U.S.A. 101, 9786–9791 10.1073/pnas.0403302101 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Li L., Bannantine J. P., Zhang Q., Amonsin A., May B. J., Alt D., Banerji N., Kanjilal S., Kapur V. (2005). The complete genome sequence of Mycobacterium avium subspecies paratuberculosis. Proc. Natl. Acad. Sci. U.S.A. 102, 12344–12349 10.1073/pnas.0504799102 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. McLean K. J., Belcher J., Driscoll M. D., Fernandez C. C., Le V. D., Bui S., Golovanova M., Munro A. W. (2010). The Mycobacterium tuberculosis cytochromes P450: physiology, biochemistry and molecular intervention. Future Med. Chem. 2, 1339–1353 10.4155/fmc.10.216 [DOI] [PubMed] [Google Scholar]
  17. Morita Y., Maruyama S., Kabeya H., Nagai A., Kozawa K., Kato M., Nakajima T., Mikami T., Katsube Y., Kimura H. (2004). Genetic diversity of the dnaJ gene in the Mycobacterium avium complex. J. Med. Microbiol. 53, 813–817 10.1099/jmm.0.45601-0 [DOI] [PubMed] [Google Scholar]
  18. Naser S. A., Ghobrial G., Romero C., Valentine J. F. (2004). Culture of Mycobacterium avium subspecies paratuberculosis from the blood of patients with Crohn’s disease. Lancet 364, 1039–1044 10.1016/S0140-6736(04)17058-X [DOI] [PubMed] [Google Scholar]
  19. Naser S. A., Shafran I., Schwartz D., El Zaatari F., Biggerstaff J. (2002). In situ identification of mycobacteria in Crohn’s disease patient tissue using confocal scanning laser microscopy. Mol. Cell. Probes 16, 41–48 10.1006/mcpr.2001.0395 [DOI] [PubMed] [Google Scholar]
  20. Over K., Crandall P. G., O’Bryan C. A., Ricke S. C. (2011). Current perspectives on Mycobacterium avium subsp. paratuberculosis, Johne’s disease, and Crohn’s disease: a review. Crit. Rev. Microbiol. 37, 141–156 10.3109/1040841X.2010.532480 [DOI] [PubMed] [Google Scholar]
  21. Paustian M. L., Kapur V., Bannantine J. P. (2005). Comparative genomic hybridizations reveal genetic regions within the Mycobacterium avium complex that are divergent from Mycobacterium avium subsp. paratuberculosis isolates. J. Bacteriol. 187, 2406–2415 10.1128/JB.187.7.2406-2415.2005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Perna N. T., Mayhew G. F., Posfai G., Elliott S., Donnenberg M. S., Kaper J. B., Blattner F. R. (1998). Molecular evolution of a pathogenicity island from enterohemorrhagic Escherichia coli O157-H7. Infect. Immun. 66, 3810–3817 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Qi W., Kaser M., Roltgen K., Yeboah-Manu D., Pluschke G. (2009). Genomic diversity and evolution of Mycobacterium ulcerans revealed by next-generation sequencing. PLoS Pathog. 5, e1000580. 10.1371/journal.ppat.1000580 [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Quail M. A., Kozarewa I., Smith F., Scally A., Stephens P. J., Durbin R., Swerdlow H., Turner D. J. (2008). A large genome center’s improvements to the Illumina sequencing system. Nat. Methods 5, 1005–1010 10.1038/nmeth.1270 [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Raizman E. A., Fetrow J. P., Wells S. J. (2009). Loss of income from cows shedding Mycobacterium avium subspecies paratuberculosis prior to calving compared with cows not shedding the organism on two Minnesota dairy farms. J. Dairy Sci. 92, 4929–4936 10.3168/jds.2009-2133 [DOI] [PubMed] [Google Scholar]
  26. Rocha E. P., Smith J. M., Hurst L. D., Holden M. T., Cooper J. E., Smith N. H., Feil E. J. (2006). Comparisons of dN/dS are time dependent for closely related bacterial genomes. J. Theor. Biol. 239, 226–235 10.1016/j.jtbi.2005.08.037 [DOI] [PubMed] [Google Scholar]
  27. Santema W., Overdijk M., Barends J., Krijgsveld J., Rutten V., Koets A. (2009). Searching for proteins of Mycobacterium avium subspecies paratuberculosis with diagnostic potential by comparative qualitative proteomic analysis of mycobacterial tuberculins. Vet. Microbiol. 138, 191–196 10.1016/j.vetmic.2009.03.021 [DOI] [PubMed] [Google Scholar]
  28. Semret M., Zhai G., Mostowy S., Cleto C., Alexander D., Cangelosi G., Cousins D., Collins D. M., van Soolingen D., Behr M. A. (2004). Extensive genomic polymorphism within Mycobacterium avium. J. Bacteriol. 186, 6332–6334 10.1128/JB.186.18.6332-6334.2004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Sevilla I., Li L., Amonsin A., Garrido J. M., Geijo M. V., Kapur V., Juste R. A. (2008). Comparative analysis of Mycobacterium avium subsp. paratuberculosis isolates from cattle, sheep and goats by short sequence repeat and pulsed-field gel electrophoresis typing. BMC Microbiol. 8, 204. 10.1186/1471-2180-8-204 [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Shin S. J., Wu C.-W., Steinberg H., Talaat A. M. (2006). Identification of novel virulence determinants in Mycobacterium paratuberculosis by screening a library of insertional mutants. Infect. Immun. 7, 3825–3833 10.1128/IAI.01742-05 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Smith N. H., Hewinson R. G., Kremer K., Brosch R., Gordon S. V. (2009). Myths and misconceptions: the origin and evolution of Mycobacterium tuberculosis. Nat. Rev. Microbiol. 7, 485–491 10.1038/nrmicro2165 [DOI] [PubMed] [Google Scholar]
  32. Smole S. C., McAleese F., Ngampasutadol J., von Reyn C. F., Arbeit R. D. (2002). Clinical and epidemiological correlates of genotypes within the Mycobacterium avium complex defined by restriction and sequence analysis of hsp65. J. Clin. Microbiol. 40, 3374–3380 10.1128/JCM.40.9.3374-3380.2002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Talaat A. M., Reimschuessel R., Trucksis M. (1997). Identification of mycobacteria infecting fish to the species level using polymerase chain reaction and restriction enzyme analysis. Vet. Microbiol. 58, 229–237 10.1016/S0378-1135(97)00120-X [DOI] [PubMed] [Google Scholar]
  34. Tamura K., Peterson D., Peterson N., Stecher G., Nei M., Kumar S. (2011). MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol. Biol. Evol. 28, 2731–2739 10.1093/molbev/msr121 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Tanaka S., Sato M., Taniguchi T., Yokomizo Y. (1994). Histopathological and morphometrical comparison of granulomatous lesions in BALB/c and C3H/HeJ mice inoculated with Mycobacterium paratuberculosis. J. Comp. Pathol. 110, 381–388 10.1016/S0021-9975(08)80315-5 [DOI] [PubMed] [Google Scholar]
  36. Turenne C. Y., Semret M., Cousins D. V., Collins D. M., Behr M. A. (2006). Sequencing of hsp65 distinguishes among subsets of the Mycobacterium avium complex. J. Clin. Microbiol. 44, 433–440 10.1128/JCM.44.2.433-440.2006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Van B. L., Coudijzer K., Vlaemynck G., Hendrickx M., Michiels C., Messens W., Herman L., De B. J. (2010). Localization of Mycobacterium avium subspecies paratuberculosis in artificially inoculated milk and colostrum by fractionation. J. Dairy Sci. 93, 4722–4729 10.3168/jds.2009-2895 [DOI] [PubMed] [Google Scholar]
  38. Whittington R. J., Hope A. F., Marshall D. J., Taragel C. A., Marsh I. (2000). Molecular epidemiology of Mycobacterium avium subsp. paratuberculosis: IS900 restriction fragment length polymorphism and IS1311 polymorphism analyses of isolates from animals and a human in Australia. J. Clin. Microbiol. 38, 3240–3248 [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Wu C.-W., Glasner J., Collins M. T., Naser S., Talaat A. M. (2006). Whole genome plasticity among Mycobacterium avium subspecies: insights from comparative genomic hybridizations. J. Bacteriol. 188, 711–723 10.1128/JB.188.2.711-723.2006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Wu C. W., Schramm T. M., Zhou S., Schwartz D. C., Talaat A. M. (2009). Optical mapping of the Mycobacterium avium subspecies paratuberculosis genome. BMC Genomics 10, 25. 10.1186/1471-2164-10-25 [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Wynne J. W., Bull T. J., Seemann T., Bulach D. M., Wagner J., Kirkwood C. D., Michalski W. P. (2011). Exploring the zoonotic potential of Mycobacterium avium subspecies paratuberculosis through comparative genomics. PLoS ONE 6, e22171. 10.1371/journal.pone.0022171 [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Wynne J. W., Seemann T., Bulach D. M., Coutts S. A., Talaat A. M., Michalski W. P. (2010). Resequencing the Mycobacterium avium subsp. paratuberculosis K10 genome: improved annotation and revised genome sequence. J. Bacteriol. 192, 6319–6320 10.1128/JB.00972-10 [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Xu C., Zhou Y. F., Deng J. Y., Deng X., Guo Y. C., Cui Z. Q., Zhang Z. P., Wei H. P., Bi L. J., Zhang X. E. (2008). On-chip ligation of multiplexing probe-pairs for identifying point mutations out of dense SNP loci. Biosens. Bioelectron. 24, 818–824 10.1016/j.bios.2008.07.009 [DOI] [PubMed] [Google Scholar]

Articles from Frontiers in Microbiology are provided here courtesy of Frontiers Media SA

RESOURCES