Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2011 Apr 4;108(16):6526–6531. doi: 10.1073/pnas.1016708108

Dispersal of Mycobacterium tuberculosis via the Canadian fur trade

Caitlin S Pepperell a,1,2, Julie M Granka b,1, David C Alexander c, Marcel A Behr d, Linda Chui e, Janet Gordon f, Jennifer L Guthrie c, Frances B Jamieson c, Deanne Langlois-Klassen g, Richard Long g, Dao Nguyen d, Wendy Wobeser h, Marcus W Feldman b
PMCID: PMC3080970  PMID: 21464295

Abstract

Patterns of gene flow can have marked effects on the evolution of populations. To better understand the migration dynamics of Mycobacterium tuberculosis, we studied genetic data from European M. tuberculosis lineages currently circulating in Aboriginal and French Canadian communities. A single M. tuberculosis lineage, characterized by the DS6Quebec genomic deletion, is at highest frequency among Aboriginal populations in Ontario, Saskatchewan, and Alberta; this bacterial lineage is also dominant among tuberculosis (TB) cases in French Canadians resident in Quebec. Substantial contact between these human populations is limited to a specific historical era (1710–1870), during which individuals from these populations met to barter furs. Statistical analyses of extant M. tuberculosis minisatellite data are consistent with Quebec as a source population for M. tuberculosis gene flow into Aboriginal populations during the fur trade era. Historical and genetic analyses suggest that tiny M. tuberculosis populations persisted for ∼100 y among indigenous populations and subsequently expanded in the late 19th century after environmental changes favoring the pathogen. Our study suggests that spread of TB can occur by two asynchronous processes: (i) dispersal of M. tuberculosis by minimal numbers of human migrants, during which small pathogen populations are sustained by ongoing migration and slow disease dynamics, and (ii) expansion of the M. tuberculosis population facilitated by shifts in host ecology. If generalizable, these migration dynamics can help explain the low DNA sequence diversity observed among isolates of M. tuberculosis and the difficulties in global elimination of tuberculosis, as small, widely dispersed pathogen populations are difficult both to detect and to eradicate.

Keywords: genetic drift, demography, Mycobacterium tuberculosis genetics, Native Americans, Approximate Bayesian Computation


Migration of human pathogens between host populations is of clear biomedical interest, because global spread of infectious diseases is predicated on pathogen dispersal. For obligate human pathogens, the paths of migration are shaped by social networks among hosts and, at a broader scale, by human migrations and patterns of comingling. The shape and magnitude of these processes may be inferred from genetic data from both extant populations of humans (1, 2) and microorganisms carried by humans (3). Statistical genetic inferences about pathogen migration have been buttressed by a variety of independent data types, such as linguistic (3), epidemiological (4), and ecological (5).

Human tuberculosis (TB) is caused by Mycobacterium tuberculosis. By current estimates, M. tuberculosis infects one third of the world's population; a minority of these infections progress to disease and account for 9–10 million new, transmissible cases per year (6). Genetic data from M. tuberculosis are characterized by low DNA sequence diversity (7) and significant population subdivision, both at continental (8, 9) and fine geographic scales (10). TB is characterized by variable transmissibility, depending on environmental conditions, and irregular disease dynamics (11, 12). Variation in temporal dynamics of disease is introduced by the phenomenon of clinical latency, whereby a host may be infected with M. tuberculosis for decades before reactivating the infection, becoming ill, and thus able to transmit the infection to others. These features and other aspects of disease ecology would be expected to affect migration of M. tuberculosis among host populations. Broad outlines of M. tuberculosis migratory history have been described based on the geographic range of specific bacterial clades and lineages (9). However, very little is known about M. tuberculosis migratory patterns and rates, population dynamics during migration, or effects of migratory history on M. tuberculosis population genetic diversity.

In this paper, we present evidence suggesting that M. tuberculosis dispersed into Canadian Aboriginal populations as a result of contact with European fur traders in the 18th century. Although contact between the populations involved small numbers of individuals, it was extensive enough to result in the development of the Métis society of both European and Aboriginal ancestry. (Additional historical details are given in SI Background Material.) Large-scale TB epidemics were not evident among Western Canadian Aboriginal populations until the late 19th and 20th centuries (10, 1315), suggesting that epidemics may be uncoupled from the process of M. tuberculosis dispersal.

Our analyses of historical, epidemiological, and M. tuberculosis genetic data suggest that (i) M. tuberculosis may be spread by small numbers of human migrants; (ii) M. tuberculosis populations can persist at low levels over historical time scales; (iii) these small bacterial populations may be sustained by ongoing migration and possibly by slow disease dynamics; and (iv) shifts in host ecology favoring the pathogen may be accompanied by bacterial population expansions, with marked effects on genetics of M. tuberculosis populations.

Results

Results of M. tuberculosis population screening for lineage-defining polymorphisms (16, 17) are shown in Fig. 1 and Table S1. DS6Quebec is the most frequent lineage among bacteria from all four populations: the French Canadian population of Quebec (QU), and Aboriginal populations in Ontario (ON), Saskatchewan (SK), and Alberta (AB). Within Saskatchewan, DS6Quebec was found in all of eight intraprovincial regions (fine-scale geographic data are available only for this population; see ref. 10 for definitions of within-Saskatchewan regions). Frequencies of other M. tuberculosis lineages were more variable among populations. A minimum spanning tree, based on 12 minisatellite loci from bacteria with the DS6Quebec polymorphism, reveals a distinctly star-like network structure (Fig. 2), with little evidence of population differentiation. Networks based on minisatellite data within other common lineages (Rd 182, Rd 219, and H37Rv-like) were not star-like (Fig. S1). Lineage-specific patterns also were evident in analyses of population genetic structure (Analysis of Molecular Variance, AMOVA; Table 1). Overall, a modest level of differentiation was observed among populations [fraction of minisatellite variation among populations (FST) = 0.08]. However, differentiation varied substantially between lineages, with the least variation seen within the DS6Quebec lineage (FST = 0.04) and the greatest within the H37Rv-like lineage (FST = 0.52). These results suggest that (i) the DS6Quebec lineage was introduced into these populations in the relatively remote past, resulting in a similarly high frequency across populations; (ii) M. tuberculosis belonging to the DS6Quebec lineage were, or are, exchanged freely among study populations; (iii) the DS6Quebec lineage underwent rapid population expansion, reflected in a star-like network topology, possibly coincident with its introduction into new host populations; and (iv) there has been temporal and/or spatial variation in patterns of bacterial migration among human populations, reflected in variable differentiation within bacterial lineages.

Fig. 1.

Fig. 1.

Fur trade geography, regional frequencies of M. tuberculosis lineages, and historical timeline. (A) Map of Canada from Natural Resources Canada (49). The main fur trade canoe route from the St. Lawrence River to the Beaufort Sea (Montreal route) is shown in light blue. Canoe routes from Hudson's Bay to the interior are shown in red. Geography of canoe routes is based on a map by A. Ray in ref. 24 and was checked against a map of fur trade posts (50). Geography of railways, steamship lines, and areas classified as remote/traditional on the basis of archival evidence (all ca. 1920) are from figure 38 in ref. 21 and ref. 24. Proportional lineage frequencies of M. tuberculosis isolates are shown as pie charts in the corresponding province (see also Table S1). Nomenclature of M. tuberculosis lineages is from refs. 16 and 17. Pie charts are the same size for clarity, although total sample sizes differed among populations. Lineage frequencies were unavailable for the MB population. (B) Events shown in the timeline are the founding of New France (Quebec) in 1608; incursion of fur traders to the Northwest around 1710; British conquest of New France (and end of migration from France to Quebec) in 1760; merger of North West Company and Hudson's Bay Company (HBC) in 1820 (with subsequent abandonment of the Montreal route in favor of Hudson's Bay routes); completion of the Canadian Pacific Railway (CPR) in 1885, by which time Western buffalo herds were severely depleted; and, finally, widespread use of bush planes to reach remote areas, starting in the 1930s. Gray boxes indicate the estimated timing of processes in M. tuberculosis demographic history: dispersal of M. tuberculosis to indigenous populations (light gray), and expansion of M. tuberculosis populations as a result of shifts in host ecology (dark gray).

Fig. 2.

Fig. 2.

Minimum spanning tree of minisatellite haplotypes (12 loci) from M. tuberculosis within the DS6Quebec lineage, in QU, ON, SK, and AB populations. The size of nodes is proportional to the frequency of TB cases associated with that M. tuberculosis haplotype. Scale is indicated by the line at the bottom of the figure, which represents a difference of one minisatellite repeat. Minisatellites were not available from the MB population.

Table 1.

Within- and between-lineage AMOVA based on 12 minisatellite loci for M. tuberculosis from Quebec, Ontario, Saskatchewan, and Alberta

Bacterial lineage Source of variation D.f. SS Variance component % Variation FST
DS6Quebec Among populations 3 16.25 0.04 3.81 0.04
Within populations 583 521.26 0.89 96.19
H37Rv-like Among populations 3 181.68 0.89 51.56 0.52
Within populations 306 256.44 0.84 48.44
Rd 182 Among populations 3 8.26 0.33 22.26 0.22
Within populations 59 67.95 1.15 77.74
Rd 219 Among populations 3 14.28 0.25 20.97 0.21
Within populations 65 60.68 0.93 79.03
All lineages Among populations 3 75.54 0.10 7.95 0.08
Within populations 1,040 1227.16 1.18 92.05

D.f., degrees of freedom; FST, fraction of variation among populations; SS, sum of squares.

Although the Quebec population is now geographically and socially isolated from Aboriginal populations in Ontario, Saskatchewan, and Alberta, there is a history of comingling in the context of the Canadian fur trade (Fig. 1). Table 2 outlines the expected history of the DS6Quebec lineage, assuming it was introduced to Canada by a French immigrant to Quebec and dispersed to Aboriginal populations via fur trade transportation and social networks. Estimates derived from M. tuberculosis minisatellite data of the time to most recent common ancestor (TMRCA) (18) and divergence time between bacterial populations (TD) (19) suggest that genetic exchange between QU and Western Aboriginal M. tuberculosis populations occurred over ∼100 y; timing of this exchange is consistent with historical human migrations connected with the trade in furs. (Additional estimates are given in SI Results.)

Table 2.

Genetic timing estimates for DS6Quebec M. tuberculosis lineage in calendar years

Parameter Historical correlate Point estimate(s)* Confidence interval Historical prediction
TMRCA (QU) French migration to Quebec 1740 1709–1771 1608–1760
TMRCA (SK) Fur trade expansion west 1797 1777–1815 1730–1870
TMRCA (AB) 1779 1751–1804 1750–1870
TD (QU/SK) Separation of populations 1789, 1884, 1910 1788–1979 1870
TD (QU/AB) 1779, 1901, 1885 1805–1966 1870

*Three point estimates are shown for TD. The first is the earliest bound for divergence, where a variance of 0 is assumed for minisatellite repeat number in the ancestral population. For the second estimate, we assume that variance in the ancestral population is equal to the variance of haplotypes found presently in the QU population and shared with the founded population (SK or AB). The third point estimate is based on an assumption that variance in the ancestral population was equal to the variance found presently in haplotypes in the founded population (SK or AB) that are also found in QU.

Results presented for TMRCA assume a star-like genealogy. TMRCA confidence intervals are larger if constant population size or exponential growth are assumed (SI Results).

Estimate of timing of M. tuberculosis population events, based on historical (nongenetic) data. References for date ranges are Charbonneau (51) for timing of migration to Quebec from France and Innis (24) for other dates. The earlier bounds for fur trade expansion into Saskatchewan and Alberta are based on the time of establishment of fur trade posts in these regions. Note that this timing is later than expansion into the Northwest as a whole (ca. 1710), which includes Western Ontario and Manitoba. Timing of separation of populations is based on analyses of Innis (24) indicating that the Hudson's Bay Company shifted to less labor-intensive methods of extracting furs (and thus had no need of French Canadian voyageurs), starting about this time. Analyses of interprovincial migration from the 1870s to the early 20th century are also consistent with minimal migration of French Canadians to the Western provinces (25).

Levels of overall genetic diversity of M. tuberculosis restriction fragment length polymorphism (RFLP) (20) haplotypes from four populations are shown in rarefaction curves in Fig. 3A. Published data were available from Manitoba (MB) and are included along with data from QU, SK, and AB; data from ON are not included because of the small sample size (SI Results). The number of distinct haplotypes is highest in QU, consistent with its being a source population for pathogen migration. Levels of diversity are lowest in SK and MB, with intermediate diversity in AB. Diversity thus does not decrease in an east-to-west pattern, as we might expect with a serial founder effect (1) originating in Quebec.

Fig. 3.

Fig. 3.

Genetic diversity of M. tuberculosis populations. (A) Number of distinct RFLP haplotypes as a function of the number of sampled chromosomes obtained using rarefaction (Materials and Methods). Populations include QU, MB, SK, and AB; ON is not included because of its low sample size (SI Results). Every fourth data point is presented for clarity. (B) RFLP haplotype diversity (Shannon index), correcting for sample size by repeatedly sampling the total number of isolates in the smallest sample from each population (NRS, n = 123). Boxplots indicate values obtained over all samples; value for the smallest sample (NRS) is indicated by a line. Populations are as in the left panel, except that SK is split into RS and NRS populations. Although Manitoba contains both remote and nonremote regions (Fig. 1), detailed geographic data were not available for the MB sample; diversity shown here is for the entire sample.

Epidemics of TB among Canadian indigenous populations occuring post-European contact have been associated with dramatic social, economic, and environmental changes that characterized the industrial era (late 19th century and beyond; SI Background Material). There is evidence of regional differences in the pace of these changes: Published analyses of archival fur trade documents indicate that some indigenous populations remained remote from the cash-based industrial economy as late as the 1920s (21). Regions within Saskatchewan and Manitoba fall into this historically remote/traditional category, whereas Alberta does not contain any such regions (Fig. 1A). Later development of transportation networks (starting with the widespread use of bush planes in the 1930s) allowed commercial development of previously remote regions (21).

Given associations between industrialization, attendant shifts in host ecology, and TB epidemic expansion among indigenous populations (SI Background Material), we hypothesized that M. tuberculosis diversity in Saskatchewan and Manitoba was low relative to Alberta because of the more recent expansion of the M. tuberculosis population in remote/traditional areas. To test this hypothesis, minisatellite and RFLP haplotypes from SK were divided into historically “remote” (RS) and “nonremote” (NRS) regions according to the classification of trading districts outlined in ref. 21. Consistent with this classification, diversity in NRS regions is similar to that in AB (Alberta contains no remote regions), with the lowest diversity observed in RS (Fig. 3B and Fig. S2). Permutation procedures identified RS and QU as having lower and greater diversity, respectively, than would be expected under a random distribution of minisatellite haplotypes among populations (Fig. S3).

We analyzed DS6Quebec minisatellite data with rejection sampling, an Approximate Bayesian Computation (ABC) method, to assess whether a demographic model allowing an expansion in RS (in which the historical bacterial effective population size, Ne, was smaller than the current Ne) is more likely than a null model of constant size. We estimated the parameter ω, the ratio of historical Ne to contemporary Ne (Materials and Methods and SI Materials and Methods). Results using this method indicate that the posterior probability of the expansion model (versus the null constant-size model) is 1; based on simulations under the null model, the P value of this posterior probability is 0. We estimate ω to be 0.08 (95% credibility interval 0.01–0.29; Fig. S4A). Translating these results to absolute numbers (current Ne = 30) generates an estimate of effective number of TB cases in RS before 1930 (the time fixed for the expansion, based on the social history described above) equal to 2.35 individuals (95% credibility interval 0.35–8.80). SI Results gives parameter estimations under alternative mutation rate assumptions, all of which reject the constant-size model in favor of the expansion model.

Based on the size and dynamics of the population involved in the French (Montreal-based) trade in the West before 1870 and the prevalence of TB in European cities in the 18th century, we estimated the absolute number of M. tuberculosis infections transmitted to Western indigenous populations (SI Results and Table S2). The historical estimate of Nem, the product of population size (Ne) and fraction of the indigenous M. tuberculosis population replaced by immigrants (m) per generation, is 0.16. In an island model, Nem <1 results in substantial population differentiation caused by genetic drift (22). Estimates of Nem derived from bacterial minisatellite data were higher than this historical estimate: For pooled lineages, Nem derived from FST (AMOVA) is 6.16, and the private alleles method of Barton and Slatkin (23) generated an estimate of 3.62 (empirical 95% confidence interval 2.47–4.72).

Discussion

The recent history of human migration to Western Canada informs the analyses of M. tuberculosis genetic data presented here. Most importantly, substantial contact between French Canadian and Western indigenous populations is limited to a specific historical period, the fur trade era. The early bound on this period of contact is provided by the dates of incursion of fur traders into the Western interior [∼1710 (24)], and the later bound is provided by historical and demographic analyses indicating that westward migration of French Canadians ceased in the latter half of the 19th century for economic and other reasons (25).

Based on the observed pattern of M. tuberculosis lineage frequencies in this historical context, as well as bacterial population genetic diversities consistent with Quebec as a source population, genetic timing estimates, and patterns of genetic differentiation between populations, we infer that the DS6Quebec lineage was dispersed to indigenous populations by French Canadian fur traders about a century before epidemic forms of TB were manifest in Western Aboriginal communities.

Several features of the human migration associated with trade in furs are noteworthy. First, the absolute number of human migrants was small: By our estimate, 5,419 individuals migrated from east to west during 160 y of trade between Montreal and Western Canadian indigenous populations (SI Results). These early migrations are dwarfed by population movements of the late 19th and early 20th centuries. Facilitated by massive interprovincial and international migration to the Canadian prairies—from which French Canadians were largely absent—the census of Western Canada grew from 110,000 in 1871 to 750,000 in 1911 (26). Between 1900 and 1917, a total of 1,671,414 foreign-born individuals migrated to Western Canada from the United Kingdom, the Ukraine, Russia, Germany, Austria, Hungary, Norway, Denmark, China, and elsewhere (27). Despite the enormous number of migrants from regions of high TB incidence, who would be expected to introduce a broad range of Euro-American and East Asian M. tuberculosis lineages (8), bacterial populations in Western indigenous communities are dominated by the DS6Quebec lineage, present at a frequency similar to that in the French Canadian population of Quebec (Fig. 1A).

The apparent lack of M. tuberculosis gene flow from 19th century homesteaders to indigenous populations may be explained by social distance between populations. International migrants to the prairies, particularly those facing language and cultural barriers to assimilation, were socially and geographically isolated (27). By the late 19th century, prairie indigenous populations also were socially segregated, at times forcibly (15, 27), from the non-Aboriginal population. This isolation is in contrast to the earlier fur trade era, which was characterized by intermarriage and trading collaborations between European immigrants and First Nations groups (ref. 28 and SI Background Material). Although contact between populations involved small numbers of individuals, we speculate that close social ties between the sending and receiving host populations permitted migration of M. tuberculosis through the fur trade. Given that transmission of M. tuberculosis requires sustained, close contact [exemplified by efficient transmission in high-density shared living environments such as prisons (11)], this observation is likely to be generalizable, with structure of global M. tuberculosis populations strongly influenced by the social architecture of host populations.

Although by no means a disease-free interval for Native peoples (there were devastating epidemics of smallpox and other infectious diseases), epidemics of TB were not a feature of the fur trade era (1710–1870). TB epidemics among Western Canadian indigenous populations occurred later, starting in the late 1800s (10, 1315). Expansion of rail and steamship networks into Western Canada in the late 19th century permitted agricultural development, industrial-scale extraction of natural resources, development of government institutions, and mass immigration. TB epidemics were among a chain of sequelae for Aboriginal populations that included displacement, loss of traditional food sources, crowding, and institutionalization (ref. 29 and SI Background Material).

Indigenous populations in some regions (e.g., Northern Saskatchewan and Manitoba; the shaded areas of Fig. 1A) remained remote from the evolving industrial economy into the 1920s (21). We find that the genetic diversity of M. tuberculosis in lowest in RS and is highest in QU (Fig. 3); we did not have detailed geographic data that would allow us to classify M. tuberculosis strains from Manitoba according to the scheme we used in Saskatchewan (i.e., NRS vs. RS). We note that molecular epidemiological studies of other historically remote/traditional regions are consistent with low genetic diversity of the M. tuberculosis population (30, 31).

Differences in genetic diversity may result from two distinct phenomena. First, the process of M. tuberculosis migration is likely to involve serial founder events, so that diversity is highest in the source population (QU) and lower in the subsequently founded populations (ON, RS, NRS, AB, and MB). Another explanation is that QU, the oldest M. tuberculosis population (founded by 17th and 18th century migrants from France; Table 2), may be closer to an equilibrium distribution of haplotype frequencies; the youngest populations still may show evidence of a “founder flush” (32). As an historically remote/traditional population, RS was probably shielded from at least some of the ecological antecedents of epidemic TB until the regional incursion of air travel networks in the 1930s (21) and thus had the most recent expansion in its pathogen population. Demographic modeling with ABC demonstrated a very high probability of expansion in the RS M. tuberculosis population since 1930; the magnitude of the bacterial expansion was ∼13-fold.

Although our historical estimates of M. tuberculosis migration rates were low (direct Nem = 0.16), the lack of genetic differentiation among populations (DS6Quebec FST = 0.04) and estimates of Nem from the genetic data (indirect Nem = 3.62) would suggest a high rate of M. tuberculosis migration. Slatkin (33, 34) has observed that indirect (genetic) estimates of gene flow may be higher than direct measurements of population dispersal, especially in populations characterized by colonization-extinction-recolonization dynamics, which can limit population differentiation even when rates of migration are low. Taken together, our results suggest that absolute numbers of M. tuberculosis migration events were low (Nem <1, an order-of-magnitude estimate based on historical data), and patterns of genetic differentiation have been affected by unstable dynamics of small, preindustrial M. tuberculosis populations.

We have delineated two asynchronous processes involved in the spread of TB from European immigrants to native Canadians (Fig. 1B). The first is dispersal of the etiologic agent, M. tuberculosis, populations of which were sustained at very low levels (Ne ≅ 2) for ∼100 y by small numbers of human migrants who had intimate, sustained contact with susceptible hosts. In addition to sustained migration, variable transmission dynamics of TB may have cushioned small bacterial populations against extinction (35). The second process is expansion of the bacterial population, following a shift in host ecology favoring the pathogen. We find evidence of bacterial population expansions in the DS6Quebec M. tuberculosis haplotype network (Fig. 2), patterns of genetic diversity (Fig. 3), and coalescent-based demographic analysis of bacterial minisatellite data.

This work is a study of a specific historical phenomenon, and it is unknown whether the observed patterns of M. tuberculosis migration are applicable to other settings. Asynchronicity of pathogen migration and epidemiological phenomena is likely to render control of TB more difficult and could help explain global persistence of the pathogen despite extensive efforts at eradication. Because M. tuberculosis is an obligate human pathogen that requires specific environmental conditions and sustained contact for transmission, barriers to geographic spread of M. tuberculosis are largely social and therefore somewhat fluid. Perseverance of tiny bacterial populations would permit gradual accrual of a large geographic range despite significant population subdivision. As a result, by the time the spread of TB is obvious, M. tuberculosis populations already may be well established.

Materials and Methods

Population Descriptions.

Clinical isolates of M. tuberculosis in this study are from five Canadian populations: the French Canadian population of Quebec (QU), and Aboriginal populations in Ontario (ON), Manitoba (MB), Saskatchewan (SK), and Alberta (AB). The QU sample (n = 297) is described in ref. 16. The ON sample (n = 45) derives from TB cases that occurred between 1997–2009 in First Nations communities in a single region of the province (total population ∼25,000). MB data (n = 163) are from a published study (36): The total number of TB cases, number of different bacterial RFLP haplotypes, number of singletons, and the configuration distribution of the five most common M. tuberculosis haplotypes from First Nations reserve communities in Manitoba (1992–1999) are reported. We made the conservative assumption that the remaining (unreported) haplotypes were doubletons. The SK sample (n = 444) is described in ref. 10. AB samples (n = 283) are from TB cases among First Nations individuals in the province, excluding urban areas, from 1990–2008.

Genotyping Methods.

Isolates of M. tuberculosis were genotyped by RFLP, based on the number and location of IS6110 elements (20). The number of tandem repeats at 12 minisatellite loci also was determined for each isolate using a standard methodology (37). M. tuberculosis lineage typing, based on genomic deletions (16, 17), was done with real-time PCR (details are given in SI Materials and Methods and Dataset S1).

Mutation Rate Estimate.

We estimated a mutation rate (μ) per transmission generation (10) for minisatellite loci based on simulations using empirical estimates of intrahost M. tuberculosis population dynamics and in vitro estimates of mutations per cell doubling for bacterial minisatellite loci. Given a known per annum RFLP mutation rate (38), we made additional estimates by examining the number of RFLP types per minisatellite haplotype and comparing estimates of θ (θ = 2 Neμ, product of effective population size, Ne, and mutation rate, μ) from RFLP and minisatellite data. All estimates were of a similar order of magnitude and were consistent with published estimates (39); we used 0.001 mutations per locus (SI Materials and Methods, Table S3).

Statistical Calculations.

Genetic differentiation.

AMOVA, implemented in Arlequin version 3.5 (40), was used to calculate FST values from M. tuberculosis minisatellite haplotype data.

Network analysis.

BioNumerics 5.0 (Applied Maths) was used to generate minimum spanning trees of M. tuberculosis minisatellite haplotypes. This program implements the Prim–Jarnik algorithm; the BURST priority rule maximizing single- and double-locus variants, was used during network searches.

Genetic timing estimates.

We estimated TMRCA of the DS6Quebec lineage in each population using minisatellite genotypes and the method of Ytime (18). We selected as the root the haplotype at the center of the DS6Quebec network (Fig. 2), present at high frequency in QU, SK, and AB populations. We obtained bootstrap confidence intervals with Ytime assuming a star-like genealogy and neutrality (SI Materials and Methods). Divergence times for DS6Quebec isolates between pairs of populations were estimated from minisatellite genotypes by the TD estimator, which is robust to population size changes and weak gene flow (SI Materials and Methods) (19). This procedure requires an estimate of the average variance in repeat number in the ancestral population at the time of divergence (V0); we estimated TD using three different values of V0 for each pair of populations (Results and SI Materials and Methods). Alternative mutation rate and demographic assumptions have little effect on our main conclusions (SI Results).

Population genetic diversity.

We assessed diversity of minisatellite and RFLP haplotypes in each population, for all lineages, by calculating the number of distinct haplotypes using rarefaction (41), calculating haplotype diversity with resampling to the lowest sample size to correct for unequal sample sizes (42), and implementing a permutation procedure to assess whether observed diversities of minisatellite haplotypes could be explained by a random partitioning of isolates from all populations (details are given in SI Materials and Methods).

Migration.

Under a simple island model, assuming an equilibrium population where each “island” (QU, SK, and AB) has equal size Ne, we applied Slatkin's private alleles method with minisatellite haplotype data from all lineages to estimate Nem, where m is the probability that an individual is a migrant in each generation (33, 43). Nem estimates were converted to FST values, and vice versa, using the standard expectation for a haploid population FST = 1/(1+2Nm) (43) (SI Materials and Methods).

Demographic Modeling.

Motivated by the results of the diversity calculations, we used minisatellite genotypes and rejection sampling, an ABC procedure, to estimate the factor (ω) by which the size of the DS6Quebec lineage in Remote Saskatchewan (RS) before its likely expansion is related to its current size.

Assuming a constant population size after the expansion, we estimated the current effective population size of RS as 544 (multiplying the harmonic mean of fluctuating DS6Quebec TB case counts in RS from 1986–2004 by 18, the number of years over which exhaustive samples were obtained). This epidemiological estimate of Ne was interpreted in the standard population genetic manner (discussed further in SI Materials and Methods). One million coalescent simulations of minisatellite haplotypes under a population-expansion model were performed using SIMCOAL 2.0 (44, 45). The time of expansion was fixed at 54 generations, with ω drawn from a prior Uniform distribution on [0.01, 1]. The simulated data were summarized with four statistics (number of distinct and singleton haplotypes, haplotype diversity, and mean variance in locus repeat sizes), which produced reliable estimates in our method validation (SI Materials and Methods) and have been used in previous studies (2, 4648). Values of ω resulting in simulations with summary statistics within a threshold Euclidean distance from those observed in RS were retained to produce a posterior distribution. We also estimated the posterior probabilities of the population expansion and null constant-size models by the acceptance probabilities of each model.

We tested the method's performance using simulated data with known expansion sizes. The method is most powerful for pronounced expansions (ω ∼ 0.1); with high accuracy, it both can estimate ω and can distinguish between the population-expansion and constant-size models (SI Materials and Methods and Fig. S5).

Supplementary Material

Supporting Information

Acknowledgments

We thank V. Hoeppner for providing data and bacterial strains, T. MacMillan and A.D. Popescu for data collection; A. Avelar, T. Van, E. Heinemeyer, and C.-Y. Wu for technical assistance; and G. Dolganov for help with real-time PCR assays. This work was supported by National Institutes of Health Grants 5K08AI67458-2 (to C.S.P.) and GM28016 (to M.W.F.). J.M.G. is the recipient of a National Science Foundation Graduate Research Fellowship.

Footnotes

The authors declare no conflict of interest.

This article is a PNAS Direct Submission.

This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1016708108/-/DCSupplemental.

References

  • 1.Ramachandran S, et al. Support from the relationship of genetic and geographic distance in human populations for a serial founder effect originating in Africa. Proc Natl Acad Sci USA. 2005;102:15942–15947. doi: 10.1073/pnas.0507611102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Ray N, et al. A statistical evaluation of models for the initial settlement of the american continent emphasizes the importance of gene flow with Asia. Mol Biol Evol. 2010;27:337–345. doi: 10.1093/molbev/msp238. [DOI] [PubMed] [Google Scholar]
  • 3.Moodley Y, et al. The peopling of the Pacific from a bacterial perspective. Science. 2009;323:527–530. doi: 10.1126/science.1166083. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Viboud C, et al. Synchrony, waves, and spatial hierarchies in the spread of influenza. Science. 2006;312:447–451. doi: 10.1126/science.1125237. [DOI] [PubMed] [Google Scholar]
  • 5.Biek R, Henderson JC, Waller LA, Rupprecht CE, Real LA. A high-resolution genetic signature of demographic and spatial expansion in epizootic rabies virus. Proc Natl Acad Sci USA. 2007;104:7993–7998. doi: 10.1073/pnas.0700741104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.WHO . Global Tuberculosis Control. Geneva: World Health Organization; 2009. Global tuberculosis control: A short update to the 2009 report. [Google Scholar]
  • 7.Achtman M. Evolution, population structure, and phylogeography of genetically monomorphic bacterial pathogens. Annu Rev Microbiol. 2008;62:53–70. doi: 10.1146/annurev.micro.62.081307.162832. [DOI] [PubMed] [Google Scholar]
  • 8.Hirsh AE, Tsolaki AG, DeRiemer K, Feldman MW, Small PM. Stable association between strains of Mycobacterium tuberculosis and their human host populations. Proc Natl Acad Sci USA. 2004;101:4871–4876. doi: 10.1073/pnas.0305627101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Hershberg R, et al. High functional diversity in Mycobacterium tuberculosis driven by genetic drift and human demography. PLoS Biol. 2008;6:e311. doi: 10.1371/journal.pbio.0060311. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Pepperell C, et al. Bacterial genetic signatures of human social phenomena among M. tuberculosis from an Aboriginal Canadian population. Mol Biol Evol. 2010;27:427–440. doi: 10.1093/molbev/msp261. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Iseman MD. A Clinician's Guide to Tuberculosis. Philadelphia: Lippincott Williams and Wilkins; 2000. [Google Scholar]
  • 12.Aparicio JP, Castillo-Chavez C. Mathematical modelling of tuberculosis epidemics. Math Biosci Eng. 2009;6:209–237. doi: 10.3934/mbe.2009.6.209. [DOI] [PubMed] [Google Scholar]
  • 13.Daschuk JW, Hackett P, MacNeil S. Treaties and tuberculosis: First Nations people in late 19th-century western Canada, a political and economic transformation. Can Bull Med Hist. 2006;23:307–330. doi: 10.3138/cbmh.23.2.307. [DOI] [PubMed] [Google Scholar]
  • 14.Ferguson R. Tuberculosis among the Indians of the Great Canadian Plains. London: Adlard and Sons Ltd.; 1928. [Google Scholar]
  • 15.Lux M. Medicine that Walks: Disease, Medicine, and Canadian Plains Native People, 1880-1940. Toronto: Univ of Toronto Press; 2001. [Google Scholar]
  • 16.Nguyen D, et al. Genomic characterization of an endemic Mycobacterium tuberculosis strain: Evolutionary and epidemiologic implications. J Clin Microbiol. 2004;42:2573–2580. doi: 10.1128/JCM.42.6.2573-2580.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Gagneux S, et al. Variable host-pathogen compatibility in Mycobacterium tuberculosis. Proc Natl Acad Sci USA. 2006;103:2869–2873. doi: 10.1073/pnas.0511240103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Behar DM, et al. Multiple origins of Ashkenazi Levites: Y chromosome evidence for both Near Eastern and European ancestries. Am J Hum Genet. 2003;73:768–779. doi: 10.1086/378506. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Zhivotovsky LA. Estimating divergence time with the use of microsatellite genetic distances: Impacts of population growth and gene flow. Mol Biol Evol. 2001;18:700–709. doi: 10.1093/oxfordjournals.molbev.a003852. [DOI] [PubMed] [Google Scholar]
  • 20.van Embden JD, et al. Strain identification of Mycobacterium tuberculosis by DNA fingerprinting: Recommendations for a standardized methodology. J Clin Microbiol. 1993;31:406–409. doi: 10.1128/jcm.31.2.406-409.1993. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Ray AJ. The Canadian Fur Trade in the Industrial Age. Toronto: Univ of Toronto Press; 1990. [Google Scholar]
  • 22.Wright S. Evolution in Mendelian populations. Genetics. 1931;16:97–159. doi: 10.1093/genetics/16.2.97. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Barton NH, Slatkin M. A quasi-equilibrium theory of the distribution of rare alleles in a subdivided population. Heredity. 1986;56:409–415. doi: 10.1038/hdy.1986.63. [DOI] [PubMed] [Google Scholar]
  • 24.Innis HA. The Fur Trade in Canada. Toronto: Univ of Toronto Press; 1930. Ed 1970 reprinted 1999 with a new introductory essay. [Google Scholar]
  • 25.Green A, MacKinnon M, Minns C. Conspicuous by their absence: French Canadians and the Settlement of the Canadian West. J Econ Hist. 2005;65:822–849. [Google Scholar]
  • 26.Urquhart MC, Buckley KAH, editors. Historical Statistics of Canada. Toronto: Macmillan; 1965. [Google Scholar]
  • 27.Waiser B. Saskatchewan: A New History. Calgary,AB, Canada: Fifth House; 2005. [Google Scholar]
  • 28.Francis D. Traders and Indians. In: Francis RD, editor. The Prairie West Historical Readings. Edmonton, AB, Canada: Pica Pica; 1985. pp. 58–70. [Google Scholar]
  • 29.Friesen G. The Canadian Prairies: A History. Toronto: Univ of Toronto Press; 1984. [Google Scholar]
  • 30.Kunimoto D, Chedore P, Allen R, Kasatiya S. Investigation of tuberculosis transmission in Canadian Arctic Inuit communities using DNA fingerprinting. Int J Tuberc Lung Dis. 2001;5:642–647. [PubMed] [Google Scholar]
  • 31.Nguyen D, et al. Tuberculosis in the Inuit community of Quebec, Canada. Am J Respir Crit Care Med. 2003;168:1353–1357. doi: 10.1164/rccm.200307-910OC. [DOI] [PubMed] [Google Scholar]
  • 32.Templeton A. Population Genetics and Microevolutionary Theory. Hoboken, NJ: John Wiley & Sons; 2006. [Google Scholar]
  • 33.Slatkin M. Gene flow in natural populations. Annu Rev Ecol Syst. 1985;16:393–430. [Google Scholar]
  • 34.Slatkin M. Gene flow and the geographic structure of natural populations. Science. 1987;236:787–792. doi: 10.1126/science.3576198. [DOI] [PubMed] [Google Scholar]
  • 35.Blaser MJ, Kirschner D. The equilibria that allow bacterial persistence in human hosts. Nature. 2007;449:843–849. doi: 10.1038/nature06198. [DOI] [PubMed] [Google Scholar]
  • 36.Blackwood KS, Al-Azem A, Elliott LJ, Hershfield ES, Kabani AM. Conventional and molecular epidemiology of tuberculosis in Manitoba. BMC Infect Dis. 2003 doi: 10.1186/1471-2334-3-18. 10.1186/1471-2334-3-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Supply P, et al. Proposal for standardization of optimized mycobacterial interspersed repetitive unit-variable-number tandem repeat typing of Mycobacterium tuberculosis. J Clin Microbiol. 2006;44:4498–4510. doi: 10.1128/JCM.01392-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Tanaka MM, Rosenberg NA. Optimal estimation of transposition rates of insertion sequences for molecular epidemiology. Stat Med. 2001;20:2409–2420. doi: 10.1002/sim.910. [DOI] [PubMed] [Google Scholar]
  • 39.Reyes JF, Tanaka MM. Mutation rates of spoligotypes and variable numbers of tandem repeat loci in Mycobacterium tuberculosis. Infect Genet Evol. 2010;10:1046–1051. doi: 10.1016/j.meegid.2010.06.016. [DOI] [PubMed] [Google Scholar]
  • 40.Excoffier L, Lischer HEL. Arlequin suite ver 3.5: A new series of programs to perform population genetics analyses under Linux and Windows. Molecular Ecology Resources. 2010 doi: 10.1111/j.1755-0998.2010.02847.x. in press. [DOI] [PubMed] [Google Scholar]
  • 41.Kalinowski ST. Counting alleles with rarefaction: Private alleles and hierarchical sampling designs. Conserv Genet. 2004;5:539–543. [Google Scholar]
  • 42.Shannon CE. A mathematical theory of communication. Bell Syst Tech J. 1948;27:379–423. [Google Scholar]
  • 43.Slatkin M, Barton NH. A comparison of three indirect methods for estimating average levels of gene flow. Evolution. 1989;43:1349–1368. doi: 10.1111/j.1558-5646.1989.tb02587.x. [DOI] [PubMed] [Google Scholar]
  • 44.Excoffier L, Novembre J, Schneider S. SIMCOAL: A general coalescent program for the simulation of molecular data in interconnected populations with arbitrary demography. J Hered. 2000;91:506–509. doi: 10.1093/jhered/91.6.506. [DOI] [PubMed] [Google Scholar]
  • 45.Laval G, Excoffier L. SIMCOAL 2.0: A program to simulate genomic diversity over large recombining regions in a subdivided population with a complex history. Bioinformatics. 2004;20:2485–2487. doi: 10.1093/bioinformatics/bth264. [DOI] [PubMed] [Google Scholar]
  • 46.Estoup A, Wilson IJ, Sullivan C, Cornuet JM, Moritz C. Inferring population history from microsatellite and enzyme data in serially introduced cane toads, Bufo marinus. Genetics. 2001;159:1671–1687. doi: 10.1093/genetics/159.4.1671. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Beaumont MA, Zhang W, Balding DJ. Approximate Bayesian computation in population genetics. Genetics. 2002;162:2025–2035. doi: 10.1093/genetics/162.4.2025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Estoup A, Beaumont M, Sennedot F, Moritz C, Cornuet J-M. Genetic analysis of complex demographic scenarios: Spatially expanding populations of the cane toad, Bufo marinus. Evolution. 2004;58:2021–2036. doi: 10.1111/j.0014-3820.2004.tb00487.x. [DOI] [PubMed] [Google Scholar]
  • 49.NaturalResourcesCanada Lakes. Rivers and Names of Canada. 2007. Available at http://atlas.nrcan.gc.ca/site/english/maps/reference/outlinecanada/canada06. Accessed September 29, 2010.
  • 50.NaturalResourcesCanada Posts of the Canadian Fur Trade. 1974. Available at: http://atlas.nrcan.gc.ca/site/english/maps/archives/4thedition/historical/079_80. Accessed September 29, 2010.
  • 51.Charbonneau H, et al. The first French Canadians. Pioneers in the St. Lawrence Valley. Newark, NJ: Univ of Delaware Press; 1993. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supporting Information
1016708108_sd01.xlsx (11.6KB, xlsx)

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES