Skip to main content
Journal of the Royal Society Interface logoLink to Journal of the Royal Society Interface
. 2006 Sep 5;4(12):91–98. doi: 10.1098/rsif.2006.0154

Evaluating candidate agents of selective pressure for cystic fibrosis

Eric M Poolman 1,*, Alison P Galvani 1
PMCID: PMC2358959  PMID: 17015291

Abstract

Cystic fibrosis is the most common lethal single-gene mutation in people of European descent, with a carrier frequency upwards of 2%. Based upon molecular research, resistances in the heterozygote to cholera and typhoid fever have been proposed to explain the persistence of the mutation. Using a population genetic model parameterized with historical demographic and epidemiological data, we show that neither cholera nor typhoid fever provided enough historical selective pressure to produce the modern incidence of cystic fibrosis. However, we demonstrate that the European tuberculosis pandemic beginning in the seventeenth century would have provided sufficient historical, geographically appropriate selective pressure under conservative assumptions. Tuberculosis has been underappreciated as a possible selective agent in producing cystic fibrosis but has clinical, molecular and now historical, geographical and epidemiological support. Implications for the future trajectory of cystic fibrosis are discussed. Our result supports the importance of novel investigations into the role of arylsulphatase B deficiency in cystic fibrosis and tuberculosis.

Keywords: cystic fibrosis, tuberculosis, heterozygote advantage, cholera, typhoid fever

1. Introduction

Cystic fibrosis (CF) is one of the most common lethal single-gene genetic diseases in populations of European descent, with more than 1400 catalogued mutations and a carrier frequency upwards of 2% (Cystic Fibrosis Genetic Analysis Consortium 2005). The age of the most prevalent mutation, ΔF508, has been estimated to be approximately 600 generations (Wiuf 2001; Morris et al. 2002). Many hypotheses were originally forwarded to explain the high incidence of CF, including combinations of a high mutation rate, a founder effect, genetic drift and heterozygote advantage (Bertranpetit & Calafell 1996). Hypotheses that assume no heterozygote advantage have been found inconsistent with microsatellite and haplotype data (Romeo et al. 1989; Sereth et al. 1993; Wiuf 2001), the occurrence of CF in multiple countries in Europe (Serre et al. 1990), the population history of Europe (Bertranpetit & Calafell 1996) and the relative absence of CF outside of Europe (Bertranpetit & Calafell 1996).

By contrast, heterozygote advantage against an infectious disease has been repeatedly supported as a tenable explanation for the high level of allele persistence (Anderson et al. 1967; Hansson 1988; Bertranpetit & Calafell 1996; Welsh et al. 2001). The prevailing candidates for the selective agents are enteric diseases supported by molecular research: typhoid fever (Pier et al. 1998), and secretory diarrhoea including cholera (SDC) (Chao et al. 1994; Cuthbert et al. 1994; Gabriel et al. 1994). Tuberculosis has also been proposed originally on the basis of clinical observations (Anderson et al. 1967; Crawfurd 1972).

The correct selective agent must be plausible according to three lines of evidence: molecular; clinical; and historical–geographical (Galvani & Slatkin 2003). Firstly, molecular and cellular studies should demonstrate a mechanism through which heterozygosity provides resistance. Secondly, clinical studies should indicate correlations between host genotype and disease morbidity/mortality. Finally, on a historical–geographica level, the temporal and spatial distribution of the selective agent must be consistent with the observed allele frequency and incidence of the genetic disease in geographical regions of both high and low incidences. This third means of evaluation has been neglected. We develop a model that integrates demographic and population genetic dynamics which we parameterize using historical demographic and epidemiological data. This interdisciplinary approach provides a means to assess three diseases that have been considered candidates: typhoid fever; SDC; and tuberculosis. By evaluating the upper bounds of mortality and selective impact for all three diseases, we conclude that only tuberculosis resistance would have been a sufficient source of selective pressure to result in modern CF incidences.

The isolation of the CF gene product, the CF transmembrane conductance regulator (CFTR) (Kerem et al. 1989; Riordan et al. 1989; Rommens et al. 1989), has allowed numerous experiments to explore molecular mechanisms for the hypothesized heterozygote advantage. The discovery that the secretory toxin of enterotoxigenic Escherichia coli (ETEC) acts through the CFTR increased the interest in the idea that secretory diarrhoea played a role in the emergence of CF (Chao et al. 1994; Cuthbert et al. 1994; Goldstein et al. 1994). A similar toxin is produced in cholera, which also causes secretory diarrhoea (Gabriel et al. 1994). Experimental results regarding a difference in response to secretory toxins between wild-type and heterozygous mice have been contradictory: one study found a difference (Gabriel et al. 1994), while another has shown no difference (Cuthbert et al. 1995). Moreover, results in a study of human heterozygotes showed no differences in chloride secretion (Hogenauer et al. 2000). No study has incorporated clinical outcomes such as hospitalizations. Thus, clinical evidence for resistance to SDC is at best inconclusive, and there is no evidence of resistance to diarrhoea without a secretory toxin (e.g. amoebic, rotaviral, other bacterial diarrhoea).

The causative agent of typhoid, Salmonella enterica serovar Typhi, requires wild-type CFTR to enter mouse gastrointestinal submucosa (Pier et al. 1998). Intact CFTR is markedly reduced in mice heterozygous for ΔF508, suggesting a role for CF mutations in preventing typhoid fever (Pier et al. 1998). However, a recent clinical study in Indonesia, an area highly endemic for typhoid fever, found no ΔF508 mutations in their sample of 775 individuals (van de Vosse et al. 2005). Thus, clinical evidence supporting a role of CFTR mutations in typhoid resistance is lacking.

Resistance to tuberculosis in CF patients was recently postulated to arise from diminished activity of the enzyme arylsulphatase B (Tobacman 2003). Mycobacterium tuberculosis lacks intrinsic arylsulphatase activity, but the bacterium has an arylsulphotransferase that may require sulphate freed by host arylsulphatase to construct its cell wall (Rivera-Marrero et al. 2002). Cell wall sulphate content is in turn correlated with virulence (Goren et al. 1976). Thus, dysfunction of arylsulphatase activity in CF is postulated to protect against tuberculosis by depriving the bacteria of a necessary nutrient (Tobacman 2003).

There is also clinical evidence of an association between CF and resistance to tuberculosis. A lower tuberculosis mortality rate has been observed among heterozygotes versus controls (Crawfurd 1972). Additionally, the low incidence of tuberculosis in CF patients, particularly in relation to non-tubercular mycobacteria, has been noted by a number of investigators (Anderson et al. 1967; Smith et al. 1984; Hjelte et al. 1990; Kilby et al. 1992). Thus, tuberculosis is supported by both molecular and clinical evidence.

While molecular and clinical facets of the hypotheses of heterozygote advantage for the CFTR locus have been repeatedly addressed, quantitative assessment based upon historical or epidemiological data has been neglected. Calculations of selection coefficients for heterozygote advantage in CF (Meindl 1987; Bertranpetit & Calafell 1996) have neither used historical data, nor compared particular selective agents within a population genetic framework. Each candidate that we examine here has been responsible for notable historic mortality (figure 1). However, in addition to the magnitude of overall mortality, the age distribution of mortality as well as human demography are important determinants of selection (Samuelson 1978; Crow 1979; Galvani & Slatkin 2003; Galvani & Slatkin 2004).

Figure 1.

Figure 1

Age-specific probability of death owing to candidate diseases from historical records (Woods & Shelton 1997). Tuberculosis curve represents the peak of the European epidemic during 1600–1900, with 20% of all-cause mortality; the rates we used prior to 1600 were fractions of the same curve. SDC refers to secretory diarrhoea including cholera, and hence includes diarrhoea due to E. coli.

Cystic fibrosis follows a well-known geographical pattern, with an incidence in newborns of European descent of approximately 1 in 2500 (Welsh et al. 2001), lower incidences in the Middle East of 1 in 15 000 (Frossard et al. 1998), down to 1 in 40 000 in Indian populations (Powers et al. 1996) and vanishingly small incidences in African and East Asian populations (Yamashiro et al. 1997; Welsh et al. 2001). This is in stark contrast to geographical clines for typhoid fever and cholera, which are more common in tropical regions (Barua 1992; Crump et al. 2004). The historical–geographical distribution of tuberculosis more closely matches that of CF, i.e. while tuberculosis has been globally endemic for at least 15 000 years (Morse et al. 1964; Kapur et al. 1994; Rothschild et al. 2001; Hughes et al. 2002), a tuberculosis epidemic of record mortality broke out in Europe in the early seventeenth century (Twelfth Annual Report 1854; Grigg 1958; Heberden & Percival 1973; Bates & Stead 1993; Kipple 1993; Davis 2000). The epidemic did not spread to India, Africa and other parts of the world until the late nineteenth century (Bates & Stead 1993). Our model incorporates this historical outbreak and tests the correspondence of the historical–geographical course of the tuberculosis epidemic to the modern distribution of CF.

2. Material and methods

We used British mortality records from 1861 to 1870 to determine age-structured mortality rates owing to SDC (Woods & Shelton 1997). The original registers separately tabulated deaths from ‘diarrhoea and dysentery’ and ‘cholera’. We multiplied the mortality rate of the former by a parameter, ε, representing the proportion of diarrhoea that is secretory, so as to include only diarrhoea for which molecular evidence of CF resistance exists. A modern estimate of 18.5% for ε comes from Bangladesh, an area highly endemic for secretory diarrhoea (Albert et al. 1999). For our sensitivity analysis, we varied ε from half to twice this value, producing an estimate of 2% of deaths owing to SDC. As contemporary mortality rates are higher in urban environments, and the population became increasingly urbanized, we probably overestimated earlier mortality owing to SDC. It is debatable whether true cholera existed in Europe prior to 1817 (Barua 1992); we conservatively allow that it did and apply the record of 1861–1870 cholera mortality across all time.

For typhoid fever, a recent study showed that clinical diagnosis underestimates disease incidence among infants and children (Sinha et al. 1999). Therefore, we used recorded typhoid fever mortality as the lower bound in our sensitivity analysis (Woods & Shelton 1997). For our upper bound, we used the mortality rate from all fevers (Creighton 1965). As this includes death to typhus—which itself was often blamed for more than half of deaths owing to fever—and other fevers, it is necessarily an overestimate of death to typhoid fever. We used the higher mortality rate in determining the age-structured mortality, as it showed more deaths among the young; the lower figures produced on an average 39% of this mortality, or approximately 2% of all-cause mortality. As with SDC, higher and increasing rates of typhoid fever in urban areas suggest that earlier times suffered less mortality from this disease than that we modelled.

For tuberculosis, numerous sources ascribe to it 20–25% of all deaths from the sixteenth century to the early twentieth century, with peak death rates in excess of 750 per 100 000 annually (Twelfth Annual Report 1854; Grigg 1958; Heberden & Percival 1973; Bates & Stead 1993; Kipple 1993; Davis 2000). We used the lower figure of 20% as an average and the shorter term of 1600–1900. We conservatively assigned zero mortality to tuberculosis after 1900. Prior to 1600, we assigned tuberculosis responsibility for a variable proportion, τeuro<20%, of all deaths. We additionally modelled resistance to tuberculosis in India, where the modern epidemic started only several hundred years later (Bates & Stead 1993). For India, we used a peak share of mortality of 20% from 1800 to 1950 to reflect the shift in the start of the epidemic there. To construct the age distribution of deaths, we used figures from 1849 to 1853 in the United States (1854). The age-specific mortality rates used for the model are shown for comparison in figure 1.

We use an age-structured population to incorporate the different shapes of mortality curves from the candidate diseases, with nt,a,z females of age a and genotype z in the population at time t, and with nt,a total females across all genotypes (Crow 1979; Galvani & Slatkin 2003; Galvani & Slatkin 2004). Females bear female children at age-specific rate ma and suffer age- and time-specific all-cause mortality at rate μt,a. The mortality from the candidate disease is dt,a. The wild-type homozygote experiences the full mortality, whereas the heterozygote experiences mortality reduced by their fractional resistance, ρ. For initial modelling of all three candidate diseases, we assume that heterozygotes enjoy perfect resistance to the candidate diseases, ρ=1. The CF homozygote individuals experience 100% mortality prior to reproduction, which is consistent with the life-history curves for such patients even up to the beginning of the last century (Warwick & Monson 1967). Baseline mortality and fecundity rates were taken from data derived from English parish records from 1550 to 1900 (Wrigley & Schofield 1981). For our sensitivity analysis, we varied mean age at first childbirth from 23.4 to 26.0, the range for the period of 1600–1849 (Wrigley & Schofield 1981). The stable population distribution used to initiate model realizations was constructed from these rates (Preston et al. 2001).

Baseline mortality rates from the source life tables must necessarily include deaths owing to endemic diseases. For each disease, we adjusted the baseline mortality downwards from μ to μ′ so as not to double-count deaths,

μt,a=11μt,a1dt,a. (2.1)

Using these estimates for fecundity, baseline mortality and disease mortality, we evolve the population according to Hardy–Weinberg ratios, with pt,a representing the frequency of the wild-type allele at time t in age group a, and assuming that each female chooses a partner from a pool with allele frequency identical to that of her age group. We use 45 age classes of 1 year, consistent with the age-specific fertility rates (Wrigley & Schofield 1981). We ignore post-reproductive adults. The number of newly born wild-type homozygous (wt), heterozygous (het) and CF homozygous (cf) individuals is governed by the following difference equations:

pt,a=2nt,a,wt+nt,a,het2nt,a,nt+1,1,wt=a=145pt,a2nt,ama,nt+1,1,het=a=1452pt,a(1pt,a)nt,ama,nt+1,1,cf=a=145(1pt,a)2nt,ama.} (2.2)

And the existing population ages according to

nt+1,a+1,wt=(1μt,a)(1dt,a)nt,a,wt,nt+1,a+1,het=(1μt,a)(1(1ρ)dt,a)nt,a,het,nt+1,2,cf=0.} (2.3)

To address the hypotheses that ΔF508's high prevalence means it alone bestows heterozygote advantage, and that non-ΔF508 mutations are merely detrimental, we formulated an extended version of the above model with three alleles (e.g. wild-type, ΔF508 and non-ΔF508 mutation) and the six resultant genotypes. We then modelled a population in which a single ΔF508 mutation is introduced into the population with an equilibrium level of non-ΔF508 mutations (Wirth et al. 1997; Freeman & Herron 2004), assuming the former is protective against tuberculosis in the heterozygote and that both the mutant alleles produce CF in the homozygotes and mixed heterozygote. We tracked the ratio of ΔF508 to non-ΔF508 mutations over time, assuming that non-ΔF508 mutations continue to arise de novo at a rate of 6.7×10−7 (Wirth et al. 1997; Freeman & Herron 2004).

Our model of a freely mixing population facilitates the increase of an overdominant lethal recessive; hence, if any disease is to provide sufficient selective pressure, it must do so under these conditions (Nishino & Tajima 2005). Moreover, the well-studied ΔF508 mutation has a unique origin (Morral et al. 1994), indicating pan-European mixing of CFTR alleles. Each model realization follows the change in allele frequencies in a single region; thus, differences across Europe can be due to differences in selective pressure (e.g. force of infection of candidate selective agent.)

Target CF incidence in Europe, ieuro, was 1 in 3000 births (Welsh et al. 2001). We used the CF incidence from a study of Indian immigrants to the United Kingdoms, 1 in 40 000 births, for the target outside of Europe, iasia, as India experiences all three candidate diseases (Powers et al. 1996). Historic continuation of childbearing into the fifth decade of life produced mean generation times in our model from 31.9 to 32.8 years (Preston et al. 2001).

The model was initiated from 50 000 years before the present (yrBP) and run through to equilibria for both SDC and typhoid fever, affording these diseases maximal opportunity for selection. Model realizations for tuberculosis were initiated from a more conservative 18 000 yrBP, consistent with an intermediate estimate of the age of the most common CF mutation (Morris et al. 2002). Moreover, low endemic levels of tuberculosis mortality were applied until AD 1600, with the higher epidemic level from 1600 to 1900, and no tuberculosis mortality after AD 1900. This is consistent with both the estimate of 35 000 years since the origin of M. tuberculosis (Hughes et al. 2002) and historical records of the tuberculosis pandemic's origin in Europe.

3. Results

Neither SDC nor typhoid fever is able to produce the observed European incidence, given perfect resistance in the heterozygote and record disease mortality extended over all time. Perfect resistance to SDC, ρdia=1, produces an equilibrium CF incidence of 1 in 10 800 live births (1 in 20 100 to 1 in 4700 from sensitivity analysis), significantly below observed frequencies of greater than 1 in 3000. Perfect resistance to typhoid fever, ρtyph=1, produces an equilibrium incidence of 1 in 7600 live births (1 in 29 200 to 1 in 4100 from sensitivity analysis), again considerably below observed modern incidences (figure 2). At this incidence, the mortality owing to CF balances the mortality prevented owing to putative heterozygote advantage.

Figure 2.

Figure 2

Model realizations of CF incidence over time for candidate diseases, given 100% resistance (ρ=1) to the specified disease. Tuberculosis has been assigned responsibility for a fraction, τ=1.6%, of mortality prior to 1600. All the populations are initiated with 1 in 20 000 CF allele frequencies. Typhoid fever and SDC were initiated 50 000 yrBP. Tuberculosis was given the more restrictive start time of 18 000 yrBP. Under these conditions, tuberculosis resistance produces an observed modern European CF incidence of 1 in 3000 (arrow). The inflection point in the tuberculosis curve is due to the conservative assumption that tuberculosis rates jumped to their historically documented levels in 1600, rather than increasing gradually prior to 1600.

The tuberculosis pandemic originating in Europe in the seventeenth century provides adequate, appropriately located selective pressure to produce the modern CF incidence in Europe, greater than 1 in 3000, assuming that endemic tuberculosis prior to the seventeenth century was responsible for less than 2% of all-cause mortality, which is consistent with historical records, and only one-tenth of its mortality during the recent European epidemic (Bates & Stead 1993). Partial resistance to tuberculosis in the heterozygote, down to ρTB=15%, can similarly produce the modern CF incidence, given higher levels of endemic tuberculosis prior to the seventeenth century. Thus, tuberculosis provides sufficient selective pressure over a wide range of resistances and mortality levels (figure 3).

Figure 3.

Figure 3

Range of CF incidences given resistance to tuberculosis in the heterozygote. The independent variables are fraction of resistance to tuberculosis in the CF carrier and percent of all deaths owing to tuberculosis prior to the pandemic that began in the 1600s. From 1600 until 1900, 20% of all deaths are ascribed to tuberculosis, as it is consistent with historical records. Light grey regions indicate combinations of parameters that could have produced the observed CF incidence (greater than 1 in 3000) in Europe. Tuberculosis resistance is a sufficient cause of CF across a broad range of parameters. In contrast, neither SDC nor typhoid fever produces sufficient CF incidence in any region of their parameter space, even allowing peak mortality levels extending back indefinitely into the past—their entire graphs are dark grey (not shown).

We additionally modelled the CF incidence in India. If tuberculosis mortality in India had been approximately 70% of that in Europe, it would produce the observed Indian CF incidence. This lower level of tuberculosis in India is historically plausible (Bates & Stead 1993).

When we modelled two types of CFTR mutations, those with and without heterozygote advantage, the advantageous category of alleles rises to represent 98% of CFTR mutant alleles in the modern day. This rise occurs despite de novo disadvantageous mutations at a rate of 6.7×10−7 per generation (Wirth et al. 1997; Freeman & Herron 2004). This projected 98% contribution is significantly higher than the 66% modern frequency of ΔF508 among CFTR mutations (Bobadilla et al. 2002), suggesting that the heterozygote advantage in CF is not associated solely with ΔF508.

Tuberculosis remains a major cause of mortality worldwide (Nunn et al. 2005), but from a historical perspective, it is relatively well controlled in most developed countries. In situations where tuberculosis is no longer responsible for significant mortality among individuals of reproductive age, our model demonstrates that the incidence of CF will fall. The decline amounts to a 0.1% decrease in CF incidence annually over the next 100 years. To reduce the incidence by half requires approximately 20 generations.

4. Discussion

The tuberculosis pandemic, the ‘white plague’ that claimed over 20% of all European lives beginning in the 1600s, swept across the world over subsequent centuries (Bates & Stead 1993). However, its origin and duration in Europe meant that the greatest selective pressure was applied there—enough so that, according to our results, tuberculosis resistance in the CF heterozygote can explain the modern gradient in CF incidence between Europe and the rest of the world.

Our quantitative analysis revealed that secretory diarrhoea mortality—including both cholera and ETEC—was insufficient to account for the present CF frequencies in Europe, given perfect resistance over all time and liberal estimates of disease-associated mortality. Likewise, typhoid fever did not provide adequate mortality under similar assumptions. We were generous to these diseases in numerous aspects of our evaluations. Firstly, our estimates of the incidences of cholera, typhoid and ETEC are likely to be high; as all these diseases tend to increase with increasing population density, the urbanizing communities of the 1800s from which the mortality estimates were drawn would suffer the highest mortality rates in history from these diseases (Woods & Shelton 1997). In the case of typhoid fever, we broadly included all deaths by fever in our parameterization to avoid difficulties in retrospective diagnosis. Secondly, we used a high estimate for the mean age at first childbirth; lower estimates predict even lower CF incidences owing to these diseases, as the differential resistance in heterozygotes has less time over which to act. Thirdly, many regions in Europe have CF incidences higher than our target incidence, up to 1 in 1700 (Welsh et al. 2001); hence, we set a lenient standard for determining whether typhoid fever or SDC could produce European incidences. In contrast, tuberculosis is able to match even the higher target incidences (data not shown). Finally, neither typhoid fever nor SDC shows a historical or geographical distribution consistent with the modern distribution of CF.

Abundant literature attests to the ancient, persistent, worldwide presence of tuberculosis (Morse et al. 1964; Davis 2000; Rothschild et al. 2001). A tuberculosis contribution of approximately 2% of all mortality during endemic periods is adequate in our model and higher contributions are consistent if resistance is imperfect. Tuberculosis can also explain geographical differences in CF incidence, i.e. tuberculosis produced the observed CF incidence for India with tuberculosis mortality levels about 70% of that in Europe, consistent with the historical data (Bates & Stead 1993). Comparisons to Africa and Asia are more difficult, primarily because clear data on the incidence of CF are unavailable. Furthermore, other polymorphisms that lead to incomplete forms of CF, e.g. congenital bilateral absence of the vas deferens or chronic pancreatitis (Sharer et al. 1998), are common in parts of Asia and complicate the analysis there (Cuppens et al. 1998; Fujiki et al. 2004). While these and other factors may contribute to the difference in CF incidence between Europe and elsewhere, our results show that the timing and duration of the most recent tuberculosis pandemic provides a parsimonious explanation.

Tuberculosis supplies greater selective pressure than either typhoid fever or SDC primarily because it has been associated with mortality rates routinely of an order of magnitude higher than those owing to the other candidate diseases and secondarily because mortality closer to the onset of reproductive age removes relatively more reproductive potential per death (figure 1).

A founder effect has been proposed to explain the high incidence of CF (Klinger 1983). We modelled a putative bottleneck population consisting solely of heterozygotes. In such a population, CF incidence begins at 1 in 4 births and decays below the observed European incidence in 1800 years. Thus, a founder effect could not produce existing levels of CF, as there has been no such dramatic bottleneck for the entire European population within the past 1800 years.

The particular prevalence of the ΔF508 mutation has led to the suggestion that it alone provides heterozygote advantage (Welsh et al. 2001; Wiuf 2001) and that other CFTR mutations are purely detrimental. However, our results demonstrate that selective pressure sufficient to drive CF up to its present incidence would also drive the class of purely detrimental CFTR alleles down to less than 2% of all CFTR mutations. This is significantly lower than the 34% of all CFTR mutations that are not ΔF508 (Bobadilla et al. 2002), suggesting that ΔF508 is neither the sole beneficial CFTR allele nor a genetic hitchhiker (Macek et al. 1997). Additionally, the historical presence of tuberculosis provides a ready explanation for the range of ages suggested for ΔF508 (Wiuf 2001; Morris et al. 2002); it would have provided modest selective advantage against the endemic disease even prior to the seventeenth century epidemic.

As our model relates CF to tuberculosis, it predicts that the incidence of CF will fall in regions of the world where tuberculosis is well controlled at a rate of approximately 0.1% per year. In parts of the world where tuberculosis incidence is increasing (as owing to HIV and limited public health resources), the equilibrium incidence of CF may be higher or lower than present levels; prediction in such regions is outside the scope of this paper.

We were only able to use fertility and mortality data from one portion of Europe, and that assembled and compared across multiple sources. Moreover, we had to extrapolate relatively recent demographic parameters back in time to earlier periods with markedly different living conditions. Retrospective diagnosis of the causes of death is notoriously error-prone, but we took a generous approach of ascribing deaths to diarrhoea and typhoid fever. Even if our exact parameters are not correct, our basic finding holds: tuberculosis resistance is sufficient to explain CF across a wide range of parameters, while SDC and typhoid fever appear much more limited in the range of feasible parameters (in our case, none was found consistent with the historical data).

Our model assumes random mixing across the entire population. It is also deterministic rather than stochastic. Further, we include neither spatial nor temporal heterogeneity in the force of infection of the candidate diseases. All these factors could conspire to raise or lower the incidence of CF within a community, irrespective of selective pressures. However, we believe that the magnitude of these effects is insufficient to produce the broad incidence of CF across so many European populations.

We demonstrate that the two agents commonly invoked to explain the persistence of CF—secretory diarrhoea (including cholera) and typhoid fever—have not imposed sufficient historical selection. In contrast, tuberculosis combines ancient endemicity with a recent epidemic of European origin and severe mortality, particularly among young adults at the peak of their reproductive potential. Consequently, tuberculosis would provide sufficient selective pressure in the appropriate parts of the world. Additionally, tuberculosis is a plausible candidate based upon clinical observations and recent molecular hypotheses. Thus, investigation of the putative link between tuberculosis and CF may be an important area for inquiry, both through genetic studies of tuberculosis patients and examination of the proposed role of arylsulphatase B deficiency in both tuberculosis and CF.

Acknowledgments

We thank J. Medlock, T. Reluga, D. Paltiel, E. Kaplan, R. Stephenson-Padron, D. Thomas and J. Townsend for their comments on and assistance with the manuscript. This research was funded by the National Institute of Mental Health (R01 MH65869) and an award from the Notsew Orm Sands Foundation.

References

  1. Albert M.J, Faruque A.S, Faruque S.M, Sack R.B, Mahalanabis D. Case-control study of enteropathogens associated with childhood diarrhea in Dhaka, Bangladesh. J. Clin. Microbiol. 1999;37:3458–3464. doi: 10.1128/jcm.37.11.3458-3464.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Anderson C.M, Allan J, Johansen P.G. Comments on the possible existence and nature of a heterozygote advantage in cystic fibrosis. Bibl. Paediatr. 1967;86:381–387. [PubMed] [Google Scholar]
  3. Barua D. History of Cholera. In: Barua D, Greenough W, editors. Cholera. Plenum Medical Book Company; New York, NY: 1992. pp. 1–36. [Google Scholar]
  4. Bates J.H, Stead W.W. The history of tuberculosis as a global epidemic. In: Bass J.B Jr., editor. The medical clinics of North America: tuberculosis. vol. 77. Philadelphia; Harcourt Brace & Company: 1993. pp. 1205–1217. [DOI] [PubMed] [Google Scholar]
  5. Bertranpetit J, Calafell F. Genetic and geographical variability in cystic fibrosis: evolutionary considerations. Ciba Found. Symp. 1996;197:97–114. doi: 10.1002/9780470514887.ch6. discussion 114–8. [DOI] [PubMed] [Google Scholar]
  6. Bobadilla J.L, Macek M, Jr, Fine J.P, Farrell P.M. Cystic fibrosis: a worldwide analysis of CFTR mutations—correlation with incidence data and application to screening. Hum. Mutat. 2002;19:575–606. doi: 10.1002/humu.10041. [DOI] [PubMed] [Google Scholar]
  7. Chao A.C, de Sauvage F.J, Dong Y.J, Wagner J.A, Goeddel D.V, Gardner P. Activation of intestinal CFTR Cl-channel by heat-stable enterotoxin and guanylin via cAMP-dependent protein kinase. EMBO J. 1994;13:1065–1072. doi: 10.1002/j.1460-2075.1994.tb06355.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Crawfurd, M. A., 1972 A genetic study, including evidence for heterosis, of cystic fibrosis of the pancreas. In 168th Meeting of the Genetical Society of Great Britain, pp. 126. University of Leeds.
  9. Creighton C. Cass; London, UK: 1965. A history of epidemics in Britain. [Google Scholar]
  10. Crow J.F. Gene frequency and fitness change in an age-structured population. Ann. Hum. Genet. 1979;42:355–370. doi: 10.1111/j.1469-1809.1979.tb00669.x. [DOI] [PubMed] [Google Scholar]
  11. Crump J.A, Luby S.P, Mintz E.D. The global burden of typhoid fever. Bull. World Health Organ. 2004;82:346–353. [PMC free article] [PubMed] [Google Scholar]
  12. Cuppens H, et al. Polyvariant mutant cystic fibrosis transmembrane conductance regulator genes. The polymorphic (Tg)m locus explains the partial penetrance of the T5 polymorphism as a disease mutation. J. Clin. Invest. 1998;101:487–496. doi: 10.1172/JCI639. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Cuthbert A.W, Halstead J, Ratcliff R, Colledge W.H, Evans M.J. The genetic advantage hypothesis in cystic fibrosis heterozygotes: a murine study. J. Physiol. 1995;482:449–454. doi: 10.1113/jphysiol.1995.sp020531. (Pt 2) [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Cuthbert A.W, Hickman M.E, MacVinish L.J, Evans M.J, Colledge W.H, Ratcliff R, Seale P.W, Humphrey P.P. Chloride secretion in response to guanylin in colonic epithelial from normal and transgenic cystic fibrosis mice. Br. J. Pharmacol. 1994;112:31–36. doi: 10.1111/j.1476-5381.1994.tb13024.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Cystic Fibrosis Genetic Analysis Consortium. Cystic Fibrosis Mutation Database. 2005 See http://www.genet.sickkids.on.ca/cftr/. [Google Scholar]
  16. Davis A.L. History of Tuberculosis. In: Reichman L.B, Hershfield E.S, editors. Tuberculosis: a comprehensive international approach. Marcel Dekker, Inc; New York, NY: 2000. pp. 3–54. [Google Scholar]
  17. Freeman S, Herron J.C. Pearson Prentice Hall; Upper Saddle River, NJ: 2004. Evolutionary analysis. [Google Scholar]
  18. Frossard P.M, Girodon E, Dawson K.P, Ghanem N, Plassa F, Lestringant G.G, Goossens M. Identification of cystic fibrosis mutations in the United Arab Emirates. Mutations in brief no. 133. Online. Hum. Mutat. 1998;11:412–413. doi: 10.1002/(SICI)1098-1004(1998)11:5&#x0003c;412::AID-HUMU15&#x0003e;3.0.CO;2-O. [DOI] [PubMed] [Google Scholar]
  19. Fujiki K, et al. Genetic evidence for CFTR dysfunction in Japanese: background for chronic pancreatitis. J. Med. Genet. 2004;41:e55. doi: 10.1136/jmg.2003.014456. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Gabriel S.E, Brigman K.N, Koller B.H, Boucher R.C, Stutts M.J. Cystic fibrosis heterozygote resistance to cholera toxin in the cystic fibrosis mouse model. Science. 1994;266:107–109. doi: 10.1126/science.7524148. [DOI] [PubMed] [Google Scholar]
  21. Galvani A.P, Slatkin M. Evaluating plague and smallpox as historical selective pressures for the CCR5-Delta 32 HIV-resistance allele. Proc. Natl Acad. Sci. USA. 2003;100:15 276–15 279. doi: 10.1073/pnas.2435085100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Galvani A.P, Slatkin M. Intense selection in an age-structured population. Proc. R. Soc. B. 2004;271:171–176. doi: 10.1098/rspb.2003.2573. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Goldstein J.L, Sahi J, Bhuva M, Layden T.J, Rao M.C. Escherichia coli heat-stable enterotoxin-mediated colonic Cl-secretion is absent in cystic fibrosis. Gastroenterology. 1994;107:950–956. doi: 10.1016/0016-5085(94)90218-6. [DOI] [PubMed] [Google Scholar]
  24. Goren M.B, D'Arcy Hart P, Young M.R, Armstrong J.A. Prevention of phagosome–lysosome fusion in cultured macrophages by sulfatides of Mycobacterium tuberculosis. Proc. Natl Acad. Sci. USA. 1976;73:2510–2514. doi: 10.1073/pnas.73.7.2510. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Grigg E.R. The arcana of tuberculosis with a brief epidemiologic history of the disease in the U.S.A. Am. Rev. Tuberc. 1958;78:151–172. doi: 10.1164/artpd.1958.78.2.151. [DOI] [PubMed] [Google Scholar]
  26. Hansson G.C. Cystic fibrosis and chloride-secreting diarrhoea. Nature. 1988;333:711. doi: 10.1038/333711c0. [DOI] [PubMed] [Google Scholar]
  27. Heberden W, Percival T. Gregg; Farnborough, UK: 1973. Population and disease in early industrial England. Pioneers of demography. [Google Scholar]
  28. Hjelte L, Petrini B, Kallenius G, Strandvik B. Prospective study of mycobacterial infections in patients with cystic fibrosis. Thorax. 1990;45:397–400. doi: 10.1136/thx.45.5.397. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Hogenauer C, Santa Ana C.A, Porter J.L, Millard M, Gelfand A, Rosenblatt R.L, Prestidge C.B, Fordtran J.S. Active intestinal chloride secretion in human carriers of cystic fibrosis mutations: an evaluation of the hypothesis that heterozygotes have subnormal active intestinal chloride secretion. Am. J. Hum. Genet. 2000;67:1422–1427. doi: 10.1086/316911. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Hughes A.L, Friedman R, Murray M. Genomewide pattern of synonymous nucleotide substitution in two complete genomes of Mycobacterium tuberculosis. Emerg. Infect. Dis. 2002;8:1342–1346. doi: 10.3201/eid0811.020064. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Kapur V, Whittam T.S, Musser J.M. Is Mycobacterium tuberculosis 15 000 years old? J. Infect. Dis. 1994;170:1348–1349. doi: 10.1093/infdis/170.5.1348. [DOI] [PubMed] [Google Scholar]
  32. Kerem B, Rommens J.M, Buchanan J.A, Markiewicz D, Cox T.K, Chakravarti A, Buchwald M, Tsui L.C. Identification of the cystic fibrosis gene: genetic analysis. Science. 1989;245:1073–1080. doi: 10.1126/science.2570460. [DOI] [PubMed] [Google Scholar]
  33. Kilby J.M, Gilligan P.H, Yankaskas J.R, Highsmith W.E, Jr, Edwards L.J, Knowles M.R. Nontuberculous mycobacteria in adult patients with cystic fibrosis. Chest. 1992;102:70–75. doi: 10.1378/chest.102.1.70. [DOI] [PubMed] [Google Scholar]
  34. Kipple K. Cambridge University Press; Cambridge, UK: 1993. The Cambridge world history of human disease. [Google Scholar]
  35. Klinger K.W. Cystic fibrosis in the Ohio Amish: gene frequency and founder effect. Hum. Genet. 1983;65:94–98. doi: 10.1007/BF00286641. [DOI] [PubMed] [Google Scholar]
  36. Macek M, Jr, et al. Possible association of the allele status of the CS.7/HhaI polymorphism 5′ of the CFTR gene with postnatal female survival. Hum. Genet. 1997;99:565–572. doi: 10.1007/s004390050407. [DOI] [PubMed] [Google Scholar]
  37. Meindl R.S. Hypothesis: a selective advantage for cystic fibrosis heterozygotes. Am. J. Phys. Anthropol. 1987;74:39–45. doi: 10.1002/ajpa.1330740104. [DOI] [PubMed] [Google Scholar]
  38. Morral N, et al. The origin of the major cystic fibrosis mutation (delta F508) in European populations. Nat. Genet. 1994;7:169–175. doi: 10.1038/ng0694-169. [DOI] [PubMed] [Google Scholar]
  39. Morris A.P, Whittaker J.C, Balding D.J. Fine-scale mapping of disease loci via shattered coalescent modeling of genealogies. Am. J. Hum. Genet. 2002;70:686–707. doi: 10.1086/339271. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Morse D, Brothwell D.R, Ucko P.J. Tuberculosis in Ancient Egypt. Am. Rev. Respir. Dis. 1964;90:524–541. doi: 10.1164/arrd.1964.90.4.524. [DOI] [PubMed] [Google Scholar]
  41. Nishino J, Tajima F. Effect of population structure on the amount of polymorphism and the fixation probability under overdominant selection. Genes Genet. Syst. 2005;80:287–295. doi: 10.1266/ggs.80.287. [DOI] [PubMed] [Google Scholar]
  42. Nunn P, Williams B, Floyd K, Dye C, Elzinga G, Raviglione M. Tuberculosis control in the era of HIV. Nat. Rev. Immunol. 2005;5:819–826. doi: 10.1038/nri1704. [DOI] [PubMed] [Google Scholar]
  43. Pier G.B, Grout M, Zaidi T, Meluleni G, Mueschenborn S.S, Banting G, Ratcliff R, Evans M.J, Colledge W.H. Salmonella typhi uses CFTR to enter intestinal epithelial cells. Nature. 1998;393:79–82. doi: 10.1038/30006. [DOI] [PubMed] [Google Scholar]
  44. Powers C.A, Potter E.M, Wessel H.U, Lloyd-Still J.D. Cystic fibrosis in Asian Indians. Arch. Pediatr. Adolesc. Med. 1996;150:554–555. doi: 10.1001/archpedi.1996.02170300108024. [DOI] [PubMed] [Google Scholar]
  45. Preston S.H, Heuveline P, Guillot M. Blackwell Publishers Ltd; Oxford, UK: 2001. Demography: measuring and modeling population processes. [Google Scholar]
  46. Riordan J.R, et al. Identification of the cystic fibrosis gene: cloning and characterization of complementary DNA. Science. 1989;245:1066–1073. doi: 10.1126/science.2475911. [DOI] [PubMed] [Google Scholar]
  47. Rivera-Marrero C.A, Ritzenthaler J.D, Newburn S.A, Roman J, Cummings R.D. Molecular cloning and expression of a novel glycolipid sulfotransferase in Mycobacterium tuberculosis. Microbiology. 2002;148:783–792. doi: 10.1099/00221287-148-3-783. [DOI] [PubMed] [Google Scholar]
  48. Romeo G, Devoto M, Galietta L.J. Why is the cystic fibrosis gene so frequent? Hum. Genet. 1989;84:1–5. doi: 10.1007/BF00210660. [DOI] [PubMed] [Google Scholar]
  49. Rommens J.M, et al. Identification of the cystic fibrosis gene: chromosome walking and jumping. Science. 1989;245:1059–1065. doi: 10.1126/science.2772657. [DOI] [PubMed] [Google Scholar]
  50. Rothschild B.M, Martin L.D, Lev G, Bercovier H, Bar-Gal G.K, Greenblatt C, Donoghue H, Spigelman M, Brittain D. Mycobacterium tuberculosis complex DNA from an extinct bison dated 17 000 years before the present. Clin. Infect. Dis. 2001;33:305–311. doi: 10.1086/321886. [DOI] [PubMed] [Google Scholar]
  51. Samuelson P.A. Generalizing Fisher's “reproductive value”: overlapping and nonoverlapping generations with competing genotypes. Proc. Natl Acad. Sci. USA. 1978;75:4062–4066. doi: 10.1073/pnas.75.8.4062. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Sereth H, Shoshani T, Bashan N, Kerem B.S. Extended haplotype analysis of cystic fibrosis mutations and its implications for the selective advantage hypothesis. Hum. Genet. 1993;92:289–295. doi: 10.1007/BF00244474. [DOI] [PubMed] [Google Scholar]
  53. Serre J.L, Simon-Bouy B, Mornet E, Jaume-Roig B, Balassopoulou A, Schwartz M, Taillandier A, Boue J, Boue A. Studies of RFLP closely linked to the cystic fibrosis locus throughout Europe lead to new considerations in populations genetics. Hum. Genet. 1990;84:449–454. doi: 10.1007/BF00195818. [DOI] [PubMed] [Google Scholar]
  54. Sharer N, Schwarz M, Malone G, Howarth A, Painter J, Super M, Braganza J. Mutations of the cystic fibrosis gene in patients with chronic pancreatitis. N. Engl. J. Med. 1998;339:645–652. doi: 10.1056/NEJM199809033391001. [DOI] [PubMed] [Google Scholar]
  55. Sinha A, et al. Typhoid fever in children aged less than 5 years. Lancet. 1999;354:734–737. doi: 10.1016/S0140-6736(98)09001-1. [DOI] [PubMed] [Google Scholar]
  56. Smith M.J, Efthimiou J, Hodson M.E, Batten J.C. Mycobacterial isolations in young adults with cystic fibrosis. Thorax. 1984;39:369–375. doi: 10.1136/thx.39.5.369. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Tobacman J.K. Does deficiency of arylsulfatase B have a role in cystic fibrosis? Chest. 2003;123:2130–2139. doi: 10.1378/chest.123.6.2130. [DOI] [PubMed] [Google Scholar]
  58. Twelfth Annual Report. Twelfth annual report relating to the registry of births, marriages, and deaths in Massachusetts. In: Gutmann Rosenkrantz B, editor. From consumption to tuberculosis: a documentary history. Garland; New York, NY: 1854. [Google Scholar]
  59. van de Vosse E, Ali S, Visser A.W, Surjadi C, Widjaja S, Vollaard A.M, Dissel J.T. Susceptibility to typhoid fever is associated with a polymorphism in the cystic fibrosis transmembrane conductance regulator (CFTR) Hum. Genet. 2005:1–3. doi: 10.1007/s00439-005-0005-0. [DOI] [PubMed] [Google Scholar]
  60. Warwick W.J, Monson S. Life table studies of mortality. Bibl. Paediatr. 1967;86:353–367. [PubMed] [Google Scholar]
  61. Welsh M.J, Ramsey B.W, Accurso F, Cutting G.R. Cystic fibrosis. In: Scriver C.R, Beaudet A.L, Valle D, Sly W.S, Vogelstein B, Childs B, Kinzler K.W, editors. The metabolic and molecular bases of inherited disease. McGraw-Hill; New York, NY: 2001. [Google Scholar]
  62. Wirth B, Schmidt T, Hahnen E, Rudnik-Schoneborn S, Krawczak M, Muller-Myhsok B, Schonling J, Zerres K. De novo rearrangements found in 2% of index patients with spinal muscular atrophy: mutational mechanisms, parental origin, mutation rate, and implications for genetic counseling. Am. J. Hum. Genet. 1997;61:1102–1111. doi: 10.1086/301608. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Wiuf C. Do delta F508 heterozygotes have a selective advantage? Genet. Res. 2001;78:41–47. doi: 10.1017/S0016672301005195. [DOI] [PubMed] [Google Scholar]
  64. Woods R, Shelton N. Liverpool University Press; Liverpool, UK: 1997. An atlas of Victorian mortality. [Google Scholar]
  65. Wrigley E.A, Schofield R.S. Harvard University Press; Cambridge, MA: 1981. The population history of England 1541–1871. [Google Scholar]
  66. Yamashiro Y, Shimizu T, Oguchi S, Shioya T, Nagata S, Ohtsuka Y. The estimated incidence of cystic fibrosis in Japan. J. Pediatr. Gastroenterol. Nutr. 1997;24:544–547. doi: 10.1097/00005176-199705000-00010. [DOI] [PubMed] [Google Scholar]

Articles from Journal of the Royal Society Interface are provided here courtesy of The Royal Society

RESOURCES