Skip to main content
American Journal of Human Genetics logoLink to American Journal of Human Genetics
letter
. 2016 Feb 4;98(2):396–397. doi: 10.1016/j.ajhg.2015.12.025

The Kalash Genetic Isolate? The Evidence for Recent Admixture

Garrett Hellenthal, Daniel Falush, Simon Myers, David Reich, George BJ Busby, Mark Lipson, Cristian Capelli, Nick Patterson
PMCID: PMC4746362  PMID: 26849116

To the editor: The recent paper “The Kalash Genetic Isolate: Ancient Divergence, Drift, and Selection,” by Ayub et al.1 suggests that the Kalash people of present-day Pakistan experienced “no detectable gene flow from their geographic neighbors in Pakistan or from other extant Eurasian populations” since their split from those populations over 8,000 years ago. They note that this finding of apparent genetic isolation contradicts the results of Hellenthal et al.,2 who inferred DNA introgression dated to 910–220 BCE in an overlapping sample of Kalash individuals. Hellenthal et al.2 inferred the (unknown) source of this DNA to have genetic similarities to a wide range of modern-day groups from West Asia and Europe, including Germany-Austria and Turkey, for example.

Ayub et al. apply methods using two fundamental sources of information, which differ in the types of admixture they are designed to detect. The first source of information, leveraged by ALDER, a method developed in work involving several of us,3 is unique patterns of linkage disequilibrium generated by admixture. Ayub et al. claim that ALDER “showed no evidence of gene flow into the Kalash.”1 However, this is an error, as the ALDER results reported in their Table S4 show highly significant evidence (p value < 10−10) in the Kalash when using Armenia and Chamar as surrogates. Eight other pairings of surrogates give p values < 10−5. In all cases, the surrogate pairs include one group from South Asia (Chamar, Kol) and the other from West Eurasia (Armenia, Adygei, Brahui, Hungarians, Palestinians, Tuscans), consistent with admixture from a West Eurasian source.2 The admixture date point estimates range from 92 to 125 generations ago (with SE < 20), consistent to that inferred in Hellenthal et al. using GLOBETROTTER (95% CI: 76–101 generations).2 Indeed, the original ALDER paper also found a highly significant signal of mixture in the Kalash (Table 1 of that paper).3

The second source of information is based on tests for population mixture, as implemented in the ADMIXTOOLS4 and TREEMIX5 software, that model allele frequency correlation patterns among populations but not correlation patterns along the genome. The authors fail to detect a signal using these methods, although previous work has shown that signals of population mixture using such methods can be masked by the effects of genetic drift3, 6 or model mis-specification.3 In detail, the negative f3-statistic test implemented in ADMIXTOOLS measures the correlation in allele frequency differences between a sampled group C and two other groups A and B. A significantly negative f3(C;A,B)—indicating that the frequencies of alleles in population C tend to be intermediate between A and B—can only arise if population C descends from a mixture of populations related (anciently) to A and B.4 However, as the developers of the negative f3-statistic test note, “a history of admixture does not always result in a negative f3(C; A,B) statistic. If population C has experienced a high degree of population-specific drift (perhaps due to founder events after admixture), it can mask the signal so that f3(C; A,B) might not be negative.”4, 6 In other words, allele frequencies in population C will have drifted enough that they will no longer tend to be intermediate between those of A and B. As Ayub et al. showed and has been reported previously, the Kalash experienced strong drift effects—among the highest of Eurasian populations studied to date.1, 3, 7, 8, 9 Thus, the failure to observe a negative f3-statistic does not provide meaningful evidence against admixture.

TREEMIX5 also does not provide evidence of admixture in the Kalash according to the analyses reported in Ayub et al. However, this might be explained by the enormous search space necessary to explain all potential population merges, split times, and migrations among the 30 sampled populations considered in Figure S3 of Ayub et al.1 In the original TREEMIX paper, those authors speculate that “in graphs with complex structure … several different histories will be compatible with the data.”5 As an example, that paper notes how the well-documented admixture from Neanderthals into non-African populations10 is missed when applying TREEMIX to data from Neanderthals, Denisovans, and the 53 world-wide populations of the Human Genome Diversity Panel (HGDP).5

These observations indicate that, contrary to the claim of Ayub et al. that the ancestors of the Kalash have been isolated from the ancestors of other extant populations for over 8,000 years, there is in fact strong evidence that they have not been isolated over this time frame. Hellenthal et al.2 also inferred a similar signal of ancient admixture from a West Eurasian source into several populations from neighboring regions to the Kalash (the Balochi, Brahui, Makrani, Pathan, Sindhi), suggesting this introgression might be shared as part of a broader signal (though they note the Kalash event appeared to involve a more European-like source relative to the other groups). Whether or not this admixture event involved the armies of Alexander the Great is an unresolved question. One promising direction for future insight is ancient DNA analysis of skeletal remains from northern Pakistan. Further studies of this unique group11 are important, and we hope that future studies will shed light on the contributing populations.

References

  • 1.Ayub, Q., Mezzavilla, M., Pagani, L., Haber, M., Mohyuddin, A., Khaliq, S., Mehdi, S.Q., and Tyler-Smith, C. Am. J. Hum. Genet. 96, 775–783. [DOI] [PMC free article] [PubMed]
  • 2.Hellenthal G., Busby G.B.J., Band G., Wilson J.F., Capelli C., Falush D., Myers S. Science. 2014;343:747–751. doi: 10.1126/science.1243518. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Loh P.-R., Lipson M., Patterson N., Moorjani P., Pickrell J.K., Reich D., Berger B. Genetics. 2013;193:1233–1254. doi: 10.1534/genetics.112.147330. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Patterson N., Moorjani P., Luo Y., Mallick S., Rohland N., Zhan Y., Genschoreck T., Webster T., Reich D. Genetics. 2012;192:1065–1093. doi: 10.1534/genetics.112.145037. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Pickrell J.K., Pritchard J.K. PLoS Genet. 2012;8:e1002967. doi: 10.1371/journal.pgen.1002967. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Reich D., Thangaraj K., Patterson N., Price A.L., Singh L. Nature. 2009;461:489–494. doi: 10.1038/nature08365. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Qamar R., Ayub Q., Mohyuddin A., Helgason A., Mazhar K., Mansoor A., Zerjal T., Tyler-Smith C., Mehdi S.Q. Am. J. Hum. Genet. 2002;70:1107–1124. doi: 10.1086/339929. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Quintana-Murci L., Chaix R., Wells R.S., Behar D.M., Sayar H., Scozzari R., Rengo C., Al-Zahery N., Semino O., Santachiara-Benerecetti A.S. Am. J. Hum. Genet. 2004;74:827–845. doi: 10.1086/383236. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Li J.Z., Absher D.M., Tang H., Southwick A.M., Casto A.M., Ramachandran S., Cann H.M., Barsh G.S., Feldman M., Cavalli-Sforza L.L., Myers R.M. Science. 2008;319:1100–1104. doi: 10.1126/science.1153717. [DOI] [PubMed] [Google Scholar]
  • 10.Green R.E., Krause J., Briggs A.W., Maricic T., Stenzel U., Kircher M., Patterson N., Li H., Zhai W., Fritz M.H.-Y. Science. 2010;328:710–722. doi: 10.1126/science.1188021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Lines M. Emjay Books International; Peshawar, Pakistan: 1999. [Google Scholar]

Articles from American Journal of Human Genetics are provided here courtesy of American Society of Human Genetics

RESOURCES