Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2013 Jul 3;110(29):11668–11669. doi: 10.1073/pnas.1308435110

North Africans traveling north

Karl Skorecki a,b,1, Doron M Behar a
PMCID: PMC3718129  PMID: 23824295

Humans have always loved to move and to mate. In doing so, we generate complex patterns of diversity in language, custom, and culture across and within geographic regions. However, the richest and most durable signals of migration and admixture are evident in patterns of DNA sequence variation (1, 2). The study by Botigué et al. (3) reported in PNAS analyzes DNA sequence variation in Africa, the Near East, and Europe to advance our understanding of the peopling of Europe with a focus on the relatively recent contribution of North African ancestry to the contemporary population genetic structure of southern Europe.

The population history of Europe during the last ∼50,000 y corresponds to two major geologic time periods: the Pleistocene before ∼11,500 y ago, transitioning into the Holocene (Fig. 1). The major events of the Pleistocene include the initial pioneering colonization by humans during the Upper Paleolithic, as part of the overall dispersal of modern humans out of Africa (Fig. 1A). Subsequently, during the most recent major Ice Age (Last Glacial Maximum) extending from 25,000 to 19,500 y ago, people found refuge in a number of well-defined geographic areas, including southwest Europe (Franco-Cantabrian/ Iberian), along the Mediterranean coast including the Italian peninsula, the Balkans, the Levant, and on the east European steppes (Fig. 1B) (4, 5). The subsequent transition from Pleistocene to Holocene (disrupted briefly by a less severe cooling lasting about a millennium; Fig. 1C) also ushered in the dispersal to Europe of Near East migrants, bringing with them Neolithic skill sets and practices, including different forms of agriculture and animal domestication (Fig. 1D) (6). Both the recolonization from Ice Age population refugia and the Neolithic migration generated a backdrop for a reducing south to north European cline of DNA sequence diversity. Ever since, numerous migrations have followed along multiple intercontinental routes from the Copper Age to this day, including trans-Mediterranean movements (Fig. 1D), which are directly relevant to understanding population genetic structure in southern Europe (7).

Fig. 1.

Fig. 1.

Major stages in the peopling of Europe. A–E are illustrative of five major demographic stages that may have contributed to the current complex human population genetic structure of Europe. Time scales, expansion arrows, and specificity of geographic regions should not be regarded as pointing to the detailed migratory routes and dates.

Much of the genetic trace of the more remote and recent migratory events has been fruitfully elucidated by examining the distribution of ever more finely resolved phylogenies or haplogroups at the nonrecombining uniparental regions of the genome: namely, the Y chromosome and mitochondrial DNA, as has been recently summarized for Europe (5). However, population genetics studies corresponding to the Holocene, which initiated 10 millennia of relative climatic stability to this day, has particularly benefited from combining such uniparental lineage analysis with genomewide approaches (2). The rapid transition during the last decade to analysis of genomewide variation has enabled levels of resolution of population genetic structure, which eluded single locus analysis (811). This is because each independent locus has its own coalescent history and continues to add information until the point where the spacing of loci falls below the scale of linkage disequilibrium (the nonrandom association of neighboring markers) in the populations of interest. For more than a decade, it has been possible, and now straightforward and affordable using DNA BeadChip technology, to determine for any individual genome, the biparental allelic states (genotype) for a subset of almost all known single nucleotide variants (SNVs), or better still, the phased combination of genotypes for a group of SNVs at any and all given genomic regions of interest (haplotype). The technical ability to interrogate genomewide variation was accompanied by the development of analytic approaches based on either allele frequency or haplotype comparisons. The various categories of such analytic approaches, their strengths, and limitations have been summarized recently (2).

Using these approaches, multiple studies examining autosomal biparental variation among Europeans have yielded a set of consistent conclusions, which generate a multilayered pattern, to which Botigué et al. (3) validate an important North African layer (Fig. 1E). Within a global context, it is apparent that European populations share a distinctive background genetic layer that is readily separable from components dominating other continents, enabling quite reliable individual assignment of European ancestry, most evident in the Basque and Sardinian populations (1, 812). A next layer is evident as the consistent and reproducible distinction between “northern” and “southern” European population groups, with a clinal distribution of genetic variation consistent with a south to north expansion and/or a larger effective population size in southern than in northern Europe (10, 12, 13). Despite comparatively low average levels of genetic differentiation among Europeans, the decay of genetic similarity as a function of geographic distance within Europe gives rise to a rather amazing ability to overlay a geographical map of Europe with allele frequency-based measures of genomewide variation (9, 12). In certain cases, it is even possible to assign the ethnic affiliation of an individual with a high degree of fidelity for population isolates such as Basque, Sardinian, Finns, and Ashkenazi Jews (12, 14, 15), a genetic structure within a given geographic region (16), or even a village of origin (17).

In the European context, the question of the trans-Mediterranean gene flow has also been investigated. The maternal component of African lineages to the contemporary gene pool of Europeans could be readily estimated due to the clear partitioning of Sub-Saharan and European mtDNA lineages. Frequencies of such African lineages vary widely within Europe, ranging from 3% in southern Europe to 0.7% in central Europe and 0.5% in northern Europe (18, 19), with evidence that as much as 35% of African lineages form European-specific subclades, pointing to gene flow from Sub-Saharan Africa to Europe as early as 11,000 y ago (20). Similarly, the sharp discontinuities between northwestern Africa and the Iberian Peninsula at the level of the Y chromosome were followed by evidence of North African to south European migration, particularly to the Iberian Peninsula (21, 22). At the genomewide level, Auton et al. (23) reported the highest genomewide haplotype diversity of European samples in the Iberian Peninsula, together with the highest sharing of haplotypes with the Yoruba population of any European population examined. A more recent study reported that southern Europeans have inherited 1–3% African ancestry, with an average admixture timing of some 55 generations ago, coinciding with the end of the Roman Empire and subsequent Arab migrations (24).

This latter set of findings begs the question of whether this level of Sub-Saharan admixture is simply the carryover of Sub-Saharan African admixture embedded within a much larger recent North African contribution. It is this question that Botigué and colleagues successfully resolve (3). With the appropriate choice of sample sets, multiple independent analytic approaches converged on the conclusion that a relatively recent North African–specific, rather than Sub-Saharan, admixture has made a significant contribution to the population genomic structure of Europe, with a striking clinal pattern from prominence of such admixture in southwest Europe to vanishing in north and east Europe. With the reassuring exception of the Basque population isolate, the Iberian Peninsula showed the greatest imprint. Specifically, southwestern European populations averaged between 4% and 20% of their genomes assigned to a North African ancestral cluster, whereas this value did not exceed 2% in southeastern European populations. The study included an isolation-by-distance analysis, including a western Mediterranean waypoint. Isolation-by-distance resulting from a geographic barrier such as the Mediterranean can confound allele frequency based measures of admixture (25). The addition of an Eastern Nile Corridor waypoint might have provided an even more comprehensive picture, as would have a larger number of representative sources for Near East populations. The study also highlights the importance of combining allele frequency with haplotype-based approaches. Thus, the genomewide examination of long identical-by-descent haplotypes showed that the component of haplotype sharing between Europe and Sub-Saharan Africa appeared to be driven by the carryover of Sub-Saharan to northern Africa.

Botigué et al. (3) motivate further studies to unravel the complexity of trans-Mediterranean gene flow. First, it is important to clarify whether the strong North African signal results from multiple waves or continuous gene flow to Europe or rather can be mostly attributed to the strong hold of North African Moors in Iberia lasting for almost a millennium. Moreover, multiple complex scenarios distinguish direct gene flow from North Africa to Europe from indirect gene flow via a third region such as the Levant and/or by the combined movements of groups such as Jews and Phoenicians. Levant and Near Eastern gene flow to southern Europe could also have been either direct or via a North Africa route, much as that for the Arabian introgression first to North Africa and then to Iberia. Sorting this out is required as a next necessary step with careful comparison of adequately representative Near East with North African population sample sets.

Finally, Botigué et al. (3) make a valiant attempt to relate population genetic findings to patterns of common disease. While acknowledging the serious limitations of extrapolating a common disease risk genetic signature based on SNVs that are presumed to tag true risk causative mutations in one population (in this case a set of 53 tagging markers for multiple sclerosis in Europeans) to another population (North Africans) whose patterns of allele frequencies and linkage disequilibrium are markedly different. Nevertheless, the study goes on to report a higher than expected frequency of this very signature in North Africans. Given the disparity of actual multiple sclerosis disease favoring the populations of the temperate climates of northern Europe, the authors reach two major conclusions—both of which are valuable in their own right—but not likely a sequitur for the current study results. The first is that appropriate genomewide mapping of SNP signatures predicting predisposition to common disease requires delineation of the population genetic architecture of the population of interest. The second is that a true disease risk genomic signature does not equal a higher frequency of disease without the additional influence of gene-by-environment interactions.

Footnotes

The authors declare no conflict of interest.

See companion article on page 11791.

References

  • 1.Li JZ, et al. Worldwide human relationships inferred from genome-wide patterns of variation. Science. 2008;319(5866):1100–1104. doi: 10.1126/science.1153717. [DOI] [PubMed] [Google Scholar]
  • 2.Novembre J, Ramachandran S. Perspectives on human population structure at the cusp of the sequencing era. Annu Rev Genomics Hum Genet. 2011;12:245–274. doi: 10.1146/annurev-genom-090810-183123. [DOI] [PubMed] [Google Scholar]
  • 3.Botigué LR, et al. Gene flow from North Africa contributes to differential human genetic diversity in southern Europe. Proc Natl Acad Sci USA. 2013;110:11791–11796. doi: 10.1073/pnas.1306223110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Shennan S, Edinborough K. Prehistoric population history: From the Late Glacial to the Late Neolithic in Central and Northern Europe. J Archaeol Sci. 2007;34(8):1339–1345. [Google Scholar]
  • 5.Soares P, et al. The archaeogenetics of Europe. Curr Biol. 2010;20(4):R174–R183. doi: 10.1016/j.cub.2009.11.054. [DOI] [PubMed] [Google Scholar]
  • 6.Currat M, Excoffier L. The effect of the Neolithic expansion on European molecular diversity. Proc Biol Sci. 2005;272(1564):679–688. doi: 10.1098/rspb.2004.2999. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Rubin Z. The Mediterranean and the dilemma of the Roman Empire in late Antiquity. Mediterr Hist Rev. 1986;1(1):13–62. [Google Scholar]
  • 8.Behar DM, et al. The genome-wide structure of the Jewish people. Nature. 2010;466(7303):238–242. doi: 10.1038/nature09103. [DOI] [PubMed] [Google Scholar]
  • 9.Novembre J, et al. Genes mirror geography within Europe. Nature. 2008;456(7218):98–101. doi: 10.1038/nature07331. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Tian C, et al. Analysis and application of European genetic substructure using 300 K SNP information. PLoS Genet. 2008;4(1):e4. doi: 10.1371/journal.pgen.0040004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Yunusbayev B, et al. The Caucasus as an asymmetric semipermeable barrier to ancient human migrations. Mol Biol Evol. 2012;29(1):359–365. doi: 10.1093/molbev/msr221. [DOI] [PubMed] [Google Scholar]
  • 12.Lao O, et al. Correlation between genetic and geographic structure in Europe. Curr Biol. 2008;18(16):1241–1248. doi: 10.1016/j.cub.2008.07.049. [DOI] [PubMed] [Google Scholar]
  • 13.Seldin MF, et al. European population substructure: Clustering of northern and southern populations. PLoS Genet. 2006;2(9):e143. doi: 10.1371/journal.pgen.0020143. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Need AC, Kasperaviciute D, Cirulli ET, Goldstein DB. A genome-wide genetic signature of Jewish ancestry perfectly separates individuals with and without full Jewish ancestry in a large random sample of European Americans. Genome Biol. 2009;10(1):R7. doi: 10.1186/gb-2009-10-1-r7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Sabatti C, et al. Genome-wide association analysis of metabolic traits in a birth cohort from a founder population. Nat Genet. 2009;41(1):35–46. doi: 10.1038/ng.271. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Price AL, et al. The impact of divergence time on the nature of population structure: An example from Iceland. PLoS Genet. 2009;5(6):e1000505. doi: 10.1371/journal.pgen.1000505. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.O’Dushlaine C, et al. Genes predict village of origin in rural Europe. Eur J Hum Genet. 2010;18(11):1269–1270. doi: 10.1038/ejhg.2010.92. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Pereira L, Cunha C, Alves C, Amorim A. African female heritage in Iberia: A reassessment of mtDNA lineage distribution in present times. Hum Biol. 2005;77(2):213–229. doi: 10.1353/hub.2005.0041. [DOI] [PubMed] [Google Scholar]
  • 19.Salas A, et al. The African diaspora: Mitochondrial DNA and the Atlantic slave trade. Am J Hum Genet. 2004;74(3):454–465. doi: 10.1086/382194. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Cerezo M, et al. Reconstructing ancient mitochondrial DNA links between Africa and Europe. Genome Res. 2012;22(5):821–826. doi: 10.1101/gr.134452.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Cruciani F, et al. Tracing past human male movements in northern/eastern Africa and western Eurasia: New clues from Y-chromosomal haplogroups E-M78 and J-M12. Mol Biol Evol. 2007;24(6):1300–1311. doi: 10.1093/molbev/msm049. [DOI] [PubMed] [Google Scholar]
  • 22.Adams SM, et al. The genetic legacy of religious diversity and intolerance: Paternal lineages of Christians, Jews, and Muslims in the Iberian Peninsula. Am J Hum Genet. 2008;83(6):725–736. doi: 10.1016/j.ajhg.2008.11.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Auton A, et al. Global distribution of genomic diversity underscores rich complex history of continental human populations. Genome Res. 2009;19(5):795–803. doi: 10.1101/gr.088898.108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Moorjani P, et al. The history of African gene flow into Southern Europeans, Levantines, and Jews. PLoS Genet. 2011;7(4):e1001373. doi: 10.1371/journal.pgen.1001373. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Jensen JL, Bohonak AJ, Kelley ST. Isolation by distance, web service. BMC Genet. 2005 doi: 10.1186/1471-2156-6-13. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES