Abstract
The American continent currently accounts for ∼15% of the world population. Although first settled thousands of years ago and fitting its label as “the New World,” the European colonial expansion initiated in the late 15th century resulted in people from virtually every corner of the globe subsequently settling in the Americas. The arrival of large numbers of immigrants led to a dramatic decline of the Native American population and extensive population mixing. A salient feature of the current human population of the Americas is, thus, its great diversity. The genetic variation of the Native peoples that recent immigrants encountered had been shaped by demographic events acting since the initial peopling of the continent. Similarly, but on a compressed timescale, the colonial history of the Americas has had a major impact on the genetic makeup of the current population of the continent. A range of genetic analyses has been used to study both the ancient settlement of the continent and more recent history of population mixing. Here, I show how these two strands of research overlap and make use of results from other scientific disciplines to produce a fuller picture of the settlement of the continent at different time periods. The biological diversity of the Americas also provides prominent examples of the complex interaction between biological and social factors in constructing human identities and of the difficulties in defining human populations.
Studies of human genetic diversity at increasing levels of resolution have shed light on (1) the original ancient settlement of the American continent, and (2) the more recent population mixing due to European colonial expansion.
A multiplicity of research approaches have been used to explore the original settlement of the American continent, often focusing on three prominent questions: (1) the route of entry of the initial settlers, (2) their time of arrival, and (3) the pattern of subsequent migration. These questions have been approached with variable degrees of success using various types of genetic markers examined in “Native” populations, defined on anthropological grounds (particularly language). Early studies used information from blood groups and proteins (Cavalli-Sforza et al. 1994) and were followed by DNA analyses mainly of mitochondrial DNA (mtDNA) (Forster et al. 1996; Tamm et al. 2007; Fagundes et al. 2008; Kitchen et al. 2008) and the Y chromosome (Lell et al. 1997; Bianchi et al. 1998; Karafet et al. 1999; Bortolini et al. 2003). The more recent studies have examined the human genome at increasing levels of resolution, from analyses with restricted sets of markers (Wang et al. 2007; Ray et al. 2010) to ongoing studies based on full genome sequences. Although a range of scenarios for the initial peopling of the Americas have been envisaged, genetic evidence points to the continent being settled by people migrating into the northwestern tip of the continent from Asia. This migration would have been facilitated by the existence, at that time, of a land bridge connecting Siberia to Alaska, which later was submerged beneath the Bering Strait by the rising sea level at the end of the last glaciation, around 15,000 years ago (Fiedel 2000). Genetic support for an American settlement from Eastern Siberia includes the finding that Native Americans are genetically most similar to North Asians (Cavalli-Sforza et al. 1994; Wang et al. 2007) and the existence of a gradient of declining genetic diversity from northwest North America southward (Wang et al. 2007; Reich et al. 2012). This gradient extends beyond that seen in the “Old World” for populations at increasing distance from Africa, possibly resulting from a sequence of population contractions that occurred as small groups of humans moved from settled areas into uninhabited territories (Ramachandran et al. 2005; Handley et al. 2007; Wang et al. 2007). The American continent, being the last major landmass to have been settled by humans, shows a low genetic diversity as compared with all other continents (Wang et al. 2007).
Estimating the date of the initial settlement of the Americas has proven a difficult and contentious issue. Geological information provides a key reference point in that because of extensive ice sheets covering North America at the peak of the last glaciation (around 20,000 years ago), the continent would have been impenetrable then (Fig. 1). Therefore, this leaves two broad opportunities for settlement: before or after this last glacial maximum (LGM). Calculating the time of initial settlement of the continent from genetic information requires a number of assumptions of which the exact validity is difficult to assess, including variation in factors such as population demography, mutation rates, and the influence of selection. Perhaps, not surprisingly, the range of genetic estimates for the time of human settlement of America is quite wide, extending to both sides of the glacial maximum. It is, however, encouraging that most of the recent estimates, based on increasingly larger amounts of data and more sophisticated statistical methods, point to a settlement not long after the LGM. These estimates show greater consistency with the archaeological evidence, which, although itself not devoid of controversy, points to a human presence in the Americas by ∼14,000 years ago.
The pattern of migration into the continent has also been the subject of considerable disagreement. An influential model put forward in the mid-1980s posited that the settlement of the continent occurred in three sequential migratory waves from Asia, corresponding to the three major linguistic stocks in which the linguist Joseph Greenberg classified Native American languages (Greenberg et al. 1986; Greenberg 1987; Ruhlen 1991). The first migration would have given rise to a very large Amerind linguistic family comprising populations living all over the continent, whereas two subsequent migrations, restricted to North America and the Arctic, would be associated with populations speaking languages of the Na-Dene and Eskimo-Aleut linguistic families, respectively. Although early blood group and protein data were interpreted in support of the Greenberg model (Cavalli-Sforza et al. 1994), subsequent mtDNA and Y-chromosome analyses have been mostly interpreted as indicative of a single migration wave into the continent (Bonatto and Salzano 1997; Tamm et al. 2007; Fagundes et al. 2008; Kitchen et al. 2008). The recent genome-wide surveys of diversity with increasing resolution have, however, provided a different view. These are inconsistent with the single migration model and are more in line with the occurrence of multiple migrations (Fig. 1). Particularly strong support for several ancient migrations comes from a study based on a large survey of populations and using data for hundreds of thousands of genetic markers. With this type of data, it is possible to estimate the ancestry of every segment of DNA along the genome and state whether such a segment is of African, European, or Native American origin. Analyses can then focus only on the Native American segments of the genome (Fig. 2). This means that Native American individuals, and populations, that previously had to be excluded from study because of admixture with non-Natives can now be included, facilitating a more extensive population survey and reducing bias. These recent data have provided strong evidence that the Eskimo-Aleut, Na-Dene, and Amerind linguistic groups show evidence of differential gene flow from Asia, inconsistent with stemming from a discrete single colonization event with no subsequent migration (Fig. 1). Noticeably, although North American populations show evidence of multiple episodes of gene exchange with Asia, Native populations from Mexico to the Southern tip of South America appear to stem from one colonization wave with no subsequent Asian gene flow. This observation agrees with the highly controversial proposal of grouping widely separated Native American languages into a single “Amerindian” linguistic family (Greenberg 1987; Ruhlen 1991). These data also confirm the correlation of population diversity with distance from the Bering Strait, in agreement with settlement in a north-to-south direction. Interestingly, this correlation increases when considering the coasts as facilitators of population movement, suggesting an important role of the coast during the initial population dispersals on the continent. A phylogenetic tree relating the Native American populations examined in that survey is also consistent with the north-to-south settlement of the continent, as it shows a sequence of major population splits separating groups of populations mostly along a north-to-south axis (Figs. 1 and 3). Consistent with some degree of parallel evolution for languages and genes (Cavalli-Sforza et al. 1994), resulting from population separation followed by relative isolation, the major clusters of populations in this genetic tree show a broad correspondence with the linguistic affiliation of the populations (Fig. 3).
The recent study by Reich et al. (2012) illustrates the potential of high-density genotyping for extending studies focused on the original settlement of the Americas to Native individuals with evidence of admixture with recent immigrants. This admixture is extensive across the continent and involves not only Natives but also the general population, particularly in the countries of what is now referred to as Latin America. Historically, a major driver behind population mixing in this region was the fact that immigrants from Spain and Portugal, particularly in the early phases of the colonial expansion, were mostly men (Boyd-Bowman 1973). It is well documented that many Conquistadors had children with Native women, the most famous example possibly being that of the Conquistador of Mexico, Hernán Cortez, and the Nahua woman known as “Malinche” (Fig. 4). This “sex-biased” pattern of mating between immigrant men and Native women had been alluded to by historians (Morner 1967), but it was only with mtDNA and Y-chromosome studies that the full genetic impact of this feature became apparent. Because mtDNA and the Y chromosome are only transmitted by mothers and fathers, respectively, they allow the inference of the maternal (mtDNA) and paternal (Y-chromosome) ancestry of individuals. One of the first such studies was performed in Antioquia (Colombia), a population traditionally considered as mainly of Spanish descent. Consistent with this view, it was found that >90% of men in Antioquia had Y-chromosome lineages of European origin (Carvajal-Carmona et al. 2000). Surprisingly, when examining their mtDNA, a sharply different picture was observed. In 90% of individuals, maternal ancestry was Native American (Carvajal-Carmona et al. 2000). Similar observations have now been made in many Latin American countries (Alves-Silva et al. 2000; Green et al. 2000; Carvalho-Silva et al. 2001; Marrero et al. 2007), although with a considerable variation in ancestry proportions between them (Fig. 5). These studies, in addition, show a higher African ancestry with mtDNA than the Y chromosome, indicating that, historically, admixture with Africans has also mostly involved African women.
The large variation in ancestry seen across Latin America relates to differences in pre-Columbian Native population density and the pattern of recent immigration into specific regions of the continent. For instance, most studies performed so far have mainly focused on areas with little documented African immigration and consistently show a relatively low African genetic ancestry. In these population samples, the variation in individual European and Native American ancestry is very large, to the extent that it overlaps with that seen in Native population samples (Fig. 6). The variation in individual ancestry seen in these samples thus effaces their designation as “Native” or “non-Native.” This observation punctuates the interest of incorporating admixed Latin American populations, traditionally considered non-Native, into studies on the initial settlement of the continent. Similar to what has been performed in the recent survey by Reich et al. (2012), the inference of ancestry of each genome segment in Latin Americans could be used to focus solely on Native American segments of the genome. This is an avenue of research that is just beginning to be explored and shows great potential for the future. It promises to be of particular importance for the analysis of regions where anthropologically recognizable Native populations and individuals are virtually nonexistent, as they have been absorbed into the current mixed population. This is the case for the many areas that were relatively sparsely populated in pre-Columbian times and, subsequently, received a large flow of immigrants, such as from the Caribbean and many parts of North and South America. Consequently, estimation of individual ancestry along the genome will facilitate denser demographic history analyses across the Americas, as well as a reexamination of the original settlement of the continent based on a more comprehensive population sampling.
Other than being informative for addressing questions of population history, the study of Latin American populations promises to facilitate the genetic characterization of biological attributes differentiated among the populations that participated in admixture on the continent. For instance, a range of facial features differ between Native Americans and Europeans, and the genetic study of admixed Latin Americans promises to help in the identification of genes explaining variation in facial appearance. Such research is of interest for understanding disorders of craniofacial development and could also have forensic applications. Another example is type 2 diabetes (T2D), a disease that has a very high frequency in Native Americans and for which a higher risk is associated with increased Native American ancestry. This observation led to the proposal of the “thrifty genotype” hypothesis, which posits that the increased risk of T2D in Native Americans results from genetic adaption to a low-calorie/high-exercise way of life that became detrimental with the recent change to a high-calorie/low-exercise lifestyle (Pollard 2008). The study of large, carefully characterized samples from Latin American populations offers a unique opportunity for conducting a detailed assessment of this hypothesis. The identification of genes explaining the variable frequency of diseases between populations (such as T2D) will be an important step forward in the development of novel, more effective (even individualized) disease-management strategies that account for human population diversity.
The overlap of individual genetic ancestry estimates, seen in Latin and Native American populations (Fig. 5), raises the question of the relationship of these estimates to the perception that individuals have of their own ancestry. A recent analysis of a large sample of individuals from five Latin American countries found a highly significant correlation between self-perception and genetically estimated ancestry (Fig. 7). However, this study also found evidence that self-perception is biased. A particularly clear bias involves pigmentation: individuals with greater pigmentation tend to overestimate their Native and African ancestry, whereas individuals with lighter pigmentation tend to overestimate their European ancestry (AR Ruiz-Linares, in press). Statistically significant differences were also observed between countries, pointing to the influence of social factors in self-perception. Consistent with this observation, social scientists have argued that, in Latin America, self-identification as Native or non-Native is often strongly influenced by social cues (Wade 2010).
The insights into the initial settlement of the continent provided by the genetic study of Native Americans illustrate the fact that a population sampling that maximizes diversity based on anthropological grounds (such as language) can facilitate investigations concerned with the first settlement of the continent. However, the analysis of intercontinental admixture both in Native and non-Native Latin American populations show some of the complexities of defining human groups, with population labels suggesting a potentially misleading genetic singularity. The biological reality is that of a gradient in the genetic makeup of these populations (and individuals) involving various degrees of mixture between the initial settlers of the continent and more recent immigrants. The genetic diversity of Latin Americans is, thus, a prominent example of the fuzzy meaning of the labels used to refer to human populations. Although these labels can assist in study design and facilitate certain historical inferences, ethnicity, race, and other such terms are social constructs devoid of a clear-cut biological meaning.
Footnotes
Editors: Aravinda Chakravarti
Additional Perspectives on Human Variation available at www.cshperspectives.org
REFERENCES
- Alves-Silva J, da Silva Santos M, Guimarães PE, Ferreira AC, Bandelt HJ, Pena SD, Prado VF. 2000. The ancestry of Brazilian mtDNA lineages. Am J Hum Genet 67: 444–461. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bianchi NO, Catanesi CI, Bailliet G, Martinez-Marignac VL, Bravi CM, Vidal-Rioja LB, Herrera RJ, López-Camelo JS. 1998. Characterization of ancestral and derived Y-chromosome haplotypes of New World native populations. Am J Hum Genet 63: 1862–1871. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bonatto SL, Salzano FM. 1997. A single and early migration for the peopling of the Americas supported by mitochondrial DNA sequence data. Proc Natl Acad Sci 94: 1866–1871. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bortolini MC, Salzano FM, Thomas MG, Stuart S, Nasanen SP, Bau CH, Hutz MH, Layrisse Z, Petzl-Erler ML, Tsuneto LT, et al. 2003. Y-chromosome evidence for differing ancient demographic histories in the Americas. Am J Hum Genet 73: 524–539. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Boyd-Bowman P. 1973. Patterns of Spanish emigration to the New World (1493–1580). Council on International Studies, State University of New York at Buffalo, Buffalo, NY. [Google Scholar]
- Carvajal-Carmona LG, Soto ID, Pineda N, Ortíz-Barrientos D, Duque C, Ospina-Duque J, McCarthy M, Montoya P, Alvarez VM, Bedoya G, et al. 2000. Strong Amerind/white sex bias and a possible Sephardic contribution among the founders of a population in northwest Colombia. Am J Hum Genet 675: 1287–1295. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Carvalho-Silva DR, Santos FR, Rocha J, Pena SD. 2001. The phylogeography of Brazilian Y-chromosome lineages 1. Am J Hum Genet 68: 281–286. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cavalli-Sforza LL, Menozzi P, Piazza A. 1994. The history and geography of human genes. Princeton University Press, Princeton, NJ. [Google Scholar]
- Fagundes NJ, Kanitz R, Eckert R, Valls AC, Bogo MR, Salzano FM, Smith DG, Silva WA Jr, Zago MA, Ribeiro-dos-Santos AK, et al. 2008. Mitochondrial population genomics supports a single pre-Clovis origin with a coastal route for the peopling of the Americas. Am J Hum Genet 82: 583–592. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fiedel SJ. 2000. The peopling of the New World: Present evidence, new theories, and future directions. J Archaeol Res 8: 39–103. [Google Scholar]
- Forster P, Harding R, Torroni A, Bandelt HJ. 1996. Origin and evolution of Native American mtDNA variation: A reappraisal. Am J Hum Genet 59: 935–945. [PMC free article] [PubMed] [Google Scholar]
- Green LD, Derr JN, Knight A. 2000. mtDNA affinities of the peoples of North-Central Mexico. Am J Hum Genet 66: 989–998. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Greenberg JH. 1987. Language in the Americas. Stanford University Press, Stanford, CA. [Google Scholar]
- Greenberg JH, Turner CG II, Zegura SL. 1986. The settlement of the Americas: A comparison of the linguistic, dental, and genetic evidence. Curr Anthropol 27: 477–497. [Google Scholar]
- Handley LJ, Manica A, Goudet J, Balloux F. 2007. Going the distance: Human population genetics in a clinal world. Trends Genet 23: 432–439. [DOI] [PubMed] [Google Scholar]
- Karafet TM, Zegura SL, Posukh O, Osipova L, Bergen A, Long J, Goldman D, Klitz W, Harihara S, de Knijff P, et al. 1999. Ancestral Asian source(s) of new world Y-chromosome founder haplotypes. Am J Hum Genet 64: 817–831. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kitchen A, Miyamoto MM, Mulligan CJ. 2008. A three-stage colonization model for the peopling of the Americas. PLoS ONE 3: e1596. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lell JT, Brown MD, Schurr TG, Sukernik RI, Starikovskaya YB, Torroni A, Moore LG, Troup GM, Wallace DC. 1997. Y chromosome polymorphisms in Native American and Siberian populations: Identification of Native American Y chromosome haplotypes. Hum Genet 100: 536–543. [DOI] [PubMed] [Google Scholar]
- Marrero AR, Bravi C, Stuart S, Long JC, Pereira das Neves Leite F, Kommers T, Carvalho CM, Pena SD, Ruiz-Linares A, Salzano FM, et al. 2007. Pre- and post-Columbian gene and cultural continuity: The case of the Gaucho from southern Brazil. Hum Hered 64: 160–171. [DOI] [PubMed] [Google Scholar]
- Morner M. 1967. Race mixture in the history of Latin America. Little Brown, New York. [Google Scholar]
- Pollard TM. 2008. Western diseases: An evolutionary perspective. Cambridge University Press, New York. [Google Scholar]
- Ramachandran S, Deshpande O, Roseman CC, Rosenberg NA, Feldman MW, Cavalli-Sforza LL. 2005. Support from the relationship of genetic and geographic distance in human populations for a serial founder effect originating in Africa. Proc Natl Acad Sci 102: 15942–15947. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ray N, Wegmann D, Fagundes NJ, Wang S, Ruiz-Linares A, Excoffier L. 2010. A statistical evaluation of models for the initial settlement of the American continent emphasizes the importance of gene flow with Asia. Mol Biol Evol 27: 337–345. [DOI] [PubMed] [Google Scholar]
- Reich D, Patterson N, Campbell D, Tandon A, Mazieres S, Ray N, Parra MV, Rojas W, Duque C, Mesa N, et al. 2012. Reconstructing Native American population history. Nature 488: 370–374. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ruhlen M. 1991. A guide to the world’s languages. Stanford University Press, Stanford, CA. [Google Scholar]
- Tamm E, Kivisild T, Reidla M, Metspalu M, Smith DG, Mulligan CJ, Bravi CM, Rickards O, Martinez-Labarga C, Khusnutdinova EK, et al. 2007. Beringian standstill and spread of Native American founders. PLoS ONE 2: 1–6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wade P. 2010. Race and ethnicity in Latin America. Pluto, London. [Google Scholar]
- Wang S, Lewis CM, Jakobsson M, Ramachandran S, Ray N, Bedoya G, Rojas W, Parra MV, Molina JA, Gallo C, et al. 2007. Genetic variation and population structure in native Americans. PLoS Genet 3: e185. [DOI] [PMC free article] [PubMed] [Google Scholar]