Skip to main content
Investigative Genetics logoLink to Investigative Genetics
. 2014 Nov 30;5:15. doi: 10.1186/s13323-014-0015-6

Different waves and directions of Neolithic migrations in the Armenian Highland

Anahit Hovhannisyan 1,, Zaruhi Khachatryan 1, Marc Haber 2, Peter Hrechdakian 3, Tatiana Karafet 4, Pierre Zalloua 5,6, Levon Yepiskoposyan 1
PMCID: PMC4249771  PMID: 25452838

Abstract

Background

The peopling of Europe and the nature of the Neolithic agricultural migration as a primary issue in the modern human colonization of the globe is still widely debated. At present, much uncertainty is associated with the reconstruction of the routes of migration for the first farmers from the Near East. In this context, hospitable climatic conditions and the key geographic position of the Armenian Highland suggest that it may have served as a conduit for several waves of expansion of the first agriculturalists from the Near East to Europe and the North Caucasus.

Results

Here, we assess Y-chromosomal distribution in six geographically distinct populations of Armenians that roughly represent the extent of historical Armenia. Using the general haplogroup structure and the specific lineages representing putative genetic markers of the Neolithic Revolution, haplogroups R1b1a2, J2, and G, we identify distinct patterns of genetic affinity between the populations of the Armenian Highland and the neighboring ones north and west from this area.

Conclusions

Based on the results obtained, we suggest a new insight on the different routes and waves of Neolithic expansion of the first farmers through the Armenian Highland. We detected at least two principle migratory directions: (1) westward alongside the coastline of the Mediterranean Sea and (2) northward to the North Caucasus.

Electronic supplementary material

The online version of this article (doi:10.1186/s13323-014-0015-6) contains supplementary material, which is available to authorized users.

Keywords: Armenian Highland, Y chromosome, Neolithic migration

Background

The large-scale transition from hunter-gathering to farming, known as the Neolithic Revolution, is broadly recognized as one of the crucial demographic events in human prehistory. It is considered that the advent of the Neolithic lifestyle, which is characterized by the dominance of settlement sedentism and the domestication of wild animals and plants, led to obvious advantages of farmers over hunter-gatherers and, in particular, drove dramatic human population growth and dispersal [1-3].

Archaeological research has uncovered the independent emergence of agricultural homelands in many parts of the world at different subsequent times, initially ranging between approximately 10 and 5 KYA [2,4]. In terms of chronology, the Fertile Crescent, the region in the Middle East, spanning the Zagros Mountains of Iran and Southern Mesopotamia northward to Southeast Anatolia, is widely recognized as the earliest farming center where agriculture is known to have originated, dating to around 10 KYA [5,6]. From the Fertile Crescent, human populations, with their cultural resources and languages, migrated towards various destinations, including Europe, currently the most thoroughly investigated region by archaeologists and geneticists [3,7].

Since the advance of molecular techniques, genetic studies have been extensively applied to disentangle a long-standing question about the nature of the spread of agriculture from the Fertile Crescent [8-11]. Under the demic diffusion model [5,8,12], the extant genetic diversity of Europeans would have resulted mainly from the genetic pool of the Near Eastern Neolithic farmers, while conversely, the cultural diffusion model asserts that European lineages would have been expected to have descended from indigenous hunter-gatherers [13-15]. In general, genetic studies based on different nuclear, mitochondrial, and Y-chromosomal markers and ancient DNA analysis differ considerably in their evaluation of the contribution of Paleolithic hunter-gatherers and Neolithic farmers to the composition of the modern European gene pool [16,17]. Recent discoveries indicating a third population, the Northern Eurasians, contributing their genetic legacy to modern Europeans, has further added to the complexity of these models [18]. Overall, previous studies highlight the entanglement and complexity of such historical events as farming dispersal and, ultimately, the peopling of Europe. The intricacies of these migratory events with varying patterns of cultural and demographic diffusion in different regions require the development of relevant models reflecting the process of Neolithic dispersal throughout Eurasia [7].

Despite the fast-growing application of the whole genomic sequencing approach on the reconstruction of human population history, convenient polymorphic markers of the non-recombining portion of the Y chromosome (NRY) still remain an indispensable and relatively simple tool for the patrilineal study of complex historic migration events that influenced modern-day Europeans’ genetic diversity [19-21]. In particular, relatively stable (in evolutionary terms) single-nucleotide polymorphisms (SNPs) with Y-chromosomal haplogroup defining characteristics and more rapidly mutating short tandem repeats (STRs) on the NRY locus are used in population genetic surveys for the detection of diversity among and within the studied populations [20]. Furthermore, among the useful features of the Y chromosome is its high level of geographic stratification and diversification, providing more specific inferences concerning population movement [22,23]. In addition to the frequency of classical genetic markers, the distribution of Y-chromosomal haplogroups shows broad clines across Europe, which was characterized as one of the main features of the European genetic landscape and regarded as evidence for the demic diffusion model [5]. Moreover, previous studies of Y-chromosomal haplogroup distribution reveal that the majority of contemporary European lineages fall into the haplogroups E, G, I, N, and R [20,24,25]. Further, it has been suggested that some Y-chromosomal haplogroups serve as specific markers of the Neolithic migration involving the first farmers from the Fertile Crescent, namely, E1b1b1-M35, J2-M172, G-M201, and R1b1a2-M269 lineages [22,24-26]. In particular, haplogroup R1b1a2-M269 is the most common Y-chromosomal lineage in Europe, encountered in 110 million European men, and increases in frequency westward [27,28]. Lately, the question of whether its origins were in the Paleolithic or Neolithic periods has become the subject of intense debate. In this context, Busby et al. claim that the existing data and methods are not capable of unambiguously estimating the age of its origin and the directions of its migration [29]. However, in some recent works, the observed explicit frequency cline of the haplogroup R1b1a2-M269 from Anatolia to Western Europe and its associated haplotype diversity cline in the opposite direction suggest that the lineage may have spread towards Europe with the migration of Neolithic farmers from the Near East [24,28]. Conversely, Y-chromosomal haplogroups G-M201 and J2-M172 are widely distributed in populations of the Caucasus, Near/Middle East, and Southern Europe, with the highest frequency in the North Caucasus [30,31]. These studies, however, did not consider the populations from the eastern regions of modern Turkey and the South Caucasus, roughly corresponding to the boundaries of the Armenian Highland, which could have served as a potential corridor for various Neolithic migrations.

Located at the crossroads of Europe and the Middle East, the Armenian Highland was a conduit for major waves of prehistoric and historic migrations [32], as well as a cradle for various ancient civilizations [33]. The unique geographic location of the plateau has garnered a great deal of scientific interest as a potential link between eastern and western Eurasian populations. Moreover, the variable climatic diversity and proximity to the Fertile Crescent likely contributed to the post-Last Glacial Maximum (LGM) Neolithic resettlements of the Armenian plateau, particularly by the first farmers from the Near East [32,34,35]. Dozens of archaeological and archaeobotanical artifacts related to agriculture and animal husbandry were discovered from the region, being consistent with the critical role of the Armenian Highland in the Neolithic farming migration from the Near East to Europe and the North Caucasus [36-38]. Though the area within the plateau is currently being studied by archaeologists, there is no convincing data enabling a proper description of the generalized pattern of Neolithic migrations through this region. However, it is possible to bridge this gap by applying the genetic study of populations indigenous to this geographic area. Here, we intended to identify the possible directions and waves of Neolithic migrations that had taken place via the Armenian Highland. To test the role of the region in the spread of Neolithic farmers, we studied the spatial frequency and diversity distribution of Y-chromosomal markers (drawing special attention to those linked with the spread of agriculturalists) in six geographically distinct Armenian populations, roughly covering the whole expanse of the Armenian Highland. Recently published genome-wide study results showing the absence of any significant admixture for Armenians over the past 4 KYA [39] justify using this population as a reference group for addressing the issue of Neolithic migration from the Near East to Europe and the North Caucasus.

Methods

Samples

Buccal swabs were collected with informed consent from a total of 757 unrelated (at the paternal grandfather level) self-identified ethnic Armenian males, representing four geographically distinct Armenian regions of the historical expanse of Armenia. These regions include Salmast (n = 199), eastern (Karabakh and Syunik) (n = 210), central (Alashkert and Bayazet) (n = 200), and western (n = 148) parts of the Armenian plateau. All subjects were informed about the aim of this study and gave their consent to participate. The study protocol was approved by the Ethics Committee of the Institute of Molecular Biology NAS RA (IORG number 0003427, Assurance number FWA00015042, and IRB number 00004079). Further, in order to roughly encompass the whole region for analysis, we used previously published data for Van (n = 103), Sasun (n = 104), the Ararat Valley (n = 110), and Gardman (n = 96) [35], with the latter two, along with Karabakh and Syunik, then included in one group representing the eastern part of the Armenian Highland (Figure 1). To assess the frequency and diversity distribution of encountered Y-chromosomal haplogroups, we combined our data with previously published comparative datasets representing the Near East, the North Caucasus, and Europe. Overall, the present study comprises data from 35 populations (see Additional file 1).

Figure 1.

Figure 1

Geographic locations of the Armenian populations studied.

Y-SNP and Y-STR genotyping

The genotyping was performed in a hierarchical manner for the Y-chromosomal binary (SNP) markers and for STRs (see Additional file 2). The samples of western and central Armenia, Karabakh, and Syunik were genotyped at the Lebanese American University for 32 SNPs and 17 STRs. The genotyping of Salmast specimens was performed at the University of Arizona for 44 SNPs and 14 STRs. Nomenclature of Y-chromosomal haplogroups was assigned in accordance with ISOGG 2014 (http://www.isogg.org). In order to unify the number of haplogroups and STR markers while doing comparative analysis, we used 24 haplogroups for analysis within the Armenian populations (Figure 2), nine haplogroupsin comparison with other ethnic groups (see Additional file 3) and the following eight common STRs for all other cross-comparisons: DYS19, DYS389I, DYS389b, DYS390, DYS391, DYS392, DYS393, and DYS439.

Figure 2.

Figure 2

Phylogenetic relationships and Y-chromosome haplogroup frequencies in six Armenian populations.

Data analysis

Measures of pairwise genetic distances (FST) were calculated using the software package Arlequin 3.5 [40]. We also estimated the intra-population locus-specific variance, VL, and the intra-population genetic variance, VP, according to the formulae given in Kayser et al. [41]. Frequencies and microsatellite variances of the haplogroups were displayed using Surfer 10 (Golden Software) by the gridding method. Latitude and longitude values were calculated for the geographic centers of the sampling regions. Principal coordinate analysis (PCoA) was performed on distance matrices based on FST genetic distances using Genstat software. The phylogenetic relationships among eight loci haplotypes of equal number of individuals from different populations within the haplogroups R1b1a2, J2, and G were ascertained using the NETWORK 4.6.1.0 (available at http://www.fluxus-engineering.com) and Network Publisher softwares. Median-joining networks were generated by processing haplotypes with the reduced-median algorithm, followed by the median-joining method, and with weighted STR loci tabulated to be proportional to the inverse of the repeat variance. GENE-E software was used to graphically represent genetic similarities between populations by color coding pairwise FST values on a heatmap. To estimate differences in the haplogroup composition of the regions, correspondence analysis was conducted using SPSS ver. 19 software package (SPSS Inc.).

Results and discussion

Y-haplogroup frequency distribution

The phylogenetic relationships of Y-chromosomal markers and frequency distribution of the defined 24 haplogroups in the six Armenian populations are shown in Figure 2. The haplogroup R1b1a2-M269 is the most frequently encountered subclade in all Armenian samples, except Sasun, which differs from others due to the predominance of haplogroup T (20%) [35]. Of the lineages within haplogroup R, its subclade R1a1a-M198 is linked to the spread of Indo-Aryan languages [42] and detected with low frequencies or even absent in the analyzed populations. The majority of the J-M304 samples belongs to its J2-M172 branch, though in the population of Salmast, there is a nearly equal frequency distribution of J2 and J1 lineages. The haplogroup G is also observed at relatively high frequencies in all Armenian samples (Figure 2). On the whole, the results of analysis of patrilineal lineages revealed a prevalence of the Y-chromosomal haplogroups associated with the arrival of Neolithic farmers from the Near East. Three prospective genetic markers of agricultural migration, namely, the haplogroups J2, G, and R-M269, represent the most common lineages in all six Armenian populations, together accounting for 49%–70% of the sampled groups. It has previously been proposed that the wide presence of genetic markers attributed to agriculturalists, coupled with Neolithic archaeological artifacts, indicates continuous habitation of the Armenian Highland since the dawn of the Neolithic [32,35].

To obtain insight towards the question of the directions of movement for agriculturalists from the Near East, we used the PCoA method to visualize the FST genetic distances (based on absolute haplogroup frequencies in Additional file 3) between the Armenian and comparative datasets from the Near East, the North Caucasus, and Europe (see Additional file 4, sheet 1). The PCoA plot shows strong regional clustering indicating the separation of the populations from the Near East and Eastern Europe from those of the North Caucasus and Western Europe (Figure 3). In this context, populations of the Armenian Highland, the Near East, and Eastern Europe appear to be in one extensive cluster with a clear geographic gradient from the Levant towards the northwest. In fact, the closest population to the Near Eastern groups is Cyprus, the region settled by Neolithic farmers from the mainland shortly after the emergence of agriculture [43]. Moreover, the population of Crete hosts one of the oldest Neolithic settlements of Europe and underwent an agricultural transition from either the Anatolian coast or by sea from the Levant approximately 7–8 KYA [3,44]. The Cretan population within the cluster is centrally located between the populations of the Near East and Europe. This pattern is in accordance with previously found genetic affinity between human remains from Neolithic sites (based on aDNA data) and the modern populations of Cyprus and Crete, suggesting the leading role of pioneer seafaring colonization in the expansion towards the rest of Europe [17,45]. Specifically, our results of the PCoA analysis support a key role for Crete in the spread of Neolithic farmers through maritime routes from the Near East to Europe, which is also confirmed by pairwise FST value comparisons based on haplogroup frequencies (see Additional file 4, sheet 1). The plot on Figure 3 clearly separates the western European and North Caucasus populations from each other and bidirectionally from the Armenian cluster. These overall results further bolster the Armenian Highland as a corridor between the two aforementioned regions and the Near East.

Figure 3.

Figure 3

PCoA plot based on pairwise F ST genetic distances calculated from haplogroup frequencies in the populations of this study. The plot is based on F ST pairwise genetic distances calculated from frequencies of nine common Y-chromosomal haplogroups (E1b1b1-M35, E(xE1b1b1), G-M201, I-M170, J2-M172, J(xJ2), L-M20, R1b1a2-M269, R(xR1b1a2)).

In order to provide a potential genetic explanation for the classification presented in Figure 3, we have conducted a correspondence analysis (Figure 4) on the haplogroup frequency data in the populations studied (Additional file 3). On the whole, the patterns of population distribution for correspondence analysis and PCoA are nearly identical. The European cluster, containing Basques, Sicilians, and Belgians, is associated with the haplogroups R1b1a2-M269 and I-M170, both widely spread in Europe, and the former being a marker for the Neolithic migration. The Caucasus cluster, comprising Abkhazians, Georgians, and Ossetians, is found to be connected to the haplogroup G-M201, which is also a marker for the Neolithic migration. The presence of the outlying Armenian population of Sasun in the vicinity of the Caucasus cluster could be explained by the geographic peculiarities of this high-mountainous group which lead to the genetic isolation from other Armenians during the intervening centuries. Completing the analysis of the haplogroups associated with the Neolithic agriculturalists, the lineage J2-M172 appears in between the European and Caucasus clusters.

Figure 4.

Figure 4

Correspondence analysis plot based on the haplogroup frequency data in the populations studied.

The results of the PCoA and correspondence analysis show that the haplogroup composition of the Near Eastern populations is very similar to that found for the populations from the Anatolian and Armenian plateaus, as well as those of the Mediterranean islands. This is highly suggestive of a lengthy genetic continuity, persistent since at least the Neolithic. Apparently, the population migration of the first farmers from the Levant could have been both by land to Anatolia and the North Caucasus, and by maritime routes via eastern Mediterranean islands towards continental Europe. This scenario is supported by the result of the comparison of FST genetic distance values based on the frequencies of all haplogroups identified (see Additional file 4), showing that the populations of the Armenian Highland display an intermediate position between the Near East and Europe, and the Near East and the North Caucasus. Though previous work based on 15 autosomal STR loci from four Armenian populations (Ararat Valley, Gardman, Sasun, and Van) [46] derived a potential Balkan origin for one of these locations (Van), the results of our analysis not only support the transition zone model of the Armenian Highlands but also the potential gene flow of some Neolithic markers, shared among Armenians and Balkan populations, from the Near East through this region.

Further, in order to obtain deeper insight into the relationships between the populations observed and to analyze possible routes of expansion, we separately assessed the distribution patterns of putative Y-chromosomal tracers of the spread of the first agriculturalists. The values of frequency and genetic variance within each haplogroup among considered populations are provided in Additional file 5. Pairwise FST genetic distances and their statistical significances between the considered populations based on STR distribution within the haplogroups R1b1a2, J2, and G are available in Additional file 4 (sheets 2–4).

Haplogroup R1b1a2-M269

The spatial distribution of the main western European Y-chromosomal lineage, haplogroup R1b1a2-M269, shows a significant frequency cline from 7% in Lebanon to 82% in Ireland [24,47], though also present in trace amounts in the majority of the North Caucasus populations [30]. Among Armenian samples, the haplogroup is one of the most common lineages, which is frequently encountered in the eastern part of the Armenian Highland and Van (see Additional file 5).

In contrast, a decreasing cline of microsatellite variance is detected from the Levant towards northwest and northeast. Furthermore, in comparison with all analyzed populations from the Near East, Europe, and Anatolia, the haplogroup R1b1a2-M269 occurs with the highest genetic variances in the western parts of the Armenian plateau, in Sasun and Salmast (Figure 5).

Figure 5.

Figure 5

Geographical distribution maps of haplogroup frequencies and genetic variances ( V P ): (A) R1b1a2, (B) J2, and (C) G.

A heatmap plot of FST distances within haplogroup R1b1a2 (Figure 6) reveals two large clusters with low genetic distances. The first represents a genetic homogeneity of European populations, while the second encompasses all populations of the Near East. Generally, only the population of Sasun is slightly different within the last group, likely due to the long centuries of its aforementioned isolation by geographic barriers. Moreover, in contrast to other populations of the Near Eastern cluster, the populations of the western part of the Armenian Highland, Van, Turkey, and Lebanon show a moderate level of genetic affinity to the central European populations. Indeed, the actual estimates of the FST values for haplogroup R1b1a2 place the western region of the Armenian Highland in a transitional position between the Near East and Europe (see Additional file 4, sheet 2). Previous data on the limited Y-chromosomal and autosomal sharing among the Armenian and European populations [31,35] should be considered as a consequence of the absence, in their Armenian datasets, of the populations from the western region of the Armenian Highland.

Figure 6.

Figure 6

Heatmap of pairwise F ST genetic distances, ranging from low (red) to high (blue), calculated for the haplogroup R1b1a2.

To assess the relationship between the haplotypes, we have conducted a median-joining network analysis within the haplogroup R1b1a2-M269 for the populations of Lebanon, the western part of the Armenian Highland, Italy, and Ireland, roughly approximating the path of human Neolithic migrations (see Additional file 6). The haplotypes of western Armenian origin are widely scattered and mainly associated with haplotypes from the Near Eastern (Lebanese) population. In addition, there are four haplotypes shared between Armenians and Europeans (Ireland and Italy), which was not revealed in Herrera et al. [35].

Haplogroup J2-M172

The spatial distribution of haplogroup J2-M172 indicates highest encountered frequencies (>15%) in the areas between the Near East and the Mediterranean littoral [25,48]. Conversely, this lineage is also one of the most common haplogroups in the Caucasus (Figure 5) [30,49]. In particular, the lineage comprises 59% of the Y chromosomes in Chechen population and occurs with the lowest STR variance (14%), likely representing a strong founder effect signal [30]. Moreover, the distribution pattern of the haplogroup is consistent with a Levantine/Anatolian dispersal route to southeastern Europe and the Caucasus [25]. By this definition, the notion of ‘Anatolia,’ taken from Cinnioğlu et al. [50], actually includes the western and central areas of the Armenian Highland.

The frequency analysis of the haplogroup J2-M172 data within the Armenian populations shows that it is the most commonly encountered clade in the western and central parts of historical Armenia (27.7% and 25.5%, respectively). Further, the western and eastern parts of the Armenian Highland have relatively high values of genetic variances, while the highest level among all populations was detected in Syria, in accordance with the suggested Near Eastern origin of this haplogroup (see Additional file 5) [25].

The heatmap plot of the FST values (see Additional file 7) within this haplogroup separates a distinct cluster of western Asian populations (Armenians, Turks, Lebanese, and Iranians). It also demonstrates a moderate level of genetic similarity between the majority of Armenian geographic groups (except Sasun) and the European populations. Our findings also indicate that western Armenians rather than eastern Armenians have a slightly closer genetic affinity with Greeks and Cretans based on the absolute values of pairwise FST distances. This result contradicts Herrera et al. [35], who demonstrated a segregation of Armenian populations from the European populations mentioned. In addition, eastern Armenians rather than western Armenians display closer genetic proximity to Ossets (relying on the FST values). On the whole, the comparison of FST genetic distances for haplogroup J2 indicates that the western Armenian population occupies an intermediate position between the Near East and Balkans on one hand, and Southern Europe on the other, while eastern Armenia serves as a genetic bridge between the Levant and the North Caucasus (see Additional file 4, sheet 3). Median-joining network analysis within the haplogroup J2 for the populations of Syria, western and eastern parts of the Armenian Highland, Crete, and Chechens also reflects the bidirectional split of the haplogroup J2 from the Near East westward and northward, mainly connecting western Armenia to Europe and eastern Armenia to the North Caucasus (see Additional file 8).

Haplogroup G-M201

The Y-chromosomal haplogroup G-M201 is widely distributed in the populations of the Caucasus, the Near East, and Southern Europe, with the highest frequencies occurring in the North Caucasus (Figure 5) [30,31]. Our observations indicate that in the central part of the Armenian Highland, the haplogroup occurs with a relatively high frequency (16%), being inferior by this rate only to the populations of the North Caucasus. At the same time, the Armenian sample from the central region of the Armenian Highland has a comparable value of haplotype diversity (74.5%) with that of the Near Eastern populations of Syria (88.6%) and Palestine (79.3%) (see Additional file 5). Thus, our results support the recently published data on the origin of this haplogroup in the neighboring areas of eastern Anatolia, Armenia, and Western Iran [51].

The heatmap plot of FST values for the haplogroup G (see Additional file 9) does not identify distinct clusters of western Asian or European populations. Though the comparison of FST values does not conclusively indicate the intermediate position of the central part of the Armenian Highland for the Neolithic migration from the Near East to the North Caucasus, it does not reject this possibility either (see Additional file 4, sheet 4).

The constructed median-joining network within the haplogroup G (Figure 7) reveals the highest level of scattering of central Armenian haplotypes as compared to various neighboring populations (Palestinians, Cherkessians, Iranians), which is expected under the assumption of the local origin of this lineage. Furthermore, the network clearly shows the presence of a founder effect among the Cherkessian population of the North Caucasus who share their ancestral haplogroup with Armenians.

Figure 7.

Figure 7

Median-joining network of microsatellite haplotypes within the haplogroup G. Circles represent microsatellite haplotypes, the areas of the circles are proportional to haplotype frequency (smallest circle corresponds to one individual), and population is indicated by color.

Conclusions

Our observation of the Y-chromosomal structure in geographically different Armenian populations suggests that the Armenian Highland served as a transitional corridor for at least two distinct pathways of migration for Neolithic farmers from the Near East westward and northward. The movement to Europe took place predominantly via the western region of the Armenian Highland alongside the coastline of the Mediterranean Sea, which is supported by the spatial distribution pattern of the haplogroup R1b1a2-M269. The migration to the North Caucasus occurred mainly across the central and eastern regions of the Armenian Highland, which is shown by the geographical distribution of haplogroup G-M201. In addition, we identified a distinct Neolithic wave of bidirectional expansion to Europe and the North Caucasus associated with haplogroup J2-M172.

Thus, at the initial stage of the Neolithic migration from the Levant, different directions and waves of population movement could be identified in the Armenian Highland (Figure 8). This inference needs to be tested by further study of other indigenous populations of the region using higher resolution genotyping of Y-chromosome, mitochondrial, and autosomal DNA markers, as well as applying the data recovered from ancient DNA.

Figure 8.

Figure 8

Different waves and directions of Neolithic migration from the Fertile Crescent.

Acknowledgements

We thank all the participants who donated their DNA samples and everyone who assisted in the sample collections. This work was supported by the State Committee Science MES RA, in the frame of research project no. SCS 13-1 F0221.

Abbreviations

KYA

Kilo years ago

NRY

Non-recombining portion of the Y chromosome

SNP

Single-nucleotide polymorphism

STR

Short tandem repeat

LGM

Last glacial maximum

PCoA

Principal coordinate analysis

Additional files

Additional file 1: (16.2KB, xlsx)

List of populations analyzed in this study. Samples used as comparative datasets representing the Near East, the North Caucasus, and Europe.

Additional file 2: (44.3KB, xlsx)

Typing results of the Y-chromosomal SNP and STR markers in the Armenian samples studied. List of haplogroup distribution and STRs for the haplogroups R1b1a2, J2, and G.

Additional file 3: (8.8KB, xlsx)

Absolute frequency distribution of haplogroups in the studied populations. Frequencies of the main Y-chromosomal haplogroups in the 18 populations included in the PCoA and correspondence analysis of Figures 3 and 4.

Additional file 4: (19KB, xlsx)

Pairwise F ST genetic distances between the populations studied. Pairwise F ST genetic distances between the populations studied based on all haplogroup frequencies, and for the haplogroups R1b1a2, J2, and G.

Additional file 5: (13.1KB, xlsx)

Distribution of the haplogroups R1b1a2, J2, and G. The values of frequency and genetic variance within the haplogroups R1b1a2, J2, and G among the considered populations.

Additional file 6: (132.8KB, pptx)

Median-joining network of microsatellite haplotypes within the haplogroup R1b1a2. Circles represent microsatellite haplotypes, the areas of the circles are proportional to haplotype frequency (smallest circle corresponds to one individual), and population is indicated by color.

Additional file 7: (104KB, pptx)

Heatmap of pairwise F ST genetic distances between the studied populations calculated for the haplogroup J2. Pairwise F ST genetic distances on the heatmap range from low (red) to high (blue).

Additional file 8: (124.8KB, pptx)

Median-joining network of microsatellite haplotypes within the haplogroup J2. Circles represent microsatellite haplotypes, the areas of the circles are proportional to haplotype frequency (smallest circle corresponds to one individual), and population is indicated by color.

Additional file 9: (101.8KB, pptx)

Heatmap of pairwise F ST genetic distances between the studied populations calculated for the haplogroup G. Pairwise F ST genetic distances on the heatmap range from low (red) to high (blue).

Footnotes

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

AH performed most statistical data analyses and drafted the manuscript. ZK, PH, and LY collected the samples and helped to draft the manuscript. LY conceived of the study and participated in its design and coordination. PZ, MH, and TK typed the Armenian DNA samples. All authors read and approved the final manuscript.

Contributor Information

Anahit Hovhannisyan, Email: hovhannisyananahit19@gmail.com.

Zaruhi Khachatryan, Email: z_khachatryan@mb.sci.am.

Marc Haber, Email: mh25@sanger.ac.uk.

Peter Hrechdakian, Email: peter.hrechdakian@gmail.com.

Tatiana Karafet, tkarafet@email.arizona.edu.

Pierre Zalloua, Email: pierre.zalloua@lau.edu.lb.

Levon Yepiskoposyan, Email: lepiskop@yahoo.com.

References

  • 1.Renfrew C: Languages families and the spread of farming. In The Origins and Spread of Agriculture and Pastoralism in Eurasia. Edited by Harris DR. Washington DC: Smithsonian Institution Press; 1996.
  • 2.Diamond J, Bellwood P. Farmers and their languages: the first expansions. Science. 2003;300:597–603. doi: 10.1126/science.1078208. [DOI] [PubMed] [Google Scholar]
  • 3.Bellwood P. First Migrants: Ancient Migration in Global Perspective. Wiley-Blackwell: Chichester; 2013. [Google Scholar]
  • 4.Bellwood P, Oxenham M. The Neolithic Demographic Transition and its Consequences. Netherlands: Springer; 2008. The expansions of farming societies and the role of the Neolithic demographic transition; pp. 13–34. [Google Scholar]
  • 5.Ammerman AJ, Cavalli-Sforza LL. Neolithic Transition and the Genetics of Populations in Europe. Princeton University Press: Princeton; 1984. [Google Scholar]
  • 6.Riehl S, Zeidi M, Conard NJ. Emergence of agriculture in the foothills of the Zagros Mountains of Iran. Science. 2013;341:65–67. doi: 10.1126/science.1236743. [DOI] [PubMed] [Google Scholar]
  • 7.Barbujani G, Chikhi L. DNAs from the European Neolithic. Heredity. 2006;97:84–85. doi: 10.1038/sj.hdy.6800852. [DOI] [PubMed] [Google Scholar]
  • 8.Menozzi P, Piazza A, Cavalli-Sforza L. Synthetic maps of human gene frequencies in Europeans. Science. 1978;201:786–792. doi: 10.1126/science.356262. [DOI] [PubMed] [Google Scholar]
  • 9.Cavalli-Sforza LL, Menozzi P, Piazza A. The History and Geography of Human Genes. Princeton University Press: Princeton; 1994. [Google Scholar]
  • 10.Dupanloup I, Bertorelle G, Chikhi L, Barbujani G. Estimating the impact of prehistoric admixture on the genome of Europeans. Mol Biol Evol. 2004;21:1361–1372. doi: 10.1093/molbev/msh135. [DOI] [PubMed] [Google Scholar]
  • 11.Sjödin P, François O. Wave-of-advance models of the diffusion of the Y chromosome haplogroup R1b1b2 in Europe. PLoS One. 2011;6:e21592. doi: 10.1371/journal.pone.0021592. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Chikhi L, Nichols RA, Barbujani G, Beaumont MA. Y genetic data support the Neolithic demic diffusion model. Proc Natl Acad Sci U S A. 2002;99:11008–11013. doi: 10.1073/pnas.162158799. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Zvelebil M, Zvelebil K. Agricultural transition and Indo-European dispersal. Antiquity. 1988;62:574–583. [Google Scholar]
  • 14.Haak W, Forster P, Bramanti B, Matsumura S, Brandt G, Tänzer M, Villems R, Renfrew C, Gronenborn D, Alt KW, Burger J. Ancient DNA from the first European farmers in 7500-year-old Neolithic sites. Science. 2005;310:1016–1018. doi: 10.1126/science.1118725. [DOI] [PubMed] [Google Scholar]
  • 15.Morelli L, Contu D, Santoni F, Whalen M, Francalacci P, Cucca F. A comparison of Y chromosome variation in Sardinia and Anatolia is more consistent with cultural rather than demic diffusion of agriculture. PLoS One. 2010;5:e10419. doi: 10.1371/journal.pone.0010419. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Pinhasi R, Fort J, Ammerman AJ. Tracing the origin and spread of agriculture in Europe. PLoS Biol. 2005;3:e410. doi: 10.1371/journal.pbio.0030410. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Fernández E, Pérez-Pérez A, Gamba C, Prats E, Cuesta P, Anfruns J, Molist M, Arroyo-Pardo E, Turbón D. Ancient DNA analysis of 8000 BC near eastern farmers supports an early neolithic pioneer maritime colonization of Mainland Europe through Cyprus and the Aegean Islands. PLoS Genet. 2014;10:e1004401. doi: 10.1371/journal.pgen.1004401. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Lazaridis I, Patterson N, Mittnik A, Renaud G, Mallick S, Kirsanow K, Sudmant PH, Schraiber JG, Castellano S, Lipson M, Berger B, Economou C, Bollongino R, Fu Q, Bos KI, Nordenfelt S, Li H, de Filippo C, Prüfer K, Sawyer S, Posth C, Haak W, Hallgren F, Fornander E, Rohland N, Delsate D, Francken M, Guinet JM, Wahl J, Ayodo G, et al. Ancient human genomes suggest three ancestral populations for present-day Europeans. Nature. 2014;513:409–413. doi: 10.1038/nature13673. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Jobling MA, Tyler-Smith C. Fathers and sons: the Y chromosome and human evolution. Trends Genet. 1995;11:449–456. doi: 10.1016/S0168-9525(00)89144-1. [DOI] [PubMed] [Google Scholar]
  • 20.Jobling MA, Tyler-Smith C. The human Y chromosome: an evolutionary marker comes of age. Nat Rev Genet. 2003;4:598–612. doi: 10.1038/nrg1124. [DOI] [PubMed] [Google Scholar]
  • 21.Underhill PA, Kivisild T. Use of Y chromosome and mitochondrial DNA population structure in tracing human migrations. Annu Rev Genet. 2007;41:539–564. doi: 10.1146/annurev.genet.41.110306.130407. [DOI] [PubMed] [Google Scholar]
  • 22.Rosser ZH, Zerjal T, Hurles ME, Adojaan M, Alavantic D, Amorim A, Amos W, Armenteros M, Arroyo E, Barbujani G, Beckman G, Beckman L, Bertranpetit J, Bosch E, Bradley DG, Brede G, Cooper G, Côrte-Real HB, de Knijff P, Decorte R, Dubrova YE, Evgrafov O, Gilissen A, Glisic S, Gölge M, Hill EW, Jeziorowska A, Kalaydjieva L, Kayser M, Kivisild T, et al. Y-chromosomal diversity in Europe is clinal and influenced primarily by geography, rather than by language. Am J Hum Genet. 2000;67:1526–1543. doi: 10.1086/316890. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Lippold S, Xu H, Ko A, Li M, Renaud G, Butthof A, Schroeder R, Stoneking M. Human paternal and maternal demographic histories: insights from high-resolution Y chromosome and mtDNA sequences. Investigative Genet. 2014;5:13. doi: 10.1186/2041-2223-5-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Myres NM, Rootsi S, Lin AA, Järve M, King RJ, Kutuev I, Cabrera VM, Khusnutdinova EK, Pshenichnov A, Yunusbayev B, Balanovsky O, Balanovska E, Rudan P, Baldovic M, Herrera RJ, Chiaroni J, Di Cristofaro J, Villems R, Kivisild T, Underhill PA. A major Y-chromosome haplogroup R1b Holocene era founder effect in Central and Western Europe. Eur J Hum Genet. 2011;19:95–101. doi: 10.1038/ejhg.2010.146. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Semino O, Magri C, Benuzzi G, Lin AA, Al-Zahery N, Battaglia V, Maccioni L, Triantaphyllidis C, Shen P, Oefner PJ, Zhivotovsky LA, King R, Torroni A, Cavalli-Sforza LL, Underhill PA, Santachiara-Benerecetti AS. Origin, diffusion, and differentiation of Y-chromosome haplogroups E and J: inferences on the neolithization of Europe and later migratory events in the Mediterranean area. Am J Hum Genet. 2004;74:1023–1034. doi: 10.1086/386295. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Hammer MF, Karafet T, Rasanayagam A, Wood ET, Altheide TK, Jenkins T, Griffiths RC, Templeton AR, Zegura SL. Out of Africa and back again: nested cladistic analysis of human Y chromosome variation. Mol Biol Evol. 1998;15:427–441. doi: 10.1093/oxfordjournals.molbev.a025939. [DOI] [PubMed] [Google Scholar]
  • 27.Chiaroni J, Underhill PA, Cavalli-Sforza LL. Y chromosome diversity, human expansion, drift, and cultural evolution. Proc Natl Acad Sci U S A. 2009;106:20174–20179. doi: 10.1073/pnas.0910803106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Balaresque P, Bowden GR, Adams SM, Leung HY, King TE, Rosser ZH, Goodwin J, Moisan JP, Richard C, Millward A, Demaine AG, Barbujani G, Previderè C, Wilson IJ, Tyler-Smith C, Jobling MA. A predominantly neolithic origin for European paternal lineages. PLoS Biol. 2010;8:e1000285. doi: 10.1371/journal.pbio.1000285. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Busby GB, Brisighelli F, Sánchez-Diz P, Ramos-Luis E, Martinez-Cadenas C, Thomas MG, Bradley DG, Gusmão L, Winney B, Bodmer W, Vennemann M, Coia V, Scarnicci F, Tofanelli S, Vona G, Ploski R, Vecchiotti C, Zemunik T, Rudan I, Karachanak S, Toncheva D, Anagnostou P, Ferri G, Rapone C, Hervig T, Moen T, Wilson JF, Capelli C. The peopling of Europe and the cautionary tale of Y chromosome lineage R-M269. Proc Biol Sci. 2012;279:884–892. doi: 10.1098/rspb.2011.1044. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Balanovsky O, Dibirova K, Dybo A, Mudrak O, Frolova S, Pocheshkhova E, Haber M, Platt D, Schurr T, Haak W, Kuznetsova M, Radzhabov M, Balaganskaya O, Romanov A, Zakharova T, Soria Hernanz DF, Zalloua P, Koshel S, Ruhlen M, Renfrew C, Wells RS, Tyler-Smith C, Balanovska E, Genographic Consortium Parallel evolution of genes and languages in the Caucasus region. Mol Biol Evol. 2011;28:2905–2920. doi: 10.1093/molbev/msr126. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Yunusbayev B, Metspalu M, Järve M, Kutuev I, Rootsi S, Metspalu E, Behar DM, Varendi K, Sahakyan H, Khusainova R, Yepiskoposyan L, Khusnutdinova EK, Underhill PA, Kivisild T, Villems R. The Caucasus as an asymmetric semipermeable barrier to ancient human migrations. Mol Biol Evol. 2012;29:359–365. doi: 10.1093/molbev/msr221. [DOI] [PubMed] [Google Scholar]
  • 32.Dolukhanov P, Aslanyan S, Kolpakov E, Belyayeva E: Prehistoric sites in northern Armenia.Antiquity 2004, 78: [http://antiquity.ac.uk/projgall/dolukhanov301/]
  • 33.Lang DM. Armenia: Cradle of Civilization. London: Allen & Unwin; 1980. [Google Scholar]
  • 34.Vavilov NI: The problems of breeding, the role of EurAsia and New World in origin of cultivated plants. In Selected wWorks in Five Volumes, Volume 2. Moscow-Leningrad: USSR Academy of Sciences press; 1960. in Russian.
  • 35.Herrera KJ, Lowery RK, Hadden L, Calderon S, Chiou C, Yepiskoposyan L, Regueiro M, Underhill PA, Herrera RJ. Neolithic patrilineal signals indicate that the Armenian plateau was repopulated by agriculturalists. Eur J Hum Genet. 2012;20:313–320. doi: 10.1038/ejhg.2011.192. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Kushnareva KK. The Southern Caucasus in Prehistory: Stages of Cultural and Socioeconomic Development from the Eighth to the Second Millennium BC. Philadelphia: University of Pennsylvania Museum; 1997. [Google Scholar]
  • 37.Gandilyan PA. Archaeobotanical evidence for evolution of cultivated wheat and barley in Armenia. In: Gandilian PA, Damania AB, Valkoun J, Willcox G, editors. Proceedings of the Harlan Symposium: The Origins of Agriculture and the Domestication of Crop Plants in the Near East: 10–14 May 1997. Aleppo: Qualset CO: ICARDA; 1998. pp. 280–285. [Google Scholar]
  • 38.Hovsepyan R, Willcox G. The earliest finds of cultivated plants in Armenia: evidence from charred remains and crop processing in Pisé from the Neolithic settlements of Aratashen and Aknashen. Veget Hist Archaeobot. 2008;17:S63–S71. doi: 10.1007/s00334-008-0158-6. [DOI] [Google Scholar]
  • 39.Hellenthal G, Busby GB, Band G, Wilson JF, Capelli C, Falush D, Myers S. A genetic atlas of human admixture history. Science. 2014;343:747–751. doi: 10.1126/science.1243518. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Excoffier L, Laval G, Schneider S. Arlequin (version 3.0): an integrated software package for population genetics data analysis. Evol Bioinform Online. 2005;1:47–50. [PMC free article] [PubMed] [Google Scholar]
  • 41.Kayser M, Krawczak M, Excoffier L, Dieltjes P, Corach D, Pascali V, Gehrig C, Bernini LF, Jespersen J, Bakker E, Roewer L, de Knijff P. An extensive analysis of Y-chromosomal microsatellite haplotypes in globally dispersed human populations. Am J Hum Genet. 2001;68:990–1018. doi: 10.1086/319510. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Underhill PA, Myres NM, Rootsi S, Metspalu M, Zhivotovsky LA, King RJ, Lin AA, Chow CE, Semino O, Battaglia V, Kutuev I, Järve M, Chaubey G, Ayub Q, Mohyuddin A, Mehdi SQ, Sengupta S, Rogaev EI, Khusnutdinova EK, Pshenichnov A, Balanovsky O, Balanovska E, Jeran N, Augustin DH, Baldovic M, Herrera RJ, Thangaraj K, Singh V, Singh L, Majumder P, et al. Separating the post-Glacial coancestry of European and Asian Y chromosomes within haplogroup R1a. Eur J Hum Genet. 2010;18:479–484. doi: 10.1038/ejhg.2009.194. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Vigne JD, Briois F, Zazzo A, Willcox G, Cucchi T, Thiébault S, Carrère I, Franel Y, Touquet R, Martin C, Moreau C, Comby C, Guilaine J. First wave of cultivators spread to Cyprus at least 10,600 y ago. Proc Natl Acad Sci U S A. 2012;109:8445–8449. doi: 10.1073/pnas.1201693109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Evans JD. The early millennia: continuity and change in a farming settlement. In: Evely D, Hughes-Brock H, Momigliano N, editors. Knossos: A Labyrinth of History: Papers Presented in Honour of Sinclair Hood. London: British School at Athens; 1994. pp. 1–20. [Google Scholar]
  • 45.Davison K, Dolukhanov P, Sarson GR, Shukurov A. The role of waterways in the spread of the Neolithic. J Arch Sci. 2006;33:641–652. doi: 10.1016/j.jas.2005.09.017. [DOI] [Google Scholar]
  • 46.Lowery RK, Herrera KJ, Barrett DA, Rodriguez R, Hadden LR, Harutyunyan A, Margaryan A, Yepiskoposyan L, Herrera RJ. Regionalized autosomal STR profiles among Armenian groups suggest disparate genetic influences. Am J Phys Anthropol. 2011;146:171–178. doi: 10.1002/ajpa.21558. [DOI] [PubMed] [Google Scholar]
  • 47.Zalloua PA, Xue Y, Khalife J, Makhoul N, Debiane L, Platt DE, Royyuru AK, Herrera RJ, Hernanz DF, Blue-Smith J, Wells RS, Comas D, Bertranpetit J, Tyler-Smith C. Genographic Consortium: Y-chromosomal diversity in Lebanon is structured by recent historical events. Am J Hum Genet. 2008;82:873–882. doi: 10.1016/j.ajhg.2008.01.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Di Giacomo F, Luca F, Popa LO, Akar N, Anagnou N, Banyko J, Brdicka R, Barbujani G, Papola F, Ciavarella G, Cucci F, Di Stasi L, Gavrila L, Kerimova MG, Kovatchev D, Kozlov AI, Loutradis A, Mandarino V, Mammi' C, Michalodimitrakis EN, Paoli G, Pappa KI, Pedicini G, Terrenato L, Tofanelli S, Malaspina P, Novelletto A. Y chromosomal haplogroup J as a signature of the post-neolithic colonization of Europe. Hum Genet. 2004;115:357–371. doi: 10.1007/s00439-004-1168-9. [DOI] [PubMed] [Google Scholar]
  • 49.Nasidze I, Ling EY, Quinque D, Dupanloup I, Cordaux R, Rychkov S, Naumova O, Zhukova O, Sarraf-Zadegan N, Naderi GA, Asgary S, Sardas S, Farhud DD, Sarkisian T, Asadov C, Kerimov A, Stoneking M. Mitochondrial DNA and Y chromosome variation in the Caucasus. Ann Hum Genet. 2004;68:205–221. doi: 10.1046/j.1529-8817.2004.00092.x. [DOI] [PubMed] [Google Scholar]
  • 50.Cinnioğlu C, King R, Kivisild T, Kalfoğlu E, Atasoy S, Cavalleri GL, Lillie AS, Roseman CC, Lin AA, Prince K, Oefner PJ, Shen P, Semino O, Cavalli-Sforza LL, Underhill PA. Excavating Y-chromosome haplotype strata in Anatolia. Hum Genet. 2004;114:127–148. doi: 10.1007/s00439-003-1031-4. [DOI] [PubMed] [Google Scholar]
  • 51.Rootsi S, Myres NM, Lin AA, Järve M, King RJ, Kutuev I, Cabrera VM, Khusnutdinova EK, Varendi K, Sahakyan H, Behar DM, Khusainova R, Balanovsky O, Balanovska E, Rudan P, Yepiskoposyan L, Bahmanimehr A, Farjadian S, Kushniarevich A, Herrera RJ, Grugni V, Battaglia V, Nici C, Crobu F, Karachanak S, Hooshiar Kashani B, Houshmand M, Sanati MH, Toncheva D, Lisa A, et al. Distinguishing the co-ancestries of haplogroup G Y-chromosomes in the populations of Europe and the Caucasus. Eur J Hum Genet. 2012;20:1275–1282. doi: 10.1038/ejhg.2012.86. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Investigative Genetics are provided here courtesy of BMC

RESOURCES