Skip to main content
Meta Gene logoLink to Meta Gene
. 2014 Jun 21;2:427–438. doi: 10.1016/j.mgene.2014.04.011

Genetic ancestry of a Moroccan population as inferred from autosomal STRs

K Bentayebi a,, F Abada a, H Ihzmad b, S Amzazi a
PMCID: PMC4287812  PMID: 25606427

Abstract

Detecting population substructure and ancestry is a critical issue for both association studies of health behaviors and forensic genetics. Determining aspects of a population's genetic history as potential sources of substructure can aid in design of future genetic studies. Within this context, fifteen autosomal short tandem repeat (STR), were used to examine population genetic structure and hypotheses of the origin of the modern Moroccan population from individuals belonging to three different ethnical groups from Morocco (Arab, Berber and Sahrawi), by comparing their autosomal STR variation with that of neighboring and non-neighboring populations in North Africa, Europe and Middle East as well as proposed ancestral populations in Morocco (Berber). We report on the results that the gradient of North African ancestry accounts for previous observations of low levels of sharing with Near East and a substantially increased gene flow especially from Morocco and Spain.

Keywords: North Africa, Morocco, Population genetics, Iberia, STR, Ancestry, Gene flow

Introduction

The question of Moroccan ancestry has been addressed for ages to answer questions of human evolutionary, historical and medical significance. Human evolutionary or anthropological studies have focused on vast array of DNA-based genetic markers, each of which has different attributes but complementary roles (Garrigan and Hammer, 2006). The nuclear genome contains the higher rate of polymorphisms. These include microsatellites that consist of tandemly repeated DNA sequences with a variable number of repeats from one individual to another. Thousands of microsatellite polymorphisms have now been identified in human. Their mutation rate is much higher than that of single nucleotides, approaching 10− 3 per generation (White et al., 2007, Zhang and Hewitt, 2003). Because of this high mutation rate, microsatellites have the potential to provide information about recent evolutionary events which made them the markers of choice in population genetics and forensic applications (El Amri et al., 2011, Chbel et al., 2003). By comparing the autosomal and X chromosome, it has been clear that the lower mutation rate, smaller population size and higher linkage disequilibrium make the X chromosome less informative than autosomes but record substantially older histories than the Y chromosome (Bentayebi et al., 2012a, Bentayebi et al., 2012b). In the other hand, the mitochondrial genome as well as the Y chromosome offer a very different perspective on human evolution. The absence of recombination in these regions of the genome and their uniparental characteristics (maternal for the mtDNA and paternal for the Y-chromosomal DNA) allows researchers to infer past human behaviors and evolutionary events such as migrations, founder events, population bottlenecks or expansions, relative male and female contributions to an admixed population, marriage practices, and mode of transmission of languages (Underhill and Kivisild, 2007, Xue et al., 2008). Interestingly, a joint analysis has the potential of providing more robust information (Jorde et al., 1998), as these markers give an independent (for autosomal and X chromosome) and dependent (for mitochondrial DNA and Y chromosome) picture of sex-specific demography (Ségurel et al., 2008, Ouborg et al., 2008, White et al., 1998, Schaffner, 2004, Bentayebi et al., 2012a, Bentayebi et al., 2012b). Mitochondrial DNA studies of Moroccan population, at times, provided interesting results that together with autosomal and other genetic markers will help to understand the genetic landscape of the Moroccan population. In molecular evolution, a haplogroup is a group of similar haplotypes that share a common ancestor having the same polymorphism in all haplotypes. Some haplogroups exhibit specific geographic homelands (Aboukhalid et al., 2013). Haplogroup U6 in the mtDNA phylogenetic tree descends from the haplogroup U, who lived around 55,000 years ago. Recent studies based on the distribution of mtDNA of haplogroup U6 raised two theories about the origins of modern human populations in North Africa. The first put forward that groups of the proto-U6 lineage spread from the Near East to North Africa around 40–45 ka (thousands of years ago) (Olivierri et al., 2006), followed by some degree of regional continuity, while the second proposes a westward human migration from the Near East, followed by further demographic expansion at 22 ka centered on the Maghreb and associated with a microlithic bladelet culture known as the Iberomaurusian (Maca-Meyer et al., 2003, Pereira et al., 2003). In the other hand, a recent leading evolutionary theory envisages that the first modern humans in Europe, called the Cro-Magnon (Iberomaurusian), arrived from North-west Africa and are believed to have completely replaced the previous inhabitants “the Neanderthals” (Caramelli et al., 2008). Present day genetic diversity of North African populations has been revised through a wide variety of uniparental and autosomal genetic markers.

Morocco is an appropriate region for population genetic studies and a good example for answering and completing the previous questions and theories. Located in northwest Africa with a total of 13 living languages listed by Ethnologue (Gordon, 2005), Morocco is important for studies concerning human migration, because it is bounded by the Mediterranean Sea to the North with just 14 km away from Spain across the Gibraltar strait and the Atlantic Ocean to the west, which is part of the traditionally favored model of the migratory route out of Africa for anatomically modern humans (Bilgili and Weyel, 2009). Berber-speaking populations, as representative of the autochthonous inhabitants of North Africa, inhabit a wide ranging area extended from the Saharan desert to the Atlas Mountains and nowadays combining six different countries from Egypt to Mauritania, speaking 26 sorts of Berber dialects coming all from a unique and standard Berber language (Gordon, 2005, Hayward, 2000, Louali and Philippson, 2003). In spite of their self-identification by their common language, Berbers have a complex history of invasions, conquests and migrations, especially the Arabic conquest in the seventh century AD. The Arab-Muslim conquerors adopted a language policy that enabled them to spread Arabic and Islamic cultural values (Brunschvig R, 1975). However, today in Morocco, beside Standard Arabic, Berber language remains one of the two national languages of Morocco (Gordon, 2005).

The considerable ethnic and cultural diversity within Morocco reflects differential historic population influences and make the study of existing genetic diversity of the kingdom an attractive effort. Morocco has the most complex linguistic situation compared to the three other North African countries with a remarkably unequaled repartition of population: 90% of residents settle in the north part of Morocco. Although around half of the total population speaks one of the different Berber dialects, it is quite difficult to trace a clear line among Berber, Arabized Berber and Arabic people in Morocco (Abderrahman Z, 2013). All these historical events together with recent demographical movements have left an important imprinting in their current genetic background.

Previous genetic studies in Morocco have mainly focused on classical markers, Alu polymorphisms (Aboukhalid et al., 2013, Bentayebi et al., 2011), mitochondrial DNA (Bentayebi et al., 2012a, Bentayebi et al., 2012b), X chromosome STRs and SNPs (Picornell et al., 2011, Tomas et al., 2008) and Y-chromosome STRs and SNP (Aboukhalid et al., 2009, Aboukhalid et al., 2010) variation in Berber and Arabic speaking samples from small and limited parts of Morocco. The present work completes the previous studies investigating a wide range of samples from different part of Morocco (Arab, Berber and Sahrawi).

Analyses of autosomal STRs have become very useful tools, both in evolutionary studies and forensic casework (Gaibar et al., 2010). In the same way, AmpFlSTR Identifiler PCR amplification kit provides an easily reproducible and highly reliable method for typing 15 highly polymorphic STR loci for forensic and evolutionary purposes (Anslinger et al., 2001, Applied Biosystems, 2001). The aim of this study is to analyze STR genotypes in a Moroccan population that has special interest because of its particular geographical, cultural and historical characteristics (Fernández-Santander et al., 1999).

The present study characterizes autosomal STR genotypes from Morocco to examine population substructure and genetic relationships with other groups, including testing of the proposed genetic affinities between Morocco and populations in Southern Europe, North Africa and Middle East (Aboukhalid et al., 2009, Aboukhalid et al., 2010, Aboukhalid et al., 2013, Bentayebi et al., 2011, Bentayebi et al., 2012a, Bentayebi et al., 2012b, Picornell et al., 2011). We predict that if the actual Moroccan population share a common ancestor (or have experienced more recent migration and resultant gene flow) with either populations in Southern Europe or Middle East (Arab), allele frequencies of autosomal STR loci will be similar among Morocco and these proposed related populations and genetic distances between these populations will be low. Alternatively, if Morocco is an autochthonous North African population (with no recent gene flow from groups in Middle East or Europe), autosomal STR frequencies will be within the range of other populations in North Africa, and Moroccan will be more genetically similar to other North African groups with a Berber ancestry. Previous studies presenting STR data from the Moroccan population either used small samples that were often collected in urban areas of a single province to study relationships between Morocco and other populations (Chbel et al., 2003, El Ossmani et al., 2009, El Ossmani et al., 2010) (Table 1). This study represents one of the most comprehensive samples of Morocco yet analyzed for autosomal STR variation, with 377 male and female throughout the country.

Table 1.

Size and ethnical distribution of the Moroccan population in previous studies based on autosomal STRs.

Size Ethnicity Location
138 Unknown ethnicity Moroccan living in Brussel Abdin et al. (2003)
180 Arab, Berber and Saharawi Various parts of Morocco El Amri et al. (2011)
201 Berber Azrou (Atlas Montains) El Ossmani et al. (2010)
204 Arab Southern Morocco El Ossmani et al. (2009)
234 Arab Casablanca (Central Morocco) Chbel et al. (2003)
387 Arab Rabat, Sale, Zemmour, Zaer (Central Morocco) El Ossmani et al. (2008)

Materiel and methods

DNA samples

Blood samples and buccal swabs were collected from 320 healthy unrelated men and women (men n = 170, women n = 150) belonging to the three ethnic groups (Arab n = 116, Berber n = 104, Sahrawi n = 100) living in Morocco and including 145 samples already typed for 12 X-STRs (Bentayebi et al., 2012a, Bentayebi et al., 2012b), were analyzed. Genomic DNA was extracted using a DNA IQ™ System (Promega, Madison, USA) according to the manufacturer's instructions, and the DNA was quantitated using a NanoDrop 8000 spectrophotometer (Thermo Fisher Scientific, Wilmington, DE, USA), following the manufacturer's recommendation. Samples were obtained, with informed consent. All participants had all four of their grandparents born in the same region. The work was approved by the Ethical Committee of the University of Mohammed V-Agdal. Information about the linguistic origins was also recorded.

Polymerase chain reaction (PCR) amplification and genotyping

PCR amplification was performed according to the manufacturer's instruction (AmpFlSTR Identifiler™, Applied Biosystems, Foster City, CA, USA), to amplify the 15 autosomal short tandem repeats (CSF1PO, D2S1338, D3S1358, D5S818, D7S820, D8S1179, D13S317, D16S539, D18S51, D19S433, D21S11, FGA, TH01, TPOX, vWA) included in the AmpFlSTR Identifiler PCR Amplification Kit™. The samples were amplified in single reactions for all loci using fluorescently labeled primers. Electrophoresis was performed on an ABI Prism 3130 XL Genetic Analyser (Applied Biosystems, Foster City, CA).

Comparative population data

To test hypotheses of population origins, autosomal STR data from our sample were compared to geographically assumed-related populations. In this way, data on 13 STRs (D8S1179, D21S11, D7S820, CSF1PO, D3S1358, TH01, D13S317, VWA, TPOX, D18S51, FGA, D5S818, D16S539) in a total of 21 populations, were selected from the available literature and compared to our sample, including populations from North Africa (Moroccan Berber, Tunisian Berber, Tunisian Arabic and Libyan) (Cherni et al., 2005, El Ossmani et al., 2010, Khodjet-el-Kkil et al., 2012); Southern Europe (Italy, Spain and Greece) (Brisighelli et al., 2009, Camacho et al., 2007, Kovatsi et al., 2006); and Middle East (Palestine, Syria, Lebanon, UAE, Saudi Arabia, Oman, Yemen, Iraq, Kuwait, Egypt and Sudan) (Abdin et al., 2003, Abu Halima et al., 2009, Alenizi et al., 2008, Alshamali et al., 2005, Babiker et al., 2011, Barni et al., 2007, Chouery et al., 2010, Omran et al., 2009) and Western Asia (Pakistan, Iran, Fars) (Alshamali et al., 2005, Hedjazi et al., 2013, Rakha et al., 2009).

Statistical analysis

Arlequin software ver 3.11 (Excoffier et al., 2005) was used to calculate allele frequencies, population pairwise genetic distances (FST), AMOVA test, and also to assess departures from Hardy–Weinberg equilibrium. Pairwise FST values for each STR were calculated between 22 populations, based on 13 of the 15 studied STRs (D3S1358, vWA, FGA, D8S1179, D21S11, D18S51, D13S317, D7S820, TH01, TPOX, CSF1PO, D19S433, D2S1338). They were averaged over loci and represented in a multidimensional scaling (MDS) plot using the statistical package SPSS v.17.0 (Levesque, 2007).

Statistical parameters of forensic interest (PI, paternity index; MP, matching probability; PD, power of discrimination; PIC, polymorphic information content and PE, power of exclusion) were calculated using PowerStats v1.2 (Promega Corporation, Madison, WI) software package (Tereba, 1999). They are defined as follows (Houston, 1998):

Matching probability (MP), also known as probability of match (pM), is the number of individuals that may be surveyed before finding the same DNA pattern in a randomly selected individual. This is represented as:

MP=i=anj1npij2

where i and j represent the frequencies of all possible alleles a through n, Pij represents the frequencies of all possible genotypes.

The paternity index (PI) reflects how many more times likely it is that the person being tested is the biological father, rather than a randomly selected individual. The typical paternity index is assigned to a locus rather than an individual case. Generally, a PI of less than one is indicative of non-relatedness. The PI is represented by the following equation:

PI=1/2H

where H is the frequency of homozygotes.

The power of discrimination is defined as the probability of discrimination between two unrelated individuals. Thus, correlated to MP discussed above:

PD=1MP.

The power of exclusion (PE) is defined as the fraction of individuals having a DNA profile that is different from that of a randomly selected individual in a typical paternity case. The value for each individual case will vary. The average for a given locus is represented by the following equation:

PE=h212hH2

where h is the heterozygosity and H is the frequency of homozygotes.

The polymorphism information content (PIC) is the probability that one could identify which homologue of a given parent was transmitted to a given offspring, the other parent being genotyped as well.

PIC=1i=1npi2i=1nj=i+1n2pi2pj2.

Results

Allele frequencies of 15 autosomal STR loci (D3S1358, vWA, FGA, D8S1179, D21S11, D18S51, D5S818, D13S317, D7S820, TH01, TPOX, CSF1PO, D19S433, D2S1338, D16S539) as well as the values for forensic parameters are shown in Table 2. D21S11 was the most polymorphic autosomal STR, with 18 alleles, while TH01, TPOX, CSF1PO, D5S818, D7S820 and D16S539 were the least polymorphic with 6 alleles each. In particular, the most frequently observed alleles of the TH01, TPOX, CSF1PO, D5S818, D7S820 and D16S539 loci had 23.0%, 41.0%, 34.4%, 39.2%, 34.0% and 30.5% occurrences, respectively. Informativeness can be quantitatively measured by the polymorphism information content or PIC. Theoretically, PIC values can range from 0 to 1. At a PIC of 0, the marker has only one allele. At a PIC of 1, the marker would have an infinite number of alleles. A PIC value of greater than 0.7 is considered to be highly informative. Clearly markers with greater numbers of alleles tend to have higher PIC values and thus are more informative (Hildebrand et al., 1992). Using the equation for PIC, eleven autosomal markers included in the Identifiler are considered to be highly polymorphic: 73% of the offspring should be informative for D3S1358, 79% for vWA, 82.2% for FGA, 81.9% for TH01, 72.5% for D7S820, 80% for D8S1179, 80.1% for D21S11, 85.5% for D18S51, 72.7% for D16S539, 81.8% for D2S1338 and 80.1% for D19S433. While only four markers are considered to be moderately polymorphic: 68% of the offspring should be informative for TPOX, 66% for CSF1PO, 69.9% for D5S818 and 68.9% for D13S317. No significant deviations from the Hardy–Weinberg equilibrium were observed in these 15 markers (Table 2).

Table 2.

Allele frequencies and statistical parameters of AmpFlSTR Identifiler PCR Amplification Kit loci in Moroccan population.

D3S1358 vWA FGA TH01 TPOX CSF1PO D5S818 D13S317 D7S820 D8S1179 D21S11 D18S51 D16S539 D2S1338 D19S433
5 0.002
5.3 0.006
6 0.183 0.020
7 0.002 0.230 0.027 0.017 0.002 0.005
8 0.019 0.195 0.410 0.027 0.050 0.074 0.154 0.001 0.001 0.005
9 0.003 0.252 0.193 0.025 0.035 0.051 0.089 0.001 0.103
9.3 0.003 0.134 0.080
10 0.003 0.271 0.306 0.065 0.047 0.340 0.100 0.001 0.005 0.054 0.001
11 0.009 0.344 0.273 0.325 0.266 0.138 0.022 0.305 0.004 0.008
12 0.001 0.001 0.280 0.392 0.364 0.146 0.125 0.005 0.150 0.300 0.140
13 0.001 0.010 0.184 0.135 0.203 0.001 0.124 0.203 0.004 0.248
13.2 0.003 0.040
14 0.072 0.129 0.234 0.140 0.278
14.2 0.070
15 0.280 0.197 0.197 0.001 0.147 0.003 0.150
16 0.261 0.226 0.160 0.048 0.046
17 0.243 0.232 0.003 0.145 0.282 0.017
18 0.142 0.148 0.003 0.070 0.099 0.002
18.2 0.001
19 0.020 0.044 0.001 0.138
20 0.140 0.003 0.036 0.170 0.002
21 0.159 0.001 0.060
22 0.210
23 0.201 0.090
23.2 0.001
24 0.137 0.090
25 0.100
26 0.012
27 0.026
28 0.123
28.2
29 0.245
29.2
30 0.265
30.2
31 0.086
31.2 0.102
32
32.2 0.119
33
33.2
34
34.2 0.004
35 0.011
MP 0.096 0.043 0.049 0.050 0.134 0.145 0.115 0.110 0.099 0.055 0.055 0.030 0.102 0.053 0.070
PI 2.050 2.070 2.010 2.350 1.570 1.570 1.850 1.850 1.770 2.370 2.550 2.540 2.010 2.80 2.95
Ho 0.769 0.790 0.790 0.788 0.790 0.730 0.740 0.750 0.745 0.808 0.820 0.820 0.770 0.838 0.820
He 0.780 0.817 0.860 0.840 0.794 0.737 0.743 0.771 0.770 0.840 0.848 0.830 0.778 0.857 0.810
PD 0.904 0.957 0.951 0.950 0.866 0.855 0.885 0.890 0.901 0.945 0.945 0.970 0.898 0.947 0.941
PIC 0.730 0.792 0.822 0.819 0.680 0.660 0.699 0.689 0.725 0.800 0.801 0.855 0.727 0.818 0.801
PE 0.543 0.632 0.523 0.675 0.400 0.430 0.470 0.465 0.451 0.580 0.606 0.604 0.524 0.640 0.655

Ho, observed heterozygosity; He, expected heterozygosity; PI, paternity index; MP, matching probability; PD, power of discrimination; PIC, polymorphic information content and PE, power of exclusion.

The observed heterozygosity (Ho) ranges from 0.737 (CSF1PO) to 0.860 (FGA). The power of discrimination varies between 0.855 (CSF1PO) and 0.970 (D18S51), and the combined power of discrimination for the 15 STR loci is 0.99999999. The probability of excluding paternity (PE) varies between 0.400 (TPOX) and 0.675 (TH01), and the combined probability of excluding paternity for the 15 loci is 0.9999. The present work of the Moroccan population was conducted in order to complete the STR profiling of the different ethnic groups of the country. Previous studies were published on the Arab and Berber speaking groups (Chbel et al., 2003, El Ossmani et al., 2010) but this is the first large work that encloses both groups plus a small Sahraoui pool. All statistical values reported in this work showed no significant difference from those reported precedently (Table 2).

A first approach to population diversity by locus through the Fst statistic values range from — 0.56% for vWA between Morocco and Libya to an impressive 11.8% for TPOX between Morocco and Pakistan. A hierarchical AMOVA, assuming four geographical groups (‘Maghreb’, “South Europe”, “Middle East” and ‘Western Asia’), does not reveal any significant distances between the Mediterranean populations. The frequency variance between the four groups (Fct = 0.3%, P = 0.154) being clearly lower than the diversity among populations within groups (Fsc = 0.44%, P = 0.068) (Table 3).

Table 3.

Locus-by-locus analysis of molecular variance of 13 autosomal short tandem repeat loci.


Among groups
Among populations
Within populations
Locus Percent variance FCT P Percent variance FSC P Percent variance FST P
1 0.29 0.0029 0.0010 0.0000 0.0001 0.4448 99.71 0.0029 0.0176
2 0.27 0.0027 0.00000 0.08 0.00082 0.19844 99.65 0.00348 0.00000
3 0.05 0.0005 0.52688 1.57 0.01574 0.00000 98.38 0.01619 0.00000
4 0.71 0.0071 0.04301 1.02 0.01032 0.00000 98.27 0.01732 0.00000
5 1.03 0.0103 0.04692 1.38 0.01397 0.00000 97.59 0.02415 0.00000
6 0.17 0.0017 0.03617 99.54 0.00168 0.01564 0.17 0.00457 0.00000
7 0.07 0.0007 0.20821 0.35 0.00346 0.00098 99.58 0.00415 0.00000
8 0.13 0.0013 0.08309 0.22 0.00216 0.02444 99.65 0.00348 0.00000
9 0.31 0.0031 0.00098 0.17 0.00175 0.04790 99.51 0.00485 0.00000
10 0.53 0.0053 0.00098 0.16 0.00165 0.02053 99.31 0.00691 0.00000
11 0.03 0.0003 0.45846 0.14 0.00144 0.03617 99.82 0.00177 0.01564
12 − 0.02 − 0.0002 0.59922 0.30 0.00298 0.00000 99.73 0.00274 0.00000
13 0.37 0.0037 0.00098 0.11 0.00111 0.09580 99.52 0.00482 0.00000
Global estimates 0.0030 0.1540 0.0044 0.0680 0.0075 0.0030
Covariance estimates Va = 0.0012 Vb = 0.0018 Vc = 0.3944

*Abbreviations: FCT — fixation index among groups; FSC — fixation index among populations within groups; and FST — fixation index within populations.

The allele frequencies of the 15 autosomal STR loci were compared with the exact test of population differentiation. We found no significant differences between Moroccan and Spanish populations in 10 loci (P < 0.05); P-values of population comparison were 0.09910 (D3S1358), 0.15315 (vWA), 0.62162 (FGA), 0.09910 (TH01), 0.00000 (TPOX), 0.37838 (CSF1PO), 0.38739 (D5S818), 0.00901 (D13S317), 0.00000 (D7S820), 0.29730 (D8S1179), 0.90090 (D21S11), 0.88288 (D18S51) and 0.62162 (D16S539). While the Iraqi population showed no significant differences with both Moroccan and Spanish populations in 9 loci. However, the Pakistani population showed significant differences from the Moroccan population (P < 0.05) in most loci. The MDS analysis showed that the Moroccan population form a separated plot with the Iberian Peninsula, while populations from Near East and south east Europe were clustered together. Tunisian Arabic remains isolated at the opposite side of the MDS (Fig. 1).

Fig. 1.

Fig. 1

Multidimensional scaling analysis (MDS) calculated between 22 population for 13 autosomal STRs.

Stress: 0.14391 RSQ = 0.93846.

Discussion

Our study of the autosomal STR variation supports the hypotheses of a recent common ancestor between North Africa and Southern Europe (Morjani et al., 2011). The deviation of Arabic Tunisian was of particular concern.

This paper describes the pattern of the frequency distribution of 15 polymorphic STR of the autosomal chromosomes in the Moroccan population. In general, the allele frequencies found range within the general patterns described previously, but with a remarkable between population variations. In terms of variation, the samples were adjusted to a decreasing pattern of diversity from East to West (mean heterozygosity for Kuwait: 0.875, Syria: 0.822, Italy: 0.796 and Morocco: 0.785).

The hierarchical AMOVA analysis showed that only a small and non-significant part of the genetic variance could be attributed to the variation between East and West groups (FCT = 0.30%), indicating no particular genetic differentiation between both sides and arguing for a shared middle eastern ancestry. Botigué et al. have previously studied population relationships in the western and eastern Mediterranean basin using genome-wide SNP data from more than 2000 individuals. This survey indicated that Southwestern European population average between 4% and 20% of their genomes assigned to a North African ancestral whereas this value does not exceed 2% in southeastern European populations. They also found that migration event from North Africa to Europe would have occurred at least 6–10 generations ago (∼ 240–300 ya) in Spain, and at least 5–7 generations ago in France and Italy. In the other hand, SNP data showed that the haplotype sharing between Europe and the Near East follows a southeast to southwest gradient, whereas sharing between Europe and the Maghreb follows the opposite pattern (Botigué et al., 2013, Henn et al., 2012); this suggests that gene flow from the Near East cannot account for the sharing with North Africa. It is possible that these patterns reflect more ancient migrations, perhaps dating back to the Neolithic, which resulted in a low level of short Near Eastern haplotypes across much of Europe. A model of gene flow from the Near East into both Europe and North Africa, such as a strong demic wave during the Neolithic, could result in shared haplotypes between Europe and North Africa. In other ways, studies focusing on mitochondrial DNA concluded that groups of the proto U6 lineage spread from the Near East to North Africa around 40–45 ka, followed by some degree of regional continuity. In Europe, U6 lineage was only found in the Iberian Peninsula, pointing likely to a northward expansion from Africa (Maca-Meyer et al., 2003, González et al., 2003). This expansion could be attributed to either the Arab–Berber occupation that lasted seven centuries or to prehistoric immigrations of North Africans to Iberia (Ibn Abd-el-Hakem, 1858) probably during the Capsian diffusion in North Africa and suggesting important demographic movements in the western area of this continent. For instance and in line with our results, it was found using classical genetic markers, that the Near East showed smaller genetic distances with East Africa than with West Africa (especially Arab groups) and Iberian Peninsula (Bentayebi et al., 2011, Botigué et al., 2013, Henn et al., 2012). Moreover, Cultural and religious changes within the last two millennia appear to have facilitated and maintained admixture between culturally similar populations from the Near East and North Africa. The deviation of Arabic Tunisian was of particular concern. This could be explained by the history of the Tunisian population, reflecting the influence of the ancient Phoenician settlers of Carthage followed, among others, by Roman, Byzantine, Arab and French occupations, according to historical records. Notwithstanding, other explanations cannot be discarded, such as the relative heterogeneity within current Tunisian populations, and/or the limited sub-Saharan genetic influence in this region as compared with other North African areas, without excluding the possibility of the genetic drift and the small size of the sampled population.

The total number of markers used in this study is quite small in comparison to many other available studies, but due to higher mutation rates and number of alleles per locus STRs provide much more information, on average, than SNPs for population assignment and population stratification (Listman et al., 2010). The fifteen used autosomal markers reveal a high population differentiation; this marker set may be more useful for detecting recent admixture or founding events, such as those which formed the North African populations.

To sum up, our data on autosomal markers support, in general, the decreasing ancestry of the East–west populations described by other investigators (Henn et al., 2012, Maca-Meyer et al., 2003), providing, at the same time, detailed data of the frequency distribution of autosomal elements. These markers seem to perform well in fine-scale population differentiation studies. Also according to the statistical parameters the combined analysis of these 15 STR loci is a powerful tool for forensic identification and paternity testing in the Moroccan population.

References

  1. Abdin L., Shimada I., Brinkmann B., Hohoff C. Analysis of 15 short tandem repeats reveals significant differences between the Arabian populations from Morocco and Syria. Legal Med. 2003;5:S150–S155. doi: 10.1016/s1344-6223(02)00129-3. (Suppl.) [DOI] [PubMed] [Google Scholar]
  2. Aboukhalid R., Bouabdellah M., Abbassi M., Bentayebi K., Elmzibri M., Squalli D., Amzazi S. Haplotype frequencies for 17 Y-STR loci (AmpFISTR Y-filer) in a Moroccan population sample. Forensic Sci. Int. Genet. 2009;4:e73–e74. doi: 10.1016/j.fsigen.2009.06.004. [DOI] [PubMed] [Google Scholar]
  3. Aboukhalid R., Bouabdellah M., Lai R., Bradford N., El Ossmani H., Talbi J., Bentayebi K., Squalli D., Podini D., Amzazi S. Y chromosomal SNP analysis using the minisequencing strategy in a Moroccan population samples. J. Forensic Res. 2010;1(2):1000113. [Google Scholar]
  4. Aboukhalid R., Sturk-Andreaggi K., Bouabdellah M., Squalli D., Irwin J.A., Amzazi S. Mitochondrial DNA control region variation from samples of the Moroccan population. Int. J. Legal Med. 2013;127(4):757–759. doi: 10.1007/s00414-012-0813-4. (July) [DOI] [PubMed] [Google Scholar]
  5. Abu Halima M.S., Bernal L.P., Sharif F.A. Genetic variation of 15 autosomal short tandem repeat (STR) loci in the Palestinian population of Gaza Strip. Legal Med. 2009;11(4):203–204. doi: 10.1016/j.legalmed.2009.02.072. (July) [DOI] [PubMed] [Google Scholar]
  6. Alenizi M., Goodwin W., Ismael S., Hadi S. STR data for the AmpFlSTR Identifiler loci in Kuwaiti population. Legal Med. 2008;10(6):321–325. doi: 10.1016/j.legalmed.2008.05.003. (November) [DOI] [PubMed] [Google Scholar]
  7. Alshamali F., Alkhayat A.Q., Budowle B., Watson N.D. STR population diversity in nine ethnic populations living in Dubai. Forensic Sci. Int. 2005;152(2–3):267–279. doi: 10.1016/j.forsciint.2004.09.133. (September 10) [DOI] [PubMed] [Google Scholar]
  8. Anslinger K., Rolf B., Keil W. Evaluation and application of the AmpF/STR profiler plus PCR amplification kit in a Bavarian population sample. Int. J. Legal Med. 2001;114(4–5):278–280. doi: 10.1007/s004149900116. [DOI] [PubMed] [Google Scholar]
  9. Applied Biosystems . Applied Biosystems; USA: 2001. AmpFlSTR IdentifilerTM User's Manual. [Google Scholar]
  10. Babiker H.M., Schlebusch C.M., Hassan H.Y., Jakobsson M. Genetic variation and population structure of Sudanese populations as indicated by 15 Identifiler sequence-tagged repeat (STR) loci. Investig. Genet. 2011;2(1):12. doi: 10.1186/2041-2223-2-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Barni F., Berti A., Pianese A., Boccellino A., Miller M.P., Caperna A., Lago G. Allele frequencies of 15 autosomal STR loci in the Iraq population with comparisons to other populations from the middle-eastern region. Forensic Sci. Int. 2007;167(1):87–92. doi: 10.1016/j.forsciint.2006.03.005. (March 22) [DOI] [PubMed] [Google Scholar]
  12. Bentayebi K., Ramon M.M., Castro J.A., Barbaro A., Aboukhalid R., Amzazi S., Picornell A. Inferring ethnicity from the X-chromosome ALU insertions: data from Western Mediterranean human groups. Forensic Sci. Int. Genet. Suppl. Ser. 2011:e27–e28. [Google Scholar]
  13. Bentayebi K., Picornell A., Bouabdeallah M., Castro J.A., Aboukhalid R., Squalli D., Misericordia M.M., Amzazi S. Genetic diversity of 12 X-chromosomal short tandem repeats in the Moroccan population. Forensic Sci. Int. Genet. 2012;6:e48–e49. doi: 10.1016/j.fsigen.2011.03.008. [DOI] [PubMed] [Google Scholar]
  14. Bentayebi K., Picornell A., Amzazi S. Novapublishers; USA: 2012. Genetic Diversity: New research: Genetic History of Morocco; pp. 109–132. (Chapter 5) [Google Scholar]
  15. Bilgili O., Weyel S. Paper Series: Migration and Development Country Profiles. Maastricht Graduate School of Governance; 2009. Migration in Morocco: history, current trends and future prospects; p. 6. [Google Scholar]
  16. Botigué L.R., Henn B.M., Gravel S., Maples B.K., Gignoux C.R., Corona E., Atzmon G., Burns E., Ostrer H., Flores C., Bertranpetit J., Comas D., Bustamante C.D. Gene flow from North Africa contributes to differential human genetic diversity in southern Europe. Proc. Natl. Acad. Sci. 2013;110(29):11791–11796. doi: 10.1073/pnas.1306223110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Brisighelli F., Capelli C., Boschi I., Garagnani P., Lareu M.P., Pascali V.L., Carracedo A. Allele frequencies of fifteen STRs in a representative sample of the Italian population. Forensic Sci. Int. Genet. 2009;3(2):e29–e30. doi: 10.1016/j.fsigen.2008.05.002. (March) [DOI] [PubMed] [Google Scholar]
  18. Brunschvig R. Vol. 40. Túnez: J.Aloccio; Al-Andalus: 1975. Ibn Abd al-Hakam et la conquète de l'Afrique du Nord par les arabes; pp. 129–179. [Google Scholar]
  19. Camacho M.V., Benito C., Figueiras A.M. Allelic frequencies of the 15 STR loci included in the AmpFlSTR Identifiler PCR Amplification Kit in an autochthonous sample from Spain. Forensic Sci. Int. 2007;173(2–3):241–245. doi: 10.1016/j.forsciint.2007.02.006. (December 20) [DOI] [PubMed] [Google Scholar]
  20. Caramelli D., Milani L., Vai S., Modi A., Pecchioli E., Girardi M., Pilli E., Lari M., Lippi B., Ronchitelli A., Mallegni F., Casoli A., Bertorelle G., Barbujani G. A 28,000 years old Cro-Magnon mtDNA sequence differs from all potentially contaminating modern sequences. PLoS ONE. 2008;3(7):e27000. doi: 10.1371/journal.pone.0002700. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Carl E. Hildebrand, Torney D.C., Wagner R.P. Vol. 20. Los Alamos Science; Los Alamos: 1992. Informativeness of Polymorphic DNA Markers; pp. 100–102. [Google Scholar]
  22. Chbel F., Nadifi S., Martinez-Bouzas C., Louahlia S., Azeddoug H., Martinez De Pancorbo M. Population genetic data of eight tetrameric short tandem repeats (STRs) in Casablanca resident population to use in forensic casework. Forensic Sci. Int. 2003;132(1):82–83. doi: 10.1016/s0379-0738(02)00433-4. (March 12) [DOI] [PubMed] [Google Scholar]
  23. Cherni L., Yaâcoubi B.L., Pereira L., Alves C., Khodjet-El-Khil H., El Gaaied A A.B., Amorim A. Data for 15 autosomal STR markers (Powerplex 16 System) from two Tunisian populations: Kesra (Berber) and Zriba (Arab) Forensic Sci. Int. 2005;147(1):101–106. doi: 10.1016/j.forsciint.2004.04.009. (January) [DOI] [PubMed] [Google Scholar]
  24. Chouery E., Coble M.D., Strouss K.M., Saunier J.L., Jalkh N., Medlej-Hashim M., Ayoub F., Mégarbané A. Population genetic data for 17 STR markers from Lebanon. Legal Med. 2010;12(6):32432–32436. doi: 10.1016/j.legalmed.2010.08.005. (November) [DOI] [PubMed] [Google Scholar]
  25. El Amri H., Squalli D., Gazzaz B., El Harrak H.H., El Ossmani H. Genetic structure of Moroccan population using 15 STRs of NGM kit. Forensic Sci. Int. Genet. Suppl. Ser. 2011;3(1):e534–e535. [Google Scholar]
  26. El Ossmani H., Talbi J., Bouchrif B., Chafik A. Allele frequencies of 15 autosomal STR loci in the southern Morocco population with phylogenetic structure among worldwide populations. Legal Med. 2009;11(3):155–158. doi: 10.1016/j.legalmed.2009.01.053. (May) [DOI] [PubMed] [Google Scholar]
  27. El Ossmani H., Talbi J., Bouchrif B., Chafik A. Exploitation de 15 STR autosomaux pour l'étude phylogénétique de la population Arabophone de Rabat-Salé-Zemmour-Zaer (Maroc) Antropo. 2008;17:15–23. [Google Scholar]
  28. El Ossmani H., Bouchrif B., Aboukhalid R., Bouabdillah M., Gazzaz B., Zaoui D., Chafik A., Talbi J. Assessment of phylogenetic structure of Berber-speaking population of Azrou using 15 STRs of Identifiler kit. Legal Med. 2010;12(1):52–56. doi: 10.1016/j.legalmed.2009.10.004. (January) [DOI] [PubMed] [Google Scholar]
  29. Excoffier L., Laval G., Schneider S. Arlequin ver.3.0: an integrated software package for population genetics data analysis. Evol. Bioinformatics Online. 2005;1:47–50. [PMC free article] [PubMed] [Google Scholar]
  30. Fernández-Santander A., Kandil M., Luna F., Esteban E., Giménez F., Zaoui D., Moral P. Genetic relationships between southeastern Spain and Morocco: new data on ABO, RH, MNSs, and DUFFY polymorphisms. Am. J. Hum. Biol. 1999;11(6):745–752. doi: 10.1002/(SICI)1520-6300(199911/12)11:6<745::AID-AJHB4>3.0.CO;2-W. (November) [DOI] [PubMed] [Google Scholar]
  31. Gaibar M., Esteban E., Moral P., Gómez-Gallego F., Santiago C., Bandrés F., Luna F., Fernández-Santander A. STR genetic diversity in a Mediterranean population from the south of the Iberian Peninsula. Ann. Hum. Biol. 2010;37(2):253–266. doi: 10.3109/03014460903341851. (April) [DOI] [PubMed] [Google Scholar]
  32. Garrigan D., Hammer M.F. Reconstructing human origins in the genomic era. Nat. Rev. Genet. 2006;7:669–680. doi: 10.1038/nrg1941. [DOI] [PubMed] [Google Scholar]
  33. González A.M., Brehm A., Pérez J.A., Maca-Meyer N., Flores C., Cabrera V.M. Mitochondrial DNA affinities at the Atlantic fringe of Europe. Am. J. Phys. Anthropol. 2003;120:391–404. doi: 10.1002/ajpa.10168. [DOI] [PubMed] [Google Scholar]
  34. Gordon R.G. Fifteenth edition. SIL International; Dallas, Texas: 2005. Ethnologue: Languages of the World. [Google Scholar]
  35. Hayward R.J. African Languages: An Introduction. In: Heine Bernd, Nurse Derek., editors. Afroasiatic. Cambridge University Press; Cambridge: 2000. pp. 74–98. [Google Scholar]
  36. Hedjazi A., Nikbakht A., Hosseini M., Hoseinzadeh A., Hosseini S.M. Allele frequencies for 15 autosomal STR loci in Fars province population, southwest of Iran. Legal Med. 2013;15(4):226–228. doi: 10.1016/j.legalmed.2013.01.005. [DOI] [PubMed] [Google Scholar]
  37. Henn B.M., Botigué L.R., Gravel S., Wang W., Brisbin A. Genomic ancestry of North Africans supports back-to-Africa migration. PLoS Genet. 2012;8(1):e1002397. doi: 10.1371/journal.pgen.1002397. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Houston K.A. 1998. Statistical Analysis of STR Data; pp. 14–15. (Profiles in DNA, Gene PrintTM). [Google Scholar]
  39. Ibn Abd-el-Hakem . 1858. The Islamic Conquest of Spain. (Medieval Sourcebook: Ibn Abd-el-Hakem (870 CE)). [Google Scholar]
  40. Jorde L.B., Bamshad M., Rogers A.R. Using mitochondrial and nuclear DNA markers to reconstruct human evolution. Bioessays. 1998;20(2):126–136. doi: 10.1002/(SICI)1521-1878(199802)20:2<126::AID-BIES5>3.0.CO;2-R. [DOI] [PubMed] [Google Scholar]
  41. Khodjet-el-Kkil H., Fadhlaoui-Zid K., Gusmão L., Alves C., Benammar-Elgaaied A., Amorim A. Allele frequencies for 15 autosomal STR markers in the Libyan population. Ann. Hum. Biol. 2012;39(1):80–83. doi: 10.3109/03014460.2011.630678. (January) [DOI] [PubMed] [Google Scholar]
  42. Kovatsi L., Parsons T.J., Just R.S., Irwin J.A. Genetic variation for 15 autosomal STR loci (PowerPlex 16) in a population sample from northern Greece. Forensic Sci. Int. 2006;159(1):61–63. doi: 10.1016/j.forsciint.2005.05.019. (May) [DOI] [PubMed] [Google Scholar]
  43. Levesque R. Fourth edition. SPSS Inc.; Chicago Ill: 2007. SPSS Programming and Data Management: A Guide for SPSS and SAS Users. [Google Scholar]
  44. Listman J.B., Hasin D., Kranzler H.R., Malison R.T., Mutirangura A., Sughondhabirom A., Aharonovich E., Spivak B., Gelernter J. Identification of population substructure among Jews using STR markers and dependence on reference populations included. BMC Genet. 2010;11(48) doi: 10.1186/1471-2156-11-48. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Louali N., Philippson G. GALF (Groupement des Anthropologues de Langue Française); Marrakech: 2003. Protoméditerranéens Capsiens sont-ils des protoberbères? Interrogations de linguiste; pp. 22–25. (September) [Google Scholar]
  46. Maca-Meyer N., González A.M., Pestano J., Flores C., Larruga J.M., Cabrera V.M. Mitochondrial DNA transit between West Asia and North Africa inferred from U6 phylogeography. BMC Genet. 2003;4(15) doi: 10.1186/1471-2156-4-15. (October 16) [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Morjani P., Patterson N., Hirschhorn J.N., Keinan A., Hao L., Atzmon G., Burns E., Ostrer H., Price A.L., Reich D. The history of African gene flow into Southern Europeans, Levantines, and Jews. PLoS Genet. 2011;7(4):e1001373. doi: 10.1371/journal.pgen.1001373. (April) [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Olivierri A., Achilli A., Pala M., Battaglia V., Fornarino S., Al-Zahery N., Scozzari R., Cruciani F., Behar D.M., Dugoujon J.M., Coudray C., Santachiara A.S., Semino O., Bandelt H.J., Torroni A. The mtDNA legacy of the Levantine early Upper Paleolithic in Africa. Science. 2006;314:1767–1770. doi: 10.1126/science.1135566. [DOI] [PubMed] [Google Scholar]
  49. Omran G.A., Rutty G.N., Jobling M.A. Genetic variation of 15 autosomal STR loci in Upper (Southern) Egyptians. Forensic Sci. Int. Genet. 2009;3(2):e39–e44. doi: 10.1016/j.fsigen.2008.05.007. (March) [DOI] [PubMed] [Google Scholar]
  50. Ouborg N.J., Piquot Y., Van Groenendael J.M. Population genetics, molecular markers and the study of dispersal in plants. J. Ecol. 2008;87(4):551–568. (September 18) [Google Scholar]
  51. Pereira L., Silva N.M., Franco-Duarte R., Fernandes V., Preira J.B., Costa M.B., Martins H., Soares P., Behar D.M., Richards M.B., Macauly V. New chronometric data from Ifri n Ammar (Morocco) and the chronostratigraphy of the Middle Paleolithic in the Western Maghreb. J. Hum. Evol. 2003;59:672–679. doi: 10.1016/j.jhevol.2010.07.024. [DOI] [PubMed] [Google Scholar]
  52. Picornell A., Bentayebi K., Bouabdeallah M., Castro J.A., Bouabdellah M., Squalli D., Misericordia M., Amzazi S. X-chromosomal haplotype frequencies of four linkage groups in a North African population. Forensic Sci. Int. Genet. Suppl. Ser. 2011:e19–e20. [Google Scholar]
  53. Rakha A., Yu B., Hadi S., Sheng-Bin L. Population genetic data on 15 autosomal STRs in a Pakistani population sample. Legal Med. 2009;11(6):305–307. doi: 10.1016/j.legalmed.2009.08.001. (November) [DOI] [PubMed] [Google Scholar]
  54. Schaffner S.F. The X chromosome in population genetics. Nat. Rev. Genet. 2004;5:43–51. doi: 10.1038/nrg1247. (January) [DOI] [PubMed] [Google Scholar]
  55. Ségurel L., Martínez-Cruz B., Quintana-Murci L., Balaresque P., Georges M., Hegay T., Aldashev A., Nasyrova F., Jobling M.A., Heyer E., Vitalis E.R. Sex-specific genetic structure and social organization in Central Asia: insights from a multi-locus study. PLoS Genet. 2008;4(9):e1000200. doi: 10.1371/journal.pgen.1000200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Tereba A. Vol. 2. 1999. Tools for analysis of population statistics; pp. 14–16. (Profiles DNA). [Google Scholar]
  57. Tomas C., Sanchez J.J., Barbaro A., Brandt-Casadevall C., Hernandez A., Ben Dhiab M., Ramon M., Morling N. X-chromosome SNP analyses in 11 human Mediterranean populations show a high overall genetic homogeneity except in North-west Africans (Moroccans) BMC Evol. Biol. 2008;8(75) doi: 10.1186/1471-2148-8-75. (pp. 8:75 (February 29)) [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Underhill P.A., Kivisild T. Use of y chromosome and mitochondrial DNA population structure in tracing human migrations. Annu. Rev. Genet. 2007;41:539–564. doi: 10.1146/annurev.genet.41.110306.130407. [DOI] [PubMed] [Google Scholar]
  59. White T.L., Adams W.T., Neale D.B. 2007. Genetic markers — morphological, biochemical and molecular markers; pp. 53–74. (Book: Forest Genetics). (Chapter 4) [Google Scholar]
  60. Xue F., Wang Y., Xu S., Zhang F., Wen B., Wu X., Lu M., Deka R., Qian J., Jin L. A spatial analysis of genetic structure of human populations in China reveals distinct difference between maternal and paternal lineages. Eur. J. Hum. Genet. 2008;16:705–717. doi: 10.1038/sj.ejhg.5201998. [DOI] [PubMed] [Google Scholar]
  61. Zhang D.X., Hewitt G.F. Nuclear DNA analyses in genetic studies of populations: practice, problems and prospects. Mol. Ecol. 2003;12:563–584. doi: 10.1046/j.1365-294x.2003.01773.x. [DOI] [PubMed] [Google Scholar]

Articles from Meta Gene are provided here courtesy of Elsevier

RESOURCES