Abstract
Crop-to-wild gene flow is a mechanism process widely documented, both in plants and animals. This can have positive or negative impacts on the evolution of admixed populations in natural environments, yet the phenomenon is still misunderstood in long-lived woody species, contrary to short-lived crops. Wild olive Olea europaea L. occurs in the same eco-geographical range as domesticated olive, i.e. the Mediterranean Basin (MB). Moreover, it is an allogamous and anemophilous species whose seeds are disseminated by birds, i.e. factors that drive gene flow between crops and their wild relatives. Here we investigated the genetic structure of western MB wild olive populations in natural environments assuming a homogenous gene pool with limited impact of cultivated alleles, as previously suggested. We used a target sequencing method based on annotated genes from the Farga reference genome to analyze 27 western MB olive tree populations sampled in natural environments in France, Spain and Morocco. We also target sequenced cultivated olive tree accessions from the Worldwide Olive Germplasm Bank of Marrakech and Porquerolles and from an eastern MB wild olive tree population. We combined PCA, sNMF, pairwise FST and TreeMix and clearly identified genuine wild olive trees throughout their natural distribution range along a north-south gradient including, for the first time, in southern France. However, contrary to our assumption, we highlighted more admixed than genuine wild olive trees. Our results raise questions regarding the admixed population evolution pattern in this environment, which might be facilitated by crop-to-wild gene flow.
1 | Introduction
Gene flows between domesticated species and their wild relatives have been identified in several studies in animals [1–3] and plant species [4, 5]. This phenomenon is noted when cultivated genomic variants occur in unmanaged naturally occurring populations in natural environments. These admixed populations raise the question of the impact of gene flow on the evolution of natural populations. For instance, the introgression of new genetic diversity inside wild genomes can accelerate their evolution by increasing the frequency of favorable alleles, but can also be detrimental resulting in outbreeding depression, i.e. a loss of fitness in hybrids compared to their parents [6–8].
Cultivated to wild introgressions variants may be found in several plant species. Crop-to-wild gene flow occurs in maize and teosinte, its closest relatives, for instance, which has been found to lead to the acquisition of herbicide resistance in teosinte and, consequently, to high frequency of teosinte forms in maize fields [4]. In some perennial species such as apple, major introgressions with the spread of alleles from the cultivated gene pool to wild populations in Europe have been documented [9]. The resulting admixed populations showed higher fitness than wild apple trees [9, 10]. Gene flow from domesticated relatives has also been reported in natural chestnut and poplar populations. Admixed poplar populations have been found in France [11] and along the Danube River in natural environment [12], while the same scenario has been observed in chestnuts in Japan [13]. All these studies have proposed conservation measures for in-situ and ex-situ preservation of genuine wild populations by limiting gene flow, by replanting genuine wild genotypes far from domesticated forms or by protecting the connections of wild metapopulations which can breed and thereby protect themselves from random genetic deterioration [10, 12]. The evolutionary consequences of crop-to-wild gene flows in the natural environment and on wild populations are still misunderstood, especially in perennial species.
Olive tree, (Olea europaea L.) is an iconic perennial species from the Mediterranean Basin (MB) which can live thousands of years. Cultivated (Olea europaea var. europaea) and wild (Olea europaea var. sylvestris) forms coexist within the same Mediterranean distribution range [14, 15]. Wild olive trees have an ancient evolutionary history in the MB [16] indicated that three plastid lineages with a probable common ancestor dating from the Middle to Upper Pleistocene had diversified long before the Last Glaciation Maximum (26,500 to 19,000 BP [17]). They were subsequently impacted by glaciation, while some wild populations persisted in refugia [16, 18]. Olive lineages have been isolated in two distant areas, which could explain the current population genetic structure profile of wild olive trees. According to previous genetic studies, two main gene pools are identified, one in the eastern and another in the western/central MB [16, 19–21]. This eastern/western genetic differentiation is also found in other plants in the MB [22, 23]. Cultivated olive trees emerged with the domestication of olive trees around 6,000 years BP [14, 24, 25]. It is generally considered that the center of primary olive domestication, from wild progenitors, is located in the Middle East, near the border between Turkey and Syria [14, 16, 24].
It is currently impossible to distinguish between genuine wild, admixed and cultivated olive trees in the natural environment because of the absence of easily measurable discriminating morphological traits in the field. The use of geometric component of shape stone allows to distinguish the wild (round-shaped stone) from cultivated morphotype [26, 27]. However, numerous intermediate morphotypes have been found, ranging from elliptical to more tapered, reflecting the complex history and evolution processes related to human-associated migration [19, 21, 26, 28]. Genuine wild and admixed olive trees can only distinguish using genetic markers [19, 21, 28–31]. The genetic diversity of cultivated olive trees is close to eastern MB wild olive trees [20, 30, 31], hence making it difficult to study gene flow between eastern MB wild and cultivated accessions. Conversely, in western MB, the genetic diversity of wild olive trees growing in natural areas is clearly different from cultivated accessions [16, 21, 28], thereby enabling the identification of genuine wild olive trees as previously reported by [31] using allozyme markers. The genetic pattern observed in naturally occurring populations was little impacted by crop-to-wild gene flow [21, 28, 30]. The well-known genetic differentiation makes it a relevant model for investigating the genetic structure of populations in their natural environments and to infer potential gene flow between cultivated and wild olive.
Here, we investigated the genetic diversity of naturally occurring olive tree populations in the western MB. We assumed that the genetic pool of wild olive tree in the western Mediterranean area has not been impacted by introgressions from domesticated forms. We addressed the following questions: (1) What is the genetic structure of spontaneous olive trees in the western MB? (2) Are there genuine wild olive populations in this range? (3) Is there crop-to-wild gene flow in this region? We analyze genome-wide SNPs in olive trees from 27 natural sites ranging from southern France, Spain and Morocco. We included DNA from wild trees previously sampled in southern Turkey for the purpose of comparing diversity in these populations with the genetic pattern in the eastern MB wild gene pool [30]. We sought to identify crop-to-wild gene flow and patterns of admixtures using data of cultivated accessions from western MB from the Worldwide Olive Germplasm Bank of Marrakech (WOGBM) and Porquerolles obtained with the same sequencing strategy [32, 33].
2 | Methods
2.1 | Sampling of wild olive trees on a north-south gradient in the western Mediterranean Basin
Sampling of 27 assumed wild olive tree sites was conducted along a north-south gradient from southern France to southern Morocco in 2021 and early 2022. Sites were selected via the Conservatoire Botanique National Simethis database (http://simethis.eu) for southern France and northern Spain whereas for Corsica, central and southern Spain and Morocco, the delineation of wild populations was based on plastid polymorphism as reported by [21]. In addition to information from the Simethis database, we used environmental criteria to limit sampling of admixed olive populations and disregard olive orchards, agricultural and urban environments [34]. This resulted in the selection of 27 sites (Table 1; Fig 1A). At each of them, 13 to 15 individuals were sampled, representing a total of 400 sampled wild olive trees. At each site and for each tree, leaves were collected and immediately dried in silica gel for subsequent DNA extractions.
Table 1. Summary of the localisation of the natural olive populations sampled for the study.
| Site | n | Country | Latitude | Longitude | Localisation | Source | |
|---|---|---|---|---|---|---|---|
| F01 | 15 | Continental France | 42.9368 | 3.0126 | Leucate | This study | |
| F02 | 15 | Continental France | 43.5407 | 3.3033 | Lac des olivettes—Valhan | This study | |
| F03 | 15 | Continental France | 43.7706 | 3.7919 | Cazevieille | This study | |
| F04 | 15 | Continental France | 43.8774 | 4.7328 | Avignon | This study | |
| F06 | 15 | Continental France | 43.3826 | 6.3611 | Plain des Maures—Gonfaron | This study | |
| F07 | 15 | Continental France | 43.6894 | 7.3029 | MontBoron—Nice | This study | |
| F08 | 15 | Corsica France | 42.6538 | 9.0638 | Ile Rousse | This study | |
| F09 | 15 | Corsica France | 42.4037 | 8.6989 | Manso—Calvi | This study | |
| F10 | 15 | Corsica France | 41.7503 | 8.8688 | Filitosa—Propriano | This study | |
| F11 | 15 | Corsica France | 41.3723 | 9.202 | Bonifacio | This study | |
| S12 | 15 | Spain | 42.2355 | 3.2188 | Roses | This study | |
| S13 | 15 | Spain | 41.4208 | 1.976 | Barcelone | This study | |
| S14 | 15 | Spain | 41.0202 | 0.9348 | L’Hospitalet del Infant | This study | |
| S15 | 15 | Spain | 40.3415 | 0.3858 | Peníscola | This study | |
| S16 | 15 | Spain | 38.8029 | 0.1952 | Xàbia | This study | |
| S17 | 15 | Spain | 38.3737 | -3.507 | Santa Elena | This study | |
| S18 | 15 | Spain | 37.2563 | -6.2085 | Aznalcar—Sevilla | This study | |
| S19 | 15 | Spain | 36.7656 | -3.8496 | Malaga | This study | |
| S20 | 15 | Spain | 36.0615 | -5.6695 | Tarifa | This study | |
| M21 | 15 | Morocco | 35.79 | -5.9248 | Cap Spartel | This study | |
| M22 | 15 | Morocco | 35.7828 | -5.5153 | Douar Dakchire | This study | |
| M23 | 13 | Morocco | 34.8685 | -5.3526 | Douar Nefzi | This study | |
| M24 | 15 | Morocco | 33.5341 | -5.9082 | Bouquachmir | This study | |
| M25 | 15 | Morocco | 33.0998 | -5.5883 | Moyen-Atlas—M’rirt | This study | |
| M28 | 15 | Morocco | 31.2104 | -8.0398 | Marrakech-Asni | This study | |
| M29 | 13 | Morocco | 30.6315 | -9.3704 | Ameskroud-Idmine | This study | |
| M30 | 15 | Morocco | 31.111 | -9.6907 | Agadir—Essaouira | This study | |
| OST | 15 | Turkey | 36.11363 | 33.43209 | Tisan | [16, 20, 28, 30, 31] | |
n: sample size
Fig 1. Result of the population structure analyses performed on the genome-wide SNPs diversity of natural populations of O. europaea L. collected in France (143), Spain (123), Morocco (96) and Turkey (13) and cultivated O. europaea L. from the western Mediterranean Basin (145), using 142,060 SNPs.
(A) Geographical distribution of the populations and proportion of genetic cluster assigning. Pie chart at each location represents the fraction of individuals belonging to each genetic cluster as inferred by sNMF (K = 4). (B) PCA inferred with LEA. C: Cultivated, W: Western wild, WE: Eastern wild. (C) Genetic structure inferred by sNMF, each horizontal bar indicates individual assignment to a genetic cluster with K being the number of genetic clusters (K from 2 to 4).
2.2 | Reference set of cultivated and eastern wild accessions
In addition to the sample sites described above, leaves from 10 cultivated olive varieties were added to the sampling. These varieties were selected because of their significant presence in the French sampling area of natural populations [33], which can be a potential source of introgression. Fifteen individual wild olive trees from Turkey in the eastern MB [30] were also added to create a genetically distinct group which will be considered as an outgroup (Fig 1A). Moreover, 135 cultivated varieties from the WOGBM, representative of the genetic diversity of olive resources in the western MB [32] were considered as reference varieties to assess the introgressions from cultivated olive into wild populations. This last dataset was developed in a parallel study by our group that is focusing on cultivated olive (S2 Table). Overall, the experiment included 561 individuals.
2.3 | Bait design
The cultivated Olea europaea var. europaea (cv. Farga) Oe9 genome assembly [29] was used as a reference to design target sequencing probes. This genome is 1.38 Gb. Baits were designed according to the following parameters: place 80 bp probes with 0.5x tilling targeting the first 640 bp of each of the 55,595 annotated genes available. For each gene, 1 to 4 baits were designed depending on its length. After quality filtration, this resulted in a total set of 210,367 baits representing 55,452 unique loci and a captured length of 16.8 Mb. The probes were designed and synthesized by Daicel Arbor Biosciences, Ann Arbor, Michigan, USA. For this study, we only retained sequencing data targeted on assembled chromosomes. This subset represented 102,126 baits with a captured length of 8.2 Mb (S1 Table).
2.4 | Library preparation and sequencing
DNA was extracted from leaves using a mixed alkyl trimethylammonium bromide buffer (MATAB) and NucleoMag Plant Kit (Macherey-Nagel, Düren, Germany) as already described by [35] (S1 File). Individual genomic libraries for the NGS experiments were constructed with the NEBNext® Ultra™ II FS DNA Library Prep Kit (New England Biolabs, Ipswich, MA) with inputs ≤ 100 ng (S1 File). DNA was enzymatically sheared at an average 160 bp length before being tagged with the Unique Dual Index. Enrichment by capture was performed with biotinylated RNA probes (80 bp) as recommended by the provider using myBaits kits (Arbor Biosciences). A single dose of bait was used on a bulk of 48 normalized libraries. The sequencing was performed by MGX-Montpellier GenomiX on an Illumina® NovaseqTM 6000 (Illumina Inc., San Diego, CA, USA) platform with an S4 flow cell. In addition to the target sequencing data set, we also sequenced four whole genomes (OES_E13_09, OES_F10_03, Picholine and Picholine Marocaine) to calculate the enrichment rate of the target sequencing method.
2.4.1 | SNP calling
Raw sequencing reads were first trimmed with FastP version 0.20.1 [36]. The resulting data were then mapped on the reference genome Farga Oe9 genome assembly [29] using bwa-mem2 version 2.0 [37]. The mapped reads were sorted with samtools version 1.10 [38]. Only primary alignment, properly paired and unique reads were kept. Duplicates were removed using picard-tools version 2.24.0 [39]. From this clean alignment, GATK version 4.2.0.0 [40] was used for the SNP calling according to GATK4 best practices (https://gatk.broadinstitute.org/hc/en-us/sections/360007226651-Best-Practices-Workflows).
2.4.2 | SNP filtering
After sequencing, we obtained 27,275,679 raw data. These were filtered with VCFtools version 0.1.16 [41]. The following filters were sequentially applied. With vcf-annotate, we first removed SNPs with a quality below 200 and clusters of 3 SNPs in 10 bases. VCFtools was used to remove indels, to only keep biallelic SNPs, to select SNPs with a minimum depth per site of 8 and a maximum mean depth per site of 400. Sites with >15% missing data were removed, then individuals with >20% missing data were also removed (S4 Table). After filtration, 35 individuals were removed (1 cultivated, 2 eastern wild plants and 32 western wild plants). Only sampling sites with at least 12 individuals were considered in this study. M23 had only 6 individuals left and was therefore removed from the data-set. Filtering was carried out to exclude positions with fixed heterozygosity (>85%) and the final filtering was done to keep at least one minor allele count per site. After all filtering steps, we obtained 142,060 SNPs in the final data set, these were located on all chromosomes (S3 Table).
2.5 | Genomic analysis
2.5.1 | Genetic diversity and genetic structure
Genetic diversity measure was examined for each of sampled sites, considered as distinct populations. We calculated diversity measures as expected heterozygosity (HE), observed heterozygosity (HO) and inbreeding coefficient (FIS) using Hierfstat version 05–11 [42].
Pairwise between population genetic differentiations were estimated with Weir and Cockeman fixation index (FST) using pairwise.WCfst function from the Hierfstat package version 05–11 [42]. Support values were calculated per locus, for each pair of population, based on bootstraps procedure (S5 Table).
Genetic structure analyses were conducted using sNMF and Principal Component Analysis (PCA) from the LEA package version 3.11.3 [43]. Both analyses were performed on the dataset. It included 135 individuals from the WOGBM, originating from Spain, France and Morocco according to their passport data, 10 cultivated samples from southern France, 362 western MB wild individuals and 13 eastern MB wild samples. For sNMF, five repetitions per clusters (K) considered, were performed with K ranging from 1 to 10.
2.5.2 | Admixture assessment
Inference of the population history, with the admixture and split pattern were done using TreeMix version 1.13 [44]. This software constructs admixture graphs using allele frequencies of current genetic populations to infer a graph of all ancestral populations related to a common ancestor. For this analysis, we grouped cultivated olives in four different genetic groups, depending on their sNMF assignment to genetic ancestral clusters: C0, C1, C3 and C4 (S1 File; S6 Table). With these clusters, the 27 populations collected in western MB and the Turkish population, we did 100 TreeMix runs with a random SNP block size between 100 and 1000, from 1 to 10 migrations each when considering the M29 population as an outgroup. We inferred the optimum number of migrations with multiple linear models and the Evanno method implemented in the OptM package version 0.1.6 (S1 and S2 Figs) [45, 46]. TreeMix analysis was performed with 500 bootstrap replicates, which were used to build a consensus tree with Phylip version 3.697 [47]. We used BITE packages version 1.2.0008 [48] to display the trees.
3 | Results
In this study, we analyzed 520 individuals, including 145 cultivars from Spain, France and Morocco, representing the MB olive diversity of cultivated olive trees (S2 Table), a set of 362 wild trees from France, Spain and Morocco collected in 27 natural sites and 13 wild trees from southern Turkey.
3.1 | Target sequencing efficiency
The average enrichment rate in the target sequencing experiment was 34 times higher than expected with whole-genome sequencing (S7 Table). Moreover, for the bait on the chromosome annotated genes, 63.5% of the filtered SNPs (90,157) were on-target SNPs. The remaining was off-target SNPs. The on-target SNPs corresponded to sequences targeted by the baits. Conversely, the off-target corresponded to nonspecific and unintended sequences that can arise through sequencing (S7 Table).
3.2 | Genetic diversity and genetic structure
The average inbreeding coefficient (FIS) calculated on the different populations was on average 0. This is in accordance with the outcrossing mating system of olive tree. For 3 populations we detected FIS values ranging from -0.086 to -0.101 (F03, F11, S13) while 3 other ones FIS values ranging from -0.192 to-0.206 (S16, S17 and M28) (Table 2). All of these populations might be resulting from admixture event.
Table 2. Summary information of genetic diversity for sampling sites of naturally occurring olive trees in the western and eastern Mediterranean Basin.
| n | H O | H E | F IS | |
|---|---|---|---|---|
| F01 | 15 | 0.142±0.195 | 0.141±0.178 | -0.003±0.263 |
| F02 | 13 | 0.147±0.199 | 0.144±0.180 | -0.014±0.265 |
| F03 | 15 | 0.145±0.213 | 0.130±0.177 | -0.086±0.234 |
| F04 | 15 | 0.134±0.202 | 0.124±0.174 | -0.056±0.247 |
| F06 | 15 | 0.148±0.202 | 0.142±0.180 | -0.032±0.249 |
| F07 | 13 | 0.154±0.218 | 0.141±0.184 | -0.070±0.262 |
| F08 | 15 | 0.127±0.197 | 0.121±0.173 | -0.029±0.262 |
| F09 | 15 | 0.121±0.194 | 0.118±0.172 | -0.017±0.273 |
| F10 | 13 | 0.135±0.205 | 0.126±0.176 | -0.053±0.271 |
| F11 | 14 | 0.147±0.222 | 0.129±0.181 | -0.101±0.252 |
| S12 | 15 | 0.158±0.214 | 0.146±0.184 | -0.057±0.242 |
| S13 | 14 | 0.155±0.225 | 0.136±0.182 | -0.098±0.246 |
| S14 | 15 | 0.154±0.204 | 0.149±0.184 | -0.026±0.254 |
| S15 | 13 | 0.148±0.205 | 0.142±0.182 | -0.024±0.264 |
| S16 | 12 | 0.187±0.274 | 0.144±0.187 | -0.206±0.319 |
| S17 | 14 | 0.168±0.264 | 0.131±0.187 | -0.195±0.314 |
| S18 | 12 | 0.147±0.204 | 0.141±0.182 | -0.028±0.271 |
| S19 | 14 | 0.140±0.199 | 0.140±0.184 | 0.001±0.280 |
| S20 | 14 | 0.123±0.201 | 0.115±0.174 | -0.043±0.269 |
| M21 | 12 | 0.139±0.206 | 0.129±0.175 | -0.051±0.262 |
| M22 | 15 | 0.141±0.196 | 0.135±0.173 | -0.030±0.246 |
| M24 | 15 | 0.139±0.202 | 0.131±0.174 | -0.045±0.247 |
| M25 | 13 | 0.127±0.197 | 0.123±0.174 | -0.020±0.270 |
| M28 | 14 | 0.175±0.262 | 0.139±0.189 | -0.192±0.295 |
| M29 | 12 | 0.114±0.198 | 0.111±0.176 | -0.018±0.307 |
| M30 | 15 | 0.121±0.198 | 0.115±0.174 | -0.031±0.268 |
| OST | 13 | 0.117±0.183 | 0.116±0.166 | -0.007±0.271 |
n, number of genotypes; HE, expected heterozygosity; HO, observed heterozygosity; F IS, population level deviation from Hardy-Weinberg heterozygosity
The pairwise differentiation values between the studied populations (FST) ranged from 0.01 to 0.42 (Fig 2). The Turkish eastern wild sampling site was the most genetically differentiated from the western MB sites, such as Corsica, F01 and F07 in Continental France, S14, S17, S18 and S20 in Spain, and all sites in Morocco (all above 0.2). Compared to the Turkish wild population, M29 from the southern limit of the olive distribution is the most differentiated population (FST = 0.42), while F04 from France was the least differentiated population (FST = 0.01). We revealed a high genetic differentiation between western MB wild populations and the eastern MB wild population.
Fig 2. Heatmap of pairwise FST performed on the genome-wide SNPs diversity of 27 natural populations of O. europaea L. collected in western Mediterranean Basin in France (143), Spain (123), Morocco (96) and in the eastern Mediterranean Basin in Turkey (13).
In the principal component analysis (PCA), the first axis PC1 accounted for 25.4% of the variation and revealed an eastern-western genetic structure between western MB wild olive trees and eastern MB wild olive trees, with cultivated accessions mainly related to the eastern MB wild populations (Figs 1B and 3). On the first axis, we observed accessions from sampling sites in Corsica (F08 to F11, Fig 3B), S20, a large part of S18 from Spain (Fig 3C) and from all the sites in Morocco (Fig 3D), with the notable exception of M28, were clearly separated from the cultivated accessions and eastern MB wild accessions (Fig 3). All individuals collected in central southern France (F02, F03, F04 and F06; Fig 3A), one from Morocco (M28, Fig 3D) and some from north-central Spain (S12, S13, S15 and S16; Fig 3C) clustered with cultivated accessions as shown in the Figs 1B and 3 (left side of the PCA). This profile suggests admixture events. Several other individuals collected in eastern and western France (F01 and F07; Fig 3A), in Morocco (M21, M22, M24 and M25; Fig 3D) and in Spain (S14, S17, S18 and S19; Fig 3C) also exhibited a pattern of admixture with the cultivated accessions. The second axis, i.e. PC2, accounted for 2.2% of the observed variability, highlighting two subgroups within cultivated trees. The first subgroup includes cultivated genotypes mostly from Spanish varieties such as “Picual” and three Moroccan varieties including “Picholine Marocaine”. The other cultivated group included several varieties from Spain, Morocco and France (Fig 1B).
Fig 3. Detailed genetic structure of O. europaea L. populations using PCA analysis.
Red boxes represent cultivated individuals, orange triangles represent Turkish population with the (A) French continental populations, (B) Corsican French populations, (C) Spain populations, (D) Moroccan populations.
A similar pattern was supported by the sNMF analyses. According to the cross-entropy criterion, only K from 2 to 4 were considered suitable to explain the western MB natural olive tree genetic pattern (Fig 1C; S3 Fig). The wild Turkish olive population (OST) was assigned to a specific cluster from K = 2 to K = 4 (in blue) regardless of the admixture model examined, thereby supporting the existence of a structure between western MB and eastern MB natural populations. At K = 2, the cultivated and eastern MB olive trees collected in natural sites were assigned to cluster 1 (in blue) and the western MB wild natural olive trees in Morocco, Corsica, South Spain, France (F01 and F07 sites) were mainly assigned to cluster 2 (in green). Olive trees from the F02, F03, F04 and F06 sites in France were mainly assigned to cluster 1 (in blue). Trees collected from S12, S13, S15, S16, S17 S19 and M28 sites spread in the two clusters. At K = 3, these sampling sites were mostly assigned to cluster 3 (in red), particularly for individuals from populations M28, S17, and S16. With K = 4, a fourth cluster (in dark blue) was noted within the cultivated cluster, corresponding to the same first subgroup described in the PCA results above (Figs 1B and 3).
By combining three analyses (i.e. PCA, sNMF and pairwise FST; Figs 1 and 3), particularly by considering the left and central part of the PCA (Figs 1B and 3) and the cluster 3 (in red; Fig 1C) from the sNMF analyses, we have several arguments strongly suggesting admixtures between natural olive trees and cultivated ones. It seems to be consistent with FST values, with much lower levels of differentiation between cultivated olive accessions and wild olive accessions (Fig 2). Accordingly, all olive trees mapped in the right side of the PCA (Figs 1B and 3) and assigned to the cluster (in green) regardless of the admixture model examined (Fig 1C) and considered as genuine wild.
3.3 | Inference of population admixture and gene flow
The tree inferred by TreeMix was ranked using M29 population. This population was chosen because all of the individuals collected in this site belong to the same cluster, referred as the western MB wild cluster (in green; Fig 1C). This genuinely western MB wild olive also appeared to be the most genetically distinct from eastern MB wild olive (FST = 0.42) and from the cultivated accessions (Fig 1).
The TreeMix analysis revealed the highest divergence between M29 and C3 (cultivated) and OST (Turkish population). The lowest divergence (below 0.015) from M29 were found for with almost all the Moroccan sites (except for M28), with S20, F08, F09, F10 and F11. A second group of sampling sites was found with a divergence from M29 of 0.017 to 0.024, including S18, M28, F07, F01 and S17. Accessions from sampling sites in Spain, except for S20, had a genetic divergence of >0.032 from M29 and were closer to cultivated groups (<0.012 genetic divergence between S16 and C3). Accessions from the French F06, F02, F03 and F04 sites were found to be grouped with the cultivated clusters C3 and C0. TreeMix inferred a low divergence between C4, C1 and OST (around 0.005). Two gene flow events were inferred (S1 and S2 Figs), with the first one being from cultivated and French populations from the center to M28, with a weight of (w = 0.425). The second one was from M22 to northern Spain sites (w = 0.485) (Fig 4).
Fig 4. Tree inferred by TreeMix analysis on natural O. europaea L. populations from the western and eastern Mediterranean Basin and cultivated accessions.
Genetic divergence is represented by the horizontal difference between populations. The vertical bars are only graphical representations and are not taken into account in the analysis. C0, C1, C3 and C4 are groups of cultivated accessions (S1 File).
4 | Discussion
Wild olive molecular identification and characterization are important to assess the genetic diversity of this species, in addition, to evolutionary history and to investigate local adaptation. This is particularly important for wild relative crops such as olive, where wild olives and cultivated olives coexist in the same area [14, 15] and cannot be clearly distinguish with morphological traits [26, 27] but only using genetic markers [19, 21, 28–31]. As crop-to-wild gene flows likely occur, deciphering their genetic relationships can help to understand the impact of crop on natural populations, their demographic histories and to explore new sources of genetic diversity. In this study, our aim was to identify genuine wild olive populations by investigating the genetic structure and diversity patterns of olive trees evolving in the natural environment in the western MB. We sampled allegedly wild olive trees according to past study [21] over a large geographic area in natural environment within the western MB. We hypothesis that all these populations are genuine wild olives. In addition, we included eastern MB genuine wild olive trees and cultivated olive trees. We analyzed this large panel of population using SNPs from target sequencing. This if the first time target sequencing has been used to study genomic variation in olive tree—it enabled a genome scan of many individuals while accessing more than 140,000 SNPs distributed throughout the genome. This represents a major advance over previous methods in similar studies using SSR-based molecular analyses [21, 28] or SNPs analyses from RNAseq [30]. We documented the presence of genuine wild olive tree populations in the western MB, in southern France, Spain and Morocco. However, contrary to our assumption, our analysis suggested that admixed populations are more frequent than genuine ones.
4.1 | Genetic variation patterns in natural and cultivated olive trees highlight the persistence of genuine wild olive populations in the western Mediterranean Basin
Based on a large set of SNP markers, located all over the genome, we identify a very strong genetic differentiation between the natural populations collected in the western MB and those in the eastern MB. This confirmed the results of previous studies using microsatellite markers highlighted two distinct gene pools in the eastern and western MB. In these previous studies, the western/central MB gene pool clustered wild populations from Greece to Morocco [21, 28] while the eastern MB gene pool clustered wild populations from Greece to the Levant. The eastern MB gene pool was represented here by one Turkish population (OST), i.e. a genuine wild population previously revealed using plastid DNA polymorphism [16, 21], SSR markers [28] and SNPs from RNAseq [30]. Our results reflects the typical long evolutionary history of Mediterranean species as described by [49].
Our study confirmed that the wild eastern MB olive population from Turkey is very closely related to cultivated olives, as shown by different approaches (PCA, sNMF, FST and TreeMix). This finding is consistent with the olive domestication history which was likely the results of genetic selection of eastern MB wild trees [16, 28, 30]. Moreover, we combined several approaches, i.e. PCA, sNMF, pairwise FST estimation and inference of splits in mixture in populations and clearly identified a genetic group of olive trees strongly differentiated from the eastern wild and cultivated olive as previously found by [30] which probably constitute genuine wild olive trees. Finally, within cultivated accessions, we identified two genetic groups. The first one similar to OST and which might be composed of varieties likely issued from two processes, primary selection in the east and secondary diversification in the central and western Mediterranean areas as proposed by Khadari & El Bakkali (2018). The second group essentially consisted of Spanish and Moroccan varieties that were highly differentiated from OST and which might be composed of western MB cultivated olives through a secondary diversification process mainly via selection involving crossing between ancient varieties such as Gordal Sevillana and Lechin de Granada, as shown by [19].
4.2 | Wild versus admixed olive trees: An evolutionary history impacted by domestication
As discussed above, we identified genuine wild olive trees in the western Mediterranean basin in Spain, Corsica (France) and Morocco as already shown by [21]. The genetic diversity observed in Corsican and Moroccan populations were very close suggesting a common ancestral history for these populations. We also discovered, for the first time, the presence of genuine wild olive trees in continental France at Mont-Boron (eastern-south; F07) and near Leucate (western-south; F01). The abundance of wild olives trees in the western MB has been well characterized [16, 20, 28, 30, 31]. However, populations initially characterized as genuine wilds according to plastid DNA polymorphism and SSR markers, for instance, in Spain [21] were found admixed in our study. These sites were in habitats considered to be little or not at all impacted by human activities, e.g. in natural reserves, remote from urban centres or areas with olive orchards. Even within some sites, from a genetic viewpoint, several individuals were considered genetically to be wild, while others were very admixed with cultivated olives. This high genetic admixture intensity was unexpected in the western MB based on previous studies [16, 21, 28, 30, 50]. This differing results compared to past study [21, 28] may arise from variations in the sampling distribution area and the size of the samples at each site, particularly with the presence of heterogeneous populations, including admixed and wild individuals. The SSRs should be able to detect a similar pattern of admixture. Our findings have provided new insights into the evolutionary history of olive trees in natural western MB habitat.
4.3 | What factors could influence the prevalence of admixed populations in natural environments?
We obtained clear evidence in this study on the substantial presence of admixed populations within the natural olive populations which were previously reported to be little impacted by crop-to-wild gene flow [21, 28]. Recent phylogenomic and population structure investigations revealed genetic admixtures during olive domestication thus highlighting the impact of domesticated alleles on two wild olive trees for the western MB [29]. However, these authors analyzed very limited sampling of wild olive trees (7 olive trees from the western MB), while in our investigations, we analyzed 362 trees sampled from 27 natural sites in Spain, Morocco and France—this sampling covered the olive natural distribution range in the western MB. Through TreeMix analysis and comparing genomic variation comparison in these 362 olive trees to the genuine Turkish natural population and to cultivated olive trees, we, therefore, were able to depict gene flow between wild and cultivated olives (see Fig 4). The observed patterns highlighted the two kinds of gene flow events: from cultivated to wild populations in Morocco, France and Spain and from wild to cultivated or already admixed populations in Spain.
These gene flows could have been driven by several factors. First, the mating system of olive, which, is allogamous, pollination mostly depends on wind (anemophilous pollination) and seed dissemination relies on birds (zoochorous dissemination). These forms of dissemination may occur over long geographic distances (>50 km) [51]. Second, cultivated and wild olive trees share the same climatic and ecological niches, the geographic proximity between them increases the possibility of gene flow and events of admixture [50]. Third, there could be cultivated versus wild olive tree pollen competition: monocultures and single-varietal olive orchards are responsible for broad dissemination of pollen from orchards (thousands of trees), whereas wild populations are often composed of few individuals. Wild olive pollen is thus less abundant. Fourth, gene flow between cultivated and wild olive trees may increase genetic diversity in admixed populations. Associated new variants or combination might be better adapted to the local environment, promoting an acceleration of local adaptation of a species to an environment is a recognized evolutionary force explaining the occurrence of admixed populations [8]. Evolutionary factors such as allogamy [52] and cultivated versus wild olive tree pollen competition (see above) may not be sufficient to explain the large frequency of admixed populations versus genuine wild populations. Here we assume that admixed olive trees could have a better adaptive potential to their natural environment, as this has been previously demonstrated in several short-lived and annual crops [3, 4, 10]. This assumption is supported by the findings of genetic investigations on natural olive trees in Australia [53]. These authors hypothesized that hybridization between two introduced Olea species, Olea europaea subsp. europaea and Olea europaea subsp. cuspidata, overcame the lack of diversity after their introduction bottleneck, thereby facilitating their establishment. Strikingly, to our knowledge, this assumption has yet to be investigated in long-lived woody plants such as olive trees, even though knowledge on the impact of admixture on the evolution of natural populations could help guide appropriate conservation strategies in forest areas and other natural ecosystems.
4.4 | Consequences of extensive hybridization of wild olive via domesticated olive introgression and conservation recommendations
The future of genuine wild genotypes might be threatened by the gene flow we highlighted here. For instance, extensive gene flow could ultimately lead to complete replacement of wild populations by admixed genotype [8]. However, in global change context, this gene flow could enhance adaptation to a changing environment. Our study offers new opportunities for more in-depth studies on this introgression process. We identified three different compartments, i.e. a genuine wild olive compartment, a cultivated one and an admixed one, that could be study to address this long-standing question. Conservation policies on wild olive trees should take into account the risk of introgression from cultivated alleles and by the impact of climate change. The naturally occurring olive trees sampled here were positioned on a north-south gradient with different environmental conditions, which could facilitate studies on their potential local adaptation to changing climatic conditions. Our study finding may provide a basis for designing new conservation measures to protect genuine wild genotypes, in-situ and ex-situ, including repositories of wild genetic diversity not impacted by artificial selection.
5 | Conclusion
In this study we assessed the genetic structure of natural olive populations from the western Mediterranean Basin. We confirmed that the western MB genuine wild olive is genetically well differentiated from eastern MB wild olive as well as cultivated forms. We detected its presence in France, Spain and Morocco. We also found many admixed populations resulting from strong crop-to-wild gene flow. The presence of admixed olive populations in the same distribution area as genuine wild populations raises questions on the reasons for their predominance in the natural environment and on designing conservation strategies for both compartments. Finally, the two genetic patterns revealed by our investigations could be considered as a suitable model for investigating two core questions, the first on the admixture nature, i.e. what domesticated genomic alleles/regions would be suitable for introgression in wild genomes? The second question is related to local adaptation: are wild better locally adapted than admixed olive trees?
Supporting information
Supplementary methods for the DNA extraction, Library preparation, assignment to create subgroup inside cultivated.
(DOCX)
One hundred TreeMix runs were done with a random SNPs block size between 100 and 1000, from 1 to 10 migrations each with M29 as outgroup. The analysis was performed with OptM package.
(TIF)
One hundred TreeMix runs were done with a random SNPs block size between 100 and 1000, from 1 to 10 migrations each with M29 as outgroup. The analysis was performed with OptM package.
(TIF)
(TIF)
The baits were designed based on annotated gene of Olea europaea var. europaea (cv. Farga) Oe9 genome assembly (Julca et al. 2020).
(XLSX)
(DOCX)
(DOCX)
(DOCX)
The upper limit ci is the upper part and the lower limit ci in the bottom part of the matrix.
(XLSX)
(DOCX)
(DOCX)
Acknowledgments
We thank Guillaume Perez, Magalie Delalande, Karine Loridon, Juliette Arzul, Sylvia Lochon-Menseau for helping us in wild olive sampling from in situ areas; Christian Pinatel for the sampling of French cultivated olive trees; Bénédicte Rhoné for her advices in the target sequencing design. Anaïs Fossot, Hélène Vignes and Ronan Rivallan for the laboratory help. We acknowledge the helpful comments from Joëlle Ronfort and the English corrections from David Manley.
A part of a field work has been supported by the Conservatoire Botanique National Méditerranéen de Porquerolles, l’Institut Agro Montpellier and INRA Morocco. This work has been realized with the support of MESO@LR-Platform at the University of Montpellier, the CIRAD, and the technical support of the bioinformatics group of the UMR AGAP Institute, member of the French Institute of Bioinformatics (IFB)—South Green Bioinformatics Platform.
Data Availability
All raw sequences of Olea europaea are available in the following database: ClimOliveMed; 2023;GenomiCOM: ClimOliveMed Genomic resources for research on adaptation of olive tree to climate change; European Nucleotide Archive; 2023-04-17; PRJEB61410. Snakemake workflow of the SNP calling is available here: https://forgemia.inra.fr/gautier.sarah/ClimOlivMedCapture.
Funding Statement
LZ received a PhD scholarship from the French government. This study was funded through Labex AGRO 2011-LABX-002, project ClimOliveMed n° 2003-001 (under I-Site Muse framework) coordinated by Agropolis Fondation. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Mary N, Iannuccelli N, Petit G, Bonnet N, Pinton A, Barasc H, et al. Genome-wide analysis of hybridization in wild boar populations reveals adaptive introgression from domestic pig. Evol Appl. 2022;15(7):1115–28. doi: 10.1111/eva.13432 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Anderson D, Negishi Y, Ishiniwa H, Okuda K, Hinton TG, Toma R, et al. Introgression dynamics from invasive pigs into wild boar following the March 2011 natural and anthropogenic disasters at Fukushima. Proc Biol Sci. 2021;288(1953):20210874. doi: 10.1098/rspb.2021.0874 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Johnsson M, Gering E, Willis P, Lopez S, Van Dorp L, Hellenthal G, et al. Feralisation targets different genomic loci to domestication in the chicken. Nat Commun. 2016;7:12950. doi: 10.1038/ncomms12950 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Le Corre V, Siol M, Vigouroux Y, Tenaillon MI, Délye C. Adaptive introgression from maize has facilitated the establishment of teosinte as a noxious weed in Europe. Proc Natl Acad Sci U S A. 2020;117(41):25618–27. doi: 10.1073/pnas.2006633117 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Hauser TP, Shim SI. Survival and flowering of hybrids between cultivated and wild carrots (Daucus carota) in Danish grasslands. Environ Biosafety Res. 2007;6(4):237–47. doi: 10.1051/ebr:2007044 [DOI] [PubMed] [Google Scholar]
- 6.Gering E, Incorvaia D, Henriksen R, Conner J, Getty T, Wright D. Getting Back to Nature: Feralization in Animals and Plants. Trends Ecol Evol. 2019;34(12):1137–51. doi: 10.1016/j.tree.2019.07.018 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Keller M, Kollmann J, Edwards PJ. Genetic introgression from distant provenances reduces fitness in local weed populations. J Appl Ecol. 2000;37(4):647–59. [Google Scholar]
- 8.Ellstrand NC, Prentice HC, Hancock JF. Gene Flow and Introgression from Domesticated Plants into their Wild Relatives. Annu Rev Ecol Syst. 1999;30(1):539–63. [Google Scholar]
- 9.Cornille A, Antolín F, Garcia E, Vernesi C, Fietta A, Brinkkemper O, et al. A Multifaceted Overview of Apple Tree Domestication. Trends Plant Sci. 2019;24(8):770–82. doi: 10.1016/j.tplants.2019.05.007 [DOI] [PubMed] [Google Scholar]
- 10.Feurtey A, Cornille A, Shykoff JA, Snirc A, Giraud T. Crop-to-wild gene flow and its fitness consequences for a wild fruit tree: Towards a comprehensive conservation strategy of the wild apple in Europe. Evol Appl. 2017;10(2):180–8. doi: 10.1111/eva.12441 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Chenault N, Arnaud-Haond S, Juteau M, Valade R, Almeida JL, Villar M, et al. SSR-based analysis of clonality, spatial genetic structure and introgression from the Lombardy poplar into a natural population of Populus nigra L. along the Loire River. Tree Genet Genomes. 2011;7(6):1249–62. [Google Scholar]
- 12.Jelić M, Patenković A, Skorić M, Mišić D, Kurbalija Novičić Z, Bordács S, et al. Indigenous forests of European black poplar along the Danube River: genetic structure and reliable detection of introgression. Tree Genet Genomes. 2015;11(5):89. [Google Scholar]
- 13.Nishio S, Takada N, Terakami S, Takeuchi Y, Kimura MK, Isoda K, et al. Genetic structure analysis of cultivated and wild chestnut populations reveals gene flow from cultivars to natural stands. Sci Rep. 2021;11(1):240. doi: 10.1038/s41598-020-80696-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Zohary D, Spiegel-roy P. Beginnings of Fruit Growing in the Old World. American Association for the Advancement of Science. 1975;187(4174):319–27. doi: 10.1126/science.187.4174.319 [DOI] [PubMed] [Google Scholar]
- 15.Rubio de Casas R, Besnard G, Schönswetter P, Balaguer L, Vargas P. Extensive gene flow blurs phylogeographic but not phylogenetic signal in Olea europaea L. Theor Appl Genet. 2006;113(4):575–83. doi: 10.1007/s00122-006-0306-2 [DOI] [PubMed] [Google Scholar]
- 16.Besnard G, Khadari B, Navascués M, Fernández-Mazuecos M, Bakkali AE, Arrigo N, et al. The complex history of the olive tree: From late quaternary diversification of mediterranean lineages to primary domestication in the northern Levant. Proceedings of the Royal Society B: Biological Sciences. 2013;280:1–7. doi: 10.1098/rspb.2012.2833 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Clark PU, Dyke AS, Shakun JD, Carlson AE, Clark J, Wohlfarth B, et al. The Last Glacial Maximum. Science. 2009;325(5941):710–4. doi: 10.1126/science.1172873 [DOI] [PubMed] [Google Scholar]
- 18.Carrión Y, Ntinou M, Badal E. Olea europaea L. in the North Mediterranean Basin during the Pleniglacial and the Early-Middle Holocene. Quat Sci Rev. 2010;29(7–8):952–68. [Google Scholar]
- 19.Diez CM, Trujillo I, Martinez-Urdiroz N, Barranco D, Rallo L, Marfil P, et al. Olive domestication and diversification in the Mediterranean Basin. New Phytol. 2015;206(1):436–47. doi: 10.1111/nph.13181 [DOI] [PubMed] [Google Scholar]
- 20.Besnard G, Baradat P, Breton C, Khadari B, Bervillé A. Olive domestication from structure of oleasters and cultivars using nuclear RAPDs and mitochondrial RFLPs. Genet Sel Evol. 2001;33:S251. [Google Scholar]
- 21.Besnard G, Bakkali AE, Haouane H, Baali-Cherif D, Moukhli A, Khadari B. Population genetics of Mediterranean and Saharan olives: Geographic patterns of differentiation and evidence for early generations of admixture. Ann Bot. 2013;112(7):1293–302. doi: 10.1093/aob/mct196 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Rodríguez-Sánchez F, Guzmán B, Valido A, Vargas P, Arroyo J. Late Neogene history of the laurel tree (LaurusL., Lauraceae) based on phylogeographical analyses of Mediterranean and Macaronesian populations. J Biogeogr. 2009;36(7):1270–81. [Google Scholar]
- 23.Arroyo-García R, Ruiz-García L, Bolling L, Ocete R, López MA, Arnold C, et al. Multiple origins of cultivated grapevine (Vitis vinifera L. ssp. sativa) based on chloroplast DNA polymorphisms. Mol Ecol. 2006;15(12):3707–14. doi: 10.1111/j.1365-294X.2006.03049.x [DOI] [PubMed] [Google Scholar]
- 24.Kaniewski D, Van Campo E, Boiy T, Terral JF, Khadari B, Besnard G. Primary domestication and early uses of the emblematic olive tree: Palaeobotanical, historical and molecular evidence from the Middle East. Biol Rev Camb Philos Soc. 2012;87(4):885–99. doi: 10.1111/j.1469-185X.2012.00229.x [DOI] [PubMed] [Google Scholar]
- 25.Newton C, Lorre C, Sauvage C, Ivorra S, Terral JF. On the origins and spread of Olea europaea L. (olive) domestication: Evidence for shape variation of olive stones at Ugarit, Late Bronze Age, Syria-a window on the Mediterranean Basin and on the westward diffusion of olive varieties. Veg Hist Archaeobot. 2014;23(5):567–75. [Google Scholar]
- 26.Terral, Alonso N, Buxó I Capdevila, Chatti N, Fabre L, Fiorentino G, et al. Historical biogeography of olive domestication (Olea europaea L.) as revealed by geometrical morphometry applied to biological and archaeological material. J Biogeogr. 2004;31(1):63–77. [Google Scholar]
- 27.Terral JF, Bonhomme V, Pagnoux C, Ivorra S, Newton C, Paradis L, et al. The Shape Diversity of Olive Stones Resulting from Domestication and Diversification Unveils Traits of the Oldest Known 6500-Years-Old Table Olives from Hishuley Carmel Site (Israel). Agronomy. 2021;11(11):2187. [Google Scholar]
- 28.Khadari B, El Bakkali A. Primary Selection and Secondary Diversification: Two Key Processes in the History of Olive Domestication. International Journal of Agronomy [Internet]. 2018;2018. Available from: 10.1155/2018/5607903 [DOI] [Google Scholar]
- 29.Julca I, Marcet-Houben M, Cruz F, Gómez-Garrido J, Gaut BS, Díez CM, et al. Genomic evidence for recurrent genetic admixture during the domestication of Mediterranean olive trees (Olea europaea L.). BMC Biol. 2020;18(1):1–25. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Gros-Balthazard M, Besnard G, Sarah G, Holtz Y, Leclercq J, Santoni S, et al. Evolutionary transcriptomics reveals the origins of olives and the genomic changes associated with their domestication. Plant J. 2019;100(1):143–57. doi: 10.1111/tpj.14435 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Lumaret R, Ouazzani N. Ancient wild olives in Mediterranean forests. Nature. 2001;413(6857):700–700. [DOI] [PubMed] [Google Scholar]
- 32.El Bakkali A, Essalouh L, Tollon C, Rivallan R, Mournet P, Moukhli A, et al. Characterization of Worldwide Olive Germplasm Banks of Marrakech (Morocco) and Córdoba (Spain): Towards management and use of olive germplasm in breeding programs. PLoS One. 2019;14(10):e0223716. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Khadari B, El Bakkali A, Essalouh L, Tollon C, Pinatel C, Besnard G. Cultivated Olive Diversification at Local and Regional Scales: Evidence From the Genetic Characterization of French Genetic Resources. Front Plant Sci. 2019;10:1593. doi: 10.3389/fpls.2019.01593 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Copernicus Land Monitoring Service. CORINE Land Cover [Internet]. Land Copernicus Eu. 2020 [cited 2021 Mar 24]. Available from: https://land.copernicus.eu/pan-european/corine-land-cover/clc2018
- 35.Cormier F, Lawac F, Maledon E, Gravillon MC, Nudol E, Mournet P, et al. A reference high-density genetic map of greater yam (Dioscorea alata L.). Theor Appl Genet. 2019;132(6):1733–44. doi: 10.1007/s00122-019-03311-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Chen S, Zhou Y, Chen Y, Gu J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics. 2018;34(17):i884–90. doi: 10.1093/bioinformatics/bty560 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Vasimuddin M, Misra S, Li H, Aluru S. Efficient Architecture-Aware Acceleration of BWA-MEM for Multicore Systems. In: 2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS). ieeexplore.ieee.org; 2019. p. 314–24.
- 38.Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25(16):2078–9. doi: 10.1093/bioinformatics/btp352 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Broad Institute. Picard toolkit. [Internet]. 2019. Available from: https://broadinstitute.github.io/picard/
- 40.Poplin R, Ruano-Rubio V, DePristo MA, Fennell TJ, Carneiro MO, Van der Auwera GA, et al. Scaling accurate genetic variant discovery to tens of thousands of samples [Internet]. bioRxiv. 2017. [cited 2023 Mar 28]. p. 201178. Available from: https://www.biorxiv.org/content/10.1101/201178v2 [Google Scholar]
- 41.Danecek P, Auton A, Abecasis G, Albers CA, Banks E, DePristo MA, et al. The variant call format and VCFtools. Bioinformatics. 2011;27(15):2156–8. doi: 10.1093/bioinformatics/btr330 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Goudet J, Jombart T. Hierfstat: Estimation and tests of hierarchical F-statistics. R package version 0.5–11. [Internet] 2022. Available from: https://CRAN.R-project.org/package=hierf stat [Google Scholar]
- 43.Frichot E, François O. LEA: An R package for landscape and ecological association studies. Methods Ecol Evol. 2015;6(8):925–9. [Google Scholar]
- 44.Pickrell JK, Pritchard JK. Inference of population splits and mixtures from genome-wide allele frequency data. PLoS Genet. 2012;8(11):e1002967. doi: 10.1371/journal.pgen.1002967 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Fitak RR. OptM: estimating the optimal number of migration edges on population trees using Treemix. Biol Methods Protoc. 2021;6(1):bpab017. doi: 10.1093/biomethods/bpab017 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Evanno G, Regnaut S, Goudet J. Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Mol Ecol. 2005;14(8):2611–20. doi: 10.1111/j.1365-294X.2005.02553.x [DOI] [PubMed] [Google Scholar]
- 47.Felsenstein J. PHYLIP (phylogeny inference package), version 3.5 c. Joseph Felsenstein. [Internet] 2005. Available from: https://phylipweb.github.io/phylip/ [Google Scholar]
- 48.Milanesi M, Capomaccio S, Vajana E, Bomba L, Garcia JF, Ajmone-Marsan P, et al. BITE: an R package for biodiversity analyses [Internet]. bioRxiv. 2017. [cited 2023 Mar 2]. p. 181610. Available from: https://www.biorxiv.org/content/10.1101/181610v1 [Google Scholar]
- 49.Blondel J, Aronson J, Bodiou JY, Boeuf G. The Mediterranean Region: Biological Diversity in Space and Time. Oxford University Press; 2010. 376 p. [Google Scholar]
- 50.Besnard G, Terral JF, Cornille A. On the origins and domestication of the olive: A review and perspectives. Ann Bot. 2018;121(3):385–403. doi: 10.1093/aob/mcx145 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Spennemann DHR, Richard Allen L. The avian dispersal of olives Olea europaea: Implications for Australia. Emu. 2000;100(4):264–73. [Google Scholar]
- 52.Saumitou-Laprade P, Vernet P, Vekemans X, Billiard S, Gallina S, Essalouh L, et al. Elucidation of the genetic architecture of self-incompatibility in olive: Evolutionary consequences and perspectives for orchard management. Evol Appl. 2017;10(9):867–80. doi: 10.1111/eva.12457 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Besnard G, Dupuy J, Larter M, Cuneo P, Cooke D, Chikhi L. History of the invasive African olive tree in Australia and Hawaii: evidence for sequential bottlenecks and hybridization with the Mediterranean olive. Evol Appl. 2014;7(2):195–211. doi: 10.1111/eva.12110 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Supplementary methods for the DNA extraction, Library preparation, assignment to create subgroup inside cultivated.
(DOCX)
One hundred TreeMix runs were done with a random SNPs block size between 100 and 1000, from 1 to 10 migrations each with M29 as outgroup. The analysis was performed with OptM package.
(TIF)
One hundred TreeMix runs were done with a random SNPs block size between 100 and 1000, from 1 to 10 migrations each with M29 as outgroup. The analysis was performed with OptM package.
(TIF)
(TIF)
The baits were designed based on annotated gene of Olea europaea var. europaea (cv. Farga) Oe9 genome assembly (Julca et al. 2020).
(XLSX)
(DOCX)
(DOCX)
(DOCX)
The upper limit ci is the upper part and the lower limit ci in the bottom part of the matrix.
(XLSX)
(DOCX)
(DOCX)
Data Availability Statement
All raw sequences of Olea europaea are available in the following database: ClimOliveMed; 2023;GenomiCOM: ClimOliveMed Genomic resources for research on adaptation of olive tree to climate change; European Nucleotide Archive; 2023-04-17; PRJEB61410. Snakemake workflow of the SNP calling is available here: https://forgemia.inra.fr/gautier.sarah/ClimOlivMedCapture.




