Skip to main content
Evolutionary Applications logoLink to Evolutionary Applications
. 2012 Jul 18;6(2):243–252. doi: 10.1111/j.1752-4571.2012.00285.x

Genetic variation and risks of introgression in the wild Coffea arabica gene pool in south-western Ethiopian montane rainforests

Raf Aerts 1, Gezahegn Berecha 2,3, Pieter Gijbels 3, Kitessa Hundera 1,4, Sabine Glabeke 5, Katrien Vandepitte 3, Bart Muys 1, Isabel Roldán-Ruiz 5, Olivier Honnay 2
PMCID: PMC3689350  PMID: 23798974

Abstract

The montane rainforests of SW Ethiopia are the primary centre of diversity of Coffea arabica and the origin of all Arabica coffee cultivated worldwide. This wild gene pool is potentially threatened by forest fragmentation and degradation, and by introgressive hybridization with locally improved coffee varieties. We genotyped 703 coffee shrubs from unmanaged and managed coffee populations, using 24 microsatellite loci. Additionally, we genotyped 90 individuals representing 23 Ethiopian cultivars resistant to coffee berry disease (CBD). We determined population genetic diversity, genetic structure, and admixture of cultivar alleles in the in situ gene pool. We found strong genetic differentiation between managed and unmanaged coffee populations, but without significant differences in within-population genetic diversity. The widespread planting of coffee seedlings including CBD-resistant cultivars most likely offsets losses of genetic variation attributable to genetic drift and inbreeding. Mixing cultivars with original coffee genotypes, however, leaves ample opportunity for hybridization and replacement of the original coffee gene pool, which already shows signs of admixture. In situ conservation of the wild gene pool of C. arabica must therefore focus on limiting coffee production in the remaining wild populations, as intensification threatens the genetic integrity of the gene pool by exposing wild genotypes to cultivars.

Keywords: admixture, Afromontane rainforest, coffee, crop wild relative, ecosystem services, genetic erosion

Introduction

To improve quality, achieve higher yields, or create pest and disease-resistant or stress-tolerant varieties of crops, plant breeders often utilize crop wild relatives (CWRs) (Hoisington et al. 1999; Heywood et al. 2007; Lashermes et al. 2011). CWRs are progenitors of crops and wild plant taxa that have relatively close genetic relationships to crops, but that are not domesticated themselves (Meilleur and Hodgkin 2010; Maxted et al. 2006). CWRs may possess desirable characteristics that can be used to improve existing crops (Gur and Zamir 2004; Fernie et al. 2006; Maxted et al. 2007; Takeda and Matsuoka 2008). In particular, in the light of global climatic change, sustained agricultural production may increasingly rely on the genetic enhancement of crops using the diverse germplasm of CWRs (Heywood et al. 2007; Tester and Langridge 2010; Foley et al. 2011; Ford-Lloyd et al. 2011), and for that reason, the in situ conservation of the genetic diversity of CWRs is an important but often undervalued challenge (Mercer and Perales 2010; Honnay et al. 2012).

Arabica coffee is one of the world's most valuable agricultural commodities, accounting for two-thirds of the global coffee market (Labouisse et al. 2008). Despite the currently wide geographical range of arabica coffee cultivation, the number of cultivars used is very small: mainly Coffea arabica var. typica, C. arabica var. bourbon and hybrids of the two (Labouisse et al. 2008). The narrow genetic base of those cultivars (Anthony et al. 2002) has resulted in a crop with homogenous agronomic behaviour (Lashermes et al. 2009), but also with a high susceptibility to biotic and climatic hazards (Labouisse et al. 2008; Jaramillo et al. 2011), and a low adaptability in response to environmental changes or changing market demands.

The closest wild relative of cultivated arabica coffee is wild Coffea arabica, which has its origin and centre of diversity in south-western Ethiopia (Anthony et al. 2001, 2002). Wild arabica coffee is a unique potential source of genetic diversity for selection and breeding of enhanced arabica cultivars, including varieties with low caffeine content, increased yields, or increased resistance to pests and pathogens such as coffee berry disease (CBD, caused by Colletotrichum kahawae), coffee rust (caused by Hemileia vastatrix), Meloidogyne root nematodes and the coffee berry borer (Hypothenemus hampei) (Hein and Gatzweiler 2006; Silvestrini et al. 2007; Dessalegn et al. 2008; Boisseau et al. 2009). Despite its importance for the global coffee industry and for the livelihood of rural communities depending on coffee cultivation, the status of the wild gene pool of arabica coffee is largely unknown and potentially threatened (Labouisse et al. 2008), a fate shared with many other CWRs in the world (Heywood et al. 2007).

Two major anthropogenic processes may potentially threaten the diversity and integrity of the gene pool of wild Coffea arabica: (i) the fragmentation and intensive management of the natural Afromontane rainforests, and (ii) the large scale introduction of improved coffee varieties in natural coffee stands. First, as elsewhere in the tropics (Ahrends et al. 2010; DeFries et al. 2010), forest conversion to agriculture and other land uses related to urban population growth have resulted in the fragmentation of the Ethiopian montane forest (Gole et al. 2008). Furthermore, traditional forest coffee production practices in Ethiopia also alter forest structure and plant communities (Schmitt et al. 2009; Aerts et al. 2011). The intensity of management varies between so-called forest coffee (FC) systems, which undergo little or no intervention, and semi-forest coffee (SFC) systems, in which herbs, shrubs (other than coffee) and emerging tree seedlings in the understory are removed annually, the upper canopy is selectively thinned and coffee saplings are locally planted (Senbeta and Denich 2006; Schmitt et al. 2009; Aerts et al. 2011). Both forest fragmentation and forest degradation can have a negative impact on the genetic diversity of forest plant species through increased genetic drift, reduced gene flow, and alteration of mating patterns resulting in increased inbreeding (Young et al. 1996; Honnay et al. 2005; Eckert et al. 2010). Second, the widespread planting, since the 1970s, of a restricted set of locally improved coffee varieties, mainly genotypes resistant to coffee berry disease (CBD), in the forest and its surroundings may result in the replacement of a part of the wild gene pool with a small number of domesticated alleles (Ellstrand et al. 1999; Becker et al. 2006; Hooftman et al. 2007). This can result in loss of genetic variation from the original gene pool and may even have negative fitness consequences for the original populations (Ellstrand 2003).

The general aim of this study was to provide the first thorough assessment of population genetic diversity within the wild gene pool of Coffea arabica in its centre of origin, the south-western Ethiopian montane rainforests. We addressed the following specific questions: (i) Is there genetic erosion of the wild arabica gene pool in fragmented forests managed for coffee production, compared to continuous nonmanaged forests? (ii) Is the introduction of CBD-resistant genotypes posing a threat to the integrity of the wild arabica gene pool? The answers to these questions should help the in situ conservation of arabica coffee genetic resources.

Materials and methods

Study species

Coffea arabica L. (family Rubiaceae) is the only Coffea species occurring in Ethiopia and is geographically isolated from all other species in the genus (Silvestrini et al. 2007). It is a naturally occurring understory shrub of the Afromontane rainforest, a type of moist evergreen montane rainforest found in the south-western highlands between 1500 and 2600 m, with an annual rainfall between 700 and 1500 mm (Friis 1992). The canopy of the Afromontane rainforest typically consists of a mixture of broad-leaved species 10-30 m tall with emergent trees that may reach a height of 30–40 m (Demissew et al. 2004). Wild coffee generally occurs between 1500 and 1900 m, but cultivated plants are found over a much wider range, between 1000 and 2800 m (Hedberg et al. 2003; Gole et al. 2008). Flowering is induced by rains, with a short annual period of synchronous flowering usually in January. The species is self-compatible and mainly insect-pollinated, in Ethiopia typically by bees, which are attracted to the nectar (Fichtl and Admasu 1994). Coffea arabica fruits take about 1 year to reach maturity and are dispersed by birds, bats, monkeys, rodents and humans. Its population density varies by forest management intensity, with on average 3900 individuals (≥0.5 m in height and dbh ≥2 cm) ha−1 in the FC system compared to 18 500 individuals ha−1 in the SFC system (Schmitt et al. 2009). Within the genus, it is the only allotetraploid (2n = 4x = 44), formed by relatively recent natural hybridization between C. canophora and C. eugenioides (Lashermes et al. 1999). The recent origin and self-fertilization of C. arabica probably contribute to its relatively low genetic diversity compared to diploid Coffea species (Lashermes et al. 2000).

Sample collection and DNA extraction

Coffee leaf samples were collected in eleven coffee stands in montane rainforest in the Jimma zone of Oromia region in SW Ethiopia (Table 1, Fig. 1). Six stands were located in the remote Gera sector of the Belete–Gera National Forest Priority Area (NFPA) and were classified as FC. These stands showed no or only few signs of forest management. Five other stands were located in forest fragments (1–20 ha in size), which are managed as SFC since the 1970s and which are located in the coffee-producing agricultural landscape east of the NFPA. All these forest fragments showed clear evidence of tree thinning, understory removal and locally, of coffee planting activities (including CBD-resistant cultivars), as revealed, for instance, by the high density and regular spacing of coffee plants (Aerts et al. 2011).

Table 1.

Location, sample size, molecular variance (MV), expected heterozygosity corrected for sample size HE,C and population mean STRUCTURE cluster membership coefficients Q for eleven Coffea arabica stands and 23 cultivars in SW Ethiopia

Stand Lat (N) Long (E) Elev (m) N MV HE,C Q

I II SE
Forest coffee
Afalo (A10) 7.6307 36.2241 1825 62 23.27 0.509 0.27 0.73 0.042
Afalo (A6) 7.6404 36.2092 1889 73 13.77 0.565 0.56 0.44 0.112
Afalo (A4) 7.6395 36.2067 1987 73 20.90 0.561 0.42 0.58 0.042
Qacha (Q11) 7.7868 36.3238 2108 63 15.82 0.600 0.67 0.33 0.086
Qacha (Q3) 7.7817 36.3313 1920 88 30.41 0.574 0.46 0.54 0.107
Qacha (QY) 7.7865 36.3432 1926 24 21.28 0.623 0.80 0.20 0.043
Semi-forest coffee
Fetche (F1) 7.7144 36.7482 2085 60 25.06 0.552 0.78 0.22 0.003
Fetche (F8) 7.7106 36.7617 1908 61 14.80 0.554 0.81 0.19 0.031
Garuke (G10) 7.7368 36.7420 2025 57 17.63 0.560 0.69 0.31 0.069
Garuke (G11) 7.7373 36.7477 2040 82 23.99 0.547 0.57 0.43 0.138
Garuke (G24) 7.7256 36.7227 2062 60 20.01 0.500 0.62 0.38 0.144
Cultivars 90 23.25 0.621 0.58 0.42 0.053

Figure 1.

Figure 1

Afromontane rainforests in Southwest-Ethiopia and sampled Coffea arabica populations: forest coffee (closed circles) and semi-forest coffee (open circles). Insets show detail of the forest coffee (A) and of the semi-forest coffee landscape (B). Satellite imagery © 2012 DigitalGlobe, GeoEye and Cnes/Spot Image, via Google Earth.

In each stand, we established rectangular plots, containing approximately 65 coffee shrubs. Plots were at least 20 m away from the edge in SFC forest fragments. Because of the difference in coffee density between forest stands, these plots varied in size between 12 and 225 m2. All shrubs within a plot were sampled for young leaf material, totalling 703 samples across 11 plots. In one forest stand (QY, Table 1), coffee density was very low and we sampled only 24 individuals. In two other stands (G11 and Q3), coffee density was higher than estimated when establishing the plots and we sampled 82 and 88 individuals, respectively. The coffee plants within a plot are further referred to as a population. This set was complemented with leaf samples of 90 individuals representing 23 different CBD-resistant varieties (741, 744, 7440, 7454, 7487, 74110, 74112, 74140, 74148, 74158, 74165, 754, 75227, Ababuna, Bunawashi, Dessu, Gawe, Gesha, Melko-CH2, Me'oftu, Merdahereka, Wushwush and Yachi; hereafter called ‘cultivars’), which were locally developed by the Ethiopian Institute of Agricultural Research from genotypes collected throughout the Ethiopian montane rainforests and which have been released between the late 1970s and 1990s.

Leaf material was dried on silica gel. Before DNA extraction, leaves were freeze-dried for 48 h and homogenized with a mill (Mixel Mill MM 200; Retsch®, Haan, Germany). Genomic DNA was extracted from 20 mg homogenized leaf material using the NucleoSpin® Plant II kit (Machery-Nagel, Düren, Germany), with slight modifications of the standard CTAB protocol (we increased the incubation time during cell lysis to 60 min at 65°C and used a two-step elution procedure incubated at 70°C for optimal recovery of bound nucleic acids).

Simple sequence repeats genotyping

Twenty-four simple sequence repeats (SSRs, microsatellites) were amplified in six multiplex PCRs (Table S1) using a GeneAmp® PCR System 9700 thermal cycler (Applied Biosystems®, CA, USA) and a total sample volume of 10 μL containing 5 μL Qiagen® Multiplex PCR Master Mix (Qiagen, Valencia, CA, USA), 2 μL sample/template DNA and 0.2 μL of each primer (reverse and forward, 10 μm) in the multiplex combination complemented with RNase-free Milli-Q water. The multiplexes had equal thermocycling profiles with an initial Taq DNA polymerase heat-activation step at 95°C for 15 min; 25 cycles of 30 s at 94°C (denaturation step), 90 s at 57°C (annealing step) and 60 s at 72°C (extension step); and a final extension step of 30 min at 60°C. Then, 1 μL of the PCR was added to a solution of 8.8 μL formamide and 0.2 μL of the Applied Biosystems GeneScan™ 500 LIZ® size standard. Sized fragments were scored using GeneMapper® v4.0 (Applied Biosystems).

Genetic data analysis

The allotetraploid nature of the Coffea arabica genome limits the flexibility of the data analysis. We adopted two parallel approaches, one based on the codominantly scored data and allowing allele copy number ambiguity, and a second based on the scoring of each individual allele as present or absent, resulting in a dominantly scored data set comparable to the output of an amplified fragment length polymorphism (AFLP) marker approach. We used the R package polysat (Clark and Jasieniuk 2011) as a central data handling facility, that is, for importing the SSR data from the GeneMapper® software and converting the data. To assess the resolution of the microsatellite marker set we discriminated distinct multilocus genotypes (MLGs).

Genetic diversity and population differentiation

Population genetic diversity was quantified using the expected heterozygosity corrected for sample size (HE,C), and the molecular variance (MV) based on the within-population sum of squares (SSWP) and calculated as SSWP×(n − 1)−1. Among-population genetic differentiation (ΦPT) was calculated based on Euclidian genetic distances (Huff et al. 1993). HE,C is based on the tetraploid data set and was calculated in ATETRA 1.2.a (Van Puyvelde et al. 2010), whereas MV and ΦPT resulted from a hierarchical analysis of molecular variance (amova) approach on the dominantly scored data set as performed in GenAlEx 6.41 (Peakall and Smouse 2006). For the amova, we used the coffee production system (FC vs SFC) as the regional grouping variable. Genetic differentiation was further assessed using principal coordinates analyses (PCoA) calculated in GenAlEx on the pairwise ΦPT matrix. Effects of coffee forest management intensity (FC vs SFC) on population genetic diversity were analysed using Wilcoxon–Mann–Whitney U-tests. To test whether the mean genetic diversities recorded in FC and SFC populations differed from the diversity in the cultivar population, we used one-sample t-tests. Finally, also the average pairwise genetic differentiation (ΦPT) between populations in FC stands was compared with the differentiation between populations in SFC stands, using a Wilcoxon–Mann–Whitney U test. Statistical analyses were performed in SPSS 15.0 (SPSS Inc., Chicago, IL, USA).

Bayesian analysis of population structure

To investigate the presence of alleles from the cultivar gene pool across the coffee populations, genetic structure was assessed using Bayesian clustering analysis implemented in structure 2.3.3 (Pritchard et al. 2000, 2010; Falush et al. 2007). structure was run five times at K = 1–9 applying 10 000 burn-in cycles and 50 000 Markov Chain Monte Carlo (MCMC) iterations. Correlated allele frequencies and admixture were assumed. The degree of admixture was inferred from the data using an initial value of α = 1.0 and a maximum of 10. The value of K that best fitted our data was selected using the estimated log probability of data Pr(X|K) and the derived ΔK statistic (Evanno et al. 2005). We used clumpp 1.1.2 (Jakobsson and Rosenberg 2007) to match the five structure solutions and calculate average ancestry estimates, given as estimated membership coefficients Q for each individual and each population, in each of the K clusters.

Analysis of genome-wide admixture

We assessed genome-wide admixture of alleles from the cultivar gene pool into the FC and SFC populations using a hybrid index or admixture coefficient (Gompert and Buerkle 2009), as calculated in the R package Introgress (Gompert and Buerkle 2010). The SSR data of the populations A10 and Q3 and of the cultivar population were used as wild and cultivar parental data, respectively, and the SSR data of the remaining FC and all SFC populations were entered as potentially admixed individuals. For this analysis, A10 and Q3 were selected as pure wild populations, because these were the only populations where the owners guaranteed that they had not planted any cultivars, and thus, where the introduction of coffee plants had not taken place. The est.h function of Introgress was applied to calculate a maximum likelihood hybrid index estimate h for each potentially admixed individual (Gompert and Buerkle 2010). We compared hybrid index means of FC and SFC using the independent-samples t-test.

Results

Genetic diversity and differentiation

The 24 SSRs in the six multiplex combinations yielded a total of 159 alleles. The number of alleles ranged from 2 to 19 per locus. Only eleven individuals were assigned to four nonunique MLGs across populations. The expected heterozygosity corrected for sample size (HE,C) and the molecular variance (MV) did not differ significantly between FC (HE,C = 0.571, SE 0.016; MV = 20.91, SE 2.40) and SFC populations (HE,C = 0.552, SE 0.014; MV = 20.30, SE 1.92) (HE,C: U = 10, P = 0.429; MV: U = 15, P = 1.000). The sample of CBD-resistant varieties was genetically more diverse (HE,C = 0.621; MV = 23.25) than both the FC and SFC populations, but this difference was only significant for HE,C (t10 = −5.90, P < 0.001). Overall among-population genetic differentiation was high (ΦPT = 0.186, P < 0.001), with a genetic differentiation of 0.033 (ΦRT) between production systems (Table 2). Genetic differentiation among populations was significantly higher for the SFC (ΦPT = 0.176, SE 0.018) than for the FC (ΦPT = 0.131, SE 0.014) populations (U = 38, P = 0.040). Populations from FC and SFC clustered at opposite ends in the PCoA with some overlap among clusters (e.g. F1, Q3) (Fig. 2).

Table 2.

Hierarchical analysis of molecular variance of 159 alleles at 24 microsatellite loci for 703 Coffea arabica individuals distributed in 11 stands and two coffee production systems, forest coffee (FC) and semi-forest coffee (SFC), in SW Ethiopia

Source df SS MS EV %MV Φ-statistic P
Among systems (FC-SFC) 1 586.0 586.0 0.85 3.27 ΦRT 0.033 0.001
Among populations (stands) 9 2452.7 272.5 3.97 15.36 ΦPR 0.159 0.001
Within populations (stands) 696 14623.5 21.0 21.01 81.37 ΦPT 0.186 0.001
Total 706 17662.2 25.82

For each source of variation the following is given: the number of degrees of freedom (df), the sum of squared difference to the mean (SS) and the mean sum of squares (MS), the estimated variance (EV), the percentage of total molecular variance (%MV), the Φ-statistic and the associated probability.

Figure 2.

Figure 2

Principal coordinates (PCo) plot based on ΦPT calculated with 24 SSR markers for Coffea arabica, demonstrating population genetic differentiation between forest coffee (closed circles) and semi-forest coffee systems (open circles).

Genetic structure and admixture

The log probability of data increased with increasing K but was less pronounced when K > 2. This, together with the fact that the ΔK statistic reached its maximum at K = 2 (Fig. S1), suggested the existence of two clusters. The genotypes assigned to cluster I were by far the most widespread and dominated five SFC populations and two FC populations. Genotypes assigned to cluster II were dominant in the remote FC population A10. One SFC population (G11), three FC populations (A6, A4, Q3) and the cultivars had comparable proportions of individuals assigned to either cluster I or II, as well as individuals with an admixed genotype, that is, assigned to both genetic clusters with comparable probabilities (Table 1; Fig. 3).

Figure 3.

Figure 3

Population structure of Coffea arabica based on structure analysis of 24 SSR markers for forest coffee (N = 383, six populations), semi-forest coffee (N = 320, five populations) and cultivar (N = 90) samples for K = 2 clusters (Fig. S1). Individuals are represented by columns, with colours showing the average proportion (R = 5 runs) of their genome assigned to the different clusters, demonstrating the prevalence of a genotype associated to the coffee berry disease (CBD)-resistant gene pool in the semi-forest coffee (SFC) populations.

The admixture coefficients or hybrid indices varied between 0 (pure ‘wild’) and 1 (pure ‘cultivar’). In the SFC populations, hybrid indices were predominantly high, with h > 0.50 for 86.3% of the samples. In the FC populations, hybrid indices were low, with h < 0.50 for 82.4% of the samples (Fig. 4). The mean hybrid index was significantly higher in the SFC (mean hSFC = 0.74, SE 0.012; hFC = 0.30; SE 0.015) (t551 = 22.4, P < 0.001). These results indicate that alleles from the cultivar gene pool are more prevalent in the SFC than in the FC populations, and that most individuals in the SFC populations have more alleles from the cultivar gene pool than alleles from the wild gene pool in their genome.

Figure 4.

Figure 4

Maximum likelihood hybrid index estimates h for 320 Coffea arabica individuals from five semi-forest coffee populations and for 233 Coffea arabica individuals from four forest coffee populations in SW Ethiopia. The h index or admixture coefficient, based on frequencies for 159 alleles, a parental population of N = 150 wild individuals (from populations A10 and Q3) and a parental population of N = 90 specimens from 23 coffee berry disease (CBD)-resistant varieties, gives the fraction of the genome shared with the cultivated varieties for each individual.

Discussion

Genetic diversity and cryptic genetic erosion

Although SFC management is associated with major changes in forest structure and shrub and canopy species composition (Schmitt et al. 2009; Aerts et al. 2011), it had no negative impact on coffee genetic diversity within populations (HE,C, MV). This was surprising because negative effects on effective population size and genetic diversity of C. arabica populations could have been expected. Changes in forest microclimate and in availability of nesting sites for insect pollinators, for instance, may have negatively affected pollinator diversity (Klein et al. 2008), reducing cross-pollination (Klein et al. 2003), increasing selfing and inbreeding (Eckert et al. 2010), and reducing progeny vigour and effective population size (Honnay and Jacquemyn 2007). As SFC systems are already managed as such for some generations (∼40 years), it is unlikely that expected genetic changes still have to become apparent. More plausible, and in agreement with our other results, is that farmers have been introducing new genotypes from elsewhere, both CBD-resistant genotypes provided by local authorities and wild genotypes from neighbouring sites, compensating for the eventual loss of genetic variation through genetic drift and inbreeding.

The introduction of genetically diverse coffee genotypes from elsewhere into SFC populations by coffee farmers most likely represents a farmer-mediated evolutionary force. This is supported by the higher genetic differentiation (ΦPT) among SFC populations than among FC populations. The conspicuous presence of alleles from the pool of introduced CBD-resistant genotypes in almost all SFC populations (high h, Fig. 4) also suggests that genetic diversity in the original coffee populations is being displaced, which is a form of cryptic genetic erosion (see Fig. S2). The local introduction of CBD-resistant genotypes in the SFC area since the 1970s probably facilitated later hybridization through gene flow by pollen (Papa and Gepts 2003). We can, however, not exclude that regional differences contributed to the higher genetic differentiation within the SFC region but we expect that their effect is limited. First, given that FC populations have been sampled over a much larger area (25 × 25 km2, Fig. 1) than the SFC populations (5 × 5 km2), this higher genetic differentiation is unlikely to be caused through isolation-by-distance or by greater environmental variation among SFC populations. Second, gene flow through pollen or seed between SFC populations may have been impeded due to the more fragmented nature of the agricultural landscape near Jimma town, increasing the among-population genetic differentiation. If the latter was the case, however, we expect genetic differentiation to be paralleled by loss of genetic diversity through genetic drift, which was not observed in the SFC populations.

Introduction of alleles from CBD-resistant varieties

Gene flow from domesticated crops to wild relatives poses an important potential threat to CWRs, as repeated occurrences of hybridization may lead to the loss of the genetic integrity of the wild species (introgression), which becomes assimilated into the cultivar. This process has already increased the risk of extinction of the wild relatives of two of the world's 13 most important crops, rice (Ozyra sativa) and cotton (Gossypium hirsutum) (Ellstrand et al. 1999). Evidence of introgression from modern hybrid crop varieties into wild populations and populations of locally domesticated landraces is rapidly emerging (e.g. Sørensen et al. 2007; Bitocchi et al. 2009; Arnaud et al. 2010; Arrigo et al. 2011; Kwit et al. 2011), warning ecologists and plant breeders about the latent extinction of wild relative populations. The high genetic variation within the group of introduced CBD-resistant genotypes in this study (Table 1) strongly suggests that the improved varieties that have been released in Ethiopia did not yet undergo an extreme process of breeding and selection, and that the process of domestication so far was focussed on the capture and multiplication of genotypes with desirable traits. So far, coffee breeding in Ethiopia has been less intensive than other crops such as maize, which underwent a rigorous process of breeding and selection, yielding elite breeding pools with very little of the genetic diversity found in the maize wild relatives (Ortiz et al. 2010). In general, genetic domestication bottlenecks seem to be more limited in perennial fruit crops than in annual crops (Miller and Gross 2011), although commercial coffee cultivars grown in central America or Asia show important losses of genetic diversity (Lashermes et al. 1996; López-Gartner et al. 2009).

Our results show the striking presence of alleles from the CBD-resistant gene pool in all SFC populations, and to a much lesser extent in some FC populations. This suggests the possibility that introgression from recently introduced cultivars may be common in wild populations. Although the process of introgression may be slower in long-lived or clonal plants, such as coffee, postzygotic barriers to hybridization may be weaker than in domesticated annuals such as cereals, where hybrid seedlings have been shown to be maladapted to wild environments (McKey et al. ). The CBD-resistant cultivars were released starting in the 1970s, providing ample opportunities for hybridization, given that the generation time for coffee is ∼3 years. Hybridization between cultivars and wild individuals has been reported in a number of long-lived perennials, such as grapevine (Vitis vinifera, De Andrés et al. 2012), wild almond (Prunus orientalis, Delplancke et al. 2012) and wild apple (Malus sylvestris, Coart et al. 2003). However, we cannot rule out that the presence of these alleles in wild populations may simply reflect the shared ancestry between introduced and wild genotypes. CBD-resistant genotypes are derived from FC trees from different parts of Ethiopia, and at least some of the alleles shared between the cultivar gene pool and the SFC are expected to be identical by descent. Further research should focus in depth on gene flow between wild coffee and introduced cultivars, for example through parentage analysis (De Andrés et al. 2012).

Conclusions

Our results clearly show that SFC populations are genetically more similar to the pool of introduced CBD-resistant genotypes than FC populations, but that SFC populations are not less diverse than FC populations. These patterns can be explained by the anthropogenic introduction of genotypes (both wild genotypes and CBD-resistant cultivars) in SFC populations. Although we cannot provide direct evidence for hybridization and introgression, the practice of large scale planting of CBD-resistant genotypes by local farmers in SFC systems, intimately mixed with original coffee genotypes, offers ample opportunity for the exchange of alleles and the replacement of the original coffee gene pool.

Only the few remaining large forests with a FC cultivation system seem to be safe from the introduction of CBD-resistant cultivars so far. To ensure the in situ conservation of arabica coffee genetic resources in Ethiopia, we recommend to: (i) avoid establishing plantations with foreign coffee cultivars in the centre of origin of C. arabica; (ii) allow only the use of Ethiopian cultivars that did not undergo a very strict process of selection in the SFC systems; and (iii) protect a sufficiently large area of low-intensity FC systems. The latter can probably only be realized by establishing buffer zones of SFC surrounding more strict reserves of FC in the last remaining large forest blocks in the region.

Data archiving statement

Data for this study are available at Dryad. DOI: 10.5061/dryad.37r9p.

Acknowledgments

This research was carried out within the framework of the Institutional University Collaboration partnership between Jimma University and universities in Flanders (JU-IUC), which is funded by the University Development Cooperation of the Flemish Interuniversity Council (VLIR-UOS). R.A. and K.V. held postdoctoral fellowships of the Research Foundation – Flanders (FWO), G.B. and K.H. doctoral grants of JU-IUC and P.G. a research grant of IRO. We gratefully credit the farmer–owners of the coffee shrubs for their permission and guidance, and Alex Fontaine, Israel Delelegn, Abamecha Abadulesa, Tijani Ibrahim and Sabit Abadiga for their assistance during data collection. The useful suggestions of Hans Jacquemyn, associate editor Ingrid M. Parker and two anonymous reviewers are highly appreciated.

Supporting Information

Additional Supporting Information may be found in the online version of this article:

Figure S1. ΔK statistic for detection of the true number of groups K* or the value of K that best fit the data.

Figure S2. Relative frequency distribution of alleles for Coffea arabica microsattelite EU597612 (samples with missing values for this SSR were omitted), illustrating cryptic genetic erosion in the in situ arabica gene pool. The same four alleles are present in forest coffee (FC) and semi-forest coffee (SFC), but within the SFC two of these four alleles are rare. The φ-statistic and probability P show a significant association of two alleles to FC and one to SFC.

Table S1. Multiplex panels used for SSR genotyping of Coffea arabica.

eva0006-0243-SD1.doc (241KB, doc)

Please note: Wiley-Blackwell are not responsible for the content or functionality of any supporting materials supplied by the authors. Any queries (other than missing material) should be directed to the corresponding author for the article.

Literature cited

  1. Aerts R, Hundera K, Berecha G, Gijbels P, Baeten M, Van Mechelen M, Hermy M, et al. Semi-forest coffee cultivation and the conservation of Ethiopian Afromontane rainforest fragments. Forest Ecology and Management. 2011;261:1034–1041. [Google Scholar]
  2. Ahrends A, Burgess ND, Milledge SAH, Bulling MT, Fisher B, Smart JCR, Clarke GP, et al. Predictable waves of sequential forest degradation and biodiversity loss spreading from an African city. Proceedings of the National Academy of Sciences of the United States of America. 2010;107:14556–14561. doi: 10.1073/pnas.0914471107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Anthony F, Bertrand B, Quiros O, Wilches A, Lashermes P, Berthaud J, Charrier A. Genetic diversity of wild coffee (Coffea arabica L.) using molecular markers. Euphytica. 2001;118:53–65. [Google Scholar]
  4. Anthony F, Combes MC, Astorga C, Bertrand B, Graziosi G, Lashermes P. The origin of cultivated Coffea arabica L. varieties revealed by AFLP and SSR markers. Theoretical and Applied Genetics. 2002;104:894–900. doi: 10.1007/s00122-001-0798-8. [DOI] [PubMed] [Google Scholar]
  5. Arnaud J-F, Fénart S, Cordellier M, Cuguen J. Populations of weedy crop–wild hybrid beets show contrasting variation in mating system and population genetic structure. Evolutionary Applications. 2010;3:305–318. doi: 10.1111/j.1752-4571.2010.00121.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Arrigo N, Guadagnuolo R, Lappe S, Pasche S, Parisod C, Felber F. Gene flow between wheat and wild relatives: empirical evidence from Aegilops geniculata Ae. neglecta and Ae. triuncialis. Evolutionary Applications. 2011;4:685–695. doi: 10.1111/j.1752-4571.2011.00191.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Becker U, Reinhold T, Matthies D. Effects of pollination distance on reproduction and offspring performance in Hypochoeris radicata: experiments with plants from three European regions. Biological Conservation. 2006;132:109–118. [Google Scholar]
  8. Bitocchi E, Nanni L, Rossi M, Rau D, Bellucci E, Giardini A, Buonamici A, et al. Introgression from modern hybrid varieties into landrace populations of maize (Zea mays ssp mays L.) in central Italy. Molecular Ecology. 2009;18:603–621. doi: 10.1111/j.1365-294X.2008.04064.x. [DOI] [PubMed] [Google Scholar]
  9. Boisseau M, Aribi J, Carneiro FR, de Sousa Rmdg, Anthony F. Resistance to Meloidogyne paranaensis in wild Coffea arabica. Tropical Plant Pathology. 2009;34:38–41. [Google Scholar]
  10. Clark LV, Jasieniuk M. POLYSAT: an R package for polyploid microsatellite analysis. Molecular Ecology Resources. 2011;11:562–566. doi: 10.1111/j.1755-0998.2011.02985.x. [DOI] [PubMed] [Google Scholar]
  11. Coart E, Vekemans X, Smulders MJM, Wagner I, Van Huylenbroeck J, Van Bockstaele E, Roldán-Ruiz I. Genetic variation in the endangered wild apple (Malus sylvestris (L.) Mill.) in Belgium as revealed by amplified fragment length polymorphism and microsatellite markers. Molecular Ecology. 2003;12:845–857. doi: 10.1046/j.1365-294x.2003.01778.x. [DOI] [PubMed] [Google Scholar]
  12. De Andrés MT, Benito A, Pérez-Rivera G, Ocete R, Lopez MA, Gaforio L, Muñoz G, et al. Genetic diversity of wild grapevine populations in Spain and their genetic relationships with cultivated grapevines. Molecular Ecology. 2012;21:800–816. doi: 10.1111/j.1365-294X.2011.05395.x. [DOI] [PubMed] [Google Scholar]
  13. DeFries RS, Rudel T, Uriarte M, Hansen M. Deforestation driven by urban population growth and agricultural trade in the twenty-first century. Nature Geoscience. 2010;3:178–181. [Google Scholar]
  14. Delplancke M, Alvarez N, Espíndola A, Joly H, Benoit L, Brouck E, Arrigo N. Gene flow among wild and domesticated almond species: insights from chloroplast and nuclear markers. Evolutionary Applications. 2012;5:317–329. doi: 10.1111/j.1752-4571.2011.00223.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Demissew S, Cribb P, Rasmussen F. Field guide to Ethiopian orchids, Kew Field Guide. Kew: Royal Botanic Gardens; 2004. [Google Scholar]
  16. Dessalegn Y, Labuschagne MT, Osthoff G, Herselman L. Genetic diversity and correlation of bean caffeine content with cup quality and green bean physical characteristics in coffee (Coffea arabica L.) Journal of the Science of Food and Agriculture. 2008;88:1726–1730. [Google Scholar]
  17. Eckert CG, Kalisz S, Geber MA, Sargent R, Elle E, Cheptou PO, Goodwillie C, et al. Plant mating systems in a changing world. Trends in Ecology and Evolution. 2010;25:35–43. doi: 10.1016/j.tree.2009.06.013. [DOI] [PubMed] [Google Scholar]
  18. Ellstrand NC. Dangerous liaisons? When cultivated plants mate with their wild relatives. Baltimore: John Hopkins University Press; 2003. [Google Scholar]
  19. Ellstrand NC, Prentice HC, Hancock JF. Gene flow and introgression from domesticated plants into their wild relatives. Annual Review of Ecology and Systematics. 1999;30:539–563. [Google Scholar]
  20. Evanno G, Regnaut S, Goudet J. Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Molecular Ecology. 2005;14:2611–2620. doi: 10.1111/j.1365-294X.2005.02553.x. [DOI] [PubMed] [Google Scholar]
  21. Falush D, Stephens M, Pritchard JK. Inference of population structure using multilocus genotype data: dominant markers and null alleles. Molecular Ecology Notes. 2007;7:574–578. doi: 10.1111/j.1471-8286.2007.01758.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Fernie AR, Tadmor Y, Zamir D. Natural genetic variation for improving crop quality. Current Opinion in Plant Biology. 2006;9:196–202. doi: 10.1016/j.pbi.2006.01.010. [DOI] [PubMed] [Google Scholar]
  23. Fichtl R, Admasu A. Honeybee flora of Ethiopia. Germany: The National Herbarium, Addis Ababa University, Addis Ababa and Deutscher Entwicklungsdienst (DED) Margraf Verlag, Weichersheim; 1994. [Google Scholar]
  24. Foley JA, Ramankutty N, Brauman KA, Cassidy ES, Gerber JS, Johnston M, Mueller ND, et al. Solutions for a cultivated planet. Nature. 2011;478:337–342. doi: 10.1038/nature10452. [DOI] [PubMed] [Google Scholar]
  25. Ford-Lloyd BV, Schmidt M, Armstrong SJ, Barazani O, Engels J, Hadas R, Hammer K, et al. Crop wild relatives-undervalued, underutilized and under threat? BioScience. 2011;61:559–565. [Google Scholar]
  26. Friis I. Forests and forest trees of northeast tropical Africa—their natural habitats and distribution patterns in Ethiopia, Djibouti and Somalia. Kew Bulletin Additional Series. 1992;15:1–396. [Google Scholar]
  27. Gole TW, Borsch T, Denich M, Teketay D. Floristic composition and environmental factors characterizing coffee forests in southwest Ethiopia. Forest Ecology and Management. 2008;255:2138–2150. [Google Scholar]
  28. Gompert Z, Buerkle CA. A powerful regression-based method for admixture mapping of isolation across the genome of hybrids. Molecular Ecology. 2009;18:1207–1224. doi: 10.1111/j.1365-294X.2009.04098.x. [DOI] [PubMed] [Google Scholar]
  29. Gompert Z, Buerkle CA. INTROGRESS: a software package for mapping components of isolation in hybrids. Molecular Ecology Resources. 2010;10:378–384. doi: 10.1111/j.1755-0998.2009.02733.x. [DOI] [PubMed] [Google Scholar]
  30. Gur A, Zamir D. Unused natural variation can lift yield barriers in plant breeding. PLoS Biology. 2004;2:1610–1615. doi: 10.1371/journal.pbio.0020245. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Hedberg I, Edwards S, Nemomissa S. Flora of Ethiopia and Eritrea. Uppsala: Apiaceae to Dipsacaceae. The National Herbarium, Addis Ababa University, Addis Ababa and The Department of Systematic Botany, Uppsala University; 2003. Volume 4, part 1. [Google Scholar]
  32. Hein L, Gatzweiler F. The economic value of coffee (Coffea arabica) genetic resources. Ecological Economics. 2006;60:176–185. [Google Scholar]
  33. Heywood V, Casas A, Ford-Lloyd B, Kell S, Maxted N. Conservation and sustainable use of crop wild relatives. Agriculture Ecosystems and Environment. 2007;121:245–255. [Google Scholar]
  34. Hoisington D, Khairallah M, Reeves T, Ribaut JV, Skovmand B, Taba S, Warburton M. Plant genetic resources: what can they contribute toward increased crop productivity? Proceedings of the National Academy of Sciences of the United States of America. 1999;96:5937–5943. doi: 10.1073/pnas.96.11.5937. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Honnay O, Jacquemyn H. Susceptibility of common and rare plant species to the genetic consequences of habitat fragmentation. Conservation Biology. 2007;21:823–831. doi: 10.1111/j.1523-1739.2006.00646.x. [DOI] [PubMed] [Google Scholar]
  36. Honnay O, Jacquemyn H, Bossuyt B, Hermy M. Forest fragmentation effects on patch occupancy and population viability of herbaceous plant species. New Phytologist. 2005;166:723–736. doi: 10.1111/j.1469-8137.2005.01352.x. [DOI] [PubMed] [Google Scholar]
  37. Honnay O, Jacquemyn H, Aerts R. Crop wild relatives: more common ground for breeders and ecologists. Frontiers in Ecology and the Environment. 2012;10:121. [Google Scholar]
  38. Hooftman DAP, Jong MJD, Oostermeijer JGB, Den Nijs HJCM. Modelling the long-term consequences of crop-wild relative hybridization: a case study using four generations of hybrids. Journal of Applied Ecology. 2007;44:1035–1045. [Google Scholar]
  39. Huff DR, Peakall R, Smouse PE. RAPD variation within and among populations of outcrossing buffalograss (Buchloe dactyloides (Nutt.) Engelm) Theoretical and Applied Genetics. 1993;86:927–934. doi: 10.1007/BF00211043. [DOI] [PubMed] [Google Scholar]
  40. Jakobsson M, Rosenberg NA. CLUMPP: a cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure. Bioinformatics. 2007;23:1801–1806. doi: 10.1093/bioinformatics/btm233. [DOI] [PubMed] [Google Scholar]
  41. Jaramillo J, Muchugu E, Vega FE, Davis A, Borgemeister C, Chabi-Olaye A. Some like it hot: the influence and implications of climate change on coffee berry borer (Hypothenemus hampei) and coffee production in East Africa. PLoS ONE. 2011;6(9):0000. doi: 10.1371/journal.pone.0024528. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Klein AM, Steffan-Dewenter I, Tscharntke T. Bee pollination and fruit set of Coffea arabica and C. canephora (Rubiaceae) American Journal of Botany. 2003;90:153–157. doi: 10.3732/ajb.90.1.153. [DOI] [PubMed] [Google Scholar]
  43. Klein AM, Cunningham SA, Bos M, Steffan-Dewenter I. Advances in pollination ecology from tropical plantation crops. Ecology. 2008;89:935–943. doi: 10.1890/07-0088.1. [DOI] [PubMed] [Google Scholar]
  44. Kwit C, Moon HS, Warwick SI, Stewart CN. Transgene introgression in crop relatives: molecular evidence and mitigation strategies. Trends in Biotechnology. 2011;29:284–293. doi: 10.1016/j.tibtech.2011.02.003. [DOI] [PubMed] [Google Scholar]
  45. Labouisse JP, Bellachew B, Kotecha S, Bertrand B. Current status of coffee (Coffea arabica L.) genetic resources in Ethiopia: implications for conservation. Genetic Resources and Crop Evolution. 2008;55:1079–1093. [Google Scholar]
  46. Lashermes P, Trouslot P, Anthony F, Combes MC, Charrier A. Genetic diversity for RAPD markers between cultivated and wild accessions of Coffea arabica. Euphytica. 1996;87:59–64. [Google Scholar]
  47. Lashermes P, Combes MC, Robert J, Trouslot P, D'Hont A, Anthony F, Charrier A. Molecular characterisation and origin of the Coffea arabica L. genome. Molecular and General Genetics. 1999;261:259–266. doi: 10.1007/s004380050965. [DOI] [PubMed] [Google Scholar]
  48. Lashermes P, Andrzejewski S, Bertrand B, Combes MC, Dussert S, Graziosi G, Trouslot P, et al. Molecular analysis of introgressive breeding in coffee (Coffea arabica L.) Theoretical and Applied Genetics. 2000;100:139–146. [Google Scholar]
  49. Lashermes P, Benoît B, Hervé E. Breeding coffee (Coffea arabica) for sustainable production. In: Jain M, Priyadarshan PM, editors. Breeding Plantation Tree Crops: Tropical Species. New York: Springer; 2009. pp. 525–544. [Google Scholar]
  50. Lashermes P, Combes MC, Ansaldi C, Gichuru E, Noir S. Analysis of alien introgression in coffee tree (Coffea arabica L.) Molecular Breeding. 2011;27:223–232. [Google Scholar]
  51. López-Gartner G, Cortina H, McCouch SR, Moncada MD. Analysis of genetic structure in a sample of coffee (Coffea arabica L.) using fluorescent SSR markers. Tree Genetics and Genomes. 2009;5:435–446. [Google Scholar]
  52. Maxted N, Ford-Lloyd BV, Jury S, Kell S, Scholten M. Towards a definition of a crop wild relative. Biodiversity and Conservation. 2006;15:2673–2685. [Google Scholar]
  53. Maxted N, Scholten M, Codd R, Ford-Lloyd B. Creation and use of a national inventory of crop wild relatives. Biological Conservation. 2007;140:142–159. [Google Scholar]
  54. McKey D, Elias M, Pujol B, Duputi A. The evolutionary ecology of clonally propagated domesticated plants. New Phytologist. 2010;186:318–332. doi: 10.1111/j.1469-8137.2010.03210.x. [DOI] [PubMed] [Google Scholar]
  55. Meilleur BA, Hodgkin T. In situ conservation of crop wild relatives: status and trends. Biodiversity and Conservation. 2004;13:663–684. [Google Scholar]
  56. Mercer KL, Perales HR. Evolutionary response of landraces to climate change in centers of crop diversity. Evolutionary Applications. 2010;3:480–493. doi: 10.1111/j.1752-4571.2010.00137.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Miller AJ, Gross BL. From forest to field: perennial fruit crop domestication. American Journal of Botany. 2011;98:1389–1414. doi: 10.3732/ajb.1000522. [DOI] [PubMed] [Google Scholar]
  58. Ortiz R, Taba S, Chavez VH, Mezzalama M, Xu YB, Yan JB, Crouch JH. Conserving and enhancing maize genetic resources as global public goods-a perspective from CIMMYT. Crop Science. 2010;50:13–28. [Google Scholar]
  59. Papa R, Gepts P. Asymmetry of gene flow and differential geographical structure of molecular diversity in wild and domesticated common bean (Phaseolus vulgaris L.) from Mesoamerica. Theoretical and Applied Genetics. 2003;106:239–250. doi: 10.1007/s00122-002-1085-z. [DOI] [PubMed] [Google Scholar]
  60. Peakall R, Smouse PE. GENALEX 6: genetic analysis in Excel. Population genetic software for teaching and research. Molecular Ecology Notes. 2006;6:288–295. doi: 10.1093/bioinformatics/bts460. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Pritchard JK, Stephens M, Donnelly P. Inference of population structure using multilocus genotype data. Genetics. 2000;155:945–959. doi: 10.1093/genetics/155.2.945. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Pritchard JK, Wen Xiaoquan, Falush D. 2010. Documentation for structure software: Version 2.3. Available from http://pritch.bsd.uchicago.edu/structure.html.
  63. Schmitt CB, Senbeta F, Denich M, Preisinger H, Boehmer HJ. Wild coffee management and plant diversity in the montane rainforest of southwestern Ethiopia. African Journal of Ecology. 2009;48:78–86. [Google Scholar]
  64. Senbeta F, Denich M. Effects of wild coffee management on species diversity in the Afromontane rainforests of Ethiopia. Forest Ecology and Management. 2006;232:68–74. [Google Scholar]
  65. Silvestrini M, Junqueira MG, Favarin AC, Guerreiro O, Maluf MP, Silvarolla MB, Colombo CA. Genetic diversity and structure of Ethiopian, Yemen and Brazilian Coffea arabica L. accessions using microsatellites markers. Genetic Resources and Crop Evolution. 2007;54:1367–1379. [Google Scholar]
  66. Sørensen BS, Kiær LP, Jørgensen RB, Hauser TP. The temporal development in a hybridizing population of wild and cultivated chicory (Cichorium intybus L.) Molecular Ecology. 2007;16:3292–3298. doi: 10.1111/j.1365-294X.2007.03346.x. [DOI] [PubMed] [Google Scholar]
  67. Takeda S, Matsuoka M. Genetic approaches to crop improvement: responding to environmental and population changes. Nature Reviews Genetics. 2008;9:444–457. doi: 10.1038/nrg2342. [DOI] [PubMed] [Google Scholar]
  68. Tester M, Langridge P. Breeding technologies to increase crop production in a changing world. Science. 2010;327:818–822. doi: 10.1126/science.1183700. [DOI] [PubMed] [Google Scholar]
  69. Van Puyvelde K, Van Geert A, Triest L. ATETRA, a new software program to analyse tetraploid microsatellite data: comparison with TETRA and TETRASAT. Molecular Ecology Resources. 2010;10:331–334. doi: 10.1111/j.1755-0998.2009.02748.x. [DOI] [PubMed] [Google Scholar]
  70. Young A, Boyle T, Brown T. The population genetic consequences of habitat fragmentation for plants. Trends in Ecology and Evolution. 1996;11:413–418. doi: 10.1016/0169-5347(96)10045-8. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

eva0006-0243-SD1.doc (241KB, doc)

Articles from Evolutionary Applications are provided here courtesy of Wiley

RESOURCES