Skip to main content
3 Biotech logoLink to 3 Biotech
. 2020 Nov 21;10(12):543. doi: 10.1007/s13205-020-02513-w

Comparative analysis of genomic- and EST-SSRs in European plum (Prunus domestica L.): implications for the diversity analysis of polyploids

Rosanna Manco 1, Pasquale Chiaiese 1, Boris Basile 1, Giandomenico Corrado 1,
PMCID: PMC7679426  PMID: 33235823

Abstract

Simple sequence repeats (SSRs) are among the most useful DNA markers in plant science. The aim of this study was to compare the features and usefulness of genomic SSR (gSSR) and EST-SSRs in European plum (Prunus domestica L.), an economically important, hexaploid stone fruit crop globally cultivated to produce fleshy fruits and derived foodstuff. The analysis of an ample set of morphologically diverse varieties indicated that gSSRs and EST-SSRs provide different estimates of some of the locus-based indicators of diversity. Moreover, the two classes of SSRs gave different, weakly correlated, estimations of distance-based parameters with gSSRs being more powerful for discriminating purposes. The two SSR classes provide complementary information in European plum, making the contribution of EST-SSRs useful not only as non-neutral markers. The differences between SSR classes are discussed considering the neutral and non-neutral evolution, and the polyploidy and asexual propagation of the cultivated tree varieties.

Electronic supplementary material

The online version of this article (10.1007/s13205-020-02513-w) contains supplementary material, which is available to authorized users.

Keywords: Stone fruit, Polyploidy, Polymorphism, Diversity, Microsatellites

Introduction

European plum (Prunus domestica L.) is an economically important stone fruit crop, globally cultivated in temperate areas for its fleshy fruits. These are mainly marketed fresh, canned or dried (Neumüller 2011). Further uses include the production of juices, fruit brandy, and flavours for jams, candies, sweets, and other baked foods. Commercially available European plum varieties present a range of phenotypic traits (e.g., fruits shape and size; skin and flesh colour; firmness; taste) that originated several classifications. Cultivated varieties are typically classified in different pomological groups by breeders and retailers, although the degree of overlap of morphological traits and the complex interspecific origin of the species do not always allow a clear distinction among all the different groups (Zhebentyayeva et al. 2019).

Considerable progress in describing and classifying the ample diversity of the European plum has been achieved with the introduction of DNA molecular markers (Decroocq et al. 2004; Dirlewanger et al. 2002; Li et al. 2010; Shimada et al. 1999). These analyses also confirmed that the European plum clade of P. domestica has a high level of diversity (Zhebentyayeva et al. 2019). Moreover, DNA markers have been used in plum for different purposes, such as map-based cloning (Claverie et al. 2004), population structure analysis (Horvath et al. 2011), phylogenetic relationships (Liu et al. 2007; Reales et al. 2010), landraces examination (Manco et al. 2019; Sehic et al. 2015), and discrimination of clones (Gharbi et al. 2014).

Microsatellites, also known in plant genetics as simple sequence repeats (SSRs), are one of the most suitable markers for Prunus diversity (Aranzana et al. 2019), being multiallelic and highly polymorphic. The diffusion of the SSRs in genetics is also due to their good transferability across similar species (Mnejja et al. 2010). SSRs are DNA sequences with a simple core motif of one to six nucleotides that is tandemly repeated. Microsatellites are frequently used for orphan plants (e.g., those without a publicly available reference genome sequence) and for polyploid and highly heterozygous species because of their codominant nature. Considering that SSRs have been traditionally isolated from genomic libraries enriched in repetitive sequences, they are usually treated as neutral markers (Goldstein and Schlötterer 1999; Ellegren 2004). For these reasons, SSRs are highly informative to investigate clonality, identify genotypes, and describe demographic process such as genetic drift and migration (Goldstein and Schlötterer 1999; Ellegren 2004).

Recent technical advances in genomics along with the improvement of computational statistical tools allow the evaluation of nucleotide polymorphisms at the genome scale, thus overcoming some of the limitations of the SSR analysis (e.g., cost of the SSR development, low-medium throughput, and risk of technical artefacts). High-throughput sequencing (HTS) is currently changing SSR profiling in human forensics (Børsting and Morling 2015; Parson et al. 2016). In spite of the problems of sequencing highly repetitive long DNA sequences (Treangen and Salzberg 2012), a key advancement of the HTS is the ability to directly reveal a high number of (short) microsatellites sequences with a single assay (Šarhanová et al. 2018). Moreover, HTS and more specifically, the RNA-Seq is a much more affordable approach to identify SSRs at a fraction of the cost of the traditional cloning-based strategy or the whole genome sequencing and assembly (Taheri et al. 2018; Martin et al. 2013). The number of the EST-SSRs (i.e. SSR markers part of a transcribed DNA sequence) identified from RNA-Seq efforts in orphan species has greatly increased, rendering less needed the use of transferable SSRs (Hodel et al. 2016a). EST-SSRs are therefore treated as non-neutral markers, useful for instance, to study adaptive genetic diversity (Ellis and Burke 2007). It is expected that the reduction of the Next Generation Sequencing (NGS) cost will increase the diffusion of this approach for non-model tree crops. This will allow to uncover DNA polymorphisms at an unprecedented scale by making available large data on both genomic (gSSRs) and EST-SSRs.

The present study was undertaken to evaluate the differences between the features and the information provided by different classes of SSRs in cultivated European plum. P. domestica is polyploid (2n=6x=46) and agamically propagated in agriculture. These features can significantly impact the evaluation of various genetic parameters (Meirmans et al. 2018). In order to provide information useful for the analysis of genetic diversity and for association studies in P. domestica, we investigated the differences between EST-SSRs and gSSRs in revealing genetic diversity considering a common large set of European plum varieties.

Materials and methods

DNA analysis

The DNA analysis was carried out from young leaves of 44 European plum (Prunus domestica L.) Italian varieties namely: ‘Biancolella di Ottaviano’ (Bian), ‘Botta a muro bianca’ (Bott), ‘Cacazzara’ (Caca), ‘Calavrice’ (Cala), ‘Coglie e astag bianca’ (CogB), ‘Coglie e astag nera’ (Cogn), ‘Core’ (Core), ‘Del Carmine’ (Delc), ‘Della Maddalena’ (Dell), ‘Di Spagna’ (Diso), ‘Fele’ (Fele), ‘Fiaschetta’ (Fias), ‘Fiocco bianco’ (Fiob), ‘Fiocco rosa’ (Fioc), ‘Genova giallo-verde’ (Geno), ‘Lecina tonda’ (Leci), ‘Marchigiana’ (Marc), ‘Mbriaca’ (Mbri), ‘Melella’ (Mele), ‘Nera tardiva’ (Nera), ‘Occhio di bue’ (Occh), ‘Pannaranese’ (Pann), ‘Pappacona’ (Papp), ‘Pappacona gialla’ (Papg), ‘Pappacona rossa’ (Papr), ‘Pappacona verde’ (Papv), ‘Pazza di Somma’ (Pazz), ‘Pezza rossa’ (Pezz), ‘Preta ‘e zucchero’ (Pret), ‘Prunarinia’ (Prun), ‘Rachele’ (Rach), ‘Riardo’ (Riar), ‘San Giovanni’ (Sang), ‘San Rafele’ (Sanr), ‘Santa Maria’ (Sanm), ‘Santangiolese’ (Sana), ‘Santa Paola’ (Sanp), ‘Scarrafona’ (Scar), ‘Scauratella’ (Scau), ‘Sile’ (Sile), ‘Turcona’ (Turc), ‘Uttaiana’ (Utta), ‘Zi’ ‘Augusto’ (Ziau), ‘Zuccarina’ (Zucc). Main morphological features of the plums are reported in Supplementary Table 1. Samples were obtained from the field collection of the ‘Azienda Agricola Sperimentale Regionale Improsta’ of the ‘Centro per la Ricerca Applicata in Agricoltura’ (C.R.A.A.) of the Campania region of Italy. We analysed two different plants per variety. DNA isolation, quantification and amplification were performed as previously described (Manco et al. 2019).

The molecular fingerprint was carried out using five SSRs from EST-libraries (EST-SSRs) and five SSRs from genomic libraries (gSSRs) selected from different sources to avoid possible bias related to their identification and selection. The EST-SSRs loci were EPPISF001, EPPISF004, EPPISF027 (Vendramin et al. 2007), ES4 and ES5 (Li et al. 2010). The gSSRs loci were BPPCT 004, BPPCT 014, BPPCT 028 (Dirlewanger et al. 2002), PS12A02 (Downey and Iezzoni 2000), and UDP98-409 (Cipriani et al. 1999). For each publication, loci were selected based on the number of alleles detected in P. domestica. Primer sequences and annealing temperature are reported in Supplementary Table 2. PCR amplification and capillary electrophoresis were performed as described (Manco et al. 2019) using an ABI Prism 3130 Genetic Analyzer system (Applied Biosystems). Raw sizes were calculated with the local Southern algorithm implemented in the GeneMapper 4.0 software (Applied Biosystems) using the GeneScan 500 LIZ Size Standard (Applied Biosystems). Values were rounded to integer and scaled according to the SSR-core motif (Supplementary Table 2), while minimizing the average offset of the alleles for each SSR within the instrumental resolution (± 1 bp).

Locus-based data analysis

For each SSR, we calculated the number of alleles; the Simpson's index of diversity (1-D), where D is (1-Σpi), with pi being the proportion of the i-th allele of a locus; and the evenness (Ev), calculated as: [(1/λ)-1]/[(e^H)-1], where 1/λ is Stoddart and Taylor’s index and H is the Shannon diversity. The genotype accumulation curve was created by randomly sampling (n = 10,000) loci to create the distribution, and then counting the number of multilocus genotypes (MLG) for an increasing number of SSRs (from 1 to n–1, where n is the maximum number of markers). These calculations were performed using the poppr library in the R environment (Kamvar et al. 2014).

The effective number of alleles (En; the number of alleles in a population weighed for their frequencies), the gametic heterozygosity (GH; the chance that two random alleles drawn from the individual are the same) (Moody et al. 1993), and the expected heterozygosity (Hs; the expected frequency of heterozygotes within a population) were calculated with the GenoDive software (Meirmans and Van Tienderen 2004).

The allelic richness (AR), expressed as the expected number of alleles among k gene copies (k = 79 EST-SSRs; k = 82 gSSRs), the gene diversity (GD) corrected for sample size, and the observed heterozygosity (Ho) were calculated with SPAGeDi v. 1.5 (Hardy and Vekemans 2002). For each locus, we also statistically evaluate the ratio (R) between the effective number of alleles and the total number of alleles using the z test statistics based on a random generation for the normal distribution (pnorm) implemented in R (Team R Core 2013).

Distance-based data analysis

Pairwise genetic distances were calculated with the Bruvo’s coefficient, because this index was developed to analyse microsatellite data from polyploids (Bruvo et al. 2004), utilizing the combinational model, implemented in the poppr library (Kamvar et al. 2014). The agglomerative hierarchical clustering was performed by applying the unweighted pair-group method with arithmetic averages (UPGMA) algorithm implemented in R using the Bruvo’s distances. To statistically test the linear correlation between the molecular distances, the two parallel matrices from gSSR and EST-SSRs were compared by a Mantel test with 9999 permutations. Tree topologies were compared considering the parallel matrices with cophenetic values. A tanglegram plot of a side by side tree representation was created with the dendextend library in the R environment (Galili 2015).

Results

Differences in locus-based parameters

The DNA fingerprint of the 44 cultivated varieties indicated that all the EST-SSRs were polymorphic in our germplasm collection. Main genetic parameters are presented in Table 1. The EST-derived SSRs generated on average 9.60 alleles per locus, ranging from a minimum of six (ES4) to a maximum of 12 (EPPISF001). The Simpson’s index of diversity (1–D) was high and displayed little variation across loci (CV = 5.74%), while a higher variation across loci was present for the evenness (CV = 15.6%), a parameter that measures the distribution of genotype abundance in a population. The number of alleles was positively correlated to the evenness (r = 0.717, p < 0.05; Pearson correlation coefficient) yet, the locus that distributed the alleles most uniformly in the varieties under investigation was the one with the least alleles (ES4). Considering that in a hexaploidy species the overall level of heterozygosity is not frequently due to full heterozygote loci, we calculated also the gametic heterozygosity (GD), a parameter that is weighted considering the different partial heterozygotes. As expected, the observed homozygosity was close to 1.0 for many loci, while the GD was lower and little differed across loci (CV = 8.3%). To account for the ploidy when estimating genetic diversity, the expected heterozygosity (Hs) was calculated with correction for missing dosage. This index was on average high (0.74) and did not correlate with the number of alleles in each locus.

Table 1.

Main genetic indices for the EST-SSRs (locus) and their average values (mean)

Locus An 1-D Ev AR GD Ho GH Hs En R
EPPISF001 12 0.82 0.73 11.07 0.79 0.99 0.66 0.81 5.15 0.43
EPPISF004 11 0.73 0.62 10.24 0.67 1.00 0.66 0.58 2.36 0.21**
EPPISF027 8 0.70 0.71 7.67 0.66 1.00 0.65 0.65 2.77 0.35*
ES4 6 0.75 0.83 6.00 0.72 0.98 0.47 0.69 3.14 0.52
ES5 11 0.71 0.55 10.88 0.65 0.78 1.00 0.97 2.90 0.26**
Mean 9.6 0.74 0.69 9.17 0.70 0.95 0.69 0.74 3.26 0.36

An number of alleles; 1-D: Simpson’s index of diversity; Ev genotypic evenness; AR allelic richness; GD gene diversity; Ho observed heterozygosity: GH gametic heterozygosity; Hs expected heterozygosity; En effective number of allele; R: En/An asterisks indicate a statistically significant deviation relative to a normal distribution (*p < 0.05; **p < 0.01)

The gSSR were all polymorphic. The number of alleles of the gSSR (n = 96) was significantly higher (p < 0.001; t test) and overall, double than the number of EST-SSRs (n = 48), but with a similar CV across loci (Table 2). The number of gSSR alleles ranged from a minimum of 14, a value higher than the maximum number of alleles for EST-SSRs. Likewise, also allelic richness, the Simpson’s index of diversity, and the gene diversity were significantly higher for gSSRs (p < 0.001; t test), indicating that the genetic diversity captured by the gSSRs is higher. On the other hand, the evenness did not display a significant difference between the two kinds of SSRs (p > 0.05; t test). The observed heterozygosity (Ho) was for many loci close to the maximum value, similarly to the EST-SSRs, and also the gametic heterozygosity did not differ between the two SSR classes (p > 0.05; t test). These data suggest that differences in the number of alleles did not strongly influence their distribution in the analysed population.

Table 2.

Main genetic indices for the gSSRs (locus) and their average values (mean)

Locus An 1-D Ev AR GD Ho GH Hs En R
BPPCT004 17 0.90 0.80 15.49 0.89 1.00 0.88 0.88 8.25 0.49
BPPCT014 26 0.92 0.70 24.13 0.91 0.94 0.59 0.85 6.24 0.24**
BPPCT028 14 0.77 0.54 14.00 0.74 0.73 1.00 1.00 2.45 0.17**
PS12A02 21 0.92 0.80 19.27 0.91 1.00 0.72 0.91 10.42 0.47
UDP98409 18 0.89 0.76 16.72 0.88 1.00 0.59 0.85 6.30 0.35*
Mean 19.2 0.88 0.72 17.92 0.87 0.94 0.76 0.90 6.73 0.34

An number of alleles; 1-D Simpson’s index of diversity; Ev genotypic evenness; AR allelic richness; GD gene diversity; Ho observed heterozygosity: GH gametic heterozygosity; Hs expected heterozygosity; En effective number of allele; R: En/An asterisks indicate a statistically significant deviation relative to a normal distribution (*p < 0.05; **p < 0.01)

The effective number of alleles (En) for both marker classes was substantially lower than the number of alleles, and did not significantly correlate with the observed number of alleles (r = 0,36; p > 0.05 for EST-SSRs; r = 0.42; p > 0.05 for gSSRs; Pearson correlation coefficient). However, in absolute terms, En differed between marker classes (p < 0.05; t test). To account for the different numbers of alleles between EST-SSRs and gSSRs, we evaluated the ratio (R) between En and An. As expected, there were locus-specific differences in both SSRs classes but the average R and the number of loci with a significantly low R did not differ between EST-SSRs and gSSRs.

To analyse the ability of the SSRs in discriminating between unique genotypes according to their number, we performed an accumulation analysis (Fig. 1). The genotype accumulation for an increasing number of EST-SSRs or gSSR indicated that gSSR are always able to capture a higher number of multilocus genotypes (MLG). As expected, this difference gets smaller when reaching the plateau, i.e., the maximum number of MLG (44). The data indicated that gSSRs have a greater discriminatory power in European plum but also that, for the EST-SSRs, the addition of a small number of markers can yield a similar genotyping ability.

Fig. 1.

Fig. 1

Genotype accumulation plot. The graph shows in different panels (representing increasing number of SSR loci as indicated in the top dark grey bar), a box-and-whisker plot of the number of multilocus genotypes obtained by randomly sampling loci without replacement (n = 10,000). The band inside the box represents the median (2nd quartile). Dots indicate outliers (i.e., values outside 1.5 times the interquartile range above the upper and below the lower quartile). EST-SSRs (resp., gSSRs) box-and-whisker plots are in salmon (resp., turquoise) colour

Differences in distance-based parameters

Due to their codominance, SSRs are highly valued to obtain matrices of genetic resemblance especially for highly heterozygotes genotypes, for instance, through the construction of dendrograms. To investigate the ability to describe the genetic differences among European plum genotypes, we calculated the Bruvo’s distances based on either EST-SSRs or gSSRs. The average Bruvo’s distance was higher for the gSSR (p < 0.001; t test) (Fig. 2). Moreover, gSSRs were able to discriminate all the varieties under investigation, while the EST-SSRs could not discriminate between two pairs of cultivars (‘Occhio di bue’ and ‘Fiocco bianco’; ‘Uttaiana e ‘Della Maddalena’). The distances based on gSSRs or EST-SSRs were both normally distributed (p > 0.05, Kolmogorov–Smirnov test of normality) (Supplementary Fig. 1). The correlation between the two distance matrices was low (r = 0.167) and statistically significant (p < 0.001; Mantel’s test). Moreover, a linear trend or a triangular shape of the scatterplot was not evident (Fig. 3). Lower pairwise distances obtained with one class of SSRs did not largely associate to lower pairwise distances with the other class of SSRs. Similarly, higher distances with one SRR class did not necessarily associate with higher distances obtained with the other thus, not giving rise to a triangular-shaped scatterplot expected when genotype pairs with low (respectively, high) level of diversity relative to one marker type show low (resp., high) level of diversity relative to another one. This implies that the higher distances provided by the gSSRs are not simply a linear extension of the genetic distance revealed by the EST-SSRs.

Fig. 2.

Fig. 2

Distribution of the pairwise genetic distance among varieties calculated with EST-SSRs (salmon) or gSSRs (turquoise). Each bar shows the number of pairwise comparisons between the 44 varieties that have a genetic distance (calculated with the Bruvo’s index) included in the class intervals reported on the x-axis

Fig. 3.

Fig. 3

Scatterplot displaying the correlation between the genetic distances calculated from EST-SSRs (x-axis) and gSSRs (y-axis). The two coordinates of each circle are the genetic distances of a pairwise comparison between varieties

To illustrate possible differences in describing genetic diversity, we built and compared dendrograms from hierarchical clustering based on the Bruvo’s distance. The cophenetic correlation between the trees was positive (r = 0.143; p < 0.05, Mantel) and slightly lower than the correlation of the genetic distances (Fig. 4). The result indicated that in plum the genetic resemblance depicted with EST-SSRs did not largely relate with that obtained with gSSRs.

Fig. 4.

Fig. 4

A comparison of the hierarchical cluster analysis (HCA) of the plum varieties using EST-SSRs (left) or gSSRs (right) data. HCA was performed with the UPGMA algorithm for both dendrograms, using the Bruvo’s genetic distances. To ease the comparison, coloured lines connect identical names. The different line type in the dendrograms highlights distinct edges in a tree (compared to the other one)

Discussion

SSRs are one of the highly polymorphic and versatile DNA markers for plants (Hodel et al. 2016b; Vieira et al. 2016). Although their genome coverage remains lower than that of SNPs, the advent of NGS technologies largely expanded the number of available microsatellites, especially those identified in transcribed sequences. The usefulness of microsatellite in European plum has been previously demonstrated (Decroocq et al. 2004; Dirlewanger et al. 2002; Li et al. 2010). Considering that advances in DNA sequencing provide the opportunity to select and analyse an ample number of different classes of SSRs, in this work, we studied and compared the features and usefulness of EST-SSRs and gSSRs. The data indicated that the two SSR classes reveal different levels of polymorphism in P. domestica. EST-SSRs had significantly lower values in the allele-based parameters such as number of alleles and the Simpson’s index of diversity. Nonetheless, the genotypic evenness and the observed heterozygosity were comparable. The latter can be explained by the hexaploidy of the European plum, because a polyploid locus harbours a larger amount of diversity than a diploid one, thus allowing a relatively reduced number of alleles to reveal a high level of heterozygosity in a population. The absence of differences in parameters that strongly depend on allele distribution, such as the evenness and expected heterozygosity, implies that the frequency of the alleles at each locus displays variation that cannot be explained considering the number of alleles. The lack of allele dosage in the molecular screening of polyploids poses limitations in the interpretation of frequency-based analysis (Meirmans et al. 2018), and it is expected to result in an overestimation of the genetic diversity. Nonetheless, the data may also suggest that, in European plum, the occurrence of rare SSR-alleles is not strongly dependant on the class of the marker.

Our study agrees with the literature highlighting the presence of a difference between EST-SSRs and gSSRs when the comparison is carried out within a plant species. gSSRs are generally considered more polymorphic, and this feature has been typically explained in an evolutionary framework related to the natural selection of molecular diversity. However, for a population of clonally selected varieties, it is predicted that clonal propagation will increase the non-random association of alleles at different loci (Birky 1996), making less pertinent the distinction between neutral and non-neutral markers. Our work indicated a large difference in the number of alleles, which is similar, in relative terms, to the one described in chestnut (Martin et al. 2010). In crop plants subjected to intense breeding, such as cereals, cucumber, and sugarcane, the number of gSSR alleles was slightly higher (Xinquan et al. 2005; Hu et al. 2011; Pinto et al. 2006) or similar to the EST-SSRs alleles (Parthiban et al. 2018). In tomato, EST-SSRs had more alleles than gSSRs (Zhou et al. 2015). Overall, it is likely that the amplitude of the difference between the number of EST-SSRs and gSSRs positively correlates with the genetic diversity of the populations/species under investigation, as also suggested in barley (Zhang et al. 2014). For cultivated P. domestica it is also necessary to consider that in a clonal population, the alleles at one locus independently accumulate mutations over time (Jarni et al. 2015). Many microsatellites were present in plant genomes long before domestication (Morgante et al. 2002). It is not possible to exclude that a pre-existing difference in the allele number of gSSR and EST-SSRs (dependent on the non-neutral selection of transcribed sequences) in a hypothetical European plum pre-domesticated population has been equally enlarged by asexual propagation, resulting in a large, yet different, number of alleles for both SSR classes. Clonal propagation in polyploids is expected to favour the maintenance of neo‐functionalized and functionally inactive variants of sequences present in multiple copies (Lynch and Conery 2000).

To evaluate the ability to designate relationships between individual, we produced matrices of resemblance to build dendrograms. The data indicated that gSSR have a higher discriminating power and allowed to distinguish all the genotypes under investigation. Nonetheless, the MLG accumulation curve indicated that adding few EST-SSRs can achieve a similar result than gSSRs. In barley, an equivalent rate of discrimination of 12 gSSRs was achieved with 17 EST-SSRs (Leigh et al. 2003). Overall, irrespective of their origin, a handful of SSRs is sufficient for discriminatory purposes in European plum, mostly because of the high number of alleles per locus. The genetic distances showed by the gSSR were on average higher than those revealed by EST-SSRs and this feature associates with the number of diverse alleles of each class. Unlike what is reported in herbaceous plants (Xinquan et al. 2005; Hu et al. 2011; Pinto et al. 2006), in our study, the distances and the cophenetic matrices weakly correlated, implying that the two SSRs classes sampled a different diversity. The fixation of agronomically valuable phenotypes through clonal selection and agamic propagation of the tree allows to preserve adaptive and neutral genetic diversity, but also makes difficult to disentangle the evolutionary selections of DNA variation. Further studies will have to clarify whether the different diversity highlighted by the two SSR classes (at least partially) correlates to different evolutionary forces driving the selection of DNA variation (e.g., natural vs. artificial).

In conclusion, detecting and quantifying differences in polyploid genomes still represent a challenge. In European plum, gSSR and EST-SSRs provide different estimates of some of the genetic parameters calculated at locus level. Larger differences are present with respect to the estimation of the pairwise genetic resemblance. The two classes of SSRs should be considered complementary. gSSRs should be preferred for measures of genetic diversity and for discrimination purposes, considering that highly polymorphic loci provided better estimates of genetic distances (Kalinowski 2002). Nonetheless, the data also indicated that the two SSR classes provided weakly correlated estimates of genetic distance, making useful the contribution of the EST-SSRs not only for facilitating the study of recent adaptation or specific plant traits but also for the effective description and conservation of genetic resources.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Funding

Open access funding provided by Università degli Studi di Napoli Federico II within the CRUI-CARE Agreement.

Compliance with ethical standards

Conflict of interest

The authors declare that they have no conflict of interests.

References

  1. Aranzana MJ, Decroocq V, Dirlewanger E, Eduardo I, Gao ZS, Gasic K, Iezzoni A, Jung S, Peace C, Prieto H. Prunus genetics and applications after de novo genome sequencing: achievements and prospects. Horticu Res. 2019;6(1):1–25. doi: 10.1038/s41438-019-0140-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Birky CW. Heterozygosity, Heteromorphy, and phylogenetic trees in asexual eukaryotes. Genetics. 1996;144(1):427–437. doi: 10.1093/genetics/144.1.427. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Børsting C, Morling N. Next generation sequencing and its applications in forensic genetics. Fore Sci Internat Genet. 2015;18:78–89. doi: 10.1016/j.fsigen.2015.02.002. [DOI] [PubMed] [Google Scholar]
  4. Bruvo R, Michiels NK, Souza TG, Schulenburg H. A simple method for the calculation of microsatellite genotype distances irrespective of ploidy level. Mole Ecol. 2004;13(7):2101–2106. doi: 10.1111/j.1365-294X.2004.02209.x. [DOI] [PubMed] [Google Scholar]
  5. Cipriani G, Lot G, Huang W-G, Marrazzo M, Peterlunger E, Testolin R. AC/GT and AG/CT microsatellite repeats in peach [Prunus persica (L) Batsch]: isolation, characterisation and cross-species amplification in Prunus. Theor Appl Genet. 1999;99(1–2):65–72. [Google Scholar]
  6. Claverie M, Bosselut N, Lecouls A, Voisin R, Lafargue B, Poizat C, Kleinhentz M, Laigret F, Dirlewanger E, Esmenjaud D. Location of independent root-knot nematode resistance genes in plum and peach. Theor Appl Genet. 2004;108(4):765–773. doi: 10.1007/s00122-003-1463-1. [DOI] [PubMed] [Google Scholar]
  7. Decroocq V, Hagen L, Favé M-G, Eyquard J-P, Pierronnet A. Microsatellite markers in the hexaploid Prunus domestica species and parentage lineage of three European plum cultivars using nuclear and chloroplast simple-sequence repeats. Mol Breeding. 2004;13(2):135–142. [Google Scholar]
  8. Dirlewanger E, Cosson P, Tavaud M, Aranzana M, Poizat C, Zanetto A, Arús P, Laigret F. Development of microsatellite markers in peach [Prunus persica (L.) Batsch] and their use in genetic diversity analysis in peach and sweet cherry (Prunus avium L.) Theor Appl Genet. 2002;105(1):127–138. doi: 10.1007/s00122-002-0867-7. [DOI] [PubMed] [Google Scholar]
  9. Downey SL, Iezzoni AF. Polymorphic DNA markers in black cherry (Prunus serotina) are identified using sequences from sweet cherry, peach, and sour cherry. J Am Soc Hortic Sci. 2000;125(1):76–80. [Google Scholar]
  10. Ellegren H. Microsatellites: simple sequences with complex evolution. Nat Rev Genet. 2004;5(6):435–445. doi: 10.1038/nrg1348. [DOI] [PubMed] [Google Scholar]
  11. Ellis J, Burke J. EST-SSRs as a resource for population genetic analyses. Heredity. 2007;99(2):125–132. doi: 10.1038/sj.hdy.6801001. [DOI] [PubMed] [Google Scholar]
  12. Galili T. dendextend: an R package for visualizing, adjusting and comparing trees of hierarchical clustering. Bioinformatics. 2015;31(22):3718–3720. doi: 10.1093/bioinformatics/btv428. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Gharbi O, Wünsch A, Rodrigo J. Characterization of accessions of ‘Reine Claude Verte’ plum using Prunus SRR and phenotypic traits. Sci Hortic. 2014;169:57–65. [Google Scholar]
  14. Goldstein DB, Schlötterer C. Microsatellites: evolution and applications. Oxford: Oxford University Press; 1999. [Google Scholar]
  15. Hardy OJ, Vekemans X. SPAGeDi: a versatile computer program to analyse spatial genetic structure at the individual or population levels. Mol Ecol Notes. 2002;2(4):618–620. [Google Scholar]
  16. Hodel RG, Gitzendanner MA, Germain-Aubrey CC, Liu X, Crowl AA, Sun M, Landis JB, Segovia-Salcedo MC, Douglas NA, Chen S. A new resource for the development of SSR markers: millions of loci from a thousand plant transcriptomes. Appl Plant Sci. 2016;4(6):1600024. doi: 10.3732/apps.1600024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Hodel RG, Segovia-Salcedo MC, Landis JB, Crowl AA, Sun M, Liu X, Gitzendanner MA, Douglas NA, Germain-Aubrey CC, Chen S. The report of my death was an exaggeration: a review for researchers using microsatellites in the 21st century. Appl Plant Sci. 2016;4(6):1600025. doi: 10.3732/apps.1600025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Horvath A, Balsemin E, Barbot J-C, Christmann H, Manzano G, Reynet P, Laigret F, Mariette S. Phenotypic variability and genetic structure in plum (Prunus domestica L.), cherry plum (P. cerasifera Ehrh.) and sloe (P. spinosa L) Sci Hortic. 2011;129(2):283–293. [Google Scholar]
  19. Hu J, Wang L, Li J. Comparison of genomic SSR and EST-SSR markers for estimating genetic diversity in cucumber. Biol Plant. 2011;55(3):577–580. [Google Scholar]
  20. Jarni K, Jakše J, Brus R. Vegetative propagation: Linear barriers and somatic mutation affect the genetic structure of a Prunus avium L. Stand Forestry. 2015 doi: 10.1093/forestry/cpv029. [DOI] [Google Scholar]
  21. Kalinowski ST. How many alleles per locus should be used to estimate genetic distances? Heredity. 2002;88(1):62–65. doi: 10.1038/sj.hdy.6800009. [DOI] [PubMed] [Google Scholar]
  22. Kamvar ZN, Tabima JF, Grünwald NJ. Poppr: an R package for genetic analysis of populations with clonal, partially clonal, and/or sexual reproduction. PeerJ. 2014;2:e281. doi: 10.7717/peerj.281. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Leigh F, Lea V, Law J, Wolters P, Powell W, Donini P. Assessment of EST-and genomic microsatellite markers for variety discrimination and genetic diversity studies in wheat. Euphytica. 2003;133(3):359–366. [Google Scholar]
  24. Li X, Shangguan L, Song C, Wang C, Gao Z, Yu H, Fang J. Analysis of expressed sequence tags from Prunus mume flower and fruit and development of simple sequence repeat markers. BMC Genet. 2010;11(1):66. doi: 10.1186/1471-2156-11-66. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Liu W, Liu D, Zhang A, Feng C, Yang J, Yoon J, Li S. Genetic diversity and phylogenetic relationships among plum germplasm resources in China assessed with inter-simple sequence repeat markers. J Am Soc Hortic Sci. 2007;132(5):619–628. [Google Scholar]
  26. Lynch M, Conery JS. The evolutionary fate and consequences of duplicate genes. Science. 2000;290(5494):1151–1155. doi: 10.1126/science.290.5494.1151. [DOI] [PubMed] [Google Scholar]
  27. Manco R, Basile B, Capuozzo C, Scognamiglio P, Forlani M, Rao R, Corrado G. Molecular and phenotypic diversity of traditional european plum (Prunus domestica L.) germplasm of Southern Italy. Sustainability. 2019;11(15):4112. [Google Scholar]
  28. Martin MA, Mattioni C, Cherubini M, Taurchini D, Villani F. Genetic diversity in European chestnut populations by means of genomic and genic microsatellite markers. Tree Genet Genomes. 2010;6(5):735–744. [Google Scholar]
  29. Martin L, Fei Z, Giovannoni J, Rose JKC. Catalyzing plant science research with RNA-seq. Front Plant Sci. 2013;4:66. doi: 10.3389/fpls.2013.00066. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Meirmans PG, Van Tienderen PH. GENOTYPE and GENODIVE: two programs for the analysis of genetic diversity of asexual organisms. Mol Ecol Notes. 2004;4(4):792–794. [Google Scholar]
  31. Meirmans PG, Liu S, van Tienderen PH. The analysis of polyploid genetic data. J Hered. 2018;109(3):283–296. doi: 10.1093/jhered/esy006. [DOI] [PubMed] [Google Scholar]
  32. Mnejja M, Garcia-Mas J, Audergon J-M, Arús P. Prunus microsatellite marker transferability across rosaceous crops. Tree Genet Genomes. 2010;6(5):689–700. [Google Scholar]
  33. Moody ME, Mueller L, Soltis D. Genetic variation and random drift in autotetraploid populations. Genetics. 1993;134(2):649–657. doi: 10.1093/genetics/134.2.649. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Morgante M, Hanafey M, Powell W. Microsatellites are preferentially associated with nonrepetitive DNA in plant genomes. Nat Genet. 2002;30(2):194–200. doi: 10.1038/ng822. [DOI] [PubMed] [Google Scholar]
  35. Neumüller M. Fundamental and applied aspects of plum (Prunus domestica) breeding Fruit. Veg Cereal Sci Biotechnol. 2011;5:139–156. [Google Scholar]
  36. Parson W, Ballard D, Budowle B, Butler JM, Gettings KB, Gill P, Gusmão L, Hares DR, Irwin JA, King JL. Massively parallel sequencing of forensic STRs: considerations of the DNA commission of the International Society for Forensic Genetics (ISFG) on minimal nomenclature requirements. Forensic Sci Internat Genet. 2016;22:54–63. doi: 10.1016/j.fsigen.2016.01.009. [DOI] [PubMed] [Google Scholar]
  37. Parthiban S, Govindaraj P, Senthilkumar S. Comparison of relative efficiency of genomic SSR and EST-SSR markers in estimating genetic diversity in sugarcane. Biotech. 2018;8(3):144. doi: 10.1007/s13205-018-1172-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Pinto LR, Oliveira KM, Marconi T, Garcia AAF, Ulian EC, De Souza A. Characterization of novel sugarcane expressed sequence tag microsatellites and their comparison with genomic SSRs. Plant Breeding. 2006;125(4):378–384. [Google Scholar]
  39. Reales A, Sargent DJ, Tobutt KR, Rivera D. Phylogenetics of Eurasian plums, Prunus L section Prunus (Rosaceae), according to coding and non-coding chloroplast DNA sequences. Tree Genet Genomes. 2010;6(1):37–45. [Google Scholar]
  40. Šarhanová P, Pfanzelt S, Brandt R, Himmelbach A, Blattner FR. SSR-seq: Genotyping of microsatellites using Next-Generation Sequencing reveals higher level of polymorphism as compared to traditional fragment size scoring. Ecol Evol. 2018;8(22):10817–10833. doi: 10.1002/ece3.4533. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Sehic J, Nybom H, Hjeltnes S, Gaši F. Genetic diversity and structure of Nordic plum germplasm preserved ex situ and on-farm. Sci Hortic. 2015;190:195–202. [Google Scholar]
  42. Shimada T, Hayama H, Haji T, Yamaguchi M, Yoshida M. Genetic diversity of plums characterized by random amplified polymorphic DNA (RAPD) analysis. Euphytica. 1999;109(3):143. [Google Scholar]
  43. Taheri S, Lee Abdullah T, Yusop MR, Hanafi MM, Sahebi M, Azizi P, Shamshiri RR. Mining and development of novel SSR markers using next generation sequencing (NGS) data in plants. Molecules. 2018;23(2):399. doi: 10.3390/molecules23020399. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Team R Core (2020). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. URL https://www.R-project.org/.
  45. Treangen TJ, Salzberg SL. Repetitive DNA and next-generation sequencing: computational challenges and solutions. Nat Rev Genet. 2012;13(1):36–46. doi: 10.1038/nrg3117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Vendramin E, Dettori M, Giovinazzi J, Micali S, Quarta R, Verde I. A set of EST-SSRs isolated from peach fruit transcriptome and their transportability across Prunus species. Mol Ecol Notes. 2007;7(2):307–310. [Google Scholar]
  47. Vieira MLC, Santini L, Diniz AL, Munhoz CdF. Microsatellite markers: what they mean and why they are so useful. Genet Mole Biol. 2016;39(3):312–328. doi: 10.1590/1678-4685-GMB-2016-0027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Xinquan Y, Peng L, Zongfu H, Zhongfu N, Qixin S. Genetic diversity revealed by genomic-SSR and EST-SSR markers among common wheat, spelt and compactum. Prog Nat Sci. 2005;15(1):24–33. [Google Scholar]
  49. Zhang M, Mao W, Zhang G, Wu F. Development and characterization of polymorphic EST-SSR and genomic SSR markers for Tibetan annual wild barley. PLoS ONE. 2014;9:4. doi: 10.1371/journal.pone.0094881. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Zhebentyayeva T, Shankar V, Scorza R, Callahan A, Ravelonandro M, Castro S, DeJong T, Saski CA, Dardick C. Genetic characterization of worldwide Prunus domestica (plum) germplasm using sequence-based genotyping. Hortic Res. 2019;6(1):12. doi: 10.1038/s41438-018-0090-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Zhou R, Wu Z, Cao X, Jiang F. Genetic diversity of cultivated and wild tomatoes revealed by morphological traits and SSR markers. Genet Mol Res. 2015;14(4):13868–13879. doi: 10.4238/2015.October.29.7. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials


Articles from 3 Biotech are provided here courtesy of Springer

RESOURCES