Abstract
Interspecific pear (Pyrus spp.) hybrid populations are often used to develop novel cultivars. Pear cultivar breeding is a lengthy process because of long juvenility and the subsequent time required for reliable fruit phenotyping. Molecular techniques such as genome-wide association (GWA) and genomic selection (GS) provide an opportunity to fast-forward the development of high-value cultivars. We evaluated the genetic architecture of 10 pear fruit phenotypes (including sensory traits) and the potential of GS using genotyping-by-sequencing of 550 hybrid seedlings from nine interrelated full-sib families. Results from GWA suggested a complex polygenic nature of all 10 traits as the maximum variance explained by each marker was less than 4% of the phenotypic variance. The effect-size of SNPs for each trait suggested many genes of small effect and few of moderate effect. Some genomic regions associated with pear sensory traits were similar to those reported for apple – possibly a result of high synteny between the apple and pear genomes. The average (across nine families) GS accuracy varied from 0.32 (for crispness) to 0.62 (for sweetness), with an across-trait average of 0.42. Further efforts are needed to develop larger genotype-phenotype datasets in order to predict fruit phenotypes of untested seedlings with sufficient efficiency.
Subject terms: Molecular biology, Plant sciences
Introduction
Pear is currently grown commercially in almost every continent of the world. The number of catalogued species in the genus Pyrus varies according to different studies, but commercial breeding has mainly focussed on three species: P. communis (European pear), and two Asian pears namely P. pyrifolia and P. x bretschneideri1. High flavour, buttery and juicy texture are among the key characteristics of European pears, while Asian pears generally have crisp texture and subtle flavour. Breeding programmes in New Zealand and elsewhere have combined Asian and European pears to develop crisp, juicy and highly flavoured hybrid cultivars.
Pear breeding programmes generally involve mating of selected parents to create hybrid seedling populations, selection amongst seedling populations, and testing of best performing seedlings. Traditional pear breeding is an expensive and lengthy process primarily because seedlings grown on their own roots typically have a long juvenile period. Reduction of generation time is a focus of many breeding programmes as this has the largest influence on the time taken for new products to reach market2. One way of reducing the juvenility period is by growing seedlings in the glasshouse to accelerate the growth rate before planting them into the orchard. New genomic technologies also offer the possibility of accelerating and increasing efficiencies and effectiveness of breeding programmes for new pear cultivars. Most pear fruit traits, such as texture, size, storage ability, resistance to scuffing, flavour and aroma are reported to be affected by numerous loci with small-to-moderate effects3–6.
A traditional MAS scheme ignores the contribution of all other genes, and thus could result in a lower response to selection, especially for polygenic traits. High-speed and reduced cost of genotyping technologies have facilitated the availability of large number of single nucleotide polymorphic (SNP) markers enabling researchers to study marker-trait associations across the whole genome. High-density genotyping platforms have also facilitated the implementation of genomic selection (GS). GS involves simultaneous estimation of genomewide SNPs effects to predict genomic breeding values7. The traditional MAS is best suited for monogenic or oligogenic traits whereas GS is ideal for traits controlled by many loci with small-to-moderate effects. Therefore, a two-stage selection strategy combining MAS and GS has the potential to accelerate breeding cycles and improve the efficiency of fruit breeding programmes8. There are reports of GWAS in germplasm populations of Japanese pear P. pyrifolia6,9 and Pyrus spp.5, but the evaluation of GS has only been attempted in Japanese pear6.
Despite sensory traits being primary selection criteria for developing new cultivars, there appears to be no report of GWA and GS for pear sensory traits – something that hinders the acceleration of breeding cycles. The main objectives of this study were to conduct GWA to find the candidate genomic regions for pear fruit traits including sensory eating quality traits, and evaluate the potential of GS using a hybrid population derived from crosses between Asian and European pears.
Results
Genetic parameters
The distribution of adjusted fruit phenotypes is shown in Supplementary Fig. S1. Estimates of narrow-sense heritability were low for sweetness (SWET, 0.16), but moderate–high (0.40–0.69) for all other traits (Table 1). Fruit weight (AVFW) was the most heritable trait at 0.69, followed by sourness (SOUR). The highest genetic correlation (0.81) was observed between sensory firmness (FIRM) and crispness (CRIS). Sourness (SOUR) was adversely correlated with sweetness (SWET: −0.31) and flavour intensity (FINT: −0.46), and SWET was favourably correlated (about 0.40) with juiciness (JUIC) and FINT. Fruit with high FIRM were relatively less susceptible to scuffing (SCUF) as there was a significant correlation (−0.24) between these traits. High russet (RUSS) fruit tended to display high SOUR and less JUIC. Estimated genetic correlation of AVFW with SOUR, FINT, SHAP and RUSS was found to be significant (Table 1).
Table 1.
FIRM | CRIS | JUIC | SWET | SOUR | FINT | SCUF | SHAP | RUSS | AVFW | |
---|---|---|---|---|---|---|---|---|---|---|
FIRM | 0.47 | |||||||||
CRIS | 0.81* | 0.40 | ||||||||
JUIC | 0.23* | 0.39* | 0.58 | |||||||
SWET | 0.09 | 0.21* | 0.41* | 0.16 | ||||||
SOUR | −0.09 | −0.11 | −0.02 | −0.31* | 0.62 | |||||
FINT | 0.17 | 0.10 | 0.26* | 0.38* | −0.46* | 0.46 | ||||
SCUF | −0.24* | −0.19* | −0.08 | 0.05 | 0.21* | −0.21* | 0.51 | |||
SHAP | −0.05 | −0.05 | −0.04 | −0.07 | 0.06 | −0.10 | −0.12 | 0.57 | ||
RUSS | −0.16 | −0.15 | −0.24* | −0.05 | 0.24* | −0.27* | −0.08 | 0.03 | 0.59 | |
AVFW | −0.10 | 0.04 | 0.05 | 0.12 | 0.30* | −0.32* | 0.04 | 0.33* | 0.28* | 0.69 |
Population structure and LD decay
Overall, a product-moment correlation of 0.69 was observed between pedigree-based and SNP-based estimates of pair-wise coefficient of relationships. The average SNP-based within-family pairwise relationship ranged from 0.50 (family p490) to 0.72 (p449), and the average relationships among seedlings from different families ranged from 0.30 and 0.52 (Fig. 1). Family p449 was derived from crossing two siblings, hence showed a relatively higher relationship coefficient. A plot of the first two principal components (PCs) of the SNP-based realized relationship matrix (G) grouped seedlings largely according to their familial relationships (Fig. 2). The first two PCs were used to account for population structure in GWA models. Some individuals did not cluster within their pedigree-assigned full-sib family groupings. For example, two individuals from family p493 clustered with p491, which suggested some pollen contamination or mislabelling. The pattern of LD (r2) decay in the genetically related population of 550 individuals showed a high degree of LD even at longer distances between markers. For example, the average r2 for SNPs separated by 0.5 cM, 1.0 cM, and 5.0 cM was 0.29, 0.26, and 0.19, respectively (Fig. 3).
Genetic architecture
Using single-locus GWA, significant (p < 0.001) SNP-trait association signals for FIRM and CRIS were identified on LG3 and LG10; for SCUF on LG2, LG4 and LG10; for SHAP on LG11 and LG15; for SOUR on LG1, LG6 and LG13; for SWET on LG4, LG5 and LG13; and for FINT on LG1 and LG8 (Fig. 4). The majority of SNPs individually explained about 0.5% of phenotypic variance (Supplementary Fig. S2), while the maximum effect-size varied between 2% (for AVFW and RUSS) and 3.5% (SWET) (Table 2). The largest-effect SNP was common between FIRM and CRISP on LG10, while the SNPs with largest effect on JUIC and SWET were located on LG5 (Table 2). The distribution of observed ASEs (presented as phenotypic standard deviation (PSD)) for each trait were moderately leptokurtic, suggesting only few SNPs with moderate effect, and the highest ASE varied between 0.41 and 0.68 (Fig. 5). The Kolmogorov-Smirnov test showed that these observed distributions were significantly (p < 0.05) different from normal and exponential density functions, but fitted best to a gamma distribution (i.e. the majority of the SNPs having a small effect and a few a moderate effect). The estimated shape and rate parameters of gamma distribution were about 1.25 and 15.0, respectively, for all traits.
Table 2.
Trait | SNP | Linkage group | Position (cM) | Significance (−log10p) | ASE | R2 (%) |
---|---|---|---|---|---|---|
FIRM | S764_78213 | 10 | 95.60 | 4.00 | 0.52 | 3.0 |
CRISP | S764_78213 | 10 | 95.60 | 3.64 | 0.48 | 2.6 |
JUIC | S210_30053 | 5 | 298.97 | 3.54 | 0.41 | 2.2 |
SWET | S182_250115 | 5 | 189.89 | 4.45 | 0.33 | 3.5 |
SOUR | S465_110990 | 1 | 187.40 | 4.42 | 0.44 | 3.0 |
FINT | S203_236962 | 8 | 214.20 | 4.63 | 0.41 | 3.3 |
SCUF | S29076_1553 | 10 | 184.92 | 3.88 | 0.44 | 2.4 |
SHAP | S4855_850 | 11 | 16.65 | 4.21 | 0.65 | 3.0 |
RUSS | S150_272459 | 9 | 123.17 | 3.77 | 0.48 | 2.0 |
AVFW | S340_202551 | 12 | 79.42 | 3.25 | 0.53 | 2.0 |
The number of significant (p < 0.001) SNPs identified using multi-locus methods MLMM and MRMLM were 79 and 77, respectively, compared to 67 identified from single-locus GWA (Supplementary Table S1). Majority of the significant SNPs identified using the two multi-locus methods were on the same genomic locations as those from single-locus method (GAPIT) for all traits (Supplementary Figs S3 and S4, Supplementary Table S1). MLMM and MRMLM identified SNPs significantly associated with RUSS on LG 4, 16 and 17, which were insignificant in single-locus GWA (Supplementary Table S1).
Genomic prediction accuracies
We applied a cross-validation scheme by using each full-sib family in turn as a validation population (VP), resulting in a nine-fold cross validation. The results are displayed using a boxplot graph (Fig. 6). The average (across nine families) accuracy varied from 0.32 (CRIS) to 0.62 (SWET), and the range of predicted accuracy was lowest (0.24) for SWET and highest (0.77) for SCUF. The higher prediction accuracy for SWET was partly due to low genetic and phenotypic variability (Table 1, Supplementary Fig. 1). Across all 10 traits, the prediction accuracy was lowest (0.32) for family p449 and the highest (0.51) for family p487.
Discussion
Understanding of the heritability (h2) of a selection trait is critical in designing molecular breeding strategies such as MAS and GS7,8. Using genomic relationship matrix (GRM) in mixed-model equations would provide improved estimation of h2 compared to the pedigree-based relationships10. For a quantitative trait controlled by many genes, the proportion of h2 explained by a SNP would indicate its worthiness for use in traditional MAS. Most traits in this study were under moderate-to-high genetic control, with GRM-based estimates of h2 being comparable with earlier published pedigree-based estimates. For example, h2 of fruit weight (0.68) and acid taste (0.62) were almost identical to those reported by Minamikawa et al.6. The, h2 for fruit firmness in interspecific populations has been reported to be around 0.60–0.7011,12, slightly higher than that found in this study (0.47). Heritability of FINT in interspecific populations was reported to be 0.5413, which is similar to that observed in this study (0.46). However, h2 of fruit sweetness as reported here (0.16) was half of that observed by Abe et al.14 in hybrid seedlings. In our study, the average (over traits) GRM-based h2 was 0.50, which was slightly lower than the pedigree-based estimate (0.60). In general, GRM-based h2 were shown to be in the similar range to pedigree-based estimates in various species15–17, which is consistent with results from this study.
LD is a measure of associations between SNP alleles and the alleles at QTL18. In practice, LD between pairs of SNP markers are obtained because the genomic coordinates of QTLs are generally unknown. The extent of LD obtained in the hybrid seedling populations in this study is higher than that in a population of Asian and European pear germplasm accessions5. For markers separated by 10 cM, the r2 (0.17) in our study is almost identical to that reported (0.18) by Minamikawa et al.6 in full-sib progenies of Japanese pear. The average r2 between the adjacent SNPs was 0.47, which is higher than that reported for pear (0.33)6 and apple (0.32)17.
The higher magnitude of short-range and long-range LD in our study could be a result of the genetic structure of the seedling population. Clustering patterns of families (Fig. 2) reflected sharing of parents between the various families. Strong relatedness between different families, and a bottleneck in the breeding history of parents of these families, could be among the factors underpinning the high observed LD9,18. High LD between the adjacent SNPs also plays a key role in improved accuracy of GS19.
Peak association signals for various traits were located close to genomic regions that have been previously identified. For example, putative QTLs identified for SCUF on LG2, 4 and 10 are in agreement with earlier reports5,20. Similar to our study, there are reports of QTLs for pear fruit shape index on LG2 and LG113,21. Cao et al.22 reported a large effect QTL for apple fruit shape index on LG11, suggesting an orthologous region between apple and pear genomes. A QTL for AVFW was mapped on LG721, which agrees with our results, but some studies3,4 mapped QTLs for AVFW on different linkage groups – suggesting this trait has a complex polygenic nature. As might have been expected given the high genetic correlation between FIRM and CRIS, these traits shared the largest-effect SNPs located on LG10 and LG3. Genomic locations on LG10 and LG3 have previously been shown to be associated with fruit firmness in apple8,17,23. QTLs influencing apple and pear fruit traits (e.g. fruit softening; harvest maturity) have been mapped on the same LGs of apple and pear genomes6,24 – further evidence for high synteny between pear and apple genomes25,26.
The same SNP marker was found to be associated with SWET and JUIC on LG5. A significant genetic correlation (0.41) between these traits (Table 1) would suggest some genes with pleiotropic effects. There are no reports on QTLs for pear SWET, but QTLs for soluble solids concentration (SSC) have been investigated as SSC is the best objective predictor of sensory sweetness27. In agreement with results of this study, QTLs for SSC have been mapped on LG44 and LG53,21. A high correlation was reported between malic acid content and SOUR taste in apple fruit27, hence common QTLs for these fruit traits would be expected. Minamikawa et al.6 mapped a large-effect QTL for acid content in pear fruit on LG6, which supports our GWA results. Unlike our study, there are no QTLs reported for SOUR taste on LG1 and LG13 – suggesting a possibility of population-specific QTL28. Similar to apple FINT QTL29, significant marker-trait associations were observed for pear FINT on LG1 and LG8. SNPs within the apple genes MdCXE4 (LG1) and MdMYB44 (LG8) were shown to be associated with apple FINT and/or fruit acid content29,30. Further work is needed to identify and test the functionality of putative candidate genes underpinning marker-trait associations reported in this study, and evaluate their synteny with apple genes.
Multi-locus GWA could be more powerful than single-locus GWA because they account for LD between SNPs, and a small-effect locus may be more apparent when other large-effect loci are already fitted in the model31,32. Similar to a previous study6 on pear GWA, a higher number of significant SNPs were detected in the multi-locus GWA compared to the single-locus analysis. However, the majority (nearly 90%) of the significant SNPs, especially the large-effect SNPs, were common between the two approaches in our study (Supplementary Table S1). As some small-effect SNPs (e.g. for RUSS) were not detected in the single-locus model, multi-locus GWA could be more powerful in such cases. However, it’s important to note that the threshold (p < 0.001) used to identify significant SNPs in our study is low and the false-discovery rate adjusted p-values were >0.05 for most traits. These results emphasise that an independent validation of the significant SNPs would be required irrespective of the GWA method used.
Understanding the genetic architecture of pear fruit phenotypes is facilitated by having information regarding the distribution of QTL effects. The variance explained by a SNP is mainly a function of the size of the QTL associated with the significant marker and LD between the marker and QTL7. Consequently, the distribution of estimated SNP effects should resemble the distribution of the underlying QTL effects. In this study, the maximum variance explained by a SNP for any trait was very small (<4%) and there were very few SNPs with ASE higher than 0.50 PSD, so gamma distributions best fitted to the observed SNP effects for all fruit traits. These results indicated that many QTLs of small effect and only a few moderate effect QTLs control the fruit traits investigated in this study. Hayes et al.33 also reported a gamma distribution of QTL effect sizes in dairy and pig breeding programmes; this is supported by similar observations in other species34,35. Our results suggest that the response from MAS in most pear fruit phenotypes would be small – hence GS could be a better selection tool. MAS would still be useful for traits controlled by major genes, such as pear red skin colour36.
The fundamental difference between MAS and GS is that the former only utilizes the SNPs that are significant in a GWAS, whereas the latter uses high-density genome-wide SNPs so that all QTLs are expected to be in LD with one or more SNPs. In the case of GS, potentially all the genetic variance for a trait can be tracked because the marker effect does not need to exceed a pre-determined significance threshold to be used to predict breeding value7. In fruit breeding programmes, the traditional MAS for major gene traits (e.g. disease resistance, skin/flesh colour) is followed by orchard testing for polygenic traits – hence the time required for developing new cultivars is not shortened to any great extent. A two-step approach, i.e. MAS for monogenic traits followed by GS for polygenic traits, obviates the need for Stage-1 seedling testing, hence fast-forwarding the development of new cultivars8. This strategy has been evaluated in the New Zealand-based PFR apple breeding programme, and it could be adapted to pear breeding programmes as well.
Various studies on apple have shown that the correlation between sensory and instrumental measures of firmness, acidity, and sweetness were about 0.75, 0.80 and 0.50, respectively27,37, suggesting that genomic prediction accuracies of sensory and instrumental measures could be similar. Minamikawa et al.6 used instrumental measures of some of the sensory traits used in our study. The average prediction accuracy of traits (fruit weight, firmness, sweetness, and sourness), that were common with Minamikawa et al.6, was almost identical (cf. 0.46). For the across-family validation scheme implemented in this study, the genetic relationships between the training and validation families and population-level LD are among the key drivers of genomic prediction accuracy. Genomic predictions were reported to be most accurate when models were trained with some individuals from the validation families because of the close relation between training and validation sets8,38. Prediction accuracies of pear fruit phenotypes could be improved further by increasing the training population size, and also by combining parental and breeding populations6.
When the training and validation samples are observed independently over different sites/environments, prediction accuracies can be lower depending on the magnitude of genotype-by-site interaction39. Training data would need to be obtained from different sites and years in order to develop robust genomic prediction models for pear fruit traits. A common approach for the evaluation of GS in fruit breeding programmes is to cross-validate using validation samples from the same generation6,8,39,40. The accuracy of predicting phenotypes of successive generations would be lower than within-generation accuracy due to marker-QTL LD decay19. Meuwissen et al.41 suggested that Bayesian GS models could outperform GBLUP because they capture marker-trait LD that persists in the successive breeding cycles. However, various studies42,43 based on empirical data suggested very little or no advantage of Bayesian GS models for most traits, so GBLUP method is widely adopted for GS in commercial breeding programmes44,45.
Similar to animal and plant species7,46,47, fruit breeding programmes need to develop unique multi-generation genotype-phenotype datasets to evaluate persistence of accuracy of genomic predictions over several generations under different environmental conditions. Based on the accuracy of GS in our study, we conclude that it shows strong potential to accelerate the pear breeding cycle by making selections prior to extensive fruit-quality phenotyping. Thus, a GWAS-GS combination could be an effective tool for increasing the efficiency of pear breeding programmes.
Method
Plant material
An interspecific pear breeding programme at Plant & Food Research Limited (PFR) New Zealand was initiated in 1986 using commercial cultivars of European, Chinese and Japanese pear as parents. Second-generation populations were created in 1996 using the best selections produced from the first-generation hybrid families as parents36. A subset of the third-generation families created during 2007–08 using best seedlings from the second-generation were used for this study. A total of 12 second-generation selections were used as parents to create nine families and details of mating design and relatedness among the 12 parents were reported earlier36.
As described earlier by Kumar et al.36, a random subset of seedlings that reached a minimum height after growing in a field nursery for 1 year were propagated and planted in PFR’s orchard in Motueka during 2011. Fruit were harvested during the fruiting season in 2015 and 2016 and a random sample of six fruit from each seedling was stored for 28 days at 3 °C, then a further 1 day at 20 °C before evaluation36. Sensory traits including firmness (FIRM), crispness (CRISP), juiciness (JUIC), sweetness (SWET), sourness (SOUR) and flavour intensity (FINT) were evaluated on a scale from 0 (=lowest) to 9 (=highest) by two trained assessors, and one overall score for each trait was given to each seedling. Russet (RUSS) was scored visually on a scale from 0 (=lowest) to 9 (=highest). Scuffing (SCUF) was rated on a 0–9 scale (0 = no darkening; 9 = solid brown or black colouration) following the method described by Brewer et al.48. Fruit shape index (SHAP) was measured using a two dimensional shape chart5 and fruit weight (AVFW) was measured as the average weight of the six fruits.
DNA extraction, variants discovery, and linkage map construction
Protocols for DNA extraction, genotyping-by-sequencing (GBS) library preparation, and variant calling were reported earlier as the SNP data used in this study are those used for fine-mapping of pear red skin colour gene in the same nine families36. Resulting SNPs were used to construct a consensus linkage map as reported earlier by Kumar et al.36. Briefly, a total of 16 paternal and maternal maps were constructed using Joinmap v4.1 software, and then common SNPs shared by at least three maps were selected as a bridge to merge the maps using R package MergeMap. Finally, 7,509 high quality SNPs were mapped and used for GWA and GS in this study.
Genomic BLUP model
Phenotypes adjusted for fixed effects (e.g. year and assessor effect) were used for estimation of variance components and best linear unbiased prediction (BLUP) of additive effects using the following model17:
1 |
where y is a vector of adjusted phenotypes; μ is an intercept, 1n is a vector of 1 s; Z is the known design matrix relating to a, the unknown vector of random additive genetic effects with a ~ N(0, G ). The scalar is the additive variance and e is a vector of independent random deviates with variance . A realized (or genomic) relationship matrix (G)49 was obtained using all available SNPs, and Eq. (1) was implemented in software ASReml v3.050. Estimates of variance components derived from Eq. (1) were used for calculating narrow-sense heritability (h2) as the ratio of additive () to phenotypic variance (). Product-moment correlations between breeding values were used as estimates of genetic correlation among various traits.
Model validation
The dataset in this study was composed of nine families, so each family in-turn was used as a validation population (VP) and the remning eight families were used as a training population (TP). GBLUP of seedlings in a VP were predicted by fitting Eq. (1) where their phenotypes were considered as missing values. Prediction accuracy was obtained as the correlation between observed and predicted BLUP of individuals in the VP. This process was repeated so that all nine families were predicted. The mean (averaged over nine validation sets) accuracy and its standard error were presented for each trait.
Estimation of linkage disequilibrium (LD) and genetic architecture
Estimates of LD (r2), derived from allele frequencies at a pair of loci, were calculated between SNPs located on the same linkage group36. The r2 estimates were corrected for population structure and cryptic relatedness, and then the average across LGs were plotted against pairwise genetic distance using R package LdcorSV51. The LD decay curve was fitted using a standard logarithmic function. Genetic architecture, in terms of allele substitution effects (ASE) of genome-wide SNPs, of each trait was investigated using the single-locus unified mixed linear model (MLM) approach that accounts for family structure and cryptic relationships as implemented in R package GAPIT52. The following MLM was implemented for single-locus GWA52:
2 |
where y is a vector of adjusted phenotypes; β is an unknown vector containing estimates of fixed effects (overall mean, ASE of the SNP, and population structure); X and Z are the known design matrices relating to β and a (the unknown vector of random additive genetic effects with variance G ), respectively. The scalar is the additive variance, G is the realised or genomic relationship matrix (GRM)49, and ε is a vector of independent random deviates with variance . The estimated ASEs of all SNPs were expressed in phenotypic standard deviation units and the observed distribution of SNP effects was compared with theoretical density functions (Normal, Gamma, Exponential) using Kolmogorov-Smirnov test implemented in R package ‘fitdistrplus’53. The proportion of phenotypic variance explained by each SNP was also calculated using R package GAPIT.
Multi-locus GWA methods could potentially be more powerful than single-locus GWA especially for complex traits. Two multi-locus methods were evaluated in this study; first, a multi-locus mixed model (MLMM31) which accounts for population structure and cryptic relatedness and uses a stepwise regression with forward inclusion and backward elimination of SNPs as fixed cofactors; second, a multi-locus random-SNP-effect mixed linear model (MRMLM32) which fits all SNPs simultaneously as random effects and also accounts for population structure and cryptic relatedness (https://cran.r-project.org/web/packages/mrMLM.GUI/index.html).
Supplementary information
Acknowledgements
This research was partly funded by the New Zealand Ministry of Business, Innovation and Employment (MBIE). Funding from the National Science Foundation of China (31820103012) is thankfully acknowledged. The PFR Markers & Mapping Team is thanked for their help in collecting leaf samples. Marlene Aldworth helped in fruit harvest and phenotyping. Constructive feedback from Margaret Carpenter and Luis Gea is gratefully acknowledged.
Author Contributions
S.K. and L.B. conceived and designed the study. C.W. and C.K. performed leaf collection and GBS library preparation. C.H.D. conducted bioinformatics analysis, and M.Q. and J.W. constructed linkage map. A.S. and L.B. led the phenotyping. S.K. performed GWA and genomic predictions and wrote the first draft. All authors helped to edit the manuscript, and read and approved the manuscript.
Competing Interests
The authors declare no competing interests.
Footnotes
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Supplementary information accompanies this paper at 10.1038/s41598-019-45618-w.
References
- 1.da Silva, G. J., Souza, T. M., Barbieri, R. L. & Oliveira, A. C. D. Origin, domestication, and dispersing of pear (Pyrus spp.). Adv. Agr. e541097 (2014).
- 2.Brewer LR, Palmer JW. Global pear breeding programmes: goals, trends and progress for new cultivars and new rootstocks. Acta Hortic. 2011;909:105–120. doi: 10.17660/ActaHortic.2011.909.10. [DOI] [Google Scholar]
- 3.Wu J, et al. High-density genetic linkage map construction and identification of fruit-related QTLs in pear using SNP and SSR markers. J. Exp. Bot. 2014;65:5771–5781. doi: 10.1093/jxb/eru311. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Yamamoto T, et al. Identification of QTLs controlling harvest time and fruit skin colour in Japanese pear (Pyrus pyrifolia Nakai) Breed. Sci. 2014;64:351–361. doi: 10.1270/jsbbs.64.351. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Kumar S, et al. Genotyping-by-sequencing of pear (Pyrus spp.) accessions unravels novel patterns of genetic diversity and selection footprints. Hortic. Res. 2017;4:17015. doi: 10.1038/hortres.2017.15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Minamikawa MF, et al. Genome-wide association study and genomic prediction using parental and breeding populations of Japanese pear (Pyrus pyrifolia Nakai) Sci. Rep. 2018;8:11994. doi: 10.1038/s41598-018-30154-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Hayes BJ, Bowman PJ, Chamberlain AJ, Goddard ME. Invited review: Genomic selection in dairy cattle: Progress and challenges. J. Dairy Sci. 2009;92:433–443. doi: 10.3168/jds.2008-1646. [DOI] [PubMed] [Google Scholar]
- 8.Kumar S, et al. Genomic selection for fruit quality traits in apple (Malus × domestica Borkh.) Plos One. 2012;7:e36674. doi: 10.1371/journal.pone.0036674. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Iwata H, et al. Potential assessment of genome-wide association study and genomic selection in Japanese pear Pyrus pyrifolia. Breed. Sci. 2013;63:125–140. doi: 10.1270/jsbbs.63.125. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Speed D, Hemani G, Johnson MR, Balding DJ. Improved heritability estimation from genome-wide SNPs. Am. J. Hum. Genet. 2012;91:1011–1021. doi: 10.1016/j.ajhg.2012.10.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Shin IS, Shin YU, Hwang HS. Heritability of fruit characters of interspecific hybrids between Pyrus pyrifolia and P. ussuriensis or P. breschneideri. Acta Hortic. 2008;800:535–540. doi: 10.17660/ActaHortic.2008.800.70. [DOI] [Google Scholar]
- 12.Saito T. Advances in Japanese pear breeding in Japan. Breed. Sci. 2016;66:46–59. doi: 10.1270/jsbbs.66.46. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Bell RL, Janick J. Quantitative Genetic Analysis of Fruit Quality in Pear. J. Am. Soc. Hortic. Sci. 1990;115:829–834. doi: 10.21273/JASHS.115.5.829. [DOI] [Google Scholar]
- 14.Abe K, Saito Y, Kurihara A, Kotobuki K. Narrow-sense heritability of fruit characters in Japanese pear (Pyrus pyrifolia Nakai) Breed. Sci. 1995;45:1–5. [Google Scholar]
- 15.Yang J, et al. Genome partitioning of genetic variation for complex traits using common SNPs. Nat. Genet. 2011;43:519–525. doi: 10.1038/ng.823. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Fan B, et al. Genome-wide association study identifies loci for body composition and structural soundness traits in pigs. Plos One. 2011;6:e14726. doi: 10.1371/journal.pone.0014726. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Kumar S, et al. Novel genomic approaches unravel genetic architecture of complex traits in apple. BMC Genomic. 2013;14:93. doi: 10.1186/1471-2164-14-93. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Flint-Garcia SA, Thornsberry JM, Buckler ES. Structure of linkage disequilibrium in plants. Annu. Rev. Phys. Chem. 2002;53:319–348. doi: 10.1146/annurev.physchem.53.100301.131630. [DOI] [PubMed] [Google Scholar]
- 19.Habier D, Fernando RL, Dekkers JC. The impact of genetic relationship information on genome-assisted breeding values. Genetics. 2007;177:2389–2397. doi: 10.1534/genetics.107.081190. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Saeed M, et al. Genetic, metabolite and developmental determinism of fruit friction discolouration in pear. BMC Plant Biol. 2014;14:1. doi: 10.1186/s12870-014-0241-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Zhang RP, et al. An AFLP, SRAP, and SSR genetic linkage map and identification of QTLs for fruit traits in pear (Pyrus L.) Plant Mol. Biol. Rep. 2013;31:678–687. doi: 10.1007/s11105-012-0544-1. [DOI] [Google Scholar]
- 22.Cao K, et al. Candidate gene prediction via quantitative trait locus analysis of fruit shape index traits in apple. Euphytica. 2015;206:381–391. doi: 10.1007/s10681-015-1488-y. [DOI] [Google Scholar]
- 23.McClure KA, et al. A genome-wide association study of apple quality and scab resistance. Plant Genome. 2018;11:170075. doi: 10.3835/plantgenome2017.08.0075. [DOI] [PubMed] [Google Scholar]
- 24.Costa FA, et al. Map position and functional allelic diversity of Md-Exp7, a new putative expansin gene associated with fruit softening in apple (Malus × domestica Borkh.) and pear (Pyrus communis) Tree Genet. Genomes. 2008;4:575–586. doi: 10.1007/s11295-008-0133-5. [DOI] [Google Scholar]
- 25.Yamamoto T, et al. Genetic linkage maps of Japanese and European pears aligned to the apple consensus map. Acta Hortic. 2004;663:51–66. doi: 10.17660/ActaHortic.2004.663.2. [DOI] [Google Scholar]
- 26.Wu J, et al. The genome of the pear (Pyrus bretschneideri Rehd.) Genome Res. 2013;23:396–408. doi: 10.1101/gr.144311.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Harker FR, et al. Sensory interpretation of instrumental measurements 2: sweet and acid taste of apple fruit. Postharvest Biol. Tech. 2002;24:241–250. doi: 10.1016/S0925-5214(01)00157-0. [DOI] [Google Scholar]
- 28.Würschum T. Mapping QTL for agronomic traits in breeding populations. Theor. Appl. Genet. 2012;125:201–10. doi: 10.1007/s00122-012-1887-6. [DOI] [PubMed] [Google Scholar]
- 29.Kumar S, et al. Genome-wide scans reveal genetic architecture of apple flavour volatiles. Mol. Breed. 2015;35:118. doi: 10.1007/s11032-015-0312-7. [DOI] [Google Scholar]
- 30.Jia D, et al. Apple fruit acidity is genetically diversified by natural variations in three hierarchical epistatic genes Md SAUR 37, Md PP 2 CH and Md ALMTII. Plant J. 2018;95:427–443. doi: 10.1111/tpj.13957. [DOI] [PubMed] [Google Scholar]
- 31.Segura V, et al. An efficient multi-locus mixed-model approach for genome-wide association studies in structured populations. Nat. Genet. 2012;44:825–830. doi: 10.1038/ng.2314. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Wang SB, et al. Improving power and accuracy of genome-wide association studies via a multi-locus mixed linear model methodology. Sci. Rep. 2016;6:19444. doi: 10.1038/srep19444. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Hayes BE, Goddard ME. The distribution of the effects of genes affecting quantitative traits in livestock. Genet. Sel. Evol. 2001;33:209. doi: 10.1186/1297-9686-33-3-209. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Albert AY, et al. The genetics of adaptive shape shift in stickleback: pleiotropy and effect size. Evolution. 2008;62:76–85. doi: 10.1111/j.1558-5646.2007.00259.x. [DOI] [PubMed] [Google Scholar]
- 35.Caballero A, Tenesa A, Keightley PD. The nature of genetic variation for complex traits revealed by GWAS and regional heritability mapping analyses. Genetics. 2015;201:1601–1613. doi: 10.1534/genetics.115.177220. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Kumar S, et al. Fine-mapping and validation of the genomic region underpinning pear red skin colour. Hortic. Res. 2019;6:29. doi: 10.1038/s41438-018-0112-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Oraguzie N, et al. Postharvest assessment of fruit quality parameters in apple using both instruments and an expert panel. Postharvest Biol. Tech. 2009;52:279–87. doi: 10.1016/j.postharvbio.2009.01.004. [DOI] [Google Scholar]
- 38.Legarra A, Robert-Granié C, Manfredi E, Elsen JM. Performance of genomic selection in mice. Genetics. 2008;180:611–618. doi: 10.1534/genetics.108.088575. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Burgueño J, de los Campos G, Weigel K, Crossa J. Genomic prediction of breeding values when modeling genotype-environment interaction using pedigree and dense molecular markers. Crop Sci. 2012;52:707–719. doi: 10.2135/cropsci2011.06.0299. [DOI] [Google Scholar]
- 40.Biscarini F, et al. Genome-enabled predictions for fruit weight and quality from repeated records in European peach progenies. BMC Genomics. 2017;18:432. doi: 10.1186/s12864-017-3781-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Meuwissen TH, Hayes BJ, Goddard ME. Prediction of total genetic value using genome-wide dense marker maps. Genetics. 2001;157:1819–1829. doi: 10.1093/genetics/157.4.1819. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Zhang X, et al. Rapid cycling genomic selection in a multiparental tropical maize population. G3-Genes Genom. Genet. 2017;7:2315–2326. doi: 10.1534/g3.117.043141. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Hofheinz N, Borchardt D, Weissleder K, Frisch M. Genome-based prediction of test cross performance in two subsequent breeding cycles. Theor. Appl. Genet. 2012;125:1639–1645. doi: 10.1007/s00122-012-1940-5. [DOI] [PubMed] [Google Scholar]
- 44.Hickey JM, et al. Genomic prediction unifies animal and plant breeding programs to form platforms for biological discovery. Nat. Genet. 2017;49:1297–1302. doi: 10.1038/ng.3920. [DOI] [PubMed] [Google Scholar]
- 45.Wiggans GR, Cole JB, Hubbard SM, Sonstegard TS. Genomic selection in dairy cattle: the USDA experience. Annu. Rev. Anim. Biosci. 2017;5:309–327. doi: 10.1146/annurev-animal-021815-111422. [DOI] [PubMed] [Google Scholar]
- 46.Wolc A, et al. Persistence of accuracy of genomic estimated breeding values over generations in layer chickens. Genet. Sel. Evol. 2011;43:23. doi: 10.1186/1297-9686-43-23. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Zhang X. Rapid cycling genomic selection in a multi-parental tropical maize population. G3-Genes Genom. Genet. 2017;7:2315–2326. doi: 10.1534/g3.117.043141. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Brewer LR, Morgan C, Alspach PA, Volz RK. Heritability and parental breeding value estimates of abrasion-induced skin discolouration on pear fruit. Acta Hortic. 2011;909:127–135. doi: 10.17660/ActaHortic.2011.909.12. [DOI] [Google Scholar]
- 49.VanRaden PM. Efficient methods to compute genomic predictions. J. Dairy Sci. 2008;91:4414–4423. doi: 10.3168/jds.2007-0980. [DOI] [PubMed] [Google Scholar]
- 50.Gilmour, A. R., Gogel, B. J., Cullis, B. R. & Thompson, R. ASReml User Guide Release 3.0.VSN International Ltd, Hemel Hempstead, United Kingdom. (2009).
- 51.Mangin B, et al. Novel measures of linkage disequilibrium that corrects the bias due to population structure and relatedness. Heredity. 2012;108:285–291. doi: 10.1038/hdy.2011.73. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Lipka AE, et al. GAPIT: Genome association and prediction integrated tool. Bioinformatics. 2012;28:2397–2399. doi: 10.1093/bioinformatics/bts444. [DOI] [PubMed] [Google Scholar]
- 53.Delignette-Muller ML, Dutang C. Fitdistrplus: An R package for fitting distributions. J. Stat. Software. 2015;64:1–34. doi: 10.18637/jss.v064.i04. [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.