Skip to main content
PLOS One logoLink to PLOS One
. 2021 Feb 4;16(2):e0246468. doi: 10.1371/journal.pone.0246468

Allelic composition of carotenoid metabolic genes in 13 founders influences carotenoid composition in juice sac tissues of fruits among Japanese citrus breeding population

Hiroshi Fujii 1,#, Keisuke Nonaka 1,#, Mai F Minamikawa 2,#, Tomoko Endo 1,#, Aiko Sugiyama 3,#, Kosuke Hamazaki 2,#, Hiroyoshi Iwata 2,#, Mitsuo Omura 3,#, Takehiko Shimada 1,*,#
Editor: Chunxian Chen4
PMCID: PMC7861536  PMID: 33539435

Abstract

To enrich carotenoids, especially β-cryptoxanthin, in juice sac tissues of fruits via molecular breeding in citrus, allele mining was utilized to dissect allelic variation of carotenoid metabolic genes and identify an optimum allele on the target loci characterized by expression quantitative trait (eQTL) analysis. SNPs of target carotenoid metabolic genes in 13 founders of the Japanese citrus breeding population were explored using the SureSelect target enrichment method. An independent allele was determined based on the presence or absence of reliable SNPs, using trio analysis to confirm inheritability between parent and offspring. Among the 13 founders, there were 7 PSY alleles, 7 HYb alleles, 11 ZEP alleles, 5 NCED alleles, and 4 alleles for the eQTL that control the transcription levels of PDS and ZDS among the ancestral species, indicating that some founders acquired those alleles from them. The carotenoid composition data of 263 breeding pedigrees in juice sac tissues revealed that the phenotypic variance of carotenoid composition was similar to that in the 13 founders, whereas the mean of total carotenoid content increased. This increase in total carotenoid content correlated with the increase in either or both β-cryptoxanthin and violaxanthin in juice sac tissues. Bayesian statistical analysis between allelic composition of target genes and carotenoid composition in 263 breeding pedigrees indicated that PSY-a and ZEP-e alleles at PSY and ZEP loci had strong positive effects on increasing the total carotenoid content, including β-cryptoxanthin and violaxanthin, in juice sac tissues. Moreover, the pyramiding of these alleles also increased the β-cryptoxanthin content. Interestingly, the offset interaction between the alleles with increasing and decreasing effects on carotenoid content and the epistatic interaction among carotenoid metabolic genes were observed and these interactions complexed carotenoid profiles in breeding population. These results revealed that allele composition would highly influence the carotenoid composition in citrus fruits. The allelic genotype information for the examined carotenoid metabolic genes in major citrus varieties and the trio-tagged SNPs to discriminate the optimum alleles (PSY-a and ZEP-e) from the rest would promise citrus breeders carotenoid enrichment in fruit via molecular breeding.

Introduction

Carotenoids are lipophilic isoprenoid pigments biosynthesized from 5-carbon isoprene units and C40 carotenoids and their derived C30 apocarotenoids are the most abundant in nature. Most photosynthetic organisms produce carotenoids, which are essential for the survival of both plants and animals [1]. Carotenoids are involved in various functions in plants, including phyto-hormone precursor action [2] and environmental adaptation through the modulation of the photosynthetic apparatus [3]. Approximately 115 different carotenoids have been reported in citrus fruits, wherein the color of the fruit and peel are caused by carotenoid accumulation [4], and carotenoid content and composition vary among citrus varieties. For example, Satsuma mandarin (Citrus unshiu Marc.) and Ponkan mandarin (C. reticulata Blanco) predominantly accumulate β-cryptoxanthin in the juice sacs (1.5 mg/100FWG and 0.9 mg/100FWG, respectively), which are the major sources of β-cryptoxanthin in nature [5]. In contrast, Valencia orange (C. sinensis Osbeck) mainly accumulates violaxanthin isomers with 9-cis-violaxanthin (0.9 mg/100FWG) as the principal carotenoid. Lisbon lemon (C. limon Burm. f.) also accumulates β-cryptoxanthin as the principal carotenoid, but at a much lower level than Satsuma mandarin and Valencia orange. Epidemiologic studies have suggested that dietary intake of β-cryptoxanthin reduces the risks of eye diseases, certain cancers, osteoporosis and inflammation [69]. Therefore, in the Japanese citrus breeding program, the enrichment of carotenoids with health-promoting properties, especially β-cryptoxanthin, is an important objective with the aim of expanding citrus fruit consumption. Till date, our research institute has released β-cryptoxanthin enrichment varieties obtained by conventional cross breeding, such as ‘Seinanohikari’ (2.3mg/100gFWG) and ‘Tsunokagayaki’ (2.3mg/100gFWG) and ‘Tamami’ (2.0mg/100gFWG). The total carotenoid content and β-cryptoxanthin content of their parent varieties are not higher than those varieties, indicating a complicated regulation system involved in controlling the accumulation and composition of carotenoids by plural genetic loci in citrus fruits. The carotenoid composition varies among citrus varieties rather than species, and the carotenoid diversity in cultivated citrus is highly influenced by genetic factors [10]. Numerous carotenoid metabolic genes have been physiologically characterized at the molecular level [1113], including several transcription factors, such as CubHLH1 [14], an R2R3-MYB transcription factor, CrMYB68 [15], and CsMADS6 [16]. However, the molecular mechanism for how carotenoid composition extends the variation among cultivars remains unclear.

To advance molecular breeding for the enrichment of carotenoids, especially β-cryptoxanthin, Sugiyama et al. [17] examined quantitative trait locus (QTL) mapping for carotenoid composition using a bi-parental population. QTLs for β-cryptoxanthin and total carotenoid content overlapped at the Gn0005 locus in linkage group 6 of the pollen parent map, and their logarithm (base 10) of odds (LOD) value was around 3.0. This locus was later found to be adjacent to the locus of 9-cis-epoxycarotenoid dioxygenase (NCED) by comparing the linkage map and genome sequences of clementine mandarins (https://phytozome.jgi.doe.gov/pz/portal.html) [18] and Satsuma mandarin using the Mikan Genome Database (MiGD, https://mikan.dna.affrc.go.jp/) [19]. Various QTLs for other carotenoid components were also detected, but most of them expressed low LOD values (less than 2.5). Sugiyama et al. [20] extended the study to compare the genetic locus and expression QTL (eQTL) of carotenoid metabolic genes using a bi-parental population (Fig 1). Against the genetic loci of the major carotenoid metabolic genes, phytoene synthase (PSY) on linkage group (LG)-4, phytoene desaturase (PDS) on LG-3, ζ-carotene desaturase (ZDS) on LG-9, lycopene β-cyclase (LCYb), and β-ring hydroxylase (HYb) on LG-3, zeaxanthin epoxidase (ZEP) on LG-2, and NCED on LG-06, along with eQTLs of PSY, HYb, and ZEP could be mapped on the loci of their corresponding genes, revealing that their transcription is regulated primarily by cis-elements in their promoter regions. In contrast, the eQTLs of both PDS and ZDS were mapped around the genetic locus of the Tf0271 DNA marker on LG-8, indicating that transcription would be regulated by trans-elements in this genetic region. Thus, the mode of transcriptional regulation revealed a difference in each gene. There have been several reports that allelic differences in carotenoid metabolic genes have been involved in the modulation of carotenogenesis in crops. In wheat grain (Triticum turgidum L. subsp. durum (Desf.) Husn.), the allelic divergence of PSY may be responsible for the grain’s yellow pigment content [21]. In maize (Zea. mays subsp. mays (L.) Iltis), one PSY allele with an insertion in its promoter region increased the expression in endosperm and the carotenoid content of yellow maize [22]. In Satsuma mandarins, allelic combination has caused the transcriptional variation of PSY and ZEP in mature fruit [23, 24]. Considering that transcriptional variation among carotenoid metabolic genes is significantly associated with the carotenoid composition and content among varieties [5] and Japanese cultivars comprise admixture genomes derived from the limited ancestral species [25], it is possible that the allelic combinations among carotenoid metabolic genes would likely be one of the major factors influencing the carotenoid diversity across citrus varieties.

Fig 1. Carotenoid metabolic pathway in citrus fruits and 5 target genes in allele mining system for the enrichment of β-cryptoxanthin in juice sac tissues of fruits.

Fig 1

Notes: 5 target genes responsible for enrichment of carotenoids in juice sac tissues of fruits are selected from the past QTL and eQTL analyses [15, 17].

In this study, we applied an allele mining approach to understand the molecular mechanism by which cultivars extend the diversity of carotenoid composition as well as to develop an allele mining system for enrichment of carotenoids, especially β-cryptoxanthin, in juice sac tissues of fruits. Allele mining is a promising technique for dissecting the naturally occurring allelic variation in candidate genes controlling key agronomic traits that have potential applications in crop improvement programs [26]. Among the various carotenoid metabolic genes, the loci of PSY, HYb, ZEP, NCED, and a putative causative gene on eQTLs that control the transcription level of PDS and ZDS (TCL) were selected as major candidates to strongly influence the carotenoid content and composition in juice sac tissues of fruits, as determined from previous studies [15, 17]. The independent alleles of five target genes in 13 ancestral varieties (founder) of the Japanese citrus breeding population were explored based on single nucleotide polymorphisms (SNPs) by SureSelect target enrichment analysis. PSY, HYb, ZEP, and TCL possessed 7, 11, 5, and 4 independent alleles in 13 founders, respectively, while some of the 13 founders shared independent alleles derived from ancestral species. A Bayesian statistical analysis was applied to the association analysis between allelic composition and carotenoid composition in 263 breeding pedigrees. Our findings reveal PSY and ZEP as key genes that influenced the total carotenoid content and β-cryptoxanthin content in juice sac tissues of fruits. We further discussed the ability of the allele mining system to enrich carotenoid content in juice sac tissues of fruits.

Materials and methods

Plant materials

All plants used in the experiments were cultivated in the research field of the National Agriculture and Food Research Organization Institute of Fruit and Tea Tree Science, Citrus Research Center, Okitsu, Shizuoka, Japan (Table 1). Genomic DNA was extracted from fresh and fully expanded leaves of these plants according to the method described by Dellaporta et al. [27]. Juice sac tissues were collected from mature fruits at their harvest time and were immediately frozen by liquid nitrogen for liquid chromatography analysis of carotenoid components.

Table 1. Plant materials used in this study.

Code Name Academic name/ Parental combination Breeding generation
1 Dancy tangerine Citrus tangerina hort. ex Tanaka Founder
2 Grapefruit (cv. Duncan) C. paradisi Macfad. Founder
3 Kishu mikan (cv. Mukakukishu) C. kinokuni hort. ex Tanaka Founder
4 Buntan pumelo (cv. Tanikawa buntan) C. grandis Osbeck Founder
5 Hassaku C. hassaku hort. ex Tanaka Founder
6 Hyuganatsu C. tamurana hort. ex Tanaka Founder
7 Sweet orange (cv. Trovita) C. sinensis (L.) Osbeck Founder
8 Iyo C. iyo hort. ex Tanaka Founder
9 Kunenbo mandarin C. nobilis Lour. var. kunep Tanaka Founder
10 Ponkan mandarin (cv. Yoshida) C. reticulata Blanco Founder
11 Willowleaf mandarin C. deliciosa Ten. Founder
12 King mandarin C. nobilis Lour. Founder
13 Murcott Hybrid Founder
14 Clementine mandarin C. clementina hort. ex Tanaka (11 × 7) Natural cross between founders (G0)
15 Satsuma mandarin (cv. Miyagawa-Wase) C. unshiu Marc. (3 × 9) Natural cross between founders (G0)
16 Minneola 2 × 1 1st generation (G1)
17 Seminole 2 × 1 1st generation (G1)
18 Orland 2 × 1 1st generation (G1)
19 Southern Yellow 4 × 3 1st generation (G1)
20 Nankou 15 × 14 1st generation (G1)
21 Ariake 7 × 14 1st generation (G1)
22 Sweet Spring 15 × 5 1st generation (G1)
23 Awa orange 6 × 7 1st generation (G1)
24 JHG 15 × 6 1st generation (G1)
25 Kiyomi 15 × 7 1st generation (G1)
26 Aki-tangor 15 × 7 1st generation (G1)
27 HF9 15 × 7 1st generation (G1)
28 Hayaka 15 × 10 1st generation (G1)
29 Kankitsu Chukanbohon Nou 6 Gou 12 × 3 1st generation (G1)
30 Kara 15 × 12 1st generation (G1)
31 Encore 12 × 11 1st generation (G1)
32 Wilking 12 × 11 1st generation (G1)
33 Page 16 × 14 2nd generation (G2)
34 Robinson 14 × 18 2nd generation (G2)
35 Lee 14 × 18 2nd generation (G2)
36 Fairchild 14 × 18 2nd generation (G2)
37 Fortune 14 × 18 2nd generation (G2)
38 Nova 14 × 18 2nd generation (G2)
39 Osceola 14 × 18 2nd generation (G2)
40 Seihou 25 × 16 2nd generation (G2)
41 Akemi 25 × 17 2nd generation (G2)
42 Okitsu 46 gou 22 × 7 2nd generation (G2)
43 Nishinokaori 25 × 7 2nd generation (G2)
44 KyOw No.21 25 × 15 2nd generation (G2)
45 KyOw No.14 25 × 15 2nd generation (G2)
46 Tsunokaori 25 ×15 2nd generation (G2)
47 Shiranuhi 25 × 10 2nd generation (G2)
48 Youkou 25 × 10 2nd generation (G2)
49 Harumi 25 × 10 2nd generation (G2)
50 Setomi 25 × 10 2nd generation (G2)
51 EnOw No.21 31 ×15 2nd generation (G2)
52 Kuchinotsu 39 gou 31 ×15 2nd generation (G2)
53 Miho-core 15 × 31 2nd generation (G2)
54 KyEn No.4 25 × 31 2nd generation (G2)
55 KyEn No.5 25 × 31 2nd generation (G2)
56 Tsunonozomi 25 × 31 2nd generation (G2)
57 Amaka 25 × 31 2nd generation (G2)
58 Okitsu 45 gou 25 × 32 2nd generation (G2)
59 Tamami 25 × 32 2nd generation (G2)
60 Benibae 27 × 31 2nd generation (G2)
61 Hareyaka 31 × 10 2nd generation (G2)
62 Amakusa 45 × 33 3rd generation (G3)
63 Kuchinotsu 38 gou 44 × 34 3rd generation (G3)
64 Kankitsu Chukanbohon Nou 5 Gou 35 × 3 3rd generation (G3)
65 E-647 25 × 39 3rd generation (G3)
66 Southern Red 30 × 51 3rd generation (G3)
67 Kuchinotsu 28 gou 44 × 1 3rd generation (G3)
68 Haruhi 42 × 23 3rd generation (G3)
69 2700OIy-25 43 × 8 3rd generation (G3)
70 No.1408 51 × (25 × 8) 3rd generation (G3)
71 Kuchinotsu 18 gou 44 × 31 3rd generation (G3)
72 Kuchinotsu 35 gou 44 × 31 3rd generation (G3)
73 Kanpei 43 × 10 3rd generation (G3)
74 Okitsu 57 gou 42 × 49 3rd generation (G3)
75 Asumi 42 × 49 3rd generation (G3)
76 Asuki 42 × 49 3rd generation (G3)
77 Seinannohikari 51 × 48 3rd generation (G3)
78 Kuchinotsu 27 gou 51 × 48 3rd generation (G3)
79 Kuchinotsu 33 gou 45 × 31 3rd generation (G3)
80 Tsunokagayaki 45 × 31 3rd generation (G3)
81 Setoka 54 × 13 3rd generation (G3)
82 Kuchinotsu 36 gou ns × 13 3rd generation (G3)
83 Reikou ns × 13 3rd generation (G3)
84 Ehime Kashi No.28 20 × 62 4th generation (G4)
85 Kuchinotsu 49 gou 63 × 34 4th generation (G4)
86 Okitsu 56 gou 58 × 64 4th generation (G4)
87 No.1011 20 × 69 4th generation (G4)
88 Kuchinotsu 51 gou ns × 78 4th generation (G4)
89 Harehime 65 × 15 4th generation (G4)
90 Okitsu 63 gou 65 × 10 4th generation (G4)
91 Kuchinotsu 52 gou 80 × 21 4th generation (G4)
92 Mihaya 56 × 70 4th generation (G4)
93 Kuchinotsu 54 gou 82 × 29 4th generation (G4)
94 080716 84 × 77 5th genaration (G5)
95 Okitsu 67 gou 89 × 86 5th genaration (G5)
~105 10 strains 29 × 7 Hybrid Seedling
~120 15 strains 26 × 19 Hybrid Seedling
~138 18 strains 11 × 88 Hybrid Seedling
~154 16 strains C. tankan Hayata × 71 Hybrid Seedling
~175 21 strains 95 × 71 Hybrid Seedling
~201 26 strains 81 × 70 Hybrid Seedling
~212 11 strains 68 × Soren tangero Hybrid Seedling
~232 20 strains 7 × 65 Hybrid Seedling
~244 12 strains 68 × 3 Hybrid Seedling
~266 22 strains 26 × 52 Hybrid Seedling
~269 3 strains 76 × 11 Hybrid Seedling
~275 6 strains 14 × 4 Hybrid Seedling

SureSelect target enrichment of carotenoid metabolic genes in 13 founders

Genomic DNA of 13 founders (Table 1) was used for SureSelect target enrichment. The genomic DNA samples were randomly fragmented using a SureSelect QXT Reagent Kit (Agilent Technologies, Santa Clara, CA, USA) and amplified using SureSelect primer Mix (Agilent Technologies). The adapter-attached DNA libraries were hybridized to the SureSelectXT custom library, and the captured DNA was purified using Dynabeads MyOne Streptavidin T1 beads (Thermo Fisher, Waltham, MA, USA). The nucleic acids of the target carotenoid metabolic genes (PSY, HYb, ZEP, and NCED) were enriched with index primers. Index primers for each target gene were designed using the clementine mandarin genome sequence ver. 1.0 as reference (https://phytozome.jgi.doe.gov/pz/portal.html), as follows: PSY (Ciclev10011841m.g), HYb (Ciclev10005481m.g), ZEP (Ciclev10025089m.g), and NCED (Ciclev10019364m.g). The enriched fragments were purified using Agencourt AMPure XP (Beckman Coulter, Brea, CA, USA). The purified products were subjected to next-generation sequencing (NGS) analysis in a single lane on a HiSeq2500 system (Illumina, San Diego, CA, USA) with a paired-end read length of 100 bp. Low-quality bases and Illumina sequencing adapters were trimmed using cutadapt v1.1 (https://cutadapt.readthedocs.org/en/stable/) and Trimmomatic v0.32 (http://www.usadellab.org/cms/index.php?page=trimmomatic). Trimmed reads were mapped to the clementine genome sequence v1.0 using BWA v0.7.10 (http://bio-bwa.sourceforge.net/), GATK Lite v2.3.0 (https://www.broadinstitute.org/gatk/), and Picard v1.133 (http://broadinstitute.github.io/picard/). Mutation analysis was carried out using the Samtools v1.2 (http://www.htslib.org/man/Samtools/) and BCFtools ver. 1.2 (http://www.htslib.org/man/bcftools/). Library preparation, Illumina PE sequencing, mapping, and mutation analyses of PE reads were performed by Hokkaido System Science Co., Ltd. (Sapporo, Hokkaido, Japan).

SNP genotyping by GoldenGate assay and Fluidigm assay

SNP genotyping was carried out using the GoldenGate assay (Illumina) and a Fluidigm assay (Fluidigm, South San Francisco, CA, USA). For the GoldenGate assay, bead arrays were designed using the Illumina® Assay Design Tool according to the manufacturer’s instructions. SNP genotyping analysis was performed using the Goldengate Assay system on an iScan system (Illumina). The assay was performed according to the manufacturer’s instructions. The scanned image data were converted to genotype scores using the Genome Studio software by a function of the Genotyping module (Illumina).

For the Fluidigm assay, TaqMan minor groove binder (TaqMan-MGB) probe and primer sets were designed using Primer Express ver. 3.0 (Applied Biosystems, Foster City, CA, USA). 5-carboxyl-fluorescein (FAM) and hexachloro-fluorescein (HEX) were used to label the 5-end of the oligonucleotides. Genotyping was carried out using SNPType chemistry (Fluidigm Corp., South San Francisco, CA, USA) on a Fluidigm Nanofluidic 96.96 Dynamic Array [28], according to the manufacturer’s instructions. Thermal cycling comprised an initial thermal mix cycle (70°C for 30 min; 25°C for 10 min), followed by a hot-start Taq polymerase activation step (94°C for 15 min) and a touchdown amplification protocol, as follows: 10 cycles of 94°C for 20 s, 65°C for 1 min (decreasing 0.8°C per cycle); then 26–46 cycles of 94°C for 20 s, 57°C for 1 min, with a hold at 20°C for 30 s after every four cycles to collect end-point fluorescent images of the chip using the Biomark imager (Fluidigm Corp). Data were analyzed using the Fluidigm SNP Genotyping Analysis Software with dye settings MGB-FAM and MGB-HEX.

Filtering of inheritable SNPs in parent-offspring trios

The SNPs detected by the GoldenGate assay and Fluidigm assay were used to evaluate the inheritability and repulsion in gametes using MARCO software [29] at the condition without any discrepancy among 78 combinations of parent-offspring trios. A reliable SNP with confirmed heritability between parent and offspring was designated as a trio-tagged SNP.

Quantification of the carotenoid content in breeding materials

Quantification of eight representative carotenoids [phytoene, α-carotene, ζ-carotene, lutein, β-carotene, β-cryptoxanthin, zeaxanthin, and violaxanthin] was carried out according to a modification of the method proposed by Kato et al. [5]. Juice sac tissues were homogenized and extracted in a solution of hexane, acetone, and ethanol (50:25:25, v/v). The pigments partitioned into the hexane phase evaporated until dry. They were then dissolved in methyl tert-butyl ether containing 0.1% (w/v) 2,6-di-tertbutyl-4-methylphenol. The extracts containing carotenoids esterified to fatty acids were saponified with 10% (w/v) methanolic KOH. After saponification, water-soluble extracts were removed from the extract by adding NaCl-saturated water. The pigments were repartitioned into 2mL of the MTBE phase. An aliquot (20 μL) was obtained using a reverse-phase HPLC system (Jasco, Tokyo, Japan) fitted with a YMC Carotenoid S-5 column of 250 × 4.6-mm mm i.d. (Waters, Milford, MA) at a flow rate of 1 mL min−1. The eluent was monitored using a photodiode array detector (MD-910, Jasco, Tochigi, Japan). Each sample was analyzed using the gradient elution schedule proposed by Rouseff et al. [30]. The composition was alternated from 90% methanol, 5% MTBE, and 5% water through a linear gradient to 95% MeOH and 5% MTBE over 12 min, then to 86% MeOH and 14% MTBE over 8 min, 75% MeOH, and 25% MTBE over 10 min, and 50% MeOH and 50% MTBE over 20 min. The peaks were identified by comparing their specific retention times and absorption spectra with authentic standards. The concentrations of the standard solutions were estimated on the basis of the absorption coefficient at 286 nm for phytoene, 400 nm for ζ-carotene, 452 nm for t-violaxanthin, c-violaxanthin, lutein, β-cryptoxanthin, α-carotene, and zeaxanthin, and 453 nm for β-carotene [31]. The sample concentrations were estimated from standard curves. Violaxanthin and ζ-carotene were obtained as the sum of isomers and total CARs as the sum of the carotenoids. The carotenoid concentration was estimated by the standard curves and expressed as milligrams per hundred fresh weight grams (mg/100FWG).

Estimation for the genetic effects of alleles in the target gene regions by using a Bayesian regression model

To evaluate the genetic effects of alleles in the target genetic loci on carotenoid contents, we applied a multi-loci Bayesian linear regression model [34]. The observed phenotypic value for the ith variety yi is described as follows:

yi=α+jJlLjγj(xijl+xijl)βjl+ei, (Eq 1)

where α represents the intercept. Lj represents the number of alleles in the target gene region j(j = 1,2,…,J). xijl(xijl) denotes the maternal (paternal) allele in the target gene region j for variety i and equals to 1 if the maternal (paternal) allele is the lth allele (l = 1,…,Lj) and 0 otherwise. γj signifies the indicator variable and γj = 1 corresponds to the case where gene region j is included in the model as a QTL representative and γj = 0 implies exclusion. βjl denotes the allele effect associated with the allele l for gene region j, which was assumed to follow N(0,σβj2). The residual error ei was assumed to follow N(0,σe2). The genetic variance of gene region j, i.e., σβj2, and the residual variance, i.e., σe2, were assumed to follow the scaled inverse chi-square distributions Inv−χ2(υβ,Sβ) and Inv−χ2(υe,Se), respectively, as described by Iwata et al. [32]. A prior probability for γj was assumed to follow the Bernoulli distribution with probability π=λJ, as described by Iwata et al. [32]. Estimation of the parameters in the above model via the Bayesian regression was conducted as described by Iwata et al. [32]. MCMC sampling was used for the Bayesian inference on each parameter. For each dataset, MCMC cycles were repeated 13×104 times. The first 3×104 (burn-in) cycles were not used for the parameter estimation. The sampling was conducted every ten cycles to reduce serial correlation, such that the total number of samples we retained was 1×104. The hyperparameters of the model were set as νβ = 4, Sβ2=0.04, νe = −2, Se2=0 and π = 0.6 (λ = 3) to correspond to the model known as BayesB. This sampling scheme was based on a previously described evaluation of the convergence of MCMC cycles [32, 33]. In this analysis, we evaluated the allele effects (ζjl) in the target gene region by multiplying the product of the posterior means of γj and βjl. Further, we calculated the proportion of the expected variance explained by all alleles, the allele which had the highest effect, and the allele which had the largest variance to the phenotypic variance as the allele contribution. Significant gene regions were determined by the permutation test as described by Iwata et al. [34].

Minimal set of trio-tagged SNPs to discriminate the independent alleles of PSY and ZEP in 13 founders of Japanese breeding pedigrees

Minimal set of trio-tagged SNPs was calculated to discriminate the independent alleles of PSY and ZEP in 13 founders using MinimalMarker software [35].

Results

SureSelect target enrichment of carotenoid metabolic genes among the 13 founders

In previous studies of QTL and eQTL analyses using bi-parental populations [17, 20], each genetic locus for PSY, HYb, ZEP, and NCED, and eQTL for PDS and ZDS in linkage group 8 (TCL) were shown to have statically effects on carotenoid content and transcription level of carotenoid metabolic genes. In this study, we focused on them and explored an optimum allele associated with high carotenoid content in juice sac tissues of fruits, from a total of 26 alleles present in 13 ancestral species of the Japanese breeding population. Citrus varieties comprised complex mosaic genome structures of a limited number of ancient species via repeated natural cross [36]; therefore, the number of independent alleles that were shared by some of the 13 founders was estimated for five target genes based on SNPs. Since the genomic sequences of most founders are not available in the public database, a SureSelect target enrichment system was applied to acquire the SNP information of the target genes except for TCL. The SNP information was collected from the gene region because sequence variation in promoter region were complicate among 13 founders by the combination of insertion, deletion and point mutation among citrus varieties and it would be unfavorable to extract stably inherited SNPs among citrus varieties and their progenies. Because carotenoid metabolic genes have several isoforms in the genome, genome sequences in the gene region of Ciclev10011841m.g for PSY, Ciclev10005481m.g for HYb, Ciclev10025089m.g for ZEP, and Ciclev10014639m.g for NCED in the clementine mandarin genome sequence ver 1.0 were utilized to create custom bait libraries that covered the genomic region around target genes. These loci were confirmed to influence the carotenoid content and transcription levels of metabolic genes in previous studies. The constructed captured libraries enriching four metabolic genes were sequenced using a HiSeq2500 system (Illumina), and NGS data were aligned with the reference sequence of clementine mandarin genome sequence ver 1.0 using CLC Genomic Workbench 6.5.1 (CLC bio, Aarhus, Denmark). Numerous SNPs and indels were found in 4 target genes when overviewing 13 founders. The number of SNPs in PSY, HYb, ZEP, and NCED with more than readable SNP quality score (>150) according to manufacturer’s description were 107, 31, 54 and 19 respectively, when NGS data of 13 founders were compared with the corresponding reference gene of clementine mandarin. For TCL, the causative gene that controls the transcription levels of PDS and ZDS has not been characterized. Therefore, arbitrary SNPs on TCL were applied to estimate the number of independent alleles for uncharacterized causative genes on TCL given that the allele on common haplotype blocks derived from common founders would reveal the same SNP genotypes. The candidate SNPs on TCL were explored using the TASUKE program in MiGD [19]. Based on the SNP information, 17 SNP markers for PSY, 15 for HYb, 31 for ZEP, 8 NCED, and 5 for TCL were developed for a SNP genotyping assay using the GoldenGate assay system (Illumina) and Fluidigm BioMark™ HD assay system (Fluidigm). PSY catalyzes the first committed condensation step from GGPP to produce the first C40 carotene, phytoene. Carotenoid composition is highly influenced by the transcription balance among carotenoid metabolic genes [5]. In previous QTL and eQTL analyses, PSY, HYb, ZEP, NCED, and eQTL for PDS and ZEP have been shown to increase carotenoid content in juice sac tissues of fruits [17, 20]. These genes including a causative gene on TCL (denoted in red font within boxes) are the targets for allele mining in this study. PSY, phytoene synthase; PDS, phytoene desaturase; ZDS, ζ-carotene desaturase; LCYe, lycopene ε-cyclase; LCYb, lycopene β-cyclase; HYb, β-carotene hydroxylase; ZEP, zeaxanthin epoxidase; VDE, violaxanthin de-epoxidase; NSY, neoxanthin synthase; CCD, carotenoid cleavage dioxygenase; NCED, 9-cis-epoxycarotenoid dioxygenase.

Independent alleles in 13 founders based on the SNP genotype information of reliable SNP markers

A total of 76 SNP markers were applied to the genotyping of 13 founders and 78 offspring that had parent–offspring relationships among them. Modern citrus varieties reveal extensive sharing of haplotypes from ancient species [37]. Considering the admixture genomes and long cultivation history of 13 founders, numerous de novo mutations frequently occur in those sharing haplotypes; these would hamper the identification of a synonymous allele derived from common ancient species, out of the 26 alleles present in the 13 founders. Trio analysis was carried out to remove SNPs of de novo mutations and obtain reliable SNPs on the basis of inheritance in the lineages. Seventy-eight combinations of parent-offspring varieties were evaluated by the computer software “MARCO”. In total, 57 SNPs passed through the filtering by trio analysis and were valid as trio-tagged SNPs. Schematic diagram to generate 57 trio-tagged SNPs and their detail information are summarized in S1 Fig and S1 Table, respectively. Tentatively, each of the 26 alleles calculated in the13 founders was also named according to the style of abbreviation combination for gene, variety, and allele number (1 or 2), i.e., PSY alleles in grapefruit were named PSY-a-gf1 and PSY-a-gf2, and so on. The alleles with identical trio-tagged SNP genotypes were assigned as independent alleles, and 12, 10, 24, and 6 trio-tagged SNPs were used to find an independent allele for PSY, HYb, ZEP, and NCED alleles, respectively. The numbers of independent alleles in 13 founders were as follows: 7 alleles for PSY, 7 alleles for HYb, 11 alleles for ZEP, 5 alleles for NCED, and 4 alleles for TCL. The members of each synonymous allele and their genotypes of trio-tagged SNPs are summarized in Table 2. For example, the PSY-independent allele was defined based on the genotype of 12 trio-tagged SNPs, and there were 7 independent alleles from PSY-a to PSY-g. PSY-a-gf2 in grapefruit, PSY-a-ks1 in Kishu mikan, PSY-a-hs2 in hassaku, PSY-a-so2 in sweet orange, PSY-a-kb1 in Kunenbo mandarin, PSY-a-wl2 in willow leaf mandarin, PSY-a-kg2 in King mandarin, and PSY-a-mch in ‘Murcott’ revealed the same SNP genotype and were assigned to PSY-a. The trio-tagged SNP genotypes for PSY, HYb, ZEP, NCED, and TCL are summarized in S2 Table.

Table 2. Allelic composition of independent alleles of 5 target genes including a causative gene on TCL in 13 founders.

PSY HYb ZEP NCED TCL
Independent allele Founder Tentative allele Independent allele Founder Tentative allele Independent allele Founder Tentative allele Independent allele Founder Tentative allele Independent allele Founder Tentative allele
PSY-a Grapefruit PSY-a-gf2 HYb-a Dancy tangerine HYb-a-dc1 ZEP-a Dancy tangerine ZEP-a-dc1 NCED-a Dancy tangerine NCED-a-dc1 TCL-a Buntan pumelo TCL-a-tbh
Kishu mikan PSY-a-ks1 Kishu mikan HYb-a-ks2 Kishu mikan ZEP-a-ks1 Grapefruit NCED-a-gf2 Hassaku TCL-a-hsh
Hassaku PSY-a-hs2 Buntan pumelo HYb-a-tb2 Hassaku ZEP-a-hs1 Buntan pumelo NCED-a-tb2 Hyuganatsu TCL-a-hg2
Sweet orange PSY-a-so2 Hassaku HYb-a-hs2 Hyuganatsu ZEP-a-hg1 Hassaku NCED-a-hsh Sweet orange TCL-a-so2
Kunenbo mandarin PSY-a-kb1 Iyo HYb-a-iy2 Sweet orange ZEP-a-so1 Hyuganatsu NCED-a-hg1 Iyo TCL-a-iy2
Willowleaf mandarin PSY-a-wl2 Kunenbo mandarin HYb-a-kb2 Iyo ZEP-a-iy1 Sweet orange NCED-a-so2 Kunenbo TCL-a-kb2
King mandarin PSY-a-kg2 Ponkan mandarin HYb-a-pk1 Kunenbo mandarin ZEP-a-kb1 Iyo NCED-a-iy1 King mandarin TCL-a-kg2
Murcott PSY-a-mch Murcott HYb-a-mc2 Ponkan mandarin ZEP-a-pkh Kunenbo mandarin NCED-a-kb1 Murcott TCL-a-mc2
PSY-b Iyo PSY-b-iy1 HYb-b Grapefruit HYb-b-gf2 Willowleaf mandarin ZEP-a-wlh Ponkan mandarin NCED-a-pk1 TCL-b Dancy tangerine TCL-b-dc1
Ponkan mandarin PSY-b-pk1 Buntan pumelo HYb-b-tb1 ZEP-b Dancy tangerine ZEP-b-dc2 King mandarin NCED-a-kg1 Grapefruit TCL-b-gf1
Willowleaf mandarin PSY-b-wl1 Hyuganatsu HYb-b-hgh Kishu mikan ZEP-b-ks2 Murcott NCED-a-mch Iyo TCL-b-iy1
King mandarin PSY-b-kg1 Sweet orange HYb-b-so1 Murcott ZEP-b-mc2 NCED-b Hyuganatsu NCED-b-hg2 Murcott TCL-b-mc1
Dancy tangerine PSY-b-dc1 Kunenbo mandarin HYb-b-kb1 ZEP-c Kunenbo mandarin ZEP-c-kb2 Kunenbo mandarin NCED-b-kb2 Sweet orange TCL-b-so1
Buntan pumelo PSY-b-tb2 Willowleaf mandarin HYb-b-wlh King mandarin ZEP-c-kg2 Kishu mikan NCED-b-ksh Ponkan mandarin TCL-b-pk1
PSY-c Grapefruit PSY-c-gf1 King mandarin HYb-b-kg1 ZEP-d Buntan pumelo ZEP-d-tb2 Willowleaf mandarin NCED-b-wlh Willowleaf mandarin TCL-b-wl1
Buntan pumelo PSY-c-tb1 HYb-c Sweet orange HYb-c-so2 Iyo ZEP-d-iy2 NCED-c Dancy tangerine NCED-c-dc2 TCL-C Kishu mikan TCL-c-ksh
Hyuganatsu PSY-c-hg1 Murcott HYb-c-mc1 ZEP-e King mandarin ZEP-e-kg1 Iyo NCED-c-iy2 Dancy tangerine TCL-c-dc2
Sweet orange PSY-c-so1 King mandarin HYb-c-kg2 Murcott ZEP-e-mc1 Ponkan mandarin NCED-c-pk2 Hyuganatsu TCL-c-hg1
PSY-d Kishu mikan PSY-d-ks2 HYb-d Dancy tangerine HYb-d-dc2 ZEP-f Grapefruit ZEP-f-gf1 King mandarin NCED-c-kg2 Kunenbo TCL-c-kb1
Iyo PSY-d-iy2 Ponkan mandarin HYb-d-pk2 ZEP-g Grapefruit ZEP-g-gf2 NCED-d Grapefruit NCED-d-gf1 Ponkan TCL-c-pk2
Kunenbo mandarin PSY-d-kb2 HYb-e Iyo HYb-e-iy1 ZEP-h Hassaku ZEP-h-hs2 Sweet orange NCED-d-so1 Willowleaf mandarin TCL-c-wl2
PSY-e Dancy tangerine PSY-e-dc2 Hassaku HYb-e-hs1 ZEP-i Sweet orange ZEP-i-so2 NCED-e Buntan pumelo NCED-e-tb1 King mandarin TCL-c-kg1
Ponkan mandarin PSY-e-pk2 HYb-f Grapefruit HYb-f-gf1 ZEP-j Buntan pumelo ZEP-j-tb1 TCL-d Grapefruit TCL-d-gf2
PSY-f Hassaku PSY-f-hs1 HYb-g Kishu mikan HYb-g-ks1 ZEP-k Hyuganatsu ZEP-k-hg2
PSY-g Hyuganatsu PSY-g-hg2

Notes: PSY, phytoene synthase gene; HYb, β-ring hydroxylase gene; ZEP, zeaxanthin epoxidase gene; NCED, 9-cis-epoxycarotenoid dioxygenase gene; TCL, a putative causative gene on eQTLs that control the transcription level of phytoene desaturase (PDS) and ζ-carotene desaturase (ZDS). The numbers of independent alleles in 13 founders were as follows: 7 alleles for PSY, 7 alleles for HYb, 11 alleles for ZEP, 5 alleles for NCED, and 4 alleles for TCL. The members of each synonymous allele are summarized. For example, 7 PSY-independent alleles were named from PSY-a to PSY-g. Each of the 26 alleles in the13 founders was also named according to the style of abbreviation combination for allele, variety, and allele number (1 or 2), i.e., PSY alleles in grapefruit were named PSY-c-gf1 and PSY-a-gf2, and so on. PSY-a-gf2 in grapefruit, PSY-a-ks1 in Kishu mikan, PSY-a-hs2 in hassaku, PSY-a-so2 in sweet orange, PSY-a-kb1 in Kunenbo mandarin, PSY-a-wl2 in willow leaf mandarin, PSY-a-kg2 in King mandarin, and PSY-a-mch in ‘Murcott’ revealed the same SNP genotype and were assigned to PSY-a. The trio-tagged SNP genotypes for PSY, HYb, ZEP, NCED, and TCL are summarized in S2 Table.

Genotyping of trio-tagged SNPs in breeding populations and the frequency of alleles in the progress of breeding generation

SNP genotyping assay was carried out for 275 plant materials, including13 founders, and their allelic composition was characterized based on the SNP genotype data of 57 trio-tagged SNPs. The allelic genotypes of five target genes in major cultivars are summarized in S3 Table. There was no discrepancy in the transmission of all independent alleles from parents to offspring among the samples in the 78 parent–offspring combinations. Breeding populations have been generated by crossing with one another, and the most advanced breeding generations of the registered cultivars belong to the 4th generation from 13 founders in the breeding program. The numbers of the registered cultivars and preserved selection lines in each generation were as follows: founder, 13 varieties; zero generation (G0): 2 (Satsuma mandarin and Clementine mandarin), 1st generation (G1), 14 varieties and lines (i.e. ‘Kiyomi’, ‘Sweet spring’, etc.); 2nd generation (G2), 23 (i.e. ‘Harumi’, ‘Tamami’, etc.); 3rd generation (G3), 21 (i.e. ‘Setoka’, ‘Tsunokagayaki’, etc.); 4th generation (G4), 10 (i.e. ‘Harehime’, ‘Ehime Kashi No. 28’, etc.). In Japanese breeding programs, the enrichment of carotenoids in fruit is one of the most important breeding traits. The allele frequencies of the five target genes were traced during those breeding generations, except for G0 because Satsuma mandarin and clementine mandarin were generated by the natural cross of founders between Kunenbo mandarin and Kishu mikan or between Willowleaf mandarin and King mandarin, respectively (Fujii et al. 2016). Interestingly, several alleles of the five target genes were selected out in progress of breeding generation. For example, out of seven PSY alleles in founders, PSY-g and PSY-f disappeared from G2, and the occupied ratio of PSY-c decreased from 15.4% in founders to 5% in G4 (Fig 2A). Of the seven HYb alleles in founders, HYb-c, HYb-e, and HYb-f disappeared in the G4, and the occupied ratio of HYb-g increased from 3.8% in founders to 27.8% in G4 (Fig 2B). Of the 11 ZEP alleles in founders, ZEP-f, ZEP-g, ZEP-h, ZEP-j, and ZEP-k disappeared in G4, and the occupied ratio of ZEP-e increased from 7.7% in founders to 26.3% in G4 (Fig 2C). Of the five NCED alleles in founders, NCED-d and NCED-e disappeared in G4, and the occupied ratio of NCED-c reduced from 15.4% in founders to 5.0% in G4 (Fig 2D). Additionally, out of 4 TCL alleles in founders, TCL-d disappeared in G4, and the occupied ratio of TCL-a decreased from 38.5% in founders to 12.1% in G4 (Fig 2E). Thus, several alleles in founders disappeared during the breeding generation although these disappeared alleles may be revived by crossing with old varieties.

Fig 2. The frequency of independent alleles in 13 founders during the progress of Japanese breeding population.

Fig 2

Notes: The Japanese breeding population was developed from the germplasms of 13 founders. Several alleles of PSY (A), HYb (B), ZEP (C), NCED (D), and TCL (E) disappeared in the progressive breeding generation. G1: 1st generation, G2: 2nd generation, G3: 3rd generation, G4: 4th generation. A several alleles disappear in the progress of breeding generation in each gene.

Quantification of carotenoid components in juice sac tissues of mature fruits from 263 breeding pedigrees

Phytoene, t-violaxanthin, c-violaxanthin, lutein, β-cryptoxanthin, α-carotene, zeaxanthin, and β-carotene in the juice sac tissues of the mature fruits were measured among 263 breeding pedigrees including 11 founders, for which fruits were available, by high-pressure liquid chromatography (HPLC). The content of total carotenoid, β-cryptoxanthin and violaxanthin in the major cultivars is summarized in S3 Table. Fig 3 shows the carotenoid composition for each breeding generation from G1 to G4. The mean total carotenoid content in the 13 founders was 2.2 mg/100FWG, with a range 0.0 mg/100FWG to 6.0 mg/100FWG. The variance in total carotenoid content was 3.9. The fruits of Kishu mikan, King mandarin and ‘Murcott’ accumulated high concentrations of carotenoids and those of grapefruit, Hassaku and Hyuganatsu accumulated lower concentrations of carotenoids. The major carotenoids present in the 13 founders were either β-cryptoxanthin or violaxanthin, and they comprised more than 67.0% of the total carotenoids. In the breeding population, the mean total carotenoid content was 2.8 mg/100FWG, with a range of 0.2 mg/100FWG to 9.6 mg/100FWG. The variance of total carotenoid content was 2.7, which was less than that of the 13 founders. These results indicated that the variance of total carotenoids became smaller in the breeding population, whereas total carotenoids, β-cryptoxanthin, and VIO became more abundant owing to the removal of unnecessary alleles. For example, ‘Tsunokagayaki’ in G3 and Kuchinotsu 52 gou in G4 revealed a deep orange flesh color (Fig 4), and β-cryptoxanthin (total carotenoids) content in the juice sac tissues were 2.2 mg/100FWG (4.6 mg/100FWG) and 2.4 mg/100FWG (4.5 mg/100FWG), which were higher than 1.5 mg/100FWG (2.3 mg/100FWG) of Satsuma mandarin in G0. Interestingly, β-cryptoxanthin content and violaxanthin content were highly correlated with the total carotenoid content among breeding populations, and both of their correlation values were 0.85 (Fig 5). This result implied that the increase in total carotenoids was supported by the accumulation of these xanthophylls in mature fruits. These carotenoid profiles suggest that carotenoid enrichment was achieved by the exchange of allelic composition and by the targeted removal of the unnecessary alleles in conventional crossbreeding.

Fig 3. Carotenoid composition in juice sac tissues of fruit among 13 founders and 57 cultivars and lines within 5th generation.

Fig 3

Notes: Phytoene, α-carotene, β-carotene, ζ-carotene, lutein, β-cryptoxanthin, zeaxanthin, and violaxanthin in juice sac tissues were measured by high-pressure liquid chromatography. G0: natural cross, G1: 1st generation, G2: 2nd generation, G3: 3rd generation, G4: 4th generation.

Fig 4. Images of the fruits of ’Tsunokagayaki’ and Kuchinotsu 52 gou with PSY-a and ZEP-e.

Fig 4

Notes: The deep orange color in flesh color revealed high concentration of β-cryptoxanthin content in juice sac tissues of ‘Tsunokagayaki’ (2.2 mg/100FWG) and Kuchinotsu 52 gou (2.4 mg/100FWG).

Fig 5.

Fig 5

Correlation between total carotenoid content and β-cryptoxanthin content (A) and between total carotenoid content and violaxanthin content (B) among 263 breeding pedigrees. Notes: β-cryptoxanthin and violaxanthin are major xanthophylls that accumulate in mature fruits, and their increase leads to an increase in total carotenoids.

Association between the allelic composition of five target genes and carotenoid composition in juice sac tissues of fruits by Bayesian statistical analysis

The association between allelic composition of five target genes and carotenoid composition (S3 Table) was evaluated by Bayesian statistical analysis using the data obtained from the 263 pedigrees. The five target genes detected some significant associations with the measured eight kinds of carotenoids and total carotenoid content (Fig 6). PSY, ZEP, and NCED significantly associated with total carotenoid content, PSY and ZEP significantly associated with β-cryptoxanthin content, and PSY, HYb, ZEP, NCED, and TCL significantly associated with violaxanthin content. Based on the estimated effect size for each allele of the five target genes on the measured carotenoids, the allele with the value of more than one third of the max effect size was defined as the alleles with positive or negative effect (Fig 7). With respect to the independent alleles of the five target genes, PSY-a had a strong positive effect (ζ = 0.82) on increasing the total carotenoid content in juice sac tissues of fruits, whereas PSY-c and PSY-g had medium negative effects (ζ = -0.42 and -0.41, respectively) on decreasing the total carotenoid content (Fig 7A). Among the ZEP alleles, ZEP-e had a strong positive effect (ζ = 0.83) on increasing the total carotenoids in juice sac tissues of fruits, whereas three alleles, ZEP-a, ZEP-b, and ZEP-c, had medium or weak positive effects (ζ = 0.54, 0.38, and 0.66, respectively) on increasing the total carotenoid content. Four alleles of ZEP-f, ZEP-g, ZEP-h, and ZEP-k had medium or weak negative effects (ζ = -0.42, -0.68, -0.59, and -0.31, respectively) on decreasing the total carotenoid content in juice sac tissues of fruits. Moreover, among the NCED alleles, NCED-a had a weak positive effect (ζ = 0.34) on increasing the total carotenoid content in juice sac tissues of fruits. In contrast to those genes, HYb and TCL had unclear effect on the total carotenoid content in juice sac tissues of fruits. With regard to the β-cryptoxanthin content, PSY-a had a medium positive effect (ζ = 0.38), while PSY-c and PSY-g had weak negative effects (ζ = -0.29 and -0.26, respectively). In contrast, ZEP-a had a weak positive effect (ζ = 0.25). Other alleles had unclear effect on β-cryptoxanthin content in juice sac tissues of fruits. Finally, with regard to the violaxanthin content, PSY-a had a weak positive effect (ζ = 0.20), while ZEP-c and ZEP-e also had weak positive effects (ζ = 0.24 and 0.34, respectively). Other alleles had unclear effect on violaxanthin content in juice sac tissues of fruits.

Fig 6. Posterior probability of having a QTL (posterior average of γ) at 5 targeted gene regions, estimated for each carotenoid.

Fig 6

Notes: Horizontal dashed lines correspond to the threshold obtained from the random permutation procedure.

Fig 7.

Fig 7

Estimated effect size (ζ) of each allele in the five target gene regions for (A) total carotenoid, (B) β-cryptoxanthin, and (C) violaxanthin. Notes: Red, blue, and purple circle indicated that the allele among the 5 target regions had the highest effect size, the largest variance, and both the highest effect size and the largest variance, respectively. The characters in the brackets mean the product of the posterior means of γ.

Regarding the other carotenoids, several alleles in PSY, HYb, and ZEP had a weak effect on increasing their content (S2 Fig). Notably, PSY-a had a broad effect on multiple carotenoids, except for lutein.

Fig 8 shows the transmission of PSY and ZEP alleles in the lineage of ‘Tsunokagayaki’ in G3 and Kuchinotsu 52 gou in G4, which accumulate high concentrations of β-cryptoxanthin in juice sac tissues. The alleles of PSY and ZEP come from either allele in five founders of Kishu mikan (PSY-a/PSY-d; ZEP-a/ZEP-b), sweet orange (PSY-a/PSY-c; ZEP-a/ZEP-i), Kunenbo mandarin (PSY-a/PSY-d; ZEP-a/ZEP-c), willow leaf mandarin (PSY-a/PSY-b; ZEP-a/ZEP-a), and King mandarin (PSY-a/PSY-b; ZEP-c/ZEP-e). During breeding selection aiming for enrichment of β-cryptoxanthin, PSY-a and ZEP-e (or ZEP-a) were pyramided in those cultivars and breeding lines, thus confirming that PSY-a and ZEP-e had statically strong positive effects on the increase in total carotenoid content in juice sac tissues of fruits. These alleles would operate to increase β-cryptoxanthin and violaxanthin contents, although the effects were smaller compared to those exerted overall on the total carotenoid content. Of note, most alleles with negative effects on carotenoid content were, consistently, those alleles that disappeared during the progress of breeding generation.

Fig 8. Transmission of PSY and ZEP alleles in the lineage of ‘Tsunokagayaki’ and Kuchinotsu 52 gou with high concertation of β-cryptoxanthin.

Fig 8

Notes: G0: natural cross, G1: 1st generation, G2: 2nd generation, G3: 3rd generation, G4: 4th generation. PSY (PSY-a) and ZEP (ZEP-a and ZEP-e) alleles with positive effects are selected in the process of conventional breeding program for the enrichment of β-cryptoxanthin.

Verification of the effects of optimum PSY and ZEP alleles on increased carotenoid content in juice sac tissues of fruits

Box plot analysis was carried out to inspect the ability of PSY and ZEP optimum alleles with the strongest positive effects on total carotenoid content in juice sac tissues of fruits. Fig 9A revealed the distribution of total carotenoid content in six combinations of PSY allele as follows: condition 1: a pair of negative alleles (PSY-c or PSY-g), condition 2: a single of negative alleles without an optimum allele (PSY-a), condition 3: not including both optimum (PSY-a) and negative alleles (PSY-c or PSY-g), condition 4: a single optimum allele (PSY-a) and a single negative allele (PSY-c or PSY-g), condition 5: a single optimum allele (PSY-a) without any negative allele (PSY-c or PSY-g); and condition 6, 2 optimum alleles (PSY-a).When the mean values between conditions 2 and 3 were compared, it was clear that PSY-c and PSY-g had negative effects on reducing the carotenoid concentration. In contrast, the mean value of condition 6 with two PSY-a alleles was 4.2 mg/100FWG, which was higher than those of conditions 3, 4, and 5. Interestingly, the allelic interaction between alleles with positive effect and negative effect would be observed in condition 4

Fig 9. Box plot analysis to verify the effectiveness of optimum PSY and ZEP alleles for the enrichment of total carotenoids in juice sac tissues of fruits.

Fig 9

Notes: Box edges represent the upper and lower quantiles, with the median value shown as a bold line in the middle of the box. The circles indicate an outlier. (A) Comparisons of total carotenoid contents by the number of alleles with positive effect (PSY-a) and negative effect (PSY-c or -g). Condition 1, PSY-a = 0 & (PSY-c or -g) = 2 Condition 2, PSY-a = 0 & (PSY-c or -g) = 1; Condition 3, PSY-a = 0 & (PSY-c or -g) = 0; Condition 4, PSY-a = 1 & (PSY-c or -g) = 1; Condition 5, PSY-a = 1 & (PSY-c or -g) = 0; Condition 6, PSY-a = 2 & (PSY-c or -g) = 0. Means were tested for significant differences using Tukey’s HSD test at P < 0.05, following the normality of the distribution was confirmed not to be rejected at P = 0.05 by Kolmogorov-Smirnov’s one sample test. Carotenoid contents were analyzed as square root-transformed values. N/A indicates entry eliminated from the Tukey’s HSD test due to insufficient data. (B) Comparison of total carotenoid content by the number of alleles with a positive effect (ZEP-a, b, c, or e) under the PSY allele-type conditions (A). Means between ’+Allele = 1’ and ’+ Allele = 2’ were tested for significant differences using Welch’s two-sample t-test under conditions 3 and 5. *** indicates significance at P < 0.001, following the normality of the distribution was confirmed not to be rejected at P = 0.05 by Kolmogorov-Smirnov’s one sample test. Carotenoid contents were analyzed as square root-transformed values. Means under other conditions were eliminated from Welch’s two-sample t-test due to insufficient data.

In addition, the ability of 4 ZEP alleles (ZEP-a, ZEP-b, ZEP-c, and ZEP-e), which were identified to exert a positive effect on the total carotenoid content, was examined against the above six conditions of PSY alleles (Fig 9B). Under condition 3, which did not include any PSY-a, the mean values of total carotenoid content increased proportionately with the number of ZEP alleles. Under conditions 1 and 2, no consistent effect was observed along with an increase in the number of ZEP alleles, indicating an epistatic interaction between the PSY allele and ZEP allele. This result was understandable as PSY is located upstream of the carotenoid metabolic pathway compared to ZEP. Consequently, PSY alleles with negative effects could negate the effectiveness of ZEP alleles with positive effects. Under condition 5, the mean values of total carotenoid content increased proportionately with the number of optimum ZEP alleles. Under condition 6, the mean values of total carotenoid content with those ZEP alleles were higher than those without them. Thus, these results confirmed that the optimum alleles of PSY and ZEP possess the capability of increasing carotenoid content in juice sac tissues of fruits.

Box plot analysis was also carried out for β-cryptoxanthin and violaxanthin content (S3 and S4 Figs, respectively). With respect to β-cryptoxanthin, a slight effect of PSY-a was observed on increasing the mean value, while ZEP alleles with a positive effect clearly indicated that their numerical increase led to an increase in β-cryptoxanthin content. Furthermore, for violaxanthin, the increase in PSY-a led to an increase in its mean value. In contrast, the numerical increase of ZEP alleles with positive effects did not correlate with violaxanthin content, except for condition 5. These evaluation test results indicated that the optimum allele of PSY-a emphatically had the ability to strengthen the flux of carotenoid metabolic pathway and increase total carotenoid content, along with an increase in β-cryptoxanthin and violaxanthin. ZEP alleles with a positive effect also confirmed an increase in total carotenoids, and its effect preferentially increased β-cryptoxanthin content over violaxanthin content. It is considered that ZEP alleles with positive alleles did not have a strong effect on increasing β-cryptoxanthin content by themselves, and they would likely synergistically increase β-cryptoxanthin content in cooperation with PSY-a by the sandwich effect between upstream and downstream genes in the carotenoid pathway.

Minimal set of trio-tagged SNPs to identify the optimum alleles of PSY-a and ZEP-e for marker-assisted selection

Pyramiding for optimum alleles of PSY-a and ZEP-e would be promising for the enrichment of carotenoids, especially β-cryptoxanthin in juice sac tissues of fruits. To efficiently promote molecular breeding by marker-assisted selection (MAS), minimal sets of trio-tagged SNPs were evaluated using MinimalMarker software [35] in order to determine the optimum PSY and ZEP alleles among the others. For PSY, MinimalMarker software revealed 45 minimal sets, which comprised different combinations of six trio-tagged SNPs, to discriminate any combination of the seven PSY alleles. For example, PSY-SNP-05, PSY-SNP06, PSY-07, PSY-SNP08, PSY-SNP09, and PSY-SNP10 were one of the minimal sets to discriminate each other (Table 3). Fortunately, the optimum PSY-a allele could be discriminated with the other alleles by a single trio-tagged PSY-SNP06. For ZEP, MinimalMarker software revealed a single minimal set comprising seven trio-tagged SNPs (ZEP-SNP01, ZEP-SNP03, ZEP-SNP05, ZEP-SNP14, ZEP-SNP17, ZEP-SNP20, and ZEP-SNP21) to discriminate each other (Table 4). The optimum ZEP-e allele could be discriminated with the other alleles by two trio-tagged ZEP-SNP17 and ZEP-SNP21. When the allelic genotypes for PSY and ZEP alleles in the parent varieties are available, the number of SNPs for MAS in their progenies will be smaller than the above calculated minimal sets. For example, the genotypes of seedlings from the cross between the seed parent with ZEP-c and ZEP-e and pollen parent with ZEP-e and ZEP-e would reveal homozygous genotypes of ZEP-e or heterozygous genotypes of ZEP-c and ZEP-e. In this case, only one trio-tagged ZEP-SNP17 or ZEP-SNP20 was sufficient to discriminate the seedlings with homozygous genotypes of ZEP-e (Fig 10). Other trio-tagged SNPs, which were not included in the above minimal marker sets, are also applicable for MAS when they discriminated parent allelic genotypes. Recently, various SNP detection systems have been developed, such as the TaqMan-MGB SNP genotyping assay, Kompetitive Allele Specific PCR genotyping assay, and so on. SNP markers compatible with those systems could be customized based on the sequence information of trio-tagged SNPs listed in S1 Table.

Table 3. Minimal set of trio-tagged SNPs that discriminate 7 PSY independent alleles in 13 founders.

Independent allele Trio-tagged SNPs
PSY-SNP05 PSY-SNP06 PSY-SNP07 PSY-SNP08 PSY-SNP09 PSY-SNP10
PSY-a T A A A C T
PSY-b T G A G C T
PSY-c T G A G C C
PSY-d C G A A C T
PSY-e T G A A T T
PSY-f T G C A C T
PSY-g T G A A C T

Notes: PSY, phytoene synthase gene. 7 PSY-independent alleles were named from PSY-a to PSY-g. The all trio-tagged SNP genotypes for PSY are summarized in S2 Table.

Table 4. Minimal set of trio-tagged SNPs that discriminate 11 ZEP independent alleles in 13 founders.

Independent allele Trio-tagged SNPs
ZEP-SNP01 ZEP-SNP03 ZEP-SNP05 ZEP-SNP14 ZEP-SNP17 ZEP-SNP20 ZEP-SNP21
ZEP-a A C A C T A T
ZEP-b A C A C C A C
ZEP-c A C A C C C C
ZEP-d T C A C C C C
ZEP-e A C A C T A C
ZEP-f A C A C T C T
ZEP-g T A A C C A C
ZEP-h A C A T C C C
ZEP-i T C T C C C C
ZEP-j T A A C C C C
ZEP-k A C T C C C C

Notes: ZEP, zeaxanthin epoxidase gene. 11 ZEP-independent alleles were named from ZEP-a to ZEP-k. The all trio-tagged SNP genotypes for ZEP are summarized in S2 Table.

Fig 10. TaqMAN-MGB SNP genotyping using PSY-SNP06 and ZEP-SNP17 for marker-assisted selection.

Fig 10

Notes: Seedlings with homozygous PSY-a (A) and those with homozygous ZEP-e (B) are able to select in marker-assisted selection for the enrichment of carotenoids by a single DNA marker when allelic genotypes of parents are available. FAM fluorescent signal values are plotted on the x-axis, and HEX fluorescent signal values are plotted on the y-axis. NTC: Non-template control.

Discussion

The carotenoid profile in citrus fruits varies greatly across species and cultivars, wherein approximately 115 different carotenoids have been reported [4]. To address how carotenoid composition is highly extended among cultivars, we focused on the allelic diversities of carotenoid metabolic genes. Recent advances in NGS technologies provided a new insight that citrus varieties comprise a complex mosaic genome structures of a limited number of ancient species via repeated natural cross [35]. Ahmed et al. [38] further revealed interspecific mosaic genome structures in 53 citrus accessions that comprised large genomic fragments derived from four basic taxa using 15,946 diagnostic SNP markers generated by genotyping by sequencing assay. It is plausible that some of the founders in the Japanese breeding population share the common haplotype block derived from ancient species, where functionally synonymous genes are located. Citrus genomes are highly heterozygous and maintain high sequence variation among citrus varieties, resulting in numerous SNPs that can be detected in sequence comparisons made among them. The high frequency of SNPs among citrus varieties makes it difficult to explore the functional SNPs linked to important agricultural traits. In this study, trio-tagged SNPs, which were inherited in lineage, were used to classify the independent allele of 5 target genes, including a causative gene on TCL, in 13 founders. The number of independent alleles in the 13 founders was smaller than expected. This result was supported by previous reports that modern citrus varieties revealed extensive sharing of haplotypes from the ancient species among them [37]; the collapse of linkage disequilibrium (LD) structure was very slow [39], and a wide range of LD was observed in the breeding population [40]. Within the independent alleles of the five target genes, the genotypes of the trio-tagged SNPs were identical among the examined breeding populations. We considered that independent alleles of the five target genes would play synonymous functional roles in carotenoid metabolism because they are located on common haplotype blocks derived from common founders, even if numerous de novo mutations are scattered among them.

PSY and ZEP play a key role in influencing carotenoid content in juice sac tissues of fruits

Out of the five target genes responsible for high carotenoid content, PSY-a and ZEP-e were identified as the optimum alleles responsible for increasing carotenoid content in juice sac tissues of fruits. PSY is a rate-limiting enzyme in the carotenoid biosynthetic pathway, and the flux of the carotenoid pathway is generally controlled by isoforms or alleles of PSYs in plants [41]. Expression profiles of different PSY isoforms are likely to exhibit tissue specificity in rice (Oryza sativa L.) and tomato (Solanum lycopersicum L.) [42, 43]. In clementine mandarin, there are four PSY isoforms, and the locus of Ciclev10011841m.g in scaffold 6 is responsible for carotenoid biosynthesis in juice sac tissues of the fruit [24]. This locus possessed plural PSY alleles with sequence variation among citrus varieties, and many expressed sequence tags are registered in the public database. We confirmed that the allelic combination of PSYs on this locus influenced PSY transcription level and carotenoid content in the fruits of F1 plants from a cross between A255 and G434. One PSY allele of G434 had a negative effect on the transcription of PSY, and F1 plants with this allele tended to have lower carotenoids than those without it. Several reports have documented that an increase in the carotenoid pathway flux emphasized the carotenoid accumulation by overexpression of PSY in tomato, Arabidopsis, and white carrots (Daucus carota L) [44, 45]. Therefore, the flux of the carotenoid pathway is primarily controlled by the allelic combination of seven PSY alleles in the Japanese breeding population. Among these, PSY-a had a highly positive effect on increasing the total carotenoid content, while PSY-c and PSY-g had a negative effect on decreasing it. The PSY independent alleles in 13 founders were arranged based on the 12 trio-tagged SNPs. Interestingly, phylogenetic tree analysis revealed that the seven PSY alleles functionally diverged from the common PSY ancestor during the process of citrus species specification (S5 Fig). All trio-tagged SNPs located on the out of open reading frames and those SNPs were nor likely linked to influence enzymatic activity. PSY-a also had a positive effect on β-cryptoxanthin and violaxanthin accumulation, but the effects were weaker compared to those exerted overall on the total carotenoids. In citrus fruits, the major xanthophyll in the juice sac is either β-cryptoxanthin or violaxanthin, and its content increases during fruit maturation. We considered that the high positive effect of PSY-a on total carotenoids would be due to the sum of these effects on β-cryptoxanthin and violaxanthin.

ZEP metabolizes zeaxanthin to yield antheraxanthin and then violaxanthin. Violaxanthin is the substrate for the synthesis of neoxanthin and can be converted back to zeaxanthin by violaxanthin deepoxidase. This reversible epoxidation/deepoxidation is termed the xanthophyll cycle, in which deepoxidation to zeaxanthin is favored under high-light conditions, while epoxidation to violaxanthin predominates in moderate-light conditions. The rapid formation of zeaxanthin by the xanthophyll cycle is integral for the dissipation of excess energy by non-photochemical quenching [46]. ZEP-e had a strong effect on increasing the total carotenoid content, while ZEP-a, ZEP-b, and ZEP-c had medium or weak effects on it. In contrast, ZEP-g and ZEP-h had a negative effect on decreasing the total carotenoid content. The ZEP independent alleles in 13 founders were arranged based on the 24 trio-tagged SNPs. Phylogenetic tree analysis revealed that the ZEP alleles with positive effects (ZEP-a, ZEP-b, ZEP-c and ZEP-e) were functionally diverged from negative allele group, and ZEP-f might be further diverged from positive allele group (S6 Fig). Eight trio-tagged SNPs located on the open reading frames but the amino acid substitution that linked to influence enzymatic activity was not found between positive and negative alleles. In carrot, maize, and potato (Solanum tuberosum L.) [4749], ZEP was identified as a major candidate gene governing carotenoid accumulation. A lower expression level of ZEP resulted in the accumulation of zeaxanthin and upstream of carotenoids in these crops. Homozygosity of the recessive ZEP allele, with the insertion of a non-LTR retrotransposon in the promoter region, caused a reduced level of expression and could accumulate large amounts of zeaxanthin in potato. In citrus, Sugiyama et al. [23] reported that the ZEP allelic combination influenced the amounts and accumulation patterns of their transcripts, altering the carotenoid content and composition in juice sac tissues of fruits. The high transcription allele of ZEP-1o in the previous report was ZEP-g, with a negative effect in this study, whereas the low transcription alleles of ZEP-1m, ZEP-2m, and ZEP-2o in the previous report were ZEP-a, c, and e, respectively, with a positive effect in this study. Considering that ZEP is located downstream of the carotenoid biosynthesis pathway, ZEP alleles with positive effects would narrow the flux of the carotenoid pathway and increase the carotenoid content in juice sac tissues of fruits.

In this study, a weak positive effect of NCED-a was detected against total carotenoid content, but any effective allele was not detected on β-cryptoxanthin and violaxanthin. NCED is encoded by a multigene family, and stress-inducible members contribute to a rate-limiting step in abscisic acid (ABA) biosynthesis [50]. The functional characterization of NCED in response to water deficit has been well characterized in the vegetative tissues, whereas the contribution of other NCED family members appeared to be minor. In citrus, NCED has been characterized at the molecular level in fruits and leaves [51, 52]. They demonstrated a reciprocal relationship between the levels of NCED transcripts and 9-cis- violaxanthin (a substrate of NCED), suggesting that NCED activity affected carotenoid accumulation in citrus fruits via ABA synthesis. The oxidative cleavage of 9-cis-violaxanthin catalyzed by NCEDs affects the concentration of 9-cis-violaxanthin and ABA in fruits, and consequently the β, β-xanthophyll composition [11]. Ma et al. [53] reported that the low transcription level of NCED in the mature fruit played a key role in increasing β-cryptoxanthin and violaxanthin content in the mature fruit when comparing Satsuma mandarin and ‘Seinannohikari’ (EnOw No.21 × ‘Yoko’). NCED is also located at the end of the carotenoid metabolism pathway, and its allelic genotype also influences carotenoid accumulation. Because ‘Seinannohikari’ does not have optimum allele of PSY-a and ZEP-e (S3 Table), NCED would like to play a visible role to increase β-cryptoxanthin and violaxanthin content instead of them.

Thus, it was found that PSY and ZEP play key roles in influencing the carotenoid content in juice sac tissues of fruits. By pyramiding of PSY-a and ZEP-e alleles located at the beginning and end of the carotenoid pathway, a significant increase in carotenoid concentration could be expected in juice sac tissues of fruits. In this study, we could not detect the effects of HYb and TCL. As these genes are located in the middle of the carotenoid biosynthetic pathway, their effects may be hidden by the epistatic interaction between upstream and downstream allelic genotypes.

Other factors influencing carotenoid accumulation in citrus mature fruit

In this study, we provided a new insight that the carotenoid composition in fruit was highly influenced by the allelic composition of carotenoid metabolic genes. The expected variance explained by five target genes for phenotypic variance of total carotenoid content among 263 breeding pedigrees was approximately 0.3, which was similar to that of QTLs for total carotenoids and β-cryptoxanthin detected in bi-parental populations [17]. We considered this to be a reasonable value because the carotenoid pathway was highly influenced by its response to numerous factors, such as development, ABA, high light, salt, drought, and pathogen interaction temperature. In addition, a significant number of terpene synthase (TPS) genes are found in the citrus genome, which are responsible for various terpenoid products such as aroma and limonoid. Because these TPS genes share an isopentenyl precursor of isopentenyl pyrophosphate, the genetic composition of other TPS genes may influence carotenoid accumulation in fruits.

Recently, it has also become clear that several factors influence carotenoid accumulation in mature citrus fruits. Carotenoids are catabolized enzymatically by a family of carotenoid cleavage dioxygenases (CCDs). The Arabidopsis genome has nine CCD family genes, which are divided into two groups of CCDs (4 CCD genes) and NCEDs (5 NCED genes) that recognize different carotenoid substrates and are cleaved at different sites, producing various apocarotenoids, such as precursors of phytohormones (ABA and strigoractone), volatile compounds (β-ionone and related compound), pigments (β-citraurin and bixin), and so on [54]. In tomato, petunia (Petunia×hybrida), grape (Vitis vinifera), and peach (Prunus persica), the transcription of CCD1 and CCD4 is associated with the emission of carotenoid-derived volatile scent and pigmentation in flowers and fruit flesh [41]. Carotenoid accumulation in various plant species is generally found to be negatively correlated with the transcription of CCD1 and CCD4. In citrus, the clementine mandarin genome possesses 14 CCD family genes, which can be functionally divided into five subfamilies, namely, CCD1, CCD4, CCD7, CCD8, and NCED [55]. Citrus CCD1 has wide substrate specificity and generates carotenoid-derived volatile scent [51], whereas CCD4 generates β-citraurin from β-cryptoxanthin and zeaxanthin, which is responsible for the red pigmentation of citrus fruits [56]. There are several reports that CCD1 and CCD4 transcription do not correlate with carotenoid accumulation in morning glory and citrus [51, 57]. A recent recombinant protein assay of citrus CCD1 and CCD4 revealed that they could cleave the free β-cryptoxanthin but not the β-cryptoxanthin esters, indicating that the β-cryptoxanthin esters may be more stable than free β-cryptoxanthin in citrus fruits [58]. Furthermore, more than 80% of the β-cryptoxanthin in citrus fruits is esterified with fatty acids such as laurate, myristate, and palmitate. Therefore, esterification is an important process for the massive accumulation of carotenoids in chromoplasts, as esterification facilitates sequestration and enhances the stability of carotenoids in chromoplasts [59]. Xanthophyll esterase (XES) is a key enzyme responsible for the high carotenoid accumulation in the petals of tomato and petunia (Petunia × hybrida) [59, 60]. Therefore, it is possible that XES may also be involved in the regulation of carotenoid accumulation in citrus fruits.

In addition to the CCD family, orange protein (OR) is another factor that influences carotenoid accumulation in the mature fruit. It has been reported that OR increases carotenoid accumulation in various plant species through post-transcriptionally regulating PSY, promoting formation of carotenoid-sequestering structures, and preventing carotenoid degradation [61]. Carotenoids are abundant in the chromoplast, and chromoplast formation during fruit ripening is one of the main factors governing fruit pigmentation [62]. In citrus, little is known about the molecular function of OR; thus, it is necessary to investigate the interaction between the high concentration allele of PSY-a and the putative citrus OR. In addition, various transcription factors of CubHLH1 [14] (Endo et al. 2015), an R2R3-MYB transcription factor CrMYB68 [15] as well as CsMADS6 [16] have been implicated in the direct regulation of multiple genes involved in carotenoid metabolism during fruit development and maturation. Considering that their putative loci, which were predicted using the genome sequence in MiGD, were not associated with the previously detected QTL [17], these transcription factors may also have some effects on the regulation of carotenoid accumulation, although they prefer to coordinate transcription in response to environmental factors such as light and temperature during fruit development and maturation.

Allelic mining system aiming to enrich carotenoid concentration in fruit

To date, only a bi-parental population has been used to explore the genomic region responsible for high carotenoid content in juice sac tissues of fruits by QTL and eQTL analyses [17, 20]. The information from this approach would be limited when considering the abundance of the genes and their alleles among the 13 founders. In fact, the offset combination of alleles with positive and negative effects and the epistasis among metabolic genes confuse us to explore the alleles responsible for the enrichment of carotenoid in fruit. In Arabidopsis, tomato, maize, and rice, a multi-parental integrated population has been developed to increase the accuracy of genetic studies and lead to the successful elucidation of complex QTLs that are regulated by multiple genes, responsible for the transition from vegetative to reproductive stage, yield, grain quality, response to abiotic and biotic stress, and so on [6366]. The Japanese citrus breeding population comprises a similar structure of a multi-parental intermated population. Therefore, we hypothesized that association analysis focusing on the alleles of carotenoid metabolic genes would provide more accurate information that is applicable to whole breeding populations rather than QTL analysis using only bi-parental populations.

Recently, genome-wide association studies (GWAS) and genomic selection (GS) using NGS-based genome-wide markers have been widely used in genetic dissection of complex traits in various plants. In Arabidopsis and rice [67, 68], GWAS has been applied to find genotype-metabolite associations, providing deeper insights into the genetic basis of metabolic diversity. In citrus, it was confirmed that GWAS and GS revealed the effectiveness of genetic improvement of 17 fruit traits, including fruit weight, smoothness of pericarp, and sugar content [40]. GWAS is a powerful method to determine genotype-phenotype associations; however, the genetic interaction between epistasis or between loci and the environment sometimes prevents the detection of valid associations. In fact, epistatic interactions among carotenoid metabolic genes were also found in this study. In this regard, the developed allelic mining system for carotenoid enrichment would provide a more reliable association between allele composition and carotenoid composition. It is possible that these methods would detect another factor that influences the enrichment of carotenoids in fruit. The new findings obtained in this study are not contradictory to previous physiological and genetic studies.

Conclusion

We provided a new insight that 13 ancestral varieties of Japanese breeding pedigrees shared common alleles of carotenoid metabolic genes from ancestral species, and the carotenoid diversity observed in cultivars could be extended by exchanging the allelic combination among carotenoid metabolic genes. The offset interaction between the alleles with increasing and decreasing effects on carotenoid content and the epistatic interaction among carotenoid metabolic genes were observed and these interactions complexed carotenoid profiles in breeding population. Out of the examined carotenoid metabolic genes, we confirmed by association analysis on breeding population that PSY and ZEP play a key role in controlling the carotenoid flux in mature fruits. Consequently, a high concentration of carotenoids could be achieved by pyramiding the ideal alleles of PSY-a and ZEP-e. These important alleles were successfully genotyped using the TaqMAN-MGB SNP markers for MAS. The results obtained, along with the DNA markers used, can facilitate carotenoid metabolic engineering to potentially improve the nutritional quality of fruits.

Supporting information

S1 Fig. Schematic diagram to develop the trio-tagged SNPs in the alleles of 5 target genes in 13 founders.

(TIF)

S2 Fig

Estimated effect size (ζ) of each allele types in the 5 target genes regions for (A) α- carotene, (B) β-carotene, (C) lutein, (D) phytoene, (E) ζ-carotene, and (F) zeaxanthin, respectively. Red, blue, and purple circle indicated that the allele type among the 5 target genes had the highest effect size, the largest variance, and both the highest effect size and the largest variance, respectively. The characters in the brackets mean the product of the posterior means of γ.

(TIF)

S3 Fig. Box plot analysis to verify the effectiveness of optimum PSY and ZEP alleles for the enrichment of β-cryptoxanthin in juice sac tissues of fruits.

Box edges represent the upper and lower quantiles, with the median value shown as a bold line in the middle of the box. The circles indicate an outlier. (A) Comparison of β-cryptoxanthin content by the number of alleles with positive effect (PSY-a) and negative effect (PSY-c or -g). Condition 1, PSY-a = 0 & (PSY-c or -g) = 2 Condition 2, PSY-a = 0 & (PSY-c or -g) = 1; Condition 3, PSY-a = 0 & (PSY-c or -g) = 0; Condition 4, PSY-a = 1 & (PSY-c or -g) = 1; Condition 5, PSY-a = 1 & (PSY-c or -g) = 0; Condition 6, PSY-a = 2 & (PSY-c or -g) = 0. Means were tested for significant differences using Tukey’s HSD test at P < 0.05, following the normality of the distribution was confirmed not to be rejected at P = 0.05 by Kolmogorov-Smirnov’s one sample test. β-cryptoxanthin content was analyzed as square root-transformed values. N/A indicates values eliminated from the Tukey’s HSD test due to insufficient data. (B) Comparison of total carotenoid content by the number of alleles with a positive effect (ZEP-a, -b, -c, or -e) under the PSY allele-type conditions (A). Means were tested for significant differences using Tukey’s HSD test at P < 0.05 under conditions 2 and 5. Means between ’+Allele = 1’ and ’+ Allele = 2’ were tested for significant differences using Welch’s two-sample t-test under condition 3, following the normality of the distribution was confirmed not to be rejected at P = 0.05 by Kolmogorov-Smirnov’s one sample test. ’ns’ indicates not significant at P < 0.05. β-cryptoxanthin content was analyzed as square root-transformed values.

(TIF)

S4 Fig. Box plot analysis to verify the effectiveness of optimum PSY and ZEP alleles for the enrichment of violaxanthin in juice sac tissues of fruits.

Box edges represent the upper and lower quantiles, with the median value shown as a bold line in the middle of the box. The circles indicate an outlier. (A) Comparison of violaxanthin content by the number of alleles with positive effect (PSY-a) and negative effect (PSY-c or -g). Condition 1, PSY-a = 0 & (PSY-c or -g) = 2 Condition 2, PSY-a = 0 & (PSY-c or -g) = 1; Condition 3, PSY-a = 0 & (PSY-c or -g) = 0; Condition 4, PSY-a = 1 & (PSY-c or -g) = 1; Condition 5, PSY-a = 1 & (PSY-c or -g) = 0; Condition 6, PSY-a = 2 & (PSY-c or -g) = 0. Means were tested for significant differences using Tukey’s HSD test at P < 0.05, following the normality of the distribution was confirmed not to be rejected at P = 0.05 by Kolmogorov-Smirnov’s one sample test. Violaxanthin contents were analyzed as square root-transformed values. N/A indicates values eliminated from the Tukey’s HSD test due to insufficient data. (B) Comparison of total carotenoid content by the number of alleles with a positive effect (ZEP-a, -b, -c, or -e) under the PSY allele-type conditions (A). Means were tested for significant differences using Tukey’s HSD test at P < 0.05 under conditions 2 and 5. Means between ’+Allele = 1’ and ’+ Allele = 2’ were tested for significant differences using Welch’s two-sample t-test under condition 3. ns indicates not significant at P < 0.05, following the normality of the distribution was confirmed not to be rejected at P = 0.05 by Kolmogorov-Smirnov’s one sample test. Violaxanthin contents was analyzed as square root-transformed values.

(TIF)

S5 Fig. Phylogenetic tree analysis of 7 PSY independent alleles in 13 founders by the neighbor-joining method.

The tree is divided into two major nodes. The positive allele (PSY-a) and negative alleles (PSY-c and PSY-g) were clustered into different nodes, revealing that these alleles might be divergent from the common ancestral PSY. Numbers above branches are bootstrap values based on 1000 replicates and numbers under ranches are genetic distance for each pair of nodes.

(TIF)

S6 Fig. Phylogenetic tree analysis of 11 ZEP independent alleles in 13 founders by the neighbor-joining method.

The tree is divided into two major nodes. The positive alleles (ZEP-a, b, c, e) and negative alleles (ZEP-f, g, h, k) are mixed in those nodes, revealing that the functional divergence of those alleles might have occurred after the divergence of 2 functionally different ZEP ancestral alleles. Numbers above branches are bootstrap values based on 1000 replicates and numbers under ranches are genetic distance for each pair of nodes.

(TIF)

S1 Table. Information on reliable trio-tagged SNPs with lineage by trio-analysis.

(PDF)

S2 Table. SNP genotype of trio-tagged SNPs for 5 target genes.

(PDF)

S3 Table. Allelic genotype of 5 target genes in major varieties and cultivars.

(PDF)

Data Availability

All relevant data are within the manuscript and its Supporting Information files.

Funding Statement

This work was partially supported by a grant from the Ministry of Agriculture, Forestry, and Fisheries of Japan (Genomics-based Technology for Agricultural Improvement, HOR-2003, DNA-marker breeding project), by a grant from the Project of the Bio-oriented Technology Research Advancement Institution, NARO (the special scheme project on advanced research and development for next-generation technology, 150), and by a Grant-in-Aid for Scientific Research of Japan Society for the Promotion of Science (JSPS), Grant Number 23580055.

References

  • 1.Wong JC, Lambert RJ, Wurtzel ET, Roncherford TJ. QTL and candidate genes phytoene synthase and z-carotene desaturase associated with the accumulation of carotenoids in maize. Theor Appl Genet. 2004; 108: 349–359. 10.1007/s00122-003-1436-4 [DOI] [PubMed] [Google Scholar]
  • 2.Schwartz SH, Quin X, Zeevaart JD. Elucidation of the indirect pathway of abscisic acid biosynthesis by mutants, genes and enzymes. Plant Physiol. 2003; 131: 1591–1601. 10.1104/pp.102.017921 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Demmig-Adams B, Adams WW. Antioxidants in photosynthesis and human nutrition. Science. 2002; 298: 2149–2153. 10.1126/science.1078002 [DOI] [PubMed] [Google Scholar]
  • 4.Stewart I, Wheaton TA. Conversion of β-citraurin to reticulataxanthin and β-apo-8′-carotenal to citranaxanthin during the isolation of carotenoids from Citrus. Phytochemistry 1973; 12: 2947–2951. [Google Scholar]
  • 5.Kato M, Ikoma Y, Matsumoto H, Sugiura M, Hyodo H, Yano M. Accumulation of carotenoids and expression of carotenoid biosynthetic genes during maturation in citrus fruit. Plant Physiol. 2004; 134: 824–837. 10.1104/pp.103.031104 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Pouchieu C, Galan P, Ducros V, Latino-Martel P, Hercberg S, Touvier M. Plasma carotenoids and retinol and overall and breast cancer risk: a nested case-control study. Nutr Cancer. 2014; 66: 980–988. 10.1080/01635581.2014.936952 [DOI] [PubMed] [Google Scholar]
  • 7.Sugiura M, Nakamura M, Ogawa K, Ikoma Y, Yano M. High serum carotenoids associated with lower risk for bone loss and osteoporosis in post-menopausal Japanese female subjects: Prospective cohort study. Plos One. 2012; 7: e52643 10.1371/journal.pone.0052643 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Takayanagi K, Morimoto S, Shirakura Y, Mukai K, Sugiyama T, Tokuji Y, et al. Mechanism of visceral fat reduction in Tsumura Suzuki obese, diabetes (TSOD) mice orally administered β-cryptoxanthin from Satsuma mandarin oranges (Citrus unshiu Marc). J Agric Food Chem. 2011; 59: 12342–12351. 10.1021/jf202821u [DOI] [PubMed] [Google Scholar]
  • 9.Yamaguchi M. Role of carotenoid β-cryptoxanthin in bone homeostasis. J Biomed Sci. 2012; 19: 36 10.1186/1423-0127-19-36 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Fancicullino AN, Dhuique-Mayer C, Luro F, Casanova J, Morillon R, Ollitrault P. (2006) Carotenoid diversity in cultivated citrus is highly influenced by genetic factors. J Agric Food Chem. 2006; 54: 4397–4406. 10.1021/jf0526644 [DOI] [PubMed] [Google Scholar]
  • 11.Kato M. Mechanism of carotenoid accumulation in citrus fruit. J Jpn Soc Hortic Sci. 2012; 81 (3): 219–233. [Google Scholar]
  • 12.Fanciullino AL, Dhuique-Mayer C, Luro F, Morillon R, Patrick Ollitrault P. Carotenoid Biosynthetic Pathway in the Citrus Genus:  Number of Copies and Phylogenetic Diversity of Seven Genes J. Agric. Food Chem. 2007; 55: 7405–7417. [DOI] [PubMed] [Google Scholar]
  • 13.Chen C, Costa MGC, Yu Q, Moore GA, Gmitter FG. Identification of novel members in sweet orange carotenoid biosynthesis gene families. Tree Genet. Genom.2010; 6:905–914. [Google Scholar]
  • 14.Endo T, Fujii H, Sugiyama A, Nakano M, Nkajima N, Ikoma Y, et al. Overexpression of a citrus basic helix-loop-helix transcription factor (CubHLH1), which is homologous to Arabidopsis activation-tagged bri1 suppressor 1 interacting factor genes, modulates carotenoid metabolism in transgenic tomato. Plant Sci. 2015; 243: 35–48. 10.1016/j.plantsci.2015.11.005 [DOI] [PubMed] [Google Scholar]
  • 15.Zhu F, Luo T, Liu C, Wang Y, Yang H, Yang W, et al. An R2R3-MYB transcription factor represses the transformation of a- and β-branch carotenoids by negatively regulating expression of CrBCH2 and CrNCED5 in flavedo of Citrus reticulate. New Phytologist. 2017; 216: 178–192. [DOI] [PubMed] [Google Scholar]
  • 16.Lu SW, Zhang Y, Zhu K, Yang W, Ye JL, Chai L, et al. The citrus transcription factor CsMADS6 modulates carotenoid metabolism by directly regulating carotenogenic genes. Plant Physiol. 2018; 176: 2657–2676. 10.1104/pp.17.01830 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Sugiyama A, Omura M, Matsumoto H, Shimada T, Fujii H, Endo T, et al. Quantitative trait loci (QTL) analysis of carotenoid content in citrus fruit. J Jpn Soc Hortic Sci. 2011; 80: 136–144. [Google Scholar]
  • 18.Wang J, Lin M, Crenshaw A, Hutchinson A, Hicks B, Yeager M, et al. High-throughput single nucleotide polymorphism genotyping using nanofluidic Dynamic Arrays. BMC Genomics. 2009; 10:561 10.1186/1471-2164-10-561 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Kawahara Y, Endo T, Omura M, Teramoto Y, Itoh T, Fujii H, et al. (2020) Mikan Genome Database (MiGD): integrated database of genome annotation, genomic diversity, and CAPS marker information for mandarin molecular breeding. Breeding. Science. 2020; 70 (2): 200–211. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Sugiyama A, Omura M, Shimada T, Fujii H, Endo T, Shimizu T, et al. Expression quantitative trail loci analysis of carotenoid metabolism-related genes in Citrus. J Jpn Soc Hortic Sci. 2014; 83: 32–43. [Google Scholar]
  • 21.Zhang W, Dubcovsky J. Association between allelic variation at the Phytoene synthase 1 gene and yellow pigment content in the wheat grain. Theor Appl Genet, 2008; 116: 635–645. 10.1007/s00122-007-0697-8 [DOI] [PubMed] [Google Scholar]
  • 22.Palaisa KA, Morgante M, Williams M, Rafalski A. Contrasting effects of selection on sequence diversity and linkage disequilibrium at two phytoene synthase loci. Plant Cell. 2003; 15: 1795–1806. 10.1105/tpc.012526 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Sugiyama A, Ikoma Y, Fujii H, Shimada T, Endo T, Shimizu T, et al. Structure and expression levels of alleles of citrus zeaxanthin epoxidase genes. J Jpn Soc Hortic Sci. 2010; 79: 263–274. [Google Scholar]
  • 24.Sugiyama A, Ikoma Y, Fujii H, Endo T, Nesumi H, Shimada T et al. Allelic diversity of phytoene synthase gene influences the transcription level in citrus fruit among a citrus F1 hybrid population. Breeding Sci. 2017; 67 (4): 382–392. 10.1270/jsbbs.17033 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Imai A, Kuniga T, Yoshioka T, Nonaka K, Mitani N, Fukamachi H, et al. Genetic Background, Inbreeding, and Genetic Uniformity in the National Citrus Breeding Program, Japan. Hortic J. 2017; 86: 200–207. [Google Scholar]
  • 26.Kumar GR, Sakthivel K, Sundaram RM, Neeraja CN, Balachandran SM, Shobha N et al. Allele mining in crops: Prospects and potentials. Biotechnol Adv. 2010; 28: 451–461. 10.1016/j.biotechadv.2010.02.007 [DOI] [PubMed] [Google Scholar]
  • 27.Dellaporta SL, Wood J, Hicks JB. A plant DNA minipreparation: version II. Plant Mol Biol Rep. 1983; 1: 19–21. [Google Scholar]
  • 28.Wu GA, Prochnik S, Jenkins J, Salse J, Hellsten U, Murat F et al. Sequencing of diverse mandarin, pummelo and orange genomes reveals complex history of admixture during citrus domestication. Nature Biotechnol. 2014; Volume: 32: Pages: 656–662.Year published: 10.1038/nbt.2906 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Fujii H, Yamashita H, Hosaka F, Terakami S, Yamamoto T. Development of a software to presume the parent-child relationship using the result of DNA marker typing. J Jpn Soc Hortic Sci. 2010; 9: 34. [Google Scholar]
  • 30.Rouseff R, Ralay L. Application of diode array detection with a C-30 reversed phase column for the separation and identification of saponified orange juice carotenoids. J Agric Food Chem. 2009; 44: 2176–2181. [Google Scholar]
  • 31.Davies BH. Carotenoids In Chemistry and biochemistry of plant pigments. Ed. By Goodwin TW. Academic Press, London, New York and San Francisco: Vo. II, 38–165. [Google Scholar]
  • 32.Iwata H, Ebana K, Uga U, Hayashi T, Jannink JL. Genome-wide association study of grain shape variation among Oryza sativa L. germplasms based on elliptic Fourier analysis. Mol. Breed. 2010; 25: 203–215. [Google Scholar]
  • 33.Iwata H, Uga Y, Yoshioka Y, Ebana E, Hayashi T. Bayesian association mapping of multiple quantitative trait loci and its application to the analysis of genetic variation among Oryza sativa L. germplasms. Theor Appl Genet. 2007; 114: 1437–1449. 10.1007/s00122-007-0529-x [DOI] [PubMed] [Google Scholar]
  • 34.Iwata H, Hayashi T, Terakami S, Takada N, Sawamura Y, Yamamoto T. Potential assessment of genome-wide association study and genomic selection in Japanese pear Pyrus pyrifolia. Breed. Sci. 2013; 63: 125–140. 10.1270/jsbbs.63.125 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Fujii H, Ogata T, Shimada T, Endo T, Iketani H, Shimizu T, et al. MinimalMarker: an algorithm and computer program for the identification of minimal sets of discriminating DNA markers for efficient variety identification. J Comput Biol. 2013; 11: 1250022 10.1142/S0219720012500229 [DOI] [PubMed] [Google Scholar]
  • 36.Curk F, Ancillo G, García-Lor A, Luro F, Perrier X, Jacquemoud-Collet JP, et al. Next generation haplotyping to decipher nuclear genomic interspecific admixture in Citrus species: analysis of chromosome 2. BMC Genetics. 2014; 15: 152 10.1186/s12863-014-0152-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Wu GA, Terol J, Ibanez V, López-García A, Pérez-Román E, Borredá C, et al. Genomics of the origin and evolution of Citrus. Nature. 2018; 554: 311–316. 10.1038/nature25447 [DOI] [PubMed] [Google Scholar]
  • 38.Ahmed D, Comte A, Curk F, Costantino G1, Luro F1, Dereeper A, et al. Genotyping by sequencing can reveal the complex mosaic genomes in gene pools resulting from reticulate evolution: a case study in diploid and polyploid citrus. Annals Botany. 2019; 123(7): 1231–1251. 10.1093/aob/mcz029 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.García-Lor A, Luro F, Navarro L, Ollitrault P. Comparative use of InDel and SSR markers in deciphering the interspecific structure of cultivated citrus genetic diversity: A perspective for genetic association studies. Mol Genet Genomics. 2012; 287: 77–94. 10.1007/s00438-011-0658-4 [DOI] [PubMed] [Google Scholar]
  • 40.Minamikawa MF, Nonaka K, Kaminuma E, Kajiya-Kanegae H, Onogi A, Goto S, et al. nome-wide association study and genomic prediction in citrus: Potential of genomics-assisted breeding for fruit quality traits. Sci Rep. 2017; 7(1): 4721 10.1038/s41598-017-05100-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Nisar N, Li L, Lu S, Khin NC, Pogson BJ. Carotenoid metabolism in plants. Mol Plant. 2015; 8(1): 68–82. 10.1016/j.molp.2014.12.007 [DOI] [PubMed] [Google Scholar]
  • 42.Ruiz-Sola MA, Rodrı´guez-Concepcio´ na M. Carotenoid biosynthesis in Arabidopsis: a colorful pathway. Arabidopsis Book. 2012; 10: e0158 10.1199/tab.0158 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Welsch R, Wust F, Bar C, Al-Babili S, Beyer P. A third phytoene synthase is devoted to abiotic stress-induced abscisic acid formation in rice and defines functional diversification of phytoene synthase genes. Plant Physiol. 2008; 147:367–380. 10.1104/pp.108.117028 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Fraser PD, Enfissi EM, Halket JM, Truesdale MR, Yu D, Gerrish C, et al. Manipulation of phytoene levels in tomato fruit: effects on isoprenoids, plastids, and intermediary metabolism. Plant Cell. 2007; 19 (10): 3194–3211. 10.1105/tpc.106.049817 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Maass D, Arango J, Wust F, Beyer P, Welsch R. Carotenoid crystal formation in Arabidopsis and carrot roots caused by increased phytoene synthase protein levels. PLoS One. 2009; 4 (7); e6373 10.1371/journal.pone.0006373 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Jahns P, Holzwarth AR. The role of the xanthophyll cycle and of lutein in photoprotection of photosystem II. Biochim Biophys Acta. 2012; 1817: 182–193. 10.1016/j.bbabio.2011.04.012 [DOI] [PubMed] [Google Scholar]
  • 47.Galpaz N, Wang Q, Menda N, Zamir D, Hirschberg J. Abscisic acid deficiency in the tomato mutant high-pigment 3 leading to increased plastid number and higher fruit lycopene content. Plant J. 2008; 53:717–730. 10.1111/j.1365-313X.2007.03362.x [DOI] [PubMed] [Google Scholar]
  • 48.Vallabhaneni R, Wurtzel ET. Timing and biosynthetic potential for carotenoid accumulation in genetically diverse germplasm of maize. Plant Physiol. 2009; 50:562–572. 10.1104/pp.109.137042 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Wolters AM, Uitdewilligen JAML, Kloosterman B, Hutten RB, Visser RF, van Eck H. Identification of alleles of carotenoid pathway genes important for zeaxanthin accumulation in potato tubers. Plant Mol. Biol. 2010; 73: 659–671. 10.1007/s11103-010-9647-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Finkelstein R. Abscisic acid synthesis and response. Arabidopsis Book. 2013; 11: e0166 10.1199/tab.0166 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Kato M, Matsumoto H, Ikoma Y, Okuda H, Yano M. The role of carotenoid cleavage dioxygenases in the regulation of carotenoid profiles during maturation in citrus fruit. J Exp Botany. 2006; 57:2153–2164. 10.1093/jxb/erj172 [DOI] [PubMed] [Google Scholar]
  • 52.Rodrigo MJ, Alquézar B, Zacarías L. Cloning and characterization of two 9-cis-epoxycarotenoid dioxygenase genes, differentially regulated during fruit maturation and under stress conditions, from orange (Citrus sinensis L. Osbeck). J Exp Botany 2006; 57: 633–643. [DOI] [PubMed] [Google Scholar]
  • 53.Ma G, Zhang L, Yungyuen W, Yahata M, Sato Y, Furuya T, et al. Accumulation of carotenoids in a novel citrus cultivar ’Seinannohikari’ during the fruit maturation. Plant Physiol Biochem. 2018; 129: 349–356. 10.1016/j.plaphy.2018.06.015 [DOI] [PubMed] [Google Scholar]
  • 54.Walter MH, Strack D. Carotenoids and their cleavage products: biosynthesis and functions. Nat Prod Rep 2011; 28: 663–692. 10.1039/c0np00036a [DOI] [PubMed] [Google Scholar]
  • 55.Zhang Y, Peng J, Zhu Y, Yang S, Wang X, Zhai W, et al. Genome wide Identification of CCD gene family in Citrus and effect of CcCCD4a on the color of citrus flesh. Sci Agric Sinica. 2020; 53: 1874–1889. [Google Scholar]
  • 56.Ma G, Zhang L, Matsuta A, Matsutani K, Yamawaki K, Yahata M, et al. Enzymatic formation of b-citraurin from b-cryptoxanthin and zeaxanthin by carotenoid cleavage dioxygenase4 in the flavedo of citrus fruit. Plant Physiol. 2013; 163: 682–695. 10.1104/pp.113.223297 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Yamamizo C, Kishimoto S, Ohmiya A. Carotenoid composition and carotenogenic gene expression during Ipomoea petal development. J Exp Botany. 2010; 61:709–719. 10.1093/jxb/erp335 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Ma G, Zhang L, Iida K, Madono Y, Yungyuen W, Yahata M, et al. Identification and quantitative analysis of β-cryptoxanthin and β-citraurin esters in Satsuma mandarin fruit during the ripening process. Food Chem. 2017; 234: 356–364. 10.1016/j.foodchem.2017.05.015 [DOI] [PubMed] [Google Scholar]
  • 59.Ariizumi T, Kishimoto S, Kakami R, Maoka T, Hirakawa H, Suzuki Y, et al. Identification of the carotenoid modifying gene Pale Yellow Petal 1 as an essential factor in xanthophyll esterification and yellow flower pigmentation in tomato (Solanum lycopersicum). Plant J. 2014; 79:453–465. 10.1111/tpj.12570 [DOI] [PubMed] [Google Scholar]
  • 60.Kishimoto S, Oda-Yamamizo C, Ohmiya A. Heterologous expression of xanthophyll esterase genes affects carotenoid accumulation in petunia corollas. Sci Rep. 2020; 10: 1299 10.1038/s41598-020-58313-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Osorio CE. The role of Orange gene in carotenoid accumulation: Manipulating chromoplasts toward a colored future. Plant Sci. 2019; 10:1235. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Sun T, Yuan H, Cao H, Yazdani M, Tadmor Y, Li L. Carotenoid Metabolism in Plants: The Role of Plastids. Mol Plant. 2018; 11: 58–74. 10.1016/j.molp.2017.09.010 [DOI] [PubMed] [Google Scholar]
  • 63.Kover PX, Valdar W, Trakalo J, Scarcelli N, Ehrenreich IM, Purugganan MD, et al. A Multiparent advanced generation inter-cross to fine-map quantitative traits in Arabidopsis thaliana. PLoS Genet. 2009; 5(7): e1000551 10.1371/journal.pgen.1000551 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Pascual L, Albert E, Sauvage C, Duangjit J, Bouchet JP, Bitton F, et al. Dissecting quantitative trait variation in the resequencing era: complementarity of bi-parental, multi-parental and association panels. Plant Sci. 20116; 242: 120–130. [DOI] [PubMed] [Google Scholar]
  • 65.Bandillo N, Raghavan C, Muyco PA, Sevilla MAL, Lobina ITL, Dilla-Ermita CJ, et al. Multi-parent advanced generation inter-cross (MAGIC) populations in rice: progress and potential for genetics research and breeding. Rice (N Y). 2013; 6: 11 10.1186/1939-8433-6-11 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Meng L, Zhao X, Ponce K, Ye G, Leung H. QTL mapping for agronomic traits using multi-parent advanced generation intercross (MAGIC) populations derived from diverse elite indica rice lines. Field Crops Research. 2016; 189: 19–42. [Google Scholar]
  • 67.Chan EKF, Rowe HC, Hansen BG, Kliebenstein DJ. The complex genetic architecture of the metabolome. PLoS Genet. 2010; 6(11): e1001198 10.1371/journal.pgen.1001198 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Chen W, Gao Y, Xie W, Gong L, Lu K, Wans W, et al. Genome-wide association analyses provide genetic and biochemical insights into natural variation in rice metabolism. Nature Genet. 2014; 46(7): 714–21. 10.1038/ng.3007 [DOI] [PubMed] [Google Scholar]

Decision Letter 0

Chunxian Chen

10 Dec 2020

PONE-D-20-34918

Allelic composition of carotenoid metabolic genes in 13 founders influences carotenoid composition in fruits among Japanese citrus breeding population

PLOS ONE

Dear Dr. Shimada,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Jan 24 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols

We look forward to receiving your revised manuscript.

Kind regards,

Chunxian Chen, Ph.D.

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. PLOS requires an ORCID iD for the corresponding author in Editorial Manager on papers submitted after December 6th, 2016. Please ensure that you have an ORCID iD and that it is validated in Editorial Manager. To do this, go to ‘Update my Information’ (in the upper left-hand corner of the main menu), and click on the Fetch/Validate link next to the ORCID field. This will take you to the ORCID site and allow you to create a new iD or authenticate a pre-existing iD in Editorial Manager. Please see the following video for instructions on linking an ORCID iD to your Editorial Manager account: https://www.youtube.com/watch?v=_xcclfuvtxQ

3. Thank you for stating the following in the Acknowledgments Section of your manuscript:

"This work was partially supported by a grant from the Ministry of Agriculture,

Forestry, and Fisheries of Japan (Genomics-based Technology for Agricultural

Improvement, HOR-2003, DNA-marker breeding project), by a grant from the Project

of the Bio-oriented Technology Research Advancement Institution, NARO (the special

scheme project on advanced research and development for next-generation technology),

and by a Grant-in-Aid for Scientific Research of Japan Society for the Promotion of

Science (JSPS), Grant Number 23580055.".

i) We note that you have provided funding information that is not currently declared in your Funding Statement. However, funding information should not appear in the Acknowledgments section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form.

ii) Please remove any funding-related text from the manuscript and let us know how you would like to update your Funding Statement. Currently, your Funding Statement reads as follows:

"The author(s) received no specific funding for this work.".

 iii) Please include your amended statements within your cover letter; we will change the online submission form on your behalf.

Additional Editor Comments (if provided):

The paper was to identify alleles of carotenoid biosynthetic genes using citrus breeding parents and populations using the SureSelect target enrichment method. The results are of great value for understanding the diversity of carotenoid gene expressions and compositions. The paper was well written. The authors need attentions to some editorial issues in the paper.

Introduction:

Some early relevant works on citrus carotenoid biosynthetic genes/alleles and the pigment compositions should be cited, for example, those from Ollitrault’s group (e.g., Fanciullino et al. 2007) and Dr. Gmitter’s group (e.g., Chen et al. 2010).

Abstract and main text:

Abbreviations are needed only when they are used afterwards (no need of abbreviations otherwise). This rule is also applied to the main text.

Figures and tables:

They should be understood well without reference to the main text. Some legends and captions should have a bit more illustration. Likewise, abbreviations should be redefined provided that they are used there afterwards.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: I Don't Know

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: “Carotenoids are C40 lipophilic isoprenoid pigments biosynthesized from 5-carbon isoprene units” should be “Carotenoids are mostly C40 lipophilic isoprenoid pigments biosynthesized from 5-carbon isoprene units”. There are C30 carotenoids in citrus.

Valencia orange (Citrus sinensis Osbeck) and Lisbon lemon (Citrus limon Burm. f.) should be Valencia orange (C. sinensis Osbeck) and Lisbon lemon (C. limon Burm. f.)。

For a better understanding, the procedures in “SureSelect target enrichment of carotenoid metabolic genes in 13 founders” are suggested to be present with a diagram.

The abbreviations in the text, such as “PHY, 400 nm for ZCA, 452 nm for t-VIO, c-VIO, LUT, BCR, ACA, and ZEA, and 453 nm for BCA”, make it difficult to read and understand, full terms are suggested to be present.

I guess the sentences below Figure titles are notes, but not the text, so please add “Notes” before them. However, the ‘Fig. 1. Carotenoid metabolic pathway in citrus fruits’ should be better put in the section of ‘Introduction’, not in the ‘Results’. And QTL and eQTL sites in Fig.1 should be mentioned as specially generated from juice sacs, but not from the flavedo, albedo and segment membrane, for carotenoids and carotenogenesis in the later fruit tissues are quite different.

Please specify those negative, positive and optimum alleles were based on previous Estimated effect size from Bayesian statistical analysis.

Please rephrase the sentence of ‘When the mean values of conditions 1 and 2 were compared with those of condition 3, it was clear that PSY-c and PSY-g had negative effects on reducing the carotenoid concentration.’

Reviewer #2: 1. According to introduction, the authors mentioned “eQTLs of PSY, HYb, and ZEP could be mapped on the loci of their corresponding genes, revealing that their transcription is regulated primarily by cis-elements in their promoter regions.” So what are the polymorphism in the promoter region of these genes?

2. Filtering parameters for the SNP should be added. In the text, the authors mentioned that ”The number of SNPs in PSY, HYb, ZEP, and NCED with more than readable SNP quality score (>150) according to manufacturer’s description were 107, 31, 54 and 19 respectively”, and “Based on the SNP information, 17 SNP markers for PSY, 15 for HYb, 31 for ZEP, 8 NCED, and 5 for TCL were developed for a SNP genotyping assay using the GoldenGate assay system (Illumina) and Fluidigm BioMark™ HD assay system (Fluidigm).” in page 17. What kind of SNPs used? Nonsynomous mutations in coding regions or others?

3. Figure 9, the normal distribution and variance should be assessed before the Welch's test and the Tukey's HSD test were used for significance analysis, and the information should be added.

4. A mistake in page 7 “PYS, HYb, ZEP, and TCL possessed 7, 11, 5, and 4 independent alleles in 13 founders”, PYS should be PSY.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Dr. Juan Xu

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2021 Feb 4;16(2):e0246468. doi: 10.1371/journal.pone.0246468.r002

Author response to Decision Letter 0


11 Jan 2021

Dear Reviewers

We are appreciated with careful checks, the meaningful comments and suggestions of reviewer, which are helpful to improve the quality of our manuscript. We describe our responses against each comment in below. All modifications in new manuscript are written in red characters.

Reviewer #1:

"Carotenoids are C40 lipophilic isoprenoid pigments biosynthesized from 5-carbon isoprene units" should be "Carotenoids are mostly C40 lipophilic isoprenoid pigments biosynthesized from 5-carbon isoprene units". There are C30 carotenoids in citrus.

Answer) The information of C30 apocarotenoids is added in ‘Introduction’

Valencia orange (Citrus sinensis Osbeck) and Lisbon lemon (Citrus limon Burm. f.) should be Valencia orange (C. sinensis Osbeck) and Lisbon lemon (C. limon Burm. f.)。

Answer) Academic names of them are changed according to reviewer’s comment.

For a better understanding, the procedures in "SureSelect target enrichment of carotenoid metabolic genes in 13 founders" are suggested to be present with a diagram.

Answer) According to reviewer’s comment, a new diagram is constructed and is added as a new Fig. S1 in new manuscript.

The abbreviations in the text, such as "PHY, 400 nm for ZCA, 452 nm for t-VIO, c-VIO, LUT, BCR, ACA, and ZEA, and 453 nm for BCA", make it difficult to read and understand, full terms are suggested to be present.

Answer) Full compound names are described in new manuscript.

I guess the sentences below Figure titles are notes, but not the text, so please add "Notes" before them. However, the 'Fig. 1.

Answer) ‘Note’ is added below the title of figure legends

Carotenoid metabolic pathway in citrus fruits' should be better put in the section of 'Introduction', not in the 'Results'. And QTL and eQTL sites in Fig.1 should be mentioned as specially generated from juice sacs, but not from the flavedo, albedo and segment membrane, for carotenoids and carotenogenesis in the later fruit tissues are quite different.

Answer) Fig. 1 is moved to ‘Introduction’ according to reviewer’s comment. In addition, the information which sample was used in fruit tissues was lacked in the previous manuscript. We used juice sac tissues in the analysis. This information is added at various places in new manuscript.

Please specify those negative, positive and optimum alleles were based on previous Estimated effect size from Bayesian statistical analysis.

Answer) We defied the allele with more than one third of the max effect size (ζ) based on results of Fig.7 as positive or negative alleles for the measured carotenoids. For an example in total carotenoid, the alleles having more than (ζ =) + 0.249 or – 0249 effect size are defined as the ‘positive’ or the ‘negative’ effect alleles.

On one hand, the optimum allele means to possess the strongest effect among positive alleles, that is PSY-a and ZEP-e for total carotenoid.

These explanations are added separately in the result of new manuscript.

Please rephrase the sentence of 'When the mean values of conditions 1 and 2 were compared with those of condition 3, it was clear that PSY-c and PSY-g had negative effects on reducing the carotenoid concentration.'

Answer) The precious sentence was not correct as reviewer suggested. This sentence is modified in new manuscript. In addition, a new sentence is added to describe allelic interaction between positive and negative alleles, which was observed in in condition 4.

Reviewer #2:

1. According to introduction, the authors mentioned "eQTLs of PSY, HYb, and ZEP could be mapped on the loci of their corresponding genes, revealing that their transcription is regulated primarily by cis-elements in their promoter regions." So what are the polymorphism in the promoter region of these genes?

Answer) Citrus varieties possess high heterozygosity and high polymorphism in their genomes. Promoter sequences of citrus varieties are highly varied and numerous polymorphisms could be detected when compared. We considered that some or most of those polymorphisms might be meaningless in contrast to homogenous plants with less polymorphism. Sequence variation in promoter region would like to occur highly by the combination of insertion, deletion and point mutation among citrus varieties, some of which might happen in the process of vegetative reproduction for a long time period. On one hand, functional cis-elements in promoter region should be conserved within the functionally synonymous alleles. However, comparing analysis of promoter sequence among citrus varieties cannot provide the definitive information to find the cis-element site regulating for the transcription level from the numerous polymorphisms. From these reasons, we carried out allele mining method using SNP in gene region. We add the reason why SNP is collected from gene region in ‘Result’ of new manuscript.

2. Filtering parameters for the SNP should be added. In the text, the authors mentioned that "The number of SNPs in PSY, HYb, ZEP, and NCED with more than readable SNP quality score (>150) according to manufacturer's description were 107, 31, 54 and 19 respectively", and "Based on the SNP information, 17 SNP markers for PSY, 15 for HYb, 31 for ZEP, 8 NCED, and 5 for TCL were developed for a SNP genotyping assay using the GoldenGate assay system (Illumina) and Fluidigm BioMark™ HD assay system (Fluidigm)." in page 17. What kind of SNPs used? Nonsynomous mutations in coding regions or others?

Answer) For better understanding of filtering of SNPs, new S1 Fig. is constructed. The condition of MARCO software is also added in new manuscript.

Regarding for the question of what kind of SNPs, we used all inheritable SNPs in gene region among all examined trio-sets (parents and their children), in which both of nonsynomous and synonymous mutation were included. To supplement this information, S1 Table is modified to add the information of amino acid information on SNP position.

3. Figure 9, the normal distribution and variance should be assessed before the Welch's test and the Tukey's HSD test were used for significance analysis, and the information should be added.

Answer) We confirmed the normality of the distribution by Kolmogorov-Smirnov’s one sample test before the significance analysis. This explanation was added in notes of Figure. 9, S3 Fig and S4 Fig.

4. A mistake in page 7 "PYS, HYb, ZEP, and TCL possessed 7, 11, 5, and 4 independent alleles in 13 founders", PYS should be PSY.

Answer) PYS is modified to PSY.

Attachment

Submitted filename: Letter for Editor (BCR)(R3.1.6).docx

Decision Letter 1

Chunxian Chen

20 Jan 2021

Allelic composition of carotenoid metabolic genes in 13 founders influences carotenoid composition in juice sac tissues of fruits among Japanese citrus breeding population

PONE-D-20-34918R1

Dear Dr. Shimada,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Chunxian Chen, Ph.D.

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Acceptance letter

Chunxian Chen

22 Jan 2021

PONE-D-20-34918R1

Allelic composition of carotenoid metabolic genes in 13 founders influences carotenoid composition in juice sac tissues of fruits among Japanese citrus breeding population

Dear Dr. Shimada:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Chunxian Chen

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. Schematic diagram to develop the trio-tagged SNPs in the alleles of 5 target genes in 13 founders.

    (TIF)

    S2 Fig

    Estimated effect size (ζ) of each allele types in the 5 target genes regions for (A) α- carotene, (B) β-carotene, (C) lutein, (D) phytoene, (E) ζ-carotene, and (F) zeaxanthin, respectively. Red, blue, and purple circle indicated that the allele type among the 5 target genes had the highest effect size, the largest variance, and both the highest effect size and the largest variance, respectively. The characters in the brackets mean the product of the posterior means of γ.

    (TIF)

    S3 Fig. Box plot analysis to verify the effectiveness of optimum PSY and ZEP alleles for the enrichment of β-cryptoxanthin in juice sac tissues of fruits.

    Box edges represent the upper and lower quantiles, with the median value shown as a bold line in the middle of the box. The circles indicate an outlier. (A) Comparison of β-cryptoxanthin content by the number of alleles with positive effect (PSY-a) and negative effect (PSY-c or -g). Condition 1, PSY-a = 0 & (PSY-c or -g) = 2 Condition 2, PSY-a = 0 & (PSY-c or -g) = 1; Condition 3, PSY-a = 0 & (PSY-c or -g) = 0; Condition 4, PSY-a = 1 & (PSY-c or -g) = 1; Condition 5, PSY-a = 1 & (PSY-c or -g) = 0; Condition 6, PSY-a = 2 & (PSY-c or -g) = 0. Means were tested for significant differences using Tukey’s HSD test at P < 0.05, following the normality of the distribution was confirmed not to be rejected at P = 0.05 by Kolmogorov-Smirnov’s one sample test. β-cryptoxanthin content was analyzed as square root-transformed values. N/A indicates values eliminated from the Tukey’s HSD test due to insufficient data. (B) Comparison of total carotenoid content by the number of alleles with a positive effect (ZEP-a, -b, -c, or -e) under the PSY allele-type conditions (A). Means were tested for significant differences using Tukey’s HSD test at P < 0.05 under conditions 2 and 5. Means between ’+Allele = 1’ and ’+ Allele = 2’ were tested for significant differences using Welch’s two-sample t-test under condition 3, following the normality of the distribution was confirmed not to be rejected at P = 0.05 by Kolmogorov-Smirnov’s one sample test. ’ns’ indicates not significant at P < 0.05. β-cryptoxanthin content was analyzed as square root-transformed values.

    (TIF)

    S4 Fig. Box plot analysis to verify the effectiveness of optimum PSY and ZEP alleles for the enrichment of violaxanthin in juice sac tissues of fruits.

    Box edges represent the upper and lower quantiles, with the median value shown as a bold line in the middle of the box. The circles indicate an outlier. (A) Comparison of violaxanthin content by the number of alleles with positive effect (PSY-a) and negative effect (PSY-c or -g). Condition 1, PSY-a = 0 & (PSY-c or -g) = 2 Condition 2, PSY-a = 0 & (PSY-c or -g) = 1; Condition 3, PSY-a = 0 & (PSY-c or -g) = 0; Condition 4, PSY-a = 1 & (PSY-c or -g) = 1; Condition 5, PSY-a = 1 & (PSY-c or -g) = 0; Condition 6, PSY-a = 2 & (PSY-c or -g) = 0. Means were tested for significant differences using Tukey’s HSD test at P < 0.05, following the normality of the distribution was confirmed not to be rejected at P = 0.05 by Kolmogorov-Smirnov’s one sample test. Violaxanthin contents were analyzed as square root-transformed values. N/A indicates values eliminated from the Tukey’s HSD test due to insufficient data. (B) Comparison of total carotenoid content by the number of alleles with a positive effect (ZEP-a, -b, -c, or -e) under the PSY allele-type conditions (A). Means were tested for significant differences using Tukey’s HSD test at P < 0.05 under conditions 2 and 5. Means between ’+Allele = 1’ and ’+ Allele = 2’ were tested for significant differences using Welch’s two-sample t-test under condition 3. ns indicates not significant at P < 0.05, following the normality of the distribution was confirmed not to be rejected at P = 0.05 by Kolmogorov-Smirnov’s one sample test. Violaxanthin contents was analyzed as square root-transformed values.

    (TIF)

    S5 Fig. Phylogenetic tree analysis of 7 PSY independent alleles in 13 founders by the neighbor-joining method.

    The tree is divided into two major nodes. The positive allele (PSY-a) and negative alleles (PSY-c and PSY-g) were clustered into different nodes, revealing that these alleles might be divergent from the common ancestral PSY. Numbers above branches are bootstrap values based on 1000 replicates and numbers under ranches are genetic distance for each pair of nodes.

    (TIF)

    S6 Fig. Phylogenetic tree analysis of 11 ZEP independent alleles in 13 founders by the neighbor-joining method.

    The tree is divided into two major nodes. The positive alleles (ZEP-a, b, c, e) and negative alleles (ZEP-f, g, h, k) are mixed in those nodes, revealing that the functional divergence of those alleles might have occurred after the divergence of 2 functionally different ZEP ancestral alleles. Numbers above branches are bootstrap values based on 1000 replicates and numbers under ranches are genetic distance for each pair of nodes.

    (TIF)

    S1 Table. Information on reliable trio-tagged SNPs with lineage by trio-analysis.

    (PDF)

    S2 Table. SNP genotype of trio-tagged SNPs for 5 target genes.

    (PDF)

    S3 Table. Allelic genotype of 5 target genes in major varieties and cultivars.

    (PDF)

    Attachment

    Submitted filename: Letter for Editor (BCR)(R3.1.6).docx

    Data Availability Statement

    All relevant data are within the manuscript and its Supporting Information files.


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES