Abstract
Atlantic Giant (AG) pumpkin (Cucurbita maxima) produces the world’s largest fruit. Elucidating the molecular mechanism of AG fruit formation is of scientific and practical importance. In this research, genome-wide resequencing of an F2 population produced by a cross between AG and its small-fruit ancestor Hubbard was used to identify quantitative trait loci (QTLs) and candidate genes. Transgressive segregation of fruit size-related traits was observed in the F2 population, suggesting that fruit size was a quantitative trait controlled by multiple genes. A genetic map with an average physical distance of 154 kb per marker was constructed, and 13 QTLs related to fruit size were identified using bin-map construction. RNA sequencing analysis revealed that pathways associated with assimilate accumulation into the fruit, including carbohydrate metabolism, were significantly enriched in differentially expressed genes. According to the predicted impact of mutation on the biological function of certain proteins, 13 genes were selected as candidate genes associated with fruit size, among which two phytohormone-related genes, CmaCh17G011340 (a flavin-containing monooxygenase) and CmaCh04G029660 (a leucine-rich repeat protein kinase) were chosen for further investigation. Finally, one insertion-deletion (inDel) and three single nucleotide polymorphisms (SNPs) were successfully transformed to Kompetitive Allele-Specific PCR (KASP) markers. The novel QTLs and candidate genes identified provide insights into the genetic mechanism of large fruit formation of AG, and the genetic map and tightly linked KASP markers developed in this study can be employed for marker-assisted breeding to alter fruit size of C. maxima.
Keywords: Atlantic Giant, fruit size, genome-wide resequencing, KASP marker, Cucurbita maxima
Introduction
The world’s largest fruit is produced by the Atlantic Giant (AG) pumpkin (Pan et al., 2021). Each year, for giant pumpkin growers, giant pumpkin weigh off is the last of the great outdoor festivals. Due to the efforts of these growers from both aspects of breeding and cultivation, new records are created for the weight of the largest pumpkin every 1–2 years. To date, the world record of the fruit weight of AG is 1226 kg (Guinness World Records, 2021). It is believed that the current cultivated AG originated from the Hubbard squash (Sanjur et al., 2002), and the dramatic increase in the fruit weight of the modern AG from its ancestor has occurred only in the last 100 years (Hu et al., 2011).
It is interesting and important to know the genetic and physiological changes between AG and Hubbard caused by artificial selection and crossing during the last century. To date, only several studies have focused on the differences in the morphology and physiology between the two varieties. Basically, on the source side, the leaf area difference between the two types of pumpkins is slight (Savage et al., 2015; Pan et al., 2021). The leaf photosynthetic rate (Pn) of the two varieties was not significantly different at anthesis stage (Savage et al., 2015). However, at the fruit fast-growing stage, the Pn of the fruit-carrying leaf was higher in AG than in Hubbard (Pan et al., 2021). Along the assimilate translocation pathway, the intersecting surface of both the xylem and the phloem in both the pedicels and petioles of Hubbard was smaller than that of AG, while the phloem flow rate was similar between the two pumpkins (Savage et al., 2015; Pan et al., 2021). On the sink side, data from both Nakata et al. (2012) and our study (Pan et al., 2021) indicated that the larger size of AG fruit is due to faster increases in both the number and size of the fruit cells. Logically, the abovementioned differences should be governed by genetic divergency between the two varieties. However, unfortunately, which genetic loci have been altered by breeding and selection from Hubbard to AG during the last century remains largely unclear. The only certainty is that polyploidy is not the reason for the large fruit formation of AG (Tatum et al., 2006; Nakata et al., 2012).
Fruit size is an important quality and yield trait of cucurbits and other fruiting crops. As a result, the genetic mechanism of fruit size has been widely investigated in different plants using different genetic tools. Basically, fruit size is determined by the number and size of fruit cells and derived by cell proliferation and cell expansion during organ growth. A number of key components controlling cell proliferation have been identified in different plants. In maize (Zea mays), cellulose synthase-like D1 promotes cell division by participating in cell wall polysaccharide formation (Li N. et al., 2018). In legumes, a KIX domain protein gene bigger organs and an ortholog of PEAPOD gene ELE1 interact with each other and negatively regulate organ size by repressing cell proliferation (Li et al., 2019). The cauliflower (Brassica oleracea) curd development-associated gene (CDAG1) can lead to increased cell number and enlarged organ size (Li et al., 2017). In Arabidopsis (Arabidopsis thaliana), AINTEGUMENTA and ErbB-3 binding protein 1 (ZmEBP1) can increase organ size by accelerating cell division (Wang Q. L. et al., 2016; Wang T. Y. et al., 2016; Ding et al., 2018). In Arabidopsis, soybean (Glycine max) and cucumber (Cucumis sativus), STERILE APETALA family genes also promote cell proliferation by modulating PPD or BIG SEEDS1 (Li W. Y. et al., 2018; Yang et al., 2018; Yin et al., 2020). Genes associated with cell expansion also play key roles in controlling organ size. Wang Q. L. et al. (2016) reported that the maize ADP-ribosylation factor ZmArf2 promotes cell expansion and enhances organ size in Arabidopsis. In addition, some uncharacterized genes, such as Cell Size Regulator in tomato (Solanum pimpinellifolium) and DUF4 in Arabidopsis, were reported to promote organ size by increasing cell size rather than cell number (Mu et al., 2017; Chen et al., 2018). Cytochrome P450 genes have been found to be involved in controlling organ size and development in Arabidopsis, rice, soybean, tomato (Solanum lycopersicum) and sweet cherry (Prunus avium; Chakrabarti et al., 2013; Yang et al., 2013; Zhao et al., 2016; Nobusawa et al., 2021). Phytohormones are important factors promoting cell division and growth. It has been reported that auxin, cytokinin, gibberellin, brassinosteroid (BR) and genes (Davies, 1987; Mandava, 1988; Mok and Mok, 1994; De León et al., 2004; Achard et al., 2009; Devoghalaere et al., 2012; Jiang et al., 2012; Qi et al., 2017) in their signaling pathways are closely related to organ size. In addition, polyploidy and locule number are associated with the fruit size of some species (Cheniclet et al., 2005; Muńos et al., 2011). Pan et al. (2020) surveyed several cucurbit genomes and identified more than 200 homologs of organ size-related genes, including cytochrome P450, CSR (cell size regulator) and CNR (cell number regulator).
The genomics research and genome sequencing of Cucurbita maxima have achieved great advances in recent years (Sun et al., 2017). Several high-density genetic maps were constructed using simple sequence repeat (SSR) markers or single nucleotide polymorphism (SNP), and quantitative trait loci (QTLs) of interested agronomic traits, such as dwarf vine, seed shape and weight, fruit carotenoid content and other fruit-associated traits were identified (Van Der Knaap et al., 2014; Zhang et al., 2015; Kaźmińska et al., 2018, 2020). These studies provide useful information and tools to unlock the mystery of the formation of world’s largest fruit. In this research, a population with 102 F2 individuals was generated from a cross between AG and Hubbard. The population was then resequenced, and a high-density genetic map was developed. Afterward, QTLs for fruit cell number, fruit cell volume, fruit weight and fruit volume were identified, and 4 Kompetitive Allele-Specific PCR (KASP) markers were established. These studies will facilitate the identification of key genes regulating fruit size. In addition, the high-density genetic map and linked markers will provide useful tools for marker-assisted selection (MAS) in C. maxima breeding programs.
Materials and methods
Plant materials and phenotyping
The seeds of two parents of C. maxima, Hubbard and Atlantic Giant (AG) were obtained from the Sustainable Seed Company (Covelo, California, United States). A segregating population of 102 F2 plants was constructed from a cross between the female parent AG and the male parent Hubbard. Self-pollination was further performed for these F2 individuals to generated F2:3 families. Two parents, F1 and F2:3 plants were grown in the spring of 2020 in plastic tunnels at Yangzhou University (119°26′E, 32°24′N), Yangzhou, China. Plant management was performed according to Nakata et al. (2012) with a few modifications. Basically, the row and plant spacing were 2 and 5 m, respectively. A hole 1 m in diameter and 50 cm deep was dug for every plant. Fertilizers, including 20 kg of cow manure, 10 kg of rapeseed lees, 10 kg of CaSO4 fertilizer and 10 kg of fertilizer (N:P2O5:K2O = 15:10:15), were applied to each plant. Half of the fertilizers were put into the hole, and the rest were spread over the field. Only one fruit remained per plant (usually between the 14th and 18th nodes from bottom to top of the plant). Water and fertilizer management was performed to ensure the consistency of the growth condition among all plants.
A randomized block design was performed to collected the phenotypic data. Basically, twenty individuals were planted for each line (two parents, the F1 population, 102 F2:3 families and six cultivars with distinct fruit sizes) as a plot (200 m2) and the experiment was repeated 3 times in different plastic tunnels. The weight (W1), volume (V1), cell volume (CV) and the number of sarcocarp cells (CN) of the fruit was measured according to our previous study (Pan et al., 2021). The weight (W2) and volume (V2) of the sarcocarp were obtained after pumpkin fruit was cut in half and the seeds and placenta were removed. Each phenotypic value was the mean of 3 replicates and the data of each replicate were calculated from 10 randomized selected plants in the plots. Correlations were calculated using the Pearson correlation coefficient by Statistica 12 software (Statsoft Inc., United States) at p ≤ 0.05.
Nucleic acid isolation and sequencing
Genomic DNA of the two parents and all 102 F2 plants was extracted from young leaves using a Plant Genomic DNA Extraction Kit (TIANGEN, Beijing, China) according to the manufacturer’s instructions.1 RNA was extracted from 20 days after anthesis (the exponential growth period) fruit sarcocarp tissues using RNAiso Plus (Takara, Dalian, China) according to the manufacturer’s instructions.2 For RNA extraction, each sample was a mix of nine parts from three fruits, and three independent biological replicates were performed (Pan et al., 2021). The purity and integrity of the RNA and DNA samples were determined by 1% agarose gel electrophoresis, and the concentration of nucleic acids was measured in a NanoDrop. High-throughput sequencing for DNA and RNA libraries was performed by BGI Tech3 using a BGISEQ-500/MGISEQ-2000 system or Novogene4 using an Illumina HiSeq TM 4000 system, respectively.
Data analysis and single nucleotide polymorphism genotyping
To obtain clean reads, the adapter sequences, low-quality reads (Q ≤ 10) and N > 5% reads were filtered out by the Soapnuke tool (1.5.6, BGI Tech). Then, using Burrows–Wheeler Aligner (BWA) software, all clean reads were mapped with the C. maxima genome sequence5 (Li and Durbin, 2009). Picard tools6 were used for the realignment of inDel regions and removal of PCR duplicates. Genome analysis toolkit (GATK) software was employed to call and filter genome-wide SNPs with default parameters (McKenna et al., 2010). The heterozygous alleles in both parents were removed during the process.
Bin-map construction and quantitative trait loci analysis
SNPs on the same scaffold with identical genotypes across the F2 population were concatenated into a bin using SNPbinner (Chen et al., 2014). Bins less than 5 kb were initially filtered out, and bins with an extreme segregation distortion (P < 0.001) by the χ2 test were eliminated. The filtered bin-markers were then employed for map construction by LepMAP3 (Rastas, 2017). Linkage groups (LGs) were constructed using likelihood odds (LOD) ratios ≥ 5.0 and genetic map distances were calculated with the Kosambi mapping function (Kosambi, 1944). R/qtl software was employed for QTL identification based on the inclusive composite interval mapping (ICIM) model (Broman et al., 2003). Calculations using 1000 permutations were used to set the significant LOD threshold of QTLs. QTLs of the six traits associated with fruit size, CN, CV, W1, V1, W2 and V2 mentioned above, were identified.
Bioinformatics analysis of RNA sequencing data
Raw reads were filtered to eliminate adaptor sequences, low-quality reads with > 10% unknown nucleotides and poly-N. The clean reads were mapped to the C. maxima genome (see text footnote 5) using hierarchical indexing for spliced alignment of transcripts 2 (HISAT2; Kim et al., 2019). Read counts of annotated genes were calculated using StringTie (Pertea et al., 2016). fragments per transcript kilobase per million fragments mapped (FPKM) was used to estimate gene expression levels (Trapnell and Salzberg, 2009). Genes meeting the log2 (fold change) ≥ 1 and false discovery rate (FDR) ≤ 0.01 were defined as differentially expressed genes (DEGs) between the two parents. The Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment was analyzed according to biological pathways.7
qRT–PCR for candidate genes
The expression of genes located in the QTL regions that were associated with more than one trait and differentially expressed between the two parents was further verified by Quantitative real-time polymerase chain reaction (qRT–PCR). The experiments were conducted using the same RNA samples as those used for RNA sequencing mentioned above. The primers for this study are listed in Supplementary Table 1.
Evaluation of biological function of single nucleotide polymorphisms
The impact of the “missense_variant” SNPs on the biological function of certain proteins was evaluated using PROVEAN (Choi et al., 2012), and only SNPs with a prediction of “Deleterious” calculated in alternative directions were selected.
Development and analysis of Kompetitive Allele-Specific PCR markers
Five missense SNPs in key candidate genes and inDels in the coding region of genes located in QTL regions were selected to develop KASP markers. Two allele-specific forward primers and one common reverse primer (Supplementary Table 2) for KASP™ assays were designed. The developed KASP markers were used to differentiate the polymorphism by genotyping the two parents, the entire F2 population and several other C. maxima cultivars with distinct fruit sizes.
In silico analysis
Sequences of Flavin-containing monooxygenase family protein (YUCCA) and leucine-rich repeat receptor-like protein kinase (LRR-RLK) proteins were collected from (see text footnote 5), TAIR,8 and previous studies (Abu-Zaitoon, 2014; Song et al., 2015). Multiple sequence alignments were carried out using DNAMAN 6.0 (LynnonBiosoft, United States), and phylogenetic relationships were established by neighbor joining using aligned amino acid sequences analyzed by Mega X (Institute of Molecular Evolutionary Genetics, The Pennsylvania State University, University Park, United States).
Results
Valuation of fruit size-associated traits
Although the fruit weight of AG in this study was much lighter than the world heaviest fruit record (1226 kg) due to the limited growth condition, several fruit-size associated traits, including V1, W1, V2 and W2 of AG, were still much higher than those of its small fruit ancestor Hubbard. All F1 plants showed an intermediate phenotype in these traits. Transgressive segregation was found in the F2:3 population for all investigated traits, and the average value was skewed toward the small fruit-size parent (Figure 1). The correlation analysis indicated that the six tested traits were closely correlated, except for the correlation between CN and CV (Supplementary Table 3).
Sequencing, genotyping, and genetic map construction
Genome-wide resequencing (GWRS) was performed for two parents (AG and Hubbard) and 102 F2 plants. As shown in Supplementary Table 4, approximately 44.32 and 44.52 million cleaned reads from parents Hubbard and AG, respectively, and an average of 4.23 million cleaned reads from F2 individuals were obtained. Among these reads, the average GC content was 38.56% and the Q30 score was more than 88.41%. The average depths of Hubbard, AG and F2 plants were 7, 7 and 1, respectively. Of these reads, 76.94% from AG, 74.34% from Hubbard and 78.87% from F2 plants were mapped to the genome of C. maxima and employed for SNP calling. Finally, a total of 208626 polymorphic SNPs between the two parents were obtained. The SNPs were found to be distributed across all 20 LGs, as illustrated in Figure 2. The detailed information of these SNPs is listed in Supplementary Table 5. Therefore, these SNPs were further combined into 2973 bin markers (Qi et al., 2021). These bin markers were used to construct the 20 LGs, and individual LGs consisted of 95 to 284 markers for LG7 and LG4, respectively, with an average of 148.25 markers per LG. The size of the LGs ranged from 69.63 to 189.56 cM for LG7 and LG4, respectively. The genetic map spans a total of 2337.88 cM, and the mean interval between bin markers was approximately 0.96 cM, and the maximum distance between the bins ranged from 4.42 to 18.44 cM. According to the pumpkin genome size (373.9 Mb), the map had an mean physical spacing of 154 kb per bin (Table 1). The recombination bin-map, heatmaps and synteny analysis are shown in Supplementary Figures 1–3, and the obtained results indicated that a genetic map with high quality and good collinearity was constructed.
TABLE 1.
LG ID | Spearman (R2) | Total distance (cM) | Total bin marker | Unique bin marker | Mean distance (cM) | Max Gap (cM) | Gap < 5 cM (%) |
LG01 | 0.999872983 | 150.27 | 170 | 148 | 1.02 | 10.44 | 98.65 |
LG02 | 0.999959086 | 121.221 | 170 | 151 | 0.8 | 8.92 | 98.68 |
LG03 | 0.999888817 | 130.137 | 155 | 134 | 0.97 | 15.15 | 98.51 |
LG04 | 0.997327309 | 189.561 | 284 | 216 | 0.88 | 12.51 | 99.54 |
LG05 | 0.999454392 | 114.41 | 160 | 128 | 0.89 | 8.92 | 96.88 |
LG06 | 0.999896691 | 109.936 | 132 | 108 | 1.02 | 8.41 | 98.15 |
LG07 | 0.999839022 | 69.633 | 95 | 87 | 0.8 | 4.42 | 100 |
LG08 | 0.938374327 | 133.1 | 132 | 94 | 1.42 | 18.44 | 94.68 |
LG09 | 0.999131348 | 103.992 | 129 | 108 | 0.96 | 4.92 | 100 |
LG10 | 0.999923118 | 93.901 | 109 | 92 | 1.02 | 11.99 | 97.83 |
LG11 | 0.999992659 | 128.076 | 187 | 175 | 0.73 | 9.42 | 99.43 |
LG12 | 0.999702745 | 110.412 | 173 | 140 | 0.79 | 8.92 | 99.29 |
LG13 | 0.999953664 | 99.808 | 110 | 98 | 1.02 | 11.47 | 96.94 |
LG14 | 0.9999501 | 139.965 | 181 | 155 | 0.9 | 10.44 | 98.71 |
LG15 | 0.99998174 | 106.182 | 122 | 109 | 0.97 | 10.44 | 96.33 |
LG16 | 0.999962663 | 97.211 | 141 | 111 | 0.88 | 9.42 | 99.1 |
LG17 | 0.999791751 | 110.711 | 151 | 99 | 1.12 | 13.03 | 96.97 |
LG18 | 0.999806121 | 112.865 | 147 | 110 | 1.03 | 5.41 | 98.18 |
LG19 | 0.999720432 | 96.247 | 103 | 91 | 1.06 | 8.92 | 97.8 |
LG20 | 0.999714443 | 120.244 | 114 | 91 | 1.32 | 12.51 | 94.51 |
Total | 2337.882 | 2965 | 2445 | 0.956 | 18.44 | 98.2 |
Max Gap: The largest Gap in the linkage group. The smaller the maximum Gap is, the more uniform the map is. Gap < 5 cM: the proportion of bin markers whose Gap is less than 5 cM to all bin markers. Spearman coefficient: The closer Spearman coefficient is to 1, the better the collinearity between the genetic graph and the physical graph.
Quantitative trait loci identification
Using joint analysis (LOD ≥ 4), based on the phenotyping data, 13 QTLs related to CN (2 QTLs), CV (2 QTLs), V1 (1 QTL), V2 (3 QTLs), W1 (2 QTLs) and W2 (3 QTLs) were identified (Figure 3). The information of each QTL, including LOD, map position, percentage of phenotypic variance explained (PVE) and threshold and LOD maximum values are presented in Table 2. Of note, several QTLs were found to be related to more than one trait, such as LG3 115.43 cM to 115.429 cM (related to both CN and V2) and LG4 188.58 cM to 189.561 cM (related to V1, V2, W1, and W2), confirming that there are intrinsic connections among these traits, as shown in Supplementary Table 3. These results indicated that fruit weight and volume, no matter whether include seeds and placenta, are closely correlated each other, and both CN and CV make important contributions of the large fruit of AG. Thus, genes located in these QTLs are worthy of our attention. Detailed gene information located in these QTLs is listed in Supplementary Table 6.
TABLE 2.
Triait | LG ID | Start (cM) | End (cM) | Min (LOD) | Max (LOD) | Min (PVE%) | Max (PVE%) | Start (bp) | End (bp) |
CN | LG03 | 115.43 | 115.429 | 4.483 | 4.483 | 16.09029329 | 18.32441785 | 7770003 | 7781760 |
CN | LG14 | 11.782 | 12 | 4.647 | 4.647 | 18.92850129 | 18.92890772 | 1298851 | 1343835 |
CV | LG04 | 182.7 | 183.678 | 9.743 | 10.246 | 35.590225 | 37.03511312 | 18536335 | 18644674 |
CV | LG20 | 115.83 | 116.322 | 4.377 | 4.913 | 15.01135274 | 18.35401604 | 8701462 | 8750248 |
V1 | LG04 | 188.58 | 189.071 | 5.164 | 5.393 | 17.83358354 | 21.61142548 | 19414367 | 19495938 |
V2 | LG03 | 115.43 | 115.429 | 4.605 | 4.605 | 17.06621369 | 18.77444921 | 7770003 | 7781760 |
V2 | LG04 | 188.58 | 189.561 | 4.767 | 5.463 | 19.36394567 | 21.86076583 | 19414367 | 19814128 |
V2 | LG17 | 90.09 | 91 | 4.485 | 5.194 | 18.33175041 | 20.90470268 | 7740462 | 7986582 |
W1 | LG04 | 188.58 | 189.561 | 4.697 | 5.36 | 19.11051215 | 21.49636912 | 19414367 | 19814128 |
W1 | LG17 | 90.09 | 91 | 4.485 | 4.999 | 18.33290159 | 20.20505156 | 7740462 | 7986582 |
W2 | LG04 | 188.58 | 189.561 | 4.339 | 5.38 | 17.79393599 | 21.56842962 | 19414367 | 19814128 |
W2 | LG14 | 137 | 138.495 | 4.375 | 4.425 | 17.16672181 | 18.10998139 | 13609485 | 14221370 |
W2 | LG17 | 90.58 | 90.58 | 4.458 | 4.4584 | 17.40514744 | 18.23280147 | 7749751 | 7822436 |
Differentially expressed genes in fruits between Hubbard and Atlantic Giant
To further reveal gene expression regulation in pumpkin fruit size, we employed RNA-seq technology to analyze the gene expression profiles of fruits for Hubbard and AG at the fast growing stage (20 days after anthesis, DAA). RNA-seq produced 76.59 MB and 69.14 MB clean reads from Hubbard and AG (Q30 score ≥ 93%), and 95.07 and 95.17% of them could be mapped to the C. maxima genome, respectively. Between the Hubbard and AG groups, 2964 DEGs were identified (Figure 4A). Compared with Hubbard, the expression levels of 1332 genes were upregulated, and 1632 genes were downregulated in AG (Figure 4A). To elucidate the transcriptome differences between the two types of pumpkins, KEGG (Kyoto Encyclopedia of Genes and Genomes) analysis was carried out. We observed that some pathways associated with assimilate accumulation and fruit enlargement, such as carbohydrate metabolism and BR signaling, were significantly enriched (Figure 4B). Analysis of the RNA-Seq data for all genes located in QTL regions suggested that 32 genes were differentially expressed between AG and Hubbard, among which 14 genes were related to more than one trait (Supplementary Table 7). The expression differences of these transcripts were confirmed with qRT–PCR (Figure 4C).
Candidate gene analysis
The SNP type among QTL genes was analyzed and only “missense_variant” was selected. Furthermore, 11 genes which have SNPs showed “Deleterious”calculated with PROVEAN were selected as candidate genes (Supplementary Table 8). In addition, insertion-deletion (inDel) mutations occurring in coding regions of QTL genes were also analyzed using PROVEAN, and two genes, CmaCh04G029620 (annotated as the Noc2p family) and CmaCh14G019420 (fatty acid hydroxylase superfamily), were also selected as candidate genes (Supplementary Table 9).
Validation of candidate single nucleotide polymorphisms and inDels with Kompetitive Allele-Specific PCR markers
Phytohormones have been reported closely associated with plant organ size. Therefore, two phytohormone-related genes, CmaCh17G011340 (annotated as YUCCA) and CmaCh04G029660 (annotated as leucine-rich repeat protein kinase family protein, LRR-RLK), were selected from candidate SNPs genes. Further analysis indicated that the “missense_variant” SNPs occurred in the CzcO domain of CmaCh17G011340 and the PKc_like superfamily domain of CmaCh04G029660. KASP analysis was designed for the selected SNPs and inDels mentioned above. Finally, three SNPs and 2 inDels were selected to develop KASP marker and employed to screen two parents, 40 F2 individual plants and six cultivars showing extreme phenotypes to confirm their polymorphisms (Supplementary Table 10). The obtained results showed that four out of five markers distinguished the bulks, parents and six selected cultivars successfully (Figure 5), suggesting that these KASP markers were applicable for MAS of pumpkin fruit size breeding. These results further confirm that CmaCh04G029660, CmaCh17G011340 and CmaCh04G029620 are important candidate genes for fruit size control of C. maxima.
Phylogenetic analysis and expression pattern of YUCCA and LRR-RLK family genes in Cucurbita maxima
Because little is known about the relationship between most candidate genes and organ size, only two phytohormone related genes CmaCh17G011340 and CmaCh04G029660 were further analyzed. Thirty-nine putative YUCCA from C. maxima (14), Arabidopsis (11) and rice (14) and 746 putative LRR-RLK family proteins from C. maxima (296), Arabidopsis (224) and maize (226) were collected for phylogenetic analysis (Supplementary Figures 4 and 6). Consistent with previous reports, the obtained results indicate that plant YUCCAs (Schlaich, 2007) are divided into three groups and LRR-RLK (Hosseini et al., 2020) into 23 groups. According to the function of genes closely related in the phylogenetic trees, CmaCh17G011340 is considered a typical YUCCA family gene, whose function is to catalyze the rate-limiting step in the indole-3-acetic acid (IAA) synthesis (Schlaich, 2007). CmaCh04G029660 belongs to the subfamily VI-1 group of LRR-RLKs. LRR-RLKs are the largest family of transmembrane receptors in plants and have been reported to be associated with various processes, including those associated with the BR and ERECTA (ER) signaling pathways (He et al., 2020). The functional domains of the two family genes and locations of SNPs are shown in Supplementary Figures 5 and 7. The evaluated impact of SNPs on the biological function of certain proteins (Supplementary Table 8) indicates that the isoforms of CmaCh17G011340 in AG and CmaCh04G029660 in Hubbard have higher activities. According to these results, it can be deduced that CmaCh17G011340 positively regulates pumpkin fruit size, while CmaCh04G029660 negatively regulates pumpkin fruit size. To further investigate the functions of the two candidate genes, the expression patterns of cascade genes in IAA, BR and ER signaling pathways were investigated. As shown in Figures 6A–H, the expression level differences of some important downstream genes, such as AUX/IAA, ARF and SAUR of the IAA signaling pathway, BIN2, BNR1/2 and TCH4 of the BR signaling pathway and MKK4/5 and SPCH of the ER pathway, between the two types of pumpkin were found, some of which were further confirmed by qRT–PCR. In addition, the IAA levels at the fruit fast-growing stage were also higher in AG fruits than in Hubbard fruits (Figure 6K). These results indicate that the SNPs occurring within CmaCh17G011340 and CmaCh04G029660 may improve fruit enlargement by altering the IAA, BR and ER signaling pathways. Of note, the mRNA abundance of CmaCh04G029660 was higher and that of CmaCh17G011340 was lower in AG than in Hubbard (Figure 4C), indicating that the alterations in the IAA, BR and ER pathways were not due to changes in gene expression levels. Therefore, the SNP in the coding region of these two genes between the two pumpkins may be a key reason to alter protein function and further affect the biosynthesis of IAA and the downstream signaling pathways of BR and ER. As shown in Supplementary Table 5, we also noticed SNPs in the promoter regions of the two candidate genes. Whether these promoter SNPs are reasons for the expression difference of the two genes between the two pumpkins needs to be further studied.
Discussion
In this study, all 102 F2 plants were used for genome-wide resequencing. Bulked segregation analysis (BSA) is another choice that has been widely used for constructing genetic linkage map and identifying QTL due to its advantages of cost-effectiveness and relatively low time consumption (Tang et al., 2017). However, because the space required for giant pumpkin growth is vast, the population size of F2 and F2:3 was limited in our study. The skewed normal distribution (skewed toward the small fruit parent) of all tested traits further led to the deficiency of F2 lines for big-fruit mixed poll construction (Figure 1B). Thus, all available F2 plants were used for SNP calling and QTL detection in this study. Furthermore, with the application of next-generation DNA sequencing (NGS), an increasing number of reduced representation sequencing methods have been applied in recent molecular research, such as genotyping by sequencing (GBS) and restriction-site associated DNA sequencing (RAD-seq; Meger et al., 2019). In comparison with these methods, a whole-genome resequencing (WGRS)-based approach could identify more polymorphism SNPs randomly distributed throughout the genome, which is beneficial for decreasing the candidate regions (Zhao et al., 2020). Therefore, WGRS was performed for all F2 lines in this study. In addition, bin mapping has been proven to be an efficient method for generating high-density genetic maps for identifying QTLs in many plant species (Wang et al., 2019). Finally, with the application of the WGRS-based bin-map strategy, we identified more SNP and bin markers than previous works of C. maxima genetic linkage map construction (Zhang et al., 2015; Wang et al., 2020). Therefore, we suggest that the strategy employed in this study is an efficient approach for genetic linkage map construction and with a high density of SNP and bin markers. The constructed map should be a powerful tool for the fine mapping of QTLs and marker-assisted breeding of important traits of C. maxima.
Perhaps due to their economic value, substantial QTLs and candidate genes associated with fruit size of cucumber, melon and watermelon have been identified (Pan et al., 2020). Although fruit weight variation among pumpkins is more fascinating, only few researches have been performed to elucidate the genetic mechanism of fruit weight variation in Cucurbita crops (Pan et al., 2020). Kaźmińska et al. (2020) used recombinant inbred lines (RILs) to identify QTLs related to the fruit weight. In this study, the fruit weight of one parent was approximately 5-fold higher than that of another parent. Six QTLs for fruit size were detected, and the major-effect QTL explaining 32 and 41% of the phenotypic variation in the two experiments was located on chromosome 4 (p ≤ 0.01). The remaining fruit weight QTLs were located on chromosomes 2, 10, 14 and 17. In our study, we also detected fruit size-associated QTLs on chromosomes 4, 14 and 17. However, most genomic positions of these QTLs were different between the two experiments. The only overlapping fragment of fruit weight-associated QTLs between the two experiments was from 13609485 to 13764003 on chromosome 14. Twenty-two genes (CmaCh14G019320 – CmaCh14G019560) were identified in this region (Supplementary Table 11). Among these genes, we noted that CmaCh14G019450, encoding a WD40 repeat-like superfamily protein, also showed a “Deleterious” prediction in PROVEAN calculation. Yang et al. (2018) also reported that LITTLELEAF encodes a WD40 repeat protein related to organ size in cucumber. Thus CmaCh14G019450 should be another important candidate gene and worthy further investigation.
IAA is the most common auxin and plays various roles in plant development and response to abiotic and biotic stress. In higher plants, the tryptophan-dependent pathway is the predominant IAA biosynthesis pathway, among which TRYPTOPHAN AMINOTRANSFERASE OF ARABIDOPSIS (TAA) and flavin monooxygenase (YUCCA) are two key catalytic enzymes (Zhao, 2014). After biosynthesis, the IAA function is achieved through several processes, including IAA transport, metabolism, and signal transduction (Devoghalaere et al., 2012). These processes were coordinated by a series of regulators, such as gretchen hagen 3 (GH3) family protein (Staswick et al., 2002; converting active IAA to inactive IAA),auxin-resistant 1/like auxin-resistant 1 (AUX1/LAX1) and PINformed 1 (PIN1; transports auxin between extracellular and intracellular regions), small auxin-up RNA (SAUR), auxin response factors (ARFs) and transport inhibitor response1/auxin signaling F-BOX protein (TIR1/AFB; Staswick et al., 2002; Paponov et al., 2008; Wabnik et al., 2010; Zažímalová et al., 2010; Lavy and Estelle, 2016; Leyser, 2018). The size of various plant organs has been reported to be affected by the IAA level, including fruits, seeds and roots (Chhun et al., 2007; Figueiredo and Köhler, 2018; Bu et al., 2020). IAA significantly increases organ size by promoting cell division and enlargement (Ljung et al., 2001). According to Supplementary Figures 4 and 8, the expression of two homologs of CmaCh17G011340 was not detected in fruits of either variety, further indicating the important role of CmaCh17G011340 during fruit enlargement.
Receptor-like kinases represent the largest family of cell surface receptors in plants, among which LRR-RLKs are one of the largest RLK subgroups (Schlaich, 2007). LRR-RLKs characteristically contain an intracellular kinase domain, a transmembrane domain and an extracellular LRR, whose function is to transduce signal in and out of cells (He et al., 2020). According to different phylogenetic analysis methods, the LRR-RLK subfamily could be divided into different number of clusters (Liu et al., 2016, 2017; Sun et al., 2018). Sakamoto et al. (2012) proposed that the LRR-RLKII subfamily contains three well-supported clades, named SERK (somatic embryogenesis receptor kinase), NIK (NSP-interacting kinase) and LRRIIc (unknown function). However, based on the combined extracellular and intracellular sequence matrix, Hosseini et al. (2020) redefined LRR-RLKIIs into five subgroups named LRR-RLKII 1–5. In the last decade, several LRR-RLK-associated signaling pathways have been reported in Arabidopsis and other plants (Zhang et al., 2012; He et al., 2020). A receptor complex containing two LRR-receptor kinases, BRASSINOSTEROID INSENSITIVE1 (BRI1, belonging to subfamily Xb-1) and BRI1ASSOCIATED KINASE-1 (BAK1), triggers phosphorylation of their intracellular domains (Hohmann et al., 2017). The phosphorylated kinases recruit and activate downstream transcription factors of BR signaling such as ASSINAZOLE-RESISTANT 1 (BZR1) and BRI1-EMS-SUPPRESSOR 1 (BES1), which eventually leads to vast changes in a variety of biological processes mediated by BR signaling (Wang et al., 2002; Yin et al., 2002). ER, another LRR family gene (subfamily XIIIb), functions with ER-LIKE1 (ERL1) and ERL2 synergistically to mediate multiple growth processes, including organ size, inflorescence architecture, petiole length, ovule development, and stomatal patterning (Shpak, 2004, 2013; Torii, 1996; Pillitteri et al., 2007). ZmRLK7, an LRR-RLK of Zea mays, negatively regulates plant organ size by influencing the SERKs, ER cascade and BR cascade (He et al., 2020). In our experiment, CmaCh04G029660 belongs to subfamily VI-1; unfortunately, little research has been conducted in this group. According to our results, AG fruit has a less active allele of this LRR-RLK than Hubbard fruit, and the expression of abundant downstream genes of the BR and ER pathways was different between the two types of pumpkins, indicating that CmaCh04G029660 may also act as a negative regulator of pumpkin fruit by affecting BRs and ER signaling pathways. If this is true, the exact mechanism of regulation is worthy of further study. We cannot rule out other pathways of the regulation conducted by CmaCh04G029660. In addition, several other LRR genes, which have high homology with CmaCh04G029660, were also expressed in AG and Hubbard fruits (Supplementary Figures 6 and 9). The role of these LRRs in fruit enlargement needs to be further investigated.
In this study, a genetic map with an average physical interval of 154 kb per marker was constructed using GWRS and Bin marker technology. Thirteen fruit size-related QTLs were further identified on chromosomes 3, 4, 14, and 17 of C. maxima. Additional RNA-Seq data and impact analysis of SNP and inDel suggested several important candidate genes, such as CmaCh04G029620, CmaCh04G029660, CmaCh14G019420, CmaCh14G019450 and CmaCh17G011340. In addition, four KASP markers that were closely linked to fruit size were developed to confirm the SNP and inDel mutations and provide an MAS tool for C. maxima breeding. These data can be used to understand the molecular mechanisms underlying the formation of the world’s largest fruit and provide candidate genes for fruit size-related breeding of C. maxima. Furthermore, two SNPs were identified in the key domains of CmaCh17G011340 and CmaCh04G029660, and significant differences in the IAA level and RNA abundance of several genes in the IAA, BR and ER signaling pathways were found between large and small pumpkins, indicating that IAA, BR and ER may play important roles in the AG fruit enlargement. In addition, compared with Hubbard, AG has several other morphological and physiological characteristics facilitating its fast fruit growth, such as larger leaves, higher leaf stachyose level, higher net photosynthetic rate, and larger peduncle vascular cross area (Pan et al., 2021). Whether IAA, BR and ER play roles in these biological processes is worth further study. Functional verification by gene editing of two candidate genes is being carried out in our lab.
Data availability statement
The original contributions presented in the study are publicly available. This data can be found here: NCBI, PRJNA797700, and PRJNA797701.
Author contributions
MM designed the experiments and coordinated the project. LP, MW, YY, and CC performed the experiments, analyzed the data, and drew conclusions based on the results. MM wrote the manuscript. HD, ZZ, and BH helped to perform the analysis with the constructive discussions. All authors read and approved the final manuscript.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Footnotes
Funding
This research was supported by the National Key Research and Development Program (2019YFD1000300 and 2018YFD1000800) and the National Natural Science Foundation of China (32072579, 31872107, and 31672160).
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2022.942004/full#supplementary-material
References
- Abu-Zaitoon Y. M. (2014). Phylogenetic analysis of putative genes involved in the tryptophan-dependent pathway of auxin biosynthesis in rice. Appl. Biochem. 172 2480–2495. 10.1007/s12010-013-0710-4 [DOI] [PubMed] [Google Scholar]
- Achard P., Gusti A., Cheminant S., Alioua M., Dhondt S., Coppens F., et al. (2009). Gibberellin signaling controls cell proliferation rate in Arabidopsis. Curr. Biol. 19 1188–1193. 10.1016/j.cub.2009.05.059 [DOI] [PubMed] [Google Scholar]
- Broman K. W., Wu H., Sen S., Churchill G. A. (2003). R/qtl: QTL mapping in experimental crosses. Bioinformatics 19 889–890. 10.1093/bioinformatics/btg112 [DOI] [PubMed] [Google Scholar]
- Bu H. D., Yu W. Q., Yuan H., Yue P. T., Wei Y., Wang A. D. (2020). Endogenous auxin content contributes to larger size of apple fruit. Front. Plant Sci. 11:592540. 10.3389/fpls.2020.592540 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chakrabarti M., Zhang N., Sauvage C., Munos S., Blanca J., Canizares J., et al. (2013). A cytochrome P450 regulates a domestication trait in cultivated tomato. Proc. Natl. Acad. Sci. U.S.A. 110 17125–17130. 10.1073/pnas.1307313110 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen G. X., Cao X., Ma Z. X., Tang Y., Zeng Y. J., Chen L. Q., et al. (2018). Overexpression of the nuclear protein gene AtDUF4 increases organ size in Arabidopsis thaliana and Brassica napus. J. Genet. Genomics 45 459–462. 10.1016/j.jgg.2018.05.009 [DOI] [PubMed] [Google Scholar]
- Chen Z. L., Wang B. B., Dong X. M., Liu H., Ren L. H., Chen J., et al. (2014). An ultra-high density bin-map for rapid QTL mapping for tassel and ear architecture in a large F2 maize population. BMC Genomics 15:433. 10.1186/1471-2164-15-433 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cheniclet C., Rong W. Y., Causse M., Frangne N., Bolling L., Carde J. P., et al. (2005). Cell expansion and endoreduplication show a large genetic variability in pericarp and contribute strongly to tomato fruit growth. Plant Physiol. 139 1984–1994. 10.1104/pp.105.068767 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chhun T., Uno Y., Taketa S., Azuma T., Ichii M., Okamoto T., et al. (2007). Saturated humidity accelerates lateral root development in rice (Oryza sativa L.) seedlings by increasing phloem-based auxin transport. J. Exp. Bot. 58 1695–1704. 10.1093/jxb/erm026 [DOI] [PubMed] [Google Scholar]
- Choi Y. W., Sims G. E., Murphy S., Miller J. R., Chan A. P. (2012). Predicting the functional effect of amino acid substitutions and indels. PLoS One 7:e46688. 10.1371/journal.pone.0046688 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Davies P. J. (1987). Plant Hormones and their Role in Plant Growth and Development. Dordrecht: Springer. [Google Scholar]
- De León B. G. P., Zorrilla J. M. F., Rubio V., Dahiya P., Paz-Ares J., Leyva A. (2004). Interallelic complementation at the Arabidopsis CRE1 locus uncovers independent pathways for the proliferation of vascular initials and canonical cytokinin signalling. Plant J. 38 70–79. 10.1111/j.1365-313x.2004.02023.x [DOI] [PubMed] [Google Scholar]
- Devoghalaere F., Doucen T., Guitton B., Keeling J., Payne W., Ling T. J., et al. (2012). A genomics approach to understanding the role of auxin in apple (Malus x domestica) fruit size control. BMC Plant Biol. 12:7. 10.1186/1471-2229-12-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ding Q., Cui B., Li J. J., Li H. Y., Zhang Y. H., Lv X. H., et al. (2018). Ectopic expression of a Brassica rapa AINTEGUMENTA gene (BrANT-1) increases organ size and stomatal density in Arabidopsis. Sci. Rep. 8:10528. 10.1038/s41598-018-28606-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Figueiredo D. D., Köhler C. (2018). Auxin: a molecular trigger of seed development. Gen. Dev. 32 479–490. 10.1101/gad.312546.118 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Guinness World Records (2021). Heaviest Pumpkin. Available online at: https://www.guinnessworldrecords.com/world-records/heaviest-pumpkin (accessed September 26, 2021). [Google Scholar]
- He C. M., Wang J., Dong R., Guan H. Y., Liu T. S., Liu C. X. (2020). Overexpression of an antisense RNA of maize receptor-like kinase gene ZmRLK7 enlarges the organ and seed size of transgenic Arabidopsis Plants. Front. Plant Sci. 11:579120. 10.3389/fpls.2020.579120 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hohmann U., Lau K., Hothorn M. (2017). The structural basis of ligand perception and signal activation by receptor kinases. Annu. Rev. Plant Biol. 68 109–137. 10.1146/annurev-arplant-042916-040957 [DOI] [PubMed] [Google Scholar]
- Hosseini S., Schmidt E. D. L., Bakker F. T. (2020). Leucine-rich repeat receptor-like kinase II phylogenetics reveals five main clades throughout the plant kingdom. Plant J. 103 547–560. 10.1111/tpj.14749 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hu D. L., Richards P., Alexeev A. (2011). The growth of giant pumpkins: how extreme weight influences shape. Int. J. Nonlin. Mech. 46 637–647. 10.1016/j.ijnonlinmec.2010.12.013 [DOI] [Google Scholar]
- Jiang Y. H., Bao L., Jeong S. Y., Kim S. K., Xu C. G., Li X. H., et al. (2012). XIAO is involved in the control of organ size by contributing to the regulation of signaling and homeostasis of brassinosteroids and cell cycling in rice. Plant J. 70 398–408. 10.1111/j.1365-313x.2011.04877.x [DOI] [PubMed] [Google Scholar]
- Kaźmińska K., Hallmann E., Korzeniewska A., Niemirowicz-Szczytt K., Bartoszewski G. (2020). Identifification of Fruit-Associated QTLs in Winter Squash (Cucurbita maxima Duchesne) Using Recombinant Inbred Lines. Genes 11:419. 10.3390/genes11040419 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kaźmińska K., Hallmann E., Rusaczonek A., Korzeniewska A., Sobczak M., Filipczak J., et al. (2018). Genetic mapping of ovary color and quantitative trait loci for carotenoid content in the fruit of Cucurbita maxima Duchesne. Mol. Breed. 38:114. 10.1007/s11032-018-0869-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kim D., Paggi J. M., Park C., Bennett C., Salzberg S. L. (2019). Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat. Biotechnol. 37 907–915. 10.1038/s41587-019-0201-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kosambi D. D. (1944). The estimation of map distances from recombination values. Ann. Eugen. 12 172–175. [Google Scholar]
- Lavy M., Estelle M. (2016). Mechanisms of auxin signaling. Development 143 3226–3229. 10.1242/dev.131870 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Leyser O. (2018). Auxin signaling. Plant Physiol. 176 465–479. 10.1104/pp.17.00765 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li H., Durbin R. (2009). Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25 1754–1760. 10.1093/bioinformatics/btp324 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li H., Liu Q., Zhang Q. L., Qin E. J., Jin C., Wang Y., et al. (2017). Curd development associated gene (CDAG1) in cauliflower (Brassica oleracea L. var. botrytis) could result in enlarged organ size and increased biomass. Plant Sci. 254 82–94. 10.1016/j.plantsci.2016.10.009 [DOI] [PubMed] [Google Scholar]
- Li N., Liu Z. P., Wang Z. B., Ru L. C., Gonzalez N., Baekelandt A., et al. (2018). STERILE APETALA modulates the stability of a repressor protein complex to control organ size in Arabidopsis thaliana. PLoS Genet. 14:e1007218. 10.1371/journal.pgen.1007218 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li W. Y., Yang Z. X., Yao J. Y., Li J. S., Song W. B., Yang X. H. (2018). Cellulose synthase-like D1 controls organ size in maize. BMC Plant Biol. 18:239. 10.1186/s12870-018-1453-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li X., Liu W., Zhuang L. L., Zhu Y., Wang F., Chen T., et al. (2019). BIGGER ORGANS and ELEPHANT EAR-LIKE LEAF1 control organ size and floral organ internal asymmetry in pea. J. Exp. Bot. 70 179–191. 10.1093/jxb/ery352 [DOI] [PubMed] [Google Scholar]
- Liu P. L., Du L., Huang Y., Gao S. M., Yu M. (2017). Origin and diversification of leucine-rich repeat receptor-like protein kinase (LRR-RLK) genes in plants. BMC Evol. Biol. 17:47–62. 10.1186/s12862-017-0891-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu P. L., Xie L. L., Li P. W., Mao J. F., Liu H., Gao S. M., et al. (2016). Duplication and aivergence of leucine-rich repeat receptor-like protein kinase (LRR-RLK) genes in basal angiosperm Amborella trichopoda. Front. Plant Sci. 7:1952. 10.3389/fpls.2016.01952 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ljung K., Bhalerao R. P., Sandberg G. (2001). Sites and homeostatic control of auxin biosynthesis in Arabidopsis during vegetative growth. Plant J. 28 465–474. 10.1046/j.1365-313x.2001.01173.x [DOI] [PubMed] [Google Scholar]
- Mandava N. B. (1988). Plant growth-promoting brassinosteroids. Annu. Rev. Plant Phys. 39 23–52. 10.1146/annurev.pp.39.060188.000323 [DOI] [Google Scholar]
- McKenna A., Hanna M., Banks E., Sivachenko A., Cibulskis K., Kernytsky A., et al. (2010). The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 20 1297–1303. 10.1101/gr.107524.110 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Meger J., Ulaszewski B., Vendramin G. G., Burczyk J. (2019). Using reduced representation libraries sequencing methods to identify cpDNA polymorphisms in European beech (Fagus sylvatica L). Tree Genet. Genomes 15:7. 10.1007/s11295-018-1313-6 [DOI] [Google Scholar]
- Mok D. W. S., Mok M. C. (1994). Cytokinins. New York, NY: CRC Press. [Google Scholar]
- Mu Q., Huang Z. J., Chakrabarti M., Illa-Berenguer E., Liu X. X., Wang Y. P., et al. (2017). Fruit weight is controlled by Cell Size Regulator encoding a novel protein that is expressed in maturing tomato fruits. PLoS Genet. 13:e1006930. 10.1371/journal.pgen.1006930 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Muńos S., Ranc N., Botton E., Berard A., Rolland S., Duffe P., et al. (2011). Increase in tomato locule number is controlled by two single-nucleotide polymorphisms located near WUSCHEL. Plant Physiol. 156 2244–2254. 10.1104/pp.111.173997 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nakata Y., Taniguchi G., Takazaki S. (2012). Comparative analysis of cells and proteins of pumpkin plants for the control of fruit size. J Biosci Bioeng. 114 334–341. 10.1016/j.jbiosc.2012.04.005 [DOI] [PubMed] [Google Scholar]
- Nobusawa T., Kamei M., Ueda H., Matsushima N., Yamatani H., Kusaba M. (2021). Highly pleiotropic functions of CYP78As and AMP1 are regulated in non-cell-autonomous/organ-specific manners. Plant Physiol. 186 767–781. 10.1093/plphys/kiab067 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pan L., Chen C., Wang M., Shen Y. F., Yang Y. T., Wang A. H., et al. (2021). Comparative analysis of assimilate synthesis, translocation and partitioning between two Cucurbita maxima cultivars “Atlantic giant” and “Hubbard”. Sci. Hortic. 289:11041. 10.1016/j.scienta.2021.110411 [DOI] [Google Scholar]
- Pan Y., Wang Y., McGregor C., Liu S., Luan F., Gao M., et al. (2020). Genetic architecture of fruit size and shape variation in cucurbits: a comparative perspective. Theor. Appl. Genet. 133 1–21. 10.1007/s00122-019-03481-3 [DOI] [PubMed] [Google Scholar]
- Paponov I. A., Paponov M., Teale W., Menges M., Chakrabortee S., Murray J. A., et al. (2008). Comprehensive transcriptome analysis of auxin responses in Arabidopsis. Mol. Plant 1 321–337. 10.1093/mp/ssm021 [DOI] [PubMed] [Google Scholar]
- Pertea M., Kim D., Pertea G., Leek J. T., Salzberg S. L. (2016). Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie, and Ballgown. Nat. Protoc. 11 1650–1667. 10.1038/nprot.2016.095 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pillitteri L. J., Bemis S. M., Shpak E. D., Torii K. U. (2007). Haploinsufficiency after successive loss of signaling reveals a role for ERECTA-family genes in Arabidopsis ovule development. Development 134 3099–3109. 10.1242/dev.004788 [DOI] [PubMed] [Google Scholar]
- Qi X. L., Liu C. L., Song L. L., Li Y. H., Li M. (2017). PaCYP78A9, a Cytochrome P450, Regulates Fruit Size in Sweet Cherry (Prunus avium L.). Front. Plant Sci. 8 2076. 10.3389/fpls.2017.02076 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Qi X. P., Ogden L. E., Bostan H., Sargent J. D., Ward J., Gilbert J., et al. (2021). High-density linkage map construction and QTL identification in a diploid blueberry mapping population. Front. Plant Sci. 12:692628. 10.3389/fpls.2021.692628 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rastas P. (2017). Lep-MAP3: robust linkage mapping even for low-coverage whole genome sequencing data. Bioinformatics 33 3726–3732. 10.1093/bioinformatics/btx494 [DOI] [PubMed] [Google Scholar]
- Sakamoto T., Deguchi M., Brustolini O. J. B., Santos A. A., Silva F. F., Fontes E. P. B. (2012). The tomato RLK superfamily, phylogeny and functional predictions about the role of the LRR-RLKII subfamily in antiviral defense. BMC Plant Biol. 12:229. 10.1186/1471-2229-12-229 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sanjur O. I., Piperno D. R., Andres T. C. (2002). Phylogenetic relationships among domesticated and wild species of Cucurbita (Cucurbitaceae) inferred from mitochondrial gene: implications for crop plant evolution and areas of origin. Proc. Natl. Acad. Sci. U.S.A. 99 535–540. 10.1073/pnas.012577299 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Savage J. A., Haines D. F., Holbrook N. M. (2015). The making of giant pumpkins: how selective breeding changed the phloem of Cucurbita maxima from source to sink. Plant Cell Environ. 38 1543–1554. 10.1111/pce.12502 [DOI] [PubMed] [Google Scholar]
- Schlaich N. L. (2007). Flavin-containing monooxygenases in plants: looking beyond detox. Trends Plant Sci. 12 412–418. 10.1016/j.tplants.2007.08.009 [DOI] [PubMed] [Google Scholar]
- Shpak E. D. (2004). Synergistic interaction of three ERECTA-family receptor-like kinases controls Arabidopsis organ growth and flower development by promoting cell proliferation. Development 131 1491–1501. 10.1242/dev.01028 [DOI] [PubMed] [Google Scholar]
- Shpak E. D. (2013). Diverse roles of ERECTA family genes in plant development. J. Integr. Plant Biol. 55 1238–1250. 10.1111/jipb.12108 [DOI] [PubMed] [Google Scholar]
- Song W., Wang B. Q., Li X. H., Wei J. F., Chen L., Zhang D. M., et al. (2015). Identification of Immune Related LRR-containing genes in maize (Zea mays L.) by genome-wide sequence analysis. Int. J. Genomics 2015:231358. 10.1155/2015/231358 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Staswick P. E., Tiryaki I., Rowe M. (2002). Jasmonate Response Locus JAR1 and several related Arabidopsis Genes encode enzymes of the firefly luciferase superfamily that show activity on Jasmonic, Salicylic, and Indole-3-Acetic Acids in an Assay for Adenylation. Plant Cell 14 1405–1415. 10.1105/tpc.000885 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sun H., Wu S., Zhang G., Jiao C., Guo S., Ren Y., et al. (2017). Karyotype stability and unbiased fractionation in the Paleo-Allotetraploid Cucurbita Genomes. Mol. Plant 10 1293–1306. 10.1016/j.molp.2017.09.003 [DOI] [PubMed] [Google Scholar]
- Sun R. B., Wang S. H., Ma D., Liu C. L. (2018). Genome-wide analysis of LRR-RLK gene family in four Gossypium Species and expression analysis during cotton development and stress responses. Genes 9:592. 10.3390/genes9120592 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tang Q. W., Tian M. Y., An G. H., Zhang W. Y., Chen J. J., Yan C. H. (2017). Rapid identification of the purple stem (Ps) gene of Chinese kale (Brassica oleracea var. alboglabra) in a segregation distortion population by bulked segregant analysis and RNA sequencing. Mol. Breed. 37:153. 10.1007/s11032-017-0752-3 [DOI] [Google Scholar]
- Tatum T. C., Nunez L., Kushad M. M., Rayburn A. L. (2006). Genome size variation in pumpkin (Cucurbita sp.). Ann. Appl. Biol. 149 145–151. 10.1111/j.1744-7348.2006.00079.x [DOI] [Google Scholar]
- Torii K. U. (1996). The Arabidopsis ERECTA Gene Encodes a putative receptor protein kinase with extracellular Leucine-Rich Repeats. Plant Cell 8 735–746. 10.2307/3870348 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Trapnell C., Salzberg S. L. (2009). How to map billions of short reads onto genomes. Nat. Biotechnol. 27 455–457. 10.1038/nbt0509-455 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Van Der Knaap E., Chakrabarti M., Chu Y. H., Clevenger J. P., Illa-Berenguer E., Huang Z., et al. (2014). What lies beyond the eye: the molecular mechanisms regulating tomato fruit weight and shape. Front. Plant Sci. 5:227. 10.3389/fpls.2014.00227 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wabnik K., Kleine-Vehn J., Balla J., Sauer M., Naramoto S., Reinöhl V., et al. (2010). Emergence of tissue polarization from synergy of intracellular and extracellular auxin signaling. Mol. Syst. Biol. 6:447. 10.1038/msb.2010.103 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang Q. L., Xue X. J., Li Y. L., Dong Y. B., Zhang L., Zhou Q., et al. (2016). A maize ADP-ribosylation factor ZmArf2 increases organ and seed size by promoting cell expansion in Arabidopsis. Physiol. Plant. 156 97–107. 10.1111/ppl.12359 [DOI] [PubMed] [Google Scholar]
- Wang T. Y., Sui Z. P., Liu X. Y., Li Y. Y., Li H. J., Xing J. W., et al. (2016). Ectopic expression of a maize hybrid up-regulated gene, ErbB-3 binding Protein 1 (ZmEBP1), increases organ size by promoting cell proliferation in Arabidopsis. Plant Sci. 243 23–34. 10.1016/j.plantsci.2015.11.002 [DOI] [PubMed] [Google Scholar]
- Wang Y. L., Wang C. J., Han H. Y., Luo Y. S., Wang Z. C., Yan C. D., et al. (2020). Construction of a high-density genetic map and analysis of seed-related traits using specific length amplified fragment sequencing for Cucurbita maxima. Front. Plant Sci. 10:1782. 10.3389/fpls.2019.01782 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang Z. L., Wang J., Peng J. X., Du X. F., Jiang M. S., Li Y. F., et al. (2019). QTL mapping for 11 agronomic traits based on a genome-wide Bin-map in a large F2 population of foxtail millet (Setaria italica (L.) P. Beauv). Mol. Breed. 39:18. 10.1007/s11032-019-0930-6 [DOI] [Google Scholar]
- Wang Z. Y., Nakano T., Gendron J., He J., Chen M., Vafeados D., et al. (2002). Nuclear-localized BZR1 mediates brassinosteroid-induced growth and feedback suppression of brassinosteroid biosynthesis. Dev. Cell 2 505–513. 10.1016/s1534-5807(02)00153-3 [DOI] [PubMed] [Google Scholar]
- Yang L. M., Liu H. Q., Zhao J. Y., Pan Y. P., Cheng S. Y., Lietzow C. D., et al. (2018). LITTLELEAF (LL) encodes a WD40 repeat domain-containing protein associated with organ size variation in cucumber. Plant J. 95 834–847. 10.1111/tpj.13991 [DOI] [PubMed] [Google Scholar]
- Yang W. B., Gao M. J., Yin X., Liu J. Y., Xu Y. G., Zeng L. J., et al. (2013). Control of Rice Embryo Development, Shoot Apical Meristem Maintenance, and Grain Yield by a Novel Cytochrome P450. Mol. Plant 6 1945–1960. 10.1093/mp/sst107 [DOI] [PubMed] [Google Scholar]
- Yin P. C., Ma Q. X., Wang H., Feng D., Wang X. B., Pei Y. X., et al. (2020). SMALL LEAF AND BUSHY1 controls organ size and lateral branching by modulating the stability of BIG SEED1 in Medicago truncatula. New Phytol. 226 1399–1412. 10.1111/nph.16449 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yin Y., Wang Z. Y., Mora-Garcia S., Li J., Yoshida S., Asami T., et al. (2002). BES1 accumulates in the nucleus in response to brassinosteroids to regulate gene expression and promote stem elongation. Cell 109 181–191. 10.1016/s0092-8674(02)00721-3 [DOI] [PubMed] [Google Scholar]
- Zažímalová E., Murphy A. S., Yang H., Hoyerová K., Hošek P. (2010). Auxin transporters—why so many? Cold Spring Harb. Perspect. Biol. 2:a001552. 10.1101/cshperspect.a001552 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang G., Ren Y., Sun H., Guo S., Zhang F., Zhang J., et al. (2015). A high-density genetic map for anchoring genome sequences and identifying QTLs associated with dwarf vine in pumpkin (Cucurbita maxima Duch.). BMC Genomics 16:1101. 10.1186/s12864-015-2312-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang X., Facette M., Humphries A. J., Shen Z., Park Y., Sutimantanapi D., et al. (2012). Identification of PAN2 by quantitative proteomics as a leucine-rich repeat-receptor-like kinase acting upstream of PAN1 to polarize cell division in Maize. Plant Cell 24 4577–4589. 10.1105/tpc.112.104125 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhao B. T., Dai A. H., Wei H. C., Yang S. X., Wang B. S., Jiang N., et al. (2016). Arabidopsis KLU homologue GmCYP78A72 regulates seed size in soybean. Plant Mol. Biol. 90 33–47. 10.1007/s11103-015-0392-0 [DOI] [PubMed] [Google Scholar]
- Zhao Q. L., Li D. K., Huang X. W., Ren B. N., Yue L. M., Dua B. H., et al. (2020). Identifying gene-environment interactions on the efficacy of folic acid therapy for hyperhomocysteinemia based on prediction model. Nutr. Res. 77 54–61. 10.1016/j.nutres.2020.03.001 [DOI] [PubMed] [Google Scholar]
- Zhao Y. (2014). Auxin biosynthesis. Arabidopsis Book 12:e0173. 10.1199/tab.0173 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The original contributions presented in the study are publicly available. This data can be found here: NCBI, PRJNA797700, and PRJNA797701.