Abstract
Genetic improvement of seed yield per plant (SY) is one of the major objectives in Brassica napus breeding programme. SY, being a complex quantitative trait is directly and indirectly influenced by yield-component traits such as siliqua length (SL), number of seeds per siliqua (NSS), and thousand seed weight (TSW). Therefore, concurrent improvement in SL, NSS and TSW can lead to higher SY in B. napus. This study was conducted to identify significant SNPs and putative candidate genes governing SY and its component traits (SL, NSS, TSW). All these traits were evaluated in a diverse set of 200 genotypes representing diversity from wide geographical locations. Of these, a set of 125 genotypes were chosen based on pedigree diversity and multi-location trait variation for genotyping by sequencing (GBS). Best linear unbiased predictors (BLUPs) of all the traits were used for genome-wide association study (GWAS) with 85,126 SNPs obtained from GBS. A total of 16, 18, 27 and 18 SNPs were found to be significantly associated for SL, NSS, TSW and SY respectively. Based on linkage disequilibrium decay analysis, 150 kb genomic region flanking the SNP was used for the identification of underlying candidate genes for each test trait. Important candidate genes involved in phytohormone signaling (WAT1, OSR1, ARR8, CKX1, REM7, REM9, BG1) and seed storage proteins (Cruciferin) were found to have significant influence on seed weight and yield. Genes involved in sexual reproduction and fertilization (PERK7, PERK13, PRK3, GATA15, NFD6) were found to determine the number of seeds per siliqua. Several genes found in this study namely ATS3A, CKX1, SPL2, SPL6, SPL9, WAT1 showed pleiotropic effect with yield component traits. Significant SNPs and putative candidate genes identified for SL, NSS, TSW and SY could be used in marker-assisted breeding for improvement of crop yield in B. napus. Genotypes identified with high SL, NSS, TSW and SY could serve as donors in crop improvement programs in B. napus.
Supplementary Information
The online version contains supplementary material available at 10.1007/s12298-021-01060-9.
Keywords: Brassica napus, GWAS, SNPs, Siliqua length, Number of seeds per siliqua, Thousand seed weight, Seed yield
Introduction
Rapeseed (Brassica napus subsp. napus) is one of the most important oilseed crops. It is the third largest source of vegetable oil accounting for approximately 15% of the total vegetable oil used for human consumption (Carruthers et al. 2017; Kaur et al. 2020). It is an allotetraploid oilseed plant species (AACC, 2n = 38), originated through the hybridization of two diploid species namely, B. rapa (AA, 2n = 20) and B. oleracea (CC, 2n = 18) about 7,500 years ago (Nagaharu 1935). It is being grown mainly under sub-humid areas with irrigation. In India, it is largely cultivated from October to April (winter season) in north-western parts comprising states of Punjab, Himachal Pradesh, Haryana and Delhi under assured irrigated conditions (Priyamedha et al. 2015).
Breeding for genetic improvement of seed yield in B. napus is an important objective to increase productivity of rapeseed. Seed yield is a complex quantitative trait governed by various components, and highly influenced by genotype-environment interaction (Yang et al. 2016; Zhu et al. 2020). Siliqua number, number of seeds per siliqua and seed weight have been well reported as important determinants of seed yield (Yang et al. 2012). Seeds per siliqua and seed weight have further been found to be significantly correlated with siliqua length (Chay and Thurling 1989; Lebowitz 1989; Diepenbrock 2000) that contributed towards higher productivity in B. napus. X-ray studies suggested failure of double fertilization as the main factor leading to fewer seeds per siliqua (Pechan and Morgan 1985). Therefore, concurrent improvement in siliqua length, number of seeds per siliqua and seed weight can lead to higher seed yield in B. napus. Hence, there is a need for marker-assisted approach to accelerate commercial breeding program for enhanced crop yield.
In B. napus, number of QTLs for these traits have been identified using bi-parental mapping population and genome wide association studies (GWAS). More than 100 QTLs were reported for number of seeds per siliqua (NSS), out of which, three major QTLs located on chromosomes A6, A7 and C9 have been fine mapped (Shi et al. 2015; Yang et al. 2016; Zhu et al. 2020). Based on trait-by-trait meta-analysis, Zhang et al. (2011) reported 26 QTLs for siliqua-traits, out of which, five for siliqua length, two for seeds per siliqua and three for seed weight were consensus QTL. Liu et al. (2015a) reported co-located genes for seed weight and siliqua length in a major QTL on chromosome A9 of B. napus and increase in these traits due to a 165-bp deletion in the auxin-response factor 18 (ARF18).
Li et al. (2019) reported significant differential expression of genes related to pod development, seed development, cell division, nutrient reservoir and ribosomal proteins in the pod wall and seeds of the large-seed pool and cloned two pleiotropic major QTLs which indirectly influenced seed weight via their effects on siliqua length. Recently, one QTL (cqSW.A03-2) explaining 8.46–13.7% of the phenotypic variation was reported for seed weight (Wang et al. 2020). Zhu et al. (2020) validated and characterized seed number per siliqua QTL qSN.A7 and demonstrated that the difference in single nucleotide polymorphisms (SNPs) between the two homologous near-isogenic lines was determined by the embryonic genotypic effect. However, still more efforts are required to unfold genetic variation for these traits particularly in the diverse population, identify significant QTLs and associated markers that can be used in marker-assisted breeding programme. With availability of whole genome sequence of important crop plants, next-generation sequencing technologies based high-throughput genotyping platforms, GWAS has emerged as an important approach to identify putative QTLs for complex quantitative traits in diverse set of populations. In this study, a collection of geographically diverse germplasm, which was maintained through sowing and continuous selfing in alternate years, was evaluated in multi-environment across locations and years to identify putative genetic loci influencing seed yield and its associated traits using GWAS.
Materials and methods
Plant material
A diverse set of 200 genotypes comprised of 113 lines from different countries namely, Australia (26), Canada (7), China (37), France (23), Poland (2), South Korea (1), Sweden (6), United States (11), India (85 advanced breeding lines and two commercial cultivars developed at Punjab Agricultural University (PAU), Ludhiana) (Table 1, Table S1, Fig. 1). The stock is being maintained through continuous selfing. The germplasm set was evaluated for test traits from October to April during 2017–18 (Y1) and 2018–19 (Y2) at two locations in India viz. Punjab Agricultural University (PAU, Latitude: 30.90° N, Longitude: 75.81° E, Altitude: 247 m) marked as LDH_Y1 and LDH_Y2, respectively and at Regional Research Station, PAU, Bathinda (Latitude: 30.21° N, Longitude: 74.95° E, Altitude: 210 m) marked as BTI_Y1 and BTI_Y2, respectively. These two locations represent the different agroclimatic zones of Punjab state in India- Ludhiana located in the central plain zone and Bathinda in western zone. The soil is sandy loam to clayey with normal reaction (pH from 7.8 to 8.5). Bathinda (BTI) receives relatively low average annual rainfall and higher maximum and minimum temperatures than Ludhiana (LDH) (Kaur and Hundal 2008; Kaur and Kaur 2015). Variation of minimum, maximum temperature and rainfall pattern between Ludhiana and Bathinda from October to April in year 2018–19 was presented in Fig. S1.
Table 1.
Number of genotypes collected from different countries for developing a diverse set for GWAS
| S. no. | Country | No. of genotypes |
|---|---|---|
| 1 | Australia | 26 |
| 2 | Canada | 7 |
| 3 | China | 37 |
| 4 | France | 23 |
| 5 | Poland | 2 |
| 6 | South Korea | 1 |
| 7 | Sweden | 6 |
| 8 | United states | 11 |
| 9 | India | 87 |
| Total | 200 |
Fig. 1.
Geographical distribution of germplasm stock of 200 genotypes
Phenotyping of germplasm stock for test traits
In all the four environments (LDH_Y1, LDH_Y2, BTI_Y1 and BTI_Y2), the field experiments were conducted. In each experiment, each genotype of 200 germplasm stock was sown in paired rows of two meters row length at a row to row and plant to plant spacing of 45 cm and 10 cm, respectively in alpha lattice design with two replications (Anonymous 2020). At maturity, the crop was harvested manually. Five siliquae were taken from the middle of the main shoot of five plants of each genotype and siliqua length (SL, cm) was measured in centimeteres and averaged. Number of seeds per siliqua (NSS) was counted as an average number of seeds from ten siliqua borne on the middle of the main shoot of five plants. Thousand seed weight (TSW, g) was measured as weight in grams of thousand fully developed seeds from the main shoot. The seed weight values for five plants were averaged. Seed yield was recorded on a plot basis. Seed yield per plant (SY, g) was calculated by dividing total plot yield by the number of plants in the plot. TSW and SY were recorded after proper drying of seeds (Sandhu et al. 2019).
Statistical analysis of phenotypic data
Analysis of variance (ANOVA), coefficient of variance (CV) and BLUPs (best linear and unbiased predictors) were computed using META-R version 6.0 statistical software (Alvarado et al. 2020). BLUPs for each genotype were calculated for each environment viz. LDH_Y1, LDH_Y2, BTI_Y1 and BTI_Y2 as well as pooled over years for each location (E1: Ludhiana, E2: Bathinda) and used for GWAS analysis. Following linear model for analysing individual environments was used:
where Yijk is the trait of interest, μ is the overall mean effect, Repi is the effect of the ith replicate, Blockj(Repi) is the effect of the jth incomplete block within the ith replicate, Genk is the effect of the kth genotype and εijk is the effect of the error associated with the ith replication, jth incomplete block, and kth genotype, which is assumed to be iid normal with mean zero and variance .
For, a combined analysis across years, following linear model was used:
where, the new terms and are the effects of the ith environment and the environment × genotype (G × E) interaction, respectively. In both models, all effects, except the overall mean, are declared to be random and iid normal with mean zero and effect-specific variances. The random assumption for the genotype effects allowed to calculate BLUPs.
Genotyping of GWAS panel
DNA extraction, GBS and SNP calling
A representative diverse panel of 125 lines out of stock of 200 genotypes was selected based on geographic origin, pedigree and variation in recorded values of test traits. Genomic DNA was isolated from young leaf tissue of 125 genotypes using CTAB (cetyl trimethyl ammonium bromide) method (Doyle et al. 1990). Agarose gel electrophoresis (0.8%) and NanoDrop 8000 spectrophotometer (Thermo Scientific Waltham, Massachusetts, United States) were used to examine the quantity and quality of DNA. Genotyping by sequencing (GBS) was outsourced from “Nucleome Informatics Pvt Ltd, India” and pair-end sequencing was performed on Illumina HiSeq™ 2500 platform. BWA software was used for alignment of quality clean sequencing data of each accession with B. napus ‘ZS11’ v2.0 reference genome {chromosome assembly was downloaded from NCBI (http://www.ncbi.nlm.nih.gov/genome/term=txid3708orgn)} to build the consensus genomic sequence of each genotype (Li and Durbin 2009). SAMtools were used for handling alignment files and variants were called based on the discrepancies between the consensus sequence and the reference genome using BCFtools (Li et al. 2009). SNPs having MAF < 0.05, missing data > 10%, and multiple alleles were filtered out using VCFtools (Danecek et al. 2011). Further, SNPs having up to 50% heterozygosity were only used for the GWAS analysis.
Genome-wide association analyses
A total of 85,126 filtered SNPs as genotypic data and BLUPs datasets as phenotypic data of test traits were used for GWAS using FarmCPU (Fixed and Random Model Circulating Unification) statistical method implemented through GAPIT version 3 (Genomic Association and Prediction Integrated Tool) (Lipka et al. 2012; Liu et al. 2016). FarmCPU is a multi-locus model approach and addresses the confounding problems of mixed linear models (MLM) by using both fixed effect model and the random effect model iteratively. R version 3.6.3 was implemented in RStudio software (R Core Team 2018; RStudio Team 2020). Principal components (PCs) and kinship computed from genotypic SNP data were incorporated in the FarmCPU model. Uniform threshold (−log10(p) = 3.74) was set to control the genome-wide type 1 error rate and to identify significant SNPs associated with test traits in all environment datasets (Duggal et al. 2008; Yang et al. 2014).
Identification of candidate genes
Linkage disequilibrium (LD) analysis was performed across the genome using PopLDdecay version 3.41 (Zhang et al. 2019) and a cut off of r2 = 0.1 was used for determining LD decay. DNA sequence from 150 kb genomic region on either side of significant SNPs was considered as confidence interval for comparison with previous studies. Information about genes present in the confidence interval of SNPs was obtained from NCBI (https://www.ncbi.nlm.nih.gov/genome/browse/#!/proteins/203/335272%7CBrassica%20napus/) and functions of the predicted candidate genes were reviewed to establish their importance for improving test traits in B. napus. The list of softwares/tools used in this study was given in Table S2.
Results
Variation for test traits in germplasm stock of B. napus in multiple environments
(a) Siliqua length (SL): Genotypic variance and genotype-by environment interaction variance was significant (p < 0.001) for SL in E1, E2 and across all environments (Table 2). Comparative variation of SL between two locations (Ludhiana and Bathinda) was presented in Fig. 2a. Top ten genotypes with highest SL were given in Table 3. Maximum SL was 8.26 cm for genotype “BLN-33–51” from Australia origin.
(b) Number of seeds per siliqua (NSS): Genotypic variance and genotype-by environment interaction variance was significant (p < 0.001) for NSS in E1, E2 and across all environments (Table 2). Comparative variation of NSS between two locations (Ludhiana and Bathinda) was presented in Fig. 2b. Top ten genotypes with highest NSS were given in Table 3. Maximum NSS was 27 for genotype “PN-78–7-2” from advanced breeding lines.
(c) Thousand seed weight (TSW): Genotypic variance and genotype-by environment interaction variance were significant (p < 0.001) for TSW in E1, E2 and across all environments (Table 2). Comparative variation of TSW between two locations (Ludhiana and Bathinda) was presented in Fig. 2c. Top ten genotypes with highest TSW were given in Table 3. Maximum TSW was 4.61 g for genotype “CHARLTON” from Australia origin.
(d) Seed yield per plant (SY): Genotypic variance and genotype-by environment interaction variance were significant (p < 0.001) for TSW in E1, E2 and across all environments (Table 2). Comparative variation of SY between two locations (Ludhiana and Bathinda) was presented in Fig. 2d. Top ten genotypes with the highest SY were given in Table 3. Maximum SY was 15.52 g for genotype “BOOMER” from Canada origin.
Table 2.
Analysis of variance (ANOVA) of seed yield and component traits in multi-environments
| Trait† | Env§ | Mean | LSD (5%) | CV | ||
|---|---|---|---|---|---|---|
| SL | E1 | 0.20*** | 0.07*** | 6.76 | 0.48 | 6.39 |
| E2 | 0.21*** | 0.09*** | 6.93 | 0.48 | 5.12 | |
| Across | 0.21*** | 0.07*** | 6.85 | 0.36 | 5.8 | |
| NSS | E1 | 4.03*** | 5.46*** | 22.28 | 2.84 | 11.14 |
| E2 | 4.89*** | 6.7*** | 21.87 | 3.05 | 10.56 | |
| Across | 4.89*** | 5.64*** | 22.08 | 2.4 | 10.9 | |
| TSW | E1 | 0.15*** | 0.02*** | 3.44 | 0.23 | 3.96 |
| E2 | 0.28*** | 0.12*** | 3.43 | 0.47 | 6.21 | |
| Across | 0.11*** | 0.17*** | 3.43 | 0.36 | 5.21 | |
| SY | E1 | 5.97*** | 0.09 | 12.58 | 1.12 | 8.23 |
| E2 | 3.73*** | 1.12*** | 11.71 | 1.64 | 8.63 | |
| Across | 3.75*** | 1.72*** | 12.14 | 1.4 | 8.49 |
= Genotype variance, = G X E variance, LSD = Least significant difference, CV = Coefficient of variance
*** = Significance at p < 0.001
†SL: Siliqua length (cm), NSS: Number of seeds per siliqua, TSW: Thousand-seed weight (g), SY: Seed yield per plant (g)
§ENV: Environment, E1: Ludhiana pooled over two years, E2: Bathinda pooled over two years, Across: Pooled across all environments
Fig. 2.
a Boxplots presenting variation between Ludhiana (E1) and Bathinda (E2) pooled over two years for siliqua length (SL). b Boxplots presenting variation between Ludhiana (E1) and Bathinda (E2) pooled over two years for number of seeds per siliqua (NSS). c Boxplots presenting variation between Ludhiana (E1) and Bathinda (E2) pooled over two years for thousand-seed weight (TSW). d Boxplots presenting variation between Ludhiana (E1) and Bathinda (E2) pooled over two years for seed yield per plant (SY)
Table 3.
Top ten genotypes with highest observations for test traits based on BLUPs (across four environments: two years at two locations)
| S. no. | Genotype | SL† | Genotype | NSS† | Genotype | TSW† | Genotype | SY† |
|---|---|---|---|---|---|---|---|---|
| 1 | BLN-33–51 | 8.26* | PN-78–7-2 | 27* | CHARLTON | 4.61* | BOOMER | 15.52* |
| 2 | ZING-821 | 8.03* | MYSTIC | 26* | ZY-0–14 | 4.28* | P-617 | 15.07* |
| 3 | P-617 | 7.75* | 03-P-74–6 | 26* | USCN-56 | 4.13* | HYOLA-75 | 14.82 |
| 4 | ACN-39 | 7.63* | 06–06-3737 | 26* | P-617 | 4.06* | CHARLTON | 14.76 |
| 5 | BLN-33–44 | 7.57 | EC-28960 | 26* | ACN-50 | 4.06* | LCN-10 | 14.73 |
| 6 | PN-47–1 | 7.56 | EC-609308 | 25* | PN-84–2 | 4.03* | ACN-17 | 14.45 |
| 7 | DING-474 | 7.54 | FM-24 | 25* | ZY-0–12-4 | 3.95* | VCN-62 | 14.41 |
| 8 | FM-7 | 7.49 | FM-35 | 25* | BCN-58 | 3.92* | ZING-SHU-ANG-4 | 14.39 |
| 9 | MYSTIC | 7.49 | BLN-33–50 | 25* | PN-87–3 | 3.92* | NE-371 | 14.39 |
| 10 | RR-009 | 7.47 | P-617 | 25* | FAN-023 | 3.89* | FAN-028 | 14.09 |
| Check | GSC-7 | 7.22 | GSC-7 | 20 | GSC-7 | 3.49 | GSC-7 | 13.54 |
| LSD (5%) | 0.36 | LSD (5%) | 2.40 | LSD (5%) | 0.36 | LSD (5%) | 1.40 |
*Significantly different from check with α = 0.05
†SL, Siliqua length (cm); NSS, Number of seeds per siliqua; TSW, Thousand seed weight (g); SY, Seed yield per plant (g)
Performance of important productivity component traits (i.e., SL, NSS, TSW) for ten high productivity (SY) elite genotypes across the environments was given in Table 4. These genotypes were from diverse origins i.e., Canada (BOOMER), China (P-617 and ZING-SHU-ANG-4), Australia (HYOLA-75 and CHARLTON), France (FAN-028), United States (NE-371) and India (LCN-10, ACN-17, VCN-62). Pearson correlation coefficient was presented in Table 5. SYs have significant positive correlation with NSS (r = 0.15) and TSW (r = 0.23). SLs have weak correlation with SY. However, SLs have significant positive correlation with NSS (r = 0.38) and TSW (r = 0.16).
Table 4.
Trait performance of elite genotypes across four environments (two years at two locations)
| S. no. | Genotype | Origin | Traits† | |||
|---|---|---|---|---|---|---|
| SY | NSS | SL | TSW | |||
| 1 | BOOMER | Canada | 15.52* | 22.48* | 7.19 | 3.00 |
| 2 | P-617 | China | 15.07* | 25.17* | 7.75* | 4.06* |
| 3 | HYOLA-75 | Australia | 14.82 | 22.45* | 6.84 | 3.16 |
| 4 | CHARLTON | Australia | 14.76 | 16.00 | 5.89 | 4.61* |
| 5 | LCN-10 | India | 14.73 | 20.62 | 6.54 | 3.33 |
| 6 | ACN-17 | India | 14.45 | 22.40* | 6.90 | 3.50 |
| 7 | VCN-62 | India | 14.41 | 19.47 | 6.38 | 3.02 |
| 8 | ZING-SHU-ANG-4 | China | 14.39 | 21.12 | 6.94 | 3.61 |
| 9 | NE-371 | United states | 14.39 | 22.96* | 6.90 | 3.51 |
| 10 | FAN-028 | France | 14.09 | 22.91* | 7.00 | 3.45 |
| Check | GSC-7 | India | 13.54 | 19.91 | 7.22 | 3.49 |
| LSD (5%) | 1.40 | 2.40 | 0.36 | 0.36 | ||
*Significantly different from check with α = 0.05
†SY: Seed yield per plant (g), SL: Siliqua length (cm), NSS: Number of seeds per siliqua, TSW: Thousand seed weight (g)
Table 5.
Pearson correlation coefficients of test traits in B. napus
| Traits† | SL | NSS | TSW |
|---|---|---|---|
| NSS | 0.38* | ||
| TSW | 0.16* | − 0.08 | |
| SY | 0.05 | 0.15* | 0.23* |
*Significance at α = 0.05
†SL, Siliqua length (cm); NSS, Number of seeds per siliqua, TSW, Thousand seed weight (g); SY, Seed yield per plant (g)
Genotype-by-sequencing of B. napus genotypes
A diverse association mapping panel of 125 genotypes was used for GWAS studies (Fig. 3a). A total of 245.6 to 964.78 million raw reads were obtained through the GBS of association mapping panel (AMP). GC percent ranged from 35.13% to 38.89% with Q20 and Q30 values ranging from 94.75 to 98.33 and 87.37 to 95.64, respectively. After alignment with B. napus reference genome and variant calling, a total 18,41,123 variants (SNPs and Indels) from the whole genome of B. napus were obtained. Out of which, 1,03,274 bi-allelic SNPs having < 10% missing data and MAF ≥ 5% were obtained after filtering. These SNPs were located on 19 chromosomes of B. napus. Out of these, 18,148 SNPs were excluded due to more than 50% heterozygosity and 85,126 SNPs were finally used for GWAS. SNPs were found to be almost equally distributed over the two genomes (A and C) of B. napus (Table 6). Total 42,252 SNPs were obtained on ten chromosomes of genome A with the highest number on chromosome A03 and the least number on chromosome A04. Similarly, on genome C, total 42,874 SNPs were found on nine chromosomes with maximum SNPs obtained on chromosome C03, whereas the minimum number of SNPs found on chromosome C06. Linkage disequilibrium (LD) decay in population was estimated to be ~ 150 kb where r2 value drops below 0.1 (Fig. 3b).
Fig. 3.

a Distribution of 125 genotypes from different geographical locations based on Principle Component Analysis (PCA) plot of 85,126 SNPs. b Linkage disequilibrium (LD) decay plot in the genome of B. napus based on 85,126 SNPs in 125 B. napus diverse lines
Table 6.
Genome-wide distribution of SNPs on all chromosomes of B. napus
| Chromosome | Size (Mb)§ | Number of markers |
|---|---|---|
| A01 | 35.82 | 4287 |
| A02 | 35.33 | 4791 |
| A03 | 49.14 | 5727 |
| A04 | 23.56 | 2834 |
| A05 | 31.47 | 4267 |
| A06 | 36.08 | 4867 |
| A07 | 27.43 | 4021 |
| A08 | 27.74 | 3006 |
| A09 | 45.94 | 5298 |
| A10 | 22.22 | 3154 |
| C01 | 50.78 | 4682 |
| C02 | 68.4 | 4782 |
| C03 | 80.38 | 7214 |
| C04 | 70.55 | 6402 |
| C05 | 44.14 | 3336 |
| C06 | 45.51 | 3256 |
| C07 | 62.44 | 5691 |
| C08 | 46.35 | 4086 |
| C09 | 51.7 | 3425 |
| Total | 854.98 | 85,126 |
§Size of B. napus reference genome in this study
Marker-trait associations for test traits
Siliqua length (SL)
Total 20 significant SNPs were identified for SL, out of which six were found in E1, eight in E2 and six in pooled across environment BLUPs (Table 7). Out of 20 SNPs, one SNP (A05.1643587) from E1 and three SNPs (C03.56028089, C06.29644578, C07.36219105) from E2 were consistently identified in pooled across environment BLUP. For 12 SNPs minor allele had favourable effect (allelic effect: 0.18 to 0.28) and for eight SNPs major allele had favourable effect (allelic effect: 0.18 to 0.4). Manhattan plots depicting marker-traits associations of SL were presented in Fig. 4a.
Table 7.
Significant marker-trait associations for siliqua length based on BLUPs (E1, E2, Pooled across four environments: two years at two locations)
| Env† | SNPs associated | Chromosome | Position | MAF | Effect§ | −log10(p) |
|---|---|---|---|---|---|---|
| E1 | A09.18413957 | A09 | 18,413,957 | 0.39 | 0.19 | 4.09 |
| A05.16435872 | A05 | 1,643,587 | 0.38 | 0.18 | 4.04 | |
| A01.1223282 | A01 | 1,223,282 | 0.47 | − 0.18 | 4.01 | |
| A08.12100271 | A08 | 12,100,271 | 0.20 | 0.22 | 3.89 | |
| C04.17855316 | C04 | 17,855,316 | 0.19 | 0.27 | 3.81 | |
| A01.1108249 | A01 | 1,108,249 | 0.28 | − 0.18 | 3.80 | |
| E2 | C02.46888810 | C02 | 46,888,810 | 0.15 | − 0.29 | 4.24 |
| C03.560280893 | C03 | 56,028,089 | 0.10 | − 0.38 | 4.21 | |
| A05.31053566 | A05 | 31,053,566 | 0.37 | − 0.22 | 4.16 | |
| C07.43257273 | C07 | 43,257,273 | 0.27 | 0.27 | 4.09 | |
| C07.48993159 | C07 | 48,993,159 | 0.26 | 0.24 | 3.90 | |
| C07.362191053 | C07 | 36,219,105 | 0.13 | − 0.32 | 3.86 | |
| C06.296445783 | C06 | 29,644,578 | 0.28 | 0.18 | 3.85 | |
| C09.50218707 | C09 | 50,218,707 | 0.16 | 0.28 | 3.78 | |
| Pooled | C07.362191053 | C07 | 36,219,105 | 0.13 | -0.35 | 3.98 |
| C06.296445783 | C06 | 29,644,578 | 0.28 | 0.20 | 3.97 | |
| C03.560280893 | C03 | 56,028,089 | 0.10 | − 0.40 | 3.95 | |
| A08.24703686 | A08 | 24,703,686 | 0.41 | 0.20 | 3.82 | |
| A05.16435872 | A05 | 1,643,587 | 0.38 | 0.20 | 3.79 | |
| A01.14870209 | A01 | 14,870,209 | 0.28 | 0.21 | 3.75 |
†ENV, Environment; E1, Ludhiana pooled over two years, E2, Bathinda pooled over two years; Across, Pooled across all environments
2Common SNP between E1 and pooled, 3Common SNP between E2 and pooled
§A positive allelic effect estimate indicates that the minor allele is favorable and a negative allelic effect estimate indicates that the major allele is favourable
Fig. 4.
Manhattan plot showing significant marker-trait associations (MTAs) for E1, E2 and pooled across environments: a Siliqua length (SL), b Number of seeds per siliqua (NSS), c Thousand-seed weight (TSW), d Seed yield per plant (SY). *Chromosome 1 to 10 represents A genome mentioned as A01 to A10 and chromosome 11 to 19 represents C genome mentioned as C01 to C09 in manuscript
Number of seeds per siliqua (NSS)
There were a total of 23 significant SNPs for NSS, out of which seven were found in both E1 and E2 and nine in pooled (Table 8). Out of 23, four SNPs were consistent across environments. one SNP (C03.496706371) was present across all environments (E1, E2, pooled). Two SNPs (A04.5535255, C02.24437477) were common between E1 and pooled datasets. One SNP (A08.18254593) was common between E2 and pooled datasets. For 12 SNPs minor allele had favourable effect (allelic effect: 0.61 to 1.79) and for 11 SNPs major allele had favourable effect (allelic effect 0.6 to 1.25). Manhattan plots depicting marker-traits associations of NSS were presented in Fig. 4b.
Table 8.
Significant marker-trait associations for number of seeds per silique based on BLUPs (E1, E2, Pooled across four environments: two years at two locations)
| Env† | SNPs associated | Chromosome | Position | MAF | Effect§ | −log10(p) |
|---|---|---|---|---|---|---|
| E1 | C02.244374772 | C02 | 24,437,477 | 0.36 | 0.64 | 4.07 |
| A04.55352552 | A04 | 5,535,255 | 0.30 | –0.66 | 4.04 | |
| A10.22094031 | A10 | 22,094,031 | 0.34 | 0.63 | 4.02 | |
| C01.10935066 | C01 | 10,935,066 | 0.32 | 0.65 | 3.98 | |
| C03.496706371,2 | C03 | 49,670,637 | 0.32 | –0.60 | 3.87 | |
| C02.24309423 | C02 | 24,309,423 | 0.36 | -0.60 | 3.86 | |
| A06.2046187 | A06 | 2,046,187 | 0.43 | 0.67 | 3.76 | |
| E2 | A04.19663920 | A04 | 19,663,920 | 0.19 | –0.89 | 4.31 |
| C03.496706371,3 | C03 | 49,670,637 | 0.32 | –0.67 | 4.05 | |
| C04.38665294 | C04 | 38,665,294 | 0.17 | –0.97 | 4.04 | |
| A02.29797443 | A02 | 29,797,443 | 0.09 | 1.24 | 3.93 | |
| A08.182545933 | A08 | 18,254,593 | 0.40 | 0.62 | 3.90 | |
| A03.4833681 | A03 | 4,833,681 | 0.12 | 1.14 | 3.89 | |
| A09.18412313 | A09 | 18,412,313 | 0.49 | 0.61 | 3.76 | |
| Pooled | C03.496706372.3 | C03 | 49,670,637 | 0.32 | –0.89 | 5.37 |
| A08.182545933 | A08 | 18,254,593 | 0.40 | 0.78 | 4.61 | |
| C01.36201849 | C01 | 36,201,849 | 0.08 | 1.79 | 4.38 | |
| C02.244374772 | C02 | 24,437,477 | 0.36 | 0.81 | 4.21 | |
| C04.52890124 | C04 | 52,890,124 | 0.19 | –1.25 | 4.08 | |
| A04.55352552 | A04 | 5,535,255 | 0.30 | –0.83 | 4.08 | |
| A08.18028056 | A08 | 18,028,056 | 0.36 | –0.77 | 3.94 | |
| C08.38958037 | C08 | 38,958,037 | 0.12 | –1.22 | 3.75 | |
| A04.5535337 | A04 | 5,535,337 | 0.30 | 0.79 | 3.75 |
†ENV, Environment; E1: Ludhiana pooled over two years; E2: Bathinda pooled over two years; Across, Pooled across all environments
1Common SNP between E1 and E2,
2Common SNP between E1 and pooled,
3Common SNP between E2 and pooled
§A positive allelic effect estimate indicates that the minor allele is favorable and a negative allelic effect estimate indicates that the major allele is favourable
Thousand seed weight (TSW)
Total 30 SNPs were identified for TSW, out of which 11 were found each in both E1 and pooled while eight were in E2 (Table 9). Out of 30 SNPs, three (A03.42737659, A04.3187095, A10.91859) were common between E2 and pooled datasets. Major allele was favourable for 14 SNPs (allelic effect: 0.16 to 0.37) and 16 SNPs have favourable minor allele (allelic effect: 0.14 to 0.37). Manhattan plots depicting marker-traits associations for TSW were presented in Fig. 4c.
Table 9.
Significant marker-trait associations for thousand seed weight based on BLUPs (E1, E2, Pooled across four environments: two years at two locations)
| Env† | SNPs associated | Chromosome | Position | MAF | Effect§ | −log10(p) |
|---|---|---|---|---|---|---|
| E1 | C02.41257489 | C02 | 41,257,489 | 0.18 | 0.21 | 5.10 |
| C05.13774545 | C05 | 13,774,545 | 0.09 | − 0.29 | 4.58 | |
| C02.42717465 | C02 | 42,717,465 | 0.17 | 0.19 | 4.52 | |
| A03.13569864 | A03 | 13,569,864 | 0.20 | − 0.17 | 4.47 | |
| C02.43603089 | C02 | 43,603,089 | 0.12 | 0.23 | 4.36 | |
| C08.24798665 | C08 | 24,798,665 | 0.29 | 0.16 | 4.35 | |
| A03.13570024 | A03 | 13,570,024 | 0.21 | 0.16 | 4.26 | |
| C02.39280275 | C02 | 39,280,275 | 0.13 | − 0.20 | 4.21 | |
| C02.43441429 | C02 | 43,441,429 | 0.13 | − 0.20 | 4.01 | |
| C02.43540356 | C02 | 43,540,356 | 0.16 | 0.19 | 3.95 | |
| C02.41059840 | C02 | 41,059,840 | 0.14 | − 0.20 | 3.85 | |
| E2 | A03.427376593 | A03 | 42,737,659 | 0.27 | 0.33 | 5.68 |
| A04.31870953 | A04 | 3,187,095 | 0.17 | 0.37 | 4.88 | |
| A10.918593 | A10 | 91,859 | 0.29 | 0.36 | 4.48 | |
| A03.40403702 | A03 | 40,403,702 | 0.28 | 0.30 | 4.23 | |
| A10.9206788 | A10 | 9,206,788 | 0.20 | 0.35 | 4.20 | |
| A09.18695626 | A09 | 18,695,626 | 0.30 | − 0.26 | 4.13 | |
| C03.32913880 | C03 | 32,913,880 | 0.19 | − 0.37 | 3.99 | |
| C05.13684652 | C05 | 13,684,652 | 0.23 | − 0.35 | 3.98 | |
| Pooled | A10.918593 | A10 | 91,859 | 0.29 | 0.20 | 4.70 |
| A03.427376593 | A03 | 42,737,659 | 0.27 | 0.17 | 4.58 | |
| A04.14864106 | A04 | 14,864,106 | 0.14 | − 0.20 | 4.23 | |
| A03.34381273 | A03 | 34,381,273 | 0.12 | − 0.20 | 4.09 | |
| A03.40590580 | A03 | 40,590,580 | 0.22 | − 0.16 | 4.02 | |
| A04.31870953 | A04 | 3,187,095 | 0.17 | 0.18 | 4.00 | |
| C07.43724006 | C07 | 43,724,006 | 0.10 | − 0.20 | 3.99 | |
| A10.630419 | A10 | 630,419 | 0.40 | 0.14 | 3.91 | |
| C02.41257479 | C02 | 41,257,479 | 0.18 | − 0.17 | 3.84 | |
| A03.34381456 | A03 | 34,381,456 | 0.12 | − 0.19 | 3.82 | |
| C06.20084594 | C06 | 20,084,594 | 0.08 | 0.27 | 3.80 |
†ENV, Environment; E1, Ludhiana pooled over two years; E2, Bathinda pooled over two years; Across, Pooled across all environments
3Common SNP between E2 and pooled
§A positive allelic effect estimate indicates that the minor allele is favourable and a negative allelic effect estimate indicates that the major allele is favourable
Seed yield per plant (SY)
There are a total 18 SNPs that were found to be significant in this case. Out of the total of 18 significant SNPs, eight were identified in E1 and E2 each, while two SNPs were identified in the pooled dataset (Table 10). Major allele was favourable for eight SNPs (allelic effect: 0.91 to 1.34) and minor allele was favourable for ten SNPs (allelic effect: 0.14 to 0.37). Manhattan plots depicting marker-traits associations for SY were presented in Fig. 4d. No common SNPs were found across any environment data set in this case.
Table 10.
Significant marker-trait associations for seed yield per plant based on BLUPs (E1, E2, Pooled across four environments: two years at two locations)
| Env† | SNPs associated | Chromosome | Position | MAF | Effect§ | −log10(p) |
|---|---|---|---|---|---|---|
| E1 | A02.19604813 | A02 | 19,604,813 | 0.14 | 1.72 | 4.13 |
| C09.6453403 | C09 | 6,453,403 | 0.23 | − 1.34 | 4.05 | |
| C07.59073470 | C07 | 59,073,470 | 0.39 | − 1.19 | 3.98 | |
| A10.15563864 | A10 | 15,563,864 | 0.06 | 2.52 | 3.97 | |
| C02.67336115 | C02 | 67,336,115 | 0.19 | − 1.08 | 3.94 | |
| C03.27718223 | C03 | 27,718,223 | 0.50 | 1.01 | 3.88 | |
| A02.19263168 | A02 | 19,263,168 | 0.18 | 1.47 | 3.88 | |
| C02.17346965 | C02 | 17,346,965 | 0.43 | − 1.12 | 3.79 | |
| E2 | C01.47142794 | C01 | 47,142,794 | 0.15 | 1.34 | 4.57 |
| A08.27703652 | A08 | 27,703,652 | 0.11 | 1.13 | 4.32 | |
| A03.46280715 | A03 | 46,280,715 | 0.06 | 1.86 | 4.28 | |
| A10.11535749 | A10 | 11,535,749 | 0.24 | 0.73 | 4.08 | |
| A03.43617362 | A03 | 43,617,362 | 0.06 | 1.47 | 4.04 | |
| A10.19925371 | A10 | 19,925,371 | 0.29 | − 0.96 | 3.93 | |
| A02.23510581 | A02 | 23,510,581 | 0.28 | − 0.91 | 3.92 | |
| C08.45226147 | C08 | 45,226,147 | 0.17 | − 1.07 | 3.81 | |
| Pooled | C02.22109942 | C02 | 22,109,942 | 0.14 | − 1.11 | 4.09 |
| C01.41279341 | C01 | 41,279,341 | 0.14 | 0.99 | 3.84 |
†ENV, Environment; E1, Ludhiana pooled over two years; E2, Bathinda pooled over two years; Across, Pooled across all environments
§A positive allelic effect estimate indicates that the minor allele is favourable and a negative allelic effect estimate indicates that the major allele is favourable
Identification of candidate genes
LD decay tends to be stable when the distance is 150 kb, therefore, the candidate region is defined as the genomic region located within 150 kb upstream and downstream of the genome-wide significant SNPs. The candidate genomic region of significant SNPs was inspected for putative candidate genes for each test trait and presented in Table 11. For SL, four putative candidate genes viz. SPL2, ATS3A, CKX1 and SPL9 were identified on two chromosomes A09 (SNP: A09.18413957) and A05 (SNP: A05.1643587). For NSS, five candidate genes were identified located on chromosomes A06 and C02. Genes PERK7 and GATA15 were identified on chromosome A06 in the candidate region of SNP “A06.2046187”. Chromosome C02 has two significant SNPs, which have overlapping candidate regions. Genes NFD6 and PERK13 were identified in the candidate region of SNP “C02. 24,309,423”, while gene PRK3 was identified in the candidate region of SNP “C02. 24,437,477”.
Table 11.
Gene annotation in the confidence interval of significant SNPs associated with test traits
| Traits† | Chr§ | Position | Locus | Gene | Protein Name | |
|---|---|---|---|---|---|---|
| Start | End | |||||
| SL | A09 | 18,412,161 | 18,413,643 | LOC106447058 | SPL2 | squamosa promoter-binding-like protein 2 |
| SL | A05 | 1,497,493 | 1,498,487 | LOC106450668 | ATS3A | embryo-specific protein ATS3A-like |
| SL | A05 | 1,508,609 | 1,510,590 | LOC106450662 | CKX1 | cytokinin dehydrogenase 1-like |
| SL | A05 | 1,693,622 | 1,695,604 | LOC106450633 | SPL9 | squamosa promoter-binding-like protein 9 |
| NSS | A06 | 1,946,277 | 1,949,436 | LOC106351125 | PERK7 | proline-rich receptor-like protein kinase PERK7 |
| NSS | A06 | 2,075,018 | 2,075,610 | LOC106346114 | GATA15 | GATA transcription factor 15-like |
| NSS | C02 | 24,188,687 | 24,189,457 | LOC106404610 | NFD6 | protein NUCLEAR FUSION DEFECTIVE 6 |
| NSS | C02 | 24,245,382 | 24,250,169 | LOC106402973 | PERK13 | proline-rich receptor-like protein kinase PERK13 |
| NSS | C02 | 24,467,574 | 24,469,776 | LOC106357053 | PRK3 | pollen receptor-like kinase 3 |
| TSW | A03 | 13,420,280 | 13,421,490 | LOC106438015 | SRC2 | protein SRC2 homolog |
| TSW | A03 | 13,451,242 | 13,452,029 | LOC106438007 | WAT1 | WAT1-related protein At2g40900-like |
| TSW | A03 | 13,489,152 | 13,492,022 | LOC106438000 | AVT1A | amino acid transporter AVT1A |
| TSW | A03 | 13,524,554 | 13,524,829 | LOC106443562 | OSR1 | protein ORGAN SIZE RELATED 1-like |
| TSW | A03 | 13,550,794 | 13,552,034 | LOC106443560 | ARR8 | two-component response regulator ARR8-like |
| TSW | A03 | 13,609,966 | 13,612,015 | LOC106437984 | CKX1 | cytokinin dehydrogenase 1-like |
| TSW | A03 | 13,652,828 | 13,655,279 | LOC106428105 | At2g41710 | AP2-like ethylene-responsive transcription factor At2g41710 |
| TSW | A03 | 42,703,613 | 42,704,499 | LOC106444685 | REM9 | B3 domain-containing protein REM9-like isoform X2 |
| TSW | A03 | 42,703,613 | 42,704,474 | LOC106444685 | REM7 | B3 domain-containing protein REM7-like isoform X1 |
| TSW | C02 | 40,921,964 | 40,923,997 | LOC106383582 | BnC1 | cruciferin BnC1-like |
| TSW | C02 | 40,924,993 | 40,930,838 | LOC111203134 | BnC2 | cruciferin BnC2-like |
| SY | A02 | 19,489,729 | 19,490,450 | LOC106411812 | VPS45 | vacuolar protein sorting-associated protein 45 homolog |
| SY | A02 | 19,727,985 | 19,730,528 | LOC106397017 | WAT1 | WAT1-related protein At1g44800-like |
| SY | C02 | 21,998,836 | 21,999,801 | LOC111202902 | BG1 | protein BIG GRAIN 1-like E |
| SY | C02 | 22,027,303 | 22,028,455 | LOC111202903 | SPL6 | squamosa promoter-binding-like protein 6 |
| SY | C02 | 22,037,557 | 22,038,378 | LOC106441608 | AGL82 | agamous-like MADS-box protein AGL82 |
| SY | C02 | 22,192,124 | 22,195,820 | LOC106372743 | LHT1 | lysine histidine transporter 1 |
†NSS = number of seeds per silique, SL: silique length, TSW: thousand seed weight, SY: seed yield per plant
§Chr: Chromosome
For seed weight, presented as TSW, 13 candidate genes were identified on three chromosomes viz. A03, A09 and C02. Total nine candidate genes were identified on chromosome A03, out of which, seven genes viz. SRC2, WAT1, AVT1A, OSR1, ARR8, CKX1 and At2g41710 were located in the candidate region of SNP “A03.13569864”, while two genes REM9 and REM7 were located in the candidate region of SNP “A03.42737659”. Both chromosomes A09 and C02 have two candidate genes. LTP1 and PAH2 were present in the candidate region of SNP “A09.18695626”, while BnC1 and BnC2 were located in the candidate region of SNP “C02.41059840”. For SY, six genes were identified on two chromosomes viz. A02 and C02. Out of these, two genes viz. VPS45 and WAT1 were located on chromosome A02 in the candidate region of SNP “A02.19604813” and four genes viz. BG1, SPL6, AGL82 and LHT1 were identified in the candidate region of SNP C02.22109942 located on chromosome C02.
Discussion
B. napus being domesticated only during the recent 400–500 years in Brassicaceae family (Prakash et al. 2012), has narrow genetic base and hence, a hurdle for extensive genetic enhancement. The dual bottlenecks of polyploidy and domestication, and participation of a small number of progenitor plants in the ancient hybridization events are the probable reasons put forward. Intensive plant breeding activities for canola quality may have further eroded the variation for traits of high breeding interest (Hasan et al. 2006) particularly seed yield. Siliqua length (SL), number of seeds per siliqua (NSS), thousand-seed weight (TSW) are important components of seed yield per plant (SY) in B. napus which were investigated in a set of diverse germplasm stock to identify potential donors for these traits, associated SNPs and candidate genes. Out of 200, ten elite genotypes having the highest yield were presented in Table 4. Out of these, genotype P-617 from China has the highest NSS (~ 25), SL (7.75 cm) and second-highest TSW (4.06 g) and SY (15.07 g). Based on traits studied P-617 was recommended for commercial breeding programme. Based on variation for test traits across environments, a set of diverse germplasm stock collected from different geographical locations was used to constitute AMP, based on variation for test traits across environments. The presence of G X E interaction for test traits was found to be highly significant in ANOVA. Therefore, BLUPs were calculated and were used for GWAS.
GWAS and candidate gene identification
Significant variation and higher rate of LD decay (150 kb) presented in this panel made it highly suitable for GWAS analysis. FarmCPU uses the most significant markers as covariates in the GWAS model (Liu et al. 2016), therefore, SNPs are rarely identified within the same LD block for an environment-specific dataset (Steketee et al. 2019). Comparative studies of various models for GWAS analysis have demonstrated better performance and statistical power of FarmCPU in controlling both false positives and false negatives and indicated consistent results in identifying significant SNPs (Kaler et al. 2020). This study identified 11 loci (four for SL and NSS each, three for TSW), which were consistently identified both in individual environments (E1, E2) as well as in pooled datasets (Table 7a-d). For SL, SNP “C03.56028089” had the largest allelic effect (−0.40) with favourable major allele and was consistently identified in E2 and pooled datasets. For NSS, the largest allelic effect was 1.79 and minor allele was favourable for SNP “C01.36201849”. For TSW, SNPs “A04.3187095” and “C03.32913880” had largest allelic effect (0.37 each) with favourable minor and major alleles, respectively. SNP “A04.3187095” had significant association in both E2 and pooled datasets. SNP “A10.15563864” had the largest allelic effect (2.52) with favourable minor allele for SY.
The development of seeds in flowering plants is placed under complex interactions between maternal tissues, the embryo, and the endosperm. Functional annotation of the associated SNPs and surrounding genome space(s) helped to predict candidate genes for test traits. Important among these were the genes encoding auxin response factors, regulators of cytokinin signalling, seed storage proteins, transports related proteins and transcription factors and protein involved in successful fertilization. Important candidate genes were discussed as follows:
Candidate genes for SL
Total four candidate genes were identified for siliqua length, one of them was located on chromosome A09 and the other three were located on chromosome A05. Out of which, two genes encoding Squamosa promoter binding protein-like (SPL) proteins were identified in this study i.e., SPL2 located on chromosome A09 (SNP: A09.18413957) and SPL9 located on chromosome A05 (SNP: A05.1643587).
SPL proteins are plant-specific transcription factors and play critical roles in plant growth and development. The functions of many SPL gene family members were well characterized in various plant species, such as Arabidopsis (Cardon et al. 1999), rice (Xie et al. 2006; Wang et al. 2012) and wheat (Zhang et al. 2014). Cao et al. (2019) investigated ectopic expression of TaSPL16 gene from wheat in Arabidopsis thaliana and reported significantly longer siliquae, more seeds per siliqua and larger seed size. In Arabidopsis, siliquae of agl15, hda3, tub6, spl15, at5g10625, and at3g01323 mutants showed a significant reduction in length (Mizzotti et al. 2018).
On chromosome A05, gene ATS3A was an embryo-specific gene and expressed in a pattern similar to the Arabidopsis seed storage protein genes (Nuccio and Thomas 1999). Another gene CKX1 (LOC106450662) codes for cytokinin dehydrogenase, which catalyses the oxidation of cytokinin. Liu et al. (2018) reported higher expression of cytokinin dehydrogenase in reproductive organs e.g., buds, flowers or siliquae and their role in siliqua length, siliqua development and stress responses in B. napus.
Bennett et al. (2011) reported that siliqua is not only a photosynthetic source organ, but also serves to coordinate seed filling, regulates the reallocation of reserves, and protects seeds against biotic and abiotic stresses. Li et al. (2019) cloned two pleiotropic major QTLs, that acted indirectly on seed weight via their effects on siliqua length. Previous studies have reported pleiotropic effect of candidate genes, identified in this study, for SL viz., SPL gene family (Cao et al. 2019), ATS3A (Nuccio and Thomas 1999), CKX1 (Liu et al. 2018). Pleiotropic effect of identified candidate genes and significant positive correlation of SL with yield-component traits such as NSS (r = 0.38) and TSW (r = 0.16) suggested the importance of SL for SY improvement through marker assisted selection (MAS).
Candidate genes for TSW
Seed size that determines the seed weight, is one of the key traits for seed yield. The growth regulator auxin is known to have an important role in the regulation of different cellular processes involved in seed development (Cao et al. 2020). Auxin response factors (ARFs) are the transcription factors having the capability of recognizing AUXIN RESPONSE ELEMENTs (AuxREs) within the promoter regions of downstream genes and regulating their expression (Ulmasov 1997, 1999). Previous studies reported the role of ARFs in influencing seed size in various crops e.g., Camelina sativa (Na et al. 2019), rice (Liu et al. 2015b) and B. Napus (Liu et al. 2015a). In the previous study, Liu et al. (2015a) identified the role of ARF18 in determining final seed weight by regulating the classic auxin signalling pathway during siliqua development. Our study identified two genes REM9 and REM7 coding for B3 domain containing proteins ARF23 and ARF25 in the candidate region of SNP “A03.42737659” associated with TSW.
This study identified a gene (At2g40900-like) coding for WAT1-related protein in the candidate region of SNPs “A03.13569864” and “A03.13570024” associated with TSW. Ranocha et al. (2013) have established that Arabidopsis thaliana WAT1 (WALLS ARE THIN1) encodes a novel auxin transporter in plants and showed that WAT1 exports IAA from the vacuole to the cytoplasm in plant cells.
The high level of cytokinin activity found in endosperm suggested that the plant hormone cytokinin regulates growth of seed components and controls seed mass or yield (Letham 1973; Mok and Mok 2001; Day et al. 2008). In our study, two genes related to cytokinin signalling were identified in candidate regions of SNPs “A03.13569864” and “A03.13570024”. Gene ARR8-like (LOC106443560) codes for protein “two-component response regulator” which are primary cytokinin response targets and possibly involved in cytokinin signalling (Li et al. 2013). Siliqua pericarp provides photosynthates to the developing seeds and therefore, genes regulating siliqua length may lead to increased seed weight in B. napus (Liu et al. 2015a). In our study, CKX1 gene encoding cytokinin dehydrogenase was identified for TSW on chromosome A03 (LOC106437984) as well as for SL on chromosome A05 (LOC106450662).
Feng et al. (2011) suggested the role of phytohormones in regulating organ growth and final organ size by modifying the expression of different ORGAN SIZE RELATED (OSR) genes such as OSR1, ARGOS and ARL. ARGOS is induced by auxin and cytokinin, ARL is induced by brassinosteroid (BR), while OSR1 is induced by ethylene but repressed by abscisic acid and brassinosteroid. Our study identified gene (LOC106443562) located on chromosome A03 encoding protein ORGAN SIZE RELATED 1-like in the candidate region of SNPs “A03.13569864” and “A03.13570024” associated with TSW. Another gene (LOC106428105) encoding “AP2-like ethylene-responsive transcription factor At2g41710 '' was identified 128 kb downstream of OSR1.
The seeds of B. napus contain 25% seed storage proteins (SSPs), out of which, cruciferin (60%) is major storage protein followed by napin (20%) and other minor proteins (Gehrig et al. 1996). Our study identified two genes viz. BnC1 and BnC2 coding for seed storage protein “Cruciferin” in the confidence interval of SNP “C02.41059840” associated with TSW. Seed storage proteins are synthesized in the endoplasmic reticulum (ER) and transported to protein storage vacuoles (PSV) where they are primarily accumulated (Hohl et al. 1996; Hara-Nishimura et al. 1998). Our study identified gene SRC2 (LOC106438015) coding for “Protein SRC2 homolog” in the candidate region of co-localized SNPs “A03.13569864” and “A03.13570024” associated with TSW. Oufattole et al. (2005) identified a type II endoplasmic reticulum (ER) membrane protein (At)SRC2 in the protein storage vacuole (PSV) of mature seeds of Arabidopsis thaliana and identified its preferential interaction with a targeting motif specific for the ER-to-vacuole pathway. In the same candidate region, gene AVT1A coding for amino acid transporter was also identified, its role was reported in vacuolar amino acid transport in Arabidopsis thaliana (Fujiki et al. 2017).
Candidate genes for NSS
Five important candidate genes viz. PERK7, GATA transcription factor 15-like, NFD6, PERK13 were identified for NSS. Out of these, PERK7 and GATA transcription factor 15-like” were located in the candidate region of SNP “A06.2046187” and PERK13 and NFD6 in the candidate region of SNP “C02.24437477″. Gene PERK7 encodes proline-rich receptor-like kinase protein and NFD6 encodes NUCLEAR FUSION DEFECTIVE 6. Haffani et al. (2006) have studied ectopic expression and antisense suppression using the BnPERK1 cDNA and reported significant increase in the number of ovules per pistil and seeds per siliqua in Arabidopsis thaliana. Wang et al. (2009) cloned the rice gene NECK LEAF 1 (NL1) which encodes a GATA-type transcription factor and reported smaller panicles with overgrown bracts due to mutation in NL1.
In plant sexual reproduction, karyogamy or nuclear fusion is essential for sexual reproduction. Drews and Yadegari (2002) suggested segregation distortion and reduced seed set due to mutation affecting karyogamy. In Arabidopsis, Portereiko et al. (2006) identified nine female gametophyte mutants viz. nuclear fusion defective1 (nfd1) to nfd9, that are defective in fusion of the polar nuclei. In the mutants nfd1 to nfd6, failure of fusion of the polar nuclei is the only defect detected during mega-gametogenesis.
In A. thaliana, Takeuchi and Higashiyama (2016) have reported function of AtPRK3, along with AtPRK1, AtPRK6, and AtPRK8, as the receptors that sense the AtLURE1.2 peptide and discovered vital role of AtPRK3 in pollen tube growth, plant fertility, and plant reproduction. Chakraborty et al. (2018) have reported structural similarity of extracellular domain of pollen receptor kinase 3 to the SERK (somatic embryogenesis receptor kinase) family of co-receptors. Recently, Lee and Goring (2020) reported reduced seed set due to synergistic effect of mutation in SERKs and Leucine rich repeat receptor kinases (LRR-VIII-2 RKs) in A. thaliana.
Candidate genes for SY
This study identified “vacuolar protein sorting-associated protein 45 homolog” in the candidate region of SNP “A02.19604813” associated with SY. Bassham and Raikhel (1998) have demonstrated the role of AtVPS45p homolog in protein transport to the vacuole in Arabidopsis thaliana. Zouhar et al. (2009) suggested the role of AtVPS45 in pollen growth and cell expansion. Also, WAT1 and SPL6 (squamosa promoter-binding-like protein 6) were found to be associated with SY. Role of TaSPL3 and TaSPL6 was reported in regulating flowering time and promoting biomass accumulation in Arabidopsis (Cao et al. 2019). Role of similar genes in governing seed weight (WAT1) and siliqua length (SPL2, SPL9) was already discussed above.
Another gene (BG1) encoding protein “BIG GRAIN 1-like E” was identified on chromosome C02 in the candidate region of SNP “C02.22109942” associated with SY. In both Arabidopsis and rice, BG1 improved plant biomass, seed weight, and yield through its involvement in auxin transport and auxin response (Liu et al. 2015b). Gene AGL82 encoding agamous-like MADS-box proteins have a role in maintaining the proper function of the central cell in the pollen tube attraction (Li et al. 2015). Another gene (LHT1) encoding protein “lysine histidine transporter 1” was identified on chromosome C02. Hirner et al. (2006) showed that mutation of a single transporter gene (LHT1) is sufficient to inhibit plant growth and interfere with both the uptake and distribution of amino acids in Arabidopsis. LHT1 predominantly expresses in nonvascular tissues, including root surface and leaf mesophyll.
Conclusion
This study identified SNPs having significant association with key yield component traits viz. SL, NSS, TSW and SY in Brassica napus. Candidate gene analysis in the genomic region around SNPs (within 150 kb LD region) identified important candidate genes having significant role in phytohormone signaling, seed storage proteins, sexual reproduction and fertilization. Genotypes identified with high SL, NSS, TSW and SY suggested to serve as donors in crop improvement programs. As environments have significant influence on trait performance, multi-environment GWAS is to be preferred to capture consistent SNPs across environments with high allelic effect. The candidate regions will help to develop KASP (kompetitive allele specific polymerase chain reaction) based markers which can be used in marker assisted breeding. It is suggested to carry further research work on validation of SNPs, identified in present study, to develop gene-based markers for their utilization in marker assisted selection (MAS) in crop improvement.
Supplementary Information
Below is the link to the electronic supplementary material.
Acknowledgements
The B. napus germplasm and advance breeding lines, used in this study, were collected/developed and maintained by ICAR National Professor Dr. S. S. Banga. The authors duly acknowledge Dr. S. S. Banga for providing germplasm and valuable guidance. Authors also acknowledge Dr. Paramjit Singh (Director, Regional Research Station, Bathinda) for providing necessary facilities required for conducting field experiments at Bathinda. The fellowship from University Grant Commission (UGC) under CSIR-UGC JRF for the Ph.D. programme of Lalit Pal is duly acknowledged.
Abbreviations
- SL
Siliqua length
- NSS
Number of seeds per siliqua
- TSW
Thousand seed weight
- SY
Seed yield per plant
- BLUPs
Best linear unbiased predictors
- GWAS
Genome wide association study
- SNPs
Single nucleotide polymorphisms
- PAU
Punjab Agricultural University
- BTI
Bathinda
- LDH
Ludhiana
- ANOVA
Analysis of variance
- CV
Coefficient of variance
- CTAB
Cetyl trimethyl ammonium bromide
- GBS
Genotyping by sequencing
- FarmCPU
Fixed and Random Model Circulating Unification
- GAPIT
Genomic Association and Prediction Integrated Tool
- PCs
Principal Components
- LD
Linkage disequilibrium
- ARF
Auxin response factor
- AuxREs
Auxin response elements
- OSR
ORGAN SIZE RELATED
- BR
Brassinosteroid
- SSP
Seed storage proteins
- ER
Endoplasmic reticulum
- PSV
Protein storage vacuoles
- SERK
Somatic embryogenesis receptor kinase
- LRR-VII-2 RKs
Leucine rich repeat receptor kinases
- SPL
Squamosa promoter binding like protein
Declarations
Conflict of interest
The authors declare that there is no conflict of interest.
Footnotes
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- Alvarado G, Rodríguez FM, Pacheco A, Burgueño J, Crossa J, Vargas M, Pérez-Rodríguez P, Lopez-Cruz MA. META-R: A software to analyze data from multi-environment plant breeding trials. Crop J. 2020 doi: 10.1016/j.cj.2020.03.010. [DOI] [Google Scholar]
- Anonymous (2020) Package of Practices for Rabi Crops. Pp 20–31. Punjab Agricultural University, Ludhiana
- Bassham DC, Raikhel NV. An Arabidopsis VPS45p homolog implicated in protein transport to the vacuole. Plant Physiol. 1998;117(2):407–415. doi: 10.1104/pp.117.2.407. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bennett EJ, Roberts JA, Wagstaff C. The role of the pod in seed development: strategies for manipulating yield. New Phytol. 2011;190:838–853. doi: 10.1111/j.1469-8137.2011.03714.x. [DOI] [PubMed] [Google Scholar]
- Cao J, Li G, Qu D, Li X, Wang Y. Into the seed: Auxin controls seed development and grain yield. Int J Mol Sci. 2020;21(1662):1–17. doi: 10.3390/ijms21051662. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cao R, Guo L, Ma M, Zhang W, Liu X, Zhao H. Identification and functional characterization of Squamosa promoter binding protein-like gene TaSPL16 in wheat (Triticum aestivum L.) Front Plant Sci. 2019;10:212. doi: 10.3389/fpls.2019.00212. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cardon G, Höhmann S, Klein J, Nettesheim K, Saedler H, Huijser P. Molecular characterisation of the Arabidopsis SBP-box genes. Gene. 1999;237(1):91–104. doi: 10.1016/S0378-1119(99)00308-X. [DOI] [PubMed] [Google Scholar]
- Carruthers JM, Cook SM, Wright GA, Osborne JL, Clark SJ, Swain JL, Haughton AJ. Oilseed rape (Brassica napus) as a resource for farmland insect pollinators: quantifying floral traits in conventional varieties and breeding systems. GCB Bioenergy. 2017;9(8):1370–1379. doi: 10.1111/gcbb.12438. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chakraborty S, Pan H, Tang Q, Woolard C, Xu G. The Extracellular Domain of Pollen Receptor Kinase 3 is structurally similar to the SERK family of co-receptors. Sci Rep. 2018;8(1):1–2. doi: 10.1038/s41598-018-21218-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chay P, Thurling N. Identification of genes controlling pod length in spring rapeseed, Brassica napus L., and their utilization for yield improvement. Plant Breed. 1989;103:54–62. doi: 10.1111/j.1439-0523.1989.tb00351.x. [DOI] [Google Scholar]
- Danecek P, Auton A, Abecasis G, Albers CA, Banks E, DePristo MA, Handsaker RE, Lunter G, Marth GT, Sherry ST, McVean G. The variant call format and VCFtools. J Bioinform. 2011;27(15):2156–2158. doi: 10.1093/bioinformatics/btr330. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Day RC, Herridge RP, Ambrose BA, Macknight RC. Transcriptome analysis of proliferating Arabidopsis endosperm reveals biological implications for the control of syncytial division, cytokinin signaling, and gene expression regulation. Plant Physiol. 2008;148(4):1964–1984. doi: 10.1104/pp.108.128108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Diepenbrock W. Yield analysis of winter oilseed rape (Brassica napus L.): a review. Field Crops Res. 2000;67:35–49. doi: 10.1016/S0378-4290(00)00082-4. [DOI] [Google Scholar]
- Doyle JJ, Doyle JL, Hortoriun LB. Isolation of plant DNA from fresh tissue. Focus. 1990;12:13–15. [Google Scholar]
- Drews GN, Yadegari R. Development and function of the angiosperm female gametophyte. Annu Rev Genet. 2002;36:99–124. doi: 10.1146/annurev.genet.36.040102.131941. [DOI] [PubMed] [Google Scholar]
- Duggal P, Gillanders EM, Holmes TN, Bailey-Wilson JE. Establishing an adjusted p-value threshold to control the family-wide type 1 error in genome wide association studies. BMC Genom. 2008;9:1–8. doi: 10.1186/1471-2164-9-516. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Feng G, Qin Z, Yan J, Zhang X, Hu Y. Arabidopsis ORGAN SIZE RELATED1 regulates organ growth and final organ size in orchestration with ARGOS and ARL. New Phytol. 2011;191(3):635–646. doi: 10.1111/j.1469-8137.2011.03710.x. [DOI] [PubMed] [Google Scholar]
- Fujiki Y, Teshima H, Kashiwao S, Kawano-Kawada M, Ohsumi Y, Kakinuma Y, Sekito T. Functional identification of AtAVT3, a family of vacuolar amino acid transporters. Arabidopsis FEBS Lett. 2017;591(1):5–15. doi: 10.1002/1873-3468.12507. [DOI] [PubMed] [Google Scholar]
- Gehrig PM, Krzyzaniak A, Barciszewski J, Biemann K. Mass spectrometric amino acid sequencing of a mixture of seed storage proteins (napin) from Brassica napus, products of a multigene family. Proc Natl Acad Sci USA. 1996;93:3647–3652. doi: 10.1073/pnas.93.8.3647. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Haffani Y, Silva-Gagliardi N, Sewter S, Aldea MG, Zhao Z, Nakhamchik A, Cameron R, Goring D. Altered expression of PERK receptor kinases in Arabidopsis leads to changes in growth and floral organ formation. Plant Signal Behav. 2006;1(5):251–260. doi: 10.4161/psb.1.5.3324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hara-Nishimura I, Shimada T, Hatano K, Takeuchi Y, Nishimura M. Transport of storage proteins to protein storage vacuoles is mediated by large precursor-accumulating vesicles. Plant Cell. 1998;10:825–836. doi: 10.1105/tpc.10.5.825. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hasan M, Seyis F, Badani AG, Pons-Kuhnemann J, Friedt W, Luhs W, Snowdon RJ. Analysis of genetic diversity in the Brassica napus L. gene pool using SSR markers. Genet Resour Crop Evol. 2006;53:793–802. doi: 10.1007/s10722-004-5541-2. [DOI] [Google Scholar]
- Hirner A, Ladwig F, Stransky H, Okumoto S, Keinath M, Harms A, Frommer WB, Koch W. Arabidopsis LHT1 is a high-affinity transporter for cellular amino acid uptake in both root epidermis and leaf mesophyll. Plant Cell. 2006;18(8):1931–1946. doi: 10.1105/tpc.106.041012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hohl I, Robinson DG, Chrispeels MJ, Hinz G. Transport of storage proteins to the vacuole is mediated by vesicles without a clathrin coat. J Cell Sci. 1996;109:2539–2550. doi: 10.1242/jcs.109.10.2539. [DOI] [PubMed] [Google Scholar]
- Kaler AS, Gillman JD, Beissinger T, Purcell LC. Comparing different statistical models and multiple testing corrections for association mapping in soybean and maize. Front Plant Sci. 2020;10:1794. doi: 10.3389/fpls.2019.01794. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kaur H, Kaur P. Temperature features in different agroclimatic zones of Punjab. Agricultural Res J. 2015;52(4):32–35. doi: 10.5958/2395-146X.2015.00057.5. [DOI] [Google Scholar]
- Kaur H, Wang L, Stawniak N, Sloan R, Van Erp H, Eastmond P, Bancroft I. The impact of reducing fatty acid desaturation on the composition and thermal stability of rapeseed oil. Plant Biotechnol J. 2020;18(4):983–991. doi: 10.1111/pbi.13263. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kaur P, Hundal SS. Climatic trends in different agroclimatic zones of Punjab: II Rainfall features and its variability. J Res. 2008;45(1–2):41–50. [Google Scholar]
- Lebowitz R. Image analysis measurements and repeatability estimates of siliqua morphological traits in Brassica campestris L. Euphytica. 1989;43:113–116. doi: 10.1007/BF00037903. [DOI] [Google Scholar]
- Lee HK, Goring D. Two groups of Arabidopsis receptor kinases preferentially regulate the growth of intra-species pollen tubes in the female reproductive tract. Curr Bio. 2020 doi: 10.2139/ssrn.3564989. [DOI] [Google Scholar]
- Letham DS. Cytokinins from Zea mays. Phytochem. 1973;12(10):2445–2455. doi: 10.1016/0031-9422(73)80453-4. [DOI] [Google Scholar]
- Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinform. 2009;25(14):1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R. The sequence alignment/map format and SAMtools. Bioinform. 2009;25(16):2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li HJ, Zhu SS, Zhang MX, Wang T, Liang L, Xue Y, Shi DQ, Liu J, Yang WC. Arabidopsis CBP1 is a novel regulator of transcription initiation in central cell-mediated pollen tube guidance. Plant Cell. 2015;27(10):2880–28893. doi: 10.1105/tpc.15.00370. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li J, Nie X, Tan JL, Berger F. Integration of epigenetic and genetic controls of seed size by cytokinin in Arabidopsis. Proc Natl Acad Sci USA. 2013;110(38):15479–15484. doi: 10.1073/pnas.1305175110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li N, Song D, Peng W, Zhan J, Shi J, Wang X, Liu G, Wang H. Maternal control of seed weight in rapeseed (Brassica napus L.): the causal link between the size of pod (mother, source) and seed (offspring, sink) Plant Biotechnol J. 2019;17(4):736–749. doi: 10.1111/pbi.13011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lipka AE, Tian F, Wang Q, Peiffer J, Li M, Bradbury PJ, Gore MA, Buckler ES, Zhang Z. GAPIT: genome association and prediction integrated tool. Bioinform. 2012;28:2397–2399. doi: 10.1093/bioinformatics/bts444. [DOI] [PubMed] [Google Scholar]
- Liu J, Hua W, Hu Z, Yang H, Zhang L, Li R, Deng L, Sun X, Wang X, Wang H. Natural variation in ARF18 gene simultaneously affects seed weight and silique length in polyploid rapeseed. Proc Natl Acad Sci. 2015;112(37):E5123–5132. doi: 10.1073/pnas.1502160112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu L, Tong H, Xiao Y, Che R, Xu F, Hu B, Liang C, Chu J, Li J, Chu C. Activation of Big Grain1 significantly improves grain size by regulating auxin transport in rice. Proc Natl Acad Sci. 2015;112(35):11102–11107. doi: 10.1073/pnas.1512748112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu P, Zhang C, Ma JQ, Zhang LY, Yang B, Tang XY, Huang L, Zhou XT, Lu K, Li JN. Genome-wide identification and expression profiling of cytokinin oxidase/dehydrogenase (CKX) genes reveal likely roles in pod development and stress responses in oilseed rape (Brassica napus L.) Genes. 2018;9(3):168. doi: 10.3390/genes9030168. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu X, Huang M, Fan B, Buckler ES, Zhang Z. Iterative usage of fixed and random effect models for powerful and efficient genome-wide association studies. PLoS Genet. 2016;12(2):1005767. doi: 10.1371/journal.pgen.1005767. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mizzotti C, Rotasperti L, Moretto M, Tadini L, Resentini F, Galliani BM, Galbiati M, Engelen K, Pesaresi P, Masiero S. Time-course transcriptome analysis of Arabidopsis siliques discloses genes essential for fruit development and maturation. Plant Physiol. 2018;178(3):1249–1268. doi: 10.1104/pp.18.00727. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mok DW, Mok MC. Cytokinin metabolism and action. Annu Rev Plant Biol. 2001;52:89–118. doi: 10.1146/annurev.arplant.52.1.89. [DOI] [PubMed] [Google Scholar]
- Na G, Mu X, Grabowski P, Schmutz J, Lu C. Enhancing micro-RNA 167A expression in seed decreases the α-linolenic acid content and increases seed size in Camelina sativa. Plant J. 2019;98(2):346–358. doi: 10.1111/tpj.14223. [DOI] [PubMed] [Google Scholar]
- Nagaharu U. Genome analysis in Brassica with special reference to the experimental formation of B. napus and peculiar mode of fertilization. J Jap Bot. 1935;7:389–452. [Google Scholar]
- Nuccio ML, Thomas TL. ATS1 and ATS3: Two novel embryo-specific genes in Arabidopsis thaliana. Plant Mol Biol. 1999;391(1):1153–1163. doi: 10.1023/A:1006101404867. [DOI] [PubMed] [Google Scholar]
- Oufattole M, Park JH, Poxleitner M, Jiang L, Rogers JC. Selective membrane protein internalization accompanies movement from the endoplasmic reticulum to the protein storage vacuole pathway in Arabidopsis. Plant Cell. 2005;17(11):3066–3080. doi: 10.1105/tpc.105.035212. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pechan PA, Morgan DG. Defoliation and its effects on pod and seed development in oil seed rape (Brassica napus L.) J Exp Bot. 1985;36(3):458–468. doi: 10.1093/jxb/36.3.458. [DOI] [Google Scholar]
- Portereiko MF, Sandaklie-Nikolova L, Lloyd A, Dever CA, Otsuga D, Drews GN. NUCLEAR FUSION DEFECTIVE1 encodes the Arabidopsis RPL21M protein and is required for karyogamy during female gametophyte development and fertilization. Plant Physiol. 2006;141(3):957–965. doi: 10.1104/pp.106.079319. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Prakash S, Wu X, Bhat SR. History, evolution and domestication of Brassica crops. Plant Breed Rev. 2012;35:19–84. [Google Scholar]
- Priyamedha BK, Thomas L, Bala M, Singh VV, Singh D. Status and perspective of canola quality rapeseed-mustard cultivation in India: a review. J Oilseed Brassica. 2015;1(1):142–151. [Google Scholar]
- R Core Team (2018) R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/
- Ranocha P, Dima O, Nagy R, Felten J, Corratgé-Faillie C, Novák O, Morreel K, Lacombe B, Martinez Y, Pfrunder S, Jin X. Arabidopsis WAT1 is a vacuolar auxin transport facilitator required for auxin homoeostasis. Nat Commun. 2013;4(1):1–9. doi: 10.1038/ncomms3625. [DOI] [PMC free article] [PubMed] [Google Scholar]
- RStudio Team (2020) RStudio: Integrated Development for R. RStudio, Inc., Boston, MA URL http://www.rstudio.com/
- Sandhu SK, Pal L, Kaur J, Bhatia D. Genome wide association studies for yield and its component traits under terminal heat stress in Indian mustard (Brassica juncea L.) Euphytica. 2019;215(11):1–5. doi: 10.1007/s10681-019-2489-z. [DOI] [Google Scholar]
- Shi J, Zhan J, Yang Y, Ye J, Huang S, Li R, Wang X, Liu G, Wang H. Linkage and regional association analysis reveal two new tightly-linked major-QTLs for pod number and seed number per pod in rapeseed (Brassica napus L.) Sci Rep. 2015;5(1):1–8. doi: 10.1038/srep14481. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Steketee CJ, Sinclair TR, Riar MK, Schapaugh WT, Li Z. Unraveling the genetic architecture for carbon and nitrogen related traits and leaf hydraulic conductance in soybean using genome-wide association analyses. BMC Genom. 2019;20(1):1–8. doi: 10.1186/s12864-019-6170-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Takeuchi H, Higashiyama T. Tip-localized receptors control pollen tube growth and LURE sensing in Arabidopsis. Nature. 2016;531:245–248. doi: 10.1038/nature17413. [DOI] [PubMed] [Google Scholar]
- Ulmasov T, Hagen G, Guilfoyle TJ. ARF1, a transcription factor that binds to auxin response elements. Science. 1997;276(5320):1865–1868. doi: 10.1126/science.276.5320.1865. [DOI] [PubMed] [Google Scholar]
- Ulmasov T, Hagen G, Guilfoyle TJ. Dimerization and DNA binding of auxin response factors. Plant J. 1999;19(3):309–319. doi: 10.1046/j.1365-313X.1999.00538.x. [DOI] [PubMed] [Google Scholar]
- Wang H, Yan M, Xiong M, Wang P, Liu Y, Xin Q, Wan L, Yang G, Hong D. Genetic dissection of thousand-seed weight and fine mapping of cqSW. A03–2 via linkage and association analysis in rapeseed (Brassica napus L.) Theor Appl Genet. 2020;133(4):1321–1335. doi: 10.1007/s00122-020-03553-9. [DOI] [PubMed] [Google Scholar]
- Wang L, Yin H, Qian Q, Yang J, Huang C, Hu X, Luo D. NECK LEAF 1, a GATA type transcription factor, modulates organogenesis by regulating the expression of multiple regulatory genes during reproductive development in rice. Cell Res. 2009;19(5):598–611. doi: 10.1038/cr.2009.36. [DOI] [PubMed] [Google Scholar]
- Wang S, Wu K, Yuan Q, Liu X, Liu Z, Lin X, et al. Control of grain size, shape and quality by OsSPL16 in rice. Nat Genet. 2012;44:950–954. doi: 10.1038/ng.2327. [DOI] [PubMed] [Google Scholar]
- Xie K, Wu C, Xiong L. Genomic organization, differential expression, and interaction of SQUAMOSA promoter-binding-like transcription factors and microRNA156 in rice. Plant Physiol. 2006;142:280–293. doi: 10.1104/pp.106.084475. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yang P, Shu C, Chen L, Xu J, Wu J, Liu K. Identification of a major QTL for silique length and seed weight in oilseed rape (Brassica napus L.) Theor Appl Genet. 2012;125(2):285–296. doi: 10.1007/s00122-012-1833-7. [DOI] [PubMed] [Google Scholar]
- Yang W, Guo Z, Huang C, Duan L, Chen G, Jiang N, Fang W, Feng H, Xie W, Lian X, Wang G. Combining high-throughput phenotyping and genome-wide association studies to reveal natural genetic variation in rice. Nat Commun. 2014;5:5087. doi: 10.1038/ncomms6087. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yang Y, Shi J, Wang X, Liu G, Wang H. Genetic architecture and mechanism of seed number per pod in rapeseed: elucidated through linkage and near-isogenic line analysis. Sci Rep. 2016;6(1):1–10. doi: 10.1038/s41598-016-0001-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang B, Liu X, Zhao G, Mao X, Li A, Jing R. Molecular characterization and expression analysis of Triticum aestivum squamosa promoter binding protein-box genes involved in ear development. J Integr Plant Biol. 2014;56:571–581. doi: 10.1111/jipb.12153. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang C, Dong SS, Xu JY, He WM, Yang TL. PopLDdecay: a fast and effective tool for linkage disequilibrium decay analysis based on variant call format files. Bioinform. 2019;35(10):1786–1788. doi: 10.1093/bioinformatics/bty875. [DOI] [PubMed] [Google Scholar]
- Zhang L, Yang G, Liu P, Hong D, Li S, He Q. Genetic and correlation analysis of silique-traits in Brassica napus L. by quantitative trait locus mapping. Theor Appl Genet. 2011;122(1):21–31. doi: 10.1007/s00122-010-1419-1. [DOI] [PubMed] [Google Scholar]
- Zhu Y, Ye J, Zhan J, Zheng X, Zhang J, Shi J, Wang X, Liu G, Wang H. Validation and characterization of a seed number per silique quantitative trait locus qSN A7 in Rapeseed (Brassica napus L.) Front Plant Sci. 2020;11:68. doi: 10.3389/fpls.2020.00068. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zouhar J, Rojo E, Bassham DC. AtVPS45 is a positive regulator of the SYP41/SYP61/VTI12 SNARE complex involved in trafficking of vacuolar cargo. Plant Physiol. 2009;149(4):1668–1678. doi: 10.1104/pp.108.134361. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.




