Abstract
Seed traits in bread wheat are valuable to breeders and farmers, thus it is important exploring putative QTLs responsible for key traits to be used in breeding programs. GWAS was carried out using 298 bread wheat landraces and cultivars from Iran to uncover the genetic basis of seed characteristics in both rain-fed and well-watered environments. The analyses of linkage disequilibrium (LD) between marker pairs showed that the largest number of significant LDs in landraces (427,017) and cultivars (370,359) was recorded in genome B, and the strongest LD was identified on chromosome 4A (0.318). LD decay was higher in the B and A genomes, compared to the D genome. Mapping by using mrMLM (LOD > 3) and MLM (0.05/m, Bonferroni) led to 246 and 67 marker-trait associations (MTAs) under rain-fed, as well as 257 and 74 MTAs under well-watered conditions, respectively. The study found that 3VmrMLM correctly detected all types of loci and estimated their effects in an unbiased manner, with high power and accuracy and a low false positive rate, which led to the identification of 140 MTAs (LOD > 3) in all environments. Gene ontology revealed that 10 and 10 MTAs were found in protein-coding regions for rain-fed and well-watered conditions, respectively. The findings suggest that landraces studied in Iranian bread wheat germplasm possess valuable alleles, which are responsive to water-limited conditions. MTAs uncovered in this study can be exploited in the genome-mediated development of novel wheat cultivars.
Subject terms: Genetic association study, Genetic markers
Introduction
Bread wheat (Triticum aestivum L.) is a strategic cereal worldwide and can feed approximately 30% of the global population and provide 25% of the calorie consumed by humans1. Owing to rapid population growth, climate change, and abiotic stress incidence in the world, wheat productivity needs a 2.5% of yield increase yearly. Therefore, to meet future demand, plant breeders face the challenge of increasing wheat production up to 70% by the 2050s2. Drought stress adversely influences wheat productivity by disrupting a variety of bio-physiological and metabolic activities, and thereby giving rise to yield loss by a diminution in biomass3. Seed morphometric properties are explored as basic parameters in digital seed analysis4, enhancing the understanding of seed response to drought stress and providing data for research on wheat breeding in water-limited conditions5–7. There are only a few reports on investigating seed physical traits in the previous researches8, and such data have been focused on energy dissipation (thermodynamic) or shape, volume, surface area, sphericity, aspect ratio, density, and moisture (dimensional) properties. Seed physical traits, such as shape and size, are found effective for grain storage and processing. These features may be helpful, among others, for food scientists, processors, and engineers. The composition of the seed is influenced by the seed number, cultivar, water availability, temperature, light, and maturity9.
Genomics-by-sequencing (GBS) is a method for evaluating genetic variation and discovering new markers based on the advent of next-generation sequencing technologies10. This approach has been used to discover the complicated agronomical properties of wheat using molecular markers such as single nucleotide polymorphisms (SNP). They have also been recognized as key elements in genome-wide association studies11. This approach is aimed at detecting genomic regions that are either QTLs, genes, or markers related to important traits for gene introgression, gene discovery, or marker-assisted breeding12. Genetic markers detected by GWAS enable the dissection of genetic structure and diversity across many loci. This can enable wheat breeders to discover and use genomic loci controlling drought tolerance13.
Exploring the genetic basis of complicated quantitative traits by innovative technologies is critical to wheat breeding programs14. Genome-wide association mapping (GWAS), an efficient approach to dissecting the genetic foundation of complex traits, first genotypes a large collection of accessions with a lot of single-nucleotide polymorphisms (SNPs) distributed throughout the genome and then tests their associations with agronomic traits15. Association mapping has been successfully utilized to evaluate several agronomic traits in a range of plants/crops, including alfalfa16, sorghum17, soybean18, maize19, and rice20. Although GWAS has been widely adopted to examine agronomic characteristics in wheat, only a few studies used this approach for seed-related properties in drought-stressed wheat genotypes. In an attempt, Rahimi et al.21 demonstrated that bread wheat landraces from Iran possess favorable alleles, which are adaptive to water deficit. They also observed marker-trait associations (MTAs) within protein-coding regions that can be used in the molecular breeding of novel wheat cultivars. Such studies also provide important data about MTAs, which can assist plant breeders in the marker-assisted selection schedules22.
The objective of this study was to perform a genome-wide association analysis for seed morphometric traits in Iranian bread wheat. Seed morphometric traits were utilized in association studies to uncover putative QTLs responsible for key seed traits in water-limited conditions.
Results
Phenotypic data summary
The effects of genotype and genotype × environment for seed morphometric traits in the whole population were significant at 0.001 probability level (Table 1). The results of the box plot of 34 morphometric traits of wheat seeds for cultivars and native landraces in favorable conditions (well-watered) and stress (rain-fed) are shown in Fig. 1. The means of all traits under stress conditions decreased compared to normal conditions in both cultivars and native landraces. In both environments, the highest length (Feret) and width (Breadth) were found in landraces and cultivars, respectively. There was no significant difference between cultivars and landraces for the most important morphometric traits of seeds, i.e. Area, Area.1, Area.2, Volume, thickness, and 1000-kernel weight (TKW). Overall, the diversity and distribution among native landraces were higher than those of cultivars in both well-watered and rain-fed environments.
Table 1.
Trait | Well-watered | Rain-fed | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Mean | CV (%) | H2 | Mean squares | Mean | CV (%) | H2 | Mean squares | |||||||
Env | Rep (env) | Gen | Gen × Env | Env | Rep (Env) | Gen | Gen × Env | |||||||
ArBBox | 22.37 | 6.543 | 0.831 | * | ** | *** | *** | 18.89 | 7.926 | 0.732 | ** | ** | *** | *** |
ArBBox_1 | 24.07 | 7.595 | 0.848 | * | * | *** | *** | 20.16 | 8.676 | 0.686 | ** | ** | *** | *** |
ArBBox_2 | 10.79 | 8.027 | 0.801 | * | * | *** | *** | 7.963 | 10.43 | 0.660 | ** | ** | *** | *** |
Area | 17.44 | 6.461 | 0.826 | * | * | *** | *** | 14.49 | 7.811 | 0.719 | ** | ** | *** | *** |
Area_1 | 18.22 | 7.495 | 0.839 | * | * | *** | *** | 15.06 | 8.419 | 0.681 | ** | ** | *** | *** |
Area_2 | 8.322 | 7.965 | 0.808 | * | * | *** | *** | 6.133 | 10.03 | 0.619 | ** | ** | *** | *** |
ArEquivD | 4.702 | 3.260 | 0.835 | * | * | *** | *** | 4.283 | 3.910 | 0.738 | ** | ** | *** | *** |
Aspectratio | 2.166 | 4.170 | 0.881 | * | * | *** | *** | 2.504 | 5.253 | 0.843 | ** | ** | *** | *** |
Breadth | 3.213 | 4.454 | 0.799 | * | * | *** | *** | 2.750 | 5.815 | 0.701 | ** | ** | *** | *** |
CArea | 17.82 | 6.411 | 0.830 | * | * | *** | *** | 14.93 | 7.699 | 0.728 | ** | ** | *** | *** |
CHull | 16.74 | 3.030 | 0.878 | ns | ns | *** | *** | 15.97 | 3.335 | 0.840 | ns | ns | *** | *** |
Circ | 219.1 | 6.461 | 0.826 | * | ** | *** | *** | 182.1 | 7.811 | 0.719 | ** | ** | *** | *** |
Compactness | 0.680 | 2.030 | 0.882 | ns | ns | *** | *** | 0.629 | 2.478 | 0.830 | ns | * | *** | *** |
Concavity | 0.382 | 22.18 | 0.504 | * | * | *** | *** | 0.437 | 12.92 | 0.631 | ** | ** | *** | *** |
Convexity | 0.948 | 0.865 | 0.374 | * | ** | *** | *** | 0.947 | 0.415 | 0.519 | ** | ** | *** | *** |
EquivEllAr | 17.53 | 6.561 | 0.830 | * | ** | *** | *** | 14.79 | 8.001 | 0.732 | ** | ** | *** | *** |
Frete | 6.931 | 3.237 | 0.901 | ns | ns | *** | *** | 6.829 | 3.414 | 0.869 | * | * | *** | *** |
MaxR | 3.495 | 3.151 | 0.899 | * | ** | *** | *** | 3.439 | 3.334 | 0.869 | ** | ** | *** | *** |
MBCRadius | 5.123 | 3.057 | 0.868 | ns | ns | *** | *** | 4.831 | 3.485 | 0.811 | * | * | *** | *** |
MinR | 1.628 | 4.357 | 0.803 | ns | ns | *** | *** | 1.392 | 5.634 | 0.706 | ns | ns | *** | *** |
ModRatio | 1.481 | 1.401 | 0.890 | ns | ns | *** | *** | 1.417 | 1.568 | 0.842 | * | * | *** | *** |
PerEquivD | 5.551 | 6.461 | 0.826 | ns | ns | *** | *** | 4.613 | 7.811 | 0.719 | ns | * | *** | *** |
Perim | 17.67 | 3.292 | 0.869 | ns | ns | *** | *** | 16.88 | 3.410 | 0.837 | ns | * | *** | *** |
Perim_1 | 18.41 | 3.353 | 0.879 | ns | ns | *** | *** | 17.46 | 4.073 | 0.794 | * | * | *** | *** |
Perim_2 | 10.17 | 4.135 | 0.800 | ns | * | *** | *** | 8.681 | 5.201 | 0.672 | * | * | *** | *** |
Rectang | 0.780 | 1.207 | 0.488 | * | ** | *** | *** | 0.768 | 1.332 | 0.707 | ns | ns | *** | *** |
RFactor | 0.770 | 1.049 | 0.882 | * | ** | *** | *** | 0.745 | 1.077 | 0.828 | * | * | *** | *** |
Roundness | 2.156 | 4.605 | 0.871 | * | ** | *** | *** | 2.456 | 5.872 | 0.846 | * | * | *** | *** |
Shape | 17.99 | 3.778 | 0.770 | ns | * | *** | *** | 19.82 | 3.150 | 0.820 | * | * | *** | *** |
Solidity | 0.978 | 0.463 | 0.416 | ** | ** | *** | *** | 0.970 | 0.381 | 0.556 | ** | ** | *** | *** |
Sphericity | 3.155 | 2.744 | 0.880 | ns | * | *** | *** | 3.490 | 3.567 | 0.849 | * | * | *** | *** |
Thickness | 3.312 | 5.445 | 0.801 | ns | * | *** | *** | 2.847 | 6.379 | 0.607 | * | ** | *** | *** |
TKW | 42.10 | 1.193 | 0.796 | * | ns | *** | *** | 28.18 | 1.955 | 0.591 | * | ns | *** | *** |
Volume | 38.92 | 8.379 | 0.837 | *** | *** | *** | *** | 28.38 | 10.71 | 0.688 | ** | ** | *** | *** |
*, **, *** and ns are significant at the probability levels of 5%, 1%, 0.1%, and non-significant, respectively.
The highest correlation was observed between TKW and volume under stress (r = 0.76**) and normal (r = 0.85**) conditions. There was also a high correlation between circ and TKW in rain-fed (r = 0.73**) and well-watered (r = 0.83**) environments, which indicates that the more round the seed, the heavier it will be (Fig. 2; Supplementary Table 1).
Assessment of SNPs
The number of imputed SNPs includes 15,951, 21,864, and 5710 markers for genomes A, B, and D, respectively, including 36.7, 50.2, and 13.1% of total SNPs (Fig. 3A). The highest number of markers used in all chromosomes except chromosome 4 is related to genome A. The highest number of markers 4034 is related to chromosome 3A and the lowest number of markers 270 is related to chromosome 4D (Fig. 3B).
Linkage disequilibrium (LD)
LD assessment indicated that this indicator varies between chromosomes and across each chromosome and it usually decreases with rising distances between SNP locations. A total of 1,858,425 marker pairs with r2 = 0.211 were identified in cultivars, of which 700,991 (37.72%) harbored significant linkages at P < 0.001. The strongest LD was recorded between marker pairs on chromosome 4A (r2 = 0.367). Based on the observations, most of the significant marker pairs were found at a distance of < 10 cM. Genomes D and B possessed the lowest and highest number of significant marker pairs (63,924) and (370,359), respectively. A similar analysis on landraces identified a total of 1,867,575 marker pairs with r2 = 0.182, of which 847,725 (45.39%) harbored significant linkages at P < 0.001. Similar to cultivars, marker pairs on chromosome 4A showed the strongest LD (r2 = 0.369). Moreover, most of the significant marker pairs were found at a distance of < 10 cM. Genomes D and B possessed the lowest and highest number of marker pairs (92,702 and 427,017), respectively (Supplementary Table 2).
Population structure and Kinship matrix
The genetic relationship of accessions in the wheat population was assayed via the Kinship matrix derived from imputed SNPs. Population structure analysis indicated the highest value of ΔK for K = 3 (Fig. 4A,B). The estimated principal components for the population revealed that PC1, PC2, and PC3 explain 16.94, 6.34, and 2.30% of genotypic variations, respectively (Fig. 4C). As expected, a population structure was identified in the Iranian wheat landraces, with the first five eigenvalues accounting for 30.50% of genetic diversity.
The clustering analysis determined three major groups with different levels of admixture, where Group I consists of 6 cultivars and 107 landraces, Group II includes 4 landraces and 70 cultivars, Group III includes 14 cultivars, and 97 landraces (Fig. 4D). From the imputed SNP data, a total of 19 Cultivars appeared to be mixed with the two native landrace groups. The admixed Cultivars originated include the Sivand, Neishabour, Ghods, Azadi, Mahdavi, 4820, and Shahi. A neighbor-joining tree indicated that both cultivars and landraces were divided into two groups based on the imputed SNPs (Supplementary Fig. 1). In an analysis based on cultivars, two groups with 42 and 48 accessions were obtained. Native landraces were also divided into two groups with 98 and 110 accessions. The reason for each group's location can be due to the characteristics of the parents and the place where they came from.
Genome-wide association studies for morphometric seed traits using mrMLM, 3VmrMLM, and MLM
A total of 257 and 74 MTAs were identified by mrMLM and MLM models under well-watered conditions, respectively, using the imputed SNPs at a significance value of LOD > 3 (mrMLM) and 0.05/m (MLM). Of the total MTAs in the mrMLM method, 95, 99, and 63 MTAs were related to genomes A, B, and D, respectively. Out of 74 MTAs in the MLM method, 27, 31, and 16 MTAs belonged to genomes A, B, and D, respectively. Genome B with 38.5% (mrMLM) and 41.9% (MLM) had the highest number of significant MTAs. Therefore, the mrMLM approach led to the most MATs. The number of significant MTAs for Frete, Breadth, Thickness, Area, Perim, Circ, Volume, and TKW traits by using the mrMLM method were 9, 8, 9, 7, 7, 7, 12, and 10, respectively, and according to the MLM method were 5, 1, 0, 2, 4 2, 0, and zero, respectively. Based on mrMLM and MLM methods, the highest number of significant MTAs was related to ArBBox.1 and Concavity (18 and 10 MTAs, respectively) (Fig. 5A,B).
More significant MTAs were identified in rain-fed than well-irrigated conditions, i.e., a total of 246 and 67 MTAs were recorded based on mrMLM and MLM methods, respectively. Of these MTAs, 110, 105, and 31 mrMLM-based MTAs, as well as 30, 33, and 4 MLM-based MTAs were related to genomes A, B, and D, respectively. Genome A and B had the highest percentage of significant MTAs with 44.7% and 49.6% based on mrMLM and MLM, respectively. The number of significant MTAs for Frete, Breadth, Thickness, Area, Perim, Circ, Volume, and TKW traits according to the mrMLM method were 8, 4, 10, 7, 4, 7, 8, and 8, respectively, and according to the MLM method were 3, 3, 2, 0, 0, 0, 1, and 1, respectively. Based on mrMLM and MLM methods, the highest number of significant MTAs were related to Concavity (11 and 7 MTAs, respectively) (Fig. 5C,D). Circular Manhattan plots were plotted for common regions associated with seed traits (Fig. 6; Supplementary Fig. 2).
In this study, we adopted a three-variance component mixed model method, 3VmrMLM, for detecting QTNs and QTN-by-environment (QEIs). A total of 140 MTAs were identified by 3VmrMLM model, using the imputed SNPs at a significance value of LOD > 3. A total of 64, 60, and 16 MTAs based on 3VmrMLM, were related to genomes A, B, and D, respectively. Genome A with 45.7% had the highest number of significant MTAs. Therefore, the 3VmrMLM approach led to the most MATs. The number of significant MTAs for Frete, Breadth, Thickness, Area, Circ, Volume, and TKW traits by using the mrMLM method were 2, 3, 0, 6, 6, 5, and 4, respectively (Fig. 5E). QTN-by-environment interactions using 3VmrMLM for Area, Perim, Frete, Breadth, and TKW are reported in Table 2 and other traits in Supplementary Table 3.
Table 2.
Trait name | Marker | Sequence | Chro | Position (cM) | LOD (QE) | add*env1 | dom*env1 | add*env2 | dom*env2 | add*env3 | dom*env3 | add*env4 | dom*env4 | variance | r2 (%) | P-value | Method | Transcript ID | Biological process | Molecular process | Cellular component |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Area | rs38918 | TGCAGCTGCCTGCTCATCCTCACCATCTCAATCTGTGTACGAGTAGATCCCATGGTTTACCAGT | 2A | 11.39 | 22.1318 | − 0.3807 | − 0.5515 | − 0.2525 | − 0.6141 | 0.3785 | 0.3653 | 0.2548 | 0.8002 | 0.1164 | 2.0318 | 9.99E−20 | MLM | TraesCS2A02G045800 | Protein binding | ||
rs47935 | TGCAGGCGCCACTTATCCTAATCGGCCAATAAGGCCAACTCCAACGCGCACCCCCATCATGTCT | 2B | 59.184 | 13.7438 | 0.3024 | 0.3799 | 0.2175 | 0.4805 | − 0.296 | − 0.4631 | − 0.224 | − 0.3972 | 0.0711 | 1.2412 | 9.63E−12 | MLM | – | – | – | – | |
rs24480 | TGCAGCCCTGATGGCTCTGCTGAAACATTCTGGACCCGAAACATGAACTCTACTGCCAGTCGGA | 2B | 111.506 | 10.603 | − 0.0823 | 0.0286 | − 0.3229 | − 0.5606 | 0.1143 | − 0.2102 | 0.2909 | 0.7421 | 0.0545 | 0.9517 | 8.08E−09 | – | – | – | – | – | |
rs38828 | TGCAGCTGCCGAGGTACGCTTCGACTTGAAGTTCGAGAATGTGCCCTTGGTGATCCTCTACATG | 4A | 147.563 | 6.9031 | 0.1636 | 0.0899 | 0.2148 | − 0.0506 | − 0.2024 | 0.0791 | − 0.1759 | − 0.1183 | 0.0352 | 0.6152 | 1.79E−05 | – | – | – | – | – | |
rs62825 | TGCAGTTCAAAGGAGTTCAATGGAAAGGGCGGGTCGGGGCTTATAAACAGGTCCTGCCGCTCCT | 7A | 0.5685 | 5.5317 | 0.2002 | 0.0931 | 0.1358 | − 0.1428 | − 0.1489 | − 0.0209 | − 0.1871 | 0.0706 | 0.0282 | 0.4917 | 0.000279 | – | – | – | – | – | |
rs5421 | TGCAGAGAGAAAAAGCACCATCATCAAAGCAATATCCAGCAATAACCAATGAAAAAAAACCCAC | 7D | 71.943 | 14.4177 | 0.3461 | − 0.7677 | 0.1448 | 0.2718 | − 0.1211 | 0.749 | − 0.3697 | − 0.253 | 0.0747 | 1.3038 | 2.24E−12 | – | – | – | – | – | |
Frete | rs20215 | TGCAGCATTATTGCTTGTTATGTCGGTGTCCCTTGCCGCACTAGAGGGTGTTCATGGCGTTTGC | 2B | 67.141 | 13.8519 | -0.0533 | -0.0424 | -0.0708 | 0.0811 | 0.0623 | 0.0422 | 0.0618 | − 0.0809 | 0.0039 | 1.335 | 7.63E−12 | – | TraesCS2B02G461219 | Systemic acquired resistance | Fatty acid binding | |
rs53985 | TGCAGGTGCCGTGGGTGCAAAAACAACTCCTGCAACATTTTCATCCGTGCTTAAAGCTCTAAAT | 7D | 150.625 | 15.0054 | 0.0994 | – | 0.0148 | – | − 0.0453 | – | − 0.0689 | – | 0.0042 | 1.4494 | 6.65E−15 | – | – | – | – | – | |
Breadth | rs2102 | TGCAGAAGCTTGGGATACTTGGAGTATAGCAAAGGGCGGCATGTGTATAAAATTTAGTTTTGTA | 1A | 34.278 | 6.3891 | 0.005 | − 0.1148 | 0.0005 | − 0.191 | 0.0018 | 0.166 | − 0.0072 | 0.1399 | 0.0006 | 0.5544 | 5.06E−05 | – | – | – | – | – |
rs5425 | TGCAGAGAGAAAGAGGGGTTTCGGGCTACGCGGTTAACCGTTGGAGTACCAAACGACCTCCAAA | 2B | 59.184 | 6.0309 | 0.0295 | 0.1095 | 0.0135 | 0.0079 | − 0.0208 | − 0.0358 | − 0.0222 | − 0.0816 | 0.0006 | 0.5229 | 0.000104 | – | – | – | – | – | |
rs33742 | TGCAGCGTGCCTTGCAATGGCGTGCGCAACCTACAAGCAAAATTACACAAGATGCCAAATTACA | 6A | 99.391 | 10.1792 | − 0.0216 | − 0.023 | − 0.0381 | 0.0122 | 0.0212 | − 0.0233 | 0.0384 | 0.034 | 0.0009 | 0.8897 | 1.98E−08 | mrMLM | TraesCS6A02G402300 | Proteolysis | – | – | |
rs35884 | TGCAGCTCACGTAGAAGGAGACCCGACCGACAGCGCGATTCGCAAGACAGTCGACGAGCGCTTT | 1B | 75.148 | 16.9999 | − 0.0001 | 0.0074 | − 0.0001 | − 0.0222 | 0.0004 | 0.0071 | − 0.0002 | 0.0077 | 0 | 3.2964 | 8.08E−15 | – | – | – | – | – | |
rs13332 | TGCAGCACAACAACACAAGATGCGCCACAAAGATTAGAAACAGGCCCGAAAGCAAGCAAGTAAA | 3A | 43.437 | 6.3256 | − 0.0002 | − 0.0086 | − 0.0001 | 0.0004 | 0.0001 | 0.0025 | 0.0002 | 0.0058 | 0 | 1.2014 | 5.75E−05 | – | TraesCS3A02G084300 | snRNA import into nucleus | - | Nucleus, cytoplasm | |
rs7643 | TGCAGATACCCCTACCGACACCATGTCTCTTTCTTACTTATTTCTTAATCAATTGTGATCTTTT | 3B | 22.764 | 39.7108 | 0.0028 | − 0.0007 | − 0.0001 | 0.0022 | − 0.0011 | 0.0005 | − 0.0015 | − 0.002 | 0 | 8.052 | 8.34E−37 | – | – | – | – | – | |
rs10599 | TGCAGATTTTGGGTTTAGTCCCACTTTTAACATTGTAGATTTGAAGCCTTATTTGGGTGACGAG | 3B | 48.935 | 12.9176 | − 0.0008 | − 0.0239 | 0.0002 | 0.0086 | 0.0004 | 0.0071 | 0.0003 | 0.0082 | 0 | 2.485 | 5.73E−11 | – | – | – | – | – | |
rs15115 | TGCAGCACGGAAGGAGAGGGCCTAGCCACCTGGTACACTTGCTGGGGCGCGGGCAGACGCGGAC | 5B | 145.902 | 12.7877 | − 0.0003 | − 0.015 | 0 | 0.0087 | 0.0003 | 0.0037 | 0 | 0.0025 | 0 | 2.4594 | 7.57E−11 | – | – | – | – | – | |
rs31905 | TGCAGCGGCCGCCTAGACACCCAGGGCCAGGGATCTGACGGTTAGCTTTAGCTTAATCGAACGG | 5B | 145.902 | 6.0769 | 0.0004 | − 0.0065 | 0.0003 | 0.0023 | − 0.0005 | 0.002 | − 0.0002 | 0.0022 | 0 | 1.1536 | 9.46E−05 | – | – | – | – | – | |
rs49346 | TGCAGGCTTTCCAGCTGAACAGCTGAATCACAGGGTATACACTGCGTTAGGATTCTTAACTTTG | 7A | 69.63 | 11.3531 | 0 | − 0.0148 | 0.0004 | 0.0133 | − 0.0002 | -0.0018 | − 0.0002 | 0.0034 | 0 | 2.1774 | 1.64E−09 | – | – | – | – | – | |
TKW | rs56585 | TGCAGTACTTGCGTCAAACCGCCTCCATCTTGCACTGCATCAGGGCCTCCGTCTCTGCTGGCGG | 1A | 44.512 | 25.3101 | 0.9973 | 1.4715 | 1.0725 | 1.2849 | − 1.033 | − 1.3295 | − 1.0368 | − 1.427 | 1.0857 | 1.3583 | 8.62E−23 | MLM | – | – | – | – |
rs29065 | TGCAGCGATATCTCCCTACAACAATCAGACGGCTCCACGACGGGATGGCGGCTCCTCCATGAGG | 6A | 25.146 | 11.9358 | − 0.1777 | − 3.2922 | − 0.0918 | -3.3345 | 0.0989 | 3.5286 | 0.1706 | 3.0981 | 0.4988 | 0.624 | 4.71E−10 | – | – | – | – | – | |
rs31607 | TGCAGCGGCAAGAACATGGATTAGTCCTTCCAGGAGACAGAAACCACACGGATCGATCGACCCG | 6A | 99.391 | 15.5325 | − 0.8601 | − 0.0068 | − 0.8259 | 0.0254 | 0.8562 | − 0.0187 | 0.8299 | 0.0001 | 0.6537 | 0.8178 | 1.99E−13 | MLM | TraesCS6A02G399400 | Phosphorelay signal transduction system, signal transduction, response to ethylene | Protein binding, phosphorelay sensor kinase activity, metal ion binding, ethylene binding | Endoplasmic reticulum membrane, integral component of membrane | |
rs56117 | TGCAGTACAATTTTCCCACGGCCACACCAGCCAACCAGGATACAACACAGGTAACGCAAGGAAA | 7A | 63.946 | 14.2069 | − 0.577 | 3.2412 | − 0.5815 | 3.0985 | 0.5794 | − 3.2171 | 0.5791 | − 3.1226 | 0.5963 | 0.746 | 3.54E−12 | – | TraesCS7B02G121200 | Protein complex oligomerization, iron ion homeostasis | ATPase activator activity, chaperone binding | – |
Gene ontology
The markers with the highest significance and pleiotropy were studied in more detail. A total of 10 high-significance markers were identified in well-watered plants, most of which were located on chromosomes 1A, 1D, 2B, 2D, 3B, 4A, and 7A. Genes encoding proteins from MTAs were involved in molecular/biological processes such as metal ion binding, ATP binding, calcium ion binding, DNA binding, positive regulation of protein catabolic process, protein ubiquitination, ionotropic glutamate receptor, ligand-gated ion channel, lipid binding, and transport, protein phosphorylation, protein kinase, oxidation–reduction, and lipid biosynthesis (Table 3). In the rain-fed plants, 10 high-significance markers were identified with the highest pleiotropy, most of those were located on the wheat chromosomes 1A, 1B, 2A, 2D, 3B, 4A, and 6A. Protein-encoded genes from MTAs were responsible for molecular/biological processes such as metal ion binding, Fe ion binding, lipid binding, and transport, oxidation–reduction, lipid biosynthesis, oxidoreductase activity, and DNA-binding transcription factor (Table 3).
Table 3.
No. | Environments | SNP | Sequence | Chromosome | Position | Trait- Index | Transcript ID | Biological process | Molecular process | Cellular component | P-value | QTL reported in previous studies | Identified genes in QTL region in Triticum species |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | Well-watered | rs5540 | TGCAGAGAGGAAACGGTGGCCGTGCCTGATATCTCGGCGTTGGCTCTCGCTTTGAGGCTCCTGA | 1A | 3415 | Concavity, Convexity, Perim | TraesCS1A02G008500 | - | Acyltransferase activity, transferring groups other than amino-acyl groups | – | 0.0000946 | 27 | Gamma gliadin-A1, gamma gliadin-A3, gamma gliadin-A4, and LMW-A2 genes |
2 | rs14628 | TGCAGCACCTCTTTACTGAAGCTAGTGGTACTGCTCGCTGCGTATGATGTGGACCGCCATGGCA | 1A | 31,851 | Rfactor, Roundness, ModRatio | TraesCS1A02G399700 | transcription, DNA-templated, regulation of transcription, DNA-templated | DNA binding | Nucleus | 0.00000146 | 28 | B3 domain-containing protein Os03g0619600-like (LOC123183123) | |
3 | rs15519 | TGCAGCACTCTGCAAGAAAAACGTCAAAGTAAGAACCACCTACCCACATCTGCTCCAATTCAAA | 1D | 47,767 | Feret, Perim,Volume, TKW, CArea, Chull, Area, Circ, PerEquivD, ArEquivD, MBCRadius, ArBBox, EquivEllAr | TraesCS1D02G147800 | Lipid biosynthetic process | Iron ion binding, oxidoreductase activity | Integral component of membrane | 0.0000297551 | 29 | Very-long-chain aldehyde decarbonylase GL1-5-like (LOC123181004) | |
4 | rs2432 | TGCAGAATAAGAATATTAAGTTGATCAACATCCAGATCAACGCGCCCGAGAACAGCCCAAACAC | 2B | 69,414 | Perim.1 | TraesCS2B02G491300 | Carbohydrate metabolic process, cell wall organization | Polygalacturonase activity | Extracellular region | 0.000441697 | 27 | Exopolygalacturonase-like (LOC123042122) | |
5 | rs59777 | TGCAGTCTTTCAGAAGTGCAGATGTAAACGTATTGCTATATCAGTGGTTTGAACTACATGGTAA | 2D | 58,883 | Area.2, Perim.2, Feret, Volume, TKW | TraesCS2D02G152500 | Protein ubiquitination, positive regulation of protein catabolic process | Protein binding | – | 0.0000444241 | 29,30,31,32 | Exopolygalacturonase-like (LOC123042122) | |
6 | rs51999 | TGCAGGGTTCTGGTGGCGATGCCGACCGTCGTCGAGAGGTTCGTGGACTGGGCGGTGTTGTGGC | 3B | 67,127 | ArBBox, EquivEllAr, MBCRadius, Perim, Chull, Carea, Area, Circ, PerEquivD, ArEquivD, Perim.1, MaxR, Frete | TraesCS2D02G480600 | – | – | Integral component of membrane | 0.000248572 | 27,33 | HVA22-like protein i (LOC123103004) | |
7 | rs7487 | TGCAGATAAACAGCATCAGCTCAGTTCACGGATCGATCGAGCATGTAAAATGGCGACAAACAGT | 4A | 61,015 | Solidity | TraesCS4A02G186500 | l-Arabinose metabolic process | Alpha-l-arabinofuranosidase activity | – | 0.000230482 | 29,34 | Alpha-l-arabinofuranosidase 1-like (LOC123086376), transcript variant X2, | |
8 | rs31569 | TGCAGCGGATGGCTTCAACGTGCTGATGCCCAGCAACATAGACACAACCATAGCCGAGATCGGA | 4A | 61,015 | Solidity | TraesCS4A02G187400 | – | Protein binding | Nucleus | 0.000143907 | 29,33 | Protein HEAT STRESS TOLERANT DWD 1-like (LOC123086390) | |
9 | rs52301 | TGCAGGTACGAAACACCGAGGTGCTGCTGCTGCTGATGATGAACATTTTCGCTCCCAAAGGCCG | 7A | 0 | ArBBox.2, Area.2, Breadth, Perim.2, MinR, Concavity, Solidity, Convexity | TraesCS7A02G015000 | Autophagy, protein transport, COPII vesicle coating | – | Golgi membrane, endoplasmic reticulum | 0.000228123 | 28 | Protein transport protein SEC16A homolog (LOC123150363) | |
10 | rs1418 | TGCAGAACGAAAAACAGAGCATGTACTCAGTTTCTTATAATAAAAAGCTTCAAATATCATCAGA | 4A | 152,121 | Concavity, Perim | TraesCS4A02G495100 | Triterpenoid biosynthetic process | Lanosterol synthase activity, intramolecular transferase activity, beta-amyrin synthase activity | Lipid droplet | 0.000313328 0.000179158 | 35 | Cycloartenol Synthase-like (LOC123088646), transcript variant X3 | |
1 | Rain-fed | rs61378 | TGCAGTGGCAGGAGAAATTCTAACGTTTTGTGGCGTGCGATAGCGAGACTGGCGGGAAAGTACC | 1A | 75,213 |
Frete, Perim, Chull, Perim.1, MaxR , MBCRadius |
TraesCS1A02G361100 | Protein-glycosylation | Glycosyltransferase-activity | Integral-component-of-membrane | 0.000660236 | 27 | Glucosamine inositolphosphorylceramide transferase 1 |
2 | rs40522 | TGCAGCTTACGTGCTCCACATTGGTACTCTTGCGCTGATAATCATGAACCGCAAACTCTATGCA | 1B | 45,574 | MaxR, Frete, CHull | TraesCS1B02G176000 | Cytokinin-metabolic-process | Catalytic-activity,-oxidoreductase-activity,-cytokinin-dehydrogenase-activity,-flavin-adenine-dinucleotide-binding,-FAD-binding | Extracellular-space | 0.000485937 | 28 | Cytokinin oxidase/dehydrogenase 8 (CKX8) gene | |
3 | rs39898 | TGCAGCTGGGGCTGTAGTGCCCCAGCGAACGCCCCTGGATGCGGCGAGTACCACGGCAGGGCCG | 1B | 45,574 | MaxR, Chull, Frete, Perim | TraesCS1B02G178900 | – | Kinase-activity | – | 0.000335132 | 27,28 | antifreeze protein Maxi-like (LOC123110907) | |
4 | rs37640 | TGCAGCTCGTCATCACCGCTCGCCCGCCCGCGTGGATGCAGAAGTGCTCGAACGCCGTGCGGAA | 6A | 55,893 | TKW | TraesCS1B02G421100 | Fatty-acid-biosynthetic-process | Acyltransferase-activity,acyltransferase-activity,-transferring-groups-other-than-aminoacyl-groups | Membrane | 0.000503086 | 27,28,28 | 3-ketoacyl-CoA synthase 5-like (LOC123128295) | |
5 | rs45754 | TGCAGGATTTTTATTCAAGTTTGACGTACTTATTTAGCTATATATCCTGATGAATATGGGAAGT | 2A | 52,385 | Aspect.ratio,Compactness,Concavity,ModRatio,Rfactor,Roundness,Shape,Sphericity | TraesCS2A02G115700 | – | Membrane,integral-component-of-membrane | 0.0000000503 | 27 | Protein SRC2-like (LOC123184774) | ||
6 | rs51900 | TGCAGGGTGGGGGCGGAGAAAAAGGAGGAGGGGCGGCCGAGATCGGAAGAGCGGGATCACCGA | 2D | 28,183 | TKW | TraesCS2D02G082900 | Vesiclemediated-transport | Protein-binding | Plasma-membrane,-integral-component-of-membrane | 0.0000429131 | 29,27,28,29 | Syntaxin-binding protein 5-like (LOC123043515) | |
7 | rs16023 | TGCAGCAGAGGTGGTTTGGAGGTTTGGTGGCGGCAGGATTCCCCTCCCGCGGGCGGCTCGGCTC | 3B | 56,892 | TKW | TraesCS3B02G373500 | Auxinactivated-signaling-pathwa,-transmembrane-transport,-intracellular-auxin-transport | Membrane,-integral-component-of-membrane | 0.0000599241 | 27,29 | Protein PIN-LIKES 6-like (LOC123071203) | ||
8 | rs1934 | TGCAGAAGCAGTCCATCCCCACCAACCCAGCCAGCGCCGCCGCAACTACTCCTACGAGCGAAGC | 3B | 114,516 | ArBBox,Area,ArEquivD,Carea,Circ,EquivEllAr,MBCRadius,PerEquivD | TraesCS3B02G581600 | – | Metal-ion-binding | – | 0.000605311 | 27 | PH, RCC1 and FYVE domains-containing protein 1-like (LOC123072730) | |
9 | rs15576 | TGCAGCACTGGAAATTCTGGAGATGTGTAGGTCCAGACATAGTTTCTGTCGTCAATCAACTGTC | 4A | 149,842 |
Perim, Chull, MBCRadius, MaxR , Frete |
TraesCS4A02G494000 | – | Protein-binding | – | 0.000454063 | 27 | F-box protein At4g00755-like (LOC123083653), transcript variant X3 | |
10 | rs15577 | TGCAGCACTGGAAATTCTGGAGATGTGTAGGTCCAGACATAGTTTCTGTCGTCAATTAACCGTC | 4A | 149,842 | ArBBox,ArEquivD,Carea,Chull,EquivEllAr,Frete,MaxR,MBCRadius,Perim | TraesCS4A02G494000 | Protein-binding | – | 0.0000166 | 27,29 | F-box protein At4g00755-like (LOC123083653), transcript variant X3 |
Based on blast gene IDs identified from the wheat reference genome, the following pathways were discovered: metabolic pathways (Supplementary Fig. 3), ubiquitin-mediated proteolysis (Supplementary Fig. 4), oxidative phosphorylation (Supplementary Fig. 5), carbon metabolism (Supplementary Fig. 6), biosynthesis of amino acids (Fig. 7a), pentose phosphate (Supplementary Fig. 7), ascorbate and aldarate metabolism (Fig. 7b), sulfur metabolism (Supplementary Fig. 8), and fatty acid elongation (Supplementary Fig. 9)23–25 (www.kegg.jp/kegg/kegg1.html).
In addition, using RNA-seq data from Rahimi et al.26 the DGEs belonging to the different transcription factor (TFs) families totaled 1,377. In this study, 443 genes encoding transcription factors were identified that showed differential expression between stress and normal treatments, Approximately the same number of TFs were identified among susceptible and tolerant genotypes (356 and 328 TFs, respectively). The difference between 9 and 18 days of water deficit was associated with 250 TFs. As genotype specific TFs, the majority of these TFs belong to the MYB, AP2/ERF-ERF related, MADS-M, B3, and, bHLH classes. There were, however, other TFs that were specific to long and short-term water deficits, including bZIP, C2H2, WRKY, NAC, and MYB. Furthermore, transcriptional regulators such as TAZ, TRAF, SNF2, and mTERF were identified. A summary of identified TFs among the different sets of DEGs in wheat is given in Supplementary Table 4.
Discussion
A total of 298 Iranian wheat accessions including 208 landraces and 90 cultivars were assembled as a natural population for mapping QTLs related to seed traits using GWAS. A high level of variation found in wheat seed traits suggests the potential of GWAS for uncovering QTLs, as reported by Rahimi et al.21.
Most plant populations are structured because of artificial selection, isolation, or, nonrandom mating. As a result, genetic loci may be falsely related to traits when there is no authentic associations15. The possibility of false positives can increase in GWAS if population structure is not suitably accounted for, therefore evaluation of population structure is critical for any association mapping36,37. The panel of Iranian wheat accessions in this study was stratified into three groups. Cultivars made up one group, while landraces made up the other two groups, regardless of their geographic origins. Rahimi et al.21 observed the same groups on these Iranian wheat accessions. This mixture can be derived from grain exchanges between farmers in different local markets throughout the country15. As reported previously21, most Iranian cultivars originated from the International Maize and Wheat Improvement Center and only a small number of the cultivars derived from Iran, suggesting relatively narrow exploitation of native landraces in developing the new/old cultivars. Therefore, Iranian cultivars are suffered from a remarkable genetic bottleneck.
In accordance with previous reports, genome D indicated a low number of SNPs while most SNPs we located on the genomes B and A21,38. A similar situation was also uncovered for the number of marker pairs in LD, i.e., SNPs mapped to the genome B were about four times more common than those located on the genome D. The 3B and 2B chromosomes possess the most significant marker pairs, as reported previously21. The higher variation uncovered in the B and A genomes can be due to two reasons: (i) gene flow from T. turgidum as opposed to its absence from Ae. tauschii to T. aestivum; (ii) the output of older evolutionary history of the genomes B and A relative to genome D39,40. Furthermore, bottleneck impacts have likely happened owing to intense selection in native landraces during breeding schedules and this might lead to further impacts on genome D15. These impacts lead to a decrease in the effective population sizes, which in turn increase the loss of rare alleles in genomes B and A. A higher rate of low-frequency alleles in the D genome indicates a decrease in its allelic variant41. Of the observations in this study, most of the significant markers were present at a distance of less than 10 cM. Marker distances and LD throughout the genomes B and A were much lower than in the D genome. The higher level of linkage across three genomes in wheat cultivars reflects the impact of selection in the breeding history of those Cultivars. Population relatedness, mating systems, genetic drift, mutation, recombination, and selection are major forces affecting LD42–44. The fact that cultivars revealed higher LD in contrast to landraces, particularly in the genome D, is presumably a consequence of selection throughout the time of breeding efforts for key traits45.
A total of 10 and 10 MTAs by mrMLM, 3VmrMLM and MLM methods in the well-watered and rain-fed environments, respectively, were found within coding regions with P-value < 0.001. To remove any false-positive association, the most strongly markers were selected. Some MTAs discovered in this study are in line with previous reports.The GWAS identified 8 MTAs underlying seven putative QTL associated with grain perim on chromosomes 1A27, 1B27,28, 1D29, 2B27, 2D29–32, 3B27,33, and 7A28. Thus, MTA on Ch. 4A has not been reported and they are new for wheat seed perim. Six MTAs for area were found on Ch. 4A, 7A, 3B, 1D, 2D, and 3D. Earlier reports have detected MTAs/QTLs for area on Ch. 4A27,29, 7A28, 5B, 3B27, 1D29, 2D30,31. Therefore, MTAs on Ch. 3D are novel for area. Four MTAs for grain frete were recorded on Ch. 1A, 1B, 1D, and 4A in this study. Earlier research efforts have discovered MTAs/QTLs for frete on wheat Ch. 1A28, 1B28, 1D29, and 4A27. For seed breadth, two MTAs were revealed on Ch. 4A, and 7A. Previous research exhibited that this trait is linked with genomic regions on Ch. 2B, 4A29,33,34, 4B, 6A, and 7A28. The GWAS identified 4 MTAs underlying seven putative QTL associated with grain compactness and solidity on chromosomes 1A27, 2A27, 4A29,33,34, and 7A28. For instance, we detected QTLs on chromosomes 1A, 2A, 2B, 2D, 3A, 4A, 5A, 5B, and 7A for TKW under well-irrigated conditions. These observations agree with previously determined QTLs for TKW46. For rain-fed conditions, we also detected QTLs on chromosomes 1A, 1B, 2B, 2D, 3B, 4A, 5B, 6A, and 6B for TKW. These outputs are in agreement with the report by Ain et al.47 for TKW. Moreover, Gao et al.48 mapped a TKW QTL, namely QTKW.caas-7AL, in various conditions using an F8 population of Chinese spring wheat. Yan et al.22 revealed the TaGW8 gene is associated with seed size in wheat by using GWAS. Breseghello and Sorrells35 revealed a QTL on chromosome 5B that affects seed length, with a moderate impact on seed size, under normal and stress conditions. They also reported QTLs for seed sphericity on 2D, 5B, QTLs for surface on1B, 2B, 4A, and QTLs for volume on 1B, 4A, 5B, 7B in wheat. Ma et al.49 located the TaCYP78A3 gene, encoding cytochrome CYP78A3 P450, on the 7DS, 7BS, and 7AS, related to wheat seed shape and size. The authors demonstrated that silencing the TaCYP78A3 gene could reduce the seed shape and size. Earlier reports have detected MTAs/QTLs for seed traits on Ch. 7D50, 7B51, 5B52, 3B50, 3A52,53, 2D54, 2B50,51,54, 2A50, and 1A51–54. Therefore, MTAs on Ch. 5A, 1B, 6B, and 1D are novel for seed traits.
In the recent study, the flanking sequences of imputed SNPs were identified and aligned versus the RefSeq v2.055. The results indicated that most genes detected are responsible for key biosynthetic pathways. In a closer look, the proteins encoded by these genes are responsible for metal ion binding, peroxidase activity, ATP-binding, DNA-binding, protein kinase activity, enzyme inhibitor activity, etc. Such marker-trait associations have also been uncovered in previous reports56,57. These genes are found in genomic regions, which exhibit strong associations with key seed characteristics, suggesting that the genes can be regarded as favorable target genes for breeding efforts in future programs.
Analysis of RNA sequencing revealed genetic variations among genotypes as well as drought-responsive genes26. Our goal is to identify wheat genes that respond consistently to drought in dry, long-term conditions. Interestingly, we found a significantly higher number of genotype-specific DEGs in the susceptible genotype under normal and stress environments than in the tolerant genotypes, which is consistent with previous findings by Mia et al.58, and Fracasso et al.59 who both found similar expression pattern changes in susceptible materials.
From the gene network, several pathways were discovered in this study. Synthesis and elongation of fatty acids also are useful in response to drought in oats60. Protein phosphorylation contributes to a key role in wheat response to drought conditions61. Peptidase activity, DNA repair, DNA-binding transcription factor activity, and transmembrane transport were possibly responsible for drought tolerance26. Wheat avoids from oxidative stress and maintains cellular functions under drought by non-enzymatic antioxidants (ascorbate, etc.) and ROS scavenging enzymes (SOD, CAT, etc.)62. The role of ubiquitination in metabolic pathways of tea in response to drought has also been proved by Xie et al.63. Such essential roles for the biosynthesis of secondary metabolites have also been reported64. A metabolic pathway that is associated with drought stress tolerance involves genes such as ABA-responsive element-binding factor, sucrose synthase, and sucrose-phosphate synthase in the metabolism of ascorbate and aldarate65. Drought stimulates energy-intensive processes such as osmolyte production and oxidative phosphorylation, as well as increases respiratory rates66. Proline is an amino acid produced by the amino acid pathway. Proline has been linked to a number of osmoprotective properties, such as the ability to regulate humidity and activate genes that produce antioxidizing enzymes that scavenge reactive oxygen species (ROS)67,68. In drought-stressed genotypes, proline levels increased faster and by a greater proportion than those of their sensitive counterparts, emphasizing its importance for drought tolerance breeding. Proline-controlling genes have cumulative effects on proline content69,70. These findings are similar to the previous report21. Oxidative damage is induced by the production of the reactive oxygen species (ROSs), including OH–, O2–, and H2O267. These ROSs in high concentrations are detrimental and degrade photosynthetic pigments, proteins, etc. In the context of osmotic tolerance, crops generate proline osmolyte to adjust water status69,70. Crops also adopt tissue tolerance by using the scavenging system to alleviate ROSs effects. The first enzyme committed to remove ROSs is the superoxide dismutase (SOD), which can dismutate O2– to H2O2. H2O2, in turn, is catalyzed by peroxidase (POD) and catalase (CAT) to O2 and H2O 67,68. Expressed wheat-originated CAT and SOD in Arabidopsis can enhance tolerance to multiple abiotic stimuli, such as high-drought conditions62. APX, GPX, and PPO enzymes are other key components of non-enzymatic scavenging systems in crops68.
Conclusion
Of the current findings, new QTLs were uncovered in the panel of Iranian wheat landraces in multi-environment phenotypic data, i.e., rain-fed (drought) and well-watered (normal). Data from multi-environment, multi-year phenotypic experiments could reveal QTL that are stable across environments. Major QTLs controlling seed traits were uncovered on the genome B (1B and 3B) and chromosomes 1A and 4A. QTL for grain shape traits were identified in chromosome regions in which major QTL or/and genes were detected in previous studies. Using digital image analysis is a non-invasive and inexpensive alternative to trait evaluations.
Methods
Plant materials and experimental conditions
A total of 208 wheat landraces and 90 cultivars (Supplementary Table 5) were analyzed in an alpha-lattice experiment with two repeats during two crop seasons (2018–2019 and 2019–2020) under rain-fed (drought) and well-watered (normal) conditions. In the field, the plots consisted of four rows (1*1 m2) at 0.5 m intervals. The irrigation threshold in the well-watered crops was considered according to 40 mm evaporation from an evaporation pan. The crop coefficient [KC] and reference crop evapotranspiration [ET0 = Epan × Kpan; where Epan is the evaporation depth from the pan surface (40 mm) and Kpan is a pan coefficient (0.8) for each month] were utilized to measure evapotranspiration (ETC = KC × ET0). The irrigation time was determined according to the ratio of the assigned water for 1400 m2 (the cultivation area of 298 genotypes in two repeats) to water discharge (10.8 m3/h). The volume of water needed for each hectare (m3/ha) was determined by the depth of ET0 (mm) multiplied by 10. The wheat cultivated under the rain-fed regime was only exposed to rainfall, the only available water source. The pattern of monthly rainfall for the cropping seasons is presented in Table 4. The authors declare that all study complies with relevant institutional, national, and international guidelines and legislation for plant ethics in the “Methods” section. Samples are provided from the Gene Bank of Agronomy and Plant Breeding Group and these samples are available at USDA with the USDA PI number (Supplementary Table 5). The authors declare that all that permissions or licenses were obtained to collect the wheat plant.
Table 4.
Year | Month | Max Temperature °C | Min Temperature °C | Average Temperature °C | Average rainfall, mm | Average relative humidity | Sunny hours | Evaporation, mm |
---|---|---|---|---|---|---|---|---|
2018–2019 | November | 14.561 | 4.104 | 10.900 | 0.031 | 45.810 | 6.893 | 3.068 |
December | 9.242 | − 0.119 | 4.671 | 1.326 | 60.134 | 5.065 | 0.000 | |
January | 8.406 | − 0.613 | 3.668 | 0.485 | 57.750 | 6.652 | 0.000 | |
February | 7.871 | − 2.254 | 2.536 | 0.965 | 61.429 | 6.868 | 0.000 | |
March | 14.216 | 4.623 | 9.271 | 1.240 | 56.847 | 5.942 | 0.179 | |
April | 21.093 | 9.563 | 15.110 | 1.555 | 49.954 | 6.587 | 4.497 | |
May | 29.229 | 14.261 | 21.935 | 0.710 | 38.722 | 10.435 | 7.377 | |
June | 34.159 | 17.597 | 26.083 | 0.000 | 32.304 | 12.763 | 11.676 | |
2019–2020 | November | 17.080 | 6.383 | 11.520 | 0.021 | 43.479 | 6.960 | 3.189 |
December | 12.303 | 1.652 | 6.671 | 0.152 | 50.419 | 7.226 | 0.000 | |
January | 9.077 | − 0.055 | 4.052 | 0.640 | 54.476 | 6.526 | 0.000 | |
February | 10.739 | 2.039 | 6.464 | 1.094 | 64.755 | 5.829 | 0.000 | |
March | 20.558 | 8.377 | 14.652 | 0.455 | 38.952 | 7.303 | 0.000 | |
April | 19.983 | 7.793 | 13.633 | 1.527 | 51.413 | 7.563 | 6.714 | |
May | 25.513 | 12.061 | 18.432 | 1.841 | 54.907 | 8.287 | 6.161 | |
June | 33.807 | 17.347 | 25.583 | 0.241 | 37.492 | 11.100 | 11.143 |
Month | ET0 (mm) | KC | ETC (mm) | Required water per ha (m3/ha) | Required water for 1377 m2 (m3) | Water discharge (m3/h) | Period of irrigation (h) | |
---|---|---|---|---|---|---|---|---|
2019 and 2020 | March | 40 | 0.92 | 36.8 | 368 | 50.69 | 10.8 | 4.69 |
April | 40 | 1.33 | 53.2 | 532 | 73.28 | 10.8 | 6.79 | |
May | 40 | 1.15 | 46 | 46 | 63.37 | 10.8 | 5.87 | |
June | 40 | 0.58 | 23.2 | 232 | 31.96 | 10.8 | 2.96 |
Digital image analysis
The digital images of wheat seeds were provided by a camera (Canon SX540 HS) equipped with 800 dpi resolution. After imaging, the pictures were analyzed and processed via the Python 3.7 software6,71,72 to evaluate a total of 34 morphometric variables in bread wheat seeds (Table 5; Fig. 8).
Table 5.
Seed mode | Parameter (unit) | Description |
---|---|---|
Dorsal | Perim (mm) | Perimeter, calculated from the centers of the boundary pixel |
Area (mm2) | Area inside the polygon defined by the perimeter | |
MinR (mm) | Radius of the inscribed circle centered at the middle of mass | |
MaxR (mm) | Radius of the enclosing circle centered at the middle of mass | |
Feret (mm) | Largest axis length | |
Breadth (mm) | Largest axis perpendicular to the Feret | |
Chull (mm) | Convex hull or convex polygon calculated from pixel centers | |
CArea (mm2) | Area of the convex hull polygon | |
MBCRadius (mm) | Radius of the minimal bounding circle | |
AspRatio | Aspect ratio = Feret/Breadth | |
Circ | Circularity = 4·π·Area/Perimeter2 | |
Roundness | Roundness = 4·Area/(π·Feret2 ) | |
ArEquivD | Area equivalent diameter = √((4/π)·Area) | |
PerEquivD | Perimeter equivalent diameter = Area/π | |
EquivEllAr | Equivalent ellipse area = (π·Feret·Breadth)/4 | |
Compactness | Compactness = √((4/π)·Area)/Feret | |
Solidity | Solidity = Area/Convex_Area | |
Concavity | Concavity = Convex_Area-Area | |
Convexity | Convexity = Convex_hull/Perimeter | |
Shape | Shape = Perimeter2/Area | |
RFactor | RFactor = Convex_Hull/(Feret·π) | |
ModRatio | Modification ratio = (2·MinR)/Feret | |
Sphericity | phericity = MinR/MaxR | |
ArBBox (mm2) | Area of the bounding box along the feret diameter = Feret·Breadth | |
Rectang | Rectangularity = Area/ArBBox | |
Lateral | Perim.1 (mm) | Perimeter, calculated from the centers of the boundary pixel |
Area.1 (mm2) | Area inside the polygon defined by the perimeter | |
ArBBox.1 (mm2) | Area of the bounding box along the feret diameter = Feret· Thickness | |
Thickness (mm) | Largest axis perpendicular to the Feret | |
Vertical | Perim.2 (mm) | Perimeter, calculated from the centers of the boundary pixel |
Area.2 (mm2) | Area inside the polygon defined by the perimeter | |
ArBBox.2 (mm2) | Area of the bounding box along the feret diameter = Breadth · Thickness | |
Volume (mm3) | Volume = (4/3) π (Feret/2)(Breadth/2)( Thickness/2) | |
TKW (gr) | The weight of one thousand seeds |
GBS and imputation
The establishment and sequencing of the sequence library for the wheat accessions were carried out following the procedure as elucidated by Alipour et al.73. After trimming reads to 64 bp and categorizing them into tags, single-nucleotide polymorphisms (SNPs) were discovered via internal alignments, which permitting for mismatch up to 3 bp. The pipeline UNEAKGBS was utilized for SNP calling, where SNPs with low minor allele frequency < 1% and reads with a low-quality score (< 15) were discarded to keep away from false-positive markers, which are derived from errors in the sequencing process. The imputation was performed according to available allele frequency calculated after accounting for the haplotype phase74 in BEAGLE version 3.3.2. The reference genome W7984 was specified that harboring the highest imputation accuracy75 among four various reference genomes during imputation. The linkage disequilibrium decay of various chromosomes was obtained based on LOESS regression and RStudio, the ggplot2 package76.
Population structure and Kinship matrix
Population structure was assayed in the Iranian wheat landraces and cultivars through STRUCTURE version 2.3.437. A simulation phase consisted of 10,000 steps for K = 1 up to 10 along with an admixture model was used in this study. ΔK was utilized to estimate the most likely number of subpopulations in this study. To measure LD among markers, the expected and observed allele frequencies were exerted in TASSEL version 577. Q-matrix was used as a structural matrix for the association study. A neighbor-joining tree was formed according to a pairwise distance matrix counted in TASSEL77 and visualized using Archaeopteryx to explore the relationships between the Iranian wheat landraces and cultivars.
Genome-wide association study
MLM78, mrMLM79,80 and 3VmrMLM81 approaches were used to estimate the marker effect. IIIVmrMLM82 was used to identify QTN and QEI in this study. The first approach led to the most accurate marker-trait association. The K, Q, and Q + K versions of the MLM approach were utilized to modulate both effects of more diffused relationships (K) among accessions and population structure (Q) via TASSEL. The association mapping for the MLM, mrMLM, and 3VmrMLM models was performed using the package GAPIT and IIIVmrMLM in Rstudio. In the MLM approach, accessions are regarded as a random effect and the relevance among them was transferred by a kinship matrix. The elements in this matrix were utilized as similarities and the resultant clusters were visualized using a UPGMA-based heatmap via the GAPIT package. A Manhattan plot was derived from a comparison scenario using the package GAPIT to explore the association between genotype and phenotype, SNPs were ordered according to their base-pair positions and chromosomes. In the Manhattan plot, the y-axis represented the negative logarithm of P-value derived from the F-test and the x-axis represented the SNP genomic position.
Annotation of genes
Sequences around all significantly associated SNPs were provided from the 90 K SNP database of wheat. These sequences were utilized for the gene annotation via aligning to the IWGSC RefSeq V2.0 (URGI-INRA) using the database gramene (http://www.gramene.org/). The functions of putative genes were discovered via evaluating the pathways including the encoded enzymes. After aligning SNPs sequences to the reference, overlapped genes with the largest identity percentages and blast scores were picked out for further analysis. The ensemble-gramene database was used to extract the molecular functions and biological processes of genes in the gene ontology. Moreover, the sequences of significant SNPs were utilized in the enrichment analysis of gene ontology via KOBAS version 2.0 to test for statistically enriched pathways in the database KEGG (https://www.genome.jp/kegg/; www.kegg.jp/kegg/kegg1.html).
Identification of candidate genes via BLASTn
Identification of gene IDs was based on sequences of genes associated with seed traits (https://plants.ensembl.org/index.html). An analysis of whole CDS sequences of candidate genes was conducted using BLASTn analysis (nucleotide Basic Local Alignment Search Tool) from the National Center for Biotechnology Information (NCBI; https://www.ncbi.nlm.nih.gov/). In this alignment, default settings are used to align these sequences to Triticum aestivume species. An ideal match or high similarity was determined when all queries were covered, the Expect (E) value was zero, and the identity was greater than 99%. In addition, RNA-seq data from Rahimi et al., which were based on the tolerant and sensitive genotypes selected from this field experiment, were used to identify the genes involved in the drought stress path.
Statistical analysis
The descriptive statistics, variance analysis (ANOVA) and correlation of Seed imaging data were performed via SAS version 9.4 and RStudio separately for the two conditions, rain-fed (drought) and well-watered (normal). For advanced linear analysis, the adjusted means were derived from an alpha-lattice experiment using GLM and MLM models. Correlation and box-plot analysis were carried out in RStudio using the corrplot and ggpubr packages to assay the relationship and distribution of wheat seed morphometric traits.
Permission for land study
The authors declare that all land experiments and studies were carried out according to authorized rules.
Supplementary Information
Author contributions
M.R.B. proposed the idea and helped to provide the plant materials, E.R. performed field trial, analyzed the data and wrote draft version of the manuscript, H.A. proposed the idea and helped in the genomic data analysis, M.E.M. and V.M. were involved in designing and conducting the experiment. All authors contributed to revising and editing the manuscript. All authors have read and approved of the final manuscript.
Data availability
The datasets generated and analyzed during the current study are available in the Figshare repository [10.6084/m9.figshare.18774476.v1].
Competing interests
The authors declare no competing interests.
Footnotes
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
The online version contains supplementary material available at 10.1038/s41598-022-22607-0.
References
- 1.Rabieyan E, Alipour H. NGS-based multiplex assay of trait-linked molecular markers revealed the genetic diversity of Iranian bread wheat landraces and cultivars. Crop Pasture Sci. 2021;72:173–182. doi: 10.1071/CP20362. [DOI] [Google Scholar]
- 2.Li P, Ma B, Palta JA, Ding T, Cheng Z, Lv G, Xiong Y. Wheat breeding highlights drought tolerance while ignores the advantages of drought avoidance: A meta-analysis. Eur. J. Agron. 2021;122:126196. doi: 10.1016/j.eja.2020.126196. [DOI] [Google Scholar]
- 3.Guo X, Xin Z, Yang T, Ma X, Zhang Y, Wang Z, Lin T. Metabolomics response for drought stress tolerance in Chinese wheat genotypes (Triticum aestivum) Plants. 2020;9:520. doi: 10.3390/plants9040520. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Sandeep Varma V, Kanaka Durga K, Keshavulu K. Seed image analysis: Its applications in seed science research. Int. J. Res. Agric. Sci. 2013;1:30–36. [Google Scholar]
- 5.Schmidt J, Claussen J, Wörlein N, Eggert A, Fleury D, Garnett T, Gerth S. Drought and heat stress tolerance screening in wheat using computed tomography. Plant Methods. 2020;16:1–12. doi: 10.1186/s13007-020-00565-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Rabieyan, E., Bihamta, M. R., Moghaddam, M. E., Mohammadi, V., & Alipour, H. Morpho-colorimetric seed traits for the discrimination, classification and prediction of yield in wheat genotypes under rainfed and well-watered conditions. Crop Pasture Sci.73 (2022)
- 7.Williams K, Munkvold J, Sorrells M. Comparison of digital image analysis using elliptic Fourier descriptors and major dimensions to phenotype seed shape in hexaploid wheat (Triticum aestivum L.) Euphytica. 2013;190:99–116. doi: 10.1007/s10681-012-0783-0. [DOI] [Google Scholar]
- 8.Sau S, Ucchesu M, D’hallewin G, Bacchetta G. Potential use of seed morpho-colourimetric analysis for Sardinian apple cultivar characterisation. Comput. Electron. Agric. 2019;162:373–379. doi: 10.1016/j.compag.2019.04.027. [DOI] [Google Scholar]
- 9.Liu Y, Barot S, El-Kassaby YA, Loeuille N. Impact of temperature shifts on the joint evolution of seed dormancy and size. Ecol. Evol. 2017;7:26–37. doi: 10.1002/ece3.2611. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Sabzehzari M, Zeinali M, Naghavi MR. CRISPR-based metabolic editing: Next-generation metabolic engineering in plants. Gene. 2020;759:144993. doi: 10.1016/j.gene.2020.144993. [DOI] [PubMed] [Google Scholar]
- 11.Mwadzingeni L, Shimelis H, Rees DJG, Tsilo TJ. Genome-wide association analysis of agronomic traits in wheat under drought-stressed and non-stressed conditions. PLoS ONE. 2017;12:e0171692. doi: 10.1371/journal.pone.0171692. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Rabbi SMHA, Kumar A, Mohajeri Naraghi S, Simsek S, Sapkota S, Solanki S, Alamri MS, Elias EM, Kianian S, Missaoui A, Mergoum M. Genome-wide association mapping for yield and related traits under drought stressed and non-stressed environments in wheat. Front. Genet. 2021;12:649988. doi: 10.3389/fgene.2021.649988. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Gahlaut V, Jaiswal V, Singh S, et al. Multi-locus genome wide association mapping for yield and its contributing traits in hexaploid wheat under different water regimes. Sci. Rep. 2019;9:19486. doi: 10.1038/s41598-019-55520-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Kehel Z, Sanchez-Garcia M, El Baouchi A, Aberkane H, Tsivelikas A, Charles C, Amri A. Predictive characterization for seed morphometric traits for genebank accessions using genomic selection. Front. Ecol. Evol. 2020;8:32. doi: 10.3389/fevo.2020.00032. [DOI] [Google Scholar]
- 15.Alemu A, Feyissa T, Tuberosa R, Maccaferri M, Sciara G, Letta T, Abeyo B. Genome-wide association mapping for grain shape and color traits in Ethiopian durum wheat (Triticum turgidum) Crop J. 2020;8:757–768. doi: 10.1016/j.cj.2020.01.001. [DOI] [Google Scholar]
- 16.Yu LX, Zheng P, Zhang T, Rodringuez J, Main D. Genotyping-by-sequencing-based genome-wide association studies on Verticillium wilt resistance in autotetraploid alfalfa (Medicago sativa L.) Mol. Plant Pathol. 2017;18:187–194. doi: 10.1111/mpp.12389. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Mace ES, Tai S, Gilding EK, Li Y, Prentis PJ, Bian L, Wang J. Whole-genome sequencing reveals untapped genetic potential in Africa’s indigenous cereal crop sorghum. Nat. Commun. 2013;4:1–9. doi: 10.1038/ncomms3320. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Hwang EY, Song Q, Jia G, Specht JE, Hyten DL, Costa J, Cregan PB. A genome-wide association study of seed protein and oil content in soybean. BMC Genomics. 2014;15:1–12. doi: 10.1186/1471-2164-15-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Wang X, Wang H, Liu S, Ferjani A, Li J, Yan J, Qin F. Genetic variation in ZmVPP1 contributes to drought tolerance in maize seedlings. Nat. Genet. 2016;48:1233–1241. doi: 10.1038/ng.3636. [DOI] [PubMed] [Google Scholar]
- 20.Liu MH, Kang H, Xu Y, Peng Y, Wang D, Gao L, Wang GL. Genome-wide association study identifies an NLR gene that confers partial resistance to Magnaporthe oryzae in rice. Plant Biotechnol. J. 2020;18:1376–1383. doi: 10.1111/pbi.13300. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Rahimi Y, Bihamta MR, Taleei A, Alipour H, Ingvarsson PK. Genome-wide association study of agronomic traits in bread wheat reveals novel putative alleles for future breeding programs. BMC Plant Biol. 2019;19:1–19. doi: 10.1186/s12870-019-2165-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Yan X, Zhao L, Ren Y, Dong Z, Cui D, Chen F. Genome-wide association study revealed that the TaGW8 gene was associated with kernel size in Chinese bread wheat. Sci Rep. 2019;9:1–10. doi: 10.1038/s41598-019-38570-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Kanehisa M, Goto S. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000;28:27–30. doi: 10.1093/nar/28.1.27. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Kanehisa M. Toward understanding the origin and evolution of cellular organisms. Protein Sci. 2019;28:1947–1951. doi: 10.1002/pro.3715. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Kanehisa M, Furumichi M, Sato Y, Ishiguro-Watanabe M, Tanabe M. KEGG: integrating viruses and cellular organisms. Nucleic Acids Res. 2021;49:545–551. doi: 10.1093/nar/gkaa970. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Rahimi Y, Ingvarsson PK, Bihamta MR, Alipour H, Taleei A, Abadi SKJ. Characterization of dynamic regulatory gene and protein networks in wheat roots upon perceiving water deficit through comparative transcriptomics survey. Front. Plant Sci. 2021;12:867. doi: 10.3389/fpls.2021.710867. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Campbell KG, Bergman CJ, Gualberto DG, Anderson JA, Giroux MJ, Hareland G, Finney PL. Quantitative trait loci associated with kernel traits in a soft× hard wheat cross. Crop Sci. 1999;39:1184–1195. doi: 10.2135/cropsci1999.0011183X003900040039x. [DOI] [Google Scholar]
- 28.Aoun M, Carter AH, Ward BP, Morris CF. Genome-wide association mapping of the ‘super-soft’kernel texture in white winter wheat. Theor. Appl. Genet. 2021;134:2547–2559. doi: 10.1007/s00122-021-03841-y. [DOI] [PubMed] [Google Scholar]
- 29.Tekeu H, Ngonkeu EL, Bélanger S, Djocgoué PF, Abed A, Torkamaneh D, Belzile F. GWAS identifies an ortholog of the rice D11 gene as a candidate gene for grain size in an international collection of hexaploid wheat. Sci. Rep. 2021;11:1–13. doi: 10.1038/s41598-021-98626-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Touzy G, Rincent R, Bogard M, Lafarge S, Dubreuil P, Mini A, Praud S. Using environmental clustering to identify specific drought tolerance QTLs in bread wheat (T. aestivum L) Theor. Appl. Genet. 2019;132:2859–2880. doi: 10.1007/s00122-019-03393-2. [DOI] [PubMed] [Google Scholar]
- 31.Gahlaut V, Jaiswal V, Singh S, Balyan HS, Gupta PK. Multi-locus genome wide association mapping for yield and its contributing traits in hexaploid wheat under different water regimes. Sci. Rep. 2019;9:1–15. doi: 10.1038/s41598-019-55520-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Bhatta M, Morgounov A, Belamkar V, Baenziger PS. Genome-wide association study reveals novel genomic regions for grain yield and yield-related traits in drought-stressed synthetic hexaploid wheat. Int. J. Mol. Sci. 2018;19:3011. doi: 10.3390/ijms19103011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Williams K, Sorrells ME. Three-dimensional seed size and shape QTL in hexaploid wheat (Triticum aestivum L.) populations. Crop Sci. 2014;54:98–110. doi: 10.2135/cropsci2012.10.0609. [DOI] [Google Scholar]
- 34.Muhammad A, Hu W, Li Z, Li J, Xie G, Wang J, Wang L. Appraising the genetic architecture of kernel traits in hexaploid wheat using GWAS. Int. J. Mol. Sci. 2020;21:5649. doi: 10.3390/ijms21165649. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Breseghello F, Sorrells ME. QTL analysis of kernel size and shape in two hexaploid wheat mapping populations. Field Crops Res. 2007;101:172–179. doi: 10.1016/j.fcr.2006.11.008. [DOI] [Google Scholar]
- 36.Alipour H, Abdi H, Rahimi Y, Bihamta MR. Dissection of the genetic basis of genotype-by-environment interactions for grain yield and main agronomic traits in Iranian bread wheat landraces and cultivars. Sci. Rep. 2021;11:1–17. doi: 10.1038/s41598-021-96576-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Pritchard JK, Stephens M, Donnelly P. Inference of population structure using multilocus genotype data. Genetics. 2000;155:945–959. doi: 10.1093/genetics/155.2.945. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Berkman PJ, Visendi P, Lee HC, Stiller J, Manoli S, Lorenc MT, Edwards D. Dispersion and domestication shaped the genome of bread wheat. Plant Bioethanol. J. 2013;11:564–571. doi: 10.1111/pbi.12044. [DOI] [PubMed] [Google Scholar]
- 39.Jordan KW, Wang S, Lun Y, Gardiner LJ, MacLachlan R, Hucl P, Akhunov E. A haplotype map of allohexaploid wheat reveals distinct patterns of selection on homoeologous genomes. Genome Biol. 2015;16:1–18. doi: 10.1186/s13059-015-0606-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Dvorak J, Akhunov ED, Akhunov AR, Deal KR, Luo MC. Molecular characterization of a diagnostic DNA marker for domesticated tetraploid wheat provides evidence for gene flow from wild tetraploid wheat to hexaploid wheat. Mol. Biol. Evol. 2006;23:1386–1396. doi: 10.1093/molbev/msl004. [DOI] [PubMed] [Google Scholar]
- 41.Chao S, Zhang W, Akhunov E, Sherman J, Ma Y, Luo MC, Dubcovsky J. Analysis of gene-derived SNP marker polymorphism in US wheat (Triticum aestivum L.) cultivars. Mol. Breed. 2009;23:23–33. doi: 10.1007/s11032-008-9210-6. [DOI] [Google Scholar]
- 42.Sabzehzari M, Zeinali M, Naghavi MR. Alternative sources and metabolic engineering of Taxol: Advances and future perspectives. Biotechnol. Adv. 2020;43:107569. doi: 10.1016/j.biotechadv.2020.107569. [DOI] [PubMed] [Google Scholar]
- 43.Liu H, Zhou H, Wu Y, Li X, Zhao J, Zuo T, Pan G. The impact of genetic relationship and linkage disequilibrium on genomic selection. PLoS ONE. 2015;10:e0132379. doi: 10.1371/journal.pone.0132379. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Yan J, Shah T, Warburton ML, Buckler ES, McMullen MD, Crouch J. Genetic characterization and linkage disequilibrium estimation of a global maize collection using SNP markers. PLoS ONE. 2009;4:e8451. doi: 10.1371/journal.pone.0008451. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Abdullaev AA, Salakhutdinov IB, Egamberdiev SS, Khurshut EE, Rizaeva SM, Ulloa M, Abdurakhmonov IY. Genetic diversity, linkage disequilibrium, and association mapping analyses of Gossypium barbadense L. germplasm. PLoS ONE. 2017;12:e0188125. doi: 10.1371/journal.pone.0188125. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Groos C, Robert N, Bervas E, Charmet G. Genetic analysis of grain protein-content, grain yield and thousand-kernel weight in bread wheat. Theor. Appl. Genet. 2003;106:1032–1040. doi: 10.1007/s00122-002-1111-1. [DOI] [PubMed] [Google Scholar]
- 47.Ain QU, Rasheed A, Anwar A, Mahmood T, Imtiaz M, He Z, Quraishi UM. Genome-wide association for grain yield under rainfed conditions in historical wheat cultivars from Pakistan. Front. Plant Sci. 2015;6:743. doi: 10.3389/fpls.2015.00743. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Gao F, Wen W, Liu J, Rasheed A, Yin G, Xia X, He Z. Genome-wide linkage mapping of QTL for yield components, plant height and yield-related physiological traits in the Chinese wheat cross Zhou 8425B/Chinese Spring. Front. Plant Sci. 2015;6:1099. doi: 10.3389/fpls.2015.01099. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Ma M, Wang Q, Li Z, Cheng H, Li Z, Liu X, Zhao H. Expression of TaCYP78A3, a gene encoding cytochrome P450 CYP78A3 protein in wheat (Triticum aestivum L.), affects seed size. Plant J. 2015;83(2):312–325. doi: 10.1111/tpj.12896. [DOI] [PubMed] [Google Scholar]
- 50.Sukumaran S, Lopes M, Dreisigacker S, Reynolds M. Genetic analysis of multi-environmental spring wheat trials identifies genomic regions for locus-specific trade-offs for grain weight and grain number. Theor. Appl. Genet. 2018;131:985–998. doi: 10.1007/s00122-017-3037-7. [DOI] [PubMed] [Google Scholar]
- 51.Neumann K, Kobiljski B, Dencic S, Varshney RK, Börner A. Genome-wide association mapping: A case study in bread wheat (Triticum aestivum L.) Mol. Breed. 2011;27:37–58. doi: 10.1007/s11032-010-9411-7. [DOI] [Google Scholar]
- 52.Sun C, Zhang F, Yan X, Zhang X, Dong Z, Cui D, Chen F. Genome-wide association study for 13 agronomic traits reveals distribution of superior alleles in bread wheat from the Yellow and Huai Valley of China. Plant Biotechnol. J. 2017;15:953–969. doi: 10.1111/pbi.12690. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Lozada DN, Mason RE, Babar MA, Carver BF, Guedira GB, Merrill K, Arguello MN, Acuna A, Vieira L, Holder A, Dreisigacker S. Association mapping reveals loci associated with multiple traits that affect grain yield and adaptation in soft winter wheat. Euphytica. 2017;213:1–15. doi: 10.1007/s10681-017-2005-2. [DOI] [Google Scholar]
- 54.Ogbonnaya FC, Rasheed A, Okechukwu EC, Jighly A, Makdis F, Wuletaw T, Hagras A, Uguru MI, Agbo CU. Genome-wide association study for agronomic and physiological traits in spring wheat evaluated in a range of heat prone environments. Theor. Appl. Genet. 2017;130:1819–1835. doi: 10.1007/s00122-017-2927-z. [DOI] [PubMed] [Google Scholar]
- 55.Appels R, Eversole K, Feuille C, Keller B, Rogers J, Stein N. The international wheat genome sequencing consortium (IWGSC): Shifting the limits in wheat research and breeding using a fully annotated reference genome. Science. 2018;361:10–1126. doi: 10.1126/science.aar7191. [DOI] [PubMed] [Google Scholar]
- 56.Sukumaran S, Dreisigacker S, Lopes M, Chavez P, Reynolds MP. Genome-wide association study for grain yield and related traits in an elite spring wheat population grown in temperate irrigated environments. Theor. Appl. Genet. 2015;128:353–363. doi: 10.1007/s00122-014-2435-3. [DOI] [PubMed] [Google Scholar]
- 57.Neumann K, Kobiljski B, Denčić S, Varshney RK, Börner A. Genome-wide association mapping: A case study in bread wheat (Triticum aestivum L.) Mol. Breed. 2011;27:37–58. doi: 10.1007/s11032-010-9411-7. [DOI] [Google Scholar]
- 58.Mia MS, Liu H, Wang X, Zhang C, Yan G. Root transcriptome profiling of contrasting wheat genotypes provides an insight to their adaptive strategies to water deficit. Sci. Rep. 2020;10:1–11. doi: 10.1038/s41598-020-61680-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Fracasso A, Trindade LM, Amaducci S. Drought stress tolerance strategies revealed by RNA-Seq in two sorghum genotypes with contrasting WUE. BMC Plant Biol. 2016;16:1–18. doi: 10.1186/s12870-016-0800-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Sánchez-Martín J, Canales FJ, Tweed JK, Lee MR, Rubiales D, Gómez-Cadenas A, Prats E. Fatty acid profile changes during gradual soil water depletion in oats suggests a role for jasmonates in coping with drought. Front. Plant Sci. 2018;9:1077. doi: 10.3389/fpls.2018.01077. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Luo F, Deng X, Liu Y, Yan Y. Identification of phosphorylation proteins in response to water deficit during wheat flag leaf and grain development. Bot. Stud. 2018;59:1–17. doi: 10.1186/s40529-018-0245-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Sun J, Qiu C, Ding Y, Wang Y, Sun L, Fan K, Ding Z. Fulvic acid ameliorates drought stress-induced damage in tea plants by regulating the ascorbate metabolism and flavonoids biosynthesis. BMC Genomics. 2020;21:1–13. doi: 10.1186/s12864-020-06815-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Xie H, Wang Y, Ding Y, Qiu C, Sun L, Gai Z, Ding Z. Global ubiquitome profiling revealed the roles of ubiquitinated proteins in metabolic pathways of tea leaves in responding to drought stress. Sci. Rep. 2019;9:1–12. doi: 10.1038/s41598-019-41041-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Jogawat A, Yadav B, Lakra N, Singh AK, Narayan OP. Crosstalk between phytohormones and secondary metabolites in the drought stress tolerance of crop plants: A review. Physiol. Plant. 2021;172:1106–1132. doi: 10.1111/ppl.13328. [DOI] [PubMed] [Google Scholar]
- 65.Wu KC, Wei LP, Huang CM, Wei YW, Cao HQ, Xu L, Li YR. Transcriptome reveals differentially expressed genes in Saccharum spontaneum GX83–10 leaf under drought stress. Sugar Tech, 2018;20:756–764. doi: 10.1007/s12355-018-0608-0. [DOI] [Google Scholar]
- 66.Florez-Sarasa ID, Bouma TJ, Medrano H, Azcon-Bieto J, Ribas-Carbo M. Contribution of the cytochrome and alternative pathways to growth respiration and maintenance respiration in Arabidopsis thaliana. Physiol. Plant. 2007;129:143–151. doi: 10.1111/j.1399-3054.2006.00796.x. [DOI] [Google Scholar]
- 67.de Carvalho K, de Campos MKF, Domingues DS, Pereira LFP, Vieira LGE. The accumulation of endogenous proline induces changes in gene expression of several antioxidant enzymes in leaves of transgenic Swingle citrumelo. Mol. Biol. Rep. 2013;40:3269–3279. doi: 10.1007/s11033-012-2402-5. [DOI] [PubMed] [Google Scholar]
- 68.Zadehbagheri M, Azarpanah A, Javanmardi S. Proline metabolite transport an efficient approach in corn yield improvement as response to drought conditions. Nature. 2014;566:76–485. [Google Scholar]
- 69.Maleki A, Jalal S, Shekari F. Inheritance of proline content in bread wheat (Triticum aestivum L.) under rainfed conditions. J. Food Agric. Environ. 2010;8:155–157. [Google Scholar]
- 70.Mwadzingeni L, Figlan S, Shimelis H, Mondal S, Tsilo TJ. Genetic resources and breeding methodologies for improving drought tolerance in wheat. J. Crop Improv. 2017;31:648–672. doi: 10.1080/15427528.2017.1345816. [DOI] [Google Scholar]
- 71.Rabieyan E, Bihamta MR, Moghaddam ME, Mohammadi V, Alipour H. Imaging-based screening of wheat seed characteristics towards distinguishing drought-responsive Iranian landraces and cultivars. Crop Pasture Sci. 2022;73:337–355. doi: 10.1071/CP21500. [DOI] [Google Scholar]
- 72.Rabieyan E, Bihamta MR, Moghaddam ME, Mohammadi V, Alipour H. Genome-wide association mapping and genomic prediction for pre-harvest sprouting resistance, low α-amylase and seed color in Iranian bread wheat. BMC Plant Biol. 2022;22:1–23. doi: 10.1186/s12870-022-03628-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Alipour H, Bihamta MR, Mohammadi V, Peyghambari SA, Bai G, Zhang G. Genotyping-by-sequencing (GBS) revealed molecular genetic diversity of Iranian wheat landraces and cultivars. Front. Plant Sci. 2017;8:1293. doi: 10.3389/fpls.2017.01293. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Browning BL, Browning SR. A unified approach to genotype imputation and haplotype-phase inference for large data sets of trios and unrelated individuals. Am. J. Hum. Genet. 2009;84:210–223. doi: 10.1016/j.ajhg.2009.01.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Alipour H, Bai G, Zhang G, Bihamta MR, Mohammadi V, Peyghambari SA. Imputation accuracy of wheat genotyping-by-sequencing (GBS) data using barley and wheat genome references. PLoS ONE. 2019;14:e0208614. doi: 10.1371/journal.pone.0208614. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Team, R. RStudio: Integrated Development for R. RStudio, Inc. http://www.rstudio.com (2015).
- 77.Bradbury PJ, Zhang Z, Kroon DE, Casstevens TM, Ramdoss Y, Buckler ES. TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics. 2007;23:2633–2635. doi: 10.1093/bioinformatics/btm308. [DOI] [PubMed] [Google Scholar]
- 78.Lipka AE, Tian F, Wang Q, Peiffer J, Li M, Bradbury PJ, Zhang Z. GAPIT: Genome association and prediction integrated tool. Bioinformatics. 2012;28:2397–2399. doi: 10.1093/bioinformatics/bts444. [DOI] [PubMed] [Google Scholar]
- 79.Zhang YW, Tamba CL, Wen YJ, Li P, Ren WL, Ni YL, Zhang YM. mrMLM v.4 0.2: An R platform for multi-locus genome-wide association studies. Genom. Proteom. Bioinform. 2020;18:481–487. doi: 10.1016/j.gpb.2020.06.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Wang SB, Feng JY, Ren WL, Huang B, Zhou L, Wen YJ, Zhang YM. Improving power and accuracy of genome-wide association studies via a multi-locus mixed linear model methodology. Sci. Rep. 2016;6:1–10. doi: 10.1038/srep19444. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Li M, Zhang YW, Zhang ZC, Xiang Y, Liu MH, Zhou YH, Zhang YM. A compressed variance component mixed model for detecting QTNs and QTN-by-environment and QTN-by-QTN interactions in genome-wide association studies. Mol. Plant. 2022;15:630–650. doi: 10.1016/j.molp.2022.02.012. [DOI] [PubMed] [Google Scholar]
- 82.Li M, Zhang YW, Xiang Y, Liu MH, Zhang YM. IIIVmrMLM: The R and C++ tools associated with 3VmrMLM, a comprehensive GWAS method for dissecting quantitative traits. Mol. Plant. 2022;15:1251–1253. doi: 10.1016/j.molp.2022.06.002. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The datasets generated and analyzed during the current study are available in the Figshare repository [10.6084/m9.figshare.18774476.v1].