Abstract
Shading in combination with extended photoperiods can cause exaggerated stem elongation (ESE) in soybean, leading to lodging and reduced yields when planted at high-density in high-latitude regions. However, the genetic basis of plant height in adaptation to these regions remains unclear. Here, through a genome-wide association study, we identify a plant height regulating gene on chromosome 13 (PH13) encoding a WD40 protein with three main haplotypes in natural populations. We find that an insertion of a Ty1/Copia-like retrotransposon in the haplotype 3 leads to a truncated PH13H3 with reduced interaction with GmCOP1s, resulting in accumulation of STF1/2, and reduced plant height. In addition, PH13H3 allele has been strongly selected for genetic improvement at high latitudes. Deletion of both PH13 and its paralogue PHP can prevent shade-induced ESE and allow high-density planting. This study provides insights into the mechanism of shade-resistance and offers potential solutions for breeding high-yielding soybean cultivar for high-latitude regions.
Subject terms: Plant breeding, Natural variation in plants, Light responses, Agricultural genetics
How plant height is adaptive to high latitudes and high density planting is unclear. Here, the authors report a retrotransposon insertion in a WD40 protein encoding gene PH13 affects its interaction with GmCOP1s and increases the abundance of STF1/2, leading to reduced soybean height and increased shade resistance.
Introduction
Soybean (Glycine max (L.) Merr.) is an economically important crop, accounting for 59% of the oilseed production and providing 70% of the plant protein for human and animal consumption worldwide (SoyStats, 2023)1. To meet the demands of an ever-growing population and to continuously improve the living standards, it is estimated that the global soybean yield must be doubled by 20502,3. Soybean is a short-day plant (SDP) that originated in the temperate regions of China (between 32 and 40°N)4. However, traditional soybean varieties are restricted to narrow range of latitudes due to their high photoperiod sensitivity, resulting in traits such as maturity time and plant architecture being affected5,6. The introduction of new genetic resources has enabled the expansion of cultivation worldwide. In particular, the long juvenile trait (LJ) has been successfully utilized since the 1970s for the breeding of cultivars suitable for lower latitudes (below 22°), making Brazil the largest producer of soybean in the world7–9.
High latitudes are also essential areas for soybean production. In 2021, China’s three northeastern provinces (Heilongjiang, Jilin, and Liaoning at latitudes of over 40°N) accounted for 48.71% of the country’s total soybean production (NBSC, 2022). However, the worldwide cultivation of soybean at high latitudes (over 40°N) only produced less than 20% of the total yield, suggesting a great potential to boost global yield by cultivation of soybean in these regions including the far Russia East, Canada, the Northern United States, and the Northeast of China. High latitudes have long-day photoperiods which can induce an extended maturity period and excessive stem growth, impeding the adaptation of soybean varieties to their short frost-free farm seasons. Several flowering time genes, including the early flowering locus (E1-E4)10–13, Tof514, two homologs of PSEUDO-RESPONSE-REGULATOR3, Tof11 and Tof1215–17, have been reported to be utilized for breeding of soybean varieties suitable for planting in northern regions of China. However, much less is known about the genetic basis of plant height essential for soybean adaptation to high latitudes.
Plant height in soybean is determined by the number of nodes and the length of the internodes, and is highly sensitive to the variation in the light conditions18. For instance, long-day photoperiods at high latitude and low blue light (LBL) under high-density planting and intercropping conditions can induce exaggerated stem elongation (ESE) syndrome, decrease the mechanical strength of the stem, result in lodging and increase susceptibility to insects and pathogens6,19,20. The above situations seriously constrain the improvement of soybean yield by high-density planting in high-latitude regions. It has been revealed that the blue light receptor GmCRY1s repress the LBL-induced ESE syndrome by stabilizing the bZIP transcription factors STF1 and STF221, and modulating the components of the GmCRY1-mediated signaling pathway can improve shade-tolerant traits and improve yield in soybean22,23.
In this study, we identify a plant height determining gene PH13, which has been selected in breeding soybean cultivars suitable for high latitudes. We show that a Ty1/Copia-like retrotransposon causes a truncation in PH13H3 protein, resulting in a weaker interaction with GmCOP1a and GmCOP1b, which consequently leads to accumulation of bZIP transcription factors STF1/2 and a short plant stature. We knock out both PH13 and its paralogous gene PHP to generate a ph13php double (phd) mutant that displays an ideal type of shade-resistance. Our findings provide insights into the genetic basis for the breeding of modern soybeans suitable for high latitudes and offer alleles to improve soybean yield through high-density planting.
Results
Identification of plant height loci in soybean
To identify genetic loci that can attenuate the extent of ESE, we phenotyped a collection of 2214 previously genotyped soybean accessions24 for the plant height trait in a normal cultivating system in ten locations over two or three years. The best linear unbiased prediction (BLUP) for each accession was estimated to support association studies (Supplementary Data 1). We conducted a Genome-wide Association Study (GWAS) using 540 spring-sown improved cultivars with 3.47 million SNPs (minor allele frequency >5%) and a Transcriptome-wide Association Study (TWAS) for 488 soybean accessions (Supplementary Fig. 1a) to identify SNPs using FarmCPU25 and genes associated with plant height. The GWAS identified eleven plant height loci, while the TWAS identified seven genes (Supplementary Data 2, 3). Of these, fourteen have not been reported previously as soybean height loci and three were refined loci associated with known candidate genes (E2, Dt1 and Dt2)11,26,27, validating the effectiveness of our strategy to find targeting genes (Fig. 1a, b). Only one gene (Glyma.13G276700) located on chromosome 13 was identified by both GWAS and TWAS. Hence, we named this candidate gene as Plant Height 13 (PH13) for further analysis. The surrounding genetic region of PH13 has repeatedly been identified as a plant height-associated QTL in previous studies28–32, supporting a role of PH13 in regulating plant height.
The leading single nucleotide polymorphism (SNP), Gm13-37757704, identified by GWAS has a physical distance of 55,493 bp from PH13 and a linkage disequilibrium R2 of 0.86 with the exonic SNP (Gm13-37816013) within PH13 (Fig. 1a and Supplementary Fig. 1b). The TWAS identified the sixth exon of PH13 within this locus (Fig. 1b). Since FarmCPU considers associated markers as covariants, which can lead to false negatives of neighboring coexpressed genes of PH13 in TWAS, we also used a mixed linear model to perform TWAS, which confirmed that PH13 is the only gene within this locus that displayed an association signal (Supplementary Fig. 1c).
A homologue of SUPPRESSOR OF PHYA is the candidate of PH13 gene
The PH13 gene encoding a WD40 protein which is likely homologous to the suppressor of the phyA-105 (SPA) family protein in Arabidopsis33. Phylogenetic analysis indicated that PH13 is grouped closely with SPA3/4 protein (Supplementary Fig. 2a and Supplementary Data 4). We then transformed the 35S::PH13-3×Flag construct into the spa134 mutant34. The phenotypic results indicated that ectopic expression of PH13 can at least partially rescued the dwarf phenotype of the spa134 mutant at both seedling and adult vegetative stage, supporting that PH13 is homologous to the SPA family proteins (Supplementary Fig. 2b–e).
The TWAS analysis of 488 accessions revealed two forms of PH13 transcripts (Supplementary Fig. 3), one of which had no detectable expression from shortly after the beginning of the fifth exon to the 3’ end (Fig. 1c). To understand the cause of this difference, we examined the coding DNA sequence (CDS) of PH13 in 1254 accessions by re-sequencing and PCR assay (Supplementary Fig. 4a, b). Besides the exonic SNP (Gm13-37816013), a 5404 bp fragment containing two 431 bp long terminal repeats (LTRs) and a 3984 bp open reading frame (ORF) encoding Gag-protease-integrase-RT-RNaseH domains was found at the start of the fifth exon. This fragment belongs to the Ty1/Copia-like retrotransposon (Fig. 1d, Supplementary Fig. 5 and Supplementary Data 5, 6).
Three main haplotypes (PH13H1-PH13H3) were detected in the PH13 CDS of the 1254 accessions based on the SNPs and insertion fragment variation (Table 1). The haplotype 3 (PH13H3) harbors the Ty1/Copia-like retrotransposon insertion, which was predicted to produce a truncated PH13 protein (742 amino acids) lacking the 3’ part of the WD40 domain (Supplementary Fig. 6). This hypothesis was confirmed by immunoblot results showing that the molecular weight of PH13H3−3×Flag was about 10 kDa lower than that of PH13H1−3×Flag or PH13H2 −3×Flag when ectopically expressed in tobacco leaves (Supplementary Fig. 7). Moreover, PH13H3 accessions were found to have significantly reduced plant height compared to accessions carrying PH13H1 and PH13H2 under field conditions in ten field locations over two or three years (Fig. 1e and Supplementary Data 7), suggesting that the retrotransposon insertion in PH13H3 is responsible for the reduction in plant height.
Table 1.
Haplotypes | Position in PH13 | |
---|---|---|
2444 | 3411 | |
PH13H1 | A | – |
PH13H2 | G | – |
PH13H3 | G | 5404 bp insertion |
The haplotype analysis of PH13 was conducted using a core collection of 1254 soybean accessions (Supplementary Data 7). The haplotype 3 of PH13 (PH13H3) harbors a 5404 bp retrotransposon insertion.
Genetic validation of PH13 function in regulating plant height
Tissue-specific expression analysis showed that PH13 was expressed at a higher level in the shoots and stems compared to other tested tissues, including roots, cotyledons, unifoliate and trifoliate leaves, and flowers, supporting PH13 is involved in regulation of plant height (Supplementary Fig. 8). To investigate its biological role in soybean, two independent gRNAs targeting the first exon were used to generate loss-of-function mutants in the Williams 82 background (W82H1, harboring PH13H1) using CRISPR/Cas9 technology. Three independent mutants, ph13-1, ph13-2, and ph13-3, were identified with base deletion or/and insertion (Supplementary Fig. 9). The phenotypic results demonstrated that the loss-of-function of PH13 caused a significant dwarf phenotype with a more than 30% reduction in plant height compared to the wild type (WT) W82H1 (Fig. 2a, b and Supplementary Fig. 10a, b). Additionally, we generated three independent PH13H1−3×Flag overexpression lines (H1-OE1/TL1H3, H1-OE2/TL1H3 and H1-OE3/TL1H3) in the TL1H3 background, which exhibited a significant increase in plant height (Fig. 2a and Supplementary Fig. 10d–f). The reduction in plant height of the ph13 mutants was mainly due to a decrease in internode length (Fig. 2b and Supplementary Fig. 11) which was correlated with the reduced cell length (Supplementary Fig. 12). Moreover, the ph13 mutants showed 5 days earlier flowering time and 2 fewer nodes than W82H1, which also contributed to the reduction in plant height (Supplementary Figs. 10c and 11b).
To assess the functionality of PH13H3, three independent mutants were generated using CRISPR-Cas9 technology in a Tianlong 1 backgrounds (TL1H3, harboring PH13H3 with the retrotransposon insertion). Phenotypic analysis revealed that the mutants had a slightly reduced plant height of around 10% compared to the WT TL1H3 (Fig. 2a, b), indicating that the PH13H3 retained some of its ability to promote plant height. In addition, near-isogenic lines (NILs) were created by crossing the W82H1 cultivar (with an average plant height of 110 cm) with the TL1H3 cultivar (with an average plant height of 80 cm) (Supplementary Fig. 13). The resulting NILH3 plants had a significantly lower plant height than that of NILH1 (Fig. 2c). Collectively, these results demonstrate that PH13 functions as a plant height enhancer by promoting stem elongation and increasing node number, and the Ty1/Copia-like retrotransposon insertion had partially comprised the function of PH13H3 in regulating plant height.
PH13H3 was artificially selected for the improvement of soybean at high latitude
Given that plant height is one of the essential agronomic traits determining yield35–37, we investigated whether PH13 alleles have been utilized during soybean domestication and improvement. We analyzed the frequency and geographic distribution of the three main haplotypes among 121 accessions of G. soja, 715 landraces, and 418 improved cultivars, all with known genotypes and origin sites around the world (Supplementary Data 7). We found that G. soja predominantly contain PH13H1 (99.2%) with a small amount of PH13H2 (Fig. 3a). Meanwhile, PH13H3 was absent in G. soja but occurred at a low frequency of 1.4% in landraces (Fig. 3a). Interestingly, the proportion of PH13H3 accessions among all improved cultivars increased to 32.7%, suggesting that the PH13H3 allele was subject to intensive artificial selection during genetic improvement.
The increase in the proportion of PH13H3 in improved cultivars compared to landraces prompted us to investigate its geographical distribution in the world. We found that a large proportion of the PH13H3 accessions were concentrated in high-latitude regions, while the other two haplotypes were dispersed more evenly (Fig. 3b). This geographical bias was further analyzed in China where the proportion of PH13H3 among all cultivars increased from near zero in the lower latitude regions (below 40°N) to 48.7% in the higher latitude regions (above 40°N) (Supplementary Fig. 14a–d). Out of 78 cultivars carrying PH13H3, 76 were located in higher latitude regions (Supplementary Fig. 14e), demonstrating that the PH13H3 allele has been used successfully in breeding programs to improve soybean adaptation in high latitudes.
The protein products of PH13H3 have reduced binding affinity with two GmCOP1 E3 ubiquitin ligases
PH13, being a SPA homologous protein in soybean, possesses a conserved kinase domain, coiled-coil domain, and WD40 domain, while the protein products of PH13H3 lacks the intact WD40 domain (Supplementary Figs. 6 and 7). In order to assess the impact of the retrotransposon insertion on PH13’s function, we conduced RT-qPCR analysis on W82H1 and TL1H3 cultivars. The results revealed that the insertion did not affect the transcription of the sequence before the insertion site, but abolished the transcription of the carboxyl terminus (Fig. 4a, b), which is consistent with the expression patterns observed in natural population (Fig. 1c). Further analysis of the subcellular location of different PH13 haplotypes showed that the retrotransposon did not alter the location of the truncated PH13H3 protein in both nucleus and cytoplasm (Supplementary Fig. 15).
The SPA family is known to suppress photomorphogenesis, as part of a complex with COP1, serving as E3 ubiquitin ligases to target multiple transcription factors for degradation in Arabidopsis38–40. A previous study showed that two soybean COP1 orthologs, GmCOP1a and GmCOP1b, play a pivotal role in controlling plant height in soybean22. Here we found that the diurnal transcription pattern of PH13 is in line with that of GmCOP1a and GmCOP1b in soybean, peaking at dawn and declining at dusk (Fig. 4c), suggesting that PH13 may act as an evolutionarily conserved factor to form a complex with these GmCOP1s to degrade target proteins in soybean. This prompted us to investigate whether the retrotransposon affects the interaction of PH13 with GmCOP1s. Yeast-two hybrid (Y2H) and Co-Immunoprecipitation (Co-IP) experiments revealed that these GmCOP1s strongly interact with PH13H1 and with PH13H2 but weakly with PH13H3 (Fig. 4d–f and Supplementary Fig. 16). Domain-specific Y2H assays indicated that PH13 interacts with GmCOP1b via their coil-coil domains, but the WD40 domain of PH13 can significantly enhance their interaction strength (Supplementary Fig. 17a–c). These results together demonstrate that the absence of the WD40 domain in PH13H3 reduces the interaction strength between PH13 and GmCOP1s in soybean.
PH13 and its paralogue PHP function together to decrease STF1/2 abundance
Given previous evidence that GmCOP1s mediate the degradation of STF1/2 transcription factors which are homologous to Arabidopsis HY5 and responsible for inhibiting stem elongation in legume21,22,41,42, we sought to determine whether the altered interaction of the truncated PH13 with GmCOP1s affects the accumulation of STF1/2 proteins. Our results showed that the overall protein levels of STF1/2 increased by 39–144% during a diurnal cycle in the NILH3 line compared to NILH1 line grown under long day condition in growth chamber (Fig. 4g, h and Supplementary Fig. 18). This indicates that the insertion retrotransposon in NILH3 enhances the accumulation of STF1/2. Additionally, the pattern of STF1/2 protein accumulation, which peaked in the middle of the day (Fig. 4h), was opposite to that of the GmCOP1s and PH13 transcripts peaking at night (Fig. 4c). This suggests that the GmCOP1s-PH13 E3 ligases play a role in modulating the abundance of the STF1/2 protein in response to photoperiod.
While the abundance of the STF1/2 protein increased in the NILH3 line, its expression pattern remained unchanged and its level still decreased during the night period (Fig. 4h and Supplementary Fig. 18), suggesting a possibility that some homologous genes may act redundantly or additively with PH13 to regulate the degradation of STF1/2. Indeed, the soybean genome contains a PH13 paralogue Glyma.12G224600, which we hereafter name PHP, encoding a protein with 96.6% amino acid similarity to PH13 (Supplementary Figs. 2a and 19). We then generated multiple php and ph13/php double (phd) mutant lines using CRISPR/Cas9 technology in the TL1H3 background (Supplementary Fig. 20). Phenotypic analysis revealed a progressive reduction in plant height in the order of TL1H3, ph13, php, and phd (Supplementary Fig. 21), indicating that the PH13 and PHP genes act cooperatively in regulating plant height. We observed at least 3-fold increase in STF1/2 protein abundance in the phd-1 mutant compared to the WT TL1H3 (Supplementary Fig. 22) especially at night. To verify whether PH13 and PHP act as upstream regulators of STF1/2, we crossed the phd-1 mutant with the stf1/2 mutant43 (Supplementary Fig. 23a). Phenotypic assessment showed partial rescue of the dwarf phenotype of the phd-1 mutant by the stf1/2 mutant (Supplementary Fig. 23b, c), suggesting that STF1/2 act as downstream targets of PH13 and PHP to influence plant height.
PH13 and PHP redundantly regulate LBL-induced exaggerated stem elongation
Considering that low blue light (LBL) triggers degradation of STF1/2 and leads to ESE syndrome in soybean21, we hypothesize that the increased abundances of STF1/2 in the ph13, php and phd mutants might result in an appropriate extent of stem elongation for high yield under shade conditions. To test this, we compared the performance of each mutant line with the WT TL1H3 in response to simulated shade regimes (Fig. 5a and Supplementary Fig. 24). Our results showed that LBL efficiently induced ESE syndrome in the ph13 and php mutants as well as in TL1H3, but not in the phd mutant (Fig. 5b).
The blue light receptor, GmCRY1s, was observed to mainly mediated LBL-induced shade avoidance syndrome (SAS) in soybean21. Next, we crossed the CRISPR-Cas9-engineered Gmcry1s quadruple (Gmcry1s-qm) mutant21 displaying constitutive ESE syndrome, with the phd-1 mutant carrying stocky phenotype, to obtain the phd-1/Gmcry1s-qm sextuple mutant (Supplementary Fig. 25a). We found that the phd mutations partially rescued the ESE syndrome of the Gmcry1-qm mutant in field conditions (Supplementary Fig. 25b–d), suggesting that PH13 and PHP may be genetically downstream of GmCRY1s in regulating of LBL-induced stem elongation. Furthermore, the phd mutant was found to display a semi-dwarf phenotype due to decreased internode length, rather than reduced node number in comparison with TL1H3, under LBL conditions (Fig. 5b). Moreover, the phd mutant was distinguished by its thicker stem compared to other lines under all light conditions (Fig. 5a and Supplementary Fig. 26). Collectively, these results demonstrate that the phd mutant is characterized by ideal shade-resistant traits, which may be advantageous for high-density planting or intercropping at high latitudes.
Knockout of PH13 and PHP enable TL1H3 to adapt to high latitudes
Generally, the allopatric planting of lower-latitude soybean cultivars in higher-latitude region can induce excessive stem growth and lead to severe lodging due to the extended vegetative phase (Supplementary Fig. 27). To verify if the phd mutants could be utilized to enhance the adaptability of cultivars to higher latitudes, we conducted field trails in Changchun (125°19′E, 43°53′N) and Xuchang (104°31′E, 34°10′N) and compared the extent of stem elongation between the phd mutants and TL1H3 (Supplementary Fig. 28a, b). The results showed that the phd mutant had a lower fold change in stem elongation between the two latitudes than TL1H3 (Supplementary Fig. 28c). Moreover, the phd mutants had a lower lodging rate, earlier maturity time, more branches, and a higher pod number per plant than the ph13, php mutants, and WT TL1H3 (Supplementary Fig. 29). In addition, we observed that the phd mutants exhibit a larger root system at seedling stage in growth chamber and at maturity stage in Beijing field conditions (Supplementary Fig. 30). Above results further suggesting the potential of using the phd mutant to improve the adaptability of soybean to high-latitude regions.
The phd mutants is suitable for high-density planting or intercropping
We evaluated the yield performance of the phd mutants at different planting densities (30 cm, 20 cm, 10 cm, and 5 cm plant space, equivalent to approximately 67,000, 100,000, 200,000, and 400,000 plants per hectare) in Changchun (Supplementary Fig. 31). The results showed that the main stem length of TL1H3 increased with the increase of planting density (IPD), resulting in severe lodging at all planting densities, whereas the phd mutants were insensitive to IPD and no lodging occurred (Fig. 5c, d and Supplementary Figs. 31c, 32c). While the yield per plant decreased with the IPD for TL1H3, the phd mutants consistently outperformed it (Fig. 5e and Supplementary Fig. 32). The plot yields of the phd mutants increased significantly in response to the IPD, while remaining unchanged or even decreased for TL1H3 (Fig. 5f). Moreover, the phd mutant lines showed better performance than a local elite cultivar Jiyu202 (JY202) at high planting density (400,000 plants per hectare) (Supplementary Fig. 33a). Despite being shorter than JY202, the phd mutants did not display any difference in node number (Supplementary Fig. 33d). Additionally, the phd mutants had a thicker stem, lower lodging rate, more branches, more pods per plant, as well as a 15.8% increase in grain yield per plant compared to JY202 (Supplementary Fig. 33b–h).
Furthermore, we evaluated the shade-tolerant ability of the phd mutants under a maize-soybean relay intercropping system at even higher latitudes in Harbin (127°50’E, 45°70’N). The results showed that the phd mutants were significantly shorter than TL1H3 as well as LK317 and LK18-842 (Supplementary Fig. 34a–c), two elite cultivars used for maize-soybean intercropping in Harbin. Moreover, lodging did not occur in the phd mutants, while nearly 100%, 40% and 80% lodging rates were observed in TL1, LK317 and LK18-842, respectively (Supplementary Fig. 34d). The plot yield of the phd mutants was at par with that of LK317 and LK18-842 (Supplementary Fig. 34e). These results together demonstrate that knockout of PH13 and PHP can improve the adaptability of TL1H3, a cultivar of mid latitude, to higher latitude for high density planting or intercropping.
Discussion
The global demand for soybeans is expected to continue rise in the future due to several factors such as the rising consumption of meat and soy-based health products, growing populations, and a more favorable policy of biofuels. Currently, the majority of soybean cultivation is located in mid to low latitude countries, including the United States, Brazil, and Argentina, which together contribute 80% of the total global yield44. Thus, there present a great opportunity for soybean production in high-latitude regions, such as Canada, northern US, northern China, and Russia, which could potentially serve as the primary source of increased soybean production in the future.
However, high-latitude regions have short frost-free periods and a long-day photoperiod, which can lead to excessive stem elongation and lodging. Those require the development of soybean varieties with a shorter growth period and reduced plant height. In this study, a natural variant of the PH13 gene (PH13H3) was identified that has been unintentionally utilized in the breeding of modern soybean varieties in northern latitude regions. The underlying mechanism of the reduced plant height was determined to be the insertion of the Ty1/Copia-like retrotransposon in the PH13 gene, causing a partial loss of the WD40 domain in the PH13 protein (Fig. 4d) and weakening its interaction with GmCOP1s and allowing the accumulation of downstream transcription factors STF1/2 (Fig. 4g, h), which subsequently inhibit internode elongation. The interaction between PH13 and GmCOP1b is mediated by their coil-coil domains, and enhanced by the WD40 domain of PH13. This is likely different from Arabidopsis, where the interaction between SPA1 (a homologous protein of SPA3/4 and PH13) and COP1 is also mediated by their coil-coil domains, and the absence of the WD40 domain of SPA1 does not impair the interaction with COP145. Furthermore, COP1 and SPAs have been reported to co-localize and function together within the nucleus in Arabidopsis46,47, whereas PH13 lacks an NLS (Nuclear Localization Signal) and is uniformly distributed in both the nucleus and cytoplasm (Supplementary Figs. 15 and 19). This suggests that PH13 may be involved in a distinct mechanism for regulating soybean growth and development.
Although the PH13H3 allele has been strongly selected in northern latitudes, its ability to reduce plant height may be insufficient and needs to be combined with early flowering loci to suit the environmental requirements of higher latitude regions14,48,49. This is evidenced by the TL1H3 cultivar, which harbors the PH13H3 haplotype but exhibits severe ESE and lodging in the northern regions (Fig. 5c and Supplementary Fig. 27). To address this problem, the PH13 gene and its paralog gene PHP were simultaneously eliminated to produce the phd mutants in TL1H3 background, which exhibit both reduced plant height and early maturity suitable for planting at high latitudes (Fig. 5d and Supplementary Fig. 29f). Additionally, many current soybean variates in the north latitudes are not suitable for high-density planting and soybean-maize intercropping which can exacerbate the problem of lodging caused by long-day photoperiods20,50. Low blue light (LBL) is the main shade avoidance signal that induces the ESE syndrome in soybean21. The phd mutant is insensitive to LBL and lodging resistant (Fig. 5a, b), thus significantly increasing yield under high-density planting conditions at high latitudes (Fig. 5e, f). Moreover, the phd mutations can also benefit intercropping by reducing the lodging-induced yield reduction (Supplementary Fig. 34d, e). Although the phd mutant in the TL1 background exhibited lower yield potential compared to the elite cultivars LK317 and LK18-842 in Northern China under norm planting conditions, it displayed a lower lodging rate and a similar grain yield potential relative to the elite cultivars LK317 and LK18-842 under maize-soybean relay intercropping conditions. In conclusion, this study identified a pair of plant height regulatory genes that can improve soybean adaptation to high latitude and provided a strategy for breeding high-yield varieties suitable for dense planting and intercropping at high latitudes (Fig. 6).
Methods
Plant materials and growth conditions
The high-quality genome sequences of 2214 accessions have been published before24. Among these materials, 540 improved cultivars with phenotype data were utilized for GWAS, resulting in the discovery of an association between genotype and phenotype variation. Furthermore, TWAS analysis uncovered an association between gene between gene expression and phenotype variation in 488 soybean accessions that possessed both phenotype and expression data. Detailed information of these materials is available in Supplementary Data 1.
In this study, the soybean (Glycine max (L.) Merr.) cultivar Tianlong 1 (TL1) and Williams 82 (W82) served as control groups to generate transgenic lines. To evaluate plant height, we cultivated WT, ph13, php, and phd mutants, as well as PH13 overexpression lines under long day conditions (16 h light/8 h dark at 26 °C) in a controlled growth chamber. For the field experiment, the aforementioned transgenic materials were grown naturally from May to October in three different locations: the Institute of Crop Science, Chinese Academy of Agricultural Science, Beijing (116°23′E, 39°54′N), Jilin Agricultural University, Changchun (125°19′E, 43°53′N), Beidahuang KenFeng Seed Co., Ltd, Harbin (127°50’E, 45°70’N).
To assess the plant density effects, four treatments with three replicates each were conducted in Changchun. Seeds were sown in May with varying plant spacings (30 cm, 20 cm, 10 cm, and 5 cm, resulting in plant densities of 66,700, 100,000, 200,000, and 400,000 plants per hectare, respectively). The plots were organized in 3 m long rows with 0.5 m between each row, covering a total area of 7.5 square meters. Harvesting was performed in October 2021.
For the maize-soybean relay intercropping experiments, seeds were sown in early May in Harbin, with a row-to-row distance of 40 cm between maize plants, and between soybean plants, and 60 cm between maize and soybean. The soybean was spaced 5 cm apart in rows of 5 m in length (equivalent to 400,000 plants per hectare), while the maize was spaced 10 cm apart in rows of 5 m in length. To maintain uniformity, all lines under different densities were manually planted with 3 seeds per hole. Once the seeds geminated and the unifoliate leaf fully unfolded, only one healthy seedling was retained per hole. Failed seedlings were removed and replaced with a healthy seedling of same genotype via transplanting to maintain uniformity. The plot size was 19.7 square meters, and two biological replicates were conducted for each experimental material. Harvesting was performed in October 2022. At the R8 stage (when 95% of pods had reached maturity), the following agronomic traits were measured: plant height, length of the internodes, number of branches, node number, height of the center of gravity point, pods per plant, lodging rate, grain yield per plant, and grain yield per plot were measured. Soybean lodging resistance was evaluated based on the height of the center of gravity point, a measure of the balance point at which mature plants were placed in a horizontal position. At least ten randomly selected plants within each plot were included in the phenotypic analysis for each measured trait.
GWAS and TWAS assays
A panel of 857 soybean accessions (Supplementary Fig. 1a) were genotyped and phenotyped for plant height for association analysis in a previous study24. The plant materials were planted in ten different environments and evaluated for a minimun of two years to record plant height data. These environments included Harbin (45°45’N, 126°41’E) from 2017 to 2019, Changchun (43°88’N, 125°25’E) in 2018 and 2019, Tonghua (41°74’N, 125°94’E) from 2017 to 2019, Shijiazhuang (38°05’N, 114°52’E) in 2018 and 2019, Liaocheng (36°46’N, 115°99’E) from 2017 to 2019, Xuzhou (34°27’N, 117°19’E) in 2018 and 2019, Nanjing (31°05’N, 118°78’E) in 2018 and 2019, Hefei (31°88’N, 117°17’E) in 2018 and 2019, Wuhan (30°58’N, 114°32’E) in 2018 and 2019, and Nanchang (28°32′N, 116°1’E) from 2017 to 2019. Each cultivar was planted in a 1.8 m × 0.8 m plot with two rows and a spacing of 10 cm between seedlings in each row. The soybean lines’ plant height Best Linear Unbiased Predictions (BLUPs) were calculated using a mixed linear model (1)51. Here, Y represents the observed plant height, X is the fixed effects, β is the vector of fixed effect coefficients, Z is the random effects, u is the random effect coefficients, and e is the residual errors.
1 |
In the model, the fixed effect represents the mean plant height across all lines, locations, and years. Random effects account for variations within each line, location, year, and their interactions (Line: Location and Line: Year). To address incomplete data in some lines across experimental trials, both Year and Location were treated as random effects.
The re-sequencing of 2214 soybean accessions generated 8,785,134 SNPs, which were imputed by beagle24. After filtering the imputed SNPs for a minor allele frequency of >5% among the genotyped and phenotyped 540 soybean accessions, the remaining 3,469,934 SNPs were retained and used for plant height GWAS. The GWAS was performed using the fixed and random model Circulating Probability Unification (FarmCPU)25, with population structure controlled by the first three components from SNPs principal components analysis (PLINKv1.90)52. The resulted p-values were adjusted with Bonferroni correction at a level of α = 0.05, resulting in a cutoff of 1.44E−08.
A natural panel (PRJCA014188) of previously published RNA-Seq data from tissues above cotyledonary node at the V2 stage was employed for TWAS. A total of 488 soybean accessions, possessing both expression and phenotype data, were included in the analysis and underwent TWAS using FarmCPU25. Genes and exons with an average Transcripts Per Million (TPM) > 0.1 were considered to be expressed as described by Li et al. 53. Population structure was controlled by the first three components from the expression principal components analysis. The resulting p-values were adjusted using Bonferroni correction at a level of α = 0.05. However, a potential issue of the FarmCPU model is its integration of leading markers as co-variants, which can remove co-expressed and result in false-negative TWAS results. To address this issue, TWAS was also conducted using a compressed mixed linear model implemented in GAPIT54,55.
DNA isolation and detection of PH13 haplotype
The DNA of 1254 accessions (out of 2214 genome sequenced accessions) were extracted from leaves individually using the modified CTAB method56. The status of a SNP (A or G) located 2444 bp downstream of the translation start site (TSS) was determined using the resequencing data24. To identify the insertion of the Ty1/Copia-like retrotransposon, three primers were designed for polymerase chain reaction (PCR) detection (Supplementary Data 8). The presence of the fragment insertion was confirmed by agarose gel electrophoresis.
Construction of near isogenic lines
The creation of a pair of near-isogenic lines (NILs), consisting of NILH1 and NILH3, was achieved through the crossbreeding of Williams82 carrying Hap1 (W82H1) and Tianlong1 carrying Hap3 (TL1H3). In the progenies of the F7 generation, a heterozygous line at the PH13 gene was selected, and the F8 generation segregating groups, carrying homozygous H1 (NILH1) and homozygous H3 (NILH3) were used for phenotypic analysis (Supplementary Fig. 13). Segregation of the PH13 haplotypes was analyzed by PCR using the three primers designed for the Ty1/Copia-like retrotransposon insertion, and confirmed through agarose gel electrophoresis. Seeds heterozygous in the targeted region from the F8 generation were grown in the phytotron or field, and the plant height phenotypes of the segregating progeny were recorded at the R8 stage.
Plasmid construction and plant transformation
To generate CRISPR/Cas9 engineered mutants, multiple gRNAs were designed for each gene on the basis of their specificity and off-target effects using the CRISPR direct website (http://crispr.dbcls.jp/)57. CRISPR/Cas9 vectors were constructed using multiple target gRNA for each gene in order to improve editing efficiency and minimize off-target effects16. The editing efficiency of each construct was evaluated using the soybean hairy root system58, and at least two vectors independently with high editing efficiency for each gene were selected for soybean transformation. The selected CRISPR/Cas9 vectors were introduced into the Agrobacterium tumefaciens strain EHA105 through electroporation, and then separately transformed into W82H1 and TL1H3 using the cotyledon-node method59.
To construct the overexpression vector, the coding DNA sequence (CDS) of PH13 was amplified from cDNA derived from young W82H1 seedlings via PCR. The CDS was then cloned into the 35 S::3×Flag vector with an XhoI site, which was constructed based on the pFGC5941 plasmid containing the (2×35 S)-(3×Flag)-NOS cassette inserted between EcoRI and HindIII sites60. The newly generated construct, 35 S::PH13H1−3×Flag, was then introduced into Agrobacterium strain EHA105 for the transformation of TL116. Additionally, the 35 S::PH13H1−3×Flag vector was introduced into the Arabidopsis spa134 mutant34 for ectopic expression. The applied primers were listed in the Supplementary Data 8.
RNA extraction and gene expression analysis
To investigate the transcriptional dynamics of PH13 in different genotypes and assess the co-expression of PH13 and GmCOP1s, we grew W82H1 and TL1H3 separately under long-day conditions for 20 days. The second fully expanded trifoliolate leaves were harvested every 4 hours over a day. Two pairs of primers were designed for qPCR to detect the expression levels of different PH13 alleles, one targeting exon 3, and the other targeting the 3’UTR. Total RNA was extracted using Trizol Reagent (TIANGEN), and then treated with DNase. A reverse transcription kit (TransGen Biotech) was used to synthesized cDNA from 3 μg of total RNA in a 20 μl reaction. RT-qPCR was performed on 384-well optical plate using the ABI Q7 equipment and the SYBR Green RT-PCR kit (Vazyme Biotech). All primers used for the indicated genes were listed in the Supplementary Data 8. Three independent biological replicates were carried out for each sample.
Multiple alignment and phylogenetic analysis
The protein sequences of AtSPAs (AtSPA1 AT2G46340, AtSPA2 AT4G11110, AtSPA3 AT3G15354, AtSPA4 AT1G53090) were retrieved from TAIR (https://www.arabidopsis.org/index.jsp). Homologous SPA protein sequences were sourced from Phytozome (https://phytozome.jgi.doe.gov/pz/portal.html), with the list of the homologous SPA protein sequences utilized in this study outlined in Supplementary Data 4. The amino acid sequences of the SPA proteins and their homologous proteins were aligned by ClustalW in MEGA7 with manually adjustments. The phylogenetic tree was constructed using the neighbor joining method in MEGA7 software.
Subcellular location in protoplasts
To investigate the subcellular location of different PH13 haplotype proteins, the CDSs of PH13H1, PH13H2, and PH13H3 were inserted into the pA7-YFP vector at the BamHI and SmaI sites using the In-Fusion system, respectively (TransGen Biotech). The resulting PH13Hs-YFP transient expression constructs were driven by the 35 S promoter. Control experiments were performed utilizing the empty pA7-YFP vector. Additionally the GmMYB29-RFP fusion protein was employed as a nucleus marker61. The plasmids were transformed into Arabidopsis mesophyll protoplasts and the resultant subcellular localization was imaged using a Zeiss LSM980 confocal laser scanning microscope. ZEN 2009 Light Edition software was used to process the images. All primers used for vector construction were listed in the Supplementary Data 8.
Yeast two-hybrid assays
The yeast two-hybrid assays were conducted following the manufacturer’s instructions (Yeast Handbook Clontech). The CDSs of PH13H1, PH13H2, PH13H3, PH13N, PH13CT, PH13cc, and PH13WD40 were individually cloned in frame with the GAL4 DNA binding domain in the bait vector pGBKT7 (Clontech, catalog no. 631604). While the CDSs of GmCOP1a and GmCOP1b were similarly fused with the GAL4 transcription activation domain in the prey vector pGADT7 (Clontech, catalog no. K1612-1). The bait and prey plasmids were then co-transformed into the yeast strain Saccharomyces cerevisiae AH109 (Clontech). The transformed yeast cells were grown on SD/-Leu-Trp (-LW) minimal medium. Positive clones were selected and grown on SD/-His-Leu-Trp-Ade (-LWHA) selection medium at 30 °C for 3–5 days to evaluate protein interactions.
The β-galactosidase activity assay was performed as previously reported62. Briefly, yeast colonies were selected and cultured at 180 rpm and 28 °C in an incubator until the optical density (OD600) of the culture reached 0.1 in a 10 mL flask containing 4 ml of SD medium (-Leu/-Trp). 2 mL of yeast culture was then transferred into 8 mL YPDA culture solution and cultured at 180 rpm at 28 °C until the OD600 reached 0.5-0.8 prior to the β-galactosidase assay. The relative bait-prey interaction was presented as β-gal units (2), where T is the response time (min) and V represents 0.1 × concentration factor.
2 |
Co-immunoprecipitation assays
To evaluate the strength of interactions between different haplotypes of PH13 and GmCOP1b, a co-immunoprecipitation (Co-IP) assay was conducted in N. benthamiana. The PH13H1-Flag, PH13H2-Flag or PH13H3-Flag constructs were co-transformed with the GmCOP1b-YFP construct as indicated into N. benthamiana leaves. The YFP protein co-expressed with PH13H1-Flag was used as a negative control. Following infiltration, the N. benthamiana plants were incubated at 25 °C for 12 h in the dark and then transferred to light growth conditions for an additional 36 h before IP analysis. The samples were harvested, ground, and treated with lysis buffer containing 1 mM MgCl2, 10 mM EDTA [pH 8.0], 1 mM PMSF, and 5 mM DTT, and Roche protease inhibitor cocktail. After centrifugation, the supernatant was incubated with anti-GFP trap agarose (Chromotek, catalog number gta-20) overnight at 4 °C, and rinsed three times with lysis buffer. The samples were boiled in SDS-PAGE sample buffer then analyzed using immunoblotting with anti-GFP antibody (at a 1:2500 dilution) followed by anti-Flag antibody (at a 1:2500 dilution).
Light regimes
White light (WL), blue light (400–499 nm), red light (600–699 nm), and far-red light (700–750 nm) LED panels (HiPoint brand, 14005-11145, Made in TAIWAN) were used separately or in combination as indicated (Supplementary Fig. 24). Low blue light (LBL) was achieved by filtering WL through two layers of yellow filters (no. 101, Lee Filters, CA), while low red: far-red light (L R:FR) was achieved by supplementing WL with far-red light21. The height of the LED was adjusted to maintain the Photosynthetic Photon Flux Density (PPFD) at about 500 μmol m−2 s−1. The quality and intensity of light were measured by placing a HiPoint HR−350 spectrometer on top of the leaves.
Statistical analyses
For phenotypic investigation, at least five individual plants per accession were analyzed. The exact numbers of individuals (n) varying depending on the experiment were presented in the figure legends. The expression analysis were conducted by pooling at least three individual plants per tissue sample and performing at least three RT-qPCR reactions (technical replicates) for three biological replicates. Multiple comparisons were performed using GraphPad Prism 8.0 software with a two-way ANOVA and a two-sided Tukey test. For comparisons between two groups, two-tailed Student’s t-tests were conducted in Microsoft Excel to obtain p-values. The figure legends provide details on the statistical tests utilized for each experiment.
Primers and accession number
All primers used in this study are listed in Supplementary Data 8. Gene sequences are available at the Phytozome database (https://phytozome-next.jgi.doe.gov/info/Gmax_Wm82_a2_v1): PH13/Glyma.13G276700, PHP/Glyma.12G224600, GmCOP1a/Glyma.02G267800, GmCOP1b/Glyma.14G049700, STF1/Glyma.08G302500, STF2/Glyma.18G117100, GmCRY1a/Glyma.04G101500, GmCRY1b/Glyma.06G103200, GmCRY1c/Glyma.14G174200 and GmCRY1d/Glyma.13G089200.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Supplementary information
Source data
Acknowledgements
This work was supported by the National Key Research and Development Plan (grant no. 2021YFF1001201, 2021YFD1201601), the National Natural Science Foundation of China (grant no. 31422041, 31871705, 32201759), the Innovation Program of Chinese Academy of Agricultural Sciences, the Agricultural Science and Technology Innovation Program (ASTIP) of the Chinese Academy of Agricultural Sciences (grant no. CAAS-ZDRW202109), the Central Public-Interest Scientific Institution Basal Research Fund, and the earmarked fund for CARS (grant no. CARS-04-PS01). We are grateful to Dr. Huihui Li (Institute of Crop Sciences, Chinese Academy of Agriculture Sciences) for the fruitful discussions.
Author contributions
B.L., L.-J.Q., F.K. and Y.-H.L. designed the research. C.Q. and D.L. performed the experiments. X.L., R.J. and Q.C. provided relevant experimental materials. Y.Z., X.Z.W., Q.W., Y.W., W.H., Q. Z., L.L., X.W., G.X., G.H., Z.S. and R.W., assisted in collecting the phenotypic data. Z.J., X.J.L., Y.L., H.L., H.Y.L, T.Z., J.L. and X.H designed the field experiment. C.Q., X.Z., D.L. and L.K. analyzed data. B.L., L.-J.Q., F.K., Y.-H.L., D.L. and C.Q. wrote the manuscript.
Peer review
Peer review information
Nature Communications thanks Julin Maloof and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.
Data availability
The raw sequence data24 and gene expression data63 reported elsewhere are available at NCBI Sequence Read Archive under accession PRJNA681974 and the Genome Sequence Archive (GSA) database of the BIG Data Center under accession PRJCA014188, respectively. The sequences of the three PH13 alleles reported in this study are available at GenBank of NCBI: PH13H1/OR637868, PH13H2/OR637869, and PH13H3/OR637870. Requests for materials should be addressed to B.L. or L.-J.Q. Source data are provided with this paper.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Chao Qin, Ying-hui Li, Delin Li.
Contributor Information
Fanjiang Kong, Email: kongfj@gzhu.edu.cn.
Li-juan Qiu, Email: qiulijuan@caas.cn.
Bin Liu, Email: liubin05@caas.cn.
Supplementary information
The online version contains supplementary material available at 10.1038/s41467-023-42608-5.
References
- 1.SoyStat. http://www.soystats.com/ (2023).
- 2.Ray DK, Ramankutty N, Mueller ND, West PC, Foley JA. Recent patterns of crop yield growth and stagnation. Nat. Commun. 2012;3:1293. doi: 10.1038/ncomms2296. [DOI] [PubMed] [Google Scholar]
- 3.Ray DK, Mueller ND, West PC, Foley JA. Yield trends are insufficient to double global crop production by 2050. PLoS ONE. 2013;8:e66428. doi: 10.1371/journal.pone.0066428. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Zhang SR, et al. Photoperiodism dynamics during the domestication and improvement of soybean. Sci. China Life Sci. 2017;60:1416–1427. doi: 10.1007/s11427-016-9154-x. [DOI] [PubMed] [Google Scholar]
- 5.Lin X, Liu B, Weller JL, Abe J, Kong F. Molecular mechanisms for the photoperiodic regulation of flowering in soybean. J. Integr. Plant Biol. 2021;63:981–994. doi: 10.1111/jipb.13021. [DOI] [PubMed] [Google Scholar]
- 6.Yang Q, et al. Environmental and genetic regulation of plant height in soybean. BMC Plant Biol. 2021;21:63. doi: 10.1186/s12870-021-02836-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Hartwig EE, Kiihl R. Identification and utilization of a delayed flowering character in soybeans for short-day conditions. Field Crops Res. 1979;2:145–151. doi: 10.1016/0378-4290(79)90017-0. [DOI] [Google Scholar]
- 8.Sinclair TR, Hinson K. Soybean flowering in response to the long-juvenile trait. Crop Sci. 1992;32:5. doi: 10.2135/cropsci1992.0011183X003200050036x. [DOI] [Google Scholar]
- 9.Neumaier N, James AT. Exploiting the long-juvenile trait to improve adaptation of soybeans to the tropics. Food Legume Newsl. 1993;8:12–14. [Google Scholar]
- 10.Xia Z, et al. Positional cloning and characterization reveal the molecular basis for soybean maturity locus E1 that regulates photoperiodic flowering. Proc. Natl Acad. Sci. USA. 2012;109:E2155–E2164. doi: 10.1073/pnas.1117982109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Watanabe S, et al. A map-based cloning strategy employing a residual heterozygous line reveals that the GIGANTEA gene is involved in soybean maturity and flowering. Genetics. 2011;188:395–407. doi: 10.1534/genetics.110.125062. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Watanabe S, et al. Map-based cloning of the gene associated with the soybean maturity locus E3. Genetics. 2009;182:1251–1262. doi: 10.1534/genetics.108.098772. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Liu B, et al. Genetic redundancy in soybean photoresponses associated with duplication of the phytochrome A gene. Genetics. 2008;180:995–1007. doi: 10.1534/genetics.108.092742. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Dong L, et al. Parallel selection of distinct Tof5 alleles drove the adaptation of cultivated and wild soybean to high latitudes. Mol. Plant. 2022;15:308–321. doi: 10.1016/j.molp.2021.10.004. [DOI] [PubMed] [Google Scholar]
- 15.Lu S, et al. Stepwise selection on homeologous PRR genes controlling flowering and maturity during soybean domestication. Nat. Genet. 2020;52:428–436. doi: 10.1038/s41588-020-0604-7. [DOI] [PubMed] [Google Scholar]
- 16.Li C, et al. A domestication-associated gene GmPRR3b regulates the circadian clock and flowering time in soybean. Mol. Plant. 2020;13:745–759. doi: 10.1016/j.molp.2020.01.014. [DOI] [PubMed] [Google Scholar]
- 17.Wang L, et al. Natural variation and CRISPR/Cas9-mediated mutation in GmPRR37 affect photoperiodic flowering and contribute to regional adaptation of soybean. Plant Biotechnol. J. 2020;18:1869–1881. doi: 10.1111/pbi.13346. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Garner WW, Allard HA. Effect of the relative length of day and night and other factors of the environment on growth and reproduction in plants. J. Agric. Res. 1920;18:157–158. [Google Scholar]
- 19.Yang F, et al. Growth of soybean seedlings in relay strip intercropping systems in relation to light quantity and red:far-red ratio. Field Crops Res. 2014;155:245–253. doi: 10.1016/j.fcr.2013.08.011. [DOI] [Google Scholar]
- 20.Raza MA, et al. Growth and development of soybean under changing light environments in relay intercropping system. PeerJ. 2019;7:e7262. doi: 10.7717/peerj.7262. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Lyu X, et al. GmCRY1s modulate gibberellin metabolism to regulate soybean shade avoidance in response to reduced blue light. Mol. Plant. 2021;14:298–314. doi: 10.1016/j.molp.2020.11.016. [DOI] [PubMed] [Google Scholar]
- 22.Ji R, et al. Induced mutation in GmCOP1b enhances the performance of soybean under dense planting conditions. Int. J. Mol. Sci. 2022;23:5394. doi: 10.3390/ijms23105394. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Mu R, et al. GmBICs modulate low blue light-induced stem elongation in soybean. Front. Plant Sci. 2022;13:803122. doi: 10.3389/fpls.2022.803122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Li Y, et al. Genome-wide signatures of the geographic expansion and breeding of soybean. Sci. China Life Sci. 2023;66:350–365. doi: 10.1007/s11427-022-2158-7. [DOI] [PubMed] [Google Scholar]
- 25.Liu X, et al. Iterative usage of fixed and random effect models for powerful and efficient genome-wide association studies. PLoS Genet. 2016;12:e1005767. doi: 10.1371/journal.pgen.1005767. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Liu B, et al. The soybean stem growth habit gene Dt1 is an ortholog of Arabidopsis TERMINAL FLOWER1. Plant Physiol. 2010;153:198–210. doi: 10.1104/pp.109.150607. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Ping J, et al. Dt2 is a gain-of-function MADS-domain factor gene that specifies semideterminacy in soybean. Plant Cell. 2014;26:2831–2842. doi: 10.1105/tpc.114.126938. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Kabelka E, et al. Putative alleles for increased yield from soybean plant introductions. Crop Sci. 2004;44:784–791. doi: 10.2135/cropsci2004.7840. [DOI] [Google Scholar]
- 29.Reinprecht Y, et al. Seed and agronomic QTL in low linolenic acid, lipoxygenase-free soybean (Glycine max (L.) Merrill) germplasm. Genome. 2006;32:1510–1527. doi: 10.1139/g06-112. [DOI] [PubMed] [Google Scholar]
- 30.Diers B, et al. Genetic architecture of soybean yield and agronomic traits. G3. 2018;8:3367–3375. doi: 10.1534/g3.118.200332. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Oki N, et al. Quantitative trait loci associated with short inter-node length in soybean. Breed. Sci. 2018;68:554–560. doi: 10.1270/jsbbs.18087. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Wang X, et al. Increased copy number of gibberellin 2-oxidase 8 genes reduced trailing growth and shoot length during soybean domestication. Plant J. 2021;107:1739–1755. doi: 10.1111/tpj.15414. [DOI] [PubMed] [Google Scholar]
- 33.Hoecker U, Xu Y, Quail H. SPA1: A new genetic locus involved in phytochrome A–specific signal transduction. Plant Cell. 1998;10:19–33. doi: 10.1105/tpc.10.1.19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Fittinghoff K, et al. Functional and expression analysis of Arabidopsis SPA genes during seedling photomorphogenesis and adult growth. Plant J. 2006;47:577–590. doi: 10.1111/j.1365-313X.2006.02812.x. [DOI] [PubMed] [Google Scholar]
- 35.Wang B, Smith S, Li J. Genetic regulation of shoot architecture. Annu. Rev. Plant Biol. 2018;69:437–468. doi: 10.1146/annurev-arplant-042817-040422. [DOI] [PubMed] [Google Scholar]
- 36.Guo W, et al. Altering plant architecture to improve performance and resistance. Trends Plant Sci. 2020;25:1154–1170. doi: 10.1016/j.tplants.2020.05.009. [DOI] [PubMed] [Google Scholar]
- 37.Hwang S, Lee T. Integration of lodging resistance QTL in soybean. Sci. Rep. 2019;9:6540. doi: 10.1038/s41598-019-42965-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Ponnu J, Hoecker U. Illuminating the COP1/SPA ubiquitin ligase: Fresh insights into its structure and functions during plant photomorphogenesis. Front. Plant Sci. 2021;12:662793. doi: 10.3389/fpls.2021.662793. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Podolec R, Ulm R. Photoreceptor-mediated regulation of the COP1/SPA E3 ubiquitin ligase. Curr. Opin. Plant Biol. 2018;45:18–25. doi: 10.1016/j.pbi.2018.04.018. [DOI] [PubMed] [Google Scholar]
- 40.Menon C, Sheerin DJ, Hiltbrunner A. SPA proteins: SPAnning the gap between visible light and gene expression. Planta. 2016;244:297–312. doi: 10.1007/s00425-016-2509-3. [DOI] [PubMed] [Google Scholar]
- 41.Yong H, et al. STF1 is a novel TGACG-binding factor with a zinc-finger motif and a bZIP domain which heterodimerizes with GBF proteins. Plant J. 1998;15:199–209. doi: 10.1046/j.1365-313X.1998.00197.x. [DOI] [PubMed] [Google Scholar]
- 42.Weller J, et al. Light regulation of gibberellin biosynthesis in pea is mediated through the COP1/HY5 pathway. Plant Cell. 2009;21:800–813. doi: 10.1105/tpc.108.063628. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Ji H, et al. Differential light-dependent regulation of soybean nodulation by papilionoid-specific HY5 homologs. Curr. Biol. 2022;32:783–795.e5. doi: 10.1016/j.cub.2021.12.041. [DOI] [PubMed] [Google Scholar]
- 44.Vivek Voora, C.L. Steffany Bermudez. Global market report: Soybeans. Sustainable Commodities Marketplace (2020).
- 45.Hoecker U, Quail PH. The phytochrome A-specific signaling intermediate SPA1 interacts directly with COP1, a constitutive repressor of light signaling in Arabidopsis. J. Biol. Chem. 2001;276:38173–38178. doi: 10.1074/jbc.M103140200. [DOI] [PubMed] [Google Scholar]
- 46.Balcerowicz M, et al. SPA proteins affect the subcellular localization of COP1 in the COP1/SPA ubiquitin ligase complex during photomorphogenesis. Plant Physiol. 2017;174:1314–1321. doi: 10.1104/pp.17.00488. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Sheerin DJ, et al. Light-activated phytochrome A and B interact with members of the SPA family to promote photomorphogenesis in Arabidopsis by reorganizing the COP1/SPA complex. Plant Cell. 2015;27:189–201. doi: 10.1105/tpc.114.134775. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Dong L, et al. The genetic basis of high-latitude adaptation in wild soybean. Curr. Biol. 2022;33:252–262. doi: 10.1016/j.cub.2022.11.061. [DOI] [PubMed] [Google Scholar]
- 49.Chen L, et al. Soybean adaption to high-latitude regions is associated with natural variations of GmFT2b, an ortholog of FLOWERING LOCUS T. Plant Cell Environ. 2020;43:934–944. doi: 10.1111/pce.13695. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Zhai H, et al. GmMDE genes bridge the maturity gene E1 and florigens in photoperiodic regulation of flowering in soybean. Plant Physiol. 2022;00:1–16. doi: 10.1093/plphys/kiac092. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Robinson GK. BLUP is a good thing: the estimation of random effects. Stat. Sci. 1991;1:15–32. [Google Scholar]
- 52.Gaunt TR, Rodriguez S, Day IN. Cubic exact solutions for the estimation of pairwise haplotype frequencies: implications for linkage disequilibrium analyses and a web tool ‘CubeX’. BMC Bioinforma. 2007;8:428. doi: 10.1186/1471-2105-8-428. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Li D, Liu Q, Schnable PS. TWAS results are complementary to and less affected by linkage disequilibrium than GWAS. Plant Physiol. 2021;186:1800–1811. doi: 10.1093/plphys/kiab161. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Zhang Z, et al. Mixed linear model approach adapted for genome-wide association studies. Nat. Genet. 2010;42:355–360. doi: 10.1038/ng.546. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Lipka A, et al. GAPIT: genome association and prediction integrated tool. Bioinformatics. 2012;28:2397–2399. doi: 10.1093/bioinformatics/bts444. [DOI] [PubMed] [Google Scholar]
- 56.Murray MG, Thompson WF. Rapid isolation of high molecular weight plant DNA. Nucleic Acids Res. 1980;8:4321–4326. doi: 10.1093/nar/8.19.4321. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Naito Y, Hino K, Bono H, Ui-Tei K. CRISPRdirect: software for designing CRISPR/Cas guide RNA with reduced off-target sites. Bioinformatics. 2015;31:1120–1123. doi: 10.1093/bioinformatics/btu743. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Sun X, et al. Targeted mutagenesis in soybean using the CRISPR-Cas9 system. Sci. Rep. 2015;5:10342. doi: 10.1038/srep10342. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Paz M, Wang K, et al. Improved cotyledonary node method using an alternative explant derived from mature seed for efficient Agrobacterium-mediated soybean transformation. Plant Cell Rep. 2006;25:206–213. doi: 10.1007/s00299-005-0048-7. [DOI] [PubMed] [Google Scholar]
- 60.Kerschen A, et al. Effectiveness of RNA interference in transgenic plants. FEBS Lett. 2004;566:223–228. doi: 10.1016/j.febslet.2004.04.043. [DOI] [PubMed] [Google Scholar]
- 61.Chu S, et al. An R2R3-type MYB transcription factor, GmMYB29, regulates isoflavone biosynthesis in soybean. PLoS Genet. 2017;13:e1006770. doi: 10.1371/journal.pgen.1006770. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Qin C, et al. GmEID1 modulates light signaling through the Evening Complex to control flowering time and yield in soybean. Proc. Natl Acad. Sci. USA. 2023;120:e2212468120. doi: 10.1073/pnas.2212468120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Li, D. et al. Transcriptome brings variations of gene expression, alternative splicing, and structural variations into gene-scale trait dissection in soybean. Preprint at 10.1101/2023.07.03.545230 (2023).
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The raw sequence data24 and gene expression data63 reported elsewhere are available at NCBI Sequence Read Archive under accession PRJNA681974 and the Genome Sequence Archive (GSA) database of the BIG Data Center under accession PRJCA014188, respectively. The sequences of the three PH13 alleles reported in this study are available at GenBank of NCBI: PH13H1/OR637868, PH13H2/OR637869, and PH13H3/OR637870. Requests for materials should be addressed to B.L. or L.-J.Q. Source data are provided with this paper.