Abstract
Phytochrome-interacting factor 4 (PIF4) participates in light signaling by interacting with photoreceptors, phytochromes, and cryptochromes. Although well characterized in Arabidopsis, PIF4′s role in crop plants is unknown. Here we performed the first integrated genomics, transcriptomics, and molecular characterization of PIF4 in soybean (Glycine max) plants. Fifteen identified Glycine max PIFs (GmPIFs) grouped into PIF3, PIF4, and PIF8 subfamilies based on their phylogenetic relationships. The GmPIF4 subfamily formed two distinct clades (GmPIF4 I and GmPIF4 II) with different amino acid sequences in the conserved bHLH region. Quantitative transcriptional analysis of soybean plants exposed to different photoperiods and temperatures indicated that all PIF4 I clade GmPIF4s conserved PIF4-like expression. Three out of four GmPIF4 transcripts of the GmPIF4 I clade increased at 35 °C compared to 25 °C under short day conditions. RNA sequencing of soybeans undergoing floral transition showed differential regulation of GmPIF4b, and ectopic GmPIF4b expression in wild type Arabidopsis resulted in an early flowering phenotype. Complementation of GmPIF4b in Arabidopsis pif4-101 mutants partially rescued the mutant phenotype. PIF4 protein levels peaked before dawn, and a GmPIF4b protein variant was observed in soybean plants treated at high temperatures.
Introduction
Environmental factors such as light and temperature have a profound effect on plant physiology and development; not only their presence but also the duration of exposure1. The photoperiod (light and dark phase length) influences molecular signaling2, with the circadian clock synchronizing these environmental signals with endogenous rhythms to ensure optimal development and reproduction2,3.
High-throughput sequencing and genetic analyses have revealed that phytochrome interacting factors (PIFs), a class of basic helix-loop-helix (bHLH) transcription factors, play crucial roles in integrating photoperiodic signals through photoreceptor, phytochrome and cryptochrome, interactions. In the model plant Arabidopsis thaliana, PIFs belong to the bHLH superfamily of proteins, with the PIF subfamily consisting of PIF1, PIF3, PIF4, PIF5, PIF6, PIF7, and PIF84. The bHLH domain contains a stretch of 50–60 amino acids that comprises two segments: a stretch of around 40 amino acids forming two amphipathic α-helices separated by a variable length loop and a 10–15 basic amino acid domain with DNA-binding capacity5.
PIF proteins have predominately been studied in Arabidopsis shade avoidance responses6,7. PIFs interact with the light-activated form of phytochromes (Pfr) through their highly-conserved active phytochrome-binding (APB) motifs6. PIFs typically accumulate in the dark, peak at dawn, and then degrade in the presence of light by interacting with Pfrs and ubiquitin-proteasome degradation6. PIF transcription is regulated by the evening circadian clock complex, with the ELF3-ELF4-LUX complex directly binding to PIF4 and PIF5 promoters to suppress their expression and regulate circadian responses8. It has recently been suggested that PIF4 acts as an integrating hub for light and temperature-related signals and the evening circadian clock-expressed factor TOC1 to regulate thermoresponsive plant growth9. PIF4 is also a central phytochrome regulator during Arabidopsis flowering under short day conditions3 through control of hormonal networks10,11. In Arabidopsis, PIF4 also controls auxin (indole acetic acid, IAA) signaling by modulating the expression of SMALL AUXIN-UP RNA (SAUR) genes at high temperatures10. PIF4 interacts with the blue light receptor CYPTOCHROME 1 (CRY1) to regulate high temperature-mediated hypocotyl elongation by increasing IAA concentrations through stimulation of YUC8 (YUCCA8) and TRYPTOPHAN AMINOTRANSFERASE OF ARABIDOPSIS 1 (TAA1) gene expression12. PIF4 and PIF5 together play a crucial role in leaf senescence, activating ETHYLENE INSENSITIVE 3 (EIN3), ABSCISIC ACID INSENSITIVE 5 (ABI5), and ENHANCED EM LEVEL (EEL) gene expression to produce the senescence hormones ethylene and abscisic acid13. Clearly, PIFs have pleiotropic roles in model plants, but their roles in other plants of commercial value is less well characterized.
Soybean (Glycine max (L.) Merrill) is a leguminous crop that is mainly used as a source of protein and vegetable oil and that can fix atmospheric nitrogen via a symbiotic relationship with soil-borne microorganisms. The soybean genome is complex due to two genome duplication events estimated to have occurred 59 and 13 million years ago14. The paleopolyploid soybean genome presents the exciting opportunity to explore evolutionary diversification in gene function occurring due to chromosomal rearrangements during duplication. The presence of multiple forms/copies of a gene is often linked to the acquisition of new functions (neo-functionalization) or division of labor to divide the function (sub-functionalization) in a species. These gene diversification events lay the foundation for phenotypic variability and adaptability in plants15. Soybean flowering and pod set is dependent on the photoperiod16. Hence, soybean cultivars are divided into different maturity groups depending on day length requirements, and some of the quantitative trait loci that affect soybean flowering have recently been reported17–19.
The roles of PIFs and their interactions with phytochromes during soybean flowering have yet to be investigated. Moreover, the functions of genes related to temperature and light perception in soybean are unknown. The recent sequencing of the soybean genome has provided the means to examine the genes participating in soybean flowering pathways. To explore PIF4’s roles in soybean plants, especially short day-specific signaling in soybean flowering, we studied all the GmPIF sequences present in the soybean genome. Phylogeny, conserved protein motifs, and expression profiles of these genes were comprehensively analyzed using bioinformatics approaches. Further, gene expression patterns under flowering non-inductive (long day) and flowering inductive (short day) light conditions and at elevated temperatures were quantitatively analyzed. The function of differentially regulated GmPIF4 (GmPIF4b) was studied by ectopic expression in Arabidopsis Col-0 and in pif4-101 mutants. We reveal structural and functional divergence in soybean PIF4 genes and proteins.
Results
Identification, phylogeny, and subcellular localization of GmPIF genes
Systematic and comprehensive database searches of the available genome sequences of leguminous plants revealed multiple PIF family members. To investigate the phylogenetic relationship between different PIFs and their evolutionary conservation, four leguminous plants with sequenced genomes were considered. The Phytozome search and phylogenetic analysis grouped fifteen PIF-like sequences into PIF4, PIF3, and PIF8 clades. There is strong evidence that soybean has undergone two whole genome duplication events during evolution. Based on the chromosomal evidence, soybean’s recent lineage-specific palaeotetraploidization was probably an allotetraploidy event14 preceded by an early legume duplication event occurring near the origins of the papilionoid lineage20. Recently, the Legume Family Working Group (LPWG) refined the classification of the Leguminosae family into six subfamilies: Caesalpinioideae, Cercidoideae, Detarioideae, Dialiodeae, Duparquetioideae, and Faboideae21, with soybean assigned to the family Faboideae.
To establish the phylogenetic relatedness of legume PIF proteins, soybean (Glycine max), common bean (Phaseolus vulgaris), barrel clover (Medicago truncatula), and peanut (Arachis duranesis) sequences were extracted. All these plants belong to the Faboideae family, with the soybean, common bean, and peanut short day plants and Medicago a long day plant22. Phylogenetic analysis using the neighbor-joining algorithm revealed that different soybean PIFs group into different clades (PIF4, PIF3, and PIF8; Fig. 1A) and include the signature PIF4 sequence of Arabidopsis. Soybean PIF4s grouped into two clades, GmPIF4 I and GmPIF4 II (marked with asterisks in Fig. 1A), with GmPIF4a, GmPIF4b, GmPIF4c, GmPIF4d grouping into GmPIF4 I and GmPIF4e, GmPIF4f, and GmPIF4g grouping into GmPIF4 II. Similarly, PIF3 and PIF8 were classified based on their relatedness to the signature PIF3 and PIF8 Arabidopsis sequences. Their position in the tree indicated that these multiple PIF copies in soybean may have evolved at different evolutionary points. Some of the PIF4s in soybean retain family-specific relatedness because of the early legume genome duplication event, while the other PIFs arose more recently due to a soybean-specific duplication event. PIFs grouped more closely to the common bean PIFs compared to Medicago and peanut, consistent with the common bean being a closer relative21.
Analysis of GmPIF protein sequence motifs
Ten motifs were identified and designated motifs 1–10 (Supplementary Figure 1). Motif 5, 7, and 9 mainly distinguished GmPIF4 I from GmPIF4 II proteins (Fig. 1A). A lack of motif 9 and 5 and the presence of motif 8 was a characteristic feature of PIF3s. Furthermore, motifs 3, 5, 8, and 9 were absent in PIF8s. Motif patterns help to distinguish sequences, as motif location and frequency are important for protein folding during translation. Motifs also act as recognition sequences for molecules involved in important processes such as post-translation modifications, subcellular transport and localization, and translation start and termination23.
Two whole genome duplications contributed to GmPIF gene family expansion
The genomic survey showed an uneven distribution of fifteen GmPIF genes on 11 soybean chromosomes (Fig. 1B). Chromosome 3 and 19 had two genes each, chromosome 10 had three genes, and the other nine genes were located on chromosomes 1, 2, 8, 13, 14, 18, and 20. Two main gene duplication types occur during evolution: tandem duplication, resulting in gene clusters; and segmental duplication, which gives rise to members scattered across the genome. 5,671 putative soybean transcription factor genes have been identified, of which 9.5% show tandem duplication14. Detailed analysis of GmPIF genes revealed that two gene pairs of the PIF3 subfamily were tandemly duplicated (GmPIF3a-GmPIF3c and GmPIFb-GmPIF3f; Fig. 1B).
We next estimated the possible duplication time according to their pairwise distances (Ks values) based on previous soybean studies14. Ks values of 0.06–0.39 correspond to the 13 million years ago (Mya) Glycine lineage-specific genome duplication, Ks values of 0.40–0.80 correspond to the 59 Mya early legume whole genome duplication, and Ks values greater than 1.5 mostly correspond to the most ancient gamma event14. Based on this, four GmPIF pairs were associated with 13 Mya Glycine-lineage specific duplications and 13 pairs were associated with 59 Mya early legume duplication (Supplementary Table 2). Ka/Ks calculations were also performed to estimate the selection pressure on GmPIF sequences, which indicated that all GmPIFs were subjected to purifying selection pressure (Fig. 1B; Ka/Ks = 1, neutral selection, Ka/Ks < 1, purifying selection, and Ka/Ks > 1 positive selection. Purifying selection results in the selective removal of deleterious alleles24.
Finally, we investigated duplication blocks between the soybean PIF genes and its close relative the common bean Phaseolus vulgaris. Fourteen GmPIF genes formed putative orthologous relationships with four PvPIF genes. All showed Ka/Ks values < 1, indicating purifying selection (Fig. 1C). Duplication events played a significant role in the expansion of the legume PIF gene family.
bHLH domain alignment shows conserved and unique amino acid residues in GmPIFs
Protein sequence alignment revealed the presence of the highly conserved bHLH domain in all soybean GmPIF proteins. Plant bHLH proteins bind to their target sequences at G-box (5′-CACGTG-3′) motifs, a subset of the E-box motif (5′-CANNTG-3′). This binding event is characterized by contact of glutamic acid residue (E) at position 9 of the basic stretch with the CA nucleotides of the E/G box25,26. While the E residue at position 5 of the basic stretch was conserved in all GmPIF4s and GmPIF3s, it was replaced by alanine (A) in GmPIF8s. Alignment also indicated the presence of different amino acid residues in the basic region of the bHLH domain, with the basic amino acid arginine (R) conserved in all GmPIFs except GmPIF4g (replaced with histidine (H). Another difference was the presence of asparagine (N) in GmPIF4a, GmPIF4b, GmPIF4c, and GmPIF4d but replaced by serine (S) and glycine (G) in the remaining soybean PIFs. These differences in basic region amino acids allow the bHLH proteins to discriminate their target DNA (Fig. 2)27.
Differential response of GmPIFs during floral transition and transcript abundance in different soybean tissues
The RNA sequencing analysis for floral transition was performed using plants grown for ten days in long photoperiod and then exposed for one day to short photoperiod (SD) for floral induction. Leaf and shoot apical meristem (SAM) samples were collected at SD-0, SD-1, SD-2, and SD-4. 13/15 GmPIFs were differentially regulated during floral transition. GmPIF4f, GmPIF4g, and GmPIF3c were abundantly expressed in the SAM, while GmPIF4a-e were expressed in the leaves. GmPIF3a, b, and f were previously shown to be highly regulated in leaves28. Tissue-specific expression analysis showed that GmPIF8b, GmPIF3b, GmPIF4c, GmPIF4d were expressed in leaves, while GmPIF8a, GmPIF4g, GmPIF4f, GmPIF3a were present in leaves, flowers, and young pods. GmPIF3f was the only transcript observed in seeds, and no GmPIF transcript was observed at late developmental stages (Fig. 3)29.
Long day specific diurnal rhythm of GmPIF4 transcripts
Soybean leaves were sampled every four hours to examine whether soybean PIF4s were diurnally regulated. Over long days, all transcripts showed differential responses during the day and night. During light periods, GmPIF4b transcript abundance significantly declined at 8 h compared to 0 h. GmPIF4c and GmPIF4g showed significant decreases at 12 h. However, these transcripts were re-expressed in the last four hours of the day, i.e., between 12 and 16 h (Fig. 4A–F), consistent with the long day behavior of Arabidopsis thaliana PIF4 transcripts, which re-accumulate on prolonged exposure to light and indicating that decreases in PIF4 levels upon light exposure are transient6. GmPIF4a and GmPIF4d did not show typical PIF4 like expression in long day photoperiod.
Short day specific diurnal rhythm of GmPIF4 transcripts
Samples were collected every 4 hours to study PIF4 transcription patterns after one short day treatment. One short day treatment was sufficient to alter the expression of GmPIF4a, GmPIF4b, and GmPIF4c (Fig. 4G–I), which accumulated during the dark (just before the day breaks), consistent with previous reports on the expression of PIF4 in Arabidopsis during short days30. However, GmPIF4f and GmPIF4g showed no diurnal fluctuations under short day conditions (Fig. 4K,L). Often, duplication events silence the function of an ancient gene, with selective pressure giving rise to homologs with new functions14.
Expression of GmPIF4 transcripts at different temperatures under long and short day conditions
A temperature-dependent role for PIF4 in flowering and blue light responses in Arabidopsis has been reported12. To investigate how soybean PIF4 genes respond to temperature under flower-inducing short photoperiod conditions, the expression levels of soybean PIF4 transcripts were analyzed at 25 °C, 30 °C, and 35 °C under long and short-day conditions. There were no significant changes in transcript levels under long day conditions except for GmPIF4f and GmPIF4g, which showed an increase at 30 °C compared to at 25 °C (Fig. 4M–R). However, under short day conditions, GmPIF4a, GmPIF4c, GmPIF4d, and GmPIF4g transcripts significantly increased at 35 °C compared to 25 °C (Fig. 4M–R). According to the thermosensory activation model of flowering in Arabidopsis, PIF4 integrates short day photoperiod signals and combines them with the ambient temperature signal30 under the control of the endogenous clock. Kumar et al.30 proposed that, at higher temperatures, PIF4 directly interacts with flowering locus T (FT, florigen) to activate the flowering pathway in Arabidopsis. Further, temperature-based changes in PIF4 transcripts are rate limiting for the biological response, because H2A.Z nucleosomes decrease the accessibility of PIF4 to the FT promoter at cool temperatures30. Since soybean is a warm climate plant requiring short day conditions for floral induction, the increase in GmPIF4 transcript abundance at 35 °C (short day) indicates a possible role for soybean PIF4s in high temperature-mediated initiation of flowering.
Ectopic expression of GmPIF4b in Arabidopsis Col-0 plants
Analysis of RNA-seq data of soybean plants undergoing floral transition showed that GmPIF4b was differentially regulated in leaves. Hence, to further characterize gene function, GmPIF4b was expressed ectopically in Arabidopsis Col-0 plants and trangenic lines studied under long day 22 °C and short day 25 °C conditions. Transgenic lines had longer hypocotyls at SD-25 °C and flowered 8–10 days earlier than wild type lines under short day conditions. However, expression of GmPIF4b had no effect under long day conditions, indicating a conserved function for GmPIF4 (Fig. 5)31.
Complementation of GmPIF4b in the Arabidopsis pif-101 mutant background
pif4-101 mutants have a T-DNA insertion in exon 5 of the Arabidopsis PIF4 gene. These mutant plants have shorter hypocotyls in the dark and a compact rosette (reduced petiole length) phenotype6. We transformed the pif4-101 Arabidopsis mutant with the 35S::Gmpif4::polyA construct for a gain-of-function analysis. Hypocotyl length was recorded in seedlings grown. Furthermore, petioles were also measured to assess rosette size. GmPIF4b partially rescued the mutant phenotype for both hypocotyl and petiole lengths under short day 25 °C conditions. Petiole length was almost 8 mm in wild-type, 2.6 mm in pif4-101, and 6 mm in complemented lines (Fig. 6) and hypocotyl length in complemented lines was 0.86 times of the WT (Fig. 6).
GmPIF4b protein levels peak four hours before dawn under both long and short-day conditions
To study the diurnal rhythm of PIF4 protein, protein expression was assessed every 4 h under long and short day conditions. GmPIF4b transcript was more abundant in the leaves of the plants grown under short day conditions compared to long day conditions. However, GmPIF4b transcripts followed a strict diurnal rhythm under both photoperiod conditions. For both conditions, protein levels peaked four hours before dawn (Fig. 7). Arabidopsis PIF4 levels are known to peak during the night due to superimposition of the clock and photoperiodic pathways3, and PIF4 is thought to be under the control of the evening complex. Further, the TOC1 component of the clock binds to PIF4 in the evening and inactivates it in Arabidopsis9. Here, the GmPIF4b protein expression rhythm in soybean was similar to Arabidopsis. GmPIF4b protein also showed the highest expression in soybean leaves at SD-1, suggesting involvement in floral transition. RNA-seq studies have previously indicated major reprogramming during floral transition, especially when SAM converts from the vegetative to reproductive stage after 4–6 short day treatment32.
GmPIF4b variant observed at elevated temperatures show unique temperature adaptations in soybean
Arabidopsis lines containing the 35S::PIF4:HA construct have been reported to contain slightly higher PIF4 protein levels at 27 °C than at 12 °C and 22 °C30. To study the effect of temperature on PIF4 protein expression, soybean plants were treated with a range of temperatures (25 °C to 35 °C), reflecting soybean as a warm temperature crop with ambient temperatures for soybean growing at different latitudes often exceeding 30 °C. A different molecular weight variant form of GmPIF4b was observed following exposure to plants at higher temperature. (Figure 7C). Higher molecular weight variant observed in response to higher temperature might reflect a protein modification that merits further experimental evaluation.
Discussion
Soybean is a major leguminous crop used to produce a significant amount of vegetable oil and protein for human consumption and fodder for animals. Soy products are increasingly used as meat and milk substitutes globally. Hence, the demand for breeding high-yield varieties of this commercially important crop in our changing environment is increasing. To refine yields, a full understanding of the key regulators of flowering and development is essential. PIF4 is a bHLH transcription factor that is thought to act as an integrating hub for light and temperature signals in Arabidopsis. However, its role in important crops such as soybean, a paleopolyploid, has yet to be investigated.
Two gene duplication events occurred in the soybean genome nearly 59 and 13 million years ago, which were followed by gene diversification, loss, and numerous chromosomal rearrangements leading to 75% of soybean genes being present as multiple copies14. Here we extracted fifteen GmPIF transcription factor genes from the Phytozome database and compared their sequences at both the nucleotide and amino acid levels. GmPIFs could be grouped into three significant subfamily clades (GmPIF4, GmPIF3, and GmPIF8) based on their conserved protein sequences. GmPIF4 could be further divided into two groups, GmPIF4 I and GmPIF4 II, based on sequence motif organization. This sequence-level observation supports the hypothesis that these transcription factors have undergone significant changes during evolution. Overall, there are estimated to be 31,264 gene paralogs in soybean, which may have developed from substitution and transversion events14.
PIF transcription factors use their bHLH domain to bind DNA and regulate their downstream targets. Our detailed comparison of this conserved domain for all the GmPIF protein sequences highlighted amino acid variations within the bHLH domains of these proteins. These variations in conserved domains suggest that it is likely that these transcription factors have different protein binding specificities.
Gene duplication analysis of the GmPIF family revealed that GmPIF genes expanded during both glycine lineage-specific and early legume duplication events nearly 13 Mya and 59 Mya, respectively. Synteny of GmPIFs with common bean (Phaseolus vulagris) PvPIFs was also evaluated to study the selective pressure on these genes, which showed that the Ka/Ks ratios for all Gm-Pv gene pairs were below 1, confirming purifying selection pressure.
Gene duplication serves as a mechanism to increase functional diversity33. In a paleopolyploid plant such as the soybean, these duplication events often lead to divergent expression patterns of closely related genes34. We found that the expression of these transcription factors varied in response to photoperiod and temperature stimuli. In Arabidopsis thaliana, PIF4 transcription has been studied under both short day and warm conditions30. Soybean is a facultative short-day plant requiring the warm temperatures for floral initiation. Hence, we focused on studying GmPIF4 transcription under short day conditions, under which four GmPIF4s showed similar expression to Arabidopsis PIF4, i.e., peaking at the end of the night phase (at dawn). All four GmPIF4s belong to the GmPIF4 I group; however, two GmPIF4s belonging to the GmPIF4 II clade did not follow a typical diurnal rhythm. A coincidence model has been proposed to understand short day-specific flowering in Arabidopsis, where PIF4 accumulates at the end of the night on short days due to coincidence between the internal (circadian rhythm) and external (photoperiod) cues3. During the light phase, PIF4 interacts with phytochromes and is degraded to switch on phytochrome signaling-mediated downstream processes7. In soybean, short days promote a shift from the vegetative to reproductive phase and hence control flowering32. Our data on GmPIF4 I group transcription is consistent with the co-incidence model, thus pointing towards conservation of gene function. To confirm this, ectopic expression of GmPIF4b, differentially regulated during soybean floral transition (GmPIF4b) in Arabidopsis Col-0 plants resulted in longer hypocotyls and an early flowering phenotype under short day 25 °C conditions, and partially recovered the phenotype of hypocotyl length and compact rosette in Arabidopsis pif4-101 mutants.
Protein expression of GmPIF4b peaked four hours before dawn under both long photoperiod and short period conditions, indicating superimposition of the biological clock in controlling GmPIF4 expression in soybean plants. A unique GmPIF4 higher molecular weight variant was observed following treatment of soybean plants at higher temperatures, indicating involvement of post-translational modifications in regulating GmPIF4b protein levels at the high temperatures.
Hence, apart from the general functions of PIF4 in plants, this protein may participate in novel legume-specific development and function in soybean plants. Further detailed interaction analyses and metabolomic and proteomic-based studies are needed. Functional analysis of individual PIF4 genes would uncover their specific roles in soybean development. This study paves the way for future research into specific biological functions of GmPIF4s in soybean development and floral transition.
Methods
Identification, phylogenetic analysis and sub-cellular localization prediction of soybean PIF family
PIF genes were searched by using the keywords of “PIF”, “Phytochrome Interacting factors”, and blast searches against Arabidopsis PIFs in the proteome database of the latest version of soybean genome (Wm82.a2.v1) in Phytozome. Subsequently, all the sequences with E-value below 0.01 were kept and checked for the presence of conserved basic helix loop helix (bHLH) by using Hidden Markov Model (HMM) profile (PF00010) in Pfam database, http://pfam.xfam.org/. Self-blast was performed on the resulting sequences list, and all the redundant sequences were removed. Similarly, PIF sequences for other legumes such as common bean (Phaseolus Vulgaris), barrel clover (Medicago truncatula) and peanut (Arachis duransis) were also searched. The resulting sequences were listed in a table (Supplementary Table 1) and aligned using ClustalW program with default parameters in the alignment window of MEGA7 software, http://www.megasoftware.net/ (Kumar, Stecher, and Tamura 2015). A phylogenetic tree was constructed using the PIF sequences of all the legumes and Arabidopsis using a neighbor-joining algorithm, JTT model, and partial deletion parameters. Based on the phylogenetic analysis, the putative soybean PIFs were named according to their respective clades. The subcellular localizations of GmPIF genes were predicted using LOCALIZER tool of the Commonwealth Scientific and Industrial Organization of Australia (CSIRO)35.
Conserved protein motif search
MEME search (http://meme-suite.org/tools/meme) was used for protein motif search comparison36. The length of the motif was fixed to 6–100 amino acids. To detect motifs ZOOPS model was used, which considers that the motif occurrence can be zero or 1 in a sequence. Maximum 10 motifs were searched.
Analysis of chromosome distribution, gene duplication and synteny with common bean
The chromosome distribution of soybean PIF genes was obtained from Phytozome, and duplicated genes were obtained from Plant genome duplication database (PGDD) (http://chibba.agtec.uga.edu/duplication/) by downloading the dataset of duplicated blocks in soybean genome37. Duplicated PIF gene pairs were searched in the dataset, and their nucleotide non-synonymous (Ka) to synonymous (Ks) ratios (Ka/Ks) was also calculated (Supplementary Table 2). Ks values were used to estimate the duplication time for soybean PIFs. Similarly, syntenic blocks between soybean and common bean were also searched. PGDD uses BLASTP to search for potential anchors (E < 1e-5, top 5 matches) between every possible pair of chromosomes in the genomes considered. Input for MCscan synteny search tool is the homologous pairs38. The built-in scoring scheme for MCscan is min (−log10E, 40) for every matching gene pairs and -1 for each 10 kb distance between anchors, similar to DAGchainer synteny tool39 and blocks that have scores >200 are kept. The resulting syntenic chains are evaluated using a procedure in ColinearScan and E-value < 1e-10 as a significance cutoff. The data for duplicated PIF gene pairs within soybean and their putative orthologs in common bean is listed (Supplementary Table 2).
Multiple sequence alignment of the bHLH domain GmPIFs
The bHLH domain was identified after aligning the sequences of all 15 PIFs by using clustal alignment option in Jalview software40. The logo of bHLH domain was obtained from Pfam database of protein HMMs41.
RNA seq data analysis for soybean undergoing floral transition and expression in different tissues
The RNA seq data for the soybean undergoing floral transition was obtained from previously published research (Wong et al. 2013). RNA sequencing data for the expression of GmPIFs in different tissues was obtained from soybase https://www.soybase.org/29. The RNA sequencing reads have been listed in Supplementary Table 3. The heat maps were constructed using MORPHEUS tool of the Broad Institute (https://software.broadinstitute.org/morpheus/).
Plant material, Treatments and Expression analysis using qRT-PCR
For photoperiod-dependent expression analysis, two sets of (Glycine max [L.] Merill) cv. Bragg plants were grown for 10 long days (16hrs light, 8hrs dark) at 25 °C, 400 µMm−2s−1 light intensity. On 11th day, one set of plants was subjected to one short-day (8hrs light, 16hrs dark) treatment. Leave samples (from three different plants within a set) from both sets were harvested every 4 hours for 24 hours.
For temperature dependent expression analysis, six sets of the plants were grown for 10 days under long day conditions. On the 11th day, three sets were subjected to long-day at 25 °C, 30 °C and 35 °C and the other three sets were subjected to short-day at 25 °C, 30 °C and 35 °C respectively. Samples were collected at the end of the night. All the expression analyses were performed using three biologically replicated experiments.
Total RNA was extracted from the leaves samples by using Trizol method and cDNA was synthesized by using Superscript III Reverse transcriptase of Invitrogen. SYBR-Green master mix from Agilent Technologies was used. The expression data of GmPIF4a, GmPIFb, GmPIF4c, GmPIF4d, GmPIF4f and GmPIF4g transcripts was normalized against the expression of Glycine max Actin gene (Glyma.08G146500.1)42. Supplementary Figure 4 shows that this actin gene is not regulated diurnally or in response to heat treatment.
GmPIF4b over-expression construct, Arabidopsis transformation, and transgenic line analysis
Total RNA was extracted from Soybean’s leaf tissue. The amplified DNA was cloned downstream of constitutive 35S promoter and resulting 35S::GmPIF4b::ployA was used for plant transformation. Arabidopsis plants (wild type and mutants) were grown in soil, long day photoperiod and at 22 °C for 4 weeks (till flowering started). The first inflorescence was cut-off to promote flowering on lateral branches because we followed floral dip method for Arabidopsis transformation43. Arabidopsis seeds obtained from T0 generation were grown for 7 days and on 8th day, these were sprayed with the herbicide Glufosinate (Basta) to select transgenic lines. Strong YFP signal was observed in the surviving plants (observed in blue light). This generation (T1) was examined for the expression of GmPIF4b by qRT-PCR. Similarly, complemented lines were obtained by infecting Arabidopsis pif4-101 mutants with GmPIF4b over-expression construct. Hypocotyl lengths were analysis using Image J software (114 pixels were scaled to 1 cm).
Production of rabbit polyclonal antibody against the GmPIF4b protein, plant nuclear protein extraction, and immunoblotting
The codon optimized GmPIFb gene construct by the GenScript Services (Hong-Kong) was used to express this protein in E.coli. The recombinant protein was quantified on a BSA standard curve and used for immunization of rabbits. The antibody was purified from total sera (Supplementary Figure 2). Nuclear protein was extracted according to Haerizadeh et al.44. For immunoblotting, 1.5 µg of total nuclear protein from soybean, primary antibody (anti-GmPIF4b developed in our lab) and secondary antibody (anti-rabbit IgG) were used. The blots were imaged using Licor western blot imager (800 nm channel). Two independent exeriments were performed to check the validity of the western blots. Dot blot analysis of Arabidopsis transgenic lines containing over-expressed GmPIF4b showed reactivity with GmPIF4b antibody, whereas wild type Arabidopsis (Col-0) showed no reactivity (Supplementary Figure 6).
Data availability
Data described in this study can be obtained from the corresponding author by request.
Electronic supplementary material
Acknowledgements
This work was supported by The Australian Research Council Discovery Grant, ARC DP0988972.
Author Contributions
P.L.B. and M.B.S. conceived and designed the experiments; H.A. performed the experiments; H.A., M.B.S. and P.L.B. wrote the manuscript.
Competing Interests
The authors declare no competing interests.
Footnotes
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Supplementary information accompanies this paper at 10.1038/s41598-018-30043-2.
References
- 1.Al-Sady B, Ni W, Kircher S, Schafer E, Quail PH. Photoactivated phytochrome induces rapid PIF3 phosphorylation prior to proteasome-mediated degradation. Molecular cell. 2006;23:439–446. doi: 10.1016/j.molcel.2006.06.011. [DOI] [PubMed] [Google Scholar]
- 2.McClung CR. Plant Circadian Rhythms. The Plant Cell. 2006;18:792–803. doi: 10.1105/tpc.106.040980. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Nomoto Y, Kubozono S, Yamashino T, Nakamichi N, Mizuno T. Circadian clock-and PIF4-controlled plant growth: a coincidence mechanism directly integrates a hormone signaling network into the photoperiodic control of plant architectures inArabidopsis thaliana. Plant and Cell Physiology. 2012;53:1950–1964. doi: 10.1093/pcp/pcs137. [DOI] [PubMed] [Google Scholar]
- 4.Leivar P, Monte E. PIFs: systems integrators in plant development. The Plant Cell Online. 2014;26:56–78. doi: 10.1105/tpc.113.120857. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Pires N, Dolan L. Origin and diversification of basic-helix-loop-helix proteins in plants. Molecular biology and evolution. 2010;27:862–874. doi: 10.1093/molbev/msp288. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Lorrain S, Allen T, Duek PD, Whitelam GC, Fankhauser C. Phytochrome-mediated inhibition of shade avoidance involves degradation of growth-promoting bHLH transcription factors. The Plant journal: for cell and molecular biology. 2008;53:312–323. doi: 10.1111/j.1365-313X.2007.03341.x. [DOI] [PubMed] [Google Scholar]
- 7.Gommers CMM, Visser EJW, Onge KRS, Voesenek LACJ, Pierik R. Shade tolerance: when growing tall is not an option. Trends in Plant Science. 2013;18:65–71. doi: 10.1016/j.tplants.2012.09.008. [DOI] [PubMed] [Google Scholar]
- 8.Nusinow DA, et al. The ELF4-ELF3-LUX complex links the circadian clock to diurnal control of hypocotyl growth. Nature. 2011;475:398–402. doi: 10.1038/nature10182. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Zhu J-Y, Oh E, Wang T, Wang Z-Y. TOC1–PIF4 interaction mediates the circadian gating of thermoresponsive growth in Arabidopsis. Nature Communications. 2016;7:13692. doi: 10.1038/ncomms13692. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Franklin KA, et al. Phytochrome-interacting factor 4 (PIF4) regulates auxin biosynthesis at high temperature. Proceedings of the National Academy of Sciences. 2011;108:20231–20235. doi: 10.1073/pnas.1110682108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Galvão VC, Collani S, Horrer D, Schmid M. Gibberellic acid signaling is required for ambient temperature‐mediated induction of flowering in Arabidopsis thaliana. The Plant Journal. 2015;84:949–962. doi: 10.1111/tpj.13051. [DOI] [PubMed] [Google Scholar]
- 12.Ma D, et al. Cryptochrome 1 interacts with PIF4 to regulate high temperature-mediated hypocotyl elongation in response to blue light. Proceedings of the National Academy of Sciences. 2016;113:224–229. doi: 10.1073/pnas.1511437113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Sakuraba, Y. et al. Phytochrome-interacting transcription factors PIF4 and PIF5 induce leaf senescence in Arabidopsis. Nature Communications5, 5636, 10.1038/ncomms5636 (2014). [DOI] [PubMed]
- 14.Schmutz J, et al. Genome sequence of the palaeopolyploid soybean. Nature. 2010;463:178–183. doi: 10.1038/nature08670. [DOI] [PubMed] [Google Scholar]
- 15.Chandna, R. et al. Class-Specific Evolution and Transcriptional Differentiation of 14-3-3 Family Members in Mesohexaploid Brassica rapa. Frontiers in Plant Science7, 12, 10.3389/fpls.2016.00012 (2016). [DOI] [PMC free article] [PubMed]
- 16.Watanabe S, Harada K, Abe J. Genetic and molecular bases of photoperiod responses of flowering in soybean. Breeding Science. 2012;61:531–543. doi: 10.1270/jsbbs.61.531. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Xia Z, et al. Positional cloning and characterization reveal the molecular basis for soybean maturity locus E1 that regulates photoperiodic flowering. Proceedings of the National Academy of Sciences. 2012;109:E2155–E2164. doi: 10.1073/pnas.1117982109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Watanabe S, et al. Map-Based Cloning of the Gene Associated With the Soybean Maturity Locus E3. Genetics. 2009;182:1251–1262. doi: 10.1534/genetics.108.098772. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Lu S, et al. Natural variation at the soybean J locus improves adaptation to the tropics and enhances yield. Nature genetics. 2017;49:773–779. doi: 10.1038/ng.3819. [DOI] [PubMed] [Google Scholar]
- 20.Lavin M, Herendeen PS, Wojciechowski MF. Evolutionary Rates Analysis of Leguminosae Implicates a Rapid Diversification of Lineages during the Tertiary. Systematic Biology. 2005;54:575–594. doi: 10.1080/10635150590947131. [DOI] [PubMed] [Google Scholar]
- 21.Azani N, et al. A new subfamily classification of the Leguminosae based on a taxonomically comprehensive phylogeny. Taxon. 2017;66:44–77. doi: 10.12705/661.3. [DOI] [Google Scholar]
- 22.Weller, J. L. & Ortega, R. Genetic control of flowering time in legumes. Frontiers in Plant Science6, 207, 10.3389/fpls.2015.00207 (2015). [DOI] [PMC free article] [PubMed]
- 23.Mills CL, Beuning PJ, Ondrechen MJ. Biochemical functional predictions for protein structures of unknown or uncertain function. Computational and Structural Biotechnology Journal. 2015;13:182–191. doi: 10.1016/j.csbj.2015.02.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Lynch M, Conery JS. The evolutionary fate and consequences of duplicate genes. Science. 2000;290:1151–1155. doi: 10.1126/science.290.5494.1151. [DOI] [PubMed] [Google Scholar]
- 25.Ferre-D’Amare AR, Prendergast GC, Ziff EB, Burley SK. Recognition by Max of its cognate DNA through a dimeric b/HLH/Z domain. Nature. 1993;363:38–45. doi: 10.1038/363038a0. [DOI] [PubMed] [Google Scholar]
- 26.Giuliano G, et al. An evolutionarily conserved protein binding sequence upstream of a plant light-regulated gene. Proceedings of the National Academy of Sciences of the United States of America. 1988;85:7089–7093. doi: 10.1073/pnas.85.19.7089. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Atchley WR, Terhalle W, Dress A. Positional dependence, cliques, and predictive motifs in the bHLH protein domain. Journal of molecular evolution. 1999;48:501–516. doi: 10.1007/PL00006494. [DOI] [PubMed] [Google Scholar]
- 28.Wong, C. E., Singh, M. B. & Bhalla, P. L. The dynamics of soybean leaf and shoot apical meristem transcriptome undergoing floral initiation process. PLOS ONE8(6), e65319, 10.1371/journal.pone.0065319 (2013). [DOI] [PMC free article] [PubMed]
- 29.Severin AJ, et al. RNA-Seq Atlas of Glycine max: A guide to the soybean transcriptome. BMC plant biology. 2010;10:160. doi: 10.1186/1471-2229-10-160. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Kumar SV, et al. Transcription factor PIF4 controls the thermosensory activation of flowering. Nature. 2012;484:242–245. doi: 10.1038/nature10928. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Wigge PA. Ambient temperature signalling in plants. Current Opinion in Plant Biology. 2013;16:661–666. doi: 10.1016/j.pbi.2013.08.004. [DOI] [PubMed] [Google Scholar]
- 32.Wong CE, Singh MB, Bhalla PL. Molecular processes underlying the floral transition in the soybean shoot apical meristem. The Plant journal: for cell and molecular biology. 2009;57:832–845. doi: 10.1111/j.1365-313X.2008.03730.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Qin, C. et al. Genome-Wide Identification and Expression Analysis of the 14-3-3 Family Genes in Medicago truncatula. Frontiers in Plant Science7, 320, 10.3389/fpls.2016.00320 (2016). [DOI] [PMC free article] [PubMed]
- 34.Yue, R. et al. Identification and expression profiling analysis of calmodulin-binding transcription activator genes in maize (Zea mays L.) under abiotic and biotic stresses. Frontiers in Plant Science6, 576, 10.3389/fpls.2015.00576 (2015). [DOI] [PMC free article] [PubMed]
- 35.Sperschneider J, et al. LOCALIZER: subcellular localization prediction of both plant and effector proteins in the plant cell. Scientific Reports. 2017;7:44598. doi: 10.1038/srep44598. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Bailey TL, et al. MEME Suite: tools for motif discovery and searching. Nucleic Acids Research. 2009;37:W202–W208. doi: 10.1093/nar/gkp335. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Lee T-H, Tang H, Wang X, Paterson AH. PGDD: a database of gene and genome duplication in plants. Nucleic Acids Research. 2013;41:D1152–D1158. doi: 10.1093/nar/gks1104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Tang H, et al. Synteny and Collinearity in Plant Genomes. Science. 2008;320:486–488. doi: 10.1126/science.1153917. [DOI] [PubMed] [Google Scholar]
- 39.Haas BJ, Delcher AL, Wortman JR, Salzberg SL. DAGchainer: a tool for mining segmental genome duplications and synteny. Bioinformatics. 2004;20:3643–3646. doi: 10.1093/bioinformatics/bth397. [DOI] [PubMed] [Google Scholar]
- 40.Waterhouse AM, Procter JB, Martin DMA, Clamp M, Barton GJ. Jalview Version 2—a multiple sequence alignment editor and analysis workbench. Bioinformatics. 2009;25:1189–1191. doi: 10.1093/bioinformatics/btp033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Finn RD, et al. The Pfam protein families database: towards a more sustainable future. Nucleic Acids Research. 2016;44:D279–D285. doi: 10.1093/nar/gkv1344. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Liew LC, Singh MB, Bhalla PL. A novel role of the soybean clock gene LUX ARRHYTHMO in male reproductive development. Scientific Reports. 2017;7:10605. doi: 10.1038/s41598-017-10823-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Zhang X, Henriques R, Lin S-S, Niu Q-W, Chua N-H. Agrobacterium-mediated transformation of Arabidopsis thaliana using the floral dip method. Nature Protocols. 2006;1:641. doi: 10.1038/nprot.2006.97. [DOI] [PubMed] [Google Scholar]
- 44.Haerizadeh F, Singh MB, Bhalla PL. Transcriptional Repression Distinguishes Somatic from Germ Cell Lineages in a. Plant. Science. 2006;313:496–499. doi: 10.1126/science.1125526. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Data described in this study can be obtained from the corresponding author by request.