Abstract
Homeobox genes encode transcription factors that are known to play a major role in different aspects of plant growth and development. In the present study, we identified homeobox genes belonging to 14 different classes in five legume species, including chickpea, soybean, Medicago, Lotus and pigeonpea. The characteristic differences within homeodomain sequences among various classes of homeobox gene family were quite evident. Genome-wide expression analysis using publicly available datasets (RNA-seq and microarray) indicated that homeobox genes are differentially expressed in various tissues/developmental stages and under stress conditions in different legumes. We validated the differential expression of selected chickpea homeobox genes via quantitative reverse transcription polymerase chain reaction. Genome duplication analysis in soybean indicated that segmental duplication has significantly contributed in the expansion of homeobox gene family. The Ka/Ks ratio of duplicated homeobox genes in soybean showed that several members of this family have undergone purifying selection. Moreover, expression profiling indicated that duplicated genes might have been retained due to sub-functionalization. The genome-wide identification and comprehensive gene expression profiling of homeobox gene family members in legumes will provide opportunities for functional analysis to unravel their exact role in plant growth and development.
Introduction
Homeobox genes are known to play an important role in body plan specification of higher organisms during early stages of embryogenesis. Initially, homeobox genes were isolated from the fruit fly, Drosophila melanogaster, but later these genes were identified in diverse organisms, like nematodes, fungi, plants and humans [1,2]. Homeobox genes encode a conserved 60 amino acid (aa) long DNA-binding domain, known as homeodomain (HD). In animal and plant genomes, homeobox genes are represented by a large gene family [2,3]. The characteristic three-dimensional structure of HD contains three alpha-helices, of which the second and third helices form a helix-turn-helix DNA-binding motif [4,5].
Based on the conserved amino acid sequence of HD along with the presence of other characteristic motifs, homeobox genes have been categorized into different groups. There are “typical” HD, characterized by a length of 60 amino acids and “atypical” HD having variation in amino acid length [6]. One such “atypical” HD has been named TALE (Three Amino acid Loop Extension) which is of 63 aa, having three extra residues between helices 1 and 2 [7,8]. Earlier, HD proteins were classified into the following classes; namely, KNOX, BELL, ZM-HOX, HAT, AT-HB8 and GL2 [9]. In another study, homeobox genes in rice were classified into ten subfamilies, namely HD-Zip I, HD-Zip II, HD-Zip III, HD-Zip IV, BLH, KNOX I, KNOX II, WOX, ZF-HD and PHD [10]. Furthermore, a comprehensive study on plant homeobox genes was also conducted where they were classified into 14 classes, including some new classes, such as NDX, DDT, PHD, LD, SAWADEE and PINTOX [3].
Members of plant homeobox gene family are known to participate in several developmental processes. Many members of HD-Zip I class are critical components regulating cotyledon development, leaf cell fate determination and blue light signalling [11,12]. Some HD-Zip II class members are involved in shade avoidance responses [13]. The members of HD-Zip III class are involved in apical meristem formation, vascular development and maintenance of adaxial or abaxial polarity of leaves and embryo [14]. HD-Zip IV proteins play primary role in the formation of outer cell layers of plant organs, in addition to controlling processes of anthocyanin pigmentation and maintenance of epidermal layer [15,16]. KNOX family members are known to have well-defined roles in shoot apical meristem maintenance [17]. They have been reported to interact with BEL family members to regulate hormone homeostasis [18]. WOX family members in Arabidopsis mark cell fate during early embryonic patterning and some of the members are known to be involved in stem cell maintenance and organogenesis [19,20]. WUSCHEL protein has been linked with cell differentiation during anther development [21]. The ZF-HD family members have been implicated in floral developmental processes in Arabidopsis [22]. Interestingly, a member of NDX class in soybean showed cell-specific expression pattern in nodules, highlighting its role in nodule development [23].
Legumes are important crop plants possessing the unique ability to fix atmospheric nitrogen. In addition, being rich source of proteins, legumes are very important for human diet. Although, homeobox genes have been identified in various plant species and characterized to some extent [3,10,24–26], their genome level analysis in legumes is lacking as of now. Recently, genome sequences of many legume plants have become available, which provided an opportunity for detailed characterization of homeobox genes in this important family of plants. In the present study, we identified homeobox genes in five legumes, including chickpea (Cicer arietinum), soybean (Glycine max), Medicago (Medicago truncatula), Lotus (Lotus japonicus) and pigeonpea (Cajanus cajan). On the basis of domain architecture and phylogenetic relationship, homeobox genes identified in legumes were classified into 14 different classes. Expression profiles of these genes were examined in different tissues/organs during various stages of development and in response to environmental cues. Moreover, analysis of whole genome duplication events provided insights into the expansion of soybean homeobox gene family. This study furnishes valuable information about homeobox gene family in legumes to facilitate functional analysis.
Materials and Methods
Screening of genomic resources for identification of homeobox genes in legumes
Homeobox genes in rice and Arabidopsis were retrieved from the previous studies [3, 10], and a non-redundant set of homeobox genes were retained for analysis (S1 Table). Proteome sequences of legume crops, chickpea (CGAP_v1.0: http://nipgr.res.in/CGAP/home.php) [27], soybean (Gmax_189; ftp://ftp.jgi-psf.org/pub/compgen/phytozome/v9.0/Gmax) [28], Medicago (Mtruncatula_198; ftp://ftp.jgi-psf.org/pub/compgen/phytozome/v9.0/Mtruncatula) [29], Lotus (build 2.5; www.kazusa.or.jp/Lotus/) [30] and pigeonpea (v-5.0; www.icrisat.org/gt-bt/iipg/home.html) [31] were downloaded from their respective databases. The homeobox protein sequences from Arabidopsis and rice were taken as query and searched against proteomes of different legumes via BLASTP. In addition, proteomes of legumes were searched against hidden Markov model (HMM) profiles of homeobox domain (PF00046) and zinc-finger HD (PF04770) via HMMER. Both similarity searches were performed at an e-value cut-off of ± 1e-05. The protein sequences obtained from above two approaches were concatenated and redundant entries were removed in order to create a non-redundant set of putative homeobox proteins for each legume. To confirm the presence of HD and identify other conserved domains, homeobox proteins from each legume were further subjected to domain search via SMART (www.smart.embl-heidelberg.de) and Pfam (www.pfam.sanger.ac.uk).
Phylogenetic analysis and identification of conserved motifs
Multiple sequence alignment tool, CLUSTALX (v2.1; www.clustal.org/clustal2), was employed and phylogenetic trees were constructed using the neighbour-joining (NJ) method [32]. A gap open penalty of 10 and gap extension penalty of 0.2 were used for sequence alignment. Bootstrap analysis was performed using 1000 replicates and the tree was visualized using FigTree (v1.4.0; www.tree.bio.ed.ac.uk/software/figtree/). Conserved motifs other than HD, present in different classes of HD proteins were identified using MEME (Multiple EM for Motif Elicitation) Suite and viewed by MAST (Motif Alignment and Search Tool).
Expression profiling of homeobox genes in legumes
We analyzed the expression patterns of homeobox genes from RNA-seq experiments conducted previously in soybean [33,34] and chickpea [27,35]. To study the expression profiles of soybean homeobox genes in response to biotic and abiotic stresses, the microarray data, available in Genevestigator v.3 (https://www.genevestigator.com/gv/plant.jsp) and Gene Expression Omnibus (accession number GSE40627) were used. The expression patterns of homeobox genes in Lotus and Medicago were analyzed using Lotus japonicus gene expression atlas (LjGEA) [36] and Medicago truncatula gene expression atlas (MtGEA) [37], respectively. Probe set IDs corresponding to Lotus homeobox genes were identified by BLASTN utility available at LjGEA, whereas probe set IDs corresponding to Medicago homeobox genes were identified using online Plexdb Blast (BLASTN) utility (http://www.plexdb.org/). For genes with more than one probe set ID, the probe showing better e-value and higher identity was considered (S1 Table). For Medicago, we analyzed RNA-seq data of vegetative and reproductive tissues also from a previous study [29]. Normalized data obtained from different studies were log2 transformed to generate heat-maps using MultiExperiment Viewer (MeV) software (v4.8.1).
For quantitative reverse transcription polymerase chain reaction (qRT-PCR) analysis, chickpea (Cicer arietinum L. genotype ICC4958) seeds were grown as described previously [38]. Different chickpea tissues/organs (shoot, root, stem, mature leaf, mature flower and young pod) were collected from plants as described [27]. For desiccation and salinity stress treatments, 10-day-old chickpea seedlings were transferred on folds of tissue paper and beaker containing 150 mM NaCl solution, respectively, at 22±1°C. For cold treatment, the seedlings were kept in water at 4±1°C. The control seedlings were kept at 22±1°C as described [35]. Root and shoot tissues were collected from stressed and control seedlings after 5 h of treatment. At least two independent biological replicates of each tissue sample were harvested and total RNA was isolated using TRI reagent (Sigma Life Sciences) according to the manufacturer’s instructions. Assessment of the quality and quantity of each RNA sample was done using NanoVue (GE Healthcare). Sequences of primer pairs used in this study have been listed in S2 Table. The qRT-PCR was performed following the protocol described previously [38]. The transcript level of each gene in different tissue samples was normalized with the transcript level of the most suitable internal control gene, elongation factor 1-alpha (EF-1a) [38]. The correlation between expression profiles of selected genes obtained from qRT-PCR and RNA-seq analysis was determined using R programming environment.
Identification of cis-regulatory elements in chickpea and soybean
Genomic coordinates of chickpea and soybean genes were determined from genome annotation file (gff file) and the promoter sequence (2 kb) of each gene was retrieved using in-house perl script from their respective genome sequences. Cis-regulatory elements present in the promoter sequence of homeobox genes were scanned at PLACE web server (http://www.dna.affrc.go.jp/). In addition, the known binding sites/motifs of HD-Zip I (AH1, CAAT(A/T)ATTG and/or AH2, CAAT(C/G)ATTG) and of HD-Zip II (AH2) class homeobox proteins were identified in the promoter sequences of all chickpea and soybean genes using custom perl script. Further, coexpression analysis of HD-Zip I and HD-Zip II subfamily genes in chickpea and soybean with genes harbouring AH1 and/or AH2 motifs in their promoter regions was carried out using R programming [33–35]. The genes with a Pearson correlation coefficient ≥ 0.7 and p-value of ≤ 0.05 were designated as significantly correlated.
Genome localization and gene duplication
To determine the location of homeobox genes onto chromosomes, coordinates of individual genes were obtained from genome annotation file (gff file) of respective legumes. The list of homeobox genes in duplicated genomic regions and Ka/Ks values for each duplicated gene for soybean were retrieved from batch download option of Plant Genome Duplication Database (PGDD; http://chibba.agtec.uga.edu/). The Ks values have been calculated using Nei-Gojobori method implemented in PAML package following CLUSTALW and PAL2NAL alignments. The duplicated homeobox genes in soybean were visualized using Circos software (http://circos.ca/). The expression patterns of homeologous homeobox genes were extracted from RNA-seq data as described earlier.
Results and Discussion
Homeobox genes in legumes and their classification
The non-redundant set of homeobox genes from Arabidopsis (113) and rice (113) belonging to 14 classes were extracted from previous reports (S1 Table). Based on BLASTP and HMM profile searches followed by confirmation of the presence of HD, we identified a total of 89 homeobox genes in chickpea, 276 in soybean, 82 in Medicago, 92 in Lotus and 137 in pigeonpea (Table 1). The homeobox genes in legumes were found to be distributed in 14 different classes, including two superclasses, i.e. HD-Zip (HD-Zip I, HD-Zip II, HD-Zip III and HD-Zip IV) and TALE (KNOX and BEL), and eight classes, i.e. PLINC (ZF-HD), WOX, DDT, PHD, NDX, LD, PINTOX and SAWADEE as reported previously for plants [3]. A schematic representation of domain composition of different classes of homeobox genes is depicted in Fig. 1. The total number of homeobox genes identified in soybean was the highest (276 genes). At least one member was identified for each of the 14 classes in soybean, Medicago and pigeonpea, whereas homeobox genes of chickpea were distributed in 12 classes (no member in NDX and SAWADEE classes). In Lotus, there was no member identified for LD and NDX classes, and one member was kept under the category of “unclassified”, as it did not possess any known characteristic domain other than HD (Table 1). Identification of lesser number of homeobox genes in chickpea (89) and Lotus (93), may be due to their incomplete (~70%) draft genome sequence available as of now.
Table 1. Classification of homeobox gene family members in different legumes, Arabidopsis and rice.
Class | Chickpea | Soybean | Medicago | Lotus | Pigeonpea | Arabidopsis | Rice |
---|---|---|---|---|---|---|---|
HD-ZIP I | 12 | 35 | 11 | 9 | 18 | 17 | 14 |
HD-ZIP II | 9 | 27 | 7 | 9 | 14 | 10 | 14 |
HD-ZIP III | 5 | 12 | 5 | 4 | 6 | 5 | 9 |
HD-ZIP IV | 8 | 31 | 15 | 16 | 15 | 16 | 12 |
PLINC | 14 | 51 | 14 | 23 | 20 | 17 | 14 |
WOX | 14 | 33 | 10 | 12 | 19 | 16 | 14 |
BEL | 12 | 34 | 4 | 6 | 17 | 13 | 14 |
KNOX | 5 | 28 | 6 | 5 | 16 | 8 | 12 |
DDT | 6 | 13 | 5 | 3 | 5 | 4 | 3 |
PHD | 2 | 6 | 1 | 2 | 2 | 2 | 2 |
PINTOX | 1 | 2 | 1 | 1 | 1 | 1 | 1 |
LD | 1 | 1 | 1 | 0 | 1 | 1 | 1 |
NDX | 0 | 1 | 2 | 0 | 1 | 1 | 1 |
SAWADEE | 0 | 2 | 1 | 2 | 2 | 2 | 2 |
Unclassified | 0 | 0 | 0 | 1 | 0 | 0 | 0 |
Total | 89 | 276 | 83 | 93 | 137 | 113 | 113 |
Number of members identified in each class (based on domain composition and phylogenetic relationship) are given for each plant.
The members of homeobox gene family have been predicted in different legumes and are available in databases, namely PlantTFDB and LegumeTFDB. However, the number of homeobox genes reported in our study is much higher. PlantTFDB reports the existence of 77 homeobox genes in chickpea, 183 in soybean, 62 in Medicago, 64 in Lotus and 82 in pigeonpea, whereas LegumeTFDB shows the presence of 260 homeobox genes in soybean, 62 in Medicago and 80 in Lotus. Moreover, SoybeanTFDB reports the existence of 269 homeobox proteins in soybean as compared to 276 homeobox proteins identified in our study. Overall, these differences may be due to the robust methodology of identification employed or latest versions of the genome annotation used in our study. The complete details, including gene identifier, classification, conserved domain(s), genomic location, protein length and genomic coordinates of the homeobox genes identified in different legumes are enlisted in S1 Table.
Super-class HD-Zip was found to have the maximum representative members among homeobox genes in legumes similar to other plants. We identified 105 members of HD-Zip superclass in soybean as compared to 88 members in a previous report [39]. This difference in number of HD-Zip proteins may be attributed to more robust methodology employed for identification in our study. In HD-Zip superclass, leucine-Zipper (LZ) domain is known to mediate protein-protein interactions. Additional characteristic domains present besides HD are known to perform specific functions as well. CPSCE motif in HD-Zip II class acts as a redox sensor [40], ZIBEL motif mediates interaction between HD-Zip II proteins and BEL HD proteins or similar targets [3]. In HD-Zip III class, MEKHLA domain is speculated to be involved in oxygen redox and light signalling [41]. HD-Zip IV proteins containing START (STeroidogenic Acute Regulatory protein-related lipid Transfer) domain and HD-SAD (START Associated conserved Domain) domain (Fig. 1) possess putative lipid binding capability [42] and transcriptional activation property, respectively [43].
The second largest class of homeobox proteins was represented by PLINC (Plant Zinc Finger, previously called ZF-HD). These proteins contain two highly conserved zinc-finger-like motifs upstream to HD (Fig. 1), which are involved in protein-protein interaction by mediating homo- and hetero-dimerization [44]. Maximum members of PLINC class are present in soybean (51) followed by Lotus (23) and pigeonpea (20) (Table 1). KNOX and BEL class HD proteins belonging to the superclass TALE harbor three extra amino acid residues (just before PYP) in the loop connecting the first and second helices of the HD (Fig. 2). In total, 73 and 60 proteins of legumes were classified into BEL and KNOX classes, respectively (Table 1). BEL or BEL1-like homeobox (BLH) class proteins harbor a domain of unknown function, called POX domain (S1 Table) towards N-terminal of HD. It has been proposed that this co-domain is a bipartite domain composed of BEL-A and BEL-B (Fig. 1) [3]. We detected a highly conserved 10 aa motif named “ZIBEL” present at both the ends (C-terminal and N-terminal) of BEL proteins (Fig. 1). KNOX domain residing towards N-terminal, is composed of two conserved stretches (KNOX A/I and KNOX B/II, separated by a variable region) and ELK domain upstream to HD (Fig. 1) [7]. The ELK, KNOX A and KNOX B domains are required for nuclear localization, target gene suppression and homo-dimerization, respectively [45,46]. Notably, the number of TALE superclass proteins was significantly lesser (10) in Medicago as compared to other legumes (Table 1). The biological significance of this difference in Medicago remains to be elucidated.
WOX proteins contain one extra residue between helices 1 and 2, and four extra residues between helices 2 and 3 (Fig. 2). WUS-box motif, a sequence of eight conserved residues (TLPLFPMH) is present towards C-terminus of HD (Fig. 1) [20]. WOX proteins showed presence of an acidic amino acid stretch between HD and WUS box apart from other distinctive conserved motifs. DDT homeobox proteins harbor eight additional conserved motifs, named D-TOX A to H, distributed over the entire length of the protein, in addition to the DDT domain (Fig. 1). This class consists of longest plant homeobox proteins with sequences of length up to ~1800 aa (S1 Table). In plants, classification of DDT proteins was done into three subclasses (D-TOX 1, D-TOX 2 and D-TOX 3) [3]. Among these, D-TOX 3 was considered as eudicot-specific that has lost all characteristic motifs of DDT class except D-TOX A motif. We also made similar observations within DDT class, where one of the two major clades of DDT (adjacent to BEL in S1 Fig.) had members harboring only D-TOX A motif. The maximum length of this group of DDT proteins was of 546 aa, thereby deviating from basic characteristics of DDT HD class.
Chromosomal localization and phylogenetic analysis
Chromosomal localization of all the homeobox genes was analyzed in different legumes (S1 Table). It was seen that Arabidopsis and rice homeobox genes were unevenly distributed on the chromosomes (S1 Table). In legumes, apart from soybean and Medicago, a large number of homeobox genes in chickpea, Lotus and pigeonpea were found to be mapped to unanchored scaffolds. Out of 89 chickpea homeobox genes, less than 45% (38 genes) were assigned to eight chickpea linkage groups, while the rest were located on scaffolds. This percentage was higher for pigeonpea, where 82 of the 137 homeobox genes mapped onto eleven linkage groups (S1 Table). This may be due to availability of incomplete genome sequence of these legumes. All but two soybean homeobox genes were located on the twenty chromosomes. On the other hand, 82 of 83 homeobox genes of Medicago were found distributed on eight chromosomes, with chromosome 6 harboring only two genes (S1 Table). Evidently, some chromosomes in these legumes show sparsely situated homeobox genes, whereas some chromosomes possess dense distribution of homeobox genes.
To study the evolutionary relationship among homeobox proteins, an unrooted phylogenetic tree was constructed after multiple sequence alignment of full-length 791 homeobox proteins from five legumes and Arabidopsis using CLUSTALX. The members of homeobox gene family were distinctly clustered into 14 classes (Fig. 3), supporting our domain composition based classification. The detailed phylogenetic tree with bootstrap values and gene identifiers has been presented in S1 Fig. The phylogenetic tree generated using only the homeobox domain sequences also supported the clustering of almost all the proteins in 14 classes (S2 Fig.). Phylogenetic relationship also revealed that many homeobox proteins representing various classes showed very high homology within or across the species (S1 and S2 Figs.). It has been suggested that proteins with higher homology within a class/subfamily may perform similar functions [47].
Differential expression of homeobox genes during development
To gain insights into the putative function of homeobox genes in different legumes, their expression patterns were analyzed in various tissues/organs/developmental stages. In chickpea, RNA-seq data [35] analysis revealed differential expression of homeobox genes in several tissues and/or organs. Homeobox genes of four classes, namely DDT, LD, PHD and PINTOX showed more or less uniform expression pattern across all the tissues/organs analyzed (Fig. 4A). The members of HD-Zip I class were found to be highly up-regulated in mature flower, thereby suggesting their role in flower development. Previously, it has been reported that Arabidopsis HD-Zip I members are expressed in diverse developmental stages, with only few genes showing tissue/organ specific expression. For example, ATHB53 was specifically expressed in roots and flowers and ATHB13 was detected in seedling, leaves and flowers only [11]. Many of the HD-Zip II, HD-Zip IV and WOX class genes were found to be expressed at lower level in the chickpea tissues/organs analyzed. Most prominently, expression patterns of HD-Zip IV genes showed considerable down-regulation in roots of chickpea (Fig. 4A). A few chickpea homeobox genes exhibited tissue-specific/preferential gene expression as well. For example, Ca_17060 (HD-Zip IV member) was expressed in young pod, Ca_02032 and Ca01318 (WOX family members) were expressed in young pod and flower, and root, respectively. HD-Zip IV members are known to be involved in shoot and reproductive developmental processes besides maintenance of epidermal cell layer [15]. GLABRA2 (GL2), a HD-Zip IV member in Arabidopsis, was found to play a crucial role in root hair development in addition to leaf epidermis patterning [48]. Interestingly, many WOX class members have also been implicated in root and flower development in Arabidopsis [49]. To validate the results of differential gene expression analysis obtained from RNA-seq data, we performed qRT-PCR analysis of at least 11 randomly selected differentially expressed genes in six different tissues/organs of chickpea. qRT-PCR analysis revealed similar expression patterns of all the selected genes as observed in RNA-seq data. The statistical analysis also showed a very good agreement (correlation coefficient of 0.75) between the results of qRT-PCR and RNA-seq data analysis (Fig. 4B).
For soybean, we investigated the global expression profile of homeobox genes using the previously reported RNA-seq data [33,34]. The homeobox genes in soybean also exhibited diverse expression patterns, including low to high level, tissue-specific and/or preferential expression in one or more tissue sample analyzed. Except members of HD-Zip superclass (HD-Zip I-IV) and few members of PLINC and DDT class, homeobox proteins were found to be expressed at low levels in root tip (S3 Fig.). Nearly all the HD-Zip IV proteins, except Glyma08g09430, Glyma15g13950, Glyma09g02990, Glyma09g03000 and Glyma08g09440, were up-regulated in shoot apical meristem, thereby suggesting their role in shoot apical meristem maintenance. In Arabidopsis, HD-Zip III members were found to be responsible for shoot apical meristem maintenance and polarisation of leaf cell primordia [14,50]. Only PLINC and few HD-Zip I members were found to be active in different stages of seed and pod shell development in soybean. It was previously also reported that members of PLINC class coordinate floral development in Arabidopsis [22]. In addition, a fraction of soybean BEL proteins were expressed at moderate levels in later stages of seed development (S3 Fig.). However, BEL proteins were highly expressed in nodule, flower and leaf. In fact, BEL family members, along with HD-Zip-I, KNOX and DDT genes were found to be up-regulated in nodules, thereby suggesting their active participation in root nodule development related biological processes. WOX proteins were least expressed among soybean homeobox gene family members in different tissue/organ/developmental stages (S3 Fig.). Interestingly, it has recently been reported that a BEL1-type homeobox gene, SH5, induces seed shattering by development of abscission zone and suppression of lignin biosynthesis [51].
For expression profiling of Medicago and Lotus homeobox proteins, microarray data from their respective gene expression atlases were analyzed. In Lotus, expression analysis of different tissue/organ and developmental stages of pods and seeds were undertaken. Interestingly, one member of PLINC class of Lotus, chr2.CM1835.100.r2.m, was highly up-regulated in developmental stages of pod and seed (S4A Fig.). Similarly, few members of PLINC class in Medicago were also found to be up-regulated in developmental stages of pods and seeds (S4B Fig.). These observations suggest that PLINC proteins may have a conclusive role pertaining to seed setting in legume pods. A recent study in Medicago truncatula highlighted the role of several members of HD-Zip class, WOX and KNOX class in early embryo development [52].
Genome duplication and expression patterns of duplicated homeobox genes
Whole genome duplication events in plants have been considered as a mechanism of diversification and adaptation to the environment [53]. However, functions of duplicated genes are poorly understood. In legumes, soybean has undergone genome duplication two times (59 and 13 million years ago), which resulted in emergence of multiple copies of ~75% of soybean genes [28]. Relatively higher number of homeobox genes identified in soybean as compared to other legume species may be due to these whole genome duplication events. We found that members of homeobox gene family are distributed preferentially in duplicated blocks in soybean (S4 Table). A total of 246 (89.1%) homeobox genes in soybean were found located on duplicated chromosomal blocks. Interestingly, we could not locate even a single event of tandem duplication in soybean. These observations suggest that segmental duplication has played an important role in expansion of soybean homeobox gene family, since this process allows retention of numerous duplicated genes in the genome [54]. The duplicated gene pairs of respective classes of homeobox gene family have been represented pictorially in Fig. 5A. The non-synonymous/synonymous substitution ratio (Ka/Ks) tells us about the selective evolutionary pressure acting on a gene. Majority (96.3%) of the gene pairs were found to have Ka/Ks < 1 suggesting their evolution to be under the influence of purifying selection (S4 Table). The purifying selection has also been previously observed for HD-Zip proteins in soybean [39] and poplar [26].
Events of gene duplication may serve as a crucial mechanism to increase the functional diversity of gene family due to spatial and/or temporal changes in gene expression. Differences in gene expression pattern may result from non-functionalization, sub-functionalization or neo-functionalization of duplicated genes. Similar evidences have been reported in other model plants like Arabidopsis [55]. We observed that majority of soybean homeobox family duplicated genes were differentially expressed in tissue/organ/developmental stages analyzed (S5 Fig.). Likewise, differential expression of nearly 50% of 17547 duplicated genes in soybean was observed across seven tissues, thereby suggesting sub-functionalization [56]. Based on gene expression patterns, we observed three types of functional variations in homeologous gene pairs in soybean. For instance, sub-functionalization was observed in Glyma01g38650/Glyma02g06730 with no expression of Glyma02g06730 in root hairs. In Glyma10g10040/Glyma12g10030 gene pair, Glyma12g10030 expression decreased to basal level in root, root hair and nodule in contrast to Glyma10g10040. In addition, instances of neo-functionalization were observed in Glyma04g01150/Glyma04g03150 and Glyma04g01150/Glyma06g03200 gene pairs, where expression of Glyma04g03150 and Glyma06g03200 could be detected in root hairs and nodules, contrary to Glyma04g01150. The phenomenon of non-functionalization was exhibited by Glyma04g33640/Glyma19g02610 gene pair, where Glyma19g02610 was expressed in reproductive tissues, however, Glyma04g33640, did not show expression in most of the developmental stages analyzed (Fig. 5B). These observations imply that the evolutionary fate of soybean homeobox genes have been closely regulated by gene duplication events. Overall, these analyses indicate that purifying selection has majorly contributed in retention and maintenance of duplicated gene pairs during evolution. Moreover, expression profiling of duplicated soybean homeobox proteins highlighted that majority of them have undergone sub-functionalization. Such observations are consistent with other plant species, where closely related genes, have been shown to display diverse expression patterns [10,39].
Differential expression of homeobox genes in response to abiotic and biotic stresses
Crop production is often adversely affected by several abiotic stress factors like desiccation, salinity and extremes of temperatures. Since homeobox genes are known to play an important role in abiotic stress responses, we analyzed the expression profile of chickpea homeobox genes in root and shoot tissues subjected to desiccation, salinity and cold stresses using RNA-seq data [35]. Out of 89 chickpea homeobox genes, 44 were found to be significantly differentially regulated in root and/or shoot tissues subjected to at least one of the abiotic stress conditions. Overall, more number of homeobox genes were up-regulated in root tissues subjected to salinity stress as compared to desiccation and cold stresses. All HD-Zip II members were reasonably up-regulated under salinity stress in root tissues, thereby suggesting their possible role in salinity stress responses (Fig. 6A). However, HD-Zip II genes showed no change in their expression pattern in root tissues under desiccation stress. Highest fold change in root tissue was recorded for Ca_08006, a member of PLINC class, when subjected to salinity stress. Cold stress did not alter the expression level of most homeobox genes. Only a few members showed differential expression in root tissues in response to cold. However, no considerable alteration in transcript levels could be detected in shoot tissues under cold stress. Under desiccation stress, HD-Zip I genes were highly up-regulated in shoot tissues as compared to root tissues (Fig. 6A). Similar observations have been made in Arabidopsis, where transcript levels of HD-Zip I members (ATHB7 and ATHB12) increased tremendously in response to desiccation stress [57]. However, Ca_06148 and Ca_00550 were found to be downregulated in response to all the abiotic stress conditions analyzed in either of the tissues. Highest upregulation in shoot tissue was recorded for Ca_19899, a member of HD-Zip I class, when subjected to desiccation stress. However, this gene was greatly up-regulated during salinity stress as compared to desiccation stress in root tissues (Fig. 6A). It has been reported that a cotton homeobox gene, GhHB1, is specifically expressed in root tissues and gets up-regulated under exogenous salinity treatment [58]. Similarly, differential expression of many homeobox genes during abiotic stress conditions has been reported in various plant species [10,24,35,59]. We performed qRT-PCR analysis of six randomly selected differentially expressed homeobox genes in root and shoot tissues of chickpea during desiccation, salinity and cold stress conditions to validate the results obtained from RNA-seq data (Fig. 6B). The qRT-PCR analysis revealed similar differential expression patterns of all the selected genes as observed in RNA-seq data showing good correlation between the results of qRT-PCR and RNA-seq data analysis. These results suggest that homeobox genes may prove to be suitable candidates for engineering abiotic stress tolerance in crop plants.
In a previous study, genome-wide transcriptome analysis reported the differential expression of several genes in soybean leaf tissue under drought stress at late developmental stages [60]. We utilised the microarray data from this study in order to understand the role of soybean homeobox genes in abiotic stress responses. Of the 276 soybean homeobox genes, 50 genes were found to be significantly differentially expressed in at least one of the conditions analysed. Among them, 17 genes were specifically differentially expressed at either late vegetative stage or full bloom reproductive stage and 16 homeobox genes were commonly differentially expressed at both the developmental stages of soybean (Fig. 7A).
Apart from abiotic stress factors, a wide range of biotic stress factors, like virus, bacteria, fungi, and nematode severely damage the crop productivity. Transcript levels of homeobox genes are altered under biotic stresses as well [61,62]. Hence, we analyzed the expression profile of soybean homeobox genes under biotic stress conditions using microarray data from Genevestigator v.3. Many soybean homeobox genes, were found to be significantly differentially regulated in response to at least one of the conditions analyzed. A maximum number of HD-Zip I class homeobox genes followed by BEL class and HD-Zip II class members were differentially expressed due to biotic stress factors (Fig. 7B). For example, among HD-Zip I proteins, Glyma16g02390 was found to be highly upregulated by Aphis glycines and Phytopthora sojae infection, whereas Glyma08g40970, Glyma18g01830, Glyma19g01300 and Glyma01g05230 were significantly downregulated in response to Heterodera glycines and P. sojae infection. Several KNOX members, namely, Glyma17g14180, Glyma04g06810, Glyma09g01000 and Glyma14gg37550 were differentially expressed in response to H. glycines, Phakopsora pachyrhizi and P. sojae infection. In addition, elevated transcript levels of some PLINC class members (Glyma01g03060, Glyma01g05810 and Glyma05g01060) in response to P. pachyrhizi, P. sojae and H. glycines infection, was also observed. The above expression profiling suggests a distinctive role of homeobox genes in biotic stress responses. As of now, only some reports have provided preliminary evidence of biotic stress-responsiveness of homeobox genes across various plant species and speculate their role in pathogen resistance [61,62]. Thus, the involvement of homeobox genes in pathogen-related responses needs to be explored in greater detail.
Transcription factors can act as master regulators as they can regulate the expression of several genes via binding to their promoter sequences. However, transcription factors may themselves be under the control of other upstream regulators, which may bind to promoter region of homeobox genes thereby regulating the cascade of reactions occurring during various biological processes in plants. We carried out a cis-regulatory element search in promoter regions (1 kb upstream) of homeobox genes in chickpea and soybean. Several cis-regulatory elements primarily known to be involved in various processes of plant development like leaf, shoot and root development were detected in the promoter sequences analysed (S3 Table). In addition, some seed specific cis-regulatory elements have been found in promoter regions of many homeobox genes. Further, existence of characteristic stress-responsive cis-regulatory elements, like ABRE, DRE and/or LTRE suggested stress-responsive regulation of these genes. Interestingly, auxin-responsive elements were also detected in the promoters of some homeobox genes in chickpea and soybean. The presence of such cis-regulatory elements suggests that homeobox genes may play pivotal role in various developmental processes, hormonal crosstalk and abiotic stress responses in legumes as well.
Overall, homeobox genes have been established as critical regulators of plant development. So far, many among them have emerged to play significant role in specific stress responses at various stages of plant development [10,25,57,62,63]. We also observed the differential/specific expression of many homeobox genes in different tissues/organ/developmental stage and abiotic/biotic stress conditions. Thus, homeobox genes are speculated to coordinate both developmental processes and stress-adaptive pathways in plants [10,64].
Identification of putative downstream targets of HD-Zip I and HD-Zip II proteins
During abiotic stress conditions, some members of HD-Zip superclass are reported to bind specifically to cis-regulatory elements, thereby regulating the action of several downstream genes [65]. Investigations of DNA-binding specificities and dimerization properties of HD-Zip family members in Arabidopsis revealed that HD-Zip I members have the ability to bind to CAAT(A/T)ATTG (AH1) and/or CAAT(C/G)ATTG (AH2) motif(s), whereas HD-Zip II members can bind to AH2 motif [66,67].
Since, the binding sites of HD-Zip I and HD-Zip II class homeobox proteins are well documented, we scanned 2 kb upstream sequences of all the genes in soybean and chickpea to identify the presence of AH1 and/or AH2 motif(s). In total, 3,971 soybean genes were found to harbor at least one or more of these motifs, signifying the potential downstream targets of HD-Zip I and HD-Zip II class members. AH1 and/or AH2 motifs were present in promoters of 2671 and 1379 genes, respectively. A total of 1320 genes were found to harbor these motifs in chickpea. These genes are speculated to be probable target genes of HD-Zip I and HD-Zip II proteins. GOSlim analysis revealed that genes involved in various developmental processes, response to abiotic and biotic stress, and various enzymatic activities were most represented among the AH1 and/or AH2 motif harboring genes in their promoters (S6 Fig.), suggesting that these genes might be the putative targets of homeobox transcription factors. Coexpression analysis revealed 21 chickpea homeobox genes to be significantly (with Pearson correlation coefficient cut-off of 0.7 and p-value ≤ 0.05) coexpressed with 816 other chickpea genes harbouring AH1 and/or AH2 motifs, whereas, 152 soybean homeobox genes were found to be significantly coexpressed with 2093 soybean genes harbouring AH1 and/or AH2 motifs (S5 Table). Notably, in chickpea and soybean, 83 and 292 genes were found to be highly positively (≥ 0.95) correlated. At least, 113 chickpea genes showed significant negative correlation with homeobox genes (≤ -0.95) respectively, whereas not a single gene exhibited negative correlation ≤ (-0.90) with homeobox genes (S7 Fig.). The coexpression of a large number of other genes suggests the involvement of HD-Zip I and HD-Zip II class proteins in a complex transcriptional regulatory network responsible for various cellular processes. Due to lack of definite knowledge of binding specificity and suitable experimental evidence, such analysis could not be carried out for homeobox proteins belonging to other classes.
Interestingly, in rice, previous reports have established that HD-Zip I and HD-Zip II proteins bind to AH1 and/or AH2 motifs [68]. Very recently, it was also identified that an abiotic stress-responsive gene belonging to HD-Zip I class, Oshox22, could bind to either and/or AH2 motif(s) suggesting that homeobox genes across different plant species govern regulation of many downstream target genes via binding to AH1 and/or AH2 motif(s) [69]. In addition to abiotic stress responses, homeobox proteins possess ability to bind to cis-regulatory elements due to hormonal induction too. In Medicago truncatula, during lateral root emergence, a HD-Zip I family member, MtHB1, was shown to bind to AH1 motif located in promoter of LOB-like gene, LBD1, which is regulated transcriptionally by auxin [70]. The presence of such motifs in promoters of numerous genes suggests a preferential regulation via homeobox transcription factors. In future, these in-silico identifications of cis-regulatory elements would require experimental validation.
In conclusion, the comprehensive analysis of homeobox gene family members in legumes has generated a rich repertoire of knowledge for future investigation. Transcript profiling in legumes reiterated the diverse role of homeobox genes in biology of various tissues/organs/developmental stages and stress responses in legumes. Gene duplication analysis revealed that whole genome duplication events have resulted in expansion of homeobox gene family in soybean, which may have seemingly contributed to functional diversification in course of evolution. This fact was substantiated by analysis of expression profiles of duplicated soybean homeobox genes. Overall, the current study has built a foundation to initiate detailed investigations pertaining to biological functions of homeobox genes in legumes.
Supporting Information
Acknowledgments
AB acknowledges the award of research fellowship from the Council of Scientific and Industrial Research, New Delhi. RG acknowledges INSPIRE Faculty Award from the Department of Science and Technology, Government of India.
Data Availability
All relevant data are within the paper and its Supporting Information files.
Funding Statement
The study was funded by the Department of Biotechnology, Government of India, under the Next Generation Challenge Programme on Chickpea Genomics (grant number BT/PR12919/AGR/02/676/2009 from 2009–2014). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1. Gehring WJ, Affolter M, Burglin TR. Homeodomain proteins. Annu Rev Biochem. 1994; 63: 487–526. [DOI] [PubMed] [Google Scholar]
- 2. Nam J, Nei M. Evolutionary change of the numbers of homeobox genes in bilateral animals. Mol Biol Evol. 2005; 22: 2386–2394. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Mukherjee K, Brocchieri L, Burglin TR. A comprehensive classification and evolutionary analysis of plant homeobox genes. Mol Biol Evol. 2009; 26: 2775–2794. 10.1093/molbev/msp201 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Desplan C, Theis J, O’Farrell PH. The sequence specificity of homeodomain-DNA interaction. Cell 1988; 54: 1081–1090. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Otting G, Qian YQ, Billeter M, Muller M, Affolter M, Gehring W, et al. Protein-DNA contacts in the structure of a homeodomain–DNA complex determined by nuclear magnetic resonance spectroscopy in solution. EMBO J. 1990; 9: 3085–3092. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Burglin TR. A comprehensive classification of homeobox genes In: Duboue D, editor. Guidebook to the homeobox genes. Oxford: Oxford University Press; 1994. pp. 25–71. [Google Scholar]
- 7. Burglin TR. Analysis of TALE superclass homeobox genes (MEIS, PBC, KNOX, Iroquois, TGIF) reveals a novel domain conserved between plants and animals. Nucleic Acids Res. 1997; 25: 4173–4180. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Chen H, Rosin FM, Prat S, Hannapel DJ. Interacting transcription factors from the three-amino acid loop extension superclass regulate tuber formation. Plant Physiol. 2003; 132: 1391–1404. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Bharathan G, Janssen BJ, Kellogg EA, Sinha N. Did homeodomain proteins duplicate before the origin of angiosperms, fungi, and metazoa? Proc Natl Acad Sci USA. 1997; 94: 13749–13753. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Jain M, Tyagi AK, Khurana JP. Genome-wide identification, classification, evolutionary expansion and expression analyses of homeobox genes in rice. FEBS J. 2008; 275: 2845–2861. 10.1111/j.1742-4658.2008.06424.x [DOI] [PubMed] [Google Scholar]
- 11. Henriksson E, Olsson A, Johannesson H, Hanson J, Engstrom P. Homeodomain leucine zipper class I genes in Arabidopsis expression patterns and phylogenetic relationships. Plant Physiol. 2005; 139: 509–518. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Wang Y, Henriksson E, Soderman E, Henriksson KN, Sundberg E, Engstrom P. The Arabidopsis homeobox gene, ATHB16, regulates leaf development and the sensitivity to photoperiod in Arabidopsis . Dev Biol. 2003; 264: 228–239. [DOI] [PubMed] [Google Scholar]
- 13. Steindler C, Matteucci A, Sessa G, Weimar T, Ohgishi M, Aoyama T, et al. Shade avoidance responses are mediated by the ATHB-2 HD-Zip protein, a negative regulator of gene expression. Development 1999; 126: 4235–4245. [DOI] [PubMed] [Google Scholar]
- 14. Prigge MJ, Otsuga D, Alonso JM, Ecker JR, Drews GN, Clark SE. Class III homeodomain leucine zipper gene family members have overlapping, antagonistic and distinct roles in Arabidopsis development. Plant Cell 2005; 17: 61–76. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Nakamura M, Katsumata H, Abe M, Yabe N, Komeda Y, Yamamoto KT, et al. Characterization of the class IV homeodomain leucine zipper gene family in Arabidopsis . Plant Physiol. 2006; 141: 1363–1375. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Chew W, Hrmova M, Lopato S. Role of homeodomain leucine zipper (HD-Zip) IV transcription factors in plant development and plant protection from deleterious environmental factors. Int J Mol Sci. 2013; 14: 8122–8147. 10.3390/ijms14048122 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Hay A, Tsiantis M. KNOX genes: Versatile regulators of plant development and diversity. Development 2010; 137: 3153–3165. 10.1242/dev.030049 [DOI] [PubMed] [Google Scholar]
- 18. Hake S, Smith HM, Holtan H, Magnani E, Mele G, Ramirez J. The role of knox genes in plant development. Annu Rev Cell Dev Biol. 2004; 20: 125–151. [DOI] [PubMed] [Google Scholar]
- 19. van der Graaff E, Laux T, Rensing SA. The WUS homeobox containing (WOX) protein family. Genome Biol. 2009; 10: 248 10.1186/gb-2009-10-12-248 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Haecker A, Gross-Hardt R, Geiges B, Sarkar A, Breuninger H, et al. Expression dynamics of WOX genes mark cell fate decisions during early embryonic patterning in Arabidopsis thaliana . Development 2004; 131: 657–668. [DOI] [PubMed] [Google Scholar]
- 21. Deyhle F, Sarkar AK, Tucker EJ, Laux T. WUSCHEL regulates cell differentiation during anther development. Dev Biol. 2007; 302: 154–159. [DOI] [PubMed] [Google Scholar]
- 22. Tan QK, Irish VF. The Arabidopsis zinc finger-homeodomain genes encode proteins with unique biochemical properties that are coordinately expressed during floral development. Plant Physiol. 2006; 140: 1095–1108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Jorgensen JE, Gronlund M, Pallisgaard N, Larsen K, Marcker KA, Jensen EO. A new class of plant homeobox genes is expressed in specific regions of determinate symbiotic root nodules. Plant Mol Biol. 1999; 40: 65–77. [DOI] [PubMed] [Google Scholar]
- 24. Deng X, Phillips J, Meijer AH, Salamini F, Bartels D. Characterization of five novel dehydration responsive homeodomain leucine zipper genes from the resurrection plant Craterostigma plantagineum . Plant Mol Biol. 2002; 49: 601–610. [DOI] [PubMed] [Google Scholar]
- 25. Bhattacharjee A, Jain M. Homeobox genes as potential candidates for crop improvement under abiotic stress In: Tuteja N, Gill SSS, editors. Plant Acclimation to Environmental Stress. New York, USA: Springer Science+Business Media; 2013. pp. 163–176. [Google Scholar]
- 26. Hu R, Chi X, Chai G, Kong Y, He G, Wang X, et al. Genome-wide identification, evolutionary expansion, and expression profile of homeodomain-leucine zipper gene family in Poplar (Populus trichocarpa). PLoS One 2012; 7: e31149 10.1371/journal.pone.0031149 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Jain M, Misra G, Patel RK, Priya P, Jhanwar S, Khan AW, et al. A draft genome sequence of the pulse crop chickpea (Cicer arietinum L.). Plant J. 2013; 74: 715–729. 10.1111/tpj.12173 [DOI] [PubMed] [Google Scholar]
- 28. Schmutz J, Cannon SB, Schlueter J, Ma J, Mitros T, Nelson W, et al. Genome sequence of the palaeopolyploid soybean. Nature 2010; 463: 178–183. 10.1038/nature08670 [DOI] [PubMed] [Google Scholar]
- 29. Young ND, Debelle F, Oldroyd GE, Geurts R, Cannon SB, Udvardi MK, et al. The Medicago genome provides insight into the evolution of rhizobial symbioses. Nature 2011; 480: 520–524. 10.1038/nature10625 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Sato S, Nakamura Y, Kaneko T, Asamizu E, Kato T, Nakao M, et al. Genome structure of the legume, Lotus japonicus . DNA Res. 2008; 15: 227–239. 10.1093/dnares/dsn008 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Varshney RK, Chen W, Li Y, Bharti AK, Saxena RK, Schlueter JA, et al. Draft genome sequence of pigeonpea (Cajanus cajan), an orphan legume crop of resource-poor farmers. Nat Biotechnol. 2011; 30: 83–89. 10.1038/nbt.2022 [DOI] [PubMed] [Google Scholar]
- 32. Saitou N, Nei M. The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol. 1987; 4: 406–425. [DOI] [PubMed] [Google Scholar]
- 33. Libault M, Farmer A, Joshi T, Takahashi K, Langley RJ, Franklin LD, et al. An integrated transcriptome atlas of the crop model Glycine max, and its use in comparative analyses in plants. Plant J. 2010; 63: 86–99. 10.1111/j.1365-313X.2010.04222.x [DOI] [PubMed] [Google Scholar]
- 34. Severin AJ, Woody JL, Bolon YT, Joseph B, Diers BW, Farmer AD, et al. RNA-Seq atlas of Glycine max: A guide to the soybean transcriptome. BMC Plant Biol. 2010; 10: 160 10.1186/1471-2229-10-160 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Garg R, Bhattacharjee A, Jain M. Genome-scale transcriptomic insights into molecular aspects of abiotic stress responses in chickpea. Plant Mol Biol Rep. 2014; DOI 10.1007/s11105-014-0753-x. [DOI]
- 36. Verdier J, Torres-Jerez I, Wang M, Andriankaja A, Allen SN, He J, et al. Establishment of the Lotus japonicus Gene Expression Atlas (LjGEA) and its use to explore legume seed maturation. Plant J. 2013; 74: 351–362. 10.1111/tpj.12119 [DOI] [PubMed] [Google Scholar]
- 37. Benedito VA, Torres-Jerez I, Murray JD, Andriankaja A, Allen S, Kakar K, et al. A gene expression atlas of the model legume Medicago truncatula . Plant J. 2008; 55: 504–513. 10.1111/j.1365-313X.2008.03519.x [DOI] [PubMed] [Google Scholar]
- 38. Garg R, Sahoo A, Tyagi AK, Jain M. Validation of internal control genes for quantitative gene expression studies in chickpea (Cicer arietinum L.). Biochem Biophys Res Commun. 2010; 396: 283–288. 10.1016/j.bbrc.2010.04.079 [DOI] [PubMed] [Google Scholar]
- 39. Chen X, Chen Z, Zhao H, Zhao Y, Cheng B, Xiang Y, et al. Genome-wide analysis of soybean HD-Zip gene family and expression profiling under salinity and desiccation treatments. PLoS One 2014; 9: e87156 10.1371/journal.pone.0087156 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Chan RL, Gago GM, Palena CM, Gonzalez DH. Homeoboxes in plant development. Biochim Biophys Acta. 1998; 1442: 1–19. [DOI] [PubMed] [Google Scholar]
- 41. Mukherjee K, Burglin TR. MEKHLA, a novel domain with similarity to PAS domains, is fused to plant homeodomain leucine zipper III proteins. Plant Physiol. 2006; 140: 1142–1150. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Schrick K, Nguyen D, Karlowski WM, Mayer KF. START lipid/sterol-binding domains are amplified in plants and are predominantly associated with homeodomain transcription factors. Genome Biol. 2004; 5: R41 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. De Caestecker MP, Yahata T, Wang D, Parks WT, Huang S, Hilli CS, et al. The Smad 4 activation domain (SAD) is a proline-rich, p300-dependent transcriptional activation domain. J Biol Chem. 2000; 3: 2115–2122. [DOI] [PubMed] [Google Scholar]
- 44. Windhovel A, Hein I, Dabrowa R, Stockhaus J. Characterization of a novel class of plant homeodomain proteins that bind to the C4 phosphoenolpyruvate carboxylase gene of Flaveria trinervia . Plant Mol Biol. 2001; 45: 201–214. [DOI] [PubMed] [Google Scholar]
- 45. Hofer J, Gourlay C, Michael A, Ellis TH. Expression of a class 1 knotted1-like homeobox gene is down-regulated in pea compound leaf primordial. Plant Mol Biol. 2001; 45: 387–398. [DOI] [PubMed] [Google Scholar]
- 46. Nagasaki H, Sakamoto T, Sato Y, Matsuoka M. Functional analysis of the conserved domains of a rice KNOX homeodomain protein, OSH15. Plant Cell 2001; 13: 2085–2098. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47. Jain M, Kaur N, Tyagi AK, Khurana JP. The auxin-responsive GH3 gene family in rice (Oryza sativa). Funct Integr Genomics. 2006; 6: 36–46. [DOI] [PubMed] [Google Scholar]
- 48. Masucci JD, Rerie WG, Foreman DR, Zhang M, Galway ME, Marks MD, et al. The homeobox gene GLABRA2 is required for position dependent cell differentiation in the root epidermis of Arabidopsis thaliana . Development 1996; 122: 1253–1260. [DOI] [PubMed] [Google Scholar]
- 49. Deveaux Y, Nioche CT, Claisse G, Thareau V, Morin H, Laufs P, et al. Genes of the most conserved WOX clade in plants affect root and flower development in Arabidopsis. BMC Evol Biol. 2008; 8:291 10.1186/1471-2148-8-291 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50. McConnell JR, Emery J, Eshed Y, Bao N, Bowman J, Barton MK. Role of PHABULOSA and PHAVOLUTA in determining radial patterning in shoots. Nature 2001; 411: 709–713. [DOI] [PubMed] [Google Scholar]
- 51. Yoon J, Cho LH, Kim SL, Choi H, Koh HJ, An G. The BEL1-type homeobox gene SH5 induces seed shattering by enhancing abscission-zone development and inhibiting lignin biosynthesis. Plant J. 2014; 79:717–728. 10.1111/tpj.12581 [DOI] [PubMed] [Google Scholar]
- 52. Kurdyukov S, Song Y, Sheahan M, Rose RJ. Transcriptional regulation of early embryo development in the model legume Medicago truncatula . Plant Cell 2014; 33: 349–362. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53. Taylor JS, Raes J. Duplication and divergence: the evolution of new genes and old ideas. Annu Rev Genet. 2004; 38: 615–643. [DOI] [PubMed] [Google Scholar]
- 54. Cannon SB, Mitra A, Baumgarten A, Young ND, May G. The roles of segmental and tandem gene duplication in the evolution of large gene families in Arabidopsis thaliana . BMC Plant Biol. 2004; 4:10 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55. Duarte JM, Cui L, Wall PK, Zhang Q, Zhang X, Leebens-Mack J, et al. Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalisation in regulatory genes of Arabidopsis . Mol Biol Evol. 2006; 23: 469–478. [DOI] [PubMed] [Google Scholar]
- 56. Roulin A, Auer PL, Libault M, Schlueter J, Farmer A, May G, et al. The fate of duplicated genes in a polyploid plant genome. Plant J. 2013; 73: 143–153. [DOI] [PubMed] [Google Scholar]
- 57. Olsson A, Engstrom P, Soderman E. The homeobox genes ATHB12 and ATHB7 encode potential regulators of growth in response to water deficit in Arabidopsis . Plant Mol Biol. 2004; 55: 663–677. [DOI] [PubMed] [Google Scholar]
- 58. Ni Y, Wang X, Li D, Wu Y, Xu W, Li X. Novel cotton homeobox gene and its expression profiling in root development and in response to stresses and phytohormones. Acta Biochim Biophy Sin. 2008; 40: 78–84. [DOI] [PubMed] [Google Scholar]
- 59. Gago GM, Almoguera C, Jordano J, Gonzalez DH, Chan RL. Hahb-4, a homeobox-leucine zipper gene potentially involved in abscisic acid-dependent responses to water stress in sunflower. Plant Cell Environ. 2002; 25: 633–640. [Google Scholar]
- 60. Le DT, Nishiyama R, Watanabe Y, Tanaka M, Seki M, Ham LH, et al. Differential gene expression in soybean leaf tissues at late developmental stages under drought stress revealed by genome-wide transcriptome analysis. PLoS One 2012; 7: e49522 10.1371/journal.pone.0049522 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61. Wang YJ, Li YD, Luo GZ, Tian AG, Wang HW, Zhang JS, et al. Cloning and characterization of an HD-Zip I gene GmHZ1 from soybean. Planta 2005; 221: 831–843. [DOI] [PubMed] [Google Scholar]
- 62. Luo H, Song F, Zheng Z. Overexpression in transgenic tobacco reveals different roles for the rice homeodomain gene OsBIHD1 in biotic and abiotic stress responses. J Exp Bot. 2005; 56: 2673–2682. [DOI] [PubMed] [Google Scholar]
- 63. Ariel FD, Manavella PA, Dezar CA, Chan RL. The true story of the HD-Zip family, Trends Plant Sci. 2007; 12: 419–426. [DOI] [PubMed] [Google Scholar]
- 64. Dezar CA, Gago GM, Gonzalez DH, Chan RL. Hahb-4, a sunflower homeobox-leucine zipper gene, is a developmental regulator and confers drought tolerance to Arabidopsis thaliana plants. Transgenic Res. 2005; 14: 429–440. [DOI] [PubMed] [Google Scholar]
- 65. Liu JH, Peng T, Dai W. Critical cis-acting elements and interacting transcription factors: key players associated with abiotic stress responses in plants. Plant Mol Biol Rep. 2014; 32: 303–317. [Google Scholar]
- 66. Sessa G, Morelli G, Ruberti I. The ATHB-1 and -2 HD-Zip domains homodimerize forming complexes of different DNA binding specificities. EMBO J. 1993; 12: 3507–3517. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67. Johannesson H, Wang Y, Engstrom P. DNA-binding and dimerization preferences of Arabidopsis homeodomain-leucine zipper transcription factors in vitro. Plant Mol Biol. 2001; 45: 63–73. [DOI] [PubMed] [Google Scholar]
- 68. Zhang S, Haider I, Kohlen W, Jiang L, Bouwmeester H, Meijer AH, et al. Function of the HD-Zip I gene Oshox22 in ABA-mediated drought and salt tolerances in rice. Plant Mol Biol. 2012; 80: 571–585. 10.1007/s11103-012-9967-1 [DOI] [PubMed] [Google Scholar]
- 69. Meijer AH, de Kam RJ, d’ Ehrfurth I, Shen W, Hoge JHC. HD-Zip proteins of families I and II from rice: interactions and functional properties. Mol Genet Genomics 2000; 263: 12–21. [DOI] [PubMed] [Google Scholar]
- 70. Ariel F, Diet A, Verdenaud M, Gruber V, Frugier F, Chan R, et al. Environmental regulation of lateral root emergence in Medicago truncatula requires the HD-Zip I transcription factor HB1. Plant Cell 2010; 22: 2171–2183. 10.1105/tpc.110.074823 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All relevant data are within the paper and its Supporting Information files.