Skip to main content
Genes logoLink to Genes
. 2019 Jun 30;10(7):502. doi: 10.3390/genes10070502

Global Dynamics in Protein Disorder during Maize Seed Development

Jesús Alejandro Zamora-Briseño 1, Alejandro Pereira-Santana 2, Sandi Julissa Reyes-Hernández 1, Enrique Castaño 3, Luis Carlos Rodríguez-Zapata 1,*
PMCID: PMC6678312  PMID: 31262071

Abstract

Intrinsic protein disorder is a physicochemical attribute of some proteins lacking tridimensional structure and is collectively known as intrinsically disordered proteins (IDPs). Interestingly, several IDPs have been associated with protective functions in plants and with their response to external stimuli. To correlate the modulation of the IDPs content with the developmental progression in seed, we describe the expression of transcripts according to the disorder content of the proteins that they codify during seed development, from the early embryogenesis to the beginning of the desiccation tolerance acquisition stage. We found that the total expression profile of transcripts encoding for structured proteins is highly increased during middle phase. However, the relative content of protein disorder is increased as seed development progresses. We identified several intrinsically disordered transcription factors that seem to play important roles throughout seed development. On the other hand, we detected a gene cluster encoding for IDPs at the end of the late phase, which coincides with the beginning of the acquisition of desiccation tolerance. In conclusion, the expression pattern of IDPs is highly dependent on the developmental stage, and there is a general reduction in the expression of transcripts encoding for structured proteins as seed development progresses. We proposed maize seeds as a model to study the regulation of protein disorder in plant development and its involvement in the acquisition of desiccation tolerance in plants.

Keywords: Intrinsically disordered proteins, dehydration, seed development, maize

1. Introduction

Over the last few years, new information has been generated regarding the role of intrinsically disordered proteins (IDPs) which has changed our understanding of protein biochemistry. These findings led to the categorization of IDPs as an interesting group of proteins which differ from globular proteins in terms of action modes and physicochemical characteristics. IDPs possess low complexity and a biased composition of amino acid (aa), being almost depleted in hydrophobic and aromatic residues while enriched in polar and charged aa [1]. These features in the primary structure confer them with a high net charge and low mean hydrophobicity [2]. For these reasons, such protein sequences are unable to fold into stable, rigid, globular, three-dimensional structures. In contrast, they are dynamically interconvertible ensembles of spatial conformations ranging from extended statistical coils to collapsed globules [1,3]. Some proteins are predicted to be entirely disordered (IDPs in sensu stricto), while others are not intrinsically disordered throughout, but have disordered segments (intrinsically disordered regions IDRs) [4]. The collection of IDPs/IDRs of a given organism in a specific condition can be collectively referred as disordome [5].

IDPs are enriched in signaling and regulation of key cellular processes, such as control of cell division, apoptosis, post-translational modification, transcription, etc. Bioinformatics analysis of the Swiss-Prot database reveals that at least 238 function keywords of cellular process is given between IDPs/IDRs [6,7,8]. Flexible conformation of intrinsic disorder confers advantages to proteins, such as an increased speed of interaction, the combination of specificity with weak and reversible binding, and the ability to carry out more than one function [9]. In this context, several advantages have been proposed for IDPs/IDRs, which include, economizing genome and protein resources, overcoming steric restrictions in binding, achieving high specificity with low affinity, facilitating post-translational modifications, enabling flexible linkers, preventing aggregation, providing resistance to non-native conditions and finally, allowing compatibility with more available sequences [6].

In plants the genomic-wide information available regarding IDPs is rather limited in comparison to other eukaryotic organisms and it is circumscribed to Arabidopsis thaliana [10,11], and a few other plant models [12,13,14]. The fact that many IDPs can serve as signal integrators in signaling cascades and stress-response processes, allows the supposition that their role can be especially important in plant development and adaptation to environmental conditions, probably by providing them with a fast mechanism to obtain complex and highly interconnected molecular networks [15]. It has been pointed out that several IDPs in plants play key roles in response to different stresses [10,11], in particular water stress [16,17,18]. It has been suggested that plants may use IDPs independently to adapt quickly and efficiently to environmental changes from which they cannot escape [10].

Although important progress has been achieved in our comprehension of the functions of disordered proteins in plants, the answers to many interesting questions remain elusive. An important feature to be evaluated is their role in desiccation tolerance (DT) in plants. When cellular water content is reduced, the density of macromolecules is increased, an effect known as macromolecular crowding [18]. In some cases, this density must be significantly adjusted, for example, in orthodox seeds in dry state.

The end of orthodox seed development is typified by a programmed period of dehydration leading to the loss of bulk water from the entire structure [19]. Such dehydration causes a reduction in the cellular volume. The macromolecular crowding resulting from the cytoplasmic compaction, provide an environment amenable to numerous undesirable interactions that can lead to organelle-cell membrane fusion, and denaturation and aggregation of certain proteins especially susceptible to such conditions [20]. Acquisition of desiccation tolerance, or the ability to withstand these very low water potentials and consequent macromolecular crowding, has been correlated with the accumulation of various protective compounds including protective proteins and sugars [19]. Additionally, it has been proposed that a possible mechanism to partially help cope with this effect is the induction of proteins less prone to the loss of function induced by denaturalization [5]. Furthermore, the induction of IDPs could represent a compensatory mechanism to mitigate the effect of the macromolecular crowding in the cell, since these proteins are not able to lose their function by a denaturalization-mediated mechanism. In addition, a reduction in the content of structured proteins with propensity to denaturalization, mediated by the lack of water, is also feasible.

Understanding the spatio-temporal expression pattern of genes represents an invaluable source of information regardless if a gene contributes or not to its function in the cellular context [21,22]. Therefore, the expression profile of coding transcripts under a desiccation context can shed light on the relationship of protein intrinsic disorder with such characteristics. In these sense, orthodox seeds may be particularly useful models to evaluate association between the overall regulations of protein disorder under a dehydration context, since these are specialized structures which have developed evolutionary conserved adaptations to resist the extreme loss of water [23]. A very well study orthodox seed is the maize caryopsis. This is an amenable plant model, with a fully sequenced and well annotated genome [24], with several available public transcriptomic datasets of seed development, as well as a detailed description of the anatomical, physiological and biochemical changes occurring during this period [25,26,27,28].

For these reasons, in this work we tested the hypothesis that a desiccation context favors the accumulation of proteins less prone to denaturation mediated by water loss. In this sense, we considered that the increase of protein disorder could represent a strategy to increase the general resilience of the proteome as a response to extreme water loss conditions. To evaluate this idea, we analyze the expression of genes according to the disorder content of their coded proteins during maize seed development. We found that the overall disorder content differs throughout the process of seed development and it is closely associated with the different developmental stages in which maize seed is divided. Interestingly, a general reduction in the content of genes encoding for structured proteins can be observed. We identified several disordered transcription factors (TF) involved in key regulatory activities during seed development. We also identified clusters of highly disordered proteins, the expression of which is correlated with the onset of the program which leads to the acquisition of dehydration tolerance in seed. Therefore, we consider maize seed as a good model to study the IDPs roles and understand their implications during development as well as in drought stress response in general. Finally, we proposed a general model that involve an overall reduction of the structured proteins and an increase in the content of IDPs as a mechanism to reduce the negative impact of extreme water loss in the cell.

2. Materials and Methods

2.1. Transcriptomic Data Acquisition

To correlate if there is a dependency between the disorder content of the proteins codified by transcripts during seed development progression, we used the available transcriptomic data of whole maize seed during its development. Transcriptome data was downloaded from the EMBL-EBI Expression Atlas (https://www.ebi.ac.uk/gxa/home). This data was generated by Chen et al. [25] and corresponds to the experiment identified as “Transcription profiling by high throughput sequencing of maize embryo and endosperm during development”. This data includes the counts of transcripts standardized in Transcripts per million readings (TPM) of maize seed from day 0 until day 38 of post-pollination (DAP).

The identifiers of each transcript were retrieved from the expression table to obtain the protein sequences codified by each transcript. The maize genome B73 version 4 was used in this study, and the file annotations were downloaded from Ensembl Plants (plants.ensembl.org/index.html).

2.2. Protein Disorder Analysis and Data Clustering

The prediction of the protein intrinsic disorder was performed using Espritz [29] setting X-ray and Best Sw as parameters. The longest isoforms of each transcript were used and cataloged according to the percentage of structural disorder of their encoded protein. Thus, transcripts were cataloged into four different categories from lower to higher percentage of structural disorder (their encoded proteins) as follows: (I) 0–25, (II) 25–50, (III) 50–75, and (IV) 75–100 of disorder percentage. The overall protein sequences of each category were analyzed to quantify the relative composition of aa. The percentage of aa composition of each category were plotted as a heatmap. This data was also used to perform Principal Component Analysis (PCA) to identify if the categories could be distinguished from each other. Both analyses were performed with ClustVis [30]. Proteins were annotated with Blast2GO [31] and gene ontology (GO) enrichment analysis was performed with WEGO [32]. Enriched GO terms were exported and plotted as a heatmap with ClustVis [30]. To correlate protein length and structural disorder, we classified the proteins into six categories according to their length: (I) 30–250, (II) 250–500, (III) 500–750, (IV) 750–1000, (V) 1000–1500, and (VI) > 1500 aa. Subsequently, we classified our four protein disordered categories according to their length.

2.3. Transcriptome Mining and Differential Expression Analysis

To obtain the proportion of transcripts which encode for disordered proteins with respect to the structured proteins, we obtained the relative content of transcripts of each category. The total sum of TPM of each category was obtained and divided by the total number of TPM per day and multiplied by 100, and the results were plotted with respect to the time (0–38 DAP). These kinetics were divided into five different developmental stages (R1, R2, R3, R4, and R5), according to Hanway [33]. Additionally, the number of expressed genes of each category (considering as expressed genes those with a TPM > 0) was quantified and represented as percentage. To confirm the correlation between the developmental stage and the global expression profile of IDPs, a Pearson’s correlation analysis of transcripts encoding for proteins with a disorder content > 25% was depicted.

Venn diagrams of the expressed transcripts (TPM > 0) were constructed to identify intersected and specific transcripts between developmental stages. To simplify the clustering of the data, the transcripts were divided into transcripts encoding for structured proteins (0–25% of disorder) and transcripts encoding for IDPs (>25% of disorder). To obtain transcripts that encode for highly disordered proteins (>75% of disorder) which are also differentially expressed, the EdgeR package was used [34,35], using a cut-off value of false discovery rate (FDR) of 0.001. From these data, we defined two groups of transcripts which encode for IDPs, the transcripts that are positively regulated and those that are negatively regulated throughout the development of the seed. Additionally, a functional enrichment analysis was carried out, for both the up- and the down-regulated genes identified. This analysis was performed with Blast2GO [31].

2.4. Gene Co-Expression Network Analysis

To find out if functional specialization of TFs exists based on their level of disorder, we inferred the regulatory networks formed according to structural disorder of TFs identified as up-regulated in early (R1), middle (R2 and R3) and late (R4 and R5) stages. Up-regulated transcription factors were obtained from the differential analysis with EdgeR, and TF expressed in seed identified previously [25].

Transcript per million reading values of genes corresponding to seed dataset were filtered to reduce complexity and data noise. For the co-expression matrix training, we just used those genes with at least one count per time-point treatment and with a total sum of counts per treatment greater than 21 (there were 21 time-point treatments). To build the pairwise co-expression matrix, we use the GENIE3 Bioconductor package [36,37], which is based on the Random Forest machine-learning model [38] with default parameters. We build four different co-expression networks using as “Regulator nodes” the 73 differential expressed transcription factors (DETF) found in our differential analysis and the filtered seed TPM matrix as “Target nodes”. The 73 DETF were grouped according to their disorder level (0–25, 25–50, 50–75, and 75–100% of intrinsic disorder). For the first network, we used as regulator nodes the five DETF with 0–25% of disorder; for the second network we used 34 DEFT within a 25–50% of intrinsic disorder; the third network was built on 22 DETF with 50–75% of intrinsic disorder; and the fourth network was built on 12 DETF within a 75–100% of intrinsic disorder. The resulting files from GENIE3 software were loaded as tables into Cytoscape v.3.6.0 [39]. All the co-expression networks were taken as undirected but weighted. The networks were clustered by using the GLay clustering algorithm plug-in in Cytoscape [40] according to topological edge connections. Finally, transcripts clusters were separated according to TF disorder level and stage of development in which their expression is induced.

3. Results

For the analyzed dataset, we found a clear relationship in the protein length and the protein disorder content (Figure 1). The proportion of proteins with smaller length is greater when the disorder increases. Thus, the IDPs tend to be shorter in comparison with the structured proteins. From a functional point of view, there is a clear separation between the enriched functions of structured (=<25% of disorder) and disordered proteins. Disordered proteins are enriched in regulatory function, in contrast to structured proteins which are enriched in catalytic functions (Figure 2A). From a structural point of view, it is possible to separate four categories based on aa composition, since we can differentiate structured proteins (< 25% of protein disorder) from moderately disordered (25–50% and 50–75% categories) and highly disordered proteins (>75% of protein disorder; Figure 2B). This separation is clearly explained given that structured proteins tend to be enriched in hydrophobic residues, while polar and hydrophilic aa are over-represented in highly disordered proteins (Figure 2C). However, during the progression of seed development, there is a reduction in the number of expressed genes (TPM > 0) encoding for structured proteins (Figure 3). Apparently, the genes that encode for structured proteins are activated in the first stages of seed development and they are inhibited later in the course of development (Figure 3A). In contrast, this does not occur for transcripts that encode for proteins with >25% structural disorder. In this case, there is a subset of genes that are specifically activated in each of the developmental stages (Figure 3B). According to our analysis, the relative content of transcripts encoding for structured proteins (=<25% of disorder) and IDPs is not constant throughout seed development. If we divide the dataset into the five different phases of seed development (R1, R2, R3, R4, and R5), which include the period of time analyzed in this work (0–38 DAP), it is highly interesting to note that in the R2 and R3 phases, there is an important increase in the proportion of transcript counts of structured proteins (=<25% of protein disorder; Figure 4A). Therefore, the most abundant transcripts of these phases were identified, and the 15 most abundant ones were selected. Thus, the following genes were identified: alpha-zein A20 (Zm00001d019155), alpha-zein 19D1 (Zm00001d030855), zein 15kD (Zm00001d035760), zein 10 kD (Zm00001d045937), alpha-zein3 22kD (Zm00001d048809), alpha-zein4 22kD (Zm00001d048812), alpha-zein (Zm00001d048813), zein 3 (Zm00001d048817), alpha-zein 19B1 (Zm00001d048848), alpha-zein (Zm00001d048849), alpha-zein PMS1 (Zm00001d048850), alpha-zein z1C2 (Zm00001d049243), and alpha-zein Z1A (Zm00001d049476). Total counts of these highly expressed transcripts represent up to 39.3 ± 7.94 (mean ± SD) percent of total TPMs in middle phase (R2 and R3), while in early and late phases their counts represent 0.62 ± 0.84 and 2.70 ± 3.43, respectively.

Figure 1.

Figure 1

Relationship between the level of disorder and the protein length. It can be observed that the more disordered the proteins, the greater the relative proportion of smaller sized proteins constituting each category.

Figure 2.

Figure 2

General characteristics of the proteins according to the disorder content of each category defined in this work. (A) Heatmap of the enriched Gene Ontology (GO) terms between the four defined categories for a p-value < 0.01. Colors represent the percentage of each GO category with scaling applied to rows and the values centered to zero. (B) Principal Component Analysis (PCA) analysis of the aa composition of the proteins that constitute each category. Unit variance scaling is applied to relative aa composition of the proteins of each category; Singular Value Decomposition (SVD) with imputation is used to calculate principal components. X and Y axis show principal component 1 and principal component 2 which explain 89.9% and 5.8% of the total variance, respectively. (C) Heatmap of the aa composition of the different categories of protein disorder. The rows were centered, and scaling was applied. The rows were then clustered using correlation distance and complete linkage.

Figure 3.

Figure 3

Venn diagrams of the shared expressed genes (TPM -Transcripts Per Kilobase Million- > 0) of each stage, according to the disordered content of the proteins codified by each transcript. (A) Transcripts encoding for structured proteins (0–25% of disorder content) is expressed in the early (0–8 Days After Pollination -DAP-) and then are turned off in the following stages. (B) There are transcripts encoding for intrinsically disordered proteins (IDPs) which are expressed specifically in each stage.

Figure 4.

Figure 4

Expression patterns of transcripts according to the disorder content of the protein that they encode. (A) Percentage of TPM of transcripts encoding for proteins with different degrees of protein disorder. There is an induction in the overall expression of transcripts encoding for structured proteins during the middle phase of seed development. (B) Pearson’s correlation analysis of the expression of transcripts encoding for proteins with > 25% of disorder. There is a clear separation of the different phases of development, which suggests a marked specific expression of transcripts during each of the developmental stages.

Moreover, the expression of IDPs (>25% of protein disorder) is highly dependent on the developmental phase, and the global expression of this type of transcripts is correlated with developmental time (Figure 4B). The complete list of expressed transcripts classified in the different categories of disorder, as well as their expression values are shown in Supplementary Tables S1–S4.

It is evident that transcriptomes from 0 to 10 DAP (R1), 12 to 28 DAP (R2 and R3), and 30 to 38 DAP (R4 and R5) formed clusters, which correspond to early, middle, and late phases of development, respectively [25]. R1 is an active period of cell division and cell elongation, while R2 and R3 correspond to morphogenesis and maturation phases of development, respectively. During the R2 and R3 stage, the embryo undergoes active DNA synthesis, cell division, and differentiation, followed by synthesis of storage reserve and desiccation [41]. The distinct cluster after R4 and R5 stages (30 to 38 DAP) encompass the end of storage compound accumulation in the endosperm and the activation of biological processes involved in dormancy and dehydration [25].

In fact, at the end of the R5 stage the proportion of expressed transcripts encoding IDPs is increased with respect to the transcripts encoding for structured proteins (Figure 5) with an atypical peak of transcripts encoding for structured proteins at 22 DAP. Thus, it is expected that the mean length of the proteins in the late phase of seed development tends to become shorter in comparison with the beginning of the embryogenesis.

Figure 5.

Figure 5

Dynamics in the relative composition of transcripts expressed throughout seed development. It is a clear reduction in the relative composition of structured proteins. The atypical data observed in the 22 DAP coincides with the peak of seed filling.

These results suggest that the changes in the proportion of structural disorder are a consequence of the functional changes occurring at the different developmental phases. When seed development begins, a large quantity of cellular resources is advocated to cell division and cell differentiation. This process requires an important gene regulation. However, when the seed enters the middle phase of development, the metabolism becomes primarily biosynthetic, given that several enzymes are needed for the synthesis of starch and other storage compounds, and storage proteins synthesis take place. At the end of the filling phase and the beginning of the late stage of development (R5), the onset of the dehydration program takes place. For this reason, we were advised to identify the up-regulated transcripts that codify for IDPs. We identified 1015 transcripts codifying for IDPs (>25% of disorder) which are positively expressed at the end of the R5 stage. Between these IDPs-coding transcripts, we identified 193 up-regulated transcripts that codify for highly disordered proteins (> 75% of protein disorder). In contrast, 157 transcripts were identified as down-regulated transcripts which codify for this type of proteins, but are highly expressed during early embryogenesis (Figure 6A,B). Interestingly, the GO enrichment analysis of both sets of proteins reflects the switch in the metabolism occurring during maize seed development (Figure 6C). The gene ontologies associated with the up-regulated genes are clearly different from those associated with the down-regulated genes. In the down-regulated group, several GO terms associated with cell development and cell differentiation are enriched. Among these GO terms, some are representative, such as “Cellular developmental process”, “Chromosome organization”, “Cell differentiation”, and “DNA conformation change”. The up-regulated transcript set shows enrichment of GO terms associated with plant adaptation to drought conditions which become obvious. Some over-represented processes are “response to abiotic stimulus”, “Response to water deprivation”, “Response to ABA”, “Seed germination”, and “Embryo development”.

Figure 6.

Figure 6

Gene expression profile of highly disordered IDPs with more than 75% of disorder in its sequence. (A) and (B) There is a group of IDPs which are induced and another group which are actively repressed throughout the period evaluated. (C) Functional enrichment analysis for each up and down-regulated gene encoding for highly IDPs. The size is proportional to the number of enriched genes among the total number of their GO term associated genes.

In the group of down-regulated genes, there were identified genes encoding disordered proteins involved in DNA regulation and RNA regulation such as Histone H1 (Zm00001d013066 and Zm00001d013067), and Histone H1a (Zm00001d018981 and Zm00001d034479), Histone H2A (Zm00001d044246), High mobility group protein 3 (Zm00001d051427), High mobility group family A (Zm00001d032239), nucleosome/chromatin assembly factor D (Zm00001d052749), H/ACA ribonucleoprotein complex subunit 3-like protein (Zm00001d051116), H/ACA ribonucleoprotein complex subunit 1-like protein 1 (Zm00001d052952). Interestingly, there were identified several transcripts participating in cell signaling and TFs involved in development such as Calmodulin binding protein (Zm00001d002630 and Zm00001d028841), Calreticulin-2 (Zm00001d005460), Calmodulin binding protein isoform 2 (Zm00001d038838), Protodermal factor 1 (Zm00001d043588), Early nodulin 75 protein (Zm00001d031878), HMG-transcription factor 13 (Zm00001d021433), Homeobox-transcription factor 41 (Zm00001d017422), and WRKY-transcription factor 48 (Zm00001d015515). Meanwhile in the list of up-regulated genes encoding for highly disordered proteins, we identified several TF involved in stress response and dormancy, including bHLH-167 (Zm00001d003677), MYB-139 (Zm00001d005300), bZIP-91 (Zm00001d007042), MYB-related-111 (Zm00001d026017), DRE-binding protein 1 (Zm00001d032295), bZIP-29 (Zm00001d034571), bHLH87 (Zm00001d038863), ZF-HD-9 (Zm00001d044662), bZIP (Zm00001d052562), HSF-11 (Zm00001d034433), as well as several proteins involved in seed maturation, protection of cellular components, and water stress response, such as Dormancy-associated protein homolog 3 (Zm00001d047503), LEA 3 (Zm00001d043709 and Zm00001d038870), Xero 1 (Zm00001d043730), Seed maturation proteins (Zm00001d044022, Zm00001d024414, Zm00001d033782, and Zm00001d035000), and glycine-rich cell wall structural protein (Zm00001d017033). The complete list of up-regulated and down-regulated genes encoding for highly disordered proteins (> 75% of protein disorder) from day 0 to day 38 post-pollination are presented in Supplementary Table S5; Table S6.

Using the co-expression networks analysis, an important number of key transcription factors were identified forming regulatory clusters of transcripts. As expected, most of the TFs identified forming clusters have a certain level of structural disorder (Figure 7). Although during the middle phase there is an increase in the total content of transcripts that encode for structured proteins, in this period we did not identify any structured TF forming regulatory clusters. Likewise, structural disorder seems to play a preponderant role in some TF families, including AP2, NAC, bZIP, HSF, Opaque 2, and Myb families.

Figure 7.

Figure 7

Co-expression networks analysis using up-regulated TF as regulators. Differentially expressed TFs of each phase were used to infer the key TFs, according to their disorder content. TFs are preponderantly IDPs. At early stage, TFs families are associated with developmental process. During the middle phase, key TFs are advocated to control of biosynthetic processes, such as accumulation of storage proteins. At the beginning of late phase, TFs families are preponderantly focused on the control of desiccation acquisition mechanisms. Only TFs with more than two target genes were included in the figure.

Interestingly, in this analysis we identified several TFs of NAC, ethylene-insensitive, HSF, and bZIP families as key regulators of the late phase. The general list of over-regulated TF that form regulatory clusters, as well as their target genes, is presented in the Supplementary Table S7.

In general, the data generated in this work suggest that there is an active adjustment in the content of the disorder content during the maize seed development.

4. Discussion

A potent feature of the transcriptomic studies is the ability to derive conclusions of the mechanisms that are being regulated under a defined biological condition. A complementary approach would be to analyze the gene expression in relation to predicted conformational properties of the proteins they encode. However, the use of such approach is not so exploited [42]. This kind of information is not obvious and must be quantified. The consideration of the global structural characteristics of the proteins that are synthesized under a given condition could give us additional information regarding the adaptive mechanisms used by organisms to respond to their environment. For example, recently it was corroborated that proteins, whose expression levels increase by heat in A. thaliana, are enriched in charged aa and have a low proportion of polar and hydrophobic aa, in comparison with the proteins that are repressed11. The regulation of the types of proteins based on their aa composition could represent a kind of adaptive response to environmental changes which have not been previously taken into account.

There are enough examples of the accumulation of IDPs in response to loss of water, to suggest that the increase in structural disorder is part of the responses to water stress [20,43,44,45,46,47]. For example, in orthodox seeds, LEA proteins (a group of very well characterized IDPs) are highly abundant during the late stages of plant seed development when the embryo becomes desiccation tolerant and its induction coincides with the onset of desiccation tolerance [44,48]. Previously, it was pointed out that there is a strong correlation between the desiccation and intrinsic disorder in proteins, since IDPs are involved in vitrification, water replacement, molecular shielding, membrane stabilization, preservation of cellular organization and structure, water retention, and scavenging of reactive oxygen species [46]. Another feasible measure that cells can implement to reduce the effects of lack of water, is the reduction of proteins which are more sensitive to loss of function mediated by lack of water. Based on these ideas, in this work we analyzed the regulation of the expression of transcripts according to the disorder content of the encoded proteins along the maize seed development, including the beginning of late phase.

During maize seed development there is a progression of a morphogenetic program which includes the acquisition of desiccation tolerance at the end of the developmental program. This is accompanied by a sharp reduction in the relative content of water. Interestingly, in this work we found that this is also accompanied by a reduction in the number of genes encoding for structured proteins. To our knowledge, this is the first global survey of the expression of transcripts according to the structural disorder of their encoded proteins under a context of seed development and water reduction.

We first described general features of the different categories in which proteins were divided according to their disorder content. It is interesting to note that there is a clear relationship between the protein disorder level and the protein length. To our knowledge such association has not been pointed out previously. However, it was found that the percentage of occurrence of most of the residue types depends significantly on protein dimension [49], but this dependency was not explained. The functional specialization of the IDPs with respect to the structured proteins may help to explain such observations. The GO enrichment analysis showed that the protein specialization is dependent on the level of protein disorder. IDPs are specialized in regulatory activities, while the structured proteins are enriched in catalytic functions. It has been observed that transcription factors possess a high degree of disorder in the form of intrinsically disordered regions (IDRs) [50]. In fact, disorder predictions show that 83–94% of all known TFs possess extended regions of disordered residues [51]. This explains why most of the TFs identified in the co-expression analyzes fall into some category of disorder.

Disorder is less frequent in enzymes and many proteins involved in catalytic activity are structured [52]. This functional bias can be explained if we take into consideration that intrinsic disorder is a direct consequence of the particularity of the amino-acid composition of IDPs. Compositional bias is a common attribute of IDPs [2,8,53,54]. IDPs are generally rich in Arg, Gln, Glu, Lys, Pro, and Ser, while they are deficient in Cys, Ile, Leu, Phe, Trp, Tyr, and Val [4]. These data are in accordance with our predictions, which show a clear difference for each protein category defined in this work.

We found that transcripts encoding for structured proteins are actively induced during the middle seed filling phase (R2 and R3 stage), and coincides with the biosynthesis and accumulation of reserves25. During this phase, the conversion of imported sucrose and aa into starch and storage proteins can account for about 90% of kernel total dry matter, with the most rapid grain-filling occurring between 21 and 25 DAP, regardless of the genetic background [26,55,56,57]. A phenomenon that characterizes this period is the accumulation of zeins in endosperm. Zeins are a class of storage proteins which have been classified as α-, β-, γ-, and δ-zeins based on its solubility and its structural properties [58,59,60,61]. The α-zein protein group is the largest among them, consisting of ∼ 80% of the total zeins proteins [62]. They are unusually rich in hydrophobic amino acids including leucine, alanine, and proline, alongside the hydrophilic glutamine [59]. In general, zeins are highly insoluble in water, and are structured proteins [60,63]. For example, 22kDa and 19kDa zeins are conformed of nine contiguous, topologically antiparallel helices grouped within a distorted cylinder [64]. In this work it was found that a large part of the increase in the content of transcripts that encode for structured proteins can be explained by the incredible increase in the expression of the genes of the zein family. The transcriptional control of genes encoding 22kD zein proteins is mediated by opaque-2 (o2), a well-known TF that is a key transcriptional regulator during the middle phase [65]. Interestingly, Opaque-2 (Zm00001d018971) was identified as a regulator of several zeins in our networks analysis which help to support our predictions.

On the other hand, the number of expressed genes encoding for structured proteins shows a continuous reduction after middle phase. At the beginning of the maturation phase (R5 stage), there is a subset of highly disordered proteins which is actively induced. However, the regulation of these genes starts at the R4. The regulation of the desiccation program begins immediately after the end of the most active phase of biosynthesis, before the acquisition of tolerance to desiccation becomes a visible characteristic. This is a phase that can be considered to be preparatory. Prior to R5, kernels along the length of the ear begin to dent or dry on top. Moreover, in the R5 stage the kernels are drying and moisture content is severely reduced, reaching less than the 50% [5,33]. In this stage, a decreasing grain-filling rate culminates in physiological maturity and black-layer formation, and is related to seed dormancy [26]. In addition, our data suggest that there is a continuous reduction in the number of transcripts encoding for structured proteins. Interestingly, there is an atypical peak in the relative content of this kind of transcripts at 22 DAP, which coincides with the maximum starch accumulation rate, occurring at 21 DAP [26,57].

The early stage characteristically involves cell division, after which the endosperm cells enlarge and as a result of several metabolic processes acquire storage proteins and starch [66]. In the final phase of embryogenesis, desiccation tolerance is acquired, and dormancy is established [67]. Therefore, differential expression analysis is very useful to define the IDPs associated with the first developmental process and those associated with the beginning of the maturity phase. Within these two types of transcripts, the specialization of the IDPs can be differentiated, since their ontologies are involved in clearly different processes. While the down-regulated IDPs genes are dedicated to developmental-associated processes, the induced ones are involved in functions related to the abiotic stress and environment stimulus responses.

Within the IDPs identified in the first category, it is interesting that there are several proteins associated with nucleic acid packing, such as histones and high mobility group (HMG) proteins. It is recognized that many protein functional classes are heavily dependent on intrinsic disorder. Among these disorder-centric functions are interactions with nucleic acids and protein complex assembly. Previously, it has been demonstrated that all the members of the histone family are IDPs. In fact, intrinsic disorder is necessary for various histone functions, starting from heterodimerization to formation of higher order oligomers, to interactions with DNA and other proteins, and to post-translational modifications [68]. HMG proteins are nuclear proteins that binds transiently to nucleosomes, changes the local architecture of the chromatin fiber, and affects several DNA-related activities such as transcription, replication, and DNA repair [69]. HMGs are highly disordered proteins, and they are mainly random coiled in solution, but they are subjected to folding upon binding with DNA or other interators [70,71].

In addition, we identified many down-regulated transcripts encoding proteins involved in plant development and calcium-mediated signaling, such as auxin-repressed protein (ARP) and Calreticulin. Evidence has emerged for ARP family members as IDPs. ARP genes are responsive to hormones involved in responses to biotic stress, such as salicylic acid (SA) and methyl jasmonate (MeJA), as well as to hormones that regulate plant growth and development, including auxins [72,73,74]. Calreticulin (CALR) is well recognized as a Ca2+-binding protein molecular chaperone that assists the folding of newly synthesized glycoproteins and modulates the Ca2+ homeostasis in the endoplasmic reticulum (ER) lumen [75]. In the ER, CALR binds not only Ca2+, but also interacts with many ER proteins, and with some mRNAs, thereby determining their fate. The potential functions of CALR are so numerous that identification of all of them is becoming a nightmare. Interestingly, its functional versatility is provided by its intrinsic disorder properties [76,77,78,79].

In contrast, up-regulated genes are over-represented by an important number of IDPs families, such as LEA proteins, dehydrins, glycine-rich proteins, proline rich proteins (PRPs) and Seed Maturation Proteins (SMP), which are implied in the DT and storability of vital seeds [80,81,82]. For example, LEA proteins, especially group 3 are highly hydrophilic, IDPs, whose expression are associated with the acquisition of desiccation tolerance in maturing seeds [20,81,83]. In maize, LEA3 gene was discovered to be responsive to ABA and hyperosmolarity and afterwards it was proved to be able to reduce cell shrinkage effects under dehydration [84]. On the other hand, PRPs are IDPs that were first identified as proteins that accumulate in the cell wall in response to physical damage. Members of the PRP gene family are expressed during leaf, stem, root, and seed coat development, seedling growth, and in cell types associated with lignification [85]. Functionally, PRPs are insolubilized in the cell walls with the involvement of reactive oxygen species (ROS)-mediated cross-linking and it play a role in the structural integrity of plant cells, participate in defense related activities, and plant cell surface interactions [85].

Glycine-rich proteins genes regulate diverse cellular processes in plant development and stress response. In tobacco, NtGRPs are highly regulated under osmotic stress and AtGRPs and OsGRPs have been identified acting as RNA chaperones that regulate mRNA export from the nucleus to the cytoplasm under stress condition. Also, GRPs proteins are important for the regulation of some steps in RNA post-transcriptional processing as splicing and polyadenylation [86,87,88]. The detection of these recognized IDPs, whose involvement in DT has been widely studied, can help us to support our disordered predictions. However, there are other IDPs that are induced in this phase, such as remorins and metallothioneins which have been associated with drought tolerance [89,90,91]; however, information regarding their relationship with DT is still limited, so that a more detailed analysis of its implications in the acquisition of tolerance to desiccation should be addressed.

In our co-expression analysis, we detected several key TFs for each stage of development. Key TFs identified in the early stage are mainly specialized in coordinating processes related to cell development and cell differentiation, such as MADS and MYB TFs. MADS TFs are recognized to be involved in controlling many developmental processes in flowering plants, ranging from pollen and embryo sac development to root, flower, and fruit development [92]. Likewise, MYB proteins are key factors in regulatory networks controlling development, but also metabolism and responses to biotic and abiotic stresses [93].

In the middle phase, we identified several key intrinsically disordered TFs involved in the accumulation of storage reserves. Within this group of TFs, the identification of opaque2 is particularly important, due to its importance in the regulation of the accumulation of reserve proteins and its incredible capacity to integrate N and C metabolism [94]. Opaque 2 belongs to the basic leucine zipper (bZIP) class that is specifically expressed in the endosperm activating the expression of 22-kDa α-zein and 15-kDa β-zein genes. Opaque2 also directly or indirectly regulates several other non-storage protein genes, such lysine-ketoglutarate reductase and heat shock protein 70 [95,96]. In our network analysis, these and others important proteins, such as ABA receptor PYL5, NAC87, 60S ribosomal protein L7-1, 40S ribosomal protein S24, giberellin-2 oxidase 1, aconitase1, TFDII subunit 9, among others were predicted to be regulated by opaque2 TF.

During the late phase, most of the regulator identified belongs to NAC TF family. The NAC TFs comprise one of the largest family of TFs in plants [97,98]. The NAC TFs play a vital role in the complex signaling networks during plant stress responses. It appears that an important proportion of NAC genes function in stress response according to the expression data from global expression analyses in many plants [99,100,101]. Interestingly, Heat shock factor 1 (HSF1) (Zm00001d005888, 50–75% of disorder), appears to be involved in the regulation of oxidative stress, as many redox-related proteins were identified as regulated by this TF. For example, we identified proteins such as wound induced protein, respiratory burst oxidase, glutaredoxin subgroup III protein, glutaredoxin, and 17.4 kDa class I HSP. In tomatoes was found that HsfA1 is the master regulator in response to heat shock stress [102], and it has been proposed that some members of HSFs family may act as redox sensors [103].

On the other hand, TF bZIP7 seems to play an essential role in this stage. Moreover, to our knowledge, this gene has not been functionally characterized. In general, few bZIP genes have been functionally characterized in maize [104]. In plants, the bZIP TFs regulate diverse functions, including processes such as plant development and stress response. In our networks analysis, bZIP7 appears as regulator of genes involved in desiccation tolerance, such as glycine-rich protein, hidroxy-proline rich protein, GRAS46, metallothionein 2, glutanione transferase 19, gluthatione transferase 37, anther specific proline rich protein, cupins, SMPs, etc. Therefore, an interesting topic could be to characterize the role of this TF or its homologs.

Unfortunately, in this work we are limited by the lack of data regarding the transcriptomic profile of stage R6, where maturation of the seed ends and the process of desiccation of the caryopsis is almost completed. Further analyses must be carried out during the end of the late embryogenesis phase to uncover the global pattern of disordome in R6 stage.

However, we consider that with the data presented in this work is possible to draw a general model about the alterations that occur during the adjustment process of the disordome as a mechanism of tolerance to desiccation in orthodox seeds. In this proposed model, the content of structured proteins is higher during the most active metabolic stages of seed development, just when the water content is higher. Subsequently, there is a gradual reduction in the number of structured proteins, as well as in their total content. At the same time, the relative content of IDPs increases and with this, the proportion of proteins less prone to losing its function due to loss of structure. This mechanism may facilitate the maintenance of the integrity of the proteome, due to the fact that there would be a lower amount of proteins susceptible to losing its spatial structure, and its molecular function. The general view of such ideas is summarized in Figure 8.

Figure 8.

Figure 8

Proposed model of disorder adjustment in the development of orthodox seeds. At the beginning of the development of the seed, the biosynthesis of the reserve compounds and the molecules required to grow and develop is active, the available water is abundant, and the content of structured proteins is high (A). As the development program progresses, there is a significant reduction in water content, cell volume is reduced, and macromolecular crowding increases. This effect causes unwanted interactions between proteins, loss of tertiary structure, and protein aggregation. To reduce this negative effect, cells activate protective mechanisms (such as the biosynthesis of compatible solutes and the induction of chaperones). However, we propose that there is also a sharp reduction of the most labile proteins, without the replacement of other types of structured proteins. In contrast, the content of the IDPs increases and all the genes expressed de novo encodes for IDPs (B). The number of structural proteins continues to decrease, as does their content. It is considered that all this complements the typical mechanisms of tolerance to desiccation. In general, this can be considered to be a strategy to make the proteome more resilient to water-limiting conditions by a mechanism mediated by overall adjustment of the protein disorder.

5. Conclusions

This study represents the first survey of the dynamics of the expression of transcripts encoding for IDPs in maize seeds. The data obtained in this work help to establish a relationship between global modulation of the overall protein disorder (disordome), and progression of seed development, including the onset of the desiccation tolerance program. In part, such behavior could be explained by the fact that under a desiccation context there is a general reduction in the biosynthetic activities of storage compounds. During this period, we identified several key TFs controlling each stage of seed development and desiccation tolerance acquisition. Interestingly, most of these TFs are IDPs.

Here, we provide evidence to propose that the onset of desiccation program in seed provoke an actively reduction of proteins that are more prone to loss their function in a water-limiting environment, and an induction of IDPs. The proposed model, although general, could represent an elegant strategy to partially circumvent the negative effects of the compaction effect induced by loss of water, and can be easily extrapolated to other plant and non-plants models in which desiccation tolerance take place. In this sense, the methodological strategy used in this work may be very useful for further studies.

Supplementary Materials

The following are available online at https://www.mdpi.com/2073-4425/10/7/502/s1, Table S1: Expression profile of transcripts encoding structured proteins (<25% of protein disorder) during seed development, and their prediction of disorder content, Table S2: Expression profile of transcripts encoding intrinsically disordered proteins (25–50% of protein disorder) during seed development, and their prediction of disorder content, Table S3: Expression profile of transcripts encoding intrinsically disordered proteins (50–75% of protein disorder) during seed development, and their prediction of disorder content, Table S4: Expression profile of transcripts encoding highly disordered proteins (>75% of protein disorder) during seed development, and their prediction of disorder content, Table S5: Complete list of up-regulated transcripts encoding highly intrinsically disordered proteins (>75% of protein disorder) during seed development, Table S6: Complete list of down-regulated transcripts encoding highly intrinsically disordered proteins (>75% of protein disorder) during seed development, Table S7: Complete list of up-regulated key TFs, their target genes and their disorder category that they belong to.

Author Contributions

Conceptualization: J.A.Z.-B., A.P.-S., S.J.R.-H., L.C.R.-Z.; Writing and editing: J.A.Z.-B., A.P.-S., S.J.R.-H., E.C., L.C.R.-Z.; Project administration: L.C.R.-Z., E.C.; Data preparation: J.A.Z.-B.; Formal analysis: J.A.Z.B., A.P.-S.; Funding acquisition: L.C.R.-Z., E.C.; Methodology: J.A.Z.-B., A.P.-S.; Supervision: L.C.R.-Z.

Funding

This research was supported by Conacyt grant numbers (FC2016-1572, 221208, and S52089-Z).

Conflicts of Interest

The authors declare no conflict of interest.

References

  • 1.Wright P.E., Dyson H.J. Intrinsically disordered proteins in cellular signalling and regulation. Nat. Rev. Mol. Cell Biol. 2015 doi: 10.1038/nrm3920. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Uversky V.N., Gillespie J.R., Fink A.L. Why are ‘natively unfolded’ proteins unstructured under physiologic conditions? Proteins Struct. Funct. Genet. 2000 doi: 10.1002/1097-0134(20001115)41:3&#x0003c;415::AID-PROT130&#x0003e;3.0.CO;2-7. [DOI] [PubMed] [Google Scholar]
  • 3.Tompa P., Kovacs D. Intrinsically disordered chaperones in plants and animals. Biochem. Cell Biol. 2010 doi: 10.1139/O09-163. [DOI] [PubMed] [Google Scholar]
  • 4.Romero P., Obradovic Z., Li X., Garner E.C., Brown C.J., Dunker A.K. Sequence complexity of disordered protein. Proteins Struct. Funct. Genet. 2001 doi: 10.1002/1097-0134(20010101)42:1&#x0003c;38::AID-PROT50&#x0003e;3.0.CO;2-3. [DOI] [PubMed] [Google Scholar]
  • 5.Zamora-Briseño J.A., Reyes-Hernández S.J., Zapata L.C.R. Does water stress promote the proteome-wide adjustment of intrinsically disordered proteins in plants? Cell Stress Chaperones. 2018 doi: 10.1007/s12192-018-0918-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Liu Z., Huang Y. Advantages of proteins being disordered. Protein Sci. 2014 doi: 10.1002/pro.2443. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Iakoucheva L.M., Radivojac P., Brown C.J., O’Connor T.R., Sikes J.G., Obradovic Z., Dunker A.K. The importance of intrinsic disorder for protein phosphorylation. Nucleic Acids Res. 2004 doi: 10.1093/nar/gkh253. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Iakoucheva L.M., Brown C.J., Lawson J.D., Obradović Z., Dunker A.K. Intrinsic disorder in cell-signaling and cancer-associated proteins. J. Mol. Biol. 2002 doi: 10.1016/S0022-2836(02)00969-5. [DOI] [PubMed] [Google Scholar]
  • 9.Tompa P., Szász C., Buday L. Structural disorder throws new light on moonlighting. Trends Biochem. Sci. 2005 doi: 10.1016/j.tibs.2005.07.008. [DOI] [PubMed] [Google Scholar]
  • 10.Pietrosemoli N., García-Martín J.A., Solano R., Pazos F. Genome-wide analysis of protein disorder in Arabidopsis thaliana: Implications for plant environmental adaptation. PLoS ONE. 2013 doi: 10.1371/journal.pone.0055524. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Alvarez-Ponce D., Ruiz-González M.X., Vera-Sirera F., Feyertag F., Perez-Amador M.A., Fares M.A. Arabidopsis heat stress-induced proteins are enriched in electrostatically charged amino acids and intrinsically disordered regions. Int. J. Mol. Sci. 2018;19:2276. doi: 10.3390/ijms19082276. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Yruela I., Contreras-Moreira B. Protein disorder in plants: A view from the chloroplast. BMC Plant Biol. 2012;12:165. doi: 10.1186/1471-2229-12-165. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Vincent M., Schnell S. A collection of intrinsic disorder characterizations from eukaryotic proteomes. Sci. Data. 2016 doi: 10.1038/sdata.2016.45. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Liu Y., Wu J., Sun N., Tu C., Shi X., Cheng H., Liu S., Li S., Wang Y., Zheng Y., et al. Intrinsically disordered proteins as important players during desiccation stress of soybean radicles. J. Proteome Res. 2017 doi: 10.1021/acs.jproteome.6b01045. [DOI] [PubMed] [Google Scholar]
  • 15.Pazos F., Pietrosemoli N., García-Martín J.A., Solano R. Protein intrinsic disorder in plants. Front. Plant Sci. 2013 doi: 10.3389/fpls.2013.00363. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Kovacs D., Kalmar E., Torok Z., Tompa P. Chaperone activity of ERD10 and ERD14, two disordered stress-related plant proteins. Plant Physiol. 2008 doi: 10.1104/pp.108.118208. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Kovacs D., Agoston B., Tompa P. Disordered plant LEA proteins as molecular chaperones. Plant Signal. Behav. 2008 doi: 10.4161/psb.3.9.6434. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Mouillon J.M., Eriksson S.K., Harryson P. Mimicking the plant cell interior under water stress by macromolecular crowding: Disordered dehydrin proteins are highly resistant to structural collapse. Plant Physiol. 2008 doi: 10.1104/pp.108.124099. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Manfre A.J., LaHatte G.A., Climer C.R., Marcotte W.R. Seed dehydration and the establishment of desiccation tolerance during seed maturation is altered in the Arabidopsis thaliana mutant atem6-1. Plant Cell Physiol. 2009 doi: 10.1093/pcp/pcn185. [DOI] [PubMed] [Google Scholar]
  • 20.Chakrabortee S., Boschetti C., Walton L.J., Sarkar S., Rubinsztein D.C., Tunnacliffe A. Hydrophilic protein associated with desiccation tolerance exhibits broad protein stabilization function. Proc. Natl. Acad. Sci. USA. 2007 doi: 10.1073/pnas.0706964104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Das R.K., Ruff K.M., Pappu R.V. Relating sequence encoded information to form and function of intrinsically disordered proteins. Curr. Opin. Struct. Biol. 2015 doi: 10.1016/j.sbi.2015.03.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Pereira-Santana A., Alvarado-Robledo E.J., Zamora-Briseño J.A., Ayala-Sumuano J.T., Gonzalez-Mendoza V.M., Espadas-Gil F., Alcaraz L.D., Castaño E., Keb-Llanes M.A., Sanchez-Teyer F., et al. Transcriptional profiling of sugarcane leaves and roots under progressive osmotic stress reveals a regulated coordination of gene expression in a spatiotemporal manner. PLoS ONE. 2017 doi: 10.1371/journal.pone.0189271. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Moore J.P., Le N.T., Brandt W.F., Driouich A., Farrant J.M. Towards a systems-based understanding of plant desiccation tolerance. Trends Plant Sci. 2009 doi: 10.1016/j.tplants.2008.11.007. [DOI] [PubMed] [Google Scholar]
  • 24.Nannas N.J., Kelly Dawe R. Genetic and genomic toolbox of Zea mays. Genetics. 2015 doi: 10.1534/genetics.114.165183. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Chen J., Zeng B., Zhang M., Xie S., Wang G., Hauck A., Lai J. Dynamic transcriptome landscape of maize embryo and endosperm development. Plant Physiol. 2014 doi: 10.1104/pp.114.240689. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Prioul J.L., Méchin V., Lessard P., Thévenot C., Grimmer M., Chateau-Joubert S., Coates S., Hartings H., Kloiber-Maitz M., Murigneux A., et al. A joint transcriptomic, proteomic and metabolic analysis of maize endosperm development and starch filling. Plant Biotechnol. J. 2008 doi: 10.1111/j.1467-7652.2008.00368.x. [DOI] [PubMed] [Google Scholar]
  • 27.Yi F., Gu W., Chen J., Song N., Gao X., Zhang X., Zhou Y., Ma X., Song W., Zhao H., et al. High temporal-resolution transcriptome landscape of early maize seed development. Plant Cell. 2019 doi: 10.1105/tpc.18.00961. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Silva-Sanchez C., Chen S., Li J., Chourey P.S. A comparative glycoproteome study of developing endosperm in the hexose-deficient miniature1 (mn1) seed mutant and its wild type Mn1 in maize. Front. Plant Sci. 2014 doi: 10.3389/fpls.2014.00063. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Walsh I., Martin A.J., Di Domenico T., Tosatto S.C. Espritz: Accurate and fast prediction of protein disorder. Bioinformatics. 2012 doi: 10.1093/bioinformatics/btr682. [DOI] [PubMed] [Google Scholar]
  • 30.Metsalu T., Vilo J. ClustVis: A web tool for visualizing clustering of multivariate data using principal Component Analysis and heatmap. Nucleic Acids Res. 2015 doi: 10.1093/nar/gkv468. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Conesa A., Götz S., García-Gómez J.M., Terol J., Talón M., Robles M. Blast2GO: A universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics. 2005 doi: 10.1093/bioinformatics/bti610. [DOI] [PubMed] [Google Scholar]
  • 32.Ye J., Fang L., Zheng H., Zhang Y., Chen J., Zhang Z., Wang J., Li S., Li R., Bolund L., et al. WEGO: A web tool for plotting GO annotations. Nucleic Acids Res. 2006 doi: 10.1093/nar/gkl031. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Hanway J. How a Corn Plant Develops. [(accessed on 24 June 2019)];Sci. Technol. 1966 38 Special Report. 38. Available online: http://lib.dr.iastate.edu/specialreports/3. [Google Scholar]
  • 34.McCarthy D.J., Chen Y., Smyth G.K. Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variation. Nucleic Acids Res. 2012 doi: 10.1093/nar/gks042. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Robinson M.D., McCarthy D.J., Smyth G.K. edgeR: A Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2009 doi: 10.1093/bioinformatics/btp616. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Aibar S., González-Blas C.B., Moerman T., Huynh-Thu V., Imrichova H., Hulselmans G., Rambow F., Marine J.C., Geurts P., Aerts J., et al. SCENIC: Single-cell regulatory network inference and clustering. Nat. Methods. 2017 doi: 10.1038/nmeth.4463. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Huynh-Thu V.A., Irrthum A., Wehenkel L., Geurts P. Inferring regulatory networks from expression data using tree-based methods. PLoS ONE. 2010 doi: 10.1371/journal.pone.0012776. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Breiman L. Statistical Modeling: The two cultures (with comments and a rejoinder by the author) Stat. Sci. 2002 doi: 10.1214/ss/1009213726. [DOI] [Google Scholar]
  • 39.Shannon P., Markiel A., Ozier O., Baliga N.S., Wang J.T., Ramage D., Amin N., Schwikowski B., Ideker T. Cytoscape: A software environment for integrated models of biomolecular interaction networks. Genome Res. 2003 doi: 10.1101/gr.1239303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Su G., Kuchinsky A., Morris J.H., States D.J., Meng F. GLay: Community structure analysis of biological networks. Bioinformatics. 2010 doi: 10.1093/bioinformatics/btq596. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Vernoud V., Hajduch M., Khaled A.S., Depège N., Rogowsky P.M. Maize embryogenesis. Maydica. 2005;50:469. [Google Scholar]
  • 42.Arvidsson G., Wright A.P.H. A protein intrinsic disorder approach for characterising differentially expressed genes in transcriptome data: Analysis of cell-adhesion regulated gene expression in lymphoma cells. Int. J. Mol. Sci. 2018;19:3101. doi: 10.3390/ijms19103101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Chakrabortee S., Meersman F., Kaminski Schierle G.S., Bertoncini C.W., McGee B., Kaminski C.F., Tunnacliffe A. Catalytic and chaperone-like functions in an intrinsically disordered protein associated with desiccation tolerance. Proc. Natl. Acad. Sci. USA. 2010 doi: 10.1073/pnas.1006276107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Sun X., Rikkerink EH A., Jones W.T., Uversky V.N. Multifarious roles of intrinsic disorder in proteins illustrate its broad impact on plant biology. Plant Cell. 2013 doi: 10.1105/tpc.112.106062. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Boothby T.C., Tapia H., Brozena A.H., Piszkiewicz S., Smith A.E., Giovannini I., Rebecchi L., Pielak G.J., Koshland D., Goldstein B. Tardigrades use intrinsically disordered proteins to survive desiccation. Mol. Cell. 2017 doi: 10.1016/j.molcel.2017.02.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Boothby T.C., Pielak G.J. Intrinsically isordered proteins and desiccation tolerance: Elucidating functional and mechanistic underpinnings of anhydrobiosis. BioEssays. 2017 doi: 10.1002/bies.201700119. [DOI] [PubMed] [Google Scholar]
  • 47.Wetzler D.E., Fuchs Wightman F., Bucci H.A., Rinaldi J., Caramelo J.J., Iusem N.D., Ricardi M.M. Conformational plasticity of the intrinsically disordered protein asr1 modulates its function as a drought stress-responsive gene. PLoS ONE. 2018 doi: 10.1371/journal.pone.0202808. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Zamora-Briseño J.A., de Jiménez E.S. A LEA 4 protein up-regulated by ABA is involved in drought response in maize roots. Mol. Biol. Rep. 2016 doi: 10.1007/s11033-016-3963-5. [DOI] [PubMed] [Google Scholar]
  • 49.Carugo O. Amino acid composition and protein dimension. Protein Sci. 2008 doi: 10.1110/ps.037762.108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Niklas K.J., Bondos S.E., Dunker A.K., Newman S.A. Rethinking gene regulatory networks in light of alternative splicing, intrinsically disordered protein domains, and post-translational modifications. Front. Cell Dev. Biol. 2015 doi: 10.3389/fcell.2015.00008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Minezaki Y., Homma K., Kinjo A.R., Nishikawa K. Human transcription factors contain a high fraction of intrinsically disordered regions essential for transcriptional regulation. J. Mol. Biol. 2006 doi: 10.1016/j.jmb.2006.04.016. [DOI] [PubMed] [Google Scholar]
  • 52.Yizhi Z., Hélène L., Antoine S., Régine L., Brigitte G. Exploring intrinsically disordered proteins in chlamydomonas reinhardtii. Sci. Rep. 2018 doi: 10.1038/s41598-018-24772-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Uversky V.N. Brenner’s Encyclopedia of Genetics. 2nd ed. Academic Press; San Diego, CA, USA: 2013. Intrinsically disordered proteins. [DOI] [Google Scholar]
  • 54.Williams R.M., Obradovi Z., Mathura V., Braun W., Garner E.C., Young J., Takayama S., Brown C.J., Dunker A.K. The protein non-folding problem: amino acid determinants of intrinsic order and disorder; Proceedings of the Pacific Symposium on Biocomputing; Mauna Lani, HI, USA. 3–7 January 2001; [DOI] [PubMed] [Google Scholar]
  • 55.Méchin V., Balliau T., Château-Joubert S., Davanture M., Langella O., Négroni L., Prioul J.L., Thévenot C., Zivy M., Damerval C. A two-dimensional proteome map of maize endosperm. Phytochemistry. 2004 doi: 10.1016/j.phytochem.2004.04.035. [DOI] [PubMed] [Google Scholar]
  • 56.Liu Z.H., Ji H.Q., Cui Z.T., Wu X., Duan L.J., Feng X.X., Tang J.H. QTL detected for grain-filling rate in maize using a RIL population. Mol. Breed. 2011 doi: 10.1007/s11032-010-9410-8. [DOI] [Google Scholar]
  • 57.Jin X., Fu Z., Ding D., Li W., Liu Z., Tang J. Proteomic identification of genes associated with maize grain-filling rate. PLoS ONE. 2013 doi: 10.1371/journal.pone.0059353. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Esen A. A proposed nomenclature for the alcohol-soluble proteins (zeins) of maize (Zea mays L.) J. Cereal Sci. 1987 doi: 10.1016/S0733-5210(87)80015-2. [DOI] [Google Scholar]
  • 59.Coleman C.E., Herman E.M., Takasaki K., Larkins B.A. The Maize g-Zein Sequesters a-Zein and Stabilizes Its Accumulation in Protein Bodies of Transgenic Tobacco Endosperm. Plant Cell. 2007 doi: 10.2307/3870472. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Wilson C.M. Serial analysis of Zein by isoelectric focusing and sodium dodecyl sulfate gel electrophoresis. Plant Physiol. 2008 doi: 10.1104/pp.82.1.196. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Woo Y.M., Hu D.W., Larkins B.A., Jung R. Genomics analysis of genes expressed in maize endosperm identifies novel seed proteins and clarifies patterns of zein gene expression. Plant Cell. 2001;13:2297–2317. doi: 10.1105/tpc.010240. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Li J.S., Vasal S.K. Encyclopedia of Food Grains. 2nd ed. Academic Press; Cambridge, MA, USA: 2015. Maize: Quality protein maize. eBook. [DOI] [Google Scholar]
  • 63.Meng Y., Cloutier S. Microencapsulation in the Food Industry. Academic Press; Cambridge, MA, USA: 2014. Gelatin and Other Proteins for Microencapsulation. eBook. [DOI] [Google Scholar]
  • 64.Song R., Llaca V., Linton E., Messing J. Sequence, regulation, and evolution of the maize 22-kD alpha zein gene family. Genome Res. 2001;11:1817–1825. doi: 10.1101/gr.197301. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Schmidt R.J., Ketudat M., Aukerman M.J., Hoschek G. Opaque-2 is a transcriptional activator that recognizes a specific target site in 22-kD zein genes. Plant Cell. 2007 doi: 10.2307/3869527. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Gibbon B.C., Larkins B.A. Molecular genetic approaches to developing quality protein maize. Trends Genet. 2005 doi: 10.1016/j.tig.2005.02.009. [DOI] [PubMed] [Google Scholar]
  • 67.Devic M., Roscoe T. Seed maturation: Simplification of control networks in plants. Plant Sci. 2016 doi: 10.1016/j.plantsci.2016.08.012. [DOI] [PubMed] [Google Scholar]
  • 68.Peng Z., Mizianty M.J., Xue B., Kurgan L., Uversky V.N. More than just tails: Intrinsic disorder in histone proteins. Mol. Biosyst. 2012 doi: 10.1039/c2mb25102g. [DOI] [PubMed] [Google Scholar]
  • 69.Reeves R. HMG nuclear proteins: Linking chromatin structure to cellular phenotype. Biochim. Biophys. Acta. 2009 doi: 10.1016/j.bbagrm.2009.09.001. [DOI] [Google Scholar]
  • 70.Slama-Schwok A., Zakrzewska K., Léger G., Leroux Y., Takahashi M., Käs E., Debey P. Structural changes induced by binding of the high-mobility group I protein to a mouse satellite DNA sequence. Biophys. J. 2000 doi: 10.1016/S0006-3495(00)76799-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Love J.J., Li X., Chung J., Dyson H.J., Wright P.E. The LEF-1 high-mobility group domain undergoes a disorder-to-order transition upon formation of a complex with cognate DNA. Biochemistry. 2004 doi: 10.1021/bi049591m. [DOI] [PubMed] [Google Scholar]
  • 72.Zhao Y., Li C., Ge J., Xu M., Zhu Q., Wu T., Guo A., Xie J., Dong H. Recessive Mutation Identifies Auxin-Repressed Protein ARP1, Which Regulates Growth and Disease Resistance in Tobacco. Mol. Plant Microbe Interact. 2014 doi: 10.1094/MPMI-08-13-0250-R. [DOI] [PubMed] [Google Scholar]
  • 73.Shi H.Y., Zhang Y.X., Chen L. Two pear auxin-repressed protein genes, PpARP1 and PpARP2, are predominantly expressed in fruit and involved in response to salicylic acid signaling. Plant Cell Tissue Organ Cult. 2013 doi: 10.1007/s11240-013-0321-3. [DOI] [Google Scholar]
  • 74.Wu L., Yu M., Holowachuk J., Sharpe A., Lydiate D., Dwayne H., Gruber M. Evaluation of a Brassica napus auxin-repressed gene induced by flea beetle damage and Sclerotinia sclerotiorum infection. Am. J. Plant Sci. 2017 doi: 10.4236/ajps.2017.88130. [DOI] [Google Scholar]
  • 75.Borisjuk N., Sitailo L., Adler K., Malysheva L., Tewes A., Borisjuk L., Manteuffel R. Calreticulin expression in plant cells: Developmental regulation, tissue specificity and intracellular distribution. Planta. 1998 doi: 10.1007/s004250050427. [DOI] [PubMed] [Google Scholar]
  • 76.High S., Lecomte F.J.L., Russell S.J., Abell B.M., Oliver J.D. Glycoprotein folding in the endoplasmic reticulum: A tale of three chaperones? FEBS Lett. 2000 doi: 10.1016/S0014-5793(00)01666-5. [DOI] [PubMed] [Google Scholar]
  • 77.Varricchio L., Falchi M., Dall’Ora M., De Benedittis C., Ruggeri A., Uversky V.N., Migliaccio A.R. Calreticulin: Challenges posed by the intrinsically disordered nature of calreticulin to the study of its function. Front. Cell Dev. Biol. 2017 doi: 10.3389/fcell.2017.00096. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Qiu Y., Xi J., Du L., Poovaiah B.W. The function of calreticulin in plant immunity: New discoveries for an old protein. Plant Signal. Behav. 2012 doi: 10.4161/psb.20721. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Michalak M., Groenendyk J., Szabo E., Gold L.I., Opas M. Calreticulin, a multi-process calcium-buffering chaperone of the endoplasmic reticulum. Biochem. J. 2009 doi: 10.1042/BJ20081847. [DOI] [PubMed] [Google Scholar]
  • 80.Dure L., Galau G.A. Developmental biochemistry of cottonseed embryogenesis and germination 1. Plant Physiol. 1981 doi: 10.1007/BF01578378. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Hundertmark M., Hincha D.K. LEA (late embryogenesis abundant) proteins and their encoding genes in Arabidopsis thaliana. BMC Genom. 2008 doi: 10.1186/1471-2164-9-118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Delahaie J., Hundertmark M., Bove J., Leprince O., Rogniaux H., Buitink J. LEA polypeptide profiling of recalcitrant and orthodox legume seeds reveals ABI3-regulated LEA protein abundance linked to desiccation tolerance. J. Exp. Bot. 2013 doi: 10.1093/jxb/ert274. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Olvera-Carrillo Y., Luis Reyes J., Covarrubias A.A. Late embryogenesis abundant proteins. Plant Signal. Behav. 2011 doi: 10.4161/psb.6.4.15042. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Thomann E.B., Sollinger J., White C., Rivin C.J. Accumulation of group 3 late embryogenesis abundant proteins in Zea mays embryos: Roles of abscisic acid and the viviparous-1 gene product. Plant Physiol. 2008 doi: 10.1104/pp.99.2.607. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Kavi Kishor P.B. Role of proline in cell wall synthesis and plant development and its implications in plant ontogeny. Front. Plant Sci. 2015 doi: 10.3389/fpls.2015.00544. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Park S.J., Kwak K.J., Oh T.R., Kim Y.O., Kang H. Cold shock domain proteins affect seed germination and growth of Arabidopsis thaliana under abiotic stress conditions. Plant Cell Physiol. 2009 doi: 10.1093/pcp/pcp037. [DOI] [PubMed] [Google Scholar]
  • 87.Lu Y., Sun J., Yang Z., Zhao C., Zhu M., Ma D., Dong T., Zhou Z., Liu M., Yang D., et al. Genome-wide identification and expression analysis of glycine-rich RNA-binding protein family in sweet potato wild relative Ipomoea trifida. Gene. 2019 doi: 10.1016/j.gene.2018.11.044. [DOI] [PubMed] [Google Scholar]
  • 88.Kim J.Y., Kim W.Y., Kwak K.J., Oh S.H., Han Y.S., Kang H. Glycine-rich RNA-binding proteins are functionally conserved in Arabidopsis thaliana and Oryza sativa during cold adaptation process. J. Exp. Bot. 2010 doi: 10.1093/jxb/erq058. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Benešová M., Holá D., Fischer L., Jedelský P.L., Hnilička F., Wilhelmová N., Rothová O., Kočová M., Procházková D., Honnerová J., et al. The physiology and proteomics of drought tolerance in Maize: Early stomatal closure as a cause of lower tolerance to short-term dehydration? PLoS ONE. 2012 doi: 10.1371/journal.pone.0038017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Checker V.G., Khurana P. Molecular and functional characterization of mulberry EST encoding remorin (MiREM) involved in abiotic stress. Plant Cell Rep. 2013 doi: 10.1007/s00299-013-1483-5. [DOI] [PubMed] [Google Scholar]
  • 91.Jaiswal P.S., Mittal N., Randhawa G.S. Cyamopsis tetragonoloba type 1 metallothionein (CtMT1) gene is upregulated under drought stress and its protein product has an additional C-X-C motif and unique metal binding pattern. Int. J. Biol. Macromol. 2018 doi: 10.1016/j.ijbiomac.2018.08.027. [DOI] [PubMed] [Google Scholar]
  • 92.Theißen G., Gramzow L. Chapter 8—Structure and Evolution of Plant MADS Domain Transcription Factors A2. In: Gonzalez D.H., editor. Plant Transcription Factors. Academic Press; Cambridge, MA, USA: 2016. eBook. [DOI] [Google Scholar]
  • 93.Dubos C., Stracke R., Grotewold E., Weisshaar B., Martin C., Lepiniec L. MYB transcription factors in Arabidopsis. Trends Plant Sci. 2010 doi: 10.1016/j.tplants.2010.06.005. [DOI] [PubMed] [Google Scholar]
  • 94.Hartings H., Lauria M., Lazzaroni N., Pirona R., Motto M. The Zea mays mutants opaque-2 and opaque-7 disclose extensive changes in endosperm metabolism as revealed by protein, amino acid, and transcriptome-wide analyses. BMC Genom. 2011 doi: 10.1186/1471-2164-12-41. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95.Motto M., Thompson R., Salamini F. Cellular and Molecular Biology of Plant Seed Development. Springer; Heidelberg, Germany: 1997. Genetic regulation of carbohydrate and protein accumulation in seeds. [DOI] [Google Scholar]
  • 96.Brochetto-Braga M.R., Leite A., Arruda P. Partial purification and characterization of lysine-ketoglutarate reductase in normal and opaque-2 Maize endosperms. Plant Physiol. 2008 doi: 10.1104/pp.98.3.1139. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97.Nuruzzaman M., Sharoni A.M., Kikuchi S. Roles of NAC transcription factors in the regulation of biotic and abiotic stress responses in plants. Front. Microbiol. 2013 doi: 10.3389/fmicb.2013.00248. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98.Pereira-Santana A., Alcaraz L.D., Castaño E., Sanchez-Calderon L., Sanchez-Teyer F., Rodriguez-Zapata L. Comparative genomics of NAC transcriptional factors in angiosperms: Implications for the adaptation and diversification of flowering plants. PLoS ONE. 2015 doi: 10.1371/journal.pone.0141866. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 99.Le D.T., Nishiyama R., Watanabe Y., Mochida K., Yamaguchi-Shinozaki K., Shinozaki K., Tran L.S. Genome-wide survey and expression analysis of the plant-specific NAC transcription factor family in soybean during development and dehydration stress. DNA Res. 2011 doi: 10.1093/dnares/dsr015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100.Fang Y., You J., Xie K., Xie W., Xiong L. Systematic sequence analysis and identification of tissue-specific or stress-responsive genes of NAC transcription factor family in rice. Mol. Genet. Genom. 2008 doi: 10.1007/s00438-008-0386-6. [DOI] [PubMed] [Google Scholar]
  • 101.Hu W., Wei Y., Xia Z., Yan Y., Hou X., Zou M., Lu C., Wang W., Peng M. Genome-wide identification and expression analysis of the NAC transcription factor family in Cassava. PLoS ONE. 2015 doi: 10.1371/journal.pone.0136993. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 102.Mishra S.K., Tripp J., Winkelhaus S., Tschiersch B., Theres K., Nover L., Scharf K.D. In the complex family of heat stress transcription factors, HsfA1 has a unique role as master regulator of thermotolerance in tomato. Genes Dev. 2002 doi: 10.1101/gad.228802. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 103.Miller G., Mittler R. Could heat shock transcription factors function as hydrogen peroxide sensors in plants? Ann. Bot. 2006 doi: 10.1093/aob/mcl107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 104.Ying S., Zhang D.F., Fu J., Shi Y.S., Song Y.C., Wang T.Y., Li Y. Cloning and characterization of a maize bZIP transcription factor, ZmbZIP72, confers drought and salt tolerance in transgenic Arabidopsis. Planta. 2012 doi: 10.1007/s00425-011-1496-7. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials


Articles from Genes are provided here courtesy of Multidisciplinary Digital Publishing Institute (MDPI)

RESOURCES