Skip to main content
G3: Genes | Genomes | Genetics logoLink to G3: Genes | Genomes | Genetics
. 2018 Jun 22;8(8):2781–2803. doi: 10.1534/g3.118.200423

Cotton Late Embryogenesis Abundant (LEA2) Genes Promote Root Growth and Confer Drought Stress Tolerance in Transgenic Arabidopsis thaliana

Richard Odongo Magwanga *,†,1, Pu Lu *,1, Joy Nyangasi Kirungu *,1, Qi Dong *, Yangguang Hu *, Zhongli Zhou *, Xiaoyan Cai *, Xingxing Wang *, Yuqing Hou *, Kunbo Wang *,2, Fang Liu *,2
PMCID: PMC6071604  PMID: 29934376

Abstract

Late embryogenesis abundant (LEA) proteins play key roles in plant drought tolerance. In this study, 157, 85 and 89 candidate LEA2 proteins were identified in G. hirsutum, G. arboreum and G. raimondii respectively. LEA2 genes were classified into 6 groups, designated as group 1 to 6. Phylogenetic tree analysis revealed orthologous gene pairs within the cotton genome. The cotton specific LEA2 motifs identified were E, R and D in addition to Y, K and S motifs. The genes were distributed on all chromosomes. LEA2s were found to be highly enriched in non-polar, aliphatic amino acid residues, with leucine being the highest, 9.1% in proportion. The miRNA, ghr-miR827a/b/c/d and ghr-miR164 targeted many genes are known to be drought stress responsive. Various stress-responsive regulatory elements, ABA-responsive element (ABRE), Drought-responsive Element (DRE/CRT), MYBS and low-temperature-responsive element (LTRE) were detected. Most genes were highly expressed in leaves and roots, being the primary organs greatly affected by water deficit. The expression levels were much higher in G. tomentosum as opposed to G. hirsutum. The tolerant genotype had higher capacity to induce more of LEA2 genes. Over expression of the transformed gene Cot_AD24498 showed that the LEA2 genes are involved in promoting root growth and in turn confers drought stress tolerance. We therefore infer that Cot_AD24498, CotAD_20020, CotAD_21924 and CotAD_59405 could be the candidate genes with profound functions under drought stress in upland cotton among the LEA2 genes. The transformed Arabidopsis plants showed higher tolerance levels to drought stress compared to the wild types. There was significant increase in antioxidants, catalase (CAT), peroxidase (POD) and superoxide dismutase (SOD) accumulation, increased root length and significant reduction in oxidants, Hydrogen peroxide (H2O2) and malondialdehyde (MDA) concentrations in the leaves of transformed lines under drought stress condition. This study provides comprehensive analysis of LEA2 proteins in cotton thus forms primary foundation for breeders to utilize these genes in developing drought tolerant genotypes.

Keywords: LEA2 proteins, miRNAs, Drought stress, Expression analysis, Transgenic plant, Oxidants, Antioxidants


Drought stress is one of the major abiotic stress factors with deleterious effects in plant growth and development (Sofia et al. 2013). With the ever changing environmental condition and erratic precipitation levels, plant production is projected to undergo further decline, that meeting the demands and needs of the growing population will be a challenge in the near future (Tilman et al. 2011). Plants being sessile, the effects caused by the various abiotic stresses are enormous thus threatening their existence (Rejeb et al. 2014). Plants have developed various coping strategies for continued survival under these extreme conditions, one of which is through the induction of various transcriptome factors (TFs) with the aim of boosting their tolerance level (Xiong and Ishitani 2006). One of the transcriptome factor (TF) that has a functional role under various abiotic stress conditions is a member of the late embryogenesis abundant (LEA) proteins (Rodriguez-Salazar et al. 2017). LEA proteins are basically grouped into eight (8) sub families, named as LEA1, LEA2, LEA3, LEA4, LEA5, LEA6, seed maturation proteins (SMPs) and dehydrins (Battaglia and Covarrubias 2013). In several studies conducted on the genome wide identification, the proteins encoding the late embryogenesis abundant (LEA) genes have been found to be the most abundant among all the other LEA protein families (Yang and Xia 2011).

LEA2 proteins are the members of a larger protein family of the late embryogenesis abundant (LEA) (Hundertmark and Hincha 2008). As the name suggests, this group of proteins are found to in large quantities in seeds at the late stages of embryo development (Dure et al. 1983). Even though, the LEA proteins are synonymous with the seeds, a number of LEA proteins have been detected in the other plant tissues, such as the vegetative tissues (de Nazaré Monteiro Costa et al. 2011). The distribution of LEA proteins is not restricted to plants only, but have been found in animals (10) (Denekamp et al. 2010) and in bacteria (11) (Espelund et al. 1992). The LEA protein families basically have universal structural architecture, high hydrophilicity, low proportion of cysteine (Cys) and tryptophan (Trp) residues and high contents of arginine (Arg), lysine (Lys), glutamate (Glu), alanine (Ala), threonine (Thr) and glycine (Gly). Due to the unique and common features of the LEA proteins, the LEA proteins are mainly referred as hydrophilins with a hydrophilicity index of more than 1 and a glycine (Gly) content of more than 6% (Battaglia et al. 2008).

The late embryogenesis abundant (LEA) proteins have been positively correlated with several of abiotic stress, and have been found to confer tolerance in plants such as Brassica napus (Dalal et al. 2009), rice (He et al. 2012) and Fagus sylvatica (Jiménez et al. 2008). For instance, overexpression of Arabidopsis LEA gene, AtLEA3 have been found to enhance tolerance to drought and salinity stresses (Zhao et al. 2011). Overexpression of a rice LEA gene type, OsLEA3-1 was found to confer drought tolerance (Xiao et al. 2007). Similarly, the LEA gene HVA1 LEA gene from barley, was found to confer dehydration tolerance in transgenic rice (Babu et al. 2004). In addition, SiLEA14, a novel gene was found to be highly expressed in the roots of foxtail millet under drought condition (Wang et al. 2014). However, the precise roles of LEA proteins are still not well understood. A number of proposals have been made to explain the possible roles of the LEA proteins in plants during water deficit conditions, such as enzyme protection (Hand et al. 2011), molecular shield (Furuki et al. 2011), hydration buffer (Hundertmark et al. 2012) and membrane interactions (Olvera-Carrillo et al. 2011). To date, a number of studies have been conducted in trying to determine the distribution and characterization of the LEA proteins in various plants, for instance Arabidopsis (Hundertmark and Hincha 2008), Brassica napus (Dalal et al. 2009), water melon (Celik Altunoglu et al. 2017) among other plants. Despite all the significance of the LEA genes, little has been done to investigate their putative role in cotton in relation to drought stress tolerance.

Cotton (Gossypium hirsutum) is an economically important fiber and oil crop cultivated in many tropical and subtropical areas of the world, where they are constantly exposed to a range of abiotic stresses which includes drought, extreme temperature and high salinity (Mahajan et al. 2005). The completion and publication of the draft genome sequences of upland cotton G. hirsutum (Li et al. 2015b), Gossypium arboreum (Li et al. 2015c) and Gossypium raimondii (Wang et al. 2012) has become a valuable tool in elucidating the transcriptome factors (TFs) in cotton genomes. There is a paucity of information available about LEA2 sub family in upland cotton. Therefore, in this study we carried out the identification, characterization of the LEA2 genes in three cotton genomes and transformed a novel LEA2 gene, Cot_AD24498 into Arabidopsis thaliana, in which we further investigated the expression levels of the transformed gene in both the transgenic lines and the wild type (WT) under drought stress condition.

Materials and methods

Identification, Sequence Analysis, Phylogenetic Tree Analysis and Subcellular Location Prediction of The LEA2 Proteins In Cotton

G. hirsutum, tetraploid (AD) genome LEA2 protein sequences were downloaded from the Cotton Research Institute website (http://mascotton.njau.edu.cn). The G. arboreum of A genome LEA2 protein sequences were downloaded from the Beijing Genome Institute database (https://www.bgi.com/), and G. raimondii of D genome was obtained from Phytozome (http://www.phytozome.net/). The conserved domain of LEA2 protein (PF03168) was downloaded from Pfam protein families (http://pfam.xfam.org). The hidden Markov model analysis (HMM) profile of LEA2 protein was queried to carry out the HMMER search (http://hmmer.janelia.org/) (Finn et al. 2011) against G. hirsutum, G. raimondii and G. arboreum protein sequences. The amino acids sequences were analyzed for the presence of the LEA2 protein domains by ScanProsite tool (http://prosite.expasy.org/scanprosite/) and SMART program (http://smart.embl-heidelberg.de/). The three cotton genomes LEA2 proteins together with the LEA2 proteins from Arabidopsis (http://www.arabidopsis.org/) and rice (http://rice.plantbiology.msu.edu/index.shtml) were used to investigate the evolutionary history and patterning in relation to orthology or paralogy among the proteins encoding LEA2 genes. A phylogenetic tree was constructed, the multiple sequence alignments of all the LEA2 proteins were done by Clustal omega, MEGA 7.0 software using default parameters as described by Higgins et al., (Higgins et al. 1996). The physiochemical characteristics of all the obtained LEA2 proteins were determined through an online ExPASy Server tool (http://www.web.xpasy.org/compute_pi/). In addition, subcellular location prediction for all the upland cotton LEA2 proteins were determined through Wolfpsort (https://www.wolfpsort.hgc.jp/) (Horton et al. 2007). The subcellular prediction results were further validated through other two online tools TargetP1.1 server (Emanuelsson et al. 2007) and Protein Prowler Subcellular Localization Predictor version 1.2 (http://www.bioinf.scmb.uq.edu.au/pprowler_webapp_1-2/) (Bodén and Hawkins 2005).

Analysis of promoter regions, chromosomal locations and miRNA target prediction of LEA2 genes

To identify the presence of drought stress-responsive cis-acting regulatory elements in LEA2 promoter regions, 1 kb up and down stream region from the translation start site of the LEA2 genes were analyzed using the PLACE database (http://www.dna.affrc.go.jp/place/signalscan.html) (Higo et al. 1999). The physical locations in base pair (bp) of each LEA2 genes were determined through BLASTN searching against the local database. Mapchart software (https://www.wur.nl/en/show/Mapchart.htm) (Voorrips 2002), was used to plot the gene loci on G. hirsutum, G. arboreum and G.raimondii chromosomes. Finally we analyzed the miRNA targeting the LEA2 genes by submitting all the coding sequences (CDS) of all the LEA2 genes to the psRNATarget database (http://plantgrn.noble.org/psRNATarget/).

Expression analysis of LEA2 genes and determination of the gene to be transformed

The qRT-PCR analysis was used to determine the expression changes of the LEA2 genes in response to drought stress in the two parental lines used. the upland elite cultivar, G. hirsutum is known to be drought sensitive while the wild tetraploid cotton, G. tomentosum is a drought tolerant (Zheng et al. 2016). The two cotton genotypes were treated for drought stress for 14 days. The samples for RNA extraction were obtained from the leaves, stem and roots, at 0, 7 and 14 days of stress exposure. All the samples were taken in three biological replicates in both control and treated seedlings. In order to get the best sets of the LEA2 genes for carrying out qRT-PCR validation, we had to rely on the RNA-sequencing data profiled under drought stress condition. The RNA-Sequence data were downloaded from cotton research institute website (http://mascotton.njau.edu.cn/html/Data). RNAs were reversely transcribed to first strand cDNA by use of TransCript-All-in-One-First-Strand cDNA synthesis Super Mix for qPCR (TransGen, Beijing, China). The fluorescent quantitative primers were designed for the selected genes (24 up and 24 down regulated genes) using Primer Premier 5 (Supplemental Table S1). Actin gene served as a reference. The synthesized cDNA was pre-incubated at 95° for 15 sec, followed by 40 cycles of denaturation at 95° for 5 sec and extension at 60° for 34 sec. The fluorescence quantitative assay was used to analyze expression level of the LEA2 genes in root, leaves and stem tissues of cotton plant, and expression changes in G. hirsutum and G. tomentosum under drought stress. The assay was designed with three replicates and the results were analyzed with the double delta Ct method.

Transformation and Screening of Novel gene Cot_AD24498 (LEA2) in the Model Plant Arabidopsis thaliana (Ecotype Colombia-0) Lines

The gene was transformed into model plant, A. thaliana ecotype Colombia-0 (Col-0). The upland cotton, G. hirsutum, accession number CRI-12 (G09091801–2) was used to confirm for the presence of the Cot_AD24498 gene in various tissues. The pWM101-35S:Cot_AD24498 (LEA2) construct in Agrobacterium tumefaciens GV3101 was confirmed by gene specific primer, the forward primer sequence Cot_AD24498 (5′CGGATCCATGTCGGTAAAAGAGTGCGGC3′) and reverse primer sequence pair of Cot_AD24498 (5′GGTCGACTTACACGCTAACACTGCATCT3′), synthesized from Invitrogen, Beijing, China. The Arabidopsis Wild-type (WT) plants were transformed by use of floral dip method (Clough SJ und Bent A 1998). Infiltration media mainly composed of 4.3 g/l, sucrose 50 g/l (5%), 2-(4-morpholino) ethane sulfonic acid (MES) 0.5 g/l, Silwet-77 200 µl/l (0.02%), 6-benzylaminopurine (6-BA) 0.01 mg/l with pH of 5.7. Transformed lines of A. thaliana were selected by germinating seeds on 50% (0.5) MS (PhytoTechnology Laboratories, Lenexa, USA), containing 50 mg/l hygromycin B (Roche Diagnostics GmbH, Mannheim, Germany) for a duration of three (3) days at temperature of 4° to optimize germination. Upon which the seedlings were transferred to Arabidopsis conditioned growth room set at 16 hr light and 8 hr dark. After 7 days in selection medium, and at three true leaves stage, the seedlings were transplanted into small plastic containers filled with vermiculite and humus in equal ratios. The seedlings at generation T0 were grown to set seeds, the seeds obtained were generation T1. The T1 seeds were germinated in selective antibiotic medium; the one-copy lines were identified by determining the segregation ratio of 3:1 of the antibiotics-selectable marker. The 3:1 ratio of the segregated lines (T2) seeds were again germinated in antibiotics-selective medium, only the lines with 100% were selected for the development of T3 generation. The T3 homozygous progeny was bred from a T2 population after real-time quantitative reverse transcription polymerase chain reaction (qRT-PCR) and the selection of three out of the eight successfully transformed overexpressed lines (L2, L3, and L4) was done by using Cot_AD24498 (LEA2) forward primer sequence (5′CGAACATCCATCCCTCCAAC3′) and Cot_AD24498 (LEA2) reverse primer sequence (5′ATCATCAAGAAAACCGACCC3′) with total complementary DNA (cDNA) as template. The phenotypic investigations were carried out in T3 homozygous generation.

qRT-PCR Analysis of the Expression of Drought-Responsive Genes in Transgenic Arabidopsis

We assessed the action of the transformed gene in the transgenic lines and the wild type of the model plant, A. thaliana by carrying out expression analysis of two drought responsive genes. ABRE-binding factor 4 (ABF4) gene; forward sequence 5′AACAACTTAGGAGGTGGTGGTCAT3′ and reverse sequence 5′TGTAGCAGCTGGCGCAGAAGTCAT3′ and responsive to desiccation 29A (RD29A) gene with forward sequence 5′TGAAAGGAGGAGGAGGAATGGTTGG3′ and the reverse sequence 5′ACAAAACACACATAAACATCCAAAGT3′. Total RNA was isolated from four-week-old transgenic Arabidopsis seedlings and wild type (Columbia ecotype) grown under normal conditions (CK) and 15% PEG6000 treatments for 4 days. RNA extraction and real-time RT-PCR (qRT-PCR) analyzed was applied as described in the section” Expression analysis of LEA2 genes and determination of the gene to be transformed”, cotton Actin2 forward sequence 5′ATCCTCCGTCTTGACCTTG3′ and reverse sequence 5′TGTCCGTCAGGCAACTCAT3′ applied as the reference gene.

Quantification of oxidant and antioxidants in transgenic lines and the wild type

When plants are exposed to any form of stress, there are drastic changes which occurs both at molecular and cellular level in order to tolerate the stress factors (Gill et al. 2016). Reactive oxygen species is an oxidant substance being produced continuously from the respiring cells, and plants have an elaborate mechanism to keep the level within nontoxic limit, but when stresses such as drought sets in, the ROS equilibrium shifts leading to excessive production. In this research work, we undertook to evaluate the various oxidants and antioxidants levels between the transgenic lines (L1, L2 and L3) compared to the wild type when exposed to drought stress condition. Catalase (CAT), superoxide dismutase (SOD), peroxidase (POD), Malondialdehyde (MDA) and hydrogen peroxide (H2O2) levels were quantified according to the method described by Bartosz (Bartosz 2005). The seeds for transgenic and the wild types were grown in0.5 MS for eight (8) days, then transferred to small conical containers filled with a mixture vermiculite and sand in the ratio of 1:1 and grown for 21 days. After 21 days, water was totally withdrawn from drought treated plants for a period of 8 days, while the controlled plants were watered normally. The leaf samples were then harvested for antioxidants and oxidant determination after 8 days of post stress exposure. The samples were obtained in triplicate, in which each represented a biological repeat.

Availability of Data Statement

The author do affirms that all the data supporting the conclusions of this research work are represented fully within the manuscripts and its supplementary files. Supplemental material available at Figshare: https://doi.org/10.25387/g3.6626849.

Results and discussion

LEA2 protein encoding genes in the cotton genome and other plants

In the identification of the LEA2 proteins in the three cotton genomes, we employed the Hidden Markov Model (HMM profile) of the Pfam LEA2 domains PF03168, as keyword to search the three cotton genome sequences databases. Based on the Pfam domain search, we obtained 200 LEA2 genes in G. hirsutum of AD genome, 101 LEA2 genes in G. raimondii of D genome and 110 LEA genes in G. arboreum of A genome. In order to ascertain the various genes obtained for the three cotton genomes, we carried out manual search through SMART (http://smart.embl.de/smart/) and PFAM database (http://pfam.xfam.org) to verify the presence of the LEA2 gene domain. Upon removal of the redundant sequences with no functional domain or those that lacked the LEA2 domains, we eventually obtained 157, 85 and 89 LEA2 proteins in G. hirsutum, G. arboreum and G. raimondii, respectively. The confirmed domains of the LEA2 proteins in the three cotton genomes were further analyzed for their functional domain attributes of the LEA2 proteins, by use of an online tool, conserved domain database (CDD) tool hosted in the NCBI database. The results showed that the LEA2 proteins were members of c112118 super family with E values ranging from 0 to 0.008 (Supplementary Table S2) and all contained transmembrane domain (Supplementary Table S3) The association of the LEA2s with transmembrane domain could possibly explain the reason why the LEA proteins are found in high concentrations in seeds at late stages of seed development, this possibly to aid in maintaining the stability of the cell membrane under dehydration state. Similar results have also been reported in some of the drought and salt enhancing genes such as Salicornia brachiata SNARE-like superfamily protein (SbSLSP), has been reported to be localized in the plasma membrane (Singh et al. 2016). LEA2 proteins could be playing an integral role in maintaining non-lethal level of reactive oxygen species (ROS homeostasis) in order to minimize oxidative damages to cellular membranous and macromolecules, in addition, LEA2s could also be playing similar roles as the aquaporin’s, the water channel proteins, which are responsible in the regulation of water movement channels such as plasmodesmata and xylem vessels (Buckley 2015). Aquaporin’s (AQPs) have been associated with salt and drought stress tolerance in plants, the aquaporin’s share similar functional domain with LEAs, being basically membrane proteins (Li et al. 2015a).

The number of proteins encoding the LEA2 genes found in G. arboreum, G. raimondii and G. hirsutum were relatively higher than the number recorded in other plants, the entire repertoire of LEA proteins in the 8 LEA families outlined in (Hundertmark and Hincha 2008) have been found to be 34 in rice (Wang et al. 2007), 30 in Chinese plum (Du et al. 2013), 27 in tomatoes (Cao and Li 2014), 53 in poplar (Lan et al. 2013) and 29 in potatoes (Charfeddine et al. 2015), which is far below the individual numbers of LEA2 in the three cotton genome. The abundance of cotton proteins encoding the LEA2 genes could be possibly due to their unique characteristics of being more hydrophobic than other LEA2 proteins from other species and or they could have evolved much later after other transcriptome factors. The genome size of plants and animal is constant, and high abundance of a particular gene family gives an indication of their integral role in enhancing the survival of the plants. The ever changing environmental conditions, plants are constantly faced with hearse environmental condition and disadvantaged by their sessile nature. The survival of the plants under these extreme environmental conditions therefore is through the increase of more stress tolerance genes or integrating a more complex gene interaction in initiating adaptive response mechanisms aimed at increased tolerance levels (Avramova 2015).

Phylogenetic analyses of LEA2 proteins in G. hirsutum, G. arboreum and G. raimondii

Phylogenetic tree analysis provides valuable knowledge on the lines of evolutionary descent of different genes or proteins from a common ancestor, since its inception, it has remained a powerful tool for structuring classifications, biological diversity and for providing insight into events that occurred during gene evolution (Gregory 2008). In this study a total of 157, 85 and 89 LEA2 proteins were identified from G. hirsutum, G. arboreum and G. raimondii, respectively (Table 1). All the LEA2 proteins were aligned by the neighbor joining (NJ) method in ClustalW. The various LEA2 proteins from upland cotton, G. arboreum, G. raimondii, A. thaliana, T. cacao and G. max were analyzed. The inclusion of A. thaliana, T. cacao and G. max in the analysis of the cotton LEA2s was due to fact that Theobroma cacao share ancestral origins with cotton, A. thaliana and G. max have undergone whole genome duplication similar to cotton plant. The resulting phylogenetic tree showed that the cotton LEA genes tend to cluster together. Based on the clustering pattern, the LEA2 genes were sub-divided into 6 groups, namely group 1 with three sub-groups, group 2, group 3 with two sub-groups, group 4, group 5 and finally group 6 with 5 sub-groups. Groups 1, 2, 4 and 5 were entirely LEA2 proteins from the three cotton genomes.

Table 1. The identified LEA2 genes and their nomenclatural description.

In this work Hundertmark & Hincha (2008) G. hirsutum G. arboreum G. raimondii V. vinifera B.napus G. max Arabidopsis
LEA2 LEA_2 157 85 89 1 4 5 3

The LEA2s seems to have evolved later among all the LEA genes, in the analysis of the LEA genes in sweet orange, the highest among all the 8 members of the LEA genes were members of the LEA2 (Muniz Pedrosa et al. 2015), this kind of observation was replicated in a number of plants. More than a half of the phylogenetic tree was mainly covered by the cotton LEA2 proteins, with no presence of LEA2s from other plants used in the analysis of the phylogenetic tree. Theobroma cacao, being evolutionary related to cotton, a few members of the LEA proteins clustered with cotton, while majority of the proteins encoding the LEA2 genes from Theobroma cacao clustered together.

The late embryogenesis abundant (LEA2) proteins from A. thaliana were found to cluster with those of cotton LEA2s in group 3 and 6 (3-2 and 6-1) while Glycine max LEA2 proteins were predominantly found in group 6-1 (Figure 1). No ortholog gene pairs were detected between the proteins encoding the cotton LEA2 genes of cotton to any of the plants used. All the ortholog gene pairs occurred between G. hirsutum and G. arboreum, G. hirsutum and G. raimondii and G. arboreum and G. raimondii. Interestingly, even Theobroma cacao, which is evolutionary related to Gossypium species, had their LEA2 proteins clustered together.

Figure 1.

Figure 1

Phylogenetic relationship of LEA2 genes in three cotton species with Arabidopsis, T. cacao and G. max. Neighbor-joining phylogeny of 157 genes for G. hirsutum, 85 genes for G. arboreum, 89 genes for G. raimondii, 9 genes for T. cacao, 5 G. max and 3 Arabidopsis LEA protein sequences, as constructed by MEGA7.0.

The abundance of LEA2s in plants can be explained by either being the last members of the LEA genes to evolve and or due to duplication. Upland cotton is a tetraploid cotton, having emerged through whole genome duplication (WGD) between the two diploid cotton of A and D genomes. The high number of LEA2 genes, have also been observed in Arabidopsis (Hundertmark and Hincha 2008). Therefore, we could infer that LEA2 proteins might have evolved later after species divergence and the presence of ortholog genes in the cotton genome could be due to the whole genome duplication event coupled with chromosome rearrangement. It is generally assumed that ortholog genes have the same biological functions in different species (Tatusov 1997), and duplication makes room for paralogous gene pairs to evolve new functions (Ohno 1970). LEA2 genes could be functionally-oriented ortholog groups consisting of orthologous pair which plays the same biological role in the three different cotton genomes.

Physio-chemical analysis, subcellular localization and amino acid composition of the LEA2 genes in upland cotton

In the analysis of the physio-chemical properties of the LEA2 genes in upland cotton, the proteins encoding the LEA2 genes had varied molecular formulae though with similar elemental composition, carbon (C), hydrogen (H), oxygen (O), nitrogen (N) and sulfur (S) in varying proportions. Molecular weights ranged from 11.5384 to 73.5831 kD, Pl values from 4.63 to 10.35, aliphatic index from 19.78 to 65.4, instability index from 6.91 to 63.52, protein lengths ranged from 100 to 661 bp and the grand average of hydropathy (GRAVY) values ranged from 0.574 to 1.04. The grand average hydropathy (GRAVY) values showed that almost all the LEA2s are hydrophobic proteins, the hydrophobic nature of proteins is integral for their biological functions, allows the proteins to fold spontaneously into complex three-dimensional structures that are significant for biological activity (Gosline et al. 2002). The hydrophobic nature of the proteins enables the removal of nonpolar amino acids from solvent and their burial in the core of the protein, this attribute is common among the aquaporin’s (AQPs), water channel proteins, are highly hydrophobic and known to have a functional role in water and salt stress tolerance in plants (Sreedharan et al. 2013). In the sub cellular localization prediction, 10 different sites were detected, in which majority of the LEA2 proteins were found to be localized within the chloroplast with 73 genes. Further analysis by TargetP and Pprowler, more than 70% of the genes were found to be associated with secretory pathway and chloroplast (Table 2 and Supplementary Table S4). The high number of these genes in chloroplast explains their significant role in drought stress, since chloroplast plays a central role in plant response to stress (Gläßer et al. 2014). The connection between different stress responses and organellar signaling pathways such as reactive oxygen species, emanate from the chloroplast (Kmiecik et al. 2016). Chloroplasts being semi-autonomous organelles provide complex communication channel that allow for effective coordination of gene expression since most plastid localized proteins are nuclear-encoded, thus ensuring an effective functioning of overall cellular metabolism (Pfannschmidt et al. 2009). Numerous and vital cellular processes such as aromatic amino acids, fatty acids and carotenoids biosynthesis and sulfate assimilation pathways are harbored within the chloroplast, in addition to photosynthesis, these cellular processes are known to be key factors in plants response to stress. The chloroplast acts as a sensor to abiotic stress thus initiates different cell functions in response to stress factor, enhancing adaptability of the plant to the environmental stress (Mittler 2006). Higher proportions of LEA2 genes were found to be localized within the cytoplasm, nucleus and mitochondrion, with 24, 20 and 16 genes respectively, which further provided a stronger evidence of the importance of these genes in enhancing drought tolerance ability in cotton. The following cell structures contained low numbers of LEA2 genes, endoplasmic reticulum (E.R) with 3, extracellular structures with 5, Golgi body 6, plasma 4 and vacuole with 3 genes each. The result obtained for the subcellular localization of the LEA2 genes is in agreement to previous findings in which the highest proportions of LEA2 genes were found to be localized within the cytoplasm and chloroplast, accounting for 35.7% and 30.9% of the total LEA2 genes in sweet orange, while others were found to target endoplasmic reticulum (E.R) and mitochondrion (Muniz Pedrosa et al. 2015). Similarly, abiotic stress related gene, plasma membrane protein 3 (PMP3), a member of the small hydrophobic polypeptides with high sequence similarity, and have been functionally characterized to be responsible for salt, drought, cold, and abscisic acid, have been found to be sub localized in the nucleus, cytoplasm, and cell membrane (Fu et al. 2012).

Table 2. Physiochemical properties of LEA2 gene in upland cotton, G. hirsutum, subcellular location prediction and chromosome position.

Gene Id Molecular Formula Atoms Numbers Instability Index Aliphatic Index Gravy Length (Aa) Pl Mw (Aa) Chr No Sub Cellular Localization
Wolfpsort TargetP Prowler
CotAD_ 00275 C2550H4266N832O1061S220 8929 49.24 24.58 0.824 274 10 29834.66 Dt09_chr23 chlo S sp
CotAD_ 00465 C2809H4694N922O1183S186 9794 38.68 27.5 0.704 304 10 33689.28 Dt09_chr23 chlo C sp
CotAD_ 00799 C3119H5215N1021O1297S196 10848 42.14 31.89 0.776 337 9 38982.02 scaffold26.1 golg C sp
CotAD_ 00808 C2114H3538N688O893S149 7382 38.49 25.51 0.698 226 10 26011.22 scaffold26.1 cyto _ sp
CotAD_ 01033 C1868H3118N616O781S132 6515 37.57 27.69 0.749 202 9 22587.14 Dt10_ch20 chlo S sp
CotAD_ 01298 C1996H3326N664O833S142 6961 35.29 27.79 0.754 218 10 24021.4 Dt10_ch20 cyto _ other
CotAD_ 01321 C2138H3550N724O880S189 7481 48.5 25.62 0.855 238 10 26020.28 Dt10_ch20 cyto S sp
CotAD_ 01385 C2253H3753N751O944S189 7890 53.4 22.96 0.757 247 7 27497.03 Dt09_chr23 cyto S sp
CotAD_ 01700 C2382H3972N790O976S223 8343 52.12 25.76 0.914 260 9 28399.83 Dt09_chr23 cyto _ sp
CotAD_ 02652 C2022H3396N646O835S184 7083 63.52 25.47 0.898 212 10 23764.43 Dt09_chr23 mito S sp
CotAD_ 03037 C2465H4132N796O1011S239 8643 54.69 25.19 0.943 262 9 28472.57 Dt05_chr19 cyto S sp
CotAD_ 03649 C2938H4904N970O1220S232 10264 42.02 27.07 0.811 320 10 35345.6 At_chr09 cyto S sp
CotAD_ 03784 C1076H1792N358O453S53 3732 26.71 31.74 0.644 116 7 13537.66 Dt07_chr16 chlo _ other
CotAD_ 05724 C1834H3065N601O771S128 6399 46.85 26.71 0.719 197 10 22442.51 At_chr09 chlo _ sp
CotAD_ 05725 C2229H3732N724O935S169 7789 50.4 25.76 0.755 238 10 27552.78 At_chr09 nucl _ sp
CotAD_ 06037 C1893H3159N625O802S134 6613 45.22 24.24 0.668 205 10 22125.81 Dt13_ch18 chlo _ sp
CotAD_ 07087 C1926H3222N628O819S106 6701 43.9 28.43 0.622 206 10 22853.64 At_chr02 plas _ other
CotAD_ 08181 C1864H3110N616O780S135 6505 43.71 26.87 0.745 202 9 22460.02 Dt09_chr23 cyto S sp
CotAD_ 08350 C1894H3182N604O790S142 6612 49.06 27.91 0.802 198 5 22266.98 scaffold190.1 chlo _ sp
CotAD_ 08837 C2300H3853N745O961S220 8079 55.29 20.73 0.825 245 9 26376.34 scaffold280.1 golg S sp
CotAD_ 09578 C2381H3970N790O977S223 8341 50.46 25.38 0.905 260 9 28406.84 At_chr09 chlo _ sp
CotAD_ 09685 C2306H3847N763O928S220 8064 61.17 29.7 1.024 251 10 27153.8 Dt09_chr23 chlo _ sp
CotAD_ 09732 C2198H3688N706O923S164 7679 47.07 26.14 0.755 232 9 25906.5 Dt09_chr23 chlo C sp
CotAD_ 10376 C2568H4293N841O1038S271 9011 60.05 25.86 1.033 277 10 30152.74 Dt01_chr15 chlo S sp
CotAD_ 11658 C2438H4075N799O1007S165 8484 34.86 31.99 0.823 263 10 29835.19 Dt08_chr24 cyto _ sp
CotAD_ 11875 C1627H2717N535O682S94 5655 33.71 30.96 0.706 175 7 20070.28 scaffold42.1 chlo S sp
CotAD_ 11876 C1942H3245N637O798S180 6802 50.01 25.51 0.904 209 10 23563.32 scaffold42.1 chlo _ other
CotAD_ 11878 C2121H3552N688O886S165 7412 55.97 26.24 0.785 226 10 25841.73 scaffold42.1 chlo S sp
CotAD_ 11879 C1215H2031N397O519S61 4223 41.32 28.35 0.574 129 10 15037.05 scaffold42.1 chlo S sp
CotAD_ 12375 C1765H2948N580O727S157 6177 61.3 25.95 0.879 190 9 21328.78 At_chr09 chlo _ other
CotAD_ 13115 C1791H2994N586O760S122 6253 39.07 24.83 0.659 192 9 20770.35 Dt08_chr24 extr _ sp
CotAD_ 13584 2310H3858N760O957S190 8075 46.59 26.65 0.832 250 10 28048.83 Dt06_chr25 golg S sp
CotAD_ 13827 C3342H5592N1090O1370S299 11693 55.07 27.48 0.922 360 8 40945.87 Dt12_ch26 E.R. _ sp
CotAD_ 14147 C2022H3396N646O838S180 7082 61.99 25.16 0.871 212 10 23855.54 At_chr07 mito S sp
CotAD_ 15892 C2861H4789N931O1209S186 9976 40.47 27.23 0.688 307 8 34741.21 Dt12_ch26 chlo _ sp
CotAD_ 16731 C2370H3954N784O980S202 8290 47.46 26.09 0.845 258 10 28519.44 Dt09_chr23 chlo S sp
CotAD_ 17044 C1387H2309N463O581S100 4840 43.02 26.25 0.725 151 5 16422.87 At_chr07 cyto _ other
CotAD_ 17045 C2199H3654N742O907S185 7687 48.71 26.49 0.838 219 10 23930.18 At_chr07 cyto _ other
CotAD_ 17062 C2047H3416N676O852S170 7161 50.56 25.07 0.802 244 10 27393.16 At_chr07 chlo S sp
CotAD_ 17101 C1958H3277N637O811S177 6860 53.57 24.41 0.86 222 9 25294.09 At_chr06 mito _ sp
CotAD_ 17102 C2435H4063N805O1008S182 8493 41.98 29.02 0.817 209 10 23661.48 At_chr06 nucl _ sp
CotAD_ 17103 C2213H3709N715O930S187 7754 61.46 22.72 0.767 265 7 30299.29 At_chr06 mito _ sp
CotAD_ 17649 C1849H3077N619O759S131 6435 37.22 31.77 0.843 235 9 26726.9 At_chr10 chlo S sp
CotAD_ 18210 C1850H3079N619O757S134 6439 35.57 32.09 0.865 203 10 22501.33 scaffold377.1 cyto _ other
CotAD_ 18233 C1630H2729N529O675S118 5681 40.59 29.98 0.822 203 10 22406.26 scaffold377.1 chlo _ other
CotAD_ 18546 C2571H4299N841O1038S270 9019 58.81 26.34 1.04 173 10 19695.85 Dt09_chr23 chlo _ sp
CotAD_ 18729 C1990H3320N658O828S137 6933 43.32 29.42 0.772 277 10 30227.97 scaffold336.1 chlo S sp
CotAD_ 19078 C1684H2807N559O714S128 5892 45.25 22.26 0.669 216 10 24007.7 At_chr12 nucl S sp
CotAD_ 19107 C2766H4629N901O1165S184 9645 42.16 27.59 0.71 183 9 20031.24 At_chr12 chlo _ other
CotAD_ 19205 C941H1570N310O394S56 3271 35.65 30.19 0.703 297 7 33395.7 At_chr12 chlo _ sp
CotAD_ 19213 C1704H2853N553O707S109 5926 45.8 32.12 0.793 100 10 11538.35 At_chr10 chlo _ sp
CotAD_ 19214 C2114H3541N685O887S125 7352 35.76 30.89 0.719 181 9 20628.72 At_chr10 nucl C other
CotAD_ 19375 C2310H3858N760O958S187 8073 46.32 26.78 0.823 225 9 25956.2 Dt11_ch21 golg S sp
CotAD_ 20020 C1807H3029N583O761S123 6303 36.84 27.19 0.717 250 10 27947.68 At_chr06 mito S sp
CotAD_ 20308 C2201H3658N742O909S184 7694 46.29 26.35 0.83 191 10 21054.44 Dt06_chr25 chlo _ sp
CotAD_ 21731 C2426H4054N796O986S230 8492 60.23 27.71 0.975 244 10 27381.21 Dt05_chr19 nucl S sp
CotAD_ 21924 C1845H3069N619O756S138 6427 38.31 30.96 0.86 262 10 28411.4 Dt11_ch21 nucl S sp
CotAD_ 23646 C2458H4115N799O1036S200 8608 53.5 22.84 0.738 204 10 21921.93 Dt07_chr16 nucl _ other
CotAD_ 24019 C1624H2711N535O680S94 5644 33.71 31.14 0.711 203 10 22391.06 Dt06_chr25 mito S sp
CotAD_ 24497 C1941H3243N637O796S181 6798 50.1 25.83 0.916 263 9 29247.79 Dt10_ch20 chlo S sp
CotAD_ 24499 C2118H3546N688O883S170 7405 56.27 25.95 0.801 175 8 20026.25 scaffold238.1 chlo _ sp
CotAD_ 25271 C2240H3751N727O937S188 7843 48.67 24 0.79 209 10 23559.33 scaffold238.1 nucl S sp
CotAD_ 26038 C1695H2826N562O718S127 5928 45.53 22.86 0.673 226 9 25852.71 scaffold238.1 chlo _ sp
CotAD_ 26981 C1423H2384N460O593S106 4966 51.34 27.73 0.79 274 10 29936.66 At_chr09 chlo C sp
CotAD_ 27453 C2034H3390N676O861S160 7121 44.82 21.96 0.686 239 10 26994.13 scaffold477.1 mito _ sp
CotAD_ 27789 C2367H3951N781O998S201 8298 53.3 21.31 0.731 184 9 20135.39 scaffold699.1 E.R. _ sp
CotAD_ 28249 C2260H3788N730O947S140 7865 33.99 30.49 0.736 150 9 16764.6 At_chr09 nucl _ sp
CotAD_ 28252 C2177H3646N706O916S180 7625 48.77 22.87 0.752 222 9 24982.77 At_chr07 mito S sp
CotAD_ 28872 C1387H2306N466O578S109 4846 48.91 25.43 0.764 257 9 26949.97 Dt03_chr17 nucl _ sp
CotAD_ 29279 C1875H3141N607O784S137 6544 47.67 27.44 0.769 305 10 34588.47 Dt13_ch18 chlo _ other
CotAD_ 31344 C2277H3795N757O932S181 7942 42.47 30.2 0.887 101 6 11711.01 scaffold1346.1 chlo S sp
CotAD_ 31535 C2944H4916N970O1223S231 10284 41.3 27.17 0.809 240 8 27649.86 At_chr05 vacu S sp
CotAD_ 31536 C2047H3416N676O854S171 7164 52.02 24.33 0.789 210 9 23875.63 scaffold1346.1 plas S sp
CotAD_ 31537 C1956H3273N637O809S177 6852 54.13 24.72 0.868 254 10 27558.52 scaffold1841.1 nucl _ sp
CotAD_ 31780 C2649H4422N874O1100S195 9240 40.01 28.67 0.799 310 10 34525.38 Dt08_chr24 chlo _ sp
CotAD_ 31782 C1944H3258N628O812S139 6781 46.14 28.27 0.774 210 8 23638.39 Dt09_chr23 chlo S sp
CotAD_ 31860 C4139H6916N1360O1727S338 14480 44.89 25.26 0.795 206 10 22839.69 scaffold257.1 cyto _ sp
CotAD_ 31906 C1914H3198N628O804S148 6692 47.49 24.6 0.739 232 10 26256.38 scaffold769.1 cyto C sp
CotAD_ 31936 C2627H4393N859O1089S219 9187 47.93 26.37 0.838 152 5 16462.97 Dt01_chr15 mito S sp
CotAD_ 32487 C1940H3238N640O815S167 6800 51.19 21.79 0.753 305 10 33718.76 At_chr11 mito _ sp
CotAD_ 32645 C1845H3066N622O771S148 6452 42.79 24.35 0.753 199 9 22785.41 Dt06_chr25 chlo S sp
CotAD_ 32847 C1752H2928N574O730S100 6084 39.49 32.87 0.745 249 10 27707.74 At_chr09 extr S sp
CotAD_ 33143 C3449H5767N1129O1433S246 12024 46.47 29.55 0.8 305 10 34544.43 Dt02_chr14 chlo S sp
CotAD_ 33144 C1970H3298N640O818S163 6889 54.12 26.18 0.83 240 9 27655.92 Dt05_chr19 chlo _ sp
CotAD_ 34476 C2374H3959N787O982S206 8308 47.91 25.48 0.844 320 10 35579.84 Dt09_chr23 cyto _ sp
CotAD_ 34798 C2925H4884N964O1214S245 10232 51.69 25.78 0.826 222 9 25253.03 Dt06_chr25 nucl S sp
CotAD_ 35069 C2296H3827N763O944S159 7989 48.49 32.19 0.84 209 10 23628.4 Dt06_chr25 chlo S sp
CotAD_ 35091 C2037H3411N661O855S133 7097 42.17 28.83 0.728 288 7 32755.52 Dt06_chr25 extr _ sp
CotAD_ 35514 C1704H2853N553O708S110 5928 46.78 31.58 0.785 206 6 23420.27 Dt05_chr19 mito S sp
CotAD_ 36328 C1970H3298N640O819S162 6889 53.13 26.02 0.821 450 5 49131.5 scaffold821.1 chlo C other
CotAD_ 36446 C1628H2725N529O673S119 5674 44.25 30.17 0.833 231 10 24949.39 Dt08_chr24 chlo _ other
CotAD_ 36583 C2954H4936N970O1224S234 10318 40.36 27.69 0.829 206 9 22761.2 scaffold821.1 chlo _ sp
CotAD_ 37776 C1843H3062N622O768S149 6444 46.57 24.84 0.769 202 9 22357.93 Dt09_chr23 chlo S sp
CotAD_ 37888 C2554H4274N832O1063S219 8942 50.77 24.7 0.823 283 10 31410.18 At_chr08 chlo S sp
CotAD_ 38978 C2819H4711N925O1184S205 9844 42.1 26.22 0.734 210 10 22644.27 Dt08_chr24 nucl S sp
CotAD_ 39064 C969H1623N313O399S81 3385 56.09 27.65 0.874 210 10 23699.74 Dt01_chr15 chlo S sp
CotAD_ 39719 C1971H3300N640O818S160 6889 54.8 26.8 0.83 191 6 20961.07 Dt01_chr15 nucl S sp
CotAD_ 40324 C2364H3954N772O959S228 8277 54.49 27.79 0.995 204 10 21780.76 At_chr07 plas _ sp
CotAD_ 41569 C2875H4808N940O1188S244 10055 50.55 26.76 0.862 208 10 22559.45 At_chr13 chlo _ sp
CotAD_ 41571 C1947H3252N640O803S171 6813 59.76 26.02 0.87 270 10 30627.54 Dt09_chr23 chlo _ sp
CotAD_ 41925 C1928H3226N628O816S110 6708 46.13 29.07 0.656 188 9 21941.4 scaffold1231.1 nucl _ other
CotAD_ 42599 C2794H4661N925O1169S206 9755 43.38 26.65 0.752 373 10 43118.75 scaffold1231.1 cyto _ other
CotAD_ 44357 C2819H4711N925O1183S209 9847 44.41 26 0.743 210 9 23874.6 scaffold1088.1 cyto C sp
CotAD_ 45324 C2259H3786N730O944S141 7860 34.69 31.04 0.754 256 10 28431.93 Dt11_ch21 chlo S sp
CotAD_ 46873 C2117H3529N703O871S205 7425 56.14 23.68 0.894 259 10 28603.52 At_chr09 vacu S sp
CotAD_ 47322 C1862H3106N616O776S139 6499 43.14 27.2 0.773 220 10 24666.72 At_chr03 chlo S sp
CotAD_ 47454 C1973H3304N640O818S176 6911 53.21 24.61 0.854 661 6 73583.12 scaffold1851.1 cysk S sp
CotAD_ 47495 C1754H2923N583O719S178 6157 55.01 23.06 0.922 318 10 35234.15 Dt07_chr16 chlo S sp
CotAD_ 47749 C1922H3208N634O818S131 6713 42.78 23.89 0.636 251 9 27769.63 Dt07_chr16 chlo S sp
CotAD_ 48050 C2571H4320N820O1053S198 8962 50.08 32.15 0.921 217 9 24968.87 Dt10_ch20 mito _ sp
CotAD_ 48069 C2356H3932N778O994S159 8219 43.83 26.42 0.689 181 10 20577.73 Dt10_ch20 extr S sp
CotAD_ 48336 C2036H3400N670O835S177 7118 47.96 27.69 0.9 211 9 23479.93 Dt04_chr22 nucl S sp
CotAD_ 48753 C6218H10441N1993O2614S448 21714 47.81 27.02 0.752 210 9 23676.69 At_chr06 mito _ sp
CotAD_ 48769 C1998H3351N643O829S165 6986 56.41 26.68 0.843 304 10 33675.21 At_chr09 nucl _ sp
CotAD_ 49818 C2811H4698N922O1186S183 9800 36.96 27.39 0.691 317 5 35274.16 scaffold2616.1 cyto S sp
CotAD_ 53045 C2922H4881N961O1224S173 10161 37.17 30.97 0.72 206 8 22650.27 Dt10_ch20 cyto S sp
CotAD_ 53263 C1938H3246N628O811S135 6758 44.09 28.27 0.756 251 10 27168.81 At_chr09 chlo _ other
CotAD_ 53981 C2316H3867N763O933S219 8098 61.7 29.83 1.021 247 7 27715.29 scaffold3326.1 mito _ sp
CotAD_ 54337 C2251H3749N751O943S189 7883 54.62 22.96 0.757 152 5 16453.02 At_chr07 chlo _ sp
CotAD_ 55224 C1390H2312N466O579S109 4856 50.61 25.65 0.768 210 10 23769.83 Dt03_chr17 mito S sp
CotAD_ 56356 C1954H3266N640O822S101 6783 33.97 32.13 0.677 173 10 19737.98 At_chr09 chlo _ other
CotAD_ 56696 C1963H3275N649O822S113 6822 33.29 31.22 0.71 213 10 23750.48 Dt03_chr17 nucl S sp
CotAD_ 58358 C1600H2547N445O483S11 5086 61.19 65.4 209 10 23626.51 Dt12_ch26 chlo S sp
CotAD_ 59405 C1936H3233N637O793S189 6788 54.41 24.72 0.93 320 10 35457.72 Dt05_chr19 chlo _ sp
CotAD_ 60279 C2316H3879N751O968S220 8134 53.86 20.83 0.82 247 9 26619.63 scaffold2414.1 chlo S sp
CotAD_ 60435 C2292H3819N763O938S163 7975 49.59 32.72 0.869 251 10 27952.81 At_chr01 chlo S sp
CotAD_ 60617 C1977H3312N640O820S177 6926 54.15 24.45 0.855 210 10 23780.9 Dt01_chr15 mito S sp
CotAD_ 61173 C1964H3271N655O821S137 6848 38.49 27.72 0.739 215 10 24043 At_chr04 chlo _ other
CotAD_ 61391 C1753H2921N583O718S179 6154 54.1 23.06 0.928 191 6 20884.97 Dt01_chr15 chlo S sp
CotAD_ 62996 C2926H4886N964O1214S245 10235 51.58 25.88 0.828 318 10 35356.25 At_chr01 nucl S sp
CotAD_ 63174 C3526H5909N1141O1460S281 12317 44.13 28.18 0.85 377 10 41228.93 scaffold3177.1 E.R. S sp
CotAD_ 64004 C2020H3371N667O833S157 7048 48.11 29.02 0.845 219 10 23825.02 Dt07_chr16 chlo _ sp
CotAD_ 64120 C2001H3336N664O837S143 6981 33.36 27.19 0.743 218 10 24050.43 At_chr12 chlo S other
CotAD_ 64346 C1963H3284N640O817S168 6872 52.92 24.61 0.818 210 9 23572.5 Dt06_chr25 chlo _ other
CotAD_ 64347 C2142H3567N715O883S231 7538 59.98 19.78 0.901 235 9 26111.93 Dt06_chr25 plas _ sp
CotAD_ 64657 C2431H4064N796O990S225 8506 59.04 27.96 0.961 262 10 28516.58 At_chr11 vacu _ sp
CotAD_ 65119 C1908H3186N628O800S147 6669 42.93 25.08 0.747 206 9 22733.19 Dt08_chr24 golg S sp
CotAD_ 65370 C1019H1668N278O359S3 3327 61.44 49 326 10 36098.18 scaffold3528.1 chlo S sp
CotAD_ 66245 C4148H6934N1360O1732S337 14511 45.38 25.26 0.792 450 5 48836.2 Dt08_chr24 chlo C sp
CotAD_ 66538 C1991H3337N643O823S168 6962 59.16 26.99 0.866 211 10 23424.96 At_chr04 chlo S sp
CotAD_ 66551 C2086H3485N685O872S114 7242 18.66 32.8 0.72 225 9 25226.24 scaffold3976.1 cyto _ sp
CotAD_ 66774 C1993H3326N658O830S137 6944 42.93 29.27 0.768 216 10 24090.84 Dt08_chr24 chlo S sp
CotAD_ 66775 C2066H3445N685O872S139 7207 32.12 26.21 0.682 225 10 25078.29 Dt08_chr24 chlo _ other
CotAD_ 67823 C2035H3392N676O841S191 7135 53.53 23.44 0.861 222 10 23928.26 At_chr08 cyto S sp
CotAD_ 68063 C2031H3396N664O856S167 7114 50.86 22.36 0.733 218 9 23245.72 At_chr03 cyto _ sp
CotAD_ 68189 C1936H3242N628O808S135 6749 44.73 28.91 0.772 206 7 22579.21 At_chr10 chlo S sp
CotAD_ 69737 C1966H3281N649O821S117 6834 32.83 31.38 0.732 213 10 23867.69 scaffold2095.1 chlo S sp
CotAD_ 69738 C1956H3270N640O824S101 6791 32.31 31.82 0.669 210 10 23893.04 scaffold2095.1 chlo S sp
CotAD_ 70003 C1807H3029N583O761S120 6300 6.91 27.71 0.713 191 10 20942.44 At_chr12 cyto _ sp
CotAD_ 70190 C3927H6552N1300O1658S217 13654 30.66 30.05 0.661 430 5 48185.02 scaffold4817.1 cyto _ other
CotAD_ 70192 C1226H2050N400O509S77 4262 34.45 31.91 0.776 130 5 14420.49 scaffold4817.1 nucl C other
CotAD_ 71431 C1743H2916N568O719S152 6098 46.46 26.33 0.874 186 10 20579.98 Dt05_chr19 extr C sp
CotAD_ 72458 C1788H2988N586O760S119 6241 39.96 24.83 0.644 192 10 20613.31 scaffold3083.1 cysk _ sp
CotAD_ 72913 C2901H4845N955O1214S173 10088 38.37 31.06 0.726 315 5 35071.89 scaffold4398.1 cysk _ other
CotAD_ 73966 C2955H4938N970O1228S230 10321 41.06 27.38 0.809 320 10 35484.73 At_chr12 chlo S sp
CotAD_ 74713 C1998H3351N643O829S165 6986 56.41 26.68 0.843 211 9 23479.93 Dt08_chr24 golg S sp
CotAD_ 76129 C1937H3235N637O793S190 6792 54.41 24.72 0.935 209 10 23626.51 At_chr12 chlo _ sp

The cell compartmentalization of stress related genes is fundamental to their functional role (Osman et al. 2009), the presence of the proteins encoding LEA2 genes in the chloroplast, could be responsible for maintaining osmotic balance and suppression of reactive oxygen species (ROS) production in the guard cells (Wang et al. 2016), while those present in the membrane, could be responsible for the protection of the membrane integrity (Guo et al. 2009). In addition, the sub cellular localized proteins encoding LEA2 genes embedded in the channeling or transporter organelles such endoplasmic reticulum, are likely to aid in the process of the ions sequestration (Porcel et al. 2005). Based on various findings, the LEA protein families are known to have a universal structure, with varying proportions of the various amino acids (Hong-bo et al. 2005). In order to verify the LEA2 proteins due to their unique hydrophobic property, we found that the LEA2s are rich in non-polar aliphatic amino acid residues, in which the highest proportion was noted in leucine with 9.2%, Valine with 8.2%, isoleucine (6.3%), alanine (5.9%) and the least was proline (5.7%). The high proportions of the non-polar residues, indicated that the LEA2 proteins are mainly embedded within the membrane, non-polar amino acids are found in the center of water soluble proteins while the polar amino acids are found at the surface (Petukhov et al. 1998). The second in proportions were the polar, non-charged residues such as serine (8.9%), threonine (6.4%), cysteine 1.9%), methionine (2.2%), asparagine (5.0%) and glutamine (3.4%) The high proportions of the polar residues have been found to be predominant among the stress related proteins, such as the heat shock proteins (HSPs) (Wang et al. 2004), therefore the presence of the polar residue, indicated that the LEA2 proteins could be responsible for coating the cellular macromolecules with a cohesive water layer and in turn protect the membrane and the membrane bounds multiprotein complexes from unfolding and aggregation during drought stress condition.

Genomic organization and motif detection of LEA2 proteins in cotton

Analysis of the exon-intron structure of all the 157 LEA2 genes was done using the gene structure displayer (http://gsds.cbi.pku.edu.cn/), a greater percentage of the LEA2 genes and their exons were highly conserved within the group (Supplementary Figure S1). Most of the LEA2 genes were intronless, with 114 genes, accounting for over 73%, of the LEA2s found to be intronless. The existence of introns in a genome is argued to cause enormous burden on the host (Wahl et al. 2009). The burden is because the introns requires a spliceosome, which is among the largest molecular complexes in the cell, comprising of 5 small nuclear RNAs and more than 150 proteins (Wahl et al. 2009). Intron transcription is costly in terms of time and energy (Lane and Martin 2010). Due to various stresses in which the plants are exposed to, the energy demand for survival is relatively high, thus various gene actions within the plant has to function under conserved energy demand threshold (Timperio et al. 2008). A plant under stress condition requires to survive the effects caused by overload of excessive production of reactive oxygen species (ROS), 3,4-Methylenedioxyamphetamine (MDA) and low levels of Peroxidase (PODs) activities, therefore most of the genes responsible for stress tolerance either lack introns or possess significantly reduced number of introns within their gene structure (Jeffares et al. 2008). Being the transcription process of the intron laden genes requires a lot of time and energy, which is hypothesized to cause or results into deleterious effect on gene expression (Calderwood et al. 2003). Conserved motifs in the 157 LEA2 proteins were identified through an online tool MEME (Supplementary Figure S1). The motif lengths identified by MEME (http://meme-suite.org/), were between 14 and 112 amino acids in LEA2 proteins, similar results of conserved motif with lengths between 11 and 164 amino acids were obtained in cotton MYBs protein (He et al. 2016). The homology in motif lengths with that of MYBs provided significant evidence supporting the possible role of the LEA2s in response to water stress which includes the regulation of stomatal movement, the control of suberin and cuticular waxes synthesis and the regulation of flower development (He et al. 2016). Most of the LEA2 proteins had distinctive motifs, which are valuable for their identification, the common motifs identified for the cotton LEA proteins were; motif 1 (FFVLFSVFSLILWGASRPQKPKITMKSIKFENFKIQAGSDFSGVPTDMITMNSTVKMTYRNTATFFGVHVTSTPLDLSYSQJTIASG), motif 2 (WLVFRPKKPKFSLQSVTVYAL), motif 3 (NFQVTVTARNPNKRIG IYYD), motif 5 (TVKNPNFGSFKYDNSTVSVNYRGKVVGEA) and motif 14 (RRRSCCCCCCLWTLJ) (Supplementary Figure S2).

The number of the conserved motifs in each LEA2s varied between 1 and 7. The majority of close members in the phylogenetic tree exhibited common motif compositions, which suggested they have a functional similarity within the same subgroup. The alignment results of the LEA2 proteins showed various segments such as Y-segment, K-segment and S-segments (Supplementary Figure S3), which have been previously described in dehydrins (Hanin et al. 2011). Other unique segments identified were E, R and D segments. The K segment has been found to form an amphipathic α-helix (Monera et al. 1995). The K-segments assumes α-helical structure identical to class A2 amphipathic α-helices mainly found in apolipoproteins, apolipoproteins facilitate the transportation of water-insoluble lipids in plasma, and α-synucleins (Rorat 2006). The conformation of the protein structure in turn leads to functional change (Dyson and Wright 2005). Drought stress alters the protein ambient microenvironment, leading to protein conformational and functional changes (Mahdieh et al. 2008). The amphipathic α-helices have the ability to interact with the dehydrated surfaces of various other proteins and biomembranes (Cornell and Taneva 2006). The binding of dehydrins to the dehydrated surface of other proteins enhances formation of amphipathic α-helices which protects other proteins from further loss of water. The presence of this K segment in LEA2 revealed the significant role played by these proteins in plants during drought stress. It has been suggested that the protective role of the LEA proteins is due to their ability to form α-helices which enables them to interact with other proteins and or biomembranes (Koag 2003). Kovacs et al., (Kovacs et al. 2008), reported the protective activities of two dehydrin proteins isolated from A. thaliana, early response to dehydration 10 (ERD10) and early response to dehydration 14 (ERD14), against thermal inactivation of alcohol dehydrogenase and thermal aggregation of citrate synthase.

Chromosomal location and duplication events of cotton LEA2 genes

A gene’s location on a chromosome plays a significant role in shaping how an organism’s traits vary and evolve (Lazazzera and Hughes 2015). Chromosomes hold thousands of genes, with some situated in the middle of their linear structure and others at either end (Bickmore and Van Steensel 2013). Therefore, for us to understand the gene distribution and mapping positions of the LEA2 genes, the positions of each LEA2 genes were mapped on the A, D and AD cotton chromosome by carrying out homology search against the full-lengths of G. arboreum (A-genome), G. raimondii (D-genome) and G. hirsutum (AD genome) assembly. The LEA2 genes were mapped in all the 26 chromosomes in G. hirsutum, 13 chromosomes in G. arboreum and 12 chromosomes in G.raimondii. In diploid cotton genome, G. arboreum and G. raimondii, the gene distribution pattern was almost identical to the tetraploid cotton gene distribution (Supplementary Figure S4). In chromosome 9 in G. arboreum and its homolog chromosome in G. raimondii, a significant level of gene loss was observed in which only a single gene was contained in chr09 of G. arboreum compared to 10 genes in chr09. But more interestingly, there was total gene loss in chr13 of G. raimondii. The lack of LEA2 genes in chr13 in G. raimondii could only be accounted for due to either gene loss or gene deletion, for most of the LEA genes are found in every chromosome. The occurrences of LEA2 genes on every chromosome indicated that the genes are widely distribution on the entire cotton genome. However, the density of these loci was variable across the 26 chromosomes of upland and 13 chromosomes in A and D diploid cotton. The largest number of genes were located on chromosomes At09 (chr09) and Dt09 (chr23), with 12 and 14 genes respectively, followed by chromosome, Dt08 (chr24) with 10 genes, Dt 06 (chr25) with 9 genes, At07 and At12 with 12 genes each. The lowest loci ranged from 1 to 5 genes, with chromosome At02, At05, At09, Dt02 (chr14) and Dt04 (chr22) had a single gene each (Supplementary Figure S5). A total of 39 genes were not mapped and thus grouped as scaffold. The distribution of the genes on the chromosomes appeared to be uneven.

In general, the central sections of chromosomes were located with less LEA2 genes and relatively high densities of upland cotton LEA2s were observed in the top and bottom sections of most chromosomes. Similar gene loci clustering pattern was also observed in GrMYB genes distribution in which most of the genes were clumped either on the upper or lower regions of the chromosomes (He et al. 2016). A gene’s location on a chromosome plays a significant role in shaping how an organism’s traits vary and evolve (Sexton and Cavalli 2015). It has been found that evolution is less a function of what a physical trait is, but more of where the genes that affect that trait are located in the genome (Sexton and Cavalli 2015). The distribution of this subset of LEA genes across the whole cotton genome provided a significant role played by these genes within the plant.

The main cause of gene expansion in a genome or organism is either due to segmental or tandem duplication (Cannon et al. 2004). Two or more genes located on the same chromosome, one following the other, confirms a tandem duplication event, while gene duplication on different chromosomes is designated as segmental duplication event (Yu et al. 2005). In the present study, cluster formations by the LEA2 genes explained the mechanism behind their expansion in cotton. Most of the duplicated genes were between G. hirsutum and its ancestors, G. arboreum (53) and G. raimondii (11) (Table 3). The tetraploid cotton, G. hirsutum evolved due to whole genome duplication resulting into polyploidy cotton. The Ka/Ks values ranged from 0 to 2.17333, with an average value of 0.4238, which implied that majority of the gene pair had Ka/Ks values of less than 1, which indicated that the LEA2 genes have been influenced extensively by purifying selection during the process of their evolution.

Table 3. Gene duplication, synonymous (Ks), nonsynonymous (Ka) and Ka/Ks values calculated for paralogous LEA2 gene pairs in cotton genome.

Gene type Paralogous gene pairs Length (aa) Ka Ks Ka/Ks Negative/purifying selection P-Value (Fisher)
LEA2 CotAD_59405 CotAD_76129 627 0 0.00654 0 YES 0
LEA2 CotAD_20020 Cotton_A_01845 750 0 0.00568 0 YES 0
LEA2 CotAD_19078 Cotton_A_23172 648 0 0.00672 0 YES 0
LEA2 CotAD_08181 Cotton_A_27543 606 0 0.00697 0 YES 0
LEA2 CotAD_48976 Cotton_A_29779 660 0 0.00642 0 YES 0
LEA2 CotAD_35514 Gorai.010G176400.1 543 0 0.00822 0 YES 0
LEA2 CotAD_31536 Cotton_A_13470 627 0.00211 0.03373 0.06246 YES 0.00360292
LEA2 CotAD_37888 Cotton_A_08663 960 0.04378 0.55839 0.07841 YES 1.73E-37
LEA2 CotAD_03649 CotAD_37888 960 0.04522 0.54142 0.08352 YES 9.32E-36
LEA2 CotAD_03649 Cotton_A_14478 960 0.04592 0.52972 0.08668 YES 3.29E-35
LEA2 CotAD_03649 CotAD_73966 960 0.04597 0.527 0.08723 YES 4.70E-35
LEA2 CotAD_17102 CotAD_31536 627 0.00422 0.03365 0.12547 YES 0.0107355
LEA2 CotAD_44941 Gorai.005G203000.1 720 0.00175 0.01368 0.12779 YES 0.0998325
LEA2 CotAD_08181 CotAD_46550 606 0.00654 0.04975 0.1315 YES 0.00250188
LEA2 CotAD_17101 Cotton_A_13469 666 0.00195 0.01318 0.14805 YES 0.121749
LEA2 CotAD_09578 Cotton_A_02196 780 0.0903 0.59944 0.15064 YES 7.07E-24
LEA2 CotAD_35069 CotAD_62996 954 0.00551 0.03643 0.15116 YES 0.0017334
LEA2 CotAD_59405 Cotton_A_40363 627 0.00636 0.04016 0.15842 YES 0.00848415
LEA2 CotAD_17045 Cotton_A_14354 657 0.00201 0.01262 0.15958 YES 0.13409
LEA2 CotAD_09685 CotAD_53981 753 0.00711 0.04386 0.16211 YES 0.00252472
LEA2 CotAD_01700 Cotton_A_02196 780 0.09992 0.58986 0.16939 YES 8.68E-22
LEA2 CotAD_17062 CotAD_21731 732 0.00719 0.04161 0.17276 YES 0.00506705
LEA2 CotAD_35069 Cotton_A_24356 954 0.00551 0.03178 0.17329 YES 0.00508945
LEA2 CotAD_10376 Cotton_A_05625 831 0.00645 0.03444 0.18723 YES 0.00723285
LEA2 CotAD_21924 Cotton_A_18919 786 0.01028 0.05219 0.19697 YES 0.00026749
LEA2 CotAD_31535 Gorai.006G150200.1 666 0.00391 0.01981 0.19743 YES 0.082505
LEA2 CotAD_25271 Cotton_A_14676 405 0.00647 0.03234 0.20023 YES 0.085476
LEA2 CotAD_09685 Cotton_A_05444 753 0.0089 0.04387 0.20282 YES 0.00516244
LEA2 CotAD_46888 Cotton_A_09596 573 0.00922 0.0453 0.20351 YES 0.0147038
LEA2 CotAD_08181 Gorai.009G305100.1 606 0.00435 0.02103 0.20672 YES 0.090366
LEA2 CotAD_19078 CotAD_66774 648 0.01009 0.04842 0.20844 YES 0.00834864
LEA2 CotAD_32487 Cotton_A_13240 630 0.00425 0.01917 0.22185 YES 0.103356
LEA2 CotAD_23118 CotAD_74061 1215 0.01611 0.06882 0.23405 YES 5.00E-05
LEA2 CotAD_36328 CotAD_64346 630 0.01777 0.07564 0.23489 YES 0.000973496
LEA2 CotAD_32847 CotAD_39064 612 0.01106 0.0461 0.23994 YES 0.0153075
LEA2 CotAD_46873 CotAD_60617 630 0.00835 0.03452 0.24185 YES 0.0372109
LEA2 CotAD_46873 Cotton_A_09615 630 0.00835 0.03452 0.24185 YES 0.0372109
LEA2 CotAD_18546 CotAD_37776 519 0.01016 0.04195 0.24212 YES 0.0375368
LEA2 CotAD_19375 Cotton_A_06435 675 0.01345 0.05541 0.24268 YES 0.00759106
LEA2 CotAD_46888 CotAD_61391 573 0.01387 0.05313 0.26111 YES 0.0175133
LEA2 CotAD_23118 Cotton_A_38117 1215 0.01611 0.06077 0.26514 YES 0.000321992
LEA2 CotAD_19214 Cotton_A_30889 543 0.00237 0.0083 0.28598 YES 0.347253
LEA2 CotAD_31535 Cotton_A_13469 666 0.01377 0.04718 0.2919 YES 0.0234164
LEA2 CotAD_21924 CotAD_64657 786 0.01373 0.04693 0.29247 YES 0.0120925
LEA2 CotAD_31140 Cotton_A_15998 747 0.00174 0.0058 0.30099 YES 0.356655
LEA2 CotAD_30219 Cotton_A_32495 597 0.01105 0.03626 0.30482 YES 0.0618481
LEA2 CotAD_46873 Gorai.001G124400.1 630 0.00208 0.00674 0.30909 YES 0.361889
LEA2 CotAD_46888 Gorai.001G122700.1 573 0.0046 0.0148 0.31039 YES 0.238274
LEA2 CotAD_28252 CotAD_53263 492 0.01356 0.04285 0.31656 YES 0.069282
LEA2 CotAD_14147 Cotton_A_02370 636 0.00416 0.01312 0.3169 YES 0.244174
LEA2 CotAD_23646 Cotton_A_27300 609 0.04249 0.13135 0.32348 YES 0.000630664
LEA2 CotAD_09578 Cotton_A_07036 780 0.00342 0.01037 0.33004 YES 0.256013
LEA2 CotAD_17045 CotAD_64004 657 0.02247 0.06523 0.34445 YES 0.0157104
LEA2 CotAD_37888 CotAD_73966 960 0.01528 0.0442 0.34576 YES 0.0157353
LEA2 CotAD_37888 Cotton_A_14478 960 0.01247 0.03528 0.3534 YES 0.0321315
LEA2 CotAD_23646 Gorai.006G199800.1 609 0.04249 0.11411 0.37237 YES 0.00460089
LEA2 CotAD_17062 Cotton_A_14370 732 0.0099 0.02648 0.37402 YES 0.0618224
LEA2 CotAD_02652 Cotton_A_02370 636 0.01256 0.03311 0.37934 YES 0.101339
LEA2 CotAD_19214 CotAD_35514 543 0.00955 0.02509 0.38065 YES 0.19023
LEA2 CotAD_21731 Cotton_A_14370 732 0.00899 0.02354 0.38192 YES 0.138838
LEA2 CotAD_13584 Cotton_A_01845 750 0.00878 0.02294 0.38264 YES 0.139381
LEA2 CotAD_17101 CotAD_31535 666 0.01576 0.04026 0.3915 YES 0.077316
LEA2 CotAD_35091 CotAD_60435 753 0.03016 0.07689 0.39221 YES 0.0144267
LEA2 CotAD_20308 CotAD_70003 573 0.00915 0.02291 0.39918 YES 0.206152
LEA2 CotAD_50359 CotAD_66538 633 0.01677 0.04094 0.40958 YES 0.0891624
LEA2 CotAD_01700 Cotton_A_07036 780 0.01551 0.03701 0.41916 YES 0.0752732
LEA2 CotAD_02652 CotAD_14147 636 0.00835 0.01974 0.42281 YES 0.226532
LEA2 CotAD_35513 Cotton_A_30890 651 0.02193 0.05102 0.42978 YES 0.0738291
LEA2 CotAD_35514 Cotton_A_30889 543 0.00716 0.01659 0.43135 YES 0.312651
LEA2 CotAD_28872 Gorai.005G203000.1 720 0.01233 0.02762 0.44641 YES 0.170613
LEA2 CotAD_56699 Cotton_A_38534 639 0.02021 0.04493 0.44988 YES 0.106618
LEA2 CotAD_01700 CotAD_09578 780 0.01202 0.02634 0.45642 YES 0.151105
LEA2 CotAD_40972 Cotton_A_29659 591 0.96659 2.0709 0.46675 YES 0.00123143
LEA2 CotAD_40972 CotAD_38978 591 0.96025 2.04193 0.47026 YES 0.00125135
LEA2 CotAD_17101 Gorai.006G150200.1 666 0.01977 0.04018 0.49197 YES 0.209339
LEA2 CotAD_50359 Cotton_A_33548 633 0.01678 0.03388 0.49528 YES 0.175709
LEA2 CotAD_74713 Cotton_A_33548 633 0.01678 0.03388 0.49528 YES 0.175709
LEA2 CotAD_03649 CotAD_31344 960 0.01103 0.02214 0.49798 YES 0.177084
LEA2 CotAD_13584 CotAD_20020 750 0.00878 0.01711 0.51348 YES 0.287642
LEA2 CotAD_13115 Cotton_A_31059 576 0.0207 0.0379 0.54625 YES 0.312514
LEA2 CotAD_19214 Gorai.010G176400.1 543 0.00955 0.01664 0.57418 YES 0.403293
LEA2 CotAD_20308 Cotton_A_17625 573 0.01375 0.02296 0.59881 YES 0.347235
LEA2 CotAD_25271 CotAD_48769 405 0.00647 0.01063 0.6094 YES 0.539117
LEA2 CotAD_12681 Cotton_A_08212 432 0.03121 0.04928 0.63319 YES 0.35887
LEA2 CotAD_19623 CotAD_36999 282 0.03296 0.04771 0.69081 YES 0.631725
LEA2 CotAD_23646 Cotton_A_27282 609 0.02587 0.03738 0.69204 YES 0.542393
LEA2 Cotton_A_13471 CotAD_17103 180.94 2.32397 1.11323 0.77822 YES 837
LEA2 CotAD_53438 CotAD_68189 618 0.02341 0.02898 0.80786 YES 0.519399
LEA2 CotAD_56696 Cotton_A_38535 630 0.01838 0.02269 0.80979 YES 0.670475
LEA2 CotAD_44941 Cotton_A_17986 720 0.01233 0.01369 0.90056 YES 0.874489
LEA2 CotAD_22539 Cotton_A_25195 408 1.23265 1.24112 0.99317 YES 1
LEA2 CotAD_28872 CotAD_44941 720 0.0141 0.01369 1.03042 NO 0.900519
LEA2 CotAD_17103 Cotton_A_13471 837 2.58712 2.32397 1.11323 NO 0.778217
LEA2 CotAD_13827 Cotton_A_18645 1104 2.12092 1.89653 1.11832 NO 0.642563
LEA2 CotAD_10044 Cotton_A_09473 1902 0.00274 0.00228 1.20458 NO 0.731531
LEA2 Cotton_A_31083 CotAD_35069 939 2.2748 1.83858 1.23726 NO 0.447623
LEA2 CotAD_30219 Gorai.006G104100.1 597 0.00884 0.00707 1.25015 NO 0.743557
LEA2 CotAD_03649 Cotton_A_08663 960 0.00549 0.00437 1.25606 NO 0.744588
LEA2 CotAD_11658 Cotton_A_40499 789 0.02309 0.01751 1.3191 NO 0.985982
LEA2 CotAD_12375 CotAD_42408 597 2.42062 1.68288 1.43838 NO 0.288342
LEA2 CotAD_35091 Cotton_A_24371 699 3.50309 1.61186 2.17333 NO 0.036477

Cis element prediction in LEA2 proteins

Transcription factors (TFs) and cis-acting regulatory elements contained in stress-responsive promoter regions function not only as molecular switches for gene expression, but also as terminal points of signal transduction in the signaling processes (Chang et al. 2008). The cis-regulatory promoters are located on the upstream of genes and functions as binding sites for transcription factors (TFs) which play essential functions in determining the tissue-specificity or stress-responsive expression patterns of the genes (Yamaguchi-Shinozaki and Shinozaki 2005). For better understanding of the potential roles of the LEA2 genes, 1000 bp regions upstream of the transcriptional start site were extracted and used in the identification of cis-regulatory promoters and other important motifs. Abiotic stress-related cis-elements were found in the putative promoters of LEA2 genes in upland cotton, G. hirsutum, (Figure 2) and (Supplementary table S5). For instance, MYBCORE, is known to have a functional role in drought and regulation of flavonoid biosynthesis (Solano et al. 1995). ABRELATERD1, ABRE-like sequence and ACGTATERD1 are responsive to dehydration (Simpson et al. 2003). ACGTATERD1 is associated to early responsive to dehydration (Simpson et al. 2003). The presence of the stress promoter elements strongly supported the possible role of upland cotton LEA2 proteins in enhancing drought tolerance in cotton. The high proportion of cis promoter elements in LEA2 proteins, could possibly explain why genes encoding LEA proteins are highly expressed under abiotic stress, as was found in the root tissues of Arabidopsis under drought stress (Dalal et al. 2009; Candat et al. 2014). It is also important to mention that various transcription factors (TFs) and cis-acting regulatory elements contained in stress-responsive promoter regions function not only as molecular switches for gene expression, but also as terminal points of signal transduction in the signaling processes (Yamaguchi-Shinozaki and Shinozaki 2005).

Figure 2.

Figure 2

Average number of the cis-elements in promoter region of upland cotton G. hirsutum LEA2 genes. The cis-elements were analyzed in the 1 kb upstream promoter region of translation start site using the PLACE database.

Prediction of LEA genes targeted by miRNAs

Drought is a recurring climate feature in most parts of the world (Kang et al. 2009). The sessile nature of the plants, has made the plants to developed their own defense systems to cope up with perennial and erratic adverse climatic conditions (Bartwal et al. 2013). One of the defense mechanisms used by the plants toward the effect of drought stress is the reprogramming of gene expression by microRNAs (Ferdous et al. 2015). The small RNAs (miRNAs) are known as the small noncoding RNAs with approximately 22 nucleotides length. The miRNAs are mainly involved in the regulation of genes at post-transcriptional levels in a range of organisms (Grivna et al. 2006). Large groups of small RNAs have been reported as regulators in plant adaptation to abiotic stresses (Xie et al. 2015). To get more information on the LEA2 genes functions, we determined the prediction of miRNAs targets on LEA2 genes by the use of psRNATarget, the same as been applied for other functional genes in cotton (Dai and Zhao 2011). Out of 157 upland cotton LEA2 genes, 63 genes were found to be targeted by 48 miRNAs, representing 40% of all the LEA2 genes (Supplementary Table S6). The highest levels of target was detected for the following genes with more than 6 miRNAs, CotAD_00799 being targeted by ghr-miR2948-5p, ghr-miR7492a, ghr-miR7492b, ghr-miR7492c, ghr-miR7494 and ghr-miR7510b. CotAD_19205 targeted by ghr-miR390a, ghr-miR390b, ghr-miR390c, ghr-miR7492a, ghr-miR7492b and ghr-miR7492c. CotAD_31936 targeted by ghr-miR7492a, ghr-miR7492b, ghr-miR7492c, ghr-miR827a, ghr-miR827b and ghr-miR827c. CotAD_32487 targeted by ghr-miR156a, ghr-miR156b, ghr-miR156d, ghr-miR7507 and ghr-miR7509. CotAD_33143 targeted by ghr-miR2948-5p, ghr-miR482a, ghr-miR7492a, ghr-miR7492b, ghr-miR7492c and ghr-miR7510b. CotAD_41925 targeted ghr-miR396a, ghr-miR396b, ghr-miR7492a, ghr-miR7492b, ghr-miR7492c, ghr-miR827a, ghr-miR827b and ghr-miR827c. The rest of the genes were either targeted by 1 or 5 miRNAs. The high number of miRNAs targeting LEA2 genes could possibly have direct or indirect correlation to their stress tolerance levels to abiotic stress more so drought. Some specific miRNAs had high level of target to various genes such as ghr-miR164 (4 genes), ghr-miR2949a-3p (4 genes), ghr-miR2950 (8 genes), ghr-miR7492a (10 genes), ghr-miR7492b (10 genes), ghr-miR7492c (10 genes), ghr-miR7504a (5 genes), ghr-miR7507 (5 genes), ghr-miR7510a (6 genes), ghr-miR7510b (10 genes), ghr-miR827b (4 genes) and lastly ghr-miR827c (4 genes). It has been found that miRNAs might be playing a role in response to drought and salinity stresses through targeting a series of stress-related genes.

The plant specific transcriptome factors such as NAC gene family have been found to have varied functional roles in plant growth and development (Pereira-Santana et al. 2015), myeloblastosis (MYB) is highly correlated to various stress factors (Ambawat et al. 2013). The detection of some the LEA2 genes being targeted by specific miRNA linked to mitogen-activated protein kinase (MAPK), N-acetyl-L-cysteine (NAC) and myeloblastosis (MYB) provided a stronger indication of the significance contributions of the LEA2s in enhancing drought tolerance in plants. The micro/small RNAs mediated post-transcriptional processes have been linked to response to water deficit condition. Plant miRNAs are involved in multi-complex and arrays of processes, including but not limited to response to stress, nutrient limitation, development, pattern formation, flowering time, hormone regulation, and even self-regulation of the miRNA biogenesis pathway (Yamaguchi-Shinozaki and Shinozaki 2005). It is important to note that most of the miRNA target genes encode transcription factors, which place miRNAs at the focal point of gene regulatory networks. Moreover, the availability of genome-wide characterization of cotton miRNA genes enabled us to perform the prediction of the miRNA targets involved in drought response.

Expression Patterns of LEA2 Genes in Different Tissues of Upland cotton as determined Through RNA sequence

Analysis of the RNA expression profile provides an indicator of the functional role of the genes in the plant. We therefore carried the RNA expression analysis (RPKM > 1) in various tissues of the cotton plant, out of the entire 157 LEA2 genes in upland cotton, G. hirsutum, 117 (75%) of all the LEA2 genes showed differential expression in various tissues, such as the leaves, roots, stem, petal, pistil, stamen, torus and calycle (Figure 3). Based on their expression profiling, the genes were clustered into three broad groups. Group 1 members with 29 genes were highly up regulated under drought and salt conditions. Under salt and drought stress, CotAD_33321, CotAD_41571, CotAD_11876, CotAD_24498 and CotAD_59405 showed the highest expression levels, Similarly CotAD_11876, CotAD_24498 and CotAD_59405 were equally significantly up regulated in all the tissues tested. A total of 23 genes were highly up regulated in 5 tissues, which provided a strong evidence of the functional role of the LEA2 genes in enhancing stress tolerance in plants. Majority of the analyzed genes, showed relatively lower expression levels in the root tissues, but CotAD_11876, CotAD_59405 and CotAD_24498 exhibited significant higher expression levels, with expression values of more than 2. A unique observation was made, among the moderately up regulated genes in the roots, the genes exhibited significant up regulation in the calyx. The up regulation of these genes in the reproductive tissues could be an indication of their functional role in the fiber development process.

Figure 3.

Figure 3

Expression profile analysis of LEA2 genes in 5 upland cotton tissues. The LEA2 genes expressed (RPKM > 1) in leaf, stem, root, calyx and petal were represented according to their tissue specificity: (A): LEA2 genes RNA seq. expression profile under drought and salt stress. (B): LEA2 expression in the 8 different tissues and (C): Venn diagram quantification and common genes expressed among the 5 tissues.

In the validation of the expression profile of the LEA2 genes under drought stress condition, CotAD_24498, CotAD_21924, CotAD_20020 and CotAD_59405 were highly up regulated in root, stem and roots tissues under drought stress condition. However, the expression levels were much higher in G. tomentosum as opposed to G. hirsutum, suggesting that, these genes could be the key genes.

qRT-PCR Expression profiling of the LEA2 genes in leaf, stem and roots of upland cotton

Based on the results obtained from the RNA sequence data, 48 genes were selected for qRT-PCR validation. Two cotton genotypes were used, G hirsutum an elite cultivar, majorly grown around the world; it covers more than 90% the cotton growing regions in China but susceptible to drought stress condition. The second plant used was the G. tomentosum, wild cotton, native to the Hawaiian island, it is known for its high ability to tolerate salinity and drought stress conditions. The two cotton plants were grown in the greenhouse, and at three leaf stage, were exposed to drought for a period of 14 days. The roots stem and leaves were obtained for RNA extraction and qRT-PCR analysis. In the analysis of qRT-PCR profiling of various tissues, the results indicated high variability in transcript abundance of LEA2 genes in upland cotton (Figure 4). In G. tomentosum and G. hirsutum, majority of these genes showed relatively high expression in the root and leaf, except in stem. Leaves and roots are the main plant organs affected by drought stress (Alexandersson et al. 2005). The plant leaf is the site for photosynthesis; drought stress might possibly be the cause of excess release of reactive oxygen species (ROS). ROS are toxic to the plants, the genes with high expression in the leaves, could perhaps be involved in the ubiquitin of the ROS, thus preventing the damage and maintain the normal functions of the photosynthetic cells. The high osmotic potential generated in the cytoplasm of guard cells during stomatal opening could probably lead to accumulation of LEA2s in leaf tissue. Increased osmotic potential within the guard cells necessitates mass flow of water into the guard cells, leading to its turgidity and thus opening of the stomatal pore, but during drought stress, the osmotic potential is never offset, and thus dehydration stress on the nucleus. The LEA2s increased accumulation within the leaf tissues, could be due to maintaining structural integrity and preventing the membranes from dehydration stress. The finding is consistent to proposed functions of the LEA genes, which is the protective role during abiotic stresses (Nylander et al. 2001). The roots are the connection point between the water reservoir and the plants. High up regulation of LEA2 genes in the roots indicated that these genes could be involved in the water balance in the roots. Increased or high up regulation of LEA2s in the roots, further augment the primary role of LEA genes in plants, the protective function, roots are the very first plant organs to be affected by drought stress.

Figure 4.

Figure 4

Venn den diagram of differential expressions of LEA2 genes in different plants tissues. A. tissues of G. hirsutum and B. tissues of G. tomentosum.

Expression profiles of LEA2 genes Under drought treatment in G. hirsutum and G. tomentosum

Gene expression profile provides vital information of the roles played by the genes in plants (Movahedi et al. 2012). In order to determine the expression pattern of the LEA2 genes in tolerant and non-tolerant upland cotton genotypes, we carried the qRT-PCR validation of 48 LEA2 genes in leaves, roots and stem tissues. The 48 genes were selected based on the RNA sequence expression profile, 24 genes were up regulated while the other half were down regulated. The samples for qRT-PCR were collected at 0, 7 and 14th day of stress exposure, in which 0 day (control) was used as the reference point. More genes were up regulated in all the tissues of the drought tolerant genotype, G. tomentosum as compared to the drought sensitive genotype, G. hirsutum (Figure 5). The result obtained denotes that the drought resistant genotype have the potential to mobilize more drought related genes, when exposed to drought tolerance as opposed to the less tolerant genotypes, thus the higher expression levels, similar results were obtained in the expression for cold tolerance genes in Arabidopsis with varying tolerance levels, more genes were up regulated in the cold tolerant and in the cold susceptible genotype (Hannah et al. 2006).

Figure 5.

Figure 5

Differential expression of upland cotton LEA2 genes under drought stress. The heat map was visualized using Mev.exe program. (Showed by log2 values) under control and in treated samples for 7 and 14 days after drought treatment (i) G. tomentosum and (ii) G. hirsutum. Red–up regulated, green-down regulated and black–no expression. Red box indicate the cloned gene.

The up regulation of LEA2 genes under drought stress, could possibly explain their protective role in plants tissues under dehydration stress. For instance, HVA1, a LEA gene from barley (Hordeum vulgare L) was found to confer drought stress in transgenic rice (Babu et al. 2004). Interestingly, some phylogenetic LEA2 gene pairs, orthologous genes were found to have differential expression pattern in either of the cotton genotypes (Figure 6), for instance, CotAD_71431 and CotAD_51205 exhibited varied expression pattern under drought and salt stress conditions as evident in the RNA expression analysis. The result suggests that even if these genes are cladded together; they could have developed different biological function over time. Orthologous genes are members of the genes with a common evolutionary origin and share greater percentage of sequence similarity (Nehrt et al. 2011). According to the expression pattern of LEA2 genes in different tissues, it would be interesting to functionally characterize these genes in upland cotton, G. hirsutum. Majority of the LEA2 genes showed higher expression level in leaf and root tissues, which indicated the functional conservation of the gene sub family. The variation in expression between G. hirsutum and G. tomentosum could be due to broad changes in environmental conditions, G. tomentosum exhibits divergence signals that are associated with directionally selected traits and are functionally related to stress responses. These results suggest that stress adaptation in G. tomentosum might have involved the evolution of protein-coding sequences and thus these genes can be introgressed in to elite upland cotton, in order to boost their performance in the current face of declining fresh water and precipitation.

Figure 6.

Figure 6

Quantitative PCR analysis of the selected LEA2 genes. Abbreviations: 7d-7 days and 14d-14 days of stress. GhG. hirsutum and GtG. tomentosum. Y-axis: relative expression (2-ΔΔCT. The enclosure indicated the cloned gene.

qRT-PCR Analysis of the Transformed Gene in Upland Cotton Tissues

Based on the expression analysis of the LEA2 genes in the various tissues of G. tomentosum (drought susceptible) and G. hirsutum (drought susceptible). We identified a single gene with significant expression in the various tissues and transformed the gene into the model plant, A. thaliana (Colombia ecotype-0). The gene CotAD_24498 was analyzed in various tissues of the upland cotton, G. hirsutum. This was carried out in order to determine its relative abundance within the plant. We found that the gene was more abundantly expressed in the reproductive tissues, more specifically in the petal and stamen (Figure 7A). In addition, we further carried out treatment on cotton seedlings after three true leaves stage under drought stress (PEG6000_15%) the samples for RNA extraction and qRT-PCR analysis were obtained from leaf, root and leaves at intervals of 0 h, 3 hr, 6 hr, 12 hr and 24 hr of post stress treatment. In all the three tissues, 6 hr marked the peak up-regulation of the gene, and then a gradual decline was observed with increase in time of stress exposure. The gene exhibited a significant up regulation in the root as compare to leaf and stem tissues (Figure 7B). We successfully transformed 9 lines with overexpressed gene CotAD_24498 (Figure 7C), out the nine (9) lines, three (3) lines showed the highest level of overexpression and were further used in the investigation of the potential of the gene in the transgenic lines under drought stress conditions (Figure 7D).

Figure 7.

Figure 7

The qRT-PCR analysis of the expression of the cloned gene CotAD_24498 (A) Total RNA isolated from various tissue of cotton plant under normal conditions; (B) Total RNA extracted from drought-stressed cotton seedlings; (C) Polymerase chain reaction (PCR) analysis performed to check 630bp coding sequence (CDS) integration in transformed T1 generation, number 1–10 transgenic lines, 11 positive control (pWM101- CotAD_24498 and 12 is the negative control (wild-type, WT). (D) The transcripts expression levels of the CotAD_24498 of T2 transgenic lines analyzed through qRT-PCR.

Overexpression of CotAD_24498 in plants promote root growth and confers tolerance to drought stress tolerance

Increased primary root growth and overall plant fresh biomass are indicators of tolerance to various abiotic stresses in which plants are exposed to (Verslues et al. 2006; Jisha et al. 2013). We sought to investigate the response of the transgenic lines and the wilt type to drought stress condition in relation to primary root length elongation and fresh biomass accumulation. The transgenic lines showed enhanced performance with relatively increased primary root growth and with higher fresh biomass increment compared to the wild type under drought stress condition. The drought stress was imposed by exposing the transgenic lines to different concentrations of mannitol 0 mM, 100 mM, 200 mM and 300 mM for a period of six (6) days. Under osmotic stress, highest level of root length assays and fresh biomass accumulations was observed at 100 mM of mannitol concentration (Figure 8B). The transgenic lines had significantly higher primary root length and fresh biomass accumulation (Figure 8C), an indication that the photosynthetic processes were not impaired by the drought stress as compared to the wilt type.

Figure 8.

Figure 8

Overexpression of CotAD_24498 enhances root growth and drought stress tolerance in Arabidopsis transgenic lines (A) CotAD_24498 overexpressing and WT plants were grown vertically in 0.5 Murashige and Skoog (MS) medium supplemented with 0, 100, 200 and 300 mM mannitol and incubated for 6 days. (B). Root elongation comparisons on 0.5 MS put at normal and osmotic stress for 6 days. The seedlings were scored and photographed after 6 days post germination. (C). Quantitative determination of fresh weight biomass of wild-type (WT) and both transgenic lines (L2, L3 and L3) after 6 days post germination at normal and drought stress condition. In (B, C,), each experiment was repeated three times. Bar indicates standard error (SE). Different letters indicate significant differences between wild-type and transgenic lines (ANOVA; P < 0.05). CK: normal conditions.

Transcripts Investigation of Drought Stress-Responsive Genes

The root appears to be the most relevant organ for breeding drought stress tolerance (Henry 2013). Underlying the ABA-mediated stress responses is the transcriptional regulation of stress-responsive gene expression (Giraudat et al. 1994). Numerous genes have been reported that are up-regulated under stress conditions in vegetative tissues, these include a class of genes known as LEA genes, which are expressed abundantly in developing seed under normal conditions, osmolyte biosynthetic genes, and genes of general cellular metabolism. We undertook to check the expression of two known abiotic stress responsive genes on the transgenic lines (L2, L3 and L4) and the wild types when the plants are exposed to drought condition. The result showed that the stress responsive genes were highly up-regulated in the transgenic lines as opposed to the wild type (Figure 9). The result obtained was in agreement to the result obtained when the various LEA2 genes were analyzed through qRT-PCR on the tissues obtained from two upland cotton genotypes. More genes were found to be up regulated on the various tissues of the more tolerant genotype as opposed to the less tolerant. Constitutive expression of RD29A and ABF4 demonstrated enhanced drought tolerance in the transgenic Arabidopsis plants.

Figure 9.

Figure 9

Expression levels of drought stress-responsive genes (ABF4 and RD29A) in transgenic lines and wild-type. Arabidopsis ACTIN2 was used as the reference gene mean values with ± SD. * P < 0.05 as calculated by Student’s t-test.

Oxidants and antioxidant determination in the transgenic lines

In order to understand the role of the transformed LEA2 genes in the transgenic lines in relation to drought stress. We carried out the analysis of the various oxidants and antioxidants measurements in the leaves of the transgenic lines and the wild type. The levels of oxidants were significantly reduced in the transgenic lines compared to the wild type (Figure 10A-B). When plants are exposed to drought the level of ROS increases, which results into oxidative stress. MDA concentration provides a measure on the damage caused on the membrane lipids due to oxidative stress (Jain et al. 2001). The significant reduction in MDA and H2O2 in the leaf tissues of the transgenic lines showed that the transformed gene had a regulatory role in controlling various biological pathways geared toward detoxification of the reactive oxygen species in the cells. In addition, we quantified the levels of various antioxidants, SOD, POD and CAT. In all the three antioxidants, there was significant increased levels in the transgenic lines (L1, L2 and L3) compared to the wild type (Figure 10 C-D). The increased levels of the antioxidants showed that the transgenic lines had a higher ability to tolerant drought stress compared to the wild types. The results obtained in this research, correlates to previous findings, in which drought stressed wheat plants were found to have higher accumulation of oxidants levels (Luna et al. 2005). More tolerant plants genotypes have ability to induct more of the antioxidants such as the CAT, POD and SOD in order to scavenge on the excess ROS and other deleterious molecules released by the cells due to stress condition (Bian and Jiang 2009).

Figure 10.

Figure 10

determination of the oxidants and antioxidants in the transgenic lines under stress condition (A) Determination of hydrogen peroxide (H2O2) accumulation in leaves of wild-type (WT) and both transgenic lines (L2, L3, and L4) after 8-days drought stress (B) Determination of MDA accumulation in leaves of wild-type (WT) and both transgenic lines (L2, L3, and L4) after 8-days drought stress; (C) Catalase (CAT) activity, (D) peroxidase (POD) activity and (E) superoxide dismutase (SOD) activity. Data are means ± SE calculated from three replicates. Different letters indicate a significant difference between the WT and both transgenic lines (ANOVA; P < 0.05).

Conclusions

In this study, the identification, phylogenetic relationships, miRNA targets, cis promoter analysis, GO functional annotation and exon/intron structures of LEA2 genes family members were evaluated in upland cotton, Gossypium hirsutum, and the tissue expression pattern of the two tetraploid cotton species, G. hirsutum (drought sensitive) and G. tomentosum (drought tolerant) were detected under drought stress. The abundance of LEA2 genes and unique gene structure reported in this work provide a solid foundation for future research to understand the evolution of LEA2 gene family and the potential functional role of the 157 LEA2 genes in plants under drought stress condition. Since the discovery of LEA genes, little work has been reported on LEA genes as a whole in upland cotton. The transformation and expression analysis of the transformed LEA2 gene indicated that the LEA2 genes have a profound role in enhancing drought stress tolerance. The transgenic lines L2, L3 and L4 exhibited superior performance compared to the wild type. The roots were significantly longer than the wild type under drought stress condition; similarly, the levels of oxidants in the levels were significantly reduced while the antioxidants levels were higher in the leaves of the transgenic lines compared to the wild type. An indication that the transgenic plants had a higher capacity to regulate the oxidative stress as opposed to the wild type (WT). The genes could be promoting growth of the root cells under limited water condition. Primary root growth is linked to drought stress tolerance; due to increased surface area of the roots thus improving its ability maximally absorb any little moisture available. Deep or extensive root growth is a trait known for most of the xerophytic plants (Brunner et al. 2015).

Acknowledgments

This research was financially supported by the National Natural Science Foundation of China (31671745, 31530053) and the National Key Research and Development plan (2016YFD0100306)

ROM and WK designed the experiment, ROM, PL and JNK, implemented and collected the data. ROM analyzed the results and prepared the manuscript. JNK, PL, QD, FL, WXX, CX, ZZ, YH and WK revised the manuscript. All authors revised and approved the final manuscript.

Footnotes

Supplemental material available at Figshare: https://doi.org/10.25387/g3.6626849.

Communicating editor: K. McKim

Literature Cited

  1. Alexandersson E., Fraysse L., Sjövall-Larsen S., Gustavsson S., Fellert M., et al. , 2005.  Whole gene family expression and drought stress regulation of aquaporins. Plant Mol. Biol. 59: 469–484. 10.1007/s11103-005-0352-1 [DOI] [PubMed] [Google Scholar]
  2. Ambawat S., Sharma P., Yadav N. R., Yadav R. C., 2013.  MYB transcription factor genes as regulators for plant responses: An overview. Physiol. Mol. Biol. Plants 19: 307–321. 10.1007/s12298-013-0179-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Avramova Z., 2015.  Transcriptional “memory” of a stress: Transient chromatin and memory (epigenetic) marks at stress-response genes. Plant J. 83: 149–159. 10.1111/tpj.12832 [DOI] [PubMed] [Google Scholar]
  4. Babu R. C., Zhang J., Blum A., Ho T. H. D., Wu R., et al. , 2004.  HVA1, a LEA gene from barley confers dehydration tolerance in transgenic rice (Oryza sativa L.) via cell membrane protection. Plant Sci. 166: 855–862. 10.1016/j.plantsci.2003.11.023 [DOI] [Google Scholar]
  5. Bartosz G., 2005.  Superoxide Dismutases and Catalase. Oxid. Antioxid. Def. Syst. 2: 109–149. [Google Scholar]
  6. Bartwal A., Mall R., Lohani P., Guru S. K., Arora S., 2013.  Role of Secondary Metabolites and Brassinosteroids in Plant Defense Against Environmental Stresses. J. Plant Growth Regul. 32: 216–232. 10.1007/s00344-012-9272-x [DOI] [Google Scholar]
  7. Battaglia M., Covarrubias A. A., 2013.  Late Embryogenesis Abundant (LEA) proteins in legumes. Front. Plant Sci. 4: 190 10.3389/fpls.2013.00190 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Battaglia M., Olvera-Carrillo Y., Garciarrubio A., Campos F., Covarrubias A. A., 2008.  The Enigmatic LEA Proteins and Other Hydrophilins. Plant Physiol. 148: 6–24. 10.1104/pp.108.120725 [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Bian S., Jiang Y., 2009.  Reactive oxygen species, antioxidant enzyme activities and gene expression patterns in leaves and roots of Kentucky bluegrass in response to drought stress and recovery. Sci. Hortic. (Amsterdam) 120: 264–270. 10.1016/j.scienta.2008.10.014 [DOI] [Google Scholar]
  10. Bickmore W. A., Van Steensel B., 2013.  Genome architecture: Domain organization of interphase chromosomes. Cell 152: 1270–1284. 10.1016/j.cell.2013.02.001 [DOI] [PubMed] [Google Scholar]
  11. Bodén M., Hawkins J., 2005.  Prediction of subcellular localization using sequence-biased recurrent networks. Bioinformatics 21: 2279–2286. 10.1093/bioinformatics/bti372 [DOI] [PubMed] [Google Scholar]
  12. Brunner I., Herzog C., Dawes M. A., Arend M., Sperisen C., 2015.  How tree roots respond to drought. Front. Plant Sci. 6: 547 10.3389/fpls.2015.00547 [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Buckley T. N., 2015.  The contributions of apoplastic, symplastic and gas phase pathways for water transport outside the bundle sheath in leaves. Plant Cell Environ. 38: 7–22. 10.1111/pce.12372 [DOI] [PubMed] [Google Scholar]
  14. Calderwood M. S., Gannoun-Zaki L., Wellems T. E., Deitsch K. W., 2003.  Plasmodium falciparum var Genes Are Regulated by Two Regions with Separate Promoters, One Upstream of the Coding Region and a Second within the Intron. J. Biol. Chem. 278: 34125–34132. 10.1074/jbc.M213065200 [DOI] [PubMed] [Google Scholar]
  15. Candat A., Paszkiewicz G., Neveu M., Gautier R., Logan D. C., et al. , 2014.  The Ubiquitous Distribution of Late Embryogenesis Abundant Proteins across Cell Compartments in Arabidopsis Offers Tailored Protection against Abiotic Stress. Plant Cell 26: 3148–3166. 10.1105/tpc.114.127316 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Cannon S. B., Mitra A., Baumgarten A., Young N. D., May G., 2004.  The roles of segmental and tandem gene duplication in the evolution of large gene families in Arabidopsis thaliana. BMC Plant Biol. 4: 10 10.1186/1471-2229-4-10 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Cao J., Li X., 2014.  Identification and phylogenetic analysis of late embryogenesis abundant proteins family in tomato (Solanum lycopersicum). Planta 241: 757–772. 10.1007/s00425-014-2215-y [DOI] [PubMed] [Google Scholar]
  18. Celik Altunoglu Y., Baloglu M. C., Baloglu P., Yer E. N., Kara S., 2017.  Genome-wide identification and comparative expression analysis of LEA genes in watermelon and melon genomes. Physiol. Mol. Biol. Plants 23: 5–21. 10.1007/s12298-016-0405-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Chang W. C., Lee T. Y., Huang H. D., Huang H. Y., Pan R. L., 2008.  PlantPAN: Plant promoter analysis navigator, for identifying combinatorial cis-regulatory elements with distance constraint in plant gene groups. BMC Genomics 9: 561 10.1186/1471-2164-9-561 [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Charfeddine S., Saïdi M. N., Charfeddine M., Gargouri-Bouzid R., 2015.  Genome-wide identification and expression profiling of the late embryogenesis abundant genes in potato with emphasis on dehydrins. Mol. Biol. Rep. 42: 1163–1174. 10.1007/s11033-015-3853-2 [DOI] [PubMed] [Google Scholar]
  21. Clough S. J., Bent A. F., 1998.  Floral dip: a simplified method for Agrobacterium-mediated transformation of Arabidopsis thaliana. Plant J. 16: 735–743. 10.1046/j.1365-313x.1998.00343.x [DOI] [PubMed] [Google Scholar]
  22. Cornell R. B., Taneva S. G., 2006.  Amphipathic helices as mediators of the membrane interaction of amphitropic proteins, and as modulators of bilayer physical properties. Curr. Protein Pept. Sci. 7: 539–552. 10.2174/138920306779025675 [DOI] [PubMed] [Google Scholar]
  23. Dai X., Zhao P. X., 2011.  PsRNATarget: A plant small RNA target analysis server. Nucleic Acids Res. 39: W155–W159. 10.1093/nar/gkr319 [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Dalal M., Tayal D., Chinnusamy V., Bansal K. C., 2009.  Abiotic stress and ABA-inducible Group 4 LEA from Brassica napus plays a key role in salt and drought tolerance. J. Biotechnol. 139: 137–145. 10.1016/j.jbiotec.2008.09.014 [DOI] [PubMed] [Google Scholar]
  25. Denekamp N. Y., Reinhardt R., Kube M., Lubzens E., 2010.  Late Embryogenesis Abundant (LEA) Proteins in Nondesiccated, Encysted, and Diapausing Embryos of Rotifers1. Biol. Reprod. 82: 714–724. 10.1095/biolreprod.109.081091 [DOI] [PubMed] [Google Scholar]
  26. Du D., Zhang Q., Cheng T., Pan H., Yang W., et al. , 2013.  Genome-wide identification and analysis of late embryogenesis abundant (LEA) genes in Prunus mume. Mol. Biol. Rep. 40: 1937–1946. 10.1007/s11033-012-2250-3 [DOI] [PubMed] [Google Scholar]
  27. Dure L., III, Pyle J. B., Chlan C. A., Baker J. C., Galau G. A., 1983.  Developmental biochemistry of cottonseed embryogenesis and germination : XVII. Developmental expression of genes for the principal storage proteins. Plant Mol. Biol. 2: 199–206. 10.1007/BF01578379 [DOI] [PubMed] [Google Scholar]
  28. Dyson H. J., Wright P. E., 2005.  Intrinsically unstructured proteins and their functions. Nat. Rev. Mol. Cell Biol. 6: 197–208. 10.1038/nrm1589 [DOI] [PubMed] [Google Scholar]
  29. Emanuelsson O., Brunak S., von Heijne G., Nielsen H., 2007.  Locating proteins in the cell using TargetP, SignalP and related tools. Nat. Protoc. 2: 953–971. 10.1038/nprot.2007.131 [DOI] [PubMed] [Google Scholar]
  30. Espelund M., Sæbøe‐Larssen S., Hughes D. W., Galau G. A., Larsen F., et al. , 1992.  Late embryogenesis‐abundant genes encoding proteins with different numbers of hydrophilic repeats are regulated differentially by abscisic acid and osmotic stress. Plant J. 2: 241–252. [PubMed] [Google Scholar]
  31. Ferdous J., Hussain S. S., Shi B. J., 2015.  Role of microRNAs in plant drought tolerance. Plant Biotechnol. J. 13: 293–305. 10.1111/pbi.12318 [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Finn R. D., Clements J., Eddy S. R., 2011.  HMMER web server: Interactive sequence similarity searching. Nucleic Acids Res. 39: W29–W37. 10.1093/nar/gkr367 [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Fu J., Zhang D. F., Liu Y. H., Ying S., Shi Y. S., et al. , 2012.  Isolation and characterization of maize PMP3 genes involved in salt stress tolerance. PLoS One 7: e31101 10.1371/journal.pone.0031101 [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Furuki T., Shimizu T., Kikawada T., Okuda T., Sakurai M., 2011.  Salt effects on the structural and thermodynamic properties of a group 3 LEA protein model peptide. Biochemistry 50: 7093–7103. 10.1021/bi200719s [DOI] [PubMed] [Google Scholar]
  35. Gill S. S., Anjum N. A., Gill R., Tuteja N., 2016, pp. 1–12 in Abiotic stress signaling in plants-an overview, in Abiotic Stress Response in Plants. Wiley VCH Verlag GmbH & Co., Weinheim, Germany. [Google Scholar]
  36. Giraudat J., Parcy F., Bertauche N., Gosti F., Leung J., et al. , 1994.  Current Advances in Abscisic-Acid Action and Signaling. Plant Mol. Biol. 26: 1557–1577. 10.1007/BF00016490 [DOI] [PubMed] [Google Scholar]
  37. Gläßer C., Haberer G., Finkemeier I., Pfannschmidt T., Kleine T., et al. , 2014.  Meta-analysis of retrograde signaling in Arabidopsis thaliana reveals a core module of genes embedded in complex cellular signaling networks. Mol. Plant 7: 1167–1190. 10.1093/mp/ssu042 [DOI] [PubMed] [Google Scholar]
  38. Gosline J., Lillie M., Carrington E., Guerette P., Ortlepp C., et al. , 2002.  Elastic proteins: biological roles and mechanical properties. Philos. Trans. R. Soc. Lond. B Biol. Sci. 357: 121–132. 10.1098/rstb.2001.1022 [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Gregory T. R., 2008.  Understanding Evolutionary Trees. Understanding Evolutionary Trees. Evol. Educ. Outreach 1: 121–137. 10.1007/s12052-008-0035-x [DOI] [Google Scholar]
  40. Grivna S. T., Beyret E., Wang Z., Lin H., 2006.  A novel class of small RNAs in mouse spermatogenic cells. Genes Dev. 20: 1709–1714. 10.1101/gad.1434406 [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Guo P., Baum M., Grando S., Ceccarelli S., Bai G., et al. , 2009.  Differentially expressed genes between drought-tolerant and drought-sensitive barley genotypes in response to drought stress during the reproductive stage. 60: 3531–3544. [DOI] [PMC free article] [PubMed]
  42. Hand S. C., Menze M. A., Toner M., Boswell L., Moore D., 2011.  LEA Proteins During Water Stress: Not Just for Plants Anymore. Annu. Rev. Physiol. 73: 115–134. 10.1146/annurev-physiol-012110-142203 [DOI] [PubMed] [Google Scholar]
  43. Hanin M., Brini F., Ebel C., Toda Y., Takeda S., et al. , 2011.  Plant dehydrins and stress tolerance: versatile proteins for complex mechanisms. Plant Signal. Behav. 6: 1503–1509. 10.4161/psb.6.10.17088 [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Hannah M., Wiese D., Freund S., Fiehn O., Heyer A. G., et al. , 2006.  Natural genetic variation of freezing tolerance in Arabidopsis. Plant Physiol. 142: 98–112. 10.1104/pp.106.081141 [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. He Q., Jones D. C., Li W., Xie F., Ma J., et al. , 2016.  Genome-Wide Identification of R2R3-MYB Genes and Expression Analyses During Abiotic Stress in Gossypium raimondii. Sci. Rep. 6: 22980 10.1038/srep22980 [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. He S., Tan L., Hu Z., Chen G., Wang G., et al. , 2012.  Molecular characterization and functional analysis by heterologous expression in E. coli under diverse abiotic stresses for OsLEA5, the atypical hydrophobic LEA protein from Oryza sativa L. Mol. Genet. Genomics 287: 39–54. 10.1007/s00438-011-0660-x [DOI] [PubMed] [Google Scholar]
  47. Henry A., 2013.  IRRI’s drought stress research in rice with emphasis on roots: accomplishments over the last 50 years. Plant Root 7: 92–106. 10.3117/plantroot.7.92 [DOI] [Google Scholar]
  48. Higgins D. G., Thompson J. D., Gibson T. J., 1996.  Using CLUSTAL for multiple sequence alignments. Methods Enzymol. 266: 383–402. 10.1016/S0076-6879(96)66024-8 [DOI] [PubMed] [Google Scholar]
  49. Higo K., Ugawa Y., Iwamoto M., Korenaga T., 1999.  Plant cis-acting regulatory DNA elements (PLACE) database: 1999. Nucleic Acids Res. 27: 297–300. 10.1093/nar/27.1.297 [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Hong-bo S., Zong-suo L., Ming-an S., 2005.  LEA proteins in higher plants : Structure, function, gene expression and regulation. 45: 131–135. [DOI] [PubMed]
  51. Horton P., Park K. J., Obayashi T., Fujita N., Harada H., et al. , 2007.  WoLF PSORT: Protein localization predictor. Nucleic Acids Res. 35: W585–W587. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Hundertmark M., Hincha D. K., 2008.  LEA (Late Embryogenesis Abundant) proteins and their encoding genes in Arabidopsis thaliana. 22: 1–22. [DOI] [PMC free article] [PubMed]
  53. Hundertmark M., Popova A. V., Rausch S., Seckler R., Hincha D. K., 2012.  Influence of drying on the secondary structure of intrinsically disordered and globular proteins. Biochem. Biophys. Res. Commun. 417: 122–128. 10.1016/j.bbrc.2011.11.067 [DOI] [PubMed] [Google Scholar]
  54. Jain M., Mathur G., Koul S., Sarin N. B., 2001.  Ameliorative effects of proline on salt stress-induced lipid peroxidation in cell lines of groundnut (Arachis hypogaea L.). Plant Cell Rep. 20: 463–468. 10.1007/s002990100353 [DOI] [Google Scholar]
  55. Jeffares D. C., Penkett C. J., Bähler J., 2008.  Rapidly regulated genes are intron poor. Trends Genet. 24: 375–378. 10.1016/j.tig.2008.05.006 [DOI] [PubMed] [Google Scholar]
  56. Jiménez J. A., Alonso-Ramírez A., Nicolás C., 2008.  Two cDNA clones (FsDhn1 and FsClo1) up-regulated by ABA are involved in drought responses in Fagus sylvatica L. seeds. J. Plant Physiol. 165: 1798–1807. 10.1016/j.jplph.2007.11.013 [DOI] [PubMed] [Google Scholar]
  57. Jisha K. C., Vijayakumari K., Puthur J. T., 2013.  Seed priming for abiotic stress tolerance: An overview. Acta Physiol. Plant. 35: 1381–1396. 10.1007/s11738-012-1186-5 [DOI] [Google Scholar]
  58. Kang Y., Khan S., Ma X., 2009.  Climate change impacts on crop yield, crop water productivity and food security - A review. Prog. Nat. Sci. 19: 1665–1674. 10.1016/j.pnsc.2009.08.001 [DOI] [Google Scholar]
  59. Kmiecik P., Leonardelli M., Teige M., 2016.  Novel connections in plant organellar signalling link different stress responses and signalling pathways. J. Exp. Bot. 67: 3793–3807. 10.1093/jxb/erw136 [DOI] [PubMed] [Google Scholar]
  60. Koag M.-C., 2003.  The binding of Maize DHN1 to Lipid Vesicles. Gain of Structure and Lipid Specificity. Plant Physiol. 131: 309–316. 10.1104/pp.011171 [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Kovacs D., Kalmar E., Torok Z., Tompa P., 2008.  Chaperone Activity of ERD10 and ERD14, Two Disordered Stress-Related Plant Proteins. Plant Physiol. 147: 381–390. 10.1104/pp.108.118208 [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Lan T., Gao J., Zeng Q. Y., 2013.  Genome-wide analysis of the LEA (late embryogenesis abundant) protein gene family in Populus trichocarpa. Tree Genet. Genomes 9: 253–264. 10.1007/s11295-012-0551-2 [DOI] [Google Scholar]
  63. Lane N., Martin W., 2010.  The energetics of genome complexity. Nature 467: 929–934. 10.1038/nature09486 [DOI] [PubMed] [Google Scholar]
  64. Lazazzera B. A., Hughes D., 2015.  Genetics: Location affects sporulation. Nature 525: 42–43. 10.1038/nature15207 [DOI] [PubMed] [Google Scholar]
  65. Li J., Ban L., Wen H., Wang Z., Dzyubenko N., et al. , 2015a An aquaporin protein is associated with drought stress tolerance. Biochem. Biophys. Res. Commun. 459: 208–213. 10.1016/j.bbrc.2015.02.052 [DOI] [PubMed] [Google Scholar]
  66. Li F., Fan G., Lu C., Xiao G., Zou C., et al. , 2015b Genome sequence of cultivated Upland cotton (Gossypium hirsutum TM-1) provides insights into genome evolution. Nat. Biotechnol. 33: 524–530. 10.1038/nbt.3208 [DOI] [PubMed] [Google Scholar]
  67. Li C., Ng C. K. Y., Fan L. M., 2015c MYB transcription factors, active players in abiotic stress signaling. Environ. Exp. Bot. 114: 80–91. 10.1016/j.envexpbot.2014.06.014 [DOI] [Google Scholar]
  68. Luna C. M., Pastori G. M., Driscoll S., Groten K., Bernard S., et al. , 2005.  Drought controls on H2O2accumulation, catalase (CAT) activity and CAT gene expression in wheat, pp. 417–423 in Journal of Experimental Botany,. [DOI] [PubMed] [Google Scholar]
  69. Mahajan S., Mahajan S., Tuteja N., Tuteja N., 2005.  Cold, salinity and drought stresses: an overview. Arch. Biochem. Biophys. 444: 139–158. 10.1016/j.abb.2005.10.018 [DOI] [PubMed] [Google Scholar]
  70. Mahdieh M., Mostajeran A., Horie T., Katsuhara M., 2008.  Drought stress alters water relations and expression of PIP-type aquaporin genes in Nicotiana tabacum plants. Plant Cell Physiol. 49: 801–813. 10.1093/pcp/pcn054 [DOI] [PubMed] [Google Scholar]
  71. Mittler R., 2006.  Abiotic stress, the field environment and stress combination. Trends Plant Sci. 11: 15–19. 10.1016/j.tplants.2005.11.002 [DOI] [PubMed] [Google Scholar]
  72. Monera O. D., Sereda T. J., Zhou N. E., Kay C. M., Hodges R. S., 1995.  Relationship of sidechain hydrophobicity and α‐helical propensity on the stability of the single‐stranded amphipathic α‐helix. J. Pept. Sci. 1: 319–329. 10.1002/psc.310010507 [DOI] [PubMed] [Google Scholar]
  73. de Nazaré Monteiro Costa C. N., Brígida A. B. S., do Borges B. N., de Neto M. A. M., Carvalho L. J. C. B., et al. , 2011.  Levels of MeLEA3, a cDNA Sequence Coding for an Atypical Late Embryogenesis Abundant Protein in Cassava, Increase Under In Vitro Salt Stress Treatment. Plant Mol. Biol. Rep. 29: 997–1005. 10.1007/s11105-011-0292-7 [DOI] [Google Scholar]
  74. Movahedi S., Van Bel M., Heyndrickx K. S., Vandepoele K., 2012.  Comparative co-expression analysis in plant biology. Plant Cell Environ. 35: 1787–1798. 10.1111/j.1365-3040.2012.02517.x [DOI] [PubMed] [Google Scholar]
  75. Muniz Pedrosa A., Martins C. D. P. S., Gonçalves L. P., Costa M. G. C., 2015.  Late Embryogenesis Abundant (LEA) constitutes a large and diverse family of proteins involved in development and abiotic stress responses in sweet orange (Citrus sinensis L. Osb.). PLoS One 10: e0145785. [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Nehrt N. L., Clark W. T., Radivojac P., Hahn M. W., 2011.  Testing the ortholog conjecture with comparative functional genomic data from mammals. PLOS Comput. Biol. 7: e1002073. [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Nylander M., Svensson J., Palva E. T., Welin B. V., 2001.  Stress-induced accumulation and tissue-specific localization of dehydrins in Arabidopsis thaliana. Plant Mol. Biol. 45: 263–279. 10.1023/A:1006469128280 [DOI] [PubMed] [Google Scholar]
  78. Ohno S., 1970.  Evolution by Gene Duplication. (1970).
  79. Olvera-Carrillo Y., Luis Reyes J., Covarrubias A. A., 2011.  Late Embryogenesis Abundant proteins: Versatile players in the plant adaptation to water limiting environments. Plant Signal. Behav. 6: 586–589. [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Osman C., Merkwirth C., Langer T., 2009.  Prohibitins and the functional compartmentalization of mitochondrial membranes. J. Cell Sci. 122: 3823–3830. 10.1242/jcs.037655 [DOI] [PubMed] [Google Scholar]
  81. Pereira-Santana A., Alcaraz L. D., Castaño E., Sanchez-Calderon L., Sanchez-Teyer F., et al. , 2015.  Comparative genomics of NAC transcriptional factors in angiosperms: Implications for the adaptation and diversification of flowering plants. PLoS One 10: e0141866 10.1371/journal.pone.0141866 [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. Petukhov M., Muñoz V., Yumoto N., Yoshikawa S., Serrano L., 1998.  Position dependence of non-polar amino acid intrinsic helical propensities. J. Mol. Biol. 278: 279–289. 10.1006/jmbi.1998.1682 [DOI] [PubMed] [Google Scholar]
  83. Pfannschmidt T., Bräutigam K., Wagner R., Dietzel L., Schröter Y., et al. , 2009.  Potential regulation of gene expression in photosynthetic cells by redox and energy state: Approaches towards better understanding. Ann. Bot. 103: 599–607. 10.1093/aob/mcn081 [DOI] [PMC free article] [PubMed] [Google Scholar]
  84. Porcel R., Azcón R., Ruiz-Lozano J. M., 2005.  Evaluation of the role of genes encoding for dehydrin proteins (LEA D-11) during drought stress in arbuscular mycorrhizal Glycine max and Lactuca sativa plants. J. Exp. Bot. 56: 1933–1942. 10.1093/jxb/eri188 [DOI] [PubMed] [Google Scholar]
  85. Rejeb I., Pastor V., Mauch-Mani B., 2014.  Plant Responses to Simultaneous Biotic and Abiotic Stress: Molecular Mechanisms. Plants 3: 458–475. 10.3390/plants3040458 [DOI] [PMC free article] [PubMed] [Google Scholar]
  86. Rodriguez-Salazar J., Moreno S., Espín G., 2017.  LEA proteins are involved in cyst desiccation resistance and other abiotic stresses in Azotobacter vinelandii. Cell Stress Chaperones 22: 397–408. 10.1007/s12192-017-0781-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  87. Rorat T., 2006.  Plant dehydrins — Tissue location, structure and function. Cell. Mol. Biol. Lett. 11: 536–556. [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Sexton T., Cavalli G., 2015.  The role of chromosome domains in shaping the functional genome. Cell 160: 1049–1059. 10.1016/j.cell.2015.02.040 [DOI] [PubMed] [Google Scholar]
  89. Simpson S. D., Nakashima K., Narusaka Y., Seki M., Shinozaki K., et al. , 2003.  Two different novel cis-acting elements of erd1, a clpA homologous Arabidopsis gene function in induction by dehydration stress and dark-induced senescence. Plant J. 33: 259–270. 10.1046/j.1365-313X.2003.01624.x [DOI] [PubMed] [Google Scholar]
  90. Singh D., Yadav N. S., Tiwari V., Agarwal P. K., Jha B., 2016.  A SNARE-Like Superfamily Protein SbSLSP from the Halophyte Salicornia brachiata Confers Salt and Drought Tolerance by Maintaining Membrane Stability, K+/Na+ Ratio, and Antioxidant Machinery. Front. Plant Sci. 7: 737 10.3389/fpls.2016.00737 [DOI] [PMC free article] [PubMed] [Google Scholar]
  91. Sofia A., de Almeida A. M., da Silva A. B., da Silva J. M., Paula A., et al. , 2013.  Abiotic Stress Responses in Plants: Unraveling the Complexity of Genes and Networks to Survive. Abiotic Stress - Plant Responses Appl. Agric. 49–101. [Google Scholar]
  92. Solano R., Nieto C., Avila J., Cañas L., Diaz I., et al. , 1995.  Dual DNA binding specificity of a petal epidermis-specific MYB transcription factor (MYB.Ph3) from Petunia hybrida. EMBO J. 14: 1773–1784. [DOI] [PMC free article] [PubMed] [Google Scholar]
  93. Sreedharan S., Shekhawat U. K. S., Ganapathi T. R., 2013.  Transgenic banana plants overexpressing a native plasma membrane aquaporin MusaPIP1;2 display high tolerance levels to different abiotic stresses. Plant Biotechnol. J. 11: 942–952. 10.1111/pbi.12086 [DOI] [PubMed] [Google Scholar]
  94. Tatusov R. L., 1997.  A Genomic Perspective on Protein Families. Science (80-.). 278: 631–637. 10.1126/science.278.5338.631 [DOI] [PubMed] [Google Scholar]
  95. Tilman D., Balzer C., Hill J., Befort B. L., 2011.  Global food demand and the sustainable intensification of agriculture. Proc. Natl. Acad. Sci. USA 108: 20260–20264. 10.1073/pnas.1116437108 [DOI] [PMC free article] [PubMed] [Google Scholar]
  96. Timperio A. M., Egidi M. G., Zolla L., 2008.  Proteomics applied on plant abiotic stresses: Role of heat shock proteins (HSP). J. Proteomics 71: 391–411. 10.1016/j.jprot.2008.07.005 [DOI] [PubMed] [Google Scholar]
  97. Verslues P. E., Agarwal M., Katiyar-agarwal S., Zhu J., Zhu J., 2006.  Methods and concepts in quantifying resistance to drought, salt and freezing, abiotic stresses that affect plant water status. 523–539. [DOI] [PubMed]
  98. Voorrips R. E., 2002.  MapChart: Software for the Graphical Presentation of Linkage Maps and QTLs. J. Hered. 93: 77–78. 10.1093/jhered/93.1.77 [DOI] [PubMed] [Google Scholar]
  99. Wahl M. C., Will C. L., Lührmann R., 2009.  The Spliceosome: Design Principles of a Dynamic RNP Machine. Cell 136: 701–718. 10.1016/j.cell.2009.02.009 [DOI] [PubMed] [Google Scholar]
  100. Wang M., Li P., Li C., Pan Y., Jiang X., et al. , 2014.  SiLEA14, a novel atypical LEA protein, confers abiotic stress resistance in foxtail millet. 1–16. [DOI] [PMC free article] [PubMed]
  101. Wang W., Vinocur B., Shoseyov O., Altman A., 2004.  Role of plant heat-shock proteins and molecular chaperones in the abiotic stress response. Trends Plant Sci. 9: 244–252. 10.1016/j.tplants.2004.03.006 [DOI] [PubMed] [Google Scholar]
  102. Wang Z., Wang F., Hong Y., Huang J., Shi H., et al. , 2016.  Two Chloroplast Proteins Suppress Drought Resistance by Affecting ROS Production in Guard Cells. Plant Physiol. 172: 2491–2503. 10.1104/pp.16.00889 [DOI] [PMC free article] [PubMed] [Google Scholar]
  103. Wang K., Wang Z., Li F., Ye W., Wang J., et al. , 2012.  The draft genome of a diploid cotton Gossypium raimondii. Nat. Genet. 44: 1098–1103. 10.1038/ng.2371 [DOI] [PubMed] [Google Scholar]
  104. Wang X.-S., Zhu H.-B., Jin G.-L., Liu H.-L., Wu W.-R., et al. , 2007.  Genome-scale identification and analysis of LEA genes in rice (Oryza sativa L.). Plant Sci. 172: 414–420. 10.1016/j.plantsci.2006.10.004 [DOI] [Google Scholar]
  105. Xiao B., Huang Y., Tang N., Xiong L., 2007.  Over-expression of a LEA gene in rice improves drought resistance under the field conditions. Theor. Appl. Genet. 115: 35–46. 10.1007/s00122-007-0538-9 [DOI] [PubMed] [Google Scholar]
  106. Xie F., Wang Q., Sun R., Zhang B., 2015.  Deep sequencing reveals important roles of microRNAs in response to drought and salinity stress in cotton. 66: 789–804. [DOI] [PMC free article] [PubMed]
  107. Xiong L., Ishitani M., 2006.  Stress signal transduction: Components, pathways and network integration, pp. 3–29 in Abiotic Stress Tolerance in Plants. Springer-Verlag, New York. [Google Scholar]
  108. Yamaguchi-Shinozaki K., Shinozaki K., 2005.  Organization of cis-acting regulatory elements in osmotic- and cold-stress-responsive promoters. Trends Plant Sci. 10: 88–94. 10.1016/j.tplants.2004.12.012 [DOI] [PubMed] [Google Scholar]
  109. Yang L., Xia J., 2011.  Prediction of late embryogenesis abundant proteins with chaos games representations. 9: 4–6.
  110. Yu J., Wang J., Lin W., Li S., Li H., et al. , 2005.  The genomes of Oryza sativa: A history of duplications. PLoS Biol. 3: 0266–0281. [DOI] [PMC free article] [PubMed] [Google Scholar]
  111. Zhao P., Liu F., Ma M., Gong J., Wang Q., et al. , 2011.  Overexpression of AtLEA3–3 confers resistance to cold stress in Escherichia coli and provides enhanced osmotic stress tolerance and ABA sensitivity in Arabidopsis thaliana. Mol. Biol. 45: 785–796. 10.1134/S0026893311050165 [DOI] [PubMed] [Google Scholar]
  112. Zheng J. Y., Oluoch G., Riaz Khan M. K., Wang X. X., Cai X. Y., et al. , 2016.  Mapping QTLs for drought tolerance in an F2:3 population from an inter-specific cross between Gossypium tomentosum and Gossypium hirsutum. Genet. Mol. Res. 15 10.4238/gmr.15038477 [DOI] [PubMed] [Google Scholar]

Articles from G3: Genes|Genomes|Genetics are provided here courtesy of Oxford University Press

RESOURCES