Skip to main content
International Journal of Molecular Sciences logoLink to International Journal of Molecular Sciences
. 2024 Nov 3;25(21):11814. doi: 10.3390/ijms252111814

Origin, Evolution, and Diversification of the Expansin Family in Plants

Zhizhan Wang 1, Jinbiao Cao 1, Nan Lin 1, Jiaming Li 1, Yazhou Wang 1, Weibin Liu 1, Wen Yao 1, Yang Li 1,*
Editor: Abir U Igamberdiev1
PMCID: PMC11547041  PMID: 39519364

Abstract

The cell wall is a crucial feature that allows ancestral streptophyte green algae to colonize land. Expansin, an extracellular protein that mediates cell wall loosening in a pH-dependent manner, could be a powerful tool for studying cell wall evolution. However, the evolutionary trajectory of the expansin family remains largely unknown. Here, we conducted a comprehensive identification of 2461 expansins across 64 sequenced species, ranging from aquatic algae to terrestrial plants. Expansins originated in chlorophyte algae and may have conferred the ability to loosen cell walls. The four expansin subfamilies originated independently: α-expansin appeared first, followed by β-expansin, and then expansin-like A and expansin-like B, reflecting the evolutionary complexity of plant expansins. Whole genome duplication/segmental duplication and tandem duplication events greatly contributed to expanding the expansin family. Despite notable changes in sequence characteristics, the intron distribution pattern remained relatively conserved among different subfamilies. Phylogenetic analysis divided all the expansins into five clades, with genes from the same subfamily tending to cluster together. Transcriptome data from 16 species across ten lineages and qRT-PCR analysis revealed varying expression patterns of expansin genes, suggesting functional conservation and diversification during evolution. This study enhances our understanding of the evolutionary conservation and dynamics of the expansin family in plants, providing insight into their roles as cell wall-loosening factors.

Keywords: expansin, cell wall, origin, evolution, diversification

1. Introduction

Green plants (Viridiplantae) are widely distributed in aquatic and terrestrial ecosystems. During the colonization of land, ancestral green plants evolved their cell walls to adapt to terrestrial environments. The plant cell wall is an elaborate structure that surrounds the plasma membrane, providing structural support and protection to the plant cell. Major components of the cell wall include cellulose, hemicellulose, and pectin, which are organized into primary and secondary cell walls. The primary cell wall, which controls cell expansion and elongation, has a thickness ranging from approximately 0.1 to 10 μm [1]. Following the cessation of cell growth, a robust secondary cell wall begins to deposit inside the primary cell wall, providing additional protection, rigidity, and strength. Currently, many cell wall-related genes have been documented. In plants, cellulose, composed of β-1,4 glucans, is an important biopolymer in the cell wall. Plasma membrane-embedded cellulose synthases A (CesAs) directly catalyze the elongation of the β-1,4 glucan chains and align them into protofibrils [2]. The cryo-electron microscopy (cryo-EM) structures of PttCesA8 and GhCesA7 have been elucidated to enhance our understanding of the molecular mechanism underlying plant cellulose synthesis [3,4]. Xyloglucan is the most abundant hemicellulose in the primary wall of most seed plants [5]. Cellulose synthase like-C (CSLC) in Arabidopsis thaliana, which is involved in synthesizing the xyloglucan backbone, has been identified [6]. Homogalacturonan (HG), a main type of pectin in the primary cell wall, is methyl-esterified in the Golgi and demethyl-esterified after secretion into the cell wall [7]. Pectin methylesterase (PME) and pectin methylesterase inhibitor (PMEI) regulate the demethylesterification of HG in two ways: non-blockwise and blockwise [8].

In addition to the three major cell wall components (cellulose, hemicellulose, and pectin), there are cell wall-related proteins, such as expansin, wall-associated kinase, and pectate lyase, that play numerous important roles in cell wall dynamics [9,10,11]. Expansin, first identified in cucumber hypocotyls, is a classic and important cell wall protein that facilitates cell wall loosening without causing the lysis of wall polymers [12,13]. Research has shown that ZmEXPB1 specifically binds to glucuronoarabinoxylan to loosen the cell wall in maize [14]. Typical expansin proteins possess two conserved functional domains: the DPBB domain (N-terminal domain, 120–135 amino acids), which is similar to family-45 glycosyl hydrolase (GH45); and the CBM63 domain (C-terminal domain, 90–120 amino acids), which is homologous to the family-63 carbohydrate-binding module (CBM63) [13,15]. Both domains are preceded by a signal peptide consisting of 20–30 amino acids. Based on phylogenetic analysis and sequence characteristics, the expansin family is divided into four subfamilies: α-expansin (EXPA), β-expansin (EXPB), expansin-like A (EXLA), and expansin-like B (EXLB). To date, EXPA and EXPB have been associated with cell enlargement, while EXLA and EXLB have received comparatively less attention [16,17].

In plants, immunolocalization studies have demonstrated that expansins are mainly localized to the cell wall [18,19]. Expansins are highly expressed in developing tissues and organs where cell wall loosening or disassembly occurs, such as in growing roots, expanding leaves, and developing seeds, fruits, or fibers [20,21,22,23,24]. In many plant species, the biological functions of expansins have been confirmed, including their roles in regulating root growth, leaf/pollen tube development, grain size, fruit ripening, and stress tolerance. During lateral root development in Arabidopsis, the auxin-responsive gene AtEXPA1 affects the division and expansion of pericycle cells [22]. In wheat, the EXPA gene TaExpA6, which is normally expressed in roots, was ectopically expressed in early developing seeds, resulting in larger grains without affecting the grain number [21]. Tomato Exp1 and endoglucanase Cel2 work together to enhance fruit softening and facilitate cell wall disassembly [23]. Additionally, expansins play a key role in responding to abiotic and biotic stresses in crops such as rice, wheat, maize, soybean, peanut, and cotton [17,25,26,27,28,29].

Advances in sequencing techniques for plant genomics over the past decade have been impressive, enabling the elucidation of the evolution and phylogeny of some pivotal cell wall-related genes in plants, such as cellulose synthase, PME/PMEI, and glucuronic acid substitution of xylan (GUX) [30,31,32]. However, a comprehensive understanding of the evolution of the plant expansin family at the genomic level is still lacking. In this study, we investigated the presence and absence of expansins by mining the genomes of 64 species, ranging from aquatic algae to land plants. Expansins were found in Chlorophyta and Streptophyta but are missing in all the red algae studied. Furthermore, we explored the expansion mechanism, sequence characteristics, and intron distribution pattern of expansins, as well as their evolutionary conservation. Finally, we focused on the functional conservation and diversification of the expansin family during evolution using transcriptome data from 16 representative species and conducting qRT-PCR analysis.

2. Results

2.1. The Origination of Expansin Genes in Plants

To better investigate the origin of expansin genes in plants, we selected 64 representative species, including red algae (Rhodophyta), green algae (Chlorophyta), streptophyte algae (Charophyta), and land plants (Embryophyta) (Table S1). The rhodophytes included two species, the chlorophytes included five species, the streptophyte algae included seven species, and the land plants included 50 species classified into eight lineages (i.e., liverworts, mosses, lycophytes, ferns, gymnosperms, basal angiosperms, monocots, and eudicots) [33]. A total of 2461 expansin proteins belonging to four subfamilies were identified utilizing BLAST, HMMER 3.3.1, and NCBI CDD (Table S2). Of the 64 species examined, the number of expansin proteins varied significantly, ranging from 0 to 247 (Figure 1). No expansin proteins were detected in two rhodophytes (Cyanidioschyzon merolae and Chondrus crispus), three chlorophytes (Ostreococcus lucimarinus, Micromonas pusilla and Chlamydomonas reinhardtii), and two streptophyte algae (Mesostigma viride and Chlorokybus atmophyticus). To gain additional insights into the origin of expansin genes, the genomes of the seven species were analyzed. We identified 17 protein sequences encoding the DPBB domain with significant e-values (<1 × 10−5) in the genomes of four species (C. crispus, O. lucimarinus, M. pusilla and C. reinhardtii), whereas the CBM63 domain was not detected (Figure S1, Table S3). Surprisingly, the rhodophyte C. merolae and the two streptophyte algae, M. viride and C. atmophyticus, were found to lack both the DPBB and CBM63 domains (Figure S1). Subsequently, one expansin protein was identified in the green algae Volvox carteri and four in Coccomyxa subellipsoidea. In the land plants, expansin proteins were found to be ubiquitously present, suggesting that expansin proteins first appeared in chlorophytes during the evolutionary process.

Figure 1.

Figure 1

Occurrence of the expansin family among green plants. The number of expansin genes for each species is shown. The four previously defined expansin subfamilies (EXPA, EXPB, EXLA, and EXLB) are indicated. The species tree was constructed using TimeTree 5 (http://www.timetree.org/) [34]. The scale bar represents divergence time (million years ago, MYA).

The published literature shows that expansin proteins are grouped into four subfamilies (EXPA, EXPB, EXLA, and EXLB) based on sequence characteristics [13,15]. We next investigated the origin of the four subfamilies (Figure 1). EXPA was first identified in the chlorophytes (V. carteri and C. subellipsoidea) and appeared widely in the streptophyte algae and land plants. EXPB was first present in the moss (Physcomitrium patens), comprising six members. EXLA and EXLB were absent in the chlorophytes, streptophyte algae, liverworts, mosses, lycophytes, and ferns, but were first found in the gymnosperms (Picea abies and Ginkgo biloba). All four subfamilies were present in the seed plants (gymnosperms, basal angiosperms, monocots, and eudicots). These data suggest that the emergence times of EXPA, EXPB, and EXLA/EXLB were inconsistent, with EXPA occurring earliest in chlorophytes. EXPB appeared after the divergence of liverworts and mosses, while EXLA and EXLB were limited to seed plants.

2.2. Evolutionary Enlargement of the Expansin Family

Gene-family expansion is a well-known contributor to the diversity of gene functional characteristics during evolution. To understand the enlargement of the expansin family in plants, we first observed the number of expansin proteins across different lineages (Table S4). The average number of expansin proteins per lineage varied significantly, ranging from 2.5 members in the chlorophytes to 14.6 in the streptophyte algae, 34 in the liverworts, 40 the in mosses, 18 in the lycophytes, 21 in the ferns, 45.5 in the gymnosperms, 19.5 in the basal angiosperms, 81.18 in the monocots, and 40.23 in the eudicots. The expansin proteins showed a trend of gradual expansion from chlorophyte algae to mosses, contrasting with the more variable expansion observed in the other six lineages of land plants during evolution.

Natural diploids and polyploids are prevalent in plants. Significant variation in the number of expansin genes was observed among 43 diploid species, ranging from 13 in Amborella trichopoda to 94 in Zea mays, reflecting the occurrence of gene expansion events (Figure 1). When the expansin numbers in three tetraploid species (Panicum virgatum, Brassica napus and Arachis hypogaea) and one hexaploid species (Triticum aestivum) were normalized to the diploid level, T. aestivum, P. virgatum, and B. napus still had more than 50 members each.

By investigating the number of each expansin subfamily, we found that the number of EXPA was significantly larger than those of EXPB, EXLA, and EXLB in most of the species studied. Exceptions to this trend were observed in Brachypodium distachyon, T. aestivum, Hordeum vulgare, Sorghum bicolor, and Z. mays (Figure 1). The largest average number of EXPB was found in Gramineae plants, with 41,875 members. Notably, EXLB was absent in four Gramineae crops (T. aestivum, Hordeum vulgare, S. bicolor, and Z. mays), which is somewhat surprising.

Whole genome duplication (WGD), segmental duplication, and tandem duplication have been proven critical for gene family expansion during genome evolution. To trace the expansion history of the expansin family in plants, we identified 21,922 collinear blocks containing 1771 expansin genes through intergenomic comparison within 44 angiosperm species (two basal angiosperms, 11 monocots, and 31 eudicots) using MCScanX. It indicated that these genes were generated through WGD or segmental duplication. From the collinear data analyses, we detected 18,619 (EXPA-EXPA), 3078 (EXPB-EXPB), 915 (EXLA-EXLA), and 1546 (EXLB-EXLB) pairs of collinear genes (Figure 2, Table S5). The number of collinear expansin pairs between two angiosperm species varied widely, from 1 to 194 (B. napus versus Brassica rapa). The number of collinear expansin pairs in the eudicots was larger than that in the basal angiosperms and monocots. Distinct from angiosperms, the expansin genes in the chlorophyte algae, streptophyte algae, liverworts, mosses, lycophytes, ferns, and gymnosperms showed no collinearity with each other (Figure 2). In addition, no evidence of collinearity of expansin genes was found between the angiosperms and chlorophyte algae, streptophyte algae, liverworts, mosses, lycophytes, ferns, or gymnosperms.

Figure 2.

Figure 2

Schematic representations of the collinear relationships of expansin genes among 29 of the 57 species. (A) EXPA subfamily; (B) EXPB subfamily; (C) EXLA subfamily; (D) EXLB subfamily. The length of each colored bar in the circle indicates the number of expansin genes in a species. Expansin genes within each species are arranged clockwise along a colored bar, according to their genomic coordinates. Each colored line represents a colinear expansin gene pair. The species names are as follows: Volvox carteri (Vc), Coccomyxa subellipsoidea (Cs), Klebsormidium nitens (Kn), Chara braunii (Cb), Spirogloea muscicola (Smu), Mesotaenium endlicherianum (Me), Penium margaritaceum (Pm), Marchantia polymorpha (Mp), Physcomitrium patens (Pp), Selaginella moellendorffii (Smo), Azolla filiculoides (Af), Picea abies (Pa), Ginkgo biloba (Gb), Amborella trichopoda (Atr), Nymphaea colorata (Nc), Phoenix dactylifera (Pd), Oryza sativa (Os), Hordeum vulgare (Hv), Zea mays (Zm), Aquilegia coerulea (Aco), Actinidia chinensis (Ach), Solanum lycopersicum (Sl), Eucalyptus grandis (Eg), Gossypium raimondii (Gr), Carica papaya (Cp), Arabidopsis thaliana (Ath), Populus trichocarpa (Pt), Glycine max (Gm), and Malus domestica (Md).

Next, the intraspecies collinear blocks were analyzed (Table S6). We found that 1142 expansin genes from 46 species, including Chara braunii, Spirogloea muscicola, P. patens, and 43 angiosperms, were present in 751 collinear blocks, which was significantly lower than the number observed in intergenomic comparisons. Over 98.73% of the collinear expansin pairs resulted from angiosperms. However, the collinearity of expansin genes was not detected in A. trichopoda, one of the two basal angiosperms. Our results suggest that two expansin genes (GBG70289 and GBG74093) in C. braunii might have evolved through WGD or segmental duplication. However, Nishiyama et al. demonstrated that WGD was absent in C. braunii [35]. Therefore, it is likely that GBG70289 and GBG74093 resulted from segmental duplication. Moreover, nine collinear expansin pairs in S. muscicola could be attributed to a whole genome triplication event [36]. In P. patens, two WGD events (WGD1 and WGD2) occurred approximately 40–48 and 27–35 million years ago [37], which may have contributed to the increase in expansin gene numbers (Pp3c18_19690V3.2 and Pp3c21_1890V3.1, Pp3c8_13450V3.1 and Pp3c24_9250V3.1, Pp3c1_37980V3.2 and Pp3c2_3390V3.1).

To gain further insight into the enlargement of the expansin family, tandem duplication genes (TDGs) were identified (Table S7). A total of 753 expansin genes from 50 species were grouped into 256 TDG clusters. In particular, TDG clusters in eight Gramineae plants (an average of ~18 TDG clusters) were larger than those in the other 42 species (an average of ~2.67 TDG clusters). The largest TDG cluster, containing 10 expansin genes, was mapped on the 3B chromosome of wheat, likely resulting from unequal crossing-over events [38]. Additionally, we found that tandem duplication primarily contributed to expansin enlargement in the chlorophyte algae, streptophyte algae, liverworts, mosses, lycophytes, ferns, and gymnosperms. In contrast, WGD or segmental duplication played more significant roles in the angiosperms. These results suggest that the expansin family has undergone extensive expansion through both WGD/segmental duplication and tandem duplication over evolutionary time.

2.3. Sequence Characteristics, Gene Structure, and Intron Distribution Pattern of Expansin Genes in Plants

To explore how the sequence characteristics of expansin genes vary from chlorophyte algae to eudicots, we further analyzed 2461 expansin genes from 57 studied species. Generally, the gene length, protein length, molecular weight, and isoelectric point (pI) changed substantially (Table S2). For 68.14% of all the expansin genes, the gene length ranged from 1000 bp to 2500 bp. Consistent with previously identified expansin proteins, 91.47% of the total expansins had a protein length from 200 to 300 amino acids. The averages of the protein length and molecular weight were highly conserved within the four expansin subfamilies but varied widely between the chlorophyte algae or streptophyte algae and the eight lineages of land plants (Table S8 and Table S9).

Based on exon-intron organization, gene structure is crucial for elucidating the evolutionary processes of gene families. Among the 2461 expansin genes, the exon and intron lengths varied significantly, especially the intron lengths, which ranged from 0 to 40,949 bp (Table S2). At the subfamily level, the average changes in the exon and intron lengths were basically conserved, in contrast to the variations observed at the lineage level (Tables S8 and S9). Additionally, a correlation analysis was performed to examine the relationship between gene length and exon or intron length. Exon length had a slight influence on gene length (r2 = 0.22, p < 0.01), whereas intron length had a stronger impact (r2 = 0.87, p < 0.01), implying that changes in gene length are closely related to changes in intron length.

Subsequently, intron distribution patterns within the DPBB and CBM63 domains were analyzed (Figure 3, Table S10). A total of 2409 (97.89%) expansin genes featured the six most common patterns: Pattern I (47.26%), Pattern II (18.89%), Pattern III (11.82%), Pattern IV (8.09%), Pattern V (6.30%), and Pattern VI (5.53%). Pattern I, characterized by two introns in the DPBB domain, was highly conserved in the EXPA from the liverworts to eudicots, indicating that it might have been inherited from the common ancestor of land plants. Pattern II, which features one intron in the DPBB domain, was the second most common pattern in EXPA. EXPB exhibited various distribution patterns during evolution. However, Pattern III, with three introns in the DPBB domain, was the most prevalent in the EXPB from the basal angiosperms to eudicots. In EXLA and EXLB, the distribution patterns were generally conserved, featuring three introns in the DPBB domain and one intron in the CBM63 domain (Pattern V), except for the EXLB in the eudicots. Expansin genes from the chlorophyte algae possessed at least six and up to eleven introns in the two domains, differing significantly from those in the streptophyte algae and land plants. The intron distribution patterns in EXPA, EXPB, and EXLA/EXLB were highly conserved in the land plants, angiosperms, and seed plants, respectively. The change in intron number in the DPBB and CBM63 domains suggested that intron gain and loss events were common in the expansin family, from unicellular algae to angiosperms.

Figure 3.

Figure 3

Intron distribution patterns within the DPBB and CBM63 domains across ten lineages. A simplified phylogeny of green plants was drawn according to Leebens-Mack JH et al. and Jia Q et al. [33,39]. Introns in the DPBB and CBM63 domains are indicated by red vertical lines. The red numerals represent the number of introns corresponding to the most common pattern (i.e., the largest percentage) in each subfamily across different lineages.

2.4. Evolutionary Conservation of the Expansin Family

To determine the phylogenetic relationship of plant expansins, 2461 expansin proteins were used to construct an unrooted phylogenetic tree using the maximum likelihood (ML) approach in the IQ-TREE tool. The phylogenetic analysis classified these proteins into five clades: EXPA I, EXPA II, EXPB, EXLA, and EXLB (Figure 4A). Among these, EXPA I was an ancestral clade and clearly distinct from the other clades. Clade EXPA I, the earliest diverging clade, included not only chlorophyte algae (V. carteri and C. subellipsoidea) but also streptophyte algae (Klebsormidium nitens, C. braunii, S. muscicola, Mesotaenium endlicherianum, and Penium margaritaceum). These results indicate that all the expansin genes from the land plants were derived from the common ancestors of both chlorophyte algae and streptophyte algae.

Figure 4.

Figure 4

Phylogenetic analysis of expansins in green plants. (A) Phylogenetic tree topology of 2461 expansins across 57 species. Different clades within the EXPA subfamily are denoted by Roman numerals. The tree was constructed using IQ-TREE with the best-fitting model (Q.pfam + R9) and visualized using iTOL v5. The scale bar indicates an evolutionary distance of 0.5 nucleotides per position in the sequence. (B) Spiral diagram of the phylogenetic tree of 2461 expansins across 57 species. Each subfamily is labeled with a specific color. The tree was constructed using IQ-TREE with the best-fitting model (Q.pfam + R9) and visualized using the R package spiralize (v1.1.0). The phylogenetic tree, including all bootstrap values, is presented in Figure S2.

In parallel, the spiral phylogenetic tree clearly divided all the expansin proteins into four groups: the EXPA group, EXPB group, EXLA group, and EXLB group, demonstrating that expansin proteins from the same subfamily clustered together (Figure 4B and Figure S2). A total of 1610 EXPA members, 557 EXPB members, 139 EXLA members, and 155 EXLB members were assigned to their respective groups (Table S2). The phylogenetic classification was completely consistent with the genome-wide identification of the four subfamilies, revealing that the functional domains from different subfamilies are highly conserved.

In addition, the conservation of two domains (DPBB domain and CBM63 domain) was analyzed by sequence alignment (Figure S3). Among the four subfamilies, the DPBB domain exhibited the highest degree of conservation, possessing a GACG motif and a CGAC motif interspaced by ~30 amino acids. A highly conserved HDF motif was present in the EXPA and EXPB subfamilies, while the TDF motif was observed in the EXLA and EXLB subfamilies. Similar to previous results [15], the CDRC motif in the N-terminus was found only in the EXLA subfamily. We speculate that these conserved motifs are likely involved in the biological functions of the expansin family.

2.5. Strong Negative Selection for Expansin Genes

The ratio of nonsynonymous substitution rate (Ka) to synonymous substitution rate (Ks) is commonly used to study the type of Darwinian selection. To elucidate the potential selective pressure on expansin genes in plants, we calculated the Ka/Ks ratio for 1130 paralogous gene pairs from 56 species (except for V. carteri). The Ka/Ks values for these gene pairs ranged from 0.001 to 1.78945 (Figure 5, Table S11). Notably, the paralogous gene pair Phvul.010G070900-Phvul.010G071000 from Phaseolus vulgaris had the highest Ka/Ks value. The overwhelming majority of gene pairs (99.12%) had Ka/Ks values lower than 1, with 524 pairs showing values less than 0.1. These results indicate that most paralogs underwent negative selection with limited functional divergence. Only ten gene pairs (0.88%) from eight species (one in mosses, two in monocots, and five in eudicots) displayed Ka/Ks values greater than 1, suggesting that these genes experienced positive selection. Overall, expansin genes have been strongly conserved during evolution, reflecting their crucial role in cell wall loosening in plant lineages.

Figure 5.

Figure 5

Distribution of Ka/Ks values for expansin genes. A Ka/Ks analysis was performed on paralogous gene pairs across 56 species. The horizontal dashed line indicates the threshold for distinguishing between negative and positive selection in expansin genes.

2.6. Expression Pattern Analysis of the Expansin Family from Green Algae to Eudicots

Changes in gene expression patterns are associated with the evolution of gene families. To understand the evolution of the expansin family at the expression level, we analyzed transcriptome data from 16 species across ten lineages (Table S12). Of the 506 expansin genes identified in these species, 198 (39.13%) were expressed (FPKM ≥ 1), 97 (19.17%) were expressed at extremely weak levels (0 < FPKM < 1), and 211 (41.70%) were not expressed (FPKM = 0). At the species level, only one expansin gene in V. carteri, Vocar.0022s0205.1 (EXPA1), was found to have extremely weak expression (FPKM = 0.7031) (Figure 6, Table S13). In comparison, C. subellipsoidea had three additional expansin genes. All four expansin genes in C. subellipsoidea showed a large change in their expression levels, with FPKM values ranging from 0 to 169.8159. When analyzing the expression levels of expansin genes from the remaining 14 species, we observed a similar trend to that in C. subellipsoidea, except for P. margaritaceum. This trend was characterized by one or several expansin genes with very high expression levels. These genes were defined as specifically or preferentially expressed expansin genes that played key roles in development and function. Together with the expansion of the expansin family, we propose that changes in the expression levels of expansin genes were accompanied by functional conservation and diversification during species evolution.

Figure 6.

Figure 6

Expression analysis of the expansin family. FPKM values of 506 expansin genes from four different subfamilies across 16 species were analyzed. Expansin genes in each species were renamed based on the order in their genomic coordinates (e.g., EXPA1, EXPA2, etc.). Gray quadrilaterals indicate no genes in the corresponding species. The phylogenetic tree of the 16 species was constructed using TimeTree 5 [34]. The scale bar indicates divergence time (million years ago, MYA).

2.7. qRT-PCR Validation of Expansin Genes in Various Maize Tissues

To further validate the expression patterns of expansin genes in leaves, six genes (ZmEXPA14, ZmEXPA19, ZmEXPA39, ZmEXPB8, ZmEXPB28, and ZmEXPB37) with high expression levels in maize leaves were selected based on the transcriptome data. We employed qRT-PCR to analyze the expression characteristics of these expansin genes across five tissues: root, leaf, anther, silk, and seed, in the maize inbred B73. The results demonstrate that the expression patterns of these genes were consistent with the transcriptome data, showing high expression levels in maize leaves (Figure 7). Additionally, various transcript levels were observed for the six genes across the different tissues. For example, five genes (ZmEXPA14, ZmEXPA19, ZmEXPA39, ZmEXPB8, and ZmEXPB37) showed high expression in the seeds, especially ZmEXPB8, suggesting that expansin genes may play a role in seed development. ZmEXPA39 also exhibited the highest expression in the silk. However, ZmEXPB28 and ZmEXPB37 were absent in the anther and silk, respectively. These findings enhance our understanding of the functional conservation and diversification of expansin genes during evolution.

Figure 7.

Figure 7

qRT-PCR analysis of the expression patterns of six expansin genes in five maize tissues. All expression levels were normalized to ZmEF1α. The rose-red and blue bars represent the relative expression levels of expansin genes in leaves and the other four tissues, respectively. Error bars indicate the standard deviation of three biological replicates. Different letters represent statistically significant differences at p < 0.05 based on ANOVA (Duncan’s multiple comparison test).

3. Discussion

Expansin, a plant-specific protein that facilitates cell wall loosening, has critical functions in various biological processes by promoting cell elongation and expansion, as well as regulating development. Despite the recognized importance of expansins, the origin, evolution, and diversification of the expansin family in plants have yet to be systematically analyzed. In this study, we identified a total of 2461 expansin proteins from 64 species across eleven lineages at the genome-wide level. By investigating the collinearity and phylogenetic relationships, sequence characteristics, and gene expression patterns, our aim was to deepen the understanding of their evolutionary history and functional diversification in green plants.

The genome-wide gene identification suggested that expansins first appeared in Chlorophyta, followed by a significant increase in the size of the expansin family in Streptophyta (Figure 1, Table S2). Unexpectedly, homologous genes to Arabidopsis EXPA were found only in two chlorophyte algae (V. carteri and C. subellipsoidea) and five streptophyte algae (K. nitens, C. braunii, S. muscicola, M. endlicherianum, and P. margaritaceum) but not in the other three chlorophyte algae (O. lucimarinus, M. pusilla, and C. reinhardtii) and two streptophyte algae (M. viride and C. atmophyticus). Based on the phylogenetic relationship of the expansin family (Figure 4A), we speculate that Clade EXPA I might be the common ancestor of land plant expansins. Additionally, it is highly probable that these two chlorophyte algae and five streptophyte algae employed EXPA for cell wall loosening, similar to land plants. Therefore, we presume that EXPA in chlorophyte algae and streptophyte algae laid the foundation for the cell wall loosening function in land plants.

In green plants, horizontal gene transfer (HGT) is a crucial way to acquire novel genes from bacteria or fungi [40]. In recent years, over 200 horizontally acquired genes or gene families have been identified across various plant taxa [41]. For example, TAL-type transaldolase (associated with vascular development), peroxiredoxin (involved in stress response), and several cell wall-related genes, including glycoside hydrolase subfamily GH5_11 (related to cellulose degradation), β-glucosidase (involved in cellulose degradation), and glycoside hydrolase family 43 β-xylosidase (associated with xylan degradation) [41]. The DPBB domain and the CBM63 domain are two canonical expansin domains [13]. We found that the DPBB domain, but not the CBM63 domain, was present in one red alga (C. crispus) and three chlorophyte algae (O. lucimarinus, M. pusilla, and C. reinhardtii), all of which lacked expansin proteins (Figure S1, Table S3). This finding implies that the DPBB domain may have originated in red algae or chlorophyte algae. However, it is also possible that the CBM63 domain in chlorophyte algae resulted from an HGT event from an unknown bacteria or fungi.

Gene duplication contributes to increased genome complexity during evolution [42]. WGD, segmental duplication, and tandem duplication are common duplication events that drive gene family expansion. Many studies suggest that both recent and ancient polyploidization events have repeatedly occurred across all major land plant lineages, with WGD being particularly prevalent in angiosperms [43]. In this study, tandem duplication was identified as the major driving force behind the enlargement of the expansin family in chlorophyte algae, streptophyte algae, liverworts, mosses, lycophytes, ferns, and gymnosperms (Table S7). In contrast, in angiosperms, the expansin family expanded mainly through WGD/segmental duplication (Figure 2, Tables S5 and S6), indicating that WGD/segmental duplication had important consequences for gene family expansion during angiosperm evolution, aligning with previous findings [43].

Gene loss is important in green plant evolution. It is not only associated with gene redundancy but also with phenotypic diversity [44]. With the sequencing of numerous plant genomes, gene loss has been observed in various plant species, such as Cuscuta australis, three Brassica species [45,46]. By far, the role of the EXLB subfamily remains poorly understood in green plants. It has been reported that the entire EXLB subfamily was lost in three aquatic plant species, likely as an adaptation to their aquatic environment [47]. Among the eleven monocots studied, the complete absence of the EXLB subfamily was found only in four Gramineae crops: T. aestivum, H. vulgare, S. bicolor, and Z. mays (Figure 1). One reason for this loss may be the development of specialized physiological traits that are better suited to their living environment. Another possibility is that EXLB may have become dispensable for the growth and development of the four Gramineae crops. The underlying cause of EXLB loss, however, necessitates further exploration.

Gene duplication and loss can drive divergence in gene expression and promote functional diversification. Among the 16 species studied, with the exceptions of V. carteri and M. endlicherianum, expansin members exhibited different expression patterns (Figure 6). Expansin genes with high expression levels were considered to play important roles in regulating growth and development. Conversely, the 211 non-expressed expansin genes (FPKM = 0) were likely pseudogenes or may lack functionality in cell wall loosening. However, it is also possible that these genes could be expressed under specific conditions. They are thought to serve as a reservoir of genetic material, offering organisms adaptability to changing conditions and environmental pressures [48]. Additionally, a qRT-PCR analysis of various maize tissues confirmed the functional conservation and diversification of expansin genes during evolution (Figure 7).

In summary, our study reveals the early origin and evolutional divergence of the expansin family, while also uncovering the patterns of gene duplication and expression among plant expansins. This comprehensive analysis enhances our understanding of their functions from an evolutionary perspective.

4. Materials and Methods

4.1. Genome-Wide Identification of Expansin Genes among 64 Species

The genome files of 64 species were downloaded from various databases between 2022 and 2023 (Table S1). To identify expansin genes across these species, 35 expansin proteins from Arabidopsis thaliana (excluding the pseudogene AtEXPA19) were collected from TAIR (https://www.arabidopsis.org/index.jsp, accessed on 7 January 2022) and used as queries to search for homologous genes in the other 63 species via BLASP (v 2.10.1+) analysis. The parameters for BLASP were set to “sequence coverage >30%, identity >30%, and e-value < 1 × 10−3”, while other parameters were maintained at the default values [49]. Subsequently, hidden Markov model profiles for PF03330 (DPBB domain) and PF01357 (CBM63 domain) were obtained from the Pfam database (http://pfam.xfam.org/, accessed on 13 January 2022). HMMER 3.3.1 was used to detect protein domains with an e-value < 1 × 10−5 [50]. The protein domains were further searched using the NCBI CDD in default mode (https://www.ncbi.nlm.nih.gov/cdd/, accessed on 23 January 2022 and 8 July 2023) [51]. Only protein sequences containing both the DPBB and CBM63 domains were retained. Due to alternative splicing, some expansin genes had multiple transcripts. Thus, the proteins with the longest lengths were chosen, except for six proteins with appropriate lengths (Table S14). Moreover, eleven proteins with misannotated flanking regions were reannotated using the gene prediction software FGENESH (v2.6) and assigned new names (Table S15) [52]. Finally, 2461 expansin genes were identified for further analysis (Table S2). To visualize changes in the gene copy number across the 64 species, a species tree with divergence times was constructed using TimeTree 5 (http://www.timetree.org/, accessed on 10 July 2023) [34] and visualized using Evolview v3 [53].

4.2. Phylogenetic Analysis

A total of 2461 expansin protein sequences were aligned using the Muscle5 (v5.1) program with the parameters “muscle -align fasta.file -output aligned_fasta.file -perturb 0 -perm none -consiters 2 -refineiters 100” [54]. The phylogenetic tree was constructed using the IQ-TREE (v1.6.12) software with the best-fitting model (Q.pfam + R9) [55]. The parameters for IQ-TREE were set to “-m MFP -bb 1000 -bnni -nt AUTO -cmax 15 -redo”. The bootstrap value was 1000. The topology of the phylogenetic tree was visualized using iTOL v5 [56]. The R package spiralize (v1.1.0) was used to visualize the phylogenetic tree, displaying 2461 genes on the spiral [57].

4.3. Gene Structure and Sequence Conservation Analysis

The R package rtracklayer (v1.48.0) was used to obtain general feature format version 3 (GFF3) files from 57 species genomes [58]. The exon-intron structure information of the expansin genes was then extracted from these GFF3 files. The gene length, protein length, exon length (total exon length per gene), exon number, intron length (total intron length per gene), and intron number were calculated. The molecular weight and pI of the putative expansin proteins were calculated using the online tool, IPC 1.0 (http://isoelectric.org/, accessed on 29 January 2022 and 11 July 2023) [59]. Furthermore, the intron distribution within the DPBB and CBM63 domains was investigated as described by Xu et al. [60].

To investigate the conservation of the two domains, multiple sequence alignment was performed using Muscle5 (v5.1) [54]. Signal peptides and poorly conserved amino acids in the predicted expansin proteins were manually removed. The conserved amino acids within the protein sequences were visualized using the R package ggseqlogo (v0.1) [61].

4.4. Gene Duplication and Synteny Analysis

The chromosomal positions of the expansin genes were identified using GFF3 files. Tandem duplication genes were defined as genes separated by no more than one intervening gene on the same chromosome [49]. Following Maher et al. [62], 20 protein-coding genes located upstream and downstream of each expansin gene were extracted to determine large-scale duplication events. The collinearity among genomic regions containing 41 protein-coding genes was analyzed using MCScanX with default parameters [63].

4.5. Calculation of Ka/Ks Values

A multiple sequence alignment of 2461 expansin proteins was prepared for Ka/Ks estimation using Muscle5 (v5.1) and ParaAT (v2.0) [54,64]. The Ka/Ks ratio for each paralogous gene pair was then calculated with KaKs_Calculator 3.0 [65]. To avoid the risk of saturation effects, gene pairs with Ks values greater than 2.0 were discarded [66]. A Ka/Ks ratio greater than one indicates a positive or diversifying selection, while a ratio lower than one represents a negative or purifying selection. Ka/Ks = 1 indicates neutral evolution.

4.6. Transcriptome Data Analysis

Transcriptome data from sixteen species across ten lineages were downloaded from the NCBI SRA database (https://www.ncbi.nlm.nih.gov/sra/?term=, accessed on 5 March 2022 and 13 July 2023) (Table S12). For V. carteri, C. subellipsoidea, M. endlicherianum, and P. margaritaceum, whole plant (unicellular organism) was selected. In C. braunii and Azolla filiculoides, the thallus was chosen. For P. patens, Selaginella moellendorffii, A. filiculoides and P. abies, the protonema, vascular leaf, sporophytes, and needles were selected, respectively. In A. trichopoda, Nymphaea colorata, O. sativa, Z. mays, A. thaliana, and Gossypium raimondii, the leaf was selected. Quality control of the raw reads was performed using Trimmomatic-0.39 [67]. The clean reads were further aligned to the genomes using HISAT2 (Version 2.2.0) [68]. The gene expression levels were estimated using StringTie2 (Version 2.1.4) and normalized by FPKM [69]. A heatmap was drawn using shinyCircos-V2.0 based on the gene expression level data (FPKM) [70]. A species tree with divergence times was constructed using TimeTree 5 (http://www.timetree.org/, accessed on 18 July 2023) [34].

4.7. RNA Isolation and qRT-PCR Analysis

In this study, maize inbred B73 was used to analyze the expression patterns of expansin genes across five tissues: primary roots (8 days after sowing in a growth chamber), young leaves (18 days after sowing in a growth chamber), anthers (field-grown maize), silks (field-grown maize), and seeds (10 days after pollination in the field). Following the user guide, TRIzolTM Reagent (Catalog Number 15596026, Invitrogen, Carlsbad, CA, USA) was used to extract the total RNA from the five maize tissues. Subsequently, cDNA was synthesized from 1 μg of total RNA per sample using SuperScript™ IV (Catalog Number 18090010, Invitrogen, Carlsbad, CA, USA). qRT-PCR was performed on an ABI StepOne Plus Real Time PCR System (Applied Biosystems® Inc., Foster City, CA, USA) using MonAmpTM SYBR Green qPCR Mix (Catalog Number MQ10201S, Monad, Wuhan, China). ZmEF1α served as an internal control, and the primers used for qRT-PCR are listed in Table S16.

5. Conclusions

Expansin is a pivotal factor in loosening the cell wall in green plants. In this study, we examined the origin, evolution, and diversification of the plant expansin family at the genome level. Our findings indicate that plant expansin genes originated from chlorophyte algae, with subsequent expansion of the gene family occurring through WGD/segmental duplication and tandem duplication events. A gene structure analysis revealed relatively conserved intron distribution patterns across different subfamilies. A phylogenetic analysis supported the origin of expansins and the conservation of four subfamilies. The expression patterns of expansin genes across 16 species, along with qRT-PCR analysis in maize, indicated that the functional conservation and diversification of expansin genes were accompanied by gene expansion during evolution. This study elucidates the molecular evolution of plant expansins from a bioinformatics perspective, enhancing our understanding of their biological functions and cell wall evolution.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ijms252111814/s1.

Author Contributions

Y.L. conceived and designed this experiment. Z.W. and Y.L. collected the genomic data. Z.W., Y.L., J.C., N.L., J.L., Y.W., W.L. and W.Y. analyzed the data using bioinformatics and performed the experiments. Y.L. wrote the manuscript. All authors have read and agreed to the published version of the manuscript.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are contained within the article or Supplementary Materials.

Conflicts of Interest

The authors declare no conflicts of interest.

Funding Statement

This research was supported by grants from the National Natural Science Foundation of China (32101745); the Research start-up fund to young talents of Henan Agricultural University (30500941); and the Key Specialized Research and Development Breakthrough Program in Henan Province (232102110237).

Footnotes

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

References

  • 1.Popper Z.A. Evolution and diversity of green plant cell walls. Curr. Opin. Plant Biol. 2008;11:286–292. doi: 10.1016/j.pbi.2008.02.012. [DOI] [PubMed] [Google Scholar]
  • 2.Lampugnani E.R., Flores-Sandoval E., Tan Q.W., Mutwil M., Bowman J.L., Persson S. Cellulose synthesis—Central components and their evolutionary relationships. Trends Plant Sci. 2019;24:402–412. doi: 10.1016/j.tplants.2019.02.011. [DOI] [PubMed] [Google Scholar]
  • 3.Purushotham P., Ho R., Zimmer J. Architecture of a catalytically active homotrimeric plant cellulose synthase complex. Science. 2020;369:1089–1094. doi: 10.1126/science.abb2978. [DOI] [PubMed] [Google Scholar]
  • 4.Zhang X., Xue Y., Guan Z., Zhou C., Nie Y., Men S., Wang Q., Shen C., Zhang D., Jin S., et al. Structural insights into homotrimeric assembly of cellulose synthase CesA7 from Gossypium hirsutum. Plant Biotechnol. J. 2021;19:1579–1587. doi: 10.1111/pbi.13571. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Zhang B., Gao Y., Zhang L., Zhou Y. The plant cell wall: Biosynthesis, construction, and functions. J. Integr. Plant Biol. 2021;63:251–272. doi: 10.1111/jipb.13055. [DOI] [PubMed] [Google Scholar]
  • 6.Kim S.-J., Chandrasekar B., Rea Anne C., Danhof L., Zemelis-Durfee S., Thrower N., Shepard Zachary S., Pauly M., Brandizzi F., Keegstra K. The synthesis of xyloglucan, an abundant plant cell wall polysaccharide, requires CSLC function. Proc. Natl. Acad. Sci. USA. 2020;117:20316–20324. doi: 10.1073/pnas.2007245117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Levesque-Tremblay G., Müller K., Mansfield S.D., Haughn G.W. Highly methyl esterified seeds is a pectin methyl esterase involved in embryo development. Plant Physiol. 2015;167:725–737. doi: 10.1104/pp.114.255604. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Dauphin B.G., Ranocha P., Dunand C., Burlat V. Cell-wall microdomain remodeling controls crucial developmental processes. Trends Plant Sci. 2022;27:1033–1048. doi: 10.1016/j.tplants.2022.05.010. [DOI] [PubMed] [Google Scholar]
  • 9.Cosgrove D.J. Building an extensible cell wall. Plant Physiol. 2022;189:1246–1277. doi: 10.1093/plphys/kiac184. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Zhang Z., Huo W., Wang X., Ren Z., Zhao J., Liu Y., He K., Zhang F., Li W., Jin S., et al. Origin, evolution, and diversification of the wall-associated kinase gene family in plants. Plant Cell Rep. 2023;42:1891–1906. doi: 10.1007/s00299-023-03068-9. [DOI] [PubMed] [Google Scholar]
  • 11.Uluisik S., Seymour G.B. Pectate lyases: Their role in plants and importance in fruit ripening. Food Chem. 2020;309:125559. doi: 10.1016/j.foodchem.2019.125559. [DOI] [PubMed] [Google Scholar]
  • 12.McQueen-Mason S., Durachko D.M., Cosgrove D.J. Two endogenous proteins that induce cell wall extension in plants. Plant Cell. 1992;4:1425–1433. doi: 10.1105/tpc.4.11.1425. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Cosgrove D.J. Plant expansins: Diversity and interactions with plant cell walls. Curr. Opin. Plant Biol. 2015;25:162–172. doi: 10.1016/j.pbi.2015.05.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Wang T., Chen Y., Tabuchi A., Hong M., Cosgrove D.J. The target of β-expansin EXPB1 in maize cell walls from binding and solid-state NMR studies. Plant Physiol. 2016;172:2107–2119. doi: 10.1104/pp.16.01311. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Sampedro J., Cosgrove D.J. The expansin superfamily. Genome Biol. 2005;6:242. doi: 10.1186/gb-2005-6-12-242. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Cosgrove D.J. Diffuse Growth of Plant Cell Walls. Plant Physiol. 2018;176:16–27. doi: 10.1104/pp.17.01541. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Brasileiro A.C.M., Lacorte C., Pereira B.M., Oliveira T.N., Ferreira D.S., Mota A.P.Z., Saraiva M.A.P., Araujo A.C.G., Silva L.P., Guimaraes P.M. Ectopic expression of an expansin-like B gene from wild Arachis enhances tolerance to both abiotic and biotic stresses. Plant J. 2021;107:1681–1696. doi: 10.1111/tpj.15409. [DOI] [PubMed] [Google Scholar]
  • 18.Balestrini R., Cosgrove D.J., Bonfante P. Differential location of α-expansin proteins during the accommodation of root cells to an arbuscular mycorrhizal fungus. Planta. 2005;220:889–899. doi: 10.1007/s00425-004-1431-2. [DOI] [PubMed] [Google Scholar]
  • 19.Fudali S., Janakowski S., Sobczak M., Griesser M., Grundler F.M.W., Golinowski W. Two tomato α-expansins show distinct spatial and temporal expression patterns during development of nematode-induced syncytia. Physiol. Plant. 2008;132:370–383. doi: 10.1111/j.1399-3054.2007.01017.x. [DOI] [PubMed] [Google Scholar]
  • 20.Li Y., Tu L., Pettolino F.A., Ji S., Hao J., Yuan D., Deng F., Tan J., Hu H., Wang Q., et al. GbEXPATR, a species-specific expansin, enhances cotton fibre elongation through cell wall restructuring. Plant Biotechnol. J. 2016;14:951–963. doi: 10.1111/pbi.12450. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Calderini D.F., Castillo F.M., Arenas M.A., Molero G., Reynolds M.P., Craze M., Bowden S., Milner M.J., Wallington E.J., Dowle A., et al. Overcoming the trade-off between grain weight and number in wheat by the ectopic expression of expansin in developing seeds leads to increased yield potential. New Phytol. 2021;230:629–640. doi: 10.1111/nph.17048. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Ramakrishna P., Ruiz Duarte P., Rance Graham A., Schubert M., Vordermaier V., Vu Lam D., Murphy E., Vilches Barro A., Swarup K., Moirangthem K., et al. EXPANSIN A1-mediated radial swelling of pericycle cells positions anticlinal cell divisions during lateral root initiation. Proc. Natl. Acad. Sci. USA. 2019;116:8597–8602. doi: 10.1073/pnas.1820882116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Su G., Lin Y., Wang C., Lu J., Liu Z., He Z., Shu X., Chen W., Wu R., Li B., et al. Expansin SlExp1 and endoglucanase SlCel2 synergistically promote fruit softening and cell wall disassembly in tomato. Plant Cell. 2024;36:709–726. doi: 10.1093/plcell/koad291. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Fan N., Xu Q., Yang Z., Zhuang L., Yu J., Huang B. Identification of expansin genes as promoting or repressing factors for leaf elongation in tall fescue. Physiol. Plant. 2023;175:e13861. doi: 10.1111/ppl.13861. [DOI] [PubMed] [Google Scholar]
  • 25.Liu B., Zhang B., Yang Z., Liu Y., Yang S., Shi Y., Jiang C., Qin F. Manipulating ZmEXPA4 expression ameliorates the drought-induced prolonged anthesis and silking interval in maize. Plant Cell. 2021;33:2058–2071. doi: 10.1093/plcell/koab083. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Tan J., Wang M., Shi Z., Miao X. OsEXPA10 mediates the balance between growth and resistance to biotic stress in rice. Plant Cell Rep. 2018;37:993–1002. doi: 10.1007/s00299-018-2284-7. [DOI] [PubMed] [Google Scholar]
  • 27.Yang J., Zhang G., An J., Li Q., Chen Y., Zhao X., Wu J., Wang Y., Hao Q., Wang W., et al. Expansin gene TaEXPA2 positively regulates drought tolerance in transgenic wheat (Triticum aestivum L.) Plant Sci. 2020;298:110596. doi: 10.1016/j.plantsci.2020.110596. [DOI] [PubMed] [Google Scholar]
  • 28.Zhang B., Chang L., Sun W., Ullah A., Yang X. Overexpression of an expansin-like gene, GhEXLB2 enhanced drought tolerance in cotton. Plant Physiol. Biochem. 2021;162:468–475. doi: 10.1016/j.plaphy.2021.03.018. [DOI] [PubMed] [Google Scholar]
  • 29.Zhou J., Xie J., Liao H., Wang X. Overexpression of β-expansin gene GmEXPB2 improves phosphorus efficiency in soybean. Physiol. Plant. 2014;150:194–204. doi: 10.1111/ppl.12077. [DOI] [PubMed] [Google Scholar]
  • 30.Wang M., Yuan D., Gao W., Li Y., Tan J., Zhang X. A comparative genome analysis of PME and PMEI families reveals the evolution of pectin metabolism in plant cell walls. PLoS ONE. 2013;8:e72082. doi: 10.1371/journal.pone.0072082. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Little A., Schwerdt J.G., Shirley N.J., Khor S.F., Neumann K., O’Donovan L.A., Lahnstein J., Collins H.M., Henderson M., Fincher G.B., et al. Revised phylogeny of the cellulose synthase gene superfamily: Insights into cell wall evolution. Plant Physiol. 2018;177:1124–1141. doi: 10.1104/pp.17.01718. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Gallinari R.H., Coletta R.D., Araújo P., Menossi M., Nery M.F. Bringing to light the molecular evolution of GUX genes in plants. Genet. Mol. Biol. 2020;43:e20180208. doi: 10.1590/1678-4685-gmb-2018-0208. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Leebens-Mack J.H., Barker M.S., Carpenter E.J., Deyholos M.K., Gitzendanner M.A., Graham S.W., Grosse I., Li Z., Melkonian M., Mirarab S., et al. One thousand plant transcriptomes and the phylogenomics of green plants. Nature. 2019;574:679–685. doi: 10.1038/s41586-019-1693-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Kumar S., Suleski M., Craig J.M., Kasprowicz A.E., Sanderford M., Li M., Stecher G., Hedges S.B. TimeTree 5: An expanded resource for species divergence times. Mol. Biol. Evol. 2022;39:msac174. doi: 10.1093/molbev/msac174. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Nishiyama T., Sakayama H., de Vries J., Buschmann H., Saint-Marcoux D., Ullrich K.K., Haas F.B., Vanderstraeten L., Becker D., Lang D., et al. The Chara genome: Secondary complexity and implications for plant terrestrialization. Cell. 2018;174:448–464.e424. doi: 10.1016/j.cell.2018.06.033. [DOI] [PubMed] [Google Scholar]
  • 36.Cheng S., Xian W., Fu Y., Marin B., Keller J., Wu T., Sun W., Li X., Xu Y., Zhang Y., et al. Genomes of subaerial Zygnematophyceae provide insights into land plant evolution. Cell. 2019;179:1057–1067.e1014. doi: 10.1016/j.cell.2019.10.019. [DOI] [PubMed] [Google Scholar]
  • 37.Lang D., Ullrich K.K., Murat F., Fuchs J., Jenkins J., Haas F.B., Piednoel M., Gundlach H., Van Bel M., Meyberg R., et al. The Physcomitrella patens chromosome-scale assembly reveals moss genome structure and evolution. Plant J. 2018;93:515–533. doi: 10.1111/tpj.13801. [DOI] [PubMed] [Google Scholar]
  • 38.Panchy N., Lehti-Shiu M., Shiu S.-H. Evolution of gene duplication in plants. Plant Physiol. 2016;171:2294–2316. doi: 10.1104/pp.16.00523. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Jia Q., Brown R., Köllner Tobias G., Fu J., Chen X., Wong Gane K.-S., Gershenzon J., Peters Reuben J., Chen F. Origin and early evolution of the plant terpene synthase family. Proc. Natl. Acad. Sci. USA. 2022;119:e2100361119. doi: 10.1073/pnas.2100361119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Soucy S.M., Huang J., Gogarten J.P. Horizontal gene transfer: Building the web of life. Nat. Rev. Genet. 2015;16:472–482. doi: 10.1038/nrg3962. [DOI] [PubMed] [Google Scholar]
  • 41.Chen R., Huangfu L., Lu Y., Fang H., Xu Y., Li P., Zhou Y., Xu C., Huang J., Yang Z. Adaptive innovation of green plants by horizontal gene transfer. Biotechnol. Adv. 2021;46:107671. doi: 10.1016/j.biotechadv.2020.107671. [DOI] [PubMed] [Google Scholar]
  • 42.Wang Y., Tan X., Paterson A.H. Different patterns of gene structure divergence following gene duplication in Arabidopsis. BMC Genom. 2013;14:652. doi: 10.1186/1471-2164-14-652. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Zhang Z., Ma X., Liu Y., Yang L., Shi X., Wang H., Diao R., Zhong B. Origin and evolution of green plants in the light of key evolutionary events. J. Integr. Plant Biol. 2022;64:516–535. doi: 10.1111/jipb.13224. [DOI] [PubMed] [Google Scholar]
  • 44.Albalat R., Cañestro C. Evolution by gene loss. Nat. Rev. Genet. 2016;17:379–391. doi: 10.1038/nrg.2016.39. [DOI] [PubMed] [Google Scholar]
  • 45.Sun G., Xu Y., Liu H., Sun T., Zhang J., Hettenhausen C., Shen G., Qi J., Qin Y., Li J., et al. Large-scale gene losses underlie the genome evolution of parasitic plant Cuscuta australis. Nat. Commun. 2018;9:2683. doi: 10.1038/s41467-018-04721-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Bayer P.E., Scheben A., Golicz A.A., Yuan Y., Faure S., Lee H., Chawla H.S., Anderson R., Bancroft I., Raman H., et al. Modelling of gene loss propensity in the pangenomes of three Brassica species suggests different mechanisms between polyploids and diploids. Plant Biotechnol. J. 2021;19:2488–2500. doi: 10.1111/pbi.13674. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Hepler N.K., Bowman A., Carey R.E., Cosgrove D.J. Expansin gene loss is a common occurrence during adaptation to an aquatic environment. Plant J. 2020;101:666–680. doi: 10.1111/tpj.14572. [DOI] [PubMed] [Google Scholar]
  • 48.Yang F., Luo J., Guo W., Zhang Y., Liu Y., Yu Z., Sun Y., Li M., Ma F., Zhao T. Origin and early divergence of tandem duplicated sorbitol transporter genes in Rosaceae: Insights from evolutionary analysis of the SOT gene family in angiosperms. Plant J. 2024;117:856–872. doi: 10.1111/tpj.16533. [DOI] [PubMed] [Google Scholar]
  • 49.Pei L., Zhang L., Li J., Shen C., Qiu P., Tu L., Zhang X., Wang M. Tracing the origin and evolution history of methylation-related genes in plants. BMC Plant Biol. 2019;19:307. doi: 10.1186/s12870-019-1923-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Potter S.C., Luciani A., Eddy S.R., Park Y., Lopez R., Finn R.D. HMMER web server: 2018 update. Nucleic Acids Res. 2018;46:W200–W204. doi: 10.1093/nar/gky448. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Lu S., Wang J., Chitsaz F., Derbyshire M.K., Geer R.C., Gonzales N.R., Gwadz M., Hurwitz D.I., Marchler G.H., Song J.S., et al. CDD/SPARCLE: The conserved domain database in 2020. Nucleic Acids Res. 2020;48:D265–D268. doi: 10.1093/nar/gkz991. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Solovyev V., Kosarev P., Seledsov I., Vorobyev D. Automatic annotation of eukaryotic genes, pseudogenes and promoters. Genome Biol. 2006;7:S10. doi: 10.1186/gb-2006-7-s1-s10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Subramanian B., Gao S., Lercher M.J., Hu S., Chen W.-H. Evolview v3: A webserver for visualization, annotation, and management of phylogenetic trees. Nucleic Acids Res. 2019;47:W270–W275. doi: 10.1093/nar/gkz357. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Edgar R.C. Muscle5: High-accuracy alignment ensembles enable unbiased assessments of sequence homology and phylogeny. Nat. Commun. 2022;13:6968. doi: 10.1038/s41467-022-34630-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Nguyen L.-T., Schmidt H.A., von Haeseler A., Minh B.Q. IQ-TREE: A fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 2015;32:268–274. doi: 10.1093/molbev/msu300. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Letunic I., Bork P. Interactive Tree Of Life (iTOL) v5: An online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 2021;49:W293–W296. doi: 10.1093/nar/gkab301. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Gu Z., Hübschmann D. spiralize: An R package for visualizing data on spirals. Bioinformatics. 2021;38:1434–1436. doi: 10.1093/bioinformatics/btab778. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Lawrence M., Gentleman R., Carey V. rtracklayer: An R package for interfacing with genome browsers. Bioinformatics. 2009;25:1841–1842. doi: 10.1093/bioinformatics/btp328. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Kozlowski L.P. IPC—Isoelectric point calculator. Biol. Direct. 2016;11:55. doi: 10.1186/s13062-016-0159-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Xu Y., Zhang H., Zhong Y., Jiang N., Zhong X., Zhang Q., Chai S., Li H., Zhang Z. Comparative genomics analysis of bHLH genes in cucurbits identifies a novel gene regulating cucurbitacin biosynthesis. Hortic. Res. 2022;9:uhac038. doi: 10.1093/hr/uhac038. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Wagih O. ggseqlogo: A versatile R package for drawing sequence logos. Bioinformatics. 2017;33:3645–3647. doi: 10.1093/bioinformatics/btx469. [DOI] [PubMed] [Google Scholar]
  • 62.Maher C., Stein L., Ware D. Evolution of Arabidopsis microRNA families through duplication events. Genome Res. 2006;16:510–519. doi: 10.1101/gr.4680506. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Wang Y., Tang H., DeBarry J.D., Tan X., Li J., Wang X., Lee T.-h., Jin H., Marler B., Guo H., et al. MCScanX: A toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res. 2012;40:e49. doi: 10.1093/nar/gkr1293. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Zhang Z., Xiao J., Wu J., Zhang H., Liu G., Wang X., Dai L. ParaAT: A parallel tool for constructing multiple protein-coding DNA alignments. Biochem. Biophys. Res. Commun. 2012;419:779–781. doi: 10.1016/j.bbrc.2012.02.101. [DOI] [PubMed] [Google Scholar]
  • 65.Zhang Z. KaKs_Calculator 3.0: Calculating selective pressure on coding and non-coding sequences. Genom. Proteom. Bioinform. 2022;20:536–540. doi: 10.1016/j.gpb.2021.12.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Blanc G., Wolfe K.H. Widespread paleopolyploidy in model plant species inferred from age distributions of duplicate genes. Plant Cell. 2004;16:1667–1678. doi: 10.1105/tpc.021345. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Bolger A.M., Lohse M., Usadel B. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Kim D., Langmead B., Salzberg S.L. HISAT: A fast spliced aligner with low memory requirements. Nat. Methods. 2015;12:357–360. doi: 10.1038/nmeth.3317. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Kovaka S., Zimin A.V., Pertea G.M., Razaghi R., Salzberg S.L., Pertea M. Transcriptome assembly from long-read RNA-seq alignments with StringTie2. Genome Biol. 2019;20:278. doi: 10.1186/s13059-019-1910-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Wang Y., Jia L., Tian G., Dong Y., Zhang X., Zhou Z., Luo X., Li Y., Yao W. shinyCircos-V2.0: Leveraging the creation of Circos plot with enhanced usability and advanced features. iMeta. 2023;2:e109. doi: 10.1002/imt2.109. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data Availability Statement

Data are contained within the article or Supplementary Materials.


Articles from International Journal of Molecular Sciences are provided here courtesy of Multidisciplinary Digital Publishing Institute (MDPI)

RESOURCES