Abstract
While roots and leaves have evolved independently in lycophytes, ferns and seed plants, there is still confusion regarding the morphological evolution of ferns, especially in whisk ferns, which lack true leaves and roots and instead only exhibit leaf-like appendages and absorptive rhizoids. In this study, analyses of comparative transcriptomics on positively selected genes were performed to provide insights into the adaptive evolution of whisk fern morphologies. Significantly clustered gene families specific to whisk ferns were mainly enriched in Gene Ontology (GO) terms “binding proteins” and “transmembrane transporter activity”, and positive selection was detected in genes involved in transmembrane transporter activities and stress response (e.g., sodium/hydrogen exchanger and heat shock proteins), which could be related to the adaptive evolution of tolerance to epiphytic environments. The analysis of TF/TR gene family sizes indicated that some rapidly evolving gene families (e.g., the GRF and the MADS-MIKC families) related to the development of morphological organs were commonly reduced in whisk ferns and ophioglossoid ferns. Furthermore, the WUS homeobox-containing (WOX) gene family and the knotted1-like homeobox (KNOX) gene family, both associated with root and leaf development, were phylogenetically conserved in whisk ferns and ophioglossoid ferns. In general, our results suggested that adaptive evolution to epiphytic environments might have occurred in whisk ferns. We propose that the simplified and reduced leaf and root system in whisk ferns is the result of reduction from the common ancestor of whisk ferns and ophioglossoid ferns, rather than an independent origin.
Keywords: whisk ferns, transcriptome, positive selection, morphological evolution, root, leaf
1. Introduction
Leaves, roots and stems are the three major vegetative organs of vascular plants, mainly functional for photosynthesis, absorption and anchoring, respectively [1]. Leaves and roots evolved multiple times over plant evolutionary history [2,3,4,5,6,7,8,9]. Fossil evidence and studies on developmental pathways indicated several independent origins of tracheophytes [4,5,6]. Roots had evolved firstly in lycophytes in a bifurcating form in the Early Devonian [5,7,9]. The second origin event occurred independently in the ancestor of euphyllophytes in the Middle Devonian, which resulted in the genesis of adventitious roots and lateral roots in the monilophytes [5,7,9]. The third root-origin event occurred in the seed plants, resulting in primary roots in addition to adventitious and lateral roots [5,10]. On the other hand, based on fossil evidence and the distinct developmental mechanisms of extant vascular plants, it is now accepted that leaves originated in lycophytes and euphyllophytes independently, producing microphylls and megaphylls, respectively [2,3,9]. In addition, the comparison between fossils of the most ancient ferns and modern seed plants suggested that the clades following parallel traces of evolution may also be of independent origins [8].
Although the key functional organs such as roots and leaves have definitely evolved in Paleozoic vascular plants, their structures can be highly variable, as a result of special functional requirements and adaption to specific environments. Psilotaceae (whisk ferns), including Psilotum and Tmesipteris, do not produce leaves and roots, and only exhibit leaf-like appendages and absorptive rhizoids [11,12] (Figure 1). Due to the lack of Mesozoic and Paleozoic fossil records, the origin of the two plant groups, Ophioglossales (ophioglossoid ferns) and the Psilotales, is still unclear, and whether the morphological differences are caused by independent origins or transformations that postdate a shared origin is unknown [12,13]. Schneider proposed that the body plan of whisk ferns may have evolved through a secondary reduction from the common ancestor with Ophioglossales rather than primary absence [12]. Nonetheless, the key developmental mechanism of whisk ferns in leaves and roots is poorly understood and based on limited evidence.
Figure 1.
The distinct morphologies of whisk ferns and ophioglossoid ferns. (A,B), the whisk ferns, Psilotum nudum (A) and Tmesipteris tannensis (B) are characterized by the lack of roots, the reduction of leaves, the leaf-like appendages and the absorptive rhizoids. (C,D), the ophioglossoid ferns, Ophioglossum vulgatum (C) and Botrychium japonicum (D), are characterized by simple or pinnatifid leaves and a reduced root system.
Ophioglossales and whisk ferns are phylogenetically close and both have simplified leaf and root structures [12,14], which makes them good models for studying species with distinct organ structure. Comparative analysis of gene families, especially transcription factors (TFs) and transcriptional regulators (TRs), which play an important role in plant growth and organ development, has been widely used in identification of genes of interest [15]. In addition, as an important component of comparative genomics, phylogenetic analysis could provide insight into the functional differentiation of genes; for example, the WUS homeobox-containing (WOX) gene family and the knotted1-like homeobox (KNOX) gene family function in root initiation and leaf development, respectively [16,17,18,19]. Furthermore, selective pressure analysis of orthologous genes could provide important information on how whisk ferns proceed with structural simplifications by adaptive evolution under epiphytic environments.
In this study, we conducted phylogenetic analyses on single-copy protein-coding genes from 11 species, including Physcomitrium patens, Selaginella moellendorffii, Arabidopsis thaliana, Equisetum arvense, Angiopteris fokiensis, Ophioglossum pendulum, O. vulgatum, Botrychium japonicum, Psilotum nudum, Tmesipteris tannensis and Dipteris chinensis. By conducting comparative and evolutionary transcriptomics analyses, including the detection of lineage-specific gene families, genes under positive selection on different branches, TF/TR variation and phylogenetic reconstruction of gene families, we aimed to acquire insights into the genetic basis of adaptive evolution to epiphytic environments and of the highly reduced body plans of whisk ferns at the genomic level.
2. Results
2.1. Assessment of Transcriptomes and Homolog Clusters
In total, genomic data were collected for 11 species, of which three were downloaded from the phytozome (v13) database, five were retrieved from Sequence Read Archive (SRA), and the remaining three transcriptomes (D. chinensis, O. pendulum and O. vulgatum) were newly sequenced in this study (Supplementary Table S1). The GC content of each sample was 44.11–52.63% with a contig N50 between 1056–1698 bp. The completeness of acquired data was assessed by estimating the coverage of the gene space based on gene mapping to the core plant genes’ datasets, all of which supported a high quality of the assemblies: (1) 70.2–99.7% of conservatized complete genes, (2) 55.8–99.5% of complete single-copy genes. Only single-copy genes between 0.1–16.5% were classified as missing, indicating fine coverage and high quality of the peptides for these species. The number of non-redundant protein sequences in each species ranged from 18,850 in O. pendulum to 50,890 in B. japonicum. A total of 348,947 non-redundant protein sequences were sorted into 30,051 orthogroups, which covered 82.3% of genes in all non-redundant sequences. Among the 30,051 orthogroups, 65 were single-copy orthogroups.
2.2. Comparative Transcriptome Analyses and Functional Enrichment
Comparison of gene families between whisk ferns (P. nudum and T. tannensis) and ophioglossoid ferns (O. pendulum, O. vulgatum and B. japonicum) revealed 6998 core orthologs (shared by all five species) (Figure 2). In contrast, 1323 and 1588 gene families were lineage-specific to P. nudum and T. tannensis, respectively. GO functional annotation and enrichment were conducted on the lineage-specific gene families. The top 20 enriched GO terms were selected and are shown in Supplementary Figures S1 and S2. These GO terms constituted genes for binding proteins (e.g., GO:0003730, GO:2001070, GO:0001871) and transmembrane transporters (e.g., GO:0080161, GO:0015171, GO:0010328), which potentially play important roles in the defense of limited water and poor nutrient supply of epiphytes.
Figure 2.
Venn diagram showing unique and shared gene families among members of whisk ferns (Psilotum nudum and Tmesipteris tannensis) and ophioglossoid ferns (Ophioglossum vulgatum, O. pendulum and Botrychium japonicum). The number in each region represents the number of gene families belonging to that category.
2.3. Phylogeny Reconstruction and Analysis on TF/TR Variation Related to Morphological Development
For each of the 65 single-copy gene families, alignment was conducted on protein sequences from the 11 plant species. The resultant 65 alignments were concatenated to generate a supermatrix, which was used for phylogeny analysis. The maximum likelihood species tree illustrated a well-resolved phylogeny with most nodes having a 100% bootstrap value (BSV) except two nodes with a BSV at 99% and 76%, respectively (Supplementary Figure S3). As expected, the whisk fern clade consisting of P. nudum and T. tannensis was clustered with the ophioglossoid fern clade consisting of O. pendulum, O. vulgatum and B. japonicum.
Based on the constructed species tree, a computational analysis of TF/TR family sizes was conducted (Supplementary Table S2) to study the expansion and contraction of TF/TR gene families involved in morphological development during the evolution of whisk ferns and their related species (Figure 3 and Supplementary Table S3). The result showed that 26 and 47 TF/TR gene families were under expansion in P. nudum and T. tannensis, respectively, compared to 12, 10 and 71 in O. pendulum, O. vulgatum and B. japonicum, respectively. On the other hand, 37 and 20 TF/TR gene families were under contraction in P. nudum and T. tannensis, respectively, compared to 44, 43 and 9 in O. pendulum, O. vulgatum and B. japonicum, respectively. To explore the most significantly changed TF/TR involved in morphological development, TF/TR gene families with a rapid evolution were selected and are listed in Table 1. Some TF/TR gene families involved in morphological development were found to reduce in family size in whisk ferns. For example, the GRF gene family, whose expression was strongly upregulated in actively growing and developing tissues such as shoot tips, flower buds and roots, had significant contraction in both P. nudum and T. tannensis. As another example, the MADS-MIKC gene family, type II of MADS-box genes in plants, was reduced by five in P. nudum. Interestingly, the family sizes of GRF and MADS-MIKC were also reduced in O. pendulum.
Figure 3.
Phylogenetic tree showing the evolution of TF/TR gene family sizes in 11 plant species. The whisk fern clade consisting of Psilotum nudum and Tmesipteris tannensis is clustered with the ophioglossoid fern clade consisting of Ophioglossum pendulum, O. vulgatum and Botrychium japonicum.
Table 1.
A summary of rapidly evolved TF/TR families identified by CAFÉ.
| Species | Rapidly Evolved TF/TR Families |
|---|---|
| Physcomitrium patens | null |
| Selaginella moellendorffii | null |
| Arabidopsis thaliana | MADS-MIKC [+25 *] |
| Equisetum arvense | AUX-IAA [+14 *], HB-BELL [+12 *], SBP [+16 *], GATA-Tify [+17 *] |
| Angiopteris fokiensis | BES1 [+8 *], HD-ZIP [+16 *], Jumonji [+12 *], MADS-MIKC [+28 *], TRAF [+25 *] |
| Tmesipteris tannensis | Dof [+7 *], GATA-Tify [+9 *], BES1 [−4 *], GRF [−3 *] |
| Psilotum nudum | GRF [−5 *], Dof [−8 *], MADS-MIKC [−5 *] |
| Botrychium japonicum | BES1 [+6 *], GARP-G2-like [+15 *], HB-other [+11 *], IWS1 [+10 *], MYB-related [+17 *], TRAF [+17 *] |
| Ophioglossum vulgatum | null |
| Ophioglossum pendulum | GRF [−3 *], HB-other [−6 *], MADS-MIKC [−5 *] |
| Dipteris chinensis | Dof [−15 *], GARP-G2-like [−17 *], HB-HD-ZIP [−14 *], Jumonji [−11 *], MYB-related [−22 *], SNF2 [−20 *], TRAF [−14 *] |
Note: The [+number] indicates the number of gained genes, the [−number] indicates the number of lost genes, and * indicates a rapid change compared to other changes in that gene family in other lineages. The change information of whisk ferns and ophioglossoid ferns is highlighted.
2.4. Phylogenetic Analysis of WOX and KNOX Genes
Whisk ferns are characterized by rootless and leaf-like appendages, and applying phylogenetic strategies, especially from sequences of gene families involved in root initiation and leaf development, will help us understand the evolutionary history of the root and leaf. Based on the sequences of WOX and KNOX proteins, evolutionary trees of multiple land plants including whisk ferns were constructed (Figure 4). The phylogeny of WOX genes was divided into three subfamilies, the ancient-clade-WOX (AC-WOX), the WUS-clade-WOX (WC-WOX) and the intermediate-clade WUSCHEL-RELATED HOMEOBOX (IC-WOX). In the WC-WOX subfamily, the homologous genes of whisk ferns (P. nudum) and ophioglossoid ferns (B. japonicum and O. vulgatum) were clustered into a clade (Figure 4A). According to previous studies, the plant KNOX proteins could be divided into three subfamilies, Class I, Class II and KNATM [20]. The KNOX tree generated by this study also had three branches representing the three subfamilies (Figure 4B). The homologous genes of whisk ferns (P. nudum) and ophioglossoid ferns (O. vulgatum) were clustered into a clade in the Class I subfamily (Figure 4B).
Figure 4.
Grouping of WOX and KNOX homologs using the neighbor-joining method. (A) A collection of 50 WOX proteins from 9 distinctive plant species. The WC-WOX sequences of whisk ferns and ophioglossoid ferns are highlighted. (B) A collection of 58 KNOX proteins from 9 distinctive plant species. The class I sequences of whisk ferns and ophioglossoid ferns are highlighted.
2.5. Genes under Positive Selection Detected in Whisk Ferns
To check whether P. nudum and T. tannensis had the same adaptive evolution, genes under positive selection were detected in the branch of P. nudum and the branch of the common ancestor of P. nudum and T. tannensis (Figure 5). Using the branch-site model of PAML, positive selection was detected in 803 single-copy orthologs. When P. nudum was labeled as a foreground alone, positive selection was detected in 74 genes. Among these 74 genes, 62 were annotated, with most of them carrying functions involved in transmembrane transporter activities and stress response (e.g., sodium/hydrogen exchanger and heat shock proteins) (Supplementary Table S4). In the common ancestral branch of P. nudum and T. tannensis, positive selection was detected in 79 genes. Among the 79 genes, 68 were annotated, with most of them carrying functions involved in transmembrane transporter activity and stress response as well (e.g., BT1 protein, Mpv17/PMP22 protein and proteins for chromosome structure maintenance) (Supplementary Table S5). Though the numbers and functions of positive selection genes were not identical, the majority of these genes were involved in cellular response to water and nutrient deficiency caused by the epiphytic environment, secondary wall biosynthesis and DNA damage repair.
Figure 5.
The phylogenetic relationships of whisk ferns and ophioglossoid ferns used to detect genes under positive selection. (A) Selection pressure analysis where Psilotum nudum was labeled as the foreground. (B) Selection pressure analysis where the common ancestral branch of whisk ferns was labeled as the foreground. Branches in red color indicate foreground branches for detection of genes under positive selection. The color of the circle represents the ecological habit of each species, with green for epiphytic habit and blue for geophytic habit.
3. Discussion
Early understanding of the origin of whisk ferns was based on the well-preserved fossils of rhyniophytes, in which whisk ferns were regarded as direct descendants of the rhyniophytes due to their root-less and leaf-less morphologies [21,22]. However, both DNA barcoding and phylogenomic analyses indicated that whisk ferns are sisters of Ophioglossales [14,23]. In this study, we performed analyses on comparative transcriptomics and positively selected genes to illustrate the adaptive evolution of whisk ferns in epiphytic environments and provided insights into the origin of the simplified root and leaf of whisk ferns.
Significant numbers of unique genes in whisk ferns (P. nudum and T. tannensis) were enriched in the GO terms’ “binding proteins”, such as hormone binding, auxin binding and kinase binding, as well as “transmembrane transporter activity”, such as auxin transmembrane transporter activity, inorganic anion transmembrane transporter activity and phosphate ion transmembrane transporter activity (Supplementary Figures S1 and S2). Previous studies indicated that whisk ferns are often epiphytic, with rhizomes having symbiotic mycorrhizal associations with other plants [21,24]. Due to the arboreal habitation and direct interaction with the atmosphere interface, epiphytes are subjected to multiple abiotic and biotic stresses [25,26]. For example, living in the canopy environments, the major challenges to the survival and growth of epiphytes are water deficit and obtaining nourishment directly from the environment [26,27]. Thus, genes related to transmembrane transporter and binding protein activities might contribute to adaptive evolution for water deficit stress, nutritional stress and other abiotic as well as biotic stress.
Constructing a stable phylogenetic topology structure is the basis for exploring how plant evolution related to morphological innovations was driven by alterations in the TF/TR gene families [28]. In this study, we performed a reconstruction of TF/TR to identify TF/TR genes that had changed corresponding to morphological development in whisk ferns and ophioglossoid ferns. Our results showed that the family sizes of some TF/TR related to morphological development (e.g., the GRF family and the MADS-MIKC family) were reduced in epiphytic whisk ferns (P. nudum and T. tannensis) and epiphytic O. pendulum (Table 1). Therefore, we hypothesized that the root-less and leaf-less whisk ferns and the simplified ophioglossoid ferns had a similar adaptive evolutionary trajectory that could be traced. Of the TF/TR genes with rapid evolution, the GRF family, one of the plant-specific TF families, plays an important regulatory role in cotyledon and leaf growth [29,30]. In Arabidopsis, overexpression of two of the nine GRFs resulted in larger leaf size; in contrast, insertional null mutations in three GRFs resulted in smaller leaves and cotyledons due to a decrease in cell size [30]. MADS-MIKC, type II of the MADS-box TF family, has been divided into a range of subfamilies such as ANR1, AGL12, SVP, TM3 and FLC [31,32]. Specifically, the AGL12 genes involved in root cell differentiation and the ANR1 genes involved in regulating lateral root development might have experienced a reduction in epiphytes compared to terrestrials [33,34]. We also found significant differences between whisk ferns and ophioglossoid ferns in gene families including Dof, GATA-Tify and BES1. It is worth noting that a similar adaptive evolution mechanism for root and leaf reduction in whisk ferns and ophioglossoid ferns does not imply that all TF/TR genes had the same evolutionary trajectories.
Gene expression and protein functional studies indicated that the WOX family members fulfilled specialized functions in the processes of plant root formation [35]. IC-WOX and WC-WOX were considered to be important participants in the process of root organogenesis in the common ancestor of ferns and seed plants [5]. In our analysis, homologous WOX genes were clustered with whisk ferns (P. nudum) and ophioglossoid ferns (B. japonicum and O. vulgatum) in a clade in the WC-WOX subfamily, suggesting the conservation of genes related to root formation in whisk ferns and ophioglossoid ferns, and supporting the hypothesis that roots of whisk ferns and ophioglossoid ferns had a common origin. The KNOX family, including three subfamilies, Class I, Class II and KNATM, is a group of TFs that play important roles in leaf and meristem development [17,20]. In this study, phylogeny based on the KNOX gene family gathered the simple leaf-shaped whisk ferns and ophioglossoid ferns together and clearly distinguished them from other plants with compound leaves such as A. fokiensis and A. thaliana (Figure 5B), which also indicates that the evolution of the KNOX family was conserved in whisk ferns and ophioglossoid ferns. In general, the evolutionary conservation of WOX and KNOX elucidates a common origin of whisk ferns and ophioglossoid ferns.
We hypothesized that adaptive evolution to epiphytism increased tolerance to water and nutrient deficits. Thus, genes under positive selection were detected in the ancestral branch of whisk ferns (P. nudum and T. tannensis) and the branch of P. nudum, to reveal whether adaptive evolution to epiphytism occurred within the two branches. Positive selection was detected in 79 and 74 genes in the branches of whisk ferns and P. nudum, respectively. Some of the annotated genes identified from the two branches are involved in transmembrane transporter activity and stress response (Supplementary Tables S4 and S5). Whisk ferns inhabiting epiphytic and exposure environments could be exposed to heat stress, water and nutrient deficits. In this way, genes subjected to positive selection and carrying transmembrane transporter activities might contribute to adaptive evolution to the epiphytic environment. In addition, several genes under positive selection were commonly detected from both the ancestral branch of whisk ferns and the branch of P. nudum. These genes are also functional in response to epiphytic environments. For example, the UDP-glycosyltransferase has been found to play an important role in cell wall pentose biosynthesis and secondary wall biosynthesis [36,37], which helps rhizome and aerial branch development to adapt to specific epiphytic habitats.
Naturally, it cannot be denied that there are still some limitations in this research, and further research and discussion are needed. For example, this paper only considers the expressed genes for the species whose transcriptomes were sequenced, but in real life, RNAseq data could very easily miss genes if the RNA for a given gene is not expressed under the growth conditions or in the tissues used for RNA analysis. In addition, although one replicate per sample is reasonable for phylogenetic analysis in this research, it could result in incomplete or missing data for comparative transcriptome analysis. Moreover, in the absence of other valid validations, there is still uncertainty about GO annotations of gene functions with reference to EggNOG database. Nevertheless, our findings still preliminarily illustrate adaptive evolutionary patterns of whisk ferns in leaf-less and root-less species. Therefore, more research is still needed, and more genomic data needs to be collected to establish a trend given the uncertainties in the experiments.
4. Materials and Methods
4.1. Transcriptome Data Collection and BUSCO Analysis
Transcriptome or genome data from 11 species, including whisk ferns and ophioglossoid ferns, along with other related species that occupy distinctive phylogenetic positions, were obtained as described below (Supplementary Table S1). The whole protein sequences of P. patens, S. moellendorffii and A. thaliana were downloaded from the Phytozome (v.13) database [38], and the raw reads of the transcriptomic sequences of E. arvense, A. fokiensis, P. nudum, T. tannensis and B. japonicum were downloaded from the National Center for Biotechnology Information (NCBI) database [23,39]. The other three field-collected adult species, including terrestrial D. chinensis, epiphytic O. pendulum and terrestrial O. vulgatum, were newly sequenced in this study. For transcriptome sequencing, the total plant transcriptomic RNA was isolated from mixed fresh tissues of leaf, stem and rhizome using the Tiangen Polysaccharide&Polyphenolics-rich RNAprep Pure Plant Kit. Illumina library preparation, and RNA-sequencing was performed on the Majorbio next-generation sequencing (NGS) platform (Shanghai, China). To standardize, all the raw reads were cleaned using Trimmomatic (v.0.36) [40] and de novo assembled using Trinity (v.2.6.6) [41]. Coding sequences (CDSs) of the transcripts and amino acid sequences of the encoded proteins were predicted by TransDecoder (v.5.50) (https://github.com/TransDecoder, accessed on 10 August 2021) with default parameters. The cd-hit (v.4.6.2) [42] software was used to remove redundantly assembled transcripts with a sequence identity threshold at 98% and a minimum coverage at 90%.
The genomic assembly completeness of the 11 species in total were assessed by estimating the coverage of the gene space based on blast against the core plant homologous gene database (www.orthodb.org, accessed on 5 August 2021).
4.2. Phylogenetic Analysis
The OrthoFinder (v.2.2.6) [43] software was used to cluster transcripts into orthologous genes (OGs), which infer core-orthogroups based on all-against-all BLASTP (v.2.9.0+) [44] searches with an e-value cutoff at 10−5. In total, 65 single-copy OGs were identified. Orthologs in each of the 65 single-copy OGs were aligned using MAFFT (v.7.471) [45]. For the concatenation analysis, the Utensils phylogenetic tool (https://github.com/ballesterus/Utensils, accessed on 21 October 2021) was used to generate a supermatrix from the alignments. A phylogenetic tree was reconstructed from the supermatrix using IQ-TREE (v.2.0.3) [46] with 1000 bootstrapping repeats and using -m MFP parameter for automatically extended model selection followed by tree inference. The resultant maximum likelihood tree was subjected to subsequent analyses.
4.3. Comparative Transcriptome Analysis
Using the OrthoVenn2 web platform [47], all protein sequences from the 5 species were compared. The output of OrthoVenn2 was an interactive occurrence pattern table, and within the occurrence results, the functional annotations and summaries of the disjunctions and intersections of clusters between the species were displayed through a Venn diagram. The gene ontology (GO) term enrichment was performed using the GO enrichment tool included in the OrthoVenn2 platform. Additionally, the iTAK (v.18.12) program [15] was used for prediction and classification of plant TFs and TRs from all the protein sequences. Based on the constructed evolutionary tree, expansion and contraction of TFs and TRs relative to the most recent common ancestor were analyzed using the CAFÉ (v.4.2.1) program [48]. The value of the birth and death parameter (λ) was 0.00253807 and the p-value was 0.01.
4.4. Identification and Gene Tree Reconstruction of the WOX and KNOX Families
Two methods, BLASTP based on the protein homology alignment and hidden Markov model (HMM) search based on the HMM files, were used to identify the WOX and KNOX proteins. BLASTP and HMM search was conducted using local BLAST (v.2.9.0+) and HMMER (v.3.2.1), respectively. Arabidopsis WOX and KNOX proteins were obtained from the TAIR database (https://www.arabidopsis.org, accessed on 13 November 2021) and used to construct a local BLAST database, against which BLASTP was conducted using the protein sequences from the 11 species as queries with a cut-off e-value equaling 10−5. Domain PF00046 (homeobox domain) from the Pfam database (http://pfam.xfam.org, accessed on 13 November 2021) was used for HMM search with an e-value at 10−5. Then, the NCBI Conserved Domain Database and SMART (the Simple Modular Architecture Research Tool; http://smart.embl-heidelberg.de, accessed on 13 November 2021) were used to confirm the putative proteins. To study the evolutionary relationship between species based on the sequences of WOX and KNOX proteins, an evolutionary tree was constructed using sequences of A. thaliana and related species by the PhyloSuite (v.1.2.2) [49] platform, with the following built-in software with default parameters: MAFFT for the alignment of all identified sequences; ModelFinder for the automatic detection of evolutionary models; and IQ-TREE for the construction of unrooted maximum-likelihood phylogenetic trees.
4.5. Detection of Genes under Positive Selection
Presence of positive selection was detected using the CodeML program of the PAML software package [50], which implements the branch site model. CodeML computes ω, the ratio of nonsynonymous over synonymous substitution rates (dN/dS). In this study, two models were fitted and tested based on the null and alternative hypotheses of sequence evolution. The null hypothesis assumes that the gene evolved neutrally and ω is expected to have a value of 1 in all branches (null model: model = 2, NSsites = 2, fix_omega = 0, omega = 1), while the alternative hypothesis assumes that the gene is under positive selection and ω value is greater than 1 in the foreground branch (model A: model = 2, NSsites = 2, fix_omega = 1, omega = 1). For each of the two models, goodness of fit between the expected and observed ω was determined by the p-values of chi-squared test. Based on their annotation, the positively selected genes were assigned to a Cluster of Orthologous Groups (COG) category by searching the COG EggNOG database [51], and the enrichment analysis was performed using the OmicShare tool, a free online platform for data analysis (https://www.omicshare.com/tools, accessed on 8 December 2021).
5. Conclusions
In this study, we performed analyses on comparative transcriptomics and positive selection of genes to explore the adaptive evolution of whisk ferns. We identified gene families specific to whisk ferns and genes showing signs of adaption to epiphytic environments. The gene family size of TF/TR related to morphological development (e.g., the GRF family and the MADS-MIKC family) was reduced in epiphytic whisk ferns (P. nudum and T. tannensis) and epiphytic O. pendulum. Phylogenetic analysis revealed that both the WOX and KNOX gene families had a conserved evolutionary history in whisk ferns and ophioglossoid ferns. Genes under positive selection were detected in the ancestral branch of whisk ferns and the branch of P. nudum, and these genes were mainly involved in the stress response (e.g., heat shock proteins and proteins for chromosome structure maintenance) and transmembrane transporter activities (e.g., BT1 protein and sodium/hydrogen exchanger), which suggested that the positively selected changes on these genes might be associated with adaptation to epiphytic habitats and nutrient deficiency. Overall, we proposed that the simplified and reduced leaves and roots in whisk ferns are the result of reduction from the common ancestor of whisk ferns and ophioglossoid ferns, rather than an independent origin, and that the adaptation to the epiphytic environment in these ferns could be assessed at the gene level.
Acknowledgments
We thank Lijun Chen, Xinyi Wu, and Guohua Zhao for helping with transcriptome material collection and visualization. We thank Chengwei Chen for providing the images of whisk ferns, and thank Xile Zhou for providing the images of ophioglossoid ferns. Informed consent was obtained from all subjects involved in the study. We appreciate the editors and anonymous reviewers for carefully correcting the manuscript.
Supplementary Materials
The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/plants11091198/s1, Figure S1: The GO enrichment of P. nudum specific gene families; Figure S2: The GO enrichment of T. tannensis specific gene families; Figure S3: Phylogenetic tree constructed from single-copy orthogroups; Table S1: A summary of the sequencing, assembly, BUSCO assessment and related data included in the phylogenomic analysis; Table S2: A summary of the number of TF/TR; Table S3: Summary of variations in the number of TF/TR; Table S4: The results of paml analyses for genes, where positive selection was detected in the branch of P. nudum; Table S5: The results of paml analyses for genes, where positive selection was detected in the common ancestral branch of P. nudum and T. tannensis.
Author Contributions
Y.Y. and H.S. managed the project; Z.X., Z.W. and L.L. conceived this research; Z.X., L.L. and F.W. collected the plant material; Z.X. and Z.W. sequenced the raw data; Z.X. performed the experiments and analyzed the data; Z.X. wrote the manuscript; F.W. and H.S. modified the manuscript. All authors contributed to the revision of the manuscript. All authors have read and agreed to the published version of the manuscript.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
The raw sequence data reported in this paper have been deposited in the SRA under Bioproject number PRJNA762181.
Conflicts of Interest
The named authors declare that there is no conflict of interest, financial or otherwise.
Funding Statement
This research was funded by the Strategic Priority Research Program of the Chinese Academy of Sciences (No. XDA19050404).
Footnotes
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Haberlandt G. Physiological Plant Anatomy. Macmillan; London, UK: 1914. [Google Scholar]
- 2.Boyce C.K., Knoll A.H. Evolution of developmental potential and the multiple independent origins of leaves in Paleozoic vascular plants. Paleobiology. 2002;28:70–100. doi: 10.1666/0094-8373(2002)028<0070:EODPAT>2.0.CO;2. [DOI] [Google Scholar]
- 3.Floyd S.K., Bowman J.L. Distinct developmental mechanisms reflect the independent origins of leaves in vascular plants. Curr. Biol. 2006;16:1911–1917. doi: 10.1016/j.cub.2006.07.067. [DOI] [PubMed] [Google Scholar]
- 4.Kenrick P., Strullu-Derrien C. The origin and early evolution of roots. Plant Physiol. 2014;166:570–580. doi: 10.1104/pp.114.244517. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Liu W., Xu L. Recruitment of IC-WOX genes in root evolution. Trends Plant Sci. 2018;23:490–496. doi: 10.1016/j.tplants.2018.03.011. [DOI] [PubMed] [Google Scholar]
- 6.Pires N.D., Dolan L. Morphological evolution in land plants: New designs with old genes. Philos. Trans. R. Soc. Lond. B Biol. Sci. 2012;367:508–518. doi: 10.1098/rstb.2011.0252. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Raven J.A., Edwards D. Roots: Evolutionary origins and biogeochemical significance. J. Exp. Bot. 2001;52:381–401. doi: 10.1093/jxb/52.suppl_1.381. [DOI] [PubMed] [Google Scholar]
- 8.Sanders H., Rothwell G.W., Wyatt S.E. Key morphological alterations in the evolution of leaves. Int. J. Plant Sci. 2009;170:860–868. doi: 10.1086/600135. [DOI] [Google Scholar]
- 9.Tomescu A.M.F. Megaphylls, microphylls and the evolution of leaf development. Trends Plant Sci. 2009;14:5–12. doi: 10.1016/j.tplants.2008.10.008. [DOI] [PubMed] [Google Scholar]
- 10.Ge Y., Fang X., Liu W., Sheng L., Xu L. Adventitious lateral rooting: The plasticity of root system architecture. Physiol. Plant. 2019;165:39–43. doi: 10.1111/ppl.12741. [DOI] [PubMed] [Google Scholar]
- 11.Foster A.S., Gifford E.M. Comparative Morphology of Vascular Plants. W. H. Freeman and Company; San Francisco, CA, USA: 1974. [Google Scholar]
- 12.Schneider H. Evolutionary morphology of ferns (monilophytes) Annu. Plant Rev. 2013;45:115–140. [Google Scholar]
- 13.Taylor E.L., Taylor T.N., Krings M. Paleobotany: The Biology and Evolution of Fossil Plants. Academic Press; Cambridge, MA, USA: 2009. [Google Scholar]
- 14.PPG I.(The Pteridophyte Phylogeny Group) A community-derived classification for extant lycophytes and ferns. J. Syst. Evol. 2016;54:563–603. doi: 10.1111/jse.12229. [DOI] [Google Scholar]
- 15.Zheng Y., Jiao C., Sun H., Rosli H., Pombo M.A., Zhang P., Banf M., Dai X., Martin G.B., Giovannoni J.J. iTAK: A program for genome-wide prediction and classification of plant transcription factors, transcriptional regulators, and protein kinases. Mol. Plant. 2016;9:1667–1670. doi: 10.1016/j.molp.2016.09.014. [DOI] [PubMed] [Google Scholar]
- 16.Hake S., Smith H.M., Holtan H., Magnani E., Mele G., Ramirez J. The role of knox genes in plant development. Annu. Rev. Cell Dev. Biol. 2004;20:125–151. doi: 10.1146/annurev.cellbio.20.031803.093824. [DOI] [PubMed] [Google Scholar]
- 17.Hay A., Tsiantis M. A KNOX family tale. Curr. Opin. Plant Biol. 2009;12:593–598. doi: 10.1016/j.pbi.2009.06.006. [DOI] [PubMed] [Google Scholar]
- 18.Sakakibara K., Nishiyama T., Deguchi H., Hasebe M. Class 1 KNOX genes are not involved in shoot development in the moss Physcomitrella patens but do function in sporophyte development. Evol. Dev. 2008;10:555–566. doi: 10.1111/j.1525-142X.2008.00271.x. [DOI] [PubMed] [Google Scholar]
- 19.Yu J., Zhang Y., Liu W., Wang H., Wen S., Zhang Y., Xu L. Molecular evolution of auxin-mediated root initiation in plants. Mol. Biol. Evol. 2020;37:1387–1393. doi: 10.1093/molbev/msz202. [DOI] [PubMed] [Google Scholar]
- 20.Gao J., Yang X., Zhao W., Lang T., Samuelsson T. Evolution, diversification, and expression of KNOX proteins in plants. Front. Plant Sci. 2015;6:882. doi: 10.3389/fpls.2015.00882. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Simpson M.G. Plant Systematics. Academic Press; Cambridge, MA, USA: 2019. [Google Scholar]
- 22.Kenrick P., Crane P.R. The origin and early evolution of plants on land. Nature. 1997;389:33–39. doi: 10.1038/37918. [DOI] [Google Scholar]
- 23.Shen H., Jin D., Shu J.P., Zhou X.-L., Lei M., Wei R., Shang H., Wei H.J., Zhang R., Liu L. Large-scale phylogenomic analysis resolves a backbone phylogeny in ferns. GigaScience. 2018;7:gix116. doi: 10.1093/gigascience/gix116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Schuettpelz E. The Evolution and Diversification of Epiphytic Ferns. Duke University; Durham, NC, USA: 2007. [Google Scholar]
- 25.Adibah M.R., Ainuddin A. Epiphytic plants responses to light and water stress. Asian J. Plant Sci. 2011;10:97–107. doi: 10.3923/ajps.2011.97.107. [DOI] [Google Scholar]
- 26.Bartels S.F., Chen H.Y. Mechanisms regulating epiphytic plant diversity. CRC Crit. Rev. Plant Sci. 2012;31:391–400. doi: 10.1080/07352689.2012.680349. [DOI] [Google Scholar]
- 27.Miller G., Hartzell S., Porporato A. Ecohydrology of epiphytes: Modelling water balance, CAM photosynthesis, and their climate impacts. Ecohydrology. 2021;14:e2275. doi: 10.1002/eco.2275. [DOI] [Google Scholar]
- 28.Lai X., Chahtane H., Martin-Arevalillo R., Zubieta C., Parcy F. Contrasted evolutionary trajectories of plant transcription factors. Curr. Opin. Plant Biol. 2020;54:101–107. doi: 10.1016/j.pbi.2020.03.002. [DOI] [PubMed] [Google Scholar]
- 29.Debernardi J.M., Mecchia M.A., Vercruyssen L., Smaczniak C., Kaufmann K., Inze D., Rodriguez R.E., Palatnik J.F. Post-transcriptional control of GRF transcription factors by micro RNA miR396 and GIF co-activator affects leaf size and longevity. Plant J. 2014;79:413–426. doi: 10.1111/tpj.12567. [DOI] [PubMed] [Google Scholar]
- 30.Kim J.H., Choi D., Kende H. The AtGRF family of putative transcription factors is involved in leaf and cotyledon growth in Arabidopsis. Plant J. 2003;36:94–104. doi: 10.1046/j.1365-313X.2003.01862.x. [DOI] [PubMed] [Google Scholar]
- 31.Arora R., Agarwal P., Ray S., Singh A.K., Singh V.P., Tyagi A.K., Kapoor S. MADS-box gene family in rice: Genome-wide identification, organization and expression profiling during reproductive development and stress. BMC Genom. 2007;8:242. doi: 10.1186/1471-2164-8-242. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Parenicová L., de Folter S., Kieffer M., Horner D.S., Favalli C., Busscher J., Cook H.E., Ingram R.M., Kater M.M., Davies B. Molecular and phylogenetic analyses of the complete MADS-box transcription factor family in Arabidopsis: New openings to the MADS world. Plant Cell. 2003;15:1538–1551. doi: 10.1105/tpc.011544. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Tapia-López R., García-Ponce B., Dubrovsky J.G., Garay-Arroyo A., Pérez-Ruíz R.V., Kim S.-H., Acevedo F., Pelaz S., Alvarez-Buylla E.R. An AGAMOUS-related MADS-box gene, XAL1 (AGL12), regulates root meristem cell proliferation and flowering transition in Arabidopsis. Plant Physiol. 2008;146:1182–1192. doi: 10.1104/pp.107.108647. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Zhang G.-Q., Liu K.-W., Li Z., Lohaus R., Hsiao Y.-Y., Niu S.-C., Wang J.-Y., Lin Y.-C., Xu Q., Chen L.-J., et al. The Apostasia genome and the evolution of orchids. Nature. 2017;549:379–383. doi: 10.1038/nature23897. Erratum in Nature 2020, 583, E30. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Van der Graaff E., Laux T., Rensing S.A. The WUS homeobox-containing (WOX) protein family. Genome Biol. 2009;10:248. doi: 10.1186/gb-2009-10-12-248. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Samac D.A., Litterer L., Temple G., Jung H.-J.G., Somers D.A. Expression of UDP-glucose dehydrogenase reduces cell-wall polysaccharide concentration and increases xylose content in alfalfa stems. Appl. Biochem. Biotechnol. 2004;116:1167–1182. doi: 10.1385/ABAB:116:1-3:1167. [DOI] [PubMed] [Google Scholar]
- 37.Klinghammer M., Tenhaken R. Genome-wide analysis of the UDP-glucose dehydrogenase gene family in Arabidopsis, a key enzyme for matrix polysaccharides in cell walls. J. Exp. Bot. 2007;58:3609–3621. doi: 10.1093/jxb/erm209. [DOI] [PubMed] [Google Scholar]
- 38.Goodstein D.M., Shu S., Howson R., Neupane R., Hayes R.D., Fazo J., Mitros T., Dirks W., Hellsten U., Putnam N. Phytozome: A comparative platform for green plant genomics. Nucleic Acids Res. 2012;40:D1178–D1186. doi: 10.1093/nar/gkr944. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Huang C.H., Qi X., Chen D., Qi J., Ma H. Recurrent genome duplication events likely contributed to both the ancient and recent rise of ferns. J. Integr. Plant Biol. 2020;62:433–455. doi: 10.1111/jipb.12877. [DOI] [PubMed] [Google Scholar]
- 40.Bolger A.M., Lohse M., Usadel B. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Haas B.J., Papanicolaou A., Yassour M., Grabherr M., Blood P.D., Bowden J., Couger M.B., Eccles D., Li B., Lieber M., et al. De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nat. Protoc. 2013;8:1494–1512. doi: 10.1038/nprot.2013.084. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Li W.Z., Godzik A. Cd-hit: A fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics. 2006;22:1658–1659. doi: 10.1093/bioinformatics/btl158. [DOI] [PubMed] [Google Scholar]
- 43.Emms D.M., Kelly S. OrthoFinder: Phylogenetic orthology inference for comparative genomics. Genome Biol. 2019;20:238. doi: 10.1186/s13059-019-1832-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Camacho C., Coulouris G., Avagyan V., Ma N., Papadopoulos J., Bealer K., Madden T.L. BLAST+: Architecture and applications. BMC Bioinform. 2009;10:421. doi: 10.1186/1471-2105-10-421. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Katoh K., Standley D.M. MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Mol. Biol. Evol. 2013;30:772–780. doi: 10.1093/molbev/mst010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Minh B.Q., Schmidt H.A., Chernomor O., Schrempf D., Woodhams M.D., Von Haeseler A., Lanfear R. IQ-TREE 2: New models and efficient methods for phylogenetic inference in the genomic era. Mol. Biol. Evol. 2020;37:1530–1534. doi: 10.1093/molbev/msaa015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Xu L., Dong Z., Fang L., Luo Y., Wei Z., Guo H., Zhang G., Gu Y.Q., Coleman-Derr D., Xia Q. OrthoVenn2: A web server for whole-genome comparison and annotation of orthologous clusters across multiple species. Nucleic Acids Res. 2019;47:W52–W58. doi: 10.1093/nar/gkz333. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.De Bie T., Cristianini N., Demuth J.P., Hahn M.W. CAFE: A computational tool for the study of gene family evolution. Bioinformatics. 2006;22:1269–1271. doi: 10.1093/bioinformatics/btl097. [DOI] [PubMed] [Google Scholar]
- 49.Zhang D., Gao F., Jakovlić I., Zou H., Zhang J., Li W.X., Wang G.T. PhyloSuite: An integrated and scalable desktop platform for streamlined molecular sequence data management and evolutionary phylogenetics studies. Mol. Ecol. Resour. 2020;20:348–355. doi: 10.1111/1755-0998.13096. [DOI] [PubMed] [Google Scholar]
- 50.Yang Z. PAML: Phylogenetic Analysis by Maximum Likelihood. Mol. Biol. Evol. 2007;24:1586–1591. doi: 10.1093/molbev/msm088. [DOI] [PubMed] [Google Scholar]
- 51.Huerta-Cepas J., Szklarczyk D., Heller D., Hernández-Plaza A., Forslund S.K., Cook H., Mende D.R., Letunic I., Rattei T., Jensen L.J. eggNOG 5.0: A hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses. Nucleic Acids Res. 2019;47:D309–D314. doi: 10.1093/nar/gky1085. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The raw sequence data reported in this paper have been deposited in the SRA under Bioproject number PRJNA762181.





