Skip to main content
Plant Communications logoLink to Plant Communications
. 2024 Apr 1;5(7):100891. doi: 10.1016/j.xplc.2024.100891

A reference-grade genome of the xerophyte Ammopiptanthus mongolicus sheds light on its evolution history in legumes and drought-tolerance mechanisms

Lei Feng 1,2, Fei Teng 3, Na Li 1, Jia-Cheng Zhang 1, Bian-Jiang Zhang 1, Sau-Na Tsai 1, Xiu-Le Yue 4, Li-Fei Gu 1, Guang-Hua Meng 1, Tian-Quan Deng 3, Suk-Wah Tong 1, Chun-Ming Wang 1, Yan Li 5, Wei Shi 3, Yong-Lun Zeng 6, Yue-Ming Jiang 7, Weichang Yu 8, Sai-Ming Ngai 1, Li-Zhe An 4,9,, Hon-Ming Lam 1,∗∗, Jun-Xian He 1,∗∗∗
PMCID: PMC11287142  PMID: 38561965

Abstract

Plants that grow in extreme environments represent unique sources of stress-resistance genes and mechanisms. Ammopiptanthus mongolicus (Leguminosae) is a xerophytic evergreen broadleaf shrub native to semi-arid and desert regions; however, its drought-tolerance mechanisms remain poorly understood. Here, we report the assembly of a reference-grade genome for A. mongolicus, describe its evolutionary history within the legume family, and examine its drought-tolerance mechanisms. The assembled genome is 843.07 Mb in length, with 98.7% of the sequences successfully anchored to the nine chromosomes of A. mongolicus. The genome is predicted to contain 47 611 protein-coding genes, and 70.71% of the genome is composed of repetitive sequences; these are dominated by transposable elements, particularly long-terminal-repeat retrotransposons. Evolutionary analyses revealed two whole-genome duplication (WGD) events at 130 and 58 million years ago (mya) that are shared by the genus Ammopiptanthus and other legumes, but no species-specific WGDs were found within this genus. Ancestral genome reconstruction revealed that the A. mongolicus genome has undergone fewer rearrangements than other genomes in the legume family, confirming its status as a “relict plant”. Transcriptomic analyses demonstrated that genes involved in cuticular wax biosynthesis and transport are highly expressed, both under normal conditions and in response to polyethylene glycol-induced dehydration. Significant induction of genes related to ethylene biosynthesis and signaling was also observed in leaves under dehydration stress, suggesting that enhanced ethylene response and formation of thick waxy cuticles are two major mechanisms of drought tolerance in A. mongolicus. Ectopic expression of AmERF2, an ethylene response factor unique to A. mongolicus, can markedly increase the drought tolerance of transgenic Arabidopsis thaliana plants, demonstrating the potential for application of A. mongolicus genes in crop improvement.

Key words: Ammopiptanthus mongolicus, genome sequencing, genome evolution, drought tolerance, cuticular wax, ethylene


This study reports a high-quality chromosome-level genome assembly for the xerophytic legume shrub Ammopiptanthus mongolicus. Comparative genomic analyses provide insights into the genome expansion and chromosome evolution of this relict plant. Analyses of its cuticular wax biosynthesis and phytohormone signaling pathways shed light on its mechanisms of adaptation to desert environments.

Introduction

The genus Ammopiptanthus (family Leguminosae or Fabaceae, subfamily Papilionoideae) comprises only two species, Ammopiptanthus mongolicus and Ammopiptanthus nanus (Cheng, 1959). The model species, A. mongolicus (Figure 1A), is distributed throughout the southern Gobi Desert and adjacent areas in central Asia, whereas A. nanus is restricted to Kashgar, Xinjiang Province, China (Zhao and Zhu, 2003). Geographical research suggests that deserts have existed in eastern central Asia for approximately 22 million years (Xie and Yang, 2012). The genus Ammopiptanthus, an endemic plant, is considered a relict descendant of the ancient Tethyan flora that evolved to survive in desert environments (Xie and Yang, 2012). Despite growing in extremely arid and cold environments, Ammopiptanthus spp. are evergreen broadleaf shrubs (Zhao et al., 2016a). These evergreen shrubs play an essential role in maintaining the semi-desert ecosystem, as they can obstruct drifting sand and prevent further desertification (Liu and Qiu, 1982). However, the harsh environment of the Gobi Desert leads to low seed germination rates and increased growth pressure. Therefore, both Ammopiptanthus species have been listed as endangered and second-class state-protected wild plants in China (Ge et al., 2005).

Figure 1.

Figure 1

Morphology, karyotyping, Hi-C interaction heatmap, and genomic landscape of A. mongolicus.

(A) A population of A. mongolicus grown in a field on Alxa Plateau, Inner Mongolia, China (July 2015).

(B) Karyotyping of A. mongolicus by fluorescence in situ hybridization (FISH) using probes from repeat sequences of rDNAs (green, 18S rDNA; red, 5S rDNA). Scale bar, 5 μm.

(C) Assembly of A. mongolicus chromosomes using Hi-C data. Bin size was set to 200 kb when plotting the Hi-C interaction heatmap.

(D) Genomic landscape of A. mongolicus. i, intragenomic collinear blocks between or within chromosomes, with the most dramatic one highlighted in pink; ii, nine pseudo-chromosomes (unit, Mb); iii, GC content in each 500-kb bin of the genome; iv, gene density along the chromosomes, expressed as the number of genes per bin; v, repeat ratio of each bin; vi, number of simple short repeats (SSRs) in each bin; vii, percentage of LTR_Copia among TEs; viii, percentage of LTR_Gypsy among TEs. All density or ratio information was determined in non-overlapping 500-kb windows.

Phylogenetic analyses based on the nuclear ribosomal internal transcribed spacer (nrITS) marker and the plastid matK gene have shown that Ammopiptanthus is monophyletic and has a sibling genetic relationship with a clade including Sophora and Ammodendron (Xie and Yang, 2012). A study of chloroplast intergenic spacers and ITS markers estimated the divergence time between A. mongolicus and A. nanus at approximately 0.77 million years ago (mya) (Su et al., 2016). Our recent chloroplast and mitochondrial sequence analyses revealed that these two species have almost the same organelle gene content, with only a few insertions and deletions in non-coding regions, suggesting that they diverged very recently (Feng et al., 2017, 2019).

Mechanisms of abiotic stress tolerance in this genus are receiving increasing attention. Morphological studies have shown that its leaf surfaces are covered with an exceptionally thick waxy cuticle (up to 18 μm) that helps to limit transpiration and maintain plant water status (Han and Li, 1992; Li et al., 2023). There are dense silvery trichomes on both leaf surfaces that protect against heat and cold (Liu et al., 2004). A. mongolicus leaves also have a lower stomatal density than leaves of Arabidopsis (Li et al., 2015), further reducing the rate of transpiration. Notably, the palisade mesophyll of A. mongolicus has nine or 10 cell layers, resulting in succulent leaves and thus an increased water storage capacity. Gene expression profiling of this genus under drought, cold, and salinity stress has identified hundreds of abiotic-stress-inducible genes (Zhou et al., 2012; Liu et al., 2013; Pang et al., 2013, 2015; Wu et al., 2014; Gao et al., 2015; Yang et al., 2022b). However, the exact stress-tolerance mechanisms of this unique desert flora remain unclear.

Legumes (family Fabaceae) form the third largest angiosperm family in the plant kingdom, with 751 genera and approximately 19 500 species (Bruneau et al., 2013). With recent advances in genome sequencing technologies, genomes of more than 20 legume species have been sequenced (Wang et al., 2017; Bertioli et al., 2019; Xie et al., 2019; Zhuang et al., 2019; Xu et al., 2020; Yang et al., 2022a; Chang et al., 2022). These sequenced genomes are invaluable resources for functional studies of individual plant species and comparative and evolutionary studies of eudicots. However, the sequenced genomes represent only a small portion of the large legume family. An 823-Mb draft genome of A. nanus was recently assembled using only Pacific Biosciences (PacBio) sequencing data (Gao et al., 2018); it provides basic information on the genome but is insufficient for a detailed understanding of its genomic features, downstream functional studies, and comparative genomic analyses.

Here, we assembled a high-quality reference-grade genome sequence for A. mongolicus by combining Illumina shotgun sequencing, PacBio long-read sequencing, Bionano optical mapping, and high-throughput chromosome conformation capture (Hi-C) sequencing. The final assembly is 843.07 Mb in length, with ∼98.7% anchored to nine chromosomes. Transcriptome sequencing and gene expression analyses revealed that ethylene signaling and cuticular wax biosynthesis play important roles in the drought tolerance of this desert plant. Our results provide new insights into the genomic characteristics, evolutionary history, and drought-tolerance mechanisms of A. mongolicus.

Results

Genome sequencing and assembly

Previous karyotyping studies have revealed that the genus Ammopiptanthus is diploid (2n = 2X = 18) (Feng and Song, 1988), and this was confirmed by our fluorescence in situ hybridization (FISH) analysis, which clearly showed nine pairs of chromosomes in A. mongolicus (Figure 1B). Two approaches were used to predict the genome size of A. mongolicus. The size predicted by flow cytometry was ∼979 Mb (Supplemental Figure 1), and that estimated from 60 Gb of next-generation sequencing (NGS) data (Illumina HiSeq 2000) from a 180-bp-insert-size DNA library was ∼926 Mb, with a GC content of 37% and a repetitive sequence content of 66% (Supplemental Figure 2). Deep sequencing of the A. mongolicus genome was performed using both Illumina NGS technology and single-molecule real-time (SMRT) sequencing on the PacBio Sequel II platform. Illumina sequencing based on multiple paired-end DNA libraries generated ∼395 Gb of data, representing a sequencing depth of >400× (Supplemental Table 1). PacBio sequencing generated 214 Gb of long reads (average length = 16.9 kb) (Supplemental Table 2), representing a sequencing coverage of >100×. All PacBio long reads were self-corrected and used for assembly with Canu v2.1 (Koren et al., 2017), and 114 Gb of shotgun reads from NGS libraries (insert sizes: 200, 500, and 800 bp) were used for error correction with Pilon v1.22 (Walker et al., 2014). To further increase genome quality, we performed Bionano optical mapping sequencing to enable better scaffolding. In total, 130.35 Gb of data were generated by the Bionano Saphyr system (N50 = 328.5 kb for reads >150 kb, and N50 = 214.5 kb for reads >20 kb; Supplemental Tables 3 and 4). After scaffolding, the contig N50 length was increased to 3.13 Mb. Super-scaffolding (or chromosome-level scaffolding) was achieved using Hi-C sequencing, which generated 316.42 million paired-end reads (94.93 Gb of clean data; Supplemental Table 5). Approximately half (49.5%) of the Hi-C reads (156.70 million paired-end reads) were uniquely mapped to the genome, and 30.7% (97.18 million paired-end reads) turned out to be valid interaction pairs (Supplemental Table 6). The 3D-DNA Hi-C assembly pipeline was then used to cluster scaffolds with the agglomerative hierarchical clustering method (Dudchenko et al., 2017). After clustering and orientation, the scaffolds were clustered into nine chromosomes or superscaffolds (Figure 1C). The final assembly was 843.07 Mb in length, including 498 scaffolds and 1374 contigs. The N50 length of the scaffolds was 92.57 Mb, and that of the contigs was 2.90 Mb. The lengths of the nine chromosomes ranged from 70.9 to 108.8 Mb (Supplemental Table 7), with a total length of 831.99 Mb, accounting for ∼98.7% of the overall assembly (Table 1). Notably, all telomere sequences were found at both ends of the nine chromosomes, and the identified telomere repeat motif in A. mongolicus (TTTAGGG)n is also conserved among most terrestrial plants, including Arabidopsis thaliana. The chromosome sizes of the final assembly (Supplemental Table 7) were consistent with the karyotyping results obtained using FISH (Figure 1B).

Table 1.

Global statistics of the A. mongolicus genome and gene predictions.

Number or ratio Size
Assembly feature

Estimated genome size 926 Mbp
Total chromosomes 9 831.99 Mbp
Chromosome N50 92.57 Mbp
Longest chromosome 108.85 Mbp
Total scaffolds (>1000 bp) 498 843.07 Mbp
Scaffold N50 92.57 Mbp
Longest scaffold 108.85 Mbp
Total contigs 1374 842.10 Mbp
Contig N50 2.90 Mbp
Longest contig 14.38 Mbp
GC content 36.77%

Genome annotation

Total repetitive sequences 70.71% 596.16 Mbp
Transposable elements 69.55% 586.35 Mbp
Gene models 47 611 222.31 Mbp
Non-coding RNAs 5701 1.17 Mbp

The quality of the assembled genome was assessed using Benchmarking Universal Single-Copy Orthologs (BUSCOs). The BUSCO database provides 255 single-copy orthologous eukaryotic genes for evaluation of genome quality (https://busco.ezlab.org, eukaryota_odb10); 250 of these BUSCOs (98.1%) were completely mapped to the A. mongolicus genome, and two additional BUSCOs (0.8%) were partially mapped. BUSCO also provided 5366 single-copy orthologous genes from legumes (fabales_odb10); 4937 of these single-copy genes (92.0%) were completely represented in the A. mongolicus genome, and 92 more BUSCOs (1.7%) were fragmented in the genome (Supplemental Table 8).

Genome annotation

Repetitive sequences were annotated using homology-based and ab initio methods. The genome of A. mongolicus is highly repetitive, with repetitive sequences accounting for 70.71% of its total genome sequence (Supplemental Table 9). Approximately half of the repeats (35.39% of the genome) matched well with known transposable elements (TEs). As reported for many land-plant genomes, long-terminal-repeat retrotransposons (LTR-RTs) are the dominant TE type in A. mongolicus (∼84.64% of total TEs). Most LTR-RTs were from the Ty1/Copia and Ty3/Gypsy superfamilies, which accounted for 9.35% and 20.14% of the genome, respectively. Non-TE repeats (e.g., simple short repeats, transfer RNA [tRNAs], small nuclear RNAs [snRNAs], and rRNAs) constituted 1.16% of the genome (Supplemental Table 9).

Protein-coding genes were predicted using a combination of homology-based, transcriptome-based, and ab initio gene prediction strategies, which yielded 47 611 protein-coding genes (Supplemental Table 10) and 3542 non-coding RNAs (Supplemental Table 11). The average coding sequence (CDS) length of protein-coding genes was 1488 bp, with an average of 5.5 exons per gene. The total length of the genic regions was 222.31 Mb (26.37% of the genome) (Supplemental Table 12). Functional annotation assigned putative functions to 91.44% of the protein-coding genes (43 536) on the basis of sequence similarity searches against public databases (Supplemental Table 13). The 3542 non-coding RNAs included 1212 tRNAs, 380 rRNAs, 1779 snRNAs, and 171 microRNAs (Supplemental Table 11). The GC content and repeat ratio were higher in centromeric and pericentromeric regions than in the chromosome arms, whereas protein-coding genes were distributed mainly in the chromosome arms (Figure 1D). Among the TEs, simple sequence repeats (SSRs) had distribution patterns similar to those of protein-coding genes, whereas LTR-Gypsy sequences were enriched in centromeric and pericentromeric regions, and LTR-Copia sequences were more dispersed throughout the genome (Figure 1D). These distribution characteristics of the A. mongolicus genome are consistent with those of the archetypal eukaryotic genome, which is segregated into A/B compartments, with A compartments (i.e., active) located in chromosome arms and B compartments (i.e., inactive) in centromeric and pericentromeric regions (Wang et al., 2021).

Comparative genomic analyses revealed ancient polyploidy of the common legume ancestor but no species-specific genome duplications in Ammopiptanthus

Legumes evolved from a common tetraploid ancestor approximately 58 mya (Young et al., 2011). The ancestor underwent two polyploidization events at 130 and 58 mya, followed by more recent events in individual legume lineages, such as the soybean-specific tetraploidy event at ∼13 mya (Schmutz et al., 2010; Wang et al., 2017). As a result, numerous syntenic blocks were detected between the A. mongolicus genome and those of other closely related legumes such as soybean (Glycine max) and barrel medic (Medicago truncatula) (Figure 2A). For each syntenic block associated with barrel medic, there is usually one corresponding block in A. mongolicus and two in soybean (Figure 2A). The larger number of intragenomic syntenic blocks in soybean is indicative of its more recent specific polyploidization event at 13 mya. In addition, 325 collinear blocks were identified when the A. mongolicus genome was compared with itself (Supplemental Table 13), reflecting the ancient polyploidizations of its legume ancestors. On average, each collinear block contained 13.81 paralogous gene pairs; the most dramatic collinear block was found in Chr01, with 236 paralogous gene pairs (Supplemental Table 14; Figure 1D). The divergence times between homologous gene pairs were predicted from the four-fold synonymous third-codon transversion (4DTv) rates. The 4DTv distribution of paralogous gene pairs from intragenomic collinear blocks in the soybean genome showed peaks at 0.4, 0.18, and 0.04, corresponding to three whole-genome duplication (WGD) events at 130, 58, and 13 mya, respectively (Figure 2B). However, paralogous gene pairs in intragenomic collinear blocks of the A. mongolicus genome showed only two significant peaks around 0.4 and 0.18, indicating that no WGD events had occurred since 58 mya, and similar results were obtained for A. nanus (Figure 2B). The 4DTv distributions of paralogous genes in A. mongolicus and A. nanus almost completely overlapped (Figure 2B). When considering orthologous gene pairs from intergenomic collinear blocks between A. mongolicus and other legumes, the 4DTv distribution of orthologous gene pairs between A. mongolicus and A. nanus showed a peak at 0.01, suggesting a divergence time for these two species of ∼3.25 mya (Figure 2C). The ratio of the WGD time/4DTv peak values of paralogous gene pairs in G. max was used to calibrate the divergence time. Synonymous substitution rates (Ks) were also calculated for syntenic gene pairs in both intragenomic (Supplemental Figure 3A) and intergenomic regions (Supplemental Figure 3B), revealing patterns consistent with those of the 4DTv distributions (Figure 2B and 2C). The Ks analysis thus supports the conclusion that no species-specific genome duplications have occurred in the genus Ammopiptanthus since 58 mya.

Figure 2.

Figure 2

Synteny, orthology, and polyploidization analyses of A. mongolicus and closely related legume species.

(A) Intergenomic syntenic blocks identified among A. mongolicus, M. truncatula, and G. max. The connections that are specific to chromosome 1 of M. truncatula are highlighted. The purple ribbons indicate syntenic blocks shared by M. truncatula and A. mongolicus, and the red and green ribbons denote those shared by M. truncatula (barrel medic) and G. max (soybean); each syntenic block in M. truncatula usually corresponds to two blocks in G. max.

(B) Four-fold synonymous third-codon transversion (4DTv) rates of paralogous gene pairs in intragenomic collinear regions of selected plant genomes. The numbers I–III indicate WGD or triplication events during evolution. I, γ event (whole-genome triplication) shared by eudicots ∼130 million years ago (mya); II, WGD event shared by legumes 58 mya; III, WGD event specific to the genus Glycine 13 mya. Amo, A. mongolicus; Ana, A. nanus; Cca, C. cajan; Gma, G. max; Mtr, M. truncatula.

(C) 4DTv distributions calculated from orthologous gene pairs in intergenomic collinear regions between different species.

(D) Numbers of orthologous gene groups among selected plants.

The phylogenetic relationships among A. mongolicus and other legumes

To better understand the evolutionary history of A. mongolicus, we performed phylogenetic analyses of 15 closely related legumes along with A. mongolicus (Supplemental Table 15). Orthologous gene groups were identified on the basis of sequence similarity using the OrthoFinder pipeline and were clustered using the Markov Cluster Algorithm (Emms and Kelly, 2018). A Venn diagram highlighted the overlaps of orthologous groups among A. mongolicus, three other legumes, and A. thaliana (Figure 2D). The enriched biological process Gene Ontology (GO) terms in the A. mongolicus-specific gene groups included DNA/RNA-mediated transposition, regulation of root development, meristem growth, and aldonic acid metabolic process (Supplemental Table 16), which may enhance survival of the species or adaptation to severe abiotic stress.

A phylogenetic analysis based on 322 single-copy orthologous genes from these 15 legume species confirmed their phylogenetic relationships (Figure 3A). The topology of the phylogenetic tree aligns with previous analyses of legume species (Wang et al., 2006, 2018). The estimated divergence time between A. mongolicus and A. nanus was 2.4 mya (Figure 3A), which was slightly more recent but similar to the estimate from 4DTv analysis (3.25 mya) (Figure 2C). The divergence time between A. mongolicus and Lupinus angustifolius was calculated to be 27.79 mya, predating the whole-genome triplication event in the genus Lupinus at around 25 mya (Hane et al., 2017).

Figure 3.

Figure 3

Inference of divergence times and MRCAs of A. mongolicus.

(A) A phylogenetic tree inferred from single-copy orthologous genes of legume species. Numbers on the nodes represent the divergence times from the present (mya). The divergence time between G. max and P. vulgaris (20.1 [19.0–21.0] mya) was used as the standard for calibration (Zheng et al., 2016). Polyploidization events are marked by red stars on the tree. The background colors of different lineages represent different legume clades. A. thaliana was used as the outgroup taxon.

(B) Reconstructed ancestral genomes of six legume species. The phylogenetic tree and timescale were redrawn from (A). WGD, whole-genome duplication (2×, tetraploidy; 3×, hexaploidy); CRE ratio, completely rearranged endpoint; EA, estimated accuracy; AEA, accumulated estimated accuracy. The numbers of shuffling events (fissions and fusions) are labeled beside each branch. The shuffling events marked by an asterisk (∗) in peanut were calculated against ancestor 1.

We also examined the expansion and contraction of gene families. The orthologous genes were classified into two groups: group I, which included genes that expanded/contracted in both A. mongolicus and A. nanus, and group II, which included genes that expanded/contracted only in A. mongolicus. Group I contained 155 expanded gene families and 31 contracted families, representing 3749 expanded genes in A. mongolicus and 1311 expanded genes in A. nanus. Group II comprised 219 expanded orthologous groups and 128 contracted groups (Supplemental Figure 4). Functional annotation of the expanded genes indicated that most were TE related, such as reverse transcriptases, the DNA/RNA polymerase superfamily, and helicases. Other gene groups included senescence-associated proteins, UDP-glycosyltransferases, the NAD(P)-binding Rossmann-fold superfamily, the cytochrome P450 2C family, the pectin lyase-like superfamily, disease-resistance proteins, and osmotins (Supplemental Tables 17 and 18). Members of the cytochrome P450 family may be involved in cuticular wax biosynthesis, which helps to limit transpiration and reduce water loss (Pandian et al., 2020; Zhang et al., 2020). Osmotins, members of the pathogenesis-related protein family, may be critical for adaptation to osmotic stress (Bashir et al., 2020). The expanded gene groups were enriched in GO terms associated with transpositional recombination, anatomical structure homeostasis, ATP biosynthetic pathways, proton transmembrane transport, and abiotic stimuli (Supplemental Figure 5; Supplemental Tables 19 and 20).

Further analyses of single-copy orthologous genes revealed that 719 genes were under positive selection in A. mongolicus. Functional annotations indicated that some were related to stress responses, such as osmotin-like proteins, the ATP-binding cassette transporter family, ATPases, membrane transporter proteins, cadmium-transporting ATPases, sugar transporters, peroxidases, and genes related to long-chain fatty acid biosynthesis (Supplemental Table 21), and they were enriched in GO terms such as ion transmembrane transport, peroxisomal transport, hormone biosynthesis, and membrane lipid biosynthetic processes (Supplemental Table 22; Supplemental Figure 6). These positively selected genes may have played important roles in the evolution of abiotic stress resistance and adaptation in A. mongolicus.

Reconstruction of ancestral legume genomes

The chromosome-level genome assembly enabled us to characterize the evolution of the A. mongolicus genome from those of its legume ancestor(s). To this end, we selected A. mongolicus and five other legumes—soybean (G. max), common bean (Phaseolus vulgaris), barrel medic (M. truncatula), white lupin (Lupinus albus), and a wild diploid peanut species (Arachis ipaensis)—and reconstructed the most recent common ancestors (MRCAs) of these legume species. Protein-coding genes that could be anchored to the chromosomes of these six species were aligned and clustered using OrthoFinder. A total of 26 038 ortho-groups were identified, each containing an average of 8.6 orthologous genes (Supplemental Table 23), and DRIMM-synteny analyses revealed 220 syntenic blocks shared by these plants (Supplemental Tables 24 and 25). Six intermediate ancestral genomes (Figure 3B) were computed using the Inferring Ancestor Genome Structure (IAGS) package (Gao et al., 2022). Overall, a large number of rearrangements (including both fusions and fissions) between a species and its ancestor, as in the cases of white lupin and soybean, indicates the occurrence of paleopolyploidy and post-polyploid diploidization. Compared with the other legumes in the analysis, A. mongolicus had the fewest rearrangements, with only 22 fissions and 22 fusions between it and ancestor 1. This finding provides strong molecular support for the claim that A. mongolicus is a “relict plant” or “living fossil” (Xu et al., 2002).

LTR-RT bursts caused the expansion of the A. mongolicus genome

TEs are important driving forces in genome evolution, as they can induce genomic rearrangements such as translocations, inversions, and duplications through recombination (Pereira and Ryan, 2019). We therefore performed a comprehensive analysis of repetitive sequences in 17 legume species together with A. thaliana (Supplemental Table 15). In general, legume genomes had a much higher proportion of repetitive sequences than A. thaliana (Figure 4A), and the A. mongolicus genome had the second highest proportion (70.71%), only slightly lower than that of A. ipaensis (71.65%). Most repetitive sequences in the legume genomes were TEs. In A. mongolicus, 35.39% of the genome consisted of intact TEs, with the dominant type being LTR-RTs (∼84.64% of total TEs); 9.35% of the genome was occupied by Ty1/Copia LTR-RTs and 20.14% by Ty3/Gypsy LTR-RTs (Figure 4A). The actual proportions of Ty1/Copia and Ty3/Gypsy LTR-RTs may be even higher, because many older LTR-RTs have experienced severe deletions or fragmentation during genome evolution through processes such as unequal homologous recombination and illegitimate recombination (Du et al., 2010). Intact LTR-RTs were identified on the basis of their structural features using LTR_retriever; they included 3054 Ty1/Copia, 3019 Ty3/Gypsy, and 1076 non-autonomous LTR-RTs, with an average length of 7.65 kb (Supplemental Figure 7). Insertion time analysis indicated that LTR-RTs have greatly expanded in A. mongolicus during the last 2.5 million years (Figure 4B). A recent systematic survey of LTR-RTs classified Ty1/Copia and Ty3/Gypsy into 16 and 14 lineages, respectively. The 16 Ty1/Copia lineages include Ale, Alesia, Angela, Bianca, Bryco, Lyco, Gymco-I–IV, Ikeros, Ivana, Osser, SIRE, TAR, and Tork, and the 14 Ty3/Gypsy lineages include chromoviruses (CRM, Chlamyvir, Galadriel, Tcn1, Reina, and Tekay) and non-chromovirus elements (Athila, Tat-I,II,III, Ogre, Retand, Phygy, and Selgy) (Neumann et al., 2019). In A. mongolicus, the Ty1/Copia type included eight lineages (Ale, Angela, Bianca, Ikeros, Ivana, SIRE, TAR, and Tork) (Figure 4C), although the Bianca lineage was represented by only three LTR-RTs, echoing the case of soybean, in which the entire Bianca lineage has been lost (Du et al., 2010). The Ty3/Gypsy LTR-RTs of A. mongolicus comprised six lineages: Athila, CRM, Ogre, Reina, Retand, and Tekay (Figure 4D). The Ty1/Copia LTR-RTs were further sorted into 458 clusters based on their reverse transcriptase (RT) domain sequences using the cd-hit package, and the Ty3/Gypsy LTR-RTs were sorted into 336 clusters (Figure 4E and 4F). The results of phylogenetic analyses of these two LTR-RT groups were consistent with the classification using TEsorter and provide detailed features of TE expansion during the evolution of the A. mongolicus genome.

Figure 4.

Figure 4

Classification of repetitive sequences in legumes and evolution of TEs in A. mongolicus.

(A) Percentages of transposable elements (TEs): Ty1/Copia, Ty3/Gypsy, other TEs (unknown types), non-TE repeats (microsatellites, low-complexity sequences, rRNAs, snRNAs, etc.), and non-repeat sequences in the genomes of the respective species.

(B) Insertion times of long-terminal-repeat retrotransposons (LTR-RTs), calculated as T = K/(2r), where T is insertion time, K is the number of base substitutions per site between the two LTRs of an LTR-RT, and r is the synonymous nucleotide substitution rate, expressed as the number of synonymous mutations per site per year. For legumes, r is 7e−9 (Jing et al., 2005).

(C and D) Clade-level classification of LTR-RTs using TEsorter (https://github.com/zhangrengang/TEsorter). (C) Ty1/Copia. (D) Ty1/Gypsy.

(E) Classification of Ty1/Copia into sub-lineages.

(F) Classification of Ty3/Gypsy into sub-lineages. The phylogenetic trees in (E) and (F) were constructed from the RT domains of the LTR-RTs.

To investigate the potential contribution of the LTR-RT expansion to the drought tolerance of A. mongolicus, we performed a comprehensive GO analysis. Genes that overlapped with or were in close proximity to LTR-RTs were identified, resulting in the discovery of 7125 LTR-RT-related genes (Supplemental Table 26). GO enrichment analysis revealed that these genes were significantly enriched in various biological processes, including retrotransposition, DNA transposition, and metabolic processes related to D-gluconate, sphingolipids, and aldonic acid (Supplemental Figure 8A). In terms of gene expression, 75% of the LTR-RT-related genes showed no transcriptional activity, whereas 50% of non-LTR-RT-related genes were unexpressed (Supplemental Figure 8B). However, there were no distinct patterns between LTR-RT-related and non-LTR-RT-related genes in most leaf samples (Supplemental Figure 8C). Further studies are needed to fully understand the involvement of LTR-RTs and their associated genes in the drought tolerance of A. mongolicus.

The involvement of cuticular wax in the drought-stress tolerance of A. mongolicus

Leaves of A. mongolicus are rhombic-elliptic or broad-lanceolate in shape, measuring 2.0–3.8 cm in length and 0.6–2.7 cm in width (Han and Li, 1992). Both surfaces of the leaves are coated with a waxy cuticle that can reach a thickness of 18 μm (Han and Li, 1992). This cuticle thickness exceeds that observed in other desert plants and may therefore provide significant protection from excessive transpiration. Cuticular waxes are mixtures of very-long-chain aliphatic compounds such as fatty acids, primary alcohols, acyl esters, alkanes, aldehydes, secondary alcohols, and ketones (Dimopoulos et al., 2020). Very-long-chain-fatty-acids (VLCFAs) are produced from fatty acyl-coenzyme A (CoA)-thioesters through a process catalyzed by the multienzyme fatty acid elongase (FAE) complex. To produce aliphatic compounds, VLCFA-CoAs are modified by enzymes in the endoplasmic reticulum (Figure 5). By combining the genome sequence and transcriptomic data, we identified genes related to cuticular wax biosynthesis and transport in A. mongolicus and analyzed changes in their expression during polyethylene glycol (PEG)-induced dehydration stress. Overall, these genes had higher expression levels in leaves than in roots, and most were induced by PEG treatment, including KCR (encoding beta-ketoacyl reductase, a member of the FAE family), WSD1 (encoding wax ester synthase and diacylglycerol acyltransferase), CER3 (encoding ECERIFERUM 3, a transmembrane protein with similarity to the sterol desaturase family), and LTPs (encoding lipid transfer proteins). LTPs, together with ATP-binding cassette transporters, transport cuticular wax mixtures (Figure 5; Supplemental Table 27). Interestingly, many of the cuticular wax biosynthetic genes had high copy numbers and were tandemly duplicated in the A. mongolicus genome, including the KCR genes (Chr03.g17075, Chr03.g17076, Chr04.g22540, Chr04.g22542) and CER4 genes (Chr04.g18365, Chr04.g18366, Chr07.g34606, Chr07.g34607, and Chr07.g34608). Most notably, the MAH1 genes (encoding midchain alkane hydroxylases) had more than 100 copies in the A. mongolicus genome, and some were also tandemly duplicated (Figure 5; Supplemental Tables 27 and 28).

Figure 5.

Figure 5

Genes involved in cuticular wax biosynthesis and transport in A. mongolicus and their responses to simulated drought stress.

Heatmaps show the expression levels of genes involved in the cuticular wax biosynthesis and transport pathways upon drought treatment simulated by polyethylene glycol (PEG) at 0, 2, 24, and 48 h in leaves and roots of 2-month-old A. mongolicus plants. Gene expression values were normalized as log10(FPKM + 1). LACS, long-chain fatty acid acyl-CoA synthetase; FAE, fatty acid elongase; VLFCA, very-long-chain fatty acids; FAR, fatty acyl-CoA reductase; CER, eceriferum; WSD, wax ester synthase and diacylglycerol acyltransferase; MAH, midchain alkane hydroxylase.

Ethylene and other stress phytohormones play important roles in the stress tolerance of A. mongolicus

In the drought-responsive signaling pathway, messenger molecules such as stress hormones and/or second messengers (e.g., inositol phosphates and reactive oxygen species) that regulate intracellular Ca2+ levels are released upon stress signal perception. Changes in Ca2+ levels then initiate phosphorylation cascades, activating diverse transcription factors that act upon downstream stress-tolerance effector genes (Xiong et al., 2002). In our transcriptomic study, we observed significant induction of genes related to biosynthesis or signaling of several stress hormones, including ethylene, abscisic acid (ABA), and jasmonic acid (JA) (Supplemental Table 29). Among these hormone-related genes, two ethylene biosynthetic genes, Chr06.g29235 and Chr06.g29237, which are homologous to Arabidopsis 1-Aminocyclopropane-1-Carboxylic Acid Oxidase (ACO), caught our attention because their expression was induced by more than 39.8-fold and 4-fold, respectively, after 24 h of PEG treatment (Figure 6A). In addition to the two ACO genes, several other ethylene biosynthesis and signaling genes were also significantly upregulated by PEG-induced dehydration stress, particularly at 24 h (Figure 6B). For example, Chr01.g00583, Chr06.g30417, and Chr01.g01008, three genes homologous to Arabidopsis 1-Aminocyclopropane-1-Carboxylic Acid Synthase (ACS), and genes encoding ethylene receptors (ETR1 and ETR2), key mediator Ethylene Insensitive 2 (EIN2), key transcription factors (EIN3, EIN3-like 3 [EIL3], and EIN4), and downstream ethylene response factors (ERFs) were all activated (Figure 6B).

Figure 6.

Figure 6

Ethylene-related genes play important roles in stress resistance of A. mongolicus.

(A) Both transcriptomic and qRT–PCR analyses indicated marked induction of the ethylene biosynthetic genes EFE1/ACO and EFE2/ACO. Data are from 2-month-old A. mongolicus plants treated with 15% PEG for 24 h. The housekeeping gene AmEIF was used as an internal control in qRT–PCR analysis, and the significance of changes was evaluated by Student’s t-test (∗∗p < 0.01).

(B) Expression profiles of genes related to ethylene biosynthesis and signaling in leaves of 2-month-old A. mongolicus plants during PEG treatment. The expression levels of each gene at different treatment time points are displayed in the heatmaps by converting FPKM values into Z scores. ACS, ACC synthase; ACO, ACC oxidase; ETR, ethylene receptor; EIN, ethylene insensitive; EIL, EIN3-like; ERF, ethylene response factors; ERT, ethylene-responsive transcriptional coactivator-like protein; EGY, ethylene-dependent gravitropism-deficient and yellow-green-like; AP2, ethylene-responsive element-binding protein 2; ERS, ethylene response sensor 1.

(C) Microsynteny analysis of collinear regions of the ERF2 gene and its flanking regions among A. mongolicus, A. nanus, M. truncatula, C. cajan, G. max, and V. vinifera. Blue and green boxes represent gene loci in the chromosome regions. Gray ribbons represent orthologous gene relationships among the six species. A red ribbon highlights the ERF2 locus in A. mongolicus and A. nanus. ALA2, Aminophospholipid ATPase 2 (a homolog of Arabidopsis AT5G44240); TAT2, Tyrosine aminotransferase 2 (a homolog of AT5G36160).

(D) Western blot validation of 35S::AmERF2-6Myc transgenic Arabidopsis plants using an anti-Myc antibody. Ponceau S staining of the Rubisco large subunit band was used as a loading control. WT, wild-type (Col-0); OE1, OE2, and OE3, three independent AmERF2-overexpressing lines.

(E) Drought stress assay of 35S::AmERF2-6Myc transgenic A. thaliana plants. Ten-day-old seedlings were transplanted to soil and subjected to drought treatment for 16 days without watering in a 22°C growth chamber with a 16-h light/8-h dark photoperiod. After the drought treatment, the plants were rewatered for 2 days to allow for recovery.

Among the PEG-induced ERF genes, ERF2 was found to be unique to the genus Ammopiptanthus. In a phylogenetic tree of ERF2 and its homologs in legume species, as well as the outgroup taxa grape and Arabidopsis, AmERF2 (Chr07.g34904) of A. mongolicus and AnERF2 (EVM0018613) of A. nanus formed a distinct clade, separate from homologous genes in other legume species (Supplemental Figure 9). Despite conservation of the AP2 domain among these plants, ERF2 displayed unique sequence patterns in its C terminus (Supplemental Figure 10). Microsynteny analysis revealed that ERF2 was positioned between the Aminophospholipid ATPase 2 (ALA2) and Tyrosine aminotransferase 2 (TAT2) genes in grape (Vitis vinifera), A. mongolicus, and A. nanus. By contrast, no genes were found between ALA2 and TAT2 in syntenic regions of typical legume plants such as G. max, M. truncatula, and Cajanus cajan (Figure 6B). Grape, which has not undergone WGD since the formation of eudicots, is often considered to have retained ancestral traits and genetic organization (Jaillon et al., 2007). By contrast, legumes have experienced at least one WGD since 58 mya, resulting in the presence of two copies of ancestral genes. Phylogenetic and synteny analyses suggested that ERF2 is most likely an ancestral gene that has lost one copy during the evolution of most legume species, whereas both copies (Chr07.g34904 and Chr04.g18700) were retained during the evolution of the genus Ammopiptanthus. We therefore heterologously expressed a Myc-tagged AmERF2 (Chr07.g34904) gene in Arabidopsis (Col-0) under the control of the CaMV 35S promoter and evaluated the drought-tolerance of the resulting transgenic plants. Three successful homozygous transgenic lines (OE1, OE2, and OE3) were confirmed by western blot analysis of transgene expression (Figure 6D). After 10-day-old soil-grown seedlings of the wild type (WT, Col-0) and the AmERF2-overexpressing lines were exposed to drought for 16 days and rewatered for 2 days for recovery, the AmERF2-6Myc transgenic plants all survived, whereas the WT and empty-vector-transformed control plants (35S::6Myc) all died (Figure 6E). To determine whether the high drought tolerance of the AmERF2-overexpressing lines was due to high levels of ethylene, we determined ethylene content by measuring the ethylene precursor 1-aminocyclopropane-1-carboxylic acid (ACC) by gas chromatography–tandem mass spectrometry under normal and drought-stress conditions. The results showed that ACC levels were much higher in the AmERF2-overexpressing lines than in the WT under normal conditions (Supplemental Figure 11). Under drought conditions, ACC content increased further in the WT plants, but no further increase was observed in the AmERF2-overexpressing lines (Supplemental Figure 11). These results suggested that overexpression of AmERF2 resulted in a high level of ethylene, which contributed to the better drought tolerance of AmERF2-overexpressing lines.

ABA is known to be involved in the regulation of plant water status. Indeed, PEG treatment of A. mongolicus resulted in significant upregulation of two ABA biosynthetic genes, NCED3 (Chr09.g44316) and AAO3 (Chr07.g34300), and an ABA-inducible gene, HAI3 (Chr06.g29412), as well as downregulation of the ABA catabolic gene CYP707A2 (Chr06.g31923) (Supplemental Table 29). Similarly, expression of the JA biosynthesis gene AOS (Chr06.g31606) was also significantly upregulated during osmotic stress. By contrast, expression of genes related to the biosynthesis of growth-promoting hormones such as brassinosteroid (BR) and auxin was suppressed by osmotic-stress treatment. These included two BR biosynthetic genes, DWF4 (Chr01.g05194) and CPD (Chr02.g10433), and two auxin biosynthetic genes, TAA1 (Chr08.g41747) and YUC (Chr07.g38214) (Supplemental Table 29). To confirm the changes in expression of these hormone-related genes during drought treatment, 15 genes involved in phytohormone biosynthesis, catabolism, and signaling were selected for validation by real-time qRT–PCR (Supplemental Tables 29 and 30). The expression patterns of these genes were consistent in the qRT–PCR and transcriptomic analyses (Supplemental Figure 12), indicating the reliability of our transcriptomic data (Supplemental Table 29).

Discussion

By combining multiple sequencing strategies, we successfully assembled a chromosome-level genome of A. mongolicus. Initial Illumina shotgun reads did not yield satisfactory assembly results, with a contig N50 of only 138.6 kb (Supplemental Table 31), and PacBio long-read sequencing data were therefore used to improve the contig N50 to 2.9 Mb (Table 1). Hi-C sequencing was performed to generate chromosome-scale scaffolds, and 98.63% of the assembly was anchored onto the nine chromosomes of A. mongolicus (Supplemental Table 6). This approach effectively captured chromosomal interactions and led to a high rate of valid interaction pairs. The resulting high-level chromosomal assembly highlights the importance of capturing interacting chromatin in Hi-C-based genome assemblies.

Repetitive sequences dominate the A. mongolicus genome, and retrotransposons are the dominant type of repetitive element, among which Ty3/Gypsy elements are twice as abundant as Ty1/Copia elements (Figure 4A). By contrast, a higher Ty1/Copia-to-Ty3/Gypsy ratio was observed in the L. angustifolius genome (Hane et al., 2017). Hence, the burst of LTR-RTs in A. mongolicus must have occurred after its divergence from L. angustifolius at ∼28 mya. Indeed, insertion time analyses of LTR-RTs indicated that they have expanded mainly within the last 2.5 million years. We also observed that A. mongolicus has a larger average chromosome size than other legumes such as soybean and barrel medic (Figure 2A). The larger genome size of A. mongolicus may be due to the expansion of retrotransposons, similar to the LTR-RT expansion in Pisum sativum (Kreplak et al., 2019). However, the two species have different LTR-RT compositions. The major LTR-RT lineage in P. sativum is Ty3/Gypsy Ogre, whereas the dominant LTR-RT lineages in A. mongolicus are Ty3/Gypsy Tekay and Reina. Genome sequencing of Ginkgo biloba, another representative relict plant, revealed that over 80% of its genome consists of repetitive sequences (Liu et al., 2021). By contrast, a recent study on the relict plant Cyclocarya paliurus revealed only 48.1% repetitive sequences in its genome (Qu et al., 2023). These findings indicate that a high percentage of repetitive sequences is not a universal characteristic of relict plants. Functional enrichment and expression analyses were performed for LTR-RT-related genes to explore their potential relevance to stress tolerance in A. mongolicus. However, no clear patterns were observed between LTR-RT-related genes and non-LTR-RT-related genes in the majority of leaf samples (Supplemental Figure 8C). Further studies are necessary to gain a comprehensive understanding of the role of LTR-RTs and their associated genes in enhancing the drought tolerance of A. mongolicus.

Several transcriptomic studies have been performed on A. mongolicus under different abiotic stresses (Zhou et al., 2012; Liu et al., 2013; Pang et al., 2013, 2015; Wu et al., 2014; Gao et al., 2015, 2016). Significant changes in gene expression have been observed under single or combined stress conditions, and some biological pathways have been identified. Nonetheless, RNA-sequencing-based transcript assemblies have inevitably led to incorrect gene-model predictions owing to the abundance of paralogous genes in plant genomes, resulting in large discrepancies in reported gene numbers. For example, 11 357 unigenes have been reported for A. mongolicus (Pang et al., 2015), which is far fewer than the average gene number (40 200) of legume species (Supplemental Table 15). The reference-grade genome for A. mongolicus thus enables more accurate gene structure annotations and forms a reliable basis for the study of epigenetic regulation, which is another important mechanism of plant adaptation to harsh environments.

As evergreen xerophytes, Ammopiptanthus species have unique morphological features compared with other legumes. A thick waxy cuticle and sparse stomata are excellent transpiration barriers, and the succulent leaves formed by multiple mesophyll cell layers increase water storage (Han and Li, 1992). Interestingly, Rhazya stricta, another evergreen shrub in the Dogbane family that is mainly found in arid regions of the Middle East, has a leaf structure similar to that of Ammopiptanthus (Schuster, 2016). This morphological similarity could be the result of convergent evolution and implies that cuticular wax is a key factor in plant adaptation to arid environments. In this study, most genes involved in cuticular wax biosynthesis or transport in A. mongolicus were specifically expressed in leaves and were significantly upregulated by PEG-induced dehydration stress. Some of these genes, such as KCR, CER4, and MAH1, were expanded as tandem gene duplicates (Figure 5). LTP genes were also highly expressed in leaves, even under normal growth conditions, and were further upregulated by PEG treatment. LTPs are required for the assembly of a waterproof lipid barrier, as they can transfer and deposit cutin and cuticular waxes (Edqvist et al., 2018). Cuticular wax consists of complex mixtures of hydrophobic VLCFAs and their derivatives. Triterpenoids are the major components of cuticular wax in plants such as Sorghum bicolor (Busta et al., 2021). Recently, Ma et al. (2024) analyzed genes related to cuticular wax biosynthesis and transport in several xerophytic plants, including Zygophyllum xanthoxylum and A. nanus. They observed a slight expansion of LACS and cytochrome b5 (CYBT5) gene copy number compared with that in the non-xerophyte plants Arabidopsis and grape. The expansion of these wax-related genes may contribute to the water-retention capacity of these xerophytes in desert environments (Ma et al., 2024). Further studies on the metabolome or chemical analyses of A. mongolicus should provide more details about the development of its cuticular wax.

Ethylene is a gaseous phytohormone released in response to various biotic and abiotic stresses/stimuli (Pei et al., 2017). In higher plants, ethylene is synthesized from S-adenosylmethionine (SAM), an activated form of methionine (Met), via ACC. Two key enzymes, ACC synthase (ACS) and ACC oxidase (ACO), are responsible for the synthesis of ACC and its conversion into ethylene, respectively (Yang and Hoffman, 1984). The expression of three ACS genes (Chr01.g00583, Chr01.g01008, and Chr06.g030417) and three ACO genes (Chr01.g00435, Chr06.g29235, and Chr06.g29237) was significantly induced by PEG treatment in A. mongolicus (Figure 6A and 6B), indicating that ethylene biosynthesis was induced by dehydration stress. Drought-induced ethylene accumulation due to upregulation of ACS genes has been reported previously in Arabidopsis (Urano et al., 2017). Ethylene action is triggered by signal perception by membrane-localized receptors (Hall et al., 2012), which initiate a signal transduction pathway involving the key transcription factors Ethylene Insensitive 3 (EIN3) and EIN3-like1 (EIL1), as well as downstream ERFs, resulting in the expression of ethylene-responsive genes (Merchante et al., 2013). Members of the ERF subfamily have been shown to regulate plant resistance to various abiotic stresses, including drought, salinity, and chilling stress (Quan et al., 2010; Rong et al., 2014; Wu et al., 2022). In the present study, we found that the AmERF2 gene was ancestral and unique to A. mongolicus (Figure 6C, Supplemental Figure 9) and that its expression was significantly upregulated under PEG-induced dehydration stress (Figure 6B). We therefore speculate that this unique gene may play an important role in the drought tolerance of A. mongolicus. Indeed, heterologous overexpression of AmERF2 in Arabidopsis significantly enhanced the drought tolerance of the transgenic plants (Figure 6D and 6E). These results demonstrate the important role of ethylene in inducing plant drought resistance.

Significant induction under PEG treatment was also observed for genes related to other stress hormones such as ABA and JA (Supplemental Figure 12). All these results indicate that the drought tolerance of A. mongolicus is controlled by sophisticated mechanisms, likely involving the action of multiple stress hormones. The complete mechanisms of A. mongolicus drought tolerance await more detailed analyses of all the drought-responsive genes and pathways.

Methods

Plant materials and growth conditions

A. mongolicus seeds were collected in Bayanhot, Inner Mongolia, China, from July 2015 to July 2019. They were surface sterilized with 70% ethanol for 10 min, washed three times with distilled water, placed in bleach for 30 min, and washed five times with distilled water. Sterilized seeds were germinated on agar plates supplemented with half-strength Murashige and Skoog (½ MS) medium (20 seeds/Petri dish, 10 cm × 25 mm) in the dark at 28°C for 2 days. Germinated seedlings were transferred into Erlenmeyer flasks with ½ MS agar and grown in the dark for 7 days before being flash-frozen in liquid nitrogen and stored at −80°C.

Illumina shotgun sequencing

Total DNA was extracted from etiolated seedlings using a modified Cetyltrimethylammonium bromide (CTAB) method (Feng et al., 2019). Five grams of tissue powder ground in liquid nitrogen were mixed with 15 mL of extraction buffer (100 mM Tris–HCl [pH 8.0], 150 mM NaCl, 1% [w/v] SDS, 100 mM EDTA, and 4% [w/v] polyvinylpyrrolidone) and incubated in a 65°C water bath for 1 h. An equal volume (15 mL) of phenol:chloroform:isoamyl alcohol (25:24:1) was added to the tissue mixture for DNA isolation. After centrifugation at 12 000 g for 10 min, the supernatant was transferred into a new centrifuge tube and mixed with half the volume of precooled isopropanol. Crude DNA was pelleted by centrifugation at 12 000 g for 10 min and re-dissolved in 1 mL of TE buffer (10 mM Tris, 1 mM EDTA, pH 8.0). RNA contamination was removed by addition of 50 μL RNase A (Omega Bio-Tek, catalog no. AC117, 25 mg/mL) to 1 mL DNA solution. DNA quantity and quality were assessed using a Qubit 3.0 Fluorometer (Thermo Fisher Scientific, Waltham, MA, USA) and agarose gel electrophoresis, respectively. DNA was then purified using the Aurora DNA Clean-up System (Boreal Genomics, Vancouver, BC, Canada) following the manufacturer’s protocol. The purified DNA was dissolved in 100 μL of 0.25× TBE buffer (89 mM Tris, 89 mM boric acid, and 2 mM EDTA) for later use.

Eight laddered DNA libraries were constructed, with insert sizes of 180 bp, 260 bp, 500 bp, 800 bp, 2 kb, 5 kb, 10 kb, and 20 kb. Small-insert libraries (180–800 bp) were constructed and sequenced by Macrogen (South Korea). For these libraries, 10 μg of DNA was sheared into fragments of 180–800 bp using a Bioruptor Pico. After end repair, A-tailing, and adapter ligation, the ligated fragments were size selected at 180, 260, 500, and 800 bp on an agarose gel and amplified by ligation-mediated PCR to produce the corresponding short-insert libraries for sequencing on the HiSeq 2500 system (Illumina, San Diego, CA, USA) in PE150 mode.

Mate-pair libraries were constructed by BGI Shenzhen (Beijing Genomics Institute, Shenzhen, China). In brief, 20–40 μg of genomic DNA was sheared to the desired insert size using nebulization (for 2 kb) or HydroShear (Covaris, Woburn, MA, USA) (for 5, 10, and 20 kb). Next, the DNA fragments were end repaired using biotinylated nucleotide analogs, size selected, and circularized via intramolecular ligation. Circular DNA molecules were then sheared with Adaptive Focused Acoustics (Covaris, Woburn, MA, USA) to an average size of 500 bp. Biotinylated fragments were purified with magnetic beads (Invitrogen, CA, USA), end repaired, A-tailed, ligated to Illumina paired-end adapters, size selected again, and purified by ligation-mediated PCR. Sequencing was performed on the HiSeq 2500 system in PE100 mode by BGI Shenzhen.

PacBio SMRT sequencing

For PacBio long-read sequencing, cell nuclei were first extracted from young leaves (1.5 g) of a single 1-year-old plant as described previously (Weng et al., 2018). Nuclear DNA was isolated using CTAB extraction buffer (100 mM Tris–HCl, 25 mM EDTA, 1.5 M NaCl, and 3% CTAB). DNA libraries for SMRT PacBio genome sequencing were constructed following standard protocols (Pacific Biosciences, Menlo Park, CA, USA). In brief, genomic DNA was sheared to ∼20-kb targeted size, followed by damage repair and end repair, blunt-end adaptor ligation, and size selection. Finally, the libraries were sequenced on the PacBio Sequel II platform (Beijing Berry Genomics, China).

Bionano optical mapping

To identify a suitable nicking enzyme, the draft genome sequence was scanned using LabelDensityCalculator v1.3.0 (https://bionano.com/software-downloads/), and the enzyme Nt.BspQl (New England Biolabs, Ipswich, MA, USA) was chosen for labeling high-molecular-weight DNA. A label density of 9–15 labels per 100 kb of DNA was preferred. After loading onto the IrysChip, nicked and fluorescently labeled long DNA strands were linearized and imaged using the Irys single-molecule genome mapping system (Bionano Genomics, San Diego, CA, USA) to generate single-molecule physical maps. Nb.BssSI (New England Biolabs) was also used for labeling genomic DNA, which was then read using the Bionano Saphyr system at Berry Genomics.

Hi-C sequencing

For Hi-C sequencing, 2 g of young leaves from 2-month-old A. mongolicus was cross-linked with 40 mL of 2% formaldehyde at room temperature for 15 min and then quenched by adding 4.324 mL of 2.5 M glycine. A Hi-C library was constructed according to the method described in Shi et al. (2019). DNA was digested overnight with a 4-bp cutter restriction enzyme (400 units of MboI) at 37°C on a rocking platform. Hi-C libraries were amplified by 10–12 cycles of PCR and sequenced using the Illumina HiSeq instrument in 2 × 150-bp read mode. Sequencing was performed by ANNOROAD Gene Technology Beijing (China).

Genome assembly

All PacBio long-read data were self-corrected and used for assembly using Canu v2.1 (Koren et al., 2017) with the parameters genomeSize = 930m, useGrid = false, errorRate = 0.02, corOutCoverage = 30, corMhapSensitivity = normal, minReadLength = 4000, and corMinEvidenceLength = 4000. Because the PacBio reads had an error rate of ∼15%, 114 Gb of Illumina shotgun reads from paired-end libraries were used to correct assembly errors from the PacBio reads with Pilon v1.22 (Walker et al., 2014). Bionano Solve v3.3 was used for scaffold assembly with the optical mapping data. To anchor scaffolds onto superscaffolds (or chromosomes), the Hi-C reads were mapped to scaffolds using Juicer v1.6 (https://github.com/aidenlab/juicer) with default parameters. The Hi-C assembly pipeline 3D-DNA v180922 (https://github.com/theaidenlab/3d-dna) was applied to cluster scaffolds using the agglomerative hierarchical clustering method with an interaction matrix between sequences (Dudchenko et al., 2017). The Hi-C interaction map was imported into Juicebox (https://github.com/aidenlab/Juicebox/wiki/Download) for manual checking and adjustment as described in https://www.youtube.com/watch?v=Nj7RhQZHM18. The Hi-C data were then re-aligned to the nine pseudo-chromosomes with HiC-Pro v2.11.4 (Servant et al., 2015). Finally, an interaction heatmap was generated using Juicebox_1.11.08 (https://github.com/aidenlab/juicer/wiki/Download).

Identification of repetitive sequences

Repetitive DNA regions include TEs (or interspersed repeats) and tandem repeats. RepeatModeler v2.0.1 (www.repeatmasker.org) was used with default parameters for ab initio prediction of repetitive DNA regions in the genome (Price et al., 2005). These DNA repeats were then mapped to Repbase (Dfam v3.2, https://www.dfam.org/releases/Dfam_3.2/families/Dfam.h5.gz) to determine whether they had homologous sequences in other species using RepeatMasker (Tarailo-Graovac and Chen, 2009).

Gene prediction

Gene prediction was performed on the repeat-masked genome using the EVidenceModeler pipeline (Haas et al., 2008), integrating protein sequences from closely related or remote species and RNA sequencing (RNA-seq) data from A. mongolicus for homology-based gene prediction using GeMoMa-1.6.1 (Keilwagen et al., 2016). Several well-annotated legume genome and protein datasets from six closely related species, G. max (Gmax_508_Wm82.42.v1.protein.fasta, Phytozome v13), Glycine soja (v1, http://www.wildsoydb.org/Gsoja_W05/), M. truncatula (Mtruncatula_285_Mt4.0v1.protein.fasta, Phytozome v13), Lotus japonicus (v3.0, http://www.plantgdb.org/XGDB/phplib/download.php?GDB=Lj), L. angustifolius (LupAngTanjil_v1.0.pep.all.fa.gz, https://plants.ensembl.org), and A. nanus (http://gigadb.org/dataset/100466). The PacBio isoform sequencing data from A. mongolicus were processed using SMRT analysis and clustered to reduce redundancy (Supplemental Table 32). Transcriptomic shotgun reads from roots and leaves were aligned to the genome using HISAT v2.0.4 (Pertea et al., 2016), and transcripts were reconstructed with Cufflinks v2.2.1 (Trapnell et al., 2012). Open reading frames (ORFs) were predicted with PASA v2.0.1 (Haas et al., 2003) using isoform sequences and RNA-seq transcripts as input and with the criteria for each ORF set to contain 100–1000 amino acids and at least two exons. The cDNA sequences of ORFs were used as a training set for downstream gene prediction. Ab initio gene prediction was performed with Augustus v3.0.3, SNAP, GlimmerHMM, and GeneMark-ET v4.57. All homology-based and ab initio-based gene predictions were integrated using EVidenceModeler (Haas et al., 2008).

Prediction of non-coding RNAs

snRNAs and microRNAs in the genome were identified by searching against the Rfam database (v14.1) with INFERNAL v1.1.2 (Nawrocki and Eddy, 2013) using the following parameters: -Z 1686 --cut_ga --rfam --notrunc –nohmmonly. tRNAs were detected with tRNAscan v2.0.9 using the general tRNA model (corresponding parameter: -G) (Chan and Lowe, 2019). Because rRNA genes (5S, 5.8S, 18S, and 28S) are highly conserved among plants, they were predicted in the genome by searching for the rRNA genes of G. max (GenBank: X15199.1, X02623.1, and XR_003264281.1) and A. thaliana (AT2G01020.1) using BLASTn v2.7.1 (Camacho et al., 2009).

Identification of syntenic blocks

Syntenic blocks between genomes were identified using the JCVI pipeline (https://github.com/tanghaibao/jcvi). All-versus-all BLASTP was performed to identify paralogous or orthologous gene pairs with an E-value cutoff of 1e−5. Collinear blocks containing at least five genes were identified using MCScanX with default parameters (Wang et al., 2012).

Identification of orthologous genes

Sixteen plant species were used for comparative analyses, including the legumes A. ipaensis, L. angustifolius, A. mongolicus, A. nanus, M. truncatula, Cicer arietinum, P. sativum, Trifolium pratense, L. japonicus, Vigna angularis, V. unguiculata, V. radiata, G. max, G. soja, and P. vulgaris and the outgroup A. thaliana. OrthoFinder v2.5.2 (Emms and Kelly, 2018) was used with default parameters to detect orthologous groups among the 16 plant species. All-versus-all protein alignment with a default E-value cutoff of 1e−3 was performed using Diamond v0.9.22 (Buchfink et al., 2015). Orthologous protein groups were clustered using Markov Cluster Algorithm v14-137 (Enright et al., 2002).

Calculation of 4DTv rates and Ks rates

The protein sequences of each orthologous/paralogous gene pair were aligned using MAFFT v7.427 with the following parameters: –quiet –clustalout (Katoh and Standley, 2013). The corresponding codon alignment was obtained using PAL2NAL v14 (Suyama et al., 2006), and columns with gaps were removed using the parameter –nogap. 4DTv rates between gene pairs located in syntenic blocks were calculated using custom Perl scripts (https://github.com/JinfengChen/Scripts/blob/master/FFgenome/03.evolution/distance_kaks_4dtv/bin/calculate_4DTV_correction.pl). The 4DTv value of each orthologous/paralogous gene pair was calculated as the ratio of transversion numbers in four-fold third codons to the total number of four-fold third codons. After obtaining the 4DTv values of all gene pairs, their distribution curve was visualized using the geom_density function of the ggplot2 package in RStudio. The Ks value of each paralogous/orthologous gene pair was calculated using KaKs Calculator v3.0 (Zhang, 2022).

Phylogenetic tree construction and divergence time estimation

The protein sequences of all 322 single-copy orthologous genes inferred by OrthoFinder were concatenated into a supergene for each species using custom shell scripts. The supergenes of the 16 species were aligned using MAFFT v7.427 with default parameters (output format: fasta). After alignment and gap removal, 74 865 amino acids remained for each species. RAxML (Randomized Axelerated Maximum Likelihood) v8.2.9 was used to build a phylogenetic tree with the following parameters: raxmlHPC-PTHREADS -s proteins.alignment.fasta -n raxml.out -m PROTGAMMAAUTO -p 12345 -x 12345 -# 100 -f ad -T 20. Divergence times were estimated using the mcmctree function in PAML with the following parameters: burn-in = 5 000 000, sample-number = 1 000 000, sample-frequency = 50. The divergence time between soybean (G. max) and common bean (P. vulgaris) in the Milletoid clade at 20.1 (19.0–21.0) mya was used to calibrate the systematic analysis (Zheng et al., 2016).

Identification of positively selected genes

Single-copy genes from eight legumes (A. ipaensis, A. mongolicus, P. sativum, M. truncatula, L. japonicus, V. angularis, P. vulgaris, and C. cajan) were identified using OrthoFinder with default parameters. Codon alignment of the resulting 3195 single-copy genes was performed using Clustal W2 and PAL2NAL. An unrooted phylogenetic tree for the eight legumes was used to detect positively selected genes, with the A. mongolicus lineage set as the foreground branch. Positive selection was detected using the branch-site model in the Codeml package of PAML (Liu et al., 2014).

Reconstruction of ancestral legume genomes

Ancestral legume genomes were reconstructed according to the method of Yang et al. (2021). In addition to A. mongolicus, five other legume species were also used for ancestral genome reconstruction: soybean (G. max), common bean (P. vulgaris), barrel medic (M. truncatula), white lupin (L. albus), and the wild peanut ancestor (A. ipaensis) (Supplemental Table 15). Only proteins that were anchored to chromosomes were retained for identification of orthologous genes using OrthoFinder v2.5.2 (https://github.com/davidemms/OrthoFinder/releases) with the following parameters: -t 8 -S diamond. Next, syntenic blocks were identified using DRIMM-synteny (Pham and Pevzner, 2010) with a cycle length threshold of 20 and a dust threshold of 19. The output file from DRIMM-synteny was converted into a GRIMM-like format (Tesler, 2002) using processDRIMM (https://github.com/xjtu-omics/processDrimm). Intermediate ancestor genomes were inferred using IAGS (https://github.com/xjtu-omics/IAGS). On the basis of the phylogeny and WGD events in legumes, the multicopy GHHP model was used for ancestor 4 and ancestor 6, whereas the other four ancestors were inferred using the multicopy GMP model (Supplemental Table 33).

Identification of intact LTR-RTs

LTR-RT libraries were constructed using LTR_FINDER_parallel v1.1 (https://github.com/oushujun/LTR_FINDER_parallel) and LTRharvest v1.6.1 (Ellinghaus et al., 2008). The parameters of LTR_FINDER_parallel were: -threads 10 -harvest_out -size 1 000 000 -time 300. The parameters of LTRharvest were: -minlenltr 100 -maxlenltr 7000 -mintsd 4 -maxtsd 6 -motif TGCA -motifmis 1 -similar 85 -vic 10 -seed 20 -seqids yes. Intact LTR-RT sequences were then identified and isolated using LTR_retriever v2.9.0 (Ou and Jiang, 2018) with the following parameters: -inharvest -threads 10.

Estimation of LTR-RT insertion time

For each intact LTR-RT, the DNA sequences of the two LTRs were extracted using bedtools v2.30.0 (https://github.com/arq5x/bedtools2) on the basis of the LTR_retriever results. These two DNA sequences were then aligned using Muscle v3.8.31 (https://www.drive5.com/muscle/) with default parameters. The distance between the two sequences was calculated using the Kimura two-parameter model. The insertion time of the LTR-RT was calculated following the manual at https://github.com/SIWLab/Lab_Info/wiki/Ageing-LTR-insertions using the formula T = K/(2r), where T is the insertion time, K is the divergence distance between two LTRs in an LTR-RT, and r is the synonymous nucleotide substitution rate, or synonymous mutations/site/year (Jedlicka et al., 2020). In legumes, r has been determined to be 7e−9 (Jing et al., 2005). The distribution of insertion times was plotted using the geom_histogram function of the ggplot2 package.

Classification and phylogenetic tree construction of LTR-RTs

The clade-level classification of LTR-RTs was performed using TEsorter v1.3 (https://github.com/zhangrengang/TEsorter; Zhang et al., 2019) with the following parameters: -db rexdb-plant. TEsorter was also used to identify functional domains in LTR-RTs, such as the three most conserved polyprotein domains: RT, ribonuclease H, and integrase. The RT domains of LTR-RTs were collected and clustered using cd-hit v4.8.1 (Li and Godzik, 2006) with the parameter -sc. Evolutionary analyses were performed using MEGA X (Kumar et al., 2018), and the phylogenetic tree was inferred by the maximum likelihood method with the Jones-Taylor-Thornton (JTT) matrix-based model. Phylogenetic trees were polished using FigTree v1.4.4 (https://github.com/rambaut/figtree/).

Transcriptome sequencing and data analysis

A. mongolicus seeds were surface sterilized with bleach for 30 min and germinated in ½ MS for 3 days at 28°C. Young seedlings were transplanted into pots filled with vermiculite and watered with half-strength Hoagland’s solution. The pots were placed in a greenhouse at 25°C under long-day conditions (16-h light/8-h dark). Two months after germination, the seedlings were divided into two groups: one group was transferred to fresh half-strength Hoagland’s solution (as a control), and the treatment group was transferred to half-strength Hoagland’s solution supplemented with 15% PEG6000 to mimic drought stress. During treatment, the leaves and roots of the seedlings were harvested at 0, 2, 24, and 48 h, with three independent biological replicates per treatment and time point. Samples were collected for RNA-seq, flash frozen in liquid nitrogen, and stored at −80°C.

Total RNA was extracted from frozen plant tissues using TRIzol reagent (Takara Bio, Beijing, China) according to the manufacturer’s protocol, dissolved in Diethyl pyrocarbonate (DEPC)-treated water, and stored at −80°C. RNA quality was measured using a NanoDrop 2000 spectrophotometer (Thermo Fisher Scientific, Waltham, MA, USA) and an Agilent 2100 Bioanalyzer (Agilent Technologies, Santa Clara, CA, USA). Strand-specific RNA-seq libraries were constructed according to the method described by Horvath et al. (2015). In brief, cDNA libraries were constructed using the Illumina TruSeq RNA preparation kit (Illumina, San Diego, CA, USA). After second-strand cDNA synthesis, the cDNAs were fragmented, ligated with adapters, and amplified by PCR. Quantitated cDNA libraries were mixed, loaded into two lanes of a flow cell, and sequenced on an Illumina HiSeq 4000 system in paired-end 100-bp mode (BGI, Shenzhen, China).

Transcriptome data were analyzed as described by Feng et al. (2023). In brief, high-quality reads were mapped to the assembled reference genome using HISAT2 v2.0.5 (--rf --trim5 10) (Kim et al., 2015). The fragments per kilobase per million mapped reads (FPKM) value of each gene was calculated using StringTie v1.3.3 (-e -B) (https://ccb.jhu.edu/software/stringtie/). Comparisons between groups were performed with the Ballgown package in Bioconductor. Differentially expressed genes (DEGs) were defined as |Fold Change| > 1.5 with p < 0.05.

Gene sequences were mapped to the SwissProt database (ftp://ftp.ncbi.nih.gov/blast/db/FASTA/swissprot.gz) using the blastx function in Diamond (Buchfink et al., 2015). Each gene annotated by SwissProt was then associated with GO terms using idmapping datasets (ftp://ftp.pir.georgetown.edu/databases/idmapping/idmapping.tb.gz). The GO annotation of A. mongolicus was constructed using the AnnotationForge package in Bioconductor. GO enrichment analyses of the DEGs were performed with the clusterProfiler package in Bioconductor.

Identification of putative genes involved in ethylene biosynthesis and signaling

Ethylene biosynthesis and signaling pathways have been well studied in model plants such as A. thaliana (Dubois et al., 2018). A specific set of ethylene-related genes from Arabidopsis, including SAM (AT1G02500), ACS (AT4G11280), ACO (AT1G62380), ETR1/EIN1 (AT1G66340), CTR1 (AT5G03730), EIN2 (AT5G03280), and EIN3 (AT3G20770), was chosen as the reference dataset. To identify ethylene-related genes in A. mongolicus, a BLASTp search (E value <1e−10) was conducted against the A. mongolicus proteome database using these reference genes from A. thaliana. The ERF gene family was identified by searching for the AP2 domain (Pfam accession: PF00847) within the A. mongolicus proteome database using the hmmscan function in HMMER version 3.1b2.

Identification of cuticular wax biosynthesis genes

Wax-related genes in A. mongolicus were identified by a BLASTp search (E value <1e−10) against the A. mongolicus proteome database using previously reported wax-related genes from A. thaliana (Dimopoulos et al., 2020). The biosynthetic pathway shown in Figure 5 was redrawn from Bernard and Joubès (2013), Zhao et al. (2016b), and Xue et al. (2017). A heatmap of gene expression levels in PEG-treated 2-month-old A. mongolicus leaves and roots was plotted using the pheatmap function in R 4.0.3. Gene expression values in all samples were normalized using the formula log10(FPKM + 1).

Validation of DEGs by real-time qRT–PCR

Reverse transcription and qRT–PCR were performed according to the methods described by Feng et al. (2023). Total RNA was extracted from plant tissues using the EZNA Total RNA Kit II (Omega BIO-TEK). After quantification with the NanoDrop 2000 instrument, 1 μg of total RNA from each sample was reverse transcribed with random hexamer primers using the RevertAid First Strand cDNA Synthesis Kit (Thermo Scientific) according to the manufacturer’s instructions. The cDNA products were diluted 10-fold with sterilized Milli-Q water and used as templates. Each reaction mixture (20 μL) contained 2 μL of cDNA template, 4 μL of forward primer (1 μM), 4 μL of reverse primer (1 μM), and 10 μL of 2× premixed reagent (SYBR Premix Ex Taq II, Tli RNase H Plus, TaKaRa). A two-step PCR protocol was used in the CFX96 Touch Real-Time PCR Detection System (Bio-Rad) with the following procedure: 95°C for 3 min, followed by 39 cycles of 95°C for 10 s and 60°C for 30 s. The housekeeping gene AmEIF (eukaryotic translation initiation factor, Chr02.g10509) was used as the internal standard. Primer sequences for qRT–PCR are listed in Supplemental Table 30. Three independent biological replicates were prepared for each sample. The relative expression of each gene of interest was calculated using the 2−ΔΔCt method (Livak and Schmittgen, 2001). Student’s t-test was used to evaluate the significance of changes.

Construction of transgenic plants and assessment of drought resistance

The cDNA sequence of AmERF2 (Chr07.g34904) was amplified and cloned into the pCanG-MYC vector to construct 35S::AmERF2-6Myc plasmids, which were then introduced into Agrobacterium strain GV1301 for transformation of A. thaliana ecotype Col-0 using the floral dip method (Zhang et al., 2006). The primers used for PCR amplification of the gene sequences are listed in Supplemental Table 34. Successful transgenic plants were validated by western blotting using mouse anti-Myc primary antibody (catalog no. 631206, Clontech) and anti-mouse IgG (H + L) secondary antibody (catalog no. 32430, Thermo Scientific). The Ponceau S-stained Rubisco large subunit band on the western blot membrane was used as a loading control.

Ten-day-old Arabidopsis seedlings (WT [Col-0], empty vector-transformed [35S::6Myc], and AmERF2 overexpressors) were transplanted to soil and cultivated in a 22°C growth chamber (16-h light/8-h dark) without watering for 16 days. Watering was resumed for 2 days, and the survival rates and phenotypic responses were recorded.

Detection of ACC content

One-week-old seedlings of WT (Col-0) and AmERF2-overexpressing lines were planted in potted soil and grown at 22°C with a 16-h light/8-h dark cycle. No additional water was provided until all the WT plants had become dehydrated. The leaves were collected and frozen in liquid nitrogen, and 50 mg of tissue was used for ACC detection by gas chromatography–tandem mass spectrometry according to previously described methods (Peng et al., 2022). A stable isotope analog of ACC ([2H4]ACC) was used as an internal standard in each sample. A series of gradient-diluted ACC standards were used as references for quantification. The significance of changes was evaluated using Student’s t-test. The experiments were repeated three times.

Data and code availability

The data supporting the findings of this study are provided in the supplemental information files. The genome assembly and annotations have been deposited at the Chinese National Genomics Data Center (https://ngdc.cncb.ac.cn/) under BioProject accession number PRJCA024714. Raw sequencing data have been deposited in the NCBI Sequence Read Archive database (http://www.ncbi.nlm.nih.gov/sra) under accession number PRJNA1085185. The genome sequences and annotations have also been posted on our website https://ammopiptanthus.github.io/ for data download.

Funding

This work was supported by the National Natural Science Foundation of China (NSFC) (no. 91125027); GRF grants (CUHK codes 14148916 and 14104521); AoE grants (AoE/M-05/12 and AoE/M-403/16) from the Research Grants Council (RGC) of Hong Kong; the NSFC-RGC Joint Scheme (N_CUHK452/17); the National Key Research and Development Program, Key Innovative and Collaborative Science and Technology Scheme for Hong Kong, Macau, and Taiwan (2017YFE0191100); direct grants from the Chinese University of Hong Kong; and the China Postdoctoral Science Foundation (2023M741234).

Author contributions

J.-X.H., H.-M.L., and L.-Z.A. conceived the project. L.-Z.A., J.-X.H., L.F., X.-L.Y., L.-F.G., G.-H.M., C.-M.W., S.-W.T., S.-N.T., J.-C.Z., and Y.L. collected the seed resources and prepared samples for sequencing. B.-J.Z. and W.-C.Y. performed flow cytometry and FISH experiments. L.F. and S.-W.T. performed the high-molecular-weight DNA purification and part of the Bionano optical mapping experiments. N.L. and L.F. performed gene overexpression experiments in Arabidopsis. L.F., F.T., and T.-Q.D. assembled the draft genomes. L.F. and J.-C.Z. performed chromosome assembly, genome annotation, evolutionary analyses, transcriptome data analyses, and qRT–PCR assays and organized all data visualization. W.S. provided logistics and coordination between the project team and the sequencing company (BGI). L.F. and J.-X.H. wrote the manuscript. L.-Z.A., S.-M.N., Y.-M.J., and Y.-L.Z. provided critical discussions and comments on the manuscript. J.-X.H. and H.-M.L. supervised the study and approved the final manuscript.

Acknowledgments

We thank Mr. Zhiyuan Li and Mr. Lingyuan Zeng for their help in collecting A. mongolicus and A. nanus seeds in Inner Mongolia. We thank Mr. Aisi Fu and Ms. Qiong Ding (Wuhan Institute of Biotechnology, China) for their assistance in the early stages of PacBio sequencing. We thank Mr. Weijian Huang and Mr. Wanli Liang (Beijing Berry Genomics Co. Ltd.) for assistance in genome annotation. We thank Prof. Caiji Gao (South China Normal University) for kindly providing laboratory resources and suggestions for this project. Ms. Jee-Yan Chu copy edited the manuscript. No conflict of interest is declared.

Published: April 1, 2024

Footnotes

Published by the Plant Communications Shanghai Editorial Office in association with Cell Press, an imprint of Elsevier Inc., on behalf of CSPB and CEMPS, CAS.

Supplemental information is available at Plant Communications Online.

Contributor Information

Li-Zhe An, Email: lizhean@lzu.edu.cn.

Hon-Ming Lam, Email: honming@cuhk.edu.hk.

Jun-Xian He, Email: jxhe@cuhk.edu.hk.

Supplemental information

Document S1. Supplemental Figures 1–12
mmc1.pdf (1.8MB, pdf)
Data S1. Supplemental Tables 1–34
mmc2.zip (12.4MB, zip)
Document S2. Article plus supplemental information
mmc3.pdf (6.1MB, pdf)

References

  1. Bashir M.A., Silvestri C., Ahmad T., Hafiz I.A., Abbasi N.A., Manzoor A., Cristofori V., Rugini E. Osmotin: a cationic protein leads to improve biotic and abiotic stress tolerance in plants. Plants. 2020;9:992. doi: 10.3390/plants9080992. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Bernard A., Joubès J. Arabidopsis cuticular waxes: Advances in synthesis, export and regulation. Prog. Lipid Res. 2013;52:110–129. doi: 10.1016/j.plipres.2012.10.002. [DOI] [PubMed] [Google Scholar]
  3. Bertioli D.J., Jenkins J., Clevenger J., Dudchenko O., Gao D., Seijo G., Leal-Bertioli S.C.M., Ren L., Farmer A.D., Pandey M.K., et al. The genome sequence of segmental allotetraploid peanut Arachis hypogaea. Nat. Genet. 2019;51:877–884. doi: 10.1038/s41588-019-0405-z. [DOI] [PubMed] [Google Scholar]
  4. Bruneau A., Doyle J.J., Herendeen P., Hughes C., Kenicer G., Lewis G., Mackinder B., Pennington R.T., Sanderson M.J., Wojciechowski M.F., et al. Legume phylogeny and classification in the 21st century: Progress, prospects and lessons for other species-rich clades. Taxon. 2013;62:217–248. doi: 10.12705/622.8. [DOI] [Google Scholar]
  5. Buchfink B., Xie C., Huson D.H. Fast and sensitive protein alignment using DIAMOND. Nat. Methods. 2015;12:59–60. doi: 10.1038/nmeth.3176. [DOI] [PubMed] [Google Scholar]
  6. Busta L., Schmitz E., Kosma D.K., Schnable J.C., Cahoon E.B. A co-opted steroid synthesis gene, maintained in sorghum but not maize, is associated with a divergence in leaf wax chemistry. Proc. Natl. Acad. Sci. USA. 2021;118 doi: 10.1073/pnas.2022982118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Camacho C., Coulouris G., Avagyan V., Ma N., Papadopoulos J., Bealer K., Madden T.L. BLAST+: architecture and applications. BMC Bioinf. 2009;10:421. doi: 10.1186/1471-2105-10-421. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Chan P.P., Lowe T.M. In: Gene Prediction: Methods and Protocols. Kollmar M., editor. Springer New York; 2019. tRNAscan-SE: Searching for tRNA Genes in Genomic Sequences; pp. 1–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Chang D., Gao S., Zhou G., Deng S., Jia J., Wang E., Cao W. The chromosome-level genome assembly of Astragalus sinicus and comparative genomic analyses provide new resources and insights for understanding legume-rhizobial interactions. Plant Commun. 2022;3 doi: 10.1016/j.xplc.2021.100263. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Cheng S.H. Ammopiptanthus Cheng f. A new genus of Leguminosae from central Asia. J Bot USSR. 1959;44:1381–1386. [Google Scholar]
  11. Dimopoulos N., Tindjau R., Wong D.C.J., Matzat T., Haslam T., Song C., Gambetta G.A., Kunst L., Castellarin S.D. Drought stress modulates cuticular wax composition of the grape berry. J. Exp. Bot. 2020;71:3126–3141. doi: 10.1093/jxb/eraa046. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Du J., Tian Z., Hans C.S., Laten H.M., Cannon S.B., Jackson S.A., Shoemaker R.C., Ma J. Evolutionary conservation, diversity and specificity of LTR-retrotransposons in flowering plants: insights from genome-wide analysis and multi-specific comparison. Plant J. 2010;63:584–598. doi: 10.1111/j.1365-313X.2010.04263.x. [DOI] [PubMed] [Google Scholar]
  13. Dubois M., Van den Broeck L., Inzé D. The pivotal role of ethylene in plant growth. Trends Plant Sci. 2018;23:311–323. doi: 10.1016/j.tplants.2018.01.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Dudchenko O., Batra S.S., Omer A.D., Nyquist S.K., Hoeger M., Durand N.C., Shamim M.S., Machol I., Lander E.S., Aiden A.P., Aiden E.L. De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds. Science. 2017;356:92–95. doi: 10.1126/science.aal3327. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Edqvist J., Blomqvist K., Nieuwland J., Salminen T.A. Plant lipid transfer proteins: are we finally closing in on the roles of these enigmatic proteins? J. Lipid Res. 2018;59:1374–1382. doi: 10.1194/jlr.R083139. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Ellinghaus D., Kurtz S., Willhoeft U. LTRharvest, an efficient and flexible software for de novo detection of LTR retrotransposons. BMC Bioinf. 2008;9 doi: 10.1186/1471-2105-9-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Emms D.M., Kelly S. OrthoFinder2: fast and accurate phylogenomic orthology analysis from gene sequences. bioRxiv. 2018 doi: 10.1101/466201. Preprint at. [DOI] [Google Scholar]
  18. Enright A.J., Van Dongen S., Ouzounis C.A. An efficient algorithm for large-scale detection of protein families. Nucleic Acids Res. 2002;30:1575–1584. doi: 10.1093/nar/30.7.1575. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Feng L., Gu L.F., Luo J., Fu A.S., Ding Q., Yiu S.M., He J.X. Complete plastid genomes of the genus Ammopiptanthus and identification of a novel 23-kb rearrangement. Conserv. Genet. Resour. 2017;9:647–650. doi: 10.1007/s12686-017-0747-8. [DOI] [Google Scholar]
  20. Feng L., Li N., Yang W., Li Y., Wang C.M., Tong S.W., He J.X. Analyses of mitochondrial genomes of the genus Ammopiptanthus provide new insights into the evolution of legume plants. Plant Systemat. Evol. 2019;305:385–399. doi: 10.1007/s00606-019-01578-2. [DOI] [Google Scholar]
  21. Feng L., Li Y., Zhou Y.L., Meng G.H., Ji Z.L., Lin W.H., He J.X. Integrative transcriptomic and proteomic analyses reveal a positive role of BES1 in salt tolerance in Arabidopsis. Front. Plant Sci. 2023;14 doi: 10.3389/fpls.2023.1034393. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Feng X., Song Y. Karyotype studies for Ammopiptanthus mongolicus. Ningxia Journal of Agriculture and Forestry Science and Technology. 1988;3 [Google Scholar]
  23. Gao F., Wang J., Wei S., Li Z., Wang N., Li H., Feng J., Li H., Zhou Y., Zhang F. Transcriptomic analysis of drought stress responses in Ammopiptanthus mongolicus leaves using the RNA-Seq technique. PLoS One. 2015;10 doi: 10.1371/journal.pone.0124382. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Gao F., Wang N., Li H., Liu J., Fu C., Xiao Z., Wei C., Lu X., Feng J., Zhou Y. Identification of drought-responsive microRNAs and their targets in Ammopiptanthus mongolicus by using high-throughput sequencing. Sci. Rep. 2016;6 doi: 10.1038/Srep34601. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Gao F., Wang X., Li X., Xu M., Li H., Abla M., Sun H., Wei S., Feng J., Zhou Y. Long-read sequencing and de novo genome assembly of Ammopiptanthus nanus, a desert shrub. GigaScience. 2018;7 doi: 10.1093/gigascience/giy074. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Gao S., Yang X., Sun J., Zhao X., Wang B., Ye K. IAGS: Inferring Ancestor Genome Structure under a wide range of evolutionary scenarios. Mol. Biol. Evol. 2022;39:msac041. doi: 10.1093/molbev/msac041. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Ge X.J., Yu Y., Yuan Y.M., Huang H.W., Yan C. Genetic diversity and geographic differentiation in endangered Ammopiptanthus (Leguminosae) populations in desert regions of northwest China as revealed by ISSR analysis. Ann. Bot. 2005;95:843–851. doi: 10.1093/aob/mci089. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Haas B.J., Salzberg S.L., Zhu W., Pertea M., Allen J.E., Orvis J., White O., Buell C.R., Wortman J.R. Automated eukaryotic gene structure annotation using EVidenceModeler and the program to assemble spliced alignments. Genome Biol. 2008;9:R7. doi: 10.1186/gb-2008-9-1-r7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Haas B.J., Delcher A.L., Mount S.M., Wortman J.R., Smith R.K., Hannick L.I., Maiti R., Ronning C.M., Rusch D.B., Town C.D., et al. Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies. Nucleic Acids Res. 2003;31:5654–5666. doi: 10.1093/nar/gkg770. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Hall B.P., Shakeel S.N., Amir M., Ul Haq N., Qu X., Schaller G.E. Histidine kinase activity of the ethylene receptor ETR1 facilitates the ethylene response in Arabidopsis. Plant Physiol. 2012;159:682–695. doi: 10.1104/pp.112.196790. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Han S.H., Li J.S. Characteristics of leaf structure of Ammopiptanthus mongolicus and its relation with cold resistance. Sci. Silvae Sin. 1992;28:198–201. [Google Scholar]
  32. Hane J.K., Ming Y., Kamphuis L.G., Nelson M.N., Garg G., Atkins C.A., Bayer P.E., Bravo A., Bringans S., Cannon S., et al. A comprehensive draft genome sequence for lupin (Lupinus angustifolius), an emerging health food: insights into plant-microbe interactions and legume evolution. Plant Biotechnol. J. 2017;15:318–330. doi: 10.1111/pbi.12615. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Horvath D.P., Hansen S.A., Moriles-Miller J.P., Pierik R., Yan C., Clay D.E., Scheffler B., Clay S.A. RNAseq reveals weed-induced PIF3-like as a candidate target to manipulate weed stress response in soybean. New Phytol. 2015;207:196–210. doi: 10.1111/nph.13351. [DOI] [PubMed] [Google Scholar]
  34. Jaillon O., Aury J.-M., Noel B., Policriti A., Clepet C., Casagrande A., Choisne N., Aubourg S., Vitulo N., Jubin C., et al. The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla. Nature. 2007;449:463–467. doi: 10.1038/nature06148. [DOI] [PubMed] [Google Scholar]
  35. Jedlicka P., Lexa M., Kejnovsky E. What can long terminal repeats tell us about the age of LTR retrotransposons, gene conversion and ectopic recombination? Front. Plant Sci. 2020;11:644. doi: 10.3389/fpls.2020.00644. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Jing R., Knox M.R., Lee J.M., Vershinin A.V., Ambrose M., Ellis T.H.N., Flavell A.J. Insertional polymorphism and antiquity of PDR1 retrotransposon insertions in Pisum species. Genetics. 2005;171:741–752. doi: 10.1534/genetics.105.045112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Katoh K., Standley D.M. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol. 2013;30:772–780. doi: 10.1093/molbev/mst010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Keilwagen J., Wenk M., Erickson J.L., Schattat M.H., Grau J., Hartung F. Using intron position conservation for homology-based gene prediction. Nucleic Acids Res. 2016;44:e89. doi: 10.1093/nar/gkw092. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Kim D., Landmead B., Salzberg S.L. HISAT: a fast spliced aligner with low memory requirements. Nat. Methods. 2015;12:357–U121. doi: 10.1038/Nmeth.3317. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Koren S., Walenz B.P., Berlin K., Miller J.R., Bergman N.H., Phillippy A.M. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res. 2017;27:722–736. doi: 10.1101/gr.215087.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Kreplak J., Madoui M.A., Cápal P., Novák P., Labadie K., Aubert G., Bayer P.E., Gali K.K., Syme R.A., Main D., et al. A reference genome for pea provides insight into legume genome evolution. Nat. Genet. 2019;51:1411–1422. doi: 10.1038/s41588-019-0480-1. [DOI] [PubMed] [Google Scholar]
  42. Kumar S., Stecher G., Li M., Knyaz C., Tamura K. MEGA X: Molecular Evolutionary Genetics Analysis across computing platforms. Mol. Biol. Evol. 2018;35:1547–1549. doi: 10.1093/molbev/msy096. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Li H.J., Bai W.P., Liu L.B., Liu H.S., Wei L., Garant T.M., Kalinger R.S., Deng Y.X., Wang G.N., Bao A.K., et al. Massive increases in C31 alkane on Zygophyllum xanthoxylum leaves contribute to its excellent abiotic stress tolerance. Ann. Bot. 2023;131:723–736. doi: 10.1093/aob/mcad038. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Li J.L., Han L., Jin M., Zhang H., Su Y.H. Characteristics analysis of Ammopiptanthus mongolicus stomata and inward potassium channels in guard cell of Ammopiptanthus mongolicus. Genom. Appl. Biol. 2015;11 [Google Scholar]
  45. Li W., Godzik A. Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics. 2006;22:1658–1659. doi: 10.1093/bioinformatics/btl158. [DOI] [PubMed] [Google Scholar]
  46. Liu H., Wang X., Wang G., Cui P., Wu S., Ai C., Hu N., Li A., He B., Shao X., et al. The nearly complete genome of Ginkgo biloba illuminates gymnosperm evolution. Nat. Plants. 2021;7:748–756. doi: 10.1038/s41477-021-00933-x. [DOI] [PubMed] [Google Scholar]
  47. Liu J.Q., Qiu M.X. Ecological, physiological and anatomical traits of Ammopiptanthus mongolicus grown in desert of China. Acta Bot. Sin. 1982;24 [Google Scholar]
  48. Liu M.J., Zhao J., Cai Q.L., Liu G.C., Wang J.R., Zhao Z.H., Liu P., Dai L., Yan G.J., Wang W.J., et al. The complex jujube genome provides insights into fruit tree biology. Nat. Commun. 2014;5 doi: 10.1038/ncomms6315. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Liu M.Q., Lu C.F., Yin W.L. Research progress of biological characteristics and abiotic stress tolerant mechanisms of rare & endangered plant Ammopiptanthus. Chin. J. Appl. Environ. Biol. 2004;10:384–388. doi: 10.3321/j.issn:1006-687X.2004.03.028. [DOI] [Google Scholar]
  50. Liu M., Shi J., Lu C. Identification of stress-responsive genes in Ammopiptanthus mongolicus using ESTs generated from cold- and drought-stressed seedlings. BMC Plant Biol. 2013;13:88. doi: 10.1186/1471-2229-13-88. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Livak K.J., Schmittgen T.D. Analysis of relative gene expression data using real-time quantitative PCR and the 2ˆ(ΔΔCt) Method. Methods. 2001;25:402–408. doi: 10.1006/meth.2001.1262. [DOI] [PubMed] [Google Scholar]
  52. Ma Q., Liu H.S., Li H.J., Bai W.P., Gao Q.F., Wu S.D., Yin X.X., Chen Q.Q., Shi Y.Q., Gao T.G., et al. Genomic analysis reveals phylogeny of Zygophyllales and mechanism for water retention of a succulent xerophyte. Plant Physiol. 2024 doi: 10.1093/plphys/kiae040. Preprint at. [DOI] [PubMed] [Google Scholar]
  53. Merchante C., Alonso J.M., Stepanova A.N. Ethylene signaling: simple ligand, complex regulation. Curr. Opin. Plant Biol. 2013;16:554–560. doi: 10.1016/j.pbi.2013.08.001. [DOI] [PubMed] [Google Scholar]
  54. Nawrocki E.P., Eddy S.R. Infernal 1.1: 100-fold faster RNA homology searches. Bioinformatics. 2013;29:2933–2935. doi: 10.1093/bioinformatics/btt509. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Neumann P., Novak P., Hostakova N., Macas J. Systematic survey of plant LTR-retrotransposons elucidates phylogenetic relationships of their polyprotein domains and provides a reference for element classification. Mobile DNA-Uk. 2019;10 doi: 10.1186/s13100-018-0144-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Ou S., Jiang N. LTR_retriever: a highly accurate and sensitive program for identification of long terminal repeat retrotransposons. Plant Physiol. 2018;176:1410–1422. doi: 10.1104/pp.17.01310. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Pandian B.A., Sathishraj R., Djanaguiraman M., Prasad P.V.V., Jugulam M. Role of cytochrome P450 enzymes in plant stress response. Antioxidants. 2020;9:454. doi: 10.3390/antiox9050454. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Pang T., Ye C.Y., Xia X., Yin W. De novo sequencing and transcriptome analysis of the desert shrub, Ammopiptanthus mongolicus, during cold acclimation using Illumina/Solexa. BMC Genom. 2013;14:488. doi: 10.1186/1471-2164-14-488. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Pang T., Guo L., Shim D., Cannon N., Tang S., Chen J., Xia X., Yin W., Carlson J.E. Characterization of the transcriptome of the xerophyte Ammopiptanthus mongolicus leaves under drought stress by 454 pyrosequencing. PLoS One. 2015;10 doi: 10.1371/journal.pone.0136495. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Pei H.X., Wang H.L., Wang L.J., Zheng F.F., Dong C.H. In: Mechanism of Plant Hormone Signaling under Stress. Pandey G.K., editor. John Wiley & Sons; 2017. Regulatory function of ethylene in plant responses to drought, cold, and salt stresses; pp. 327–344. [DOI] [Google Scholar]
  61. Peng Y., Chen Y., Yuan Y., Liu B., Yu P., Song S., Yi Y., Teng Z., Yi Z., Zhang J., et al. Post-anthesis saline-alkali stress inhibits grain filling by promoting ethylene production and signal transduction. Food Energy Secur. 2022;11 doi: 10.1002/fes3.384. [DOI] [Google Scholar]
  62. Pereira J.F., Ryan P.R. The role of transposable elements in the evolution of aluminium resistance in plants. J. Exp. Bot. 2019;70:41–54. doi: 10.1093/jxb/ery357. [DOI] [PubMed] [Google Scholar]
  63. Pertea M., Kim D., Pertea G.M., Leek J.T., Salzberg S.L. Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown. Nat. Protoc. 2016;11:1650–1667. doi: 10.1038/nprot.2016.095. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Pham S.K., Pevzner P.A. DRIMM-Synteny: decomposing genomes into evolutionary conserved segments. Bioinformatics. 2010;26:2509–2516. doi: 10.1093/bioinformatics/btq465. [DOI] [PubMed] [Google Scholar]
  65. Price A.L., Jones N.C., Pevzner P.A. De novo identification of repeat families in large genomes. Bioinformatics. 2005;21:i351–i358. doi: 10.1093/bioinformatics/bti1018. [DOI] [PubMed] [Google Scholar]
  66. Qu Y., Shang X., Zeng Z., Yu Y., Bian G., Wang W., Liu L., Tian L., Zhang S., Wang Q., et al. Whole-genome duplication reshaped adaptive evolution in a relict plant species, Cyclocarya paliurus. Dev. Reprod. Biol. 2023;21:455–469. doi: 10.1016/j.gpb.2023.02.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Quan R., Hu S., Zhang Z., Zhang H., Zhang Z., Huang R. Overexpression of an ERF transcription factor TSRF1 improves rice drought tolerance. Plant Biotechnol. J. 2010;8:476–488. doi: 10.1111/j.1467-7652.2009.00492.x. [DOI] [PubMed] [Google Scholar]
  68. Rong W., Qi L., Wang A., Ye X., Du L., Liang H., Xin Z., Zhang Z. The ERF transcription factor TaERF3 promotes tolerance to salt and drought stresses in wheat. Plant Biotechnol. J. 2014;12:468–479. doi: 10.1111/pbi.12153. [DOI] [PubMed] [Google Scholar]
  69. Schmutz J., Cannon S.B., Schlueter J., Ma J., Mitros T., Nelson W., Hyten D.L., Song Q., Thelen J.J., Cheng J., et al. Genome sequence of the palaeopolyploid soybean. Nature. 2010;463:178–183. doi: 10.1038/nature08670. [DOI] [PubMed] [Google Scholar]
  70. Schuster A.C. Graduate School of Life Sciences. Julius-Maximilians-Universität Würzburg; 2016. Chemical and Functional Analyses of the Plant Cuticle as Leaf Transpiration Barrier. [Google Scholar]
  71. Servant N., Varoquaux N., Lajoie B.R., Viara E., Chen C.J., Vert J.P., Heard E., Dekker J., Barillot E. HiC-Pro: an optimized and flexible pipeline for Hi-C data processing. Genome Biol. 2015;16 doi: 10.1186/s13059-015-0831-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Shi J., Ma X., Zhang J., Zhou Y., Liu M., Huang L., Sun S., Zhang X., Gao X., Zhan W., et al. Chromosome conformation capture resolved near complete genome assembly of broomcorn millet. Nat. Commun. 2019;10 doi: 10.1038/s41467-018-07876-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Su Z., Pan B., Zhang M., Shi W. Conservation genetics and geographic patterns of genetic variation of endangered shrub Ammopiptanthus (Fabaceae) in northwestern China. Conserv. Genet. 2016;17:485–496. doi: 10.1007/s10592-015-0798-x. [DOI] [Google Scholar]
  74. Suyama M., Torrents D., Bork P. PAL2NAL: robust conversion of protein sequence alignments into the corresponding codon alignments. Nucleic Acids Res. 2006;34:W609–W612. doi: 10.1093/nar/gkl315. [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Tarailo-Graovac M., Chen N. Using RepeatMasker to identify repetitive elements in genomic sequences. Curr. Protoc. Bioinformatics. 2009;Chapter 4:4.10.1–4.10.14. doi: 10.1002/0471250953.bi0410s25. [DOI] [PubMed] [Google Scholar]
  76. Tesler G. GRIMM: genome rearrangements web server. Bioinformatics. 2002;18:492–493. doi: 10.1093/bioinformatics/18.3.492. [DOI] [PubMed] [Google Scholar]
  77. Trapnell C., Roberts A., Goff L., Pertea G., Kim D., Kelley D.R., Pimentel H., Salzberg S.L., Rinn J.L., Pachter L. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat. Protoc. 2012;7:562–578. doi: 10.1038/nprot.2012.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. Urano K., Maruyama K., Jikumaru Y., Kamiya Y., Yamaguchi-Shinozaki K., Shinozaki K. Analysis of plant hormone profiles in response to moderate dehydration stress. Plant J. 2017;90:17–36. doi: 10.1111/tpj.13460. [DOI] [PubMed] [Google Scholar]
  79. Walker B.J., Abeel T., Shea T., Priest M., Abouelliel A., Sakthikumar S., Cuomo C.A., Zeng Q.D., Wortman J., Young S.K., et al. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One. 2014;9 doi: 10.1371/journal.pone.0112963. [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Wang H.C., Sun H., Compton J.A., Yang J.B. A phylogeny of Thermopsideae (Leguminosae : Papilionoideae) inferred from nuclear ribosomal internal transcribed spacer (ITS) sequences. Bot. J. Linn. Soc. 2006;151:365–373. doi: 10.1111/j.1095-8339.2006.00512.x. [DOI] [Google Scholar]
  81. Wang J., Sun P., Li Y., Liu Y., Yu J., Ma X., Sun S., Yang N., Xia R., Lei T., et al. Hierarchically aligning 10 legume genomes establishes a family-level genomics platform. Plant Physiol. 2017;174:284–300. doi: 10.1104/pp.16.01981. [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. Wang L., Jia G., Jiang X., Cao S., Chen Z.J., Song Q. Altered chromatin architecture and gene expression during polyploidization and domestication of soybean. Plant Cell. 2021;33:1430–1446. doi: 10.1093/plcell/koab081. [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. Wang Y.H., Wicke S., Wang H., Jin J.J., Chen S.Y., Zhang S.D., Li D.Z., Yi T.S. Plastid genome evolution in the early-diverging legume subfamily Cercidoideae (Fabaceae) Front. Plant Sci. 2018;9:138. doi: 10.3389/Fpls.2018.00138. [DOI] [PMC free article] [PubMed] [Google Scholar]
  84. Wang Y.P., Tang H.B., DeBarry J.D., Tan X., Li J.P., Wang X.Y., Lee T.H., Jin H.Z., Marler B., Guo H., et al. MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res. 2012;40 doi: 10.1093/nar/gkr1293. [DOI] [PMC free article] [PubMed] [Google Scholar]
  85. Weng X.Y., Zhou S.L., Zong W., Ouyang Y.D. ChIP assay in rice. Bio-protocol. 2018 doi: 10.21769/BioProtoc.1010135. Preprint at. [DOI] [Google Scholar]
  86. Wu Y., Li X., Zhang J., Zhao H., Tan S., Xu W., Pan J., Yang F., Pi E. ERF subfamily transcription factors and their function in plant responses to abiotic stresses. Front. Plant Sci. 2022;13 doi: 10.3389/fpls.2022.1042084. [DOI] [PMC free article] [PubMed] [Google Scholar]
  87. Wu Y., Wei W., Pang X., Wang X., Zhang H., Dong B., Xing Y., Li X., Wang M. Comparative transcriptome profiling of a desert evergreen shrub, Ammopiptanthus mongolicus, in response to drought and cold stresses. BMC Genom. 2014;15:671. doi: 10.1186/1471-2164-15-671. [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Xie L., Yang Y. Miocene origin of the characteristic broad-leaved evergreen shrub Ammopiptanthus (Leguminosae) in the desert flora of eastern central Asia. Int. J. Plant Sci. 2012;173:944–955. doi: 10.1086/667232. [DOI] [Google Scholar]
  89. Xie M., Chung C.Y.L., Li M.W., Wong F.L., Wang X., Liu A., Wang Z., Leung A.K.Y., Wong T.H., Tong S.W., et al. A reference-grade wild soybean genome. Nat. Commun. 2019;10:1216. doi: 10.1038/s41467-019-09142-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  90. Xiong L., Schumaker K.S., Zhu J.K. Cell signaling during cold, drought, and salt stress. Plant Cell. 2002;14:S165–S183. doi: 10.1105/tpc.000596. [DOI] [PMC free article] [PubMed] [Google Scholar]
  91. Xu S., An L., Feng H., Wang X., Li X. The seasonal effects of water stress on Ammopiptanthus mongolicus in a desert environment. J. Arid Environ. 2002;51:437–447. doi: 10.1006/jare.2001.0949. [DOI] [Google Scholar]
  92. Xu W., Zhang Q., Yuan W., Xu F., Muhammad Aslam M., Miao R., Li Y., Wang Q., Li X., Zhang X., et al. The genome evolution and low-phosphorus adaptation in white lupin. Nat. Commun. 2020;11:1069. doi: 10.1038/s41467-020-14891-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  93. Xue D., Zhang X., Lu X., Chen G., Chen Z.H. Molecular and evolutionary mechanisms of cuticular wax for plant drought tolerance. Front. Plant Sci. 2017;8:621. doi: 10.3389/fpls.2017.00621. [DOI] [PMC free article] [PubMed] [Google Scholar]
  94. Yang S.F., Hoffman N.E. Ethylene biosynthesis and its regulation in higher plants. Annu. Rev. Plant Physiol. 1984;35:155–189. doi: 10.1146/annurev.pp.35.060184.001103. [DOI] [Google Scholar]
  95. Yang T., Liu R., Luo Y., Hu S., Wang D., Wang C., Pandey M.K., Ge S., Xu Q., Li N., et al. Improved pea reference genome and pan-genome highlight genomic features and evolutionary characteristics. Nat. Genet. 2022;54:1553–1563. doi: 10.1038/s41588-022-01172-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  96. Yang X., Gao S., Guo L., et al. Three chromosome-scale Papaver genomes reveal punctuated patchwork evolution of the morphinan and noscapine biosynthesis pathway. Nat. Commun. 2021;12:6030. doi: 10.1038/s41467-021-26330-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  97. Yang Z., Liu Y., Han H., Zhao X., Chen S., Li G., Shi S., Feng J. Physiological and transcriptome analyses reveal the response of Ammopiptanthus mongolicus to extreme seasonal temperatures in a cold plateau desert ecosystem. Sci. Rep. 2022;12 doi: 10.1038/s41598-022-14402-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  98. Young N.D., Debellé F., Oldroyd G.E.D., Geurts R., Cannon S.B., Udvardi M.K., Benedito V.A., Mayer K.F.X., Gouzy J., Schoof H., et al. The Medicago genome provides insight into the evolution of rhizobial symbioses. Nature. 2011;480:520–524. doi: 10.1038/nature10625. [DOI] [PMC free article] [PubMed] [Google Scholar]
  99. Zhang D., Yang H., Wang X., Qiu Y., Tian L., Qi X., Qu L.Q. Cytochrome P450 family member CYP96B5 hydroxylates alkanes to primary alcohols and is involved in rice leaf cuticular wax synthesis. New Phytol. 2020;225:2094–2107. doi: 10.1111/nph.16267. [DOI] [PubMed] [Google Scholar]
  100. Zhang R.G., Wang Z.X., Ou S., Li G.Y. TEsorter: lineage-level classification of transposable elements using conserved protein domains. bioRxiv. 2019 doi: 10.1101/800177. Preprint at. [DOI] [Google Scholar]
  101. Zhang X., Henriques R., Lin S.S., Niu Q.W., Chua N.H. Agrobacterium-mediated transformation of Arabidopsis thaliana using the floral dip method. Nat. Protoc. 2006;1:641–646. doi: 10.1038/nprot.2006.97. [DOI] [PubMed] [Google Scholar]
  102. Zhang Z. KaKs_Calculator 3.0: Calculating Selective Pressure on Coding and Non-coding Sequences. Dev. Reprod. Biol. 2022;20:536–540. doi: 10.1016/j.gpb.2021.12.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  103. Zhao L., Wang X., Huo H., Yuan G., Sun Y., Zhang D., Cao Y., Xu L., Wei G. Phylogenetic diversity of Ammopiptanthus rhizobia and distribution of rhizobia associated with Ammopiptanthus mongolicus in diverse regions of northwest China. Microb. Ecol. 2016;72:231–239. doi: 10.1007/s00248-016-0759-z. [DOI] [PubMed] [Google Scholar]
  104. Zhao X.H., Zhang R., Li L., Fu X.L., Chen X.D., Li D.M., Xiao W., Gao D.S. Advances of plant cuticles biosynthesis and transport. Plant Physiology Journal. 2016;52:1128–1134. doi: 10.13592/j.cnki.ppj.2016.0114. [DOI] [Google Scholar]
  105. Zhao Y.Z., Zhu Z.Y. The endemic genera of desert region in the centre of Asia. Acta Bot. Yunnanica. 2003;25:113–122. [Google Scholar]
  106. Zheng F., Wu H., Zhang R., Li S., He W., Wong F.L., Li G., Zhao S., Lam H.M. Molecular phylogeny and dynamic evolution of disease resistance genes in the legume family. BMC Genom. 2016;17:402. doi: 10.1186/s12864-016-2736-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  107. Zhou Y., Gao F., Liu R., Feng J., Li H. De novo sequencing and analysis of root transcriptome using 454 pyrosequencing to discover putative genes associated with drought tolerance in Ammopiptanthus mongolicus. BMC Genom. 2012;13:266. doi: 10.1186/1471-2164-13-266. [DOI] [PMC free article] [PubMed] [Google Scholar]
  108. Zhuang W., Chen H., Yang M., Wang J., Pandey M.K., Zhang C., Chang W.C., Zhang L., Zhang X., Tang R., et al. The genome of cultivated peanut provides insight into legume karyotypes, polyploid evolution and crop domestication. Nat. Genet. 2019;51:865–876. doi: 10.1038/s41588-019-0402-2. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Document S1. Supplemental Figures 1–12
mmc1.pdf (1.8MB, pdf)
Data S1. Supplemental Tables 1–34
mmc2.zip (12.4MB, zip)
Document S2. Article plus supplemental information
mmc3.pdf (6.1MB, pdf)

Data Availability Statement

The data supporting the findings of this study are provided in the supplemental information files. The genome assembly and annotations have been deposited at the Chinese National Genomics Data Center (https://ngdc.cncb.ac.cn/) under BioProject accession number PRJCA024714. Raw sequencing data have been deposited in the NCBI Sequence Read Archive database (http://www.ncbi.nlm.nih.gov/sra) under accession number PRJNA1085185. The genome sequences and annotations have also been posted on our website https://ammopiptanthus.github.io/ for data download.


Articles from Plant Communications are provided here courtesy of Elsevier

RESOURCES