Abstract
Plants produce a remarkable diversity of structurally and functionally diverse natural chemicals that serve as adaptive compounds throughout their life cycles. However, unlocking this metabolic diversity is significantly impeded by the size, complexity, and abundant repetitive elements of typical plant genomes. As genome sequencing becomes routine, we anticipate that links between metabolic diversity and genetic variation will be strengthened. In addition, an ever-increasing number of plant genomes have revealed that biosynthetic gene clusters are not only a hallmark of microbes and fungi; gene clusters for various classes of compounds have also been found in plants, and many are associated with important agronomic traits. We present recent examples of plant metabolic diversification that have been discovered through the exploration and exploitation of various genomic and pan-genomic data. We also draw attention to the fundamental genomic and pan-genomic basis of plant chemodiversity and discuss challenges and future perspectives for investigating metabolic diversity in the coming pan-genomics era.
Key words: metabolic diversity, gene and genome duplication, biosynthetic gene clusters, genomic structural variations, pan-genomics
Plants are excellent chemists, producing a vast array of structurally diverse, specialized metabolites with various functions. However, our ability to access these diverse metabolites is impeded by their narrow taxonomic distribution, complex biosynthetic routes, and little known genomic basis. This review summarizes classical and emerging genetic drivers of metabolic diversification and draws attention to the use of genomics and pan-genomics to dissect complex metabolic traits in plants.
Introduction
A hallmark of plants is their ability to synthesize an extraordinarily large number (∼0.2–1 million) of diverse small molecules (Dixon and Strack, 2003; Fang et al., 2019). Although a small portion of these serve as primary metabolites that are essential for plant growth and reproduction, the majority are recognized as secondary or specialized metabolites (SMs) that play important ecological roles in plant defense against various biotic and abiotic stresses (Hartmann, 2007; Chae et al., 2014; Weng, 2014). In addition, although the functions of SMs were initially thought to be “secondary,” they can sometimes play important “primary” roles, as in the regulation of plant growth and development (Erb and Kliebenstein, 2020). From a biochemical viewpoint, plant SMs are mainly classified as phenylpropanoids, terpenoids, and alkaloids. Different classes of compounds can also be included, adding enormous chemical diversity to plant repertoires. Other types of plant SMs, such as unusual peptides and highly modified fatty acids, have drawn increasing attention in recent years (Kersten and Weng, 2018; Jeon et al., 2020). SMs are often found in limited groups of plant lineages or clades, and they can therefore reveal how plants have adapted to various ecological niches. For example, the sulfur-containing glucosinolate defense compounds are found exclusively in the Brassicales (which evolved roughly 90 million years ago), and their diversification is thought to reflect co-evolution with butterflies in an evolutionary arms-race (Edger et al., 2015). Thalianin (T10), a triterpene found only in the model plant Arabidopsis thaliana, has been shown to play important roles in modulating root growth and shaping a healthy rhizosphere bacterial community (Huang et al., 2019; Bai et al., 2021). In addition to the large number of metabolites synthesized by plants and their patchy distributions across phylogenetic taxa, plant metabolic diversity is also reflected in metabolic variation within plant organs and developmental stages. This variation may be even greater when examined at the level of specific cell types, single cells, or organelles (Misra et al., 2014; de Souza et al., 2020). Although the obstacles mentioned above challenge our ability to dissect and understand plant metabolic diversity, combinatorial approaches, including comparative genomics and transcriptomics, appear to offer powerful strategies for revealing the DNA fingerprints that lead to metabolite divergence in related species (Fernie and Tohge, 2017; van der Hooft et al., 2020). For example, comparing the genomes of the sister species A. thaliana and Arabidopsis lyrata has revealed that pathway genes for the biosynthesis of thalianin (T10) are also present in the A. lyrata genome. However, A. lyrata lacks two additional reductase genes, resulting in the production of the new triterpenoid epi-thalianin (T17, an epimer of thalianin) in its roots (Liu et al., 2020a).
Genome sequencing is now becoming routine, revolutionizing our interpretation of the genomic basis of metabolic diversification in a wide range of plant taxa. Conventionally, gene and genome duplications are the main types of genomic variants observed in surveys of related plant genomes (Moore and Purugganan, 2005; Lichman et al., 2020). Genetic drift and natural selection then drive the retention and diversification of these genetic foundations (Ohno, 1970; Panchy et al., 2016), translating into a spectacular diversity of SMs among plant species. In addition, re-sequencing and de novo assembly of representative accessions within a species (referred to as pan-genomics) have enabled the identification of other, underexplored genomic structural variations, such as presence–absence variations and gene copy-number variations (Ho et al., 2020), enabling us to uncover the molecular mechanisms that underlie plant metabolic diversity with greater resolution. Pan-genomic variants found at the intra-species level are more evolutionarily recent than genomic variants found in inter-species comparisons. Therefore, the investigation of both genomic and pan-genomic variations associated with a metabolic trait can shed light on the comprehensive evolutionary history of variations in plant chemical profiles.
Here, we review recent advances linking phytochemical innovations to genetic variations at both shallow (genomic) and deep (pan-genomic) phylogenetic scales, with a particular emphasis on genome structural variations, some of which have only recently become available following the construction of plant pan-genomes. Another key development in the field of specialized plant metabolism is the discovery of biosynthetic gene clusters (BGCs) that serve as the genomic basis for various types of natural products (Polturak and Osbourn, 2021). Metabolic diversity generated by diversification of gene clusters will therefore also be covered. Finally, we summarize and discuss challenges and perspectives on genomic and pan-genomic variation and their connections to the pan-metabolomes they encode.
Classical drivers of metabolic diversity: Gene and genome duplication
Since the first plant genome was sequenced from the model plant Arabidopsis thaliana (The Arabidopsis Genome Initiative, 2000), the genomes of several hundred plant species have been sequenced and published according to the National Center for Biotechnology Information, although they may differ in sequencing depth and assembly quality. These high-quality genomes were then analyzed by comparative genomics, often coupled with transcriptomics and metabolomics, methods that are key to revealing the phylogenomic basis for chemical innovation and metabolite distribution among species. It is widely accepted that gene and genome duplication provide the basic genomic foundations that support the expansion of plant metabolism (Ober, 2005, 2010; Lichman et al., 2020). For example, the expansion of the cytochrome P450 family, one of the largest enzyme families in plants, is well correlated with the diversification of metabolites in the plant kingdom (Nelson and Werck-Reichhart, 2011; Hansen et al., 2021). Duplications of genes encoding other types of enzymes, such as acyltransferases, methyltransferases, and glycosyltransferases, have all been shown to be excellent markers of plant metabolic diversification (Barakat et al., 2011; Leong and Last, 2017; Louveau and Osbourn, 2019). Recently, high-quality genomes have been produced in some non-model plants, and the expansion of specific functional enzymes has been identified. For example, analysis of the Coptis chinensis genome revealed amplification of the Ranunculales clade-specific CYP719 family, which is associated with the diversification of protoberberine-type alkaloids (Liu et al., 2021). Similarly, amplification of chalcone synthase (CHS) genes appears to be associated with urushiol biosynthesis in Mangifera indica (mango) (Wang et al., 2020), and the expansion of the CYP71D family is proposed to have driven the diversification of tanshinone biosynthesis in Salvia miltiorrhiza (Ma et al., 2021).
Although genome sequencing has undergone impressive advances in recent years, the sequencing and assembly of non-model plant genomes can be expensive and time consuming. Alternatively, RNA-seq analysis can be used to identify biosynthetic genes and reveal the evolutionary basis of targeted metabolic pathways. For instance, based on biochemical data on kavalactones (with potent anxiolytic and analgesic activity) isolated from kava (Piper methysticum), Pluskal et al. (2019) hypothesized that a CHS-like enzyme (a type of polyketide synthase) probably participates in the biosynthesis of the styrylpyrone backbone of kavalactone. They subsequently identified three CHS-like genes in a kava transcriptome assembled de novo from leaf and root tissues. One of these genes was characterized as a chalcone synthase for flavokavains, but the other two appear to be recently duplicated and have undergone neofunctionalization to catalyze the formation of the styrylpyrone backbone of kavalactones specific to kava. Metabolic pathways can also be elucidated via comparative transcriptomics in related plant species or tissues/organs of the same species by correlating differential gene expression patterns with chemical profiles in related samples (Tzfadia et al., 2015; Delli-Ponti et al., 2020; Nett et al., 2020; Song et al., 2021). In addition to singleton gene duplication (e.g. tandem duplication and dispersed duplication), whole-genome duplication (WGD), or polyploidization, also plays important roles in plant function and evolution (Clark and Donoghue, 2018). As genome sequencing becomes more affordable, more and more genomes representing a wider variety of plant families have been sequenced, and the resulting data consistently support the finding that WGD has occurred frequently throughout plant evolutionary history (Qiao et al., 2019; Soltis and Soltis, 2021). Commonly used methods for inferring WGD in plants include (often in combination) synteny analysis (Wang et al., 2012; Qiao et al., 2019), analysis of synonymous substitutions per synonymous site (Ks) among paralogs (Cui et al., 2006), phylogenomic (gene tree) reconciliation (Jiao et al., 2011), and a likelihood-based gene-count method (Rabier et al., 2014). Although WGD is pervasive in plant genomes, assessing its effect on plant metabolism remains a challenge. WGD is often associated with an ancient polyploidization event, followed by loss of most duplicated genes over a few million years (Lynch and Conery, 2000). The complexity of metabolic pathways, including those that produce very complex chemical structures or that interact with other pathways, adds additional barriers to investigating WGD-generated metabolic diversification. Only a few examples of WGD-driven metabolic diversity have been reported in plants (Figure 1). Representative examples include WGD-generated CYP79s for glucosinolate biosynthesis in Brassicaceae (Hofberger et al., 2013; Edger et al., 2015), morphinan branches in opium poppy (Li et al., 2020), N-methylputrescine oxidase (MPO1) for nicotine biosynthesis in wild tobacco (Xu et al., 2017), and fatty acid desaturase 2 (FAD2) for oil biosynthesis in wild olive (Unver et al., 2017). In some cases, a whole metabolic pathway can be duplicated via WGD, leading to metabolite diversification in plants that diverged before and after the WGD event. Su et al. (2021) identified a recent tribe-specific WGD dated at 13.5–27.1 million years ago in the apple tribe. They then traced the evolution of the entire metabolic pathway (including one triterpene cyclase, one CYP716A, and one CYP716C) for biosynthesis of major triterpenes in Gillenia trifoliata and loquat, which pre- and postdate the apple tribe WGD, respectively. Their results demonstrated that WGD-generated gene duplicates for the entire pathway were retained and co-opted for the production of exceptionally high levels of ursane-type triterpenes in loquat, providing a clear example of WGD-driven metabolic diversification in plants.
Figure 1.
Whole genome duplication (WGD) underlies metabolic diversification in plants.
WGD-associated metabolic diversification has been found for various classes of compounds in plants. The CYP79 gene family expansion that originated from the second major WGD of Brassicales laid the foundation for glucosinolate diversity in Brassicaceae (e.g. Arabidopsis thaliana) (Hofberger et al., 2013; Edger et al., 2015). The differential accumulation of oleic and linoleic acids produced in olive (Olea europaea) compared with the closely related oil crop sesame is due in part to the divergence of the fatty acid desaturase FAD2 gene, generated by WGD (Unver et al., 2017). N-methylputrescine oxidase (MPO1), required for the biosynthesis of nicotine in Nicotiana attenuata, is proposed to have been generated via whole-genome triplication in Solanaceae (Xu et al., 2017). In Papaver somniferum, three genes (T6ODM, CODM, and COR) are likely to have been duplicated via WGD, and their encoded enzymes catalyze the final steps of the morphine biosynthetic pathway (Li et al., 2020). Retention and co-option of WGD-generated sister pathways underlie the high levels of bioactive triterpenes in Eriobotrya japonica (loquat) (Su et al., 2021).
Synergistic drivers of chemical innovation: BGCs
Genes were once thought to be randomly organized in plant genomes. The clustering in close proximity of multiple non-homologous genes that collectively encode metabolic pathway(s) offers a clear rebuttal of this notion (Figure 2) (Field and Osbourn, 2008; Nützmann et al., 2016). The so-called BGCs are no longer a hallmark of only microbes and fungi: they have also been found for various classes of compounds in plants, including terpenes, alkaloids, benzoxazinoids, cyanogenic glucosides, polyketides, fatty acids, and phenylpropanoids (Polturak and Osbourn, 2021). Although the clustering of metabolic genes in plants is an exception to the more common scenario of non-clustered pathways (Wisecaver et al., 2017), the study of plant BGCs has hugely facilitated the functional identification of previously unknown enzymes and pathways, thus advancing our understanding of plant genome evolution (Nützmann et al., 2016; Polturak et al., 2022). Plant BGCs are commonly composed of signature and tailoring genes (which generate and modify the backbone, respectively), although atypical types have also been reported (Polturak and Osbourn, 2021). The distinct genomic features of BGCs have led to the development of computer algorithms for mining of metabolic genes and pathways, some of which use only plant genomic data (Kautsar et al., 2017; Töpfer et al., 2017). Combining genome sequences with other types of data, such as transcriptomic and metabolomic data, further amplifies the power of genome mining for the investigation of metabolic diversity (Medema et al., 2021).
Figure 2.
Biosynthetic gene clusters and auxiliary genes.
(A) Examples showing auxiliary genes located in the cluster region, including OsPDX3, encoding a cofactor for hydroxycinnamoyl tyramine production in rice, and two transporters for the thebaine cluster in opium poppy and the dhurrin cluster in Sorghum bicolor, respectively.
(B) Auxiliary genes of different gene types (e.g. transcription factor, protein kinase, and non-coding RNA) may be explored by developing computational methods such as clustering mining (plantiSMASH, PhytoClust), association studies (with transcriptome and chemical features), and machine learning.
A recent emerging theme for plant BGCs is that auxiliary genes not directly involved in biosynthesis per se are, in some cases, clustered with the core cluster (Figure 2A). Examples include cognate transporters in the noscapine cluster and dhurrin cluster (Darbani et al., 2016; Dastmalchi et al., 2019) and a cofactor that is used to support the biosynthesis of hydroxycinnamoyl tyramine in rice (Shen et al., 2021). Clustering of genes with auxiliary functions thus extends the boundaries of gene clustering in plant genomes and has opened up new windows for mining BGC-related metabolic diversity (Figure 2B). Liu et al. (2020b) carried out a systematic genomic neighborhood (GN) association analysis across 13 Brassicaceae genomes, centered on oxidosqualene cyclase (OSC) genes that generate the triterpene skeleton. Pfam domains in the OSC genomic region (5- or 10-gene distance on each side of an OSC gene) were retrieved, and their presence was compared with that at the whole-genome level by fitting to a hypergeometric distribution. The OSC-centric GN analysis involved various statistical analyses to capture significant co-occurrence of OSCs with any type of coding gene, either enzymatic or non-enzymatic. This analysis identified in the OSC-GNs not only clustered metabolic genes (some of which were later functionally characterized as BGCs) but also, for example, F-box and wall-associated receptor kinase genes, although whether these genes are truly associated with auxiliary functions of triterpene biosynthesis awaits further investigation (Liu et al., 2020b). GN association analysis thus also highlights the importance of using comparative genomics based on conservation of local gene content to discover novel metabolism-related genes or pathways. Indeed, microsynteny-based phylogeny has recently been successfully tested across 50 different plant families and 30 orders in angiosperms (Zhao et al., 2021), and it may offer a great opportunity for the application of GN analysis to wider plant families for the discovery of new metabolic pathways.
Pan-genomics: A new frontier for mining metabolic diversity in plants
The pan-genome, which refers to the entire conserved (core genome) and diversified (dispensable genome) genetic information of a species, has recently drawn significant attention (Bayer et al., 2020; Lei et al., 2021). Previously, population-level genomic variation was limited mainly to single-nucleotide polymorphisms (SNPs) and small insertion and deletion (<50 bp) variations, which could be picked up by short-read sequencing (Ikegawa, 2012; 1001 Genomes Consortium, 2016). Recent advances in long-read genome sequencing and de novo assembly have enabled long-overlooked genomic structural variations, such as presence–absence variations (PAVs) (>50 bp), gene copy-number variations, chromosomal inversions (CIs), and genomic translocations, to be included in pan-genomes (Ho et al., 2020). These structural variations have significantly revolutionized our understanding of the molecular mechanisms that underlie plant survival and adaptation (Li et al., 2014; Tao et al., 2019; Barragan and Weigel, 2021). To date, about 30 plant pan-genomes have been created, although there are large differences in the number of sequenced accessions in individual pan-genomes (Bayer et al., 2020; Jiao and Schneeberger, 2020; Song et al., 2020; Qin et al., 2021). Li et al. (2020) sequenced 10 opium poppy cultivars and identified gene copy-number variations that underlie marked differences in benzylisoquinoline alkaloid production among the sequenced cultivars. Interestingly, benzylisoquinoline alkaloid pathway genes within BGCs are more likely to exhibit gene copy-number variations than those outside of BGCs. Gao et al. (2019) generated a pan-genome for tomato by sequencing 725 accessions and identified a PAV in the promoter region of TomLoxC (solyc01g006540), a 13-lipoxygenase gene linked to tomato flavor. Based on eight high-quality genomes for rapeseed (Brassica napus), Song et al. (2020) identified a large number of PAVs that were suitable for a PAV-based genome-wide association study (PAV-GWAS). This enabled the direct identification of causal structural variations for several important crop traits, including silique length, seed weight, and flowering time. PAV-GWAS has also been applied to pigeon pea (Cajanus cajan) (Zhao et al., 2020). It is thus tempting to speculate that PAV-GWAS may be widely used in the near future to identify causative genes not only for qualitative traits (as mentioned above) but also for metabolic diversification in natural populations. This is also likely to complement the widely used SNP-based mGWAS (SNP-mGWAS) approach (Luo, 2015), leading to greater resolution in the dissection of plant metabolic plasticity and complexity.
CIs play important roles in evolution (Kirkpatrick, 2010). Although it has been observed that CIs are ubiquitous across plant genomes (Huang and Rieseberg, 2020), CI-associated metabolic diversity in plants is rarely reported. Liu et al. (2020a) retrieved 22 genomic regions for the thalianol cluster from an ongoing pan-genome project in Arabidopsis thaliana. Comparative analysis revealed that CI acted as a shuffling mechanism for relocating distant metabolic genes into the core cluster region, thus forming a more compact gene cluster. The compacted thalianol cluster was present in ∼80% of the examined genomes, indicating that there are advantages to tighter metabolic gene clustering (Liu et al., 2020a). Further investigations are required, however, to clarify whether these CIs lead to chemical diversification in natural accessions of A. thaliana.
Concluding remarks and perspectives
Great advances in genome sequencing have propelled biology and biological research into a new digital era, leading to the identification of numerous genetic and genomic structural variations among and within species (Figure 3). The rapidly growing number of sequenced plant genomes has strengthened our appreciation of the importance of gene and genome duplications for the diversification of plant metabolism. In a phylogenomic context, metabolic gene duplicates derived from random, dispersed, or whole-genome duplications can be distinguished (Moore et al., 2019; Qiao et al., 2019), and the selective forces driving these variants in shaping metabolic flexibility and variability can be inferred with greater resolution. Powerful statistical methods and phylogeny-guided approaches that can be used to detect signatures of selection at the gene and population level have been the subject of recent extensive reviews (Scossa and Fernie, 2020; Schenck and Busta, 2021). However, the application of comparative genomics beyond closely related plant species remains a challenge, particularly when the same or similar compounds are synthesized convergently or independently by distant or early diverged species (Pichersky and Lewinsohn, 2011; Lou et al., 2021). New tools are therefore required that can delineate conserved (i.e., sharing a common ancestor) and independent evolution of metabolic traits at the genomic level.
Figure 3.
(pan)-genomic basis underlying metabolic diversification in plants.
We summarize the major types of (pan)-genomic variations associated with metabolic diversity in plants by showing representative examples. (A) Gene and genome duplications. SGD, single gene duplication. In A. thaliana, CYP98A8 and CYP98A9 are tandemly duplicated from a common ancestor CYP98A8/9′ whose product has dual 3′-hydroxylase and 5′-hydroxylase activity on phenolamide N1,N5,N10-tri-coumaroyl-spermidine. During the course of evolution, the child copies CYP98A8 and CYP98A9 became specialized to perform one of the ancestral functions of CYP98A8/9′ (Liu et al., 2016). WGD, whole genome duplication. In Eriobotrya japonica (loquat), the biosynthetic pathways of the major triterpene corosolic acid were generated by WGD (Su et al., 2021).
(B) Organization of metabolic pathway genes. Genes for plant metabolic pathways can be organized tightly (as clusters) or loosely in the genome.
(C) Structural variations based on pan-genomes. In Papaver somniferum, copy-number variations (CNVs) of T6ODM between different cultivars were associated with the level of morphine production (Li et al., 2020). In A. thaliana, compaction of the thalianol cluster within natural accessions is associated with chromosome inversion (CI) (Liu et al., 2020a). In Oryza sativa (rice), presence–absence variations (PAVs) of the metabolic genes TPS28, CYP71Z21, and CYP71Z2 are responsible for variations in casbene among rice accessions (Zhan et al., 2020).
High-quality genome sequences provide an entrance for functional genomics and subsequent characterization of metabolic diversity between plant species. However, for many recently sequenced non-model plants, genome annotations are incomplete and sometimes even incorrect, thereby hindering functional analysis. Integrating genome sequencing with other omics techniques (e.g. gene co-expression analysis, proteomics, and metabolomics) and additional synergistic approaches (e.g. phylogenetic analysis, gene enrichment analysis, evolutionary principles, and functional validation) is likely to enable effective and efficient annotation and characterization of metabolic diversity in the near future.
It is also becoming evident that metabolic gene clustering is widely distributed across different classes of natural products and, putatively, in all sequenced plant genomes. On average, approximately 30 BGC candidates have been identified in each sequenced plant genome (a rough estimate based on plantiSMASH) (Kautsar et al., 2017), but the vast majority have not been functionally characterized and are therefore only putative BGCs. Functional characterization of more in silico–predicted BGCs will in turn be used to adjust methods for the genome mining of BGCs. The accuracy of BGC prediction also depends on the contiguity of the host genome assembly, and assembly currently faces challenges in resolving the rich repetitive sequences that are commonly identified in plant BGC regions (Field et al., 2011; Shen et al., 2021). Improving long-read sequencing with high accuracy is likely to conquer this problem. In addition to the localization of gene clusters, co-expression of the clustered genes is another important feature of BGCs and has been leveraged for the discovery of biosynthetic pathways (Nützmann et al., 2016; Polturak and Osbourn, 2021). However, the molecular mechanisms that underlie the co-regulation of BGC genes are not fully understood, impeding investigations of their encoded metabolic diversity across various developing tissues of host plants (Yu et al., 2016; Nützmann et al., 2020).
Although plant genome sequencing is becoming routine, the creation and visualization of plant pan-genomes are still challenging (Lei et al., 2021). Calling variants in pan-genomes is heavily reliant on the degree of their contiguity, completeness, and accuracy. Association studies based on structural variants are further constrained by other factors, including (1) the size and highly repetitive nature of plant pan-genomes; (2) the pervasive occurrence of polyploidization and gene duplication events; and (3) the need to sequence an adequate number of accessions that are sufficiently variable for association analysis at a reasonable time and cost. As metabolic traits are often quantitative, the challenges mentioned above are likely to be magnified in mGWAS analysis, e.g. PAV-mGWAS enabled by pan-genomes. Compared with the pan-genomes of microbes, in which the concept of pan-genome was first conceived (Tettelin et al., 2005), the pan-genomes of plants have developed much more slowly because of the challenges mentioned above. However, progress in plant pan-genomics can be achieved by taking inspiration from lower organisms; for instance, the most frequently used software for plant BGCs, plantiSMASH, is derived from antiSMASH, which was initially developed to mine gene clusters from bacterial and fungal genomes (Medema et al., 2011).
In summary, the surge in the genome sequencing field has greatly expanded our genomic and pan-genomic understanding of metabolic diversification in plants. Massive progress in the identification of genomic structural variations will provide rich materials for the interpretation of complex metabolic variations that differ widely among plants and will facilitate the elucidation of their largely uncharacterized ecological roles in the relationships between plants and other associated organisms.
Funding
The Z.L. laboratory is supported by a startup grant provided by Shanghai Jiao Tong University, School of Agriculture and Biology and the Shanghai Pujiang Program (20PJ1405900).
Acknowledgments
We apologize to those authors whose work could not be discussed owing to space limitations. We thank Dr. Guy Polturak (John Innes Centre) for useful comments. No conflict of interest is declared.
Published: January 20, 2022
Footnotes
Published by the Plant Communications Shanghai Editorial Office in association with Cell Press, an imprint of Elsevier Inc., on behalf of CSPB and CEMPS, CAS.
References
- 1001 Genomes Consortium 1,135 genomes reveal the global pattern of polymorphism in Arabidopsis thaliana. Cell. 2016;166:481–491. doi: 10.1016/j.cell.2016.05.063. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bai Y., et al. Modulation of Arabidopsis root growth by specialized triterpenes. New Phytol. 2021;230:228–243. doi: 10.1111/nph.17144. [DOI] [PubMed] [Google Scholar]
- Barakat A., Choi A., Yassin N.B.M., Park J.S., Sun Z., Carlson J.E. Comparative genomics and evolutionary analyses of the O-methyltransferase gene family in Populus. Gene. 2011;479:37–46. doi: 10.1016/j.gene.2011.02.008. [DOI] [PubMed] [Google Scholar]
- Barragan A.C., Weigel D. Plant NLR diversity: the known unknowns of pan-NLRomes. Plant Cell. 2021;33:814–831. doi: 10.1093/plcell/koaa002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bayer P.E., Golicz A.A., Scheben A., Batley J., Edwards D. Plant pan-genomes are the new reference. Nat. Plants. 2020;6:914–920. doi: 10.1038/s41477-020-0733-0. [DOI] [PubMed] [Google Scholar]
- Chae L., Kim T., Nilo-Poyanco R., Rhee S.Y. Genomic signatures of specialized metabolism in plants. Science. 2014;344:510–513. doi: 10.1126/science.1252076. [DOI] [PubMed] [Google Scholar]
- Clark J.W., Donoghue P.C.J. Whole-genome duplication and plant macroevolution. Trends Plant Sci. 2018;23:933–945. doi: 10.1016/j.tplants.2018.07.006. [DOI] [PubMed] [Google Scholar]
- Cui L., et al. Widespread genome duplications throughout the history of flowering plants. Genome Res. 2006;16:738–749. doi: 10.1101/gr.4825606. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Darbani B., Motawia M.S., Olsen C.E., Nour-Eldin H.H., Møller B.L., Rook F. The biosynthetic gene cluster for the cyanogenic glucoside dhurrin in Sorghum bicolor contains its co-expressed vacuolar MATE transporter. Sci. Rep. 2016;6:37079. doi: 10.1038/srep37079. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dastmalchi M., Chang L., Chen R., Yu L., Chen X., Hagel J.M., Facchini P.J. Purine permease-type benzylisoquinoline alkaloid transporters in opium poppy. Plant Physiol. 2019;181:916–933. doi: 10.1104/pp.19.00565. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Delli-Ponti R., Shivhare D., Mutwil M. Using gene expression to study specialized metabolism-a practical guide. Front. Plant Sci. 2020;11:625035. doi: 10.3389/fpls.2020.625035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dixon R.A., Strack D. Phytochemistry meets genome analysis, and beyond. Phytochemistry. 2003;62:815–816. doi: 10.1016/s0031-9422(02)00712-4. [DOI] [PubMed] [Google Scholar]
- Edger P.P., et al. The butterfly plant arms-race escalated by gene and genome duplications. Proc. Natl. Acad. Sci. U S A. 2015;112:8362–8366. doi: 10.1073/pnas.1503926112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Erb M., Kliebenstein D.J. Plant secondary metabolites as defenses, regulators, and primary metabolites: the blurred functional trichotomy. Plant Physiol. 2020;184:39–52. doi: 10.1104/pp.20.00433. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fang C., Fernie A.R., Luo J. Exploring the diversity of plant metabolism. Trends Plant Sci. 2019;24:83–98. doi: 10.1016/j.tplants.2018.09.006. [DOI] [PubMed] [Google Scholar]
- Fernie A.R., Tohge T. The genetics of plant metabolism. Annu. Rev. Genet. 2017;51:287–310. doi: 10.1146/annurev-genet-120116-024640. [DOI] [PubMed] [Google Scholar]
- Field B., Osbourn A.E. Metabolic diversification—independent assembly of operon-like gene clusters in different plants. Science. 2008;320:543–547. doi: 10.1126/science.1154990. [DOI] [PubMed] [Google Scholar]
- Field B., Fiston-Lavier A.-S., Kemen A., Geisler K., Quesneville H., Osbourn A.E. Formation of plant metabolic gene clusters within dynamic chromosomal regions. Proc. Natl. Acad. Sci. U S A. 2011;108:16116–16121. doi: 10.1073/pnas.1109273108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gao L., et al. The tomato pan-genome uncovers new genes and a rare allele regulating fruit flavor. Nat. Genet. 2019;51:1044–1051. doi: 10.1038/s41588-019-0410-2. [DOI] [PubMed] [Google Scholar]
- Hansen C.C., Nelson D.R., Møller B.L., Werck-Reichhart D. Plant cytochrome P450 plasticity and evolution. Mol. Plant. 2021;14:1244–1265. doi: 10.1016/j.molp.2021.06.028. [DOI] [PubMed] [Google Scholar]
- Hartmann T. From waste products to ecochemicals: fifty years research of plant secondary metabolism. Phytochemistry. 2007;68:2831–2846. doi: 10.1016/j.phytochem.2007.09.017. [DOI] [PubMed] [Google Scholar]
- Ho S.S., Urban A.E., Mills R.E. Structural variation in the sequencing era. Nat. Rev. Genet. 2020;21:171–189. doi: 10.1038/s41576-019-0180-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hofberger J.A., Lyons E., Edger P.P., Chris Pires J., Eric Schranz M. Whole genome and tandem duplicate retention facilitated glucosinolate pathway diversification in the mustard family. Genome Biol. Evol. 2013;5:2155–2173. doi: 10.1093/gbe/evt162. [DOI] [PMC free article] [PubMed] [Google Scholar]
- van der Hooft J.J.J., Mohimani H., Bauermeister A., Dorrestein P.C., Duncan K.R., Medema M.H. Linking genomics and metabolomics to chart specialized metabolic diversity. Chem. Soc. Rev. 2020;49:3297–3314. doi: 10.1039/d0cs00162g. [DOI] [PubMed] [Google Scholar]
- Huang K., Rieseberg L.H. Frequency, origins, and evolutionary role of chromosomal inversions in plants. Front. Plant Sci. 2020;11:296. doi: 10.3389/fpls.2020.00296. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huang A.C., Jiang T., Liu Y.-X., Bai Y.-C., Reed J., Qu B., Goossens A., Nützmann H.-W., Bai Y., Osbourn A. A specialized metabolic network selectively modulates Arabidopsis root microbiota. Science. 2019;364:eaau6389. doi: 10.1126/science.aau6389. [DOI] [PubMed] [Google Scholar]
- Ikegawa S. A short history of the genome-wide association study: where we were and where we are going. Genomics Inform. 2012;10:220–225. doi: 10.5808/GI.2012.10.4.220. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jeon J.E., Kim J.G., Fischer C.R., Mehta N., Dufour-Schroif C., Wemmer K., Mudgett M.B., Sattely E. A pathogen-responsive gene cluster for highly modified fatty acids in tomato. Cell. 2020;180:176–187.e19. doi: 10.1016/j.cell.2019.11.037. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jiao W.-B., Schneeberger K. Chromosome-level assemblies of multiple Arabidopsis genomes reveal hotspots of rearrangements with altered evolutionary dynamics. Nat. Commun. 2020;11:989. doi: 10.1038/s41467-020-14779-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jiao Y., et al. Ancestral polyploidy in seed plants and angiosperms. Nature. 2011;473:97–100. doi: 10.1038/nature09916. [DOI] [PubMed] [Google Scholar]
- Kautsar S.A., Suarez Duran H.G., Blin K., Osbourn A., Medema M.H. plantiSMASH: automated identification, annotation and expression analysis of plant biosynthetic gene clusters. Nucleic Acids Res. 2017;45:W55–W63. doi: 10.1093/nar/gkx305. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kersten R.D., Weng J.-K. Gene-guided discovery and engineering of branched cyclic peptides in plants. Proc. Natl. Acad. Sci. U S A. 2018;115:E10961–E10969. doi: 10.1073/pnas.1813993115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kirkpatrick M. How and why chromosome inversions evolve. PLoS Biol. 2010;8:e1000501. doi: 10.1371/journal.pbio.1000501. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lei L., Goltsman E., Goodstein D., Wu G.A., Rokhsar D.S., Vogel J.P. Plant pan-genomics comes of age. Annu. Rev. Plant Biol. 2021;72:411–435. doi: 10.1146/annurev-arplant-080720-105454. [DOI] [PubMed] [Google Scholar]
- Leong B.J., Last R.L. Promiscuity, impersonation and accommodation: evolution of plant specialized metabolism. Curr. Opin. Struct. Biol. 2017;47:105–112. doi: 10.1016/j.sbi.2017.07.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li Y., et al. De novo assembly of soybean wild relatives for pan-genome analysis of diversity and agronomic traits. Nat. Biotechnol. 2014;32:1045–1052. doi: 10.1038/nbt.2979. [DOI] [PubMed] [Google Scholar]
- Li Q., Ramasamy S., Singh P., Hagel J.M., Dunemann S.M., Chen X., Chen R., Yu L., Tucker J.E., Facchini P.J., et al. Gene clustering and copy number variation in alkaloid metabolic pathways of opium poppy. Nat. Commun. 2020;11:1190. doi: 10.1038/s41467-020-15040-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lichman B.R., Godden G.T., Buell C.R. Gene and genome duplications in the evolution of chemodiversity: perspectives from studies of Lamiaceae. Curr. Opin. Plant Biol. 2020;55:74–83. doi: 10.1016/j.pbi.2020.03.005. [DOI] [PubMed] [Google Scholar]
- Liu Z., Tavares R., Forsythe E.S., André F., Lugan R., Jonasson G., Boutet-Mercey S., Tohge T., Beilstein M.A., Werck-Reichhart D., et al. Evolutionary interplay between sister cytochrome P450 genes shapes plasticity in plant metabolism. Nat. Commun. 2016;7:1–10. doi: 10.1038/ncomms13026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu Z., Cheema J., Vigouroux M., Hill L., Reed J., Paajanen P., Yant L., Osbourn A. Formation and diversification of a paradigm biosynthetic gene cluster in plants. Nat. Commun. 2020;11:5354. doi: 10.1038/s41467-020-19153-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu Z., Suarez Duran H.G., Harnvanichvech Y., Stephenson M.J., Schranz M.E., Nelson D., Medema M.H., Osbourn A. Drivers of metabolic diversification: how dynamic genomic neighbourhoods generate new biosynthetic pathways in the Brassicaceae. New Phytol. 2020;227:1109–1123. doi: 10.1111/nph.16338. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu Y., et al. Analysis of the Coptis chinensis genome reveals the diversification of protoberberine-type alkaloids. Nat. Commun. 2021;12:3276. doi: 10.1038/s41467-021-23611-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lou Y.-R., Anthony T.M., Fiesel P.D., Arking R.E., Christensen E.M., Jones A.D., Last R.L. It happened again: convergent evolution of acylglucose specialized metabolism in black nightshade and wild tomato. Sci. Adv. 2021;7:eabj8726. doi: 10.1126/sciadv.abj8726. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Louveau T., Osbourn A. The sweet side of plant-specialized metabolism. Cold Spring Harb. Perspect. Biol. 2019;11:a034744. doi: 10.1101/cshperspect.a034744. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Luo J. Metabolite-based genome-wide association studies in plants. Curr. Opin. Plant Biol. 2015;24:31–38. doi: 10.1016/j.pbi.2015.01.006. [DOI] [PubMed] [Google Scholar]
- Lynch M., Conery J.S. The evolutionary fate and consequences of duplicate genes. Science. 2000;290:1151–1155. doi: 10.1126/science.290.5494.1151. [DOI] [PubMed] [Google Scholar]
- Ma Y., et al. Expansion within the CYP71D subfamily drives the heterocyclization of tanshinones synthesis in Salvia miltiorrhiza. Nat. Commun. 2021;12:685. doi: 10.1038/s41467-021-20959-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Medema M.H., Blin K., Cimermancic P., de Jager V., Zakrzewski P., Fischbach M.A., Weber T., Takano E., Breitling R. antiSMASH: rapid identification, annotation and analysis of secondary metabolite biosynthesis gene clusters in bacterial and fungal genome sequences. Nucleic Acids Res. 2011;39:W339–W346. doi: 10.1093/nar/gkr466. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Medema M.H., de Rond T., Moore B.S. Mining genomes to illuminate the specialized chemistry of life. Nat. Rev. Genet. 2021;22:553–571. doi: 10.1038/s41576-021-00363-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Misra B.B., Assmann S.M., Chen S. Plant single-cell and single-cell-type metabolomics. Trends Plant Sci. 2014;19:637–646. doi: 10.1016/j.tplants.2014.05.005. [DOI] [PubMed] [Google Scholar]
- Moore R.C., Purugganan M.D. The evolutionary dynamics of plant duplicate genes. Curr. Opin. Plant Biol. 2005;8:122–128. doi: 10.1016/j.pbi.2004.12.001. [DOI] [PubMed] [Google Scholar]
- Moore B.M., Wang P., Fan P., Leong B., Schenck C.A., Lloyd J.P., Lehti-Shiu M.D., Last R.L., Pichersky E., Shiu S.-H. Robust predictions of specialized metabolism genes through machine learning. Proc. Natl. Acad. Sci. U S A. 2019;116:2344–2353. doi: 10.1073/pnas.1817074116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nelson D., Werck-Reichhart D. A P450-centric view of plant evolution. Plant J. 2011;66:194–211. doi: 10.1111/j.1365-313X.2011.04529.x. [DOI] [PubMed] [Google Scholar]
- Nett R.S., Lau W., Sattely E.S. Discovery and engineering of colchicine alkaloid biosynthesis. Nature. 2020;584:148–153. doi: 10.1038/s41586-020-2546-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nützmann H.W., Huang A., Osbourn A. Plant metabolic clusters—from genetics to genomics. New Phytol. 2016;211:771–789. doi: 10.1111/nph.13981. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nützmann H.-W., Doerr D., Ramírez-Colmenero A., Sotelo-Fonseca J.E., Wegel E., Di Stefano M., Wingett S.W., Fraser P., Hurst L., Fernandez-Valverde S.L., et al. Active and repressed biosynthetic gene clusters have spatially distinct chromosome states. Proc. Natl. Acad. Sci. U S A. 2020;117:13800–13809. doi: 10.1073/pnas.1920474117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ober D. Seeing double: gene duplication and diversification in plant secondary metabolism. Trends Plant Sci. 2005;10:444–449. doi: 10.1016/j.tplants.2005.07.007. [DOI] [PubMed] [Google Scholar]
- Ober D. Gene duplications and the time thereafter—examples from plant secondary metabolism. Plant Biol. 2010;12:570–577. doi: 10.1111/j.1438-8677.2009.00317.x. [DOI] [PubMed] [Google Scholar]
- Ohno S. Springer-Verlag; 1970. Evolution by Gene Duplication. [Google Scholar]
- Panchy N., Lehti-Shiu M., Shiu S.-H. Evolution of gene duplication in plants. Plant Physiol. 2016;171:2294–2316. doi: 10.1104/pp.16.00523. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pichersky E., Lewinsohn E. Convergent evolution in plant specialized metabolism. Annu. Rev. Plant Biol. 2011;62:549–566. doi: 10.1146/annurev-arplant-042110-103814. [DOI] [PubMed] [Google Scholar]
- Pluskal T., Torrens-Spence M.P., Fallon T.R., De Abreu A., Shi C.H., Weng J.-K. The biosynthetic origin of psychoactive kavalactones in kava. Nat. Plants. 2019;5:867–878. doi: 10.1038/s41477-019-0474-0. [DOI] [PubMed] [Google Scholar]
- Polturak G., Osbourn A. The emerging role of biosynthetic gene clusters in plant defense and plant interactions. Plos Pathog. 2021;17:e1009698. doi: 10.1371/journal.ppat.1009698. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Polturak G., Liu Z., Osbourn A. New and emerging concepts in the evolution and function of plant biosynthetic gene clusters. Curr. Opin. Green. Sustain. Chem. 2022;33:100568. [Google Scholar]
- Qiao X., Li Q., Yin H., Qi K., Li L., Wang R., Zhang S., Paterson A.H. Gene duplication and evolution in recurring polyploidization–diploidization cycles in plants. Genome Biol. 2019;20:38. doi: 10.1186/s13059-019-1650-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Qin P., et al. Pan-genome analysis of 33 genetically diverse rice accessions reveals hidden genomic variations. Cell. 2021;184:3542–3558.e16. doi: 10.1016/j.cell.2021.04.046. [DOI] [PubMed] [Google Scholar]
- Rabier C.-E., Ta T., Ané C. Detecting and locating whole genome duplications on a phylogeny: a probabilistic approach. Mol. Biol. Evol. 2014;31:750–762. doi: 10.1093/molbev/mst263. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schenck C.A., Busta L. Using interdisciplinary, phylogeny-guided approaches to understand the evolution of plant metabolism. Plant Mol. Biol. 2021 doi: 10.1007/s11103-021-01220-1. [DOI] [PubMed] [Google Scholar]
- Scossa F., Fernie A.R. The evolution of metabolism: how to test evolutionary hypotheses at the genomic level. Comput. Struct. Biotechnol. J. 2020;18:482–500. doi: 10.1016/j.csbj.2020.02.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shen S., et al. An Oryza-specific hydroxycinnamoyl tyramine gene cluster contributes to enhanced disease resistance. Sci. Bull. 2021;66:2369–2380. doi: 10.1016/j.scib.2021.03.015. [DOI] [PubMed] [Google Scholar]
- Soltis P.S., Soltis D.E. Plant genomes: markers of evolutionary history and drivers of evolutionary change. Plants, People, PLANET. 2021;3:74–82. [Google Scholar]
- Song J.-M., et al. Eight high-quality genomes reveal pan-genome architecture and ecotype differentiation of Brassica napus. Nat. Plants. 2020;6:34–45. doi: 10.1038/s41477-019-0577-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Song J.-J., et al. A 2-oxoglutarate dependent dioxygenase converts dihydrofuran to furan in Salvia diterpenoids. Plant Physiol. 2021:kiab567. doi: 10.1093/plphys/kiab567. [DOI] [PMC free article] [PubMed] [Google Scholar]
- de Souza L.P., Borghi M., Fernie A. Plant single-cell metabolomics-challenges and perspectives. Int. J. Mol. Sci. 2020;21:8987. doi: 10.3390/ijms21238987. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Su W., et al. Polyploidy underlies co-option and diversification of biosynthetic triterpene pathways in the apple tribe. Proc. Natl. Acad. Sci. U. S. A. 2021;118 doi: 10.1073/pnas.2101767118. e2101767118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tao Y., Zhao X., Mace E., Henry R., Jordan D. Exploring and exploiting pan-genomics for crop improvement. Mol. Plant. 2019;12:156–169. doi: 10.1016/j.molp.2018.12.016. [DOI] [PubMed] [Google Scholar]
- Tettelin H., et al. Genome analysis of multiple pathogenic isolates of Streptococcus agalactiae: implications for the microbial “pan-genome. Proc. Natl. Acad. Sci. U S A. 2005;102:13950–13955. doi: 10.1073/pnas.0506758102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- The Arabidopsis Genome Initiative Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature. 2000;408:796–815. doi: 10.1038/35048692. [DOI] [PubMed] [Google Scholar]
- Töpfer N., Fuchs L.-M., Aharoni A. The PhytoClust tool for metabolic gene clusters discovery in plant genomes. Nucleic Acids Res. 2017;45:7049–7063. doi: 10.1093/nar/gkx404. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tzfadia O., Diels T., De Meyer S., Vandepoele K., Aharoni A., Van de Peer Y. CoExpNetViz: comparative co-expression networks construction and visualization tool. Front. Plant Sci. 2015;6:1194. doi: 10.3389/fpls.2015.01194. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Unver T., et al. Genome of wild olive and the evolution of oil biosynthesis. Proc. Natl. Acad. Sci. U S A. 2017;114:E9413–E9422. doi: 10.1073/pnas.1708621114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang Y., Tang H., Debarry J.D., Tan X., Li J., Wang X., Lee T., Jin H., Marler B., Guo H., et al. MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res. 2012;40:e49. doi: 10.1093/nar/gkr1293. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang P., et al. The genome evolution and domestication of tropical fruit mango. Genome Biol. 2020;21:60. doi: 10.1186/s13059-020-01959-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Weng J.K. The evolutionary paths towards complexity: a metabolic perspective. New Phytol. 2014;201:1141–1149. doi: 10.1111/nph.12416. [DOI] [PubMed] [Google Scholar]
- Wisecaver J.H., Borowsky A.T., Tzin V., Jander G., Kliebenstein D.J., Rokas A. A global coexpression network approach for connecting genes to specialized metabolic pathways in plants. Plant Cell. 2017;29:944–959. doi: 10.1105/tpc.17.00009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Xu S., et al. Wild tobacco genomes reveal the evolution of nicotine biosynthesis. Proc. Natl. Acad. Sci. U S A. 2017;114:6133–6138. doi: 10.1073/pnas.1700073114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yu N., Nützmann H.-W., MacDonald J.T., Moore B., Field B., Berriri S., Trick M., Rosser S.J., Kumar S.V., Freemont P.S., et al. Delineation of metabolic gene clusters in plant genomes by chromatin signatures. Nucleic Acids Res. 2016;44:2255–2265. doi: 10.1093/nar/gkw100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhan C., et al. Selection of a subspecies-specific diterpene gene cluster implicated in rice disease resistance. Nat. Plants. 2020;6:1447–1454. doi: 10.1038/s41477-020-00816-7. [DOI] [PubMed] [Google Scholar]
- Zhao J., Bayer P.E., Ruperao P., Saxena R.K., Khan A.W., Golicz A.A., Nguyen H.T., Batley J., Edwards D., Varshney R.K. Trait associations in the pangenome of pigeon pea (Cajanus cajan) Plant Biotechnol. J. 2020;18:1946–1954. doi: 10.1111/pbi.13354. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhao T., Zwaenepoel A., Xue J.-Y., Kao S.-M., Li Z., Schranz M.E., Van de Peer Y. Whole-genome microsynteny-based phylogeny of angiosperms. Nat. Commun. 2021;12:3498. doi: 10.1038/s41467-021-23665-0. [DOI] [PMC free article] [PubMed] [Google Scholar]